miku-discord

Author	SHA1	Message	Date
koko210Serve	305605fde5	docs: add comprehensive COMMANDS.md reference Document all bot commands, features and API endpoints: - 7 voice commands, 4 UNO commands, 2 inline commands - Conversational features (name detection, DMs, media analysis, image gen) - Mood system (14 regular + 4 evil moods) - Personality modes (evil, bipolar, persona dialogue) - Voice chat architecture (dual GPU, STT, TTS, resource locking) - Autonomous behavior system (6 action types) - Memory system (Cheshire Cat declarative + episodic) - Profile picture system - ~126 API endpoints organized into 20 categories - Discord event handlers and environment variables Resolves #18	2026-02-18 12:37:25 +02:00
koko210Serve	d44f08af18	fix(config): persist runtime settings across bot restarts Add restore_runtime_settings() to ConfigManager that reads config_runtime.yaml on startup and restores persisted values into globals: - LANGUAGE_MODE, AUTONOMOUS_DEBUG, VOICE_DEBUG_MODE - USE_CHESHIRE_CAT, PREFER_AMD_GPU, DM_MOOD Add missing persistence calls to API endpoints: - POST /language/set now persists to config_runtime.yaml - POST /voice/debug-mode now persists to config_runtime.yaml - POST /memory/toggle now persists to config_runtime.yaml Call restore_runtime_settings() in on_ready() after evil/bipolar restore. Resolves #22	2026-02-18 12:18:12 +02:00
koko210Serve	8d5137046c	fix(shutdown): implement graceful async shutdown handler Replace the minimal sync-only shutdown (which only saved autonomous state) with a comprehensive async graceful_shutdown() coroutine that: 1. Ends active voice sessions (disconnect, release GPU locks, cleanup audio) 2. Saves autonomous engine state 3. Stops the APScheduler 4. Cancels all tracked background tasks (from task_tracker) 5. Closes the Discord gateway connection Signal handlers (SIGTERM/SIGINT) now schedule the async shutdown on the running event loop. The atexit handler is kept as a last-resort sync fallback. Resolves #5, also addresses #4 (voice cleanup at shutdown)	2026-02-18 12:08:32 +02:00
koko210Serve	7b7abcfc68	fix(tasks): replace fire-and-forget asyncio.create_task with create_tracked_task Add utils/task_tracker.py with create_tracked_task() that wraps background tasks with error logging, cancellation handling, and reference tracking. Replace all 17 fire-and-forget asyncio.create_task() calls across 7 files: - bot/bot.py (5 interjection checks) - bot/utils/autonomous.py (2 check-and-act/react tasks) - bot/utils/bipolar_mode.py (3 argument tasks) - bot/commands/uno.py (1 game loop task) - bot/utils/voice_receiver.py (3 STT/interruption callbacks) - bot/utils/persona_dialogue.py (4 dialogue turn/interjection tasks) Previously-tracked tasks (voice_audio.py, voice_manager.py) were left as-is since they already store task references for cancellation. Closes #1	2026-02-18 12:01:08 +02:00
koko210Serve	cf55b15745	Optimize miku-bot container: remove unused packages and caches Optimizations applied: - Add pip cache purge after pip install (~7.5MB saved) - Remove /usr/share/doc documentation (~7.5MB saved) - Remove pocketsphinx speech recognition packages (~37MB saved) - Remove libflite1 TTS library (~28MB saved) Packages removed: - pocketsphinx-en-us (US English speech model) - pocketsphinx (speech recognition library) - libflite1 (text-to-speech engine) - libpocketsphinx3 (speech recognition frontend) Reason: These packages are not used in Python code: - Speech recognition: Handled by external stt-realtime container - Text-to-speech: Handled by external RVC container Note: Could not remove Vulkan/Mesa drivers (~130MB) because: - Playwright installs them via --with-deps flag - Removing them also removes libgl1 (required by OpenCV) - libgl1 pulls back Mesa graphics drivers Total savings: ~80MB (from previous 2.41GB baseline) Container size remains 2.41GB due to essential package dependencies	2026-02-15 22:21:30 +02:00
koko210Serve	33e5095607	Optimize miku-bot container size by removing unused dependencies Major changes: - Remove unused ML libraries: torch, scikit-learn, langchain-core, langchain-text-splitters, langchain-community, faiss-cpu - Comment out unused langchain imports in utils/core.py (only used in commented-out code) - Keep transformers (used in persona_dialogue.py for sentiment analysis) Results: - Container size reduced from 14.5GB to 2.6GB - 82% reduction (11.9GB saved) - Bot runs correctly without errors - All functionality preserved Removed packages: - torch: ~1.0-1.5GB (not used, only in soprano_to_rvc/) - scikit-learn: ~200-300MB (not used in bot/) - langchain-core: ~50-100MB (not used, only in commented code) - langchain-text-splitters: ~30-50MB (not used, only in commented code) - langchain-community: ~50-80MB (not used, only in commented code) - faiss-cpu: ~100-200MB (not used in bot/) This is Phase 1 of container optimization (Quick Wins). Further optimizations possible: - OpenCV headless (150-200MB) - Evaluate Playwright usage (500MB-1GB) - Alpine base image (1-1.5GB) - Multi-stage builds (200-400MB)	2026-02-15 20:56:25 +02:00
koko210Serve	8d09a8a52f	Implement comprehensive config system and clean up codebase Major changes: - Add Pydantic-based configuration system (bot/config.py, bot/config_manager.py) - Add config.yaml with all service URLs, models, and feature flags - Fix config.yaml path resolution in Docker (check /app/config.yaml first) - Remove Fish Audio API integration (tested feature that didn't work) - Remove hardcoded ERROR_WEBHOOK_URL, import from config instead - Add missing Pydantic models (LogConfigUpdateRequest, LogFilterUpdateRequest) - Enable Cheshire Cat memory system by default (USE_CHESHIRE_CAT=true) - Add .env.example template with all required environment variables - Add setup.sh script for user-friendly initialization - Update docker-compose.yml with proper env file mounting - Update .gitignore for config files and temporary files Config system features: - Static configuration from config.yaml - Runtime overrides from config_runtime.yaml - Environment variables for secrets (.env) - Web UI integration via config_manager - Graceful fallback to defaults Secrets handling: - Move ERROR_WEBHOOK_URL from hardcoded to .env - Add .env.example with all placeholder values - Document all required secrets - Fish API key and voice ID removed from .env Documentation: - CONFIG_README.md - Configuration system guide - CONFIG_SYSTEM_COMPLETE.md - Implementation summary - FISH_API_REMOVAL_COMPLETE.md - Removal record - SECRETS_CONFIGURED.md - Secrets setup record - BOT_STARTUP_FIX.md - Pydantic model fixes - MIGRATION_CHECKLIST.md - Setup checklist - WEB_UI_INTEGRATION_COMPLETE.md - Web UI config guide - Updated readmes/README.md with new features	2026-02-15 19:51:00 +02:00
koko210Serve	bb5067a89e	fix: Add settings.json and enable profile_picture_context plugin - Added empty settings.json required by Cat plugin system - Plugin now appears in ACTIVE PLUGINS list - Enabled via /plugins/toggle API endpoint - Ready to inject PFP descriptions when user asks about it	2026-02-11 00:09:58 +02:00
koko210Serve	eb557f655c	feat: Add profile picture context plugin with regex-based injection - Create profile_picture_context plugin to detect PFP queries via regex - Inject current_description.txt only when user asks about profile picture - Mount bot/memory directory in Cat container for PFP access - Avoids context bloat by only adding PFP description when relevant - Patterns match: 'what does your pfp look like', 'describe your avatar', etc. - Works seamlessly with existing profile picture update system - No manual sync needed - description auto-updates with PFP changes	2026-02-10 23:41:14 +02:00
koko210Serve	985ac60191	Webhook pfp updates properly now	2026-02-10 22:57:55 +02:00
koko210Serve	34167eddae	feat: Restore mood system and implement comprehensive memory editor UI MOOD SYSTEM FIX: - Mount bot/moods directory in docker-compose.yml for Cat container access - Update miku_personality plugin to load mood descriptions from .txt files - Add Cat logger for debugging mood loading (replaces print statements) - Moods now dynamically loaded from working_memory instead of hardcoded neutral	2026-02-10 22:03:54 +02:00
koko210Serve	6ba8e19d99	Ability to edit and add memories from the web UI with fixed escapeHtml	2026-02-10 21:41:28 +02:00
koko210Serve	fbd940e711	fix: Restore declarative memory recall by preserving suffix template Root cause: The miku_personality plugin's agent_prompt_suffix hook was returning an empty string, which wiped out the {declarative_memory} and {episodic_memory} placeholders from the prompt template. This caused the LLM to never receive any stored facts about users, resulting in hallucinated responses. Changes: - miku_personality: Changed agent_prompt_suffix to return the memory context section with {episodic_memory}, {declarative_memory}, and {tools_output} placeholders instead of empty string - discord_bridge: Added before_cat_recalls_declarative_memories hook to increase k-value from 3 to 10 and lower threshold from 0.7 to 0.5 for better fact retrieval. Added agent_prompt_prefix to emphasize factual accuracy. Added debug logging via before_agent_starts hook. Result: Miku now correctly recalls user facts (favorite songs, games, etc.) from declarative memory with 100% accuracy. Tested with: - 'What is my favorite song?' → Correctly answers 'Monitoring (Best Friend Remix) by DECO*27' - 'Do you remember my favorite song?' → Correctly recalls the song - 'What is my favorite video game?' → Correctly answers 'Sonic Adventure'	2026-02-09 12:33:31 +02:00
koko210Serve	beb1a89000	Fix: Optimize Twitter fetching to avoid Playwright hangs - Replaced Playwright browser scraping with direct API media extraction - Both fetch_miku_tweets() and fetch_figurine_tweets_latest() now use twscrape's built-in media info - Reduced tweet fetching from 10-15 minutes to ~5 seconds - Eliminated browser timeout/hanging issues - Relaxed autonomous tweet sharing conditions: * Increased message threshold from 10 to 20 per hour * Reduced cooldown from 3600s to 2400s (40 minutes) * Increased energy threshold from 50% to 70% * Added 'silly' and 'flirty' moods to allowed sharing moods This makes both figurine notifications and tweet sharing much more reliable and responsive.	2026-02-08 14:55:01 +02:00
koko210Serve	b9d1f67d70	llama-swap-rocm now uses official image and adjusted accordingly	2026-02-07 23:43:01 +02:00
koko210Serve	11b90ebb46	fix: Phase 3 bug fixes - memory APIs, username visibility, web UI layout, Docker Critical Bug Fixes: 1. Per-user memory isolation bug - Changed CatAdapter from HTTP POST to WebSocket /ws/{user_id} - User_id now comes from URL path parameter (true per-user isolation) - Verified: Different users can't see each other's memories 2. Memory API 405 errors - Replaced non-existent Cat endpoint calls with Qdrant direct queries - get_memory_points(): Now uses POST /collections/{collection}/points/scroll - delete_memory_point(): Now uses POST /collections/{collection}/points/delete 3. Memory stats showing null counts - Reimplemented get_memory_stats() to query Qdrant directly - Now returns accurate counts: episodic: 20, declarative: 6, procedural: 4 4. Miku couldn't see usernames - Modified discord_bridge before_cat_reads_message hook - Prepends [Username says:] to every message text - LLM now knows who is texting: [Alice says:] Hello Miku! 5. Web UI Memory tab layout - Tab9 was positioned outside .tab-container div (showed to the right) - Moved tab9 HTML inside container, before closing divs - Memory tab now displays below tab buttons like other tabs Code Changes: bot/utils/cat_client.py: - Line 25: Logger name changed to 'llm' (available component) - get_memory_stats() (lines 256-285): Query Qdrant directly via HTTP GET - get_memory_points() (lines 275-310): Use Qdrant POST /points/scroll - delete_memory_point() (lines 350-370): Use Qdrant POST /points/delete cat-plugins/discord_bridge/discord_bridge.py: - Fixed .pop() → .get() (UserMessage is Pydantic BaseModelDict) - Added before_cat_reads_message logic to prepend [Username says:] - Message format: [Alice says:] message content Dockerfile.llamaswap-rocm: - Lines 37-44: Added conditional check for UI directory - if [ -d ui ] before npm install && npm run build - Fixes build failure when llama-swap UI dir doesn't exist bot/static/index.html: - Moved tab9 from lines 1554-1688 (outside container) - To position before container closing divs (now inside) - Memory tab button at line 673: 🧠 Memories Testing & Verification: ✅ Per-user isolation verified (Docker exec test) ✅ Memory stats showing real counts (curl test) ✅ Memory API working (facts/episodic loading) ✅ Web UI layout fixed (tab displays correctly) ✅ All 5 services running (llama-swap, llama-swap-amd, qdrant, cat, bot) ✅ Username prepending working (message context for LLM) Result: All Phase 3 critical bugs fixed and verified working.	2026-02-07 23:27:15 +02:00
koko210Serve	5fe420b7bc	Web UI tabs made into two rows	2026-02-07 22:16:01 +02:00
koko210Serve	14e1a8df51	Phase 3: Unified Cheshire Cat integration with WebSocket-based per-user isolation Key changes: - CatAdapter (bot/utils/cat_client.py): WebSocket /ws/{user_id} for chat queries instead of HTTP POST (fixes per-user memory isolation when no API keys are configured — HTTP defaults all users to user_id='user') - Memory management API: 8 endpoints for status, stats, facts, episodic memories, consolidation trigger, multi-step delete with confirmation - Web UI: Memory tab (tab9) with collection stats, fact/episodic browser, manual consolidation trigger, and 3-step delete flow requiring exact confirmation string - Bot integration: Cat-first response path with query_llama fallback for both text and embed responses, server mood detection - Discord bridge plugin: fixed .pop() to .get() (UserMessage is a Pydantic BaseModelDict, not a raw dict), metadata extraction via extra attributes - Unified docker-compose: Cat + Qdrant services merged into main compose, bot depends_on Cat healthcheck - All plugins (discord_bridge, memory_consolidation, miku_personality) consolidated into cat-plugins/ for volume mount - query_llama deprecated but functional for compatibility	2026-02-07 20:22:03 +02:00
koko210Serve	edb88e9ede	fix: Phase 2 integrity review - v2.0.0 rewrite & bugfixes Memory Consolidation Plugin (828 -> 465 lines): - Replace SentenceTransformer with cat.embedder.embed_query() for vector consistency - Fix per-user fact isolation: source=user_id instead of global - Add duplicate fact detection (_is_duplicate_fact, score_threshold=0.85) - Remove ~350 lines of dead async run_consolidation() code - Remove duplicate declarative search in before_cat_sends_message - Unify trivial patterns into TRIVIAL_PATTERNS frozenset - Remove all sys.stderr.write debug logging - Remove sentence-transformers from requirements.txt (no external deps) Loguru Fix (cheshire-cat/cat/log.py): - Patch Cat v1.6.2 loguru format to provide default extra fields - Fixes KeyError: 'original_name' from third-party libs (fastembed) - Mounted via docker-compose volume Discord Bridge: - Copy discord_bridge.py to cat-plugins/ (was empty directory) Test Results (6/7 pass, 100% fact recall): - 11 facts extracted, per-user isolation working - Duplicate detection effective (+2 on 2nd run) - 5/5 natural language recall queries correct	2026-02-07 19:24:46 +02:00
koko210Serve	83c103324c	feat: Phase 2 Memory Consolidation - Production Ready Implements intelligent memory consolidation system with LLM-based fact extraction: Features: - Bidirectional memory: stores both user and Miku messages - LLM-based fact extraction (replaces regex for intelligent pattern detection) - Filters Miku's responses during fact extraction (only user messages analyzed) - Trivial message filtering (removes lol, k, ok, etc.) - Manual consolidation trigger via 'consolidate now' command - Declarative fact recall with semantic search - User separation via metadata (user_id, guild_id) - Tested: 60% fact recall accuracy, 39 episodic memories, 11 facts extracted Phase 2 Requirements Complete: ✅ Minimal real-time filtering ✅ Nightly consolidation task (manual trigger works) ✅ Context-aware LLM analysis ✅ Extract declarative facts ✅ Metadata enrichment Test Results: - Episodic memories: 39 stored (user + Miku) - Declarative facts: 11 extracted from user messages only - Fact recall accuracy: 3/5 queries (60%) - Pipeline test: PASS Ready for production deployment with scheduled consolidation.	2026-02-03 23:17:27 +02:00
koko210Serve	323ca753d1	feat: Phase 1 - Discord bridge with unified user identity Implements unified cross-server memory system for Miku bot: Core Changes: - discord_bridge plugin with 3 hooks for metadata enrichment - Unified user identity: discord_user_{id} across servers and DMs - Minimal filtering: skip only trivial messages (lol, k, 1-2 chars) - Marks all memories as consolidated=False for Phase 2 processing Testing: - test_phase1.py validates cross-server memory recall - PHASE1_TEST_RESULTS.md documents successful validation - Cross-server test: User says 'blue' in Server A, Miku remembers in Server B ✅ Documentation: - IMPLEMENTATION_PLAN.md - Complete architecture and roadmap - Phase 2 (sleep consolidation) ready for implementation This lays the foundation for human-like memory consolidation.	2026-01-31 18:54:00 +02:00
koko210Serve	0a9145728e	Ability to play Uno implemented in early stages!	2026-01-30 21:43:20 +02:00
koko210Serve	5b1163c7af	Removed KV Cache offloading to increase performance	2026-01-30 21:35:07 +02:00
koko210Serve	7368ef0cd5	Added Japanese and Bulgarian addressing	2026-01-30 21:34:24 +02:00
koko210Serve	38a986658d	moved AI generated readmes to readme folder (may delete)	2026-01-27 19:58:26 +02:00
koko210Serve	c58b941587	moved AI generated readmes to readme folder (may delete)	2026-01-27 19:57:48 +02:00
koko210Serve	0f1c30f757	Added verbose logging to llama-swap-rocm. Not sure if does anything...	2026-01-27 19:57:04 +02:00
koko210Serve	55fd3e0953	Cleanup. Moved prototype and testing STT/TTS to 1TB HDD	2026-01-27 19:55:13 +02:00
koko210Serve	ecd14cf704	Able to now address Miku in Cyrillic, Kanji and both Kanas, incl. Japanese honorifics	2026-01-27 19:53:18 +02:00
koko210Serve	641a5b83e8	Improved Evil Mode toggle to handle edge cases of the pfp and role color change. Japanese swallow model compatible (should be).	2026-01-27 19:52:39 +02:00
koko210Serve	c0aaab0c3a	Disabled KV cache offloading on llama-server and enabled Flash Attention. Performance gains in the tens.	2026-01-27 19:11:49 +02:00
koko210Serve	dca58328e4	Tuned the Japanese mode system prompt and model better	2026-01-23 17:01:47 +02:00
koko210Serve	fe0962118b	Implemented new Japanese only text mode with WebUI toggle, utilizing a llama3.1 swallow dataset model. Next up is Japanese TTS.	2026-01-23 15:02:36 +02:00
koko210Serve	eb03dfce4d	refactor: Implement low-latency STT pipeline with speculative transcription Major architectural overhaul of the speech-to-text pipeline for real-time voice chat: STT Server Rewrite: - Replaced RealtimeSTT dependency with direct Silero VAD + Faster-Whisper integration - Achieved sub-second latency by eliminating unnecessary abstractions - Uses small.en Whisper model for fast transcription (~850ms) Speculative Transcription (NEW): - Start transcribing at 150ms silence (speculative) while still listening - If speech continues, discard speculative result and keep buffering - If 400ms silence confirmed, use pre-computed speculative result immediately - Reduces latency by ~250-850ms for typical utterances with clear pauses VAD Implementation: - Silero VAD with ONNX (CPU-efficient) for 32ms chunk processing - Direct speech boundary detection without RealtimeSTT overhead - Configurable thresholds for silence detection (400ms final, 150ms speculative) Architecture: - Single Whisper model loaded once, shared across sessions - VAD runs on every 512-sample chunk for immediate speech detection - Background transcription worker thread for non-blocking processing - Greedy decoding (beam_size=1) for maximum speed Performance: - Previous: 400ms silence wait + ~850ms transcription = ~1.25s total latency - Current: 400ms silence wait + 0ms (speculative ready) = ~400ms (best case) - Single model reduces VRAM usage, prevents OOM on GTX 1660 Container Manager Updates: - Updated health check logic to work with new response format - Changed from checking 'warmed_up' flag to just 'status: ready' - Improved terminology from 'warmup' to 'models loading' Files Changed: - stt-realtime/stt_server.py: Complete rewrite with Silero VAD + speculative transcription - stt-realtime/requirements.txt: Removed RealtimeSTT, using torch.hub for Silero VAD - bot/utils/container_manager.py: Updated health check for new STT response format - bot/api.py: Updated docstring to reflect new architecture - backups/: Archived old RealtimeSTT-based implementation This addresses low latency requirements while maintaining accuracy with configurable speech detection thresholds.	2026-01-22 22:08:07 +02:00
koko210Serve	2934efba22	Implemented experimental real production ready voice chat, relegated old flow to voice debug mode. New Web UI panel for Voice Chat.	2026-01-20 23:06:17 +02:00
koko210Serve	362108f4b0	Decided on Parakeet ONNX Runtime. Works pretty great. Realtime voice chat possible now. UX lacking.	2026-01-19 00:29:44 +02:00
koko210Serve	0a8910fff8	Changed stt to parakeet — still experiemntal, though performance seems to be better	2026-01-18 03:35:50 +02:00
koko210Serve	50e4f7a5f2	Error in llama-swap catchall implemented + webhook notifier	2026-01-18 01:30:26 +02:00
koko210Serve	d1e6b21508	Phase 4 STT pipeline implemented — Silero VAD + faster-whisper — still not working well at all	2026-01-17 03:14:40 +02:00
koko210Serve	3e59e5d2f6	Phase 3 implemented — Text LLM can now stream to the TTS pipeline with the !miku say command	2026-01-17 00:01:17 +02:00
koko210Serve	9943cecdec	Phase 2 implemented and tested. Added warmup to pipeline and Miku queues tokens while the pipeline is warming up	2026-01-16 23:37:34 +02:00
koko210Serve	b0066f3525	Tested Phase 1, fixed text channel blocking while in voice and implemented joining and leaving VC from Phase 2	2026-01-16 20:39:23 +02:00
koko210Serve	911f11ee9f	Untested Phase 1 (Foundation & Resource management) of voice chat integration	2026-01-16 13:01:08 +02:00
koko210Serve	353c9c9583	Face Detector container now able to be created, started and stopped from within miku-bot container	2026-01-11 02:01:41 +02:00
koko210Serve	2d3b9d0e08	Fix IndentationError in persona_dialogue.py by removing stray docstring delimiter	2026-01-10 23:01:28 +02:00
koko210Serve	f576db0d88	fix: Remove duplicate json import causing runtime error - Removed local 'import json' statement inside get_servers() function - This was shadowing the module-level import and causing 'cannot access local variable' error - json is already imported at the top of the file (line 44)	2026-01-10 21:05:46 +02:00
koko210Serve	32c2a7b930	feat: Implement comprehensive non-hierarchical logging system - Created new logging infrastructure with per-component filtering - Added 6 log levels: DEBUG, INFO, API, WARNING, ERROR, CRITICAL - Implemented non-hierarchical level control (any combination can be enabled) - Migrated 917 print() statements across 31 files to structured logging - Created web UI (system.html) for runtime configuration with dark theme - Added global level controls to enable/disable levels across all components - Added timestamp format control (off/time/date/datetime options) - Implemented log rotation (10MB per file, 5 backups) - Added API endpoints for dynamic log configuration - Configured HTTP request logging with filtering via api.requests component - Intercepted APScheduler logs with proper formatting - Fixed persistence paths to use /app/memory for Docker volume compatibility - Fixed checkbox display bug in web UI (enabled_levels now properly shown) - Changed System Settings button to open in same tab instead of new window Components: bot, api, api.requests, autonomous, persona, vision, llm, conversation, mood, dm, scheduled, gpu, media, server, commands, sentiment, core, apscheduler All settings persist across container restarts via JSON config.	2026-01-10 20:46:19 +02:00
koko210Serve	ce00f9bd95	Changed misleading face detector warning message on startup in the log	2026-01-09 00:13:03 +02:00
koko210Serve	1fc3d74a5b	Add dual GPU support with web UI selector Features: - Built custom ROCm container for AMD RX 6800 GPU - Added GPU selection toggle in web UI (NVIDIA/AMD) - Unified model names across both GPUs for seamless switching - Vision model always uses NVIDIA GPU (optimal performance) - Text models (llama3.1, darkidol) can use either GPU - Added /gpu-status and /gpu-select API endpoints - Implemented GPU state persistence in memory/gpu_state.json Technical details: - Multi-stage Dockerfile.llamaswap-rocm with ROCm 6.2.4 - llama.cpp compiled with GGML_HIP=ON for gfx1030 (RX 6800) - Proper GPU permissions without root (groups 187/989) - AMD container on port 8091, NVIDIA on port 8090 - Updated bot/utils/llm.py with get_current_gpu_url() and get_vision_gpu_url() - Modified bot/utils/image_handling.py to always use NVIDIA for vision - Enhanced web UI with GPU selector button (blue=NVIDIA, red=AMD) Files modified: - docker-compose.yml (added llama-swap-amd service) - bot/globals.py (added LLAMA_AMD_URL) - bot/api.py (added GPU selection endpoints and helper function) - bot/utils/llm.py (GPU routing for text models) - bot/utils/image_handling.py (GPU routing for vision models) - bot/static/index.html (GPU selector UI) - llama-swap-rocm-config.yaml (unified model names) New files: - Dockerfile.llamaswap-rocm - bot/memory/gpu_state.json - bot/utils/gpu_router.py (load balancing utility) - setup-dual-gpu.sh (setup verification script) - DUAL_GPU_*.md (documentation files)	2026-01-09 00:03:59 +02:00
koko210Serve	ed5994ec78	Fix: Resolve webhook send timeout context error ISSUE ===== When using the manual webhook message feature via API, the following error occurred: - 'Timeout context manager should be used inside a task' - 'NoneType' object is not iterable (when sending without files) The error happened because Discord.py's webhook operations were being awaited directly in the FastAPI endpoint context, rather than within a task running in the bot's event loop. SOLUTION ======== Refactored /manual/send-webhook endpoint to properly handle async operations: 1. Moved webhook creation inside task function - get_or_create_webhooks_for_channel() now runs in send_webhook_message() - All Discord operations (webhook selection, sending) happen inside the task - Follows same pattern as working /manual/send endpoint 2. Fixed file parameter handling - Changed from 'files=discord_files if discord_files else None' - To conditional: only pass files parameter when list is non-empty - Discord.py's webhook.send() cannot iterate over None, requires list or omit 3. Maintained proper file reading - File content still read in endpoint context (before form closes) - File data passed to task as pre-read byte arrays - Prevents form closure issues TECHNICAL DETAILS ================= - Discord.py HTTP operations use timeout context managers - Context managers must run inside bot's event loop (via create_task) - FastAPI endpoint context is separate from bot's event loop - Solution: Wrap all Discord API calls in async task function - Pattern: Read files → Create task → Task handles Discord operations TESTING ======= - Manual webhook sending now works without timeout errors - Both personas (Miku/Evil) send correctly - File attachments work properly - Messages without files send correctly	2026-01-07 13:44:13 +02:00

1 2

76 Commits