- Add automatic recording stop when user turns off camera/screenshare - Listen for startStreaming events to detect video_ssrc=0 (camera/screen off) - Add disconnect and connection close event handling - Implement proper cleanup of event listeners to prevent memory leaks - Add targetUserId tracking for accurate event filtering - Update both videoRecording.js and webcamRecording.js services - Update discord.js-selfbot-v13 submodule with latest webcam recording fixes |
||
|---|---|---|
| .. | ||
| ai-architecture.md | ||
| architecture.md | ||
| commands.md | ||
| development.md | ||
| docker-compose-examples.md | ||
| docker.md | ||
| interactions.md | ||
| README.md | ||
| setup.md | ||
| troubleshooting.md | ||
| video-recording.md | ||
Kasane Teto AI Companion Bot Documentation
Welcome to the comprehensive documentation for the Kasane Teto AI Companion Bot! This AI-powered Discord bot brings the beloved virtual singer to life in your server with natural conversation, multimedia interaction, and authentic personality.
🎭 What is Teto?
Kasane Teto is an AI-powered Discord companion that:
- Chats naturally with your friend group using advanced language models
- Joins voice channels and speaks with synthesized voice
- Analyzes images and visual content you share
- Watches video streams and provides real-time commentary
- Records memorable moments for your group to revisit
- Maintains authentic personality as the cheerful virtual singer
Unlike simple command bots, Teto engages in genuine conversations, remembers past interactions, and adapts to your server's unique culture and dynamics.
📚 Documentation Structure
🚀 Getting Started
- Setup Guide - Complete installation and local AI stack configuration
- Quick Start - Get Teto running in 5 minutes
- Configuration - Local models, personality, and customization
💬 Interacting with Teto
- Conversation Guide - How to chat naturally with Teto
- Commands Reference - All available commands and usage
- Personality Guide - Understanding Teto's character and responses
- Voice Interaction - Speaking with Teto in voice channels
🧠 AI Capabilities
- AI Architecture - How Teto's local AI systems work
- Vision System - Image analysis with local multi-modal models
- Memory System - How Teto remembers conversations locally +- Personality Engine](personality-engine.md) - Character consistency and roleplay
🔧 Technical Documentation
- Architecture Overview - System design and components
- API Reference - Service interfaces and integration
- Docker Guide - Container deployment and management
- Development Guide - Contributing and extending Teto
🛠️ Operations & Support
- Troubleshooting - Common issues and solutions
- Performance Tuning - Optimizing your local AI stack +- Security & Privacy - Data handling and safety in a local-first setup
🎯 Quick Navigation by Use Case
"I want to set up Teto for the first time"
- Setup Guide - Installation and local AI stack configuration
- Configuration - vLLM, Piper, and Whisper setup
- Docker Guide - Multi-container deployment for AI services
"I want to understand how to interact with Teto"
- Conversation Guide - Natural chat examples
- Commands Reference - Available commands
- Voice Interaction - Voice channel usage
"I want to understand Teto's capabilities"
- Personality Guide - Character traits and style
- Vision System - Image analysis with local models
- AI Architecture - Technical capabilities
"I want to customize or develop features"
- Development Guide - Contributing guidelines
- Architecture Overview - System design
- API Reference - Integration interfaces
"I'm having issues or want to optimize"
- Troubleshooting - Problem solving
- Performance Tuning - Optimizing your local AI stack +- Security & Privacy - Best practices for a local-first setup
🌟 Key Features Overview
🤖 AI-Powered Conversation
Teto uses advanced language models to maintain natural, contextual conversations while staying true to her cheerful, musical personality. She remembers past interactions and adapts to your server's communication style.
👀 Visual Understanding
With computer vision capabilities, Teto can analyze images, screenshots, memes, and artwork that users share, providing relevant commentary and engaging with visual content naturally.
🎵 Voice Interaction
Teto can join voice channels and speak with synthesized voice, participating in conversations, singing when requested, and providing commentary on ongoing activities.
📹 Stream & Video Analysis
Advanced capabilities allow Teto to watch Discord streams, analyze video content, and provide real-time commentary on what she observes.
🧠 Memory & Learning
Teto maintains conversation history and user preferences, allowing for continuity across interactions and personalized responses based on past conversations.
🎭 Authentic Roleplay
Carefully crafted personality engine ensures Teto maintains consistent character traits, speech patterns, and interests that align with the Kasane Teto persona.
🔧 Technical Architecture
Teto Local AI System
├── Language Model (vLLM) # Self-hosted natural conversation
├── Vision Model (vLLM Multi-modal) # Self-hosted image/video analysis
├── Voice Synthesis (Piper TTS) # Local speech generation via Wyoming
├── Speech Recognition (Whisper STT) # Local speech recognition via Wyoming
├── Memory System (Local Vector DB) # Local conversation history
├── Personality Engine # Character consistency
└── Discord Integration # Platform interface
📋 System Requirements
Minimum Requirements
- VRAM: 8GB+ for 7B models (required for
vLLM) - RAM: 16GB+ (for models and system)
- CPU: Modern multi-core (for processing)
- Storage: 15GB+ SSD (for model weights)
- Network: Local network for inter-service communication
Recommended Setup
- VRAM: 24GB+ for larger models or concurrent tasks
- RAM: 32GB+ for smoother operation
- Storage: NVMe SSD for fast model loading
- GPU: Required for
vLLMandWhisper
🚦 Getting Started Checklist
- Read the Setup Guide
- Download required model weights (LLM, TTS, etc.)
- Configure local endpoints for
vLLMandWyoming - Deploy multi-container stack using Docker
- Customize personality settings
- Test basic conversation features
- Explore voice and vision capabilities
- Review security and privacy settings
🤝 Community & Support
Contributing
We welcome contributions to improve Teto's capabilities:
- Personality Enhancement - Better character responses
- New Features - Additional AI capabilities
- Documentation - Help others understand Teto
- Bug Fixes - Improve stability and performance
See the Development Guide for detailed contribution guidelines.
Getting Help
- Technical Issues: Check Troubleshooting
- Setup Problems: Review Setup Guide
- Feature Questions: See Commands Reference +- AI Behavior: Read Personality Guide
Best Practices
- Privacy First: All data is processed locally, ensuring maximum privacy
- Appropriate Content: Maintain family-friendly interactions
- Resource Management: Monitor local GPU and CPU usage
- Community Guidelines: Foster positive server environments
📊 Documentation Stats
- Total Guides: 15+ comprehensive documents
- Coverage: Setup, usage, development, troubleshooting
- Examples: 50+ practical usage examples
- Code Samples: 100+ implementation snippets
- Last Updated: Current with latest AI capabilities
🔮 Future Enhancements
The documentation will continue to evolve with new features:
- Advanced Memory Systems - Long-term relationship building
- Custom Voice Training - Fine-tuning
Piperfor a unique Teto voice - Multi-Server Consistency - Shared personality across servers
- Game Integration - Interactive gaming experiences
- Creative Tools - Music and art generation with local models
Ready to bring Teto to life? Start with the Setup Guide to get your AI companion running!
For quick questions, try the Troubleshooting guide or explore specific topics using the navigation above.
✨ Let's make your Discord server more magical with Kasane Teto! ✨