History

Mikolaj Wojciech Gorski 81a2318bd2 feat: add auto-stop functionality for webcam and video recordings - Add automatic recording stop when user turns off camera/screenshare - Listen for startStreaming events to detect video_ssrc=0 (camera/screen off) - Add disconnect and connection close event handling - Implement proper cleanup of event listeners to prevent memory leaks - Add targetUserId tracking for accurate event filtering - Update both videoRecording.js and webcamRecording.js services - Update discord.js-selfbot-v13 submodule with latest webcam recording fixes		2025-07-26 16:13:19 +02:00
..
ai-architecture.md	Updated the docs to focus on a local only stack instead of one relient on services like OpenAI, Eleven labs and so on.	2025-07-26 14:26:18 +02:00
architecture.md	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
commands.md	Updated the docs to focus on a local only stack instead of one relient on services like OpenAI, Eleven labs and so on.	2025-07-26 14:26:18 +02:00
development.md	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
docker-compose-examples.md	feat: add auto-stop functionality for webcam and video recordings	2025-07-26 16:13:19 +02:00
docker.md	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
interactions.md	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
README.md	Updated the docs to focus on a local only stack instead of one relient on services like OpenAI, Eleven labs and so on.	2025-07-26 14:26:18 +02:00
setup.md	Updated the docs to focus on a local only stack instead of one relient on services like OpenAI, Eleven labs and so on.	2025-07-26 14:26:18 +02:00
troubleshooting.md	Updated the docs to focus on a local only stack instead of one relient on services like OpenAI, Eleven labs and so on.	2025-07-26 14:26:18 +02:00
video-recording.md	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00

README.md

Kasane Teto AI Companion Bot Documentation

Welcome to the comprehensive documentation for the Kasane Teto AI Companion Bot! This AI-powered Discord bot brings the beloved virtual singer to life in your server with natural conversation, multimedia interaction, and authentic personality.

🎭 What is Teto?

Kasane Teto is an AI-powered Discord companion that:

Chats naturally with your friend group using advanced language models
Joins voice channels and speaks with synthesized voice
Analyzes images and visual content you share
Watches video streams and provides real-time commentary
Records memorable moments for your group to revisit
Maintains authentic personality as the cheerful virtual singer

Unlike simple command bots, Teto engages in genuine conversations, remembers past interactions, and adapts to your server's unique culture and dynamics.

📚 Documentation Structure

🚀 Getting Started

Setup Guide - Complete installation and local AI stack configuration
Quick Start - Get Teto running in 5 minutes
Configuration - Local models, personality, and customization

💬 Interacting with Teto

Conversation Guide - How to chat naturally with Teto
Commands Reference - All available commands and usage
Personality Guide - Understanding Teto's character and responses
Voice Interaction - Speaking with Teto in voice channels

🧠 AI Capabilities

AI Architecture - How Teto's local AI systems work
Vision System - Image analysis with local multi-modal models
Memory System - How Teto remembers conversations locally +- Personality Engine](personality-engine.md) - Character consistency and roleplay

🔧 Technical Documentation

Architecture Overview - System design and components
API Reference - Service interfaces and integration
Docker Guide - Container deployment and management
Development Guide - Contributing and extending Teto

🛠️ Operations & Support

Troubleshooting - Common issues and solutions
Performance Tuning - Optimizing your local AI stack +- Security & Privacy - Data handling and safety in a local-first setup

"I want to set up Teto for the first time"

Setup Guide - Installation and local AI stack configuration
Configuration - vLLM, Piper, and Whisper setup
Docker Guide - Multi-container deployment for AI services

"I want to understand how to interact with Teto"

Conversation Guide - Natural chat examples
Commands Reference - Available commands
Voice Interaction - Voice channel usage

"I want to understand Teto's capabilities"

Personality Guide - Character traits and style
Vision System - Image analysis with local models
AI Architecture - Technical capabilities

"I want to customize or develop features"

Development Guide - Contributing guidelines
Architecture Overview - System design
API Reference - Integration interfaces

"I'm having issues or want to optimize"

Troubleshooting - Problem solving
Performance Tuning - Optimizing your local AI stack +- Security & Privacy - Best practices for a local-first setup

🌟 Key Features Overview

🤖 AI-Powered Conversation

Teto uses advanced language models to maintain natural, contextual conversations while staying true to her cheerful, musical personality. She remembers past interactions and adapts to your server's communication style.

👀 Visual Understanding

With computer vision capabilities, Teto can analyze images, screenshots, memes, and artwork that users share, providing relevant commentary and engaging with visual content naturally.

🎵 Voice Interaction

Teto can join voice channels and speak with synthesized voice, participating in conversations, singing when requested, and providing commentary on ongoing activities.

📹 Stream & Video Analysis

Advanced capabilities allow Teto to watch Discord streams, analyze video content, and provide real-time commentary on what she observes.

🧠 Memory & Learning

Teto maintains conversation history and user preferences, allowing for continuity across interactions and personalized responses based on past conversations.

🎭 Authentic Roleplay

Carefully crafted personality engine ensures Teto maintains consistent character traits, speech patterns, and interests that align with the Kasane Teto persona.

🔧 Technical Architecture

Teto Local AI System
├── Language Model (vLLM)            # Self-hosted natural conversation
├── Vision Model (vLLM Multi-modal)  # Self-hosted image/video analysis  
├── Voice Synthesis (Piper TTS)      # Local speech generation via Wyoming
├── Speech Recognition (Whisper STT) # Local speech recognition via Wyoming
├── Memory System (Local Vector DB)  # Local conversation history
├── Personality Engine               # Character consistency
└── Discord Integration              # Platform interface

📋 System Requirements

Minimum Requirements

VRAM: 8GB+ for 7B models (required for vLLM)
RAM: 16GB+ (for models and system)
CPU: Modern multi-core (for processing)
Storage: 15GB+ SSD (for model weights)
Network: Local network for inter-service communication

Recommended Setup

VRAM: 24GB+ for larger models or concurrent tasks
RAM: 32GB+ for smoother operation
Storage: NVMe SSD for fast model loading
GPU: Required for vLLM and Whisper

🚦 Getting Started Checklist

Read the Setup Guide
Download required model weights (LLM, TTS, etc.)
Configure local endpoints for vLLM and Wyoming
Deploy multi-container stack using Docker
Customize personality settings
Test basic conversation features
Explore voice and vision capabilities
Review security and privacy settings

🤝 Community & Support

Contributing

We welcome contributions to improve Teto's capabilities:

Personality Enhancement - Better character responses
New Features - Additional AI capabilities
Documentation - Help others understand Teto
Bug Fixes - Improve stability and performance

See the Development Guide for detailed contribution guidelines.

Getting Help

Technical Issues: Check Troubleshooting
Setup Problems: Review Setup Guide
Feature Questions: See Commands Reference +- AI Behavior: Read Personality Guide

Best Practices

Privacy First: All data is processed locally, ensuring maximum privacy
Appropriate Content: Maintain family-friendly interactions
Resource Management: Monitor local GPU and CPU usage
Community Guidelines: Foster positive server environments

📊 Documentation Stats

Total Guides: 15+ comprehensive documents
Coverage: Setup, usage, development, troubleshooting
Examples: 50+ practical usage examples
Code Samples: 100+ implementation snippets
Last Updated: Current with latest AI capabilities

🔮 Future Enhancements

The documentation will continue to evolve with new features:

Advanced Memory Systems - Long-term relationship building
Custom Voice Training - Fine-tuning Piper for a unique Teto voice
Multi-Server Consistency - Shared personality across servers
Game Integration - Interactive gaming experiences
Creative Tools - Music and art generation with local models

Ready to bring Teto to life? Start with the Setup Guide to get your AI companion running!

For quick questions, try the Troubleshooting guide or explore specific topics using the navigation above.

✨ Let's make your Discord server more magical with Kasane Teto! ✨