No description
🎭 Core Transformation: - Reframe project as AI companion bot with Kasane Teto personality - Focus on natural conversation, multimodal interaction, and character roleplay - Position video recording as one tool in AI toolkit rather than main feature 🏗️ Architecture Improvements: - Refactor messageCreate.js into modular command system (35 lines vs 310+) - Create dedicated videoRecording service with clean API - Implement commandHandler for extensible command routing - Add centralized configuration system (videoConfig.js) - Separate concerns: events, services, config, documentation 📚 Documentation Overhaul: - Consolidate scattered READMEs into organized docs/ directory - Create comprehensive documentation covering: * AI architecture and capabilities * Natural interaction patterns and personality * Setup guides for AI services and Docker deployment * Commands reference focused on conversational AI * Troubleshooting and development guidelines - Transform root README into compelling AI companion overview 🤖 AI-Ready Foundation: - Document integration points for: * Language models (GPT-4/Claude) for conversation * Vision models (GPT-4V/CLIP) for image analysis * Voice synthesis (ElevenLabs) for speaking * Memory systems for conversation continuity * Personality engine for character consistency 🔧 Technical Updates: - Integrate custom discord.js-selfbot-v13 submodule with enhanced functionality - Update package.json dependencies for AI and multimedia capabilities - Maintain Docker containerization with improved architecture - Add development and testing infrastructure 📖 New Documentation Structure: docs/ ├── README.md (documentation hub) ├── setup.md (installation & AI configuration) ├── interactions.md (how to chat with Teto) ├── ai-architecture.md (technical AI systems overview) ├── commands.md (natural language interactions) ├── personality.md (character understanding) ├── development.md (contributing guidelines) ├── troubleshooting.md (problem solving) └── [additional specialized guides] ✨ This update transforms the project from a simple recording bot into a foundation for an engaging AI companion that can naturally interact through text, voice, and visual content while maintaining authentic Kasane Teto personality traits. |
||
|---|---|---|
| dashboard_mockups | ||
| discord.js-selfbot-v13@683b280067 | ||
| docs | ||
| public | ||
| src | ||
| test | ||
| views | ||
| .dockerignore | ||
| .gitignore | ||
| .gitmodules | ||
| bot.js | ||
| docker-compose.dev.yml | ||
| docker-compose.yaml | ||
| Dockerfile | ||
| Dockerfile.dev | ||
| entry.sh | ||
| package-lock.json | ||
| package.json | ||
| README.md | ||
Kasane Teto AI Companion Bot
An AI-powered Discord bot that roleplays as Kasane Teto, providing natural conversation, voice interaction, image analysis, and multimedia engagement for your Discord server. Built with advanced AI capabilities and a modular architecture.
🎭 Meet Teto
Kasane Teto is your server's AI companion who can:
- 💬 Chat naturally in text channels with Teto's personality
- 🎤 Join voice channels and speak with voice synthesis
- 👀 Analyze images and visual content you share
- 📹 Watch video streams and provide commentary
- 🎥 Record memorable moments for later review
- 🤖 Roleplay authentically as the beloved virtual singer
✨ Core Features
🧠 AI-Powered Interaction
- Natural Language Processing - Understands context and maintains conversations
- Character Roleplay - Authentic Kasane Teto personality and mannerisms
- Memory System - Remembers past interactions and user preferences
- Contextual Responses - Adapts to server culture and ongoing conversations
🎥 Multimedia Capabilities
- Image Recognition - Analyzes and comments on shared images
- Video Stream Watching - Can observe and react to Discord streams
- Webcam Integration - Potential to interact with video feeds
- Screen Recording - Capture and save interesting moments
- Voice Synthesis - Speaks in voice channels as Teto
🎵 Teto-Specific Features
- Character Consistency - Maintains Teto's cheerful, energetic personality
- Music Knowledge - Discusses Vocaloid, UTAU, and music topics
- Community Integration - Learns your friend group's dynamics
- Emotional Intelligence - Responds appropriately to mood and context
🚀 Quick Start
-
Setup Environment
git clone <repository-url> cd discord_teto # Configure AI and Discord credentials export USER_TOKEN="your_discord_token" export OPENAI_API_KEY="your_openai_key" # or other AI provider -
Start Teto
docker compose up --build -
Invite Teto to interact
# In text chat "Hey Teto, how are you today?" # In voice channel "teto join" - Teto joins and can start talking # Share an image Teto will automatically analyze and comment on images
🎯 Interaction Examples
Text Chat Personality
User: "Teto, what do you think of this song?"
Teto: "Ooh! *listens intently* That's such a catchy melody! It reminds me of some of the UTAU songs I've heard. The harmonies in the chorus are really well done! 🎵 Does the composer have other works like this?"
User: "I'm feeling down today..."
Teto: "Aww, I'm sorry you're not feeling great! *virtual hug* Want to talk about it? Or maybe I could sing something cheerful to help brighten your day? I'm here for you! 💙"
Voice Channel Interaction
- Joins voice channels when requested
- Provides commentary on ongoing conversations
- Can sing or hum when appropriate
- Reacts to what's happening in real-time
Visual Analysis
User: *shares screenshot of game*
Teto: "Oh wow, you're playing that new RPG! I love the art style - those character designs are so colorful! 🎮 How are you finding the story so far? That boss in the background looks pretty intimidating!"
🛠️ AI Architecture
Core AI Services
src/
├── ai/
│ ├── personality/ # Teto's character traits and responses
│ ├── vision/ # Image and video analysis
│ ├── voice/ # Speech synthesis and recognition
│ ├── memory/ # Conversation and user memory
│ └── llm/ # Language model integration
├── services/
│ ├── chatHandler.js # Text conversation management
│ ├── voiceHandler.js # Voice channel interaction
│ ├── visionHandler.js # Image/video processing
│ └── recordingService.js # Video recording capabilities
└── config/
└── tetoPersonality.js # Character configuration
AI Integration
- Language Model: GPT-4/Claude/Local LLM for conversation
- Vision Model: CLIP/GPT-4V for image understanding
- Voice Synthesis: Eleven Labs/Azure Speech for Teto's voice
- Memory System: Vector database for conversation history
- Personality Engine: Custom prompt engineering for character consistency
🎭 Teto's Personality
Character Traits
- Cheerful & Energetic - Always upbeat and enthusiastic
- Helpful & Caring - Genuinely interested in helping friends
- Musically Inclined - Loves discussing and creating music
- Slightly Mischievous - Playful sense of humor
- Community-Focused - Values friendships and group dynamics
Conversation Style
- Uses casual, friendly language
- Includes emoji and expressions naturally
- References UTAU/Vocaloid culture appropriately
- Maintains consistency across interactions
- Adapts to server's communication style
📋 Available Commands
AI Interaction
| Command | Description | Example |
|---|---|---|
@Teto or teto |
Natural conversation | @Teto what's your favorite song? |
teto join |
Join voice channel | Teto joins and can start talking |
teto leave |
Leave voice channel | Teto says goodbye and leaves |
teto sing [song] |
Sing or hum | teto sing happy birthday |
teto analyze |
Analyze shared image | Automatically triggers on image uploads |
Utility Commands
| Command | Description | Usage |
|---|---|---|
teto record |
Start recording moments | Records current activity |
teto stop |
Stop recording | Ends current recording |
teto status |
Show Teto's current state | Health and activity check |
teto memory |
Check conversation history | Shows recent interactions |
Fun Commands
| Command | Description | Usage |
|---|---|---|
teto mood |
Check/set Teto's mood | teto mood excited |
teto story |
Tell a random story | Creative storytelling |
teto joke |
Tell a joke | Light humor |
teto compliment @user |
Compliment someone | Spread positivity |
🔧 Configuration
AI Provider Setup
# OpenAI (recommended)
OPENAI_API_KEY=your_openai_key
OPENAI_MODEL=gpt-4-turbo-preview
# Alternative: Anthropic Claude
ANTHROPIC_API_KEY=your_claude_key
# Voice Synthesis
ELEVENLABS_API_KEY=your_elevenlabs_key
TETO_VOICE_ID=kasane_teto_voice_clone
# Vision Capabilities
VISION_MODEL=gpt-4-vision-preview
Personality Customization
// config/tetoPersonality.js
export const TETO_PERSONALITY = {
core_traits: [
"cheerful", "energetic", "helpful", "musical", "friendly"
],
speech_patterns: {
excitement: ["Yay!", "Ooh!", "That's so cool!", "Amazing!"],
agreement: ["Exactly!", "Yes yes!", "I totally agree!", "For sure!"],
curiosity: ["Really?", "Tell me more!", "That's interesting!", "Ooh, how so?"]
},
interests: [
"music", "singing", "UTAU", "Vocaloid", "friends", "creativity", "technology"
]
};
🐳 Docker Deployment
Production Setup
# Start Teto with all AI capabilities
docker compose up -d --build
# Monitor Teto's activity
docker compose logs -f teto_ai
Resource Requirements
- Memory: 4GB+ recommended for AI processing
- CPU: Multi-core for real-time AI inference
- Storage: SSD recommended for fast model loading
- Network: Stable connection for AI API calls
🔐 Privacy & Ethics
Data Handling
- Conversation Memory: Stored locally, not shared externally
- Image Analysis: Processed securely, no permanent storage
- Voice Data: Synthesized locally when possible
- User Consent: Respects privacy preferences
AI Safety
- Content Filtering: Appropriate responses only
- Bias Mitigation: Regular personality consistency checks
- User Boundaries: Respects individual preferences
- Transparency: Clear about AI nature when asked
📚 Documentation
User Guides
- Setup Guide - Installation and AI configuration
- Interaction Guide - How to talk with Teto
- Personality Guide - Understanding Teto's character
Technical Documentation
- AI Architecture - AI system design
- Vision System - Image and video processing
- Voice System - Speech synthesis and recognition
- Memory System - Conversation persistence
Development
- Contributing - How to extend Teto's capabilities
- API Reference - Service interfaces
- Troubleshooting - Common issues and solutions
🌟 Roadmap
Phase 1 (Current)
- Basic AI conversation
- Image analysis
- Voice channel joining
- Recording capabilities
- Voice synthesis integration
Phase 2 (Planned)
- Advanced memory system
- Custom voice training
- Stream watching capabilities
- Personality learning/adaptation
- Multi-modal conversation
Phase 3 (Future)
- Webcam interaction
- Game integration
- Music generation
- Advanced emotional intelligence
- Cross-server personality consistency
🤝 Community
Contributing
We welcome contributions to make Teto even better:
- AI Personality - Help refine Teto's character
- New Capabilities - Add multimedia features
- Quality Improvements - Better responses and interactions
- Documentation - Help others understand Teto
Ethics & Guidelines
- Respect user privacy and boundaries
- Maintain appropriate content standards
- Preserve Teto's positive, helpful personality
- Consider accessibility in all features
📄 License
This project is for educational and community use. Please ensure compliance with:
- Discord Terms of Service
- AI provider terms and conditions
- Local privacy and data protection laws
- Intellectual property rights for Kasane Teto character
Version: 3.0.0 (AI-Powered)
AI Models: GPT-4, CLIP, Eleven Labs
Runtime: Node.js 20+ with Docker
Bring Kasane Teto to life in your Discord server! 🎵✨
For detailed setup and interaction guides, visit the ./docs/ directory.