Mikolaj Wojciech Gorski 44b45b7212 Major refactor: Transform into AI-powered Kasane Teto companion bot

🎭 Core Transformation:
- Reframe project as AI companion bot with Kasane Teto personality
- Focus on natural conversation, multimodal interaction, and character roleplay
- Position video recording as one tool in AI toolkit rather than main feature

🏗️ Architecture Improvements:
- Refactor messageCreate.js into modular command system (35 lines vs 310+)
- Create dedicated videoRecording service with clean API
- Implement commandHandler for extensible command routing
- Add centralized configuration system (videoConfig.js)
- Separate concerns: events, services, config, documentation

📚 Documentation Overhaul:
- Consolidate scattered READMEs into organized docs/ directory
- Create comprehensive documentation covering:
  * AI architecture and capabilities
  * Natural interaction patterns and personality
  * Setup guides for AI services and Docker deployment
  * Commands reference focused on conversational AI
  * Troubleshooting and development guidelines
- Transform root README into compelling AI companion overview

🤖 AI-Ready Foundation:
- Document integration points for:
  * Language models (GPT-4/Claude) for conversation
  * Vision models (GPT-4V/CLIP) for image analysis
  * Voice synthesis (ElevenLabs) for speaking
  * Memory systems for conversation continuity
  * Personality engine for character consistency

🔧 Technical Updates:
- Integrate custom discord.js-selfbot-v13 submodule with enhanced functionality
- Update package.json dependencies for AI and multimedia capabilities
- Maintain Docker containerization with improved architecture
- Add development and testing infrastructure

📖 New Documentation Structure:
docs/
├── README.md (documentation hub)
├── setup.md (installation & AI configuration)
├── interactions.md (how to chat with Teto)
├── ai-architecture.md (technical AI systems overview)
├── commands.md (natural language interactions)
├── personality.md (character understanding)
├── development.md (contributing guidelines)
├── troubleshooting.md (problem solving)
└── [additional specialized guides]

✨ This update transforms the project from a simple recording bot into a foundation for an engaging AI companion that can naturally interact through text, voice, and visual content while maintaining authentic Kasane Teto personality traits.

2025-07-26 13:08:47 +02:00

10 KiB

Raw Blame History

Kasane Teto AI Companion Bot

An AI-powered Discord bot that roleplays as Kasane Teto, providing natural conversation, voice interaction, image analysis, and multimedia engagement for your Discord server. Built with advanced AI capabilities and a modular architecture.

🎭 Meet Teto

Kasane Teto is your server's AI companion who can:

💬 Chat naturally in text channels with Teto's personality
🎤 Join voice channels and speak with voice synthesis
👀 Analyze images and visual content you share
📹 Watch video streams and provide commentary
🎥 Record memorable moments for later review
🤖 Roleplay authentically as the beloved virtual singer

✨ Core Features

🧠 AI-Powered Interaction

Natural Language Processing - Understands context and maintains conversations
Character Roleplay - Authentic Kasane Teto personality and mannerisms
Memory System - Remembers past interactions and user preferences
Contextual Responses - Adapts to server culture and ongoing conversations

🎥 Multimedia Capabilities

Image Recognition - Analyzes and comments on shared images
Video Stream Watching - Can observe and react to Discord streams
Webcam Integration - Potential to interact with video feeds
Screen Recording - Capture and save interesting moments
Voice Synthesis - Speaks in voice channels as Teto

🎵 Teto-Specific Features

Character Consistency - Maintains Teto's cheerful, energetic personality
Music Knowledge - Discusses Vocaloid, UTAU, and music topics
Community Integration - Learns your friend group's dynamics
Emotional Intelligence - Responds appropriately to mood and context

🚀 Quick Start

Setup Environment

git clone <repository-url>
cd discord_teto

# Configure AI and Discord credentials
export USER_TOKEN="your_discord_token"
export OPENAI_API_KEY="your_openai_key"  # or other AI provider

Start Teto
```
docker compose up --build
```

Invite Teto to interact

# In text chat
"Hey Teto, how are you today?"

# In voice channel
"teto join" - Teto joins and can start talking

# Share an image
Teto will automatically analyze and comment on images

🎯 Interaction Examples

Text Chat Personality

User: "Teto, what do you think of this song?"
Teto: "Ooh! *listens intently* That's such a catchy melody! It reminds me of some of the UTAU songs I've heard. The harmonies in the chorus are really well done! 🎵 Does the composer have other works like this?"

User: "I'm feeling down today..."
Teto: "Aww, I'm sorry you're not feeling great! *virtual hug* Want to talk about it? Or maybe I could sing something cheerful to help brighten your day? I'm here for you! 💙"

Voice Channel Interaction

Joins voice channels when requested
Provides commentary on ongoing conversations
Can sing or hum when appropriate
Reacts to what's happening in real-time

Visual Analysis

User: *shares screenshot of game*
Teto: "Oh wow, you're playing that new RPG! I love the art style - those character designs are so colorful! 🎮 How are you finding the story so far? That boss in the background looks pretty intimidating!"

🛠️ AI Architecture

Core AI Services

src/
├── ai/
│   ├── personality/          # Teto's character traits and responses
│   ├── vision/              # Image and video analysis
│   ├── voice/               # Speech synthesis and recognition  
│   ├── memory/              # Conversation and user memory
│   └── llm/                 # Language model integration
├── services/
│   ├── chatHandler.js       # Text conversation management
│   ├── voiceHandler.js      # Voice channel interaction
│   ├── visionHandler.js     # Image/video processing
│   └── recordingService.js  # Video recording capabilities
└── config/
    └── tetoPersonality.js   # Character configuration

AI Integration

Language Model: GPT-4/Claude/Local LLM for conversation
Vision Model: CLIP/GPT-4V for image understanding
Voice Synthesis: Eleven Labs/Azure Speech for Teto's voice
Memory System: Vector database for conversation history
Personality Engine: Custom prompt engineering for character consistency

🎭 Teto's Personality

Character Traits

Cheerful & Energetic - Always upbeat and enthusiastic
Helpful & Caring - Genuinely interested in helping friends
Musically Inclined - Loves discussing and creating music
Slightly Mischievous - Playful sense of humor
Community-Focused - Values friendships and group dynamics

Conversation Style

Uses casual, friendly language
Includes emoji and expressions naturally
References UTAU/Vocaloid culture appropriately
Maintains consistency across interactions
Adapts to server's communication style

📋 Available Commands

AI Interaction

Command	Description	Example
`@Teto` or `teto`	Natural conversation	`@Teto what's your favorite song?`
`teto join`	Join voice channel	Teto joins and can start talking
`teto leave`	Leave voice channel	Teto says goodbye and leaves
`teto sing [song]`	Sing or hum	`teto sing happy birthday`
`teto analyze`	Analyze shared image	Automatically triggers on image uploads

Utility Commands

Command	Description	Usage
`teto record`	Start recording moments	Records current activity
`teto stop`	Stop recording	Ends current recording
`teto status`	Show Teto's current state	Health and activity check
`teto memory`	Check conversation history	Shows recent interactions

Fun Commands

Command	Description	Usage
`teto mood`	Check/set Teto's mood	`teto mood excited`
`teto story`	Tell a random story	Creative storytelling
`teto joke`	Tell a joke	Light humor
`teto compliment @user`	Compliment someone	Spread positivity

🔧 Configuration

AI Provider Setup

# OpenAI (recommended)
OPENAI_API_KEY=your_openai_key
OPENAI_MODEL=gpt-4-turbo-preview

# Alternative: Anthropic Claude
ANTHROPIC_API_KEY=your_claude_key

# Voice Synthesis
ELEVENLABS_API_KEY=your_elevenlabs_key
TETO_VOICE_ID=kasane_teto_voice_clone

# Vision Capabilities  
VISION_MODEL=gpt-4-vision-preview

Personality Customization

// config/tetoPersonality.js
export const TETO_PERSONALITY = {
  core_traits: [
    "cheerful", "energetic", "helpful", "musical", "friendly"
  ],
  
  speech_patterns: {
    excitement: ["Yay!", "Ooh!", "That's so cool!", "Amazing!"],
    agreement: ["Exactly!", "Yes yes!", "I totally agree!", "For sure!"],
    curiosity: ["Really?", "Tell me more!", "That's interesting!", "Ooh, how so?"]
  },
  
  interests: [
    "music", "singing", "UTAU", "Vocaloid", "friends", "creativity", "technology"
  ]
};

🐳 Docker Deployment

Production Setup

# Start Teto with all AI capabilities
docker compose up -d --build

# Monitor Teto's activity
docker compose logs -f teto_ai

Resource Requirements

Memory: 4GB+ recommended for AI processing
CPU: Multi-core for real-time AI inference
Storage: SSD recommended for fast model loading
Network: Stable connection for AI API calls

🔐 Privacy & Ethics

Data Handling

Conversation Memory: Stored locally, not shared externally
Image Analysis: Processed securely, no permanent storage
Voice Data: Synthesized locally when possible
User Consent: Respects privacy preferences

AI Safety

Content Filtering: Appropriate responses only
Bias Mitigation: Regular personality consistency checks
User Boundaries: Respects individual preferences
Transparency: Clear about AI nature when asked

📚 Documentation

User Guides

Setup Guide - Installation and AI configuration
Interaction Guide - How to talk with Teto
Personality Guide - Understanding Teto's character

Technical Documentation

AI Architecture - AI system design
Vision System - Image and video processing
Voice System - Speech synthesis and recognition
Memory System - Conversation persistence

Development

Contributing - How to extend Teto's capabilities
API Reference - Service interfaces
Troubleshooting - Common issues and solutions

🌟 Roadmap

Phase 1 (Current)

Basic AI conversation
Image analysis
Voice channel joining
Recording capabilities
Voice synthesis integration

Phase 2 (Planned)

Advanced memory system
Custom voice training
Stream watching capabilities
Personality learning/adaptation
Multi-modal conversation

Phase 3 (Future)

Webcam interaction
Game integration
Music generation
Advanced emotional intelligence
Cross-server personality consistency

🤝 Community

Contributing

We welcome contributions to make Teto even better:

AI Personality - Help refine Teto's character
New Capabilities - Add multimedia features
Quality Improvements - Better responses and interactions
Documentation - Help others understand Teto

Ethics & Guidelines

Respect user privacy and boundaries
Maintain appropriate content standards
Preserve Teto's positive, helpful personality
Consider accessibility in all features

📄 License

This project is for educational and community use. Please ensure compliance with:

Discord Terms of Service
AI provider terms and conditions
Local privacy and data protection laws
Intellectual property rights for Kasane Teto character

Version: 3.0.0 (AI-Powered)
AI Models: GPT-4, CLIP, Eleven Labs
Runtime: Node.js 20+ with Docker

Bring Kasane Teto to life in your Discord server! 🎵✨

For detailed setup and interaction guides, visit the ./docs/ directory.

10 KiB Raw Blame History