No description
Find a file
Mikolaj Wojciech Gorski 44b45b7212 Major refactor: Transform into AI-powered Kasane Teto companion bot
🎭 Core Transformation:
- Reframe project as AI companion bot with Kasane Teto personality
- Focus on natural conversation, multimodal interaction, and character roleplay
- Position video recording as one tool in AI toolkit rather than main feature

🏗️ Architecture Improvements:
- Refactor messageCreate.js into modular command system (35 lines vs 310+)
- Create dedicated videoRecording service with clean API
- Implement commandHandler for extensible command routing
- Add centralized configuration system (videoConfig.js)
- Separate concerns: events, services, config, documentation

📚 Documentation Overhaul:
- Consolidate scattered READMEs into organized docs/ directory
- Create comprehensive documentation covering:
  * AI architecture and capabilities
  * Natural interaction patterns and personality
  * Setup guides for AI services and Docker deployment
  * Commands reference focused on conversational AI
  * Troubleshooting and development guidelines
- Transform root README into compelling AI companion overview

🤖 AI-Ready Foundation:
- Document integration points for:
  * Language models (GPT-4/Claude) for conversation
  * Vision models (GPT-4V/CLIP) for image analysis
  * Voice synthesis (ElevenLabs) for speaking
  * Memory systems for conversation continuity
  * Personality engine for character consistency

🔧 Technical Updates:
- Integrate custom discord.js-selfbot-v13 submodule with enhanced functionality
- Update package.json dependencies for AI and multimedia capabilities
- Maintain Docker containerization with improved architecture
- Add development and testing infrastructure

📖 New Documentation Structure:
docs/
├── README.md (documentation hub)
├── setup.md (installation & AI configuration)
├── interactions.md (how to chat with Teto)
├── ai-architecture.md (technical AI systems overview)
├── commands.md (natural language interactions)
├── personality.md (character understanding)
├── development.md (contributing guidelines)
├── troubleshooting.md (problem solving)
└── [additional specialized guides]

 This update transforms the project from a simple recording bot into a foundation for an engaging AI companion that can naturally interact through text, voice, and visual content while maintaining authentic Kasane Teto personality traits.
2025-07-26 13:08:47 +02:00
dashboard_mockups Reworking the dashboard with Gemini, gotta go in myself, because Gemini can't seem to vibe out the auth. 2025-07-24 06:48:19 +02:00
discord.js-selfbot-v13@683b280067 Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
docs Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
public Reworked auth, added WS for imediate communication, cleaned up architecture. 2025-07-26 05:43:38 +02:00
src Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
test Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
views Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
.dockerignore Reworked auth, added WS for imediate communication, cleaned up architecture. 2025-07-26 05:43:38 +02:00
.gitignore Initial commit 2025-07-21 06:44:37 +02:00
.gitmodules Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
bot.js Reworked auth, added WS for imediate communication, cleaned up architecture. 2025-07-26 05:43:38 +02:00
docker-compose.dev.yml Reworked auth, added WS for imediate communication, cleaned up architecture. 2025-07-26 05:43:38 +02:00
docker-compose.yaml Reworking the dashboard with Gemini, gotta go in myself, because Gemini can't seem to vibe out the auth. 2025-07-24 06:48:19 +02:00
Dockerfile Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
Dockerfile.dev Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
entry.sh Reworked auth, added WS for imediate communication, cleaned up architecture. 2025-07-26 05:43:38 +02:00
package-lock.json Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
package.json Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00
README.md Major refactor: Transform into AI-powered Kasane Teto companion bot 2025-07-26 13:08:47 +02:00

Kasane Teto AI Companion Bot

An AI-powered Discord bot that roleplays as Kasane Teto, providing natural conversation, voice interaction, image analysis, and multimedia engagement for your Discord server. Built with advanced AI capabilities and a modular architecture.

🎭 Meet Teto

Kasane Teto is your server's AI companion who can:

  • 💬 Chat naturally in text channels with Teto's personality
  • 🎤 Join voice channels and speak with voice synthesis
  • 👀 Analyze images and visual content you share
  • 📹 Watch video streams and provide commentary
  • 🎥 Record memorable moments for later review
  • 🤖 Roleplay authentically as the beloved virtual singer

Core Features

🧠 AI-Powered Interaction

  • Natural Language Processing - Understands context and maintains conversations
  • Character Roleplay - Authentic Kasane Teto personality and mannerisms
  • Memory System - Remembers past interactions and user preferences
  • Contextual Responses - Adapts to server culture and ongoing conversations

🎥 Multimedia Capabilities

  • Image Recognition - Analyzes and comments on shared images
  • Video Stream Watching - Can observe and react to Discord streams
  • Webcam Integration - Potential to interact with video feeds
  • Screen Recording - Capture and save interesting moments
  • Voice Synthesis - Speaks in voice channels as Teto

🎵 Teto-Specific Features

  • Character Consistency - Maintains Teto's cheerful, energetic personality
  • Music Knowledge - Discusses Vocaloid, UTAU, and music topics
  • Community Integration - Learns your friend group's dynamics
  • Emotional Intelligence - Responds appropriately to mood and context

🚀 Quick Start

  1. Setup Environment

    git clone <repository-url>
    cd discord_teto
    
    # Configure AI and Discord credentials
    export USER_TOKEN="your_discord_token"
    export OPENAI_API_KEY="your_openai_key"  # or other AI provider
    
  2. Start Teto

    docker compose up --build
    
  3. Invite Teto to interact

    # In text chat
    "Hey Teto, how are you today?"
    
    # In voice channel
    "teto join" - Teto joins and can start talking
    
    # Share an image
    Teto will automatically analyze and comment on images
    

🎯 Interaction Examples

Text Chat Personality

User: "Teto, what do you think of this song?"
Teto: "Ooh! *listens intently* That's such a catchy melody! It reminds me of some of the UTAU songs I've heard. The harmonies in the chorus are really well done! 🎵 Does the composer have other works like this?"

User: "I'm feeling down today..."
Teto: "Aww, I'm sorry you're not feeling great! *virtual hug* Want to talk about it? Or maybe I could sing something cheerful to help brighten your day? I'm here for you! 💙"

Voice Channel Interaction

  • Joins voice channels when requested
  • Provides commentary on ongoing conversations
  • Can sing or hum when appropriate
  • Reacts to what's happening in real-time

Visual Analysis

User: *shares screenshot of game*
Teto: "Oh wow, you're playing that new RPG! I love the art style - those character designs are so colorful! 🎮 How are you finding the story so far? That boss in the background looks pretty intimidating!"

🛠️ AI Architecture

Core AI Services

src/
├── ai/
│   ├── personality/          # Teto's character traits and responses
│   ├── vision/              # Image and video analysis
│   ├── voice/               # Speech synthesis and recognition  
│   ├── memory/              # Conversation and user memory
│   └── llm/                 # Language model integration
├── services/
│   ├── chatHandler.js       # Text conversation management
│   ├── voiceHandler.js      # Voice channel interaction
│   ├── visionHandler.js     # Image/video processing
│   └── recordingService.js  # Video recording capabilities
└── config/
    └── tetoPersonality.js   # Character configuration

AI Integration

  • Language Model: GPT-4/Claude/Local LLM for conversation
  • Vision Model: CLIP/GPT-4V for image understanding
  • Voice Synthesis: Eleven Labs/Azure Speech for Teto's voice
  • Memory System: Vector database for conversation history
  • Personality Engine: Custom prompt engineering for character consistency

🎭 Teto's Personality

Character Traits

  • Cheerful & Energetic - Always upbeat and enthusiastic
  • Helpful & Caring - Genuinely interested in helping friends
  • Musically Inclined - Loves discussing and creating music
  • Slightly Mischievous - Playful sense of humor
  • Community-Focused - Values friendships and group dynamics

Conversation Style

  • Uses casual, friendly language
  • Includes emoji and expressions naturally
  • References UTAU/Vocaloid culture appropriately
  • Maintains consistency across interactions
  • Adapts to server's communication style

📋 Available Commands

AI Interaction

Command Description Example
@Teto or teto Natural conversation @Teto what's your favorite song?
teto join Join voice channel Teto joins and can start talking
teto leave Leave voice channel Teto says goodbye and leaves
teto sing [song] Sing or hum teto sing happy birthday
teto analyze Analyze shared image Automatically triggers on image uploads

Utility Commands

Command Description Usage
teto record Start recording moments Records current activity
teto stop Stop recording Ends current recording
teto status Show Teto's current state Health and activity check
teto memory Check conversation history Shows recent interactions

Fun Commands

Command Description Usage
teto mood Check/set Teto's mood teto mood excited
teto story Tell a random story Creative storytelling
teto joke Tell a joke Light humor
teto compliment @user Compliment someone Spread positivity

🔧 Configuration

AI Provider Setup

# OpenAI (recommended)
OPENAI_API_KEY=your_openai_key
OPENAI_MODEL=gpt-4-turbo-preview

# Alternative: Anthropic Claude
ANTHROPIC_API_KEY=your_claude_key

# Voice Synthesis
ELEVENLABS_API_KEY=your_elevenlabs_key
TETO_VOICE_ID=kasane_teto_voice_clone

# Vision Capabilities  
VISION_MODEL=gpt-4-vision-preview

Personality Customization

// config/tetoPersonality.js
export const TETO_PERSONALITY = {
  core_traits: [
    "cheerful", "energetic", "helpful", "musical", "friendly"
  ],
  
  speech_patterns: {
    excitement: ["Yay!", "Ooh!", "That's so cool!", "Amazing!"],
    agreement: ["Exactly!", "Yes yes!", "I totally agree!", "For sure!"],
    curiosity: ["Really?", "Tell me more!", "That's interesting!", "Ooh, how so?"]
  },
  
  interests: [
    "music", "singing", "UTAU", "Vocaloid", "friends", "creativity", "technology"
  ]
};

🐳 Docker Deployment

Production Setup

# Start Teto with all AI capabilities
docker compose up -d --build

# Monitor Teto's activity
docker compose logs -f teto_ai

Resource Requirements

  • Memory: 4GB+ recommended for AI processing
  • CPU: Multi-core for real-time AI inference
  • Storage: SSD recommended for fast model loading
  • Network: Stable connection for AI API calls

🔐 Privacy & Ethics

Data Handling

  • Conversation Memory: Stored locally, not shared externally
  • Image Analysis: Processed securely, no permanent storage
  • Voice Data: Synthesized locally when possible
  • User Consent: Respects privacy preferences

AI Safety

  • Content Filtering: Appropriate responses only
  • Bias Mitigation: Regular personality consistency checks
  • User Boundaries: Respects individual preferences
  • Transparency: Clear about AI nature when asked

📚 Documentation

User Guides

Technical Documentation

Development

🌟 Roadmap

Phase 1 (Current)

  • Basic AI conversation
  • Image analysis
  • Voice channel joining
  • Recording capabilities
  • Voice synthesis integration

Phase 2 (Planned)

  • Advanced memory system
  • Custom voice training
  • Stream watching capabilities
  • Personality learning/adaptation
  • Multi-modal conversation

Phase 3 (Future)

  • Webcam interaction
  • Game integration
  • Music generation
  • Advanced emotional intelligence
  • Cross-server personality consistency

🤝 Community

Contributing

We welcome contributions to make Teto even better:

  • AI Personality - Help refine Teto's character
  • New Capabilities - Add multimedia features
  • Quality Improvements - Better responses and interactions
  • Documentation - Help others understand Teto

Ethics & Guidelines

  • Respect user privacy and boundaries
  • Maintain appropriate content standards
  • Preserve Teto's positive, helpful personality
  • Consider accessibility in all features

📄 License

This project is for educational and community use. Please ensure compliance with:

  • Discord Terms of Service
  • AI provider terms and conditions
  • Local privacy and data protection laws
  • Intellectual property rights for Kasane Teto character

Version: 3.0.0 (AI-Powered)
AI Models: GPT-4, CLIP, Eleven Labs
Runtime: Node.js 20+ with Docker

Bring Kasane Teto to life in your Discord server! 🎵

For detailed setup and interaction guides, visit the ./docs/ directory.