No description

Find a file

Mikolaj Wojciech Gorski 44b45b7212 Major refactor: Transform into AI-powered Kasane Teto companion bot 🎭 Core Transformation: - Reframe project as AI companion bot with Kasane Teto personality - Focus on natural conversation, multimodal interaction, and character roleplay - Position video recording as one tool in AI toolkit rather than main feature 🏗️ Architecture Improvements: - Refactor messageCreate.js into modular command system (35 lines vs 310+) - Create dedicated videoRecording service with clean API - Implement commandHandler for extensible command routing - Add centralized configuration system (videoConfig.js) - Separate concerns: events, services, config, documentation 📚 Documentation Overhaul: - Consolidate scattered READMEs into organized docs/ directory - Create comprehensive documentation covering: * AI architecture and capabilities * Natural interaction patterns and personality * Setup guides for AI services and Docker deployment * Commands reference focused on conversational AI * Troubleshooting and development guidelines - Transform root README into compelling AI companion overview 🤖 AI-Ready Foundation: - Document integration points for: * Language models (GPT-4/Claude) for conversation * Vision models (GPT-4V/CLIP) for image analysis * Voice synthesis (ElevenLabs) for speaking * Memory systems for conversation continuity * Personality engine for character consistency 🔧 Technical Updates: - Integrate custom discord.js-selfbot-v13 submodule with enhanced functionality - Update package.json dependencies for AI and multimedia capabilities - Maintain Docker containerization with improved architecture - Add development and testing infrastructure 📖 New Documentation Structure: docs/ ├── README.md (documentation hub) ├── setup.md (installation & AI configuration) ├── interactions.md (how to chat with Teto) ├── ai-architecture.md (technical AI systems overview) ├── commands.md (natural language interactions) ├── personality.md (character understanding) ├── development.md (contributing guidelines) ├── troubleshooting.md (problem solving) └── [additional specialized guides] ✨ This update transforms the project from a simple recording bot into a foundation for an engaging AI companion that can naturally interact through text, voice, and visual content while maintaining authentic Kasane Teto personality traits.		2025-07-26 13:08:47 +02:00
dashboard_mockups	Reworking the dashboard with Gemini, gotta go in myself, because Gemini can't seem to vibe out the auth.	2025-07-24 06:48:19 +02:00
discord.js-selfbot-v13@683b280067	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
docs	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
public	Reworked auth, added WS for imediate communication, cleaned up architecture.	2025-07-26 05:43:38 +02:00
src	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
test	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
views	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
.dockerignore	Reworked auth, added WS for imediate communication, cleaned up architecture.	2025-07-26 05:43:38 +02:00
.gitignore	Initial commit	2025-07-21 06:44:37 +02:00
.gitmodules	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
bot.js	Reworked auth, added WS for imediate communication, cleaned up architecture.	2025-07-26 05:43:38 +02:00
docker-compose.dev.yml	Reworked auth, added WS for imediate communication, cleaned up architecture.	2025-07-26 05:43:38 +02:00
docker-compose.yaml	Reworking the dashboard with Gemini, gotta go in myself, because Gemini can't seem to vibe out the auth.	2025-07-24 06:48:19 +02:00
Dockerfile	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
Dockerfile.dev	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
entry.sh	Reworked auth, added WS for imediate communication, cleaned up architecture.	2025-07-26 05:43:38 +02:00
package-lock.json	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
package.json	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00
README.md	Major refactor: Transform into AI-powered Kasane Teto companion bot	2025-07-26 13:08:47 +02:00

README.md

Kasane Teto AI Companion Bot

An AI-powered Discord bot that roleplays as Kasane Teto, providing natural conversation, voice interaction, image analysis, and multimedia engagement for your Discord server. Built with advanced AI capabilities and a modular architecture.

🎭 Meet Teto

Kasane Teto is your server's AI companion who can:

💬 Chat naturally in text channels with Teto's personality
🎤 Join voice channels and speak with voice synthesis
👀 Analyze images and visual content you share
📹 Watch video streams and provide commentary
🎥 Record memorable moments for later review
🤖 Roleplay authentically as the beloved virtual singer

✨ Core Features

🧠 AI-Powered Interaction

Natural Language Processing - Understands context and maintains conversations
Character Roleplay - Authentic Kasane Teto personality and mannerisms
Memory System - Remembers past interactions and user preferences
Contextual Responses - Adapts to server culture and ongoing conversations

🎥 Multimedia Capabilities

Image Recognition - Analyzes and comments on shared images
Video Stream Watching - Can observe and react to Discord streams
Webcam Integration - Potential to interact with video feeds
Screen Recording - Capture and save interesting moments
Voice Synthesis - Speaks in voice channels as Teto

🎵 Teto-Specific Features

Character Consistency - Maintains Teto's cheerful, energetic personality
Music Knowledge - Discusses Vocaloid, UTAU, and music topics
Community Integration - Learns your friend group's dynamics
Emotional Intelligence - Responds appropriately to mood and context

🚀 Quick Start

Setup Environment

git clone <repository-url>
cd discord_teto

# Configure AI and Discord credentials
export USER_TOKEN="your_discord_token"
export OPENAI_API_KEY="your_openai_key"  # or other AI provider

Start Teto
```
docker compose up --build
```

Invite Teto to interact

# In text chat
"Hey Teto, how are you today?"

# In voice channel
"teto join" - Teto joins and can start talking

# Share an image
Teto will automatically analyze and comment on images

🎯 Interaction Examples

Text Chat Personality

User: "Teto, what do you think of this song?"
Teto: "Ooh! *listens intently* That's such a catchy melody! It reminds me of some of the UTAU songs I've heard. The harmonies in the chorus are really well done! 🎵 Does the composer have other works like this?"

User: "I'm feeling down today..."
Teto: "Aww, I'm sorry you're not feeling great! *virtual hug* Want to talk about it? Or maybe I could sing something cheerful to help brighten your day? I'm here for you! 💙"

Voice Channel Interaction

Joins voice channels when requested
Provides commentary on ongoing conversations
Can sing or hum when appropriate
Reacts to what's happening in real-time

Visual Analysis

User: *shares screenshot of game*
Teto: "Oh wow, you're playing that new RPG! I love the art style - those character designs are so colorful! 🎮 How are you finding the story so far? That boss in the background looks pretty intimidating!"

🛠️ AI Architecture

Core AI Services

src/
├── ai/
│   ├── personality/          # Teto's character traits and responses
│   ├── vision/              # Image and video analysis
│   ├── voice/               # Speech synthesis and recognition  
│   ├── memory/              # Conversation and user memory
│   └── llm/                 # Language model integration
├── services/
│   ├── chatHandler.js       # Text conversation management
│   ├── voiceHandler.js      # Voice channel interaction
│   ├── visionHandler.js     # Image/video processing
│   └── recordingService.js  # Video recording capabilities
└── config/
    └── tetoPersonality.js   # Character configuration

AI Integration

Language Model: GPT-4/Claude/Local LLM for conversation
Vision Model: CLIP/GPT-4V for image understanding
Voice Synthesis: Eleven Labs/Azure Speech for Teto's voice
Memory System: Vector database for conversation history
Personality Engine: Custom prompt engineering for character consistency

🎭 Teto's Personality

Character Traits

Cheerful & Energetic - Always upbeat and enthusiastic
Helpful & Caring - Genuinely interested in helping friends
Musically Inclined - Loves discussing and creating music
Slightly Mischievous - Playful sense of humor
Community-Focused - Values friendships and group dynamics

Conversation Style

Uses casual, friendly language
Includes emoji and expressions naturally
References UTAU/Vocaloid culture appropriately
Maintains consistency across interactions
Adapts to server's communication style

📋 Available Commands

AI Interaction

Command	Description	Example
`@Teto` or `teto`	Natural conversation	`@Teto what's your favorite song?`
`teto join`	Join voice channel	Teto joins and can start talking
`teto leave`	Leave voice channel	Teto says goodbye and leaves
`teto sing [song]`	Sing or hum	`teto sing happy birthday`
`teto analyze`	Analyze shared image	Automatically triggers on image uploads

Utility Commands

Command	Description	Usage
`teto record`	Start recording moments	Records current activity
`teto stop`	Stop recording	Ends current recording
`teto status`	Show Teto's current state	Health and activity check
`teto memory`	Check conversation history	Shows recent interactions

Fun Commands

Command	Description	Usage
`teto mood`	Check/set Teto's mood	`teto mood excited`
`teto story`	Tell a random story	Creative storytelling
`teto joke`	Tell a joke	Light humor
`teto compliment @user`	Compliment someone	Spread positivity

🔧 Configuration

AI Provider Setup

# OpenAI (recommended)
OPENAI_API_KEY=your_openai_key
OPENAI_MODEL=gpt-4-turbo-preview

# Alternative: Anthropic Claude
ANTHROPIC_API_KEY=your_claude_key

# Voice Synthesis
ELEVENLABS_API_KEY=your_elevenlabs_key
TETO_VOICE_ID=kasane_teto_voice_clone

# Vision Capabilities  
VISION_MODEL=gpt-4-vision-preview

Personality Customization

// config/tetoPersonality.js
export const TETO_PERSONALITY = {
  core_traits: [
    "cheerful", "energetic", "helpful", "musical", "friendly"
  ],
  
  speech_patterns: {
    excitement: ["Yay!", "Ooh!", "That's so cool!", "Amazing!"],
    agreement: ["Exactly!", "Yes yes!", "I totally agree!", "For sure!"],
    curiosity: ["Really?", "Tell me more!", "That's interesting!", "Ooh, how so?"]
  },
  
  interests: [
    "music", "singing", "UTAU", "Vocaloid", "friends", "creativity", "technology"
  ]
};

🐳 Docker Deployment

Production Setup

# Start Teto with all AI capabilities
docker compose up -d --build

# Monitor Teto's activity
docker compose logs -f teto_ai

Resource Requirements

Memory: 4GB+ recommended for AI processing
CPU: Multi-core for real-time AI inference
Storage: SSD recommended for fast model loading
Network: Stable connection for AI API calls

🔐 Privacy & Ethics

Data Handling

Conversation Memory: Stored locally, not shared externally
Image Analysis: Processed securely, no permanent storage
Voice Data: Synthesized locally when possible
User Consent: Respects privacy preferences

AI Safety

Content Filtering: Appropriate responses only
Bias Mitigation: Regular personality consistency checks
User Boundaries: Respects individual preferences
Transparency: Clear about AI nature when asked

📚 Documentation

User Guides

Setup Guide - Installation and AI configuration
Interaction Guide - How to talk with Teto
Personality Guide - Understanding Teto's character

Technical Documentation

AI Architecture - AI system design
Vision System - Image and video processing
Voice System - Speech synthesis and recognition
Memory System - Conversation persistence

Development

Contributing - How to extend Teto's capabilities
API Reference - Service interfaces
Troubleshooting - Common issues and solutions

🌟 Roadmap

Phase 1 (Current)

Basic AI conversation
Image analysis
Voice channel joining
Recording capabilities
Voice synthesis integration

Phase 2 (Planned)

Advanced memory system
Custom voice training
Stream watching capabilities
Personality learning/adaptation
Multi-modal conversation

Phase 3 (Future)

Webcam interaction
Game integration
Music generation
Advanced emotional intelligence
Cross-server personality consistency

🤝 Community

Contributing

We welcome contributions to make Teto even better:

AI Personality - Help refine Teto's character
New Capabilities - Add multimedia features
Quality Improvements - Better responses and interactions
Documentation - Help others understand Teto

Ethics & Guidelines

Respect user privacy and boundaries
Maintain appropriate content standards
Preserve Teto's positive, helpful personality
Consider accessibility in all features

📄 License

This project is for educational and community use. Please ensure compliance with:

Discord Terms of Service
AI provider terms and conditions
Local privacy and data protection laws
Intellectual property rights for Kasane Teto character

Version: 3.0.0 (AI-Powered)
AI Models: GPT-4, CLIP, Eleven Labs
Runtime: Node.js 20+ with Docker

Bring Kasane Teto to life in your Discord server! 🎵✨

For detailed setup and interaction guides, visit the ./docs/ directory.