Content
# 🎙️ YouTube Free Deep Research CLI
[](https://python.org)
[](LICENSE)
[](https://github.com/usemanusai/youtube-free-deep-research-cli)
[](https://github.com/usemanusai)
[](https://github.com/usemanusai/youtube-free-deep-research-cli/releases)
[](https://mseep.ai/app/usemanusai-youtube-free-deep-research-cli)
**Production-ready Python 3.13+ CLI/API system with Adaptive RAG, multi-engine TTS, OpenRouter key rotation, FastAPI backend, and Next.js dashboard.**
A comprehensive, modular platform for deep research, content analysis, and intelligent workflow automation. Features adaptive RAG engine (LangGraph-based), multi-engine TTS (MeloTTS/Chatterbox via Python 3.11 bridge), 31-key OpenRouter rotation with intelligent failover, FastAPI backend with REST API and WebSocket support, Next.js dashboard, MCP server for Claude Desktop integration, and comprehensive testing suite.
## 📋 Table of Contents
- [✨ Features](#-features)
- [🚀 Quick Start](#-quick-start)
- [📦 Installation](#-installation)
- [⚙️ Configuration](#️-configuration)
- [🏗️ Architecture](#️-architecture)
- [📚 Documentation](#-documentation)
- [🧪 Testing](#-testing)
- [🐳 Docker](#-docker)
- [🔄 CI/CD](#-cicd)
- [📝 Contributing](#-contributing)
- [📄 License](#-license)
## ✨ Features
### Core Capabilities
- **Adaptive RAG Engine** - LangGraph-based retrieval-augmented generation
- **Multi-Engine TTS** - MeloTTS, Chatterbox, Edge TTS, gTTS, pyttsx3
- **OpenRouter Integration** - 31-key rotation with intelligent failover
- **FastAPI Backend** - REST API with WebSocket support
- **Next.js Dashboard** - Professional web interface
- **MCP Server** - Claude Desktop integration
- **Comprehensive Testing** - Unit and integration tests
### Advanced Features
- **Python 3.13 Compatibility** - Modern Python support with TTS bridge
- **Modular Architecture** - 60+ files organized into services, API, CLI, utils
- **100% Backward Compatible** - Zero breaking changes
- **Production-Ready** - Comprehensive error handling and logging
- **Advanced Voice Cloning** - 13 prosody parameters for complete voice control
- **Voice Cloning Parameters** - Speed, pitch, emotion, emphasis, pauses, breath patterns, and more
### 🎙️ **Advanced Podcast Generation (NEW in v2.1.0)**
- **14 Professional Podcast Styles**: Interview, Debate, News Report, Educational, Storytelling, Panel Discussion, Documentary, Quick Tips, Deep Dive, Roundup, and more
- **Multi-Voice Support**: Different speakers for different roles (host, expert, moderator, panelists)
- **Customizable Length & Tone**: Short (2-8 min) to Extended (30+ min) with 8 different tone options
- **Intelligent Content Synthesis**: AI-powered script generation with n8n RAG integration
### 📁 **Multi-Source Content Processing (NEW in v2.1.0)**
- **20+ File Types**: PDF, DOCX, TXT, MD, CSV, XLSX, MP3, WAV, MP4, AVI, URLs, YouTube videos/playlists/channels, PPTX, code files, images (OCR)
- **Advanced Filtering**: Date range, file type, size, tags, location-based filtering
- **Batch Processing**: Handle hundreds of sources efficiently with parallel processing
- **Smart Content Prioritization**: AI selects most relevant content automatically
### 📋 **Blueprint Generation (NEW in v2.1.0)**
- **5 Blueprint Styles**: Comprehensive, Executive, Technical, Educational, Reference
- **Multiple Output Formats**: Markdown, PDF, HTML, DOCX, JSON
- **Intelligent Documentation**: AI-powered synthesis from multiple sources
- **Structured Output**: Table of contents, citations, metadata, and professional formatting
### 💬 **Interactive Chat Interface (NEW in v2.1.0)**
- **Rich Terminal UI**: Beautiful formatting with syntax highlighting, tables, and markdown rendering
- **Session Management**: Save, load, resume conversations with full history
- **Real-time Streaming**: Live responses from n8n RAG workflows
- **Export Capabilities**: JSON and Markdown export of chat sessions
### 🔄 **Workflow Management (NEW in v2.1.0)**
- **Multiple n8n Workflows**: Manage different RAG workflows for various use cases
- **Connection Testing**: Automated workflow health checks
- **Default Workflow**: Set preferred workflows for different tasks
- **Import/Export**: Backup and share workflow configurations
### YouTube Integration
- **Interactive chat** with YouTube video transcripts using advanced AI models
- **Automated channel monitoring** with configurable intervals (daily, weekly, custom)
- **Bulk import** from channels, playlists, and URL files with comprehensive filtering
- **Advanced filtering** by duration, keywords, view count, exclude shorts/live streams
- **Video metadata extraction** and persistent storage with SQLite database
- **Intelligent transcript processing** with punctuation restoration and formatting
### Text-to-Speech (TTS)
- **Support for 6 TTS libraries**: Kokoro, OpenVoice v2, MeloTTS, Chatterbox, Edge TTS, Google TTS
- **Automated installer** with CPU-only support for compatibility
- **Configurable voice selection** and audio settings per library
- **Retry logic and timeout handling** for robust audio generation
- **Podcast-style audio overviews** with natural speech patterns
### Intelligent Rate Limiting
- **Smart queue system** to prevent YouTube IP blocking and API quota exhaustion
- **Maximum 5 videos per day** processing limit (configurable)
- **1-2 hour delays** between video processing attempts with smart distribution
- **Exponential backoff** on rate limit detection (2+ hour delays)
- **Distributed processing** throughout the day instead of bulk operations
- **Automatic rescheduling** of failed videos with intelligent retry logic
### Background Service
- **Automated channel monitoring** with APScheduler for cross-platform scheduling
- **Daily video discovery scans** at configurable times (default: 8 AM)
- **Continuous queue processing** every 2 hours respecting rate limits
- **Health checks and stuck job detection** every 30 minutes
## 🚀 Quick Start Options
### 📦 **Python Package (PyPI)**
```bash
pip install youtube-chat-cli
youtube-chat --help
```
### 🤖 **MCP Server for AI Assistants (npm)**
```bash
npx jaegis-youtube-chat-mcp
```
### 🔌 **MCP Client Configuration**
```json
{
"mcpServers": {
"jaegis-youtube-chat": {
"command": "npx",
"args": ["jaegis-youtube-chat-mcp"]
}
}
}
```
## 🚀 Quick Start
### Installation
```bash
# Clone repository
git clone https://github.com/usemanusai/youtube-free-deep-research-cli.git
cd youtube-free-deep-research-cli
# Create virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# Install dependencies
pip install -r requirements.txt
# Run API server
uvicorn youtube_chat_cli_main.api_server:app --reload --port 8556
```
### Health Endpoints
- `GET /health/live` - Liveness probe
- `GET /health/ready` - Readiness probe
## 📦 Installation
### From Source
```bash
# Install with development dependencies
pip install -e .
pip install -r requirements.txt -r youtube_chat_cli_main/api_requirements.txt
# Run tests
pytest -q
```
### Using uv (Recommended)
```bash
# Install uv
python -m pip install -U uv
# Lock dependencies
uv lock
# Run tests
uv run --with dev pytest -q
```
### Docker
```bash
docker build -t jaegis-api .
docker run --rm -p 8556:8556 jaegis-api
```
## ⚙️ Configuration
### Environment Variables
Create a `.env` file:
```bash
# API Configuration
API_HOST=0.0.0.0
API_PORT=8556
# LLM Configuration
OPENROUTER_API_KEYS=key1,key2,key3,... # 31 keys for rotation
# Database
DATABASE_URL=sqlite:///./data.db
# Logging
LOG_LEVEL=INFO
```
## 🏗️ Architecture
### System Overview
```
youtube_chat_cli_main/
├── services/ # 33 service modules
│ ├── llm/ # LLM implementations
│ ├── tts/ # TTS orchestrator & engines
│ ├── rag/ # RAG engine
│ ├── content/ # Content processing
│ ├── search/ # Search services
│ ├── storage/ # Vector store & sessions
│ ├── integration/ # Google Drive, n8n, embeddings
│ └── background/ # Background tasks
├── api/ # 13 API modules
│ ├── routes/ # API endpoints
│ ├── models/ # Request/response models
│ ├── middleware/ # CORS, error handling
│ └── server.py # FastAPI factory
├── cli/ # 6 CLI modules
│ └── commands/ # Command implementations
├── utils/ # 4 utility modules
└── tests/ # 4 test packages
```
## 🎙️ Advanced Voice Cloning Parameters (v2.0.0)
Generate natural-sounding speech with complete prosody control using 13 advanced parameters:
### CLI Usage
```bash
# Basic generation
youtube-chat voice-clone generate --voice-id 1 --text "Hello world" --output output.wav
# With emotion and pitch
youtube-chat voice-clone generate --voice-id 1 --text "I'm so happy!" \
--emotion happy --pitch 3 --exaggeration 0.7 --output happy.wav
# Professional presentation
youtube-chat voice-clone generate --voice-id 1 --text "Welcome to our report" \
--speed 1.0 --pitch 1 --emotion confident --emphasis "earnings" "report" \
--output presentation.wav
# View available presets
youtube-chat voice-clone presets
```
### Available Parameters
- **Speed** (0.5-2.0) - Speech rate multiplier
- **Pitch** (-12 to +12) - Pitch adjustment in semitones
- **Exaggeration** (0.25-2.0) - Emotional intensity
- **Emotion** (10 types) - neutral, happy, sad, angry, joyful, excited, surprised, curious, confident, empathetic
- **Emphasis** - Words to emphasize in speech
- **Pauses** - Silence before/after speech (0-5000ms)
- **Breath Frequency** - Insert breath every N words
- **CFG Weight, Temperature, Top P, Min P** - Advanced generation parameters
- **Seed** - For reproducible results
### Documentation
- **[Parameter Tuning Guide](docs/voice-cloning/PARAMETER_TUNING_GUIDE.md)** - Complete parameter reference
- **[CLI Usage Guide](docs/voice-cloning/CLI_ADVANCED_USAGE.md)** - CLI examples and patterns
- **[API Reference](docs/voice-cloning/API_ADVANCED_REFERENCE.md)** - REST API documentation
## 📚 Documentation
Comprehensive documentation is available in the `/docs` directory:
- **[Getting Started](docs/getting-started/)** - Installation and setup guides
- **[Architecture](docs/architecture/)** - System design and modular structure
- **[API Reference](docs/api/)** - REST API documentation
- **[Guides](docs/guides/)** - User guides and tutorials
- **[Development](docs/development/)** - Contributing and deployment
- **[Integrations](docs/integrations/)** - N8N, Google Drive, MCP server
- **[Voice Cloning](docs/voice-cloning/)** - Advanced voice cloning with prosody control
## 🧪 Testing
```bash
# Run all tests
pytest -q
# Run with coverage
pytest --cov=youtube_chat_cli_main
# Run specific test file
pytest tests/test_api_endpoints.py
# Run with verbose output
pytest -v
```
## 🐳 Docker
```bash
# Build image
docker build -t jaegis-api .
# Run container
docker run --rm -p 8556:8556 jaegis-api
# Run with environment file
docker run --rm -p 8556:8556 --env-file .env jaegis-api
```
## 🔄 CI/CD
GitHub Actions workflows:
- **Quality Assurance** - `.github/workflows/quality-assurance.yml`
- Lint, test, and coverage on Ubuntu, Windows, macOS
- **Security Audit** - `.github/workflows/security-audit.yml`
- Semgrep, Bandit, Gitleaks, ESLint
## 📝 Contributing
1. Fork the repository
2. Create a feature branch (`git checkout -b feature/amazing-feature`)
3. Commit your changes (`git commit -m 'feat: Add amazing feature'`)
4. Push to the branch (`git push origin feature/amazing-feature`)
5. Open a Pull Request
See [CONTRIBUTING.md](CONTRIBUTING.md) for detailed guidelines.
## 📄 License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
---
**Made with ❤️ for researchers and content creators**
Transform your research into actionable insights with AI-powered intelligence! 🚀✨
Connection Info
You Might Also Like
markitdown
Python tool for converting files and office documents to Markdown.
Fetch
Retrieve and process content from web pages by converting HTML into markdown format.
chatbox
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
oh-my-opencode
Background agents · Curated agents like oracle, librarians, frontend...
continue
Continue is an open-source project for seamless server management.
semantic-kernel
Build and deploy intelligent AI agents with Semantic Kernel's orchestration...