Automate YouTube Video Summaries & AI Chat on Telegram
Access concise YouTube video summaries and engage in AI-powered Q&A instantly, cutting video research time by over 80%.
Manually sifting through long YouTube videos to find key information is time-consuming and inefficient. This workflow instantly transcribes any YouTube video, generates a comprehensive summary, and allows you to chat with an AI for immediate, precise answers.

Documentation
Automate YouTube Video Summaries & AI Chat on Telegram
This powerful n8n workflow revolutionizes how you consume YouTube content. By integrating OpenAI's GPT-4o-mini with Telegram and Google Docs, it provides instant summaries of any YouTube video and allows for interactive AI-powered Q&A, making video research efficient and insightful.
Key Features
- Instant YouTube Video Summaries: Automatically transcribes YouTube videos and leverages GPT-4o-mini to generate concise, structured summaries.
- Interactive AI Chat: Engage in real-time conversations with an AI assistant about video content, getting precise answers to your specific questions via Telegram.
- Flexible Input Options: Trigger the summarization by simply sending a YouTube URL to your Telegram bot or through a dedicated webhook.
- Persistent Transcript Memory: Video transcripts are stored in Google Docs, allowing the AI chat to maintain context and provide accurate, informed responses over time.
- Seamless Telegram Integration: Receive summaries and AI chat responses directly within your Telegram application for convenient access.
How It Works
This workflow operates in two main modes: summarization and interactive chat. For summarization, it's activated when you provide a YouTube URL via a Telegram message or a webhook. The workflow extracts the video ID, fetches the full transcript, and then processes it using GPT-4o-mini to generate a concise, structured summary. This summary is then delivered back to you on Telegram. For interactive chat, a separate Telegram trigger listens for your messages. Your questions are fed to an AI agent, powered by GPT-4o-mini and LangChain, which retrieves the video transcript (previously stored in a Google Docs document) to provide accurate and context-aware answers. A window buffer memory ensures the AI remembers past conversation context, enabling a fluid chat experience.