
YouTube MCP Servers in 2026: Put Your Videos Inside Claude, ChatGPT & Cursor
The best YouTube MCP servers in 2026, compared — from on-demand transcript fetchers (kimtaeyoon83, ZubeidHendricks, ergut) to a persistent, searchable library (Noverload). How to set one up in Claude Desktop, and why persistence beats re-fetching.
You found a YouTube MCP server, added five lines to your Claude Desktop config, pasted a video URL, and got a summary. It worked. Then you closed the chat — and the summary was gone. Next week you paste the same video again, because Claude has no memory that you ever watched it.
That is the ceiling every YouTube MCP server hits today: they fetch one transcript, on demand, into one conversation. Nothing persists. Nothing is searchable later. Ask "what did that video say about pricing?" a month from now and you are starting from a blank URL bar.
This guide covers what a YouTube MCP server actually is, the options that exist in 2026, how to set one up, and the one architectural difference that decides whether your videos become a searchable library or evaporate after each chat.
What Is a YouTube MCP Server?
MCP (Model Context Protocol) is an open standard from Anthropic that lets an AI assistant — Claude Desktop, Claude Code, Cursor, Windsurf, and a growing list of clients — call external tools directly, without copy-paste. A YouTube MCP server is one of those tools: it gives your AI the ability to reach into YouTube and pull back a video's transcript, metadata, or captions so the model can summarize or analyze it.
Before MCP, summarizing a YouTube video with Claude meant a manual dance: open the video, click "Show transcript," toggle off timestamps, copy the wall of text, paste it into a new chat, and prompt. A YouTube MCP server collapses that into one instruction — "summarize this video: [URL]" — and the server fetches the transcript behind the scenes.
That is a real upgrade. But most YouTube MCP servers stop there, and "there" has a hidden cost.
On-Demand vs. Persistent: The Distinction That Matters
There are two fundamentally different shapes of YouTube MCP server, and the SERP does not make the difference obvious because they all describe themselves the same way.
On-demand transcript fetchers are the majority. You give them a URL, they return the transcript for that one video, for that one conversation. The moment the chat ends, the knowledge is gone — the server stored nothing. Watch fifty videos this way and you have fifty dead conversations, not a library. These are the GitHub repos that dominate the search results, and for a one-off summary they are genuinely fine.
Persistent knowledge servers flip the model. You save a video once; it is transcribed, summarized, and embedded into a searchable store. From then on, every AI client you connect can query the whole collection: "compare what those three founder interviews said about distribution," "find the video where someone explained the pricing framework," "pull every action item across everything I saved this month." The video stops being a URL you re-fetch and becomes part of a memory your AI can reason over.
If you only ever need to summarize the video currently in front of you, an on-demand fetcher is enough. If you watch to learn — and want those learnings to compound instead of scroll away — you need the persistent kind.
The Best YouTube MCP Servers in 2026
Here is what we evaluated each option on:
- Persistence: Does it remember videos after the chat ends, or fetch fresh each time?
- Cross-video search: Can it search across everything you have watched, or one video at a time?
- Multi-source: YouTube only, or also articles, X, Reddit, and PDFs?
- Setup: One-click OAuth, or a config file plus an API key?
- Client portability: Works across Claude, ChatGPT, and Cursor — or tied to one?
1. Noverload — Best for a persistent, searchable video library
Noverload is the only option here built around persistence. You save a YouTube URL once and get an AI summary, the full transcript, key takeaways, and extracted action items — then it stays, embedded and searchable alongside every other thing you have saved. Its MCP integration exposes that whole library to your AI: cross-content search, synthesis across videos, and concept extraction, not just single-video retrieval.
- Persistence: Yes — every saved video is stored and searchable forever
- Cross-video search: Yes — semantic search and synthesis across your entire library
- Multi-source: YouTube, X, Reddit, articles, and PDFs in one place
- Setup: One-click OAuth from /mcp, no API key, no config file
- Portability: MCP-standard — reachable from Claude Desktop, Cursor, Claude Code (ChatGPT MCP support rolling out)
Best for: anyone who watches educational YouTube and wants it to become a second brain their AI can query — not a stack of one-off summaries.
2. kimtaeyoon83/mcp-server-youtube-transcript — Best pure transcript fetcher
The most popular on-demand option. A lightweight npx install that pulls captions from any video, including auto-generated ones, with language selection. No API key required.
- Persistence: No — fetches fresh each time
- Cross-video search: No — single video per call
- Multi-source: No — YouTube transcripts only
- Setup: Config file,
npx -y @kimtaeyoon83/mcp-server-youtube-transcript
Best for: developers who want a dead-simple, free way to drop one transcript into Claude on demand.
3. ZubeidHendricks/youtube-mcp-server — Best for channel and video metadata
A fuller-featured server that goes beyond transcripts to video statistics, channel data, and video listings. More capable, but it requires a Google/YouTube Data API key and setup.
- Persistence: No
- Cross-video search: Within a channel via the API, not across your own saved set
- Multi-source: No — YouTube only
- Setup: Config file plus a YouTube Data API v3 key
Best for: building automations that query YouTube channel data, not personal knowledge.
4. ergut/youtube-transcript-mcp — Best zero-install remote fetcher
A hosted, remote transcript server — no local install, works from mobile. Still on-demand: it returns a transcript per request and stores nothing for you.
- Persistence: No
- Cross-video search: No
- Multi-source: No
- Setup: Point Claude Desktop at the hosted SSE endpoint — zero local dependencies
Best for: quick transcript extraction on any device without installing anything.
5. Smithery "Advanced YouTube" / ytmcp.com — Best for in-transcript search
Directory-hosted servers that add search within a channel's transcripts — ask what a creator said about a topic and get timestamped quotes. Powerful for research on a specific creator, but scoped to live YouTube, not your saved history.
- Persistence: No — searches YouTube live, not a store you own
- Cross-video search: Within a creator's public catalog, not your personal library
- Multi-source: No
- Setup: Via the hosting directory (Smithery), some require a YouTube API key
Best for: researching one creator's back catalog, not building your own knowledge base.
Save a YouTube video and query it inside Claude
Paste any YouTube URL — get an AI summary, full transcript, and action items, then reach it from Claude via MCP. No account needed to try.
10 free saves · No credit card required
How to Set Up a YouTube MCP Server in Claude Desktop
For an on-demand fetcher, you edit your claude_desktop_config.json (Settings → Developer → Edit Config) and add the server:
{
"mcpServers": {
"youtube-transcript": {
"command": "npx",
"args": ["-y", "@kimtaeyoon83/mcp-server-youtube-transcript"]
}
}
}
Restart Claude Desktop, look for the tools icon, and paste a URL with "get the transcript and summarize the main points." That is the whole flow for a single video — and the whole flow, every time, because nothing is saved.
For a persistent library with Noverload, there is no config file. You connect once through /mcp with one-click OAuth, and your entire saved collection becomes available to Claude, Cursor, and Claude Code. Full walkthrough in Introducing Noverload MCP. After that, saving a video anywhere — web, extension, share sheet — adds it to the library your AI can already reach.
Why Persistence Beats Re-Fetching
The friction most people hit is not "I cannot get a transcript into Claude." The on-demand servers solved that. The friction is that the value evaporates: you summarize a video, act on nothing, and a month later you cannot find the one insight you wanted because it lived in a chat you closed.
A YouTube video sits in a watch-later list for six months. You finally watch it, summarize it once in Claude, and the summary dies with the conversation. That is the bookmark graveyard problem wearing a new coat — and re-fetching the same transcript on demand does not fix it, because there is still no memory layer underneath.
Persistence is what turns individual summaries into leverage. When every video you have watched is embedded and searchable, your AI can do things no on-demand fetcher can: surface the three talks that disagree on a point, connect a framework from one video to a note in an article, or answer a question from across a year of watching without you remembering where you saw it.
Beyond YouTube: One Library, Every Source
The other limit of dedicated YouTube MCP servers is in the name. Your learning is not YouTube-only — it is scattered across X threads, Reddit discussions, long articles, and PDFs. A YouTube-only server leaves four-fifths of your knowledge stranded in other tools.
Noverload treats YouTube as one input into a single searchable layer. The same MCP tools that query your saved videos also reach your saved articles, threads, and documents — so your AI answers from everything you have consumed, not just the videos. For the broader landscape of MCP servers built for personal knowledge, see The 7 Best MCP Servers for Personal Knowledge in 2026. For non-MCP ways to summarize video, see The Best AI YouTube Summarizers in 2026.
FAQ
What is a YouTube MCP server? An MCP-compatible tool that lets an AI assistant fetch a YouTube video's transcript or metadata directly, so it can summarize or analyze the video without manual copy-paste.
Do I need a YouTube API key? For pure transcript fetchers like kimtaeyoon83's, no. For metadata-heavy servers like ZubeidHendricks', yes — a YouTube Data API v3 key. Noverload needs neither; you connect with one-click OAuth.
Can Claude summarize a YouTube video without MCP? Yes, but manually — open the transcript, copy it, paste it into a chat, and prompt. A YouTube MCP server automates the fetch. A persistent server also keeps the result.
Will these work with ChatGPT and Cursor? MCP is supported in Claude Desktop, Claude Code, Cursor, and Windsurf today; ChatGPT MCP support is rolling out. A standard MCP server like Noverload is reachable from any compliant client.
What is the difference between a transcript MCP and Noverload? A transcript MCP fetches one video per request and stores nothing. Noverload saves each video permanently — summarized, embedded, and searchable across your whole library and every source you save.
The Bottom Line
If all you need is to drop the video in front of you into Claude, install an on-demand transcript server — kimtaeyoon83's is the simplest, and it is free.
But if you watch YouTube to learn, the on-demand model quietly fails you: every summary dies with its chat, and you rebuild from scratch each time. The fix is a YouTube MCP server with a memory — one that saves each video once and makes your entire watch history queryable by your AI, forever.
That is what Noverload is built for: not fetching one transcript, but turning everything you watch, read, and save into a library your AI can reason over.
Want to see it work? Try Noverload free — 10 saves/month, no credit card. Then connect it to Claude in one click from /mcp.
Tags
Ready to build your second brain?
Start saving content and give your AI the context it needs.