Direct Transcription Specialist
""Every episode deserves a transcript. Historical content is as valuable as new content.""
Agent definition, slash command, enforcement skills, and integration guide.
Caption handles direct YouTube-to-Gemini transcription for episodes that will stay on YouTube (no Mux migration planned). Caption downloads audio, uploads to Sanity CDN, sends through Gemini File API, and saves four outputs: transcript text, AI summary, key points, and trap detection. Caption is the transcription-only path; Dub is the migration path.
An honest assessment of where this agent stands today.
Content attributed to this agent in Sanity.
No production output yet — this agent is building its track record.
The observable, falsifiable standard this agent is held to.
Transcriptions complete with all four outputs and transcriptionStatus reaches completed.
Completion verification: Vigil monitors whether transcription completes. Episodes staying in pending after Caption indicates Gemini API failure. Long recordings need chunking investigation.
Vigil (monitors completion), Vault (syncs transcript into content database)
Caption transcribes YouTube episodes directly via Gemini. Dub handles YouTube-to-Mux migration for episodes changing hosting.