Automatic transcription
Speech is turned into accurate, time-coded subtitles using a Whisper-based engine — no manual typing.
Upload a video, get accurate subtitles automatically, fine-tune them in the browser, and download the result with captions burned in — or as a clean SRT/VTT track. No installs, no timeline wrestling.
Runs in your browser · Parallel transcription · Export hard-burned or soft subtitles
A focused workflow that handles transcription, editing, styling and export — without leaving the tab.
Speech is turned into accurate, time-coded subtitles using a Whisper-based engine — no manual typing.
Audio is chopped into segments and transcribed at the same time, so a long video finishes about as fast as a short one.
Review every cue, fix wording, and nudge timings in a clean editor synced to live video playback.
Control font, size, color and placement, then preview exactly how captions will sit on the video.
Hard-burn styled captions into the video with ffmpeg, or export a separate SRT/VTT subtitle track.
Transcribe English, Spanish, German, Romanian, French and Italian out of the box.
Upload, transcribe, edit, export. The heavy lifting runs on a fast serverless pipeline.
Drop in a file. It uploads straight to secure storage — no waiting in a queue.
The audio is split and transcribed in parallel, then merged into clean, time-coded cues.
Polish the text, adjust timings, and style captions to match your brand.
Export the video with captions baked in, or grab a standalone SRT/VTT file.
Burn them in for social platforms, or keep them as a toggleable track for players that support it.
Styled captions are rendered directly into the footage with ffmpeg.
A standalone subtitle file or muxed soft track alongside the video.
Pick a language and let the engine handle the rest.
Because the audio is split into segments and transcribed in parallel, a long video finishes about as fast as a short one — usually minutes, not hours.
English, Spanish, German, Romanian, French and Italian are supported out of the box.
Yes. You can hard-burn styled captions directly into the video, or export a separate soft SRT/VTT subtitle track — whichever fits your platform.
No. Caption Studio runs entirely in your browser. Upload, edit, and export without installing software.
Absolutely. Every cue is editable — fix wording, adjust timings, and restyle captions, all synced to live video playback.
Upload, auto-transcribe, fine-tune, and export — all in the browser.
Open Caption Studio