MP3 โ†’ Stock Video โ†’ Auto Subtitles โ†’ MP4

MP3 to Video
with Subtitles โ€” Automatic & Free

Upload your MP3. We transcribe every word, find stock footage for each sentence, and burn subtitles directly into the video โ€” so your captions are always on, on every platform.

Convert MP3 with Subtitles โ€” Free Sign in
Auto-transcription Burned-in captions No watermark No editing
Your MP3
Upload audio
AI Transcription
Vosk speech-to-text
Stock Footage
Pexels per sentence
Subtitle Burn
FFmpeg embed
Final MP4
With captions

How it works โ€” step by step

1
Upload your MP3
Any MP3, WAV, or M4A file. Podcast clips, voice memos, narration recordings, coaching audio โ€” all supported.
2
Vosk transcribes with timestamps
Our speech-to-text engine assigns a start and end time to every sentence. This drives both the footage duration and subtitle timing.
3
Stock clips matched per sentence
Each sentence's text is used as a search query against Pexels. The most relevant clip is downloaded and trimmed to your speech duration.
4
Subtitles burned into the video
FFmpeg bakes the SRT captions permanently into every frame of the final MP4. No separate file needed by viewers โ€” captions always show.

Why burned-in subtitles matter for social video

85%
of social video is watched without sound โ€” captions are the only way your message lands.
40%
more watch-time on videos with captions vs videos without, according to multiple platform studies.
3ร—
more shares on captioned video posts across Instagram and LinkedIn.
100%
platform compatibility โ€” burned-in subs work on YouTube, TikTok, Instagram, X, and LinkedIn without importing a file.

MP3 to video with subtitles โ€” FAQs

Are the subtitles generated automatically or do I type them?
Fully automatic. Vosk AI transcribes your MP3 and we generate the SRT file from that. You can review and edit the text before the video is generated if needed.
What language do the subtitles support?
The current transcription model is optimised for English. Other languages may produce lower accuracy results.
Can I get both the video with burned-in subtitles AND a separate SRT file?
Yes. The project dashboard lets you download the full MP4 (with burned-in captions) and the .srt file separately.
What if a subtitle line is wrong?
After transcription you can edit the text of any sentence. The corrected text becomes both the subtitle and the stock footage search query for that scene.
Is there a watermark on the subtitle text or the video?
No. There is no watermark of any kind โ€” not on the video itself, not embedded in the subtitles, not in the corner. Your video is completely clean.
Does this work for music or singing?
Speech-to-text is optimised for spoken word, not singing. For music videos with lyrics, use the text mode and type the lyrics manually as the script.

Related tools

MP3 to Video Audio to Video Add Subtitles to Video Free Subtitle Generator Voice to Video

Ready to create your first video?

No credit card required. Generate your first reel in under 5 minutes.

Start for Free