To add captions to a demo video, you export a transcript from your narration, time it into a subtitle file (SRT or VTT), then either burn the text directly into the video or attach it as a toggleable track. The whole process takes a few minutes once you have a clean script, and it pays off immediately: most people watch product videos with the sound off, so captions are often the only way your message lands.
Captions are no longer a nice-to-have for product demos. They lift watch time, make your content accessible, and help search engines understand what your video is about. Below is a practical walkthrough of why they matter and exactly how to add them quickly.
Why captions lift engagement on demo videos
The behavior data is consistent across platforms: a large share of social and embedded video plays with the sound muted by default. On a feed-based platform, autoplay is silent. On a landing page, a visitor may be in an open office or a quiet room and never unmute. If your demo relies on narration alone, that audience gets nothing.
Captions change the math in a few concrete ways:
- They keep muted viewers watching. When text tracks the on-screen action, a silent viewer still follows the story and stays past the first few seconds.
- They improve comprehension. Product names, feature labels, and technical terms are easier to absorb when viewers read them, not just hear them.
- They widen your reach. Roughly 1 in 6 adults has some degree of hearing loss, and captions make your demo usable for them and for non-native speakers.
- They feed SEO and discovery. A subtitle file is machine-readable text. Search engines and video platforms use it to index your content, which helps the right people find your demo.
For a SaaS demo specifically, the payoff is sharper. Your viewer is often evaluating whether your product solves their problem in the first 10 seconds. Captions remove the "I'll watch it later with sound" excuse and let the value land instantly.
Burned-in captions vs. soft subtitle tracks
Before you add anything, decide which format fits where the video will live. There are two approaches, and they solve different problems.
Burned-in (open) captions
Burned-in captions are rendered directly into the video pixels. They are always visible and cannot be turned off. Use them when:
- The video autoplays muted on social feeds (LinkedIn, X, Instagram, TikTok).
- You embed the demo on a landing page where you cannot rely on a player's caption controls.
- You want full control over font, color, and placement for brand consistency.
The downside: they are permanent. If you find a typo, you re-export.
Soft (closed) captions
Soft captions live in a separate file (.srt or .vtt) that the player loads alongside the video. Viewers toggle them on or off. Use them when:
- You host on YouTube, Vimeo, or Wistia, which support uploaded caption files.
- You want viewers to choose their language or turn text off.
- You may need to edit or translate captions later without touching the video.
A common best practice: burn captions in for social clips, and use soft tracks for hosted, on-page, and YouTube versions. You can generate both from the same source transcript.
How to add captions to a demo video quickly
Here is the fastest reliable workflow, step by step.
- Start with an accurate script. Captions are only as good as the transcript behind them. If your demo was built from a written narration script, you already have the text and can skip transcription entirely. This is where a tool like InstaDemo helps: it turns a website URL into a narrated demo video and generates the narration script as part of the process, so you have clean, ready-to-caption text from the start instead of running speech-to-text on a recording.
- Generate a timed subtitle file. Take your transcript and align it to the video timeline. Most editors and platforms auto-generate timings from audio; you then nudge the in/out points so each line appears as the speaker says it. Export as
.srtfor broad compatibility or.vttif you need styling. - Clean up the timing and line length. Keep each caption to one or two lines, roughly 32 to 42 characters per line, and on screen for at least one second. Break lines at natural pauses, not mid-phrase.
- Choose burn-in or soft track. For social, burn the captions in during export. For hosted players, upload the
.srt/.vttfile alongside the video. - Review on mute. Play the finished video with the sound off, on a phone-sized screen. If you can follow the demo without audio, the captions are doing their job.
A minimal .srt file looks like this:
1
00:00:00,000 --> 00:00:03,200
Paste any website URL to start.
2
00:00:03,200 --> 00:00:06,500
InstaDemo crawls the page and writes the narration.
The number is the cue index, the timestamps are start and end, and the text follows. You can hand-edit this in any text editor if an auto-generated file is close but not perfect.
Styling captions so they stay readable
Good captions disappear into the experience. Bad ones fight the video. A few rules keep them legible across screen sizes:
- Use a clean sans-serif at a size that is readable on mobile. Test at the smallest screen your audience uses.
- Add a background or outline. A semi-transparent black box or a thin text stroke keeps white text readable over bright UI screenshots.
- Position in the lower third, but raise captions if they would cover an important button or menu in the demo.
- Limit on-screen text. Two lines maximum. If a sentence is long, split it across two cues.
- Match your brand sparingly. A consistent font and accent color is fine; avoid heavy colors or animation that distract from the product.
Avoid these common caption mistakes
A few errors quietly undercut otherwise solid demos:
- Auto-captions left unedited. Speech-to-text gets product names and jargon wrong. Always proofread, especially your own brand and feature names.
- Captions that lag the action. If text appears after the relevant screen, viewers lose the connection. Tighten the timing.
- Walls of text. Dumping a full sentence on screen at once forces speed-reading. Chunk it.
- Forgetting the social cut. A hosted demo with soft captions still needs burned-in text when you repost the clip to a muted feed.
Conclusion
Captions are one of the highest-leverage edits you can make to a product demo. They keep muted viewers engaged, make your content accessible, and give search engines clean text to index, all from a transcript you likely already have. The fastest path is to start from an accurate script, generate a timed .srt or .vtt, then burn it in for social and attach a soft track everywhere else.
If you would rather skip the recording and transcription steps entirely, try InstaDemo: paste a URL and it produces a narrated demo video with a ready-made script you can caption in minutes.