Captions & Transcripts
Upload your video. Captions, transcripts, and chapters — done.
VideoNest Audio automatically transcribes every upload, generating synchronized WebVTT captions, word-level JSON, and searchable plain text. Add chapter markers to help viewers navigate. No manual work, no third-party tools.
in Washington, D.C.
in Washington, D.C.
Horizontal · 16:9 · captions auto-generated
Transcript formats
Two formats. One transcription.
From a single audio pass, VideoNest generates a caption-ready WebVTT file and a word-level JSON transcript. The examples below are from this video.
WEBVTT 1 00:00:00.000 --> 00:00:02.560 The number one, I believe, sold for a million bucks. 2 00:00:10.560 --> 00:00:14.080 This is the National Stock and Bonds Show in Washington, D.C.
{ "video_id": 454084, "model": "nvidia/parakeet-tdt-0.6b-v3", "language": "en", "words": [ { "word": "The", "start": 0.00, "end": 0.16, "confidence": 0.98 }, { "word": "number", "start": 0.16, "end": 0.48, "confidence": 0.97 }, { "word": "one,", "start": 0.48, "end": 0.80, "confidence": 0.99 } // ... one entry per spoken word ] }
Capabilities
Built for every viewer, every context
Transcribed on upload
VideoNest Audio runs on every hosted video. WebVTT, JSON, and plain text are generated automatically. No manual file upload required.
Chapter markers
Add chapter markers to any video so viewers can jump directly to the section they want. Set titles and timestamps in your video settings — the player handles the rest.
Multiple caption tracks
Attach multiple caption files per video. Viewers choose their language from the player controls. No separate video files needed.
WCAG 2.1 AA controls
Player controls are keyboard-navigable and screen-reader compatible. Meets WCAG 2.1 AA accessibility standards at the player level.
Auto-on captions
Configure captions to display by default — for social embeds, news feeds, and mobile placements where viewers watch without audio.
Searchable transcripts
Word-level JSON transcripts make every second of your video searchable. Index content, power in-site search, and surface video at the right moment.
More player features
More ways to control the viewer experience
Autoplay & Sound
Set when and how video plays. Muted autoplay by default, sound-on for approved CTV and interstitial placements.
Learn more →Branding Controls
Add your logo, set player colors, and remove platform watermarks. Fully white-labeled on Business plans.
Learn more →Embed Options
One snippet for any CMS: WordPress, Webflow, Squarespace, or custom HTML. Fully responsive with per-placement overrides.
Learn more →Player Customization
Control colors, control bar layout, and playback behavior to match your brand at every placement.
Learn more →Player API
Hook into playback events programmatically. Build custom controls, triggers, and integrations on top of the player.
Learn more →Vertical Video Player
Native 9:16 player format for mobile-first content, social embeds, and short-form video placements.
Learn more →