Video Frame
Extractor
Capture screenshots from any video file. Seek to the perfect moment, grab frames, and download them as PNG or JPG.
Click to upload a video
MP4, WebM, MOV — up to 500MB
How It Works
Upload Video
Select an MP4, WebM, or MOV video file from your device.
Seek & Capture
Use the video controls to find the perfect moment, then capture the frame.
Download
Download individual frames or grab all of them as a ZIP file.
Extract frames from a video online to grab thumbnails, b-roll stills, or freeze frames for slides and articles. ConverterUp's video frame extractor reads MP4, WebM, and MOV files, lets you scrub through the timeline, and saves the current frame as a PNG or JPG. It is handy for journalists pulling stills from press footage, designers capturing reference frames, and content creators preparing thumbnails for YouTube, TikTok, or Reels. The video is decoded in the browser, so source clips and unreleased footage stay on your device throughout the process.
Use cases: thumbnails, b-roll, reaction stills, frame-by-frame editing
YouTube and TikTok thumbnails are the most common case. Scrub to the exact expression or composition you want, export as PNG at full source resolution, then bring into Photoshop or Figma for text overlay. A thumbnail pulled from the source is sharper than one re-extracted from a compressed upload, since you skip the second round of codec loss.
B-roll and editorial stills — journalists, documentary editors, and news producers pull frames from press conference footage or interview clips for use as standalone photographs in articles, social posts, and pitch decks. The frame extractor avoids the awkward screenshot-the-player workflow, which captures UI chrome and reduces resolution.
Frame-by-frame analysis — sports coaches, motion designers, animators, and VFX artists need to see exactly what happens between two motion-blurred frames. Step one frame at a time with arrow keys to inspect a tennis serve at 1/30 s resolution, study a 24 fps animation pose, or grab a hard-to-spot continuity error from a rough cut.
Reaction stills and meme generation — extract the single perfect facial expression from a podcast clip or a film scene to use as a chat reaction, blog header, or shareable image. The freeze-frame is often more usable than the moving clip itself in static contexts (email, PDF, print).
ToolSeo.video-frame-extractor.section1.p5
Codec and keyframe accuracy
Video codecs do not store every frame independently. Most frames are P-frames (predicted from the preceding frame) or B-frames (interpolated between past and future frames). Only I-frames (a.k.a. keyframes) are fully self-contained. To decode any frame, the decoder must walk back to the previous I-frame and replay everything in between.
H.264 typically places an I-frame every 2 seconds (~48–60 frames at 24/30 fps). Seeking is fast and frame-accurate because the GOP (Group of Pictures) is short. H.265 / HEVC uses longer GOPs (10+ seconds) to improve compression; seeking is slower because more frames must be replayed. AV1 can use even longer GOPs and is markedly slower to decode in the browser without hardware acceleration.
ConverterUp uses the MediaSource + requestVideoFrameCallback API to align to the nearest decoded frame, then steps by exact frame duration (1 / fps). This is frame-accurate for H.264 in every modern browser; H.265 support is limited to Safari and to Chrome on macOS with hardware decode. WebM/VP9 works everywhere; AV1 works in Chrome and Firefox.
Variable frame rate (VFR) clips — common in screen recordings, OBS captures, and phone videos with adaptive frame rates — break naive frame-stepping because each frame's duration differs. The extractor reads the per-frame timestamp from the container's presentation time, so VFR clips step correctly without drift. The trade-off is that the stepping cadence is not constant; you may move 33 ms or 50 ms depending on the actual frame timing.
ToolSeo.video-frame-extractor.section2.p5
Batch and interval extraction modes
Interval mode samples one frame every N seconds across the full clip — useful for storyboards, contact sheets, content moderation pre-scans, and ML training-set extraction. Pick 1 fps for a dense storyboard, 0.2 fps (one frame every 5 s) for a thumbnail strip, or 0.033 fps (one frame every 30 s) for long-form podcast clip indexing.
Keyframe-only mode extracts only I-frames, which gives you a natural set of scene-boundary samples for free. The output count is unpredictable (it depends on the encoder's keyframe placement) but the frames are guaranteed to be visually distinct because keyframes typically land on scene cuts in well-encoded video.
Range mode extracts every frame between two timestamps. Use for animation rotoscoping (frame-by-frame paint-over), VFX matte extraction, or short loop creation. Be aware that 1 second of 60 fps source becomes 60 PNG files — a 10-second range is 600 frames, easily hundreds of megabytes. Export as JPG quality 90 if file size matters more than perfect quality.
Output is delivered as a ZIP with sequential filenames (frame_0001.png, frame_0002.png, …) padded to keep alphabetical sort matching temporal order. Filenames embed the source timestamp in milliseconds (frame_0001_t00:01:23.456.png) when the 'include timestamp in filename' toggle is on.
ToolSeo.video-frame-extractor.section3.p5
ToolSeo.video-frame-extractor.section4.heading
ToolSeo.video-frame-extractor.section4.p1
ToolSeo.video-frame-extractor.section4.p2
ToolSeo.video-frame-extractor.section4.p3
ToolSeo.video-frame-extractor.section4.p4
ToolSeo.video-frame-extractor.section4.p5
ToolSeo.video-frame-extractor.section5.heading
ToolSeo.video-frame-extractor.section5.p1
ToolSeo.video-frame-extractor.section5.p2
ToolSeo.video-frame-extractor.section5.p3
ToolSeo.video-frame-extractor.section5.p4
ToolSeo.video-frame-extractor.section5.p5
Frequently asked questions
What is the maximum video size?
Up to 500 MB per file. Larger videos technically work but may take longer to seek because the browser has to demux the container in memory.
How accurate is the frame selection?
Seeking is frame-accurate within the limits of the codec's keyframe interval. Use the arrow keys to step one frame at a time forward or backward for precise selection.
Can I export multiple frames at once?
Yes. Switch to interval mode and the tool will sample frames at a fixed rate (for example one frame per second) and export them as a ZIP archive of PNGs.
Which video formats are supported?
MP4 (H.264, H.265), WebM (VP8, VP9, AV1), and MOV. Output is PNG by default for lossless quality, with JPG available when you need a smaller file.
Does the extracted frame match the encoder's source quality?
It matches the <em>decoded</em> source quality, which is the same picture the player sees. The frame is rendered to a canvas at the video's native resolution before being encoded as PNG (lossless) or JPG (high quality). For a 4K H.264 source, you get a 3840 × 2160 PNG identical to what the codec output — no second pass of compression is applied unless you choose JPG.
Why does my exported frame look slightly blurry compared to the playing video?
Two likely causes. First, motion-blurred frames in 24/30 fps source are <em>inherently</em> blurred — that is how the camera captured motion, not a tool artifact. Second, some video players apply real-time sharpening or upscaling at render time that is not part of the underlying frame. Disable the player's enhancement filters or compare the extracted PNG side-by-side with the raw decoded frame in mpv/VLC.
ToolSeo.video-frame-extractor.q7
ToolSeo.video-frame-extractor.a7
ToolSeo.video-frame-extractor.q8
ToolSeo.video-frame-extractor.a8