Video import PR2b: ffmpeg frame sampling + ffmpeg in Docker image by windoze95 · Pull Request #88 · windoze95/saltybytes

windoze95 · 2026-06-30T03:04:11Z

What

PR 2b of the premium video-link import pipeline. Adds frame extraction and the one infra change (ffmpeg in the runtime image).

FrameSampler (internal/video/frames.go) — downloads the video's media URL and samples up to MaxFrames (default 30) JPEG frames via ffmpeg. Scene-change detection (select='gt(scene,0.2)') catches quick on-screen text and flashy cuts; falls back to even time-sampling across the full duration when a clip has too few cuts. Frames scaled to 640px (keeps Claude image input at standard resolution → avoids the ~3× high-res token cost). The downloaded source video is always deleted before returning — no source media retained. Exec + download seams → fully unit-tested offline.
Dockerfile — runtime base distroless → debian:12-slim + ffmpeg + ca-certificates. Validated with a local docker build (ffmpeg 5.1 present, image assembles & runs).

Cost control

The frame cap is the master cost dial (~$0.0048/frame on Sonnet, $0.0016 on Haiku). 30 frames ≈ $0.14 Sonnet for the AI step; standard-res scaling keeps it there.

Tests

computeFPS clamping, scene-vs-even selection, frame cap, download-error and no-frames paths (all via seams). go test ./... -count=1 → green (12 packages). Local docker build confirms the image builds with ffmpeg.

PR2c: async job + POST /v1/recipes/import/video + poll endpoint, fanning VideoMeta (frames + transcript + caption) into ExtractRecipesFromMedia, with the premium gate, VideoExtractionCache, cost metering, and the daily-budget kill switch — plus the ECS key + a real end-to-end TikTok test.

🤖 Generated with Claude Code

https://claude.ai/code/session_01BU4UWZutHd1AnK3XAf7H19

PR2b of the video-link import pipeline. - internal/video: FrameSampler downloads a video's media URL and samples up to MaxFrames (default 30) JPEG frames via ffmpeg -- scene-change detection (to catch quick on-screen text / flashy cuts) with even time-sampling fallback. Frames scaled to 640px (standard-res tokens); the source video is always deleted (never retained). Exec + download test seams keep it fully unit-tested offline. - Dockerfile: runtime base distroless -> debian:12-slim + ffmpeg + ca-certificates (validated with a local docker build; ffmpeg 5.1 present). The frame cap is the master cost dial (~$0.0048/frame on Sonnet). Next: PR2c orchestration (job + endpoints + gate/cache/metering/kill-switch). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Claude-Session: https://claude.ai/code/session_01BU4UWZutHd1AnK3XAf7H19

chatgpt-codex-connector · 2026-06-30T03:04:15Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

windoze95 merged commit 6fae57a into main Jun 30, 2026
1 check passed

windoze95 deleted the feat/video-frame-sampling branch June 30, 2026 03:05

windoze95 mentioned this pull request Jun 30, 2026

Assemble video-link import: async pipeline, premium gate, cost controls #89

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Video import PR2b: ffmpeg frame sampling + ffmpeg in Docker image#88

Video import PR2b: ffmpeg frame sampling + ffmpeg in Docker image#88
windoze95 merged 1 commit into
mainfrom
feat/video-frame-sampling

windoze95 commented Jun 30, 2026

Uh oh!

chatgpt-codex-connector Bot commented Jun 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

windoze95 commented Jun 30, 2026

What

Cost control

Tests

Next

Uh oh!

chatgpt-codex-connector Bot commented Jun 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant