Skip to content

Video import PR2a: ScrapeCreators client (platform detect + TikTok)#87

Merged
windoze95 merged 1 commit into
mainfrom
feat/video-scrapecreators-client
Jun 30, 2026
Merged

Video import PR2a: ScrapeCreators client (platform detect + TikTok)#87
windoze95 merged 1 commit into
mainfrom
feat/video-scrapecreators-client

Conversation

@windoze95

Copy link
Copy Markdown
Owner

What

PR 2a of the premium video-link import pipeline (foundation merged in #86). New internal/video package that acquires a video's caption, transcript, and downloadable media URL via the ScrapeCreators API.

  • DetectPlatform — classifies TikTok / Instagram / YouTube / Facebook / Pinterest URLs by host.
  • ScrapeCreatorsClient.FetchVideo → normalized VideoMeta{VideoID, Caption, Transcript, MediaURL, DurationMS}. Auth via x-api-key. TikTok implemented and verified against the live API (/v2/tiktok/video with get_transcript=trueaweme_detail.desc, .video.download_no_watermark_addr with play_addr fallback, bundled transcript). The other four platforms are stubbed (not yet supported) for fast-follow — same client shape.
  • An httpDo request seam keeps it fully unit-tested offline (no network, no key).

Tests

DetectPlatform (all 5 platforms + unsupported/garbage), TikTok parse (no-watermark + play_addr fallback), unsupported-platform + HTTP-error paths. go test ./... -count=1 -> green (12 packages, 0 failures).

Next

  • PR2b: ffmpeg frame sampling + Dockerfile -> debian-slim.
  • PR2c: async job + POST /v1/recipes/import/video + poll endpoint, fanning VideoMeta (frames + transcript) into ExtractRecipesFromMedia, with the premium gate, cache, metering, and daily-budget kill switch.

🤖 Generated with Claude Code

https://claude.ai/code/session_01BU4UWZutHd1AnK3XAf7H19

PR2a of the video-link import pipeline. New internal/video package:
- DetectPlatform classifies TikTok/Instagram/YouTube/Facebook/Pinterest URLs.
- ScrapeCreatorsClient.FetchVideo resolves a URL to normalized VideoMeta
  (video ID, caption, transcript, no-watermark media URL, duration) via the
  ScrapeCreators API (x-api-key). TikTok implemented + verified against the
  live response; other platforms stubbed for follow-up.
- httpDo request seam keeps it fully unit-tested offline.

Next: PR2b ffmpeg frame sampling + Dockerfile; PR2c orchestration.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Claude-Session: https://claude.ai/code/session_01BU4UWZutHd1AnK3XAf7H19
@chatgpt-codex-connector

Copy link
Copy Markdown

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

@windoze95 windoze95 merged commit 62bc454 into main Jun 30, 2026
1 check passed
@windoze95 windoze95 deleted the feat/video-scrapecreators-client branch June 30, 2026 02:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant