Skip to content

feat: Add LlamaParse + Supabase integration for document extraction#1

Open
Yvette-0508 wants to merge 1 commit into
mainfrom
dev
Open

feat: Add LlamaParse + Supabase integration for document extraction#1
Yvette-0508 wants to merge 1 commit into
mainfrom
dev

Conversation

@Yvette-0508
Copy link
Copy Markdown
Owner

  • Integrated LlamaParse for multi-modal document parsing (PDF, DOCX, XLSX, HTML, images)
  • Added Supabase cloud storage for extraction results
  • Created universal test script (test_extractor.py) with --store flag
  • Support for audio transcription via OpenAI Whisper
  • Chunking with table and formula extraction from markdown

- Integrated LlamaParse for multi-modal document parsing (PDF, DOCX, XLSX, HTML, images)
- Added Supabase cloud storage for extraction results
- Created universal test script (test_extractor.py) with --store flag
- Support for audio transcription via OpenAI Whisper
- Chunking with table and formula extraction from markdown
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant