Skip to content
This repository was archived by the owner on Aug 22, 2025. It is now read-only.

Latest commit

 

History

History
22 lines (13 loc) · 411 Bytes

File metadata and controls

22 lines (13 loc) · 411 Bytes

Preparing Data with Data Prep Kit

Overview

In this section, we will process PDF documents and save them into a vector database

Here is the workflow:

Step-1: Setup Python Dev Environment

conda create -n rag-1 -y python=3.11
conda activate rag-1
pip install -r requirements.txt

Step-2: Process Data

Run notebook: process_data.ipynb