Here are some example prompts you can use to test and demonstrate the Formula 1 RAG Agent's retrieval capabilities:
- "How did Ayrton Senna perform in the 1991 Monaco Grand Prix?"
- "Which constructor dominated the 2004 Formula 1 season and why?"
- "Give me a summary of Lewis Hamilton’s 2018 season, including key races he won."
This project is a fully functional Retrieval Augmented Generation (RAG) agent built using Google Cloud Vertex AI and the Agent Development Kit (ADK).
The agent is powered by a custom Formula 1 corpus containing 1,800+ documents.
- Set up a Google Cloud project, fixed SDK conflicts, and authenticated using Application Default Credentials.
- Installed Google Cloud SDK cleanly and configured the correct project, region, and zone.
- Creatd a corpus named
Formula1 - Created a GCS bucket (
f1-corpus-8474-eu) to store all Formula 1 documents. - Extracted data from the public F1 dataset (1950-2020): https://www.kaggle.com/datasets/rohanrao/formula-1-world-championship-1950-2024
- Generated a comprehensive
f1_manifest.txtfile containing 1,825+ GCS file paths. - Built a bootstrap ingestion script that imports files in safe batches of 200.
- Successfully ingested 1,825+ Formula 1 documents into a RAG corpus named
Formula1. - Integrated RAG tools into a single ADK agent.
- Connected the agent to the ADK Web UI for live querying.
- Verified ingestion using
verify_corpusand executed retrieval-based F1 Q&A. - Cleaned the project structure (removed unused folders) and prepared it for GitHub.
This repository contains a Vertex AI-powered Retrieval Augmented Generation (RAG) Agent implemented using the Google Agent Development Kit (ADK).
- Google Cloud account with billing enabled
- Google Cloud project with Vertex AI API enabled
- Python 3.9+
- Google Cloud SDK installed
- Permissions for Vertex AI RAG
- ADC configured locally
Follow: https://cloud.google.com/sdk/docs/install
gcloud initgcloud auth application-default logingcloud auth list
gcloud config listgcloud services enable aiplatform.googleapis.com- Ask natural-language questions
- Retrieves relevant text chunks
- Generates grounded answers
- Lists all RAG corpora in your Google Cloud project
- Creates an empty RAG corpus
- Supports GCS URLs, Google Drive links, Google Docs URLs
- Shows file count, metadata, timestamps
- Permanently deletes a corpus