Add GitHub Action for automatic ReEDS index update (Handled in https://github.com/Yunzhi-Chen/ReEDS-GPT-Index/actions instead)#64
Closed
Yunzhi-Chen wants to merge 1 commit into
Closed
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Add a GitHub Actions workflow that automatically rebuilds the FAISS vector index used by ReEDS-GPT whenever code changes are merged to
main. This keeps the AI knowledge base in sync with the latest ReEDS codebase without manual intervention.Technical details
The workflow builds a FAISS vector index from all text-like files in the repo (
.py,.gms,.md,.csv,.yaml, etc.), using Google Gemini embeddings (gemini-embedding-001), and uploads the result to the existing GCS bucket (gs://reeds-agent-index) that powers the ReEDS-GPT ChatGPT assistant.Implementation notes
main: downloads the previous index from GCS, compares file signatures (size + mtime) viamanifest.json, and only re-embeds changed/new files. Cost: <$0.10 per run.git-restore-mtimeso the incremental skip logic works correctly in CI.torchorsentence-transformers).Additional changes
None. No changes to ReEDS model code, inputs, or outputs.
Issues resolved
Resolves the manual index rebuild process described in the future development list (item: "Index ReEDS repo to improve AI chat accuracy; automate the indexing process so it updates together with ReEDS-GPT").
Validation, testing, and comparison report(s)
ast.parse()syntax validationyaml.safe_load()validationmain— pushing to feature branches has no effectGCP_SA_KEY,GOOGLE_API_KEY) must be configured in repo Settings → Secrets before the workflow will succeedChecklist for author
Details to double-check
Charge code provided to reviewersIncluded comparison reports for appropriate test casesDocumentation updated if necessaryIf input data added/modifiedGeneral information to guide review
Did you use LLM tools (chatbot or copilot) in the preparation of this PR? If so, describe how
Yes. GitHub Copilot (Claude) was used to write both the build script and the workflow YAML.
Tag points of contact here if you would like additional review of the relevant parts of the model