Skip to content

Argus-Tech-Solutions/argus-ai-engineering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Argus · AI Engineering

Fine-tune Llama 3 8B on your data · deploy on Vertex AI · wrap with FastAPI

Status Delivery License

What You Get

  • Domain-specific fine-tuned Llama 3 8B via PEFT/LoRA
  • Google Vertex AI deployment — scalable, production-ready endpoint
  • FastAPI wrapper with auth, rate-limit, and OpenAPI docs
  • LangChain RAG pipeline (STANDARD+) — retrieval over your documents
  • Full source code + Dockerfile + deployment guide

Sample Output

POST /v1/chat -> 200 OK  model: argus-llama3-8b-finetuned-v1  latency: 312ms

Stack

Python Llama_3_8B PEFT_LoRA Vertex_AI FastAPI

How to Order

Order on Fiverr - Argus Intelligence · From $180 · 48h delivery

License

MIT © Argus Intelligence · marcosantcs

About

Fine-tune Llama 3 8B + Vertex AI deployment + FastAPI wrapper in 48h

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages