Argus · AI Engineering

Fine-tune Llama 3 8B on your data · deploy on Vertex AI · wrap with FastAPI

What You Get

Domain-specific fine-tuned Llama 3 8B via PEFT/LoRA
Google Vertex AI deployment — scalable, production-ready endpoint
FastAPI wrapper with auth, rate-limit, and OpenAPI docs
LangChain RAG pipeline (STANDARD+) — retrieval over your documents
Full source code + Dockerfile + deployment guide

Sample Output

POST /v1/chat -> 200 OK  model: argus-llama3-8b-finetuned-v1  latency: 312ms

Stack

How to Order

Order on Fiverr - Argus Intelligence · From $180 · 48h delivery

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
inference.py		inference.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Argus · AI Engineering

What You Get

Sample Output

Stack

How to Order

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Argus · AI Engineering

What You Get

Sample Output

Stack

How to Order

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages