Skip to content

Add inference benchmarks and model capabilities to README#17

Merged
arpannookala-12 merged 3 commits intomainfrom
docs/inference-benchmarks
Mar 25, 2026
Merged

Add inference benchmarks and model capabilities to README#17
arpannookala-12 merged 3 commits intomainfrom
docs/inference-benchmarks

Conversation

@arpannookala-12
Copy link
Copy Markdown
Collaborator

@arpannookala-12 arpannookala-12 commented Mar 25, 2026

Summary

  • Add benchmark results table comparing vLLM local, Intel OPEA Enterprise Inference, and OpenAI cloud tiers across the full FinSights document analysis pipeline
  • Add model capability tables for Llama 3.2 3B Instruct, BAAI/bge-base-en-v1.5, text-embedding-3-small, and GPT-4o-mini
  • Add side-by-side comparison summary
  • Fix trivy-action version from 0.24.0 to 0.35.0 (unresolvable version breaking CI)

Test plan

  • Verify README renders correctly on GitHub
  • Verify CI passes with updated trivy-action version

Add benchmark results comparing vLLM local, Intel OPEA Enterprise
Inference, and OpenAI cloud tiers across the full FinSights document
analysis pipeline. Include model capability tables for Llama 3.2 3B
Instruct, BAAI/bge-base-en-v1.5, text-embedding-3-small, and
GPT-4o-mini with a side-by-side comparison summary.
Fix CI failure caused by unresolvable trivy-action version 0.24.0.
Fix CVE-2026-31938 (CRITICAL, XSS) and CVE-2026-31898 (HIGH,
arbitrary code execution) flagged by Trivy scan.
@arpannookala-12 arpannookala-12 merged commit fc7f919 into main Mar 25, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant