🚀 LLMInventory

Note: I currently have API keys for OpenAI, Anthropic, and Google providers. If you'd like to help test and support other providers (xAI, Mistral), please consider buying me a coffee to help with API costs! ☕

A comprehensive Python library for managing and accessing multiple Large Language Model (LLM) APIs through a unified interface. Support for 40 models from 5 major AI providers.

✨ Features

40 AI Models from major providers (OpenAI, Anthropic, Google, xAI, Mistral)
Unified API Interface - Use any model with the same code structure
Comprehensive Model Database with capabilities, pricing, and context windows
Type-Safe Configuration with parameter validation
Extensible Architecture - Easy to add new providers
Rich Model Metadata including capabilities, pricing, and technical specs
Embedding Support - Text embeddings for semantic search
Multimodal Capabilities - Text, vision, audio, and code generation

🎯 Supported Providers & Models

🟢 OpenAI (6 models) ✅ Tested

GPT-4o - Most advanced multimodal model
GPT-4o Mini - Fast and cost-effective
GPT-4 Turbo - Vision capabilities, 128k context
GPT-4.1, GPT-4.1 Mini - 1M+ context window
DALL-E 3 - Image generation

🟠 Anthropic (3 models) ✅ Tested

Claude 3.5 Sonnet - Latest with computer use capabilities
Claude 3.5 Haiku - Fast and efficient
Claude 3 Haiku - Cost-effective option

🔴 Google (19 models) ✅ Tested

Gemini 2.0 Flash - Latest experimental model
Gemini 2.5 Flash - Enhanced capabilities
Gemini 1.5 Flash - Fast multimodal (multiple variants)
Text Embedding 004 - Advanced embeddings
Note: gemini-1.5-pro requires paid tier access

⚫ xAI (3 models) ❌ No API Key - Need Support!

Grok-2 - Advanced reasoning with real-time data
Grok-2 Mini - Faster version
Grok Beta - Next-generation model

🟡 Mistral (9 models) ❌ No API Key - Need Support!

Mistral Large - Most capable model (multiple versions)
Codestral - Code generation specialist
Pixtral - Multimodal with vision
Ministral - Compact edge deployment
Mistral Embed - Text embeddings

📊 Current Test Status

Last Updated: 2025-06-29 00:21:37

Total Models: 40
Working Models: 26 (65.0% success rate)
Providers with API Keys: 3/5

Provider Status:

OpenAI: ✅ Working
Anthropic: ✅ Working
Google: ✅ Working
xAI: ❌ No API Key
Mistral: ❌ No API Key

🛠️ Installation

# Clone the repository
git clone https://github.com/yourusername/LLMInventory.git
cd LLMInventory

# Install dependencies
pip install -r requirements.txt

# Set up your API keys
cp secrets.example.yaml secrets.yaml
# Edit secrets.yaml with your API keys

🔑 API Keys Setup

Edit secrets.yaml with your API keys:

openai:
  api_key: "sk-your-openai-key-here"

anthropic:
  api_key: "sk-ant-your-anthropic-key-here"

google:
  api_key: "your-google-ai-api-key-here"

xai:
  api_key: "your-xai-grok-key-here"

mistral:
  api_key: "your-mistral-api-key-here"

🚀 Quick Start

Basic Usage

from pathlib import Path
from src.llminventory import LLMInventory

# Initialize the inventory
inventory = LLMInventory(
    configs_dir=Path("configs"),
    secrets_file=Path("secrets.yaml")
)

# List all available models
models = inventory.get_supported_models()
print(f"Available models: {len(models)}")

# Use any model with the same interface
response = inventory.invoke(
    provider="openai",
    model="gpt-4o",
    payload={"messages": [{"role": "user", "content": "Hello!"}]},
    parameters={"temperature": 0.7, "max_tokens": 100}
)

print(response)

Text Generation Examples

# OpenAI GPT-4o
response = inventory.invoke(
    provider="openai",
    model="gpt-4o",
    payload={"messages": [{"role": "user", "content": "Explain quantum computing"}]},
    parameters={"temperature": 0.7, "max_tokens": 200}
)

# Anthropic Claude
response = inventory.invoke(
    provider="anthropic",
    model="claude-3-5-sonnet-20241022",
    payload={"messages": [{"role": "user", "content": "Write a Python function"}]},
    parameters={"temperature": 0.3, "max_tokens": 500}
)

# Google Gemini
response = inventory.invoke(
    provider="google",
    model="gemini-2.0-flash",
    payload={
        "contents": [{
            "role": "user",
            "parts": [{"text": "Analyze this data"}]
        }]
    },
    parameters={"temperature": 0.5, "maxOutputTokens": 300}
)

Embedding Usage

# Google Text Embeddings
embedding_response = inventory.invoke(
    provider="google",
    model="text-embedding-004",
    payload={"input": "Your text to embed here"},
    parameters={"dimensions": 768}
)

# Extract embedding vector
embedding_vector = embedding_response['embedding']['values']
print(f"Embedding dimensions: {len(embedding_vector)}")

📊 Model Information

Each model includes comprehensive metadata:

Capabilities: text, vision, audio, reasoning, code, embeddings
Context Window: Maximum input tokens
Max Output: Maximum output tokens
Pricing: Cost per million tokens (where available)
Parameters: Supported configuration options

# Get detailed model information
for model in inventory.get_supported_models():
    print(f"Model: {model['provider']}/{model['model']}")
    print(f"Description: {model['description']}")
    print(f"Capabilities: {model.get('capabilities', [])}")
    print(f"Context Window: {model.get('context_window', 'N/A')}")
    print(f"Pricing: {model.get('pricing', 'N/A')}")
    print("---")

🏗️ Architecture

src/llminventory/
├── __init__.py              # Main LLMInventory class
├── inventory.py             # High-level interface
├── model_config_manager.py  # Configuration management
├── secret_manager.py        # API key management
└── adapters/                # Provider-specific adapters
    ├── base_adapter.py      # Base adapter class
    ├── openai_adapter.py    # OpenAI API adapter
    ├── anthropic_adapter.py # Anthropic API adapter
    ├── google_adapter.py    # Google AI adapter
    ├── xai_adapter.py       # xAI Grok adapter
    └── mistral_adapter.py   # Mistral AI adapter

🧪 Testing

# Run comprehensive tests
python comprehensive_model_test.py

# Test specific providers
python test_single_provider.py

# Run unit tests
python -m pytest tests/

📝 Configuration

Models are configured in supported_models.yaml. Each model includes:

- provider: openai
  model: gpt-4o
  endpoint: https://api.openai.com/v1/chat/completions
  description: Most advanced GPT-4 model with multimodal capabilities
  capabilities: [text, vision, audio]
  context_window: 128000
  max_output: 16384
  required_fields: [messages]
  parameters:
    temperature:
      type: float
      default: 0.7
      description: Controls randomness in generation
    max_tokens:
      type: integer
      default: 4096
      description: Maximum tokens to generate
  pricing:
    input_cost_per_1m_tokens: 2.5
    output_cost_per_1m_tokens: 10.0

🔧 Adding New Providers

Create a new adapter in src/llminventory/adapters/
Inherit from BaseAdapter
Implement the invoke method
Register in adapters/__init__.py
Add models to supported_models.yaml

📋 Requirements

Python 3.8+
requests
PyYAML
google-generativeai (for Google models)
anthropic (for Claude models)
openai (for OpenAI models)

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

☕ Support the Project

If you find this project helpful and would like to support development and testing of additional providers, consider buying me a coffee! Your support helps cover API costs for testing new models and providers.

Current funding needs:

xAI Grok API testing
Mistral API testing
Additional model testing
Documentation improvements

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI for GPT models
Anthropic for Claude models
Google for Gemini models
xAI for Grok models
Mistral AI for Mistral models

📞 Support

If you have questions or need help:

Check the documentation
Open an issue
Read the comprehensive test results

Made with ❤️ for the AI community

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
configs		configs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
MODELS.md		MODELS.md
README.md		README.md
TODO.md		TODO.md
api_monitoring_config.yaml		api_monitoring_config.yaml
comprehensive_test_results.json		comprehensive_test_results.json
database_consistency_report.json		database_consistency_report.json
example_direct_usage.py		example_direct_usage.py
generate_model_list.py		generate_model_list.py
github_upload_commands.txt		github_upload_commands.txt
interactive_test.py		interactive_test.py
main.py		main.py
requirements.txt		requirements.txt
secrets.example.yaml		secrets.example.yaml
setup_api_keys.py		setup_api_keys.py
supported_models.json		supported_models.json
supported_models.yaml		supported_models.yaml
test_all_providers.py		test_all_providers.py
test_fixed_models.py		test_fixed_models.py
test_results.json		test_results.json
test_results_anthropic.json		test_results_anthropic.json
test_results_comprehensive.json		test_results_comprehensive.json
test_results_final.json		test_results_final.json
test_results_google.json		test_results_google.json
test_results_openai.json		test_results_openai.json
test_single_provider.py		test_single_provider.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 LLMInventory

✨ Features

🎯 Supported Providers & Models

🟢 OpenAI (6 models) ✅ Tested

🟠 Anthropic (3 models) ✅ Tested

🔴 Google (19 models) ✅ Tested

⚫ xAI (3 models) ❌ No API Key - Need Support!

🟡 Mistral (9 models) ❌ No API Key - Need Support!

📊 Current Test Status

Provider Status:

🛠️ Installation

🔑 API Keys Setup

🚀 Quick Start

Basic Usage

Text Generation Examples

Embedding Usage

📊 Model Information

🏗️ Architecture

🧪 Testing

📝 Configuration

🔧 Adding New Providers

📋 Requirements

🤝 Contributing

☕ Support the Project

📄 License

🙏 Acknowledgments

📞 Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 LLMInventory

✨ Features

🎯 Supported Providers & Models

🟢 OpenAI (6 models) ✅ Tested

🟠 Anthropic (3 models) ✅ Tested

🔴 Google (19 models) ✅ Tested

⚫ xAI (3 models) ❌ No API Key - Need Support!

🟡 Mistral (9 models) ❌ No API Key - Need Support!

📊 Current Test Status

Provider Status:

🛠️ Installation

🔑 API Keys Setup

🚀 Quick Start

Basic Usage

Text Generation Examples

Embedding Usage

📊 Model Information

🏗️ Architecture

🧪 Testing

📝 Configuration

🔧 Adding New Providers

📋 Requirements

🤝 Contributing

☕ Support the Project

📄 License

🙏 Acknowledgments

📞 Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages