Skip to content
@doublewordai

doublewordai

Popular repositories Loading

  1. control-layer control-layer Public

    The world’s fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key gener…

    Rust 48 4

  2. deepseek-reddit-agent deepseek-reddit-agent Public

    An example notebook which shows how you can build a LLM agent that scrapes information from Reddit and summarize key bullets using a self-hosted DeepSeek-R1-Distill-Llama-8B deployed with Titan Tak…

    Jupyter Notebook 11 2

  3. autobatcher autobatcher Public

    Drop-in AsyncOpenAI replacement that transparently batches requests

    Python 8 1

  4. zerodp zerodp Public

    ZeroDP implements an efficient zero-copy data parallel approach for serving Mixture-of-Experts (MoE) models, where expert weights are shared across data parallel ranks via CUDA IPC (Inter-Process C…

    Python 3 1

  5. inference-stack inference-stack Public

    The Doubleword Inference Stack is the easiest & most performant way to run genAI infrastructure in your private environment.

    Go Template 2

  6. outlet outlet Public

    A high-performance Axum middleware for capturing and correlating HTTP requests and responses with full streaming support.

    Rust 2

Repositories

Showing 10 of 38 repositories
  • control-layer Public

    The world’s fastest AI model gateway (450x less overhead than LiteLLM). Unified access to LLMs across endpoints (openAI, self-hosted, etc.) behind a single authentication layer - with API key generation, user management, request logging, and more

    doublewordai/control-layer’s past year of commit activity
    Rust 48 Apache-2.0 4 21 8 Updated Feb 18, 2026
  • batchbench Public
    doublewordai/batchbench’s past year of commit activity
    Rust 0 0 0 1 Updated Feb 18, 2026
  • shenron-configs Public

    Public Shenron release configs

    doublewordai/shenron-configs’s past year of commit activity
    0 0 0 0 Updated Feb 18, 2026
  • onwards Public

    A router for openAI compatible endpoints

    doublewordai/onwards’s past year of commit activity
    Rust 1 MIT 1 2 3 Updated Feb 18, 2026
  • fusillade Public

    Batched LLM request processing daemon with efficient request coalescing and per-model concurrency control

    doublewordai/fusillade’s past year of commit activity
    Rust 1 0 3 2 Updated Feb 18, 2026
  • control-layer-chart Public

    A Helm chart for the Doubleword control layer

    doublewordai/control-layer-chart’s past year of commit activity
    Go Template 0 Apache-2.0 0 1 2 Updated Feb 17, 2026
  • inference-stack Public

    The Doubleword Inference Stack is the easiest & most performant way to run genAI infrastructure in your private environment.

    doublewordai/inference-stack’s past year of commit activity
    Go Template 2 MIT 0 0 3 Updated Feb 17, 2026
  • documentation Public

    Developer documentation for DoubleWord products

    doublewordai/documentation’s past year of commit activity
    TypeScript 0 0 0 0 Updated Feb 17, 2026
  • inference-lab-web Public

    Interactive web interface for the Inference Lab LLM simulator

    doublewordai/inference-lab-web’s past year of commit activity
    TypeScript 0 MIT 0 1 9 Updated Feb 16, 2026
  • arbiter Public

    DeBERTa inference server for sequence classification

    doublewordai/arbiter’s past year of commit activity
    Rust 0 0 2 10 Updated Feb 16, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…