SIPHON

Open-source, low-latency Voice AI.
No markups. No lock-in. No middlemen.

Built for teams who want full control over their calling AI stack,
from infrastructure to data to cost.

⭐ Drop a star to help us grow!

What Siphon is

Siphon is a Python framework that handles the hard parts of real-time voice AI:

✅ SIP + telephony integration — Connect to any SIP trunk (Twilio, Telnyx, SignalWire, etc.)
✅ Streaming audio pipelines — Sub-500ms latency powered by WebRTC (LiveKit)
✅ Interruptions & barge-in — Natural conversation flow with configurable turn detection
✅ Agent state management — Recording, transcription, metadata persistence
✅ Horizontal scaling — Run 1 or 1,000 workers with zero-config load balancing

So you can focus on agent behavior, not call plumbing.

You bring:

🤖 Your LLM (OpenAI, Anthropic, Google, DeepSeek, Groq, Cerebras, Mistral, etc.)
🎤 Your STT/TTS providers (Deepgram, Cartesia, ElevenLabs, AssemblyAI, Sarvam, etc.)
📞 Your SIP trunk (Twilio, Telnyx, SignalWire, or self-hosted)
☁️ Your infrastructure (LiveKit Cloud or self-hosted)

You keep:

💰 Your margins — No per-minute markup on AI provider costs
🔒 Your data — Runs on your infrastructure, all logs stay with you
📊 Your observability — Complete control over recording, transcription, metadata
🔑 Your keys — Direct integration with AI providers, no middleman

What Siphon is not

❌ Not a SaaS platform — You host it, you control it
❌ Not a black box — Open-source (Apache 2.0), inspect and modify everything
❌ Not a per-minute tax — No markup on your AI provider costs
❌ Not vendor lock-in — Swap LLM/STT/TTS providers with a config change

Why Siphon exists

Voice agents listen to everything.

Your customers' calls contain sensitive information — personal details, business data, private conversations.

Traditional managed platforms route every call through their infrastructure. You pay per minute and trust them with your data.

Siphon runs on your infrastructure.
You own the keys. You control the data. You keep the margins.

Production-Ready Architecture

⚡ Low Latency	🛡️ Production Ready	🚀 Infinite Scale
Powered by WebRTC (LiveKit) for sub-500ms voice interactions that feel like real human conversation.	Handles the chaotic reality of phone networks—audio packet loss, SIP signaling, and interruptions.	Define your agent once and run it on 1 or 1,000 servers. It balances the load automatically.

Quick Start

If you're new to Siphon, we recommend checking out:

📖 Documentation
⚡ Quick Start Guide

1. Install

pip install siphon-ai

2. Configure Environment

Siphon requires LiveKit for real-time media and API keys for your AI providers.

Create a .env file:

# LiveKit (Cloud: https://cloud.livekit.io/ or Self-hosted)
LIVEKIT_URL=...
LIVEKIT_API_KEY=...
LIVEKIT_API_SECRET=...

# AI Providers
OPENAI_API_KEY=...
DEEPGRAM_API_KEY=...
CARTESIA_API_KEY=...

3. Create your Agent

Create a file named agent.py. This simple agent acts as a helpful assistant.

from siphon.agent import Agent
from siphon.plugins import openai, cartesia, deepgram
from dotenv import load_dotenv

load_dotenv()

# Initialize your AI stack
llm = openai.LLM()
tts = cartesia.TTS()
stt = deepgram.STT()

# Define the Agent
agent = Agent(
    agent_name="Receptionist",
    llm=llm,
    tts=tts,
    stt=stt,
    system_instructions="You are a helpful receptionist. Answer succinctly.",
)

if __name__ == "__main__":
    # One-time setup: downloads required files (only needed on fresh machines)
    agent.download_files()

    # Start the agent worker in development mode
    agent.dev()

    # Start the agent worker in production mode
    # agent.start()

For more details on configuring your Agent (latency, interruptions, VAD...etc) and exploring available Plugins (Deepgram, Cartesia, OpenAI, ElevenLabs...etc), check out the documentation.

4. Run

Start your agent worker.

python agent.py

Horizontal Scaling: To scale, simply run this command on multiple servers. The worker architecture automatically detects new nodes and balances the load with Zero Configuration. Learn more about Scaling

Capabilities

📞 Receive Calls (Inbound)

Bind a phone number to your agent using a Dispatch rule.

import os
from siphon.telephony.inbound import Dispatch
from dotenv import load_dotenv

load_dotenv()

dispatch = Dispatch(
    dispatch_name="customer-support",
    agent_name="Receptionist", # Must match the name in agent.py
    sip_trunk_id=os.getenv("SIP_TRUNK_ID"),
    # Or: sip_number=os.getenv("SIP_NUMBER"),
)
dispatch.agent()

Note: For more details, check out the Inbound Documentation. To configure numbers with providers like Twilio, see the Twilio Setup Guide.

📱 Make Calls (Outbound)

Trigger calls programmatically from your code or API.

import os
from siphon.telephony.outbound import Call
from dotenv import load_dotenv

load_dotenv()

call = Call(
    agent_name="Receptionist", # Must match the name in agent.py
    sip_trunk_setup={ ... }, # Your SIP credentials
    # Or: sip_trunk_id=os.getenv("SIP_TRUNK_ID"),
    number_to_call="+15550199"
)
call.start()

Note: For more details, check out the Outbound Documentation. To configure trunks with providers like Twilio, see the Twilio Setup Guide.

💾 Persist Call Data

Siphon enables call recordings, transcriptions, and metadata persistence via environment variables.

# Enable saving features
CALL_RECORDING=true
SAVE_METADATA=true
SAVE_TRANSCRIPTION=true

# Configure storage location (locally, S3, Redis, Postgres, etc)
METADATA_LOCATION=Metadata # saves locally
TRANSCRIPTION_LOCATION=postgresql://..... # saves to postgresql

# Configure S3 (Call Recordings are always saved to S3)
AWS_S3_ENDPOINT=
AWS_S3_ACCESS_KEY_ID=
AWS_S3_SECRET_ACCESS_KEY=
AWS_S3_BUCKET=
AWS_S3_REGION=
AWS_S3_FORCE_PATH_STYLE=true

Note: Siphon supports multiple storage backends. For detailed configuration instructions, see the Call Data Documentation.

🚀 Examples and demo

More Examples

Example	Description
A 24/7 AI Dental Receptionist in few lines	A fully functional AI receptionist that handles appointment booking, modifications, and cancellations with Google Calendar integration.

More coming and stay tuned 👀!

📖 Documentation

For detailed documentation, visit Siphon Documentation, including a Quickstart Guide.

🤝 Contributing

We love contributions from the community ❤️. For details on contributing or running the project for development, check out our Contributing Guide.

Support us

We are constantly improving, and more features and examples are coming soon. If you love this project, please drop us a star ⭐ at GitHub repo to stay tuned and help us grow.

License

Siphon is Apache 2.0 licensed.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github		.github
examples		examples
flows		flows
siphon		siphon
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFESTO.md		MANIFESTO.md
README.md		README.md
SECURITY.md		SECURITY.md
__init__.py		__init__.py
cml3qc29a000602jk36t24h4j.md		cml3qc29a000602jk36t24h4j.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SIPHON

Open-source, low-latency Voice AI.
No markups. No lock-in. No middlemen.

What Siphon is

You bring:

You keep:

What Siphon is not

Why Siphon exists

Production-Ready Architecture

Quick Start

1. Install

2. Configure Environment

3. Create your Agent

4. Run

Capabilities

📞 Receive Calls (Inbound)

📱 Make Calls (Outbound)

💾 Persist Call Data

🚀 Examples and demo

📖 Documentation

🤝 Contributing

Support us

License

About

Uh oh!

Releases 4

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

SIPHON

Open-source, low-latency Voice AI. No markups. No lock-in. No middlemen.

What Siphon is

You bring:

You keep:

What Siphon is not

Why Siphon exists

Production-Ready Architecture

Quick Start

1. Install

2. Configure Environment

3. Create your Agent

4. Run

Capabilities

📞 Receive Calls (Inbound)

📱 Make Calls (Outbound)

💾 Persist Call Data

🚀 Examples and demo

📖 Documentation

🤝 Contributing

Support us

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 4

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Open-source, low-latency Voice AI.
No markups. No lock-in. No middlemen.

Packages