Now supporting 5 AI providers

Your phone line,
AI on the call

SIP4AI is a VOIP softphone where AI is on the other end, not a human. Self-host it, register a SIP line, and let AI take the calls. Works with your existing phone system — just add a line.

Scroll

Everything you need
for voice AI

Full SIP protocol support, multiple AI providers, function calling, and a complete HTTP API for programmatic control.

Inbound & Outbound

Answer incoming calls automatically or make outbound calls with task-based objectives. Full control over caller ID and duration limits.

Function Calling

AI can call your webhooks mid-conversation. Book appointments, check orders, or transfer calls — all through simple HTTP endpoints.

Hold Audio

Play custom audio while executing webhooks. Keep callers engaged with professional hold messages during function processing.

Call Transfers

Blind and attended transfers via SIP REFER. Hand off to human agents seamlessly with consultation mode support.

REST API

Full HTTP API for programmatic control. Initiate calls, monitor status, inject context mid-conversation, and receive webhook events.

Native Audio

μ-law 8kHz native support with ElevenLabs and Cartesia. Full codec conversion for all providers including PCMU, PCMA, and G.729.

Five AI providers,
one interface

Choose the provider that fits your needs. From OpenAI's advanced reasoning to ElevenLabs' native μ-law support.

AI

OpenAI Realtime

GPT-4o powered realtime conversations with advanced reasoning capabilities. Built-in function calling and server-side VAD with configurable thresholds.

PCM16 @ 24kHz audio quality
6 voice options: shimmer, alloy, echo, fable, onyx, nova
Server-side VAD with adjustable silence detection
Configuration
{
  "provider": "openai",
  "openai": {
    "instructions": "You are a helpful AI...",
    "voice": "shimmer",
    "turn_detection": {
      "type": "server_vad",
      "threshold": 0.7,
      "silence_duration_ms": 1000
    }
  }
}
11

ElevenLabs

Native μ-law 8kHz support means zero audio conversion — the lowest possible latency. Configure your agent in the ElevenLabs dashboard with client-side tools.

Native μ-law @ 8kHz — no conversion needed
Excellent latency for real-time conversations
Client-side tools with conversation override
Configuration
{
  "provider": "elevenlabs",
  "elevenlabs": {
    "agent_id": "your-agent-id",
    "first_message": "Hello! How can I help?",
    "system_prompt": "You are a helpful..."
  }
}
G

Google Gemini Live

Google's Gemini 2.0 Flash with built-in function calling. Five distinct voices to choose from, perfect for Google ecosystem integration.

PCM16 @ 16kHz in, 24kHz out
Voices: Aoede, Charon, Fenrir, Kore, Puck
Generous free tier for development
Configuration
{
  "provider": "gemini",
  "gemini": {
    "model": "models/gemini-2.0-flash-exp",
    "voice": "Kore",
    "first_message": "Hello!",
    "system_prompt": "You are a helpful..."
  }
}
DG

Deepgram Voice Agent

Flexible component selection with Nova-3 STT and configurable LLM. 13 Aura-2 voices available with excellent latency characteristics.

Nova-3 or Nova-2 STT models
13 Aura-2 voices for TTS
Configurable LLM (GPT-4o-mini default)
Configuration
{
  "provider": "deepgram",
  "deepgram": {
    "model": "aura-2-thalia-en",
    "listen_model": "nova-3",
    "think_provider": "open_ai",
    "think_model": "gpt-4o-mini",
    "greeting": "Hello!"
  }
}
C

Cartesia Sonic

Hybrid architecture combining Deepgram STT, OpenAI LLM, and Cartesia Sonic TTS. Premium voice synthesis with native μ-law support.

Native μ-law @ 8kHz TTS output
Deepgram Nova-2 for STT
OpenAI GPT-4o-mini for reasoning
Configuration
{
  "provider": "cartesia",
  "cartesia": {
    "model": "sonic-2024-10-19",
    "voice_id": "694f9389-aac1...",
    "stt_provider": "deepgram",
    "llm_provider": "openai",
    "llm_model": "gpt-4o-mini"
  }
}

Simple architecture,
powerful results

A single Go binary handles everything: SIP signaling, RTP media, audio conversion, and WebSocket connections to AI providers.

Caller

Phone network via SIP/RTP

SIP4AI

SIP4AI Bridge

Go binary handles all conversion

AI Provider

WebSocket to voice AI

Function calls Your APIs

Full HTTP API
for control

Initiate calls, monitor status, inject context mid-conversation, and receive webhook events. Everything you need for programmatic control.

POST /api/calls

Initiate an outbound call with task and instructions

GET /api/calls/{id}

Get call status, duration, and results

POST /api/calls/{id}/inject

Inject context into an active conversation

DELETE /api/calls/{id}

Hang up an active call

Initiate a call
POST /api/calls

{
  "target": "+61400000000",
  "task": "Confirm appointment for Tuesday",
  "instructions": "Be friendly and brief",
  "event_webhook": "https://your-app.com/webhook",
  "max_duration_seconds": 300
}

// Response
{
  "success": true,
  "call_id": "550e8400-e29b-41d4-a716...",
  "status": "initiating"
}

Built for
real use cases

🏢

AI Receptionist

Answer and route incoming calls automatically

📅

Appointment Booking

Schedule appointments via natural conversation

📊

Customer Surveys

Outbound AI calls for feedback collection

📦

Order Updates

Proactive status calls to customers

Simple pricing,
no surprises

Self-host on your infrastructure. Pay annually for the license, not per minute or per call.

Free

For personal & non-commercial use

$0 /forever
  • 1 SIP line
  • All 5 AI providers
  • Full API access
  • Community support
Get Started

Personal

For freelancers & solo projects

$49 /year
  • 2 SIP lines
  • Commercial use
  • Email support
  • License key
Buy Personal
Most Popular

Business

For teams & growing companies

$299 /year
  • 10 SIP lines
  • Up to 5 team members
  • Priority support
  • Invoice & receipts
Buy Business

Enterprise

For large-scale deployments

Custom
  • Unlimited SIP lines
  • Unlimited team members
  • Dedicated support & SLA
  • Custom contract
Contact Sales

Get started
in seconds

Get started in seconds. One command to install, set your credentials, and you're ready to go.

Terminal
# Install SIP4AI
curl -sSL https://sip4ai.com/install.sh | bash

# Set credentials
export SIP_USERNAME=your_user
export SIP_PASSWORD=your_pass
export OPENAI_API_KEY=sk-...

# Run
sip4ai