AI Calling Agent

A real-time voice AI system that integrates OpenAI's Realtime API with Twilio Voice to create intelligent voice conversations. Perfect for customer service, compliance monitoring, and automated calling systems.

Branches

main - OpenAI Realtime API version (streaming, low latency)
llama3 - Llama3 via Together AI (traditional, cost-effective)

Features

Real-time Voice Processing - Instant speech recognition and response
Smart Interruption Handling - Natural conversation flow with speech detection
Flexible Configuration - Customizable prompts and voice settings
Call Recording - Automatic recording with compliance features
WebSocket Communication - Low-latency audio streaming
Production Ready - Built with FastAPI for scalability

Quick Start

Prerequisites

Python 3.8+
OpenAI API key (with Realtime API access)
Twilio account (SID, Auth Token, Phone Number)
ngrok or similar tunneling tool

Installation

Clone the repository

   git clone https://github.com/intellwe/ai-calling-agent.git
   cd ai-calling-agent

Install dependencies

pip install -r requirements.txt

Configure environment

cp .env.example .env
# Edit .env with your credentials

Start the server
```
uvicorn main:app --port 8000
```
Expose with ngrok
```
ngrok http 8000
```

Configuration

Create a .env file with the following variables:

OPENAI_API_KEY=your_openai_api_key
TWILIO_ACCOUNT_SID=your_twilio_account_sid
TWILIO_AUTH_TOKEN=your_twilio_auth_token
TWILIO_PHONE_NUMBER=your_twilio_phone_number
NGROK_URL=your_ngrok_url
PORT=8000

API Endpoints

Method	Endpoint	Description
GET	`/`	Health check
POST	`/make-call`	Initiate outbound call
POST	`/outgoing-call`	Twilio webhook handler
WebSocket	`/media-stream`	Real-time audio streaming

Making a Call

curl -X POST "http://localhost:8000/make-call" \
  -H "Content-Type: application/json" \
  -d '{"to_phone_number": "+1234567890"}'

Architecture

┌─────────────┐    WebSocket   ┌─────────────┐    HTTP/WS    ┌─────────────┐
│   Twilio    │ ◄────────────► │  FastAPI    │ ◄───────────► │   OpenAI    │
│   Voice     │                │   Server    │               │ Realtime API│
└─────────────┘                └─────────────┘               └─────────────┘

The system creates a bridge between Twilio's voice services and OpenAI's Realtime API, enabling natural voice conversations with AI.

Development

Setup Development Environment

Install development dependencies
```
pip install -r requirements-dev.txt
```
Install pre-commit hooks (optional)
```
pre-commit install
```

Code Quality Tools

Format code: black .
Sort imports: isort .
Lint code: flake8
Type checking: mypy main.py
Security scan: bandit -r .
Run tests: pytest

Customizing AI Behavior

Edit prompts/system_prompt.txt to modify the AI's personality and responses.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Author

@FardinHash -> LinkedIn
@RianaAzad -> LinkedIn

⚠️ Disclaimer

This project is not officially affiliated with OpenAI or Twilio. Use responsibly and in accordance with their terms of service.

⭐ If you find this project helpful, please give it a star!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
prompts		prompts
.env.example		.env.example
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

AI Calling Agent

Branches

Features

Quick Start

Prerequisites

Installation

Configuration

API Endpoints

Making a Call

Architecture

Development

Setup Development Environment

Code Quality Tools

Customizing AI Behavior

🤝 Contributing

License

Author

⚠️ Disclaimer

About

Uh oh!

Releases 1

Sponsor this project

Uh oh!

Packages

Contributors 2

Languages

Uh oh!

License

intellwe/ai-calling-agent

Folders and files

Latest commit

History

Repository files navigation

AI Calling Agent

Branches

Features

Quick Start

Prerequisites

Installation

Configuration

API Endpoints

Making a Call

Architecture

Development

Setup Development Environment

Code Quality Tools

Customizing AI Behavior

🤝 Contributing

License

Author

⚠️ Disclaimer

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Sponsor this project

Uh oh!

Packages 0

Contributors 2

Languages

Packages