openai-realtime-webrtc-python

A Python library for real-time audio streaming with the OpenAI Realtime API over WebRTC.

Features

Real-time audio communication over WebRTC
Support for OpenAI Realtime API
Automatic audio device management
Automatic sample rate conversion
Low-latency audio streaming
Audio buffering management
Pause/resume streaming support

Requirements

Python 3.7+
Supported operating systems: Windows, macOS, Linux
Audio device support

Dependencies

sounddevice>=0.4.6
numpy>=1.24.0
websockets>=11.0.3
openai>=1.3.0
aiohttp>=3.8.5
pyaudio>=0.2.13
python-dotenv>=1.0.0
aiortc>=1.6.0
scipy>=1.12.0

Installation

Clone the repository:

git clone https://github.com/yourusername/openai-realtime-webrtc-python.git
cd openai-realtime-webrtc-python

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # Linux/macOS

Install dependencies:

pip install -r requirements.txt

Install in development mode:

pip install -e .

Usage

Set up environment variables: Create a .env file and add your OpenAI API key:

OPENAI_API_KEY=your-api-key-here

Basic example:

import asyncio
from openai_realtime_webrtc import OpenAIWebRTCClient

async def main():
    # Create client instance
    client = OpenAIWebRTCClient(
        api_key="your-api-key",
        model="gpt-4o-realtime-preview-2024-12-17",
        tools=[
            {
                "name": "display_color_palette",
                "description": "Displays the colors palette",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "colors_hex": {
                            "type": "string",
                            "description": "Comma-separated list of color hex values"
                        }
                    },
                    "required": ["colors_hex"]
                }
            }
        ]
    )

    # Define transcription callback
    def on_transcription(text: str):
        print(f"Transcription: {text}")

    client.on_transcription = on_transcription

    # Define event callback (for tools/function calling)
    def on_event(event: dict):
        print(f"Event: {event}")

    client.on_event = on_event

    try:
        # Start streaming
        await client.start_streaming()
        # Keep the connection alive
        while True:
            await asyncio.sleep(1)
    except KeyboardInterrupt:
        # Stop streaming
        await client.stop_streaming()

if __name__ == "__main__":
    asyncio.run(main())

To add support for tools (function calling) at initialization, pass a tools list to the constructor. The client will automatically send a session.update (embedding tools and tool_choice in the session field) on session creation to register these tools. For an example of handling function call events (e.g. for follow-up requests), see examples/basic_streaming.py.

Run the example:

python examples/basic_streaming.py

Contributing

Pull requests and issues are welcome!

License

MIT License

Changelog

v0.1.0

Initial release
Implement basic WebRTC audio streaming functionality
Support for OpenAI Realtime API
Automatic audio device management
Audio resampling support

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
examples		examples
src/openai_realtime_webrtc		src/openai_realtime_webrtc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

openai-realtime-webrtc-python

A Python library for real-time audio streaming with the OpenAI Realtime API over WebRTC.

Features

Requirements

Dependencies

Installation

Usage

Contributing

License

Changelog

v0.1.0

About

Uh oh!

Releases

Packages

Languages

License

astroseger/openai-realtime-webrtc-python

Folders and files

Latest commit

History

Repository files navigation

openai-realtime-webrtc-python

A Python library for real-time audio streaming with the OpenAI Realtime API over WebRTC.

Features

Requirements

Dependencies

Installation

Usage

Contributing

License

Changelog

v0.1.0

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages