Developer Preview

Intro background
Product

Agents

Build programmable participants that can join sessions and process or synthesize media streams in real-time.

chain

Compose multimodal AI apps with plugins

Chain together prebuilt integrations for speech and video processing, text or image generation, and more.

Build on top of our growing set of plugins

OpenAIGoogleAnthropicElevenLabs
platforms
cross-platform

Use WebRTC on the server

Your agent has access the same LiveKit WebRTC features as our client SDKs.

agent.py
1async def publish_audio(self):
2 audio_source = rtc.AudioSource(sample_rate, num_channels)
3 track = rtc.LocalAudioTrack
4 .create_audio_track("agent-voice", audio_source)
5
6 options = rtc.TrackPublishOptions()
7 options.source = rtc.TrackSource.SOURCE_MICROPHONE
8
9 await self.ctx.room.local_participant
10 .publish_track(track, options)
1
Hello, I'm here.
Hello, I'm here.
multimodal

Create agents that can see and hear

Pipe streaming, raw audio packets or video frames through an AI model or processing workflow.

1async def detect_faces(self, frame: rtc.VideoFrame):
2 faces = detect_faces(frame)
3 annotated_frame = draw_bounding_boxes(faces, frame)
4 video_source.capture_frame(annotated_frame)
1
Face Detector

Example showing detecting a face in realtime

synthesize

Stream output from generative AI

Generate and stream voice or video samples on-the-fly or play back prerecorded media files.

1async def synthesize(self, text: str):
2 generated_frame = await stable_diffusion_generate(prompt=text)
3 video_source.capture_frame(generated_frame)
1
scale

Autoscale and orchestrate your agents in prod

LiveKit’s edge worker service automatically monitors, health checks, and distributes load across your pool of agents.

platforms
use-case

Allow machines to see the world

DirectAI works with LiveKit Agents to help developers instantly build computer vision models without code or training data.

Check out Direct AI
Direct AI
Show me seagulls flying
showcase

See agents in action

Browse some of our agent demos that leverage the latest in AI powered technology

KITT voice assistant

KITT voice assistant

KITT is an AI voice assistant powered by Deepgram, Eleven Labs, and ChatGPT.

Try it out

Try Agents for free

LiveKit Cloud is a cloud real-time platform and the fastest way to prototype and ship to production.

Chevron IconGet StartedTry out Agents playgroundLearn more
LiveKit Logo
GitHub LogoTwitter Logo

Product

SFU

SDKs

Cloud Dashboard

Cloud Status

© 2024 LiveKit. All rights reserved.

Terms of Service
|
Cookie Policy
|
Privacy Policy