Developer Preview

Product

Agents

Build programmable participants that can join sessions and process or synthesize media streams in real-time.

chain

Compose multimodal AI apps with plugins

Chain together prebuilt integrations for speech and video processing, text or image generation, and more.

Build on top of our growing set of plugins

cross-platform

Use WebRTC on the server

Your agent has access the same LiveKit WebRTC features as our client SDKs.

agent.py

1async def publish_audio(self):
2   audio_source = rtc.AudioSource(sample_rate, num_channels)
3   track = rtc.LocalAudioTrack
4      .create_audio_track("agent-voice", audio_source)
5
6   options = rtc.TrackPublishOptions()
7   options.source = rtc.TrackSource.SOURCE_MICROPHONE
8
9   await self.ctx.room.local_participant
10      .publish_track(track, options)

Hello, I'm here.

multimodal

Create agents that can see and hear

Pipe streaming, raw audio packets or video frames through an AI model or processing workflow.

1async def detect_faces(self, frame: rtc.VideoFrame):
2   faces = detect_faces(frame)
3   annotated_frame = draw_bounding_boxes(faces, frame)
4   video_source.capture_frame(annotated_frame)

Face Detector

Example showing detecting a face in realtime

synthesize

Stream output from generative AI

Generate and stream voice or video samples on-the-fly or play back prerecorded media files.

1async def synthesize(self, text: str):
2   generated_frame = await stable_diffusion_generate(prompt=text)
3   video_source.capture_frame(generated_frame)

scale

Autoscale and orchestrate your agents in prod

LiveKit’s edge worker service automatically monitors, health checks, and distributes load across your pool of agents.

use-case

Allow machines to see the world

DirectAI works with LiveKit Agents to help developers instantly build computer vision models without code or training data.

Check out Direct AI

Direct AI

Show me seagulls flying

showcase

See agents in action

Browse some of our agent demos that leverage the latest in AI powered technology

KITT voice assistant

KITT is an AI voice assistant powered by Deepgram, Eleven Labs, and ChatGPT.

Try it out

Try Agents for free

LiveKit Cloud is a cloud real-time platform and the fastest way to prototype and ship to production.

Get Started Try out Agents playground Learn more

Product

SFU

SDKs

Cloud Dashboard

Cloud Status

Developers

Documentation

Security

Slack

GitHub

Resources

Video Codecs

Codec Bitrates

WebRTC Browser Test

Connection Test

Company

Blog

Careers

About

License

Cookie Policy