Question 1

What is a LiveKit SFU and why does it matter for self-hosting?

Accepted Answer

SFU stands for Selective Forwarding Unit. It is the core media routing component in LiveKit. Unlike a Multipoint Control Unit (MCU), which decodes all streams and mixes them into a single output, an SFU receives each participant's encoded media tracks and forwards them selectively to each subscriber without re-encoding. This means CPU usage scales with the number of streams being forwarded, not the complexity of mixing, making SFUs far more efficient for large rooms. When you self-host LiveKit, you own and operate the SFU, which means you control the networking configuration, the geographic placement, the scaling policies, and the data that flows through it.

Question 2

What is the difference between deploying on AWS, GCP, and bare metal?

Accepted Answer

AWS and GCP give you managed infrastructure, global regions, and integrations with storage, identity, and security services. They are faster to deploy and easier to scale, but media egress costs add up at scale because you pay for every gigabyte that leaves the cloud. Bare metal hosting at a co-location facility eliminates egress costs and provides predictable, flat-rate pricing, but requires more operational expertise and longer provisioning timelines. For most clients we recommend starting on AWS or GCP and migrating heavily-trafficked regions to bare metal if the economics justify it. The LiveKit architecture supports hybrid setups where some SFU nodes run on cloud and others run on bare metal within the same cluster.

Question 3

How does LiveKit Egress work and what does it cost to run?

Accepted Answer

LiveKit Egress is a separate service that uses a headless Chrome instance to composite room participants into a video output. You can record a room to a file (saved to S3, GCS, or a local path), stream it to an RTMP destination, or export individual tracks. The compositing is done by rendering a web page that uses the LiveKit JavaScript SDK, so you can fully customize the layout with CSS and HTML. The operational cost is significant: Egress is CPU-heavy because Chrome is doing real-time video rendering. A single Egress instance typically handles 4-8 concurrent recordings depending on resolution. You need to plan for this compute separately from your SFU nodes, and you need auto-scaling if recording volume is unpredictable.

Question 4

Do we need a TURN server and how do you configure it?

Accepted Answer

TURN (Traversal Using Relays around NAT) is a relay server that handles clients who cannot establish a direct peer-to-peer connection with your SFU because of restrictive firewalls or NAT configurations. In a browser-to-server WebRTC setup, TURN becomes necessary for a meaningful percentage of your users, typically 5-15% depending on your user base. We configure LiveKit to use TURN through either the built-in TURN server or a dedicated Coturn instance for higher traffic. TURN relay adds latency because all media flows through the relay node, so we place TURN servers in the same region as the SFU to minimize the added delay. For multi-region deployments, each region needs its own TURN capacity.

Question 5

How do you handle multi-region LiveKit deployments?

Accepted Answer

LiveKit's distributed mode uses Redis Cluster as a shared state store so that multiple SFU nodes can coordinate room state. Each region runs its own set of SFU nodes, and clients connect to the geographically closest region using DNS-based routing or a geolocation API. When a participant in one region joins a room that has participants in another region, LiveKit uses cross-region forwarding through a mesh of SFU nodes. We design the Redis topology to minimize cross-region latency for the coordination plane while keeping media traffic local whenever possible. We also configure proper connection limits per node so that traffic overflows to additional nodes rather than degrading media quality.

Question 6

What does a Kubernetes LiveKit deployment look like?

Accepted Answer

A production LiveKit Kubernetes deployment typically consists of: a LiveKit Server Deployment with HPA scaling on active room count, a Redis StatefulSet or managed Redis Cluster for coordination state, a LiveKit Egress Deployment scaled on recording queue depth, a TURN server DaemonSet on nodes with hostNetwork access (WebRTC requires raw UDP, which is easiest with hostNetwork), and a monitoring stack with Prometheus scraping LiveKit metrics and Grafana dashboards for room count, participant count, ICE failure rate, and publishing bitrate. Node affinity rules keep media-intensive pods on compute-optimized instances. PodDisruptionBudgets ensure that rolling upgrades do not simultaneously restart enough nodes to drop active sessions.

Question 7

Can you take over maintenance of an existing self-hosted LiveKit deployment?

Accepted Answer

Yes, and this is one of the most common engagements we take on. Teams often set up LiveKit from the quickstart documentation and then struggle when it needs to scale, or when a new LiveKit version breaks something subtle in their configuration. We start with an infrastructure audit: review the current deployment topology, networking setup, Redis configuration, TURN server health, and monitoring coverage. We identify gaps and risks, then work through them in priority order. Ongoing maintenance includes version upgrades, certificate management, capacity planning before major traffic events, and incident response when media quality degrades.

Question 8

How do you know you are the right choice for a self-hosted LiveKit project?

Accepted Answer

We are active contributors in the LiveKit community forum and rank among the top participants in the community leaderboard. We have filed issues that led to bug fixes, answered hundreds of technical questions from other developers, and tested LiveKit across multiple deployment configurations that most teams never encounter in their first project. If you want references from clients or community members who have seen our work, we can provide them. Self-hosted LiveKit has sharp edges that only appear in production: ICE candidate priority ordering under NAT, Egress Chrome crashes at high concurrency, Redis split-brain during node failures. We have hit most of these already and know the fixes.

Self-Host LiveKit on Your Own Infrastructure

What We Deploy and Operate

SFU Infrastructure Setup

AWS & GCP Cloud Deployments

Bare Metal Deployments

Kubernetes & Multi-Node Clusters

Egress & Recording Pipelines

Ingress & Live Streaming Intake

Custom Deployment Sizing

Ongoing Maintenance & Ops

Self-Hosting LiveKit Is Not a Weekend Project

Deployment Options We Support

Single VM

Kubernetes Cluster

Bare Metal

Our Self-Hosted LiveKit Stack

Self-Hosted LiveKit Deployments We Have Delivered

Voice AI for Medicare Patients

Multi-Tenant Video Platform

Hybrid Broadcast and Interactive

How We Work

Infrastructure Discovery

Deployment Architecture

Deploy, Test & Hand Off

Frequently Asked Questions

Ready to Self-Host LiveKit the Right Way?

Get a Free Infrastructure Assessment