Virtual Event | OCTOBER 22 + 23, 2025
The event for developers who care about high-perfomance, low-latency applications
Filter Videos
Browse our library of talks on low-latency engineering strategies.
ChatGPT Ain’t Got $%@& On Me!
Andy Pavlo
Associate Professor at Carnegie Mellon University
This talk presents research on a new generation of autonomous tuning agents that optimize more parts of a database in…
Clickhouse’s C++ & Rust Journey
Alexey Milovidov
CTO at ClickHouse, Inc.
A full rewrite from C++ to Rust or gradual integration with Rust libraries? For a large C++ codebase, only the…
LLM Inference Optimization
Chip Huyen
LLM Inference Optimization
This talk will discuss why LLM inference is slow and key latency metrics. It also covers techniques that make LLM…
P99 & Me
Dor Laor
CEO of ScyllaDB
At ScyllaDB, P99 is part of our DNA. Predictable low latency has been a hard requirement ever since we launched…
Performance Insights Beyond P99: Tales from the Long Tail
Rachel Stephens
Research Director at Red Monk
Adrian Cockroft
Tech Advisor at Nubank
Beyond the P99” moments are the rare, unpredictable outliers that disproportionately affect performance, reliability, and user experience. In this session,…
Let’s Do a Lot of Fuzzing in the Cloud
Alex Pshenichkin
Principal Engineer, Tech Lead at Antithesis
How do you simulate fast enough to do DST? Performance is a real hurdle to successfully implementing deterministic simulation, because…
A Visual Journey Through Async Rust
Alex Puschinsky
Tech Lead Software Engineer at Trigo
Async programming is tricky, but tinkering and visualization make it click. In this talk, we’ll build an async Rust visualization…
Rivian’s Push Notification Sub Stream with Mega Filter
Marcus Kim
Software Engineer II at Rivian and VW Group Technology, LLC
Saahil Khurana
Staff Software Engineer at Rivian and VW Group Technology, LLC
Rivian vehicles stream over 5500 signals every 5 seconds, but only about 80 are relevant for push notifications. Without filtering,…
Timeseries Storage at Ludicrous Speed
Duarte Nunes
Staff Engineer at Datadog
Datadog’s real-time storage system for timeseries data ingests billions of points per second and serves thousands of queries per second…
Fast and Deterministic Full Table Scans at Scale
Felipe Cardeneti Mendes
Technical Director at ScyllaDB
ScyllaDB’s new tablet replication algorithm replaces static vNodes with dynamic, elastic data distribution that adapts to shifting workloads. This talk…
P99 Latency at 32 Million Concurrent Streams
Ashutosh Agrawal
Staff Software Engineer at Gemini
Tim Koopmans
Senior Director, Product Experience at ScyllaDB
Ashutosh and Tim talk about all things related to glass-to-glass latency: live streaming of sports to 32 million concurrent devices…
Designing Low-Latency Systems with TLA+
Hillel Wayne
Consultant at Windy Coast Consulting
Many costly bugs come not from code, but from flawed designs: a common challenge in complex high-performance systems. TLA+ lets…
Mechanical Sympathy in Cooperative Multitasking
Kenny Chamberlin
Lead Engineer at Momento
This talk applies mechanical sympathy to server workloads that use cooperative multitasking and async/await. We’ll cover three techniques: reducing thread…
Shared Nothing Databases at Scale
Nick Van Wiggeren
CTO at PlanetScale
This talk will discuss how PlanetScale scaled databases in the cloud, focusing on a shared-nothing architecture that is built around…
The Power of Small Optimizations
Maksim Kita
Principal Software Engineer at Tinybird
This session will cover some important small optimizations that I contributed to ClickHouse over the last years — optimizations that…
GPUS and How to Program Them
Manya Bansal
PhD Student at Massachusetts Institute of Technology
CUDA, designed as an extension to C++, preserves its familiar abstractions. However, unlike CPU programming — where compilers and runtime…
Parsing Protobuf as Fast as Possible
Miguel Young de la Sota
Engineer at Buf Technologies
Protobuf is an extremely popular binary data interchange format. This session dives into hyperpb, a Protobuf parser for Go that…
Push the Database Beyond the Edge
Nikita Sivukhin
Software Engineer at Turso
Almost any application can benefit from having data available locally – enabling blazing-fast access and optimized write patterns. This talk…
ZGC: A Decade of Innovation
Stefan Johansson
Principle Member of Technical Staff at Oracle
ZGC has been in development for more or less a decade now. This talk will explore the current state of…
A Deep Dive into the Seastar Event Loop
Pavel Emelyanov
Principal Software Engineer at ScyllaDB
The core and the basis of ScyllaDB’s outstanding performance is the Seastar framework, and the core and the basis of…
A Java Developer’s Quest for I/O Performance
David Vlijmincx
Java Developer at JPoint
My journey optimizing Java’s io_uring bindings taught me what performance truly means. Through misleading benchmarks, midnight debugging sessions, and countless…
40x Faster Binary Search
Ragnar Groot Koerkamp
PhD at ETH Zurich
This talk will first expose the lie that binary search takes O(lg n) time — it very much does not!…
Design Considerations for P99-optimized Hash Tables
Steve Heller
President at Chrysalis Software Corp.
Hash tables are a classic data structure but struggle in P99-optimized applications, especially with variable-length records. Open addressing works well…
The Tale of Taming TigerBeetle’s Tail Latency
Tobias Ziegler
Software Engineer at Tigerbeetle
Learn how we reduced TigerBeetle’s tail latency through algorithm engineering. ‘Algorithm engineering goes beyond studying theoretical complexity and considers how…
Bridging epoll and io_uring in Async Rust
Tzu Gwo
Co-Founder & CEO at Tonbo IO Inc.
Tokio dominates async Rust, but its epoll-based model makes it hard to adopt io_uring. This talk explains why async Rust’s…
Turbocharging MCP: Speed, Smarts, and Scale
Viraj Sharma
Student at Presidium School Delhi India
Learn how to speed up Model Context Protocol (MCP) tools using async servers, caching, batching, and smart data handling—making your…
Patterns of Low Latency
Pekka Enberg
Founder & CTO at Turso
Building for low latency is important, but the tips and tricks are often part of developer folklore and hard to…
Noisy Neighbor Detection with eBPF
Jose Fernandez
Senior Software Engineer at Netflix
Tackling “noisy neighbor” issues in multi-tenant setups! At Netflix, we use eBPF to monitor and mitigate excessive CPU usage in…
Designing a Query Queue for ScyllaDB
Avi Kivity
CTO and Co-Founder of ScyllaDB
Database queries vary widely—from milliseconds to hours. Optimizing concurrency is a delicate balance of CPU, memory, and stability. Bad design…
You’re Doing It All Wrong
Michael Stonebraker
CTO & Co-founder of DBOS
Historically, business apps use a three-tier architecture. Now, cloud-native architectures and DBMS can be combined, allowing for resilient, cost-effective, and…
1BRC – Nerd Sniping the Java Community
Gunnar Morling
Principal Software Engineer at Decodable
Gunnar Morling dives into the tricks that the fastest 1BRC solutions used to process the challenge’s 13 GB input file…
Just In Time LSM Compaction
Aleksei Kladov
Staff Software Engineer at TigerBeetle
Matklad dives into the implementation of TigerBeetle’s JIT compaction algorithm for LSM, which is highly concurrent and uses all available…
Redis Alternatives Compared
Peter Zaitsev
Founder of Percona, Coroot, FerretDB
Join Peter as he dives into Redis alternatives like Valley, DragonflyDB, and Microsoft Garnet. He’ll cover licensing, features, community support,…
One Billion Row Challenge in Golang
Shraddha Agrawal
Senior Software Engineer, Ceph, IBM
Join us as we tackle Gunnar Morling’s One Billion Rows Challenge in Golang! We’ll walk through optimizing a 16GB file…
Taming Discard Latency Spikes
Patryk Wróbel
Software Engineer at ScyllaDB
Learned a crucial lesson on read/write latency when fixing a real ScyllaDB issue! Discover how TRIM requests impact NVMe SSDs…
Why Databases Cache, but Caches Go to Disk
Felipe Cardeneti Mendes
Technical Director at ScyllaDB
Alan Kasindorf
Founder of Cache Forge
ScyllaDB teamed up with Memcached to compare how caches and databases handle storage and memory across different scenarios. We’ll dive…
Low-Latency Mesh Services Using Actors
Nikita Lapkov
Senior Software Engineer
We’re transforming elfo, our Rust actor system, into a distributed mesh of services. Learn how we tackled message serialization, compression,…
Get Low (Latency)
Benjamin Cane
Distinguished Engineer at American Express
Tyler Wedin
Vice President, Global Payments Network SRE at American Express
Building a real-time, low-latency card payments system is a challenge. Join the Amex Payments Network team to learn about their…
Reliable Data Replication
Cameron Morgan
Staff Infrastructure Engineer at Shopify
Data replication ensures high availability—reliable, consistent, and timely access. Dive into the tough problems often skipped: reliable backfills, schema changes,…
Scheduler Tracing With ftrace + eBPF
Jason Rahman
Principal Software Engineer at Microsoft
Dive into understanding app latency by exploring the Linux scheduler with ftrace, eBPF, and Perfetto for visualization. Uncover quirks in…
Building a Cloud Native LSM on Object Storage
Chris Riccomini
Creator of Materialized View
Rohan Desai
Co-Founder of Responsive
Excited to introduce SlateDB, an open-source, cloud-native storage engine. Built as an LSM on object stores like S3/GCS/ABS, it leverages…
Running Low-Latency Workloads on Kubernetes
Jimmy Zelinskie
Co-Founder of AuthZed
Configuring Kubernetes for optimal workload performance is a continuous journey. Best practices can sometimes harm performance. Join us as we…
WebAssembly on the Edge: Sandboxing AND Performance
Brian Sletten
President at Bosatsu Consulting, Inc.
Ramnivas Laddad
Co-Founder of Exograph, Inc
Moving apps to the Edge can complicate performance due to security constraints. Learn how WebAssembly bridges the gap, enabling both…
Queues, Hockey Sticks and Performance
David Collier-Brown
Staff Engineer
Queues: both a blessing and a curse in computer science. They help predict performance but also signal overload. This talk…
Remote CAD that Feels Local
Adam Chalmers
Systems Engineer at Zoo
Adam Sunderland
Lead Cloud Infrastructure Engineer at Zoo
Zoo is creating a CAD suite that runs in the cloud but feels like it’s local. How? Regional deployment, WebRTC…
Profiling your Go Service with pprof
Miriah Peterson
Lead Engineer at Soypete Tech
Optimize your Go code with the powerful pprof tool. Learn how to integrate, access, and interpret pprof metrics, plus best…
High Performance on a Low Budget
Gwen Shapira
Co-founder & CPO of Nile
It is one thing to solve performance challenges when you have plenty of time, money, and expertise available. Many performance…
eBPF vs Sidecars
Liz Rice
Chief Open Source Officer, Isovalent at Cisco
From its vantage point in the kernel, eBPF provides a platform for building a new generation of infrastructure tools for…
Performance Budgets for the Real World
Tammy Everts
Chief Experience Officer at SpeedCurve
Performance budgets have been around for more than ten years. Over those years, we’ve learned a lot about what works,…
Measuring the Impact of Network Latency at Twitter
Widya Salim
Data Scientist at SEEK
Victor Ma
Senior Data Scientist at Airwallex
Zhen Li
Data Scientist at TikTok
Widya Salim, Victor Ma, and Zhen Li will outline the causal impact analysis, framework, and key learnings used to quantify…
ORM is Bad, But is There an Alternative?
Henrietta Dombrovskaya
Database Architect at DRW
It’s a well-known fact, that although the database performance is great, and each query is executed in milliseconds, the overall…
The History of Tracing Oracle
Cary Millsap
Distinguished Product Manager at Oracle
In this presentation, I will explore the history of tracing Oracle and why it has been overlooked despite its usefulness.…
Practical Go Memory Profiling
William Kennedy
Managing Partner at Ardan Labs
In this talk, Bill will show you how to use benchmark profiling in and compiler directives in Go to find…
Noise Canceling RUM
Tim Vereecke
Web Performance Architect at Akamai
Noisy Real User Monitoring (RUM) data can ruin your P99! We introduce a fresh concept called “Human Visible Navigations” (HVN)…
Less Wasm
Piotr Sarna
Founding Engineer at poolside
The presentation explains why getting rid of WebAssembly is good for your latency. More specifically, it’s a short case study…
Reducing P99 Latencies with Generational ZGC
Stefan Johansson
Principle Member of Technical Staff at Oracle
With the low-latency garbage collector ZGC, GC pause times are no longer a big problem in Java. With sub-millisecond pause…
Chihuahua-Sized Load Tests!
Leandro Melendez
Developer Advocate at Grafana Labs
Because bigger isn’t always better. Especially nowadays.Do your teams need help accommodating those humongous load tests in your agile &…
99.99% of Your Traces are Trash
Paige Cruz
Senior Developer Advocate at Chronosphere
Distributed tracing is still finding its footing in many organizations today, one challenge to overcome is the data volume –…
A Deep Dive Into Concurrent React
Matheus Albuquerque
Senior Software Engineer, Front-End at Medallia
Writing fluid user interfaces becomes more and more challenging as the application complexity increases. In this talk, we’ll explore how…
Ingesting in Rust
Armin Ronacher
Creator of Flask and Principal Architect at Sentry
At Sentry we handle hundreds of thousands of events a second — from tiny metric to huge memory dump. What…
Building a 10x More Efficient Edge Platform
Felipe Huici
CEO and Co-Founder of Unikraft UG
Painful cold boots, terrible auto-scale times, minutes-long waits for compute nodes to be up: these are standard headaches that cloud…
HTTP 3: Moving on From TCP
Brian Sletten
President at Bosatsu Consulting, Inc.
Any network class you have taken in the last thirty years will have highlighted that the application layer depends on…
Misery Metrics & Consequences
Gil Tene
CTO and Co-Founder of Azul Systems
Join Azul System’s Gil Tene as he defines “misery metrics,” which describe what happens when our production systems are operating…
From SLO to GOTY
Charity Majors
CTO of Honeycomb
Charity Majors shares the performance lessons we can all learn from game developers, who were among the first to run…
Linux Kernel vs DPDK: HTTP Performance Showdown
Marc Richards
Performance Engineer at Amazon Web Services
AWS’ Marc Richards uses an HTTP benchmark to compare performance of the Linux kernel networking stack with userspace networking doing…
Speedup Your Code Through Asynchronous Programing
Sabina Smajlaj
Operations Developer at Hudson River Trading
Hudson River Trading’s Sabina Smajlaj demonstrates how to take advantage of programming languages’ asynchronous libraries with a few minor tweaks…
A New IO Scheduler Algorithm for Mixed Workloads
Pavel Emelyanov
Principal Software Engineer at ScyllaDB
Discover how ScyllaDB, built on the highly asynchronous Seastar library, implemented an IO scheduler optimized for peak performance on modern…
Why User-Mode Threads Are Good for Performance
Ron Pressler
Project Loom Technical Lead, Java Platform Group at Oracle
Hear from Oracle’s Ron Pressler how Java added virtual threads, an implementation of user-mode threads, to help write high-throughput servers.
Hardware Assisted Latency Investigations
Kshitij Doshi
Senior Principal Engineer, Intel Corportation
Harshad S Sane
Principal Software Engineer at Intel
Intel’s Harshad S Sane & Kshitij Doshi share new ways to use eBPF to better examine latency excursions.
Evaluating Performance In Go
William Kennedy
Managing Partner at Ardan Labs
William Kennedy provides a deep dive training on how to optimize Go’s concurrency and garbage collection.
Implementing Highly Performant Distributed Aggregates
Michal Jadwiszczak
Software Engineer at ScyllaDB
ScyllaDB’s Michał Jadwiszczak explains how can you implement aggregate functions without hammering real-time availability and performance for other read/write operations.
A Deep Dive into Query Performance
Peter Zaitsev
Founder of Percona, Coroot, FerretDB
Percona’s Peter Zaitsev explores overlooked and underappreciated ways to successfully establish a connection and get results to the queries promptly…
Fast and Fault Tolerant
Michael Barker
Independent Consultant at Ephemeris Consulting
Michael Barker draws on knowledge from working on financial exchanges, messaging and clustering systems to describe a model that can…
C# as a System Language
Oren Eini
Founder & CEO of RavenDB
RavenDB’s Oren Eini discusses the features that make C# a viable system language for building high-end systems.
Retaining Goodput with Query Rate Limiting
Piotr Dulikowski
Senior Software Engineer, ScyllaDB
ScyllaDB’s Piotr Dulikowski walks through how they tackled a “hot partition” problem: a single partition accessed with disproportionate frequency that…
It’s Time to Debloat the Cloud with Unikraft
Felipe Huici
CEO and Co-Founder of Unikraft UG
Felipe Huici introduces Unikraft, a cloud operating system that allows for easily building fully-tailored cloud-ready images that boot in a…
Performance Insights Into eBPF, Step by Step
Dmitrii Dolgov
Senior Software Engineer at Red Hat
Red Hat’s Dmitri Dolgov sheds light on using eBPF. How to collect execution metrics, profile programs and common pitfalls to…
cachegrand: A Take on High Performance Caching
Daniele Salvatore Albano
Senior Software Engineer II at Microsoft
Microsoft’s Daniele Salvatore Albano presents cachegrand, a SIMD-accelerated hashtable without locks or busy-wait loops using fibers, io_uring, and much more.
Throw Away Your Nines
Alex Hidalgo
Principal Reliability Advocate at Nobl9
You may encounter problems if you only think about “nines” setting service reliability targets. Throw away your nines. Let’s find…
Cutting Through the Fog of Virtualization
Bernd Bandemer
Head of Data Science at Clockwork Systems Inc.
Clockwork Systems’ Bernd Bandemer details causes of cloud network latency, from its underlying infrastructure, to its physical topology and network…
Three Perspectives on Measuring Latency
Geoffrey Beausire
Senior Site Reliability Engineer at Criteo
Discover from Criteo’s Geoffrey Beausire how to measures latency in key-value infrastructure from both server and client sides, as well…
OSNoise Tracer: Who Is Stealing My CPU Time?
Daniel Bristot de Oliveira
Principal Software Engineer at Red Hat
Daniel Bristot de Oliveira (Red Hat) explores operating system noise (the interference experienced by an application due to activities inside…
How to Measure Latency
Heinrich Hartmann
Principal Engineer at Zalando
Heinrich Hartmann (Zalando) shares strategies for avoiding pitfalls with collecting, aggregating and analyzing latency data for monitoring and benchmarking.
Rust Is Safe. But Is It Fast?
Glauber Costa
Founder & CEO of Turso
Glauber Costa outlines pitfalls and best practices for developing Rust applications with low P99.
G1: To Infinity and Beyond
Stefan Johansson
Principle Member of Technical Staff at Oracle
Stefan Johansson (Oracle) provides insights on the G1 JVM garbage collector — what’s new, how it impacts performance, and what’s…
I/O Rings and You — Optimizing I/O on Windows
Yarden Shafir
Software Engineer at Crowdstrike
Yarden Shafir (Crowdstrike) introduces Windows’ implementation of I/O rings, demonstrating how it’s used, and discusses potential future additions.
Scaling Apache Pulsar to 10 Petabytes/Day
Karthik Ramasamy
Senior Director of Engineering at Splunk
Karthik Ramaswamy (Splunk) demonstrates how data — including logs and metrics — can be processed at scale and speed with…
Seastore: Next Generation Backing Store for Ceph
Sam Just
Senior Principal Software Engineer at Red Hat
Sam Just (Red Hat) shares how they architected their next-generation distributed file system to take advantage of emerging storage technologies…
Object Compaction in Cloud for High Yield
Tejas Chopra
Senior Software Engineer at Netflix
Tejas Chopra shares how Netflix gets massive volumes of media assets and metadata to the cloud fast and cost-efficiently.
Where Did All These Cycles Go?
Thomas Dullien
CEO of optimyze.cloud Inc.
Thomas Dullien (Optimyze.cloud) exposed all the hidden places where you can recover your wasted CPU resources.
Avoiding Data Hotspots at Scale
Konstantin Osipov
Director of Software Engineering at ScyllaDB
Konstantine Osipov (ScyllaDB) addresses the tradeoffs between hash and range-based sharding.
Using eBPF to Measure the k8s Cluster Health
Henrik Rexed
Cloud Native Advocate at Dynatrace
Henrik Rexed (Dynatrace) explains how to use Prometheus + eBPF to understand the inner behavior of Kubernetes clusters and workloads…
Continuous Go Profiling & Observability
Felix Geisendörfer
Staff Engineer at Datadog
Felix Geisendörfer (Datadog) digs into the unique aspects of the Go runtime and interoperability with tools like Linux perf and…
Understanding Apache Kafka P99 Latency at Scale
Pere Urbón-Bayes
Senior Solutions Architect at Confluent
Pere Urbón-Bayes (Confluent) presents strategies for measuring, evaluating, and optimizing the performance of an Apache Kafka-based infrastructure.
Whoops! I Rewrote It in Rust
Brian Martin
Software Engineer at Twitter
Why and how Brian Pelikan rewrote Pelikan, Twitter’s open source and modular framework for in-memory caching, in Rust.
Let’s Fix Logging Once and for All
Peter Portante
Senior Principal Software Engineer at Red Hat
Peter Portante (Red Hat) presents a Linux kernel modification that gives the SRE and logging source owner greater control over…
















































































































































































