GitHub - vladich/pg_jitter: Better JIT for Postgres

A lightweight JIT compilation provider for PostgreSQL that adds three alternative JIT backends - sljit, AsmJit and MIR - delivering faster compilation and competitive query execution across PostgreSQL 14–18.

Why?

JIT compilation was introduced in Postgres 11 in 2018. It solves a problem of Postgres having to interpret expressions and use inefficient per-row loops in run-time in order to do internal data conversions (so-called tuple deforming). On expression-heavy workloads or just wide tables, it can give a significant performance boost for those operations. However, standard LLVM-based JIT is notoriously slow at compilation. When it takes tens to hundreds of milliseconds, it may be suitable only for very heavy, OLAP-style queries, in some cases. For typical OLTP queries, LLVM's JIT overhead can easily exceed the execution time of the query itself. pg_jitter provides native code generation with microsecond-level compilation times instead of milliseconds, making JIT worthwhile for a much wider range of queries.

Performance

Typical compilation time:

sljit: tens to low hundreds of microseconds
AsmJIT: hundreds of microseconds
MIR: hundreds of microseconds to single milliseconds
LLVM (Postgres default): tens to hundreds of milliseconds (seconds to tens(!) of seconds in worst cases)

In reality, the effect of JIT compilation is broader - execution can slow down for up to ~1ms even for sljit, because of other related things, mostly cold processor cache and effects of increased memory pressure (rapid allocations / deallocations related to code generation and JIT compilation). Therefore, on systems executing a lot of queries per second, it's recommended to avoid JIT compilation for very fast queries such as point lookups or queries processing only a few records. By default, jit_above_cost parameter is set to a very high number (100'000). This makes sense for LLVM, but doesn't make sense for faster providers. It's recommended to set this parameter value to something from ~200 to low thousands for pg_jitter (depending on what specific backend you use and your specific workloads).

sljit is the most consistent: 5–25% faster than the interpreter across all workloads. This, and also its phenomenal compilation speed, make it the best choice for most scenarios.
AsmJIT excels on wide-row/deform-heavy queries (up to 32% faster) thanks to specialized tuple deforming
MIR provides solid gains while being the most portable backend
LLVM was supposed to be fast at execution time, due to clang optimization advantages, but in fact, in most cases, it's slower than all 3 pg_jitter backends, even not counting compilation performance differences. pg_jitter instead focuses on fast code generation, direct helper calls, and manual instruction-level optimization for hot expression patterns.

There are some operations that pg_jitter optimizes much better than typical expressions:

LIKE / ILIKE / regexp - by using PCRE2 (compilation) and StringZilla (SIMD), they are typically 2x-5x faster, but could be up to 25x faster for complex patterns. PCRE2 uses sljit compilation internally, and that adds extra optimization opportunities when sljit is also used as a JIT backend
CASE / IN (ANY) kind of expressions. These have special optimizations (pre-compiled binary search tree instead of linear search and hash probing). That approach speeds them up from 1.5x - 2x for IN up to 10x for CASE in extreme scenarios.

Benchmarks

There are several scripts in the /tests folder to run different types of benchmarks.

For the version 0.3.0, more benchmarks have been added:

TPC-C - typical OLTP queries (average speedup 8%)
TPC-H - typical OLAP queries (average speedup 6%)
ECOMMERCE - a synthetic OLTP benchmark for a hypothetical e-commerce app (average speedup 14%)
OLAP - a synthetic benchmark for a hypothetical analytics workload (average speedup 42%)
CRM - a synthetic benchmark for a hypothetical CRM analytics (average speedup ~400%) - it was created to showcase the strongest areas of the JIT backends - text filtering / CASE / IN.

TPC-C and TPC-H numbers are a bit low, because they are single-pass, while the other 3 are continuous. If you run those TPC-C / TPC-H queries continuously, like what most workloads do, you will get better numbers.

[ARM64 / 0.3.0] (bench/ARM64) ->

PG14 | PG15 | PG16 | PG17 | PG18
TPC-C * TPC-H * ECOMMERCE * OLAP * CRM

[x86_64 / 0.3.0] (bench/x86_64) TBD

Features

Simple-configuration - set jit_provider and go
Three independent backends with different strengths
Runtime backend switching via SET pg_jitter.backend = 'sljit' (no restart)
PostgreSQL 14–18 support from one codebase
Tiered function optimization - hot-path PG functions emitted inline or compiled as direct native calls
No LLVM dependency - pure C/C++ with small, embeddable libraries
Supported platforms - aside from AsmJit (which supports only ARM64/x86_64), other providers can be used on most platforms supported by Postgres. pg_jitter builds and passes basic regression testing on Linux/MacOS (ARM64) - all providers, Linux/FreeBSD/Windows (x86_64) - all providers, Linux (PPC64LE) - sljit only, Linux (s390x) - sljit only. It was not yet extensively user-tested outside of ARM64 and x86_64.

Stability

The current source code can be considered beta-quality. It passes all standard Postgres regression tests on all supported platforms and shows good improvements in performance tests. It also successfully passes ~7M test cases of SqlLogicTest. But it lacks large-scale production verification (yet). Stay tuned.

Quick Start

Prerequisites

PostgreSQL 14–18 (with development headers)
CMake >= 3.16
C11 and C++17 compilers
Backend libraries as sibling directories:

parent/
├── pg_jitter/
├── sljit/        
├── asmjit/       
└── mir/

SLJIT | AsmJit | MIR

For MIR and sljit, use the patched versions from MIR-patched and SLJIT-patched - they have a number of bug fixes, extra SIMD opcodes, and memory management improvements.

Build

# Build all backends
./build.sh

# Build a single backend
./build.sh sljit

# Custom PostgreSQL installation
./build.sh --pg-config /opt/pg17/bin/pg_config all

# Custom dependency paths
./build.sh all -DSLJIT_DIR=/path/to/sljit -DMIR_DIR=/path/to/mir

Install

# Install all backends and restart PostgreSQL
./install.sh

# Custom paths
./install.sh --pg-config /opt/pg17/bin/pg_config --pgdata /var/lib/postgresql/data all

Configure

-- Use a specific backend directly
ALTER SYSTEM SET jit_provider = 'pg_jitter_sljit';
SELECT pg_reload_conf();

-- Or use the meta provider for runtime switching (no restart needed)
ALTER SYSTEM SET jit_provider = 'pg_jitter';
SELECT pg_reload_conf();

SET pg_jitter.backend = 'asmjit';  -- switch on the fly

Configuration

All parameters are user-settable (SET in session, ALTER SYSTEM for persistent) and take effect without a restart unless noted.

Backend Selection

Parameter	Type	Default	Description
`pg_jitter.backend`	enum	`auto` (if 2+ backends installed, else the single available one)	Active JIT backend: `sljit`, `asmjit`, `mir`, or `auto`. The `auto` mode is experimental — it uses adaptive statistics to select the fastest backend per expression profile.

Parallel Execution

Parameter	Type	Default	Description
`pg_jitter.parallel_mode`	enum	`per_worker`	Controls JIT in parallel workers. `off` — workers use the PG interpreter. `per_worker` — each worker JIT-compiles independently. `shared` — leader compiles once, workers reuse code via DSM (saves compilation time, slightly higher per-row overhead). The `shared` mode is experimental
`pg_jitter.shared_code_max`	integer (KB)	`4096` (4 MB)	Maximum DSM segment size for shared JIT code. Range: 64 KB – 1 GB.

Expression Tuning

Parameter	Type	Default	Description
`pg_jitter.min_expr_steps`	integer	`4`	Minimum expression step count for JIT compilation. Expressions with fewer steps skip JIT and use the interpreter. Range: 0–1000.
`pg_jitter.deform_cache`	boolean	`on`	Cache compiled deform functions across queries within a backend process. When off, deform is recompiled each query.
`pg_jitter.in_bsearch_max`	integer	`4096`	Maximum IN-list elements for inline binary search tree. Larger integer IN-lists use `pg_jitter.in_hash`. 0 disables inline binary search. Range: 0–4096.
`pg_jitter.in_hash`	enum	`crc32` on x86_64, `pg` elsewhere	SLJIT strategy for integer IN-lists larger than `pg_jitter.in_bsearch_max`: `pg` (PostgreSQL's native hashed scalar-array path), `crc32` (hardware/software CRC32C open-addressing). Other backends use PostgreSQL's native path above the binary-search threshold.
`pg_jitter.in_text_hash`	boolean	`off`	Use pg_jitter's experimental text IN-list hash table. When off, text hashed scalar-array operations use PostgreSQL's native path.

Adaptive Backend Selection (experimental)

These parameters only take effect when pg_jitter.backend = 'auto'. Statistics are not collected when a specific backend is selected.

Parameter	Type	Default	Description
`pg_jitter.adaptive`	boolean	`on`	Enable adaptive backend selection based on runtime performance statistics.
`pg_jitter.adaptive_samples`	integer	`64`	Number of expression evaluations to time before considering a backend profiled. Range: 4–10000.
`pg_jitter.adaptive_epsilon`	real	`0.05`	Exploration probability (epsilon-greedy). 0.0 = always pick the best measured backend, 1.0 = always pick randomly. Range: 0.0–1.0.

PostgreSQL JIT Parameters

These are standard PostgreSQL parameters that control when JIT compilation is triggered:

Parameter	Default	Recommended for pg_jitter
`jit_above_cost`	`100000`	`200`–`2000` (pg_jitter compiles in microseconds, not milliseconds)
`jit_inline_above_cost`	`500000`	`500000` (keep default)
`jit_optimize_above_cost`	`500000`	`500000` (keep default)

Architecture

Expression Compilation

pg_jitter implements PostgreSQL's JitProviderCallbacks interface. When PostgreSQL decides to JIT-compile a query, it calls compile_expr() which:

Walks the ExprState->steps[] array (PostgreSQL's expression evaluation opcodes)
Emits native machine code for hot-path opcodes (arithmetic, comparisons, variable access, tuple deforming, aggregation, boolean logic, jumps)
Delegates remaining opcodes to pg_jitter_fallback_step() which calls the corresponding ExecEval* C functions
Installs the compiled function with a one-time validation wrapper that catches ALTER COLUMN TYPE invalidation

Function Optimization Tiers

Tier 0: Simple operations emitted directly into provider-generated code. This avoids function-call and FunctionCallInfo overhead entirely.
Tier 1: Direct calls to unwrapped native jit_* functions. This bypasses PostgreSQL's fmgr/fcinfo call path.
Tier 2: Pass-by-reference operations (numeric, interval, uuid, selected text paths) called through wrappers or native helpers. This path has no LLVM or PostgreSQL bitcode dependency.

Three JIT Backends

	sljit	AsmJIT	MIR
Language	C	C++	C
IR level	Low-level (register machine)	Low-level (native assembler)	Medium-level (typed ops)
Register allocation	Manual	Virtual (automatic)	Automatic
Architectures	arm64, x86_64, s390x, ppc, mips, riscv	arm64, x86_64	arm64, x86_64, s390x, ppc, mips, riscv
Compilation speed	Fastest (10s to low 100s of μs)	Fast (x3-x5) of sljit	Still fast (x15-x20 of sljit)
Best for	General workloads, lowest overhead	Wide rows, deform-heavy queries	Portability and edge cases
Library size	~200 KB	~900 KB	~900 KB

Meta Provider

The meta provider (jit_provider = 'pg_jitter') is a thin dispatcher that:

Exposes a pg_jitter.backend GUC (user-settable, no restart required)
Lazily loads backend shared libraries on first use
Caches loaded backends for process lifetime
Falls back to the next available backend if the selected one isn't installed

Each backend remains independently usable by setting jit_provider = 'pg_jitter_sljit' directly.

Memory Management

JIT-compiled code is tied to PostgreSQL's ResourceOwner system:

A PgJitterContext is created per query, extending PostgreSQL's JitContext
Each compiled function is registered on a linked list with a backend-specific free callback
When the query's ResourceOwner is released, all compiled code is freed:
- sljit: sljit_free_code() — releases mmap'd executable memory
- AsmJIT: JitRuntime::release() — frees the code buffer
- MIR: MIR_gen_finish() + MIR_finish() — tears down the entire MIR context

Version Compatibility

A single codebase supports PostgreSQL 14–18 via compile-time #if PG_VERSION_NUM guards in src/pg_jitter_compat.h. Key differences handled:

PG14–16: JIT-specific ResourceOwner API (ResourceOwnerEnlargeJIT/RememberJIT/ForgetJIT)
PG17+: Generic ResourceOwner API (ResourceOwnerDesc)
PG18: CompactAttribute, split EEOP_DONE, CompareType rename, new opcodes

Function Wrappers

The native precompiled blob pipeline was removed. Small leaf functions are handled by Tier 0 emitters or ordinary Tier 1 direct calls. Pass-by-reference operations that are not safe to reimplement use always-available C wrappers around PostgreSQL built-ins.

Testing

# Correctness: provider-specific bugs and direct-function surface
./tests/test_provider_regressions.sh --backend all
./tests/test_function_surface.sh --backend all

# Benchmarks
./tests/bench_all_backends.sh
./tests/gen_cross_version_benchmarks.py

# I-cache impact analysis
./tests/bench_cache_compare.sh

# Memory leak detection (10K queries with RSS trend)
./tests/test_leak_trend.sh [port] [backend]

# Multi-version build + test (PG14–18)
./tests/run_all_versions.sh

License

Apache License 2.0. See LICENSE.

Copyrights

All copyrights belong to their respective owners.

Main builds

Full Postgres regression tests (pg_regress) run on each commit to the master branch for each supported combination of main platforms and versions of Postgres (20 combinations total). For each branch commit, they run for a single version only (PG17). Here is the state of the last master run.