Quixotic AI — AI sovereignty for the JVM

AI sovereignty
for the JVM

The JVM powers global finance, big data, and mission-critical infrastructure. It deserves an AI stack built to the same standard.

Experimental · Evolving

Multi-backend tensor engine. Panama, C, CUDA, HIP, Metal, OpenCL, and Mojo. Write once, accelerate everywhere.

Pure Java read/write for llama.cpp's GGUF model format. No native bindings required.

Pure Java read/write for HuggingFace's Safetensors format. Memory-mapped for large models.

Pure Java, efficient, TikToken-compatible + customizable BPE tokenizers for popular LLM models.

Inference runs on any JVM out of the box. Backends for CUDA, Metal, and others give you hardware acceleration when you need it.

Run large language models locally with quantization and efficient memory management.

A single Tensor API across Panama, C, CUDA, HIP, Metal, OpenCL, and Mojo. Switch backends with one line.

Fast vector operations for RAG pipelines and semantic search.

First-class support for Native Image. Small footprint, fast startup.

No Python interop, no ONNX bridges. An AI stack built from first principles for the JVM.

Connect

Quixotic AI is building in the open. Give it a spin, share feedback, contribute.