Langfuse is open source and we want to be fully transparent what we’re working on and what’s next. This roadmap is a living document and we’ll update it as we make progress.
🚀 Released
10 most recent changelog items:
- Langfuse CLI(Feb 17, 2026)
- Evaluate Individual Operations: Faster, More Precise LLM-as-a-Judge(Feb 13, 2026)
- Run Experiments on Versioned Datasets(Feb 11, 2026)
- Corrected Outputs for Traces and Observations(Jan 14, 2026)
- Filter Observations by Tool Calls and add Tool Calls to Dashboard Widgets(Dec 22, 2025)
- v2 Metrics and Observations API (Beta)(Dec 17, 2025)
- Dataset Item Versioning(Dec 15, 2025)
- OpenAI GPT-5.2 support(Dec 12, 2025)
- Batch Add Observations to Datasets(Dec 11, 2025)
Active Development
Agent Observability
- Improve Langfuse to dig into complex, long running agents more intuitively
Evals
- Introduce experiments as a first class citizen, remove the dependency on datasets to allow for more bespoke unit-tests
- Overhaul experiment (dataset run) comparison views to make it easier to work with experiment results
- Dataset management: bulk add traces to datasets
- Improve comments across the product to allow for more qualitative evaluation workflows and collaboration
Playground
- Experiment with prompts/models in playground based on logged traces and datasets with reference inputs
- Langfuse model input/output data schema to increase model interoperability for structured outputs and tool calls
- Make Playground stateful and collaborative
UI/UX
- Improve onboarding experience
- Improve core screens, especially for new and non-technical users
- Increase UI performance for extremely large traces and datasets
Infrastructure / Data Platform
- We strongly increase ingestion throughput, response times, and error rates across APIs by simplifying the core data model.
- Move to an observation-only and immutable data model as it better aligns with complex agents and allows us to scale our platform. Thereby, we remove traces as a first class citizen.
- Improvements across our tracing UI to make it easier to find relevant spans for complex agents.
- Webhooks for observability and evaluation events, useful for routing and alerting
🙏 Feature requests and bug reports
The best way to support Langfuse is to share your feedback, report bugs, and upvote on ideas suggested by others.