Langfuse Roadmap

Langfuse is open source and we want to be fully transparent what we’re working on and what’s next. This roadmap is a living document and we’ll update it as we make progress.

Your feedback is highly appreciated. Feel like something is missing? Add new ideas on GitHub or vote on existing ones. Both are a great way to contribute to Langfuse and help us understand what is important to you.

🚀 Released

10 most recent changelog items:

Prompt Experiments on Datasets with LLM-as-a-Judge Evaluations(Nov 22, 2024)
All new Datasets, Experimentation and Evaluation documentation(Nov 21, 2024)
Full multi-modal support, including audio, images, and attachments(Nov 20, 2024)
LLM-as-a-judge Evaluators for Dataset Experiments(Nov 19, 2024)
Dataset Run Comparison View(Nov 18, 2024)
llms.txt(Nov 17, 2024)
Prompt Management for Vercel AI SDK(Nov 17, 2024)
New Sidebar(Nov 1, 2024)
Event input and output masking(Oct 25, 2024)
Amazon Bedrock support for LLM Playground and Evaluations(Oct 11, 2024)

Subscribe to our mailing list to get occasional email updates about new features.

🚧 In progress

Langfuse v3.0: preparing Langfuse for the next level of scale using an OLAP database, a queue and another container. Parts of it are already available in Langfuse Cloud and once migration is complete, self-hosting will be upgraded as well. Learn more in this GitHub Discussion.
Export traces and sessions from Langfuse dashboard (CSV, JSON)
Improved tables across the Langfuse UI to display all relevant information and be more user-friendly.
Move to SDK references generated from docstrings to improve the developer experience (Intellisense) and reduce the risk of errors.
Improve cost tracking of multi-modal LLMs and more complex pricing models (e.g. Anthropic/Google Context Caching, Google Vertex pricing)
In-UI prompt and model evaluation/benchmarking based on Langfuse-managed custom evaluators.

🔮 Planned

Webhooks to subscribe to changes within your Langfuse project.
Datasets: make them usable in CI (e.g GitHub Actions).
Comments on prompt versions.
Improved datasets UI/UX.
Add non-LLM evaluators to online evaluation within Langfuse UI.
Revamped context-aware JS integration to to remove the need for nesting of tracing calls, similar to Python decorator.
Better support for multi-modal traces that use base64 encoded images.

⚠️ Upcoming breaking changes

Self-hosting: Langfuse v3.0 will add additional containers and database for improve scalability. Learn more in this GitHub Discussion.
OpenAI integration, dropping support of openai < 1.0.0 to greatly simplify the integration and improve the developer experience of everyone on openai >= 1. No timeline on this yet as many libraries still depend on the old version.

🙏 Feature requests and bug reports

The best way to support Langfuse is to share your feedback, report bugs, and upvote on ideas suggested by others.

Langfuse Roadmap

🚀 Released

🚧 In progress

🔮 Planned

⚠️ Upcoming breaking changes

🙏 Feature requests and bug reports

Feature requests

Bug reports

Was this page useful?

Questions? We're here to help

Subscribe to updates