Open source LLM engineering platform. Now part of @clickhousedb. We're hiring: https://t.co/k6dgv4dws2langfuse.com github.com/langfuse/langfuseJoined July 2023
Now live: Langfuse x @OpenAI Codex integration.
Trace every prompt, tool call, shell command, and token across your Codex sessions. Also available for Claude Code.
langfuse.com/integrations/o…
had a lot of fun at the AI Engineer conference to go deep on:
(1) how we think about the role of skills
(2) how to develop/eval/improve them
(3) lessons from building our own set of skills
Skill issue: Lessons from skilling up coding agents
Getting agents to actually use Langfuse was a "skill issue" — literally. Marc Klingen from Clickhouse on teaching coding agents to use new tools, and why it's harder than you think.
youtube.com/watch?v=vNCY9k…
quarterly Langfuse Town Hall on June 11th
catch up on everything we've shipped: v4, the latest releases, and what's coming next on the roadmap. Q&A with the team at the end.
open to the whole community. register: luma.com/7dny2x72
day 5 of launch week: langfuse MCP.
supports: observations, metrics, scores, datasets, comments, annotation queues, models, media, and more.
claude or linear agents can pull a trace, drop a comment, or create dataset items without leaving the chat.
langfuse.com/launch
day 4 of langfuse launch week: code evaluators.
write a python or typescript `evaluate` function in the langfuse UI. attach it to live observations or an experiment. scores land natively next to your existing ones.
@wochinge demos below; langfuse.com/launch
day 3 of langfuse launch week: full-text search.
multi-GB scans drop from many seconds to sub-second on @ClickHouseDB's new text indexes. great work from @sum3rman.
available via UI and API.
more: langfuse.com/launch
day 2 of langfuse launch week 5: langfuse agent skill.
bringing an agent to production is hard.
using the skill you can ask your coding agent to instrument your app, calibrate a judge, or set up evaluators.
@marliessophie demos below; langfuse.com/launch
day 1 of langfuse launch week 5: a github action that runs your langfuse experiments on every PR.
fails the workflow when scores drop below your threshold. posts pass/fail to the PR. every run is tracked in langfuse.
langfuse.com/launch
@langfuse launch week 5 starts monday.
one release per day, mon to fri. agents, evals, and some long-requested features.
we'll be demoing all new features at @ClickHouseDB Open House in San Francisco same week. come say hi.
langfuse.com/launch
Want to see what Claude Code is actually doing? We made a video showing exactly how to observe it in real-time with Langfuse.
Claude Code in Action: Trace Tool Calls & Decisions with Langfuse youtu.be/fsoBHf_WNmQ?si…
This is a great article by @annabellschfr - a lot of teams still get stucks on vibes and don't make it to actually systematically experiment with models, prompts, . context, architectures. dig in!
24 Followers 237 FollowingBuilding agent attestation for regulated AI.
Previously CRED, Vance (YC W22), Partnr.
Mostly write about AI in regulated industries, occasionally other things.
8K Followers 879 FollowingHelping @java developers to build and deploy the best software in the industry. @Java_Champions. Published author. Board member @soujava.
11 Followers 49 FollowingIIT KGP '26 | Building reliable AI systems for business workflows | Systems thinking • Evals • Observability • Long-term reliability | Open to client projects
75K Followers 2K Following✨ AI should be about empowering humans, building understanding, and making dreams realities. 👩💻 DevX Eng. Lead @GoogleDeepMind ex-@GitHub || views = my own!
1K Followers 449 FollowingDistributed systems & database nerd. + gamedev and photography. #DevZen co-host. Mastodon: https://t.co/IxKcKv3MHn
SWE @langfuse. Opinions are my own
80K Followers 897 FollowingCreator of Flask. Building at https://t.co/uGuzfu0LKT. Bypassing Permissions. Can hand crank. Husband and father of 3 — “more nuanced in person”