Is Lilypad worth it for Python developers building LLM apps?

Yes, if you're already using Mirascope and need lightweight, open-source tracing. The @ops.trace decorator adds observability with minimal code changes. But if you need a managed UI or advanced experiment tracking, consider Langfuse instead.

Does Lilypad integrate with Datadog?

Yes, you can export Lilypad traces to Datadog via the OpenTelemetry Collector or directly using Datadog's OTLP endpoint. See the Ops Configuration guide for setup details.

How does Lilypad compare to Langfuse?

Lilypad is open-source and vendor-neutral, exporting traces to any OTel backend. Langfuse offers a hosted UI, prompt management, and experiment tracking out of the box. Lilypad is free (self-managed), while Langfuse has paid plans. Choose Lilypad for flexibility, Langfuse for convenience.

Yes, Lilypad is completely free and open-source under the MIT license. There are no subscription fees, but you must cover the cost of running your own OpenTelemetry backend (e.g., Jaeger, Grafana Tempo).

What are Lilypad's biggest limitations?

Lilypad is Python-only and tightly coupled to the Mirascope framework. It has no hosted UI or built-in experiment tracking. You must manage your own OTel backend, which requires DevOps effort for deployment, retention, and scaling.

Can Lilypad replace LangSmith?

Not fully. Lilypad provides distributed tracing and versioning, but lacks LangSmith's integrated evaluation, dataset management, and playground. If you only need tracing, Lilypad can replace it; for full experiment tracking, keep LangSmith.

How long does Lilypad take to set up?

If you already have an OTel backend, you can add tracing to a Mirascope function in under 10 minutes. Setting up a backend from scratch (e.g., Jaeger via Docker) adds 1–2 hours.

How do I migrate from Langfuse to Lilypad?

Reconfigure your OpenTelemetry exporter in Lilypad to point to your own OTel backend (e.g., Jaeger). Replace Langfuse SDK calls with Lilypad's @ops.trace and ops.session. Export existing traces from Langfuse via its export feature and re-import if needed.

Is Lilypad good for tracing LLM function execution?

Yes, it's specifically designed for that. The @ops.trace decorator automatically captures inputs, outputs, and timing of LLM calls. You can view the traces in any OTel-compatible backend.

Developer Infrastructure

Lilypad

Open-source OpenTelemetry LLM observability for Python devs

69/100MonitorFreeFree

A solid choice for Python devs already running an OTel backend who want lightweight, open-source LLM tracing. Skip it if you need a managed UI or advanced experiment tracking—Langfuse is a better fit.

Best for

Python developers building LLM apps needing OTel-based observability
Teams already using an OTel backend who want to add LLM-specific tracing
Projects requiring vendor-neutral, open-source instrumentation without lock-in
Users of Mirascope framework wanting integrated tracing and versioning

Not ideal for

Teams wanting a fully managed, hosted observability platform with a UI
Non-Python stacks (Python-only library)
Projects needing advanced prompt versioning or experiment tracking

Visit Website

IntermediateFor a Mirascope user: add the decorator and configure a tracer provider in under 10 minutes. For new adopters: budget 1–2 hours to set up an OTel backend (e.g., Jaeger via Docker) and integrate Lilypad into your Python project.API · CLIAPI available6.0k viewsVerified 11d ago

Pricing

Free

FreeFree tier

Learning curve

Intermediate

For a Mirascope user: add the decorator and configure a tracer provider in under 10 minutes. For new adopters: budget 1–2 hours to set up an OTel backend (e.g., Jaeger via Docker) and integrate Lilypad into your Python project.

Runs on

APICLI

API available · 10 integrations

Who it's for

Python developer using MirascopeML engineer setting up LLM observabilityOpen-source contributor iterating on prompts

Live sentiment

Is Lilypad actually worth it?

We scan live Reddit threads, YouTube comments, X posts, G2 reviews and other communities — and hand you an honest verdict in under a minute.

Honest verdict, not marketing
Real pros & cons from real users
Attributed quotes with receipts

Run a free scan

3 free scans · no card needed

Skip it if

Skip Lilypad if you prefer a fully managed observability platform with a UI and don't want to operate your own OpenTelemetry backend.

The 30-second take

Price reality

Lilypad is free and open-source (MIT), making it cost-effective for any team already running an OpenTelemetry backend. Unlike managed solutions like Langfuse or LangSmith, there are no per-seat or per-event fees, but you must cover your own infrastructure costs for the backend.

In short

Lilypad — Open-source OpenTelemetry LLM observability for Python devs. Best for Python developers building LLM apps needing OTel-based observability, Teams already using an OTel backend who want to add LLM-specific tracing, Projects requiring vendor-neutral, open-source instrumentation without lock-in. Free to use.

Viability Score

69/100

Monitor

How likely is Lilypad to still be operational in 12 months? Based on 4 signals — momentum (how recently it shipped), wrapper dependency, revenue model, and web presence.

momentum

funding runway

website health

wrapper dependency

100

Last calculated: July 2026

How we score →

Key Features

OpenTelemetry-based distributed tracing
@ops.trace decorator for automatic function tracing
@ops.version decorator for function versioning
ops.session() context manager to group traces
ops.span() for explicit span creation
Automatic LLM call instrumentation
Context propagation across services
OTLP protocol support
Console exporter for development debugging
Configurable tracer provider
Batch span processing
Custom span attributes and events
Python-first API with type hints
Mirascope ecosystem integration
Open-source (MIT license)

About Lilypad

FreeIntermediateAPI availableAPI · CLI

Lilypad (part of Mirascope) is an open-source observability module for LLM applications built on OpenTelemetry. It provides distributed tracing, function versioning, and session management for Python developers who need production-grade LLM observability without vendor lock-in. Core features include the @ops.trace decorator for automatic function tracing, ops.session() to group related traces, @ops.version to track prompt versions, and auto-instrumentation of LLM calls. Traces can be exported to any OTel-compatible backend such as Langfuse, Jaeger, Zipkin, Grafana Tempo, or Datadog. Unlike managed solutions like Langfuse, Lilypad gives you vendor-neutral, open-source instrumentation but requires you to bring your own OTel backend—no hosted UI is provided. It integrates tightly with Mirascope and is designed for Python developers comfortable managing their own observability infrastructure.

Behind the Verdict

Lilypad does one thing well: it gives Python developers OpenTelemetry-native LLM tracing without forcing a proprietary backend. The @ops.trace decorator and ops.session() context manager feel nearly invisible in existing Mirascope codebases. We'd reach for this when we're already invested in the Mirascope ecosystem and have an OTel pipeline running. Where it bites: there's no hosted UI, no advanced prompt versioning, and Python-only support. If you're not using Mirascope or don't want to manage an OTel backend, Langfuse or Datadog LLM Observability offer more out of the box. For teams that value open-source and vendor neutrality, Lilypad delivers exactly what it promises—no more, no less.

Researching Lilypad? Get your full AI stack in 60 seconds.

Free, no signup — tell us your goal and get tools matched to your budget & existing stack.

Real-world workflow fit

Concrete scenarios for the personas Lilypad actually fits — and what changes day-one when you adopt it.

Python developer using Mirascope

You have a Mirascope-based LLM app in production and need to trace individual LLM calls to debug unexpected responses.

Outcome: Add the @ops.trace decorator to your function; within minutes, traces appear in your OTel console or backend for analysis.

ML engineer setting up LLM observability

Your team wants to monitor LLM latency and error rates across services without committing to a proprietary vendor.

Outcome: Configure Lilypad with your existing Jaeger or Grafana Tempo backend; auto-instrument all LLM calls and propagate context across microservices.

Open-source contributor iterating on prompts

You're versioning prompts in a collaborative project and need to track which version produced which output.

Outcome: Use @ops.version to tag each prompt iteration; group related traces with ops.session() to compare performance across versions.

Use Cases

Trace LLM function execution to debug and optimize responses.
Version control and iterate on LLM prompts collaboratively.
Group related LLM interactions into sessions for analysis.
Automatically instrument LLM calls for observability.
Propagate tracing context across distributed services.

Limitations

Lilypad is an open-source OpenTelemetry-based observability tool designed Python-first for LLM applications.
It requires integration with the Mirascope ecosystem and a compatible OTel backend (e.g., Jaeger or Grafana Tempo), which you must deploy and manage yourself.
The tool focuses on tracing and versioning without providing a hosted UI, prompt management, or evaluation capabilities.

as of 2026-06-30

Where the pricing makes sense

The company stage and team size where Lilypad's pricing actually pencils out — and where peers do it cheaper.

Setup time & first value

How long it actually takes to get something useful out of Lilypad — broken out by persona, not the marketing-page minute.

Switching to or from Lilypad

How to bring data in from common predecessors and how to get it back out — written for the switcher, not the buyer.

Migrating in

→From custom logging: replace print/logging statements with @ops.trace and ops.session for structured traces.
→From Langfuse managed: if you want open-source, export your existing OTel traces to your own backend and switch to Lilypad's decorators.

Migrating out

↗To Langfuse: if you need a managed UI, export your Lilypad traces via OTLP to Langfuse's backend.
↗To LangSmith: if you need experiment tracking, migrate by replacing @ops.trace with LangSmith's run instrumentation.

Integrations

LangfuseJaegerZipkinGrafana TempoDatadogOpenTelemetry CollectorPrometheusGoogle Cloud TraceAzure MonitorAWS X-Ray

Resources & Guides

Official links

Official Website Changelog

Tools that pair well with Lilypad

Common stack mates teams adopt alongside Lilypad, with the specific reason each pairing earns its keep.

Arize Phoenix

Open-source AI observability for LLM agent tracing and evaluation.

Dash0

OpenTelemetry-native observability with autonomous AI agents

Phoenix

Open-source observability and evaluation for AI agents

Alternatives to Lilypad

View all

Frequently Asked Questions

Topics

API Data Analysis Open Source

Used Lilypad? Help shape our editorial sentiment research.