Category

AI Observability & MLOps

9 tools in this category. Ranked by community rating + honest fit.

Get AI recommendations →Compare tools

Braintrust

4.5
AI Observability & MLOps✓ Editorial

Enterprise LLM eval platform — logging, evals, and prompt iteration with strong offline scoring.

small, mediumstarter

Weights & Biases

AI Observability & MLOps

The MLOps platform for tracking, visualizing, and optimizing ML experiments and model training.

small, mediumfree

Langfuse

AI Observability & MLOps✓ Editorial

Open-source LLM engineering platform — trace, evaluate, and debug your AI application in production.

small, mediumfree

Helicone

AI Observability & MLOps

LLM observability proxy — one line of code to monitor costs, latency, and quality across all AI calls.

small, mediumfree

Arize AI

AI Observability & MLOps

ML and LLM observability — model monitoring, drift detection, and agent tracing at enterprise scale.

medium, largeprofessional

LangSmith

AI Observability & MLOps

The observability platform from LangChain — tracing, eval, and prompt management for LLM apps.

small, mediumstarter

Humanloop

AI Observability & MLOps

Prompt management and eval platform for enterprise LLM applications — collaboration between engineers and subject-matter experts.

medium, largeprofessional

PromptLayer

AI Observability & MLOps

Prompt registry and observability — manage, version, and monitor prompts across LLM providers.

small, mediumstarter

AgentOps

AI Observability & MLOps

Observability and monitoring for AI agents — trace runs, measure costs, and debug multi-agent systems.

small, mediumfree
← Browse all categories