Skip to content

LlamaIndex Observability & Monitoring (Python)

LlamaIndex Observability & Monitoring (Python)

Section titled “LlamaIndex Observability & Monitoring (Python)”

Latest: 0.14.20 | Updated: April 2026 Upstream: https://github.com/run-llama/llama_index/releases | https://pypi.org/project/llama-index/

  • Wrap query engine invocations with OTEL spans; include index name, retriever type, tokens
  • Histogram for latency; counters for calls, failures; gauge for cache hit rate
  • Structured logs with request ID and source nodes used