Monitoring & Observability

Monitor platform performance, track workflows, and set up alerting for production deployments.

What You’ll Find Here

  • Metrics - Performance metrics and monitoring dashboards

  • Logging - Log aggregation and analysis

  • Health Checks - Liveness and readiness probes

  • Alerting - Setting up alerts for failures

  • Tracing - Distributed tracing for complex workflows

Key Topics

Metrics & Dashboards

Monitor platform performance with:

  • Workflow execution time

  • Rows processed per second

  • Memory and CPU usage

  • Database connection pool metrics

  • Error rates

Logging

The platform provides structured logging:

  • Application logs

  • Workflow execution logs

  • Database query logs

  • Error stack traces

Health Checks

For Kubernetes and Docker deployments:

  • Liveness probes - Is the platform running?

  • Readiness probes - Is the platform ready to accept requests?

  • Startup probes - Has the platform started successfully?

Where to Start

  • Configure logging levels via environment variables

  • Set up Prometheus for metrics collection

  • Use Grafana for visualization dashboards

  • Configure health check endpoints