Monitoring & Observability
Monitor platform performance, track workflows, and set up alerting for production deployments.
What You’ll Find Here
-
Metrics - Performance metrics and monitoring dashboards
-
Logging - Log aggregation and analysis
-
Health Checks - Liveness and readiness probes
-
Alerting - Setting up alerts for failures
-
Tracing - Distributed tracing for complex workflows
Key Topics
Metrics & Dashboards
Monitor platform performance with:
-
Workflow execution time
-
Rows processed per second
-
Memory and CPU usage
-
Database connection pool metrics
-
Error rates
Where to Start
-
Configure logging levels via environment variables
-
Set up Prometheus for metrics collection
-
Use Grafana for visualization dashboards
-
Configure health check endpoints