Monitoring & Observability

Monitor platform performance, track workflows, and set up alerting for production deployments.

What You’ll Find Here

Metrics - Performance metrics and monitoring dashboards
Logging - Log aggregation and analysis
Health Checks - Liveness and readiness probes
Alerting - Setting up alerts for failures
Tracing - Distributed tracing for complex workflows

Key Topics

Metrics & Dashboards

Monitor platform performance with:

Workflow execution time
Rows processed per second
Memory and CPU usage
Database connection pool metrics
Error rates

Logging

The platform provides structured logging:

Application logs
Workflow execution logs
Database query logs
Error stack traces

Health Checks

For Kubernetes and Docker deployments:

Liveness probes - Is the platform running?
Readiness probes - Is the platform ready to accept requests?
Startup probes - Has the platform started successfully?

Where to Start

Configure logging levels via environment variables
Set up Prometheus for metrics collection
Use Grafana for visualization dashboards
Configure health check endpoints

Related Topics