Metrics and Monitoring

Monitor platform performance and health using built-in metrics and observability tools.

Overview

The platform exposes metrics for:

  • Workflow execution performance

  • Resource utilization

  • Error rates

  • Agent health

Prometheus Integration

The platform exposes Prometheus-compatible metrics at:

http://localhost:8080/actuator/prometheus

Key Metrics

Performance Metrics

  • tdk_workflow_duration_seconds: Workflow execution time

  • tdk_rows_processed_total: Total rows processed

  • tdk_throughput_rows_per_second: Processing throughput

System Metrics

  • jvm_memory_used_bytes: JVM memory usage

  • jvm_threads_live: Active threads

  • system_cpu_usage: CPU utilization

Agent Metrics

  • tdk_agents_connected: Number of connected agents

  • tdk_agent_tasks_active: Active tasks per agent

Grafana Dashboards

Import pre-built Grafana dashboards for monitoring.