Ho Ngoc Hai
90839cf542
feat(monitoring): add API latency Grafana dashboard and alerting rules
...
Create comprehensive Grafana dashboard for API latency monitoring with:
- p50/p95/p99 stat panels and time series for all endpoints
- Per-endpoint latency breakdown with route/method template variables
- Top 10 slowest endpoints table and bar chart (by p99)
- Request rate (by method) and error rate (4xx/5xx) panels
- Error rate percentage (5xx/total) with SLO threshold
- Latency heatmap and histogram distribution panels
Add Prometheus alerting rules:
- ApiLatencyP99High: p99 > 1s for 5m (warning)
- ApiEndpointLatencyP99High: per-endpoint p99 > 2s (warning)
- ApiLatencyP99Critical: p99 > 3s for 3m (critical/SLO breach)
- ApiErrorRate5xxHigh: 5xx rate > 1% for 5m (warning)
Fix api-overview.json using wrong metric name
(http_request_duration_seconds → goodgo_api_request_duration_seconds).
Co-Authored-By: Paperclip <noreply@paperclip.ing >
2026-04-10 23:18:09 +07:00
Ho Ngoc Hai
5114f5b87e
chore: update monitoring configs, CI workflow, and web build info
...
Update Grafana datasource and Prometheus configs for monitoring
integration. Improve E2E CI workflow with Prisma generate, browser
caching, and trace artifact collection.
Co-Authored-By: Paperclip <noreply@paperclip.ing >
2026-04-08 23:07:21 +07:00
Ho Ngoc Hai
d99dfbafbc
feat(monitoring): add Prometheus metrics endpoint and Grafana dashboards
...
Add observability stack with @willsoto/nestjs-prometheus for /metrics endpoint,
Prometheus scraping config, and 4 auto-provisioned Grafana dashboards
(API overview, database, search, business metrics).
Co-Authored-By: Paperclip <noreply@paperclip.ing >
2026-04-08 03:08:54 +07:00