feat: add Prometheus metrics for backup recovery window by ermakov-oleg · Pull Request #69 · operasoftware/cnpg-plugin-pgbackrest

ermakov-oleg · 2026-02-13T23:18:23Z

Summary

Port of upstream #459, #467

Problem: No observability into backup health — operators had no way to alert on stale backups or monitor recovery point objectives (RPO) without manually querying pgBackRest.

Fix: Implements the cnpg-i Metrics service, exposing two Prometheus gauges:

cnpg_pgbackrest_first_recoverability_point — unix timestamp of the earliest restore point (first successful backup stop time)
cnpg_pgbackrest_last_available_backup_timestamp — unix timestamp of the most recent completed backup (latest backup stop time)

These allow standard Prometheus alerts like "no backup in last 24h" or "RPO exceeds 1h".

Implementation:

New MetricsServiceImplementation in internal/cnpgi/instance/metrics.go
Registers TYPE_METRICS capability in plugin identity
Collect() calls pgbackrest info to get the backup catalog, then delegates to getRecoveryWindow() which uses catalog.FirstRecoverabilityPoint() and catalog.GetLastSuccessfulBackupTime() — these methods filter out errored backups (Start=0 or Stop=0) and use Time.Stop for recoverability
Returns 0 for both metrics if no backups exist or credentials fail (graceful degradation)

Unit tests in metrics_test.go cover: nil/empty catalog, single backup, multiple backups, errored backups filtering, all-errored catalog.

Related issues

Signed-off-by: ermakov-oleg <ermakovolegs@gmail.com>

ermakov-oleg · 2026-02-20T10:40:34Z

Hi @Agalin, just following up on this PR - would you have a chance to review it when you have time? The changes from all my PRs have been running in our production for a while now without issues, but I’m happy to adjust anything if needed.

feat: add Prometheus metrics for backup/recovery window

331f244

Signed-off-by: ermakov-oleg <ermakovolegs@gmail.com>

ermakov-oleg force-pushed the feat/prometheus-metrics branch from 1250c04 to 331f244 Compare February 16, 2026 15:30

ermakov-oleg mentioned this pull request Feb 17, 2026

feat: port 9 upstream improvements from open PRs ermakov-oleg/cnpg-plugin-pgbackrest#13

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: add Prometheus metrics for backup recovery window#69

feat: add Prometheus metrics for backup recovery window#69
ermakov-oleg wants to merge 1 commit intooperasoftware:mainfrom
ermakov-oleg:feat/prometheus-metrics

ermakov-oleg commented Feb 13, 2026

Uh oh!

ermakov-oleg commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

ermakov-oleg commented Feb 13, 2026

Summary

Related issues

Uh oh!

ermakov-oleg commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant