warning
8.4.1.1. GitLab Puma high queued connections
GitLab Puma has {{ $value }} queued connections on {{ $labels.instance }}. Requests are waiting for an available worker thread.
# Queued connections indicate Puma workers are saturated.
# Consider increasing puma['worker_processes'] or puma['max_threads'] in gitlab.rb.
- alert: GitLabPumaHighQueuedConnections
expr: puma_queued_connections > 5
for: 5m
labels:
severity: warning
annotations:
summary: GitLab Puma high queued connections (instance {{ $labels.instance }})
description: "GitLab Puma has {{ $value }} queued connections on {{ $labels.instance }}. Requests are waiting for an available worker thread.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" critical
8.4.1.2. GitLab Puma no available pool capacity
GitLab Puma pool capacity on {{ $labels.instance }} has been at 0 for 5 minutes. All threads are busy.
- alert: GitLabPumaNoAvailablePoolCapacity
expr: puma_pool_capacity == 0
for: 5m
labels:
severity: critical
annotations:
summary: GitLab Puma no available pool capacity (instance {{ $labels.instance }})
description: "GitLab Puma pool capacity on {{ $labels.instance }} has been at 0 for 5 minutes. All threads are busy.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.3. GitLab Puma workers not running
GitLab Puma on {{ $labels.instance }} has {{ $value }} running workers out of expected total.
- alert: GitLabPumaWorkersNotRunning
expr: puma_running_workers < puma_workers
for: 5m
labels:
severity: warning
annotations:
summary: GitLab Puma workers not running (instance {{ $labels.instance }})
description: "GitLab Puma on {{ $labels.instance }} has {{ $value }} running workers out of expected total.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" critical
8.4.1.4. GitLab high HTTP error rate
GitLab is returning more than 5% HTTP 5xx errors on {{ $labels.instance }}.
# Threshold is 5% of all requests returning server errors.
# Check GitLab logs at /var/log/gitlab/ for root cause.
- alert: GitLabHighHTTPErrorRate
expr: sum(rate(http_requests_total{status=~"5.."}[5m])) / sum(rate(http_requests_total[5m])) * 100 > 5 and sum(rate(http_requests_total[5m])) > 0
for: 5m
labels:
severity: critical
annotations:
summary: GitLab high HTTP error rate (instance {{ $labels.instance }})
description: "GitLab is returning more than 5% HTTP 5xx errors on {{ $labels.instance }}.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.5. GitLab high HTTP request latency
GitLab p95 HTTP request latency on {{ $labels.instance }} is above 10 seconds.
# Threshold of 10s may need adjustment based on your instance size and workload.
- alert: GitLabHighHTTPRequestLatency
expr: histogram_quantile(0.95, sum(rate(http_request_duration_seconds_bucket[5m])) by (le)) > 10
for: 5m
labels:
severity: warning
annotations:
summary: GitLab high HTTP request latency (instance {{ $labels.instance }})
description: "GitLab p95 HTTP request latency on {{ $labels.instance }} is above 10 seconds.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.6. GitLab Sidekiq jobs failing
GitLab Sidekiq jobs are failing at a rate of {{ $value }} per second on {{ $labels.instance }}.
# This metric requires the emit_sidekiq_histogram_metrics feature flag to be enabled.
# A sustained failure rate indicates background processing issues.
- alert: GitLabSidekiqJobsFailing
expr: rate(sidekiq_jobs_failed_total[5m]) > 0.1
for: 10m
labels:
severity: warning
annotations:
summary: GitLab Sidekiq jobs failing (instance {{ $labels.instance }})
description: "GitLab Sidekiq jobs are failing at a rate of {{ $value }} per second on {{ $labels.instance }}.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.7. GitLab Sidekiq queue too large
GitLab Sidekiq has {{ $value }} running jobs, approaching concurrency limit on {{ $labels.instance }}.
# When running jobs approach the concurrency limit, new jobs will queue up.
# Consider scaling Sidekiq workers or increasing concurrency.
- alert: GitLabSidekiqQueueTooLarge
expr: sum(sidekiq_running_jobs) >= sum(sidekiq_concurrency) * 0.9
for: 10m
labels:
severity: warning
annotations:
summary: GitLab Sidekiq queue too large (instance {{ $labels.instance }})
description: "GitLab Sidekiq has {{ $value }} running jobs, approaching concurrency limit on {{ $labels.instance }}.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.8. GitLab Sidekiq high job completion time
GitLab Sidekiq job p95 completion time on {{ $labels.instance }} is above 5 minutes ({{ $value | humanizeDuration }}).
# This metric requires the emit_sidekiq_histogram_metrics feature flag to be enabled.
- alert: GitLabSidekiqHighJobCompletionTime
expr: histogram_quantile(0.95, sum(rate(sidekiq_jobs_completion_seconds_bucket[5m])) by (le, worker)) > 300
for: 10m
labels:
severity: warning
annotations:
summary: GitLab Sidekiq high job completion time (instance {{ $labels.instance }})
description: "GitLab Sidekiq job p95 completion time on {{ $labels.instance }} is above 5 minutes ({{ $value | humanizeDuration }}).\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.9. GitLab Sidekiq high queue latency
GitLab Sidekiq jobs on {{ $labels.instance }} are waiting more than 60 seconds before being processed.
# This metric requires the emit_sidekiq_histogram_metrics feature flag to be enabled.
# High queue latency means jobs are stuck waiting. Check Sidekiq concurrency and queue sizes.
- alert: GitLabSidekiqHighQueueLatency
expr: histogram_quantile(0.95, sum(rate(sidekiq_jobs_queue_duration_seconds_bucket[5m])) by (le)) > 60
for: 5m
labels:
severity: warning
annotations:
summary: GitLab Sidekiq high queue latency (instance {{ $labels.instance }})
description: "GitLab Sidekiq jobs on {{ $labels.instance }} are waiting more than 60 seconds before being processed.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.10. GitLab database connection pool saturation
GitLab database connection pool on {{ $labels.instance }} ({{ $labels.class }}) is {{ $value }}% busy.
# When the pool is near saturation, requests may block waiting for a connection.
# Increase db_pool_size in gitlab.rb or investigate slow queries.
- alert: GitLabDatabaseConnectionPoolSaturation
expr: gitlab_database_connection_pool_busy / gitlab_database_connection_pool_size * 100 > 90 and gitlab_database_connection_pool_size > 0
for: 5m
labels:
severity: warning
annotations:
summary: GitLab database connection pool saturation (instance {{ $labels.instance }})
description: "GitLab database connection pool on {{ $labels.instance }} ({{ $labels.class }}) is {{ $value }}% busy.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.11. GitLab database connection pool dead connections
GitLab database connection pool on {{ $labels.instance }} ({{ $labels.class }}) has {{ $value }} dead connections.
- alert: GitLabDatabaseConnectionPoolDeadConnections
expr: gitlab_database_connection_pool_dead > 0
for: 5m
labels:
severity: warning
annotations:
summary: GitLab database connection pool dead connections (instance {{ $labels.instance }})
description: "GitLab database connection pool on {{ $labels.instance }} ({{ $labels.class }}) has {{ $value }} dead connections.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.12. GitLab database connection pool waiting
GitLab on {{ $labels.instance }} has {{ $value }} threads waiting for a database connection.
- alert: GitLabDatabaseConnectionPoolWaiting
expr: gitlab_database_connection_pool_waiting > 0
for: 5m
labels:
severity: warning
annotations:
summary: GitLab database connection pool waiting (instance {{ $labels.instance }})
description: "GitLab on {{ $labels.instance }} has {{ $value }} threads waiting for a database connection.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.13. GitLab CI pipeline creation slow
GitLab CI pipeline creation p95 latency on {{ $labels.instance }} is above 30 seconds.
- alert: GitLabCIPipelineCreationSlow
expr: histogram_quantile(0.95, sum(rate(gitlab_ci_pipeline_creation_duration_seconds_bucket[5m])) by (le)) > 30
for: 5m
labels:
severity: warning
annotations:
summary: GitLab CI pipeline creation slow (instance {{ $labels.instance }})
description: "GitLab CI pipeline creation p95 latency on {{ $labels.instance }} is above 30 seconds.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.14. GitLab CI pipeline failures increasing
GitLab CI pipeline failures are increasing on {{ $labels.instance }} ({{ $value }}/s).
# This metric may not exist in all GitLab versions. Verify against your GitLab installation.
- alert: GitLabCIPipelineFailuresIncreasing
expr: deriv(gitlab_ci_pipeline_failure_reasons[5m]) > 0.05
for: 10m
labels:
severity: warning
annotations:
summary: GitLab CI pipeline failures increasing (instance {{ $labels.instance }})
description: "GitLab CI pipeline failures are increasing on {{ $labels.instance }} ({{ $value }}/s).\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.15. GitLab CI runner authentication failures
GitLab CI runners are experiencing authentication failures on {{ $labels.instance }} ({{ $value }} failures).
# Frequent runner auth failures may indicate expired tokens or misconfigured runners.
- alert: GitLabCIRunnerAuthenticationFailures
expr: increase(gitlab_ci_runner_authentication_failure_total[5m]) > 5
for: 5m
labels:
severity: warning
annotations:
summary: GitLab CI runner authentication failures (instance {{ $labels.instance }})
description: "GitLab CI runners are experiencing authentication failures on {{ $labels.instance }} ({{ $value }} failures).\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.16. GitLab high memory usage
GitLab process on {{ $labels.instance }} is using {{ $value | humanize1024 }}B of RSS memory.
# Threshold of 2GB may need adjustment based on your instance size.
# High memory usage can lead to OOM kills and service disruptions.
- alert: GitLabHighMemoryUsage
expr: process_resident_memory_bytes{job=~".*gitlab.*"} > 2e+9
for: 10m
labels:
severity: warning
annotations:
summary: GitLab high memory usage (instance {{ $labels.instance }})
description: "GitLab process on {{ $labels.instance }} is using {{ $value | humanize1024 }}B of RSS memory.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.17. GitLab Ruby heap fragmentation
GitLab Ruby heap fragmentation on {{ $labels.instance }} is {{ $value }}. High fragmentation wastes memory.
# Heap fragmentation above 50% means a significant amount of memory is wasted.
# A Puma worker restart may help reclaim memory.
- alert: GitLabRubyHeapFragmentation
expr: ruby_gc_stat_ext_heap_fragmentation{job=~".*gitlab.*"} > 0.5
for: 15m
labels:
severity: warning
annotations:
summary: GitLab Ruby heap fragmentation (instance {{ $labels.instance }})
description: "GitLab Ruby heap fragmentation on {{ $labels.instance }} is {{ $value }}. High fragmentation wastes memory.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.18. GitLab rack uncaught errors
GitLab is experiencing uncaught errors in the Rack layer on {{ $labels.instance }} ({{ $value }}/s).
- alert: GitLabRackUncaughtErrors
expr: rate(rack_uncaught_errors_total[5m]) > 0.05
for: 5m
labels:
severity: warning
annotations:
summary: GitLab rack uncaught errors (instance {{ $labels.instance }})
description: "GitLab is experiencing uncaught errors in the Rack layer on {{ $labels.instance }} ({{ $value }}/s).\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.19. GitLab version mismatch
Multiple GitLab versions are running across the fleet.
# This may happen during a rolling deployment. If it persists, investigate incomplete upgrades.
- alert: GitLabVersionMismatch
expr: count(count by (version) (gitlab_build_info)) > 1
for: 0m
labels:
severity: warning
annotations:
summary: GitLab version mismatch (instance {{ $labels.instance }})
description: "Multiple GitLab versions are running across the fleet.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.20. GitLab high file descriptor usage
GitLab on {{ $labels.instance }} is using {{ $value }}% of available file descriptors.
- alert: GitLabHighFileDescriptorUsage
expr: process_open_fds{job=~".*gitlab.*"} / process_max_fds * 100 > 80 and process_max_fds > 0
for: 5m
labels:
severity: warning
annotations:
summary: GitLab high file descriptor usage (instance {{ $labels.instance }})
description: "GitLab on {{ $labels.instance }} is using {{ $value }}% of available file descriptors.\n VALUE = {{ $value }}\n LABELS = {{ $labels }}" warning
8.4.1.21. GitLab Ruby threads saturated
GitLab running threads on {{ $labels.instance }} have exceeded the expected maximum ({{ $value }}).
- alert: GitLabRubyThreadsSaturated
expr: sum by (instance) (gitlab_ruby_threads_running_threads) > on(instance) gitlab_ruby_threads_max_expected_threads * 1.5
for: 10m
labels:
severity: warning
annotations:
summary: GitLab Ruby threads saturated (instance {{ $labels.instance }})
description: "GitLab running threads on {{ $labels.instance }} have exceeded the expected maximum ({{ $value }}).\n VALUE = {{ $value }}\n LABELS = {{ $labels }}"