Pingara: The Affordable Monitoring Solution

Your Pingara dashboard is mission control for monitoring. This guide explains every metric, chart, and indicator so you can make informed decisions about your infrastructure.

Dashboard Overview

The dashboard has four main sections:

Overview Cards — High-level KPIs (uptime, Apdex, incidents)
Monitors Table — Real-time status of all your monitors
Latency Chart — Performance trends over time
Incidents List — Recent outages and ongoing issues

Overview Cards

Overall Uptime

What it shows: The percentage of successful checks across all your monitors in the selected time period (24h, 7d, 30d).

How it's calculated:

Uptime % = (Successful Checks / Total Checks) × 100

What the numbers mean:

99.9% or higher — Excellent (less than 8.6 hours downtime per year)
99.5% - 99.9% — Good (less than 43 hours downtime per year)
Below 99.5% — Needs attention

Tip: Industry standard SLAs typically target 99.9% (three nines) or 99.99% (four nines) uptime.

Apdex Score

What it shows: A 0.0-1.0 score representing user satisfaction based on response times.

How it's calculated:

Apdex = (Satisfied + Tolerating/2) / Total Checks

Where:

Satisfied — Response time ≤ T (your threshold)
Tolerating — Response time ≤ 4T
Frustrated — Response time > 4T or errors

What the scores mean:

0.94-1.0 — Excellent
0.85-0.93 — Good
0.70-0.84 — Fair
0.50-0.69 — Poor
Below 0.5 — Unacceptable

Learn more about Apdex scoring.

Active Incidents

What it shows: The count of monitors currently experiencing outages or degraded performance.

Incident statuses:

Investigating — Just detected, root cause unknown
Identified — Root cause determined
Monitoring — Fix deployed, watching for recovery
Resolved — Monitor recovered (2+ consecutive successful checks)

Click the number to view incident details.

Monitors Table

The monitors table shows real-time status for each of your monitors.

Status Indicators

Icon	Status	Meaning
🟢	Up	Responding successfully
🟡	Degraded	Slow responses (> Apdex threshold)
🔴	Down	Failed checks (2+ consecutive failures)
⏸️	Paused	Monitoring temporarily stopped
⏳	Pending	Newly created, first check not yet run

Columns Explained

Monitor Name — Click to view detailed performance history
URL — The endpoint being monitored
Status — Current state (see above)
Response Time — Latest check duration in milliseconds
Uptime — Success rate over last 24 hours
Last Checked — Timestamp of most recent check
Active Incidents — Count of unresolved incidents for this monitor

What triggers a "Down" status?

Pingara uses consecutive failure detection to avoid false alarms:

First failure → Monitor stays "Up" (could be transient)
Second consecutive failure → Status changes to "Down"
Incident is created and alerts fire

This reduces false positives from temporary network blips.

Latency Chart

The latency chart visualizes response time trends across your monitors.

Percentile Lines

p50 (median) — Half of requests are faster than this
p95 — 95% of requests are faster than this (common SLA target)
p99 — 99% of requests are faster than this (captures worst-case)

Why percentiles matter: Average response time can hide outliers. If p95 is 200ms but p99 is 2000ms, some users are experiencing 10x slower responses.

Time Range Selector

Switch between:

24 hours — Spot recent performance changes
7 days — Identify weekly patterns (e.g., traffic spikes)
30 days — Track long-term trends and seasonal effects

Interpreting the Chart

Healthy pattern:

All three lines stay relatively flat
p99 stays within 2-3x of p50
No sudden spikes

Warning signs:

p99 line frequently spikes
Growing gap between p50 and p99
Gradual upward trend (degrading performance)

Action: If you see spikes, drill down into the specific monitor to see which regions or time periods are affected.

Incidents List

Shows recent incidents across all monitors, ordered by start time (newest first).

Incident Information

Each incident shows:

Monitor name — Which monitor went down
Status — investigating / identified / monitoring / resolved
Started — When the incident began
Duration — How long it lasted (or is lasting)
Error type — DNS failure, timeout, 5xx error, etc.

Color Coding

🔴 Red — Active incidents (investigating, identified, monitoring)
🟢 Green — Resolved incidents

Root Cause Hints

Click an incident to see AI-generated root cause analysis. Pingara's AI examines:

DNS lookup time (DNS issues?)
TCP connection time (network problems?)
TLS handshake time (certificate issues?)
Time to first byte (slow backend?)
HTTP status code
Error messages

Learn more about AI root cause analysis.

Dashboard Actions

Add Monitor

Click "Add Monitor" in the top right to create a new monitor. You'll configure:

URL and monitor type
Check interval and regions
Expected status codes
SSL certificate tracking
Keyword validation

Learn more about creating monitors.

Pause/Resume Monitors

Click the ⏸️ icon next to a monitor to:

Pause — Stop checks temporarily (e.g., during planned maintenance)
Resume — Restart monitoring

Note: Paused monitors don't count against your plan limits but won't generate alerts.

Filter Monitors

Use the status filter to show only:

All monitors
Up monitors
Down monitors
Degraded monitors
Paused monitors

Mobile Dashboard

The dashboard is fully responsive. On mobile:

Overview cards stack vertically
Monitor table scrolls horizontally
Charts adapt to smaller screens
All actions remain accessible

Keyboard Shortcuts

Shortcut	Action
`/`	Focus search
`n`	Create new monitor
`r`	Refresh data
`?`	Show keyboard shortcuts

Dashboard Best Practices

Monitor Grouping

Use tags to organize monitors:

By environment: production, staging
By service: api, web, cdn
By criticality: critical, important, non-critical

Alert Fatigue Prevention

If you're getting too many alerts:

Increase Apdex threshold for less critical monitors
Use longer check intervals for non-production services
Set up alert policies with severity filtering

Performance Baselines

After 7-30 days, you'll have baseline data:

Normal p95 response time for each monitor
Typical uptime percentage
Expected Apdex score

Use these baselines to:

Set realistic Apdex thresholds
Detect performance regressions
Plan capacity upgrades

Next Steps

Creating Your First Monitor — Detailed monitor setup guide
Apdex Scoring — Deep dive into user satisfaction metrics
Incident Lifecycle — How incidents are detected and resolved
Setting Up Alerts — Get notified when issues occur

Understanding the Dashboard