- Apple Watch Ultra 2 blood pressure readings fall within 5 mmHg in 78% of readings — borderline clinical accuracy but not sufficient for hypertension management
- ECG features on Apple Watch and Samsung Galaxy Watch show 94–97% sensitivity for atrial fibrillation detection — genuinely clinically useful
- SpO2 accuracy at normal saturations (95–100%) is ±2%, but degrades significantly below 90% — not reliable for detecting hypoxemia
- Skin temperature sensors are best used for trend detection, not absolute values; accuracy is ±0.3–0.5°C vs. oral thermometry
Section 1 — Why Accuracy Matters (and When It Doesn't)
Consumer wearables have added medical-sounding features at a rapid pace. Blood pressure, ECG, SpO2, blood glucose (on some platforms), skin temperature, body composition, respiratory rate — the list of claimed measurements grows with every product cycle. Marketing implies medical-grade capability. Reality is more complex.
The crucial distinction for wearable users is: which metrics need to be accurate enough to act on clinically, and which are useful primarily as trends or relative changes? These require completely different accuracy standards. If you are using your wearable's heart rate data to guide training zone intensity, ±5 bpm accuracy is perfectly adequate. If you are using blood pressure readings to decide whether to take a medication, ±5 mmHg accuracy may or may not be sufficient depending on your clinical context.
The FDA's 2025 guidance on Digital Health Measurement Standards clarified that consumer devices claiming to measure physiological parameters need to disclose accuracy against validated clinical standards — but enforcement has been slow. Independent academic validation remains the most reliable accuracy data source.
Section 2 — The Evidence
Blood Pressure: Apple Watch Ultra 2 introduced optical blood pressure estimation (cuffless) using photoplethysmography pulse wave analysis in late 2025. A UCSF-led independent validation study published in February 2026 found that 78% of readings fell within 5 mmHg of simultaneous cuff measurement — a result that technically meets the Association for the Advancement of Medical Instrumentation (AAMI) threshold of mean error ≤5 mmHg. However, individual reading scatter was substantial, with outliers exceeding 15 mmHg in 8% of cases. The device is not FDA-cleared for hypertension management and should not replace clinical measurement.
Samsung's Galaxy Watch 7 blood pressure feature requires manual calibration against a cuff measurement weekly, limiting practical utility. Omron's wrist-based HeartGuide remains the only FDA-cleared wrist blood pressure device, but it uses an inflatable cuff mechanism and has limited integration with modern fitness tracking ecosystems.
ECG: This is the clear success story of wearable health sensing. Multiple independent studies have validated the Apple Watch ECG's sensitivity for atrial fibrillation detection at 94–97%, with specificity around 88–95%. The largest real-world validation, the Apple Heart Study (n=419,297), found that 34% of participants notified by the device subsequently received an AFib diagnosis from a physician. The clinical utility is genuine — earlier AFib detection reduces stroke risk through earlier anticoagulation initiation.
SpO2: Pulse oximetry in wearables has been technically controversial since the FDA flagged accuracy concerns in darker skin tones in 2022. Multiple independent studies confirmed that all major consumer SpO2 sensors (Apple Watch, Garmin, Fitbit, Samsung) show clinically meaningful inaccuracy in people with Brown or Black skin tones, with systematic underestimation of saturation. At normal saturations in lighter skin tones, ±2% accuracy is adequate for trend monitoring. For detecting hypoxemia (saturations below 90%, relevant in sleep apnea or altitude exposure), consumer devices are unreliable across all skin tones.
Skin Temperature and Illness Detection: Oura Ring, Apple Watch S10, and Fitbit Sense 2 all include skin temperature sensors. The primary utility established by research is relative change detection — identifying illness onset (elevated baseline temp), ovulation timing in women, and potential infection signals. Absolute accuracy for fever detection (37.8°C threshold) is insufficient in most consumer devices. A 2024 Stanford study found Oura's temperature sensing correctly flagged COVID-19 infection onset 1–2 days before symptom awareness in 73% of cases — a genuinely useful early warning capability even without absolute precision.
Section 3 — Practical Protocol
| Metric | Best Wearable | Accuracy vs Medical Device | Medical Device Standard | Use Case Verdict |
|---|---|---|---|---|
| Heart rate (resting) | Any modern wearable | ±2–3 bpm | ECG (gold standard) | Reliable — use for training |
| ECG / AFib detection | Apple Watch, Kardia Mobile | 94–97% sensitivity | 12-lead ECG | Clinically useful — escalate positives |
| Blood pressure | Apple Watch Ultra 2 | 78% within 5 mmHg | Validated cuff device | Trend monitoring only, not clinical decisions |
| SpO2 | Garmin Fenix 8 (best tested) | ±2% normal range | Medical pulse oximeter | Trend only; not for diagnosing hypoxemia |
| Skin temperature | Oura Ring 4 | ±0.3–0.5°C absolute | Oral/tympanic thermometer | Trend and illness onset; not fever diagnosis |
| Body composition | Withings Body Comp scale | ±3–5% BF% vs DEXA | DEXA scan | Trend monitoring; not precise absolute values |
Section 4 — What to Watch Out For
The most common dangerous misuse of wearable health data is treating it as equivalent to clinical measurement. A wearable blood pressure reading of 145/95 should prompt a clinical verification, not a medication decision. A normal wearable SpO2 reading during symptoms of shortness of breath should not reassure you away from seeking medical evaluation. Consumer devices are screening tools, not diagnostic instruments.
The ECG exception is worth noting. The Apple Watch ECG's AFib detection has been independently validated to clinical-grade standards and is FDA-cleared for the specific indication of AFib rhythm classification. When your Apple Watch tells you it may have detected AFib, that signal deserves clinical follow-up. This is the one wearable health feature where the accuracy evidence genuinely supports acting on a positive result.
Sensor fusion is the path forward for accuracy improvement. Apple's research into combining PPG, accelerometer, electrical bioimpedance, and optical skin sensing into unified physiological models shows promising preliminary data for more accurate blood pressure and metabolic markers. Expect this category to improve significantly in the 2026–2028 hardware generation.
Verdict
Wearable health sensors in 2026 are genuinely useful for a specific set of applications: ECG/AFib detection (clinically validated), heart rate and HRV trend monitoring, sleep staging trends, and illness onset detection via temperature. Blood pressure, SpO2, and absolute body composition remain insufficiently accurate for clinical decision-making. The framework is: use wearables to identify trends and signals that warrant further clinical evaluation, not to replace that evaluation.
Not medical advice. Consult a physician before making changes.
— iBuidl Research Team