VO2 Max: Lab Tests vs Watch Estimates
This page is educational. It describes what published research has measured. It is not medical advice and does not replace consultation with a qualified healthcare professional.
This content is educational. It describes how VO2 max is measured by laboratories versus estimated by consumer wearables. It is not training prescription or medical advice.
Why this matters
VO2 max — the maximum rate at which the body can take in and use oxygen — is one of the most-studied single measurements in exercise physiology and one of the strongest predictors of all-cause mortality in research. We covered the mortality association on the healthspan vs lifespan page.
The catch: most people now encounter VO2 max as a number on their watch face. Apple Watch, Garmin, Polar, Fitbit, Whoop, and Suunto all display "VO2 max" estimates. Almost no one has had VO2 max actually measured.
The gap between the number on the watch and the number a lab would produce is meaningful — sometimes small, sometimes large, sometimes mostly noise. This page describes what each measurement actually represents and where the limits sit.
What VO2 max is
VO2 max is the volume of oxygen the body can consume per unit time at maximum exercise intensity. It is expressed in millilitres of oxygen per kilogram of body weight per minute (ml/kg/min).
It is a composite measure that integrates:
- Pulmonary capacity (oxygen uptake at the lungs)
- Cardiac output (the heart's ability to pump oxygenated blood)
- Vascular delivery (blood flow to working tissues)
- Mitochondrial extraction (the working muscles' ability to use the oxygen)
When any of these systems is the limit, VO2 max is constrained. In trained athletes, the limit is usually cardiac output. In sedentary populations, multiple systems may be near their respective limits simultaneously.
VO2 max correlates with age, sex, training status, body composition, and altitude of habituation. It is moderately heritable — twin studies have reported approximately 40-50% genetic contribution to baseline VO2 max [Bouchard et al. 1998].
Lab measurement: cardiopulmonary exercise testing (CPET)
The gold standard for measuring VO2 max is cardiopulmonary exercise testing — CPET. The test takes place in a clinical or sports science laboratory and works as follows:
Setup
The participant wears a mask connected to a metabolic cart that measures inhaled and exhaled gas composition in real time. ECG electrodes track heart rhythm. Blood pressure is monitored. The participant exercises on a treadmill or stationary bike with intensity increasing in stages.
Direct measurement
The metabolic cart directly measures:
- Volume of inhaled air (ventilation)
- Fraction of oxygen in inhaled air (atmospheric: ~21%)
- Fraction of oxygen in exhaled air (lower as oxygen is consumed)
- Volume of exhaled air
From these, VO2 is calculated continuously: how much oxygen is the body actually consuming per unit time.
Reaching maximum
The test continues until one of several criteria is met:
- VO2 plateaus despite increasing exercise intensity (the textbook definition of VO2 max)
- The participant reaches volitional exhaustion (most common in practice)
- Heart rate reaches age-predicted maximum
- Respiratory exchange ratio (CO2 produced / O2 consumed) exceeds 1.10-1.15
- Blood lactate exceeds a defined threshold (some protocols)
The highest sustained VO2 in the final stages of the test is the measured VO2 max.
Accuracy and reproducibility
When performed correctly, CPET has test-retest reproducibility within 3-5%. The measurement is considered the gold standard precisely because it directly measures oxygen consumption rather than inferring it.
Limitations of the lab test:
- Requires a maximal effort, which depends on participant motivation
- Different protocols (Bruce, Astrand, ramp) produce slightly different values
- A single test in a sedentary participant may underestimate true VO2 max if effort isn't sustained
- Expensive — typically several hundred dollars per test
- Not widely available outside academic centres and sports science facilities
Watch estimation: how the algorithms work
Consumer wearables don't measure oxygen consumption. They estimate VO2 max from a combination of:
- Heart rate response during exercise
- Age, sex, weight (entered by the user)
- Pace and altitude data (when GPS is in use)
- Activity type and duration
The specific algorithms are proprietary and vary by manufacturer. Most are based on the Firstbeat methodology, which uses heart rate variability and the relationship between heart rate and pace during submaximal exercise to estimate maximal oxygen uptake [Firstbeat 2014].
What the algorithms actually need
For reasonable estimates, the wearable needs:
- Outdoor running or walking with GPS active. Pace data anchors the heart rate response.
- At least 10-20 minutes of sustained activity. Brief sessions don't give the algorithm enough data.
- A representative sample of intensities. A consistent jog provides better data than an erratic run.
- Accurate user-entered baseline data. Wrong weight or wrong age skews the estimate.
When these conditions are met, the watch processes the heart rate response to pace and extrapolates what your VO2 max would be at theoretical maximum effort — without you ever actually exercising at maximum.
What the watch is implicitly assuming
The estimate relies on assumptions that don't hold equally well across populations:
- Heart rate response to a given pace is consistent within and across individuals
- The relationship between submaximal and maximal effort is linear
- The user's resting and max heart rate fit population norms
For trained recreational runners doing steady outdoor sessions, these assumptions hold reasonably well. For populations the algorithm wasn't trained on — elite athletes, sedentary individuals, older adults, those with cardiovascular conditions — the estimates degrade.
How accurate are watch estimates
Validation studies comparing wearable VO2 max estimates against laboratory CPET have produced a consistent pattern:
- Trained recreational populations — correlation coefficients around 0.85-0.90 with lab measurement; typical error within ~3.5 ml/kg/min [Cao et al. 2022; Düking et al. 2020]
- Sedentary populations — correlations drop to 0.70-0.80; larger absolute error
- Highly trained athletes — wearables tend to underestimate; the algorithms weren't calibrated on elite ranges
- Older adults — limited validation data; available studies suggest wider error margins
A 2022 systematic review of consumer wearable VO2 max estimates concluded that the accuracy is "acceptable for tracking individual trends over time but not for absolute fitness assessment" [Cao et al. 2022].
Practical translation:
- "My Garmin says my VO2 max went from 44 to 47 over six months of training" — meaningful within-device trend
- "My Garmin says my VO2 max is 47 — therefore my actual lab VO2 max is 47" — not supported by the validation data
- "My Garmin says my VO2 max is 47; my friend's Apple Watch says hers is 49 — therefore she's fitter than me" — the cross-device comparison isn't reliable
What about wrist-based vs chest-strap heart rate?
The accuracy of the VO2 max estimate depends on the accuracy of the underlying heart rate data.
- Steady-state outdoor running with wrist HR — usually accurate enough for the estimate to be meaningful
- High-intensity interval training with wrist HR — wrist HR struggles with rapid heart rate changes, which degrades the underlying data
- Chest-strap HR — more accurate during variable-intensity exercise, which improves the VO2 max estimate's foundation
For users who care about the estimate, pairing the watch with a chest strap for key sessions typically improves the algorithm's confidence.
Where each is appropriate
| Goal | Tool |
|---|---|
| Tracking your own fitness trends over months | Watch estimate is sufficient |
| Comparing fitness to population norms | Watch estimate gives a usable percentile range |
| Pre-surgery cardiovascular risk assessment | CPET, not watch |
| Elite athlete training prescription | CPET, periodic re-testing |
| Research on VO2 max in cohorts | CPET; some studies validate against wearables for screening |
| Initial screening for cardiovascular fitness deficits | Watch estimate flags low fitness; clinical evaluation confirms |
The pattern: watches are useful for personal trend monitoring and rough population placement. They are not adequate for clinical decisions or competitive sport prescription.
What this means for the mortality link
The research on VO2 max and mortality (covered in the healthspan vs lifespan page) used laboratory CPET measurements, not watch estimates. The 5x mortality difference between the lowest and highest fitness quintiles in the Mandsager 2018 study used direct treadmill VO2 measurement.
What this means: the strong association between fitness and mortality is real and well-supported. But applying that finding to a personal watch number requires understanding that:
- The watch number has meaningful error against lab measurement
- The mortality association is a population statistic, not an individual causal prediction
- Improving your watch number over time is a reasonable proxy for improving your fitness — but the absolute number on the watch is approximate
Both the mortality association and the watch's utility for personal tracking can be true at the same time. They're describing different things.
Related Proco pages
- Healthspan vs lifespan: what longevity research measures
- Wearables: what they can and can't measure
- How to read a clinical trial
Sources
-
Bouchard C, Daw EW, Rice T, et al. Familial resemblance for VO2max in the sedentary state: the HERITAGE family study. Medicine and Science in Sports and Exercise. 1998;30(2):252-258.
-
Firstbeat Technologies. VO2 estimation method based on heart rate measurement. White paper. 2014.
-
Cao R, Azimi I, Sarhaddi F, et al. Accuracy Assessment of Oura Ring Nocturnal Heart Rate and Heart Rate Variability in Comparison With Electrocardiography. JMIR mHealth and uHealth. 2022;10(1):e27487.
-
Düking P, Giessing L, Frenkel MO, et al. Validity of Garmin Forerunner 935 for the assessment of selected variables in submaximal exercise. European Journal of Sport Science. 2020;20(8):1067-1075.
-
Mandsager K, Harb S, Cremer P, et al. Association of cardiorespiratory fitness with long-term mortality among adults undergoing exercise treadmill testing. JAMA Network Open. 2018;1(6):e183605.
-
Kodama S, Saito K, Tanaka S, et al. Cardiorespiratory fitness as a quantitative predictor of all-cause mortality and cardiovascular events in healthy men and women: a meta-analysis. JAMA. 2009;301(19):2024-2035.
-
Kaminsky LA, Arena R, Myers J. Reference standards for cardiorespiratory fitness measured with cardiopulmonary exercise testing: data from the Fitness Registry and the Importance of Exercise National Database. Mayo Clinic Proceedings. 2015;90(11):1515-1523.
-
Bassett DR Jr, Howley ET. Limiting factors for maximum oxygen uptake and determinants of endurance performance. Medicine and Science in Sports and Exercise. 2000;32(1):70-84.
-
Wagner PD. New ideas on limitations to VO2max. Exercise and Sport Sciences Reviews. 2000;28(1):10-14.
-
Smolander J, Juuti T, Kinnunen ML, et al. A new heart rate variability-based method for the estimation of oxygen consumption without individual laboratory calibration: application example on postal workers. Applied Ergonomics. 2008;39(3):325-331.
Proco provides educational, research-based information. This page describes measurement methodology. It is not training advice or medical advice. Before beginning a new exercise program — particularly if you are over 60, have a cardiovascular condition, or have been previously sedentary — consult a qualified clinician.
Schema (for implementation)
{
"@context": "https://schema.org",
"@type": "Article",
"headline": "VO2 Max: Lab Tests vs Watch Estimates",
"description": "Laboratory VO2 max testing directly measures oxygen consumption during exercise. Wearables estimate it from heart rate and pace data. This page describes the methods and the accuracy gap.",
"datePublished": "2026-06-01",
"dateModified": "2026-05-31",
"author": {"@type": "Organization", "name": "Proco"},
"publisher": {"@type": "Organization", "name": "Proco", "url": "https://procohq.com"},
"about": {"@type": "Thing", "name": "VO2 max measurement methodology"}
}
Proco provides educational, research-based information. It does not diagnose, treat, cure, or prevent any condition. Individual responses to interventions vary based on age, health status, medications, and other factors. If you are pregnant, breastfeeding, take prescription medication, manage a chronic condition, or are considering health changes for a child, talk to a qualified healthcare professional before relying on any information from Proco.
If you are experiencing a medical emergency, contact your local emergency services.