// Clinical brief

How We Evidence-Grade Consumer Health Apps (A–F)

By Joel Kingsley, RD, MS Publicado 18 de mayo de 2026 Actualizado 22 de mayo de 2026 Última revisión 22 de mayo de 2026

The Five Grades

We assign every evaluated app an Evidence Grade — a structured assessment of the published validation evidence supporting the product’s clinical claims.

A — ≥ 1 published RCT or equivalent peer-reviewed validation study validating the app as a clinical intervention versus an active comparator
B — Peer-reviewed observational validation published, or appearance as the intervention tool in published behavioral interventions
C — Manufacturer-cited validation, not independently peer-reviewed; or methodology published and architecturally sound but pre-validation
D — Methodology described but no validation evidence
F — No validation evidence and no methodology published

The grades anchor to the published evidence as it stands at the report date — they update as new evidence publishes.

Why Most Consumer Apps Land in C or D

The consumer wellness category structurally does not produce Grade A evidence because Grade A requires positioning the product as a medical device — which triggers FDA/MHRA regulatory pathways that most consumer publishers explicitly avoid. The publishers that do produce validation evidence (Noom, Cronometer indirectly via database citations, MyFitnessPal via behavioral RCTs) earn Grade B but not A.

Grade C is the modal grade in our 2026 ranking because most publishers document their methodology (sufficient for C) but do not commission independent validation studies (which would earn B).

What an Evidence Grade Is Not

The Evidence Grade does not measure clinical accuracy directly — that’s a separate criterion in our Clinical Evaluation Framework (Clinical Accuracy, 20% of the overall score). It measures the evidence base supporting the publisher’s claims.

A Grade C app with strong architecture can outperform a Grade B app with weak architecture on the Clinical Accuracy criterion. The Evidence Grade tells you what’s been validated; the accuracy sub-score tells you what’s likely true based on architecture and benchmark data.

How the Grade Affects the Recommendation

For most consumer use, Grade C is acceptable — the app is documented, the methodology is sound, validation evidence is plausible but pre-published. For clinical-adjacent use (RD-supervised, research data collection, weight-management under medical supervision), Grade B or higher is the right floor. For users requiring regulatory clearance (Class II medical-device-equivalent use), no consumer calorie tracker is currently appropriate — that requires a dedicated regulated product.

Referencias

Stoyanov SR et al. Mobile App Rating Scale: A new tool for assessing the quality of health mobile apps. JMIR Mhealth Uhealth. 2015;3(1):e27.. 10.2196/mhealth.3422
Mathews SC et al. Digital health: a path to validation. NPJ Digit Med. 2019;2:38.. 10.1038/s41746-019-0111-3
Larsen ME et al. Quantifying app store dynamics. JMIR Mhealth Uhealth. 2016;4(3):e96.. 10.2196/mhealth.6020

Preguntas frecuentes

What is an Evidence Grade?

A Clinical App Report Evidence Grade (A–F) is a structured assessment of the published validation evidence supporting a consumer health app's clinical claims. Grade A is the strongest (an RCT validating the app as an intervention); Grade F is the weakest (no methodology, no validation). Most consumer health apps land in Grades C–D.

Why don't any consumer calorie trackers carry Evidence Grade A?

Grade A requires a published RCT validating the app as a clinical intervention versus an active comparator. The consumer calorie tracker category has been studied as a behavior in multiple RCTs (with MyFitnessPal frequently used as the intervention tool), but no published RCT validates a specific app versus an active comparator with the app itself as the variable of interest. This is normal for the consumer wellness category — the regulatory path for Grade A would require the publisher to position the product as a medical device, which most consumer publishers explicitly avoid.