Wearables for COVID-19 Detection: 1 Year Later

Last May I wrote a post discussing the growing interest in using wearables to aid the battle against COVID-19. Just over 1 year later the preliminary results of several studies on this topic have been published. 

In this article I round up the research related to using wearables to detect COVID-19 before or at the time of symptom onset.

Why wearables?

Wearables could serve as a first-level screening for COVID-19 to alert individuals of a possible infection early in the course of the disease. The rationale is that an alert from a wearable would:

(1) allow the individual to take immediate precautionary measures (isolation, masking, testing) to prevent spreading

(2) help health officials (or businesses, sports organizations, schools, etc.) to prioritize who should get COVID-19 testing on a given day when resources are limited

To be clear, at this point no one is arguing for wearables to serve as a medical-grade diagnostic test. Rather, they may offer a valuable first-line defense against illness spread, because individuals can use them in a hands-off 24/7 manner and receive alerts as soon as they wake up in the morning. And while wearable data is not medical grade, the presence of many data points, over several months, and from multiple physiological data streams (heart rate, respiratory rate, temperature, activity, sleep) could overcome limitations in the accuracy of a single measure.

How this article is organized

For each paper I summarize the purpose, methods, and key results, along with an exemplary graph and caption copied from the paper to help illustrate the findings. Where possible, I separate results that I consider proof-of-concept (looking at data to figure out if there is anything possibly meaningful there) from detection algorithms (actually trying to predict infection from the data using a model that was not built on that same data).

If you don’t want to get into the nitty gritty, you can just jump to the end of the article where I attempt to pull all these findings together and look ahead to what's next in this space.

Note: Most of the methods are complex, so I do not dive heavily into the details here. My main goal is to give a high-level overview. I strongly encourage you to dig into the papers themselves via the links provided. Two papers have not yet been peer-reviewed at the time of publication and are denoted by PREPRINT.

The Papers

Miller, D. J. et al. Analyzing changes in respiratory rate to predict the risk of COVID-19 infection. PLoS ONE 15, e0243693 (2020).

Mishra, T. et al. Pre-symptomatic detection of COVID-19 from smartwatch data. Nat Biomed Eng 4, 1208-1220 (2020).

Natarajan, A., Su, H.-W. & Heneghan, C. Assessment of physiological signs associated with COVID-19 measured using wearable devices. NPJ Digit Med 3, 156–8 (2020).

Smarr, B. L. et al. Feasibility of continuous fever monitoring using wearable devices. Sci Rep 10, 21640–11 (2020).

Hirten, R. P. et al. Use of Physiological Data From a Wearable Device to Identify SARS-CoV-2 Infection and Symptoms and Predict COVID-19 Diagnosis: Observational Study. J Med Internet Res 23, e26107 (2021).

Quer, G. et al. Wearable sensor data and self-reported symptoms for COVID-19 detection. Nat Med 27, 73–77 (2021).

Hassantabar, S. et al. CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors and Efficient Neural Networks. (2020). Preprint.

Liu, S. et al. Fitbeat: COVID-19 Estimation based on Wristband Heart Rate. (2021). Preprint.

Terminology

Several terms come up frequently in this area of study, so I’ll define them here:

HR (or RHR) - heart rate, typically taken during a period of sleep or daytime rest for the purpose of these analyses. The calculation depends on the study and may be the mean, median, or minimum during a given time interval. An unusual increase in resting HR may indicate illness.

HRV - heart rate variability, or how (in)consistent the time between heart beats is. Also typically measured during rest or sleep. There are several different ways to calculate it (for example, root mean sum of squared differences or RMSSD), but generally too little or too much may signal illness.

RR - respiration rate, or how many breaths an individual takes per min. Similar to HR and HRV, this is typically calculated as an average during rest or sleep to maximize accuracy. RR may increase due to an illness that causes breathing difficulties, such as COVID-19.

T - temperature. In the studies presented here it represents skin temperature rather than core body temperature.

Sensitivity - true positive rate, or the percentage of individuals with the disease that are correctly identified by a classification algorithm (sick/not sick).

Specificity - true negative rate, or the percentage individuals without the disease who are correctly identified as not having the disease by a classification algorithm (sick/not sick).

PPV - positive predictive value, or the probability that a person with a positive test actually has the disease.

NPV - negative predictive value, or probability that a person with a negative test actually does not have the disease.

AUC - area under the receiver-operating characteristic (ROC) curve, which is created by plotting the sensitivity of a diagnostic test against its false positive rate. The closer the value is to 1, the better the test is at classifying disease.

Now the good stuff…

 

Round Up of Research on Early Detection of COVID-19 via Wearables

Analyzing changes in respiratory rate to predict the risk of COVID-19 infection

Miller et al. 2020, Results of predictive algorithm for COVID-19 from respiratory rate data. Cumulative percentage of individuals from each dataset that were classified as COVID-19 positive (C+, black) or COVID-19 negative (C-, gray) relative to symptom onset (day = 0).

Miller et al. 2020, Results of predictive algorithm for COVID-19 from respiratory rate data. Cumulative percentage of individuals from each dataset that were classified as COVID-19 positive (C+, black) or COVID-19 negative (C-, gray) relative to symptom onset (day = 0).

Purpose
To predict COVID-19 infection before or during early symptom onset using RR.  

Participants
271 WHOOP Strap users who reported COVID-19 symptoms and test results from March 14 through June 6, 2020. They were divided into 3 groups: training (COVID-19 symptoms March 14-April 14), validation #1 (COVID-19 positive with symptoms April 14-June 6), and validation #2 (COVID-19 negative with symptoms April 14 – June 6).

Wearable
WHOOP Strap

Wearable Measures
RR (nighttime) transformed into several different features. The features essentially examined different ways of normalizing the current RR value to “baseline.” One feature also looked for a linear trend in RR in the preceding nights.

Early Detection Algorithm
The model used several RR features to provide a probability that an individual was infected with COVID-19 on the current day. Days with 30% probability or higher of COVID-19 were classified as C+ and the remaining days as C-. The model had 36.5% sensitivity and 95.3% specificity in individuals with confirmed COVID-19, meaning it accurately classified 36.5% of actual infected days (-2 days symptom to +3 post symptom onset) as C+ and 95.3% of healthy days (defined as 30 to 14 days prior to symptom onset) classified as C-. The PPV was 73.8% and NPV was 80.6%. 20% of COVID-19 cases were detected 2 days before onset, 30% at symptom onset, and 80% by 3 days post onset.

Summary
This is one of the first studies to use RR data as the primary input to its model. The results are promising and provide good proof of concept that RR could support illness detection. Like other studies, the model results suggest it would be best suited for identifying absence of disease. The authors also provide valuable information about the typical variability of RR, finding that across 750,000 nights of data RR appears to be less variable than resting HR or HRV, which may make anomaly detection easier.  

Read more: Miller et al. 2020

Pre-symptomatic detection of COVID-19 from smartwatch data

Mishra et al. 2020, Examples of heart rate metrics during COVID-19 illness for two individual participants from group I (a and b). Red and purple vertical dashed lines indicate the days of symptom onset and diagnosis, respectively. Shown are the sta…

Mishra et al. 2020, Examples of heart rate metrics during COVID-19 illness for two individual participants from group I (a and b). Red and purple vertical dashed lines indicate the days of symptom onset and diagnosis, respectively. Shown are the standardized HROS from the HROS-AD method (bottom plot in each panel; dark blue lines) and the standardized heart rate residuals from the RHR-Diff method (top plot in each panel; black lines). For RHR-Diff, the green dashed line is at 0. Gold solid triangles mark the infection detection window used to score detections as a hit or a miss. Also indicated are time intervals when the heart rate residuals were significantly elevated from RHR-Diff (red arrows in the top plots of each panel) and times when anomalies were detected by HROS-AD (red dots in the bottom plots of each panel).

Purpose
(1) To demonstrate that HR from fitness trackers can be used to detect COVID-19 infection before symptom onset. (2) To present an algorithm to identify early stages of infection from real-time HR monitoring.

Participants
118 individuals from a larger ongoing study (5,000+). Divided into 3 groups: 32 COVID-19 positive, 13 with confirmed illness that was not COVID-19, 73 healthy individuals with no report of illness or symptoms.

Wearable
Fitbit

Wearable Measures
HR (appears to be nighttime but the methods are unclear), steps (daily total), sleep duration (nightly total and by sleep stage)

Proof of Concept Results
- A single episode or tight cluster of episodes of elevated HR and/or elevated HROS (HR divided by total steps) was evident in 26 of the 32 individuals with COVID-19 in the 28 days before or during the illness period.
- How much resting HR increased due to illness did not correlate with types of symptoms or illness duration.
- Steps significantly decreased at the onset of elevated HR whereas sleep duration increased only after elevated HR, which may indicate that sleep is not as suitable as other measures for early detection.
- Elevated HR relative to baseline (28-day rolling average) occurred before symptom onset also in many of the non-COVID illnesses suggesting that these changes can be used to detect multiple illnesses.

Early Detection Algorithm
Based on these findings the researchers implemented an algorithm to detect COVID-19 based on cumulative occurrences of abnormally high HR relative to the individual’s baseline. Alerts occurred when the accumulated deviations from normal exceeded a pre-set threshold. A second alert occurred when abnormal deviations were maintained for at least 24 hours. When the algorithm was applied retrospectively to the dataset, 62.5% of COVID-19 positive individuals had at least 1 alert within 28 days before symptom onset, but alerts also occurred in the healthy dataset and in 9 of the 15 individuals with non-COVID-19 illness. 

Summary
Using accumulations of abnormally elevated HR over a 24-hour period is a promising measure for illness detection in real-time. Further work is needed to improve the algorithm’s predictive power and to validate the algorithm on a new dataset.

Read more: Mishra et al. 2020

Assessment of physiological signs associated with COVID-19 measured using wearable devices

Natarajan et al. 2021, Variation of metrics with day: The Z-scores for respiration rate, heart rate, HRV (RMSSD), and entropy. Day 0 (D0) represents the start of symptoms. The respiration rate and heart rate are elevated during times of sickness, while the RMSSD and entropy are decreased. These metrics may change a few days prior to the start of symptoms. The heart rate decreases on average, following day D+7, and returning to the base value by day D+21. The HRV metrics are slightly elevated on average during this period. We did not notice a decrease in respiration rate during this phase. Error bars represent the standard error of the mean.

Natarajan et al. 2021, Variation of metrics with day: The Z-scores for respiration rate, heart rate, HRV (RMSSD), and entropy. Day 0 (D0) represents the start of symptoms. The respiration rate and heart rate are elevated during times of sickness, while the RMSSD and entropy are decreased. These metrics may change a few days prior to the start of symptoms. The heart rate decreases on average, following day D+7, and returning to the base value by day D+21. The HRV metrics are slightly elevated on average during this period. We did not notice a decrease in respiration rate during this phase. Error bars represent the standard error of the mean.

Purpose
To predict COVID-19 presence prior to symptom presentation with user-provided data and self-reported symptoms.

Participants
2745 Fitbit users diagnosed with COVID-19 between May 21 and September 11, 2020

Wearable
Fitbit

Wearable Measures
HR (average during non-REM sleep), HRV (median 5-min RMSSD and Shannon entropy of nighttime HR), RR (average during deep sleep).

Proof of Concept Results
- Comparing HR, HRV, and RR before and after COVID-19 illness onset, RR was most affected by COVID-19 (large increase) and took the longest to return to normal after illness.
- Increased HR and decreased HRV were also evident with illness.

Early Detection Algorithm
Researchers used machine-learning to predict the probability of COVID-19 on a given day. The algorithm relies on wearable data for the given day and the 4 preceding days and had an AUC of 0.77. At 95% specificity, it has a 44% sensitivity, 96% specificity, and predicts 15% of sick individuals 1 day before illness onset.

Summary
This paper presents a promising algorithm for confirming absence of disease based on HR, HRV, and RR.  A negative result is highly predictive that the individual is healthy while a positive result suggests that precautions (isolation, masking) should be taken until a follow-up medical diagnostic test is obtained. This may seem counterintuitive at first, but if you consider that testing is costly and hard to complete daily (and even then results may take several days), an algorithm that can give you a green light to leave your house and socialize could be very helpful, both for the current pandemic or for the seasonal flu and other diseases. 

Read more: Natarajan et al. 2020

Feasibility of continuous fever monitoring using wearable devices

Smarr et al. 2021, Wearable distal temperature sensors are suitable for developing digital biomarkers for fever with and without paired symptom reports. An example T record (e) with fever-like days identified by exceeding these thresholds before ons…

Smarr et al. 2021, Wearable distal temperature sensors are suitable for developing digital biomarkers for fever with and without paired symptom reports. An example T record (e) with fever-like days identified by exceeding these thresholds before onset of symptom report (e; black dots represent daily min and max above thresholds) has similar changes in heart rate (HR), HR variability (HRV), and respiration rate (RR), to the reported fever event (f by variable, and g,h with overlay, respectively). (fh) All lines are smoothed by 360 min radius, displaying the same smoothing used to generate median minimum and maximum values for each day. Faded blue line in (g,h) is the raw T (1 point/min).

Purpose
To demonstrate that fever detection and prediction is feasible from continuously collected skin temperature.

Participants
The first 50 participants in the TemPredict dataset (now at over 20,000 participants) who reported a period of COVID-19 symptoms (no diagnostic test) prior to enrolling and had accompanying wearable data. 

Wearable
Oura Ring

Wearable Metrics
HR (5 min average during sleep), HRV (RMSSD 5 min average during sleep), RR (average during sleep), skin T (finger, daily maximum and minimum values)

Proof of Concept Results
-
T (both max and min) and RR were elevated at self-reported onset of fever compared to individual baseline values (minimum 14-day window within 40 days prior to symptom onset).
- If T was used to identify fever, rather than self-report, other significant physiological changes were generally evident: increased HR, decreased HRV, and increased RR relative to baseline.
- Anomalies in the circadian rhythms of the T signal were found up to one week before fever onset in 93% of cases.

SUMMARY
This is the only study to present detailed information on skin temperature. These results suggest that finger skin temperature can be used to identify and predict fever. The algorithm needs to be tested in a new dataset, which the study is currently doing, and may be improved by adding other metrics such as HR and RR.

Read more: Smarr et al. 2020

Longitudinal Physiological Data from a Wearable Device Identifies SARS-CoV-2 Infection and Symptoms and Predicts COVID-19 Diagnosis

Hirten et al. 2021, Relationship between HRV circadian rhythm and COVID-19 status. Daily HRV patterns for days on which subjects were healthy, 7 days before a positive COVID-19 test, 7 days after a positive COVID-19 test, and 7-14 days after a positive COVID-19 test. Means and 95% confidence intervals for the acrophase, amplitude, and MESOR of the HRV measured on days when participants were healthy, 7 days before a positive COVID-19 test, 7 days after a positive COVID-19 test, and 7-14 days after a positive COVID-19 test. HRV: heart rate variability; MESOR: midline statistic of rhythm; SDNN: standard deviation of the interbeat interval of normal sinus beats.

Hirten et al. 2021, Relationship between HRV circadian rhythm and COVID-19 status. Daily HRV patterns for days on which subjects were healthy, 7 days before a positive COVID-19 test, 7 days after a positive COVID-19 test, and 7-14 days after a positive COVID-19 test. Means and 95% confidence intervals for the acrophase, amplitude, and MESOR of the HRV measured on days when participants were healthy, 7 days before a positive COVID-19 test, 7 days after a positive COVID-19 test, and 7-14 days after a positive COVID-19 test. HRV: heart rate variability; MESOR: midline statistic of rhythm; SDNN: standard deviation of the interbeat interval of normal sinus beats.

Purpose
To determine whether changes in HRV can (1) differentiate participants infected or not infected with COVID-19 and (2) predict development of COVID-19 prior to diagnosis.

Participants
Healthcare workers in the Mount Sinai Health System (n = 297) including 13 who tested positive for COVID-19 from April 29 to Sep 29, 2020.

Wearable
Apple Watch Series 4 or higher

Wearable metrics
HRV during day (computed by the Apple Watch)

Proof of Concept Results
There were significant differences in the circadian pattern of HRV between COVID-19 positive (reduced amplitude) and negative individuals. Similarly, when the COVID-19 positive patients data was compared more than 7 days before diagnosis (“healthy”) versus 7 days before diagnosis or 7 days after diagnosis, HRV circadian amplitude was significantly smaller during the periods 7 days before and after diagnosis. The first day of COVID-19 symptoms was also marked with significantly higher mean (adjusted for circadian rhythm variations) and lower amplitude of HRV compared to all other days.

Summary
The Apple Watch’s HRV algorithm is proprietary and produces inconsistent and infrequent samples. Despite these challenges, there still appears to be some predictive value in the HRV, particularly when a more sophisticated analysis that accounts for natural variations in HRV throughout the day is used. This is encouraging since real-world wearable data is often messy. This type of analysis now needs to be applied as part of a predictive model and/or paired with additional metrics to improve its power. 

Read more: Hirten et al. 2021

Wearable sensor data and self-reported symptoms for COVID-19 detection 

Quer et al. 2021, Prediction of COVID-19 from self-reports symptoms and sensor data. Receiver operating characteristic curves (ROCs) for the discrimination between COVID-19-positive (54 individuals) and COVID-19-negative (279 individuals) cases base…

Quer et al. 2021, Prediction of COVID-19 from self-reports symptoms and sensor data. Receiver operating characteristic curves (ROCs) for the discrimination between COVID-19-positive (54 individuals) and COVID-19-negative (279 individuals) cases based on the available data: RHR (a); sleep (b); activity (c); all available sensor data (d); symptoms only (e); symptoms with sensor data (f). Models are based on a single decision threshold. Median values and 95% confidence intervals (CIs) for sensitivity (SE), specificity (SP), positive predictive value (PPV) and negative predictive value (NPV) are reported, considering the point on the ROC with the highest average value of sensitivity and specificity. Error bars represent 95% CIs. P values from a one-sided Mann–Whitney U test are reported.

Purpose
To assess whether adding sensor data to self-reported symptoms can better differentiate COVID-19 positive from negative cases than symptoms alone.

Participants
333 individuals from a large study cohort (30,000+) who experienced COVID-19 symptoms, received COVID-19 testing, and had corresponding wearable data. Divided into two groups: 54 COVID-19 positive, 279 COVID-19 negative.

Wearable
Any but primarily Fitbit devices (78.4%)

Wearable metrics
Resting HR (daily, calculated per device’s software), sleep duration (total from 12 pm to 12 pm following day), steps (daily total)

Proof of Concept Results
Sleep and activity but not resting HR were significantly different between the COVID-19 positive and negative individuals.

Early Detection Algorithm
Researchers created a relatively simple illness score from the weighted sums of max resting HR, total sleep, and average daily steps, normalized to “baseline” (median of 7-21 days before symptom onset). When the wearable score was added to a weighted sum of self-report symptoms (loss of taste/smell, cough, fatigue), age, and sex, it could detect COVID-19 reasonably well (AUC = 0.80, sensitivity = 72%, specificity = 73%, PPV = 35%, NPV = 93%). The prediction of the combined model was better than the models based only on wearable data or only on symptoms and demographics. 

Summary
COVID-19 was well detected by a simple calculation using HR, steps, sleep, symptoms, age, and sex. While the researchers frame this as a predictive model for COVID-19, the data suggests that it would be better formatted as a model that confirms the absence of COVID-19. The most effective form of the model requires self-reported symptoms, so it is de facto not a pre-symptomatic model, but the wearable-related findings are useful nonetheless.

Read more: Quer et al. 2021

CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors and Efficient Neural Networks (Preprint)

Hassantabar et al. 2020, Schematic diagram of the CovidDeep framework. GSR: Galvanic skin response, IBI: inter-beat interval, Ox.: oxygen saturation, BP: blood pressure, DT/RF: decision tree/random forest, NN: neural network, KB: knowledge-base, MND: multi-variate Normal distribution, GMM: Gaussian mixture model, KDE: kernel density estimation.

Hassantabar et al. 2020, Schematic diagram of the CovidDeep framework. GSR: Galvanic skin response, IBI: inter-beat interval, Ox.: oxygen saturation, BP: blood pressure, DT/RF: decision tree/random forest, NN: neural network, KB: knowledge-base, MND: multi-variate Normal distribution, GMM: Gaussian mixture model, KDE: kernel density estimation.

Purpose
To develop a framework for daily detection of COVID-19 from wearable data.

Participants
87 individuals divided into 3 groups: 27 COVID-19 positive but asymptomatic, 30 COVID-19 positive with symptoms, 30 healthy

Wearable
Empatica E4

Wearable Measures
HR, T, galvanic skin response (GSR) (also pulse oximetry and blood pressure from other devices)

Early Detection Algorithm
The authors trained models using machine-learning methods and then tested them on the same dataset. They explored several inputs, types of training sets, and model designs. Ultimately the highest accuracy (98.1%) was obtained with a model using GSR, pulse oxygen, blood pressure, and self-reported symptoms. Most interestingly, it has a 4.5% rate of false negatives (says the individual is healthy when they are sick) in the the asymptomatic COVID-19 cohort.

SUMMARY
This is the only study examining GSR or adding in blood pressure or pulse oximetry, which are not wearable measures but are obtainable easily at home. It demonstrates the feasibility of real-time COVID-19 detection from wearable data and provides valuable insight into model design. It now needs to be refined and validated on other datasets.

Read more: Hassantabar 2020 (PREPRINT) 

Fitbeat: COVID-19 Estimation based on Wristband Heart Rate (Preprint)

Liu et al. 2021,  Results of COVID-19 detection algorithm from heart rate data. Continuous binary COVID-19 yes/no classification on each given 14-day heart rate windows of an exemplary individual.

Liu et al. 2021, Results of COVID-19 detection algorithm from heart rate data. Continuous binary COVID-19 yes/no classification on each given 14-day heart rate windows of an exemplary individual.

Purpose
To predict the presence of COVID-19 symptoms using HR. 

Participants
87 individuals with multiple sclerosis from a separate study (IMI2 RADAR-CNS) divided into 3 groups: 19 individuals with COVID-19 (diagnosed by symptoms), 49 individuals with some COVID-19 symptoms but did not meet diagnostic criteria, and 19 matched controls without symptoms.

Wearable
Fitbit Charge 2 or 3

Wearable Measures
HR (5-minute average)

Early Detection Algorithm
Researchers used a machine learning approach to detect the presence of COVID-19. They trained a convolution neural network on the 49 individuals with some symptoms and then applied it to the 19 individuals with COVID-19. They investigated several versions of the model, with the best reporting 100% sensitivity and 92% specificity 1 day before symptom onset and up to 53% sensitivity and 62% specificity 3 days before symptom onset.

Summary
In keeping with other papers, resting HR appears promising for early detection of COVID-19 or other illness. The results reported seem too good to be true if I’m reading them correctly. Nonetheless, it provides a valuable proof of concept along with information about what may be the optimal deep learning approach for this type of illness detection.

Liu et al. 2021 (PREPRINT)

So What Does It All Mean?

Wrist-wearable-on-athlete.jpg

It’s encouraging that there are several groups working on this problem in parallel, each taking slightly different approaches with seemingly similar success. Which approach shakes out to be the most effective remains to be seen. However, several key themes have emerged:

  1. Heart rate, heart rate variability, respiratory rate, and skin temperature have value in detecting COVID-19 —and likely other flu-like illnesses, too. Galvanic skin response, activity, and sleep parameters also may be helpful.

  2. Illness periods are best identified by looking for anomalies in the data relative to an individual’s typical or “baseline” values. There is too much difference between individuals in these types of data to allow single values in isolation to be meaningful.

  3. Complex analyses are not necessarily superior to simpler analyses. Methods used by these studies ranged from a simple mean difference in heart rate to frequency content analysis of skin temperature data. The optimal way to analyze each signal, however, is yet to be determined.

  4. Wearable data seem better suited to identifying absence of disease than the presence of disease. Users with a “negative” result could have good confidence to go about their regular lives, while “positive” results flag a need for medical consultation and isolation measures until a diagnostic test can be obtained.

Not a Silver Bullet

There are several limitations to this research that need to be noted:

  • These are mainly explorations of feasibility. None of this research is ready yet for real-time operation in the wild although several research groups are actively working toward this.

  • Research participants are not representative of the general population. Most of these studies relied on participants who already had wearable devices and also had access to COVID-19 testing in the first half of 2020. Quer et al. 2020 reported that while “a recent survey found no racial or ethnic variation in smartwatch or activity tracker usage (23%, 26% and 21% for Black, Hispanic and White individuals, respectively), the lowest percentage of users were identified in those with the lowest annual earnings (12%), the lowest educational attainment (15%) and in those over age 50 (17%).” 

  • Defining “healthy” and “sick” is fuzzy. COVID-19 has a variable incubation period of 2-14 days from infection to symptom onset. Since most studies relied on participant self-reports of date of symptom onset or positive COVID-19 test, it’s hard to know precisely when someone was infected. Additionally, some individuals classified as “healthy” actually may have been asymptomatic with COVID-19. Finally, COVID-19 diagnostic criteria were evolving in parallel with these studies. All together, this makes labelling “healthy” and “sick” datapoints noisy, which in turn challenges the models.

  • A wearable only works when you wear it. Data was lost in several studies when individuals forgot to wear their device or let it run out of charge. This is common with wearables and worse when someone is feeling unwell. Besides compromising results, it highlights the practical challenges of relying on a wearable to manage health issues.

On the Horizon

I expect we will see more evolved algorithms emerge over the coming months. Jawbone, one of the earliest consumer wearable manufacturers, appears to be coming back to market with a focus on illness detection. Other wearable companies also have begun staking a claim in this space. Meanwhile many of the research groups cited in this article are taking their proof-of-concept findings and deploying them in real-time detection studies. And we await the first results of ongoing projects at West Virginia University, Duke University, and the Department of Defense.

The Big Picture

Even though an end to COVID-19 seems in sight (thank you vaccines!), this research is important and will remain important for many years to come. Ultimately, advancing this area of science is my main motivation for summarizing these studies here.

A wearable could help with disease screening and triage in the interim. Recent estimates call for at least another year until herd immunity from COVID-19 but likely longer and with significant geographic disparity. Given the high cost of most wearable devices and lower adherence in older populations, however, it remains to be seen whether this can be done in an accessible manner. 

This research can and should be leveraged to develop broader early illness detection strategies. Most of this work looks at detecting flu-like illness rather than COVID-19 specifically. As Joseph Patterson noted in our University of Michigan panel discussion last year, a bad case of strep throat ripping through a US Army barracks is just as problematic for military readiness as the presence of COVID-19. Many illnesses inflict a serious human and financial cost, so the opportunity to detect illness before transmission to others can be tremendously beneficial to employers, the military, sports, and of course, the general public.

It appears very feasible for wearable devices that provide physiological data to be used to reliably alert an individual to possible illness. Like any scientific endeavor, much more work needs to be done, so I’ll keep this space updated as the research evolves.

Until then, wishing you low resting heart rates, ample sleep durations, and a healthy summer 2021!


Extra Credit: Other Interesting COVID-19 Wearables Research

Observational study on wearable biosensors and machine learning‐based remote monitoring of COVID‐19 patients 

The study used the Everion sensor (HR, HRV, RR, pulse oxygen saturation, skin temp, actigraphy) and a proprietary illness index (Biovitals Sentinal platform) to predict disease severity, clinical status, and length of stay for COVID-19 patients in the hospital. The proprietary index outperformed the NIH clinical status index (NEWS2) for discriminating viral load, predicting clinical worsening events, and predicting hospital discharge.

Read more: Un et al. 2021

Corona-Datenspende (Corona Data Donation)

This is an ongoing effort to create a fever map for Germany using vital signals collected by wearable health and fitness tracking devices. It aims to identify COVID-19 hot spots by detecting regions in which the number of residents exhibiting fever symptoms is higher than average. The fever detection algorithm uses resting HR (increased) and daily step count (decreased) from fitness tracking devices, while also exploring the benefit of sleep duration and quality as well. 

Read more: Robert Koch Institute

Learning from Large-Scale Wearable Device Data for Predicting the Epidemic Trend of COVID-19 

Researchers at Huami Corporation developed a framework to predict COVID-19 outbreak trends at the city and regional level from data collected from Huami connected wearable devices. The framework relies on detecting anomalies in heart rate (>+1.5 SD) and sleep duration (>0.5SD) accounting for geographic region. It has been validated against COVID-19 reported cases in several cities in China, Italy, and Spain.

Read more: Zhu et al. 2020

Previous
Previous

U-W Sports Medicine Grand Rounds 2021

Next
Next

5 Tips for Selecting a Sleep Wearable