Accuracy of 11 Wearable, Nearable, and Airable Consumer Sleep Trackers: Prospective Multicenter Validation Study.

Fitbit Sense 2, Amazon Halo Rise, SleepRoutine airables artificial intelligence comparative study consumer sleep trackers deep learning multicenter study nearables polysomnography sleep monitoring sleep stage wearables

Journal

JMIR mHealth and uHealth
ISSN: 2291-5222
Titre abrégé: JMIR Mhealth Uhealth
Pays: Canada
ID NLM: 101624439

Informations de publication

Date de publication:
02 Nov 2023
Historique:
received: 18 07 2023
accepted: 20 09 2023
revised: 08 08 2023
medline: 3 11 2023
pubmed: 2 11 2023
entrez: 2 11 2023
Statut: epublish

Résumé

Consumer sleep trackers (CSTs) have gained significant popularity because they enable individuals to conveniently monitor and analyze their sleep. However, limited studies have comprehensively validated the performance of widely used CSTs. Our study therefore investigated popular CSTs based on various biosignals and algorithms by assessing the agreement with polysomnography. This study aimed to validate the accuracy of various types of CSTs through a comparison with in-lab polysomnography. Additionally, by including widely used CSTs and conducting a multicenter study with a large sample size, this study seeks to provide comprehensive insights into the performance and applicability of these CSTs for sleep monitoring in a hospital environment. The study analyzed 11 commercially available CSTs, including 5 wearables (Google Pixel Watch, Galaxy Watch 5, Fitbit Sense 2, Apple Watch 8, and Oura Ring 3), 3 nearables (Withings Sleep Tracking Mat, Google Nest Hub 2, and Amazon Halo Rise), and 3 airables (SleepRoutine, SleepScore, and Pillow). The 11 CSTs were divided into 2 groups, ensuring maximum inclusion while avoiding interference between the CSTs within each group. Each group (comprising 8 CSTs) was also compared via polysomnography. The study enrolled 75 participants from a tertiary hospital and a primary sleep-specialized clinic in Korea. Across the 2 centers, we collected a total of 3890 hours of sleep sessions based on 11 CSTs, along with 543 hours of polysomnography recordings. Each CST sleep recording covered an average of 353 hours. We analyzed a total of 349,114 epochs from the 11 CSTs compared with polysomnography, where epoch-by-epoch agreement in sleep stage classification showed substantial performance variation. More specifically, the highest macro F1 score was 0.69, while the lowest macro F1 score was 0.26. Various sleep trackers exhibited diverse performances across sleep stages, with SleepRoutine excelling in the wake and rapid eye movement stages, and wearables like Google Pixel Watch and Fitbit Sense 2 showing superiority in the deep stage. There was a distinct trend in sleep measure estimation according to the type of device. Wearables showed high proportional bias in sleep efficiency, while nearables exhibited high proportional bias in sleep latency. Subgroup analyses of sleep trackers revealed variations in macro F1 scores based on factors, such as BMI, sleep efficiency, and apnea-hypopnea index, while the differences between male and female subgroups were minimal. Our study showed that among the 11 CSTs examined, specific CSTs showed substantial agreement with polysomnography, indicating their potential application in sleep monitoring, while other CSTs were partially consistent with polysomnography. This study offers insights into the strengths of CSTs within the 3 different classes for individuals interested in wellness who wish to understand and proactively manage their own sleep.

Sections du résumé

BACKGROUND BACKGROUND
Consumer sleep trackers (CSTs) have gained significant popularity because they enable individuals to conveniently monitor and analyze their sleep. However, limited studies have comprehensively validated the performance of widely used CSTs. Our study therefore investigated popular CSTs based on various biosignals and algorithms by assessing the agreement with polysomnography.
OBJECTIVE OBJECTIVE
This study aimed to validate the accuracy of various types of CSTs through a comparison with in-lab polysomnography. Additionally, by including widely used CSTs and conducting a multicenter study with a large sample size, this study seeks to provide comprehensive insights into the performance and applicability of these CSTs for sleep monitoring in a hospital environment.
METHODS METHODS
The study analyzed 11 commercially available CSTs, including 5 wearables (Google Pixel Watch, Galaxy Watch 5, Fitbit Sense 2, Apple Watch 8, and Oura Ring 3), 3 nearables (Withings Sleep Tracking Mat, Google Nest Hub 2, and Amazon Halo Rise), and 3 airables (SleepRoutine, SleepScore, and Pillow). The 11 CSTs were divided into 2 groups, ensuring maximum inclusion while avoiding interference between the CSTs within each group. Each group (comprising 8 CSTs) was also compared via polysomnography.
RESULTS RESULTS
The study enrolled 75 participants from a tertiary hospital and a primary sleep-specialized clinic in Korea. Across the 2 centers, we collected a total of 3890 hours of sleep sessions based on 11 CSTs, along with 543 hours of polysomnography recordings. Each CST sleep recording covered an average of 353 hours. We analyzed a total of 349,114 epochs from the 11 CSTs compared with polysomnography, where epoch-by-epoch agreement in sleep stage classification showed substantial performance variation. More specifically, the highest macro F1 score was 0.69, while the lowest macro F1 score was 0.26. Various sleep trackers exhibited diverse performances across sleep stages, with SleepRoutine excelling in the wake and rapid eye movement stages, and wearables like Google Pixel Watch and Fitbit Sense 2 showing superiority in the deep stage. There was a distinct trend in sleep measure estimation according to the type of device. Wearables showed high proportional bias in sleep efficiency, while nearables exhibited high proportional bias in sleep latency. Subgroup analyses of sleep trackers revealed variations in macro F1 scores based on factors, such as BMI, sleep efficiency, and apnea-hypopnea index, while the differences between male and female subgroups were minimal.
CONCLUSIONS CONCLUSIONS
Our study showed that among the 11 CSTs examined, specific CSTs showed substantial agreement with polysomnography, indicating their potential application in sleep monitoring, while other CSTs were partially consistent with polysomnography. This study offers insights into the strengths of CSTs within the 3 different classes for individuals interested in wellness who wish to understand and proactively manage their own sleep.

Identifiants

pubmed: 37917155
pii: v11i1e50983
doi: 10.2196/50983
pmc: PMC10654909
doi:

Types de publication

Multicenter Study Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e50983

Informations de copyright

©Taeyoung Lee, Younghoon Cho, Kwang Su Cha, Jinhwan Jung, Jungim Cho, Hyunggug Kim, Daewoo Kim, Joonki Hong, Dongheon Lee, Moonsik Keum, Clete A Kushida, In-Young Yoon, Jeong-Whun Kim. Originally published in JMIR mHealth and uHealth (https://mhealth.jmir.org), 02.11.2023.

Références

J Perinatol. 2004 Mar;24(3):135-6
pubmed: 15044928
Med Sci Sports Exerc. 2019 Jul;51(7):1538-1557
pubmed: 30789439
Sensors (Basel). 2016 May 05;16(5):
pubmed: 27164110
Nat Sci Sleep. 2021 Dec 24;13:2239-2250
pubmed: 35002345
Expert Rev Med Devices. 2019 Dec;16(12):1041-1052
pubmed: 31774330
Sleep Sci. 2021 Jan-Mar;14(1):83-86
pubmed: 34104344
Sleep. 2013 Nov 01;36(11):1747-55
pubmed: 24179309
Chronobiol Int. 2018 Apr;35(4):465-476
pubmed: 29235907
J Sleep Res. 2020 Feb;29(1):e12935
pubmed: 31674096
J Pers Med. 2017 May 24;7(2):
pubmed: 28538708
Clocks Sleep. 2021 May 03;3(2):274-288
pubmed: 34063579
J Clin Sleep Med. 2019 Mar 15;15(3):483-487
pubmed: 30853052
Nat Sci Sleep. 2022 Mar 22;14:493-516
pubmed: 35345630
Sensors (Basel). 2021 Jun 23;21(13):
pubmed: 34201861
Sleep. 2021 Feb 12;44(2):
pubmed: 32882005
PLoS One. 2020 Dec 11;15(12):e0243214
pubmed: 33306678
Chronobiol Int. 2021 Jul;38(7):1010-1022
pubmed: 33792456
Expert Rev Med Devices. 2016 May;13(5):497-506
pubmed: 27043070
Sleep. 2021 May 14;44(5):
pubmed: 33378539
J Med Internet Res. 2019 Nov 28;21(11):e16273
pubmed: 31778122
Sensors (Basel). 2022 Aug 18;22(16):
pubmed: 36015949
Curr Sleep Med Rep. 2019;5(3):156-163
pubmed: 33134038
Sleep Breath. 2019 Mar;23(1):13-24
pubmed: 29687190
Sleep. 2015 Jun 01;38(6):843-4
pubmed: 26039963
J Med Internet Res. 2023 Jun 1;25:e46216
pubmed: 37261889
Circulation. 1995 Apr 1;91(7):1918-22
pubmed: 7895347
J Community Hosp Intern Med Perspect. 2014 Nov 25;4(5):24983
pubmed: 25432643
J Clin Sleep Med. 2008 Oct 15;4(5):487-504
pubmed: 18853708
J Clin Sleep Med. 2019 Jul 15;15(7):1051-1061
pubmed: 31383243

Auteurs

Taeyoung Lee (T)

Asleep Co., Ltd., Seoul, Republic of Korea.

Younghoon Cho (Y)

Asleep Co., Ltd., Seoul, Republic of Korea.
Clionic Lifecare Clinic, Seoul, Republic of Korea.

Kwang Su Cha (KS)

Asleep Co., Ltd., Seoul, Republic of Korea.

Jinhwan Jung (J)

Asleep Co., Ltd., Seoul, Republic of Korea.

Jungim Cho (J)

Asleep Co., Ltd., Seoul, Republic of Korea.

Hyunggug Kim (H)

Asleep Co., Ltd., Seoul, Republic of Korea.

Daewoo Kim (D)

Asleep Co., Ltd., Seoul, Republic of Korea.

Joonki Hong (J)

Asleep Co., Ltd., Seoul, Republic of Korea.

Dongheon Lee (D)

Asleep Co., Ltd., Seoul, Republic of Korea.

Moonsik Keum (M)

Clionic Lifecare Clinic, Seoul, Republic of Korea.

Clete A Kushida (CA)

Department of Psychiatry and Behavioral Sciences, Stanford University Medical Center, Redwood City, CA, United States.

In-Young Yoon (IY)

Department of Psychiatry, Seoul National University Bundang Hospital, Seongnam-si, Republic of Korea.

Jeong-Whun Kim (JW)

Department of Otorhinolaryngology-Head and Neck Surgery, Seoul National University Bundang Hospital, Seongnam-si, Republic of Korea.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH