Dynamic Region of Interest Selection in Remote Photoplethysmography: Proof-of-Concept Study.
algorithm
biomedical sensing
contactless vital sign measurement
facial camera PPG
machine learning
mobile app
region of interest (ROI)
remote photoplethysmography
signal processing
skin tone
smart device
vital sign
vital sign measurement
Journal
JMIR formative research
ISSN: 2561-326X
Titre abrégé: JMIR Form Res
Pays: Canada
ID NLM: 101726394
Informations de publication
Date de publication:
30 Mar 2023
30 Mar 2023
Historique:
received:
24
11
2022
accepted:
08
02
2023
revised:
08
02
2023
medline:
31
3
2023
entrez:
30
3
2023
pubmed:
31
3
2023
Statut:
epublish
Résumé
Remote photoplethysmography (rPPG) can record vital signs (VSs) by detecting subtle changes in the light reflected from the skin. Lifelight (Xim Ltd) is a novel software being developed as a medical device for the contactless measurement of VSs using rPPG via integral cameras on smart devices. Research to date has focused on extracting the pulsatile VS from the raw signal, which can be influenced by factors such as ambient light, skin thickness, facial movements, and skin tone. This preliminary proof-of-concept study outlines a dynamic approach to rPPG signal processing wherein green channel signals from the most relevant areas of the face (the midface, comprising the cheeks, nose, and top of the lip) are optimized for each subject using tiling and aggregation (T&A) algorithms. High-resolution 60-second videos were recorded during the VISION-MD study. The midface was divided into 62 tiles of 20×20 pixels, and the signals from multiple tiles were evaluated using bespoke algorithms through weighting according to signal-to-noise ratio in the frequency domain (SNR-F) score or segmentation. Midface signals before and after T&A were categorized by a trained observer blinded to the data processing as 0 (high quality, suitable for algorithm training), 1 (suitable for algorithm testing), or 2 (inadequate quality). On secondary analysis, observer categories were compared for signals predicted to improve categories following T&A based on the SNR-F score. Observer ratings and SNR-F scores were also compared before and after T&A for Fitzpatrick skin tones 5 and 6, wherein rPPG is hampered by light absorption by melanin. The analysis used 4310 videos recorded from 1315 participants. Category 2 and 1 signals had lower mean SNR-F scores than category 0 signals. T&A improved the mean SNR-F score using all algorithms. Depending on the algorithm, 18% (763/4212) to 31% (1306/4212) of signals improved by at least one category, with up to 10% (438/4212) improving into category 0, and 67% (2834/4212) to 79% (3337/4212) remaining in the same category. Importantly, 9% (396/4212) to 21% (875/4212) improved from category 2 (not usable) into category 1. All algorithms showed improvements. No more than 3% (137/4212) of signals were assigned to a lower-quality category following T&A. On secondary analysis, 62% of signals (32/52) were recategorized, as predicted from the SNR-F score. T&A improved SNR-F scores in darker skin tones; 41% of signals (151/369) improved from category 2 to 1 and 12% (44/369) from category 1 to 0. The T&A approach to dynamic region of interest selection improved signal quality, including in dark skin tones. The method was verified by comparison with a trained observer's rating. T&A could overcome factors that compromise whole-face rPPG. This method's performance in estimating VS is currently being assessed. ClinicalTrials.gov NCT04763746; https://clinicaltrials.gov/ct2/show/NCT04763746.
Sections du résumé
BACKGROUND
BACKGROUND
Remote photoplethysmography (rPPG) can record vital signs (VSs) by detecting subtle changes in the light reflected from the skin. Lifelight (Xim Ltd) is a novel software being developed as a medical device for the contactless measurement of VSs using rPPG via integral cameras on smart devices. Research to date has focused on extracting the pulsatile VS from the raw signal, which can be influenced by factors such as ambient light, skin thickness, facial movements, and skin tone.
OBJECTIVE
OBJECTIVE
This preliminary proof-of-concept study outlines a dynamic approach to rPPG signal processing wherein green channel signals from the most relevant areas of the face (the midface, comprising the cheeks, nose, and top of the lip) are optimized for each subject using tiling and aggregation (T&A) algorithms.
METHODS
METHODS
High-resolution 60-second videos were recorded during the VISION-MD study. The midface was divided into 62 tiles of 20×20 pixels, and the signals from multiple tiles were evaluated using bespoke algorithms through weighting according to signal-to-noise ratio in the frequency domain (SNR-F) score or segmentation. Midface signals before and after T&A were categorized by a trained observer blinded to the data processing as 0 (high quality, suitable for algorithm training), 1 (suitable for algorithm testing), or 2 (inadequate quality). On secondary analysis, observer categories were compared for signals predicted to improve categories following T&A based on the SNR-F score. Observer ratings and SNR-F scores were also compared before and after T&A for Fitzpatrick skin tones 5 and 6, wherein rPPG is hampered by light absorption by melanin.
RESULTS
RESULTS
The analysis used 4310 videos recorded from 1315 participants. Category 2 and 1 signals had lower mean SNR-F scores than category 0 signals. T&A improved the mean SNR-F score using all algorithms. Depending on the algorithm, 18% (763/4212) to 31% (1306/4212) of signals improved by at least one category, with up to 10% (438/4212) improving into category 0, and 67% (2834/4212) to 79% (3337/4212) remaining in the same category. Importantly, 9% (396/4212) to 21% (875/4212) improved from category 2 (not usable) into category 1. All algorithms showed improvements. No more than 3% (137/4212) of signals were assigned to a lower-quality category following T&A. On secondary analysis, 62% of signals (32/52) were recategorized, as predicted from the SNR-F score. T&A improved SNR-F scores in darker skin tones; 41% of signals (151/369) improved from category 2 to 1 and 12% (44/369) from category 1 to 0.
CONCLUSIONS
CONCLUSIONS
The T&A approach to dynamic region of interest selection improved signal quality, including in dark skin tones. The method was verified by comparison with a trained observer's rating. T&A could overcome factors that compromise whole-face rPPG. This method's performance in estimating VS is currently being assessed.
TRIAL REGISTRATION
BACKGROUND
ClinicalTrials.gov NCT04763746; https://clinicaltrials.gov/ct2/show/NCT04763746.
Identifiants
pubmed: 36995742
pii: v7i1e44575
doi: 10.2196/44575
pmc: PMC10131655
doi:
Banques de données
ClinicalTrials.gov
['NCT04763746']
Types de publication
Journal Article
Langues
eng
Pagination
e44575Informations de copyright
©Adam Kiddle, Helen Barham, Simon Wegerif, Connie Petronzio. Originally published in JMIR Formative Research (https://formative.jmir.org), 30.03.2023.
Références
Med J Aust. 1999 Jul 5;171(1):22-5
pubmed: 10451667
Breast. 2020 Feb;49:267-273
pubmed: 31935669
Annu Int Conf IEEE Eng Med Biol Soc. 2015 Aug;2015:2758-61
pubmed: 26736863
J Ambient Intell Humaniz Comput. 2023;14(7):8871-8880
pubmed: 35043065
J Med Imaging (Bellingham). 2020 Sep;7(5):057501
pubmed: 33062803
NPJ Digit Med. 2019 Jun 26;2:60
pubmed: 31388564
Telemed J E Health. 2017 Aug;23(8):678-683
pubmed: 28140834
Bioengineering (Basel). 2016 Sep 22;3(4):
pubmed: 28952584
Physiol Meas. 2019 Feb 26;40(2):025006
pubmed: 30699397
Eur J Appl Physiol. 2013 Apr;113(4):1035-41
pubmed: 23064980
NPJ Digit Med. 2021 Feb 26;4(1):38
pubmed: 33637822
JMIR Res Protoc. 2021 Jan 28;10(1):e14326
pubmed: 33507157
Comput Biol Med. 2022 Feb;141:105146
pubmed: 34942393
JMIR Res Protoc. 2023 Jan 11;12:e41533
pubmed: 36630158
Sensors (Basel). 2021 Nov 27;21(23):
pubmed: 34883926
J Clin Monit Comput. 2000;16(4):309-15
pubmed: 12578078
Med Oral Patol Oral Cir Bucal. 2016 Mar 01;21(2):e135-41
pubmed: 26827055
Annu Int Conf IEEE Eng Med Biol Soc. 2015 Aug;2015:4938-41
pubmed: 26737399
Sci Rep. 2022 Jan 7;12(1):281
pubmed: 34996908
JMIR Form Res. 2022 Nov 14;6(11):e36340
pubmed: 36374541
IEEE Trans Nucl Sci. 2013 Jun;60(3):1609-1618
pubmed: 25346545
Aesthet Surg J. 2015 Nov;35(8):1007-13
pubmed: 26508650
J Clin Monit Comput. 1999 Dec;15(7-8):461-7
pubmed: 12578044
IEEE Trans Biomed Eng. 2013 Oct;60(10):2878-86
pubmed: 23744659
Anesth Analg. 2002 Jan;94(1 Suppl):S1-3
pubmed: 11900029
Head Neck. 2019 Jul;41(7):2065-2073
pubmed: 30684276