Dynamic Region of Interest Selection in Remote Photoplethysmography: Proof-of-Concept Study.

algorithm biomedical sensing contactless vital sign measurement facial camera PPG machine learning mobile app region of interest (ROI) remote photoplethysmography signal processing skin tone smart device vital sign vital sign measurement

Journal

JMIR formative research
ISSN: 2561-326X
Titre abrégé: JMIR Form Res
Pays: Canada
ID NLM: 101726394

Informations de publication

Date de publication:
30 Mar 2023
Historique:
received: 24 11 2022
accepted: 08 02 2023
revised: 08 02 2023
medline: 31 3 2023
entrez: 30 3 2023
pubmed: 31 3 2023
Statut: epublish

Résumé

Remote photoplethysmography (rPPG) can record vital signs (VSs) by detecting subtle changes in the light reflected from the skin. Lifelight (Xim Ltd) is a novel software being developed as a medical device for the contactless measurement of VSs using rPPG via integral cameras on smart devices. Research to date has focused on extracting the pulsatile VS from the raw signal, which can be influenced by factors such as ambient light, skin thickness, facial movements, and skin tone. This preliminary proof-of-concept study outlines a dynamic approach to rPPG signal processing wherein green channel signals from the most relevant areas of the face (the midface, comprising the cheeks, nose, and top of the lip) are optimized for each subject using tiling and aggregation (T&A) algorithms. High-resolution 60-second videos were recorded during the VISION-MD study. The midface was divided into 62 tiles of 20×20 pixels, and the signals from multiple tiles were evaluated using bespoke algorithms through weighting according to signal-to-noise ratio in the frequency domain (SNR-F) score or segmentation. Midface signals before and after T&A were categorized by a trained observer blinded to the data processing as 0 (high quality, suitable for algorithm training), 1 (suitable for algorithm testing), or 2 (inadequate quality). On secondary analysis, observer categories were compared for signals predicted to improve categories following T&A based on the SNR-F score. Observer ratings and SNR-F scores were also compared before and after T&A for Fitzpatrick skin tones 5 and 6, wherein rPPG is hampered by light absorption by melanin. The analysis used 4310 videos recorded from 1315 participants. Category 2 and 1 signals had lower mean SNR-F scores than category 0 signals. T&A improved the mean SNR-F score using all algorithms. Depending on the algorithm, 18% (763/4212) to 31% (1306/4212) of signals improved by at least one category, with up to 10% (438/4212) improving into category 0, and 67% (2834/4212) to 79% (3337/4212) remaining in the same category. Importantly, 9% (396/4212) to 21% (875/4212) improved from category 2 (not usable) into category 1. All algorithms showed improvements. No more than 3% (137/4212) of signals were assigned to a lower-quality category following T&A. On secondary analysis, 62% of signals (32/52) were recategorized, as predicted from the SNR-F score. T&A improved SNR-F scores in darker skin tones; 41% of signals (151/369) improved from category 2 to 1 and 12% (44/369) from category 1 to 0. The T&A approach to dynamic region of interest selection improved signal quality, including in dark skin tones. The method was verified by comparison with a trained observer's rating. T&A could overcome factors that compromise whole-face rPPG. This method's performance in estimating VS is currently being assessed. ClinicalTrials.gov NCT04763746; https://clinicaltrials.gov/ct2/show/NCT04763746.

Sections du résumé

BACKGROUND BACKGROUND
Remote photoplethysmography (rPPG) can record vital signs (VSs) by detecting subtle changes in the light reflected from the skin. Lifelight (Xim Ltd) is a novel software being developed as a medical device for the contactless measurement of VSs using rPPG via integral cameras on smart devices. Research to date has focused on extracting the pulsatile VS from the raw signal, which can be influenced by factors such as ambient light, skin thickness, facial movements, and skin tone.
OBJECTIVE OBJECTIVE
This preliminary proof-of-concept study outlines a dynamic approach to rPPG signal processing wherein green channel signals from the most relevant areas of the face (the midface, comprising the cheeks, nose, and top of the lip) are optimized for each subject using tiling and aggregation (T&A) algorithms.
METHODS METHODS
High-resolution 60-second videos were recorded during the VISION-MD study. The midface was divided into 62 tiles of 20×20 pixels, and the signals from multiple tiles were evaluated using bespoke algorithms through weighting according to signal-to-noise ratio in the frequency domain (SNR-F) score or segmentation. Midface signals before and after T&A were categorized by a trained observer blinded to the data processing as 0 (high quality, suitable for algorithm training), 1 (suitable for algorithm testing), or 2 (inadequate quality). On secondary analysis, observer categories were compared for signals predicted to improve categories following T&A based on the SNR-F score. Observer ratings and SNR-F scores were also compared before and after T&A for Fitzpatrick skin tones 5 and 6, wherein rPPG is hampered by light absorption by melanin.
RESULTS RESULTS
The analysis used 4310 videos recorded from 1315 participants. Category 2 and 1 signals had lower mean SNR-F scores than category 0 signals. T&A improved the mean SNR-F score using all algorithms. Depending on the algorithm, 18% (763/4212) to 31% (1306/4212) of signals improved by at least one category, with up to 10% (438/4212) improving into category 0, and 67% (2834/4212) to 79% (3337/4212) remaining in the same category. Importantly, 9% (396/4212) to 21% (875/4212) improved from category 2 (not usable) into category 1. All algorithms showed improvements. No more than 3% (137/4212) of signals were assigned to a lower-quality category following T&A. On secondary analysis, 62% of signals (32/52) were recategorized, as predicted from the SNR-F score. T&A improved SNR-F scores in darker skin tones; 41% of signals (151/369) improved from category 2 to 1 and 12% (44/369) from category 1 to 0.
CONCLUSIONS CONCLUSIONS
The T&A approach to dynamic region of interest selection improved signal quality, including in dark skin tones. The method was verified by comparison with a trained observer's rating. T&A could overcome factors that compromise whole-face rPPG. This method's performance in estimating VS is currently being assessed.
TRIAL REGISTRATION BACKGROUND
ClinicalTrials.gov NCT04763746; https://clinicaltrials.gov/ct2/show/NCT04763746.

Identifiants

pubmed: 36995742
pii: v7i1e44575
doi: 10.2196/44575
pmc: PMC10131655
doi:

Banques de données

ClinicalTrials.gov
['NCT04763746']

Types de publication

Journal Article

Langues

eng

Pagination

e44575

Informations de copyright

©Adam Kiddle, Helen Barham, Simon Wegerif, Connie Petronzio. Originally published in JMIR Formative Research (https://formative.jmir.org), 30.03.2023.

Références

Med J Aust. 1999 Jul 5;171(1):22-5
pubmed: 10451667
Breast. 2020 Feb;49:267-273
pubmed: 31935669
Annu Int Conf IEEE Eng Med Biol Soc. 2015 Aug;2015:2758-61
pubmed: 26736863
J Ambient Intell Humaniz Comput. 2023;14(7):8871-8880
pubmed: 35043065
J Med Imaging (Bellingham). 2020 Sep;7(5):057501
pubmed: 33062803
NPJ Digit Med. 2019 Jun 26;2:60
pubmed: 31388564
Telemed J E Health. 2017 Aug;23(8):678-683
pubmed: 28140834
Bioengineering (Basel). 2016 Sep 22;3(4):
pubmed: 28952584
Physiol Meas. 2019 Feb 26;40(2):025006
pubmed: 30699397
Eur J Appl Physiol. 2013 Apr;113(4):1035-41
pubmed: 23064980
NPJ Digit Med. 2021 Feb 26;4(1):38
pubmed: 33637822
JMIR Res Protoc. 2021 Jan 28;10(1):e14326
pubmed: 33507157
Comput Biol Med. 2022 Feb;141:105146
pubmed: 34942393
JMIR Res Protoc. 2023 Jan 11;12:e41533
pubmed: 36630158
Sensors (Basel). 2021 Nov 27;21(23):
pubmed: 34883926
J Clin Monit Comput. 2000;16(4):309-15
pubmed: 12578078
Med Oral Patol Oral Cir Bucal. 2016 Mar 01;21(2):e135-41
pubmed: 26827055
Annu Int Conf IEEE Eng Med Biol Soc. 2015 Aug;2015:4938-41
pubmed: 26737399
Sci Rep. 2022 Jan 7;12(1):281
pubmed: 34996908
JMIR Form Res. 2022 Nov 14;6(11):e36340
pubmed: 36374541
IEEE Trans Nucl Sci. 2013 Jun;60(3):1609-1618
pubmed: 25346545
Aesthet Surg J. 2015 Nov;35(8):1007-13
pubmed: 26508650
J Clin Monit Comput. 1999 Dec;15(7-8):461-7
pubmed: 12578044
IEEE Trans Biomed Eng. 2013 Oct;60(10):2878-86
pubmed: 23744659
Anesth Analg. 2002 Jan;94(1 Suppl):S1-3
pubmed: 11900029
Head Neck. 2019 Jul;41(7):2065-2073
pubmed: 30684276

Auteurs

Adam Kiddle (A)

Xim Ltd, Southampton, United Kingdom.

Helen Barham (H)

The Text Doctor, Wantage, United Kingdom.

Simon Wegerif (S)

Xim Ltd, Southampton, United Kingdom.

Connie Petronzio (C)

Xim Ltd, Southampton, United Kingdom.

Classifications MeSH