Application of Aligned-UMAP to longitudinal biomedical studies.

Alzheimer's disease Parkinson's disease clinical data genomics iPSC longitudinal data machine learning proteomics time-series unsupervised learning

Journal

Patterns (New York, N.Y.)
ISSN: 2666-3899
Titre abrégé: Patterns (N Y)
Pays: United States
ID NLM: 101767765

Informations de publication

Date de publication:
09 Jun 2023
Historique:
received: 23 11 2022
revised: 02 02 2023
accepted: 07 04 2023
medline: 6 7 2023
pubmed: 6 7 2023
entrez: 6 7 2023
Statut: epublish

Résumé

High-dimensional data analysis starts with projecting the data to low dimensions to visualize and understand the underlying data structure. Several methods have been developed for dimensionality reduction, but they are limited to cross-sectional datasets. The recently proposed Aligned-UMAP, an extension of the uniform manifold approximation and projection (UMAP) algorithm, can visualize high-dimensional longitudinal datasets. We demonstrated its utility for researchers to identify exciting patterns and trajectories within enormous datasets in biological sciences. We found that the algorithm parameters also play a crucial role and must be tuned carefully to utilize the algorithm's potential fully. We also discussed key points to remember and directions for future extensions of Aligned-UMAP. Further, we made our code open source to enhance the reproducibility and applicability of our work. We believe our benchmarking study becomes more important as more and more high-dimensional longitudinal data in biomedical research become available.

Identifiants

pubmed: 37409055
doi: 10.1016/j.patter.2023.100741
pii: S2666-3899(23)00081-8
pmc: PMC10318357
doi:

Types de publication

Journal Article

Langues

eng

Pagination

100741

Déclaration de conflit d'intérêts

A.D., H.I., M.A.N., and F.F. declare the following competing financial interests, as their participation in this project was part of a competitive contract awarded to Data Tecnica International, LLC, by the NIH to support open science research. M.A.N. also currently serves on the scientific advisory board for Character Bio and is an advisor to Neuron23, Inc. The study’s funders had no role in the study design, data collection, data analysis, data interpretation, or writing of the report. All authors and the public can access all data and statistical programming code used in this project for the analyses and results generation. F.F. takes final responsibility for the decision to submit the paper for publication.

Références

Mov Disord. 2008 Nov 15;23(15):2129-70
pubmed: 19025984
Brain. 2023 May 16;:
pubmed: 37192343
PLoS One. 2019 Jul 8;14(7):e0218942
pubmed: 31283759
Sci Data. 2016 May 24;3:160035
pubmed: 27219127
Sci Rep. 2021 Apr 27;11(1):9068
pubmed: 33907199
Lancet Digit Health. 2022 May;4(5):e359-e369
pubmed: 35341712
Nat Commun. 2020 Jul 16;11(1):3559
pubmed: 32678092
Cell Rep Med. 2021 May 18;2(5):100287
pubmed: 33969320
NPJ Parkinsons Dis. 2022 Apr 1;8(1):35
pubmed: 35365675
Nat Commun. 2018 Nov 22;9(1):4931
pubmed: 30467425
Nat Biotechnol. 2018 Dec 03;:
pubmed: 30531897
Philos Trans A Math Phys Eng Sci. 2016 Apr 13;374(2065):20150202
pubmed: 26953178
J Hum Genet. 2021 Jan;66(1):85-91
pubmed: 33057159
NPJ Parkinsons Dis. 2022 Dec 16;8(1):172
pubmed: 36526647
J Am Geriatr Soc. 2005 Apr;53(4):695-9
pubmed: 15817019
Nat Biotechnol. 2017 Jun;35(6):551-560
pubmed: 28459448
J Am Med Inform Assoc. 2017 Nov 01;24(6):1142-1148
pubmed: 29016973

Auteurs

Anant Dadu (A)

Department of Computer Science, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA.
Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Data Tecnica International, Washington, DC 20037, USA.

Vipul K Satone (VK)

Department of Industrial and Enterprise Systems Engineering, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA.

Rachneet Kaur (R)

Department of Industrial and Enterprise Systems Engineering, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA.

Mathew J Koretsky (MJ)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.

Hirotaka Iwaki (H)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Data Tecnica International, Washington, DC 20037, USA.
Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.

Yue A Qi (YA)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.

Daniel M Ramos (DM)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.

Brian Avants (B)

Invicro, Image Analysis, Needham, MA, USA.

Jacob Hesterman (J)

Invicro, Image Analysis, Needham, MA, USA.

Roger Gunn (R)

Invicro, Image Analysis, Needham, MA, USA.

Mark R Cookson (MR)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.

Michael E Ward (ME)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, USA.

Andrew B Singleton (AB)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.

Roy H Campbell (RH)

Department of Computer Science, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA.

Mike A Nalls (MA)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Data Tecnica International, Washington, DC 20037, USA.
Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.

Faraz Faghri (F)

Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Data Tecnica International, Washington, DC 20037, USA.
Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.

Classifications MeSH