Machine learning enables identification of an alternative yeast galactose utilization pathway.


Journal

Proceedings of the National Academy of Sciences of the United States of America
ISSN: 1091-6490
Titre abrégé: Proc Natl Acad Sci U S A
Pays: United States
ID NLM: 7505876

Informations de publication

Date de publication:
30 Apr 2024
Historique:
medline: 26 4 2024
pubmed: 26 4 2024
entrez: 26 4 2024
Statut: ppublish

Résumé

How genomic differences contribute to phenotypic differences is a major question in biology. The recently characterized genomes, isolation environments, and qualitative patterns of growth on 122 sources and conditions of 1,154 strains from 1,049 fungal species (nearly all known) in the yeast subphylum Saccharomycotina provide a powerful, yet complex, dataset for addressing this question. We used a random forest algorithm trained on these genomic, metabolic, and environmental data to predict growth on several carbon sources with high accuracy. Known structural genes involved in assimilation of these sources and presence/absence patterns of growth in other sources were important features contributing to prediction accuracy. By further examining growth on galactose, we found that it can be predicted with high accuracy from either genomic (92.2%) or growth data (82.6%) but not from isolation environment data (65.6%). Prediction accuracy was even higher (93.3%) when we combined genomic and growth data. After the

Identifiants

pubmed: 38669185
doi: 10.1073/pnas.2315314121
doi:

Substances chimiques

Galactose X2RN3Q8DNE

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

e2315314121

Subventions

Organisme : NIAID NIH HHS
ID : R01 AI153356
Pays : United States

Déclaration de conflit d'intérêts

Competing interests statement:A.R. is a scientific consultant for LifeMine Therapeutics, Inc. The authors declare no other competing interests.

Auteurs

Marie-Claire Harrison (MC)

Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235.

Emily J Ubbelohde (EJ)

Laboratory of Genetics, Department of Energy (DOE) Great Lakes Bioenergy Research Center, Center for Genomic Science Innovation, J. F. Crow Institute for the Study of Evolution, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53726.

Abigail L LaBella (AL)

Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235.
Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28262.

Dana A Opulente (DA)

Laboratory of Genetics, Department of Energy (DOE) Great Lakes Bioenergy Research Center, Center for Genomic Science Innovation, J. F. Crow Institute for the Study of Evolution, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53726.
Department of Biology, Villanova University, Villanova, PA 19085.

John F Wolters (JF)

Laboratory of Genetics, Department of Energy (DOE) Great Lakes Bioenergy Research Center, Center for Genomic Science Innovation, J. F. Crow Institute for the Study of Evolution, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53726.

Xiaofan Zhou (X)

Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Center, South China Agricultural University, Guangzhou 510642, China.

Xing-Xing Shen (XX)

Key Laboratory of Biology of Crop Pathogens and Insects of Zhejiang Province, Institute of Insect Sciences, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China.

Marizeth Groenewald (M)

Westerdijk Fungal Biodiversity Institute, Utrecht 3584, The Netherlands.

Chris Todd Hittinger (CT)

Laboratory of Genetics, Department of Energy (DOE) Great Lakes Bioenergy Research Center, Center for Genomic Science Innovation, J. F. Crow Institute for the Study of Evolution, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53726.

Antonis Rokas (A)

Department of Biological Sciences and Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235.

Articles similaires

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
Humans Artificial Intelligence Neoplasms Prognosis Image Processing, Computer-Assisted

Classifications MeSH