CHESS 3: an improved, comprehensive catalog of human genes and transcripts based on large-scale expression data, phylogenetic analysis, and protein structure.


Journal

Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660

Informations de publication

Date de publication:
30 10 2023
Historique:
received: 21 12 2022
accepted: 16 10 2023
medline: 1 11 2023
pubmed: 31 10 2023
entrez: 31 10 2023
Statut: epublish

Résumé

CHESS 3 represents an improved human gene catalog based on nearly 10,000 RNA-seq experiments across 54 body sites. It significantly improves current genome annotation by integrating the latest reference data and algorithms, machine learning techniques for noise filtering, and new protein structure prediction methods. CHESS 3 contains 41,356 genes, including 19,839 protein-coding genes and 158,377 transcripts, with 14,863 protein-coding transcripts not in other catalogs. It includes all MANE transcripts and at least one transcript for most RefSeq and GENCODE genes. On the CHM13 human genome, the CHESS 3 catalog contains an additional 129 protein-coding genes. CHESS 3 is available at http://ccb.jhu.edu/chess .

Identifiants

pubmed: 37904256
doi: 10.1186/s13059-023-03088-4
pii: 10.1186/s13059-023-03088-4
pmc: PMC10614308
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, U.S. Gov't, Non-P.H.S. Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

249

Subventions

Organisme : NHGRI NIH HHS
ID : R01 HG006677
Pays : United States
Organisme : NIMH NIH HHS
ID : R01 MH123567
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM130151
Pays : United States

Informations de copyright

© 2023. The Author(s).

Références

Nucleic Acids Res. 2021 Jan 8;49(D1):D165-D171
pubmed: 33196801
Nature. 2021 Aug;596(7873):590-596
pubmed: 34293799
J Mol Diagn. 2022 Mar;24(3):219-223
pubmed: 35041928
Nat Methods. 2022 Jun;19(6):679-682
pubmed: 35637307
Biophys Rev. 2017 Jun;9(3):201-206
pubmed: 28510117
Nature. 2021 Aug;596(7873):583-589
pubmed: 34265844
Nature. 2022 Apr;604(7905):310-315
pubmed: 35388217
Bioinform Adv. 2022 Jan 09;2(1):vbab043
pubmed: 36699409
FASEB J. 2012 Jan;26(1):93-103
pubmed: 21940993
Bioinformatics. 2021 Jul 19;37(12):1639-1643
pubmed: 33320174
Trends Biochem Sci. 2017 Feb;42(2):98-110
pubmed: 27712956
Science. 2022 Apr;376(6588):eabl3533
pubmed: 35357935
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Biol Sex Differ. 2020 Jul 21;11(1):42
pubmed: 32693839
Science. 2022 Apr;376(6588):44-53
pubmed: 35357919
Genome Res. 2020 Dec 23;:
pubmed: 33361112
Bioinformatics. 2021 Oct 25;37(20):3650-3651
pubmed: 33964128
Sci Data. 2020 Oct 5;7(1):326
pubmed: 33020484
Trends Biochem Sci. 2017 Jun;42(6):407-408
pubmed: 28483376
Nat Biotechnol. 2019 Aug;37(8):907-915
pubmed: 31375807
Nature. 2023 Oct;622(7981):41-47
pubmed: 37794265
Nucleic Acids Res. 2022 Jan 7;50(D1):D54-D59
pubmed: 34755885
Nucleic Acids Res. 2019 Jan 8;47(D1):D135-D139
pubmed: 30371849
Elife. 2022 Dec 15;11:
pubmed: 36519529
Nucleic Acids Res. 2023 Jan 6;51(D1):D942-D949
pubmed: 36420896
Annu Rev Genomics Hum Genet. 2022 Aug 31;23:153-172
pubmed: 35395170
Nucleic Acids Res. 2021 Jan 8;49(D1):D212-D220
pubmed: 33106848
Genome Res. 2002 Jun;12(6):996-1006
pubmed: 12045153
Genome Biol. 2018 Nov 28;19(1):208
pubmed: 30486838
Genome Biol. 2019 Dec 16;20(1):278
pubmed: 31842956
F1000Res. 2020 Apr 28;9:304
pubmed: 32489650
Nature. 2017 Mar 9;543(7644):199-204
pubmed: 28241135
Nat Genet. 2009 Oct;41(10):1061-7
pubmed: 19718026
Science. 2015 May 8;348(6235):648-60
pubmed: 25954001
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45
pubmed: 26553804
PLoS One. 2018 Dec 5;13(12):e0207531
pubmed: 30517151

Auteurs

Ales Varabyou (A)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA. ales.varabyou@jhu.edu.
Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA. ales.varabyou@jhu.edu.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA. ales.varabyou@jhu.edu.

Markus J Sommer (MJ)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.

Beril Erdogdu (B)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.

Ida Shinder (I)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Cross Disciplinary Graduate Program in Biomedical Sciences, Johns Hopkins School of Medicine, Baltimore, MD, USA.

Ilia Minkin (I)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.

Kuan-Hao Chao (KH)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.

Sukhwan Park (S)

School of Biological Sciences, Seoul National University, Seoul, South Korea.
Artificial Intelligence Institute, Seoul National University, Seoul, South Korea.

Jakob Heinz (J)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.

Christopher Pockrandt (C)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.

Alaina Shumate (A)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.

Natalia Rincon (N)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.

Daniela Puiu (D)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.

Martin Steinegger (M)

School of Biological Sciences, Seoul National University, Seoul, South Korea.
Artificial Intelligence Institute, Seoul National University, Seoul, South Korea.
Institute of Molecular Biology and Genetics, Seoul National University, Seoul, South Korea.

Steven L Salzberg (SL)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA. salzberg@jhu.edu.
Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA. salzberg@jhu.edu.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA. salzberg@jhu.edu.
Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA. salzberg@jhu.edu.
Department of Biostatistics, Johns Hopkins University, Baltimore, MD, USA. salzberg@jhu.edu.

Mihaela Pertea (M)

Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA. mpertea@jhu.edu.
Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA. mpertea@jhu.edu.
Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA. mpertea@jhu.edu.
Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA. mpertea@jhu.edu.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH