Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers.


Journal

BioMed research international
ISSN: 2314-6141
Titre abrégé: Biomed Res Int
Pays: United States
ID NLM: 101600173

Informations de publication

Date de publication:
2020
Historique:
received: 04 12 2019
revised: 30 03 2020
accepted: 11 05 2020
entrez: 25 8 2020
pubmed: 25 8 2020
medline: 24 4 2021
Statut: epublish

Résumé

Precision medicine for breast cancer relies on biomarkers to select therapies. However, the reliability of biomarkers drawn from gene expression arrays has been questioned and calls for reassessment, in particular for large datasets. We revisit widely used data-normalization procedures and evaluate differences in outcome in order to pinpoint the most reliable reprocessing methods biomarkers can be based upon. We generated a database of 3753 breast cancer patients out of 38 studies by downloading and curating patient samples from NCBI-GEO. As gene-expression biomarkers, we select the assessment of receptor status and breast cancer subtype classification. Each normalization procedure is applied separately, and biomarkers are then evaluated for each patient. Differences between normalization pipelines are quantified as percentages of patients having outcomes different for each pipeline. Some normalization procedures lead to quite consistent biomarkers, differing only in 1-2% of patients. Other normalization procedures-some of them have been used in many clinical studies-end up with distrusting discrepancies (10% and more). A good deal of doubt regarding the reliability of microarrays may root in the haphazard application of inadequate preprocessing pipelines. Several modes of batch corrections are evaluated regarding a possible improvement of receptor prediction from gene expression versus the golden standard of immunohistochemistry. Finally, we nominate those normalization methods yielding consistent and trustable results. Adequate bioinformatics data preprocessing is key and crucial for any subsequent statistics to arrive at trustable results. We conclude with a suggestion for future bioinformatics development to further increase the reliability of cancer biomarkers.

Identifiants

pubmed: 32832541
doi: 10.1155/2020/1363827
pmc: PMC7428878
doi:

Substances chimiques

Biomarkers, Tumor 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1363827

Informations de copyright

Copyright © 2020 Michael Kenn et al.

Déclaration de conflit d'intérêts

The authors declare that there are no conflicts of interests regarding the publication of this paper.

Références

Bioinformatics. 2012 Mar 15;28(6):882-3
pubmed: 22257669
BMC Genomics. 2020 Jan 2;21(1):6
pubmed: 31898477
Bioinformatics. 2016 Apr 1;32(7):1097-9
pubmed: 26607490
Hum Genomics Proteomics. 2009 Jan 12;2009:
pubmed: 20948564
Nat Biotechnol. 2006 Sep;24(9):1115-22
pubmed: 16964225
J Clin Oncol. 2008 Mar 1;26(7):1186-7; author reply 1187-8
pubmed: 18309960
Breast Cancer Res Treat. 2018 Nov;172(2):313-326
pubmed: 30117066
Nat Biotechnol. 2004 Jun;22(6):656-8; author reply 658
pubmed: 15175677
Biostatistics. 2010 Apr;11(2):242-53
pubmed: 20097884
Nat Rev Genet. 2013 Feb;14(2):89-99
pubmed: 23269463
J Natl Cancer Inst. 2012 Feb 22;104(4):311-25
pubmed: 22262870
PLoS One. 2016 Jun 07;11(6):e0156594
pubmed: 27272489
Bioinformatics. 2006 Apr 1;22(7):789-94
pubmed: 16410320
Biostatistics. 2007 Jan;8(1):118-27
pubmed: 16632515
Bioinformatics. 2007 Jul 1;23(13):i282-8
pubmed: 17646307
Nucleic Acids Res. 2002 Jan 1;30(1):207-10
pubmed: 11752295
Nat Biotechnol. 2003 Jul;21(7):818-21
pubmed: 12794640
Biostatistics. 2003 Apr;4(2):249-64
pubmed: 12925520
PLoS One. 2014 Jun 06;9(6):e98879
pubmed: 24905456
Bioinformatics. 2004 Feb 12;20(3):307-15
pubmed: 14960456
Oncotarget. 2017 Aug 24;8(44):77341-77359
pubmed: 29100391
Nat Biotechnol. 2010 Aug;28(8):761
pubmed: 20697381
Nucleic Acids Res. 2003 Feb 15;31(4):e15
pubmed: 12582260
Stat Methods Med Res. 1999 Jun;8(2):135-60
pubmed: 10501650
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Apr;65(4 Pt 1):040902
pubmed: 12005798
PeerJ. 2014 Sep 23;2:e561
pubmed: 25332844
Microarrays (Basel). 2015 Aug 21;4(3):389-406
pubmed: 27600230
Front Genet. 2018 Mar 16;9:83
pubmed: 29616078
Biostatistics. 2016 Jan;17(1):29-39
pubmed: 26272994
Lancet. 2005 Feb 19-25;365(9460):671-9
pubmed: 15721472
PLoS One. 2014 Jun 18;9(6):e99380
pubmed: 24941114
BMC Med Genomics. 2008 Oct 21;1:52
pubmed: 18940004
Nucleic Acids Res. 2014 Jan;42(Database issue):D938-43
pubmed: 24271388
Int J Technol Assess Health Care. 2017 Jan;33(1):32-45
pubmed: 28486999
Trends Genet. 2006 Feb;22(2):101-9
pubmed: 16380191
Nat Biotechnol. 2006 Sep;24(9):1123-31
pubmed: 16964226
Sci Rep. 2018 Apr 20;8(1):6351
pubmed: 29679021
PLoS Genet. 2007 Sep;3(9):1724-35
pubmed: 17907809
N Engl J Med. 2002 Dec 19;347(25):1999-2009
pubmed: 12490681
J Comput Biol. 2005 Jul-Aug;12(6):882-93
pubmed: 16108723
Bioinformatics. 2003 Jan 22;19(2):185-93
pubmed: 12538238
Nat Genet. 2002 Dec;32 Suppl:496-501
pubmed: 12454644
Pharmacogenomics J. 2010 Aug;10(4):278-91
pubmed: 20676067
BMC Bioinformatics. 2013 May 07;14:153
pubmed: 23647742
Nat Rev Genet. 2010 Oct;11(10):733-9
pubmed: 20838408

Auteurs

Michael Kenn (M)

Section of Biosimulation and Bioinformatics, Center for Medical Statistics, Informatics and Intelligent Systems (CeMSIIS), Medical University of Vienna, Spitalgasse 23, 1090 Vienna, Austria.

Dan Cacsire Castillo-Tong (D)

Translational Gynecology Group, Department of Obstetrics and Gynecology, Comprehensive Cancer Center, Medical University of Vienna, Waehringer Guertel 18-20, 1090 Vienna, Austria.

Christian F Singer (CF)

Department of General Gynecology and Gynecologic Oncology and Comprehensive Cancer Center, Medical University of Vienna, Waehringer Guertel 18-20, 1090 Vienna, Austria.

Michael Cibena (M)

Section of Biosimulation and Bioinformatics, Center for Medical Statistics, Informatics and Intelligent Systems (CeMSIIS), Medical University of Vienna, Spitalgasse 23, 1090 Vienna, Austria.

Heinz Kölbl (H)

Department of General Gynecology and Gynecologic Oncology and Comprehensive Cancer Center, Medical University of Vienna, Waehringer Guertel 18-20, 1090 Vienna, Austria.

Wolfgang Schreiner (W)

Section of Biosimulation and Bioinformatics, Center for Medical Statistics, Informatics and Intelligent Systems (CeMSIIS), Medical University of Vienna, Spitalgasse 23, 1090 Vienna, Austria.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH