Moving translational mass spectrometry imaging towards transparent and reproducible data analyses: a case study of an urothelial cancer cohort analyzed in the Galaxy framework.
Bladder
Formalin-fixed paraffin-embedded tissues
MALDI imaging
Mass spectrometry imaging
Reproducibility
Spatial proteomics
Urothelial cancer
Urothelial tissue
Journal
Clinical proteomics
ISSN: 1542-6416
Titre abrégé: Clin Proteomics
Pays: England
ID NLM: 101184586
Informations de publication
Date de publication:
19 Apr 2022
19 Apr 2022
Historique:
received:
12
08
2021
accepted:
04
04
2022
entrez:
20
4
2022
pubmed:
21
4
2022
medline:
21
4
2022
Statut:
epublish
Résumé
Mass spectrometry imaging (MSI) derives spatial molecular distribution maps directly from clinical tissue specimens and thus bears great potential for assisting pathologists with diagnostic decisions or personalized treatments. Unfortunately, progress in translational MSI is often hindered by insufficient quality control and lack of reproducible data analysis. Raw data and analysis scripts are rarely publicly shared. Here, we demonstrate the application of the Galaxy MSI tool set for the reproducible analysis of a urothelial carcinoma dataset. Tryptic peptides were imaged in a cohort of 39 formalin-fixed, paraffin-embedded human urothelial cancer tissue cores with a MALDI-TOF/TOF device. The complete data analysis was performed in a fully transparent and reproducible manner on the European Galaxy Server. Annotations of tumor and stroma were performed by a pathologist and transferred to the MSI data to allow for supervised classifications of tumor vs. stroma tissue areas as well as for muscle-infiltrating and non-muscle infiltrating urothelial carcinomas. For putative peptide identifications, m/z features were matched to the MSiMass list. Rigorous quality control in combination with careful pre-processing enabled reduction of m/z shifts and intensity batch effects. High classification accuracy was found for both, tumor vs. stroma and muscle-infiltrating vs. non-muscle infiltrating urothelial tumors. Some of the most discriminative m/z features for each condition could be assigned a putative identity: stromal tissue was characterized by collagen peptides and tumor tissue by histone peptides. Immunohistochemistry confirmed an increased histone H2A abundance in the tumor compared to the stroma tissues. The muscle-infiltration status was distinguished via MSI by peptides from intermediate filaments such as cytokeratin 7 in non-muscle infiltrating carcinomas and vimentin in muscle-infiltrating urothelial carcinomas, which was confirmed by immunohistochemistry. To make the study fully reproducible and to advocate the criteria of FAIR (findability, accessibility, interoperability, and reusability) research data, we share the raw data, spectra annotations as well as all Galaxy histories and workflows. Data are available via ProteomeXchange with identifier PXD026459 and Galaxy results via https://github.com/foellmelanie/Bladder_MSI_Manuscript_Galaxy_links . Here, we show that translational MSI data analysis in a fully transparent and reproducible manner is possible and we would like to encourage the community to join our efforts.
Sections du résumé
BACKGROUND
BACKGROUND
Mass spectrometry imaging (MSI) derives spatial molecular distribution maps directly from clinical tissue specimens and thus bears great potential for assisting pathologists with diagnostic decisions or personalized treatments. Unfortunately, progress in translational MSI is often hindered by insufficient quality control and lack of reproducible data analysis. Raw data and analysis scripts are rarely publicly shared. Here, we demonstrate the application of the Galaxy MSI tool set for the reproducible analysis of a urothelial carcinoma dataset.
METHODS
METHODS
Tryptic peptides were imaged in a cohort of 39 formalin-fixed, paraffin-embedded human urothelial cancer tissue cores with a MALDI-TOF/TOF device. The complete data analysis was performed in a fully transparent and reproducible manner on the European Galaxy Server. Annotations of tumor and stroma were performed by a pathologist and transferred to the MSI data to allow for supervised classifications of tumor vs. stroma tissue areas as well as for muscle-infiltrating and non-muscle infiltrating urothelial carcinomas. For putative peptide identifications, m/z features were matched to the MSiMass list.
RESULTS
RESULTS
Rigorous quality control in combination with careful pre-processing enabled reduction of m/z shifts and intensity batch effects. High classification accuracy was found for both, tumor vs. stroma and muscle-infiltrating vs. non-muscle infiltrating urothelial tumors. Some of the most discriminative m/z features for each condition could be assigned a putative identity: stromal tissue was characterized by collagen peptides and tumor tissue by histone peptides. Immunohistochemistry confirmed an increased histone H2A abundance in the tumor compared to the stroma tissues. The muscle-infiltration status was distinguished via MSI by peptides from intermediate filaments such as cytokeratin 7 in non-muscle infiltrating carcinomas and vimentin in muscle-infiltrating urothelial carcinomas, which was confirmed by immunohistochemistry. To make the study fully reproducible and to advocate the criteria of FAIR (findability, accessibility, interoperability, and reusability) research data, we share the raw data, spectra annotations as well as all Galaxy histories and workflows. Data are available via ProteomeXchange with identifier PXD026459 and Galaxy results via https://github.com/foellmelanie/Bladder_MSI_Manuscript_Galaxy_links .
CONCLUSION
CONCLUSIONS
Here, we show that translational MSI data analysis in a fully transparent and reproducible manner is possible and we would like to encourage the community to join our efforts.
Identifiants
pubmed: 35439943
doi: 10.1186/s12014-022-09347-z
pii: 10.1186/s12014-022-09347-z
pmc: PMC9016955
doi:
Types de publication
Journal Article
Langues
eng
Pagination
8Subventions
Organisme : German-Israeli Foundation for Scientific Research and Development
ID : 1444
Organisme : Bundesministerium für Bildung und Forschung
ID : 01KU1916
Organisme : Bundesministerium für Bildung und Forschung
ID : 01KU1915A
Organisme : Deutsche Forschungsgemeinschaft
ID : PA 2807/3-1
Organisme : Deutschen Konsortium für Translationale Krebsforschung
ID : Im- pro-Rec
Organisme : Deutsche Forschungsgemeinschaft
ID : INST 39/766-3 (Z1)
Organisme : Deutsche Forschungsgemeinschaft
ID : SCHI 871/15-1
Organisme : Deutsche Forschungsgemeinschaft
ID : GR 4553/5-1
Organisme : Foundation for the National Institutes of Health
ID : 1R01LM013115
Organisme : NLM NIH HHS
ID : R01 LM013115
Pays : United States
Organisme : Deutsche Forschungsgemeinschaft
ID : INST 39/1244-1 (P12)
Organisme : NSF-BIO/DBI
ID : 1950412
Organisme : Deutsche Forschungsgemeinschaft
ID : GRK 2606 "ProtPath"
Organisme : Deutsche Forschungsgemeinschaft
ID : SCHI 871/11- 1
Informations de copyright
© 2022. The Author(s).
Références
J Proteome Res. 2012 Mar 2;11(3):1996-2003
pubmed: 22224404
Nat Rev Urol. 2013 Jul;10(7):386-95
pubmed: 23670183
Anal Chim Acta. 2008 Oct 3;627(1):71-81
pubmed: 18790129
J Pers Med. 2020 Jun 22;10(2):
pubmed: 32580362
Nucleic Acids Res. 2018 Jul 2;46(W1):W537-W544
pubmed: 29790989
Analyst. 2017 Jul 24;142(15):2690-2712
pubmed: 28642940
Proteomics Clin Appl. 2019 Jan;13(1):e1800029
pubmed: 30408343
Eur Urol. 2006 May;49(5):798-805
pubmed: 16442207
Proteomics Clin Appl. 2014 Oct;8(9-10):796-804
pubmed: 24888792
Clin Cancer Res. 2009 Sep 1;15(17):5541-51
pubmed: 19690195
Bioinformatics. 2015 Jul 15;31(14):2418-20
pubmed: 25777525
J Invest Dermatol. 2002 Nov;119(5):1137-49
pubmed: 12445204
Mol Cell Proteomics. 2016 Oct;15(10):3081-3089
pubmed: 27473201
J Pathol. 2015 Jan;235(1):3-13
pubmed: 25201776
Adv Cancer Res. 2017;134:27-66
pubmed: 28110654
Chimia (Aarau). 2014;68(3):146-9
pubmed: 24801845
Mol Cell Proteomics. 2016 May;15(5):1761-72
pubmed: 26796117
J Proteome Res. 2010 Sep 3;9(9):4315-28
pubmed: 20597554
J Proteomics. 2012 Aug 30;75(16):5093-5105
pubmed: 22634080
Anal Bioanal Chem. 2018 Sep;410(23):5969-5980
pubmed: 29968108
Nucleic Acids Res. 2016 Jan 4;44(D1):D447-56
pubmed: 26527722
Gigascience. 2019 Dec 1;8(12):
pubmed: 31816088
J Proteome Res. 2014 Feb 7;13(2):1138-42
pubmed: 24313301
Anal Bioanal Chem. 2011 Jul;401(1):167-81
pubmed: 21479971
Oncotarget. 2020 Nov 03;11(44):3998-4015
pubmed: 33216824
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Proteomics. 2016 Jun;16(11-12):1793-801
pubmed: 27061135
Lab Invest. 2015 Apr;95(4):422-31
pubmed: 25621874
Proteomics Clin Appl. 2019 Jan;13(1):e1700173
pubmed: 30411850
Exp Cell Res. 2007 Jun 10;313(10):2050-62
pubmed: 17512929
Int J Mol Sci. 2017 Sep 15;18(9):
pubmed: 28914774
J Proteome Res. 2010 May 7;9(5):2182-90
pubmed: 20141219
Mol Cell Proteomics. 2014 Dec;13(12):3497-506
pubmed: 25225357
Arch Pathol Lab Med. 2001 Jul;125(7):921-3
pubmed: 11419977
Gigascience. 2018 Oct 1;7(10):
pubmed: 30124809
Biochim Biophys Acta Proteins Proteom. 2017 Jul;1865(7):907-915
pubmed: 27599305
Anal Chem. 2012 Feb 7;84(3):1310-9
pubmed: 22148759