Display options
Share it on

Adv Bioinformatics. 2016;2016:3791214. doi: 10.1155/2016/3791214. Epub 2016 May 17.

A Support Vector Machine Classification of Thyroid Bioptic Specimens Using MALDI-MSI Data.

Advances in bioinformatics

Manuel Galli, Italo Zoppis, Gabriele De Sio, Clizia Chinello, Fabio Pagni, Fulvio Magni, Giancarlo Mauri

Affiliations

  1. Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, 20900 Monza Brianza, Italy.
  2. Department of Informatics, Systems and Communication, University of Milano-Bicocca, Viale Sarca 336, 20125 Milan, Italy.
  3. Department of Surgery and Translational Medicine, Section of Pathology, University of Milano-Bicocca, Milan, Italy.

PMID: 27293431 PMCID: PMC4886047 DOI: 10.1155/2016/3791214

Abstract

Biomarkers able to characterise and predict multifactorial diseases are still one of the most important targets for all the "omics" investigations. In this context, Matrix-Assisted Laser Desorption/Ionisation-Mass Spectrometry Imaging (MALDI-MSI) has gained considerable attention in recent years, but it also led to a huge amount of complex data to be elaborated and interpreted. For this reason, computational and machine learning procedures for biomarker discovery are important tools to consider, both to reduce data dimension and to provide predictive markers for specific diseases. For instance, the availability of protein and genetic markers to support thyroid lesion diagnoses would impact deeply on society due to the high presence of undetermined reports (THY3) that are generally treated as malignant patients. In this paper we show how an accurate classification of thyroid bioptic specimens can be obtained through the application of a state-of-the-art machine learning approach (i.e., Support Vector Machines) on MALDI-MSI data, together with a particular wrapper feature selection algorithm (i.e., recursive feature elimination). The model is able to provide an accurate discriminatory capability using only 20 out of 144 features, resulting in an increase of the model performances, reliability, and computational efficiency. Finally, tissue areas rather than average proteomic profiles are classified, highlighting potential discriminating areas of clinical interest.

References

  1. J Integr Bioinform. 2012 Mar 21;9(1):189 - PubMed
  2. J Clin Bioinforma. 2014 Jan 24;4(1):2 - PubMed
  3. Int J Mass Spectrom. 2007 Feb 1;260(2-3):212-221 - PubMed
  4. Lab Invest. 2015 Apr;95(4):422-31 - PubMed
  5. Cytopathology. 2015 Oct;26(5):318-24 - PubMed
  6. J Proteomics. 2012 Aug 30;75(16):5106-10 - PubMed
  7. Proteomics. 2014 Apr;14(7-8):807-9 - PubMed
  8. Endocr Pathol. 2013 Dec;24(4):250-3 - PubMed
  9. Anal Bioanal Chem. 2011 Jul;401(1):167-81 - PubMed
  10. J Clin Bioinforma. 2013 Jan 14;3(1):1 - PubMed
  11. Bioinformatics. 2012 Sep 1;28(17):2270-1 - PubMed
  12. Cytopathology. 2014 Jun;25(3):170-6 - PubMed

Publication Types