TY - JOUR
T1 - Analysis of premalignant pancreatic cancer mass spectrometry data for biomarker selection using a group search optimizer
AU - CUI, Zhihua
AU - SHI, Zhongzhi
AU - ABONYI, Janos
AU - HE, S.
AU - COOPER, H.J.
AU - WARD, D.G.
AU - YAO, X.
AU - HEATH, J.K.
PY - 2012/8
Y1 - 2012/8
N2 - This paper presents a novel group search optimizer (GSO)-based biomarker discovery method for pancreatic cancer diagnosis using mass spectrometry (MS) data. The GSO was inspired by animal social searching behaviour. It has been shown that the global search performance of the GSO is competitive to other biologically inspired optimization algorithms. In this study, we applied a GSO as a feature selection method to MS data analysis for premalignant pancreatic cancer biomarker discovery. We first applied a smooth non-linear energy operator to detect peaks. Then a GSO with linear discriminant analysis was used to select a parsimonious set of peak windows (biomarkers) that can distinguish cancer. After selecting a set of biomarkers, a support vector machine was then applied to build a classifier to diagnosis premalignant cancer cases. We compared the GSO algorithm with a genetic algorithm, evolution strategies, evolutionary programming and a particle swarm optimizer. The results showed that the GSO-based feature selection algorithm is capable of selecting a parsimonious set of biomarkers to achieve better classification performance than other algorithms. The source code of the proposed GSO-based feature selection algorithm is available at www.cs.bham.ac.uk/~szh. © 2011, SAGE Publications. All rights reserved.
AB - This paper presents a novel group search optimizer (GSO)-based biomarker discovery method for pancreatic cancer diagnosis using mass spectrometry (MS) data. The GSO was inspired by animal social searching behaviour. It has been shown that the global search performance of the GSO is competitive to other biologically inspired optimization algorithms. In this study, we applied a GSO as a feature selection method to MS data analysis for premalignant pancreatic cancer biomarker discovery. We first applied a smooth non-linear energy operator to detect peaks. Then a GSO with linear discriminant analysis was used to select a parsimonious set of peak windows (biomarkers) that can distinguish cancer. After selecting a set of biomarkers, a support vector machine was then applied to build a classifier to diagnosis premalignant cancer cases. We compared the GSO algorithm with a genetic algorithm, evolution strategies, evolutionary programming and a particle swarm optimizer. The results showed that the GSO-based feature selection algorithm is capable of selecting a parsimonious set of biomarkers to achieve better classification performance than other algorithms. The source code of the proposed GSO-based feature selection algorithm is available at www.cs.bham.ac.uk/~szh. © 2011, SAGE Publications. All rights reserved.
KW - Cancer biomarkers
KW - feature selection
KW - group search optimization
KW - proteomics
UR - http://www.scopus.com/inward/record.url?scp=84864221517&partnerID=8YFLogxK
U2 - 10.1177/0142331211402893
DO - 10.1177/0142331211402893
M3 - Journal Article (refereed)
SN - 0142-3312
VL - 34
SP - 668
EP - 676
JO - Transactions of the Institute of Measurement and Control
JF - Transactions of the Institute of Measurement and Control
IS - 6
ER -