INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, cilt.23, sa.2, ss.101-118, 2020 (SCI-Expanded)
One area of metabolic data analysis is processes that involve the detection and discovery of biomarkers used in the early diagnosis of diseases and development of alternative treatments. Classification and feature selection are frequently used in the statistical analysis of metabolomics data for the detection and discovery of biomarkers. Recently, automatic programming methods have begun to be used instead of conventional methods. In this paper, three conventional classification and feature selection methods (PLS-DA, RF, SVM) and two automatic programming methods (ABCP and GP) are applied to classification problems where they are evaluated on synthetic and real data sets. The selection performances on the biomarker discovery of the algorithms have been compared. It has been found that automatic programming methods are more successful in classifying metabolic data and ABCP is superior to GP in biomarker discovery.