MEPAR-miner: Multi-expression programming for classification rule mining


Baykasoglu A., Ozbakir L.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, cilt.183, ss.767-784, 2007 (SCI İndekslerine Giren Dergi) identifier

  • Cilt numarası: 183 Konu: 2
  • Basım Tarihi: 2007
  • Doi Numarası: 10.1016/j.ejor.2006.10.015
  • Dergi Adı: EUROPEAN JOURNAL OF OPERATIONAL RESEARCH
  • Sayfa Sayıları: ss.767-784

Özet

Classification and rule induction are two important tasks to extract knowledge from data. In rule induction, the representation of knowledge is defined as IF-THEN rules which are easily understandable and applicable by problem-domain experts. In this paper, a new chromosome representation and solution technique based on Multi-Expression Programming (MEP) which is named as MEPAR-miner (Multi-Expression Programming for Association Rule Mining) for rule induction is proposed. Multi-Expression Programming (MEP) is a relatively new technique in evolutionary programming that is first introduced in 2002 by Oltean and Dumitrescu. MEP uses linear chromosome structure. In MEP, multiple logical expressions which have different sizes are used to represent different logical rules. MEP expressions can be encoded and implemented in a flexible and efficient manner. MEP is generally applied to prediction problems; in this paper a new algorithm is presented which enables MEP to discover classification rules. The performance of the developed algorithm is tested oil nine publicly available binary and n-ary classification data sets. Extensive experiments are performed to demonstrate that MEPAR-miner can discover effective classification rules that are as good as (or better than) the ones obtained by the traditional rule induction methods. It is also shown that effective gene encoding structure directly improves the predictive accuracy of logical IF-THEN rules. (c) 2006 Elsevier B.V. All rights reserved.