Data mining and preprocessing application on component reports of an airline company in Turkey


GÜRBÜZ F., ÖZBAKIR L., YAPICI H.

EXPERT SYSTEMS WITH APPLICATIONS, cilt.38, sa.6, ss.6618-6626, 2011 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 38 Sayı: 6
  • Basım Tarihi: 2011
  • Doi Numarası: 10.1016/j.eswa.2010.11.076
  • Dergi Adı: EXPERT SYSTEMS WITH APPLICATIONS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.6618-6626
  • Anahtar Kelimeler: Data mining, Preprocessing, Rough sets, Find laws
  • Erciyes Üniversitesi Adresli: Evet

Özet

Risk and safety have always been important considerations in aviation. With the rapid growth in air travel, flight delays, cancellations and incidents/accidents have also dramatically increased in recent years (Nazeri & Jianping, 2002). There is a large amount of knowledge and data accumulation in aviation industry. These data could be stored in the form of pilot reports, maintenance reports, incident reports or delay reports. This paper focuses on different preprocessing and feature selection techniques applied on the 15 component reports of an airline company in Turkey to understand and clean the data set. Regression analysis, anomaly detection analysis, find dependencies and rough sets are used in this study in order to reduce the data set. Also the classification techniques of data mining are used to predict the warning level of the component as the class attribute. For this purpose Polyanalyst, SPSS Clementine, Minitab and Rosetta software tools are used. Find laws module of Polyanalyst is used to find the relations and information retrieval about the components warning level. (C) 2010 Elsevier Ltd. All rights reserved.