Fitness Functions in Genetic Programming for Classification with Unbalanced Data
This paper describes a genetic programming (GP) approach to binary classification with class imbalance problems. This approach is examined on two benchmark and two synthetic data sets. The results show that when using the overall classification accuracy as the fitness function, the GP system is stro...
        Saved in:
      
    
          | Published in | AI 2007: Advances in Artificial Intelligence Vol. 4830; pp. 769 - 775 | 
|---|---|
| Main Authors | , | 
| Format | Book Chapter | 
| Language | English | 
| Published | 
        Germany
          Springer Berlin / Heidelberg
    
        2007
     Springer Berlin Heidelberg  | 
| Series | Lecture Notes in Computer Science | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 9783540769262 3540769269  | 
| ISSN | 0302-9743 1611-3349  | 
| DOI | 10.1007/978-3-540-76928-6_90 | 
Cover
| Summary: | This paper describes a genetic programming (GP) approach to binary classification with class imbalance problems. This approach is examined on two benchmark and two synthetic data sets. The results show that when using the overall classification accuracy as the fitness function, the GP system is strongly biased toward the majority class. Two new fitness functions are developed to deal with the class imbalance problem. The experimental results show that both of them substantially improve the performance for the minority class, and the performance for the majority and minority classes is much more balanced. | 
|---|---|
| ISBN: | 9783540769262 3540769269  | 
| ISSN: | 0302-9743 1611-3349  | 
| DOI: | 10.1007/978-3-540-76928-6_90 |