A scalable bottom-up data mining algorithm for relational databases
Machine learning induction algorithms are difficult to scale to very large databases because of their memory-bound nature. Using virtual memory results in a significant performance degradation. To overcome such shortcomings, we developed a classification rule induction algorithm for relational datab...
        Saved in:
      
    
          | Published in | Proceedings. Tenth International Conference on Scientific and Statistical Database Management (Cat. No.98TB100243) pp. 206 - 209 | 
|---|---|
| Main Authors | , , | 
| Format | Conference Proceeding | 
| Language | English | 
| Published | 
            IEEE
    
        1998
     | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 0818685751 9780818685750  | 
| ISSN | 1099-3371 | 
| DOI | 10.1109/SSDM.1998.688125 | 
Cover
| Summary: | Machine learning induction algorithms are difficult to scale to very large databases because of their memory-bound nature. Using virtual memory results in a significant performance degradation. To overcome such shortcomings, we developed a classification rule induction algorithm for relational databases. Our algorithm uses a bottom-up rule generation strategy that is more effective for mining databases having large cardinality of nominal variables. We have successfully used our algorithm to mine a retail grocery database containing more than 1.6 million records in about 5 hours on a dual Pentium processor PC. | 
|---|---|
| ISBN: | 0818685751 9780818685750  | 
| ISSN: | 1099-3371 | 
| DOI: | 10.1109/SSDM.1998.688125 |