Machine Learning

The purpose of this chapter is to present fundamental ideas and techniques of machine learning suitable for the field of this book, i.e., for automated scientific discovery. The chapter focuses on those symbolic machine learning methods, which produce results that are suitable to be interpreted and...

Full description

Saved in:
Bibliographic Details
Published inScientific Data Mining and Knowledge Discovery pp. 7 - 52
Main Authors Hoffmann, Achim, Mahidadia, Ashesh
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg 2010
Subjects
Online AccessGet full text
ISBN3642027873
9783642027871
DOI10.1007/978-3-642-02788-8_2

Cover

Abstract The purpose of this chapter is to present fundamental ideas and techniques of machine learning suitable for the field of this book, i.e., for automated scientific discovery. The chapter focuses on those symbolic machine learning methods, which produce results that are suitable to be interpreted and understood by humans. This is particularly important in the context of automated scientific discovery as the scientific theories to be produced by machines are usually meant to be interpreted by humans. This chapter contains some of the most influential ideas and concepts in machine learning research to give the reader a basic insight into the field. After the introduction in Sect. 1, general ideas of how learning problems can be framed are given in Sect. 2. The section provides useful perspectives to better understand what learning algorithms actually do. Section 3 presents the Version space model which is an early learning algorithm as well as a conceptual framework, that provides important insight into the general mechanisms behind most learning algorithms. In section 4, a family of learning algorithms, the AQ family for learning classification rules is presented. The AQ family belongs to the early approaches in machine learning. The next, Sect. 5 presents the basic principles of decision tree learners. Decision tree learners belong to the most influential class of inductive learning algorithms today. Finally, a more recent group of learning systems are presented in Sect. 6, which learn relational concepts within the framework of logic programming. This is a particularly interesting group of learning systems since the framework allows also to incorporate background knowledge which may assist in generalisation. Section 7 discusses Association Rules – a technique that comes from the related field of Data mining. Section 8 presents the basic idea of the Naive Bayesian Classifier. While this is a very popular learning technique, the learning result is not well suited for human comprehension as it is essentially a large collection of probability values. In Sect. 9, we present a generic method for improving accuracy of a given learner by generatingmultiple classifiers using variations of the training data. While this works well in most cases, the resulting classifiers have significantly increased complexity and, hence, tend to destroy the human readability of the learning result that a single learner may produce. Section 10 contains a summary, mentions briefly other techniques not discussed in this chapter and presents outlook on the potential of machine learning in the future.
AbstractList The purpose of this chapter is to present fundamental ideas and techniques of machine learning suitable for the field of this book, i.e., for automated scientific discovery. The chapter focuses on those symbolic machine learning methods, which produce results that are suitable to be interpreted and understood by humans. This is particularly important in the context of automated scientific discovery as the scientific theories to be produced by machines are usually meant to be interpreted by humans. This chapter contains some of the most influential ideas and concepts in machine learning research to give the reader a basic insight into the field. After the introduction in Sect. 1, general ideas of how learning problems can be framed are given in Sect. 2. The section provides useful perspectives to better understand what learning algorithms actually do. Section 3 presents the Version space model which is an early learning algorithm as well as a conceptual framework, that provides important insight into the general mechanisms behind most learning algorithms. In section 4, a family of learning algorithms, the AQ family for learning classification rules is presented. The AQ family belongs to the early approaches in machine learning. The next, Sect. 5 presents the basic principles of decision tree learners. Decision tree learners belong to the most influential class of inductive learning algorithms today. Finally, a more recent group of learning systems are presented in Sect. 6, which learn relational concepts within the framework of logic programming. This is a particularly interesting group of learning systems since the framework allows also to incorporate background knowledge which may assist in generalisation. Section 7 discusses Association Rules – a technique that comes from the related field of Data mining. Section 8 presents the basic idea of the Naive Bayesian Classifier. While this is a very popular learning technique, the learning result is not well suited for human comprehension as it is essentially a large collection of probability values. In Sect. 9, we present a generic method for improving accuracy of a given learner by generatingmultiple classifiers using variations of the training data. While this works well in most cases, the resulting classifiers have significantly increased complexity and, hence, tend to destroy the human readability of the learning result that a single learner may produce. Section 10 contains a summary, mentions briefly other techniques not discussed in this chapter and presents outlook on the potential of machine learning in the future.
Author Hoffmann, Achim
Mahidadia, Ashesh
Author_xml – sequence: 1
  givenname: Achim
  surname: Hoffmann
  fullname: Hoffmann, Achim
– sequence: 2
  givenname: Ashesh
  surname: Mahidadia
  fullname: Mahidadia, Ashesh
BookMark eNo1j01PwzAMho0YEmxU4s6FP5ARJ23sHNHEx6QiLuwcJU0KA5Sihv8vwgBfLD8Hv--zhEWecgK4RLlGKenaEgstTKuEVMQs2KkjaCrVlR0QH8Py_yB9Ck0pb7KORmsVncHFox9e9zld9cnPeZ9fzuFk9B8lNX97Bbu72-fNg-if7rebm14UtPQljDZaIvtIgeOgxmQ9G4_YoUczmMTejLFT3ahaH0Ib2FBkRoptlF2KpFeAv3_L51xj0-zCNL0Xh9L9qLkq4bSrzd3Bw1U1_Q3-KkCu
ContentType Book Chapter
Copyright Springer-Verlag Berlin Heidelberg 2009
Copyright_xml – notice: Springer-Verlag Berlin Heidelberg 2009
DOI 10.1007/978-3-642-02788-8_2
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9783642027888
3642027881
Editor Gaber, Mohamed Medhat
Editor_xml – sequence: 1
  givenname: Mohamed Medhat
  surname: Gaber
  fullname: Gaber, Mohamed Medhat
  email: mohamed.m.gaber@gmail.com
EndPage 52
GroupedDBID -T.
089
0D6
0DA
0E8
20A
38.
4UP
4V3
92K
A4J
AABBV
AAJYQ
AATVQ
ABBUY
ABCYT
ABMKK
ABMNI
ABXPP
ACBPT
ACDPG
ACDTA
ACDUY
ACZTO
ADVHH
AEHEY
AEJLV
AEKFX
AEOKE
AETDV
AEZAY
AHNNE
AHSMR
ALMA_UNASSIGNED_HOLDINGS
ATJMZ
AZZ
BBABE
CZZ
E6I
I4C
IEZ
JJU
MYL
SBO
TBMHI
TPJZQ
Z5O
Z7R
Z7U
Z7W
Z7X
Z7Z
Z81
Z83
Z84
Z85
Z87
Z88
ID FETCH-LOGICAL-s197t-6363018ad7b8dc2fe9a86a1151a16c6e8a6fd525f24abb4b867d8817d4d05ed73
ISBN 3642027873
9783642027871
IngestDate Tue Jul 29 20:21:18 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-s197t-6363018ad7b8dc2fe9a86a1151a16c6e8a6fd525f24abb4b867d8817d4d05ed73
PageCount 46
ParticipantIDs springer_books_10_1007_978_3_642_02788_8_2
PublicationCentury 2000
PublicationDate 2010
PublicationDateYYYYMMDD 2010-01-01
PublicationDate_xml – year: 2010
  text: 2010
PublicationDecade 2010
PublicationPlace Berlin, Heidelberg
PublicationPlace_xml – name: Berlin, Heidelberg
PublicationSubtitle Principles and Foundations
PublicationTitle Scientific Data Mining and Knowledge Discovery
PublicationYear 2010
Publisher Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin Heidelberg
SSID ssj0000319927
Score 1.365058
Snippet The purpose of this chapter is to present fundamental ideas and techniques of machine learning suitable for the field of this book, i.e., for automated...
SourceID springer
SourceType Publisher
StartPage 7
SubjectTerms Association Rule
Concept Space
Frequent Itemset
Inductive Logic Programming
Logic Program
Title Machine Learning
URI http://link.springer.com/10.1007/978-3-642-02788-8_2
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07T8MwELagLIihPMVbGZiogpLYcZyxQkVVoUwt6hbZsU06tJVoWPj1nJ1HQ4uQyhJFTuTkfCfb-nzfdwjdScJ17OnU9RiXLsGEu0Jo7vppICjGsIBqw3cevtL-mAwm4WRVrMGyS3LxkH79yiv5j1ehDfxqWLJbeLbuFBrgHvwLV_AwXNc2vz9h1oLDYamMJtMHXJfzztCWerCHAc8VUGbENVOTpFkD5_2F1rOyMHI3zaazFSSdTaURKrBPlplaZs1wGtqkS1Xpsb430QKbdNZECyq0sPOHmJYldhADirCiOko5t0WNRbJQnd2YfpsZF9CFOSAGl7EE1shd6LWF9rq9wctbDYIZClUcmLqJ9RdxoYq0-oNaKqpQA17reOMA2-4LRofowHBFHEPigPg8QjtqfozaVaUMp5w4T1C7HD6nGr5TNH7qjR77blmLwl36cZS7FFOYCiGeI8FkGmgVc0Y5bKd97tOUKsaplmEQ6oBwIYhgNJKM-ZEk0guVjPAZas0Xc3WOnBB73JNpqJnWhPKAEy1opGIqMdc-FhfovjIpMdG1TCppabA_wQnYn1j7E7D_cpuXr9D-KiauUSv_-FQ3sKfKxW3pmW8XrBq6
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Scientific+Data+Mining+and+Knowledge+Discovery&rft.au=Hoffmann%2C+Achim&rft.au=Mahidadia%2C+Ashesh&rft.atitle=Machine+Learning&rft.date=2010-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783642027871&rft.spage=7&rft.epage=52&rft_id=info:doi/10.1007%2F978-3-642-02788-8_2
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9783642027871/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9783642027871/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9783642027871/sc.gif&client=summon&freeimage=true