k-Nearest Neighbour Classifiers: 2nd Edition (with Python examples)

Perhaps the most straightforward classifier in the arsenal or machine learning techniques is the Nearest Neighbour Classifier -- classification is achieved by identifying the nearest neighbours to a query example and using those neighbours to determine the class of the query. This approach to classi...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Cunningham, Padraig, Delany, Sarah Jane
Format Paper Journal Article
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 29.04.2020
Subjects
Online AccessGet full text
ISSN2331-8422
DOI10.48550/arxiv.2004.04523

Cover

Abstract Perhaps the most straightforward classifier in the arsenal or machine learning techniques is the Nearest Neighbour Classifier -- classification is achieved by identifying the nearest neighbours to a query example and using those neighbours to determine the class of the query. This approach to classification is of particular importance because issues of poor run-time performance is not such a problem these days with the computational power that is available. This paper presents an overview of techniques for Nearest Neighbour classification focusing on; mechanisms for assessing similarity (distance), computational issues in identifying nearest neighbours and mechanisms for reducing the dimension of the data. This paper is the second edition of a paper previously published as a technical report. Sections on similarity measures for time-series, retrieval speed-up and intrinsic dimensionality have been added. An Appendix is included providing access to Python code for the key methods.
AbstractList Perhaps the most straightforward classifier in the arsenal or machine learning techniques is the Nearest Neighbour Classifier -- classification is achieved by identifying the nearest neighbours to a query example and using those neighbours to determine the class of the query. This approach to classification is of particular importance because issues of poor run-time performance is not such a problem these days with the computational power that is available. This paper presents an overview of techniques for Nearest Neighbour classification focusing on; mechanisms for assessing similarity (distance), computational issues in identifying nearest neighbours and mechanisms for reducing the dimension of the data. This paper is the second edition of a paper previously published as a technical report. Sections on similarity measures for time-series, retrieval speed-up and intrinsic dimensionality have been added. An Appendix is included providing access to Python code for the key methods.
Perhaps the most straightforward classifier in the arsenal or machine learning techniques is the Nearest Neighbour Classifier -- classification is achieved by identifying the nearest neighbours to a query example and using those neighbours to determine the class of the query. This approach to classification is of particular importance because issues of poor run-time performance is not such a problem these days with the computational power that is available. This paper presents an overview of techniques for Nearest Neighbour classification focusing on; mechanisms for assessing similarity (distance), computational issues in identifying nearest neighbours and mechanisms for reducing the dimension of the data. This paper is the second edition of a paper previously published as a technical report. Sections on similarity measures for time-series, retrieval speed-up and intrinsic dimensionality have been added. An Appendix is included providing access to Python code for the key methods.
Author Delany, Sarah Jane
Cunningham, Padraig
Author_xml – sequence: 1
  givenname: Padraig
  surname: Cunningham
  fullname: Cunningham, Padraig
– sequence: 2
  givenname: Sarah
  surname: Delany
  middlename: Jane
  fullname: Delany, Sarah Jane
BackLink https://doi.org/10.1145/3459665$$DView published paper (Access to full text may be restricted)
https://doi.org/10.48550/arXiv.2004.04523$$DView paper in arXiv
BookMark eNotz01Lw0AQBuBFFKy1P8CTAS96SJzMfiXeJNQPKNVD72Frds3WNIm7aW3_vWvrXIaBl-F9Lshp27WakKsUEpZxDvfK7ew2QQCWAONIT8gIKU3jjCGek4n3KwBAIZFzOiLFVzzXymk_RHNtP-tlt3FR0SjvrbHa-YcI2yqaVnawXRvd_tihjt73Qx0OvVPrvtH-7pKcGdV4PfnfY7J4mi6Kl3j29vxaPM5iFWrEQnKu5AdiCpRRJrCSVSYVgyrPFSJVXFTUCKaMESllOSxzLmWaGQEawQg6JtfHtwdh2Tu7Vm5f_knLgzQkbo6J3nXfm2AqV4HThk4l0ixLBadhfgHxEVVk
ContentType Paper
Journal Article
Copyright 2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: 2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID 8FE
8FG
ABJCF
ABUWG
AFKRA
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
HCIFZ
L6V
M7S
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PTHSS
AKY
EPD
GOX
DOI 10.48550/arxiv.2004.04523
DatabaseName ProQuest SciTech Collection
ProQuest Technology Collection
Materials Science & Engineering Collection
ProQuest Central (Alumni)
ProQuest Central
ProQuest Central Essentials
ProQuest Central
ProQuest Technology Collection
ProQuest One Community College
ProQuest Central Korea
SciTech Premium Collection
ProQuest Engineering Collection
Engineering Database
ProQuest Central Premium
ProQuest One Academic (New)
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Engineering Collection
arXiv Computer Science
arXiv Statistics
arXiv.org
DatabaseTitle Publicly Available Content Database
Engineering Database
Technology Collection
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
ProQuest One Academic Eastern Edition
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Central China
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest Engineering Collection
ProQuest One Academic UKI Edition
ProQuest Central Korea
Materials Science & Engineering Collection
ProQuest Central (New)
ProQuest One Academic
ProQuest One Academic (New)
Engineering Collection
DatabaseTitleList
Publicly Available Content Database
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
– sequence: 2
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Physics
EISSN 2331-8422
ExternalDocumentID 2004_04523
Genre Working Paper/Pre-Print
GroupedDBID 8FE
8FG
ABJCF
ABUWG
AFKRA
ALMA_UNASSIGNED_HOLDINGS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
FRJ
HCIFZ
L6V
M7S
M~E
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PTHSS
AKY
EPD
GOX
ID FETCH-LOGICAL-a523-6755a7c2210343462d7d87a40d99a223a56d3f64aff613490b957718f60e20f63
IEDL.DBID 8FG
IngestDate Tue Jul 22 22:00:08 EDT 2025
Mon Jun 30 09:22:43 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a523-6755a7c2210343462d7d87a40d99a223a56d3f64aff613490b957718f60e20f63
Notes SourceType-Working Papers-1
ObjectType-Working Paper/Pre-Print-1
content type line 50
OpenAccessLink https://www.proquest.com/docview/2388165333?pq-origsite=%requestingapplication%
PQID 2388165333
PQPubID 2050157
ParticipantIDs arxiv_primary_2004_04523
proquest_journals_2388165333
PublicationCentury 2000
PublicationDate 20200429
PublicationDateYYYYMMDD 2020-04-29
PublicationDate_xml – month: 04
  year: 2020
  text: 20200429
  day: 29
PublicationDecade 2020
PublicationPlace Ithaca
PublicationPlace_xml – name: Ithaca
PublicationTitle arXiv.org
PublicationYear 2020
Publisher Cornell University Library, arXiv.org
Publisher_xml – name: Cornell University Library, arXiv.org
SSID ssj0002672553
Score 1.7238252
SecondaryResourceType preprint
Snippet Perhaps the most straightforward classifier in the arsenal or machine learning techniques is the Nearest Neighbour Classifier -- classification is achieved by...
Perhaps the most straightforward classifier in the arsenal or machine learning techniques is the Nearest Neighbour Classifier -- classification is achieved by...
SourceID arxiv
proquest
SourceType Open Access Repository
Aggregation Database
SubjectTerms Classifiers
Computer Science - Learning
K-nearest neighbors algorithm
Machine learning
Python
Similarity
Statistics - Machine Learning
SummonAdditionalLinks – databaseName: arXiv.org
  dbid: GOX
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1NSwMxEA1tT15EUWm1Sg4e9BDN5mu73qS0FMHqoUJvS3aTQBFW6bZS_70z2S0exGtIIMmEvDeTzBtCrhPg8MF4wyzwXabgGmaF04E5oLY6LUWpJCYKP8_N7E09LfWyQ-g-F8aud6uvRh-4qO_RhHco-i27pAtEAZN5X5bN42SU4mr7__YDjhmb_lytES-mR-SwJXr0sbHMMen46oSM39kcRWPrDZ1jTBJDijTWpVwFrEn9QEXl6MTFf1T0BoOk9PUb0_up31kU8q1vT8liOlmMZ6wtY8AsTIgBI9cWVg2-lVRSGeFSN0qt4i7LLICz1cbJYJQNwaBWIC8ynQJiBMO94MHIM9KrPirfJ1Ql3EvrubOlUSNwnKzIghZl6pKyTGwYkH5cfP7ZKFVgjUmVx30ZkOF-P_L2lNY5wPUoMUD45Pn_Iy_IgUAfkysmsiHpbdZbfwlAvCmuojV-AFh1hl4
  priority: 102
  providerName: Cornell University
Title k-Nearest Neighbour Classifiers: 2nd Edition (with Python examples)
URI https://www.proquest.com/docview/2388165333
https://arxiv.org/abs/2004.04523
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3PS8MwFA66IXjzJ1PnyMGDHuLaJE1bL4Kj2xBWh0zYbWT5AUPo5jplXvzbfck6PQheCm0v7Ut43_e-JN9D6CoEDm-FEUQC3yUc0jCZ6sgSDdQ2ihVVnLmDwoNc9F_44zgaV4JbWW2r3OZEn6j1XDmNvA3QkoQCyAm7X7wR1zXKra5WLTR2UR2AOnWzOun2fjQWKmJgzGyzmOmtu9pyuZ59-LLw1pmJM-Ck_tGfVOzxpXuA6kO5MMtDtGOKI7Tnt2Wq8hh1XknuTGbLFc6dhukkSOz7WM6s62F9h2mhcab9vit87URVPPx0dgDYrKUz_i1vTtCom406fVK1PSASPogAg48kRAlqMcYZF1THOoklD3SaSgBzGQnNrODSWuG8BYNpGsWAMFYEhgZWsFNUK-aFaSDMw8AwaQItleAJFFqSpjaiKtahUqG0Z6jhf36y2DhbuJ6UfOLjcoaa23hMqlldTn7H4Pz_1xdon7q6NOCEpk1UWy3fzSWA92ra8iPUQvWHLB8-w13vaQzXwVf2DYTXmvU
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1dS8MwFL3MDdE3P_HbPCjoQ7RN0nQVRFA35lcZMsG3kjUJiDDnOnX-OP-bN1mnD4Jve21paJP03nNuknMA9kLE8FYaSRXiXSowDNOujizVCG2jOGe54O6g8F0qWw_i-jF6rMDX5CyM21Y5iYk-UOuX3NXIjzG11EOJ4ISf9V-pc41yq6sTCw1VWivoUy8xVh7suDGfH0jhitOrSxzvfcaajc5Fi5YuA1QhCaMImCOFL4XUhwsuJNOxrsdKBDpJFOZOFUnNrRTKWumk_IJuEsUY0K0MDAus5NjsDNTw0QS5X-28kbbvf4o8TMYI2fl4NdVrhx2rwejp3fPSI6dmzhEU-0t_coFPcM0FqLVV3wwWoWJ6SzDr94XmxTJcPNPUqdwWQ5K6IqqrgRJvpPlknYn2CWE9TRrab_wiB66qS9qfTo-AmJFyysPF4Qp0ptEjq1DtvfTMGhARBoYrE2iVS1FHpqdYYiOWxzrM81DZdVjzH5_1x9IazhRTZL5f1mFr0h9Z-VsV2e8k2Pj_9i7MtTp3t9ntVXqzCfPMkeRAUJZsQXU4eDPbiCSG3Z1yvAhkU54h3yts1tg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=k-Nearest+Neighbour+Classifiers%3A+2nd+Edition+%28with+Python+examples%29&rft.jtitle=arXiv.org&rft.au=Cunningham%2C+Padraig&rft.au=Delany%2C+Sarah+Jane&rft.date=2020-04-29&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422&rft_id=info:doi/10.48550%2Farxiv.2004.04523