Binary k-nearest neighbor for text categorization

Purpose - With the ever-increasing volume of text data via the internet, it is important that documents are classified as manageable and easy to understand categories. This paper proposes the use of binary k-nearest neighbour (BKNN) for text categorization.Design methodology approach - The paper des...

Full description

Saved in:
Bibliographic Details
Published inOnline information review Vol. 29; no. 4; pp. 391 - 399
Main Author Tan, Songbo
Format Journal Article
LanguageEnglish
Published Bradford Emerald Group Publishing Limited 01.01.2005
Emerald
Subjects
Online AccessGet full text
ISSN1468-4527
1468-4535
DOI10.1108/14684520510617839

Cover

Abstract Purpose - With the ever-increasing volume of text data via the internet, it is important that documents are classified as manageable and easy to understand categories. This paper proposes the use of binary k-nearest neighbour (BKNN) for text categorization.Design methodology approach - The paper describes the traditional k-nearest neighbor (KNN) classifier, introduces BKNN and outlines experiemental results.Findings - The experimental results indicate that BKNN requires much less CPU time than KNN, without loss of classification performance.Originality value - The paper demonstrates how BKNN can be an efficient and effective algorithm for text categorization. Proposes the use of binary k-nearest neighbor (BKNN ) for text categorization.
AbstractList Purpose - With the ever-increasing volume of text data via the internet, it is important that documents are classified as manageable and easy to understand categories. This paper proposes the use of binary k-nearest neighbour (BKNN) for text categorization.Design methodology approach - The paper describes the traditional k-nearest neighbor (KNN) classifier, introduces BKNN and outlines experiemental results.Findings - The experimental results indicate that BKNN requires much less CPU time than KNN, without loss of classification performance.Originality value - The paper demonstrates how BKNN can be an efficient and effective algorithm for text categorization. Proposes the use of binary k-nearest neighbor (BKNN ) for text categorization.
With the ever-increasing volume of text data via the internet, it is important that documents are classified as manageable and easy to understand categories. This paper proposes the use of binary k-nearest neighbour (BKNN) for text categorization.
Purpose: With the ever-increasing volume of text data via the internet, it is important that documents are classified as manageable and easy to understand categories. This paper proposes the use of binary k-nearest neighbour (BKNN) for text categorization. Design/methodology/approach: The paper describes the traditional k-nearest neighbor (KNN) classifier, introduces BKNN and outlines experiemental results. Findings: The experimental results indicate that BKNN requires much less CPU time than KNN, without loss of classification performance. Originality/value: The paper demonstrates how BKNN can be an efficient and effective algorithm for text categorization. Proposes the use of binary k-nearest neighbor (BKNN) for text categorization. (Original abstract)
With the ever-increasing volume of text data via the internet, it is important that documents are classified as manageable and easy to understand categories. This paper proposes the use of binary k-nearest neighbour (BKNN) for text categorization. The paper describes the traditional k-nearest neighbor (KNN) classifier, introduces BKNN and outlines experiemental results. The experimental results indicate that BKNN requires much less CPU time than KNN, without loss of classification performance. The paper demonstrates how BKNN can be an efficient and effective algorithm for text categorization. Proposes the use of binary k-nearest neighbor (BKNN) for text categorization.
Author Tan, Songbo
Author_xml – sequence: 1
  givenname: Songbo
  surname: Tan
  fullname: Tan, Songbo
  organization: Software Department, Institute of Computing Technology, Chinese Academy of Sciences, People's Republic of China
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17071132$$DView record in Pascal Francis
BookMark eNqF0UtLAzEQAOAgCtbqD_BWBPXiaiaPJjlq8QUFL3pestnZurrN1mQL6q83pcVCRT2EhOGbzCSzR7Z965GQQ6DnAFRfgBhqIRmVQIegNDdbpLeIZUJyuf19ZmqX7MX4QikwwWWPwFXtbfgYvGYebcDYDTzWk-eiDYMqrQ7fu4GzHU7aUH_arm79PtmpbBPxYLX3ydPN9ePoLhs_3N6PLseZE2zYZU7pEqwyKHWhgUmuCiMrVypecGGQGSELLTR1jnNkWhtEU1IpaKkNCCt5n5wu752F9m2eOsundXTYNNZjO4-5EpwaoxhN8uRPKdVQUMbhX8g0CC3MovbRBnxp58Gn5-aQGheccZHQ8QrZ6GxTBetdHfNZqKfpR3NQVAEk2SewdC60MQas1oTmi-HlP4aXcs6WOTjFYJtynbJJ81lZJU5_4b9W-AKJ6aX7
Cites_doi 10.1145/312624.312647
10.1145/243199.243277
10.1007/s100440050003
10.1023/A:1009982220290
10.1016/0167-8655(94)90095-7
ContentType Journal Article
Copyright Emerald Group Publishing Limited
2006 INIST-CNRS
Copyright Emerald Group Publishing, Limited 2005
Copyright_xml – notice: Emerald Group Publishing Limited
– notice: 2006 INIST-CNRS
– notice: Copyright Emerald Group Publishing, Limited 2005
DBID AAYXX
CITATION
IQODW
0-V
7RV
7SC
7WY
7WZ
7XB
8AO
8FD
8FE
8FG
8FI
ABUWG
AFKRA
ALSLI
ARAPS
AZQEC
BENPR
BEZIV
BGLVJ
CCPQU
CJNVE
CNYFK
DWQXO
E3H
F2A
FYUFA
F~G
GNUQQ
GUQSH
HCIFZ
JQ2
K6~
K7-
L.-
L7M
L~C
L~D
M0C
M0N
M0P
M1O
M2O
MBDVC
NAPCQ
P5Z
P62
PHGZM
PHGZT
PKEHL
PPXIY
PQBIZ
PQEDU
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PRQQA
Q9U
7TA
JG9
DOI 10.1108/14684520510617839
DatabaseName CrossRef
Pascal-Francis
ProQuest Social Sciences Premium Collection
ProQuest Nursing and Allied Health Journals - PSU access expires 11/30/25.
Computer and Information Systems Abstracts
ProQuest ABI/INFORM Collection
ABI/INFORM Global (PDF only)
ProQuest Central (purchase pre-March 2016)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Hospital Collection
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Social Science Premium Collection
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
ProQuest Central
Business Premium Collection
Technology collection
ProQuest One Community College
ProQuest Education Collection
ProQuest Library & Information Science Collection
ProQuest Central Korea
Library & Information Sciences Abstracts (LISA)
Library & Information Science Abstracts (LISA)
Health Research Premium Collection
ABI/INFORM Global (Corporate)
ProQuest Central Student
Research Library Prep
SciTech Premium Collection
ProQuest Computer Science Collection
ProQuest Business Collection
Computer Science Database
ABI/INFORM Professional Advanced
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
ABI/INFORM Global
Computing Database
Education Database
Library Science Database
Research Library
Research Library (Corporate)
Nursing & Allied Health Premium
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic (New)
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Business
ProQuest One Education
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest One Social Sciences
ProQuest Central Basic
Materials Business File
Materials Research Database
DatabaseTitle CrossRef
ProQuest One Education
Research Library Prep
Computer Science Database
ProQuest Central Student
Library and Information Science Abstracts (LISA)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
SciTech Premium Collection
ProQuest Central China
ABI/INFORM Complete
ProQuest One Applied & Life Sciences
Health Research Premium Collection
Library & Information Science Collection
ProQuest Central (New)
Advanced Technologies & Aerospace Collection
Business Premium Collection
Social Science Premium Collection
ABI/INFORM Global
Education Collection
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
ProQuest Business Collection
Nursing & Allied Health Premium
ProQuest Social Sciences Premium Collection
ProQuest One Academic UKI Edition
ProQuest One Academic
ProQuest One Academic (New)
ABI/INFORM Global (Corporate)
ProQuest One Business
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Pharma Collection
ProQuest Central
ABI/INFORM Professional Advanced
ProQuest Library Science
ProQuest Central Korea
ProQuest Research Library
Advanced Technologies Database with Aerospace
ProQuest Computing
ProQuest One Social Sciences
ProQuest Central Basic
ProQuest Education Journals
ProQuest Nursing & Allied Health Source
ProQuest SciTech Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
Materials Research Database
Materials Business File
DatabaseTitleList
Computer and Information Systems Abstracts
Library and Information Science Abstracts (LISA)
ProQuest One Education
Materials Research Database
Database_xml – sequence: 1
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Library & Information Science
EISSN 1468-4535
EndPage 399
ExternalDocumentID 913587411
17071132
10_1108_14684520510617839
10.1108/14684520510617839
Genre Feature
GroupedDBID -ET
.DC
.X0
0-V
0R~
123
1WG
1XV
29N
3FY
3V.
4.4
5VS
70U
77K
7RV
7WY
8AO
8FE
8FG
8FI
8FW
8R4
8R5
9E0
9F-
AAGBP
AAMCF
AAOWE
AAPSD
AAUDR
AAWTL
AAYOK
ABEAN
ABHCV
ABIJV
ABSDC
ABUWG
ACGFS
ACHQT
ADBBV
ADOMW
AEBZA
AEDOK
AEMMR
AENEX
AETHF
AFKRA
AFNZV
AGZLY
AIAFM
AJEBP
AJFKA
ALIPV
ALMA_UNASSIGNED_HOLDINGS
ALSLI
AODMV
APPLU
ARALO
ARAPS
ASPBG
ATGMP
AUCOK
AVWKF
AZFZN
AZQEC
BENPR
BEZIV
BGLVJ
BKEYQ
BLEHN
BPHCQ
BTXLY
BUONS
BVLZF
BVXVI
CAG
CCPQU
CJNVE
CNYFK
COF
CS3
DU5
DWQXO
EBS
EJD
EX3
FNNZZ
FYUFA
GEA
GEC
GEI
GMM
GMN
GNUQQ
GQ.
GROUPED_ABI_INFORM_COMPLETE
GUQSH
H13
HCIFZ
HZ~
H~9
IPNFZ
J1Y
JI-
JL0
K6V
K6~
K7-
KLENG
M0C
M0N
M0P
M1O
M2O
M42
NAPCQ
O9-
OHT
P2P
P62
PCD
PQBIZ
PQEDU
PQQKQ
PRG
PROAC
Q2X
RIG
ROL
SCAQC
SDURG
SLOBJ
TDX
TEM
TET
TGG
TMD
TMF
TMI
TMK
TMT
TMX
UKHRP
WOW
XSW
Z11
Z12
Z21
Z22
ZCA
1JL
2RR
77I
8NV
AABYC
AAYXX
ABJNI
ABKIT
ABXQL
ABYQI
ACXJU
ACZUD
ADIOT
ADQUB
ADYJY
AEACZ
AFQLH
AFVFF
AGQPQ
AGSTH
AGUEF
AHAFT
AHMHQ
AJNYF
AJZCB
AKXVL
ALJBP
ASJQZ
CITATION
OXR
PHGZM
PHGZT
PPXIY
PQGLB
PRQQA
PUEGO
SQT
0B8
AADXL
AAPBV
ABPTK
ABTMD
ACMTK
AEUCW
ASMFL
AVELQ
IQODW
V1G
7SC
7XB
8FD
AFNTC
E3H
F2A
JQ2
L.-
L7M
L~C
L~D
MBDVC
PKEHL
PQEST
PQUKI
PRINS
Q9U
7TA
JG9
ID FETCH-LOGICAL-c426t-c78d1a79e58b812537b95fcd73b349e2945b8480cc33e2889ee9d0540d8914a53
IEDL.DBID M1O
ISSN 1468-4527
IngestDate Wed Oct 01 08:10:54 EDT 2025
Thu Oct 02 15:42:21 EDT 2025
Wed Oct 01 14:24:00 EDT 2025
Sat Aug 23 14:20:10 EDT 2025
Sun Oct 22 16:08:06 EDT 2023
Wed Oct 01 05:41:52 EDT 2025
Tue Nov 26 02:56:51 EST 2024
Wed Jul 31 14:17:48 EDT 2019
IsPeerReviewed true
IsScholarly true
Issue 4
Keywords Data handling
Information retrieval
Classification
Nearest neighbour
Classifier
Algorithm
Categorization
Language English
License CC BY 4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c426t-c78d1a79e58b812537b95fcd73b349e2945b8480cc33e2889ee9d0540d8914a53
Notes SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
PQID 194543234
PQPubID 23500
PageCount 9
ParticipantIDs proquest_miscellaneous_743099720
proquest_miscellaneous_57640231
proquest_journals_194543234
crossref_primary_10_1108_14684520510617839
proquest_miscellaneous_28148495
pascalfrancis_primary_17071132
emerald_primary_10_1108_14684520510617839
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2005-01-01
PublicationDateYYYYMMDD 2005-01-01
PublicationDate_xml – month: 01
  year: 2005
  text: 2005-01-01
  day: 01
PublicationDecade 2000
PublicationPlace Bradford
PublicationPlace_xml – name: Bradford
PublicationTitle Online information review
PublicationYear 2005
Publisher Emerald Group Publishing Limited
Emerald
Publisher_xml – name: Emerald Group Publishing Limited
– name: Emerald
References b2
b10
b3
b12
b14
b8
b9
References_xml – ident: b14
  doi: 10.1145/312624.312647
– ident: b9
  doi: 10.1145/243199.243277
– ident: b2
  doi: 10.1007/s100440050003
– ident: b12
  doi: 10.1023/A:1009982220290
– ident: b3
– ident: b8
– ident: b10
  doi: 10.1016/0167-8655(94)90095-7
SSID ssj0012435
Score 1.6554837
Snippet Purpose - With the ever-increasing volume of text data via the internet, it is important that documents are classified as manageable and easy to understand...
With the ever-increasing volume of text data via the internet, it is important that documents are classified as manageable and easy to understand categories....
Purpose: With the ever-increasing volume of text data via the internet, it is important that documents are classified as manageable and easy to understand...
SourceID proquest
pascalfrancis
crossref
emerald
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 391
SubjectTerms Binary system
Classification
Exact sciences and technology
Indexing
Information and communication sciences
Information management
Information processing and retrieval
Information retrieval
Information retrieval. Man machine relationship
Information science. Documentation
Internet
Online information retrieval
Research process. Evaluation
Sciences and techniques of general use
Searches
Studies
Text categorization
Vocabularies & taxonomies
Title Binary k-nearest neighbor for text categorization
URI https://www.emerald.com/insight/content/doi/10.1108/14684520510617839/full/html
https://www.proquest.com/docview/194543234
https://www.proquest.com/docview/28148495
https://www.proquest.com/docview/57640231
https://www.proquest.com/docview/743099720
Volume 29
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVPQU
  databaseName: Library Science Database
  customDbUrl:
  eissn: 1468-4535
  dateEnd: 20241105
  omitProxy: false
  ssIdentifier: ssj0012435
  issn: 1468-4527
  databaseCode: M1O
  dateStart: 20010101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/libraryscience
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 1468-4535
  dateEnd: 20241105
  omitProxy: true
  ssIdentifier: ssj0012435
  issn: 1468-4527
  databaseCode: BENPR
  dateStart: 20010101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Technology Collection
  customDbUrl:
  eissn: 1468-4535
  dateEnd: 20241105
  omitProxy: true
  ssIdentifier: ssj0012435
  issn: 1468-4527
  databaseCode: 8FG
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/technologycollection1
  providerName: ProQuest
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT9wwEB4VuLRCfdBWpJStD1CplQKb2Fk7p6pUbFEloKqKxC1K7EkPSNmFZC_8emYc7_JYUVW9OiMlzoztGc_M9wHsMOCISSg6Id9Ax8oZjCtEjGtLZ4EslU49zvbxyejoTP04z85DbU4byirne6LfqN3E8h35PgXbmZKpVF-mlzGTRnFyNTBorMAa5x6ZwOA4OV0kEVLl-TV9c5HKUh2Smkx8w2M0xBbJPXJMFX7nWLrtzV2fli39q7onuVjar_0hNH7RM622HruQa08u9mZdtWevHyA7_vf8XsLz4J6Kr709vYIn2GzAszughRuwHVodxEcReplYtyJsEq8hOfAtvuIibhgft-1Ew9evZGuCpAVXmgguw_pDr-97QN_A2fjw97ejOBAzxJYO9C622rik1DlmpiIHIZO6yrPaOi0rqXJMaSqVUWZorZSYGpMj5o59Q2fyRJWZfAurzaTBTRCI1o1yo8q0RkXRpnG2Gmm0ErXLpVQRfJ7rpZj2-BuFj1uGplhSYgSfgub-RXZ3SfahTDF1dQSDezZwK63JO6NYPoKtuUKLsP7bYqHNCD4sntLC5WxM2eBk1hapoUiUwtPHJSgUVAzPF4F4RILcP9_5PHz314_YgqcecdbfHL2H1e5qhtvkS3XVAFbM-PsA1g4OT37-Gvj1cwOaPxsQ
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1RTxQxEJ4QeFBjDKLGFYE-iIkmG-_a7rX7QAwg5BC4GAMJb-tuO-uDyd7pHjH-OP-bM73ugVwwvvC6O8luZtrOfG2_-QBeccMR2yd0QrWBSbW3mFaImNaOcoEqtZGhz_bpaDA81x8vsosl-N1xYfhaZbcmhoXajx3vkb8jsJ1pJZV-P_mesmgUH652ChplVFbwO6HDWOR1HOOvn4Tg2p2jDxTubSkPD872h2kUGUgdJadp6oz1_dLkmNmKkl2mTJVntfNGVUrnKOmzldW255xSKK3NEXPPdY63eV-XLBpBGWBFkzFhv5W9g9Gnz_NjDKmDwmegN-lMmnisytI7_Iwe8Zxglh6LlV9LjFfs4IeTsqVo1TOZjYWMEdLg4So8ivWr2J0NuMewhM0aPLjW1XANNiIXQrwWkezEwRdxFXkC_b3AARbf0oYb6LZT0fD-LA1GQdaCr6IIvqf1lfw_I4k-hfM7ceozWG7GDT4Hgej8ILe6lDVqgqPWu2pg0Ck0PldKJ_C2c1sxmTXoKAKw6dliwccJvImO_R_b7QXbmzbFxNcJbP4VoitrQ-Ubgf0E1ruYFXGBaIv5cE5ga_6WZjYf15QNji_bQlqCqoRfb7cgrKi5f18C4hYLqg8DNbr34p8_sQX3hmenJ8XJ0eh4He6H9rRhm-klLE9_XOIGFV7TajMObwFf7npG_QG39Dfn
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V3NbtQwEB5VRUIgxE8BEUpbHygSSGk3trN2DggBZdVSKByo1Fua2BMOlbILmxWCN-NVeBpmnGRbuiri0gNXZ6TE8ef5seebAXjMBUdsQtEJ-QYm1t5iXCJiXDmyBarQRoY62-8PhruH-u1RerQEP3suDKdV9joxKGo_dnxGvk3BdqqVVHq76rIiPu6MXky-xNxAii9a-24aLUL28fs3it6mz_d2aKk3pRy9-fR6N-4aDMSODFMTO2N9UpgMU1uSoUuVKbO0ct6oUukMJb2ytNoOnFMKpbUZYubZx_E2S3TBDSNI-18xHEVw1mDyYX6BIXXo7RmITTqVprtQ5aY7PEZDvBuYn8dtys-YxFNe8I1JMaV1qtoGGwu2IhjA0S341f-6Nu_lZGvWlFvux7mqkv_lv70NNzu3XLxs99EdWMJ6Ba6fKda4AmsdxUM8ER2HizEtOuV4F5JXgdosTuKa6wJPG1HzsTPtMUHSgucrOP3sM0235b7eg8NLmdN9WK7HNT4Agej8MLO6kBVqirKtd-XQoFNofKaUjuBZj4l80tYdyUO8NrD5AoAieNqh5l9kNxdkz8vkE19FsP4H_k6lDXmliZIRrPZgyju9N83nSIpgY_6UFBbfQhU1jmfTXFqKwCksv1iCQmDNZQkjEBdIkNsbGN-Dh3_9iA24SkDO3-0d7K_CtVB0NxyePYLl5usM18idbMr1sHEFHF82mn8Dybl8HA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Binary+k-nearest+neighbor+for+text+categorization&rft.jtitle=Online+information+review&rft.au=Tan%2C+Songbo&rft.date=2005-01-01&rft.pub=Emerald+Group+Publishing+Limited&rft.issn=1468-4527&rft.eissn=1468-4535&rft.volume=29&rft.issue=4&rft.spage=391&rft_id=info:doi/10.1108%2F14684520510617839&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=913587411
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1468-4527&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1468-4527&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1468-4527&client=summon