A model-based algorithm for quantity and parameters of clusters discovery

It is called unsupervised learning to solve various problems in pattern recognition based on training samples with unknown categories (unlabeled). Clustering algorithm is a kind of unsupervised learning algorithm. Although a lot of clustering algorithms have been studied in modern science and applie...

Full description

Saved in:
Bibliographic Details
Main Authors Jiang, Kunpeng, Yang, Kun, Qu, Haipeng, Li, Miao, Wang, Shijun
Format Conference Proceeding
LanguageEnglish
Published SPIE 03.02.2023
Online AccessGet full text
ISBN9781510661363
1510661360
ISSN0277-786X
DOI10.1117/12.2660137

Cover

Abstract It is called unsupervised learning to solve various problems in pattern recognition based on training samples with unknown categories (unlabeled). Clustering algorithm is a kind of unsupervised learning algorithm. Although a lot of clustering algorithms have been studied in modern science and applied in many fields, it is their common problem that the quantity of clusters has to be given. This paper proposes a model-based algorithm for quantity and parameters of clusters discovery (QPCD) which can calculate the quantity and parameters of clusters according to the characteristics of the data themselves. The algorithm initially fills the shortage of existing clustering algorithms. The paper proposes an elementary judgment rule on whether the cluster center is appropriate. According to the elementary judgment rule, the algorithm proposed by the paper can calculate the correct quantity of clusters, and give the corresponding clustering parameters according to the data characteristics. Monte Carlo simulation is used to evaluate the effectiveness of the proposed algorithm. The experimental results show that the algorithm proposed in the paper can start with an arbitrary given cluster center and get the cluster centers close to the actual cluster centers of the data themselves, so as to complete the clustering unsupervised.
AbstractList It is called unsupervised learning to solve various problems in pattern recognition based on training samples with unknown categories (unlabeled). Clustering algorithm is a kind of unsupervised learning algorithm. Although a lot of clustering algorithms have been studied in modern science and applied in many fields, it is their common problem that the quantity of clusters has to be given. This paper proposes a model-based algorithm for quantity and parameters of clusters discovery (QPCD) which can calculate the quantity and parameters of clusters according to the characteristics of the data themselves. The algorithm initially fills the shortage of existing clustering algorithms. The paper proposes an elementary judgment rule on whether the cluster center is appropriate. According to the elementary judgment rule, the algorithm proposed by the paper can calculate the correct quantity of clusters, and give the corresponding clustering parameters according to the data characteristics. Monte Carlo simulation is used to evaluate the effectiveness of the proposed algorithm. The experimental results show that the algorithm proposed in the paper can start with an arbitrary given cluster center and get the cluster centers close to the actual cluster centers of the data themselves, so as to complete the clustering unsupervised.
Author Wang, Shijun
Jiang, Kunpeng
Qu, Haipeng
Yang, Kun
Li, Miao
Author_xml – sequence: 1
  givenname: Kunpeng
  surname: Jiang
  fullname: Jiang, Kunpeng
  organization: Zhengzhou University of Science and Technology (China)
– sequence: 2
  givenname: Kun
  surname: Yang
  fullname: Yang, Kun
  organization: Zhengzhou University of Science and Technology (China)
– sequence: 3
  givenname: Haipeng
  surname: Qu
  fullname: Qu, Haipeng
  organization: Zhengzhou University of Science and Technology (China)
– sequence: 4
  givenname: Miao
  surname: Li
  fullname: Li, Miao
  organization: Zhengzhou University of Science and Technology (China)
– sequence: 5
  givenname: Shijun
  surname: Wang
  fullname: Wang, Shijun
  organization: Zhengzhou University of Science and Technology (China)
BookMark eNotkE9LwzAcQANOcJu7-AlyFjrzS5o_PY6hbjDwouCtpMkvs9I2NemEfXtRd3rv9A5vQWZDHJCQO2BrANAPwNdcKQZCX5FVpQ1IYEqBUGJG5oxrXWij3m_IIudPxriRupqT_Yb20WNXNDajp7Y7xtROHz0NMdGvkx2mdjpTO3g62mR7nDBlGgN13Sn_uW-zi9-YzrfkOtgu4-rCJXl7enzd7orDy_N-uzkUGZjUhZWG2wZtKbiX0rISuDcuBOlY8MJwr6wsSx6CUWi0NAJVYyoILAQA5oxYkvv_bh5brMcUHaJvh2OugdW_I2rg9WWE-AEDrlGu
ContentType Conference Proceeding
Copyright COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.
Copyright_xml – notice: COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.
DOI 10.1117/12.2660137
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Editor Yang, Ting
Zhang, Tao
Editor_xml – sequence: 1
  givenname: Tao
  surname: Zhang
  fullname: Zhang, Tao
  organization: North China University of Technology (China)
– sequence: 2
  givenname: Ting
  surname: Yang
  fullname: Yang, Ting
  organization: Tianjin University (China)
EndPage 125112A-6
ExternalDocumentID 10_1117_12_2660137
GroupedDBID 29O
4.4
5SJ
ACGFS
ALMA_UNASSIGNED_HOLDINGS
EBS
F5P
FQ0
R.2
RNS
RSJ
SPBNH
UT2
ID FETCH-LOGICAL-s1057-a582abea432d55a0412d8cff5c0fd382d6a5442ff86e87583e6b891f0ff110c83
ISBN 9781510661363
1510661360
ISSN 0277-786X
IngestDate Wed Apr 12 04:40:34 EDT 2023
IsPeerReviewed false
IsScholarly true
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-s1057-a582abea432d55a0412d8cff5c0fd382d6a5442ff86e87583e6b891f0ff110c83
Notes Conference Location: Hulun Buir, China
Conference Date: 2022-08-19|2022-08-21
ParticipantIDs spie_proceedings_10_1117_12_2660137
PublicationCentury 2000
PublicationDate 20230203
PublicationDateYYYYMMDD 2023-02-03
PublicationDate_xml – month: 2
  year: 2023
  text: 20230203
  day: 3
PublicationDecade 2020
PublicationYear 2023
Publisher SPIE
Publisher_xml – name: SPIE
SSID ssj0028579
ssib050757843
Score 2.211397
Snippet It is called unsupervised learning to solve various problems in pattern recognition based on training samples with unknown categories (unlabeled). Clustering...
SourceID spie
SourceType Publisher
StartPage 125112A
Title A model-based algorithm for quantity and parameters of clusters discovery
URI http://www.dx.doi.org/10.1117/12.2660137
Volume 12511
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT9wwELZ4XOBEC4hnZanckCHxI3GOK9oKWkCVChK3lRPb7UqwC-zuhV_PjPNw2lKpcIkSy0qczKd5ON_MEHKQ8syWWktWJmXBpJQ5KwqfM--0rWyii8xjovDFZXZ6Lb_eqJvYqTBkl8zKo-rpxbySt0gVxkCumCX7Csl2N4UBOAf5whEkDMc_nN8X7cyg7mPD0BLZQ3P7cwKR_q-7wBx8mBtMwK2rK2F97zvkvQTeRnU7n4ZzzMhFBmek0Yya3eNv8_G9a56FKiEOx43SYLTMqD_vfFQz8c2kv5nAReAfix4F4-z3ABP8AfBJUtEooVov4V_fXIcGhFGJYqjSU4Thmg96drUZYdk_FHdI_edH4DBgFcRonjrSYB2u5MOUD5tJi2Qxz0GVLQ8-XZz_aNUIeLighmRnkLlWdcnFdt2Y39e-V1v2q3vPpoQtPOg4rgZpfvcj1_M8rtbIRszJpN87FLwjC278nqz2ykmuk7MB7QGCdoCgAAjaAoICIGgEBJ142gKCdoDYINdfPl-dnLKmVQabYqNmZpTmpnRGCm6VMlhEzerKe1Ul3grNbWaUlNx7nTmIULVwWamL1Cfeg_9XabFJlsaTsdsi1EmImZWQVemFNCoxorJ5lnolnUkdd9vkI36LYQT-dPi3aHb-a9YuWYko3CNLs8e52wcnb1Z-aIT6DA24Rnw
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=A+model-based+algorithm+for+quantity+and+parameters+of+clusters+discovery&rft.au=Jiang%2C+Kunpeng&rft.au=Yang%2C+Kun&rft.au=Qu%2C+Haipeng&rft.au=Li%2C+Miao&rft.date=2023-02-03&rft.pub=SPIE&rft.isbn=9781510661363&rft.issn=0277-786X&rft.volume=12511&rft.spage=125112A&rft.epage=125112A-6&rft_id=info:doi/10.1117%2F12.2660137&rft.externalDocID=10_1117_12_2660137
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0277-786X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0277-786X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0277-786X&client=summon