Performance Comparison of Bagging Classifiers Between Multi-Core And Single-Core Processors Using Python Multiprocessing

In recent years, the demand for efficient and scalable machine learning algorithms has surged. Bagging (Bootstrap Aggregating) stands out as a widely used ensemble technique that combines multiple base classifiers to enhance predictive accuracy and mitigate over fitting. However, when implementing b...

Full description

Saved in:
Bibliographic Details
Published in2024 2nd International Conference on Advancements and Key Challenges in Green Energy and Computing (AKGEC) pp. 1 - 6
Main Authors Rawat, Aman, Krishna, Charu, Kumar, Divya
Format Conference Proceeding
LanguageEnglish
Published IEEE 21.11.2024
Subjects
Online AccessGet full text
DOI10.1109/AKGEC62572.2024.10867840

Cover

Abstract In recent years, the demand for efficient and scalable machine learning algorithms has surged. Bagging (Bootstrap Aggregating) stands out as a widely used ensemble technique that combines multiple base classifiers to enhance predictive accuracy and mitigate over fitting. However, when implementing bagging classifiers in Python, a crucial consideration lies in their effective utilization of multi-core processors. Python, as a language, faces parallelism limitations due to the Global Interpreter Lock (GIL) present in its C implementation (cPython). The GIL restricts Python threads from fully exploiting multiple CPU cores, thereby limiting their concurrency. Consequently, relying solely on regular threads for concurrency would not fully harness the potential of multi-core processors. Python, as a language, faces parallelism limitations due to the Global Interpreter Lock (GIL) present in its C implementation (cPython). The GIL restricts Python threads from fully exploiting multiple CPU cores, thereby limiting their concurrency. Consequently, relying solely on regular threads for concurrency would not fully harness the potential of multi-core processors. To overcome this limitation, we recommend leveraging Python's multiprocessing module. Unlike threads, which share the same memory space and are subject to the GIL, multiprocessing creates separate processes, each with its own Python interpreter instance. As a result, these processes can run in parallel, effectively utilizing multiple CPU cores. Our study delves into the performance comparison of bagging classifiers on multi-core processors versus single-core using Python multiprocessing. Our experiments reveal that multiprocessing significantly improves execution time, particularly for CPU-bound tasks. By leveraging multiple cores, bagging classifiers can better generalize to unseen data and achieve enhanced predictive accuracy.
AbstractList In recent years, the demand for efficient and scalable machine learning algorithms has surged. Bagging (Bootstrap Aggregating) stands out as a widely used ensemble technique that combines multiple base classifiers to enhance predictive accuracy and mitigate over fitting. However, when implementing bagging classifiers in Python, a crucial consideration lies in their effective utilization of multi-core processors. Python, as a language, faces parallelism limitations due to the Global Interpreter Lock (GIL) present in its C implementation (cPython). The GIL restricts Python threads from fully exploiting multiple CPU cores, thereby limiting their concurrency. Consequently, relying solely on regular threads for concurrency would not fully harness the potential of multi-core processors. Python, as a language, faces parallelism limitations due to the Global Interpreter Lock (GIL) present in its C implementation (cPython). The GIL restricts Python threads from fully exploiting multiple CPU cores, thereby limiting their concurrency. Consequently, relying solely on regular threads for concurrency would not fully harness the potential of multi-core processors. To overcome this limitation, we recommend leveraging Python's multiprocessing module. Unlike threads, which share the same memory space and are subject to the GIL, multiprocessing creates separate processes, each with its own Python interpreter instance. As a result, these processes can run in parallel, effectively utilizing multiple CPU cores. Our study delves into the performance comparison of bagging classifiers on multi-core processors versus single-core using Python multiprocessing. Our experiments reveal that multiprocessing significantly improves execution time, particularly for CPU-bound tasks. By leveraging multiple cores, bagging classifiers can better generalize to unseen data and achieve enhanced predictive accuracy.
Author Kumar, Divya
Rawat, Aman
Krishna, Charu
Author_xml – sequence: 1
  givenname: Aman
  surname: Rawat
  fullname: Rawat, Aman
  email: amanrawat262@gmail.com
  organization: MNNIT Allahabad,Computer Science Department,Prayagraj,India
– sequence: 2
  givenname: Charu
  surname: Krishna
  fullname: Krishna, Charu
  email: charu.2020rcs02@mnnit.ac.in
  organization: MNNIT Allahabad,Computer Science Department,Prayagraj,India
– sequence: 3
  givenname: Divya
  surname: Kumar
  fullname: Kumar, Divya
  email: divyak@mnnit.ac.in
  organization: MNNIT Allahabad,Computer Science Department,Prayagraj,India
BookMark eNqFjkFrwkAUhFewB635Bz28P2DcTdRkj7pEhVII2J5lsS_pQrIv7Eba_HsjSc-ehplvBmbOppYsMgaCh0Jwudq9HzO1jTZJFEY8WoeCp9skXfMJC2Qi03jDY8lFImfsL0dXkKu1vSIoqhvtjCcLVMBel6WxJahKe28Kg87DHttfRAsft6o1S0UOYWe_4dz3Khx87uiK3lPf_vKPfd61PzROmgH28YK9FLryGIz6yt4O2ac6LQ0iXhpnau26y__x-Am-AwsCTqk
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/AKGEC62572.2024.10867840
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore (NTUSG)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350390179
EndPage 6
ExternalDocumentID 10867840
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-ieee_primary_108678403
IEDL.DBID RIE
IngestDate Wed Feb 26 09:44:16 EST 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-ieee_primary_108678403
ParticipantIDs ieee_primary_10867840
PublicationCentury 2000
PublicationDate 2024-Nov.-21
PublicationDateYYYYMMDD 2024-11-21
PublicationDate_xml – month: 11
  year: 2024
  text: 2024-Nov.-21
  day: 21
PublicationDecade 2020
PublicationTitle 2024 2nd International Conference on Advancements and Key Challenges in Green Energy and Computing (AKGEC)
PublicationTitleAbbrev AKGEC
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
Score 3.7856188
Snippet In recent years, the demand for efficient and scalable machine learning algorithms has surged. Bagging (Bootstrap Aggregating) stands out as a widely used...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Accuracy
Bagging
bagging classifier
Concurrent computing
ensemble learning
Faces
global interpreter lock
Instruction sets
Limiting
Machine learning algorithms
multi-core
Multicore processing
Parallel processing
python multiprocessing
single core
Training
Title Performance Comparison of Bagging Classifiers Between Multi-Core And Single-Core Processors Using Python Multiprocessing
URI https://ieeexplore.ieee.org/document/10867840
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB5sT55UjPiosgevSeu62bTHNvSBQgmo0FvZV0SUpEgC6q93Z7dpsSh4y-5OsgNzmNnN930DcG3rNZYbmoSDnKqQcS5CIZW2ARGSMxVrI5za55zPntjdIl6syeqOC2OMceAzE-Gj-5evS1XjVVkXuwIl9kTSglbS556s1aBzeoPu8H46Tm09nyDBirKoMf_ROMXljckBzJsdPVzkNaorGamvHTHGf7t0CMGWokeyTfI5gj1THMNHtqUBkHTTYZCUORkJvFp-Jq4J5kuODbDJyGO0iCPhhqndmgwLTR6s3Zvx4zWPoLTWDlxAsk8UG_CvrPyinQ6gMxk_prMQXV-uvILFsvH69gTaRVmYUyBC9aVGvUamOYt7SuaoNW_TvS0pE0HpGQS_fuL8j_kL2McgIG2P3nSgXb3X5tLm70peubh9A_GWon4
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB60HvSkYsRH1T14TVrXTWKPbWiNtoaAFXoL-4pIJSmSgPrr3UfToih4Sza72YE5zGTzPQAuVb9GcolDt5dj7pIgoC5lXKiEUBYQ7gtJjdpnEsRP5H7mz5ZkdcOFkVIa8Jn09KX5ly9KXuujso52BQrVF8kmbPmEEN_StRp8TrfX6Y9vh5Hq6ENNscLEaxZ8s04xlWO0C0mzpwWMzL26Yh7__CHH-O-g9sBZk_RQuio_-7AhiwN4T9dEABStPAZRmaMB1YfLz8jYYL7k2gIbDSxKCxkarhuprVG_EOhRzXuV9n7JJCjVbAMvQOmHlhuwSxb2oRp2oD0aTqPY1aFnC6thkTVRXx9CqygLeQSI8hsmtGIjEQHxu5zlWm1eFXzVVIYU42Nwfn3FyR_jF7AdTx8m2eQuGZ_Cjk6IJvHhqza0qrdanqlqXrFzk8MvnWClyw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+2nd+International+Conference+on+Advancements+and+Key+Challenges+in+Green+Energy+and+Computing+%28AKGEC%29&rft.atitle=Performance+Comparison+of+Bagging+Classifiers+Between+Multi-Core+And+Single-Core+Processors+Using+Python+Multiprocessing&rft.au=Rawat%2C+Aman&rft.au=Krishna%2C+Charu&rft.au=Kumar%2C+Divya&rft.date=2024-11-21&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FAKGEC62572.2024.10867840&rft.externalDocID=10867840