Performance Comparison of Bagging Classifiers Between Multi-Core And Single-Core Processors Using Python Multiprocessing
In recent years, the demand for efficient and scalable machine learning algorithms has surged. Bagging (Bootstrap Aggregating) stands out as a widely used ensemble technique that combines multiple base classifiers to enhance predictive accuracy and mitigate over fitting. However, when implementing b...
Saved in:
| Published in | 2024 2nd International Conference on Advancements and Key Challenges in Green Energy and Computing (AKGEC) pp. 1 - 6 |
|---|---|
| Main Authors | , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
21.11.2024
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.1109/AKGEC62572.2024.10867840 |
Cover
| Abstract | In recent years, the demand for efficient and scalable machine learning algorithms has surged. Bagging (Bootstrap Aggregating) stands out as a widely used ensemble technique that combines multiple base classifiers to enhance predictive accuracy and mitigate over fitting. However, when implementing bagging classifiers in Python, a crucial consideration lies in their effective utilization of multi-core processors. Python, as a language, faces parallelism limitations due to the Global Interpreter Lock (GIL) present in its C implementation (cPython). The GIL restricts Python threads from fully exploiting multiple CPU cores, thereby limiting their concurrency. Consequently, relying solely on regular threads for concurrency would not fully harness the potential of multi-core processors. Python, as a language, faces parallelism limitations due to the Global Interpreter Lock (GIL) present in its C implementation (cPython). The GIL restricts Python threads from fully exploiting multiple CPU cores, thereby limiting their concurrency. Consequently, relying solely on regular threads for concurrency would not fully harness the potential of multi-core processors. To overcome this limitation, we recommend leveraging Python's multiprocessing module. Unlike threads, which share the same memory space and are subject to the GIL, multiprocessing creates separate processes, each with its own Python interpreter instance. As a result, these processes can run in parallel, effectively utilizing multiple CPU cores. Our study delves into the performance comparison of bagging classifiers on multi-core processors versus single-core using Python multiprocessing. Our experiments reveal that multiprocessing significantly improves execution time, particularly for CPU-bound tasks. By leveraging multiple cores, bagging classifiers can better generalize to unseen data and achieve enhanced predictive accuracy. |
|---|---|
| AbstractList | In recent years, the demand for efficient and scalable machine learning algorithms has surged. Bagging (Bootstrap Aggregating) stands out as a widely used ensemble technique that combines multiple base classifiers to enhance predictive accuracy and mitigate over fitting. However, when implementing bagging classifiers in Python, a crucial consideration lies in their effective utilization of multi-core processors. Python, as a language, faces parallelism limitations due to the Global Interpreter Lock (GIL) present in its C implementation (cPython). The GIL restricts Python threads from fully exploiting multiple CPU cores, thereby limiting their concurrency. Consequently, relying solely on regular threads for concurrency would not fully harness the potential of multi-core processors. Python, as a language, faces parallelism limitations due to the Global Interpreter Lock (GIL) present in its C implementation (cPython). The GIL restricts Python threads from fully exploiting multiple CPU cores, thereby limiting their concurrency. Consequently, relying solely on regular threads for concurrency would not fully harness the potential of multi-core processors. To overcome this limitation, we recommend leveraging Python's multiprocessing module. Unlike threads, which share the same memory space and are subject to the GIL, multiprocessing creates separate processes, each with its own Python interpreter instance. As a result, these processes can run in parallel, effectively utilizing multiple CPU cores. Our study delves into the performance comparison of bagging classifiers on multi-core processors versus single-core using Python multiprocessing. Our experiments reveal that multiprocessing significantly improves execution time, particularly for CPU-bound tasks. By leveraging multiple cores, bagging classifiers can better generalize to unseen data and achieve enhanced predictive accuracy. |
| Author | Kumar, Divya Rawat, Aman Krishna, Charu |
| Author_xml | – sequence: 1 givenname: Aman surname: Rawat fullname: Rawat, Aman email: amanrawat262@gmail.com organization: MNNIT Allahabad,Computer Science Department,Prayagraj,India – sequence: 2 givenname: Charu surname: Krishna fullname: Krishna, Charu email: charu.2020rcs02@mnnit.ac.in organization: MNNIT Allahabad,Computer Science Department,Prayagraj,India – sequence: 3 givenname: Divya surname: Kumar fullname: Kumar, Divya email: divyak@mnnit.ac.in organization: MNNIT Allahabad,Computer Science Department,Prayagraj,India |
| BookMark | eNqFjkFrwkAUhFewB635Bz28P2DcTdRkj7pEhVII2J5lsS_pQrIv7Eba_HsjSc-ehplvBmbOppYsMgaCh0Jwudq9HzO1jTZJFEY8WoeCp9skXfMJC2Qi03jDY8lFImfsL0dXkKu1vSIoqhvtjCcLVMBel6WxJahKe28Kg87DHttfRAsft6o1S0UOYWe_4dz3Khx87uiK3lPf_vKPfd61PzROmgH28YK9FLryGIz6yt4O2ac6LQ0iXhpnau26y__x-Am-AwsCTqk |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/AKGEC62572.2024.10867840 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore (NTUSG) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9798350390179 |
| EndPage | 6 |
| ExternalDocumentID | 10867840 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL CBEJK RIE RIL |
| ID | FETCH-ieee_primary_108678403 |
| IEDL.DBID | RIE |
| IngestDate | Wed Feb 26 09:44:16 EST 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-ieee_primary_108678403 |
| ParticipantIDs | ieee_primary_10867840 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-Nov.-21 |
| PublicationDateYYYYMMDD | 2024-11-21 |
| PublicationDate_xml | – month: 11 year: 2024 text: 2024-Nov.-21 day: 21 |
| PublicationDecade | 2020 |
| PublicationTitle | 2024 2nd International Conference on Advancements and Key Challenges in Green Energy and Computing (AKGEC) |
| PublicationTitleAbbrev | AKGEC |
| PublicationYear | 2024 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| Score | 3.7856188 |
| Snippet | In recent years, the demand for efficient and scalable machine learning algorithms has surged. Bagging (Bootstrap Aggregating) stands out as a widely used... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | Accuracy Bagging bagging classifier Concurrent computing ensemble learning Faces global interpreter lock Instruction sets Limiting Machine learning algorithms multi-core Multicore processing Parallel processing python multiprocessing single core Training |
| Title | Performance Comparison of Bagging Classifiers Between Multi-Core And Single-Core Processors Using Python Multiprocessing |
| URI | https://ieeexplore.ieee.org/document/10867840 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB5sT55UjPiosgevSeu62bTHNvSBQgmo0FvZV0SUpEgC6q93Z7dpsSh4y-5OsgNzmNnN930DcG3rNZYbmoSDnKqQcS5CIZW2ARGSMxVrI5za55zPntjdIl6syeqOC2OMceAzE-Gj-5evS1XjVVkXuwIl9kTSglbS556s1aBzeoPu8H46Tm09nyDBirKoMf_ROMXljckBzJsdPVzkNaorGamvHTHGf7t0CMGWokeyTfI5gj1THMNHtqUBkHTTYZCUORkJvFp-Jq4J5kuODbDJyGO0iCPhhqndmgwLTR6s3Zvx4zWPoLTWDlxAsk8UG_CvrPyinQ6gMxk_prMQXV-uvILFsvH69gTaRVmYUyBC9aVGvUamOYt7SuaoNW_TvS0pE0HpGQS_fuL8j_kL2McgIG2P3nSgXb3X5tLm70peubh9A_GWon4 |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB60HvSkYsRH1T14TVrXTWKPbWiNtoaAFXoL-4pIJSmSgPrr3UfToih4Sza72YE5zGTzPQAuVb9GcolDt5dj7pIgoC5lXKiEUBYQ7gtJjdpnEsRP5H7mz5ZkdcOFkVIa8Jn09KX5ly9KXuujso52BQrVF8kmbPmEEN_StRp8TrfX6Y9vh5Hq6ENNscLEaxZ8s04xlWO0C0mzpwWMzL26Yh7__CHH-O-g9sBZk_RQuio_-7AhiwN4T9dEABStPAZRmaMB1YfLz8jYYL7k2gIbDSxKCxkarhuprVG_EOhRzXuV9n7JJCjVbAMvQOmHlhuwSxb2oRp2oD0aTqPY1aFnC6thkTVRXx9CqygLeQSI8hsmtGIjEQHxu5zlWm1eFXzVVIYU42Nwfn3FyR_jF7AdTx8m2eQuGZ_Cjk6IJvHhqza0qrdanqlqXrFzk8MvnWClyw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+2nd+International+Conference+on+Advancements+and+Key+Challenges+in+Green+Energy+and+Computing+%28AKGEC%29&rft.atitle=Performance+Comparison+of+Bagging+Classifiers+Between+Multi-Core+And+Single-Core+Processors+Using+Python+Multiprocessing&rft.au=Rawat%2C+Aman&rft.au=Krishna%2C+Charu&rft.au=Kumar%2C+Divya&rft.date=2024-11-21&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FAKGEC62572.2024.10867840&rft.externalDocID=10867840 |