Memory Dependence Speculation for Simultaneous Multi-Threading Processors
Simultaneous Multi-Threading (SMT) processors provide improvement over the traditional out-of-order superscalar architecture by allowing instructions from several independent threads to execute out-of-order concurrently. Maintaining the accuracy of values read from and written to memory is a great b...
Saved in:
| Published in | Parallel processing letters Vol. 34; no. 2 |
|---|---|
| Main Authors | , |
| Format | Journal Article |
| Language | English |
| Published |
Singapore
World Scientific Publishing Company
01.06.2024
World Scientific Publishing Co. Pte., Ltd |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0129-6264 1793-642X |
| DOI | 10.1142/S0129626424500014 |
Cover
| Abstract | Simultaneous Multi-Threading (SMT) processors provide improvement over the traditional out-of-order superscalar architecture by allowing instructions from several independent threads to execute out-of-order concurrently. Maintaining the accuracy of values read from and written to memory is a great bottleneck for processor performance, as loads must stall execution until all prior store addresses are known or risk reading invalid data. Prior research in this area has mainly focused on superscalar architecture, as such, it is only natural to extend memory dependence speculation techniques to an SMT architecture. In this paper, we allow for loads among threads to execute as soon as their addresses are resolved without checking for prior memory address conflicts. Stores also perform a check on all later loads to see if any read was too early due to an address match, if so, the processor state is recovered, and the load re-issued. This aggressive technique allows for the greatest potential instructions per clock cycle gains over predictive techniques as the pipeline is never stalled for loads. Our simulations show that an overall IPC gain up to 12% and 10% is possible for both 4-threaded and 8-threaded workloads respectively. Conversely, a maximum overall IPC loss of at least 2.3% and 2% for 4-threaded and 8-threaded workloads respectively was also observed. |
|---|---|
| AbstractList | Simultaneous Multi-Threading (SMT) processors provide improvement over the traditional out-of-order superscalar architecture by allowing instructions from several independent threads to execute out-of-order concurrently. Maintaining the accuracy of values read from and written to memory is a great bottleneck for processor performance, as loads must stall execution until all prior store addresses are known or risk reading invalid data. Prior research in this area has mainly focused on superscalar architecture, as such, it is only natural to extend memory dependence speculation techniques to an SMT architecture. In this paper, we allow for loads among threads to execute as soon as their addresses are resolved without checking for prior memory address conflicts. Stores also perform a check on all later loads to see if any read was too early due to an address match, if so, the processor state is recovered, and the load re-issued. This aggressive technique allows for the greatest potential instructions per clock cycle gains over predictive techniques as the pipeline is never stalled for loads. Our simulations show that an overall IPC gain up to 12% and 10% is possible for both 4-threaded and 8-threaded workloads respectively. Conversely, a maximum overall IPC loss of at least 2.3% and 2% for 4-threaded and 8-threaded workloads respectively was also observed. |
| Author | Lin, Wei-Ming Flores, Jonathan |
| Author_xml | – sequence: 1 givenname: Jonathan surname: Flores fullname: Flores, Jonathan – sequence: 2 givenname: Wei-Ming surname: Lin fullname: Lin, Wei-Ming |
| BookMark | eNplkFFLwzAUhYNMcJv-AN8KPleTNk3WR5k6BxsKm-BbuaY3mtElNWmR_XtTJr7s6V7uOd-5cCZkZJ1FQq4ZvWWMZ3cbyrJSZIJnvKCUMn5GxkyWeRov7yMyHuR00C_IJITd4JCcjslyjXvnD8kDtmhrtAqTTYuqb6Azziba-WRj9n3TgUXXh2QdV5NuvzxCbexn8uqdwhCcD5fkXEMT8OpvTsnb0-N2_pyuXhbL-f0qVZkQPAVQGgBKOuMfWlKObCYFLTBnQmkJrAauRIkQFawZFExqKXiNGQXBKWf5lNwcc1vvvnsMXbVzvbfxZRUzClnIGc-jix1dyrsQPOqq9WYP_lAxWg2NVSeNRYYemR_nmzoog7Yz2qh_9BT5BdZ1b48 |
| Cites_doi | 10.1145/3239567 10.1016/S0141-9331(98)00086-6 10.1145/1596510.1596514 |
| ContentType | Journal Article |
| Copyright | 2024, World Scientific Publishing Company 2024. World Scientific Publishing Company |
| Copyright_xml | – notice: 2024, World Scientific Publishing Company – notice: 2024. World Scientific Publishing Company |
| DBID | AAYXX CITATION 7SP 8FD L7M |
| DOI | 10.1142/S0129626424500014 |
| DatabaseName | CrossRef Electronics & Communications Abstracts Technology Research Database Advanced Technologies Database with Aerospace |
| DatabaseTitle | CrossRef Technology Research Database Advanced Technologies Database with Aerospace Electronics & Communications Abstracts |
| DatabaseTitleList | CrossRef Technology Research Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1793-642X |
| ExternalDocumentID | 10_1142_S0129626424500014 S0129626424500014 |
| GroupedDBID | -~X .DC 0R~ 123 4.4 8V8 ADSJI ALMA_UNASSIGNED_HOLDINGS CAG COF CS3 DU5 EBS EJD ESX HZ~ H~9 K1G O9- P2P P71 PQQKQ RWJ WSC AAYXX AMVHM CITATION 7SP 8FD L7M |
| ID | FETCH-LOGICAL-c2664-aacfaaa9084bf704e187605e316cf7a1da4c69ea04eed1a517f764de20a640413 |
| ISSN | 0129-6264 |
| IngestDate | Mon Jun 30 12:44:39 EDT 2025 Tue Jul 01 00:34:07 EDT 2025 Fri Aug 23 08:19:25 EDT 2024 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 2 |
| Keywords | speculation Multi-threading SMT |
| Language | English |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c2664-aacfaaa9084bf704e187605e316cf7a1da4c69ea04eed1a517f764de20a640413 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ORCID | 0009-0006-6983-606X |
| PQID | 3165757843 |
| PQPubID | 4437618 |
| ParticipantIDs | worldscientific_primary_S0129626424500014 proquest_journals_3165757843 crossref_primary_10_1142_S0129626424500014 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 20240600 2024-06-00 20240601 |
| PublicationDateYYYYMMDD | 2024-06-01 |
| PublicationDate_xml | – month: 06 year: 2024 text: 20240600 |
| PublicationDecade | 2020 |
| PublicationPlace | Singapore |
| PublicationPlace_xml | – name: Singapore |
| PublicationTitle | Parallel processing letters |
| PublicationYear | 2024 |
| Publisher | World Scientific Publishing Company World Scientific Publishing Co. Pte., Ltd |
| Publisher_xml | – name: World Scientific Publishing Company – name: World Scientific Publishing Co. Pte., Ltd |
| References | Calder B. (S0129626424500014BIB002) 2000 Tullsen D. M. (S0129626424500014BIB001) Önder S. (S0129626424500014BIB011) S0129626424500014BIB006 S0129626424500014BIB004 Moshovos A. (S0129626424500014BIB003) S0129626424500014BIB005 Sherwood T. (S0129626424500014BIB010) Tyson G. S. (S0129626424500014BIB007) |
| References_xml | – ident: S0129626424500014BIB004 doi: 10.1145/3239567 – volume-title: Proc. of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems ident: S0129626424500014BIB010 – volume-title: Proc. of 22nd Annual International Symposium on Computer Architecture ident: S0129626424500014BIB001 – ident: S0129626424500014BIB005 doi: 10.1016/S0141-9331(98)00086-6 – volume-title: Proc. Sixth International Symposium on High-Performance Computer Architecture ident: S0129626424500014BIB003 – year: 2000 ident: S0129626424500014BIB002 publication-title: Journal of Instruction-Level Parallelism – volume-title: Proc. of Micro-30 ident: S0129626424500014BIB007 – volume-title: Proc. of the 2002 International Conference on Parallel Architectures and Compilation Techniques ident: S0129626424500014BIB011 – ident: S0129626424500014BIB006 doi: 10.1145/1596510.1596514 |
| SSID | ssj0014740 ssib019635363 |
| Score | 2.321047 |
| Snippet | Simultaneous Multi-Threading (SMT) processors provide improvement over the traditional out-of-order superscalar architecture by allowing instructions from... |
| SourceID | proquest crossref worldscientific |
| SourceType | Aggregation Database Index Database Publisher |
| SubjectTerms | Computer architecture Microprocessors Processors Read-only memory devices Workload Workloads |
| Title | Memory Dependence Speculation for Simultaneous Multi-Threading Processors |
| URI | http://www.worldscientific.com/doi/abs/10.1142/S0129626424500014 https://www.proquest.com/docview/3165757843 |
| Volume | 34 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVEBS databaseName: EBSCOhost Mathematics Source - HOST customDbUrl: eissn: 1793-642X dateEnd: 20241105 omitProxy: false ssIdentifier: ssj0014740 issn: 0129-6264 databaseCode: AMVHM dateStart: 19910901 isFulltext: true titleUrlDefault: https://www.ebsco.com/products/research-databases/mathematics-source providerName: EBSCOhost |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3dT9swELdYedkLY2No3WDyw4S0IXe268T0ESFQh1qGRDr1LXIcR6qE2kK7h-2v5_yROKPTxHiJqqQ6OXeX8_nufncIfSokK2Gj0kSKQUkEoxUpWCVIpQvdL6md7WOxw-OrdDgRl9Nk2kJcW3TJuujp33_FlTxHqnAP5GpRsv8h2YYo3IDfIF-4goTh-iQZj22Z7C-wGX6QrW2quXTDbpsCwpuZrRhUc2MrXR3YlmQgPVc4X4MEFiGhE1zUa3Vv56vcHi_9Y_vPWwf6iVAROOSbVTv4HhM8MzIOY1JGoal3iClwEWufvOVydTzOuLiCpXZI7GzRO75eG2e6YuGRC03aQhrue5L3jDen8PUTOOFM2_Y2BC9n8di7acYFd4lkIGkpcpG4s1zcs-o8_dX3_GIyGuXZ-TQ7Wt4RO03MZt3DaJUXaJuDtacdtH06_jFs4jjW7CSu8VjINwnpoLPNO4T8N6zj68Yq_vRg4rFkx_W4XTVca_kp2S7aCQcMfOq15TXaMvM36FU9vAMHW76HvnnlwVF5cEt5MCgPbisPfqQ8OCrPWzS5OM_OhiTM1SAa3DFBlNKVUmpAT0RRSSoMgy2RJqbPUl1JxUoldDowCp6YkqmEyUqmojScqlRQ8Hr2UWe-mJt3CCtTUS7LQvLSAHtEkaaaAQ3dtylXKbroS82rfOnbp-QeCs_zDcZ20UHNzTx8ZascFpXYmQui30WfH3G4IblB6v2_SX1AL6PaH6DO-v6nOQTncl18DJryAOlTeig |
| linkProvider | EBSCOhost |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Memory+Dependence+Speculation+for+Simultaneous+Multi-Threading+Processors&rft.jtitle=Parallel+processing+letters&rft.au=Flores%2C+Jonathan&rft.au=Wei-Ming%2C+Lin&rft.date=2024-06-01&rft.pub=World+Scientific+Publishing+Co.+Pte.%2C+Ltd&rft.issn=0129-6264&rft.eissn=1793-642X&rft.volume=34&rft.issue=2&rft_id=info:doi/10.1142%2FS0129626424500014&rft.externalDBID=NO_FULL_TEXT |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0129-6264&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0129-6264&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0129-6264&client=summon |