Memory Dependence Speculation for Simultaneous Multi-Threading Processors

Simultaneous Multi-Threading (SMT) processors provide improvement over the traditional out-of-order superscalar architecture by allowing instructions from several independent threads to execute out-of-order concurrently. Maintaining the accuracy of values read from and written to memory is a great b...

Full description

Saved in:
Bibliographic Details
Published inParallel processing letters Vol. 34; no. 2
Main Authors Flores, Jonathan, Lin, Wei-Ming
Format Journal Article
LanguageEnglish
Published Singapore World Scientific Publishing Company 01.06.2024
World Scientific Publishing Co. Pte., Ltd
Subjects
Online AccessGet full text
ISSN0129-6264
1793-642X
DOI10.1142/S0129626424500014

Cover

Abstract Simultaneous Multi-Threading (SMT) processors provide improvement over the traditional out-of-order superscalar architecture by allowing instructions from several independent threads to execute out-of-order concurrently. Maintaining the accuracy of values read from and written to memory is a great bottleneck for processor performance, as loads must stall execution until all prior store addresses are known or risk reading invalid data. Prior research in this area has mainly focused on superscalar architecture, as such, it is only natural to extend memory dependence speculation techniques to an SMT architecture. In this paper, we allow for loads among threads to execute as soon as their addresses are resolved without checking for prior memory address conflicts. Stores also perform a check on all later loads to see if any read was too early due to an address match, if so, the processor state is recovered, and the load re-issued. This aggressive technique allows for the greatest potential instructions per clock cycle gains over predictive techniques as the pipeline is never stalled for loads. Our simulations show that an overall IPC gain up to 12% and 10% is possible for both 4-threaded and 8-threaded workloads respectively. Conversely, a maximum overall IPC loss of at least 2.3% and 2% for 4-threaded and 8-threaded workloads respectively was also observed.
AbstractList Simultaneous Multi-Threading (SMT) processors provide improvement over the traditional out-of-order superscalar architecture by allowing instructions from several independent threads to execute out-of-order concurrently. Maintaining the accuracy of values read from and written to memory is a great bottleneck for processor performance, as loads must stall execution until all prior store addresses are known or risk reading invalid data. Prior research in this area has mainly focused on superscalar architecture, as such, it is only natural to extend memory dependence speculation techniques to an SMT architecture. In this paper, we allow for loads among threads to execute as soon as their addresses are resolved without checking for prior memory address conflicts. Stores also perform a check on all later loads to see if any read was too early due to an address match, if so, the processor state is recovered, and the load re-issued. This aggressive technique allows for the greatest potential instructions per clock cycle gains over predictive techniques as the pipeline is never stalled for loads. Our simulations show that an overall IPC gain up to 12% and 10% is possible for both 4-threaded and 8-threaded workloads respectively. Conversely, a maximum overall IPC loss of at least 2.3% and 2% for 4-threaded and 8-threaded workloads respectively was also observed.
Author Lin, Wei-Ming
Flores, Jonathan
Author_xml – sequence: 1
  givenname: Jonathan
  surname: Flores
  fullname: Flores, Jonathan
– sequence: 2
  givenname: Wei-Ming
  surname: Lin
  fullname: Lin, Wei-Ming
BookMark eNplkFFLwzAUhYNMcJv-AN8KPleTNk3WR5k6BxsKm-BbuaY3mtElNWmR_XtTJr7s6V7uOd-5cCZkZJ1FQq4ZvWWMZ3cbyrJSZIJnvKCUMn5GxkyWeRov7yMyHuR00C_IJITd4JCcjslyjXvnD8kDtmhrtAqTTYuqb6Azziba-WRj9n3TgUXXh2QdV5NuvzxCbexn8uqdwhCcD5fkXEMT8OpvTsnb0-N2_pyuXhbL-f0qVZkQPAVQGgBKOuMfWlKObCYFLTBnQmkJrAauRIkQFawZFExqKXiNGQXBKWf5lNwcc1vvvnsMXbVzvbfxZRUzClnIGc-jix1dyrsQPOqq9WYP_lAxWg2NVSeNRYYemR_nmzoog7Yz2qh_9BT5BdZ1b48
Cites_doi 10.1145/3239567
10.1016/S0141-9331(98)00086-6
10.1145/1596510.1596514
ContentType Journal Article
Copyright 2024, World Scientific Publishing Company
2024. World Scientific Publishing Company
Copyright_xml – notice: 2024, World Scientific Publishing Company
– notice: 2024. World Scientific Publishing Company
DBID AAYXX
CITATION
7SP
8FD
L7M
DOI 10.1142/S0129626424500014
DatabaseName CrossRef
Electronics & Communications Abstracts
Technology Research Database
Advanced Technologies Database with Aerospace
DatabaseTitle CrossRef
Technology Research Database
Advanced Technologies Database with Aerospace
Electronics & Communications Abstracts
DatabaseTitleList CrossRef
Technology Research Database

DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1793-642X
ExternalDocumentID 10_1142_S0129626424500014
S0129626424500014
GroupedDBID -~X
.DC
0R~
123
4.4
8V8
ADSJI
ALMA_UNASSIGNED_HOLDINGS
CAG
COF
CS3
DU5
EBS
EJD
ESX
HZ~
H~9
K1G
O9-
P2P
P71
PQQKQ
RWJ
WSC
AAYXX
AMVHM
CITATION
7SP
8FD
L7M
ID FETCH-LOGICAL-c2664-aacfaaa9084bf704e187605e316cf7a1da4c69ea04eed1a517f764de20a640413
ISSN 0129-6264
IngestDate Mon Jun 30 12:44:39 EDT 2025
Tue Jul 01 00:34:07 EDT 2025
Fri Aug 23 08:19:25 EDT 2024
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords speculation
Multi-threading
SMT
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c2664-aacfaaa9084bf704e187605e316cf7a1da4c69ea04eed1a517f764de20a640413
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0009-0006-6983-606X
PQID 3165757843
PQPubID 4437618
ParticipantIDs worldscientific_primary_S0129626424500014
proquest_journals_3165757843
crossref_primary_10_1142_S0129626424500014
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 20240600
2024-06-00
20240601
PublicationDateYYYYMMDD 2024-06-01
PublicationDate_xml – month: 06
  year: 2024
  text: 20240600
PublicationDecade 2020
PublicationPlace Singapore
PublicationPlace_xml – name: Singapore
PublicationTitle Parallel processing letters
PublicationYear 2024
Publisher World Scientific Publishing Company
World Scientific Publishing Co. Pte., Ltd
Publisher_xml – name: World Scientific Publishing Company
– name: World Scientific Publishing Co. Pte., Ltd
References Calder B. (S0129626424500014BIB002) 2000
Tullsen D. M. (S0129626424500014BIB001)
Önder S. (S0129626424500014BIB011)
S0129626424500014BIB006
S0129626424500014BIB004
Moshovos A. (S0129626424500014BIB003)
S0129626424500014BIB005
Sherwood T. (S0129626424500014BIB010)
Tyson G. S. (S0129626424500014BIB007)
References_xml – ident: S0129626424500014BIB004
  doi: 10.1145/3239567
– volume-title: Proc. of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems
  ident: S0129626424500014BIB010
– volume-title: Proc. of 22nd Annual International Symposium on Computer Architecture
  ident: S0129626424500014BIB001
– ident: S0129626424500014BIB005
  doi: 10.1016/S0141-9331(98)00086-6
– volume-title: Proc. Sixth International Symposium on High-Performance Computer Architecture
  ident: S0129626424500014BIB003
– year: 2000
  ident: S0129626424500014BIB002
  publication-title: Journal of Instruction-Level Parallelism
– volume-title: Proc. of Micro-30
  ident: S0129626424500014BIB007
– volume-title: Proc. of the 2002 International Conference on Parallel Architectures and Compilation Techniques
  ident: S0129626424500014BIB011
– ident: S0129626424500014BIB006
  doi: 10.1145/1596510.1596514
SSID ssj0014740
ssib019635363
Score 2.321047
Snippet Simultaneous Multi-Threading (SMT) processors provide improvement over the traditional out-of-order superscalar architecture by allowing instructions from...
SourceID proquest
crossref
worldscientific
SourceType Aggregation Database
Index Database
Publisher
SubjectTerms Computer architecture
Microprocessors
Processors
Read-only memory devices
Workload
Workloads
Title Memory Dependence Speculation for Simultaneous Multi-Threading Processors
URI http://www.worldscientific.com/doi/abs/10.1142/S0129626424500014
https://www.proquest.com/docview/3165757843
Volume 34
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVEBS
  databaseName: EBSCOhost Mathematics Source - HOST
  customDbUrl:
  eissn: 1793-642X
  dateEnd: 20241105
  omitProxy: false
  ssIdentifier: ssj0014740
  issn: 0129-6264
  databaseCode: AMVHM
  dateStart: 19910901
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/mathematics-source
  providerName: EBSCOhost
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3dT9swELdYedkLY2No3WDyw4S0IXe268T0ESFQh1qGRDr1LXIcR6qE2kK7h-2v5_yROKPTxHiJqqQ6OXeX8_nufncIfSokK2Gj0kSKQUkEoxUpWCVIpQvdL6md7WOxw-OrdDgRl9Nk2kJcW3TJuujp33_FlTxHqnAP5GpRsv8h2YYo3IDfIF-4goTh-iQZj22Z7C-wGX6QrW2quXTDbpsCwpuZrRhUc2MrXR3YlmQgPVc4X4MEFiGhE1zUa3Vv56vcHi_9Y_vPWwf6iVAROOSbVTv4HhM8MzIOY1JGoal3iClwEWufvOVydTzOuLiCpXZI7GzRO75eG2e6YuGRC03aQhrue5L3jDen8PUTOOFM2_Y2BC9n8di7acYFd4lkIGkpcpG4s1zcs-o8_dX3_GIyGuXZ-TQ7Wt4RO03MZt3DaJUXaJuDtacdtH06_jFs4jjW7CSu8VjINwnpoLPNO4T8N6zj68Yq_vRg4rFkx_W4XTVca_kp2S7aCQcMfOq15TXaMvM36FU9vAMHW76HvnnlwVF5cEt5MCgPbisPfqQ8OCrPWzS5OM_OhiTM1SAa3DFBlNKVUmpAT0RRSSoMgy2RJqbPUl1JxUoldDowCp6YkqmEyUqmojScqlRQ8Hr2UWe-mJt3CCtTUS7LQvLSAHtEkaaaAQ3dtylXKbroS82rfOnbp-QeCs_zDcZ20UHNzTx8ZascFpXYmQui30WfH3G4IblB6v2_SX1AL6PaH6DO-v6nOQTncl18DJryAOlTeig
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Memory+Dependence+Speculation+for+Simultaneous+Multi-Threading+Processors&rft.jtitle=Parallel+processing+letters&rft.au=Flores%2C+Jonathan&rft.au=Wei-Ming%2C+Lin&rft.date=2024-06-01&rft.pub=World+Scientific+Publishing+Co.+Pte.%2C+Ltd&rft.issn=0129-6264&rft.eissn=1793-642X&rft.volume=34&rft.issue=2&rft_id=info:doi/10.1142%2FS0129626424500014&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0129-6264&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0129-6264&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0129-6264&client=summon