Synthesizing memory-level parallelism aware miniature clones for SPEC CPU2006 and ImplantBench workloads

We generate and provide miniature synthetic benchmark clones for modern workloads to solve two pre-silicon design challenges, namely: 1) huge simulation time (weeks to months) when using complete runs of modern workloads like SPEC CPU2006 having trillions of instructions on pre-silicon design models...

Full description

Saved in:
Bibliographic Details
Published in2010 IEEE International Symposium on Performance Analysis of Systems and Software pp. 33 - 44
Main Authors Ganesan, Karthik, Jungho Jo, John, Lizy K
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.03.2010
Subjects
Online AccessGet full text
ISBN1424460239
9781424460236
DOI10.1109/ISPASS.2010.5452076

Cover

Abstract We generate and provide miniature synthetic benchmark clones for modern workloads to solve two pre-silicon design challenges, namely: 1) huge simulation time (weeks to months) when using complete runs of modern workloads like SPEC CPU2006 having trillions of instructions on pre-silicon design models 2) unavailability of access to their specific target applications for computer architects, as some of them are proprietary in nature and vendors hesitate to share them. We first provide a detailed characterization of the SPEC CPU2006 and the ImplantBench suites based on microarchitecture-independent metrics. Our metrics include the Memory Level Parallelism (MLP) of these workloads to estimate the burstiness of accesses to the main memory. Secondly, our proposed framework, that uses this characterized information (including MLP) to generate synthetic clones is explained and evaluated. We provide the synthetic clones generated for CPU2006 workloads for download and use. The efficacy of the synthetic clones for CPU2006 and ImplantBench is demonstrated by comparing their performance and power characteristics with their original counterparts. We show that the synthetic clones generated using our MLP-aware methodology have an error of only 2.8% in terms of Instruction Per Cycle (IPC) as compared to an error of 15.3% when using the previous MLP-unaware approaches for CPU2006. We also evaluate their effectiveness in assessing the change in performance and power consumption for various microarchitecture design changes. For CPU2006, with synthetics limited to 1 million dynamic instructions, the average correlation coefficient for assessing design changes for IPC is 0.95 (0.98 for power-per-cycle). For ImplantBench, we have an average error of 2.9% in assessing the IPC and the correlation coefficient for assessing design changes is 0.94 (0.97 for power-per-cycle).
AbstractList We generate and provide miniature synthetic benchmark clones for modern workloads to solve two pre-silicon design challenges, namely: 1) huge simulation time (weeks to months) when using complete runs of modern workloads like SPEC CPU2006 having trillions of instructions on pre-silicon design models 2) unavailability of access to their specific target applications for computer architects, as some of them are proprietary in nature and vendors hesitate to share them. We first provide a detailed characterization of the SPEC CPU2006 and the ImplantBench suites based on microarchitecture-independent metrics. Our metrics include the Memory Level Parallelism (MLP) of these workloads to estimate the burstiness of accesses to the main memory. Secondly, our proposed framework, that uses this characterized information (including MLP) to generate synthetic clones is explained and evaluated. We provide the synthetic clones generated for CPU2006 workloads for download and use. The efficacy of the synthetic clones for CPU2006 and ImplantBench is demonstrated by comparing their performance and power characteristics with their original counterparts. We show that the synthetic clones generated using our MLP-aware methodology have an error of only 2.8% in terms of Instruction Per Cycle (IPC) as compared to an error of 15.3% when using the previous MLP-unaware approaches for CPU2006. We also evaluate their effectiveness in assessing the change in performance and power consumption for various microarchitecture design changes. For CPU2006, with synthetics limited to 1 million dynamic instructions, the average correlation coefficient for assessing design changes for IPC is 0.95 (0.98 for power-per-cycle). For ImplantBench, we have an average error of 2.9% in assessing the IPC and the correlation coefficient for assessing design changes is 0.94 (0.97 for power-per-cycle).
Author Jungho Jo
Ganesan, Karthik
John, Lizy K
Author_xml – sequence: 1
  givenname: Karthik
  surname: Ganesan
  fullname: Ganesan, Karthik
  email: karthik@mail.utexas.edu
  organization: Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
– sequence: 2
  surname: Jungho Jo
  fullname: Jungho Jo
  email: jungho.jo@mail.utexas.edu
  organization: Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
– sequence: 3
  givenname: Lizy K
  surname: John
  fullname: John, Lizy K
  email: ljohn@ece.utexas.edu
  organization: Dept. of Electr. & Comput. Eng., Univ. of Texas at Austin, Austin, TX, USA
BookMark eNpFUNFKAzEQjKigrf2CvuQHrm5yuVzvsR5VCwULZ59LmuzZaC5XklOpX2_Agvsys8uwzMyIXPneIyFTBjPGoLpfNZtF08w4pEMhCg6lvCAjJrgQErgoL_-XvLohkxjfIU1SSshvyaE5-eGA0f5Y_0Y77Ppwyhx-oaNHFZRz6GzsqPpWAWlnvVXDZ2LaJReRtn2gzWZZ03qz5QCSKm_oqjs65YcH9PpAv_vw4Xpl4h25bpWLODnjmGwfl6_1c7Z-eVrVi3VmuWBDJjiydg7SVKY1rCw0oJzzQgLolGyuQbK9xjwRw1NmZphWQu25KvdYpT7yMZn-_bWIuDsG26lw2p2byX8BYZhZ5A
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ISPASS.2010.5452076
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1424460247
9781424460243
EndPage 44
ExternalDocumentID 5452076
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ADFMO
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
IERZE
OCL
RIE
RIL
ID FETCH-LOGICAL-i241t-42e1f806d9dfd175c0e6825600c0768c061bce38c0d22011d1ca4ab2a7be91093
IEDL.DBID RIE
ISBN 1424460239
9781424460236
IngestDate Wed Aug 27 02:24:34 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i241t-42e1f806d9dfd175c0e6825600c0768c061bce38c0d22011d1ca4ab2a7be91093
PageCount 12
ParticipantIDs ieee_primary_5452076
PublicationCentury 2000
PublicationDate 2010-03
PublicationDateYYYYMMDD 2010-03-01
PublicationDate_xml – month: 03
  year: 2010
  text: 2010-03
PublicationDecade 2010
PublicationTitle 2010 IEEE International Symposium on Performance Analysis of Systems and Software
PublicationTitleAbbrev ISPASS
PublicationYear 2010
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000452603
Score 1.6004242
Snippet We generate and provide miniature synthetic benchmark clones for modern workloads to solve two pre-silicon design challenges, namely: 1) huge simulation time...
SourceID ieee
SourceType Publisher
StartPage 33
SubjectTerms Application software
Character generation
Cloning
Computational modeling
Computer aided instruction
Computer applications
Computer simulation
Microarchitecture
Parallel processing
Time sharing computer systems
Title Synthesizing memory-level parallelism aware miniature clones for SPEC CPU2006 and ImplantBench workloads
URI https://ieeexplore.ieee.org/document/5452076
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjZ1LT8JAEMc3wMmTGjC-swePLvSx3bZHJRA00TSpJNzIPgOxFAMlBj69O23BaDx42-6hnew2mdnZ__wGoTvrwrg9BRgSqNAnNGacCEZDEnp-6Fl_L5QpaZ-vbDSmz5Ng0kD3h1oYrXUpPtNdGJZ3-WopN5Aq60E_bHvubqJmGLGqVuuQTwE0OHP8fe0Wg6LNPdKpfmY1dch14t5TmjykaSXtql_7o79K6V6Gx-hlb1ilKnnvbgrRlbtfzMb_Wn6COt-FfDg5uKhT1NB5G83SbW7DvvV8Z-fwAqS2W5KBeAgDCDzLdDZfLzD_5CuNAT1Ssj-xzADrj22Qi9Nk0Mf9ZAyJAsxzhYExbLfo0X5whkHqlS25WnfQeDh4649I3XCBzK0jLwj1tGsih6lYGWXjCuloFpUxkYQLO2l9v5DatwPlQeCgXMkpFx4PhY6BS3WGWrm15RxhPzBURr42xjVUKSGop7RSPgADlcuDC9SGVZp-VEyNab1Al39PX6Gj6tYetF_XqFWsNvrGBgOFuC3_gi-VzbBM
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjZ1LT8JAEMc3iAc9qQHj2z14tNDHdgtHJRBQIE0KCTeyz0gsxUCJgU_vTlswGg_etntoJ7tNZnb2P79B6MG4MGZOAdryZeBZpEmZxSkJrMD1Atf4ey51Rvsc0u6YvEz8SQk97mthlFKZ-EzVYJjd5cuFWEOqrA79sM25-wAd-oQQP6_W2mdUAA5ObW9XvUWhbHMHdSqeacEdcuxmvReFT1GUi7uKF__osJI5mM4JGuxMy3Ul77V1ymti-4va-F_bT1H1u5QPh3sndYZKKqmgt2iTmMBvNduaOTwHse3GikE-hAEFHscqnq3mmH2ypcIAH8non1jEAPbHJszFUdhu4VY4hlQBZonEQBk2m_RsPviGQewVL5hcVdG40x61ulbRcsGaGVeeWsRVjm7YVDalliayELaijSwqEnBlJ4z350J5ZiBdCB2kIxhh3GUBV00gU52jcmJsuUDY8zURDU9p7WgiJefElUpKD5CB0mH-JarAKk0_cqrGtFigq7-n79FRdzToT_u94es1Os7v8EEJdoPK6XKtbk1okPK77I_4AgLSs5k
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+IEEE+International+Symposium+on+Performance+Analysis+of+Systems+and+Software&rft.atitle=Synthesizing+memory-level+parallelism+aware+miniature+clones+for+SPEC+CPU2006+and+ImplantBench+workloads&rft.au=Ganesan%2C+Karthik&rft.au=Jungho+Jo&rft.au=John%2C+Lizy+K&rft.date=2010-03-01&rft.pub=IEEE&rft.isbn=9781424460236&rft.spage=33&rft.epage=44&rft_id=info:doi/10.1109%2FISPASS.2010.5452076&rft.externalDocID=5452076
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424460236/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424460236/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424460236/sc.gif&client=summon&freeimage=true