Cache contention and application performance prediction for multi-core systems

The ongoing move to chip multiprocessors (CMPs) permits greater sharing of last-level cache by processor cores but this sharing aggravates the cache contention problem, potentially undermining performance improvements. Accurately modeling the impact of inter-process cache contention on performance a...

Full description

Saved in:
Bibliographic Details
Published in2010 IEEE International Symposium on Performance Analysis of Systems and Software pp. 76 - 86
Main Authors Chi Xu, Xi Chen, Dick, Robert P, Mao, Zhuoqing Morley
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.03.2010
Subjects
Online AccessGet full text
ISBN1424460239
9781424460236
DOI10.1109/ISPASS.2010.5452065

Cover

Abstract The ongoing move to chip multiprocessors (CMPs) permits greater sharing of last-level cache by processor cores but this sharing aggravates the cache contention problem, potentially undermining performance improvements. Accurately modeling the impact of inter-process cache contention on performance and power consumption is required for optimized process assignment. However, techniques based on exhaustive consideration of process-to-processor mappings and cycle-accurate simulation are inefficient or intractable for CMPs, which often permit a large number of potential assignments. This paper proposes CAMP, a fast and accurate shared cache aware performance model for multi-core processors. CAMP estimates the performance degradation due to cache contention of processes running on CMPs. It uses reuse distance histograms, cache access frequencies, and the relationship between the throughput and cache miss rate of each process to predict its effective cache size when running concurrently and sharing cache with other processes, allowing instruction throughput estimation.We also provide an automated way to obtain process-dependent characteristics, such as reuse distance histograms, without offline simulation, operating system (OS) modification, or additional hardware. We tested the accuracy of CAMP using 55 different combinations of 10 SPEC CPU2000 benchmarks on a dual-core CMP machine. The average throughput prediction error was 1.57%.
AbstractList The ongoing move to chip multiprocessors (CMPs) permits greater sharing of last-level cache by processor cores but this sharing aggravates the cache contention problem, potentially undermining performance improvements. Accurately modeling the impact of inter-process cache contention on performance and power consumption is required for optimized process assignment. However, techniques based on exhaustive consideration of process-to-processor mappings and cycle-accurate simulation are inefficient or intractable for CMPs, which often permit a large number of potential assignments. This paper proposes CAMP, a fast and accurate shared cache aware performance model for multi-core processors. CAMP estimates the performance degradation due to cache contention of processes running on CMPs. It uses reuse distance histograms, cache access frequencies, and the relationship between the throughput and cache miss rate of each process to predict its effective cache size when running concurrently and sharing cache with other processes, allowing instruction throughput estimation.We also provide an automated way to obtain process-dependent characteristics, such as reuse distance histograms, without offline simulation, operating system (OS) modification, or additional hardware. We tested the accuracy of CAMP using 55 different combinations of 10 SPEC CPU2000 benchmarks on a dual-core CMP machine. The average throughput prediction error was 1.57%.
Author Dick, Robert P
Xi Chen
Mao, Zhuoqing Morley
Chi Xu
Author_xml – sequence: 1
  surname: Chi Xu
  fullname: Chi Xu
  email: xuchi@umn.edu
  organization: ECE Dept., Univeristy of Minnesota, Minneapolis, MN, USA
– sequence: 2
  surname: Xi Chen
  fullname: Xi Chen
  email: chexi@umich.edu
  organization: EECS Dept., Univ. of Michigan, Ann Arbor, MI, USA
– sequence: 3
  givenname: Robert P
  surname: Dick
  fullname: Dick, Robert P
  email: dickrp@eecs.umich.edu
  organization: EECS Dept., Univ. of Michigan, Ann Arbor, MI, USA
– sequence: 4
  givenname: Zhuoqing Morley
  surname: Mao
  fullname: Mao, Zhuoqing Morley
  email: zmao@eecs.umich.edu
  organization: EECS Dept., Univ. of Michigan, Ann Arbor, MI, USA
BookMark eNpFj89Kw0AYxFdU0NY-QS_7Aqn7N8keS1BbKCpEz2Xz7RdcSTZhNx769oZacC7D_BgGZkFuwhCQkDVnG86ZedzX79u63gg2A620YLm-IguuhFI5E6q4_g_S3JFVSt9s1tzMmbwnr5WFL6QwhAnD5IdAbXDUjmPnwZ7ziLEdYm8DIB0jOg9nPDPa_3STz2CISNMpTdinB3Lb2i7h6uJL8vn89FHtssPby77aHjLPCz1lhjcFQ3QOWleWGgCks41Voi3AWpcLA44xUFo3SorcCWAcFRqtS8lNI-SSrP92PSIex-h7G0_Hy3_5C2y4Utg
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ISPASS.2010.5452065
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore Digital Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1424460247
9781424460243
EndPage 86
ExternalDocumentID 5452065
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ADFMO
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
IERZE
OCL
RIE
RIL
ID FETCH-LOGICAL-i175t-91b70eeddcfd885ccc3daba42f7caad629cd00c455b4326d2c01e4e9558319b23
IEDL.DBID RIE
ISBN 1424460239
9781424460236
IngestDate Wed Aug 27 02:24:34 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-91b70eeddcfd885ccc3daba42f7caad629cd00c455b4326d2c01e4e9558319b23
PageCount 11
ParticipantIDs ieee_primary_5452065
PublicationCentury 2000
PublicationDate 2010-March
PublicationDateYYYYMMDD 2010-03-01
PublicationDate_xml – month: 03
  year: 2010
  text: 2010-March
PublicationDecade 2010
PublicationTitle 2010 IEEE International Symposium on Performance Analysis of Systems and Software
PublicationTitleAbbrev ISPASS
PublicationYear 2010
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000452603
Score 1.6960907
Snippet The ongoing move to chip multiprocessors (CMPs) permits greater sharing of last-level cache by processor cores but this sharing aggravates the cache contention...
SourceID ieee
SourceType Publisher
StartPage 76
SubjectTerms Degradation
Energy consumption
Frequency estimation
Hardware
Histograms
Multicore processing
Operating systems
Power system modeling
Predictive models
Throughput
Title Cache contention and application performance prediction for multi-core systems
URI https://ieeexplore.ieee.org/document/5452065
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEB1qT55UWvGbPXg07Sa72SRHKZYqtBRqobeyXwER0lLTi7_e2d20VfHgLRlCsuySzMvOe28A7hGvyZRREwmephEXSRnlAoGctFlhbSlj5u2LxxMxmvOXRbpowcNeC2Ot9eQz23OHvpZvVnrrtsr6rh82pswjOMpyEbRa-_0UZw0uKNtpt4QTbe4snZpz0bgOxbToP8-mj7NZoHY1t_3RX8Wnl-EJjHcDC6yS9962Vj39-cuz8b8jP4XuQchHpvsUdQYtW3VgMnAmzsSR1APXkcjKkG-VbLI-qAnIeuNKOT6MMeIJiJHzviTBBPqjC_Ph0-tgFDVtFaI3xAo1ft5URvG5Rpcmz1OtNTNSSZ6UmZbSiKTQhlKNa6c4gjuTaBpbbos0zfF9VQk7h3a1quwFkFIiwGMiLzleL2VcIJqzXDOlTYb_RsUldNxcLNfBOWPZTMPV3-FrOA61ecfwuoF2vdnaW0z5tbrza_0Fy1aocA
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8JAEJ0gHvSkBozf7sGjhX7sbtujIRJQICRAwo3sVxNjUgiWi7_e2W4BNR68tZOm3eymnded994APCBeEyzytccpYx7lYeYlHIGcMHFqTCaCqLQvHo54b0Zf5mxeg8edFsYYU5LPTMselrV8vVQbu1XWtv2wMWUewCGjlDKn1trtqFhzcO5HW_UWt7LNralTdc4r36HAT9v9yfhpMnHkrurGPzqslAmmewLD7dAcr-S9tSlkS33-cm3879hPobmX8pHxLkmdQc3kDRh1rI0zsTR1x3YkItfkWy2brPZ6ArJa22JOGcYYKSmInnW_JM4G-qMJs-7ztNPzqsYK3huihQI_cDL28blaZTpJmFIq0kIKGmaxEkLzMFXa9xWunqQI73So_MBQkzKW4Bsrw-gc6vkyNxdAMoEQL-JJRvF6IYIU8ZyhKpJKx_h3lF5Cw87FYuW8MxbVNFz9Hb6Ho950OFgM-qPXazh2lXrL97qBerHemFsEAIW8K9f9C9BCq70
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+IEEE+International+Symposium+on+Performance+Analysis+of+Systems+and+Software&rft.atitle=Cache+contention+and+application+performance+prediction+for+multi-core+systems&rft.au=Chi+Xu&rft.au=Xi+Chen&rft.au=Dick%2C+Robert+P&rft.au=Mao%2C+Zhuoqing+Morley&rft.date=2010-03-01&rft.pub=IEEE&rft.isbn=9781424460236&rft.spage=76&rft.epage=86&rft_id=info:doi/10.1109%2FISPASS.2010.5452065&rft.externalDocID=5452065
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424460236/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424460236/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424460236/sc.gif&client=summon&freeimage=true