Visualizing complex dynamics in many-core accelerator architectures

While many-core accelerator architectures, such as today's Graphics Processing Units (GPUs), offer orders of magnitude more raw computing power than contemporary CPUs, their massive parallelism often produces complex dynamic behaviors even with the simplest applications. Using a fixed set of ha...

Full description

Saved in:
Bibliographic Details
Published in2010 IEEE International Symposium on Performance Analysis of Systems and Software pp. 164 - 174
Main Authors Ariel, Aaron, Fung, Wilson W L, Turner, Andrew E, Aamodt, Tor M
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.03.2010
Subjects
Online AccessGet full text
ISBN1424460239
9781424460236
DOI10.1109/ISPASS.2010.5452029

Cover

Abstract While many-core accelerator architectures, such as today's Graphics Processing Units (GPUs), offer orders of magnitude more raw computing power than contemporary CPUs, their massive parallelism often produces complex dynamic behaviors even with the simplest applications. Using a fixed set of hardware or simulator performance counters to quantify behavior over a large interval of time such as an entire application execution run or program phase may not capture this behavior. Software and/or hardware designers may consequently miss out on opportunities to optimize for better performance. Similarly, significant effort may be expended to find metrics that explain anomalous behavior in architecture design studies. Moreover, the increasing complexity of applications developed for today's GPU has created additional difficulties for software developers when attempting to identify bottlenecks of an application for optimization. This paper presents a novel GPU performance visualization tool, AerialVision, to address these two problems. It interfaces with the GPGPU-Sim simulator to capture and visualize the dynamic behavior of a GPU architecture throughout an application run. Similar to existing performance analysis tools for CPUs, it can annotate individual lines of source code with performance statistics to simplify the bottleneck identification process. To provide further insight, AerialVision introduces a novel methodology to relate pathological dynamic architectural behaviors resulting in performance loss with the part of the source code that is responsible. By rapidly providing insight into complex dynamic behavior, AerialVision enables research on improving many-core accelerator architectures and will help ensure applications written for these architectures reach their full performance potential.
AbstractList While many-core accelerator architectures, such as today's Graphics Processing Units (GPUs), offer orders of magnitude more raw computing power than contemporary CPUs, their massive parallelism often produces complex dynamic behaviors even with the simplest applications. Using a fixed set of hardware or simulator performance counters to quantify behavior over a large interval of time such as an entire application execution run or program phase may not capture this behavior. Software and/or hardware designers may consequently miss out on opportunities to optimize for better performance. Similarly, significant effort may be expended to find metrics that explain anomalous behavior in architecture design studies. Moreover, the increasing complexity of applications developed for today's GPU has created additional difficulties for software developers when attempting to identify bottlenecks of an application for optimization. This paper presents a novel GPU performance visualization tool, AerialVision, to address these two problems. It interfaces with the GPGPU-Sim simulator to capture and visualize the dynamic behavior of a GPU architecture throughout an application run. Similar to existing performance analysis tools for CPUs, it can annotate individual lines of source code with performance statistics to simplify the bottleneck identification process. To provide further insight, AerialVision introduces a novel methodology to relate pathological dynamic architectural behaviors resulting in performance loss with the part of the source code that is responsible. By rapidly providing insight into complex dynamic behavior, AerialVision enables research on improving many-core accelerator architectures and will help ensure applications written for these architectures reach their full performance potential.
Author Fung, Wilson W L
Ariel, Aaron
Turner, Andrew E
Aamodt, Tor M
Author_xml – sequence: 1
  givenname: Aaron
  surname: Ariel
  fullname: Ariel, Aaron
  email: aaronariel@hotmail.com
  organization: Univ. of British Columbia, Vancouver, BC, Canada
– sequence: 2
  givenname: Wilson W L
  surname: Fung
  fullname: Fung, Wilson W L
  email: wwlfung@ece.ubc.ca
  organization: Univ. of British Columbia, Vancouver, BC, Canada
– sequence: 3
  givenname: Andrew E
  surname: Turner
  fullname: Turner, Andrew E
  email: aturner@ece.ubc.ca
  organization: Univ. of British Columbia, Vancouver, BC, Canada
– sequence: 4
  givenname: Tor M
  surname: Aamodt
  fullname: Aamodt, Tor M
  email: aamodtaamodt@ece.ubc.ca
  organization: Univ. of British Columbia, Vancouver, BC, Canada
BookMark eNpFj19LwzAUxSMq6OY-wV7yBTpvkiY1j6P4ZzBQqPo6btNbjbTpSDqwfnoHDjwvh_PjcODM2EUYAjG2FLASAuztpnpZV9VKwhHoXEuQ9ozNRC7z3IDMi_P_oOwVW6T0BUcdmwbUNSvffTpg5398-OBu6PcdffNmCth7l7gPvMcwZW6IxNE56ijiOESO0X36kdx4iJRu2GWLXaLFyefs7eH-tXzKts-Pm3K9zbzMxZgVyiLIQtattLqWDeoGdG1BIaABR5a0slQUpq2dIkMknBBOyca6mlpzp-Zs-bfriWi3j77HOO1Op9UvMaBOdQ
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ISPASS.2010.5452029
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
Accès INSA - IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1424460247
9781424460243
EndPage 174
ExternalDocumentID 5452029
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ADFMO
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
IERZE
OCL
RIE
RIL
ID FETCH-LOGICAL-i241t-739a0272bf295b2da5d05b903a0a60ce9e539e776fbc3e6ee1c11c32d9cbef683
IEDL.DBID RIE
ISBN 1424460239
9781424460236
IngestDate Wed Aug 27 02:24:34 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i241t-739a0272bf295b2da5d05b903a0a60ce9e539e776fbc3e6ee1c11c32d9cbef683
PageCount 11
ParticipantIDs ieee_primary_5452029
PublicationCentury 2000
PublicationDate 2010-03
PublicationDateYYYYMMDD 2010-03-01
PublicationDate_xml – month: 03
  year: 2010
  text: 2010-03
PublicationDecade 2010
PublicationTitle 2010 IEEE International Symposium on Performance Analysis of Systems and Software
PublicationTitleAbbrev ISPASS
PublicationYear 2010
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000452603
Score 1.5229745
Snippet While many-core accelerator architectures, such as today's Graphics Processing Units (GPUs), offer orders of magnitude more raw computing power than...
SourceID ieee
SourceType Publisher
StartPage 164
SubjectTerms Accelerator architectures
Application software
Computational modeling
Computer architecture
Concurrent computing
Counting circuits
Graphics
Hardware
Parallel processing
Visualization
Title Visualizing complex dynamics in many-core accelerator architectures
URI https://ieeexplore.ieee.org/document/5452029
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3Pa8MgGJWup5220Y79xsOOszWxajyOstINOgpdR28l6ieUsXSsKYz-9VOTdj_YYYeA8SCaiE_93nsfQteZjM5SjkDGBen5SUM0s5I4kIK7BLRVQSg8ehTDae9hxmcNdLPTwgBAJJ9BJxRjLN8uzTpclXVDPmyaqj20JzNRabV29ynBGlxQttVuiSDa3Fo61e-idh1KqOreT8a3k0lF7aqb_ZFfJcLL4ACNth2rWCUvnXWpO2bzy7Pxvz0_RO0vIR8e7yDqCDWgaKH-82IVpJQbX4UjpRw-sK0y06_wosCvfoEgwd4S58Z4WIqRePw95LBqo-ng7qk_JHUuBbLwGF0SyVTuT6CpdqniOrU5t5RrRVlOc0ENKOBMgZTCacNAACQmSQxLrTIanMjYMWoWywJOEHY55Kl_FLd-8-KSzB8iMwvBqF5SK-0paoUPMH-r7DLm9djP_q4-R_tVQD7Qui5Qs3xfw6XH-VJfxR_8CW3bpOc
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LTgIxFG0QF7pSA8a3Xbi00Hm0nS4NkaACIQEMOzJtbxNiHIxAYvh625kBH3HhYpJOF0070_S0veeci9BNInJnKUsgYZzEbtIQFRlBLAjObADKSC8U7vV5Zxw_Ttikgm63WhgAyMln0PDFPJZv5nrlr8qaPh82DeUO2mVxHLNCrbW9UfHm4JxGG_UW97LNjalT-c5L36GAyubDcHA3HBbkrrLhHxlWcoBpH6DepmsFr-SlsVqqhl7_cm38b98PUf1LyocHW5A6QhXIaqj1PFt4MeXaVeGcVA4f2BS56Rd4luFXt0QQb3CJU60dMOWxePw96LCoo3H7ftTqkDKbApk5lF4SEcnUnUFDZUPJVGhSZihTkkYpTTnVIIFFEoTgVukIOECgg0BHoZFageVJdIyq2TyDE4RtCmnoHsmM277YIHHHyMSAt6oX1Ahzimr-A0zfCsOMaTn2s7-rr9FeZ9TrTrsP_adztF-E5z3J6wJVl-8ruHSov1RX-c_-BEToqDQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+IEEE+International+Symposium+on+Performance+Analysis+of+Systems+and+Software&rft.atitle=Visualizing+complex+dynamics+in+many-core+accelerator+architectures&rft.au=Ariel%2C+Aaron&rft.au=Fung%2C+Wilson+W+L&rft.au=Turner%2C+Andrew+E&rft.au=Aamodt%2C+Tor+M&rft.date=2010-03-01&rft.pub=IEEE&rft.isbn=9781424460236&rft.spage=164&rft.epage=174&rft_id=info:doi/10.1109%2FISPASS.2010.5452029&rft.externalDocID=5452029
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424460236/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424460236/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424460236/sc.gif&client=summon&freeimage=true