Data flow algorithms for processors with vector extensions: Handling actors with internal state

Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dep...

Full description

Saved in:
Bibliographic Details
Published in2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP) pp. 20 - 24
Main Authors Barford, Lee, Bhattacharyya, Shuvra S., Yanzhou Liu
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2014
Subjects
Online AccessGet full text
DOI10.1109/GlobalSIP.2014.7032070

Cover

Abstract Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dependencies between actor invocations that make vectorizing across actor invocations impossible. Ideally, issues of inter-thread coordination required by serial data dependencies should be handled by code written by parallel programming experts that is separate from code specifying signal processing operations. The purpose of this paper is to present one approach for so doing in the case of actors that maintain state. We propose a methodology for using the parallel scan (also known as prefix sum) pattern to create algorithms for multiple simultaneous invocations of such an actor that results in vectorizable code. Two examples of applying this methodology are given: (1) infinite impulse response filters and (2) finite state machines. The correctness and performance of the resulting IIR filters are studied.
AbstractList Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dependencies between actor invocations that make vectorizing across actor invocations impossible. Ideally, issues of inter-thread coordination required by serial data dependencies should be handled by code written by parallel programming experts that is separate from code specifying signal processing operations. The purpose of this paper is to present one approach for so doing in the case of actors that maintain state. We propose a methodology for using the parallel scan (also known as prefix sum) pattern to create algorithms for multiple simultaneous invocations of such an actor that results in vectorizable code. Two examples of applying this methodology are given: (1) infinite impulse response filters and (2) finite state machines. The correctness and performance of the resulting IIR filters are studied.
Author Barford, Lee
Bhattacharyya, Shuvra S.
Yanzhou Liu
Author_xml – sequence: 1
  givenname: Lee
  surname: Barford
  fullname: Barford, Lee
  email: lee.barford@keysight.com
  organization: Keysight Lechnologies, Keysight Labs., Reno, NV, USA
– sequence: 2
  givenname: Shuvra S.
  surname: Bhattacharyya
  fullname: Bhattacharyya, Shuvra S.
  organization: Univ. of Maryland, College Park, MD, USA
– sequence: 3
  surname: Yanzhou Liu
  fullname: Yanzhou Liu
  organization: Univ. of Maryland, College Park, MD, USA
BookMark eNotT81KAzEYjKAHW30CQfICXZNNs9l4k6ptoaCgnpcv2y81kCYlCVbf3i3d0zA_DDMTchliQELuOas4Z_ph6aMB_7F-r2rG55ViomaKXZAJnyutFWtbcU26ZyhArY9HCn4Xkyvf-0xtTPSQYo85x5TpcVDpD_ZlkPG3YMguhvxIVxC23oUdhZM15lwomAJ4mgsUvCFXFnzG2xGn5Ov15XOxmm3eluvF02bmeMPKTBstsUEjzDCyrhsNRjKtZA-9wpZxw4RoatVu-YkbOzBhjVTQG6WklWJK7s69DhG7Q3J7SH_d-Fn8A9_XVDI
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/GlobalSIP.2014.7032070
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE/IET Electronic Library
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1479970883
9781479970889
EndPage 24
ExternalDocumentID 7032070
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i160t-9b95e6eb3b3202269ab50975cac7e801b0336278d1c7e8bf3363fb57acb775f53
IEDL.DBID RIE
IngestDate Thu Jun 29 18:37:51 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i160t-9b95e6eb3b3202269ab50975cac7e801b0336278d1c7e8bf3363fb57acb775f53
PageCount 5
ParticipantIDs ieee_primary_7032070
PublicationCentury 2000
PublicationDate 2014-Dec.
PublicationDateYYYYMMDD 2014-12-01
PublicationDate_xml – month: 12
  year: 2014
  text: 2014-Dec.
PublicationDecade 2010
PublicationTitle 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)
PublicationTitleAbbrev GlobalSIP
PublicationYear 2014
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.5631745
Snippet Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems...
SourceID ieee
SourceType Publisher
StartPage 20
SubjectTerms data flow computing
digital signal processing
Graphics processing units
Indexes
Kernel
parallel algorithms
Signal processing
Signal processing algorithms
vector processors
Vectors
Title Data flow algorithms for processors with vector extensions: Handling actors with internal state
URI https://ieeexplore.ieee.org/document/7032070
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEA21J08qrfhNDh7d7X4mG69qqUKloIXeSpJOtFi7pd0q-OudJGtF8eBts4TskoF9mbfz5hFyDikGeSKjINFJFmQa0x2pAZMVBmBpi6KIrXa4f896w-xulI8a5GKjhQEAV3wGob10__InpV5bqqzDrds3xwR9ixfMa7Vq0W8ciY5vkv9wO7AFW1lYT_7hmuJAo7tD-l-P87UiL-G6UqH--NWJ8b_vs0va3_I8OtgAzx5pwLxFxteyktTMyncqZ08l5vzPryuKR1K68FqAcrmilnWlb46op47-tlzZ6pL2bK8FXIt69x0_b-rJwhl1oqM2GXZvHq96Qe2eEExjFlWBUCIHhrmyshbpCRNS4eGA51pqDohLKkoRvHgxie1YGRylRuVcasV5bvJ0nzTn5RwOCAWVGRAmNjoWmUgSiUvwlBmR4pdSRukhadnNGS98g4xxvS9Hf98-Jts2QL4m5IQ0q-UaThHZK3XmQvoJ-Iynbg
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwFG4IHvSkBoy_7cGjG_vRrtSrSoYCIRESbqQtrRKRERia-Nf7uk6MxoO3dWm6pS_Z1_ftfe9D6FLHEOSJCLxIRcQjCtIdoTQkK4nWlrZoNkOrHe72knRI7kd0VEFXGy2M1rooPtO-vSz-5U8ytbZUWYNZt28GCfoWJYRQp9YqZb9hwBuuTf5ju29LtohfTv_hm1LARmsXdb8e6KpFXvx1Ln318asX43_faA_VvwV6uL-Bnn1U0fMaGt-KXGAzy96xmD1lkPU_v64wHErxwqkBsuUKW94VvxVUPS4IcMuWra5xarstwFrY-e-4eVNHF85wITuqo2HrbnCTeqV_gjcNkyD3uORUJ5AtS2uSHiVcSDgeMKqEYhqQSQYxwBdrTkI7lgZGsZGUCSUZo4bGB6g6z-b6EGEtidHchEaFnPAoErAEixPDY_hWiiA-QjW7OeOFa5ExLvfl-O_bF2g7HXQ7406793CCdmywXIXIKarmy7U-A5zP5XkR3k_oLaq7
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2014+IEEE+Global+Conference+on+Signal+and+Information+Processing+%28GlobalSIP%29&rft.atitle=Data+flow+algorithms+for+processors+with+vector+extensions%3A+Handling+actors+with+internal+state&rft.au=Barford%2C+Lee&rft.au=Bhattacharyya%2C+Shuvra+S.&rft.au=Yanzhou+Liu&rft.date=2014-12-01&rft.pub=IEEE&rft.spage=20&rft.epage=24&rft_id=info:doi/10.1109%2FGlobalSIP.2014.7032070&rft.externalDocID=7032070