Data flow algorithms for processors with vector extensions: Handling actors with internal state

Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dep...

Full description

Saved in:

Bibliographic Details
Published in	2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP) pp. 20 - 24
Main Authors	Barford, Lee, Bhattacharyya, Shuvra S., Yanzhou Liu
Format	Conference Proceeding
Language	English
Published	IEEE 01.12.2014
Subjects	data flow computing digital signal processing Graphics processing units Indexes Kernel parallel algorithms Signal processing Signal processing algorithms vector processors Vectors
Online Access	Get full text
DOI	10.1109/GlobalSIP.2014.7032070

Cover

Abstract	Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dependencies between actor invocations that make vectorizing across actor invocations impossible. Ideally, issues of inter-thread coordination required by serial data dependencies should be handled by code written by parallel programming experts that is separate from code specifying signal processing operations. The purpose of this paper is to present one approach for so doing in the case of actors that maintain state. We propose a methodology for using the parallel scan (also known as prefix sum) pattern to create algorithms for multiple simultaneous invocations of such an actor that results in vectorizable code. Two examples of applying this methodology are given: (1) infinite impulse response filters and (2) finite state machines. The correctness and performance of the resulting IIR filters are studied.
AbstractList	Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dependencies between actor invocations that make vectorizing across actor invocations impossible. Ideally, issues of inter-thread coordination required by serial data dependencies should be handled by code written by parallel programming experts that is separate from code specifying signal processing operations. The purpose of this paper is to present one approach for so doing in the case of actors that maintain state. We propose a methodology for using the parallel scan (also known as prefix sum) pattern to create algorithms for multiple simultaneous invocations of such an actor that results in vectorizable code. Two examples of applying this methodology are given: (1) infinite impulse response filters and (2) finite state machines. The correctness and performance of the resulting IIR filters are studied.
Author	Barford, Lee Bhattacharyya, Shuvra S. Yanzhou Liu
Author_xml	– sequence: 1 givenname: Lee surname: Barford fullname: Barford, Lee email: lee.barford@keysight.com organization: Keysight Lechnologies, Keysight Labs., Reno, NV, USA – sequence: 2 givenname: Shuvra S. surname: Bhattacharyya fullname: Bhattacharyya, Shuvra S. organization: Univ. of Maryland, College Park, MD, USA – sequence: 3 surname: Yanzhou Liu fullname: Yanzhou Liu organization: Univ. of Maryland, College Park, MD, USA
BookMark	eNotT81KAzEYjKAHW30CQfICXZNNs9l4k6ptoaCgnpcv2y81kCYlCVbf3i3d0zA_DDMTchliQELuOas4Z_ph6aMB_7F-r2rG55ViomaKXZAJnyutFWtbcU26ZyhArY9HCn4Xkyvf-0xtTPSQYo85x5TpcVDpD_ZlkPG3YMguhvxIVxC23oUdhZM15lwomAJ4mgsUvCFXFnzG2xGn5Ov15XOxmm3eluvF02bmeMPKTBstsUEjzDCyrhsNRjKtZA-9wpZxw4RoatVu-YkbOzBhjVTQG6WklWJK7s69DhG7Q3J7SH_d-Fn8A9_XVDI
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/GlobalSIP.2014.7032070
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE/IET Electronic Library url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
EISBN	1479970883 9781479970889
EndPage	24
ExternalDocumentID	7032070
Genre	orig-research
GroupedDBID	6IE 6IL CBEJK RIE RIL
ID	FETCH-LOGICAL-i160t-9b95e6eb3b3202269ab50975cac7e801b0336278d1c7e8bf3363fb57acb775f53
IEDL.DBID	RIE
IngestDate	Thu Jun 29 18:37:51 EDT 2023
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i160t-9b95e6eb3b3202269ab50975cac7e801b0336278d1c7e8bf3363fb57acb775f53
PageCount	5
ParticipantIDs	ieee_primary_7032070
PublicationCentury	2000
PublicationDate	2014-Dec.
PublicationDateYYYYMMDD	2014-12-01
PublicationDate_xml	– month: 12 year: 2014 text: 2014-Dec.
PublicationDecade	2010
PublicationTitle	2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP)
PublicationTitleAbbrev	GlobalSIP
PublicationYear	2014
Publisher	IEEE
Publisher_xml	– name: IEEE
Score	1.5631745
Snippet	Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems...
SourceID	ieee
SourceType	Publisher
StartPage	20
SubjectTerms	data flow computing digital signal processing Graphics processing units Indexes Kernel parallel algorithms Signal processing Signal processing algorithms vector processors Vectors
Title	Data flow algorithms for processors with vector extensions: Handling actors with internal state
URI	https://ieeexplore.ieee.org/document/7032070
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEA21J08qrfhNDh7d7X4mG69qqUKloIXeSpJOtFi7pd0q-OudJGtF8eBts4TskoF9mbfz5hFyDikGeSKjINFJFmQa0x2pAZMVBmBpi6KIrXa4f896w-xulI8a5GKjhQEAV3wGob10__InpV5bqqzDrds3xwR9ixfMa7Vq0W8ciY5vkv9wO7AFW1lYT_7hmuJAo7tD-l-P87UiL-G6UqH--NWJ8b_vs0va3_I8OtgAzx5pwLxFxteyktTMyncqZ08l5vzPryuKR1K68FqAcrmilnWlb46op47-tlzZ6pL2bK8FXIt69x0_b-rJwhl1oqM2GXZvHq96Qe2eEExjFlWBUCIHhrmyshbpCRNS4eGA51pqDohLKkoRvHgxie1YGRylRuVcasV5bvJ0nzTn5RwOCAWVGRAmNjoWmUgSiUvwlBmR4pdSRukhadnNGS98g4xxvS9Hf98-Jts2QL4m5IQ0q-UaThHZK3XmQvoJ-Iynbg
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwFG4IHvSkBoy_7cGjG_vRrtSrSoYCIRESbqQtrRKRERia-Nf7uk6MxoO3dWm6pS_Z1_ftfe9D6FLHEOSJCLxIRcQjCtIdoTQkK4nWlrZoNkOrHe72knRI7kd0VEFXGy2M1rooPtO-vSz-5U8ytbZUWYNZt28GCfoWJYRQp9YqZb9hwBuuTf5ju29LtohfTv_hm1LARmsXdb8e6KpFXvx1Ln318asX43_faA_VvwV6uL-Bnn1U0fMaGt-KXGAzy96xmD1lkPU_v64wHErxwqkBsuUKW94VvxVUPS4IcMuWra5xarstwFrY-e-4eVNHF85wITuqo2HrbnCTeqV_gjcNkyD3uORUJ5AtS2uSHiVcSDgeMKqEYhqQSQYxwBdrTkI7lgZGsZGUCSUZo4bGB6g6z-b6EGEtidHchEaFnPAoErAEixPDY_hWiiA-QjW7OeOFa5ExLvfl-O_bF2g7HXQ7406793CCdmywXIXIKarmy7U-A5zP5XkR3k_oLaq7
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2014+IEEE+Global+Conference+on+Signal+and+Information+Processing+%28GlobalSIP%29&rft.atitle=Data+flow+algorithms+for+processors+with+vector+extensions%3A+Handling+actors+with+internal+state&rft.au=Barford%2C+Lee&rft.au=Bhattacharyya%2C+Shuvra+S.&rft.au=Yanzhou+Liu&rft.date=2014-12-01&rft.pub=IEEE&rft.spage=20&rft.epage=24&rft_id=info:doi/10.1109%2FGlobalSIP.2014.7032070&rft.externalDocID=7032070