Data flow algorithms for processors with vector extensions: Handling actors with internal state
Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dep...
Saved in:
| Published in | 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP) pp. 20 - 24 |
|---|---|
| Main Authors | , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
01.12.2014
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.1109/GlobalSIP.2014.7032070 |
Cover
| Abstract | Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dependencies between actor invocations that make vectorizing across actor invocations impossible. Ideally, issues of inter-thread coordination required by serial data dependencies should be handled by code written by parallel programming experts that is separate from code specifying signal processing operations. The purpose of this paper is to present one approach for so doing in the case of actors that maintain state. We propose a methodology for using the parallel scan (also known as prefix sum) pattern to create algorithms for multiple simultaneous invocations of such an actor that results in vectorizable code. Two examples of applying this methodology are given: (1) infinite impulse response filters and (2) finite state machines. The correctness and performance of the resulting IIR filters are studied. |
|---|---|
| AbstractList | Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems for digital signal processing have internal state (or, equivalently, an edge that loops from the actor back to itself) that impose serial dependencies between actor invocations that make vectorizing across actor invocations impossible. Ideally, issues of inter-thread coordination required by serial data dependencies should be handled by code written by parallel programming experts that is separate from code specifying signal processing operations. The purpose of this paper is to present one approach for so doing in the case of actors that maintain state. We propose a methodology for using the parallel scan (also known as prefix sum) pattern to create algorithms for multiple simultaneous invocations of such an actor that results in vectorizable code. Two examples of applying this methodology are given: (1) infinite impulse response filters and (2) finite state machines. The correctness and performance of the resulting IIR filters are studied. |
| Author | Barford, Lee Bhattacharyya, Shuvra S. Yanzhou Liu |
| Author_xml | – sequence: 1 givenname: Lee surname: Barford fullname: Barford, Lee email: lee.barford@keysight.com organization: Keysight Lechnologies, Keysight Labs., Reno, NV, USA – sequence: 2 givenname: Shuvra S. surname: Bhattacharyya fullname: Bhattacharyya, Shuvra S. organization: Univ. of Maryland, College Park, MD, USA – sequence: 3 surname: Yanzhou Liu fullname: Yanzhou Liu organization: Univ. of Maryland, College Park, MD, USA |
| BookMark | eNotT81KAzEYjKAHW30CQfICXZNNs9l4k6ptoaCgnpcv2y81kCYlCVbf3i3d0zA_DDMTchliQELuOas4Z_ph6aMB_7F-r2rG55ViomaKXZAJnyutFWtbcU26ZyhArY9HCn4Xkyvf-0xtTPSQYo85x5TpcVDpD_ZlkPG3YMguhvxIVxC23oUdhZM15lwomAJ4mgsUvCFXFnzG2xGn5Ov15XOxmm3eluvF02bmeMPKTBstsUEjzDCyrhsNRjKtZA-9wpZxw4RoatVu-YkbOzBhjVTQG6WklWJK7s69DhG7Q3J7SH_d-Fn8A9_XVDI |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/GlobalSIP.2014.7032070 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 1479970883 9781479970889 |
| EndPage | 24 |
| ExternalDocumentID | 7032070 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL CBEJK RIE RIL |
| ID | FETCH-LOGICAL-i160t-9b95e6eb3b3202269ab50975cac7e801b0336278d1c7e8bf3363fb57acb775f53 |
| IEDL.DBID | RIE |
| IngestDate | Thu Jun 29 18:37:51 EDT 2023 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i160t-9b95e6eb3b3202269ab50975cac7e801b0336278d1c7e8bf3363fb57acb775f53 |
| PageCount | 5 |
| ParticipantIDs | ieee_primary_7032070 |
| PublicationCentury | 2000 |
| PublicationDate | 2014-Dec. |
| PublicationDateYYYYMMDD | 2014-12-01 |
| PublicationDate_xml | – month: 12 year: 2014 text: 2014-Dec. |
| PublicationDecade | 2010 |
| PublicationTitle | 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP) |
| PublicationTitleAbbrev | GlobalSIP |
| PublicationYear | 2014 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| Score | 1.5631745 |
| Snippet | Full use of the parallel computation capabilities of present and expected CPUs and CPUs require use of vector extensions. Yet many actors in data flow systems... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 20 |
| SubjectTerms | data flow computing digital signal processing Graphics processing units Indexes Kernel parallel algorithms Signal processing Signal processing algorithms vector processors Vectors |
| Title | Data flow algorithms for processors with vector extensions: Handling actors with internal state |
| URI | https://ieeexplore.ieee.org/document/7032070 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEA21J08qrfhNDh7d7X4mG69qqUKloIXeSpJOtFi7pd0q-OudJGtF8eBts4TskoF9mbfz5hFyDikGeSKjINFJFmQa0x2pAZMVBmBpi6KIrXa4f896w-xulI8a5GKjhQEAV3wGob10__InpV5bqqzDrds3xwR9ixfMa7Vq0W8ciY5vkv9wO7AFW1lYT_7hmuJAo7tD-l-P87UiL-G6UqH--NWJ8b_vs0va3_I8OtgAzx5pwLxFxteyktTMyncqZ08l5vzPryuKR1K68FqAcrmilnWlb46op47-tlzZ6pL2bK8FXIt69x0_b-rJwhl1oqM2GXZvHq96Qe2eEExjFlWBUCIHhrmyshbpCRNS4eGA51pqDohLKkoRvHgxie1YGRylRuVcasV5bvJ0nzTn5RwOCAWVGRAmNjoWmUgSiUvwlBmR4pdSRukhadnNGS98g4xxvS9Hf98-Jts2QL4m5IQ0q-UaThHZK3XmQvoJ-Iynbg |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT8IwFG4IHvSkBoy_7cGjG_vRrtSrSoYCIRESbqQtrRKRERia-Nf7uk6MxoO3dWm6pS_Z1_ftfe9D6FLHEOSJCLxIRcQjCtIdoTQkK4nWlrZoNkOrHe72knRI7kd0VEFXGy2M1rooPtO-vSz-5U8ytbZUWYNZt28GCfoWJYRQp9YqZb9hwBuuTf5ju29LtohfTv_hm1LARmsXdb8e6KpFXvx1Ln318asX43_faA_VvwV6uL-Bnn1U0fMaGt-KXGAzy96xmD1lkPU_v64wHErxwqkBsuUKW94VvxVUPS4IcMuWra5xarstwFrY-e-4eVNHF85wITuqo2HrbnCTeqV_gjcNkyD3uORUJ5AtS2uSHiVcSDgeMKqEYhqQSQYxwBdrTkI7lgZGsZGUCSUZo4bGB6g6z-b6EGEtidHchEaFnPAoErAEixPDY_hWiiA-QjW7OeOFa5ExLvfl-O_bF2g7HXQ7406793CCdmywXIXIKarmy7U-A5zP5XkR3k_oLaq7 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2014+IEEE+Global+Conference+on+Signal+and+Information+Processing+%28GlobalSIP%29&rft.atitle=Data+flow+algorithms+for+processors+with+vector+extensions%3A+Handling+actors+with+internal+state&rft.au=Barford%2C+Lee&rft.au=Bhattacharyya%2C+Shuvra+S.&rft.au=Yanzhou+Liu&rft.date=2014-12-01&rft.pub=IEEE&rft.spage=20&rft.epage=24&rft_id=info:doi/10.1109%2FGlobalSIP.2014.7032070&rft.externalDocID=7032070 |