Vyzkoušejte nový nástroj s podporou AI
Summon Research Assistant
BETA
Design of an energy-efficient accelerator for training of convolutional neural networks using frequency-domain computation
Jong Hwan Ko, Mudassar, Burhan, Na, Taesik, Mukhopadhyay, Saibal
Published in 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) (18.06.2017)
Published in 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) (18.06.2017)
Get full text
Conference Proceeding
Accelerating Fourier and Number Theoretic Transforms using Tensor Cores and Warp Shuffles
Durrani, Sultan, Chughtai, Muhammad Saad, Hidayetoglu, Mert, Tahir, Rashid, Dakkak, Abdul, Rauchwerger, Lawrence, Zaffar, Fareed, Hwu, Wen-mei
Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)
Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)
Get full text
Conference Proceeding
Bitwidth-Optimized Energy-Efficient FFT Design via Scaling Information Propagation
Liu, Xinzhe, Chen, Fupeng, Muhamad, Raees Kizhakkumkara, Blinder, David, Nikolova, Dessislava, Schelkens, Peter, Catthoor, Francky, Ha, Yajun
Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
Get full text
Conference Proceeding
Möbius Convolutions for Spherical CNNs
Published in Special Interest Group on Computer Graphics and Interactive Techniques Conference Proceedings
Get full text
Conference Proceeding
A high performance split-radix FFT with constant geometry architecture
Kwong, Joyce, Goel, Manish
Published in Proceedings of the Conference on Design, Automation and Test in Europe (12.03.2012)
Published in Proceedings of the Conference on Design, Automation and Test in Europe (12.03.2012)
Get full text
Conference Proceeding
A flexible and fast software implementation of the FFT on the BPE platform
Cupaiuolo, Teo, Iacono, Daniele Lo
Published in Proceedings of the Conference on Design, Automation and Test in Europe (12.03.2012)
Published in Proceedings of the Conference on Design, Automation and Test in Europe (12.03.2012)
Get full text
Conference Proceeding
High performance model based image reconstruction
Published in Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
(27.02.2016)
Get full text
Conference Proceeding
List decoding algorithms for certain concatenated codes
Published in Proceedings of the thirty-second annual ACM symposium on Theory of computing
(01.05.2000)
Get full text
Conference Proceeding
Optimization of fast Fourier transforms on the Blue Gene/L supercomputer
Sabharwal, Yogish, Garg, Saurabh K., Garg, Rahul, Gunnels, John A., Sahoo, Ramendra K.
Published in Proceedings of the 15th international conference on High performance computing (17.12.2008)
Published in Proceedings of the 15th international conference on High performance computing (17.12.2008)
Get full text
Conference Proceeding
Auto-tuning 3-D FFT library for CUDA GPUs
Nukada, Akira, Matsuoka, Satoshi
Published in Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (14.11.2009)
Published in Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (14.11.2009)
Get full text
Conference Proceeding
Interconnection Networks for Scalable Quantum Computers
Isailovic, Nemanja, Patel, Yatish, Whitney, Mark, Kubiatowicz, John
Published in 33rd International Symposium on Computer Architecture (ISCA'06) (01.05.2006)
Published in 33rd International Symposium on Computer Architecture (ISCA'06) (01.05.2006)
Get full text
Conference Proceeding
Reproducibility in Benchmarking Parallel Fast Fourier Transform based Applications
Published in Companion of the 2019 ACM/SPEC International Conference on Performance Engineering
(27.03.2019)
Get full text
Conference Proceeding