SIMD algorithms for matrix multiplication on the hypercube
Presents a new algorithm for n/spl times/n matrix multiplication on a hypercube of p processors, which outperforms, in terms of time complexity, the best algorithms known in the literature, due to Dekel, Nassimi and Sahni (1981). These authors presented algorithms of O/spl lsqb/n/sup /spl lambda/p/s...
        Saved in:
      
    
          | Published in | Parallel Processing Symposium, 8th International (IPPS '94 pp. 492 - 496 | 
|---|---|
| Main Authors | , | 
| Format | Conference Proceeding | 
| Language | English | 
| Published | 
            IEEE Comput. Soc. Press
    
        1994
     | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 0818656026 9780818656026  | 
| DOI | 10.1109/IPPS.1994.288258 | 
Cover
| Summary: | Presents a new algorithm for n/spl times/n matrix multiplication on a hypercube of p processors, which outperforms, in terms of time complexity, the best algorithms known in the literature, due to Dekel, Nassimi and Sahni (1981). These authors presented algorithms of O/spl lsqb/n/sup /spl lambda/p/sup (/spl lambdaspl minus/12)/spl rsqb/, with 2/spl lesspl lambda/<3 and 1/spl les/p/spl les/n/sup 2/, and O/spl lsqb/log(p/n/sup 2/)+n/sup 3p/spl rsqb/, for n/sup 2spl les/p/spl les/n/sup 3/. The MMM/sub 1/ algorithm presented in this paper is O/spl lsqb/(n/sup 2p/sup 2/3/)log p + n/sup /spl lambda/p/sup /spl lambda3spl rsqb/, where 1/spl les/p/spl les/n/sup 3/. It can be shown that MMM/sub 1/ is better for 1/spl les/p/spl les/n/sup 3log/sup 3/n. The algorithm is derived by using the matricial visualization of the hypercube, suggested by Nassimi and Sahni (1982).< > | 
|---|---|
| ISBN: | 0818656026 9780818656026  | 
| DOI: | 10.1109/IPPS.1994.288258 |