PetRBF — A parallel O( N) algorithm for radial basis function interpolation with Gaussians

We have developed a parallel algorithm for radial basis function ( rbf) interpolation that exhibits O( N) complexity, requires O( N) storage, and scales excellently up to a thousand processes. The algorithm uses a gmres iterative solver with a restricted additive Schwarz method ( rasm) as a precondi...

Full description

Saved in:
Bibliographic Details
Published inComputer methods in applied mechanics and engineering Vol. 199; no. 25; pp. 1793 - 1804
Main Authors Yokota, Rio, Barba, L.A., Knepley, Matthew G.
Format Journal Article
LanguageEnglish
Published Kidlington Elsevier B.V 01.05.2010
Elsevier
Subjects
Online AccessGet full text
ISSN0045-7825
1879-2138
DOI10.1016/j.cma.2010.02.008

Cover

More Information
Summary:We have developed a parallel algorithm for radial basis function ( rbf) interpolation that exhibits O( N) complexity, requires O( N) storage, and scales excellently up to a thousand processes. The algorithm uses a gmres iterative solver with a restricted additive Schwarz method ( rasm) as a preconditioner and a fast matrix-vector algorithm. Previous fast rbf methods — achieving at most O( Nlog N) complexity — were developed using multiquadric and polyharmonic basis functions. In contrast, the present method uses Gaussians with a small variance with respect to the domain, but with sufficient overlap. This is a common choice in particle methods for fluid simulation, our main target application. The fast decay of the Gaussian basis function allows rapid convergence of the iterative solver even when the subdomains in the rasm are very small. At the same time we show that the accuracy of the interpolation can achieve machine precision. The present method was implemented in parallel using the petsc library (developer version). Numerical experiments demonstrate its capability in problems of rbf interpolation with more than 50 million data points, timing at 106 s (19 iterations for an error tolerance of 10 − 15 ) on 1024 processors of a Blue Gene/L (700 MHz PowerPC processors). The parallel code is freely available in the open-source model.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:0045-7825
1879-2138
DOI:10.1016/j.cma.2010.02.008