PetRBF — A parallel O( N) algorithm for radial basis function interpolation with Gaussians
We have developed a parallel algorithm for radial basis function ( rbf) interpolation that exhibits O( N) complexity, requires O( N) storage, and scales excellently up to a thousand processes. The algorithm uses a gmres iterative solver with a restricted additive Schwarz method ( rasm) as a precondi...
Saved in:
| Published in | Computer methods in applied mechanics and engineering Vol. 199; no. 25; pp. 1793 - 1804 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
Kidlington
Elsevier B.V
01.05.2010
Elsevier |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0045-7825 1879-2138 |
| DOI | 10.1016/j.cma.2010.02.008 |
Cover
| Summary: | We have developed a parallel algorithm for radial basis function (
rbf) interpolation that exhibits
O(
N) complexity, requires
O(
N) storage, and scales excellently up to a thousand processes. The algorithm uses a
gmres iterative solver with a restricted additive Schwarz method (
rasm) as a preconditioner and a fast matrix-vector algorithm. Previous fast
rbf methods — achieving at most
O(
Nlog
N) complexity — were developed using multiquadric and polyharmonic basis functions. In contrast, the present method uses Gaussians with a small variance with respect to the domain, but with sufficient overlap. This is a common choice in particle methods for fluid simulation, our main target application. The fast decay of the Gaussian basis function allows rapid convergence of the iterative solver even when the subdomains in the
rasm are very small. At the same time we show that the accuracy of the interpolation can achieve machine precision. The present method was implemented in parallel using the
petsc library (developer version). Numerical experiments demonstrate its capability in problems of
rbf interpolation with more than 50
million data points, timing at 106
s (19 iterations for an error tolerance of 10
−
15
) on 1024 processors of a Blue Gene/L (700
MHz PowerPC processors). The parallel code is freely available in the open-source model. |
|---|---|
| Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
| ISSN: | 0045-7825 1879-2138 |
| DOI: | 10.1016/j.cma.2010.02.008 |