An algorithm for a constrained P-spline

Regression splines are largely used to investigate and predict data behavior, attracting the interest of mathematicians for their beautiful numerical properties, and of statisticians for their versatility with respect to the applications. Several penalized spline regression models are available in t...

Full description

Saved in:
Bibliographic Details
Published inBIT Vol. 65; no. 2
Main Authors Campagna, Rosanna, Crisci, Serena, Santin, Gabriele, Toraldo, Gerardo, Viola, Marco
Format Journal Article
LanguageEnglish
Published Dordrecht Springer Netherlands 01.06.2025
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN0006-3835
1572-9125
1572-9125
DOI10.1007/s10543-025-01071-y

Cover

More Information
Summary:Regression splines are largely used to investigate and predict data behavior, attracting the interest of mathematicians for their beautiful numerical properties, and of statisticians for their versatility with respect to the applications. Several penalized spline regression models are available in the literature, and the most commonly used ones in real-world applications are P-splines, which enjoy the advantages of penalized models while being easy to generalize across different functional spaces and higher degree order, because of their discrete penalty term. To face the different requirements imposed by the nature of the problem or the physical meaning of the expected values, the P-spline definition is often modified by additional hypotheses, often translated into constraints on the solution or its derivatives. In this framework, our work is motivated by the aim of getting approximation models that fall within pre-established thresholds. Specifically, starting from a set of observed data, we consider a P-spline constrained between some prefixed bounds. In our paper, we just consider 0 as lower bound, although our approach applies to more general cases. We propose to get nonnegativity by imposing lower bounds on selected sample points. The spline can be computed through a sequence of linearly constrained problems. We suggest a strategy to dynamically select the sample points, to avoid extremely dense sampling, and therefore try to reduce as much as possible the computational burden. We show through some computational experiments the reliability of our approach and the accuracy of the results compared to some state-of-the-art models.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0006-3835
1572-9125
1572-9125
DOI:10.1007/s10543-025-01071-y