An Interpretable Machine-learning Framework for Modeling High-resolution Spectroscopic Data

Comparison of échelle spectra to synthetic models has become a computational statistics challenge, with over 10,000 individual spectral lines affecting a typical cool star échelle spectrum. Telluric artifacts, imperfect line lists, inexact continuum placement, and inflexible models frustrate the sci...

Full description

Saved in:

Bibliographic Details
Published in	The Astrophysical journal Vol. 941; no. 2; p. 200
Main Authors	Gully-Santiago, Michael, Morley, Caroline V.
Format	Journal Article
Language	English
Published	Philadelphia IOP Publishing 01.12.2022
Subjects	Acceleration Algorithms Archaeology Astrophysics Back propagation networks Cool stars Data comparison Deep learning Line spectra Machine learning Magnetic fields Modelling Neural networks Radial velocity Remote sensing Scaffolding Source code Spectroscopy Spectrum analysis Stellar models Telluric lines
Online Access	Get full text
ISSN	0004-637X 1538-4357 1538-4357
DOI	10.3847/1538-4357/aca0a2

Cover

More Information
Summary:	Comparison of échelle spectra to synthetic models has become a computational statistics challenge, with over 10,000 individual spectral lines affecting a typical cool star échelle spectrum. Telluric artifacts, imperfect line lists, inexact continuum placement, and inflexible models frustrate the scientific promise of these information-rich data sets. Here we debut an interpretable machine-learning framework blasé that addresses these and other challenges. The semiempirical approach can be viewed as “transfer learning”—first pretraining models on noise-free precomputed synthetic spectral models, then learning the corrections to line depths and widths from whole-spectrum fitting to an observed spectrum. The auto-differentiable model employs back-propagation, the fundamental algorithm empowering modern deep learning and neural networks. Here, however, the 40,000+ parameters symbolize physically interpretable line profile properties such as amplitude, width, location, and shape, plus radial velocity and rotational broadening. This hybrid data-/model-driven framework allows joint modeling of stellar and telluric lines simultaneously, a potentially transformative step forward for mitigating the deleterious telluric contamination in the near-infrared. The blasé approach acts as both a deconvolution tool and semiempirical model. The general-purpose scaffolding may be extensible to many scientific applications, including precision radial velocities, Doppler imaging, chemical abundances for Galactic archeology, line veiling, magnetic fields, and remote sensing. Its sparse-matrix architecture and GPU acceleration make blasé fast. The open-source PyTorch-based code blase includes tutorials, Application Programming Interface documentation, and more. We show how the tool fits into the existing Python spectroscopy ecosystem, demonstrate a range of astrophysical applications, and discuss limitations and future extensions.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0004-637X 1538-4357 1538-4357
DOI:	10.3847/1538-4357/aca0a2