Machine learning classification of SDSS transient survey images

We show that multiple machine learning algorithms can match human performance in classifying transient imaging data from the Sloan Digital Sky Survey (SDSS) supernova survey into real objects and artefacts. This is a first step in any transient science pipeline and is currently still done by humans,...

Full description

Saved in:
Bibliographic Details
Published inMonthly notices of the Royal Astronomical Society Vol. 454; no. 2; pp. 2026 - 2038
Main Authors du Buisson, L., Sivanandam, N., Bassett, Bruce A., Smith, M.
Format Journal Article
LanguageEnglish
Published London Oxford University Press 01.12.2015
Subjects
Online AccessGet full text
ISSN0035-8711
1365-8711
1365-2966
1365-2966
DOI10.1093/mnras/stv2041

Cover

More Information
Summary:We show that multiple machine learning algorithms can match human performance in classifying transient imaging data from the Sloan Digital Sky Survey (SDSS) supernova survey into real objects and artefacts. This is a first step in any transient science pipeline and is currently still done by humans, but future surveys such as the Large Synoptic Survey Telescope (LSST) will necessitate fully machine-enabled solutions. Using features trained from eigenimage analysis (principal component analysis, PCA) of single-epoch g, r and i difference images, we can reach a completeness (recall) of 96 per cent, while only incorrectly classifying at most 18 per cent of artefacts as real objects, corresponding to a precision (purity) of 84 per cent. In general, random forests performed best, followed by the k-nearest neighbour and the SkyNet artificial neural net algorithms, compared to other methods such as naive Bayes and kernel support vector machine. Our results show that PCA-based machine learning can match human success levels and can naturally be extended by including multiple epochs of data, transient colours and host galaxy information which should allow for significant further improvements, especially at low signal-to-noise.
Bibliography:SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
ISSN:0035-8711
1365-8711
1365-2966
1365-2966
DOI:10.1093/mnras/stv2041