Discriminating Variable Star Candidates in Large Image Databases from the HiTS Survey Using NMF
Author(s) -
P. Huijse,
P. A. Estévez,
F. Förster,
Emanuel Berrocal
Publication year - 2015
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2015.07.276
Subject(s) - computer science , star (game theory) , variable (mathematics) , information retrieval , database , data mining , pattern recognition (psychology) , artificial intelligence , astrophysics , mathematics , physics , mathematical analysis
New instruments and technologies are allowing the acquisition of large amounts of data from astronomical surveys. Nowadays there is a pressing need for autonomous methods to discriminate the interesting astronomical objects in the vast sky. The High Cadence Transient Survey (HiTS) project is an astronomical survey that is trying to find a rare transient event that occurs during the first instants of a supernova. In this paper we propose an autonomous method to discriminate stellar variability from the HiTS database, that uses a feature extraction scheme based on Non-negative matrix factorization (NMF). Using NMF, dictionaries of image prototypes that represent the data in a compact way are obtained. The projections of the dataset into these dictionaries are fed into a random forest classifier. NMF is compared with other feature extraction schemes, on a subset of 500,000 transient candidates from the HiTS survey. With NMF a better class separability at feature level is obtained which enhances the classification accuracy significantly. Using the NMF features less than 4% of the true stellar transients are lost, at a manageable false positive rate of 0.1%
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom