Comparing Multi-class, Binary and Hierarchical Machine Learning Classification schemes for variable stars
Upcoming synoptic surveys are set to generate an unprecedented amount of data. This requires an automatic framework that can quickly and efficiently provide classification labels for several new object classification challenges. Using data describing 11 types of variable stars from the Catalina Real-Time Transient Surveys (CRTS), we illustrate how to capture the most important information from computed features and describe detailed methods of how to robustly use Information Theory for feature selection and evaluation. We apply three Machine Learning (ML) algorithms and demonstrate how to optimize these classifiers via cross-validation techniques. For the CRTS dataset, we find that the Random Forest (RF) classifier performs best in terms of balanced-accuracy and geometric means. We demonstrate substantially improved classification results by converting the multi-class problem into a binary classification task, achieving a balanced-accuracy rate of $\sim$99 per cent for the classification of ${\delta}$-Scuti and Anomalous Cepheids (ACEP). Additionally, we describe how classification performance can be improved via converting a 'flat-multi-class' problem into a hierarchical taxonomy. We develop a new hierarchical structure and propose a new set of classification features, enabling the accurate identification of subtypes of cepheids, RR Lyrae and eclipsing binary stars in CRTS data.
NurtureToken New!

Token crowdsale for this paper ends in

Buy Nurture Tokens

Authors

Are you an author of this paper? Check the Twitter handle we have for you is correct.

Zafiirah Hosenie (edit)
Robert Lyon (add twitter)
Benjamin Stappers (add twitter)
Arrykrishna Mootoovaloo (edit)
Ask The Authors

Ask the authors of this paper a question or leave a comment.

Read it. Rate it.
#1. Which part of the paper did you read?

#2. The paper contains new data or analyses that is openly accessible?
#3. The conclusion is supported by the data and analyses?
#4. The conclusion is of scientific interest?
#5. The result is likely to lead to future research?

Github
User:
None (add)
Repo:
None (add)
Stargazers:
0
Forks:
0
Open Issues:
0
Network:
0
Subscribers:
0
Language:
None
Youtube
Link:
None (add)
Views:
0
Likes:
0
Dislikes:
0
Favorites:
0
Comments:
0
Other
Sample Sizes (N=):
Inserted:
Words Total:
Words Unique:
Source:
Abstract:
None
07/18/19 06:01PM
11,337
3,029
Tweets
arxivml: "Comparing Multi-class, Binary and Hierarchical Machine Learning Classification schemes for variable stars", Zafiir… https://t.co/5RysDHDPL3
StatsPapers: Comparing Multi-class, Binary and Hierarchical Machine Learning Classification schemes for variable stars. https://t.co/a6chYUuGXl
DARABigData: Publication 📝 alert! Comparing multi-class, binary and hierarchical machine learning classification schemes for variable stars 🌟 💫 by @DARABigData PhD student @ZHosenie is accepted by @RoyalAstroSoc #MNRAS https://t.co/YsNKEbY1AY
Images
Related