Galaxy zoo: Probabilistic morphology through Bayesian CNNs and active learning
Monthly Notices of the Royal Astronomical Society Oxford University Press 491:2 (2019) 1554-1574
Abstract:
We use Bayesian convolutional neural networks and a novel generative model of Galaxy Zoo volunteer responses to infer posteriors for the visual morphology of galaxies. Bayesian CNN can learn from galaxy images with uncertain labels and then, for previously unlabelled galaxies, predict the probability of each possible label. Our posteriors are well-calibrated (e.g. for predicting bars, we achieve coverage errors of 11.8 per cent within a vote fraction deviation of 0.2) and hence are reliable for practical use. Further, using our posteriors, we apply the active learning strategy BALD to request volunteer responses for the subset of galaxies which, if labelled, would be most informative for training our network. We show that training our Bayesian CNNs using active learning requires up to 35–60 per cent fewer labelled galaxies, depending on the morphological feature being classified. By combining human and machine intelligence, Galaxy zoo will be able to classify surveys of any conceivable scale on a time-scale of weeks, providing massive and detailed morphology catalogues to support research into galaxy evolution.A ghost in the toast: TESS background light produces a false “transit” across τ Ceti
Research Notes of the AAS American Astronomical Society 3:10 (2019) 145
Radio galaxy zoo: Unsupervised clustering of convolutionally auto-encoded radio-astronomical images
Publications of the Astronomical Society of the Pacific IOP Publishing 131:1004 (2019) 108011
Abstract:
This paper demonstrates a novel and efficient unsupervised clustering method with the combination of a self-organizing map (SOM) and a convolutional autoencoder. The rapidly increasing volume of radio-astronomical data has increased demand for machine-learning methods as solutions to classification and outlier detection. Major astronomical discoveries are unplanned and found in the unexpected, making unsupervised machine learning highly desirable by operating without assumptions and labeled training data. Our approach shows SOM training time is drastically reduced and high-level features can be clustered by training on auto-encoded feature vectors instead of raw images. Our results demonstrate this method is capable of accurately separating outliers on a SOM with neighborhood similarity and K-means clustering of radio-astronomical features. We present this method as a powerful new approach to data exploration by providing a detailed understanding of the morphology and relationships of Radio Galaxy Zoo (RGZ) data set image features which can be applied to new radio survey data.Author Correction: Time-lapse imagery and volunteer classifications from the Zooniverse Penguin Watch project.
Scientific data (2019)
Abstract:
An amendment to this paper has been published and can be accessed via a link at the top of the paper.Secularly powered outflows from AGN: the dominance of non-merger driven supermassive black hole growth
Monthly Notices of the Royal Astronomical Society Oxford University Press (OUP) (2019)