Skip to main content
eScholarship
Open Access Publications from the University of California

UC Berkeley

UC Berkeley Previously Published Works bannerUC Berkeley

Weakly supervised anomaly detection in the Milky Way

Abstract

Large-scale astrophysics data sets present an opportunity for new machine learning techniques to identify regions of interest that might otherwise be overlooked by traditional searches. To this end, we demonstrate how Classification Without Labels (CWoLa), a weakly supervised anomaly detection method, can help identify cold stellar streams within the more than one billion Milky Way stars observed by the Gaia satellite. CWoLa operates without the use of labelled streams or knowledge of astrophysical principles. Instead, it uses a classifier to distinguish between mixed samples for which the proportions of signal and background samples are unknown. As a proof of concept, we demonstrate that this computationally lightweight strategy is able to detect both simulated streams and the known stream GD-1 in data. Originally designed for high-energy collider physics, this technique may have broad applicability within astrophysics as well as other domains interested in identifying localized anomalies.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View