New Metrics for Identifying Variables and Transients in Large Astronomical Surveys

The Astrophysical Journal American Astronomical Society 992:1 (2025) 109

Authors:

Shih Ching Fu, Arash Bahramian, Aloke Phatak, James CA Miller-Jones, Suman Rakshit, Alexander Andersson, Robert Fender, Patrick A Woudt

Abstract:

A key science goal of large sky surveys such as those conducted by the Vera C. Rubin Observatory and precursors to the Square Kilometre Array is the identification of variable and transient objects. One approach is analyzing time series of the changing brightness of sources, namely, light curves. However, finding adequate statistical representations of light curves is challenging because of the sparsity of observations, irregular sampling, and nuisance factors inherent in astronomical data collection. The wide diversity of objects that a large-scale survey will observe also means that making parametric assumptions about the shape of light curves is problematic. We present a Gaussian process (GP) regression approach for characterizing light-curve variability that addresses these challenges. Our approach makes no assumptions about the shape of a light curve and, therefore, is general enough to detect a range of variable and transient source types. In particular, we propose using the joint distribution of GP amplitude hyperparameters to distinguish variable and transient candidates from nominally stable ones and apply this approach to 6394 radio light curves from the ThunderKAT survey. We compare our results with two variability metrics commonly used in radio astronomy, namely ην and Vν, and show that our approach has better discriminatory power and interpretability. Finally, we conduct a rudimentary search for transient sources in the ThunderKAT data set to demonstrate how our approach might be used as an initial screening tool. Computational notebooks in Python and R are available to help deploy this framework to other surveys.

Textual interpretation of transient image classifications from large language models

(2025)

Authors:

Fiorenzo Stoppa, Turan Bulmus, Steven Bloemen, Stephen J Smartt, Paul J Groot, Paul Vreeswijk, Ken W Smith

Textual interpretation of transient image classifications from large language models

Nature Astronomy Nature Research (2025) 1-10

Authors:

Fiorenzo Stoppa, Turan Bulmus, Steven Bloemen, Stephen J Smartt, Paul J Groot, Paul Vreeswijk, Ken W Smith

Abstract:

Modern astronomical surveys deliver immense volumes of transient detections, yet distinguishing real astrophysical signals (for example, explosive events) from bogus imaging artefacts remains a challenge. Convolutional neural networks are effectively used for real versus bogus classification; however, their reliance on opaque latent representations hinders interpretability. Here we show that large language models (LLMs) can approach the performance level of a convolutional neural network on three optical transient survey datasets (Pan-STARRS, MeerLICHT and ATLAS) while simultaneously producing direct, human-readable descriptions for every candidate. Using only 15 examples and concise instructions, Google’s LLM, Gemini, achieves a 93% average accuracy across datasets that span a range of resolution and pixel scales. We also show that a second LLM can assess the coherence of the output of the first model, enabling iterative refinement by identifying problematic cases. This framework allows users to define the desired classification behaviour through natural language and examples, bypassing traditional training pipelines. Furthermore, by generating textual descriptions of observed features, LLMs enable users to query classifications as if navigating an annotated catalogue, rather than deciphering abstract latent spaces. As next-generation telescopes and surveys further increase the amount of data available, LLM-based classification could help bridge the gap between automated detection and transparent, human-level understanding.

Angular correlation functions of bright Lyman-break galaxies at 3 ≲ z ≲ 5

Monthly Notices of the Royal Astronomical Society Oxford University Press (OUP) (2025) staf1651

Authors:

Isabelle Ye, Philip Bull, Rebecca AA Bowler, Rachel K Cochrane, Nathan J Adams, Matt J Jarvis

Abstract:

Abstract We investigate the clustering of Lyman-break galaxies at redshifts of 3 ≲ z ≲ 5 within the COSMOS field by measuring the angular two-point correlation function. Our robust sample of ~60,000 bright (mUV ≲ 27) Lyman-break galaxies was selected based on spectral energy distribution fitting across 14 photometric bands spanning optical and near-infrared wavelengths. We constrained both the 1- and 2-halo terms at separations up to 300 arcsec, finding an excess in the correlation function at scales corresponding to <20 kpc, consistent with enhancement due to clumps in the same galaxy or interactions on this scale. We then performed Bayesian model fits on the correlation functions to infer the Halo Occupation Distribution parameters, star formation duty cycle, and galaxy bias in three redshift bins. We examined several cases where different combinations of parameters were varied, showing that our data can constrain the slope of the satellite occupation function, which previous studies have fixed. For an MUV-limited sub-sample, we found galaxy bias values of $b_g=3.18^{+0.14}_{-0.14}$ at z ≃ 3, $b_g=3.58^{+0.27}_{-0.29}$ at z ≃ 4, $b_g=4.27^{+0.25}_{-0.26}$ at z ≃ 5. The duty cycle values are $0.62^{+0.25}_{-0.26}$, $0.40^{+0.34}_{-0.22}$, and $0.39^{+0.31}_{-0.20}$,respectively. These results suggest that, as the redshift increases, there is a slight decrease in the host halo masses and a shorter timescale for star formation in bright galaxies, at a fixed rest-frame UV luminosity threshold.

SN 2019tsf: Evidence for Extended Hydrogen-poor CSM in the Three-peaked Light Curve of Stripped Envelope of a Type Ib Supernova

The Astrophysical Journal American Astronomical Society 992:1 (2025) 9

Authors:

Yossef Zenati, Qinan Wang, Alexey Bobrick, Lindsay DeMarchi, Hila Glanz, Mor Rozner, Jacob E Jencson, Armin Rest, Brian D Metzger, Raffaella Margutti, Sebastian Gomez, Nathan Smith, Silvia Toonen, Joe S Bright, Colin Norman, Ryan J Foley, Alexander Gagliano, Julian H Krolik, Stephen J Smartt, Ashley V Villar, Gautham Narayan, Ori Fox, Katie Auchettl, Daniel Brethauer

Abstract:

We present multiband ATLAS and ZTF photometry for SN 2019tsf, a Type Ib stripped-envelope supernova (SESN). The slow spectral evolution could be associated with an uncommon explosion mechanism specific to this SN. Possible explanations include fallback accretion onto a compact remnant or a long-lived central engine, both of which could provide extended energy injection responsible for the late-time rebrightening and unusual spectral features. The rebrightening observations represent the latest photometric measurements of a multipeaked Type Ib SN. As late-time photometry and spectroscopy suggest no hydrogen, the potential circumstellar material (CSM) must be H-poor. The absence of a nebular phase and the lack of narrow emission lines in the late-time spectra (>142 days) of the SNe suggest that any CSM interaction is likely asymmetric and enveloped by the SN ejecta. However, an extended CSM structure is evident through a follow-up radio campaign with the Karl G. Jansky Very Large Array (VLA), indicating a source of bright optically thick radio emission at late times, which is highly unusual among H-poor SESNe. We attribute this phenomenology to an interaction of the supernova ejecta with asymmetric CSM, potentially disk-like, and we present several models that may explain the origin of this rare Type Ib supernova. We propose a warped disk model in which a tertiary companion—commonly present around massive stars—perturbs the progenitor’s CSM, producing density enhancements that may explain the observed multipeaked SN 2019tsf light curve. This SN 2019tsf is a unique SN Type Ib among the recently discovered class of SNe that undergo mass transfer at the moment of explosion.