TiDES: The 4MOST Time Domain Extragalactic Survey

The Astrophysical Journal American Astronomical Society 992:1 (2025) 158

Authors:

C Frohmaier, M Vincenzi, M Sullivan, SF Hönig, M Smith, H Addison, T Collett, G Dimitriadis, RS Ellis, P Gandhi, O Graur, I Hook, L Kelsey, Y-L Kim, C Lidman, K Maguire, L Makrygianni, B Martin, A Möller, RC Nichol, M Nicholl, P Schady, BD Simmons, SJ Smartt

Abstract:

The Time Domain Extragalactic Survey (TiDES) conducted on the 4 m Multi-Object Spectroscopic Telescope will perform spectroscopic follow-up of extragalactic transients discovered in the era of the NSF-DOE Vera C. Rubin Observatory. TiDES will conduct a 5 yr survey, covering >14, 000squaredegrees , and use around 250,000 fibre hours to address three main science goals: (i) spectroscopic observations of >30,000 live transients, (ii) comprehensive follow-up of >200,000 host galaxies to obtain redshift measurements, and (iii) repeat spectroscopic observations of active galactic nuclei to enable reverberation mapping studies. The live spectra from TiDES will be used to reveal the diversity and astrophysics of both normal and exotic supernovae across the luminosity-timescale plane. The extensive host-galaxy redshift campaign will allow exploitation of the larger sample of supernovae and improve photometric classification, providing the largest-ever sample of SNe Ia, capable of a sub-2% measurement of the equation-of-state of dark energy. Finally, the TiDES reverberation mapping experiment of 700–1000 AGN will complement the SN Ia sample and extend the Hubble diagram to z ∼ 2.5.

New Metrics for Identifying Variables and Transients in Large Astronomical Surveys

The Astrophysical Journal American Astronomical Society 992:1 (2025) 109

Authors:

Shih Ching Fu, Arash Bahramian, Aloke Phatak, James CA Miller-Jones, Suman Rakshit, Alexander Andersson, Robert Fender, Patrick A Woudt

Abstract:

A key science goal of large sky surveys such as those conducted by the Vera C. Rubin Observatory and precursors to the Square Kilometre Array is the identification of variable and transient objects. One approach is analyzing time series of the changing brightness of sources, namely, light curves. However, finding adequate statistical representations of light curves is challenging because of the sparsity of observations, irregular sampling, and nuisance factors inherent in astronomical data collection. The wide diversity of objects that a large-scale survey will observe also means that making parametric assumptions about the shape of light curves is problematic. We present a Gaussian process (GP) regression approach for characterizing light-curve variability that addresses these challenges. Our approach makes no assumptions about the shape of a light curve and, therefore, is general enough to detect a range of variable and transient source types. In particular, we propose using the joint distribution of GP amplitude hyperparameters to distinguish variable and transient candidates from nominally stable ones and apply this approach to 6394 radio light curves from the ThunderKAT survey. We compare our results with two variability metrics commonly used in radio astronomy, namely ην and Vν, and show that our approach has better discriminatory power and interpretability. Finally, we conduct a rudimentary search for transient sources in the ThunderKAT data set to demonstrate how our approach might be used as an initial screening tool. Computational notebooks in Python and R are available to help deploy this framework to other surveys.

Textual interpretation of transient image classifications from large language models

(2025)

Authors:

Fiorenzo Stoppa, Turan Bulmus, Steven Bloemen, Stephen J Smartt, Paul J Groot, Paul Vreeswijk, Ken W Smith

Textual interpretation of transient image classifications from large language models

Nature Astronomy Nature Research (2025) 1-10

Authors:

Fiorenzo Stoppa, Turan Bulmus, Steven Bloemen, Stephen J Smartt, Paul J Groot, Paul Vreeswijk, Ken W Smith

Abstract:

Modern astronomical surveys deliver immense volumes of transient detections, yet distinguishing real astrophysical signals (for example, explosive events) from bogus imaging artefacts remains a challenge. Convolutional neural networks are effectively used for real versus bogus classification; however, their reliance on opaque latent representations hinders interpretability. Here we show that large language models (LLMs) can approach the performance level of a convolutional neural network on three optical transient survey datasets (Pan-STARRS, MeerLICHT and ATLAS) while simultaneously producing direct, human-readable descriptions for every candidate. Using only 15 examples and concise instructions, Google’s LLM, Gemini, achieves a 93% average accuracy across datasets that span a range of resolution and pixel scales. We also show that a second LLM can assess the coherence of the output of the first model, enabling iterative refinement by identifying problematic cases. This framework allows users to define the desired classification behaviour through natural language and examples, bypassing traditional training pipelines. Furthermore, by generating textual descriptions of observed features, LLMs enable users to query classifications as if navigating an annotated catalogue, rather than deciphering abstract latent spaces. As next-generation telescopes and surveys further increase the amount of data available, LLM-based classification could help bridge the gap between automated detection and transparent, human-level understanding.

Angular correlation functions of bright Lyman-break galaxies at 3 ≲ z ≲ 5

Monthly Notices of the Royal Astronomical Society Oxford University Press (OUP) (2025) staf1651

Authors:

Isabelle Ye, Philip Bull, Rebecca AA Bowler, Rachel K Cochrane, Nathan J Adams, Matt J Jarvis

Abstract:

Abstract We investigate the clustering of Lyman-break galaxies at redshifts of 3 ≲ z ≲ 5 within the COSMOS field by measuring the angular two-point correlation function. Our robust sample of ~60,000 bright (mUV ≲ 27) Lyman-break galaxies was selected based on spectral energy distribution fitting across 14 photometric bands spanning optical and near-infrared wavelengths. We constrained both the 1- and 2-halo terms at separations up to 300 arcsec, finding an excess in the correlation function at scales corresponding to <20 kpc, consistent with enhancement due to clumps in the same galaxy or interactions on this scale. We then performed Bayesian model fits on the correlation functions to infer the Halo Occupation Distribution parameters, star formation duty cycle, and galaxy bias in three redshift bins. We examined several cases where different combinations of parameters were varied, showing that our data can constrain the slope of the satellite occupation function, which previous studies have fixed. For an MUV-limited sub-sample, we found galaxy bias values of $b_g=3.18^{+0.14}_{-0.14}$ at z ≃ 3, $b_g=3.58^{+0.27}_{-0.29}$ at z ≃ 4, $b_g=4.27^{+0.25}_{-0.26}$ at z ≃ 5. The duty cycle values are $0.62^{+0.25}_{-0.26}$, $0.40^{+0.34}_{-0.22}$, and $0.39^{+0.31}_{-0.20}$,respectively. These results suggest that, as the redshift increases, there is a slight decrease in the host halo masses and a shorter timescale for star formation in bright galaxies, at a fixed rest-frame UV luminosity threshold.