Textual interpretation of transient image classifications from large language models

(2025)

Authors:

Fiorenzo Stoppa, Turan Bulmus, Steven Bloemen, Stephen J Smartt, Paul J Groot, Paul Vreeswijk, Ken W Smith

Textual interpretation of transient image classifications from large language models

Nature Astronomy Nature Research (2025) 1-10

Authors:

Fiorenzo Stoppa, Turan Bulmus, Steven Bloemen, Stephen J Smartt, Paul J Groot, Paul Vreeswijk, Ken W Smith

Abstract:

Modern astronomical surveys deliver immense volumes of transient detections, yet distinguishing real astrophysical signals (for example, explosive events) from bogus imaging artefacts remains a challenge. Convolutional neural networks are effectively used for real versus bogus classification; however, their reliance on opaque latent representations hinders interpretability. Here we show that large language models (LLMs) can approach the performance level of a convolutional neural network on three optical transient survey datasets (Pan-STARRS, MeerLICHT and ATLAS) while simultaneously producing direct, human-readable descriptions for every candidate. Using only 15 examples and concise instructions, Google’s LLM, Gemini, achieves a 93% average accuracy across datasets that span a range of resolution and pixel scales. We also show that a second LLM can assess the coherence of the output of the first model, enabling iterative refinement by identifying problematic cases. This framework allows users to define the desired classification behaviour through natural language and examples, bypassing traditional training pipelines. Furthermore, by generating textual descriptions of observed features, LLMs enable users to query classifications as if navigating an annotated catalogue, rather than deciphering abstract latent spaces. As next-generation telescopes and surveys further increase the amount of data available, LLM-based classification could help bridge the gap between automated detection and transparent, human-level understanding.

A measurement of the high-mass τ τ ¯ production cross-section at s = 13 TeV with the ATLAS detector and constraints on new particles and couplings

Journal of High Energy Physics Springer 2025:10 (2025) 54

Authors:

G Aad, E Aakvaag, B Abbott, S Abdelhameed, K Abeling, NJ Abicht, SH Abidi, M Aboelela, A Aboulhorma, H Abramowicz, Y Abulaiti, BS Acharya, A Ackermann, C Adam Bourdarios, L Adamczyk, SV Addepalli, MJ Addison, J Adelman, A Adiguzel, T Adye, AA Affolder, Y Afik, MN Agaras, A Aggarwal

Abstract:

The production cross-section of high-mass τ-lepton pairs is measured as a function of the dilepton visible invariant mass, using 140 fb−1 of s=13 TeV proton-proton collision data recorded with the ATLAS detector at the Large Hadron Collider. The measurement agrees with the predictions of the Standard Model. A fit to the invariant mass distribution is performed as a function of b-jet multiplicity, to constrain the non-resonant production of new particles described by an effective field theory or in models containing leptoquarks or Z′ bosons that couple preferentially to third-generation fermions. The constraints on new particles improve on previous results, and the constraints on effective operators include those affecting the anomalous magnetic moment of the τ-lepton.

MIGHTEE-H i: the direct detection of neutral hydrogen in galaxies at z > 0.25

Monthly Notices of the Royal Astronomical Society Oxford University Press 544:1 (2025) 193-210

Authors:

Matt J Jarvis, Madalina N Tudorache, I Heywood, Anastasia A Ponomareva, M Baes, Natasha Maddox, Kristine Spekkens, Andreea Vărăşteanu, CL Hale, Mario G Santos, RG Varadaraj, Elizabeth AK Adams, Alessandro Bianchetti, Barbara Catinella, Jacinta Delhaize, M Maksymowicz-Maciata, Pavel E Mancera Piña, Hengxing Pan, Amélie Saintonge, Gauri Sharma, O Ivy Wong

Abstract:

Atomic hydrogen constitutes the gas reservoir from which molecular gas and star formation in galaxies emerges. However, the weakness of the line means it has been difficult to directly detect in all but the very local Universe. Here, we present results from the first search using the MeerKAT International Tiered Extragalactic Exploration (MIGHTEE) Survey for high-redshift () H i emission from individual galaxies. By searching for 21-cm emission centred on the position and redshift of optically selected emission-line galaxies we overcome difficulties that hinder untargeted searches. We detect 11 galaxies at , forming the first sample of detections with an interferometer, with the highest redshift detection at . We find they have much larger H i masses than their low-redshift H i-selected counterparts for a given stellar mass. This can be explained by the much larger cosmological volume probed at these high redshifts, and does not require any evolution of the H i mass function. We make the first-ever measurement of the baryonic Tully–Fisher relation (bTFr) with H i at and find consistency with the local bTFr, but with tentative evidence of a flattening in the relation at these redshifts for higher-mass objects. This may signify evolution, in line with predictions from hydrodynamic simulations, or that the molecular gas mass in these high-mass galaxies could be significant. This study paves the way for future studies of H i beyond the local Universe, using both searches targeted at known objects and via pure H i selection.

Robust cosmic shear with small-scale nulling

Journal of Cosmology and Astroparticle Physics IOP Publishing 2025:10 (2025) 017

Authors:

Giulia Piccirilli, Matteo Zennaro, Carlos García-García, David Alonso

Abstract:

Standard cosmological weak lensing analyses using cosmic shear are inevitably sensitive to small-scale, non-linear clustering from low-redshift structures. The need to adequately model the clustering of matter on this non-linear regime, accounting for both gravitational and baryonic effects, adds significant uncertainty to weak lensing studies, particularly in the context of near-future Stage-IV datasets. In this paper, inspired by previous work on so-called “nulling” techniques, we present a general method that selects the linear combinations of a given tomographic cosmic shear dataset that are least sensitive to small-scale non-linearities, by essentially suppressing the contribution from low-redshift structures. We apply this method to the latest public cosmic shear data from the Dark Energy Survey, DES-Y3, that corresponds to 3 years of observation, and show: a) that a large fraction of the signal is dominated by the single mode that is most affected by non-linear scales, and b) that removing this mode leads to a ∼ 1σ upwards shift in the preferred value of S 8 ≡ σ 8√(ΩM/0.3), alleviating the tension with current CMB data. However, the removal of the most contaminated mode also results in a significant increase in the statistical uncertainties. Taking this into account, we find this shift to be compatible with a random fluctuation caused by removing this most-contaminated mode at the ∼ 1.4σ level. We also show that this technique may be used by future Stage-IV surveys to mitigate the sensitivity of the final constraints to baryonic effects, trading precision for robustness.