Exhaustive symbolic regression

IEEE Transactions on Evolutionary Computation IEEE (2023)

Authors:

Deaglan Bartlett, Harry Desmond, Pedro Ferreira

Abstract:

Symbolic Regression (SR) algorithms attempt to learn analytic expressions which fit data accurately and in a highly interpretable manner. Conventional SR suffers from two fundamental issues which we address here. First, these methods search the space stochastically (typically using genetic programming) and hence do not necessarily find the best function. Second, the criteria used to select the equation optimally balancing accuracy with simplicity have been variable and subjective. To address these issues we introduce Exhaustive Symbolic Regression (ESR), which systematically and efficiently considers all possible equations—made with a given basis set of operators and up to a specified maximum complexity— and is therefore guaranteed to find the true optimum (if parameters are perfectly optimised) and a complete function ranking subject to these constraints. We implement the minimum description length principle as a rigorous method for combining these preferences into a single objective. To illustrate the power of ESR we apply it to a catalogue of cosmic chronometers and the Pantheon+ sample of supernovae to learn the Hubble rate as a function of redshift, finding 40 functions (out of 5.2 million trial functions) that fit the data more economically than the Friedmann equation. These low-redshift data therefore do not uniquely prefer the expansion history of the standard model of cosmology. We make our code and full equation sets publicly available.

Can we constrain structure growth from galaxy proper motions?

(2023)

Authors:

Iain Duncan, David Alonso, Anže Slosar, Kate Storey-Fisher

The Highest-Redshift Balmer Breaks as a Test of $Λ$CDM

ArXiv 2305.15459 (2023)

Authors:

Charles L Steinhardt, Albert Sneppen, Thorbjørn Clausen, Harley Katz, Martin P Rey, Jonas Stahlschmidt

Relativistic drag forces on black holes from scalar dark matter clouds of all sizes

ArXiv 2305.10492 (2023)

Authors:

Dina Traykova, Rodrigo Vicente, Katy Clough, Thomas Helfer, Emanuele Berti, Pedro G Ferreira, Lam Hui

Bursts from Space: MeerKAT – the first citizen science project dedicated to commensal radio transients

Monthly Notices of the Royal Astronomical Society Oxford University Press 523:2 (2023) 2219-2235

Authors:

Alex Andersson, chris Lintott, rob Fender, joe Bright, francesco Carotenuto, ian Heywood, Lauren Rhodes, Sara Motta, David Williams

Abstract:

The newest generation of radio telescopes is able to survey large areas with high sensitivity and cadence, producing data volumes that require new methods to better understand the transient sky. Here, we describe the results from the first citizen science project dedicated to commensal radio transients, using data from the MeerKAT telescope with weekly cadence. Bursts from Space: MeerKAT was launched late in 2021 and received ∼89 000 classifications from over 1000 volunteers in 3 months. Our volunteers discovered 142 new variable sources which, along with the known transients in our fields, allowed us to estimate that at least 2.1 per cent of radio sources are varying at 1.28 GHz at the sampled cadence and sensitivity, in line with previous work. We provide the full catalogue of these sources, the largest of candidate radio variables to date. Transient sources found with archival counterparts include a pulsar (B1845-01) and an OH maser star (OH 30.1–0.7), in addition to the recovery of known stellar flares and X-ray binary jets in our observations. Data from the MeerLICHT optical telescope, along with estimates of long time-scale variability induced by scintillation, imply that the majority of the new variables are active galactic nuclei. This tells us that citizen scientists can discover phenomena varying on time-scales from weeks to several years. The success both in terms of volunteer engagement and scientific merit warrants the continued development of the project, while we use the classifications from volunteers to develop machine learning techniques for finding transients.