Marginalised Normal Regression: Unbiased curve fitting in the presence of x-errors
ArXiv 2309.00948 (2023)
The information on halo properties contained in spectroscopic observations of late-type galaxies
Monthly Notices of the Royal Astronomical Society Oxford University Press 525:4 (2023) 5066-5079
Abstract:
Rotation curves are the key observational manifestation of the dark matter distribution around late-type galaxies. In a halo model context, the precision of constraints on halo parameters is a complex function of properties of the measurements as well as properties of the galaxy itself. Forthcoming surveys will resolve rotation curves to varying degrees of precision, or measure their integrated effect in the HI linewidth. To ascertain the relative significance of the relevant quantities for constraining halo properties, we study the information on halo mass and concentration as quantified by the Kullback–Leibler divergence of the kinematics-informed posterior from the uninformative prior. We calculate this divergence as a function of the different types of spectroscopic observation, properties of the measurement, galaxy properties, and auxiliary observational data on the baryonic components. Using the SPARC (Spitzer Photometry & Accurate Rotation Curves) sample, we find that fits to the full rotation curve exhibit a large variation in information gain between galaxies, ranging from ~1 to ~11 bits. The variation is predominantly caused by the vast differences in the number of data points and the size of velocity uncertainties between the SPARC galaxies. We also study the relative importance of the minimum HI surface density probed and the size of velocity uncertainties on the constraining power on the inner halo density slope, finding the latter to be significantly more important. We spell out the implications of these results for the optimization of galaxy surveys aiming to constrain galaxies’ dark matter distributions, highlighting the need for precise velocity measurements.Priors for symbolic regression
Association for Computing Machinery (ACM) (2023) 2402-2411
Exhaustive symbolic regression
IEEE Transactions on Evolutionary Computation IEEE (2023)
Abstract:
Symbolic Regression (SR) algorithms attempt to learn analytic expressions which fit data accurately and in a highly interpretable manner. Conventional SR suffers from two fundamental issues which we address here. First, these methods search the space stochastically (typically using genetic programming) and hence do not necessarily find the best function. Second, the criteria used to select the equation optimally balancing accuracy with simplicity have been variable and subjective. To address these issues we introduce Exhaustive Symbolic Regression (ESR), which systematically and efficiently considers all possible equations—made with a given basis set of operators and up to a specified maximum complexity— and is therefore guaranteed to find the true optimum (if parameters are perfectly optimised) and a complete function ranking subject to these constraints. We implement the minimum description length principle as a rigorous method for combining these preferences into a single objective. To illustrate the power of ESR we apply it to a catalogue of cosmic chronometers and the Pantheon+ sample of supernovae to learn the Hubble rate as a function of redshift, finding 40 functions (out of 5.2 million trial functions) that fit the data more economically than the Friedmann equation. These low-redshift data therefore do not uniquely prefer the expansion history of the standard model of cosmology. We make our code and full equation sets publicly available.Inferring dark matter halo properties for H i-selected galaxies
Monthly Notices of the Royal Astronomical Society Oxford University Press 526:4 (2023) 5861-5882