MetaBeeAI: An AI pipeline for structured evidence extraction from biological literature
Ecological Informatics Elsevier 96 (2026) 103813
Abstract:
The volume and complexity of scientific literature are expanding rapidly, making it increasingly difficult to extract and synthesise information across studies. This challenge is particularly acute in the biological sciences, where evidence spans multiple levels of organisation and heterogeneous experimental designs. Large Language Model (LLM) pipelines offer a scalable route to evidence synthesis, but many existing approaches lack transparency, modularity, and effective mechanisms for human oversight. We present MetaBeeAI, an open-source, modular pipeline that integrates established LLM techniques into a coherent, auditable workflow for structured data extraction in biology. MetaBeeAI combines modular prompting, multi-pass extraction, and expert-in-the-loop validation within an interface that presents model outputs alongside source text, enabling inspection, correction, and iterative refinement. The pipeline produces machine-readable records of prompts, configurations, and expert annotations, supporting reproducibility and continuous improvement. We apply MetaBeeAI to 924 research papers on bees and pesticides, extracting structured information on species, compounds, exposure designs, and experimental context. Evaluation demonstrates improved consistency, convergence with expert judgement, and robustness across heterogeneous biological studies, highlighting the value of expert-guided refinement. MetaBeeAI provides a transparent and extensible framework for scalable evidence synthesis, supporting reliable integration of LLMs into biological research workflows.A Python client for the ATLAS API
The Journal of Open Source Software Open Journals 11:119 (2026) 9462-9462
Abstract:
The Asteroid Terrestrial-impact Last Alert System (ATLAS) is an all-sky optical sky survey with a cadence of 24 to 48 hours (Tonry et al., 2018), and the ATLAS Transient Server (Smith et al., 2020) processes the alert stream to enable the discovery and follow-up of extra-galactic transients. The data from the ATLAS server can be accessed through a REST API, which has allowed the development of bots that need direct access to the data to help rank alerts and trigger follow-up observations of promising targets. Here we present the Python client we have developed for the ATLAS API to help connect bots and scientists to our data.ATLAS100 -- I. A volume-limited sample of supernovae and related transients within 100 Mpc
(2026)
Anomaly Hunter for Alerts (AHA): Anomaly Detection in the ZTF Transient Alert Stream
(2026)
Search for the Optical Counterpart of Einstein Probe–discovered Fast X-Ray Transients from the Lulin Observatory
The Astrophysical Journal: Supplement Series American Astronomical Society 281:1 (2025) 20