An introduction to the Zooniverse

AAAI Workshop - Technical Report WS-13-18 (2013) 103

Authors:

AM Smith, S Lynn, CJ Lintott

Abstract:

The Zooniverse (zooniverse.org) began in 2007 with the launch of Galaxy Zoo, a project in which more than 175,000 people provided shape analyses of more than 1 million galaxy images sourced from the Sloan Digital Sky Survey. These galaxy 'classifications', some 60 million in total, have subsequently been used to produce more than 50 peer-reviewed publications based not only on the original research goals of the project but also because of serendipitous discoveries made by the volunteer community. Based upon the success of Galaxy Zoo the team have gone on to develop more than 25 web-based citizen science projects, all with a strong research focus in a range of subjects from astronomy to zoology where human-based analysis still exceeds that of machine intelligence. Over the past 6 years Zooniverse projects have collected more than 300 million data analyses from over 1 million volunteers providing fantastically rich datasets for not only the individuals working to produce research from their projects but also the machine learning and computer vision research communities. The Zooniverse platform has always been developed to be the 'simplest thing that works', implementing only the most rudimentary algorithms for functionality such as task allocation and user-performance metrics. These simplifications have been necessary to scale the Zooniverse so that the core team of developers and data scientists can remain small and the cost of running the computing infrastructure relatively modest. To date these simplifications have been acceptable for the data volumes and analysis tasks being addressed. This situation however is changing: next generation telescopes such as the Large Synoptic Sky Telescope (LSST) will produce data volumes dwarfing those previously analyzed. If citizen science is to have a part to play in analyzing these next-generation datasets then the Zooniverse will need to evolve into a smarter system capable for example of modeling the abilities of users and the complexities of the data being classified in real time. In this session we will outline the current architecture of the Zooniverse platform and introduce new functionality being developed that should be of interest to the HCOMP community. Our platform is evolving into a system capable of integrating human and machine intelligence in a live environment. Data APIs providing realtime access to 'event streams' from the Zooniverse infrastructure are currently being tested as well as API endpoints for making decisions about for example what piece of data to show next to a volunteer as well as when to retire a piece of data from the live system because a consensus has been reached.

CFHTLenS tomographic weak lensing cosmological parameter constraints: Mitigating the impact of intrinsic galaxy alignments

Monthly Notices of the Royal Astronomical Society 432:3 (2013) 2433-2453

Authors:

C Heymans, E Grocutt, A Heavens, M Kilbinger, TD Kitching, F Simpson, J Benjamin, T Erben, H Hildebrandt, H Hoekstra, Y Mellier, L Miller, L Van Waerbeke, ML Brown, J Coupon, L Fu, J Harnois-Déraps, MJ Hudson, K Kuijken, B Rowe, T Schrabback, E Semboloni, S Vafaei, M Velander

Abstract:

We present a finely binned tomographic weak lensing analysis of the Canada-France-Hawaii Telescope Lensing Survey (CFHTLenS) mitigating contamination to the signal from the presence of intrinsic galaxy alignments via the simultaneous fit of a cosmological model and an intrinsic alignment model. CFHTLenS spans 154 square degrees in five optical bands, with accurate shear and photometric redshifts for a galaxy sample with a median redshift of zm = 0.70. We estimate the 21 sets of cosmic shear correlation functions associated with six redshift bins, each spanning the angular range of 1.5 < θ < 35 arcmin. We combine this CFHTLenS data with auxiliary cosmological probes: the cosmic microwave background with data from WMAP7, baryon acoustic oscillations with data from Baryon Oscillation Spectroscopic Survey and a prior on the Hubble constant from the Hubble Space Telescope distance ladder. This leads to constraints on the normalization of the matter power spectrum σ8 = 0.799 ± 0.015 and the matter density parameter ωm = 0.271 ± 0.010 for a flat Λ cold dark matter (ΛCDM) cosmology. For a flat wCDM cosmology, we constrain the dark energy equation-of-state parameter w = -1.02 ± 0.09. We also provide constraints for curved ΛCDM and wCDM cosmologies.We find the intrinsic alignment contamination to be galaxy-type dependent with a significant intrinsic alignment signal found for early-type galaxies, in contrast to the late-type galaxy sample for which the intrinsic alignment signal is found to be consistent with zero. © 2013 The Author. Published by Oxford University Press on behalf of the Royal Astronomical Society.

CFHTLenS: Mapping the large-scale structure with gravitational lensing

Monthly Notices of the Royal Astronomical Society 433:4 (2013) 3373-3388

Authors:

L Van Waerbeke, J Benjamin, T Erben, C Heymans, H Hildebrandt, H Hoekstra, TD Kitching, Y Mellier, L Miller, J Coupon, J Harnois-Déraps, L Fu, M Hudson, M Kilbinger, K Kuijken, B Rowe, T Schrabback, E Semboloni, S Vafaei, E van Uitert, M Velander

Abstract:

We present a quantitative analysis of the largest contiguous maps of projected mass density obtained from gravitational lensing shear. We use data from the 154 deg2 covered by the Canada-France-Hawaii Telescope Lensing Survey (CFHTLenS). Our study is the first attempt to quantitatively characterize the scientific value of lensing maps, which could serve in the future as a complementary approach to the study of the dark universe with gravitational lensing. We show that mass maps contain unique cosmological information beyond that of traditional two-point statistical analysis techniques. Using a series of numerical simulations, we first show how, reproducing the CFHTLenS observing conditions, gravitational lensing inversion provides a reliable estimate of the projected matter distribution of large-scale structure. We validate our analysis by quantifying the robustness of the maps with various statistical estimators. We then apply the same process to the CFHTLenS data. We find that the two-point correlation function of the projected mass is consistent with the cosmological analysis performed on the shear correlation function discussed in the CFHTLenS companion papers. The maps also lead to a significant measurement of the third-order moment of the projected mass, which is in agreement with analytic predictions, and to a marginal detection of the fourth-order moment. Tests for residual systematics are found to be consistent with zero for the statistical estimators we used. A new approach for the comparison of the reconstructed mass map to that predicted from the galaxy distribution reveals the existence of giant voids in the dark matter maps as large as 3° on the sky. Our analysis shows that lensing mass maps are not only consistent with the results obtained by the traditional shear approach, but they also appear promising for new techniques such as peak statistics and the morphological analysis of the projected dark matter distribution. © 2013 The Authors. Published by Oxford University Press on behalf of the Royal Astronomical Society.

Newborn spheroids at high redshift: When and how did the dominant, old stars in today's massive galaxies form?

Monthly Notices of the Royal Astronomical Society 428:2 (2013) 925-934

Authors:

S Kaviraj, S Cohen, RS Ellis, S Peirani, RA Windhorst, RW O'Connell, J Silk, BC Whitmore, NP Hathi, RE Ryan, MA Dopita, JA Frogel, A Dekel

Abstract:

We study ~330 massive (M* > 109.5M⊙), newborn spheroidal galaxies (SGs) around the epoch of peak star formation (1 < z < 3) to explore the high-redshift origin of SGs and gain insight into when and how the old stellar populations that dominate today's Universe formed. The sample is drawn from the Hubble Space Telescope (HST)/WFC3 Early-Release Science programme, which provides deep 10-filter (0.2-1.7 μm) HST imaging over one-third of the GOODS-South field. We find that the star formation episodes that built our SGs likely peaked in the redshift range 2 < z < 5 (with a median of z ~ 3) and have decay time-scales shorter than ~1.5Gyr. Starburst time-scales and ages show no trend with stellar mass in the range 109.5 < M* < 1010.5 M⊙. However, the time-scales show increased scatter towards lower values (<0.3 Gyr) for M* > 1010.5M⊙, and an age trend becomes evident in this mass regime: SGs with M* > 1011.5M⊙ are ~2 Gyr older than their counterparts with M* < 1010.5M⊙. Nevertheless, a smooth downsizing trend with galaxy mass is not observed, and the large scatter in starburst ages indicates that SGs are not a particularly coeval population. Around half of the blue SGs appear not to drive their star formation via major mergers, and those that have experienced a recent major merger show only modest enhancements (~40 per cent) in their specific star formation rates. Our empirical study indicates that processes other than major mergers (e.g. violent disc instability driven by cold streams and/or minor mergers) likely play a dominant role in building SGs, and creating a significant fraction of the old stellar populations that dominate today's Universe. © 2012 The Authors Published by Oxford University Press on behalf of the Royal Astronomical Society.

On the shear estimation bias induced by the spatial variation of colour across galaxy profiles

Monthly Notices of the Royal Astronomical Society 432:3 (2013) 2385-2401

Authors:

E Semboloni, H Hoekstra, Z Huang, VF Cardone, M Cropper, B Joachimi, T Kitching, K Kuijken, M Lombardi, R Maoli, Y Mellier, L Miller, J Rhodes, R Scaramella, T Schrabback, M Velander

Abstract:

The spatial variation of the colour of a galaxy may introduce a bias in the measurement of its shape if the point spread function (PSF) profile depends on wavelength. We study how this bias depends on the properties of the PSF and the galaxies themselves. The bias depends on the scales used to estimate the shape, which may be used to optimize methods to reduce the bias. Here, we develop a general approach to quantify the bias. Although applicable to any weak lensing survey, we focus on the implications for the ESA Euclid mission. Based on our study of synthetic galaxies, we find that the bias is a few times 10-3 for a typical galaxy observed by Euclid. Consequently, it cannot be neglected and needs to be accounted for. We demonstrate how one can do so using spatially resolved observations of galaxies in two filters. We show that Hubble Space Telescope (HST) observations in the F606W and F814W filters allow us to model and reduce the bias by an order of magnitude, sufficient to meet Euclid's scientific requirements. The precision of the correction is ultimately determined by the number of galaxies for which spatially resolved observations in at least two filters are available. We use results from the Millennium simulation to demonstrate that archival HST data will be sufficient for the tomographic cosmic shear analysis with the Euclid data set. © 2013 The Author. Published by Oxford University Press on behalf of the Royal Astronomical Society.