Deterministic Motif Mining in Protein Databases
Chapter in Data Warehousing and Mining, IGI Global (2008) 1722-1746
Clover-measuring the CMB B-mode polarisation
Proceedings of the Eighteenth International Symposium on Space Terahertz Technology 2007, ISSTT 2007 (2007) 238-243
Abstract:
We describe the objectives, design and predicted performance of Clover, a fully-funded, UK-led experiment to measure the B-mode polarisation of the Cosmic Microwave Background (CMB). Three individual telescopes will operate at 97, 150 and 225 GHz, each populated by up to 256 horns. The detectors, TES bolometers, are limited by unavoidable photon noise, and coupled to an optical design which gives very low systematic errors, particularly in cross-polarisation. The telescopes will sit on three-axis mounts on a site in the Atacama Desert. The angular resolution of around 8 ́ and sky coverage of around 1000 deg2 provide multipole coverage of 20<ℓ<1000. Combined with the high sensitivity, this should allow the B-mode signal to be measured (or constrained) down to a level corresponding to a tensor-to-scalar ratio of r = 0.01, providing the emission from polarised foregrounds can be subtracted. This in turn will allow constraints to be placed on the energy scale of inflation, providing an unprecedented insight into the early history of the Universe.Evaluating deterministic motif significance measures in protein databases.
Algorithms for molecular biology : AMB 2 (2007) 16
Abstract:
Background
Assessing the outcome of motif mining algorithms is an essential task, as the number of reported motifs can be very large. Significance measures play a central role in automatically ranking those motifs, and therefore alleviating the analysis work. Spotting the most interesting and relevant motifs is then dependent on the choice of the right measures. The combined use of several measures may provide more robust results. However caution has to be taken in order to avoid spurious evaluations.Results
From the set of conducted experiments, it was verified that several of the selected significance measures show a very similar behavior in a wide range of situations therefore providing redundant information. Some measures have proved to be more appropriate to rank highly conserved motifs, while others are more appropriate for weakly conserved ones. Support appears as a very important feature to be considered for correct motif ranking. We observed that not all the measures are suitable for situations with poorly balanced class information, like for instance, when positive data is significantly less than negative data. Finally, a visualization scheme was proposed that, when several measures are applied, enables an easy identification of high scoring motifs.Conclusion
In this work we have surveyed and categorized 14 significance measures for pattern evaluation. Their ability to rank three types of deterministic motifs was evaluated. Measures were applied in different testing conditions, where relations were identified. This study provides some pertinent insights on the choice of the right set of significance measures for the evaluation of deterministic motifs extracted from protein databases.On the growth of structure in theories with a dynamical preferred frame
(2007)
On the growth of structure in theories with a dynamical preferred frame
ArXiv 0711.0520 (2007)