Freshly Printed - allow 4 days lead
Modern Statistics for Modern Biology
A far-reaching course in practical advanced statistics for biologists using R/Bioconductor, data exploration, and simulation.
Susan Holmes (Author), Wolfgang Huber (Author)
9781108705295, Cambridge University Press
Paperback / softback, published 28 February 2019
402 pages
27.9 x 21.7 x 1.6 cm, 1.14 kg
'... the book is extremely readable and engaging, it explains complicated concepts in simple terms, and uses illuminating graphics and examples. Any researcher who wants to learn or teach up-to-date statistics to biologists will find this an essential volume for modern teaching of modern statistics to modern biologists.' Noa Pinter-Wollman, The Quarterly Review of Biology
If you are a biologist and want to get the best out of the powerful methods of modern computational statistics, this is your book. You can visualize and analyze your own data, apply unsupervised and supervised learning, integrate datasets, apply hypothesis testing, and make publication-quality figures using the power of R/Bioconductor and ggplot2. This book will teach you 'cooking from scratch', from raw data to beautiful illuminating output, as you learn to write your own scripts in the R language and to use advanced statistics packages from CRAN and Bioconductor. It covers a broad range of basic and advanced topics important in the analysis of high-throughput biological data, including principal component analysis and multidimensional scaling, clustering, multiple testing, unsupervised and supervised learning, resampling, the pitfalls of experimental design, and power simulations using Monte Carlo, and it even reaches networks, trees, spatial statistics, image data, and microbial ecology. Using a minimum of mathematical notation, it builds understanding from well-chosen examples, simulation, visualization, and above all hands-on interaction with data and code.
Introduction
1. Generative models for discrete data
2. Statistical modeling
3. High-quality graphics in R
4. Mixture models
5. Clustering
6. Testing
7. Multivariate analysis
8. High-throughput count data
9. Multivariate methods for heterogeneous data
10. Networks and trees
11. Image data
12. Supervised learning
13. Design of high-throughput experiments and their analyses
Statistical concordance
Bibliography
Index.
Subject Areas: Data mining [UNF], Data capture & analysis [UNC], Biotechnology [TCB], Probability & statistics [PBT]