synbreed: A Framework for the Analysis of Genomic Prediction Data using R
Valentin Wimmer
Plant Breeding Technische Universit¨ at M¨ unchen G¨
- ttingen, 28/29 March 2011
Plant Breeding
1
synbreed: A Framework for the Analysis of Genomic Prediction Data - - PowerPoint PPT Presentation
synbreed: A Framework for the Analysis of Genomic Prediction Data using R Valentin Wimmer Plant Breeding Technische Universit at M unchen G ottingen, 28/29 March 2011 1 Plant Breeding Outline Day 1 Introduction to the synbreed
Plant Breeding Technische Universit¨ at M¨ unchen G¨
Plant Breeding
1
Plant Breeding
2
Plant Breeding
3
1
2
3
4
5
Plant Breeding
4
Phenotype Genotype
Estimation Set Test Set
Estimate model effects Validate models
Progeny
Cross Validation
Genotype
Predict Genomic Breeding Values
Plant Breeding
5
Plant Breeding
6
Plant Breeding
7
Plant Breeding
8
Plant Breeding
9
Plant Breeding
10
Plant Breeding
11
Plant Breeding
12
Plant Breeding
13
within 1 cM
11 21 32 42 53 120 100 80 60 40 20 chr pos 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 X 1044 948 857 778 770 709 658 615 630 481 706 550 573 590 527 497 535 456 302 319
Plant Breeding
14
Plant Breeding
15
Plant Breeding
16
A B C D E
Plant Breeding
17
◮ additive numerator relationship matrix A (default)
◮ dominance relationship matrix D
◮ kinship matrix K = 1
2A
◮ gametic relationship matrix (dimension 2n ×2n)
Plant Breeding
18
Plant Breeding
19
◮ random imputation by marginal allele distribution ◮ imputation by full-sib family information (only for homozygous inbred lines) ◮ Beagle (Browning and Browning 2009) ◮ Beagle after family ◮ a fixed value Plant Breeding
20
1
2
3
4
5
6
7
Plant Breeding
21
Plant Breeding
22
◮ LDDist : LD decay as scatterplot or stacked histogram ◮ LDMap : LD heatmap using package LDheatmap (Shin et al. 2006)
Plant Breeding
23
50 100 150 0.0 0.2 0.4 0.6 0.8 1.0
chromosome 1
dist [cM] r2 (0,25] (25,50] (50,75] (75,160]
chromosome 1
dist [cM] fraction of SNP pairs 0.0 0.2 0.4 0.6 0.8 1.0 LD (r2) (0,0.05] (0.05,0.1] (0.1,0.2] (0.2,0.3] (0.3,0.5] (0.5,1]
Plant Breeding
24
Plant Breeding
25
Plant Breeding
26
Plant Breeding
27
Plant Breeding
28
Plant Breeding
29
Plant Breeding
30
Plant Breeding
31
Plant Breeding
32
Plant Breeding
33
Plant Breeding
34
Plant Breeding
35
Plant Breeding
36
Plant Breeding
37
Plant Breeding
38
◮ Common standards (S3 or S4) ◮ Common example data sets ◮ Standards for documentation ◮ Quality checks (beyond R CMD check) Plant Breeding
39
Plant Breeding
40
Plant Breeding
41
Browning, B. L., and S. R. Browning, 2009 A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. The American Journal of Human Genetics 846: 210–223. Chambers, J. M., and T. J. Hastie, 1992 Statistical Models in S. Chapman & Hall. ISBN 9780412830402. Clifford, D., and P. McCullagh, 2009 regress: Gaussian Linear Models with Linear Covariance Structure. R package version 1.1-2. Habier, D., R. Fernando, and J. Dekkers, 2007 The impact of genetic relationship information on Genome-Assisted breeding values. Genetics 177: 2389 – 2397. Heffner, E. L., M. E. Sorrells, and J.-L. Jannink, 2009 Genomic selection for crop improvement. Crop Science 49: 1–12. Meuwissen, T. H. E., B. J. Hayes, and M. E. Goddard, 2001 Prediction of total genetic value using Genome-Wide dense marker maps. Genetics 157: 1819–1829. R Development Core Team, 2010 R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. Scheet, P., and M. Stephens, 2006 A fast and flexible statistical model for large-scale population genotype data: Applications to inferring missing genotypes and haplotypic phase. The American Journal of Human Genetics 78: 629–644. Shin, J.-H., S. Blay, B. McNeney, and J. Graham, 2006 Ldheatmap: An R function for graphical display of pairwise linkage disequilibria between single nucleotide polymorphisms. J Stat Soft 16: Code Snippet 3. Theußl, S., and A. Zeileis, 2009 Collaborative Software Development Using R-Forge. The R Journal 1: 9–14. Valdar, W., L. Solberg, D. Gauguier, W. Cookson, J. Rawlins, et al., 2006 Genetic and environmental effects
Plant Breeding
42
This research was funded by the German Federal Ministry of Education and Research (BMBF) within the AgroClustEr “Synbreed - Synergistic plant and animal breeding” (Grant ID 0315528A).
Plant Breeding
43