Leif ¡ Leif ¡Väremo mo
leif.varemo@scilifelab.se
Bioinforma6cs ¡Long-‑term ¡Support ¡(WABI) Systems ¡Biology ¡Facility ¡@ ¡Chalmers
Gene-set analysis and data integra/on Leif Leif Vremo mo - - PowerPoint PPT Presentation
Gene-set analysis and data integra/on Leif Leif Vremo mo leif.varemo@scilifelab.se Bioinforma6cs Long-term Support (WABI) Systems Biology Facility @ Chalmers Outline Gene-set
Bioinforma6cs ¡Long-‑term ¡Support ¡(WABI) Systems ¡Biology ¡Facility ¡@ ¡Chalmers
2
Immune ¡response Pyruvate
Gene-‑level ¡data Gene-‑set ¡data ¡(results)
PPARG
Gen Gene-‑s e-‑set ¡ ¡analy analysis sis GO-‑terms Pathways Chromosomal ¡loca6ons Transcrip6on ¡factors Histone ¡modifica6ons Diseases etc… Samples Genes
We ¡will ¡focus ¡on ¡transcriptomics ¡and ¡differen3al ¡expression ¡analysis ¡ However, ¡GSA ¡can ¡in ¡principle ¡be ¡used ¡on ¡all ¡types ¡of ¡genome-‑wide ¡data. ¡
3
4
5
Images sources: Garg et al. Sci Rep 5 (2015); Gutteridge et al. PLoS ONE 8 (2013); Han et al. BMC Genomics 15 (2014)
6
7
GO-‑terms Pathways Chromosomal ¡loca6ons Transcrip6on ¡factors Histone ¡modifica6ons Diseases Metabolites etc…
8
9
10
11
“Hallmark gene sets summarize and represent specific well-defined biological states or processes and display coherent expression. These gene sets were generated by a computational methodology based on identifying gene set overlaps and retaining genes that display coordinate
and provide a better delineated biological space for GSEA.”
12
http://amp.pharm.mssm.edu/Enrichr/#stats http://software.broadinstitute.org/gsea/msigdb/index.jsp
13
14
http://www.ensembl.org/biomart/martview
One way to map different gene IDs to each other, or to assemble a gene-set collection with the gene IDs used by your data
See also: https://david.ncifcrf.gov/content.jsp?file=conversion.html
15
16
http://omictools.com/gene-set-analysis-category
https://bioconductor.org/packages/release/BiocViews.html#___GeneSetEnrichment
17
18
https://david.ncifcrf.gov/home.jsp http://amp.pharm.mssm.edu/Enrichr/
19
20
Samples Genes
Gene-set 1 Gene-set 2
Permute the gene-labels (or sample labels) and redo the calculations over and over again (e.g. 10,000 times)!
21
22
Mootha et al Nature Genetics, 2003; Subramanian PNAS 2005
23
Disclaimer: The author of this presentation is the developer of piano 24
25
Disclaimer: The author of this presentation is the developer of piano 26
27
Metabolite-reaction-gene relations
Reaction Enzymes/genes Products Substrates
m m m m
Gene-sets (metabolites)
Samples Genes
Disclaimer: The author of this presentation is the developer of Kiwi 28
29
“Heavily annotated (“multifunctional”) genes are likely to appear in genomics study results and drive the generation of biologically nonspecific enrichment results as well as highly fragile significances”
Ballouz et al. (Oct 2016) NAR. doi:10.1093/nar/gkw957
Uniqueness constraint “To assess uniqueness, we compared the output of each algorithm when given the experimental input hit lists to that of the algorithm when the top 100 multifunctional genes was the input“
30
Robustness constraint “Assessed robustness by removing the 5%
functional genes from the experimental hit lists”
Ballouz et al. (Oct 2016) NAR. doi:10.1093/nar/gkw957
31