SLIDE 4 S.Will, 18.417, Fall 2011
Whole Genome Realignment for ncRNA Prediction
UU U G A A _ _ _ _ _ _ C G U U U C U U C G A U U C A U C A A G _ _ _ G U U U A A __ U G A A U C U A U G G A G C G A G U A A U G G G C U U G A A G C U G U _ G U U U A U C U G G U C A C A U G U A U U G A _ _ _ _ _ _ _ _ _ _ A U G G C G U A U G U U C C G U A A U A U A C C G U A _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ U U U G A G G G _ U U U C UUGGC U U C A U U A A G G U U U A AUG A A U C U A U G G A G CG A G UA A UG C G C U U GA A G C U G G G U G U U U C U G G U C A C A U G U A U _ _ _ U G A _ _ A G U U A U A C G G C G G G C
whole genome alignment
1
stable loci
realign based on sequence and structure
realigned loci
2 3
slice and filter by thermodynamic stability
estimate ncRNA likelihood
predictions (q-values)
unstable conserved structure of original locus alignment stable conserved structure
............................................................ DroMel_CAF1
- ----------------------TTTGAGTG-TTTCTTGTGTTCATTAAGGTTTAATGAA 36
DroSim_CAF1
- ----------------------TTTTAGTG-TTTCTTGGGTTCATTAAGGTTTAATGAA 36
DroYak_CAF1
- ----------------------TTTGATGG-TTACTTTGCTTCATCAAGGTTTAATGGT 36
DroEre_CAF1
- ----------------------TTTGATGG-TTTCTTTGCTTCATCAAGTTTTAATGAT 36
DroPse_CAF1 GGGCCATGGCCTCCTCTGATCGATTAG-GGGTTTTCTTGCTTGATTTATCGGTTGATGGA 59 DroPer_CAF1 GGGCCATGGCCTCCTCTGATCGATTAG-GGGTTTTCTTGCTTGATTTATCGGTTGATGGA 59 .........10........20........30........40........50......... ..((((..((.((....((((........))))....)).))..))))............ DroMel_CAF1 TCTATGGAGCGAGTAATGCGCTTGAAGCTGTGTTTATCTGGTCACATGTAT---TGA--A 91 DroSim_CAF1 TCTATGGAGCGAGTACTGGGCTTGAAGCTGGGCTTATCTGGTCACATGTAT---TGA--A 91 DroYak_CAF1 TCTATGGAGCGAGTATTGGGCTTGAAGCTGTGTGTTTCTGGTCGCATGTAT---TGA--A 91 DroEre_CAF1 TCTATGAAGCGAGTATTGCGCTTGAAGCTGTGTGTTTCTGGTCACATGTAT---TGA--A 91 DroPse_CAF1 GCAATGGGGTG----ATGCTAGTGA--GTGGGTGATTCTGGCCATGGCCATAGGTGAATA 113 DroPer_CAF1 GCAATGGGGTG----ATGCTAGTGA--GTGGGTGATTCTGGCCATGGCCATAGGTGAATA 113 .........70........80........90........100.......110........
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ U U U G A G G G _ U U U C UUGGC U U C A U U A A G G U U U A AUG A A U C U A U G G A G CG A G UA A UG C G C U U GA A G C U G G G U G U U U C U G G U C A C A U G U A U _ _ _ U G A _ _ A G U U A U A C G G C G G G C
structure- based realignment − − −− →
.((((.......((((((.((((((((((((......))))..)))))))).)))))).. DroMel_CAF1 UUUGAG------UGUUUCUUGUGUUCAUUAAG---GUUUAA--UGAAUCUAUGGAGCGAG 49 DroSim_CAF1 UUUUAG------UGUUUCUUGGGUUCAUUAAG---GUUUAA--UGAAUCUAUGGAGCGAG 49 DroYak_CAF1 UUUGAU------GGUUACUUUGCUUCAUCAAG---GUUUAA--UGGUUCUAUGGAGCGAG 49 DroEre_CAF1 UUUGAU------GGUUUCUUUGCUUCAUCAAG---UUUUAA--UGAUUCUAUGAAGCGAG 49 DroPse_CAF1 GGGCCAUGGCCUCCUCUGAUCGAUUAGGGGUUUUCUUGCUUGAUUUAUCGGUUGAUGGAG 60 DroPer_CAF1 GGGCCAUGGCCUCCUCUGAUCGAUUAGGGGUUUUCUUGCUUGAUUUAUCGGUUGAUGGAG 60 .........10........20........30........40........50......... .....((((.(((((....)))..)).)))).......))))........... DroMel_CAF1 UAAUGCGCUUGAAGCUGU-GUUUAUCUGGUCACAUGUAUUGA----------A 91 DroSim_CAF1 UACUGGGCUUGAAGCUGG-GCUUAUCUGGUCACAUGUAUUGA----------A 91 DroYak_CAF1 UAUUGGGCUUGAAGCUGU-GUGUUUCUGGUCGCAUGUAUUGA----------A 91 DroEre_CAF1 UAUUGCGCUUGAAGCUGU-GUGUUUCUGGUCACAUGUAUUGA----------A 91 DroPse_CAF1 CAAUGGGGUGAUGCUAGUGAGUGGGUGAUUCUGGCCAUGGCCAUAGGUGAAUA 113 DroPer_CAF1 CAAUGGGGUGAUGCUAGUGAGUGGGUGAUUCUGGCCAUGGCCAUAGGUGAAUA 113 .........70........80........90........100.......110.
UU U G A A _ _ _ _ _ _ C G U U U C U U C G A U U C A U C A A G _ _ _ G U U U A A _ _ U G A A U C U A U G G A G C G A G U A A U G G G C U U G A A G C U G U _ G U U U A U C U G G U C A C A U G U A U U G A _ _ _ _ _ _ _ _ _ _ A U G G C G U A U G U U C C G U A A U A U A C C G U A