[Back to mitochondrial introns by organism] [Back to home page]
Information of Marchantia polymorpha SSU.I1 intron (Format of information for each intron)
[intron with flanking sequence]
The boundaries of the intron are marked as red and the ORF is marked as blue.
For many organellar introns, the ORF is translated in frame with the upstream
exon ORF. The precursor protein is then processed to a mature form that cuts
off all of the exon-encoded and some of the intron-encoded polypeptide. This
situation is indicated here by blue-coded ORF that extends to the start of
the intron rather than by a translation start approximately 500 bp after the
start of the intron.
Note: This intron sequence occurs on the reverse complement of the original
GenBank sequence. Therefore, all sequences below are reverse complements of
the GenBank sequence.
3' end
gt gcgccatttc
29761 acgatatatt aggcccgcgc ccccagatca aaatgatcag gcaacaggcg ccgtgcgagt
29821 tgcaaatcac gcacgtaatg ctagcagctc atccgacgct ataaccaaaa gcaatccggc
29881 gaaagcctag gggagtatag catgctgcta aacggagaca cggatacaag cacatatatc
29941 gaggccgtct tcggagcaac cagtcgttca taggttctat ctctgagtct actgcgggtc
30001 tgccttcccg aagcagtcaa actgcttaag agtagggggt ctcctcagag tttcggggga
30061 tcccataaag caaccgctca tcagaaacac caaaggtgtg caatgactcg aacctaaaaa
30121 cgacactgac ctaaatatcg gaacttgggg tccctggtat tcgccgccgg ggacggatgc
30181 ctgatagtag cttaacgcgc taagcaggca agtatcacta ctccatcaag aacattctgt
30241 accctcccgg agcatcgtgt ctcttatgat agaaatcaag cctccggaga taggaggagt
30301 ttataagttc caaaaagaaa aaaagaagag ttatcatgac aaaatgcaat tacgagcaac
30361 tactagaccc cgagatattt aggctagctt acgagctaaa gaaatcgaaa tcaggcaata
30421 tgaaacctgg tgcggataaa gaaactctcg acggtttctc ccaagcctat gttgagaagg
30481 tcgtccgtca actaaaagac gaatcatttc aattccgtcc gtcacgaaga gaattcattc
30541 ctaaagcaga cggcaagctt cgttccctgg gcataccatc acctagagac aagatagtac
30601 aagaggtcat gaggaggatc cttgaacctg tatttgaacc gcgattcctg gattcgtctc
30661 acggatttag acctcatcga tcaccgcaca cggctctacg acaaatccgt cgatggacag
30721 gcacctcctg gatgatcgaa ggagacatca aaggatactt cgacaacatc gatcatcacc
30781 tactcgcggg attcatagca gagttggtaa aggaccaacg gcttctcgcg ctttattgga
30841 aattggtacg cgctggctat gtaaatcaag gcaaagcaga gccacacttg ctaacaggag
30901 tacctcaagg aaggatacta tcgcctctgc tttccaacat ctacctacac cagttcgatc
30961 tattcatgga ggaaatcaaa gtcaaatata caacgaccgg tgcgctttcc aaaaacaacc
31021 cgatttactt gaaggcgcgg aataaatact acaaacttgt gaaatcatta aaggcttctt
31081 ccgccgaaat catccgagcg agacgcgata tgttgaaaat gacttacggg attcaaacag
31141 gttctagggt gcgttatgtt agatacgcgg acgattgggt gatcggggtc acgggtccaa
31201 aagccctggc cgtacaaatc aaagaagagg tctctacctt cctccaagaa aaactaaaac
31261 tttcgcttca ggccgaaaaa acacgtatta ccaatctatc aagaagcgaa gctttattcc
31321 taggaacctt aataagcata acaactcgta aatacgtgca aagccagaag gtgggcgggg
31381 ggcacaggcg agcctccctc ggtagaatac gcctgtgcat ccccatcgat atccttatcg
31441 ggaagctctc acaaatgggg gcgtgcgacg aaaagggaac gcccaaagcg gtgaccaaat
31501 ggatctttct aaacgtggga gaaataatca acaaatatat ggctgtgttc cggggatact
31561 acaactacta ctcattcgca gacgatatcc atcaccttct ccaaataata tacatactaa
31621 gatactcggc tatcaacacg gtcgcccgta aactggggct taacacagcc aaagtaataa
31681 aacgcttcgg cgtggaccta atcttccggg accacacgaa tgagatcaag cataagctca
31741 atttcccgcg atccctacct aataagcgca tgaacttcgc cttgagtccg ccttcggacc
31801 ctagagttct atttgataca agctgcgctc gcattcagtg ttgaacgacc tgtgtcaagt
31861 atgcggctcc tacgaacagg tggagatgca ccacataaaa agactaagcc gcgataacgc
31921 ggtttcacta ggttgatggt ctaattgaat cgtaaacaac tcccggtctg ccgtaactgt
31981 cataggaaga ttcatagagg ggaatacaac ggaatgaggc tggaacgatt actcgcaagg
32041 aacaaaagac atcttgatta gttagtgaca gggagagcca tatgccggga aactggcacg
32101 tatggttcgg agggaggtat atgtcttcca catgggagat gggtaccgac cctac
Genbank entry, intron is marked as red
g ccgcggtaag acgggggggg caagtgttat
29281 tcggaatgac taggcgtaaa gggcacgtag gcggtgaatc cagttgaaag tgaaagtcgc
29341 cagctcaact ggcggaatgc tttcaaaacc aatttactag agtaaggcat agaggaaagc
29401 ggaatttcgt gtgtagcgat aaaatgcaaa aatatacgaa ggaacgccaa aagcgaaggc
29461 agctttctgg ttctttaact gacgctaaag tgcgaaagca tggggagcaa acaggattag
29521 ataccctggt agtccatgct gtaaacgatg agtgttcgtt cttggtctac tgatatcgca
29581 ctcttttggg ctgacttaag ctcggcttaa tgcttaaatt actgcaaagg tagtgtgact
29641 cgattgtttt cttcaagttc caacaatcgt gaaaaatatg tgatgatcag gggctgagct
29701 aacgcgttaa acactccgcc tggggagtac ggtcgcaagg ctgaaactgt gcgccatttc
29761 acgatatatt aggcccgcgc ccccagatca aaatgatcag gcaacaggcg ccgtgcgagt
29821 tgcaaatcac gcacgtaatg ctagcagctc atccgacgct ataaccaaaa gcaatccggc
29881 gaaagcctag gggagtatag catgctgcta aacggagaca cggatacaag cacatatatc
29941 gaggccgtct tcggagcaac cagtcgttca taggttctat ctctgagtct actgcgggtc
30001 tgccttcccg aagcagtcaa actgcttaag agtagggggt ctcctcagag tttcggggga
30061 tcccataaag caaccgctca tcagaaacac caaaggtgtg caatgactcg aacctaaaaa
30121 cgacactgac ctaaatatcg gaacttgggg tccctggtat tcgccgccgg ggacggatgc
30181 ctgatagtag cttaacgcgc taagcaggca agtatcacta ctccatcaag aacattctgt
30241 accctcccgg agcatcgtgt ctcttatgat agaaatcaag cctccggaga taggaggagt
30301 ttataagttc caaaaagaaa aaaagaagag ttatcatgac aaaatgcaat tacgagcaac
30361 tactagaccc cgagatattt aggctagctt acgagctaaa gaaatcgaaa tcaggcaata
30421 tgaaacctgg tgcggataaa gaaactctcg acggtttctc ccaagcctat gttgagaagg
30481 tcgtccgtca actaaaagac gaatcatttc aattccgtcc gtcacgaaga gaattcattc
30541 ctaaagcaga cggcaagctt cgttccctgg gcataccatc acctagagac aagatagtac
30601 aagaggtcat gaggaggatc cttgaacctg tatttgaacc gcgattcctg gattcgtctc
30661 acggatttag acctcatcga tcaccgcaca cggctctacg acaaatccgt cgatggacag
30721 gcacctcctg gatgatcgaa ggagacatca aaggatactt cgacaacatc gatcatcacc
30781 tactcgcggg attcatagca gagttggtaa aggaccaacg gcttctcgcg ctttattgga
30841 aattggtacg cgctggctat gtaaatcaag gcaaagcaga gccacacttg ctaacaggag
30901 tacctcaagg aaggatacta tcgcctctgc tttccaacat ctacctacac cagttcgatc
30961 tattcatgga ggaaatcaaa gtcaaatata caacgaccgg tgcgctttcc aaaaacaacc
31021 cgatttactt gaaggcgcgg aataaatact acaaacttgt gaaatcatta aaggcttctt
31081 ccgccgaaat catccgagcg agacgcgata tgttgaaaat gacttacggg attcaaacag
31141 gttctagggt gcgttatgtt agatacgcgg acgattgggt gatcggggtc acgggtccaa
31201 aagccctggc cgtacaaatc aaagaagagg tctctacctt cctccaagaa aaactaaaac
31261 tttcgcttca ggccgaaaaa acacgtatta ccaatctatc aagaagcgaa gctttattcc
31321 taggaacctt aataagcata acaactcgta aatacgtgca aagccagaag gtgggcgggg
31381 ggcacaggcg agcctccctc ggtagaatac gcctgtgcat ccccatcgat atccttatcg
31441 ggaagctctc acaaatgggg gcgtgcgacg aaaagggaac gcccaaagcg gtgaccaaat
31501 ggatctttct aaacgtggga gaaataatca acaaatatat ggctgtgttc cggggatact
31561 acaactacta ctcattcgca gacgatatcc atcaccttct ccaaataata tacatactaa
31621 gatactcggc tatcaacacg gtcgcccgta aactggggct taacacagcc aaagtaataa
31681 aacgcttcgg cgtggaccta atcttccggg accacacgaa tgagatcaag cataagctca
31741 atttcccgcg atccctacct aataagcgca tgaacttcgc cttgagtccg ccttcggacc
31801 ctagagttct atttgataca agctgcgctc gcattcagtg ttgaacgacc tgtgtcaagt
31861 atgcggctcc tacgaacagg tggagatgca ccacataaaa agactaagcc gcgataacgc
31921 ggtttcacta ggttgatggt ctaattgaat cgtaaacaac tcccggtctg ccgtaactgt
31981 cataggaaga ttcatagagg ggaatacaac ggaatgaggc tggaacgatt actcgcaagg
32041 aacaaaagac atcttgatta gttagtgaca gggagagcca tatgccggga aactggcacg
32101 tatggttcgg agggaggtat atgtcttcca catgggagat gggtaccgac cctaccaaag
32161 gaattgacgg gggcctgcac aagcggtgga gcatgtggtt taattcgatt caacgcgcaa
32221 aaccttacca gcccttgaca tatgaataag tgtgcttgtc cttaacggga tggtacgaaa
32281 attcatacag gtgttgcatg gctgtcgtca gctcgtgtct tgagacgttg ggttaagtcc
32341 tataacgagc gcaacccttg ttttgtgttg ctaagacatg ctttggttca atccttgacc
32401 actggagact gacgaagact acgccgtgaa aatggaggat accgacgagc taagtttttg
32461 caaacgccgt gtggctcatt ctgaaaagaa tatgaccata caaagttttc atacggtatt
32521 ctccgacgaa cgaagctctc agagcattgt acacttcaca ctgaggaact cgatcttagt
32581 caaaatgatt gacactccaa gccatgtgct gcactcacaa aaaactgcca gtgatatact
32641 ggaggaaggt gggga
573 a.a.
Note: Published size is 501 amino acids; two frameshifts and a termination readthrough add 72
amino acids with similarity to the Zn domain
MTKCNYEQLLDPEIFRLAYELKKSKSGNMKPGADKETLDGFSQAYVEKVVRQLKDESF
QFRPSRREFIPKADGKLRSLGIPSPRDKIVQEVMRRILEPVFEPRFLDSSHGFRPHRS
PHTALRQIRRWTGTSWMIEGDIKGYFDNIDHHLLAGFIAELVKDQRLLALYWKLVRAG
YVNQGKAEPHLLTGVPQGRILSPLLSNIYLHQFDLFMEEIKVKYTTTGALSKNNPIYL
KARNKYYKLVKSLKASSAEIIRARRDMLKMTYGIQTGSRVRYVRYADDWVIGVTGPKA
LAVQIKEEVSTFLQEKLKLSLQAEKTRITNLSRSEALFLGTLISITTRKYVQSQKVGG
GHRRASLGRIRLCIPIDILIGKLSQMGACDEKGTPKAVTKWIFLNVGEIINKYMAVFR
GYYNYYSFADDIHHLLQIIYILRYSAINTVARKLGLNTAKVIKRFGVDLIFRDHTNEI
KHKLNFPRSLPNKRMNFALSPPSDPRVLFDTSCARIQCLNDLCQVCGSYEQVEMHHIK
RLSRDRGFTRLMVLNRKQLPVCRNCHRKIHRGEYNGMRLERLLARNKRKLD