[Back to mitochondrial introns by organism] [Back to home page]

Information of Marchantia polymorpha SSU.I1 intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

 

Note:  This intron sequence occurs on the reverse complement of the original

GenBank sequence.  Therefore, all sequences below are reverse complements of

the GenBank sequence.

3' end

                    

                                                          gt gcgccatttc 

29761 acgatatatt aggcccgcgc ccccagatca aaatgatcag gcaacaggcg ccgtgcgagt 

29821 tgcaaatcac gcacgtaatg ctagcagctc atccgacgct ataaccaaaa gcaatccggc 

29881 gaaagcctag gggagtatag catgctgcta aacggagaca cggatacaag cacatatatc 

29941 gaggccgtct tcggagcaac cagtcgttca taggttctat ctctgagtct actgcgggtc 

30001 tgccttcccg aagcagtcaa actgcttaag agtagggggt ctcctcagag tttcggggga 

30061 tcccataaag caaccgctca tcagaaacac caaaggtgtg caatgactcg aacctaaaaa 

30121 cgacactgac ctaaatatcg gaacttgggg tccctggtat tcgccgccgg ggacggatgc 

30181 ctgatagtag cttaacgcgc taagcaggca agtatcacta ctccatcaag aacattctgt 

30241 accctcccgg agcatcgtgt ctcttatgat agaaatcaag cctccggaga taggaggagt 

30301 ttataagttc caaaaagaaa aaaagaagag ttatcatgac aaaatgcaat tacgagcaac 

30361 tactagaccc cgagatattt aggctagctt acgagctaaa gaaatcgaaa tcaggcaata 

30421 tgaaacctgg tgcggataaa gaaactctcg acggtttctc ccaagcctat gttgagaagg 

30481 tcgtccgtca actaaaagac gaatcatttc aattccgtcc gtcacgaaga gaattcattc 

30541 ctaaagcaga cggcaagctt cgttccctgg gcataccatc acctagagac aagatagtac 

30601 aagaggtcat gaggaggatc cttgaacctg tatttgaacc gcgattcctg gattcgtctc 

30661 acggatttag acctcatcga tcaccgcaca cggctctacg acaaatccgt cgatggacag 

30721 gcacctcctg gatgatcgaa ggagacatca aaggatactt cgacaacatc gatcatcacc 

30781 tactcgcggg attcatagca gagttggtaa aggaccaacg gcttctcgcg ctttattgga 

30841 aattggtacg cgctggctat gtaaatcaag gcaaagcaga gccacacttg ctaacaggag 

30901 tacctcaagg aaggatacta tcgcctctgc tttccaacat ctacctacac cagttcgatc 

30961 tattcatgga ggaaatcaaa gtcaaatata caacgaccgg tgcgctttcc aaaaacaacc 

31021 cgatttactt gaaggcgcgg aataaatact acaaacttgt gaaatcatta aaggcttctt 

31081 ccgccgaaat catccgagcg agacgcgata tgttgaaaat gacttacggg attcaaacag 

31141 gttctagggt gcgttatgtt agatacgcgg acgattgggt gatcggggtc acgggtccaa 

31201 aagccctggc cgtacaaatc aaagaagagg tctctacctt cctccaagaa aaactaaaac 

31261 tttcgcttca ggccgaaaaa acacgtatta ccaatctatc aagaagcgaa gctttattcc 

31321 taggaacctt aataagcata acaactcgta aatacgtgca aagccagaag gtgggcgggg 

31381 ggcacaggcg agcctccctc ggtagaatac gcctgtgcat ccccatcgat atccttatcg 

31441 ggaagctctc acaaatgggg gcgtgcgacg aaaagggaac gcccaaagcg gtgaccaaat 

31501 ggatctttct aaacgtggga gaaataatca acaaatatat ggctgtgttc cggggatact 

31561 acaactacta ctcattcgca gacgatatcc atcaccttct ccaaataata tacatactaa 

31621 gatactcggc tatcaacacg gtcgcccgta aactggggct taacacagcc aaagtaataa 

31681 aacgcttcgg cgtggaccta atcttccggg accacacgaa tgagatcaag cataagctca 

31741 atttcccgcg atccctacct aataagcgca tgaacttcgc cttgagtccg ccttcggacc 

31801 ctagagttct atttgataca agctgcgctc gcattcagtg ttgaacgacc tgtgtcaagt 

31861 atgcggctcc tacgaacagg tggagatgca ccacataaaa agactaagcc gcgataacgc 

31921 ggtttcacta ggttgatggt ctaattgaat cgtaaacaac tcccggtctg ccgtaactgt 

31981 cataggaaga ttcatagagg ggaatacaac ggaatgaggc tggaacgatt actcgcaagg 

32041 aacaaaagac atcttgatta gttagtgaca gggagagcca tatgccggga aactggcacg 

32101 tatggttcgg agggaggtat atgtcttcca catgggagat gggtaccgac cctac

 

                                                                   

[top]


 

 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

                                     g ccgcggtaag acgggggggg caagtgttat 

29281 tcggaatgac taggcgtaaa gggcacgtag gcggtgaatc cagttgaaag tgaaagtcgc 

29341 cagctcaact ggcggaatgc tttcaaaacc aatttactag agtaaggcat agaggaaagc 

29401 ggaatttcgt gtgtagcgat aaaatgcaaa aatatacgaa ggaacgccaa aagcgaaggc 

29461 agctttctgg ttctttaact gacgctaaag tgcgaaagca tggggagcaa acaggattag 

29521 ataccctggt agtccatgct gtaaacgatg agtgttcgtt cttggtctac tgatatcgca 

29581 ctcttttggg ctgacttaag ctcggcttaa tgcttaaatt actgcaaagg tagtgtgact 

29641 cgattgtttt cttcaagttc caacaatcgt gaaaaatatg tgatgatcag gggctgagct 

29701 aacgcgttaa acactccgcc tggggagtac ggtcgcaagg ctgaaactgt gcgccatttc 

29761 acgatatatt aggcccgcgc ccccagatca aaatgatcag gcaacaggcg ccgtgcgagt 

29821 tgcaaatcac gcacgtaatg ctagcagctc atccgacgct ataaccaaaa gcaatccggc 

29881 gaaagcctag gggagtatag catgctgcta aacggagaca cggatacaag cacatatatc 

29941 gaggccgtct tcggagcaac cagtcgttca taggttctat ctctgagtct actgcgggtc 

30001 tgccttcccg aagcagtcaa actgcttaag agtagggggt ctcctcagag tttcggggga 

30061 tcccataaag caaccgctca tcagaaacac caaaggtgtg caatgactcg aacctaaaaa 

30121 cgacactgac ctaaatatcg gaacttgggg tccctggtat tcgccgccgg ggacggatgc 

30181 ctgatagtag cttaacgcgc taagcaggca agtatcacta ctccatcaag aacattctgt 

30241 accctcccgg agcatcgtgt ctcttatgat agaaatcaag cctccggaga taggaggagt 

30301 ttataagttc caaaaagaaa aaaagaagag ttatcatgac aaaatgcaat tacgagcaac 

30361 tactagaccc cgagatattt aggctagctt acgagctaaa gaaatcgaaa tcaggcaata 

30421 tgaaacctgg tgcggataaa gaaactctcg acggtttctc ccaagcctat gttgagaagg 

30481 tcgtccgtca actaaaagac gaatcatttc aattccgtcc gtcacgaaga gaattcattc 

30541 ctaaagcaga cggcaagctt cgttccctgg gcataccatc acctagagac aagatagtac 

30601 aagaggtcat gaggaggatc cttgaacctg tatttgaacc gcgattcctg gattcgtctc 

30661 acggatttag acctcatcga tcaccgcaca cggctctacg acaaatccgt cgatggacag 

30721 gcacctcctg gatgatcgaa ggagacatca aaggatactt cgacaacatc gatcatcacc 

30781 tactcgcggg attcatagca gagttggtaa aggaccaacg gcttctcgcg ctttattgga 

30841 aattggtacg cgctggctat gtaaatcaag gcaaagcaga gccacacttg ctaacaggag 

30901 tacctcaagg aaggatacta tcgcctctgc tttccaacat ctacctacac cagttcgatc 

30961 tattcatgga ggaaatcaaa gtcaaatata caacgaccgg tgcgctttcc aaaaacaacc 

31021 cgatttactt gaaggcgcgg aataaatact acaaacttgt gaaatcatta aaggcttctt 

31081 ccgccgaaat catccgagcg agacgcgata tgttgaaaat gacttacggg attcaaacag 

31141 gttctagggt gcgttatgtt agatacgcgg acgattgggt gatcggggtc acgggtccaa 

31201 aagccctggc cgtacaaatc aaagaagagg tctctacctt cctccaagaa aaactaaaac 

31261 tttcgcttca ggccgaaaaa acacgtatta ccaatctatc aagaagcgaa gctttattcc 

31321 taggaacctt aataagcata acaactcgta aatacgtgca aagccagaag gtgggcgggg 

31381 ggcacaggcg agcctccctc ggtagaatac gcctgtgcat ccccatcgat atccttatcg 

31441 ggaagctctc acaaatgggg gcgtgcgacg aaaagggaac gcccaaagcg gtgaccaaat 

31501 ggatctttct aaacgtggga gaaataatca acaaatatat ggctgtgttc cggggatact 

31561 acaactacta ctcattcgca gacgatatcc atcaccttct ccaaataata tacatactaa 

31621 gatactcggc tatcaacacg gtcgcccgta aactggggct taacacagcc aaagtaataa 

31681 aacgcttcgg cgtggaccta atcttccggg accacacgaa tgagatcaag cataagctca 

31741 atttcccgcg atccctacct aataagcgca tgaacttcgc cttgagtccg ccttcggacc 

31801 ctagagttct atttgataca agctgcgctc gcattcagtg ttgaacgacc tgtgtcaagt 

31861 atgcggctcc tacgaacagg tggagatgca ccacataaaa agactaagcc gcgataacgc 

31921 ggtttcacta ggttgatggt ctaattgaat cgtaaacaac tcccggtctg ccgtaactgt 

31981 cataggaaga ttcatagagg ggaatacaac ggaatgaggc tggaacgatt actcgcaagg 

32041 aacaaaagac atcttgatta gttagtgaca gggagagcca tatgccggga aactggcacg 

32101 tatggttcgg agggaggtat atgtcttcca catgggagat gggtaccgac cctaccaaag 

32161 gaattgacgg gggcctgcac aagcggtgga gcatgtggtt taattcgatt caacgcgcaa 

32221 aaccttacca gcccttgaca tatgaataag tgtgcttgtc cttaacggga tggtacgaaa 

32281 attcatacag gtgttgcatg gctgtcgtca gctcgtgtct tgagacgttg ggttaagtcc 

32341 tataacgagc gcaacccttg ttttgtgttg ctaagacatg ctttggttca atccttgacc 

32401 actggagact gacgaagact acgccgtgaa aatggaggat accgacgagc taagtttttg 

32461 caaacgccgt gtggctcatt ctgaaaagaa tatgaccata caaagttttc atacggtatt 

32521 ctccgacgaa cgaagctctc agagcattgt acacttcaca ctgaggaact cgatcttagt 

32581 caaaatgatt gacactccaa gccatgtgct gcactcacaa aaaactgcca gtgatatact 

32641 ggaggaaggt gggga                                                               

 

[top]


[ORF sequence]

 

573 a.a.

Note: Published size is 501 amino acids; two frameshifts and a termination readthrough add 72

amino acids with similarity to the Zn domain

MTKCNYEQLLDPEIFRLAYELKKSKSGNMKPGADKETLDGFSQAYVEKVVRQLKDESF

QFRPSRREFIPKADGKLRSLGIPSPRDKIVQEVMRRILEPVFEPRFLDSSHGFRPHRS

PHTALRQIRRWTGTSWMIEGDIKGYFDNIDHHLLAGFIAELVKDQRLLALYWKLVRAG

YVNQGKAEPHLLTGVPQGRILSPLLSNIYLHQFDLFMEEIKVKYTTTGALSKNNPIYL

KARNKYYKLVKSLKASSAEIIRARRDMLKMTYGIQTGSRVRYVRYADDWVIGVTGPKA

LAVQIKEEVSTFLQEKLKLSLQAEKTRITNLSRSEALFLGTLISITTRKYVQSQKVGG

GHRRASLGRIRLCIPIDILIGKLSQMGACDEKGTPKAVTKWIFLNVGEIINKYMAVFR

GYYNYYSFADDIHHLLQIIYILRYSAINTVARKLGLNTAKVIKRFGVDLIFRDHTNEI

KHKLNFPRSLPNKRMNFALSPPSDPRVLFDTSCARIQCLNDLCQVCGSYEQVEMHHIK

RLSRDRGFTRLMVLNRKQLPVCRNCHRKIHRGEYNGMRLERLLARNKRKLD  

[top]


[secondary structure]

 

 

[top]