[Back to mitochondrial introns by organism] [Back to home page]

Information of Marchantia polymorpha atpA.I2 intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

3' end

                    

                                              gtgc gacgtgacgt gtgtgggatc 

117961 ggtgagttgg gggcgcatcg tcgtcccctg cgacagagcc aaggattaag gggtagttgg 

118021 acttttccta ccccaagtag caacaccgta ggcgatctta acagagcttc ttcgggggag 

118081 tcctccgaga ttcgagtgga agccccctac cacggtttgt ctgttagcac taggccgtat 

118141 ccttttgata ctgttagtcc tagccccgat acaaaagccc aggtccatgg gcggttttcg 

118201 ggaatcggtg ggcggttgcc cacagggatt agaagcttgg cgttaagaaa tgtcgatcct 

118261 cgtacgagcg cggcgagctc cttcggcgga tccctccaaa gacacaacta cagttcgctg 

118321 tcaaaagact tgagggccgc cgcgggcgcg gagatggagg tggaaaaccc cgaattagcc 

118381 ggtggatgcg cggaaaagca gaacccggtc aacgcgggac ctaatacctt ctgcatagaa 

118441 gatgtgcagg ctaaagcgag cgtgagctcg tcctttcatt tggagtctac ggactccata 

118501 gcaaaagttc tagcctcgaa aaggagtgat aaaggcaaat accaagatct ttttagcctt 

118561 ataataagcc cagagaattt aaaacaggcg tggctagaga taagaaacaa aacaggcaat 

118621 ctgacgcccg gtggcacggg cgaaaccggg gaagactggt ttacggagac cagcataagt 

118681 tttaataagg gaatctataa ataccaacta gccagaaaag ttggcatacc gggcgagcgc 

118741 cgcccactta ctataacatc tccccgggac aaaataattc aacaggcctt caaaagaatt 

118801 ttagagccta tcttcgaagg ttatgttgtc ggggttacag tccctcggaa ccagggcaac 

118861 gacatggtta aacactggat cctcccagtt gtctttagca acttatccca cggattcaga 

118921 cccggaaaaa atgcccattc ggcgctccaa caaatgcaat ttggttggag ggacgtacat 

118981 tggcttctcg actacgacgt gaggaaagct tttgataacg ttaaccacca cgtactgtgt 

119041 gctgcgctgg aagagcagat agcggacctt agggttatcg acgagattcg caagatgctt 

119101 aatgtgaaag ggttaaactt tgacatccag gaaaccctgg gaaccccaca ggggtccgtt 

119161 ctatcccctt tcctctttaa cgtttacatg cataaatttg accaattcat gcataatctt 

119221 atagcacact acgaccgttt gggcaagagg ttaataaatc ccgaatggaa agaagttagc 

119281 cgagtcagag ctgatgagaa gggtcagctg tcggaccagg ctactaggaa ccggcagaga 

119341 cgtttagccg ctgcccgcaa aggtatcaag cgatacatct accatccggc caatatcaag 

119401 atccgatacg ttcgttatgc tgacgatttt ttggtaggta tcgagggccc gaaggacctg 

119461 agtaagacca ttagagggga aatcaacgat ttccttcaat ctgccctgca cctagagata 

119521 aagaaggaca acttggtcca cgcgcgttcc gattgggttc gttttatagg cttcgacatt 

119581 caagctagaa tgacaggttc ctgctttacc aagggtaagg agcttgaggc tattaacaga 

119641 tttaaacaaa gagcccgtag ggccaaagag aagtcacaca agaagtacca atctatgatg 

119701 aacaaggtgg gagcttcgat ccaacgccga accatagagg atacctttgc tagggctgag 

119761 caaacctacc ttaagcaagt acaggtcgca gtgggagccg aaacccttag tcgatcgcaa 

119821 gctttggacg ctctgcaaga atcactcaat gtactaagac atgagtactc atgggaacag 

119881 ggtaacggac cagcccagat gcctgagaat tatcggaaca ccgattcggc aagcataagg 

119941 gaagtggctg ggcagtggat tcgagaggct aaatcccttg gaaacctcag ggtcgaacaa 

120001 gaagttagac aagccctgaa agatctatac ggggtagaca ttgtggagaa ggtggaggaa 

120061 ctcagtaagt tcctggacaa cttagactct ggtcgaggta aaataacacg gtacatcaaa 

120121 gataagtata aaggtaagag tgcagctata gcctcggtca acctttatat aagatgccca 

120181 atatctcaca tactggaatg tctgcggtcc caaaatattg tagccaagaa tacccgaaga 

120241 cctattgcgg tcactactat ttccaatctg gatgatctaa ctattatcga ctggtttaaa 

120301 tccaaagccc atgggatcct aagctattac aaatgcgcac acaacatctg ggaagtccga 

120361 gatctggtca actaccattt aagatattct ctccttagca ccttagccct caagcataag 

120421 tcctcaattt ccgggattat aagagggtac ggtaaagccc cgaaaatacg aacaattaat 

120481 gagaagagcc aggaatccac ggagttcccc acagtacaca gagtgaatag cacccgcaga 

120541 ggatttaaca ccaaaccaat aagcctaata acgttacagg attttcagga tctcctgatg 

120601 agaccgtgcg tggcgcggaa acagtactat gacccgcttc atgcagctaa acgccaaaat 

120661 cggtagtatc acccgataaa actactgcag aactgggaga ctagccgcca ctggtctacc 

120721 tagatgcctt tgatcattcc tctagtctcc tatgggagcc ggatgaggga gaaatttctc 

120781 acgtccggat ctttgagagc ctccgggcta ctctct

 

[top]


 

 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

                        agct ccatcacatt cggaagctgg taaaaaggtt caaaggcaac 

117481 atcgtcacca tcaacgttaa gggcaaacgc accggaaaac aggatctgga caaggtgcgt 

117541 ctttcagccc tatccagaaa acaaatcccc ctctgcccca aacatcactc cgactttcat 

117601 gaaggcacca taacattgga aaacatagac agcaagtatt cgtatcctaa gaccttatat 

117661 gtgggcggag aacaagggaa gttggtgaac atagaagacg tcaccaaaca acgttttctg 

117721 gaaaaaaaaa aacactagtt ttttttaccc ccccttttat ccccgtcagg ggataattgc 

117781 agggcctcgt tttcaccccc ggaatgttcc agtaggagcc gtatgatggg aaaccatcac 

117841 gtacggttcg gtgacggcct gcgggctagt tctacaacac aagctggaga cgtatccgct 

117901 tatattccta ccaatgtgat ttccattaca gatggagtgc gacgtgacgt gtgtgggatc 

117961 ggtgagttgg gggcgcatcg tcgtcccctg cgacagagcc aaggattaag gggtagttgg 

118021 acttttccta ccccaagtag caacaccgta ggcgatctta acagagcttc ttcgggggag 

118081 tcctccgaga ttcgagtgga agccccctac cacggtttgt ctgttagcac taggccgtat 

118141 ccttttgata ctgttagtcc tagccccgat acaaaagccc aggtccatgg gcggttttcg 

118201 ggaatcggtg ggcggttgcc cacagggatt agaagcttgg cgttaagaaa tgtcgatcct 

118261 cgtacgagcg cggcgagctc cttcggcgga tccctccaaa gacacaacta cagttcgctg 

118321 tcaaaagact tgagggccgc cgcgggcgcg gagatggagg tggaaaaccc cgaattagcc 

118381 ggtggatgcg cggaaaagca gaacccggtc aacgcgggac ctaatacctt ctgcatagaa 

118441 gatgtgcagg ctaaagcgag cgtgagctcg tcctttcatt tggagtctac ggactccata 

118501 gcaaaagttc tagcctcgaa aaggagtgat aaaggcaaat accaagatct ttttagcctt 

118561 ataataagcc cagagaattt aaaacaggcg tggctagaga taagaaacaa aacaggcaat 

118621 ctgacgcccg gtggcacggg cgaaaccggg gaagactggt ttacggagac cagcataagt 

118681 tttaataagg gaatctataa ataccaacta gccagaaaag ttggcatacc gggcgagcgc 

118741 cgcccactta ctataacatc tccccgggac aaaataattc aacaggcctt caaaagaatt 

118801 ttagagccta tcttcgaagg ttatgttgtc ggggttacag tccctcggaa ccagggcaac 

118861 gacatggtta aacactggat cctcccagtt gtctttagca acttatccca cggattcaga 

118921 cccggaaaaa atgcccattc ggcgctccaa caaatgcaat ttggttggag ggacgtacat 

118981 tggcttctcg actacgacgt gaggaaagct tttgataacg ttaaccacca cgtactgtgt 

119041 gctgcgctgg aagagcagat agcggacctt agggttatcg acgagattcg caagatgctt 

119101 aatgtgaaag ggttaaactt tgacatccag gaaaccctgg gaaccccaca ggggtccgtt 

119161 ctatcccctt tcctctttaa cgtttacatg cataaatttg accaattcat gcataatctt 

119221 atagcacact acgaccgttt gggcaagagg ttaataaatc ccgaatggaa agaagttagc 

119281 cgagtcagag ctgatgagaa gggtcagctg tcggaccagg ctactaggaa ccggcagaga 

119341 cgtttagccg ctgcccgcaa aggtatcaag cgatacatct accatccggc caatatcaag 

119401 atccgatacg ttcgttatgc tgacgatttt ttggtaggta tcgagggccc gaaggacctg 

119461 agtaagacca ttagagggga aatcaacgat ttccttcaat ctgccctgca cctagagata 

119521 aagaaggaca acttggtcca cgcgcgttcc gattgggttc gttttatagg cttcgacatt 

119581 caagctagaa tgacaggttc ctgctttacc aagggtaagg agcttgaggc tattaacaga 

119641 tttaaacaaa gagcccgtag ggccaaagag aagtcacaca agaagtacca atctatgatg 

119701 aacaaggtgg gagcttcgat ccaacgccga accatagagg atacctttgc tagggctgag 

119761 caaacctacc ttaagcaagt acaggtcgca gtgggagccg aaacccttag tcgatcgcaa 

119821 gctttggacg ctctgcaaga atcactcaat gtactaagac atgagtactc atgggaacag 

119881 ggtaacggac cagcccagat gcctgagaat tatcggaaca ccgattcggc aagcataagg 

119941 gaagtggctg ggcagtggat tcgagaggct aaatcccttg gaaacctcag ggtcgaacaa 

120001 gaagttagac aagccctgaa agatctatac ggggtagaca ttgtggagaa ggtggaggaa 

120061 ctcagtaagt tcctggacaa cttagactct ggtcgaggta aaataacacg gtacatcaaa 

120121 gataagtata aaggtaagag tgcagctata gcctcggtca acctttatat aagatgccca 

120181 atatctcaca tactggaatg tctgcggtcc caaaatattg tagccaagaa tacccgaaga 

120241 cctattgcgg tcactactat ttccaatctg gatgatctaa ctattatcga ctggtttaaa 

120301 tccaaagccc atgggatcct aagctattac aaatgcgcac acaacatctg ggaagtccga 

120361 gatctggtca actaccattt aagatattct ctccttagca ccttagccct caagcataag 

120421 tcctcaattt ccgggattat aagagggtac ggtaaagccc cgaaaatacg aacaattaat 

120481 gagaagagcc aggaatccac ggagttcccc acagtacaca gagtgaatag cacccgcaga 

120541 ggatttaaca ccaaaccaat aagcctaata acgttacagg attttcagga tctcctgatg 

120601 agaccgtgcg tggcgcggaa acagtactat gacccgcttc atgcagctaa acgccaaaat 

120661 cggtagtatc acccgataaa actactgcag aactgggaga ctagccgcca ctggtctacc 

120721 tagatgcctt tgatcattcc tctagtctcc tatgggagcc ggatgaggga gaaatttctc 

120781 acgtccggat ctttgagagc ctccgggcta ctctctcaaa tctttttgga aacagaactc 

120841 ttttatcgtg gaagtcgacc tgctattaac gtgggattat ctgtgagtcg cgtcggttct 

120901 gcggctcagt tgaaagccat gaaacaagta tgcggtagct taaaactaga attggcacaa 

120961 tatcgtgaag tagccgcttt tgctcaattt ggttcagacc ttgatgctgc gactcagtat 

121021 ttattaaatc gtggagctag gttaactgag atacttaaac aagcacaata tagcccaatt 

121081 cctattgaaa aacaaatcgt ggttatttat gcagctgtca aaggttactt agaccaaata 

121141 cccgtcgctc tcataacgca ctatgaacaa gagctattga aatctattga cccaggtcta 

121201 ctttctgcta ttgtacaaca aaaaaacatc actgagcaaa ttagtagtca actggctacc 

121261 ttttgccaaa aatttacgca gagcttctta gcaactcatc aatcataaaa aaagaa                                                                

 


[ORF sequence]

 

909 a.a.

 

VRRDVCGIGELGAHRRPLRQSQGLRGSWTFPTPSSNTVGDLNRASSGESSEIRVEAPY

HGLSVSTRPYPFDTVSPSPDTKAQVHGRFSGIGGRLPTGIRSLALRNVDPRTSAASSF

GGSLQRHNYSSLSKDLRAAAGAEMEVENPELAGGCAEKQNPVNAGPNTFCIEDVQAKA

SVSSSFHLESTDSIAKVLASKRSDKGKYQDLFSLIISPENLKQAWLEIRNKTGNLTPG

GTGETGEDWFTETSISFNKGIYKYQLARKVGIPGERRPLTITSPRDKIIQQAFKRILE

PIFEGYVVGVTVPRNQGNDMVKHWILPVVFSNLSHGFRPGKNAHSALQQMQFGWRDVH

WLLDYDVRKAFDNVNHHVLCAALEEQIADLRVIDEIRKMLNVKGLNFDIQETLGTPQG

SVLSPFLFNVYMHKFDQFMHNLIAHYDRLGKRLINPEWKEVSRVRADEKGQLSDQATR

NRQRRLAAARKGIKRYIYHPANIKIRYVRYADDFLVGIEGPKDLSKTIRGEINDFLQS

ALHLEIKKDNLVHARSDWVRFIGFDIQARMTGSCFTKGKELEAINRFKQRARRAKEKS

HKKYQSMMNKVGASIQRRTIEDTFARAEQTYLKQVQVAVGAETLSRSQALDALQESLN

VLRHEYSWEQGNGPAQMPENYRNTDSASIREVAGQWIREAKSLGNLRVEQEVRQALKD

LYGVDIVEKVEELSKFLDNLDSGRGKITRYIKDKYKGKSAAIASVNLYIRCPISHILE

CLRSQNIVAKNTRRPIAVTTISNLDDLTIIDWFKSKAHGILSYYKCAHNIWEVRDLVN

YHLRYSLLSTLALKHKSSISGIIRGYGKAPKIRTINEKSQESTEFPTVHRVNSTRRGF

NTKPISLITLQDFQDLLMRPCVARKQYYDPLHAAKRQNR

 

[top]


[secondary structure]

 

 

[top]