[Back to mitochondrial introns by organism] [Back to home page]

Information of Marchantia polymorpha atpA.I1 intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

3' end

                    

                                                                      

114541 tgcgccccga cgggacgtag tatgattcaa acaatggttg gaggggaagt cgctggcagg 

114601 tactaccaaa ccacgacgag aagcaacccg aaagaccaga gcgctaggag gataggaaga 

114661 gcggatattc acggagtcca gaagccaata aataagcctt cttcgcctgt tattaacctt 

114721 gttcaatggg tgaacgcagc atcgtcgaaa tacgatgaga gctctgggta taagcatata 

114781 ccttacccaa gaatccactc cggcagcgct caccttacga gacaggggcg tctactgcgt 

114841 ccgagcgggg ttgttactga gggtgataag ccaaaggctt acgctctcgg tacgaggaat 

114901 tattccaaag acagctttca gcctccgtta actgaaggcg tccatacgaa tgaaggtact 

114961 ggaacgaatt tcgagctcct aacagggctt cgaaacagat ggaaggtgaa aacgggaaaa 

115021 tatgaagaca tcttttcgct tatcgcgagg gaagataatc taatcctggc atggaacagc 

115081 ataagaagca agccaggtaa cttaaccccc ggaacaactc ccgaaactct agatggtatc 

115141 gacaccaagt ggttccaaga cactagttcg aaactaaaga gcgggactta caggtatgcc 

115201 ccggcgagga gagttgatgt tccaaaaccc aacaaacccg gggagacgcg tcccctaaca 

115261 gtttgtaacc cccgagacaa gatcattcag caagcgttct taaacgtatt gcaacccatc 

115321 ttcgaaggcg gctccgtctg gcaagaaagc gagcaatcga cctacgaaaa ccactcccgg 

115381 gaatacaccc aatgtcatct ctccctaacc ggaaggaaaa tatcacactt caaaggtgag 

115441 gacggtagct ataccaagaa gttcctgacc aggcattggc tattaccccc tatattcctt 

115501 aacgtagccc acgggttcag acctaaccga ggcgttcact cggcgcttaa agaaatcaag 

115561 caattctggg gcgccccaaa ttggttcctt aagttcgcta ttaagaaggc cttcgacaat 

115621 acgaaccata acctgattct aaatctcttg aagcaacacg ttgcggacca acgtgtggaa 

115681 gacgagcttc ggaaaatgtt gaaggcccgc gtcatcaact tcgtgcttgg agaagaatca 

115741 actatgacct taggtgtgcc tcagggttct gtcttaagtc ccttcctatt caacgtagcc 

115801 atgcatcatc tggacgtgtt catgatcgat ctcaaaaaaa aacatcaggt gccctcggaa 

115861 aaaataccca ataaggtgta cgtaagcgaa agacgtaaat tgctaacgtt agcaaaaaaa 

115921 caagaatggg gtatcagaaa aaaactgaga gccaacaggg acttaagaaa agacctcaag 

115981 cgcagggggg ttagcctttt tgaatacaag gtgaatcctc gtaacattta ttatgtccga 

116041 tatgcggacg attttatcgt cggggtccag ggagataaag cctttgccaa ggacacggct 

116101 acgtcgatct ctaattttat taagagtagt atgcactttg aagtctcaga gtacatattt 

116161 cacaccaaaa gcggcaaaac tcctttctta gggtttcacc tatcggtcgg actccttggc 

116221 ccctcggtaa agggtaaaat agcagagcgc tttgcacggc tcaagacacg aattcgaggc 

116281 gcgcgcaggg gagaatataa gcaataccta aaaatggtga gcaagacgta caccaaatac 

116341 tataggaagg tcctagaggg tcacctccga caggcccatc agactatgtt gtctagcaag 

116401 ctcttgaaat ccggaggctt gactctggcg cgccaaagaa ctatcgacgc attggaagag 

116461 acgctgaacc agatcaagaa ggaagcgcag cgaggctcgg agggccgaag gccggagtgc 

116521 tttccgacgg gcattttagt atcagacccg gcggcagccc gcagggctct cgggaatccg 

116581 cctgctaacc ctaggtacca agaggcagag gaggaatggg ataaggaatg ggaattcttg 

116641 gtctcggctt ggggttcgag ggccctatca ctagccagga ttgagcccga caaggaggcc 

116701 gacctgttca acatcatggg gagggaagac cttagcaaac tgattagtct gagggagcaa 

116761 taccatagcg cgctcgagga actcctgagc agagagacct cgggcaagat ggtaaaatac 

116821 gttcagggta agttcgcggc cgcaggcaga ggagaaggaa acgccaacgc tattttggct 

116881 tcacagctca ctaacaatag atatcgtagt acggtttatc tggagatgcc ttattcaaaa 

116941 attagagaga agctgcgatc ggttggattc attcgggacg aacagggtgc attccctaaa 

117001 tccctgcccc agatcacaga gcttgaggat attcaaataa tcgatttttt taagcagaag 

117061 gcctatggtc tgcttaactt ctaccaatgt gcggacaatc gatgggagct tatcaagatc 

117121 gtgaattgga tgcttagata ttctctctta cacacactag cgcacaagta taacactact 

117181 gtgtcaaaaa taatgggcgt atactccaaa tctccgaaag tattcattga agctgagaac 

117241 cctggcgaat attctatgct cactggtttt cccaccccac tggagataaa cactagacgc 

117301 aaagaattcc tcgtttcggg ggatgaaact cccgaaagct tggaaaaact ccttgacgat 

117361 aggcggaccc ttctcaccca atcgaaggcg aagttcaata aatgctgtgt cgctgattgc 

117421 cctgaaacta acataaagct ccatcacatt cggaagctgg taaaaaggtt caaaggcaac 

117481 atcgtcacca tcaacgttaa gggcaaacgc accggaaaac aggatctgga caaggtgcgt 

117541 ctttcagccc tatccagaaa acaaatcccc ctctgcccca aacatcactc cgactttcat 

117601 gaaggcacca taacattgga aaacatagac agcaagtatt cgtatcctaa gaccttatat 

117661 gtgggcggag aacaagggaa gttggtgaac atagaagacg tcaccaaaca acgttttctg 

117721 gaaaaaaaaa aacactagtt ttttttaccc ccccttttat ccccgtcagg ggataattgc 

117781 agggcctcgt tttcaccccc ggaatgttcc agtaggagcc gtatgatggg aaaccatcac 

117841 gtacggttcg gtgacggcct gcgggctagt tctac

 

[top]


 

 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

                                                 c aacgagaact tataatagga 

114061 gacagacaaa ctggaaagac cgctatagct attgatacca tattgaacca gaaacaaatc 

114121 aacgcacagg gcacctctga tagtgaaaaa ttgtattgtg tatatgtagc gattggacag 

114181 aaacgttcaa ccgtggcaca attagttaag attctttcag aagcgggtgc tttagaatat 

114241 tccattattg tagcagccac tgcttcggat ccagctcctt tgcaattcct ggcaccatat 

114301 tcaggttgtg ctatgggaga atattttaga gataatggaa tgcacgcatt aataatctat 

114361 gatgatctga gtaagcaatc agtggcgtat cgccaaatgt cattattatt acgtcgacca 

114421 ccaggtcgtg aggcgttccc aggagatgtg ttctatttac attctcgttt attagaaaga 

114481 gccgctaaaa tgtcagacca gactggtgcg ggtagcttga ctgcgctacc tgttattga

114541 tgcgccccga cgggacgtag tatgattcaa acaatggttg gaggggaagt cgctggcagg 

114601 tactaccaaa ccacgacgag aagcaacccg aaagaccaga gcgctaggag gataggaaga 

114661 gcggatattc acggagtcca gaagccaata aataagcctt cttcgcctgt tattaacctt 

114721 gttcaatggg tgaacgcagc atcgtcgaaa tacgatgaga gctctgggta taagcatata 

114781 ccttacccaa gaatccactc cggcagcgct caccttacga gacaggggcg tctactgcgt 

114841 ccgagcgggg ttgttactga gggtgataag ccaaaggctt acgctctcgg tacgaggaat 

114901 tattccaaag acagctttca gcctccgtta actgaaggcg tccatacgaa tgaaggtact 

114961 ggaacgaatt tcgagctcct aacagggctt cgaaacagat ggaaggtgaa aacgggaaaa 

115021 tatgaagaca tcttttcgct tatcgcgagg gaagataatc taatcctggc atggaacagc 

115081 ataagaagca agccaggtaa cttaaccccc ggaacaactc ccgaaactct agatggtatc 

115141 gacaccaagt ggttccaaga cactagttcg aaactaaaga gcgggactta caggtatgcc 

115201 ccggcgagga gagttgatgt tccaaaaccc aacaaacccg gggagacgcg tcccctaaca 

115261 gtttgtaacc cccgagacaa gatcattcag caagcgttct taaacgtatt gcaacccatc 

115321 ttcgaaggcg gctccgtctg gcaagaaagc gagcaatcga cctacgaaaa ccactcccgg 

115381 gaatacaccc aatgtcatct ctccctaacc ggaaggaaaa tatcacactt caaaggtgag 

115441 gacggtagct ataccaagaa gttcctgacc aggcattggc tattaccccc tatattcctt 

115501 aacgtagccc acgggttcag acctaaccga ggcgttcact cggcgcttaa agaaatcaag 

115561 caattctggg gcgccccaaa ttggttcctt aagttcgcta ttaagaaggc cttcgacaat 

115621 acgaaccata acctgattct aaatctcttg aagcaacacg ttgcggacca acgtgtggaa 

115681 gacgagcttc ggaaaatgtt gaaggcccgc gtcatcaact tcgtgcttgg agaagaatca 

115741 actatgacct taggtgtgcc tcagggttct gtcttaagtc ccttcctatt caacgtagcc 

115801 atgcatcatc tggacgtgtt catgatcgat ctcaaaaaaa aacatcaggt gccctcggaa 

115861 aaaataccca ataaggtgta cgtaagcgaa agacgtaaat tgctaacgtt agcaaaaaaa 

115921 caagaatggg gtatcagaaa aaaactgaga gccaacaggg acttaagaaa agacctcaag 

115981 cgcagggggg ttagcctttt tgaatacaag gtgaatcctc gtaacattta ttatgtccga 

116041 tatgcggacg attttatcgt cggggtccag ggagataaag cctttgccaa ggacacggct 

116101 acgtcgatct ctaattttat taagagtagt atgcactttg aagtctcaga gtacatattt 

116161 cacaccaaaa gcggcaaaac tcctttctta gggtttcacc tatcggtcgg actccttggc 

116221 ccctcggtaa agggtaaaat agcagagcgc tttgcacggc tcaagacacg aattcgaggc 

116281 gcgcgcaggg gagaatataa gcaataccta aaaatggtga gcaagacgta caccaaatac 

116341 tataggaagg tcctagaggg tcacctccga caggcccatc agactatgtt gtctagcaag 

116401 ctcttgaaat ccggaggctt gactctggcg cgccaaagaa ctatcgacgc attggaagag 

116461 acgctgaacc agatcaagaa ggaagcgcag cgaggctcgg agggccgaag gccggagtgc 

116521 tttccgacgg gcattttagt atcagacccg gcggcagccc gcagggctct cgggaatccg 

116581 cctgctaacc ctaggtacca agaggcagag gaggaatggg ataaggaatg ggaattcttg 

116641 gtctcggctt ggggttcgag ggccctatca ctagccagga ttgagcccga caaggaggcc 

116701 gacctgttca acatcatggg gagggaagac cttagcaaac tgattagtct gagggagcaa 

116761 taccatagcg cgctcgagga actcctgagc agagagacct cgggcaagat ggtaaaatac 

116821 gttcagggta agttcgcggc cgcaggcaga ggagaaggaa acgccaacgc tattttggct 

116881 tcacagctca ctaacaatag atatcgtagt acggtttatc tggagatgcc ttattcaaaa 

116941 attagagaga agctgcgatc ggttggattc attcgggacg aacagggtgc attccctaaa 

117001 tccctgcccc agatcacaga gcttgaggat attcaaataa tcgatttttt taagcagaag 

117061 gcctatggtc tgcttaactt ctaccaatgt gcggacaatc gatgggagct tatcaagatc 

117121 gtgaattgga tgcttagata ttctctctta cacacactag cgcacaagta taacactact 

117181 gtgtcaaaaa taatgggcgt atactccaaa tctccgaaag tattcattga agctgagaac 

117241 cctggcgaat attctatgct cactggtttt cccaccccac tggagataaa cactagacgc 

117301 aaagaattcc tcgtttcggg ggatgaaact cccgaaagct tggaaaaact ccttgacgat 

117361 aggcggaccc ttctcaccca atcgaaggcg aagttcaata aatgctgtgt cgctgattgc 

117421 cctgaaacta acataaagct ccatcacatt cggaagctgg taaaaaggtt caaaggcaac 

117481 atcgtcacca tcaacgttaa gggcaaacgc accggaaaac aggatctgga caaggtgcgt 

117541 ctttcagccc tatccagaaa acaaatcccc ctctgcccca aacatcactc cgactttcat 

117601 gaaggcacca taacattgga aaacatagac agcaagtatt cgtatcctaa gaccttatat 

117661 gtgggcggag aacaagggaa gttggtgaac atagaagacg tcaccaaaca acgttttctg 

117721 gaaaaaaaaa aacactagtt ttttttaccc ccccttttat ccccgtcagg ggataattgc 

117781 agggcctcgt tttcaccccc ggaatgttcc agtaggagcc gtatgatggg aaaccatcac 

117841 gtacggttcg gtgacggcct gcgggctagt tctacaacac aagctggaga cgtatccgct 

117901 tatattccta ccaatgtgat ttccattaca gatggagtgc gacgtgacgt gtgtgggatc 

117961 ggtgagttgg gggcgcatcg tcgtcccctg cgacagagcc aaggattaag gggtagttgg 

118021 acttttccta ccccaagtag caacaccgta ggcgatctta acagagcttc ttcgggggag 

118081 tcctccgaga ttcgagtgga agccccctac cacggtttgt ctgttagcac taggccgtat 

118141 ccttttgata ctgttagtcc tagccccgat acaaaagccc aggtccatgg gcggttttcg 

118201 ggaatcggtg ggcggttgcc cacagggatt agaagcttgg cgttaagaaa tgtcgatcct 

118261 cgtacgagcg cggcgagctc cttcggcgga tccctccaaa gacacaacta cagttcgctg 

118321 tcaaaagact tgagggccgc cgcgggcgcg gagatggagg tggaaaaccc cgaat                                                                

 

[top]


[ORF sequence]

 

1064 a.a.

APTGRSMIQTMVGGEVAGRYYQTTTRSNPKDQSARRIGRADIHGVQKPINKPSSPVIN

LVQWVNAASSKYDESSGYKHIPYPRIHSGSAHLTRQGRLLRPSGVVTEGDKPKAYALG

TRNYSKDSFQPPLTEGVHTNEGTGTNFELLTGLRNRWKVKTGKYEDIFSLIAREDNLI

LAWNSIRSKPGNLTPGTTPETLDGIDTKWFQDTSSKLKSGTYRYAPARRVDVPKPNKP

GETRPLTVCNPRDKIIQQAFLNVLQPIFEGGSVWQESEQSTYENHSREYTQCHLSLTG

RKISHFKGEDGSYTKKFLTRHWLLPPIFLNVAHGFRPNRGVHSALKEIKQFWGAPNWF

LKFAIKKAFDNTNHNLILNLLKQHVADQRVEDELRKMLKARVINFVLGEESTMTLGVP

QGSVLSPFLFNVAMHHLDVFMIDLKKKHQVPSEKIPNKVYVSERRKLLTLAKKQEWGI

RKKLRANRDLRKDLKRRGVSLFEYKVNPRNIYYVRYADDFIVGVQGDKAFAKDTATSI

SNFIKSSMHFEVSEYIFHTKSGKTPFLGFHLSVGLLGPSVKGKIAERFARLKTRIRGA

RRGEYKQYLKMVSKTYTKYYRKVLEGHLRQAHQTMLSSKLLKSGGLTLARQRTIDALE

ETLNQIKKEAQRGSEGRRPECFPTGILVSDPAAARRALGNPPANPRYQEAEEEWDKEW

EFLVSAWGSRALSLARIEPDKEADLFNIMGREDLSKLISLREQYHSALEELLSRETSG

KMVKYVQGKFAAAGRGEGNANAILASQLTNNRYRSTVYLEMPYSKIREKLRSVGFIRD

EQGAFPKSLPQITELEDIQIIDFFKQKAYGLLNFYQCADNRWELIKIVNWMLRYSLLH

TLAHKYNTTVSKIMGVYSKSPKVFIEAENPGEYSMLTGFPTPLEINTRRKEFLVSGDE

TPESLEKLLDDRRTLLTQSKAKFNKCCVADCPETNIKLHHIRKLVKRFKGNIVTINVK

GKRTGKQDLDKVRLSALSRKQIPLCPKHHSDFHEGTITLENIDSKYSYPKTLYVGGEQ

GKLVNIEDVTKQRFLEKKKH

 

[top]


[secondary structure]

 not available

 

[top]