[Back to mitochondrial introns by organism] [Back to home page]

Information of Marchantia polymorpha cox2.I2 intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

3' end

                    

                                                   gggcgaccg ttaggtcacc 

82441 catagttatg tcaatggggc tcaacacatg ggctctaaac cgcgcgagcc tatcttccgc 

82501 gcgccaggta tacgactcat ccttgccacg taagtggtgg gagaccaaag catgtcgtaa 

82561 aagcgagatt gaggggtcct tgtcgttcgc agctggactg tggaaagccc aagatcatct 

82621 tgctaacctg ccatcgctat tcgagatgag gagctgttct ggcgaatcta agttcctttc 

82681 ggtcgaattg aacctgatgc tatccggatc caacccggtg acacgcacga gcgcacgcat 

82741 taaagctctc agcctgacaa tggtccaaag tgtacaggcc ggagatagtg cggtgcctat 

82801 accccgcatg agagcagatt acctacatca ctggggacag ggaactaaaa gacatctcgt 

82861 ggcgacacgg cctatcggtg ataccggtga gatcccagcg tggtcacacg tcctgggaag 

82921 tcagaggatc catatgacct gcctactgtt aacgcaccct cgtaagagga aagcgctttg 

82981 cgaacacaaa gcaacgggga agggatctaa gcatctgcca tacctttgca gattttactc 

83041 gatatcgtct acaaactcag ccccctggga tcccaaggtg gatgtgtccg actatgtgcg 

83101 gaatggggtt gatgcccttg tggacctttg gatctcgtct tttcgaaggc gtgactggat 

83161 ataccatgat ctaagcaatt acctaaagag tatggatata tggagcatag cctaccaaaa 

83221 actgagacct aatccagggt caatgaatcc tgggactcac ggtctgacca tcgatggcac 

83281 gtctttccgt aagcttcaag cgctgagaga tgcagtcctg gacagcgaaa gcccatatga 

83341 atggggaggc acgaaaatca ttaccaagcc gggtaagcgc gagaaaatat cacttggaat 

83401 tccctgcttc caagaccgta tcgtgcaaga ggtactgaaa atgcttctag agcctatcta 

83461 tgaatcaata ttctctcgga gatcccacgg ctggcgacct gggcgtagcg cgcacacggc 

83521 tctacgaacc atacgcagcg acttcaaaaa aactaattgg attgtaccag gcaacattaa 

83581 caaattattc gacatcgtga accatggcat cctctgccac atcatgagac gtaagatccg 

83641 agacaaaaaa ctgctaaaac tcattgctgg cggattgaaa gcaaagatac acatgcccta 

83701 tgggaacatt gaggagtcaa acttgggaac cccgcaaggc agaatactaa gtccaatact 

83761 gtccaatata tatctacacg agttcgacat atggatagaa gagcgcatcc agcaatacaa 

83821 tctaggaagg aaggagactc gtagctgggt gctgcttcgg aagcagggaa agatgcgtaa 

83881 agcgcgcctc cgcagtgacc cattcaaccc attgtatcga agaatggagt acagacgata 

83941 cggagatgac ttcctcatag ctatacgtgg ccccctgtct gatgctaaag ccattcgtca 

84001 ggaatgcgaa accttcctaa gagagaaact caaattgcta ctgaacatgg aaaagaccca 

84061 tataaaacat atatccgtgg gaattccttt cttggggcac cgtatcggac gccgagtagt 

84121 acacacgaaa caaagatacc agacccaaga gggttggcga tggaggataa aaaagaaagt 

84181 tatcctcacc atggacgcag acatgaacca gttgaagctg cgattgcaac agcaggctta 

84241 ccttgctggg aatggagatc ctttaccaaa cttcggcttg atgagcttac ctcaaagcga 

84301 agcaaaccga cgaatgaact caatcttgcg cggatacgcc aattggtatc agtttgccgg 

84361 aaacaaacgc ggggccatcg cctatttggc ttacgtcttg cgttcctcct tggccaaaat 

84421 gtttgcagcg aagtttaaac tgcactcttt gaagaaggta ttccagatag caggtaacga 

84481 cttaggtagg gcgctcgaag cccgatatcc agtgggagtc actgactcac aagtggaagc 

84541 ttggcaacag tctgtgagtg ggaaaggtac gcctagagac atcctgggct tttgggagat 

84601 cagtcaacgg aacagggtcc gaagtagact tctcggacac acgaaaaagc gttgattggc 

84661 ctgagcgata aagcagagat gtattaataa gagtgcgcga aatttcggaa gtacgtgtac 

84721 agtcgtgtag tttcaactgc cccaatgacg cgctactcgt gtggcgtttt gctcaaggag 

84781 tgaaaagaat ccccatgtgg acgatctccg gacaatgtca aaatcatgtg cggggcccgc 

84841 cccctcgccc gaacacgctc cccgcacgtc cgtgcctatg ggtccgaaga cttctttgcc 

84901 tttacagggc ccggaggccc gaagaagtgc gcggaaattg taaagccaaa ggggttctac 

84961 aagctaaaga agtatccttc ggacccttcg gcccaaaaaa gcgtaccgcg tagcgccgcg 

85021 ccttccttct aggtgcccac cgggcaactg gcagactagg ccagccccgc tatctgaact 

85081 cggaaagtaa tccgaagtaa gtcgagttag tgagccgtag cacggtgacg tgctcatacg 

85141 gttcggagag cacttgagta atcttataat gaggtgaaat ttttactcta t

 

[top]


 

 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

                             gtaatgacc atgtgtagtc ttttgggggg atcacaatgg 

81961 gaacgagtgt atctacacca catgaaagta ggcggcttct cctcgtgaca caaaatttgc 

82021 aaatgaatct agaatgccct ctctctttgg tcgagtgttt gaaggacact ctttgccgaa 

82081 atggctgaga aagaagggtc ttcgcactac gtgcctctcc ctatagtgct cgtgcgtgga 

82141 aatcgtcaag ccatttccgg tgcgtaggcg tgtaagggag ttttgagaat gaaatggagc 

82201 tgtatgaagg taaactttca cgtacagttc ttagggggga aagcccgtaa gggcctacct 

82261 atccaaacta attgacttac ataatgatat ttttttcttt ttaatcgtta ttttgatctt 

82321 tgtattatgg atgttggttc gcgctttatg gcattttcac tataaaagaa atccaattcc 

82381 agaaaggatt gttcatggaa ctactataga aattatttgg agggcgaccg ttaggtcacc 

82441 catagttatg tcaatggggc tcaacacatg ggctctaaac cgcgcgagcc tatcttccgc 

82501 gcgccaggta tacgactcat ccttgccacg taagtggtgg gagaccaaag catgtcgtaa 

82561 aagcgagatt gaggggtcct tgtcgttcgc agctggactg tggaaagccc aagatcatct 

82621 tgctaacctg ccatcgctat tcgagatgag gagctgttct ggcgaatcta agttcctttc 

82681 ggtcgaattg aacctgatgc tatccggatc caacccggtg acacgcacga gcgcacgcat 

82741 taaagctctc agcctgacaa tggtccaaag tgtacaggcc ggagatagtg cggtgcctat 

82801 accccgcatg agagcagatt acctacatca ctggggacag ggaactaaaa gacatctcgt 

82861 ggcgacacgg cctatcggtg ataccggtga gatcccagcg tggtcacacg tcctgggaag 

82921 tcagaggatc catatgacct gcctactgtt aacgcaccct cgtaagagga aagcgctttg 

82981 cgaacacaaa gcaacgggga agggatctaa gcatctgcca tacctttgca gattttactc 

83041 gatatcgtct acaaactcag ccccctggga tcccaaggtg gatgtgtccg actatgtgcg 

83101 gaatggggtt gatgcccttg tggacctttg gatctcgtct tttcgaaggc gtgactggat 

83161 ataccatgat ctaagcaatt acctaaagag tatggatata tggagcatag cctaccaaaa 

83221 actgagacct aatccagggt caatgaatcc tgggactcac ggtctgacca tcgatggcac 

83281 gtctttccgt aagcttcaag cgctgagaga tgcagtcctg gacagcgaaa gcccatatga 

83341 atggggaggc acgaaaatca ttaccaagcc gggtaagcgc gagaaaatat cacttggaat 

83401 tccctgcttc caagaccgta tcgtgcaaga ggtactgaaa atgcttctag agcctatcta 

83461 tgaatcaata ttctctcgga gatcccacgg ctggcgacct gggcgtagcg cgcacacggc 

83521 tctacgaacc atacgcagcg acttcaaaaa aactaattgg attgtaccag gcaacattaa 

83581 caaattattc gacatcgtga accatggcat cctctgccac atcatgagac gtaagatccg 

83641 agacaaaaaa ctgctaaaac tcattgctgg cggattgaaa gcaaagatac acatgcccta 

83701 tgggaacatt gaggagtcaa acttgggaac cccgcaaggc agaatactaa gtccaatact 

83761 gtccaatata tatctacacg agttcgacat atggatagaa gagcgcatcc agcaatacaa 

83821 tctaggaagg aaggagactc gtagctgggt gctgcttcgg aagcagggaa agatgcgtaa 

83881 agcgcgcctc cgcagtgacc cattcaaccc attgtatcga agaatggagt acagacgata 

83941 cggagatgac ttcctcatag ctatacgtgg ccccctgtct gatgctaaag ccattcgtca 

84001 ggaatgcgaa accttcctaa gagagaaact caaattgcta ctgaacatgg aaaagaccca 

84061 tataaaacat atatccgtgg gaattccttt cttggggcac cgtatcggac gccgagtagt 

84121 acacacgaaa caaagatacc agacccaaga gggttggcga tggaggataa aaaagaaagt 

84181 tatcctcacc atggacgcag acatgaacca gttgaagctg cgattgcaac agcaggctta 

84241 ccttgctggg aatggagatc ctttaccaaa cttcggcttg atgagcttac ctcaaagcga 

84301 agcaaaccga cgaatgaact caatcttgcg cggatacgcc aattggtatc agtttgccgg 

84361 aaacaaacgc ggggccatcg cctatttggc ttacgtcttg cgttcctcct tggccaaaat 

84421 gtttgcagcg aagtttaaac tgcactcttt gaagaaggta ttccagatag caggtaacga 

84481 cttaggtagg gcgctcgaag cccgatatcc agtgggagtc actgactcac aagtggaagc 

84541 ttggcaacag tctgtgagtg ggaaaggtac gcctagagac atcctgggct tttgggagat 

84601 cagtcaacgg aacagggtcc gaagtagact tctcggacac acgaaaaagc gttgattggc 

84661 ctgagcgata aagcagagat gtattaataa gagtgcgcga aatttcggaa gtacgtgtac 

84721 agtcgtgtag tttcaactgc cccaatgacg cgctactcgt gtggcgtttt gctcaaggag 

84781 tgaaaagaat ccccatgtgg acgatctccg gacaatgtca aaatcatgtg cggggcccgc 

84841 cccctcgccc gaacacgctc cccgcacgtc cgtgcctatg ggtccgaaga cttctttgcc 

84901 tttacagggc ccggaggccc gaagaagtgc gcggaaattg taaagccaaa ggggttctac 

84961 aagctaaaga agtatccttc ggacccttcg gcccaaaaaa gcgtaccgcg tagcgccgcg 

85021 ccttccttct aggtgcccac cgggcaactg gcagactagg ccagccccgc tatctgaact 

85081 cggaaagtaa tccgaagtaa gtcgagttag tgagccgtag cacggtgacg tgctcatacg 

85141 gttcggagag cacttgagta atcttataat gaggtgaaat ttttactcta tctatttttc 

85201 caagtattat tctgatgttt attgcaatac cttcgttcgc ccttctttat tcaatggacg 

85261 aggtagtaga tccagctatt actatcaaag ctattggaca tcaatggtat tggacttatg 

85321 agtattcaga ctataacagt tctgatgaac agtcactaac ttttgacagt tatatgattc 

85381 cagaggatga tttagaattg ggtcaattac gtttattaga agtggacaat cgagtggttg 

85441 taccagcaaa aactcatcta cgtatgatta ttacttctgc tgatgtactt catagttggg 

85501 ctgtaccttc cttaggtgta aaatgtgatg ctgtacctgg tcgtttaaat cagacttcca 

85561 tttttattaa acgagaaggt gtttactatg gtcagtgcag tgaactttgt ggaactaatc 

85621 atggctttat gcctattgtc gtagaggcag tttccttgga tgattatgtt tcttgggtat 

85681 ccaataaatt a                                                                

 

[top]


[ORF sequence]

 

743 a.a.

 

ATVRSPIVMSMGLNTWALNRASLSSARQVYDSSLPRKWWETKACRKSEIEGSLSFAAG

LWKAQDHLANLPSLFEMRSCSGESKFLSVELNLMLSGSNPVTRTSARIKALSLTMVQS

VQAGDSAVPIPRMRADYLHHWGQGTKRHLVATRPIGDTGEIPAWSHVLGSQRIHMTCL

LLTHPRKRKALCEHKATGKGSKHLPYLCRFYSISSTNSAPWDPKVDVSDYVRNGVDAL

VDLWISSFRRRDWIYHDLSNYLKSMDIWSIAYQKLRPNPGSMNPGTHGLTIDGTSFRK

LQALRDAVLDSESPYEWGGTKIITKPGKREKISLGIPCFQDRIVQEVLKMLLEPIYES

IFSRRSHGWRPGRSAHTALRTIRSDFKKTNWIVPGNINKLFDIVNHGILCHIMRRKIR

DKKLLKLIAGGLKAKIHMPYGNIEESNLGTPQGRILSPILSNIYLHEFDIWIEERIQQ

YNLGRKETRSWVLLRKQGKMRKARLRSDPFNPLYRRMEYRRYGDDFLIAIRGPLSDAK

AIRQECETFLREKLKLLLNMEKTHIKHISVGIPFLGHRIGRRVVHTKQRYQTQEGWRW

RIKKKVILTMDADMNQLKLRLQQQAYLAGNGDPLPNFGLMSLPQSEANRRMNSILRGY

ANWYQFAGNKRGAIAYLAYVLRSSLAKMFAAKFKLHSLKKVFQIAGNDLGRALEARYP

VGVTDSQVEAWQQSVSGKGTPRDILGFWEISQRNRVRSRLLGHTKKR

 

[top]


[secondary structure]

 

 

[top]