[Back to mitochondrial introns by organism] [Back to home page]

Information of Marchantia polymorpha cox1.I1 intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

 

Note:  This intron sequence occurs on the reverse complement of the original

GenBank sequence.  Therefore, all sequences below are reverse complements of

the GenBank sequence.

3' end

                    

           gtgcg acgctatgca tgctataatc gctgggtcaa accgcgccgg gacaattggt 

42121 gctaaacccc acgtaactca gccgaaagtt agtaggagag tgatgtcctg cgttggggta 

42181 gacggtctgg taagtctagg gaaacgaata ctaacgatgc gccctccgtt tctgtggagc 

42241 gtaggcggtt ttctgcctac ggcttacgag tgccgttctc atccgaatat cttactttgt 

42301 ggggagaaag gagcgtccct gagaaccgaa acgaaacgct cgtactgcgg tgatactaca 

42361 agctcacctg atgcgctagg taagacaagc cattctgcgc gatttaggga tgacggcggc 

42421 gagggtgggg ccgggtcacg ccccgggatg agttgtaaac cccatgccaa tggcggacgg 

42481 gatagtgact caactgagaa ccggtcgggt aatgaacccc ctgctgtaac cgtccatatg 

42541 gacggtggtg gttggaggcg gaaactcgaa actctcaaac ggaacgaaaa atccgggaaa 

42601 tttgaaaaca tctactccat atgcacggac ccgaacttct tgattgcagc ctatgaacag 

42661 attaagtccc acacgagtaa catgaccccg gaggggggcg aggaaaggga aaacctcttt 

42721 cttcgtcaag tggcttcgcc cctggaaagt ctagatcggg cttggttcga aagaaccgcc 

42781 gaattacttc gaagcgaaca gtttcgcttc aaactcgcac gtaggattat gatacccacg 

42841 cctaacaagc ccagggaatt cagaccttta actattggga gcgacaatat tgttcagcaa 

42901 gctatgaaaa tagtcatgga acatatatac gaacctaaat tcctcgacac ctcccacggg 

42961 ttccgtcccg gaagagggtg tcactcgggg ttagaacaaa tttgcctgaa atggacagga 

43021 gcgtcgtggt tccttgaatt tgacataaaa aggtgcttca attccatgga tcgacacaag 

43081 ctggtcttca tcttacaaaa ggatatagaa gatcagaggt ggatggatct cgtgcataaa 

43141 ctgttcaccg cgggacttgt tggcggcgag cttggtggtc cagaccccct tcaaggatca 

43201 gtcctctccc cttggtcgag tcccccttgg gccctcgccc ctctattttg taatatttac 

43261 ttgcacgatt tggaccaaga agtggccaag atggctaatg aactctcacg aagccgcaag 

43321 cgaagagtcg acaaaaggac cacggcggct acgaggacgc ccagaacgaa ggcgttcaga 

43381 gcgctgaccc cgcaggcgga aatcatgagg gtccgtagaa aggccgcccg gggactgtcc 

43441 ccgactgata ggaaggaccc caactacgca cgcgcatttt acgttcgata tgcggggaat 

43501 ttcctcctag gcattgccgg acccagggaa ctggtagcca cggtcaagag taggattgtg 

43561 cagttcgtta attcggagct gcacctggaa ctcaccggag gttctatatc ccacataagc 

43621 gccgaaagcg tgaaatttct gggaatggaa atcaaggttg taccctcctc gaaacttcga 

43681 aggagattcg gcaaggccat ggagaagcga cgtagggtta ggaaccgtat ctttacccta 

43741 aaagtacaaa aacggaaacg ccaagactcc ttggtccacg atgccttggt tagggcgctg 

43801 ggtaaattgt cgaggaagca taactctgcg gggctggcaa aactcctgag tcccaaatac 

43861 ccggaaatgg cggagttagc taaagccttg ttaagagaga tgcgagccga taacgagcta 

43921 gtaaccgaca ttcgggactc gcaaaaaaac ttccacctgg ctttagcgag caggctagac 

43981 tttgccccgg atgaggcacg ggaagccatg gagaaccttg agaggaaact cgacaagtgg 

44041 tgcgatgagt gtgaagtccg aacccttttt gataaagata gggagaaggc tcgacgagaa 

44101 cacgtgggaa gatacgaggc gctacctctg caaattttgg ccccattaaa ggagataagg 

44161 aaaaggctta aattgtgggg ccttattacg gagaataaca aaccttgttg tgtggttaga 

44221 ttgattcaac tcaacgacga agacattgtg ctttggttta attccgtagc tcgagaccta 

44281 ttgaattact accgttgttg tgccaatttc tacaaggtac gagactacgt ggattatttc 

44341 ttacgctggt ccttgataca cacaatggcc gggaaacata agacatcggc gacgaagctc 

44401 atcagggcat tatcgataga tatggtcata aaggataacg agggggaaaa aataataagc 

44461 tttctaagct ccaacgaaat tcgacggatg ggaagaatgt tcttgagggg catcccgtgt 

44521 gactctgata tgcggactct ggaccgtatt tacgccacgt tcaggcgctc ccgtgcattg 

44581 cggtgctctg tgaaggggcg cttcgaggat aaggtggaaa tgcaccatgt gcgcaagagg 

44641 ctcctggatg cttttgggag gataacagta gtgaccaaaa agaataagag gggttacggg 

44701 aatggatgcg ttcaaggttg ccataaatag aaagcaaata cctttgtgta cgtatcatca 

44761 cgataaattg cacaagaggg aattgaagtt cgggggactg gattgggagt aggaccgata 

44821 aaattccaag tgtatacgtc caatcgagag taggttcttg gagagccgtg tgatggaaga 

44881 ctatcaagca cggttccacg agaagggcta atcctttact cgac

 

                                                                   

[top]


 

 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

                                                                                                              aagtt ttgtgtgttt 

41581 tgttatattt agtcaaaaaa catttggtga aactataaaa gctatttttg atgcgagaag 

41641 cgaagctctt ctttccgatt tacagcaatg gatgagttat caagaagcta tgttgtccga 

41701 attaaaaaaa cagcatgaat tacgtagtat aagcttgcgt tcaagtacac aaatgattgg 

41761 agaatcatgt ataaatgata tggttacgcg ctgtgcgcca aagtgcaaac agacagtaaa 

41821 atctgtgtta tgccaacaaa tagagcaaaa gttaaaaaca ctgttagcta ttcaagagca 

41881 ttctcgtatc agtttacagg agaagatagt aacttgtttt cgcgaaacag tttgtgacga 

41941 atttcgcttt tccaaattgc gaaaacatca gtcaaaacta gttcaacaaa gcatggtatt 

42001 attgaaagat ggggttccga aatgaacaat tttgcacaaa gatggctgtt ttccacgaac 

42061 cacaagtgcg acgctatgca tgctataatc gctgggtcaa accgcgccgg gacaattggt 

42121 gctaaacccc acgtaactca gccgaaagtt agtaggagag tgatgtcctg cgttggggta 

42181 gacggtctgg taagtctagg gaaacgaata ctaacgatgc gccctccgtt tctgtggagc 

42241 gtaggcggtt ttctgcctac ggcttacgag tgccgttctc atccgaatat cttactttgt 

42301 ggggagaaag gagcgtccct gagaaccgaa acgaaacgct cgtactgcgg tgatactaca 

42361 agctcacctg atgcgctagg taagacaagc cattctgcgc gatttaggga tgacggcggc 

42421 gagggtgggg ccgggtcacg ccccgggatg agttgtaaac cccatgccaa tggcggacgg 

42481 gatagtgact caactgagaa ccggtcgggt aatgaacccc ctgctgtaac cgtccatatg 

42541 gacggtggtg gttggaggcg gaaactcgaa actctcaaac ggaacgaaaa atccgggaaa 

42601 tttgaaaaca tctactccat atgcacggac ccgaacttct tgattgcagc ctatgaacag 

42661 attaagtccc acacgagtaa catgaccccg gaggggggcg aggaaaggga aaacctcttt 

42721 cttcgtcaag tggcttcgcc cctggaaagt ctagatcggg cttggttcga aagaaccgcc 

42781 gaattacttc gaagcgaaca gtttcgcttc aaactcgcac gtaggattat gatacccacg 

42841 cctaacaagc ccagggaatt cagaccttta actattggga gcgacaatat tgttcagcaa 

42901 gctatgaaaa tagtcatgga acatatatac gaacctaaat tcctcgacac ctcccacggg 

42961 ttccgtcccg gaagagggtg tcactcgggg ttagaacaaa tttgcctgaa atggacagga 

43021 gcgtcgtggt tccttgaatt tgacataaaa aggtgcttca attccatgga tcgacacaag 

43081 ctggtcttca tcttacaaaa ggatatagaa gatcagaggt ggatggatct cgtgcataaa 

43141 ctgttcaccg cgggacttgt tggcggcgag cttggtggtc cagaccccct tcaaggatca 

43201 gtcctctccc cttggtcgag tcccccttgg gccctcgccc ctctattttg taatatttac 

43261 ttgcacgatt tggaccaaga agtggccaag atggctaatg aactctcacg aagccgcaag 

43321 cgaagagtcg acaaaaggac cacggcggct acgaggacgc ccagaacgaa ggcgttcaga 

43381 gcgctgaccc cgcaggcgga aatcatgagg gtccgtagaa aggccgcccg gggactgtcc 

43441 ccgactgata ggaaggaccc caactacgca cgcgcatttt acgttcgata tgcggggaat 

43501 ttcctcctag gcattgccgg acccagggaa ctggtagcca cggtcaagag taggattgtg 

43561 cagttcgtta attcggagct gcacctggaa ctcaccggag gttctatatc ccacataagc 

43621 gccgaaagcg tgaaatttct gggaatggaa atcaaggttg taccctcctc gaaacttcga 

43681 aggagattcg gcaaggccat ggagaagcga cgtagggtta ggaaccgtat ctttacccta 

43741 aaagtacaaa aacggaaacg ccaagactcc ttggtccacg atgccttggt tagggcgctg 

43801 ggtaaattgt cgaggaagca taactctgcg gggctggcaa aactcctgag tcccaaatac 

43861 ccggaaatgg cggagttagc taaagccttg ttaagagaga tgcgagccga taacgagcta 

43921 gtaaccgaca ttcgggactc gcaaaaaaac ttccacctgg ctttagcgag caggctagac 

43981 tttgccccgg atgaggcacg ggaagccatg gagaaccttg agaggaaact cgacaagtgg 

44041 tgcgatgagt gtgaagtccg aacccttttt gataaagata gggagaaggc tcgacgagaa 

44101 cacgtgggaa gatacgaggc gctacctctg caaattttgg ccccattaaa ggagataagg 

44161 aaaaggctta aattgtgggg ccttattacg gagaataaca aaccttgttg tgtggttaga 

44221 ttgattcaac tcaacgacga agacattgtg ctttggttta attccgtagc tcgagaccta 

44281 ttgaattact accgttgttg tgccaatttc tacaaggtac gagactacgt ggattatttc 

44341 ttacgctggt ccttgataca cacaatggcc gggaaacata agacatcggc gacgaagctc 

44401 atcagggcat tatcgataga tatggtcata aaggataacg agggggaaaa aataataagc 

44461 tttctaagct ccaacgaaat tcgacggatg ggaagaatgt tcttgagggg catcccgtgt 

44521 gactctgata tgcggactct ggaccgtatt tacgccacgt tcaggcgctc ccgtgcattg 

44581 cggtgctctg tgaaggggcg cttcgaggat aaggtggaaa tgcaccatgt gcgcaagagg 

44641 ctcctggatg cttttgggag gataacagta gtgaccaaaa agaataagag gggttacggg 

44701 aatggatgcg ttcaaggttg ccataaatag aaagcaaata cctttgtgta cgtatcatca 

44761 cgataaattg cacaagaggg aattgaagtt cgggggactg gattgggagt aggaccgata 

44821 aaattccaag tgtatacgtc caatcgagag taggttcttg gagagccgtg tgatggaaga 

44881 ctatcaagca cggttccacg agaagggcta atcctttact cgacagatat aggtactcta 

44941 tatttaattt tcggtgccat tgctggagta atgggtacat gcttttcagt actaattcgt 

45001 atggaattag cacaacccgg caaccaaatt cttggtggaa atcatcaact ttataatggt 

45061 gcgcccggat atacatcgtc cgacaagaag agctatccac tcttctttgt gccatccagg 

45121 ctagctaaca agctagggca caggctcaac ttgcagaggc gccatagtaa tatggcttac 

45181 tcgggagcag caagtttcaa tcggggaacg gctgggctgc cacggctagg acgtcctgag 

45241 agagcagaag gtagggccag gataactgac ttgacacgca atgctcgtat tacagtagac 

45301 ctcttgtctc aagcagcaca ttatggcaag agcacgataa gtaagcctct acggaggtca 

45361 gaactgtgca caatacattg tgtgtgcaca gtccctggaa aaatgtgggg cactctacac 

45421 agat                                                                

 

[top]


[ORF sequence]

 

914 a.a.

Note: Published size is 887 amino acids, a single frameshift replaces the last 16 amino acids with

43 amino acids with similarity to other Zn domains

 

CDAMHAIIAGSNRAGTIGAKPHVTQPKVSRRVMSCVGVDGLVSLGKRILTMRPPFLWS

VGGFLPTAYECRSHPNILLCGEKGASLRTETKRSYCGDTTSSPDALGKTSHSARFRDD

GGEGGAGSRPGMSCKPHANGGRDSDSTENRSGNEPPAVTVHMDGGGWRRKLETLKRNE

KSGKFENIYSICTDPNFLIAAYEQIKSHTSNMTPEGGEERENLFLRQVASPLESLDRA

WFERTAELLRSEQFRFKLARRIMIPTPNKPREFRPLTIGSDNIVQQAMKIVMEHIYEP

KFLDTSHGFRPGRGCHSGLEQICLKWTGASWFLEFDIKRCFNSMDRHKLVFILQKDIE

DQRWMDLVHKLFTAGLVGGELGGPDPLQGSVLSPWSSPPWALAPLFCNIYLHDLDQEV

AKMANELSRSRKRRVDKRTTAATRTPRTKAFRALTPQAEIMRVRRKAARGLSPTDRKD

PNYARAFYVRYAGNFLLGIAGPRELVATVKSRIVQFVNSELHLELTGGSISHISAESV

KFLGMEIKVVPSSKLRRRFGKAMEKRRRVRNRIFTLKVQKRKRQDSLVHDALVRALGK

LSRKHNSAGLAKLLSPKYPEMAELAKALLREMRADNELVTDIRDSQKNFHLALASRLD

FAPDEAREAMENLERKLDKWCDECEVRTLFDKDREKARREHVGRYEALPLQILAPLKE

IRKRLKLWGLITENNKPCCVVRLIQLNDEDIVLWFNSVARDLLNYYRCCANFYKVRDY

VDYFLRWSLIHTMAGKHKTSATKLIRALSIDMVIKDNEGEKIISFLSSNEIRRMGRMF

LRGIPCDSDMRTLDRIYATFRRSRALRCSVKGRFEDKVEMHHVRKRLLDAFGRITVVT

KRIRGVTGMDAFKVAINRKQIPLCTYHHDKLHKRELKFGGLDWE

 

[top]


[secondary structure]

 

 

[top]