[Back to mitochondrial introns by organism] [Back to home page]
Information of Marchantia polymorpha cox1.I1 intron (Format of information for each intron)
[intron with flanking sequence]
The boundaries of the intron are marked as red and the ORF is marked as blue.
For many organellar introns, the ORF is translated in frame with the upstream
exon ORF. The precursor protein is then processed to a mature form that cuts
off all of the exon-encoded and some of the intron-encoded polypeptide. This
situation is indicated here by blue-coded ORF that extends to the start of
the intron rather than by a translation start approximately 500 bp after the
start of the intron.
Note: This intron sequence occurs on the reverse complement of the original
GenBank sequence. Therefore, all sequences below are reverse complements of
the GenBank sequence.
3' end
gtgcg acgctatgca tgctataatc gctgggtcaa accgcgccgg gacaattggt
42121 gctaaacccc acgtaactca gccgaaagtt agtaggagag tgatgtcctg cgttggggta
42181 gacggtctgg taagtctagg gaaacgaata ctaacgatgc gccctccgtt tctgtggagc
42241 gtaggcggtt ttctgcctac ggcttacgag tgccgttctc atccgaatat cttactttgt
42301 ggggagaaag gagcgtccct gagaaccgaa acgaaacgct cgtactgcgg tgatactaca
42361 agctcacctg atgcgctagg taagacaagc cattctgcgc gatttaggga tgacggcggc
42421 gagggtgggg ccgggtcacg ccccgggatg agttgtaaac cccatgccaa tggcggacgg
42481 gatagtgact caactgagaa ccggtcgggt aatgaacccc ctgctgtaac cgtccatatg
42541 gacggtggtg gttggaggcg gaaactcgaa actctcaaac ggaacgaaaa atccgggaaa
42601 tttgaaaaca tctactccat atgcacggac ccgaacttct tgattgcagc ctatgaacag
42661 attaagtccc acacgagtaa catgaccccg gaggggggcg aggaaaggga aaacctcttt
42721 cttcgtcaag tggcttcgcc cctggaaagt ctagatcggg cttggttcga aagaaccgcc
42781 gaattacttc gaagcgaaca gtttcgcttc aaactcgcac gtaggattat gatacccacg
42841 cctaacaagc ccagggaatt cagaccttta actattggga gcgacaatat tgttcagcaa
42901 gctatgaaaa tagtcatgga acatatatac gaacctaaat tcctcgacac ctcccacggg
42961 ttccgtcccg gaagagggtg tcactcgggg ttagaacaaa tttgcctgaa atggacagga
43021 gcgtcgtggt tccttgaatt tgacataaaa aggtgcttca attccatgga tcgacacaag
43081 ctggtcttca tcttacaaaa ggatatagaa gatcagaggt ggatggatct cgtgcataaa
43141 ctgttcaccg cgggacttgt tggcggcgag cttggtggtc cagaccccct tcaaggatca
43201 gtcctctccc cttggtcgag tcccccttgg gccctcgccc ctctattttg taatatttac
43261 ttgcacgatt tggaccaaga agtggccaag atggctaatg aactctcacg aagccgcaag
43321 cgaagagtcg acaaaaggac cacggcggct acgaggacgc ccagaacgaa ggcgttcaga
43381 gcgctgaccc cgcaggcgga aatcatgagg gtccgtagaa aggccgcccg gggactgtcc
43441 ccgactgata ggaaggaccc caactacgca cgcgcatttt acgttcgata tgcggggaat
43501 ttcctcctag gcattgccgg acccagggaa ctggtagcca cggtcaagag taggattgtg
43561 cagttcgtta attcggagct gcacctggaa ctcaccggag gttctatatc ccacataagc
43621 gccgaaagcg tgaaatttct gggaatggaa atcaaggttg taccctcctc gaaacttcga
43681 aggagattcg gcaaggccat ggagaagcga cgtagggtta ggaaccgtat ctttacccta
43741 aaagtacaaa aacggaaacg ccaagactcc ttggtccacg atgccttggt tagggcgctg
43801 ggtaaattgt cgaggaagca taactctgcg gggctggcaa aactcctgag tcccaaatac
43861 ccggaaatgg cggagttagc taaagccttg ttaagagaga tgcgagccga taacgagcta
43921 gtaaccgaca ttcgggactc gcaaaaaaac ttccacctgg ctttagcgag caggctagac
43981 tttgccccgg atgaggcacg ggaagccatg gagaaccttg agaggaaact cgacaagtgg
44041 tgcgatgagt gtgaagtccg aacccttttt gataaagata gggagaaggc tcgacgagaa
44101 cacgtgggaa gatacgaggc gctacctctg caaattttgg ccccattaaa ggagataagg
44161 aaaaggctta aattgtgggg ccttattacg gagaataaca aaccttgttg tgtggttaga
44221 ttgattcaac tcaacgacga agacattgtg ctttggttta attccgtagc tcgagaccta
44281 ttgaattact accgttgttg tgccaatttc tacaaggtac gagactacgt ggattatttc
44341 ttacgctggt ccttgataca cacaatggcc gggaaacata agacatcggc gacgaagctc
44401 atcagggcat tatcgataga tatggtcata aaggataacg agggggaaaa aataataagc
44461 tttctaagct ccaacgaaat tcgacggatg ggaagaatgt tcttgagggg catcccgtgt
44521 gactctgata tgcggactct ggaccgtatt tacgccacgt tcaggcgctc ccgtgcattg
44581 cggtgctctg tgaaggggcg cttcgaggat aaggtggaaa tgcaccatgt gcgcaagagg
44641 ctcctggatg cttttgggag gataacagta gtgaccaaaa agaataagag gggttacggg
44701 aatggatgcg ttcaaggttg ccataaatag aaagcaaata cctttgtgta cgtatcatca
44761 cgataaattg cacaagaggg aattgaagtt cgggggactg gattgggagt aggaccgata
44821 aaattccaag tgtatacgtc caatcgagag taggttcttg gagagccgtg tgatggaaga
44881 ctatcaagca cggttccacg agaagggcta atcctttact cgac
Genbank entry, intron is marked as red
aagtt ttgtgtgttt
41581 tgttatattt agtcaaaaaa catttggtga aactataaaa gctatttttg atgcgagaag
41641 cgaagctctt ctttccgatt tacagcaatg gatgagttat caagaagcta tgttgtccga
41701 attaaaaaaa cagcatgaat tacgtagtat aagcttgcgt tcaagtacac aaatgattgg
41761 agaatcatgt ataaatgata tggttacgcg ctgtgcgcca aagtgcaaac agacagtaaa
41821 atctgtgtta tgccaacaaa tagagcaaaa gttaaaaaca ctgttagcta ttcaagagca
41881 ttctcgtatc agtttacagg agaagatagt aacttgtttt cgcgaaacag tttgtgacga
41941 atttcgcttt tccaaattgc gaaaacatca gtcaaaacta gttcaacaaa gcatggtatt
42001 attgaaagat ggggttccga aatgaacaat tttgcacaaa gatggctgtt ttccacgaac
42061 cacaagtgcg acgctatgca tgctataatc gctgggtcaa accgcgccgg gacaattggt
42121 gctaaacccc acgtaactca gccgaaagtt agtaggagag tgatgtcctg cgttggggta
42181 gacggtctgg taagtctagg gaaacgaata ctaacgatgc gccctccgtt tctgtggagc
42241 gtaggcggtt ttctgcctac ggcttacgag tgccgttctc atccgaatat cttactttgt
42301 ggggagaaag gagcgtccct gagaaccgaa acgaaacgct cgtactgcgg tgatactaca
42361 agctcacctg atgcgctagg taagacaagc cattctgcgc gatttaggga tgacggcggc
42421 gagggtgggg ccgggtcacg ccccgggatg agttgtaaac cccatgccaa tggcggacgg
42481 gatagtgact caactgagaa ccggtcgggt aatgaacccc ctgctgtaac cgtccatatg
42541 gacggtggtg gttggaggcg gaaactcgaa actctcaaac ggaacgaaaa atccgggaaa
42601 tttgaaaaca tctactccat atgcacggac ccgaacttct tgattgcagc ctatgaacag
42661 attaagtccc acacgagtaa catgaccccg gaggggggcg aggaaaggga aaacctcttt
42721 cttcgtcaag tggcttcgcc cctggaaagt ctagatcggg cttggttcga aagaaccgcc
42781 gaattacttc gaagcgaaca gtttcgcttc aaactcgcac gtaggattat gatacccacg
42841 cctaacaagc ccagggaatt cagaccttta actattggga gcgacaatat tgttcagcaa
42901 gctatgaaaa tagtcatgga acatatatac gaacctaaat tcctcgacac ctcccacggg
42961 ttccgtcccg gaagagggtg tcactcgggg ttagaacaaa tttgcctgaa atggacagga
43021 gcgtcgtggt tccttgaatt tgacataaaa aggtgcttca attccatgga tcgacacaag
43081 ctggtcttca tcttacaaaa ggatatagaa gatcagaggt ggatggatct cgtgcataaa
43141 ctgttcaccg cgggacttgt tggcggcgag cttggtggtc cagaccccct tcaaggatca
43201 gtcctctccc cttggtcgag tcccccttgg gccctcgccc ctctattttg taatatttac
43261 ttgcacgatt tggaccaaga agtggccaag atggctaatg aactctcacg aagccgcaag
43321 cgaagagtcg acaaaaggac cacggcggct acgaggacgc ccagaacgaa ggcgttcaga
43381 gcgctgaccc cgcaggcgga aatcatgagg gtccgtagaa aggccgcccg gggactgtcc
43441 ccgactgata ggaaggaccc caactacgca cgcgcatttt acgttcgata tgcggggaat
43501 ttcctcctag gcattgccgg acccagggaa ctggtagcca cggtcaagag taggattgtg
43561 cagttcgtta attcggagct gcacctggaa ctcaccggag gttctatatc ccacataagc
43621 gccgaaagcg tgaaatttct gggaatggaa atcaaggttg taccctcctc gaaacttcga
43681 aggagattcg gcaaggccat ggagaagcga cgtagggtta ggaaccgtat ctttacccta
43741 aaagtacaaa aacggaaacg ccaagactcc ttggtccacg atgccttggt tagggcgctg
43801 ggtaaattgt cgaggaagca taactctgcg gggctggcaa aactcctgag tcccaaatac
43861 ccggaaatgg cggagttagc taaagccttg ttaagagaga tgcgagccga taacgagcta
43921 gtaaccgaca ttcgggactc gcaaaaaaac ttccacctgg ctttagcgag caggctagac
43981 tttgccccgg atgaggcacg ggaagccatg gagaaccttg agaggaaact cgacaagtgg
44041 tgcgatgagt gtgaagtccg aacccttttt gataaagata gggagaaggc tcgacgagaa
44101 cacgtgggaa gatacgaggc gctacctctg caaattttgg ccccattaaa ggagataagg
44161 aaaaggctta aattgtgggg ccttattacg gagaataaca aaccttgttg tgtggttaga
44221 ttgattcaac tcaacgacga agacattgtg ctttggttta attccgtagc tcgagaccta
44281 ttgaattact accgttgttg tgccaatttc tacaaggtac gagactacgt ggattatttc
44341 ttacgctggt ccttgataca cacaatggcc gggaaacata agacatcggc gacgaagctc
44401 atcagggcat tatcgataga tatggtcata aaggataacg agggggaaaa aataataagc
44461 tttctaagct ccaacgaaat tcgacggatg ggaagaatgt tcttgagggg catcccgtgt
44521 gactctgata tgcggactct ggaccgtatt tacgccacgt tcaggcgctc ccgtgcattg
44581 cggtgctctg tgaaggggcg cttcgaggat aaggtggaaa tgcaccatgt gcgcaagagg
44641 ctcctggatg cttttgggag gataacagta gtgaccaaaa agaataagag gggttacggg
44701 aatggatgcg ttcaaggttg ccataaatag aaagcaaata cctttgtgta cgtatcatca
44761 cgataaattg cacaagaggg aattgaagtt cgggggactg gattgggagt aggaccgata
44821 aaattccaag tgtatacgtc caatcgagag taggttcttg gagagccgtg tgatggaaga
44881 ctatcaagca cggttccacg agaagggcta atcctttact cgacagatat aggtactcta
44941 tatttaattt tcggtgccat tgctggagta atgggtacat gcttttcagt actaattcgt
45001 atggaattag cacaacccgg caaccaaatt cttggtggaa atcatcaact ttataatggt
45061 gcgcccggat atacatcgtc cgacaagaag agctatccac tcttctttgt gccatccagg
45121 ctagctaaca agctagggca caggctcaac ttgcagaggc gccatagtaa tatggcttac
45181 tcgggagcag caagtttcaa tcggggaacg gctgggctgc cacggctagg acgtcctgag
45241 agagcagaag gtagggccag gataactgac ttgacacgca atgctcgtat tacagtagac
45301 ctcttgtctc aagcagcaca ttatggcaag agcacgataa gtaagcctct acggaggtca
45361 gaactgtgca caatacattg tgtgtgcaca gtccctggaa aaatgtgggg cactctacac
45421 agat
914 a.a.
Note: Published size is 887 amino acids, a single frameshift replaces the last 16 amino acids with
43 amino acids with similarity to other Zn domains
CDAMHAIIAGSNRAGTIGAKPHVTQPKVSRRVMSCVGVDGLVSLGKRILTMRPPFLWS
VGGFLPTAYECRSHPNILLCGEKGASLRTETKRSYCGDTTSSPDALGKTSHSARFRDD
GGEGGAGSRPGMSCKPHANGGRDSDSTENRSGNEPPAVTVHMDGGGWRRKLETLKRNE
KSGKFENIYSICTDPNFLIAAYEQIKSHTSNMTPEGGEERENLFLRQVASPLESLDRA
WFERTAELLRSEQFRFKLARRIMIPTPNKPREFRPLTIGSDNIVQQAMKIVMEHIYEP
KFLDTSHGFRPGRGCHSGLEQICLKWTGASWFLEFDIKRCFNSMDRHKLVFILQKDIE
DQRWMDLVHKLFTAGLVGGELGGPDPLQGSVLSPWSSPPWALAPLFCNIYLHDLDQEV
AKMANELSRSRKRRVDKRTTAATRTPRTKAFRALTPQAEIMRVRRKAARGLSPTDRKD
PNYARAFYVRYAGNFLLGIAGPRELVATVKSRIVQFVNSELHLELTGGSISHISAESV
KFLGMEIKVVPSSKLRRRFGKAMEKRRRVRNRIFTLKVQKRKRQDSLVHDALVRALGK
LSRKHNSAGLAKLLSPKYPEMAELAKALLREMRADNELVTDIRDSQKNFHLALASRLD
FAPDEAREAMENLERKLDKWCDECEVRTLFDKDREKARREHVGRYEALPLQILAPLKE
IRKRLKLWGLITENNKPCCVVRLIQLNDEDIVLWFNSVARDLLNYYRCCANFYKVRDY
VDYFLRWSLIHTMAGKHKTSATKLIRALSIDMVIKDNEGEKIISFLSSNEIRRMGRMF
LRGIPCDSDMRTLDRIYATFRRSRALRCSVKGRFEDKVEMHHVRKRLLDAFGRITVVT
KRIRGVTGMDAFKVAINRKQIPLCTYHHDKLHKRELKFGGLDWE