[Back to mitochondrial introns by organism] [Back to home page]

Information of Marchantia polymorpha cox1.I2 intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

 

Note:  This intron sequence occurs on the reverse complement of the original

GenBank sequence.  Therefore, all sequences below are reverse complements of

the GenBank sequence.

3' end

                    

                                                                     gt 

45061 gcgcccggat atacatcgtc cgacaagaag agctatccac tcttctttgt gccatccagg 

45121 ctagctaaca agctagggca caggctcaac ttgcagaggc gccatagtaa tatggcttac 

45181 tcgggagcag caagtttcaa tcggggaacg gctgggctgc cacggctagg acgtcctgag 

45241 agagcagaag gtagggccag gataactgac ttgacacgca atgctcgtat tacagtagac 

45301 ctcttgtctc aagcagcaca ttatggcaag agcacgataa gtaagcctct acggaggtca 

45361 gaactgtgca caatacattg tgtgtgcaca gtccctggaa aaatgtgggg cactctacac 

45421 agataccagc tgagtaatca gctaattgcg caagggccta acccttcagg atctaagatc 

45481 tttcttggct ttacgaagaa aggatcgaag tgtcttccgg acacgaagaa caaggactat 

45541 gtaggagccg ggaaaataac ccgccctaat cgggatgggt ttcggagggc ccgtaatagc 

45601 tgcggatttt tgcaatccag cctggcagtt aaggagccta gctctcgatc gtactgttct 

45661 atcccggagg tctcggataa cgaaacatcg tctcgttcca acggctccgc ccgcccagcc 

45721 cccaaagctg cctggtccga ccgtgtccgt atgtccgttt cggaacagat tcaagcttat 

45781 ttaggcccgg acaataggta caatgggctg attcatatca tatcagaccc tacatttttg 

45841 gccttgtgct atgaatccat ccgaggtaag ccagggacct ccgggtctga tgcgaaacct 

45901 ttggatggtc cagaatggtt tgtacaagta ggtgagaaac ttaagaaggg ccaatttgag 

45961 ttctcgcccg cgcgaagaat tacaaagccc ggcaaaaagg agaagcggcc cttaggcatc 

46021 aacagccctg tcaaacaaaa aaagtgctac ggggaaaaga tcgtacaaaa agccttacag 

46081 cttgtgcttg aggccatcta tgaacctatt tttctagatt gctcacatgg atttcgtatc 

46141 caccgaagct gtcacacagc attaaagagg ctctgcttag agggaggtca ctatccttgg 

46201 gttgtggagg gaaacattcg aaaatttttc gattcgatac ctcataaagt gatccttcac 

46261 aaaatttcgc aaaaggtcaa gtgtcatcgt acgctcgaat tactccaaag ggctctccgt 

46321 gcgggttaca aggatcctac ttccggtcag gtcataagtt tagacgaagg cacttcgcag 

46381 ggctcggtgc tgagtccgct cctctgcaat atcatactac attacctcga cgaatttgtg 

46441 atgaaacttc gtgatcgatt taacaagggc aaaagtagaa gaattaaccc agagtacaag 

46501 ctattgactc gtcatatgaa tgcgaacaga caggatagat ctctcctgat taagcgcaga 

46561 ttaatcccgt cgaaagatcc cttggaccct tactttcgaa gaattttata tgtacgatat 

46621 gccgatgact ttgtcatttt agtaagcgga actcggttag aaactttcgc gattcaagcg 

46681 tcgttacaaa actttctgca ccggagtctt aggttagagc ttagtttaga gaaaaccgtg 

46741 gtctcacatc ttgcaaacaa gggattccat tttttaggta catactgcaa acgtacccga 

46801 tctcgtcatc ggatcttcca tgtgcgcacg gtaagaggca agacgattaa gcagcgttca 

46861 acagaacgtc ttcgggtttg tgcccccatt acgaaacttt tttacaagtt aaaagagaaa 

46921 ggttttgtaa aacggaatga aatggggaaa tatgtaccca cggcgaggag gaatctaacc 

46981 ccattggatc atgcagacat tttagaatta tacaatcaga aagtccgggg aaccctgaat 

47041 tactactcgt tcgcgtcgaa tcgcagtagt cttaatcaga ttgtacatgt gttgcatatg 

47101 tcctgtgccc tgactcttgc tttaaaatac aaactgaaga cagctagtaa gacttttaac 

47161 cgcttcggta agtgccttac ctgtcctgct acgggtatga gtctatttcg accaagcgcc 

47221 tacaaagcca tacacttata caatcctagt ccgattgccc gggctgagca ggttattgat 

47281 attagctaga ggtcgacccg atccgctctt tttagaacat gtgctctttg cggatcaacg 

47341 aaagtcgaga tgcatcatat ccgttatgtc aaagacgtta aggccaggat ccgatacgtc 

47401 acttgcgctt attctgagtg gaagggcgcc ttgaaacgga agcagacccc cctctgttct 

47461 caccatcaca gtgtgtatca caacggccag ttatctgagt cggaaatgtt agcactgtct 

47521 ttgtacacaa tccatggaga gatgttcaat tataagattg aaagttcata gcaataagtg 

47581 gagaccggag ttgcgagccg tttgaggtga aagcctcacg tacggttctg agggcagctg 

47641 tgttgaaaga cctgactgac ccctac

 

                                                                   

[top]


 

 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

                                               gt tcaggcgctc ccgtgcattg 

44581 cggtgctctg tgaaggggcg cttcgaggat aaggtggaaa tgcaccatgt gcgcaagagg 

44641 ctcctggatg cttttgggag gataacagta gtgaccaaaa agaataagag gggttacggg 

44701 aatggatgcg ttcaaggttg ccataaatag aaagcaaata cctttgtgta cgtatcatca 

44761 cgataaattg cacaagaggg aattgaagtt cgggggactg gattgggagt aggaccgata 

44821 aaattccaag tgtatacgtc caatcgagag taggttcttg gagagccgtg tgatggaaga 

44881 ctatcaagca cggttccacg agaagggcta atcctttact cgacagatat aggtactcta 

44941 tatttaattt tcggtgccat tgctggagta atgggtacat gcttttcagt actaattcgt 

45001 atggaattag cacaacccgg caaccaaatt cttggtggaa atcatcaact ttataatggt 

45061 gcgcccggat atacatcgtc cgacaagaag agctatccac tcttctttgt gccatccagg 

45121 ctagctaaca agctagggca caggctcaac ttgcagaggc gccatagtaa tatggcttac 

45181 tcgggagcag caagtttcaa tcggggaacg gctgggctgc cacggctagg acgtcctgag 

45241 agagcagaag gtagggccag gataactgac ttgacacgca atgctcgtat tacagtagac 

45301 ctcttgtctc aagcagcaca ttatggcaag agcacgataa gtaagcctct acggaggtca 

45361 gaactgtgca caatacattg tgtgtgcaca gtccctggaa aaatgtgggg cactctacac 

45421 agataccagc tgagtaatca gctaattgcg caagggccta acccttcagg atctaagatc 

45481 tttcttggct ttacgaagaa aggatcgaag tgtcttccgg acacgaagaa caaggactat 

45541 gtaggagccg ggaaaataac ccgccctaat cgggatgggt ttcggagggc ccgtaatagc 

45601 tgcggatttt tgcaatccag cctggcagtt aaggagccta gctctcgatc gtactgttct 

45661 atcccggagg tctcggataa cgaaacatcg tctcgttcca acggctccgc ccgcccagcc 

45721 cccaaagctg cctggtccga ccgtgtccgt atgtccgttt cggaacagat tcaagcttat 

45781 ttaggcccgg acaataggta caatgggctg attcatatca tatcagaccc tacatttttg 

45841 gccttgtgct atgaatccat ccgaggtaag ccagggacct ccgggtctga tgcgaaacct 

45901 ttggatggtc cagaatggtt tgtacaagta ggtgagaaac ttaagaaggg ccaatttgag 

45961 ttctcgcccg cgcgaagaat tacaaagccc ggcaaaaagg agaagcggcc cttaggcatc 

46021 aacagccctg tcaaacaaaa aaagtgctac ggggaaaaga tcgtacaaaa agccttacag 

46081 cttgtgcttg aggccatcta tgaacctatt tttctagatt gctcacatgg atttcgtatc 

46141 caccgaagct gtcacacagc attaaagagg ctctgcttag agggaggtca ctatccttgg 

46201 gttgtggagg gaaacattcg aaaatttttc gattcgatac ctcataaagt gatccttcac 

46261 aaaatttcgc aaaaggtcaa gtgtcatcgt acgctcgaat tactccaaag ggctctccgt 

46321 gcgggttaca aggatcctac ttccggtcag gtcataagtt tagacgaagg cacttcgcag 

46381 ggctcggtgc tgagtccgct cctctgcaat atcatactac attacctcga cgaatttgtg 

46441 atgaaacttc gtgatcgatt taacaagggc aaaagtagaa gaattaaccc agagtacaag 

46501 ctattgactc gtcatatgaa tgcgaacaga caggatagat ctctcctgat taagcgcaga 

46561 ttaatcccgt cgaaagatcc cttggaccct tactttcgaa gaattttata tgtacgatat 

46621 gccgatgact ttgtcatttt agtaagcgga actcggttag aaactttcgc gattcaagcg 

46681 tcgttacaaa actttctgca ccggagtctt aggttagagc ttagtttaga gaaaaccgtg 

46741 gtctcacatc ttgcaaacaa gggattccat tttttaggta catactgcaa acgtacccga 

46801 tctcgtcatc ggatcttcca tgtgcgcacg gtaagaggca agacgattaa gcagcgttca 

46861 acagaacgtc ttcgggtttg tgcccccatt acgaaacttt tttacaagtt aaaagagaaa 

46921 ggttttgtaa aacggaatga aatggggaaa tatgtaccca cggcgaggag gaatctaacc 

46981 ccattggatc atgcagacat tttagaatta tacaatcaga aagtccgggg aaccctgaat 

47041 tactactcgt tcgcgtcgaa tcgcagtagt cttaatcaga ttgtacatgt gttgcatatg 

47101 tcctgtgccc tgactcttgc tttaaaatac aaactgaaga cagctagtaa gacttttaac 

47161 cgcttcggta agtgccttac ctgtcctgct acgggtatga gtctatttcg accaagcgcc 

47221 tacaaagcca tacacttata caatcctagt ccgattgccc gggctgagca ggttattgat 

47281 attagctaga ggtcgacccg atccgctctt tttagaacat gtgctctttg cggatcaacg 

47341 aaagtcgaga tgcatcatat ccgttatgtc aaagacgtta aggccaggat ccgatacgtc 

47401 acttgcgctt attctgagtg gaagggcgcc ttgaaacgga agcagacccc cctctgttct 

47461 caccatcaca gtgtgtatca caacggccag ttatctgagt cggaaatgtt agcactgtct 

47521 ttgtacacaa tccatggaga gatgttcaat tataagattg aaagttcata gcaataagtg 

47581 gagaccggag ttgcgagccg tttgaggtga aagcctcacg tacggttctg agggcagctg 

47641 tgttgaaaga cctgactgac ccctactgtt aataacagct cacgcttttt taatgatctt 

47701 ctttatggtt atgccggcga tgataggtgg ttttggtaat tggtttgttc ctattcttat 

47761 aggaagtccg gatatggcat tccctagatt aaataatatt tcattttggc ttttgccacc 

47821 gtcattgtta cttcttctaa gctcagcctt agtagaggtg ggttgctaaa gccgcagccc 

47881 accccaaaaa cttcactata tgctggaaat cttatggaca atcagcaggg aagtagaaaa 

47941 taccccctca gagactatac gtgaagcgag cttccagctt aaagaaatag tccaagaaac 

48001 tccagaagag cctagttcgg gcaaatatca ggcaccaggt gataagcttg atacttattt 

48061 ggctggttaa ggggatggtt ctctaataac tcctgaaagc gataggggct gtggggggtc 

48121 ggtctcccaa cgtaaagatt gtgttccgtt gtgaagatga gccctt                                                                

 

[top]


[ORF sequence]

 

835 a.a.

Note: Published size is 742 amino acids; a single termination readthrough adds 94 amino acids

with similarity to the Zn domain 

 

APGYTSSDKKSYPLFFVPSRLANKLGHRLNLQRRHSNMAYSGAASFNRGTAGLPRLGR

PERAEGRARITDLTRNARITVDLLSQAAHYGKSTISKPLRRSELCTIHCVCTVPGKMW

GTLHRYQLSNQLIAQGPNPSGSKIFLGFTKKGSKCLPDTKNKDYVGAGKITRPNRDGF

RRARNSCGFLQSSLAVKEPSSRSYCSIPEVSDNETSSRSNGSARPAPKAAWSDRVRMS

VSEQIQAYLGPDNRYNGLIHIISDPTFLALCYESIRGKPGTSGSDAKPLDGPEWFVQV

GEKLKKGQFEFSPARRITKPGKKEKRPLGINSPVKQKKCYGEKIVQKALQLVLEAIYE

PIFLDCSHGFRIHRSCHTALKRLCLEGGHYPWVVEGNIRKFFDSIPHKVILHKISQKV

KCHRTLELLQRALRAGYKDPTSGQVISLDEGTSQGSVLSPLLCNIILHYLDEFVMKLR

DRFNKGKSRRINPEYKLLTRHMNANRQDRSLLIKRRLIPSKDPLDPYFRRILYVRYAD

DFVILVSGTRLETFAIQASLQNFLHRSLRLELSLEKTVVSHLANKGFHFLGTYCKRTR

SRHRIFHVRTVRGKTIKQRSTERLRVCAPITKLFYKLKEKGFVKRNEMGKYVPTARRN

LTPLDHADILELYNQKVRGTLNYYSFASNRSSLNQIVHVLHMSCALTLALKYKLKTAS

KTFNRFGKCLTCPATGMSLFRPSAYKAIHLYNPSPIARAEQVIDISRSTRSALFRTCA

LCGSTKVEMHHIRYVKDVKARIRYVTCAYSEWKGALKRKQTPLCSHHHSVYHNGQLSE

SEMLALSLYTIHGEMFNYKIESS

 

[top]


[secondary structure]

 

[top]