[Back to mitochondrial introns by organism] [Back to home page]

Information of Marchantia polymorpha cob1.I3 intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

3' end

 

                                gcgcggt tcggacccaa caatatccgc taccgacgga 

104941 aatcggtgcc ttgagggcgt taaggctaat ccctaccgaa actggggagt gagtgacctc 

105001 ctccccaggg caagctcttt ggatgggaaa atgctctttg tggttcctga tacaagccca 

105061 agccgccttc ttactataag gggctgccgc cttcgcctag gcgggccggc ggtcacgtcg 

105121 ttttgccgga ctcacgcgag tactcctaat tccgggggag gaaaggggcc ctctgaggaa 

105181 ggaccaaaca agacggcgtc gccaagttcg cagactggta aatgcttagg gaggaccaca 

105241 ctgagtgctt cggattggtt gggaccgaaa caaaattggc cggggggaac cccggaactg 

105301 ataagcgaag cctcgaataa agccttaggc cttgatgagg tacgggccca agatgaggcg 

105361 aacccgatct atggacagat gagtggagcg attaagagtt cgaatcttga cgttcctctt 

105421 aaccccttgg agttttcaca tctggcccaa cttgtagaga aacaattcat cgatggcaaa 

105481 tatcgccatc tggtaaggga gattatatct aaaccggaag tgctgcttac ggcctataac 

105541 aacatcaaat ccaaacccgg gaatattccc ggaagtccag agagcgacac gctcttcctt 

105601 cgtcgagtgg cttcgcccgg cataagcctc aaatggtttc atgaaacggg gcggaaactg 

105661 agggagggta catacgagtt tgagccgatt tggaggtccg aaggaccaaa gacgggtaaa 

105721 gctgaaaagc gccccctaac catagcgaac cctagggaca agataataca ggaggctttc 

105781 cggatggtac tggaaattat ctacgaacca aggttcaaag atatatccca tggtttccga 

105841 aggaacaagg gtactcactc agccctgagg gacatcaaaa tgcggtggaa gaacccatcg 

105901 tggtggctcg aattcgatat tcggaaatgc ttcgacacta tcaacaagaa aatactaatg 

105961 tcaatactaa gcgaaaccat tcaagataat agactcgaaa gtaccttgaa ccaaatgtgg 

106021 aacgcaaaga ttatggatgt cgaattggga ggccccgggg ttcctcaggg aagccttata 

106081 tctcccatcc tgaccaattt atacctcgac agactcgaca gggagatctt aaggatccga 

106141 aaggaactcg aaaaaggctc tccgaggcat cgtcgagcca atcccgtata tgagcaactg 

106201 ctgtacatcc cgaaaaactc ggtgatgcag atgggaccgg cagccctgct tcgaaagaag 

106261 aggacacggc ttaaaattgt cagaagtata ccctttgcgg atcccaagga ccctaaattc 

106321 gtgcgaatct acgcgtgccg ttacgccgac gacattctga tggcagtttc ggggtcgaaa 

106381 gctttggccc gcgaagtcat ggaacgagtc tcccggttcc tcaaagacgt tctccacctg 

106441 gagataaacc cagagcaaac ccgtctggga cacgtagtgg aagagaaggc aaccttctta 

106501 gggatgaggc tcttaggtcc gaaaccaagt caattgcacg tgaggtcgga caaagcgact 

106561 cgggccagga ataaatatcg gggccgcgtc cgaaaggccg cgttggagct ttcgaatggg 

106621 tgggagaaag ggcttaagaa gctcggagag aagttactgg tgtgcgccct gaagagggcc 

106681 ttgaaggagg ccggcaaaac ggggaacctt acacttatga aacctgacca agaagtgcgg 

106741 aaaatgctag aacaaatttg tcgagaagtt gtgagtgagg tacaaaggcc aacggggatt 

106801 ggtcaggatg ccatgttccg gtgggcacgt gaatcgagta aaggggacat cttcacgaat 

106861 attattgaac gggatggaca gggagtggtg agaaaattcg acgaattcgt cgtatcggtg 

106921 cacaagctct tatacccaga gtccaaggct gagcgaaaag aatcaacggg tccaggcccg 

106981 gaaagggacg cccccacgaa ccaacgccag gcgttccgaa tacagatata cgcaccgctt 

107041 tcgcggatac tagacaagtt aagggccaga ggaatcatta acgctgaggg aagaccaacc 

107101 tccgtccccc tcctcgcgac ccaagacgat gtcactatca cgcaatggta cggaagtgtg 

107161 gcgcacgggt tcttaagtta ttaccgatgt tgtgataact tctataaggt caaaaaggtg 

107221 gtagactatc agctcagatg gtcctgccta cacaccttat cacacaagat taaagccaag 

107281 ggcgtgggta aagtcataga caaatatacc cacgagctgc gggtggaaac ggcgaagggc 

107341 cctagggtat atttccctac acctactgaa ctgagggtta tgggaaaaca attcttggtg 

107401 aagagcatca aagacccgga gagaatcctt ggtctgatgt tcttacgcac cactaggcac 

107461 ccggccgata gatgctcagt tataggttgc ctggagagta agatcgaaat gcaccatatc 

107521 cgtgcgttga aacgcagcgg agcgggcggc cccaaaatgt ccatagtaaa ctctagcagc 

107581 gaccgaatca gtgggctaga agctcttcac gcggcgctta accggaaaca aatttcctta 

107641 tgcagggagc accacaaggc gatgcatgcg ggggatataa gtctgcagga tatagatgta 

107701 tctgtggttc tgaacccgaa cccaagaagg gggcaggcct gtgattctcg atgattaaac 

107761 tctacaacga aaaagattgg ggttgggagc cgtatgagcg gtaacgttca cgtacggttc 

107821 tgtgagaagg gttggggacg gaaatggccc cattccctta ctctcg                    

                                                                   

[top]


 

 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

          acaggag tctaaacgac atcttaagca cagggcgttc aaaaacgaag gtttttggac 

104461 gagccgtgta ggaaacaacg gtgaggtcct aggggtcgct tttatcccta ggaagtcaga 

104521 gggcccgtat tacccgccta cctgaaaggg acctagcaat aggaaagcgc ttgataacaa 

104581 gcagcaggga aggagcctat gtacttacta aatggttacc gtttggcaag tttcactttg 

104641 acgagtctat caggccatct tccaaggacc gtgtaatagg cgctcgaaag agagggaatg 

104701 tgcgttaaat agaccttccg ggtttgaaga agctccgaag aaagtcaggt tagagagccg 

104761 tgtgatgggc gactatctcg tacggttcgg agagcacttg agtagcccag atcggtgaac 

104821 ggggtaaccc tgggaccgca tagggtggat tcttgactct atcccgcaaa tcccatgtca 

104881 accccggctc atatagtgcc ggagcgcggt tcggacccaa caatatccgc taccgacgga 

104941 aatcggtgcc ttgagggcgt taaggctaat ccctaccgaa actggggagt gagtgacctc 

105001 ctccccaggg caagctcttt ggatgggaaa atgctctttg tggttcctga tacaagccca 

105061 agccgccttc ttactataag gggctgccgc cttcgcctag gcgggccggc ggtcacgtcg 

105121 ttttgccgga ctcacgcgag tactcctaat tccgggggag gaaaggggcc ctctgaggaa 

105181 ggaccaaaca agacggcgtc gccaagttcg cagactggta aatgcttagg gaggaccaca 

105241 ctgagtgctt cggattggtt gggaccgaaa caaaattggc cggggggaac cccggaactg 

105301 ataagcgaag cctcgaataa agccttaggc cttgatgagg tacgggccca agatgaggcg 

105361 aacccgatct atggacagat gagtggagcg attaagagtt cgaatcttga cgttcctctt 

105421 aaccccttgg agttttcaca tctggcccaa cttgtagaga aacaattcat cgatggcaaa 

105481 tatcgccatc tggtaaggga gattatatct aaaccggaag tgctgcttac ggcctataac 

105541 aacatcaaat ccaaacccgg gaatattccc ggaagtccag agagcgacac gctcttcctt 

105601 cgtcgagtgg cttcgcccgg cataagcctc aaatggtttc atgaaacggg gcggaaactg 

105661 agggagggta catacgagtt tgagccgatt tggaggtccg aaggaccaaa gacgggtaaa 

105721 gctgaaaagc gccccctaac catagcgaac cctagggaca agataataca ggaggctttc 

105781 cggatggtac tggaaattat ctacgaacca aggttcaaag atatatccca tggtttccga 

105841 aggaacaagg gtactcactc agccctgagg gacatcaaaa tgcggtggaa gaacccatcg 

105901 tggtggctcg aattcgatat tcggaaatgc ttcgacacta tcaacaagaa aatactaatg 

105961 tcaatactaa gcgaaaccat tcaagataat agactcgaaa gtaccttgaa ccaaatgtgg 

106021 aacgcaaaga ttatggatgt cgaattggga ggccccgggg ttcctcaggg aagccttata 

106081 tctcccatcc tgaccaattt atacctcgac agactcgaca gggagatctt aaggatccga 

106141 aaggaactcg aaaaaggctc tccgaggcat cgtcgagcca atcccgtata tgagcaactg 

106201 ctgtacatcc cgaaaaactc ggtgatgcag atgggaccgg cagccctgct tcgaaagaag 

106261 aggacacggc ttaaaattgt cagaagtata ccctttgcgg atcccaagga ccctaaattc 

106321 gtgcgaatct acgcgtgccg ttacgccgac gacattctga tggcagtttc ggggtcgaaa 

106381 gctttggccc gcgaagtcat ggaacgagtc tcccggttcc tcaaagacgt tctccacctg 

106441 gagataaacc cagagcaaac ccgtctggga cacgtagtgg aagagaaggc aaccttctta 

106501 gggatgaggc tcttaggtcc gaaaccaagt caattgcacg tgaggtcgga caaagcgact 

106561 cgggccagga ataaatatcg gggccgcgtc cgaaaggccg cgttggagct ttcgaatggg 

106621 tgggagaaag ggcttaagaa gctcggagag aagttactgg tgtgcgccct gaagagggcc 

106681 ttgaaggagg ccggcaaaac ggggaacctt acacttatga aacctgacca agaagtgcgg 

106741 aaaatgctag aacaaatttg tcgagaagtt gtgagtgagg tacaaaggcc aacggggatt 

106801 ggtcaggatg ccatgttccg gtgggcacgt gaatcgagta aaggggacat cttcacgaat 

106861 attattgaac gggatggaca gggagtggtg agaaaattcg acgaattcgt cgtatcggtg 

106921 cacaagctct tatacccaga gtccaaggct gagcgaaaag aatcaacggg tccaggcccg 

106981 gaaagggacg cccccacgaa ccaacgccag gcgttccgaa tacagatata cgcaccgctt 

107041 tcgcggatac tagacaagtt aagggccaga ggaatcatta acgctgaggg aagaccaacc 

107101 tccgtccccc tcctcgcgac ccaagacgat gtcactatca cgcaatggta cggaagtgtg 

107161 gcgcacgggt tcttaagtta ttaccgatgt tgtgataact tctataaggt caaaaaggtg 

107221 gtagactatc agctcagatg gtcctgccta cacaccttat cacacaagat taaagccaag 

107281 ggcgtgggta aagtcataga caaatatacc cacgagctgc gggtggaaac ggcgaagggc 

107341 cctagggtat atttccctac acctactgaa ctgagggtta tgggaaaaca attcttggtg 

107401 aagagcatca aagacccgga gagaatcctt ggtctgatgt tcttacgcac cactaggcac 

107461 ccggccgata gatgctcagt tataggttgc ctggagagta agatcgaaat gcaccatatc 

107521 cgtgcgttga aacgcagcgg agcgggcggc cccaaaatgt ccatagtaaa ctctagcagc 

107581 gaccgaatca gtgggctaga agctcttcac gcggcgctta accggaaaca aatttcctta 

107641 tgcagggagc accacaaggc gatgcatgcg ggggatataa gtctgcagga tatagatgta 

107701 tctgtggttc tgaacccgaa cccaagaagg gggcaggcct gtgattctcg atgattaaac 

107761 tctacaacga aaaagattgg ggttgggagc cgtatgagcg gtaacgttca cgtacggttc 

107821 tgtgagaagg gttggggacg gaaatggccc cattccctta ctctcgatgg tatttcttac 

107881 cagtctatgc gattcttcga agtataccta acaaattagg gggtgtagcc gccataggac 

107941 tagtttttgt gtcattattg gctttacctt tcattaacac ttcatatgta cgtagttcaa 

108001 gttttcgacc aattcaccaa aaattttttt ggttgcttgt agcagattgc ttgcttttag 

108061 gttggattgg atgtcaaccc gtggaagcac catatgttac tattggacaa attgcttcag 

108121 tgggtttttt cttctatttt gctataacgc ccattcttgg caaatgtgaa gccagattaa 

108181 tcaaaaattc taatgcttgc gaggcgcgta gcgtcctagc aagctttctc acttctattg 

108241 gcttgctttg gtggtgaaat ccaaagctat cagccctccg ggccttacgc aattctcctt 

108301 tttttatacc aataataata tacttttcct aaaggttatt agttgcacca agtatatcgc 

108361 actttg                                                                

 

[top]


[ORF sequence]

 

949 a.a.

 

RGSDPTISATDGNRCLEGVKANPYRNWGVSDLLPRASSLDGKMLFVVPDTSPSRLLTI

RGCRLRLGGPAVTSFCRTHASTPNSGGGKGPSEEGPNKTASPSSQTGKCLGRTTLSAS

DWLGPKQNWPGGTPELISEASNKALGLDEVRAQDEANPIYGQMSGAIKSSNLDVPLNP

LEFSHLAQLVEKQFIDGKYRHLVREIISKPEVLLTAYNNIKSKPGNIPGSPESDTLFL

RRVASPGISLKWFHETGRKLREGTYEFEPIWRSEGPKTGKAEKRPLTIANPRDKIIQE

AFRMVLEIIYEPRFKDISHGFRRNKGTHSALRDIKMRWKNPSWWLEFDIRKCFDTINK

KILMSILSETIQDNRLESTLNQMWNAKIMDVELGGPGVPQGSLISPILTNLYLDRLDR

EILRIRKELEKGSPRHRRANPVYEQLLYIPKNSVMQMGPAALLRKKRTRLKIVRSIPF

ADPKDPKFVRIYACRYADDILMAVSGSKALAREVMERVSRFLKDVLHLEINPEQTRLG

HVVEEKATFLGMRLLGPKPSQLHVRSDKATRARNKYRGRVRKAALELSNGWEKGLKKL

GEKLLVCALKRALKEAGKTGNLTLMKPDQEVRKMLEQICREVVSEVQRPTGIGQDAMF

RWARESSKGDIFTNIIERDGQGVVRKFDEFVVSVHKLLYPESKAERKESTGPGPERDA

PTNQRQAFRIQIYAPLSRILDKLRARGIINAEGRPTSVPLLATQDDVTITQWYGSVAH

GFLSYYRCCDNFYKVKKVVDYQLRWSCLHTLSHKIKAKGVGKVIDKYTHELRVETAKG

PRVYFPTPTELRVMGKQFLVKSIKDPERILGLMFLRTTRHPADRCSVIGCLESKIEMH

HIRALKRSGAGGPKMSIVNSSSDRISGLEALHAALNRKQISLCREHHKAMHAGDISLQ

DIDVSVVLNPNPRRGQACDSR

 

[top]


[secondary structure]

 

[top]