[Back to mitochondrial introns by organism] [Back to home page]

Information of Marchantia polymorpha atp9.I1 intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

 

Note:  This intron sequence occurs on the reverse complement of the original

GenBank sequence.  Therefore, all sequences below are reverse complements of

the GenBank sequence.

3' end

                    

                                                                 gtgcg

16141 ctcggcagag cgcgtttgga tcagcgaagg ttaacagatt atcgcacggc cttagcccta 

16201 acctgtagcg ggaacagacc gcagggaacc atagcatggg tttcggctga gtttcgtcgc 

16261 acggtgttca cgagtgcttc agagtcaagc gttactccct ccaacgaagg aggcccggaa 

16321 ggcactaagg accaactaaa ccctttgtgc aaacaaccct acgggggaaa ctgcaggaag 

16381 caggaaaagt tgctcactaa cctaaacgac gagcaaacaa ccaaacgtga aaacgaggga 

16441 tcaccccaat ggaccccaaa aaggttagtg cctaggtgcc aaaccccggg ggaccgaggt 

16501 atatccaaaa atgtaatggg tgacagatca gtcataagta ctccgaggga tacactgggc 

16561 aaaccaagtc gcgtatacga cgatgcccgt aatgtgaaag actctgaggg aaggactgta 

16621 gatggtaatg tgccggcgga aaaagctttt ttttgtcaaa aggagagggt cgaccaacag 

16681 gacagggcag aaaatggaag actgagaaaa gcagaaatat ttttttttgg gagtatagca 

16741 aaatcagcga aagcaactac taaaaccaag gccaataaag gtgagtctag accggtgacg 

16801 cctcccgcgc tcggtcgcgt cttctatgaa gacatttaca atatagacaa ccttagggct 

16861 ggctacaaaa ggctaaaagg aaatgtagcc ccgggaatcg acggtagaac taaagcggac 

16921 atgaccgaca aggcccttga aaaactatcc aaagagttga gaaggcaggc atatgcccca 

16981 aaaccggcca aacggataat tattactaaa ccagatggcg gtagtaggcc cctttctatt 

17041 gcttcgacgg ttgacaaagt tgtgcaatcc actttaaaag aactggtgga accgcacttt 

17101 gagtcgcttt ttagagactc aagccacggg ttccgccctg ggagaagctg tcataaagcc 

17161 ctccgagacc tccgttactc gtggacggca ctgacttggc ttgtacaaat tgatattaag 

17221 aaagactttg acaagattca tcatgacctg cttattaagg aaatggaatc agtactgcga 

17281 tctaaagcac tacaggatct tatgcgaaag ctacttaacg caggatacat agatgtatat 

17341 aatcttacag atcgaacgca atacaacaca gagggcgtta cgcagggctc tataatctcc 

17401 ccgctttgtg ccaatatctt cctacataag ttggattgtt atgtcgaaga catactcata 

17461 cccaactaca atgtagggaa tatgcgcccc gcgtcggcgg aatataagaa aaggcttaat 

17521 atccattcaa aggacaaagc cttcttcaag tattatacag aattagagca ggccatcaaa 

17581 aacatcaaac acctaaagtg gataaatcgc gaacaacaaa aaaaatctat tttagtaaaa 

17641 aaaaaatatt tttttgaaaa tttgtttttt tttcgaaacc ctaaagtttc gtgccctttg 

17701 ggccgaagga cgcttctaga aatggctgag aaagaaggat taaaaagact taagtacctc 

17761 cggtacgcgg ataacataat tttaggtgtt ataggcagca aacaagatgc cctagatatt 

17821 agaaaagccg tgcaaaactt cctgcaggaa gaactgaagt tggacataaa cgaacagaaa 

17881 agtaaaattt tacacgcaaa gtccgagatg gccaaatact taggcgcatt agtaatatat 

17941 tacggcaccg gaagtgtgga aatcttaagc aaggtctccg atgtcaaaca aataagactc 

18001 cgatctcgtc cacaactaat agctcccata aaggacttaa taactaaagc cttggaactt 

18061 ggatacgcga aaaaaaaacg ccaagggctt agctcgagca acctccaatc cacgattggt 

18121 ttcctcaggg gacaaacaca ttgttatcca cttctcatcg gtgatacgcg gcattgttaa 

18181 ctactattca tttgtgaaca agcgctcttc cctgtggaaa gttgtgtcca tctataaaaa 

18241 gtcctgcgcg ctaaccttgg ccagaaaaca cggcctacgg tcagccaaag ctgctttact 

18301 caagtttggt ccaaacctgc gaatcacaga aaaaggcaag gaggtagcgt cactatacta 

18361 ccccacctct ctaaaaacta caggaaagtt ccacgctact aggttcagtc aggttactat 

18421 actcgaggaa ccatattatg gagtcaggtg actatattcg aggaaccctg tgatggagtg 

18481 gcgtggatag agcggacaga ctactcacaa caaaaggctc tatcccgcgc ctccgacaaa 

18541 aaaaatatgt ttaggctctc ttcctttggt cgagtgtctc gctttggtcg agtgtctctc 

18601 tttggtcttc gcgcctctcc cctcttctcc ctttagcact cgcgcgtgga aatcgtaaag 

18661 ccatttccaa cctgcggacc cttctttatc agccatttag actagttcaa aatttttttt 

18721 gcagctttaa ctgtatcact tgagacctta tatcccggtc tcggcgctct tgctagggca 

18781 accatagcca aatcaagaaa gcggatcacg ctatgcctac aatgacttag ttacaaggtg 

18841 cagggagggg gaattgcaca gtagtcgcgc gtcccggtgt aaggagggct agtagcgctt 

18901 atacggccaa gccacaaaag ctgaagaaat tgccatggac gagccacatg cagggaaact 

18961 tgcacgtgtg gttctgaccg gggggaataa ccctatcggt at

                                                                   

[top]


 

 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

                                                aaagtc cgtatgacaa ccgagtgcat 

15661 cagcctttct atttctgttt ctatttctaa attgaaggat atgctagcca gtaaggtccg 

15721 tttattacga gccttggtgc taagctacaa atgaagcgcg taaagtctac caaatactat 

15781 ttcgtagaaa ataattcatt atatccaccc atgcttttcc taatctacag aaatcttgag 

15841 tatttcgtgc ctcaggcatg gcccgggggc caaaggttct cgtttatata tggtattata 

15901 catattatgt ctcttctgtt aacccccccc tttgggccat aagggttcgg gttgatttta 

15961 gtatattggg ttgtttttag tttggctgca tttttattga tcagccatga atggtcattg 

16021 ttttgacaaa aacaaaagta aacggttatg ctagaaggtg caaaattaat tggagcagga 

16081 gcagctacca ttgctttagc gggagctgct gtaggtattg gaaacgtttt tagtgtgcgc 

16141 ctcggcagag cgcgtttgga tcagcgaagg ttaacagatt atcgcacggc cttagcccta 

16201 acctgtagcg ggaacagacc gcagggaacc atagcatggg tttcggctga gtttcgtcgc 

16261 acggtgttca cgagtgcttc agagtcaagc gttactccct ccaacgaagg aggcccggaa 

16321 ggcactaagg accaactaaa ccctttgtgc aaacaaccct acgggggaaa ctgcaggaag 

16381 caggaaaagt tgctcactaa cctaaacgac gagcaaacaa ccaaacgtga aaacgaggga 

16441 tcaccccaat ggaccccaaa aaggttagtg cctaggtgcc aaaccccggg ggaccgaggt 

16501 atatccaaaa atgtaatggg tgacagatca gtcataagta ctccgaggga tacactgggc 

16561 aaaccaagtc gcgtatacga cgatgcccgt aatgtgaaag actctgaggg aaggactgta 

16621 gatggtaatg tgccggcgga aaaagctttt ttttgtcaaa aggagagggt cgaccaacag 

16681 gacagggcag aaaatggaag actgagaaaa gcagaaatat ttttttttgg gagtatagca 

16741 aaatcagcga aagcaactac taaaaccaag gccaataaag gtgagtctag accggtgacg 

16801 cctcccgcgc tcggtcgcgt cttctatgaa gacatttaca atatagacaa ccttagggct 

16861 ggctacaaaa ggctaaaagg aaatgtagcc ccgggaatcg acggtagaac taaagcggac 

16921 atgaccgaca aggcccttga aaaactatcc aaagagttga gaaggcaggc atatgcccca 

16981 aaaccggcca aacggataat tattactaaa ccagatggcg gtagtaggcc cctttctatt 

17041 gcttcgacgg ttgacaaagt tgtgcaatcc actttaaaag aactggtgga accgcacttt 

17101 gagtcgcttt ttagagactc aagccacggg ttccgccctg ggagaagctg tcataaagcc 

17161 ctccgagacc tccgttactc gtggacggca ctgacttggc ttgtacaaat tgatattaag 

17221 aaagactttg acaagattca tcatgacctg cttattaagg aaatggaatc agtactgcga 

17281 tctaaagcac tacaggatct tatgcgaaag ctacttaacg caggatacat agatgtatat 

17341 aatcttacag atcgaacgca atacaacaca gagggcgtta cgcagggctc tataatctcc 

17401 ccgctttgtg ccaatatctt cctacataag ttggattgtt atgtcgaaga catactcata 

17461 cccaactaca atgtagggaa tatgcgcccc gcgtcggcgg aatataagaa aaggcttaat 

17521 atccattcaa aggacaaagc cttcttcaag tattatacag aattagagca ggccatcaaa 

17581 aacatcaaac acctaaagtg gataaatcgc gaacaacaaa aaaaatctat tttagtaaaa 

17641 aaaaaatatt tttttgaaaa tttgtttttt tttcgaaacc ctaaagtttc gtgccctttg 

17701 ggccgaagga cgcttctaga aatggctgag aaagaaggat taaaaagact taagtacctc 

17761 cggtacgcgg ataacataat tttaggtgtt ataggcagca aacaagatgc cctagatatt 

17821 agaaaagccg tgcaaaactt cctgcaggaa gaactgaagt tggacataaa cgaacagaaa 

17881 agtaaaattt tacacgcaaa gtccgagatg gccaaatact taggcgcatt agtaatatat 

17941 tacggcaccg gaagtgtgga aatcttaagc aaggtctccg atgtcaaaca aataagactc 

18001 cgatctcgtc cacaactaat agctcccata aaggacttaa taactaaagc cttggaactt 

18061 ggatacgcga aaaaaaaacg ccaagggctt agctcgagca acctccaatc cacgattggt 

18121 ttcctcaggg gacaaacaca ttgttatcca cttctcatcg gtgatacgcg gcattgttaa 

18181 ctactattca tttgtgaaca agcgctcttc cctgtggaaa gttgtgtcca tctataaaaa 

18241 gtcctgcgcg ctaaccttgg ccagaaaaca cggcctacgg tcagccaaag ctgctttact 

18301 caagtttggt ccaaacctgc gaatcacaga aaaaggcaag gaggtagcgt cactatacta 

18361 ccccacctct ctaaaaacta caggaaagtt ccacgctact aggttcagtc aggttactat 

18421 actcgaggaa ccatattatg gagtcaggtg actatattcg aggaaccctg tgatggagtg 

18481 gcgtggatag agcggacaga ctactcacaa caaaaggctc tatcccgcgc ctccgacaaa 

18541 aaaaatatgt ttaggctctc ttcctttggt cgagtgtctc gctttggtcg agtgtctctc 

18601 tttggtcttc gcgcctctcc cctcttctcc ctttagcact cgcgcgtgga aatcgtaaag 

18661 ccatttccaa cctgcggacc cttctttatc agccatttag actagttcaa aatttttttt 

18721 gcagctttaa ctgtatcact tgagacctta tatcccggtc tcggcgctct tgctagggca 

18781 accatagcca aatcaagaaa gcggatcacg ctatgcctac aatgacttag ttacaaggtg 

18841 cagggagggg gaattgcaca gtagtcgcgc gtcccggtgt aaggagggct agtagcgctt 

18901 atacggccaa gccacaaaag ctgaagaaat tgccatggac gagccacatg cagggaaact 

18961 tgcacgtgtg gttctgaccg gggggaataa ccctatcggt attctttgat taattctgtt 

19021 gcgcgaaatc catcattggc taagcaatta tttggttatg ccattttagg ttttgcttta 

19081 actgaagcta ttgctttgtt tgccttaatg atggcatttt taatattatt cgtcttttaa 

19141 tgtgctgcaa atatttcttt ttttcttttt tatttcctaa aacgagcacc tcagggctcg 

19201 aacgtttcgt acgtttgagc ccttccctcg ccgcacacgc ataaaagagc taacaaaaaa 

19261 aaacaagggg ggtatttgac ctttgcatct gcttagacag tagagccccg aaggaattca 

19321 aaaaagattg ttcttgggta aggtaaacag ctttgtctct ctacccgtta gggtagagcg 

19381 caaagagcag aaaacgtaat atttttttgc ttacttctcc agggtccaaa ggatacttat 

19441 ttggcttgta aaaccccttc tttcgtaaag ccaaataagg aaaatcgtca agccatttcc 

19501 gc                                                                

[top]


[ORF sequence]

 

771 a.a.

(Note: Published size is 681 amino acids; a single frameshift in domain X replaces 34 C-terminal

amino acids with 124 amino acids similar to other domain X sequences)

 

VRLGRARLDQRRLTDYRTALALTCSGNRPQGTIAWVSAEFRRTVFTSASESSVTPSNE

GGPEGTKDQLNPLCKQPYGGNCRKQEKLLTNLNDEQTTKRENEGSPQWTPKRLVPRCQ

TPGDRGISKNVMGDRSVISTPRDTLGKPSRVYDDARNVKDSEGRTVDGNVPAEKAFFC

QKERVDQQDRAENGRLRKAEIFFFGSIAKSAKATTKTKANKGESRPVTPPALGRVFYE

DIYNIDNLRAGYKRLKGNVAPGIDGRTKADMTDKALEKLSKELRRQAYAPKPAKRIII

TKPDGGSRPLSIASTVDKVVQSTLKELVEPHFESLFRDSSHGFRPGRSCHKALRDLRY

SWTALTWLVQIDIKKDFDKIHHDLLIKEMESVLRSKALQDLMRKLLNAGYIDVYNLTD

RTQYNTEGVTQGSIISPLCANIFLHKLDCYVEDILIPNYNVGNMRPASAEYKKRLNIH

SKDKAFFKYYTELEQAIKNIKHLKWINREQQKKSILVKKKYFFENLFFFRNPKVSCPL

GRRTLLEMAEKEGLKRLKYLRYADNIILGVIGSKQDALDIRKAVQNFLQEELKLDINE

QKSKILHAKSEMAKYLGALVIYYGTGSVEILSKVSDVKQIRLRSRPQLIAPIKDLITK

ALELGYAKKNAKGLARATSNPRLVSSGDKHIVIHFSSVIRGIVNYYSFVNKRSSLWKV

VSIYKKSCALTLARKHGLRSAKAALLKFGPNLRITEKGKEVASLYYPTSLKTTGKFHA

TRFSQVTILEEPYYGVR

 

[top]


[secondary structure]

 

 

[top]