[Back to mitochondrial introns by organism] [Back to home page]
Information of Marchantia polymorpha atp9.I1 intron (Format of information for each intron)
[intron with flanking sequence]
The boundaries of the intron are marked as red and the ORF is marked as blue.
For many organellar introns, the ORF is translated in frame with the upstream
exon ORF. The precursor protein is then processed to a mature form that cuts
off all of the exon-encoded and some of the intron-encoded polypeptide. This
situation is indicated here by blue-coded ORF that extends to the start of
the intron rather than by a translation start approximately 500 bp after the
start of the intron.
Note: This intron sequence occurs on the reverse complement of the original
GenBank sequence. Therefore, all sequences below are reverse complements of
the GenBank sequence.
3' end
gtgcgc
16141 ctcggcagag cgcgtttgga tcagcgaagg ttaacagatt atcgcacggc cttagcccta
16201 acctgtagcg ggaacagacc gcagggaacc atagcatggg tttcggctga gtttcgtcgc
16261 acggtgttca cgagtgcttc agagtcaagc gttactccct ccaacgaagg aggcccggaa
16321 ggcactaagg accaactaaa ccctttgtgc aaacaaccct acgggggaaa ctgcaggaag
16381 caggaaaagt tgctcactaa cctaaacgac gagcaaacaa ccaaacgtga aaacgaggga
16441 tcaccccaat ggaccccaaa aaggttagtg cctaggtgcc aaaccccggg ggaccgaggt
16501 atatccaaaa atgtaatggg tgacagatca gtcataagta ctccgaggga tacactgggc
16561 aaaccaagtc gcgtatacga cgatgcccgt aatgtgaaag actctgaggg aaggactgta
16621 gatggtaatg tgccggcgga aaaagctttt ttttgtcaaa aggagagggt cgaccaacag
16681 gacagggcag aaaatggaag actgagaaaa gcagaaatat ttttttttgg gagtatagca
16741 aaatcagcga aagcaactac taaaaccaag gccaataaag gtgagtctag accggtgacg
16801 cctcccgcgc tcggtcgcgt cttctatgaa gacatttaca atatagacaa ccttagggct
16861 ggctacaaaa ggctaaaagg aaatgtagcc ccgggaatcg acggtagaac taaagcggac
16921 atgaccgaca aggcccttga aaaactatcc aaagagttga gaaggcaggc atatgcccca
16981 aaaccggcca aacggataat tattactaaa ccagatggcg gtagtaggcc cctttctatt
17041 gcttcgacgg ttgacaaagt tgtgcaatcc actttaaaag aactggtgga accgcacttt
17101 gagtcgcttt ttagagactc aagccacggg ttccgccctg ggagaagctg tcataaagcc
17161 ctccgagacc tccgttactc gtggacggca ctgacttggc ttgtacaaat tgatattaag
17221 aaagactttg acaagattca tcatgacctg cttattaagg aaatggaatc agtactgcga
17281 tctaaagcac tacaggatct tatgcgaaag ctacttaacg caggatacat agatgtatat
17341 aatcttacag atcgaacgca atacaacaca gagggcgtta cgcagggctc tataatctcc
17401 ccgctttgtg ccaatatctt cctacataag ttggattgtt atgtcgaaga catactcata
17461 cccaactaca atgtagggaa tatgcgcccc gcgtcggcgg aatataagaa aaggcttaat
17521 atccattcaa aggacaaagc cttcttcaag tattatacag aattagagca ggccatcaaa
17581 aacatcaaac acctaaagtg gataaatcgc gaacaacaaa aaaaatctat tttagtaaaa
17641 aaaaaatatt tttttgaaaa tttgtttttt tttcgaaacc ctaaagtttc gtgccctttg
17701 ggccgaagga cgcttctaga aatggctgag aaagaaggat taaaaagact taagtacctc
17761 cggtacgcgg ataacataat tttaggtgtt ataggcagca aacaagatgc cctagatatt
17821 agaaaagccg tgcaaaactt cctgcaggaa gaactgaagt tggacataaa cgaacagaaa
17881 agtaaaattt tacacgcaaa gtccgagatg gccaaatact taggcgcatt agtaatatat
17941 tacggcaccg gaagtgtgga aatcttaagc aaggtctccg atgtcaaaca aataagactc
18001 cgatctcgtc cacaactaat agctcccata aaggacttaa taactaaagc cttggaactt
18061 ggatacgcga aaaaaaaacg ccaagggctt agctcgagca acctccaatc cacgattggt
18121 ttcctcaggg gacaaacaca ttgttatcca cttctcatcg gtgatacgcg gcattgttaa
18181 ctactattca tttgtgaaca agcgctcttc cctgtggaaa gttgtgtcca tctataaaaa
18241 gtcctgcgcg ctaaccttgg ccagaaaaca cggcctacgg tcagccaaag ctgctttact
18301 caagtttggt ccaaacctgc gaatcacaga aaaaggcaag gaggtagcgt cactatacta
18361 ccccacctct ctaaaaacta caggaaagtt ccacgctact aggttcagtc aggttactat
18421 actcgaggaa ccatattatg gagtcaggtg actatattcg aggaaccctg tgatggagtg
18481 gcgtggatag agcggacaga ctactcacaa caaaaggctc tatcccgcgc ctccgacaaa
18541 aaaaatatgt ttaggctctc ttcctttggt cgagtgtctc gctttggtcg agtgtctctc
18601 tttggtcttc gcgcctctcc cctcttctcc ctttagcact cgcgcgtgga aatcgtaaag
18661 ccatttccaa cctgcggacc cttctttatc agccatttag actagttcaa aatttttttt
18721 gcagctttaa ctgtatcact tgagacctta tatcccggtc tcggcgctct tgctagggca
18781 accatagcca aatcaagaaa gcggatcacg ctatgcctac aatgacttag ttacaaggtg
18841 cagggagggg gaattgcaca gtagtcgcgc gtcccggtgt aaggagggct agtagcgctt
18901 atacggccaa gccacaaaag ctgaagaaat tgccatggac gagccacatg cagggaaact
18961 tgcacgtgtg gttctgaccg gggggaataa ccctatcggt at
Genbank entry, intron is marked as red
aaagtc cgtatgacaa ccgagtgcat
15661 cagcctttct atttctgttt ctatttctaa attgaaggat atgctagcca gtaaggtccg
15721 tttattacga gccttggtgc taagctacaa atgaagcgcg taaagtctac caaatactat
15781 ttcgtagaaa ataattcatt atatccaccc atgcttttcc taatctacag aaatcttgag
15841 tatttcgtgc ctcaggcatg gcccgggggc caaaggttct cgtttatata tggtattata
15901 catattatgt ctcttctgtt aacccccccc tttgggccat aagggttcgg gttgatttta
15961 gtatattggg ttgtttttag tttggctgca tttttattga tcagccatga atggtcattg
16021 ttttgacaaa aacaaaagta aacggttatg ctagaaggtg caaaattaat tggagcagga
16081 gcagctacca ttgctttagc gggagctgct gtaggtattg gaaacgtttt tagtgtgcgc
16141 ctcggcagag cgcgtttgga tcagcgaagg ttaacagatt atcgcacggc cttagcccta
16201 acctgtagcg ggaacagacc gcagggaacc atagcatggg tttcggctga gtttcgtcgc
16261 acggtgttca cgagtgcttc agagtcaagc gttactccct ccaacgaagg aggcccggaa
16321 ggcactaagg accaactaaa ccctttgtgc aaacaaccct acgggggaaa ctgcaggaag
16381 caggaaaagt tgctcactaa cctaaacgac gagcaaacaa ccaaacgtga aaacgaggga
16441 tcaccccaat ggaccccaaa aaggttagtg cctaggtgcc aaaccccggg ggaccgaggt
16501 atatccaaaa atgtaatggg tgacagatca gtcataagta ctccgaggga tacactgggc
16561 aaaccaagtc gcgtatacga cgatgcccgt aatgtgaaag actctgaggg aaggactgta
16621 gatggtaatg tgccggcgga aaaagctttt ttttgtcaaa aggagagggt cgaccaacag
16681 gacagggcag aaaatggaag actgagaaaa gcagaaatat ttttttttgg gagtatagca
16741 aaatcagcga aagcaactac taaaaccaag gccaataaag gtgagtctag accggtgacg
16801 cctcccgcgc tcggtcgcgt cttctatgaa gacatttaca atatagacaa ccttagggct
16861 ggctacaaaa ggctaaaagg aaatgtagcc ccgggaatcg acggtagaac taaagcggac
16921 atgaccgaca aggcccttga aaaactatcc aaagagttga gaaggcaggc atatgcccca
16981 aaaccggcca aacggataat tattactaaa ccagatggcg gtagtaggcc cctttctatt
17041 gcttcgacgg ttgacaaagt tgtgcaatcc actttaaaag aactggtgga accgcacttt
17101 gagtcgcttt ttagagactc aagccacggg ttccgccctg ggagaagctg tcataaagcc
17161 ctccgagacc tccgttactc gtggacggca ctgacttggc ttgtacaaat tgatattaag
17221 aaagactttg acaagattca tcatgacctg cttattaagg aaatggaatc agtactgcga
17281 tctaaagcac tacaggatct tatgcgaaag ctacttaacg caggatacat agatgtatat
17341 aatcttacag atcgaacgca atacaacaca gagggcgtta cgcagggctc tataatctcc
17401 ccgctttgtg ccaatatctt cctacataag ttggattgtt atgtcgaaga catactcata
17461 cccaactaca atgtagggaa tatgcgcccc gcgtcggcgg aatataagaa aaggcttaat
17521 atccattcaa aggacaaagc cttcttcaag tattatacag aattagagca ggccatcaaa
17581 aacatcaaac acctaaagtg gataaatcgc gaacaacaaa aaaaatctat tttagtaaaa
17641 aaaaaatatt tttttgaaaa tttgtttttt tttcgaaacc ctaaagtttc gtgccctttg
17701 ggccgaagga cgcttctaga aatggctgag aaagaaggat taaaaagact taagtacctc
17761 cggtacgcgg ataacataat tttaggtgtt ataggcagca aacaagatgc cctagatatt
17821 agaaaagccg tgcaaaactt cctgcaggaa gaactgaagt tggacataaa cgaacagaaa
17881 agtaaaattt tacacgcaaa gtccgagatg gccaaatact taggcgcatt agtaatatat
17941 tacggcaccg gaagtgtgga aatcttaagc aaggtctccg atgtcaaaca aataagactc
18001 cgatctcgtc cacaactaat agctcccata aaggacttaa taactaaagc cttggaactt
18061 ggatacgcga aaaaaaaacg ccaagggctt agctcgagca acctccaatc cacgattggt
18121 ttcctcaggg gacaaacaca ttgttatcca cttctcatcg gtgatacgcg gcattgttaa
18181 ctactattca tttgtgaaca agcgctcttc cctgtggaaa gttgtgtcca tctataaaaa
18241 gtcctgcgcg ctaaccttgg ccagaaaaca cggcctacgg tcagccaaag ctgctttact
18301 caagtttggt ccaaacctgc gaatcacaga aaaaggcaag gaggtagcgt cactatacta
18361 ccccacctct ctaaaaacta caggaaagtt ccacgctact aggttcagtc aggttactat
18421 actcgaggaa ccatattatg gagtcaggtg actatattcg aggaaccctg tgatggagtg
18481 gcgtggatag agcggacaga ctactcacaa caaaaggctc tatcccgcgc ctccgacaaa
18541 aaaaatatgt ttaggctctc ttcctttggt cgagtgtctc gctttggtcg agtgtctctc
18601 tttggtcttc gcgcctctcc cctcttctcc ctttagcact cgcgcgtgga aatcgtaaag
18661 ccatttccaa cctgcggacc cttctttatc agccatttag actagttcaa aatttttttt
18721 gcagctttaa ctgtatcact tgagacctta tatcccggtc tcggcgctct tgctagggca
18781 accatagcca aatcaagaaa gcggatcacg ctatgcctac aatgacttag ttacaaggtg
18841 cagggagggg gaattgcaca gtagtcgcgc gtcccggtgt aaggagggct agtagcgctt
18901 atacggccaa gccacaaaag ctgaagaaat tgccatggac gagccacatg cagggaaact
18961 tgcacgtgtg gttctgaccg gggggaataa ccctatcggt attctttgat taattctgtt
19021 gcgcgaaatc catcattggc taagcaatta tttggttatg ccattttagg ttttgcttta
19081 actgaagcta ttgctttgtt tgccttaatg atggcatttt taatattatt cgtcttttaa
19141 tgtgctgcaa atatttcttt ttttcttttt tatttcctaa aacgagcacc tcagggctcg
19201 aacgtttcgt acgtttgagc ccttccctcg ccgcacacgc ataaaagagc taacaaaaaa
19261 aaacaagggg ggtatttgac ctttgcatct gcttagacag tagagccccg aaggaattca
19321 aaaaagattg ttcttgggta aggtaaacag ctttgtctct ctacccgtta gggtagagcg
19381 caaagagcag aaaacgtaat atttttttgc ttacttctcc agggtccaaa ggatacttat
19441 ttggcttgta aaaccccttc tttcgtaaag ccaaataagg aaaatcgtca agccatttcc
19501 gc
771 a.a.
(Note: Published size is 681 amino acids; a single frameshift in domain X replaces 34 C-terminal
amino acids with 124 amino acids similar to other domain X sequences)
VRLGRARLDQRRLTDYRTALALTCSGNRPQGTIAWVSAEFRRTVFTSASESSVTPSNE
GGPEGTKDQLNPLCKQPYGGNCRKQEKLLTNLNDEQTTKRENEGSPQWTPKRLVPRCQ
TPGDRGISKNVMGDRSVISTPRDTLGKPSRVYDDARNVKDSEGRTVDGNVPAEKAFFC
QKERVDQQDRAENGRLRKAEIFFFGSIAKSAKATTKTKANKGESRPVTPPALGRVFYE
DIYNIDNLRAGYKRLKGNVAPGIDGRTKADMTDKALEKLSKELRRQAYAPKPAKRIII
TKPDGGSRPLSIASTVDKVVQSTLKELVEPHFESLFRDSSHGFRPGRSCHKALRDLRY
SWTALTWLVQIDIKKDFDKIHHDLLIKEMESVLRSKALQDLMRKLLNAGYIDVYNLTD
RTQYNTEGVTQGSIISPLCANIFLHKLDCYVEDILIPNYNVGNMRPASAEYKKRLNIH
SKDKAFFKYYTELEQAIKNIKHLKWINREQQKKSILVKKKYFFENLFFFRNPKVSCPL
GRRTLLEMAEKEGLKRLKYLRYADNIILGVIGSKQDALDIRKAVQNFLQEELKLDINE
QKSKILHAKSEMAKYLGALVIYYGTGSVEILSKVSDVKQIRLRSRPQLIAPIKDLITK
ALELGYAKKNAKGLARATSNPRLVSSGDKHIVIHFSSVIRGIVNYYSFVNKRSSLWKV
VSIYKKSCALTLARKHGLRSAKAALLKFGPNLRITEKGKEVASLYYPTSLKTTGKFHA
TRFSQVTILEEPYYGVR