[Back to mitochondrial introns by organism] [Back to home page]

Information of Arabidopsis thaliana nad1.I4  intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

3' end

                    

                                          gtgcgg aactttgcat ctgacattcg 

 541 ttgggcttcc ctcttcggga gcctgcgtcc cggccgtttt tgtgcaataa acccctccgg 

 601 ccgaagacta gtggtaggtg gtcccgcgta cgtttcggag aagggtagcc ttgtgtgtaa 

 661 gcacagcaat gaaccgcggc gaaccctcag acgacctatc taagattagg ggggggggag 

 721 atcctcagta gtggtgaccc tttcacctct tcccacggac tgatacatgt accgaatgct 

 781 catacgggaa agttgactcc tgggtctgga acctgggggg ttgctccgag aaatcctttc 

 841 tttctcgtcc actcaggggg gtgcggacac acctgcgcgg attacaggtg acggttacaa 

 901 gaatgcgggg aagtgaacag tacccgacga cattcaggga tggatgtaga cccatcgggc 

 961 agggataatc attccggtcc tgggagaggt attgattact cgactaaaag gagaggaacc 

1021 ccatcgactg aaagaagagg agaggtggcg accattttca agaaccaaaa agactgaact 

1081 gagggaagcc ctatgagtca ctgaaacgac ggcaggaggg ggcctttgat tataaaaaag 

1141 gggagcaaaa aacgagcttt tccccccttt acaatatgaa gaaagaaaga agggttgaag 

1201 tttagaccgc tcacagtagt tctacctata gaaaagatca tgaaagaggc gatcagaatg 

1261 gtactcgaat ccatttacga tcccgagttt ccagacacat cgcatttccg ctcgggtcaa 

1321 ggctgccact cggtcctaag acggatcaaa gaagagtggg gaatctctcg ctggttttta 

1381 gaattcgaca tcaggaagtg ttttcacacc atcgaccgac atcgactcat cccaattttg 

1441 aaggaagaga tcgacgatcc caagttcttt tactccattc agaaagtatt ttccgccgga 

1501 cgactcgtag gagttgagag gggcccttac tccgtcccac acagtgtact actatcggcc 

1561 ctaccaggca acatctacct acacaagctc gatcaggaga tagggaggat ccgacagaag 

1621 tacgaaattc cgattgttca gagagtcaga tcggttctat taaggacagg tcgtcgtatt 

1681 gatgaccaag aaaaccctgg agaagaagca agcttcaacg ctccccaaga caacagagcc 

1741 atcattgtgg ggagcgttaa gagcatgcaa cgcaaagcgg cctttcattc ccttgtttcg 

1801 tcgtggcaca ccccccccac aagcaccctc cggctcaggg gggaccagaa aaggcctttc 

1861 gttttccccc cttcgtcggc ccttgccgtc ttccttaaca agccctcgag ccttctttgc 

1921 gccgccttcc tcatagaagc cgccgggttg accccgaagg ctgaattcta tggtggagaa 

1981 cgctgtaata ataattgggc catgagagac cttcttaagt attgcaaaag aaagggcctg 

2041 ctgatagagc tgggcgggga ggcgatacta gttatcaggt cagagagagg cctggcccgt 

2101 aagcaggccc ccttaaaaac ccattactta ataaggattt gttacgcgcg atatgccgac 

2161 gacttactac tgggaatcgt gggtgccgta gagcttctca tagaaataca aaaacgtatc 

2221 gcccatttcc tacaatctgg cctgaacctt tgggtaggct ccgcaggatc aacaacaata 

2281 gctgcacgga gtacggtaga attccttggt acggtcattc gggaagtccc tccgaggacg 

2341 actcccatac aatttttgcg agagctggaa aagcgtctac gggtaaagca ccgtatccat 

2401 ataactgctt gccacctacg ctccgccatc cattcaaagt ttaggaacct aggtgatagt 

2461 atcccgatca aacagctgac gaaggggatg agcaaaacag ggagtctaca ggacggggtt 

2521 caactagcgg agactcttgg aacagctgga gtcagaagtc cccaagttag cgtattatgg 

2581 gggaccgtca agcacatccg gcaaggatca agggggatct cgttcttgca tagctcaggt 

2641 cggagcaacg cgtcatcgga cgttcaacag gtagtctcac gatcgggcac tcatgcccgt 

2701 aagttgtcat tgtatactcc cccgggtcgg aaggcggcgg gggagggagg aggacactgg 

2761 gcgggatcta tcagcagcga attccccata aagatagagg cacctataaa aaagatactc 

2821 cgaaggcttc gggatcgagg tatcattagc cgaagaagac cctggccaat ccacgtggcc 

2881 tgtttgacga acgtcagcga cgaagacatc gtaaattggt ccgcgggcat cgcgataagt 

2941 cctctgtcct actacaggtg ccgcgacaac ctttatcaag tccgaacgat tgtcgaccac 

3001 cagattcgct ggtctgcaat attcacccta gcccacaagc acaaatcctc ggcgcggaat 

3061 ataatcctca agtactccaa agactcaaat attgtaaatc aagaaggtgg caagatcctt 

3121 gcagagttcc ccaacagcat agagcttggg aagctcggac ccggtcaaga cctgaacaag 

3181 aaggaacact caactactag tctagtctag tagtcgtttt tttctattag ttgcgatgcg 

3241 aacaggcgtt tacttatgag attagttgag tagacttgcc tgagttgtct gctataagat 

3301 agagctagtt ttggggtagg gcttttgaca taaaaaagcc ggataggctt ggcttcgcta 

3361 tcgctcatga cttgtattgt agtcggcccg gaatgcctcg gtagtctttc taatgccttc 

3421 ttccttcatt cattttcttt tagttgcggt agcttccgcg ccagcaagat acggacagcg 

3481 aagccaaagc aatactaaac aagcgagaaa agtccttgtt attagtaaag cgctaacgat 

3541 caagaaaagg ccccttacta ctagataggc taacacgcct ttactaatta tatatatata 

3601 ataaggtatt tctcaaagta aagtttctag cttgtttctt tagaaagatt gcggggggcg 

3661 ctcacgtttt ttggcccctt cccggcccgg aagttcgctt ccggcgacta gcttttacgc 

3721 tatcgcttgg acttgtcact tcgtaccttg aatcaatcaa tgaatgaatg aaaacgcctg 

3781 actaggaata gaaaggaagg acaggttggt tcgaggaccc cttggtcaaa ggaaaggtac 

3841 aaaggaactc gaccacttgt tgggagaggt tgtgaaacaa actcgactaa aaggagaggt 

3901 ataaaaatga ttccggggcg gagccgtatg acgcgagagt gtcacgtacg gtttctttga 

3961 gaagggtgtg ataccaccac ctatcaggcc cgacgagcgg tccacggagc tgcatcccta 

4021 ctcacc

 

                                                                   

[top]


 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

                    ctctcc ttttagtcga gtaatcaatc cctcgactca tctctctatc 

  61 cataaaaaag cccttcccag aggagagcgg aagctccagg atgagtatgc cctggattcc 

 121 cggattcatt ctcgtcctga tccactctgg aattttcgga atctacaaaa acattctagg 

 181 aaagggctcg taatgatctt ctcaaagatc acaaggtgct gaagtggcag gataggaggg 

 241 ggatccgtag tttttgtgaa agggatgctc ttacttatca tgtctgaaaa gaaaaaccga 

 301 attcctcgat ttcattcgat tcatccacat ttccatttct tgtagaggaa ggctaactgt 

 361 gcttgctggc tgggagctgt atgagcggta acgtccacgt acggttccgt gagaagggcg 

 421 gtggacagaa atggccttgt tgtaccttac tctcgtcttc aatggggtct gctctttttt 

 481 ttttgggaga gtatgccaat atgatcttaa tgaggtgcgg aactttgcat ctgacattcg 

 541 ttgggcttcc ctcttcggga gcctgcgtcc cggccgtttt tgtgcaataa acccctccgg 

 601 ccgaagacta gtggtaggtg gtcccgcgta cgtttcggag aagggtagcc ttgtgtgtaa 

 661 gcacagcaat gaaccgcggc gaaccctcag acgacctatc taagattagg ggggggggag 

 721 atcctcagta gtggtgaccc tttcacctct tcccacggac tgatacatgt accgaatgct 

 781 catacgggaa agttgactcc tgggtctgga acctgggggg ttgctccgag aaatcctttc 

 841 tttctcgtcc actcaggggg gtgcggacac acctgcgcgg attacaggtg acggttacaa 

 901 gaatgcgggg aagtgaacag tacccgacga cattcaggga tggatgtaga cccatcgggc 

 961 agggataatc attccggtcc tgggagaggt attgattact cgactaaaag gagaggaacc 

1021 ccatcgactg aaagaagagg agaggtggcg accattttca agaaccaaaa agactgaact 

1081 gagggaagcc ctatgagtca ctgaaacgac ggcaggaggg ggcctttgat tataaaaaag 

1141 gggagcaaaa aacgagcttt tccccccttt acaatatgaa gaaagaaaga agggttgaag 

1201 tttagaccgc tcacagtagt tctacctata gaaaagatca tgaaagaggc gatcagaatg 

1261 gtactcgaat ccatttacga tcccgagttt ccagacacat cgcatttccg ctcgggtcaa 

1321 ggctgccact cggtcctaag acggatcaaa gaagagtggg gaatctctcg ctggttttta 

1381 gaattcgaca tcaggaagtg ttttcacacc atcgaccgac atcgactcat cccaattttg 

1441 aaggaagaga tcgacgatcc caagttcttt tactccattc agaaagtatt ttccgccgga 

1501 cgactcgtag gagttgagag gggcccttac tccgtcccac acagtgtact actatcggcc 

1561 ctaccaggca acatctacct acacaagctc gatcaggaga tagggaggat ccgacagaag 

1621 tacgaaattc cgattgttca gagagtcaga tcggttctat taaggacagg tcgtcgtatt 

1681 gatgaccaag aaaaccctgg agaagaagca agcttcaacg ctccccaaga caacagagcc 

1741 atcattgtgg ggagcgttaa gagcatgcaa cgcaaagcgg cctttcattc ccttgtttcg 

1801 tcgtggcaca ccccccccac aagcaccctc cggctcaggg gggaccagaa aaggcctttc 

1861 gttttccccc cttcgtcggc ccttgccgtc ttccttaaca agccctcgag ccttctttgc 

1921 gccgccttcc tcatagaagc cgccgggttg accccgaagg ctgaattcta tggtggagaa 

1981 cgctgtaata ataattgggc catgagagac cttcttaagt attgcaaaag aaagggcctg 

2041 ctgatagagc tgggcgggga ggcgatacta gttatcaggt cagagagagg cctggcccgt 

2101 aagcaggccc ccttaaaaac ccattactta ataaggattt gttacgcgcg atatgccgac 

2161 gacttactac tgggaatcgt gggtgccgta gagcttctca tagaaataca aaaacgtatc 

2221 gcccatttcc tacaatctgg cctgaacctt tgggtaggct ccgcaggatc aacaacaata 

2281 gctgcacgga gtacggtaga attccttggt acggtcattc gggaagtccc tccgaggacg 

2341 actcccatac aatttttgcg agagctggaa aagcgtctac gggtaaagca ccgtatccat 

2401 ataactgctt gccacctacg ctccgccatc cattcaaagt ttaggaacct aggtgatagt 

2461 atcccgatca aacagctgac gaaggggatg agcaaaacag ggagtctaca ggacggggtt 

2521 caactagcgg agactcttgg aacagctgga gtcagaagtc cccaagttag cgtattatgg 

2581 gggaccgtca agcacatccg gcaaggatca agggggatct cgttcttgca tagctcaggt 

2641 cggagcaacg cgtcatcgga cgttcaacag gtagtctcac gatcgggcac tcatgcccgt 

2701 aagttgtcat tgtatactcc cccgggtcgg aaggcggcgg gggagggagg aggacactgg 

2761 gcgggatcta tcagcagcga attccccata aagatagagg cacctataaa aaagatactc 

2821 cgaaggcttc gggatcgagg tatcattagc cgaagaagac cctggccaat ccacgtggcc 

2881 tgtttgacga acgtcagcga cgaagacatc gtaaattggt ccgcgggcat cgcgataagt 

2941 cctctgtcct actacaggtg ccgcgacaac ctttatcaag tccgaacgat tgtcgaccac 

3001 cagattcgct ggtctgcaat attcacccta gcccacaagc acaaatcctc ggcgcggaat 

3061 ataatcctca agtactccaa agactcaaat attgtaaatc aagaaggtgg caagatcctt 

3121 gcagagttcc ccaacagcat agagcttggg aagctcggac ccggtcaaga cctgaacaag 

3181 aaggaacact caactactag tctagtctag tagtcgtttt tttctattag ttgcgatgcg 

3241 aacaggcgtt tacttatgag attagttgag tagacttgcc tgagttgtct gctataagat 

3301 agagctagtt ttggggtagg gcttttgaca taaaaaagcc ggataggctt ggcttcgcta 

3361 tcgctcatga cttgtattgt agtcggcccg gaatgcctcg gtagtctttc taatgccttc 

3421 ttccttcatt cattttcttt tagttgcggt agcttccgcg ccagcaagat acggacagcg 

3481 aagccaaagc aatactaaac aagcgagaaa agtccttgtt attagtaaag cgctaacgat 

3541 caagaaaagg ccccttacta ctagataggc taacacgcct ttactaatta tatatatata 

3601 ataaggtatt tctcaaagta aagtttctag cttgtttctt tagaaagatt gcggggggcg 

3661 ctcacgtttt ttggcccctt cccggcccgg aagttcgctt ccggcgacta gcttttacgc 

3721 tatcgcttgg acttgtcact tcgtaccttg aatcaatcaa tgaatgaatg aaaacgcctg 

3781 actaggaata gaaaggaagg acaggttggt tcgaggaccc cttggtcaaa ggaaaggtac 

3841 aaaggaactc gaccacttgt tgggagaggt tgtgaaacaa actcgactaa aaggagaggt 

3901 ataaaaatga ttccggggcg gagccgtatg acgcgagagt gtcacgtacg gtttctttga 

3961 gaagggtgtg ataccaccac ctatcaggcc cgacgagcgg tccacggagc tgcatcccta 

4021 ctcacctggt ccatgcacat tgttctttcc aggaggttgg ccgcctatcc tagatcttcc 

4081 cattttcaag aagatcccgg gctcgatctg gtttagtatc aaggttcttc tctttctgtt 

4141 cctatatata tgggtccgtg cagcatttcc acgatatcgt tatgatcaat taatgggact 

4201 tggccggaaa gtgttcttgc ctctatcatt agctcgggta gtccctgttt ctggtctttt 

4261 agtcaccttt caatggcttc cttaatttta ttatgtgcga ggaattgccc tcttggagta 

4321 atgggaagcg ggctagtccc cgaaaatgcc cgttaatcaa gcaagttggg gaacaaaatc 

4381 ttccttgtta gttactcatt tcttcggtcg agcgttctcc ggacgtcgag aaatctatca 

4441 ctcaatcact ggccgctctg taattgtctt gattttaggt ttttgatcac actcgaaatt 

4501 aaattatgta tctacttatc gtattt

[top]


[ORF sequence]

 

656 a.a.

 

MKEAIRMVLESIYDPEFPDTSHFRSGQGCHSVLRRIKEEWGISRWFLEFDIRKCFHTI

DRHRLIPILKEEIDDPKFFYSIQKVFSAGRLVGVERGPYSVPHSVLLSALPGNIYLHK

LDQEIGRIRQKYEIPIVQRVRSVLLRTGRRIDDQENPGEEASFNAPQDNRAIIVGSVK

SMQRKAAFHSLVSSWHTPPTSTLRLRGDQKRPFVFPPSSALAVFLNKPSSLLCAAFLI

EAAGLTPKAEFYGGERCNNNWAMRDLLKYCKRKGLLIELGGEAILVIRSERGLARKQA

PLKTHYLIRICYARYADDLLLGIVGAVELLIEIQKRIAHFLQSGLNLWVGSAGSTTIA

ARSTVEFLGTVIREVPPRTTPIQFLRELEKRLRVKHRIHITACHLRSAIHSKFRNLGD

SIPIKQLTKGMSKTGSLQDGVQLAETLGTAGVRSPQVSVLWGTVKHIRQGSRGISFLH

SSGRSNASSDVQQVVSRSGTHARKLSLYTPPGRKAAGEGGGHWAGSISSEFPIKIEAP

IKKILRRLRDRGIISRRRPWPIHVACLTNVSDEDIVNWSAGIAISPLSYYRCRDNLYQ

VRTIVDHQIRWSAIFTLAHKHKSSARNIILKYSKDSNIVNQEGGKILAEFPNSIELGK

LGPGQDLNKKEHSTTSLV

[top]


[secondary structure]

 not available

 

[top]