[Back to mitochondrial introns by organism] [Back to home page]

Information of Thalassiosira pseudooana cox1.I2 intron   (Format of information for each intron)

[intron sequence]

[intron with flanking sequence]

[ORF sequence]

[secondary structure]

[intron sequence]

 

The boundaries of the intron are marked as red and the ORF is marked as blue.

For many organellar introns, the ORF is translated in frame with the upstream

exon ORF. The precursor protein is then processed to a mature form that cuts 

off all of the exon-encoded and some of the intron-encoded polypeptide. This 

situation is indicated here by blue-coded ORF that extends to the start of 

the intron rather than by a translation start approximately 500 bp after the 

start of the intron.

 

Note:  This intron sequence occurs on the reverse complement of the original

GenBank sequence.  Therefore, all sequences below are reverse complements of

the GenBank sequence.

3' end

                     gtgcgg cgtctaacgt cgttttcgcc tgtcttgact ttgatcaaga 

41041 ctcacattac ttgcttattc gttaattaat gaatatattg attaaacgat ttgtgataaa 

41101 gcatgatgct tacctgagag ggctaacctc gcaagggacc atagcatgca tcagaaggcg 

41161 ctctgaaaga gtgagtgcca agctattaac taatatgggt aggtttacga atcccagtta 

41221 caactttctg gcagggtctg aagtccctat taaacgattg agtgttagaa attacgtagt 

41281 ctttctttca tcaagcaagt tcaacaagcc ggctgcaagt agcagtttta gaaaggggag 

41341 ccctaagtta gttttgcgac gaaaacgaac ggaattcgcg atgacccgaa attcgaaaga 

41401 atgggggtta cggagggtcc atagtactgc gcaatcttgc aggaaggaac ctagttatgt 

41461 taggtttgac aaagatacgc tttcgggtgt atacgaatca aatcagttgg acttattgag 

41521 atcttatatt atcagtaaca aaaaatgtgt caatttgagt agtatcatgt ctgatccaaa 

41581 ttttctaatt gccgcttggg ctagaatccg ctctaatagt ggaagtttaa cttttgcttt 

41641 gagtaaagaa accttagatg gaatcgctct atcttggtta gaggaaactg caaataccat 

41701 gcggaacgga atattccaat tttctccttc tagaagaact tacatttcta agtctgatgg 

41761 ggggaagaga cctttaacta ttccttctcc gagagataag attgtccaag aagccatgcg 

41821 ttttttatta atgctggttt tcgaaggtga ttttagcaaa aactctcatg gttgggtatc 

41881 cggtagaggc tgtcatactg ctttaaacca aattaaaatg gaatttgccc acgataattg 

41941 gcttattgaa ggggatattg atcaacaatt cccaagcttg aatcatcagg tattggtcaa 

42001 tttactgaaa accaagatag atgaccaagc ttttatagac ttaatctaca aataccttag 

42061 agtgggttat ggtgagtccc cagataaaat agttaaaatg cgcattggga caagccaagg 

42121 aggagtttta tctcctgttc tagcgaacat ttatatgact ccttttgaca aatgggttga 

42181 aagagatctt atacctaaat atactaaagg taaaagaagg aaagccaacc ctgtttatac 

42241 aaaaatgata cgttccggaa aagttactga tcactctatc cctagtttgt atgcacatga 

42301 tcgaaatttt attagacttc actatgttcg ttatgcggac gatttcatca tgggtttaaa 

42361 tggtccaaaa gtttattgta agcaaatagt tgatgagtgc aaaacgtttt tgttcgaaca 

42421 actaaaactt accttaaaca tcgaaaagac caagataact catagtcagt tagattccgc 

42481 aacattttta ggttatcgtg tatacaaaac taagctttct aaaatgaaaa tagctcataa 

42541 tctcaaaggt caactttctc gtagaaccac taatactgtt ttagacggtc ctaccgatca 

42601 aattgtaaaa aagttgaatg aacgaggtta caccaagaaa gacggttcac ctaccagaaa 

42661 tggaagattt ataaatcata cgttgtatga tatgatagaa cattataaaa cggtggaaag 

42721 aggtattctt caatactaca agttagctaa taattacggt agagtcgctg ctagagtaca 

42781 ttatatctta aaatactctt gtgcccttac cattgcctct aagatgaaac ttactactct 

42841 acggagggta tttaacaaat acggcaagaa tcttaatatt aaagatgaat cgggtaaaat 

42901 tattattagt taccctacgg tcgattatcg tcgtccgaaa aagtttacta tagctcctat 

42961 attagactat tcttcattag aagcatatat tgaccagtat gatcgtcgag tacaaagagg 

43021 tcgtaaagat cttaaaggcc cttgtgtatt atgtggcagt aatcaggata ttgaaattca 

43081 tcacgttcga aaatttagta aaactaagcg taaagactac ttatccagta tgatgtctag 

43141 aatgaatcga aaacaagttc ctgtttgcaa aaaatgtcat ataaaaatac atcagggcgt 

43201 atacgatggc aaacgagtca aataaaactt ctgtccaatt ttaacatgag tgagagccgt 

43261 atgcagggaa acttgcacgt acggttcggg gtcgatttat tgataagcaa cgagattggt 

43321 aaattaggcc ac

 

                                                                   

[top]


 

 

[intron& flanking sequence]

 

Genbank entry, intron is marked as red

 

                                                                 tagaaa 

40501 tttcaatacc acattttttg atcctgcagg aggaggtgat cctgtgttat ttcaacatct 

40561 tttttgattt tttggtcacc ctgaagttta tatacttatt ttacctggat ttggtattat 

40621 aagccacatt gtagttagta ccgcaaaaaa acctattttt ggttaccttg gtatggttta 

40681 cgctatgttt tctatcggtg ttttaggttt tatcgtatgg gctcaccata tgtttactgt 

40741 aggtttggat atagatacca gagcttactt tacagcagca actatgatta ttgctattcc 

40801 aacaggaatt aaaatattta gttgacttgc tacattatga ggtggttcta ttgatctacg 

40861 gaccccaggt ctttttgcaa ttggtttcat atttttattc accgtaggtg gcgttacagg 

40921 agttgttctt gctaattctg gtattgatat agctttacat gatacctatt atgtggtagc 

40981 acattttcat tacggtgcgg cgtctaacgt cgttttcgcc tgtcttgact ttgatcaaga 

41041 ctcacattac ttgcttattc gttaattaat gaatatattg attaaacgat ttgtgataaa 

41101 gcatgatgct tacctgagag ggctaacctc gcaagggacc atagcatgca tcagaaggcg 

41161 ctctgaaaga gtgagtgcca agctattaac taatatgggt aggtttacga atcccagtta 

41221 caactttctg gcagggtctg aagtccctat taaacgattg agtgttagaa attacgtagt 

41281 ctttctttca tcaagcaagt tcaacaagcc ggctgcaagt agcagtttta gaaaggggag 

41341 ccctaagtta gttttgcgac gaaaacgaac ggaattcgcg atgacccgaa attcgaaaga 

41401 atgggggtta cggagggtcc atagtactgc gcaatcttgc aggaaggaac ctagttatgt 

41461 taggtttgac aaagatacgc tttcgggtgt atacgaatca aatcagttgg acttattgag 

41521 atcttatatt atcagtaaca aaaaatgtgt caatttgagt agtatcatgt ctgatccaaa 

41581 ttttctaatt gccgcttggg ctagaatccg ctctaatagt ggaagtttaa cttttgcttt 

41641 gagtaaagaa accttagatg gaatcgctct atcttggtta gaggaaactg caaataccat 

41701 gcggaacgga atattccaat tttctccttc tagaagaact tacatttcta agtctgatgg 

41761 ggggaagaga cctttaacta ttccttctcc gagagataag attgtccaag aagccatgcg 

41821 ttttttatta atgctggttt tcgaaggtga ttttagcaaa aactctcatg gttgggtatc 

41881 cggtagaggc tgtcatactg ctttaaacca aattaaaatg gaatttgccc acgataattg 

41941 gcttattgaa ggggatattg atcaacaatt cccaagcttg aatcatcagg tattggtcaa 

42001 tttactgaaa accaagatag atgaccaagc ttttatagac ttaatctaca aataccttag 

42061 agtgggttat ggtgagtccc cagataaaat agttaaaatg cgcattggga caagccaagg 

42121 aggagtttta tctcctgttc tagcgaacat ttatatgact ccttttgaca aatgggttga 

42181 aagagatctt atacctaaat atactaaagg taaaagaagg aaagccaacc ctgtttatac 

42241 aaaaatgata cgttccggaa aagttactga tcactctatc cctagtttgt atgcacatga 

42301 tcgaaatttt attagacttc actatgttcg ttatgcggac gatttcatca tgggtttaaa 

42361 tggtccaaaa gtttattgta agcaaatagt tgatgagtgc aaaacgtttt tgttcgaaca 

42421 actaaaactt accttaaaca tcgaaaagac caagataact catagtcagt tagattccgc 

42481 aacattttta ggttatcgtg tatacaaaac taagctttct aaaatgaaaa tagctcataa 

42541 tctcaaaggt caactttctc gtagaaccac taatactgtt ttagacggtc ctaccgatca 

42601 aattgtaaaa aagttgaatg aacgaggtta caccaagaaa gacggttcac ctaccagaaa 

42661 tggaagattt ataaatcata cgttgtatga tatgatagaa cattataaaa cggtggaaag 

42721 aggtattctt caatactaca agttagctaa taattacggt agagtcgctg ctagagtaca 

42781 ttatatctta aaatactctt gtgcccttac cattgcctct aagatgaaac ttactactct 

42841 acggagggta tttaacaaat acggcaagaa tcttaatatt aaagatgaat cgggtaaaat 

42901 tattattagt taccctacgg tcgattatcg tcgtccgaaa aagtttacta tagctcctat 

42961 attagactat tcttcattag aagcatatat tgaccagtat gatcgtcgag tacaaagagg 

43021 tcgtaaagat cttaaaggcc cttgtgtatt atgtggcagt aatcaggata ttgaaattca 

43081 tcacgttcga aaatttagta aaactaagcg taaagactac ttatccagta tgatgtctag 

43141 aatgaatcga aaacaagttc ctgtttgcaa aaaatgtcat ataaaaatac atcagggcgt 

43201 atacgatggc aaacgagtca aataaaactt ctgtccaatt ttaacatgag tgagagccgt 

43261 atgcagggaa acttgcacgt acggttcggg gtcgatttat tgataagcaa cgagattggt 

43321 aaattaggcc actgttgtcg atgggtgcag tattttctat gctagggggt atctattttt 

43381 ggtttgaaaa aatcactgga gttagatatt cagaaatact aggaaaaatt catttttgga 

43441 gttttttcgt tggggtaaat ttaacctttt tcccaatgca ttttttgggt gttgcaggta 

43501 tgccaagacg aatccctgat tatcctgacg catattttac gtttaataaa attgcatctt 

43561 gaggttctta cgtttctgct atctcttcat tattcttttt ttatgtggta tttgaagctt 

43621 tttcatctaa caaaataagt aaaaaagaat actaattacc ctttcttaaa ttttttatag 

43681 attaatagat aaacgatttt tataacgcaa ccattcattt tgaattatga taaattcata 

43741 acttaaaaag tttatataat ttattgatta aatcaaatct ttggtatatc tatgcttatt 

43801 ttctttaaca tcatatcttt tatttag

[top]


[ORF sequence]

 

718 a.a.

 

MNILIKRFVIKHDAYLRGLTSQGTIACIRRRSERVSAKLLTNMGRFTNPSYNFLAGSE

VPIKRLSVRNYVVFLSSSKFNKPAASSSFRKGSPKLVLRRKRTEFAMTRNSKEWGLRR

VHSTAQSCRKEPSYVRFDKDTLSGVYESNQLDLLRSYIISNKKCVNLSSIMSDPNFLI

AAWARIRSNSGSLTFALSKETLDGIALSWLEETANTMRNGIFQFSPSRRTYISKSDGG

KRPLTIPSPRDKIVQEAMRFLLMLVFEGDFSKNSHGWVSGRGCHTALNQIKMEFAHDN

WLIEGDIDQQFPSLNHQVLVNLLKTKIDDQAFIDLIYKYLRVGYGESPDKIVKMRIGT

SQGGVLSPVLANIYMTPFDKWVERDLIPKYTKGKRRKANPVYTKMIRSGKVTDHSIPS

LYAHDRNFIRLHYVRYADDFIMGLNGPKVYCKQIVDECKTFLFEQLKLTLNIEKTKIT

HSQLDSATFLGYRVYKTKLSKMKIAHNLKGQLSRRTTNTVLDGPTDQIVKKLNERGYT

KKDGSPTRNGRFINHTLYDMIEHYKTVERGILQYYKLANNYGRVAARVHYILKYSCAL

TIASKMKLTTLRRVFNKYGKNLNIKDESGKIIISYPTVDYRRPKKFTIAPILDYSSLE

AYIDQYDRRVQRGRKDLKGPCVLCGSNQDIEIHHVRKFSKTKRKDYLSSMMSRMNRKQ

VPVCKKCHIKIHQGVYDGKRVK

[top]


[secondary structure]

 not available

 

[top]