Path: utzoo!utgpu!watserv1!watmath!uunet!cs.utexas.edu!usc!ucselx!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: C.reinhardtii carbonic anhydrase (CAH2) gene, complete cds. Message-ID: Date: 21 Feb 91 13:03:33 GMT Sender: root@genbank.bio.net Distribution: bionet Lines: 196 Approved: lear@genbank.bio.net Checksum: 38471 11 LOCUS CRECAH2G 4858 bp ds-DNA PLN 21-FEB-1991 DEFINITION C.reinhardtii carbonic anhydrase (CAH2) gene, complete cds. ACCESSION D90207 X54488 KEYWORDS carbonic anhydrase. SOURCE C.reinhardtii (strain C-9 mt(-) haploid), genomic DNA, clones F1 and F9. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 1 to 4858) AUTHORS Fukuzawa,H., Fujiwara,S., Tachiki,A. and Miyachi,S. TITLE Nucleotide sequences of two genes CAH1 and CAH2 which encode carbonic anhydrase polypeptides in Chlamydomonas reinhardtii JOURNAL Nucleic Acids Res. 18, 6441-6442 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 4858) AUTHORS Fujiwara,S., Fukuzawa,H., Tachiki,A. and Miyachi,S. TITLE Structure and differential expression of two genes encoding carbonic anhydrase in Chlamydomonas reinhardtii JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 9779-9783 (1991) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hideya Fukuzawa Institute of Applied Microbiology University of Tokyo 1-1-1 Yayoi, Bunkyo-ku Tokyo 113 Japan Phone: 03-812-2111 x7846 Fax: 03-812-2910 All the features are predicted by comparing with CAH1. FEATURES Location/Qualifiers precursor_RNA <491..>4667 /note="carbonic anhydrase mRNA and introns" CDS join(491..586,855..904,1074..1128,1245..1319,1654..1708, 2022..2084,2288..2413,2592..2726,2968..3212,3414..3567, 3714..3802) /note="carbonic anhydrase (EC 4.2.1.1) precursor" mat_peptide join(551..586,855..904,1074..1128,1245..1319,1654..1708, 2022..2084,2288..2413,2592..2726,2968..3212,3414..3542, 3543..3567,3714..3799) /note="carbonic anhydrase, large subunit" misc_signal 245..248 /note="CAAT box" misc_signal 303..312 /note="AGCGGCTCGC element" misc_signal 340..345 /note="TATA box" misc_signal 372..375 /note="CAAT box" misc_signal 428..431 /note="TATA box" exon 491..586 /note="carbonic anhydrase (EC 4.2.1.1) precursor, exon 1" sig_peptide 491..550 /note="carbonic anhydrase signal peptide" intron 587..854 /note="intron 1" exon 855..904 /note="carbonic anhydrase, exon 2" intron 905..1073 /note="intron 2" exon 1074..1128 /note="carbonic anhydrase, exon 3" intron 1129..1244 /note="intron 3" exon 1245..1319 /note="carbonic anhydrase, exon 4" intron 1320..1653 /note="intron 4" exon 1654..1708 /note="carbonic anhydrase, exon 5" misc_feature 1678..1680 /note="glycosylation site" intron 1709..2021 /note="intron 5" exon 2022..2084 /note="carbonic anhydrase, exon 6" intron 2085..2287 /note="intron 6" exon 2288..2413 /note="carbonic anhydrase, exon 7" misc_feature 2296..2298 /note="glycosylation site" misc_binding 2380..2382 /note="zinc binding histidine residue" misc_binding 2386..2388 /note="zinc binding histidine residue" intron 2414..2591 /note="intron 7" exon 2592..2726 /note="carbonic anhydrase, exon 8" misc_binding 2615..2617 /note="zinc binding histidine residue" intron 2727..2967 /note="intron 8" exon 2968..3212 /note="carbonic anhydrase, exon 9" misc_feature 3201..3203 /note="glycosylation site" intron 3213..3413 /note="intron 9" exon 3414..3567 /note="carbonic anhydrase, exon 10" intron 3568..3713 /note="intron 10" exon 3714..3802 /note="carbonic anhydrase, exon 11" misc_signal 4663..4667 /note="poly(A) signal" BASE COUNT 954 a 1456 c 1439 g 1009 t ORIGIN 1 ctgcaggaag gtgaccagca tccacggatg agcaccagag gcaaccgcag agcaaggctg 61 acagtcgtcg tccgaggcca gccatggtta ttgccgaaag gggagctcca ggggagcggg 121 ggccgggggg aggggggctg gagacagctg gaggcggggg caaggcgcag acatgcagcg 181 agtcgaagcc caaacgcacg gagcccgaca ggcgctcaca tgaaacttcc tcgtgtgact 241 catgcaatct gatggcagac aacggcccgc tccaagctgc agagcggcgc ggcttcagat 301 gcagcggctc gcccacgtcc gcgacaaaca actggtttgt atatatgtct gttgtaaagc 361 ttgctggctg gcaatcctgg cgcgctgacg atgaaagtac ggcttgatgc cacgcttttc 421 ggcgcgataa aagcatgctt cgtagggcaa tttgggatag tttgttacta caattgatca 481 attgaacacc atggcgcgta ctggcgctct actcctggcc gcgctggcgc ttgcgggctg 541 cgcgcaggct tgcatctaca agttcggcac gtcgccggac tccaaggtgc gcacgtccga 601 agctgcatgt ctcgcaatga tcaagtgcga gttggaacga ggatgttgtt ttggcccctg 661 gttgccgttt ggcataaacg tgaatgtctc accgcgactg cgtcactgag cattcaactg 721 ctcgtttccg gctgcaaatg aagtcatatc catgggtatt actactagtc ggcattctgc 781 gcagcttcgg gcaggcatgc ccgttgcttg ggtgagtcct gaccgccgct tgcgccctcg 841 cttttgtttg acaggccact cacacaggcg accactggga tcatagtctc aatggcgaga 901 actggtaagg acctggcctg ctcacatgaa tggagtgact tgtggccgcc gtccggcacc 961 cactgctttg cacccctgaa ttattagtgc ggcgctaggc gcttgcgttg gttgcgcggg 1021 atcgcggcga ggccatgaga gaccaaacgg acttttttcg ccatcgcacg cagggagggc 1081 aaggacggcg cgggcaaccc ctgggtctgc aagactggcc gcaagcaggt ccgtaacaat 1141 acatgttgag agtttctgcc gactgggcat tgtcacgtga actttctggg aggttgccct 1201 cgaattcgct cccgcccctt cctcccccgc aaccgcacgc gcagtcgccc atcaacgtgc 1261 cccagtacca tgtcctggac gggaagggtt ccaagattgc caccggcctg cagacccagg 1321 tgagtagcca gcatgacaca acaggccagc gtagtgcgcc gccttgcgcc gcgtgtgcga 1381 gggtgtgtgc ctgtcacggc gccatccaac ttaaccgcct ggcttgcgcc ccgaagcgtg 1441 cttgcgcgta gactggagca ggggcaatgg tttcaagcat cctaacatgg gaggcatcag 1501 ggtttccatg gttgggtggc ccggttttgc tggtgcgaac ctcctgattg cgatcaacgg 1561 ccggttttgt gccgcaacgc cagccagatt gacagcgggt ttttaaggcc ccgttactct 1621 ccattctctt tccctgcctt attaatactg cagtggtcgt accctgacct gatgtccaac 1681 ggcagctcgg ttcaagtcat caacaacggt gcgagagaga cggcaaggtg gccggtgggc 1741 ttttgcaacg cggcggaccg cacagaaagg attcatggcg ccaagccgtt tagttgcatt 1801 tgctacaaaa gggacgcagt ccacaagccg gtagcgtgga gtgcgagcac gcttggtcac 1861 cccgccttgt gcggtagctg agacaccacg gcgcttgggg tctgttgtgt caaggattca 1921 agcaccagac ccaatccatg caagcacgtg gtgtttccgc taaacctgac tgaagcattt 1981 tgcctgcctg cctccatccc gccactccat cctgctctca ggccacacca tccaggtgca 2041 gtggacctac gactacgccg gccatgccac catcgccatc cctggtgcgt cagctggtgc 2101 tgcatctacc gcattacgcc agtgcaggcc ctgtcgacaa cccaatccgc aatcactggc 2161 cccttttctt cccatgacga catcggtcct tgatgcttac aagtgttagt gcttcccaga 2221 taggcatatc cacgtaccgt cttgacctct tctcgccccc catgcccgtc ctcccacgtg 2281 ctcgcagcca tgcgcaacca gagcaaccgc atcgtggacg tgctggagat gcgccccaac 2341 gacgcctccg accgcgtgac tgccgtgccc acccagttcc acttccactc cacctcggag 2401 cacctgctgg cgggtgcgtg ggcgtggtcg taactcgtga cttctgacag ggccaggtgt 2461 tgggaggggt gtttcaagca ttcggctccc ataaagcgat aaggtcccac caggtatcaa 2521 gcaagcacct gccttccgcg tgtcataaaa catccgctgg tgtcctcctt cgttacgtct 2581 accgcctgca ggcaagatct ttcctcttga gttgcacatt gtgcacaagg tgactgacaa 2641 gctagaggcc tgcaagggcg gctgcttcag cgtcaccggc atcctgttcc agctcgacaa 2701 cggccccgat aacgagctgc ttgagcgtga gtcgggcagc agaggcgggg cggtggaggc 2761 ggggcggggg cgggtttgtg cgaagctgtc acagtttgtg tggaatcggt acgcacgtgc 2821 atgccttatg agcgcttctc aggaggaccg ctgagcgctt ctcaggaggt gcctgatcca 2881 gtgcctctat ttcatcctct gttgtgccct tacatgctcg tttgcacgtt gactgatctc 2941 cacacttcat cgacccgtcc tctgcagcca tctttgcgaa catgcccacg cgcgagggca 3001 ccttcaccaa cctgccggcg ggcaccacca tcaagctggg tgagctgctg cccagcgacc 3061 gcgactacgt cacctacgag ggcagcctca ccaccccgcc ctgcagcgag ggcctgctgt 3121 ggcacgtcat gacccagccg cagcgcatca gcttcggcca gtggaaccgc taccgcctgg 3181 ctgtgggcga gaaggagtgc aactccacgg aggtaagata tgggggcgac agctgtgggg 3241 acgcaggctc gcaggtttgg aggagccctt cagggatttg ctccattatt ggttgggcga 3301 gcctttgcgt gagggcgcgc accttgccgc ctctgggtcc gcacacgcct ttctggtacc 3361 cagggttgtt cccggctaaa cgcaacgctg tgtccatctc atcattgcta cagaccgatg 3421 ctgcccacgc ggacgccggc catcatcacc accaccaccg ccgcctgctg cacaaccacg 3481 cgcacctgga ggaggtgcct gccgccacct ccgagcccaa gcactacttc cgccgcgtga 3541 tggaggagac cgagaacccc gatgcttgtg agtacgcacc gtgacgcaat gcgcacatac 3601 agtagccctg cacggaagct aagatagcta accccatcga cacgggttgc tctgatccga 3661 cctacgatcc ggcctgcatg ccccctgcct gcctctctcc tcccgcgcga cagacacctg 3721 cacgaccgtt gcctttggcc agaacttccg caacgcccag tacgccaacg gccgcaccat 3781 caagctggcc cgctacgagt aagacaggat tggcagaatt cgaaactact atgtgagcac 3841 tgttcgcgcg cgtatgacac tggtcgtgga cgagaggaag agcgcctgcg gcgctaaaag 3901 ctttagcacg gaactgaagc ataacacacg aggtcgggtt atcgtactga actcggaagg 3961 gcgaggccgc ttacactgcc ggaaggtagt tggcctcgtg actgtgaatg gtagcaaatc 4021 gagtgaggtt tttttttgta ggtcggatgc gttgcggatg gatcaaggag gatcaaggtg 4081 gggttgattg tgaggggttg ataagtacac cgggaacccc gaaggcagga gcggggagcc 4141 gatggcggcg gtggcagatg ctttccagcg tactgaataa tggctgccct gcctggaggt 4201 tttgagcgag tgcagttgat accttttttg aggtagttgt ggcgccttgg catggttttg 4261 ccgccggcaa cggccctgcc tttggccgtg cgtggtgctt gagggctgac cccatccacg 4321 gcccatgggt gatcagggca tcaagtttcg tgccccgcac cgccacatca cgcaataaac 4381 gcgttgatct tcccatatac tgtttgacat cggttttcag gtcgatgcta actaatttca 4441 cggctgagtc acacaatgct cgtgaccatt tatgcggcga tggatggcgg ggtgacagtg 4501 cagagccgta ggcccgtagc gcattatgtc gttggaggcg gtgtgtcaga cttcaaatcg 4561 gactcgaact cgcgacgtga atgatgcgtg tgagcttcac aggcgttgta gctttgtgca 4621 tggtgtacga gtggctgcaa gagagcacag ctgcttaggg cgtgtaagta tcaccggtac 4681 tgacacatca cccagcctga cctcgctgcc ccagagagcc cacacaaccc atgagcgcag 4741 caccggcagg cacctcctgg cacaacccat gagcgcaggt tgtgatcgcc agtgttgccg 4801 ctgccagtgc gctcatctcg tgccgtggtg cgcccgcatc aaccaggagc gtagggcc //