Path: utzoo!utgpu!watserv1!watmath!uunet!zaphod.mps.ohio-state.edu!usc!ucselx!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: C.reinhardtii carbonic anhydrase (CAH1) gene, complete cds. Message-ID: Date: 21 Feb 91 13:03:30 GMT Sender: root@genbank.bio.net Distribution: bionet Lines: 209 Approved: lear@genbank.bio.net Checksum: 15228 12 LOCUS CRECAH1G 5009 bp ds-DNA PLN 21-FEB-1991 DEFINITION C.reinhardtii carbonic anhydrase (CAH1) gene, complete cds. ACCESSION D90206 D90114 X54487 KEYWORDS carbonic anhydrase. SOURCE C.reinhardtii (strain C-9 mt(-) haploid), genomic DNA, clones F1 and gtCA3. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 3938 to 4819) AUTHORS Fukuzawa,H., Fujiwara,S., Yamamoto,Y., Dionisio-Sese,M.L. and Miyachi,S. TITLE cDNA cloning, sequence, and expression of carbonic anhydrase in Chlamydomonas reinhardtii: Regulation by environmental CO2 concentration JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4383-4387 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 5009) AUTHORS Fukuzawa,H., Fujiwara,S., Tachiki,A. and Miyachi,S. TITLE Nucleotide sequences of two genes CAH1 and CAH2 which encode carbonic anhydrase polypeptides in Chlamydomonas reinhardtii JOURNAL Nucleic Acids Res. 18, 6441-6442 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 5009) AUTHORS Fujiwara,S., Fukuzawa,H., Tachiki,A. and Miyachi,S. TITLE Structure and differential expression of two genes encoding carbonic anhydrase in Chlamydomonas reinhardtii JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 9779-9783 (1991) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hideya Fukuzawa Institute of Applied Microbiology University of Tokyo 1-1-1 Yayoi, Bunkyo-ku Tokyo 113 Japan Phone: 03-812-2111 x7846 Fax: 03-812-2910 The amino acid sequence Thr-168 reported by Kamo et. al. in Eur. J. Biochem. 192:557-562 (1990) was revised to Ser-168 in [Proc. Natl. Acad. Sci. U.S.A. 87, 4383-4387 (1990)]. FEATURES Location/Qualifiers precursor_RNA <562..4819 /note="carbonic anhydrase mRNA and introns" CDS join(592..687,950..999,1185..1239,1455..1529,1850..1904, 2225..2287,2409..2534,2741..2875,3098..3342,3499..3643, 3938..4026) /note="carbonic anhydrase (EC 4.2.1.1) precursor" mat_peptide join(652..687,950..999,1185..1239,1455..1529,1850..1904, 2225..2287,2409..2534,2741..2875,3098..3342,3499..3618, 3619..3643,3938..4023) /note="carbonic anhydrase, large subunit" misc_signal 382..391 /note="AGCGGCTCGC element" misc_signal 442..445 /note="CAAT box" misc_signal 521..524 /note="TATA box" exon 592..687 /note="carbonic anhydrase (EC 4.2.1.1) precursor, exon 1" sig_peptide 592..651 /note="carbonic anhydrase signal peptide" intron 688..949 /note="carbonic anhydrase intron 1" exon 950..999 /note="carbonic anhydrase precursor, exon 2" intron 1000..1184 /note="carbonic anhydrase intron 2" exon 1185..1239 /note="carbonic anhydrase precursor, exon 3" intron 1240..1454 /note="carbonic anhydrase intron 3" exon 1455..1529 /note="carbonic anhydrase precursor, exon 4" intron 1530..1849 /note="carbonic anhydrase intron 4" exon 1850..1904 /note="carbonic anhydrase precursor, exon 5" misc_feature 1874..1876 /note="glycosylation site" intron 1905..2224 /note="carbonic anhydrase intron 5" exon 2225..2287 /note="carbonic anhydrase precursor, exon 6" intron 2288..2408 /note="carbonic anhydrase intron 6" exon 2409..2534 /note="carbonic anhydrase precursor, exon 7" misc_feature 2417..2419 /note="glycosylation site" misc_binding 2501..2503 /note="zinc binding histidine residue" misc_binding 2507..2509 /note="zinc binding histidine residue" intron 2535..2740 /note="carbonic anhydrase intron 7" exon 2741..2875 /note="carbonic anhydrase precursor, exon 8" misc_binding 2764..2766 /note="zinc binding histidine residue" intron 2876..3097 /note="carbonic anhydrase intron 8" exon 3098..3342 /note="carbonic anhydrase precursor, exon 9" misc_feature 3331..3333 /note="glycosylation site" intron 3343..3498 /note="carbonic anhydrase intron 9" exon 3499..3643 /note="carbonic anhydrase precursor, exon 10" intron 3644..3937 /note="carbonic anhydrase intron 10" exon 3938..4026 /note="carbonic anhydrase precursor, exon 11" misc_signal 4796..4800 /note="poly(A) signal" misc_feature 4814..4819 /note="poly(A) site" repeat_region 4932..4981 /note="A-C repeat" BASE COUNT 1004 a 1553 c 1423 g 1029 t ORIGIN 1 gaattcccta cctccacgct tccgtgcctc cccaaccctc cgaagacccg ccagaatgtg 61 tccataggca cccatatgcg tgcagagcac gcatgatttc tgctcctgga cctggagcca 121 atcaaatcca gtccgcaggg ctccggtcca gcccagcctt ccacaccggc tgggcgccca 181 gcaacccaac agccgcaaga ataaaatagt caaggctgta gcctttcaag ccgcgccacc 241 acgttggcag cgttgtagat tttcaccggt tggaaggagg tttgacgcga gggatttagg 301 tgcgagtccg acttacgaac tgcgaaaagg gttccccctt tgccccgggg tatgacatgg 361 gtgccggaac tccaaccaag tagcggctcg caaccccggc tgcagactgt gcgcatgcag 421 aaatgcagct tctggtttta acaattgttt gatggcatat gcgatttctc tccacgccca 481 tgtgacggcc agcgaccgcc gtaccgttgt cactttttga tataagtccg gatgcaacca 541 tagttgtcca caccctgcgt tgagtcatta cctgcaaccc acttgaacac catggcgcgt 601 actggcgctc tactcctggt cgcgctggcg cttgcgggct gcgcgcaggc ttgcatctac 661 aagttcggca cgtcgccgga ctccaaggtg cgcacttgaa ataggctaac cgactcgaac 721 aatgaaccaa gaacaagctt gtcaagcaga tgcatgattc ggctgttggt tgccgatgcg 781 caattctatg cctcgacaag gccgccgccg cgtcgtgagc atatcgatcg gccaacctgt 841 ccagccatct gttgtgatag gctgtttgcg cccccactca gccttgctgt gccaagtcgt 901 tgcttttcgc gttgtgagaa cagatttcgt catcgcgatt gttttacagg ccaccgtttc 961 gggtgatcac tgggaccatg gcctcaacgg cgagaactgg taaggccctt ggatgcagtc 1021 agacacatgc atgcatgact tgacgtttgc gacggatgct gggcgcgcac tgctttgcat 1081 cgctgcattt gcaacacgcc cttgtgtgct caacatggga ttgtggtcag gctagggcga 1141 tgaacggttc tggtccctgt cgacttgggc atccccacat acagggaggg caaggacggc 1201 gcaggcaacg cctgggtttg caagactggc cgcaagcagg tccgtggaca acgcggatca 1261 atgacggctt ttcagcaaga ccgcttagca tgaaaacacg actgcacggg acgggttgca 1321 tgggacggac tggcggaccg aactcttgcc gtgcccccca cccccccccc ccccgccccc 1381 caccccccaa cagttcatca tgttgcgcgt gaccccttac cccgcccctt cctcccccgc 1441 aaccgcacgc gcagtcgccc atcaacgtgc cccagtacca ggtcctggac gggaagggtt 1501 ccaagattgc caacggcctg cagacccagg tgggtagcca acagggtgta acggtgggag 1561 gcagcgtgtc gcgccacgtg tgctgtcacg gcgccatcca acctaaccgg cccgccagcg 1621 ccccggaagg cgttgcgcgg agactggagc acgggcaagg gttcaagcac cctaacatgg 1681 gaggagtcag ggtttccatg gttggatggc ccggtttatt ttgccggggc gagccgcctg 1741 attgcgatta accaccggtt ttgtaccgcg caacgccggc cagattgcca gcgggtttca 1801 ctggccactt taacctccct cctctttccc tgccttgttc atactgcagt ggtcgtaccc 1861 tgacctgatg tccaacggca cctcggtcca agtcatcaac aacggtgaga gcgaggcgat 1921 gtggctggat gggttttgca acgcggcgga ccgcacagaa ggattcatag cgccaagcag 1981 tttagattgc atttgctacg gaagggacct gccagctgac gcagttcacg agccggtagc 2041 gtggagtgcg agcacgctta gtaacccagc cttgtgcgca gtcatgcatc tgggtgcctt 2101 ggtttcttgc aggcgttgtg gggcgaagaa gtttgttgca gtatctggga cactgcggtg 2161 gttttctacg gtcggtaacc tcaccaaatc ctgcctgcct ccatcccgcc acccctatcc 2221 tcaggccaca ccatccaggt gcagtggact tacaactacg ccggccatgc caccatcgcc 2281 atccctggtg cgtcagctgg tgctgcatct ctgcatctcc tatctattcg gtaccatagc 2341 gtgccaacgc cgcgtatctt gtctcacgtc attggtaacc tcctccttct cctcccacgt 2401 gctcgcagcc atgcacaacc agaccaaccg catcgtggac gtgctggaga tgcgccccaa 2461 cgacgccgcc gaccgcgtga ctgccgtgcc cacccagttc cacttccact ccacctcgga 2521 gcacctgctg gcgggtgcgt gggcgtgggc gcaactcttg acttttgaca ggggcgggtg 2581 ttgagaaggg tgttgggaga ggtgtaaagc tctaagtagg caatgggctc caaccaaacg 2641 aaaggtccca caagtcccaa cccccacccc acctagcacc tgccttccgc acacaacaca 2701 tccgctggtg cctttccttc gttacgtcta ccgcctacag gcaagatcta tccccttgag 2761 ttgcacattg tgcaccaggt gactgagaag ctggaggcct gcaagggcgg ctgcttcagc 2821 gtcaccggca tcctgttcca gctcgacaac ggccccgata acgagctgct tgagcgtgag 2881 tcgggcagca ggggcggggt ggtggaggcg gggcgggggc gggtttgtac gaagctgtca 2941 cggtttgtgt ggatttggta cgtacgtgca tggcagaccg ctgagcgctt ctcaggaggt 3001 gcctcatcca gtgcctctat ttcatcctct gttgtgccct tacatgctcg tttgcacgtt 3061 tactgatcgc cacacttcat cggcccgtcc tctgcagcca tctttgcgaa catgccctcg 3121 cgcgagggca ccttcagcaa cctgccggcg ggcaccacca tcaagctggg tgagctgctg 3181 cccagcgacc gcgactacgt aacgtacgag ggcagcctca ccaccccgcc ctgcagcgag 3241 ggcctgctgt ggcacgtcat gacccagccg cagcgcatca gcttcggcca gtggaaccgc 3301 taccgcctgg ctgtgggcct gaaggagtgc aactccacgg aggtaagacg cccctacaaa 3361 cgggggagtc ctgggaacgg gcaagaagag cgaggtgtcg ggtgacggtg gcggctgccg 3421 ggtcaaatct gcgtcactgg tcagtgctcc tgtcacgcac gcctatgtgc tgcccatgtt 3481 catttaccac gcgtgcagac cgccgcggac gccggccacc accaccacca ccgccgcctg 3541 ctgcacaacc acgcgcacct ggaggaggtg cctgccgcca cctccgagcc caagcactac 3601 ttccgccgcg tgatgctggc cgagtccgcg aaccccgatg cctgtgagtt tacctgagag 3661 tggtaaaaag ggagtaccgg catgcattgt accctagaag ctgggcgagg gcgcggtatc 3721 ccaaaggctc ccccaagggg agacaccgag tcatgcgtgc tgagttccat tggattgcgt 3781 ctgcccaata gtcggtccga gctgctgcca agcaagtccg tgcacgtggt cgtccactcc 3841 tgcgtgtgcc tgcgcgcctg cgcgccgtgt ttcttttgta acgcccccta accccgacct 3901 tattgcctcc ctctttgcct cccccccaat gcgacagaca cctgcaaggc cgttgccttt 3961 ggccagaact tccgcaaccc ccagtacgcc aacggccgca ccatcaagct ggcccgctat 4021 cactaaactt cccagtagtt agtcacgcta ccaccgtcgg cacggccagc aggcattcca 4081 ttttccaggc tttgcttcac ggtttggtgt gtcattcgat ggtgttcttg acgacccgcg 4141 cttggcgggc ctttccaatt ttttccatag tacaccgaaa tagttctgcg gtgcagcacg 4201 catacacaca gtaccggacg ggcggcggga cctcctgttt tctcctgact agtaaagaag 4261 taaggaaggt atggagttgg ttccacgatg gggcagtctg agagcggaat aaagtcagtg 4321 ggccggacgt tgtggcgatg gatggtagtg aggcaagtaa tacgtacgta gagggcgtac 4381 gcgggtaata acgggaactt cgacagcaat cgagagtgtc tgcacgcgag acatttgcgt 4441 acaggggagg caccggttct cctcgatgag tgatccgtac ttatgcaagt tatataaggc 4501 tggtgtgggg cgcttcagca cggtatggtt gccagcatgc acggtccggc ctctgtctgg 4561 ctggctgggt tgttggcggg ctggcctcat gcgcgccgtc gcacatgccg atcaatgcag 4621 ttgctctcca gtagctgcaa ggcctggctg ggcaatccca tagccatgtc gaatgtgaag 4681 cattgttttc ttggagatgg aggacaggag acgctgaccg gatgttttaa gacgtgcagg 4741 atgtggggag cgaggtagct acaacggtgc agttgaggca gagacgtgta cgacatgtaa 4801 gatgcccatg gacaaaaaag gccctgggtg tccctcgcaa agggaaaacg tgggctgcgc 4861 cccaaaagtg ggcacaagtc acgcctcccg tctgaggccg caagtgtcta cctcatcgcc 4921 aggggatagc aacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 4981 ccgcaacgcc acacactgtg tgtcccgag //