Path: utzoo!attcan!uunet!shelby!rutgers!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 24 Jul 90 12:00:28 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 7298 Approved: lear@genbank.bio.net Checksum: 19767 452 LOCUS HUMHPRTB 56736 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete cds. ACCESSION M26434 KEYWORDS Alu repetitive element; LINE repetitive element; hypoxanthine phosphoribosyltransferase. SOURCE Human cell line 4X,Y (GM1202), fetal liver, and beta thalassemia patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 56736) AUTHORS Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J., Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W. TITLE Automated DNA sequencing of the human HPRT locus JOURNAL Genomics 6, 593-608 (1990) STANDARD full staff_entry REFERENCE 2 (sites; mutations causing gout) AUTHORS Argos,P., Hanei,M., Wilson,J.M. and Kelley,W.N. TITLE A possible nucleotide-binding domain in the tertiary fold of phosphoribosyltransferase JOURNAL J. Biol. Chem. 258, 6450-6457 (1983) STANDARD full staff_entry REFERENCE 3 (sites; mutations causing gout) AUTHORS Cariello,N.F., Scott,J.K., Kat,A.G., Thilly,W.G. and Keohavong,P. TITLE Resolution of a missense mutant in human genomic DNA by denaturing gradient gel electrophoresis and direct sequencing using in vitro DNA amplification: HPRT-Munich JOURNAL Am. J. Hum. Genet. 42, 726-734 (1988) STANDARD full staff_entry REFERENCE 4 (sites; mutations causing gout) AUTHORS Davidson,B.L., Chin,S.J., Wilson,J.M., Kelley,W.N. and Palella,T.D. TITLE Hypoxanthine-guanine phosphoribosyltransferase. Genetic evidence for identical mutations in two partially deficient subjects JOURNAL J. Clin. Invest. 82, 2164-2167 (1988) STANDARD full staff_entry REFERENCE 5 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Davidson,B.L., Palella,T.D. and Kelly,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase: A single nucleotide substitution in cDNA clones isolated from a patient with Lesch-Nyhan syndrome (HPRT-Midland) JOURNAL Gene 68, 85-91 (1988) STANDARD full staff_entry REFERENCE 6 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Genetic basis of hypoxanthine guanine phosphoribosyltransferase deficiency in a patient with Lesch-Nyhan syndrome (HPRT-Flint) JOURNAL Gene 63, 331-336 (1988) STANDARD full staff_entry REFERENCE 7 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Human hypoxanthine-guanine phosphoribosyltransferase deficiency. The molecular defect in a patient with gout (HPRT-Ashville) JOURNAL J. Biol. Chem. 264, 520-525 (1989) STANDARD full staff_entry REFERENCE 8 (sites; mutations causing gout) AUTHORS Davidson,B.L., Tarle,S.A., Palella,T.D. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in ten subjects determined by direct sequencing of amplified transcripts JOURNAL J. Clin. Invest. 84, 342-346 (1989) STANDARD full staff_entry REFERENCE 9 (sites; mutations causing gout) AUTHORS Dush,M.K., Sikela,J.M., Khan,S.A., Tischfield,J.A. and Stambrook,P.J. TITLE Nucleotide sequence and organization of the mouse adenine phosphoribosyltransferase gene: Presence of a coding region common to animal and bacterial phosphoribosyltransferases that has a variable intron/exon arrangement JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2731-2735 (1985) STANDARD full staff_entry REFERENCE 10 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Fujimori,S., Davidson,B.L., Kelley,W.N. and Palella,T.D. TITLE Identification of a single nucleotide change in the hypoxanthine-guanine phosphoribosyltransferase gene (HPRT-Yale) responsible for Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 83, 11-13 (1989) STANDARD full staff_entry REFERENCE 11 (sites; mutations causing gout) AUTHORS Fujimori,S., Hidaka,Y., Davidson,B.L., Palella,T.D. and Kelley,W.N. TITLE Identification of a single nucleotide change in a mutant gene for hypoxanthene-guanine phosphoribosyltransferase (HPRT-Ann Arbor) JOURNAL Hum. Genet. 79, 39-43 (1988) STANDARD full staff_entry REFERENCE 12 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Gibbs,R.A., Nguyen,P.N., Edwards,A., Civitello,A.B. and Caskey,C.T. TITLE Multiplex DNA deletion detection and exon sequencing of the hypoxanthine phosphoribosyltransferase gene in Lesch-Nyhan families JOURNAL Genomics 7, 235-244 (1990) STANDARD full staff_entry REFERENCE 13 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Gibbs,R.A., Nguyen,P.N., McBride,L.J., Koepf,S.M. and Caskey,C.T. TITLE Identification of mutations leading to the Lesch-Nyhan syndrome by automated direct DNA sequencing of in vitro amplified cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1919-1923 (1989) STANDARD full staff_entry REFERENCE 14 (sites; mutations causing gout) AUTHORS Hershey,H.V. and Taylor,M.W. TITLE Nucleotide sequence and deduced amino acid sequence of Escherichia coli adenine phosphoribosyltransferase and comparison with other analogous enzymes JOURNAL Gene 43, 287-293 (1986) STANDARD full staff_entry REFERENCE 15 (sites; conserved domain) AUTHORS King,A. and Melton,D.W. TITLE Characterisation of cDNA clones for hypoxanthine-guanine phosphoribosyltransferase from the human malarial parasite, Plasmodium falciparum: Comparisons to the mammalian gene and protein JOURNAL Nucleic Acids Res. 15, 10469-10481 (1987) STANDARD full staff_entry REFERENCE 16 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Ogasawara,N., Stout,J.T., Goto,H., Sonta,S.I., Matsumoto,A. and Caskey,C.T. TITLE Molecular analysis of a female Lesch-Nyhan patient JOURNAL J. Clin. Invest. 84, 1024-1027 (1989) STANDARD full staff_entry REFERENCE 17 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Stout,J.T. and Caskey,C.T. TITLE HPRT: Gene structure, expression, and mutation JOURNAL Annu. Rev. Genet. 19, 127-148 (1985) STANDARD full staff_entry REFERENCE 18 (sites; AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in a patient with the Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 71, 1331-1335 (1983) STANDARD full staff_entry REFERENCE 19 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Structural alteration in a dysfunctional enzyme variant (HPRT-Munich) isolated from a patient with gout JOURNAL J. Biol. Chem. 259, 27-30 (1984) STANDARD full staff_entry REFERENCE 20 (sites; mutations causing gout) AUTHORS Wilson,J.M., Kobayashi,R., Fox,I.H. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Molecular abnormality in a mutant form of the enzyme (HPRT-Toronto) JOURNAL J. Biol. Chem. 258, 6458-6460 (1983) STANDARD full staff_entry REFERENCE 21 (sites; mutations causing gout) AUTHORS Wilson,J.M., Tarr,G.E. and Kelley,W.N. TITLE Human hypoxanthine (guanine) phosphoribosyltransferase: An amino acid substitution in a mutant form of the enzyme isolated from a patient with gout JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 870-873 (1983) STANDARD full staff_entry REFERENCE 22 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Yang,T.P., Patel,P.I., Chinault,A.C., Stout,J.T., Jackson,L.G., Hildebrand,B.M. and Caskey,C.T. TITLE Molecular evidence for new mutation at the hprt locus in Lesch-Nyhan patients JOURNAL Nature 310, 412-414 (1984) STANDARD full staff_entry REFERENCE 23 (sites; mutant reversion) AUTHORS Yang,T.P., Stout,J.T., Konecki,D.S., Patel,P.I., Alford,R.L. and Caskey,C.T. TITLE Spontaneous reversion of novel Lesch-Nyhan mutation by HPRT gene rearrangement JOURNAL Somat. Cell Mol. Genet. 14, 293-303 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Edwards, 26-JUL-1989. Mutant Description RJK 1780 missing intron 1 (partial and exon 2 [12] RJK 849 missing intron 3 (partial) and exons 4-9 (no mRNA) [22] RJK 984 missing intron 5 (partial) and exons 6-9 (no mRNA) [17] [22] GM 3467 missing intron 8 (partial) and exon 9 (no mRNA) [12] [22] RJK 853 complete gene deletion [12] [22] [16] GM 2227 inversion of exons 6-9 (no mRNA) [23] GM 1662 and GM 6804 duplication of exons 2 and 3 and elongated mRNA [22] [23] Connersville missing intron 7 (partial and exon 8 [7] FEATURES from to/span description pept 1677 1703 hypoxanthine phosphoribosyltransferase, exon 1 /nomgen="HPRT" /map="Xq26" /hgml_locus_uid="LE0031K" 14780 14886 hypoxanthine phosphoribosyltransferase, exon 2 16603 16786 hypoxanthine phosphoribosyltransferase, exon 3 27891 27956 hypoxanthine phosphoribosyltransferase, exon 4 31617 31634 hypoxanthine phosphoribosyltransferase, exon 5 34937 35019 hypoxanthine phosphoribosyltransferase, exon 6 39815 39861 hypoxanthine phosphoribosyltransferase, exon 7 40033 40109 hypoxanthine phosphoribosyltransferase, exon 8 41454 41501 hypoxanthine phosphoribosyltransferase, exon 9 IVS 1704 14779 HPRT intron A IVS 14887 16602 HPRT intron B IVS 16787 27890 HPRT intron C IVS 27957 31616 HPRT intron D IVS 31635 34936 HPRT intron E IVS 35020 39814 HPRT intron F IVS 39862 40032 HPRT intron G IVS 40110 41453 HPRT intron H rpt 258 41 (c) Alu repeat copy A rpt 1012 694 (c) Alu repeat copy B rpt 3532 3342 (c) Alu repeat copy C rpt 3982 3343 (c) Alu repeat copy D rpt 6232 5937 (c) Alu repeat copy E rpt 6524 6288 (c) Alu repeat copy F rpt 7504 7807 Alu repeat copy G rpt 8713 8408 (c) Alu repeat copy H rpt 9477 9169 (c) Alu repeat copy I rpt 9699 10142 Alu repeat copy J rpt 10407 10293 (c) Alu repeat copy K rpt 12154 12310 Alu repeat copy L rpt 13302 12998 (c) Alu repeat copy M rpt 13638 13314 (c) Alu repeat copy N rpt 14238 13923 (c) Alu repeat copy O rpt 14610 14239 (c) Alu repeat copy P rpt 16090 15843 (c) Alu repeat copy Q rpt 16518 16293 (c) Alu repeat copy R rpt 18783 18439 (c) Alu repeat copy S rpt 18962 19084 Alu repeat copy T rpt 19397 19104 (c) Alu repeat copy U rpt 20064 19778 (c) Alu repeat copy V rpt 20486 20636 Alu repeat copy W rpt 20990 20696 (c) Alu repeat copy X rpt 21293 20996 (c) Alu repeat copy Y rpt 22171 21878 (c) Alu repeat copy Z rpt 23772 23460 (c) Alu repeat copy AA rpt 25310 25426 Alu repeat copy AB rpt 26388 26279 (c) Alu repeat copy AC rpt 29863 29560 (c) Alu repeat copy AD rpt 33618 33148 (c) Alu repeat copy AE rpt 36575 36645 Alu repeat copy AF rpt 38613 39039 Alu repeat copy AG rpt 39091 39388 Alu repeat copy AH rpt 40622 40925 Alu repeat copy AI rpt 40952 41262 Alu repeat copy AJ rpt 42640 42936 Alu repeat copy AK rpt 44776 44486 (c) Alu repeat copy AL rpt 46627 46305 (c) Alu repeat copy AM rpt 47520 47763 Alu repeat copy AN rpt 49257 49551 Alu repeat copy AO rpt 49705 49993 Alu repeat copy AP rpt 50002 50312 Alu repeat copy AQ rpt 51066 50774 (c) Alu repeat copy AR rpt 52276 51978 (c) Alu repeat copy AS rpt 52407 52674 Alu repeat copy AT rpt 54653 54551 (c) Alu repeat copy AU rpt 55465 55164 (c) Alu repeat copy AV rpt 56523 56689 Alu repeat copy AW rpt 11312 11254 (c) LINE repeat copy A (partial) rpt 18284 18242 (c) LINE repeat copy B (partial) rpt 27091 26843 (c) LINE repeat copy C (partial) rpt 37558 37300 (c) LINE repeat copy D (partial) rpt 39451 39427 (c) LINE repeat copy E (partial) rpt 51317 51100 (c) LINE repeat copy F (partial) rpt 261 321 short interspersed repeat copy A rpt 1013 1054 short interspersed repeat copy B rpt 2902 2869 (c) short interspersed repeat copy C rpt 3078 3107 short interspersed repeat copy D rpt 4228 4254 short interspersed repeat copy E rpt 4463 4494 short interspersed repeat copy F rpt 21582 21635 short interspersed repeat copy G rpt 28430 28394 (c) short interspersed repeat copy H rpt 28923 28981 short interspersed repeat copy I rpt 35562 35604 short interspersed repeat copy J rpt 44042 44108 short interspersed repeat copy K rpt 46628 46688 short interspersed repeat copy L rpt 51715 51687 (c) short interspersed repeat copy M unsure 17479 17479 polymorphism or cloning error unsure 24903 24903 polymorphism or cloning error mut 1664 1678 ggccggctccgttat in wt; gt in gout RJK 951 [13] mut 14778 14778 a in wt; t in Lesch-Nyhan RJK 1760, no exon 2 [12] mut 14808 14809 tt in wt; ttt in Lesch-Nyhan Chicago [7] mut 14874 14874 t in wt; c in Lesch-Nyhan Detroit Leu->Pro [7] mut 14877 14879 tta in wt; ta in Lesch-Nyhan RJK 1939 [12] mut 14886 14886 g in wt; a in Lesch-Nyhan RJK 2163 Arg->Lys [12] mut 16619 16619 c in wt; g in gout Toronto Arg->Gly [20] mut 16623 16664 atgtgatgaaggagatgggaggccatcacattgtagccctct in wt; at in Lesch-Nyhan RJK 2108 [12] mut 16675 16676 ag in wt; agg in Lesch-Nyhan RJK 866 [13] mut 16677 16677 g in wt; a in Lesch-Nyhan New Haven Gly->Glu [7] mut 16679 16679 g in wt; c in Lesch-Nyhan Yale Gly->Arg [10] mut 16690 16690 c in wt; a in Lesch-Nyhan Flint RJK 892 Phe->Leu [5] [13] mut 16707 16707 a in wt; t in gout Arlingen Asp->Val [7] mut 16756 16759 tgta in wt; ta in Lesch-Nyhan RJK 1332 [13] mut 16780 16780 c in wt; a in gout Munich Ser->Arg [3] [19] mut 16784 16787 tgtg in wt; tg in Lesch-Nyhan RJK 1747 [12] mut 27897 27897 c in wt; t in Lesch-Nyhan RJK 1930 Gln->TAG [12] mut 27911 27911 c in wt; t in gout London Ser->Leu [2] [4] [12] [14] mut 31621 31621 t in wt; a in Lesch-Nyhan Midland RJK 896 Val->Asp [6] [13] mut 31622 31624 ctt in wt; ct in Lesch-Nyhan RJK 2019 [12] mut 31624 31624 t in wt; c in Lesch-Nyhan RJK 1784 Leu->Ser [13] mut 31628 31628 t in wt; g in gout Ann Arbor Ile->Met [11] mut 34962 34970 tgcag in wt; agcaaagcaa in Lesch-Nyhan RJK 1210 Met->Lys [13] mut 35015 35015 g in wt; t in gout Milwaukee RJK 949 Ala->Ser [7] [13] mut 39837 39837 c in wt; t in Lesch-Nyhan RJK 974 Arg->TGA [13] mut 39842 39846 tgttg in wt; tg in Lesch-Nyhan RJK 1894 [12] mut 39858 39858 g in wt; t in Lesch-Nyhan RJK 2185 Asp->Tyr [12] mut 39866 39866 g in wt; a in Lesch-Nyhan RJK 1934. no exon 7 [12] mut 40034 40038 tgttg in wt; tg in Lesch-Nyhan Michigan RJK 855 [7] [13] mut 40080 40080 g in wt; a in Lesch-Nyhan Kingston RJK 2188 Asp->Asn [12] [19] mut 40095 40095 t in wt; g in Lesch-Nyhan New Briton RJK 950 Phe->Val [7] [13] mut 40102 40102 a in wt; g in gout Ashville Asp->Gly [8] mut 40114 40114 g in wt; a in Lesch-Nyhan RJK 888, GM 7092, no exon 8 [12] mut 41486 41509 aaaatacaaagcctaagatgagag in wt; ag in Lesch-Nyhan Evansville RJK 894 [7] [12] mut 41450 41452 ata in wt; ttt in Lesch-Nyhan RJK 906, GM 1899, downstream cryptic splice site used mut 41454 41454 c in wt; g in Lesch-Nyhan RJK 1874, RJK 2079, His->Asp [12] [13] mut 41461 41461 g in wt; a in Lesch-Nyhan RJK 1727 Cys->Tyr [13] BASE COUNT 15689 a 11281 c 11599 g 18167 t ORIGIN 1 bp upstream of EcoRI site; chromosome Xq26. 1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt 61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc 121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc 181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt 241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga 301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc 361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag 421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc 481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt 541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct 601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga 661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt 721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat 781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga 841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag 901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct 961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc 1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag 1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac 1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc 1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg 1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt 1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg 1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg 1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg 1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc 1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc 1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg 1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg 1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg 1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc 1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga 1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca 1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc 2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat 2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg 2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag 2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg 2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct 2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt 2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg 2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca 2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg 2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt 2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg 2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa 2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt 2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg 2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc 2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc 3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt 3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt 3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct 3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat 3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt 3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag 3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc 3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc 3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta 3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac 3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc 3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt 3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc 3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc 3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc 3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag 3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc 4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca 4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc 4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc 4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa 4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca 4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga 4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac 4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg 4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt 4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac 4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg 4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag 4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg 4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca 4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg 4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat 4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg 5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc 5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta 5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc 5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact 5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct 5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg 5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg 5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc 5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa 5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat 5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa 5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg 5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc 5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa 5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt 5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt 6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg 6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga 6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg 6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc 6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt 6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc 6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg 6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt 6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga 6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac 6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct 6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag 6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc 6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt 6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg 6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc 6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca 7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc 7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga 7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt 7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc 7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg 7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag 7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag 7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca 7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt 7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac 7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga 7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt 7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa 7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct 7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact 7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt 7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc 8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac 8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct 8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg 8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat 8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag 8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat 8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg 8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg 8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc 8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac 8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg 8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc 8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat 8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga 8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta 8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg 9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt 9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag 9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt 9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc 9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag 9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt 9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca 9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc 9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc 9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat 9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg 9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc 9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga 9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca 9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat 9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta 9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc 10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc 10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa 10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa 10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat 10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt 10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct 10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt 10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga 10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca 10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct 10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg 10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg 10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa 10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta 10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa 10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca 10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct 11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa 11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc 11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat 11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc 11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag 11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag 11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga 11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg 11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg 11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt 11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac 11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa 11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa 11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa 11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt 11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt 12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag 12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat 12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat 12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc 12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt 12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt 12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct 12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat 12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa 12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag 12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt 12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac 12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca 12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct 12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt 12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc 12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta 13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc 13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg 13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc 13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct 13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt 13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg 13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat 13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca 13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga 13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt 13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac 13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta 13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg 13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc 13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat 13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg 13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc 14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt 14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga 14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt 14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc 14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg 14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc 14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc 14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca 14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg 14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata 14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg 14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac 14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg 14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat 14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac 14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa 15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt 15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg 15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg 15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg 15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt 15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat 15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt 15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata 15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat 15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag 15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg 15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt 15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga 15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc 15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca 15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag 15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt 16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag 16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt 16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa 16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa 16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt 16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact 16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac 16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac 16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa 16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg 16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg 16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag 16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa 16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa 16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta 16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta 16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg 17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt 17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt 17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt 17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc 17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg 17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc 17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt 17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa 17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc 17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat 17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc 17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat 17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact 17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa 17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt 17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa 18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat 18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat 18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt 18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat 18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc 18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg 18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt 18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt 18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca 18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc 18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt 18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg 18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg 18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat 18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa 18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat 18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg 19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa 19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag 19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga 19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc 19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa 19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg 19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc 19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa 19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg 19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa 19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa 19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg 19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag 19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga 19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag 19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt 19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct 20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt 20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac 20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct 20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga 20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat 20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg 20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat 20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt 20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc 20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag 20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt 20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca 20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta 20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg 20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc 20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt 21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc 21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg 21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc 21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc 21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa 21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat 21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa 21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat 21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat 21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt 21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt 21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta 21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt 21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt 21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct 21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct 21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat 22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc 22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg 22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca 22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag 22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc 22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat 22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg 22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct 22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca 22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt 22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc 22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta 22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct 22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta 22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct 22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa 22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa 23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag 23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc 23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa 23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc 23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac 23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga 23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt 23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc 23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc 23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat 23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc 23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc 23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc 23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga 23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga 23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa 24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg 24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat 24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat 24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct 24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt 24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga 24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg 24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt 24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc 24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt 24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc 24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg 24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat 24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa 24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt 24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg 24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc 25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag 25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc 25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact 25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta 25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg 25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc 25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag 25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc 25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca 25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag 25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag 25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt 25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag 25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat 25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa 25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata 25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg 26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt 26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat 26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt 26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag 26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc 26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact 26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg 26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat 26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag 26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt 26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt 26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata 26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta 26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat 26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg 26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata 27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata 27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg 27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc 27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat 27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag 27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga 27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt 27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac 27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa 27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg 27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt 27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat 27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct 27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata 27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt 27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat 27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct 28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg 28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca 28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat 28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa 28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca 28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc 28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg 28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa 28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt 28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt 28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt 28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat 28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa 28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac 28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac 28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca 28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct 29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac 29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa 29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc 29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga 29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt 29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca 29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca 29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt 29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt 29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca 29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg 29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt 29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct 29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg 29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag 29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct 30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga 30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact 30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt 30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa 30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt 30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg 30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc 30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc 30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag 30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga 30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta 30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata 30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct 30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata 30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa 30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca 30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga 31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg 31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac 31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat 31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga 31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg 31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt 31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt 31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa 31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat 31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg 31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac 31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat 31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg 31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt 31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt 31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac 31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta 32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa 32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag 32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta 32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt 32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt 32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact 32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc 32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat 32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa 32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta 32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca 32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg 32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc 32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt 32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt 32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt 33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc 33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac 33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta 33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac 33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta 33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta 33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata 33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata 33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc 33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat 33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa 33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct 33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc 33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt 33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga 33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct 33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag 34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc 34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat 34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac 34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt 34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga 34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt 34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc 34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt 34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca 34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct 34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac 34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa 34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc 34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc 34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg 34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt 34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac 35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt 35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat 35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta 35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag 35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga 35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc 35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc 35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt 35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag 35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat 35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta 35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat 35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta 35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg 35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt 35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa 36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt 36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag 36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat 36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga 36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa 36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt 36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga 36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct 36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg 36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg 36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga 36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag 36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga 36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta 36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt 36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag 36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta 37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa 37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc 37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc 37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg 37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc 37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg 37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac 37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag 37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg 37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc 37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt 37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc 37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt 37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc 37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc 37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag 37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa 38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat 38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc 38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat 38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga 38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata 38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa 38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag 38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt 38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg 38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc 38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg 38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa 38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg 38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa 38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt 38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg 39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa 39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt 39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg 39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc 39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga 39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga 39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc 39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca 39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac 39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa 39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc 39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg 39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct 39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa 39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct 39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga 39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat 40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg 40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt 40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa 40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct 40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag 40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag 40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc 40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact 40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg 40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg 40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag 40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac 40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag 40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg 40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca 40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca 40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct 41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg 41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc 41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac 41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg 41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct 41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc 41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt 41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag 41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg 41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta 41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac 41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc 41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt 41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata 41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga 41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg 42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt 42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc 42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt 42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt 42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta 42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg 42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct 42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt 42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta 42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag 42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta 42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca 42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa 42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg 42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag 42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac 42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa 43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt 43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc 43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct 43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat 43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat 43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc 43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt 43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc 43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt 43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt 43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc 43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca 43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag 43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg 43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa 43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg 43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt 44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca 44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc 44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta 44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact 44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc 44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca 44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg 44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg 44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc 44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac 44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg 44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta 44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca 44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc 44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact 44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt 45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga 45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt 45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg 45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc 45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta 45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg 45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc 45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac 45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct 45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca 45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa 45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa 45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa 45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat 45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg 45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt 45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt 46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca 46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc 46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca 46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg 46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt 46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag 46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct 46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt 46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc 46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc 46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc 46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa 46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc 46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat 46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg 46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc 46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac 47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac 47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca 47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg 47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca 47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc 47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga 47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc 47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa 47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact 47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag 47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg 47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc 47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa 47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga 47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg 47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga 48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc 48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct 48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc 48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg 48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg 48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt 48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg 48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa 48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat 48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc 48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg 48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga 48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat 48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg 48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc 48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt 48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt 49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg 49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag 49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc 49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct 49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat 49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa 49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc 49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc 49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc 49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc 49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta 49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg 49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa 49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc 49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag 49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct 49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt 50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat 50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc 50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt 50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct 50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga 50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca 50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt 50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca 50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt 50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg 50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca 50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa 50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga 50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct 50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt 50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac 51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca 51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca 51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct 51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca 51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac 51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc 51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac 51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa 51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc 51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct 51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc 51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc 51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga 51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt 51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac 51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg 51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa 52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga 52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag 52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa 52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg 52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat 52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg 52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca 52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa 52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag 52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca 52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt 52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta 52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa 52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa 52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac 52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc 52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca 53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata 53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt 53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag 53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc 53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact 53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc 53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat 53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag 53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca 53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag 53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc 53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc 53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc 53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt 53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta 53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct 54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg 54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac 54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt 54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg 54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc 54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc 54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg 54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga 54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag 54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc 54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta 54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta 54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg 54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta 54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga 54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa 54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca 55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc 55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca 55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc 55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc 55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc 55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta 55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg 55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg 55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt 55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt 55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg 55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat 55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac 55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt 55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa 55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa 55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta 56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg 56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag 56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca 56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt 56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca 56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt 56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct 56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg 56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg 56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa 56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc 56701 gccgcccctg gtgatggacc agcggggctt cgacga // LOCUS CHTMOMPCT 1407 bp ds-DNA BCT 24-JUL-1990 DEFINITION C.trachomatis major outer membrane protein gene, complete cds. ACCESSION J03813 KEYWORDS major outer membrane protein. SOURCE C.trachomatis (serotype A) DNA. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (sites; amino acid sequences) AUTHORS Baehr,W., Zhang,Y.-X., Joseph,T., Su,H., Nano,F.E., Everett,K.D.E. and Caldwell,H.D. TITLE Mapping antigenic domains expressed by Chlamydia trachomatis major outer membrane protein genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 4000-4004 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1407) AUTHORS Baehr,W. JOURNAL Unpublished (1988) Rocky Mountain laboratories, Hamilton, MT 59840 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly provided by W.Baehr, 04-MAY-1988 FEATURES from to/span description pept 169 1359 major outer membrane protein BASE COUNT 403 a 283 c 311 g 410 t ORIGIN 1 bp upstream of XhoI site. 1 ctcgagcatt gaacgacatg ttctcgatta aggctgcttt tacttgcaag acattcctca 61 ggccattaat tgctacagga catcttgtct ggctttaact aggacacagt gccgccagaa 121 aaagatagcg agcacaaaga gagctaatta tacaatttag aggtaagaat gaaaaaactc 181 ttgaaatcgg tattagtatt tgccgctttg agttctgctt cctccttgca agctctgcct 241 gtggggaatc ctgctgaacc aagccttatg atcgacggaa ttctgtggga aggtttcggc 301 ggagatcctt gcgatccttg caccacttgg tgtgacgcta tcagcatgcg tatgggttac 361 tatggtgact ttgttttcga ccgtgttttg aaaacagatg tgaataaaga atttcagatg 421 ggagcggcgc ctactaccag cgatgtagca ggcttagaaa aggatccagt agcaaatgtt 481 gctcgcccaa atcccgctta tggcaaacac atgcaagatg ctgaaatgtt tacgaacgct 541 gcttacatgg cattaaatat ctgggatcgt tttgatgtat tttgtacatt gggagcaact 601 accggttatt taaaaggaaa ctccgcttcc ttcaacttag ttggattatt cggaacaaaa 661 acacaatctt ctggctttga tacagcgaat attgttccta acactgcttt gaatcaagct 721 gtggttgagc tttatacaga cactaccttt gcttggagcg taggtgctcg tgcagctctc 781 tgggaatgtg ggtgtgcaac gttaggagct tctttccaat atgctcaatc taaacctaaa 841 gtagaagagt tgaatgttct ttgtaatgca tccgaattta ctattaataa gccgaaagga 901 tatgttgggg cggaatttcc acttgatatt accgcaggaa cagaagctgc gacaggaact 961 aaggatgcct ctattgacta ccatgagtgg caagcaagtt tagccctttc ttacagatta 1021 aatatgttca ctccttacat tggagttaaa tggtctagag taagttttga tgccgacacg 1081 atccgtatcg ctcagcctaa attggctaaa ccagtcttgg ataccactac tctaaacccg 1141 accatcgctg gtaaaggaac tgtggtctct tccgcagaaa acgaactggc tgatacaatg 1201 caaatcgttt ccttgcagtt gaacaagatg aaatctagaa aatcttgcgg tattgcagta 1261 ggaacaactg ttgtagatgc agataaatac gcagttacaa ttgagactcg cttgatcgat 1321 gagagagcag ctcacgtaaa tgcacaattc cgcttctaat attctaataa ttgtataatt 1381 atattagaat cttttcttag taagctt // LOCUS DROD1P 1734 bp ss-mRNA INV 24-JUL-1990 DEFINITION D.melanogaster chromosomal protein D1 mRNA, complete cds. ACCESSION J04725 KEYWORDS chromosomal protein. SOURCE D.melanogaster (strain Oregon R) embryo, cDNA to mRNA, (library of T.-S.Hsieh), clones D1-[9,9.14,15]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1734) AUTHORS Ashley,C.T., Pendleton,C.G., Jennings,W.W., Saxena,A. and Glover,C.V.C. TITLE Isolation and sequencing of cDNA clones encoding Drosophila chromosomal protein D1: A repeating motif in proteins which recognize AT DNA JOURNAL J. Biol. Chem. 264, 8394-8401 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by C.V.C.Glover, 29-MAR-1989. FEATURES from to/span description pept 228 1295 chromosomal protein D1 mRNA < 1 1734 D1 mRNA signal 1599 1604 polyadenylation signal signal 1717 1722 polyadenylation signal BASE COUNT 558 a 410 c 462 g 304 t ORIGIN Map position 85D1-2; polytene band. 1 acatcactac tttgaactgt actagcgtgc gcgtatttta caattacaaa ttgagaaggc 61 aaacttgatt ttcggcaaac aaaaaggtga agaagaagag cggagccaag cgaaaagagt 121 gagcgtgtgt tcgtgagaga gagcatttgt gcgcgtgtgt gtgtgagggg acgcgaagaa 181 acaaggattt tttttattcg cttgctaatt gaatcccgta cagagaaatg gaggaagttg 241 cggtaaagaa gcgcggccga ccgtcaaagg catccgtcgg tggtaaatct tcaacagcgg 301 cggttgcagc aatctcgcca ggcatcaaaa agcgcggtcg tcccgccaag aacaagggct 361 ccagtggcgg tggtggacag cggggtcgcc caccgaaggc gtcaaaaatc caaaatgacg 421 aagatcccga agacgagggg gaggaggacg gcgatggcga tggctccggc gctgagcttg 481 caaacaactc atcaccctcg ccgacgaagg gcaggggacg acccaagagc agtggtggcg 541 ccggatcagg atccggggat tcggtgaaga cgcccggttc cgccaagaag cgcaaggccg 601 gcaggcccaa gaagcaccag cccagcgata gcgagaatga agacgatcag gatgaggacg 661 acgacggcaa cagcagcatc gaagaacgtc gcccagtggg ccgaccgtcg gctggatccg 721 tcaacctgaa catatcgcgc acgggacgcg gcctgggaag acccaagaag cgggctgtcg 781 agtccaatgg cgatggggag ccacaagtgc ccaagaaacg cggtcgtcca ccacagaaca 841 agtcgggtag cggtggcagt actggctacg tacccaccgg tcgtccgcgc ggtcgtccaa 901 aggccaatgc tgcacccgtc gagaagcacg aggacaatga cgacgatcag gacgatgaga 961 actccgggga ggaggaacac agctcgcctg agaagacggt agtggcgccg aagaaacgtg 1021 gacgtccctc tcttgccgct ggcaaggtgt ccaaagagga aacaacaaag ccccgtagtc 1081 gcccagcgaa aaatattgat gacgatgcag atgatgctga ttccgccgat cagggccaac 1141 acaactccaa gaaggagtcg aacgacgagg accgtgccgt ggatgggaca ccaaccaaag 1201 gagatggcct caaatggaac tccgatggcg aaaatgatgc caacgacggt tatgtttcgg 1261 ataactacaa cgattccgaa tcggtagctg cctaaaacat acaatttata aacatttcca 1321 cacacccctc cccaaaaatc aacacaaaaa aaaaaagaat gttgtaaaag caaaaaaaaa 1381 aaaaacgaaa aaaaatctgc aagtaagaga agaaaacaca aaaatggata ggacctactt 1441 ggactttaga gaaccttccc ctcctaaaac aacacacttt gtctacatat atattccgat 1501 ttattgctga gaaatcggac aaaccgttta caactgcact gcactgtaat gacgacacat 1561 agctcggaaa acgatttcaa cttggattga aattaccaaa taaataacac ataaataaca 1621 tatttgggga agatttcaag ttccgaagta gaactgaact ttattgtaac aacaaatttg 1681 atttgattgc agaaatttca atcgtataca agtttaaata aacctactat acct // LOCUS ECOESTA2A 549 bp ds-DNA BCT 24-JUL-1990 DEFINITION E.coli estA2 gene encoding heat-stable enterotoxin A2 (STa2) protein, complete cds. ACCESSION M18345 KEYWORDS STa2 protein; estA2 gene; heat-stable enterotoxin A2. SOURCE E.coli (strain CRL 25090; isolated from human in Bangladesh) HB101 pGK22 cell line DNA, clones 2,12,21, and 27. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 549) AUTHORS Guzman-Verduzio,L.M. and Kupersztoch,Y.M. TITLE Rectification of two Escherichia coli heat-stable enterotoxin allele sequences and lack of biological effect of the change of the carboxyterminal tyrosine for histidine JOURNAL Infect. Immun. 57, 645-648 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.M.Kupersztoch, 08-NOV-1988. FEATURES from to/span description pept 149 367 heat-stable enterotoxin A2 precursor sigp 149 205 heat-stable enterotoxin A2 signal peptide matp 308 364 heat-stable enterotoxin A2 mRNA 132 549 heat-stable enterotoxin A2 mRNA BASE COUNT 182 a 93 c 115 g 159 t ORIGIN Unreported. 1 gaattccccg ccctaaaaca taatattatt atgctcttcg tagcggagag tatagtatga 61 tacacatcac aaaaaaaaaa taaaaaagtt tgcgcaatcg ttctgatttt gatttaaata 121 ttcgtggacg acgtgtttcg gaggtaatat gaaaaaatca atattattta tttttctttc 181 tgtattatct ttttcacctt tcgctcagga tgctaaacca gcagggtctt caaaagaaaa 241 aattacacta gaatcgaaaa aatgtaacat tgtaaaaaaa aataatgaaa gtagtcctga 301 aagcatgaat agtagcaatt actgctgtga attgtgttgt aatcctgctt gtaccgggtg 361 ctattaataa tataaaggga actaaacagt tccctttata tttgtgtgcg ccgtggctgg 421 cgctgttctt caactgtgga ggctgaagaa cgactaagag gtgaaagtcc tccacacacc 481 cggtgagggg aagtgttagc ggaaggcaag gtgatcctac ccacgtaata tggacacagg 541 ggaattcgt // LOCUS ECOESTA3A 403 bp ds-DNA BCT 24-JUL-1990 DEFINITION E.coli estA3 gene encoding heat-stable enterotoxin A3 (STa3) protein, complete cds. ACCESSION M18346 KEYWORDS STa3 protein; estA3 gene; heat-stable enterotoxin A3. SOURCE E.coli (strain 153837-2; isolated from human in Bangladesh) HB101 pYK117 cell line DNA, clones 5 and 7. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 403) AUTHORS Guzman-Verduzio,L.M. and Kupersztoch,Y.M. TITLE Rectification of two Escherichia coli heat-stable enterotoxin allele sequences and lack of biological effect of the change of the carboxyterminal tyrosine for histidine JOURNAL Infect. Immun. 57, 645-658 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.M.Kupersztoch, 08-NOV-1988. FEATURES from to/span description pept 147 365 heat-stable enterotoxin A3 precursor sigp 147 203 heat-stable enterotoxin A3 signal peptide matp 204 362 heat-stable enterotoxin A3 mRNA 130 > 403 heat-stable enterotoxin A3 mRNA BASE COUNT 139 a 61 c 72 g 131 t ORIGIN Unreported. 1 ttttcggtcg ccgaaaaaga taatattact atgctcttcg tagcggagag tatagtatga 61 tgttcatcac aaaaaaaata aaaaagtttg cgcaaccgtt ctgattttga ttcaaatgtt 121 cgtggatgcc atgttccgga ggtaatatga agaaatcaat attatttatt tttctttctg 181 tattgtcttt ttcacctttc gctcaggatg ctaaaccagt agagtcttca aaagaaaaaa 241 tcacactaga atcaaaaaaa tgtaacattg caaaaaaaag taataaaagt ggtcctgaaa 301 gcatgaatag tagcaattac tgctgtgaat tgtgttgtaa tcctgcttgt accgggtgct 361 attaataata taaagggaac taaacagttc cctttatatt tgt // LOCUS HIV2ISY 9636 bp ss-RNA VRL 24-JUL-1990 DEFINITION Human immunodeficiency virus type 2, isolate SBLISY, complete genome. ACCESSION J04498 KEYWORDS . SOURCE Human immunodeficiency virus type 2 (HIV-2) proviral DNA, isolate SBLISY. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9636) AUTHORS Franchini,G., Fargnoli,K.A., Giombini,F., Jagodzinski,L., De Rossi,A., Bosch,M., Biberfeld,G., Fenyo,A.M., Albert,J., Gallo,R.C. and Wong-Staal,F. TITLE Molecular and biological characterization of a replication competent human immunodeficiency type 2 (HIV-2) proviral clone JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2433-2437 (1988) STANDARD full staff_review COMMENT Clean copy of sequence [1] kindly provided by G. Franchini, 30-JAN-1989. FEATURES from to/span description pept 547 2109 gag polyprotein pept < 1827 4934 pol polyprotein (NH2 terminus uncertain; AA at 1827) pept 4864 5511 vif protein pept 5339 5677 vpx protein pept 5677 5994 vpr protein pept 5840 6135 tat, exon 2 (first expressed exon) 8281 8377 tat, exon 3 (AA at 8282) pept 6066 6135 rev, exon 2 (first expressed exon) 8281 8717 rev, exon 3 (AA at 8283) pept 6145 8685 envelope polyprotein pept 8519 9289 nef protein pre-msg 1 9636 genomic mRNA pre-msg 1 9636 tat, rev, nef subgenomic mRNA IVS 6136 8280 tat cds intron 2 IVS 6136 8280 rev cds intron 2 IVS 6136 8280 tat, rev, nef subgenomic mRNA intron 2 LTR < 1 300 5' LTR LTR 8904 > 9636 3' LTR rpt 1 174 R repeat 5' copy rpt 9457 9636 R repeat 3' copy binding 304 320 primer (Lys-tRNA) binding site signal 9609 9614 mRNA polyadenylation signal BASE COUNT 3291 a 1940 c 2396 g 2009 t ORIGIN 1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctggactct caccagtgct tggccggcgc tgggcagacg 121 gctccacgct tgcttgctta aaagacctct taataaagct gccagttaga agcaagttaa 181 gtgtgtgttc ccatctctcc tagtcgccgc ctggtcattc ggtgttctcc tgagtaacaa 241 gaccctggtc tgttaggacc cttcttgctt tgggaaaccg aggcaggaaa atccctagca 301 ggttggcgcc cgaacaggga cttgaggaag actgagaagc cttggaacac ggctgagtga 361 aggcagtaag ggcggcagga acaaaccacg acggagtgct cctagaaagg cgcaggccaa 421 ggtaccaaag gcggcgtgtg gagcgggagt caagaggcct ccgggtgaag gtaagtacct 481 acaccaaaaa ctgtagccgg aaaaggcttg ttatcctacc tttagacagg tagaagattg 541 tgggagatgg gcgcgaaaaa ctccgtcttg agagggaaaa aggcagatga attagaaaaa 601 attaggttac ggcccggcgg gaagaaaaaa tacagattaa aacatattgt gtgggcagcg 661 aatgaattgg acagattcgg attaacagag agcctgttgg agtcaaaaga aggttgccaa 721 aaaattattt cagttttaga accattagta ccaacagggt cagaaaattt aaaaagcctt 781 tataatacta cctgcgtcat ttggtgcttg cacgcagaag agaaagtgaa agatactgaa 841 gaagcaaaaa gaatagtagg gagacatcta gtggcagaaa cagaaactgc agagaaaatg 901 ccaaatataa gtagaccaac agcaccacct agcgggaaag ggggaaactt ccccgtgcaa 961 caaataggcg gcaactatgt ccatctgccg ctgagtcccc gaaccctaaa tgcttgggta 1021 aagttagtag aggagaaaaa gttcggggca gaagtagtgc cgggatttca ggcactctca 1081 gaaggctgca cgccctatga tattaatcaa atgcttaatt gtgtgggcga ccatcaagca 1141 gcgatgcaaa taatcagaga aattattaat gaagaagcag cagactggga tgtacaacat 1201 ccaataccag gccccttacc agcggggcag ctcagagatc cacgaggatc tgacatagca 1261 gggacaacaa gcacagtaga ggaacagatc gaatggatgt ataggcaaga aaatcctgta 1321 ccagtaggaa acatctatag gagatggatc cagataggac tgcagaagtg tgtcagaatg 1381 tacaatccaa ccaacattct agacataaaa caaggaccaa aagagtcgtt ccaaagctat 1441 gtggatagat tctacaaaag cttaagggca gaacagacag atgcagcagt gaagaattgg 1501 atgacccaga cgctgctagt gcaatcgaac ccagactgta agttagtact aaagggacta 1561 gggatgaatc ctaccttaga agagatgcta accgcctgtc aagggatagg tggaccaggc 1621 cagaaggcca gactaatggc agaagcctta aaagaggcca tgcgaccagc ccctatccca 1681 tttgcagcag cccaacagaa aagggcaatt aagtgttgga attgtggaaa ggaagggcac 1741 tcggcaagac aatgccgagc gcctagaaga cagggctgct ggaaatgtgg caagtcagga 1801 cacatcatgg caaactgccc agatagacag gctggttttt tagggcttgg accatgggga 1861 aagaagcccc gcaacttccc cgtggtccca agttcgcagg ggctaacacc aacagcaccc 1921 ccaatggatc cagcagtgga cctactggag aagtacatgc agcaagggag aaaacagaga 1981 gagcagagac aaagaccata caaagaagtg acagaggact tgctgcatct cgagcaagga 2041 gagacaccac acagagagac gacagaggac ttgctgcacc tcaattctct ctttggaaac 2101 gaccagtagt cacagcatac attgaggatc agccagtaga agttttacta gacacagggg 2161 ctgacgactc aatagtagca ggaatagagt tagggagcaa ttatagtcca aaaatagtag 2221 ggggaatagg aggattcata aataccaaag aatataaaga tgtagaaata agagtgctaa 2281 ataaaaaggt aagagccacc ataatgacag gtgatacccc aatcaacatt tttggcagaa 2341 atatcctgac agccttgggc atgtcattaa atttaccagt cgccaaaata gaaccagtaa 2401 aagtaacatt aaagccagga aaagatgggc caaaacaaag acaatggccc ttaacaagag 2461 aaaaaataga agcactaaga gaaatctgtg aaaaaatgga aagagaaggt cagctagaag 2521 aagcgcctcc aactaatccc tataataccc ctacatttgc aattaagaaa aaggacaaaa 2581 acaaatggag gatgctgata gattttagag aactaaacaa ggtaactcaa gatttcacag 2641 aggttcagtt aggaattcca cacccagcag gattagccaa gaaaagaaga attactgtgt 2701 tagatgtagg agatgcctac ttttccatac ccctatatga ggattttaga cagtatactg 2761 catttactct gccatcagta aacaatgcag aaccaggaaa aagatatata tacaaagtct 2821 taccacaggg atggaagggg tcaccagcaa tttttcaata cacaatgagg caagtcttag 2881 aaccattcag aaaagcaaac ccagatgtca ttatcgttca gtacatggat gatatcttaa 2941 tagctagtga caggacagat ttggaacatg acaaagtagt cctacagcta aaggaacttc 3001 taaatggcct gggattttcc accccagacg aaaagttcca aaaggaccct ccataccaat 3061 ggatgggcta tgaactgtgg ccaaccaaat ggaaattgca aaaaatacaa ttgccccaaa 3121 aggaagtatg gacagttaat gacatccaga aactagtggg tgtcctaaac tgggcggcac 3181 aaatctaccc aggaataaag accaaacact tatgtaagct aattagagga aagatgacac 3241 ccacggaaga agtacagtgg acagaattag cagaagcaga gctggaggaa aacaaaatta 3301 tcttaagcca ggaacaggag ggacactatt accaagagga aaaagagtta gaagcaacag 3361 ttcaaaagga tcaagacaat cagtggacat ataaagtaca ccagggagaa aaaattctaa 3421 aagtaggaaa atatgcaaag ataaaaaata cccataccaa cggggtcagg ttgttggcac 3481 aggtagttca gaaaatagga aaagaagcac tagtcatttg gggacgaata ccaaaatttc 3541 acctaccagt agaaagagag acctgggaac agtggtggga taactattgg caagtgacat 3601 ggatcccaga ctgggacttc gtatccaccc caccgttggt caggttagca tttaacctgg 3661 taaaagatcc tataccaggc gcagagacct tctacacgga tggatcttgc aataggcaat 3721 caaaagaggg aaaagcagga tatataacag atagaggaaa agacaaagta aggatattag 3781 agcaaactac caaccaacaa gcagaattag aagcctttgc aatggcagta acagactcag 3841 gtccaaaagt caatattgta gtagactcac agtatgtaat gggaatagta acaggccaac 3901 cggctgaatc agagagtaga atagtaaata aaattataga agagatgata aaaaaggaag 3961 caatctatgt tgcatgggtc ccggcccaca aaggcatagg aggaaatcaa gaaattgacc 4021 acttagtaag tcagggcatc agacaagtat tattcctaga gagaatagag cccgctcagg 4081 aagaacatgg aaaatatcat agcaatgtaa aagaactagc ccataagttt ggattaccca 4141 acctggtggc aagacaaata gtaaacacat gtgcccagtg ccaacaaaaa ggggaagcta 4201 tacatgggca agtaaatgca gaactaggca cctggcaaat ggactgcaca cacttagaag 4261 gaaaaatcat tatagtagca gtacatgttg caagtggatt tatagaagca gaagtcatcc 4321 cacaggaatc aggaaggcaa acagcactct tcctattaaa actggccagt aggtggccaa 4381 taacacactt gcacacagat aatggtgcca acttcacttc acaggaggta aagatggtag 4441 catggtgggt aggcatagaa caatcctttg gagtacctta caatccacaa agccagggag 4501 tagtagaagc aatgaatcac cacctgaaaa atcagataga aagaattaga gagcaggcaa 4561 atacaatgga aacaatagta ctaatggcag ttcattgcat gaattttaaa agaaggggag 4621 gaatagggga tatgacccca gtagaaagac tagtcaatat gatcaccaca gaacaagaaa 4681 tacaattcct ccaagcaaaa aattcaaaat taaaaaattt tcgggtctat ttcagagaag 4741 gcagaaatca actgtggcaa ggacctgggg agctactgtg gaaaggggac ggagcagtca 4801 tagtcaaggt agggacagat ataaaagtaa taccaagaag aaaggccaag atcatcagag 4861 actatggacc aaggcaagag atggatagcg gttcccacct ggagggtgcc agggaggatg 4921 gagaaatggc atagccttat caagtatcta aaatacagaa caaaagatct agaacaggtg 4981 cgctatgttc cccaccataa ggtggggtgg gcatggtgga cttgcagcag ggtaatattc 5041 ccattaaaag gaaacagtca tctagagata caggcatatt ggaacctaac accagaaaaa 5101 ggatggctct cctcttattc agtaagaatg acttggtact cagaagggtt ctggacagat 5161 gttaccccag actgtgcaga caccctaata cacagcactt atttctcttg ctttacggca 5221 ggtgaagtaa gaagagccat caggggagaa aagtcattgt cctgctgcaa ttatccccaa 5281 gcccataagt ccaaggtacc gtcactccaa tttctggcct tagtagtagt acagcaaaat 5341 gacaaacccc agagagacaa taccaccagg aaacagtggc gaagaaacta tcgaagaggc 5401 cttcgattgg ctagacagga cggtagaagc cataaacaga gaggcagtga accacctgcc 5461 cagggagctt attttccagg tgtggcaaag gtcctggaga tactggcatg atgagcaagg 5521 gatgtcacga agctacacaa agtatagata tttgtgctta atgcagaaag ctgtgttcat 5581 gcatttcaag aaagggtgca cttgccgggg ggaaggacat gggccaggag ggtggagatc 5641 aggacctccc cctcctcctc ccccaggttt agtctaatga ctgaagcacc agcagagttt 5701 cccccggagg atgggacccc accgagggaa ccaggggatg agtgggtaat agaaattctg 5761 agagaaataa aggaagaagc tttaaagcat tttgaccctc gcttgctaac tgctcttggc 5821 tactatatct atactagaca tggagacacc cttgaaggcg ccagagagct cattagggtc 5881 ctacaacgag ccctcttcac gcacttcaga gcaggatgtg gccactcaag aattggccaa 5941 ccaaggggaa gaaatcctct ctcagctata ccgaccccta gaaacatgca ataacaaatg 6001 cttttgtaag gggtgctgct tccattgcca gctgtgtttt ttaaacaagg ggctcgggat 6061 atgttatgac cgaaagggca gacgaagaag gagtccgaag aaaactaagg ctcattcgtc 6121 tcctgcatca gacaagtgag tacaatgagt ggtaaaattc agctgcttgt tgcctttctg 6181 ctaactagtg cttgcttaat atattgcacc aaatatgtga ctgttttcta tggagtaccc 6241 gtgtggaaaa atgcatccat tcccctcttt tgtgcaacta aaaatagaga tacttgggga 6301 accatacagt gcttgccaga caatgatgat tatcaagaga tacctttgaa tgtaacagag 6361 gcttttgacg catgggataa tatagtaaca gaacaagcag tagaagatgt ctggaatcta 6421 tttgagacat caataaaacc atgtgtcaaa ttaacgcctt tatgtgtaac aatgaactgt 6481 aacgcaagta cagagagcgc agttgcaact acaagcccat ctggacctga tatgataaat 6541 gatactgatc catgcataca attgaacaat tgctcaggac tgagggagga agacatggtc 6601 gagtgtcagt tcaatatgac aggactagag ttagataaga aaaaacagta tagtgaaacc 6661 tggtactcaa aagatgtggt ttgtgaatca gataacagca cagaccgaaa aagatgttac 6721 atgaaccatt gcaacacatc agtcatcaca gagtcatgtg acaagcacta ttgggatgct 6781 atgagattta gatactgtgc accaccgggt tttgtcttgc taaggtgcaa tgataccaat 6841 tactcaggct ttgagcccaa ttgctctaaa gtagtagctt ctacatgtac aagaatgatg 6901 gaaacgcaac cttctacttg gcttggcttt aatggcacta gggcagaaaa tagaacatat 6961 atctattggc atggtaggga taacagaact attatcagct taaacaaata ttataatctc 7021 accatacttt gtaggagacc agaaaataaa acagttgtac caataacact catgtcaggc 7081 cgcagatttc actcccagaa gatcatcaat aaaaaaccca ggcaagcatg gtgccggttc 7141 aaaggcgagt ggagggaagc catgcaggag gtgaaacaaa cccttgtaaa acatcccagg 7201 tataaaggaa ccaatgacac aaataaaatt aactttacag caccagaaaa agactcagac 7261 ccagaagtag catatatgtg gactaactgc agaggagaat tcctctattg caacatgact 7321 tggttcctta attgggtaga aaacaagacg ggtcaacagc ataactatgt gccgtgccat 7381 atagagcaaa taattaatac ctggcataag gtagggaaaa atgtatattt gcctcctagg 7441 gaaggagagt tgtcctgcga atcaacagtg accagtatca ttgctaacat tgatgttgat 7501 ggagataacc ggacaaatat tacctttagt gcagaggtgg cagaactata ccgattggaa 7561 ttgggggatt ataaattagt agaagtaaca ccaattggct tcgcccctac agcagaaaaa 7621 agatactcct ctgctccagg gagacataag agaggtgtgc ttgtgctagg gttcctaggt 7681 tttctcacga cagcaggtgc tgcaatgggg gcggcgtctc tgacgctgtc ggctcagtct 7741 cggactttat tccgtgggat agtgcagcaa cagcaacagc tgttggacgt ggtcaagaga 7801 caacaagaaa tgttgcgact gaccgtctgg ggaactaaaa acctccaggc aagagtcact 7861 gctattgaga agtacctagc agaccaggcg cgactaaatt catggggatg tgcgtttaga 7921 caagtctgcc acactactgt accatgggta aatgacacct taacacctga gtggaacaac 7981 atgacatggc aagaatggga acacaaaatc cgcttcctag aggcaaatat cagtgagagt 8041 ttagaacagg cacaaatcca gcaagaaaag aatatgtatg agctgcaaaa gctaaatagc 8101 tgggatgttt ttggcaattg gtttgactta acctcctgga tcaagtatat tcaatatgga 8161 gtcatgatag tagtaggaat agtagctctc agaatagtaa tatatgtagt acaaatgcta 8221 agtagactta gaaagggcta taggcctgtt ttctcttccc cccccggtta tattcaacag 8281 atccatatcc acaaggactg ggaacagcca gacagagaag aaacagaaga agacgttggg 8341 aacgacgttg gaagcagatc ctggccttgg ccgatagaat atatacattt cctgatccgc 8401 ctgctgatcc gcctcttgac cagactatac aacagctgca gggacttact atccagactc 8461 tacctgatcc tccaaccact cagagactgg ctgagactca aggcagccta cctgcagtat 8521 gggtgcgagt ggatccaaga agcgttccag gccctcgcga gggttacaag agagactctt 8581 acgagcgcgg ggaggagctt gtggggggct ctgggacgaa tcggaagggg gatactcgca 8641 gttccacgaa ggatcaggca gggagcagaa attgccctcc tgtgagggac agagatatca 8701 gcaaggagac tttatgaata ccccatggag aaccccagca acagaaaagg aaaaagaatc 8761 gtacaggcaa caaaatatgg atgatgtaga ttcagatgat gatgacctag taggggtctc 8821 tgacacatca agagtaccat tgagagcaat gacatataga atggcagtag acatgtcaga 8881 tttaataaaa gataaggggg gactggaagg gatgtattac agtgagagaa gacatagaat 8941 cctagacata tacttagaaa aggaagaagg gataattcca gattggcaga actatactca 9001 tgggctagga gtaaggtacc caatgttctt tgggtggcta tggaagctag taccagtaac 9061 tgtcccacaa gaaggggagg acactgagac tctctgctta atgcactcag cacaagtaag 9121 cagatttgat gacccgcatg gggagacact agtctggaag tttgacccca tgctggctca 9181 tgagtacacg acctttattc tatacccaga ggaatttggg cacaagtcag gaatggaaga 9241 agatgactgg aaggcaaaac tgaaagcaag agggatacca tttagttaaa aacaggaaca 9301 accatacttg gtcaggacag gaagtagcta ctgaaaacag ctgagactgc agggactttc 9361 cagaaggggc tgtaaccagg ggagggacat gggaggagct ggtggggaac gccctcatac 9421 tttctgtata aatgtacccg ctgctcgcat tgtattcagt cgctctgcgg agaggctggc 9481 agattgagcc ctgggaggtt ctctccagca ctagcaggta gagcctgggt gttccctgct 9541 ggactctcac cagtgcttgg ccggcgctgg gcagacggct ccacgcttgc ttgcttaaaa 9601 gacctcttaa taaagctgcc agttagaagc aagtta // LOCUS MUSIGHAAL 411 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig productively rearranged H-chain V-region VDJ mRNA from hybridoma HY2-2C8. ACCESSION M22438 KEYWORDS immunoglobulin; immunoglobulin heavy chain; joining exon; variable region. SOURCE Mouse (strain SM/J) hybridoma HY2-2C8 cell, cDNA to mRNA, clone 1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 411) AUTHORS Hardy,R.R., Carmack,C.E., Shinton,S.A., Riblet,R.J. and Hayakawa,K. TITLE A single V-H gene is utilized predominantly in anti-BrMRBC hybridomas derived from purified Ly-1 B cells: Definition of the V-H11 family JOURNAL J. Immunol. 142, 3643-3651 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.E.Carmack 03-FEB-1989. FEATURES from to/span description pept 1 / 411 Ig heavy-chain (V-J-region) precursor sigp 1 57 Ig heavy-chain (V-J-region) signal peptide matp 58 / 411 Ig heavy-chain (V-J-region) recomb 147 148 framework 1 end/CDR 1 start recomb 162 163 CDR 1 end/framework 2 start recomb 204 205 framework 2 end/CDR 2 start recomb 255 256 CDR 2 end/framework 3 start recomb 351 352 framework 3 end/CDR 3 or D-region start recomb 360 361 CDR 3 or D-region end/J-region start BASE COUNT 100 a 86 c 118 g 107 t ORIGIN Chromosome 12. 1 atggagtggg aactgagctt aattttcatt tttgctcttt taaaagatgt ccagtgtgaa 61 gtgcagctgt tggagactgg aggaggcttg gtgcaacctg gggggtcacg gggactctct 121 tgtgaaggct cagggtttac ttttagtggc ttctggatga gctgggttcg acagacacct 181 gggaagaccc tggagtggat tggagacatt aattctgatg gcagtgcaat aaactacgca 241 ccatccataa aggatcgatt cactatcttc agagacaatg acaagagcac cctgtacctg 301 cagatgagca atgtgcgatc tgaggacaca gccacgtatt tctgtatgag atatgggccc 361 tactggtact tcgatgtctg gggcacaggg accacggtca ccgtctcctc a // LOCUS MUSIGHAAM 411 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig productively rearranged H-chain V-region VDJ mRNA from hybridoma HY1-1H2. ACCESSION M22439 KEYWORDS immunoglobulin; immunoglobulin heavy chain; joining exon; variable region. SOURCE Mouse (strain SM/J) hybridoma HY1-1H2 cell, cDNA to mRNA, clone 1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 411) AUTHORS Hardy,R.R., Carmack,C.E., Shinton,S.A., Riblet,R.J. and Hayakawa,K. TITLE A single V-H gene is utilized predominantly in anti-BrMRBC hybridomas derived from purified Ly-1 B cells: Definition of the V-H11 family JOURNAL J. Immunol. 142, 3643-3651 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.E.Carmack 03-FEB-1989. FEATURES from to/span description pept 1 / 411 Ig heavy-chain (V-J-region) precursor sigp 1 54 Ig heavy-chain (V-J-region) signal peptide matp 55 / 411 Ig heavy-chain (V-J-region) recomb 141 142 framework 1 end/CDR 1 start recomb 159 160 CDR 1 end/framework 2 start recomb 204 205 framework 2 end/CDR 2 start recomb 252 253 CDR 2 end/framework 3 start recomb 348 349 framework 3 end/CDR 3 or D-region start recomb 360 361 CDR 3 or D-region end/J-region start BASE COUNT 94 a 115 c 96 g 106 t ORIGIN Chromosome 12. 1 atgagagtgc tgggattttt gtgcctggtg acagtccttc ctggtagcct gtctcagatg 61 cagcttcagg agtcaggacc tggcctggtg aaaccctcac agtcactctt cctcgcctgc 121 tctattactg gtttccccat caccagtggt tactactgga tctggatccg tcagtcacct 181 gggaaacccc tagaatggat ggggtacatc actcatagtg gggaaacttt ctacaaccca 241 tccctccaga gccccatctc cattactaga gaaacatcca agaaccagtt ctttctgcaa 301 ttgaactctg tgaccacaga ggacacagcc atgtattact gtgcaggaga ctatgatggt 361 tactggtact tcgatgtctg gggcacaggg accacggtca ccgtctcctc a // LOCUS RATCKBR 4360 bp ds-DNA ROD 24-JUL-1990 DEFINITION Rat creatine kinase gene, complete cds. ACCESSION M18668 M26669 KEYWORDS creatine kinase. SOURCE Rat (Sprague Dawley) brain DNA, clone lambda ckb-2. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4360) AUTHORS Benfield,P.A., Graf,D., Korolkoff,P.N., Hobson,G. and Pearson,M.L. TITLE Isolation of four rat creatine kinase genes and identification of multiple potential promoter sequences within the rat brain creatine kinase promoter region JOURNAL Gene 63, 227-243 (1988) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.A.Benfield, 23-MAR-1988. FEATURES from to/span description pept 1276 1468 creatine kinase, exon 2 (first expressed exon) 1656 1810 creatine kinase, exon 3 1888 2020 creatine kinase, exon 4 2197 2368 creatine kinase, exon 5 2941 3064 creatine kinase, exon 6 3174 3363 creatine kinase, exon 7 3443 3621 creatine kinase, exon 8 pre-msg 941 3818 creatine kinase mRNA and introns IVS 1015 1254 creatine kinase intron A IVS 1469 1655 creatine kinase intron B IVS 1811 1887 creatine kinase intron C IVS 2021 2196 creatine kinase intron D IVS 2369 2940 creatine kinase intron E IVS 3065 3173 creatine kinase intron F IVS 3364 3442 creatine kinase intron G BASE COUNT 801 a 1314 c 1278 g 967 t ORIGIN 1 aagtactgct aaacctactt tccagagaag atgaaactgc tggcagctgg aggcagggtg 61 tgaacgcctg tgttcacaca ttccaattca cagccggcac tggctgtggt caaattccag 121 gtttccccgt acctgcgtgt gtgggacacg gggtgggggt caaagagact ctttcaatct 181 ggtggatcta ttaatagtta tttttgactg gctgcttcct ccacccccag tttgccctgc 241 cgtaggtact gtctttgtag gtggggtggt agaaaccggt ccctcaggaa agaacctggg 301 gatttgaaga ttcaaaaaca gtctctagga gctcagtgtc ttaatttttc agggtccggg 361 tccaagggtc ttggctaggt tccttagggc ccgcccaagg tcagaacacc tggtcttccg 421 gcggacgagc cagatgcgct ctgtgtggac caagaacacc caggagatgc ccgcaggcac 481 cttgaggaag gtcagctccc ctgtaccgtt ttgtaccgcc tggatgcagt ggatggaagc 541 cgtaccccat gctaaaaacc tggtatctct tgccatgacc ccctcaacac tgctgtcggc 601 ctgacaaggc aggaagcagg gaaggcagga acctcaagtg ctaactggct ttccagactc 661 agtctctcgc cccaggcatg tgtgtgaagc aggtctaggt atctacggtc ctgggatttc 721 gtcttcgaga tcctgagcga gcgccacggg gttgaccaaa ctgcgcagcg gggtcgagac 781 ttggggacca tggggtgggc cgctgggggt gtcgagggtt gttgcctcgg acaaagcggc 841 ggcaccaccc caaagcgcgg gccaatggaa tgaatgggct ataaatagcc gccaatggga 901 ggccggcgac gcgcccctta agagctcagg gagcagcgag cggccgtcgt tcttctgcgt 961 gcgcccggag ctgcaagcac aagcagtctg cgttcctgct ccgtcggaat cccggtgagc 1021 gggtcggagg gtgaggggct agtcctgagg gcgtacgcgg ttgggtgcat cacgcgcagc 1081 tcgttgcccg gagcgcgtcc ggaccgccgg ggaacttggg atgcgccgga ctcaaaagat 1141 gcagacctcg ctgaccttgg tgacgtccga ggaaagcttc gggggtccgg attggtctgc 1201 agggtctctg gagaccggtg tgtaaagctc ctctgacccc gctcttcccc gcagcctgcc 1261 gccgccgccg ccgccatgcc cttctccaac agccacaaca cgcagaagct gcgcttcccg 1321 gccgaggatg aattccctga tctgagcagc cacaacaacc atatggccaa ggtgctgacc 1381 cccgagctgt acgcggagct ccgtgccaag tgcacgccga gcggctttac gttggacgac 1441 gccatccaga ctggcgtaga caatccgggt atgcacaccc ttgtagcgtc aggctttcgt 1501 ctccccaaca agccccccgg gcaaggatcc cactgctctt ccctgaacct cggcggggtg 1561 gggtctcctg gtcctctctg cgcttagcca taagggcctc agtgccttgc tctagcttgg 1621 cgctgacgtc actgtcgccg cgccccatcc tccaggccac ccgtacatca tgacagtggg 1681 tgcagtggcg ggcgacgagg agagttacga cgtattcaag gaccttttcg accccatcat 1741 cgaggaccgg cacggcggct accagcccag tgatgagcac aagactgacc tcaacccaga 1801 caacctgcag gtgcggggcg tcagggtccg ggcgtgctgg ggagaggggg tctcggcgct 1861 cactccggcc actgcttgca tccctagggc ggcgatgacc tggaccccaa ctacgtgctg 1921 agctcgcggg tgcgcacagg ccgaagcatc cgcggcttct gcctcccccc tcactgcagc 1981 cgtggggagc gccgcgccat cgagaagctg gcagtagaag gtagggtccg gtatcagcca 2041 ccagagctgc tgcgttcttg tctgggcgcg cagcgttccc ccagggtctg agggtggaag 2101 acccaactaa gacagcgggg acactgccgc gggcgggtgg cgagctttct agctcgccca 2161 cccacccacc ggctccttga actctgtact ccgcagccct gtccagccta gatggcgacc 2221 tgtctggcag gtactatgcg ctcaagagca tgaccgaggc ggacgagcag cagctcattg 2281 acgaccactt cctcttcgac aagcctgtgt cgcctctgct gctggcctcc gcgatggccc 2341 gcgactggcc ggatgctcgc ggcatttggt acgagccctc ttccctcaca gtccccggaa 2401 ggtggggcac ggcctgaatt cgtagattgt gcgttaccac actgtgcgca ggtgaccttg 2461 gttctgcgga tgactgaggt ccctgcgccg cgggaggccg cagtctccag ggattcaagg 2521 gtggtgacca gtttctttgg cgtctgttct ccgccctcct cctgggagcc ggcgcttctt 2581 gttttctctc ctccttctca cccccttatt ccgccgggat tctgccaggt gccagtgacg 2641 caaaagcctc cgcacccgtc cggcagggct cctacccctg cagactgcgc gggcgagcgt 2701 gcttcctctg gtgggctgct ctggctgctc cagccccttg cgggccacgc agcacatgac 2761 tggtgattga ggtgccctgg tctccctgcc ttcgagtgtt gctgggtggg cgtcttagag 2821 acagcccaca ctcctgcact tggaaagcgc tttggaggtc acccacctag ggcgtcctgc 2881 tgggtgtgac ctaactaagg ctgtagactt tgactttgtc tgcccgaccc cctcctccag 2941 gcacaatgac aataagacgt tcctggtgtg gatcaacgag gaggaccacc tgcgggttat 3001 ctccatgcag aaagggggca acatgaagga agttttcacg cgattctgca ctggcctcac 3061 tcaggtctgg cctggtgcta aggagggagg gggtcggggg gatttgagtc cacatttccc 3121 agggtgcttg gggttccagc tacattttcc cttaactttc ccttcacctt cagattgaaa 3181 ctctcttcaa gtctaagaac tatgagttca tgtggaaccc tcacctgggc tacatcctca 3241 cgtgcccatc caacctgggc actgggcttc gggcaggcgt gcacatcaag ctgccccacc 3301 tgggaaagca cgagaagttc tcggaggtgc tcaagcgact gcggcttcag aagcgaggca 3361 caggtgaggg gcagacgact caggtgaagc agcaaacgcg gctttcccca gccctgctga 3421 cctgcctgtc tcccatcccc aggtggtgtg gacaccgctg ctgtgggtgg agtttttgat 3481 gtctccaacg ctgaccgcct gggcttctcg gaggtggagc tggtgcagat ggtggtggac 3541 ggagtgaagc tactcattga gatggagcag cggcttgagc agggtcagcc cattgacgac 3601 ctcatgcctg cccagaagtg aagcctggcc ctagccacca ccaggctgcc gcttcctaac 3661 ttattacccg ggcagtgccc gccatgcatc cttgatgttt gccgcctggc gtgagccctt 3721 agcctcgctg tagagacttc tgtcgccctg ggtagagttt attttttctg atggctaagc 3781 tgttgcagac actgaaataa attagggttt ggcctgccct atgtctgagt gttgcttctc 3841 ctttcttaga gacagtcgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 3901 tgtgtgtgtc catccgtcct tcctggcctg tcacttcacc tagcagatga aacatgagtc 3961 ttaagataca gggcagaggg agggagactt gagtgcagcc tgaacatcta aggacatttg 4021 tacttgcagg gtggagccta gtatttcctt agctccagcc agctaaaggg agtcagcaac 4081 ttgacatctt ccccagtttc tttctgccgc attcctggca gcctgaatat ctaaagttct 4141 atctcttcac ttccttcccc tccaggcttc aacacttact ccagttctta aataattctc 4201 tccctacttc ctccagcttc caaccccagg ttgagtgcct cctcctgatc cctcagccca 4261 gctctccagc cccatccaga caagagcttt tcagaaccac ccgtccctct ctcttgcagc 4321 ttcctctctg gggccctcct ccttgggagg agacaagctt // LOCUS RATTHY 797 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Rat prothymosin-alpha mRNA, complete cds. ACCESSION M20035 KEYWORDS prothymosin. SOURCE Rat (strain Charles River CD, 5 week old) spleen, cDNA to mRNA, clone pRSpro-187. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 797) AUTHORS Frangou-Lazaridis,M., Clinton,M., Goodall,G.J. and Horecker,B.L. TITLE Prothymosin alpha and parathymosin: Amino acid sequences deduced from the cloned rat spleen cDNAs JOURNAL Arch. Biochem. Biophys. 263, 305-310 (1988) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by B.L.Horecker, 26-APR-1988. FEATURES from to/span description pept 174 512 prothymosin precursor matp 177 509 prothymosin BASE COUNT 248 a 187 c 219 g 143 t ORIGIN Unreported. 1 gccatctttg catttgttcc cgggtcgtgc tccgcgctca catgcaagcc aacctatcgg 61 ccgacccacc ggtcatcctc caaacgcgga cttccggcta gcttatctcg gccaagagtc 121 ctcgaaactc gacatattct tacgcgtaga ccagaccacc ggcgtgcccc accatgtcag 181 acgcggcagt ggacaccagc tccgagatca ccaccaagga cttgaaggag aagaaggaag 241 ttgtggagga ggcagagaat ggaagagacg cacctgccaa tgggaacgct caaaatgagg 301 aaaatgggga gcaggaggct gacaatgagg tagatgaaga agaggaagaa ggtggggagg 361 aagaggagga ggaggaagaa ggtgatggtg aggaagaaga tggagatgaa gatgaggaag 421 ctgaggctcc tacgggcaag cgggtagctg aggatgatga ggatgatgat gttgagacca 481 agaagcagaa gaagactgat gaggatgact agacagcaaa aggaaaagct aaccttacgc 541 accgtgacct attcaccctc cacttcccgt ctcagaattt aaacgtggtc accttcgagt 601 agagaagcag gcccccgccg cccactccag gtgcacccac acatgacatg cgctcttcac 661 acccacaaaa cacaacatga ttgcacatgg aggagaaaga acagaactcc aggcctcttt 721 ttttcttaaa aatctttaaa ggaattgttg attttattaa ctaagcaaat tgaggcagca 781 ttaataaatt cgacaca // LOCUS RATTHYP 378 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Rat spleen parathymosin-alpha mRNA, 3' end. ACCESSION M20616 KEYWORDS parathymosin. SOURCE Rat (strain Charles River CD, 5 week old) spleen, cDNA to mRNA, clone pRSpara-5187. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 378) AUTHORS Frangou-Lazaridis,M., Clinton,M., Goodall,G.J. and Horecker,B.L. TITLE Prothymosin-alpha and parathymosin: Amino acid sequences deduced from the cloned rat spleen cDNAs-1 JOURNAL Arch. Biochem. Biophys. 263, 305-310 (1988) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by B.L.Horecker, 26-APR-1988. FEATURES from to/span description pept < 1 245 parathymosin precursor (AA at 3) BASE COUNT 111 a 69 c 144 g 54 t ORIGIN Unreported. 1 acaaggtgga ggagaaggct ggccggaaag aacggaagaa agaagtagtg gaggaggagg 61 agaatggagc tgaggaggag gaagaagaaa ctgctgagga tggagaggat gatgatgaag 121 gagacgaaga agatgaggag gaagaggagg aggaggatga aggccccgtg cggaagagaa 181 ctgctgaaga ggaggatgaa gcggatccca agaggcagaa gacagaaaac ggggcgtcgg 241 cttgagcccc tgcccgtggc ttggggatgg gaggcccctc aggtctggag gtgggcagga 301 acacacaatc agccccttct ctgctcctgc tctgcctgcc agagcttgac tgcttgacag 361 cttattcatt tcagcatc // LOCUS STYNUSAINF 228 bp ds-DNA BCT 24-JUL-1990 DEFINITION S.typhimurium nusA-infB operon promoter region. ACCESSION M28868 KEYWORDS . SOURCE S.typhimurium (strain LT2) DNA, clone 40. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 228) AUTHORS Granston,A.E., Tompson,D.L. and Friedman,D.I. TITLE Identification of a second promoter for the metY-nusA-infB operon of Escherichia coli JOURNAL J. Bacteriol. 172, 2336-2342 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.E.Granston, 27-SEP-1989. FEATURES from to/span description signal 159 164 -35 region promoter P1 signal 183 188 -10 region promoter P1 BASE COUNT 69 a 39 c 54 g 66 t ORIGIN 1 aaaataaatg agttttcatt taatcatctt ttagcagaga caggaagcgt ttagtgaagg 61 tttactgaaa taacgttgtg acatgcgtgt ataagccgga gaggtgggat gagtataacg 121 aagagtgata ttcattcatc tgcatattgg gaatagggtt gacataactc atgccaggtc 181 actatactgt gcgcagattt tacgtcccgt cctcggtacc aaatccca // LOCUS SYNAA113M 9186 bp ds-DNA SYN 24-JUL-1990 DEFINITION Sequencing vector pAA113M DNA. ACCESSION M20189 KEYWORDS cloning vector; synthetic DNA. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 9186) AUTHORS Ahmed,A. TITLE A vector for sequencing long (40-kb) DNA fragments JOURNAL Gene 75, 315-321 (1989) STANDARD full staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by A.Ahmed, 28-SEP-1988. pAA113M is a cosmid vector specially developed to sequence long (40kb) DNA fragments. It makes use of the cos site to clone long segments, IS1 to subdivide the cloned fragment by deletions, galKT to select such deletions, and some unique restriction sites to further shorten the fragment for supercoil or M13 sequencing. It contains a normal plasmid ori of replication and also the M13 ori of replication to obtain DNA in the double-stranded or single- stranded form. FEATURES from to/span description pept 3787 4977 tet protein ORF 7433 8293 amp protein BASE COUNT 2051 a 2446 c 2453 g 2236 t ORIGIN Unreported. 1 caattactgc aatgccctcg taattaagtg aatttacaat atcgtcctgt tcggagggaa 61 gaacgcggga tgttcattct tcatcacttt taattgatgt atatgctctc ttttctgacg 121 ttagtctccg acggcaggct tcaatgaccc aggctgagaa attcccggac cctttttgct 181 caagagcgat gttaatttgt tcaatcattt ggttaggaaa gcggatgttg cgggttgttg 241 ttctgcgggt tctgttcttc gttgacatga ggttgccccg tattcagtgt cgctgatttg 301 tattgtctga agttgttttt acgttaagtt gatgcagatc aattaatacg atacctgcgt 361 cataattgat tatttgacgt ggtttgatgg cctccacgca cgttgtgata tgtagatgat 421 aatcattatc actttacggg tcctttccgg tgatccgaca ggttacgggg cggcgacctc 481 gcgggttttc gctatttatg aaaattttcc ggtttaaggc gtttccgttc ttcttcgtca 541 taacttaatg tttttattta aaataccctc tgaaaagaaa ggaaacgaca ggtgctgaaa 601 gcgaggcttt ttggcctctg tcgtttcctt tctctgtttt tgtccgtgga atgaacaatg 661 gaagtcaaca aaaagcagct ggctgacatt ttcggtgcga gtatccgtac cattcagaac 721 tggcaggaac agggaatgcc cgttctgcga ggcggtggca agggtaatga ggtgctttat 781 gactctgccg ccgtcataaa atggtatgcc gaaagggatg ctgaaattga gaacgaaaag 841 ctgcgccggg aggttgaaga actgcggcag gccagcgagg cagatcaaca gtcggtacgg 901 ctgaccatcg ggtgccagtg cgggagtttc gttcagcact gtcctgctcc ttgtgatggt 961 ttacaaacgt aaaaagtctc tttaatacct gtttttgctt catattgttc agcgacacgt 1021 tgctgtacgg caggcaccag ctcttccggg atcagcgcga cgatacagcc gccaaatccg 1081 ccgccggtca tgcgtacgcc acctttgtcg ccaatcacag ctttgacgat ttctaccaga 1141 gtgtcaattt gcggcacggt gatttcgaaa tcatcgcgca tagaggcatg agactccgcc 1201 atcaactcgc ccatacgttt caggtcgcct tgctccagcg cgctggcagc ttcaacggtg 1261 cgggcgtttt cagtcagtat atgacgcacg cgttttgcca cgatcgggtc cagttcatgc 1321 gcaacagcgt tgaactcttc aatggtgaca tcacgcaggg ctggctgctg gaagaaacgc 1381 gcaccggttt cgcactgttc acgacgggtg ttgtattcgc tgccaaccag ggtacgtttg 1441 aagttactgt tgatgatgac gacagccaca cctttgggca tggaaactgc tttggtcccc 1501 agtgagcggc aatcgatcag caaggcatga tctttcttgc cgagcgcgga aattagctga 1561 tccatgatcc cgcagttaca gcctacaaac tggttttctg cttcctgacc gttaagcgcg 1621 atttgtgcgc cgtccagcgg cagatgataa agctgctgca atacggttcc gaccgcgact 1681 tccagtgaag cggaagaact taacccggca ccctgcggca cattgccgtg atcaaccatg 1741 tccacgccgc cgaagctgtt gttacgcagt tgcagatgtt tcaccacgcc acgaacgtag 1801 ttagcccatt gatagttttc atgtgcgaca atgggcgcat cgagggaaaa ctcgtcgagc 1861 tgattttcat aatcggctgc catcacgcga actttacggt catcgcgtgg tgcacaactg 1921 atcacggttt gataatcaat cgcgcagggc agaacgaaac cgtcgttgta gtcggtgtgt 1981 tcaccaatca aattcacgcg gccaggcgcc tgaatggtgt gagtggcagg gtagccaaat 2041 gcgttggcaa acagagattg tgttttttct ttcagactca tttcttacac tccggattcg 2101 cgaaaatgga tatcgctgac tgcgcgcaaa cgctctgctg cctgttctgc ggtcaggtct 2161 cgctgggtct ctgccagcat ttcataacca accataaatt tacgtacggt ggcggacggc 2221 agcagaggcg gataaaagtg cgcgtgcagc tgccagtgtt gattctcttc gccattaaat 2281 ggcgcgccgt gccagcccat agagtagggg aaggagcact ggaagaggtt gtcataacga 2341 ctggtcagct ttttcaacgc cagcgccaga tcgctgcgct gggcgtcggt caaatcggtg 2401 atccgtaaaa cgtgggcttt gggcagcagt agcgtttcga acggccaggc agcccagtaa 2461 ggcacgacgg ctaaccagtg ttcggtttcg acaacggtac ggctaccgtc tgccagctcg 2521 cgctgaacat aatccaccag cattggtgat ttctgttcgg caaaatattc tttttgcagg 2581 cggtcttcgc gctcagcttc gttaggcagg aagctatttg cccaaatctg accgtgcgga 2641 tgcgggttag agcagcccat cgccgcgcct ttgttttcaa aaacctgcac ccatgggtac 2701 gttttcccca gttctgcggt ttgctcctgc caggttttga cgatttccgt caatgctgca 2761 acgctgagct ctggcagcgt tttactgtga tccggtgaaa agcagatcac ccggctggtg 2821 ccgcgcgcgc tctggcaacg catcagcgga tcgtgacttt ctggcgcatc tggcgtgtca 2881 gacatcaaag ccgcaaagtc attagtgaaa acgtaagtcc cggtgtaatc ggggttttta 2941 tcgcctgtca cccgcacatt acctgcgcag aggaagcaat ctggatcgtg cgcaggtaac 3001 acctgtttgg ctggcgtttc ctgcgccccc tgccagggcg ttagccgggt gcggtgaaac 3061 cagaatccat tgcccggtga gcggttgtag cggcgatgtg gatgatcaac gggattaaat 3121 tgcgtcatgg tcgttcctta atcgggatat ccctgtggat ggcgtgactg ccagtgccag 3181 gtgtcctgcg ccatttcatc gagtgtgcgc gttacgcagt tcagttcacg gtcggctttg 3241 ctggcgtccg ccagtaggcc ggaaggtcgc cctcgcgagc ggtgcaaaat gataattaac 3301 cggtttgccg cacgtttgct gaaggcatta accacgtcca gcacgctgtt gcctacgcca 3361 gcgccgaggt tgtagatgtg tacgcctggc ttgttcgcca gtttttccat cgccacgacg 3421 tgaccgtccg ccagatccat tacgtggatg taatcgcgta cgccagtacc atcttcggtc 3481 ggataatcgt taccaaaaat cgccagcgag tcgcgacggc ctacagcaac ctgggcgatg 3541 tatggcatca ggttattcgg aatgccttgc ggatcttcgc ccatatcgcc cgacggatgc 3601 gcgccaaccg ggttgaagta gcgcacgagg gcaatgctcc agtccggctg ggctttttgc 3661 agatcggtga ggatctgttc caccatcagc ttgcttttgc cgtaagggct ttgcggtgtg 3721 ccggtcggga agctataatg cggtagttta tcacagttaa attgctaacg cagtcaggca 3781 ccgtgtatga aatctaacaa tgcgctcatc gtcatcctcg gcaccgtcac cctggatgct 3841 gtaggcatag gcttggttat gccggtactg ccgggcctct tgcgggatat cgtccattcc 3901 gacagcatcg ccagtcacta tggcgtgctg ctagcgctat atgcgttgat gcaatttcta 3961 tgcgcacccg ttctcggagc actgtccgac cgctttggcc gccgcccagt cctgctcgct 4021 tcgctacttg gagccactat cgactacgcg atcatggcga ccacacccgt cctgtggatc 4081 ctctacgccg gacgcatcgt ggccggcatc accggcgcca caggtgcggt tgctggcgcc 4141 tatatcgccg acatcaccga tggggaagat cgggctcgcc acttcgggct catgagcgct 4201 tgtttcggcg tgggtatggt ggcaggcccc gtggccgggg gactgttggg cgccatctcc 4261 ttgcatgcac cattccttgc ggcggcggtg ctcaacggcc tcaacctact actgggctgc 4321 ttcctaatgc aggagtcgca taagggagag cgtcgaccga tgcccttgag agccttcaac 4381 ccagtcagct ccttccggtg ggcgcggggc atgactatcg tcgccgcact tatgactgtc 4441 ttctttatca tgcaactcgt aggacaggtg ccggcagcgc tctgggtcat tttcggcgag 4501 gaccgctttc gctggagcgc gacgatgatc ggcctgtcgc ttgcggtatt cggaatcttg 4561 cacgccctcg ctcaagcctt cgtcactggt cccgccacca aacgtttcgg cgagaagcag 4621 gccattatcg ccggcatggc ggccgacgcg ctgggctacg tcttgctggc gttcgcgacg 4681 cgaggctgga tggccttccc cattatgatt cttctcgctt ccggcggcat cgggatgccc 4741 gcgttgcagg ccatgctgtc caggcaggta gatgacgacc atcagggaca gcttcaagga 4801 tcgctcgcgg ctcttaccag cctaacttcg atcattggac cgctgatcgt cacggcgatt 4861 tatgccgcct cggcgagcac atggaacggg ttggcatgga ttgtaggcgc cgccctatac 4921 cttgtctgcc tccccgcgtt gcgtcgcggt gcatggagcc gggccacctc gacctgaatg 4981 gaagccggcg gcacctcgct aacggattca ccactccaag aattggagcc aatcaattct 5041 tgcggagaac tgtgaatgcg caaaccaacc cttggcagaa catatccatc gcgtccgcca 5101 tctccagcag ccgcacgcgg cgcatctcgg gcagcgttgg gtcctggcca cgggtgcgca 5161 tgatcgtgct cctgtcgttg aggacccggc taggctggcg gggttgcctt actggttagc 5221 agaatgaatc accgatacgc gagcgaacgt gaagcgactg ctgctgcaaa acgtctgcga 5281 cctgagcaac aacatgaatg gtcttcggtt tccgtgtttc gtaaagtctg gaaacgcgga 5341 agtcagcgcc ctgcaccatt atgttccgga tctgcatcgc aggatgctgc tggctaccct 5401 gtggaacacc tacatctgta ttaacgaagc gctggcattg accctgagtg atttttctct 5461 ggtcccgccg catccatacc gccagttgtt taccctcaca acgttccagt aaccgggcat 5521 gttcatcatc agtaacccgt atcgtgagca tcctctctcg tttcatcggt atcattaccc 5581 ccatgaacag aaattccccc ttacacggag gcatcaagtg accaaacagg aaaaaaccgc 5641 ccttaacatg gcccgcttta tcagaagcca gacattaacg cttctggaga aactcaacga 5701 gctggacgcg gatgaacagg cagacatctg tgaatcgctt cacgaccacg ctgatgagct 5761 ttaccgcagc tgcctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct 5821 cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 5881 cgcgtcagcg ggtgttggcg ggtgtcgggg cgcagccatg acccagtcac gtagcgatag 5941 cggagtgtat actggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 6001 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcgctcttcc 6061 gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 6121 cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 6181 tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 6241 cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 6301 aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 6361 cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 6421 gcgctttctc aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 6481 ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 6541 cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 6601 aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 6661 tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 6721 ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 6781 tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 6841 ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 6901 agattatcaa aaaggatctt cacctagatc cttttaaatt gtaaacgtta atattttgtt 6961 aaaattcgcg ttaaattttt gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg 7021 caaaatccct tataaatcaa aagaatagcc cgagataggg ttgagtgttg ttccagtttg 7081 gaacaagagt ccactattaa agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta 7141 tcagggcgat ggcccactac gtgaaccatc acccaaatca agttttttgg ggtcgaggtg 7201 ccgtaaagca ctaaatcgga accctaaagg gagcccccga tttagagctt gacggggaaa 7261 gccggcgaac gtggcgagaa aggaagggaa gaaagcgaaa ggagcgggcg ctagggcgct 7321 ggcaagtgta gcggtcacgc tgcgcgtaac caccacaccc gccgcgctta atgcgccgct 7381 acagggcgcg taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 7441 gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 7501 gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 7561 caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 7621 ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta 7681 attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg 7741 ccattgctgc aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 7801 gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 7861 ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 7921 tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg 7981 gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 8041 cggcgtcaac acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 8101 gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 8161 tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 8221 ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 8281 gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 8341 tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 8401 catttccccg aaaagtgcgg taatgactcc aacttattga tagtgtttta tgttcagata 8461 atgcccgatg actttgtcat gcagctccac cgattttgag aacgacagcg acttccgtcc 8521 cagccgtgcc aggtgctgcc tcagattcag gttatgccgc tcaattcgct gcgtatatcg 8581 cttgctgatt acgtgcagct ttcccttcag gcgggattca tacagcggcc agccatccgt 8641 catccatatc accacgtcaa agggtgacag caggctcata agacgcccca gcgtcgccat 8701 agtgcgttca ccgaatacgt gcgcaacaac cgtcttccgg agactgtcat acgcgtaaaa 8761 cagccagcgc tggcgcgatt tagccccgac atagccccac tgttcgtcca tttccgcgca 8821 gacgatgacg tcactgcccg gctgtatgcg cgaggttacc gactgcggcc tgagtttttt 8881 aagtgacgta aaatcgtgtt gaggccaacg cccataatgc gggctgttgc ccggcatcca 8941 acgccattca tggccatatc aatgattttc tggtgcgtac cgggttgaga agcggtgtaa 9001 gtgaactgca gttgccatgt tttacggcag tgagagcaga gatagcgctg atgtccggcg 9061 gtgcttttgc cgttacgcac caccccgtca gtagctgaac aggagggaca gctgatagaa 9121 acagaagcca ctggagcacc tcaaaaacac catcatacac taaatcagta agttggcagc 9181 atcacc // LOCUS SYNHUMUBA 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Thr14->Cys), complete cds. ACCESSION M21581 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 81 a 51 c 54 g 60 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctt cgtcaagacg ttaaccggta aaaccatatg tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBB 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Phe4->Cys), complete cds. ACCESSION M22214 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 82 a 51 c 54 g 59 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccataac tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBC 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Phe4->Cys, Thr14->Cys), complete cds. ACCESSION M22215 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 81 a 50 c 55 g 60 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccatatg tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBD 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Phe4->Cys, Thr14->Cys, Thr66->Cys), complete cds. ACCESSION M22216 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 80 a 48 c 56 g 62 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccatatg tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBE 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Phe4->Cys, Thr66->Cys), complete cds. ACCESSION M22217 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 81 a 49 c 55 g 61 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccataac tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBF 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Thr66->Cys), complete cds. ACCESSION M22218 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 81 a 50 c 54 g 61 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctt cgtcaagacg ttaaccggta aaaccataac tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS XELC3 1087 bp ss-mRNA VRT 24-JUL-1990 DEFINITION X.laevis complement component 3 (C3) mRNA, 3' end. ACCESSION M21790 J04493 KEYWORDS complement component 3; serum protein. SOURCE X.laevis (adult) liver, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1087) AUTHORS Grossberger,D., Marcuz,A., Du Pasquier,L. and Lambris,J.D. TITLE Conservation of structural and functional domains in complement component C3 of Xenopus laevis and mammals JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1323-1327 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.Grossberger, 14-DEC-1988. FEATURES from to/span description pept < 1 969 complement component 3 (AA at 1) BASE COUNT 327 a 211 c 258 g 291 t ORIGIN Unreported. 1 gggagggtgg tgaccgtgta ccagtctctt gtcatagaga aagaaagaaa gtgccttaac 61 tttgaactgt cttgtgaatg taaaggaagt acaacttgca agacctcccg aagtgcaaag 121 gcaacagtgt ctatagaagc ctgtgcaagg catctcaaga acgttgatgc caccatgtcc 181 attattgata tttccatgat gactggcttt tctcctgata ccgattccct ggataggcta 241 atgaagggag tggacaaata catctctaaa tatgaagtca acaaaggagc aaatgacaag 301 ggaacactta ttctctactt ggacaaagtc tcccacatag acgaagaatg tgtgaagttt 361 tatgctcatc aatactttga agtgggtttc atccagccag cttctgtaac tgtgtatgac 421 tattatactc cagataatcg ttgcactaaa ttttaccatg tggaagaagg cagtgccttg 481 ctggcaggga tttgccaagg ggatatatgc cgatgtgcag aagagaactg tttcatgcag 541 cagcaaattg agggtaaaat cactgctgac atgagagtca acatggcttg gctcccggag 601 tggaattttg tgtacaaggc tactctcacg gagttgcagc ccagcgacaa ctatgacaac 661 tatgtaatga caattaagaa ggtcattaag caaggcacag atgaggatcc tgaggacaag 721 acacgcaatt ttatcagcca tatcaaatgc cgaaaagctt taaatatgca gctgaaccga 781 gattatctga tttggggggt aactggtgac ctctggcgcc acgatggata ttcctacatc 841 attgggaagg acacatggat ggagtggtgg cccaatgaga gggagtgtca acaacgtgag 901 aaccaggatc tctgcgatga ctttgagaca gtttctgaca acctggagat tgttggttgt 961 cccaactgag gaacctaaat tcagaccaaa atgtttccat cttcccaagc tcatctgtga 1021 tgtttaattt tgtaaatgtt gttcagtgtt ttatatacag ttccaataaa gcatttattt 1081 gcttaac // LOCUS YSCKEX2A 4106 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae prohormone processing enzyme (KEX2) gene, complete cds. ACCESSION M24201 KEYWORDS Ca-dependent serine protease; furin-related protein; processing enzyme; protease; subtilism-related protease. SOURCE S.cerevisiae (strain DBY939) DNA, clone YEp24-pJ2B. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 4106) AUTHORS Fuller,R.S., Brake,A. and Thorner,J. TITLE Yeast prohormone processing enzyme (KEX2 gene product) is a Ca2+ -dependent serine protease JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1434-1438 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Brake, 09-MAY-1989 FEATURES from to/span description pept 1329 3773 prohormone processing enzyme (KEX2) BASE COUNT 1327 a 803 c 827 g 1149 t ORIGIN Chromosome 14, left arm. 1 gaattctctg ttgactacta aactgagaga atttgccgag actctaagaa cagctttgaa 61 agagcgttct gccgatgatt ccataattgt cactctgaga gagcaaatgc aaagagaaat 121 cttcaggttg atgtcgttgt tcatggacat acctccagtg caaccaaacg agcaattcac 181 ttgggaatac gttgacaaag acaagaaaat ccacactatc aaatcgactc cgttagaatt 241 tgcctccaaa tacgcaaaat tggacccttc cacgccagtc tcattgatca atgatccaag 301 acaccatatg gtaaattaat taagatcgat cgtttaggaa acgtccttgg cggagatgcc 361 gtgatttact taaatgttga caatgaaaca ctatctaaat tggttgttaa gagattacaa 421 aataacaaag ctgtcttttt tggatctcac actccaaagt tcatggacaa gaaaactggt 481 gtcatggata ttgaattgtg gaactatcct gccatggcta taatttacct cagcaaaagg 541 catccggtat tagataccat gaaagtttga tgactcatgc tatgttggat cactggctgc 601 cacgtcgatg aaacgtctaa attaccactt cgctaccgtc tgaaaattcc tggggtaaag 661 actccggtaa agacggatta tacgtgatga ctcaaaagta cttcgaggag tactgctttc 721 aaattgtggt cgatatcaat gaattgccaa aagagctggc ttcaaaattc acctcaggta 781 aggaagagcc gattgtcttg cccatctgga cccaatggtg ctttggccaa ataaatagtt 841 tcagcagctc tgatgtagat acacgtatct cgacatgttt tatttttact atacatacat 901 aaaagaaata aaaaatgata acgtgtatat tattattcat ataatcaatg agggtcattt 961 tctgaaacgc aaaaaacggt aaatggaaaa aaaataaaga tagaaaaaga aaacaaacaa 1021 aggaaaggtt agcatattaa ataactgagc tgatacttca acagcatcgc tgaagagaac 1081 agtattgaaa ccgaaacatt ttctaaaggc aaacaaggta ctccatattt gctggacgtg 1141 ttctttctct cgtttcatat gcataattct gtcataagcc tgttcttttt cctggcttaa 1201 acatcccgtt ttgtaaaaga gaaatctatt ccacatattt cattcattcg gctaccatac 1261 taaggataaa ctaatcccgt tgttttttgg cctcgtcaca taattataaa ctactaaccc 1321 attatcagat gaaagtgagg aaatatatta ctttatgctt ttggtgggcc ttttcaacat 1381 ccgctcttgt atcatcacaa caaattccat tgaaggacca tacgtcacga cagtattttg 1441 ctgtagaaag caatgaaaca ttatcccgct tggaggaaat gcatccaaat tggaaatatg 1501 aacatgatgt tcgagggcta ccaaaccatt atgttttttc aaaagagttg ctaaaattgg 1561 gcaaaagatc atcattagaa gagttacagg gggataacaa cgaccacata ttatctgtcc 1621 atgatttatt cccgcgtaac gacctattta agagactacc ggtgcctgct ccaccaatgg 1681 actcaagctt gttaccggta aaagaagctg aggataaact cagcataaat gatccgcttt 1741 ttgagaggca gtggcacttg gtcaatccaa gttttcctgg cagtgatata aatgttcttg 1801 atctgtggta caataatatt acaggcgcag gggtcgtggc tgccattgtt gatgatggcc 1861 ttgactacga aaatgaagac ttgaaggata atttttgcgc tgaaggttct tgggatttca 1921 acgacaatac caatttacct aaaccaagat tatctgatga ctaccatggt acgagatgtg 1981 caggtgaaat agctgccaaa aaaggtaaca atttttgcgg tgtcggggta ggttacaacg 2041 ctaaaatctc aggcataaga atcttatccg gtgatatcac tacggaagat gaagctgcgt 2101 ccttgattta tggtctagac gtaaacgata tatattcatg ctcatggggt cccgctgatg 2161 acggaagaca tttacaaggc cctagtgacc tggtgaaaaa ggctttagta aaaggtgtta 2221 ctgagggaag agattccaaa ggagcgattt acgtttttgc cagtggaaat ggtggaactc 2281 gtggtgataa ttgcaattac gacggctata ctaattccat atattctatt actattgggg 2341 ctattgatca caaagatcta catcctcctt attccgaagg ttgttccgcc gtcatggcag 2401 tcacgtattc ttcaggttca ggcgaatata ttcattcgag tgatatcaac ggcagatgca 2461 gtaatagcca cggtggaacg tctgcggctg ctccattagc tgccggtgtt tacactttgt 2521 tactagaagc caacccaaac ctaacttgga gagacgtaca gtatttatca atcttgtctg 2581 cggtagggtt agaaaagaac gctgacggag attggagaga tagcgccatg gggaagaaat 2641 actctcatcg ctatggcttt ggtaaaatcg atgcccataa gttaattgaa atgtccaaga 2701 cctgggagaa tgttaacgca caaacctggt tttacctgcc aacattgtat gtttcccagt 2761 ccacaaactc cacggaagag acattagaat ccgtcataac catatcagaa aaaagtcttc 2821 aagatgctaa cttcaagaga attgagcacg tcacggtaac tgtagatatt gatacagaaa 2881 ttaggggaac tacgactgtc gatttaatat caccagcggg gataatttca aaccttggcg 2941 ttgtaagacc aagagatgtt tcatcagagg gattcaaaga ctggacattc atgtctgtag 3001 cacattgggg tgagaacggc gtaggtgatt ggaaaatcaa ggttaagaca acagaaaatg 3061 gacacaggat tgacttccac agttggaggc tgaagctctt tggggaatcc attgattcat 3121 ctaaaacaga aactttcgtc tttggaaacg ataaagagga ggttgaacca gctgctacag 3181 aaagtaccgt atcacaatat tctgccagtt caacttctat ttccatcagc gctacttcta 3241 catcttctat ctcaattggt gtggaaacgt cggccattcc ccaaacgact actgcgagta 3301 ccgatcctga ttctgatcca aacactccta aaaaactttc ctctcctagg caagccatgc 3361 attatttttt aacaatattt ttgattggcg ccacattttt ggtgttatac ttcatgtttt 3421 ttatgaaatc aaggagaagg atcagaaggt caagagcgga aacgtatgaa ttcgatatca 3481 ttgatacaga ctctgagtac gattctactt tggacaatgg aacttccgga attactgagc 3541 ccgaagaggt tgaggacttc gattttgatt tgtccgatga agaccatctt gcaagtttgt 3601 cttcatcaga aaacggtgat gctgaacata caattgatag tgtactaaca aacgaaaatc 3661 catttagtga ccctataaag caaaagttcc caaatgacgc caacgcagaa tctgcttcca 3721 ataaattaca agaattacag cctgatgttc ctccatcttc cggacgatcg tgattcgata 3781 tgtacagaaa gcttcaaatt acaaaatagc atttttttct tatagattat aatactctct 3841 catacgtata cgtatatgtg tatatgatat ataaacaaac attaatatcc tattccttcc 3901 gtttgaaatc cctatgatgt actttgcatt gtttgcaccc gcgaataaaa tgaaaactcc 3961 gaaccgatat atcaagcaca taaaagggga gggtccaatt aatgcatatt taagaccaca 4021 gctgaataac tttaaaacgg cagacaaaac aaaaaatagg tcgaataaac cttacctgcc 4081 tagaaggaat gacagcagct aataag // LOCUS YSCTASPT 2641 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae mitochondrial aspartyl-tRNA synthetase (MSD1) gene, complete cds. ACCESSION M26020 KEYWORDS aspartyl-tRNA synthetase. SOURCE S.cerevisiae (strain D273-10B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2641) AUTHORS Tzagoloff,A. JOURNAL Unpublished (1989) Columbia U. Dept. Biol. Sci. NY NY 10027 STANDARD full staff_review REFERENCE 2 (sites; for [1]) AUTHORS Gampel,A. and Tzagoloff,A. TITLE Homology of aspartyl-and lysyl-tRNA synthetases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6023-6027 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Tzagoloff, 06-JUL-1989. FEATURES from to/span description pept 601 2577 aspartyl-tRNA synthetase BASE COUNT 878 a 466 c 551 g 746 t ORIGIN 1 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaacctg taacaatagc aataccccaa 61 atacctaatg tagttccagc aagcaagcta aaaagtaaag caacaacata actcacccct 121 gcatctgcag cttttgcccg ggcagcctgc tctgcctgtg ttttctttaa ttgagcagta 181 gaccatttag cagttgcatg aatagctgca gcgtcacatc ggataataat gatggcagcc 241 attgtagaag tgccttttgc atttctagtc tctttctcgg tctagctagt tttactacat 301 cgcgaagata gaatcttaga tcacactgcc aatcttgtgc aagctgtggc aaaatcgacc 361 ttctccatgc gtccatggta aatgaaatgt tttgctttct ttttcgtgct tgcaaagttt 421 ggtagatgaa atttcagaaa gctactgccg taccaaagta caataataag gtgaaaatgg 481 gagtaggaag taatagatgg cattgaatgc ctcatttatt cattattcat gttttcttaa 541 cattttgatt cagttgagtg catgcttttt tgaaaagact aaacattaca agttttaatc 601 atgttggccc gttccagggt gtgcttgcag acaatcacta gacggttggc agactttcca 661 gaagccaatg ctattaagaa aaaatttctc tttaggaagg acacctcgac catcaaacag 721 ttaaaaggac tgtctagcgg ccagaaaata gtactcaatg gatggataga gcagaagcca 781 aaaagagttg ggaaaaattt gatcttcgga cttttaaggg actctaacgg tgacattatc 841 cagttggttg ataacaaatc gttgttgaaa ggctttactt tagaggatgt ggttcaagcc 901 gtaggtatac tctctttgaa gaggaagcta tcaaatgagg acgcagatga gtacgaagtg 961 caattggagg atattactgt gttaaatgca tctaataaaa aaccagcgca aatgcaggat 1021 tttaaattgt cagccatata cccacctgag ttccgctatt tgcaattgag aaatcccaaa 1081 tatcaagatt ttttaaagaa gaggtcatct atctctaagg aaataagaaa ctccttcaac 1141 aattttgatt ttacggaggt cgagacccca atgttattta aagctacccc agaaggcgca 1201 agagagtttc tggttccaac aaggacaaag agatccgatg gtaaaccatc gttttatgct 1261 ctagatcaga gccctcaaca atacaagcaa ctcttaatgg ctagtggtgt caacaaatat 1321 tatcaaatgg caaggtgctt tagagatgaa gatttaagag cagacaggca gcctgagttt 1381 acacaggttg atatggaaat ggcctttgct aattctgaag atgtcatgaa aatcatagaa 1441 aagacagttt ctggggtatg gagtaaattt tccaaaaaac gaggattatt gactttagac 1501 agtaagggta cattagtgcc tgcgaaaaag gaaaacggca cagtatctat ctttcgtatg 1561 acctacgaac aagccatgac ctcatatggt attgacaagc cagatttgag agcgccagat 1621 ttgaagatta tcaatttagg cgagttcaat gcctttagtc atttgaacaa aaaatttccc 1681 gtttttgaag taattattct aagaagtgcc ttttcaaata tggaagagta caaagaacga 1741 tggtcgtttc tgacaaataa cagtaattac aattatagag ttccaatagt gctaccaatt 1801 gaaaatgacg aacaagctaa ttcaaattgg tttgagaatt ttcatgcaat tgccacgttt 1861 gaaaacccac atctaataac caaatttctg aaactgaaaa aaggtgacat tgtatgcggt 1921 tgtacgagag agccaaacca ttccattttc gagaatccta ctcccctggg aagattgaga 1981 cagttggtgc tacaaagtga gcatgggaaa aatatctatc atgctgtcaa taaggatgtt 2041 gcctcatgga ttgtggattt cccgttattt tctcccgtta taattgaaga taagtctggt 2101 aaaaaagaaa agcttgcata tccggagtac gaaaaggata gactatgttc cacgcatcat 2161 ccttttacta tggtgaagct taaagactac gaaaaattag aaaagacgcc ggaaaagtgc 2221 ttgggtcggc attatgacct cgtagttaac ggtgtggaac ttggtggtgg ctcaacaaga 2281 attcacgatc caagattaca agactatatt ttcgaagata tcctcaaaat agataatgca 2341 tatgaactat ttggccattt actgaatgct tttgatatgg gaacaccgcc acacgctgga 2401 tttgctattg gttttgatcg tatgtgcgct atgatctgtg aaactgagag tataagggac 2461 gtaatcgcct tcccaaaaag tattaccggg gctgatttgg ttgtcaagag tccaagtgtg 2521 atacctgaaa gcattctgga accttacaat atcaagtata gtaattcaaa aaaatgagca 2581 gaagttgatg ttacctagat gtgagagaat cttagtgtgg ctgcagtatg taggtttttc 2641 t // LOCUS YSCTRMSD 2604 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae MSD gene encoding mitochondrial aspartyl-tRNA synthetase, complete cds. ACCESSION M24418 KEYWORDS aspartyl-tRNA synthetase. SOURCE S.cerevisiae (strain D273-10B) DNA, clone pG94/T1. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2604) AUTHORS Gampel,A. and Tzagoloff,A. TITLE Homology of aspartyl- and lysyl-tRNA synthetases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6023-6027 (1989) STANDARD full staff_review COMMENT Draft entry of sequence [1] kindly submitted by A.Tzagoloff, 01-MAY-1989. FEATURES from to/span description pept 565 2541 mitochondrial aspartyl-tRNA synthetase BASE COUNT 842 a 466 c 551 g 745 t ORIGIN 1 cctgtaacaa tagcaatacc ccaaatacct aatgtagttc cagcaagcaa gctaaaaagt 61 aaagcaacaa cataactcac ccctgcatct gcagcttttg cccgggcagc ctgctctgcc 121 tgtgttttct ttaattgagc agtagaccat ttagcagttg catgaatagc tgcagcgtca 181 catcggataa taatgatggc agccattgta gaagtgcctt ttgcatttct agtctctttc 241 tcggtctagc tagttttact acatcgcgaa gatagaatct tagatcacac tgccaatctt 301 gtgcaagctg tggcaaaatc gaccttctcc atgcgtccat ggtaaatgaa atgttttgct 361 ttctttttcg tgcttgcaaa gtttggtaga tgaaatttca gaaagctact gccgtaccaa 421 agtacaataa taaggtgaaa atgggagtag gaagtaatag atggcattga atgcctcatt 481 tattcattat tcatgttttc ttaacatttt gattcagttg agtgcatgct tttttgaaaa 541 gactaaacat tacaagtttt aatcatgttg gcccgttcca gggtgtgctt gcagacaatc 601 actagacggt tggcagactt tccagaagcc aatgctatta agaaaaaatt tctctttagg 661 aaggacacct cgaccatcaa acagttaaaa ggactgtcta gcggccagaa aatagtactc 721 aatggatgga tagagcagaa gccaaaaaga gttgggaaaa atttgatctt cggactttta 781 agggactcta acggtgacat tatccagttg gttgataaca aatcgttgtt gaaaggcttt 841 actttagagg atgtggttca agccgtaggt atactctctt tgaagaggaa gctatcaaat 901 gaggacgcag atgagtacga agtgcaattg gaggatatta ctgtgttaaa tgcatctaat 961 aaaaaaccag cgcaaatgca ggattttaaa ttgtcagcca tatacccacc tgagttccgc 1021 tatttgcaat tgagaaatcc caaatatcaa gattttttaa agaagaggtc atctatctct 1081 aaggaaataa gaaactcctt caacaatttt gattttacgg aggtcgagac cccaatgtta 1141 tttaaagcta ccccagaagg cgcaagagag tttctggttc caacaaggac aaagagatcc 1201 gatggtaaac catcgtttta tgctctagat cagagccctc aacaatacaa gcaactctta 1261 atggctagtg gtgtcaacaa atattatcaa atggcaaggt gctttagaga tgaagattta 1321 agagcagaca ggcagcctga gtttacacag gttgatatgg aaatggcctt tgctaattct 1381 gaagatgtca tgaaaatcat agaaaagaca gtttctgggg tatggagtaa attttccaaa 1441 aaacgaggat tattgacttt agacagtaag ggtacattag tgcctgcgaa aaaggaaaac 1501 ggcacagtat ctatctttcg tatgacctac gaacaagcca tgacctcata tggtattgac 1561 aagccagatt tgagagcgcc agatttgaag attatcaatt taggcgagtt caatgccttt 1621 agtcatttga acaaaaaatt tcccgttttt gaagtaatta ttctaagaag tgccttttca 1681 aatatggaag agtacaaaga acgatggtcg tttctgacaa ataacagtaa ttacaattat 1741 agagttccaa tagtgctacc aattgaaaat gacgaacaag ctaattcaaa ttggtttgag 1801 aattttcatg caattgccac gtttgaaaac ccacatctaa taaccaaatt tctgaaactg 1861 aaaaaaggtg acattgtatg cggttgtacg agagagccaa accattccat tttcgagaat 1921 cctactcccc tgggaagatt gagacagttg gtgctacaaa gtgagcatgg gaaaaatatc 1981 tatcatgctg tcaataagga tgttgcctca tggattgtgg atttcccgtt attttctccc 2041 gttataattg aagataagtc tggtaaaaaa gaaaagcttg catatccgga gtacgaaaag 2101 gatagactat gttccacgca tcatcctttt actatggtga agcttaaaga ctacgaaaaa 2161 ttagaaaaga cgccggaaaa gtgcttgggt cggcattatg acctcgtagt taacggtgtg 2221 gaacttggtg gtggctcaac aagaattcac gatccaagat tacaagacta tattttcgaa 2281 gatatcctca aaatagataa tgcatatgaa ctatttggcc atttactgaa tgcttttgat 2341 atgggaacac cgccacacgc tggatttgct attggttttg atcgtatgtg cgctatgatc 2401 tgtgaaactg agagtataag ggacgtaatc gccttcccaa aaagtattac cggggctgat 2461 ttggttgtca agagtccaag tgtgatacct gaaagcattc tggaacctta caatatcaag 2521 tatagtaatt caaaaaaatg agcagaagtt gatgttacct agatgtgaga gaatcttagt 2581 gtggctgcag tatgtaggtt tttc // LOCUS MUSCD14A 1873 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse monocyte/granulocyte cell surface glycoprotein homologous (CD14) gene, complete cds. ACCESSION M34510 KEYWORDS CD14 receptor protein; monocyte/granulocyte cell surface glycoprotein. SOURCE Mouse (strain BALB/c) adult liver macrophage cell line J774A.1 DNA, clone pMCD14/5, and cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1873) AUTHORS Ferrero,E., Hsieh,C.-L., Francke,U. and Goyert,S.M. TITLE CD14 is a member of the family of leucine-rich proteins and is encoded by a gene syntenic with multiple receptor genes JOURNAL J. Immunol. 145, 331-336 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.M.Ferrero, 25-MAY-1990. FEATURES from to/span description pept 173 175 murine CD14 protein (MCD14), exon 1 273 1370 murine CD14 protein, exon 2 sigp 173 175 murine CD14 protein signal peptide 273 314 murine CD14 protein signal peptide matp 315 1367 murine CD14 protein pre-msg 62 1602 murine CD14 mRNA and introns IVS 176 272 murine CD14 protein intron A signal 1433 1438 poly-A signal signal 1583 1588 poly-A signal BASE COUNT 427 a 500 c 478 g 468 t ORIGIN Chromosome 18. 1 aggaagggac agggtgacac cccaggatta cataaattta caggggctgc cgaattggtc 61 gaacaagccc gtggaacctg gaagccagag aacaccatcg ctgtaaagga aagaaactga 121 agcttttctc ggagcctatc tgggctgctc aaactttcag aatctaccga ccatggtgag 181 tcagacagac tgtcttgggg tggaactgga gccaacctga ggaatctcag ggtcctggca 241 ggagtctccc tgtcccccta ctttctcctc aggagcgtgt gcttggcttg ttgctgttgc 301 ttctggtgca cgcctctccc gccccaccag agccctgcga gctagacgag gaaagttgct 361 cctgcaactt ctcagatccg aagccagatt ggtccagcgc tttcaattgt ttgggggcgg 421 cagatgtgga attgtacggc ggcggccgca gcctggaata ccttctaaag cgtgtggaca 481 cggaagcaga tctggggcag ttcactgata ttatcaagtc tctgtcctta aagcggctta 541 cggtgcgggc cgcgcggatt cctagtcgga ttctattcgg agccctgcgt gtgctcggga 601 tttccggcct ccaggaactg actcttgaaa atctcgaggt aaccggcacc gcgccgccac 661 cgcttctgga agccaccgga cccgatctca acatcttgaa cctccgcaac gtgtcgtggg 721 caacaaggga tgcctggctc gcagaactgc agcagtggct aaagcctgga ctcaaggtac 781 tgagtattgc ccaagcacac tcactcaact tttcctgcga acaggtccgc gtcttccctg 841 ccctctccac cttagacctg tctgacaatc ctgaattggg cgagagagga ctgatctcag 901 ccctctgtcc cctcaagttc ccgaccctcc aagttttagc gctgcgtaac gcggggatgg 961 agacgcccag cggcgtgtgc tctgcgctgg ccgcagcaag ggtacagctg caaggactag 1021 accttagtca caattcactg cgggatgctg caggcgctcc gagttgtgac tggcccagtc 1081 agctaaactc gctcaatctg tctttcactg ggctgaagca ggtacctaaa gggctgccag 1141 ccaagctcag cgtgctggat ctcagttaca acaggctgga taggaaccct agcccagatg 1201 agctgcccca agtggggaac ctgtcactta aaggaaatcc ctttttggac tctgaatccc 1261 actcggagaa gtttaactct ggcgtagtca ccgccggagc tccatcatcc caagcagtgg 1321 ccttgtcagg aactctggct ttgctcctag gagatcgcct ctttgtttaa ggaacatttg 1381 catcctcctg gtttctgagg gtcctcgtca acgaatcctc tgctttaaat ttattaaaat 1441 cttaatccac gatgtaagga aagaaaggca gtcaagatgg ttcagtgggt aaaagccagc 1501 aaacttgacc cctgatttta accctcagga tccacacgga aggggaaaac tcactcctga 1561 aagttgtcca tctgtgctca caaataaata ttttttaaaa taacaatgtg tttgttggtt 1621 ttgtttttgt ttgggttttg ttgtggtttt gtttgttttg ttttgttttt gagacagtct 1681 ggctatgtat ccttggctgg cctcaaactc ataaagatca agatcggcct gcctctacct 1741 ccaaatgctc tggttaaagg gatgtgcctc catgcccagt tgaagtcatc ctgaaccacg 1801 agtccaggcc actcactctt tactaagatc tttactaagt atctgccata ggtcaggctc 1861 agtgctaaat gag // LOCUS YSTAKYAMT 563 bp ss-mRNA PLN 24-JUL-1990 DEFINITION Yeast sp. mitochondrial adenylate kinase (AKY2) gene, 5' end. ACCESSION M35511 KEYWORDS adenylate kinase. SOURCE Yeast sp. (wild type strain D273-10B, ATCC 25657), cDNA to mRNA. ORGANISM Yeast sp. Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina. REFERENCE 1 (bases 1 to 563) AUTHORS Oechsner,U., Magdolen,V., Zoglowek,C., Haecker,U. and Bandlow,W. TITLE Yeast adenylate kinase is transcribed constitutively from a promoter in the short intergenic region to the histone H2A-1 gene JOURNAL FEBS Lett. 242, 187-193 (1988) STANDARD simple staff_review FEATURES from to/span description pept 561 > 563 adenylate kinase (AKY2) BASE COUNT 153 a 100 c 98 g 212 t ORIGIN 1 taagatcggt tctggtattt taaagaaggc ggaaggaact aaactgcttc tccattgtaa 61 ctatattaag tgtctattag acctttctgt ttttcctttt tatttttcta tatataattt 121 attgggtata taattaacgg taacatatgt catgcatgat ataaattttt tgttgtgtct 181 tcctgttttc tctgttgctc tgtttaggtt cattgggcac tgttgcttac ttgaacatga 241 ttgagtagca aacacattct gtgctttgtt cactttgata gtgtgacgaa attttcacaa 301 ccaaaggaaa aaagctcacg attgcgcgat cctgtcacgg agaactgtcc gcagcagccc 361 gcggcttctg tccttattcg cccatttttt tttgattttc gactttttca ctctggctag 421 ttttattacg catatttgta tatatatata cgcataaatt tctgaaatgg attacctggt 481 ttatcagaaa cgagagcttt aataaagaaa cattaacgtt tctctggtaa agtcaccaca 541 cagcatcaaa tataacagta atg // LOCUS MLMPOLA 1680 bp ss-RNA VRL 24-JUL-1990 DEFINITION Moloney murine leukemia virus pol gene, partial cds. ACCESSION M32803 KEYWORDS . SOURCE Moloney murine leukemia virus RNA, clone 3-1R. ORGANISM Moloney murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 1680) AUTHORS Martinelli,S.C. and Goff,S.P. TITLE Rapid reversion of a deletion mutation in Moloney murine leukemia virus by recombination with a closely related endogenous provirus JOURNAL Virology 174, 135-144 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 684 pol protein (AA at 1) 688 > 1680 pol protein site 685 687 pol protein in-frame stop codon BASE COUNT 469 a 467 c 435 g 309 t ORIGIN 1 gcttttcccc tcgagcgccc agactgggat tacaccaccc aggcaggtag gaaccaccta 61 gtccactatc gccagttgct cctagcgggt ctccaaaacg cgggcagaag ccccaccaat 121 ttggccaagg taaaagggat aacacaggga cctaatgagt ctccctcagc ctttttagag 181 agactcaagg aggcctatcg caggtacact ccttatgacc ctgaggaccc agggcaagaa 241 accaatgtgt ctatgtcatt catctggcag tctgccccgg atatcgggcg aaagttaggg 301 cggttagaag atttaaagag caagacctta ggagacttag ttagggaggc tgaaaagatc 361 tttaataaac gtgaaacccc ggaagaaaga gaggaacgta tcaggagaga aacagaggaa 421 aaagaagaac gccgtaggac agtggatgag cagaaagaga aagaaaggga ccgcagaaga 481 catagagaga tgagcaagct cttggccact gtagttattg gtcaggaaca ggatagacag 541 gagggagagc ggaagaggcc ccaacttgat aaggatcaat gcgcctactg caaagaaaag 601 gggcactggg ctaaagactg tcccaagaag ccacgagggc cccgaggacc caggccccag 661 acctccctcc tgaccttagg tgactaggga ggtcagggtc aggacccccc ccctgaaccc 721 aggataaccc tcaaagtcgg ggggcaaccc gtcaccttcc tggtagatac tggggcccaa 781 cactccgtgc tgacccaaaa tcctggaccc ctaagtgaca agtctgcctg ggtccaaggg 841 gctactggag gaaagcggta tcgctggacc acggatcgca aagtgcatct agctaccggt 901 aaggtcaccc actctttcct ccatgtacca gactgcccct atcctctgct aggaagagat 961 ttgctgacta aactaaaagc ccaaatccac tttgagggat caggagctca ggttgtggga 1021 ccaatgggac agcccctgca agtgctgacc ctaaacatag aagatgagta tcggctacat 1081 gagacctcaa aagagccgga tgtttctcta gggttcacat ggctttctga ttttcctcag 1141 gcctgggcgg aatccggggg catgggactg gcagttcgcc aagctcctct gatcatacct 1201 ctgaaggcaa cctctacccc cgtgtccata aaacaatacc ccatgtcaca agaagccaga 1261 ctggggatca agccccacat acagagactg ttggaccagg gaatactggt accctgccag 1321 tccccctgga acacgcccct gctacccgtt aagaaaccag ggactaatga ttacaggcct 1381 gtccaagatc tgagagaagt caacaagcgg gtggaagaca tccaccccac cgtgcccaac 1441 ccttacaacc tcttgagcgg gctcccaccg tcccaccagt ggtacactgt gcttgactta 1501 aaggatgcct ttttctgcct gagactccac cccaccagtc agcctctctt cgcctttgag 1561 tggagagacc cagagatggg aatctcagga caattaacct ggaccagact cccacagggt 1621 ttcaaaaaca gtcccaccct gtttgatgag gcactgcaca gagacctagc agacttccgg // LOCUS RHIISRA 1350 bp ds-DNA BCT 24-JUL-1990 DEFINITION Plasmid RP4 of Rhizobium transposon Tn1 transposable element ISR1 open reading frames A, A1, A2 and A3. ACCESSION X06616 KEYWORDS transposable element. SOURCE Plasmid RP4 transposon Tn 1 of Rhizobium sp. DNA. ORGANISM Rhizobium sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 1350) AUTHORS Priefer,U.B., Kalinowski,J., Rueger,B., Heumann,W. and Puehler,A. TITLE ISR1, a transposable DNA sequence resident in Rhizobium class IV strains, shows structural characteristics of classical insertion elements JOURNAL Plasmid 21, 120-128 (1989) STANDARD simple staff_review FEATURES from to/span description ORF 94 360 ORF A3 ORF 433 714 ORF A1 ORF 787 1272 ORF A2 ORF 900 64 (c) ORF A BASE COUNT 258 a 415 c 470 g 207 t ORIGIN 1 acggaaattg ctcatcagct cagtattgcc tgacgtgacc ccctgaaact cctccaggaa 61 tagctagagt ccgcccaacg aaggacggac agaatgaagc gatcgaggtt cacggaagag 121 cagatcatcg ggatcctgcg ggagcaggag gccggtgtag cgacggcgga ggtgtgccga 181 cgccacgggg tcagctcggc gaccttttac aaatggaagg ccaagtttgg cgggctcgac 241 gtgtcggagg ctcggcggct caaggcgctc gaggatgaga acgcccggct gaagcggatg 301 ctggcggacg cgatgctgga caacgtggcc ctgaaggacc tgctgggaaa aaagtggtga 361 cgcccgccgg ctatcgcgag gcggctggcc atctgcaggc cgcctacgag atgagcgaaa 421 ggcgggcgtg ccgtgttctg ggcgtcgatc ggacgagcgt gcgctatcag gcgacgcgcc 481 cggacgacgc gctctgcgcg accggctgaa ggccctggcc aggaacgtcg acggttcggc 541 tatcgtcgcc tgcacgtgct gctgcggcgc gagggccatg cggtcaacag gaaacgggtc 601 cagcggatct atcgcgacga gcaactgacg gtgcgccggc gcgcgccgca agcgagcgat 661 gggcacgcgg cgaccgtgac ctgccgctgg cgcccaacca gcgctggtcg ctgacttcgt 721 cgcagaccag ctgacggacg ggcggcgctt ccgcatcctg acggtgatcg ataactgtac 781 gcgcgagtgc ttggcgctgg tggccgacac ctcgctgtcg ggcgcgcggg tcgtgcggga 841 gttggacgcc gtcatccggc agcgagggcg gcccgacacc atcgtcagcg acaacggcac 901 ggagtacacc tcgaacgcgg tcctggcctg ggcgacgaca ccggcgtcgg ctggcactac 961 atcgcgccgg gcaagcccca gcagaacggc ttcaacgaaa gcttcaacgg gcggctgcgt 1021 gacgagctgc tgaacgagac gctgttccgc tcgctgccgc cgctcgcgcc gtgcggaggc 1081 gtggcgacgc gattacaacg agcagcgtcc gcactcgaag ctcgggtggc tgacgccgca 1141 ggcttatgca gaagcgctca ccgcacacaa cggccggcct gctgcgtggt tgatggctgc 1201 gcagaccggc ctcttgccaa cccagccgat cacagctcag atcaacctag gactctcgtt 1261 atggctggat gagaaacggg ggtcacgtca tgcccgctcc acggtttata aaattcttga 1321 agacgaaagg gcctcgtgat acgcctattt // LOCUS HAMACAPT 1837 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA, partial cds. ACCESSION J05590 M22755 KEYWORDS N-acetylglucosamine-1-phosphate transferase; glycosyl transferase; tunicamycin-resistance. SOURCE Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone TRG-[4,22]. ORGANISM Cricetulus longicaudatus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 889 to 960) AUTHORS Lehrman,M.A., Zhu,X. and Khounlo,S. TITLE Amplification and molecular cloning of the hamster tunicamycin- sensitive N-acetylglucosamine-1-phosphate transferase gene: The hamster and yeast enzymes share a common peptide sequence JOURNAL J. Biol. Chem. 263, 19796-19803 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 1837) AUTHORS Zhu,X. and Lehrman,M.A. TITLE Cloning, sequence and expression of a cDNA encoding hamster UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate transferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and sequence for [1] kindly submitted by M. Lehrman, 04-APR-1989. Draft entry and computer-readable sequence for [1] kindly submitted by M.Lehrman, 12-JUN-1990. FEATURES from to/span description pept 148 1374 N-acetylglucosamine-1-phosphate transferase (GPT) binding 67 79 dolichol binding site binding 222 234 dolichol binding site BASE COUNT 363 a 515 c 418 g 541 t ORIGIN 1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg 61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg 121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg 181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc 241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg 301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc 361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc 421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg 481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct 541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta 601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc 661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat 721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta 781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt 841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga 901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc 961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct 1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg 1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc 1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag 1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt 1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc 1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc 1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca 1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt 1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt 1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc 1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat 1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag 1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc 1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt // LOCUS HS4NA3ABC 10089 bp ds-DNA VRL 24-JUL-1990 DEFINITION Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C genes, complete cds. ACCESSION M34440 KEYWORDS nuclear antigen. SOURCE Epstein-Barr virus type 2 (isolate AG876) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 10089) AUTHORS Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E., Rickinson,A. and Kieff,E. TITLE Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their EBNA-3A, EBNA-3B and EBNA-3C genes JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.T.Sample, 23-MAY-1990. Kieff Laboratory Harvard Medical School 75 Francis Street Boston, MA 02115 FEATURES from to/span description pept 189 524 nuclear antigen EBNA-3A, exon 1 610 3051 nuclear antigen EBNA-3A, exon 2 pept 3237 3593 nuclear antigen EBNA-3B, exon 1 3672 6155 nuclear antigen EBNA-3B, exon 2 pept 6277 6636 nuclear antigen EBNA-3C, exon 1 6711 9560 nuclear antigen EBNA-3C, exon 2 IVS 525 609 EBNA-3A intron IVS 3594 3671 EBNA-3B intron IVS 6637 6710 EBNA-3C intron BASE COUNT 2495 a 2837 c 2619 g 2138 t ORIGIN 1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct 61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag 121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg 181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag 241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca 301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc 361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac 421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc 481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg 541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt 601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat 661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg 721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg 781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact 841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg 901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg 961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc 1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct 1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac 1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat 1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg 1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca 1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg 1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg 1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct 1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa 1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac 1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc 1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc 1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg 1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg 1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa 1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca 1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca 2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag 2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc 2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac 2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg 2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa 2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg 2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg 2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc 2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg 2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg 2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa 2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg 2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct 2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat 2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt 2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg 3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg 3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat 3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc 3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga 3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg 3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac 3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag 3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca 3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc 3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag 3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc 3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga 3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac 3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct 3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata 3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa 3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag 4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt 4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga 4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata 4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag 4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct 4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg 4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa 4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg 4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca 4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag 4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc 4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg 4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca 4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca 4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga 4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt 4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac 5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc 5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac 5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct 5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc 5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc 5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc 5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc 5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc 5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc 5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag 5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta 5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg 5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac 5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac 5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc 5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct 6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt 6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt 6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg 6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta 6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg 6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat 6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac 6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc 6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta 6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca 6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc 6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc 6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct 6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg 6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat 6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca 6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg 7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag 7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga 7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc 7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt 7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta 7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc 7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg 7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag 7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc 7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac 7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag 7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg 7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag 7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt 7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga 7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta 7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac 8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc 8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc 8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc 8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc 8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc 8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg 8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac 8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg 8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg 8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata 8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc 8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact 8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc 8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt 8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag 8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc 9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac 9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc 9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc 9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac 9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg 9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag 9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa 9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg 9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca 9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga 9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta 9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca 9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg 9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg 9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg 9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag 9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc 10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt 10081 ggaggatcc // LOCUS HUMPAPB 3061 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human prostatic acid phosphatase mRNA, complete cds. ACCESSION M34840 KEYWORDS glycoprotein; prostatic acid phosphatase; secretory protein. SOURCE Human prostate, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1199) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3061) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.L.Van Etten, 08-JUN-1990. Chemistry Department Purdue University West Lafayette, IN 47907 FEATURES from to/span description pept 15 1175 prostatic acid phosphatase precursor (EC 3.1.3.2) /hgml_locus_uid="LA0168F" /nomgen="ACPP" /map="3q21-qter" sigp 15 110 prostatic acid phosphatase signal peptide matp 111 1172 prostatic acid phosphatase BASE COUNT 917 a 642 c 681 g 821 t ORIGIN Chromosome 3q21-qter. 1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc 61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga 121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg 181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg 241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt 301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg 361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc 421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc 481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg 541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag 601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg 661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga 721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat 781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc 841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc 901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga 961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc 1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt 1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa 1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag 1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc 1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca 1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt 1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa 1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt 1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt 1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga 1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc 1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa 1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt 1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat 1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata 1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga 1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag 2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag 2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt 2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta 2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca 2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa 2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc 2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac 2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa 2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat 2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac 2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc 2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc 2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat 2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc 2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat 2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta 3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa 3061 a // LOCUS LDV3ECPA 1064 bp ss-RNA VRL 24-JUL-1990 DEFINITION Lactate dehydrogenase-elevating virus capsid protein gene, complete cds. ACCESSION M34856 KEYWORDS viral capsid protein. SOURCE Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones A16, B24, B63, B90, B103, C44, dt4. ORGANISM Lactate dehydrogenase-elevating virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae. REFERENCE 1 (bases 1 to 1064) AUTHORS Godeny,E.K., Speicher,D.W. and Brinton,M.A. TITLE Map location of lactate dehydrogenase-elevating virus (LDV) capsid protein (Vp1) gene JOURNAL Virology (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.K.Godeny, 06-JUN-1990. FEATURES from to/span description pept 637 984 capsid protein pept 135 650 unidentified ORF BASE COUNT 267 a 223 c 265 g 309 t ORIGIN 3' terminus region. 1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc 61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc 121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt 181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg 241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt 301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg 361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc 421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg 481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta 541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga 601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc 661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat 721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc 781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga 841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt 901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat 961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg 1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt // LOCUS MBIFRHADGB 3903 bp ds-DNA BCT 24-JUL-1990 DEFINITION M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase (frhADGB) gene, complete cds. ACCESSION J02914 KEYWORDS 8-hydroxy-5-deazaflavin-reducing hydrogenase. SOURCE M.thermoautotrophicum (strain delta-H) DNA. ORGANISM Methanobacterium thermoautotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 3903) AUTHORS Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T. TITLE Cloning, sequence determination, and expression of the genes encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin reducing hydrogenase from Methanobacterium thermoautotrophicum delta-H JOURNAL Biochemistry (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.Alex, 25-MAY-1990. FEATURES from to/span description pept 298 1515 F420 hydrogenase alpha-subunit (frhA) pept 1524 2000 F420 hydrogenase delta-subunit (frhD) pept 2114 2824 F420 hydrogenase gamma-subuint (frhG) pept 2837 3682 F420 hydrogenase beta-subunit (frhB) binding 284 293 ribosomal binding site (put.) binding 1514 1520 ribosomal binding site (put.) binding 2100 2106 ribosomal binding site (put.) binding 2823 2832 ribosomal binding site (put.) BASE COUNT 1201 a 878 c 965 g 859 t ORIGIN 1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta 61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac 121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt 181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt 241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg 301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg 361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg 421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg 481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc 541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac 601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac 661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt 721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac 781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa 841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc 901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa 961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata 1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca 1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt 1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg 1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta 1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag 1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca 1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc 1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa 1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg 1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt 1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc 1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg 1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg 1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa 1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa 1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg 1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc 2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga 2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag 2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact 2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga 2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga 2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac 2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt 2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc 2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc 2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt 2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct 2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg 2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct 2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac 2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata 2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg 3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa 3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga 3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta 3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat 3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat 3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga 3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg 3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg 3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga 3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga 3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat 3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt 3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa 3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt 3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat 3901 cga // LOCUS MUSLRCNA 1751 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse loricrin gene, complete cds. ACCESSION M34398 KEYWORDS cell envelope protein; loricrin. SOURCE Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1751) AUTHORS Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D., Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M., Yuspa,S.H. and Roop,D.R. TITLE Identification of a major keratinocyte cell envelope protein, loricrin JOURNAL Cell 61, 1103-1112 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Rothnagel, 22-MAY-1990. FEATURES from to/span description pept 71 1516 loricrin signal 1740 1745 polyA signal BASE COUNT 223 a 493 c 706 g 329 t ORIGIN 1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc 61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg 121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg 181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg 241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg 301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg 361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg 421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg 481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg 541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg 601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg 661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc 721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg 781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg 841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg 901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg 961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg 1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg 1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc 1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg 1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag 1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg 1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg 1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg 1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac 1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc 1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac 1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct 1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca 1741 ataaatttgc t // LOCUS PFAGIPA 1906 bp ds-DNA INV 24-JUL-1990 DEFINITION P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete cds. ACCESSION J05544 KEYWORDS glucosephosphate isomerase. SOURCE P.falciparum (strain 7G8) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1906) AUTHORS Kaslow,D.C. and Hill,S. TITLE Cloning metabolic pathway genes by complementation in E.coli: Isolation and expression of Plasmodium falciparum glucosephosphate isomerase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.C.Kaslow, 04-JUN-1990. FEATURES from to/span description pept 28 1803 glucosephosphate isomerase (EC 5.3.1.9) BASE COUNT 782 a 174 c 289 g 661 t ORIGIN Chromosome 14. 1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat 61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat 121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct 181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa 241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc 301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata 361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat 421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg 481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa 541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat 601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat 661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct 721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt 781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg 841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta 901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga 961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat 1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat 1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca 1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa 1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat 1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa 1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg 1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa 1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt 1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta 1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt 1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat 1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat 1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag 1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt 1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt // LOCUS RATGADA 3198 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Rat glutamic acid decarboxylase (GAD) mRNA, complete cds. ACCESSION M34445 KEYWORDS glutamic acid decarboxylase. SOURCE Rat brain and hypothalamus, cDNA to mRNA, (libraries of D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and pCR22HP. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3198) AUTHORS Wyborski,R.J., Bond,R.W. and Gottlieb,D.I. TITLE Characterization of a cDNA coding for rat glutamic acid decarboxylase JOURNAL Mol. Brain Res. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.I.Gottlieb, 23-MAY-1990. FEATURES from to/span description pept 184 1965 glutamic acid decarboxylase mRNA < 1 3198 glutamic acid decarboxylase mRNA signal 3174 3179 polyA signal BASE COUNT 869 a 742 c 767 g 820 t ORIGIN 1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca 61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc 121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag 181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat 241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc 301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag 361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa 421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct 481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg 541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac 601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac 661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt 721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta 781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc 841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca 901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc 961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc 1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg 1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt 1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat 1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca 1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg 1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc 1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa 1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag 1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac 1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc 1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt 1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa 1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc 1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg 1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc 1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc 1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca 2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg 2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa 2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat 2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc 2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca 2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag 2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta 2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca 2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac 2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag 2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt 2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta 2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt 2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt 2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc 2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa 3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta 3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt 3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa 3181 taaaaaatat tctccatg // LOCUS RATTSHRA 5270 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Rat thyroid stimulating hormone receptor mRNA, complete cds. ACCESSION M34842 KEYWORDS TSH receptor; thyroid stimulating hormone receptor; transmembrane protein. SOURCE Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5270) AUTHORS Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C., McBride,O.W. and Kohn,L.D. TITLE Cloning, chromosomal assignment and regulation of the rat thyrotropin receptor by thyrotropin, agents which increase cAMP levels and thyroid autoantibodies JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.D.Kohn, 05-JUN-1990. FEATURES from to/span description pept 55 2349 thyroid stimulating hormone receptor precursor sigp 55 117 thyroid stimulating hormone receptor signal peptide matp 118 2346 thyroid stimulating hormone receptor signal 5256 5261 poly-A signal BASE COUNT 1415 a 1343 c 1176 g 1336 t ORIGIN 1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg 61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga 121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag 181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact 241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat 301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa 361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta 421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct 481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct 541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg 601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg 661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga 721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct 781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag 841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac 901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat 961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc 1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag 1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag 1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac 1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca 1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg 1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc 1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc 1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac 1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt 1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc 1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg 1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc 1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac 1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg 1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt 1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc 1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg 2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc 2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag 2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa 2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt 2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca 2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa 2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct 2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc 2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac 2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg 2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag 2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga 2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat 2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc 2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat 2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt 3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta 3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat 3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac 3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga 3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt 3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa 3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct 3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt 3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt 3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata 3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt 3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac 3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat 3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga 3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat 3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag 3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc 4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc 4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg 4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa 4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga 4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag 4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc 4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg 4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc 4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct 4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg 4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca 4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa 4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta 4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga 4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg 4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc 4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg 5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca 5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt 5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg 5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg // LOCUS STMBLIPA 840 bp ds-DNA BCT 24-JUL-1990 DEFINITION S.clavuligerus beta-lactamase inhibitory protein gene, complete cds. ACCESSION M34538 KEYWORDS beta-lactamase inhibitory protein. SOURCE S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP. ORGANISM Streptomyces clavuligerus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 840) AUTHORS Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E. TITLE Isolation and characterization of a beta-lactamase inhibitory protein from Streptomyces clavuligerus and cloning and analysis of corresponding gene JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.E.Jensen, 25-MAY-1990. Dept. of Microbiology University of Alberta Edmonton, Alberta Canada T6G 2E9 FEATURES from to/span description pept 67 672 beta-lactamase inhibitory protein precursor (bliP) sigp 67 174 beta-lactamase inhibitory protein signal peptide matp 175 669 beta-lactamase inhibitory protein binding 52 63 ribosomal binding site signal 686 724 transcription termination signal BASE COUNT 140 a 248 c 320 g 132 t ORIGIN 1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga 61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc 121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg 181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc 241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc 301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc 361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg 421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg 481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg 541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc 601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg 661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc 721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg 781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc // LOCUS TRBPARPA 763 bp ss-mRNA INV 24-JUL-1990 DEFINITION T.brucei procyclin (pPS-1) mRNA, complete cds. ACCESSION M34790 KEYWORDS procyclic acidic repetitive protein; procyclin. SOURCE T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone pPS-1. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 763) AUTHORS Dorn,P.L., Aman,R.A. and Boothroyd,J.C. TITLE Inhibition of protein synthesis results in super-induction of procyclin RNA JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Boothroyd, 04-JUN-1990. Stanford University School of Medicine Dept of Microbiology and Immunology Sherman Fairchild Science Building Stanford, CA 94305-5402 USA FEATURES from to/span description pept 29 466 procyclin mRNA < 1 763 procyclin mRNA rpt 215 393 12 bp degenerate repeats BASE COUNT 192 a 215 c 156 g 200 t ORIGIN 1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct 61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc 121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg 181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga 241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga 301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga 361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc 421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt 481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc 541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt 601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc 661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag 721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt // LOCUS W10PPIBCA 4761 bp ds-DNA BCT 24-JUL-1990 DEFINITION Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes, complete cds, and ippD gene, 3' end. ACCESSION M34849 KEYWORDS antigen. SOURCE Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA, clones HB101/pHS[4108,5357]. ORGANISM Plasmid pWR100 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. TITLE Nucleotide sequence of the invasion plasmid antigen B and C genes (ipaB and ipaC) of Shigella flexneri JOURNAL Microb. Pathog. 4, 345-357 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 538 and 3987 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. JOURNAL Unpublished (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Baudry, 05-JUN-1990. Center for Vaccine Development University of Maryland at Baltimore 10 South Pine Street Baltimore, MD 21201 FEATURES from to/span description pept 606 1073 polypeptide I (ippI) pept 1079 2821 polypeptide B (ipaB) pept 2841 3932 polypeptide C (ipaC) pept 3983 > 4761 ipaD protein pept < 1 549 unidentified ORF binding 595 600 ippI ribosomal binding site binding 1065 1070 ipaB ribosomal binding site binding 2829 2834 ipaC ribosomal binding site binding 3971 3977 ipaD ribosomal binding site signal 556 561 ippI -10 region signal 2723 2728 ipaC -35 region signal 2745 2750 ipaC -10 region BASE COUNT 1769 a 952 c 805 g 1235 t ORIGIN 1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat 61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat 121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct 181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca 241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat 301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc 361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg 421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca 481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat 541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag 601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa 661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca 721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct 781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta 841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat 901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag 961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat 1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat 1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga 1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac 1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa 1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct 1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa 1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga 1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat 1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat 1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga 1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca 1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat 1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa 1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg 1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc 1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga 1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt 2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt 2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca 2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc 2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc 2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac 2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac 2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat 2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc 2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc 2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa 2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga 2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc 2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg 2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata 2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc 2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat 3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg 3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag 3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag 3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga 3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc 3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa 3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc 3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa 3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc 3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt 3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc 3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca 3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca 3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc 3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag 3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg 3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc 4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa 4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa 4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac 4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc 4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt 4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga 4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata 4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag 4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa 4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc 4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac 4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat 4741 agacaatatg ttaaaaagct t // LOCUS YEPOMPHA 824 bp ss-mRNA BCT 24-JUL-1990 DEFINITION Y.enterocolitica cationic outer membrane protein (ompH) gene, complete cds. ACCESSION M34854 KEYWORDS cationic outer membrane protein; enterobacterial outer membrane protein; lipopolysaccharide binding protein; secretory protein; structural membrane protein. SOURCE Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 824) AUTHORS Hirvas,L., Koski,P. and Vaara,M. TITLE The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and comparison to known ompH sequences JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.H.Hirvas, 06-JUN-1990. University of Helsinki FEATURES from to/span description pept 324 818 ompH protein precursor (gtg start codon) sigp 324 392 ompH protein signal peptide matp 393 815 ompH protein binding 308 317 ompH ribosomal binding site signal 120 125 ompH -35 region signal 143 148 ompH -10 region BASE COUNT 257 a 147 c 198 g 221 t 1 others ORIGIN 1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg 61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg 121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt 181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt 241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa 301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag 361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa 421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag 481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc 541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga 601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc 661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta 721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta 781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac // LOCUS YSCEXG1A 2056 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase (EXG1) gene, complete cds. ACCESSION M34341 KEYWORDS 1,3-glucan glucanohydrolase; exo-1,3-B-glucanase. SOURCE S.cerevisiae (strain AB320) DNA, clone YEp13. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2056) AUTHORS Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A., Nebreda,A.R., Mendez,E. and del Rey,F. TITLE Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1, in the yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.del Rey, 17-MAY-1990. Instituto de Microbiologia bioquimica Facultad de Biologia CSIC/Universidad de Salamanca 37008 Salamanca, SPAIN FEATURES from to/span description pept 363 1709 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase precursor (EXG1, EC 3.2.1.58) sigp 363 482 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase signal peptide matp 483 1706 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase mRNA 265 > 1709 EXG1 mRNA signal 153 159 TATA box BASE COUNT 610 a 395 c 396 g 655 t ORIGIN Chromosome XII-right arm. 1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg 61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac 121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa 181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat 241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt 301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta 361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta 421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa 481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg 541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg 601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg 661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg 721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc 781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca 841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt 901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca 961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat 1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca 1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca 1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact 1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct 1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg 1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga 1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg 1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt 1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct 1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg 1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc 1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat 1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt 1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa 1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta 1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat 1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt 2041 gagcttttcc gccggc // LOCUS HUMPSG6A 1409 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human pregnancy-specific glycoprotein-6 (PSG6) mRNA, complete cds. ACCESSION M33666 KEYWORDS pregnancy-specific glycoprotein. SOURCE Human fetal liver first trimester, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1409) AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A. TITLE cDNA cloning demonstrates the expression of pregnancy-specific glycoprotein genes, a subgroup of the carcinoembryonic antigen gene family, in fetal liver JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989) STANDARD simple staff_review FEATURES from to/span description pept 53 1360 pregnancy-specific glycoprotein-6 BASE COUNT 395 a 403 c 294 g 317 t ORIGIN 1 gggcgggcct aggctcatct ccacagggga gaacacacag acagcagaga ccatgggacc 61 cctctcagcc cctccctgca ctcagcacat cacctggaag gggctcctgc tcacagcatc 121 acttttaaac ttctggaacc tgcccaccac tgcccaagta ataattgaag ccaagccacc 181 caaagtttcc gaggggaagg atgttcttct acttgtccac aatttgcccc agaatcttac 241 tggctacatc tggtacaaag ggcaaatgac ggacctctac cattacatta catcatatgt 301 agtacacggt caaattatat atgggcctgc ctacagtgga cgagaaacag tatattccaa 361 tgcatccctg ctgatccaga atgtcacaca ggaggatgca ggatcctaca ccttacacat 421 cataaagcga ggcgatggga ctggaggagt aactggatat ttcactgtca ccttatactc 481 ggagactccc aagccctcca tctccagcag caacttaaac cccagggagg tcatggaggc 541 tgtgcgctta atctgtgatc ctgagactcc ggatgcaagc tacctgtggt tgctgaatgg 601 tcagaacctc cctatgactc acaggttgca gctgtccaaa accaacagga ccctctatct 661 atttggtgtc acaaagtata ttgcaggacc ctatgaatgt gaaatacgga acccagtgag 721 tgccagccgc agtgacccag tcaccctgaa tctcctcccg aagctgccca tgccttacat 781 caccatcaac aacttaaacc ccagggagaa gaaggatgtg ttagccttca cctgtgaacc 841 taagagtcgg aactacacct acatttggtg gctaaatggt cagagcctcc cggtcagtcc 901 gagggtaaag cgacccattg aaaacaggat actcattcta cccagtgtca cgagaaatga 961 aacaggaccc tatcaatgtg aaatacggga ccgatatggt ggcatccgca gtaacccagt 1021 caccctgaat gtcctctatg gtccagacct ccccagaatt tacccttcat tcacctatta 1081 ccgttcagga gaaaacctcg acttgtcctg ctttgcggac tctaacccac cggcagagta 1141 ttcttggaca attaatggga agtttcagct atcaggacaa aagctcttta tcccccaaat 1201 tactacaaat catagcgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga 1261 aatctccaaa tccatgatag tcaaagtctc tgagacagca tctccccagg ttacctatgc 1321 tggtccaaac acctggtttc aagaaatcct tctgctgtga cctcccaaag tgctaggatt 1381 aaaacatgac ccaccatgaa acccgccca // LOCUS HUMPSGA 1930 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human pregnancy-specific glycoprotein-1a (PSG-1a) mRNA, complete cds. ACCESSION M33664 KEYWORDS pregnancy-specific glycoprotein. SOURCE Human fetal liver first trimester, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1930) AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A. TITLE cDNA cloning demonstrates the expression of pregnancy-specific glycoprotein genes, a subgroup of the carcinoembryonic antigen gene family, in fetal liver JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989) STANDARD simple staff_review FEATURES from to/span description pept 1 1260 pregnancy-specific glycoprotein-1a BASE COUNT 560 a 520 c 390 g 460 t ORIGIN 1 atgggaaccc tctcagcccc tccctgcaca cagcgcatca aatggaaggg gctcctgctc 61 acagcatcac ttttaaactt ctggaacctg cccaccactg cccaagtcac gattgaagcc 121 cagccaccca aagtttccga ggggaaggat gttcttctac ttgtccacaa tttgccccag 181 aatcttaccg gctacatctg gtacaaaggg caaatgaggg acctctacca ttacattaca 241 tcatatgtag tagacggtga aataattata tatgggcctg catatagtgg acgagaaaca 301 gcatattcca atgcatccct gctgatccag aatgtcaccc gggaggacgc aggatcctac 361 accttacaca tcataaaggg agatgatggg actagaggag taactggacg tttcaccttc 421 accttacacc tggagactcc taagccctcc atctccagca gcaacttaaa tcccagggag 481 accatggagg ctgtgagctt aacctgtgac cctgagactc cagacgcaag ctacctgtgg 541 tggatgaatg gtcagagcct ccctatgact cacagcttga agctgtccga aaccaacagg 601 accctctttc tattgggtgt cacaaagtat actgcaggac cctatgaatg tgaaatacgg 661 aacccagtga gtgccagccg cagtgaccca gtcaccctga atctcctccc gaagctgccc 721 aagccctaca tcaccatcaa caacttaaac cccagggaga ataaggatgt cttaaacttc 781 acctgtgaac ctaagagtga gaactacacc tacatttggt ggctaaatgg tcagagcctc 841 ccggtcagtc ccagggtaaa gcgacccatt gaaaacagga tcctcattct acccagtgtc 901 acgagaaatg aaacaggacc ctatcaatgt gaaatacggg accgatatgg tggcatccgc 961 agtgacccag tcaccctgaa tgtcctctat ggtccagacc tccccagaat ttacccttca 1021 ttcacctatt accgttcagg agaagtcctc tacttgtcct gttctgcgga ctctaaccca 1081 ccggcacagt attcttggac aattaatgaa aagtttcagc taccaggaca aaagctcttt 1141 atccgccata ttactacaaa gcatagcggg ctctatgttt gctctgttcg taactcagcc 1201 actggcaagg aaagctccaa atccatgaca gtcgaagtct ctgactggac agttccctga 1261 attctactag tacctccaat tccattttct cccatggaat cactaagagc aagacccact 1321 ctgttccaga agccctataa gctggaggtg gacaactcaa tgtaaatttc atgggaaaac 1381 ccttgtacct gaagcatgag ccactcagaa ctcaccaaaa tattcgacac cataacaaca 1441 gatgctcaaa ctgtaaacca ggacaacaag tggatgactt cacactgtgg acagtttttc 1501 ccaagatgtc agaagaagac tccccatcat gatgaggctc tcccccctct taactgtcct 1561 tgctcatgcc tgcctctttc acttggcagg ataatgcagt cattagaatt tcacatgtag 1621 tagcttctga gagtaacaac agagtgtcag atatgtcatc tcaacctcaa acttttatgt 1681 aacatctcag ggggaaatgt ggctctctcc accttgcata cagggctccc aatagaaatc 1741 aacacagaga tattgcctgt gtgttttcag agaagatggt ttgtatgaag acgtaggaaa 1801 gctgaaatta taatagagtc ccctttaaat ccacattgtg tggatggctc ttgccgtttc 1861 ctaagagata cattgtaaaa cgtgacagta agacattcta gcagaataaa acatgtacca 1921 cctttgcccc // LOCUS HUMPSGC 1307 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human pregnancy-specific glycoprotein-4 (PSG4) mRNA, 3' end. ACCESSION M33665 KEYWORDS pregnancy-specific glycoprotein. SOURCE Human fetal liver first trimester, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1307) AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A. TITLE cDNA cloning demonstrates the expression of pregnancy-specific glycoprotein genes, a subgroup of the carcinoembryonic antigen gene family, in fetal liver JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 639 pregnancy-specific glycoprotein-4 (AA at 1) BASE COUNT 394 a 339 c 249 g 325 t ORIGIN 1 gcggggtata ttgcaggacc ctatgaatgt gaaatacgga acccagtgag tgccagccgc 61 agtgacccag tcaccctgaa tctcctccca aagctgtcca agccctacat cacaatcaac 121 aacttaaacc ccagagagaa taaggatgtc ttaaccttca cctgtgaacc taagagtgag 181 aactacacct acatttggtg gctaaatggt cagagcctcc ctgtcagtcc cagggtaaag 241 cgacccattg aaaacaggat cctcattcta cccaatgtca cgagaaatga aacaggacct 301 tatcaatgtg aaatacggga ccgatatggt ggcatccgca gtgacccagt caccctgaat 361 gtcctctatg gtccagacct ccccagcatt tacccttcat tcacctatta ccgttcagga 421 gaaaacctct acttgtcctg cttcgccgag tctaacccac gggcacaata ttcttggaca 481 attaatggga agtttcagct atcaggacaa aagctctcta tcccccaaat aactacaaag 541 catagtgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga aagctccaaa 601 tccatcacag tcaaagtctc tgactggata ttaccctgaa ttctactagt tcctccaatt 661 ccattttctc ccatggaatc acgaagagca agacccactc tgttccagaa gccctataag 721 ctggaggtgg acaactcgat gtaaatttca tgggaaaacc cttgtacctg acatgtgagc 781 cactcagaac tcaccaaaat gttcgacacc ataacaacag ctactcaaac tgtaaaccag 841 gataagaagt tgatgacttc acactgtgga cagtttttcc aaagatgtca gaacaagact 901 ccccatcatg ataaggctcc cacccctctt aactgtcctt gctcatgcct gcctctttca 961 cttggcagga taatgcagtc attagaattt cacatgtagt agcttctgag ggtaacaaca 1021 gagtgtcaga tatgtcatct caacctcaaa cttttacgta acatctcagg cgaaatgtgg 1081 ctctctccat cttgcataca gggctcccaa tagaaatgaa cacagagata ttgcctgtgt 1141 gtttgcagag aagatggttt ctataaagag taggaaagct gaaattatag tagagtctcc 1201 tttaaatgca cattgtgtgg atggctctca ccatttccta agagatacag tgtaaaacgt 1261 gacagtaata ctgattctag cagaataaaa catgtaccac atttccc // LOCUS HUMPSGD 1699 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human pregnancy-specific glycoprotein-1d (PSG-1d) mRNA, complete cds. ACCESSION M33663 KEYWORDS pregnancy-specific glycoprotein. SOURCE Human fetal liver first trimester, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1699) AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A. TITLE cDNA cloning demonstrates the expression of pregnancy-specific glycoprotein genes, a subgroup of the carcinoembryonic antigen gene family, in fetal liver JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989) STANDARD simple staff_review FEATURES from to/span description pept 64 1344 pregnancy-specific glycoprotein-1d BASE COUNT 493 a 442 c 343 g 421 t ORIGIN 1 ctcagagagt ttctggatcc taggcttatc tccacagagg agaacacaca agcagcagag 61 accatgggaa ccctctcagc ccctccctgc acacagcgca tcaaatggaa ggggctcctg 121 ctcacagcat cacttttaaa cttctggaac ctgcccacca ctgcccaagt cacgattgaa 181 gcccagccac ccaaagtttc cgaggggaag gatgttcttc tacttgtcca caatttgccc 241 cagaatctta ccggctacat ctggtacaaa gggcaaatga gggacctcta ccattacatt 301 acatcatatg tagtagacgg tgaaataatt atatatgggc ctgcatatag tggacgagaa 361 acagcatatt ccaatgcatc cctgctgatc cagaatgtca cccgggagga cgcaggatcc 421 tacaccttac acatcataaa gggagatgat gggactagag gagtaactgg acgtttcacc 481 ttcaccttac acctggagac tcctaagccc tccatctcca gcagcaactt aaatcccagg 541 gagaccatgg aggctgtgag cttaacctgt gaccctgaga ctccagacgc aagctacctg 601 tggtggatga atggtcagag cctccctatg actcacagct tgaagctgtc cgaaaccaac 661 aggaccctct ttctattggg tgtcacaaag tatactgcag gaccctatga atgtgaaata 721 cggaacccag tgagtgccag ccgcagtgac ccagtcaccc tgaatctcct cccgaagctg 781 cccaagccct acatcaccat caacaactta aaccccaggg agaataagga tgtcttaaac 841 ttcacctgtg aacctaagag tgagaactac acctacattt ggtggctaaa tggtcagagc 901 ctcccggtca gtcccagggt aaagcgaccc attgaaaaca ggatcctcat tctacccagt 961 gtcacgagaa atgaaacagg accctatcaa tgtgaaatac gggaccgata tggtggcatc 1021 cgcagtgacc cagtcaccct gaatgtcctc tatggtccag acctccccag aatttaccct 1081 tcattcacct attaccgttc aggagaagtc ctctacttgt cctgttctgc ggactctaac 1141 ccaccggcac agtattcttg gacaattaat gaaaagtttc agctaccagg acaaaagctc 1201 tttatccgcc atattactac aaagcatagc gggctctatg tttgctctgt tcgtaactca 1261 gccactggca aggaaagctc caaatccatg acagtcgaag tctctggtaa gtggatccca 1321 gcatcgttgg caatagggtt ttaggtggag tctatctggc attcagagaa gagtcaggaa 1381 aacaattgta ttcccagcct gtgtccctag ggcacaagca aatcccaaat tctcctcctg 1441 aaccctccaa atttgtctaa gaacttcgaa aactttaaca aacaggctga tatcttcata 1501 atattcccag cctagaccaa gcaggaagaa cattgatttc attgaaataa ttgataataa 1561 tgaagataat gtttttatga tttttatttg aaaatttgct aattctttaa atggtttgtt 1621 ttctacattg atggaatttt tctcttttaa tctatctaca gcttatagca gttcaataaa 1681 ctatacttct gggaaccgt // LOCUS RHMDCTAB 764 bp ds-DNA BCT 24-JUL-1990 DEFINITION R.meliloti dctA and dctB genes, 5' end. ACCESSION M33555 KEYWORDS regulatory protein. SOURCE R.meliloti DNA. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 764) AUTHORS Wang,Y.-P., Birkenhead,K., Boesten,B., Manian,S. and O'Gara,F. TITLE Genetic analysis and regulation of the Rhizobium meliloti genes controlling C-4-dicarboxylic acid transport JOURNAL Gene 85, 135-144 (1989) STANDARD simple staff_review FEATURES from to/span description pept 726 > 764 dctA protein BASE COUNT 149 a 239 c 252 g 124 t ORIGIN 1 cccgggccgc ttgctgaccg tgcccatggc gaaatgttcg gccatgccgt cggcggacgg 61 cgagccggaa ataatcgcgg aaggcatagt cgttgccgac gaagctcgtc ggctcctgcc 121 agttgctggc cgcgacggcg acgccgctgc ggtcgatcag ataaatgacc gcggcttcgg 181 cgcttgtcgc cagggcctcg agcttacggt tgatgcggtc gagcgacggc ctgtccggcg 241 aaagcaatgc gccacgaatg gctgcgtcgt cggcgaggac aagcggcagg gcgcgctgcc 301 gttccacgac tgctcgaaga agcgaggctt tcaggctggc gtcgatccgg ctctgaccgg 361 caaggccggc gagcgcctgc gaccggccgt agtctcgcgc aagaagaagg ccggctgcca 421 gaaggaccag cgcgactgcc gcgaaaacga gccaggaccg tcgggcccgg ctgcgaacgc 481 atgcgatcgc tcgcttctgc aggaagtttg accatgcgaa cattgtgcat gttttcgccc 541 aggacgccag cacttctgtg cggaaatccg cacatatcca cgaacggcaa gcgagcaccg 601 ctcccaaaaa tgtcatggat tcaatcgcat aggccttccg cggcaaactg gcacgcatgt 661 tgctgacaag ctccacaagg cagccactgc tgtcgatctt cggaaagcgg cccgggaggc 721 ccggcatgtt gccggactgg gcctgccacg tggaggatat catg // LOCUS MNKSVAA 589 bp ds-DNA PRI 24-JUL-1990 DEFINITION Monkey sp. DNA fragment found also in SV40 genome. ACCESSION M33902 KEYWORDS . SOURCE Monkey sp. DNA. ORGANISM Monkey sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Cercopithecidae. REFERENCE 1 (bases 1 to 589) AUTHORS Szymanski,P. and Woodworth,M. TITLE A 69-base-pair monkey DNA sequence enhances simian virus 40 replication and transcription through multiple motifs JOURNAL J. Virol. 64, 1360-1365 (1990) STANDARD simple staff_review FEATURES from to/span description binding 221 230 AP-4 binding site binding 243 249 AP-1 binding site BASE COUNT 162 a 161 c 118 g 148 t ORIGIN 1 aagcttcgta gaatctcaga aagtcataaa gctacaaacg agttcacaaa caagtgatgc 61 atagaccaca cccagcccac agatgagttc tgtgtggctc acacttattt tttaaaacag 121 agtttgctgc caacacttca aaacccaatg attttgcata aaaatccaga tttattagaa 181 aatctgtgga cttttctgcc tggcaacaac catctggaac tcagcagcgg ttgcccctga 241 gatgagtcac agttcccacc ccactcaccc atgtcaccta tatggcttgg aaggcatgtg 301 agtttgcaac ctgctcaaag ccacacccca gaaacggcct gctaaaatgg gattccagaa 361 tccacttgca cctttcgtct ttctcagtct cccctgactc aacacagcac actggcattg 421 ttttttttcc tctcagaaaa ctgagcatgc acttttgaga atcagagaga aatccgcctc 481 tcaaatgtca gaaaacagct gatgctgcct cttacttggg ggcctcttga cttcccacgt 541 gtcttactgc cacttgagag ggaggggtgc aggcaggcag gctccctgg // LOCUS MNKSVAB 69 bp ds-DNA PRI 24-JUL-1990 DEFINITION Monkey sp. DNA fragment also found in SV40 genome. ACCESSION M33903 KEYWORDS . SOURCE Monkey sp. DNA. ORGANISM Monkey sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Cercopithecidae. REFERENCE 1 (bases 1 to 69) AUTHORS Szymanski,P. and Woodworth,M. TITLE A 69-base-pair monkey DNA sequence enhances simian virus 40 replication and transcription through multiple motifs JOURNAL J. Virol. 64, 1360-1365 (1990) STANDARD simple staff_review FEATURES from to/span description binding 1 7 AP-1 binding site binding 41 50 AP-4 binding site binding 63 69 AP-1 binding site BASE COUNT 16 a 21 c 15 g 17 t ORIGIN 1 tgactcatca cttttctgcc tggcaacaac catctggaac tcagcagcgg ttgcccctga 61 gatgagtca // LOCUS RSSFBPPRK 2099 bp ds-DNA BCT 24-JUL-1990 DEFINITION R.sphaeroides fructose 1,6-bisphosphatase (fbpB) and phosphoribulokinase (prkB) genes, complete cds. ACCESSION J02922 KEYWORDS 1,6-bisphosphatase; phosphoribulokinase. SOURCE R.sphaeroides (strain HR) DNA. ORGANISM Rhodobacter sphaeroides Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 2099) AUTHORS Gibson,J.L., Chen,J.-H., Tower,P.A. and Tabita,F.R. TITLE The form II fructose 1,6-bisphosphatase and phosphoribulokinase genes form part of a large operon in Rhodobacter sphaeroides: Primary structure and insertional mutagenesis analysis JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Gibson, 15-JUN-1990. FEATURES from to/span description pept 44 1039 fructose 1,6-bisphosphatase (fbpB) pept 1043 1921 phosphoribulokinase (prkB) BASE COUNT 361 a 694 c 720 g 324 t ORIGIN 1 ggatcctgac gcaacgtccg ccgcgacaga ggcaggagga gccatggcca tcgagctgga 61 ggacctgggg ctgagccccg atgtggcgga cgtgatgcag cgtctggcgc gcgtgggggc 121 aggcatcgcc cgcatcatct cgcgcaacgg gctcgagcgc gatctgggcg cgggcgtcgg 181 caccaatgcc ggaggagacg ggcagaaggc gctcgacgtg atcgcggacg acgcgttccg 241 cgcggcgctc gaaggctctg cggtggctta ttacgcctcc gaggagcagg acgaagtggt 301 gacgctgggc gagggaagcc tcgcgctcgc catcgacccg ctggacggct cgtccaacat 361 cgatgtgaac gtgtcgatcg ggacgatctt ctcgatcttc ccggcggcgg ctggccccga 421 ggccagcttc ctgcgcccgg gcaccgagca gattgccggc ggctacatca tctacgggcc 481 gcaatgcgcg ctggtctgca gcttcgggca gggcgtgcag cactgggtgc tcgacctcga 541 tgcgggcatc ttccggcgga tgcccgacat ccgcccgctg ccggccgaga cgtccgagtt 601 tgcgatcaac gcctcgaact accgccactg gccgcagccg atccgcgcct tcgtcgacga 661 tctggtcgcc ggggccgagg ggccgcgcgg caagaacttc aacatgcgct ggatcgcctc 721 gctggtggcc gagacgcacc gcatcctgat gcggggcggg gtgtttctct atcccggcga 781 cgagcgcaag ggctacgagc ggggccggct gcgccatgtc tacgaatgcg cgcccatcgc 841 cttcctgatc gcgaatgtcg gggggggcgc caccgacggc tgcgccgaca tcctgaccgc 901 gctgcccgac cggctgcacg cccgcacccc cttcgtcttc ggctgcgcga gcaaggtcgc 961 ccgcgtcgcc gcctatcacg atctggcctg cgaagagacg tccgctctct tcggcagccg 1021 gggcctgttc cggagttaaa gagtgtcgaa gaaatatccc atcatttccg tggtcggctc 1081 gtccggcgcg ggcacctcga cggtcaagaa cacgttcgag cagatcttcc gccgcgaggg 1141 ggtcaagtcc gtctcgatcg agggcgacgc cttccaccgc ttcaaccggg ccgacatgaa 1201 ggccgaactc gagcggcgct atgcggcggg cgatgcgacc ttctcgcatt tctcctacga 1261 ggcgaacgaa ctgaaggagc tggagcgcgt cttccgcgaa tatggcgaga cggggcgcgg 1321 ccgcacccgc acctatgtcc atgacgatgc cgaagccgcc cggacgggcg tggcccccgg 1381 caatttcacc caatgggcgc cgttcgagga caacagcgac ctgcttttct acgaggggct 1441 gcacggctgc gtggtcaatg acgaggtgaa cctcgtccgc catgccgatc tgaagctcgg 1501 cgtggcgccg gtcatcaacc ttgaatggat ccagaagatc caccgcgacc gggcgcagcg 1561 cggctatacg accgaagccg tcaccgacgt gatcctgcgc cggatgtatg cctacgtcca 1621 ctgtatcgtc ccgcaattct ccgagacgga catcaacttc cagcgcgtgc cggtggtgga 1681 cacctcgaac ccgttcatcg cgcgctggat ccccacgccg gacgagagcc tgatcgtgat 1741 ccggttcaag aacccgcgcg ggatcgactg cccctatctc acctcgatga tcgcgggctc 1801 gtggatgagc cgggcgaatt ccatcgtggt gccgggcaac aagcaggatc tggcgatgca 1861 gctgatcctg acgccgctca tcgagcggat ggtgcgcgag gcgcgccgcg cgcgggcctg 1921 aggcgcgaca gacagacgga gagggggccg cggatcgagc cgcggcccgg caaccgggac 1981 gcacagcgaa ggatgagagc catgaaggac attggagccg cgcaggagac gcggatggcg 2041 aacgcatccg ggccctcgcg atggatgccg tgaagaaggc caagtcgggc catcccggg // LOCUS YSCACH1A 2466 bp ss-mRNA PLN 24-JUL-1990 DEFINITION S.cerevisiae acetyl-CoA hydrolase (ACH1) mRNA, complete cds. ACCESSION M31036 KEYWORDS acetyl-CoA hydrolase. SOURCE S.cerevisiae (strain T3A), cDNA to mRNA, clones pBA[4,5]. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2466) AUTHORS Lee,F.-J.S., Lin,L.-W. and Smith,J.A. TITLE A glucose-repressible gene encodes acetyl-CoA hydrolase from Saccharomyces cerevisiae JOURNAL J. Biol. Chem. 265, 7413-7418 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.-W.Lin, 27-DEC-1989. FEATURES from to/span description pept 614 2194 acetyl-CoA hydrolase (ACH1) (EC 3.1.2.1) BASE COUNT 730 a 529 c 558 g 649 t ORIGIN Chromosome II. 1 gttagccatc attatcatta aaatatcaac ccgaagaaca atgtatacat atacatatac 61 gtacacatat acatatgtac atatgacata cgtattagcc gctgaggacg cggacgtata 121 aaaggacaat acttatatgg agctaagggg agcagttacg caactccgtg atcgcgcgcc 181 acgggccgtc ggcggctgtt aattgaagaa aaaaaaaatg aagaaccaca aggggtgatc 241 catataggtg actagcatca tcccctgcga cgcgcggccc gccgggcaaa ggcgggcaat 301 gcgcgctgct gattggcctc gaggacaacg ccctcaacca catccgcaac agccaatccc 361 atcggagcgt caaactacca aagtagtgat tgtatggatc accactgtat tgtggacggt 421 aagcgcttgc tggagcaaat gtgtaatcaa gttgctgtgt atatatagac gttagatgtg 481 ttctacccct tcttttgtct tgtgcccacc gggcttacat tagcacacaa agcagcaaga 541 gaccgtctta ctagacaata gcggcaaaac aaacaacaca tttctttttt tctttttcac 601 atattgcact aaaatgacaa tttctaattt gttaaagcag agagttaggt atgctcccta 661 tctgaaaaaa gttaaggaag ctcacgagct tattccattg ttcaagaatg gtcagtacct 721 tgggtggtcc ggttttacag gagtgggtac tcccaaggca gtgccggagg cactgataga 781 tcacgtggag aagaacaatt tacaagggaa gttgagattc aacctttttg ttggagcttc 841 tgctggtcca gaggaaaacc gttgggctga acacgacatg atcattaaga gagcccctca 901 tcaagtaggg aaacccattg caaaggcaat taaccagggt agaattgagt tctttgataa 961 acatctgtcc atgttccctc aggatctgac atacgggttc tacaccaggg aaagaaaaga 1021 caacaaaatc cttgattata ctataatcga ggcaacggcc attaaagagg acgggtctat 1081 cgtcccaggt ccctctgtcg gtggttctcc agaattcatt acagtcagtg ataaagtgat 1141 tattgaggtt aacacggcta cgccttcgtt cgagggtatt cacgatatag acatgcccgt 1201 gaacccacct ttcaggaaac catacccata tctgaaagtg gacgacaagt gtggtgttga 1261 ctccatcccg gttgatcctg aaaaggttgt tgcgattgtg gagtccacca tgagggacca 1321 ggtcccacca aatacgccct ctgacgacat gtccagggct attgcaggtc atttggtcga 1381 gtttttcaga aacgaggtaa aacatggtag gctacctgaa aacctgctgc ctttacaaag 1441 tggtataggt aacattgcta acgctgtcat tgaagggctt gctggcgccc aattcaagca 1501 cttgactgta tggacggaag tgctgcagga ctcgttattg gatcttttcg agaacggatc 1561 tttggactac tccactgcta cttccgtgag attgactgaa aagggtttcg acagagcctt 1621 tgcaaactgg gaaaatttca aacacagatt gtgtttgaga tctcaagttg tctcgaacaa 1681 tccggaaatg atccgtagat tccctgtcat cgccatgaat accccagtag aagttgacat 1741 ttacgcgcac gccaattcta caaatgtgaa tggttcccgt atgttgaacg ggttgggtgg 1801 atctgctgat ttcttgagaa atgcaaagtt gtccatcatg catgccccct ctgcaagacc 1861 aactaaagta gaccctaccg gtatctctac cattgttcct atggcctctc atgtagatca 1921 aactgagcat gacctggaca tcttggtcac tgaccaaggt ttggcggatc taagaggtct 1981 atcgcctaag gaaagagccc gtgaaatcat caacaagtgt gctcatcccg attatcaagc 2041 tttgttgacc gattacttgg acagagcaga gcattacgct aaaaagcaca attgcttgca 2101 tgaaccacac atgctaaaga atgctttcaa gttccacacc aacttagctg aaaagggtac 2161 aatgaaggtc gacagctggg aaccagttga ctagtgtttg tgcgcaaacc gagagatgag 2221 tatttaacaa aaaaaagaaa ggaaatgata tgattatgat tttatgttta taaagctttt 2281 atccaatgcg ttgttttttc ttgcatattt ataccttttg cgctcatgga gggagttaat 2341 caatacgcat gacgtctagt taattcacag gtagtactgt atatttatat gtttacacaa 2401 taattatgta ttaagtagtg attagtaaaa aaaactaaga ggttgaaagt catcaaccct 2461 tatatt // LOCUS YSCYEF3B 3969 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae elongation factor 3 (EF-3) gene, complete cds. ACCESSION J05583 KEYWORDS elongation factor. SOURCE S.cerevisiae DNA, clone YEpEF3. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3969) AUTHORS Sandbaken,M.G., Lupisella,J.A., DiDomenico,B. and Chakraburtty,K. TITLE Protein synthesis in yeast: Structural and functional analysis of the gene encoding elongation factor three (EF-3) JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Chakraburtty, 15-JUN-1990. FEATURES from to/span description pept 489 3623 elongation factor 3 signal 202 212 regulatory UAS-RPG binding 1875 1898 nucleotide binding consensus GX4GK(S/T) binding 2589 2612 nucleotide binding consensus GX4GK(S/T) binding 463 470 nucleotide binding consensus GX4GK(S/T) binding 701 708 nucleotide binding consensus GX4GK(S/T) binding 2117 2213 nucleotide binding consensus binding 3176 3272 nucleotide binding consensus binding 544 575 nucleotide binding consensus binding 897 928 nucleotide binding consensus BASE COUNT 1203 a 758 c 809 g 1199 t ORIGIN 1 caacttacaa tcattgttcg ccccttccat acttactgcc actcgcaaaa gggcccaacc 61 agggcaatta cgtatcaaaa aatcatgaca ggctgggtaa taaatattcg tgaagaaaga 121 agaaattaaa aaaagaaacg aagaagcaaa aaaaagaaaa gactccgttt aatcactttc 181 aaccgcggtt tatccggccc cacccatgca taaccctaaa ttattagatc acttagcacg 241 tgaaaaagaa acgtttttaa tgtttttttt ttttttcttt ttcttttttt gcgttggtga 301 aaattttttc gcttcctcga gtataattat ctcatctcat ctttcatata agataagaag 361 ttttataaaa accttttgca tcaaaatttt gttgaataac tctttttctt acgctctctt 421 tctttcctta attgttttct aaagaaccgt gtatttttct agttcgaatc catcgataac 481 attaaaagat gtctgattcc cagcaatcca ttaaggttct agaagaacta ttccagaagt 541 tatctgttgc cactgctgac aacagacacg aaatcgcttc tgaagtcgct tctttcttga 601 atggtaacat cattgaacat gatgttccag aacacttctt cggtgaattg gccaagggta 661 tcaaggacaa gaagaccgct gctaacgcca tgcaagctgt tgctcacatt gctaaccaat 721 ctaacttgtc tccatctgtt gaaccataca tcgtccaatt ggttccagct atctgtacca 781 acgcaggtaa caaggacaaa gaaattcaat ctgttgcttc cgaaactttg atttccatcg 841 ttaacgctgt taacccagtt gccatcaaag ctttgttgcc acatttgact aacgctattg 901 tagaaactaa caaatggcaa gaaaagattg ctattttggc agctttctct gctatggtcg 961 atgctgctaa ggatcaagtt gccctaagaa tgccagaatt gattccagtc ttgtctgaaa 1021 ccatgtggga caccaagaag gaagtcaagg ctgctgctac tgccgccatg accaaggcta 1081 ccgaaactgt tgacaacaag gatattgaac gtttcattcc aagtttgatt caatgtattg 1141 ctgacccaac tgaagttcca gaaaccgttc atttgctagg tgctactact ttcgttgctg 1201 aagttactcc agctactttg tccatcatgg tcccattgtt gtccagaggt ttgaacgaaa 1261 gagaaaccgg tatcaagcgt aagtctgctg ttattattga caacatgtgt aagttggtcg 1321 aagacccaca agttattgct cctttcttgg gtaaattgtt gccaggtttg aagagtaact 1381 ttgctaccat tgctgaccca gaagccagag aagttacttt gagagctttg aagactttga 1441 gaagagttgg taacgttggt gaagacgatg ctattccaga actttctcac gctggtgacg 1501 tttctactac tttgcaagtc gttaacgaat tgttgaagga cgaaaccgtt gctccaagat 1561 ttaagattgt cgtcgagtac attgccgcca ttggtgctga tttgatcgat gaaagaatca 1621 ttgaccaaca agcttggttc acccacatca ccccatacat gactatcttc ttgcacgaaa 1681 agaaggccaa ggacatcttg gacgaattca gaaagagagc tgtcgacaac attccagttg 1741 gtccaaactt cgacgacgaa gaagacgaag gtgaagactt atgtaactgt gaattttctt 1801 tggcttatgg tgctaaaatc ttgttgaaca agacccaatt aagattgaag agagccagaa 1861 gatatggtat ctgtggtcca aacggttgtg gtaagtccac tttaatgaga gctattgcca 1921 acggtcaagt tgatggtttc ccaacccaag aagaatgtag aaccgtctac gtcgaacacg 1981 acattgatgg tactcactct gacacttccg tcttggattt cgttttcgaa tctggtgttg 2041 gtactaaaga agctatcaag gacaaattga ttgaattcgg tttcaccgat gaaatgattg 2101 ctatgccaat ctctgcttta tctggtggtt ggaagatgaa gttggctcta gctagagctg 2161 tgttgagaaa tgctgatatc ttgttgttag atgaaccaac taaccatttg gataccgtca 2221 acgttgcttg gttagttaac tacttgaaca cctgtggtat cacttctatc actatttctc 2281 acgactccgt tttcttagat aacgtctgtg aatatattat taactacgaa ggtttgaagt 2341 tgagaaagta caagggtaac tttaccgaat tcgttaagaa gtgtccagct gctaaggctt 2401 acgaagaatt atccaacact gatttggaat tcaagttccc agaaccaggt tacttggaag 2461 gtgttaagac taagcaaaag gctattgtca aggttaccaa catggaattc caatatccag 2521 gtacctctaa gccacaaatc actgacatta acttccaatg ttctttgtct tccagaattg 2581 ctgtcattgg tccaaatggt gctggtaagt ctactttgat taacgtcttg actggtgaac 2641 tattaccaac ctctggtgaa gtctacaccc acgaaaattg tcgtatcgct tacattaagc 2701 aacacgcttt tgctcatatc gaatctcatt tggacaagac tccatctgaa tatatccaat 2761 ggagattcca aaccggtgaa gatagagaaa ccatggacag agctaacaga caaatcaacg 2821 aaaacgatgc tgaagctatg aacaagatct tcaagattga aggtacccct agaagaattg 2881 ccggtatcca ctccagaaga aagttcaaga acacttacga atatgaatgt tctttcttat 2941 tgggtgaaaa cattggtatg aaatctgaaa gatgggttcc aatgatgtcc gtcgacaacg 3001 cttggattcc aagaggtgaa ttggttgaat ctcactctaa gatggttgct gaagttgata 3061 tgaaggaagc tttggcttct ggtcaattcc gtccattaac cagaaaagaa attgaagaac 3121 attgttccat gttgggtttg gacccagaaa ttgtttctca ctccagaatt agaggtttgt 3181 ctggtggtca aaaggttaag ttggtcttag ctgccggtac atggcaaaga cctcacttga 3241 ttgtcttaga tgaacctacc aactatctgg acagagattc tttgggtgct ttgtctaagg 3301 ctttgaagga atttgaaggt ggtgttatta tcattactca ctctgctgaa ttcacaaaga 3361 acttgactga agaagtctgg gccgtcaagg acggtagaat gactccatct ggtcacaact 3421 gggttagtgg tcaaggtgct ggtccaagaa tcgaaaagaa ggaagacgaa gaagataaat 3481 tcgatgctat gggtaacaag attgccggtg gtaagaagaa gaagaagttg tcttctgcgg 3541 aattgagaaa gaagaagaag gaaagaatga agaagaagaa ggaattgggt gatgcttacg 3601 tttcttctga cgaagaattc taatcttttt gatcactgct ttcacagttt tctttaagat 3661 ttttattgat caataattta tgtatatttt aatttctatg tttttgtaat attgtttatt 3721 ttggtaaaat atagacgcaa cttccttatt ataaagaaag gcattattta aaagaaaaag 3781 cgttccatta gtcagacatc tttttttttt tcatacattc ttaagcctca ggcaaattga 3841 gcattgcctc ataccctttt cggtaagagg gtaacgaaaa tatttttttg gaagaataaa 3901 aataggtgac ggatcataga ctaggaagct ttaaaacatg attgagcgta atattatatt 3961 ccttctaga // LOCUS HUMKSAA 1504 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human adenocarcinoma-associated antigen (KSA) mRNA, complete cds. ACCESSION M32325 KEYWORDS adenocarcinoma-associated antigen. SOURCE Human cell line UCLA-P3, cDNA to mRNA, clone AG[1,1338,933]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1504) AUTHORS Strnad,J., Hamilton,A.E., Beavers,L.S., Gamboa,G.C., Apelgren,L.D., Taber,L.D., Sportsman,J.R., Bumol,T.F., Sharp,J.D. and Gadski,R.A. TITLE Molecular cloning and characterization of a human adenocarcinoma/epithelial cell surface antigen complementary DNA JOURNAL Cancer Res. 49, 314-317 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D. Sharp, 22-EB-1990. FEATURES from to/span description pept 155 1099 adenocarcinoma-associated antigen precursor (KSA) sigp 155 223 adenocarcinoma-associated antigen signal peptide matp 398 1096 adenocarcinoma-associated antigen mRNA < 1 1504 adenocarcinoma-associated antigen mRNA site 1486 1491 polyadenylation site BASE COUNT 442 a 302 c 356 g 404 t ORIGIN 1 gagcgagcac cttcgacgcg gtccggggac cccctcgtcg ctgtcctccc gacgcggacc 61 cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt gtcccactcc cggcgcacgc 121 cctcccgcgc ccctcttctc ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt 181 cgggcttctg cttgccgcgg cgacggcgac ttttgccgca gctcaggaag aatgtgtctg 241 tgaaaactac aagctggccg taaactgctt tgtgaataat aatcgtcaat gccagtgtac 301 ttcagttggt gcacaaaata ctgtcatttg ctcaaagctg gctgccaaat gtttggtgat 361 gaaggcagaa atgaatggct caaaacttgg gagaagagca aaacctgaag gggccctcca 421 gaacaatgat gggctttatg atcctgactg cgatgagagc gggctcttta aggccaagca 481 gtgcaacggc acctccacgt gctggtgtgt gaacactgct ggggtcagaa gaacagacaa 541 ggacactgaa ataacctgct ctgagcgagt gagaacctac tggatcatca ttgaactaaa 601 acacaaagca agagaaaaac cttatgatag taaaagtttg cggactgcac ttcagaagga 661 gatcacaacg cgttatcaac tggatccaaa atttatcacg agtattttgt atgagaataa 721 tgttatcact attgatctgg ttcaaaattc ttctcaaaaa actcagaatg atgtggacat 781 agctgatgtg gcttattatt ttgaaaaaga tgttaaaggt gaatccttgt ttcattctaa 841 gaaaatggac ctgacagtaa atggggaaca actggatctg gatcctggtc aaactttaat 901 ttattatgtt gatgaaaaag cacctgaatt ctcaatgcag ggtctaaaag ctggtgttat 961 tgctgttatt gtggttgtgg tgatggcagt tgttgctgga attgttgtgc tggttatttc 1021 cagaaagaag agaatggcaa agtatgagaa ggctgagata aaggagatgg gtgagatgca 1081 tagggaactc aatgcataac tatataattt gaagattata gaagaaggga aatagcaaat 1141 ggacacaaat tacaaatgtg tgtgcgtggg acgaagacat ctttgaaggt catgagtttg 1201 ttagtttaac atcatatatt tgtaatagtg aaacctgtac tcaaaatata agcagcttga 1261 aactggcttt accaatcttg aaatttgacc acaagtgtct tatatatgca gatctaatgt 1321 aaaatccaga acttggactc catcgttaaa attatttatg tgtaacattc aaatgtgtgc 1381 attaaatatg cttccacagt aaaatctgaa aaactgattt gtgattgaaa gctgcctttc 1441 tatttacttg agtcttgtac atacatactt ttttatgagc tatgaaataa aacattttaa 1501 actg // LOCUS HAMHITAO1 1581 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Hamster G-o protein alpha subunit 1 (GNAO I) mRNA, complete cds. ACCESSION M33661 J05476 KEYWORDS G-o protein alpha subunit. SOURCE Hamster insulinoma cell line HIT T-125 M2.2.2, cDNA to mRNA. ORGANISM Cricetulus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1581) AUTHORS Hsu,W.H., Rudolph,U., Sanford,J., Bertrand,P., Olate,J., Nelson,C., Moss,L.G., Boyd,A.E.III., Codina,J. and Birnbaumer,L. TITLE Molecular cloning of a novel splice variant of the alpha subunit of the mammalian G-o protein JOURNAL J. Biol. Chem. 265, 11220-11226 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Birnbaumer, 10-APR-1990, for release after publication. FEATURES from to/span description pept 458 1522 G-o protein alpha subunit 1 BASE COUNT 357 a 472 c 405 g 347 t ORIGIN 1 gcctccgcct cctcctcctc cgcctccggc agccgcggca gaaggaccca ccctgccccc 61 caccccaccc tccgccggct ccggctgcgg atccagcctc tactctcatt ttatttattt 121 tgggtcgtgc actagcctca gtgcctgcag cccgcgcctc ccgggcccgc gggcacctcc 181 tccctcggct ccggaggccc agaccccggc caccctcacc tcgacaccct cagaccccgg 241 ccagtcgccg ctagtctccg ccgctggaat cttgatagag gctgtcattt tttgggggtt 301 ctggcctttt cgacaatttt gttcccagcc aaggagagga tatcgtgatt ttctcccctt 361 tgagcccagg ctctgctctg tgggggggtg gggggcgcgc cgacccgggg agtcgtgcca 421 gccgagtcgt gcgggctgtg gcagggaagg ggccaccatg ggatgtactc tgagcgcaga 481 ggagagagcc gccctcgagc ggagcaaggc gattgagaaa aacctcaaag aagatggcat 541 cagcgccgcc aaagatgtga aattactcct gctgggggct ggagaatcag gaaaaagcac 601 cattgtgaag cagatgaaga tcatccatga agatggcttc tctggagagg acgtaaagca 661 gtacaagcct gttgtctata gcaacaccat ccagtctctg gcagccatcg tccgggccat 721 ggatactttg ggtgtggagt atggtgacaa ggagagaaag gcggactcca agatggtgtg 781 tgatgtggtg agccgtatgg aagacactga acccttctct gcagagctgc tttctgccat 841 gatgcgactc tggggtgact cagggatcca ggagtgcttc aaccgatctc gggagtatca 901 gctcaatgat tctgccaaat actacctgga cagcctggat cggattggag ccgctgacta 961 ccagcccact gagcaggaca tcctccgaac cagggtcaaa accaccggca tcgtagaaac 1021 ccacttcaca ttcaagaacc tccacttcag gctgtttgat gttgggggcc agcgatctga 1081 acgcaagaag tggatccact gctttgagga tgtcacggcc atcatcttct gtgtcgccct 1141 cagcggctat gaccaggtgc tccacgagga cgaaaccacg aaccgcatgc acgagtctct 1201 catgctcttc gactccatct gtaacaacaa gtttttcatc gatacctcca tcattctctt 1261 cctcaacaag aaagacctct ttggcgagaa gattaagaag tcacccttga ccatctgctt 1321 ccctgaatac ccaggctcca acacctatga agatgcagct gcctacatcc aaacacagtt 1381 tgaaagcaaa aaccgctcac ctaacaaaga aatctattgt cacatgactt gtgccacaga 1441 cacgaataat atccaggtgg tattcgacgc tgtcaccgac atcatcattg ccaacaatct 1501 ccggggctgc ggcttgtact gacctcttgt cctgtatagc aacctatttg actgcttcat 1561 ggactctttg ctgttgatgt t // LOCUS HAMHITAO2 1678 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Hamster G-o protein alpha subunit 2 mRNA, complete cds. ACCESSION M33662 J05476 KEYWORDS G-o protein alpha subunit. SOURCE Hamster insulinoma cell line HIT T-125 M2.2.2, cDNA to mRNA. ORGANISM Cricetulus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1678) AUTHORS Hsu,W.H., Rudolph,U., Sanford,J., Bertrand,P., Olate,J., Nelson,C., Moss,L.G., Boyd,A.E.III., Codina,J. and Birnbaumer,L. TITLE Molecular cloning of a novel splice variant of the alpha subunit of the mammalian G-o protein JOURNAL J. Biol. Chem. 265, 11220-11226 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Birnbaumer, 10-APR-1990, for release after publication. FEATURES from to/span description pept 458 1522 G-o protein alpha subunit 2 BASE COUNT 374 a 514 c 448 g 342 t ORIGIN 1 gcctccgcct cctcctcctc cgcctccggc agccgcggca gaaggaccca ccctgccccc 61 caccccaccc tccgccggct ccggctgcgg atccagcctc tactctcatt ttatttattt 121 tgggtcgtgc actagcctca gtgcctgcag cccgcgcctc ccgggcccgc gggcacctcc 181 tccctcggct ccggaggccc agaccccggc caccctcacc tcgacaccct cagaccccgg 241 ccagtcgccg ctagtctccg ccgctggaat cttgatagag gctgtcattt tttgggggtt 301 ctggcctttt cgacaatttt gttcccagcc aaggagagga tatcgtgatt ttctcccctt 361 tgagcccagg ctctgctctg tgggggggtg gggggcgcgc cgacccgggg agtcgtgcca 421 gccgagtcgt gcgggctgtg gcagggaagg ggccaccatg ggatgtactc tgagcgcaga 481 ggagagagcc gccctcgagc ggagcaaggc gattgagaaa aacctcaaag aagatggcat 541 cagcgccgcc aaagatgtga aattactcct gctgggggct ggagaatcag gaaaaagcac 601 cattgtgaag cagatgaaga tcatccatga agatggcttc tctggagagg acgtaaagca 661 gtacaagcct gttgtctata gcaacaccat ccagtctctg gcagccatcg tccgggccat 721 ggatactttg ggtgtggagt atggtgacaa ggagagaaag gcggactcca agatggtgtg 781 tgatgtggtg agccgtatgg aagacactga acccttctct gcagagctgc tttctgccat 841 gatgcgactc tggggtgact cagggatcca ggagtgcttc aaccgatctc gggagtatca 901 gctcaatgat tctgccaaat actacctgga cagcctggat cggattggag ccgctgacta 961 ccagcccact gagcaggaca tcctccgaac cagggtcaaa accaccggca tcgtagaaac 1021 ccacttcaca ttcaagaacc tccacttcag gctgtttgat gttgggggcc agcgatctga 1081 acgcaagaag tggatccact gctttgagga tgtcacggcc atcatcttct gtgtcgccct 1141 cagcggctat gaccaggtgc tccacgagga cgaaaccacg aaccgaatgc atgagtccct 1201 gaagcttttt gacagcatct gcaacaacaa gtggttcaca gacacgtcta tcatcctgtt 1261 tctcaacaag aaggacatat ttgaggagaa gatcaccagg tccccactga ccatctgctt 1321 tcctgaatac acaggcccca gtgccttcac agaagctgtg gctcacatcc aggggcagta 1381 tgagagtaag aacaaatcag ctcacaagga aatctacacc catttcacct gtgccaccga 1441 caccaacaac atccagtttg tctttgatgc tgtgacggac gtcatcatcg ccaaaaatct 1501 gcggggctgt ggactctact gagccctggc ctcctgccca gcctgctgct cacttctccc 1561 ctggacccag agctctgcta ccactcgatg ctctgtgcac tgagggaaaa cccagaggct 1621 ggccctgggg gaggaggaag caccctctga gcatccccac cctaccccac tccagagg // LOCUS HUMPEPD1 610 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human peptidase D gene, exon 1. ACCESSION M36549 J05513 KEYWORDS peptidase D; prolidase. SEGMENT 1 of 2 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 610) AUTHORS Tanoue,A., Endo,F. and Matsuda,I. TITLE Structural organization of the gene for human prolidase (peptidase D) and demonstration of a partial gene deletion in a patient with prolidase deficiency JOURNAL J. Biol. Chem. 265, 11306-11311 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by F.Endo, 23-MAY-1990. FEATURES from to/span description pept 372 / 388 peptidase D, exon 1 (EC 3.4.13.9) /hgml_locus_uid="LF0121K" /nomgen="PEPD" /map="19q12-q13.2" pre-msg 241 > 610 PEPD mRNA and introns IVS 389 > 610 PEPD intron A BASE COUNT 96 a 225 c 185 g 104 t ORIGIN 1 gtggtgtacg tgtacgtaag atcggggact gaacggagaa tcaagatgtc ctctgcttca 61 ggacctccct cccaccacga aaaggctggc agatgcagat gctgcccgct aacccgacct 121 actgtctgca gccggcagcc cgctcattta tcctggccga gctcttgggc ccccaatctc 181 caggagcccc gccccatcgg ctccgccctt ccccaggtac cgccccttcc cgcccgtgga 241 atttggctct ggaggaggtg ggtcaaggac tccatgcccc gccccctgcc ccgcccttac 301 cccgccccat gcattggcac cggaggggct catggacgcg cgcacttcag tgacgccggt 361 gccgggcgaa catggcggcg gccaccgggt gagtgctgcg cctccctcgc tctttcccgc 421 tggcgtggag aggggcatgg gtgtcccggc cggaccgcgg gtgagagacc gtcaccgggt 481 cgcgtgaatg ccgcgggtag gcgcctcttc ccgttggccc ggaagtccca gcgccggtca 541 ggcgagcctg cacgcgtccg agtcaaaggt caacccgagc aggacccgac tacctctcct 601 aacctctcca // LOCUS HUMPEPD2 530 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human peptidase D gene, 3' end. ACCESSION M35496 J05513 KEYWORDS peptidase D; prolidase. SEGMENT 2 of 2 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 530) AUTHORS Tanoue,A., Endo,F. and Matsuda,I. TITLE Structural organization of the gene for human prolidase (peptidase D) and demonstration of a partial gene deletion in a patient with prolidase deficiency JOURNAL J. Biol. Chem. 265, 11306-11311 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by F.Endo, 23-MAY-1990. FEATURES from to/span description pept / 1 3 peptidase D, exon 15 (EC 3.4.13.9) /hgml_locus_uid="LF0121K" /nomgen="PEPD" /map="19q12-q13.2" pre-msg < 1 394 PEPD mRNA and introns signal 373 378 poly-A signal BASE COUNT 133 a 132 c 114 g 151 t ORIGIN About 110 kb after segment 1. 1 tagagccagc cagaaatccc agcgcacctg ggggcctggc cttgcaacct cttttcgtga 61 tgggcagcct gctggtcagc actccagtag cgagagacgg cacccagaat cagatcccag 121 cttcggcatt tgatcagacc aaacagtgct gtttcccggg gaggaaacac ttttttaatt 181 acccttttgc aggcaccacc tttaatctgt tttatacctt gcttattaaa tgagcgactt 241 aaaatgattg aaaataatgc tgttctttag tagcaactaa aatgtgtctt gctgtcattt 301 atattccttt tcccaggaaa gaagcatttc tgatactttc tgtcaaaaat caatatgcag 361 aatggcattt gcaataaaag gtttcctaaa atggtccttt gtgcgtcggc ttcccctttc 421 attcctttgc agatgagcat ttgggacccc gtccgacccc tgaaaagatc ggctcacctg 481 gtgtcagtat cagtcattcc aggaagatgg ctcgtggcag cacaggccct // LOCUS SYNPMAMA 4354 bp ds-DNA SYN 24-JUL-1990 DEFINITION Artificial cloning vector pMAMPF. ACCESSION M33637 KEYWORDS . SOURCE Synthetic DNA from E.coli cell line WK6-lambd-pMAMPF3, from pMA58. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 4354) AUTHORS Szardenings,M. and Collins,J. TITLE A plasmid optimized for protein design projects: pMAMPF JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Collins, 09-APR-1990. FEATURES from to/span description signal 4121 4183 OmpA signal sequence site 4181 4186 NaeI cloning site site 4184 4308 Stuffer (pBR325:5900 to 29) BASE COUNT 1152 a 1054 c 1012 g 1136 t ORIGIN 1 aattcacctc gaaagcaagc tgataaaccg atacaattaa aggctccttt tggagccttt 61 ttttttggag attttcaacg tgaaaaaatt attattcgca attccaagct aattcacctc 121 gaaagcaagc tgataaaccg atacaattaa aggctccttt tggagccttt ttttttggag 181 attttcaacg tgaaaaaatt attattcgca attccaagct ctgcctcgcg cgtttcggtg 241 atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag 301 cggatgcaga tcacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc 361 gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt 421 cctttctcgc cacgttcgcc agctttcccc gtcaagctct aaatcggggg ctccctttag 481 ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt 541 cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt 601 tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt 661 cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt 721 aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt tacaatttga tctgcgctcg 781 gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 841 gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 901 cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 961 aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 1021 tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 1081 ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat 1141 ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 1201 cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 1261 ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 1321 gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt 1381 atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 1441 aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 1501 aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 1561 gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 1621 cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 1681 gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 1741 tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 1801 ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 1861 ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 1921 atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 1981 cgcaacgttg ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct 2041 tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 2101 aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 2161 tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 2221 ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 2281 agttgctctt gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa 2341 gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 2401 agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 2461 accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 2521 gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcagacag 2581 ttttattgtt catgatgata tatttttatc ttgtgcaatg taacatcaga gattttgaga 2641 cacaacgtgg ctttgttgaa taaatcgaac ttttgctgag ttgactcccc gcgcgcgatg 2701 ggtcgaattt gctttcgaaa aaaaagcccg ctcattaggc gggctaaaaa aaagcccgct 2761 cattaggcgg gctcgaattt ctgccattca tccgcttatt atcacttatt caggcgtagc 2821 aaccaggcgt ttaagggcac caataactgc cttaaaaaaa ttacgccccg ccctgccact 2881 catcgcagta ctgttgtaat tcattaagca ttctgccgac atggaagcca tcacagacgg 2941 catgatgaac ctgaatcgcc agcggcatca gcaccttgtc gccttgcgta taatatttgc 3001 ccatagtgaa aacgggggcg aagaagttgt ccatattcgc cacgtttaaa tcaaaactgg 3061 tgaaactcac ccagggattg gctgagacga aaaacatatt ctcaataaac cctttaggga 3121 aataggccag gttttcaccg taacacgcca catcttgcga atatatgtgt agaaactgcc 3181 ggaaatcgtc gtggtattca ctccagagcg atgaaaacgt ttcagtttgc tcatggaaaa 3241 cggtgtaaca agggtgaaca ctatcccata tcaccagctc accgtctttc attgccatac 3301 gaaattccgg atgagcattc atcaggcggg caagaatgtg aataaaggcc ggataaaact 3361 tgtgcttatt tttctttacg gtctttaaaa aggccgtaat atccagctga acggtctggt 3421 tataggtaca ttgagcaact gactgaaatg cctcaaaatg ttctttacga tgccattggg 3481 atatatcaac ggtggtatat ccagtgattt ttttctccat tttagcttcc ttagctcctg 3541 aaaatctcga taactcaaaa aatacgcccg gtagtgatct tatttcatta tggtgaaagt 3601 tggaacctct tacgtgccga tcaacgtctc attttcgcca aaagttggcc cagggcttcc 3661 cggtatcaac agggacacca ggatttattt attctgcgaa gtgatcttcc gtcacaggta 3721 tttattcgaa gacgaaaggg catcgcgcgc ggggaattgg ccacgatgcg tccggcgtag 3781 aggatctctc acctaccaaa caatgccccc ctgcaaaaaa taaattcata taaaaaacat 3841 atagataacc atctgcggtg ataaattatc tctggcggtg ttgacataaa taccactggc 3901 ggtgatactg agcacatcag caggacgcac tgaccaccat gaaggtgacg ctcttaaaat 3961 taagccctga agaagggcag cattcaaagc agaaggcttt ggggtgtgtg atacgaaacg 4021 aagcattgga attctacaac ttgcttggat tcctacaaag aagcagcaat tttcagtgtc 4081 agaagtcgac caaggaggtc tagataacga ggcgcaaaaa atgaaaaaga cagctatcgc 4141 gattgcagtg gcactggctg gtttcgctac cgtagcgcag gccggcgata atggcctgct 4201 tctcgccgaa acgtttggtg gcgggaccag tgacgaaggc ttgagcgagg gcgtgcaaga 4261 ttccgaatac cgcaagcgac aggccatgtt tgacagctta tcatcgataa gcttggatcc 4321 gtcgacctgc agccaagctt ggtctagagg tcga // LOCUS DSBRR1610 1376 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.autotrophicum 16S ribosomal rRNA. ACCESSION M34409 KEYWORDS ribosomal 16S rRNA. SOURCE D.autotrophicum (strain DSM 3382) rRNA. ORGANISM Desulfobacterium autotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1376) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1376 16S rRNA BASE COUNT 340 a 287 c 399 g 293 t 57 others ORIGIN 1 aactggagag tttgattctg gctcagaatg aacgctggcg gcgtgcttaa cacatgcaag 61 tcgaacgaga aagggattgc ttgcaatcct gagtagagtg gcgcacgggt gagtaacacg 121 tggataatct acctttgaat ccgggataac tgttcgaaag aatcgctaat accggatgaa 181 gtcagtnttt ccttggaagg attgatgaaa gacagcctct tcttgaaagc tnttgtttga 241 agatgagtcc gcgcaccatt agttngttgg tggggtaacg gcctaccaag acnttnatgg 301 ttagctggtc tgagaggatg atcagccacn ctggaactga cacacggtcc ngactcctnc 361 gggaggcagc agtgaggaat tttgcgcaat gggngcnacc ctnacgcagc aacgccgcgt 421 gagtgaagaa ggcctntggg tcgtaaagct ctgtcaatgg ggaagaagtt rcttcatatt 481 aatrgtatgt ngtattgacg gtacccatnn nngaagcacc ggcnnactcc gtgccagcan 541 nnncggtaat anggagngtg cnagcgtnat tcggaatnat tnggngtnaa gggcacgcag 601 gcggtnttgc cagtcagatg tgaaagcccg gggctcaacc ctggaagtgc atttgaaact 661 acaaggcttg agtacggtng aggaaagggg nattcctngt gtagaggtga aattcgtaga 721 tatcaggagg aacaccggtg gcgaaggcgc cnntctggac csnnactgac gctnaggtnc 781 naaggcgtgg gnagcgaaca ggattagata ccctggtagt ccacgcagta aacgttgtat 841 actaggtgta gcgggntatt gacccntgct gtgccggagc taacgcatta agtataccgc 901 ctgggaagta cggtcgcaag actaaaactc aaagaaattg acgggggccc gcacaagcgg 961 tggagcatgt ggtttaattc ganncaacgc gcagaacctt acccagactt gacatcctga 1021 gaatctgtta gagatattag agtgccttcg ggaactcaga gacaggtgct gcatggctgt 1081 cgtcagctcg tgtcgtgaga tgttcggtta agtccggcaa cgagcgcaac ccctatcttc 1141 agttgccagc acatcatggt gggaactctg aagagactgc cccggttaac ggggaggaag 1201 gnggggatga cgtcaagtcc tcatggccct tatgtctggg gctacacacg tgntacaatg 1261 gtatgtacna cgggcagcga acctgcnnag gtgngcgaat ccctnaaagc atatcccagt 1321 ccggattgga gtctgcaact cgactccatg aagttggaat cgctagtaat cgtgga // LOCUS DSBRR1677 1375 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.niacini 16S ribosomal rRNA. ACCESSION M34406 KEYWORDS ribosomal 16S rRNA. SOURCE D.niacini (strain DSM 2650) rRNA. ORGANISM Desulfobacterium niacini Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1375) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1375 16S rRNA BASE COUNT 292 a 252 c 340 g 256 t 235 others ORIGIN 1 taactggaga gtttgattct ggctcagaat gaacgctggc ggcgtgctta acacatgcaa 61 gtcgaacgag aaagtgcctg cttgcaggaa cgagtagagt ggcgcacggg tgagtaacac 121 gtggataatc taccttcaaa tcgggaataa ccattcgaaa ggatggctaa tgccggataa 181 agtcagatrt tctttggagt atttgatgaa agggtgcctc tgattgaaag catctgtttg 241 aagatgagtc cgcgcaccat tagtttgttg gtggggtaac ggcctaccaa gactncgatg 301 gtnagctngt ctgagaggat gatcagccac actggaactg acacacggtc crgrctccta 361 cgggaggcag cagtgaggaa ttttgcgcaa tgggggcaac cctnacgcag caacgcngcg 421 tgagtgaaga aggccnnngg gtcgtaaagc tntgtccatg gggaagaaat tatttagtgt 481 tratrgcatn ntttattgac nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttcctgg tgtagaggtg aaattcgtag 721 atatcaggag gaacaccggt ggcgaaggcg ccngtctgga ccgatactga cgctsaggtg 781 cgaaggcgtg ggnagcgaac gggattagat rccccggtag tccacgcagt aaacgttgta 841 tgctaggtgt agcgggtatt gcccntgctn cgccgcagtt aacacattaa gcataccgcc 901 tnnnnngtac ggtcgcaaga ctaaaactca aagaaattga cgggggcccg cacaagcggt 961 ggagcatgtg gtttrattcg acnnaacgcg aagaacctta cccagacttg acatcttgag 1021 aatcctttcg aaagttagga gtgccttcgg nngctcaaag acagnngctg catggctgtc 1081 gtcagctcgt gtcgtgagat gtttggttaa gtccagcaac gagcgcaacc cttgtcttca 1141 gttgccagcg cgtaatggcg gaactctgaa gagactgccc cggtnaacgg ggaggaaggt 1201 ggggatgacg tcaagtcctc atggcccnna tgtctgggct acrcrcgtgc tacaatggta 1261 tgtacaacgg gcagcaangc agcgatgcca agcaaatccc ttaaagcata tcccrgtccg 1321 gattggagtc tgcractcga ctccatgaag ttggaatcgc tagtaatcgt ggatc // LOCUS DSBRR169 1383 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.vacuolatum 16S ribosomal rRNA. ACCESSION M34408 KEYWORDS ribosomal 16S rRNA. SOURCE D.vacuolatum (strain DSM 3385) rRNA. ORGANISM Desulfobacterium vacuolatum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1383) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1383 16S rRNA BASE COUNT 345 a 295 c 411 g 294 t 38 others ORIGIN 1 ttctggctca gratnaacgc tggcggngtg crnaacacat gcaagtcgaa cgrgraagtt 61 sctgcttgca ggaacgagga gagtggcgct cgggngagta actcgtggat aatctacctt 121 caaatcggga ataaccattc gaaaggatgg ctaatgccgg ataaagtcag atrttctttg 181 grrtntttga tgaaagggag cctctgnata taagctnttg tttgaagatg agtccgcgca 241 ccattagttt gttggtgggg taacggccta ccaagactgt gatggtnagc tggtctgaga 301 ggatgatcag ccacactgga actgacacac ggtccagact cctacgggag gcagcagtga 361 ggaattttgc gcaatggggg aaaccctgac gcagcaacgc cgcgtgagtg aagaaggccc 421 ttgggtcgta aagctctgtc acagggaaga aattattatg ctttaataga gtgtrgtatt 481 gacggtaccc tgtgaagaag caccggctaa ctccgtgcca gcagccgcgg taatacgggg 541 ggtgcaagcg ttattcggaa ttattnggcg taaagggcac gcaggcggtc ctgtcagtca 601 gatgtgaaag ccggggctca accccggacg tgcatttgaa acagcaggnc ttgagtacgg 661 nngaggaaag ggnnnttcct ggtgtagagg tgaaattcgt agatatcagg aggaacaccg 721 gtggcgaagg cgccnntctr gaccgatact gacgctnagg tacnaaggcg tgggtagcga 781 acgggattag ataccccggt agtccacgca gtaaacgttg tatgctaggt gtagcrgggt 841 attgacccct gctgtgccgc agttaacaca ttaagcatac cgcctgggga gtacggtcgc 901 aagactnaaa ctnaaagaaa ttgacggggg cccgcacaag cggtggagca tgtggtttaa 961 ttcgacgnaa cgcgaagaac cttacccaga cttgacatct tgagaatcct ntagaaattt 1021 aggagtgcct tcgggagctc aaagacaggt gctccatggc tgtcgtcagc tcgtgtcgtg 1081 agatgtttgg ttaagtccag caacgagcgc aacccttgtc ttcagttgcc agcacataat 1141 ggtgggaact ctgaagagac tgccccggtt aacggggagg aaggtggggn tgacgtcaag 1201 tcctcatggc cctnatgtct ggggctacac acgtgctaca atggtatgta caacgggcrg 1261 caaggcagcg atgccaagca aatcccttaa agcatatccc agtccggatt ggagtctgcg 1321 actcgactcc atgaagttgg aatcgctagt aatcgtggat cagcatgccn cggtgaatac 1381 gtt // LOCUS DSFRR166 1372 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.multivorans 16S ribosomal rRNA. ACCESSION M34405 KEYWORDS ribosomal 16S rRNA. SOURCE D.multivorans (ATCC 33890) rRNA. ORGANISM Desulfococcus multivorans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1372) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1372 16S rRNA BASE COUNT 329 a 295 c 429 g 291 t 28 others ORIGIN 1 taattgacga gtttgatcct ggctcagaat gaacgctggc ggcgtgctta acacatgcaa 61 gtcgcacgag aacgcttctg cttgcagggg ctagtaagtg gcgcacgggt gagtaacgcg 121 tgggtaatct acccttggat tcgggataac ccttcgaaag gagtgctaat accggatgaa 181 gtcatttagg ctgtggttta gatgatgaaa ggtggcctct ccgaggaagc taccgtttgg 241 ggatgagccc gcgtaccatt agcttgttgg tggggtaacg gcctaccaag gcgacgatgg 301 ttagctngtc tgagaggatg atcagccaca ctggaactga cacacggtcn ngnctcctac 361 gggaggcagc agtgaggaat tttgcnnnnn gggggnaacc ntnacgcagc aacgccgcgt 421 gagtgaagaa ggccttcggg tcgtaaagct ctgtcagtgg gaagaacctt cgtatgttga 481 atatgcatac ggactgacgg taccacagaa ggaagcaccg gctaactccg tgccagcagc 541 cgcggtaata cggagggtgc aagcgttatt cggaattatt gggcgtaaag agcgcgtagg 601 cggctttgca agtcaggtgt gaaatccggg gctnaacccc ggaagagcat ttgatactgt 661 ggagcttgag tatgggagag gnnagtggna ttcctggtgt agcggtgaaa tgcgtagata 721 tcaggaggaa caccggtggc gaaggcggyt tcctggacca atactgacgc tgaggcgcga 781 aggcgtgggg agcaaacagg attagatacc ctngtagtcc acgcagtaaa cgttgatcac 841 taggtgtagc gggtattgac ccctgctgtg ccggagttaa cgcattaagt gatccgcctg 901 gggagtacga tcgcaagatt aaaactcaaa ggaattgacg ggggcccgca caagcggtgg 961 agcatgtggt ttaattcgan nnaacgcgaa gaaccttacc tggatttgac atccgcggrr 1021 ttcttatgaa aatagggagt gccttcgggg agccgcgaga caggtgctgc atggctgtcg 1081 tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ctrtcttcag 1141 ttaccatcat taagttgggg actctgaaga tactgccccg gttaacggga ggaaggtggg 1201 gatgacgtca agtcctcatg gcctttatgt ccagggctac acacgtgcta caatgggctg 1261 tacnnagggn agcgatctcg cgagagcaag ccaatcccaa aaagcagtcc ttagttcggr 1321 ttggagtctg caactcgact ccatgaagct ggaatcgcta gtaatcgcgg at // LOCUS DSLRR165 1381 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.pigra 16S ribosomal rRNA. ACCESSION M34404 KEYWORDS ribosomal 16S rRNA. SOURCE D.pigra (ATCC 29098) rRNA. ORGANISM Desulfomonas pigra Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1381) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1381 16S rRNA BASE COUNT 322 a 294 c 388 g 250 t 127 others ORIGIN 1 gaactggaga gtttgattct ggctcagatt gaacgctggc ggcgtgctta acacatgcaa 61 gtcgtacgcg aaangggact tcggtcccga gtaaagtggc gcgacgggtg agtaacgacg 121 tggataatct gcctctatga tgggnataac rgttggaaac gactgctaat accgaatacg 181 ctcatgatga ncttgtggaa aggtggcctn tgcttgcaag ctatcgcata gagatgagtc 241 cgcgtcccat tagctagttg gtggggtaac ggcctaccaa ggcaacgatg ggtagccgat 301 ctgagaggat gatcggccac actggaactg aaacacggtc cagactccta cgggaggcag 361 cagtggggaa tattgcgcaa tgggcgaaag cctgacgcag cgacgccgcg tgagggatga 421 aggtcttcgg atcgtaaacc tctgtcagaa gggaagaaac tagggtgctc taatcatcat 481 cctaactgac ggtaccttca aaggaagcac cggctaactc cgtgccagca gccgcggtaa 541 tacggagggt gcaagcgtta atcggaatca ctgggcgtaa agcgcacgta ggctnttatg 601 taagtcaggg gtgaaagccc acggctnaac cgtggaactg cccttgatac tgcacggctn 661 gaatnnnnnn nnnnggtncg gnattccagg tgtaggagtg aaatccgtag atatctggag 721 gaacatcagt ggcgaaggcg gcnacctnga cctngaccgn nattgacgct naggtgcnna 781 agcgtgggga gcaaacagga ttagataccc tngtngtcca cgccgtaaac gatggatgct 841 agatgtcggg atgtannnct cggtgtcgta gtnaacgcgt naagcatccg gcctggggag 901 tacggnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn nnnnnnnaac gcgaagaacc ttacctaggt ttgacatctg gggaaccctc 1021 ccgaarrtga ggrrtrrcct tcggggagcc ccaagacagg tgctgcatgg ctgtcgtcag 1081 ctnnnnnnnt gagatgttgg gttaagtccc gcaacgagcg caacccctat gcatagttgc 1141 cagcaagtaa agttgggcac tctatgcaga ctgnncgggt tracagggga ggaaggtggg 1201 gacgacgtca agtcatnatg gcccttacgc ctagggctac anacgtacta cnatggcacg 1261 cacaaagggc agcgataccg tgagatggag ccaatcccaa aaaacgtgtc ccagtccgga 1321 ttgcagtctg caactcgact gcatgaagtc ggaatcgcta gtaattcgag gtcagcatnc 1381 c // LOCUS DSORR1618 1402 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfotomaculum orientis 16S ribosomal rRNA. ACCESSION M34417 KEYWORDS ribosomal 16S rRNA. SOURCE D.orientis (strain Singapore I, NCIMB 8382) rRNA. ORGANISM Desulfotomaculum orientis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1402) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1402 16S rRNA BASE COUNT 301 a 253 c 386 g 255 t 207 others ORIGIN 1 ttttttggag agtttgrtcc tggctcagga ngaacgctgg cggcgtgcct aacacatgca 61 agtcgaacgg agaattgaat aagcttgctt agacaattst tagtggcgga cgggtgagta 121 acgcgtgggt aacctgccca taaakccggg acaacccttg gaaacgaggg ctaataccgg 181 atatcttaga tyytggcatc aggayttaag gaaagatggc ctctgaacat gctatcgatt 241 atggatggac ccgcgtctga ttagctagtt ggtggggtaa aggcctacca aggcgacgat 301 cagtagccgg cctgagaggg tgaacggcca cactgggact gagacacggc cnngactcct 361 acgggaggca gcagtgggga atcttccgca atggacgaaa gtctgacgga gcaacgcngc 421 gtgtatgatg aaggtcttcg gattgtaaag tactgtcagt gtggggacgg tgctntttga 481 aaatattgag gagacatgac ggtacccaag gaggaagccc cggctaannn nnnnnnnnnn 541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncggaatta ctgggcgtta agcgcgcgtt 601 ggcggatgct taagtcckgt gtgaaagatc agggctcaac cctnagagtg cattcggtaa 661 tgggtattgn nnnnnnnnga gaggaaagtg gaattccacg tgtagcggtg aaatgcgtag 721 atatgtggag gaacaccagc ggcgaaggcg actttctgga ctgtaactga cgctgaggcg 781 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgag 841 tgctaggtgt agagggtatc nncacnttct gtgccgcagt taacgcgata agcactccgc 901 ctggggagta cggccgcaag gttnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1021 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngctg tgagacaggt ggtgcatggt 1081 tgtcgtcagc ttgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccctgta 1141 tttagttgct acgagtaagg ttgagcnctc tagatagact gccggtgaca aaccggagga 1201 aggtggggat gacgtcaaat catnatgccc nttatgtcct gggctacaca cgtgctacaa 1261 tggccggtac agacggaagc gaagccgcga ggtgaagcca atccgagaaa gccggtctcr 1321 gttcggattg caggctgcaa ctcgcctgca tgaagtcgga atcgctagta atcgcaggtc 1381 agcatactgc ggtgaatacg tt // LOCUS DSORR1619 1368 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.ruminis 16S ribosomal rRNA. ACCESSION M34418 KEYWORDS ribosomal 16S rRNA. SOURCE D.ruminis (strain DL NCIMB 8452) rRNA. ORGANISM Desulfotomaculum ruminis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1368) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1368 16S rRNA BASE COUNT 341 a 284 c 420 g 267 t 56 others ORIGIN 1 ttatggagag tttrgtcctg gctcaggacg aacgctggcg gcgtgcttaa cacatgcaag 61 tcgaacggag tttaaaatga aacctngtga tttttraact tagtggcgga cgggtgagta 121 acgcgtggat aacctggcgg atagaccggg ataacagttg gaaacgactg ctaataccgg 181 atacgctcct aggaacgcat gttctaagga ggaaagattt atcgctaacc gatggatccg 241 cgtcccatta gctagttgga ggtgtaacgg acccctaagg cgacgatggg tagccggcct 301 gagagggcga acggccacac tggaactgag acacggcctn gactcctacg ggaggcagca 361 gtggggaata ttccgcaatg ggcgaaagcc tgagggagca acgcngcgtg agggaagaag 421 gctttcgggt tgtaaacctc tgtcttaagg gaagaacaaa atgacggtac cttgagagga 481 agccccggct annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnag cgttgtccgg 541 aattactggg cgtaaagggc gcgtaggtgg tgcattaagt tagaggtgaa agtgccgggc 601 nnaaccccgt gaggcctctg atactggtgg acttgagtgc aggagagggg agcagaattc 661 ccagtgtagc ggtgaaatgc gtagatattg ggaggaatac cagtggcgaa ggcggcnntc 721 tngactgtaa ctgacactga ggcgcgaaac cgtggggagc gaacaggatt agataccctg 781 gtagtccacg ccgtaaacga tgagtgctag gtgttggggg tatcgaccac ccccagtgcc 841 gtagttaaca caataagcac tccgcctggg gagtacggtc gcaagactga aactcaaagg 901 aattgacggg ggcccgcaca agcggtggag tatgtggttt aattcgacgn aacgcgaaga 961 accttaccag ggcttgacat ccgtctnaaa agtataggga tatactcttg ccttcggggn 1021 nacagagaga caggtggtgc atngttgtcg tcagctcgtg tcgtgagatg ttgggttaag 1081 tcccgcaacg agcgcaaccc ctracattcg ttgccagcga gtaatgtcgg gaactcgaat 1141 gtgactgccg ttgacaaaac ggaggaaggt ggggatgacg tcaaatcatc atgcccctta 1201 tgtcctgggc tacrcgcgta ctacnatggc cggtacagac ggaagcgaag ctgtgaagtg 1261 aagcaaatct gagaaagccg gtcccagttc ggattgtagt ctgcaactcg actacatggg 1321 gtcggaatcg ctagtaatcg caggtcagca tactgcggtn aatacgtt // LOCUS DSPRR1613 1390 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.hydrogenophilus 16S ribosomal rRNA. ACCESSION M34412 KEYWORDS ribosomal 16S rRNA. SOURCE D.hydrogenophilus (DSM 3380) rRNA. ORGANISM Desulfobacter hydrogenophilus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1390) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1390 16S rRNA BASE COUNT 354 a 297 c 416 g 292 t 31 others ORIGIN 1 actgagagtt tggacctggc tcagaatgaa cgctggcggc gtgcttaaca catgcaagtc 61 gaacgagaaa argattgctt gcaatcctga gtagagtggc gcrcgggtga gtaacrcgta 121 gataatctgc cttcaagcct ggaataacta ttcgaaaggg tagctaatac cggataaagt 181 cgatttacac nagtrgattg atgaaagatt gcctcttctt gaaagcaatt gtttggggat 241 gagtctgcgg accattagct tgttggtggg gtaaaggcct accaaggcag cgatggttag 301 ctggtctgag aggatgatca gccacactgg aactggaaca cggtccagac tcctacggga 361 ggcagcagtg aggaattttg cgcaatgggg gcaaccctga cgcagcaacg ccgcgtgagt 421 gaagaaggcc tttgggtcgt aaagctctgt caactgggaa gaaattagct tctattaata 481 gtagttgcta ttgacggtac cagtngacga agcgccggcn aactcngtgc cagcagcngc 541 ggtaacacgg ggggcgcnag cgtnattcgg aattattggg cgtaaagggc gcgcaggcgg 601 tnttgtccgt caggtgtgaa agctcggggc tcaaccccgg aagtgcactt gaaacagcaa 661 gacttgaata cgggagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc 721 aggaggaaca ccgatggcga aggcatctct ctrgaccgat attgacgctn aggcgcnaag 781 gcgtgggtag cgaacgggat tagatacccc ggtagtccac gcagtaaacg ttgtacactc 841 ggtgtngcgg atattnaaat ctgctgtgcc taagctaacg cattaagtgt accgcctggn 901 ragtacggtc gcaagactaa aactcaaagg aattgacggg ggcccgcaca agcggtggag 961 catgtggttt aattcgacgn aacgcgaaga accttacctg ggtttgacat cctgtgaata 1021 ttgtgtaatt gcaatagtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtca 1081 gctcgtntcg tgagatgttt ggttaagtcc agcaacgagc gcaaccctta tcgtcagttg 1141 ccagcacntr atggtgggaa ctctggcgag actgccccrg nnnacgggra ggaaggtggg 1201 gatgacgtca agtcctcatg gcccttatac ccagggctac acacgtgcta caatggtagg 1261 tacaaagggc agcgactctg crragggaag cgaatcccaa aagcctatct cagtccggat 1321 tggggtctgc aactcgaccc catgaagttg gaatcgctag taatcgcgga tcagcatgcc 1381 gcggtnaata // LOCUS DSPRR1614 1396 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.curvatus 16S ribosomal rRNA. ACCESSION M34413 KEYWORDS ribosomal 16S rRNA. SOURCE D.curvatus (DSM 3379) rRNA. ORGANISM Desulfobacter curvatus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1396) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1396 16S rRNA BASE COUNT 339 a 292 c 432 g 285 t 48 others ORIGIN 1 actggagagt ttgatcctgg ctcagaatga acgctggcgg cgtgctnaac acatgcaagt 61 cgaacgagaa agggattgct tgcaatcccg agtagagtgg cgcacgggtg agnnacacgt 121 rgatratctg ccttcaagcc tgggataact rttcgaaagg gtagctaata ccggataaag 181 tcggnncacn nnagtggatt gatgaaagat tgcctcttct tgaaagcaat tgtttgggga 241 tgagtttgcg taccattagc ttgttggtgg ggtaacggcc taccaaggca acgatggtna 301 gctggtctga gaggatgaca ggccacactg gaactgagac acggtcgact cgtgaacggg 361 aggcagcagt gaggaatttt gcgcaatggg ggaaaccctg acgcagcgac gccgcgtgag 421 tgaagaaggc ctttgggtcg taaagctctg tcgactggga agaagttcca atgttttaat 481 agaaggtggt attgagggta ccaggggagg aacgaccggc nnactccgtg ccagcagcag 541 cggtaacacn nngggggcna gcgttattcg gaanattggg cntaaagggc gcgcaggcgg 601 tcttgtccgt caggtgtgaa agctcggggc tcaaccccgg aagtgcactt gaaacagcaa 661 gacttgaata ccgtagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc 721 aggaggaaca ccgatggcga aggcatctct ctggaccgat attgacgctn aggcgnnaag 781 gcgtgggtag cgaacgggat tagatacccc ggtagtccac gcagtnaacg ttgtacnctc 841 ggtgtngcgg atatnnaaat ctgctgtgcc naagctaacg cattaagtgt accgcctggg 901 gagtacggtc gcaagactaa aactcnaagg aattgacggg ggcccgcaca gtcggtggag 961 catgtggttt aattcgacgn aacgcgaaga accttacctg ggtttgacat cctgtgaata 1021 yygrgtaatn gnrrtngtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtca 1081 gctcgtgtcg tgagatgttt ggttaagtcc agcaacgagc gcaaccctta tcgttagttg 1141 ccagcacgtt atgtgggaac tctaacgaga ctgcccgggt nnaccgggag gaaggcgggg 1201 atgacgtcaa gtcctcatgg cccttatatc cagggctacg cgcgtgctac natggtaggt 1261 acaaagggca gcgactttgc rraaagaagc gaatgcccaa aagcctatcg cngtccggat 1321 tggggtctgc nactcgaccc catgaagttg gaatcgctag taatcgcgga tcngcatgcc 1381 gcggtnaata tgttcc // LOCUS DSPRR1615 1373 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfobacter latus 16S ribosomal rRNA. ACCESSION M34414 KEYWORDS ribosomal 16S rRNA. SOURCE D.latus (DSM 3381) rRNA. ORGANISM Desulfobacter latus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1373) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1373 16S rRNA BASE COUNT 322 a 268 c 393 g 267 t 123 others ORIGIN 1 ctggagagtt tgatcctggc tcagaatgaa cgctggcggc gtgcttaaca catgcaagtc 61 gaacgagaaa gggattgctt gcaatcctga gtagagtggc gcacgggtga gtaacacgta 121 gataatctgc cttcaagcct gggataacta ttcgaaaggg tagctaatac cggataaagt 181 cgattyacay nagtnnattg atgaaagatt gcctcttctt gaaagcaatt gtttggggat 241 gagtttgcgt accattagct agttggtggg gtnaaggcct accaaggcnn cgatggnnag 301 ctggtctgag aggatgatca gccacactgg aactggaaca cggtccngnc tcctacggga 361 ggcagcagtg aggaattttg cgcratgggg gcaacccnna cgcagcaatg ccgcgtgagt 421 gaagaaggcc tttgggtcgt aaagctctgt caacaaggaa gaaattagga attattaata 481 gtngtttcta ttgacggtac ttgtngagga agcgccggct nactccgtgc cagcagccgc 541 ggtaacacgg ggggcgcnag cgttattcgg aattattggg cgtaaagggc gcgcaggcgg 601 tnttgtccgt naggtgtgaa agcccggggc tcaaccccgg aagagcactt gaaacagcaa 661 gacttgaata cgggagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc 721 aggaggaaca ccgatggcga aggcatctct ctngaccgrt attgacgctn aggcgnnaag 781 gcgtggggag cgaacgggat tagatacccc ggtagtccac gcagtaaacg ttgtacgctc 841 ggtgtngcgg atattnaaat ctgctgtgcc naagctaacg cattaagtgt accgcctggg 901 nagtacggtc gcaagactaa aactcraagg aattgrcggg ggcccgcrca agcggtggag 961 catgtggttt aattcgatnn aacgcgaaga accttacctg ggtntgacat cctgtgaata 1021 tcccgtnatt gggatngtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtcn 1081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1141 nnnnnnnnnn nnnntgggaa ctctggcgag actgccccgg tnnacgggga ggaaggtggg 1201 gatgacgtca agtcctsrtg gccctnatat ccagggctac acacgtgcta caatggtagg 1261 tacaaagggc agcgactttg crrraagaag cgaatcccaa aangcctntc ccagtccgaa 1321 ttggggtctg caactcgacc ccatgaagtt ggaatcgcta gtaatcgcgg atc // LOCUS DSPRR1616 1397 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfobacter sp. Vulgaris 16S ribosomal rRNA. ACCESSION M34415 KEYWORDS ribosomal 16S rRNA. SOURCE Desulfobacter sp. (strain 3ac10 DSM 2035) rRNA. ORGANISM Desulfobacter sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1397) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1388 16S rRNA BASE COUNT 363 a 290 c 404 g 296 t 44 others ORIGIN 1 actggagagt ttgrtcctgg ctcagaatga acgctggcgg cgtgcttaac acatgcaagt 61 cgaacgagaa agggattgct tgcratcctg agtagagtgg cgcacgggtg agtaacgcgt 121 agataatctg ccttcaagcc tgggataact attcgaaagg gtagctaata ccggataaag 181 tcgattcaca taagtaaatt gatgaaagat tgcctcttct tgaaagcaat tgtttgggga 241 tgagtttgcg taccattagc ttgttggtgg ggtaaaggcc taccaaggca acgatggtna 301 gctggtctga gaggatgatc agccacactg gaactggaac acggtccaga ctcctacggg 361 aggcagcagt gaggaatttt gcrgcrratg ggggcaaccc tgacgcagca acgccgcgtg 421 agtgaagaag gcctttgggt cgtaaagctc tgtcaacaag gaagaaatta caattattta 481 atagataatt gtattgacgg tacttgtaga ggaagcgccg gcnaactcnn nnnnnnnnnn 541 nnnggtaaca cggggggcgc aagcgttaat tggaattatt gggcgtaaag ggcgcgcagg 601 cggtcttgtc cgtcaggtgt gaaagctcgg ggctcaaccc cggaagtgca cttgaaacag 661 caagacttga atacgggaga ggagagagga attcctggtg tagaggtgaa attcgtagat 721 atcaggagga acaccgatgg cgaaggcatc tctctggacc gatattgacg ctnaggcgct 781 aaggcgtggg tagcgaacgg gattagatac cccggtagtc cacgcagtaa acgttgtaca 841 ctcggtgtgg cggatattta aatctgctgt gcccaagcta acgcattaag tgtaccgcct 901 gggaagtacg gtcgcaagac taaaactcaa aggaattgac gggggcccgc acaagcggtg 961 gagcatgtgg tttaattcga cgnaacgcga agaaccttac ctgggtttga catcctgtga 1021 atatcccgta attgggatag tgccttcggg agcacagaga caggtgctgc atggctgtcg 1081 tcagctcgtg tngtgagatg tttggttaag tccagcaacg agcgcaaccc ttacgtcagt 1141 tgccagcaca taatggtggg aactctggcg aactgcccgc tanannnnnn nnnnnnnnnn 1201 nntgacgtca agtcctcatg gcccttatat ccagggctac acacgtgcta caatggtagg 1261 tacaaagggc accgactctg caaagggaag cgaaatccca aaagcctatc tcagtccgga 1321 ttggggtctg caactcgacc ccatgaagtt ggaatcgcta gtaatcgcgg atcagcatgc 1381 cgcggagaat atgttcc // LOCUS DSPRR1617 1472 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfobacter sp. 16S ribosomal rRNA. ACCESSION M34416 KEYWORDS ribosomal 16S rRNA. SOURCE Desulfobacter sp. (strain 4ac11 DSM 2057) rRNA. ORGANISM Desulfobacter sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1472) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1472 16S rRNA BASE COUNT 368 a 312 c 442 g 313 t 37 others ORIGIN 1 aactggagag tttgrtcctg gctcagaatg aacgctggcg gcgtgcttaa cacatgcaag 61 tcgaacgaga aagggattgc ttgcaatcct gagtagagtg gcgcrcgggt gagtaacrcg 121 tagataatct gccttcaagc ctgggataac tatccgaaag gatagctaat accggataaa 181 gtcgatttac acaagtagac cgatgaaaga ttgcctcttc ttgaaagcaa ttgtttgggg 241 atgagtttgc gtaccattag ctaggttggt ggggtaaagg cctaccaagg cngngatggt 301 nagctggtct gagaggatga tcagtcacac tggaactgga acgcggtcca gnctcctacg 361 ggaggcagca gtgaggaatt ttgcgcaatg ggggcaaccc tgacgcagcg acgccgcgtg 421 agtgaagaag gccnntgggt cgtaaagctc tgtcwacwag gaagaaatta ggatctatta 481 atagtgggtt ctattgacgg tacttttgga ggaagcgccg gctnactckk tgccagcagc 541 ngcggtaaca cggggggcgc nagcgttatt cggaattatt gggcgtnaag ggngcgcagg 601 cggtcttgtc tgttaggtgt gaaagcgggg gctcaacccc ggaagtgcac ttgaaacagc 661 aagacttgaa tacgggagag gagagaggaa ttcctggtgt agaggtgaaa ttcgtagata 721 tcaggaggaa caccgatggc gaaggcatct ctctggaccg atattgacgc tnaggcgnna 781 aggcgtgggt agcgaacggg attagatacc ccggtagtcc acgcagtnaa cgttgtacac 841 tcggtgtagc ggatattnaa atctgctgtg ccnaagctaa cgcattaagt gtaccgcctn 901 gnnngtacgg tcgcaagact aaaactcaaa ggaattgacg ggggcccgca caagcggtgg 961 agcatgtggt ttaattcgac nnaacgcgaa gaaccttacc tgggtttgac atcctgtgaa 1021 tatcccgtaa ttgggrtrgt gccttcggga gcacagagac aggtgctgca tggctgtcgt 1081 cagctcgtgt cgtgagatgt ttggttaagt ccagcaacga gcgcaaccct tatcgtcagt 1141 tgccagcacg taatggtggg aactctggcg agactgcccc ggtcaacggg gaggaaggtg 1201 gggatgacgt caagtcctca tggcccttat atccagggct acacacgtgc tacaatggta 1261 ggtacaaagg gcagcgactt tgcaaaaagg agcgaatccc naaaagccta tctcagtccg 1321 aattggggtc tgcaactnga ccgnatgaag ttggaatcgc tagtaatcgc ggatcagcat 1381 gccgcggtna atatgttccc gggccttgta cacaccgccc gtcacaccat ggaagttgat 1441 tatacccgac gtcrctgggc taactattta ga // LOCUS DSURR1611 1371 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.propionicus 16S ribosomal rRNA. ACCESSION M34410 KEYWORDS ribosomal 16S rRNA. SOURCE D.propionicus (ATCC 33891) rRNA. ORGANISM Desulfobulbus propionicus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1371) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1371 16S rRNA BASE COUNT 333 a 271 c 397 g 276 t 94 others ORIGIN 1 naactggaga gtttnatcct ggctcagaac gaacgctggc ggcgtgctta acacatgcaa 61 gtcgaacgcg aaagggactt cggtcctgag taaagtggcg cacgggtgag taacgcgtag 121 ataacctnnc ttcatgtctg gaataatacg ccgaaagggg trctaatacc ggatattntn 181 gctnttcata agttatgcaa gcraaaggtg gcctctgcat atgctactgc rtgaagaggg 241 gtctgcgtac cattagctag taggtggggt aatggcctac ctaggcgacg atggtnagcg 301 ggtctgagag gatgatccgc nacactggca ctggaacacg ggcnngnctc ctacgggagg 361 cagcagtgag gaatattgcg caatgggggn aacccnnacg cagcgacgcn gcgtgagtga 421 ggaaggcctt cgggtcgtaa agctctgtca agaggaaaga agtgtacaat ggctnatacc 481 tnttgtnttt gacggtacct ctnaaggaag caccggctna ctccgtncca gcagcnncgg 541 taatacggag ggtgcaagcg tnnttcggaa tcactnggng tnaagggcgc gtaggcggnt 601 tggtaagtca gatgtgaaag cccacggctn aactgtggaa gtgcatttga aactgtcagg 661 cntgagtacc agaggggaaa gtggaattcc cggtgtagag gtnaaattcg tagatatcgg 721 gaggaatacc ggtggcgaag gcgactttct ggctngatac tnacgctnag gcgcnaaagc 781 gtggggagca aacaggatta gataccctng tagtccacgc tgtaaacgat gtgaactaga 841 tgcagggggt gtnnatccnn nctgtgtngc agctaacgca ttnagttnnn nnnnnnnnna 901 gtacggnngc aagattaaaa ctcaaaggaa ttgacggggn ncngcacaag cggtggagta 961 tgtggtttaa ttcgatnnaa cgcgaagaac ctnacctngt nttgacatcc cggnaatctn 1021 ttggaaacaa gagagtgctt ccgnnagnat ctggagacag nnnctgcatn gctntcgtna 1081 nnnnntgttg tgagatgttg ggttaagtcc cnnaacgagc gcaacccttg cctttagttg 1141 ccagcagttc ggctggrcac tctaaaggga ctgccggtgt taaaccggag gaaggtgggg 1201 atgacgtcaa gtcctnatgg ccnntatgac cagggctaca cacgtcctac aatggccgat 1261 acaaagggca gcgacactgc gaggtggagc taatcccata aaatcggnct cagtccggat 1321 tggagtctgc aactcgactc catgaagttg gaatcgctag taatcgngga t // LOCUS DSURR1612 1365 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfobulbus sp. 16S ribosomal rRNA. ACCESSION M34411 KEYWORDS ribosomal 16S rRNA. SOURCE Desulfobulbus sp. (strain 3pr10, DSM 2058) rRNA. ORGANISM Desulfobulbus sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1365) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1365 16S rRNA BASE COUNT 299 a 267 c 389 g 267 t 143 others ORIGIN 1 gaagagtttn attctggctc agaacgaacg ctggcggcgt gcntnncaca tgcaagtcga 61 accgnnnnnn nnnnnnnntc ctagtagtgg cgcacgggtg agtracgcgt agatracctg 121 ncctnatgtc tgggatrccc caccgaaagg tgggctaata ccggatgtnc tnntntttca 181 tnagcgtaat aaagaaaagg tggcctctga tattaagctn ctgcrtgagr aggggtctgc 241 gtaccattag ctagttggtg gggtnagngc ctaccaaggc atcgatggtn agcgggtctg 301 agngnatgat ccgtcacact ggcactgaaa cncgggccng rctccnacgg gaggcngcng 361 tgaggaatnt tgcgcnatgg gggcnaccct nacgcagcga cgccgcgtga gtgagnaagg 421 ctttcgggtc gtaaagctct gtnnaaggga agaaatgcat aatnattnat ncttnttatg 481 tttgacggta cctttaaagg aagcaccggc taactccgtg ccagnagccg cggtaatacg 541 gagggtgcaa gcgttgtccg gaattactgg gcgtnaaggg cacgcaggcg gcctgataag 601 tcagatgtga aagcccacgg ctnnaccgtg gaagtgcatt tgaaactatt aggnnngagt 661 atcagagggg aaagtggaat tcctggtgta gaggtgaaat tcgtagatat caggaggaat 721 rccagtggcg aaggcgactt tctggctgaa tactgacgct gaggtgcgaa agcgtgggga 781 gsaaacagga ttagataccc tggtagtcca cgccgtaaac gatgtcaact aggtgtaggg 841 ggggttnatc csstctgtgc cgcagctrac gcattragtt gacngcctgg ggagtacggt 901 cgcaagatta aaactcnaag naattgrcgg gggcccgcrc nagcggtgga gtatgtggtt 961 taattcgatg nnacgcgaag aaccttacct ggtcttgaca tcccaagaat cttctagaaa 1021 tagaagagtg cttttcggag aacttggtga caggtgctgc atggctgtcg tcagcnnnnn 1081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ttgccagtag 1141 ttcggctgag cactctaaag ggactgccgg tgtnnaaccg gaggaaggtg gggatgacgt 1201 caagtcctca tggcsnntat gaccagggct acacacgtac tacnatggcc ggtacnaagg 1261 gcagcgacag ggcgrcctna agcgaatccc gtraagccgg nctcngtccg gattggagtc 1321 tgcnactcga ctccatgaag tcggaatcgc tagtaatcgt ggatc // LOCUS DSVRR168 1527 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.variabilis 16S ribosomal rRNA. ACCESSION M34407 KEYWORDS ribosomal 16S rRNA. SOURCE D.variabilis (strain DSM 2060) rRNA. ORGANISM Desulfosarcina variabilis Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1527) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1527 16S rRNA BASE COUNT 380 a 338 c 458 g 328 t 23 others ORIGIN 1 ttaattggag agtttgatcc tggctcagaa tgaacgctgg cggcgtgctt aacacatgca 61 agtcgtacga gaacgctcta gcttgctaga gcaagtaaag tggcgcacgg gtgagtaacg 121 cgtgggtaac ctaccttcga attggggata acgttgcgaa agcgacgcta ataccgaata 181 atatccaaat atcttcggat ttttggatca agggtggcct cttcttgaaa gctactgttt 241 gaagatgggc ccgcgtacca ttagcttgtt ggtggggtaa tggcctacca aggcaacgat 301 ggttagctgg tctgagagga tgatcagcca cactggaact gacacacggt ccagactcct 361 acgggaggca gcagtgagga attttgcgca atgggcgaaa gcctgacgca gcaacgccgc 421 gtgagtgatg aaggctttcg ggtcgtaaag ctctgtcaag tgggaagaac ctatataatg 481 ttaatatcat tatatactga cggtaccact gaaggaagca ccggctaact ccgtgccagc 541 agccgcggta atacgggggg tgcaagcgtt attcggaatc attgggcgta aagagcacgt 601 aggcggcctg gtaagtcaga tgtgaaagcc cggggcttaa ccccggaagt gcatttgata 661 ctgtcaggct tgagtatggg agagggaagt ggaattcctg gtgtagaggt gaaattcgta 721 gatatcagga ggaacaccgg tggcgaaggc gacttcctgg accaatactg acgctgaggt 781 gcgaaggcgt gggtagcaaa caggattaga taccctggta gtccacgcag taaacgttga 841 tcactaggtg tagcrgggta ttgacccctg ctgtgccgca gttaacgcat taagtgatcc 901 gcctggggag tacggtcgca agattaaaac tcaaaggaat tgacgggggc ccgcacaagc 961 ggtggagcat gtggtttaat tcgannnaac gcgaagaacc ttacctggat ttgacatccc 1021 gggaatccta tcgaaagact ggagtgccct tcggggagcc cggwgacagg tgctgcatgg 1081 ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caacccttgt 1141 ctttagttac catcatttag ttggggactc taaagatact gccccggtca acggggagga 1201 aggtggggat gacgtcaagt cctcatggcc tttatgtcca gggctacaca cgtgctacaa 1261 tgggcggtac aaagggtagc gaacccgcga gggtaagcca atcccaaaaa gccgtcccaa 1321 gttcggattg gagtctgcaa ctcgactcca tgaagctgga atcgctagta atcgcggatc 1381 agcatgccgc ggtgaatacg ttcccgggcn ttgtacacac cgcccgtcac accatgagag 1441 ttggttgtac ccgaagtcgt cgggctaacc cttcggggat gcangcgcct aaggtatggc 1501 tgannnnnnn nnnnnnnnng taacaag // LOCUS DVURR16 1388 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.vulgaris 16S ribosomal rRNA. ACCESSION M34399 KEYWORDS ribosomal 16S rRNA. SOURCE D.vulgaris (strain Hildenborough DSM 644) rRNA. ORGANISM Desulfovibrio vulgaris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1388) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1388 16S rRNA BASE COUNT 306 a 315 c 418 g 248 t 101 others ORIGIN 1 taactggaga gtttgattct ggctcagatt gaacgctggc ggcgtgctta acacatgcaa 61 gtcgngcgtg aaaggacttc ggyccgagta aagcggcgcr cgggtgagta acgcgtggat 121 gatctaccca tgagttggga ataacggctg gaaacggtcg ctaataccga atacgctccg 181 atttcracnt cgggggaaag gtggcctctg cttgcaagct actgctcatg gatgagtccg 241 cgtcccatta gctagttggt ggggtaacgg cccacnaagg cgacratggg nagccggtct 301 gagaggatga ccggccacac tgggactgga acacggccna gactcctacg ggaggcagca 361 gtggggaata ttgcgcaatg ggcgaaagcc tnacgcagcg acgccgcgtr agggatgaag 421 gtcctcggat cgtnaacctc tgtcnggagg gaagaaccgc cacggtgctn atcagccgtn 481 gtctgacggt acctccagag gaagcaccgg ctaactccgt gccagcagcc gcggtaatac 541 ggagggtgca agcgttaatc ggaatcactg ggcgtaaagc gcacgtaggc tgcttggtaa 601 gtcaggggtg aaagccggnn nctcaaccgc ggaattgcct ttgatactgc caagcnagag 661 tccgggagag ggtagtggaa ttccaggtgt aggagtgaaa tcmgtagaga tctggaggaa 721 catcagtggc gaaggcgact acctggaccg gnactgacgc tnaggtgcnn aagcgtgggg 781 agcaaacagg attagatacc ctngtagtcc acgccgtaaa cgatggatgc taggtrtcgg 841 ggccttgarr cttcggtgcc gtagttaacg cgttaagcaa tccgcctngg gagtacggtc 901 gcnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn aacgcgaaga accttaccta ggtttgacat ccggaagacc ttcccgaaaa 1021 ggaaggntgc cttcggggaa ttccgagaca ggtgctgcat ggctgtcgtc agctcgtgcc 1081 gtgaggtgtt gggttaagtc ccgcaacgag cgcaacccct attgccagtt gctaccaggt 1141 aatgctgggc tctctggtga gactgccccg gttaacgggg aggaaggtgg ggacgacgtc 1201 aagtcatgat ggcccttacg cctagggcta cacacgtact acaatggcgc atacaaaggg 1261 cagcgatacc rcraggtgga gccaatccca aaaagtgcgt cccagtccgg attgcagtct 1321 gcaactcgac tgcatgaagt tggaatcgct agtaattcga gatcagcatg cncgggtgaa 1381 tgcgttcc // LOCUS DVURR161 1370 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.gigas 16S ribosomal rRNA. ACCESSION M34400 KEYWORDS ribosomal 16S rRNA. SOURCE D.gigas (ATCC 19364) rRNA. ORGANISM Desulfovibrio gigas Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1370) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1370 16S rRNA BASE COUNT 310 a 303 c 393 g 243 t 121 others ORIGIN 1 taactggaga gtttgattct ggctcagatt gaacgctrgc ggcgtgctta acacatgcaa 61 gtcgagcgag anngggtctt tcgggacctg agtacagcgg cgcacgggtg agtaacgcgt 121 ggatgatcta cctcattgrc cgggataacg gtgggaaacc gctgctaata ctggatacgc 181 ttcatggaaa catgagggaa aggtggcctc tgatttaagc trtcacaatg agatgagtcc 241 gcgtctcatt agcttgttgg tagggtaatg gcctaccaag gctacgatga gtagcggccc 301 tgagagggtg gccrgccaca ctggaactga aacacggtcc agactcctac gggaggcagc 361 artggggaat attgcgcaat gggcgaaagc ctnacgcagc gacgccgcgt gagggaagaa 421 ggccttcggg tcgtaaacct ctgtctggag gaaagaaacc cataggtgtg aatagggctt 481 atggctgacg gtacctccga aggaagcacc ggctaactcc gtgccagcag ccgcggtaat 541 acggagggtg caagcgttaa tcggaatcac tgggcgtaaa gcgcacgtag gctgtcttgc 601 aagtcaggtg tgaaagccct cggctcaacc ggggaactgc acttnaaact gcatgacttg 661 agtacaggag aggatggcag aattccgggt gtaggggtna aatccgtaga tatccggagg 721 aataccagtg gcgaaggcgg ccatctggac tgtcactgac gctgaggtgc rnaagcgtgg 781 ggagcaaaca ggattagata ccctngtagt ccacgctgta aacgatggac gctnggtrtc 841 grggggtcgc ccctcggcgc cgaactaacg cgttnannnn nnnnnnnnnn nnnnnnnnnn 901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn aacgcgaaga accttaccta ggcttgacat cccgggaacc ctcccgaaaa 1021 ggaggnntgc tcttcggaga gcccggagac aggtgctgca tggctrtcgt cagctcgtgc 1081 cgtgaggtgt tgggttaagt cccgcaacga gcgcaacccc tgtccatagt tgccnnnngg 1141 taaagctngn cactctatgg agactgcctt ggcaacaagg aggaaggtgg ggatgacgtc 1201 aagtcatcat ggcccttacg cctagggcta cacacgtact acaatggcga gtacaaaggg 1261 ttgcgaaacc gcgaggttga gccaatccca aaaagctmgt cccagtccgg attgcagtct 1321 gcaactcgac tgcatgaagt cggaatcgct agtaatcccg gatcagcatg // LOCUS DVURR162 1400 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.salexigens 16S ribosomal rRNA. ACCESSION M34401 KEYWORDS ribosomal 16S rRNA. SOURCE D.salexigens (ATCC 14822) rRNA. ORGANISM Desulfovibrio salexigens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1400) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1400 16S rRNA BASE COUNT 320 a 306 c 393 g 270 t 111 others ORIGIN 1 actggagagt ttgattctgg ctcagattga acgctggtgg cgtgcttaac acatgcaagt 61 cgtgcgagaa cagttccttc gggaacctag tagagcggcg cacgggtgag taacgcgtgg 121 ataatctacc cagaagactg ggataacagt tggaaacgac tgctaatacc ggatacgttt 181 catatttaac tnnatnagag aaaggtggcc tctntttcaa gctatcactt ttggatgagt 241 ccgcgtttca ttagctagtt ggtaaggtaa cggcttacca aggcgacgat gaatagctgg 301 tctgagagga tgaccagcca cactgggact ggaacacggc cnagactcct acgggaggca 361 gcagtgggnn atattgcgca atgggggaaa ccctgacgca gcgacgccat gtgagggacg 421 aaggctttcg ggtcgtaaac ctctgtcrrg rrggaagaaa ctgtttgarg ctaataccct 481 ntttcacttg acggtaccct cagaggaagc accggctaac tccgtgccag cagccgcggt 541 aatacggagg gtgcgagcgt taatcggaat cactgggcgt aaagcgcgcg taggtggcgc 601 gataagtcag gcgtgaaagc cctcggctca accggggaat tgcrcttgat actgtcgtgc 661 ttgagtctcg gagagggtgg cggaattcca ggtgtaggag tgaaatccgt agatatctgg 721 aggaacacca gtggcgaagg cggccacctg gacgagtact gacgctgagg tgcgaaagcg 781 tggggagcaa acaggattag ataccctngt agtccacgcc gtaaacgatg gatgctagat 841 gtcgggcctt aaccaggttc ggtgtcgaag ttaacgcgat aagcatccnn nnnnnnnnnn 901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn nnnnnnaacg cgaagnacct tacctggact tgacatcctg agaatcctct 1021 agaaatagag gagtgccttc gggnnattca gtgacaggtn ctgcatggct ntcgtcagct 1081 cgtgccgtga ggtgttgggt taagtcccgc aacgagcgca acccctattg ctagttgcca 1141 tcacataatg gtgggcactc tagtgagact gcccgggtca accgggagga aggtgggnac 1201 gacgtcaagt catcatggcc cttacgtcca gggctacaca cgtactacaa tggtggatac 1261 aaagggttgc caagccgcga ggccgagcca atcccaaaaa gtccrtccca gtccggatcg 1321 cagtctgcaa ctcgactgtg tgaagttgga atcgctagta atcccggatc agcatgccgg 1381 ggtnaatacg ttcccgggcc // LOCUS DVURR163 1395 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.sapovorans 16S ribosomal rRNA. ACCESSION M34402 KEYWORDS ribosomal 16S rRNA. SOURCE D.sapovorans (strain 1pa3 ATCC 33892) rRNA. ORGANISM Desulfovibrio sapovorans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1395) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1395 16S rRNA BASE COUNT 334 a 291 c 427 g 286 t 57 others ORIGIN 1 gaacggagag tttgatcctg gctcagaatg aacgctggcg gcgtgctnaa cacatgcaag 61 tcgaacgaga aagttccttc gggggataag taaagtggcg cacgggtgag taatgcgtgg 121 ataatctacc rttgaattcg gaataacatc gcgaaagcgg tgctaatacc ggatratgtt 181 gtrttttctt cggggaatgc aaccaaagaa tgcctcttca tgaaagcatt tgtttgagga 241 tgagtccgcg tcccattagc tagttggcgg ggtaacggcc caccaaggca gtgatgggna 301 gctngtctga gaggatgatc agccacactg ggactgacac acggccnnga ctcctacggg 361 aggcagcagt gaggaatttt gcgcaatggg ggcaaccctg acgcagcaac gccgcgtgag 421 tgaagaaggc ccttgggtcg taaagctctg tcnatgggga agaagttgtg tggttcaaac 481 agggccatgc attgacggta cccatnaagg aagcaccggc tnactccgtg ccagcagccg 541 cggtnatacg gagggtgcaa gcgttattcg gaattattgg gcgtnaaggg ngcgtaggcg 601 gcctgtnaag tcatctgtga aatttcgggg ctnaaccccg gagctgcatg tgatactggc 661 aggnttgagt atggcagagg aaagcggaat tcctggtgta gcggtgaaat gcgtagatat 721 caggaggaac accrgtggcg aaggcggctn tctngaccna tactgacgct gatgcgcgaa 781 ggcgtgggta gcaaacagga ttagataccc tngtagtcca cgcagtaaac gttgatcact 841 aggtgttgcg ggtattgacc cctgagtgcn gaagctaacg cattaagtga tcngcctggg 901 gagtacgatc gcaagattaa aactcaaagg aattgacggg ggnccgcana agcggtngag 961 crtgtggttt aattcgannn aacgcgcann accttacctg ggcttgaaat ctgtggaact 1021 tgtatgaaag tatggggnnn cttcggggaa ccgcaagaca ggtgctgcat ggctgtngtc 1081 agctcgtgtc gtgagatgtt gggtnaagtc ccgcaacgag cgcaaccctc gtcttcagtt 1141 gccagcacgt aaaggtggga actctgaaga tactgccccg gtnnncgggg nggaaggtgg 1201 ggntgacgtc aagtcctnat ngcnnntntg cccagggctn cacacgtgct ncaatggtat 1261 atacaaaggg nnncgatncc gcgaggtgga nccaatccca tnaagtatgc cacagttcgg 1321 attggagtct gcaactcgac tccatgaagt tggaatcgct agtaatcgcg gatcagcatg 1381 ccgcggtgaa tacgt // LOCUS DVURR164 1395 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.baarsi 16S ribosomal rRNA. ACCESSION M34403 KEYWORDS ribosomal 16S rRNA. SOURCE D.baarsi (strain 2st14 DSM 2075) rRNA. ORGANISM Desulfovibrio baarsi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1395) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1395 16S rRNA BASE COUNT 334 a 291 c 427 g 286 t 57 others ORIGIN 1 gaacggagag tttgatcctg gctcagaatg aacgctggcg gcgtgctnaa cacatgcaag 61 tcgaacgaga aagttccttc gggggataag taaagtggcg cacgggtgag taatgcgtgg 121 ataatctacc rttgaattcg gaataacatc gcgaaagcgg tgctaatacc ggatratgtt 181 gtrttttctt cggggaatgc aaccaaagaa tgcctcttca tgaaagcatt tgtttgagga 241 tgagtccgcg tcccattagc tagttggcgg ggtaacggcc caccaaggca gtgatgggna 301 gctngtctga gaggatgatc agccacactg ggactgacac acggccnnga ctcctacggg 361 aggcagcagt gaggaatttt gcgcaatggg ggcaaccctg acgcagcaac gccgcgtgag 421 tgaagaaggc ccttgggtcg taaagctctg tcnatgggga agaagttgtg tggttcaaac 481 agggccatgc attgacggta cccatnaagg aagcaccggc tnactccgtg ccagcagccg 541 cggtnatacg gagggtgcaa gcgttattcg gaattattgg gcgtnaaggg ngcgtaggcg 601 gcctgtnaag tcatctgtga aatttcgggg ctnaaccccg gagctgcatg tgatactggc 661 aggnttgagt atggcagagg aaagcggaat tcctggtgta gcggtgaaat gcgtagatat 721 caggaggaac accrgtggcg aaggcggctn tctngaccna tactgacgct gatgcgcgaa 781 ggcgtgggta gcaaacagga ttagataccc tngtagtcca cgcagtaaac gttgatcact 841 aggtgttgcg ggtattgacc cctgagtgcn gaagctaacg cattaagtga tcngcctggg 901 gagtacgatc gcaagattaa aactcaaagg aattgacggg ggnccgcana agcggtngag 961 crtgtggttt aattcgannn aacgcgcann accttacctg ggcttgaaat ctgtggaact 1021 tgtatgaaag tatggggnnn cttcggggaa ccgcaagaca ggtgctgcat ggctgtngtc 1081 agctcgtgtc gtgagatgtt gggtnaagtc ccgcaacgag cgcaaccctc gtcttcagtt 1141 gccagcacgt aaaggtggga actctgaaga tactgccccg gtnnncgggg nggaaggtgg 1201 ggntgacgtc aagtcctnat ngcnnntntg cccagggctn cacacgtgct ncaatggtat 1261 atacaaaggg nnncgatncc gcgaggtgga nccaatccca tnaagtatgc cacagttcgg 1321 attggagtct gcaactcgac tccatgaagt tggaatcgct agtaatcgcg gatcagcatg 1381 ccgcggtgaa tacgt // LOCUS DRORNP70K 3569 bp ds-DNA INV 24-JUL-1990 DEFINITION D.melanogaster U1 70K small nuclear ribonucleoprotein gene, complete cds. ACCESSION M31162 KEYWORDS ribonucleoprotein; small nuclear ribonucleoprotein. SOURCE D.melanogaster (strain Oregon R) DNA, clones lambda D331.[1,2]; and adult head, cDNA to mRNA, clones pBS9.[1,3] and lambda R3-2.1. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 3569) AUTHORS Mancebo,R., Lo,P.C.H. and Mount,S.M. TITLE Structure and expression of the Drosophila melanogaster gene for the U1 small nuclear ribonucleoprotein particle 70K protein JOURNAL Mol. Cell. Biol. 10, 2492-2502 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Mancebo, 09-JAN-1990. FEATURES from to/span description pept 614 1960 U1 70K small nuclear ribonucleoprotein, exon 2 (first expressed exon) pre-msg < 1 2259 snRNP mRNA and intron (alt.) pre-msg < 1 3367 snRNP mRNA and intron (alt.) IVS 488 566 U1 70K snRNP, intron A site 103 182 RNA recognition motif BASE COUNT 972 a 839 c 919 g 839 t ORIGIN Chromosome II; map position 27D1,2. 1 gcatgaagga aaatattcta caaaaaactt caattttata aaattcattt aaaatacaaa 61 attgtacgta aacttaacgt aaccgttact cagttatgga atgtgtgagc gagatggtga 121 agcagcagca agtgatgtag caaattgcaa ttgaacgcga gtgggaaaag gggcaactat 181 aaaaccgaga aacttgcttt tagcatggat tcgaacccct tatttatagt actctggatg 241 tccgagacac acacctattt gtggtattta tattttataa cgtaagtagt atatttaatt 301 actaatcagt atttcatgcg gaattcttcc gcttaattca tagaccgcgc gggggtcaca 361 cttgctactc aagccaggcg aaaaactaaa gaaaatcggg aaaatacttg gtctgcaccg 421 aattatattg ctggtactta aacgaagtac cctagattta ttcttgccaa gcggatggct 481 gtttaaggtg agttgcgcca gcgcttacta tcccttgtgg agtaaacaaa ctccaaccta 541 acctcaaact gaccgttttt ttgcagacga ggaacttcag gaaaaggtaa aacaaaacaa 601 aaaagcccac aaaatgaccc aatatctgcc gccgaatctg ctggcgctgt tcgcggcacg 661 ggagcccatc ccgttcatgc cgccggtgga caagctgccg cacgagaaga agtctcgcgg 721 ctacctggga gtggccaagt tcatggccga tttcgaggat cccaaggaca cgccgctgcc 781 gaaaacggtg gaaacgcgtc aggagcggct ggagcgacgc cggcgcgaga aggccgagca 841 ggtggcctac aagctggagc gtgagatagc gctgtgggac cccacagaga tcaaaaatgc 901 cacggaggac ccgtttcgca cgctgttcat tgcacgcatc aactacgaca cgtccgagtc 961 gaagctgcgg cgtgagttcg agttctacgg gcccatcaag aagatcgtcc tgatccacga 1021 ccaggaatca ggtaaaccca agggctacgc cttcatcgag tacgagcacg agcgggacat 1081 gcatgccgcc tacaagcacg ccgatggtaa gaagatcgac agcaagcgcg tcctggtgga 1141 cgtggagcgg gctcgcacgg tcaagggctg gctgcctcga cgcctgggcg gcggtctggg 1201 tggaacgcgc cgcggcggca acgatgtcaa cattaagcac tccggccgcg aggacaacga 1261 gagggaacgc gagcgctacc ggctggagcg ggagcgtgag gatcgcgagg gtcctggacg 1321 cggcggcggc tccaatggcc tggatgcccg gcccggacgc ggtttcggtg cggaacgtcg 1381 acgttcccgc tccagggaac gccgcgaccg tgaacgagat cgcggacggg gcgctgtggc 1441 tagcagcggt cgctcgcgca gccgttctcg cgagcgcaga aaacgacgag cgggcagccg 1501 ggagcggtac gacgagttcg accgccgcga tcggcgggac agggagcgcg agcgtgatcg 1561 cgatcgcgag cgtgagaaga aaaagaagcg ctccaagtct cgcgaacgcg aatcctccag 1621 ggagcgtcgc gaacggaagc gagagagaag ggaccgtgaa cgcggcaccg gatccggcgg 1681 cgatgtcaag gagcgcaagc ccgatttccg tgatatggat gtcatcaaga tcaaggagga 1741 gcccgtcgac gatggctatc ccacatttga ctaccagaac gcgaccatca agcgtgagat 1801 cgacgatgag gatgaggaga agtaccggcc gccgcctgcg catcacaata tgttcagtgt 1861 gccgccgccg cccattttgg ggcgtggaaa tgccagcacg aatcccaatc ccgacaatgg 1921 ccagcagagc tccggcgacc cgagttggtg gcgtcagtag agtcttggag cgatgtgtca 1981 aggttaatat ttctagaagt cagacgtctg tgggtcgcaa ctatttattc attcgactcc 2041 gaggcacccc aatcctggag agcctaaccg tagctatcca ttttacacag aaaattttaa 2101 ggtaccactt ggaggagggc caagtagaac aagattatag aaaacccgac cgctaaacgc 2161 agaatccgct aatgtgtgcg tagcttaaat cacttaaatt tataagtaac tcttaacaaa 2221 tgaatatgaa aacagtaagt aaaataaagc tagccctcat gtgtttgttt ccccaccttt 2281 ggtaaggggg ttaaagggaa tacggagagt caggagctgg aacgctttcg gtggcgcata 2341 caccgtacta tatggttact ccatcccatg gtggttcctg ggattttcta actcacctaa 2401 cataataagc tgaacaatac aaacccttgc actaactcgt gccttttatt ttctctgttt 2461 ttttgcagtt ttcaatcaat tgaaaatctg actctgacta gtgtgaaagc aaaagcataa 2521 gtatttaatc aaacaaacag taatccaaaa acggaaatta gttccgccag tattcgtagc 2581 ccatgcccaa gtctaaattc caagcccaca tcaggtaatt tggtctacgc acaaacctca 2641 ctaatccatg cgtctaccgt tctaggacag ctctagaatc aagacagcta ccgcaatact 2701 tttccaatct cctccgctct gggttgcctg tgttgtgtgg tggtgtggcg tgtaagttga 2761 ttccgggcta agaaattttg taaaccaaaa ccttttccgt aagtttgccc cggtaagatt 2821 acgatatcct cgcctaaccg ccgtggatcg gatggatgag tgagttagtg taagggagct 2881 ttcctctgtt tggtacacat tgcgaactgc tccgatggcc tgtggcaatc tactccattc 2941 atccattatg tctgtaacca atttaccatt tcgatctttt catgtacgtt gagctgattg 3001 ttatcgtact gaagacgaat ccgcggcgga aactcgcata gaacaaacag aaaactgcgc 3061 aaaggtttgc ttttggtaac tggtaacgat tggttgtggt tggtcaggtc aggggtcttt 3121 cgagctggtt agcctccact ttggattgcg gacggtgaca aaattctggg ggtcttttat 3181 ttttgtattg tattgtttgg cagacaggat ccatgcaatt gccttttcgt accgtgcgac 3241 atacgggaaa caaactatcg tgtagcaggg catttttcct attacaccat tattaagaaa 3301 agcggacaga gatcaatgtc acatttagat ttgatcaaat aaagaaaaga attcctcctt 3361 acacgaaaaa gtacttgttt catttctaag tatttatcaa gatgacttga attttttaca 3421 attttcgctt gttaaactgg tctatatgtc ctggataaga tttgcagtgg cgtccttgga 3481 atctctaaaa atgtatacat ttgttgcaaa taaagcattg tgaaatctat ggacattaaa 3541 tgcgacctct attggaaaac attcacata // LOCUS FSBAFP05A 946 bp ds-DNA VRT 24-JUL-1990 DEFINITION Macrozoarces americanus antifreeze protein OP5A gene, complete cds. ACCESSION J03924 KEYWORDS antifreeze protein. SOURCE Ocean pout adult testis DNA, clone lambda-OP5. ORGANISM Macrozoarces americanus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Gadiformes; Zoarcoidei; Zoarcidae. REFERENCE 1 (bases 1 to 946) AUTHORS Hew,C.L., Wang,N.-C., Joshi,S., Fletcher,G.L., Scott,G.K., Hayes,P.H., Buettner,B. and Davies,P.L. TITLE Multiple genes provide the basis for antifreeze protein diversity and dosage in the ocean pout, Macrozoarces americanus JOURNAL J. Biol. Chem. 263, 12049-12055 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.H.Hayes, 13-JUN-1988. FEATURES from to/span description pept 270 338 antifreeze protein, exon 1 521 715 antifreeze protein, exon 2 IVS 339 520 antifreeze protein, intron A BASE COUNT 283 a 209 c 186 g 268 t ORIGIN 211 bp upstream of Sau3AI site. 1 aagcttgtga tagtttggac aaaaacaagt tatactttac ttataagaat ataaaatttc 61 cattgcaatt ggcataagga ggtgtgacac agtgacctac tttcaggcca ataggaaacg 121 ggatatgccg gttaagtcct cccacatact gtatattaga tgcagcacat ggacctgtcc 181 tgtcagaagt ctcagctaca gctttcactt cgatctccga taattaatta attaattaat 241 tattaattaa ttaagtctca gccacagcca tgaagtccgt tattttaacc ggtttgcttt 301 tcgtcctcct ttgtgtcgac cacatgacag ccagccaggt gagatattct tgctccacta 361 aaaaatattc aaaaatgtga gctacagtaa aattcaacag tgttctgttt agaaagacag 421 agaacctttt aagtaaacat ttttagaatt ttctttttca actgtgccat gagaacaata 481 ataacgtctg accttttatc ttccattctt caacggtcag tccgtggtgg ccacccagct 541 gatccccata aatactgccc tgactccggt gatgatggag gggaaggtga ccaacccaat 601 aggcatcccg ttcgcagaga tgtcccaaat agtggggaag caagtgaaca cgccagtggc 661 taagggccaa accctcatgc caaacatggt gaaaacgtac gccgcgggaa agtagttctg 721 agggtgccaa ggagcttctt cccaaaacca aaagaagaaa tgccccctct cacaattaac 781 cctgtttttg tcacaaaccc aagtctgtta actgaacatg tcaaaacctg tggagactgt 841 tgagatttga tgttctgaaa agataaagcc tataaataaa atgttgccca aatttcctgc 901 ctgatgtttt tctttgtcgt tgctacatgg ctttgctgct cggatc // LOCUS HUMCRPX 78 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human C-reactive protein (CRP) mRNA, 5' end. ACCESSION M35163 KEYWORDS C-reactive protein. SOURCE Human adult liver, cDNA to mRNA, clone pCRP5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 78) AUTHORS Tucci,A., Goldberger,G., Whitehead,A.S., Kay,R.M., Woods,D.E. and Colten,H.R. TITLE Biosynthesis and postsynthetic processing of human C-reactive protein JOURNAL J. Immunol. 131, 2416-2419 (1983) STANDARD simple staff_review FEATURES from to/span description pept 1 > 78 C-reactive protein precursor /hgml_locus_uid="LR0093W" /nomgen="CRP" /map="1q21-q23" sigp 1 54 C-reactive protein signal peptide matp 55 > 78 C-reactive protein BASE COUNT 15 a 17 c 22 g 24 t 0 others ORIGIN 1 atggagaagc tgttgtgttt cttggtcttg accagcctct ctcatgcttt tggccagaca 61 gacatgtcga ggaaggct // LOCUS PT3TAUF 117 bp ds-DNA PHG 24-JUL-1990 DEFINITION Bacteriophage T3 early terminator (T3Te) region. ACCESSION M35171 KEYWORDS early terminator region. SOURCE Bacteriophage T3 DNA. ORGANISM Bacteriophage T3 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 117) AUTHORS Briat,J.-F., Bollag,G., Kearney,C.A., Molineux,I. and Chamberlin,M.J. TITLE Tau factor from Escherichia coli mediates accurate and efficient termination of transcription at the bacteriophage T3 early termination site in vitro JOURNAL J. Mol. Biol. 198, 43-49 (1987) STANDARD simple staff_review FEATURES from to/span description signal 85 106 transcription termination signal BASE COUNT 28 a 37 c 28 g 24 t ORIGIN 1 cggacggctc cctgcgacac cctagcttcg agaagttccg aggaactgag gacaaccctc 61 aagagaaaat gtaaccaact cactggctca ccttcacggg tgggcctttc ttcgttc // LOCUS VIBRHP 1489 bp ds-DNA BCT 24-JUL-1990 DEFINITION V.harveyi Arg-tRNA, two Pro-tRNA and two His-tRNA genes. ACCESSION M29762 M31832 KEYWORDS transfer RNA-Arg; transfer RNA-His; transfer RNA-Pro. SOURCE V.harveyi (strain ATCC 14126) DNA. ORGANISM Vibrio harveyi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1489) AUTHORS Giroux,S. and Cedergren,R. TITLE Evolution of a tRNA operon in gamma purple bacteria JOURNAL J. Bacteriol. 171, 6446-6454 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.J.Cedergren, 06-NOV-1989. FEATURES from to/span description tRNA 60 136 Arg-tRNA tRNA 171 247 His-tRNA tRNA 324 400 Pro-tRNA tRNA 450 523 His-tRNA tRNA 588 664 Pro-tRNA anticdn 94 96 Arg-tRNA anticodon ccg anticdn 205 207 His-tRNA anticodon gtg anticdn 358 360 Pro-tRNA anticodon tgg anticdn 481 483 His-tRNA anticodon gtg anticdn 617 619 Pro-tRNA anticodon tgg signal 11 16 -35 region signal 34 39 -10 region BASE COUNT 351 a 331 c 360 g 447 t ORIGIN 1 aaaaaagtac tagacggcat agtgtgatat ccgtattatt cctctccgcc gatgggcatg 61 cgcccgtagc tcagttggat agagcgttgg cctccggagc caaaggtcga aggttcgaat 121 cctttcgggc gtgccattcc ggaatacaaa ttaaaggcac gaacaatagt ggtggctata 181 gctcagttgg tagagccctg gattgtgatt ccggtggtcg cgagttcgaa tctcgttagc 241 caccccatta tttcggtaac tcattaaatt agtgaagttt ccagttttga ttcaaatatc 301 ccaagagtcg aaacaaaatt cgtcggtgaa tagcgcagct tggtagcgca tctggtttgg 361 gaccagaggg tcgggggttc gaatccctct tcaccgacca ctatttagtt tatggctata 421 atagagcggt aaacggcaaa attgatggtg gctatagctc agttggtaga gccctggatt 481 gtgattccgg tggtcgcgag ttcgaatctc gttagccacc ccattaattt aggtgacttc 541 ttagttgaag tatcctgtct gatttgagag aatcagcaaa tacagttcgg tgaatagcgc 601 agcttggtag cgcatctggt ttgggaccag agggtcgggg gttcgaatcc ctcttcaccg 661 accactatta gaaagcctgc tttttagcag gctttcgtcg ttttagagct tatgatctcg 721 cttggactag atgagtcggc aatatttcaa tggatttgaa cccatatggg gttcgcctga 781 agttagaata tggctcttca tcgaccaata tttaaagcct cagtcgaaag actagggctt 841 ttttgtattt ctcaccctaa aaaatctgca ttccatttca atatcagtaa acatgtacag 901 ctctggaagc ttttgaccac ctatgatcta aatgcgattc gagctgataa aagaaacccc 961 agcacgttgg ctggggtttt gtgtttcttg aattaagcgc tttactgagc aagcgttttt 1021 aggatttcat tcagctcaac cgtctctcgc acttgatagt tgggtttgat atcggttgta 1081 tttccgtacc tgttgcgttt acccacacgc tcagaatacc catattgttg gcacccgcaa 1141 tgtctgctgg tagagagtcg ccaatatgaa tcacctcttc tggctttgca tccacaggtt 1201 gagtgctttg cggaaaatgt tggctgctgg cttctcttct ggctcttccc caccgacgat 1261 aatgtgatca acccattcgc tcatttgtgt tgctttgagt tttgggtgct gtgagaagat 1321 tggaccgtta gtgatcacca ctaatttgta gtgttgacgt agctcggtca gcatctcttt 1381 tacgccaggg aagaagtcaa aagcccccat gcgagccgag tcaaaatagc tctgcgcctg 1441 ttgagccttt tctgtatcaa tctctatact ttgctctgcc aagatggcc // LOCUS YSCPOX1 2915 bp ds-DNA PLN 24-JUL-1990 DEFINITION Yeast (S.cerevisiae) acyl-coenzyme A oxidase (POX1) gene, complete cds. ACCESSION M27515 KEYWORDS acyl-coenzyme A oxidase. SOURCE Yeast (S. cerevisiae) DNA, clone PAD17. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2915) AUTHORS Dmochowska,A., Dignard,D., Maleszka,R. and Thomas,D.Y. TITLE Structure and transcriptional control of the Saccharomyces cerevisiae POX1 gene encoding acyl-coenzyme A oxidase JOURNAL Gene 88, 247-252 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Dignard, 01-SEP-1989. FEATURES from to/span description pept 455 2701 acyl-coenzyme A oxidase mRNA 364 2843 POX1 mRNA (minor alt.) mRNA 365 2843 POX1 mRNA (minor alt.) mRNA 369 2843 POX1 mRNA (minor alt.) mRNA 386 2843 POX1 mRNA (major alt.) mRNA 400 2843 POX1 mRNA (major alt.) mRNA 406 2843 POX1 mRNA (minor alt.) mRNA 413 2843 POX1 mRNA (minor alt.) BASE COUNT 895 a 532 c 600 g 888 t ORIGIN 1 bp upstream of BglII site; chromosome VII. 1 agatctcgac caaaaaaagg gggtagggta ataaaattaa ccctattatt ttttaacttt 61 aaaacctata atgtgctaat attttattat aaacctcctt tttttgcgtt caaaccctga 121 cacattttaa gccctatatt tacggtatta gttgattaaa ctccgaagcg aaaggaattc 181 ggtcattagc ggctaatagc cgttggggta aatcacctac aagcaagtac acaagagaac 241 gttggcgttg ttaagtcaaa gcactaatac attggggctt taagagtgtt tataaaggtc 301 taacctgtaa aaattattta aacaacttga acaggcctta aagttttcct cattccgctc 361 atcatcacta atattgctct ccgtttttga atacacactt gacactaata agtatcacag 421 aaaaaaagaa aatataataa attagtattg cgatatgacg agacgtacta ctattaatcc 481 cgattcggtg gttctgaatc ctcaaaaatt tatccagaaa gaaagggcgg attcgaaaat 541 caaagttgac caagttaaca catttttaga gtcatccccg gagaggagaa ctctgacgca 601 cgccttaata gaccaaatag tgaatgatcc tatattgaaa actgatacgg actattacga 661 tgctaaaaaa atgcaagaga gagaaattac tgccaaaaaa atagctaggc ttgctagtta 721 tatggagcac gatatcaaaa cagtgcgcaa acactttcgc gacactgacc tgatgaaaga 781 gttgcaagca aatgatccag acaaagcttc gcctttaaca aacaaagacc tttttatatt 841 cgataagaga ttgtcacttg tagcaaatat tgatcctcaa ttgggtacgc gcgtgggtgt 901 acacttgggg ctatttggta attgtatcaa gggcaatggt actgatgagc aaatccggta 961 ttggttgcag gagagaggtg ccactttgat gaaaggtata tatggctgtt ttgcaatgac 1021 tgagttagga catggttcca atgttgccca gctgcagact agggctgtgt acgataagca 1081 aaatgatact tttgtaattg atacacctga tctaactgcc accaaatggt ggattggtgg 1141 ggctgcccat tctgccacgc acgctgccgt gtacgccaga ttgatcgttg aaggtaaaga 1201 ctacggtgta aaaacattcg ttgttcctct gagagaccct tcgactttcc aactgttagc 1261 tggtgtttcc ataggggata ttggagcgaa gatgggtcgt gacggtattg ataatggctg 1321 gatccagttc agaaacgtag ttatccctag agaatttatg ctaagtagat ttaccaaagt 1381 tgtccgttct ccagatggtt cagtcaccgt caaaactgag ccacaattgg atcaaatttc 1441 tggttatagt gcattgttaa gtggtagagt taacatggtc atggattcat ttaggtttgg 1501 ctccaaattt gctactattg ctgtacgtta cgcggttggt cgtcagcaat tcgcacctag 1561 aaagggattg tctgaaacac aattaatcga ctatcccctt caccaatatc gtgttttacc 1621 acaattgtgt gttccatatt tggtgtcacc tgtagctttt aagttaatgg acaactatta 1681 ttccactttg gacgagttat acaacgcttc ctcatctgca tacaaagctg ctctggttac 1741 cgtgagtaaa aagttgaaga atttatttat tgatagcgcc acgttgaaag ccaccaatac 1801 ttggttaatt gctacactga ttgatgagtt gagacagact tgcggaggac atgggtattc 1861 acagtataac ggatttggta aaggctatga cgactgggtg gttcagtgca catgggaggg 1921 tgataataat gttttatctt taacttcagc aaaatcaata ttgaaaaaat ttatcgattc 1981 agccacaaag ggtagatttg acaacacact ggatgtggac tcattctctt acttaaaacc 2041 tcagtacata ggatctgtgg tttctggaga aataaagagt ggtttaaagg agttgggtga 2101 ttatactgaa atttggtcta tcaccttaat caaattactg gcacatattg gtactttagt 2161 tgaaaaatca agaagtattg atagcgtttc taagctttta gtcttagtat ccaaatttca 2221 tgccttgcgc tgcatgttga aaacctatta cgacaagtta aactctcgtg attcacatat 2281 ttccgatgaa attacaaagg aatctatgtg gaatgtttat aagttatttt ccttgtattt 2341 tattgacaag cattccggag aattccaaca attcaagatc ttcactcctg atcagatctc 2401 taaagttgtg cagccacaac tattggctct tttgccaatt gtgaggaaag actgtatagg 2461 tctgacagac tcctttgaat tacctgacgc gatgttaaat tctcctatag gttactttga 2521 tggcgatatc tatcacaatt acttcaatga agtttgccgc aataatccag tggaggcaga 2581 tggggcaggg aagccttctt atcatgcgct gttgagcagc atgctcggta gaggtttcga 2641 atttgaccaa aagttaggtg gtgcagctaa tgcggaaatt ttatcgaaaa taaacaagtg 2701 agtagaggtt tcctgttttc cttcgaaccc tctgttttgc gacttttgtt tcaattcaac 2761 tagtgtcgcc aagttttaac aaaaagttac aaaatcctag tgagaggcca tcttatgtgc 2821 ataacggtac tctctatcta tttacatatc taatactatt cacataacta tgacgaatca 2881 atgacatgac tacatttacc aatgtatagt agtaa // LOCUS YSPUGU1 507 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.pombe U1 small nuclear RNA gene (snu1), complete cds. ACCESSION M29062 KEYWORDS U1 small nuclear RNA; small nuclear RNA. SOURCE S.pombe (sub-species lindner; strain h-972) DNA, clone Bb. ORGANISM Schizosaccharomyces pombe Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 507) AUTHORS Porter,G., Brennwald,P. and Wise,J.A. TITLE U1 small nuclear RNA from Schizosaccharomyces pombe has unique and conserved features and is encoded by an essential single-copy gene JOURNAL Mol. Cell. Biol. 10, 2874-2881 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Porter, 13-OCT-1989. FEATURES from to/span description uRNA 240 387 U1 small nuclear RNA BASE COUNT 148 a 82 c 89 g 188 t ORIGIN 3 bp upstream of HincII site. 1 gttaactatt catggcttga aagcaacgaa atcatttcct attatgaaat tttggagatt 61 tatataatta caaatcgctg aacgatttca ctaatcagca aaatttaacg atttaaatat 121 cgtttttgtt aaataattat actgtacttc tttattgcat gtaagaatgt gactactttc 181 gtgactgctt cgttccgcgt cgttactata aatatgggtg cttacactta cggttcttta 241 cttacctggc atgagtttct gcagcacaag aattgtggag actcagttat ttgtcttggc 301 attgcactga gccctgacga ataactgtgg actggctaag gtcagctccg gatgcatcat 361 ttttgagttc gtccctcatt tggggcaatt ttactaatta aggcgatatg gttgccttaa 421 atagtaaagt tatccgaaat ggattatgta aagtaacatt ttctttcatc atatataatt 481 gtaaattatt gtatcataat taatatt // LOCUS MUSIGHAAZ 149 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma C16-11D11, partial cds. ACCESSION M33399 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C16-11D11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 149) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 149 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 102 103 D-region end/JH4-region start BASE COUNT 39 a 45 c 35 g 30 t ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacactgccg tctattactg tgctagagct ggtaagggag gctatgctat ggactactgg 121 ggtcaaggaa cctcagcacc gtctcctca // LOCUS MUSIGHABB 136 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma C20-7D6, partial cds. ACCESSION M33401 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C20-7D6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 136) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 135 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 105 106 D-region end/JH4-region start BASE COUNT 36 a 37 c 32 g 26 t 5 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gncatctgag 61 gacactncng tctattactg tgctagaggg nataggtacg acggtnctat ggactactgg 121 ggtcaaggaa cctcag // LOCUS MUSIGHABC 131 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 29-1G5, partial cds. ACCESSION M33402 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 131) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 131 Ig H-chain V-D-JH4-region (AA at 1) recomb 84 85 V-region end/D-region start recomb 87 88 D-region end/JH4-region start BASE COUNT 34 a 40 c 27 g 28 t 2 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacactgcng tctattactg tgctccgtac tatgctatgg actactgggg tcaaggaacc 121 tcagtnaccg t // LOCUS MUSIGHABD 123 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 23-9D10, partial cds. ACCESSION M33403 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-9D10, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 123) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 123 Ig H-chain V-D-JH4-region (AA at 1) recomb 84 85 V-region end/D-region start recomb 87 88 D-region end/JH4-region start BASE COUNT 28 a 37 c 29 g 27 t 2 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacactgcng tctattactg tgctggggct tactggggcc aagggactct ggtnactgtc 121 tct // LOCUS MUSIGHABE 129 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 41-4F5, partial cds. ACCESSION M33404 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 41-4F5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 129) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 129 Ig H-chain V-D-JH4-region (AA at 1) recomb 84 85 V-region end/D-region start recomb 90 91 D-region end/JH4-region start BASE COUNT 34 a 42 c 24 g 25 t 4 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag 61 gacactgcng tctattactg tgctaactgg gactactggg gnnaaggcac cactctcaca 121 gtctcctca // LOCUS MUSIGHABF 132 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 49-1G5, partial cds. ACCESSION M33405 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1G5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 132) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 132 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 90 91 D-region end/JH4-region start BASE COUNT 34 a 44 c 25 g 27 t 2 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacacngcng tctattactg tgctaggtac tactttgact actggggcca aggcaccact 121 ctcacagtct cc // LOCUS MUSIGHABG 141 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 23-10E7, partial cds. ACCESSION M33406 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-10E7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 141 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 99 100 D-region end/JH4-region start BASE COUNT 39 a 43 c 30 g 27 t 2 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct aacatctgag 61 gacactgcng tctattactg tgctagaagg ggnaactacg ctatggacta ctggggtcaa 121 ggaacctcag tcaccgtctc c // LOCUS MUSIGHABH 151 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 44-2F5, partial cds. ACCESSION M33407 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 44-2F5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 151) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 151 Ig H-chain V-D-JH4-region (AA at 1) recomb 84 85 V-region end/D-region start recomb 105 106 D-region end/JH4-region start BASE COUNT 39 a 48 c 27 g 34 t 3 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag 61 gacactgcng tctattactg tgcnctcatc tactatggta acacctatgc tatggactac 121 tggggtcaag gaacctcagt caccgtctcc t // LOCUS MUSIGHABI 160 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 25-5E10, partial cds. ACCESSION M33408 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 25-5E10, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 160) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 160 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 117 118 D-region end/JH4-region start BASE COUNT 44 a 44 c 36 g 33 t 3 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag 61 gacactgcng tctattactg tgctagaaaa gactattact acggtagtag cgggggagct 121 atggactact ggggtcaagg aacctcagtn accgtctcct // LOCUS MUSIGHABJ 125 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 49-1E7, partial cds. ACCESSION M33409 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1E7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 125) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 125 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 90 91 D-region end/JH4-region start BASE COUNT 32 a 42 c 26 g 24 t 1 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacactgcng tctattactg tgctagctgg gactactggg gcaaggcacc actctcacag 121 tctcc // LOCUS MUSIGHABK 91 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 50-2G3, partial cds. ACCESSION M33410 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 50-2G3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 91) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 91 Ig H-chain V-D-JH4-region (AA at 3) recomb 29 30 V-region end/D-region start recomb 47 48 D-region end/JH4-region start BASE COUNT 21 a 18 c 23 g 24 t 5 others ORIGIN Chromosome 12. 1 gagacactnc ngtctattac tgtgctagag gtactacggt agtagtctat gctatggact 61 actggggtna aggaacctna gtnaccgtct c // LOCUS MUSIGHABL 107 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 50-6D8, partial cds. ACCESSION M33411 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 50-6D8, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 107) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 107 Ig H-chain V-D-JH4-region (AA at 1) recomb 48 49 V-region end/D-region start recomb 57 58 D-region end/JH4-region start BASE COUNT 26 a 28 c 24 g 28 t 1 others ORIGIN Chromosome 12. 1 ctcagcagcc taacatctga ggacactgcn gtctattact gtgctagtag gtacgattac 61 tatgctatgg actactgggg tcaaggaacc tcagtcaccg tctctca // LOCUS MUSIGHADE 404 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain gene V-region, Id+ hybridoma C16-15F6, partial cds. ACCESSION M33390 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C16-15F6 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 404) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 20 Ig H-chain V-region, exon 1 (AA at 2) 100 > 404 Ig H-chain V-region, exon 2 IVS 21 99 Ig H-chain V-region intron A BASE COUNT 105 a 100 c 106 g 91 t 2 others ORIGIN Chromosome 12. 1 cctgatggca gtggttacag gtaaggggct ccnaagtccc aaacttgagg gtccataaac 61 tctgtgacag tggcaatcac tttgcctttc tttctacagg ggtcaattca gaggttcagc 121 tgcagcagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg tcctgcacag 181 cttctggctt caacattaaa gacacctata tgcactgggt gaagcagagg cctgaacagg 241 gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat gacccgaagt 301 tccagggnaa ggccactata acagcagaca catcctccaa cacagcctac ctgcagctca 361 gcagcctgac atctgaggac actgccgtct attactgtgc taga // LOCUS MUSIGHADF 141 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma J7-11H3, partial cds. ACCESSION M33400 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma J7-11H3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 141 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 96 97 D-region end/JH4-region start BASE COUNT 36 a 43 c 28 g 29 t 5 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tnagcagcct gacatctgag 61 gacactgcng tctattactg tgctaggagt agctacgact actttgacta ctggggnnna 121 ggcaccactc tcacagtctc c // LOCUS MUSIGHZD 234 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma C16-15F6, partial cds. ACCESSION M17008 M33398 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C16-15F6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 234) AUTHORS Busto,P., Gerstein,R., Dupre,L., Giorgetti,C.A., Selsing,E. and Press,J.L. TITLE Molecular analysis of heavy and light chains used by primary and secondary anti-(T,G)-A--L antibodies produced by normal and Xid mice JOURNAL J. Immunol. 139, 608-618 (1987) STANDARD simple staff_review REFERENCE 2 (bases 82 to 234) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 234 Ig H-chain V-D-JH4-region (AA at 1) recomb 168 169 V-region end/D-region start recomb 186 187 D-region end/JH4-region start BASE COUNT 65 a 61 c 58 g 47 t 3 others ORIGIN Chromosome 12. 1 cagggccnnn agtggattgg aaggattgat cctgcgaatg gtaatactaa atatgacccg 61 aagttccagg gcaaggccac tataacagca gacacatcct ccaacacagc ctacctgcag 121 ctcagcagcc tgacatctga ggacactgcc gtctattact gtgctagaag agggggcaat 181 ggtaactatg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctca // LOCUS MUSIGHZL 291 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 29-1G5, partial cds. ACCESSION M33391 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig H-chain V-region (AA at 1) BASE COUNT 77 a 69 c 70 g 57 t 18 others ORIGIN Chromosome 12. 1 gaggntcagc tncagnagtc tggggnagag ctngtnaagc caggngcctc agtcaagttg 61 tcctgcacag cttctnnnnn naacattaaa gacacctata tgcactgggt naagcagagg 121 cctnaacagg gcctngagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggnaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctgac atctgaggac actgcngtct attactgtgc t // LOCUS MUSIGHZM 291 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 23-9D10, partial cds. ACCESSION M33392 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-9D10 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig H-chain V-region (AA at 1) BASE COUNT 77 a 71 c 75 g 60 t 8 others ORIGIN Chromosome 12. 1 gaggttcagc tgnagnagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctgnntt caacattaaa gacacctata tgcactgggt gaagcagagg 121 cctnaacagg gcctngagtg gattggaagg atngatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctgac atctgaggac actgcngtct attactgtgc t // LOCUS MUSIGHZN 291 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 41-4F5, partial cds. ACCESSION M33393 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 41-4F5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig H-chain V-region (AA at 1) BASE COUNT 77 a 71 c 76 g 60 t 7 others ORIGIN Chromosome 12. 1 gaggntcagc tgcagnagtc tggggnagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggntt caacattaaa gacacctata tgcactgggt naagcagagg 121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc t // LOCUS MUSIGHZO 294 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V1A-J2-region, Id+ hybridoma 49-1G5, partial cds. ACCESSION M33394 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1G5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig H-chain V-region (AA at 1) BASE COUNT 78 a 69 c 76 g 59 t 12 others ORIGIN Chromosome 12. 1 gaggttcagc tgcagcagtc tggggcagan cttgtnaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggnnt naacattaaa gacacctata tgcactgggt gaagcagagg 121 cctnaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacnngaagt tcnagngcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctgac atctgaggac acngcngtct attactgtgc tagg // LOCUS MUSIGHZP 294 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 23-10E7, partial cds. ACCESSION M33395 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-10E7 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig H-chain V-region (AA at 1) BASE COUNT 80 a 72 c 78 g 60 t 4 others ORIGIN Chromosome 12. 1 gaggttcagc tgcagnagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggnnt caacattaaa gacacctata tgcactgggt gaagcagagg 121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctaac atctgaggac actgcngtct attactgtgc taga // LOCUS MUSIGHZQ 291 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 44-2F5, partial cds. ACCESSION M33396 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 44-2F5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig H-chain V-region (AA at 1) BASE COUNT 77 a 72 c 76 g 59 t 7 others ORIGIN Chromosome 12. 1 gaggttcagc tgcagcagtc tggggnagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggnnt caacattaaa gacacctata tgcactgggt gaagcagagg 121 cctnaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc n // LOCUS MUSIGHZT 294 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 25-5E10, partial cds. ACCESSION M33397 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 25-5E10 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig H-chain V-region (AA at 1) BASE COUNT 79 a 73 c 78 g 60 t 4 others ORIGIN Chromosome 12. 1 gaggttcagc tgcagcagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggcnt caacattaaa gacacctata tgcactgggt gaagcagagg 121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tcnagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc taga // LOCUS MUSIGKABO 339 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain gene V1A-J2-region, Id+ hybridoma 29-1G5, partial cds. ACCESSION M33389 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 339 Ig kappa-chain V1A-J2-region (AA at 1) recomb 300 301 V1A-region end/J2-region start BASE COUNT 88 a 84 c 83 g 82 t 2 others ORIGIN Chromosome 6. 1 gatgttgtga tgacccaaac tccnctctcc ctgcctgtca gtcttggaga tcaagcctcc 61 atctcttgca gatctagtca gagccttgta cacagtaatg gaaacaccta tttacattgg 121 tacctgcaga agccaggcca gtctccaaag ctcctgatct acaaagtttc caaccgattt 181 tctggggtcc cagacaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc 241 agcagagngg aggctgagga tctgggagtt tatttctgct ctcaaagtac acatgttccg 301 tacacgttcg gaggggggac caagctggaa ataaaacgg // LOCUS M36836 2651 bp - UNA 24-JUL-1990 DEFINITION Figure 1. Nucleotide sequence of the characterized locust mtDNA segment. ACCESSION M36836 REFERENCE 1 (bases 1 to 2651) AUTHORS Haucke,H.-R. and Gellissen,G. TITLE different mitochondrial gene orders amino insects: exchanged trna gene positions in the coii/coiii region between an orthopteran and a dipteran species JOURNAL Curr. Genet. 14, 471-476 (1988) STANDARD unannotated staff_entry BASE COUNT 1034 a 419 c 303 g 895 t ORIGIN 1 cacagatact cagaactacc actaattaat ttatagattc taatatggca gattaatgca 61 ttagatttaa gctctaaaaa taaagttttg accttttatt agaaaaatat ggcaacatga 121 tcaaacttat cattacaaga tggagcttca ccattaatag aacaattatc attctttcat 181 gatcatacaa taattgatct attattaatc acaataattg taggatattc gcttagatat 241 atattattaa caaaatatac aaaccgaaat atacttcatg gtcatttaat tgaaactatc 301 tgaacagcac taccagcaat tacattaatc tttattgcat taccatcatt acgattacta 361 tatttacttg atgattcatc agatgctata attacaatta aaacaattgg acgacaatga 421 tactgaagtt atgaatattc agatttcatt aatgtagaat ttgatacata tataactcct 481 gaaaatgaat taaatacaga tgaattccga ctcttagaag ttgataatcg aacaacatta 541 cctataaata cagaagtacg agtattaact agagcatctg atgtactcca ctcatgagca 601 gtaccagctt tagtattaaa aattgacgca acacccggac gactaaacca aggtatattt 661 ataattaacc gcccaggtct atttttcggt caatgctcag aaatctgtgg agctaatcat 721 agatttatac caattgtaat tgaaagaaca tcaattaaac ttttcattaa atgattatct 781 aacataatat aaggagttag ttaaaatata acattagaat gtcaatctaa aataactaaa 841 aatagtacac cttgaaccat cagatgactg aaagtaagta atggtctctt aaaccaaaaa 901 atagtaaatt aacgtttact tctgatgagg taaattaaaa acttaatccc tcaaatatca 961 ccaataatat gattttcact attcattata ttctcaataa caataatatt atttaatcaa 1021 ttaaacttct tctcttataa accaaataaa attataagaa gtaataataa aattaaaaaa 1081 aaaaacatta attgaatatg ataacaaatc tattctcaac attcgaccca tcaactaatt 1141 tatttaattt atcattaaat tgaactagaa cattccttgg actattatta attccatcaa 1201 tattttgatt aataccttct cgaattaata ttttatgaaa taaaataaat ctaaacttac 1261 ataatgaatt taaaacatta ttaggaaaaa actcatttca aggatcaaca ctaattctaa 1321 tttcaatttt tatcataata ttatttaaca attttatagg attattccca tacattttta 1381 caagaactag tcatataaca ttaacattct caattgcatt accaatatga ataagattta 1441 tattatttgg atgaattaat aatactaatc atatatttac acatcttgtc ccacaaggaa 1501 caccaaatgc acttatatca tttatggttc taattgaaac aattagaaat gttattcggc 1561 cagggaccct agctgtacga ttagctgcaa atatgattgc aggacatctt ttattaacat 1621 tattaggaaa tacaggaccg tcactaacaa caagaattat attatttcta attattggac 1681 aaatattact tttaattctt gaatcagccg tagctataat tcaagcatac gtattctcaa 1741 ttctaagaac attatattca agagaagttt actaaactta tgttaacaaa taacaacaat 1801 cacccatttc atatagtaga ttatagacca tgaccattaa ctggagcaat cggagcaata 1861 attctaacct caggaataac taaatgattt catacattca atataaattt attaataatt 1921 ggaataactg tcattgtatt aacaataatt caatggtgac gagatgtagt tcgagaaggt 1981 acatttcaag gtctacacac taaattagta tcaaaaggac tgcgatgagg aataatttta 2041 tttattgcat cagaagtttt atttttcgcc tcattcttct gagcattctt taatagtaga 2101 ttagcaccaa caattgaact aggaataaaa tgacctccaa taggaattca accatttaat 2161 ccaatccaaa ttccactact taatactgca attttactag cgtcaggagt tacaattaca 2221 tgagcacatc acagaatcat agaatgtaat cattcacaag cattacaagg attattcttt 2281 acagtaatat taggatttta ttttacttta ttacaaatat acgaatattg agaagcacct 2341 tttaccattg cagacgcagt atatggatct acattcttcg tagcaacagg attccatggg 2401 ctacatgtaa tcattggtac aacattctta ttaacatgcc taattcgaca tataataaac 2461 caattctcat ctaatcacca ctttggattt gaagctgcag catgatactg acactttgtt 2521 gatgtagtat gattattttt atatttatca atttactgat gaggtagata attaattttt 2581 ctagtataat tagtacattt gacttccaat caaaagattg aaatatcaag aaaaataatt 2641 ataatattaa t // LOCUS ECOFPP 1950 bp ds-DNA BCT 24-JUL-1990 DEFINITION E.coli dipeptide transport protein (fpp) gene, complete cds. ACCESSION M35045 KEYWORDS fpp gene; periplasmic protein. SOURCE E.coli (strain K12; isolate E1222) DNA, clone lambda-10. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1950) AUTHORS Olson,E.R., Dunyak,D.S., Jurss,L.M. and Poorman,R.A. TITLE Identification and characterization of fpp, an E.coli gene involved in dipeptide transport JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.R.Olson, 04-JUN-1990. Author address: E.R.Olson Molecular Biology Research, 7242-209-7 The UpJohn Company Kalamazoo, MI 49001 FEATURES from to/span description pept 147 1754 dipeptide transport protein (fpp) precursor sigp 147 230 dipeptide transport protein signal peptide matp 231 1751 dipeptide transport protein site 1792 1888 REP sequence BASE COUNT 548 a 499 c 491 g 412 t ORIGIN 79.5 min on the K12 map. 1 ggatccgcac tgttacactg atgttaatta gtacggcatc cccacctcat aacgttgacc 61 cgaccgggca aaaaacaaaa aaggtcaggc agcgacaacc cactgcaaag ggttaaaaca 121 acaaacatca caattggagc agaataatgc gtatttcctt gaaaaagtca gggatgctga 181 agcttggtct cagcctggtg gctatgaccg tcgcagcaag tgttcaggct aaaactctgg 241 tttattgctc agaaggatct ccggaagggt ttaacccgca gctgtttacc tccggcacca 301 cctatgacgc ctcttccgtc ccgctttata accgtctggt tgaatttaaa atcggcacca 361 ccgaagtgat cccgggcctc gctgaaaagt gggaagtcag cgaagacggt aaaacctata 421 ccttccatct gcgtaaaggt gtgaagtggc acgacaataa agaattcaaa ccgacgcgtg 481 aactgaacgc cgatgatgtg gtgttctcgt tcgatcgtca gaaaaacgcg caaaacccgt 541 accataaagt ttctggcggc agctacgaat acttcgaagg catgggcttg ccagagctga 601 tcagtgaagt gaaaaaggtg gacgacaaca ccgttcagtt tgtgctgact cgcccggaag 661 cgccgttcct cgctgacctg gcaatggact tcgcctctat tctgtcaaaa gaatatgctg 721 atgcgatgat gaaagccggt acaccggaaa aactggacct caacccaatc ggaaccggtc 781 cgttccagtt acagcagtat caaaaagatt cccgtatccg ctacaaagcg tttgatggct 841 actggggcac caaaccgcag atcgatacgc tggttttctc tattacccct gacgcttccg 901 tgcgttacgc gaaattgcag aagaatgaat gccaggtgat gccgtacccg aacccggcag 961 atatcgctcg catgaagcag gataaatcca tcaatctgat ggaaatgccg gggctgaacg 1021 tcggttatct ctcgtataac gtgcagaaaa aaccactcga tgacgtgaaa gttcgccagg 1081 ctctgaccta cgcggtgaac aaagacgcga tcatcaaagc ggtttatcag ggcgcgggcg 1141 tatcagcgaa aaacctgatc ccgccaacca tgtggggcta taacgacgac gttcaggact 1201 acacctacga tcctgaaaaa gcgaaagcct tgctgaaaga agcgggtctg gaaaaaggtt 1261 tctccatcga cctgtgggcg atgccggtac aacgtccgta taacccgaac gctcgccgca 1321 tggcggagat gattcaggca gactgggcga aagtcggcgt gcaggccaaa attgtcacct 1381 acgaatgggg tgagtacctc aagcgtgcga aagatggcga gcaccagacg gtaatgatgg 1441 gctggactgg cgataacggg gatccggata acttcttcgc caccctgttc agctgcgccg 1501 cctctgaaca aggctccaac tactcaaaat ggtgctacaa accgtttgaa gatctgattc 1561 aaccggcgcg tgctaccgac gaccacaata aacgcgttga actgtacaaa caagcgcagg 1621 tggtgatgca cgatcaggct ccggcactga tcatcgctca ctccaccgtg tttgaaccgg 1681 tacgtaaaga agttaaaggc tatgtggttg atccattagg caaacatcac ttcgaaaacg 1741 tctctatcga ataattaaaa gccatacaag actgatggca aaggcaaaaa tgcctgatgc 1801 gctccgctta tcaggcctac gaaaattctg caatgtattg aatttgcacg attttgtagg 1861 ccggataagg cgttaacgcg catccggcat aaacaaagcg cactttgtca acaatctgta 1921 tacccggtgg cgctgtgcct ctgcgtgcag // LOCUS ECOSHL 1820 bp ds-DNA BCT 24-JUL-1990 DEFINITION E.coli shl gene, complete cds, and acetolactate synthase II (ilvH) gene, 3' end. ACCESSION M35034 KEYWORDS acetolactate synthase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1820) AUTHORS Leclerc,G., Noel,G. and Drapeau,G. TITLE Molecular cloning, nucleotide sequence and expression of shl JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Drapeau, 01-JUN-1990. FEATURES from to/span description pept < 1 105 acetolactate synthase (ilvH) (AA at 1) pept 117 203 leader protein pept 285 1289 shl protein signal 66 73 -35 region signal 89 95 -10 region BASE COUNT 446 a 433 c 463 g 478 t ORIGIN 1 bp upstream HindIII site. 1 aagcttgatg catttttagc atcgattcgc gatgtggcga aaattgtgga ggttgctcgc 61 tctggtgtgg tcggactttc gcgcggcgat aaaataatgc gttgagaatg atctcaatgc 121 gcaatttaca gcccaacatg tcacgttggg ctttttttgc gaaatcagtg ggaacctgga 181 ataaaagcag ttgccgcagt taattttctg cgcttagatg ttaatgaatt taacccatac 241 cagtacaatg gctatggttt ttacatttta cgcaaggggc aattgtgaaa ctggatgaaa 301 tcgctcggct ggcgggagtg tcgcggacca ctgcaagcta tgttattaac ggcaaagcga 361 agcaataccg tgtgagcgac aaaaccgttg aaaaagtcat ggctgtggtg cgtgagcaca 421 attaccaccc gaacgccgtg gcagctgggc ttcgtgctgg acgcacacgt tctattggtc 481 ttgtgatccc cgatctggag aacaccagct atacccgcat cgctaactat cttgaacgcc 541 aggcgcggca acggggttat caactgctga ttgcctgctc agaagatcag ccagacaacg 601 aaatgcggtg cattgagcac cttttacagc gtcaggttga tgccattatt gtttcgacgt 661 cgttgcctcc tgagcatcct ttttatcaac gctgggctaa cgacccgttc ccgattgtcg 721 cgctggaccg cgccctcgat cgtgaacact tcaccagcgt ggttggtgcc gatcaggatg 781 atgccgaaat gctggcggaa gagttacgta agtttcccgc cgagacggtg ctttatcttg 841 gtgcgctacc ggagctttct gtcagcttcc tgcgtgaaca aggtttccgt actgcctgga 901 aagatgatcc gcgcgaagtg catttcctgt atgccaacag ctatgagcgg gaggcggctg 961 cccagttatt cgaaaaatgg ctggaaacgc atccgatgcc gcaggcgctg ttcacaacgt 1021 cgtttgcgtt gttgcaagga gtgatggatg tcacgctgcg tcgcgacggc aaactgcctt 1081 ctgacctggc aattgccacc tttggcgata acgaactgct cgacttctta cagtgtccgg 1141 tgctggcagt ggctcaacgt caccgcgatg tcgcagagcg tgtgctggag attgtcctgg 1201 caagcctgga cgaaccgcgt aagccaaaac ctggtttaac gcgcattaaa cgtaatctct 1261 atcgccgcgg cgtgctcagc cgtagctaag ccgcgaacaa aaatacgcgc caggtgaatt 1321 tccctctggc gcgtagagta cgggactgga catcaatatg cttaaagtaa ataagactat 1381 tcctgactat tattgataaa tgcttttaaa cccgcccgtt aattaactca ccagctgaaa 1441 ttcacaataa ttaagtgata tcgacagcgc gtttttgcat tattttgtta catgcggcga 1501 tgaattgccg atttaacaaa cacttttctt tgcttttgcg caaacccgct ggcatcaagc 1561 gccacacaga cgtaacaagg actgttaacc ggggaagata tgtcctaaaa tgccgctcgc 1621 gtcgcaaact gacactttat atttgctgtg gaaaatagtg agtcatttta aaacggtgat 1681 gacgatgagg gattttttct tacagctatt cataacgtta atttgcttcg cacgttggac 1741 gtaaaataaa caacgctgat attagccgta aacatcgggt tttttacctc ggtatgcctt 1801 gtgactggct tgacaagctt // LOCUS HUMATCT4A 1742 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human T4 surface glycoprotein CD4 gene, complete cds. ACCESSION M35160 KEYWORDS glycoprotein; immunoglobulin supergene family; recognition antigen; surface antigen. SOURCE Human (haplotype OKT4-negative) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1742) AUTHORS Hodge,T.W., Sasso,D.R., Steve,J. and McDougal,S. TITLE Humans with OKT4-epitope deficiency have a single base change in the CD4 gene resulting in substitutions of TRP-240 for ARG-240 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.W.Hodge, 12-JUN-1990. A cytosine to thymidine transition at nucleotide position 868 results in substitution of TRP-240 for ARG-240. Author address: T.W.Hodge; A-25 Bldg 1 Rm 1226 Centers for Disease Control 1600 Clifton Rd. Atlanta, GA 30333 FEATURES from to/span description pept 76 1452 T4 surface glycoprotein precursor sigp 76 150 T4 surface glycoprotein signal peptide matp 151 1449 T4 surface glycoprotein mature peptide mut 792 792 g in OKT4 epitope deficiency; c in wt BASE COUNT 405 a 491 c 466 g 380 t ORIGIN 1 caagcccaga gccctgccat ttctgtgggc tcaggtccct actgctcagc cccttcctcc 61 ctcggcaagg ccacaatgaa ccggggagtc ccttttaggc acttgcttct ggtgctgcaa 121 ctggcgctcc tcccagcagc cactcaggga aagaaagtgg tgctgggcaa aaaaggggat 181 acagtggaac tgacctgtac agcttcccag aagaagagca tacaattcca ctggaaaaac 241 tccaaccaga taaagattct gggaaatcag ggctccttct taactaaagg tccatccaag 301 ctgaatgatc gcgctgactc aagaagaagc ctttgggacc aaggaaactt ccccctgatc 361 atcaagaatc ttaagataga agactcagat acttacatct gtgaagtgga ggaccagaag 421 gaggaggtgc aattgctagt gttcggattg actgccaact ctgacaccca cctgcttcag 481 gggcagagcc tgaccctgac cttggagagc ccccctggta gtagcccctc agtgcaatgt 541 aggagtccaa ggggtaaaaa catacagggg gggaagaccc tctccgtgtc tcagctggag 601 ctccaggata gtggcacctg gacatgcact gtcttgcaga accagaagaa ggtggagttc 661 aaaatagaca tcgtggtgct agctttccag aaggcctcca gcatagtcta taagaaagag 721 ggggaacagg tggagttctc cttcccactc gcctttacag ttgaaaagct gacgggcagt 781 ggcgagctgt ggtggcaggc ggagagggct tcctcctcca agtcttggat cacctttgac 841 ctgaagaaca aggaagtgtc tgtaaaatgg gttacccagg accctaagct ccagatgggc 901 aagaagctcc cgctccacct caccctgccc caggccttgc ctcagtatgc tggctctgga 961 aacctcaccc tggcccttga agcgaaaaca ggaaagttgc atcaggaagt gaacctggtg 1021 gtgatgagag ccactcagct ccagaaaaat ttgacctgtg aggtgtgggg acccacctcc 1081 cctaagctga tgctgagctt gaaactggag aacaaggagg caaaggtctc gaagcgggag 1141 aaggcggtgt gggtgctgaa ccctgaggcg gggatgtggc agtgtctgct gagtgactcg 1201 ggacaggtcc tgctggaatc caacatcaag gttctgccca catggtccac cccggtgcag 1261 ccaatggccc tgattgtgct ggggggcgtc gccggcctcc tgcttttcat tgggctaggc 1321 atcttcttct gtgtcaggtg ccggcaccga aggcgccaag cagagcggat gtctcagatc 1381 aagagactcc tcagtgagaa gaagacctgc cagtgccctc accggtttca gaagacatgt 1441 agccccattt gaggcacgag gccaggcaga tcccacttgc agcctcccca ggtgtctgcc 1501 ccgcgtttcc tgcctgcgga ccagatgaat gtagcagatc ccacgctctg gcctcctgtt 1561 cgtcctccct acaatttgcc attgtttctc ctgggttagg ccccggcttc actggttgag 1621 tgttgctctc tagtttccag aggcttaatc acaccgtcct ccacgccatt tccttttcct 1681 tcaagcctag cccttctctc attatttctc tctgaccctc tccccactgc tcatttggat 1741 cc // LOCUS HUMFUSED 555 bp ds-DNA SYN 24-JUL-1990 DEFINITION Human interleukin-1beta/growth hormone fusion protein gene, complete cds. ACCESSION M35049 KEYWORDS growth hormone; interleukin-1-beta. SOURCE Human synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 555) AUTHORS Pecceu,R., Dousset,P., Shire,D., Cavrois,E., Marchese,E., Ferrara,P., Kaghad,M., Dumont,X. and Lupker,J. TITLE Human interleukin-1-beta fused to the human growth hormone signal peptide is N-glycosylated and secreted by Chinese hamster ovary cells JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Pecceu, 08-JUN-1990. Author address: F.Pecceu SANOFI ELF BIO-RECHERCHES LABEGE INNOPOLE B.P. 137 31328 LABEGE CEDEX FRANCE FEATURES from to/span description pept 11 550 human growth hormone/interleukin--1beta fusion protein sigp 11 88 human growth hormone signal peptide matp 89 547 interleukin-1-beta BASE COUNT 154 a 142 c 135 g 124 t ORIGIN 1 agcttccgcc atggctacag gctcccggac gtccctgctc ctggcttttg gcctgctctg 61 cctgccctgg cttcaagagg gcagtgccgc acctgtaaga tctctgaact gcacgctccg 121 ggactcacag caaaagagct tggtgatgtc tggtccatat gaactgaaag ctctccacct 181 gcagggacag gatatggagc aacaagtggt gttctccatg tcctttgtcc aaggagaaga 241 aagtaatgac aaaatacctg tggcgttggg cctcaaggaa aagaatctgt acttaagctg 301 cgtgttgaaa gatgataagc ccactctaca gctggagagt gtagacccca aaaattaccc 361 aaagaagaag atggaaaagc gatttgtctt caacaagata gaaatcaata acaagctgga 421 atttgagtct gcccagttcc ccaactggta catcagcacc tctcaagcag aaaacatgcc 481 cgtcttcctg ggtggtacca aaggcggcca ggatataact gacttcacca tgcaatttgt 541 gagctcctag gatcc // LOCUS HUMGLYA 1200 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human Miltenberger V-like glycophorin gene, exons 3 and 4. ACCESSION J05563 KEYWORDS glycophorin A; transmembrane protein. SOURCE Human leukocyte DNA, clone Mi51. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1200) AUTHORS Kudo,S., Chagnovich,D., Rearden,A., Mattei,M.-G. and Fukuda,M. TITLE Molecular analysis of a hybrid gene encoding human glycophorin variant Miltenberger V-like molecule JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Kudo and M.Fukuda, 04-JUN-1990. FEATURES from to/span description pept / 81 176 Miltenberger V-like glycophorin, exon 3 (AA at 81) 1110 / 1148 Miltenberger V-like glycophorin, exon 4 IVS < 1 80 Miltenberger V-like glycophorin intron B IVS 177 1109 Miltenberger V-like glycophorin intron C site 1079 1116 crossing-over site BASE COUNT 387 a 236 c 212 g 365 t ORIGIN 1 agatttgtct ttcatgatac gctctatgtc cacgcagtca cctcattctt gacccctttc 61 tcaacttctc ttatatgcag atacgcacaa acgggacaca tatgcagcca ctcctagagc 121 tcatgaagtt tcagaaattt ctgttagaac tgtttaccct ccagaagagg aaaccggtat 181 gttcttagtt ttaaatagtt gctctggagt cattgttgtg attgaactct gtttacacga 241 gctgtaactc atgacagttc tcaagctttc gtgacagaaa acccatctct tttactccaa 301 agcccatata gcacccacaa ctattaactg tgaccaagaa agagaaggca agccccaatt 361 aacctttgta cgtaaagcct aaagaatgaa aaaatatacc tgaatcctca atcatcaaac 421 agcatagtat atactaagta atttgtaata attaaactct agaaaattgt gtggcttcgg 481 tagtaagaga gcttcatgac ataaaatggc aagtggagac agagacaaaa gtaggatgtg 541 gactgagagg gaaggttagc acaggtggaa cagtaaggca accatactat caattgctac 601 tgacatagaa tccagagaga ctattggcaa aagctcaaat gagacacagt aacagtttag 661 attcagacag tggctgtggc ataaatcaga aaattgatag ccgcatgatc cctctttgca 721 tgggactggc atctgtgtgg agtaatggtt ccatatgcct cctttcttct ccttattttt 781 aaatttttta aaaatgcatt gcttcttgtg gaagtcaata agtgattctt ccaatacttt 841 ctcattcctt ccccctcagt tatgagacaa tttgcttatt tctcatccat gaatacttgt 901 tgggtcatta aaagtagata ctgaaattac taatggtacg actgacatat tacctcataa 961 atgttactag ctagatgttg aaagttgacc aacaactctc aaaatatgat taagaaaagg 1021 aaacccacag aacagtttga ttccaaaatg atttttttct ttgcacatgc cttacttatt 1081 tggacttaca ttgaaatttt gctttatagg agaaatggga caacttgtcc atcgtttcac 1141 tgtaccaggt atgttaatat ttgacaaaga ataaaagtca ttccatttta aactatccat // LOCUS HUMMHDRB 1172 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human MHC class II HLA-DR-beta DR2 gene, complete cds. ACCESSION M35159 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (black with haplotype DR2) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1172) AUTHORS Demopulos,J.T., Hodge,T.W., Wooten,V. and Acton,R.T. TITLE A novel DRB1 allele in DR2-positive American blacks JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.W.Hodge, 12-JUN-1990. A thymidine to cytosine transition at nucleotide position 88 results in an amino acid change from tyrosine to histidine at residue 30. Author address: T.W.Hodge; A-25 Bldg 1 Rm 1226 Centers for Disease Control 1600 Clifton Rd. Atlanta, GA 30333 FEATURES from to/span description pept 6 806 MHC HLA-DR-beta chain precursor /hgml_locus_uid="LV0063D" /nomgen="HLA-DRB1" /map="6p21.3" sigp 6 92 MHC HLA-DR-beta chain signal peptide matp 93 803 MHC-HLA-DR-beta chain BASE COUNT 272 a 315 c 324 g 261 t ORIGIN 1 ccagcatggt gtgtctgaag ctccctggag gttcctacat ggcaaagctg acagtgacac 61 tgatggtgct gagctcccca ctggcttcgg ctggggacac ccgaccacgt ttcttgcagc 121 aggataagta tgagtgtcat ttcttcaacg ggacggagcg ggtgcggttc ctgcacagag 181 acatctataa ccaagaggag gacttgcgct tcgacagcga cgtgggggag taccgggcgg 241 tgacggagct ggggcggcct gacgctgagt actggaacag ccagaaggac ttcctggaag 301 acaggcgcgc cgcggtggac acctactgca gacacaacta cggggttggt gagagcttca 361 cagtgcagcg gcgagttgag cctaaggtga ctgtgtatcc tgcaaggacc cagaccctgc 421 agcaccacaa cctcctggtc tgctctgtga atggtttcta tccaggcagc attgaagtca 481 ggtggttccg gaacagccag gaagagaagg ctggggtggt gtccacaggc ctgattcaga 541 atggagactg gaccttccag accctggtga tgctggaaac agttcctcga agtggagagg 601 tttacacctg ccaagtggag cacccaagcg tgacgagccc tctcacagtg gaatggagag 661 cacagtctga atctgcacag agcaagatgc tgagtggagt cgggggcttt gtgctgggcc 721 tgctcttcct tggggccggg ctattcatct acttcaagaa tcagaaaggg cactctggac 781 ttcacccaac aggactcgtg agctgaagtg cagatgacca cattcaaggg ggaaccttct 841 gccccagctt tgcatgatga aaagctttcc tgcttggctc ttattcttcc acaagagagg 901 actttctcag gccctggttg ctaccggttc agcaactctg cagaaaatgt ccatccttgt 961 ggcttcctca gctcctgccc cttggcctga agtcccagca ttgatggcag tgcctcatct 1021 tcaactttag tgctcccctt tacctaaccc tacggcctcc catgcatctg tactccccct 1081 gtgtgccaca aatgcactac gttattaaat ttttctgaag cccagagtta aaaatcatct 1141 gtccacctgg ctccaaagac aaaaaataaa aa // LOCUS MUM3TER 55 bp ss-RNA VRL 24-JUL-1990 DEFINITION Mumps virus genome 3' terminus mRNA. ACCESSION M35284 KEYWORDS . SOURCE Mumps virus (strain SBL-1), cDNA to mRNA, passed in Vero cells. ORGANISM Mumps virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Paramyxoviridae; Paramyxovirus. REFERENCE 1 (bases 1 to 55) AUTHORS Elango,N., Varsanyi,T.M., Koevamees,J. and Norrby,E. TITLE Molecular cloning and characterization of six genes, determination of gene order and intergenic sequences and leader sequence of mumps virus JOURNAL J. Gen. Virol. 69, 2893-2900 (1988) STANDARD simple staff_review BASE COUNT 9 a 17 c 4 g 24 t 1 others ORIGIN 1 nggttcccct ctttcttcta ccctataacc atcttgttta tcacattctt tgtca // LOCUS MUSIGCR 2210 bp ds-DNA ROD 24-JUL-1990 DEFINITION Murine Ig germline membrane-bound gamma 2a-chain gene C region, exon 2. ACCESSION M35032 KEYWORDS constant region; gamma-immunoglobulin; germline; immunoglobulin heavy-chain. SOURCE Murine (strain BALB/c) myeloma cell line RPC 5 DNA, clone CH.M.IgG2a-11. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2210) AUTHORS Hall,B. and Milcarek,C. TITLE Sequence and polyadenylation site determination of the murine immunoglobulin gamma 2a membrane 3' UT region JOURNAL Mol. Immunol. 26, 819-826 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.L.Hall, 08-JUN-1990. FEATURES from to/span description pept / 37 120 IgG-2a chain, exon 2 pre-msg < 1 1682 IgG-2a chain mRNA (alt.) pre-msg < 1 1759 IgG-2a chain mRNA (alt.) IVS < 1 36 IgG-2a chain intron A (no splice consensus) signal 1506 1511 major Poly-A signal signal 1585 1590 minor Poly-A signal rpt 1270 1305 (GA)17 repeats rpt 1395 1440 (CT)23 repeats site 1536 1543 McLauchlan consensus sequence site 1552 1559 McLauchlan consensus sequence site 1625 1633 McLauchlan consensus sequence site 1536 1540 Berget consensus sequence BASE COUNT 596 a 543 c 557 g 514 t ORIGIN 1 aagagaagcg ccaagagcac agcgccgact cctaccgtaa agtggatctt ctcctctgtg 61 gtggagctga agcagacgat ctcccctgac tacagaaaca tgattgggca gggagcctag 121 gccacttcct ctgggatcag aagagcttcc taggccctgc agaagcccat ccatcctact 181 gtgcagccta acagggaggc cacactctag ccctatgact ctctgatcag aactcccatg 241 gtctcctctt tggaggacca cgtgcagtgc aggctttgcc cagacctaaa cacttccaca 301 gcagtcgcca gatatctaac tactccggac cagaagaacc atctccttcc aaaccagcac 361 tagggatctg agatctcaga atgtttgcct aagaagagct ggaaatccag gcttcctgtg 421 ttctgctaca aggacatcag cctggatttg acctggacca cacattttca tctaaatgag 481 ttttccacaa aggacacgtt tcagatcctt gaatgagacc tctacatgga agaccagagt 541 cactataccc aaaggtcact ctgtatcctt gcaccagcta tactggacag cttccttcct 601 ggtacttcag tgaccctggc tgaggaaagg atctgtgacc tcaactgttt ggagagcctc 661 tggaagatgt agtcttctct tcctgctacc accaacatgc tggatctcag atgcagaatc 721 caatccacag acaccactga ccacacaacc tgaagacaag gccattgcca cctccacaga 781 gatgccatcc acactctgtg gagaaataag gagtgctttg tgcagcctct gcaaagctct 841 ggcagggatt agagtataca cactgagtac tgactaggtg accaggcaga aaaacctcca 901 ggagaaggaa caatggggga gagatgtgaa cagatagtta gaaaaagcat ggtgtcacag 961 gtctgctctg tggactgatt tccagattgg accacctaca gcagaaacca tcggttgcag 1021 tggcaatcta ggaggaccaa cctggaatag gagggctgct gtggtcaatg gagagtagac 1081 ctgtatctat ttctccactg cctcttatga ccaataagaa gccagagtct ccagacagaa 1141 agaaagaaag aaagaaagaa agaaagaaag aaagaaagag agagagagag agagagagag 1201 agagaggaag gaaggaagga aggaaggaag gaaggaagga aggaggagga ggaggaggag 1261 gaggaggagg agagagagag agagagagag agagagagag agagagcacc agcttttctg 1321 tgactggaag gaaatgctta gagagcttgg atctttaaag cttctttttt ctagagacca 1381 tgaatgtctt tgttctctct ctctctctct ctctctctct ctctctctct ctctctctct 1441 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gcgtgcatgc acgctattgt tttggcattt 1501 gaaacaataa aacattcttt taatattctg tatctcatgg ttccccttct gtgtggatca 1561 gccctaacac ccaggaacag ggacaataaa cagaccacag ccatgtacag ccttctacct 1621 cccttctggt tctgacctcc cagaggtccc tcagtgggcc cctcacagct gggtttcttc 1681 cctggcagtg ccaccaagag ctcaggcacc tctgagctgg aggctgtcct gatgccatag 1741 gcaggctatg gagcagagat gatgaccacg gtgactcagg tgagccaggc aaagcctccc 1801 atgccagaag agaagcgtgt ggtactcact ggcctcgggc tgctacggat tcagcaaaga 1861 gcatggatcc agctacctcc atagtggatg cttattgtct caagctgtgt tcagagaaat 1921 ttcagatcag gtattaatag ggtgggttct ccttacctgg taggtgcctt ctcccatggc 1981 acctcacagg ctctccatct gtgtgtctgg gtcctgatct cttctcataa gtacacagac 2041 aggctggaat ggtacaacct ggcctcatta taacttacca gttatgatcc tatctgcaag 2101 gcaggctcag tctgaggtct ggcggcttaa atgacaattg gtgtgtcttc tagaacagaa 2161 tttagtccac attagcgcgc ctgtggacca cagtatatct ctctggcctc // LOCUS YSCARF2 1012 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae ADP-ribosylation factor 2 (ARF2) gene, complete cds. ACCESSION M35158 KEYWORDS ADP-ribosylation factor 2. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1012) AUTHORS Stearns,T., Hoyt,M.A., Botstein,D. and Kahn,R.A. TITLE ADP-ribosylation factor is an essential protein in yeast and is encoded by two genes JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.A.Kahn, 05-JUN-1990. Author address: R.A.Kahn Lab. of Biol. Chem., DCT Bldg. 37; Room 5D-02 National Cancer Institute Bethesda, MD 20892 FEATURES from to/span description pept 266 811 ADP-ribosylation factor 2 (ARF2) binding 24 30 consensus GTP-binding site binding 67 71 consensus GTP-binding site binding 126 129 consensus GTP-binding site BASE COUNT 301 a 180 c 222 g 309 t ORIGIN 1 gcctgcaggt acgccccttt atttgatcag gaagccgtat tgattatcta atagggccta 61 gttatcctaa ttgtggggag tcgagcagta cggctctgat gtttttcgaa cgaagataag 121 gagttgacat acaaagtcaa cagaagttct tcttgttagc gtctctgtgc tcaatatctc 181 tctttttttc tttaagtagt aattactaac atcagccaac caatagagat aaaaaaaaaa 241 ggaattaaga tttcatagag aaaagatggg tctatacgct tctaagttat tcagcaatct 301 ttttggcaac aaagagatgc gtatacttat ggttggtcta gatggtgccg gtaagaccac 361 cgttttgtac aagttgaagt tgggcgaagt tatcactacc attccaacca ttggtttcaa 421 cgttgagact gtccaatata agaacatttc cttcactgtc tgggacgtcg gtggacaaga 481 caggattaga tctttatgga gacactacta cagaaacacc gaaggtgtta tttttgtcat 541 cgattccaac gatagatcgc gtattggtga agccagagaa gtcatgcaga gaatgctgaa 601 tgaagatgaa ttgagaaatg ctgtctggtt agtcttcgct aacaaacaag atttgccaga 661 agccatgtct gctgctgaaa tcaccgaaaa attaggttta cattctatta gaaaccgtcc 721 atggtttatc cagtctactt gtgcaacctc gggtgaaggt ctgtacgaag gtctggagtg 781 gttaagcaac aacttgaaga atcaatccta atctaaatct gtatagaacg tttagtcatg 841 cgcaccttgt gtgttttgtt tctagattgt tttattttta tgattgttga agatataaac 901 cactgtatag ttgtataaga taggataatg atggtgcact gaaaataaac ttactagctc 961 tttaatattg caacggcttg taacgggcga ctctagagga tccccgggta cc // LOCUS CODCPRRKA 94 bp ss-RNA RNA 24-JUL-1990 DEFINITION Codium fragile chloroplast 4.5S RNA, complete cds. ACCESSION M35276 M15192 KEYWORDS 4.5S RNA. SOURCE C.fragile chloroplast RNA. ORGANISM Chloroplast Codium fragile Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Brypsidales; Codiaceae; Codium fragile. REFERENCE 1 (bases 1 to 94) AUTHORS Francis,M.A., Balint,R.F. and Dudock,B.S. TITLE A novel variety of 4.5 S RNA from Codium fragile chloroplasts JOURNAL J. Biol. Chem. 262, 1848-1854 (1987) STANDARD simple staff_review FEATURES from to/span description RNA 1 94 4.5 S RNA BASE COUNT 35 a 11 c 14 g 34 t ORIGIN 1 aagtcctagt tgctataaat tcttaaatca aattatgtca gatttttaat aaaaagcagc 61 atttgtattt gaaaattgtt taggaactag gcac // LOCUS HAMAPBRBD 2339 bp ds-DNA ROD 24-JUL-1990 DEFINITION Hamster apolipoprotein (apoB) gene, partial cds (LDL receptor-binding domain). ACCESSION M35187 KEYWORDS apolipoprotein B. SOURCE Hamster DNA. ORGANISM Mesocricetus auratus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2339) AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N. TITLE Sequence of the putative low-density lipoprotein receptor-binding regions of apolipoprotein B in mouse and hamster JOURNAL Gene 87, 309-310 (1990) STANDARD simple staff_review COMMENT Phone call to T.J.Smith on 26-JUN-1990 made sequence clarifications on line 4 and line 17 of printed sequence. The hamster sequence should be 1 bp to the left on these lines. FEATURES from to/span description pept < 1 > 2339 apolipoprotein (apoB) (LDL receptor-binding domain) (AA at 1) BASE COUNT 725 a 519 c 450 g 645 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccagc ttcctcgcct ctcacacaca attgagatac ctgcttttgg cagacttcat 61 ggaatcctga aaatccagtc tcccctcttt atattagatg caaatgccaa catacagaat 121 gtaactactt tagagaacaa agcagagatt gtggcctcca tcgctgctac aggagagtcc 181 gaaattgaag ctctcaattt tgattttcaa gcacaagctc aattcttgga gctaaaccct 241 aatcctctga tcctgaagga atccatgaac ttctccagca agcatgcgag aatggagcat 301 gagggtgaga tactattttc tggaaagttc attgagggaa aattggacac ggtcgcaagt 361 ttacagacag agaaaaatat ggtggagttt aataatggta tgattgtcaa gataaacaat 421 ccaatcatcc ttgacagtca cacaaagtat tttcacaagt tgagtatccc caggctggac 481 ttctccagta aggcttcctt taacaatgaa atcaagatgc tattagaagc tggacatgta 541 gcatggactt cttcagggac tgggtcatgg aattgggcct gtcccaactt ctcagatgag 601 ggcacacatt cgtccaaaat tagcttcact gtagaaggac ccattgcttt ttttggcttg 661 tctaataaca tcaacggcaa acacctgagg gttatccaga aattggctta tgaatctggc 721 ttcctcaact attccatgtt ggaagttgag tcaaaagttg aatctcagca tgtgggttcc 781 agcattctaa ctggcaaggg aacggtactg ctcagggagg caaaggcaga aatgactggc 841 gagcacaatg ctgacttgaa tggaaaagtt attgggactt tgaaaaactc tctttccttt 901 tcagcacaac catttatgat tactgcatcc acaaataatg atgggaattt gaaagttagt 961 tttccactaa agttgactgg gaaaatagac ttcctgaata actatgcact atttttgagt 1021 cctcatgccc agcaagcaag ctggcaagtg agtgctaggt tcaatcagta caaatataat 1081 caaaattttt ctgctataaa caatgaacat aacatagaag cccatgtagg aatgaatgga 1141 gatgccaacc tggatttctt aaccatacct ctaacaattc ctgaagtgaa actaccttac 1201 atagggctca cgactccctt gctgaaggat ttctccatat gggaagaaac aggcttgaaa 1261 gaatttttga agacaacaaa gcaatcgttt gatttaagtg taaaagctca atataaaaag 1321 aacagagaca ggcattccat tgcgattcct ctgaatgggt tttatgagtt tattctcaac 1381 aatgtcgact ccgggatagg gaagattggg aaagtcagag acagcgcatt agactatctt 1441 atttcatcct ataatgaagc aaaaaacaag tttgaaaatt cccttattca gccctccagg 1501 acctttcaaa agcgtggata cactatccca tttgtcaaca ttgaagtgac tccattcact 1561 gtagagacac tggcctccag ccatgtgatc ccaaaagcaa taaatacccc cagtgttcac 1621 attctgggcc ctaatgtcat tgtgccttca tacaggttag tgctgccctc cctggagctg 1681 ccagtccttc gtgtccccag gaatctactc aagttttccc tcccagattt caaggaattg 1741 agaacaattg acaatattta tattccagct cttggcaatt ttacctatga tttttccttt 1801 aaatcaagtg tcatcacgct gaataccaac gttggacttt ataaccggtc agacatcgtt 1861 gctcatttcc tttcttcctc ttcatttgtc acggatgccc tgcagtacaa attagagggt 1921 acttcacgtc tgactcggaa aagaggattg aagctagcca cagccgactc tctcactaac 1981 aaatttgtaa agggcaatca tgatagcacc tttagcttaa ccaagaaaaa catggaagca 2041 tcagtgaaaa caactgcaaa cctccatgct cccattttaa caatgaactt caagcaggaa 2101 cttaatggaa atgccaagtc aaagcccatt gtctcatcat ccattgaact aaactatgac 2161 ttcaattcct caaagctgta ctctactgct aaaggaggtg ttgaccacaa gtttagctta 2221 gaaagtctca cttcctactt ttccattgag tcatccacca aaggaaatat caagggatct 2281 gtcctttccc aggaatattc aggaagtgtt gccagtgagg ccaacacata cctgaattc // LOCUS MUSAPBRBD 2354 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse apolipoprotein (apoB) gene, partial cds (LDL receptor-binding domain). ACCESSION M35186 KEYWORDS apolipoprotein B. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2354) AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N. TITLE Sequence of the putative low-density lipoprotein receptor-binding regions of apolipoprotein B in mouse and hamster JOURNAL Gene 87, 309-310 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 2354 apolipoprotein (apoB) (LDL receptor-binding domain) (AA at 1) BASE COUNT 757 a 525 c 423 g 649 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccaac ttcctcacct ctcacataca attgaaatac ctgcttttgg caaactgcat 61 agcatcctta agatccaatc tcctctcttt atattagatg ctaatgccaa catacagaat 121 gtaacaactt cagggaacaa agcagagatt gtggcttctg tcactgctaa aggagagtcc 181 caatttgaag ctctcaattt tgattttcaa gcacaagctc aattcctgga gttaaatcct 241 catcctccag tcctgaagga atccatgaac ttctccagta agcatgtgag aatggagcat 301 gagggtgaga tagtatttga tggaaaggcc attgagggga aatcagacac agtcgcaagt 361 ttacacacag agaaaaatga agtagagttt aataatggta tgactgtcaa agtaaacaat 421 cagctcaccc ttgacagtca cacaaagtac ttccacaagt tgagtgttcc taggctggac 481 ttctccagta aggcttctct taataatgaa atcaagacac tattagaagc tggacatgtg 541 gcattgacat cttcagggac agggtcatgg aactgggcct gtcccaactt ctcggatgaa 601 ggcatacatt cgtcccaaat tagctttact gtggatggtc ccattgcttt tgttggacta 661 tccaataaca taaatggcaa acacttacgg gtcatccaaa aactgactta tgaatctggc 721 ttcctcaact attctaagtt tgaagttgag tcaaaagttg aatctcagca cgtgggctcc 781 agcattctaa cagccaatgg tcgggcactg ctcaaggacg caaaggcaga aatgactggt 841 gagcacaatg ccaacttaaa tggaaaagtt attggaactt tgaaaaattc tctcttcttt 901 tcagcacaac catttgagat tactgcatcc acaaataatg aaggaaattt gaaagtgggt 961 tttccactaa agctgactgg gaaaatagac ttcctgaata actatgcatt gtttctgagt 1021 ccccgtgccc aacaagcaag ctggcaagcg agtaccagat tcaatcagta caaatacaat 1081 caaaactttt ctgctataaa caatgaacac aacatagaag ccagtatagg aatgaatgga 1141 gatgccaacc tggatttctt aaacatacct ttaacaattc ctgaaattaa cttgccttac 1201 acggagttca aaactccctt actgaaggat ttctccatat gggaagaaac aggcttgaaa 1261 gaatttttga agacaacaaa gcaatcattt gatttgagtg taaaggctca atataaaaag 1321 aacagtgaca agcattccat tgttgtccct ctgggtatgt tttatgaatt tattctcaac 1381 aatgtcaatt cgtgggacag aaaatttgag aaagtcagaa acaatgcttt acattttctt 1441 accacctcct ataatgaagc aaaaattaag gttgataagt acaaaactga aaattccctt 1501 aatcagccct ctgggacctt tcaaaatcat ggctacacta tcccagttgt caacattgaa 1561 gtatctccat ttgctgtaga gacactggct tccaggcatg tgatccccac agcaataagc 1621 accccaagtg tcacaatccc tggtcctaac atcatggtgc cttcatacaa gttagtgctg 1681 ccacccctgg agttgccagt tttccatggt cctgggaatc tattcaagtt tttcctccca 1741 gatttcaagg gattcaacac tattgacaat atttatattc cagccatggg caactttacc 1801 tatgactttt cttttaaatc aagtgtcatc acactgaata ccaatgctgg actttataac 1861 caatcagata tcgttgccca tttcctttct tcctcttcat ttgtcactga cgccctgcag 1921 tacaaattag agggaacatc acgtctgatg cgaaaaaggg gattgaaact agccacagct 1981 gtctctctaa ctaacaaatt tgtaaagggc agtcatgaca gcaccattag tttaaccaag 2041 aaaaacatgg aagcatcagt gagaacaact gccaacctcc atgctcccat attctcaatg 2101 aacttcaagc aggaacttaa tggaaatacc aagtcaaaac ccactgtttc atcatccatt 2161 gaactaaact atgacttcaa ttcctcaaag ctgcactcta ctgcaacagg aggcattgat 2221 cacaagttca gcttagaaag tctcacttcc tacttttcca ttgagtcatt caccaaagga 2281 aatatcaaga gttccttcct ttctcaggaa tattcaggaa gtgttgccaa tgaagccaat 2341 gtatatctga attc // LOCUS DDINDK 518 bp ss-mRNA INV 24-JUL-1990 DEFINITION D.discoideum nucleoside diphosphate kinase Gip17 mRNA, complete cds. ACCESSION J05457 KEYWORDS nucleoside diphosphate kinase. SOURCE D.discoideum (strain AX3) 3 hour starvation, cDNA to mRNA, clones G[821,951]. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 518) AUTHORS Lacombe,M.L., Wallet,V., Troll,H. and Veron,M. TITLE Functional cloning of a nucleoside diphosphate kinase from Dictyostelium discoideum JOURNAL J. Biol. Chem. 265, 10012-10018 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.L.Lacombe, 16-MAY-1990, for release after publication. FEATURES from to/span description pept 19 486 nucleoside diphosphate kinase Gip17 (EC 2.7.4.6) mRNA < 1 518 Gip17 mRNA signal 485 490 poly-A signal BASE COUNT 164 a 97 c 97 g 160 t ORIGIN 1 taacctaata agattaaaat gtccacaaat aaagtaaaca aagaaagaac tttccttgct 61 gttaaaccag acggtgttgc tcgtggttta gttggtgaaa tcatcgccag atacgaaaag 121 aaaggtttcg ttttagttgg tttaaaacaa ttagttccaa ccaaagactt agctgaatct 181 cactatgctg aacacaaaga aagaccattc ttcggtggtt tagtctcatt cattacctct 241 ggtccagtcg ttgctatggt cttcgaaggt aaaggtgttg ttgcctctgc ccgtttaatg 301 atcggtgtta ccaacccatt agcctcagcc ccaggttcaa ttcgtggtga tttcggtgtt 361 gatgttggta gaaacatcat ccacggttct gattcagttg aatctgccaa cagagaaatt 421 gctttatggt tcaaaccaga agaattatta actgaagtta aaccaaaccc aaatttatac 481 gaataaataa tttagctaaa ttctaaatca tttaatgt // LOCUS DDINDKA 184 bp ss-mRNA INV 24-JUL-1990 DEFINITION D.discoideum nucleoside diphosphate kinase Gip17 mRNA, 5' end. ACCESSION M36679 KEYWORDS nucleoside diphosphate kinase. SOURCE D.discoideum (strain AX3) 3 hour starvation, cDNA to mRNA, clone G821. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 184) AUTHORS Lacombe,M.L., Wallet,V., Troll,H. and Veron,M. TITLE Functional cloning of a nucleoside diphosphate kinase from Dictyostelium discoideum JOURNAL J. Biol. Chem. 265, 10012-10018 (1990) STANDARD full staff_review FEATURES from to/span description pept 137 > 184 nucleoside diphosphate kinase Gip17 (EC 2.7.4.6) BASE COUNT 77 a 21 c 14 g 72 t ORIGIN 1 atttttgatt ttctttattc ttatttgtaa ttgtatataa tagtaaaaaa aaaaaattct 61 cagatttata atttattatt atttttcaaa tttcttcaat tctttgtatt aaatattact 121 caaacataaa ataaaaatgg cctcaaccaa aagaagatta aacagagaag aatcaatgtc 181 caca //