Path: utzoo!attcan!uunet!shelby!apple!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 27 Jul 90 12:00:54 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 10498 Approved: lear@genbank.bio.net Checksum: 18808 649 LOCUS HUMHPRTB 56736 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete cds. ACCESSION M26434 KEYWORDS Alu repetitive element; LINE repetitive element; hypoxanthine phosphoribosyltransferase. SOURCE Human cell line 4X,Y (GM1202), fetal liver, and beta thalassemia patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 56736) AUTHORS Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J., Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W. TITLE Automated DNA sequencing of the human HPRT locus JOURNAL Genomics 6, 593-608 (1990) STANDARD full staff_entry REFERENCE 2 (sites; mutations causing gout) AUTHORS Argos,P., Hanei,M., Wilson,J.M. and Kelley,W.N. TITLE A possible nucleotide-binding domain in the tertiary fold of phosphoribosyltransferase JOURNAL J. Biol. Chem. 258, 6450-6457 (1983) STANDARD full staff_entry REFERENCE 3 (sites; mutations causing gout) AUTHORS Cariello,N.F., Scott,J.K., Kat,A.G., Thilly,W.G. and Keohavong,P. TITLE Resolution of a missense mutant in human genomic DNA by denaturing gradient gel electrophoresis and direct sequencing using in vitro DNA amplification: HPRT-Munich JOURNAL Am. J. Hum. Genet. 42, 726-734 (1988) STANDARD full staff_entry REFERENCE 4 (sites; mutations causing gout) AUTHORS Davidson,B.L., Chin,S.J., Wilson,J.M., Kelley,W.N. and Palella,T.D. TITLE Hypoxanthine-guanine phosphoribosyltransferase. Genetic evidence for identical mutations in two partially deficient subjects JOURNAL J. Clin. Invest. 82, 2164-2167 (1988) STANDARD full staff_entry REFERENCE 5 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Davidson,B.L., Palella,T.D. and Kelly,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase: A single nucleotide substitution in cDNA clones isolated from a patient with Lesch-Nyhan syndrome (HPRT-Midland) JOURNAL Gene 68, 85-91 (1988) STANDARD full staff_entry REFERENCE 6 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Genetic basis of hypoxanthine guanine phosphoribosyltransferase deficiency in a patient with Lesch-Nyhan syndrome (HPRT-Flint) JOURNAL Gene 63, 331-336 (1988) STANDARD full staff_entry REFERENCE 7 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Human hypoxanthine-guanine phosphoribosyltransferase deficiency. The molecular defect in a patient with gout (HPRT-Ashville) JOURNAL J. Biol. Chem. 264, 520-525 (1989) STANDARD full staff_entry REFERENCE 8 (sites; mutations causing gout) AUTHORS Davidson,B.L., Tarle,S.A., Palella,T.D. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in ten subjects determined by direct sequencing of amplified transcripts JOURNAL J. Clin. Invest. 84, 342-346 (1989) STANDARD full staff_entry REFERENCE 9 (sites; mutations causing gout) AUTHORS Dush,M.K., Sikela,J.M., Khan,S.A., Tischfield,J.A. and Stambrook,P.J. TITLE Nucleotide sequence and organization of the mouse adenine phosphoribosyltransferase gene: Presence of a coding region common to animal and bacterial phosphoribosyltransferases that has a variable intron/exon arrangement JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2731-2735 (1985) STANDARD full staff_entry REFERENCE 10 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Fujimori,S., Davidson,B.L., Kelley,W.N. and Palella,T.D. TITLE Identification of a single nucleotide change in the hypoxanthine-guanine phosphoribosyltransferase gene (HPRT-Yale) responsible for Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 83, 11-13 (1989) STANDARD full staff_entry REFERENCE 11 (sites; mutations causing gout) AUTHORS Fujimori,S., Hidaka,Y., Davidson,B.L., Palella,T.D. and Kelley,W.N. TITLE Identification of a single nucleotide change in a mutant gene for hypoxanthene-guanine phosphoribosyltransferase (HPRT-Ann Arbor) JOURNAL Hum. Genet. 79, 39-43 (1988) STANDARD full staff_entry REFERENCE 12 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Gibbs,R.A., Nguyen,P.N., Edwards,A., Civitello,A.B. and Caskey,C.T. TITLE Multiplex DNA deletion detection and exon sequencing of the hypoxanthine phosphoribosyltransferase gene in Lesch-Nyhan families JOURNAL Genomics 7, 235-244 (1990) STANDARD full staff_entry REFERENCE 13 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Gibbs,R.A., Nguyen,P.N., McBride,L.J., Koepf,S.M. and Caskey,C.T. TITLE Identification of mutations leading to the Lesch-Nyhan syndrome by automated direct DNA sequencing of in vitro amplified cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1919-1923 (1989) STANDARD full staff_entry REFERENCE 14 (sites; mutations causing gout) AUTHORS Hershey,H.V. and Taylor,M.W. TITLE Nucleotide sequence and deduced amino acid sequence of Escherichia coli adenine phosphoribosyltransferase and comparison with other analogous enzymes JOURNAL Gene 43, 287-293 (1986) STANDARD full staff_entry REFERENCE 15 (sites; conserved domain) AUTHORS King,A. and Melton,D.W. TITLE Characterisation of cDNA clones for hypoxanthine-guanine phosphoribosyltransferase from the human malarial parasite, Plasmodium falciparum: Comparisons to the mammalian gene and protein JOURNAL Nucleic Acids Res. 15, 10469-10481 (1987) STANDARD full staff_entry REFERENCE 16 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Ogasawara,N., Stout,J.T., Goto,H., Sonta,S.I., Matsumoto,A. and Caskey,C.T. TITLE Molecular analysis of a female Lesch-Nyhan patient JOURNAL J. Clin. Invest. 84, 1024-1027 (1989) STANDARD full staff_entry REFERENCE 17 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Stout,J.T. and Caskey,C.T. TITLE HPRT: Gene structure, expression, and mutation JOURNAL Annu. Rev. Genet. 19, 127-148 (1985) STANDARD full staff_entry REFERENCE 18 (sites; Lesch-Nyhan mutation) AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in a patient with the Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 71, 1331-1335 (1983) STANDARD full staff_entry REFERENCE 19 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Structural alteration in a dysfunctional enzyme variant (HPRT-Munich) isolated from a patient with gout JOURNAL J. Biol. Chem. 259, 27-30 (1984) STANDARD full staff_entry REFERENCE 20 (sites; mutations causing gout) AUTHORS Wilson,J.M., Kobayashi,R., Fox,I.H. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Molecular abnormality in a mutant form of the enzyme (HPRT-Toronto) JOURNAL J. Biol. Chem. 258, 6458-6460 (1983) STANDARD full staff_entry REFERENCE 21 (sites; mutations causing gout) AUTHORS Wilson,J.M., Tarr,G.E. and Kelley,W.N. TITLE Human hypoxanthine (guanine) phosphoribosyltransferase: An amino acid substitution in a mutant form of the enzyme isolated from a patient with gout JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 870-873 (1983) STANDARD full staff_entry REFERENCE 22 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Yang,T.P., Patel,P.I., Chinault,A.C., Stout,J.T., Jackson,L.G., Hildebrand,B.M. and Caskey,C.T. TITLE Molecular evidence for new mutation at the hprt locus in Lesch-Nyhan patients JOURNAL Nature 310, 412-414 (1984) STANDARD full staff_entry REFERENCE 23 (sites; mutant reversion) AUTHORS Yang,T.P., Stout,J.T., Konecki,D.S., Patel,P.I., Alford,R.L. and Caskey,C.T. TITLE Spontaneous reversion of novel Lesch-Nyhan mutation by HPRT gene rearrangement JOURNAL Somat. Cell Mol. Genet. 14, 293-303 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Edwards, 26-JUL-1989. Mutant Description RJK 1780 missing intron 1 (partial and exon 2 [12] RJK 849 missing intron 3 (partial) and exons 4-9 (no mRNA) [22] RJK 984 missing intron 5 (partial) and exons 6-9 (no mRNA) [17] [22] GM 3467 missing intron 8 (partial) and exon 9 (no mRNA) [12] [22] RJK 853 complete gene deletion [12] [22] [16] GM 2227 inversion of exons 6-9 (no mRNA) [23] GM 1662 and GM 6804 duplication of exons 2 and 3 and elongated mRNA [22] [23] Connersville missing intron 7 (partial and exon 8 [7] FEATURES from to/span description pept 1677 1703 hypoxanthine phosphoribosyltransferase, exon 1 /nomgen="HPRT" /map="Xq26" /hgml_locus_uid="LE0031K" 14780 14886 hypoxanthine phosphoribosyltransferase, exon 2 16603 16786 hypoxanthine phosphoribosyltransferase, exon 3 27891 27956 hypoxanthine phosphoribosyltransferase, exon 4 31617 31634 hypoxanthine phosphoribosyltransferase, exon 5 34937 35019 hypoxanthine phosphoribosyltransferase, exon 6 39815 39861 hypoxanthine phosphoribosyltransferase, exon 7 40033 40109 hypoxanthine phosphoribosyltransferase, exon 8 41454 41501 hypoxanthine phosphoribosyltransferase, exon 9 IVS 1704 14779 HPRT intron A IVS 14887 16602 HPRT intron B IVS 16787 27890 HPRT intron C IVS 27957 31616 HPRT intron D IVS 31635 34936 HPRT intron E IVS 35020 39814 HPRT intron F IVS 39862 40032 HPRT intron G IVS 40110 41453 HPRT intron H rpt 258 41 (c) Alu repeat copy A rpt 1012 694 (c) Alu repeat copy B rpt 3532 3342 (c) Alu repeat copy C rpt 3982 3343 (c) Alu repeat copy D rpt 6232 5937 (c) Alu repeat copy E rpt 6524 6288 (c) Alu repeat copy F rpt 7504 7807 Alu repeat copy G rpt 8713 8408 (c) Alu repeat copy H rpt 9477 9169 (c) Alu repeat copy I rpt 9699 10142 Alu repeat copy J rpt 10407 10293 (c) Alu repeat copy K rpt 12154 12310 Alu repeat copy L rpt 13302 12998 (c) Alu repeat copy M rpt 13638 13314 (c) Alu repeat copy N rpt 14238 13923 (c) Alu repeat copy O rpt 14610 14239 (c) Alu repeat copy P rpt 16090 15843 (c) Alu repeat copy Q rpt 16518 16293 (c) Alu repeat copy R rpt 18783 18439 (c) Alu repeat copy S rpt 18962 19084 Alu repeat copy T rpt 19397 19104 (c) Alu repeat copy U rpt 20064 19778 (c) Alu repeat copy V rpt 20486 20636 Alu repeat copy W rpt 20990 20696 (c) Alu repeat copy X rpt 21293 20996 (c) Alu repeat copy Y rpt 22171 21878 (c) Alu repeat copy Z rpt 23772 23460 (c) Alu repeat copy AA rpt 25310 25426 Alu repeat copy AB rpt 26388 26279 (c) Alu repeat copy AC rpt 29863 29560 (c) Alu repeat copy AD rpt 33618 33148 (c) Alu repeat copy AE rpt 36575 36645 Alu repeat copy AF rpt 38613 39039 Alu repeat copy AG rpt 39091 39388 Alu repeat copy AH rpt 40622 40925 Alu repeat copy AI rpt 40952 41262 Alu repeat copy AJ rpt 42640 42936 Alu repeat copy AK rpt 44776 44486 (c) Alu repeat copy AL rpt 46627 46305 (c) Alu repeat copy AM rpt 47520 47763 Alu repeat copy AN rpt 49257 49551 Alu repeat copy AO rpt 49705 49993 Alu repeat copy AP rpt 50002 50312 Alu repeat copy AQ rpt 51066 50774 (c) Alu repeat copy AR rpt 52276 51978 (c) Alu repeat copy AS rpt 52407 52674 Alu repeat copy AT rpt 54653 54551 (c) Alu repeat copy AU rpt 55465 55164 (c) Alu repeat copy AV rpt 56523 56689 Alu repeat copy AW rpt 11312 11254 (c) LINE repeat copy A (partial) rpt 18284 18242 (c) LINE repeat copy B (partial) rpt 27091 26843 (c) LINE repeat copy C (partial) rpt 37558 37300 (c) LINE repeat copy D (partial) rpt 39451 39427 (c) LINE repeat copy E (partial) rpt 51317 51100 (c) LINE repeat copy F (partial) rpt 261 321 short interspersed repeat copy A rpt 1013 1054 short interspersed repeat copy B rpt 2902 2869 (c) short interspersed repeat copy C rpt 3078 3107 short interspersed repeat copy D rpt 4228 4254 short interspersed repeat copy E rpt 4463 4494 short interspersed repeat copy F rpt 21582 21635 short interspersed repeat copy G rpt 28430 28394 (c) short interspersed repeat copy H rpt 28923 28981 short interspersed repeat copy I rpt 35562 35604 short interspersed repeat copy J rpt 44042 44108 short interspersed repeat copy K rpt 46628 46688 short interspersed repeat copy L rpt 51715 51687 (c) short interspersed repeat copy M unsure 17479 17479 polymorphism or cloning error unsure 24903 24903 polymorphism or cloning error mut 1664 1678 ggccggctccgttat in wt; gt in gout RJK 951 [13] mut 14778 14778 a in wt; t in Lesch-Nyhan RJK 1760, no exon 2 [12] mut 14808 14809 tt in wt; ttt in Lesch-Nyhan Chicago [7] mut 14874 14874 t in wt; c in Lesch-Nyhan Detroit Leu->Pro [7] mut 14877 14879 tta in wt; ta in Lesch-Nyhan RJK 1939 [12] mut 14886 14886 g in wt; a in Lesch-Nyhan RJK 2163 Arg->Lys [12] mut 16619 16619 c in wt; g in gout Toronto Arg->Gly [20] mut 16623 16664 atgtgatgaaggagatgggaggccatcacattgtagccctct in wt; at in Lesch-Nyhan RJK 2108 [12] mut 16675 16676 ag in wt; agg in Lesch-Nyhan RJK 866 [13] mut 16677 16677 g in wt; a in Lesch-Nyhan New Haven Gly->Glu [7] mut 16679 16679 g in wt; c in Lesch-Nyhan Yale Gly->Arg [10] mut 16690 16690 c in wt; a in Lesch-Nyhan Flint RJK 892 Phe->Leu [5] [13] mut 16707 16707 a in wt; t in gout Arlingen Asp->Val [7] mut 16756 16759 tgta in wt; ta in Lesch-Nyhan RJK 1332 [13] mut 16780 16780 c in wt; a in gout Munich Ser->Arg [3] [19] mut 16784 16787 tgtg in wt; tg in Lesch-Nyhan RJK 1747 [12] mut 27897 27897 c in wt; t in Lesch-Nyhan RJK 1930 Gln->TAG [12] mut 27911 27911 c in wt; t in gout London Ser->Leu [2] [4] [12] [14] mut 31621 31621 t in wt; a in Lesch-Nyhan Midland RJK 896 Val->Asp [6] [13] mut 31622 31624 ctt in wt; ct in Lesch-Nyhan RJK 2019 [12] mut 31624 31624 t in wt; c in Lesch-Nyhan RJK 1784 Leu->Ser [13] mut 31628 31628 t in wt; g in gout Ann Arbor Ile->Met [11] mut 34962 34970 tgcag in wt; agcaaagcaa in Lesch-Nyhan RJK 1210 Met->Lys [13] mut 35015 35015 g in wt; t in gout Milwaukee RJK 949 Ala->Ser [7] [13] mut 39837 39837 c in wt; t in Lesch-Nyhan RJK 974 Arg->TGA [13] mut 39842 39846 tgttg in wt; tg in Lesch-Nyhan RJK 1894 [12] mut 39858 39858 g in wt; t in Lesch-Nyhan RJK 2185 Asp->Tyr [12] mut 39866 39866 g in wt; a in Lesch-Nyhan RJK 1934. no exon 7 [12] mut 40034 40038 tgttg in wt; tg in Lesch-Nyhan Michigan RJK 855 [7] [13] mut 40080 40080 g in wt; a in Lesch-Nyhan Kingston RJK 2188 Asp->Asn [12] [19] mut 40095 40095 t in wt; g in Lesch-Nyhan New Briton RJK 950 Phe->Val [7] [13] mut 40102 40102 a in wt; g in gout Ashville Asp->Gly [8] mut 40114 40114 g in wt; a in Lesch-Nyhan RJK 888, GM 7092, no exon 8 [12] mut 41486 41509 aaaatacaaagcctaagatgagag in wt; ag in Lesch-Nyhan Evansville RJK 894 [7] [12] mut 41450 41452 ata in wt; ttt in Lesch-Nyhan RJK 906, GM 1899, downstream cryptic splice site used mut 41454 41454 c in wt; g in Lesch-Nyhan RJK 1874, RJK 2079, His->Asp [12] [13] mut 41461 41461 g in wt; a in Lesch-Nyhan RJK 1727 Cys->Tyr [13] BASE COUNT 15689 a 11281 c 11599 g 18167 t ORIGIN 1 bp upstream of EcoRI site; chromosome Xq26. 1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt 61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc 121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc 181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt 241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga 301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc 361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag 421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc 481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt 541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct 601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga 661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt 721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat 781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga 841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag 901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct 961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc 1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag 1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac 1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc 1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg 1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt 1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg 1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg 1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg 1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc 1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc 1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg 1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg 1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg 1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc 1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga 1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca 1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc 2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat 2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg 2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag 2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg 2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct 2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt 2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg 2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca 2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg 2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt 2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg 2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa 2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt 2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg 2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc 2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc 3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt 3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt 3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct 3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat 3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt 3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag 3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc 3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc 3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta 3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac 3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc 3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt 3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc 3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc 3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc 3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag 3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc 4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca 4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc 4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc 4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa 4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca 4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga 4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac 4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg 4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt 4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac 4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg 4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag 4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg 4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca 4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg 4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat 4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg 5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc 5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta 5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc 5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact 5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct 5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg 5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg 5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc 5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa 5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat 5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa 5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg 5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc 5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa 5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt 5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt 6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg 6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga 6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg 6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc 6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt 6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc 6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg 6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt 6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga 6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac 6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct 6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag 6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc 6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt 6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg 6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc 6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca 7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc 7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga 7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt 7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc 7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg 7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag 7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag 7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca 7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt 7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac 7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga 7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt 7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa 7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct 7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact 7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt 7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc 8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac 8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct 8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg 8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat 8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag 8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat 8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg 8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg 8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc 8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac 8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg 8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc 8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat 8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga 8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta 8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg 9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt 9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag 9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt 9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc 9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag 9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt 9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca 9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc 9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc 9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat 9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg 9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc 9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga 9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca 9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat 9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta 9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc 10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc 10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa 10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa 10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat 10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt 10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct 10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt 10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga 10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca 10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct 10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg 10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg 10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa 10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta 10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa 10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca 10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct 11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa 11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc 11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat 11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc 11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag 11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag 11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga 11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg 11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg 11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt 11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac 11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa 11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa 11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa 11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt 11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt 12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag 12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat 12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat 12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc 12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt 12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt 12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct 12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat 12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa 12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag 12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt 12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac 12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca 12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct 12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt 12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc 12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta 13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc 13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg 13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc 13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct 13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt 13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg 13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat 13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca 13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga 13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt 13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac 13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta 13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg 13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc 13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat 13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg 13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc 14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt 14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga 14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt 14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc 14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg 14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc 14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc 14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca 14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg 14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata 14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg 14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac 14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg 14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat 14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac 14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa 15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt 15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg 15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg 15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg 15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt 15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat 15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt 15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata 15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat 15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag 15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg 15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt 15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga 15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc 15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca 15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag 15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt 16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag 16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt 16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa 16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa 16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt 16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact 16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac 16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac 16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa 16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg 16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg 16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag 16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa 16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa 16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta 16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta 16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg 17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt 17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt 17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt 17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc 17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg 17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc 17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt 17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa 17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc 17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat 17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc 17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat 17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact 17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa 17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt 17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa 18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat 18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat 18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt 18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat 18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc 18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg 18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt 18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt 18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca 18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc 18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt 18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg 18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg 18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat 18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa 18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat 18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg 19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa 19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag 19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga 19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc 19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa 19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg 19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc 19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa 19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg 19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa 19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa 19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg 19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag 19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga 19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag 19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt 19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct 20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt 20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac 20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct 20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga 20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat 20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg 20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat 20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt 20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc 20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag 20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt 20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca 20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta 20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg 20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc 20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt 21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc 21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg 21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc 21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc 21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa 21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat 21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa 21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat 21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat 21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt 21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt 21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta 21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt 21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt 21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct 21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct 21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat 22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc 22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg 22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca 22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag 22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc 22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat 22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg 22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct 22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca 22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt 22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc 22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta 22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct 22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta 22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct 22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa 22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa 23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag 23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc 23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa 23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc 23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac 23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga 23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt 23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc 23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc 23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat 23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc 23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc 23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc 23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga 23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga 23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa 24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg 24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat 24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat 24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct 24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt 24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga 24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg 24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt 24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc 24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt 24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc 24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg 24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat 24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa 24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt 24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg 24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc 25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag 25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc 25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact 25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta 25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg 25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc 25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag 25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc 25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca 25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag 25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag 25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt 25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag 25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat 25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa 25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata 25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg 26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt 26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat 26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt 26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag 26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc 26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact 26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg 26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat 26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag 26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt 26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt 26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata 26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta 26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat 26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg 26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata 27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata 27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg 27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc 27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat 27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag 27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga 27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt 27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac 27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa 27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg 27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt 27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat 27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct 27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata 27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt 27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat 27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct 28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg 28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca 28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat 28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa 28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca 28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc 28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg 28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa 28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt 28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt 28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt 28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat 28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa 28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac 28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac 28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca 28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct 29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac 29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa 29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc 29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga 29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt 29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca 29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca 29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt 29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt 29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca 29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg 29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt 29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct 29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg 29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag 29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct 30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga 30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact 30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt 30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa 30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt 30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg 30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc 30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc 30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag 30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga 30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta 30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata 30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct 30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata 30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa 30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca 30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga 31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg 31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac 31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat 31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga 31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg 31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt 31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt 31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa 31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat 31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg 31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac 31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat 31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg 31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt 31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt 31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac 31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta 32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa 32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag 32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta 32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt 32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt 32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact 32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc 32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat 32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa 32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta 32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca 32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg 32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc 32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt 32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt 32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt 33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc 33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac 33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta 33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac 33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta 33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta 33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata 33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata 33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc 33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat 33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa 33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct 33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc 33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt 33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga 33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct 33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag 34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc 34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat 34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac 34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt 34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga 34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt 34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc 34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt 34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca 34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct 34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac 34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa 34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc 34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc 34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg 34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt 34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac 35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt 35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat 35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta 35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag 35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga 35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc 35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc 35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt 35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag 35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat 35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta 35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat 35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta 35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg 35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt 35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa 36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt 36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag 36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat 36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga 36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa 36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt 36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga 36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct 36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg 36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg 36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga 36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag 36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga 36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta 36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt 36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag 36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta 37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa 37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc 37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc 37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg 37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc 37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg 37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac 37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag 37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg 37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc 37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt 37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc 37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt 37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc 37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc 37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag 37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa 38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat 38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc 38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat 38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga 38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata 38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa 38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag 38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt 38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg 38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc 38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg 38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa 38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg 38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa 38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt 38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg 39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa 39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt 39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg 39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc 39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga 39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga 39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc 39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca 39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac 39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa 39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc 39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg 39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct 39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa 39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct 39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga 39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat 40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg 40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt 40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa 40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct 40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag 40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag 40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc 40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact 40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg 40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg 40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag 40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac 40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag 40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg 40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca 40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca 40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct 41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg 41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc 41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac 41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg 41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct 41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc 41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt 41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag 41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg 41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta 41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac 41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc 41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt 41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata 41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga 41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg 42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt 42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc 42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt 42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt 42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta 42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg 42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct 42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt 42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta 42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag 42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta 42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca 42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa 42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg 42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag 42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac 42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa 43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt 43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc 43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct 43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat 43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat 43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc 43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt 43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc 43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt 43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt 43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc 43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca 43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag 43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg 43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa 43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg 43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt 44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca 44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc 44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta 44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact 44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc 44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca 44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg 44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg 44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc 44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac 44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg 44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta 44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca 44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc 44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact 44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt 45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga 45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt 45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg 45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc 45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta 45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg 45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc 45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac 45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct 45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca 45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa 45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa 45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa 45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat 45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg 45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt 45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt 46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca 46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc 46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca 46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg 46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt 46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag 46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct 46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt 46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc 46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc 46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc 46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa 46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc 46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat 46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg 46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc 46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac 47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac 47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca 47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg 47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca 47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc 47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga 47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc 47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa 47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact 47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag 47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg 47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc 47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa 47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga 47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg 47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga 48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc 48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct 48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc 48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg 48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg 48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt 48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg 48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa 48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat 48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc 48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg 48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga 48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat 48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg 48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc 48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt 48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt 49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg 49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag 49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc 49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct 49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat 49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa 49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc 49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc 49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc 49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc 49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta 49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg 49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa 49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc 49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag 49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct 49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt 50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat 50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc 50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt 50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct 50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga 50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca 50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt 50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca 50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt 50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg 50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca 50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa 50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga 50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct 50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt 50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac 51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca 51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca 51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct 51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca 51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac 51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc 51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac 51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa 51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc 51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct 51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc 51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc 51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga 51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt 51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac 51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg 51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa 52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga 52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag 52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa 52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg 52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat 52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg 52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca 52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa 52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag 52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca 52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt 52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta 52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa 52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa 52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac 52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc 52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca 53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata 53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt 53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag 53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc 53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact 53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc 53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat 53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag 53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca 53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag 53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc 53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc 53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc 53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt 53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta 53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct 54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg 54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac 54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt 54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg 54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc 54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc 54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg 54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga 54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag 54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc 54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta 54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta 54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg 54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta 54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga 54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa 54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca 55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc 55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca 55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc 55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc 55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc 55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta 55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg 55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg 55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt 55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt 55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg 55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat 55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac 55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt 55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa 55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa 55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta 56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg 56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag 56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca 56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt 56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca 56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt 56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct 56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg 56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg 56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa 56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc 56701 gccgcccctg gtgatggacc agcggggctt cgacga // LOCUS VSVGLYPA 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 84-PN-H) glycoprotein gene, complete cds. ACCESSION M35207 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-PN-H, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 482 a 364 c 364 g 447 t ORIGIN 1 aacagagatc aatctgtttc ctcgacacca tgaagtgcct tttgtactta gctctgttat 61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggtaattgga 121 aaaatgttcc ttccagttac cattactgcc catcaagctc agatttgaat tggcataatg 181 gcctaatagg cacagcctta caggtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac 301 cgaaatatat aacacattcc atccggtcct tcacaccatc cgtggagcaa tgcaaggaaa 361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactccc catcatgtac 481 ttgtcgatga gtacacggga gaatgggttg attcccaatt catcaacggg aaatgcagca 541 atgacatctg ccctactgtg cataactcca caacctggca ttccgattac aaagttaaag 601 ggctgtgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggggagc 661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgct tacgaaactg 721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg 781 tttggttcga aatggcagac aaagatctgt ttgctgcagc aaaattccca gaatgtccag 841 aagggtcaag catatctgct ccatctcaga catcagtgga cgtgagtctc atccaggatg 901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc 961 ttcccatctc tccggtagac ctaagctatc ttgctcctaa aaatccagga accggcccag 1021 ctttcacaat catcaatggt actctgaaat attttgagac caggtacatc agagtcgata 1081 ttgctgctcc aattctccca agaatggttg gaacaatcag tggaactacc acagaaagag 1141 aactgtggga agactgggca ccatatgaag atgtggaaat cgggcctaat ggagttctga 1201 aaaccagttc aggatataaa tttcctttat acatgattgg acatggcatg ctggactctg 1261 atctccatct cagctcgaag gctcaagtgt ttgaacaccc tcacattcag gatgctgcat 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttgtcc aaaaatccaa 1381 tcgagctcgt agaagggtgg ttcagcggtt ggaagagctc cattgcttct tttttcttta 1441 tcataggttt gatcattgga ttattcttgg tcctccgagt tggtatctat ctttgcatca 1501 agttaaaaca gaccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaaccaga atcggacacg acagagtaat catatttaaa ccaaatcgac ttgtgatatc 1621 atgctccaag aggctttaat tatgtttcaa ttttatg // LOCUS VSVGLYPB 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-CR-B1) glycoprotein gene, complete cds. ACCESSION M35208 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-CR-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 487 a 358 c 366 g 446 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagtcttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPC 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-CR-B2) glycoprotein gene, complete cds. ACCESSION M35209 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-CR-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 489 a 358 c 364 g 446 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPD 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 84-CR-B) glycoprotein gene, complete cds. ACCESSION M35210 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-CR-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 488 a 357 c 365 g 447 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 gattgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPE 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-OA-B) glycoprotein gene, complete cds. ACCESSION M35211 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-OA-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 476 a 364 c 358 g 467 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga 1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcgtct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta 1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta 1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg // LOCUS VSVGLYPF 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-VC-B) glycoprotein gene, complete cds. ACCESSION M35212 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-VC-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 476 a 362 c 358 g 469 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccttg caagttaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga 1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcgtct tagctcaaag gctcaggtat ttgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta 1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta 1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg // LOCUS VSVGLYPG 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 84-PN-B1) glycoprotein gene, complete cds. ACCESSION M35213 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-PN-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 488 a 356 c 366 g 447 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttctggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcggg tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccga 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctatgcatca 1501 acttacggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgtttgaa ttgtatg // LOCUS VSVGLYPH 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain Mudd-Summers) glycoprotein gene, complete cds. ACCESSION M35214 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain Mudd-Summers, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 477 a 366 c 353 g 469 t ORIGIN 1 aacagagatc gatctgtttc cttgacacca tgaagtgcct tttgtactta gcttttttat 61 tcatcggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttagtagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtacggac 301 cgaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag cgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttccgactat aaggtcaaag 601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaagaag ggcacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg 1021 tctttaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg 1141 tactgtggga tgactgggct ccatatgaag acgtgggaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt ttgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatggtgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagtttgt agaaggttgg ttcagtagtt ggaagagctc tattgcctct tttttcttta 1441 ccatagggtt aatcattgga ctattcttgg ttctccgagt tggtatttat ctttgcatta 1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 cgtaactcaa atcctgcaca acagattctt catgtttgaa ccaaatcaac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttaattt ttatg // LOCUS VSVGLYPI 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 86-DF-P) glycoprotein gene, complete cds. ACCESSION M35215 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-DF-P, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 477 a 363 c 357 g 468 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga 1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctct tttttcttta 1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta 1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg // LOCUS VSVGLYPJ 1788 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 82-HD-B) glycoprotein gene, complete cds. ACCESSION M35216 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 82-HD-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1788) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 508 a 362 c 344 g 574 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat 1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg // LOCUS VSVGLYPK 1788 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 83-HD-B1) glycoprotein gene, complete cds. ACCESSION M35217 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 83-HD-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1788) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 508 a 362 c 344 g 574 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat 1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg // LOCUS VSVGLYPL 1815 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 83-HD-B2) glycoprotein gene, complete cds. ACCESSION M35218 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 83-HD-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1815) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 512 a 358 c 346 g 599 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gattgctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatt agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcactat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggcctaatt catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taattttatt tttttatttt tattttttat tttttatttt ttatttttta 1741 ttttttattt tttattttta ttttttatct ttttattttt tattttttat ttttattttt 1801 atttttattt ttatg // LOCUS VSVGLYPM 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain San Juan 56-NM-B) glycoprotein gene, complete cds. ACCESSION M35219 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain San Juan 56-NM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 484 a 364 c 355 g 462 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttctaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccata caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 cgaagtatat aacacagtcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gccgaagcag tgattgtcca ggtgactcct caccatgtgc 481 tggttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca 541 attacatatg ccccactgtc cataactcta caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct gggaaaggag ggcacagggt tcagaagtaa ctactttgct tatgaaactg 721 gaggcaaggc ctgcaaaatg caatactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtcta attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttccaatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg 1021 ctttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat acatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcaa gacgctgctt 1321 cgcaacttcc tgatgatgag agtttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtagtt ggaaaagctc tattgcctct tttttcttta 1441 tcatagggtt aatcattgga ctattcttgg ttctccgagt tggtatccat ctttgcatta 1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgga ccaaatcaac ttgtgatacc 1621 atgctcaaag aggcctcaat tatatttgag tttttaattt ttatg // LOCUS VSVGLYPN 1812 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-ES-B1) glycoprotein gene, complete cds. ACCESSION M35220 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-ES-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1812) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 512 a 362 c 347 g 591 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt 1741 tttatttttt attttttatt tttatttttt atctttttat ttcttatttt tatttttatt 1801 tttattttta tg // LOCUS VSVGLYPO 1861 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 86-ES-B2) glycoprotein gene, complete cds. ACCESSION M35221 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-ES-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1861) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 520 a 363 c 347 g 631 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt 1741 ttttattttt tattttttta ttttttattt tttatttttt atttttattt tttatttttt 1801 tattttttta ttttttattt ttatttttat ttttattttt atttttattt ttatttttat 1861 g // LOCUS VSVGLYPP 1802 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain Mudd-Summers 84-GM-B) glycoprotein gene, complete cds. ACCESSION M35222 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain Mudd-Summers 84-GM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1802) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 506 a 362 c 348 g 586 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat 61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta 1741 ttttttattt ttatttttta tctctttatt ttttattttt atttttattt ttatttttta 1801 tg // LOCUS VSVGLYPQ 1868 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-ES-B) glycoprotein gene, complete cds. ACCESSION M35223 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-ES-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1868) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 520 a 362 c 347 g 639 t ORIGIN 1 aacagagatc aatctgtttt cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat tacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct catcatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catatttcaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt 1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt 1801 atttttttat ttttttattt tttattttta tttttatttt tatttttatt tttattttta 1861 tttttatg // LOCUS VSVGLYPR 1860 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 86-ES-B1) glycoprotein gene, complete cds. ACCESSION M35224 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-ES-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1860) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 520 a 362 c 347 g 631 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcta caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt 1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt 1801 atttttttat tttttatttt tatttttatt tttattttta tttttatttt tatttttatg // LOCUS VSVGLYPS 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 69-PN-L) glycoprotein gene, complete cds. ACCESSION M35225 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 69-PN-L, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 487 a 358 c 364 g 448 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcattggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattattgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgccacgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 ctaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca agtgactcct catcatgtat 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggagaac 661 tctcatccct agggaaggaa ggcacgggat tcagaagcaa ttactttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgac aaagatctat ttgctgcagc aaaattcccg gagtgtcccg 841 aaggatcgag tatctccgct ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt ggagcaggtc 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctaaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccacct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg ttcagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttgttcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca ataccacaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPT 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-CR-B1) glycoprotein gene, complete cds. ACCESSION M35226 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-CR-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 490 a 358 c 363 g 446 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg aatcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPU 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-CR-B2) glycoprotein gene, complete cds. ACCESSION M35227 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-CR-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 486 a 357 c 367 g 447 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttgga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPV 1796 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 86-GM-B) glycoprotein gene, complete cds. ACCESSION M35228 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-GM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1796) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 503 a 362 c 349 g 582 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat 61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttcccattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat agtcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta 1741 ttttttattt tttattttta ttttttatct ttttattttt atttttattt tttatg // LOCUS VSVGLYPW 1800 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-GM-B) glycoprotein gene, complete cds. ACCESSION M35229 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-GM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1800) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 506 a 362 c 348 g 584 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat 61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttta ttatttttta ttttttattt 1741 ttatttttta tttttatttt ttcatctttt tattttttat ttttattttt attttttatg // LOCUS VSVGLYPX 1652 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 59-PN-L) glycoprotein gene, complete cds. ACCESSION M35230 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 59-PN-L, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1652) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 487 a 344 c 367 g 453 t 1 others ORIGIN 1 aacagagatc aatctgtctt cctagcatca tgaaatgnat tttgtacttc gcttttctcc 61 tggtaggagt gaactgtaaa tttacaatag ttttcccaca taatcaaaaa gggaactgga 121 aaaatgtgcc atccagttac cattattgtc catctagttc cgatcttaat tggcacaatg 181 gcctgattgg tacagcatta caagtcaaga tgcccaaaag tcacaaagca attcaggcag 241 acggctggat gtgtcatgca tccaaatggg tcaccacatg tgacttcagg tggtacggac 301 ccaaatatat tacgcattct attcggtcat tcacaccctc tgtggaacag tgcagagaaa 361 gcattgaaca gaccaaacaa ggaatctgga taaatcctgg ctttcctcct cagagttgtg 421 gatatgcaac tgtgacggac gcggaagcag taattgtcca ggtcacccca catcatgtgc 481 tcgtagatga gtacacaggc gaatgggtag attcccaatt tttgaacgga aagggcaaag 541 acgacatatg ccccacagtc cataattcga ccacatggca ttcagactac aaagtcaaag 601 ggctttgtga tgctaaccta atctcaatgg acatcacttt cttctcagaa gatggtgagc 661 tgtcctcctt gggcaaagat gggactggat tcagaagcaa ttactttgct tatgaaaatg 721 gagataaggc ttgccgcatg cagtattgca aacattgggg agtcagactc ccttctggag 781 tctggttcga gatggcagac aaagatttgt ttgcaggagc caagttcccc gagtgtccag 841 acggaaccag tatctctgct ccttctcaga catccgtaga tgtaagctta attcaggacg 901 tcgaaagaat cttggactat tccttgtgtc aagagacttg gagtaaaatc aaggcaggcc 961 ttccgatttc tccagttgat ctaagctatc tcgctccaaa aaatccaggg acaggtcccg 1021 ctttcaccat catcaatggg accttgaagt actttgagac cagatacatc agagtggaca 1081 ttgctgctcc aatcataccc agaatggtgg gaactatcag tggaacatct acagaaagag 1141 aattgtggga tgactggtac tcttatgaag atgtggagat tgggcccaat ggagtattga 1201 aaacaacctc aggatataaa tttcctcttt atatgattgg acacggaatg ctggactctg 1261 atcttcacct gacatcgaag gcccaggttt ttgaacatcc ccacattcag gacgctgcat 1321 cacagcttcc tgacggcgag actttatttt ttggtgatac cgggctctca aaaaacccca 1381 ttgaattagt agagggatgg ttcagtgggt ggaaaagctc tattgcttcg ttcttctttg 1441 tcattggatt gatcattgga ttatatttgg ttttaagggt gggaatatat ctttgtctta 1501 aactaaagca tacaaaaaag agaaaaattt acacagacat tgagatgaac agacttggaa 1561 aataaccaaa attgtgtgat ggatttgctt ttttgccaaa gaagcctgtg atatccttct 1621 cagagaagct ttaatctttt attgacaata tg // LOCUS VSVGLYPY 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 84-PN-B2) glycoprotein gene, complete cds. ACCESSION M35231 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-PN-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 489 a 367 c 357 g 444 t ORIGIN 1 aacagagatc catctgtttc ctcgacacca tgaagtgcct tttgtactta gctcttttat 61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggaaattgga 121 aaaatgttcc ttccagttac cattactgcc cgtcaagttc agatttgaac tggcataacg 181 gcctaatagg cacagcccta caggtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac 301 cgaaatatat aacacattcc atccggtcct tcacaccatc tgtggagcaa tgcaaggaaa 361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtaactccc catcatgtac 481 ttgtcgatga gtacacggga gagtgggtag attcccaatt catcaacggg aaatgcagca 541 atgacatatg ccctactgtc cataactcca caacctggca ttccgactac aaagtcaaag 601 ggctgtgtga ttctaacctc atctccatgg acatcacctt cttctcagaa gatggggagc 661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgca tacgaaacag 721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg 781 tctggtttga aatggccgac aaagatctgt ttgctgcagc aaaattccca gaatgtccag 841 aagggtcaag catctctgct ccatctcaga catcagtgga cgtgagtctt attcaggacg 901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc 961 ttcccatctc tccggtagat ctaagctatc ttgctccaaa aaacccagga accgggccag 1021 cctttacaat aatcaatggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattctccca agaatggtcg gaacaatcag tggaactacc accgagagag 1141 aattgtggga cgactgggca ccatatgaag atgttgaaat cggaccaaat ggagttctga 1201 aaaccagttc aggatataaa ttccctttat atatgattgg acatggcatg ctggactctg 1261 atctccatct cagctcaaag gctcaggtat ttgaacaccc tcacattcag gatgctgcat 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaatccaa 1381 tcgagctcgt agaagggtgg ttcagtggtt ggaaaagctc cattgcttct tttttcttta 1441 tcataggttt gatcattgga ttattcttgg ttctccgagt tggtatctat ctctgcatca 1501 agttaaaaca taccaaaaaa agaaaaatct atacagacat tgagatgaat cgactcggaa 1561 agtaactgga atcagacacg acagaataat catatttaaa ccaaatcgac ttgtgatatc 1621 atgctctaag aggctttaat tatgttttaa ttttatg // LOCUS VSVGLYPZ 1812 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-ES-B2) glycoprotein gene, complete cds. ACCESSION M35232 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-ES-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1812) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 513 a 360 c 346 g 593 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacttggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat aaagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt 1741 tttatttttt attttttatt tttatttttt atctttttat tttttatttt tatttttatt 1801 tttattttta tg // LOCUS CAVXDHB 8062 bp ds-DNA INV 27-JUL-1990 DEFINITION C.vicina xanthine dehydrogenase (Xdh) gene allele 1, exons 2-4. ACCESSION M30316 KEYWORDS xanthine dehydrogenase. SOURCE C.vicina third instar larvae fat body DNA (library of Schenkel et al.), clone Lcv1. ORGANISM Calliphora vicina Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Muscoidea; Calliphoridae. REFERENCE 1 (bases 1 to 8062) AUTHORS Houde,M., Tiveron,M.-C. and Bregegere,F. TITLE Divergence of the nucleotide sequences encoding xanthine dehydrogenase in Calliphora vicina and Drosophila melanogaster JOURNAL Gene 85, 391-402 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Bregegere, 01-DEC-1989. FEATURES from to/span description pept / 2538 6299 xanthine dehydrogenase, exon 2 (AA at 2538) (EC 1.1.1.204) 6367 6534 xanthine dehydrogenase, exon 3 7233 7283 xanthine dehydrogenase, exon 4 pre-msg < 1 7400 xanthine dehydrogenase mRNA and introns IVS < 1 2537 xanthine dehydrogenase intron A IVS 6300 6366 xanthine dehydrogenase intron B IVS 6535 7232 xanthine dehydrogenase intron C signal 7370 7375 poly-A signal signal 7391 7396 poly-A signal BASE COUNT 2516 a 1358 c 1473 g 2715 t ORIGIN 1 aagcttcaac ctgtaaaata aatttcaata aaatgttata aaatcttata ttttggtaaa 61 aattccattt gaataaaaca aatctagacg aaattataca tacaatcaaa tttatgcact 121 tcagatgttg aaaacagatt ggaaaattac ttattatacg tttcttagtg ttcgatttat 181 ttcgcttagc aataattgca aaatatttga ttgatttgta accataataa catcgttttt 241 aaatatgagt acttacttgt cttatcaact ttgttgttta tatatcaata tatatgtata 301 tttcattaaa aaacaataac aattttaaaa tggcaacttt ttgtgtaacc ttaatgaaaa 361 aaaaaaaata acacataaaa catcttatca atgactgctg tgagaccccc ctgtgacgta 421 cagataataa caatcttaaa actgttatac tgcacttttg taggactaat gaattagtaa 481 acaaataaat aactctgtat aaatatagaa cattttatta tccacatatt agttaaattt 541 gtcttggagg tctaatagct agtttctggg ataataaaat ctacattctt cgtttaaacc 601 tagattaagt aaactacaaa ttttatttct cactaatggc gtaaagagta tatgtaatct 661 aggtttaact gagcattatt gacgagttaa acctatgaaa aaaattattc caattgattt 721 tgtttcttca gattccattt gatttcgaca gttcccaaaa tattttataa ttctgaattt 781 ccaaatttaa tttggttttt ggtaaccccc tgtttatact tgacaaattt tgatcataat 841 aattaatgac gtttgttgcc aagacaaagt tgtcctgcaa tagcactaac aattgtgtca 901 taacgaagca ataatactaa taaatgaaga cattacctga aatttttttt atcttgacaa 961 caattttgac ttttatcatg ataaaaattt atcaggtata gataattatt taaaaatatc 1021 tgaaaaacaa ataattacaa actaataaca attcgacata taagacattc acacgatgaa 1081 ttttcagtca tgcaattgta gcaattctct tttgaatttt cttttactcc acccctcttg 1141 ccaacaaact cataaaatca cttagagtga catgacaaaa caagacatcg taaaaattcc 1201 ttattaaaaa aaaacaagtt agagagctat attcggcagt gccgaatctt atataccctt 1261 caccaaatta tacttcaaaa taaaaatttt aaatattttt aagtaaacaa attttttttt 1321 ccaaagttgt ttttttaagt ttttaggaaa tttttttttt cgaattgtta ttttaaaatt 1381 tttaaacagc tgttacgaac aaaatcaact atgtattgtg aatgaaaagt tcatgagaat 1441 attacgatag caattttccc tttgagggaa cataaatttc acgtattatt gccgataggg 1501 tctattgact agcatgtatt ttgttgttgc aagagttagg ttttggacaa cagacttagg 1561 tttttgactc gtctcgtctt ttattacaca gttttatttt atatttatta ttacagaagt 1621 ttaaaagttt ccattgtttt gccaatgtta agcattatat ccacatatac atgtattttt 1681 gttaattatt ccctatattt taatagaaaa cctggtgaaa ggtcatattt tttaaaactg 1741 cacagagaaa atgattcggt tgcagacata gacattttcg gttctatcaa cagaaaatca 1801 gttgataaag aagaatttcc gttgaagcaa ccaaactttt gttacctctt ctaaaatgtt 1861 gtagccataa ctgtaaaatt cggtcactaa gaatcattcg attggcaaca aattcggttg 1921 ctatcacgaa tctgttttct ctgtgtgaaa gtatggaata ttgttttaaa aaagtttcca 1981 ttttttcgct agtaggtctg ggttaattaa taactcgcgg cttcgtataa cgattcctgt 2041 tctacgtcgt ttacaccaac cgcatgtatt taatttgaac aaatcctggt ggagaccata 2101 aaaacccttc gatactggtg gagtatcatt tcaaactcaa atcgtaaatt ggtagagaac 2161 catataaaac tcgacggatc aactggagca gtatcattat cactcttgat tttaataaca 2221 aggtttaagc attctcactt ctatgaagac atatgtaggt catttaggct tacttaaata 2281 gacatgggat aacctggaaa atttgtgaga cgttatactc aaccaacaaa ctttaaaaca 2341 attttattgt cactctctgt ctaaagcatc gtaatctata catcacttta cttcacaaat 2401 gttacaatac tctccgtaat taatcttaaa tatcatcaat tacacatttt gtttaagtaa 2461 tttaaaaata ttaagttctg cattcttaaa atacagcttt gattcaagaa atataactat 2521 ttgtcttatt ttcctaggta attgatacca atcccgatcc agaatgtacc cttctaacat 2581 atttacgtga aaaactaaga ttatgtggca caaaattggg ctgtggcgag ggtggttgtg 2641 gtgcctgcac cgtcatgatt tctcgcattg atacgctaac gaatcggatt aaacacatag 2701 cggtgaatgc ctgcctgact ccggtatgtg ccatgcatgg tagtgccgtg actaccgtgg 2761 aaggtattgg cagtacgcga actcgattgc atccagtaca agagcgtctg gctaaagccc 2821 acggcagtca gtgtggtttt tgtacacccg gtattgtcat gtctatgtat gcactgttga 2881 gaaatttatc ccagccctcg atgaaagatt tggaaattgc tttccaggga aatttgtgca 2941 gatgcacggg ctatagaccc attttagagg gttataaaac atttactaaa gaattcggct 3001 gtgctatggg tgataaatgt tgtaaggtga atggcaataa atgtggagag ggtatggaaa 3061 atggtggtga catggtcgac gataaactat ttgagaaaag tgaatttgta cctttcgatc 3121 cctcacaaga acccatattt cctccggagc tgcagctgaa taaagattgg gactctcaaa 3181 cgttagtgta taagggcgaa cgtgctactt ggtatagacc tggcaattta gaagatttac 3241 tgaagattaa ggcccaattt ccggaagcca agttggttgt gggcaacact gaaataggcg 3301 tggaggttaa atttaaacat tttctatatc cagtcttggt aaatcccaca aaagttaagg 3361 aaatgattga tgtccaggaa ctcgaggata gtatttattt tggtgcttcc gttagtttaa 3421 tggatataga taggatttta agatccagca tagaaaagct acccgaacat caaacaagat 3481 ttttccaatg tgccgtgaat atgttgcatt attttgctgg taaacaaatt cgtaatgtgg 3541 cgtcattggg tggcaatatt atgacgggta gtccaatatc cgatatgaat ccggtgttga 3601 tggcgggtgc tgttaagttg aaagtggcca agtatgttga gggtcaaata aaatacagag 3661 aagtttgcat ggcatctggc ttctttacgg gttatcgcaa gaatgttata gaacccacag 3721 aaattttggt gggtctgtat ttccccaaga ctttggaaca tcaatatgtg gtggctttca 3781 agcaggccaa acgtagagat gatgatattg ctattgtaaa tgcagccata aatgtattca 3841 ttgatcccag gtccataact gtggataagg tgtatatggc ttttggtggt atggctccta 3901 ccactgtcct ggccacacgc actgccgaca taatggttaa acaacaatgg aataaggtgc 3961 ttatggagcg tgtggtggaa aatctgtgtg cagagttgcc tttggctccc tcagctcctg 4021 gtggcatgat tgcctatcgt cgttccctgg tggtgagctt atttttcaag gcctacttaa 4081 caataaccca acaattaatt aaatctggca ttttgcccca agattcttta ccccaagaag 4141 aactaagcgg ttctgatgtc ttccatactc ctgccttaaa aagtgctcaa ctgtttgaaa 4201 aagtttcaaa caaacaatcg gaatgtgatc ctattggcag accaaagata catgcctccg 4261 cattgaagca ggctaccgga gaagccatct actgcgatga tatgcctcgc atggaaaatg 4321 aattatattt ggcgctggtg ttaagtacaa aagctcatgc gaaaatccta agcattgatg 4381 cttcggaagc tctggcaatg cccggtgttc atgctttctt ctccagcaaa gacataaccc 4441 aacatgaaaa tgaagttggt ccagtgtttc atgatgaaga ggttttcgcc tcagatatgg 4501 tttactgtca aggtcaagtt attggagcta tagcagccga taatccaaac tttagctcaa 4561 agacggctcg taaggttaca atcgaatatg aagatataaa gccagttatt ataactattg 4621 aacaggctat cgagcataaa tcctacttcc ctgattatcc ccgttttacg gaaataggtg 4681 atgtagagaa ggctttcagt gaggccgatc atgtctatga aggttcttgt cgcatgggtg 4741 gccaggaaca tttctatttg gaaacccatg cctctttagc cgtgccccga gattcagatg 4801 aaattgaaat cttctgttca acacaacatc cctctgaggt acaaaaacta gtggcccatg 4861 tattgtcaac ttcagcacat cgagtagtgt gtcgtgccaa acgtttggga ggaggttttg 4921 gtggcaaaga gtctcgtgct atagctgttg ccttgcccgt ggctttggcc tgccatcgtc 4981 ttcgtagacc aatacgttgt atgctcgatc gtgatgaaga tatgatgatc accggtactc 5041 gtcatccctt tttatttaaa tacaaaattg cttttacctc tgaaggtcgt ttaaccggtt 5101 gctatattga gtgctacaat aatgccggtt ggtctatgga tctttccttc tccgtattgg 5161 aaagagccat gtttcatttt gaaaattgct ataaaatacc caacattaag gtgggtggtt 5221 gggtttgtaa aacaaactta ccatcgaaca cggcattccg tggctttggt ggtcctcagg 5281 gcatgtttgc gggagagcat ataatacgtg atgtggcccg catattgggc aaggactatt 5341 tggaaattat gaaacaaaat ttctataaag aaggcgatat aacgcactat caacaaaagt 5401 tggataactt tcccattgag aaatgtttct acgattgcct gcaacagtcg aactattacc 5461 agaaaaggaa agaaattgaa gagtttaatc gtaaccaccg ttggcgcaag cgtggcatat 5521 cattagttcc cacaaaatat ggcatagcct ttggtgtttc acatctcaat caagctggtg 5581 ctttaataaa tatttatgct gatggctcgg tgctgctttc gcatggtggt gtggaaattg 5641 gccaaggttt acataccaaa atgatccagt gttgtgctcg tgctttacaa atacccatag 5701 aattcattca catttccgag actgccactg ataaagtgcc aaatacatca cctacggctg 5761 ctagttcggg ttccgatcta aacggcatgg ccgttctgga tgcttgtgaa aagttaaata 5821 aacgtctggc tccgattaag gaggccaatc cgaatggctc ttggacagaa tggattaata 5881 aagcctattt cgaacgtgtc agcctgtcgg ctaccggctt ctatagaatg cctgatatcg 5941 gctatgatcc tgttcaaaat ccaaacgctc taatgtacaa ctattttacc aatggtgttg 6001 gttcgtcgat agttgaaatc gattgtctaa ctggagatca tcaggtccta agcacagata 6061 ttgttatgga tattggttcg agtttgaatc cggccattga tattggtcaa attgagggag 6121 ccttcatgca aggctacggc ctattcacat tggaagaaat gatttactcg cctcagggtg 6181 tactctattc tcgcggcccc ggcatgtaca aattacctgg ttttgctgat ataccgggag 6241 aatttaatgt gacaattttg acaggagctg ctaatccccg ggcggtttac tcctccaagg 6301 taagagataa tttgtgtttt aattgaagtt gtatgtttta ttatattgtg tttatgttta 6361 tttcaggctg ttggcgaacc tcctttgttt attggctgtt ctgtgttctt tgccattaaa 6421 gaagccatta cgtcggctcg actaatgaat ggcttgagcg aggactttaa attagaatca 6481 ccagctactt cggctcgtat acgtatggct tgtcaggatg agtttacgaa tttggtaaga 6541 aaatattaac aatttgaaat ggctcttatt aaatctttat atattgtttt aatgaagatc 6601 tagttagttt tctataggaa aatttcaatt gattcggcat tatattaaaa acgttggata 6661 caactagttt tctttaggaa atttttggat tcaacttgtt tttattagaa aattatttga 6721 ttcgattaga ttttagactt cttttatatg aaaatttgca ttcgactagt tttttacagc 6781 aagtattctt ttttataccc ttcaccttcg ggagcagggg atatataagt ttgtcattcc 6841 gtttgtaatt tccacaatat aattttccga ccctataaag tcgattaagc catgtccgtc 6901 tgtctatctg tcggcttgcc tgtctgttga aattttctga atacatttga ttaattaatt 6961 cgacatgttt tctgtggcaa acttgagatt cgactaggtt tatttcaaaa acttttttat 7021 tcaattagtt ttcattataa aatttaccat ccaattgttt ttattagaaa atttccgttt 7081 aattataagt tttcattaaa aactattcga ttcgagtagt tgtctgaaga aactgtttgt 7141 tttcctaaag aaaaccttcg attagactag ttttgattaa aaaacctttc gattcgaaca 7201 tttttaatgc caaatttttt ttatttttac agattgagca gccccctgct ggaagttatg 7261 tgccatggaa tattgtacct taaaattaat atttttttta atttattagc aaaaatttag 7321 ctagaaataa acgcaataga catttttttt gtatacaaat aaactgtgaa attaaatgaa 7381 atatttagaa aattaaatat aatactcgta tttgcattta ttacaatgaa atataaaact 7441 gattaaatgg aaaatgaaat gatgtgcaat taaatgttaa tggcagagtc tgtattaaca 7501 gaattgacct caattaaatg atcattactt tgttcattta tcacattggc attatcatct 7561 ccggacaaag tttcatacgg ttgattgtgg ctgaaaagga aatggaaaat tttaacagta 7621 tttcttcaca ccttaaagta tgctttaaaa tgtagataat aaataatttt attaaaagca 7681 ctgggagaaa acgtttaact gttaactact atctacttta ccttggtttt ttttcaattt 7741 aaacaatttt tatttgtttt ttttttaatt caacacatca actactttta tttgtttaat 7801 ttataaaatt ttttcataca aaatttgagc atgcatgcga tacttgatga cttttgtcaa 7861 cttttggcgt tgaattgtat ttaacgatat ttgagttttg aacattcgaa cagctacaaa 7921 cactttttct aaatagtttg ctaaatttag ggcaatgaca ggtaaatgaa ggacgtcttc 7981 taacgctaaa gtaagtcaaa tttcaagtaa ttaaaatatt aatacaaaaa aaaaatttga 8041 gtgtggttta aaagtaagcg ct // LOCUS CAVXDHC 3346 bp ds-DNA INV 27-JUL-1990 DEFINITION C.vicina xanthine dehydrogenase (Xdh) gene allele 2, exon 1. ACCESSION M30488 KEYWORDS xanthine dehydrogenase. SOURCE C.vicina third instar larvae fat body DNA (library of Schenkel et al.), clone Lcv2. ORGANISM Calliphora vicina Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Muscoidea; Calliphoridae. REFERENCE 1 (bases 1 to 3346) AUTHORS Houde,M., Theron,M.-C. and Bregegere,F. TITLE Divergence of the nucleotide sequences encoding xanthine dehydrogenase in Calliphora vicina and Drosophila melanogaster JOURNAL Gene 85, 391-402 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Bregegere, 01-DEC-1989. FEATURES from to/span description pept 1529 / 1609 xanthine dehydrogenase, exon 1 (AA at 1529) (EC 1.1.1.204) pre-msg 1305 > 3346 xanthine dehydrogenase mRNA and introns IVS < 1610 3346 xanthine dehydrogenase intron A BASE COUNT 1213 a 444 c 419 g 1270 t ORIGIN 1 aactgttcga ataatttgaa attgccgatt ttcaaataac aaataaaccg attaatttgt 61 gtcaattaac cgattaaccg aaattccttt tttttgcgct ttaaaatttt tttttgtatt 121 tatttttccg tttcaattca aatacaaatt aaaatttgat attttttgaa catccaataa 181 acatgattaa tgagtatgta taacaactga catgttttga aactttgttt gtatttacat 241 gtatagacta aacttttttg aattgagtct tttatcataa ctaaatgaat ctattaaatt 301 aatccaaatc taaaacaatc caaaaaagaa tattctttat catgtttttg atttctccaa 361 catataaaat cagtgagaga gttttttcca agaaaagttt tattaaatct gaagaaaaaa 421 atcgtttaaa ttatattctt tttataaaag attatttcag aagatctctt gctgcaacaa 481 cgtaataatt aaaaaaaagt cgtttccaaa aaatgtcttt gaatatttta tgacatttta 541 ctatttctta tataaatttt caacaaatca tgcgatttca gaaatataaa tagtagatgt 601 taatatcttt ctaatctgta tttaacccaa atttttactt atgaaatata cgagaaaaca 661 tgaattttaa ttttgatgtt tgcaataata aaaaacactc tcacacaaaa aataattgac 721 ttttttgaaa actatatgaa agattataga tcagcatatt ttgtattact cagttcataa 781 aacacggatt ttgagttatg acgttgttgc agcaaataca aaaagattac aaaactgatc 841 tcaaatacct tatttgctgt tttttatgtt tttgtacaca aaattcgttg ttgtattttc 901 aatttaaaat gaaaatagaa attattcggt taatcgaata attttaaatt aaccgattat 961 taaccgaata aatctaaacc tcgattaatt atttgctcga ttaaccgatt aaaccagaga 1021 cccgattaat tgaataccct acccccatct ttttatgcaa tttttgccct gtaaatgtca 1081 atgcgacaac actaccacta ctgtatattt cttattgtag cgccctcatg agtattaaga 1141 catttactct cgccctcgat tttatttgta ttattctctg gttatcatca cacttgttga 1201 aaatttatac acaaatatct tagcgactct ctaccgtagc cagccggttg tacaattaat 1261 tattaatacc agtataaata aaatttttat attcaaataa ataaataatt tgtattcgac 1321 tctcagttct ccagtttgca acgtgtgatc attgtgtcga gacgtatcgt ttaaaagtgc 1381 ttataatata cgtacaatat ttcatattgc gtttattatc gattgtttga ttaaaaaaat 1441 tgaattgaaa cgaaaataaa gacatcattg caattagtta aataaaaatt gataaaacaa 1501 gtggaattca gacagctaaa cgtttaatat gacgcaagaa cataatgcag cagtcctaga 1561 tttaaatcca acattttcta cattgatatt ttttgttaat ggcaaaaagg taattaatgt 1621 aatagtattt aaaaataatt tatgtatgaa aacagttagt ataatatttg gttaaaacat 1681 aactatacat tgtgattgag ggggggttta gggaacaaat tttattaaca cacatattta 1741 aatacacgaa aattttataa ataaaagata tttgttgatt atttattgag tcaattccct 1801 aatatttgtt gtaatcgtac tcgcattgtt attgtctgtc atacataaaa acttctgtct 1861 gtctgtctca acttctcatc aaacgagtaa tcatcatgaa cacttatacc tggtttgacg 1921 gcataaataa taaaatagag ttgcacagtg aggtcgatcg taaaacatgc taataaatct 1981 gaatcttctg acctatttcg attacaaaat gatgtataaa aaactaactt tgaaaaaata 2041 aattttagta taatttgtat gcaattaaaa tctgctcccc tgagcttgtt ttaggacggg 2101 tgattaatgt ttaaatgtca gattaatatt ctaattaata tgtttattat aatttgtaaa 2161 aaaaatttaa gaactagatg tgacacttag aatttttttt tttttaaatc actattttct 2221 tcctttgcct gaagtagggt acatatcgat tttaaaatct tatcgaaaat gaaaaaatat 2281 tacaattaag ctacattttt gtcaaaatgc atccattttt atatccttca ccatgagtgg 2341 cataagtctg tcattccgat tgtaaaatct acatttttca ttttcgaccc cacaaagaaa 2401 atatattcta gatcgttata catcaatata tcgggaattc ttccggctcg gaaactattt 2461 aaaaccgaaa aaatcggctg agatataagg aataaaccgg gacaacctcg actcttggcc 2521 tatttttgat ttatattttt ttgtcataaa ttaaaaaaag aattaaaaat aaaatttgga 2581 aaaaacttct ttaaacaaaa ttaaaaaaca agttagagag atacatatat tcggccgtgc 2641 atcttatata cccttcacca aattatactt caaaataaaa attttaaata gttttaggta 2701 aacaaaattt attttttttc caaagtagtt tttttttaat tttttggaaa aaaaaatttt 2761 cgaattgttt tttaaaattt taattttttt tttgtttttt taatatttag cgaaaaaaaa 2821 tttttttgtg aaaaaaaaat taacaaaaaa ctatttacat gtcattataa attttttgaa 2881 ataagatttt gacattctcg taaaatcagc ctaaggtgtt ccaacatttt gattaaataa 2941 tatctagttg atttttttgt tgttgttgct tttgtctatc attcaggttc agttgtaaac 3001 attaacgtgt tttgtttgcc ttatcacttg tttgcatttt tagaaaatgt gttttaatta 3061 aattagttta gaaaagtttt ttaaatggaa attattttaa gagcattgtc tggacattta 3121 tagatatata ttattgttaa ttgtattagt taaattttga ataaattgtg gaaaaatatc 3181 agttattttt tggttaaatt aaggaattag tacataaaaa cgaaagatag ctgagaattt 3241 ccatattagc gaatgcttct tggcttccga gtaaaaactc gcccgttagc gaattccatc 3301 atagcgttgt caatgattct tacagaaaca tcaagtatta aatgtt // LOCUS RATCGM1A 1104 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen (rnCGM1) gene, partial cds. ACCESSION M22226 J04626 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1104) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of the N-termianl domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 723 / 1082 carcinoembryonic antigen rnCGM1, exon x (AA at 725) pre-msg < 1 > 1104 carcinoembryonic antigen rnCGM1, mRNA and introns IVS < 1 722 carcinoembryonic antigen rnCGM1, intron y IVS 1083 > 1104 carcinoembryonic antigen rnCGM1, intron y+1 BASE COUNT 314 a 262 c 232 g 296 t ORIGIN Unreported. 1 gaattcactc ctcagctctc acagcataga tggacataca gactcctgaa ggctcttctc 61 ttccctccac actggtgtgt gtcacgtacc tgtagtgtgc acactgggac atgtaccttc 121 ccaaaccctc acgaacaata cagaaatatt aaattacact tgaatataat tatttttatg 181 tgctataaac atggaaatta tgtagacaaa cccagagata tcttttcttc cttccttcct 241 tccttcttcc ttccttcctt ccttcctctt tttccatact agtttctgag attttttgag 301 gaactgaacc ttccaaaaag accataccaa tccctgtcct caaaaagcct tttttattct 361 aatggactgg aaatcattgt atccagagga gaaagtcaat gatttagtgg aaccataaat 421 agaacagaaa acattcagga agtgaggatt gtatggagga ggaaaaagag gaggaggagg 481 aggaagagga ggaggaggag gaggaggacc gagagccggt tctccactca ccagacactt 541 tatggaaaga gtgatatggg gacacctgag tagaggattc cacagagagg aaatgacacc 601 ctttgaggtt ctgagggcat ggaggtcatg ctgctcacct ccattaaggg tgcatcctac 661 ctacaggctg agggatgctc acacctgctc aggattgtca acttttctct cttcccttct 721 agcctccctc ttaacctgct ggctcctgcc caccactgcc caagtctcca ttgaatcctt 781 accaccccag gtggttgaag gagaaaatgt tcttctacgt gttgacaatt tgccagagaa 841 tctcatagcc tttgtctggt acaaagggct gacaaacatg agcctcggag ttgcactgta 901 ttcactaacc tataacgtaa ctgtgacggg acctgtgcac agtggtagag agacattgta 961 cagcaatggg tccctgtgga tccaaaatgt cacccagaag gacacaggat tctacaccct 1021 acgaaccata agtaatcatg gagaaattgt atcaaataca tccctgcacc ttcatgtgta 1081 ctgtaagtaa ttctttgtga attc // LOCUS RATCGM1B 654 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen rnCGM2 gene, partial cds. ACCESSION M22227 J04626 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM2-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 654) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of the N-termianl domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 249 / 608 carcinoembryonic antigen rnCGM2, exon x (AA at 251) pre-msg < 1 > 654 carcinoembryonic antigen rnCGM2, mRNA and introns IVS < 1 248 carcinoembryonic antigen rnCGM2, intron y IVS 609 > 654 carcinoembryonic antigen rnCGM2, intron y+1 BASE COUNT 168 a 169 c 171 g 146 t ORIGIN Unreported. 1 aatgtaattc ttgttggaga gtgagtgggg agccatgcag acacgggagg agagagaccc 61 gtacaaaagg tcactccagc ttcgggggac tgggaacata gatgatgaag tttccctgca 121 ccaatgagag cgacgccctc accccacacc tcggcagaag atgaacacac ctacctgttc 181 cggacttggg cctcctctca gcgatcacta agcttctgac actgatggag ttttttcctt 241 ctccctagct tccttcttaa cctgctggaa tgcacccgcc gctgccgagc tcactattga 301 attagtgcca cccatggttg ctgaaggcgg aaactccgtt ttgtttgtgc atgaaatgcc 361 attgaatgtc caggcgtttt actggtacaa acagagagat ccgacgaaga gctatgaagt 421 cgcgcggtac ttaacaccca ccaacgaaag ttcgaagatg cctcagcaca gcggccggaa 481 aaccgtattc tacagtggat ccctgctgat cagaaacgtc acccaggccg acagtggagt 541 ctacacctta ctaacattta acacagaaat gcaaagcgaa ttaacacatg tgcatctgga 601 agtacgcggt aggtggttgc gggatctctg ggtgctaggg gtcggggtga gctc // LOCUS RATCGM1C 1185 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen rnCGM3 gene, partial cds. ACCESSION M22228 J04626 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM3-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1185) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of the N-terminal domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 729 / 1088 carcinoembryonic antigen rnCGM3, exon x (AA at 731) pre-msg < 1 > 1185 carcinoembryonic antigan rnCGM3, mRNA and introns IVS < 1 728 carcinoembryonic antigen rnCGM3, intron y IVS 1089 > 1185 carcinoembryonic antigen rnCGM3, intron y+1 BASE COUNT 312 a 279 c 277 g 317 t ORIGIN Unreported. 1 ctgcagtgtg cacagcaaga cattgtgctt tcccgaaccc acacgaacac actgaattat 61 taaatcacac ttgaatatat tgatttccct ttgctctgag cctgggcact atgtagataa 121 gtccatggaa atattaatct ttccttcctt ccttccttcc ttccttcctt ccttccttcc 181 ttccttcctt tcttccttct tctagttctt ttcacgtttt cccttttctt tttctctcca 241 atttgtttct aatctatttt caggaactga accttccaaa aagatgattc cagtccctgt 301 cctcacaaag cccttttctt gtggactgga agtcagagta tccagagaaa ggcaatggtt 361 taatggaacc tcaaacagaa cagaaaacaa ttctgagagt gagcattgca tgaggaagag 421 gaggaacggg aagaggagga ggaagaggag gaggaagagg aggaggaaga ggaatgggaa 481 gaggaggagg aagaggagga agaggaggaa gaggtcagac agctgcttca cctctcacca 541 gacactctat gggaagaatg atatggggac acctgagtag aggattcctg gagaggaaat 601 gacagctttt gagtctttga gggcatggag gtcatgctgc tcacctccat taagggtgca 661 tcctacctac aggctgaggg atgctcacac ctgctcagga tcggtgactt ttttctcttc 721 ccttctagcc tcacttttaa cctgctggct cctgcccacc actgcccacg tcaccctcaa 781 gtcctcaccg ccccaggtgg ttgaaggaga aaacgttctt ctaagtgctg acaatctgcc 841 agagaacatt atagctttcg cctggtacaa aggggagacc gacatgaacc gtggaattgc 901 actgtattca ctgaggtata ctgtaagttt gacggggcct gtgcacagtg gtcgagagac 961 attgtacagc gacgggtccc tgtggatcaa aaatgtcacc caggaggaca caggatttta 1021 tacctttcga atcataaata atcatggaaa aattcaatca aatacaaccc tgttccttca 1081 cgtgaaatgt aagtaactct ttgtgaactg tgggttttgg gtggtgtcct tccactagac 1141 acatagaagt atcaggccag ggctgtgtct cccttccccc tgcag // LOCUS RATCGM1D 748 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen rnCGM4 gene, partial cds. ACCESSION M22229 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM4-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 748) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of the N-terminal domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 90 / 434 carcinoembryonic antigen rnCGM4, exon x (AA at 92) pre-msg < 1 > 748 carcinoembryonic antigen rnCGM4, mRNA and introns IVS < 1 89 carcinoembryonic antigen rnCGM4, intron y IVS 435 > 748 carcinoembryonic antigen rnCGM4, intron y+1 BASE COUNT 197 a 158 c 183 g 210 t ORIGIN Unreported. 1 gagctctggg aaggcagaag tgtgattttt taaaaaacca acagatttca cctgctcaat 61 atcgatggtt gctctgtctt cccttttagc ctcccttcta acctgttggc tcctgactac 121 tgcccaggtc aacattgaat cggtgccatt caatgtggtt gaaggggaaa acgtccttct 181 tcttgtccac aatctgccag agaatctcat agcctttgcc tggtatagag ggctgaggaa 241 aattggagta tacatactga acactgaagt aagtgtgacg gggccaatgt acagcggtag 301 agagacagtg tacagcaatg gttccctgtg tatccgcaat gtcacccaga aggacacagg 361 attctacact ctacgaacag tcaacacacg tggagaaact gtatcaacaa catccttgta 421 cctctatgtg tacagtaagt gatactttgt gaactctggg tgttgtgtgg ggttcattcc 481 gtagacacac acagaagagg caggcctacc taccctttgc attgtgtctc cttattgagg 541 tgtgaacatt taactcaggc taaggagagt aatgccaatt gaatagaatc cttcttttga 601 ctttaccttg tagtcagctg gatgtgtggt taactcagtg aaggacatca gcccttgtct 661 agacttctgg ggttcttagc agtaatgtgt ccttgggaaa gaccttgagg gaaggagatt 721 gggtttgaat gagatagcca taggatcc // LOCUS RATCGM1E 750 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen rnCGM5 gene, partial cds. ACCESSION M22230 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM5-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 750) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of th N-terminal domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 118 / 477 carcinoembryonic antigen rnCGM5, exon x (AA at 120) pre-msg < 1 > 750 carcnioembryonic antigen rnCGM5, mRNA and introns IVS < 1 117 carcinoembryonic antigen rnCGM5, intron y IVS 478 > 750 carcinoembryonic antigen rnCGM5, intron y+1 BASE COUNT 180 a 207 c 178 g 185 t ORIGIN Unreported. 1 agcgtaggca ggagactcca cacctcagct gaccactgga cacagctgct cggactcagg 61 caccatctta gccaaatact aaagtcctga tgttgacgga tctctcttcc cttctagcct 121 ctcttttcat ctgtgggcgt ccttttaacc ctgccaagct cactattgaa tcagtgccgc 181 ccagtgttgc tgaaggggga agcgttcttc tcctcgttca caatctccag gacgagcttc 241 gagggttttt ctggtacaaa ggggcgtcta tgtctagcaa ccatgagata gcccgataca 301 gaacagcaaa gaattcaagt gtgccaggcc ctgcccacag tggtagagag acggtgtaca 361 gcaatggatc cctcctgctc cagaatgtca cccggaatga cactgggttc tacaccctac 421 gcactctgaa aagacatcag aaaatggaat tggcacacgt gcaacttcag gtggacagta 481 agtgattttc cgtgatcgtt cagtgctggg tgggtctttg acacacagga ctgtcacccc 541 tggcatgtgg ctacctcctc tctgcctttt tatccccatg ttgtggttaa ccactatgtg 601 caggacacat gtgatggaaa gaaatgccca tgggtcagac ttatcatctg actctcccct 661 gtatcaagga cagtaactca accctaggtg ctagactctg cccagtcatc tggggcatct 721 tgccatgcaa cgtgaggaaa ccatggatcc // LOCUS RATGSTAA 1116 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Rat glutathione S-transferase mRNA, complete cds. ACCESSION M28241 KEYWORDS glutathione S-transferase. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1116) AUTHORS Hsieh,J.-C., Liu,L.-F., Chen,W.-L. and Tam,M.F. TITLE Expression of Yb1 glutathione S-transferase using a Baculovirus expression system JOURNAL Biochem. Biophys. Res. Commun. 162, 1147-1154 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by J.-C.Hsieh, 21-SEP-1989. FEATURES from to/span description pept 103 753 glutathione S-transferase (EC 2.5.1.18) BASE COUNT 268 a 334 c 246 g 268 t ORIGIN 1 tttgtccggc ccacgtttct ctggtaatct gtataaatcg caacacacag gtcaattctg 61 ctgaagccaa attgagaaga ccacagcgcc agaaccatgc ctatgatact gggatactgg 121 aacgtccgcg ggctgacaca cccgatccgc ctgctcctgg aatacacaga ctcaagctat 181 gaggagaaga gatacgccat gggcgacgct cccgactatg acagaagcca gtggctgaat 241 gagaagttca aactgggcct ggacttcccc aatctgccct acttaattga tggatcgcgc 301 aagattaccc agagcaatgc cataatgcgc taccttgccc gcaagcacca cctgtgtgga 361 gagacagagg aggagcggat tcgtgcagac attgtggaga accaggtcat ggacaaccgc 421 atgcagctca tcatgctttg ttacaacccc gactttgaga agcagaagcc agagttcttg 481 aagaccatcc ctgagaagat gaagctctac tctgagttcc tgggcaagcg accatggttt 541 gcaggggaca aggtcaccta tgtggatttc cttgcttatg acattcttga ccagtaccac 601 atttttgagc ccaagtgcct ggacgccttc ccaaacctga aggacttcct ggcccgcttc 661 gagggcctga agaagatctc tgcctacatg aagagcagcc gctacctctc aacacctata 721 ttttcgaagt tggcccaatg gagtaacaag taggcccttg ctacactggc actcacagag 781 aggacctgtc cacattggat cctgcaggca ccctggcctt ctgcactgtg gttctctctc 841 cttcctgctc ccttctccag ctttgctagc cccatctcct caacctcacc ccagtcatgc 901 ccacatagtc ttcattctcc ccactttctt tcatagtggt ccccttcttt attgacacct 961 taacacaacc tcacagtcct tttctgtgat ttgaggtctg ccctgaactc agtctcccta 1021 gacttacccc aaatgtaaca ctgtctcagt gccagcctgt tcctggtggg ggagctgccc 1081 caggcctgtc tcatctttaa taaagcctga aacacc // LOCUS MUSTCGXI 354 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor gamma-chain mRNA V4-J1 region, 3' end. ACCESSION M30171 KEYWORDS T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 342 T-cell receptor beta-chain V4-J1 region (AA at 1) recomb 293 294 V-region DNA end/J-region DNA start BASE COUNT 118 a 72 c 69 g 95 t ORIGIN 1 aagttggagc aacctgaaat atcaatttcc agaccaagag atgagactgc acaaatatcc 61 tgtaaagttt tcatcgaaag ctttaggagt gtaaccatac actggtaccg gcagaaacca 121 aaccaaggtt tagagtttct attatatgtc cttgcaaccc ctacccatgt tttcttagat 181 aaggagtaca agaaaatgga ggcaagtaaa aatcctagtg cttctacatc gatattgaca 241 atatattcct tggaggaaga agacgaagct atctactact gttcctacgg ccaagtagct 301 caggttttca caaggtattt gcagaaggaa ctaagctcat agtaattccc tctg // LOCUS MUSTCGXJ 353 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor beta-chain mRNA V2-J2 region, 3' end. ACCESSION M30172 KEYWORDS T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 353) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 297 T-cell receptor beta-chain V2-J2 region (AA at 1) recomb 294 295 V-region DNA end/J-region DNA start BASE COUNT 131 a 71 c 65 g 86 t ORIGIN 1 cagctggagc aaactgaatt atcggtcacc agagagacag atgagaatgt gcaaatatcc 61 tgtatagttt atcttccata tttctccaac acagctatac attggtaccg gcaaaaaaca 121 aatcaacagt ttgagtatct aatatatgtc gcaacaaact acaatcaacg acccttagga 181 gggaagcaca aaaaaattga agcaagtaaa gattttaaaa gttctacctc aaccttggaa 241 ataaattact tgaagaaaga agatgaagcc acctactact gtgcagtctg gatatagctc 301 gggctttcac aaggtatttg cagaaggaac aaagctcata gtaattccct ccg // LOCUS MUSTCGXK 76 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor beta-chain mRNA V5-J1 region, 3' end. ACCESSION M30173 KEYWORDS T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 76) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 21 T-cell receptor beta-chain V5-J1 region (AA at 1) recomb 18 19 V-region DNA end/J-region DNA start BASE COUNT 19 a 18 c 16 g 23 t ORIGIN 1 acgtactact gtgcctgcta gctcaggttt tcacaaggta tttgcagaag gaactagctc 61 atagtaattc cctctg // LOCUS MUSTCGXL 926 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor beta-chain mRNA C1 region, 3' end. ACCESSION M30174 KEYWORDS T-cell receptor beta-chain; constant region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 926) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 518 T-cell receptor beta-chain C1 region (AA at 3) BASE COUNT 267 a 231 c 169 g 259 t ORIGIN 1 acaaaagact tgatgcagac atttccccca agcccactat tttccttcct tctgttgctg 61 aaacaaatct ccataagact gggacatacc tttgtctcct tgaaaagttc tttccagatg 121 ttataagggt gtattggaaa gaaaaggatg gcaatactat cctggactcc caggaagggg 181 atacgctgaa gactaacgac acatacatga agtttagctg gcttacagtg cccgaaaggg 241 caatggggaa agagcacaga tgtattgtca aacatgagaa caacaaagga ggagcagatc 301 aagagatttt ctttccttca ataaagaaag ttgctgtgag taccaagcct acaacttgct 361 ggcaagataa aaatgatgtg ctgcagctcc agttcacgat cacctctgcc tactacacct 421 accttctcct gctcctcaag agtgtgatct acttggccat catcagcttc tctctgctta 481 gaagaacatc tgtctgtggc aatgagaaga aatcctaaag aaacgagtgg tggtacagca 541 agtcagctgg atttcatcct cactgccata aaggtgcctt aacgaggtcc agctgccttc 601 acttgttggc tttcacttct ataaagtccc tcactcatgt tgcataaaca ttttctgaac 661 gtttgtgtgc aatttctgca acttctctct aactcacatc acctcattcc tgattccatc 721 cactgcagaa gtcccctccc ccagaagtct gaagcatcaa aattctagta cccatagcct 781 acagctttta cccatagccc ttgttgctct tgaaccagct aactccatga agacccccac 841 tctagacaac ctgaagagcc ttctttactg tttgattctg tgccttgtat ttcacattat 901 ggacaatata ttttttaaaa attaaa // LOCUS MUSTCGXM 913 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor beta-chain mRNA C2 region, 3' end. ACCESSION M30175 KEYWORDS T-cell receptor beta-chain; constant region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 913) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 503 T-cell receptor beta-chain C2 region (AA at 3) BASE COUNT 263 a 223 c 168 g 259 t ORIGIN 1 acaaaaggct tgatgcagac atttccccca agcccactat ttttcttcct tctgttgctg 61 aaacaaatct ccataagact gggacatacc tttgtctcct tgaaaagttc tttcccgatg 121 tcataagggt gtattggaaa gaaaagaatg gcaatactat cctggactcc caggaagggg 181 atacgctgaa gactaagggc acatacatga agtttagctg gcttactgtg cccgaaaggg 241 caatggggaa agagcacagt tgtattgtca aacatgagaa caacaaagga ggagcagatc 301 aagagatttt cttcccttca ataaagaaag ttgctacaac ttgctggcaa gataaaaatg 361 atgtgctgca gttccagttc acgagcacct ctgcctacta cacctacctc ctcctgctcc 421 tcaagagtgt gatctacttg gccatcatca gcttctctct gcttagaaga acatctgtct 481 gtggcaatga gaagaagtcc taaagaaacg agtggtggta cagcaagtca gctggatttc 541 atcctcactg ccataaaggt gccttaacgg ggaaccagat gccttctctt gttggctttc 601 acttctataa agtccctcac tcatgttgca taaacatttt ctgaactttt gtatgcaatt 661 tcagcaactt tttttaaact gaactcacct tcttcctgat tccatccact ccagaagtcc 721 cctcccccag aagcctgaaa cattaaaatt ctagtaccca tagcctacag cttttaccca 781 tggcccttgt tgctcttgaa ccagctaact ccatgaagac ccccactcta gacaacctga 841 agagccttct ttactgtttg attctgtgcc ttgtatttca gattattgac aataaatttt 901 ttaaaaaatt aaa // LOCUS DVUHYDAB 3180 bp ds-DNA BCT 27-JUL-1990 DEFINITION D.fructosovorans periplasmic hydrogenase (hydA and hydB) gene, complete cds. ACCESSION M35333 KEYWORDS periplasmic hydrogenase. SOURCE D.fructosovarans (strain DSM 3604) DNA, clone DHS-alpha-[pHH7]. ORGANISM Desulfovibrio fructosovorans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 3180) AUTHORS Rousset,M., Dermoun,Z., Matchikian,C.E. and Belaich,J.P. TITLE Cloning and sequencing of the locus encoding the large and small subunit genes of the periplasmic [Ni,Fe] hydrogenase Desulfovibrio fructosovorans JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Rousset, 15-JUN-1990. FEATURES from to/span description pept 301 1242 periplasmic hydrogenase-alpha subunit precursor (hydA) (EC 1.18.99,1) sigp 301 450 periplasmic hydrogenase-alpha subunit signal peptide matp 452 1239 periplasmic hydrogenase pept 1314 3008 periplasmic hydrogenase-beta subunit (hydB) mRNA 129 3060 periplasmic hydrogenase mRNA signal 97 102 -35 region signal 120 125 -10 region binding 289 293 ribosome binding site for hydA binding 1303 1307 ribosome binding site for hydB signal 3025 3060 termination signal BASE COUNT 660 a 1150 c 866 g 504 t ORIGIN 1 cgtgctgcat acgatttctt tattcatatc ctttacttcc ccggcctcat caccccggta 61 acacccttga atcacgtgaa aaatttgcca aaccccattg acgttcaaac aaactcgtga 121 ttaactgcaa aagggaaatc gaacccggcg acgcaggaac gcacgcgtcc gtcacataga 181 ggcggcggcg gccggttcgc ctgcgacggc cgcgtgccga acgggtcaac ggatgctacg 241 tggcctgggg atcatgggac gccccggacc atgcctctcg taaaacaaag gaggacgttt 301 atgaactttt ccgtgggtct tggcaggatg aatgcggaaa aacggcttgt gcaaaacggc 361 gtctcccgcc gcgacttcat gaaattttgc gccaccgtgg ccgcggccat gggcatgggc 421 ccggcgttcg cgcccaaggt cgccgaagca ttgacggcca aacaccgtcc gtcggtggtc 481 tggctgcaca acgccgagtg caccggctgc accgaagcgg cgatccggac gatcaaacct 541 tatatagacg cgctcattct cgacaccatc tccctggatt accaggagac catcatggcc 601 gcggccggcg agacgtccga ggcggccctg caccaggccc tcgaaggcaa ggacggctac 661 tacctcgtgg tcgagggcgg cctgcccacc atcgacggcg gccagtgggg catggttgcc 721 ggccatccca tgatcgagac caccaagaag gccgcggcca aggccaaggg catcatctgc 781 atccggcacc tgcctcacgg cggcgtccag aaggccaaac ccaatcccag ccaggccaag 841 ggcgtgtccg aagccctcgg cgtcaagacc atcaacatcc ccggctgccc gcccaacccc 901 atcaacttcg tgggcgccgt ggtccatgtc ctgaccaagg gcatcccgga tctcgacgag 961 aacggccgtc cgaagctctt ctacggcgag ctggtccacg acaactgtcc gcgcctgccc 1021 cacttcgagg cctccgaatt cgcgccctcc ttcgattccg aagaggccaa gaaaggcttc 1081 tgcctctacg aactcggctg caagggcccc gttacctaca acaactgccc caaggtgctg 1141 ttcaaccagg tcaactggcc cgtccaggcc ggccacccct gcctcggctg cagcgagccg 1201 gacttctggg acaccatgac gccgttctac gagcagggct aaccccctcc tttgtagcgg 1261 ccagtacccg gacaccttca acagcaccga acgtctcgtg acggaggaag catatggctg 1321 agagcaaacc cacgccacaa tccaccttca ccggccccat cgtggtcgac cccattaccc 1381 ggatcgaagg tcacttgcgg atcatggtcg aggtggaaaa cggtaaggtc aaggacgcct 1441 ggagctcctc gcagctcttc cgcggcctgg aaatcatcct caaaggccgc gatccccgcg 1501 acgcccagca cttcacccag cgcgcctgcg gcgtgtgcac gtacgtccac gccctggcct 1561 ccagccgctg cgtcgatgac gccgtcaagg tcagcatccc ggccaacgcc cgcatgatgc 1621 gcaacctggt catggcctcc cagtatctcc atgaccacct cgtccacttc tatcacctgc 1681 acgccctcga ctgggtcgac gtgaccgccg ccctcaaggc cgatcccaac aaggccgcca 1741 aactggcagc ctccatcgac acggctcgga ccggcaactc ggaaaaggcc ctcaaggcag 1801 tccaggacaa gctgaaagcc ttcgtcgagt ccggacagct cggcatcttc accaacgcct 1861 acttcctcgg cggccacaaa gcctactacc tgccgcccga ggtcaacctc atcgccaccg 1921 cccactacct ggaagccctg cacatgcagg tcaaggcggc cagcgccatg gccatcctcg 1981 gcggcaagaa cccccacacc cagttcaccg tcgtgggcgg ctgctccaac taccagggcc 2041 tgaccaagga cccgctggcc aactacctgg ccctgagcaa ggaagtctgc cagttcgtca 2101 acgagtgcta catccctgac ctgctggccg tggccggctt ctacaaggac tggggcggca 2161 tcggcggcac cagcaactac ctggccttcg gcgagttcgc caccgacgac agctcccccg 2221 agaaacacct ggccacctcg cagtttccgt ccggcgtcat caccggccgc gacctcggca 2281 aggtggataa cgtggacctc ggcgccatct acgaagacgt caagtactcc tggtacgccc 2341 ccggcggcga cggcaagcac ccctacgact gcgtcaccga tcccaagtac accaagctcg 2401 acgacaagga ccactactcc tggatgaagg ccccccgcta caagggcaag gccatggaag 2461 tcggtccctt ggcccgcacc ttcatcgcct acgccaaggg gcagcccgac ttcaaaaagg 2521 tcgtggacat ggtcctcggc aaactctccg tcccggccac ggccctgcat tcgaccctcg 2581 gacgcaccgc cgcccgcggc atcgagaccg ccatcgtctg cgccaacatg gagaagtgga 2641 tcaaggaaat ggccgacagc ggcgccaagg acaacaccct gtgcgccaag tgggagatgc 2701 ccgaggagtc caagggcgtc ggcctggccg atgctccccg cggctccctg tcccactgga 2761 tccgcatcaa gggcaagaag atcgacaact tccagctggt tgtcccctcg acctggaacc 2821 tcggtccccg ggggccccag ggcgacaaga gcccggtgga agaggccctt atcggcacgc 2881 ccatcgccga tcccaaacgc ccggtcgaaa tcctgcgcac ggtccacgcc ttcgacccct 2941 gcatcgcctg cggcgtgcac gtcatcgagc ccgagaccaa cgagatcctc aagttcaagg 3001 tttgctaaga gcggcaaccc tctaatacga aggacccgct tcgggccggg ttcttcgttt 3061 ctcaaggagg tcgcatgtcc gataccccgc gcaaaatcct catcctcggc ctcggcaaca 3121 tcctctacac cgacgagggc gtcggcgtgc gggccgtgga gcgccttctc gaaacccacg // LOCUS MTFRECA 1349 bp ds-DNA BCT 27-JUL-1990 DEFINITION M.flagellatum recA gene, complete cds. ACCESSION M35325 KEYWORDS co-protease; recombinase. SOURCE M.flagellatum (strain MFK1) DNA. ORGANISM Methylobacillus flagellatum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 1349) AUTHORS Gomelsky,M., Bolstin,A. and Tsygankov,Y.D. TITLE The nucleotide sequence and regulation of recA gene from Methylobacillus flagellatum JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.V.Gomelsky, 19-JUN-1990. Author address:M.V.Gomelsky Institute of Genetics and Section of Industrial Microorganisms 1st Dorozhry pr-d,1 Moscow 113545 U.S.S.R. FEATURES from to/span description pept 293 1327 RecA protein site 1 292 LexA repressor binding site BASE COUNT 320 a 413 c 386 g 230 t ORIGIN 1 aggagatggc cctgggcgcg ctggcctaca gcaccgccga cattagtgct tcctacacgg 61 gcatcgctgg gccgatggcg gaagcgagca aaaaccggtc ggtaccgtat gctttggctg 121 ggcccatcgc gatggccgga tcatcaccca aacccaccac ttcctgggcg agagacgccg 181 tgcgcagaca gtctgtcaaa accgcattaa ccggcttgtt gcagcttact ttacccactg 241 atttatgaaa taattgcagg ctttagcagg caactcgaaa aggtattcac tgatggatga 301 aaacagaagc aaagcgctag ccgccgccct ctcgcaaatc gagaaacaat ttggcaaagg 361 ctctatcatg cgcatgggcg acaccgatgt cgctgctgac atccaggcag tttctaccgg 421 ctccctgggc ctggacatcg ccctgggtat tggcggcttg ccgcgcggcc gcatcgtcga 481 gatttacggc ccggaatcct ccggcaagac cacgctgaca ttgtctgtga tcgcgcagat 541 gcaaaagctg ggcggcacgg cagcattcat cgacgcagaa cacgcgctcg acccggtcta 601 cgcgcagaag ctcggtgtca atgtttccga tctgctcatc tcccagcccg acaccggcga 661 acaggcgctc gagatcgccg acatgctggt gcgttccggc tcggtagatg tcgtggtggt 721 cgactccgtc gctgcgctga cgcccaaggc cgaaatcgaa ggcgaaatgg gcgactcgca 781 catgggcctg caggcccgcc tgatgtcgca ggcactgcgc aagctcaccg ccaacatcaa 841 gcgcaccaac accttggtca tcttcatcaa ccagatccgc atgaagatcg gcctgatgtt 901 cggcaacccc gagaccacca ccggcggcaa tgcactcaag ttctacgcct ccgtgcgcct 961 ggacatccgc cgcaccggcg cgatcaagaa gggggacgag gtcacgggat cagaaacccg 1021 ggtcaaggta gtgaagaaca acgtggcccc tccgttcaag ctagcggaat tcgacatcct 1081 ctatggcgaa ggtatctcgc gcgagggcga aatcatcgaa ctcggcgtca accttaaatt 1141 gatcgagaag gccggtgcct ggtacagcta caagggcgag aagatcggcc agggcaagga 1201 caacgcgcgc gagttcctca gggagcaccc ggaaatcgcc aatgagatcg acgccaagat 1261 tcgcgagcat tccaatctgg ccaacgccgc gatgacgacg gcaccggatg aagaaagcga 1321 cgaataagcc taccaccatg aatgcgaga // LOCUS CHKIGLAO 329 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34804 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 2. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 329) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 329 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 329 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 65 a 107 c 82 g 75 t ORIGIN 1 ccgttttctc ccctctttcc tcctcctctc caggttccct ggtgcaggca gcactgactc 61 agccggcctc ggtgtcagca aacctgggag aaaccgtcga gatcacctgc tccgggggta 121 gtggcagcta cggctggttc cagcagaagt ctcctggcag tgcccctgtc actctgatct 181 atgctaacac caacagaccc tcagacatcc cttcacgatt ctccggttcc aaatccggct 241 ccacgggcac attaaccatc actggggtcc aagccgagga cgaggctgtc tatttctgtg 301 ggagcaggga cagcaactat gttggtata // LOCUS CHKIGLAP 332 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34805 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 36. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 332) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 332 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 332 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 70 a 110 c 79 g 73 t ORIGIN 1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc 61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccgggagta 121 gtggcagcta ctatggctgg taccagcaga aggcacctgg cagtgcccct gtcactgtga 181 tctatgacaa caccaacaga ccctcggaca tcccttcacg attctccggt tccaaatccg 241 gctccacggc cacattaacc atcactgggg tccaagccga cgacgaggct gtctatttct 301 gtgggagtac agacagcagc tatattggta ta // LOCUS CHKIGLAQ 335 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34806 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 24. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 335) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 335 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 335 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 71 a 110 c 77 g 77 t ORIGIN 1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc 61 agccggcctc ggtgtcagca aacctgggag aaaccgtcaa gatcacctgc tcagggagta 121 gcaatactta ttactattat ggctggttcc agcagaagtc tcctggcagt gcccctgtca 181 ctgtgatcta taacaacgac aagagaccct cggacatccc ttcacgattc tccggtgccc 241 tatccggctc cacagccaca ttaaccatca ctggggtcca agccgacgac gaggctgtct 301 atttctgtgg gagcaacgac agcagtactg ctgca // LOCUS CHKIGLAR 330 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34807 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 330) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 29 > 330 Ig lambda-chain V1-J region, exon N precursor (AA at 31) sigp / 29 45 Ig lambda-chain V1-J region, exon N signal peptide (AA at 31) matp 46 > 330 Ig lambda-chain V1-J region, exon N IVS < 1 28 Ig lambda-chain V1-J region intron X BASE COUNT 71 a 101 c 80 g 78 t ORIGIN 1 ccccattttt tctcctctcc ttctccaggt tccctggtgc aggcagcgct gactcagccg 61 gcctcggtgt cagcaaacct gggagaaacc gtcgagatca cctgctccgg gggtagcagc 121 gactatggct ggtaccagca gaagtcacct ggcagtgccc ctctcactgt gatctataat 181 ggcaataaca gaccctcaaa catcccttca cgattctccg gttccctatc cggctccaca 241 aacacattaa ccatcactgg ggtccaagtc gaggacgagg ctgtctattt ctgtgggagt 301 gcagacagca gtgctggtta tgctgatata // LOCUS CHKIGLAS 332 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34808 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 126. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 332) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 332 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 332 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 70 a 106 c 79 g 77 t ORIGIN 1 ccgttttctc ccctctctcc tctccctctc caggttccct ggtgcaggca gcgctgactc 61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccgggggtt 121 acagcgacta tggctggttc cagcagaagt ctcctggcag tgcccctgtc actctgatct 181 atgctaacac caacagaccc tcgaacatcc cttcacgatt ctccggttcc aaatctggct 241 ccacaaacac attaaccatc actggggtcc aagccgagga cgaggctgtc tattactgtg 301 ggagctggga tagcagtggt aatagtggta ta // LOCUS CHKIGLAT 108 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34809 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 34. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 108) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 108 Ig lambda-chain V1-J region, exon N (AA at 1) BASE COUNT 22 a 27 c 35 g 24 t ORIGIN 1 ggatccggct ccacgggcac attaaccatc actggggtcc gagccgagga cgaggctgtc 61 tattactgtg gtggctacga cggcagcagt gcttatgctg gtgatata // LOCUS CHKIGLAU 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds. ACCESSION M34811 KEYWORDS immunoglobulin light-chain; joining exon; processed gene. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 2. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 94 a 123 c 155 g 157 t ORIGIN 1 ggtatatttg gggccgggac aaccttgacc gtcctaggtg agtcgctgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLAV 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds. ACCESSION M34812 KEYWORDS immunoglobulin light-chain; joining exon; processed gene. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 95 a 124 c 154 g 156 t ORIGIN 1 gatatatttg gggccgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLAW 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds. ACCESSION M34813 KEYWORDS immunoglobulin light-chain; joining exon; processed gene. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 126. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 94 a 123 c 155 g 155 t 2 others ORIGIN 1 ggtanntttg gggacgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLAX 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds. ACCESSION M34814 KEYWORDS immunoglobulin light-chain; joining exon; processed gene. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 34. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 97 a 126 c 152 g 154 t ORIGIN 1 gatatattag gggccgggac aaccctgacc gtcctaggtg agtcactgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat tcttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattacttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLJA 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig germline light-chain gene J region, partial cds. ACCESSION M34810 KEYWORDS germline; immunoglobulin light-chain; joining exon. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone B19-Eco. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 94 a 124 c 155 g 156 t ORIGIN 1 ggtatatttg gggccgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLVE 335 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig germline lambda-chain gene V1-J region, partial cds. ACCESSION M34803 KEYWORDS germline; immunoglobulin light-chain; joining exon; lambda-immunoglobulin; variable region. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone B19/g1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 335) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 335 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 335 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 70 a 111 c 82 g 72 t ORIGIN 1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc 61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccggggata 121 ggagctacta tggctggtac cagcagaagg cacctggcag tgcccctgtc actgtgatct 181 atgctaacac caacagaccc tcggacatcc cttcacgatt ctccggttcc aaatccggct 241 ccacagccac attaaccatc actggggtcc aagccgacga cgaggctgtc tattactgtg 301 ggagtgcaga cagcagcagt actgctggtg gtata // LOCUS CH1L1CG 7502 bp ds-DNA circular BCT 27-JUL-1990 DEFINITION Plasmid pCHL1 (from C.trachomatis) virulence proteins pGP1-D, pGP2-D, pGP3-D, pGP4-D, pGP5-D, pGP6-D, pGP7-D, and pGP8-D, complete cds. ACCESSION J03321 KEYWORDS complete genome; virulence protein. SOURCE Plasmid pCHL1 (from Chlamydia trachomatis trachoma biovar, serotype D) DNA. ORGANISM Plasmid pCHL1 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 7502) AUTHORS Comanducci,M., Ricci,S., Cevenini,R. and Ratti,G. TITLE Diversity of the Chlamydia trachomatis common plasmid in biovars with different pathogenicity JOURNAL Plasmid 23, 149-154 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Ratti, 28-MAR-1990. FEATURES from to/span description pept 1480 488 (c) pGP8-D pept 1579 2934 pGP1-D pept 2928 3992 pGP2-D pept 4054 4848 pGP3-D pept 4918 5226 pGP4-D pept 5317 6048 pGP5-D (gtg start codon) pept 6045 6788 pGP6-D (gtg start codon) pept 7022 7502 pGP7-D 1 437 pGP7-D rpt 6857 6945 four tandem 22bp repeats BASE COUNT 2460 a 1285 c 1433 g 2324 t ORIGIN 1 ggatccgtaa gttagacgaa attttgtctt tgcgcacaga cgatctattt tttgcatcca 61 atcagatttc ctttcgcatt aaaaaaagac agaataaaga aaccaaaatt ctaatcacat 121 ttcctatcag cttaatggaa gagttgcaaa aatacacttg tgggagaaat gggagagtat 181 ttgtttctaa aatagggatt cctgtaacaa caagtcaggt tgcgcataat tttaggcttg 241 cagagttcca tagtgctatg aaaataaaaa ttactcccag agtacttcgt gcaagcgctt 301 tgattcattt aaagcaaata ggattaaaag atgaggaaat catgcgtatt tcctgtcttt 361 catcgagaca aagtgtgtgt tcttattgtt ctggggaaga ggtaattcct ctagtacaaa 421 cacccacaat attgtgatat aattaaaatt atattcatat tctgttgcca gaaaaaacac 481 ctttaggcta tattagagcc atcttctttg aagcgttgtc ttctcgagaa gatttatcgt 541 acgcaaatat catctttgcg gttgcgtgtc ctgtgacctt cattatgtcg gagtctgagc 601 accctaggcg tttgtactcc gtcacagcgg ttgctcgaag cacgtgcggg gttattttaa 661 aagggattgc agcttgtagt cctgcttgag agaacgtgcg ggcgatttgc cttaacccca 721 ccatttttcc ggagcgagtt acgaagacaa aacctcttcg ttgaccgatg tactcttgta 781 gaaagtgcat aaacttctga ggataagtta taataatcct cttttctgtc tgacggttct 841 taagctggga gaaagaaatg gtagcttgtt ggaaacaaat ctgactaatc tccaagctta 901 agacttcaga ggagcgttta cctccttgga gcattgtctg ggcgatcaac caatcccggg 961 cattgatttt ttttagctct tttaggaagg atgctgtttg caaactgttc atcgcatccg 1021 tttttactat ttccctggtt ttaaaaaatg ttcgactatt ttcttgttta gaaggttgcg 1081 ctatagcgac tattccttga gtcatcctgt ttaggaatct tgttaaggaa atatagcttg 1141 ctgctcgaac ttgtttagta ccttcggtcc aagaagtctt ggcagaggaa acttttttaa 1201 tcgcatctag gattagatta tgatttaaaa gggaaaactc ttgcagattc atatccaagg 1261 acaatagacc aatcttttct aaagacaaaa aagatcctcg atatgatcta caagtatgtt 1321 tgttgagtga tgcggtccaa tgcataataa cttcgaataa ggagaagctt ttcatgcgtt 1381 tccaatagga ttcttggcga atttttaaaa cttcctgata agacttttca ctatattcta 1441 acgacatttc ttgctgcaaa gataaaatcc ctttacccat gaaatccctc gtgatataac 1501 ctatccgtaa aatgtcctga ttagtgaaat aatcaggttg ttaacaggat agcacgctcg 1561 gtattttttt atataaacat gaaaactcgt tccgaaatag aaaatcgcat gcaagatatc 1621 gagtatgcgt tgttaggtaa agctctgata tttgaagact ctactgagta tattctgagg 1681 cagcttgcta attatgagtt taagtgttct catcataaaa acatattcat agtatttaaa 1741 cacttaaaag acaatggatt acctataact gtagactcgg cttgggaaga gcttttgcgg 1801 cgtcgtatca aagatatgga caaatcgtat ctcgggttaa tgttgcatga tgctttatca 1861 aatgacaagc ttagatccgt ttctcatacg gttttcctcg atgatttgag cgtgtgtagc 1921 gctgaagaaa atttgagtaa tttcattttc cgctcgttta atgagtacaa tgaaaatcca 1981 ttgcgtagat ctccgtttct attgcttgag cgtataaagg gaaggcttga tagtgctata 2041 gcaaagactt tttctattcg cagcgctaga ggccggtcta tttatgatat attctcacag 2101 tcagaaattg gagtgctggc tcgtataaaa aaaagacgag tagcgttctc tgagaatcaa 2161 aattctttct ttgatggctt cccaacagga tacaaggata ttgatgataa aggagttatc 2221 ttagctaaag gtaatttcgt gattatagca gctagaccat ctatagggaa aacagcttta 2281 gctatagaca tggcgataaa tcttgcggtt actcaacagc gtagagttgg tttcctatct 2341 ctagaaatga gcgcaggtca aattgttgag cggattattg ctaatttaac aggaatatct 2401 ggtgaaaaat tacaaagagg ggatctctct aaagaagaat tattccgagt agaagaagct 2461 ggagaaacgg ttagagaatc acatttttat atctgcagtg atagtcagta taagcttaac 2521 ttaatcgcga atcagatccg gttgctgaga aaagaagatc gagtagacgt aatatttatc 2581 gattacttgc agttgatcaa ctcatcggtt ggagaaaatc gtcaaaatga aatagcagat 2641 atatctagaa ccttaagagg tttagcctca gagctaaaca ttcctatagt ttgtttatcc 2701 caactatcta gaaaagttga ggatagagca aataaagttc ccatgctttc agatttgcga 2761 gacagcggtc aaatagagca agacgcagat gtgattttgt ttatcaatag gaaggaatcg 2821 tcttctaatt gtgagataac tgttgggaaa aatagacatg gatcggtttt ctcttcggta 2881 ttacatttcg atccaaaaat tagtaaattc tccgctatta aaaaagtatg gtaaattata 2941 gtaactgcca cttcatcaaa agtcctatcc accttgaaaa tcagaagttt ggaagaagac 3001 ctggtcaatc tattaagata tctcccaaat tggctcaaaa tgggatggta gaagttatag 3061 gtcttgattt tctttcatct cattaccatg cattagcagc tatccaaaga ttactgaccg 3121 caacgaatta caaggggaac acaaaagggg ttgttttatc cagagaatca aatagttttc 3181 aatttgaagg atggatacca agaatccgtt ttacaaaaac tgaattctta gaggcttatg 3241 gagttaagcg gtataaaaca tccagaaata agtatgagtt tagtggaaaa gaagctgaaa 3301 ctgctttaga agccttatac catttaggac atcaaccgtt tttaatagtg gcaactagaa 3361 ctcgatggac taatggaaca caaatagtag accgttacca aactctttct ccgatcatta 3421 ggatttacga aggatgggaa ggtttaactg acgaagaaaa tatagatata gacttaacac 3481 cttttaattc accacctaca cggaaacata aagggttcgt tgtagagcca tgtcctatct 3541 tggtagatca aatagaatcc tactttgtaa tcaagcctgc aaatgtatac caagaaataa 3601 aaatgcgttt cccaaatgca tcaaagtatg cttacacatt tatcgactgg gtgattacag 3661 cagctgcgaa aaagagacga aaattaacta aggataattc ttggccagaa aacttgttat 3721 taaacgttaa cgttaaaagt cttgcatata ttttaaggat gaatcggtac atctgtacaa 3781 ggaactggaa aaaaatcgag ttagctatcg ataaatgtat agaaatcgcc attcagcttg 3841 gctggttatc tagaagaaaa cgcattgaat ttctggattc ttctaaactc tctaaaaaag 3901 aaattctata tctaaataaa gagcgctttg aagaaataac taagaaatct aaagaacaaa 3961 tggaacaatt agaacaagaa tctattaatt aatagcaagc ttgaaactaa aaacctaatt 4021 tatttaaagc tcaaaataaa aaagagtttt aaaatgggaa attctggttt ttatttgtat 4081 aacactgaaa actgcgtctt tgctgataat atcaaagttg ggcaaatgac agagccgctc 4141 aaggaccagc aaataatcct tgggacaaca tcaacacctg tcgcagccaa aatgacagct 4201 tctgatggaa tatctttaac agtctccaat aattcatcaa ccaatgcttc tattacaatt 4261 ggtttggatg cggaaaaagc ttaccagctt attctagaaa agttgggaga tcaaattctt 4321 gatggaattg ctgatactat tgttgatagt acagtccaag atattttaga caaaatcaaa 4381 acagaccctt ctctaggttt gttgaaagct tttaacaact ttccaatcac taataaaatt 4441 caatgcaacg ggttattcac tcccagtaac attgaaactt tattaggagg aactgaaata 4501 ggaaaattca cagtcacacc caaaagctct gggagcatgt tcttagtctc agcagatatt 4561 attgcatcaa gaatggaagg cggcgttgtt ctagctttgg tacgagaagg tgattctaag 4621 ccctgcgcga ttagttatgg atactcatca ggcattccta atttatgtag tctaagaacc 4681 agtattacta atacaggatt gactccgaca acgtattcat tacgtgtagg cggtttagaa 4741 agcggtgtgg tatgggttaa tgccctttct aatggcaatg atattttagg aataacaaat 4801 acttctaatg tatctttttt agaggtaata cctcaaacaa acgcttaaac aatttttatt 4861 ggatttttct tataggtttt atatttagag aaaacagttc gaattacggg gtttgttatg 4921 caaaataaaa gaaaagtgag ggacgatttt attaaaattg ttaaagatgt gaaaaaagat 4981 ttccccgaat tagacctaaa aatacgagta aacaaggaaa aagtaacttt cttaaattct 5041 cccttagaac tctaccataa aagtgtctca ctaattctag gactgcttca acaaatagaa 5101 aactctttag gattattccc agactctcct gttcttgaaa aattagagga taacagttta 5161 aagctaaaaa aggctttgat tatgcttatc ttgtctagaa aagacatgtt ttccaaggct 5221 gaatagacaa cttactctaa cgttggagtt gatttgcaca ccttagtttt ttgctctttt 5281 aagggaggaa ctggaaaaac aacactttct ctaaacgtgg gatgcaactt ggcccaattt 5341 ttagggaaaa aagtgttact tgctgaccta gacccgcaat ccaatttatc ttctggattg 5401 ggggctagtg tcagaagtga ccaaaaaggc ttgcacgaca tagtatacac atcaaacgat 5461 ttaaaatcaa tcatttgcga aacaaaaaaa gatagtgtgg acctaattcc tgcatcattt 5521 tcatccgaac agtttagaga attggatatt catagaggac ctagtaacaa cttaaagtta 5581 tttctgaatg agtactgcgc tcctttttat gacatctgca taatagacac tccacctagc 5641 ctaggagggt taacgaaaga agcttttgtt gcaggagaca aattaattgc ttgtttaact 5701 ccagaacctt tttctattct agggttacaa aagatacgtg aattcttaag ttcggtcgga 5761 aaacctgaag aagaacacat tcttggaata gctttgtctt tttgggatga tcgtaactcg 5821 actaaccaaa tgtatataga cattatcgag tctatttaca aaaacaagct tttttcaaca 5881 aaaattcgtc gagatatttc tctcagccgt tctcttctta aagaagattc tgtagctaat 5941 gtctatccaa attctagggc cgcagaagat attctgaagt taacgcatga aatagcaaat 6001 attttgcata tcgaatatga acgagattac tctcagagga caacgtgaac aaactaaaaa 6061 aagaagcgga tgtctttttt aaaaaaaatc aaactgccgc ttctctagat tttaagaaga 6121 cgcttccctc cattgaacta ttctcagcaa ctttgaattc tgaggaaagt cagagtttgg 6181 atcgattatt tttatcagag tcccaaaact attcggatga agaattttat caagaagaca 6241 tcctagcggt aaaactgctt actggtcaga taaaatccat acagaagcaa cacgtacttc 6301 ttttaggaga aaaaatctat aatgctagaa aaatcctgag taaggatcac ttctcctcaa 6361 caactttttc atcttggata gagttagttt ttagaactaa gtcttctgct tacaatgctc 6421 ttgcatatta cgagcttttt ataaacctcc ccaaccaaac tctacaaaaa gagtttcaat 6481 cgatccccta taaatccgca tatattttgg ccgctagaaa aggcgattta aaaaccaagg 6541 tcgatgtgat agggaaagta tgtggaatgt cgaactcatc ggcgataagg gtgttggatc 6601 aatttcttcc ttcatctaga aacaaagacg ttagagaaac gatagataag tctgattcag 6661 agaagaatcg ccaattatct gatttcttaa tagagatact tcgcatcatg tgttccggag 6721 tttctttgtc ctcctataac gaaaatcttc tacaacagct ttttgaactt tttaagcaaa 6781 agagctgatc ctccgtcagc tcatatatat atatctatta tatatatata tttagggatt 6841 tgatttcacg agagagattt gcaactcttg gtggtagact ttgcaactct tggtggtaga 6901 ctttgcaact cttggtggta gactttgcaa ctcttggtgg tagacttggt cataatggac 6961 ttttgttaaa aaatttatta aaatcttaga gctccgattt tgaatagctt tggttaagaa 7021 aatgggctcg atggctttcc ataaaagtag attgttttta acttttgggg acgcgtcgga 7081 aatttggtta tctactttat cttatctaac tagaaaaaat tatgcgtctg ggattaactt 7141 tcttgtttct ttagagattc tggatttatc ggaaaccttg ataaaggcta tttctcttga 7201 ccacagcgaa tctttgttta aaatcaagtc tctagatgtt tttaatggaa aagttgtttc 7261 agaggcatct aaacaggcta gagcggcatg ctacatatct ttcacaaagt ttttgtatag 7321 attgaccaag ggatatatta aacccgctat tccattgaaa gattttggaa acactacatt 7381 ttttaaaatc cgagacaaaa tcaaaacaga atcgatttct aagcaggaat ggacagtttt 7441 ttttgaagcg ctccggatag tgaattatag agactattta atcggtaaat tgattgtaca 7501 ag // LOCUS MUSMIP1X 638 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse macrophage inflammatory protein 1-beta (MIP-1) mRNA, complete cds. ACCESSION M35590 KEYWORDS macrophage inflammatory protein 1-beta. SOURCE Mouse macrophage cell line RAW 264.7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 638) AUTHORS Sherry,B., Tekamp-Olson,P., Gallegos,C., Bauer,D., Davatelis,G., Wolpe,S.D., Masiarz,F., Coit,D. and Cerami,A. TITLE Resolution of the two components of macrophage inflammatory protein 1, and cloning and characterization of one of those components, macrophage inflammatory protein 1-beta JOURNAL J. Exp. Med. 168, 2251-2259 (1988) STANDARD simple staff_review FEATURES from to/span description pept 63 341 macrophage inflammatory protein 1-beta (MIP-1) precursor sigp 63 131 macrophage inflammatory protein 1-beta signal peptide matp 132 338 macrophage inflammatory protein 1-beta mRNA < 1 638 macrophage inflammatory protein 1 mRNA BASE COUNT 125 a 175 c 139 g 199 t ORIGIN 1 gcttctgaag cttctgggcc ctgcagtccc agctctgtgc aaacctaacc ccgagcaaca 61 ccatgaagct ctgcgtgtct gccctctctc tcctcttgct cgtggctgcc ttctgtgctc 121 cagggttctc agcaccaatg ggctctgacc ctcccacttc ctgctgtttc tcttacacct 181 cccggcagct tcacagaagc tttgtgatgg attactatga gaccagcagt ctttgctcca 241 agccagctgt ggtattcctg accaaaagag gcagacagat ctgtcctaac cccagtcagc 301 cctgggtcac tgagtacatg agtcacttgg agttgaactg agcagctcca gcggcagggc 361 aggaggagcc acttcaggag aggcctcctc agccctgatg cttctcactg agaagcgtcc 421 ttgctcctca cgttcagatt tcctgcccct cttcttaatt taaatctctg tgtagacttt 481 gttttgtttt tttgggggag tattatttct attatttatg ttttagttat aggacgcgtc 541 tctcccatgg agatggtcca ccattgctgt ttctctgcta ttgtggatat gactgtgaaa 601 ttgatttcat gcattttcat aataaatctt tctttaag // LOCUS DENAAA 699 bp ss-RNA VRL 27-JUL-1990 DEFINITION Dengue virus type 2 RNA. ACCESSION M36806 KEYWORDS . SOURCE Dengue virus type 2, cDNA to viral RNA. ORGANISM Dengue virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Flaviviridae; Flavivirus (arbovirus group B). REFERENCE 1 (bases 1 to 699) AUTHORS Hahn,C.S., Hahn,Y.S., Rice,C.M., Lee,E., Dalgarno,L., Strauss,E.G. and Strauss,J.H. TITLE Conserved elements in the 3' untranslated region of flavivirus RNAs and potential cyclization sequences JOURNAL J. Mol. Biol. 198, 33-41 (1987) STANDARD simple staff_review BASE COUNT 170 a 160 c 183 g 186 t ORIGIN 1 cctccatcag agcatggctg tcttagggct actcttctgc ctagtgacat tcccaagctg 61 tgagtgtttc agggtttcag aagagggact aggacatgtc agataggaga tgtgtgactt 121 atgttgatgt tgcttgtccc caggtgtcct atcccaggtg cagctgaagc agtcaggacc 181 tggcctggtg cagccctcac agagcctgtc catcacctgc acagtctctg gtttctcatt 241 aactacctat ggtgtaaact gggttcgcca gtctccagga aagggtctgg agtggctggg 301 agtgatatgg agtggtggaa gcacagagta taatgcagct ttcatatcca gactgaacat 361 caccaaggac aattccaaga ccaagttttc tttaaaatga acagtctgca atctaatgac 421 acagccatat attactgtgc cagaagggac tacgttagta ggtacgaccc gtttgcttac 481 tggggccaag ggactctggt cactgtctct gcaggtgagt cctaacttct cccattctag 541 atgcatcttg gggggattct gagccttcag gaccaagatt ctctgcaaac gaatcaagat 601 tcaacccctt tgtaccaaaa ttgagacagg ggtctgggct agggactctc tgcctgctgg 661 tctgtggtgg cattagaaat gaagtatgat gaaggatct // LOCUS MVEAAA 558 bp ss-RNA VRL 27-JUL-1990 DEFINITION Murray Valley encephalitis virus RNA. ACCESSION M35172 KEYWORDS repetitive DNA. SOURCE Murray Valley encephalitis virus, cDNA to viral RNA. ORGANISM Murray Valley encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Flaviviridae; Flavivirus (arbovirus group B). REFERENCE 1 (bases 1 to 558) AUTHORS Hahn,C.S., Hahn,Y.S., Rice,C.M., Lee,E., Dalgarno,L., Strauss,E.G. and Strauss,J.H. TITLE Conserved elements in the 3' untranslated region of flavivirus RNAs and potential cyclization sequences JOURNAL J. Mol. Biol. 198, 33-41 (1987) STANDARD simple staff_review FEATURES from to/span description ORF < 1 30 open reading frame rpt 184 211 repeat rpt 333 359 repeat BASE COUNT 170 a 122 c 162 g 104 t ORIGIN 1 actcatgtga gtgaagatag ggtcttgtaa ataacattga tagaaaattt tgtaaatatt 61 taatgtaata tagtataggt aaaatttttt gaaattaagt aaaattaagt agcaagactt 121 gatagtcagg ccagccggtt aggctgcacc cgaaggttgg tagacggtgc tgcctgcgac 181 caaccccagg aggactgggt taccaaagct gattctccac ggttggaaag cctcccagaa 241 ccgtctcgga agaggagtcc ctgccaacaa tggagatgaa gcccgtgtca gagccgaaag 301 cgccacttcg ccgaggagtg caatctgtga ggccccagga ggactgggta aacaaagccg 361 taaggccccc gcagcccggg ccgggaggag gtgatgcaaa ccccggcgaa ggactagagg 421 ttagaggaga ccctgcggaa gaaatgagtg gcccaagctc gccgaagctg taaggcgggt 481 ggacggacta gaggttagag gagaccccac tctcaaaagc atcaaacaac agcatattga 541 cacctgggaa aagactag // LOCUS ECOSTII 806 bp ds-DNA BCT 27-JUL-1990 DEFINITION E.coli heat-stable toxin II (STII) gene, complete cds. ACCESSION M35586 KEYWORDS heat-stable toxin II. SOURCE E.coli (strain K12/711) DNA, pCHL6. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 806) AUTHORS Lee,C.H., Moseley,S.L., Moon,H.W., Whipp,S.C., Gyles,C.L. and So,M. TITLE Characterization of the gene encoding heat-stable toxin II and preliminary molecular epidemiological studies of enterotoxigenic Escherichia coli heat-stable toxin II producers JOURNAL Infect. Immun. 42, 264-268 (1983) STANDARD simple staff_review FEATURES from to/span description pept 189 40 (c) ORF 1 pept 443 658 heat-stable toxin II (put.) BASE COUNT 287 a 141 c 135 g 243 t ORIGIN 1 aagcttagta ttgttgacat gaacagcaaa tcgacttaat caaaaaacta tcagagaacg 61 ctgcattatt gattttagga ctttcaatgc aaatatcatc gaggaacgta catcattgcc 121 cacagataca acggatagtt gtatgaaagc ataaagcaaa atcaagattg caaacacaga 181 agggggcatt cctccccctc tcttttgcac ttctttccta tatttttaca agaaaagaaa 241 atgattttat atgttgaata ataaaacaat caatatgtaa atacctacaa cgggtgattg 301 acactacact cattaactat actgcaagta gcattaaaaa tcttaataaa ggagagcttc 361 gtcacatttt tttgacttga ctcatataaa ggcccactgg tataagtttt attgcttata 421 gcaataaggt tgaggtgatt ttatgaaaaa gaatatcgca tttcttcttg catctatgtt 481 cgttttttct attgctacaa atgcctatgc atctacacaa tcaaataaaa aagatctgtg 541 tgaacattat agacaaatag ccaaggaaag ttgtaaaaaa ggttttttag gggttagaga 601 tggtactgct ggagcatgct ttggcgccca aataatggtt gcagcaaaag gatgctaata 661 tatttatcaa tagcattcag caccatatac acaaaaataa tttttcataa aaagaactct 721 ataaaataaa tattttttgt gacaatgtcc taacgcaaga cggacattgt ccatttctca 781 ctgcaggctg cggctgagcg ataccc // LOCUS TRNTETE 830 bp ds-DNA BCT 27-JUL-1990 DEFINITION Transposon Tn10 (from E.coli) tetracycline resistance E (tetE) gene, complete cds. ACCESSION M34933 KEYWORDS tetracycline resistance. SOURCE Transposon Tn10 (from E.coli) DNA, clone pSL1503. ORGANISM Transposon Tn10 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 830) AUTHORS Tovar,K., Ernst,A. and Hillen,W. TITLE Identification and nucleotide sequence of the class E tet regulatory elements and operator and inducer binding of the encoded purified Tet repressor JOURNAL Mol. Gen. Genet. 215, 76-80 (1988) STANDARD simple staff_review FEATURES from to/span description pept 152 787 tetracycline resistance determinant E BASE COUNT 244 a 179 c 187 g 220 t ORIGIN 1 agttaacgtt ctctatcgat gatagggttt gaaaaataac tctatcagtg atagattgtc 61 aacagcaagt atcaattgca agagatagct actatcccaa actttttatt gagatagtca 121 ctatctaaca gttgtccaaa aggagatagt gatggcacga ctaagcttgg acgacgtaat 181 ttcaatggcg ctcaccctgc tggacagcga agggctagag ggcttgacta cgcgtaagct 241 ggcgcagtcc ctaaaaattg agcaaccgac tctgtattgg cacctgcgca acaagcagac 301 tcttatgaac atgctttcag aggcaatact ggcgaagcat cacacccgtt cagcaccgtt 361 accgactgag agttggcagc agtttctcca ggaaaatgct ctgagtttcc gtaaagcatt 421 actggtccat cgtgatggag cccgattgca tatagggacc tctcctacgc ccccccagtt 481 tgaacaagca gaggcgcaac tacgctgtct atgcgatgca gggttttcgg tcgaggaggc 541 tcttttcatt ctgcaatcta tcagccattt tacgttgggt gcagtattag aggagcaagc 601 aacaaaccag atagaaaata atcatgtgat agacgctgca ccaccattat tacaagaggc 661 atttaatatt caggcgagaa cctctgctga aatggccttc catttcgggc tgaaatcatt 721 aatatttgga ttttctgcac agttagatga aaaaaagcat acacccattg aggatggtaa 781 taaatgatgc tatctatgtg tcaactctaa tttatagtta tggatagtgt // LOCUS CLONIFC 1191 bp ds-DNA BCT 27-JUL-1990 DEFINITION C.pasteurianum nitrogenase C (nifC) gene, complete cds. ACCESSION M34365 KEYWORDS nitrogen-fixation protein; nitrogenase C. SOURCE C.pasteurianum (strain W5) DNA. ORGANISM Clostridium pasteurianum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1191) AUTHORS Wang,S.-Z., Chen,J.-S. and Johnson,J.L. TITLE A nitrogen-fixation gene (nifC) in Clostridium pasteurianum with sequence similarity to chlJ of Escherichia coli JOURNAL Biochem. Biophys. Res. Commun. 169, 1122-1128 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-Z.Wang, 18-MAY-1990. FEATURES from to/span description pept 331 1191 nitrogenase C (nifC) BASE COUNT 403 a 132 c 178 g 478 t ORIGIN 1 tttagtacat ctgaaatctg taattaagga gataataagt caaagtttaa tatatttgat 61 ttgttatttc cttaattttt tataaataac tgctatagta taactgtata aaatatggta 121 taatgtcgta gttgataata attatatttt tttatatatc attctgtgcc tataataatg 181 tatgaactaa tatttattga tacaataaat tataaaatta tataagaata tatagaatat 241 tatattgttt tatataagtt atagtatttt atattaaaca acttaattct aaatattttt 301 ttatgtattt aacaggaggt aatataatac atggaaaata ataaaaaaat attagagagt 361 tcaaaaaaac tttcttccta tggagatgga gagagcaggt tttcttttct tgaaaaaata 421 ttagctcctt tatttttagc tctaacggct atatattttg taatgcttat atttccaata 481 atttcaatga ttcggtattc aggtggttct cacattattc aaacccttta tgatcaagat 541 aatataaaga ctataatttt aagctttgta acttcactta tagcacttat atttactttt 601 atcataggaa ctccaacagc tttttgtatc aattttgtta gaaataaagt tttatctaaa 661 atattagata tatttgtaga aattcctgta gttttgcctc cagcagtagc tggtattgct 721 ttgctactag cttttggtaa aaatggtgta gtgggtaatt ttttatctaa tcatgggata 781 aatgttatat ttacctctac agctgtaata attgcacaat tttttgtatc ttctgctctt 841 tatgtaagag ttcttagaga ttctgttaaa tcagtaccta ttgaactttt tgaagtgagt 901 tatgtgctag gggcaggaaa aattgaaaca ataattaaaa ttatgattcc aatgcttaaa 961 aagtcaatcg tgtctggact tattttggct tggataagat ctcttggaga gtttggagcc 1021 actttgatgt ttgcaggaaa cattatagga aaaacaagaa ccataccact tcaaatctat 1081 acttatatgc aggatgatat taaaatggct actgcttttg ctacgatttt gtatataatg 1141 acttttgtat tgttattatt agttagactt agtattaggg atgatgatta g // LOCUS LACISTRAN 808 bp ds-DNA BCT 27-JUL-1990 DEFINITION L.lactis insertion sequence IS946 transposase gene, complete cds. ACCESSION M33868 KEYWORDS transposase. SOURCE L.lactis DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 808) AUTHORS Romero,D.A. and Klaenhammer,T.R. TITLE Characterization of gram-positive insertion sequence IS946, an Iso-ISS1 element, isolated from the conjugative Lactococcal plasmid pTR2030 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.A.Romero, 26-APR-1990. Author address: D.A.Romero North Carolina State University Dept. of Food Science Box 7624, Raleigh, NC 27695 FEATURES from to/span description pept 74 754 transposase (put.) rpt 1 18 inverted repeat rpt 791 808 inverted repeat signal 6 35 promoter region binding 62 65 ribosomal binding site BASE COUNT 281 a 137 c 158 g 232 t ORIGIN 1 ggttctgttg caaagttttc tgataagtct attttagtgt aaaatgaata aaaatgacag 61 cgaggatata tcaatgaact attttaaagg taaacaattt caaaaagatg tgattattgt 121 cgctgttggt tactacctgc gttacaatct aagctatcgt gaagttcagg aattgttata 181 tgatcgtgga ataaatgttt gtcatactac gatttatcgt tgggtgcaag aatacagtaa 241 agtcctctat catctctgga aaaagaaaaa tagacagtcc ttctattcgt ggaaaatgga 301 cgaaacctat atcaaaatca aaggtcgttg gcattatctc tatcgtgcaa ttgatgcgga 361 tggcttaacc ttagatatct ggttacgaaa gaaacgggat acgcaagcag cctatgcttt 421 cttaaaacga ctccataaac agtttggtca accaagagta attgtcacgg ataaagcgcc 481 ctctattggt tctgcattta gaaagttaca gagtaacggt ttatatacta agacagagca 541 tcgaaccgtg aagtatctca ataacctcat tgagcaagac catcgaccaa tcaaacgacg 601 caataaattt tatcgaagtc tacgaactgc ctcaaccacg attaagggca tggaaacaat 661 tcgaggaata tacaaaaaga accgaagaaa tggaacgctc ttcggatttt cggtatctac 721 tgagattaag gtcttaatgg gaatattagc ttaagaacaa gaaggattat aaaccttgta 781 tttgattttt aaactttgca acagaacc // LOCUS MUSMB1 5629 bp ds-DNA ROD 27-JUL-1990 DEFINITION Murine B cell 1 (mb-1) gene, complete cds. ACCESSION M31773 KEYWORDS murine B cell 1. SOURCE Mouse (strain Balb/c) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5629) AUTHORS Kashiwamura,S.-I., Koyama,T., Matsuo,T., Steinmetz,M., Kimoto,M. and Sakaguchi,N. TITLE Structure of the murine mb-1 gene encoding a putative sIgM-associated molecule JOURNAL J. Immunol. 145, 337-343 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Sakaguchi, 01-FEB-1990. FEATURES from to/span description pept 277 343 B cell 1, exon 1 1823 2116 B cell 1, exon 2 2308 2426 B cell 1, exon 3 4450 4518 B cell 1, exon 4 4646 4759 B cell 1, exon 5 pre-msg 268 5072 B cell 1 mRNA and introns IVS 344 1822 mb-1 intron A IVS 2117 2307 mb-1 intron B IVS 2427 4449 mb-1 intron C IVS 4519 4645 mb-1 intron D BASE COUNT 1332 a 1515 c 1400 g 1382 t ORIGIN 1 gatcctttct cagggatcca tggtgatgaa ccaggccttg aaccaccctc tccccgaccc 61 cacgcactag agagagactc aagggaattg tggccaccca ggtgcagggc agttcctctc 121 cactcagagc ccacacatat ggcaaataaa gggccaggag taagggccac tccagcccat 181 ctccggcacg gctgaacagg aagtgaggcg gagtttggag tggaatctgg tacctcaagc 241 tacggccacc caaactaacc gactcactgg gagacgatgc cagggggtct agaagccctc 301 agagccctgc ctctcctcct cttcttgtca tacgcctgtt tgggtaatgt ggccaaaggg 361 aaggtgcggg tggggcgggg gtgttgcggc tgacaggaga ggtgcagagt ggagtcagaa 421 atggccaggc agggtgatgc ataacgtggc atctcttaaa gtaacgccag aactctagga 481 aactgaggag ggaggaacat gcatttgagg ccagctggcg tacataatga cttctatcta 541 ggccaacctc gcaagttact cattaaaact ctctcaaaaa gaaaaggagt gtcgggaggg 601 aatcagggaa agcctctctc agctagctca gacagctgag taagtccact ttaggaccaa 661 cactaccttg cccacgacac acgaccttag ctcacagtca gaatgagaac tgagtaagtg 721 tgactgaaca cacaccagtg agtgtagcaa tggtgtcggt gagggaacgg gttttattgt 781 tgttgttgtt gttgttgttg ttttttcctc tgccttacat ctggaggcag gcttgccatg 841 tgaccacata gacccaggtt tgcagtggag gtcttacgag actggggccc tggtctctgt 901 gtcttaagcc tgtgtgacag gaactggctg gggctctttc ccctcccttt ctttcttcct 961 cacttccgac cttcttcttt tctgtcatgt ccacaaaagg gacctacaag ccagtgtgga 1021 gctggggagg ctgagaacag ggtggggggt ggggggggcg ggtcaagata aatacaagag 1081 cagacaccat agccaccagt ccaccccgtc tcttagcctc ctccgatgcc ccagacagaa 1141 ctcagtccca gtgcctaggt taatactggt ttctctgttt tgtttttcga gacagggatt 1201 ctgtgtgcaa ctggaacgta ctctgttaga ccagcaggcc tcaaactccc agagatctgc 1261 ctgacttctc tctcctgagt gctgggatta aaggcatgta ccagtactgc ctggcagtcg 1321 ccacatttaa attaaattac agagatgaga aacagcctag gacaaaaaaa caaaaaaaca 1381 aacaaacaaa caaacaaaaa aaacaaaaac aaaaaaaaaa acaaggccaa gcgcagtaaa 1441 ggagagattc aggaaggtgg gaattgggct ctactatctt ttccaatagc ctggttactg 1501 ctgagctgag ttttaccgct gcatctaagg ttctgcccca ccttcacccc ctccccgtgg 1561 ggttcctaag agttgtaagg aatggggaac aaatgggatg gttttcatga aagaatccct 1621 ggggtttttg gggaacatca aaatacaaag gggaaaggga agaaagaggc catagtgtgc 1681 ccccccccaa gtctcccatc atctttttct agtacattac cattggtacc ggctccaatt 1741 ctgatgtgaa ggggtctggg ctagagcaat catctccatt ctccaggcta tgacccatct 1801 gtctcctctc ctctctccac aggtcccgga tgccaggccc tgcgggtaga agggggtcca 1861 ccatccctga cggtgaactt gggcgaggag gcccgcctca cctgtgaaaa caatggcagg 1921 aaccctaata tcacatggtg gttcagcctt cagtctaaca tcacatggcc cccagtgcca 1981 ctgggtcctg gccagggtac cacaggccag ctgttcttcc ccgaagtaaa caagaacaca 2041 ggggcttgta ctgggtgcca agtgatagaa aacaacatat taaaacgctc ctgtggtact 2101 tacctccgcg tgcgcagtga gtagggaggg cgctggcctc ttgcgttccc tgctccctct 2161 ttcttccaaa acattaggag cagacgtagc tcctccctcc tggacctgcc acccagccac 2221 agagactggt tgcgagggct ccctgacctc gcaagggtct aggcgagaag aagggacaca 2281 gaatgctgag cagcacctgt cttcaagatc cagtccctag gcccttcctg gacatggggg 2341 aaggtaccaa gaaccgcatc atcacagcag aagggatcat cttgctgttc tgtgcagtgg 2401 tgccagggac gctgctgcta ttcagggtga gtctccccca agtgacaccc ttcaagtcat 2461 cttggcttac ctgggcctgg actgtgactc ccagtgtcca gctggtcctc tgaattcgag 2521 ctcgcccgga gatcttgtac tgatgccaac tgatagaaaa caacatatta aaacgctcct 2581 tgtacttacc atcccgtgcg cagtgagtag gaggcgcctg gcctccttgc tttccctgct 2641 ccctctttct tcaaaacatt agagcaagct agctcctcct cctgacctcg acagcacaga 2701 gatggtggtt caggcgcgtc cgaccctcgc aagggtctgg gctgccagaa gaagggacac 2761 cagaatgctg agcagcaccc tgtcttcaca gatccagtcc ctaggccctt cctggacatg 2821 ggggaaggta ccaagaaccg catcatcaca gcagaaggga tcatcttgct gttctgtgca 2881 gtggtgccag ggacgctgct gctattcagg gtgagtctcc cccaagtgac acccctcaag 2941 tcatcttggc ttccctgggc ctggactgtg actcccagtg tccagctggt ccttgggtct 3001 tccgaattcc aaggtttgtc cttccaattt gggaggctgg gaataaccac caaaaggatt 3061 ccaggggggt cagctgtccc cacggccctc atagaaaccc tctcacttct ggccgccaac 3121 gggagtccct gcaaatgcag ccaagacaag ggaagaaggt gtgtcctccc agagcccctg 3181 cgtctcctta ggccagaaga caacattttt caccacattg gggcttcatt attgtacgcc 3241 tggctggcct tgaactcaca gagatctgcc tatttctgcc tcccaagtgc tgctgggatt 3301 aaagggccac catgcctgcc cttcttcctt tctttaactt tttcttttcc ccaccaccta 3361 accccccaag actgggtctc tctgtgtagc cctggtgtcc tagatataag gctggctcag 3421 actcacagag atccacctgc ctctgcctcc tgagtgctgg gattacagtt gtgcacccct 3481 atcgccctgc actttattta tgtcttagtg tgtacgtggg catgttgagt gtggaggtcg 3541 aagaacaact ttcaggagtt gattctttct ccactgtgga ttcgcacaca aattcatatc 3601 ctcaggtttg caggcataca ttaccagttg agctatcatc tgacttcttt gcctgagcta 3661 tcatctgact tctttgcttt catttttatt ttagtttatt ttattttatt tttttaaagc 3721 tagatattag tctagctagc ctcaaacttg ctacatcaca tggctggcct gattcctctt 3781 taccttccaa gcactgaaat tacagggcag tggcaccacc agttttattt tgtttgtttt 3841 aatttttgtt ttgtgttttg ttttgttttt gatggggagg gttagacagc attttgctat 3901 gtaattcagg ctgggctaga actcactcta tagatcatat tggtctggaa ttctcctgcc 3961 tcagtttgcc aagcactgga attacagatc tccagcacca ctcattccct gctcgccgtt 4021 ttgcttgtca ttgaggagca gtagagagtc agcatggtgg tgcacagctt tgaccctagc 4081 agcactcagg aggcagaggc aggcggatct ctcgggagcc aaggccagcc tggtccacgt 4141 actgagttcc caggtagttc ttgctagata gagatcctca gagcatcagt ggagtggcag 4201 ggaggtgcag cagttccact tgagacaggg agattgctta aagtttgagg cccataaaaa 4261 ggaaagagag gagaggggca gagactagaa tcttgacaag atacctgatg gtgggaatct 4321 agaccttgat ggtggggagc caagtatccc tcactctttc tgataaagca cttgctagta 4381 ggagacagtc taaaaaccct tcacctccct ccttgctaaa cgtcgcttct acttcatccc 4441 cttctgtaga aacggtggca aaatgagaag tttggggtgg acatgccaga tgactatgaa 4501 gatgaaaatc tctatgaggt aagcactgtt tgcagcaggc gtcgggtgca tggggtgtgg 4561 tgtctgcctt gagctctcac cctggagttg actggactgg ggctccagct gtcagggtac 4621 tgatgtctcc tctctgaccc tgcagggcct gaaccttgat gactgttcta tgtatgagga 4681 catctccagg ggactccagg gcacctacca ggatgtgggc aacctccaca ttggagatgc 4741 ccagctggaa aagccatgac tgacatgtcc cacccttccc tgcctgccat atgtctgact 4801 ccagcatccc tgcctcccca ctcttcccct ggagtccccc cttcccagtg cttgctccag 4861 ccactcgagt gcgctaccct tgcctccagg ctgccccacc ctattctccc acctaatctt 4921 gtccttcccc aagtccagct ccctctctag tgggtaatga gcccttaatc gctgcctcta 4981 ggggagctga ttacagcatc ctcgttagtg tcaccccccc tcctcccaga tctgtcatgg 5041 ccacttaagt gataataaat ccttcccaaa gcagccttgg caggagtcat ggatctcatg 5101 tagactgctc ctcaacactg ccagggaacc aggagctaga aaacccaagt cccctccccc 5161 ctcagaccca ggattctagg ttcaagttct cacctgacca cttctagatc tacctgacaa 5221 ctcttcgaac ggtacagata gactctgcgt atgtcttcga cttcccctca acccctcccc 5281 ccgaaaataa cggtcaaaga gctctcttgg ctaatcccca ccccaattgt aaatgtctgg 5341 tttatctctt atgggcctca ccctcctctg tcccaaaaca aggggaacca gcccggtgtt 5401 aacaaatttt tacctatatt taaactctga gaacacaaga tattattttc tggagtgggg 5461 cctaattcct catgggttca cttgcataag gcagatttgg agagtggctg tcaaaaaccg 5521 gggggagggg gatgacagag agagctagag tggtgtgaca gtcattggga tgggaagagc 5581 agactcaggt tgttaaaaga caggaggcca agaggagact cagtgggga // LOCUS FRNTUL4 1233 bp ds-DNA BCT 27-JUL-1990 DEFINITION F.tularensis 17 kDa major membrane protein (TUL4) and 13-kDa protein genes, complete cds. ACCESSION M32059 KEYWORDS 13-kDa protein; 17 kDa major membrane protein. SOURCE F.tularensis (strain live vaccine) DNA. ORGANISM Francisella tularensis Prokaryota; Bacteria; Gracilicutes; Scotobacteria. REFERENCE 1 (bases 1 to 1233) AUTHORS Sjoestedt,A., Sandstroem,G., Taernvik,A. and Jaurin,B. TITLE Nucleotide sequence and T cell epitopes of a membrane protein of Francisella tularensis JOURNAL J. Immunol. 145, 311-317 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.Sjoestedt, 12-FEB-1990. FEATURES from to/span description pept 99 470 13-kDa protein pept 551 1000 17 kDa major membrane protein (TUL4) precursor sigp 551 607 17 kDa major membrane protein (TUL4) signal peptide matp 608 997 17 kDa major membrane protein (TUL4) binding 85 97 ribosome binding site binding 539 548 ribosome binding site BASE COUNT 440 a 171 c 204 g 418 t ORIGIN 1 gaattcatag tcgcctcaaa atatgtttag atatgttgtt aataagttat aatcttatat 61 atagttaact atgttattat aatcaaggtt tttttgctat gatgacttta caagaagaaa 121 aaatacaagc gccagtattt tttaaagaat atgtcaaagg taggtttata cttaatatag 181 gtgagtataa tcatccattg attttatcag caactcaagt gcttgagtat caagataaaa 241 ttgatgatat ccaaagtatc aaaaaaagtc atcttgatct tatcttagcg actaatcctg 301 aaataatact cataggtacc ggtgaaaaac aacttttgcc tccacttgag ataattaatc 361 aaatcgcaaa agctggtaaa agtgttgatt ttatggcgag tgatactgct tgtaagacat 421 ataatttgct tgttaatgaa aatcgtaatg ttagctgtat catcatttaa taaactgctg 481 tttattttat tttaattaat gttataatcg atttgagtat atgtgaatat ttaaaaatag 541 gagtatctat atgaaaaaaa taattaagct tagtctttta tctttatcaa tcgcaggttt 601 agcgagctgt tctactctag ggttaggtgg ctctgatgat gcaaaagctt cagctaaaga 661 tactgctgct gctcagacag ctactactga gcaagctgct gctgtatcta agccaactgc 721 aaaagtaagt ttaaataaac ttggtcagga taaaataaaa gcaactgtat atacagcata 781 caataataac ccacaaggaa gtgtaagatt acaatggcag gctccagaag gttctaagtg 841 ccatgataca agcttcccaa ttactaagta tgctgagaag aacgataaaa cttgggcaac 901 tgtaacagtt aagcaaggta ataacttctg tagcggtaag tggacagcta atgtagttta 961 tgacaaagaa gtaatcgctt ctgattcaat aaatatttaa tttcttcaaa tatttataat 1021 tttctagttt ttgcttttct caaaaaaaaa taacggtgtg attaattttg cttagctaga 1081 ttagttgtac ttttaacttt ttactaaaaa gatttgctat aattgcttaa atataacaaa 1141 taaaaaagta atagtttaat gaaaaacatt atcttgttat taatgatatt ctcgatgtta 1201 gttgtactta gccatgtgca ccaaccacag ctg // LOCUS TIPREGU 29802 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid Ti (from A.tumefaciens) virulence regulon (vir) genes, complete cds. ACCESSION J03320 KEYWORDS virulence regulon. SOURCE Plasmid Ti (strain C58) (from A.tumefaciens) DNA. ORGANISM Plasmid Ti Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 29802) AUTHORS Rogowsky,P.M., Powell,B.S., Shirasu,K., Lin,T.-S., Morel,P., Zyprian,E.M., Steck,T.R. and Kado,C.I. TITLE Molecular characterization of the vir regulon of Agrobacterium tumefaciens: Complete nucleotide sequence and gene organization of the 28.63-kbp regulon cloned as a single unit JOURNAL Plasmid 23, 85-106 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Kado, 02-MAR-1990. FEATURES from to/span description pept 1983 4484 virA protein pept 5029 5766 virB1 protein pept 5766 6131 virB2 protein pept 6131 6457 virB3 protein pept 6457 8823 virB4 protein pept 8840 9502 virB5 protein pept 9603 10487 virB6 protein pept 10523 10690 virB7 protein pept 10677 11390 virB8 protein pept 11387 12268 virB9 protein pept 12265 13398 virB10 protein pept 13439 14473 virB11 protein pept 14728 15453 virG protein (ttg start codon) pept 17174 17617 virD1 protein pept 17651 18994 virD2 protein pept 19009 21030 virD3 protein pept 21104 23101 virD4 protein pept 23098 25734 ORF 5 pept 26802 27053 virE1 protein pept 27169 27360 virE2 pept 27374 29044 virE3 29044 site 1983 29044 vir regulon region BASE COUNT 7085 a 7819 c 8213 g 6685 t ORIGIN 1 taccgacgat atgtgcagtc catcagcagt ataatgatta caggagacca aaaattggtt 61 tcaaatgtaa ccgaataagc ctgctgtacg taatggcaat cggtcgatat cagtcggcac 121 agccactcgt cttgctgaga tgcaatgtcg acgtgtaacc aaaatgactt tgatggaagt 181 cgactttgaa gcctgaccga caccggctaa tcgaaatctg caactgctgg aactcggaag 241 cggccaaatc cctcgaggac cgagcgccgg ccgaggtgct tcgaaagaaa tcacttactc 301 aaataagatg tgcgaatagc tttggcgcgc cgcatgtcgc gcttggaggt gacgccgacc 361 gatgctgcgc cgccgatggc catccaacac gcaggtcaaa tgatgaagcg gcgaatcacc 421 gaattcgcgt cagcgtgacc gggcaaacag gctgtcctgc gccatcttct ggccactgag 481 gaaaatcgcg ctcctgcgag agcgcatgtt caagatattc attagctatg gcctctatga 541 gctctacgtg ccgccctgca tcaatatttg gcaactggct gattgcgaga tcgtgtttgc 601 gcgcaaaacg aattgcgcag cgatatccgt cgatatcttc caaaatcggt cgagcggcag 661 ggtagttcca gagttccgcc aactcctcca acaacgaggg gcgatcttcc cggatggcaa 721 acatttccgc aacgcgttgc ttggctcggg tgagaaaggc gtccgaatcc ccaagacgta 781 gccgcttgac atgccattga aaacccgatc tccaaaacgg acttttagcc atgcaattga 841 gaagcgaaat cgacccgccc tcgagaataa gaccctcttc ggacttccgc caatccactt 901 cgaatatgag tcgacgatgg gcactctcag cgtcaaggat gccctcggtg agggggcggg 961 aatccaaata tattctccgc gttgattgca attccgattc caaaggtctt ccgctacctg 1021 tcgcgatttg aggacagcat tgcacacgat caagggcaac caccggccac ccggtttctt 1081 gtgcgatttg gatcgccatg tccgttttgc cgctgcaagt cggtccgtag atgagatgga 1141 gtaacatgtg cgcctcatat aaatttaatg ggatatccca tctgttccaa accagatttt 1201 tcaattcgga ttatgatgtt ttactgcatg atttttacca gctttacgca atctgctgaa 1261 ctaatatggc gctacatttg taatcgtaca tttttgacgt tgccggctca ttcggccatg 1321 agagtttcca tcccgttgtc ctcctgaaca attgaggagt ttggccccga cacctgctta 1381 tggtgaatgc agcggacccc gttaacgcaa cgtcgaaaag cgataggcga tttgagtagg 1441 aagccacatc ggccgcgcgg tatgctccgt cagcacctga cagttcgtgt tgcagctttg 1501 cttggtcacc tgaaacgaaa aatgccgctg aagaatattt aacggatatg cacacgaaaa 1561 agccggttgg gtctcatatg ccgctgacgt cctggcttcg tttcgaagcc gacagcctct 1621 aaggtggtac gaacacagac ctaatatccg ctagtgtccg cggcagccgc cacggcagtt 1681 ctcgctgcgc atagcatagc aacttgccct gtcgacgcat gatcagccat attgttaatg 1741 tcgacgattt catttttctc ggtgcaaacg gcgacatcct gaccatatcg atcctaccag 1801 ccaccgcgcg gagtcaaaga acacctcggc aaccggaaac atccattgga atgccgaggc 1861 tgtgtttcat ttgaaacaca ctgagtcgac gtttgttgct tcaaacccat ttacaaaccc 1921 tactgtgcgg cctaagggcc aacggggtgg gactgagcgc tggtacgagg acgtaagtgc 1981 ggatgaatgg aaggtattca ccgtctcggc aagatttcaa gacaggcgcc aagccttggt 2041 ctatcctggc cttggtagtt gctgcaatga tttttgcctt gatggcgatt acgtcttggc 2101 aggacaatga aaccaatcgg gcgatcctga cccaattgcg agctattaac atcgacagtg 2161 cttcgctgca gcgggatgta ctctccgcgg aagcgggtgt ggtggcgaac taccggccca 2221 ttatctccag gttgggagct ttgcggaaga acctggaaaa tttgaagcga ctatttaaac 2281 aatctcatct tgtgatcggc aatgatttct ctcaactgct cgacaagcta aaggtgtctg 2341 tggatacgac cgacgcggcc gttgcagcct tcggagcgca aaacgtgctc ctgcaagatt 2401 cgcttgccag cttcactcgc gcgcttagta ttcttcccaa aatgtcgtcg acggatcaga 2461 cggtcgaaaa ttcgaacgaa ttgggcagcc tgatgctgcg gtttgtgcgt cagccaagcc 2521 cagcactctc gttggagatc agtcacgaac tcgacatgct ccaaaaagct agcggtgggg 2581 ctgaagttcc tatccgtata cttgcacgcg agggtcgcgt catcttgtcg attttgcccc 2641 gtgtgaacga tgccgtaaac atgattcaga cctccgacac cgctgaaatt gccgaaagat 2701 tggagcgcaa gtgtttggag gcctatagct tgcaaagcgt gagggagcag cgggcacgga 2761 tcttcttggg ttccgtttcg gtgggccttt gcatctacat catctcactg gtctataggc 2821 tgcgtcggaa aacggcttgg ttaacgcggc gtttggatta cgaagaggta atcaaagaga 2881 ttggggtttg tttcgagggg ggaggggcca cagcgtcgtc cctcaattcg tccgcgcaag 2941 ctgccgttgg aattattcaa cgcttcttta atgcggaatc gtgtgcacta gcattggtgg 3001 accatggtga caggtgggct gtcgaaagtt tcgctgcgaa gctgcctgag cccgtctggg 3061 aggacctcgc gctacgcgag atggtttctc ttgccagagc ggatgagcgt gcgtcagtat 3121 tccgcatcat gtcgacgcga aaggtcagct gccttcctcc ggagactccg ggcgtttcta 3181 tgctgctggc acacaaatct acggatcaac tgatagcgat ttgttccctc ggttaccagg 3241 gctatcgtct gaaatcttgt ccaggcgaag ttcagcttct tgaactcgcc accgcctgcc 3301 tctgccacta tatcgatgtc cggcgtaagc agaccgaatg cgattttctg gagaggcgat 3361 tagagcatgc ggaacgtctt caggcagttg gtacacttgc tggtggaata gcgcatgagt 3421 tcaacaacat tttgggagca atcctcgggt acgctgaaat ggcgcaaaac atgctgcgtc 3481 ggtcatctgt cacccgaaga cacattgacc aaattatttc gtcgggtgac agagccaggc 3541 tcattatcga tcagatcttg actctaagtc gaaaactaga acgcgtgaca aagccgttca 3601 gtgtctccga actcgtaatg gaaattgctc ccttattgcg tgttgcattg cagcgcaaca 3661 tcgagctgaa gttcaagttt gacgacaaga agagcgtagt cgaaggaagc ccgctagagg 3721 ttcagcagat gctgatgaat ctttgcaaga acgcttccca ggcgtttacc gccgatggtc 3781 aaatcgacat catcgttagc cgaatttttg tatctcgaca gaaagtactg gcgcatggtg 3841 ttatgccagc tggcgactat gttcttcttt ctgtcagcga cgacggtgaa ggcattgccg 3901 aaactgtgct cccccacatt tttgagcctt tctttacaac acgctcttgc agcggcggca 3961 cgggtctagg tcttgctgcc gtgcacggtc atgtcagcgc gcttgcagga tatattgacg 4021 ttacttcagc cgtagggcga gggacgcgct tcgacattta tctacctcct tcctcgaaga 4081 agcccgtcag cccggacgcg ttttttgggc cctgtaaaac accgcgtggc aacggagaaa 4141 ttgtggcatt gattgagcca gatcctgtct tgcgggaggt gtacgaagac aagatcgccg 4201 ctctgggcta tgagccggtg ggctttaaga catgtgcaga cctttgcaat tggatatcaa 4261 aaggcaagca agccgatctg gttctagttg accaatcgtc tcttcccgag aatcagagtg 4321 ctactgcttt gcacgcagcc ttcaagacgg cgtccatcat cattggagga agtgatctta 4381 aaatgtcact ttccagcgat gacatgacgt ccgcgctttt tctgcccaaa cctatatcct 4441 ccaggacgat ggcctacgcg attcgtacca agatcaaagc ctagagttga ggatgttttt 4501 caggagactc gaccagggta atcagcacgt ctgttcggcg cgcctatagc tatggtttct 4561 cgcaatcgct ttgggcagtc ggttgtggcg gtgcgtcttc tgagacccga aggttggcga 4621 ttatctgctg gaggtcttcg ataccgtctt cgaagaacaa gacatatgtt cgctgagctc 4681 ggacatagga tagtcggcat aggcggatac cctcgccacg caaatggcgc gatcgacacc 4741 ctgatcggag cctgcgaagt gccacagttt attggaacca tcccccgttc attccgccct 4801 ggaatttcac ttcgacggca ttctagatcc tcgccgttgt cttagagagc catctttcgt 4861 cggcgcaccc ttaagtgcat ggaaagccgt tttcgcttca aatgaaatcg aaaagaagaa 4921 aacaaaaatc ctagagtaac cgaccctccc gataatcgtg aacatcagat cacagcattt 4981 cttccgaccg aagtggctgt gttggttatg agcttgggga gattgggaat gttgaaggca 5041 acagggccgc tgtcgattat cttactggcc tccacgtgca cgtcgagtgg tgctgcccca 5101 ctttcatttg ctgagttcaa taattttgca cgcgaatgcg ctccatccgt tgctccatct 5161 acgcttgcag cgatcgctca ggtcgaaagt cgctttgatc cgcttgctgt gcatgacaat 5221 accaccggcg aaacgcttca ctggcagaac caggctcaag caacgcaagt cgtgatggac 5281 ggtctcgaag cacggcattc gctggatgtt gggctcatgc agatcaattc ccggaatttt 5341 tccgtgctcg gtctgacacc tgacggagcc cttcaacctt gcacgtcatt atctgtcgcc 5401 gcaaacttgc ttgggagccg ctacgctggc ggcaacacgg ctgacgacga gcaattgtcg 5461 cttcgtcggg caatctccgc ctataacacc ggtgatttca cgcacggctt cgcgaacggc 5521 tacgtgcgaa aagttgaaac ggccgctcaa cagctcgtcc ccccgttaac cgcgcgtcca 5581 aaagatgatc gtgagaagcc gggatccgag gaaacatggg atgtttgggg ggcatataag 5641 cgtcgttccc cggagggcgg agctggcggg tcatccggtc cgcctccgcc gccagacgag 5701 gacaaccgca aatccgaaga cgacgatcaa ctcttgttcg acttaaatca aggaggtccg 5761 caataatgcg atgctttgaa agataccgtg tacatctgaa tcgcctctcg ctctcgaacg 5821 cggtgatgcg catggtatcc ggctatgcgc cgagcgtggt cggtgcaatg gggtggagca 5881 ttttctcctc tgggccggcc gcggcccaat ctgcaggtgg cggcactgac ccagccacaa 5941 tggttaacaa catatgcacg tttatccttg gtccgttcgg ccagtcactc gctgttcttg 6001 gcatcgtggc catcggaatc tcctggatgt tcggtcgcgc ttcactcggt ctcgttgccg 6061 gcgtcgtcgg cggcattgtc atcatgtttg gagccagctt cctgggcaaa acgctgactg 6121 gaggtggcta atgaatgatc gtctggaaga agcaaccctt tacttggcgg cgacacggcc 6181 cgcattgttt cttggcgtgc cgctgacgtt ggcggggcta ctcgtgatgt ttgccggctt 6241 tgtcatcgtc atcgttcaga acccgctgta cgaagtcgtt ctcgtgccct tgtggttcgg 6301 agcgcggctt gtggtggaac gagactataa cgcggccagc gtcgttctac tttttttgca 6361 gacggcggga aggagcgttg atggtctgat ttggggcggc gcaagcgtta gcccaaatcc 6421 aatcaaggtt cccgcgcgag ggagaggaat ggcgtaatgc tcggagcaag tggcacaacc 6481 gaaaggtctg gcgaggtcta tctaccctac gtcgggcacg tcagcgacca tattgtcctt 6541 ctagaagatg gatcgatcat gacgatggcg cacgtaagtg gcatggcctt cgaactcgaa 6601 gatgccgaaa tgcgcaatcg acgttgccgt gcatttaata cgctcttgcg caatatcgct 6661 gatgatcatg tgtcaatata tgctcacctc gtacgtcatg acgatgtgcc gccgtcaccc 6721 gcgcgacatt tccgcagcgc tttttccgcc agtctgagcg aagcttttga ggagcgcgtt 6781 ctctccggca aactccttcg caatgaccac ttccttacgc tgatcgtgtc tccccgggcc 6841 gcgcttggca aagtgaggag aaggttcacc aaacgctaca gacaaaaaga aaacgatctc 6901 acagctcaaa ccaggaacct ggaagatctc tggcatcttg tcgctggcgc tctcgaagcg 6961 tacggcctgc gtcgtcttgg tattcgtgag aagcaagatg tgctttttac ggaggttgga 7021 gaagctctgc ggctgataat gactggtcga ttcacgccgg ttcccgtcgt tagcggttcg 7081 ctcggcgcct cgatctatac cgaccgagtt atttgcggca agcggggact cgagatccga 7141 acaccaaaag atagttatgt gggatctatt tactcgtttc gcgaataccc cgcaacgacg 7201 ggaccgggta tgctcaacgt gctactgtct ctcgattttc cgcttgttct gacgcagagc 7261 ttctcgtttc tgactcgctc gcaagcccac tcgaagctca gcctcaagtc cagccaaatg 7321 ttgagttctg gcgacaaagc cgtcacccaa atcagcaagt tatccgaggc ggaggacgca 7381 ctagcgagca acgaattcgt attgggggcg catcatgtga gtctttgcat atatgcaaat 7441 gatctcaata atcttgcaga tagaggtgcc cgcgcccgga cgcgattggc ggatgcggga 7501 gctgttgttg tccaagaggg catcggcatg gaggcggctt attggtcgca gctgccaggc 7561 aactataagt ggcgcacgcg tccgggagcg atcacatcgc gcaacttcgc tggtttagtc 7621 tcattcgaga attttcccga gggatccggc tcaggtcact ggggcaacgc gattgcgcgc 7681 tttcgtacca atggtggaac ccctttcgac tacatcccgc acgagcacga tgtcggcatg 7741 acggcgatat tcggtcccat cgggaggggt aaaacgacgc tcatgacctt tatcctcgct 7801 atgctcgagc agagcatggt cgaccgcgcg ggtgcggttg tcctcttcga caaggaccgc 7861 ggcagtgagc tgctggttcg cgccaccggg ggaacatatt tggcgctccg tagaggagga 7921 agcggattgg cgccattgcg tggcctggaa aatacagcgg cttcacatga ttttctgcgc 7981 gaatggatcg tggcgctcat tgagagcgat ggccgtggag gaatatcccc ccaggaaaat 8041 cgccgtctgg tgcggggtat ccatcggcag ctctcgtttg atccccacat gcgctcaatc 8101 gcggggttac gcgaattttt gttgcatgga cccgccgaag gggcgggagc gagactccaa 8161 cgctggtgcc gtggcaatgc actaggctgg gcgttcgacg gcgagctcga cgaagtaaag 8221 ttggatcctt cgattactgg tttcgacatg acgcatcttc tcgaatatga ggaagtatgc 8281 gctgccgcgc gagcatatct tctgcaccgc attggagcca tggttgacgg ccgtcggttt 8341 gtgatgagtt gcgatgagtt tcgcgcctat ttgctaaatc ctaaatttgc ggcggtcgtc 8401 gacaagttcc tgcttactgt ccgcaaaaac aatgggatgc tgatactggc aacgcagcaa 8461 cctgagcatg tcctggaatc gcagctaggc gccagtctcg tcgcgcaatg tatgacgaag 8521 attttctatc cttcacccac ggcagatcga tcggcttaca tcgatggact gaaatgtact 8581 gaaaaggaat ttcaggcgat ccgtgaagac atggcagtag gtagccggaa gtttctgctt 8641 aaacgagaaa gcggaagcgt cgtctgcgaa ttcgatctgc gggaaatgcg cgaatatgtc 8701 gccgtacttt cggggcgcgc caacacggtg cgcttcgcgg atcagcttcg caaagtacag 8761 ggggacaacc catcagcctg gctcagcgaa tttatggctc gttaccacga ggcaaaagat 8821 tgatcaaagg tggggaacta tgaagatcat gcaacttgtt gctgcggcca tggccgtcag 8881 ccttctttcg gtcgggcccg cgcgggcgca gttcgttgtc agcgatccgg cgacggaagc 8941 tgagacgctg gcgacggcgc tcgagactgc ggcaaatctc gaacagacca taacgatggt 9001 ggcgatgtta acctcggctt atggcgtcac cggcctacta acttcgctca accaaaaaaa 9061 tcagtatccc tcgaccaggg acttggacac ggaaatgttt tcgccgcgaa tgccaatgtc 9121 gaccacggca cgtgcgatca ccaccgatac agatcgcgcc gtagttggtg gcgacgctga 9181 agcggatctg ttgcgatcgc agatcaccgg ttccgcaaat agcgcgggca ttgcggctga 9241 caacctggag acgatggaca aacgcttaac agcgaatgcc gagacctcga cacagctttc 9301 tcgctctcgc aatatcatgc aggcaaccgt taccaacggt ttgcttctca agcagatcca 9361 tgacgcaatg attcaaaatg tacaggcgac cagcttgtta acgatgacca ccgcgcaggc 9421 tggccttcat gaggcggaag aggcggccgc tcaacgtaag gagcatcaaa agactgcggt 9481 catctttggg gccgtcccct gagactgggc gacttattcg tgccgcccat ctgcacattt 9541 caggatgcga gttcattgta tccgacattc tgcgacaagc cagtcaaggt caggtccaaa 9601 cgatgaattt cacgatcccg gcgccgttta cggccattca tacgatcttc gatctagcct 9661 ttacgacaag cctggacaca atgcttggga cgatccaaga ggcggtgagc gcgccattgg 9721 tcgcctgcgt cactctttgg attatcgttc agggtattct ggtcatgcgt ggcgaaatcg 9781 acacgcgcgg cggtatcact cgggtgatca cggtcaccgt cgttgttgcc cttgtcgtcg 9841 ggcaggccaa ctaccacgac tatgtggttt cagtctttga agagacgatt ccaaacttta 9901 ttcagcagtt tagtggcagc ggcctgcctc tgcagaccat tcccgctcag ctcgatacaa 9961 tgttcgccct aacccaggct gcatttcaga gaattgcatc tgaaatcggc ccgatgaatg 10021 accaggacat ccttgctttc cagggggctc agtgggtctt ttacggcacg ctctggtctg 10081 ccttcggaat ctacgacgcc gtcggaattc tcacgaaagt tcttctggcg atcggccctt 10141 tgatcctcac gggatatatc tttgatcgca cgcgcgacat cgcggcaaag tggatcgggc 10201 aacttatcac ctatggtctc ctacttctcc tgctaaacct cgtggcgaca atagtcatcc 10261 taaccgaagc gactgcactc acgcttatgc ttggtgtaat cacccttgcc ggtacgaccg 10321 cggccaagat cattggtctt tacgaactcg acatgttctt tctgactggt gacgcactca 10381 ttgtcgcttt gccggcaatc gccaacatcg gaggcagtta ttggagcggc gcgacccaat 10441 ctgccaacag cttgtaccgt cgcttcgctc aggtcgatcg tcgttaggtc ccgcaaaaaa 10501 attcacgtcg gtggagaact ctatgaaata ttgcctgctg tgcctggctc tcgctttaag 10561 cggctgccag acaaacgata aattggcgag ctgcaaaggc ccaatattcc cgctgaatgt 10621 ggggcgatgg cagcctacgc cgtcagatct tcagctcagc aacgtaggtg gtcgccatga 10681 aggggtctga atacgccttg ctagtagcgc gggaaaccct ggctgagcac tacaaggaag 10741 tggaagcttt ccaaactgcg cgtgcgaaat cagcgcggcg cctctccaag gtcattgcag 10801 ctgtcgcaac catcgcggtt ttggggaatg ttgcgcaagc cttcacaatt gccaccatgg 10861 tgccgctgat caggcttgtg ccggtatatc tttggatacg gccggatggc accgttgaca 10921 gcgaggtgtc cgtctcccga ttgcctgcaa ctcaagagga ggccgtcgtt aacgcctcac 10981 tgtgggagta tgttcggctg cgcgagagct atgatgccga taccgcccag tatgcctatg 11041 acctggtctc gaacttcacg cgcccaatgg tgcgccaaaa ttatcagcaa ttcttcaatt 11101 atcccaatcc aacttcgcct caagtcatcc tcggcaaaca cggcaggcta gaggtcgaac 11161 acatcgcttc gaatgatgtt actccgggtg tgcagcaaat tcgctataag cgaaccctca 11221 tcgttgacgg caaaatgccg atggcgagca cttggactgc tacggttcgt tatgaaaagg 11281 tgaccagctt gcccggcaga ttgagactga ccaacccggg aggcttggtt gtcacctcct 11341 accagacatc ggaagatacc gtttcgaacg caggccacag cgaaccatga cgaaaaaagc 11401 atttctcact ctggcatgtt tactttttgc ggcgattggc gcgagggctg aagacacgcc 11461 aacggcgggc agacttgatc cgcgcatgcg ttatctcgct tacaatcccg atcaagtggt 11521 gcgcctttca acggcggttg gagccacttt ggttgttact ttcggggcta acgaaacggt 11581 gacagctgtt gccgtttcca atagcaaaga tctcgcggcc cttccacgcg gaaattatct 11641 tttcttcaag gctagcaagg ttctcccacc ccagccagtg gtcgtgctaa ctgcgagtga 11701 cgccggtatg cgacgctacg ttttcagcat ctcttccaag acgctgccgc acctcgataa 11761 agagcaggcc gatctctact atagcgtaca attcgcttac cctgccgatg acgcagcggc 11821 tccgcagaag gcggcacaag agaaggctgt tgcagaccgt atacgtgcgg aagcgcaata 11881 tcaacagaga gcagagggtt tattggagca gcctgccacg accgttggtg ccgaggacaa 11941 gaattggcac tatgtcgctc agggcgatcg ttcgctgttg ccgctcgaag tcttcgatga 12001 tggatttacg acggtatttc acttcccagg taatgtacgc ataccctcca tctacacgat 12061 aaatccggat ggaaaggaag ctgtcgctaa ctattcagtc aaagggagct atgtcgagat 12121 ttcttcggtt tcccgtggtt ggcgtctgag ggatggccac acggtattat gcatttggaa 12181 taccgcctac gatcccgtcg gccgcaggcc ggagacgggc actgtgaggc ccgatgtgaa 12241 gcgcgtccta aaggaggtga gaggatgaac gacgacaatc agcaatcggc gcatgatgtc 12301 gatgcgtcgg ggtccctggt ctccgacaca catcaccggc gcctttcggg ggctcaaaag 12361 ttgatcgtag gaggtgtagt tctcgcgcta tcacttagcc tcatttggct tggcgggcgt 12421 gaaaagaagg aaaacgggga cgcacccccg tcaaccatga tcgccacgaa caccaagcca 12481 tttcatccgg ctccgattga cgttacactt gatcctccgg ctgcccagga agctgttcag 12541 ccgactgctc ctccgccagc acgaagtgag ccggaacggc atgagccgcg gccggaagaa 12601 acaccgattt ttgcgtacac cagtggtgat caagggacca gcaagcgcgt tcaacaaggc 12661 gaaacggacc gaagacgcga aggcaatggg gaagacagtc ctttgccgaa ggtcgaagtg 12721 tccgccgaga atgatctctc gatacgcatg aagcccaccg agctgcagcc caccagggct 12781 acgctcttgc ctcatcccga cttcatggtg acggagggga cgattattcc atgtatcttg 12841 caaacggcaa tcgacaccag tctggcaggc tatgtaaaat gcgtgttacc ctgggatgtt 12901 cgtggaacaa cgaacaacgt tgtgcttctt gatcgcggca ccaccgttgt tggcgagatc 12961 cagcgcggtt tgcaacaggg agatgcgcgt gtttttgtgc tctgggatcg ggcggagaca 13021 cccgaccatg ccatgatttc gcttgcgtca ccaagcgctg acgaactcgg tcgctcggga 13081 ttgccgggca ccgtcgacaa tcacttctgg cagcgcttta gcggggccat gctcttgagt 13141 gtcgtccaag gtccgttcca ggcagcgagc acctacgctg gcagctcggg tggagggacg 13201 agcttcaaca gcgtccagaa taacggtgaa caaacggcag acacagccct caaggccacg 13261 atcaacatac cgccaaccct gaagaagaat cagggcgaca cggtctccat ttttgtcgca 13321 cgggatctcg atttctcagg catataccag cttcgtatgg ctggtcgcgc ggcgcggggg 13381 cgggatcgcc gtccataacg aattcaactt atcacttaca gatggagata cgatacaaat 13441 ggaggtggat ccgcaattac gaatccttct caagccgatt ttggaatggc tcgatgaccc 13501 gcggaccgaa gaagttgcga taaatcgacc tggggaggca tttgtgcgcc aggccggcgc 13561 cttcctcaag ttccctttgc ctgtctccta tgacgatctc gaagatatcg ctattttagc 13621 aggcgcgctg agaaaacagg acgttggacc acgcaaccca ctttgcgcaa ctgaacttcc 13681 agacggcgag cggctgcaga tctgtttgcc gccgacggta ccatcgggca ccgtcagctt 13741 gacgattcga cggccaagtt cccgtgtttc tagtctcaaa gaagtctcgt cccgttacga 13801 tgctccgagg tggaatcagt ggaaggaacg aaaaaaacgg catgctcagc atgatgaagc 13861 tatccttcgg tactatgaca acggggatct ggaggcgttt ctgcacgcat gtgtcgttgg 13921 tcggttgacg atgctgcttt gcggacccac cgggagtggc aagacaacga tgagcaagac 13981 cttgatcaac gctatcccgc cgcaggaaag gctgattacc atcgaagata cgctcgaact 14041 cgtcattcca cacgagaacc acgtaaggct gctttattct aagaatgggg ctgggctggg 14101 cgcagttacc gctgagcacc tgctacaggc tagcctgcgc atgcgaccgg accgaatact 14161 gctcggcgag atacgcgacg atgccgcgtg ggcttatctg agtgaagtcg tctcagggca 14221 tccgggatcg atttccacaa tacatggtgc caatcccgtc caaggtttca aaaagctatt 14281 ttcgctcgtg aaaagcagcg ctcagggggc tagcttggaa gatcgcaccc tgattgacat 14341 gctcgcaacc gcagttgatg tcatcgtacc cttccgtgcc cacggtgaca tttacgaggt 14401 gggcgaaatc tggctcgctg ccgatgcgcg tcggcgcggt gagacaatag gcgatcttct 14461 taaccagcag tagttgtgat ccatgtttct aaatgccgca tggcgcgttg tagaattacg 14521 tttgtagcaa tgctcagcaa tctttgtcat caaacggaga catctagttt gcatttctgt 14581 cgtgcgcggt ttggtcgaaa tcttgccgaa atgcccgtgt agtgagagaa aattaaagag 14641 tggagtctag caaatacaac ctttacgtgt ataaattctg ttgagctgca aatggctggc 14701 caggatccta gattgagagg tgaaccgttg aaacacgttc ttgtcatcga tgacgatgtc 14761 gctatgcggc atcttatagt cgagtatctt acgatccatg cctttaaggt gactgcggta 14821 gccgacagca agcagttcaa tcgtgtactc tgctccgaga cggtcgatgt cgtggtcgtc 14881 gatcttaatt tgggtcgcga agatgggctt gaaattgttc gtagtctggc cacgaagtcc 14941 gatgttccaa tcataattat tagcggcgct cgcctcgaag aggcggacaa agttattgcg 15001 ctcgagttgg gagcaaccga ttttattgcc aagccttttg ggacgcggga atttctggcg 15061 cgcatccgtg ttgcgttacg cgtgcggccc agtgtcgcgc gaaccaaaga tcgacgctca 15121 tttagtttcg ctgactggac acttaatctc aggcgacgcc gcttgatttc ggaagagggc 15181 agtgaggtga aactcacggc aggtgagttt aatctcctgg ttgctttcct ggagaagccg 15241 cgcgacgtcc tatcccggga gcagcttctg atcgccagtc gggtacgcga ggaggaggtg 15301 tatgacagaa gtattgatgt cctcattttg cggctgcgcc ggaagcttga gggggatccg 15361 acgacccctc agttgatcaa gactgcaaga ggtgctggct atttctttga cgctgacgtg 15421 gatgtttcgt acgggggtgt gatggcggcc tgaggtagag gtgcatttcg cctttagcaa 15481 tctgttccca acgtgagcag attgctatgc ggcttggcaa agctgccttt cctcggtact 15541 atccgaaaaa ctcagcactg cggagtgatt ggatgggtcc tatcttttga gagatcagct 15601 gttcgttgcc ttctcccgag caaagaaaca tgcaagcgct gcggtagcca gcttgtggcc 15661 gaaagcccgg gcggtctcca atcccaatgg atcaaagtga ttgcgagcgg cttctattag 15721 cgagaccggg aacatgcggg aggtctgaac gatgattgat ttttcgaaag ctgtgtgagg 15781 gatcggataa ctcttcggag ccgcacgaaa cgatccatcc gccagcatgt tttcaaaatc 15841 gccaagcgca cggcgcaaga tcatttgtag cgacttggaa ggactgtatt gcaggatcag 15901 gttgtcatat atcttcgata cttcaggcgc gggcgggcgc gctgaaagga agacctggat 15961 cttttctggc gctgtcgtcg aactcaaagc atccacggtc agcatcgatt gctgatcaca 16021 gctgtgacaa cgcttggcgg tggctggggc aggtcgtcga tcttcctcgt cgagattttc 16081 aggcggctgc ggcagggtcg agttttgggt ggcaacaggc aaagaaggat ggacgatttc 16141 gggtcgagcg gcggcaagcc gcctggcctc cccgacagac aaagcgggtt tgcgaattcc 16201 catcttcacc cctccaaggc ttcgctaacc aatttggaga tagtgacgag ttcctccatg 16261 gcgattctga gattccgttc gaggaggcgc attgtcggat cggttctcat attcagcaat 16321 gtgagatgca acatgccacg ttccttcatc gcggcaaatg cgtctctctc gtgcatggga 16381 gactgtacaa ctggaaggct tgcgagcatg tccgacatcg cgcgctgcga tgtggtcaat 16441 cgaccaaccg gcacgcgttg gcgcaatacg gctgtcggaa ttgccaagtt ctcgctcagc 16501 agcagttcaa tgacatagcg gtaggtcgac aatgcttcat cgatatcgag cggagttaac 16561 atggtcggga tcagaagcag gtttgagctg gcaatgatcg tgttgttgag ttcgctcgaa 16621 ccaccatgcg tatcggccag cgcataatca aatccctgga gttcggcgtc ctcataggcc 16681 gcctcaagga gtgccatttc ctcggcggcg tagacttcgc agaaggagcc ccaggtattg 16741 ctgcgaaggg cgttttcttt ccatcgcgtt agtggtcggt tttcatcagc gtcgaagaga 16801 gccaatcgtt tgccgtcact tgcaaaggcg gcgcaaagac ccatgagtgc cgtggttttg 16861 ccggctcctc ccttgaagga gcaaaatgtc agaagtttca tgtccttatc ctgtcgattt 16921 tgtgaagcgg aagtgtgtct gtacttttat ttgtgtgtat gattttgcga taattcataa 16981 gtaatgtagt aattacctga ttttatattt caattttatt gtaatataat ttcaattgta 17041 ataatataaa aataaatatc ccttatgtgt tcttgatttc gttttgtata tggctagatt 17101 cccatctgcc acgacgagga aatgctacgg cggggcaagt tccaagatct ttccgtcttc 17161 tatggaggaa gctatgtcgc aaggcagtag gcccacctca agtgacattg ccgtcaacca 17221 gcgcgaatgc gtgaaggttg aaggcttcaa ggtcgtcagt acccgattaa gatcggccga 17281 atatgagagt ttttctcatc aggcacgctt gctgggcctc tccgacagca tggccatacg 17341 ggttgcggtg cgccgcattg gtggctttct tgaaatcgac gcagagactc gtcataggat 17401 ggaggccata ctacaatcca taggaacact ctcaagcaac attgccgcgc tgctatctgc 17461 ctatgccgaa aatccgacaa tggatttgga ggctttgcga gctgaacgta tcgccttcgg 17521 taaatctttc gctgacctcg acggcttgct ccgttccatt ttgtccgtat cacggcggcg 17581 gatcgacggt tgctcgctgc tgaaagacgc cttgtagcac tgacgtagca cttggcgggg 17641 aacatattcg atgcccgatc gagctcaagt tatcattcgc attatgccgg gaggtggcac 17701 caagaccctt caacaaatta tcaatcagtt ggagtatcta tcccggaagg gcaggctgga 17761 gctgcagcgt tcagcccgac atctcgatat tcccctgcca ccggatcaaa tccacgaact 17821 tgcccgaagc tgggttcaag agactggaac ttatgacgaa agtcagccag acgaggaaag 17881 gcaacaggag ttgaccaccc atattattgt aagcttcccc gccggtacaa gccaggtagc 17941 ggcttatgcg gcgagccggg agtgggcagc cgagatgttt gggtcaggcg cagggggggg 18001 ccgatacaac tatcttacgg ccttccacat cgatcgcgac cacccacatc tgcatgtcgt 18061 cgtcaatcgg cgcgaacttt taggacacgg ctggctgaag atatctcggc gccatcccca 18121 actgaattac gacgccctgc gcataaagat ggccgagatt tcacttcgtc atggcattgc 18181 cctcgatgcg agccgacgag cagaacgtgg catcaccgag cggccgatca cttatgccca 18241 atatcggcgc cttgagcggg agcaggctcg ccaaatccgt ttcgaagacg cggatttgga 18301 acagtcgtcg ccgcaaggag atcatccaga attcagccaa cctttcgata catccccatt 18361 tgaagcatcc gcgggcggac cggaggacat gcctcggccc aacaatcggc agaatgagtc 18421 gcaagttcat ctccaggagc cagctggtgt cagcaacgaa gccggtgtcc ttgtgcgggt 18481 tgcattggag acggagcgcc ttgctcaacc attcgtttcc gaaaccattc tcgcggacga 18541 catagggagc ggctcttcgc gtgttgccga gggccgtgtg gagagcgcaa accgcactcc 18601 cgatattcct cgcgcagcaa ctgaagctgc cacgcacacg acacacgacc ggcagcggcg 18661 tgcaaagcgt cctcatgatg acgacggagg gccgagtgga gcaaaacgtg tgacattgga 18721 aggcatcgcg gttgggcccc aggcgaacgc cggcgaacag gatggcagta gtggcccctt 18781 agtacggcaa gctggaacgt ctcggccatc tccaccgacg gccacgacgc gggccagcac 18841 cgcaaccgat tcattgtctg ctacagccca cctccagcaa cggagaggtg tcctttcaaa 18901 gcgtccgcgt gaagatgatg atggagaacc gagtgaacgc aaacgcgaga gagatgagcg 18961 cagcaaggac gggcgtgggg gaaataggag atagagaagg accgagtgat ggcaaatgaa 19021 gagttcacca gacactatgc gtggcccgtt cctgtggctt cgaatgatga agggcgtggg 19081 accgcgcgga tccccatcca ggcacaatca atcgttgctg gagaagacgg tcgggacact 19141 tcggtcccaa cggctttgtc gcgaccgcca attgaagata tgccgcacgg cgtccaagaa 19201 acatcggcga gtggcggacg actgggagcg gcccgtctgc gggattccgt aatcccgcca 19261 ggaatatccg aagcccgcac ggacctatcc gcaattttgc ggaaaaaaag cggttctttc 19321 cgcaccggta tgcagtatct gcgtgggctt gaacgggaaa attttgataa acaagacagg 19381 gaagccagtg cgttgccaga tttaagtgca aggggcataa agcgaccgcg cgaaattgag 19441 tatcccggca atgcaagcgg attaaccata aagaggcaag acggcttagg catagagatc 19501 aatactatct cggcatcttc gcccgtgaac cgggccgcgc attcgtcgaa ctggcaaggc 19561 gcgccggaac cgggcgtgta caatgttcag ccatcggcag atagagcaca gaactctgcg 19621 caggaaagtt cgacgtttcc cgatggcact tctgtttccg ctctctattc aggcccactc 19681 gcagaatggt tcgagaggga tactggcagc gagacgacca ggaattcagg caacaccatt 19741 tcgtcaccgc tccgagggct ggaggagttc ggcgattccg cggacagccg gtatcttggg 19801 cgcgaagctc agagtctttc agttaccgta acaacgccta attcgaatgc tgaggcaagt 19861 tctcatagcg cacacactga aactctcgac gatgtcagca gtgaccgctc gagcgaacag 19921 gggaggggtc cccttggcgc cgcaatcctt ggatcccatc atgacctttc accgcgtgcg 19981 cagaagttat cacaaacaaa ccgcgattcc cctgaactga ccgacgccga cctggcaaaa 20041 gtcgatgcgg tgttcgaatc tctctccaag gggcccccgg gggcggagag cgccgcaccg 20101 gactttcgtg agcgcggacc aggcagcgct ttccagaaag aaggcgtgtc cgacagggcg 20161 aacggggtgc ccactaactg ggaggtacct tttggtcgcg gtggcgggca ttcaccgcaa 20221 gcgctgaggt cctcaggagt ggagctcgat gacttccctg atttcaccga ggccgaactg 20281 gcaaaaatcg acgcactagt cgaatctcac tccaacagat ctttatcagt gcgaaatata 20341 gtaccagatt tgcgcggagc gggagcagac aacgtcttcc ggaaagaagg cgttgtcgaa 20401 cgcgcggaga agatgccgat cgatagtgtg agtcttacca gactgaatgg cgagcgttca 20461 cgttcaccga agacatcgca agcaagcctt gaggatttcc ccgatctaac cgatgccgat 20521 ctggcccata tcgaagagtc ggagcggatc gcaagaactg ctgttgaaaa ggggaagcaa 20581 aaaatctcga ccgaagccga tacgcggttt gatttgggca actcttctgc accccgggtt 20641 agccctcgat cggttacccc gttggttccc aatgccaatc agccgattac atcgtggttc 20701 tacgaggcac aaaagacttg tgacaaattg gtagagaaca cttacgtgaa gccggccgtt 20761 gacagttcta gggcgcgaaa tgacgtcgaa aacaccgctg ccaggcttgg tgaccctgct 20821 cctgctttgg ggcatgacaa tcttggtaga actcgcgcgt taacgcccgt tcgcgacgtg 20881 atgtctagac ccagcgcgga ccgacaactg gctagtcatg cggcagaaca ttcggcaata 20941 gacgacattt ggaaacggga tgaccgtgat cgcaggacac acccttatag aggattggat 21001 tcgcgttcac gtgaaggttg cgggcggtaa aggctgtgtt gttcgcagca acacccaaat 21061 caaaagccca atttgcgatc gatcgaagaa tctcggtgaa gtgatgaatt cgagcaagac 21121 tacgccccag cgtttagctg tcagcatcgt atgttcgctg gcagccggtt tttgcgcggc 21181 aagtctctat gtaacatttc gccatggttt caatggcgaa gcgatgatga cgtttagcgt 21241 cttcgccttt tggtacgaga ccccgctcta tatgggtcat gcgacccccg tcttctattg 21301 cggtttagcc attgtcgtct cgacgtctat tgttgtgctg ttaagccaac ttatcatatc 21361 gtttcgcaat cacgagcatc atggcacggc tcgttgggcg ggatttggcg aaatgcggca 21421 cgccggttac ctgcagcgct ataatcgtat caaggggcca atctttggca agacgtgtgg 21481 ccctcgttgg ttcggcagct atctgaccaa tggcgaacag cctcacagtc ttgttgtcgc 21541 gccaacgcgc gccggcaaag gcgttggcgt cgttattccg acgctcctaa ccttcaaggg 21601 ttcggtgata gccctcgacg tcaaaggcga gctttttgag ctgacttcca gagcacgcaa 21661 agcgggccgc gacgccgttt tcaagttctc gcctctggat ccagagcgac ggactcattg 21721 ttacaatccc gtcctcgata ttgcagcttt accgcccgag cggcagttta ccgagacgcg 21781 ccgcctcgcc gcaaacctca tcacggccaa aggcaaggga gcggaaggct ttatcgacgg 21841 cgcgcgggac cttttcgttg cgggcatcct tacctgtatt gaccgcggta cgccaacaat 21901 tggtgcggtc tacgacttat ttgctcaacc tggagagaag tacaaacttt ttgcgcacct 21961 cgcggaagaa agccgaaata aagaggctca gcgcattttc gacaatatgg cgggtaacga 22021 cacgaaaatt ttgacttctt atacatcagt gctcggcgac ggcggactta acctgtgggc 22081 tgatccattg gttaaagcag cgacaagtcg atcagatttt tccgtctatg atctgcgtcg 22141 gaagaggacc tgcgtctatc tttgcgtcag tcccaacgac ctcgaggtcg tggcgccatt 22201 gatgcgcctc ctttttcagc aggtcgtgtc aattctgcag cgatcactgc caggtaaaga 22261 tgagcggcat gaagttttgt ttctccttga cgaatttaaa cacctgggga agcttgaggc 22321 aatcgagacc gcgattacaa ccatcgccgg ttacaaaggc cgcttcatgt ttattattca 22381 gagtctctcg gccttgacgg gcatctacga tgacgcgggc aaacaaaact ttctcagtaa 22441 cactggcgtg caagtattta tggccacggc cgacgacgaa acaccgacct atatctcaaa 22501 agctatcggc gactatacgt tcaaggcgcg ttcgacctca tacagtcaag cccgaatgtt 22561 cgatcataac atccagattt ccgatcaggg ggcaccgctt ttgcgccccg aacaagtgcg 22621 cttgcttgac gataacaatg aaatcgtcct tattaaaggg catccgcctc tcaaactacg 22681 aaaggtgcga tattattccg atcgtatgct gaggcgcctt ttcgaatgcc aaattggcgc 22741 cctccctgag cccgcatctt tgatgctttc ggaaggtgtc catcgggatg ggcaagacct 22801 cagtcaacaa gccgccgtca cggaggcagc tagtactcga tcgataccta ataatatgga 22861 agccgctaca ccgcaaaaca gtgaaatgga tgacgagcag gacagcctcc caactggaat 22921 tgacgtcccc cagggcctta ttgaaagcga tgaggtgaaa gaagacgccg gtggcgtggt 22981 gccagatttc ggtgtttcag ctgaaatggc tccagctatg attgcacaac agcagctgct 23041 ggagcagatc attgcgcttc agcaacgata tggacccgcg tcctcgcact cggtgaaatg 23101 acttcaagct gcctttcaca ttggaatcat tcgatgcggg gcgtcgtcag atatccggtt 23161 gtattcgaat gcggactaga accccgaggt tgccgattaa cagatatcgc catgagacct 23221 tcaggaaacc cgaacgtcga tcttagcggt tcgactgcat cgcttgccga agttcccgcc 23281 ggagctaccc ctgtccttaa tctaatcgag cccaggaacc gtccggctga cgactcgctt 23341 gagggccaaa ccgatcgcgg cgagcatcca tctgcatcat ttgactatga tggcatgaag 23401 cttggcgccg cggagcgtga agcatacgag aactggtgtc catcgaaccg gcctacatgg 23461 aaagatctgg tactcagggc gcgccttgat gcaatcgaca gttccgcttg gctccccgat 23521 ttgggcgagg agtcgccttt gatcttcaga tatgaaggga ttccgctggg tgagggggaa 23581 cggcaagcct acaaagaatg gcaagaggag gctcagccca catgggaaga cctcgttgtc 23641 aacgcacgaa tggcggtacc tgatccttgt gctgacgttg cagacgagca caatcccctc 23701 aaagaaggcg aggagtttcg gtctgaagcg tcgaaacgca agcggaaaaa accgatcgac 23761 caggacgaga attctcctac atcgttttac tatgacggga tgaggctcgg agaacccgag 23821 cgcgaggcat atgataactg gggcaacgcg gagccgccca cgtggaaaga cctggtactt 23881 aaggcgcgcc ttgatgcaat tgacagctcc gcctggctct ttgcttcaga agggtcttcc 23941 tcgacttttg agtatgaggg aattccactg ggtgaggggg aacggcaagc ctacaaagaa 24001 tggcaagagg acgctcagcc cacgtgggag gacctcgtca ttaatgcacg catggcagaa 24061 ctcgaccatc cttcttggat tacagacgag cacaattccc ttgaagaaaa cttagagttt 24121 cggcccgatg caagacaggc cagcctgaag gactcgaccg accagcggaa gagttcttcc 24181 gcgtcattta tctatgatgg aatgaagctc ggggaacccg agactgctgc atacgagaac 24241 tggagcaaac cggaacgacc gtcatgggaa gccctcatcc tagatgggcg ccaggcttcc 24301 atagcaagct cttcggtttc gaattcgtta cttgcaaaga catcctcgcc agtctttcta 24361 tacgagggaa tgtcgctagg ggatgcggaa cgtcaatcgt atccaaggtg gaggcagcct 24421 gcccaaccgc gatggcaaaa tcttgtggtg aacgcgcgcc ttgcggatct tgatccgttg 24481 gcctggattc ccgatgaaca tgatccgttt gcggaagccg aggcgcttag ccccacttcg 24541 caatcgagtg gcgccaacaa gtccaaccgc gctttgggtc aatcagattc aggccggccc 24601 gccttcgcat atctggcagc acaagaggcg agtcacctgc aatcaccggc atgctcacaa 24661 ttggaaacga ggcgtgcatt aaatttcggg tcgcctggac cggatgcaaa tccaacggaa 24721 agcatcgcca aatgtaatcg tttggatggc gtcagtaaaa ttaagcgatt aggcaccaaa 24781 ggccgccggg cagtaaacgc gactattcac ggcggcaagt tcggtgcgca aggactgttg 24841 tccgaagatt gcggacaagc ggctgagccc tctccatcgg aacaaactac tcggccgcga 24901 accgataata ttggtaccta tgcaagccgg aaaaacgaac gagctcggct ggctacagag 24961 accgggaaat acgaatcgga gcacattttc ggatttaagg tcgtccacga tactttgcgg 25021 gcgaccaaag aaggccgtcg tctcgaaagg ccaatgccgg catacctcga atacaaggag 25081 cttcatcggc aacacgtagg gacaggaaga ggacggactg gactggtcgg gcgcggatgg 25141 ccagatgatg cgagctatcg cgcagatcaa agggcaaccc tgtcggaccc tgttgcgtcc 25201 gcggaaggtg caacagcttc aaatgggtac cagttgaacc agctgggcta cgcgcaccag 25261 ctcgccaccc atggtctcca aagtgaaacg cccgatgggg ttaccatgcc aattcaagtt 25321 gcgacaatta gctacaacta tacagtgagt cgtgatccgg tcctttcacc acccagcaaa 25381 gaacaagcgc ccccattgct acacctgggc cttcgtgggc aaacggaagc tgtgcttgcc 25441 cgcgagaccg cattgaccgg aaaatggcca actctcgagc gtgagtgtca agtgtatcag 25501 cggtttttgg ccctgtacga cgtaaagaag gatctggacg ccaaacaact tggtgtacgg 25561 cagaaaaagc aggcactcgt tgcggcgttg aaccggaccg ccggcttaat tggcgcgtca 25621 cctttggaag cccaatcgtc gaccgctgag gttgaatata ctaccgacga gcccgatgaa 25681 cgacgggttt atgatccgcg cgatcgaggc agagacaaag cgttcaatcg ctgacaaagt 25741 cgcctagaca atgattgtta cggacttcgc aatcgatttc ttgacagcag tgccattcaa 25801 cttttgatcc accatcggaa aataatcggc ggagaaacgc acgacagctt ctattgccaa 25861 gccccgtcga tcgatctcat ggcatattcc cgttatggtc gatttttcga ggggccgcgc 25921 aacctctctc agcaaagatg aagttctcgc gggtccgcca tgggtctcca ggaaaatcct 25981 gggcgtcacg gtcggtaaag aattgaagac tttaagcaaa cgcgaacttg acgttcaaag 26041 attacgcgcc gcggaacgcg ccctcgataa acaagtctgg cagaatcctc ccgttaaccc 26101 ctcccgtcga aacgaggaga tgccaatagt catcttcgaa acgaacgcga tgagcggcta 26161 gagtcggaga tgacgatcgc gcatcaagcc gcttccgtct ccgttggcga ggatcgccaa 26221 gctcttgatg ccgctcgatc gaccttgaat gaactgcaca actcgccatc gtcggatgat 26281 agggcaaact tgagtttgcc gagagccgaa atttgcgacc gtacaacgta tccgccgtct 26341 ccatcccatt gaatacggat atcctgagca ttaggtccgt gccaatacgt ctccggcaag 26401 agctgcagat ttggcatggc ggatgatgag ggatttgctg ggcgctttga gattgtcgag 26461 gcgcgtcgag gaatactatc cccctcgaca agcgaagagt ctgatccgtg gggtcgaggc 26521 ctgcagaaca ccgtaaacgc taacggcggg atcccgaatg accagtcatt gtggcattcc 26581 gaaagaatta cattcacacg gcaccggtta ctcgatcact atctcctgcg gccttattcg 26641 ataaagtcgt gctaactccg tgacttggca agaattacat atgaaacgcg atattcgttt 26701 aatgacggag gctcggatgg tagattcacc gaaggctcaa ttctattagg agtcttgatt 26761 gtgtccgaag agtgactgcc gctgcagcag cctagaaagc aatgcacggt gatgatgttg 26821 atcggacggc tttcgtcccg cgcaggctgc cggctgaagc cgcttacgtc gatagtcaag 26881 ttgacctggc ggcaaaccga caacctcact gtgtgacttg ttttgagcga cttttggttc 26941 aagagaaaga tgaaaaccgt cgacgattgt gggcttccgg caaagcatgc gccacgtctg 27001 aagagagttg tgcaggaggc cctacactcg caccgggcca ttcggggcca tgaagatgga 27061 gcaaacctta atttgcaccg ataattttcc gttcgatgga tttgggctgg gcaggatcgg 27121 agcggaaggc actaggaaaa tcgtaatttc tgtaggagaa acaatgatat ggtgatcatc 27181 aagctaaatg cgaataaaaa catgcctgtc ttggcggttg agaagccgca agaaattcac 27241 aaagaggagt tgagcgacca tcaccagtca aatggcttta cgagtttgga tctcgaaatg 27301 attgaactgg agaattttgt ccttcactgc ccgctccccg aagaaaacct agccggctaa 27361 aaggagttag acgatggatc cgaaggccga aggcaatggt gaaaatatca ctgagaccgc 27421 agcaggcaat gtcgaaactt ctgatttcgt gaatttgaag cgccagaaga gggagggcgt 27481 aaattccacc gggatgtccg aaattgatat gacgggtagc caagaaactc ccgaacacaa 27541 catgcacgga agcccgactc acacggatga tctcggcccg cggttggatg cggacatgct 27601 cgattctcag tcaagtcatg tttctagcag cgctcaaggc aatcggtctg aggttgaaaa 27661 tgagctatcc aacttattcg cgaagatggc tttaccaggc catgatcggc gtaccgacga 27721 gtatattctt gtgcggcaaa ccggacaaga caagttcgca ggtactacta aatgtaacct 27781 cgatcatctg cccaccaagg cggaattcaa tgcgagctgc cggctctata gggatggagt 27841 cggcaactac tatcccccgc ccctcgcatt cgagaggatc gatatcccgg agcaattggc 27901 tgcacaattg cataacctgg agccaagaga acagagtaaa cagtgttttc agtacaagtt 27961 ggaagtctgg aatcgcgctc acgcagagat gggcatcact ggcaccgaca tcttctatca 28021 aacagacaag aatattaagc tcgaccgaaa ttataaattg aggcctgagg atagatatat 28081 acaaacagag aaatacgggc gcagagaaat tcaaaaacgc tatgagcacc agtttcaagc 28141 tggttcactg ctgccggata tcttaatcaa gaccccgcaa aatgatatac atttctcgta 28201 caggtttgcg ggcgaccgtt acgctaacaa gcgatttgag gaattcgaac gcgcaatcaa 28261 aactaaatac ggtagcgata ccgagatcaa gctcaaatcc aaatctggga ttatgcatga 28321 ctccaaatat ttggaatcat gggagcgggg cagtgcggat atccgtttcg cagagttcgc 28381 cggcgagaat cgagctcaca acaagcagtt tccggctgcg actgtgaata tgggaaggca 28441 gccagatggc cagggaggga tgactcgcga tcgccatgta agcgttgact acctattgca 28501 aaacctaccc aactcccctt ggacgcaagc cttgaaagag ggaaagttgt gggatcgagt 28561 tcaggtcctt gctcgcgacg gaaaccgtta catgtcacct tcaagactgg aatattccga 28621 ccccgaacac tttacccaac tgatggatca agttggtctg cccgtgtcga tgggtcggca 28681 aagtcatgcg aatagtgtca agtttgagca gtttgacaga caggcagcgg ttattgttgc 28741 ggatggcccg aacttacgtg aggttccaga tttgtccccg gaaaagttgc aacaactgtc 28801 tcaaaaagat gtcctgatag cggatcgcaa tgaaaagggg caaagaaccg gcacttacac 28861 taatgttgtg gaatatgagc gcctgatgat gaaattaccg agcgacgcag cgcagcttct 28921 cgctgaaccg tccgatagat attcacgtgc ttttgtccgg ccggagccag cattgccccc 28981 catcagtgac agccggcgga cttatgaaag ccgaccgcgc ggcccaaccg taaacagtct 29041 gtagatttcc tgataccgcg tcagtgacga agccctttag ttcaaaatat gcatttgaaa 29101 ggattcaaca atggtcgaca cgacgaagaa gagtgtcgcg aagtcgctta cggctgacat 29161 gcgccgttct gctaagcggc tttctaagca aatgcgtaaa gcctcgctta ctgaagagga 29221 ggcaacaagg aatctagccc ggctcgaaac gccggaccag aagcgaaaat atgtcgccga 29281 tatgcagata atcgacaagc tggaagacgg ctttcgaggc gaaataagct ataaaatgct 29341 gggaaataaa cagcttcggg tcgacagccc aaaagaatta acgcgcgagc atggtataat 29401 aagaaaaaca agaaaggttc tgaagcgtaa cgcagagact ggcaatgttt acttgggtct 29461 ccacgaaaag aagacctgga ggagcgttag cagccatctt tatgccgagg acggtacact 29521 tcgcgcgaag catgtgaaat acaaagacgg acgctttgaa gaaaaatggg aacgagacga 29581 aaatggcctg ctgttccgca cgcagtttgt caaccgaaat cggctatttc aacctatttc 29641 cgagaaggtc agcacaccct accggagcgg accggaaaac cggctctttc gtgaactaac 29701 ccgtcgaaaa ggttccaagc aggaaacttt tgagcgggac gaaaaaggca acctcgagct 29761 catcggcagc aaacgtctcg gcttttccaa gaattcgacg aa // LOCUS HUMPCOLLA2 700 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human procollagen alpha2(I) chain gene, exon 6. ACCESSION M35391 KEYWORDS procollagen alpha2(I). SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 700) AUTHORS Weil,D., D'Alessio,M., Ramirez,F. and Eyre,D. TITLE Structural and functional characterization of a splicing mutation in the pro-alpha2(I) collagen gene of an Ehlers-Danlos type VII patient JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by F.Ramirez, 19-JUN-1990. FEATURES from to/span description pept / 6 / 59 procollagen alpha2(I) chain, exon 6 (AA at 6) /hgml_locus_uid="LP0002V" /nomgen="COL1A2" /map="7q21.3-q22.1" IVS < 1 5 procollagen alpha2(I) chain intron E IVS 60 > 700 procollagen alpha2(I) chain intron F BASE COUNT 235 a 120 c 101 g 244 t ORIGIN 1 tctagaactt tgctgctcag tatgatggaa aaggagttgg acttggccct ggaccaatgg 61 tatgcttatc tgtttatctt agccaaaaaa attgctaaat aaatcattca ttttatgtca 121 cattttacca cgccatttat ttagctacct aagttaacac tcaatactta gattatataa 181 aaaacaactc tttttgtttt caaatttatg aaaacataag ttaaggagtt cacttttctt 241 tacaaaagaa agattaattg atcttttatg attatatgat ctttttgatt atatgatcct 301 cattaagata gatcatatac ttatgtccga agaaataatc tttcggcaat atgccaaatc 361 aattcgggaa atcaatttaa tttcaaaagt aatccatctg attaattttt catatcttca 421 atcggtagat ttagcaacac tagaggcata gaactattta ttaagttctc tgaacttgtt 481 ggaaaggatc aacaagttct atctagtcca gctaactcat ttttaaaatg ggagagttta 541 agcccttttc tcaaagtcat ccaggtaact aatgacataa ctagaactag atgccaggca 601 agatgtctaa tatttgctta catcatggtt tatgtaccta gtccttgaat aaaccactca 661 tttagtcaac agatattaat cagatgcctt caatgggccc // LOCUS MUSC4P 1739 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse complement C4 5' flank. ACCESSION D90052 KEYWORDS C4; complement C4; promoter. SOURCE Mouse (strain B10.BR) genomic DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1739) AUTHORS Nakayame K Pattanakitsakul,S.-N., Yokoyama,S., Kimura,H., Nosaka,M. and Takahashi,M. TITLE Post-transcriptional regulation of the complement C4 gene in the low C4-producing strain of mouse JOURNAL Immunogenetics 31, 361-367 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Masaru Nonaka Cancer Research Institute, Kanazawa University 13-1 Takaramachi Kanazawa 920 Japan Phone: 0762-62-8151 FEATURES from to/span description site 1691 1691 transcription initiation site BASE COUNT 376 a 464 c 476 g 423 t ORIGIN chromosome 17. 1 gtcaacagtt attgcacatg cttaatgtaa acctgtagcc tctacatgct tgtgtgcctg 61 cccagacagt atatgacacc cacacccaca cccacaccca cacccacccc accccacttc 121 acacccaggc agcagtagtg gacccaacag ggtaagtggg tccctgcttg ggttgcctgt 181 gtttctggaa gctcaaggtc catgcatcag cagtagggga agggagggtt ctcagcccag 241 gctgcatcat catggccgcc taccacctgc aggcacctgg tgaacgctgg ggtcctcact 301 gtccgagatg ctggaagctg gtggctggct gtgcctggag ctgggagatt catcaagtgc 361 tttgttaaag gtatggtgcc tgaggagcaa cacccatggg gggtctttag gtctccaaat 421 ttgactcact tcttcctttt cctggacctt ctcctccagg gcgccaggct gtactgagca 481 tggtgtggaa ggccaagtac tgggagcttg ccctgtcaga gctcctgggc cgcagggccc 541 ctttggcagt gcggctaggt cttgcctacc atgtgcacga cctcattgga gcccagctgg 601 tggactggtg agtctagtcc tgagacttgg gaaacgtgca gggacccagg gttggtgtcc 661 ccctaagtgt tgcttctgtc atcagtgaag ccaagattct ctcattcccg gactcgggag 721 aagactggac taaattctgg atttgcacac tttgggtcta ggcagctggc tccagtcttg 781 tctttctgtt acacagtgtc cccaccactt ctggaaccct ccttcgcctg ccagatacat 841 gaggagccaa tgcttggatt ctgcaactca ctgagtgagg ttcctggaag tgccacccca 901 gggtggctga gcaagtcacc gcagtgggtg ccaggctcta ctgctgcaag ctgggcttct 961 acctgagccg ggctgtgggc attgcagctc ttgcttctgt gcgtgtggag tcaggagccg 1021 tgccaagggg atgagaaggt gggattgcta gagacactgg agcagggagt agaaaactct 1081 gcccttcacg tcaggctgaa attgccaaat aaaatacttg tgcctgtact tctttctgtg 1141 tcctttgttc agtgtggtgt atgtacggct gtgtgaagcc caacttgggg ctgagcattt 1201 catcattgtt tagagagaga acaaggctta ctcaggccaa tttccctgtc tggggtgaga 1261 gtgtacctca tcttctgagc tataggcttt ctgttagagc cataccaagc agaaggctgc 1321 cttgaatatg aggccatcga ctggggaacc cattctagtt ttgttaggac ctttacctgt 1381 cttatctatg tgtcatgttg gagctgaata cacgcacaca tggacaagca catgtattcg 1441 tacacacaca cacacacacg cacacgcaca caaacacaca cacaagggga agtccctggt 1501 actgggaata aacaagcacc agatcaattt tggggccaag atttctacag agaccctggt 1561 gtttgggttg cctccccagc tctgggccta gggccagttc tgtttcttgg ccatcacgtg 1621 gtttcccggc tcatggggca aagggagggg caaggtccag atcaaactct gccccaaccc 1681 taggttgatc agaagggagc agacagtcag accagacagg tctgaccttt cctggatcc // LOCUS BSURPOF 3078 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB), complete cds. ACCESSION M34995 KEYWORDS RNA polymerase; minor sigma-37 factor. SOURCE B.subtilis (strains 168 and Marburg), clones lambda-[21,gt11,1,11,21,31]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 368 to 2297) AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W. TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus subtilis RNA polymerase: Isolation, nucleotide sequence, chromosomal locus, and cryptic function JOURNAL J. Bacteriol. 169, 771-778 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 3078) AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W. TITLE Similar organization of the sigB and spoIIA operons encoding alternate sigma factors of Bacillus subtilis RNA polymerase JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.W.Price, 18-FEB-1987. Draft entry and computer-readable sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990. FEATURES from to/span description pept 200 529 ORF V pept 526 1008 ORF W pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg start codon) pept 1762 2361 ORF X mRNA 131 2368 sigB operon mRNA signal 131 168 sigB operon promoter signal 2368 2401 sigB transcription termination signal binding 188 194 ORF V ribosomal binding site binding 511 519 ORF W ribosomal binding site binding 954 961 sigB ribosomal binding site binding 1750 1753 ORF X ribosomal binding site BASE COUNT 999 a 575 c 755 g 749 t ORIGIN 40 degrees on B.subtilis map. 1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt 61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt 121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg 181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat 241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct 301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat 361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg 421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt 481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct 541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg 601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc 661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg 721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa 781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac 841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat 901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg 961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga 1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg 1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca 1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc 1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg 1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc 1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga 1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg 1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac 1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca 1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct 1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat 1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga 1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta 1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct 1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac 1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc 1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg 2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt 2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc 2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata 2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta 2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat 2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt 2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg 2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac 2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga 2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac 2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca 2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag 2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa 2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt 2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt 2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc 3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg 3061 caaactgctg cgaagctt // LOCUS HUMERPR 1624 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human erythropoietin receptor mRNA, complete cds. ACCESSION M34986 KEYWORDS erythropoietin receptor. SOURCE Human cell line OCI-M1, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1624) AUTHORS Ehrenman,K. and St John,T. TITLE Sequence of the human erythropoietin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.St.John, 07-JUN-1990. Author address: T.St.John Hutchinson Cancer Res Center 1124 Columbia St. Seattle, WA Email: TSTJOHN@FHCRCVAX FEATURES from to/span description pept 1 1527 erythropoietin receptor precursor sigp 1 72 erythropoietin receptor signal peptide matp 73 1524 erythropoietin receptor BASE COUNT 276 a 537 c 498 g 313 t ORIGIN 1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct 61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc 121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg 181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc 241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt 301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta 361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat 421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc 481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac 541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag 601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc 661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg 721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg 781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag 841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac 901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc 961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg 1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc 1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg 1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc 1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct 1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca 1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct 1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta 1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg 1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata 1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc 1621 agga // LOCUS HUMGFP40H 4379 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete cds. ACCESSION M30135 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor; megakeryoblastic leukemia cell growth factor. SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4379) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 12-DEC-1989. FEATURES from to/span description pept 796 909 P40 T-cell and mast cell growth factor precursor, exon 1 1022 1057 P40 T-cell and mast cell growth factor precursor, exon 2 1140 1172 P40 T-cell and mast cell growth factor precursor, exon 3 2461 2592 P40 T-cell and mast cell growth factor precursor, exon 4 4096 4215 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 796 849 P40 T-cell and mast cell growth factor signal peptide matp 850 909 P40 T-cell and mast cell growth factor 1022 1057 P40 T-cell and mast cell growth factor 1140 1172 P40 T-cell and mast cell growth factor 2461 2592 P40 T-cell and mast cell growth factor 4096 4212 P40 T-cell and mast cell growth factor IVS 910 1021 hP40 intron A IVS 1058 1139 hP40 intron B IVS 1173 2460 hP40 intron C IVS 2593 4095 hP40 intron D signal 742 747 TATA box signal 4354 4359 polyA signal rpt 2965 3265 Alu repeat binding 370 376 AP-2 binding site binding 624 629 IRF-1 binding site binding 627 633 AP-1 binding site BASE COUNT 1208 a 921 c 904 g 1346 t ORIGIN 1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt 61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc 121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca 181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct 241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact 301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc 361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag 421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa 481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt 541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt 601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa 661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt 721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc 781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc 841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac 901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat 961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta 1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac 1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg 1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg 1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat 1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt 1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt 1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag 1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa 1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc 1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg 1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc 1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc 1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat 1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac 1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc 1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg 1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt 2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc 2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga 2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc 2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa 2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa 2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt 2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag 2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg 2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac 2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt 2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc 2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc 2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct 2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag 2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa 2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg 3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg 3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat 3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca 3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc 3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt 3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac 3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg 3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc 3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt 3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca 3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca 3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact 3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga 3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt 3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc 3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc 3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg 4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa 4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac 4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg 4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc 4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga 4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa // LOCUS MEFDNABP 1710 bp ds-DNA BCT 27-JUL-1990 DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds. ACCESSION M34778 KEYWORDS DNA binding protein HMfB. SOURCE M.fervidus DNA. ORGANISM Methanothermus fervidus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (sites; for [2]) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. TITLE DNA binding protein HMf, isolated from the hypothermophilic archaeal Methanothermus fervidus is most closely related to histones JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1710) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Sandman, 01-JUN-1990. Author address: K.Sandman Department of Microbiology Ohio State University 484 W. 12th Ave. Columbus, Ohio 43210 E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU FEATURES from to/span description pept 578 787 DNA binding protein HMfB pept 1662 805 (c) ORF285 signal 450 457 promoter signal 523 530 promoter signal 790 814 transcription termination signal BASE COUNT 619 a 263 c 265 g 561 t 2 others ORIGIN 1 bp upstream of HindIII site. 1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa 61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag 121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag 181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg 241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat 301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat 361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa 421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt 481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt 541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat 601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct 661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag 721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa 781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat 841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc 901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt 961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat 1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa 1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg 1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata 1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc 1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc 1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc 1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat 1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc 1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt 1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt 1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa 1681 gttttgtttt acccattgct tctaaatatt // LOCUS MUSINT1A 5607 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds. ACCESSION K02593 M34750 KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene. SOURCE Mouse (BALB/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1098 to 5606) AUTHORS Van Ooyen,A. and Nusse,R. TITLE Structure and nucleotide sequence of the putative mammary oncogene int-1; Proviral insertions leave the protein-encoding domain intact JOURNAL Cell 39, 233-240 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 5607) AUTHORS Nusse,R., Theunisseen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A., Otte,A., Schuuring,E. and Van Ooyen,A. TITLE The promoter of the Wnt-1 (int-1) oncogene and its mechanism of activation by insertion of proviral DNA of the mouse mammary tumor virus JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [2] kindly submitted by R.Nusse, 31-MAY-1990. FEATURES from to/span description pept 1779 1882 Wnt-1 protein (exon 1) 2452 2705 Wnt-1 protein (exon 2) 3279 3544 Wnt-1 protein (exon 3) 4002 4490 Wnt-1 protein (exon 4) IVS 1883 2451 Wnt-1 cds intron A IVS 2706 3278 Wnt-1 cds intron B IVS 3545 4001 Wnt-1 cds intron C site 1133 1134 MMTV insertion site in tumor 17 [1] site 1394 1395 MMTV insertion site in tumor 35 [1] site 1631 1632 MMTV insertion site in tumor 102 [1] site 4495 4496 MMTV insertion site in tumor 53 [1] mRNA 1428 5398 Wnt-1 mRNA (alt.) mRNA 1595 5398 Wnt-1 mRNA (alt.) signal 1562 1566 TATA box signal 1573 1577 TATA box BASE COUNT 1188 a 1681 c 1519 g 1219 t ORIGIN Chromosome 15. 1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc 61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac 121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac 181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa 241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg 301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg 361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac 421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct 481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact 541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac 601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag 661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg 721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga 781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca 841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact 901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc 961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca 1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg 1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga 1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg 1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg 1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc 1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc 1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg 1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt 1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa 1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag 1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact 1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt 1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc 1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg 1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg 1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag 1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct 2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc 2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct 2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt 2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg 2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg 2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg 2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt 2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga 2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat 2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg 2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa 2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc 2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag 2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg 2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg 2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc 3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag 3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag 3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact 3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac 3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat 3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc 3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg 3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc 3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg 3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg 3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag 3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag 3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt 3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca 3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac 3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt 3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc 4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc 4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc 4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg 4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat 4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt 4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc 4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg 4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc 4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg 4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc 4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga 4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc 4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac 4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac 4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc 4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa 4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc 5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc 5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac 5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga 5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac 5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct 5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat 5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca 5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga 5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat 5581 gaccctaaat gagaccaact agggatc // LOCUS MUSP40M 3808 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete cds. ACCESSION M30136 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor. SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210 DNA, clone lambda-S40.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3808) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 21-NOV-1989. FEATURES from to/span description pept 644 757 P40 T-cell and mast cell growth factor precursor, exon 1 846 881 P40 T-cell and mast cell growth factor precursor, exon 2 965 997 P40 T-cell and mast cell growth factor precursor, exon 3 2184 2315 P40 T-cell and mast cell growth factor precursor, exon 4 3393 3512 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 644 697 P40 T-cell and mast cell growth factor signal peptide matp 698 757 P40 T-cell and mast cell growth factor 846 881 P40 T-cell and mast cell growth factor 965 997 P40 T-cell and mast cell growth factor 2184 2315 P40 T-cell and mast cell growth factor 3393 3509 P40 T-cell and mast cell growth factor IVS 758 845 mP40 intron A IVS 882 964 mP40 intron B IVS 998 2183 mP40 intron C IVS 2316 3392 mP40 intron D signal 589 594 TATA box binding 227 233 AP-2 binding site binding 475 480 IRF-1 binding site binding 478 484 AP-1 binding site BASE COUNT 1012 a 903 c 869 g 1024 t ORIGIN Chromosome 13. 1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca 61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt 121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag 181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc 241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga 301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt 361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt 421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga 481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg 541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta 601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat 661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg 721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga 781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac 841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag 901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt 961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc 1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct 1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat 1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga 1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa 1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac 1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac 1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc 1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca 1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta 1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac 1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc 1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct 1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta 1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat 1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg 1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc 1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg 2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa 2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt 2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt 2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa 2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca 2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc 2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag 2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg 2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct 2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag 2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg 2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag 2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat 2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc 2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct 2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc 3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga 3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg 3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga 3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa 3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt 3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc 3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc 3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga 3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg 3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag 3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga 3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac 3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat 3781 aggtaggatg gagcaaagac ttggtaat // LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 27-JUL-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete cds. ACCESSION M29938 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone ZQ30-1. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1105) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 13 777 MHC SLA-DQ alpha chain precursor sigp 13 81 MHC SLA-DQ alpha chain signal peptide matp 82 774 MHC SLA-DQ alpha chain mRNA < 1 1105 MHC SLA-DQ mRNA BASE COUNT 254 a 290 c 255 g 306 t ORIGIN 1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc 61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta 121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag 181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa 241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg 301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact 361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac 421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt 481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc 541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat 601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag 661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc 721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc 781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa 841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc 901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg 961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc 1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1081 aaacaataaa tcacttcatg aggtt // LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 27-JUL-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete cds. ACCESSION M29939 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 69 836 MHC SLA-DQ alpha chain precursor sigp 69 137 MHC SLA-DQ alpha chain signal peptide matp 138 833 MHC SLA-DQ alpha chain BASE COUNT 273 a 313 c 266 g 322 t ORIGIN 1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg 61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga 121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg 181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat 241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta 301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca 361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt 421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca 481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt 541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca 601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata 661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga 721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct 781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca 841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac 901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct 961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga 1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc 1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1141 aaacaataaa tcacttcatg aggtttttgt actg // LOCUS PIGMHDQBA 212 bp ds-DNA MAM 27-JUL-1990 DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end. ACCESSION M29944 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 212) AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F., Gustafsson,K. and LeGuern,C. TITLE Class II genes of miniature swine: II. Molecular identification and characterization of B (beta) genes from SLA-c haplotype JOURNAL Immunogenetics 31, 1-6 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept < 1 12 MHC SLA-DO beta chain BASE COUNT 38 a 58 c 52 g 64 t ORIGIN 1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg 61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc 121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct 181 cttacccttc tgcctttctt cctcctactt gt // LOCUS PP2LEND 266 bp ds-DNA PHG 27-JUL-1990 DEFINITION Bacteriophage P2 left end DNA. ACCESSION M34756 KEYWORDS . SOURCE Bacteriophage P2 DNA. ORGANISM Bacteriophage P2 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 266) AUTHORS Ziermann,R. and Calendar,R. TITLE Characterization of the cos sites of Bacteriophages P2 and P4 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Ziermann, 05-JUN-1990. Author address: R.Ziermann University of California Dept. Mol. Cell Biology Berkeley, CA 94720 Email: calendar@garnet.berkeley.edu FEATURES from to/span description site 1 19 cohesive end of phage DNA site 20 266 left end of phage DNA BASE COUNT 41 a 58 c 99 g 68 t ORIGIN Map position 0.0-0.8%. 1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct 61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt 121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg 181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc 241 gagccagccg tttatctcgc ggatcc // LOCUS RCABCHH 314 bp ds-DNA BCT 27-JUL-1990 DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and F108) genes, 3' end, and 5' end respectively. ACCESSION M34843 KEYWORDS . SOURCE R.capsulata DNA. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 314) AUTHORS Yang,Z. and Bauer,C.E. TITLE Rhodobacter capsulatus genes involved in early steps of the bacteriochlorophyll biosynthetic pathway JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.E.Bauer, 04-JUN-1990. FEATURES from to/span description pept < 1 79 bacteriochlorophyll biosynthesis protein (bchH) (AA at 2) pept 108 > 314 bacteriochlorophyll biosynthesis protein (F108) BASE COUNT 59 a 87 c 119 g 49 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga 61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg 121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg 181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg 241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca 301 agcgcgtgct gcag // LOCUS TREANTAT 999 bp ds-DNA ORG 27-JUL-1990 DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle. ACCESSION M34848 KEYWORDS . SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA. ORGANISM Kinetoplast Trypanosoma evansi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma evansi. REFERENCE 1 (bases 1 to 999) AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S., Steinert,M. and Hamers,R. TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Bajyana Songa, 05-JUN-1990. FEATURES from to/span description site 367 516 kinetoplast minicircle common region (ORI) BASE COUNT 391 a 78 c 203 g 327 t ORIGIN 1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt 61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt 121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg 181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt 241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt 301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat 361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg 421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt 481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag 541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag 601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca 661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat 721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac 781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa 841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg 901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat 961 aaaataatgg ataaaagagt ttataggtga agttgaagt // LOCUS VIBSCRRE 1524 bp ds-DNA BCT 27-JUL-1990 DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds, and sucrose transport (scrA) protein, 5' end. ACCESSION M35009 KEYWORDS Scr repressor protein; sucrose transport protein. SOURCE V.alginolyticus DNA. ORGANISM Vibrio alginolyticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1524) AUTHORS Blatch,G.L. and Woods,D.R. TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr repressor-encoding gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Blatch, 10-JUN-1990. Author address: G.L.Blatch University of Cape Town Dept. Microbiology Private Bag Rondebosch 7700 SOUTH AFRICA Email: bltgre01.uctvax@f4.n494.z5.fidonet.org FEATURES from to/span description pept 36 < 1 (c) sucrose transport protein (scrA) pept 271 555 Scr repressor protein (scrR) pept 791 1090 ORF5 binding 264 267 scrR ribosomal binding site binding 780 785 ORF5 ribosomal binding site signal 178 183 scrA -10 region signal 203 208 scrA -35 region signal 190 195 scrR -10 region signal 162 167 scrR -10 region signal 762 767 ORF5 -35 region signal 742 747 ORF5 -35 region site 78 91 repressor operator site (put.) site 229 242 repressor operator site (put.) site 12 71 ScrR helix-turn-helix motif BASE COUNT 466 a 344 c 321 g 393 t ORIGIN 1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc 61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg 121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt 181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc 241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg 301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa 361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc 421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg 481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt 541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga 601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt 661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac 721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag 781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt 841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga 901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg 961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga 1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta 1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc 1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt 1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt 1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat 1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct 1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca 1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa 1501 gccaattgtg atattgaaga agct // LOCUS XELNRP1A 2431 bp ss-mRNA VRT 27-JUL-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA, complete cds. ACCESSION M34894 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2431) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 74 1117 pot. RNA-binding protein (nrp-1B) mRNA < 1 2431 nrp-1B mRNA signal 2407 2418 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence site 921 941 alternative exon (put.) BASE COUNT 663 a 522 c 543 g 703 t ORIGIN 1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt 61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc 121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact 181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt 241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa 301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt 361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt 421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga 481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac 541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa 601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt 661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat 721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc 781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc 841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc 901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag 961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag 1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc 1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct 1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct 1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga 1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat 1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg 1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat 1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta 1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa 1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat 1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac 1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg 1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc 1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat 1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc 1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga 1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct 2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg 2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc 2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat 2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg 2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca 2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata 2401 gttcaaaata aactatgaat tttattcatt t // LOCUS XELNRP1B 2324 bp ss-mRNA VRT 27-JUL-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA, complete cds. ACCESSION M34895 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2324) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 29 1051 pot. RNA-binding protein (nrp-1B) mRNA < 1 2324 nrp-1B mRNA signal 2306 2311 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence BASE COUNT 647 a 467 c 520 g 690 t ORIGIN 1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc 61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac 121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt 181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca 241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga 301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat 361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca 421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg 481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt 541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc 601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat 661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag 721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac 781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg 841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc 901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat 961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat 1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg 1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct 1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat 1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta 1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa 1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa 1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct 1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa 1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc 1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact 1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg 1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt 1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct 1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag 1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag 1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt 1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta 2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac 2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt 2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat 2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa 2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc // LOCUS YSJSACDHY 2569 bp ds-DNA PLN 27-JUL-1990 DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds. ACCESSION M34929 KEYWORDS saccharopine dehydrogenase. SOURCE Y.lipolytica (strain W29) DNA. ORGANISM Yarrowia lipolytica Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2569) AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and Gaillardin,C. TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia lipolytica JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.M.Fournier, 06-JUN-1990. ORF1 is probably not translated, since a lacZ fusion/expression experiment did not produce results. FEATURES from to/span description pept 649 1929 ORF1 pept 1764 655 (c) saccharopine dehydrogenase mRNA 582 2060 ORF1 mRNA (alt.) mRNA 598 2060 ORF1 mRNA (alt.) mRNA 599 2060 ORF1 mRNA (alt.) mRNA 602 2060 ORF1 mRNA (alt.) mRNA 606 2060 ORF1 mRNA (alt.) mRNA 609 2060 ORF1 mRNA (alt.) mRNA 1807 639 (c) ORF2 mRNA (alt.) mRNA 1809 639 (c) ORF2 mRNA (alt.) mRNA 1811 639 (c) ORF2 mRNA (alt.) mRNA 1812 639 (c) ORF2 mRNA (alt.) mRNA 1814 639 (c) ORF2 mRNA (alt.) signal 521 528 TATA box signal 565 575 TATA box BASE COUNT 668 a 573 c 666 g 662 t ORIGIN 1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag 61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt 121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca 181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa 241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag 301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc 361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc 421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt 481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca 541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt 601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc 661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt 721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc 781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc 841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac 901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt 961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca 1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt 1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc 1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt 1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata 1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc 1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc 1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct 1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg 1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg 1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc 1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt 1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag 1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg 1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag 1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg 1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga 1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa 2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca 2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg 2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc 2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct 2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag 2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga 2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat 2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat 2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc // LOCUS HUMMHDQ1A 1515 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human MHC class II HLA-DQ-alpha gene, regulatory region. ACCESSION M33765 KEYWORDS cell surface antigen; cell surface glycoprotein; class II gene; major histocompatibility complex. SOURCE Human (DR4, DR4) DNA, clone H11A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1515) AUTHORS Ombra,M.N., Del Pozzo,G., Perfetto,C., Maffe,A. and Guardiola,J. TITLE Effect of the AIR-I locus on the activation of an enhancerless HLA-DQA1 promoter JOURNAL Immunogenetics 31, 368-376 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Guardiola, 18-APR-1990. FEATURES from to/span description pre-msg 655 > 1515 HLA-DQA1 mRNA and introns /hgml_locus_uid="LY0151Z" /nomgen="HLA-DQA1" /map="6p21.3" IVS 785 > 1515 HLA-DQA1 intron A BASE COUNT 441 a 322 c 340 g 412 t ORIGIN Chromosome 6p21.3. 1 atataactga tgacctcatc tatccatgag gctgctcacg aaatgccctc ccctggccag 61 gcacggtggc tcatgcctgt aatccaagca ctttgggagg ctgaggcagg cagatcatga 121 ggtcaggagt tcaagaccag cgtggccaac atagtgaaac tctgtctcta ctaaaaatac 181 agaattagcc gggcatggtg gcagcgctgc agtcaccagc tactcgggag ggtgaggcag 241 gagaatcgct tgaacctggg aggcaaaggt tgcagtgagc cgagaccatg ccattgtacc 301 tcagcctggg tgacagagtg agactacgtc tcgaaaagaa aagaaaaaga aaagaaaaag 361 aaaaaaaaag aaaaatgccc atccctcttg cgaatggcag acatgcacac accagagaag 421 attccaattt agtgtcctcc ctctgttcat agaacaattc ctcaagtcca ctctgagtag 481 aggctgcatc acaacaaggg gattgccctg tctccttcca gggctcttaa tacaaactct 541 tcagctagta actgagatgt caccatgggg gatttttcta attggccaaa acctgacctg 601 gcagggtttg gtttgggtgt cttcagattt ccttgtcttg aggccctcac aattgctcta 661 cagctcagaa cagcaactgc tgaggctgcc ttgggaagag gatgatccta aacaaagctc 721 tggatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg tgaagacatt 781 gtgggtgagt gcatgagtga ggaaatgttc tctggagctg aaaaacagta aattgaagga 841 aaagagagaa agcgatttgc agagaaattg tagagatttc ctaagaccct ttcactatta 901 agagaaatta aaaattatag ctgttcctcc ttcaggaaac cagagaccca atccactctt 961 tttgttatgt atgcttttgt gttcactaag gatgctattc tgtttatatt atattcagtg 1021 acaacagcct ggaggtctct atgtcgttcc gtcatgattg cctcaaaaat tagtgaagtt 1081 tccatcagtg gaataatttt ttattattaa aaatgtatga agtgtcattc tcaaatttcc 1141 ctgaacaact tttgaagctt ttcgtatgtc tcctgtagta gatcttgggg tcgttccatc 1201 aattatatac tctatagata ttaaaaaagt tgcccgtttc tttctctcag acttactcac 1261 atttccacat gggaactggc acaggtgggg agtgggtaaa ggagtccagc aggctgaatg 1321 ccttcaacaa tcattttacc acatggtcct cacttactct cagctgcctc atatgtgtca 1381 cctcacaaaa taatcaaaat aaaaatgggc atgtagctaa gctttgtaaa tagtgaaaac 1441 atggaatgtc aattgttttt acatatttct attacaggta tagcttcaca tttttcttta 1501 gcaaaataag ggatc // LOCUS BSUAMYEA 727 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.subtilis alpha-amylase (amyE) gene, 5' end. ACCESSION M35517 KEYWORDS alpha-amylase. SOURCE B.subtilis (strain 2633) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 727) AUTHORS Emori,M., Tojo,T. and Maruo,B. TITLE Molecular cloning and expression of an alpha-amylase gene from an alpha-amylase extrahyper producing Bacullis subtilis JOURNAL Agric. Biol. Chem. 52, 399-406 (1988) STANDARD simple staff_review FEATURES from to/span description pept 599 > 727 alpha-amylase structural protein (amyE) BASE COUNT 207 a 125 c 179 g 216 t ORIGIN 1 gatcatccgc ggcattatgt ttgaatttcc gtttaaagaa tgggctgcaa gcttgttcat 61 cattatctta tattactgca tcagggctgc ggcatccgga atgctcatgc cgagaataga 121 caccaaagaa gaactgcaaa aacgggtgaa gcagcagcga atagaatcag ttgcggtcgc 181 ctttgcggta gtggtgctta cgatgtacga cagggggatt ccccatacat tcttcgcttg 241 gctgaaaatg attcttcttt ttatcgtctg cggcggcgtt ctgtttctgc ttcggtatgt 301 gattgtgaag ctggcttaca gaagagcggt aaaagaagaa ataaaaaaga aatcatcttt 361 tttgtttgga aagcgaggga agcgtacaca gtctcgggca gtttttttat aggaacattg 421 atttgtattc actctgccaa gttgttttga tagagtgatt gtgataattt aaaatgtaag 481 tgttaacaaa attctccagt cttcacatca gtttgaaagg aggaagcgga agaatgaagt 541 aagagggatt tttgactccg aagtaagtct tcaaaaaatc aaataaggag tgtcaagaat 601 gtttgcaaaa cgattcaaaa cctctttact gccgttattc gctggatttt tattgctgtt 661 ttatttggtt ctggcaggac cggcggctgc gagtgctgaa acggcgaaca aatcgaatga 721 gcttaca // LOCUS CHKCKMB 274 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken b-creatine kinase gene, 3' end. ACCESSION M35381 KEYWORDS B-creatine kinase. SOURCE Chicken brain DNA, clone CK-B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 274) AUTHORS Kwiatkowski,R.W., Ehrismann,R., Schweinfest,C.W. and Dottin,R.P. TITLE Accumulation of creatine kinase mRNA during myogenesis: Molecular cloning of a b-creatine kinase cDNA JOURNAL Dev. Biol. 112, 84-88 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 51 M-creatine kinase (AA at 1) 97 243 M-creatine kinase BASE COUNT 60 a 81 c 75 g 58 t ORIGIN 1 ctgcagaaac gaggcacagg tggtgtggac acagctgcag ccgaagcctc ttgaggagcc 61 cccccccccc cccccccccc tgcagtcgga ggggtgtttg atgtatccaa tgctgatcgt 121 cttggcttct ctgaggtaga gctggtgcag atggtggtag atggtgtgaa gctgctcatt 181 gaaatggaaa aacgccttga gaaaggccag tccattgatg acctcatgcc agctcagaaa 241 taaagcactt tattctcccc ccgcccccct gcag // LOCUS CHKCMYBA1 172 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken c-myb gene, exon 1. ACCESSION M35506 KEYWORDS c-myb protein. SEGMENT 1 of 4 SOURCE Chicken hematopoietic cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 172) AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B. TITLE Organization of 5'-proximal c-myb exons in chicken DNA: Implications for c-myb tissue-specific transcription JOURNAL FEBS Lett. 232, 227-234 (1988) STANDARD simple staff_review FEATURES from to/span description pept 140 + 162 thymic c-myb protein, exon 1 IVS 163 > 172 thymic c-myb protein intron A BASE COUNT 16 a 73 c 62 g 21 t ORIGIN 1 aacctgtttc ctcctcctcc tcccgcggct cctcggcact tcgctgcgcg gcggcggagc 61 gcctgggcgc agcccggcag cagggcgcgg cggcgggcgc gggctccccc gtcctcttcc 121 cccagcggcc gccgcgagga tggcccggag accccggcac aggtaacggg gc // LOCUS CHKCMYBA2 290 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken c-myb gene, exon 2. ACCESSION M35507 KEYWORDS c-myb protein. SEGMENT 2 of 4 SOURCE Chicken hematopoietic cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 290) AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B. TITLE Organization of 5'-proximal c-myb exons in chicken DNA: Implications for c-myb tissue-specific transcription JOURNAL FEBS Lett. 232, 227-234 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 163 + 280 thymic c-myb protein, exon 2 pep$ 200 + 280 yolk sac c-myb protein, exon 1 IVS < 1 162 thymic c-myb protein intron A IVS 281 > 290 thymic c-myb protein intron B IVS 281 > 290 yolk sac c-myb protein intron A BASE COUNT 78 a 59 c 71 g 82 t ORIGIN 1 aatactagaa caacagaatg cagaacaaaa tcttgttgtg caagttttaa aacttttatc 61 tgtgtaacct ttgtaaagat tgtgtgtttt ctgtgtgctg tgtaattaag tagctccagt 121 cctcacgttc cgctctgtgt ctcgctgtgt ttcctgctgc agcatataca gcagcgatga 181 cgatgaagaa gatgttgaga tgtacgacca cgattacgac ggcctgcttc ctaaggctgg 241 gaaacgtcac ctagggaaaa ccaggtggac ccgtgaagag gtaattgccg // LOCUS CHKCMYBA3 92 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken c-myb gene, exon 3. ACCESSION M35508 KEYWORDS c-myb protein. SEGMENT 3 of 4 SOURCE Chicken hematopoietic cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 92) AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B. TITLE Organization of 5'-proximal c-myb exons in chicken DNA: Implications for c-myb tissue-specific transcription JOURNAL FEBS Lett. 232, 227-234 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 82 thymic c-myb protein, exon 3 pep$ + 11 + 82 yolk sac c-myb protein, exon 2 IVS < 1 10 thymic c-myb protein intron B IVS 83 > 92 thymic c-myb protein intron C IVS < 1 10 yolk sac c-myb protein intron A IVS 83 > 92 yolk sac c-myb protein intron B BASE COUNT 32 a 16 c 24 g 20 t ORIGIN 1 aatgatccag gatgagaaac tgaagaaact tgtggaacag aatggcacag aagactggaa 61 agtcattgcc agtttccttc ctgtaggtag ac // LOCUS CHKCMYBA4 34 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken c-myb gene, exon 4. ACCESSION M35509 KEYWORDS c-myb protein. SEGMENT 4 of 4 SOURCE Chicken hematopoietic cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 34) AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B. TITLE Organization of 5'-proximal c-myb exons in chicken DNA: Implications for c-myb tissue-specific transcription JOURNAL FEBS Lett. 232, 227-234 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 > 34 thymic c-myb protein, exon 4 pep$ + 11 > 34 yolk sac c-myb protein, exon 3 IVS < 1 10 thymic c-myb protein intron C IVS < 1 10 yolk sac c-myb protein intron B BASE COUNT 10 a 6 c 8 g 10 t ORIGIN 1 atttcattag aatcggacag atgttcagtg ccag // LOCUS MUSFOS 3673 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse c-fos gene; cellular homolog to viral oncogene. ACCESSION J00370 KEYWORDS fos oncogene; oncogene. SOURCE Mouse DNA, clones pc-fos(mouse)-1 and lambda-c-fos(mouse)-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3548) AUTHORS Van Beveren,C., van Straaten,F., Curran,T., Mueller,R. and Verma,I.M. TITLE Analysis of FBJ-MuSV provirus and c-fos (mouse) gene reveals that viral and cellular fos gene products have different carboxy termini JOURNAL Cell 32, 1241-1255 (1983) STANDARD full staff_review REFERENCE 2 (bases 2799 to 3673) AUTHORS Meijlink,F., Curran,T., Miller,A.D. and Verma,I.M. TITLE Removal of a 67-base-pair sequence in the noncoding region of protooncogene fos converts it to a transforming gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 4987-4991 (1985) STANDARD simple staff_review COMMENT A TATA box was found at bp 101-106 and a poly-A signal at 3493-3498. [1] also reported the proviral FBJ-MuSV sequence, which is found in the viral section of the data base. FEATURES from to/span description pept 284 424 c-fos protein exon 1 1179 1430 c-fos protein exon 2 1836 1943 c-fos protein exon 3 2061 2702 c-fos protein exon 4 pre-msg 133 > 2702 c-fos protein mRNA IVS 425 1178 c-fos protein intron A IVS 1431 1835 c-fos protein intron B IVS 1944 2060 c-fos protein intron C variant 2856 2856 g in [1]; c in [2] variant 3135 3136 gt in [1]; gtt in [2] BASE COUNT 817 a 987 c 974 g 895 t ORIGIN 997 bp upstream of EcoRI site. 1 gagttgacga cagagcgccc gcagagggcc ttggggcgcg cttccccccc cttccagttc 61 cgcccagtga cgtaggaagt ccatccattc acagcgcttc tataaaggcg ccagctgagg 121 cgcctactac tccaaccgcg actgcagcga gcaactgaga agactggata gagccggcgg 181 ttccgcgaac gagcagtgac cgcgctccca cccagctctg ctctgcagct cccaccagtg 241 tctacccctg gaccccttgc cgggctttcc ccaaacttcg accatgatgt tctcgggttt 301 caacgccgac tacgaggcgt catcctcccg ctgcagtagc gcctccccgg ccggggacag 361 cctttcctac taccattccc cagccgactc cttctccagc atgggctctc ctgtcaacac 421 acaggtgagt ttggctttgt gtagccgcca ggtccgcgct gagggtcgcc gtggaggaga 481 cactggggtg tgactcgcag gggcgggggg gtcttccttt ttcgctctgg agggagactg 541 gcgcggtcag agcagcctta gcctgggaac ccaggacttg tctgagcgcg tgcacacttg 601 tcatagtaag acttagtgac cccttcccgc gcggcaggtt tattctgagt ggcctgcctg 661 cattcttctc tcggccgact tgtttctgag atcagccggg gccaacaagt ctcgagcaaa 721 gagtcgctaa ctagagtttg ggaggcggca aaccgcggca atcccccctc ccggggcagc 781 ctggagcagg gaggagggag gagggaggag ggtgctgcgg gcgggtgtgt aaggcagttt 841 cattgataaa aagcgagttc attctggaga ctccggagca gcgcctgcgt cagcgcagac 901 gtcagggata tttataacaa accccctttc gagcgagtga tgccgaaggg ataacgggaa 961 cgcagcagta ggatggagga gaaaggctgc gctgcggaat tcaagggagg atattgggag 1021 agcttttatc tccgatgagg tgcatacagg aagacataag cagtctctga ccggaatgct 1081 tctctctccc tgcttcatgc gacactaggg ccacttgctc cacctgtgtc tggaacctcc 1141 tcgctcacct ccgctttcct ctttttgttt tgtttcagga cttttgcgca gatctgtccg 1201 tctctagtgc caactttatc cccacggtga cagccatctc caccagccca gacctgcagt 1261 ggctggtgca gcccactctg gtctcctccg tggccccatc gcagaccaga gcgccccatc 1321 cttacggact ccccacccag tctgctgggg cttacgccag agcgggaatg gtgaagaccg 1381 tgtcaggagg cagagcgcag agcatcggca gaaggggcaa agtagagcag gtgagcagcg 1441 attctggacc tttgtgggct gggggggggg gggggggcgg agactgacgc acagaccaca 1501 caacagagaa gggacgctac tgactgcact tcctgaccag gagctgtggc tgctagccct 1561 ttccctccct tgtcagattt tgacagttgg acccaagaca aactctagac agtttccctg 1621 acagcttcct acttcattct ctagccgggg agcttctttg ttcccctgct aaagatctca 1681 ctttaaatgc aaatcacact ctgcctgcca actgcaggtt agaaaaactg cttcaccgag 1741 aggtgcgggt gctgtaggag ccagtttcac tggggtgact gaatggaggt gacactagac 1801 aaccttaact gaatgttggt ccttttcttc tatagctatc tcctgaagag gaagagaaac 1861 ggagaatccg aagggaacgg aataagatgg ctgcagccaa gtgccggaat cggaggaggg 1921 agctgacaga tacactccaa gcggtaggtt gaaccagctg ctgctcctga aactttatta 1981 aagttggagc ttgggactat gggcgcaggg tccttgagca tgcccgtgtc ttatgctttc 2041 ttatatctct ccctatgcag gagacagatc aacttgaaga tgagaagtct gcgttgcaga 2101 ctgagattgc caatctgctg aaagagaagg aaaaactgga gtttattttg gcagcccacc 2161 gacctgcctg caagatcccc gatgaccttg gcttcccaga ggagatgtct gtggcctccc 2221 tggatttgac tggaggtctg cctgaggctt ccaccccaga gtctgaggag gccttcaccc 2281 tgccccttct caacgaccct gagcccaagc catccttgga gccagtcaag agcatcagca 2341 acgtggagct gaaggcagaa ccctttgatg acttcttgtt tccggcatca tctaggccca 2401 gtggctcaga gacctcccgc tctgtgccag atgtggacct gtccggttcc ttctatgcag 2461 cagactggga gcctctgcac agcaattcct tggggatggg gcccatggtc acagagctgg 2521 agcccctgtg tactcccgtg gtcacctgta ctccgggctg cactacttac acgtcttcct 2581 ttgtcttcac ctaccctgaa gctgactcct tcccaagctg tgccgctgcc caccgaaagg 2641 gcagcagcag caacgagccc tcctccgact ccctgagctc acccacgctg ctggccctgt 2701 gagcagtcag agaaggcaag gcagccggca tccagacgtg ccactgcccg agctggtgca 2761 ttacagagag gagaaacacg tcttccctcg aaggttcccg tcgacctagg gaggacctta 2821 cctgttcgtg aaacacacca ggctgtgggc ctcaaggact tgcaagcatc cacatctggc 2881 ctccagtcct cacctcttcc agagatgtag caaaaacaaa acaaaacaaa acaaaaaacc 2941 gcatggagtg tgttgttcct agtgacacct gagagctggt agttagtaga gcatgtgagt 3001 caaggcctgg tctgtgtctc ttttctcttt ctccttagtt ttctcatagc actaactaat 3061 ctgttgggtt cattattgga attaacctgg tgctggattg tatctagtgc agctgatttt 3121 aacaatacct actgtgttcc tggcaatagc gtgttccaat tagaaacgac caatattaaa 3181 ctaagaaaag ataggacttt attttccagt agatagaaat caatagctat atccatgtac 3241 tgtagtcctt cagcgtcaat gttcattgtc atgttactga tcatgcattg tcgaggtggt 3301 ctgaatgttc tgacattaac agttttccat gaaaacgttt ttattgtgtt ttcaatttat 3361 ttattaagat ggattctcag atatttatat ttttatttta tttttttcta ccctgaggtc 3421 tttcgacatg tggaaagtga atttgaatga aaaattttaa gcattgtttg cttattgttc 3481 caggacattg tcaataaaag catttaagtt gaatgcgacc accttcttgc tctctttatt 3541 ctcagttttg tatggtttca ggaaggcctc tgaggagacc agtttgtcaa gatgggtggg 3601 tcctggaggg gaacacgccc tctgtcccct tgtcactcag aggacacgta gttcagggta 3661 ttttacagat gtg // LOCUS MUSIGHACX 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW. ACCESSION M34880 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone C10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 83 a 77 c 72 g 80 t ORIGIN Chromosome 6. 1 gacattgtga tgacacagtc tccatcctcc ctggctatgt cagtaggaca gaaggtcact 61 atgaggtgca agtccagtca gagcctttta aatagtagca gtcaaaagaa ctatttggcc 121 tggtaccagc agaaaccagg acagtctcct aaacttctgg tatactttgc atccactagg 181 gaatctgggg tccctgatcg cttcataggc agtggatctg ggacagattt cactcttacc 241 atcagcagtg tgcaggctga agacctggca gattacttct gtcagcaaca ttatagcact 301 ccgtacacgt tc // LOCUS MUSIGHACY 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW. ACCESSION M34883 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 80 a 78 c 68 g 83 t 3 others ORIGIN Chromosome 6. 1 gatgttttga tgacccaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc 61 atctcttgca gatctagtca gaacattgta catactnnna atggaaacac ctatttagaa 121 tggtacctgc agaaaccagg ccagtctcca aagctcctga tctacaaagt ttccaaccga 181 ttttctgggg tcccagacag gttcagtggc agtggatcag ggacagattt cacactcaag 241 atcagcagag tggaggctga ggatctggga gtttattact gctttcaagg ttcacatgtt 301 cctccgacgt tc // LOCUS MUSIGHACZ 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW. ACCESSION M34884 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 78 a 76 c 72 g 83 t 3 others ORIGIN Chromosome 6. 1 gatgttttga tgacccaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc 61 atctcttgca gattcagtca gagcattgta catactnnna atggaaacac ctatttagaa 121 tggtacctgc agaaaccagg ccagtctcca aagctcctga tctacagagt ttccaaccga 181 ttttctgggg tcccagacag gttcagtggc agtggatcag ggacagattt cacactcaag 241 atcagcagag tggaggctga ggatctggga gtttattact gctttcaagg ttcacatgtt 301 ccgtggacgt tc // LOCUS MUSIGHADA 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW. ACCESSION M34885 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CR1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 69 a 75 c 78 g 87 t 3 others ORIGIN Chromosome 6. 1 gatattgtga tgactcaggc tgcaccctct gtacctgtca ctcctggaga gtcagtatcc 61 atctcctgca ggtctagtaa gagtctcctg catagtnnna atggcaacac ttacttgtat 121 tggttcctgc agaggccagg ccagtctcct cagctcctga tatatcggat gtccaacctt 181 gcctcaggcg tcccagacag gttcagtggc agtgggtcag gaactgcttt cacactgaga 241 atcagtagag tggaggctga ggatgtgggt gtttattact gtatgcaaca tctagaatat 301 cctttgacgt tc // LOCUS MUSIGHADB 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma A/J. ACCESSION M34886 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma A/J, cDNA to mRNA, clone AF1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 86 a 73 c 62 g 73 t 18 others ORIGIN Chromosome 6. 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca gnnnnnnnnn nnnnnnnnng acattagcaa ttatttaaac 121 tggtatcagc agaaaccaga tggaactgtt aaactcctga tctactacac atcaagatta 181 cactcaggag tcccatcaag gttcagtggc agtgggtctg gaacagatta ttctctcacc 241 attagcaacc tggagcaaga agatattgcc acttactttt gccaacaggg taatacgctt 301 ccgtggacgt tc // LOCUS MUSIGHADC 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma A/J. ACCESSION M34887 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma A/J, cDNA to mRNA, clone AF2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 87 a 79 c 70 g 76 t ORIGIN Chromosome 6. 1 gacattgtga tgacacagtc tccatcctcc ctgactgtca cagcaggaga gaaggtcact 61 atgagttgca agtccagtca gagtctgtta aacagtggaa atcaaaacaa ctacttgacc 121 tggtaccagc agaaaccagg gcagcctcct aaactgttaa tctactgggc atccactaga 181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc 241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatcgttat 301 ccgctcacat tc // LOCUS MUSIGHADG 327 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW. ACCESSION M34888 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone C10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 327) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 327 Ig H-chain V-D-J-region (AA at 1) recomb 294 295 V-region end/D-region start recomb 297 298 D-region end/J-region start BASE COUNT 87 a 78 c 87 g 75 t 0 others ORIGIN Chromosome 12. 1 gaggtccagc tgcaacagtc tggacctgag ctggtgaagc ctggggattc agtgaagatg 61 tcctgcaagg cttctggcta cacattcact gactcctaca tggactgggt gaagcagagc 121 cctggaaaga gccttgagtg gattggatat atttatccta acaatggtgg tactaactac 181 aatcagaagt tcaagggcaa ggccacattg actgtagaca agtcctccag cacagcctac 241 atggaactcc acagcctgac atctgaggac tctgcagtct attactgtgc aagatggact 301 tactggggcc aagggactct ggtcact // LOCUS MUSIGHADH 345 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW. ACCESSION M34889 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 345) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 345 Ig H-chain V-D-J-region (AA at 1) recomb 293 294 V-region end/D-region start recomb 309 310 D-region end/J-region start BASE COUNT 81 a 93 c 86 g 85 t 0 others ORIGIN Chromosome 12. 1 cagcctgggg ctgagcttgt gaagcctggg acttctgtga agttgtcctg caggccttct 61 ggcttcacct acaccgactt ctggattcac tgggtgaaac agaggcctgg acaaggcctt 121 gagtggattg gagagattaa tcctagcaat ggtggtacta gctacaatga gaagttcaag 181 accaaggcca cactgactgc agacaaatcc tccagcacag tccttacatg caactcaacg 241 agcctgacat ctgacgactc tgcggtctat ttctgtacaa gaccccctct ctcctatggt 301 aacttcgcct ggtttactta ctggggccaa gggactctgg tcact // LOCUS MUSIGHADI 264 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW. ACCESSION M34890 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 264) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 264 Ig H-chain V-D-J-region (AA at 1) recomb 223 224 V-region end/D-region start recomb 230 231 D-region end/J-region start BASE COUNT 70 a 68 c 67 g 59 t 0 others ORIGIN Chromosome 12. 1 ttcactacct actggatgca ctgggtgaag cagaggcctg gacaaggcct tgagtggatc 61 ggagagattg atccttctga tacttatact taccacaatc aaaacttcaa gggcaaggcc 121 acattgactg tagacaaatc ctccagcaca gcctacatgc aactcagcag cctgacatct 181 gaggactctg cggtctatta ctgtgcaaga aggtcgatgg gccactacgg gcttgcatac 241 tggggccaag ggactctggt cact // LOCUS MUSIGHADJ 329 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW. ACCESSION M34891 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CR1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 329) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 329 Ig H-chain V-D-J-region (AA at 1) recomb 273 274 V-region end/D-region start recomb 294 295 D-region end/J-region start BASE COUNT 80 a 84 c 82 g 83 t ORIGIN Chromosome 12. 1 ttagtgaagc ttggagggtc cctgcaactc tcctgtgcag cctctggatt cactttcagt 61 agctattaca tgtcttgggt tcgccagact cctgagaaga ggctggagtg ggtcgcaacc 121 attagtaatg gtggtgttag gcacctacta tccagacagt gtgaaggacc gcttcacatc 181 tccagagaca atgccaggaa caccctctac ctgcgaatga gcagtctgaa ttctgaggac 241 acagccgtgt attactgtgc aagagatgga tcttatggta actaccacgt accttacttt 301 gactactggg gccaaggcac cactctcac // LOCUS MUSIGHADK 271 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma A/J. ACCESSION M34892 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone AF1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 271) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 271 Ig H-chain V-D-J-region (AA at 2) recomb 226 227 V-region end/D-region start recomb 238 239 D-region end/J-region start BASE COUNT 69 a 63 c 73 g 66 t 0 others ORIGIN Chromosome 12. 1 attcactttc agtagctatg ccatgtcttg ggttcgccag actccagaga agaggctgga 61 gtgggtcgca tccattaata gtggtggtag tacctactat ccagacagtg tgaagggccg 121 attcaccatc tccagagata atggcaggaa caccctgtac ctgcaaatga gcagtctgag 181 gtctgaggac acggccatgt attattgtgg aagaggcagt tacccctatt actatggtct 241 ggactactgg ggtcaaggaa cctcagtcac c // LOCUS MUSIGHADL 348 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma A/J. ACCESSION M34893 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone AF2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 348 Ig H-chain V-D-J-region (AA at 1) recomb 303 304 V-region end/D-region start recomb 315 316 D-region end/J-region start BASE COUNT 86 a 80 c 99 g 83 t 0 others ORIGIN Chromosome 12. 1 gaagtgaagc tggtggagtc tgggggaggc ttagtgacgc ctggagggtc cctgaaactc 61 tcctgtgcag cctctggatt cactttcagt agctatgcca tgtcttgggt tcgccagact 121 ccagagaaga ggctggaatg ggtcgcatcc attaataatg gtggtaatac ctactatcca 181 gacagtgtga agggccgatt caccatctcc agagataatg gcaggaacac cctgtacctg 241 caaatgagca gtctgaggtc tgaggacacg gccatgtatt actgtggaag aggcagttat 301 ccctattact atggtctaga ctactggggt caaggaacct cagtcacc // LOCUS NEUMTNDCO 839 bp ds-DNA ORG 27-JUL-1990 DEFINITION N.crassa mitochondrial pseudo-NADH dehydrogenase (ND5) gene, 3' end, and cytochrome oxidase subunit 2 (co2) gene, 5' end. ACCESSION M35513 KEYWORDS NADH dehydrogenase; cytochrome oxidase subunit 2; pseudogene. SOURCE N.crassa (strain RL3-8A) stopper mutant ER-3 mitochondrial DNA. ORGANISM Mitochondrion Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa. REFERENCE 1 (bases 1 to 839) AUTHORS Almasan,A. and Mishra,N.C. TITLE Molecular characterization of the mitochondrial DNA of a new stopper mutant ER-3 of Neurospora crassa JOURNAL Genetics 120, 935-945 (1988) STANDARD simple staff_review FEATURES from to/span description pept.ps > 1 270 pseudo-NADH dehydrogenase (ND5; AA at 1) pept 645 > 839 cytochrome oxidase subunit 2 (co2) BASE COUNT 313 a 124 c 119 g 283 t ORIGIN 1 tgtatgaaat atttagctga aatattagga ggtaaaaaac atttattaaa aagttacgaa 61 ggttacaatg taacagttaa taccactaaa ttatctccta ttgtacaata ttttaatctt 121 tatcctttaa aaactaaaaa gtatattaca tattttaact gaataaaaat atataagtta 181 gtaatagata aaaaacataa tgatcctgaa aatttattat tgattacgaa atataaaaat 241 aacattaata aatctgatta taataaataa atcaactttc ggcggacccc gccccttcgg 301 ggcgggggcc cgtttttagt aataactaaa aattaaacac agctacataa atgaagatat 361 agtccgatca gttaagtaat taactgcgta ttccacccct cccttccccc gccccgaagg 421 cgcccccgaa aaaacaaagg acgctagcgt agctaacctt atgagcagca ggctataaaa 481 acccattagg cacagggtat acttcgttag tggataagaa ttagcctaag ccttaatata 541 caatacagtt taatgttata tttaaattct atatttatga aaagatataa tatatttatt 601 attatggttt aatattcaat atagaatatt ctatttcttt ttttatggga ttattattta 661 ataatttaat tatgaatttt gatgctccaa gcccatgagg tatctatttt caagatagtg 721 ctactcctca gatggaaggt ttagttgaat tacatgacaa tattatgtac tatctggtgg 781 taatactatt tgttgtagga tgaatcttat tatctataat aagaaattat atcagtaca // LOCUS PUMCHSA1 90 bp ds-DNA PLN 27-JUL-1990 DEFINITION P.crispum chalcone synthase gene, 5' flank. ACCESSION M35515 KEYWORDS chalcone synthasse. SEGMENT 1 of 2 SOURCE P.crispum (strain Mooskrause) DNA, clone PcCHS-1. ORGANISM Petroselinum crispum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Apiales; Apiaceae. REFERENCE 1 (bases 1 to 90) AUTHORS Herrmann,A., Schulz,W. and Hahlbrock,K. TITLE Two alleles of the single-copy chalcone synthase gene in parsley differ by a transposon-like element JOURNAL Mol. Gen. Genet. 212, 93-98 (1988) STANDARD simple staff_review FEATURES from to/span description rpt 36 43 5' insertion target sequence rpt 44 58 5' inverted terminal repeat trns 44 > 90 transposon-like sequence BASE COUNT 29 a 14 c 23 g 24 t ORIGIN 1 tgaaaatgca tgttacaaac gttccctaat aatatagtag aagtagggtg taaatgagtc 61 gatacgctca cgaacagttc ggtgttcggg // LOCUS PUMCHSA2 732 bp ds-DNA PLN 27-JUL-1990 DEFINITION P.crispum chalcone synthase gene, exon 1. ACCESSION M35516 KEYWORDS chalcone synthasse. SEGMENT 2 of 2 SOURCE P.crispum (strain Mooskrause) DNA, clone PcCHS-1. ORGANISM Petroselinum crispum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Apiales; Apiaceae. REFERENCE 1 (bases 1 to 732) AUTHORS Herrmann,A., Schulz,W. and Hahlbrock,K. TITLE Two alleles of the single-copy chalcone synthase gene in parsley differ by a transposon-like element JOURNAL Mol. Gen. Genet. 212, 93-98 (1988) STANDARD simple staff_review FEATURES from to/span description pept 725 > 732 chalcone synthase, exon 1 mRNA 591 > 732 chalcone synthase mRNA (alt.) mRNA 639 > 732 chalcone synthase mRNA (alt.) trns < 1 53 transposon-like sequence rpt 38 53 3' inverted terminal repeat rpt 54 61 3' insertion target sequence BASE COUNT 245 a 139 c 126 g 222 t ORIGIN About 810 bp after segment 1. 1 ctgaaacgtg aacacttaag aattcggctc ggttcggctc atttacagcc ctaagtagaa 61 gtgatccata gcattttata ataactaaca ttacatatat attacccttt ttctacattc 121 gtgtagagag aaaatgactc aataattcaa ttagtattat agaatttctt gttggtaact 181 cttaatttta tttattaata gaataacgga aacgaaacga gatgggattg gtagatggga 241 aacagaaata ggcacgtcac ggggtgtatg tatttattaa aaacaatggc atataggtaa 301 gttgaggata acaattaaat cgtggtttca ttacaatagt tctgaaagaa gaggtagaat 361 cgattgtcct tcacttgatg tatctactca ctttggatgt acgtggaggc ctaaatcccc 421 accatggttg caagtagcag ttacatccac ataactcctc ccttattcca cgtggccatc 481 cggtggccgt ccctccaacc taacctccct tgaatttcta tcagtaaatt ttcaaccctc 541 tctcttctct gtcgtcacta tataatttcc ctacctacat ctctcttgat atctctttct 601 ctctataaat atttaacaag gagaaaggaa aaataacaac acaaatcaga cagatcagat 661 catcccttta cataaagata aggggtaagg gggggtttat agagagattt ataaaaaaaa 721 aaaaatggca aa // LOCUS R11ORIA 589 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid R1162 origin of replication region DNA. ACCESSION M35512 KEYWORDS origin of replication. SOURCE Plasmid R1162 DNA. ORGANISM Plasmid R1162 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 589) AUTHORS Meyer,R.J., Lin,L.-S., Kim,K. and Brasch,M.A. TITLE Broad host-range plasmid R1162: Replication, incompatibility, and copy-number control JOURNAL Basic Life Sci. 30, 173-188 (1985) STANDARD simple staff_review BASE COUNT 114 a 156 c 199 g 120 t ORIGIN 1 ccgggctgaa tgatcgaccg agacaggccc tgcggggctg cacacgcgcc cccacccttc 61 gggtaggggg aaaggccgct aaagcggcta aaagcgctcc agcgtatttc tgcggggttt 121 ggtgtggggt ttagcgggct ttgcccgcct ttccccctgc cgcgcagcgg tggggcggtg 181 tgtagcctag cgcagcgaat agaccagcta tccggcctct gggcatattg ggcagggcag 241 cagcgcccca cagggcgtga ctaaccgcgc ctagtggatt attcttagat aatcatggat 301 ggatttttcc aacaccccgc cagcccccgc ccctgctggg tttgcaggtt tgggggcgtg 361 acagttattg caggggttcg tgacagttat tgcagggggg cgtgacagtt attgcagggg 421 ttcgtgacag ttagtacggg atgacgggca ctggctggca atgtctagca acggcaggca 481 tgtcggctga cggtaaaaca actttccgct aagcgataga ctgtatgtga aacacagtat 541 tgcaaggacg cggaacatgc ctcatgtggc ggccaggacg gccagccgg // LOCUS STRSPEC 936 bp ds-DNA BCT 27-JUL-1990 DEFINITION S.pyogenes streptococcal pyrogenic exotoxin type C (speC) gene, complete cds. ACCESSION M35514 KEYWORDS pyrogenic exotoxin. SOURCE S.pyogenes DNA. ORGANISM Streptococcus pyogenes Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 936) AUTHORS Goshorn,S.C. and Schlievert,P.M. TITLE Nucleotide sequence of Streptococcal pyrogenic exotoxin type C JOURNAL Infect. Immun. 56, 2518-2520 (1988) STANDARD simple staff_review FEATURES from to/span description pept 154 861 streptococcal pyrogenic exotoxin type C (speC) precursor sigp 154 234 streptococcal pyrogenic exotoxin type C signal peptide (put.) matp 235 858 streptococcal pyrogenic exotoxin type C BASE COUNT 364 a 144 c 127 g 301 t ORIGIN 1 caaccttgac tatttaaatg gaactgccac tcctaaaaac taaaatataa atacatttat 61 aaaatttcta aataaacaga aatctgattt ttaactactt actgctattt catgtattct 121 cgtacgagta atacatttaa ttaaggagaa aaaatgaaaa agattaacat catcaaaata 181 gttttcataa ttacagtcat actgatttct acttatttca cctatcatca aagtgactct 241 aagaaagaca tttcgaatgt taaaagtgat ttactttatg catacactat aactccttat 301 gattataaag attgcagggt aaatttttca acgacacaca cattaaacat tgatactcaa 361 aaatatagag ggaaagacta ttatattagt tccgaaatgt cttatgaggc ctctcaaaaa 421 tttaaacgag atgatcatgt agatgttttt ggattatttt atattcttaa ttctcacacc 481 ggtgagtaca tctatggagg aattacgcct gctcaaaata ataaagtaaa tcataaatta 541 ttgggaaatc tatttatttc gggagaatct caacagaact taaataacaa gattattcta 601 gaaaaggata tcgtaacttt ccaggaaatt gactttaaaa tcagaaaata ccttatggat 661 aattataaaa tttatgacgc tacttctcct tatgtaagcg gcagaatcga aattggcaca 721 aaagatggga aacatgagca aatagactta tttgactcac caaatgaagg gactagatca 781 gatatttttg caaaatataa agataataga attatcaata tgaagaactt tagtcatttc 841 gatatttatc ttgaaaaata attcatcata cacaaaaaac cgcccagaat aatctgagcg 901 gttttgtctt atctcggagc tttacctcct aattta // LOCUS HUMTROPI 816 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human slow-twitch skeletal troponin I (TNN1) mRNA, complete cds. ACCESSION J04760 KEYWORDS slow-twitch skeletal troponin I; troponin I. SOURCE Human adult slow-twitch skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 816) AUTHORS Wade,R., Eddy,R., Shows,T.B. and Kedes,L. TITLE cDNA sequence, tissue-specific expression, and chromosomal mapping of the human slow-twitch skeletal muscle isoform of troponin I JOURNAL Genomics 7, 346-357 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Wade, 15-MAR-1990. FEATURES from to/span description pept 77 640 slow-twitch skeletal troponin I (TNN1) mRNA < 1 816 TNN1 mRNA BASE COUNT 180 a 247 c 244 g 145 t ORIGIN Chromosome 1q12-qter. 1 tagtctgcag tctacggcga ggcacaggcc agcccagctc cacgaggact gaacaaggtg 61 ctgtctcact gccaccatgc cggaagtcga gagaaaaccc aagatcactg cctcccgcaa 121 actcttgctg aagagcctga tgctggccaa ggccaaggaa tgctgggagc aggagcacga 181 ggagcgcgag gctgagaagg tgcgctacct ggcagagcgc atccccacgc tgcagacccg 241 tggcctgtcc ctcagtgccc tgcaggacct gtgccgggag ctgcacgcca aggtggaggt 301 ggtggatgag gagcgatacg acattgaggc caaatgcctc cacaacacca gggagattaa 361 ggacctgaag ctgaaggtga tggacctccg tgggaagttc aagcgcccgc ccctgcgtcg 421 agtccgtgtc tcggctgacg ccatgctccg ggccctgctg ggctccaagc acaaggtgtc 481 catggatctg cgggccaacc tcaagtctgt gaagaaggaa gacacagaga aggagcggcc 541 tgtggaggtg ggtgactgga ggaagaacgt ggaggccatg tctggcatgg aaggccggaa 601 gaagatgttt gatgccgcca atgctccgac ctcacaatag aggccagctt gctgtgctgc 661 gctctgagct cctgcttcat gcttcttctc caacccagct cactcacctc tctgcctgtg 721 tctggagcat cccttcccac ctctccccca cttcttccct ccagcctgca atgccctcct 781 ctggaactgg gattaaacag atacccaaga ggcagg // LOCUS PPB4TSP 349 bp ds-DNA VRL 27-JUL-1990 DEFINITION Bovine papillomavirus major transcript, 5' flank. ACCESSION M35265 KEYWORDS major transcript. SOURCE Bovine papillomavirus (type 4) DNA. ORGANISM Bovine papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 349) AUTHORS Stamps,A.C. and Campo,M.S. TITLE Mapping of two novel transcripts of Bovine papillomavirus type 4 JOURNAL J. Gen. Virol. 69, 3033-3045 (1988) STANDARD simple staff_review FEATURES from to/span description pre-msg 320 > 349 BPV-4 major transcript BASE COUNT 109 a 54 c 88 g 98 t ORIGIN 1 ctttgcagat gaattagaac aggaggaaca gtaaatagaa tattgctgtg cttttacatt 61 tgagaaggat tgtagtgtgg gcatttaaaa gctgaccttc cagtcttaat tgcagtaggc 121 gcctaagagg gtggtggtgg tataagttca agtttttcaa gaaaatgaag gggcagaacg 181 tgacattaca ggacattgca atagaattag aggatacaat tagtccaatt aacttgcatt 241 gtgaagagga gattgaaact gaggaggtgg acacccctaa cccttttgca ataacagcaa 301 cttgttatgc ttgcgagcaa gtccttcgtt tagctgttgt aacgtcaac // LOCUS PPBBPV4X 69 bp ds-DNA VRL 27-JUL-1990 DEFINITION Bovine papillomavirus X protein gene, partial cds. ACCESSION M35264 KEYWORDS . SOURCE Bovine papillomavirus (type 4) DNA. ORGANISM Bovine papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 69) AUTHORS Stamps,A.C. and Campo,M.S. TITLE Mapping of two novel transcripts of Bovine papillomavirus type 4 JOURNAL J. Gen. Virol. 69, 3033-3045 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 69 X protein (AA at 3) BASE COUNT 18 a 27 c 19 g 5 t ORIGIN 1 acaacgcggg acccaaaccg gggaccacgc ccgaggacgt agcagaccgt ccccccgatc 61 ttccagaga // LOCUS BLIPGECR 314 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.licheniformis regulatory extracellular protease gene, complete cds. ACCESSION M35503 KEYWORDS extracellular protease. SOURCE B.licheniformis DNA, clone pAN2. ORGANISM Bacillus licheniformis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 314) AUTHORS Watanabe,K., Sato,N., Asano,K., Hatanaka,Y., Okada,J., Murata,K. and Kimura,A. TITLE Nucleotide sequence of the gene increasing the extracellular proteolytic activities of Bacillus licheniformis; Comparison with similar phenotypic genes from other Bacillus sp JOURNAL Agric. Biol. Chem. 51, 2807-2809 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 27 149 regulatory extracellular protease binding 12 16 ribosomal binding site (put.) binding 20 23 ribosomal binding site (put.) BASE COUNT 118 a 53 c 55 g 88 t ORIGIN 1 gatcttccaa aaggagtgtg gaaccggtgg aaaagcaaca aattgaagaa ttaaaacaac 61 tgctttggcg gctagagaat gaaatcagag aaacaaagga ctccttgcgc aagattaaca 121 aaagcattga tctgttaata caagtttaaa gggatttggt ttctttttca ccgaaaagtt 181 gcatataaat tcatgcgtta caccgcaaac tttaaataga ccataacacc cattttacac 241 aacttgcgat tagttctcaa atcctaatga aacataaaat ttttgtttgt ttttcaagga 301 gtcaaataaa gatc // LOCUS DOGPLBA 832 bp ss-mRNA MAM 27-JUL-1990 DEFINITION Dog cardiac phospholamban mRNA, complete cds. ACCESSION M35393 KEYWORDS phospholamban. SOURCE Dog heart, cDNA to mRNA, clone pPLB1. ORGANISM Canis sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 832) AUTHORS Uyeda,A., Kitano,K., Fujii,J., Kadoma,M., Tada,M. and Tanaka,S. TITLE Characterization of recombinant cDNA clones for canine cardiac phospholamban JOURNAL Nucleic Acids Symp Ser 17, 121-124 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 181 339 cardiac phospholamban (EC 2.7..1.37) BASE COUNT 277 a 161 c 125 g 269 t ORIGIN 1 agaaaacttt ctaactaaac accgataaga cttcatacaa ctcacaatac tttatattgt 61 aatcatcaca agagccaagg ctacctaaaa gaagagagtg gttgagctca catttggccg 121 ccagcttttt acctttctct tcaccattta aaacttgaga cttcctgctt tcctggggtc 181 atggataaag tccaatacct cactcgctct gctattagaa gagcttcaac cattgaaatg 241 cctcaacaag cacgtcaaaa tcttcagaac ctatttataa atttctgtct cattttaata 301 tgtctcttgt tgatctgcat cattgtgatg cttctctgaa gttctgctgc aatctccagt 361 gatgcaactt gtcaccatca acttaatatc tgccatccca tgaagagggg aaaataatac 421 tatataacag accacttcta agtagaagat tttacttgtg aaaaggtcaa gattcagaac 481 aaaagaaatt attaacaaat gtcttcatct gtgggatttt gtaaacatga aaagagcttt 541 attttcaaaa attaacttca aaatgactat aggtgcgcat aatgtaattg ctgaattcct 601 caacaaagct tgtaaaagtt tctatgccaa attttttctg agggtaaagt aggagtttag 661 ttttaaaact gctctgctaa ccagttcact tcacatataa agcattagct tcactatttg 721 agctaaatat ttatattgta ctgtaaatgc ctatgtaatg tttattaaga tttttcaagt 781 ctccgctaag tacgaaaata atcatccaaa tgaagtcatc atttgaaata gc // LOCUS DVUFDXA 707 bp ds-DNA BCT 27-JUL-1990 DEFINITION D.salexigens flavodoxin gene, complete cds. ACCESSION M35475 KEYWORDS flavodoxin. SOURCE D.salexigens (ATCC 14822) DNA. ORGANISM Desulfovibrio salexigens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 707) AUTHORS Helms,L.R., Krey,G.D. and Swenson,R.P. TITLE Identification, sequence determination, and expression of the flavodoxin gene form Desulfovibrio salexigens JOURNAL Biochem. Biophys. Res. Commun. 168, 809-817 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 238 678 flavodoxin binding 227 232 ribosome binding site (put.) signal 129 135 flavodoxin -35 region signal 152 157 flavodoxin -10 region BASE COUNT 223 a 152 c 149 g 183 t ORIGIN 1 agactaacaa taaaaatttt gttttctcac aaacaacatt ttgaacgcgc atatatcggt 61 acgtcttaag gagtgcataa caccactcag gcgcggcgta aacactcaat taccgattgg 121 ctgacatttt cacattttct taatattttt ataaattgat attgactttg aatttcactc 181 tcactacaaa ctaacacaac gttatcaaac cgattacaga tttataagga ggacactatg 241 tccaaatcac tgatcgttta cggctctact accggaaata ctgaaacagc cgccgaatac 301 gtggctgaag cgtttgaaaa caaagaaatt gatgtggaac ttaaaaatgt tactgatgtc 361 agtgttgccg atctcggcaa cggatacgac atcgtgctat tcggctgctc tacctggggc 421 gaagaagaaa ttgaattgca ggacgacttc atccccctct acgattccct cgaaaacgca 481 gacctgaagg gcaagaaagt atctgtcttc ggatgcggtg actccgatta cacttatttt 541 tgcggtgcag tagatgccat cgaagaaaaa ctcgaaaaaa tgggggccgt cgtcataggt 601 gacagcctca agatcgatgg cgacccggaa cgcgatgaga ttgtaagctg gggttcagga 661 atagcggaca aaatttaaat taatacagct cagctatcct tacgaag // LOCUS HUMCKRASA 450 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human PR310 c-K-ras protein mRNA, 5' end. ACCESSION M35504 KEYWORDS c-K-ras protein; oncogene. SOURCE Human (patient PR310) lung carcinoma, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 450) AUTHORS Yamamoto,F., Nakano,H., Neville,C. and Perucho,M. TITLE Structure and mechanisms of activation of c-K-ras oncogenes in human lung cancer JOURNAL Prog. Med. Virol. 32, 101-114 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 450 PR310 c-K-ras oncogene BASE COUNT 155 a 71 c 106 g 118 t ORIGIN 1 atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 61 atacagctaa ttgacaatca ttttgtggac gaatatgatc caacaataga ggattcctac 121 aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt 181 catgaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt 241 gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt 301 aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg 361 ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct 421 tttattcaaa catcagcaaa gacaagacag // LOCUS HUMCKRASB 450 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human PR371 c-K-ras oncogene mRNA, 5' end. ACCESSION M35505 KEYWORDS c-K-ras oncogene. SOURCE Human (patient PR371) lung carcinoma, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 450) AUTHORS Yamamoto,F., Nakano,H., Neville,C. and Perucho,M. TITLE Structure and mechanisms of activation of c-K-ras oncogenes in human lung cancer JOURNAL Prog. Med. Virol. 32, 101-114 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 450 PR371 c-K-ras oncogene BASE COUNT 156 a 71 c 105 g 118 t ORIGIN 1 atgactgaat ataaacttgt ggtagttgga gcttgtggcg taggcaagag tgccttgacg 61 atacagctaa ttgacaatca ttttgtggac gaatatgatc caacaataga ggattcctac 121 aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt 181 caagaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt 241 gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt 301 aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg 361 ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct 421 tttattcaaa catcagcaaa gacaagacag // LOCUS HUMTCRAER 237 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human T cell receptor-alpha enhancer region DNA. ACCESSION M35477 KEYWORDS T-cell receptor. SOURCE Human T-lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 237) AUTHORS Winto,A. and Baltimore,D. TITLE Developmental regulation of the TCR-alpha-delta locus JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 87-92 (1989) STANDARD simple staff_entry FEATURES from to/span description binding 28 60 NF-alpha-1 protein binding site binding 128 139 NF-alpha-2 protein binding site binding 159 172 NF-alpha-3 protein binding site binding 175 189 NF-alpha-5 protein binding site binding 192 203 NF-alpha-4 protein binding site mut 180 187 gcagaagc in NF-alpha-5; tactccta in Mut-alpha-5 mut 192 197 tcctct in NF-alpha-4; gaagag in Mut-alpha-4 BASE COUNT 73 a 58 c 57 g 49 t ORIGIN 1 agacagctgc accctgaaat gggtaagctg gtcagatagt gaatcaatag ccagaagtag 61 aacaggaaat ggaaaaagtt tcccacttcc ctccaggtgt ttgggtctga acagcctccc 121 acttccatga cgtcacggct gctgacatgg gcaaacaggt ccccctttga agctctcccg 181 cagaagccac atcctctgga aagaggagtt aaaaatacag agttagagat aagatct // LOCUS HUMTGIGMU 352 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human Ig mu-chain transgenic mRNA, partial cds. ACCESSION M35478 KEYWORDS immunoglobulin heavy-chain; mu-immunoglobulin; transgenic gene. SOURCE Human leukemic patient mononuclear cell line, cDNA to mRNA, clones pTMMG1-[4,5,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 352) AUTHORS Shimizu,A., Kinashi,T., Nussenzweig,M.C., Mizuta,T.-R., Leder,P. and Honjo,T. TITLE Molecular mechanism for immunoglobulin double-isotype expression JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 175-181 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 352 mu-immunoglobulin transgene protein (AA at 2) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" recomb 226 227 V region end/D region start recomb 274 275 D region end/J6 region start recomb 332 333 J6 region end/C1 region start BASE COUNT 84 a 99 c 104 g 65 t ORIGIN 1 ggcttctgga ggcaccttca gcagctatgc tatcagctgg gtgcgacagg cccctggaca 61 agggcttgag tggatgggag ggatcatccc tatctttggt acagcaaact acgcacagaa 121 gttccagggc agagtcacga ttaccgcgga cgaatccacg agcacagcct acatggagct 181 gagcagcctg agatctgagg acacggccgt gtattactgt gcgaaaaccg ggatcctggg 241 gcggtatagc agtggctggt acccgaactc ggactactac tactacggta tggacgtctg 301 gggccaaggg accacggtca ccgtctcctc agccaaaacg acacccccat ct // LOCUS MUSIGHADD 411 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig mu-chain mRNA VDJ region, 5' end. ACCESSION M35502 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse hybridoma CP8 B3-D3, cDNA to mRNA, clone pCP12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 411) AUTHORS Reininger,L., Kaushik,A., Izui,S. and Jaton,J.-C. TITLE A member of a new V-H gene family encodes anti-bromelinized mouse red blood cell autoantibodies JOURNAL Eur. J. Immunol. 18, 1521-1526 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 411 Ig mu-chain VDJ region precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 411 Ig mu-chain recomb 351 352 V region end/D region start recomb 360 361 D region end/J1 region start BASE COUNT 101 a 84 c 118 g 108 t ORIGIN 1 atggagtggg aactgagctt aattttcatt tttgctcttt taaaagatgt ccagtgtgaa 61 gtgcagctgt tggagactgg aggaggcttg gtgcaacctg gggggtcacg gggactctct 121 tgtgaaggct cagggtttac ttttagtggc ttctggatga gctgggttcg acagacacct 181 gggaagaccc tggagtggat tggagacatt aattctgatg gcagtgcaat aaactacgca 241 ccatccataa aggatcgatt cactatcttc agagacaatg acaagagcac cctgtacctg 301 cagatgagca atgtgcgatc tgaggacaca gccacgtatt tctgtatgag atatggtaac 361 tactggtact tcgatgtctg gggcgcaggg accacggtca ccgtctcctc a // LOCUS PRW1707PPS 66 bp ds-DNA SYN 27-JUL-1990 DEFINITION Plasmid pRW1707 DNA with polypurine-polypyrimidine sequences. ACCESSION M35455 KEYWORDS . SOURCE Synthetic DNA, clone pRW1707. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D. TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine sequences in plasmids JOURNAL Biochemistry 29, 4704-4713 (1990) STANDARD simple staff_entry BASE COUNT 36 a 5 c 21 g 4 t ORIGIN 1 ggatccgaag aagaagaaga agaagaagaa gaattcgaag aagaagaaga agaagaagaa 61 ggatcc // LOCUS PRW1718PPS 72 bp ds-DNA SYN 27-JUL-1990 DEFINITION Plasmid pRW1718 DNA with polypurine-polypyrimidine sequences. ACCESSION M35460 KEYWORDS . SOURCE Synthetic DNA, clone pRW1718. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 72) AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D. TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine sequences in plasmids JOURNAL Biochemistry 29, 4704-4713 (1990) STANDARD simple staff_entry BASE COUNT 38 a 6 c 22 g 6 t ORIGIN 1 ggatccgaag aagaagaaga agaagaagaa gaattaattc gcgaagaaga agaagaagaa 61 gaagaaggat cc // LOCUS PRW1724PPS 66 bp ds-DNA SYN 27-JUL-1990 DEFINITION Plasmid pRW1724 DNA with polypurine-polypyrimidine sequences. ACCESSION M35458 KEYWORDS . SOURCE Synthetic DNA, clone pRW1724. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D. TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine sequences in plasmids JOURNAL Biochemistry 29, 4704-4713 (1990) STANDARD simple staff_entry BASE COUNT 19 a 6 c 38 g 3 t ORIGIN 1 ggatccggag gaggaggagg aggaggagga ggatccggag gaggaggagg aggaggagga 61 ggatcc // LOCUS PRW1725PPS 66 bp ds-DNA SYN 27-JUL-1990 DEFINITION Plasmid pRW1725 DNA with polypurine-polypyrimidine sequences. ACCESSION M35459 KEYWORDS . SOURCE Synthetic DNA, clone pRW1725. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D. TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine sequences in plasmids JOURNAL Biochemistry 29, 4704-4713 (1990) STANDARD simple staff_entry BASE COUNT 27 a 6 c 30 g 3 t ORIGIN 1 ggatccggag gaggaggagg aggaggagga ggatccgaag aagaagaaga agaagaagaa 61 ggatcc // LOCUS PT1INC3AA 330 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid pT181 (from S.aureus) inc3A region DNA with repC gene, 5' end. ACCESSION M35476 KEYWORDS . SOURCE Plasmid pT181 (from Staphylococcus aureus) DNA. ORGANISM Plasmid pT181 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 330) AUTHORS Highlander,S.K. and Novick,R.P. TITLE Mutational and physiological analyses of plasmid pT181 functions expressing incompatibility JOURNAL Plasmid 23, 1-15 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 11 < 1 (c) RepC protein mRNA 191 < 1 (c) RepC mRNA III mRNA 291 < 1 (c) RepC mRNA IV RNA 106 190 RNA I RNA 106 250 RNA II binding 27 22 (c) RepC ribosomal binding site (put.) site 151 176 inverted repeat terminator structure site 218 249 inverted repeat terminator structure BASE COUNT 113 a 51 c 37 g 129 t ORIGIN 1 tttttataca taaaaaatcg actccttaat ctcaatttcg tttaaggaat cgctcaccca 61 aatatatatc ttgatgtata tttaaatatc gtttaatatc taaatataca agattataaa 121 aacaactcag tgtttttttc tttgaatgat gtcgttcaca aactttggtc agggcgtgag 181 cgactccttt ttattttgtt attaatataa cactatcaaa agatttggtc taatcagatc 241 aagtcttttt ttatttaagc atttgtatta tctggtaaac agttaaaact actaaaacac 301 caagtacata cttacttgtt ataaaattct // LOCUS RICRGC1 145 bp ds-DNA PLN 27-JUL-1990 DEFINITION Rice 17S rRNA gene. ACCESSION M35382 M35383 KEYWORDS 17S ribosomal RNA. SEGMENT 1 of 4 SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 145) AUTHORS Hariharan,N. and Padayatty,J.D. TITLE Ribosomal RNA gene repeat units in rice embryos JOURNAL Indian J Biochem Biophys 24, 157-164 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 > 145 17S rRNA variant 105 106 cg in IR20; cgcggccg in Mangetsumochi variant 112 113 gt in IR20; gcgt in Mangetsumochi BASE COUNT 30 a 40 c 43 g 32 t ORIGIN 1 gatccattgg agggcaagtc tggtgccagc agccgcggta attccagctc caatagcgta 61 tatttaagtt gttgcagtta aaaagctcgt agttggacct tgggcgggcc ggtccgcctc 121 acggcaggca ccgacctgct cgacc // LOCUS RICRGC2 260 bp ds-DNA PLN 27-JUL-1990 DEFINITION Rice 5.8S rRNA gene. ACCESSION M35384 KEYWORDS 5.8S ribosomal RNA. SEGMENT 2 of 4 SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 260) AUTHORS Hariharan,N. and Padayatty,J.D. TITLE Ribosomal RNA gene repeat units in rice embryos JOURNAL Indian J Biochem Biophys 24, 157-164 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA 85 247 5.8S rRNA BASE COUNT 61 a 85 c 76 g 38 t ORIGIN About 1.0 kbp after segment 1. 1 cagaacccac ggcgccgacg gcgtcaagga acacagcgag gccctggccg tccggcggcg 61 cgcgcgatac cacgagttaa atccacacga ctctcggcaa cggatatctc ggctctcgca 121 tcgatgaaga acgtagcgaa atgcgatacc tggtgtgaat tgcagaatcc cgtgaaccat 181 cgagtctttg aacgcaagtt gcgcccgagg ccatccggcc gagggcacgc ctgcctgggc 241 gtcacgccaa aagacgctcc // LOCUS RICRGC3 91 bp ds-DNA PLN 27-JUL-1990 DEFINITION Rice 25S rRNA gene. ACCESSION M35385 KEYWORDS 25S ribosomal RNA. SEGMENT 3 of 4 SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 91) AUTHORS Hariharan,N. and Padayatty,J.D. TITLE Ribosomal RNA gene repeat units in rice embryos JOURNAL Indian J Biochem Biophys 24, 157-164 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 91 25S rRNA BASE COUNT 30 a 12 c 29 g 20 t ORIGIN About 1.6 kbp after segment 2. 1 gatcttggtg gtagtagcaa atattcaaat gagaacttga agggcgaaga ggagaaaggt 61 tccatgtgaa cggcacttgc acatgggtaa g // LOCUS RICRGC4 161 bp ds-DNA PLN 27-JUL-1990 DEFINITION Rice 25S rRNA gene. ACCESSION M35386 KEYWORDS 25S ribosomal RNA. SEGMENT 4 of 4 SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 161) AUTHORS Hariharan,N. and Padayatty,J.D. TITLE Ribosomal RNA gene repeat units in rice embryos JOURNAL Indian J Biochem Biophys 24, 157-164 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 161 25S rRNA variant 24 25 cc in IR20; ccgc in Mangetsumochi variant 63 68 gccctt in IR20; tccacg in Mangetsumochi variant 128 128 c in IR20; g in Mangetsumochi BASE COUNT 36 a 52 c 48 g 25 t ORIGIN About 250 bp after segment 3. 1 cgcacgtcgc gcggtgtccg gtgcccccgg cggcccttga aaatccggag gaccgagtac 61 cggcccttcc cggtcgtact cataaccgca tcaggtctcc aaggtgaacg acctctggcc 121 aatggaacaa tgtaggcaag ggaagtcggc aaaacggatc c // LOCUS STRRRA 5100 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid pAM-beta-1 (from S.faecalis) replication region DNA. ACCESSION X17092 KEYWORDS . SOURCE plasmid pAM-beta-1 (from Streptococcus faecalis) DNA. ORGANISM Plasmid pAM-beta-1 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 5100) AUTHORS Swinfield,T.-J., Oultram,J.D., Thompson,D.E., Brehm,J.K. and Minton,N.P. TITLE Physical characterization of the replication region of the Streptococcus faecalis plasmid pAM-beta-1 JOURNAL Gene 87, 79-90 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 476 ORFA (AA at 3) pept 505 873 ORFB pept 933 1799 ORFC pept 2121 2405 ORFF pept 2772 3071 ORFD pept 3114 4604 ORFE pept 4954 5091 ORFG binding 488 496 ORFB ribosomal binding site (put.) binding 919 927 ORFC ribosomal binding site (put.) binding 2759 2765 ORFD ribosomal binding site (put.) binding 3100 3106 ORFE ribosomal binding site (put.) binding 4942 4948 ORFG ribosomal binding site (put.) BASE COUNT 1953 a 806 c 944 g 1397 t ORIGIN 1 gaattcattt gttcatatct attttgactc tttttattgt attattgata ggaataagtg 61 ttattcaaca agtacaactt caatcagtca aaaaagagag cgaattattg gaagaacaaa 121 ttgaacgtgt caaagaaacg gatatttctc aatcgaaaat agatactttt ggaagatact 181 ttttgactta ctatttttca caggaaaaaa atcaagaaaa ttatcagtca agtttaagaa 241 catatgtatc cgaaaaggta gatatatccg attggaaagc gttagggaaa acgcttaaga 301 gtgtcaatta ctatggtagt gaacaaacaa aaaaagggta tagcgttgaa taccttctaa 361 atgtttcagt agacaatcgt agtaaaatgc aaaaaatcac ttttgaggtt gaaccaacta 421 aaaacggctt tcttgtgaca actcaaccaa aacttacaga cttctctttt aactgaaata 481 cgctaaaaag agagtgctta aacaatggga aaaattaatt taaatcaaat ttacacagca 541 aaagaaatga gtgaaaggat aggtaagaat agaaattacc tatcccaagc ttatcgtaac 601 aacaaacatg aaatacttaa aaattttaat tatcgaaaaa ttggtggaac aattattttc 661 tctgataatc ccaataatga tttatcgcaa ttgattacag caaaagaagc aagccagttg 721 ttaggaaaga atgatgaata ctttgcccac atttataaac ggtttcctca tagactagag 781 ggaatcgatc acatttatac aggtaaaaca ttattcttaa cgaaagaatc gctagaagtt 841 tttaaaaaaa agatgaacaa aaatgtcaga taaagcatat ttatgatata ctatttatgt 901 acacaaatat taatttttaa ggagagaaga aaatgaaaaa aattgccctt ttaacacttt 961 ttagtacagc gctattggta tcagccccac ttgttagttt tgcggacgaa gcaacaggta 1021 gttcaggtgt caatgttcta gctgatgatg tagttgatcc agtagaacca accgatccaa 1081 ctactccggt tgatccagtt gatccagttg atccagttga tccagttgat ccagttgatc 1141 cagttgatcc agtcgatcca gttgatccag tcgatccaac agagccaacg gaaccaacag 1201 agccaacgga accaacagag ccaacggaac caacagagcc aacagagcca acggaaccaa 1261 cggaaccaac ggaaccaacg gaaccaacag agccaacgga accaacggaa ccaacagagc 1321 caacggaacc aacagagcca acggaaccaa cggaaccaac agagccaaca gagccaacgg 1381 aaccaacgga accaacagag ccaacggaac caacagagcc aacagagcca acagagccaa 1441 cggaaccaac ggagccaacg gaaccaacgg aaccaagcaa gccgacagag ccaacagaac 1501 caagtaaacc agtagacaca aacccaatcg aaaacccagt taatacagat acaggcgttg 1561 tgattgttgc agttgaagat agtaaaccaa tcattcaatt agcagacgga acaacaaaaa 1621 aagttgaagc taaagaaatt ggcgcagacg ttcaaaaaga cggaacagta acggttaaag 1681 gttcagacgg aaaaatgaaa gttttaccaa aaacaggaga aacagaaaat attgcattgt 1741 ctgtcctagg ttcattaatg gtattaggat cagcattcat ttttaaaaag agaatctaat 1801 taagtatttt taaaagcatg gtcgcaagtt tcactagcag ccatgctttt tttgaatctt 1861 tgtaggacgt tttaaagaga attacccact ttaaaattaa acatgcgctt agaatcgctt 1921 taggaaacac gatccagtcc aataatcgtc gataaaaact tttgaaaaag gttggtgaaa 1981 ttacctactt ttggaataat cacaaatcac aagtgattaa tcacaaatca caagtgatta 2041 atcacttgtt tattaagata ttaaaagcta taatttaaat aaagcgtgaa ttttattaca 2101 caaaaagagg ggggagaaac ttggaactag catttagaga aagcttaaaa aagatgagag 2161 gtaccaaatc aaaagaaaaa ttctcccaag aattagaaat gagtagatca aattattcac 2221 gaatagaatc aggaaaatca gatccaacca taaaaacact agaacaaatt gcaaagttaa 2281 ctaactcaac gctagtagtg gatttaatcc caaatgagcc aacagaacca gaaccagaaa 2341 cagaatcaga acaagtaaca ttggatttag aaatggaaga agaaaaaagc aatgacttcg 2401 tgtgaataat gcacgaaatc gttgcttatt tttttttaaa agcggtatac tagatataac 2461 gaaacaacga actgaataga aacgaaaaaa gagccatgac acatttataa aatgtttgac 2521 gacattttat aaatgcatag cccgataaga ttgccaaacc aacgcttatc agttagtcag 2581 atgaactctt ccctcgtaag aagttattta attaactttg tttgaagacg gtatataacc 2641 gtactatcat tatataggga aatcagagag ttttcaagta tctaagctac tgaatttaag 2701 aattgttaag caatcaatcg gaaatcgttt gattgctttt tttgtattca tttatagaag 2761 gtggagtttg tatgaatcat gatgaatgta aaacttatat aaaaaatagt ttattggaga 2821 taagaaaatt agcaaatatc tatacactag aaacgtttaa gaaagagtta gaaaagagaa 2881 atatctactt agaaacaaaa tcagataagt atttttcttc ggagggggaa gattatatat 2941 ataagttaat agaaaataac aaaataattt attcgattag tggaaaaaaa ttgacttata 3001 aaggaaaaaa atctttttca aaacatgcaa tattgaaaca gttgaatgaa aaagcaaacc 3061 aagttaatta aacaacctat tttataggat ttataggaaa ggagaacagc tgaatgaata 3121 tcccttttgt tgtagaaact gtgcttcatg acggcttgtt aaagtacaaa tttaaaaata 3181 gtaaaattcg ctcaatcact accaagccag gtaaaagcaa aggggctatt tttgcgtatc 3241 gctcaaaatc aagcatgatt ggcggtcgtg gtgttgttct gacttccgag gaagcgattc 3301 aagaaaatca agatacattt acacattgga cacccaacgt ttatcgttat ggaacgtatg 3361 cagacgaaaa ccgttcatac acgaaaggac attctgaaaa caatttaaga caaatcaata 3421 ccttctttat tgattttgat attcacacgg caaaagaaac tatttcagca agcgatattt 3481 taacaaccgc tattgattta ggttttatgc ctactatgat tatcaaatct gataaaggtt 3541 atcaagcata ttttgtttta gaaacgccag tctatgtgac ttcaaaatca gaatttaaat 3601 ctgtcaaagc agccaaaata atttcgcaaa atatccgaga atattttgga aagtctttgc 3661 cagttgatct aacgtgtaat cattttggta ttgctcgcat accaagaacg gacaatgtag 3721 aattttttga tcctaattac cgttattctt tcaaagaatg gcaagattgg tctttcaaac 3781 aaacagataa taagggcttt actcgttcaa gtctaacggt tttaagcggt acagaaggca 3841 aaaaacaagt agatgaaccc tggtttaatc tcttattgca cgaaacgaaa ttttcaggag 3901 aaaagggttt aatagggcgt aataacgtca tgtttaccct ctctttagcc tactttagtt 3961 caggctattc aatcgaaacg tgcgaatata atatgtttga gtttaataat cgattagatc 4021 aacccttaga agaaaaagaa gtaatcaaaa ttgttagaag tgcctattca gaaaactatc 4081 aaggggctaa tagggaatac attaccattc tttgcaaagc ttgggtatca agtgatttaa 4141 ccagtaaaga tttatttgtc cgtcaagggt ggtttaaatt caagaaaaaa agaagcgaac 4201 gtcaacgtgt tcatttgtca gaatggaaag aagatttaat ggcttatatt agcgaaaaaa 4261 gcgatgtata caagccttat ttagtgacga ccaaaaaaga gattagagaa gtgctaggca 4321 ttcctgaacg gacattagat aaattgctga aggtactgaa ggcgaatcag gaaattttct 4381 ttaagattaa accaggaaga aatggtggca ttcaacttgc tagtgttaaa tcattgttgc 4441 tatcgatcat taaagtaaaa aaagaagaaa aagaaagcta tataaaggcg ctgacaaatt 4501 cttttgactt agagcataca ttcattcaag agactttaaa caagctagca gaacgcccta 4561 aaacggacac acaactcgat ttgtttagct atgatacagg ctgaaaataa aacccgcact 4621 atgccattac atttatatct atgatacgtg tttgtttttt ctttgctgtt tagcgaatga 4681 ttagcagaaa tatacagagt aagattttaa ttaattatta gggggagaag gagagagtag 4741 cccgaaaact tttagttggc ttggactgaa cgaagtgagg gaaaggctac taaaacgtcg 4801 aggggcagtg agagcgaagc gaacacttga ttttttaatt ttctatcttt tataggtcat 4861 tagagtatac ttatttgtcc tataaactat ttagcagcat aatagattta ttgaataggt 4921 catttaagtt gagcatatta gaggaggaaa atcttggaga aatatttgaa gaacccgatt 4981 acatggattg gattagttct tgtggttacg tggtttttaa ctaaaagtag tgaatttttg 5041 atttttggtg tgtgtgtctt gttgttagta tttgctagtc aaagtgatta aatagaattc // LOCUS SUSISDNA 241 bp ds-DNA INV 27-JUL-1990 DEFINITION Sea urchin integration site DNA. ACCESSION M34912 KEYWORDS . SOURCE Sea urchin DNA, clone 1. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 241) AUTHORS Flytzanis,C.N., McMahon,A.P., Hough-Evans,B.R., Katula,K.S., Britten,R.J. and Davidson,E.H. TITLE Persistence and integration of cloned DNA in postembryonic sea urchins JOURNAL Dev. Biol. 108, 431-442 (1985) STANDARD simple staff_entry BASE COUNT 70 a 42 c 33 g 96 t ORIGIN 1 gaaaagtaac gatattcgtt tttataaaag gtcattttca cgtagagatg aatttacctg 61 gtactaataa ctgatgcaga gtcgttttca aacagtgtca aatgcaaacc tttttgtctg 121 actggctctt ttttaaattt gtattaatgc cataacactc actttaattt gcgacatggc 181 tctttccatg tcactttttt attcttcaca tgataaatat tctaaaacta tctgttcttt 241 t // LOCUS MSQD7AA 3819 bp ds-DNA INV 27-JUL-1990 DEFINITION A.aegypti D7 gene, exons 1-5. ACCESSION M33156 KEYWORDS . SOURCE A.aigypti (strain Rockefeller) female salivary gland DNA, clone AEGD7. ORGANISM Aedes aegypti Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae. REFERENCE 1 (bases 1 to 3819) AUTHORS James,A.A., Blackmer,K., Marinotti,O., Ghosn,C. and Racioppi,J. TITLE Molecular analysis of a gene expressed in the female salivary glands of the vector mosquito, Aedes aegypti JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.A.James, 22-MAR-1990. Author address:A.A.James: U of C, Irvine, Dept Biochem Irvine, CA 92717 FEATURES from to/span description pept 2377 2421 D7 protein, exon 1 2481 2696 D7 protein, exon 2 2757 2975 D7 protein, exon 3 3038 3232 D7 protein, exon 4 3288 3578 D7 protein, exon 5 pre-msg 2350 > 3819 D7 protein mRNA and introns IVS 2422 2480 D7 protein intron A IVS 2697 2756 D7 protein intron B IVS 2976 3037 D7 protein intron C IVS 3233 3287 D7 protein intron D signal 2321 2326 TATA box signal 3634 3639 poly-A signal BASE COUNT 1202 a 721 c 744 g 1152 t ORIGIN 1 gaattcaagg aacctttaaa tgaatctctt gaaatatccc tggaaacatt actcgtgaag 61 tcctggagaa ttttcagtta taaatctggg aagaatcctt gcatgaatcc ctggggtaat 121 ttatgaagaa atcctttgag gtattcttgc aagaatctct tgacgttgaa tgcatttggt 181 ttggcttgaa atattttaag gttttggtga aattagaata cttggagaaa tctgtgataa 241 tattttgaat atattcttga aataatttta gaaagagtta ccgcattaat tggaactgaa 301 atcgctggcg gaatttttgt gaaaatcttc aagaaatttc aaaatgtctc tgaaaaaacc 361 ctgaaggaaa ttactaaagg ataatttgac agattttttt tattattgac tctttattat 421 ggagaaattc agcccttagc tagttcatct ccttgacaga attactgaaa aacgactctg 481 ttttttgtaa tctactaatt cagagtccaa cgctcgggtt ttgaatttcc catatttcct 541 acacaaatcc atgaaagaat aatagtcgag aactgtcaaa aagttatgaa aaattattga 601 aaacggaatc gagactagca ctgtaaaaga gtcatctatt gtcatttcag ttacaagggc 661 caatcaattg acacaaacac aagcttcacc aactcttcca gaaattttaa agcatctact 721 gatattggct tagataattt atacgtattc ctcctagatt tggcccagtt gttagttaca 781 tgaaaggcca aatgttaaag gttaaatatc cgcagagtcc attgtcaatc agtgaagtat 841 tttaagccag catgaaaggg tattatcaac attttttcca aaaccctccg ggttcattta 901 tgctgagaag gaaaaaccga agtgattttt ttgcgctctc gtgctctgca taataacctt 961 accgtccttc aattaaactt tccgtcaaag tcttcatagt ccaagcaata aaaaaaaaga 1021 tgaaaatttc ttcaataagg tatcaccaaa gacacatacc ataccgaggg accaaatgca 1081 gtactagaag tggtaccaaa tcagagcccg agtgggacgg acctggtatc actcgacata 1141 ttcactgttt actgatgtcg tacaaacagc tattcgtcgt aattcttcgt ctgccacttc 1201 aaatgaaaac ctgctaggtg aaacggacga acaatccaaa ctttccttca gcaaatcttg 1261 tcctacccat cagaaagtgt tatttatgag gactcctgag cggtactgca atgctctcat 1321 atgagcgctg gacggcactg acgttcacct tccggatcct cgtggagtgc ttgctgtttc 1381 gacgccatct tcgattgaac tgtcagttat tactctcaac agaagcagta agcttcacct 1441 taaggctaag taacccgtca ttcgttttgg caacaatgat gacttttcag cttgcatttc 1501 aaaatgataa aacgcagtct tgatagttta tactgacttg aaaaagtatc actgtacgca 1561 tttacatgca taaagtatgc tgatactttt tcagctgtgt cagtgcaaaa gcaactgatt 1621 ttctttgatt cgaaatcatg agatgaatta gcaacaacca tcaacgacgc gtacaaattt 1681 taatgacggc ctacttcgcc ttaacagcct tgtcgtgatg acgatccttg cagttgcgac 1741 aggaatactt cgacaaatcc tagctagatt tcccgatctg aagcagtttg ttaaaaaaaa 1801 atattttcag ttgagaaatg attttcctca cctcagagag ttaccggatc taaccgtatt 1861 aaaggttgtc aatcaggata tgcgcatcca gtaccgattc cttggataac gatggcaaat 1921 cgattaaaac ttgtgcctac gcaatttaat agttctagtt gtctctaagc aatctatcag 1981 gtcttttggt tctcgcgaaa tctgattaat tcatcgacat taccgtcgag agcctggagc 2041 ttgatttgcg acaaacatat tgatcgattg cttgaataca taagcttttg ctaagcctat 2101 taatccaata atagatctct tcagagacga atggaagcac accatagtac tttataatat 2161 tgaacatttg ttagtactgt gcacgtgtgc acacattcat gtacatatgg atgatttgat 2221 cagctttact caattctttc ttcacactct gctggttgtt tatatttgcg taaatctatt 2281 atctgttatc agctgactgc taaatcacat tcaacttgca tataaaacgc attaggcctg 2341 tctaacattc atcccgatca tacttacaat actagaatga agctacttct attactcgca 2401 atttttacaa ccttttccgt ggtaagatgt tcctagtgca tccaactctt catttcttaa 2461 caaatactcc accttttaag gttgcctcaa tgggaccatt tgatccggag gagatgctgt 2521 tcatctttac gcgctgcatg gaagacaatt tggaagatgg agcgaatcga cttccaatgt 2581 tagcgaaatg gaaagaatgg attaacgaac cggtagacag ccccgcaact cagtgtttcg 2641 gcaaatgcgt cctggtaaga acaggtctgt acgatccggt agcccaaaag ttcgatgtaa 2701 gttggtggtt gatatccggg tggaattgga taatttcatc gcttcaattg ttgtaggcgt 2761 cggtgatcca ggagcaattt aaggcttatc cgtccttggg ggaaaagagc aaagttgaag 2821 catatgctaa cgcagttaaa cagttgcctt ccacaaataa cgactgtgcc gctgttttca 2881 aagcgtacga tcctgttcat aaggcgcata aggacaccag caagaacttg ttccatggaa 2941 acaaggagtt gaccaagggc ctctatgaga agttggtaag tgtttccgtg ttattcgatt 3001 gaaacaattg tgtgtatgaa catattaatt gcttcaggga aaagacattc gccagaagaa 3061 gcaatcctac ttcgagtttt gcgagaacaa gtactaccca gctgggtcag ataagcgtca 3121 gcaactttgt cagataaggc aatacactgt cttagatgat gcgctgttca aggagcacac 3181 tgattgcgtg atgaagggta ttcgctacat tacgaaggat aatcaactgg atgtaagaaa 3241 ataataaata tcatgcaaac atcatagtaa tcgacatttc ctttcaggtg gaagaggtga 3301 agcgggactt caagctagtg aataaagata cgaaggcact cgaaaaggtt ttgaatgact 3361 gtaagtctaa ggaaccaagc aacgcaaaag agaaatcatg gcactactac aaatgtttgg 3421 tggaatcttc ggttaaggat gatttcaaag aggcttttga ctatcgtgag gtacggtcac 3481 agatttatgc cttcaatttg cccaagaaac aggcttacag caaaccagca gtgcaatctc 3541 aagtgatgga aatcgacggt aaacagtgtc cacaatagaa tagtgcattg aaaatgtctt 3601 ccaattcaat ggtttcatcg aagtagaaag ttaaataaag atataataat catcacgcag 3661 aggaactttg ttttaactta tattgtttat ttggtaattg ttgtagttat ttttaaatta 3721 ttttaatttg ggaagccgaa aaaccgttaa taggaaatta aagtagtttt tgaatcattg 3781 ttattataaa atatcgcata gttctgtact gaagaattc // LOCUS MSQD7AB 1040 bp ss-mRNA INV 27-JUL-1990 DEFINITION A.aegypti D7 mRNA, complete cds. ACCESSION M33157 KEYWORDS . SOURCE A.aigypti (strain Rockefeller) female salivary gland, cDNA to mRNA, clone AEGD7. ORGANISM Aedes aegypti Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae. REFERENCE 1 (bases 1 to 1040) AUTHORS James,A.A., Blackmer,K., Marinotti,O., Ghosn,C. and Racioppi,J. TITLE Molecular analysis of a gene expressed in the female salivary glands of the vector mosquito, Aedes aegypti JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.A.James, 22-MAR-1990. Author address:A.A.James U of C, Irvine, Dept Biochem Irvine, CA 92717 FEATURES from to/span description pept 13 978 D7 protein signal 1016 1021 poly-A signal BASE COUNT 335 a 208 c 251 g 246 t ORIGIN 1 gaattcccta gaatgaagct gcctctatta ctcgcaatag ttacaacctt ttccgtggtt 61 gcttcaacgg gaccatttga tccggaggag atgctgttca cctttacgcg ctgcatggaa 121 gacaatttgg aagatggacc gaatcgactt ccaatgttag cgaaatggaa agaatggatt 181 aacgaaccgg tagacagccc cgcaactcag tgtttcggca aatgcgtcct ggtaagaaca 241 ggtctgtacg atccggtagc ccaaaagttc gatgcgtcgg tgatccagga gcaatttaag 301 gcttatccgt ccttggggga aaagagcaaa gttgaagcat atgctaacgc agttcaacag 361 ttgccttcca caaataacga ctgtgccgct gttttcaaag cgtacgatcc tgttcataag 421 gcgcataagg acaccagcaa gaacttgttc catggaaaca aggagttgac caagggcctc 481 tatgagaagt tgggaaaaga cattcgccag aagaagaaat cctacttcga gttttgcgag 541 aacaagtact acccagctgg atcagataag cgccagcaac tttgtaagat aaggcaatac 601 actgtcttag atgatgcgct gttcaaggag cacactgatt gcgtgatgaa gggtattcgc 661 tacataacga agaataatga actggatgct gaagaggtga aacgggactt caagctagtg 721 aataaagata cgaaggcact tgaaaaggtt ttgaatgact gtaagtctaa ggaaccaagc 781 aacgcaaaag agaaatcatg gcactactac aaatgtttgg tggaatcttc ggttaaggat 841 gatttcaaag aggcttttga ctatcgtgag gtacggtcac agatttatgc cttcaatttg 901 cccaagaagc aggcttacag caaaccagca gtgcaatctc aagtgatgga aatcgacggt 961 aaacagtgtc cacaatagaa tagtgcattg aaaatgtctt ccaattcaat ggttaaataa 1021 agaaataatg atcatcacgc // LOCUS ADGLS 1477 bp ds-DNA VRL 27-JUL-1990 DEFINITION Adenovirus type 7 (simian) left end of genome: 0.0-4.2 map units. ACCESSION K03325 KEYWORDS alternate splicing; terminal repeat. SOURCE Adenovirus type 7 (simian) DNA. ORGANISM Mastadenovirus s7 Unclassified. REFERENCE 1 (bases 1 to 1477) AUTHORS Kimelman,D., Miller,J.S., Porter,D. and Roberts,B.E. TITLE E1a regions of the human adenoviruses and of the highly oncogenic simian adenovirus 7 are closely related JOURNAL J. Virol. 53, 399-409 (1985) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Kimelman (15-AUG-1985). The mRNA transcripts (mRNAs I and II) were determined by mRNA-DNA hybridization and S1 nuclease digestion. The peptide coding regions were deduced by analogy to other adenoviruses and following the most likely mode of translation. The intron of the mRNA III transcribed from the E1a of other adenoviruses has not yet been determined by S1 mapping. Enhancer sequences are located at positions 165-173 and 265-270. A TATA box is located at position 408-413. A polyadenylation signal is located at posityion 1447-1452. FEATURES from to/span description pept 486 837 E1a 28 K protein from mRNA I, exon 1 1178 1407 E1a 28 K protein from mRNA I, exon 2 pept 486 1056 E1a 24 K protein from mRNA II, exon 1 1178 1407 E1a 24 K protein from mRNA II, exon 2 pre-msg 439 1477 E1a mRNA IVS 838 1177 E1a 24K pept intron A IVS 1057 1177 E1a 28K pept intron A BASE COUNT 331 a 334 c 442 g 370 t ORIGIN 5' end of the l strand of the genome. 1 tctatataat ataccttatt tgggaacggt gccaatatgc taatgaggtg ggcggagttt 61 ggtgacgtat gcggaaatgg gcggagttag gggcggggtt tggcggtagg cgtggctggg 121 ggagtgtccg ggcgtgggaa cggaagtgac gtagggggcg cgccggaggt gacgtcgtgt 181 ggggagtttt aaaccggaag caaggtattt taaacgcttg caagcgcaat tttgtcggtt 241 ttggcgcgaa aactgataaa aagcggaagt tcggttaatc attaattttt acgataggga 301 ggaatattta ccgagggccg gtgaactttg agcggtgacg cggtggtttc gttacgtggc 361 accaccacgc gactgctcaa agtccccgtt tattgtctag gtgagggtat ttaaaccggc 421 tcagaacgtc aagaggccac tcttgagtgc ccgcgagaag agctttctcc tctttcgctg 481 cgaaaatgag acacttggcg ttggaaatga tttctgaact gctggattta ggactggata 541 ccattgatgg ctggctgcac accgaatttc ggccggtacc ggcgggggtg agtcataaca 601 tgtcgctgca cgaaatgtac gacctggacg ttaccggcca ggaggatgag aacgaagagg 661 cggtagatgg tgttttttcc gatgcgatgc tcctggccgc ggaggaggga atagaaatgc 721 ctaatcttta ttctccggga cctctggttg ggggaggtga aatgcctgaa cttcagcctg 781 aggaggaaga tcttttctgc tacgaagatg gcttccctcc cagtgactct gaggaaggtg 841 agcattcgca ggtggagaca gaacgtaaaa tggcggaggc ggcggcagca ggtgcggcgg 901 cggccgcgcg gcgggagcaa gatgactttc gcttagactg tcctagcgta cctggccatg 961 gctgtagctc ctgtgactac catcgcaaaa ctagcggctg tcctgaaatt ctgtgctcgc 1021 tgtgctatct gagggctaac agcatgttta tttatagtaa gtaaattttt tctactaact 1081 ttctcgttgt gtgtttgctc gctcgctcgc tcgcaactgc tggggtgctt gctgttggga 1141 ctgagcttac aggtattttc tctgtaattt tccataggtc cagtttctga ctctgagcca 1201 gacgagcccg actccacaac agctgattca aatcatggca gcccgccaac ccttcgctgc 1261 accccaccca gggacttgcc gcgacctgtg ccagtgaagg cctctcctgg caagcgccca 1321 gcggtgaaca gcttgcatga cctcatagag gaggttgaac aaacagtacc tttggacctg 1381 tccctaaagc gctctaggag caattagggt tataaaaccc ctccccttcc ccttaagtta 1441 tcaggaaata aaaagattaa ctggattctt tgtgcct // LOCUS ADGRS 116 bp ds-DNA VRL 27-JUL-1990 DEFINITION adenovirus type 7 (simian) right end of genome. ACCESSION V00035 J01991 KEYWORDS terminal repeat. SOURCE adenovirus type 7 (simian) dna. ORGANISM Mastadenovirus s7 Unclassified. REFERENCE 1 (bases 1 to 116) AUTHORS Broker,T.R. TITLE terminal nucleotide sequences and restriction endonuclease cleavage maps of simian and murine adenoviruses JOURNAL (in) Tooze,J. (Ed.); DNA Tumor Viruses, Second Edition Revised: 1037-1046; Cold Spring Harbor Laboratory, Cold Spring Harbor (1981). STANDARD simple staff_review REFERENCE 2 (bases 1 to 116) AUTHORS Tolun,A., Alestroem,P. and Pettersson,U. TITLE sequence of inverted terminal repetitions from different adenoviruses: demonstration of conserved sequences and homology between sa7 termini and sv40 dna JOURNAL Cell 17, 705-713 (1979) STANDARD simple staff_review COMMENT L-strand is given, from 5' to 3'. BASE COUNT 25 a 15 c 46 g 30 t ORIGIN 1 atcaataata taccttattt gggaacggtg ccaatatgct aatgaggtgg gcggagtttg 61 gtgacgtatg cggaaatggg cggagttagg ggcggggttt ggcggtaggc gtggct // LOCUS ADGSA7PE1 2338 bp ds-DNA VRL 27-JUL-1990 DEFINITION Simian adenovirus SA7P, leftmost 2338 bp. ACCESSION X01027 KEYWORDS E1A protein; E1b protein; inverted terminal repeat; overlapping genes; transforming gene. SOURCE Simian adenovirus SA 7P. ORGANISM Mastadenovirus s7 Unclassified. REFERENCE 1 (bases 1 to 2338; enum. 1 to 2338) AUTHORS Dekker,B.M.M., Konings,D.A.M., Denisova,T.S., Gibadulin,R.A. and Van Ormondt,H. TITLE The nucleotide sequence of the leftmost XhoI fragment (6%) of simian adenovirus SA7P JOURNAL J. Gen. Virol. 65, 1699-1708 (1984) STANDARD simple automatic COMMENT **map: leftmost XhoI fragment (6%); EMBL features not translated to GenBank features: key from to description SITE 1 183 inverted terminal repeat (ITR) of SA7P PRM 409 416 TATA E1A region CAP 440 440 cap site E1A RNAs MSG 440 838 "12S" E1A RNA MSG 440 838 1st exon "12S" E1A RNA IVS 839 1175 intron "12S" E1A RNA MSG 1176 >1445 2nd exon "12S" E1A RNA MSG 440 1057 1st exon "13S" E1A RNA IVS 1058 1175 intron "13S" E1A RNA MSG 1176 >1445 2nd exon "13S" E1A RNA SITE 1445 1450 AATAAA E1A PRM 1508 1514 TATA E1B CAP 1538 1538 cap E1B RNA's This virus, an independent isolate from Moscow, USSR, is highly related to simian adenovirus 7 (SA7). The E1A sequences are 95% homologous to those of SA7 see $$ J. VVrol. 5:399-409(1985) FEATURES from to/span description pept 487 838 E1A 21K protein (part 1) 1176 / 1401 E1A 21K protein (part 2) pept 487 1057 E1A 29K protein (part 1) 1176 / 1401 E1A 29K protein (part 2) pept 1569 2111 pot. 21K E1B protein pept 1874 > 2338 pot. 55K E1B protein BASE COUNT 514 a 535 c 725 g 564 t ORIGIN 1 catcatcaat aatatacctt atttgggaac ggtgccaata tgctaatgag gtgggcggag 61 tttggtgacg tatgcggaag tgggcggagc aaggggcggg gcgagaggcg gggcttttgg 121 taggcgtggc cgggcgtggg aacggaagtg acgtcggggg cgcgccggac gtgacgtgtt 181 ttgaggcgtt ttaaaccgga agcaaggtat tttaaacgct tgcagcgcaa ttttgccggt 241 tttggcgcga aaactgataa aaagcggaag ttcggttaat cattaatttt tacgataggg 301 aggaatattt accgagggcc ggtgaacttt gagcgatgac gcggtggttt cgttacgtgg 361 caccaccacg cgactgctca aagtccccgt ttattgtcta ggtgagggta tttaaaccgg 421 ctcagaccgt caagaggcca ctcttgagtg cccgcgagaa gagctttctc ctctttcgct 481 gcgaaaatga gacacttggc gttggagatg atgtctgaac tgctggattt aggactggat 541 accatcgata gctggctgca caccgaattc gcgccggtac cgacgggggt gagtcataac 601 atgtcgctgc acgaaatgta cgacctggac gttaccggcc aggaggatga gaacgaagag 661 gcggtagacg gtgttttttc cgatgcgatg ctcctggccg ctgaagaggg agtagaaatg 721 cctagtcttt attctccggg acctctggtt gggggaggtg agatgcctga gctacagcct 781 gaggaggtag atcttttctg ctacgaagat ggcttccctc ccagtgactc tgaggaaggt 841 gagcattcgc aggtggagac ggaacgtaaa atggcggagg tggcggcggc aggtgcggcg 901 gcggccgtcc gcggggagca agatgacttt cgcttagact gtcctagcgt acctggccat 961 ggctgtagct cctgtgacta ccatcgcaaa aatagcggct gtcctgaaat tctgtgctcg 1021 ctgtgctatc tgagggctaa cagcatgttt atttatagta agtgaatttt tctactaact 1081 ttctcgctgt gtgtttgctc gctcgttcgc taactgctgg ggtgcttggt gttgggactg 1141 agcttacagg tatttttctc tgtaattttc cacaggtcca gtttctgact ctgagccaga 1201 cgagcctgac tccacaacag ctgattcaaa tcatggcagc ccgccaaccc ttcgctgcac 1261 cccacccagg gacttgccgc gacctgtgcc agtgaaggcc tctcatggca agcgcccagc 1321 ggtgaacagc ttgcatgacc tcatagagga ggttgaacaa acagtacctt tggacctgtc 1381 cctaaagcgc tctaggagca attagggtca taaaacccct ccccttcccc ttaagttata 1441 aggaaataaa aagattaact ggattctttg tgcctgcttt gtttgttcgt gggcggtcct 1501 tggggattat aaaaggggtg agtcaggtta taagggtact taggcaacgc tcaggtaaca 1561 tcgcctccat ggatctccga acggcgcttc agacttttga gagcacccgc cgcttgctgg 1621 agctctgttc caatagaacc tcttttttgt ggaggtggtt atttggaact ccgctcagcc 1681 ggctggttag gcaggtgaaa ttagaatacg agaaggattt tgaaagaatt ttagatcagt 1741 gtcccggggt gtttgagtcc ctggagctgg gctatcataa ggtttttgag gagaagattg 1801 taaaggagtt ggatttttct tctcccggtc gggcggtcgc ggctgtggcc tttgcttcct 1861 acctgctgga tagatggaac acccggaccc acctgtcccc ggggtaccag atggattaca 1921 tcagcctgaa cctgtggaag ttttggttgc gccggcgggt ttacaattac tcgcgggggc 1981 tgcctcagct agggccggcg gcgccgctgg cgaggcaggg gtcgcagcag gaggagcagc 2041 agcagcggca ggaggaggag caggtgcagg aggagatgag gtccggcctg gaccctccaa 2101 cggagaactg atggcgcagg tggccggcgc ggcggatgcc agcgggccgc cttccaagcg 2161 ccctaagaat gaggcagagc agatgcagga gggtttaact cagttgaccc tgagtctcat 2221 taacaagtat cggcctgaga ccgtgtactg gaatgaactg gagcaggaat ttcagcatgg 2281 ggagatgcac ttgcagttcc actttagctt tgagcaatta aaaactcact ggctcgag // LOCUS ADGVARNA 850 bp ds-DNA VRL 27-JUL-1990 DEFINITION Simian adenovirus type 7 (SA7) virus-associated (VA) RNA gene. ACCESSION M14082 KEYWORDS virus-associated RNA. SOURCE Simian adenovirus type 7 DNA. ORGANISM Mastadenovirus s7 Unclassified. REFERENCE 1 (bases 1 to 850) AUTHORS Larsson,S., Svensson,C. and Akusjaervi,G. TITLE Characterization of a low-molecular-weight virus-associated (VA) RNA encoded by simian adenovirus type 7 which functionally can substitute for adenovirus type 5 VA RNA-1 JOURNAL J. Virol. 60, 635-644 (1986) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Larsson, 15-JAN-1987. Simian adenovirus type 7 VA RNA stimulates protein synthesis in a transient expression assay and can functionally substitute for VA RNA-I during lytic growth of human Ad5. The SA7 genome encodes only one VA RNA species. FEATURES from to/span description RNA 181 352 virus-associated RNA (5' and 3' ends +/-2bp) ORF 105 < 1 (c) ORF A cds ORF 366 > 850 ORF B cds BASE COUNT 140 a 260 c 338 g 112 t ORIGIN 150 bp upstream of XbaI site; 30 mu on viral map. 1 gtagatgtag cgcgacatcc aggtgaggcc ggcggcggtg gtggcggcgc gggtgaactc 61 gcggacgcgg ttccagaggt tgcgcagggg ggcgaagcgt tgcatggtgg tcacgctctg 121 gccggtgaga cgggcgcagt cctgcacgct ctagacggaa acagagagag gcgtcaccga 181 ctcctctccg tagcttgggg gttaggtcgc aagggtgcgg tggcggggaa ccccggttcg 241 aaaccggccg gatccgccac tcccgacgcg gcggccccgc gtccacgacc ccaccagagg 301 ccgagaccca gccgcgacgc gcgcacccca aatacggagg ggagtctttt ggtgcttttt 361 tgtagatgca tcccgtgctg cggcagatgc gccccacgcg gggggcctcc tcgcgggaag 421 aagagctgga ggcgcaggag gaggcgcgcg gtggcgccgc gggccccgcc gagctggagc 481 tggaggaggg cgaaggcttg gcgcggctgg gcgccggagt gccggagcgc cacccgcggg 541 tgcaactggc tcgcgaccag cgccaggcct acgtgccgcc gcagaacctg tttagggacc 601 ggagcgggga ggaggcggag gaaatgcgtg actgccgttt tcgggccggg cgggagctgc 661 gggcgggctt ggagcgcgag cggctgctgc ggcccgagga ctttgagccc gaggagcgca 721 acggtatcag tccggcgcgc gcgcacgtct cggccgccaa cctggtgacg gcctacgagc 781 agacggtgaa cgaggagcgc aactttcaaa agagcttcaa caaccacgtg cgcaccctcg 841 tggccccggg // LOCUS ADZITRL 150 bp ds-DNA VRL 27-JUL-1990 DEFINITION adenovirus 31 inverted terminal repeat & repl ori left end. ACCESSION J01958 KEYWORDS origin of replication; terminal repeat. SOURCE ad31 (adenovirus 31). ORGANISM Mastadenovirus 31 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A. TITLE conserved sequences at the origin of adenovirus dna replication JOURNAL J. Virol. 44, 530-537 (1982) STANDARD simple staff_review BASE COUNT 38 a 29 c 48 g 35 t ORIGIN 1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag 61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt 121 gtgggagccg ggcgcgccgg atgtgacgcg // LOCUS ADZITRR 150 bp ds-DNA VRL 27-JUL-1990 DEFINITION adenovirus 31 inverted terminal repeat & repl ori right end. ACCESSION J01959 KEYWORDS origin of replication; terminal repeat. SOURCE ad31 (adenovirus 31). ORGANISM Mastadenovirus 31 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A. TITLE conserved sequences at the origin of adenovirus dna replication JOURNAL J. Virol. 44, 530-537 (1982) STANDARD simple staff_review BASE COUNT 40 a 28 c 47 g 35 t ORIGIN 1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag 61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt 121 gtgggagccg ggcgcgccgg atgtgacgaa // LOCUS HSKGAORA 3726 bp ds-DNA VRL 27-JUL-1990 DEFINITION Herpesvirus of turkey glycoprotein A (gA) gene and ORF of unknown function. ACCESSION M33303 KEYWORDS glycoprotein A. SOURCE Herpesvirus of turkey DNA. ORGANISM Gallid herpesvirus type 2 Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 3726) AUTHORS Kato,A., Sato,I., Ihara,T., Ueda,S., Ishihama,A. and Hirai,K. TITLE Homologies between herpesvirus of turkey and Marek's disease virus type-1 DNAs within two co-linearly arranged open reading frames, one encoding glycoprotein A JOURNAL Gene 84, 399-405 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 207 1676 glycoprotein A pept 2118 2756 ORF2 BASE COUNT 1016 a 884 c 815 g 1011 t ORIGIN 1 tcgtgcgcat cgcccatggc tggcggaacg ctctttcgaa ccgtgaataa aactttgtat 61 ctactaaaca ataactttgt gttttattga gcggtcgaaa acaatgagga gctgcaattt 121 aaagctaacc gcatacgccg ggcgggtaaa gaccatttta taccatatta cgcatctatc 181 gaaacttgtt cgagaaccgc aagtatatgg tttccaacat gcgcgttcta cgcgtactgc 241 gcctgacggg atgggtgggc atatttctag ttctgtcttt acagcaaacc tcttgtgccg 301 gattgcccca taacgtcgat acccatcata tcctaacttt caacccttct cccatttcgg 361 ccgatggcgt tcctttgtca gaggtgccca attcgcctac gaccgaatta tctacaactg 421 tcgccaccaa gacagctgta ccgacgactg aaagcactag ttcctccgaa gcgcaccgca 481 actcttctca caaaatacct gatataatct gcgaccgaga agaagtattc gtattcctta 541 acaatacagg aagaattttg tgtgacctta tagtcgaccc cccttcagac gatgaatggt 601 ccaacttcgc tcttgacgtc acgttcaatc caatcgaata ccacgccaac gaaaagaatg 661 tagaggttgc ccgagtggcc ggtctatacg gagtaccggg gtcggattat gcatacccta 721 ggaaatcgga attaatatcc tccattcgac gggatcccca gggttctttc tggactagtc 781 ctacaccccg tggaaataaa tatttcatat ggattaataa aacaatgcac accatgggcg 841 tggaagttag aaatgtcgac tacaaagaca acggctactt tcaagtgata ctgcgtgata 901 gatttaatcg cccattggta gaaaaacata tttacatgcg tgtgtgccaa cgacccgcat 961 ccgtggatgt attggcccct ccagttctca gcggagaaaa ctacaaagca tcttgcatcg 1021 ttagacattt ttatcccccg ggatctgtct acgtatcttg gagacgtaac ggaaacattg 1081 ccacaccccg caaggaccgt gacgggagtt tttggtggtt cgaatctggc cgcggggcca 1141 cactagtatc cacaataacc ctcggaaact ctggactcga atctcctcca aaggtttcct 1201 gcttggtagc gtggaggcaa ggcgatatga taagcacatc gaatgctaca gctgtaccga 1261 cggtatatta tcacccccgt atctctctgg catttaaaga tgggtatgca atatgtacta 1321 tagaatgtgt tccctctggg attactgtga ggtggttagt tcatgatgaa ccccagccta 1381 acacaactta tgatactgtg gttacaggtc tctgcaggac catcgatcgt tatagaaatc 1441 tcgccagtcg gattccagtc caggacaact gggcgaaaac gaagtatacg tgcagactaa 1501 ttggatatcc gttcgacgtg gatagatttc aaaattccga atattatgat gcaacgccgt 1561 cggcaagagg aatgccgatg attgtaacaa ttacggccgt tctaggactg gccttgtttt 1621 taggtattgg tatcattatc acagccctat gcttttacct accggggcgg aattaagatt 1681 aaccatcgta tgtgatataa aaattattaa gtgttataac cgatcgcatt cttctgtttc 1741 gattcacaat aaataaaatg gtattgtaat cagcaccatc gcattgtttc gtagatgact 1801 catgttcagt ccgcgtgatg tcaaaaatac gtatttttgg tatcacgcag cggccaaaat 1861 gcccattatg ttatttttac tccaaacgcg gtatttaaaa catcgggacg tacatcatgt 1921 ggcgcacgtt aatcgtatac ggtgccgcta cattaaaaat cgcaagtctc cgaatatcaa 1981 gctcacggcc aaaacgtcgg taataatctt acgcatcgaa tgtgatacgg ataccgtaca 2041 atcgctgagt agatttccta tatagttact cagtagtgat acacaatcac aaaatcgctg 2101 gggtatatca tataagaatg atgtcgccca cccctgaaga tgatcgcgat ctcgttgtgg 2161 ttcgtggacg tctccgaatg atggatagcg gcacggaaac agatagagag caacgacatc 2221 cacgtacgac ttggcgatcg atctgttgtg ggtgtacgat aggaatggta tttaccatat 2281 tcgttctcgt agcggcagta ttgttgggat cactattcac tgtttcatac atggccatgg 2341 aatcgggaac atgtcccgat gaatggattg gtttgggtta tagttgcatg cgcgtggccg 2401 ggaaaaatgc aactgatctt gaggcgttgg atacatgtgc tcggcataac agcaaactta 2461 ttgacttcgc aaacgccaaa gttctggttg aagctatcgc cccattcggt gtgccaaatg 2521 cagcatatgg ggaagtcttc cggttaaggg acagcaaaac cacgtgtata cgacctacca 2581 tgggaggacc cgtgtcggca gactgtcctg taacatgtac cgttatatgt cagcgaccca 2641 ggcctctaag taccatgtct tccatcatta gagatgcccg cgtgtatctt catttagaac 2701 gacgcgatta ttatgaagtc tacgcctctg tcctctctaa tgcgatgagt aaataaaaac 2761 gcacctctaa cggttactgt gttttattta tccaatcaca ccatagacat tataaacata 2821 atatggatct ttatttcata taatgtatta tacagaccgc gcatcgcgtg taaaatatta 2881 ttcgctggac actttcagcc tacgtaaaat atgttctacc ttggacactg gtatatctcc 2941 acgcattgtt ttttcatgaa agtatttggt tgtcggagta gacaccctgc cagattccga 3001 aactggaggg tcattgctcg ccaaagaagg ccccccagac ccatcggtct tacgaattgc 3061 accagtcaga ccgtttccga tgttcgcaca ttgagaactc gaccccgccc ccgcaaattc 3121 agtcaaacca ggaaccgtat cgccacagcc acattcgaca cgatcgccta attgatgtgt 3181 gggcgtactt cccgacttgt tcgagttttt atcttgcttc cgaatagatt tcataagccg 3241 tgcattaaca tcgcttattc gctttggcag taattttatc cccagtagag ccagggcgtt 3301 cttagacagt ttaacatcat cggttgcagc gcggccgaac caagtattta tgctgcactc 3361 gagagccccc catgacacgg gcaacatgct cggaaacaaa ttcccaagct gatgttcaaa 3421 ttggtcttgc tgcaacaggg cctcttttaa tatagtattt tccgtattac cgtccaacca 3481 tatcacatat cccctctgag ttaaattaat taggtactga caatgatggt tgagaagtga 3541 aaccaacgaa accactgcta gtatggcacg tgagttccga tcaccatcaa agtccaccag 3601 gcgggtttgc gaccataata cgctggtttg tagtaagggg gcgatgacat tgcctataat 3661 tcgacaaagc aatgatgtcg aactccattt gtctgatggt aaggaggaaa tttcttccat 3721 gaattc // LOCUS HSKTK 2039 bp ds-DNA VRL 27-JUL-1990 DEFINITION Herpesvirus of turkey thymidine kinase gene, complete cds. ACCESSION M26659 KEYWORDS thymidine kinase. SOURCE Herpesvirus of turkey (strain FC126) DNA, passed in chicken embryo fibroblasts. ORGANISM Gallid herpesvirus type 2 Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 2039) AUTHORS Martin,S.L., Aparisio,D.I. and Bandyopadhyay,P.K. TITLE Genetic and biochemical characterization of the thymidine kinase gene from herpesvirus of turkeys JOURNAL J. Virol. 63, 2847-2852 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.L.Martin, 03-AUG-1989. FEATURES from to/span description pept 608 1540 thymidine kinase (EC 2.7.1.21) BASE COUNT 570 a 458 c 437 g 574 t ORIGIN 649 bp upstream of XbaI site. 1 atcgatgaaa gtctgtacat atagccttct tttttaatgg tgtcgaacgt ataactacaa 61 gcccgcgtct cgcgacaaac actaaaacgg ggcaaatgat agtatgatca gatcccctgg 121 gggccatgcg ctcgatcaac gttttagact ccagcaactg cttagtccct gttgcgcgtt 181 gctcattctt gctggccgtc ttgaggttgc ggtaaaatct gcaagttttc agttcgatta 241 tgacacagac attgttacag ccttccacca ataacccgcc agataaggtg aacatgcaaa 301 tgcaatctgg ccttctgcgc cccaggtcta cttcaaaggc caatcgaaac gccgacgccc 361 ctttaagtag ttcagaaggc aggatcttgc ttaatatctt ctgtaatgtt ctaccaactg 421 tattccccgc cgacgactgg tcaaagtttt cagcagcaag ctttttgtaa aaacgattat 481 gaccacggac acccgctttt agcaatcctg ccataaggtg gtttcgccgc gtgcttgcct 541 cgaagacaat tgccagctaa tccagcatta ccatatttcc ttggcttgca tttggatctg 601 cgcgtcgatg gcattgccga gaagaccgcc cacgttaacg cgagtttatc tagacggacc 661 gtttggtata ggcaaaacgt ctatactaaa cgctatgccc gaccacacgc ccgatggggc 721 tcctatattg aaagtgtacg aaccaatgaa atattggaga tgccagtcta ccgatttggt 781 ggtagctgcc aacgaaacgc cagaacgtag gcgtggtgga gctttatcac gattccaatc 841 tgacatgatc atggcatcta tacaagccag atttgccgat ccatatttgc tttttcacga 901 acggttatca tctaaatgta gaggaaaaat agaaatatgc gatactccag caattatatt 961 aatgctggat aggcaccctg tggcggcgat attatgtttc ccaatcactc gctatttact 1021 tggagaatat tctttggaaa tgttgattag ctctataata agacttccgt tggaatcccc 1081 cggatgcaac ctgacagtca caatccttcc cgacgaaaag gaacacgtta ataggatttg 1141 ttcaagagat agaccgggtg aaacggcaga tagaaatatg ctcagaacac tcaatgccgt 1201 atacgcatct ttggtggaca cggttaaata cgcaaatcta acatgccctt acgagaaaga 1261 aagctgggaa atggaatggt tgggacttcc ctggtttgaa gagtcattac ttgaagaatt 1321 catctcgcgt ccccgccctg ttatttgttc gagaactcga atgccgctgg accgaactct 1381 cctggccatt tttaaacgga aagagctgtg tagcgaaaat ggggagctgt taactcagta 1441 ttcttggata ttgtggggat tactgactaa actacacacc attaatgtcg aattatttga 1501 cattagcggt atgtcacgtc gagaatgcgc cacgctataa tgcatactat gccggagaga 1561 ttgtctactc tcgctagctg gaatgattta tgcgagcttg aagatgatgt aatttcctat 1621 aataagggaa tgtgtaacga ggttggagcg tctcgataat tcttcttaat ctgctggtat 1681 tggttactgc cataacttaa tattggtcca tgctagaata gtcatacgct acgatctgtt 1741 gctatatatg actatcgcca aactgttaaa ccgcgaagaa tatatttcat ataaacctaa 1801 gggcccctca gtctgatttt ttgtgaaaac gtgtatacat gaagttttac tgcataatcc 1861 gtttcatgat catagcgaat ctttattcat cttaccaaat atcgcttcca ggcacatatc 1921 catcgcaaat attgcttgac atgaagaact cgccgctcgt acgctttaat atatcgacgc 1981 gtgattataa agacgagaca ctctggatac ggaaaaattc gacatttgtt tatatcgat // LOCUS LACHSP 3095 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid pTR2030 (from L.lactis) aborts phage infection (hsp) gene, complete cds. ACCESSION M30192 KEYWORDS . SOURCE Plasmid pTR2030 (from Lactococcus lactis lactis (strain ME2)) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 3095) AUTHORS Hill,C., Miller,L.A. and Klaenhammer,T.R. TITLE Nucleotide sequence and distribution of the pTR2030 resistance determinant (hsp) which aborts Bacteriophage infection in Lactococci JOURNAL Appl. Environ. Microbiol. 56, 2255-2258 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Hill, 29-NOV-1989. FEATURES from to/span description pept 1105 2991 hsp gene (aborts phage infection) BASE COUNT 1129 a 439 c 518 g 1009 t ORIGIN 1 ataagagtca gaatgtgacc gtcataatga ttacgaggat agctgcaacg cttggtttcg 61 agttaattgt cgtgcaacga ttgacggaga attgaaaaat ttcaaagttc acgatgttga 121 gacttacgat aaaaagaaaa atagatttca tcggagattg tctgatgcct tggtcccaat 181 aatatcttca gaagatgttg aatttgaagc cgagcaattt ttgaaattgt atttcccagt 241 agctatggaa attcctcagc gtattgaccc gcttttgata gctgaaaaaa tgggacttac 301 ggttgagtac catgaaattt cagaagatgg gaacattttt ggtcaaattt attttcatga 361 tgctcttctt gacgggaaag aaatcaaggc aaaaactatc ctgattgatc ctcgtgtcat 421 cgaaagccgt gggattggtg gacttaataa cacaataatg cacgagtgtg tccattggca 481 taaacatagg cttgcatttg aattggttag gcttttccaa ccagagcttt cgaatattac 541 cacaactaaa gaagagttcg atgggctcat tgagaaaaat atgactccaa cagattggct 601 ggagattcaa gctagaaaga ttacgcctaa aatcttgatg cctaagaaga tgtttaaaca 661 agaagttgaa acgtttatgc gtcctgatgg tggaagtgga atagtggacc agctcctcat 721 aatcgaggga actatttcag agcttgctag ttttttcacc gtatcaaaat tatctgcaaa 781 aattcggatg gttgagctgg gatatgagat tgcgataggt gcttcaaact atgttgatgg 841 gcatcctgta cccccacact cttggaagca gggagctgta tctgctaatc agacttattc 901 aattggtttt gttgatgcaa ctatagagac attaaagaac ccaagattac tcgtcgcaat 961 aaaaaaagaa cttaacctca tatttcatag ggattgaagt tcttgttttt tatttttggt 1021 agctaaaatg gtatccaatt aataattagc tattttattt tggtaaaata gaataaatag 1081 atttttagaa cagggagtag gtaaatgata actttgcaac accaagattg ggaacgtgca 1141 gtcaatatga ttaaaaacat tccaccaagt gccaaaaata aatactttca aacttttcct 1201 ttctttttgt taagtgaaac cagctgggaa gagttacttt ctgaaaattt tttctactca 1261 tatataaagt caggagagtt tctgacttat caagaaaacc tttcattcta cgataggaca 1321 attcaaaaga gccatggagc ataccgacaa actcgaattg tatctccaat catctatata 1381 tttttaatag caattgcttc tcaagtggaa agaatatatg ttgaaaagcg tactaatgat 1441 atgtcagtat atttttcagg ttcttttgaa aaagaaaaaa acactgcaca ttataaacaa 1501 agttataaca cttatatgac tgaacttaat gcctgtcaag aagagtttga ttattatttt 1561 caaacagatt tttcaacatt ttttcatcta gtagatacag acaacttatt caataaaata 1621 gaccgattag atccgaaaag tgcattagtc tactcttccc taataaaaat gattggacaa 1681 ggaagaatgc ccatagttga tggaaattca ggtctttctt ttttaaatac tgtagtttat 1741 ctagatgact ttgataaaga aattattgat tccttaaaaa cgatagttga aatcgaaagt 1801 tttaaactag ttcgatatgt agatgactta catattttca taaaatgtgc aaataaagat 1861 ttagattttt taaattataa agtatataat ttgttatgcg aaaaagcaac gaaacatcat 1921 cttgagataa attcttcaaa aacaaaatca tttactccca caagtgaact aagcactaaa 1981 atgaatacag atttgtataa tttctttgtg tataacgaag atgttgattt tgagcaatat 2041 ttttcaaaaa ataccttaat agaatttcta gataaattaa ataatatgtc tgttaatgct 2101 gatttttcag agtacgaaaa agaagtcctg tatacattgg aaaaccctga aatagtatct 2161 gacggaagtt atatattaaa cgctattgtt tacaataaaa gtacgtggtc acaggattat 2221 gatataaaaa ataaaattag tttattagta aatagtaatt atagaaaact aagatattct 2281 gcaaaagcac tcataactct tgtcttgaat acaagagatg gtgatatcat aaaaggacta 2341 cttaataatt tatttactac attcaaaaat ggtacaaatg atattattga tgaaattata 2401 ttaattgaat acttggtaca aagaaagttt aatcataaag atttaatgac tatcttgaaa 2461 gctgacgacc atggtattaa ggaatatatt aaggcatatc aaacttctga tttcataaaa 2521 agtctagaaa aaaataaagt tattttttat actaatcaga aagaagttta tcctttaata 2581 agtaaggata aaatactcaa tttcatatat tttagagcta agtattttga atccttagac 2641 ctagttcttg aatcctttgc atattataaa aactattttg ataggtttgt tgctcacgct 2701 atgttctgta caggaattga ctctggaaga aaacctaatt ataagctata ttatacagaa 2761 ggaaagttaa tagatgggtt aaaacaacta aattttttat cttctgatga gattacaaaa 2821 ataattaatg aagcacataa aattagaaat agtaaccccg taagtcactc tagtgctggc 2881 ttacttcaaa acgaagattt tagtagatat agagttaagt caagtcttaa tgacttgaaa 2941 attatcatag aacaactttc aacattgcta caaaataaaa acagattata aaatattatt 3001 atctgtttta tttttttcaa aagaatcaaa gtttattaga tttttcaact gtcctgttat 3061 ctcctcatcc gcatttggat aaagatgact ataaa // LOCUS PH15RER 2016 bp ds-DNA PHG 27-JUL-1990 DEFINITION Bacteriophage phi-15 right early region containing genes 17, 16.5-16.9, complete cds. ACCESSION M28830 KEYWORDS DNA replication protein. SOURCE Bacteriophage phi-15 DNA. ORGANISM Bacteriophage phi-15 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 2016) AUTHORS Benes,V., Arnold,L., Smrt,J. and Paces,V. TITLE Nucleotide sequence of the right early region of Bacillus phage phi-15 and comparison with related phages: Reorganization of gene 17 during evolution JOURNAL Gene 75, 341-347 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 203 664 gene 17 peptide pept 666 992 gene 16.9 peptide (put.) pept 989 1306 gene 16.8 peptide (put.) pept 1308 1700 gene 16.7 peptide (put.) pept 1697 1861 gene 16.6 peptide (put.) pept 1858 1971 gene 16.5 peptide (put.) BASE COUNT 744 a 374 c 382 g 516 t ORIGIN 1 aaagtagggt acagcgacaa cataccccat ttccccattg accgactatc ttcgacaaca 61 atctaacaac taaaacacaa ctatatacct atactaaata acaacatcaa tttgtcgaaa 121 agggtagaca aactactgtt taacatggta tactataata gaagtaagga agtaagacaa 181 ccaatcatag gaggaattac acatgaacaa ctaccaatta actattaacg aggtaatcga 241 gatcatcaac cgcaatacag acattaataa gttagtcgct aaaaaggaca acttatatcc 301 aacagatttg tatgacttag acaaacaaca attaatcgct atcattctca acagcgattt 361 cgcattatct agcatcaaaa gagcattatt agaggtaact gtcgaagagc taagagaaca 421 ggacaatgat gacgatttag atgaaataga taacgagtta tacgagggcg cagaggcggc 481 tgatgttcca cgtgaaacaa tagtgaaagt attcgaggca gacaaatcaa tagtaacgtt 541 caacggtgag aaactaaaac attatgttaa tgttgatgac aatagctcat ctgttgatga 601 agtaaagaag atcgccaagg aaatatcaga acatgacttt aacgaaaaca aggagaacaa 661 gtaacatgag cgtacaactt aatgcattta cattcatctt agaacgccgt ggatggcgca 721 tggtatgcta cgaacaatta acaacgaacg ggacacgcat attacatttc tatctcaaag 781 ataacccaac gttctttgct acatactcat cacaattctt atcagacacc aaaatgataa 841 gacgctttgc atcatggagc gggcaattac ttgaaggttc aaactctgtg ttctggacta 901 acataacacc atttgagcca attgatgaag aaacagcaga agacatcaag aatctcgaca 961 aagtcgttga aggaatgaat tttacgttat gattgacatt attgtaaaag aggacaagcg 1021 cttaataact gttcaaacac cagagggaga cgaagtattt tacactttgt ctttctcaga 1081 cgagcataag ttactaaaac gatcaagtgc cagactaaga aacaacatat atgccattgg 1141 tgtagcaaat atcagatggg tgttagttga catggataac atgatactta gtgagtatat 1201 gcatcacgtt gacatcctaa aagacataga caggaaaatg agacaattag ggtatatagt 1261 catttcagaa tggcaacacg caaataaaaa agggacgagg agataatatg gaagctattt 1321 tgatgatcgg tgtaatcaca ttgtgcgtta tattcctttt atcaggacgc aataataaaa 1381 agaaacaaga aataagggag ctagaagatt accttgaaga cctcaatcaa agaatcgtgc 1441 aacgaacaca gatacttagt gagcttaacg aagtcatcac aaacagaagt gttgataagt 1501 cagtcaatat gtcagcttgt gaaatcgccg tccttgacct ctatgagcag tcaaatatcc 1561 gcattcctag tgatatcatc gaagatatgg ttaatcaacg tttacaatca gaacaagatg 1621 tgttaaacta tatagagaca cagcggacat actggaaact ggagaatcag aaaaaactat 1681 atcgggggtc attgaaatga agttgctcac tcacacgtgt cattactgta gcttctcttt 1741 cttcactcga aagtttgatg tatttggtgc aataaccaag aaagatactc ctgttgtttt 1801 ctgccccgca tgtgggaatc aatccctttc agtatcacac attgaggagg agatcatatg 1861 aatcagaaag aattccaagc cgttttagac tggatgttgt cacacactat tatacaattc 1921 cacgaataca actatatgct acaaaagagc ctaccgttcc tcagacgata ggcttttctt 1981 ttgattgttt cacgtggaac atcactgtat acgcat // LOCUS STLENV 4122 bp ss-RNA VRL 27-JUL-1990 DEFINITION Simian T-cell leukemia virus, pol-env-pX-3' LTR region. ACCESSION M11373 M11374 KEYWORDS env gene; pol gene. SOURCE Simian T-cell leukaemia virus genomic RNA. ORGANISM Simian T-cell lymphotropic virus type I Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 4122) AUTHORS Watanabe,T., Seiki,M., Tsujimoto,H., Miyoshi,I., Hayami,M. and Yoshida,M. TITLE Sequence homology of the simian retrovirus genome with human T-cell leukemia virus type I JOURNAL Virology 144, 59-65 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 311 1777 env protein BASE COUNT 905 a 1456 c 741 g 1020 t ORIGIN 1 catttaaacg tattaaccca ttgccacaaa acccgatggc aacttcacca ctccctccgt 61 ctgcaaccta tcccagagac acactccccc atcagtaaac aaacccattg gtattatttc 121 aagcttcctg gtcttaatag ccgccagtgg aaaggaccac aggaagctct ccaagaagcc 181 gccggcgctg ctctcattcc ggtaagcgct aattctgccc agtggatccc gtggaggctc 241 ctcaagcaag ctgcatgccc aagacccgtc gggggccccg ccgatcccaa agaaaaagac 301 caccaacacc atgggtaagt ttcttaccac cttactttta ttcctccaac tccgcccccc 361 cattgtctgt gatcacagtc ccagctgctg cactctcaca gtcggagtct cctcatatca 421 ttctaaaccc tgcaatcccg cccaaccagt ctgctcatgg accctcgacc tgctagctct 481 ttcagcagac caagccctac aacccccctg ccctaatctg gtgggttact ccaactacca 541 tgctacctat tccctatatc tattccctca ttggattaaa aagccaaacc gaaatggcgg 601 aggctattat tcggcctctt attcagaccc ttgttcccta aagtgccctt acctaggatg 661 ccaatcatgg acctgtccct acacaggagc cgtctccagc ccctattgga aatatcagca 721 agatgtcaat ttcactcaag aagtctcacg cctcagcctt aatctccact tttcaaaatg 781 cggtttctcc ttctcccttc tggtcgatgc actaggatac gaccccatct ggttccttaa 841 caccgaacct agccaactgc cccccactgc ccttcctcta ctcccccatt ccaacttgga 901 ccacatcctc gagccttcca taccatggaa gtcaaaactc ctaactcttg tccaattaac 961 tctacaaagc actaactaca cttgtattgt ctgcatagat cgtgccagct tatctacttg 1021 gcacgtctta tactctccca acatctctgt tccatcctcc ccctctactc ccctccttta 1081 cccatcgtta gcgcttccag ccccccatct gacattacca ttcaactgga ctcactactt 1141 tgaccctcag attcaagcca tagtctcctc cccctgtcac aactccctca tcttgccccc 1201 cttttctctg tcacctgttc ccgccccagg gtcccgttcc cgccgggcag taccggtggc 1261 ggtctggctc gtctctgccc tggccatggg aaccggagtt gctggcggga ttaccggctc 1321 catgtccctc gcctcaggaa aaagtctcct acatgaagta gacaaagata tttcccaatt 1381 gacccaagca attgtcaaaa accacaaaaa tctactcaag attgcacaat atgctgccca 1441 aaacagacga ggccttgatc tcctgttttg ggagcaagga ggactatgta aagcactgca 1501 agaacaatgt tgtttcctaa atattactaa ttcccatgtc tcaatacttc aagaaagacc 1561 cccccttgaa aatcgagttc taactggttg gggcctcaac tgggaccttg gcctctcgca 1621 atgggcccga gaggccttac aaactggaat cacccttgtc gcattactcc ttctggtcat 1681 ccttgtaggg ccatgcatcc ttcgccaact acgacaactc ccctcgcgtg tcagataccc 1741 ccactactct cttataaacc ccgagtcctc cctataaacc tagcagttac tgcgaccaca 1801 ttgcttgcag cctcccctgc caatagttaa cctcccccat caaatcctcc ttttcctgca 1861 acaacctctt ccgttcggcc tccaaggagt ctacctcact ctccagctgt ctagcataac 1921 catcaactcc caactccttt attttttctt tcctagcatc acgctgtctc gccctcttag 1981 ccccttgttc tgccagtctc cgcttgcgct caagccgctt ctcctcctcc cgcctcctgc 2041 gccgagcctt cttctcttcc ttccttttca ggaactcggc aatctgttcc tcctcctctt 2101 tctcccgctc tctttttcgc ttcctcttct cctcagcccg tcgctgccga tcacgatgcg 2161 tccccccgcg aggtggcgtc tcctccctta gagggacccg ttgctgctgg ccattacatt 2221 cctcttcgag ggatagtaaa ccgtcaagca cagactcctc ctccgtcctt tgtttgttta 2281 agtcctcttc tagggataat agtccgtcca ccaagtcttc caccagcagg tcctccgggc 2341 agggaacagg taaacatcga aacggcccta cacatacaaa attaatcata cttattatca 2401 gcccacttcc caggttttgg acagagcctt ctttatggat acccagtcta cgtatttgga 2461 aactgtgtgc aaggcgactg gtgccccatc tctgggggac tatgctcggc ccgcctgcac 2521 cgtcacgccc tactggccac ctgtccagag catcagatca cctgggaccc caccgatgaa 2581 cgcgttatcg gctcagctct gcagttcctt atccctcgac tcccctccct ccccacccaa 2641 agaacctcta agaccctcaa ggtccttacc ccgccggcta ctcatacaac ccccaacatt 2701 ccaccctcct tcttccaggc cgtacgaaaa tactccccct tccgaaacgg atacatggag 2761 cccacccttg ggcaacaact cccaaccctg tcctttcccg accccggcct ccggccccaa 2821 aacttgtata ccctctgggg aaactccgtt gtctgcatgt acctctacca actttccccc 2881 cccatcacct ggcccctcct gccccatgta attttctgcc acccaggcca acttggggcc 2941 tttctcacca acgttcccta caaacgaatg gaagaacttc tctataaaat tttccttaat 3001 acaggggcca caatcattct acccgaggac tgcttgccca ccaccctatt ccgacccact 3061 agggcacccg cctcgctgac agccaggcaa aatggcctcc ttccgttcca gtcaaccctt 3121 actacgccag gccttatttg gacatttacc gatggcacgc ccatggtttc cgggccctgc 3181 cctagggacg gtcagccatc tttggtgttg cagtcctcct cctttatatt tcacaaattt 3241 caaaccaagg cctaccatcc ctctttccta ctctcacatg gcctcataca gtactcgtcc 3301 tttcataact tacatctcct ctttgaagag tacaccaata ttcccatttc tttacttttt 3361 aacaaagaag aggcaaatga tactgaccat gagccccaaa tgctccccgg gggcctaaaa 3421 cctcccaatg gaaaacattt ccgcgaaaca gatgtctgaa aaaggtcagg gcccagacta 3481 gggctctgac gtctcccccc ggagggcagc tcaacacaag gccctgacgt gtccccctgg 3541 ggacaaatca tgagcccaga cctccgggaa gccaccgaga accacccatt tcctccccat 3601 gtttgtcaag ccgtccccag gcgttgacga caacccctcg cctcaaaaac ttttcatggc 3661 acgcatacag ctcaataaaa caacaggagt ctataaaagc gtggagacag ttcaggaggg 3721 ggctagcttc tgctctacac gcgcccgccg ccctacctga ggccgccatc cacaccggtt 3781 gagttgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgaccg ccgtctgggt 3841 aagttcggag ctcgggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag 3901 attcggccgg ctctccacgc tttgcctgac cctgcttgct taactccaca tctttgtttc 3961 gttttctgtt ccgcgccgct gcagatcgaa agttccaccc ctttcccttt cattcacgtc 4021 tgactgccgg cttggcccac ggccaagcac cggcatcctc actggctcgg agccagtagc 4081 agcccatttt acaactctct ccagagagaa gtttagtaca ca // LOCUS HUMIGHDA 368 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human Ig H-chain mRNA V-D-J region, partial cds. ACCESSION M33858 KEYWORDS diversity region; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region; variable region V-H4. SOURCE Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA, clone C. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 368) AUTHORS Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J. TITLE Intraclonal diversity in the V-H genes expressed by CD5- chronic lymphocytic leukemia-producing pathologic IgM rheumatoid factor JOURNAL J. Immunol. 144, 1526-1530 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 368 Ig H-chain V4-D-J region (AA at 1) BASE COUNT 73 a 110 c 112 g 73 t ORIGIN 1 caggtgcatc tacagcagtg gggcgcagga ctgttgaagc gttcggagac cctgtccctc 61 acctgcgctg tctatggtgg gcccttcagt ggttactatt ggagctgggt ccgccagccc 121 ccagggaagg gaccggagtg gattggggaa atcaatcata gtggacgcac cacctacaac 181 ccgtccctca cgagtcgagt caccatgtca ctggacacgt ccaagaacca gttctccctg 241 aagctgacct ctgtgaccgc cgcggacacg gctgtttatt actgtgcgag aggcccttgc 301 gaggcatact gtactgatga tgccccgcaa ggcggtgcgc ggtacttcca gcactggggc 361 cagggcac // LOCUS HUMIGHDB 260 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human Ig H-chain mRNA V-D-J region, partial cds. ACCESSION M33859 KEYWORDS diversity region; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region; variable region V-H4. SOURCE Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA, clone A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 260) AUTHORS Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J. TITLE Intraclonal diversity in the V-H genes expressed by CD5- chronic lymphocytic leukemia-producing pathologic IgM rheumatoid factor JOURNAL J. Immunol. 144, 1526-1530 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 260 Ig H-chain V4-D-J region (AA at 1) BASE COUNT 57 a 79 c 75 g 49 t ORIGIN 1 atccgccagc ccccagggaa gggactggag tggattgggg aaatcaatca tagtggacgc 61 accacctaca acccgtccct cacgagtcga gtcaccatat cagtggacac gtccaagaac 121 cagttctccc tgaagttgac ctctgtgacc gccgcggaca cggctgttta tttctgtgcg 181 agaggccctt gcgaggcata ctgtactgat gatgccccgc aaggcggtgc gcggtacttc 241 cagcactggg gccagggcac // LOCUS HUMIGHDC 260 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human Ig H-chain mRNA V-D-J region, partial cds. ACCESSION M33860 KEYWORDS diversity region; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region; variable region V-H4. SOURCE Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA, clone B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 260) AUTHORS Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J. TITLE Intraclonal diversity in the V-H genes expressed by CD5- chronic lymphocytic leukemia-producing pathologic IgM rheumatoid factor JOURNAL J. Immunol. 144, 1526-1530 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 260 Ig H-chain V4-D-J region (AA at 1) BASE COUNT 58 a 82 c 74 g 46 t ORIGIN 1 atccgccagc ccccagggaa gggactggag tggattgggg aaatcaatca tagtggacgc 61 accacctaca acccgtccct cacgagtcga gtcaccatat cagtggacac gtccaagaac 121 cagttctccc tgaagctgac ctctgcgacc gccgcggaca cggctgttta ttactgtgcg 181 agaggccctt gcgaggcata ctgtactgat gatgccccgc aagccggtgc gcggtacttc 241 cagcactggg gccagggcac // LOCUS HUMHLADQA 1468 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC class II HLA-DQA1 mRNA, complete cds. ACCESSION M33906 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human lymphoblastoid cell cell line SPL, cDNA to mRNA, clone pII-alpha-7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1468) AUTHORS Jonsson,A.-K., Andersson,L. and Rask,L. TITLE Complete sequences of DQA1 and DQB1 cDNA clones corresponding to the DQw4 specificity JOURNAL Immunogenetics 30, 232-234 (1989) STANDARD simple staff_review FEATURES from to/span description pept 44 811 MHC HLA-DQA1-alpha protein /hgml_locus_uid="LY0151Z" /nomgen="HLA-DQA1" /map="6p21.3" signal 1119 1124 poly-A signal (pot.) BASE COUNT 356 a 381 c 309 g 422 t ORIGIN 1 cacagctcag agcagcaact gctgaggctg ccttgggaag aggatgatcc taaacaaagc 61 tctgctgctg ggggcccttg ccctgaccac cgtgatgagc ccctgtggag gtgaagacat 121 tgtggctgac catgttgcct cttatggtgt aaacttgtac cagtcttacg gtccctctgg 181 ccagtacacc catgaatttg atggagacga gcagttctac gtggacctgg ggaggaagga 241 gactgtctgg tgtttgcctg ttctcagaca atttagattt gacccgcaat ttgcactgac 301 aaacatcgct gtgacaaaac acaacttgaa catcctgatt aaacgctcca actctactgc 361 tgctaccaat gaggttcctg aggtcacagt gttttccaag tctcccgtga cgctgggtca 421 gcccaacacc ctcatctgtc ttgtggacaa catctttcct cctgtggtca acatcacatg 481 gctgagcaat gggcactcag tcacagaagg tgtttctgag accagcttcc tctccaagag 541 tgatcattcc ttcttcaaga tcagttacct caccttcctc ccttctgctg atgagattta 601 tgactgcaag gtggagcact ggggcctgga cgagcctctt ctgaaacact gggagcctga 661 gattccagcc cctatgtcag agctcacaga gactgtggtc tgcgccctgg gattgtctgt 721 gggcctcgtg ggcattgtgg tgggcactgt cttcatcatc atccgaggcc tgcgttcagt 781 tggtgcttcc agacaccaag ggcccttgtg aatcccatcc tggaagggaa ggtgcatcgc 841 catctacagg agcagaagag tggacttgct acatgaccta gcattatttt ctggccccat 901 ttatcatatc ccttttctcc tccaaatgtt tctcctctca cctcttctgt gggacttaaa 961 ttgctatatc tgctcagagc tcacaaatgc ctttgaatta tttccctgac ttcctgattt 1021 ttttcttttc tcaagtgtta cctactaaga gatgcctgga gtaagccacc cagctaccta 1081 attcctcagt aacctccatc tataatctcc atggaagcaa caaattccct ttatgagata 1141 tatgtcaaat ttttccatct ttcatccagg gctgactgaa accgtggcta agaattggga 1201 gactctcttg tttcaagcca atttaacatc atttaccaga tcatttgtca tgtccagtaa 1261 cacagaagca accaactaca gtatagcctg ataacatgtt gatttcttag ctgacgttaa 1321 tatttctttc ttccttgtgt tcccaccctt ggcattgcca cccacccctc aattaaggca 1381 acaatgaagt taatggatac cctctgcctt tggctcagaa atgttatagc aaaaatttta 1441 aaataaaaaa gtaagtctgt attaattg // LOCUS HUMHLADQB 896 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC class II HLA-DQB1 mRNA, complete cds. ACCESSION M33907 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human lymphoblastoid cell cell line SPL, cDNA to mRNA, clone pII-beta-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 896) AUTHORS Jonsson,A.-K., Andersson,L. and Rask,L. TITLE Complete sequences of DQA1 and DQB1 cDNA clones corresponding to the DQw4 specificity JOURNAL Immunogenetics 30, 232-234 (1989) STANDARD simple staff_review FEATURES from to/span description pept 46 831 MHC HLA-DQB1-beta protein /hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1" /map="6p21.3" BASE COUNT 185 a 252 c 263 g 196 t ORIGIN 1 gtccaagctg tgttgactac cactactttt cccttcgtct caattatgtc ttggaagaag 61 gctttgcgga tccctggagg ccttcgggta gcaactgtga ccttgatgct ggcgatgctg 121 agcaccccgg tggctgaggg cagagactct cccgaggatt tcgtgttcca gtttaagggc 181 atgtgctact tcaccaacgg gaccgagcgc gtgcggggtg tgaccagata catctataac 241 cgagaggagt acgcgcgctt cgacagcgac gtgggggtgt atcgggcggt gacgccgctg 301 gggcggcttg acgccgagta ctggaatagc cagaaggaca tcctggagga ggaccgggcg 361 tcggtggaca ccgtatgcag acacaactac cagttggagc tccgcacgac cttgcagcgg 421 cgagtggagc ccacagtgac catctcccca tccaggacag aggccctcaa ccaccacaac 481 ctgctggtct gctcagtgac agatttctat ccagcccaga tcaaagtccg gtggtttcgg 541 aatgaccagg aggagacaac tggcgttgtg tccacccccc ttattaggaa cggtgactgg 601 accttccaga tcctggtgat gctggaaatg actccccagc gtggagacgt ctacacctgc 661 cacgtggagc accccagcct ccagaacccc atcatcgtgg agtggcgggc tcagtctgaa 721 tctgcccaga gcaagatgct gagtggcatt ggaggcttcg tgctggggct gatcttcctc 781 gggctgggcc ttattatcca tcacaggagt cagaaagggc tcctgcactg actcctgaga 841 ctattttaac tgggattggt tatcactttt ctgtaacgcc tgcttgtccc tgccca // LOCUS HAMRPS14 5300 bp ds-DNA ROD 27-JUL-1990 DEFINITION C.griseus ribosomal protein S14 (RPS14) gene, complete cds. ACCESSION M35008 KEYWORDS ribosomal protein S14. SOURCE C.griseus (haplotype EmtB wild type) ovary transformed fibroblast cell line CHO DNA. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (base 1 to 5300) AUTHORS Rhoads,D. and Roufa,D. JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (sites for [1]) AUTHORS Diaz,J.-J., Rhoads,D. and Roufa,D.J. TITLE Genetic analysis of a vital mammalian housekeeping locus using CHO cells that express a transfected mutant allele JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.J.Roufa, 08-JUN-1990. Author address: D.J.Roufa Kansas State University Division of Biology Ackert Hall Manhattan, Kansas 66506 FEATURES from to/span description pept 1977 2125 ribosomal protein S14, exon 2 (first expressed exon) 2576 2737 ribosomal protein S14, exon 3 4074 4150 ribosomal protein S14, exon 4 4724 4791 ribosomal protein S14, exon 5 pre-msg 183 4841 ribosomal protein S14 mRNA and introns (5' end +/- 4 bp) IVS 220 1974 ribosomal protein S14 intron A IVS 2126 2575 ribosomal protein S14 intron B IVS 2738 4073 ribosomal protein S14 intron C IVS 4151 4723 ribosomal protein S14 intron D signal 4819 4824 poly-A signal mut 4783 4783 c in wt; g in mutant Emr-75 [ref. 2] null Arg-150 -> Gly with ThaI rflp. BASE COUNT 1315 a 1121 c 1415 g 1449 t ORIGIN 1 agatctggtc acagcgacac gcagcacccg gagtccttaa taagccatct tggtcgccgg 61 caggcttagc ctccttggaa accgcaatag aggcttcttc cgggagccgg gattccctcc 121 gtgacctcag aggtacactt ccggaatacg gaagtgaccc gtctgcccgc cccttccttc 181 tctctctttc cggtggagaa gtctggagac gacgttcagg tagggaggcc cggcggccca 241 caatcgtgcg gcatcctctg gccaggggcg tctgcgggct tggggtcgcg gtttgtgcgc 301 ggtaaccgag ccaggtgcgt gatggtgacg ggcggctgtc cgctcgccgc gatccacccg 361 gcctggggct gcgccgtgtc cgggaggcgg ccttctggcc tgagatcgga aggtctgagg 421 tggaagtgcg gggcctgcct gggccttgcc gtcacgccac agtcgtcgcc cgagcgttcg 481 gggagcgtgt ccctagtcgc gggtgggccg tcaccggcac cccattagat tcttaggggc 541 cgtcgacgac ctgcacctaa cctcatgccc ctcatgtgga aaatacgagc agcgttcgta 601 tctgcccctt ctgtaaaccg aagcgtaaaa tgggatgtca gaaaacctgg tcacgcagga 661 ttagagtgct tctaacccat ggtggttacc cctccgagtc ggacacattg cctgttgggg 721 tgccccctag tagagtagga ctgatgccat atgtttacat ttctctgaat agccctctcg 781 tctccactga ggacggcaaa gctacactga acagatgtca tgtcttatgg aatcgtgttt 841 gttagcctaa tttgagggta tgagcaggtg actacatcct ggacctgccc ttgtgctttt 901 ggcaaataag tcccaaatag tgtgtgaggg aggcactaca gagatcccag gtgtggttga 961 aacttttaag gcctaggagg aactttgtgg agtaggagca tcttggttct aggaggagta 1021 taagacaggc tttggtgcag tattgtcctg gacagactta gcctcgggtt ggtcaaaggc 1081 atgcctgcag ctttggcaaa aaaaaaaaaa gcctctttgc ttgtgtattt tccctgggtc 1141 ctgtgactgt acttacatta ttgactaggc aggaaggggt tacatgtcag gtgtatttgc 1201 agttcataac aatagcaaaa ttagagttat gaagtagtag caatgacttt ttgtgtgggg 1261 ggcaccacaa tatgagaatt gtactaaagg gtcacagcgt taggttgaga accaaagggt 1321 agactcactt gatgctgaac actgaatttt aggtatggat atctttacca ttaaacttcc 1381 tggaaggcat accctgacca tcctgggact tgctacgtag accataacta cgatatcctc 1441 ctgcctgtga ctcccaggtg ctggggctaa agttatacac aagcacacat tgctgtggca 1501 gtgcttttta atgtttatta ctagtaaagg gtgaaagatt tggttctaaa cagtgagagg 1561 ctgtcagatt aagaattgtg tgtggacagg gagtccgtta atctgcagtt ggcttcagtt 1621 gtgtcatgtt aatcaaataa acttcccagc ttaaggagag ggagagggga gttgtgaatg 1681 gggctttgcc ccaacacctg agttcagcat cctctatttt ggataactac ttgtgaaagt 1741 ttgaggtgca tattgttcat tgctggaatc ttaggctcat cctgaaaatg cttggagctt 1801 ttgtttagaa attgtctcct ctcttggcgc ctccctttta gttacaactc agtaattccc 1861 caagtacaaa tatgttggat attagagagc caaaggagct tgggttttcc tttagtaggc 1921 taaagtattt ggggatgttt tgtctgttaa catggaggtt ctccttccac tcagaaatgg 1981 cacctcgcaa gggaaaggaa aagaaggaag aacaggtcat cagcctcgga cctcaagtgg 2041 ctgaaggaga gaatgtattt ggtgtctgcc acatttttgc atctttcaac gacacctttg 2101 tccatgttac tgatctttct ggcaagtgag tacttgggga catgaagaca taaaaccaag 2161 cctcacaggg actgaggtgg gagaaaaagg cataggagtt tatgggattc cctctaaaga 2221 attactgatt ttcttatttt cccataaatt ataagagaaa acagttgaat gtagctgagg 2281 attcatccac agaaaatcac cgcaaatagg aagtggaagc agaggatcag aaactcaatt 2341 atcctcagct acacgcacat gtaaaatata tgcattatgg aagtgattat ggggataaaa 2401 tgcaaaacta attgttgtat taacagggag tttgtctaga tagggtcctt ggacccctgt 2461 aattatacac aactcctgtc ttgttcagaa ccaggatgtg tgcttgttgg ggttcagcaa 2521 gtgtttcctg tgtccttgtt ggaaaatgaa ggcatggatc tgcttttgtc tctagggaaa 2581 ctatctgccg ggtgactggt ggaatgaagg taaaggctga cagagatgag tcttctcctt 2641 atgcagccat gttggctgcc caggatgttg cccagaggtg caaggagctg ggcatcactg 2701 ccctgcatat caaactccga gctacgggag gaaacaggta tggaagtcta gggctggaca 2761 tttgtaagtt cactgggctt gggggtggat ctgagatttg agtctctgct gtgtctgaga 2821 gcaagccgtt ttaggaatgt atgtatgttc ttcgtcaggc atggtagctg atacctgtaa 2881 tcccagcact ccggaaccaa tataggagga ggatctaaag tttggtgcta acctgggcta 2941 cacagtgagc ctcaaaaata agatagtttg ttcctttatg tttgtccctg tagagtgctg 3001 cttttatttt ttttttttta aagcatacat gagactgact cttccccagc accaggtaga 3061 ctgggtgtgg tagtatacac ctgttgtctc agaaagtgga ggtgaggaga gctagagttt 3121 caagttggtt attggttgca tattgagtag gtggccagct tgagctgaac agtgatgctt 3181 aaatgatgaa ctgggggtgc atagtccaga tagagacaaa acttcaagag aagcatcttg 3241 gggtgagaaa gaagttgggc tttacaggga tgattggtgt aagatatgat ctaagaagca 3301 gggatggagc ctctggggtc ttaagtttta caaaacagag caagggcttg ggcatgaaaa 3361 ctagaatagg tggctactga gtggaaagat actgcatagg aagccctaag gaagttacat 3421 actggatgta ggagctggga agtaggcatc acagcaaggg gaggacagcc acaggccagg 3481 tgactatccc ctcccccccc cacacacaca catgtacact tttaaaggat ttgtaggcat 3541 ggtggctcac acctttaatc ccagcatctc ttggggggcg tccccccaca aaagtgggaa 3601 aaattacatg cgtatgttgc ctgtgtgcat gcatgcgtgc attatcggtt ccctgtggag 3661 gacagaagag gaccatgaat ctcttggaac tggtgttgct gagccacctc tccaggcctc 3721 agttagcagt tttgaaaggg tttggttttg ttttgttttt tcgagacagg gtttcttctg 3781 tgtagctttg gagcctatcc tggcactggc tttggagacc aggctggcct caaactcaca 3841 gagatccgac tgcctcccga gtgctgggat taaaggtgtg caccaacgca ggatttggtt 3901 ttgttgtgag attgtataat atatgaaaac cgtagcagcc tacaaagtct agaattttgt 3961 gtaatcccct cacagtgtgc agatctttaa ctatagcccc accaccattc agaagtgaat 4021 taaaaccatg tgaacattcc cactgacttg accaacaatc cttcattttt taggaccaag 4081 acccctggac ctggagccca gtcagccctc agagctcttg ctcgttcagg gatgaagatt 4141 gggcggattg gtaagtgacc cttctagcta acattgggtt tattttgaag cattgacccc 4201 ttccaaacta tgtgatgtgc atattggttc aagcttgtct ggtgactggt gcagttaact 4261 tttgttaacc tgtaaaagac attgagagcg ttgatttctg taaatgcccc ttaactgtta 4321 tattttccca tttgcaattg ggaagactta aaaaacactt gtgaatctgt cctagcagcc 4381 tttttccctg ctagctaaat gttaagggta tgcccttggg acagtccaag tgaataattg 4441 cacacacctg gcctaaagag caaacagcca caggtgctca gtgtaggggc tctgagttag 4501 aaacttgaag gagtccttcc cctctgatgt ttgtgtgcct tctcagagcc tgtgttgtcc 4561 tcgtccaatg tgggttgaga tgaggaaatg acaggtgagg tgggcccttg tttattgacc 4621 actggtgata catttgggga agaataaagt ctgctagtta gtctaatggc tagggaggct 4681 gtgtttatgt agctatgggc aatgacattt tttcttcttg cagaggatgt cacccccatc 4741 ccctctgaca gcacccgaag gaagggtggt cgtcgtggtc gccgtctgtg aacaggactt 4801 ctcaaattat tttctgttaa taaattgctt tgtataagct attttggttc tgatgtttgt 4861 ttgttttgag gcaggatctc tctccatgtc tatgtagaag agtctggcct ccaactcaga 4921 tccacatacc tgtctctgcc tcccaagtac tggggataaa ggaatgcacc accacacccc 4981 ggtgacctta ggggccaccc agggcaacac caaggcagtg ttttgagggg atgttaagat 5041 tgcatgtagg agcttctaat gtaggttggg ggttggcaaa cctgggtttg caggctggct 5101 ttctgagaat gggaacctga gagaggagat gctgaggctg ccactctggc attaatttgc 5161 cagccagtat tttcaaagtg gaactgttag tagcccatgc tgtcatggga cagctaaaca 5221 ccacacacac acacacccca agcagggtag tagagatcac aagttggaag ccagtttaat 5281 atggtctact tggggaattc // LOCUS AFATFDCD 7956 bp ds-DNA BCT 27-JUL-1990 DEFINITION A.eutrophus plasmid pJP4 chlorocatechol oxidative (tfdCDEF) operon, complete cds; and 2,4-dichlorophenol hydroxylase (tfdB) gene, complete cds. ACCESSION M35097 X07754 KEYWORDS 2,4-dichlorophenol hydroxylase; chlorocatechol 1,2-dioxygenase; chloromuconate cycloisomerase; tdfF gene; tfdE gene. SOURCE A.eutrophus (strain JMP134) plasmid pJP4 DNA. ORGANISM Plasmid pJP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1598; enum. 1 to 1598) AUTHORS Perkins,E.J. JOURNAL Unpublished (1988) Dept of Biochem,Univ of Wash.,Seattle,WA 98195 STANDARD simple automatic REFERENCE 2 (bases 1 to 339) AUTHORS Perkins,E.J., Bolton,G.W., Gordon,M.P. and Lurquin,P.F. TITLE Partial nucleotide sequence of the chlorocatechol degradative operon tfdCDEF of pJP4 and similarity to promoters of the chlorinated aromatic degradative operons tfdA and clcABD JOURNAL Nucleic Acids Res. 16, 7200 (1988) STANDARD simple automatic REFERENCE 3 (bases 1 to 7956) AUTHORS Perkins,E.J., Gordon,M.P., Caceres,O. and Lurquin,P.F. TITLE Organization and sequence analysis of the 2,4-dichlorophenol hydroxylase and dichlorocatechol oxidative operons of plasmid pJP4 JOURNAL J. Bacteriol. 172, 2351-2359 (1990) STANDARD simple staff_review FEATURES from to/span description pept 337 1104 chlorocatechol 1,2-dioxygenase (tfdC) pept 1101 2213 chloromuconate cycloisomerase (tfdD) pept 2288 2992 tfdE product pept 2989 4053 tfdF product pept 4398 6194 2,4-dichlorophenol hydroxylase (tfdB) conflict 681 682 gg in [3]; cc in [1] BASE COUNT 1716 a 2209 c 2403 g 1628 t ORIGIN 1 aagctttgcc ccaatatcgc gttccaaggc ctggatctga cgagtaagcg gtggctgcga 61 tatgtgcatc ctctgtgccg ctgttccgaa acctcccgcc tccgcgaccg cgacgaagta 121 tttcaactgt cttatttcca tgttccgtcc cgttgcgctg cctgttacaa ggcgattgtg 181 agcaagcacc tgcggtctcg caactaggga aagacatacc gaaaccgtat ggtctaggcg 241 gagaaaaggt attggacggt atgggattgc cgtctctatt atgactgtgc ggggcaattg 301 tcgcccacgt gttcatgttt tcatgacgga ggcaaagtga acaaaagagt caaggatgtt 361 gtcgatgcga tcgtcgcggc ggtacagcgg gtgctcgacc agaaggaggt tactgaggcg 421 gaataccgaa ctgcggttca ttacctcatg caagtagccg aacagcgtga aaccgctctt 481 ctttgcgatg tttttttcaa tagcaccgtg gctgcgacga aggctcgtat tagcgaaggt 541 tcaacgcctg ctatcgaggg accctactat cgcgacgacg cccctctggt cgatgaccgg 601 ctcaagactt acgacacgga cgaccacaag cctttgctta tccagggaac ggtcaaagcg 661 gtcgacggga gcgttgtcga ggacgtgacg attgatgtct ggcattcgac gcccgatggg 721 aagtacagcg gtttccatga cgacatcccg actgattttt atcgagggaa gctcagggtg 781 ggcaccgatg gcagcttccg cgtgcgcaca acgatgccgg tgccgtatca gatcccggat 841 cagggtccca cgggcgcatt gctcgaaacc atgggtggtc actcgtggcg tcccgctcat 901 gtacatttca aggtgaaggc gccgggatat gaaacgttga ccacgcagta ctacttcgaa 961 ggtggagatt ggatcacgga cgactgttgc aacggcgtcc agtctagtct gattactccc 1021 gatattgtgg aagagggtgt ccgcctgatg aacatcaact tcgttatcga gcccgcgcgt 1081 gcgcaggcgg gggcaaaccc gtgaagattg acgcgattga agcagtgatc gtggatgtgc 1141 cgaccaagcg gccgatccag atgtcgatca ctaccgtgca ccagcagagc tacgttatcg 1201 tccgggtgta ttcggagggg ctcgttggtg tcggcgaggg tggaagcgtt ggtggtcccg 1261 tctggagcgc agagtgtgcg gagacgatca agatcatcgt ggaacggtat ctcgcgcccc 1321 acctcctcgg aactgatgcg ttcaacgttt caggtgcact gcaaaccatg gcgcgtgccg 1381 tcaccggaaa cgcctctgca aaggctgcgg tcgagatggc gttactggat ctcaaagctc 1441 gagcgttagg cgtatcgatc gccgagttac ttggcgggcc gttgcgcagt gcgattccga 1501 ttgcctggac attggcgagc ggagatacga aacgcgatct cgattctgcc gtcgagatga 1561 ttgaaagacg acgacacaat cgcttcaaag tcaagcttgg cttccggtcg ccccaagacg 1621 atctcatcca tatggaggct ttgtcaaata gtctcggatc gaaggcctac cttcgcgttg 1681 acgtgaatca ggcttgggac gagcaagtgg cgtccgtcta cattcctgaa ctggaggcgc 1741 ttggcgtgga actcatcgaa cagccggtcg gccgcgaaaa tacgcaagcg ttgaggcggc 1801 tctccgacaa caaccgcgtg gccatcatgg ccgatgagag cctgagtacg ttggcctcgg 1861 cattcgatct cgcgcgcgac cgaagtgtgg atgtcttttc gctgaagctt tgcaacatgg 1921 gaggggtctc ggcgacgcaa aagatagcag cggtcgcgga agcaagcggg attgcatcgt 1981 atggcggaac aatgcttgac tcgacgatcg gcacatcggt tgcacttcag ctctattcta 2041 cggttccatc gcttccgttc ggttgcgaac tgatcggtcc cttcgtgttg gcagacacgc 2101 tgagccacga gccactcgag atccgggatt acgaactgca ggttcccact ggcgtaggtc 2161 acggcatgac gcttgacgag gacaaggtgc gccaatacgc acgcgtcagc taggcaaaac 2221 gcgcagcgca ggccgcggct gtggatcgcc tcgaatgcaa ctgcgcatgt gtaactggag 2281 actaaatatg ttatcagacg gcgttgagat cacgtcgcgc tcgggtggtc gctttggtgc 2341 ctacctcgga aagccgacga cggactccgc acccatcgtc gtgatcgcgc aggagatttt 2401 cgggatcacg cctttcataa gagaaaccgt ggaatggctc gttggtgcgg gctttggttg 2461 cgtgtgtccg gatctgtact ggagacaggc gccgaatatc gagcttgatg caaacgtacc 2521 atcggaacgg gaacaggcgc ttgcgttgtt tcgcgacttt gacatggagg caggagtcaa 2581 tgacctttca tgcgccattg aatacgctcg tgcgctccct ttctcgaacg gtcgtgtcgc 2641 tgtagtcggg tactgcctgg gaggtgcgct ggcgtttgat gtggcggcgc gatcattggc 2701 cgattgctca atcggctatt acggcgtggg acttgaaaaa aaggtctcac tggtgccagc 2761 tattacacga ccggcgatgt tccacatggg taccaaagat cactatgtca cagaagaggc 2821 ccgtagcatt ctcgaagagc atttcggtcg aaacaaaaat ctgagtctgc actggtatcc 2881 agtggggcat tcatttgcac ggtcatccag cccaaatttc gatcaggcgg cgacaactgt 2941 ggccaatgct cggacgctcg aactgcttgc aatgttgaag gacccttcat gaagaagttc 3001 acgcttgact acctgagccc gagggtcgtc ttcggggcgg gcactgcttc tgcattgcca 3061 gatgaaatag gacgccttgg cgcacgccgg cccttggtat taagcagccc ggaacaacgc 3121 gagttagcga aggatatcgt ccgtccgata ggtgacaggg tagctggata tttcgatggc 3181 gcgacgatgc atgttcccgt cgacgtcatc cagaaagccg agcgggcttt taacgatact 3241 gacgccgact caatcatcgc gatcggggga ggatcgacca ccggactcgc aaaaatcctt 3301 tcgatgaacc ttgacgtccc aagtctggtt ataccaacga cctatgccgg tagtgaaatg 3361 actaccattt ggggtgtcac ggaaggcgga atgaagagga ccggccgcga ccccaaggtg 3421 ctaccgaaga ccgtgattta tgatccattg ctcacggtcg atttgccgct tgctatctcg 3481 gtgacgagcg ccttgaatgc gatcgctcac gccgcagaag gtctgtactc ggccgacctc 3541 aatcccgttc tcgagaccat gtgtaagcag ggcatatgcg ccttgttcga tgcaatcccg 3601 cgcctggtgg caaagccgac tgacgccgaa gcgcgtacgg atgccctttt tggggcatgg 3661 atgtgtggca ctgcactgtg ccacttgggc atggggctac atcacaaact ctgccacacg 3721 cttgggggaa cccttaatct tccccacgcg gagacacatg caatcgtact accacacgca 3781 ctggcataca atctgccgta cgccgcgcca gctgagcgac tgcttcagga agtcgccggc 3841 agtagtgacg tcccgagcgc gctatatgat ctcgccagaa atgctggagc accactcagt 3901 ctcgccgaaa tcggtatgcg gcctgaagat attccgaggg tacgcgacct cgcgctaagg 3961 gaccaatatc cgaatccgcg tccgctggaa tcggacgcat tggaaacatt gttagtcaat 4021 gcgtttcgtg ggcgaagacc ggatttcaaa taatgtgacc tgcactccgc gtttagtacg 4081 gtagcggtga agagcccgtt ccaaagtcga catcgggtct tcaagctaaa gcgaagcaca 4141 tgaatttgca tccactgttc cttgtggagc atcgccagcc ttcgcgcgaa aaaggccgac 4201 tgtactggag gtggctcgga ttttctttgc gcgaagtggc tcatttttac tttgcgcgca 4261 acaaaggctc ttcagcctta ctattcatcg cgcgttaaag acggtgaagt ttaaaaaatc 4321 gacttggccg aacgcgagat gaacaatccc tgtttcttct cacaaagaga gagacataac 4381 aattgacgga ggaagacatg gcattgacga tcgaaaccga cgtgctggtc gttggcaccg 4441 ggccggcggg cgcttcggcg ggggcactgc ttgcgcgata cggtgtcagg acgatgctca 4501 ttaacaagta caactggacg gctccgacgc cgcgtgccca catcaccaac caacgcacaa 4561 tggagatcct gcgtgatctc ggtctcgaag ctgaagcccg cctctacgcc gcgccgaatg 4621 acctaatggg cgagaacacc atctgcgcct cgttggccgg cgaggagttc ggtcggattc 4681 gcacctgggg gacggatgtt cgccggcgcg ccgattacga cgaatgttcg ccgacctcca 4741 tgtgcgacct gccacagaac tatctcgagc cgattctcgt caagtcggcc gcactcgacg 4801 gctgcaaggt gcgcttcgac acagagtatc ttggacacga acaggatgct gacggcgtgt 4861 cgtcgcgctt gcgcgatcgt ctgaatggcg aagagtttac cgtccggtcg aaatatctga 4921 tcggcgccga cggcgccaac agccgcgtcg tgtcagacct ggacctgccg ctcgaaggta 4981 ccatgggcaa gtccggttcc atcaacttgc tgttcgaggc tgatctcgac cgctacgtcg 5041 cgcatcgtcc atcggtcctt tactgggtca tccagcccgg ctccgatatc ggcggtctcg 5101 gcatcggcgt ggtgcgcatg gtccggccgt ggaacaaatg gcttgcgatc tgggggtacg 5161 acgtcgagca gggcccgccg gagatctcgg aatccttcgc ccgccgcatc gtccacaatc 5221 tgatcggcga tgacagcgtg ccgctgaaaa ttgagggcat ctcgacctgg acggtcaacg 5281 acatgtacgc gacccggttg cagcaaggac gggtcttctg cgccggcgac gcggtccacc 5341 ggcatccgcc gaccaacgga ctcggctcaa acacctccat ccaggacagc ttcaatctcg 5401 cctggaagat cgccatggtg ctgaacggca cggccgacga gagcctgctg gacacctaca 5461 cgatcgagcg ggcgccgatc gcgaagcagg tggtctgcag ggcgaacaag tcgcttgagg 5521 atttcccgcc gatcgcgatg gctctggggt tgccgcaggc caagagcgcc gacgagatga 5581 agagcaacat ggctcgccgc aaggaacctg ggcccgaggc gcaagcgcag cgcacgcggt 5641 tacgcgaggc catcgccggg accaattacg tttacaacgc ccacggcgtc gaaatgaatc 5701 agcgttacga cagtcccgcg atcgtggcag acaattcgcc ggacgaagtc ttccgcgacg 5761 tcgagctcta tcaccaggcg tcgacgcggc ctggagcgcc catgccgcat gtctgggtct 5821 atgcgagtgg tgatggacac cgtatctcaa caaaggatct ctgcgggaag ggcaacttca 5881 ctctctttac cgggattggc ggcgcagcct ggcaggatgc cgcggcggcc gtgagccggc 5941 aactcggggt ggcagtgacg gtgcgcatca ttgggcctgg ccaggcctac gaggatcatt 6001 acggcgactt cgcgagaatt tccgagatca tcgacaccgg cgccatcctg gtgcggccgg 6061 atttccacgt cgcctatcgg gcgacgtcct tgccggccga tgcggcaggc gacctcgtgt 6121 ccgccatgag gcggatcctc ggacgccagt cagagaggtc gagcgcgctg cgcgttacgt 6181 ctcgagctat ataaggagtg gcgtcgcgat ctattcgagg tgagacggaa gcgctcacct 6241 cgggttgcgg gaggggaagc gtgatcttgc gcagcgtgag atggcccgtg gcgacgcggg 6301 aaggttgtcg cgcgattaag gcaaggcggt gcgcaacgac gtgacgtgtt ttctggaccg 6361 gacgtaactg cagatcatct aatttgggaa gttcgtaccg gtgtcgaccg ccttgctgat 6421 gatgctatta agcacgtgag cagacgacgg gccggtgcag ccaacgttga aaaaagagcg 6481 cttgattgca atgtcgacga cacgatctac attactgtca tttgaacgac cgcttgctcg 6541 tgcggattca accggtggat gcaacagttt gctgaaatcg ttcggcgggc gttccgtggt 6601 tcagggtttt cctcggtcgc ccatccagcc gcctcgccac ggtattgagt agccaattgg 6661 tacaccgaaa tgtaggtgcc gttggggatg tctcgatagc ccgttcgcat tttcattcga 6721 cccgcgctgc caaggatgct gggggttcag taactgcagg ctgaactgcg ccaggtgaac 6781 caaacgctga tcgtcaagca aaatgaaata acccagttga acagagaaag cgcgcggctc 6841 gtcgccgaag ccggcgtcgc gacaaggcat ctacgcgagc tgcaagcgca cagcaagtag 6901 ttacagaata cgttgaacca gtcgctcgcc gaccacgcac gcaccaaggc cgagcacgat 6961 gcgctgctaa caccgcgagg actgaggcga ctgaactgac gcggctgcgc gaagcacgtg 7021 aatcgctgac cgccgatctg acgaagctca cagcgcagtt cgacgagcag cagttgctcg 7081 gggactatcg gatgcgtctg ggcctcacgg acgcggcggg atgaaagcgc cagattccgc 7141 atcaaaagga tggggatcta cgccggaacc ccagaaaatt ccgtcgttcc acattgtgaa 7201 cattaatcgg cttattccag ccgaagaatg ttggagcgac gtccggtgag tgacatgacg 7261 gtctcggtca cacggaccat caggaccgtt acgcggggcc cagttcgatt acattgtggg 7321 acaggtcgag acaatcttcg cagtcccggc aaacccagtg cagcatcggc ccgaccggat 7381 ctggatggtc gctgaacgtt gtcagcgccg ggtcgtacta gcatcggacg tgctcgagcg 7441 cagtatccgc tcgagattag actcgatgcg gtccaggctg cattcaccga tgccggactt 7501 gaacaccgtc acaccgcctg acgggttgac cgcgaagaag aaaaacgcct tcgtgtcgac 7561 catgtgcgac aggacttgtc ctggcgatga agttgctcgg attgatccag cgcattgaag 7621 atgcggtagc cgtaaaactc taagccgaag tgtacgacct cgaagtcgtc ggtgccgcgg 7681 atcgagtggc caaagttgaa accgctgcgt gttacgtcgt tcggaatgtc atccaccggc 7741 cgcagtgcca ggcaagcata gtgcgtctgg ccctcggcca cgagtcccgt cgcaaagcag 7801 ggcgcgcgct cggccagcaa ggcctcgaca aaattccggt tcatgcggag cgttggcagc 7861 atcgggcatc cttcatcgtg aatcaaatca acatttgccg acctggctgt gcggtgcgtc 7921 gacgtcgaac atcaaaggct atacatcgtc gagctc // LOCUS ECOCLPAA 3380 bp ds-DNA BCT 27-JUL-1990 DEFINITION E.coli ATP-dependent C1p protease (clpA) gene, complete cds. ACCESSION M31045 KEYWORDS C1p protease; clpA gene. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3380) AUTHORS Gottesman,S., Clark,W.P. and Maurizi,M.R. TITLE The ATP-dependent Clp protease of Escherichia coli: Sequence of ClpA and identification of a Clp-specific substrate JOURNAL J. Biol. Chem. 265, 7886-7893 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Gottesman, 03-JAN-1990. FEATURES from to/span description pept 1000 3276 ClpA protein signal 779 784 -35 region signal 805 810 -10 region binding 990 994 ribosome binding site binding 214 221 ATP binding site binding 495 502 ATP binding site BASE COUNT 877 a 759 c 943 g 801 t ORIGIN 1 bp upstream of BamHI site; 19 min. on K12 map. 1 ggatccagta gatgctctgg cacgagagta atttttgaga taaaaatcga gccgatcggg 61 cggcattttg cctttaggat gtacacaatg agacagaaga gctatgcgac tgccgcttct 121 acttcgacgg gcacaataac actggcgtga ttgccttttg gcccctggtg gacatcaaac 181 tgaacggatt gtccagcttt tagcgttctg taaccatcca tctgaatggt ggaataatga 241 gcgaaaatat cttcgccgcc gccttcaggg cagatgaaac caaacccttt ggcattgttg 301 aaccacttaa cagtaccctt ttccatgctt cgacatcctt cgcaaatctt atacaagtaa 361 gatggaataa accggggtca gagagggggc tgttcaaaac ctcgccaact ctagaaatac 421 aatttagaga attagggcga gccgtcaagc atttgacagg ggacaagggg caggtatgaa 481 tcaaaaattt gaagcagtta acgctattga caggaatgtg acagatgtcg ctgatgccaa 541 cgatagatga tagttatcta tcatgtggag tagattggtc aggcaaataa gctcttgtca 601 gcggcagggc gttctgccga taaccgtaac cgaagatgat aactgacaat gggtaaaacg 661 aacgactggc tggactttga tcaactggcg gaagaaaaag ttcgcgacgc gctaaaaccg 721 ccatctatgt ataaagtgat attagtcaat gatgattaca ctccgatgga gtttgttatt 781 gacgtgttac aaaaattctt ttcttatgat gtagaacgtg caacgcaatt gatgctcgct 841 gttcactacc aggggaaggc catttgcgga gtctttaccg ccgaggttgc agaaaccaaa 901 gtggcgatgg tgaacaagta cgcgagggag aatgagcatc cattgctgtg tacgctagaa 961 aaagcctgaa tgcaggcata aaaattgggg gaggtgccta tgctcaatca agaactggaa 1021 ctcagtttaa atatggcttt cgccagagcg cgcgagcacc gtcatgagtt tatgaccgtc 1081 gagcacttgt tactggcgct gctcagtaac ccatctgccc gggaggcgct ggaagcgtgt 1141 tctgtggatt tggttgcgct ccgtcaggaa ctggaagcct ttattgaaca aaccacaccc 1201 gttctgcctg ccagtgaaga ggagcgcgac acacagccga cgctgagttt tcagcgtgta 1261 ctgcaacgtg cggtcttcca tgtccagtcc tccggtcgca atgaggttac cggtgcaaac 1321 gttctggtcg ctatctttag cgaacaggag tcgcaggcgg catatctgtt gcgtaaacac 1381 gaagtcagcc gtctcgatgt ggtgaacttt atctctcatg gcacgcgtaa agacgagccg 1441 acacagtctt ctgatcctgg cagccagcca aacagcgaag aacaagctgg tggggaggaa 1501 cgtatggaga atttcacgac gaacctgaat cagcttgcgc gcgtgggcgg aatcgaccca 1561 ctgattggtc gtgagaagga gctggagcgt gctattcagg ttctctgccg tcgccgtaaa 1621 aacaacccgc tgctggtggg ggaatctggt gtcggtaaaa ccgcgattgc ggaaggtctt 1681 gcctggcgaa ttgttcaggg cgatgtgccg gaagtgatgg ctgactgtac gatttactct 1741 ctcgatatcg gttctctgtt agcgggcaca aaatatcgcg gcgactttga aaaacgtttt 1801 aaagcgttgc tcaagcagct ggagcaggac actaacagca tcctgtttat tgatgagatc 1861 cacaccatta tcggtgcggg tgcagcgtct ggtggtcagg tcgatgcggc taacctaatc 1921 aaaccgttgc tctccagcgg taaaattcgt gtaattggtt cgacaaccta tcaggagttc 1981 agcaacattt tcgagaaaga ccgtgctctg gcgcgtcgct tccagaaaat tgatattact 2041 gaaccgtcga tcgaagaaac tgttcaaatc atcaatggcc tgaaaccgaa gtatgaaggc 2101 caccacgacg tgcgttatac cgcaaaagcg gtgcgtgcgg cggtagagct ggcggtgaaa 2161 tacattaacg atcgtcatct gccggataaa gccattgatg ttatcgacga agcgggcgct 2221 cgcgcacgcg tgatgccggt aagcaaacgc aagaaaaccg ttaatgtggc ggatattgag 2281 tccgtggtgg cccgtattgc acgcattcca gagaagagtg tttctcagag tgatcgtgat 2341 accctgaaaa acctcggcga tcgcttgaaa atgctggtct tcggtcagga taaagccatt 2401 gaggcgctga ctgaagccat taagatggcg cgtgcaggtt taggtcacga acataaaccg 2461 gttggttcgt tcctgtttgc cggccctacc ggggtcggga aaacagaggt gacggtacag 2521 ctttcgaaag ctttgggcat tgagcttctg cgctttgata tgtccgagta tatggaacgc 2581 cataccgtca gccgtgttat tggtgcgcct ccgggatacg ttggttttga tcagggcggt 2641 ttgctgactg atgcggtcat caagcatcca catgcggtgc tgctgctgga cgaaatcgag 2701 aaagcgcacc cggacgtgtt caatattctg ttgcaggtga tggataacgg tacgctgacc 2761 gataacaacg gacgcaaagc agacttccgt aacgtggtgc tggtgatgac caccaacgcc 2821 ggggtacggg aaactgagcg caaatccatt ggtcttatcc accaggataa cagcaccgat 2881 gcgatggaag agatcaagaa gatctttaca ccggaattcc gtaaccgtct cgacaacatt 2941 atctggtttg atcatctgtc aaccgacgtg atccatcagg tggtggataa attcatcgtc 3001 gagttgcagg ttcagctgga tcagaaaggt gtttctctgg aagtgagcca ggaagcgcgt 3061 aactggctgg ccgagaaagg ttacgaccgg gcaatgggcg ctcgtccgat ggcgcgtgtc 3121 atccaggaca acctgaaaaa accgctcgcc aacgaactgc tgtttggttc gctggtggac 3181 ggcggtcagg tcaccgtcgc gctggataaa gagaaaaatg agctgactta cggattccag 3241 agtgcacaaa agcacaaggc ggaagcagcg cattaatctg attgttaggt aggttggtca 3301 agtccgtaat ctcgaaagag gttacggact ttttgtttat ggggtggagg agttcagacc 3361 ctttttttat gatgatggta // LOCUS HUMCSAE 1372 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human epithelial glycoprotein (EGP) mRNA, complete cds. ACCESSION M32306 KEYWORDS cell surface antigen; cell surface glycoprotein; epithelial glycoprotein; transmembrane protein; tumor-associated antigen. SOURCE Human colon carcinoma cell line HT-29, cDNA to mRNA, (library HT-29/cdm8). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1372) AUTHORS Simon,B., Podolsky,D.K., Moldenhauer,G., Isselbacher,K.J., Gattoni-Celli,S. and Brand,S.J. TITLE Epithelial glycoprotein is a member of a family of epithelial cell surface antigens homologous to nidogen, a matrix adhesion protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2755-2759 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.Simon, 23-FEB-1990. FEATURES from to/span description pept 22 966 epithelial glycoprotein (EGP) precursor sigp 22 84 epithelial glycoprotein signal peptide matp 85 963 epithelial glycoprotein mRNA < 1 1372 EGP mRNA BASE COUNT 436 a 233 c 317 g 386 t ORIGIN 1 gctctaaagg cgcgcgcagg catggcgccc ccgcaggtcc tcgcgttcgg gcttctgctt 61 gccgcggcga cggcgacttt tgccgcagct caggaagaat gtgtctgtga aaactacaag 121 ctggccgtaa actgctttgt gaataataat cgtcaatgcc agtgtacttc agttggtgca 181 caaaatactg tcatttgctc aaagctggct gccaaatgtt tggtgatgaa ggcagaaatg 241 aatggctcaa aacttgggag aagagcaaaa cctgaagggg ccctccagaa caatgatggg 301 ctttatgatc ctgactgcga tgagagcggg ctctttaagg ccaagcagtg caacggcacc 361 tccatgtgct ggtgtgtgaa cactgctggg gtcagaagaa cagacaagga cactgaaata 421 acctgctctg agcgagtgag aacctactgg atcatcattg aactaaaaca caaagcaaga 481 gaaaaacctt atgatagtaa aagtttgcgg actgcacttc agaaggagat cacaacgcgt 541 tatcaactgg atccaaaatt tatcacgagt attttgtatg agaataatgt tatcactatt 601 gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc tgatgtggct 661 tattattttg aaaaagatgt taaaggtgaa tccttgtttc attctaagaa aatggacctg 721 acagtaaatg gggaacaact ggatctggat cctggtcaaa ctttaattta ttatgttgat 781 gaaaaagcac ctgaattctc aatgcagggt ctaaaagctg gtgttattgc tgttattgtg 841 gttgtggtga tagcagttgt tgctggaatt gttgtgctgg ttatttccag aaagaagaga 901 atggcaaagt atgagaaggc tgagataaag gagatgggtg agatgcatag ggaactcaat 961 gcataactat ataatttgaa gattatagaa gaagggaaat agcaaatgga cacaaattac 1021 aaatgtgtgt gcgtgggacg aagacatctt tgaaggtcat gagtttgtta gtttaacatc 1081 atatatttgt aatagtaaag cctgtactca aaatataagc agcttgaaac tggctttacc 1141 aatcttgaaa tttgaccaca agtgtcttat atatgcagat ctaatgtaaa atccagaact 1201 tggactccat cgttaaaatt atttatgtgt aacattcaaa tgtgtgcatt aaatatgctt 1261 ccacagtaaa atctgaaaaa ctgatttgtg attgaaagct gcctttctat ttacttgagt 1321 cttgtacata catacttttt tatgagctat gaaataaaac cattttaaac tg // LOCUS HUMMHCAD 1093 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele B7, complete cds. ACCESSION M32317 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line JY cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1093) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1089 HLA protein allele B7 sigp 1 72 HLA protein signal peptide matp 73 1086 HLA protein mature peptide BASE COUNT 221 a 335 c 364 g 173 t ORIGIN 1 atgctggtca tggcgccccg aaccgtcctc ctgctgctct cggcggccct ggccctgacc 61 gagacctggg ccggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gagagaggag ccgcgggcgc cgtggataga gcaggagggg 241 ccggagtatt gggaccggaa cacacagatc tacaaggccc aggcacagac tgaccgagag 301 agcctgcgga acctgcgcgg ctactacaac cagagcgagg ccgggtctca caccctccag 361 agcatgtacg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtac 421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcctg gaccgccgcg 481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagcgg 541 agagcctacc tggagggcga gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gacaagctgg agcgcgctga ccccccaaag acacacgtga cccaccaccc catctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggtttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagccg 901 tcttcccagt ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagttc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttgaa aag // LOCUS HUMMHCAE 1089 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele B57, complete cds. ACCESSION M32318 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line WIN and cell line MOC cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1089) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1089 HLA protein allele B57 sigp 1 72 HLA protein signal peptide matp 73 1086 HLA protein mature peptide BASE COUNT 217 a 331 c 366 g 175 t ORIGIN 1 atgcgggtca cggcaccccg aaccgtcctc ctgctgctct ggggggcagt ggccctgacc 61 gagacctggg ccggctccca ctccatgagg tatttctaca ccgccatgtc ccggcccggc 121 cgcggggagc cccgcttcat cgcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gaggatggcg ccccgggcgc catggataga gcaggagggg 241 ccggagtatt gggacgggga gacacggaac atgaaggcct ccgcgcagac ttaccgagag 301 aacctgcgga tcgcgctccg ctactacaac cagagcgagg ccgggtctca catcatccag 361 gtgatgtatg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtcc 421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg 481 gacacggcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgtggc ggagcagctg 541 agagcctacc tggagggcct gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gagacgctgc agcgcgcgga ccccccaaag acacatgtga cccaccaccc catctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacaccgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccaaagcccc tcaccctgag atgggagcca 901 tcttcccaat ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagctc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttga // LOCUS HUMMHCAF 1093 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele B51, complete cds. ACCESSION M32319 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line BM92 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1093) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1089 HLA protein allele B51 sigp 1 72 HLA protein signal peptide matp 73 1086 HLA protein mature peptide BASE COUNT 228 a 334 c 358 g 173 t ORIGIN 1 atgcgggtca cggcgccccg aaccgtcctc ctgctgctct ggggggcagt ggccctgacc 61 gagacctggg ccggctccca ctccatgagg tatttctaca ccgccatgtc ccggcccggc 121 cgcggggagc cccgcttcat tgcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gaggacggag ccccgggcgc catggataga gcaggagggg 241 ccggagtatt gggaccggaa cacacagatc ttcaagacca acacacagac ttaccgagag 301 aacctgcgga tcgcgctccg ctactacaac cagagcgagg ccgggtctca cacttggcag 361 acgatgtatg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca taaccagtac 421 gcctacgacg gcaaagatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg 481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagctg 541 agagcctacc tggagggcct gtgcgtggag tggctccgca gacacctgga gaacgggaag 601 gagacgctgc agcgcgcgga ccccccaaag acacacgtga cccaccaccc cgtctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagcca 901 tcttcccagt ccaccatccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctactgtg atgtgtagga ggaagagctc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtccagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttgaa aag // LOCUS HUMMHCAG 1093 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele B37, complete cds. ACCESSION M32320 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line KASO and cell line MG cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1093) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1089 HLA protein allele B37 sigp 1 72 HLA protein signal peptide matp 73 1086 HLA protein mature peptide BASE COUNT 224 a 337 c 358 g 174 t ORIGIN 1 atgcgggtca cggcgccccg aaccctcctc ctgctgctct ggggggcagt ggccctgacc 61 gagacctggg ctggctccca ctccatgagg tatttccaca cctccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gaggacggag ccccgggcgc cgtggataga gcaggagggg 241 ccggagtatt gggaccggga gacacagatc tccaagacca acacacagac ttaccgagag 301 gacctgcgga ccctgctccg ctactacaac cagagcgagg ccgggtctca caccatccag 361 aggatgtctg gctgcgacgt ggggccggac gggcgcctcc tccgcgggta taaccagttc 421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg 481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgtggc ggagcaggac 541 agagcctacc tggagggcac gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gagacgctgc agcgcgcgga ccccccaaag acacatgtga cccaccaccc catctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacaccgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagcca 901 tcttcccagt ccaccatccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctactgtg atgtgtagga ggaagagctc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtccagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttgaa aag // LOCUS HUMMHCAH 1098 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele A25, complete cds. ACCESSION M32321 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line BM92 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1098) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1098 HLA protein allele A25 sigp 1 72 HLA protein signal peptide matp 73 1095 HLA protein mature peptide BASE COUNT 216 a 329 c 372 g 181 t ORIGIN 1 atggccgtca tggcgccccg aaccctcgtc ctgctactct cgggggccct ggccctgacc 61 cagacctggg cgggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat cgccgtgggc tacgtggacg acacgcagtt cgtgcggttc 181 gacagcgacg ccgcgagcca gaggatggag ccgcgggcgc cgtggataga gcaggagggg 241 ccggagtatt gggaccggaa cacacggaat gtgaaggccc actcacagac tgaccgagag 301 agcctgcgga tcgcgctccg ctactacaac cagagcgagg acggttctca caccatccag 361 aggatgtatg gctgcgacgt ggggccggac gggcgcttcc tccgcgggta ccagcaggac 421 gcttacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcttg gaccgcggcg 481 gacatggcgg ctcagatcac ccagcgcaag tgggagacgg cccatgaggc ggagcagtgg 541 agagcctacc tggagggccg gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gagacgctgc agcgcacgga cgcccccaag acgcatatga ctcaccacgc tgtctctgac 661 catgaggcca ccctgaggtg ctgggccctg agcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggggagga ccagacccag gacacggagc tcgtggagac caggcctgca 781 ggggatggga ccttccagaa gtgggcgtct gtggtggtgc cttctggaca ggagcagaga 841 tacacctgcc atgtgcagca tgagggtctg cccaagcccc tcaccctgag atgggagccg 901 tcttcccagc ccaccatccc catcgtgggc atcattgctg gcctggttct ctttggagct 961 gtgatcgctg gagctgtggt cgctgctgtg atgtggagga ggaagagctc agatagaaaa 1021 ggagggagct actctcaggc tgcaagcagt gacagtgccc agggctctga tatgtctctc 1081 acagcttgta aagtgtga // LOCUS HUMMHCAI 1098 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele A2, complete cds. ACCESSION M32322 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line JY cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1098) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1098 HLA protein allele A2 sigp 1 72 HLA protein signal peptide matp 73 1095 HLA protein mature peptide BASE COUNT 220 a 323 c 368 g 187 t ORIGIN 1 atggccgtca tggcgccccg aaccctcgtc ctgctactct cgggggctct ggccctgacc 61 cagacctggg cgggctctca ctccatgagg tatttcttca catccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat cgcagtgggc tacgtggacg acacgcagtt cgtgcggttc 181 gacagcgacg ccgcgagcca gaggatggag ccgcgggcgc cgtggataga gcaggagggt 241 ccggagtatt gggacgggga gacacggaaa gtgaaggccc actcacagac tcaccgagtg 301 gacctgggga ccctgcgcgg ctactacaac cagagcgagg ccggttctca caccgtccag 361 aggatgtatg gctgcgacgt ggggtcggac tggcgcttcc tccgcgggta ccaccagtac 421 gcctacgacg gcaaggatta catcgccctg aaagaggacc tgcgctcttg gaccgcggcg 481 gacatggcag ctcagaccac caagcacaag tgggaggcgg cccatgtggc ggagcagttg 541 agagcctacc tggagggcac gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gagacgctgc agcgcacgga cgcccccaaa acgcatatga ctcaccacgc tgtctctgac 661 catgaagcca ccctgaggtg ctgggccctg agcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggggagga ccagacccag gacacggagc tcgtggagac caggcctgca 781 ggggatggaa ccttccagaa gtgggcggct gtggtggtgc cttctggaca ggagcagaga 841 tacacctgcc atgtgcagca tgagggtttg cccaagcccc tcaccctgag atgggagccg 901 tcttcccagc ccaccatccc catcgtgggc atcattgctg gcctggttct ctttggagct 961 gtgatcactg gagctgtggt cgctgctgtg atgtggagga ggaagagctc agatagaaaa 1021 ggagggagct actctcaggc tgcaagcagt gacagtgccc agggctctga tgtgtctctc 1081 acagcttgta aagtgtga // LOCUS HUMNOXF 2206 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human neutrophil oxidase factor (p67-phox) mRNA, complete cds. ACCESSION M32011 KEYWORDS neutrophil oxidase factor. SOURCE Human promyelocytic leukemia myeloid cell line HL60, cDNA to mRNA, clone 10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2206) AUTHORS Leto,T.L., Lomax,K.J., Volpp,B.D., Nunoi,H., Sechler,J.M.G., Nauseef,W.M., Clark,R.A., Gallin,J.I. and Malech,H.L. TITLE Cloning of a 67kD neutrophil oxidase factor with similarity to a noncatalytic region of P60-c-src JOURNAL Science 248, 727-730 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.L.Malech, 08-FEB-1990. FEATURES from to/span description pept 68 1648 neutrophil oxidase factor (p67-phox) mRNA < 1 2206 p67-phox signal 2030 2035 poly-A signal signal 2181 2186 poly-A signal BASE COUNT 624 a 499 c 573 g 510 t ORIGIN 1 ctagtctttc agccttcagg ctgtttttgg cttgaagctc tcttggcctc ctagtttcta 61 cctaatcatg tccctggtgg aggccatcag cctctggaat gaaggggtgc tggcagcgga 121 caagaaggac tggaagggag ccctggatgc cttcagtgcc gtccaggacc cccactcccg 181 gatttgcttc aacattggct gcatgtacac tatcctgaag aacatgactg aagcagagaa 241 ggcctttacc agaagcatta accgagacaa gcacttggca gtggcttact tccaacgagg 301 gatgctctac taccagacag agaaatatga tttggctatc aaagacctta aagaagcctt 361 gattcagctt cgagggaacc agctgataga ctataagatc ctggggctcc agttcaagct 421 gtttgcctgt gaggtgttat ataacattgc tttcatgtat gccaagaagg aggaatggaa 481 aaaagctgaa gaacagttag cattggccac gagcatgaag tctgagccca gacattccaa 541 aatcgacaag gcgatggagt gtgtctggaa gcagaagcta tatgagccag tggtgatccc 601 tgtgggcaag ctgtttcgac caaatgagag acaagtggct cagctggcca agaaggatta 661 cctaggcaag gcgacggtcg tggcatctgt ggtggatcaa gacagtttct ctgggtttgc 721 ccctctgcaa ccacaggcag ctgagcctcc acccagaccg aaaaccccag agatcttcag 781 ggctctggaa ggggaggctc accgtgtgct atttgggttt gtgcctgaga caaaagaaga 841 gctccaggtc atgccaggga acattgtctt tgtcttgaag aagggcaatg ataactgggc 901 cacggtcatg ttcaacgggc agaaggggct tgttccctgc aactaccttg aaccagttga 961 gttgcggatc caccctcagc agcagcccca ggaggaaagc tctccgcagt ccgacatccc 1021 agctcctcct agttccaaag cccctggaaa accccagctg tcaccaggcc agaaacaaaa 1081 agaagagcct aaggaagtga agctcagtgt tcccatgccc tacacactca aggtgcacta 1141 caagtacacg gtagtcatga agactcagcc cgggctcccc tacagccagg tccgggacat 1201 ggtgtctaag aaactggagc tccggctgga acacactaag ctgagctatc ggcctcggga 1261 cagcaatgag ctggtgcccc tttcagaaga cagcatgaag gatgcctggg gccaggtgaa 1321 aaactactgc ctgactctgt ggtgtgagaa cacagtgggt gaccaaggct ttccagatga 1381 acccaaggaa agtgaaaaag ctgatgctaa taaccagaca acagaacctc agcttaagaa 1441 aggcagccaa gtggaggcac tcttcagtta tgaggctacc caaccagagg acctggagtt 1501 tcaggaaggg gatataatcc tggtgttatc aaaggtgaat gaagaatggc tggaagggga 1561 gtgcaaaggg aaggtgggca ttttccccaa agtttttgtt gaagactgcg caactacaga 1621 tttggaaagc actcggagag aagtctagga tgtttcacaa actacaaagc tgaagaaaat 1681 gaagccctat tacttgtttg taagatttag cacccttctg ctgtatactg tactgagaca 1741 ttacagtttg gaagtgttaa ctatttattc cctgttaaaa tttaacctac tagacaatga 1801 tgtgagtacc caggatgatt tcctggggca cagtgggtga ggagatgggg acaggtgaat 1861 ggaggagtta ggggagagga aaagtggatg gaagtgtctg gaaagggcac gagagagtct 1921 tccaggtact gatcctgttt cttgctctga gtgctagcta gccagctgtg ttcacactgt 1981 aaacattcat caagctgtac atttggtgca cttttctgtg tcataccaca ataaaaaaaa 2041 acctatcatc atcttacaaa aacaagacac ccaagtccag gcccaaggag taagtacaaa 2101 tattcctgtt tctgaaccat tactgtaatt ggctcttaag gcttgaagta accttatagg 2161 ttactcataa ggcatataca aataaacttg tttgttttct tttttc // LOCUS HUMPTPBX 3247 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human phosphotyrosyl-protein phosphatase (PTP-1B) mRNA, complete cds. ACCESSION M31724 KEYWORDS phosphotyrosyl-protein phosphatase. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3247) AUTHORS Chernoff,J., Schievella,A.R., Jost,C.A., Erikson,R.L. and Neel,B.G. TITLE Cloning of a cDNA for a major human protein-tyrosine-phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2735-2739 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Chernoff,J., 13-MAR-1990. FEATURES from to/span description pept 91 1398 phosphotyrosyl-protein phosphatase (EC 3.1.3.48) mRNA < 1 3247 phosphotyrosyl-protein phosphatase mRNA BASE COUNT 836 a 834 c 806 g 771 t ORIGIN 1 gggcgggcct cggggctaag agcgcgacgc ctagagcggc agacggcgca gtgggccgag 61 aaggaggcgc agcagccgcc ctggcccgtc atggagatgg aaaaggagtt cgagcagatc 121 gacaagtccg ggagctgggc ggccatttac caggatatcc gacatgaagc cagtgacttc 181 ccatgtagag tggccaagct tcctaagaac aaaaaccgaa ataggtacag agacgtcagt 241 ccctttgacc atagtcggat taaactacat caagaagata atgactatat caacgctagt 301 ttgataaaaa tggaagaagc ccaaaggagt tacattctta cccagggccc tttgcctaac 361 acatgcggtc acttttggga gatggtgtgg gagcagaaaa gcaggggtgt cgtcatgctc 421 aacagagtga tggagaaagg ttcgttaaaa tgcgcacaat actggccaca aaaagaagaa 481 aaagagatga tctttgaaga cacaaatttg aaattaacat tgatctctga agatatcaag 541 tcatattata cagtgcgaca gctagaattg gaaaacctta caacccaaga aactcgagag 601 atcttacatt tccactatac cacatggcct gactttggag tccctgaatc accagcctca 661 ttcttgaact ttcttttcaa agtccgagag tcagggtcac tcagcccgga gcacgggccc 721 gttgtggtgc actgcagtgc aggcatcggc aggtctggaa ccttctgtct ggctgatacc 781 tgcctcctgc tgatggacaa gaggaaagac ccttcttccg ttgatatcaa gaaagtgctg 841 ttagaaatga ggaagtttcg gatggggttg atccagacag ccgaccagct gcgcttctcc 901 tacctggctg tgatcgaagg tgccaaattc atcatggggg actcttccgt gcaggatcag 961 tggaaggagc tttcccacga ggacctggag cccccacccg agcatatccc cccacctccc 1021 cggccaccca aacgaatcct ggagccacac aatgggaaat gcagggagtt cttcccaaat 1081 caccagtggg tgaaggaaga gacccaggag gataaagact gccccatcaa ggaagaaaaa 1141 ggaagcccct taaatgccgc accctacggc atcgaaagca tgagtcaaga cactgaagtt 1201 agaagtcggg tcgtgggggg aagtcttcga ggtgcccagg ctgcctcccc agccaaaggg 1261 gagccgtcac tgcccgagaa ggacgaggac catgcactga gttactggaa gcccttcctg 1321 gtcaacatgt gcgtggctac ggtcctcacg gccggcgctt acctctgcta caggttcctg 1381 ttcaacagca acacatagcc tgaccctcct ccactccacc tccacccact gtccgcctct 1441 gcccgcagag cccacgcccg actagcaggc atgccgcggt aggtaagggc cgccggaccg 1501 cgtagagagc cgggccccgg acggacgttg gttctgcact aaaacccatc ttccccggat 1561 gtgtgtctca cccctcatcc ttttactttt tgccccttcc actttgagta ccaaatccac 1621 aagccatttt ttgaggagag tgaaagagag taccatgctg gcggcgcaga gggaaggggc 1681 ctacacccgt cttggggctc gccccaccca gggctccctc ctggagcatc ccaggcggcg 1741 cacgccaaca gcccccccct tgaatctgca gggagcaact ctccactcca tatttattta 1801 aacaattttt tccccaaagg catccatagt gcactagcat tttcttgaac caataatgta 1861 ttaaaatttt ttgatgtcag ccttgcatca agggctttat caaaaagtac aataataaat 1921 cctcaggtag tactgggaat ggaaggcttt gccatgggcc tgctgcgtca gaccagtact 1981 gggaaggagg acggttgtaa gcagttgtta tttagtgata ttgtgggtaa cgtgagaaga 2041 tagaacaatg ctataatata taatgaacac gtgggtattt aataagaaac atgatgtgag 2101 attactttgt cccgcttatt ctcctccctg ttatctgcta gatctagttc tcaatcactg 2161 ctcccccgtg tgtattagaa tgcatgtaag gtcttcttgt gtcctgatga aaaatatgtg 2221 cttgaaatga gaaactttga tctctgctta ctaatgtgcc ccatgtccaa gtccaacctg 2281 cctgtgcatg acctgatcat tacatggctg tggttcctaa gcctgttgct gaagtcattg 2341 tcgctcagca atagggtgca gttttccagg aataggcatt tgctaattcc tggcatgaca 2401 ctctagtgac ttcctggtga ggcccagcct gtcctggtac agcagggtct tgctgtaact 2461 cagacattcc aagggtatgg gaagccatat tcacacctca cgctctggac atgatttagg 2521 gaagcaggga caccccccgc cccccacctt tgggatcagc ctccgccatt ccaagtcaac 2581 actcttcttg agcagaccgt gatttggaag agaggcacct gctggaaacc acacttcttg 2641 aaacagcctg ggtgacggtc ctttaggcag cctgccgccg tctctgtccc ggttcacctt 2701 gccgagagag gcgcgtctgc cccaccctca aaccctgtgg ggcctgatgg tgctcacgac 2761 tcttcctgca aagggaactg aagacctcca cattaagtgg ctttttaaca tgaaaaacac 2821 ggcagctgta gctcccgagc tactctcttg ccagcatttt cacattttgc ctttctcgtg 2881 gtagaagcca gtacagagaa attctgtggt gggaacattc gaggtgtcac cctgcagagc 2941 tatggtgagg tgtggataag gcttaggtgc caggctgtaa gcattctgag ctggcttgtt 3001 gtttttaagt cctgtatatg tatgtagtag tttgggtgtg tatatatagt agcatttcaa 3061 aatggacgta ctggtttaac ctcctatcct tggagagcag ctggctctcc accttgttac 3121 acattatgtt agagaggtag cgagctgctc tgctatatgc cttaagccaa tatttactca 3181 tcaggtcatt attttttaca atggccatgg aataaaccat ttttacaaaa ataaaaacaa 3241 aaaaagc // LOCUS HUMUDPGTA 1855 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human 3,4-catechol estrogen UDP-glucuronosyltransferase mRNA, complete cds. ACCESSION J05428 KEYWORDS 3,4-catechol estrogen UDP-glucuronosyltransferase. SOURCE Human liver, cDNA to mRNA, clone 63-11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1855) AUTHORS Ritter,J.K., Sheen,Y.Y. and Owens,I.S. TITLE Cloning and expression of human liver UDP-glucuronosyltransferase in COS-1 cells: 3,4-catechol estrogens and estriol as primary substrates JOURNAL J. Biol. Chem. 265, 7900-7906 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.S.Owens, 22-FEB-1990. FEATURES from to/span description pept 15 1604 UDP-glucuronosyltransferase (EC 2.4.1.17) BASE COUNT 571 a 337 c 394 g 553 t ORIGIN 1 tgcattgcac caggatgtct gtgaaatgga cttcagtaat tttgctaata caactgagct 61 tttgctttag ctctgggaat tgtggaaagg tgctggtgtg ggcagcagaa tacagccatt 121 ggatgaatat aaagacaatc ctggatgagc ttattcagag aggtcatgag gtgactgtac 181 tggcatcttc agcttccatt ctttttgatc ccaacaactc atccgctctt aaaattgaaa 241 tttatcccac atctttaact aaaactgagt tggagaattt catcatgcaa cagattaaga 301 gatggtcaga ccttccaaaa gatacatttt ggttatattt ttcacaagta caggaaatca 361 tgtcaatatt tggtgacata actagaaagt tctgtaaaga tgtagtttca aataagaaat 421 ttatgaaaaa agtacaagag tcaagatttg acgtcatttt tgcagatgct atttttccct 481 gtagtgagct gctggctgag ctatttaaca taccctttgt gtacagtctc agcttctctc 541 ctggctacac ttttgaaaag catagtggag gatttatttt ccctccttcc tacgtacctg 601 ttgttatgtc agaattaact gatcaaatga ctttcatgga gagggtaaaa aatatgatct 661 atgtgcttta ctttgacttt tggttcgaaa tatttgacat gaagaagtgg gatcagtttt 721 atagtgaagt tctaggaaga cccactacgt tatctgagac aatggggaaa gctgacgtat 781 ggcttattcg aaactcctgg aattttcagt ttcctcatcc actcttacca aatgttgatt 841 ttgttggagg actccactgc aaacctgcca aacccctgcc taaggaaatg gaagactttg 901 tacagagctc tggagaaaat ggtgttgtgg tgttttctct ggggtcaatg gtcagtaaca 961 tgacagaaga aagggccaac gtaattgcat cagccctggc ccagatccca caaaaggttc 1021 tgtggagatt tgatgggaat aaaccagata ccttaggtct caatactcgg ctgtataagt 1081 ggatacccca gaatgacctt ctaggtcatc caaagaccag agcttttata actcatggtg 1141 gagccaatgg catctacgag gcaatctacc atgggatccc tatggtgggg attccattgt 1201 ttgccgatca acctgataac attgctcaca tgaaggccag gggagcagct gttagagtgg 1261 acttcaacac aatgtcgagt acagacttgc tgaatgcatt gaagagagta attaatgatc 1321 cttcatataa agagaatgtt atgaaattat caagaattca acatgatcaa ccagtgaagc 1381 ccctggatcg agcagtcttc tggattgaat ttgtcatgcg ccacaaagga gctaaacacc 1441 ttcgggttgc agcccacgac ctcacctggt tccagtacca ctctttggat gtgattgggt 1501 tcctgctggt ctgtgtggca actgtgatat ttatcgtcac aaaatgttgt ctgttttgtt 1561 tctggaagtt tgctagaaaa gcaaagaagg gaaaaaatga ttagttatat ctgagatttg 1621 aagctggaaa acctgatagg tgagactact tcagtttatt ccagcaagaa agattgtgat 1681 gcaagatttc tttcttcctg agacaaaaaa aaaaaaaaga aaaaaaaatc ttttcaaaat 1741 ttactttgtc aaataaaaat ttgtttttca gagatttacc acccagttca tggttagaaa 1801 tattttgtgg caatgaagaa aacactacgg aaaataaaaa ataagataaa gcctt // LOCUS RATGFGDVA 630 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Rat glioma-derived vascular endothelial cell growth factor mRNA, complete cds. ACCESSION M32167 KEYWORDS endothelial cell growth factor protein. SOURCE Rat cell line GS-9L, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 630) AUTHORS Conn,G., Bayne,M.L., Soderman,D.D., Kwok,P.W., Sullivan,K.A., Palisi,T.M., Hope,D.A. and Thomas,K.A. TITLE Amino acid and cDNA sequences of a vascular endothelial cell mitogen that is homologous to platelet-derived growth factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2628-2632 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.L.Bayne, 16-FEB-1900. FEATURES from to/span description pept 5 577 endothelial cell growth factor protein precursor sigp 5 82 endothelial cell growth factor protein signal peptide matp 83 574 endothelial cell growth factor protein BASE COUNT 169 a 163 c 172 g 126 t ORIGIN 1 aaccatgaac tttctgctct cttgggtgca ctggaccctg gctttactgc tgtacctcca 61 ccatgccaag tggtcccagg ctgcacccac gacagaaggg gagcagaaag cccatgaagt 121 ggtgaagttc atggacgtct accagcgcag ctattgccgt ccgattgaga ccctggtgga 181 catcttccag gagtaccccg atgagataga gtatatcttc aagccgtcct gtgtgcccct 241 aatgcggtgt gcgggctgct gcaatgatga agccctggag tgcgtgccca cgtcggagag 301 caacgtcact atgcagatca tgcggatcaa acctcaccaa agccagcaca taggagagat 361 gagcttcctg cagcatagca gatgtgaatg cagaccaaag aaagatagaa caaagccaga 421 aaatcactgt gagccttgtt cagagcggag aaagcatttg tttgtccaag atccgcagac 481 gtgtaaatgt tcctgcaaaa acacagactc gcgttgcaag gcgaggcagc ttgagttaaa 541 cgaacgtact tgcagatgtg acaagccaag gcggtgagcc aggctgcagg aaggagcctc 601 ctcagggttt cgggaactag acctctcacc // LOCUS RATYWKII 1821 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Rat sperm membrane protein (YWK-II) mRNA, 3' end. ACCESSION M31322 KEYWORDS sperm membrane protein. SOURCE Rat testis cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1821) AUTHORS Yan,Y.C., Bai,Y., Wang,L.F. and Miao,S.Koide.S.S. TITLE Characterization of cDNA encoding a rat sperm membrane protein related to A4 amyloid protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2405-2408 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.S.Koide 17-JAN-1990. FEATURES from to/span description pept < 1 576 sperm membrane protein (AA at 1) mRNA < 1 1821 YWK-II mRNA BASE COUNT 447 a 427 c 416 g 531 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgtta cctcctccat ctcagagaac cctgtggatg tccgggtgag ctctgaggaa 61 agtgaggaga tcccgccgtt ccaccctttc catcccttcc catccttgtc tgagaacgaa 121 gacactcagc cggagttgta ccacccaatg aaaaaaggat ctggaatggc agagcaagat 181 gggggcctga ttggtgcaga agaaaaggtg atcaacagta agaataaaat ggatgaaaat 241 atggtcattg acgagactct ggatgttaag gaaatgattt tcaatgctga gagagttggg 301 ggtctggagg aagagccgga ctctgtgggg cctctacggg aggacttcag tttgagcagc 361 agtgccctta ttggcttgct ggtcatcgcg gtggccattg ctacagtcat cgtcatcagc 421 ctggtgatgc tgaggaagag gcaatacggc accatcagcc atggcattgt ggaggttcat 481 ccaatgctca ccccagaaga gcgtcacttg aacaagatgc agaaccacgg ctatgaaaac 541 ccaacctaca aatacctgga gcagatgcag atttaaggac agcagcgtgt gcgacaccct 601 ggctgaggct gctgcaggtg ggctggaaga gcctcagcgt ttgtgcttga ctgctgacca 661 ccagcggtgc cagaggcctc atcctacatc ctgctctcct ggattgttaa gactataaag 721 tactactgta ggattgcaat ttccattctt ttaaatgggt ttaaaagatg ttaatataac 781 aatatatgat atataaacct taagtgaaaa aaagatctat tgcagatatc tgatggatgt 841 agttttcttt ttttaaatta gaaatgccac ttctattgta ttgtctcaca catgctctat 901 ataaatggaa aatgttgatt tttcaatgat agactatata cacaggctgt tcccgttatg 961 taagtctgtt ctttaggctc gtttgctggg ctggttttgt cgtcatttgt tttaatgtat 1021 aaaggcagta ttcccctttt caggttgctg agaaatgtaa gtggaactga agtacattgt 1081 atgcagttac tgactgtttt aggcatagtc tccttggaag cctagagctt ccagtgccgg 1141 gtgtccagtg cctgtcacca aagcaagggc taagtcacct tgagctagct ggatgcaaac 1201 tagatccact gtgctttcct tcaaatccag ttcttccaca gcaaccagcc catagttgtt 1261 ctgtgttctt ccacagctgt ttacggtagc ctcctagcca ctctcctcag caagtgcatc 1321 caagagtgca ccaccccctt ctttggacgt ctccgtccca tgcactgacc ctctgcttgc 1381 cttcgtacct cacttcctcc accgctcttc agcccctttg atgtcccctc agagaatacc 1441 gatatacaca tggctaagga cccaggagac ttcacgggag gcctcattag gtgaaaggac 1501 gatgttctgg gctgtacatg aaattggatc tgtagacact gtgtttcctt cactgacttg 1561 taatgtcacg cagctggagt tgatgccaca acccttagtg ctttgttgct gttttgtttt 1621 tcagggttct ggtaacctgc tactgttttt gttttggttt tggtttggtt ttttttgtat 1681 ttttctgtga tttccctccc cttccccccc atgcctcttc ccactatgca cagatggaaa 1741 ctttacctac aaactccttc gtatgatctg tggagaatgt acagaactta ttacatcaat 1801 aaaacacttt aacttccccc g // LOCUS RHBNODVW 4227 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.japonicum nodulation protein (nodV, nodW) genes, complete cds. ACCESSION M31765 KEYWORDS nodulation protein; regulatory protein. SOURCE B.japonicum (strain USDA3I1b110spc4) DNA, clone pRJ141. ORGANISM Bradyrhizobium japonicum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 4227) AUTHORS Goettfert,M., Grob,P. and Hennecke,H. TITLE Proposed regulatory pathway encoded by the nodV and nodW genes, determinants of host specificity in Bradyrhizobium japonicum JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2680-2684 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Goettfert, 01-FEB-1990. FEATURES from to/span description pept 648 3317 nodulation protein (nodV) pept 3314 3997 nodulation protein (nodW; gtg start) BASE COUNT 919 a 1147 c 1228 g 933 t ORIGIN 1 ggatcctgcg gacgcagctt gtcgttctcg ttacgatgcg cgctgcgtta gcagagcatg 61 gtgggtgtga tcagcgtgaa tggcaccgac aaagccgtgg cgagtcatac gaggctgccc 121 tgcgttctgc gacgccgctc gcttccagcg gaaccagttt ttcgcagcta aagcgcgata 181 agattggaac gaatcatcat cgcgcgtagg actcgcgcga ctccggcttc tgtgtcggca 241 ggcgaacttg cgctttcttc gctttagcgg ctgagtcggg gagacgagtc attgggcagc 301 ttttttctga tccttgccgg tccggaggcg cgccttcgaa cctatcggct gaactgaaac 361 cactgtcctc attccgcggc ctaatggggc gcattggccg taggctcgag cgatgcgcaa 421 aagcggacgt ttcagcttcg actaaagtcg cgtcagcgac acctactgaa ggtttaggac 481 aaatatacgt atgtttataa tgcgtttttt ccccgcgcgt gctgagttcg gcagccgatg 541 atcgatcttg cacacgctat catgcggacc caagtgttca agattgcctc atttttgcct 601 aaaatgctat agctctcccg ccggataaca ggccggacgg tctcaaaatg tataggatca 661 gcagtttcaa acgttcagcc cctcagttga ccttgggcag catagcgcta gccgcggtta 721 cactgacttg cgtgtacttt caggcgcatt tcgccgccgc ggcgttcgcc tatttgttag 781 tagtcttact attttcgttg atgggcagct tcattgcttc atcagcgctt tgcatcgtcg 841 caatcgctgc tctcgcatac tactttgcgc cgccggcgtt tagtttacga atcgatgatc 901 cccgcgatgt tcctgtggtt gttgcatttc ttattgtctc tgttgtcgga acgtacctga 961 ttggaaaact ccgccaggaa agagaggctg cacgtgtggc tgcggccaag cttcagcgga 1021 gtgcctcgga tttggaggat cgtgaaaaac ggtggcgcgc aattttcgag cacaatccag 1081 ccatgtactt catggtcgat gaggccggca ttgtcctcaa cgtcaatacg ttgggcgcga 1141 cacaactggg atttgcttgt gcagaactat tgggccaatc cgtgctcgac gtatttctgg 1201 aggaggatcg cgcattcgtc cgcaaatgca ttcagacgtg tcttgaggat gttggacaat 1261 cgcgcacttg ggacgtccgg aaagtcagga aggacggttc tgtattgtgg gtgcgtgaaa 1321 acgccaaggc catgctttgg gccggcgacc gccccgtcat cctcatggcg tgcgaagata 1381 ttacggagcg caagcagacc gagcttgcgc tgcagcggag cgaagcgcat ttggctcacg 1441 cgcaggagtt gagtcataca ggcagcttca gctggaacgc ctctaccggc gaggccttct 1501 ggtctaagga gacatttcgg attttccaaa tcgatcttca gacgacaccg gcgccacaac 1561 tcgtcattga gcgcacgcac ccagatgata gggcttctgt caaagagatt atcgatgaag 1621 cgatgcgaga cctgagggat ttcgagcacg agtaccggct gctgctacct gacggctccg 1681 tgaagcacat ccatgcgcag gcacgagtca cgcgaaccgc ctctggtgaa attgagtttg 1741 ttggggcagc caccgatatt acggcagcta ggcgagcaga acagcagttg cgccgaagcg 1801 aggcctatct ggccgaggct cagcatctca ctcacacagg cagctggtcc tgggacgtcc 1861 acacacgaga tttcgtttat cgctccgctg aggtcgaccg cctgtttggc tttaacccac 1921 aagagccggt ttcgctagag actattcgat cgcgcatcca tccggaagac ttgccagggt 1981 tgcaggaggt gcagcgtcag gcgattgacc aggagcacga acggttcgaa tatgatttcc 2041 gtgttattct gccagatggc gggataaggc gcatacactc cgttgcacac gttgtcgtcg 2101 gcagcgatgg taatgtcagc gagctgatcg gaacacatat ggatgttacc gagcaacacg 2161 cagctaggga acgcttggaa aacacgcttg tagcgctgcg cgagagcgaa cagcgctttc 2221 gcgactatgc cgagactgct tccgactggc tctgggaaac cgggccagat catcgggtca 2281 ctcacttgtc cgagcacacc agcgctgcgg gaattttggc gacagggtta acgggcctgc 2341 ttcgctggga catcgcgtgc gacatggaag aagaaccgga gaagtggcgt cagcatcggg 2401 cgacgttgca ggcacacctg ccgttccggg atctcatcta ccgtaccgta aataggatgg 2461 gatctccgat ctacgtccgc actagtggca agcccttctt cgacggaaac ggaaattttt 2521 tgggctatcg cggcgtcagc actgacatca ccgctaccat tcgcgctgat caggccgaac 2581 aagagctgcg aaaggcacag gcggagcttg cacatgtgac gcgtgtaacg accttgggag 2641 aaatgacaac ttctatcgcc cacgagataa cccaaccact cgccgctatc ctcagcaacg 2701 ccgatgcgtg cctcggctgg atggctcgcg atgttcccaa tcttgcagcc gcgcgctctt 2761 cagtcgaatg gatcatagaa gatgcaatcc gggcaagcga ggtgatccgt agtattcgcg 2821 cactcgcgaa aaagggcgag atcgagatgg tgccgctcga cattaatcag gtggttaggg 2881 acgtcagcgc gctggtaaca cgagagctgg tgagccacca agtgacgttg cgaagcgagt 2941 tggcgtctgc gctgcctagg gtcctcggtg atcgaattca gctacaacaa gtgatcatca 3001 atctggtgat gaacggaatc gaggccatgg acgcagttac agaccggccg cgtgaacttc 3061 tgattcaatc atctacggac gatctggggt acgtgcagct ttccgtgacc gattgcggcg 3121 tcgggatcgc cgagaatgac gcggaccgcg tcttggaccc cttcttcacc accaaatcga 3181 gcggcctagg aatgggcctt tcgatctgcc ggtcgatcgt ggaagttcac ggaggacgaa 3241 tttcagtggt tcagaaaaat ggaccgggcg cgacgttcca gtttgccctt ccgctgcata 3301 aggaggccat ctcgtgacag gacgatttga ctggagaggc caaggcggac ataccgaggc 3361 ttcgacgaag gcaatcgtct ttgtcgtcga ggatgacatc tctatgcgtc gctcgcttac 3421 gaaccttttt cgatcggtag gcttggaggt cgtggcgttc ggatcggccc gtgaaatgct 3481 gcagagcaca atgccggacg tcacaagctg tctagttctt gatgtccggc tgccgggcct 3541 gagcggcctt gactaccaga ccgagctcgc caggttgaac atacacatcc caatcatctt 3601 cattaccggc catggcgaca ttcccatgac cgtcagggcc atgaagggag gcgcggtcga 3661 tttcctcagc aagcccttcc gcgatcagga actgcttgat gccgtcgttg cggcgaccga 3721 acgcgatcgc aaaagacgag aagctcagcg aaccgtggcg aacctgaaat ctctatttga 3781 gaccctaagc ccgcgagaac aggcagtgat gaaactggtc gcgacggggc tgatgaacaa 3841 gcaggtagcc gccgaacttg ggctcgccga gatcaccgtc aagatctacc ggggacacgt 3901 aatgaaaaag atgcgtgccc gctcgctggc tgacttgatc agaatgagcg agacgctcgg 3961 aattagcgcc aatcacactg aacaaaccca agtatgattt tacaattcca tcacttaagc 4021 ccactttcgc gaaagtggct gacggtttgg cagccgctgt actgcgtcgg gagggctcat 4081 cttgtccacg cctttgattt ccgtcgttga cgacgacccc tcggtccgtg cggcgacaga 4141 aaaccttttg aaatcgcgtg gctacgtcgt gcagatattt gcctcggccg aggcgctcct 4201 gcggtcgccg cggttgaacg agatatc // LOCUS ANAPETF1A 1682 bp ds-DNA BCT 27-JUL-1990 DEFINITION A.variabilis ferredoxin (petF1) gene, complete cds. ACCESSION M35126 KEYWORDS ferredoxin. SOURCE A.variabilis (ATCC 29413; PCC 7937) DNA, clone pVA1. ORGANISM Anabaena variabilis Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Nostocaceae. REFERENCE 1 (bases 1 to 1682) AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M., Van Arkel,G. and Weisbeek,P. TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and Synechococcus sp. PCC 7942: Structure and regulation JOURNAL Photosyn. Res. 18, 179-204 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 794 1093 ferredoxin (petF1) binding 782 785 ribosomal binding site (put.) BASE COUNT 480 a 353 c 322 g 527 t ORIGIN 1 aagcttgatg taagccacct aaaagttgaa tgtgttcgac gtggagtaat ttttgctgtg 61 ctttggctaa ggcaatcagt tcttctgctt cagttaaatc tacagataaa ggatattcta 121 caattacgtg tttgccggca gttagagcag cacggcgatc gcaccatgat cacgattgat 181 ggtacaaata accactaaat ctatatcttc tcgttctact aactgttgcc agccagttat 241 cgcttcagcc tggtaatctt gagccaaagc ctgggttcgc tccagtgtac tacccgctat 301 ggcgactagg tgcgatcgct tatcctccaa aaaggcttct gcccgcagtt ttgctgcata 361 ccctgtccaa ctatgccaat acgtattgtt gcttgtgcca aagctacctc cgactcctat 421 ataatcctca ccagtcctgc taccaagcac agtaataaat ttttattttt caaaaaaagc 481 taggttaacg cagtaccagc ctaactgaag gttatccctg ttccagaact tttttatcat 541 attcttgatt caagccataa atttttctaa taactgtggg tttaacgtct ttaaagtata 601 aaaaaaattt attgttatct gctaactaaa tttcattact aatcgtggtg gatttcaagt 661 tatatacttt gatttttccc gtagtatcag aattgaacta aatttaattc tgtggctaat 721 cccctgagaa tagccgctaa gttctgcttt agcataactt atactgccga ttacaaaaga 781 gaggattacg gaaatggcaa cttttaaagt tacattgatc aacgaagcag aaggaaccag 841 caacacaatt gacgttcctg atgatgagta tattttagac gctgccgaag aacagggtta 901 tgacctaccc ttttcctgtc gtgcaggtgc ttgctccacc tgcgccggta aactagtatc 961 cggtactgtt gaccagtctg accaatcatt cttggatgac gatcaaatcg aagctggata 1021 tgtattgacc tgtgttgctt atccaacctc tgatgtaacc atccaaaccc acaaagaaga 1081 agacctctac taagagtcaa gttagcttta aaaatagagg ctagaggcta gcagctacag 1141 actagtccct agccttttta ttttttttca ttagggaaca gtctattggg actgggtgta 1201 agggtgtaag ggtatagggg tgtaatagtt tcaaacattt atactctttt caacccttga 1261 tatttcgttt tcatccgtaa gtcccacaaa aaataaatta tccaaaattg atggtttggt 1321 agggtgcgtc aatagaaatc atttctgagt gtatttaggc tctatcgcac tgacggacac 1381 tacattttgg atattttttg atctgaaagt cccttaaaat ttgatttctt cattcaggct 1441 gtgaagttgt gtaccaggat aataaaattg gagaattttg ctgttagacc aacctagttt 1501 agctaaagtt tgagcgccag tttgacttaa gcccactcca tgtcctaaac caccaccaat 1561 aaaagcgtat cccacaactc ttctttgcct ttgttgagag gtagtatgta aaaaagcgtg 1621 cttcttggcg cagcaaaggc actacgcact tcatctttgt gtagggtaaa aatgcccata 1681 tc // LOCUS ANIPETF1A 1074 bp ds-DNA BCT 27-JUL-1990 DEFINITION A.nidulans ferredoxin (petF1) gene, complete cds. ACCESSION M35127 KEYWORDS ferredoxin. SOURCE A.nidulans (strain R2, PCC7942) DNA, clone pA612. ORGANISM Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 1074) AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M., Van Arkel,G. and Weisbeek,P. TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and Synechococcus sp. PCC 7942: Structure and regulation JOURNAL Photosyn. Res. 18, 179-204 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 504 803 ferredoxin (petF1) binding 493 496 ribosomal binding site (put.) BASE COUNT 234 a 298 c 281 g 261 t ORIGIN 1 ctgcagccgc tgcatgcgca gggcgttcga tttgagggct ttttgtggaa catcttggcg 61 cacatcatcg aagctctaac accgaggctg tctgagttgg ctagtcctgg cagctgggca 121 atctttagcg gcttgctaac cagtcaagcc gacactgtca gcgtcacttt ggaagagtac 181 ttttgggtga tccgcgatcg cgccagtcag ggagattggt gtcgtttggt cgcggatttt 241 cggccggaac gataaatctc actaatgctt agcttagagg gcttactggg agcgggccga 301 gtttgagccg tgattacccc tacgaacttt ccggccacgc tccattgctt agacataaaa 361 ttcccttatg tctagactgg cgattgatag catttctcgc ggcgcagttc gccctttggc 421 aacccatagt atcaatggga aaggtacggg caggctgtca atcgatgagc tctgccaccc 481 caaaagcgat agaggacacg ctcatggcaa cctacaaggt tacgctcgtc aatgctgccg 541 aaggcttgaa caccacgatc gacgtggctg acgataccta catcttggac gccgctgaag 601 agcaaggcat tgacctgcct tactcctgcc gtgctggtgc ttgctcgacc tgtgctggca 661 aagtcgtctc tggtaccgtc gaccaatcgg atcaatcctt cttggatgac gaccaaattg 721 cagcaggctt tgtcctgacc tgcgtcgcct atccgacctc cgatgtgacg atcgaaaccc 781 acaaagaaga agacctctac taagtcttgc ttcgactgct tcaatcctta gaattcaaat 841 caaattgcgg cttccaaatt gggagtcgct tttttatcgc cgtagtcagc agcgatcgcc 901 tcactaccgt ccagaaacag cctgcgatcg ccctgactta acgcccctcc cctagcaatc 961 ctgactcggt agagttcagg ccaaggactc aaggttgagg cgatcgttca gcaacgtctt 1021 gctctaggcg tccggcgtga ctgttgtggg gaagaggcga tcgccttcct gcag // LOCUS ASAASPPA 1764 bp ds-DNA PLN 27-JUL-1990 DEFINITION A.awamori aspergillopepsin A (pepA) gene, complete cds. ACCESSION M34454 KEYWORDS aspartic proteinase; aspergillopepsin. SOURCE A.awamori (NRRL3112 mutant strain UVK143f) DNA. ORGANISM Aspergillus awamori Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 1764) AUTHORS Berka,R.M., Ward,M., Wilson,L.J., Hayenga,K.J., Kodama,K.H., Carlomagno,L.P. and Thompson,S.A. TITLE Molecular cloning and deletion of the gene encoding aspergillopepsin A from Aspergillus awamori JOURNAL Gene 86, 153-162 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 177 496 aspergillopepsin (pepA), exon 1 548 825 aspergillopepsin, exon 2 878 1126 aspergillopepsin, exon 3 1186 1523 aspergillopepsin, exon 4 sigp 177 308 aspergillopepsin signal peptide matp 455 496 aspergillopepsin 548 825 aspergillopepsin 878 1126 aspergillopepsin 1186 1523 aspergillopepsin pre-msg 124 1764 pepA mRNA and introns (put.) pre-msg 125 1764 pepA mRNA and introns (put.) pre-msg 127 1764 pepA mRNA and introns (put.) IVS 497 547 pepA intron A IVS 826 877 pepA intron B IVS 1127 1185 pepA intron C signal 54 58 TATA box BASE COUNT 387 a 515 c 445 g 417 t ORIGIN 1 gtcgacttgg atgatggaga gttgatcgag gtcaatgagg agaggcttgc aagtataaga 61 agagactgct cgaccagcag aatggatctt cttgttcatc aaccaagagt ccaaggcttc 121 tttgtctggt tctatctctt ctccgaactc tcttgcttga cattctcgtg gtcaaaatgg 181 tcgtcttcag caaaaccgct gccctcgttc tgggtctgtc ctccgccgtc tctgcggcgc 241 cggctcctac tcgcaagggc ttcaccatca accagattgc ccggcctgcc aacaagaccc 301 gcaccatcaa cctgccaggc atgtacgccc gttccctggc caagtttggc ggtacggtgc 361 cccagagcgt gaaggaggct gccagcaagg gtagtgccgt gaccacgccc cagaacaatg 421 acgaggagta cctgactccc gtcactgtcg gaaagtccac cctccatctg gactttgaca 481 ccggatctgc agatctgtaa gcttccctgc tcgggtgttc gggcaaatcg tgactaacct 541 ggactagctg ggtcttctcg gacgagctcc cttcctcgga gcagaccggt cacgatctgt 601 acacgcctag ctccagcgcg accaagctga gcggctacac ttgggacatc tcctacggtg 661 acggcagctc ggccagcgga gacgtgtacc gggatactgt cactgtcggc ggtgtcacca 721 ccaacaagca ggctgttgaa gcagccagca agatcagctc cgagttcgtt cagaacacgg 781 ccaatgacgg ccttttggga ctggccttta gctccatcaa cactggtgag tcaatcctac 841 atcagccggg ttgacctacc tgctgaccga tagacagtcc agccaaaggc gcagacaacc 901 ttcttcgaca ccgtcaagtc ccagctggac tctccccttt tcgccgtgca gctgaagcac 961 gacgcccccg gtgtttacga ctttggctac atcgatgact ccaagtacac cggttctatc 1021 acctacacgg atgccgatag ctcccagggt tactggggct tcagcaccga cggctacagt 1081 atcggtgacg gcagctccag ctccagcggc ttcagcgcca ttgctggtaa gaaccgcctt 1141 catttaacac acaacttgtc cacctcttta ctaactagtg tatagacacc ggtaccaccc 1201 tcatcctcct cgatgacgaa atcgtctccg cctactacga gcaggtttct ggcgcctcag 1261 gagagacgga agccggtggc tacgttttct cttgctcgac caacccccct gacttcactg 1321 tcgtgattgg cgactacaag gccgttgttc cgggcaagta catcaactac gctcccatct 1381 cgactggcag ctccacctgc tttggcggta tccagagcaa cagcggtctg ggactgtcca 1441 tcctgggtga tgttttcttg aagagccagt acgtggtctt caactctgag ggccctaagc 1501 tgggattcgc cgctcaggct tagattatcc actgaagtgg agtctatgat ctgctgattg 1561 atccctcgac gatgaactac atgtggaaat gcatagcaga cgagggtgat ggtgatgatg 1621 ttgatttgat gatgacccgt acatacttga tgaagctcgg tacatatgca atgtgactgt 1681 atctatgtga tgatatatgt atcatctcat aacttttggc tatgagtgca ggataacact 1741 gaaccagtag tagtactttc cact // LOCUS BSUPSIF2A 2758 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.subtilis protein synthesis initiation factor 2 (infB) gene, complete cds. ACCESSION M34836 KEYWORDS protein synthesis initiation factor 2. SOURCE B.subtilis (strain RS410) DNA, clones lambda-JET[1,2],pUK, and pJET1. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2758) AUTHORS Shazand,K., Tucker,J., Chiang,R., Stansmore,K., Sperling-Petersen,H.U., Grunberg-Manago,M., Rabinowitz,J.C. and Leighton,T. TITLE Isolation and molecular genetic characterization of the Bacillus subtilis gene (infB) encoding protein synthesis initiation factor 2 JOURNAL J. Bacteriol. 172, 2675-2687 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 381 2531 protein synthesis initiation factor 2 (infB) binding 369 375 ribosomal binding site (put.) BASE COUNT 917 a 535 c 662 g 644 t ORIGIN 1 atcgatgacc agattttcga tgaattgctg gaactggcgg aaaaggtgaa aaaataaaat 61 gtctggaatg gaatggtttc ccttgctggg tctggccaat cgagctcgta aggtcgtgtc 121 aggcgaagac ttggtaataa aagaaatcag gaatgcgcgt gcaaagcttg tcctgcttac 181 agaggatgca tcatctaaca cagcaaaaaa agtaactgac aagtgcaatt attataaagt 241 cccttataaa aaagtcgaga gtcgcgcggt tcttggacgc tctatcggta aagaagcccg 301 tgtcgttgtc gccgtcactg accaaggttt tgcgaataag ctgatcagct tgctcgatta 361 atatttttgg gggtgaacga atggctaaaa tgagagtata cgaatatgca aaagcgttaa 421 atgtttcaag taaggaaatt ttgaccgcac tgaagaacat ggatttagaa gtgaataatc 481 acatggccat gcttgaagaa aaggccatta aaaagctaga tgccaaatat aaaaaaggcg 541 gcgcacgtgc taaatctcaa aagccagcag aaacgaacaa aaacaaacag ccacaagggg 601 ttaatcagca atcagctgga aatcaaccaa ataaaattcg agacggaaag aagaatgacg 661 tgcagaataa tcaatttaac aaaaacaaga agaataacaa caacaaaaaa aataaacgca 721 acaacaacaa taataaaaac caacatcagc aaaagcctgt aaagccgaaa aaagagcttc 781 ctgagaaaat tacattctct ggcactttaa cagttggcgc acttgctgaa gagcttggca 841 aagagccttc agaactcatt aaaaagctga tgcttcttgg cgtaatggca accattaacc 901 aagagcttga taaagacaca atcgaactca ttgcatcaga atatggtgtt gaaacagaag 961 aggtcattgt gcttgaagaa acagagctgg aaaaatacga agagcctgat aatgaagagg 1021 atcttgaaat tcgtcctcct gtcgtgacaa tcatgggcca cgttgaccat gggaaaacaa 1081 cgcttcttga cagcatccgt aaaacaaagg ttgttgaagg ggaagcaggc ggaatcactc 1141 agcatatcgg cgcttatcaa attgaagaga acggcaagaa aatcacgttc ttggatacac 1201 cgggccacgc cgcatttaca acaatgcgcg cacgcggtgc agaagtaact gatattacga 1261 ttcttgtcgt agctgccgat gacggtgtca tgccgcaaac agttgaagcg attaaccatg 1321 caaaagcagc agaggttcca atcatcgttg ctgtgaataa aatagataaa gaatccgcaa 1381 accctgaccg tgtaatgcaa gaactgacgg aatacggact tgtccctgaa gcttggggcg 1441 gagaaaccat ttttgtgccg ctttccgctt taactggaaa aggcattgat gagctcgttg 1501 aaatgatttt gcttgtcagt gaggtagagg aactgaaagc gaatccgaat cgtcaggcaa 1561 aaggaacggt tattgaagct gaactcgata aaggcagagg gtcagtcgcg acgttgctcg 1621 tacagactgg aacactgcat gtcggtgatc cgatcgtagt cggcaataca tttggccgtg 1681 tccgtgcaat ggtcaacgac attggccgcc gtgtgaaaac tgccggcccg tcaactccgg 1741 ttgaaattac cggtttgaat gatgtccctc aagcgggaga ccaattcctt gtctttaagg 1801 atgaaaaaac agctcgttct gtcggtgaag cccgtgcttc gaaacagctt gaagagcagc 1861 gcagcgataa agcgaagctc agtcttgatg atttatttga gcaaattaag caaggtgatg 1921 taaaagacat caacctcatc gtaaaagctg acgttcaagg atctgctgaa gctttaacgg 1981 ctgcacttca aaaaattgaa gtagaaggcg ttaaagtgaa aatcatccat acaggcgttg 2041 gtgcgattac tgaatcagac attatcttgg catctgcttc caatgcaatt gttatcgggt 2101 ttaatgtgag accggacgga aatgctaaga gtacggctga agctgaaaat gtagatattc 2161 gacttcaccg tatcatttac aaagtaatcg acgagattga agctgccatg aaaggtatgc 2221 ttgatcctga atatgaagaa aaagtaattg gtcaagtaga agtacgccaa acattcaaag 2281 tatctaaaat cggtacaatt gccggcggat atgttactga aggaaccatt acgcgcgaca 2341 gcggcctccg tttaattcgt gacggcgtcg tcatctttga aggcgaagta gatgttctga 2401 aacgctttaa agacgatgtg aaagaagttt cacaaggcta tgaatgtggt attacaatta 2461 agaaatacaa tgacattcgt gaaggtgaca tccttgaagc gtttgtcatg caagaaattg 2521 aaagaacgtg atcggatttg cggagtgtga atgcatcatt tatgatgcag gatcgctaaa 2581 agaaaagcgt gccgttctga agcggatttt aaccagggtt caaaacaagt tcaatgtttc 2641 gatttcggag attggctatc aggacacctg gcaaagaacc agcttcggaa tcgccgctgt 2701 ttcttcctct cgcgttcaaa cagaaaaaga actgcagcgc gtcctagcgt ttatcgat // LOCUS CIBRCRA 1845 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid Colicin Ib-P9 replication control region replicon initiator protein (repZ) gene, complete cds. ACCESSION M34837 KEYWORDS replicon initiator protein. SOURCE Plasmid Colicin Ib-P9 (from E.coli) DNA, clone pCH10-delta-1. ORGANISM Plasmid Colicin Ib-P9 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1845) AUTHORS Hama,C., Takizawa,T., Moriwaki,H., Urasaki,Y. and Mizobuchi,K. TITLE Organization of the replication control region of plasmid Co1Ib-P9 JOURNAL J. Bacteriol. 172, 1983-1991 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 455 1486 repZ protein mRNA 157 1762 repZ protein mRNA (alt.) mRNA 158 1762 repZ protein mRNA (alt.) mRNA 363 < 1 (c) inc RNA binding 442 448 ribosomal binding site (put.) signal 377 372 (c) inc -10 region signal 400 395 (c) inc -35 region mut 353 353 c in wild type; t in inc3 mut 374 374 t in wild type; c in inc2 mut 400 400 a in wild type; c in inc1 BASE COUNT 505 a 467 c 490 g 383 t ORIGIN 1 gatctaaagc agaaaaatct gcttttaaaa atagatttta tttttttgac gcaggtcaag 61 attaacttat tagaggtatc agtgaggagg cactggaaga gaagagatcg ttgtaatgct 121 tttcaaatta acgtaaagcg ggtatatttc ggctgttatt agctgtgcag agggtggcac 181 tctgtggagc aaagcggcga aagccggacg gcagaatgcg ccataaggca ttcaggagag 241 atggcatgta cgggcagtaa gtcagaagac tgaagatgtt ccggaagcca taaaaggaaa 301 acccccacta tctttcttac gaacttggcg gaaagacgaa agatagtggg ggcctcacag 361 aatacgggta aagtataatg aaaccgtacc agagattcaa ccctgtgcag tgtataaata 421 cacggcacaa tcgctccgcc ataagcgaca gcttgtggca ggtctgaaga atactccata 481 taacgcagta cactggagtc agttagcacc cgaagagcag atccgtttct gggaagacta 541 tgaagcggga agggcgacca ctttcctggt tgaaccggaa aggaagcgca cgaagcgtcg 601 tcgcggtgag cactccacca aacccaaatg cgaaaatccg tcctggtatc gtcctgagcg 661 ctataaagcg ctgaaagggc agctcgggca cgcctataac cgtctggtga aaaaggaccc 721 ggtgaccggc gagcagagcc tgcgcatgcg catgtcgcgg catccttttt atgtacagaa 781 acggacgttc gtcggtcgca aatacgcttt ccggccggaa aaacaacgtc tcctcgatgc 841 catctggccg gttctggtca gcttcagtga tgccggcaca cacaccgtgg ggatgagcgt 901 cactcggctg gctgaagaaa tcagcccgaa agacagcgag ggacatgtta ttccggaact 961 ggaagtgacg gtctcccgtc tttcccgtct gctggccgaa caggtgcgct ttggtgtgct 1021 gggtgtgtca gaggaaacaa tgtgggaccg tgaacatcgt cagcgtctgc cacgttacgt 1081 ctggataaca ccggcaggct ggcagatgct gggcgtcgac atggtaaaac ttcacgaaca 1141 gcagcagaaa cgactgcgtg aaagtgaaat ccgccagcag ctcattcggg aaggtgttct 1201 gcgtgaggat gaagatatct ccgttcatgc ggccagaaaa cgctggtatc tgcagcgcag 1261 ccaggatgcc ctgaagaagc gccgagaaaa agccgcagcc agcaagcgtg ccaaccgcct 1321 gaagaaatta cctgttgacc agcagattta tgagatggct gagtatctcc ggaagcgtct 1381 gccgccggat gaagcctatt tctgttccga tgaccatctg aagcgactgg ccatcaggga 1441 gttgcgtcag cttgaactga cgctggctgc cccgccaccg cactagacag catcattccc 1501 tcagcactga atcatcgcca gcccctccgg ggctttcggc gcagattccg ctcagcccaa 1561 aatccttagt agtcacctta aatcccctca gaggggcata tctgcccata aaaccacgca 1621 tcagtcatca gaacgtggcc acgttgtttc agttatccac ataaatccgc aaacaaagaa 1681 ttttaagaag ctgcaaacct gaaacagcaa acctgcaata tattcttaac cccatcattt 1741 aatcccctgc gttgcttcgc cgcagggaaa gtctttatct ctgaaaccac tgtgaacaaa 1801 tacaaaaagg ccttcgcttg cagcgggcaa ggccgcgccg gaatt // LOCUS ECOHEMBA 1085 bp ds-DNA BCT 27-JUL-1990 DEFINITION E.coli porphobilinogen synthase (hemB) gene, complete cds. ACCESSION M35121 KEYWORDS 5-aminolevulinic acid dehydratase; hemB gene; porphobilinogen synthase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1085) AUTHORS Echelard,Y., Dymetryszyn,J., Drolet,M. and Sasarman,A. TITLE Nucleotide sequence of the hemB gene of Escherichia coli K12 JOURNAL Mol. Gen. Genet. 214, 503-508 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 80 1054 porphobilinogen synthase (hemB) signal 15 20 hemB -35 region signal 41 46 hemB -10 region BASE COUNT 254 a 277 c 301 g 253 t ORIGIN 1 tcgcgacaac tttcgtaaaa catccctacc ctgcttcagg tatactatgc ccctcgattc 61 cacaaacatc aggcagacca tgacagactt aatccaacgc cctcgtcgcc tgcgcaaatc 121 tcctgcgctg cgcgtatgtt tgaagagaca acacttagcc ttaacgacct ggtgttgccg 181 atctttgttg aagaagaaat tgacggacta caaagccgtt gaagccatgc caggcgtgat 241 gcgcattcca gagaaacatc tggcacgcga aattgaacgc atcgccaacg ccggtattcg 301 ttccgtgatg acttttggca tctctcacca taccgatgaa accggcagcg atgcctggcg 361 ggaagatgga ctggtggcgc gtatgtcgcg catctgcaag cagaccgtgc cagaaatgat 421 cgttatgtca gacacctgct tctgtgaata cacttctcac ggtcactgcg gtgtgctgtg 481 cgagcatggc gtcgacaacg acgcgactct ggaaaattta ggcaagcaag ccgtggttgc 541 agctgctgca ggtgcagact tcatcgcccc ttccgccgcg atggacggcc aggtacaggc 601 gattcgtcag gcgctggacg ctgcgggatt taaagatacg gcgattatgt cgtattcgac 661 caagttcgcc tcctcctttt atggcccgtt ccgtgaagct gccggaagcg cattaaaagg 721 cgaccgcaaa agctatcaga tgaacccaat gaaccctcgt gaggcgattc gtgaatcact 781 gctggatgaa gcccagggcg cagactgcct gatggttaaa cctgctggag cgtacctcga 841 catcgtgcgt gagctgcgtg aacgtactga attgccgatt ggcgcgtatc aggtgagcgg 901 tgagtatgcg atgattaagt tcgccgcgct ggcgggtgct atagatgaag agaaagtcgt 961 gctcgaaagc ttaggttcga ttaagcgtgc gggtgcggat ctgattttca gctactttgc 1021 gctggatttg gctgagaaga agattctgcg ttaattttgt ttcctctcag aaagggggag 1081 ttaac // LOCUS ECOSTMA 1418 bp ds-DNA BCT 27-JUL-1990 DEFINITION E.coli F factor (stm) gene, complete cds. ACCESSION M35124 KEYWORDS F factor; stm gene. SOURCE E.coli DNA, clones pEG[157,159,175,176,220]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1418) AUTHORS Golub,E.I. and Panzer,H.A. TITLE The F factor of Escherichia coli carries a locus of stable plasmid inheritance stm, similar to the parB locus of plasmid RI JOURNAL Mol. Gen. Genet. 214, 353-357 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 830 1009 F factor (stm) signal 1013 1039 pot. transcription termination signal signal 1074 1105 pot. transcription termination signal BASE COUNT 279 a 364 c 429 g 345 t 1 others ORIGIN 1 atcgctgtct gacctgcgat ttttcatgcc gtccctgacc gcagaagaac tgcacggcaa 61 ccgtcttcag tggctgtatg gcatcgatgt gctgattgaa acacagggcg aggtctgcct 121 ccttcccctt cccggtgatg cagcagagcg gtcgtttccg tcggtgcggt ttcgtgtcag 181 ggagcgcagt cgccataaat ccgcgctggt catgcagaag tacagccgcc agcaggcacg 241 ggaggcagag cagaaagccc gtgcgtacca ggcgctggtc gctcaggctg aaatcgaact 301 ggcattccat tcgccggaaa cagtcggaag ctggcatgca cgctggagtg accgggtggc 361 agagcatgac ctggaaaccc tgttctggca gtggggcgag cgttttccgt cactggccgg 421 aatggaacgc tggcagtggc aggatatgcc gttcgtggca ggttattgcg gaacgcagtc 481 ttgcagcaag agaggccggt cacgctgtcc gtgagatgag cgtgatgtgc cgaacaaact 541 ccggaggcag cgtgatgcgg gcaacaatca cacagattac ccgtaaacag cctgaatgag 601 cgggttattt tcaggaaaag tatgtgtggt cagcgtgcag ggatatggct atgatgtgcc 661 cggcgcttga ggctttctgc ctcatgacgt gaaggtggtt tgttgccgtg ttgtgtggca 721 gaaggacaaa agccccatag ttaatttttc attaacccac gaggcccctg catgcttaga 781 caacatcagg atagcctctt actgcgcttt cgacgaggag aagaaggcca tgaaactacc 841 acgcagctct cttgtctggt gtgtgttgat cgtgtgtctc acactgttga tattcactta 901 tctgacacga aaatcgctgt gcgagattcg ttacagagac ggatacaggg aggtngcggc 961 cggatacagg gaggtggcgg ctttcatggc ttacgaatcc ggtaagtagc aacctggagg 1021 cgggcgcagg cttttcaggg ctgatgctgg tctgactgca ctgatgcctt tataaagggg 1081 ctgctggttc gccggtggcc cctttctcct tgctggcatt gtacgggcat gaacaaccag 1141 actgccagga aggccgtttc cctctgggaa cggggaagaa cgtgcgcctg cggccgttct 1201 tttcccccgc cttctctgtt tatgacctgt cagaaatcac atcaccttcc tttgattatt 1261 ctctttttct ctgttctgac tctgattact gctgccctga ttcctgtctg ttttgttgtt 1321 ctcctggcag tgtcttttcc gggcgtcctg gcggggcggc tggcgtgaac ggagccagcg 1381 aagctgtctc tggcgggctt ccatccctga cgggaaac // LOCUS HUMGPIIBA 3333 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human platelet glycoprotein IIb (GPIIb) mRNA, complete cds. ACCESSION M34480 KEYWORDS platelet glycoprotein IIb. SOURCE Human megakaryocytes, cDNA to mRNA, clone IIb[3,4]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3333) AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and Marguerie,G. TITLE GPIIb and GPIIIa amino acid sequences deduced from human megakaryocyte cDNAs JOURNAL Mol. Biol. Rep. 14, 27-33 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 33 3152 platelet glycoprotein IIb (GPIIb) mRNA 1 3333 GPIIb mRNA BASE COUNT 626 a 998 c 1040 g 669 t ORIGIN 1 attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag 61 ccctctggct tctggagtgg gtgctgctgc tcttgggagc ttgtgctgcc cctccagcct 121 gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt 181 ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg 241 ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga 301 gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg 361 taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca 421 gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga 481 ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc 541 gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg 601 attttagctg ggacaagcgt tactgtgaag cgggcttcag ctcggtggtc actcaggccg 661 gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc 721 cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc 781 agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg 841 tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca 901 cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc 961 ggctgcgcgc agagcagatg gcgtcgtatt ttgggcattc agtcgctgtc actgacgtca 1021 acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggac agccgggcag 1081 accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg 1141 cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg 1201 ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc 1261 cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc 1321 tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct 1381 tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag 1441 cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc 1501 agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca 1561 agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc 1621 ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc 1681 ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctggacctg gatctgggcg 1741 gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc 1801 gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg 1861 gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg 1921 tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga 1981 cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca 2041 acgagggcga gggggcctat gaagcagagc tggcggtgca cctgccccag ggcgcccact 2101 acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg 2161 agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga 2221 taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct 2281 tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg 2341 acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc 2401 tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca 2461 aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc 2521 tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac 2581 agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact 2641 gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca 2701 gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa 2761 gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc 2821 agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc 2881 tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg 2941 tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct 3001 tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc 3061 tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggcaca 3121 ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga 3181 gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt 3241 ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc 3301 ccctcctgct gcctaataaa gagactgagc cct // LOCUS HUMGPIIIAA 4487 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human platelet glycoprotein IIIa (GPIIIa) mRNA, complete cds. ACCESSION M35999 KEYWORDS platelet glycoprotein IIIa. SOURCE Human megakaryocytes, cDNA to mRNA, clone IIIa[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4487) AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and Marguerie,G. TITLE GPIIb and GPIIIa amino acid sequences deduced from human megakaryocyte cDNAs JOURNAL Mol. Biol. Rep. 14, 27-33 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 19 2385 plate glycoprotein IIIa (GPIIIa) mRNA < 1 2603 GPIIIa mRNA BASE COUNT 1046 a 1090 c 1186 g 1165 t ORIGIN 1 ccgcgggagg cggacgagat gcgagcgcgg ccgcggcccc ggccgctctg ggtgactgtg 61 ctggcgctgg gggcgctggc gggcgttggc gtaggagggc ccaacatctg taccacgcga 121 ggtgtgagct cctgccagca gtgcctggct gtgagcccca tgtgtgcctg gtgctctgat 181 gaggccctgc ctctgggctc acctcgctgt gacctgaagg agaatctgct gaaggataac 241 tgtgccccag aatccatcga gttcccagtg agtgaggccc gagtactaga ggacaggccc 301 ctcagcgaca agggctctgg agacagctcc caggtcactc aagtcagtcc ccagaggatt 361 gcactccggc tccggccaga tgattcgaag aatttctcca tccaagtgcg gcaggtggag 421 gattaccctg tggacatcta ctacttgatg gacctgtctt actccatgaa ggatgatctg 481 tggagcatcc agaacctggg taccaagctg gccacccaga tgcgaaagct caccagtaac 541 ctgcggattg gcttcggggc atttgtggac aagcctgtgt caccatacat gtatatctcc 601 ccaccagagg ccctcgaaaa cccctgctat gatatgaaga ccacctgctt gcccatgttt 661 ggctacaaac acgtgctgac gctaactgac caggtgaccc gcttcaatga ggaagtgaag 721 aagcagagtg tgtcacggaa ccgagatgcc ccagagggtg gctttgatgc catcatgcag 781 gctacagtct gtgatgaaaa gattggctgg aggaatgatg catcccactt gctggtgttt 841 accactgatg ccaagactca tatagcattg gacggaaggc tggcaggcat tgtccagcct 901 aatgacgggc agtgtcatgt tggtagtgac aatcattact ctgcctccac taccatggat 961 tatccctctt tggggctgat gactgagaag ctatcccaga aaaacatcaa tttgatcttt 1021 gcagtgactg aaaatgtagt caatctctat cagaactata gtgagctcat cccagggacc 1081 acagttgggg ttctgtccat ggattccagc aatgtcctcc agctcattgt tgatgcttat 1141 gggaaaatcc gttctaaagt cgagctggaa gtgcgtgacc tccctgaaga gttgtctcta 1201 tccttcaatg ccacctgcct caacaatgag gtcatccctg gcctcaagtc ttgtatggga 1261 ctcaagattg gagacacggt gagcttcagc attgaggcca aggtgcgagg ctgtccccag 1321 gagaaggaga agtcctttac cataaagccc gtgggcttca aggacagcct gatcgtccag 1381 gtcacctttg attgtgactg tgcctgccag gcccaagctg aacctaatag ccatcgctgc 1441 aacaatggca atgggacctt tgagtgtggg gtatgccgtt gtgggcctgg ctggctggga 1501 tcccagtgtg agtgctcaga ggaggactat cgcccttccc agcaggacga gtgcagcccc 1561 cgggagggtc agcccgtctg cagccagcgg ggcgagtgcc tctgtggtca atgtgtctgc 1621 cacagcagtg actttggcaa gatcacgggc aagtactgcg agtgtgacga cttctcctgt 1681 gtccgctaca agggggagat gtgctcaggc catggccagt gcagctgtgg ggactgcctg 1741 tgtgactccg actggaccgg ctactactgc aactgtacca cgcgtactga cacctgcatg 1801 tccagcaatg ggctgctgtg cagcggccgc ggcaagtgtg aatgtggcag ctgtgtctgt 1861 atccagccgg gctcctatgg ggacacctgt gagaagtgcc ccacctgccc agatgcctgc 1921 acctttaaga aagaatgtgt ggagtgtaag aagtttgacc ggggagccct acatgacgaa 1981 aatacctgca accgttactg ccgtgacgag attgagtcag tgaaagagct taaggacact 2041 ggcaaggatg cagtgaattg tacctataag aatgaggatg actgtgtcgt cagattccag 2101 tactatgaag attctagtgg aaagtccatc ctgtatgtgg tagaagagcc agagtgtccc 2161 aagggccctg acatcctggt ggtcctgctc tcagtgatgg gggccattct gctcattggc 2221 cttgccgccc tgctcatctg gaaactcctc atcaccatcc acgaccgaaa agaattcgct 2281 aaatttgagg aagaacgcgc cagagcaaaa tgggacacag ccaacaaccc actgtataaa 2341 gaggccacgt ctaccttcac caatatcacg taccggggca cttaatgata agcagtcatc 2401 ctcagatcat tatcagcctg tgccaggatt gcaggagtcc ctgccatcat gtttacagag 2461 gacagtattt gtggggaggg atttcggggc tcagagtggg gtaggttggg agaatgtcag 2521 tatgtggaag tgtgggtctg tgtgtgtgta tgtgggggtc tgtgtgttta tgtgtgtgtg 2581 ttgtgtgtgg gagtgtgtaa tttaaaattg tgatgtgtcc tgataagctg agctccttag 2641 cctttgtccc agaatgcctc ctgcagggat tcttcctgct tagcttgagg gtgactatgg 2701 agctgagcag gtgttcttca ttacctcagt gagaagccag ctttcctcat caggccattg 2761 tccctgaaga gaagggcagg gctgaggcct ctcattccag aggaagggac accaagcctt 2821 ggctctaccc tgagttcata aatttatggt tctcaggcct gactctcagc agctatggta 2881 ggaactgctg gcttggcagc ccgggtcatc tgtacctctg cctcctttcc cctccctcag 2941 gccgaaggag gagtcaggga gagctgaact attagagctg cctgtgcctt ttgccatccc 3001 ctcaacccag ctatggttct ctcgcaaggg aagtccttgc aagctaattc tttgacctgt 3061 tgggagtgag gatgtctggg ccactcaggg gtcattcatg gcctggggga tgtaccagca 3121 tctcccagtt cataatcaca acccttcaaa gatttgcctt attggcagct ctactctgga 3181 ggtttgttta gaagaagtgt gtcaccctta ggccagcacc atctctttac ctcctaattc 3241 cacaccctca ctgctgtaga catttgctat gacctgggga tgtctctcat gaccaaatgc 3301 ttttcctcaa agggagagag tgctattgta gagccagagg tctggcccta tgcttccggc 3361 ctcctgtccc tcatccatag cacctccaca tacctggccc tgagccttgg tgtgctgtat 3421 ccatccatgg ggctgattgt atttaccttc tacctcttgg ctgccttgtg aaggaattat 3481 tcccatgagt tggctgggaa taagtgccag gatggaatga tgggtcagtt gtatcagcac 3541 gtgtggcctg ttcttctatg ggttacaacc tcatttaact cagtctttaa tctgagaggc 3601 cacagtgcaa ttttatttta tttttctcat gatgaggttt tcttaactta aaagaacatg 3661 tatataaaca tgcttgcatt atatttgtaa atttatgtgt atggcaaaga aggagagcat 3721 aggaaaccac acagacttgg gcagggtaca gacactccca cttggcatca ttcacagcaa 3781 gtcactggcc agtggctgga tctgtgaggg gctctctcat gatagaaggc tatggggata 3841 gatgtgtgga cacattggac ctttcctgag gaagagggac tgttcttttg tcccagaaaa 3901 gcagtggctc cattggtgtt gacatacatc caacattaaa agccaccccc aaatgcccaa 3961 gaaaaaaaga aagacttatc aacatttgtt ccatgagcag aaaactggag ctctggcctc 4021 agtgttacag ctaaataatc tttaattaag gcaagtcact ttcttcttct taaagctgtt 4081 tctagtttga gaaatgatgg gattttagca gccagtcttg aaggtctctt tcagtatcaa 4141 cattctaaga tgctgggact tactgtgtca tcaaatgtgc ggttaagatt ctctgggata 4201 ttgatactgt ttgtgttttt agttgggaga tctgagagac ctggctttgg caagagcaga 4261 tgtcattcca tatcaccttt ctcaatgaaa gtctcattct atcctctctc caaacccgtt 4321 ttccaacatt tgttaatagt tacgtctctc ctgatgtagc acttaagctt catttagtta 4381 ttatttcttt cttcactttg cacacatttg catccacata ttagggaagg aataagtagc 4441 tgcaaactat ctattcctgt attattgtgt taacattgag ataaacc // LOCUS MUSCDX1A 1597 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Cdx-1 homeobox protein mRNA, complete cds. ACCESSION M37163 M35125 KEYWORDS Cdx-1 homeobox protein. SOURCE Mouse 8.5 day embryo, cDNA to mRNA, clone BH8. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1597) AUTHORS Duprey,P., Chowdhury,K., Dressler,G.R., Balling,R., Simon,D., Guenet,J.-L. and Gruss,P. TITLE A mouse gene homologous to the Drosophila gene caudal is expressed in epithelial cells from the embryonic intestine JOURNAL Genes Dev. 2, 1647-1654 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 158 736 Cdx-1 homeobox protein site 389 571 Cdx-1 homeobox BASE COUNT 326 a 523 c 444 g 304 t ORIGIN 1 gccgcgcccg ctgggccctc cgacctatgc gccccctggc ccggcgcccg cgcccgcagt 61 accccgactt cgcgggttac acgcacgtgg agccggcgcc cgcgccctcc gacctgggct 121 gcgcttccct gcgcccaagg acgactgggc agctgctatg gcccgggccc cacggcctca 181 gcccgcagcc cggccccgct ggccttcggg ccctccggac tttagcccgg tgcccgcgcc 241 tcccgggcct ggtcccggca tcctagcgca gtccctcggg gctccgggcg caccgtcctc 301 gccaggagcg ccaaggcgga cgccctacga atggatgcgg cgcagcgtgg cggctgcagg 361 cggctgtggc agcggtaaga cccgaaccaa ggacaagtca cgtgtggtct acacagacca 421 ccaacgccta gagctggaaa aggagtttca ctacagccgg tacatcacta tccggcgcaa 481 gtccgagctg gctgctaacc tggggctcac agagcggcag gtaaagatct ggttccagaa 541 ccgccgggcc aaggagcgca aagtaaacaa gaagaaacag cagcagcagc agcccctgcc 601 tcccacacag ctgcccctgc ccctggatgg cactcccaca ccatcagggc cacccctagg 661 aagtctatgc cctactaatg ctggccttct gggcaccccc tccccagtgc ccgtcaagga 721 ggagtttcta ccctagcccc ttgcagcctg gggtctaggg atctagggac ttgaatgctg 781 ggcagctggc cttttcgggg cccaaggacg tgttctgagt gcagccctgc cttggccctt 841 tggggtcact gtggacaaac tgcctaccta ggacaagtta gcttgccctc ttcctgcctt 901 ccgttggctg ggctgtgtgg tgagcctgtt ggaccaagga gctcttccag ctcttgtgtc 961 ctagggcaca cagggtgatg gggaagccca attagaagtc tcaatcacct ggacatttcc 1021 caaagattca gaggctcaac gagctgtcaa gacaagattg aggcttcagc gtctcctcca 1081 agggttccag agtgaggtgg gaggctggta tctggaccag actggctctg gagagtaggc 1141 ggccattgaa acaaggtggt acaccacagt ctagactccc atgtgaaggg aagatgctat 1201 ctgcccacct gtctcctctt ccagcctcac tttcacctgc cctagcaccc catgctctgg 1261 ccctccccag cctggagcta caaagccatc aggactgaca tgaggtagag gctgggcaat 1321 tggccactgc tcttgagaga gaactcgccc ccaccttagt gtccagggag gcctgcggat 1381 ggagagaagg cactatgagt ctacctacct gcacatgccc accttcccac aggctgccca 1441 tcacacccca tttagccccc acaccatggg ggactcgcag ctgtacaggg cttatgtagg 1501 agtgtaatct cttggatgca gcttcagaaa taagttttcc cccttttaaa caatttataa 1561 aaatcataca aggcattaaa gaatttttaa aaagttt // LOCUS MUSHSP70B 3518 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse heat shock protein 70.1 (hsp70.1) gene, complete cds. ACCESSION M35021 KEYWORDS heat shock protein 70.1. SOURCE Mouse (strain AJ) kidney DNA, clone pM[1.2,2.3]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3518) AUTHORS Hunt,C. and Calderwood,S. TITLE Characterization and sequence of a mouse hsp70 gene and its expression in mouse cell lines JOURNAL Gene 87, 199-204 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 806 2734 heat shock protein 70.1 (hsp70.1) signal 507 512 inverted CCAAT box signal 547 552 TATA box BASE COUNT 827 a 993 c 1014 g 684 t ORIGIN 1 gatctcttct atttccctat tcaaacctaa aatgaagagg gagggggaga catggacaag 61 caagcattcc acaggcgccc ctgcccaacg ctgtcactca aaccaggacc caatcacaga 121 ctttttagcc aagccttatc ccgcctctct tgagaaactt tctgcgtccg ccatcctgta 181 ggaaggattt gtacacttta aactccctcc ctggtctgag tcccacactc tcaccaccca 241 gcaccttcag gagctgaccc ttaacagctt cacccacagg gaccccgaag ttgcgtcgcc 301 tccgcaacag tgtcaatagc agcaccagca cttccccaca ccctccccct caggaatccg 361 tactctctag cgaaccccag aaacctctgg agagttctgg acaagggcgg aacccacaac 421 tccgattact caagggaggc ggggaagctc caccagacgc gaaactgctg gaagattcct 481 ggccccaagg cctcctccgg ctcgctgatt ggcccagcgg agagtgggcg gggccggtga 541 agactcctta aaggcgcagg gcggcgagca gggcaccaga cgctgacagc tactcagaat 601 caaatctggt tccatccaga gacaagcgaa gacaagagaa gcagagcgag cggcgcgttc 661 ccgatcctcg gccaggacca gccttcccca gagcatccac gccgcggagc gcaaccttcc 721 caggagcatc cctgccgcgg agcgcaactt tccccggagc atccacgccg cggagcgcag 781 ccttccagaa gcagagcgcg gcgccatggc caagaacacg gcgatcggca tcgacctggg 841 caccacctac tcgtgcgtgg gcgtgttcca gcacggcaag gtggagatca tcgccaacga 901 ccagggcaac cgcacgaccc ccagctacgt ggccttcacc gacaccgagc gcctcatcgg 961 ggacgccgcc aagaaccagg tggcgctgaa cccgcagaac accgtgttcg acgcgaagcg 1021 gctgatcggc cgcaagttcg gcgatgcggt ggtgcagtcc gacatgaagc actggccctt 1081 ccaggtggtg aacgacggcg acaagcccaa ggtgcaggtg aactacaagg gcgagagccg 1141 gtcgttcttc ccggaggaga tctcgtccat ggtgctgacg aagatgaagg agatcgctga 1201 ggcgtacctg ggccacccgg tgaccaacgc ggtgatcacg gtgcccgcct acttcaacga 1261 ctctcagcgg caggccacca aggacgcggg cgtgatcgcc ggtctaaacg tgctgcggat 1321 catcaacgag cccacggcgg ccgccatcgc ctacgggctg gaccggaccg gcaagggcga 1381 gcgcaacgtg ctcatcttcg acctgggggg cggcacgttc gacgtgtcca tcctgacgat 1441 cgacgacggc atcttcgagg tgaaggccac ggcgggcgac acgcacctgg gaggggagga 1501 cttcgacaac cggctggtga gccacttcgt ggaggagttc aagaggaagc acaagaagga 1561 catcagccag aacaagcgcg cggtgcggcg gctgcgcacg gcgtgtgaga gggccaagag 1621 gacgctgtcg tccagcaccc aggccagcct ggagatcgac tctctgttcg agggcatcga 1681 cttctacaca tccatcacgc gggcgcggtt cgaagagctg tgctcggacc tgttccgcgg 1741 cacgctggag cccgtggaga aggccctgcg cgacgccaag atggacaagg cgcagatcca 1801 cgacctggtg ctggtgggcg gctcgacgcg catccccaag gtgcagaagc tgctgcagga 1861 cttcttcaac gggcgcgacc tgaacaagag catcaacccg gacgaggcgg tggcctacgg 1921 ggcggcggtg caggcggcca tcctgatggg ggacaagtcg gagaacgtgc aggacctgct 1981 gctgctggac gtggcgccgc tgtcgctggg cctggagact gcgggcggcg tgatgacggc 2041 gctcatcaag cgcaactcca ccatccccac caagcagacg cagaccttca ccacctactc 2101 ggacaaccag cccggggtgc tgatccaggt gtacgagggc gagagggcca tgacgcgcga 2161 caacaacctg ctggggcgct tcgagctgag cggcatcccg ccggcgccca ggggcgtgcc 2221 gcagatcgag gtgaccttcg acatcgacgc caacggcatc ctgaacgtca cggccaccga 2281 caagagcacc ggcaaggcca acaagatcac catcaccaac gacaagggcc gcctgagcaa 2341 ggaggagatc gagcgcatgg tgcaggaggc cgagcgctac aaggccgagg acgaggtgca 2401 gcgcgacagg gtggccgcca agaacgcgct cgagtcctat gccttcaaca tgaagagcgc 2461 cgtggaggac gagggtctca agggcaagct cagcgaggct gacaagaaga aggtgctgga 2521 caagtgccag gaggtcatct cctggctgga ctccaacacg ctggccgaca aggaggagtt 2581 cgtgcacaag cgggaggagc tggagcgggt gtgcagcccc atcatcagtg ggctgtacca 2641 gggtgcgggt gctcctgggg ctgggggctt cggggcccag gcgccgccga aaggagcctc 2701 tggctcagga cccaccatcg aggaggtgga ttagaggcct ctgctggctc tcccggtgtg 2761 gtctagaaaa cagactcttt gcacttgata gctgcttggg caccgattac tgtcaaggtt 2821 atttaaagtc ttcttcatgg ttcagtttaa agttacagtc tttcttaagg taattgcgtt 2881 gactgttaaa ttttgtatgc atatatatat atatatatat atatatatat atattcaaat 2941 atattcaaag taatgttggg agcagcactg tgcactgtac caggggatta tgttttatag 3001 ctaatgatgt gtaaagtcta aagatttttt tgtaattttt atatcagtgt tccagtagcc 3061 tgggaagaca tatagtctag ctgcccagtt ccctggagat ggtcatctct aagacaaagt 3121 gtcttaaaca aacgtcttgg cactgtgtac tacataactt tactcttttg tacttaaaac 3181 tttatctgct tgtccatgtt aaggttttgt ggtataacca gtatgttctt tgcatttaat 3241 ctaagtaggt taaagatggt gtatccttcc tgcatacatg tctacactgc caccctgtgt 3301 acattttttt ctttgcatca ctacaaacta atgaaaaaaa cttttatgac ttaaatattc 3361 aaaataaaag gttacaagta tattttgtct gtttgtatgt tggaagggct aatggattct 3421 gggcttctgt ggatttctta agtttttttt aagatttatt attatatgtg aacacattgt 3481 agctatcttc agacacacca gaaaagggca tcagatct // LOCUS RDVS10 1319 bp ds-RNA VRL 27-JUL-1990 DEFINITION Rice dwarf virus segment S10 mRNA, encoding an ORF, complete cds. ACCESSION D00241 KEYWORDS S10 RNA. SOURCE Rice dwarf virus, cDNA to viral RNA. ORGANISM Rice dwarf virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus. REFERENCE 1 (bases 1 to 1319) AUTHORS Omura,T., Minobe,Y. and Tsuchizaki,T. TITLE Nucleotide sequence of segment S10 of the rice dwarf virus genome JOURNAL J. Gen. Virol. 69, 227-231 (1988) STANDARD simple automatic COMMENT Three contiguous termination codons were detected at the third triplet downstream from the first stop codon. Corresponds to DDBJ (rel3.0) entry (D00241) dated 20-JUN-1988 FEATURES from to/span description pept 27 1088 ORF BASE COUNT 354 a 279 c 284 g 402 t ORIGIN 5' end of the plus-sense strand of segment S10. 1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc 61 atcatgagct cctttgtgct cacgaaggac caagtattat ttccaagttt gatgcaatta 121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag 181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact 241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg 301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt 361 ttatctttat ccctgagact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc 421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt 481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc 541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag 601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat 661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac 721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt 781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc 841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc 901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca 961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta 1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca 1081 gttcctgatc gcactgatga tagtgctttt catttatcaa aatattacgg tggggtcatc 1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac 1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccataggtg ggttagagtc 1261 gtgtgcgcac gcacttagtt ctactaatgc gtgcccccac actgcgcagg gattctgat // LOCUS RDVS10A 1320 bp ds-RNA VRL 27-JUL-1990 DEFINITION Rice dwarf virus segment 10 mRNA, encoding an ORF, complete cds. ACCESSION M35118 KEYWORDS . SOURCE Rice dwarf virus, cDNA to viral RNA, clones pRD[545,112,367]. ORGANISM Rice dwarf virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus. REFERENCE 1 (bases 1 to 1320) AUTHORS Uyeda,I., Matsumura,T., Sano,T., Ohshima,K. and Shikata,E. TITLE Nucleotide sequence of rice dwarf virus genome segment 10 JOURNAL Proc. Jpn. Acad. 63, 227-230 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 27 1088 ORF BASE COUNT 357 a 280 c 282 g 401 t ORIGIN 1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc 61 atcatgagct cctttctgct cacgaaggac caagtattat ttccaagttt gatgcaatta 121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag 181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact 241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg 301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt 361 ttatctttat ccctgaaact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc 421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt 481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc 541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag 601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat 661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac 721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt 781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc 841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc 901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca 961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta 1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca 1081 gttcctaatc gcactgatga tagtgcttat catttatcaa aatattacgg tggggtcatc 1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac 1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccatagggt gggttagagt 1261 cgtgtgcgca cgccacttag ttctactaat gcgtgccccc acatgcgcag ggattctgat // LOCUS SRVAGSM 3651 bp ss-RNA VRL 27-JUL-1990 DEFINITION Sapporo rat virus mRNA M segment, encoding an ORF, complete cds. ACCESSION M34882 KEYWORDS . SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero E6 cell line C1008. ORGANISM Sapporo rat virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Hantavirus. REFERENCE 1 (bases 1 to 3651) AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and Schmaljohn,C.S. TITLE Coding properties of the S and the M genome segments of Sapporo rat virus: Comparison to other causative agents of hemorrhagic fever with renal syndrome JOURNAL Virology 176, 114-125 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 47 3448 M segment ORF BASE COUNT 1119 a 692 c 755 g 1085 t ORIGIN 1 tagtagtaga ctccgcaaga aacagcagtt aaagaacaat aggatcatgt ggagtttgct 61 attactggcc gctttagttg gccaaggctt tgcattaaaa aatgtatttg acatgagaat 121 tcagttgccc cactcagtca actttgggga aacaagtgtg tcaggctata cagaatttcc 181 cccactctca ttacaggagg cagaacagct agtgccagag agctcatgca acatggacaa 241 ccaccagtca ctctcaacaa taaataaatt aaccaaggtc atatggcgga aaaaagcaaa 301 tcaggaatca gcaaaccaga attcatttga agttgtggaa agtgaagtca gctttaaagg 361 gttgtgtatg ttaaagcata gaatggttga agaatcatat agaaatagga gatcagtaat 421 ctgttatgat ctagcctgta atagtacatt ctgtaaacca actgtttata tgattgttcc 481 tatacatgct tgcaacatga tgaaaagctg tttgattggc cttggcccct acagaatcca 541 ggttgtctat gaaaggacat actgcactac gggtatattg acagaaggaa aatgctttgt 601 ccctgacaag gctgttgtca gtgcattgaa aagaggcatg tatgctatag caagcataga 661 gacaatctgc ttttttattc atcagaaagg gaatacatat aagatagtga ctgccattac 721 atcagcaatg ggctccaaat gtaataatac agatactaaa gttcaaggat attatatctg 781 tattattggt ggaaactccg cccctgtata tgcccctgct ggtgaagact tcagagcaat 841 ggaggttttt tctgggatta ttacatcacc acatggagaa gaccatgacc tacccggcga 901 agaaatcgca acgtaccaga tttcagggca gatagaggca aaaatccctc atacagtgag 961 ctccaaaaac ttaaaattga ctgcttttgc aggtattcca tcatactcat caactagtat 1021 attggctgct tcagaagatg gtcgtttcat atttagtcct ggtttatttc ctaacctaaa 1081 tcagtcagtc tgtgacaaca atgcactccc tttaatctgg aggggcctaa ttgatttaac 1141 gggatactat gaggcagtcc acccttgcaa tgtgttctgt gtcttatcag gaccaggtgc 1201 ttcatgtgag gccttttcag aaggaggtat tttcaatatt acttctccaa tgtgtctggt 1261 gtctaagcaa aatagattta gagcagctga gcagcagatt agctttgtct gccaaagagt 1321 tgatatggat attatagtgt actgtaatgg tcagaaaaaa acaatcctaa caaaaacatt 1381 agttataggc caatgtattt atactattac aagtctcttt tcactgttac caggggttgc 1441 ccattctatt gctattgagt tgtgtgttcc agggtttcat ggctgggcca cagctgcact 1501 tttgattaca ttctgcttcg gctgggtatt gattcctgca tgtacattag ctattctttt 1561 agtccttaag ttctttgcaa atatccttca tacaagcaat caagagaacc gattcaaagc 1621 cattctacgg aaaataaagg aggagtttga aaaaacaaag ggttccatgg tttgtgagat 1681 ctgtaagtat gagtgtgaaa cattaaagga attgaaggca cataacctat catgtgttca 1741 aggagagtgc ccatattgct ttacccactg tgaaccgaca gaaactgcaa ttcaggcaca 1801 ttacaaagtt tgtcaagcca cccaccgatt cagagaagat ttaaaaaaga ctgtaactcc 1861 tcaaaatatt gggccaggct gttaccgaac actaaatctt tttaggtata aaagtaggtg 1921 ttatattctg acaatgtgga ctcttcttct cattattgaa tccatcctct gggcagcaag 1981 tgcagcagaa atcccccttg tccctctctg gacagataat gctcatggcg ttgggagtgt 2041 tcctatgcat acggatcttg aattagactt ctctttgcca tccagttcta agtacacata 2101 caaaagacat ctcacaaacc cagttaatga ccaacagagt gtctcattgc atatagaaat 2161 tgaaagtcaa ggcattggtg ctgctgttca tcatcttgga cattggtatg atgcaagatt 2221 gaatctaaaa acctcatttc attgttatgg tgcctgcaca aaatatcaat acccatggca 2281 cactgcaaaa tgccattttg agaaagatta tgagtatgaa aatagctggg cttgcaaccc 2341 cccagattgc ccaggggttg gtacaggttg tactgcttgt ggattatatc tagatcaatt 2401 gaagccggta ggaacagcct ttaaaattat aagtgtaaga tacagtagaa aagtgtgcgt 2461 gcagtttggt gaagaacacc tttgtaaaac aattgatatg aatgattgct ttgtgactag 2521 gcatgccaaa atatgtataa ttgggactgt atctaagttt tctcaaggtg acactctact 2581 atttctgggg cccatggaag gaggtggtat aatctttaaa cactggtgta catctacctg 2641 tcactttgga gaccctggtg atgtcatggg tccaaaagat aaaccattta tttgccctga 2701 attcccaggg caatttagga aaaaatgtaa ctttgccaca actccagttt gtgaatatga 2761 tggaaacatt atatcaggct ataagaaagt acttgcaaca attgattctt tccaatcatt 2821 taacacaagc aatatacact tcactgatga gagaattgaa tggagagacc ctgatggcat 2881 gcttcgggat catattaata ttgttatttc taaagatatt gattttgaaa atttggctga 2941 gaatccttgt aaagtagggc tccaggcagc aaacatagaa ggtgcctggg gttcaggtgt 3001 cgggtttaca ctcacatgca aggtgtctct cacagaatgc ccaacatttc ttacatcaat 3061 aaaggcctgt gacatggcaa tttgttatgg tgcagaaagt gtgacactct cacgaggaca 3121 aaatactgtc aaaattaccg ggaaaggtgg ccatagtggt tcttcattca aatgctgtca 3181 tgggaaagaa tgttcatcaa ctggcctcca agccagtgca ccacatctgg ataaggtaaa 3241 tggtatctct gagttagaaa acgagaaagt ttatgatgac ggtgcacctg aatgtggcat 3301 tacttgttgg tttaaaaaat caggtgaatg ggttatgggt ataatcaatg ggaactgggt 3361 tgtcctaatt gtcttgtgtg tactgctgct cttttctctt atcctgttga gcatcttgtg 3421 tcctgttaga aagcataaaa aatcataaat cccacctaac aatcttcaca tcatgtatcg 3481 attttcaaac actttatcat ttagaactta acttggcact actatctgat aactgacttt 3541 catttttatt tttatatgga ttaattacta aaaaaaatac tctcttctat ctcccaatct 3601 tttattgatt caccggggtg ctgtcttgac atctggcggc gtctactact a // LOCUS SRVAGSS 1769 bp ss-RNA VRL 27-JUL-1990 DEFINITION Sapporo rat virus mRNA S segment, encoding an ORF, complete cds. ACCESSION M34881 KEYWORDS . SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero E6 cell line C1008. ORGANISM Sapporo rat virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Hantavirus. REFERENCE 1 (bases 1 to 1769) AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and Schmaljohn,C.S. TITLE Coding properties of the S and the M genome segments of Sapporo rat virus: Comparison to other causative agents of hemorrhagic fever with renal syndrome JOURNAL Virology 176, 114-125 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 43 1332 S segment ORF BASE COUNT 569 a 331 c 410 g 459 t ORIGIN 1 tagtagtaga ctccctaaag agctactaca ctaacaagaa aaatggcaac tatggaagaa 61 atccagagag aaatcagtgc tcacgagggg cagcttgtga tagcacgcca gaaggtcaag 121 gatgcagaaa agcagtatga gaaggatcct gatgacttaa acaagagggc actgcatgat 181 cgggagagtg tcgcagcttc aatacaatca aaaattgatg aattgaagcg ccaacttgcc 241 gacagattgc agcagggaag aacatccggg caggaccggg atcctacagg ggtagagcca 301 ggtgatcatc ttaaggaaag atcagcacta agctacggga atacactgga cctgaatagt 361 cttgacattg atgaacctac aggacagaca gctgattggc tgaccataat tgtctatctg 421 acatcattcg tggtcccgat catcttgaag gcactgtaca tgttaacaac acgaggtagg 481 cagacttcaa aggacaacaa ggggatgagg atcagattca aggatgacag ctcatatgag 541 gatgtcaatg gaatcagaaa gcccaaacat ctgtatgtgt caatgccaaa cgcccaatcc 601 agcatgaagg ctgaagagat aacaccagga agattccgca ctgcagtatg tggactatat 661 cctgcacaga taaaggcaag gaatatggta agccctgtca tgagtgtagt tgggttcttg 721 gcactggcaa aagactggac atcgagaatt gaagaatggc tcggtgcacc ctgcaaattc 781 atggcggagt ctcttattgc cgggagttta tctgggaatc ctgtgaatcg tgactatatc 841 agacagagac aaggtgcact tgcagggatg gagccaaagg aatttcaagc cctcaggcaa 901 cattcaaagg atgctggatg tacactagtt gaacatattg agtcaccatc atcaatatgg 961 gtgtttgctg gggcccctga taggtgtcca ccaacatgct tgtttgtcgg agggatggct 1021 gaattaggtg ccttcttttc tatacttcag gatatgagga acacaatcat ggcttcaaaa 1081 actgtgggca cagctgatga aaagcttcga aagaaatcat cattctatca atcatacctc 1141 agacgcacac aatcaatggg aatacaactg gaccagagga taattgttat gtttatggtt 1201 gcctggggaa aggaggcagt ggacaacttt catctcggtg atgacatgga tccagagctt 1261 cgtagcctgg ctcagatctt gattgaccag aaagtgaagg aaatctcaaa ccaggaacct 1321 atgaaattat aagtacataa atatataatc aatactaact ataggttaag aaatactaat 1381 cattagttaa taagaatata gatttattga ataatcatat taaataatta ggtaagttaa 1441 ctagtattta gttaagttag ctaattgatt tatatgattg tcacaattaa atgtaatcat 1501 aagcacaatc actgccatgt ataatcacgg gtatacgggt ggttttcata tggggaacag 1561 ggtgggctta gggccaggtc accttaagtg accttttttt gtatatatgg atgtagattt 1621 caattgatcg aatactaatc ctactgtcct cttttctttt cctttctcct tctttactaa 1681 caacaacaaa ctacctcaca ccttaatata tactacttta ttaagttgtt aagttgtgtc 1741 tttttgggga gtaagggagt ctactacta // LOCUS TRNSTIIA 1315 bp ds-DNA BCT 27-JUL-1990 DEFINITION Transposon Tn4521 (from E.coli) heat-stable toxin II gene, complete cds. ACCESSION M35123 KEYWORDS heat-stable toxin II; insertion sequence; transposase. SOURCE Transposon Tn4521 (from Escherichia coli) DNA. ORGANISM Transposon Tn4521 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1315) AUTHORS Hu,S.T. and Lee,C.H. TITLE Characterization of the transposon carrying the STII gene of enterotoxigenic Escherichia coli JOURNAL Mol. Gen. Genet. 214, 490-495 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 879 400 (c) heat-stable toxin II (STII) trns 88 607 insertion sequence IS2 (bases 17-536) trns 608 970 insertion sequence IS2 (bases 969-1327) BASE COUNT 313 a 349 c 320 g 333 t ORIGIN 1 gccgccggca gatggagctg gatattctaa aaaagacccg ggcatcagta tcagtcacct 61 gaacaacaga gaaaaaacaa agcttgcttt ccagacacct gttatcactt aacccattac 121 tggcccgctg ccgtagatat tcccgtggcg agcgataacc cagtgcacta tgcggatgcc 181 attcgttata atgctcgaac gcctctgcaa ggttctttgc tgccgttaac ccgtctggtt 241 tgggcatgat actgatgtaa tcacgcttta tcgttttcac gaagctctct gctatgccgt 301 tactctccgg actccgcacc gccgtgttct tcggttcaag tcccaacatc cgggcgaact 361 ggcgtgtttc attagcccgg tagcatgaac cattatccgt cagccattcc actggcgata 421 tcggaagctc gctgccgaag gggcgtacca ccgctcccag catgacgtcc tgtactgttt 481 cactgtcgaa gccgcccgta gtaaccgccc agtgcagtgc ctcacggtca cagcagtcca 541 gcgcgaacgt gactcgcagt ttttctccgt tatcacagcg gaactcgaac ccgtcagagc 601 accatcgttt tttttgccga gcaggcgctg gagttcttta atctgcttca tggcagcagc 661 aagttcagag gcaggaacga cctgttctcc tgcggccaca gcagtaagac ttcccttctg 721 gtattgcttg cgccagagaa ataactggct ggctgccgca ccgtgttgcc gggtaacaag 781 ggagcccgtc attcccggtt caaagctctg ctgaacaata gcgatctttt cctgtgtggt 841 acgccgtctg cgtttctccg gccctaagac atcactcatc tgctctccaa tgactagtct 901 aaaaactagt attaagacta tcacttaaat aagtgatatt ggttgtctgg agattcaggg 961 ggccagtcta cggcctatgt cggtctggtg tcaaaacagt caggctccgg agataaagtt 1021 cgactactgg gaataagcaa gtgtggtgat acttatctcc gaacattatt tattcatgga 1081 gcaagggcgg cagcattgct ggtcaaagaa ctagggccat ggatacacaa cttaagatac 1141 aacgactgtc tggtgtttca gggggccagc tcaaacattt ggcaaagata tacgtcttct 1201 ttacaatccg gaggtgttaa cggctctact tgtatcagct acgttctggt tatacgtaat 1261 agcataagaa tggtcgtgcg catacacgct ctggaaagca gaactagctg gacga //