Path: utzoo!attcan!uunet!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 11 Jul 90 12:00:16 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 4117 Approved: lear@genbank.bio.net Checksum: 30461 252 LOCUS CDCXYNAB 6067 bp ds-DNA BCT 11-JUL-1990 DEFINITION C.saccharolyticum xylanase A (XynA), beta-xylosidase (XynB) and acetyl esterase (XynC) genes, complete cds. ACCESSION M34459 KEYWORDS acetyl esterase; beta-xylosidase; xylanase. SOURCE C.saccharolyticum DNA, clone pNZ1400. ORGANISM Caldocellum saccharolyticum Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 6067) AUTHORS Luethi,E., Love,D.R., McAnulty,J., Wallace,C., Caughey,P.A., Saul,D. and Bergquist,P.L. TITLE Cloning, sequence analysis, and expression of genes encoding xylan-degrading enzymes from the thermophile "Caldocellum saccharolyticum" JOURNAL Appl. Environ. Microbiol. 56, 1017-1024 (1990) STANDARD simple staff_review FEATURES from to/span description pept 195 1223 xylanase/beta-xylosidase (XynA) precursor sigp 195 293 xylanase/beta-xylosidase signal peptide matp 294 1220 xylanase/beta-xylosidae pept 1257 2057 acetyl esterase (XynC) pept 2198 2491 ORF 3 pept 2491 3429 ORF 4 pept 3445 4911 beta-xylosidase (XynB) (3445 could be 3463) pept 5439 > 6067 ORF 6 BASE COUNT 2230 a 787 c 1243 g 1807 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccccgc aaagcctaaa ataagtacat ttagaatgat ggcagaaaat ggttatatta 61 cccttgaatt tacgttaagt aaaaatgctg tggtgctttt tgaggtaagc aaggttgtag 121 atgagtcaga tacttatata ggacttgacg atagtaaaat accaggttat tagttgcttt 181 ataaaataaa aggaatgagg tgtttaattg tgtgcgaaaa tttagagatg ctaaacttat 241 cattagcaaa aacatacaaa gattacttta aaataggtgc tgcagtaact gcgaaagatt 301 tagaaggagt tcatagggat attcttttga agcattttaa tagcctcaca ccagaaaatg 361 ccatgaagtt tgaaaatatt catccagaag agcagagata taattttgaa gaggttgcca 421 ggataaaaga gtttgcaatt aaaaatgaca tgaagttaag aggacataca tttgtttggc 481 ataatcaaac tccggggtgg gtgtttttag ataagaatgg ggaagaagcc tcaaaagagt 541 tagttattga aaggttaaga gagcatataa aaactttgtg tgagagatac aaggatgtag 601 tatatgcgtg ggatgtggtg aacgaagcag tagaagataa aacagaaaag cttttgcgag 661 aatcaaactg gagaaaaatt attggagatg attatattaa aattgctttt gagatagcaa 721 gagaatatgc aggagatgca aagttatttt ataacgatta taacaatgaa atgccttata 781 aattagaaaa aacctacaaa gttctaaaag agcttttaga aagaggtact ccaatagatg 841 gaattggtat acaagcacac tggaatatat gggataaaaa tcttgttagt aatttaaaaa 901 aggctataga agtatatgct tccttaggtt tagaaattca tattacagaa cttgacattt 961 cagtatttga gtttgaagat aagaggactg acttgtttga accaaccccg gaaatgcttg 1021 aactacaagc aaaagtatat gaagatgtat ttgcagtttt tcgagaatat aaagatgtaa 1081 taacttctgt tacattatgg ggtattagcg acagacacac atggaaagat aacttccctg 1141 taaagggtcg aaaagattgg cctctcttat tcgacgtaaa tggaaaacca aaagaagcct 1201 tgtacaggat attaagattt taaagatttt ttaacgaaga aaggggttct tttaatatgg 1261 ctatcatgca aatcaacttt tattcaaaga tgttgaaaaa gaacacaaca attttggcca 1321 ttttacccgt agataaacca gataagaaat tccagaaaga tgttgatagt gaaaatttga 1381 aaaccttata tcttttgcat ggttatgctg gtaactacat ggattggttg tgtggagccc 1441 gaattgttga attatcaatg cgatataatg ttgctgtgtt tctgccatca ggtgaaaata 1501 gtttttattt agatgatgaa gaaaaggaag aatattttgg tgaatttgtg ggaaatgaaa 1561 ttatagaatt tacaagaagc gtttttccta ttcctcaaaa aagggaaaaa acttttattg 1621 gcggtttatc aatgggaggt tacggtgctc ttagaaatgg gcttaaatat aacaagaatt 1681 ttgtaggtat aatagcttta tcatcagcac taataattca taagattgca ggtattccta 1741 aggattatag gaatgcttat gcaagttata actattatag acgagtgttt ggagacctaa 1801 actctttaat aggtagcgat aaagacataa atgccttagt tactaagcta aaacaagaaa 1861 aaggtagtat tccaaaaata tacatggcat gcggcagaga tgacttttta gttcaagaaa 1921 acagagattt atttaatttt ttgaaaaatg aaggtataga cgtggtttat gaggaagacg 1981 aaggtggaca tgactgggat ttttggaaca aatatattgc aaatgctttt gagtggatga 2041 gtaaggtttc tgattaagtc ttcacgtacc ctgttttaag ttttacaaat agatttgtgg 2101 ggtgaatagg tttttttaac actattttat taaggaagag gatgaaaaat aaaaaaagtg 2161 gacaaatttc ttgttaattg taattacatg cattgcaatg gttttctttt ttacatcgtg 2221 tactattcag tctgctatag agcagaagaa aactgttgag gaaatcttgg gaaaaatagg 2281 tgagagtgag gacaaaacaa attcaagggg gcaaccagca acaatgaaag aggatgaagt 2341 tgaagataat cctttaaaag atgtatataa agattatttc ctggttggag cagcaattaa 2401 tggctattct gttgaaactg ctgctatcaa tcatcctggt atggctgcaa ttttgaaaaa 2461 aactttaaca gtacaaccct atctaatttg atgaaacaac aatacctttt agattatgaa 2521 gctacaaaag caagtaaaaa tggaatgcca gtgtgtaaat ttgacagctg cattcctgct 2581 ttacaatttt gtaaggaaaa tggcataaaa atgagaggac atgtgttagt atggcataat 2641 cagacaccag aatggttttt ccacaaagac tatgatgtat cgaaaccact tgtagatgct 2701 gctactatgg aacgccggtt ggaaagttat atcaaacagg taattgaatt ttgtcaaaaa 2761 aattatcccg gtgtagtcta ttgctgggat gttgttaacg aagctatact tgatgatggt 2821 tcatggagag aaatcaataa taattggtat accattatga aagaaaagta tgtggaaaag 2881 gcattttatt atgcaagaaa atatgccaaa aaagatgttg ccctgtttta caatgattac 2941 aatgtttttc tccctgcaaa gagagaagca atttataatc ttgctcagaa acttaaagaa 3001 aaaggattga ttgacgggtt gggtcttcaa cctacagtag gcttgaatta tcctgaatta 3061 gattctgatg atatagattc attcaaaacg acattagaaa catttgcaaa acttggctta 3121 caaattcata ttactgagtt aaattttgaa ataaagggag atgagagcaa tcgtactcct 3181 gaaaatctca aaaaacaagc agataggtat tacgaaatga tgaagttatt attgaaggaa 3241 gatactgata atggtgggcc ttgcaacata acttgtgtta ctgtttttgg tatctgtgac 3301 gattatccac tatataaaaa ttttaagcag tgcatgtatc tttgggataa aaattgcaat 3361 cctaaaccat gtttttattc atttctccaa gcaggtttag actggaaagc atctttatta 3421 agcaaataag aatgaacaac acttatggag aggaggaaaa taatgaaaat aactattaat 3481 tatggaaaga gacttgggaa aataaacaaa ttttgggcaa aatgtgttgg aagctgtcat 3541 gctacaactg cgttaagaga agactggcga aagcaattaa aaaaatgtcg tgacgaactt 3601 ggttttgagt atattcgatt tcatggttgg ttgaatgatg atatgagtgt ttgttttaga 3661 aatgatgatg ggctactttc attctcattc ttcaacatag attctataat tgattttctt 3721 ttggagatag gtatgaaacc atttattgaa ctgagcttta tgccagaagc gttagcgtca 3781 ggtacaaaga cagttttcca ttacaaagga aatataacac cgccgaaatc ttatgaagaa 3841 tggggtcagc tgattgagga gttagcaagg catcttatta gcagatatgg gaaaaatgaa 3901 gtaagagaat ggttttttga ggtatggaac gaaccaaatc taaaggattt cttctgggca 3961 ggaacaatgg aagaatattt taagctttac aaatatgctg cttttgcaat aaagaaagtg 4021 gactctgaac taagggtagg tggaccagct actgcaatcg atgcatggat acctgaacta 4081 aaagattttt gtacaaaaaa tggtgttcca atagatttta tttcaacgca tcaatatcca 4141 acagatttag cattcagtac aagctcaaat atggaagagg ctatggcaaa agcaaagaga 4201 ggtgaattag cagagagggt aaaaaaggct ttagaggaag catatccatt gcctgtttac 4261 tacactgaat ggaataactc tccaagtcct cgagacccat atcacgacat accttacgat 4321 gctgctttta ttgtaaaaac aataattgac attatagatt taccacttgg gtgttattct 4381 tattggacat ttacagatat ctttgaagaa tgtggacaga gttctttacc ttttcatggg 4441 ggattcgggc ttctaaatat tcatggtata ccaaaaccat cctatagagc atttcaaatt 4501 ttagataaac taaacggtga gaggattgag atagagtttg aagataaaag cccaaccatt 4561 gattgtatag ctgtccagaa tgagagagag ataatacttg tgatctcaaa ccataatgtt 4621 ccgctgtctc ctattgatac cgaaaatata aaagttgttt taaaaggtat tgagaattgc 4681 cgagaagttt ttgttgagag aatagatgaa tataatgcca atccaaaaag agtatggctt 4741 gaaatgggca gtcctgcgta tctcaataga gaacagattg aggagttgat aaaagcatca 4801 gaactaaaga aagagaaagt ttcatggggg attgtgaata ataatgaaat tacatttgat 4861 ttaagtgttt tacctcactc agttgtggct gttacaatta agaatggtta gtgaaatgtt 4921 aagagagaaa agcaattttg tatatctctt ttaattttta cctttgacac atcaaacaat 4981 ctaaattaaa attaaagtat agtgttttgc atactcaaca tagtataaat tatataaggg 5041 taacattaat accctttttg tttttgtaag ggggtgtttt tgtggcaaag cacacgcaaa 5101 aaggtaaatc agctgccaca gccgccgtgt cagacaaaga aaaagcaagg tttgttccta 5161 aaaatattca agctgagata aaagaaaaga ttaaagacac tggtgaaaaa gtagcaaagg 5221 ctgagggtaa ggacaaagca cttttacagt taaagctgga gagcaacaaa aaggttgata 5281 agaaaaaatt caaaaaggat agaagtgttg agaggaataa aacttcatta aatagatttt 5341 taagtttaga taaaattaaa tccctatatt caaaagagat acataataaa ctttcacaca 5401 tctttgaaga tgcagtttct gaggtttata gaattttaat ggggctaaag tatatcaaaa 5461 aggcgccaaa ttacaccgaa attgttctga aggcaaagat attttcaacc ttgattttga 5521 tgattgtaat attattttta atcaacaaaa tgccttctac atacaaaaaa gcgtatgcag 5581 ttgttttgaa caatcagatt gtagggtatg tgaaggacaa gactgaagca caaaaccttc 5641 ttacccagat taaaaaagaa gtagaggaaa gacacaatac agacagtttc attttacaaa 5701 gtaagcttca actaaagagc attgagcctg gtcaatatcg tgagacaagg gttgatgagc 5761 tgaaaaatac tatcatagaa aaggggaagg tccttgtaaa aaggtatgct atttttgtta 5821 attcaaaacc atattttgta tttgaaaatc cacaaactcc aaataatatt cttaacaagc 5881 taaaaaaggt ctattataat gacaaggcat cacaggcaaa attcttagag aaggtagaaa 5941 taaaaccagt ttatgtctca ccagctatta aagtagctga tgaagctact gccttaacaa 6001 agattatgtt tgggaaagac caggtaatag aatatacagt caaggaagga gatactcttt 6061 gggatcc // LOCUS PFAAMA1 2307 bp ds-DNA INV 11-JUL-1990 DEFINITION P.fragile apical membrane antigen 1 (AMA1/AG352) gene, complete cds. ACCESSION M29898 KEYWORDS apical membrane antigen. SOURCE P.fragile (Nilgiri strain) DNA, from Macaca mulatta, clone AG352VATV1. ORGANISM Plasmodium fragile Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 2250) AUTHORS Peterson,M.G., Nguyen-Dinh,P., Marshall,V.M., Elliott,J.F., Collins,W.E., Anders,R.F. and Kemp,D.J. TITLE Apical membrane antigen of Plasmodium fragile JOURNAL Mol. Biochem. Parasitol. 39, 279-284 (1990) STANDARD full staff_review REFERENCE 2 (bases 2251 to 2307) AUTHORS Peterson,M.G., Nguyen-Dinh,P., Marshall,V.M., Elliott,J.F., Collins,W.E., Anders,R.F. and Kemp,D.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.Marshall, 15-NOV-1989. FEATURES from to/span description pept 275 1963 apical membrane antigen 1 (AMA1/AG352) precursor sigp 284 322 put. apical membrane antigen 1 signal peptide matp 323 1960 apical membrane antigen 1 BASE COUNT 794 a 445 c 499 g 569 t ORIGIN 1 taagttccct ttctacaccc ggatgcctcc tagagcaaat aggagtttca agcgtttaca 61 tgtaatttac caagcgtttg taattttgca actttgcaat ttttctactg cgcaagtttg 121 taaccgtgaa gctgctcacc tgtgtgacgg ccaattttta ccaacggtta aacctgttag 181 tggctatttt tttctcgccc ccctcctgat tgatgtgcag agggagagaa ccaaatagct 241 gcctttttct tgagtcacaa tttaacaaca caatatgaat aaaatatact gcatactgtt 301 tttaagtgcc cagtgccttg tgcacatggg taagtgcgag ccaaaccaga agccgagcag 361 gctgacccgc agcgctaaaa acgttttgtt ggaacaggag cctatggttg agagaagtac 421 acgaatgagt aacccatgga aagcattcat ggaaaagtac gatatcgaaa aaacacacag 481 ttctggtatt cgagtagatt taggggaaga tgcagaagtg ggaaattcca gctatagaat 541 accagcagga aaatgtcctg tttttggaaa gggtatcgtt atacagaatt ctgaggttag 601 tttcttaaca cctgtagcta caggcaatca aaagttgaag gatggaggtt tcgcctttcc 661 acaagcaaat gatcatattt cccctatatc cataaaaaac cttagagaaa ggtataaaga 721 gaatccagat ttgatgaagc taaacgattt agctttgtgt aaaactcatg cagccagctt 781 tgtaatggaa atggataaaa attcgtccta tagacaccca gctgtatatg atgaagataa 841 aaaaatatgt tacatgttgt atttatcagc gcaagaaaat atgggtccaa gatactgtag 901 taaagatgca gaaaataaag atgctatgtt ttgcttcaag ccagataaaa atgaaacatt 961 tgaccatctt gcctatttaa gcaaaaatgt ggttaatgat tggcaaaaca aatgcccccg 1021 taaaaattta ggaaattcta aatttggatt atgggtggat ggaaactgtg aagaaatccc 1081 atacgttcaa gacgtgcagg caaaggatct acgcgaatgt aacagaatcg ttttcgaagc 1141 tagcgcttca gatcaaccaa ctcagtacga agaagaacta accgattatc aaaaaataca 1201 agaaggcttt agacaaaacg atcagggtat gattaaaagt gcttttcttc cagtaggtgc 1261 attcaactcg gacaatttta agagtaaagg aagaggatat aactgggcaa atttcgatac 1321 tgaaaataag gtttgttacc tttttaatgc caaacccact tgcctcatta atgacaaaaa 1381 ctttatcgca acaacagcgt tatctcatcc ccaagaagta gacaatgagt ttccatgcag 1441 catatacaaa gatgaaatgg aaagggaaat gaggaaagaa tcgaggaaca tgagtctgta 1501 caatgttgat aaggcacgga ttgttctgcc aaggatattt atctccaacg ataaggacag 1561 tctcaaatgt ccatgcgcac cagaacacat taccaacagt acctgcaact tttacgtttg 1621 taactgtgta gagaaaaggg cagaaattaa agaaaataac gaagtggcca taaaggaaga 1681 atttaagcaa gattaccaat acgcgcaagg tgaatccaaa aatcagatgc tcctaattat 1741 tatcggaata actggaggtg tgtgtgtggt cgcactggct tccatgtttt acttcaggaa 1801 gaaagctcac aatgataagt atgacaagat ggagcaggca gacgggtacg ggaaacccac 1861 caccaggaaa gacgagatgc tcgaccccga ggcgtccttc tggggtgaag aaaagcgggc 1921 ctcccacacc acccctgtgc tgatggagaa gccttactac tgagcgggga agcaaccgaa 1981 ttggtgaggg cctctttggt cgtaaacaaa gtgggggtgc ctcacaatgc atattttcaa 2041 cccgcgtcat gtaaaaaaga aaaacgagac acacccagct ggccaacaaa ttgcccacaa 2101 gggaggagaa atggagcaag ctaaaattgg gctattgtca tcatcaccag ttaccgagga 2161 aatgaaaaca acaacaaaaa aaaacgtaac acatggtaaa gtaactgatt ggttaagcaa 2221 agccgagtga aaatttaccc cacttgcgat ttaaaagcat gatttgcctc caccaaatgg 2281 acctctccac tattaatatt accggag // LOCUS RICAAMYA 1553 bp ss-mRNA PLN 11-JUL-1990 DEFINITION Rice alpha-amylase mRNA, complete cds, clone pOS103. ACCESSION M24286 KEYWORDS 1,4-alpha-D-glucan glucanohydrolase; alpha-amylase. SOURCE Rice (strain M202), cDNA to mRNA, clone pOS103. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1553) AUTHORS O'Neil,S.D., Kumagai,M.H., Majumdar,A., Huang,N., Sutliff,T.D. and Rodriguez,R.L. TITLE The alpha-amylase genes in Oryza sativa: Characterization of cDNA clones and mRNA expression during seed germination JOURNAL Mol. Gen. Genet. 221, 235-244 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Kumagai, 25-APR-1989. Author address: M.H.Kumagi UC Davis, Dept. of Genetics, Davis, Ca. 95616 FEATURES from to/span description pept 34 1338 alpha-amylase (EC 3.2.1.1) BASE COUNT 340 a 486 c 446 g 281 t ORIGIN 1 atcaatcatc catctccgaa gtgtgtctgc agcatgcagg tgctgaacac catggtgaac 61 aaacacttct tgtccctttc ggtcctcatc gtcctccttg gcctctcctc caacttgaca 121 gccgggcaag tcctgtttca gggattcaac tgggagtcgt ggaaggagaa tggcgggtgg 181 tacaacttcc tgatgggcaa ggtggacgac atcgccgcag ccggcatcac ccacgtctgg 241 ctccctccgc cgtctcactc tgtcggcgag caaggctaca tgcctgggcg gctgtacgat 301 ctggacgcgt ctaagtacgg caacgaggcg cagctcaagt cgctgatcga ggcgttccat 361 ggcaagggcg tccaggtgat cgccgacatc gtcatcaacc accgcacggc ggagcacaag 421 gacggccgcg gcatctactg cctcttcgag ggcgggacgc ccgactcccg cctcgactgg 481 ggcccgcaca tgatctgccg cgacgacccc tacggcgatg gcaccggcaa cccggacacc 541 ggcgccgact tcgccgccgc gccggacatc gaccacctca acaagcgcgt ccagcgggag 601 ctcattggct ggctcgactg gctcaagatg gacatcggct tcgacgcgtg gcgcctcgac 661 ttcgccaagg gctactccgc cgacatggca aagatctaca tcgacgccac cgagccgagc 721 ttcgccgtgg ccgagatatg gacgtccatg gcgaacggcg gggacggcaa gccgaactac 781 gaccagaacg cgcaccggca ggagctggtc aactgggtcg atcgtgtcgg cggcgccaac 841 agcaacggca cggcgttcga cttcaccacc aagggcatcc tcaacgtcgc cgtggagggc 901 gagctgtggc gcctccgcgg cgaggacggc aaggcgcccg gcatgatcgg gtggtggccg 961 gccaaggcga cgaccttcgt cgacaaccac gacaccggct cgacgcagca cctgtggccg 1021 ttcccctccg acaaggtcat gcagggctac gcatacatcc tcacccaccc cggcaaccca 1081 tgcatcttct acgaccattt cttcgattgg ggtctcaagg aggagatcga gcgcctggtg 1141 tcaatcagaa accggcaggg gatccacccg gcgagcgagc tgcgcatcat ggaagctgac 1201 agcgatctct acctcgcgga gatcgatggc aaggtgatca caaagattgg accaagatac 1261 gacgtcgaac acctcatccc cgaaggcttc caggtcgtcg cgcacggtga tggctacgca 1321 atctgggaga aaatctgagc gcacgatgac gagactctca gtttagcaga tttaacctgc 1381 gatttttacc ctgaccggta tacgtatata cgtgccggca acgagctgta tccgatccga 1441 attacggatg caattgtcca cgaagtactt cctccgtaaa taaagtagga tcagggacat 1501 acatttgtat ggttttacga ataatgctat gcaataaaat ttgcactgct taa // LOCUS RICAAMYB 1682 bp ss-mRNA PLN 11-JUL-1990 DEFINITION Rice alpha-amylase mRNA, complete cds, clone pOS137. ACCESSION M24287 KEYWORDS 1,4-alpha-D-glucan glucanohydrolase; alpha-amylase. SOURCE Rice (strain M202), cDNA to mRNA, clone pOS137. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1682) AUTHORS O'Neil,S.D., Kumagai,M.H., Majumdar,A., Huang,N., Sutliff,T.D. and Rodriguez,R.L. TITLE The alpha-amylase genes in Oryza sativa: Characterization of cDNA clones and mRNA expression during seed germination JOURNAL Mol. Gen. Genet. 221, 235-244 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Kumagai, 25-APR-1989. Author address: M.H.Kumagi UC Davis, Dept. of Genetics, Davis, Ca. 95616 FEATURES from to/span description pept 78 1382 alpha-amylase (EC 3.2.1.1) BASE COUNT 355 a 491 c 519 g 317 t ORIGIN 1 atccatcatc tacaagagat cgatcagtag tggttagcag caactcacta tcgaacacgg 61 tttcagctta cacagatatg aagaacacca gcagcttgtg tttgctgctc ctcgtggtgc 121 tctgcagctt gacctgtaac tcgggtcaag cacaggtcct cttccagggt ttcaactggg 181 agtcgtggaa gcagcagggt ggctggtaca acatgttgaa aggccaagtc gacgacatcg 241 ccaaggccgg ggtcacccac gtctggctgc cgccgccgtc gcactccgtg gcgcgagggt 301 acatgccggg gcgtctctac gacctggacg cgtccaagta cggcacggcg gcggagctca 361 agtcgctgat cgcggcgttc cacgggaagg gcgtccagtg cgtcgccgac gtcgtgatca 421 accaccggtg cgccgagaag aaggacgccc gcggcgtgta ctgcgtgttc gagggcggga 481 cgcgcgaccg cctcgactgg ggccccggca tgatctgcag cgacgacacg cagtactccg 541 acggcacggg ccaccgcgac accggcgagg ggttcggcgc ggcgcccgac atcgaccacc 601 tcaacccgcg cgtccagcgg gagctcaccg actggctcaa ctggctcaag tccgacgtcg 661 gcttcgacgg ctggcgcctc gacttcgcca agggatactc cacggacatc gctaagatgt 721 acgtcgagag ctgcaagccg ggcttcgtcg tcgccgagat atggaactcg ctgagctaca 781 acggcgacgg caagccggcg gccaaccagg accagggccg gcaggagctg gtgaactggg 841 tgaacgccgt cggcgggccg gcgatgacgt tcgacttcac caccaagggc ctcctgcagg 901 cgggcgtcca gggcgagctg tggcggctgc gcgacggcaa cggcaaggcg cccggcatga 961 tcgggtggct gccagagaag gccgtcacgt tcgtcgacaa ccacgacacc ggctcgacgc 1021 agaagctttg gccgttcccc tccgacaagg tcatgcaggg ctacgcctac atcctcaccc 1081 accccggagt cccctgcatc ttctacgacc acatgttcga ctggaacctg aagcaggaga 1141 taaccgcgct ggcggcgatc agggagagga acggcatcaa cgccgggagc aagctccgga 1201 tcgtcgtcgc cgacgccgac gcatacgtcg ccgtcgtcga cgagaaggtc atggtgaaga 1261 tcgggacgag gtacgacgtg ggcaacgcgg tgccgtcgga tttccatcag acggtgcacg 1321 gcaaggacta cagcgtctgg gagaaggggt ccctccgcgt cccggcgggg cggcacctat 1381 agcgggctca agccctaaac tgaacgggat agtcatgctc aaaccagttt ctacacggca 1441 agaatttact gattcttata ctttttcagt caattaaatt atggttttta tatatgtaat 1501 tttgtatccg attgtagcgt tcgaataagt aggcaggctc tctagcctct aggttaattg 1561 cgggcatatg tagcttgcca gttaattgtg tttgtatcac gcagtttgta accgttggtg 1621 catatatatg tcaggttcag gatgcagtaa aaaatcatac tgcaccgatc agtgagtttt 1681 ta // LOCUS HUMCEAPX 494 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human cell adhesion protein (SQM1) mRNA, complete cds. ACCESSION M33374 KEYWORDS cell adhesion protein. SOURCE Human squamous carcinoma cell line SCC25, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 494) AUTHORS Wong,Y.-C., Tsao,S.-W., Kakefuda,M. and Bernal,S.D. TITLE cDNA cloning of a novel cell adhesion protein expressed in human squamous carcinoma cells JOURNAL Biochem. Biophys. Res. Commun. 166, 984-992 (1990) STANDARD simple staff_review FEATURES from to/span description pept 36 443 cell adhesion protein (SQM1) mRNA < 1 494 SQM1 mRNA BASE COUNT 107 a 154 c 163 g 70 t ORIGIN 1 ccctcggtgc tgcagggatc tgcaggactg cagccatggg ggcgcacctg gtccggcgct 61 acctgggcga tgcttcggtg gagcccgacc ccctgcagat gccaaccttc ccgccagact 121 acggcttccc cgaacgcaag gagcgcgaga tggtggccac acagcaggag atgatggacg 181 cgagtgaggc tcagctgcgg gactactgcg cccaccacct catccggctg ctcaagtgca 241 agcgtgacag cttcccaagt tgctggcctg caagcaggaa gcggcacgac tcgggactac 301 tgcgcaccgc aagctatgtg atgcgcatga aggagtttga gcgggacgag ggctgctcca 361 gcggaagaag cggcgggaga agaaggcggc aaatctgcaa aggccaggga cccggggaag 421 tggaccccaa ggtggccctg taggggtgca ccccccaccc tatggaccag tcaaataaaa 481 ccttcaggcc cctc // LOCUS REOCEAP1 1463 bp ds-RNA VRL 11-JUL-1990 DEFINITION Reovirus sp. (serotype ST1) sigma-1 protein gene, complete cds. ACCESSION M32860 KEYWORDS cell attachment protein; sigma-1 protein. SOURCE Reovirus sp. (serotype ST1), cDNA to viral RNA. ORGANISM Reovirus sp. Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1463) AUTHORS Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K. TITLE Identification of conserved domains in the cell attachment proteins of the three serotypes of Reovirus JOURNAL Virology 174, 399-409 (1990) STANDARD simple staff_review FEATURES from to/span description pept 14 1426 sigma-1 protein BASE COUNT 426 a 291 c 369 g 377 t ORIGIN 1 gctattcgcg cctatggatg catctctcat tacagagata cggaaaatag tactccaact 61 atctgtatca agcaatggct cccagtcaaa agaaatcgag gaaatcaaga aacaagtcca 121 ggtcaacgtt gatgatatca gggctgccaa tattaaactc gacggacttg gaagacagat 181 tgctgacatc agcaatagca tctcaaccat tgagtcaaga ttgggtgaga tggataatcg 241 acttgtgggt atctcgagtc aggtcacgca attatctaac tcagttagcc agaacactca 301 gagcatatcc tcattgggtg acagaatcaa tgctgtcgaa ccacgagttg acagtctgga 361 tacggtcacg tctaatctca ctggacgaac atccactttg gaggcagatg ttggaagctt 421 acggacagaa ctagcagcgc taacaacacg ggtgacaact gaggttacaa ggttagatgg 481 tctaatcaat agtggccaga attcgattgg tgagctatcc acaagactat ccaatgtgga 541 gacgtctatg gtgacgacgg ctggacgggg actgcagaaa aacggaaaca ccttgaacgt 601 cattgtaggt aatggaatgt ggtttaatag ttctaatcaa ttgcagctcg acctttcggg 661 gcaatcaaaa ggggtgggat ttgtcggcac aggaatggtg gttaagattg atactaatta 721 ttttgcttac aatagtaatg gagagattac attggtgagt caaatcaatg aattgccatc 781 gcgcgtatca acactggaat cagcgaaaat cgattcagtt ttacctccat taaccgtacg 841 cgaagcgagc ggcgtacgta ccctgagctt tggttatgat acgagcgatt ttacaatcat 901 caactccgta ctgtcgttac ggtcacgttt gactcttccg acatacaggt accctctgga 961 gctcgacaca gcaaataata gagtgcaggt ggcagatcgt tttggcatgc gcacgggtac 1021 ttggacggga caattgcaat atcagcaccc acaattgagt tggagagcaa atgtcacttt 1081 gaatttgatg aaggtggatg attggttggt gttgagcttt tctcagatga cgactaactc 1141 aataatggca gatgggaaat ttgtgattaa ttttgtgtct gggttatctt ctggatggca 1201 gacgggggat actgaaccat cgtcaactat tgatccattg tctacgacat ttgccgcggt 1261 ccaatttcta aataacggtc aacgcattga tgcgtttagg atcatgggag tatcggaatg 1321 gacggatgga gaattagaga ttaagaatta tggtggcaca tacaccggtc atactcaagt 1381 atattgggct ccgtggacga tcatgtatcc atgcaatgtg aggtgaatct agcgcgaacc 1441 ctcggcacaa ggggtcaatc atc // LOCUS REOCEAP2 1440 bp ss-RNA VRL 11-JUL-1990 DEFINITION Reovirus sp. (serotype ST2) sigma-1 protein gene, complete cds. ACCESSION M32861 KEYWORDS cell attachment protein; sigma-1 protein. SOURCE Reovirus sp. (serotpe ST2), cDNA to viral RNA. ORGANISM Reovirus sp. Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1440) AUTHORS Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K. TITLE Identification of conserved domains in the cell attachment proteins of the three serotypes of reovirus JOURNAL Virology 174, 399-409 (1990) STANDARD simple staff_review FEATURES from to/span description pept 14 1402 sigma-1 protein BASE COUNT 384 a 316 c 381 g 359 t ORIGIN 1 gctattcgca ctcatgtcgg atctagtgca gctcataaga agggagatct tactgttaac 61 tgggaatgga gaatcagcca actcgaaaca cgagatcgag gaaattaaga aacaaattaa 121 agacatctct gctgatgtca acaggatcag taacatcgtt gattcaatcc aaggacaact 181 gggtggatta tctgtacgcg tgtcagccat tgaatcggga gttagtgaga acggcaatcg 241 aattgataga ctcgagcgag atgtctccgg catatcggct agcgttagcg gaatcgattc 301 gcgtttatcc gagctgggtg accgagtcaa tgttgcagaa cagcgaattg gccagttgga 361 tacagtcacg gataatctcc ttgagcgagc atcaagactg gaaactgaag tatcagccat 421 tactaatgac cttggatcat tgaatacgag gctgacgact gaattgaacg atgtccgcca 481 aactattgct gcgatagaca cgcgtctcac gacactggag accgatgccg tgacgtcggt 541 tggtcaaggg cttcagaaga ctgggaactc gattaaggtt attgtgggta cggggatgtg 601 gttcgaccgc aataatgttc tgcagttatt cttatcgaac cagcagaaag ggttgggatt 661 catagacaat ggaatggtag tgaaaataga tacccagtat ttcagcttcg atagcaatgg 721 caacataact ctgaacaaca acataagtgg tctgccggcg cgaacaggtt ccctcgaggc 781 atctcgtatc gatgtggtag cgccaccgct tgtgatacag tctactggta gcactcggct 841 actgcgtctc atgtacgagg ctgtggactt cgtggttact aacaacgttc tcacactgag 901 aaatcgatcg gtcacgccaa cattcaagtt tcctctggag ttgaatagtg ctgataactc 961 agtgagcatt catagaaatt accgcattag acttgggcaa tggtcaggtc aattggaata 1021 tcacacgccg agtttgcgtt ggaatgctcc cgtcacggtt aatttgatgc gagtagacga 1081 ttggctcatt ttgagtttta ctcggttttc gacgagcggc atcttagcgt caggaaagtt 1141 tgtattgaac ttcgtaactg gtttgtctcc agggtgggcg actgggagta ccgagccctc 1201 gacaactact aacccactgt caacgacgtt tgctgcaatt cagttcatca atgggtcatc 1261 tcgcgtagac gcctttagaa tcttgggagt cgcagagtgg aatgccgggg aactagagat 1321 cacgaatcat ggcggaacat atacagcgca taccaatgtc gactgggcgc cgatgaccat 1381 tatgtaccca tgtctgggct gaggatccgg gtgctccact cggcacagtg gcgactcatc // LOCUS REOCEAP3 1416 bp ss-RNA BAD 11-JUL-1990 DEFINITION Reovirus sp. (serotype ST3) sigma-1 protein gene, complete cds. ACCESSION M32862 KEYWORDS cell attachment protein; sigma-1 protein. SOURCE Reovirus sp. (serotype ST3) viral DNA. ORGANISM Reovirus sp. Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1416) AUTHORS Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K. TITLE Identification of conserved domains in the cell attachment proteins of the three serotypes of reovirus JOURNAL Virology 174, 399-409 (1990) STANDARD simple staff_review COMMENT Secondary reference. Please see: Proc. Natl. Acad. Sci. U.S.A. 82, 24-28 (1985), accession m10262. FEATURES from to/span description pept 13 1380 sigma-1 protein BASE COUNT 376 a 301 c 365 g 374 t ORIGIN 1 gctattggtc ggatggatcc tcgcctacgt gaagaagtag tacggctgat aatcgcatta 61 acgagtgata atggagcatc actgtcaaaa gggcttgaat caagggtctc ggcgctcgag 121 aagacgtctc aaatacactc tgatactatc ctccggatca cccagggact cgatgatgca 181 aacaaacgaa tcatcgctct tgagcaaagt cgggatgact tggttgcatc agtcagtgat 241 gctcaacttg caatctccag attggaaagc tctatcggag ccctccaaac agttgtcaat 301 ggacttgatt cgagtgttac ccagttgggt gctcgagtgg gacaacttga gacaggactt 361 gcagacgtac gcgttgatca cgacaatctc gttgcgagag tggatactgc agaacgtaac 421 attggatcat tgaccactga gctatcaact ctgacgttac gagtaacatc catacaagcg 481 gatttcgaat ctaggatatc cacgttagag cgcacggcgg tcactagcgc gggagctccc 541 ctctcaatcc gtaataaccg tatgaccatg ggattaaatg atggactcac gttgtcaggg 601 aataatctcg ccatccgatt gccaggaaat acgggtctga atattcaaaa tggtggactt 661 cagtttcgat ttaatactga tcaattccag atagttaata ataacttgac tctcaagacg 721 actgtgtttg attctatcaa ctcaaggata ggcgcaactg agcaaagtta cgtggcgtcg 781 gcagtgactc ccttgagatt aaacagtagc acgaaggtgc tggatatgct aatagacagt 841 tcaacacttg aaattaattc tagtggacag ctaactgtta gatcgacatc cccgaatttg 901 aggtatccga tagctgatgt tagcggcggt atcggaatga gtccaaatta taggtttagg 961 cagagcatgt ggataggaat tgtctcctat tctggtagtg ggctgaattg gagggtacag 1021 gtgaactccg acatttttat tgtagatgat tacatacata tatgtcttcc agcttttgac 1081 ggtttctcta tagctgacgg tggagatcta tcgttgaact ttgttaccgg attgttacca 1141 ccgttactta caggagacac tgagcccgct tttcataatg acgtggtcac atatggagca 1201 cagactgtag ctatagggtt gtcgtcgggt ggtgcgcctc agtatatgag taagaatctg 1261 tgggtggagc agtggcagga tggagtactt cggttacgtg ttgagggggg tggctcaatt 1321 acgcactcaa acagtaagtg gcctgccatg accgtttcgt acccgcgtag tttcacgtga 1381 ggatcagacc accccgcggc actggggcat ttcatc // LOCUS RATGLYSN 2386 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Rat glycogen synthase mRNA, complete cds. ACCESSION J05446 KEYWORDS UDP glucose:glycogen 4-alpha-D-glucosyltransferase; glycogen synthase. SOURCE Rat adult liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2386) AUTHORS Bai,G., Zhang,Z., Werner,R., Nuttall,F.Q., Tan,A.W.H. and Lee,E.Y.C. TITLE The primary structure of rat liver glycogen synthase deduced by cDNA cloning: Absence of phosphorylation sites 1a and 1b JOURNAL J. Biol. Chem. 265, 7843-7848 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Y.C.Lee, 16-MAR-1990. FEATURES from to/span description pept 46 2157 glycogen synthase (EC 2.4.1.11) mRNA < 1 2386 glycogen synthase mRNA signal 2365 2370 poly-A signal BASE COUNT 657 a 581 c 579 g 569 t ORIGIN 1 ctgcaaccgg tccccttcgg caccagacac acagctggac gaagaatgct caggggccgc 61 tccttgtctg tgacgtccct tggtgggctc cctgcatggg aagctgaaag actccccgtg 121 gaagacttat tgctttttga agtttcctgg gaagtgacca acaaagttgg gggcatctgt 181 actgtgatcc agagcaaagc caaaaccaca gccaatgaat ggggagagaa ttacttcctg 241 ataggtccgt attttgagca taatgtgaag actcaagtag agccatgcag gcccgccaac 301 gatgccgtca gaaaagctgt ggatgccatg aacaaacatg gctgccaggt gcattttgga 361 agatggctga tagaagggag tccgtatgtg gtgctttttg acatcagctc ctcagtgtgg 421 aacctggaca ggtggaaggg agacttctgg gaagcatgtg gcgttggcat ccctcacgac 481 gaccgagaag ccaatgacat gctcatattt gggtctttaa ctgcctggtt cttaaaggag 541 gtgacggacc atgcagacgg taaacacgtc attgcccaat tccatgaatg gcaggctgga 601 actgggctga tcctttctcg tgccaggaaa ctccccatcg ccacaatatt tacaacccat 661 gccacactgc tggggcggta tctctgtgca gcaaatattg acttctacaa ccagcttgat 721 aagttcaaca tagacaaaga ggccggggag aggcagattt atcaccgcta ctgcatggag 781 cgggcttccg tgcactgtgc gcacgtgttt accacagtgt cagaaatcac agccatcgag 841 gcggacgaca tgctgaagag gaagcctgat gtggtgactc caaacggctt gaacgttaag 901 aagttttctg cggtgcacga atttcaaaat ctccatgcca catacaaggc caggatacag 961 gattttgttc gaggtcattt ctatggccac ctggacttcg atcttgaaaa gacgttattt 1021 cttttcattg ctgggaggta tgagttctcc aacaagggag cagacatctt cctagaatcc 1081 ttatccaggc tcaatttcct cctaaggatg cataagagta acgtcactgt ggtagtgttt 1141 ttcatcatgc ctgccaagac aaacaatttc aacgtggaaa ccctgaaggg ccaggcggtg 1201 cggaaacagc tgtgggacac tgtgcactgt atgaaggaaa agtttggcaa gaaactctac 1261 gatgggttat taagaggaga aatacccgac atgaatagta ttttggatcg agatgactta 1321 acaattatga aaagagccat tttttcaact cagagacact ctttgcctcc tgtgaccact 1381 cacaatatga tcgacgattc cacggatccc atcctcagca ccattcgacg aattggactt 1441 ttcaacaatc gcacagacag agtcaaggtg attttacacc cagaattcct gtcctccacc 1501 agccccctac taccaatgga ttatgaagag tttgtccgag gctgtcacct tggggtattt 1561 ccatcatact atgagccctg gggttacacg ccagccgaat gcacagtgat gggcatcccc 1621 agtgtgacta cgaacctctc tggtttcggg tgtttcatgc aggagcatgt ggctgaccct 1681 accgcgtacg gtatttatat cgtcgacagc gtccgctctc cagatgattc ttgcaaccag 1741 ctgactcagt ttctctatgg gttctgtaaa cagtcccgcc gccaaagaat catccagagg 1801 aaccgcaccg agaggctctc agatcttctg gactggagat acctgggcag atattaccag 1861 catgccagac atctgacact gagcagggct tttccagaca aattctacct ggagcccaca 1921 tccccaccaa cgacggatgg ctttaagtat cccaggccct cctcagtacc accttcccca 1981 tcaggatccc agacttcaag tcctcagagc agcgatgtgg aaaacgaagg ggatgaggat 2041 gagagatatg atgaggaaga ggaggctgag agggaccggc taaacatcaa gtcaccattt 2101 tccctgaacc acatcccaaa ggggaagaaa aagcttcatg gagaatataa gaactgagct 2161 caaatgaaat gattccaaat ccacaagaaa atgagctgag cccaagtcca tccctgatgc 2221 ataccgacag atatttacag aatgacgtcg gaaatctaga atctgtgtcc agatcactga 2281 tagtaacttg tagccaccga catgtgtcac cgtactgtga tggtactttt gttgtctaat 2341 tggaaatttc aatctgttat tgataataaa ttaccaaatc taaatg // LOCUS RABCYP2C16 2006 bp ss-mRNA MAM 11-JUL-1990 DEFINITION Rabbit cytochrome P450IIC16 (CYP2C16) mRNA, complete cds. ACCESSION M29968 KEYWORDS cytochrome P450; monooxygenase. SOURCE Rabbit (strain New Zealand White) adult liver, cDNA to mRNA. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 2006) AUTHORS Hassett,C. and Omiecinski,C.J. TITLE Sequence and gene expression of rabbit cytochrome P450 IIC16: Comparison ti highly related family members JOURNAL Nucleic Acids Res. 18, 1429-1434 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Hassett, 20-NOV-1989. Author Address [1]: C.Hasset University of Washington Department of Environmental Health SC-34 Seattle, WA 98195 FEATURES from to/span description pept 66 1529 cytochrome P450IIC16 (CYP2C16) mRNA < 1 2006 CYP2C16 mRNA signal 1985 1989 poly-A signal BASE COUNT 524 a 474 c 472 g 536 t ORIGIN 1 cggcatcggt accaaataag atagacagtg ctactcagaa atccaagaaa atggtggaag 61 aaataatgga tccagttgtg gtcctggtgt tgggtctctg ctgtttgctt ctcctttcac 121 actggaagca aaattccggg agggggaagc tccctcccgg ccccactcct ttccccatta 181 ttggaaatat tctccagata gatgctaagg acatcagcaa atccctaact aagttctcag 241 aacgctatgg ccccgtgttc actgtgtatc tgggcatgaa gcccgctgta gtgctgcatg 301 gataccaggc agtgaaggag gccctggttg atcttggaga ggagtttgct ggaagaggca 361 gttttcctat gcttgataaa gttagtaagg gactcggaat cgttttcacc aatggaaaga 421 gatggaaaga gatccggcgc ttctcgctca tgaccctgcg gaatttcggg atggggaaga 481 ggagcattga ggaccgagtt caagaggagg cccgctgcct ggtggaggag ctgagaaaaa 541 ccaacgcctc accctgtgat cccaccttta tcctgggctg tgctccctgc aatgtgatct 601 gctccattat tttccataat cgctttgatt ataaagatga ggagtttctt aaactattgg 661 aaaaattcaa tgaaaatgtt aggattctga gttctccatg gttgcaggtc tgcaataatt 721 tccctgctct tattgattac ttaccaggaa gtcataagac cttactaaag aattctgatt 781 atgtgaaaaa ttttattatg gagaaagtga aggaacacca aaaattcctg gatgttaaca 841 atcctcggga ctttatagat tgtttcttga tcaaaatgga gcaggaaaac catttggagt 901 tcactcttga aagcttggta accactgtgt ttgatttgtt tggagctggg actgagacaa 961 cgagcacaac gctgagatac tccctcctgc tcctgctgaa gcaccccgag gtcgcagata 1021 aagtgcagga ggagattgag cgtgtgattg gcaggcaccg gagcccctgc atgcaggaca 1081 ggagccgcat gccttacaca gatgccgtaa tacatgagat ccagagattc attgacctgg 1141 tccccaataa tctgccccac acagtgaccc gtgacattaa attcagaaac tactttatcc 1201 ccaagggtac ggacatcatg acatcactga catccgtgct acatgatgaa aaagcatttc 1261 ctaacccaaa ggtatttgac cctggacact ttctggatga gagtggcaac ttcaagaaga 1321 gtgactactt catgcctttc tcagcaggaa aacggatctg tgtgggagag gccctggccc 1381 gcatggagct gtttttgttc ctgacctcca ttttgcagaa ctttaaactg caatctctgg 1441 ttgagccaaa ggacctggac atcactgcag ttctcaatgg atttgtttct gtgccacctt 1501 cgttccagct ctgcttcatt cctgtttgaa aaggagcaga ctggcttcta ctgtgccatc 1561 atttcaaagg cattgcccat caccttactg catttgagac acttctttaa cttttctcac 1621 atcttactat tcccttaaga tctagtgaaa acctaacttc tgtgggtgat cccctgagac 1681 tgcctgccct gaccatgcaa gaggtagaga gggcatggca agccatgctc ctgggaggga 1741 ccccacagcc tggctgctgg caggtggcgg gacccaggca catttctctc cattcctgcc 1801 tgtcaggtaa actgctccta gctgtgtcca aagcccatca agaaagctac cgtaggctat 1861 gtgaccttca agatgattgt aggagcatat cagtaccaat attgcctcta tcctatagaa 1921 ttagtactgc cctgaattag ttacaccctt tctgcctgcc ctttagaaag tgtgcatgct 1981 cattaataaa gtggatgcat tcactg // LOCUS HUMGAPA 4307 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human GTPase-activating protein ras p21 (GAP) mRNA, complete cds. ACCESSION M23379 KEYWORDS GTPase-activating protein. SOURCE Human placenta, cDNA to mRNA, clone 101. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4307) AUTHORS Trahey,M., Wong,G., Halenbeck,R., Rubinfeld,B., Martin,G.A., Ladner,M., Long,C.M., Crosier,W.J., Watt,K., Koths,K. and McCormick,F. TITLE Molecular cloning of two types of GAP complementary DNA from human placenta JOURNAL Science 242, 1697-1700 (1988) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by C.M.Long, 31-MAR-1989. For sequence of clone 16 refer to M23612. FEATURES from to/span description pept 119 3262 ras p21 GTP-ase-activating protein (GAP) BASE COUNT 1280 a 856 c 957 g 1214 t ORIGIN 1 cctcagcctg gggagctgaa ggggagacgc gtctgggtgg ggctgctcgg agcccgggcc 61 tggtggcccc tggggctccc gggcgggcag ggtagggcag agtagagcgg gcttcaacat 121 gatggcggcc gaggccggca gtgaggaggg cggcccggta acagccggag ctggaggagg 181 cggcgcggca gcgggctcca gtgcctatcc cgcagtgtgt cgggtgaaga tacccgcggc 241 cctgcctgtg gcagccgccc cctatcctgg gctggtggag accggagtgg ctggaactct 301 gggtggcgga gccgctttgg ggtcagagtt cctaggagcc gggtctgtgg caggggcact 361 ggggggagct ggactgacag ggggaggtac tgctgctggc gtagctggtg ctgctgctgg 421 cgtggccggt gctgctgttg ctggacctag tggagacatg gctctcacca aactgcccac 481 ttcgttgctt gctgagactc tcgggccagg cggcggtttt ccccctctgc cccctccccc 541 ttacctgccc cctttggggg cgggcctcgg gacagtggac gaaggtgact ctctggatgg 601 accagaatac gaggaggaag aggtggccat accgttgacc gctcctccaa ctaaccagtg 661 gtatcacgga aaacttgaca gaacgatagc agaagaacgc ctcaggcagg cagggaagtc 721 tggcagttat cttataagag agagtgatcg gaggccaggg tcctttgtac tttcatttct 781 tagccagatg aatgttgtca accattttag gattattgct atgtgtggag attactacat 841 tggtggaaga cgtttttctt cactgtcaga cctaataggt tattacagtc atgtttcttg 901 tttgcttaaa ggagaaaaat tactttaccc agttgcacca ccagagccag tagaagatag 961 aaggcgtgta cgagctattc taccttacac aaaagtacca gacactgatg aaataagttt 1021 cttaaaagga gatatgttca ttgttcataa tgaattagaa gatggatgga tgtgggttac 1081 aaatttaaga acagatgaac aaggccttat tgttgaagac ctagtagaag aggtgggccg 1141 ggaagaagat ccacatgaag gaaaaatatg gttccatggg aagatttcca aacaggaagc 1201 ttataattta ctaatgacag ttggtcaagt ctgcagtttt cttgtgaggc cctcagataa 1261 tactcctggc gattattcac tttatttccg gaccaatgaa aatattcagc gatttaaaat 1321 atgtccaacg ccaaacaatc agtttatgat gggaggccgg tattataaca gcattgggga 1381 catcatagat cactatcgaa aagaacagat tgttgaagga tattatctta aggaacctgt 1441 accaatgcag gatcaagaac aagtactcaa tgacacagtg gatggcaagg aaatctataa 1501 taccatccgt cgtaaaacaa aggatgcctt ttataaaaac attgttaaga aaggttatct 1561 tctgaaaaag ggcaaaggaa aacgttggaa aaatttatat tttatcttag agggtagtga 1621 tgcccaactt atttattttg aaagcgaaaa acgagctacc aaaccaaaag gattaataga 1681 tctcagtgta tgttctgtct atgtcgttca tgatagtctc tttggcaggc caaactgttt 1741 tcagatagta gttcagcact ttagtgaaga acattacatc ttttactttg caggagaaac 1801 tccagaacaa gcagaggatt ggatgaaagg tctgcaggca ttttgcaatt tacggaaaag 1861 tagtccaggg acatccaata aacgccttcg tcaggtcagc agccttgttt tacatattga 1921 agaagcccat aaactcccag taaaacattt tactaatcca tattgtaaca tctacctgaa 1981 tagtgtccaa gtagcaaaaa ctcatgcaag ggaagggcaa aacccagtat ggtcagaaga 2041 gtttgtcttt gatgatcttc ctcctgacat caatagattt gaaataactc ttagtaataa 2101 aacaaagaaa agcaaagatc ctgatatctt atttatgcgc tgccagttga gccgattaca 2161 gaaagggcat gccacagatg aatggtttct gctcagctcc catataccat taaaaggtat 2221 tgaaccaggg tccctgcgtg ttcgagcacg atactctatg gaaaaaatca tgccagaaga 2281 agagtacagt gaatttaaag agcttatact gcaaaaggaa cttcatgtag tctatgcttt 2341 atcacatgta tgtggacaag accgaacact actggccagc atcctactga ggatttttct 2401 tcacgaaaag cttgaatcgt tgttgttatg cacactaaat gacagagaaa taagcatgga 2461 agatgaagcc actaccctat ttcgagccac aacacttgca agcaccttga tggagcagta 2521 tatgaaagcc actgctacac agtttgttca tcatgctttg aaagactcta ttttaaagat 2581 aatggaaagc aagcagtctt gtgagttaag tccatcaaag ttagaaaaaa atgaagatgt 2641 gaacactaat ttaacacacc tattgaacat actttcagag cttgtggaga aaatattcat 2701 ggcttcagaa atacttccac cgacattgag atatatttat gggtgtttac agaaatctgt 2761 tcagcataag tggcctacaa ataccaccat gagaacaaga gttgttagtg gttttgtttt 2821 tcttcgactc atctgtcctg ccatcctgaa tccacggatg ttcaatatca tctcagattc 2881 tccatctcct attgctgcaa gaacactgat attagtggct aaatctgtgc agaacttagc 2941 aaatcttgtg gaatttggag ctaaggagcc ctacatggaa ggtgtcaatc cattcatcaa 3001 aagcaacaaa catcgtatga tcatgttttt agatgaactt gggaatgtac ctgaacttcc 3061 ggacactaca gagcattcta gaacggacct gtcccgtgat ttagcagcat tgcatgagat 3121 ttgcgtggct cattcagatg aacttcgaac gctcagtaat gagcgtggtg cacagcagca 3181 cgtattgaaa aagcttctgg ctataacaga actgcttcaa caaaaacaaa accagtatac 3241 aaaaaccaat gatgtcaggt agcagccttc gccccagtgt tctgcatgga ttcagcatgt 3301 ccaacatggt aattcacttc agtttaatgt ctcctttgct cttgccaaaa aatagcacac 3361 ttttccacat tccagtgatg tgtgagctat gcaaacaaaa tccaagattc tgctggtgaa 3421 taactatgcc agcaaccttg taagctatct gtgcaggata tttgcactat ttccacatgg 3481 aatcaatctt taacaacctc tgagccttgg tgtacagacc acctttcaca aaacgaaatg 3541 ctatgactgt atcttgatat ctcgaacttt caaaatatat tttcagtaca cccagttgcc 3601 aaagttttgc tgtctcttag agaaagaact atgaaatcaa ctgacaagaa acacattctt 3661 attgacaatt gtgtataact ggattgcaga ctgttcttac tgtaactact tcctgattag 3721 gaatatgacc atttgactgt tcaatgatta tttgtattta cagtttccag agtttgtcat 3781 tataatagga acaatctttg ctgtatactt ttaaaaaata ctctgctatt tctcttgctg 3841 gaactgttga aagaaaatat atagaatgat ctattgctca tcagctttat tttttaaaca 3901 tacgacttat tttgttgaaa ttgtcaaaga ctgtatttag atctcataat gctttgttaa 3961 atgtttacaa gtaaatagtt tgaattcagt aaatattatt ggttgttgta ttgatcaatg 4021 catgttaccc attcaaccat tttatagact accaatttct tttatgttaa ctagaatgct 4081 tttgttaaaa gttatttgtt cattatttgt gctacccctt tgattatgca gacaacctca 4141 tcagctgcct aacttatcca tctttgaact tctgactact tgttgtatct gctggatatt 4201 tagttcaact gtatagtttt atttacttct gtatgtgtat ttttgtgaag tattcacaaa 4261 ggttaagtta aaataaaacc aagggatatc ttgcaaaaaa aaaaaaa // LOCUS HUMGAPB 3456 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human GTPase-activating protein ras p21 (GAP) mRNA, complete cds. ACCESSION M23612 KEYWORDS GTP-ase-activating protein. SOURCE Human placenta, cDNA to mRNA, clone 16. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3456) AUTHORS Trahey,M., Wong,G., Halenbeck,R., Rubinfeld,B., Martin,G.A., Ladner,M., Long,C.M., Crosier,W.J., Watt,K., Koths,K. and McCormick,F. TITLE Molecular cloning of two types of GAP complementary DNA from human placenta JOURNAL Science 242, 1697-1700 (1988) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by C.M.Long, 31-MAR-1989. For sequence of clone 101 refer to M23379. FEATURES from to/span description pept 100 2712 ras p21 GTP-ase-activating protein (GAP) site 49 51 5' in frame termination codon BASE COUNT 1134 a 640 c 687 g 995 t ORIGIN 1 ggaagaggtg gccataccgt tgaccgctcc tccaactaac cagtaagtta agactgctgt 61 tcaggaattt gggaagctgg ctccagaaaa gaagtggaaa tgaaggggtg gtatcacgga 121 aaacttgaca gaacgatagc agaagaacgc ctcaggcagg cagggaagtc tggcagttat 181 cttataagag agagtgatcg gaggccaggg tcctttgtac tttcatttct tagccagatg 241 aatgttgtca accattttag gattattgct atgtgtggag attactacat tggtggaaga 301 cgtttttctt cactgtcaga cctaataggt tattacagtc atgtttcttg tttgcttaaa 361 ggagaaaaat tactttaccc agttgcacca ccagagccag tagaagatag aaggcgtgta 421 cgagctattc taccttacac aaaagtacca gacactgatg aaataagttt cttaaaagga 481 gatatgttca ttgttcataa tgaattagaa gatggatgga tgtgggttac aaatttaaga 541 acagatgaac aaggccttat tgttgaagac ctagtagaag aggtgggccg ggaagaagat 601 ccacatgaag gaaaaatatg gttccatggg aagatttcca aacaggaagc ttataattta 661 ctaatgacag ttggtcaagt ctgcagtttt cttgtgaggc cctcagataa tactcctggc 721 gattattcac tttatttccg gaccaatgaa aatattcagc gatttaaaat atgtccaacg 781 ccaaacaatc agtttatgat gggaggccgg tattataaca gcattgggga catcatagat 841 cactatcgaa aagaacagat tgttgaagga tattatctta aggaacctgt accaatgcag 901 gatcaagaac aagtactcaa tgacacagtg gatggcaagg aaatctataa taccatccgt 961 cgtaaaacaa aggatgcctt ttataaaaac attgttaaga aaggttatct tctgaaaaag 1021 ggcaaaggaa aacgttggaa aaatttatat tttatcttag agggtagtga tgcccaactt 1081 atttattttg aaagcgaaaa acgagctacc aaaccaaaag gattaataga tctcagtgta 1141 tgttctgtct atgtcgttca tgatagtctc tttggcaggc caaactgttt tcagatagta 1201 gttcagcact ttagtgaaga acattacatc ttttactttg caggagaaac tccagaacaa 1261 gcagaggatt ggatgaaagg tctgcaggca ttttgcaatt tacggaaaag tagtccaggg 1321 acatccaata aacgccttcg tcaggtcagc agccttgttt tacatattga agaagcccat 1381 aaactcccag taaaacattt tactaatcca tattgtaaca tctacctgaa tagtgtccaa 1441 gtagcaaaaa ctcatgcaag ggaagggcaa aacccagtat ggtcagaaga gtttgtcttt 1501 gatgatcttc ctcctgacat caatagattt gaaataactc ttagtaataa aacaaagaaa 1561 agcaaagatc ctgatatctt atttatgcgc tgccagttga gccgattaca gaaagggcat 1621 gccacagatg aatggtttct gctcagctcc catataccat taaaaggtat tgaaccaggg 1681 tccctgcgtg ttcgagcacg atactctatg gaaaaaatca tgccagaaga agagtacagt 1741 gaatttaaag agcttatact gcaaaaggaa cttcatgtag tctatgcttt atcacatgta 1801 tgtggacaag accgaacact actggccagc atcctactga ggatttttct tcacgaaaag 1861 cttgaatcgt tgttgttatg cacactaaat gacagagaaa taagcatgga agatgaagcc 1921 actaccctat ttcgagccac aacacttgca agcaccttga tggagcagta tatgaaagcc 1981 actgctacac agtttgttca tcatgctttg aaagactcta ttttaaagat aatggaaagc 2041 aagcagtctt gtgagttaag tccatcaaag ttagaaaaaa atgaagatgt gaacactaat 2101 ttaacacacc tattgaacat actttcagag cttgtggaga aaatattcat ggcttcagaa 2161 atacttccac cgacattgag atatatttat gggtgtttac agaaatctgt tcagcataag 2221 tggcctacaa ataccaccat gagaacaaga gttgttagtg gttttgtttt tcttcgactc 2281 atctgtcctg ccatcctgaa tccacggatg ttcaatatca tctcagattc tccatctcct 2341 attgctgcaa gaacactgat attagtggct aaatctgtgc agaacttagc aaatcttgtg 2401 gaatttggag ctaaggagcc ctacatggaa ggtgtcaatc cattcatcaa aagcaacaaa 2461 catcgtatga tcatgttttt agatgaactt gggaatgtac ctgaacttcc ggacactaca 2521 gagcattcta gaacggacct gtcccgtgat ttagcagcat tgcatgagat ttgcgtggct 2581 cattcagatg aacttcgaac gctcagtaat gagcgtggtg cacagcagca cgtattgaaa 2641 aagcttctgg ctataacaga actgcttcaa caaaaacaaa accagtatac aaaaaccaat 2701 gatgtcaggt agcagccttc gccccagtgt tctgcatgga ttcagcatgt ccaacatggt 2761 aattcacttc agtttaatgt ctcctttgct cttgccaaaa aatagcacac ttttccacat 2821 tccagtgatg tgtgagctat gcaaacaaaa tccaagattc tgctggtgaa taactatgcc 2881 agcaaccttg taagctatct gtgcaggata tttgcactat ttccacatgg aatcaatctt 2941 taacaacctc tgagccttgg tgtacagacc acctttcaca aaacgaaatg ctatgactgt 3001 atcttgatat ctcgaacttt caaaatatat tttcagtaca cccagttgcc aaagttttgc 3061 tgtctcttag agaaagaact atgaaatcaa ctgacaagaa acacattctt attgacaatt 3121 gtgtataact ggattgcaga ctgttcttac tgtaactact tcctgattag gaatatgacc 3181 atttgactgt tcaatgatta tttgtattta cagtttccag agtttgtcat tataatagga 3241 acaatctttg ctgtatactt ttaaaaaata ctctgctatt tctcttgctg gaactgttga 3301 aagaaaatat atagaatgat ctattgctca tcagctttat tttttaaaca tacgacttat 3361 tttgttgaaa ttgtcaaaga ctgtatttag atctcataat gctttgttaa atgtttacaa 3421 gtaaatagtt tgaattcagt aaatattaaa aaaaaa // LOCUS YSCSDH 1665 bp ds-DNA PLN 11-JUL-1990 DEFINITION S.cerevisiae succinate dehydrogenase iron-protein subunit (SDH) gene, complete cds. ACCESSION J05487 KEYWORDS succinate dehydrogenase iron-protein subunit. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1665) AUTHORS Lombardo,A., Carine,K. and Scheffler,I.E. TITLE Cloning and characterization of the iron-sulfur subunit gene of succinate dehydrogenase from Saccharomyces cerevisiae JOURNAL J. Biol. Chem. 265, 10419-10423 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.E.Scheffler, 13-APR-1990. FEATURES from to/span description pept 738 1538 succinate dehydrogenase iron-protein subunit (SDH) (EC 1.3.99.1) signal 585 589 CAAT box signal 616 622 TATA box BASE COUNT 532 a 349 c 345 g 439 t ORIGIN Chromosome VII. 1 atcttacaag taacttaagt caaggcgtga aaagtaccac cactgtgctt gacatgcaaa 61 agggttgcag agtgcgtcta ccaaggtacg tggaccatga tcaaatcatc aagccttatg 121 atctacgtga ggcccaagga caatactggc tcaagaccgt gaatggagga gtattatgaa 181 tgatgaaatc ctgtcgcacg tatattgcca ggcaaagaac tagcagtaat tgtgtcatgt 241 cagcacattg ctgaggtgca aatggccacc caagagctta ttggagcaca ggatatcttc 301 atcagggaat tacattggaa agatccggtc ttcaaattaa ctcaagtctc aatccgaata 361 cttcattccc atcagcgatc ctgaagaacg tcggtccttg tacaggaaca tcgccattgc 421 tgttagagaa tacaataagt actgtgaagc tatcctatga tcacatatga aagtatatac 481 ccgcttttgt acactatgta gctataattc aatcgtatta ttgtacgtcc gcacgaccat 541 gccttagaaa tatccgcagc gcgcaaaagg cggcctcgca ttggcccaat tagctccggt 601 gtaaaaaggg caaactatat aagggattaa tgactttcta tgagaatgcc aaaaaatgtt 661 aggctaaagg aagggattga aaggaatata gttgagctat actttcttga aatactggag 721 tatacatatt tatagggatg ttgaacgtgc tattgagaag gaaggccttt tgtttggtga 781 cgaagaaggg tatggctact gccacaacag ctgcagctac gcataccccc agattgaaaa 841 cttttaaagt ttacagatgg aatccagacg agccaagtgc taaacctcat ttacagtcat 901 atcaagtgga tctgaatgac tgtgggccca tggtacttga tgcgctgtta aagatcaaag 961 acgaacagga ttctacccta acttttagaa gatcatgtag agaaggtatc tgcggttcat 1021 gtgccatgaa cattggcggt agaaacacgc tagcttgtat atgtaagatc gaccagaacg 1081 aatccaaaca actcaagatc tatccattac cccacatgtt tattgtcaaa gatttggtac 1141 ctgatttaac taacttctac caacaataca aatctatcca accttactta cagagatcat 1201 cgtttccaaa ggatggaacg gaagtgctac aaagtattga agatcgtaag aaactggatg 1261 gtctttacga atgtattctg tgtgcatgct gctctacttc atgtccatcg tactggtgga 1321 accaagaaca gtatttgggc cctgccgtgc taatgcaagc ctaccgttgg ctaattgact 1381 ctagagacca agctacaaag acaagaaagg ccatgctaaa caactccatg tcattgtaca 1441 gatgtcacac catcatgaac tgtactagaa cttgtccaaa gggcttgaat cctggtttgg 1501 ctattgctga aattaagaaa tctttggcat ttgcctagac tatcagaaaa acagctagcc 1561 ccgaagaact cagaagcctc tcaaatgatt ttggcactaa taaaagcacc aactattatt 1621 attattattt tcaaggacga aactcaccat tctcacacat tcctt // LOCUS BOVPDEAP 585 bp ss-mRNA MAM 11-JUL-1990 DEFINITION Bovine cone photoreceptor cyclic nucleotide phosphodiesterase alpha'-subunit (PDE), partial cds. ACCESSION M33140 M29465 KEYWORDS cone photoreceptor cyclic nucleotide phosphodiesterase. SOURCE Bovine dark-adapted frozen retina, cDNA to mRNA, clone BC-alpha-1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 585) AUTHORS Charbonneau,H., Prusti,R.K., LeTrong,H., Sonnenburg,W.K., Mullaney,P.J., Walsh,K.A. and Beavo,J.A. TITLE Identification of a noncatalytic cGMP-binding domain conserved in both the cGMP-stimulated and photoreceptor cyclic nucleotide phosphodiesterases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 288-292 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 585 cone photoreceptor cyclic nucleotide phosphodiesterase alpha'-subunit (AA at 1) BASE COUNT 198 a 115 c 126 g 146 t ORIGIN 1 agagaagtca tcttttataa aatcatcgat tacattttac atggaaaaga agagatcaaa 61 gtcattccga cacctcccat ggaccactgg actctcatta gtgggttgcc aacatatgtt 121 gctgaaaatg gatttatctg caacatgctg aacgccccgg cggatgaata cttcacgttt 181 cagaaaggac ctgtagatga aactggctgg gtcattaaaa atgtcttgtc cctgcctatt 241 gtcaacaaaa aggaagacat cgtgggcgta gctacatttt acaacaggaa ggatggaaag 301 ccttttgatg aatatgatga gcacatcgct gagactctca cacagtttct tggatggtct 361 ctcttaaata ctgacaccta tgagaaaatg aataagctgg agaacagaaa ggacatagcc 421 caggaaatgc tcatgaacca caccaaggct acacctgatg agatcaagtc tattttgaaa 481 tttaaagaga agttaaatat agatgtaatt gaagactgtg aagaaaaaca gcttgtcaca 541 attttgaagg aggacctgcc agacccacgg actgcagacc tgtat // LOCUS CHKG1CLSE 240 bp ds-DNA VRT 11-JUL-1990 DEFINITION Chicken delta-1-crystallin gene, intron 3 lens-specific enhancer cor segments B3 and B4. ACCESSION M33954 KEYWORDS delta-1-crystallin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 240) AUTHORS Goto,K., Okada,T.S. and Kondoh,H. TITLE Functional cooperation of lens-specific and nonspecific elements in the delta-1-crystallin enhancer JOURNAL Mol. Cell. Biol. 10, 958-964 (1990) STANDARD simple staff_review FEATURES from to/span description site 17 116 core segment B3 site 112 235 core segment B4 BASE COUNT 61 a 52 c 53 g 74 t ORIGIN 1 gtcagtgagg tgtgctcagc atgacctgcc ctcccaccct cttcagactg aacattcctg 61 aggaattgtt tcagtatgaa ttaggaatat tctttttcca atggcacttg ggatcccttt 121 gtgtctggct gcctgagtta gtagaagaca atgcacaata ttgtataggg gtgaagaaga 181 gtcagccact aagcactttt tctgaaatat tcattgttgt tgctcaccta ccatggacaa // LOCUS CHKOVAL 9206 bp ds-DNA VRT 11-JUL-1990 DEFINITION Chicken ovalbumin gene, complete cds. ACCESSION J00895 KEYWORDS ovalbumin. SOURCE Chicken oviduct DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1343 to 8906) AUTHORS Woo,S.L.C., Beattie,W.G., Catterall,J.F., Dugaiczyk,A., Staden,R., Brownlee,G.G. and O'Malley,B.W. TITLE Complete nucleotide sequence of the chicken chromosomal ovalbumin gene and its biological significance JOURNAL Biochemistry 20, 6437-6446 (1981) STANDARD full staff_review REFERENCE 2 (bases 1043 to 1562; 2675 to 4732; 8885 to 9206) AUTHORS Benoist,C., O'Hare,K., Breathnach,R. and Chambon,P. TITLE The ovalbumin gene-sequence of putative control regions JOURNAL Nucleic Acids Res. 8, 127-142 (1980) STANDARD full staff_review REFERENCE 3 (bases 1357 to 1389; 2941 to 3052; and ivs junctions) AUTHORS Breathnach,R., Benoist,C., O'Hare,K., Gannon,F. and Chambon,P. TITLE Ovalbumin gene: evidence for a leader sequence in mRNA and DNA sequences at the exon-intron boundaries JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 4853-4857 (1978) STANDARD full staff_review REFERENCE 4 (bases 1282 to 1420; 2952 to 2999) AUTHORS Gannon,F., O'Hare,K., Perrin,F., LePennec,J.P., Benoist,C., Cochet,M., Breathnach,R., Royal,A., Garapin,A., Cami,B. and Chambon,P. TITLE Organisation and sequences at the 5' end of a cloned complete ovalbumin gene JOURNAL Nature 278, 428-434 (1979) STANDARD full staff_review REFERENCE 5 (bases 1343 to 8906; exons only) AUTHORS McReynolds,L., O'Malley,B.W., Nisbet,A.D., Fothergill,J.E., Givol,D., Fields,S., Robertson,M. and Brownlee,G.G. TITLE Sequence of chicken ovalbumin mRNA JOURNAL Nature 273, 723-728 (1978) STANDARD full staff_review REFERENCE 6 (bases 2675 to 5042) AUTHORS Robertson,M.a., Staden,R., Tanaka,Y., Catterall,J.F., O'Malley,B.W. and Brownlee,G.G. TITLE Sequence of three introns in the chick ovalbumin gene JOURNAL Nature 278, 370-372 (1979) STANDARD full staff_review REFERENCE 7 (bases 1 to 1042) AUTHORS Heilig,R., Muraskowsky,R. and Mandel,J.L. TITLE The ovalbumin gene family: The 5' end region of the X and Y genes JOURNAL J. Mol. Biol. 156, 1-19 (1982) STANDARD full staff_review REFERENCE 8 (bases 5576 to 5624) AUTHORS Lai,E.C., Woo,S.L.C., Dugaiczyk,A. and O'Malley,B.W. TITLE The ovalbumin gene: Alleles created by mutations in the intervening sequences of the natural gene JOURNAL Cell 16, 201-211 (1979) STANDARD simple staff_entry REFERENCE 9 (bases 443 to 611) AUTHORS Schweers,L.A., Frank,D.E., Weigel,N.L. and Sanders,M.M. TITLE The steroid-dependent regulatory element in the ovalbumin gene does not function as a typical steroid response element JOURNAL J. Biol. Chem. 265, 7590-7595 (1990) STANDARD simple staff_entry COMMENT Eight exons reported. Sequence homologies with adenovirus early 1a, adenovirus major late, chicken conalbumin, chkx and chky genes noted for 5' flanking sequence. OV1.3 is identical to OV1.8 except that intron E is shorter by 522 nucleotides [8]. FEATURES from to/span description pept 2996 3163 ovalbumin, exon 1 3415 3465 ovalbumin, exon 2 4047 4175 ovalbumin, exon 3 4576 4693 ovalbumin, exon 4 5652 5794 ovalbumin, exon 5 6126 6281 ovalbumin, exon 6 7864 8259 ovalbumin, exon 7 pre-msg 1343 8906 oval mRNA and introns IVS 1390 2978 oval intron A IVS 3164 3414 oval intron B IVS 3466 4046 oval intron C IVS 4176 4575 oval intron D IVS 4694 5651 oval intron E IVS 5795 6125 oval intron F IVS 6282 7863 oval intron G allele 1282 1282 g may be c [1] allele 1309 1309 g may be a [1] allele 1376 1376 g may be c [1],[7] allele 1385 1385 g may be a [1],[7] allele 1393 1393 a may be g [1] allele 3010 3010 t may be c [1] allele 3154 3154 a may be g [1] allele 5747 5747 g may be a [1] allele 8032 8032 can be g [5] conflict 1471 1471 c in [1]; t in [2],[7] conflict 1523 1523 a in [1]; t in [2],[7] conflict 1538 1539 at in [1]; a in [2] conflict 2681 2681 t in [1],[6]; g in [2] conflict 2696 2696 a in [1],[6]; g in [2] conflict 3552 3552 a in [1],[6]; g in [2] conflict 3683 3683 c in [1],[6]; t in [2] conflict 3934 3934 a in [1],[6]; c in [2] conflict 3960 3960 a in [1],[6]; g in [2] conflict 3994 3994 t in [1],[6]; c in [2] conflict 4207 4208 tt in [1],[6]; ttt in [2] conflict 4396 4396 t in [1],[6]; c in [2] conflict 5636 5636 c in [1]; g in [3] conflict 6100 6100 t in [1]; tc in [3] conflict 6119 6119 t in [1]; tg in [3] conflict 8492 8505 gactcacagtactg in [1]; g in [5] site 443 611 steroid-dependent regulatory element [9] BASE COUNT 2994 a 1749 c 1721 g 2742 t ORIGIN 5 bp upstream of PstI site. 1 ctgcagactg acatgcattt cataggtaga gataacattt actgggaagc acatctatca 61 tcataaaaag caggcaagat tttcagactt tcttagtggc tgaaatagaa gcaaaagacg 121 tgattaaaaa caaaatgaaa caaaaaaaat cagttgatac ctgtggtgta gacatccagc 181 aaaaaaatat tatttgcact accatcttgt cttaagtcct cagacttggc aaggagaatg 241 tagatttcta cagtatatat gttttcacaa aaggaaggag agaaacaaaa gaaaatggca 301 ctgactaaac ttcagctagt ggtataggaa agtaattctg cttaacagag attgcagtga 361 tctctatgta tgtcctgaag aattatgttg tacttttttc ccccattttt aaatcaaaca 421 gtgctttaca gaggtcagaa tggtttcttt actgtttgtc aattctatta tttcaataca 481 gaacaatagc ttctataact gaaatatatt tgctattgta tattatgatt gtccctcgaa 541 ccatgaacac tcctccagct gaatttcaca attcctctgt catctgccag gccattaagt 601 tattcatgga agatctttga ggaacactgc aagttcatat cataaacaca tttgaaattg 661 agtattgttt tgcattgtat ggagctatgt tttgctgtat cctcagaaaa aaagtttgtt 721 ataaagcatt cacacccata aaaagataga tttaaatatt ccagctatag gaaagaaagt 781 gcgtctgctc ttcactctag tctcagttgg ctccttcaca tgcatgcttc tttatttctc 841 ctattttgtc aagaaaataa taggtcacgt cttgttctca cttatgtcct gcctagcatg 901 gctcagatgc acgttgtaga tacaagaagg atcaaatgaa acagacttct ggtctgttac 961 tacaaccata gtaataagca cactaactaa taattgctaa ttatgttttc catctctaag 1021 gttcccacat ttttctgttt tcttaaagat cccattatct ggttgtaact gaagctcaat 1081 ggaacatgag caatatttcc cagtcttctc tcccatccaa cagtcctgat ggattagcag 1141 aacaggcaga aaacacattg ttacccagaa ttaaaaacta atatttgctc tccattcaat 1201 ccaaaatgga cctattgaaa ctaaaatcta acccaatccc attaaatgat ttctatggcg 1261 tcaaaggtca aacttctgaa gggaacctgt gggtgggtca caattcaggc tatatattcc 1321 ccagggctca gccagtgtct gtacatacag ctagaaagct gtattgcctt tagcagtcaa 1381 gctcgaaagg taagcaactc tctggaatta ccttctctct atattagctc ttacttgcac 1441 ctaaacttta aaaaattaac aattattgtg ctatgtgttg tatctttaag ggtgaagtac 1501 ctgcgtgata ccccctataa aaacttctca cctgtgtatg cattctgcac tattttatta 1561 tgtgtaaaag ctttgtgttt gttttcagga ggcttattct ttgtgcttaa aatatgtttt 1621 taatttcaga acatcttatc ctgtcgttca ctatctgata tgctttgcag tttgcttgat 1681 taacttctag ccctacagag tgcacagaga gcaaaatcat ggtgttcagt gaattctggg 1741 gagttatttt aatgtgaaaa ttctctagaa gtttaattcc tgcaaagtgc agctgctgat 1801 cactacacaa gataaaaatg tggggggtgc ataaacgtat attcttacaa taatagatac 1861 atgtgaactt atatacagaa aagaaaatga gaaaaatgtg tgtgtgtata ctcacacacg 1921 tggtcagtaa aaacttttga ggggtttaat acagaaaatc caatcctgag gccccagcac 1981 tcagtacgca tataaagggc tgggctctga aggacttctg actttcacag attatataaa 2041 tctcaggaaa gcaactagat tcatgctggc tccaaaagct gtgctttata taagcacact 2101 ggctatacaa tagttgtaca gttcagctct ttataataga aacagacaga acaagtataa 2161 atcttctatt ggtctatgtc atgaacaaga attcattcag tggctctgtt ttatagtaaa 2221 cattgctatt ttatcatgtc tgcatttctc ttctgtctga atgtcaccac taaaatttaa 2281 ctccacagaa agtttatact acagtacaca tgcatatctt tgagcaaagc aaaccatacc 2341 tgaaagtgca atagagcaga atatgaatta catgcgtgtc tttctcctag actacatgac 2401 cccatataaa ttacattact tatctattct gccatcacca aaacaaaggt aaaaatactt 2461 ttgaagatct actcatagca agtagtgtgc aacaaacaga tatttctcta catttatttt 2521 tagggaataa aaataagaaa taaaatagtc agcaagcctc tgctttctca tatatctgtc 2581 caaacctaaa gtttactgaa atttgctctt tgaatttcca gttttgcaag cctatcagat 2641 tgtgttttaa tcagaggtac tgaaaagtat caatgaattc tagctttcac tgaacaaaaa 2701 tatgtagagg caactggctt ctgggacagt ttgctaccca aaagacaact gaatgcaaat 2761 acataaatag atttatgaat atggttttga acatgcacat gagaggtgga tatagcaaca 2821 gacacattac cacagaatta ctttaaaact acttgttaac atttaattgc ctaaaaactg 2881 ctcgtaattt actgttgtag cctaccatag agtaccctgc atggtactat gtacagcatt 2941 ccatccttac attttcactg ttctgctgtt tgctctagac aactcagagt tcaccatggg 3001 ctccatcggt gcagcaagca tggaattttg ttttgatgta ttcaaggagc tcaaagtcca 3061 ccatgccaat gagaacatct tctactgccc cattgccatc atgtcagctc tagccatggt 3121 atacctgggt gcaaaagaca gcaccaggac acaaataaat aaggtgagcc tacagttaaa 3181 gattaaaacc tttgccctgc tcaatggagc cacagcactt aattgtatga taatgtccct 3241 tggaaactgc atagctcaga ggctgaaaat ctgaaaccag agttatctaa aagtgtggcc 3301 acctccaact cccagagtgt tacccaaatg cactagctag aaatcttgaa actggattgc 3361 ataacttctt tttgtcataa ccattatttc agctactatt attttcaatt acaggttgtt 3421 cgctttgata aacttccagg attcggagac agtattgaag ctcaggtaca gaaataattt 3481 cacctccttc tctatgtccc tttcctctgg aagcaaaata cagcagatga agcaatctct 3541 tagctgttcc aagccctctc tgatgagcag ctagtgctct gcatccagca gttgggagaa 3601 cactgttcat aagaacagag aaaaagaagg aagtaacagg ggattcagaa caaacagaag 3661 ataaaactca ggacaaaaat accgtgtgaa tgaggaaact tgtggatatt tgtacgctta 3721 agcaagacag ctagatgatt ctggataaat gggtctggtt ggaaaagaag gaaagcctgg 3781 ctgatctgct ggagctagat tattgcagca ggtaggcagg agttccctag agaaaagtat 3841 gagggaatta cagaagaaaa acagcacaaa attgtaaata ttggaaaagg accacatcag 3901 tgtagttact agcagtaaga cagacaggat gaaaaatagt tttgtaaaca gaagtatcta 3961 actactttac tctgttcata cactacgtaa aacttactaa gtaataaaac tagaataaca 4021 acatctttct ttctctttgt attcagtgtg gcacatctgt aaacgttcac tcttcactta 4081 gagacatcct caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta 4141 gactttatgc tgaagagaga tacccaatcc tgccagtaag ttgctctaaa atctgatctg 4201 agtgtattcc atgccaaagc tctaccattc tgtaatgcaa aaacagtcag agttccacat 4261 gtttcactaa gaaaatttct ttttctcttg tttttacaaa tgaaagagag gacaaataac 4321 atttctctat caccgacctg aaactctaca gtcttcagag aatgaatggc ttgctaaaag 4381 aatgtcaaat cttactatac agctatttca tattacacta ctaaatacac tataaggcat 4441 agcatgtagt aatacagtgt aaaatagctt tttacactac tatattatta atatctgtta 4501 attccagtct tgcatttcac atttgcaaaa cgttttgaaa ttcgtatctg aaagctgaat 4561 actcttgctt tacaggaata cttgcagtgt gtgaaggaac tgtatagagg aggcttggaa 4621 cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc ctgggtagaa 4681 agtcagacaa atggtaaggt agaacatgct ttgtacatag tgagagttgg ttcaccctaa 4741 tactgagaac ttggatatag ctcagccagc gtgctttgcg ttcaagctta ccagagctgt 4801 tgtatgcctg ttaagcaggg catacagtca tgaggctctt gaaaaatctt aacagacaaa 4861 gggcaatgga aaatcggagt taagggatgg tagggataaa atgcatagaa agaggtacca 4921 caattttgat ttttgcccta atgcctctct gcgtggttcc tcaatttttc tacttcattc 4981 ctcatctcct cagagcattc ctttccctca tgcttgaaac acagatgaaa gactgtgaat 5041 tctaactgag atgaaaacat ccacaaccac acaacctctg gtgtggagtc acattctgtg 5101 aaggcaaaaa ctaggccacg taatctatgc gtgcaagcta cgcgtaagct atgtgtgtga 5161 caggacaatg tgaggaacat actatgtgca caaggactgc agaataaaca ggagcaaagt 5221 ttttgaagaa aacagagtaa aatcctgttt tcctcttttg ttacattctt tacatatatc 5281 tcaaatttcc tctttggtta gaagcaagta atatttatgt ttcttggtac tgtttgggtt 5341 gaagaccatt ctgggataag agaaattcca gtggttcttc ccctaatcat aaaatgtcag 5401 gtttagtttt tttgtaacac agaaatctct tcatctttta tcttttgttg tgattcttga 5461 tagagagaga aacaagactt actgacaata gcagcaagaa aatcaatctt ggaagaacaa 5521 gattgcaatt gcaaaaacaa accaatgtcc ttgcccctac atcctcttcc ccataaattc 5581 tacattctct atctaccttg tgcttgccaa catgatatac gtaaactctc ttttcctatt 5641 cattcttaaa ggaattatca gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc 5701 aatggttctg gttaatgcca ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga 5761 agacacacaa gcaatgcctt tcagagtgac tgaggtatat gggcatacct tagagatgta 5821 atctagaatt tatgaagaga gtagacatgt tgttatatga acactgcatt agcgtatctg 5881 ctcatttgtc tgcatctctt tcagacactg tgttaaaagc agggaatttt ccttatgtct 5941 ctctcgtcac aatattcctg acattgcaaa gctcctgaga aataacttca gattccactt 6001 ttcctaggaa ggcttctgga tgagaactaa tcatcttaac tgtaactaga catttctgca 6061 tccaagaata atctttgtta aaactatatt ctctctctct tttttttttt tttttggttc 6121 tccagcaaga aagcaaacct gtgcagatga tgtaccagat tggtttattt agagtggcat 6181 caatggcttc tgagaaaatg aagatcctgg agcttccatt tgccagtggg acaatgagca 6241 tgttggtgct gttgcctgat gaagtctcag gccttgagca ggtatggccc tagaagttgg 6301 cttcagaata ttaaaaacac atggaaattt agctgttgta aagctctttt caacacagtt 6361 atcctaaaac atttaaccag cacaaatttc atcatgattc aatatgtgat tgttgcatag 6421 aagtgtagat ttgtcccact gggtcctgca atagcccatg ctgagcatgg cttgctgaaa 6481 gaactgcttt agagggtgaa aagtttgaca cagcagacaa gatgattctc acctaagcag 6541 ctgttactgt agtggcttga actctaaagg tcttgtatct ccattcctgt gcactgagga 6601 gcttcttgga aagttcatat aaggtttact agttctaact attatctcat ttggtggcac 6661 tcaatgtgct ttgttcacgt cttcataaat taatctatct aaaaattgga tgtggttaaa 6721 gcaatttcag aaataacatg tacataatgt acaattattg atatgaacag aacacaggca 6781 tagcatattg taattaggag gactgtagtt attttgaata ggaaacacaa tgtaataaat 6841 gagaattcat tgaaatgtta gtatgctaac tcaatctaaa ttataaagat aaagaggcat 6901 ttaatcacag ctagatttcc atcacttgtg acagacaggc atatgaatga ttatgtacag 6961 ctctaggaaa aaaagtatgt aggaaaacta gtacattttg attagaaagt ctgaaaatga 7021 ggtgccttga tcaaagagaa tacgtgtgtt tgagaaaaaa aaagtttgga tagaggtggt 7081 aagagagaat atattgaaat ggtgtttcta caaactgcca tggccagatt tgtgtaagag 7141 acattcagta agtaggcaag gaaagaaata ttactaggta caaagcaaca tcagtaatac 7201 caaaagaaac caattattcc agatgccaat ctcgtaatag ggttaagaga tttccacccc 7261 tctagtggtc accagtgcaa ccagtaactt tgctaattta cattttcttt ttttaaatgg 7321 cagatatagc tttgaactga gtgatcatga actggtactg tgtaatagat gaagacatac 7381 ttgacgacta aacttctgat ttttaaaaac tcaaattctc ttgaaagatc agttcccagt 7441 ctagtaacag ctgatagttt aagtatcagt aattggctac cattaacaac tggctcctga 7501 gaggtcttaa atgtagagac agctttaaac tcaaaagcac agagtgattt ttagaataga 7561 tttcccaagc aaagaaaata aacagggagg agctttaagg gagtagccat ctcattatta 7621 ttattattta aagaaatggc agcaagccta caaaagaaaa ataagacaga gcagagaaga 7681 aagagtcatg gtatgctttt ctatcttagc aaaattaatc tctacatgcc taggaaaaag 7741 ccatgacaag agcaatcagt tcaaaaggtg tatgcaaaaa accacataat agtaactagt 7801 actgcattgc caggaaggaa gttatgtcgc cattccatgg atctcattct catttccttg 7861 cagcttgaga gtataatcaa ctttgaaaaa ctgactgaat ggaccagttc taatgttatg 7921 gaagagagga agatcaaagt gtacttacct cgcatgaaga tggaggaaaa atacaacctc 7981 acatctgtct taatggctat gggcattact gacgtgttta gctcttcagc caatctgtct 8041 ggcatctcct cagcagagag cctgaagata tctcaagctg tccatgcagc acatgcagaa 8101 atcaatgaag caggcagaga ggtggtaggg tcagcagagg ctggagtgga tgctgcaagc 8161 gtctctgaag aatttagggc tgaccatcca ttcctcttct gtatcaagca catcgcaacc 8221 aacgccgttc tcttctttgg cagatgtgtt tccccttaaa aagaagaaag ctgaaaaact 8281 ctgtcccttc caacaagacc cagagcactg tagtatcagg ggtaaaatga aaagtatgtt 8341 atctgctgca tccagacttc ataaaagctg gagcttaatc tagaaaaaaa atcagaaaga 8401 aattacactg tgagaacagg tgcaattcac ttttccttta cacagagtaa tactggtaac 8461 tcatggatga aggcttaagg gaatgaaatt ggactcacag tactgagtca tcacactgaa 8521 aaatgcaacc tgatacatca gcagaaggtt tatgggggaa aaatgcagcc ttccaattaa 8581 gccagatatc tgtatgacca agctgctcca gaattagtca ctcaaaatct ctcagattaa 8641 attatcaact gtcaccaacc attcctatgc tgacaaggca attgcttgtt ctctgtgttc 8701 ctgatactac aaggctcttc ctgacttcct aaagatgcat tataaaaatc ttataattca 8761 catttctccc taaactttga ctcaatcatg gtatgttggc aaatatggta tattactatt 8821 caaattgttt tccttgtacc catatgtaat gggtcttgtg aatgtgctct tttgttcctt 8881 taatcataat aaaaacatgt ttaagcaaac acttttcact tgtagtattt gaagtacagc 8941 aaggttgtgt agcagggaaa gaatgacatg cagaggaata agtatggaca cacaggctag 9001 cagcgactgt agaacaagta ctagtgggtg agaagttgaa caagagtccc ctacaagcaa 9061 cttaatctaa taagctagtg gtctacatca gctaaaagag catagtgagg gatgaaattg 9121 gttctccttt ctaagcatca cctgggacaa ctcatctgga gcagtgtgtc caatctgccg 9181 ctgccctgat ctcggctggg gtgatg // LOCUS PMUCEN 150 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage Mu wild type DNA fragment with a gyrase cleavage site. ACCESSION M32302 KEYWORDS . SOURCE Bacteriophage Mu (wild type) DNA. ORGANISM Bacteriophage mu Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Pato,M., Howe,M. and Higgins,P. TITLE DNA gyrase binds to a centrally located replication enhancer (CEN) in the bacteriophage Mu genome JOURNAL Unpublished (1990) In Press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.L.Pato, 23-FEB-1990. FEATURES from to/span description site 49 50 DNA gyrase cleavage site in complimentary strand mut 44 44 a in wt; g in Mu nuB103 mut 50 50 g in wt; c in Mu nuB1 BASE COUNT 40 a 38 c 27 g 45 t ORIGIN Map position at 18.0 kb. 1 acgcgtcagc gccgctctga ggcaataaac agaatcaggc ataaaatcag ccgcacagat 61 tttttaaaac gcgccacggg atttttaaac cggtatttaa cggtgtatga atcccgtttt 121 atcttccttt cactttcttt ctccagtact // LOCUS RATRNRTR 2577 bp ds-DNA ROD 11-JUL-1990 DEFINITION Rat snRNP-associated polypeptide N, complete cds. ACCESSION J05497 KEYWORDS snRNP-associated polypeptide N. SOURCE Rat male adult (Fisher) DNA, clones rgV and rgIII2. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2577) AUTHORS Schmauss,C. and Lerner,M.R. TITLE The closely related small nuclear ribonucleoprotein polypeptides N and B/B' are distinguishable by antibodies as well as by differences in their mRNAs and gene structures JOURNAL J. Biol. Chem. 265, 10733-10739 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.Schmauss, 13-APR-1990. FEATURES from to/span description pept 1314 2036 snRNP-associated polypeptide N signal 774 779 TATA box signal 698 702 CAAT box BASE COUNT 822 a 537 c 581 g 637 t ORIGIN 1 taactagaga actgagaaca gaatccctgt tagaggaatt agagaccaaa ttaaaagagg 61 tgaaggggct tgcaacccca ttagaacaac aatgccaacc aaccagagct cccagggact 121 aaaccactac ccaaagacta cacatggctc caactgcata tgtagcagag gatggcctag 181 ttgggcacaa tggaaagaga agcccttgga cctgcccagg ctggacccac cagtgtagag 241 taatgtctga ggggtagaag caggtggttg ggaatgggaa tacccttata tgtgaaggga 301 agcaggatga ggtagggaac ttatgttcgt aaaccaggaa agggaataac gtttgaaatg 361 taaataaaaa tatatccagt gaaaaaaaaa ctgaagtcta taataaaagc ttttaatcct 421 ctcagccctt aataaaagtt aattatatta cttatgttaa aaaaaacata aaacagcatg 481 gtattgtact tttttttttc agacaaaagg tctatggcac acagccaaat cagtgacctc 541 tggggcacaa tttccagaaa tcaacatcct agagttgacc tctggcttcc atgtatacgt 601 gcatgcacac acacatatgc atatacatac aaaattacat atatgcagtt gtctaaatca 661 tatgaagaat ttcaagttgt tttaagttta atatcagcaa atacatgcaa atgtgattat 721 aagaagctgg atggaatcct gagttgttga ctaaagagct aagaaggggc aattataaaa 781 caaaaatgac acatgaaatt ccacccgagg ttagaaataa ttaaagaagg ccattgcggc 841 aagtctagca cagagagtag agggtgctgg aggatgacag acggttggtt ctgaggaggg 901 attttgcaac gaatggagcg aggaagggat cgtttacact tgagaagaac tactgaacag 961 cacgtcccag agattgaggt ccaggtcaaa cgtagaagga cagcctcact gagcaaccaa 1021 gagtgtcact tgtacccacg gcattctcag caacagcaaa ttcctgtggt ggatttccag 1081 gcagaactga gacaggcgtt cttagctgag acaccaagag gtggttaaag cagtattgga 1141 acttcaaggt ggtggaagtc aacaaacaca ggacctatcc actgattgtg aaactttggt 1201 caagcttaca ctgtgttaat aaccctgcat caaaccttta tttattgccc ttccccaagt 1261 tttaaggatc ttgtaatttt agtgttgaca actgctattg tggaacagca atcatgactg 1321 tgggtaagag tagcaagatg ctgcagcata ttgactatag aatgagatgt atcctgcaag 1381 atggaagatt cttcattggc acctttaagg cttttgacaa gcatatgaat ttgatcctct 1441 gtgattgtga tgagttcagg aagatcaagc caaagaatgc aaaacagcca gaacgtgaag 1501 aaaaacgggt tttgggtctg gtcttgctac gtggagagaa cttggtttcc atgacagtgg 1561 agggtccacc tcctaaagat actggcattg ctcgtgtgcc acttgctagt gctgcaggtg 1621 gccctggtgt tggaagagca gctggcagag gagtaccagc aggtgtacct attccccaag 1681 ctcctgctgg attagcaggc cctgtccgag gagtgggagg cccatcccag caggtcatga 1741 ccccacaggg aagaggcact gttgcagctg ctgctgttgc tgctactgct agcattgcag 1801 gagccccaac ccagtacccg ccaggacggg gaactccacc tccacctgta ggcagagcaa 1861 ccccacctcc aggcattatg gctcctccac ctggaatgag accacccatg ggcccaccaa 1921 ttggacttcc ccctgctcaa gggagaccta taggcatgcc ccctccagga atgagactcc 1981 ctcctccagg aattagaggc ccacctcccc caggaatgcg tccaccaaga ccctaagata 2041 cagttgataa atctcagccc ttctctttcc ctacaatgct tcttgtgaaa ttgtgtcgcc 2101 tgcaagcttt tgacccctct tactgcatta actatagata ataaatacat agcgcaattg 2161 aattgaaaaa aaaagaaata attaaagaaa gtaagtcaca atgactattt gctattgaca 2221 ttttttttaa atgcccgaat gagagccagt ggagacgata gaaagtccag aagaagctaa 2281 gataatttca aaacacataa tgtcagtaga acgagggaag gtaagaaccc acagaacaca 2341 agaaaccact catgaaactc ctcacacaca ggaagaaaag gaagaatgta atttttaaaa 2401 aaaaagttat agtcaagtta aactatattt tctcattggt ttttttttgt gactttgtat 2461 ttatttttat gtttctttgt gtatattgta catgtctcag tcaaaggcca acggtgagtg 2521 ttttcctcta aaaaacctta ttgtttaaga cagggtctct tcctgagctc agaattc // LOCUS WUCSSP 1323 bp ds-DNA INV 11-JUL-1990 DEFINITION W.bancrofti species specific DNA fragment. ACCESSION M27140 KEYWORDS . SOURCE W.bancrofti DNA, clone IWb35. ORGANISM Wuchereria bancrofti Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 1323) AUTHORS Dissanayake,S. and Piessens,W.F. TITLE Cloning and characterization of a Wuchereria bancrofti-specific DNA sequence JOURNAL Mol. Biochem. Parasitol. 39, 147-150 (1990) STANDARD simple staff_entry BASE COUNT 399 a 204 c 251 g 469 t ORIGIN 1288 bp upstream of SacI site. 1 gatctctgtt tcattatacc gagtaaatat tggagaaaag aaaaatttgt tcaacgtgtt 61 aaagattaac ttgctttcta tataatggaa acattttgca tattggatta gtcagtaaat 121 taataatgga caattgtgat aagtaaaact aaaaagacat cgtcactctc ttccttatta 181 tagcatttcc ttgcttaaaa ccacttgcga cgtcactttt tgttataaat catatggtga 241 atacttttcc tcatttaaga tcgtttatta gcttttgcat tacaaattgt tcattttagt 301 tgtgaacgca ttttgtacat ttaaatgctt gctttagaat tttaggtttc aactggtacg 361 tttatgccgt ttatatgaaa ttatgggata acaaagaaaa ataaagataa agaagtaaaa 421 attcgaatga ttaaatgaat tattagtacc ctgattgcta tagccctttt ctacgttttg 481 gcaagaagtc ccaaattggt tctcactttt cagaatgaaa atttttagtt gtttatagcg 541 ccaaaagaaa tgattaacag cagtttggct ttgtggacgg aatgatatgc ttttctgcat 601 acctttcata aattggaaaa aacaaaataa tttggctaag agtgaatgga gtattcgttc 661 gtttgtgata ttttcaatgt ttgttgatgt atattcgaag cgtctctgct cactactgtc 721 aaaccctttt taagaacgtt gcttctacgg tcactgggca gctactacgt attgagtgag 781 cgatatgaaa agaatataca gtatctaatg actgccaatg tcaaataaat ttttgtatcg 841 tcactcagcg gtcacaaatg tttcataaat atttcacatg cattctattt taggttcaaa 901 tatgctttta aaattctgct aaatttgcaa actaacgaga ttttgtttgg cagctcttct 961 tatgataacg cagttcaatc ctggtggtga agaatttgcc acagtcttcg cattttggat 1021 gaggttcatg cgtttgtttg tgtttgtgaa atgttgattt atggtcgaat gtccgcccac 1081 aaccgggtac tttgcattca tagatgaaag gctgaccgtg tgtttcctat gtgttatata 1141 ttcgttgtaa ttgttcgtga tcaataggaa acaactggca ggatggcaga ttttaataca 1201 accatatcaa taattatatt aaatgtaaat gttctagctg ggtagagtgg cgtgcatctg 1261 tagtctcggc cacttggaag actgagctca ggaagattac ttgcacccag gagcttgagg 1321 agc // LOCUS YSCHXT2 2890 bp ds-DNA PLN 11-JUL-1990 DEFINITION S.cerevisiae high affinity hexose transporter-2 (HXT2) gene, complete cds. ACCESSION M33270 KEYWORDS high affinity hexose transporter-2. SOURCE S.cerevisiae (isogenic strain to S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2890) AUTHORS Kruckeberg,A.L. and Bisson,L.F. TITLE The HXT2 gene of Saccharomyces cerevisiae is required for high affinity glucose transport JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.L.Kruckeberg, 26-MAR-1990. FEATURES from to/span description pept 818 2443 high affinity hexose transporter-2 BASE COUNT 808 a 571 c 535 g 976 t ORIGIN 1 aaaaagaaat attattcatt actatcaaga taccgtagaa aagaaaaaga accggggatg 61 aataataaca aaacgggctg ctttttcttt ttctctttct ttttcatttg gtccctctcc 121 actctttctc cacgtggctt tgcttcccgt atttttcttc gtcagagaga ctacatgata 181 gtccaaagaa aagaaacagg ggggacgaag aagaggagag gaaaaaccaa aatataattt 241 tccgtgaaat agattctttt tctccactgc acgacttctt ctcctcccac aaaaaatgac 301 gcctcataga cagccccgca gcttcacttt taagtttctt tttctcctca cggcgcaacc 361 gctaacttaa gctaatcctt atgaatccgg agaaaagcgg ggtcttttaa ctcaataaaa 421 ttttccgaaa tcctttttcc tacgcgtttt cttcgggaac tagataggtg gctcttccac 481 ctgtttttcc atcattttag tttttcgcaa gccatgcgtg ccttttcgtt tttgcgatgg 541 cgaacgaggg ctggaaaaat taacggtacg ccgcctaacg atagtaatag gccacgcaac 601 tggcgtggac gacaacaata agtcgcccat tttttatgtt ttcaaaacct agcaaccccc 661 accaaacttg tcatcgttcc cggattcaca aatgatataa aaagcgatta caattctaca 721 ttctaaccag atttgagatt tcctctttct caattcctct tatattagat tataagaaca 781 acaaattaaa ttacaaaaag acttataaag caacataatg tctgaattcg ctactagccg 841 cgttgaaagt ggctctcaac aaacttctat ccactctact ccgatagtgc agaaattaga 901 gacggatgaa tctcctattc aaaccaaatc tgaatacact aacgctgaac tcccagcaaa 961 gccaatcgcc gcatattgga ctgttatctg tttatgtcta atgattgcat ttggtgggtt 1021 tgtctttggt tgggatactg gtaccatctc tggttttgtt aatcaaaccg atttcaaaag 1081 aagatttggt caaatgaaat ctgatggtac ctattatctt tcggacgtcc ggactggttt 1141 gatcgttggt atcttcaata ttggttgtgc ctttggtggg ttaaccttag gacgtctggg 1201 tgatatgtat ggacgtagaa ttggtttgat gtgcgtcgtt ctggtataca tcgttggtat 1261 tgtgattcaa attgcttcta gtgacaaatg gtaccaatat ttcattggta gaattatctc 1321 tggtatgggt gtcggtggta ttgctgtcct atctccaact ttgatttccg aaacagcacc 1381 aaaacacatt agaggtacct gtgtttcttt ctatcagtta atgatcactc taggtatttt 1441 cttaggttac tgtaccaact atggtactaa agactactcc aattcagttc aatggagagt 1501 gcctttgggt ttgaactttg ccttcgctat tttcatgatc gctggtatgc taatggttcc 1561 agaatctcca agattcttag tcgaaaaagg cagatacgaa gacgctaaac gttctttggc 1621 aaaatctaac aaagtcacca ttgaagatcc aagtattgtt gctgaaatgg atacaattat 1681 ggccaacgtt gaaactgaaa gattagccgg taacgcttct tggggtgagt tattctccaa 1741 caaaggtgct attttacctc gtgtgattat gggtattatg attcaatcct tacaacaatt 1801 aactggtaac aattacttct tctattatgg tactactatt ttcaacgccg tcggtatgaa 1861 agattctttc caaacttcca tcgttttagg tatagtcaac ttcgcatcca ctttcgtggc 1921 cttatacact gttgataaat ttggtcgtcg taagtgtcta ttgggtggtt ctgcttccat 1981 ggccatttgt tttgttatct tctctactgt cggtgtcaca agcttatatc caaatggtaa 2041 agatcaacca tcttccaagg ctgccggtaa cgtcatgatt gtctttacct gtttattcat 2101 tttcttcttc gctattagtt gggccccaat tgcctacgtt attgttgccg aatcctatcc 2161 tttgcgtgtc aaaaatcgtg ctatggctat tgctgttggt gccaactgga tttggggttt 2221 cttgattggt ttcttcactc ccttcattac aagtgcaatt ggattttcat acgggtatgt 2281 cttcatgggc tgtttggtat tttcattctt ctacgtgttt ttctttgtct gtgaaaccaa 2341 gggcttaaca ttagaggaag ttaatgaaat gtatgttgaa ggtgtcaaac catggaaatc 2401 tggtagctgg atctcaaaag aaaaaagagt ttccgaggaa taagagatta tacttaaact 2461 agcactgatt tttttaaggc taatggctac taatacttta atagatgatc ttcatacttt 2521 tttatttaac gatttttaat gatgttttta tttgtaccac tcatttatct agattttttt 2581 aatactgatc aaatcttacg gactcgacgt taaaaagttc ctacatacgt ctggtacttg 2641 aaacgctgct tcgaggtatt gacactataa gaatacgatc caaatactta caccgcatgt 2701 aaaaatatgc cgacaatatg aatacttgtt gatgaatgat atttgatttt aatccggcaa 2761 tttacctcct ttatataatc caataattgt tgataattag tggttaggtt gcagtactaa 2821 taagaattaa gacaaatatt cttctactat ataaaaggtg caaacaaaac acacgccgat 2881 cggccatact // LOCUS RATGAH 1003 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Rat L-glutamine amidohydrolase mRNA, 3' end. ACCESSION J05499 KEYWORDS L-glutamine amidohydrolase. SOURCE Rat (strain Sprague-Dawley) liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1003) AUTHORS Smith,E.M. and Watford,M. TITLE Molecular cloning of a cDNA for rat hepatic glutaminase: Sequence similarity to kidney-type glutaminase JOURNAL J. Biol. Chem. 265, 10631-10636 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Waterford, 08-MAY-1990. FEATURES from to/span description pept < 1 660 L-glutamine amidohydrolase (EC 3.5.1.2) BASE COUNT 270 a 243 c 258 g 232 t ORIGIN 1 ttccggatgt gtctgtcacc cccgttagac aagctgggga acagccacag gggcatcagc 61 ttctgccaga agttggtgtc tctgtttaac ttccacaact acgacaacct gcggcactgt 121 gctcggaagt tagacccacg gagggaaggg ggggaagttc ggaacaagac cgtggtgaac 181 ctgttatttg ctgcatatag tggagatgtc tcagctcttc gaaggtttgc cttgtctgcc 241 gtggatatgg agcagaagga ctatgattcc cgcacagccc tacatgtggc ggcagcggaa 301 ggacacattg acgttgtcaa gtttctgatc gaggcttgca aagtgaatcc ttttgtcaag 361 gacaggtggg gcaacattcc cctggatgat gccctgcagt tcaatcacct ggaggtggtc 421 aaactgcttc aggattacca tgactcctac atgctgtctg agactcaagc tgaggtacag 481 ctgagactct gtcaaaagag aactgagaga gcatgtgtga gcacaggcca gggcagcccg 541 tgctcaagaa aaagcatgag cgggccacaa tttaacccaa ggccaccaaa aatactattg 601 caagctgctt cagtgggatc aacacagcca tctggtgaca caggccagtg ttttctgtga 661 gaatcaaaat gccccattcc ctcatcggac agcacagaga aaagcttcag tggacacctg 721 agcagagcta gccacggaga cctcaaggta tagcttaagt gacatcctcc accagaaagt 781 agcccaggct tttacccagg tccccatttc aacttccttg gagagcgtct agctacatgc 841 atatgtatct gtcacagagc aagagaggtg ggtgagagcc caatcacctg gctttagaaa 901 tctgcagaga tctgtccatc ttagccaaga catgctgcta ctgctgacag gagttttata 961 gacaaagtat tttgtgttca aataaacttt aattaccgga att // LOCUS CEACAEVA 264 bp ds-DNA VRL 11-JUL-1990 DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene, complete cds. ACCESSION M34092 KEYWORDS tat protein. SOURCE Caprine arthritis-encephalitis lentivirus (strain Cork) DNA, from goat synovial membrane, clone pCol.9. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis- encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 08-MAY-1990. Author address: W.P.Cheevers Washington State University Dept Veterinary Microbiology Pullman, WA 94164-7040 email: b384@wsuvmsl.bitnet FEATURES from to/span description pept 1 264 tat protein BASE COUNT 97 a 40 c 80 g 47 t ORIGIN 1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta 61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg 121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg 181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta 241 tgtaacccgg ggtggggaac ataa // LOCUS CEACAEVB 264 bp ds-DNA VRL 11-JUL-1990 DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene, complete cds. ACCESSION M34093 KEYWORDS tat protein. SOURCE Caprine arthritis-encephalitis lentivirus (strain G63) DNA, from goat synovial membrane, clone pC63-49. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis- encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 08-MAY-1990. Author address: W.P.Cheevers Washington State University Dept Veterinary Microbiology Pullman, WA 94164-7040 email: b384@wsuvmsl.bitnet FEATURES from to/span description pept 1 264 tat protein BASE COUNT 99 a 43 c 75 g 47 t ORIGIN 1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata 61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg 121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg 181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta 241 tgtaacccgg ggtgggaaac ataa // LOCUS FLAPR834HA 540 bp ss-RNA VRL 11-JUL-1990 DEFINITION Influenza A/PR/8/34, hemagglutinin (seg 4) gene. partial cds. ACCESSION M34335 KEYWORDS glycoprotein; hemagglutinin. SOURCE Influenza A/PR/8/34 RNA, passed in bovine MBDK cells, originally from human. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 540) AUTHORS Bressoud,A., Whitcomb,J., Pourzand,C., Haller,O. and Cerutti,P. TITLE Rapid detection of influenza virus H1 by the polymerase chain reaction JOURNAL Biochem. Biophys. Res. Commun. 167, 425-430 (1990) STANDARD simple staff_review COMMENT Sequence reported is + strand. FEATURES from to/span description pept < 1 > 540 hemagglutinin (AA at 2) BASE COUNT 179 a 121 c 123 g 117 t ORIGIN 1 cctactggtc ctgttatgtg cacttgcagc tgcagatgca gacacaatat gtataggcta 61 ccatgcgaac aattcaaccg acactgttga cacagtactc gagaagaatg tgacagtgac 121 acactctgtt aacctgctcg aagacagcca caacggaaaa ctatgtagat taaaaggaat 181 agccccacta caattgggga aatgtaacat cgccggatgg ctcttgggaa acccagaatg 241 cgacccactg cttccagtga gatcatggtc ctacattgta gaaacaccaa actctgagaa 301 tggaatatgt tatccaggag atttcatcga ctatgaggag ctgagggagc aattgagctc 361 agtgtcatca ttcgaaagat tcgaaatatt tcccaaagaa agctcatggc ccaaccacaa 421 cacaaacgga gtaacggcag catgctccca tgaggggaaa agcagttttt acagaaattt 481 gctatggctg acggagaagg agggctcata cccaaagctg aaaaattctt atgtgaacaa // LOCUS HUMGPPSBAA 355 bp ds-DNA PRI 11-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein C-D gene, intron C1. ACCESSION M34422 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, clone PS-beta-G C. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 355) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description IVS 1 355 pregnancy-specific beta-1 glycoprotein intron C1 BASE COUNT 110 a 64 c 62 g 119 t ORIGIN Chromosome 19. 1 gtaagtggat cccagcatcg ttggcaatag ggttttaggt ggagtctatc tggcattcag 61 agaagagtca ggaaaacaat tgtattccca gcctgtgtcc catgggcaca agcaaatccc 121 aaattctcct cctgaaccct ccaaatttgt ctaagaactt cgaaaacttt aacaaacagg 181 ctgatatctt cataatattc ccagcctaga ccaagcagga agaacattga tttcattgaa 241 ataattgata ataatgaaga taatgttttt atgattttta tttgaaaatt tgctgattct 301 ttaaatggtt tgttttctac attgatggaa tttttctctt ttaatctatc tacag // LOCUS HUMGPPSBD 1418 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M34421 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1418) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE Structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept 79 1359 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 79 180 pregnancy-specific beta-1 glycoprotein signal peptide matp 181 1356 pregnancy-specific beta-1 glycoprotein mRNA 1 1418 pregnancy-specific beta-1 glycoprotein mRNA BASE COUNT 408 a 398 c 296 g 316 t ORIGIN Chromosome 19. 1 cagctgacag ccgtgctcag acagcttctg gatcctaggc tcatctccac agaggagaac 61 acgcaggcag cagagaccat ggggcccctc ccagcccctt cctgcacaca gcgcatcacc 121 tggaaggggc tcctgctcac agcatcactt ttaaacttct ggaacccgcc caccactgcc 181 gaagtcacga ttgaagccca gccacccaaa gtttctgagg ggaaggatgt tcttctactt 241 gtccacaatt tgccccagaa tcttcctggc tacttctggt acaaagggga aatgacggac 301 ctctaccatt acattatatc gtatatagtt gatggtaaaa taattatata tgggcctgca 361 tacagtggaa gagaaacagt atattccaac gcatccctgc tgatccagaa tgtcacccgg 421 aaggatgcag gaacctacac cttacacatc ataaagcgag gtgatgagac tagagaagaa 481 attcgacatt tcaccttcac cttatacttg gagactccca agccctacat ctccagcagc 541 aacttaaacc ccagggaggc catggaggct gtgcgcttaa tctgtgatcc tgagactctg 601 gacgcaagct acctatggtg gatgaatggt cagagcctcc ctgtgactca caggttgcag 661 ctgtccaaaa ccaacaggac cctctatcta tttggtgtca caaagtatat tgcaggaccc 721 tatgaatgtg aaatacggaa cccagtgagt gccagtcgca gtgacccagt caccctgaat 781 ctcctcccga agctgcccat cccctacatc accatcaaca acttaaaccc cagggagaat 841 aaggatgtct tagccttcac ctgtgaacct aagagtgaga actacaccta catttggtgg 901 ctaaacggtc agagcctccc cgtcagtccc ggggtaaagc gacccattga aaacaggata 961 ctcattctac ccagtgtcac gagaaatgaa acaggaccct atcaatgtga aatacgggac 1021 cgatatggtg gcctccgcag taacccagtc atcctaaatg tcctctatgg tccagacctc 1081 cccagaattt acccttcatt cacctattac cgttcaggag aaaacctcga cttgtcctgc 1141 ttcacggaat ctaacccacc ggcagagtat ttttggacaa ttaatgggaa gtttcagcaa 1201 tcaggacaaa agctctttat cccccaaatt actagaaatc atagcgggct ctatgcttgc 1261 tctgttcata actcagccac tggcaaggaa atctccaaat ccatgacagt caaagtctct 1321 ggtccctgcc atggagacct gacagagtct cagtcatgac tgcaacaact gagacactga 1381 gaaaaagaac aggctgatac cttcatgaaa ttcaagac // LOCUS HUMGPPSBE 1856 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M34420 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1856) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE Structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept 82 1368 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 82 183 pregnancy-specific beta-1 glycoprotein signal peptide matp 184 1365 pregnancy-specific beta-1 glycoprotein mRNA 1 1856 pregnancy-specific beta-1 glycoprotein mRNA BASE COUNT 544 a 476 c 364 g 472 t ORIGIN Chromosome 19. 1 gcacagctga gagccatgct caggaagttt ctggatccta ggctcagctc cacagaggag 61 aacacgcagg cagcagagac catggggccc ctctcagccc ctccctgcac acagcgcatc 121 acctggaagg ggctcctgct cacagcatca cttttaaact tctggaaccc gcctaccact 181 gcccaagtca cgattgaagc cgagccaacc aaagtttcca aggggaagga cgttcttcta 241 cttgtccaca atttgcccca gaatcttgct ggctacatct ggtacaaagg gcaaatgaag 301 gacctctacc attacattac atcatacgta gtagatggtc aaataattat atatgggcct 361 gcatacagtg gacgagaaac agtatattcc aatgcatccc tgctgatcca gaatgtcacc 421 cgggaggacg caggatccta caccttacac atcgtaaagc gaggtgatgg gactagagga 481 gaaactggac atttcacctt caccttatac ctggagactc ccaagccctc catctccagc 541 agcaacttat accccaggga ggacatggag gctgtgagct taacctgtga tcctgagact 601 ccggacgcaa gctacctgtg gtggatgaat ggtcagagcc tccctatgac tcacagcttg 661 cagttgtcca aaaacaaaag gaccctcttt ctatttggtg tcacaaagta cactgcagga 721 ccctatgaat gtgaaatacg gaacccagtg agtgccagcc gcagtgaccc agtcaccctg 781 aatctcctcc cgaagctgcc caagccctac atcaccatca acaacttaaa ccccagggag 841 aataaggatg tcttagcctt cacctgtgaa cctaagagtg agaactacac ctacatttgg 901 tggctaaatg gtcagagcct cccggtcagt cccagggtaa agcgacccat tgaaaacagg 961 atcctcattc tacccagtgt cacgagaaat gaaacaggac cctatcaatg tgaaatacag 1021 gaccgatatg gtggcatccg cagttaccca gtcaccctga atgtcctcta tggtccagac 1081 ctccccagaa tttacccttc attcacctat taccattcag gagaaaacct ctacttgtcc 1141 tgcttcgcgg actctaaccc accagcagaa tattcttgga caattaatgg gaagtttcag 1201 ctatcaggac aaaagctctt tatcccccag attactacaa agcatagcgg gctctatgct 1261 tgctctgttc gtaactcagc cactggcatg gaaagctcca aatccatgac agtcaaagtc 1321 tctgctcctt caggaacagg acatcttcct ggccttaatc cattatagca gccgtgatgt 1381 catttctgta tttcaggaag actggcagac agttgctttc attcttcctc aaagtattta 1441 ccatcagcta cagtccaaaa ttgctttttg ttcaaggaga tttatgaaaa gactctgaca 1501 aggactcttg aatacaagtt cctgataact tcaagatcat accactggac taagaacttt 1561 caaaatttta atgaacaggc tgatacttca tgaaattcaa gacaaagaaa aaaacccaat 1621 tttattggac taaatagtca aaacaatgtt ttcataattt tctatttgaa aatgtgctga 1681 ttctttgaat gttttattct ccagatttat gcactttttt tcttcagcaa ttggtaaagt 1741 atacttttgt aaacaaaaat tgaaacattt gcttttgctc cctaagtgcc ccagaattgg 1801 gaaactattc aggagtattc atatgtttat ggtaataaag ttatctgcac aagttc // LOCUS HUMGPPSBF 2004 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M23575 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone pSP1-i. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2004) AUTHORS Rooney,B.C., Horne,C.H.W. and Hardman,N. TITLE Molecular cloning of a cDNA for human pregnancy-specific beta-1-glycoprotein: Homology with human carcinoembryonic antigen and related proteins JOURNAL Gene 71, 439-449 (1988) STANDARD simple staff_review FEATURES from to/span description pept 232 1518 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 232 333 pregnancy-specific beta-1 glycoprotein signal peptide matp 334 1515 pregnancy-specific beta-1 glycoprotein BASE COUNT 579 a 510 c 406 g 509 t ORIGIN 1 gggcgggaca actggtctga gtactatggc tgattttcgc tgtctggcat tgagaagcca 61 cacgcccctt ttgcttagga ggcctctctg ctggaggatg acgatggcat ggtttatcta 121 aggccactga caagtcatca atataggaca gcacagctga gagccatgct caggaagttt 181 ctggatccta ggctcagctc cacagaggag aacacgcagg cagcagagac catggggccc 241 ctctcagccc ctccctgcac acagcgcatc acctggaagg ggctcctgct cacagcatca 301 cttttaaact tctggaaccc gcctaccact gcccaagtca cgattgaagc cgagccaacc 361 aaagtttcca aggggaagga cgttcttcta cttgtccaca atttgcccca gaatcttgct 421 ggctacatct ggtacaaagg gcaaatgaag gacctctacc attacattac atcatacgta 481 gtagatggtc aaataattat atatgggcct gcatacagtg gacgagaaac agtatattcc 541 aatgcatccc tgctgatcca gaatgtcacc cgggaggacg caggatccta caccttacac 601 atcgtaaagc gaggtgatgg gactagagga gaaactggac atttcacctt caccttatac 661 ctggagactc ccaagccctc catctccagc agcaacttat accccaggga ggacatggag 721 gctgtgagct taacctgtga tcctgagact ccggacgcaa gctacctgtg gtggatgaat 781 ggtcagagcc tccctatgac tcacagcttg cagttgtcca aaaacaaaag gaccctcttt 841 ctatttggtg tcacaaagta cactgcagga ccctatgaat gtgaaatacg gaacccagtg 901 agtgccagcc gcagtgaccc agtcaccctg aatctcctcc cgaagctgcc caagccctac 961 atcaccatca acaacttaaa ccccagggag aataaggatg tcttagcctt cacctgtgaa 1021 cctaagagtg agaactacac ctacatttgg tggctaaatg gtcagagcct cccggtcagt 1081 cccagggtaa agcgacccat tgaaaacagg atcctcattc tacccagtgt cacgagaaat 1141 gaaacaggac cctatcaatg tgaaatacag gaccgatatg gtggcatccg cagttaccca 1201 gtcaccctga atgtcctcta tggtccagac ctccccagaa tttacccttc attcacctat 1261 taccattcag gagaaaacct ctacttgtcc tgcttcgcgg actctaaccc accagcagaa 1321 tattcttgga caattaatgg gaagtttcag ctatcaggac aaaagctctt tatcccccag 1381 attactacaa agcatagcgg gctctatgct tgctctgttc gtaactcagc cactggcatg 1441 gaaagctcca aatccatgac agtcaaagtc tctgctcctt caggaacagg acatcttcct 1501 ggccttaatc cattatagca gccgtgatgt catttctgta tttcaggaag actggcagac 1561 agttgctttc attcttcctc aaagtattta ccatcagcta cagtccaaaa ttgctttttg 1621 ttcaaggaga tttatgaaaa gactctgaca aggactcttg aatacaagtt cctgataact 1681 tcaagatcat acatggacta agaactttca aaattttaat gaacaggctg atacttcatg 1741 aaattcaaga caaagaaaaa aacccaattt tattggacta aatagtcaaa acaatgtttt 1801 cataattttc tatttgaaaa tgtgctgatt ctttgaatgt tttattctcc agatttatgc 1861 actttttttc ttcagcaatt ggtaaagtat acttttgtaa acaaaaattg aaacatttgc 1921 ttttgctccc taagtgcccc agaattggga aactattcag gagtattcat atgtttatgg 1981 taataaagtt atctgcacaa accc // LOCUS HUMLEUELA 920 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human elastase/medullasin mRNA, complete cds. ACCESSION M34379 KEYWORDS elastase; medullasin. SOURCE Human leukemic cell line ML3, cDNA to mRNA, clone pSRHLE. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 920) AUTHORS Okano,K., Aoki,Y., Shimizu,H. and Naruto,M. TITLE Functional expression of human leukocyte elastase (HLE)/medullasin in eukaryotic cells JOURNAL Biochem. Biophys. Res. Commun. 167, 1326-1332 (1990) STANDARD full staff_review FEATURES from to/span description pept 39 842 elastase/medullasin precursor (EC 3.4.21.37) sigp 39 119 elastase/medullasin signal peptide matp 126 839 elastase/medullasin BASE COUNT 141 a 329 c 287 g 163 t ORIGIN 1 gcacggaggg gcagagaccc cggagcccca gccccaccat gaccctcggc cgccgactcg 61 cgtgtctttt cctcgcctgt gtcctgccgg ccttgctgct ggggggcacc gcgctggcct 121 cggagattgt ggggggccgg cgagcgcggc cccacgcgtg gcccttcatg gtgtccctgc 181 agctgcgcgg aggccacttc tgcggcgcca ccctgattgc gcccaacttc gtcatgtcgg 241 ccgcgcactg cgtggcgaat gtaaacgtcc gcgcggtgcg ggtggtcctg ggagcccata 301 acctctcgcg gcgggagccc acccggcagg tgttcgccgt gcagcgcatc ttcgaaaacg 361 gctacgaccc cgtaaacttg ctcaacgaca tcgtgattct ccagctcaac gggtcggcca 421 ccatcaacgc caacgtgcag gtggcccagc tgccggctca gggacgccgc ctgggcaacg 481 gggtgcagtg cctggccatg ggctggggcc ttctgggcag gaaccgtggg atcgccagcg 541 tcctgcagga gctcaacgtg acggtggtga cgtccctctg ccgtcgcagc aacgtctgca 601 ctctcgtgag gggccggcag gccggcgtct gtttcgggga ctccggcagc cccttggtct 661 gcaacgggct aatccacgga attgcctcct tcgtccgggg aggctgcgcc tcagggctct 721 accccgatgc ctttgccccg gtggcacagt ttgtaaactg gatcgactct atcatccaac 781 gctccgagga caacccctgt ccccaccccc gggacccgga cccggccagc aggacccact 841 gagaagggct gcccgggtca cctcagctgc ccacacccac actctccagc atctggcaca 901 ataaacattc tctgttttgt // LOCUS MSGIS6110 1360 bp ds-DNA BCT 11-JUL-1990 DEFINITION M.tuberculosis-50 complex IS6110 insertion sequence-like element. ACCESSION M29899 KEYWORDS insertion sequence. SOURCE M.tuberculosis (strain H37RV) DNA (cosmid library pHC79), clone I21. ORGANISM Mycobacterium tuberculosis Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae. REFERENCE 1 (bases 1 to 1360) AUTHORS Thierry,D., Cave,M.D., Eisenach,K.D., Crawford,J.T., Bates,J.H., Gicquel,B. and Guesdon,J.L. TITLE IS6110 an IS-like element of Mycobacterium tuberculosis-50 complex JOURNAL Nucleic Acids Res. 18, 188-188 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Thierry 15-NOV-1989. BASE COUNT 269 a 439 c 432 g 220 t ORIGIN 1 cgatgaaccg ccccggcatg tccggagact ccagttcttg gaaaggatgg ggtcatgtca 61 ggtggttcat cgaggaggta cccgccggag ctgcgtgagc gggcggtgcg gatggtcgca 121 gagatccgcg gtcagcacga ttcggagtgg gcagcgatca gtgaggtcgc ccgtctactt 181 ggtgttggct gcgcggagac ggtgcgtaag tgggtgcgcc aggcgcaggt cgatgccggc 241 gcacggcccg ggaccacgac cgaagaatcc gctgagctga agcgcttagc ggcgggacaa 301 cgccgaattg cgaagggcga acgcgatttt aaagaccgcg tcggctttct tcgcggccga 361 gctcgaccgg ccagcacgct aattaacggt tcatcgccga tcatcagggc caccgcgagg 421 gccccgatgg tttgcggtgg ggtgtcgagt cgatctgcac acagctgacc gagctgggtg 481 tgccgatcgc cccatcgacc tactacgacc acatcaaccg ggagcccagc cgccgcgagc 541 tgcgcgatgg cgaactcaag gagcacatca gccgcgtcca cgccgccaac tacggtgttt 601 acggtgcccg caaagtgtgg ctaaccctga accgtgaggg catcgaggtg gccagatgca 661 ccgtcgaacg gctgatgacc aaactcggcc tgtccgggac cacccgcggc aaagcccgca 721 ggaccacgat cgctgatccg gccacagccc gtcccgccga tctcgtccag cgccgcttcg 781 gaccaccagc acctaaccgg ctgtgggtag cagacctcac ctatgtgtcg acctgggcag 841 ggttcgccta cgtggccttt gtcaccgacg cctacgtcgc aggatcctgg gctggcgggt 901 cgcttccacg atggccacct ccatggtcct cgacgcgatc gagcaagcca tctggacccg 961 ccaacaagaa ggcgtactcg acctgaaaga cgttatccac catacggata ggggatctca 1021 gtacacatcg atccggttca gcgagcggct cgccgaggca ggcatccaac cgtcggtcgg 1081 agcggtcgga agctcctatg acaatgcact agccgagacg atcaacggcc tatacaagac 1141 cgagctgatc aaacccggca agccctggcg gtccatcgag gatgtcgagt tggccaccgc 1201 gcgctgggtc gactggttca accatcgccg cctctaccag tactgcggcg acgtcccgcc 1261 ggtcgaactc gaggctgcct actacgctca acgccagaga ccagccgccg gctgaggtct 1321 cagatcagag agtctccgga ctcaccgggg cggttcacga // LOCUS MUSIGHAAT 348 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig J558 family active H-chain mRNA V-J3 region from hybridoma CE5, partial cds. ACCESSION M34119 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene. SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 348 Ig H-chain V-J3 region (AA at 1) recomb 294 295 J-region end/D-region start recomb 303 304 D-region end/J-region start BASE COUNT 89 a 82 c 97 g 80 t ORIGIN Chromosome 12. 1 caggttcagc tgcagcagtc tggagctgaa ctgatgaagc ctggggcctc agtgaagata 61 tcctgcaagg ctactggcta cacattcagt aagtactgga tagagtgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagag attttacctg gaagtggtag tactaaccat 181 gatgagaagt tcaagggcaa ggccacattc actgcagata catcctccaa cacagcctac 241 atgcaactca gcagcctgac atctgaggac tctgccgtct attactgtgc aagagacggt 301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca // LOCUS MUSIGKCSR 321 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma GA733, partial cds. ACCESSION M34120 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma GA733, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain (AA at 1) BASE COUNT 87 a 83 c 74 g 77 t ORIGIN Chromosome 6. 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagtgtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatccgacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcaccaa cattatatta ctcctcggac gttcggtgga 301 ggcaccaaac tggaaatcaa a // LOCUS MUSIGKCSS 321 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma C017-1A, partial cds. ACCESSION M34121 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma C017-1A, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain (AA at 1) BASE COUNT 89 a 77 c 78 g 77 t ORIGIN Chromosome 6. 1 aacattgtaa tgacccaatc tcccaaatcc atgtccatgt cagtaggaga gagggtcacc 61 ttgacctgca aggccagtga gaatgtggtt acttatgttt cctggtatca acagaaacca 121 gagcaatctc ctaaactctt gatttacggg gcctccaacc ggtacactgg ggtccccgat 181 cgcttcacag gtagtggatc tgcaacagat ttcactctga ccattagtag tgtgcaagct 241 gaagaccttg cagattatca ctgtggacag ggttacagct atccgtacac gttcggaggg 301 gggaccaagc tggaaataaa a // LOCUS MUSIGKCST 318 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma CE5, partial cds. ACCESSION M34122 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 318 Ig kappa-chain (AA at 1) BASE COUNT 84 a 78 c 80 g 76 t ORIGIN Chromosome 6. 1 gacattgtga tgacccagtc tcagaaattc atgtccacat cagtaggaga cagggtcggc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaatca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgag 181 cgcttcgctg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcatcaa cattatagta ctcggacgtt cggtggaggc 301 accaagctgg aaatcaaa // LOCUS PSERRSAA 1517 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.aeruginosa 16S ribosomal RNA. ACCESSION M34133 KEYWORDS 16S ribosomal RNA; ribosomal RNA; small subunit ribosomal RNA. SOURCE P.aeruginosa (strain 25330) ribosomal RNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1517) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept of Microbiology 407 S. Goodwin Avenue Urbana, IL 61801 email: carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1517 16S rRNA BASE COUNT 379 a 334 c 470 g 301 t 33 others ORIGIN 1 ntactgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 61 gtcgagcgga tgaagggagc ttgctcctgg attcagcggc ggacgggtga gtaatgccta 121 ggaatctgcc tgatagtggg ggataacgtc cggaaacggg cgctaatacc gcatacgtcc 181 tgagggagaa agggggggat cctcggacct cacgctatcn gatgagccta ggtcggatta 241 gctagttggt ggggtaaagg cctaccaagg cgacgatccg taactggtct gagaggacga 301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaata 361 ttggacaatg ggcgaaagcc ngatccagcc atgccgcgtg tgtgaagaag gtcttcggat 421 tgtaaagcac tttaagttgg gaggaagggc agtaagttaa taccttgctg ttttgacgtt 481 accaacagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac gaagggtgcg 541 agcgttaatc ggaattactg ggcgtaaagc gcgcgtaggt ggttcagcaa gttggatgtg 601 aaatccccgg gctcaacctg ggaactgcat ccnaaactac tgagctagag tacggtagag 661 ggtggtggaa tttcctgtgt agcggtgaaa tgcgtagata taggaaggaa caccagtggc 721 gaaggcgacc acctggactg atactgacac tgaggtgcga aagcgtgggg agcaaacagg 781 attagatacc ctggtagtcc acgccgtaaa cgatgtcgac tagccgttgg gatccttgag 841 atcttagtgg cgcagctaac gcgataagtc gaccgcctgg ggagtacggc cgcaaggtta 901 aaactcaaat gaattgacgg gggcnngcac aagcggtgga gcatgtggtt taattcgaag 961 caacgcgaag aaccttacct ggccttgaca tgctgagaac tttccagaga tggattggtg 1021 ccttcgggaa ctcagacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt 1081 gggttaagtc ccgtaacgag cgcaaccctt gtccttagtt accagcacct cgggtgggca 1141 ctctaaggag actgccggtg acaaaccgga ggaaggtggg gatgacgtca agtcatcatg 1201 gcccttacgg cnagggctac acacgtgcta caatggtcgg tacaaagggt tgcgaagccg 1261 cgaggtggag ctaatcccat aaaaccgatc gtagtccgga tcgcagtctg caactcgact 1321 gcgtgaagtc ggaatcgcta gtaatcgtga atcagaatgt cacggtgaat acgttcccgg 1381 gccttgtaca caccgcccgt cacaccatgg gagtgggttg ctccagaagt agctagtcta 1441 accgcaaggg ggacggttac cacggagtga ttcatgnnnn nnnnnnnnnn gtaacaagnn 1501 nnnnnnnnnn gaacctg // LOCUS RATNESTIN 5946 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Rat nestin mRNA, complete cds. ACCESSION M34384 KEYWORDS intermediate filament protein; nestin. SOURCE Rat (strain E15) embryo central nervous system, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5946) AUTHORS Lendahl,U., Zimmerman,L.B. and McKay,R.D.G. TITLE CNS stem cells express a new class of intermediate filament protein JOURNAL Cell 60, 585-595 (1990) STANDARD simple staff_review FEATURES from to/span description pept 128 5545 nestin (128 could be 161) mRNA 1 5495 nestin mRNA BASE COUNT 1600 a 1401 c 1871 g 1074 t ORIGIN 1 tgctggagtt ctccgcttcc gctgggtcac tgtcgccgct acttcttttc aacccctaaa 61 agctccacgg gccactccct tctctagtgc tccacgtccg cttgccctcg ggggccagac 121 cagcgacatg gagggttgcg tcggggaaga atcttttcag atgtgggagc tcaatcgacg 181 cctggaggcc tacctgaccc gggtcaagac gctagaagag caaaaccagc tgctcagcgc 241 cgagcttggg ggactccggg cgcagtccgg agacacctcc tggagagccc gagccgatga 301 cgagctggca tccctgcgga tcctcgtcga tcagcgctgg cgggagaagc tcgaggctga 361 ggtgcagcgc gacaaccttg cggaagagct ggagagcgtg gcgggccggt gccagcaggt 421 gcggcttgct cgggagcgga ccgtccagga ggccgcctgc agccggcgcg cactcgaggc 481 ggagaagaat gcgcggggct ggctgagcac ccaggcggcc gagctggagc gggagttaga 541 ggctctgcga gccgcgcacg aggaggagcg cgcacacctg aacgcccagg ccgcctgtgc 601 gcctcgccgg ccccccgcac cgccccaccg gatccccggt ccggcccccg aagtcgagga 661 tctggccagg cgactaggcg aagtgtggcg cggggcggtg cgtgactacc aggagcgcgt 721 ggctcacatg gaaagctcgc tgggtcaggc acgcgagcgg ctgagccaag ccgtgcgggg 781 cgctcgggag tgtcgcttag aggtgcaaca gctgcaggct gatcgcgaca gcctccagga 841 gcgcagagaa gcgctggaac agagattgga aggccgctgg caggaccggc tgcaggccac 901 tgataagttc cagctggctg tggaagccct ggagcaggag aagcaaggtc tacagagtca 961 gatcgctcag atcctggaag gtgggcagca actggcacac ctcaagatgt cccttagtct 1021 ggaggtggct acatacagga ctctgctgga ggctgagaac tctcggttgc agacacctgg 1081 acgaggttcc caggcttctc ttggctttct ggaccccaag ctgaagccga atttccttgg 1141 gataccagag gaccagtacc tgggatctgt gctccctgcc ctcagcccca catccttccc 1201 ttcccccttg cctaataccc ttgagactcc tgtgacagcc ttcctgaaga ctcaggagtt 1261 ccttcaggcc agaaccccca ccttggccag cactcccatc ccacctatat ctgaggctcc 1321 ctgtcctcca aatgcagagg tgagagccca ggaggtccct ctttctctgc tccagacaca 1381 ggctccagag cccctttggc tgaaggccac agtgcctagt tcttctgcta tcctcccaga 1441 actagaggaa cctgggggca agcagcaggg tcacttccct gatgatctga cctccttagc 1501 cacaaacctc aaccctcacc accctacttt agaggctaaa gatggagaat ccagtgagtc 1561 tagagtttct agcatattcc aggaagatga ggggcaaatc tgggaactgg tagagaaaga 1621 agcagatata gaggtaaaag tagaaaacag ctcagcccag aaaacacaag aaagtggtct 1681 ggacacagaa gaaacccagg attcccaggg acctttgcag aaggaaacac tgaaggctct 1741 aggagaggag ccactgatgt ctctgaaaat ccagaactat gagacagcag ggaaagagaa 1801 ttgcaattct tctacagaag gccacctggg aacactagaa ggcccagaaa aagaaaagca 1861 aataccacta aagtctttag aagaaaagaa tgtagagtca gagaaaactc tagaaaatgg 1921 ggttcctgta ctatctgagc ttttaggaaa agaagacaca agaacagagg atcaagaatt 1981 aatgtctcct aaaggtacac taaagagatt ttcatctcta ggaaaggaaa gtcaagaagt 2041 agtgaggcct tcaaaagagg ggaacctaga atcatggaca gcttttaaag aggagagcca 2101 acacccactg ggatttccag gagctgagga ccagatgctt gagagactgg tagagaaaga 2161 ggatcagagc ttcccaaggt ctccagagga agaggaccag gaggcatgta gacctctgca 2221 gaaagagaat caggaaccac tagggtatga agaagcagag ggccagatac ttgagagact 2281 gatagaaaaa gagagtcagg agtccctgag gtctccagaa gaagaggacc aggaggcagg 2341 tagatctctg cagaaagaga atcaggagcc actagggtat gaagaagcag aggaccagat 2401 gcttgagaga ctgatagaaa aagagagtca ggagtccctg aagtctccag aagaaaacca 2461 gaggattggg aagcctctag aaagagagaa tcagaaatct ctgaggtatc ttgaagaaaa 2521 ccaggagact tttgtaccac tagaaagcag gaaccagagg ccactgagat ctctagaagt 2581 agaagaggag gagcagagaa ttgtgaaacc tctagaaaaa gtgagtcagg attccctcgg 2641 atctctagca gaagagaatg tgcagccact gaggtatctg gaagaagatg actgcataaa 2701 taagagcctt ctagaagaca agactcacaa gtccttgggg tctcttgaag atagaaatgg 2761 ggatagcatt attataccac aagaaagtga gacccaggtt tcattgaggc ctccagaaga 2821 ggaggaccag aggattgtga accatctaga aaaagaaagt caggagttct cgaggtcttc 2881 agaagaagaa gagcaggtga tggagagatc tctagaagga gagaaccatg aatcactgag 2941 ttctgtagaa aaagaggacc agatggttga gagccaacta gagaaagaga gtcaggactc 3001 agggaagtct cttgaagatg agagccagga gacctttgga cctctggaaa aagagaatgc 3061 agagtccctg agatctctag caggacagga ccaagaggaa cagaagcttg aacaagagac 3121 ccaacaaaca ctgagggctg tagggaatga gcagatggca gtgagcccac cagaaaaggt 3181 ggatccagag ttaccgaagc ctcttggaaa tgaccaggaa atagctagat ctcttggaaa 3241 agagaatcaa gagtcactag tgtcactgaa agaaaaaggt atagagacag tgaagtcttt 3301 agaaacagag atcatagaac cactggagac tgcagaagag gacctggaaa gaaggaagtc 3361 tatagatact caggagccat tgtggtctac tgaagtggct agagagacag tagaacctcc 3421 agaagatgag cccccaggat cgctagggtc tgtggatgag aaccgagaga cactgacatc 3481 ccttgaaaag gagagtcaag aactgagctc tctgggcaag tggaacgtag agaccagggt 3541 agaggacagt cagcagtgcc tgcaagtaga agagggtctg caggaggaac agcaccaaga 3601 gtctctgaga gaggtgaagc aggagctgcc tagctctgga aatcaacagc ggtgggagga 3661 tgtggtggag ggcaaagcag tgggtcagga agcacctctg gcaaccacag gagtgggaac 3721 tgaggataag gcagagttgc atctgagggg gcaaggtgga gaggaagaag ctgcagcaga 3781 gggagagctg ttgcaggata ttgtggggga ggcctggagt ctggggagct ctgagcccaa 3841 ggagcagagg gtccctgctg aggccctcga caacctggaa ggaggggcct tagaggtccc 3901 agttgctcag tcaatgccag aggtgacaga gcgagatgag gatagagccc aagcaggtga 3961 acaagactcc atagaggtga cccttgggtt agaggctgcc agaactggac tggaactcga 4021 gcaggaagtg gtagggctag aggacccaag gcattttgcc agggaggagg ccattccccc 4081 atccctgggg gaggaaagtg tgaaggcaaa gatagctcag ggcttggaag ggcctggaaa 4141 ggaaccaaaa gaggcaggtg ctctggactc ggggatcctt gaattgccca agactagcag 4201 cgaggctctg gaatgccagg gccatgaaga gtctgagtcc atggagggct gggaagaaga 4261 ggaggcctca ctggagactt cagatcatga gggcagtgat gcccctcagc ccaggccccc 4321 agaaacagaa gaagatgagg gtgcacaggc agcactgaca gcccctggtc ccaagctctt 4381 ggaaccctgt tcacccatcc caatcctgac agatgcccat gagctgcagc cccaggctga 4441 ggggatccag gaggctggct ggcagccaga agctgggtct gaagcactag aaagggtaga 4501 aaatgagcca gagtttggtc ttggggagat cccggagggc ctccaggatt gggaagaggg 4561 cagagaagaa agcgaggcag atgatctagg ggaaactctc cctgactcta ctcccctggg 4621 cctctacctg aggtcccctg cttctccaaa gtgggatctg gctggagaac agaggctttc 4681 ccctcaaggg gatgccggga aggaagactg gggtcctgct gtccccgctg cccagggcct 4741 cagtggtcca ccggaagagg aggaggagca aggccatggc tctgacctat catctgagga 4801 gtttgaggac ctagggactg aggcctctct tcttccaggg gttcccaagg aggtggcaga 4861 tcacgtgggc caagtgcccc cggtactgca gcctgcatgc tgggatcagg gtggggaatc 4921 tgatgggttt gctgatgagg aagaaagtgg ggaggaggga gaggaagaag atgctgatga 4981 ggaaggagca gagtcaggag ctcagtggtg ggggtcaggg gcctctggtg gaggctgcaa 5041 ggtccaggat attgcccaaa gaggagaccc ggtacaggag tctgtgggtg tcagtggtct 5101 ctgggatgat ggcttgagag gtgctgcagc taatgttcct gccctagaga tggtatctca 5161 ggacagtgct gagccttctg ggtcagagga gtctgagtct gcttccttgg agggggagga 5221 aggtcaagtg actgaccatt tagatgctcc ccaggaggtg accagcatgg tcccgggggt 5281 aggagatgcc tttgacattg gtggccagag ccccaacttg gactcagaac aagtgaatgg 5341 gaaaatggag aatggactag aacaggctga ggggcaggtg gtcctggatg gggacgagga 5401 tcaagaactc ctattacagg gacaggaggt gggtgctcta aaggttcctt tggtagcatc 5461 tcctgtgcat ctaggcccaa gccagcccct gaagttcact ctgagtgggg tagatgggga 5521 ttcctggtcc tcaggggaag actagaaact gcccctctgg ctctgaggat gtactggtgg 5581 ggatgtccct ccctgctctg ggtgaccact cttagctttg ataacttgac ccatggtatt 5641 tgtcctggag agttgtggct gggctgagca agggaggtga gatcctcctg aaggctcagg 5701 agttccaggc ctatagttct accccctctt tcttctgtgg ctcacctgct ggaagaggcc 5761 tgggcccaga gctttcccac aaggctgttc tggccacagc ttgctagcct tgcctaccac 5821 ctgcacaagg tctggtctgg tgtatgacca ggggagctga gggcagcatt tatctgaccc 5881 ttcatctcag cctgctgaga gcttgttcct ctcttcctcc ctgaataaag ccgtatccct 5941 acctac // LOCUS SYNCMPA 1885 bp ds-DNA BCT 11-JUL-1990 DEFINITION Synechococcus sp. 42-kD membrane protein (cmpA) gene, complete cds. ACCESSION M32999 KEYWORDS membrane protein. SOURCE Synechococcus sp. (strain PCC 7942) DNA. ORGANISM Synechococcus sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 1885) AUTHORS Omata,T., Carlson,T.J., Ogawa,T. and Pierce,J. TITLE Sequencing and modification of the gene encoding the 42 kilodalton protein in the cytoplasmic membrane of Synechococcus PCC 7942 JOURNAL Plant Physiol. 93, 305-311 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Pierce, 20-MAR-1990. FEATURES from to/span description pept 141 1493 49-kD membrane protein (cmpA) BASE COUNT 452 a 508 c 487 g 438 t ORIGIN 1 ggttatcagc cttatcggtc tggaataacc agttggccta aagtcatgca gacagagcgt 61 ttctgcgcct ctcgtgaagc aattcgcaca acttgtccat ctttagaggc atctcctgtt 121 gtgggatgta ggggagacgt atgaacgaat ttcaaccagt caatcgtcgt cagtttctgt 181 tcacgctcgg agcaaccgct gctagcgcta ttttgctgaa gggttgcggt aatcctcctt 241 ccagtagcgg cggcgggact tctagtacaa ctcagccaac tgctgcaggg gcgagtgatc 301 tggaagtcaa gacaatcaaa ttgggctaca tccccatctt tgaagcggct ccactgatca 361 ttggccgcga aaaaggcttt tttgccaaat atggcttgga tgttgaagtc tcgaaacaag 421 ccagctgggc agctgctcgc gataacgtca ttctcggttc tgctggtggc ggcatcgatg 481 gcggtcagtg gcaaatgccg atgcctgcct tgctaacgga aggtgcgatc agcaacggtc 541 aaaaagttcc catgtatgtc ttggcttgct tgagcaccca aggcaatggc atcgctgttt 601 ccaatcagct caaggcccaa aatctgggct tgaagctagc gcccaaccgc gactttatcc 661 tcaactaccc gcaaactagc ggccggaagt tcaaagcatc ctacaccttc ccgaacgcca 721 accaagactt ctggattcgc tattggtttg cagctggcgg tatcgatcct gataaagaca 781 ttgaactctt gaccgttccc agcgcagaaa ctctacaaaa tatgcgcaat ggcacgatcg 841 attgcttcag taccggcgat ccctggccgt cgcggattgc caaagatgac atcggctatc 901 aagctgcgct gacaggtcaa atgtggcctt accaccccga ggaattcttg gcgctgcgag 961 cagactgggt agacaaacat ccgaaagcta cgctcgcctt gctgatgggc ttgatggaag 1021 cgcagcaatg gtgcgatcag aaagcaaatc gggcagagat ggccaagatc ctctccggtc 1081 gcaacttctt taacgtgccg gtttcgatcc tgcagccgat tctggaaggt caaatcaaag 1141 ttggagcaga cggaaaagat ctcaacaact ttgatgccgg cccgctcttc tggaagagtc 1201 cgcgcggcag tgtctcctat ccctacaaag ggctcaccct ctggttcttg gtggagtcga 1261 tccgctgggg cttcaacaag caagtgctac ctgacattgc agccgcccag aaactcaacg 1321 atcgcgtgac tcgtgaagac ctctggcaag aggcagccaa gaaattaggg gtgcccgctg 1381 cggatatccc aaccggatcg actcgcggta ccgagacctt ctttgatggc atcacctaca 1441 acccagacag tccgcaagct tatctccaaa gcttgaagat taaacgggca taagtagggg 1501 cttcaatcat caaccttagt tcagtcacta tcaggagata gacagaccat ggttactgca 1561 cgggaaacaa gacgaaacgg aagtcgtcct tctggcttaa aaaaatggcg tcagaaactc 1621 gatggcatct tgctaccgct agcaggaatt ttgggtttcc tcatcatttg gcagatcttt 1681 tctagcacgg gcaacccgct tgcccggccc tgctcagtct cttcacagaa gagagaacac 1741 gcgagttgct gccctatccc ttcttggatc gcggcgggct tgataaaggt ctgttctggc 1801 agacgtatcg cttagttctg acgcgggtgg cccagggctt ttcgatccgc agccatcatc 1861 ggcatcggca tttccgttgg aattc // LOCUS ECOOXYR 1264 bp ds-DNA BCT 11-JUL-1990 DEFINITION E.coli oxyR regulatory protein gene, complete cds. ACCESSION J04553 KEYWORDS oxyR gene; regulatory protein. SOURCE E.coli (strain K12, CSH50) DNA, clones pAQ17 and pMomR1200. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1264) AUTHORS Christman,M.F., Storz,G. and Ames,B.N. TITLE Oxyr, a positive regulator of hydrogen peroxide-inducible genes in Escherichia coli and Salmonella typhimurium, is homologous to a family of bacterial regulatory proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3484-3488 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by G.Storz, 23-MAY-1989. FEATURES from to/span description pept 203 1120 oxyR protein mRNA 170 > 1264 oxyR mRNA signal 133 138 -35 region signal 157 162 -10 region binding 191 195 ribosome binding site BASE COUNT 289 a 308 c 368 g 299 t ORIGIN 89.6 minutes on K12 map. 1 ggatcctgga gatccgcaaa agttcacgtt ggctttagtt attcgagttg agaaactctc 61 gaaacgggca gtgacttcaa gggttaaaag aggtgccgct ccgtttctgt gagcaattat 121 cagtcagaat gcttgatagg gataatcgtt cattgctatt ctacctatcg ccatgaacta 181 tcgtggcgat ggaggatgga taatgaatat tcgtgatctt gagtacctgg tggcattggc 241 tgaacaccgc cattttcggc gtgcggcaga ttcctgccac gttagccagc cgacgcttag 301 cgggcaaatt cgtaagctgg aagatgagct gggcgtgatg ttgctggagc ggaccagccg 361 taaagtgttg ttcacccagg cgggaatgct gctggtggat caggcgcgta ccgtgctgcg 421 tgaggtgaaa gtccttaaag agatggcaag ccagcagggc gagacgatgt ccggaccgct 481 gcacattggt ttgattccca cagttggacc gtacctgcta ccgcatatta tccctatgct 541 gcaccagacc tttccaaagc tggaaatgta tctgcatgaa gcacagaccc accagttact 601 ggcgcaactg gacagcggca aactcgattg cgtgatcctc gcgctggtga aagagagcga 661 acgattcatt gaagtgccgt tgtttgatga gccaatgttg ctggctatct atgaagatca 721 cccgtgggcg aaccgcgaat gcgtaccgat ggccgatctg gcaggggaaa aactgctgat 781 gctggaagat ggtcactgtt tgcgcgatca ggcaatgggt ttctgttttg aagccggggc 841 ggatgaagat acacacttcc gcgcgaccag cctggaaact ctgcgcaaca tggtggcggc 901 aggtagcggg atcactttac tgccagcgct ggctgtgccg ccggagcgca aacgcgatgg 961 ggttgtttat ctgccgtgca ttaagccgga accacgccgc actattggcc tggtttatcg 1021 tcctggctca ccgctgcgca gccgctatga gcagctggca gaggccatcc gcgcaagaat 1081 ggatggccat ttcgataaag ttttaaaaca ggcggtttaa accgtttaac gcagctaccc 1141 gatagcttcc gccatcgtcg ggtagttaaa ggtggtgttg acgaagtact caatagtgtt 1201 gccgccacct ttctgttcca taatcgcctg accgatatga ataatttcgg cgagcgcgct 1261 cgcc // LOCUS CLLRRE 1860 bp ss-rRNA RNA 11-JUL-1990 DEFINITION C.sapidus 18S rRNA, 3' end. ACCESSION M34360 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE C.sapidus rRNA. ORGANISM Callinectes sapidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; Brachyrhyncha; Portunoidea; Portunidae. REFERENCE 1 (bases 1 to 1860) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-113 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1860 18S ribosoma RNA BASE COUNT 359 a 375 c 421 g 367 t 338 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct 61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt 121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta 181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc 241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc 301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc 361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac 421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac 481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat 541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct 601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg 661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc 721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata 841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc 901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc 961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg 1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt 1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt 1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac 1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg 1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct 1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc 1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga 1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag 1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc 1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc 1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn 1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn // LOCUS HUMNCSRC 138 bp ds-DNA PRI 11-JUL-1990 DEFINITION Human membrane-associated tyrosine protein kinase (C-SRC) gene, exons 3, 4, NI, and NII, partial cds. ACCESSION M34469 KEYWORDS membrane-associated tryosine protein kinase. SOURCE Human adult brain DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 138) AUTHORS Pyper,J.M. and Bolen,J.B. TITLE Identification of a novel neuronal C-SRC exon expressed in human brain JOURNAL Mol. Cell. Biol. 10, 2035-2040 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 29 membrane-associated tyrosine protein kinase (C-SRC), exon 3 (AA at 1) 30 47 membrane-associated tyrosine protein kinase, exon NI 48 80 membrane-associated tyrosine protein kinase, exon NII 81 > 136 membrane-associated tyrosine protein kinase, exon 4 variant 117 117 a or g variant 135 135 c or t BASE COUNT 36 a 40 c 40 g 22 t ORIGIN 1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc 61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga 121 cagacaggct acatcccc // LOCUS MHVAPEPA 1000 bp ss-RNA VRL 11-JUL-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34435 KEYWORDS peplomer protein. SOURCE Murine (strain JHM-DL) RNA. ORGANISM Murine coronavirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1000) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review BASE COUNT 258 a 192 c 227 g 317 t 6 others ORIGIN 1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag 61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa 121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct 181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa 241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta 301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt 361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca 421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac 481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg 541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga 601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg 661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt 721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt 781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa 841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa 901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag 961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa // LOCUS MHVAPEPB 843 bp ss-RNA VRL 11-JUL-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34436 KEYWORDS peplomer protein. SOURCE Murine (strain A59) RNA. ORGANISM Murine coronavirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 843) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review FEATURES from to/span description site 400 401 deletion site BASE COUNT 227 a 153 c 183 g 280 t ORIGIN 1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag 61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa 121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct 181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa 241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta 301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt 361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt 421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc 481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc 541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat 601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt 661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc 721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact 781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc 841 gaa // LOCUS MHVSP 3780 bp ss-mRNA VRL 11-JUL-1990 DEFINITION Mouse hepatitis virus surface protein S mRNA, complete cds. ACCESSION X04797 M34437 KEYWORDS glycoprotein; membrane glycoprotein; surface glycoprotein; surface projection glycoprotein. SOURCE Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones pJMS1010, pJS112, and pJS92. ORGANISM Murine hepatitis virus A59 Unclassified. REFERENCE 1 (bases 1 to 3780) AUTHORS Schmidt,I., Skinner,M. and Siddell,S. TITLE Nucleotide Sequence of the Gene Encoding the Surface Projection Glycoprotein of Coronavirus MHV-JHM JOURNAL J. Gen. Virol. 68, 47-56 (1987) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description SITE 24 30 put. region of homology preceeding mRNA 5' initiation codons SITE 121 129 pot. N-glycosylation site SITE 208 216 pot. N-glycosylation site SITE 430 438 pot. N-glycosylation site SITE 604 612 pot. N-glycosylation site SITE 1099 1107 pot. N-glycosylation site SITE 1333 1341 pot. N-glycosylation site SITE 1636 1644 pot. N-glycosylation site SITE 1732 1740 pot. N-glycosylation site SITE 1756 1764 pot. N-glycosylation site SITE 1825 1833 pot. N-glycosylation site SITE 1900 1914 put. proteolytic cleavage site SITE 1972 1980 pot. N-glycosylation site SITE 2023 2031 pot. N-glycosylation site SITE 2293 2301 pot. N-glycosylation site SITE 2440 2448 pot. N-glycosylation site SITE 3139 3147 pot. N-glycosylation site SITE 3301 3309 pot. N-glycosylation site SITE 3331 3339 pot. N-glycosylation site SITE 3388 3396 pot. N-glycosylation site SITE 3436 3444 pot. N-glycosylation site SITE 3499 3507 pot. N-glycosylation site SITE 3553 3654 pot. transmembrane domain SITE 3622 3675 cysteine-rich region SITE 3655 3660 charge cluster SITE 3715 3723 pot. N-glycosylation site FEATURES from to/span description pept 31 3738 surface protein S precursor sigp 31 60 surface protein S signal peptide matp 61 3736 surface protein S BASE COUNT 1030 a 718 c 800 g 1232 t ORIGIN 1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc 61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat 121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact 181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct 241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg 301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac 361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt 421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct 481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat 541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag 601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt 661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg 721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct 781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt 841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt 901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt 961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa 1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact 1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg 1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt 1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt 1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt 1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat 1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat 1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt 1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct 1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc 1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct 1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt 1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat 1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc 1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt 1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc 1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat 2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt 2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat 2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct 2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct 2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac 2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag 2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa 2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg 2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct 2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta 2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac 2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag 2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg 2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca 2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc 2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc 3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata 3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca 3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg 3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat 3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg 3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt 3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc 3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag 3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa 3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt 3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa 3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt 3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt // LOCUS MUSIGLAZ 713 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds. ACCESSION M34598 M29013 J03562 KEYWORDS constant region; immunoglobulin; immunoglobulin light chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone Y31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 713) AUTHORS Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A. TITLE Structure of a third murine immunoglobulin lambda light chain variable region that is expressed in laboratory mice JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987) STANDARD simple staff_entry REFERENCE 2 (bases 2 to 450) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review REFERENCE 3 (bases 266 to 429) AUTHORS Sanchez,P. and Cazenave,P.-A. TITLE A new variable region in mouse immunoglobulin lambda light chains JOURNAL J. Exp. Med. 166, 265-270 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 12 > 713 Ig lambda chain precursor V-x,J-2,C-2 region sigp 12 68 Ig lambda chain signal peptide matp 69 > 713 Ig lambda chain recomb 379 380 V-region end/J2-region start recomb 414 415 J2-region end/C2-region start BASE COUNT 175 a 190 c 159 g 189 t ORIGIN 1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt 61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag 121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc 181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca 241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta 301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa 361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc 421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca 481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga 541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca 601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca 661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct // LOCUS MUSIGLVD 681 bp ds-DNA ROD 11-JUL-1990 DEFINITION Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds. ACCESSION M34597 KEYWORDS constant region; germline; immunoglobulin light chain; joining exon; lambda-immunoglobulin; variable region. SOURCE Mouse (strain Balb/c AnPt) liver DNA, clone 30X2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 681) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review FEATURES from to/span description pept 59 104 Ig lambda-chain precursor Vx-J2-C2 region, exon 1 209 531 Ig lambda-chain precursor Vx-J2-C2 region, exon 2 sigp 59 104 Ig lambda-chain signal peptide 209 219 Ig lambda-chain signal peptide matp 220 528 Ig lambda-chain IVS 105 208 Ig lambda-chain Vx-J2-C2 region intron A recomb 530 531 Vx-region end/J2-region start recomb 565 566 J2-region end/C2-region start BASE COUNT 179 a 156 c 136 g 210 t ORIGIN Chromosome 16. 1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat 61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt 121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc 181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca 241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag 301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg 361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct 421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa 481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa 541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta 601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa 661 tcctcttgga cttcgtaggg c // LOCUS PAERRE 1877 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.kadiakensis 18S rRNA, 3' end. ACCESSION M34359 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.kadiakensis rRNA. ORGANISM Palaemonetes kadiakensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; Palaemonoidea; Palaemonidae. REFERENCE 1 (bases 1 to 1877) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-113 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1877 18S ribosoma RNA BASE COUNT 339 a 321 c 399 g 313 t 505 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt 61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt 121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac 181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg 241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca 301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct 361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna 421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga 481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac 541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn 601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac 841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna 901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta 961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta 1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa 1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt 1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc 1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt 1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn 1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg 1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna 1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg 1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca 1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact 1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag 1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnnnnn // LOCUS PBESVBRA 584 bp ds-DNA INV 11-JUL-1990 DEFINITION P.berghei telomeric repeat region subfragment alpha DNA. ACCESSION M34601 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 584) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 461 541 27 bp repeats BASE COUNT 205 a 41 c 85 g 251 t 2 others ORIGIN 1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac 61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc 121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata 181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat 241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat 301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa 361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt 421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg 481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg 541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga // LOCUS PBESVBRB 593 bp ds-DNA INV 11-JUL-1990 DEFINITION P.berghei telomeric repeat region subfragment a DNA. ACCESSION M34602 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 593) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 484 510 27 bp repeat motif BASE COUNT 209 a 40 c 97 g 247 t ORIGIN 1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca 61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac 121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa 181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt 241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat 301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat 361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat 421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg 481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt 541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg // LOCUS PEURRE 1902 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.aztecus 18S rRNA, 3' end. ACCESSION M34362 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.aztecus rRNA. ORGANISM Penaeus aztecus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea; Penaeidae. REFERENCE 1 (bases 1 to 1902) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1902 18S ribosoma RNA BASE COUNT 340 a 369 c 427 g 332 t 434 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt 61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt 121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg 181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc 241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg 301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc 361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag 481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga 541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga 601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg 661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc 721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn 781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn 841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg 901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg 961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna 1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn 1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg 1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa 1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn 1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg 1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac 1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta 1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct 1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg 1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact 1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg 1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt 1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn // LOCUS POCRRE 1874 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.ascensionis 18S rRNA, 3' end. ACCESSION M34358 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.ascensionis rRNA. ORGANISM Procaris ascensionis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; Procaridoidea; Procarididae. REFERENCE 1 (bases 1 to 1874) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1874 18S ribosoma RNA BASE COUNT 298 a 291 c 331 g 266 t 688 others ORIGIN 1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct 61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt 121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac 181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt 241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac 301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn 361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc 481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa 541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt 781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg 841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat 901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat 961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga 1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac 1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt 1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca 1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt 1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta 1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct 1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag 1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca 1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn 1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg 1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg 1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnn // LOCUS PRARRE 1869 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.leonensis 18S rRNA, 3' end. ACCESSION M34363 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.leonensis rRNA. ORGANISM Procambarus leonensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea; Astacoidea; Cambaridae. REFERENCE 1 (bases 1 to 1869) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1869 18S ribosoma RNA BASE COUNT 409 a 412 c 459 g 419 t 170 others ORIGIN 1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt 61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt 121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat 181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt 241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc 301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca 361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg 421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt 481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact 541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt 601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag 661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc 721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn 781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa 841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg 901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac 961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg 1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg 1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg 1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa 1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta 1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact 1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg 1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg 1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac 1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa 1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn 1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt 1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa 1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnn // LOCUS PVIC1RPTA 711 bp ds-DNA INV 11-JUL-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28745 M25758 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK247) sporozoite DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 711) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 711 circumsporozoite protein (AA at 1) site 1 15 region 1 site 697 711 region 2 site 502 597 post repeat variable region site 598 696 post repeat constant region BASE COUNT 261 a 151 c 226 g 73 t ORIGIN 1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat 61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca 121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca 181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga 241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc 301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa 361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga 421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat 481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca 541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga 601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta 661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c // LOCUS PVIC1RPTB 657 bp ds-DNA INV 11-JUL-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28746 M25759 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK210) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 657) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 657 circumsporozoite protein (AA at 1) site 1 15 region 1 site 643 657 region 2 rpt 16 477 27 bp repeats site 478 544 3' post repeat variable region site 545 642 3' post repeat constant region BASE COUNT 229 a 135 c 224 g 69 t ORIGIN 1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat 61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa 121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca 181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat 241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca 301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga 361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca 421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga 481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca 541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa 601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc // LOCUS SHV2A 554 bp ss-RNA VRL 11-JUL-1990 DEFINITION Simian hepatitis A virus segment 2A-encoded protein mRNA, partial cds. ACCESSION M34085 KEYWORDS . SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 554) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL Unpublished (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. Author address: E.A.Brown 547 Burnett-Womack CB# 7030 Chapel Hill, NC 27599 FEATURES from to/span description pept < 1 > 554 segment 2A-encoded protein (AA at 1) BASE COUNT 184 a 71 c 138 g 161 t ORIGIN Segment 2A; map position 3108-3662. 1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg 61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag 121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa 181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg 241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg 301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg 361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat 421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt 481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg 541 tcaggaatac agga // LOCUS SHVVP1CP 2373 bp ss-RNA VRL 11-JUL-1990 DEFINITION Simian hepatitis A virus capsid protein VP1 mRNA, partial cds. ACCESSION M34084 KEYWORDS capsid protein VP1. SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed in cwll line BS-C-1. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 2373) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL J. Virol. 63, 4932-4937 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. FEATURES from to/span description pept < 1 > 2373 capsid protein VP1 (AA at 1) BASE COUNT 661 a 447 c 491 g 774 t ORIGIN Segment P1; map position 735-3107. 1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg 61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact 121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca 181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga 241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa 301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg 361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca 421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct 481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc 541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat 601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct 661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca 721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt 781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat 841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca 901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag 961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa 1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg 1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt 1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc 1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg 1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag 1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat 1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct 1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct 1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag 1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta 1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca 1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg 1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt 1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg 1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg 1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca 1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg 2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat 2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc 2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg 2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg 2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat 2341 gatcctcgaa ctgaagagga tcgtaaattt gaa // LOCUS STNRRE 1885 bp ss-rRNA RNA 11-JUL-1990 DEFINITION S.hispidus 18S rRNA, 3' end. ACCESSION M34361 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE S.hispidus rRNA. ORGANISM Stenopus hispidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea; Stenopodidae. REFERENCE 1 (bases 1 to 1885) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1885 18S ribosoma RNA BASE COUNT 327 a 336 c 430 g 368 t 424 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt 61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact 121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn 181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta 241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag 301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt 361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga 421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn 481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag 541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag 601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn 661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc 721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg 841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg 901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg 961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt 1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt 1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg 1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat 1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg 1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt 1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga 1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng 1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt 1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc 1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg 1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg 1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn 1861 nnnnnnnnnn nnnnnnnnnn nnnnn // LOCUS MSQMUD76A 124 bp ds-DNA BAD 11-JUL-1990 DEFINITION A.dirus DNA probe pMU-D76. ACCESSION M34656 KEYWORDS . SOURCE A.dirus (Strain D) wild-caught female DNA, clone pMU-D76. ORGANISM Anopheles dirus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 124) AUTHORS Panyim,O., Yasothornsrikul,S., Tungpradubkul,S., Baimai,V., Rosenberg,R., Andre,R.G. and Green,C.A. TITLE Identification of isomorphic malaria vectors using a DNA probe JOURNAL Am. J. Trop. Med. Hyg. 38, 47-49 (1988) STANDARD simple staff_review BASE COUNT 32 a 33 c 34 g 25 t ORIGIN 1 gatctgcact cggcgtgaat ttggttacca tcgaatgtgc ggaaaaagtt ttaccccgtg 61 cgcagtgcgg aacacgccag acttgttaca cacggaaacg gaccacgaac gtgttacgcg 121 cacg // LOCUS ACCCITSYN 1895 bp ds-DNA BCT 11-JUL-1990 DEFINITION A.anitratum citrate synthase gene, complete cds. ACCESSION M33037 KEYWORDS citrate synthase. SOURCE A.anitratum DNA, clone pLJD1. ORGANISM Acinetobacter anitratum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1895) AUTHORS Donald,L.J. and Duckworth,H.W. TITLE Expression and base sequence of the citrate synthase gene of Acinetobacter anitratum JOURNAL Biochem. Cell Biol. 65, 930-938 (1987) STANDARD simple staff_review FEATURES from to/span description pept 264 1538 citrate synthase precursor matp 267 1535 citrate synthase signal 120 125 -35 signal signal 141 146 -10 signal signal 1560 1585 pot. transcription termination signal BASE COUNT 534 a 406 c 357 g 598 t ORIGIN 1 gtactcaacg cttaattttt ttctgcacgt tcttcttgaa ttgacttatg ataccatccc 61 gatgcagtga ttttactgac tttttttgct cgggtcttga tgactaactc tctgtgggaa 121 cgtcattttt tatccataag tataattgac aaaatttcag tactcactaa tcttatagca 181 aattttgaca ccgtctgatt cgcacatgag aaaattagga tttcgagtca gataatcatt 241 caccaggaca ggagatctat tgaatgtctg aagcaactgg caaaaaagcc gtattacatc 301 ttgatggcaa agaaattgaa ttaccaattt acagtggcac attaggtccc gatgtaatcg 361 acgttaaaga tgtattggcc tcaggtcact ttacttttga tcctggtttt atggcgacag 421 cttcatgcga gtctaaaatc acatttatcg atggtgacaa aggtatttta ttacaccgcg 481 gttacccgat tgaccagtta gcgactcaag cagactacct tgaaacttgt tatttattat 541 taaatggcga gttaccaact gctgaacaaa aagttgagtt cgatgcgaaa gttcgtgctc 601 atactatggt tcatgatcaa gttagccgtt tcttcaatgg tttccgtcgt gatgctcacc 661 ctatggcaat catggttggt gtagtaggcg cattatctgc tttctatcac aacaaccttg 721 acattgaaga catcaaccac cgcgaaatta ctgcgattcg tttgattgct aaaattccaa 781 cgcttgctgc ttggagctac aaatatactg taggtcagcc attcatctat ccacgtaatg 841 acttaaatta cgcggaaaac ttcttacaca tgatgtttgc aactcctgca gaccgtgact 901 acaaagtaaa ccctgttctt gctcgtgcaa tggatcgtat ctttacgctt cacgctgacc 961 acgaacaaaa cgcgtctact tctacagttc gtcttgctgg ttctactggt gcgaatccat 1021 atgcgtgtat ctctgctggt atctctgctc tttggggtcc tgcacacggt ggtgcgaacg 1081 aagcagttct taaaatgctt gatgaaatcg gtagcgttga aaatgttgct gagttcatgg 1141 aaaaagttaa acgcaaagaa gttaaactta tgggcttcgg tcaccgcgtt tacaaaaact 1201 tcgatccacg cgctaaagtg atgaagcaaa cttgtgacga agttcttgaa gcattaggta 1261 tcaatgatcc tcaattagcg cttgctatgg aacttgaacg tattgcattg aacgacccgt 1321 actttgttga acgtaaactt taccctaacg tagacttcta ctctggtatc atccttaaag 1381 cgattggtat cccaacagaa atgtttaccg ttatcttcgc tcttgcacgt acagttggct 1441 ggatcagtca ctggttagaa atgcacagcg gtccttacaa aattggtcgt cctcgtcagc 1501 tttacactgg tgaagtgcaa cgtgacatca agcgttaata ttcgaaagaa tattaatgta 1561 aaaagctgcc taatggcagt tttttttata aataagtttt aaaagttatt cttcttcaaa 1621 catatttaat aagtgatgac taataccatc agctcttagc caagccaact cataacttgc 1681 ttcggccaaa gctaaaatac gtctttcaaa ctcagtccat acttgtttaa cttgcgcttc 1741 tgaatcccta aaccactgtc atagctaaat gcttattctt ttcacatatt tttaaggcat 1801 ggtagagttt agccctttac tcgccccttc attaacctga cacgtttacc taatataaat 1861 ccttctacat gctgtagact gggaacatag gtacc // LOCUS ECOGUAC 1991 bp ds-DNA BCT 11-JUL-1990 DEFINITION E.coli GMP reductase (guaC) gene, complete cds. ACCESSION M33020 KEYWORDS GMP reductase. SOURCE E.coli (strain K12) DNA, clone pDS89. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1991) AUTHORS Andrews,S.C. and Guest,J.R. TITLE Nucleotide sequence of the gene encoding the GMP reductase of Escherichia coli K12 JOURNAL Biochem. J. 255, 35-43 (1988) STANDARD simple staff_review FEATURES from to/span description pept 210 1250 GMP reductase (guaC) (E.C. 1.6.6.8) mRNA 25 < 1250 guaC mRNA (put.) mRNA 90 < 1250 guaC mRNA (put.) mRNA 126 < 1250 guaC mRNA (put.) BASE COUNT 493 a 494 c 508 g 496 t ORIGIN 1 gaattcatca tgattatcaa aacgttaaaa atgagtgcac gaaagcgaaa ttgatgaaac 61 gttcgctcac tatttaccag gtaaatttat gggattgtag cgtaaaaaaa gacaatttcg 121 cagtcttgcg ccgcattgat tagtgcgtat gatagcgtca ctggagttgc gctcttaccc 181 ttatagccat taaccccagg aatccgcaca tgcgtattga agaagatctg aagttaggtt 241 ttaaagacgt tctcatccgc cctaaacgct ccactcttaa aagccgttcc gatgttgaac 301 tggaacgtca attcaccttc aaacattcag gtcagagctg gtccggcgtg ccgattatcg 361 ccgcaaatat ggacaccgta ggcacatttt ctatggcctc tgcgctggct tcttttgata 421 ttttgactgc tgtgcataaa cactattctg tcgaagagtg gcaagcgttt atcaacaatt 481 cttccgctga tgtgctgaaa catgtgatgg tttctaccgg tacgtctgat gcggatttcg 541 aaaaaactaa acagattctc gacctgaacc cggcattaaa cttcgtttgt attgacgtgg 601 cgaatggtta ttccgaacac ttcgtgcagt tcgttgcgaa agcgcgtgaa gcgtggccga 661 ccaaaaccat ttgtgctggt aacgtagtga ctggtgaaat gtgtgaggag cttatcctct 721 caggtgccga tatcgttaaa gttggcattg gcccaggttc tgtttgtaca actcgcgtca 781 aaacaggcgt cggttatccg caactttctg cggtaatcga atgtgccgat gctgcgcacg 841 gtctgggcgg aatgatcgtc agcgatggtg gctgcaccac gccgggcgat gtggcgaaag 901 cctttgcgcg tgccgatttc gtcatgcttg gcggcatgct ggcgggccac gaagagagcg 961 gcggtcgcat cgttgaggag aacggcgaga aatttatgct gttctacggc atgagctccg 1021 agtctgcgat gaaacgtcac gttggcggcg ttgcggaata tcgcgcagca gaaggtaaaa 1081 ccgttaagct gccgctgcga ggcccggttg aaaataccgc gcgagatatt ttgggcggcc 1141 tgcgttcagc ttgtacatac gttggggctt cacgcctgaa agagctgacc aagcgcacca 1201 cgtttattcg tgtgcaggaa caagaaaacc gcatcttcaa caacctgtaa tctcccaacg 1261 ctggcgtgga gcaacacgcc acggttatcc catcccactc atcgcatcgc ctaaatggaa 1321 aattggcaga tacattgcca ccaccagcgt accaataatt cctcccgtta tgatcagcaa 1381 cgcggttcag taaggctgcg aggttatccg ccagcgccat tgtgttttcc cgatgatgat 1441 gggcgaggtt gtctaacatg agatccagag agccggatgc ctctcctgtt ctcactaatt 1501 gcaaacagag cgggctaaac tcaccggtat tttttagcgc cagccagatg ggttgaccgt 1561 tactgatatc gtgctggatt tgtgtcagaa gttgcaccca gtacgggcag cgcattgttt 1621 ctctgacgct ctctacgccc tgtaaaaaag taatgcctgc actttgtgtc agcgccagaa 1681 tcgtaaagat ctgcgtgagt ttttgtcccc gcatcagtga acccataatc gggatgcgta 1741 acagcaattt ctgccgcact ataagccagg tcggtcggcg catcagcaac ttattggcta 1801 tcgccagcag aaagccgaac acaccagcag ccagctccat tcgccactaa agtctgccag 1861 cgtcatgatc ccctgcgtta gtgccggtag tggggtgttg aaggtcttat agatagcggc 1921 aaactccggc agacacaaaa tgcagcattg ccacaaccac catgattagc catcgctaaa 1981 atgatgatgg g // LOCUS HUMDKERB 8815 bp ds-DNA PRI 11-JUL-1990 DEFINITION Human cytokeratin 8 (CK8) gene, complete cds. ACCESSION M34482 KEYWORDS cytokeratin 8. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 8815) AUTHORS Krauss,S. and Franke,W.W. TITLE Organization and sequence of the human gene encoding cytockeratin 8 JOURNAL Gene 86, 241-249 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1113 1436 cytokeratin 8, exon 1 3972 4180 cytokeratin 8, exon 2 4809 4869 cytokeratin 8, exon 3 5344 5439 cytokeratin 8, exon 4 5958 6248 cytokeratin 8, exon 5 7113 7333 cytokeratin 8, exon 6 7492 7550 cytokeratin 8, exon 7 8380 8567 cytokeratin 8, exon 8 IVS 1437 3971 CK8 intron A IVS 4181 4808 CK8 intron B IVS 4870 5343 CK8 intron C IVS 5440 5957 CK8 intron D IVS 6249 7112 CK8 intron E IVS 7334 7491 CK8 intron F IVS 7551 8379 CK8 intron G signal 1007 1011 TATA box signal 8779 8784 poly-A signal BASE COUNT 1868 a 2324 c 2481 g 2142 t ORIGIN 1 tcaacggatc tcgctctttt ttttctttgg agatggaatc tcgctctgtc gcccaggctg 61 gagtgcagtg gcaagtctca gctcactgca actctgcctc ccgggttcaa gtgattctcc 121 tgcctcagcc tcctgagtag ctgggattac accatggcca gctaattttt gtatttttag 181 tagagatggg gtttcaccat gttggtcagg cttgtcttga actcctgacc tcgtgatccg 241 cctacctcag cctcccaaag tgctgggatt acaggcgtgc acagcgtgcc ctggccttgg 301 atctcttttt atcttgcacc ttcagatgta gagggacgac agccactgtg tgtgtatgtg 361 tatgtgtgtg tgtgtgtgtg tgtgtgcgcg tgtgatgttt attcactcat ttatttattc 421 attcattcat tccacaaata tctacccaga ccctcttggc actgcaccag gtcgtagggg 481 tagaacagta acctggaaag atgaggcaaa tggttgattt cagattcaag gctttggact 541 ccagctgttc tgtcatccag ctcaggcagg ccctcataat cgcttcaatc agggagaaca 601 caggagagtt tctctggggt gtcggcagct cagaggagac ccaaatacta ggagacccct 661 tttcccatgc ttcccagtcc tccagtttat ttcccccagg aaggagggag acaagaccca 721 gagtcagggt tgtagtggct gggcggccca ggcaagtctg cttgttacac gacttgtgcc 781 aggacaggat ttcttccagt ttcatattca ctgaactgcc ttttcctggg tttctggggg 841 tggtgctgga gtgggctcca gggttggaac gggcccttgc gacgcgtctc tgctgccccc 901 acctgagtct gccccgaggt ggcaggtgac gggttcacgc gacgcctctg gcctagccac 961 tcaggtacga ggcctttccc ccactccccg gggctgggat ctcttttata aaaggccatt 1021 cctgagagct ctcctcacca agaagcagct tctccgctcc ttctaggatc tccgcctggt 1081 tcggcccgcc tgcctccact cctgcctcta ccatgtccat cagggtgacc cagaagtcct 1141 acaaggtgtc cacctctggc ccccgggcct tcagcagccg ctcctacacg agtgggcccg 1201 gttcccgcat cagctcctcg agcttctccc gagtgggcag cagcaacttt cgcggtggcc 1261 tgggcggcgg ctatggtggg gccagcggca tgggaggcat caccgcagtt acggtcaacc 1321 agagcctgct gagccccctt gtcctggagg tggaccccaa catccaggcc gtgcgcaccc 1381 aggagaagga gcagatcaag accctcaaca acaagtttgc ctccttcata gacaaggtga 1441 gggtcccctg cgtggctgac tgtgccccgc agcccctttc tcctggtagt cccggtccct 1501 atgcacatct ccagccccca gctggcgtcc tgctgggcct cacccgccct gggcacactc 1561 tcccttccat cctccgacct cacccctccc gtgcaccttg gtttgggctg ggtgagggtg 1621 gggagagggt ctggacagcc gggatgaatc ctggggcttc cttcttccct tttaaactgg 1681 agggtcttgg aagagagaga caacttaagg gtacagccta gttcccacca cccctctcta 1741 caaatcccgt tcttcctcag gtcattctgt cccaaattat aaaaaataat agcggttatt 1801 gttctcaccc caacccagtt ctgaccgtct tttaacgtat gcctgcggca gtcccagctg 1861 ttcgggacta ccctcctcca ggttcgcctc ttcgccagca ctacccaagg ctccccagtg 1921 gtgcctttgt gatttttttt ctttcttttt tttacatagg ggtttggtgt gattctagca 1981 ttctaggaga aggaagtggg tgtctcggtt caaacgggca aatattgatt gaggcctttg 2041 gccgccggag gcctgagtgc gggggtcaca gaatgagtca tacggcccct ggcccggcag 2101 cgtgggcggg gccgagggcg gggtgagggc tgcgggcagc agtctgcggg acgctctcct 2161 ccactggcgg agctcggcgt cgggggcggt gtgggtgggg tggggtgggg tggggtgggc 2221 tggggtgggg tggaggaggc gagggcctgg cctcggaaag cccatgcagg attcaaagtc 2281 tcctgggacg ccgcccgggg tttacgtcct gttaagttta tggcttcaga taacgcggtc 2341 gcccaccaac gcccctcgcc cattcagccc gtgtcccttt ctcggcgtcc tgtccctgct 2401 gcccccagcc tcggctccac tttccacaca gcaggagcca gggccgggtt ttgcagcctg 2461 ggactccgct gcctgagccc cggcccccgg cggccccgag gattgggccc ttcacgctga 2521 ctggctcctg ggaggcattg tgggaacggg aggagggaaa tcctggggca gagtaagccg 2581 ggaggaaccg gagccccagg aacccagtgg tcgggggccc tcgctgtcca agcgcctgga 2641 cttgacttgt tgactgcgtt ttgctagccc tggggtcctt atagagagca gctaagcata 2701 ggctttggaa tctgaattct tggtctgcac tcgtctgccg gttcctggtt atggactccc 2761 ttgccaagtc ttatttcctc atctataaaa tgaatatgag agcccctaaa tccatatagc 2821 aaaagttttt gccttattca aacttacata tgtaaagagt tcagcagtgc ttggcccaca 2881 ttccattagg ataagatgtt ataatcactt ttttttaaaa aataattttg gggcagaatg 2941 actggggaag aaagcgattt gcagagagtg gtggagggaa ctaggctgta cccttaaaag 3001 atttctgtcc cctccagttt agaaggagtt acaagttttt ttgtttgttt gagacagagt 3061 tactctgtgc ccaggctgga gtgcagtggt gtgatctcag ctcactgcaa cgctccgctt 3121 cctgggttca agcgattctc ctgcctcagc caccgagtag ctgggactac aagtgcgtgc 3181 acagcccggt taattttgta attattgtag gcaaggttca atatgttggc aggctggtct 3241 cgaactctga cttcagaaat ccgcctgcct tgaccaccca aagtgctgga attacagcgt 3301 gagcctccac gcccggcctc tttttcaatc ttaacatctt tagaaaggtt ggctattttt 3361 ggccgggcgc gggcttacgc ctataatccc agcactttgg gaggccaagg cgggccaatc 3421 acaaggtcag gagttcgaga ccatcctgcc taagacggtg aaaccctgtc tctactaaaa 3481 atacaaaaaa attagtgggg cgtggtggca cgcacggctg cctgtagccc cagccactcg 3541 ggaggctgag gcaggggcag gagaatggca tgaacttggg aggcggagct tgcagtgagc 3601 tgagatcttg cactgcactc tagcctgggc cggagactcc caaagaaagc ttggctattt 3661 ttattgatgt gtaatataca acctatgtaa atgaagttag gcctattggt ttgcaaatgc 3721 agctttaaca taattacctt acctgtctcc ttcccctacc caatgctgag ggacattgct 3781 ccccacctca ccatcatgcc atgctttctc cccctggtca taggtgatct ttccagaaca 3841 gctaaccagg tgcctggggt ctggagactt actgcttgag gagtgaatta agagaaaaga 3901 ctgcttgctt tcctccagac tttgagccct ggcctgatgt agaccttttt gctctctcct 3961 ccttcgtata ggtacggttc ctggagcagc agaacaagat gctggagacc aagtggagcc 4021 tcctgcagca gcagaagacg gctcgaagca acatggacaa catgttcgag agctacatca 4081 acaaccttag gcggcagctg gagactctgg gccaggagaa gctgaagctg gaggcggagc 4141 ttggcaacat gcaggggctg gtggaggact tcaagaacaa gtgagcaact ccaccctcca 4201 cccaactgaa gtcacctgct ctcctccacc ccttgacctt gggactaagt ccatggccct 4261 ctgttgtggg aagtgcagtc ctatctaatt agggtgacca cctgatgagg tttctcggac 4321 agtctgtgtt tatgccaggt tctagcacat tgttgatagt acccacccct ttcaatctaa 4381 ctgtctggat ttgaagaaca aattatgtgt caatgttgac atggtaaacc tgagacggga 4441 gagataggca gcctgtgggc ctcacttttg tacttaacat tctggcccct ctttagtctt 4501 gacccttgac ctctagcaaa ctctagaaag ttctgtctga ggtctcatgt caggccctgc 4561 tgttaacact ctcaaggtgt ccaatccgat gtgtattcat ggatttggag agagatttcc 4621 tgcttcccac gggctaaggg aggggtgagg gtggagaggg cagctgggga aggcagaagg 4681 accagccttc tcatatcctc atctctgtga actgaatttc ctgatttcac aacgcccctg 4741 tctcccaaaa gaccaagggc aacctccctt ttgccttcat cctctaattg taagtctttt 4801 cctcacaggt atgaggatga gatcaataag cgtacagaga tggagaacga atttgtcctc 4861 atcaagaagg tgagggagtc tcccttctcc tatctggaca ctggaggctg gggctcagag 4921 actcagacca agaagctttc tgggttttgt ccctaaatat tcctaagtag tgggacaaac 4981 tcatttatgt aaacatttgg gtgcacagaa aggtagacaa ggatggagtg gtaggtgcat 5041 ttggacagaa ctcttgacat cggtgttggg acatggttca gaaaacagag cagtagaact 5101 ggagatctgg ctctagaagg ctccctagag aaggaggtgg aagagggtgt gttgcaggaa 5161 gcagaggtga aggtgtgtgg gctgagaatg cacatgtgat gggcagaggc tgggctggaa 5221 gatcaatcca caaagtggca actagaaagt cctgtgacca ggccattggg tggaccttgg 5281 gagccccttg gttggggttg ggtgtggaaa cccagctcag gctcccctct cctcatcccc 5341 caggatgtgg atgaagctta catgaacaag gtagagctgg agtctcgcct ggaagggctg 5401 accgacgaga tcaacttcct caggcagcta tatgaagagg tatgttcctg gtcgcaggag 5461 agtgagggtc cccagccttg tcagcgcctc caccctgaga ctcaaccaga ggctcctccc 5521 agcccccagc acactaataa gacaaaggac cccactgctg actaattaca gccaccaata 5581 tttgctcggc tagtatttat tgggtctata tgttctgtcc ctcgcatgag gtgagtcatt 5641 accccatttc acagacgaga aagtgggctc agagaagtga aataacgtat ccaaggtcat 5701 catagggtgt ggtgattcag cagcaactct gtccccaaag cccttgttcc taatctttga 5761 gctgcattgg atccctctgt gcacctagta ttggtgaccc agttcctttt tcaggaactt 5821 tgcccctctc cctgaccctg actcccacct gctcctctcc tctgctgccc ctgtcttata 5881 cctaagaaag gctgttgtgg aaaagggggc tcctgtgtgc agagacaggg cctcaccact 5941 tgccctcttc cccacaggag atccgggagc tgcagtccca gatctcggac acatctgtgg 6001 tgctgtccat ggacaacagc cgctccctgg acatggacag catcattgct gaggtcaagg 6061 cacagtacga ggatattgcc aaccgcagcc gggctgaggc tgagagcatg taccagatca 6121 agtatgagga gctgcagagc ctggctggga agcacgggga tgacctgcgg cgcacaaaga 6181 ctgagatctc tgagatgaac cggaacatca gccggctcca ggctgagatt gagggcctca 6241 aaggccaggt atgggccggg ttgggggtgg gagggttcct tggacacaat cctggtgaga 6301 ggagataatg taggaagagt gaagtttctg ggagtcgggg aaggaatcct agaccagggt 6361 tcaggagttg gaggggcagc cacagttcag cttctcagtc tgcttctgag aagcaaaggg 6421 atgcagggaa ggtcccttgg gccaggacag aggtgaaagg ggactggggc aggtatgttg 6481 gggactcgtg atacatgctc caagcctgct ttaatcagtc atatgcatca ggggtaaggt 6541 tgagctctgc tgctttaagg aaagtctaga acccagggat ctagtccagt tagggtaggg 6601 ggaccttaca gtgtcgcagg tcgagaaggg tgtggagggg aagcacctgg aaactgctca 6661 tgtctccctg atctgcttcc ttagtctcgt ttatttattt atttattttt gagacagagt 6721 cttgctctgt cgcccaggct ggagtgcagt ggcgtgatct cggctcactg caagctccgc 6781 ctcctgggtt cacactattc tcctgactca gcctcctgag tagctgggac tacaggcgcc 6841 cgcaccaggc tggctaattt tttttgtatt tttgctagag acggggtttc actgtgttag 6901 ccaggactcg tcgatctcct gaccttgtga tctgcccgcc tcgcctccca aagtgctggg 6961 attacaggca tgagcactgt gcccggccct tagtctcatt aattgagctg gggagtcagc 7021 ctagtgtgtg gaggacctga gggagggtgg acgcacggag gaagagaagg catacccaac 7081 ctgacctact tacctgtccc ctacccacag agagggcttc cctggaggcc gccattgcag 7141 atgccgagca gcgtggagag ctggccatta aggatgccaa cgccaagttg tccgagctgg 7201 aggccgccct gcagcgggcc aagcaggaca tggcgcggca gctgcgtgag taccaggagc 7261 tgatgaacgt caagctggcc ctggacatcg agatcgccac ctacaggaag ctgctggagg 7321 gcgaggagag ccggtgggtg tgggtacctc tgaccggacc tgcttcccta tccctgggac 7381 ctggggtggg gacggtggga gccccctgaa gccccttgga cttggggtcc tgttgttctg 7441 ggccaagaag ggctaggagt tggtcctgac accccatttg acagggtaca ggctggagtc 7501 tgggatgcag aacatgagta ttcatacgaa gaccaccggc ggctatgcag gtggtgtccc 7561 agggccctgg atgagggcgg gaggcagggc cagggaggct cagctccagg gagggggctg 7621 tgctcagtcg ctcacagtga cctcagcctg agcactcatg ttcttgggag aatcctaggg 7681 tggggaggca catattcagg gaactccagt aataacttta ttacttagta acttcatatt 7741 agaagataca ccaataacca tagctgtgtg ccaggcactt gcgtaagtat cctacaggtt 7801 ttatgtgatt tattttattt attaatttaa tttaattttt ttgagacgaa gtctcgctgt 7861 caccaagctg agtgcagtgc tgatctcagc tcactgtaac ctcacctcct gggttcaaga 7921 gattctcctc cgtcaggcct cccaagtagc tgggactaca ggcgcatacc accatgccca 7981 tgctaatttt tgtattttta gtagagacgg ggtttcactg tgttgggcag gctggtctcg 8041 aactcctgac cttgtgatca gtgctgggat tacaggcatg agacactggg cctggctgta 8101 atttattttt tatatgacac ctgtaaacgt cttcagttga ggaaggctga ggtgcagcta 8161 aatgtccaag ctgacacagg ctatatatat ggcagctgtt ttccaccctg ctcctggttt 8221 tccctgacag ttctggagta gtgaaccatg caatcactga tcaggagagc tgggttaacc 8281 tccatccctg gggctatgtt gggaatgagc agggagaagg gcatggagcc tgccatggtg 8341 ggcttctgta ctcatgtggc tacctctgtc cctcaccagg tggtctgagc tcggcctatg 8401 ggggctcaca agccggcctc agctacagcc tgggctccag ctttggctct ggcgcgggct 8461 ccagctcctt cagccgcacc agctcctcca gggccgtggt tgtgaagaag atcgagacac 8521 gtgatgggaa gctggtgtct gagtcctctg acgtcctgcc caagtgaaca gctgcggcag 8581 cccctcccag cctacccctc ctgcgctgcc ccagagcctg ggaaggaggc cgctatgcag 8641 ggtagcactg ggaacaggag acccacctga ggctcagccc tagccctcag cccacctggg 8701 gagtttacta cctggggacc ccccttgccc atgcctccag ctacaaaaca attcaattgc 8761 tttttttttt tggtccaaaa taaaacctca gctagctcgc cgaatgtcct tgctt // LOCUS HUMSRU30S 179 bp ss-RNA RNA 11-JUL-1990 DEFINITION Human 30S small nuclear ribonucleotide protein pre-mRNA complex, exons 1 and 2 (partial). ACCESSION M34493 KEYWORDS small nuclear ribonucleoprotein. SOURCE Human Hela cell pre-mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 179) AUTHORS Pruzan,R., Furneaux,H., Lassota,P., Hong,G.Y. and Hurwitz,J. TITLE Assemblage of the prespliceosome complex with separated fractions isolated from Hela cells JOURNAL J. Biol. Chem. 265, 2804-2813 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 55 small nuclear ribonucleotide protein (snRNP), exon 1 (AA at 2) 142 > 179 small nuclear ribonucleotide protein, exon 2 pre-msg < 1 > 179 snRNP pre-mRNA complex IVS 56 141 30S small nuclear RNA intron A BASE COUNT 30 a 48 c 46 g 55 t ORIGIN 1 aatacacgga attcactctc ttccgcatcg ctgtctgcga gggccagctg ttggggtgag 61 tgtgacctgc acgtctaggg cgcagtagtc cagggtttcc ttgatgatgt catacttatc 121 ctgtcccttt tttttccaca gctcgcggtt gaggacaaac tcttcgcggt ctttccagt // LOCUS K5TPA1PRO 307 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor DNA. ACCESSION M34486 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1266. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 307) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 304 > 307 bacteriophage BK5-2 mRNA BASE COUNT 80 a 56 c 43 g 128 t ORIGIN 1 gatcaaggtg tgtaggtgta atctctagct taggaacgct tttgatacag aacgtgtgat 61 tgtccgtttt taactttctt gttttgtcat cttcataaac tcacaaagtt tatttttgga 121 acaaattttt cttttttatc gtatgacgta acttttttca tttggtccat cataagcttt 181 tttaatattg tcagcttttg ctttttcgac gttctctacc gacgctttca aaatctttaa 241 tgaaaaaaac cgtaaccatc gaatttttct tccatatttt caaagaatcc gttactatct 301 aacgatc // LOCUS K5TPA3PRO 182 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor Pa3 DNA. ACCESSION M34488 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1268. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 182) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 169 > 182 promotor region mRNA BASE COUNT 57 a 21 c 24 g 80 t ORIGIN 1 ttttcagaat atgaagttaa aagttctcta atatttttat ccgttaaaga gtatcctata 61 aataaaattg gggattctgt taagtttgac aatatttttc gcatttacta atgctaattt 121 agattcatta tttttataat cctcactagt tatacatata gtatttgggt ttttgactga 181 tc // LOCUS K5TPF1PRO 177 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor Pf1 DNA. ACCESSION M34490 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1262. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 177) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description pept 149 > 177 ORF mRNA 110 > 177 ORF mRNA BASE COUNT 63 a 29 c 37 g 48 t ORIGIN 1 cctttattct tcgtgcaagg aggcgcaaga tggtcaaaac ttacaaaccg attgatttta 61 acagaaaatg taagattgga gttactaaaa cagtaactta ctccaactgg aggtaagatt 121 gaaaaaattg acccaggaac ggttttaaat gttcgatttc gcggctaaaa tgagatc // LOCUS K5TPF2PRO 1209 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor Pf2 and an ORF, partial cds. ACCESSION M34487 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1261. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1209) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 249 > 1209 bacteriophage BK5-2 mRNA BASE COUNT 377 a 213 c 235 g 384 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctgaa tatggttcgt aaccctatgg catttctcaa tactctttca tctaaaactg 61 aaactagcgg aagtgatagt gctgctggac ttactattcc gcaagatatc cgtactatga 121 ttaacacatt ggttcgccaa tatgactcac tacaacaata tgtacgtgtt gagagtgttt 181 ctacttcaaa cggtagtcgt gtatatgaaa aatggactga tgtaactccg ttgactgtaa 241 tggatgcaga agatggaaaa attcctgatc ttgataatcc acgtttggac aattattaaa 301 tacttgatta aacgttatgc gggaatcatc aatgccaact aatacattgc ttaaagatac 361 agcagaaaat attcttgcat ggttatcaag ctggattgct aagaaagtgg ttgtgactcg 421 taaccaagcg attattgcag caatgggtac agttcctaaa aaaccaacaa tcgctaaatt 481 tgatgatgtt attactatga ttaatacatc tgttgatcct gcgattatcg ccacttcaag 541 tcttttgact aaccagtcag ggttgaataa acttgctttg gttaaaactg ctgaaggtaa 601 atatttgctc gaaccagacc caacaaaacc taattcatat ctaattaaag gtaaaaaagt 661 tattgttgtt gcagatcgct ggcttccaaa tagtggatca acagtttatc cactttacta 721 tggagatatg tcgcaagcta ttacattgtt tgaccgtgaa aacatgtcat tacttccaac 781 aaatattggt gctggtgcat ttgaaactga tactactaaa attcgtgtaa tcgatcgctt 841 cgatgttaaa actgctgact cagaagcttt agttgctggt tcacttactg caattgcaga 901 ccaagtaggt aattttactg caggaaagta ggtaatttat gacagtaact gttgatgact 961 tactagatca gttatcagaa gatgatgatc gcaaaccgca acttcaaatt tatttgatac 1021 agcaaaagca tatgtgaaaa atgcagtgag ttctgataca gttgatgctc catttttcag 1081 tgtagaaaac gtttatccga tttatgatgt agctgttctt agctattcta tggatttgtg 1141 gattaatcgt tctacgacta tgccgcctac tacggctgta gatcacatgg ttggtcagtt 1201 gagaggcct // LOCUS K5TPG2PRO 195 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor Pg2 DNA. ACCESSION M34489 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1265. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 195) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 154 > 195 promotor region mRNA BASE COUNT 71 a 28 c 40 g 56 t ORIGIN 1 agagatttac gaaaagttga gtgctttagc tgaaattgat agacttttcc attggtctag 61 ccatttacat caagaacgat tacaatttgt tagtaaatat ccaaatgtta tggaaaaata 121 cagacaagca aactaaggag ggtatattga atgaccgaca aactaatatc gctggtcatc 181 aaagtgtgtg actgg // LOCUS MUSH2A 1805 bp ds-DNA ROD 11-JUL-1990 DEFINITION Mouse (H-2a haplotype) DNA fragment. ACCESSION D90007 KEYWORDS . SOURCE Mouse (strain B10.A, haplotype H-2a) DNA, clone B10.A.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1805) AUTHORS Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T., Steinmetz,M. and Moriwaki,K. TITLE Recombinational hotspot specific to female meiosis in the mouse major histocompatibility complex JOURNAL Immunogenetics 31, 79-88 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihiko Shiroishi National Institute of Genetics Yata-1111 Mishima, Shizuoka 411 Japan Phone: 0559-75-0771 FEATURES from to/span description site 1 288 MT-consensus rpt 1227 1242 TCTG repeat LTR 1533 1568 Xenotropic solitary LTR sequence BASE COUNT 391 a 400 c 409 g 605 t ORIGIN Chromosome 17. 1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt 61 ggcgccattc ctgggctggc agtcctggtt tctataagaa agcaggctga gcaagtgatg 121 aggacgcccc tccatggcct ctgcatcagc tcctgcctcc agattcctgt cctgatttct 181 tcggtgacta acagctatgt ggaagtgtaa acaggatgaa cgctttcctt cccaggtagc 241 tttggtcctg gggtttcatt gcagtaatag taaccctaga tgggacaaga ctttgatcaa 301 gtgttccctt tcattgtccc cttcctgtag acatgacttc tcttcctata gacagtctct 361 cctctgcttt cctggacatg taattttttt ttttgagaca aggtcattct tgttgtctat 421 tcttgactgg ctttgaattc agaatctgca ggctctgcct ctctggtaac atgtaacatt 481 ttccatatgt aacattttta ccagccattt cccagtaaat gagttacttc atttgaggtt 541 ttgtcttaaa tccccgtgag caatgttttg ttagtttcca aagcacgagg attctaagtg 601 tctatttgtt gctaagttgc caggctgtta cagagcacag tttctgggac cctggctctc 661 tgaaactgac tagggattgc tttagtataa acataaacca ctgggactct ggctctttga 721 aactgactag ggattgcttt agtacaagta taaaccactc agtcctggtc ctacttggct 781 tcaaaagttg aatatcgctt ttggtatttg agatggagat ttaaagatgg aattttatta 841 gtcttctgcc tggttttctt tctttctttg ctcttactgc cttgtggctc agaaccagct 901 gttgcctgtt tgatagtttg tgaccaatac ctgtactgtt aaattggcca tttgagaact 961 caaaaagtcc caacttgtag tgttttcggt ttccatggtc ttagatattt ccactgcaga 1021 caacatcaag ttgccagtgg ttaacaactg tctttcagaa ctctcaagta tttcggtggg 1081 tctgccagcc cttgtaacgt agcgccacgt ggtatatgct tatttgtctg tctgtctgtc 1141 tgttgtgcaa gatgcctgtg tgccctgagg tcagaggaca gcttcaaggg ctctccattc 1201 ttccctgacc acgtggatcc agggaataga actttgacca ttacccacgg gccatgttat 1261 ttcttgacag ttctgttgta catttgtttt agtctttggc tttatttatt tttctcaccc 1321 tcagtttccc tttgtctcag atgctttttt ttttttttta aatcttgcct tgggagatgt 1381 ttcaaactct tggaacgaat gatacagttg tttgattgat agaacgaagc cttccagtgt 1441 gaatgcgttt gcatttcagc ttgttgctgg ctggctgtgt ggtgctggtt cagacatgtc 1501 acaggcttga ggtgttaagg ctaactgagt tcggagagtc cccacctgac cccttctccg 1561 ttcccctcac cagggagacc tccctcctgg ctgcagttga gcagggtgca ccggggctgg 1621 tttcagggca ggctggtagt cttctgactc tgctcactgg ccactttcag ttcctgcttt 1681 ctgaatccta tccagagttc tcagtggtca tcagactctg gagaggacga ggggaagggg 1741 tgggctctta aactatcatt tatatttaaa aaaaattaaa caacagagtt agaagcagat 1801 ccagg // LOCUS MUSH2B 1634 bp ds-DNA ROD 11-JUL-1990 DEFINITION Mouse (H-2b haplotype) DNA fragment. ACCESSION D90008 KEYWORDS . SOURCE Mouse (strain C57BL/10, haplotype H-2b) DNA, clone B10.30. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1634) AUTHORS Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T., Steinmetz,M. and Moriwaki,K. TITLE Recombinational hotspot specific to female meiosis in the mouse major histocompatibility complex JOURNAL Immunogenetics 31, 79-88 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihiko Shiroishi National Institute of Genetics Yata-1111 Mishima, Shizuoka 411 Japan Phone: 0559-75-0771 FEATURES from to/span description site 1 285 MT-consensus rpt 1128 1143 TCTG repeat LTR 1534 1569 Xenotropic solitary LTR sequence BASE COUNT 340 a 373 c 367 g 554 t ORIGIN Chromosome 17. 1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt 61 ggcgccattc ctgggctggc agtcctggct tctataagaa agcaggctga gcaagtgatg 121 acgcccctcc atggcctctg catcagctcc tgcctccaga ttcctgtcct gatttcttcg 181 gtgactaaca gctatgtgga agtgtaaaca ggatgaacgc tttccttccc aggtagcttt 241 ggtcctgggg tttcattgca gtaatagtaa ccctagatgg gacaagactt tgatcaagcg 301 ttccctttca ttgtcccctt cctgtagaca tgacttctct tcctatagac agtctcccct 361 ctgctttcct ggacacggaa tttttttttt tttttttttg agacaaggtc tttcttgtct 421 attctcgact ggctttgaat tcagaatctg cagctctgcc tctctagtaa catgtagcat 481 tttccatatg taacattttt accagccatt tcccagtaaa tgagttactt catttggggt 541 tttatcctaa atccccgtga gcaatgtttt gttagtttcc aaagcacgag gattctaagt 601 gtctatttgt tgccaagttg ccaggctgtt acagagcaca gtttctggga ccctggctct 661 ctgaaactga ctagggattg ctttagtata aacataaacc actgggactc tggctctttg 721 aaactgacta gggattgctt tagtacaagt ataaaccact cagtcctggt cctacttggc 781 ttcaaaagtt gaatatcgca tttggtattt gagatggaga tttaaagacg gaattttatt 841 agtcttctgc ctggttttct ttctttcttt gctcttactg ccttgtggct cagaaccagc 901 tgttgcctgt ttgatagttt gtgaccaata cctgtactgt taaattggcc atttgagaac 961 tcaaaaagtc ccaacttgta gtgttttcgg tttccatggt cttagatatt tccactgcag 1021 acaacatcaa gttgccagtg gttaacaact gtctttcaga actctcaagt gtttcggtgg 1081 gtctgccagc ccttgtaacg tagcgccacg tggtatatgc ttatttgtct gtctgtctgt 1141 ctgttgtgca agatgcctgt gtgccctgag gtcagaggac agcttcaagg gctctgcatt 1201 cttccctgac cacgtggatc cagggaatag aactttgacc attacccacg ggccatgtta 1261 tttcttgaca gttctgttgt acatttgttt tagtctttgg ctttatttat ttttctcacc 1321 ctcagtttcc ctttgtctca gatgcttttt tttttttttt aatcttgcct ctgggagatg 1381 tttcaaactc ttggaacgaa tgatacagtt gtttgattga tagaacgaag ccttccagtg 1441 tgaatgcgtt tgcatttcag cttgttgctg gctggctgtg tggtgctggt tcagacatgt 1501 cacaggcttg aggtgttaag gctaactgag ttcggagagt ccccacctga ccccttctcc 1561 gttcccctca ccagggagac ctccctcctg gctgcagttg agcagggtgc accggggctg 1621 gtttcagggc atgc // LOCUS MUSH2WM7 1630 bp ds-DNA ROD 11-JUL-1990 DEFINITION Mouse (H-2wm7 haplotype) DNA fragment. ACCESSION D90009 KEYWORDS . SOURCE Mouse (strain B10.MOL-SGR, haplotype H-2wm7) DNA, clone SGR.31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1630) AUTHORS Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T., Steinmetz,M. and Moriwaki,K. TITLE Recombinational hotspot specific to female meiosis in the mouse major histocompatibility complex JOURNAL Immunogenetics 31, 79-88 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihiko Shiroishi National Institute of Genetics Yata-1111 Mishima, Shizuoka 411 Japan Phone: 0559-75-0771 FEATURES from to/span description site 1 288 MT-consensus rpt 1126 1141 TCTG repeat LTR 1533 1568 Xenotropic solitary LTR sequence BASE COUNT 342 a 363 c 370 g 555 t ORIGIN Chromosome 17. 1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt 61 ggcgccattc ctgggctggc agtcctggtt tctataagaa agcaggctga gcaagtgatg 121 aggacgcccc tccatggcct ctgcatcagc tcctgcctcc agattcctgt cctgatttct 181 tcggtgacta acagctatgt ggaagtgtaa acaggatgaa cgctttcctt cccaggtagc 241 tttggtcctg gggtttcatt gcagtaatag taaccctaga tgggacaaga ctttgatcaa 301 gtgttccctt tcattgtccc cttcctgtag acatgacttc tcttcctata gacagtctct 361 cctctgcttt actggacatg taattttttt tttgagacaa ggtcattctt gttgtctatt 421 cttgactggc tttgaattca gaatctgcag gctctgcctc tctggtaaca tgtaacattt 481 tccatatgta acatttttac cagccatttc ccagtaaatg agttacttca tttgaggttt 541 tgtcttaaat ccccgtgagc aatgttttgt tagtttccaa agcacgagga ttctaagtgt 601 ctatttgttg ctaagttgcc aggctgttac agagcacagt ttctgggacc ctggctctct 661 gaaactgact agggattgct ttagtataaa cataaaccac tgggactctg gctctttgaa 721 actgactagg gattgcttta gtacaagtat aaaccactca gtcctggtcc tacttggctt 781 caaaagttga atatcgcttt tggtatttga gatggagatt taaagatgga attttattag 841 tcttctgcct ggttttcttt ctttctttgc tcttactgcc ttgtggctca gaaccagctg 901 ttgcctgttt gatagtttgt gaccaatacc tgtactgtta aattggccat ttgagaactc 961 aaaaagtccc aacttgtagt gttttcggtt tccatggtct tagatatttc cactgcagac 1021 aacatcaagt tgccagtggt taacaactgt ctttcagaac tctcaagtgt ttcggtgggt 1081 ctgccagccc ttgtaacgta gcgccacgtg gtatatgctt atttgtctgt ctgtctgtct 1141 gttgtgcaag atgccggtgt gccctgaggt cagaggacag cttcaagggc tctgcattct 1201 tccctgacca cgtggatcca gggaacagaa ctttgaccat tatccacggg ccatgttatt 1261 tcttgacagt tctgttgtac atttgtttta gtctttggct ttatttattt ttctcaccct 1321 cagtttccct ttgtctcaga tgcttttttt ttttttttta atcttgcctc tgggagatgt 1381 ttcaaactct tggaacgaat gatacagttg tttgattgat agaacgaagc cttccagtgt 1441 gaatgcgttt gcatttcagc ttgttgctgg ctggctgtgt ggtgctggtt cagacatgtc 1501 acaggcttga ggtgttaagg ctaactgagt tcggagagtc cccacctgac cccttctccg 1561 ttcccctcac cagggagacc tccctcctgg ctgcagttga gcagggtgca ccggggctgg 1621 tttcagggca // LOCUS MUSMHH2IE 576 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse MHC class II I-E-beta-1 (haplotype H2b/K) gene, partial cds. ACCESSION M28408 KEYWORDS cell surface glycoprotein; class II gene; integral membrane glycoprotein; major histocompatibility complex. SOURCE Mouse (strain B10 (3R)) adult spleen (haplotype H2b/k), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 576) AUTHORS Gorski,J. and Hayes,C.E. TITLE The I-J-disparate mouse strains B10.A(3R) and B10.A(5R) have identical I-E beta sequences JOURNAL Immunogenetics 39, 127-129 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted J.Gorski, 27-SEP-1989. The sequence for Mouse (strain B10.A (5R)) is identical to [1]. FEATURES from to/span description pept < 1 > 576 H2-I-E-beta (AA at 1) BASE COUNT 140 a 149 c 178 g 109 t ORIGIN Chromosome 17. 1 gtcagagact ccagaccatg gtttttggaa tactgtaaat ctgagtgtca tttctacaac 61 gggacgcagc gcgtgcggct tctggaaaga tacttctaca acctggagga gaacctgcgc 121 ttcgacagcg acgtgggcga gttccgcgcg gtgaccgagc tggggcggcc agacgccgag 181 aactggaaca gccagccgga gttcctggag caaaagcggg ccgaggtgga cacggtgtgc 241 agacacaact atgagatctc ggataaattc cttgtgcggc ggagagttga gcctacggtg 301 actgtgtacc ccacaaagac gcagcccctg gaacaccaca acctcctggt ctgctctgtg 361 agtgacttct accctggcaa cattgaagtc agatggttcc ggaatggcaa ggaggagaaa 421 acaggaattg tgtccacggg cctggtccga aatggagact ggaccttcca gacactggtg 481 atgctggaga cggttcctca gagtggagag gtttacacct gccaggtgga gcatcccagc 541 ctgaccgacc ctgtcacggt cgagtggaaa gcacac // LOCUS RATFAPS 1271 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Rat testis-specific farnesyl pyrophosphate synthetase mRNA, complete cds. ACCESSION M34477 KEYWORDS farnesyl pyrophosphate synthetase. SOURCE Rat adult (Sprague-Dawley), cDNA to mRNA, clone TF1.4. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1271) AUTHORS Teruya,J.H., Kutsunai,S.Y., Spear,D.H., Edwards,P.A. and Clarke,C.F. TITLE Testis-specific transcriptional initiation sites of rat farnesyl pyrophosphate synthetase mRNA JOURNAL Mol. Cell. Biol. 10, 2315-2326 (1990) STANDARD simple staff_review FEATURES from to/span description pept 158 1219 farnesyl pyrophosphate synthetase mRNA 1 1271 farnesyl pyrophosphate synthetase mRNA BASE COUNT 329 a 319 c 347 g 276 t ORIGIN 1 ttatatttgg gttctgccta ctgagccggg agtctgggaa ctacaactcc cagagtgctg 61 agcggatgca cgctctgctt ttaggtgtaa gccgcaaaca tcttggaccc cgggagaatc 121 cgcgttgaag cacagagcat ttagctcctc tgtcagaatg aatggggacc agaaactgga 181 tgttcataac caagaaaagc agaatttcat ccagcacttc tcccagattg tcaaggtgct 241 gactgaggat gaactgggac acccagagaa gggagatgct attacccgga tcaaagaggt 301 cctggagtac aacactgtag gaggcaagta caatcggggt ctgacggtgg tacagacctt 361 ccaggaactg gtggaaccaa ggaaacagga tgctgagagc ctacagcggg ccctgacggt 421 gggctggtgt gtagaactgc tccaggcttt cttcctcgtg ttagatgaca tcatggactc 481 ttcccacact cgccgggggc agatctgctg gtatcagaag ccgggcatag gcttggatgc 541 catcaacgat gctctgcttc tggaagccgc tatctaccgc ctgcttaagt tctactgcag 601 ggagcagccc tactacctca acctgctgga gctctttcta cagagttcct atcagactga 661 gatcgggcag actctcgacc tcatcacagc accccagggc caagtggatc ttggtagata 721 cactgaaaag aggtacaaat ctatcgtcaa gtacaagaca gctttctact ctttctacct 781 gcctatcgcg gctgccatgt acatggctgg aattgatggg gagaaggaac acgctaatgc 841 cctgaagatc ctgctggaga tgggcgagtt cttccagatc caggacgact accttgatct 901 ctttggagac cccagtgtga ccggaaaggt cggcactgac atccaggaca acaaatgcag 961 ctggctggtg gttcagtgtc tgctacgagc cactcctcag cagcgccaga tcttagagga 1021 gaattatggg cagaaggacc cagaaaaagt ggcgcgggtg aaagcactgt acgaggagct 1081 ggatctgcgg agtgtgttct tcaagtacga ggaagacagt tacaaccgcc tcaagagtct 1141 catagagcag tgctccgcgc ccctgccccc atccatcttc ctggaactag caaacaagat 1201 ctacaagcgg agaaagtaac ctcgaattgt agaggctgcg agggaggggt ctcaataaat 1261 tattgttcaa c // LOCUS TTHRPEGL 2340 bp ds-DNA BCT 11-JUL-1990 DEFINITION Thermus thermophilus trpL, anthranilate synthase I and II (trpE and trpG) genes, complete cds. ACCESSION X07744 KEYWORDS anthranilate synthase I; anthranilate synthase II; trpE gene; trpG gene; trpL gene. SOURCE Thermus thermophilus (strain HB8 (ATCC 27634) DNA. ORGANISM Thermus thermophilus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 2340) AUTHORS Sato,S., Nakada,Y., Kanaya,S. and Tanaka,T. TITLE Molecular cloning and nucleotide sequence of Thermus thermophilus HB8 trpE and trpG JOURNAL Biochim. Biophys. Acta 950, 303-312 (1988) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description PRM 37 42 pot. -35 region PRM 60 65 pot. -10 region [1] Author address: Sato S., Mitsubishi Kasei, Institute of Life Sciences, 11, Minamiooya Machida-Shi, Tokyo, Japan. Submitted (24-MAY-1988) on tape to the EMBL data library. FEATURES from to/span description pept 72 107 trpL protein pept 169 1557 anthranilate synthase I (trpE) (EC 4.1.3.27) pept 1603 2217 anthranilate synthase II (trpG) BASE COUNT 351 a 764 c 855 g 370 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccgggc cctggagggg cggccccttt agcccctgga cagggccccc gtgtcccgct 61 atcctgaggc catggccctt ccctccgccc tctggtggcc cggctaggcc ccggggcggg 121 aggcctttcc ccggggcaca ccccggggct ttgtttttgg gggacggcat ggagcggatc 181 cgaccttacc gcaaaacctt cctcgcggac ctggagaccc cggtgaccgc ctacctgaag 241 cttgccgaga aggctccggt gagcttcctt ttggagtcgg tggagcgggg gcgccaaagc 301 cgcttctcca tcgtgggggt gggggcgcgg cgcaccttcc gcctgaagga cggggtcttc 361 acggtgaacg gggagcgggt ggaaacccgt gatcccttgc gcgccctcta cgagagggtc 421 tacgccccct tggagcgcca ccccgacctc ccccccttct tcggcggggt ggtgggctac 481 gccgcctacg acctcgtccg ctactacgaa aggcttccga gcctcaagcc cgacgacctc 541 ggcctccccg acctcctctt cgtggagccc gaggtggtgg ccgtctttga ccacctgaag 601 aacctcctcc acctcgtggc cccagggagg gaccccgagg aggcggaggc ccgcctcttt 661 tgggcggaga ggcggctcaa gggccccttg cccggggtgc cgggggagag ggcggggggg 721 agggcccgct tccaggcgga cttttcccgg gaggcctacc tggaggcggt gaggagggcc 781 ctggactaca tccgggcggg ggacatcttc caggtggtcc tctccttgag gctctcctcc 841 cccctcaccg tccacccctt cgccctctac cgggcgctga ggagcgtgaa cccgagcccc 901 tacatgggct acctggacct gggggaggtg gtcttggtct cggcgagccc ggaaagcctc 961 ctccgctcgg acggccgaag ggtggtcacc cggcccatcg cgggcacgag gccgaggggg 1021 aaggacgagg aggaggacaa aaggcttgcc gaggagctcc ttagggacga gaaggaggtc 1081 gcggagcacg tgatgcttct ggacctctcc cgcaacgaca tcggccgggt cgccgccttc 1141 ggcacggtgc gggtcctcga gcccctccac gtggagcact actcccacgt gatgcacctg 1201 gtctccacgg tggagggcat cttggccgag gggaagaccc ccctggacgc cctggccagc 1261 gtgctgccca tggggacggt ctccggggcc ccgaagatcc gggccatgga gatcattgaa 1321 gaactggagc cccaccgccg ggggccctac gggggaagct tcggctacct cgcctacgac 1381 ggggccatgg acatggccct caccctgcgc accttcgtgg tggcgaaggg gtggatgcac 1441 gtccaggcgg gggcggggat cgtggcggac tcggtgccgg agagggagta cgaggagtgc 1501 tggaacaagg cgcgggcgct cctcaaggcg gtggagatgg cggaggcggg gctgtgatcc 1561 caccccatgc cggcaggggc ccggtaagga ggcctggtag gcatggctgc taacggagcg 1621 aaggggagaa aggttatgag ggtcttggtg gtggacaact acgacagctt cacctacaac 1681 ctggtgcagt acctggggga gctcggggcg gagcccatcg tgtggcggaa cgaccgcttc 1741 cggctggagg aggtggaggc cctggacccg gaccggatcc tcatcagccc ggggccttgc 1801 accccctttg aggcggggct ttccgtcccc ttggtccagc gctacgcccc ccgctacccc 1861 atcctggggg tctgcctcgg acaccaggcc atcggggcgg ccttcggggg gaaggtggtc 1921 cccgcccccg tcctcatgca cggcaaggtg agccccatcc accacgacgg caccggggtc 1981 ttccgggggc tagatagccc cttccccgcc acccgctacc actccctggc ggtggtggag 2041 gtgccggagg ccctcgtggt gaacgcctgg gcggaggagg cgggggggcg gacggtgatg 2101 ggcttccgcc accgggacta ccccacccac ggggtgcagt tccacccgga aagctacctt 2161 acggaggcgg gtaaactcat cctcaagaac ttcctggagg acccatggac gcggtgaaga 2221 aggccattct gggcgaggtt ttggaggaag aggaggccta cgaggtcatg cgggccctga 2281 tggcggggga ggtctccccg gtgcgggcgg cggggctttt ggtggccttg agcctgaggg // LOCUS XELGBBBLI 6777 bp ds-DNA VRT 11-JUL-1990 DEFINITION X.laevis beta-L-I globin gene, upstream region. ACCESSION M34470 KEYWORDS beta-L-I. SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 6777) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta- globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD simple staff_review BASE COUNT 2121 a 1247 c 1228 g 2181 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcaaag cttttttttt tattaaacag ttttattgta ttttcaaacg aaaaacaagc 61 agaggtaaga cagtcaacag gttaacatta atgctgcgtg aagggtacta tacattgtgt 121 ttacattaca acttgttgga cattgatatg tcacttctgt gaatttgaag ctttacattt 181 aactaaaatt tgtgatggaa tgtctaacct gcatcccagt ccaaagaaat ttcaaagtag 241 aagatgacat aatgttggta gagatagtga tgagcggatt ttttgccagg tatggatttt 301 ggcaaaattc cgcgcttctt cgtctgcaat tttttttccc aaaactgcag caaaaatcca 361 ccataaccaa aaagtcacaa agacaaaatt gtcgcagaga caagaaagtc acagtaagac 421 ttgatgctcc tgattcactt gcactgacca caccactgta ttaaagggca gagaggggac 481 tataatgcag agacccatgg tccaggctcc tatgaccatg gggtctgctg tatagctgag 541 ctagttacac cagtaaggcc aaaacaaaat ggacttgcat gctggaataa tatgcaaaac 601 tgctgcagtg cctgttttta atctgctggg taaagagtgc aattacaaat gttaggataa 661 ttgcttatca tactctgctg catacactta ggggcccatt tacttagctc gagtgaagga 721 atagaggaaa aaaacttaga atttcgattg ttttttttgg ctacttcgac ttcgacctta 781 gacttcgaat cgaacgattc aaactaaaaa tcgtttgact attcgaccat tcgatagtca 841 aagtactgtc tctttaagaa aaaactttga ccacctagtt cgccacatta aagctaccga 901 agtcaatgtt agcctatggg gaaggtcccc atatgctttg ctagcttttt ttggtcaaaa 961 ataaaccatt cgatcgatgg attaaaatcc ttcgaatcga tcgaacgaat aatgctaaat 1021 cctttgactt cgatattcga actcgaagga tttaacttcg acagtcgaaa atcgagggtt 1081 aattaaccct cgatattcga ccttaagtaa atttgcactt attattgcaa atatttgggt 1141 ccatgacaga gtcatctgta tataatgtga aattacaaat actggtgcct cccctgtttt 1201 actttgctct atgtgagaaa aataatggag tcagtgccat acatatcctt gtgtgtatgg 1261 tggaaattgt agatgtcttg ggggcaaatt tactaaaggg cgaagtggct aacgctaggg 1321 aaaattcgcc agcgttacgt caatttgcca cttcgacaat ttagtttacg gttaccatgg 1381 cgaaaattcg ctagcaatgt aaatagacca gcgcaacttc acaccctaac gctggcgaag 1441 tcaggatgcc cacattcccc ctacatttcc taacatatgg cacctgaatt atactagggg 1501 cacatgtgta gggctttttt taagtttccc tgggcctctg tagtgttatg tatttgctgc 1561 agcaatatac atgtatacaa atttccaatc ggtagcgtaa cctcgaaccg ctgatcgtaa 1621 catcactagc gcaacttcgc aaatgattgg taacttgtgt gcaacttcgg atcttcgtga 1681 atttgcgcag ccactgcgaa gctatgcctg gcgaagtgcg gcgaatgcaa gtctcgggat 1741 ctccgcaggt aagtaaattt gccccatggt cagaggcaag gccagattat gtactaggtg 1801 acctaagaat caatactgtc cattctaaaa gtgcaagttc ataagtgccc gcaactacag 1861 aaacaatagg ggagaactaa caatctgttg taaacaacat tacaaggttg gctccctcat 1921 tgtttatatt atagctgtat aactgtaatg atgagtacga tctaagatat aatgaatctt 1981 attgcaggca aaacaatcct gttgattaat taatgcttaa attatcagaa attacaaaaa 2041 cctcaggtcc tgtgcattct ggataacagg tcccatacct gtactaaaac atgggaccag 2101 ggtgtctgca ttgatcaatc acctctttta tgattgtttt gggccatcac tctacttcaa 2161 gatgctgatg atatattacc aataaatgtt atattatata cttaaaaatc ttaattgaat 2221 taatatagtc aaatccttga tggagacaga cctagtagta tcatggataa taaaactagc 2281 aacagcaagc attggcccga cttgccatct tggagtcttg aaggaatctt ccacctttga 2341 ggaaaattgg agacagcttg tctatttttc aacctcttct aatatctaat tgaagaagat 2401 ccttacatac tgtatgtggt ggaaaatgca tgtttcttta aagatatgct gattgttgca 2461 ccaatctttg ctcaaagatc ttataagaaa tctttaagca tgactgtctg caactatgac 2521 tattataaaa tcctttccat gtagagtttt catccttttt gtgggtcaaa ggctgcccct 2581 cagcaatatc aggggaatga aattaaagtc acaaagagca aaacaattcg caccaatagg 2641 actaaaaatc cacatctcgc aatgcaatat tgttccttaa actgttattg taattgcgaa 2701 ttttaattgg ccattgcgga ttttaattgc gcactcttaa gaagtgcttg aagttgtcgt 2761 aatcttttgg agcaaacata acgacttttt cattaagagg tttaattaca ttgacgcatt 2821 ggcgcaaact ataaaatttg caaatggtct tccactgtcg gaagtggtcg caaaacagtt 2881 tctgggctcg caaaagctat attaaatttg cgaaagcaaa atgtgttcgc gcaaaggtat 2941 aacttttgca ttgcgaatag ttttccgtta gcaactttta ttgcattccc ctgtaaatat 3001 ctaataagca tggcctcgag cccaaaagac acctttttag gtaaagaaat aaatggggat 3061 ttcattctat aagtaattga atttgcacta aatattagta agtcggtttc ttgccctact 3121 ccaaccaaac tcaagaactt tcatttatta aagcacaaga aactctaact cacatattaa 3181 caaatagtta tagttggtca aattgtagct cagttaaggg tatattatat atttctgttt 3241 gttcgggtgt ggtgggccag tttttgaaac agtcaactgt tttacttaca gcagatgtcc 3301 aggtggcttg ccatatcttt gtcaaaaaca aatatattgt cagtattgtt ttttcaacat 3361 ctgccttagt tagataagaa ttgacaatat agaaccaagg gtatctaaaa atgctgctct 3421 gccttgcaat ctaatggtgg gtgggtccaa tgatttgtgt atttgcctga aaaaaaggga 3481 atattgttct ctcccttacc ttttttccca aagaaattgt ttcttttaat gtgtccaaaa 3541 tacagcaact tcagtcttgt gatttgagct tcaagtgaga tcagagacat gatttgctca 3601 gggatccatt tgtttgtgtt cctttcttcc acagtattct caaaagtctt tactaaaacc 3661 aaaatttatt agtacatttc cttgtactgc cacatttaca tctattaaga gtgacatcaa 3721 atactataac tggacaattc ccaaagtaac ctcagtacat gttaaaatat cgttgacgtc 3781 ttccatgtct cattctaagt gtcaatctgc tacttgacta taagattttt gttgtttata 3841 agtgacccag taaggcaaaa gctatacata actagctgcc cacaaactgg ccaatataaa 3901 gggagaagga aaattgttgt tcccactgga gttgttcccc tggttgggga aaaaatacta 3961 ttttgtatac aaaatgctgt tctggggtca ccaggagaga gcttctgatg ttcagggcca 4021 ggtagtgaca taagcctgag aataagactt aggggcacat tcaagctcgg gtgaatgaat 4081 agagggaaaa aaactcctcg actatcgaat tggcgtaaat tcgcctgagt agaatgattc 4141 aaatagattg agcgaaaaaa cgctgcgact attcgcccat cgatagtcga agtattgtct 4201 cttttaaaaa tcatttgact gcctacttcg ccagataaaa cctaccgaat tgctttaaaa 4261 gcctatggga aagtcccata ggcttctttt ctacgttttt gatcgaataa aaaggcattc 4321 gatcgaatat tcgatcgaat gaaaatcctt cgattgaata ttcgatcgtg cccattcgat 4381 tattcgccag cgcgtaaatt tgcccgaatt ccctattcga ttccattctc cagtcgaatt 4441 tcgagggatt taacccctcg aaattcgacc cttgatacat ctgcccctta gtgtgccaac 4501 ttgctcattg tgtgcatgtg tgtgacatgc cataaggctc tcttattaag cgcatgtatg 4561 tgatgaaaca taaccatccc cactgggagc tccttcatgg tttagcagaa tagcgctcac 4621 taccagcttt ttattcaaaa actgatattg tttccctcaa ccagagtata agctctatta 4681 gcttgcacca tcagtggggg aatttttttt cccctattag gtttccttta agctgcaaac 4741 ttgacctctc cttcccatct gcagtatatt gaccaatata agggaccaac cccacagtaa 4801 gatatctatt gtgtatgttt caaaatccca ttaggtaagg acagtacatt tatgtggtcc 4861 ctataggccc tcattatgat ctaattattg ggtcaatccg tcgtttttgg tacagtggtg 4921 ccagccttga actagagtgg taaaagaggg ctttgttggc tctttgagca tatcatagag 4981 ccttcagcaa aagttcactt tttaaatgta caccaatgaa tggagatttt tgaggccccc 5041 aaaattgtat tgctgtagat cctgcaacag ccaatgatcc ctttatctgc tctgaaatct 5101 tttttgtcgc tgctgctgct actggttaaa tacagtatag ttgaaaaaat ataggctttg 5161 agaataaaac ctgatgttca tttgcttttt aattattact ttacatcccc tttaaaaata 5221 tatacacatc actattccat gcattacact catttttaat tagacaaatc tataagaaat 5281 tctgcgagat gacacttttc atgataagca ttttgtaaaa ttgtaatatg ttcagttttt 5341 ttttttaaaa gttcaatgcc acactttatt tcaaaatgta ttaaggtgca gtaattatat 5401 taaataaatg tattgtaggg tacatgaata tatgtaacat ttaaaatgtg tgtttatgca 5461 cttctttcaa gtacagtaca tttgcactgt gatcaaatat taatttgaac tttaacagtc 5521 ctatctctac acctttatct tgtcctgggg atcagtctgt tttttagtga tatcttgtaa 5581 cacagaactt taaacaaaag ggctccgttt tgcacgtaga cctgtttgtg aatccatggc 5641 aattctgcca cctaaagcat acataacatt tagcatcttt ttttggtgtt ttttagacag 5701 atgatggtat agccatttgt gcaaataaaa tcagatattt tatcccaaat tatttgtgct 5761 gttagttgta tagggtttca acaaaatatc ttatttatca tttagagcaa atacttatgt 5821 gttacagtat ctgcaagtag tcaagtttga gcttaaaatt cccataattc ataattaagg 5881 ggatggctta gtataaaaaa acgtggaaaa aaaaacgtgt acagttatgc ttttatattg 5941 ccttgtaagt tcttttttat actattatta ttttaatgac cacgttttga attattgcat 6001 ggatttatga aaaccagttt aattgcaaag aggctcctaa aaattattta ttataagtta 6061 aaatttagta tatgcgtgca tgtatatgta acaatgcact ctcatatcta gtaaaaatca 6121 aagttgaagt aaagtgtata actaagtttg acctttctca ggcattaatg atcccagagg 6181 aaggccacac tatgtgacca aaacattgga ctacatttat taaatacatt taccttgatt 6241 tcttcaacac aatttgaaag ttcctccatg agctaatata aatttataaa gagagagagt 6301 gagtaaaaca tttttatcag aaaacagtgg cagagtaaat tctttcatac ttacaaaaga 6361 gtgctactat gcgcaacatt aacttgacat ttttgaattg tacctaatgc aattcatgat 6421 atttaaattg aatacattaa ttttaattat ttaattgtcc tgaaatctct acaggttcaa 6481 aaaaataatt ccatttatta catttatttt gtacacttaa ttatctactg ttaagtgtca 6541 caattgccct catttgatgt gggtttaagt ttcatgttgt tataaagaat caactttaca 6601 atttaagaac tatatggcat tccacatata caaaagatat attagcttaa ggttaaaaat 6661 ttattttgaa ggcaataggg tggggtggag gaaaaaaaat atgacacagc agaaatgcac 6721 aatgggtgtg actcagcatg gccatataaa gcaaggccaa caactcaaag gaacagc // LOCUS XELHBBBAI 2027 bp ds-DNA VRT 11-JUL-1990 DEFINITION X.laevis beta-A-I globin gene, upstream region. ACCESSION M34471 KEYWORDS beta-A-I. SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2027) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta- globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD simple staff_review BASE COUNT 792 a 266 c 284 g 685 t ORIGIN 1 bp upstream of BglII site. 1 agatcttgat acgttaactt tactagaaaa taatttaaac cccaatagcc tggttttgct 61 tccaatatgg tttaattata ccttagtttt caggataatg gatctttctg taatttggat 121 cttcatgcct taactgtacc agaaaatcat ttaaacttta aataaaccca atttgcttcc 181 agtacagttt aattatatct tagtttggat aagtacaagg tactgtttta ttattacagt 241 gaaaaaggta atcattttaa aaaaaaaata tatattattt ggataaaatg gagtctatgt 301 gtgatggcct ttccgtaatt ctcggtttct ggcaaacgga tctcatacct gtaataggta 361 tataaaaaac acacattaaa aaatactaca tatatattta tattcttttt tttttttaaa 421 gtgtgtaaat tcatgtcttt aaaataataa aatgtattta tatatatata tatatatata 481 tatatatata tatatatata tatatatata tatatatata tacttcaaca aaaaatttgc 541 caaattcata catacaaaaa aaataaaata ataattttaa ataattgaat ctgtctagct 601 gtttatattc tctgctctgc tggatctgac tcctgaaaaa atgtgcagaa gccatttgat 661 ttacagagct ggaggagaat ggctacatta gtttaaaagc cagaaccagg agaggatgca 721 ggcaacaaaa atggatacac acaaattaac gtctattaca attatattta caaataacct 781 taaagccaac ttttttaaaa attattatat attgtaaagt tgcttagaaa ccaatttttt 841 acttataggg agcaaaaaat agggagatcc tgtaaaacag aagctgcacc aaacatagat 901 caagctatcg agctttccat acgtatacat ttatttgaaa ggcactgtta aggagccacg 961 gtgctgtaca gtgcataaaa gtacaatata tatatataaa agtatacaca gggaagacaa 1021 atcacacaat gaatatacac agagctcata tcagaacaaa cagcttaagt gctttgtggt 1081 aagagacaca gtgggaagga ggtccctgtc ccgtagagct tacagtctta cagagctcag 1141 ttcaatccat atgactcaac catttaatta ataaaataat ttgcctttta atcattaatt 1201 aattccacac ttccatgtat aaatggaata tatatgtaag atttatatgt aatagctata 1261 tatgtaagat ttgatatttt ttttgtagga ataaaatgaa aatcaggcaa ataaaaaaca 1321 acatatatat gtttaaaaac ggtgttaatt tctatgcaac atgacatgaa aaagactttt 1381 caatattttt acatatgtat acataagata tatgattgaa ccatttgaat aataaaatag 1441 cttttgcctt accatcatta attattctac catactgtat gtaaaaagca ttgctatatg 1501 taagatttga ttatattttg ttgtaggaat aaaatgaatt ccaggcatat aaaaacacat 1561 ttataaaaaa catttataaa aaacactaca tatacatata catatatata tatatatata 1621 tatatatata tatatatatg tatatatata agtttaaaaa gtgtgttaat ttataatgtc 1681 tttctggaaa tagaatttca cacttcattg tatacaaaat tattaatatt tgtaatattt 1741 gattatatta tgttgtaggg ataaaatgaa taccaggcat ataaaaacac actttaaaaa 1801 aaaaaaaata catagataat aatttaataa tttgtattta ttttttctta atattctagc 1861 tctgctgtaa taaaaaaaac atgcatctaa aagtggtgcc aaatgggagg gtacaaatgg 1921 gctgggcaaa tgtaacgtgt gcttatccta gccaatcaac aggcagagtg gaaaggggca 1981 gtgcatcctt acagctacat aaagtctgat ggatggagaa ttagagc // LOCUS XELHBBBLII 910 bp ds-DNA VRT 11-JUL-1990 DEFINITION X.laevis beta-L-II globin gene, upstream region. ACCESSION M34472 KEYWORDS beta-L-II. SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 910) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta- globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD simple staff_review BASE COUNT 322 a 152 c 128 g 308 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccttg tctggaaacc agttatccag agggctccaa attatggaaa ggccatctcc 61 catagactca attttaatca aattattatt attttttttt tacaaattaa tgcctttttc 121 aatgtattaa taaaacagta ccttgatccc aaaattggag gcaaaacaat ccagtttgtt 181 ttatttaatg tttaaatatt ttttttaata ttgttttgat ccaaattaca gaaaggcccc 241 ttatccagaa aaacctccat ttaggataag gataacaggt ccaatacatt cataccctgt 301 acaaatctat gctatgttta attacttata aatagatcca catttcaatg gatatttcta 361 gaatatcgta ataacggtat atacttgttc aaagacaaac acatttaatg acctatgcct 421 aactggaata acagtcaagg aaatttaatg gaataatagg tatttcggag ctttccattt 481 attaacccta caaacaacta gttgttgttt caggaaacag cagtagttct atttggctta 541 catcttgaac aaaagcaaag ttgctatagt tttctttttc gtgtaaggaa agaaatgact 601 tgtgtcttta tctctacatt aaaaatgtat ctgccacaca gaatactttc tttttttaac 661 ttatctatag ataacgtatg tgcacccaaa ttgtagctgt gttacatcag cataattaag 721 tgcacacatg aagaaaaaaa atgacagatt gacaaaatgt tatattatat ggtaaggtct 781 cttggataat agcccttatc agtcataact ggttacaaat acagaaaaaa tgaggtgaca 841 cagcataaat gatatgaata cgtcactaac ttacacccct ataaatcaca aggttaaaat 901 attttttttt // LOCUS CLONEUR 4835 bp ds-DNA BCT 11-JUL-1990 DEFINITION C.botulinum neurotoxin gene, complete cds. ACCESSION M30196 KEYWORDS neurotoxin. SOURCE C.botulinum (strain 62A, subtype A) DNA. ORGANISM Clostridium botulinum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 4835) AUTHORS Binz,T., Kurazono,H., Wille,M., Frevert,J., Wernars,K. and Niemann,H. TITLE The complete sequence of the botulinum type A neurotoxin and its comparison with other Clostridial neurotoxins JOURNAL J. Biol. Chem. 265, 9153-9158 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Niemann, 29-NOV-1989. FEATURES from to/span description pept 358 4248 neurotoxin mRNA 240 > 4835 neurotoxin mRNA signal 344 349 ribosome binding site site 4400 4432 potential terminator BASE COUNT 1934 a 517 c 756 g 1628 t ORIGIN 1 aagcttctaa atttaaatta ttaagtataa atccaaataa acaatatgtt caaaaacttg 61 atgaggtaat aatttctgta ttagataata tggaaaaata tatagatata tctgaagata 121 atagattgca actaatagat aacaaaaata acgcaaagaa gatgataatt agtaatgata 181 tatttatttc caattgttta accctatctt ataacggtaa atatatatgt ttatctatga 241 aagatgaaaa ccataattgg atgatatgta ataatgatat gtcaaagtat ttgtatttat 301 ggtcatttaa ataattaata atttaattaa ttttaaatat tataagaggt gttaaatatg 361 ccatttgtta ataaacaatt taattataaa gatcctgtaa atggtgttga tattgcttat 421 ataaaaattc caaatgcagg acaaatgcaa ccagtaaaag cttttaaaat tcataataaa 481 atatgggtta ttccagaaag agatacattt acaaatcctg aagaaggaga tttaaatcca 541 ccaccagaag caaaacaagt tccagtttca tattatgatt caacatattt aagtacagat 601 aatgaaaaag ataattattt aaagggagtt acaaaattat ttgagagaat ttattcaact 661 gatcttggaa gaatgttgtt aacatcaata gtaaggggaa taccattttg gggtggaagt 721 acaatagata cagaattaaa agttattgat actaattgta ttaatgtgat acaaccagat 781 ggtagttata gatcagaaga acttaatcta gtaataatag gaccctcagc tgatattata 841 cagtttgaat gtaaaagctt tggacatgaa gttttgaatc ttacgcgaaa tggttatggc 901 tctactcaat acattagatt tagcccagat tttacatttg gttttgagga gtcacttgaa 961 gttgatacaa atcctctttt aggtgcaggc aaatttgcta cagatccagc agtaacatta 1021 gcacatgaac ttatacatgc tggacataga ttatatggaa tagcaattaa tccaaatagg 1081 gtttttaaag taaatactaa tgcctattat gaaatgagtg ggttagaagt aagctttgag 1141 gaacttagaa catttggggg acatgatgca aagtttatag atagtttaca ggaaaacgaa 1201 tttcgtctat attattataa taagtttaaa gatatagcaa gtacacttaa taaagctaaa 1261 tcaatagtag gtactactgc ttcattacag tatatgaaaa atgtttttaa agagaaatat 1321 ctcctatctg aagatacatc tggaaaattt tcggtagata aattaaaatt tgataagtta 1381 tacaaaatgt taacagagat ttacacagag gataattttg ttaagttttt taaagtactt 1441 aacagaaaaa catatttgaa ttttgataaa gccgtattta agataaatat agtacctaag 1501 gtaaattaca caatatatga tggatttaat ttaagaaata caaatttagc agcaaacttt 1561 aatggtcaaa atacagaaat taataatatg aattttacta aactaaaaaa ttttactgga 1621 ttgtttgaat tttataagtt gctatgtgta agagggataa taacttctaa aactaaatca 1681 ttagataaag gatacaataa ggcattaaat gatttatgta tcaaagttaa taattgggac 1741 ttgtttttta gtccttcaga agataatttt actaatgatc taaataaagg agaagaaatt 1801 acatctgata ctaatataga agcagcagaa gaaaatatta gtttagattt aatacaacaa 1861 tattatttaa cctttaattt tgataatgaa cctgaaaata tttcaataga aaatctttca 1921 agtgacatta taggccaatt agaacttatg cctaatatag aaagatttcc taatggaaaa 1981 aagtatgagt tagataaata tactatgttc cattatcttc gtgctcaaga atttgaacat 2041 ggtaaatcta ggattgcttt aacaaattct gttaacgaag cattattaaa tcctagtcgt 2101 gtttatacat ttttttcttc agactatgta aagaaagtta ataaagctac ggaggcagct 2161 atgtttttag gctgggtaga acaattagta tatgatttta ccgatgaaac tagcgaagta 2221 agtactacgg ataaaattgc ggatataact ataattattc catatatagg acctgcttta 2281 aatataggta atatgttata taaagatgat tttgtaggtg ctttaatatt ttcaggagct 2341 gttattctgt tagaatttat accagagatt gcaatacctg tattaggtac ttttgcactt 2401 gtatcatata ttgcgaataa ggttctaacc gttcaaacaa tagataatgc tttaagtaaa 2461 agaaatgaaa aatgggatga ggtctataaa tatatagtaa caaattggtt agcaaaggtt 2521 aatacacaga ttgatctaat aagaaaaaaa atgaaagaag ctttagaaaa tcaagcagaa 2581 gcaacaaagg ctataataaa ctatcagtat aatcaatata ctgaggaaga gaaaaataat 2641 attaatttta atattgatga tttaagttcg aaacttaatg agtctataaa taaagctatg 2701 attaatataa ataaattttt gaatcaatgc tctgtttcat atttaatgaa ttctatgatc 2761 ccttatggtg ttaaacggtt agaagatttt gatgctagtc ttaaagatgc attattaaag 2821 tatatatatg ataatagagg aactttaatt ggtcaagtag atagattaaa agataaagtt 2881 aataatacac ttagtacaga tatacctttt cagctttcca aatacgtaga taatcaaaga 2941 ttattatcta catttactga atatattaag aatattatta atacttctat attgaattta 3001 agatatgaaa gtaatcattt aatagactta tctaggtatg catcaaaaat aaatattggt 3061 agtaaagtaa attttgatcc aatagataaa aatcaaattc aattatttaa tttagaaagt 3121 agtaaaattg aggtaatttt aaaaaatgct attgtatata atagtatgta tgaaaatttt 3181 agtactagct tttggataag aattcctaag tattttaaca gtataagtct aaataatgaa 3241 tatacaataa taaattgtat ggaaaataat tcaggatgga aagtatcact taattatggt 3301 gaaataatct ggactttaca ggatactcag gaaataaaac aaagagtagt ttttaaatac 3361 agtcaaatga ttaatatatc agattatata aacagatgga tttttgtaac tatcactaat 3421 aatagattaa ataactctaa aatttatata aatggaagat taatagatca aaaaccaatt 3481 tcaaatttag gtaatattca tgctagtaat aatataatgt ttaaattaga tggttgtaga 3541 gatacacata gatatatttg gataaaatat tttaatcttt ttgataagga attaaatgaa 3601 aaagaaatca aagatttata tgataatcaa tcaaattcag gtattttaaa agacttttgg 3661 ggtgattatt tacaatatga taaaccatac tatatgttaa atttatatga tccaaataaa 3721 tatgtcgatg taaataatgt aggtattaga ggttatatgt atcttaaagg gcctagaggt 3781 agcgtaatga ctacaaacat ttatttaaat tcaagtttgt atagggggac aaaatttatt 3841 ataaaaaaat atgcttctgg aaataaagat aatattgtta gaaataatga tcgtgtatat 3901 attaatgtag tagttaaaaa taaagaatat aggttagcta ctaatgcatc acaggcaggc 3961 gtagaaaaaa tactaagtgc attagaaata cctgatgtag gaaatctaag tcaagtagta 4021 gtaatgaagt caaaaaatga tcaaggaata acaaataaat gcaaaatgaa tttacaagat 4081 aataatggga atgatatagg ctttatagga tttcatcagt ttaataatat agctaaacta 4141 gtagcaagta attggtataa tagacaaata gaaagatcta gtaggacttt gggttgctca 4201 tgggaattta ttcctgtaga tgatggatgg ggagaaaggc cactgtaatt aatctcaaac 4261 tacatgagtc tgtcaagaat tttctgtaaa catccataaa aattttaaaa ttaatatgtt 4321 taagaataac tagatatgag tattgtttga actgcccctg tcaagtagac aggtaaaaaa 4381 ataaaaatta agatactatg gtctgatttc gatattctat cggagtcaga ccttttaact 4441 tttcttgtat cctttttgta ttgtaaaact ctatgtattc atcaattgca agttccaatt 4501 agtcaaaatt atgaaacttt ctaagataat acatttctga ttttataatt tcccaaaatc 4561 cttccatagg accattatca atacatctac caactcgaga catactttga gttgcgccta 4621 tctcattaag tttattcttg aaagatttac ttgtatattg aaaaccgcta tcactgtgaa 4681 aaagtggact agcatcagga ttggaggtaa ctgctttatc aaaggtttca aagacaagga 4741 cgttgttatt tgattttcca agtacatagg aaataatgct attatcatgc aaatcaagta 4801 tttcactcaa gtacgccttt gtttcgtctg ttaac //