Path: utzoo!attcan!uunet!zephyr.ens.tek.com!uw-beaver!milton!dali.cs.montana.edu!uakari.primate.wisc.edu!zaphod.mps.ohio-state.edu!usc!rutgers!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 14 Jul 90 12:00:15 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 3273 Approved: lear@genbank.bio.net Checksum: 51321 194 LOCUS RATCYP7 3535 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat cholesterol 7-alpha-hydroxylase (CYP7) mRNA, complete cds. ACCESSION J05430 KEYWORDS cholesterol 7-alpha-hydroxylase. SOURCE Rat (Sprague Dawley) adult male liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3535) AUTHORS Jelinek,D.F., Andersson,S., Slaughter,C.A. and Russell,D.W. TITLE Cloning and regulation of cholesterol 7-alpha-hydroxylase, the rate -limiting enzyme in bile acid biosynthesis JOURNAL J. Biol. Chem. 265, 8190-8197 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Russell, 07-MAR-1990, for release after publication. FEATURES from to/span description pept 30 1541 cholesterol 7-alpha hydroxylase (CYP7) (EC 1.14.13.17) rpt 2195 2313 Alu repeat BASE COUNT 1035 a 705 c 696 g 1099 t ORIGIN 1 cgctttggaa attttcctgc ttttgcaaaa tgatgactat ttctttgatt tggggaattg 61 ccgtgttggt gagctgttgc atatggttta ttgttggaat aaggagaagg aaagctggtg 121 aacctccttt ggagaacggg ttgattccgt acctgggctg tgctctgaaa tttggatcta 181 atcctcttga gttcctaaga gctaatcaaa ggaagcatgg tcacgttttt acctgcaaac 241 tgatggggaa atatgtccat ttcatcacaa actccctgtc ataccacaaa gtcttatgtc 301 atggaaaata ttttgactgg aaaaaatttc attacactac ttctgcgaag gcatttggac 361 acagaagcat tgacccaaat gatggaaata ccacggaaaa tataaacaac acttttacca 421 aaaccctcca gggagatgct ctgtgttcac tttctgaagc catgatgcaa aacctccaat 481 ctgtcatgag acctcctggc cttcctaaat caaagagcaa tgcctgggtc acggaaggga 541 tgtatgcctt ctgttaccga gtgatgtttg aagctggcta tctaacactg tttggcagag 601 atatttcaaa gacagacaca caaaaagcac ttattctaaa caaccttgac aacttcaaac 661 aatttgacca agtctttccg gcactggtgg caggccttcc tattcacttg ttcaagaccg 721 cacataaagc tcgggaaaag ctggctgagg gattgaagca caagaacctg tgtgtgaggg 781 accaggtctc tgaactgatc cgtctacgta tgtttctcaa tgacacgctc tccacctttg 841 acgacatgga gaaggccaag acgcacctcg ctattctctg ggcatctcaa gcaaacacca 901 ttcctgcaac cttttggagc ttatttcaaa tgatcaggag tcctgaagca atgaaagcag 961 cctctgaaga agtgagtgga gctttacaga gtgctggcca agagctcagc tctggaggga 1021 gtgccattta cttggatcaa gtgcaactga atgacctgcc ggtactagac agcatcatca 1081 aggaggctct gaggctttcc agtgcatcct tgaatatccg cacagctaag gaggacttca 1141 ctctccatct tgaggacggt tcctataaca tccgaaaaga tgacatgata gctctttatc 1201 cacagttaat gcacttggat cctgaaatct acccagaccc tttgactttc aaatatgacc 1261 ggtaccttga tgaaagcggg aaagcaaaga ccaccttcta cagtaatgga aacaagctga 1321 agtgtttcta catgcccttc ggatcaggcg cgacaatatg tcctggaaga ctctttgccg 1381 tccaagaaat caagcagttt ttgatcctga tgctctcctg ctttgaactg gagtttgtgg 1441 agagccaagt caagtgtccc cctctagacc agtcccgggc aggcttggga attttgccac 1501 cactacatga tattgagttt aaatataaac tgaaacactg atacgtggtt ggaagaagcg 1561 aacactggat gatgtcactt ggcggctgag agtcatcact aaacaggcct tcgggaccaa 1621 tgctcactga tgcgccctag cgactggatt agtgggaaga actttgttct cgctgcccac 1681 attcctgggt gttcacatag ctggggccag agcttcatca ctttcagaaa gcaatgtctt 1741 ttgtatttat tttcaaaatg aagatattcc aattggcagg atatttttcc taaggaaatt 1801 gctttatatt tttatgaaaa ctaccaatta attatgaaag ggcttgaaat tcacgtttta 1861 gtgaaattac tgatttttca ctagtaaggt tcttcaggtg tgaaactgta ttataaaaat 1921 gttgtaatgg gtcacactgt gctttgcata aaggtaaagg aaactatgtt tcagcctttt 1981 ctgtgtctat gagcttcgaa aataatctta ctgttctaga aacactgggg aggtttcgac 2041 atgctctcgc tatattttat tttactgttg ctagaaattt tcattccagt tttcaactac 2101 cttatctttc ccccattttg acatgcatgc caatgagaag agtatttttt aggaattaac 2161 aaggcacctc ccagaaccct accctgagac ttttaagcct ttaatcccag cactcgagaa 2221 gtagagccag gcagatctct gagtctgagg ttattctggt ctacatcagc tccagacaag 2281 ccaggactac agaatgggat cttgtctaaa aaatacagct aatctttatg tcataactga 2341 ttatgaatca acctaaaaga taaattttca atcaggactc agagaaaatg agcaattaaa 2401 aaacttagct ctgaggtatg tggaattcat taagtacaag ttgacattac atgttcttta 2461 aaaatagttt atgttttatc tctaaatgcc ctgcagatga agaataataa tgaaaagttg 2521 aataatactg tttaaacact aagtgcaata atgctttggt aatgtacttt aagagaatca 2581 ttagccgtgc cagttttact aaaatatatt tatatgtaaa ttatatttat ctttttctta 2641 taccataaat ataaaaatat tgcaacattt agtaatttta aaattatata cctttcagaa 2701 aatgatgtat gatgtttgta tgttttttaa ctttgaacag aacatttaaa ttattcatct 2761 acggtgattt ttatcttatt tatttctttt tgtctcattc atatcttgaa gaaatccaaa 2821 aatatctgaa ggaatcgctc actcaaatgt ctccctatgg ttacagaaaa attcaatacc 2881 atgtttttgt cctcggggac tgaagcaggg tgtcgtgggt gcgagcagag gctcctgctg 2941 cagcgagctt tatccacggg actccttaaa cttttaaaat cttatcacta ttatcatgca 3001 tttattacct aagtaggata tttccctttc ctttttcatt tcagcagagt cccttagcaa 3061 cccaggctga ctgggaccct ccatgtagct taagctgtga actcactgta cttcctgttt 3121 tcacttattt taggaagtaa ttttccctat cagaaatttt aattgtttag atgatgtata 3181 agagtaacac aattctgtta tatactaatc tgtagtaaac taaatttgtt cttagaacaa 3241 gtttgatgac tctcaaattg aatgtatcca tacatctttc catggcttct tgaatgccca 3301 tttctcatac acagaatgat gggtttcacg gtgatgtctt cctttcatgt ctttattctt 3361 gtgcggtgat ggttggcaaa tgatacccat ggagcaaggt tactcttcct atttctgtgc 3421 agcctaagtg ttaagaataa tttttaaata cttggaggga aggcacattt tgtgtcatat 3481 gtgaagtgac atgtgacaca cagactagca aatccatgag taaaatttta ttggg // LOCUS PDECYT550 1047 bp ds-DNA BCT 14-JUL-1990 DEFINITION P.denitrificans cytochrome c550 gene, complete cds, and iso-cytochrome oxidase subunit I (iso-COI) gene, 5' end. ACCESSION M27304 KEYWORDS cytochrome c550; cytochrome oxidase. SOURCE P.denitrificans (strain NCIB 8944) DNA, clone pRS59. ORGANISM Paracoccus denitrificans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1047) AUTHORS Van Spanning,R.J.M., Wansell,C., Harms,N., Oltmann,L.F. and Stouthamer,A.H. TITLE Mutagenesis of the gene encoding cytochrome c-550 of Paracoccus denitrificans and analysis of the resultant physiological effects JOURNAL J. Bacteriol. 172, 986-996 (1990) STANDARD full staff_review REFERENCE 2 (sites; amino acid correction for [1]) AUTHORS Van Spanning,R.J.M., Wansell,C., Harms,N., Oltmann,L.F. and Stouthamer,A.H. TITLE Errata: Mutagenesis of the gene encoding cytochrome c-550 of Paracoccus denitrificans and analysis of the resultant physiological effects JOURNAL J. Bacteriol. 172, 3534-3534 (1990) STANDARD full staff_review COMMENT Printed sequence for [1] kindly submitted by R.J.M.Van Spanning, 25-AUG-1989. FEATURES from to/span description pept 235 702 cytochrome c550 precursor pept 874 > 1047 iso-cytochrome oxidase subunit I sigp 235 294 signal peptide cytochrome c550 matp 295 699 cytochrome c550 ORF < 1 90 open reading frame (AA at 1) rpt 734 743 inverted repeat copy A rpt 757 766 inverted repeat copy B BASE COUNT 215 a 340 c 309 g 183 t ORIGIN 1 bp upstream of SalI site. 1 gtcgacatgg gcctgccggt cggcatggat gacctggacc tggccctgcg ccgcagcttc 61 gcacggaact ttccgccgct cgccggctga ttttcggccc tgcgacatat tctccctcag 121 ccgttctacg acgtttttcc tgtattctgc ccgcttggca catgatagcc ctgtcaatcg 181 gggaagcgga gaggtccgtt gcccggacat gagcgatacc caagaggaaa cgcgatgaag 241 atcagcatct atgccactct cgccgccatc accctcgccc tgcccgctgc ggcccaggat 301 ggcgacgccg ccaaaggcga gaaagaattc aacaagtgca aggcttgcca catgatccag 361 gcgccggacg gcaccgacat catcaagggc ggcaagaccg ggcccaacct ttacggcgtc 421 gtcggccgca agatcgcctc ggaggagggc ttcaaatacg gcgaaggcat cctcgaggtc 481 gccgaaaaga accccgacct gacctggacc gaggccgacc tgatcgaata cgtcaccgac 541 cccaagccct ggctggtcaa gatgaccgac gacaagggcg ccaagaccaa gatgaccttc 601 aagatgggca agaaccaggc cgacgtggtg gccttcctgg cccagaactc gcccgatgcg 661 ggcggcgacg gcgaggctgc ggccgagggc gaatcgaact gatcccgacg cctccgagat 721 gtctcgacgg ccccgcccct gcgtgcgaca aaagcgcgca ggggcgcgac ttttagtcct 781 ttcttcgcat tgcgaggggt aggattgtgg tctagaaaca ggcgagtccg tcggcctttg 841 cgccggccag ccgcatgtct agggagtcca cgcatggcag acgcagccgt tcacggccac 901 ggtgaccatc atgacacccg cgggttcttc acccgctggt tcatgtcaac aaaccacaag 961 gatatcggta tcctttacct gttcacggcc ggcatcgtcg gcctgatctc ggtatgcttc 1021 accgtctata tgcggatgga actgcag // LOCUS ECOSPEC 2330 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli ornithine decarboxylase (speC) and cyclic AMP receptor protein genes, complete cds. ACCESSION M33766 KEYWORDS cyclic AMP receptor protein; ornithine decarboxylase; speC gene. SOURCE E.coli (strain K12) DNA, clone pLC20-5. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2330) AUTHORS Barroso,L., Moore,R., Wright,J., Patel,T. and Boyle,S.M. TITLE Analysis and sequence of the speC (ornithine decarboxylase) gene of Escherichia coli JOURNAL Unpublished (1990) STANDARD unannotated staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.M.Boyle, 22-MAY-1990. Author address: S.M.Boyle Virginia Polytechnic Institute and State University College of Veterinary Medicine, VPI & SU Blacksburg, VA 24061 FEATURES from to/span description pept 86 2281 ornithine decarboxylase (speC) (EC 4.1.1.17) pept 719 630 (c) cyclic AMP receptor protein (CRP) mRNA 15 2289 speC mRNA (alt., primer extension) mRNA 16 2289 speC mRNA (alt., S1 mapping) mRNA 27 2289 speC mRNA (alt., primer extension) mRNA 30 2289 speC mRNA (alt., S1 mapping) mRNA 41 2289 speC mRNA (alt., S1 mapping) mRNA 42 2289 speC mRNA (alt., primer extension) signal 14 19 -10 region signal 733 728 (c) -10 region signal 755 750 (c) -35 region signal 2289 2320 rho-independent transcription termination signal binding 364 367 pyridoxal-PO4 binding site binding 66 72 GTP phosphoryl binding site binding 263 266 GTP phosphoryl binding site binding 379 382 GTP guanine binding site BASE COUNT 508 a 546 c 664 g 612 t ORIGIN 1 ctgcaggaca acagtaaaat cagagcgttt ctgcttttac tgatgtctgg cggtcggagc 61 tggtgaccag tttgacccat atctcatggg gcagggtttt ccaccttgtc cggtattctt 121 acttccccga aacgggtttg cgcttatgaa atcaatgaat attgccgcca gtagtgaact 181 ggtatcccga ctttcttctc atcgtcgcgt ggtggcgttg ggagatactg attttacgga 241 cgtcgcggca gtcgtcatta ccgctgcgga tagtcgcagt ggcattcttg cgttgcttaa 301 gcgcaccggt tttcatctac cggtgttttt gtattccgaa catgctgttg aattacctgc 361 gggcgttacg gcggtaatca acggcaacga gcagcagtgg ctggagctgg aatccgcagc 421 ctgtcagtat gaagagaatt tgctgccacc gttttatgac acgctgacgc agtacgttga 481 gatgggcaac agcacctttg cttgccctgg acatcaacat ggtgcgtttt ttaaaaagca 541 tcctgccgga cgccattttt acgatttctt tggtgagaac gtctttcgcg ccgatatgtg 601 taacgctgac gtaaaattgg gcgatctgct tattcatgaa ggatcggcga aagatgcgca 661 gaaattcgca gccaaagtct ttcatgccga taaaacctat tttgtgctga acggcacatc 721 ggcagcgaat aaagtggtga cgaatgcgct gttaacgcgt ggcgatctgg tgctcttcga 781 ccgtaacaac cataagtcga atcatcacgg cgcgctgatt caggcggggg cgacgccggt 841 ctatctggaa gcttcacgca acccgtttgg tttcattggc ggtattgatg cgcactgttt 901 taatgaagag tatctgcgcc agcaaattcg cgacgttgcg ccagaaaaag ccgacctgcc 961 gcgcccgtat cgcctggcga ttattcagct gggaacctat gacggcactg tctataacgc 1021 ccgtcaggtg atcgataccg ttgggcatct gtgtgattac attctgtttg attccgcgtg 1081 ggtcggttat gaacaattta tcccgatgat ggcggatagc tcgccgctgc tgttagaact 1141 taacgaaaac gatccgggga tctttgtgac tcagtcggtg cacaaacagc aggcgggatt 1201 ctcacagacg tcgcagatcc ataaaaaaga taaccatatc cgcggacagg cgcgtttttg 1261 cccgcataag cggttgaata acgcctttat gctccatgct tctaccagcc ctttctatcc 1321 gctgtttgct gcactggatg ttaacgccaa aattcatgaa ggggagagtg ggcgtcggct 1381 gtgggctgag tgtgttgaga tagggattga agcgcgcaag gctattcttg cgcgctgtaa 1441 gctgttccgc ccgtttatcc cgcccgttgt tgatggcaaa ttgtggcagg attatccgac 1501 atcagtgtta gccagcgacc gccgtttttt cagttttgag ccgggggcga agtggcacgg 1561 ctttgaagga tatgccgcgg atcagtattt tgttgatccg tgcaagctgt tactcactac 1621 accaggtatc gatgccgaaa ccggcgaata tagcgacttt ggcgttccgg cgacgattct 1681 ggcgcactat ctgcgtgaga acggcattgt gccggagaag tgcgatctca actccattct 1741 gtttttatta actccggcgg aaagccacga gaagctggca caactggtgg cgatgctggc 1801 gcaatttgaa cagcatattg aggatgactc gccgctggtt gaggtgttgc cgagcgttta 1861 taacaagtat ccggtgcgct atcgcgacta caccctgcgc cagttgtgtc aggagatgca 1921 cgatctgtat gtcagtttcg acgtcaaaga cctacaaaaa gcgatgttcc gccagcagag 1981 tttcccgtca gtggtgatga acccccagga tgcgcatagc gcttatattc gcggtgacgt 2041 ggagttggtg cggattcgtg atgccgaagg gcgaattgcg gcagaagggg cgttgcctta 2101 tccacctggc gtgctttgcg tggtacccgg ggaagtctgg ggtggggcgg ttcaacgtta 2161 tttccttgca ctggaagaag gggtgaattt gttgccggga ttttcgccgg agctgcaagg 2221 tgtttatagc gaaaccgatg cggatggcgt gaaacggttg tacggttatg tgttgaagta 2281 agaataaaaa aaacgggtca ccttctggcg acccgttttt ctttgcgtaa // LOCUS HUMLPLAS 557 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human L-plastin mRNA, 5' end. ACCESSION M34426 KEYWORDS L-plastin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 557) AUTHORS Lin,C.-S., Aebersold,R.H. and Leavitt,J. TITLE Correction of the N-terminal sequences of the human plastin isoforms by using anchored polymerase chain reaction: Identification of a potential calcium-binding domain JOURNAL Mol. Cell. Biol. 10, 1818-1821 (1990) STANDARD simple staff_review FEATURES from to/span description pept 138 > 557 L-plastin BASE COUNT 174 a 111 c 133 g 139 t ORIGIN 1 acttcctgcc ttgtgaccac acacccaggc ttgacaaagc tgttctgcag atcagaaaga 61 aggggttcct ggtcatacac cagtactacc aaggacagct tttttcctgc aagatctgtt 121 acctaaagca ataaaaaatg gccagaggat cagtgtccga tgaggaaatg atggagctca 181 gagaagcttt tgccaaagtt gatactgatg gcaatggata catcagcttc aatgagttga 241 atgacttgtt caaggctgct tgcttgcctt tgcctgggta tagagtacga gaaattacag 301 aaaacctgat ggctacaggt gatctggacc aagatggaag gatcagcttt gatgagttta 361 tcaagatttt ccatggccta aaaagcacag atgttgccaa gacctttaga aaagcaatca 421 ataagaagga agggatttgt gcaatcggtg gtacttcaga gcagtctagc gttggcaccc 481 aacactccta ttcagaggaa gaaaagtatg cctttgtcaa ctggataaac aaagccctgg 541 aaaatgatcc tgattgt // LOCUS HUMTPLAS 516 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human T-plastin mRNA, 5' end. ACCESSION M34427 KEYWORDS T-plastin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 516) AUTHORS Lin,C.-S., Aebersold,R.H. and Leavitt,J. TITLE Correction of the N-terminal sequences of the human plastin isoforms by using anchored polymerase chain reaction: Identification of a potential calcium-binding domain JOURNAL Mol. Cell. Biol. 10, 1818-1821 (1990) STANDARD simple staff_review FEATURES from to/span description pept 88 > 516 T-plastin BASE COUNT 169 a 85 c 125 g 137 t ORIGIN 1 aaagattccg aggtgcagaa gttgtctgag tgggttggtc ggcggcagtc gggccagacc 61 caggactctg cgactttaca tctttaaatg gatgagatgg ctaccactca gatttccaaa 121 gatgagcttg atgaactcaa agaggccttt gcaaaagttg atctcaacag caacggattc 181 atttgtgact atgaacttca tgagctcttc aaggaagcta atatgccatt accaggatat 241 aaagtgagag aaattattca gaaactcatg ctggatggtg acaggaataa agatgggaaa 301 ataagttttg acgaatttgt ttatattttt caagaggtaa aaagtagtga tattgccaag 361 accttccgca aagcaatcaa caggaaagaa ggtatttgtg ctctgggtgg aacttcagag 421 ttgtccagcg aaggaacaca gcattcttac tcagaggaag aaaaatatgc ttttgttaac 481 tggataaaca aagctttgga aaatgatcct gattgt // LOCUS MUSICSBP 2544 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Mouse interferon consensus sequence binding protein mRNA, complete cds. ACCESSION M32489 KEYWORDS DNA binding protein; interferon consensus sequence binding protein. SOURCE Mouse (strain Balb/c; haplotype H-2d) neonatal spleen and lung, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2544) AUTHORS Driggers,P.H., Ennist,D.L., Gleason,S.L., Mak W,-H., Marks,M.S., Levi,B.-Z., Flanagan,J.R., Appella,E. and Ozato,K. TITLE An interferon gamma-regulated protein that binds the interferon-inducible enhancer element of major histocompatibility complex class I genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3743-3747 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Ozato, 23-MAR-1990, for release after publication. FEATURES from to/span description pept 21 1295 interferon consensus sequence binding protein binding 21 362 DNA binding site BASE COUNT 584 a 680 c 739 g 541 t ORIGIN 1 gcgtgggaac cggcggcagg atgtgtgacc ggaacggcgg gcggcggctg cggcagtggc 61 tgatcgaaca gatcgacagc agcatgtacc cggggctgat ctgggaaaat gatgagaaga 121 ccatgttccg tatcccctgg aagcatgccg gcaagcagga ttacaatcag gaggtggatg 181 cttccatctt caaggcctgg gcagttttta aagggaagtt taaagaggga gacaaagctg 241 aaccagccac gtggaagacg aggttacgct gtgctctgaa caagagccca gattttgaag 301 aagtgactga ccggtcccag ctggacattt ctgagccata taaagtttac cgaattgtcc 361 ccgaggaaga acaaaaatgc aagctgggcg tggcacctgc aggctgcatg agcgaagttc 421 ctgagatgga gtgtggccgc tcagagattg aggagctgat caaggaacct tctgtggatg 481 agtacatggg tatgaccaag aggagcccat ccccaccaga ggcctgcagg agccagatcc 541 tccctgactg gtgggtccag cagcccagtg caggcctgcc actggtgacc ggatatgccg 601 cctatgacac acaccattca gctttctccc agatggtcat cagcttctac tacgggggca 661 agctggtggg ccaggccacc accacctgcc ttgaaggctg ccgtctctcc ctgagccagc 721 cggggctgcc taagttgtat gggccggatg gcctggaacc cgtgtgcttt ccgacggccg 781 acaccatccc cagtgagcgg cagaggcagg tgacccggaa gctgtttggg cacctggaac 841 gtggcgtgct actgcacagc aaccgcaagg gcgtgttcgt gaagcggctg tgccagggcc 901 gcgtgttctg cagcggcaac gcggtggtgt gcaagggcag gcccaacaag ctggagcggg 961 acgaggtggt gcaggtcttt gacaccaacc agttcatccg agagctgcag caattctacg 1021 ccacccagag ccgcctacct gacagcaggg tggtcctgtg cttcggggag gagtttccgg 1081 acactgtgcc cttgcgctcc aaactcattc tggtgcaggt agagcagctg tatgccaggc 1141 agctggtgga ggaagcgggc aagagctgcg gtgctggctc cctgatgcca gccctggagg 1201 agccccagcc ggaccaggct ttccgcatgt ttccggatat ctgtacctca caccagagac 1261 ccttttttag agaaaatcaa cagatcaccg tctaagcctc agtccgggca ccccacctcg 1321 cctgagctca agcttcaaga gtctgtgact aagagaattc cgaaaggatg tggagccctc 1381 tgactggggt gggcgggtgt cctccaaggg gcctccggaa gcccacagag ggatgcgctc 1441 ctgctcaggc aggtgtcaga agcttgcagg ggctgtggcc gcaacctgtg attaaagcat 1501 tccttttcct tacactattt cacccctcac cactaatggc tggcctttct gtgtgctgag 1561 gtctttcgac agttcaaatc atctggtggc agcagactcg cctttgccct tctgcggccg 1621 agggcggaga tttatgactt tctctgcttg gttggagaag aagaatcttt actattcagc 1681 ttcttttctt ttggccagaa ctctgaaaaa aaaaaaaaac tcttttttaa gaccaatatt 1741 tgtattctca caggctcagc tgtcaatcac ttgagacctt ccctgtaaag tggggcagat 1801 tttaaatatg ggtgtagata ctcgttgcag ccttcgcagg aattttggtt gtggttcatt 1861 gattcacaca gactctgtgt cagctgacag ggctgtgtgg ggcatcaaag gaggaccagg 1921 cactgtggag aagacccatt cactggcatc tcacccttcc ttgtccagct ccatacccag 1981 tcctaagacc cagtgaaaag ccacgtccaa actgtgctct gggctcatca gtgcccaccc 2041 acgtaccagg gaaaggcaca cacccatccc agtgggcaca gagcggaatg tccccctacc 2101 gcaccatttg cgcccccaat ctggctgtcc aacctagttt gtaagtaatc taaatcagtg 2161 actatagccc cgcctaaggg acacttcccg gaggagggag ccgctgaaaa ggagttagtt 2221 tgagggtcag tacacaacag gggcagaaag ccaagcagat gtgggggcag ggaggtcatc 2281 atctgctttt gtctgagaga aggagagctt ctccgtttgt tcaactttgt aacaagctgg 2341 gttacatgct ccacgcagct agagaagcct aggtgctctg cattccctgg ggaactgcag 2401 gaaagcctta cctgctgact gttgctctgg ggaaaagcct gagggtccag agcagctaca 2461 agctacaggc cataccttac aacctgaaaa gctaaggacc acggtgacct tcccggctac 2521 tgtgtgaagg tgctgggtgg ggcc // LOCUS DDIUMPCMP 694 bp ss-mRNA INV 14-JUL-1990 DEFINITION D.discoideum UMP-CMP kinase mRNA, complete cds. ACCESSION M34568 KEYWORDS UMP-CMP kinase. SOURCE D.discoideum, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 694) AUTHORS Wiesmueller,L., Noegel,A.A., Barzu,O., Gerisch,G. and Schleicher,M. TITLE cDNA-derived sequence of UMP-CMP kinase from Dictyostelium discoideum and expression of the enzyme in Escherichia coli JOURNAL J. Biol. Chem. 265, 6339-6345 (1990) STANDARD simple staff_review FEATURES from to/span description pept 54 638 UMP-CMP kinase mRNA < 1 694 UMP-CMP kinase mRNA BASE COUNT 287 a 74 c 110 g 223 t ORIGIN 1 tttaaatata taaattttgt aaaacaactt caaatttttt aaattcaaga atgatggaaa 61 aaagtaaacc aaatgttgta tttgttttag gtggaccagg ttcaggtaaa ggtactcaat 121 gcgccaacat tgtcagagat tttggatggg tacatttatc agcaggtgat ttattacgtc 181 aagaacaaca atctggttca aaagatggtg aaatgattgc aacaatgatc aaaaatggtg 241 aaatcgttcc atcaattgta actgtaaaat tattaaagaa tgctattgat gctaatcaag 301 gtaaaaactt tttagttgat ggtttcccac gtaatgaaga aaataataat tcttgggaag 361 aaaatatgaa agattttgtt gatacaaaat ttgttttatt ctttgattgt ccagaagaag 421 ttatgactca aagattatta aagagaggtg aatcaagtgg tagatcagat gataatattg 481 aatcaattaa aaagagattc aatacattca acgttcaaac taaattagtt atcgatcatt 541 ataataaatt tgacaaagtt aaaatcattc cagcaaatag agatgttaat gaagtttata 601 atgatgttga aaaccttttt aaatcaatgg gtttttaaaa aaaaaaaaaa aaaaaccaat 661 aaattttaaa attactaaat aataaatata cccc // LOCUS HUMBGALXX 2409 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human beta-galactosidase (GLB1) mRNA, complete cds. ACCESSION M34423 KEYWORDS beta-galactosidase. SOURCE Human testis, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2409) AUTHORS Yamamoto,Y., Hake,C.A., Martin,B.M., Kretz,K.A., Ahern-Rindell,A.J., Naylor,S.L., Mudd,M. and O'Brien,J.S. TITLE Isolation, characterization, and mapping of a human acid beta- galactosidase cDNA JOURNAL DNA 9, 119-127 (1990) STANDARD simple staff_review FEATURES from to/span description pept 61 2094 beta-galactosidase precursor (EC 3.2.1.23) /nomgen="GLB1" /map="3pter-p21" /hgml_locus_uid="LA0059G " sigp 61 129 beta-galactosidase signal peptide (put.) matp 130 2091 beta-galactosidase BASE COUNT 566 a 635 c 613 g 595 t ORIGIN 1 gaattcgggc gcgaagcggc cggcctgggc gccgactgca gagccgggag gctggtggtc 61 atgccggggt tcctggttcg catcctcctt ctgctgctgg ttctgctgct tctgggccct 121 acgcgcggct tgcgcaatgc cacccagagg atgtttgaaa ttgactatag ccgggactcc 181 ttcctcaagg atggccagcc atttcgctac atctcaggaa gcattcacta ctcccgtgtg 241 ccccgcttct actggaagga ccggctgctg aagatgaaga tggctgggct gaacgccatc 301 cagacgtatg tgccctggaa ctttcatgag ccctggccag gacagtacca gttttctgag 361 gaccatgatg tggaatattt tcttcggctg gctcatgagc tgggactgct ggttatcctg 421 aggcccgggc cctacatctg tgcagagtgg gaaatgggag gattacctgc ttggctgcta 481 gagaaagagt ctattcttct ccgctcctcc gacccagatt acctggcagc tgtggacaag 541 tggttgggag tccttctgcc caagatgaag cctctcctct atcagaatgg agggccagtt 601 ataacagtgc aggttgaaaa tgaatatggc agctactttg cctgtgattt tgactacctg 661 cgcttcctgc agaagcgctt tcgccaccat ctgggggatg atgtggttct gtttaccact 721 gatggagcac ataaaacatt cctgaaatgt ggggccctgc agggcctcta caccacggtg 781 gactttggaa caggcagcaa catcacagat gctttcctaa gccagaggaa gtgtgagccc 841 aaaggaccct tgatcaattc tgaattctat actggctggc tagatcactg gggccaacct 901 cactccacaa tcaagaccga agcagtggct tcctccctct atgatatact tgcccgtggg 961 gcgagtgtga acttgtacat gtttataggt gggaccaatt ttgcctattg gaatggggcc 1021 aactcaccct atgcagcaca gcccaccagc tacgactatg atgccccact gagtgaggct 1081 ggggacctca ctgagaagta ttttgctctg cgaaacatca tccagaagtt tgaaaaagta 1141 ccagaaggtc ctatccctcc atctacacca aagtttgcat atggaaaggt cactttggaa 1201 aagttaaaga cagtgggagc agctctggac attctgtgtc cctctgggcc catcaaaagc 1261 ctttatccct tgacatttat ccaggtgaaa cagcattatg ggtttgtgct gtaccggaca 1321 acacttcctc aagattgcag caacccagca cctctctctt cacccctcaa tggagtccac 1381 gatcgagcat atgttgctgt ggatgggatc ccccagggag tccttgagcg aaacaatgtg 1441 atcactctga acataacagg gaaagctgga gccactctgg accttctggt agagaacatg 1501 ggacgtgtga actatggtgc atatatcaac gattttaagg gtttggtttc taacctgact 1561 ctcagttcca atatcctcac ggactggacg atctttccac tggacactga ggatgcagtg 1621 cgcagccacc tggggggctg gggacaccgt gacagtggcc accatgatga agcctgggcc 1681 cacaactcat ccaactacac gctcccggcc ttttatatgg ggaacttctc cattcccagt 1741 gggatcccag acttgcccca ggacaccttt atccagtttc ctggatggac caagggccag 1801 gtctggatta atggctttaa ccttggccgc tattggccag cccggggccc tcagttgacc 1861 ttgtttgtgc cccagcacat cctgatgacc tcggccccaa acaccatcac cgtgctggaa 1921 ctggagtggg caccctgcag cagtgatgat ccagaactat gtgctgtgac gttcgtggac 1981 aggccagtta ttggctcatc tgtgacctac gatcatccct ccaaacctgt tgaaaaaaga 2041 ctcatgcccc cacccccgca aaaaaacaaa gattcatggc tggaccatgt atgatgatga 2101 aagcctgtgt ctttgaggga ttctaccctg aacatacctc acagatcctc cctgtcatgc 2161 cacatttcac tgattggaat gtggaaatgg aaaaggaatt taggatgtgc attttcacct 2221 gaggtttccc tgcatccctg cagtgccaaa gccccacctt cagggaccac ctggaatgtg 2281 tgaggggctg acagcacagt aacgtgcata catatctgca gggctggaat ggaagcttta 2341 aaggtggtag tgatttttat tttggaagaa tcatgttacc tttttgttaa ataaaatttg 2401 cccgaattc // LOCUS HUMGAA 3441 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human acid alpha-glucosidase (GAA) mRNA, complete cds. ACCESSION M34424 KEYWORDS acid alpha-glucosidase; acid maltase. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3441) AUTHORS Martiniuk,F., Mehler,M., Tzall,S., Meredith,G. and Hirschhorn,R. TITLE Sequence of the cDNA and 5'-flanking region for human acid alpha- glucosidase, detection of an intron in the 5' untranslated leader sequence, definition of 18-bp polymorphisms, and differences with previous cDNA and amino acid sequences JOURNAL DNA 9, 85-94 (1990) STANDARD simple staff_review FEATURES from to/span description pept 33 2891 acid alpha-glucosidase /nomgen="GAA" /map="17q23" /hgml_locus_uid="LZ0096H" BASE COUNT 612 a 1170 c 1027 g 632 t ORIGIN 1 gcctgtagga gctgtccagg ccatctccaa ccatgggagt gaggcacccg ccctgctccc 61 accggctcct ggccgtctgc gccctcgtgt ccttggcaac cgctgcactc ctggggcaca 121 tcctactcca tgatttcctg ctggttcccc gagagctgag tggctcctcc ccagtcctgg 181 aggagactca cccagctcac cagcagggag ccagcagacc agggccccgg gatgcccagg 241 cacaccccgg ccgtcccaga gcagtgccca cacagtgcga cgtccccccc aacagccgct 301 tcgattgcgc ccctgacaag gccatcaccc aggaacagtg cgaggcccgc ggctgctgct 361 acatccctgc aaagcagggg ctgcagggag cccagatggg gcagccctgg tgcttcttcc 421 cacccagcta ccccagctac aagctggaga acctgagctc ctctgaaatg ggctacacgg 481 ccaccctgac ccgtaccacc cccaccttct tccccaagga catcctgacc ctgcggctgg 541 acgtgatgat ggagactgag aaccgcctcc acttcacgat caaagatcca gctaacaggc 601 gctacgaggt gcccttggag accccgcatg tccacagccg ggcaccgtcc ccactctaca 661 gcgtggagtt ctccgaggag cccttcgggg tgatcgtgcg ccggcagctg gacggccgcg 721 tgctgctgaa cacgacggtg gcgcccctgt tctttgcgga ccagttcctt cagctgtcca 781 cctcgctgcc ctcgcagtat atcacaggcc tcgccgagca cctcagtccc ctgatgctca 841 gcaccagctg gaccaggatc accctgtgga accgggacct tgcgcccacg cccggtgcga 901 acctctacgg gtctcaccct ttctacctgg cgctggagga cggcgggtcg gcacacgggg 961 tgttcctgct aaacagcaat gccatggatg tggtcctgca gccgagccct gcccttagct 1021 ggaggtcgac aggtgggatc ctggatgtct acatcttcct gggcccagag cccaagagcg 1081 tggtgcagca gtacctggac gttgtgggat acccgttcat gccgccatac tggggcctgg 1141 gcttccacct gtgccgctgg ggctactcct ccaccgctat cacccgccag gtggtggaga 1201 acatgaccag ggcccacttc cccctggacg tccagtggaa cgacctggac tacatggact 1261 cccggaggga cttcacgttc aacaaggatg gcttccggga cttcccggcc atggtgcagg 1321 agctgcacca gggcggccgg cgctacatga tgatcgtgga tcctgccatc agcagctcgg 1381 gccctgccgg gagctacagg ccctacgacg agggtctgcg gaggggggtt ttcatcacca 1441 acgagaccgg ccagccgctg attgggaagg tatggcccgg gtccactgcc ttccccgact 1501 tcaccaaccc cacagccctg gcctggtggg aggacatggt ggctgagttc catgaccagg 1561 tgcccttcga cggcatgtgg attgacatga acgagccttc caacttcatc aggggctctg 1621 aggacggctg ccccaacaat gagctggaga acccacccta cgtgcctggg gtggttgggg 1681 ggaccctcca ggcggccacc atctgtgcct ccagccacca gtttctctcc acacactaca 1741 acctgcacaa cctctacggc ctgaccgaag ccatcgcctc ccacagggcg ctggtgaagg 1801 ctcgggggac acgcccattt gtgatctccc gctcgacctt tgctggccac ggccgatacg 1861 ccggccactg gacgggggac gtgtggagct cctgggagca gctcgcctcc tccgtgccag 1921 aaatcctgca gtttaacctg ctgggggtgc ctctggtcgg ggccgacgtc tgcggcttcc 1981 tgggcaacac ctcagaggag ctgtgtgtgc gctggaccca gctgggggcc ttctacccct 2041 tcatgcggaa ccacaacagc ctgctcagtc tgccccagga gccgtacagc ttcagcgagc 2101 cggcccagca ggccatgagg aaggccctca ccctgcgcta cgcactcctc ccccacctct 2161 acacactgtt ccaccaggcc cacgtcgcgg gggagaccgt ggcccggccc ctcttcctgg 2221 agttccccaa ggactctagc acctggactg tggaccacca gctcctgtgg ggggaggccc 2281 tgctcatcac cccagtgctc caggccggga aggccgaagt gactggctac ttccccttgg 2341 gcacatggta cgacctgcag acggtgccaa tagaggccct tggcagcctc ccacccccac 2401 ctgcagctcc ccgtgagcca gccatccaca gcgaggggca gtgggtgacg ctgccggccc 2461 ccctggacac catcaacgtc cacctccggg ctgggtacat catccccctg cagggccctg 2521 gcctcacaac cacagagtcc cgccagcagc ccatggccct ggctgtggcc ctgaccaagg 2581 gtggggaggc ccgaggggag ctgttctggg acgatggaga gagcctggaa gtgctggagc 2641 gaggggccta cacacaggtc atcttcctgg ccaggaataa cacgatcgtg aatgagctgg 2701 tacgtgtgac cagtgaggga gctggcctgc agctgcagaa ggtgactgtc ctgggcgtgg 2761 ccacggcgcc ccagcaggtc ctctccaacg gtgtccctgt ctccaacttc acctacagcc 2821 ccgacaccaa ggtcctggac atctgtgtct cgctgttgat gggagagcag tttctcgtca 2881 gctggtgtta gccgggcgga gtgtgttagt ctctccagag ggaggctggt tccccaggga 2941 agcagagcct gtgtgcgggc agcagctgtg tgcgggcctg ggggttgcat gtgtcacctg 3001 gagctgggca ctaaccattc caagccgccg catcgcttgt ttccacctcc tgggccgggg 3061 ctctggcccc caacgtgtct aggagagctt tctccctaga tcgcactgtg ggccggggcc 3121 tggagggctg ctctgtgtta ataagattgt aaggtttgcc ctcctcacct gttgccggca 3181 tgcgggtagt attagccacc cccctccatc tgttcccagc accggagaag ggggtgctca 3241 ggtggaggtg tggggtatgc acctgagctc ctgcttcgcg cctgctgctc tgccccaacg 3301 cgaccgctgc ccggctgccc agagggctgg atgcctgccg gtccccgagc aagcctggga 3361 actcaggaaa attcacagga cttgggagat tctaaatctt aagtgcaatt atttttaata 3421 aaaggggcat ttggaatcaa a // LOCUS HUMGAAX 298 bp ds-DNA PRI 14-JUL-1990 DEFINITION Human acid alpha-glucosidase (GAA) gene, 5' flank. ACCESSION M34425 KEYWORDS acid alpha-glucosidase; acid maltase. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 298) AUTHORS Martiniuk,F., Mehler,M., Tzall,S., Meredith,G. and Hirschhorn,R. TITLE Sequence of the cDNA and 5'-flanking region for human acid alpha- glucosidase, detection of an intron in the 5' untranslated leader sequence, definition of 18-bp polymorphisms, and differences with previous cDNA and amino acid sequences JOURNAL DNA 9, 85-94 (1990) STANDARD simple staff_review FEATURES from to/span description IVS 274 > 298 acid alpha-glucosidase, intron A /hgml_locus_uid="LZ0096H" /nomgen="GAA" /map="17q23" BASE COUNT 38 a 88 c 128 g 44 t ORIGIN 1 cgtgcggagg tgagccgggc cggggctgcg gggcttccct gagcgcgggc cgggtcggtg 61 gggcggtcgg ctgcccgcgc ggcctctcag tgggaaagct gaggttgtcg ccggggccgc 121 gggtggaggt cggggatgag gcagcaggta ggacagtgac ctcggtgacg cgaaggaccc 181 cggccacctc taggttctcc tcgtccgccc gttgttcagc gagggaggct ctgggcctgc 241 cgcagctgac ggggaaactg aggcacggag cgggtgagac acctgacgtc tgccccgc // LOCUS HUMMAK 171 bp ds-DNA PRI 14-JUL-1990 DEFINITION Human male germ cell-associated kinase (mak) gene, exon N. ACCESSION M35863 M34569 KEYWORDS kinase. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 171) AUTHORS Matsushime,H., Jinno,A., Takagi,N. and Shibuya,M. TITLE A novel mammalian protein kinase gene (mak) is highly expressed in testicular germ cells at and after meiosis JOURNAL Mol. Cell. Biol. 10, 2261-2268 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 25 / 157 male germ cell-associated kinase (mak), exon N (AA at 27) IVS < 1 24 mak intron X IVS 158 > 171 mak intron X+1 BASE COUNT 48 a 30 c 38 g 55 t ORIGIN 1 tttttttctc cgtatatcat caaggctttt ttcataggga catgaaacca gaaaacttgc 61 tttgtatggg tccagagctt gtgaaaattg ctgattttgg acttgcaaga gaattaaggt 121 cacagccacc atacactgac tatgtatcta ccagatggtg agtaggggtt a // LOCUS RATMAK 2372 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat male germ cell-associated kinase (mak) mRNA, complete cds. ACCESSION M35862 M34569 KEYWORDS kinase. SOURCE Rat, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2372) AUTHORS Matsushime,H., Jinno,A., Takagi,N. and Shibuya,M. TITLE A novel mammalian protein kinase gene (mak) is highly expressed in testicular germ cells at and after meiosis JOURNAL Mol. Cell. Biol. 10, 2261-2268 (1990) STANDARD simple staff_review FEATURES from to/span description pept 237 2105 male germ cell-associated kinase (mak) mRNA < 1 2372 mak mRNA BASE COUNT 683 a 575 c 574 g 540 t ORIGIN 1 tctgaatatg tggagactat tatagcaaaa tggactcagt gtctctttgc cagattcctt 61 attgctgtat cccatgagcc tacctgacct tgaggaagag ccgtgtttaa tgatgacaag 121 cctgtctccc gtgtagtgac aaaaaggagg aggaccacaa ggaaagacgg atttagattt 181 ctactcagca aggaagcgag ctctgtcagc aacctctctg gattaccctt cccaagatga 241 accgatatac aaccatgagg cagctggggg acggcacgta cgggagtgtg cttatgggca 301 agagcaatga gtctggggag ctggtggcaa tcaagaggat gaagagaaag ttctattctt 361 gggatgagtg tatgaacttg cgagaagtta agtccctgaa gaaactcaat catgccaatg 421 tgattaaact aaaagaagtt atcagagaaa atgaccatct ttattttata tttgaatata 481 tgaaagaaaa cctctatcag ctaatgaaag acagaaacaa actgttccct gagtcagtca 541 tcagaaatat tatgtatcaa atactacagg ggctggcatt tatccacaaa catggctttt 601 ttcacaggga catgaaacct gagaatttgc tttgcatggg tccagagctg gtgaagattg 661 ctgattttgg acttgcgaga gaattaagat cacagccacc atatactgac tatgtgtcta 721 ccagatggta ccgtgctcct gaagttttgc taaggtcttc agtgtacagc tctcccattg 781 acgtgtgggc cgtgggaagt ataatggccg agctatatac gtttagaccg cttttcccag 841 ggaccagtga agttgatgag atctttaaaa tttgccaagt gttagggact cccaagaaaa 901 gtgactggcc ggaggggtac cagctggcat cctccatgaa cttccgcttt ccccagtgca 961 ttcctataaa cctgaaaact ctcattccca atgccagtag tgaggctatt cagcttatga 1021 cagaaatgct taactgggat ccaaagaaac ggccaactgc aagccaggca ctgaagcacc 1081 catattttca agtcggtcag gtattgggcc cttctgcaca ccatctggat gcaaaacaga 1141 ctttgcacaa gcagctgcag cctccagagc caaagccatc ttcctctgaa cgggatccta 1201 agcctttgcc aaacatcctt gatcagcctg ccgggcagcc ccagccaaaa cagggccacc 1261 aaccactgca ggccattcag ccaccacaga acacagtggt tcagccacct ccaaagcagc 1321 agggtcacca taagcaacca caaacgatgt ttccaagtat cgtcaaaacc ataccaacga 1381 atccagtcag cacagtaggc cataaggggg cccggagacg gtggggtcag acagtcttca 1441 agtctggaga cagctgtgac aacatcgagg actgcgactt gggagcctcc cactccaaga 1501 agccgagcat ggatgccttc aaggaaaaaa agaagaagga gtctccattt cggtttccag 1561 aagcaggact cccagtctcc aaccacttga agggggaaaa tagaaattta catgcatcct 1621 taaaatctga cacaaacttg tcaactgctt caaccgctaa gcagtactat ttgaaacaat 1681 caagatacct tccgggtgtg aaccccaaga acgtgtcttt ggtagctggc ggcaaggata 1741 taaattcaca ctcttggaat aatcagctat ttcctaagtc tctgggatcc atgggggcgg 1801 acctcgcttt caagaggagt aacgcagcag ggaaccttgg gagctacagc gcttacagcc 1861 agacgggatg cgtgccttcc tttctcaaga aagaagtggg atcagctggc cagaggatcc 1921 acttggcgcc tctgggtgcg tcggctgccg actatacctg gagcacaaaa actggccgag 1981 gccagttttc aggacgaact tacaatccca cagccaaaaa tctcaatatt gtgaaccgca 2041 cacagccagt cccctcggtg cacgggagga cagactgggt ggctaagtac ggaggccacc 2101 ggtaggagaa gtggaagcct gaagcattgc tccgtagagg acaatcacgc cccttgatcc 2161 tgggagacgt ctacagagtc tatttctacc gagttccaca ggacgcacac aaccgtgggc 2221 gcctcagaga ccggaagtca gctcccgatt tcttcctttt ccggaaatgc aatgcatttt 2281 cttaacttgt ggccagcagt gctgatgcag caccgtgctg aaacttttga gcgggacttt 2341 taaagagtat tgaataaatt atttgccaaa gt // LOCUS BNRCFIA 1930 bp ds-DNA BCT 14-JUL-1990 DEFINITION B.fragilis imipenem-cefoxitin-hydrolyzing enzyme (cfiA) gene, complete cds. ACCESSION M34831 KEYWORDS imipenem-cefoxitin-hydrolyzing enzyme. SOURCE B.fragilis (strain TAL2480) DNA. ORGANISM Bacteroides fragilis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Anaerobic gram-negative straight, curved and helical rods; Bacteroidaceae. REFERENCE 1 (bases 1 to 1930) AUTHORS Thompson,J.S. and Malamy,M.H. TITLE Sequencing the gene for an imipenem-cefoxitin-hydrolyzing enzyme (cfiA) from Bacteroides fragilis TAL2480 reveals strong similarity between cfiA and Bacillus cereus beta-lactamase II JOURNAL J. Bacteriol. 172, 2584-2593 (1990) STANDARD simple staff_review FEATURES from to/span description pept 533 1282 imipenem-cefoxitin-hydrolyzing enzyme precursor sigp 533 586 imipenem-cefoxitin-hydrolyzing enzyme signal peptide matp 587 1279 imipenem-cefoxitin-hydrolyzing enzyme BASE COUNT 563 a 435 c 434 g 498 t ORIGIN 1 gatcaataag cagttcatct tgtctgcagt attgaaatca taggagttgt ccagaagcag 61 atttataggt gatgttctta caagtcagtt cttcgatagc acgcaatatg gtgtctgcgc 121 tgcaagtgcg aagagttgga tgaagagaca aatgtttcat caggtgagtt gtaacatcct 181 caatacatga gccgccacaa agatatacgc acatcagaga gcgtagaatt tcgctatatt 241 gataaccaaa catagtgcat ctcaatccca aggtggaatc tatggtttga gctaaaagag 301 catcaaattg ctccataata gaaaaaattc ctccaaaagg agtgagtttc tcagatttta 361 tttgtacttt tgccatgtca tattagagtt ttgcttgtct tcttttcgca acactaaggt 421 aagtgaaaat tctgacatgg caaaatcctg ggcaactttt tgttgctcag gcacttataa 481 ataatgttaa actatagtgt tgcggaatta aggcatcata taaaagaata aaatgaaaac 541 agtatttatc cttatctcca tgcttttccc tgtcgcagtt atggcacaga aaagcgtaaa 601 aatatccgat gacatcagta tcacccaact ctcggacaaa gtgtacactt atgtatccct 661 cgccgaaatc gaaggatggg gtatggtacc ttccaacggg atgattgtta tcaacaacca 721 ccaggcagcg ttgctggaca caccgatcaa tgacgcacaa acggaaatgc tggtcaactg 781 ggtgacagac tctttgcatg ccaaagtcac cacgtttatc ccgaaccact ggcacggcga 841 ttgtattggc ggactgggtt acctgcaaag gaaaggtgtc caatcatacg cgaaccagat 901 gacgatagac ctcgccaagg aaaaagggtt gcccgtaccg gaacatggat tcaccgattc 961 actgaccgtc agcttggacg gcatgcctct ccaatgttat tatttaggag gcggacatgc 1021 gaccgacaat atcgtggttt ggctgccgac agagaatatc ctttttggcg gatgtatgct 1081 taaagacaac caggcgacaa gcatcggcaa catctcggac gcggacgtga cggcatggcc 1141 gaaaactctc gataaggtaa aagccaagtt cccctcggcc cgttacgtcg tgcccggaca 1201 tggcgactat ggcggaaccg aactgataga gcataccaag cagatcgtga accaatatat 1261 agaaagcact tcaaagccat agcccgaatt gtaggcatct acccggtatt ccatgaacga 1321 agaggcgaaa gcttttatcc cgtccacaga atgtaacccg gccacaaacg agacataaaa 1381 atatcaattg aacaacacaa cagatccttg tttcatctac ggcgtcgcag ttaggtcttt 1441 ttttatcaat tgttaagcgt ttcttcccga ttattagcga tctttgccat caaaaaataa 1501 acgatatgaa ttggattatc ctaacgattg ccggatgctt tgaggtgctg ttcactttct 1561 gcatggggaa agtgaaagaa acaaccggaa cgaaatgtac ctgtggttca ccggattcct 1621 gatcgcaact tttataagta tgttactact ggtgaaagcc acccagaatc tgcccatcgg 1681 cacagcttat gccgtgtgga caggaatagg cgctgtgggt acggtgctcg taggtatcct 1741 tgtgtttagc gaacccgtca gttttggaga ctcttcttta tagtgatgct gatcagttca 1801 ataatcgggc tgaagatggt ttcgcattga gggggaaatt catccctcag cacctccggc 1861 aattccgcca acgaacggat catcagatac tcctccgacg atgaatctcc tcgtgttcac 1921 atcgctcagc // LOCUS CLTSTRRA 142 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34684 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-A. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 142) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 36 77 STRR1 sequence rpt 78 119 STRR1 sequence BASE COUNT 28 a 64 c 19 g 31 t ORIGIN 1 tatttattcc catcacccaa tccccaatcc ccagtcccca ttacccctta tcccagaggg 61 gccccgagtt ccccaatccc cattacccct tatcccagag ggtccccgag ttccccagtc 121 cccagtcccc aatcctgact gg // LOCUS CLTSTRRB 73 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34685 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-B. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 73) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 19 a 33 c 5 g 16 t ORIGIN 1 ggagttctcc aatccccaat ccccagtccc caatccccat tccccattcc ccattcccct 61 caaactaaag aat // LOCUS CLTSTRRC 64 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34686 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-C. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 64) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 16 a 26 c 2 g 20 t ORIGIN 1 tttttccaaa tccccaatcc ctagtcccca atccctagtc cccaatccct aatcatttca 61 attc // LOCUS CLTSTRRD 49 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34687 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-C. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 49) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 9 a 23 c 5 g 12 t ORIGIN 1 ttcctctgct tccccagtcc ccagtcccca gtccccaccc ataaaagtt // LOCUS CLTSTRRE 74 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34688 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-E. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 74) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 6 a 33 c 3 g 32 t ORIGIN 1 aggttttgta ttttcccttt cccctttcac cttttcccct tcccctttca cctttcccct 61 ttccccaccc ctat // LOCUS CLTSTRRF 75 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34689 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-A. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 75) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 14 a 8 c 15 g 38 t ORIGIN 1 ctggtttgtt ttgttattgg tcattggtca tttgtcattg gtcatttgtc attagttatt 61 ggtaattcat atcaa // LOCUS CLTSTRRG 64 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34690 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-B. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 64) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 10 a 10 c 11 g 33 t ORIGIN 1 tttttgttgt ttgtcatttg tcattggtca tttgtcatta gtcctttgtc attcaaaccg 61 agtt // LOCUS CLTSTRRH 77 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34691 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-C. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 77) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 18 a 6 c 17 g 36 t ORIGIN 1 ccaatttgta ttagtcattt gttattagtc atttgtcatt tgtcatttgg ggtttgtaat 61 tggtaatagg aggaatt // LOCUS CLTSTRRI 85 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34692 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-D. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 85) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 25 a 0 c 25 g 35 t ORIGIN 1 atggggtaat tggtaattgg taattggtaa ttggtaattg gtaattggta attggtaatt 61 ggtaattggt aattggtaat agatt // LOCUS CLTSTRRJ 80 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34693 KEYWORDS s. SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-E. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 80) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 13 a 13 c 17 g 37 t ORIGIN 1 tcccatgtca tttgtcattt gtcattggtt attggttatt ggtcatttgt catttggggt 61 ttggtaattg acctcaactc // LOCUS CLTSTRRK 76 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 3 (STRR3). ACCESSION M34694 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 3. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 25 a 15 c 15 g 21 t ORIGIN 1 ttttaagtct gtgaacagtc aacagtcaac agtcaacagt taacagttga aaggacacag 61 cagtgctatg tccttt // LOCUS CPACM559B 1076 bp ds-DNA ORG 14-JUL-1990 DEFINITION C.paradoxa cyanelle cytochrome b-559 (psbE, psbF, psbI, and psbJ) genes, complete cds. ACCESSION M35129 KEYWORDS cytochrome b-559. SOURCE C.paradoxa cyanelle DNA. ORGANISM Cyanelle Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa. REFERENCE 1 (bases 1 to 1076) AUTHORS Cantrell,A. and Bryant,D.A. TITLE Nucleotide sequence of the genes encoding cytochrome b-559 from the cyanelle genome of Cyanophora paradoxa JOURNAL Photosyn. Res. 16, 65-81 (1988) STANDARD simple staff_review FEATURES from to/span description pept 246 470 cytochrome b-559 apoprotein (psbE) pept 510 638 cytochrome b-559 apoprotein (psbF) pept 648 764 cytochrome b-559 apoprotein (psbI) pept 787 909 cytochrome b-559 apoprotein (psbJ) signal 926 951 pot. transcription termination signal BASE COUNT 367 a 142 c 160 g 407 t ORIGIN 1 tctttaagta gtcagatatc tggaaatact tggaaaaaag aaacaagtac agacaatatt 61 ccttcaaatt tctataaaat tagttttata gataaagaag taggtttata ctaggaaatc 121 agggaacact tttacgatat gtatcattat aatagaataa ttaatatctt ataatttgaa 181 taaaatcttt tattatacat aaaagcatca tatatatata tatataattt ttaggagtga 241 tttatatgtc tggaggaact actggcgaac gcccattttc tgacattgtt actagtattc 301 gttattgggt tattcatact gtaactattc cattctttat tgttgcaggt tggctttttg 361 taagtactgg tttagcttat gatgtatttg gtactccaag accagatgaa tacttcactg 421 aagaacgtca agaagtacca attattaatc aacgtttttc aactaattaa tcgttattaa 481 tttatagttt ctaaaagagg aaaaaggaaa tgaataatcc taatcaaccg gtttcttatc 541 caatttttac agttagatgg ttagcaattc atgctattgg aattccagct gtatttttta 601 ttggatctat tactgcaatg caatttattc aacgatagga gatatatatg gttagccaaa 661 atcctaatag acaaaaagtt gaattaaatc gtacttccct attctgggga ttacttttaa 721 ttttcgtatt agcaatttta ttctctagct acatttttaa ctaaatttct tcaataagga 781 gtctttatgg caaatactgg tggacgcatt cctttatggc ttgttgctac agttgcaggt 841 ttagcagcta ttggtgtact aggaatcttt ttctatggtg gttactctgg tttaggttcc 901 tctatttaat aaaatcaatt taagattttt tgtaataaat taattaaaaa aaaatagaga 961 gttaaatttt tttaactctc tattctttag acatttataa ctaataaaag aaattaagca 1021 atttcatctg tttcaatata aacaaataat aatgccataa cattgctggt aagaca // LOCUS KVSM121M 612 bp ds-RNA VRL 14-JUL-1990 DEFINITION Killer virus of S.cerevisiae (class M1, type 1), M1-2 region mRNA fragment. ACCESSION M35150 KEYWORDS . SOURCE Killer virus of S.cerevisiae (strain A364A x S7) class M1 type 1, cDNA to viral RNA, clones pEH-[2,3]. ORGANISM Killer virus of S.cerevisiae Viridae; ds-RNA nonenveloped viruses. REFERENCE 1 (bases 1 to 612) AUTHORS Georgopoulos,D.E., Hannig,E.M. and Leibowitz,M.J. TITLE Sequence of the M1-2 region of killer virus double-stranded RNA JOURNAL Basic Life Sci. 40, 203-213 (1986) STANDARD simple staff_review BASE COUNT 205 a 140 c 141 g 125 t 1 others ORIGIN 1 aaagaaagaa cnaaaagaaa cagaaaaaga gagaacagga caacaaacgc aacaaaacac 61 aaacacaagc acactcacct tgagtctaac tggtggcacg cagcatatct caccctgaga 121 ctaactggcg gcaggcgacc gtgagcatac agcatgcccc actcgattcg agacgcgatt 181 cgcgctcgta ggtatcgagc ggctacgttg agctattatg gcagtgacat gcgattcgcg 241 cactgccaag atcagctcag caaagttaag accagtatcg gatatggtag actactacaa 301 ttcgcacagg tatgagattc tcagtctagt gtatggatga gtagttgagc caatgaatct 361 agggtttaaa ttactatgca ttgacatata acaggtacaa gcgtagataa tacttactag 421 gccccagccg gtacaccctg tattgaataa atacgactat ttggccaggt ctggacgggg 481 cagtcgaatt actaggttga gcacacacac gtgaatcaca caacataaca gtgtaggaac 541 ataatgtgcc attcgtagtc tgagacgccg ctagcctggt ttaatgcaac agcatagaag 601 aaacacacat ca // LOCUS PIGHMG2 1153 bp ss-mRNA MAM 14-JUL-1990 DEFINITION Pig non-histone chromosomal protein (HMG2) mRNA, complete cds. ACCESSION J02895 KEYWORDS non-histone chromosomal protein. SOURCE Pig thymus, cDNA to mRNA, clone pCD-MK02. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1153) AUTHORS Shirakawa,H., Tsuda,K.-i. and Yoshida,M. TITLE Primary structure of non-histone chromosomal protein HMG2 revealed by the nucleotide sequence JOURNAL Biochemistry 29, 4419-4423 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly provided by M.Yoshida, 07-DEC-1989. FEATURES from to/span description pept 144 776 non-histone protein HMG2 precursor matp 147 773 non-histone protein HMG2 mRNA < 1 1153 HMG2 mRNA BASE COUNT 361 a 224 c 301 g 267 t ORIGIN 53 bp upstream of HaeIII site. 1 acaaaccagt tcacgccgga gccgcaggta ggcagcgtcg cggtcggacc cggccgctgc 61 gcgggagcct gaggagaagc taccaccagg caagagaccc ctccgggccc cggtggacgc 121 gtcgtcgccg gcgccgcagc accatgggga agggcgaccc caacaagccg cggggcaaga 181 tgtcctcgta cgccttcttc gtgcagacct gccgggagga gcacaagaag aaacaccccg 241 attcctcggt caacttcgcc gagttctcca agaagtgctc cgagcgatgg aagactatgt 301 ctgccaagga aaagtccaag tttgaagata tggcaaaaag tgacaaagct cgctatgacc 361 gggagatgaa aaattacgtc cctcccaagg gtgacaagaa gggcaagaaa aaagatccca 421 atgctcccaa aaggcctcca tctgccttct tcctgttttg ctctgaacat cgcccaaaga 481 tcaaaagtga acaccctggc ttatccattg gggatactgc aaagaaattg ggtgaaatgt 541 ggtctgagca gtcagccaaa gataaacaac cgtatgaaca gaaagcagct aagctaaagg 601 aaaaatatga aaaggatatt gctgcatacc gtgccaaggg taagggtgaa gcaggaaaga 661 agggccctgg caggccaaca ggctctaaga agaagaatga accagaagat gaggaggaag 721 aggaagagga ggaagaagat gaagacgagg aggaagaaga tgaagatgag gaataaatgg 781 ctatcctgta atgttgtgtg tggagtgtgt gtgtgtgctc aggcaattat tttgctaaga 841 atgtgaattc aagtgcagct caatattagc ttcagtataa aaactgtaca gatttttgta 901 tagctaataa gattctttgt agagaaaata ctttttttaa aagtgcaggt tgcagctttt 961 tgaggggcta ctacatacag ttagatttta aagcttctga tgttgaatgt ttctaaatat 1021 ttaatggttt ctttaatttc ttgtgtatgg taacagcaaa ctcgtagaaa taagtatcaa 1081 tagcaaattt tgggttttct agaatgttgc attttgtttt ttaaaaaaat tttgtaataa 1141 aattatgtat att // LOCUS PSEHUTH 2249 bp ds-DNA BCT 14-JUL-1990 DEFINITION P.putida histidine ammonia-lyase (hutH) gene, complete cds. ACCESSION M35140 KEYWORDS histidase; histidine ammonia-lyase. SOURCE P.putida (ATCC 12633) DNA, clone pMC4. ORGANISM Pseudomonas putida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2249) AUTHORS Consevage,M.W. and Phillips,A.T. TITLE Sequence analysis of the hutH gene encoding histidine ammonia-lyase in Pseudomonas putida JOURNAL J. Bacteriol. 172, 2224-2229 (1990) STANDARD simple staff_review FEATURES from to/span description pept 316 1848 histidine ammonia-lyase (hutH) precursor (gtg start codon (E.C. 4.3.1.3) matp 319 1845 histidine ammonia-lyase (hutH) BASE COUNT 392 a 777 c 699 g 381 t ORIGIN 1 ccatggtggt ggcgtgggca tgggcttctc tcagcactcg ggcatggtca tcgtctgcga 61 cggcaccgat gaagccgccg agcgcatcgc ccgtgtactg accaacgacc cagggactgg 121 cgtcatgcgc cacgccgatg ccggttatga catcgccatc gactgcgcca aggagcaggg 181 cctggacctg ccgatgatca ctggctgatt gccacgcttt ggatcgcacc ccgccagggg 241 tgatactgaa caacaagaag gagcgcgcag gcacccacga acggcctgcg gctccacgcg 301 atttggagta gtaacgtgac cgaactcacc ctcaagcccg gcaccctgac cctggcccag 361 ctgcgcgcga tccatgccgc gcccgtgcgc ctgcaactgg atgccagcgc cgcgccggcc 421 atcgatgcca gcgtcgcctg tgtcgagcag atcattgccg aagaccgcac tgcctacggc 481 atcaacaccg gtttcggcct gctggcctcg acccgcatcg ccagccacga cctggaaaac 541 ctgcagcgct cgctggtgct gtcccacgcc gctggcattg gcgcgccgct ggatgacgat 601 ctggtgcggt tgatcatggt gctgaaaatc aacagcctca gccgtggctt ctccggcatt 661 cgccgcaaag tcatcgatgc gctgatcgcc ctggtcaacg ccgaagtcta cccgcacatc 721 ccgctgaaag gctcggtggg tgcttccggc gacctggcgc cgctggcgac catgtcgctg 781 gtgctgctgg gcgaaggcaa agcccgctac aagggccagt ggctgtcggc caccgaagcc 841 ctggcggttg ccggcctcga accgctgacc ctggctgcca aagagggcct ggccctgctc 901 aacggcaccc aggcgtccac cgcctatgcc ctgcgtggcc tgttctatgc cgaagacttg 961 tacgctgcgg ctatcgcctg cggcggcctg agcgtcgaag ccgtactggg ctcgcgttcg 1021 ccgttcgatg cgcgtattca cgaagcgcgt ggccagcgcg gccagatcga cactgccgcg 1081 tgcttccgcg acctgctggg cgattccagc gaagtgtcgc tgtcgcacaa gaactgcgac 1141 aaggtccagg acccgtactc gctgcgctgc cagccgcagg tcatgggcgc gtgcctgacc 1201 cagctgcgcc aggctgccga ggtgctgggc atcgaagcca acgccgtgtc ggacaacccg 1261 ctggtgttcg ctgccgaggg tgacgtgatt tccggtggca acttccacgc agaaccggtg 1321 gccatggccg ccgacaacct ggccctggcc atcgccgaaa tcggttcgct cagcgagcgc 1381 cgcatctcgc tgatgatgga caagcacatg tcccagctgc cgccgttcct ggtggaaaac 1441 ggtggggtca actccggctt catgatcgcc caggtcaccg ctgccgctct ggccagcgag 1501 aacaaggccc tgtcgcaccc gcacagcgtc gacagcctgc cgacctcggc caaccaggaa 1561 gaccacgtat cgatggcccc ggctgccggc aagcgcctgt gggaaatggc cgaaaacacc 1621 cgtggcgtgc ctgccatcga atggctgggc gcatgccagg gcctggacct gcgcaaaggc 1681 ctgaagactt cggccaagct ggagaaagca cgccaggcgc tgcgcagcga agtggcgcac 1741 tacgaccgtg accgtttctt cgcgccggac atcgaaaagg ctgtggaact gttggccaag 1801 ggtagtttga ccggtttgct gccggcaggt gtgctgccaa gcctgtaatg ccccctgggg 1861 cgctgcgcct tcgcagcaca aggctgctcc tgcaaaagcg cacgcgccct tgtacggaca 1921 gcctgtgctg cgatgggctg catgccagcc ccctgacctc acgaccgcag gacgattcaa 1981 ggacgccgac acgcaacaag ctcaaggtct caagtgcggg ctaaatgccc gccacatccg 2041 tttcatggcc ctcggttctg ccatcggaac cgggctgttc tacggctccg cctcagccat 2101 ccagatggcc ggcccggccg tgctgctggc ttacctgatc ggcggtgccg ctgtgttcat 2161 ggtcatgcgc gccctcggcg aaatggccgt gcacaacccg gtggccggct ccttcggcca 2221 ctacgccagt acctacctcg gccccatgg // LOCUS RATGABAAD 1799 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat GABA-A receptor delta subunit mRNA, complete cds. ACCESSION M35162 KEYWORDS GABA-A receptor delta subunit; gamma-aminobutyric acid receptor. SOURCE Rat adult brain, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1799) AUTHORS Zhao,Z.-Y. and Joho,R.H. TITLE Isolation of distantly related members in a multigene family using the polymerase chain reaction technique JOURNAL Biochem. Biophys. Res. Commun. 167, 174-182 (1990) STANDARD simple staff_review REFERENCE 2 (sites; corrections to figure 3 for [1]) AUTHORS Zhao,Z.-Y. and Joho,R.H. TITLE Isolation of distantly related members in a multigene family using the polymerase chain reaction technique JOURNAL Biochem. Biophys. Res. Commun. 168, 887-887 (1990) STANDARD simple staff_review FEATURES from to/span description pept 50 1399 GABA-A receptor delta subunit BASE COUNT 385 a 512 c 500 g 402 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccctg gggctagccg gacctgtccc gcgcacagcc cgcaaggcca tggacgttct 61 gggctggctg ctgctgccgc tccttctgct gtgcacgcag ccgcaccatg gcgccagagc 121 aatgaatgac attggggact acgtgggctc caacctggag atatcctggc tccccaacct 181 ggatggacta atggagggct acgcccgaaa cttccgacca ggcattggag gtcctccagt 241 gaatgtggcg cttgccctag aggtggccag cattgaccac atctcagaag caaatatgga 301 atacaccatg acagtgttcc tgcaccagag ctggcgagac agcaggctgt cctacaacca 361 taccaacgag accctgggcc tggatagccg cttcgtggac aagctgtggc tccctgacac 421 cttcattgtg aatgccaagt ctgcctggtt ccatgatgtg accgtggaaa acaagcttat 481 ccgcctacag cccgacggtg tgattttata cagcatccgc atcacctcca cagtggcctg 541 tgacatggac cttgccaagt accccatgga cgagcaggag tgcatgctgg acctggagag 601 ctatggctac tcttctgagg acattgtcta ttattggtca gaaaaccagg agcagatcca 661 cgggctggac aggctgcaac tggcccagtt cactatcacc agttaccgct tcaccacgga 721 gctgatgaac ttcaaatcag ctggccagtt ccctcgactc agcttacact tccagcttcg 781 gaggaaccgg ggtgtctaca tcatccagtc ttacatgccc tctgtcctcc tggttgccat 841 gtcctgggtc tccttctgga ttagccaagc agcagtgcct gccagagtat ctctaggcat 901 caccactgtg ctgacaatga ccacactcat ggttagtgcc cgctcctccc tcccgcgggc 961 ttctgctatc aaggctctgg atgtgtattt ctggatctgc tatgtcttcg tgtttgctgc 1021 cctggtggag tatgcatttg cccacttcaa tgctgactac aggaagaaac ggaaagccaa 1081 ggtcaaggtc acgaagccaa gggcagagat ggacgtgagg aacgccattg tcctcttctc 1141 cctctctgct gctggggtca gccaggagtt ggctatctcc cgccgtcaag gccgggtccc 1201 tgggaacctc atgggttcct ataggtctgt agaagtggag gcaaagaagg agggggggtc 1261 ccgcccagga ggcccaggag gcatccgttc cagactcaaa cccatcgatg cagacaccat 1321 cgacatctat gcccgcgctg tgttcccggc agcctttgca gcagtcaaca tcatctactg 1381 ggcggcgtat accatgtgac ggcagtgcgt acaccacatg acggctttac atgtaccacg 1441 tgcggtggtt gccacaaact cctggaggaa cagccctctc cgtggatccc aaagtgggac 1501 agagaggagc gtccgctccc tgggctttac ctcaattttg ctctaccagc ctgcacttcc 1561 tgtagatgga gcaggatggg ctttgcacag agggccatta gttggcctgt ccctgaagct 1621 gagcctgcac taaggtgccc ttgggagctg tgaagtttta gactcaggat gggcctgatt 1681 tctgggcctt tgctctgtgg gatcaggatc agagagaaag agagtagggg tggatgacca 1741 ctctcatggt gaactggagg agaagggcat tatgaggcct ctttggtccc agggaattc // LOCUS RPVCTSI 2195 bp ds-DNA VRL 14-JUL-1990 DEFINITION Raccoonpox virus Sal1 end fragment DNA. ACCESSION M23018 KEYWORDS . SOURCE Raccoonpox virus (strain V71-I-85A) DNA, passed in human tk- 143 cells. ORGANISM Raccoonpox virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 2195) AUTHORS Parsons,B.L. and Pickup,D.J. TITLE Tandemly repeated sequences are present at the ends of the DNA of Raccoonpox virus JOURNAL Virology 161, 45-53 (1987) STANDARD simple staff_review BASE COUNT 880 a 225 c 373 g 717 t ORIGIN 1 ataattaatt taattttaaa tattttattt aatgtctaga aaaaaatgtg taacccagac 61 tgtagagaac tctctaatgt ggtgaaaaaa tcaatccctt aatcgagact atcagaaaga 121 gacgaaacat tttttagtga gacattagaa agagaaagag acgaaacatt ttttagtgag 181 acattagaaa gagaaagaga cgaaacattt ttagtgagac attagaaaga gaaatatttt 241 tttgtaaaac ttttttatga ctccattaga aagagatgaa acatttttag tgagacatta 301 gaaagagaaa tatttttttg taaaactttt ttatgactcc attagaaaga gacgaaacat 361 ttttagtgag acattagaaa gagaaatatt tttttgtaaa acttttttat gactccatta 421 gaaagagacg aaacattttt agtgagacat tagaaagaga aatatttttt tgtaaaacgt 481 tttcttaggt caaaagtcaa tgttttaaaa tatggactag aatatgtctg tgtaaaaact 541 tgtgaaattt tgtaccaaag aaaaacgtga gcagtatccc ctacatggat tttactagat 601 catttatgta acaaaaaatt atattattcg tgtacgttaa ttataaaaat attgtatgat 661 atatcgaggt ccattagcaa aaggtgttga tagacaatta atcgtgtgtg atgtaaccat 721 gattattaca gcagaaagag taagagtatt ttttgtaaaa cttttttatg actccattag 781 aaagagaaag agacgaaaca tttttagtga gacattagaa agagacgaaa catttttttt 841 gtaaaacttt tttatgactc cattagaaag agaaagagac gaaacatttt tagtgagaca 901 ttagaaagag aaatattttt tgtaaaactt tttttatgac tccattagaa agagaaatat 961 tttttgtaaa acttttttat gactccatta gaaagagacg aaacattttt attgagacat 1021 tagaaagaga aagagacgaa acattttttt gtaaaacttt tttatgactc cattagaaag 1081 agaaagagac gaaacatttt tagtgataca ttagaaagag aaatattttt tgtaaaactt 1141 ttttatgact ccattagaaa gagatgaaac atttttagtg agacattaga aagagaaaga 1201 gacgaaacat tttttttgta aaactttttt tatgactcca ttagaaagag aaagagacga 1261 aacattttta gtgagacatt agaaagataa agagacgaaa catttttttt gtaaaacttt 1321 ttttatgact ccattagaaa gagaaagaga cgaaacattt ttagtgagac attagaaaga 1381 gacgaaacat ttttagtgag acattagaaa gataaagaga cgaaacattt tttttgtaaa 1441 acttttttat gactccatta gaaagagaaa gagacgaaac atttttagtg agacattaga 1501 aagagacgaa acattttttt tgtaaaactt tttttatgac tccattagaa agagacgaaa 1561 catttttagt gagacattag aaagataaag agacgaaaca ttttttttgt aaaacttttt 1621 tatgactcca ttagaaagag aaagagacga aacattttta gtgagacatt agaaagagaa 1681 atattttttg taaaactttt ttatgactcc attagaaaga gatgaaacat ttttagtgag 1741 acattagaaa gagaaatatt tttttgtaaa acttttttat gactccatta gaaagagatg 1801 aaacattttt agtgagacat tagaaagaga aagagacgaa acattttttt ttgtaaaact 1861 tttttatgac tccattagaa agagatgaaa catttttagt gagacattag aaagagaaag 1921 agacgaaaca tttttttttt gtaaaacttt ttatgactcc attagaaaga gatgaaacat 1981 ttttagtgag acattagaaa gagaaagagt gacacgataa aacttgttac ttattgttag 2041 agaaaaagaa tcgactttaa tgcgaataca tttttaaatt tcaaaccgga aataagtgtc 2101 cggtacactt ttaaaattaa acgtatgtcg gaatccacaa ctaatcctga ggattacacg 2161 attcagacac acgatttaag ttttgtgtcg tcgac // LOCUS STYFLICDA 612 bp ds-DNA BCT 14-JUL-1990 DEFINITION S.muenchen phase 1 flagellin (antigen-determining region) (fliCd) gene, partial cds. ACCESSION M34824 KEYWORDS phase 1 flagellin antigen d. SOURCE S.muenchen (strain RKS3121) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 612) AUTHORS Smith,N.H., Beltran,P. and Selander,R.K. TITLE Recombination of Salmonella phase 1 flagellin genes generates new serovars JOURNAL J. Bacteriol. 172, 2209-2216 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 612 phase 1 flagellin (fliCd) (AA at 1) BASE COUNT 207 a 114 c 137 g 154 t ORIGIN 1 attgatttaa aagaaattag ctctaaaaca ctgggacttg ataagcttaa tgtccaggat 61 gcctacaccc cgaaagaaac tgctgtaacc gttgataaaa ctacctataa aaatggtaca 121 gatactatta cagcccagag caatactgat atccaaactg caattggcgg tggtgcaacg 181 ggggttactg gggctgatat caaatttaaa gatggtcaat actatttaga tgttaaaggc 241 ggtgcttctg ctggtgttta taaagccact tatgatgaaa ctacaaagaa agttaatatt 301 gatacgactg ataaaactcc gttagcaact gcggaagcta cagctattcg gggaacggcc 361 actataaccc acaaccaaat tgctgaagta acaaaagagg gtgttgatac gaccacagtt 421 gcggctcaac ttgctgctgc aggggttact ggtgccgata aggacaatac tagccttgta 481 aaactatcgt ttgaggataa aaacggtaag gttattgatg gtggctatgc agtgaaaatg 541 ggcgacgatt tctatgccgc tacatatgat gagaaacagg tacaattact gctaaacaac 601 cactatacag at // LOCUS STYFLICI 585 bp ds-DNA BCT 14-JUL-1990 DEFINITION S.typhimurium phase 1 flagellin (antigen-determining region) (fliCi) gene, partial cds. ACCESSION M34822 KEYWORDS phase 1 flagellin antigen i. SOURCE S.typhimurium (strain RKS3333) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 585) AUTHORS Smith,N.H., Beltran,P. and Selander,R.K. TITLE Recombination of Salmonella phase 1 flagellin genes generates new serovars JOURNAL J. Bacteriol. 172, 2209-2216 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 585 phase 1 flagellin (fliCi) (AA at 1) BASE COUNT 180 a 108 c 142 g 155 t ORIGIN 1 atcgatctga agcagatcaa ctctcagacc ctgggtctgg atacgctgaa tctgcaacaa 61 aaatataagg tcagcgatac ggctgcaact gttacaggat atgccgatac tacgattgct 121 ttagacaata gtacttttaa agcctcggct actggtcttg gtggtactga ccagaaaatt 181 gatggcgatt taaaatttga tgatacgact ggaaaatatt acgccaaagt taccgttacg 241 gggggaactg gtaaagatgg ctattatgaa gtttccgttg ataagacgaa cggtgaggtg 301 actcttgctg gcggtgcgac ttccccgctt acaggtggac tacctgcgac agcaactgag 361 gatgtgaaaa atgtacaagt tgcaaatgct gatttgacag aggctaaagc cgcattgaca 421 gcagcaggtg ttaccggcac agcatctgtt gttaagatgt cttatactga taataacggt 481 aaaactattg atggtggttt agcagttaag gtaggcgatg attactattc tgcaactcaa 541 aataaagatg gttccataag tattaatact acgaaataca ctgca // LOCUS STYFLICR 579 bp ds-DNA BCT 14-JUL-1990 DEFINITION S.heidelberg phase 1 flagellin (antigen-determining region) (fliCr) gene, partial cds. ACCESSION M34823 KEYWORDS phase 1 flagellin antigen r. SOURCE S.heidelberg (strains RKS539) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 579) AUTHORS Smith,N.H., Beltran,P. and Selander,R.K. TITLE Recombination of Salmonella phase 1 flagellin genes generates new serovars JOURNAL J. Bacteriol. 172, 2209-2216 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 579 phase 1 flagellin (fliCr) (AA at 1) BASE COUNT 181 a 118 c 126 g 154 t ORIGIN 1 atcgatctga agcagatcaa ctctcagacc ctgggtctgg atacgctgaa tgttcaacaa 61 aaatataagg tcagcgatac ggctgcaact gtcactggct atacagattc tgctactgct 121 attgacaaat ctacgtttgc tgcatcagca actaccttag gtggtactcc tgctattact 181 ggtgatctga agtttgatga tactactgga aaatattacg ctgatgtttc aggtactacg 241 gctaaagatg gtgtttatga agtaacagtt gcagccgatg gaaaagtcac tttaactggc 301 acaccaacag gaccaattac tgctggcttc ccttcaactg caacaaaaga tgttaaacaa 361 actcagcaag aaaacgctga tttgacagag gccaaagccg cattgacagc agcgggtgtt 421 gcagcggccg gcacagcatc tgttgttaag atgtcttata ctgataataa cggtaaaact 481 attgatggtg gtttagcagt taaggtaggc gatgattact attctgcaac tcaaaataaa 541 gatggttcca taagtattaa tactacgaaa tacactgca // LOCUS HUMFGF1A 3343 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds. ACCESSION M34641 KEYWORDS FGF receptor-1; fibroblast growth factor receptor; transmembrane glycoprotein; tyrosine kinase. SOURCE Human teratocarcinoma cell line Tera-2, clone 13, cDNA to mRNA, clone 1:1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3343) AUTHORS Wennstroem,S., Sandstroem,C. and Claesson-Welsh,L. TITLE cDNA cloning and expression of a human FGF receptor which binds acidic and basic FGF JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.Claesson-Welsh, 25-MAY-1990. Author address: L.Claesson-Welsh Ludwig Institute for Cancer Research Biomedical Center Box 595 S-751 24 Uppsala SWEDEN FEATURES from to/span description pept 10 2472 FGF receptor-1 precursor sigp 10 72 FGF receptor-1 signal peptide matp 73 2469 FGF receptor-1 BASE COUNT 766 a 957 c 911 g 709 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca 61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct 121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg 181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac 241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc 301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat 361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag 421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa 481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct 541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct 601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct 661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc 721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca 781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg 841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag 901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc 961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat 1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg 1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc 1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg 1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag 1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct 1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg 1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc 1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg 1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac 1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg 1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc 1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca 1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac 1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc 1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata 1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac 1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac 2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc 2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg 2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca 2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc 2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc 2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag 2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc 2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc 2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg 2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct 2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc 2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg 2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc 2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc 2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc 2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac 3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc 3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat 3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc 3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga 3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt 3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc // LOCUS BSUSPCR 2852 bp ds-DNA BCT 14-JUL-1990 DEFINITION B.subtilis spectinomycin resistance (spc) genes, complete cds. ACCESSION M31102 KEYWORDS spectinomycin resistance. SOURCE B.subtilis (strain 1A241) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2852) AUTHORS Yoshikawa,H. and Doi,R.H. TITLE The nucleotide sequence of the Bacillus subtilis spectinomycin resistance gene region JOURNAL Nucleic Acids Res. 18, 1647-1647 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.H.Doi, 04-JAN-1990. FEATURES from to/span description pept 52 549 spc ORF1 pept 563 742 spc ORF2 pept 773 1213 spc ORF3 pept 1215 2450 spc ORF4 (ttg start codon) pept 2505 > 2852 spc ORF5 BASE COUNT 830 a 535 c 673 g 814 t ORIGIN 1 tgatgcgctc gtgaagctgg acttaaattt taataaaaga aggagggaca catgattatg 61 cgtcgtattg acccaagcaa attagagtta gaagaacgct tagttacggt taaccgcgta 121 gcgaaaggtg gtcgtcgttt ccgcttcgca gctctagtcg ttgtcggtga caaaaacgga 181 cacgtaggat tcggtactgg taaagcacaa gaagtaccag aagcgattcg caaagctgtt 241 gaagatgcga aaaagaattt gattgaagta ccaatggttg gaactacaat tccacacgaa 301 atcatcggac gtttcggtgc aggtaacatc ttgttaaaac ctgcttctga aggtactgga 361 gttatcgctg gaggccctgt acgtgcggta cttgagctag ctggtgtagc tgatatcctt 421 tctaagtctt taggttctaa cacaccgatc aacatgattc gtgcaacact tcaaggttta 481 agtgaactta aacgtgctga agacgttgcg aagcttcgtg gaaaatctgt agaagaactg 541 ttaggataag gagggaacat taatggctaa attagaaatt accctcaaac gaagtgtaat 601 cggtcgcccg gaagaccaac gcgttactgt aagaacactt ggtttaaaga aaacaaacca 661 aactgttgtt catgaagaca acgctgcgat ccgtggtatg attaataaag tatctcattt 721 agtttctgtt aaagaacaat aaaaatattt gatcggataa ggaggtgtcc taatgaaact 781 tcatgaatta aaaccttcag aaggttcacg caaaacgcgt aatcgcgtag gtcgtggtat 841 tggttctggc aacggtaaaa cagctggtaa aggtcacaaa ggtcaaaacg ctcgttctgg 901 cggcggtgta cgccctggat tcgagggggg acaaatgcct ttattccaac gtcttcctaa 961 acgtggtttc actaacatca accgcaagga atacgctgta gtcaacctag acaaattgaa 1021 cggttttgca gaaggaacgg aagtcactcc tgaacttctt ctagagactg gtgttattag 1081 caaacttaat gcaggagtaa agattcttgg caacggtaaa ttagagaaaa aattaactgt 1141 aaaagccaat aaattctctg cttctgcgaa agaagctgtt gaagctgctg gcggtacagc 1201 tgaggtgatc taacttgttt aaaacaatct ccaactttat gcgtgtgagt gatatcagga 1261 ataaaatcat attcacttta ctcatgctta tcgtctttcg cataggtgcg tttattcctg 1321 tgccttacgt taacgctgaa gcgttacagg cacagtctca aatgggtgtt tttgatctcc 1381 ttaatacatt tggcggcggt gcgctttacc aattttccat tttcgcaatg ggaattactc 1441 cttatatcac ggcttcgatc atcattcagc tgcttcagat ggatgtggta ccgaagttta 1501 ccgagtggtc taagcaaggt gaagttggcc gccgtaaatt agctcagttc acaaggtact 1561 ttacgattgt gcttggtttc atccaagcgt taggtatgtc atatggattc aacaatctgg 1621 caaacggtat gctgatcgaa aaatccggtg tatcgacata tcttatcatt gctttagtgc 1681 tcactggcgg aactgccttt ttaatgtggc ttggggaaca aattacttct catggagtag 1741 gcaacggaat atcgatcatt atcttcgcgg ggattgtgtc tagtattcca aaaacaattg 1801 ggcaaatata tgagactcaa tttgtcggca gcaacgatca gttgtttatt catattgtga 1861 aagtcgcact tcttgtgatt gcgattttag cagttattgt tggagttatt ttcattcagc 1921 gaggtcagtc tacacacctt ccattgaaag tgaatcctgc aggggttatt ccggtaatct 1981 ttgcggttgc gtttttgata acgccgcgga cgatcgcgtc attctttgga acaaacgatg 2041 tgacaaagtg gattcaaaac aactttgata atacgcatcc ggtgggtatg gcgatatatg 2101 ttgcgttgat tattgccttt acgtactttt atgcttttgt acaggtaaac cctgaacaaa 2161 tggctgataa ccttaaaaaa cagggtggct atatcccggg ggttcgtcca gggaaaatga 2221 ctcaagatag aattacgagc attttgtatc gacttacgtt tgtgggttct atattcttag 2281 ccgtgatttc cattcttcct atctttttca ttcaattcgc tggattgcct caaagtgcac 2341 aaattggcgg aacatctttg ttaattgttg tcggggtagc cttggagaca atgaaacaac 2401 tagaaagcca gttggtgaaa cgaaactacc gtggatttat gaaaaactag aggaaatgga 2461 tttatccatt ccctcttaat aaagagagga cgggggaaat ccgaatgaac ttagtcttaa 2521 tggggcttcc tggtgccggt aaaggcacac agggcgaacg aattgttgag gattatggga 2581 ttcctcatat ctcaacagga gatatgttcc gtgctgctat gaaagaagaa acaccactcg 2641 gactcgaagc aaaatcttat attgataagg gagagctcgt acctgacgaa gtcacaatcg 2701 gaattgtaaa agagagactt ggcaaagatg attgtgaaag aggttttctt ctggacggat 2761 ttccgcgaac agtcgctcaa gccgaagctc ttgaagaaat tcttgaggaa tacggcaagc 2821 cgattgatta tgtcattaac attgaagtcg ac // LOCUS RATHDNFNT 1020 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat neurotrophin-3 (HDNF/NT-3) mRNA, complete cds. ACCESSION M34643 KEYWORDS neurotrophic factor; neurotrophin-3. SOURCE Rat adult (strain Sprague-Dawley) hippocampus, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1020) AUTHORS Ernfors,P., Ibanez,C.F., Ebendal,T., Olson,L. and Persson,H. TITLE Molecular cloning and neurotrophic activities of a protein with structural similarities to b-nerve growth factor: Developmental and topographical expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.P.Ernfors, 18-MAY-1990. FEATURES from to/span description pept 73 849 neurotrophin-3 (HDNF/NT-3) precursor sigp 73 120 neurotrophic factor/neurotrophin-3 (HDNF/NT-3) signal peptide matp 490 846 neurotrophic factor/neurotrophin-3 BASE COUNT 320 a 229 c 226 g 245 t ORIGIN 1 gtcgacgtcc ctggaaatag tcatacggat gccatggtta cttctgccac gatcttacag 61 gtgaacaagg tgatgtccat cttgttttat gtgatatttc ttgcttatct ccgtggcatc 121 caaggcaaca acatggatca aaggagtttg ccagaagact ctctcaattc cctcattatc 181 aagttgatcc aggcggatat cttgaaaaac aagctctcca agcagatggt agatgttaag 241 gaaaattacc agagcaccct gcccaaagca gaggcaccca gagaaccaga gcagggagag 301 gccaccaggt cagaattcca gccgatgatt gcaacagaca cagaactact acggcaacag 361 agacgctaca attcaccccg ggtcctgctg agtgacagca cccctttgga gccccctccc 421 ttatatctaa tggaagatta tgtgggcaac ccggtggtaa ccaatagaac atcaccacgg 481 aggaaacgct atgcagagca taagagtcac cgaggagagt actcagtgtg tgacagtgag 541 agcctgtggg tgaccgacaa gtcctcagcc attgacattc ggggacacca ggttacagtg 601 ttgggagaga tcaaaaccgg caactctcct gtgaaacaat atttttatga aacgaggtgt 661 aaagaagcca ggccagtcaa aaacggttgc agggggattg atgacaaaca ctggaactct 721 cagtgcaaaa cgtcgcaaac ctacgtccga gcactgactt cagaaaacaa caaactcgta 781 ggctggcgct ggatacgaat agacacttcc tgtgtgtgtg ccttgtcaag aaaaatcgga 841 agaacatgaa ttggcatctg tccccacata taaattatta ctttaaatta tatgatatgc 901 atgtagcata taaatgttta tattgttttt atatattata agttgacctt tatttattaa 961 acttcagcaa cccttacagt atataagctt ttttcataat cgggctgctc aaaaaaaaaa // LOCUS ABCRRAA 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION Acetobacter sp. (strain MB 58) 5S ribosomal RNA, complete sequence. ACCESSION M34766 KEYWORDS 5S ribosomal RNA. SOURCE Acetobacter sp. (strain MB 58) rRNA. ORGANISM Acetobacter sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 27 a 40 c 32 g 17 t 2 others ORIGIN 1 gatctggtgg ccatggcggg agcaaatcag ccgatcccat cccgaactcg gccgtcaaat 61 gccccagcgc ccatgatact ctgcctcaag gcacggaaaa gtcggtcgcc gccagayy // LOCUS ABCRRAB 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION Acetobacter sp. (strain 914) 5S ribosomal RNA, complete sequence. ACCESSION M34767 KEYWORDS 5S ribosomal RNA. SOURCE Acetobacter sp. (strain 914) rRNA. ORGANISM Acetobacter sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 27 a 42 c 29 g 18 t 2 others ORIGIN 1 nntctgatgg ccatggcggg agcaaatcac ccgatcccat cccgaactcg gccgtcaaat 61 gccccagcgc ccatgatact ctgcctcaag gcacggaaaa gtcggtcgcc gccagatc // LOCUS ANCRRAA 117 bp ss-rRNA RNA 14-JUL-1990 DEFINITION A.aquaticus 5S ribosomal RNA, complete sequence. ACCESSION M34775 KEYWORDS 5S ribosomal RNA. SOURCE A.aquaticus (strain Ma) rRNA. ORGANISM Ancylobacter aquaticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Nonprosthecate bacteria. REFERENCE 1 (bases 1 to 117) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 23 a 40 c 32 g 22 t ORIGIN 1 cctggtggcc attgcgagga gccccaaccc gatcccatcc cgaactcggc cgttaaactc 61 ctcagcgccg atggtactat gtctcaagac ctgggagagt aggtcgctgc caggcct // LOCUS BLTRRAA 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION B.viscosus 5S ribosomal RNA, complete sequence. ACCESSION M34774 KEYWORDS 5S ribosomal RNA. SOURCE B.viscosus (strain D7) rRNA. ORGANISM Blastobacter viscosus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Nonprosthecate bacteria. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 23 a 38 c 35 g 22 t ORIGIN 1 gcctggtggc tatggcgagg agtccgaacc cgatcccatc ccgaactcgg ccgttaaact 61 cctcagcgcc aatggtactt cgtctcaaga cgcgggagag taggtcgctg ccaggcct // LOCUS HYVRRAA 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION H.vulgare 5S ribosomal RNA, complete sequence. ACCESSION M34768 KEYWORDS 5S ribosomal RNA. SOURCE H.vulgare (strain NP-160) rRNA. ORGANISM Hyphomicrobium vulgare Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Prosthecate bacteria. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 22 a 34 c 37 g 26 t ORIGIN 1 gacctggtga ttatggcggg tggtctgcac ccgatctcat tccgaactcg gccgtgaaac 61 cgcccagcgc caatggtact tcgtcttaag acgcgggaga gtaggtcgtc gccaggtct // LOCUS HYVRRAB 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION Hyphomicrobium sp. 5S ribosomal RNA, complete sequence. ACCESSION M34769 KEYWORDS 5S ribosomal RNA. SOURCE Hyphomicrobium sp. (strain G10) rRNA. ORGANISM Hyphomicrobium sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Prosthecate bacteria. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 20 a 34 c 39 g 26 t ORIGIN 1 gacctggtga ttatggcggg gtggctgcac ccgatcccat tccgaactcg gccgtgaaac 61 gcccctgcgc cgatggtact tcgtcttaag acgcgggaga gtaggtcgtt gccaggtct // LOCUS MAGRRAA 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.agile 5S ribosomal RNA, complete sequence. ACCESSION M34759 KEYWORDS 5S ribosomal RNA. SOURCE M.agile (strain A20) rRNA. ORGANISM Methylomonas agile Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 29 a 29 c 36 g 22 t 3 others ORIGIN 1 gcttggtggc catagcgagc gtgaaccacc cgatcccatc ccgaactcgg aagtgaaacc 61 gcttagcgcc gatgatagtg cgacaggttg tcgtgcgaaa gtagggaatt gtmaagcnn // LOCUS MAGRRAB 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.rubra 5S ribosomal RNA, complete sequence. ACCESSION M34760 KEYWORDS 5S ribosomal RNA. SOURCE M.rubra (strain 15S) rRNA. ORGANISM Methylomonas rubra Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 30 a 31 c 33 g 23 t 2 others ORIGIN 1 gcttggtgac catagcaagc gtgaaccacc cgatcccatc ccgaactcgg aagtgaaacc 61 gcttagcgcc gatgatagtg tggcaggttg ccatgtgaaa gtaggtcatc gcmaagctn // LOCUS MAGRRAC 117 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.methanica 5S ribosomal RNA, complete sequence. ACCESSION M34761 KEYWORDS 5S ribosomal RNA. SOURCE M.methanica (strain 12) rRNA. ORGANISM Methylomonas methanica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 27 a 30 c 32 g 24 t 4 others ORIGIN 1 ncttggtgac catagcgagc gtgraccacc cgatcccatc ccgaactcgg aagtgaaacc 61 gctttgcgcc gatgatagtg tggcagtttg ccatgtgaaa gtaggtcatc gcmaagn // LOCUS MLCRRAA 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.capsulatus (strain ATCC 19069) 5S ribosomal RNA, complete sequence. ACCESSION M34758 KEYWORDS 5S ribosomal RNA. SOURCE M.capsulatus (strain ATCC 19069) rRNA. ORGANISM Methylococcus capsulatus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 30 a 36 c 36 g 16 t ORIGIN 1 gcctggcggc cagagcgagc gggaaccacc cgatcccatc ccgacctcgg aagtgaaacc 61 gcttagcgcc gatgatagtg caagaatacc tgtgcgaaag tagggaactg ccaggctc // LOCUS MLCRRAB 117 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.capsulatus (strain Y) 5S ribosomal RNA, complete sequence. ACCESSION M34762 KEYWORDS 5S ribosomal RNA. SOURCE M.capsulatus (strain Y) rRNA. ORGANISM Methylococcus capsulatus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 31 a 28 c 31 g 23 t 4 others ORIGIN 1 nyttggtgac catagcgagc gtgacccacc cgatcccatc ccgaactcgg aagtgaaacc 61 gcttagtgcc gatgatagtg tggcaatttg ccatgcgaaa gtagggaatt gcmaaan // LOCUS MTBRRAA 115 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.organophilum 5S ribosomal RNA, complete sequence. ACCESSION M34770 KEYWORDS 5S ribosomal RNA. SOURCE M.organophilum (strain NP-220) rRNA. ORGANISM Methylobacterium organophilum Prokaryota; Bacteria; Methylmonadaceae. REFERENCE 1 (bases 1 to 115) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 115 5S ribosomal RNA BASE COUNT 23 a 41 c 34 g 17 t ORIGIN 1 ccggcggcct gagcggtgtg ccagaacccg atcccatctc gaactcggcc gttaaacaca 61 ccagcgccca tggtactgtg ctccaagaca cgggagagtc ggtcgccgcc aggct // LOCUS MTBRRAB 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.extorquens 5S ribosomal RNA, complete sequence. ACCESSION M34771 KEYWORDS 5S ribosomal RNA. SOURCE M.extorquens (strain AM1) rRNA. ORGANISM Methylobacterium extorquens Prokaryota; Bacteria; Methylmonadaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 24 a 39 c 37 g 18 t ORIGIN 1 gcctggcggc ctgagcggtg tggaagaacc cgatcccatc tcgaactcgg ccgttaaaca 61 caccagcgcc gatggtactg tgtctcaaga cacgggagag tcggtcgccg ccaggccc // LOCUS MYLRRAA 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.trichosporium 5S ribosomal RNA, complete sequence. ACCESSION M34763 KEYWORDS 5S ribosomal RNA. SOURCE M.trichosporium (strain 4E) rRNA. ORGANISM Methylosinus trichosporium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 28 a 29 c 33 g 24 t 5 others ORIGIN 1 nncttggtga ctatagtgag cgtgaaccac ccgatcccat cccgaactcg gaagtgaaac 61 cgcttagcgc cgatgatagt gtggccgttt gccatgcgaa agtagggcat tgcmaagyn // LOCUS MYMRRAA 115 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.methylotrophus 5S ribosomal RNA, complete sequence. ACCESSION M34765 KEYWORDS 5S ribosomal RNA. SOURCE M.methylotrophus (strain NCIB 10515) rRNA. ORGANISM Methylophilus methylotrophus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 115) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 115 5S ribosomal RNA BASE COUNT 29 a 33 c 28 g 25 t ORIGIN 1 cttggggaca atagcggttt ggaaccaccc cttcccatct cgaacagggc cgtgaaacga 61 acttgcgcca atgatagtgt actcttcgta tgcgaaagta ggtcatcccc aagcc // LOCUS MYPRRAA 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.parvus 5S ribosomal RNA, complete sequence. ACCESSION M34764 KEYWORDS 5S ribosomal RNA. SOURCE M.parvus (strain 492) rRNA. ORGANISM Methylocystis parvus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 28 a 32 c 33 g 22 t 4 others ORIGIN 1 nncttggtga ccatagcgag cgtgaaccac ccgatcccat cccgaactcg gaagtgaaac 61 cgcttagcgc cgatgatagt gtggccgttt gccatgcgaa agtagggcat tgcmaagcn // LOCUS PSERRAA 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION P.stutzeri 5S ribosomal RNA, complete sequence. ACCESSION M34776 KEYWORDS 5S ribosomal RNA. SOURCE P.stutzeri (strain 8) rRNA. ORGANISM Pseudomonas stutzeri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 23 a 35 c 35 g 25 t ORIGIN 1 gcctggtggc tatggcgagg aggcctaacc cgatcccatc ccgaactcgg ccgttaaact 61 cctcagtgcc gatggtacta tgtcttaaga cctgggagag taggtcgtcg ccaggcct // LOCUS RECRRAA 116 bp ss-rRNA RNA 14-JUL-1990 DEFINITION R.vacuolatum 5S ribosomal RNA, complete sequence. ACCESSION M34773 KEYWORDS 5S ribosomal RNA. SOURCE R.vacuolatum (strain Rv) rRNA. ORGANISM Renobacter vacuolatum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Nonprosthecate bacteria. REFERENCE 1 (bases 1 to 116) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 116 5S ribosomal RNA BASE COUNT 23 a 36 c 32 g 25 t ORIGIN 1 ctggtggcca ttgcgaggag ccccaattcg atctcactcc gaactcggcc gttaaactcc 61 tcagcgccga tggtactatg tctcaagacc tgggagagta ggtcgctgcc aggcct // LOCUS TUMRRAA 116 bp ss-rRNA RNA 14-JUL-1990 DEFINITION T.mutans 5S ribosomal RNA, complete sequence. ACCESSION M34772 KEYWORDS 5S ribosomal RNA. SOURCE T.mutans (strain U2) rRNA. ORGANISM Tuberoidobacter mutans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Nonprosthecate bacteria. REFERENCE 1 (bases 1 to 116) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 116 5S ribosomal RNA BASE COUNT 23 a 36 c 33 g 24 t ORIGIN 1 ctggtggcca ttgcgaggag cctgaacccg actccactcc gaactcggcc gttaaactcc 61 tcagcgccga tggtactatg tctcaagacc tgggagagta ggtcgctgcc aggtct // LOCUS ANAPETF1A 1682 bp ds-DNA BCT 14-JUL-1990 DEFINITION A.variabilis ferredoxin (petF1) gene, complete cds. ACCESSION M35126 KEYWORDS ferredoxin. SOURCE A.variabilis (ATCC 29413; PCC 7937) DNA, clone pVA1. ORGANISM Anabaena variabilis Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Nostocaceae. REFERENCE 1 (bases 1 to 1682) AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M., Van Arkel,G. and Weisbeek,P. TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and Synechococcus sp. PCC 7942: Structure and regulation JOURNAL Photosyn. Res. 18, 179-204 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 794 1093 ferredoxin (petF1) binding 782 785 ribosomal binding site (put.) BASE COUNT 480 a 353 c 322 g 527 t ORIGIN 1 aagcttgatg taagccacct aaaagttgaa tgtgttcgac gtggagtaat ttttgctgtg 61 ctttggctaa ggcaatcagt tcttctgctt cagttaaatc tacagataaa ggatattcta 121 caattacgtg tttgccggca gttagagcag cacggcgatc gcaccatgat cacgattgat 181 ggtacaaata accactaaat ctatatcttc tcgttctact aactgttgcc agccagttat 241 cgcttcagcc tggtaatctt gagccaaagc ctgggttcgc tccagtgtac tacccgctat 301 ggcgactagg tgcgatcgct tatcctccaa aaaggcttct gcccgcagtt ttgctgcata 361 ccctgtccaa ctatgccaat acgtattgtt gcttgtgcca aagctacctc cgactcctat 421 ataatcctca ccagtcctgc taccaagcac agtaataaat ttttattttt caaaaaaagc 481 taggttaacg cagtaccagc ctaactgaag gttatccctg ttccagaact tttttatcat 541 attcttgatt caagccataa atttttctaa taactgtggg tttaacgtct ttaaagtata 601 aaaaaaattt attgttatct gctaactaaa tttcattact aatcgtggtg gatttcaagt 661 tatatacttt gatttttccc gtagtatcag aattgaacta aatttaattc tgtggctaat 721 cccctgagaa tagccgctaa gttctgcttt agcataactt atactgccga ttacaaaaga 781 gaggattacg gaaatggcaa cttttaaagt tacattgatc aacgaagcag aaggaaccag 841 caacacaatt gacgttcctg atgatgagta tattttagac gctgccgaag aacagggtta 901 tgacctaccc ttttcctgtc gtgcaggtgc ttgctccacc tgcgccggta aactagtatc 961 cggtactgtt gaccagtctg accaatcatt cttggatgac gatcaaatcg aagctggata 1021 tgtattgacc tgtgttgctt atccaacctc tgatgtaacc atccaaaccc acaaagaaga 1081 agacctctac taagagtcaa gttagcttta aaaatagagg ctagaggcta gcagctacag 1141 actagtccct agccttttta ttttttttca ttagggaaca gtctattggg actgggtgta 1201 agggtgtaag ggtatagggg tgtaatagtt tcaaacattt atactctttt caacccttga 1261 tatttcgttt tcatccgtaa gtcccacaaa aaataaatta tccaaaattg atggtttggt 1321 agggtgcgtc aatagaaatc atttctgagt gtatttaggc tctatcgcac tgacggacac 1381 tacattttgg atattttttg atctgaaagt cccttaaaat ttgatttctt cattcaggct 1441 gtgaagttgt gtaccaggat aataaaattg gagaattttg ctgttagacc aacctagttt 1501 agctaaagtt tgagcgccag tttgacttaa gcccactcca tgtcctaaac caccaccaat 1561 aaaagcgtat cccacaactc ttctttgcct ttgttgagag gtagtatgta aaaaagcgtg 1621 cttcttggcg cagcaaaggc actacgcact tcatctttgt gtagggtaaa aatgcccata 1681 tc // LOCUS ANIPETF1A 1074 bp ds-DNA BCT 14-JUL-1990 DEFINITION A.nidulans ferredoxin (petF1) gene, complete cds. ACCESSION M35127 KEYWORDS ferredoxin. SOURCE A.nidulans (strain R2, PCC7942) DNA, clone pA612. ORGANISM Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 1074) AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M., Van Arkel,G. and Weisbeek,P. TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and Synechococcus sp. PCC 7942: Structure and regulation JOURNAL Photosyn. Res. 18, 179-204 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 504 803 ferredoxin (petF1) binding 493 496 ribosomal binding site (put.) BASE COUNT 234 a 298 c 281 g 261 t ORIGIN 1 ctgcagccgc tgcatgcgca gggcgttcga tttgagggct ttttgtggaa catcttggcg 61 cacatcatcg aagctctaac accgaggctg tctgagttgg ctagtcctgg cagctgggca 121 atctttagcg gcttgctaac cagtcaagcc gacactgtca gcgtcacttt ggaagagtac 181 ttttgggtga tccgcgatcg cgccagtcag ggagattggt gtcgtttggt cgcggatttt 241 cggccggaac gataaatctc actaatgctt agcttagagg gcttactggg agcgggccga 301 gtttgagccg tgattacccc tacgaacttt ccggccacgc tccattgctt agacataaaa 361 ttcccttatg tctagactgg cgattgatag catttctcgc ggcgcagttc gccctttggc 421 aacccatagt atcaatggga aaggtacggg caggctgtca atcgatgagc tctgccaccc 481 caaaagcgat agaggacacg ctcatggcaa cctacaaggt tacgctcgtc aatgctgccg 541 aaggcttgaa caccacgatc gacgtggctg acgataccta catcttggac gccgctgaag 601 agcaaggcat tgacctgcct tactcctgcc gtgctggtgc ttgctcgacc tgtgctggca 661 aagtcgtctc tggtaccgtc gaccaatcgg atcaatcctt cttggatgac gaccaaattg 721 cagcaggctt tgtcctgacc tgcgtcgcct atccgacctc cgatgtgacg atcgaaaccc 781 acaaagaaga agacctctac taagtcttgc ttcgactgct tcaatcctta gaattcaaat 841 caaattgcgg cttccaaatt gggagtcgct tttttatcgc cgtagtcagc agcgatcgcc 901 tcactaccgt ccagaaacag cctgcgatcg ccctgactta acgcccctcc cctagcaatc 961 ctgactcggt agagttcagg ccaaggactc aaggttgagg cgatcgttca gcaacgtctt 1021 gctctaggcg tccggcgtga ctgttgtggg gaagaggcga tcgccttcct gcag // LOCUS ASAASPPA 1764 bp ds-DNA PLN 14-JUL-1990 DEFINITION A.awamori aspergillopepsin A (pepA) gene, complete cds. ACCESSION M34454 KEYWORDS aspartic proteinase; aspergillopepsin. SOURCE A.awamori (NRRL3112 mutant strain UVK143f) DNA. ORGANISM Aspergillus awamori Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 1764) AUTHORS Berka,R.M., Ward,M., Wilson,L.J., Hayenga,K.J., Kodama,K.H., Carlomagno,L.P. and Thompson,S.A. TITLE Molecular cloning and deletion of the gene encoding aspergillopepsin A from Aspergillus awamori JOURNAL Gene 86, 153-162 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 177 496 aspergillopepsin (pepA), exon 1 548 825 aspergillopepsin, exon 2 878 1126 aspergillopepsin, exon 3 1186 1523 aspergillopepsin, exon 4 sigp 177 308 aspergillopepsin signal peptide matp 455 496 aspergillopepsin 548 825 aspergillopepsin 878 1126 aspergillopepsin 1186 1523 aspergillopepsin pre-msg 124 1764 pepA mRNA and introns (put.) pre-msg 125 1764 pepA mRNA and introns (put.) pre-msg 127 1764 pepA mRNA and introns (put.) IVS 497 547 pepA intron A IVS 826 877 pepA intron B IVS 1127 1185 pepA intron C signal 54 58 TATA box BASE COUNT 387 a 515 c 445 g 417 t ORIGIN 1 gtcgacttgg atgatggaga gttgatcgag gtcaatgagg agaggcttgc aagtataaga 61 agagactgct cgaccagcag aatggatctt cttgttcatc aaccaagagt ccaaggcttc 121 tttgtctggt tctatctctt ctccgaactc tcttgcttga cattctcgtg gtcaaaatgg 181 tcgtcttcag caaaaccgct gccctcgttc tgggtctgtc ctccgccgtc tctgcggcgc 241 cggctcctac tcgcaagggc ttcaccatca accagattgc ccggcctgcc aacaagaccc 301 gcaccatcaa cctgccaggc atgtacgccc gttccctggc caagtttggc ggtacggtgc 361 cccagagcgt gaaggaggct gccagcaagg gtagtgccgt gaccacgccc cagaacaatg 421 acgaggagta cctgactccc gtcactgtcg gaaagtccac cctccatctg gactttgaca 481 ccggatctgc agatctgtaa gcttccctgc tcgggtgttc gggcaaatcg tgactaacct 541 ggactagctg ggtcttctcg gacgagctcc cttcctcgga gcagaccggt cacgatctgt 601 acacgcctag ctccagcgcg accaagctga gcggctacac ttgggacatc tcctacggtg 661 acggcagctc ggccagcgga gacgtgtacc gggatactgt cactgtcggc ggtgtcacca 721 ccaacaagca ggctgttgaa gcagccagca agatcagctc cgagttcgtt cagaacacgg 781 ccaatgacgg ccttttggga ctggccttta gctccatcaa cactggtgag tcaatcctac 841 atcagccggg ttgacctacc tgctgaccga tagacagtcc agccaaaggc gcagacaacc 901 ttcttcgaca ccgtcaagtc ccagctggac tctccccttt tcgccgtgca gctgaagcac 961 gacgcccccg gtgtttacga ctttggctac atcgatgact ccaagtacac cggttctatc 1021 acctacacgg atgccgatag ctcccagggt tactggggct tcagcaccga cggctacagt 1081 atcggtgacg gcagctccag ctccagcggc ttcagcgcca ttgctggtaa gaaccgcctt 1141 catttaacac acaacttgtc cacctcttta ctaactagtg tatagacacc ggtaccaccc 1201 tcatcctcct cgatgacgaa atcgtctccg cctactacga gcaggtttct ggcgcctcag 1261 gagagacgga agccggtggc tacgttttct cttgctcgac caacccccct gacttcactg 1321 tcgtgattgg cgactacaag gccgttgttc cgggcaagta catcaactac gctcccatct 1381 cgactggcag ctccacctgc tttggcggta tccagagcaa cagcggtctg ggactgtcca 1441 tcctgggtga tgttttcttg aagagccagt acgtggtctt caactctgag ggccctaagc 1501 tgggattcgc cgctcaggct tagattatcc actgaagtgg agtctatgat ctgctgattg 1561 atccctcgac gatgaactac atgtggaaat gcatagcaga cgagggtgat ggtgatgatg 1621 ttgatttgat gatgacccgt acatacttga tgaagctcgg tacatatgca atgtgactgt 1681 atctatgtga tgatatatgt atcatctcat aacttttggc tatgagtgca ggataacact 1741 gaaccagtag tagtactttc cact // LOCUS BSUPSIF2A 2758 bp ds-DNA BCT 14-JUL-1990 DEFINITION B.subtilis protein synthesis initiation factor 2 (infB) gene, complete cds. ACCESSION M34836 KEYWORDS protein synthesis initiation factor 2. SOURCE B.subtilis (strain RS410) DNA, clones lambda-JET[1,2],pUK, and pJET1. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2758) AUTHORS Shazand,K., Tucker,J., Chiang,R., Stansmore,K., Sperling-Petersen,H.U., Grunberg-Manago,M., Rabinowitz,J.C. and Leighton,T. TITLE Isolation and molecular genetic characterization of the Bacillus subtilis gene (infB) encoding protein synthesis initiation factor 2 JOURNAL J. Bacteriol. 172, 2675-2687 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 381 2531 protein synthesis initiation factor 2 (infB) binding 369 375 ribosomal binding site (put.) BASE COUNT 917 a 535 c 662 g 644 t ORIGIN 1 atcgatgacc agattttcga tgaattgctg gaactggcgg aaaaggtgaa aaaataaaat 61 gtctggaatg gaatggtttc ccttgctggg tctggccaat cgagctcgta aggtcgtgtc 121 aggcgaagac ttggtaataa aagaaatcag gaatgcgcgt gcaaagcttg tcctgcttac 181 agaggatgca tcatctaaca cagcaaaaaa agtaactgac aagtgcaatt attataaagt 241 cccttataaa aaagtcgaga gtcgcgcggt tcttggacgc tctatcggta aagaagcccg 301 tgtcgttgtc gccgtcactg accaaggttt tgcgaataag ctgatcagct tgctcgatta 361 atatttttgg gggtgaacga atggctaaaa tgagagtata cgaatatgca aaagcgttaa 421 atgtttcaag taaggaaatt ttgaccgcac tgaagaacat ggatttagaa gtgaataatc 481 acatggccat gcttgaagaa aaggccatta aaaagctaga tgccaaatat aaaaaaggcg 541 gcgcacgtgc taaatctcaa aagccagcag aaacgaacaa aaacaaacag ccacaagggg 601 ttaatcagca atcagctgga aatcaaccaa ataaaattcg agacggaaag aagaatgacg 661 tgcagaataa tcaatttaac aaaaacaaga agaataacaa caacaaaaaa aataaacgca 721 acaacaacaa taataaaaac caacatcagc aaaagcctgt aaagccgaaa aaagagcttc 781 ctgagaaaat tacattctct ggcactttaa cagttggcgc acttgctgaa gagcttggca 841 aagagccttc agaactcatt aaaaagctga tgcttcttgg cgtaatggca accattaacc 901 aagagcttga taaagacaca atcgaactca ttgcatcaga atatggtgtt gaaacagaag 961 aggtcattgt gcttgaagaa acagagctgg aaaaatacga agagcctgat aatgaagagg 1021 atcttgaaat tcgtcctcct gtcgtgacaa tcatgggcca cgttgaccat gggaaaacaa 1081 cgcttcttga cagcatccgt aaaacaaagg ttgttgaagg ggaagcaggc ggaatcactc 1141 agcatatcgg cgcttatcaa attgaagaga acggcaagaa aatcacgttc ttggatacac 1201 cgggccacgc cgcatttaca acaatgcgcg cacgcggtgc agaagtaact gatattacga 1261 ttcttgtcgt agctgccgat gacggtgtca tgccgcaaac agttgaagcg attaaccatg 1321 caaaagcagc agaggttcca atcatcgttg ctgtgaataa aatagataaa gaatccgcaa 1381 accctgaccg tgtaatgcaa gaactgacgg aatacggact tgtccctgaa gcttggggcg 1441 gagaaaccat ttttgtgccg ctttccgctt taactggaaa aggcattgat gagctcgttg 1501 aaatgatttt gcttgtcagt gaggtagagg aactgaaagc gaatccgaat cgtcaggcaa 1561 aaggaacggt tattgaagct gaactcgata aaggcagagg gtcagtcgcg acgttgctcg 1621 tacagactgg aacactgcat gtcggtgatc cgatcgtagt cggcaataca tttggccgtg 1681 tccgtgcaat ggtcaacgac attggccgcc gtgtgaaaac tgccggcccg tcaactccgg 1741 ttgaaattac cggtttgaat gatgtccctc aagcgggaga ccaattcctt gtctttaagg 1801 atgaaaaaac agctcgttct gtcggtgaag cccgtgcttc gaaacagctt gaagagcagc 1861 gcagcgataa agcgaagctc agtcttgatg atttatttga gcaaattaag caaggtgatg 1921 taaaagacat caacctcatc gtaaaagctg acgttcaagg atctgctgaa gctttaacgg 1981 ctgcacttca aaaaattgaa gtagaaggcg ttaaagtgaa aatcatccat acaggcgttg 2041 gtgcgattac tgaatcagac attatcttgg catctgcttc caatgcaatt gttatcgggt 2101 ttaatgtgag accggacgga aatgctaaga gtacggctga agctgaaaat gtagatattc 2161 gacttcaccg tatcatttac aaagtaatcg acgagattga agctgccatg aaaggtatgc 2221 ttgatcctga atatgaagaa aaagtaattg gtcaagtaga agtacgccaa acattcaaag 2281 tatctaaaat cggtacaatt gccggcggat atgttactga aggaaccatt acgcgcgaca 2341 gcggcctccg tttaattcgt gacggcgtcg tcatctttga aggcgaagta gatgttctga 2401 aacgctttaa agacgatgtg aaagaagttt cacaaggcta tgaatgtggt attacaatta 2461 agaaatacaa tgacattcgt gaaggtgaca tccttgaagc gtttgtcatg caagaaattg 2521 aaagaacgtg atcggatttg cggagtgtga atgcatcatt tatgatgcag gatcgctaaa 2581 agaaaagcgt gccgttctga agcggatttt aaccagggtt caaaacaagt tcaatgtttc 2641 gatttcggag attggctatc aggacacctg gcaaagaacc agcttcggaa tcgccgctgt 2701 ttcttcctct cgcgttcaaa cagaaaaaga actgcagcgc gtcctagcgt ttatcgat // LOCUS CIBRCRA 1845 bp ds-DNA BCT 14-JUL-1990 DEFINITION Plasmid Colicin Ib-P9 replication control region replicon initiator protein (repZ) gene, complete cds. ACCESSION M34837 KEYWORDS replicon initiator protein. SOURCE Plasmid Colicin Ib-P9 (from E.coli) DNA, clone pCH10-delta-1. ORGANISM Plasmid Colicin Ib-P9 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1845) AUTHORS Hama,C., Takizawa,T., Moriwaki,H., Urasaki,Y. and Mizobuchi,K. TITLE Organization of the replication control region of plasmid Co1Ib-P9 JOURNAL J. Bacteriol. 172, 1983-1991 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 455 1486 repZ protein mRNA 157 1762 repZ protein mRNA (alt.) mRNA 158 1762 repZ protein mRNA (alt.) mRNA 363 < 1 (c) inc RNA binding 442 448 ribosomal binding site (put.) signal 377 372 (c) inc -10 region signal 400 395 (c) inc -35 region mut 353 353 c in wild type; t in inc3 mut 374 374 t in wild type; c in inc2 mut 400 400 a in wild type; c in inc1 BASE COUNT 505 a 467 c 490 g 383 t ORIGIN 1 gatctaaagc agaaaaatct gcttttaaaa atagatttta tttttttgac gcaggtcaag 61 attaacttat tagaggtatc agtgaggagg cactggaaga gaagagatcg ttgtaatgct 121 tttcaaatta acgtaaagcg ggtatatttc ggctgttatt agctgtgcag agggtggcac 181 tctgtggagc aaagcggcga aagccggacg gcagaatgcg ccataaggca ttcaggagag 241 atggcatgta cgggcagtaa gtcagaagac tgaagatgtt ccggaagcca taaaaggaaa 301 acccccacta tctttcttac gaacttggcg gaaagacgaa agatagtggg ggcctcacag 361 aatacgggta aagtataatg aaaccgtacc agagattcaa ccctgtgcag tgtataaata 421 cacggcacaa tcgctccgcc ataagcgaca gcttgtggca ggtctgaaga atactccata 481 taacgcagta cactggagtc agttagcacc cgaagagcag atccgtttct gggaagacta 541 tgaagcggga agggcgacca ctttcctggt tgaaccggaa aggaagcgca cgaagcgtcg 601 tcgcggtgag cactccacca aacccaaatg cgaaaatccg tcctggtatc gtcctgagcg 661 ctataaagcg ctgaaagggc agctcgggca cgcctataac cgtctggtga aaaaggaccc 721 ggtgaccggc gagcagagcc tgcgcatgcg catgtcgcgg catccttttt atgtacagaa 781 acggacgttc gtcggtcgca aatacgcttt ccggccggaa aaacaacgtc tcctcgatgc 841 catctggccg gttctggtca gcttcagtga tgccggcaca cacaccgtgg ggatgagcgt 901 cactcggctg gctgaagaaa tcagcccgaa agacagcgag ggacatgtta ttccggaact 961 ggaagtgacg gtctcccgtc tttcccgtct gctggccgaa caggtgcgct ttggtgtgct 1021 gggtgtgtca gaggaaacaa tgtgggaccg tgaacatcgt cagcgtctgc cacgttacgt 1081 ctggataaca ccggcaggct ggcagatgct gggcgtcgac atggtaaaac ttcacgaaca 1141 gcagcagaaa cgactgcgtg aaagtgaaat ccgccagcag ctcattcggg aaggtgttct 1201 gcgtgaggat gaagatatct ccgttcatgc ggccagaaaa cgctggtatc tgcagcgcag 1261 ccaggatgcc ctgaagaagc gccgagaaaa agccgcagcc agcaagcgtg ccaaccgcct 1321 gaagaaatta cctgttgacc agcagattta tgagatggct gagtatctcc ggaagcgtct 1381 gccgccggat gaagcctatt tctgttccga tgaccatctg aagcgactgg ccatcaggga 1441 gttgcgtcag cttgaactga cgctggctgc cccgccaccg cactagacag catcattccc 1501 tcagcactga atcatcgcca gcccctccgg ggctttcggc gcagattccg ctcagcccaa 1561 aatccttagt agtcacctta aatcccctca gaggggcata tctgcccata aaaccacgca 1621 tcagtcatca gaacgtggcc acgttgtttc agttatccac ataaatccgc aaacaaagaa 1681 ttttaagaag ctgcaaacct gaaacagcaa acctgcaata tattcttaac cccatcattt 1741 aatcccctgc gttgcttcgc cgcagggaaa gtctttatct ctgaaaccac tgtgaacaaa 1801 tacaaaaagg ccttcgcttg cagcgggcaa ggccgcgccg gaatt // LOCUS ECOHEMBA 1085 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli porphobilinogen synthase (hemB) gene, complete cds. ACCESSION M35121 KEYWORDS 5-aminolevulinic acid dehydratase; hemB gene; porphobilinogen synthase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1085) AUTHORS Echelard,Y., Dymetryszyn,J., Drolet,M. and Sasarman,A. TITLE Nucleotide sequence of the hemB gene of Escherichia coli K12 JOURNAL Mol. Gen. Genet. 214, 503-508 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 80 1054 porphobilinogen synthase (hemB) signal 15 20 hemB -35 region signal 41 46 hemB -10 region BASE COUNT 254 a 277 c 301 g 253 t ORIGIN 1 tcgcgacaac tttcgtaaaa catccctacc ctgcttcagg tatactatgc ccctcgattc 61 cacaaacatc aggcagacca tgacagactt aatccaacgc cctcgtcgcc tgcgcaaatc 121 tcctgcgctg cgcgtatgtt tgaagagaca acacttagcc ttaacgacct ggtgttgccg 181 atctttgttg aagaagaaat tgacggacta caaagccgtt gaagccatgc caggcgtgat 241 gcgcattcca gagaaacatc tggcacgcga aattgaacgc atcgccaacg ccggtattcg 301 ttccgtgatg acttttggca tctctcacca taccgatgaa accggcagcg atgcctggcg 361 ggaagatgga ctggtggcgc gtatgtcgcg catctgcaag cagaccgtgc cagaaatgat 421 cgttatgtca gacacctgct tctgtgaata cacttctcac ggtcactgcg gtgtgctgtg 481 cgagcatggc gtcgacaacg acgcgactct ggaaaattta ggcaagcaag ccgtggttgc 541 agctgctgca ggtgcagact tcatcgcccc ttccgccgcg atggacggcc aggtacaggc 601 gattcgtcag gcgctggacg ctgcgggatt taaagatacg gcgattatgt cgtattcgac 661 caagttcgcc tcctcctttt atggcccgtt ccgtgaagct gccggaagcg cattaaaagg 721 cgaccgcaaa agctatcaga tgaacccaat gaaccctcgt gaggcgattc gtgaatcact 781 gctggatgaa gcccagggcg cagactgcct gatggttaaa cctgctggag cgtacctcga 841 catcgtgcgt gagctgcgtg aacgtactga attgccgatt ggcgcgtatc aggtgagcgg 901 tgagtatgcg atgattaagt tcgccgcgct ggcgggtgct atagatgaag agaaagtcgt 961 gctcgaaagc ttaggttcga ttaagcgtgc gggtgcggat ctgattttca gctactttgc 1021 gctggatttg gctgagaaga agattctgcg ttaattttgt ttcctctcag aaagggggag 1081 ttaac // LOCUS ECOSTMA 1418 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli F factor (stm) gene, complete cds. ACCESSION M35124 KEYWORDS F factor; stm gene. SOURCE E.coli DNA, clones pEG[157,159,175,176,220]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1418) AUTHORS Golub,E.I. and Panzer,H.A. TITLE The F factor of Escherichia coli carries a locus of stable plasmid inheritance stm, similar to the parB locus of plasmid RI JOURNAL Mol. Gen. Genet. 214, 353-357 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 830 1009 F factor (stm) signal 1013 1039 pot. transcription termination signal signal 1074 1105 pot. transcription termination signal BASE COUNT 279 a 364 c 429 g 345 t 1 others ORIGIN 1 atcgctgtct gacctgcgat ttttcatgcc gtccctgacc gcagaagaac tgcacggcaa 61 ccgtcttcag tggctgtatg gcatcgatgt gctgattgaa acacagggcg aggtctgcct 121 ccttcccctt cccggtgatg cagcagagcg gtcgtttccg tcggtgcggt ttcgtgtcag 181 ggagcgcagt cgccataaat ccgcgctggt catgcagaag tacagccgcc agcaggcacg 241 ggaggcagag cagaaagccc gtgcgtacca ggcgctggtc gctcaggctg aaatcgaact 301 ggcattccat tcgccggaaa cagtcggaag ctggcatgca cgctggagtg accgggtggc 361 agagcatgac ctggaaaccc tgttctggca gtggggcgag cgttttccgt cactggccgg 421 aatggaacgc tggcagtggc aggatatgcc gttcgtggca ggttattgcg gaacgcagtc 481 ttgcagcaag agaggccggt cacgctgtcc gtgagatgag cgtgatgtgc cgaacaaact 541 ccggaggcag cgtgatgcgg gcaacaatca cacagattac ccgtaaacag cctgaatgag 601 cgggttattt tcaggaaaag tatgtgtggt cagcgtgcag ggatatggct atgatgtgcc 661 cggcgcttga ggctttctgc ctcatgacgt gaaggtggtt tgttgccgtg ttgtgtggca 721 gaaggacaaa agccccatag ttaatttttc attaacccac gaggcccctg catgcttaga 781 caacatcagg atagcctctt actgcgcttt cgacgaggag aagaaggcca tgaaactacc 841 acgcagctct cttgtctggt gtgtgttgat cgtgtgtctc acactgttga tattcactta 901 tctgacacga aaatcgctgt gcgagattcg ttacagagac ggatacaggg aggtngcggc 961 cggatacagg gaggtggcgg ctttcatggc ttacgaatcc ggtaagtagc aacctggagg 1021 cgggcgcagg cttttcaggg ctgatgctgg tctgactgca ctgatgcctt tataaagggg 1081 ctgctggttc gccggtggcc cctttctcct tgctggcatt gtacgggcat gaacaaccag 1141 actgccagga aggccgtttc cctctgggaa cggggaagaa cgtgcgcctg cggccgttct 1201 tttcccccgc cttctctgtt tatgacctgt cagaaatcac atcaccttcc tttgattatt 1261 ctctttttct ctgttctgac tctgattact gctgccctga ttcctgtctg ttttgttgtt 1321 ctcctggcag tgtcttttcc gggcgtcctg gcggggcggc tggcgtgaac ggagccagcg 1381 aagctgtctc tggcgggctt ccatccctga cgggaaac // LOCUS HUMGPIIBA 3333 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human platelet glycoprotein IIb (GPIIb) mRNA, complete cds. ACCESSION M34480 KEYWORDS platelet glycoprotein IIb. SOURCE Human megakaryocytes, cDNA to mRNA, clone IIb[3,4]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3333) AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and Marguerie,G. TITLE GPIIb and GPIIIa amino acid sequences deduced from human megakaryocyte cDNAs JOURNAL Mol. Biol. Rep. 14, 27-33 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 33 3152 platelet glycoprotein IIb (GPIIb) mRNA 1 3333 GPIIb mRNA BASE COUNT 626 a 998 c 1040 g 669 t ORIGIN 1 attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag 61 ccctctggct tctggagtgg gtgctgctgc tcttgggagc ttgtgctgcc cctccagcct 121 gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt 181 ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg 241 ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga 301 gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg 361 taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca 421 gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga 481 ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc 541 gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg 601 attttagctg ggacaagcgt tactgtgaag cgggcttcag ctcggtggtc actcaggccg 661 gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc 721 cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc 781 agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg 841 tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca 901 cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc 961 ggctgcgcgc agagcagatg gcgtcgtatt ttgggcattc agtcgctgtc actgacgtca 1021 acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggac agccgggcag 1081 accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg 1141 cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg 1201 ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc 1261 cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc 1321 tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct 1381 tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag 1441 cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc 1501 agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca 1561 agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc 1621 ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc 1681 ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctggacctg gatctgggcg 1741 gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc 1801 gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg 1861 gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg 1921 tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga 1981 cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca 2041 acgagggcga gggggcctat gaagcagagc tggcggtgca cctgccccag ggcgcccact 2101 acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg 2161 agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga 2221 taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct 2281 tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg 2341 acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc 2401 tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca 2461 aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc 2521 tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac 2581 agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact 2641 gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca 2701 gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa 2761 gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc 2821 agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc 2881 tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg 2941 tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct 3001 tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc 3061 tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggcaca 3121 ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga 3181 gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt 3241 ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc 3301 ccctcctgct gcctaataaa gagactgagc cct // LOCUS HUMGPIIIAA 4487 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human platelet glycoprotein IIIa (GPIIIa) mRNA, complete cds. ACCESSION M35999 KEYWORDS platelet glycoprotein IIIa. SOURCE Human megakaryocytes, cDNA to mRNA, clone IIIa[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4487) AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and Marguerie,G. TITLE GPIIb and GPIIIa amino acid sequences deduced from human megakaryocyte cDNAs JOURNAL Mol. Biol. Rep. 14, 27-33 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 19 2385 plate glycoprotein IIIa (GPIIIa) mRNA < 1 2603 GPIIIa mRNA BASE COUNT 1046 a 1090 c 1186 g 1165 t ORIGIN 1 ccgcgggagg cggacgagat gcgagcgcgg ccgcggcccc ggccgctctg ggtgactgtg 61 ctggcgctgg gggcgctggc gggcgttggc gtaggagggc ccaacatctg taccacgcga 121 ggtgtgagct cctgccagca gtgcctggct gtgagcccca tgtgtgcctg gtgctctgat 181 gaggccctgc ctctgggctc acctcgctgt gacctgaagg agaatctgct gaaggataac 241 tgtgccccag aatccatcga gttcccagtg agtgaggccc gagtactaga ggacaggccc 301 ctcagcgaca agggctctgg agacagctcc caggtcactc aagtcagtcc ccagaggatt 361 gcactccggc tccggccaga tgattcgaag aatttctcca tccaagtgcg gcaggtggag 421 gattaccctg tggacatcta ctacttgatg gacctgtctt actccatgaa ggatgatctg 481 tggagcatcc agaacctggg taccaagctg gccacccaga tgcgaaagct caccagtaac 541 ctgcggattg gcttcggggc atttgtggac aagcctgtgt caccatacat gtatatctcc 601 ccaccagagg ccctcgaaaa cccctgctat gatatgaaga ccacctgctt gcccatgttt 661 ggctacaaac acgtgctgac gctaactgac caggtgaccc gcttcaatga ggaagtgaag 721 aagcagagtg tgtcacggaa ccgagatgcc ccagagggtg gctttgatgc catcatgcag 781 gctacagtct gtgatgaaaa gattggctgg aggaatgatg catcccactt gctggtgttt 841 accactgatg ccaagactca tatagcattg gacggaaggc tggcaggcat tgtccagcct 901 aatgacgggc agtgtcatgt tggtagtgac aatcattact ctgcctccac taccatggat 961 tatccctctt tggggctgat gactgagaag ctatcccaga aaaacatcaa tttgatcttt 1021 gcagtgactg aaaatgtagt caatctctat cagaactata gtgagctcat cccagggacc 1081 acagttgggg ttctgtccat ggattccagc aatgtcctcc agctcattgt tgatgcttat 1141 gggaaaatcc gttctaaagt cgagctggaa gtgcgtgacc tccctgaaga gttgtctcta 1201 tccttcaatg ccacctgcct caacaatgag gtcatccctg gcctcaagtc ttgtatggga 1261 ctcaagattg gagacacggt gagcttcagc attgaggcca aggtgcgagg ctgtccccag 1321 gagaaggaga agtcctttac cataaagccc gtgggcttca aggacagcct gatcgtccag 1381 gtcacctttg attgtgactg tgcctgccag gcccaagctg aacctaatag ccatcgctgc 1441 aacaatggca atgggacctt tgagtgtggg gtatgccgtt gtgggcctgg ctggctggga 1501 tcccagtgtg agtgctcaga ggaggactat cgcccttccc agcaggacga gtgcagcccc 1561 cgggagggtc agcccgtctg cagccagcgg ggcgagtgcc tctgtggtca atgtgtctgc 1621 cacagcagtg actttggcaa gatcacgggc aagtactgcg agtgtgacga cttctcctgt 1681 gtccgctaca agggggagat gtgctcaggc catggccagt gcagctgtgg ggactgcctg 1741 tgtgactccg actggaccgg ctactactgc aactgtacca cgcgtactga cacctgcatg 1801 tccagcaatg ggctgctgtg cagcggccgc ggcaagtgtg aatgtggcag ctgtgtctgt 1861 atccagccgg gctcctatgg ggacacctgt gagaagtgcc ccacctgccc agatgcctgc 1921 acctttaaga aagaatgtgt ggagtgtaag aagtttgacc ggggagccct acatgacgaa 1981 aatacctgca accgttactg ccgtgacgag attgagtcag tgaaagagct taaggacact 2041 ggcaaggatg cagtgaattg tacctataag aatgaggatg actgtgtcgt cagattccag 2101 tactatgaag attctagtgg aaagtccatc ctgtatgtgg tagaagagcc agagtgtccc 2161 aagggccctg acatcctggt ggtcctgctc tcagtgatgg gggccattct gctcattggc 2221 cttgccgccc tgctcatctg gaaactcctc atcaccatcc acgaccgaaa agaattcgct 2281 aaatttgagg aagaacgcgc cagagcaaaa tgggacacag ccaacaaccc actgtataaa 2341 gaggccacgt ctaccttcac caatatcacg taccggggca cttaatgata agcagtcatc 2401 ctcagatcat tatcagcctg tgccaggatt gcaggagtcc ctgccatcat gtttacagag 2461 gacagtattt gtggggaggg atttcggggc tcagagtggg gtaggttggg agaatgtcag 2521 tatgtggaag tgtgggtctg tgtgtgtgta tgtgggggtc tgtgtgttta tgtgtgtgtg 2581 ttgtgtgtgg gagtgtgtaa tttaaaattg tgatgtgtcc tgataagctg agctccttag 2641 cctttgtccc agaatgcctc ctgcagggat tcttcctgct tagcttgagg gtgactatgg 2701 agctgagcag gtgttcttca ttacctcagt gagaagccag ctttcctcat caggccattg 2761 tccctgaaga gaagggcagg gctgaggcct ctcattccag aggaagggac accaagcctt 2821 ggctctaccc tgagttcata aatttatggt tctcaggcct gactctcagc agctatggta 2881 ggaactgctg gcttggcagc ccgggtcatc tgtacctctg cctcctttcc cctccctcag 2941 gccgaaggag gagtcaggga gagctgaact attagagctg cctgtgcctt ttgccatccc 3001 ctcaacccag ctatggttct ctcgcaaggg aagtccttgc aagctaattc tttgacctgt 3061 tgggagtgag gatgtctggg ccactcaggg gtcattcatg gcctggggga tgtaccagca 3121 tctcccagtt cataatcaca acccttcaaa gatttgcctt attggcagct ctactctgga 3181 ggtttgttta gaagaagtgt gtcaccctta ggccagcacc atctctttac ctcctaattc 3241 cacaccctca ctgctgtaga catttgctat gacctgggga tgtctctcat gaccaaatgc 3301 ttttcctcaa agggagagag tgctattgta gagccagagg tctggcccta tgcttccggc 3361 ctcctgtccc tcatccatag cacctccaca tacctggccc tgagccttgg tgtgctgtat 3421 ccatccatgg ggctgattgt atttaccttc tacctcttgg ctgccttgtg aaggaattat 3481 tcccatgagt tggctgggaa taagtgccag gatggaatga tgggtcagtt gtatcagcac 3541 gtgtggcctg ttcttctatg ggttacaacc tcatttaact cagtctttaa tctgagaggc 3601 cacagtgcaa ttttatttta tttttctcat gatgaggttt tcttaactta aaagaacatg 3661 tatataaaca tgcttgcatt atatttgtaa atttatgtgt atggcaaaga aggagagcat 3721 aggaaaccac acagacttgg gcagggtaca gacactccca cttggcatca ttcacagcaa 3781 gtcactggcc agtggctgga tctgtgaggg gctctctcat gatagaaggc tatggggata 3841 gatgtgtgga cacattggac ctttcctgag gaagagggac tgttcttttg tcccagaaaa 3901 gcagtggctc cattggtgtt gacatacatc caacattaaa agccaccccc aaatgcccaa 3961 gaaaaaaaga aagacttatc aacatttgtt ccatgagcag aaaactggag ctctggcctc 4021 agtgttacag ctaaataatc tttaattaag gcaagtcact ttcttcttct taaagctgtt 4081 tctagtttga gaaatgatgg gattttagca gccagtcttg aaggtctctt tcagtatcaa 4141 cattctaaga tgctgggact tactgtgtca tcaaatgtgc ggttaagatt ctctgggata 4201 ttgatactgt ttgtgttttt agttgggaga tctgagagac ctggctttgg caagagcaga 4261 tgtcattcca tatcaccttt ctcaatgaaa gtctcattct atcctctctc caaacccgtt 4321 ttccaacatt tgttaatagt tacgtctctc ctgatgtagc acttaagctt catttagtta 4381 ttatttcttt cttcactttg cacacatttg catccacata ttagggaagg aataagtagc 4441 tgcaaactat ctattcctgt attattgtgt taacattgag ataaacc // LOCUS MUSCDX1A 1597 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Mouse Cdx-1 homeobox protein mRNA, complete cds. ACCESSION M35125 KEYWORDS Cdx-1 homeobox protein. SOURCE Mouse 8.5 day embryo, cDNA to mRNA, clone BH8. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1597) AUTHORS Duprey,P., Chowdhury,K., Dressler,G.R., Balling,R., Simon,D., Guenet,J.-L. and Gruss,P. TITLE A mouse gene homologous to the Drosophila gene caudal is expressed in epithelial cells from the embryonic intestine JOURNAL Genes Dev. 2, 1647-1654 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 158 736 Cdx-1 homeobox protein site 389 571 Cdx-1 homeobox BASE COUNT 326 a 523 c 444 g 304 t ORIGIN 1 gccgcgcccg ctgggccctc cgacctatgc gccccctggc ccggcgcccg cgcccgcagt 61 accccgactt cgcgggttac acgcacgtgg agccggcgcc cgcgccctcc gacctgggct 121 gcgcttccct gcgcccaagg acgactgggc agctgctatg gcccgggccc cacggcctca 181 gcccgcagcc cggccccgct ggccttcggg ccctccggac tttagcccgg tgcccgcgcc 241 tcccgggcct ggtcccggca tcctagcgca gtccctcggg gctccgggcg caccgtcctc 301 gccaggagcg ccaaggcgga cgccctacga atggatgcgg cgcagcgtgg cggctgcagg 361 cggctgtggc agcggtaaga cccgaaccaa ggacaagtca cgtgtggtct acacagacca 421 ccaacgccta gagctggaaa aggagtttca ctacagccgg tacatcacta tccggcgcaa 481 gtccgagctg gctgctaacc tggggctcac agagcggcag gtaaagatct ggttccagaa 541 ccgccgggcc aaggagcgca aagtaaacaa gaagaaacag cagcagcagc agcccctgcc 601 tcccacacag ctgcccctgc ccctggatgg cactcccaca ccatcagggc cacccctagg 661 aagtctatgc cctactaatg ctggccttct gggcaccccc tccccagtgc ccgtcaagga 721 ggagtttcta ccctagcccc ttgcagcctg gggtctaggg atctagggac ttgaatgctg 781 ggcagctggc cttttcgggg cccaaggacg tgttctgagt gcagccctgc cttggccctt 841 tggggtcact gtggacaaac tgcctaccta ggacaagtta gcttgccctc ttcctgcctt 901 ccgttggctg ggctgtgtgg tgagcctgtt ggaccaagga gctcttccag ctcttgtgtc 961 ctagggcaca cagggtgatg gggaagccca attagaagtc tcaatcacct ggacatttcc 1021 caaagattca gaggctcaac gagctgtcaa gacaagattg aggcttcagc gtctcctcca 1081 agggttccag agtgaggtgg gaggctggta tctggaccag actggctctg gagagtaggc 1141 ggccattgaa acaaggtggt acaccacagt ctagactccc atgtgaaggg aagatgctat 1201 ctgcccacct gtctcctctt ccagcctcac tttcacctgc cctagcaccc catgctctgg 1261 ccctccccag cctggagcta caaagccatc aggactgaca tgaggtagag gctgggcaat 1321 tggccactgc tcttgagaga gaactcgccc ccaccttagt gtccagggag gcctgcggat 1381 ggagagaagg cactatgagt ctacctacct gcacatgccc accttcccac aggctgccca 1441 tcacacccca tttagccccc acaccatggg ggactcgcag ctgtacaggg cttatgtagg 1501 agtgtaatct cttggatgca gcttcagaaa taagttttcc cccttttaaa caatttataa 1561 aaatcataca aggcattaaa gaatttttaa aaagttt // LOCUS MUSHSP70B 3518 bp ds-DNA ROD 14-JUL-1990 DEFINITION Mouse heat shock protein 70.1 (hsp70.1) gene, complete cds. ACCESSION M35021 KEYWORDS heat shock protein 70.1. SOURCE Mouse (strain AJ) kidney DNA, clone pM[1.2,2.3]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3518) AUTHORS Hunt,C. and Calderwood,S. TITLE Characterization and sequence of a mouse hsp70 gene and its expression in mouse cell lines JOURNAL Gene 87, 199-204 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 806 2734 heat shock protein 70.1 (hsp70.1) signal 507 512 inverted CCAAT box signal 547 552 TATA box BASE COUNT 827 a 993 c 1014 g 684 t ORIGIN 1 gatctcttct atttccctat tcaaacctaa aatgaagagg gagggggaga catggacaag 61 caagcattcc acaggcgccc ctgcccaacg ctgtcactca aaccaggacc caatcacaga 121 ctttttagcc aagccttatc ccgcctctct tgagaaactt tctgcgtccg ccatcctgta 181 ggaaggattt gtacacttta aactccctcc ctggtctgag tcccacactc tcaccaccca 241 gcaccttcag gagctgaccc ttaacagctt cacccacagg gaccccgaag ttgcgtcgcc 301 tccgcaacag tgtcaatagc agcaccagca cttccccaca ccctccccct caggaatccg 361 tactctctag cgaaccccag aaacctctgg agagttctgg acaagggcgg aacccacaac 421 tccgattact caagggaggc ggggaagctc caccagacgc gaaactgctg gaagattcct 481 ggccccaagg cctcctccgg ctcgctgatt ggcccagcgg agagtgggcg gggccggtga 541 agactcctta aaggcgcagg gcggcgagca gggcaccaga cgctgacagc tactcagaat 601 caaatctggt tccatccaga gacaagcgaa gacaagagaa gcagagcgag cggcgcgttc 661 ccgatcctcg gccaggacca gccttcccca gagcatccac gccgcggagc gcaaccttcc 721 caggagcatc cctgccgcgg agcgcaactt tccccggagc atccacgccg cggagcgcag 781 ccttccagaa gcagagcgcg gcgccatggc caagaacacg gcgatcggca tcgacctggg 841 caccacctac tcgtgcgtgg gcgtgttcca gcacggcaag gtggagatca tcgccaacga 901 ccagggcaac cgcacgaccc ccagctacgt ggccttcacc gacaccgagc gcctcatcgg 961 ggacgccgcc aagaaccagg tggcgctgaa cccgcagaac accgtgttcg acgcgaagcg 1021 gctgatcggc cgcaagttcg gcgatgcggt ggtgcagtcc gacatgaagc actggccctt 1081 ccaggtggtg aacgacggcg acaagcccaa ggtgcaggtg aactacaagg gcgagagccg 1141 gtcgttcttc ccggaggaga tctcgtccat ggtgctgacg aagatgaagg agatcgctga 1201 ggcgtacctg ggccacccgg tgaccaacgc ggtgatcacg gtgcccgcct acttcaacga 1261 ctctcagcgg caggccacca aggacgcggg cgtgatcgcc ggtctaaacg tgctgcggat 1321 catcaacgag cccacggcgg ccgccatcgc ctacgggctg gaccggaccg gcaagggcga 1381 gcgcaacgtg ctcatcttcg acctgggggg cggcacgttc gacgtgtcca tcctgacgat 1441 cgacgacggc atcttcgagg tgaaggccac ggcgggcgac acgcacctgg gaggggagga 1501 cttcgacaac cggctggtga gccacttcgt ggaggagttc aagaggaagc acaagaagga 1561 catcagccag aacaagcgcg cggtgcggcg gctgcgcacg gcgtgtgaga gggccaagag 1621 gacgctgtcg tccagcaccc aggccagcct ggagatcgac tctctgttcg agggcatcga 1681 cttctacaca tccatcacgc gggcgcggtt cgaagagctg tgctcggacc tgttccgcgg 1741 cacgctggag cccgtggaga aggccctgcg cgacgccaag atggacaagg cgcagatcca 1801 cgacctggtg ctggtgggcg gctcgacgcg catccccaag gtgcagaagc tgctgcagga 1861 cttcttcaac gggcgcgacc tgaacaagag catcaacccg gacgaggcgg tggcctacgg 1921 ggcggcggtg caggcggcca tcctgatggg ggacaagtcg gagaacgtgc aggacctgct 1981 gctgctggac gtggcgccgc tgtcgctggg cctggagact gcgggcggcg tgatgacggc 2041 gctcatcaag cgcaactcca ccatccccac caagcagacg cagaccttca ccacctactc 2101 ggacaaccag cccggggtgc tgatccaggt gtacgagggc gagagggcca tgacgcgcga 2161 caacaacctg ctggggcgct tcgagctgag cggcatcccg ccggcgccca ggggcgtgcc 2221 gcagatcgag gtgaccttcg acatcgacgc caacggcatc ctgaacgtca cggccaccga 2281 caagagcacc ggcaaggcca acaagatcac catcaccaac gacaagggcc gcctgagcaa 2341 ggaggagatc gagcgcatgg tgcaggaggc cgagcgctac aaggccgagg acgaggtgca 2401 gcgcgacagg gtggccgcca agaacgcgct cgagtcctat gccttcaaca tgaagagcgc 2461 cgtggaggac gagggtctca agggcaagct cagcgaggct gacaagaaga aggtgctgga 2521 caagtgccag gaggtcatct cctggctgga ctccaacacg ctggccgaca aggaggagtt 2581 cgtgcacaag cgggaggagc tggagcgggt gtgcagcccc atcatcagtg ggctgtacca 2641 gggtgcgggt gctcctgggg ctgggggctt cggggcccag gcgccgccga aaggagcctc 2701 tggctcagga cccaccatcg aggaggtgga ttagaggcct ctgctggctc tcccggtgtg 2761 gtctagaaaa cagactcttt gcacttgata gctgcttggg caccgattac tgtcaaggtt 2821 atttaaagtc ttcttcatgg ttcagtttaa agttacagtc tttcttaagg taattgcgtt 2881 gactgttaaa ttttgtatgc atatatatat atatatatat atatatatat atattcaaat 2941 atattcaaag taatgttggg agcagcactg tgcactgtac caggggatta tgttttatag 3001 ctaatgatgt gtaaagtcta aagatttttt tgtaattttt atatcagtgt tccagtagcc 3061 tgggaagaca tatagtctag ctgcccagtt ccctggagat ggtcatctct aagacaaagt 3121 gtcttaaaca aacgtcttgg cactgtgtac tacataactt tactcttttg tacttaaaac 3181 tttatctgct tgtccatgtt aaggttttgt ggtataacca gtatgttctt tgcatttaat 3241 ctaagtaggt taaagatggt gtatccttcc tgcatacatg tctacactgc caccctgtgt 3301 acattttttt ctttgcatca ctacaaacta atgaaaaaaa cttttatgac ttaaatattc 3361 aaaataaaag gttacaagta tattttgtct gtttgtatgt tggaagggct aatggattct 3421 gggcttctgt ggatttctta agtttttttt aagatttatt attatatgtg aacacattgt 3481 agctatcttc agacacacca gaaaagggca tcagatct // LOCUS RDVS10 1319 bp ds-RNA VRL 14-JUL-1990 DEFINITION Rice dwarf virus segment S10 mRNA, encoding an ORF, complete cds. ACCESSION D00241 KEYWORDS S10 RNA. SOURCE Rice dwarf virus, cDNA to viral RNA. ORGANISM Rice dwarf virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus. REFERENCE 1 (bases 1 to 1319) AUTHORS Omura,T., Minobe,Y. and Tsuchizaki,T. TITLE Nucleotide sequence of segment S10 of the rice dwarf virus genome JOURNAL J. Gen. Virol. 69, 227-231 (1988) STANDARD simple automatic COMMENT Three contiguous termination codons were detected at the third triplet downstream from the first stop codon. Corresponds to DDBJ (rel3.0) entry (D00241) dated 20-JUN-1988 FEATURES from to/span description pept 27 1088 ORF BASE COUNT 354 a 279 c 284 g 402 t ORIGIN 5' end of the plus-sense strand of segment S10. 1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc 61 atcatgagct cctttgtgct cacgaaggac caagtattat ttccaagttt gatgcaatta 121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag 181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact 241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg 301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt 361 ttatctttat ccctgagact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc 421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt 481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc 541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag 601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat 661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac 721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt 781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc 841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc 901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca 961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta 1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca 1081 gttcctgatc gcactgatga tagtgctttt catttatcaa aatattacgg tggggtcatc 1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac 1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccataggtg ggttagagtc 1261 gtgtgcgcac gcacttagtt ctactaatgc gtgcccccac actgcgcagg gattctgat // LOCUS RDVS10A 1320 bp ds-RNA VRL 14-JUL-1990 DEFINITION Rice dwarf virus segment 10 mRNA, encoding an ORF, complete cds. ACCESSION M35118 KEYWORDS . SOURCE Rice dwarf virus, cDNA to viral RNA, clones pRD[545,112,367]. ORGANISM Rice dwarf virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus. REFERENCE 1 (bases 1 to 1320) AUTHORS Uyeda,I., Matsumura,T., Sano,T., Ohshima,K. and Shikata,E. TITLE Nucleotide sequence of rice dwarf virus genome segment 10 JOURNAL Proc. Jpn. Acad. 63, 227-230 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 27 1088 ORF BASE COUNT 357 a 280 c 282 g 401 t ORIGIN 1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc 61 atcatgagct cctttctgct cacgaaggac caagtattat ttccaagttt gatgcaatta 121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag 181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact 241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg 301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt 361 ttatctttat ccctgaaact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc 421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt 481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc 541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag 601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat 661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac 721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt 781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc 841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc 901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca 961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta 1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca 1081 gttcctaatc gcactgatga tagtgcttat catttatcaa aatattacgg tggggtcatc 1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac 1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccatagggt gggttagagt 1261 cgtgtgcgca cgccacttag ttctactaat gcgtgccccc acatgcgcag ggattctgat // LOCUS SRVAGSM 3651 bp ss-RNA VRL 14-JUL-1990 DEFINITION Sapporo rat virus mRNA M segment, encoding an ORF, complete cds. ACCESSION M34882 KEYWORDS . SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero E6 cell line C1008. ORGANISM Sapporo rat virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Hantavirus. REFERENCE 1 (bases 1 to 3651) AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and Schmaljohn,C.S. TITLE Coding properties of the S and the M genome segments of Sapporo rat virus: Comparison to other causative agents of hemorrhagic fever with renal syndrome JOURNAL Virology 176, 114-125 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 47 3448 M segment ORF BASE COUNT 1119 a 692 c 755 g 1085 t ORIGIN 1 tagtagtaga ctccgcaaga aacagcagtt aaagaacaat aggatcatgt ggagtttgct 61 attactggcc gctttagttg gccaaggctt tgcattaaaa aatgtatttg acatgagaat 121 tcagttgccc cactcagtca actttgggga aacaagtgtg tcaggctata cagaatttcc 181 cccactctca ttacaggagg cagaacagct agtgccagag agctcatgca acatggacaa 241 ccaccagtca ctctcaacaa taaataaatt aaccaaggtc atatggcgga aaaaagcaaa 301 tcaggaatca gcaaaccaga attcatttga agttgtggaa agtgaagtca gctttaaagg 361 gttgtgtatg ttaaagcata gaatggttga agaatcatat agaaatagga gatcagtaat 421 ctgttatgat ctagcctgta atagtacatt ctgtaaacca actgtttata tgattgttcc 481 tatacatgct tgcaacatga tgaaaagctg tttgattggc cttggcccct acagaatcca 541 ggttgtctat gaaaggacat actgcactac gggtatattg acagaaggaa aatgctttgt 601 ccctgacaag gctgttgtca gtgcattgaa aagaggcatg tatgctatag caagcataga 661 gacaatctgc ttttttattc atcagaaagg gaatacatat aagatagtga ctgccattac 721 atcagcaatg ggctccaaat gtaataatac agatactaaa gttcaaggat attatatctg 781 tattattggt ggaaactccg cccctgtata tgcccctgct ggtgaagact tcagagcaat 841 ggaggttttt tctgggatta ttacatcacc acatggagaa gaccatgacc tacccggcga 901 agaaatcgca acgtaccaga tttcagggca gatagaggca aaaatccctc atacagtgag 961 ctccaaaaac ttaaaattga ctgcttttgc aggtattcca tcatactcat caactagtat 1021 attggctgct tcagaagatg gtcgtttcat atttagtcct ggtttatttc ctaacctaaa 1081 tcagtcagtc tgtgacaaca atgcactccc tttaatctgg aggggcctaa ttgatttaac 1141 gggatactat gaggcagtcc acccttgcaa tgtgttctgt gtcttatcag gaccaggtgc 1201 ttcatgtgag gccttttcag aaggaggtat tttcaatatt acttctccaa tgtgtctggt 1261 gtctaagcaa aatagattta gagcagctga gcagcagatt agctttgtct gccaaagagt 1321 tgatatggat attatagtgt actgtaatgg tcagaaaaaa acaatcctaa caaaaacatt 1381 agttataggc caatgtattt atactattac aagtctcttt tcactgttac caggggttgc 1441 ccattctatt gctattgagt tgtgtgttcc agggtttcat ggctgggcca cagctgcact 1501 tttgattaca ttctgcttcg gctgggtatt gattcctgca tgtacattag ctattctttt 1561 agtccttaag ttctttgcaa atatccttca tacaagcaat caagagaacc gattcaaagc 1621 cattctacgg aaaataaagg aggagtttga aaaaacaaag ggttccatgg tttgtgagat 1681 ctgtaagtat gagtgtgaaa cattaaagga attgaaggca cataacctat catgtgttca 1741 aggagagtgc ccatattgct ttacccactg tgaaccgaca gaaactgcaa ttcaggcaca 1801 ttacaaagtt tgtcaagcca cccaccgatt cagagaagat ttaaaaaaga ctgtaactcc 1861 tcaaaatatt gggccaggct gttaccgaac actaaatctt tttaggtata aaagtaggtg 1921 ttatattctg acaatgtgga ctcttcttct cattattgaa tccatcctct gggcagcaag 1981 tgcagcagaa atcccccttg tccctctctg gacagataat gctcatggcg ttgggagtgt 2041 tcctatgcat acggatcttg aattagactt ctctttgcca tccagttcta agtacacata 2101 caaaagacat ctcacaaacc cagttaatga ccaacagagt gtctcattgc atatagaaat 2161 tgaaagtcaa ggcattggtg ctgctgttca tcatcttgga cattggtatg atgcaagatt 2221 gaatctaaaa acctcatttc attgttatgg tgcctgcaca aaatatcaat acccatggca 2281 cactgcaaaa tgccattttg agaaagatta tgagtatgaa aatagctggg cttgcaaccc 2341 cccagattgc ccaggggttg gtacaggttg tactgcttgt ggattatatc tagatcaatt 2401 gaagccggta ggaacagcct ttaaaattat aagtgtaaga tacagtagaa aagtgtgcgt 2461 gcagtttggt gaagaacacc tttgtaaaac aattgatatg aatgattgct ttgtgactag 2521 gcatgccaaa atatgtataa ttgggactgt atctaagttt tctcaaggtg acactctact 2581 atttctgggg cccatggaag gaggtggtat aatctttaaa cactggtgta catctacctg 2641 tcactttgga gaccctggtg atgtcatggg tccaaaagat aaaccattta tttgccctga 2701 attcccaggg caatttagga aaaaatgtaa ctttgccaca actccagttt gtgaatatga 2761 tggaaacatt atatcaggct ataagaaagt acttgcaaca attgattctt tccaatcatt 2821 taacacaagc aatatacact tcactgatga gagaattgaa tggagagacc ctgatggcat 2881 gcttcgggat catattaata ttgttatttc taaagatatt gattttgaaa atttggctga 2941 gaatccttgt aaagtagggc tccaggcagc aaacatagaa ggtgcctggg gttcaggtgt 3001 cgggtttaca ctcacatgca aggtgtctct cacagaatgc ccaacatttc ttacatcaat 3061 aaaggcctgt gacatggcaa tttgttatgg tgcagaaagt gtgacactct cacgaggaca 3121 aaatactgtc aaaattaccg ggaaaggtgg ccatagtggt tcttcattca aatgctgtca 3181 tgggaaagaa tgttcatcaa ctggcctcca agccagtgca ccacatctgg ataaggtaaa 3241 tggtatctct gagttagaaa acgagaaagt ttatgatgac ggtgcacctg aatgtggcat 3301 tacttgttgg tttaaaaaat caggtgaatg ggttatgggt ataatcaatg ggaactgggt 3361 tgtcctaatt gtcttgtgtg tactgctgct cttttctctt atcctgttga gcatcttgtg 3421 tcctgttaga aagcataaaa aatcataaat cccacctaac aatcttcaca tcatgtatcg 3481 attttcaaac actttatcat ttagaactta acttggcact actatctgat aactgacttt 3541 catttttatt tttatatgga ttaattacta aaaaaaatac tctcttctat ctcccaatct 3601 tttattgatt caccggggtg ctgtcttgac atctggcggc gtctactact a // LOCUS SRVAGSS 1769 bp ss-RNA VRL 14-JUL-1990 DEFINITION Sapporo rat virus mRNA S segment, encoding an ORF, complete cds. ACCESSION M34881 KEYWORDS . SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero E6 cell line C1008. ORGANISM Sapporo rat virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Hantavirus. REFERENCE 1 (bases 1 to 1769) AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and Schmaljohn,C.S. TITLE Coding properties of the S and the M genome segments of Sapporo rat virus: Comparison to other causative agents of hemorrhagic fever with renal syndrome JOURNAL Virology 176, 114-125 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 43 1332 S segment ORF BASE COUNT 569 a 331 c 410 g 459 t ORIGIN 1 tagtagtaga ctccctaaag agctactaca ctaacaagaa aaatggcaac tatggaagaa 61 atccagagag aaatcagtgc tcacgagggg cagcttgtga tagcacgcca gaaggtcaag 121 gatgcagaaa agcagtatga gaaggatcct gatgacttaa acaagagggc actgcatgat 181 cgggagagtg tcgcagcttc aatacaatca aaaattgatg aattgaagcg ccaacttgcc 241 gacagattgc agcagggaag aacatccggg caggaccggg atcctacagg ggtagagcca 301 ggtgatcatc ttaaggaaag atcagcacta agctacggga atacactgga cctgaatagt 361 cttgacattg atgaacctac aggacagaca gctgattggc tgaccataat tgtctatctg 421 acatcattcg tggtcccgat catcttgaag gcactgtaca tgttaacaac acgaggtagg 481 cagacttcaa aggacaacaa ggggatgagg atcagattca aggatgacag ctcatatgag 541 gatgtcaatg gaatcagaaa gcccaaacat ctgtatgtgt caatgccaaa cgcccaatcc 601 agcatgaagg ctgaagagat aacaccagga agattccgca ctgcagtatg tggactatat 661 cctgcacaga taaaggcaag gaatatggta agccctgtca tgagtgtagt tgggttcttg 721 gcactggcaa aagactggac atcgagaatt gaagaatggc tcggtgcacc ctgcaaattc 781 atggcggagt ctcttattgc cgggagttta tctgggaatc ctgtgaatcg tgactatatc 841 agacagagac aaggtgcact tgcagggatg gagccaaagg aatttcaagc cctcaggcaa 901 cattcaaagg atgctggatg tacactagtt gaacatattg agtcaccatc atcaatatgg 961 gtgtttgctg gggcccctga taggtgtcca ccaacatgct tgtttgtcgg agggatggct 1021 gaattaggtg ccttcttttc tatacttcag gatatgagga acacaatcat ggcttcaaaa 1081 actgtgggca cagctgatga aaagcttcga aagaaatcat cattctatca atcatacctc 1141 agacgcacac aatcaatggg aatacaactg gaccagagga taattgttat gtttatggtt 1201 gcctggggaa aggaggcagt ggacaacttt catctcggtg atgacatgga tccagagctt 1261 cgtagcctgg ctcagatctt gattgaccag aaagtgaagg aaatctcaaa ccaggaacct 1321 atgaaattat aagtacataa atatataatc aatactaact ataggttaag aaatactaat 1381 cattagttaa taagaatata gatttattga ataatcatat taaataatta ggtaagttaa 1441 ctagtattta gttaagttag ctaattgatt tatatgattg tcacaattaa atgtaatcat 1501 aagcacaatc actgccatgt ataatcacgg gtatacgggt ggttttcata tggggaacag 1561 ggtgggctta gggccaggtc accttaagtg accttttttt gtatatatgg atgtagattt 1621 caattgatcg aatactaatc ctactgtcct cttttctttt cctttctcct tctttactaa 1681 caacaacaaa ctacctcaca ccttaatata tactacttta ttaagttgtt aagttgtgtc 1741 tttttgggga gtaagggagt ctactacta // LOCUS TRNSTIIA 1315 bp ds-DNA BCT 14-JUL-1990 DEFINITION Transposon Tn4521 (from E.coli) heat-stable toxin II gene, complete cds. ACCESSION M35123 KEYWORDS heat-stable toxin II; insertion sequence; transposase. SOURCE Transposon Tn4521 (from Escherichia coli) DNA. ORGANISM Transposon Tn4521 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1315) AUTHORS Hu,S.T. and Lee,C.H. TITLE Characterization of the transposon carrying the STII gene of enterotoxigenic Escherichia coli JOURNAL Mol. Gen. Genet. 214, 490-495 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 879 400 (c) heat-stable toxin II (STII) trns 88 607 insertion sequence IS2 (bases 17-536) trns 608 970 insertion sequence IS2 (bases 969-1327) BASE COUNT 313 a 349 c 320 g 333 t ORIGIN 1 gccgccggca gatggagctg gatattctaa aaaagacccg ggcatcagta tcagtcacct 61 gaacaacaga gaaaaaacaa agcttgcttt ccagacacct gttatcactt aacccattac 121 tggcccgctg ccgtagatat tcccgtggcg agcgataacc cagtgcacta tgcggatgcc 181 attcgttata atgctcgaac gcctctgcaa ggttctttgc tgccgttaac ccgtctggtt 241 tgggcatgat actgatgtaa tcacgcttta tcgttttcac gaagctctct gctatgccgt 301 tactctccgg actccgcacc gccgtgttct tcggttcaag tcccaacatc cgggcgaact 361 ggcgtgtttc attagcccgg tagcatgaac cattatccgt cagccattcc actggcgata 421 tcggaagctc gctgccgaag gggcgtacca ccgctcccag catgacgtcc tgtactgttt 481 cactgtcgaa gccgcccgta gtaaccgccc agtgcagtgc ctcacggtca cagcagtcca 541 gcgcgaacgt gactcgcagt ttttctccgt tatcacagcg gaactcgaac ccgtcagagc 601 accatcgttt tttttgccga gcaggcgctg gagttcttta atctgcttca tggcagcagc 661 aagttcagag gcaggaacga cctgttctcc tgcggccaca gcagtaagac ttcccttctg 721 gtattgcttg cgccagagaa ataactggct ggctgccgca ccgtgttgcc gggtaacaag 781 ggagcccgtc attcccggtt caaagctctg ctgaacaata gcgatctttt cctgtgtggt 841 acgccgtctg cgtttctccg gccctaagac atcactcatc tgctctccaa tgactagtct 901 aaaaactagt attaagacta tcacttaaat aagtgatatt ggttgtctgg agattcaggg 961 ggccagtcta cggcctatgt cggtctggtg tcaaaacagt caggctccgg agataaagtt 1021 cgactactgg gaataagcaa gtgtggtgat acttatctcc gaacattatt tattcatgga 1081 gcaagggcgg cagcattgct ggtcaaagaa ctagggccat ggatacacaa cttaagatac 1141 aacgactgtc tggtgtttca gggggccagc tcaaacattt ggcaaagata tacgtcttct 1201 ttacaatccg gaggtgttaa cggctctact tgtatcagct acgttctggt tatacgtaat 1261 agcataagaa tggtcgtgcg catacacgct ctggaaagca gaactagctg gacga // LOCUS AVOCYP 1500 bp ss-mRNA PLN 14-JUL-1990 DEFINITION Avocado cytochrome P-450LXXIA1 (cyp71A1) mRNA, complete cds. ACCESSION M32885 KEYWORDS cytochrome P-450; cytochrome P-450LXXIA1. SOURCE Avocado (strain Hass) ripe fruit mesocarp, cDNA to mRNA, clone pAVOd8 and pAVOc8, and DNA (bases 1-16). ORGANISM Persea americana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Magnoliidae; Laurales; Lauraceae. REFERENCE 1 (bases 1 to 1500) AUTHORS Bozak,K.R., Yu,H., Sirevag,R. and Christoffersen,R.E. TITLE Sequence analysis of ripening-related cytochrome P-450 cDNAs from avocado fruit JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3904-3908 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.E.Christoffersen, 16-MAR-1990, for release after publication. FEATURES from to/span description pept 21 1436 cytochrome P-450LXXIA1 (cyp71A1) BASE COUNT 412 a 343 c 354 g 391 t ORIGIN 1 tcactctaga gtaataatcc atggctatct tagtgtctct gctcttctta gccattgctc 61 tcaccttctt cctcctaaaa ctcaacgaaa aacgagagaa gaaaccgaac ctaccccctt 121 ctcctcccaa ccttcccatc atcggaaacc tccatcagct cggtaatctc ccacaccgct 181 ctcttcgctc ccttgcaaac gaactcggac ccttgatact cctccatctg ggtcacatcc 241 ccactctcat agtctccacc gctgagattg ccgaagagat cttgaaaacc catgatctca 301 tctttgctag ccgaccatcc acaactgcag ctcgacgaat cttctatgac tgcaccgacg 361 tggcgttctc tccctacggc gaatattgga ggcaagtaag gaagatctgt gtactcgagc 421 tccttagcat aaagagagtc aactcctacc gttcgatcag ggaagaagag gtgggcctca 481 tgatggagag gatctctcaa tcttgctcga caggtgaagc tgttaatcta tcagagctgt 541 tgctattgct atcaagcggc acaataacaa gggttgcttt tgggaagaag tacgaaggag 601 aagaagaaag gaagaacaag tttgcggatc ttgcaactga gttgacaact cttatgggag 661 ctttcttcgt gggagactac tttccttcgt ttgcatgggt tgatgttcta acagggatgg 721 atgcgaggtt gaagagaaat catggtgaat tggatgcttt tgtggatcac gtaattgatg 781 accatctcct tagtagaaaa gcgaacggct cagatggagt ggagcagaaa gatttagtgg 841 atgtgctgct acatctgcag aaggattcct cactcggcgt ccatctcaac agaaataacc 901 ttaaagccgt catcttggac atgttctctg gtggaactga tacgacggct gtgaccttag 961 aatgggctat ggcagagctt ataaagcatc ccgatgtgat ggagaaagcc caacaagagg 1021 taagaagagt tgtggggaaa aaagcaaagg tggaagaaga agatcttcat cagttgcact 1081 acttaaaact aatcatcaaa gagactctgc gcttgcatcc tgttgctcca ttattagttc 1141 cacgcgagtc cacaagggat gtagttataa ggggctatca tattcctgca aagacaagag 1201 tctttattaa tgcatgggca ataggaagag accccaagtc atgggagaat gctgaggaat 1261 ttctcccaga gagatttgtc aataattctg ttgatttcaa ggggcaagat tttcaactta 1321 ttccttttgg agcagggagg aggggctgtc cggggattgc atttggcatc agttcagttg 1381 agatttctct tgccaatctc ttgtactggt tcaactggga attacctggg atctaaccaa 1441 agaagatctg gacacgctcg aagccgttgg gataaccgtt cacatgaagt ttcctctgca // LOCUS ECODGTP 2760 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli dGTP triphosphohydrolase (dgt) gene, complete cds, and periplasmic protease (htrA) gene, 5'end. ACCESSION M31772 KEYWORDS dGTP triphosphohydrolase; dGTPase; deoxyguanosine triphosphate triphosphohydrolase; dgt gene; htrA gene; periplasmic protease. SOURCE E.coli (strain K12, isolate HR42) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2760) AUTHORS Wurgler,S.M. and Richardson,C.C. TITLE Structure and regulation of the gene for dGTP triphosphohydrolase from Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2740-2744 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.M. Wurgler, 01-FEB-1990, for release after publication. FEATURES from to/span description pept 1066 2583 dGTP triphosphohydrolase (dgt) (EC 3.1.5.1) pept 2713 > 2760 periplasmic protease (hrtA) pept 958 323 (c) ORF signal 991 996 -35 region signal 1014 1019 -10 region site 1025 1046 regulatory signal binding 1054 1065 ribosome binding site signal 2590 2630 pot. transcription termination signal BASE COUNT 711 a 665 c 706 g 678 t ORIGIN 3.6 minutes on K12 map. 1 cagctggtca acctgccgct cggcattacc tccacgccag gcaatcacca gatcgggttt 61 cagcgcgaca atgcgttcca gattcatccc ctgccaggcg gaaacctgct caatcttttg 121 cgcttgtgga ggatagtcgg aatagctgct gaccccaacc ggcgtgatcc cggcggcaaa 181 ggcaagttca gtgttggcgg gagaaagcgt gatgacgcgc ggcgcggcgt tgagccacag 241 tggcgcaaga aaagacatgg ggcgaccagc gccctgaaca gtgacttagc catgtgcaag 301 tttctgcacc agtgactaac catcaggctg gactgtttag cggcaacagc aggaactcat 361 cgaagctaag atgagactgt tgatcggcca cgtcggagat ggcgcgtacg acaacaaacg 421 ggacgttgaa attgtggcag acatgggcga ttgccgtcgc ttccatctct acagcaatgg 481 cctgtgggaa gttgtggcgg attttcgcca gaccaacaga accgttgatg aaagcgtcgc 541 cgctaacaat caggccacgt acagcgttaa gattcagttc ggcaatgcag gcctcagcgg 601 cagcgatcag tttatcgtca gctttaaagc ctgccggaca gcctggtaac tgaccgtatt 661 cataaccaaa tgccgtgaca tccgcgtcgt gataacgtgc ttcgtccgag acaacgatat 721 cgcccacttt caacgttggt gccaggccac cggcagaacc ggtgttaata atcacatctg 781 gcttgcagtg ttccaacagc aaagtggcac ccagcgccgc agcgacttta ccgatgcccg 841 atttcagaag cgcaacctcg gttccattca gttggccggt atagatttcg caaccgccga 901 gactgatagt ttgacggttt tcgattttgt cacgcagcag cgtaacttct tcttccattg 961 caccaatgat gccgattttc atagatttac tcgcgataag cccgatttga aggcatagtt 1021 taccatgcgc ttacggggaa gcgtatttct cacgcgggag aggacatggc acagattgat 1081 ttccgaaaaa aaataaactg gcatcgtcgt taccgttcac cgcagggcgt taaaaccgaa 1141 catgagatcc tgcggatctt cgagagcgat cgcgggcgta tcatcaactc tccggcaatt 1201 cgtcgtctgc aacaaaagac ccaggttttt ccactggagc gcaatgccgc cgtgcgcacg 1261 cgtcttaccc actcgatgga agtccagcag gtggggcgct acatcgccaa agaaatttta 1321 agccgtctga aagagcttaa attactggaa gcatacggcc tggatgaact gaccggtccc 1381 tttgaaagca ttgttgagat gtcatgcctg atgcacgata tcggcaatcc gccgtttggt 1441 cattttggcg aagcggcgat aaatgactgg tttcgccaac gtttgcaccc ggaagatgcc 1501 gaaagccagc ctctgactga cgatcgctgc agcgtggcgg cactacgttt acgggacggg 1561 gaagaaccgc ttaacgagct gcggcgcaag attcgtcagg acttatgtca ttttgagggg 1621 aatgcacaag gcattcgcct ggtgcataca ttgatgcgga tgaatctcac ctgggcacag 1681 gttggcggta ttttaaaata tacccgtccg gcgtggtggc gtggcgaaac gcctgagaca 1741 catcactatt taatgaaaaa gccgggttat tatctttctg aagaagccta tattgcccgg 1801 ttgcgtaaag aacttaattt ggcgctttac agtcgttttc cattaacgtg gattatggaa 1861 gctgccgacg acatctccta ttgtgtggca gaccttgaag atgcggtaga gaaaagaata 1921 tttaccgttg agcagcttta tcatcatttg cacgaagcgt ggggccagca tgagaaaggt 1981 tcgctctttt cgctggtggt tgaaaatgcc tgggaaaaat cacgctcaaa tagtttaagc 2041 cgcagtacgg aagatcagtt ttttatgtat ttacgggtaa acaccctaaa taaactggta 2101 ccctacgcgg cacaacgatt tattgataat ctgcctgcga ttttcgccgg aacgtttaat 2161 catgcattat tggaagatgc cagcgaatgc agcgatcttc ttaagctata taaaaatgtc 2221 gctgtaaaac atgtgtttag ccatccagat gtcgagcggc ttgaattgca gggctatcgg 2281 gtcattagcg gattattaga gatttatcgt cctttattaa gcctgtcgtt atcagacttt 2341 actgaactgg tagaaaaaga acgggtgaaa cgtttcccta ttgaatcgcg cttattccac 2401 aaactctcga cgcgccatcg gctggcctat gtcgaggctg tcagtaaatt accgtcagat 2461 tctcctgagt ttccgctatg ggaatattat taccgttgcc gcctgctgca ggattatatc 2521 agcggtatga ccgacctcta tgcgtgggat gaataccgac gtctgatggc cgtagaacaa 2581 taaccaggct tttgtaaaga cgaacaataa atttttacct tttgcagaaa ctttagttcg 2641 gaacttcagg ctataaaacg aatctgaaga acacagcaat tttgcgttat ctgttaatcg 2701 agactgaaat acatgaaaaa aaccacatta gcactgagtc gactggctct gagtttaggt // LOCUS RATHMGCOA 1994 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat mitochondrial 3-hydroxy-3-methylglutaryl-CoA synthase mRNA, complete cds. ACCESSION M33648 KEYWORDS 3-hydroxy-3-methylglutaryl-CoA synthase. SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone lambda-cMS1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1994) AUTHORS Ayte,J., Gil-Gomez,G., Haro,D., Marrero,P.F. and Hegardt,F.G. TITLE Rat mitochondrial and cytosolic 3-hydroxy-3-methylglutaryl-CoA synthases are encoded by two different genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3874-3878 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.G.Hegardt, 06-APR-1990, for release after publication. FEATURES from to/span description pept 50 1576 3-hydroxy-3-methylglutaryl-CoA synthase precursor (EC 4.1.3.5) sigp 50 160 3-hydroxy-3-methylglutaryl-CoA synthase signal peptide matp 161 1573 3-hydroxy-3-methylglutaryl-CoA synthase mRNA 1 1994 3-hydroxy-3-methylglutaryl-CoA synthase mRNA BASE COUNT 493 a 563 c 505 g 433 t ORIGIN 1 atctctccca ggggctgtgg actgctggct ttctgttgat accttagaga tgcagcggct 61 tttggctcca gcaaggcggg tcctgcaagt gaagagagtc atgcaggaat cttcgctctc 121 acccgctcac ctgctccccg cagcccagca gaggttttct acaatccctc ctgctcccct 181 ggccaaaact gatacatggc caaaagatgt gggcatcctt gccctggagg tctactttcc 241 agcccaatat gtggaccaaa ctgacctgga gaagttcaac aatgtggaag cagggaagta 301 cacagtgggc ttgggccaga cccgtatggg cttctgttcg gtccaggagg acatcaactc 361 cttgtgcctc acagtggtgc agaggctgat ggaacgcaca aagctgccat gggatgccgt 421 aggccgcctg gaagtgggca cggaaaccat cattgacaag tccaaggctg tcaagacagt 481 gctcatggag ctcttccagg attcaggcaa cactgacatc gagggcatag ataccaccaa 541 cgcctgctat ggtggcactg cctccctctt caacgctgcc aactggatgg agtccagcta 601 ctgggatggt cgctatgccc tggtggtctg tggtgatatc gcagtctacc caagtggtaa 661 cccccgcccc acaggtggtg ccggggctgt ggcaatgctg attgggccca aggccccgct 721 agtcctggaa caagggctga ggggaaccca catggagaac gcctatgact tctacaaacc 781 aaacttggcc tcagagtatc cactggtgga tgggaagctg tctatccagt gctacctgcg 841 ggccttggac cgatgctatg cagcttaccg caggaaaatc cagaatcagt ggaagcaagc 901 tggaaacaac cagcctttca ccctcgatga cgtgcaatat atgatcttcc acacaccctt 961 ttgcaagatg gtccagaaat ccctagctcg gctgatgttc aatgacttcc tgtcatctag 1021 cagtgacaag cagaacaact tatacaaggg tctagaggcc ttcaagggtc taaagctgga 1081 agaaacctac accaacaagg atgttgacaa ggctctgctg aaggcctccc tggacatgtt 1141 caacaagaaa accaaggcct ccctttacct ctccacaaac aatgggaaca tgtacacctc 1201 gtccctctac gggtgcctgg cctcacttct ctcccaccac tctgcccaag aattggccgg 1261 ctccaggatt ggagccttct cctacggctc aggcttagca gcaagtttct tctcatttcg 1321 agtgtccaag gacgcttccc caggttcccc tctggagaag ctggtgtcta gtgtgtcaga 1381 tctgcccaaa cgtctagact cccggagacg catgtcccct gaggaattca cagaaataat 1441 gaatcagaga gagcaatttt accacaaggt gaacttctct ccccctggtg acacaagcaa 1501 cctcttccca ggcacttggt accttgaacg agtggatgag atgcaccgca gaaaatatgc 1561 ccggcgtccc gtctaaggag accaatccat acaaccattc cccggggaaa gaatgtgagc 1621 agagccgtta cccaaacggc ttccacttaa aattccaccc acagcagtga acggtgaata 1681 gacacagcga ccccatagga tctgctccgc ggtgaagggc ctccctctgt ggatcctggg 1741 tgaccctccc tgaagcagtg agcaccacag gttctgctgt ggaccagagc ccccctgtgg 1801 agagggagaa agaaagggga gccgctgacc tgcagggata cagaccttcc ccacagcctg 1861 gcagccgccc gtttgttgca gcttattatc agactgtggg ctatcatagt tcatgctcgt 1921 ttcttaaagt ttcccgagaa tttctaaaat tttgtatcta aacttttaat atggcgatta 1981 aaaggagaga agga // LOCUS ECOSLTOXI 1761 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli Shiga-like toxin II (SLTIIva) gene, complete cds. ACCESSION M29153 KEYWORDS Shiga-like toxin II. SOURCE E.coli (serotype O128:B12, isolate H.I.8) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1761) AUTHORS Gannon,V.P.J., Teerling,C., Masri,S.A. and Gyles,C.L. TITLE Molecular cloning and nucleotide sequence of another variant of the Escherichia coli Shiga-like toxin II family JOURNAL J. Gen. Microbiol. 136, 1125-1135 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.P.J.Gannon, 20-SEP-1989, for release after publication. FEATURES from to/span description pept 254 1213 Shiga-like toxin II (SLTIIva)-A subunit precursor sigp 254 319 Shiga-like toxin II-A subunit signal peptide matp 320 1210 Shiga-like toxin II-A subunit pept 1226 1489 Shiga-like toxin II-B subunit precursor sigp 1226 1282 Shiga-like toxin II-B subunit signal peptide matp 1283 1486 Shiga-like toxin II-B subunit signal 61 66 -35 region signal 82 87 -10 region binding 241 247 A subunit binding site binding 1215 1221 B subunit binding site signal 1659 1265 rho-independent transcription termination signal BASE COUNT 487 a 316 c 396 g 562 t ORIGIN 1 atatgtgaga gattctctat tctataaaaa tataaaccat atacccgtaa tcttaaataa 61 ttgatactat tttccggaag ctatagtgca tcattatctg ctttctgttc ctgataaaca 121 gggaagtaat acattgtgca gtcatttcct gtgaggggat ttgtttgtaa taaatttctg 181 ctaaaaacat tccggacttc ttgcgaggta ttattcctgg cgctgccggt tcagactgat 241 ggagtgttaa tatatgcgac atatattatt aaagctggtg ttgttttttt gtgtttgctt 301 gtcttcagca tcttatgcag atgagtttac tgtggatttc tcttcgcaaa agagctatgt 361 tgattcattg aatagtataa ggtcggcaat atccactcca cttggaaata tatctcaggg 421 tggtgtttct gtttcagtaa ttaatcatgt tctaggcgga aactatatat cattgaatgt 481 tagaggcctt gatccatata gcgagagatt taaccacctc cgtttaataa tggaacggaa 541 taacttatat gttgcaggct ttattaatac tgaaacgaat atcttttaca gattctccga 601 tttctcacat atttcagtgc ctgatgtgat aactgtttcc atgacgacgg acagcagtta 661 ttcatcatta cagcgaatcg cagatctgga acgtacaggg atgcagattg ggcgtcattc 721 actggttggt tcatatctgg atttaatgga gttcagagga cgttccatga cccgcgcatc 781 atccagagct atgctgcgtt ttgtcacagt gatagcagaa gctctgcgat tcagacaaat 841 acagcgggga ttccgaccgg cgctgtctga ggcatctccg ctttatacaa tgacggctca 901 ggatgttgac cttaccctga actggggaag aataagtaat gttcttccag agtacagagg 961 agaggaaggg gtaagaatcg gtaggatatc ttttaatagt ctttctgcga ttctcggaag 1021 tgttgcggtc atccttaatt gccactcaac cggaagttat tcagttcgtt ccgtgagcca 1081 aaaacagaaa acagaatgcc agattgttgg agacagggcg gccattaaag taaataatgt 1141 tttgtgggaa gcgaatacaa tcgctgcttt attaaatcgc aagcctcagg atcttactga 1201 accaaaccaa taacaggggg tgaatatgaa gaagatgatt attgcagttt tattcggtct 1261 cttttctgct aattccatgg cggcggattg tgctgtagga aaaattgagt tttccaagta 1321 taatgaggat gataccttta ctgtgaaggt gtcaggaaga gaatactgga cgaacagatg 1381 gaatttgcag ccattgttac aaagtgctca gctgacaggg atgactgtaa caatcatatc 1441 taatacctgc agttcaggct caggctttgc ccaggtgaag tttaactgag aatctacggt 1501 ttatttatgc gcgtcttttg tttctggacg cagatattat tagtgttgtg gatgctgatt 1561 aaatttggtc agtgttttcg ttaaagtcat ataaatacag gggcgtcacg cccctttttg 1621 gtctgtagtt gggtgaagat catcacttta ctttgctcaa ataactcagc acttgccggt 1681 ttacccttaa ggcaatacct acgttccata aaagaagcat caattaggat gctcgtatac 1741 ttgttatgtg ctttaaatgt t // LOCUS RATCHOL7H 3545 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat cholesterol 7-alpha-hydroxylase mRNA, complete cds. ACCESSION J05460 KEYWORDS cholesterol 7-alpha-hydroxylase; cholesterol 7-alpha-monooxygenase. SOURCE Rat (strain Wistar King) male 8 week old liver, cDNA to mRNA, clone p7-alpha-11. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3545) AUTHORS Noshiro,N., Nishimoto,M. and Okuda,K. TITLE Rat liver cholesterol 7-alpha-hydroxylase: Pretranslational regulation for circadian rhythm JOURNAL J. Biol. Chem. 265, 10036-10041 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.Noshiro, 30-MAR-1990, for release after publication. FEATURES from to/span description pept 36 1547 cholesterol 7-alpha-hydroxylase (EC 1.14.13.17) BASE COUNT 1035 a 710 c 697 g 1103 t ORIGIN 1 ggtctcccct ttggaaattt tcctgctttt gcaaaatgat gactatttct ttgatttggg 61 gaattgccgt gttggtgagc tgttgcatat ggtttattgt tggaataagg agaaggaaag 121 ctggtgaacc tcctttggag aacgggttga ttccgtacct gggctgtgct ctgaaatttg 181 gatctaatcc tcttgagttc ctaagagcta atcaaaggaa gcatggtcac gtttttacct 241 gcaaactgat ggggaaatat gtccatttca tcacaaactc cctgtcatac cacaaagtct 301 tatgtcatgg aaaatatttt gactggaaaa aatttcatta cactacttct gcgaaggcat 361 ttggacacag aagcattgac ccaaatgatg gaaataccac ggaaaatata aacaacactt 421 ttaccaaaac cctccaggga gatgctctgt gttcactttc tgaagccatg atgcaaaacc 481 tccaatctgt catgagacct cctggccttc ctaaatcaaa gagcaatgcc tgggtcacgg 541 aagggatgta tgccttctgt taccgagtga tgtttgaagc tggctatcta acactgtttg 601 gcagagatat ttcaaagaca gacacacaaa aagcacttat tctaaacaac cttgacaact 661 tcaaacaatt tgaccaagtc tttccggcac tggtggcagg ccttcctatt cacttgttca 721 agaccgcaca taaagctcgg gaaaagctgg ctgagggatt gaagcacaag aacctgtgtg 781 tgagggacca ggtctctgaa ctgatccgtc tacgtatgtt tctcaatgac acgctctcca 841 cctttgacga catggagaag gccaagacgc acctcgctat tctctgggca tctcaagcaa 901 acaccattcc tgcaaccttt tggagcttat ttcaaatgat caggagtcct gaagcaatga 961 aagcagcctc tgaagaagtg agtggagctt tacagagtgc tggccaagag ctcagctctg 1021 gagggagtgc catttacttg gatcaagtgc aactgaatga cctgccggta ctagacagca 1081 tcatcaagga ggctctgagg ctttccagtg catccttgaa tatccgcaca gctaaggagg 1141 acttcactct ccatcttgag gacggttcct ataacatccg aaaagatgac atgatagctc 1201 tttatccaca gttaatgcac ttggatcctg aaatctaccc agaccctttg actttcaaat 1261 atgaccggta ccttgatgaa agcgggaaag caaagaccac cttctacagt aatggaaaca 1321 agctgaagtg tttctacatg cccttcggat caggcgcgac aatatgtcct ggaagactct 1381 ttgccgtcca agaaatcaag cagtttttga tcctgatgct ctcctgcttt gaactggagt 1441 ttgtggagag ccaagtcaag tgtccccctc tagaccagtc ccgggcaggc ttgggaattt 1501 tgccaccact acatgatatt gagtttaaat ataaactgaa acactgatac gtggttggaa 1561 gaagcgaaca ctggatgatg tcacttggcg gctgagagtc atcactaaac aggccttcgg 1621 gaccaatgct cactgatgcg ccctagcgac tggattagtg ggaagaactt tgttctcgct 1681 gcccacattc ctgggtgttc acatagctgg ggccagagct tcatcacttt cagaaagcaa 1741 tgtcttttgt atttattttc aaaatgaaga tattccaatt ggcaggatat ttttcctaag 1801 gaaattgctt tatattttta tgaaaactac caattaatta tgaaagggct tgaaattcac 1861 gttttagtga aattactgat ttttcactag taaggttctt caggtgtgaa actgtattat 1921 aaaaatgttg taatgggtca cactgtgctt tgcataaagg taaaggaaac tatgtttcag 1981 ccttttctgt gtctatgagc ttcgaaaata atcttactgt tctagaaaca ctggggaggt 2041 ttcgacatgc tctcgctata ttttatttta ctgttgctag aaattttcat tccagttttc 2101 aactacctta tctttccccc attttgacat gcatgccaat gagaagagta ttttttagga 2161 attaacaagg cacctcccag aaccctaccc tgagactttt aagcctttaa tcccagcact 2221 cgagaagtag agccaggcag atctctgagt ctgaggttat tctggtctac atcagctcca 2281 gacaagccag gactacagaa tgggatcttg tctaaaaaat acagctaatc tttatgtcat 2341 aactgattat gaatcaacct aaaagataaa ttttcaatca ggactcagag aaaatgagca 2401 attaaaaaac ttagctctga ggtatgtgga attcattaag tacaagttga cattacatgt 2461 tctttaaaaa tagtttatgt tttatctcta aatgccctgc agatgaagaa taataatgaa 2521 aagttgaata atactgttta aacactaagt gcaataatgc tttggtaatg tactttaaga 2581 gaatcattag ccgtgccagt tttactaaaa tatatttata tgtaaattat atttatcttt 2641 ttcttatacc ataaatataa aaatattgca acatttagta attttaaaat tatatacctt 2701 tcagaaaatg atgtatgaat gtttgtatgt tttttaactt tgaacagaac atttaaatta 2761 ttcatctacg gtgattttta tcttatttat ttctttttgt ctcattcata tcttgaagaa 2821 atccaaaaat atctgaagga atcgctcact caaatgtctc cctatggtta cagaaaaatt 2881 caataccatg tttttgtcct cggggactga agcagggtgt cgtgggtgcc gagcagaggc 2941 tcctgctgca gcgagcttta tccacgggac tccttaaact tttaaaatct tatcactatt 3001 atcatgcatt tattacctaa gtaggatatt tccctttcct ttttcatttc agccgagtcc 3061 cttagcaacc caggctgact gggaccctcc atgtagctta agctgtgaac tcactgtact 3121 tcctgttttc acttatttta ggaagtaatt ttccctatca gaaattttaa ttgtttagat 3181 gatgtataag agtaacacaa ttctgttata tactaatctg tagtaaacta aatttgttct 3241 tagaacaagt ttgatgactc tcaaattgaa tgtatccata catctttcca tggcttcttg 3301 aatgcccatt tctcatacac agaatgatgg gtttcacggt gatgtcttcc tttcatgtct 3361 ttattcttgt gcggtgatgg ttggcaaatg atacccatgg agcaaggtta ctcttcctat 3421 ttctgtgcag cctaagtgtt aagaataatt tttaaatact tggagggaag gcacattttg 3481 tgtcatatgt gaagtgacat gtgacacaca gactagcaaa tccttgagta aaattttatt 3541 gggat // LOCUS HUMCTLA1A 4751 bp ds-DNA PRI 14-JUL-1990 DEFINITION Human granzyme B (CTLA-1) gene, complete cds. ACCESSION M28879 KEYWORDS cytotoxic T-lymphocyte-associated serine esterase 1; granzyme B. SOURCE Human (patient B) blood DNA, clone lambda-B30. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4751) AUTHORS Haddad,P., Clement,M.-V., Bernard,O., Larsen,C.-J., Degos,L., Sasportes,M. and Mathieu-Mahul,D. TITLE Structural organization of the hCTLA-1 gene encoding human granzyme B JOURNAL Gene 87, 265-271 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Haddad, 11-MAY-1990, for release after publication. FEATURES from to/span description pept 1161 1215 granzyme B, exon 1 /hgml_locus_uid="LQ0170R" /nomgen="CTLA1" /map="14q11-q13" 2256 2403 granzyme B, exon 2 2857 2992 granzyme B, exon 3 3200 3460 granzyme B, exon 4 4105 4248 granzyme B, exon 5 IVS 1216 2255 CTLA-1 intron A IVS 2404 2856 CTLA-1 intron B IVS 2993 3199 CTLA-1 intron C IVS 3461 4104 CTLA-1 intron D signal 4338 4343 poly-A signal BASE COUNT 1211 a 1292 c 1119 g 1129 t ORIGIN 1 gaattctata ttttgagata taccattcct catagaaaaa tttcctcaca gaaaatataa 61 aggtggaaac aaatcacaag aatcgaacca tgtagagaga cttagttgtc ttttaacaga 121 attgggcacg ggctgttcag aaacaacaat ctttcacatc cattataatg atagcattag 181 tgtagtttgt ttagcaaatg tttactgcga gcctgttatg tgctgagcct gctatgtaag 241 aagtgtggct ctctggacag gagacagaat actaaacaac acaactactg atctttggct 301 gcctggcatg cttcctcact tcatatggta tcagcaattt agcaccacaa acgtccttta 361 gagaccagcc ctttctcatt cttggttcta gtggcttgag tagactgacc ccactaccca 421 agtggatttg actcctagca attcattaat ctagcccata aatgtcaagt acaggacttt 481 attgaagcat tcagaaagag gaatagggga tgttagaatc tctagaaagg aagctatgat 541 aataaatggg ttgctagatg ggtctagtag atggtggcca tgctttgtta ctgccttgtg 601 tattgtgcta ccatagccct ccccaaactg tactctggct cctggcattt ccgtctcttc 661 aaccagatgg tcagctctct aagtgaagga gacacatctc caacatgctt ggttctagca 721 caacagaagg gctcaaacac atacctgcta aagaaactat cctgatggat ttagcagcat 781 ggccatgagg cattggcggt tctatcactg ggaactcagg tttctggtgc tccagtacct 841 ctactggctg ataccacatc ctacagttca cttcataggc ttgggttcct gctctgggct 901 gaataggtgg tccactctga gtcatcagct gtggtgatga tgtggtcact gcatgattct 961 cacacaagca cccagaggac gtcatcaggc agaggcagtg ggggtgggca gcatttacag 1021 aaaatctgtg atgagacacc acaaaaccag aggggaacat gaagtcactg agcctgctcc 1081 acctctttcc tctcccaaga gctaaaagag agcaaggagg aaacaacagc agctccaacc 1141 agggcagcct tcctgagaag atgcaaccaa tcctgcttct gctggccttc ctcctgctgc 1201 ccagggcaga tgcaggtgag tgaccgtctt ccaacctcgg ggcccaaccc atcccacagg 1261 tctcctgccc tttctccaca ttcctgatcc atctatctac caggaatgtt ctgaactcca 1321 gctcccattc taccaagacc ccccaagtgt gatgctggat aagctatcag caggaatggc 1381 agagcagcag gccattctca agaagagcca gtgggtacta tcccttcccc agagcccacc 1441 tttgtcacct ggagagtagg actttcctag aagtaaatgg cagaggatgg gaaactagaa 1501 aagagaaata ttaaattatt ctagagtagg cctggcttct gtttctggga taagacaggt 1561 gcttctctca ctgtacttag gagagaaacc cagagctcag ctgacagcag aattggtaca 1621 atcactgtcc tcagaacact gttaatgtgt ttgctcagtc ccattctcca actctgcttt 1681 tcttccctgg cctttggtgg ctcccctctt tccaaggatg aggcactacg gcaggcccca 1741 gcttccctgc tttctagaat tccaccagca ctgctctacc agccctcatc cagaggctaa 1801 ctggagccag tccatcatgc agccatgaac atttactggg cacccactac atgtcaggct 1861 ctaggaaaca ggatatgaca gtatctagat ccctccactt acaccctggc cattagaaag 1921 cagcactatc ctagacacca caggactcat aagggtcttg gaaactcacc tgaaacaaag 1981 caaagtcagg agaggaatga tcaggagcct ctgggatttc actgtcccta agacaggtat 2041 gctcgccttc aactacatat ggaagaaaga tttacagacc aaagtctgct gttcttccct 2101 ttttcagagc aggaaattga agccccttcc tccaggccac tcccaactcc aggctatccc 2161 aggctcccaa atgcccagga gttctggagc cactaagcag gtgcccaccc agcagattcc 2221 atgggtgccc acaagcagac agacttttcc ttcaggggag atcatcgggg gacatgaggc 2281 caagccccac tcccgcccct acatggctta tcttatgatc tgggatcaga agtctctgaa 2341 gaggtgcggt ggcttcctga tacaagacga cttcgtgctg acagctgctc actgttgggg 2401 aaggtgagga gcagaaaaca gcccacaccc tcctggaaac actccacaga gacccctgcc 2461 ttcttcccaa ggagctccct gggctcctgt gaacacacat gccaggaggt ctccttagag 2521 ggtgagaaaa gggcagttaa gtttgtggag agaggggaag gttggttcca gaggtgctgc 2581 tgaagtaaga aacagcagag tgaccaagcc tgccatattt agaactgggg gcatactttg 2641 gcatagaata caaactgaag caattccacc tgtgtttcta gggggaaccg aaccctgaga 2701 aacctggtgc aattaccaga attccaattc ctggggaccg actgtcctta atttcccctc 2761 agctgcagcc ctgccccagc tgtcacctgc tcttcactgt ctctgggctg tatacactgt 2821 gactccaccc ccatcctcac tctgctctct gtgcagctcc ataaatgtca ccttgggggc 2881 ccacaatatc aaggaacagg agccgaccca gcagtttatc cctgtgaaaa gagccatccc 2941 ccatccagcc tataatccta agaacttctc caatgacatc atgctactgc aggtgaggca 3001 cactcctgcc actcttgctc ttcttggtcc agttggttcc actccccctg gaatgccggc 3061 ccttccctcc tttccatcct ggcctcttgg ttagttccta tgcctcagag gagagaggga 3121 agattgtgca gccccatcac tgtgtcgggg cccagaagtt cgttggctga cctggacttt 3181 cttgcctctt ccccaccagc tggagagaaa ggccaagcgg accagagctg tgcagcccct 3241 caggctacct agcaacaagg cccaggtgaa gccagggcag acatgcagtg tggccggctg 3301 ggggcagacg gcccccctgg gaaaacactc acacacacta caagaggtga agatgacagt 3361 gcaggaagat cgaaagtgcg aatctgactt acgccattat tacgacagta ccattgagtt 3421 gtgcgtgggg gacccagaga ttaaaaagac ttcctttaag gtaagactat gcacctgcct 3481 ggattggctc ttgggagaaa gatgtttggg gaatatctga gacctggaga ctcaagtagt 3541 gggggactcc ttcacccact agactgtgat atttctctct ggaaagagaa aaggggacta 3601 gactgagctg gggagaaatt agggcctctg caaacttacc aagaggcctt atggtggatg 3661 gtgccttctt tggaaggatg aatttgcaac actccaccca ctccaggtca cagatattag 3721 gaaactgtgc ccatgggggt gcagctaatt ataaccaggt gtgtcttcag aggctggtac 3781 ccaacgtggt taatgggctg gtcctccatg gtggacatca gccctccttg cccacttctg 3841 ggtccttaaa cagccaacgg tcccacatac ctccgatctc aggatctggg ggacatgacg 3901 gaggctggcc cctgggatga ggtgaagcag taacaatgtc cagggccaga gcttggcagc 3961 tggggccacc agcggcctgc cctgccctct ggtctcccac atgtaggctg tgcaagttgg 4021 ccttttctaa aagggggctt gagatggaag agagggcagg acccggagga gcatcagctc 4081 agtccttcca ctctctattc acagggggac tctggaggcc ctcttgtgtg taacaaggtg 4141 gcccagggca ttgtctccta tggacgaaac aatggcatgc ctccacgagc ctgcaccaaa 4201 gtctcaagct ttgtacactg gataaagaaa accatgaaac gctactaact acaggaagca 4261 aactaagccc ccgctgtaat gaaacacctt ctctggagcc aagtccagat ttacactggg 4321 agaggtgcca gcaactgaat aaatacctct tagctgagtg gaaagctggt ttcttgttta 4381 ttcattgacc ctcattctca ggcaccacat ctgcgctatg caggccaatg acacaatttt 4441 gctgttttct gctttctcct ctcccctcac cccttgccac ctccccaaac ccccacatga 4501 agctgatact cagctccttc ctatccacac cagtttctcc agggcctgcc ttctgccaag 4561 gctgaagctg agcaccatca ggagacaaca tggaccactt tggtcctggg gctttgggta 4621 aacttcttac ctccttctcc agtgttacat tgacagagaa aaaagggata ataccatggg 4681 acctaactcc tcatcccact ggggctcctc attctcccct gggcttagtt tctctaccct 4741 cctctgagct c //