Path: utzoo!attcan!uunet!cs.utexas.edu!usc!apple!bionet!daemon From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: <9004131656.AA06728@life.lanl.gov.LANL.GOV> Date: 13 Apr 90 16:56:32 GMT Sender: daemon@genbank.BIO.NET Distribution: bionet Lines: 883 Approved: lear@genbank.bio.net Checksum: 33562 55 LOCUS DROEMCA1 934 bp ds-DNA INV 07-MAR-1990 DEFINITION D.melanogaster extramacrochaetae protein gene, exon 1. ACCESSION M32636 KEYWORDS extramacrochaetae protein; helix-loop-helix protein; regulatory protein. SEGMENT 1 of 2 SOURCE D.melanogaster 8-12 hour embryo, cDNA to mRNA, clones pNB5B, and DNA, clones pP7, lambda-B[11,16]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 934) AUTHORS Ellis,H.M., Spann,D.R. and Posakony,J.W. TITLE Extramacrochaetae, a negative regulator of sensory organ development in Drosophila, defines a new class of helix-loop-helix proteins JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.W.Posakony, 06-MAR-1990. FEATURES from to/span description pept 347 + 910 extramacrochaetae protein, exon 1 pre-msg 85 > 934 emc mRNA and introns (minor alt.) IVS 911 > 934 emc intron BASE COUNT 270 a 273 c 244 g 147 t ORIGIN Chromosome 3, left arm 61D1,2. 1 cgaacccgag agcgaaagac tccgctatcg gattcggaaa tgccgttcgg ggggtataaa 61 agcgggcgct ctgagcgaga caccactcat tcgttgtctg accgctcgca gcaaaagata 121 tagaaatcga gaacgtaaac gcagaaaaca cgctggatta aagtgtttcg tttcgaacat 181 cgacagagtg agaaaacaaa gtgaagtgag cgcacaaagc cgaaatcaaa tcaaattgaa 241 aagccagtca agagcaatct ttaactgaac ctcgcaagta atccggtacg accgtgtaac 301 aaccgccaag attctactca aggagcgtaa agcaatcatc tccagaatga agtccctgac 361 ggccgtctgc cagacaggtg cctccggaat gccggccttg aatgccagcg ggcgcatcca 421 gcgccatccc acgcaccgcg gcgacgggga gaacgccgag atgaagatgt atctgtccaa 481 actgaaggac ctcgttccgt tcatgcccaa gaacaggaag ctcaccaagc tggagatcat 541 ccagcacgtc atcgactaca tctgcgacct gcagaccgag ctggagacgc accccgagat 601 gggcaacttc gatgcggcag ccgctctgac ggcggtgaac ggactccacg aggacgagga 661 cagcgacatg gaggatgcgg atgccgaggc agaagcggaa gtcgatccag atatcctcgc 721 ccagcgcctg aatgccgagc agccggcgaa agtctctagt cccgccgccc gtctcccgct 781 taccgatcgc caaacgccca acactcttgt ggcgcccgcc catccgcagc agcatcagca 841 gcagcagcaa ctgcaactgc agcagcaaca actgcaatca cagcagcaac tgtccaacag 901 tttagcaacg gtgagttaac aaacaaattg ctag // LOCUS DROEMCA2 1307 bp ds-DNA INV 07-MAR-1990 DEFINITION D.melanogaster extramacrochaetae protein gene, exon 2. ACCESSION M32637 KEYWORDS extramacrochaetae protein; helix-loop-helix protein; regulatory protein. SEGMENT 2 of 2 SOURCE D.melanogaster 8-12 hour embryo, cDNA to mRNA, clones pNB5B, and DNA, clones pP7, lambda-B[11,16]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1307) AUTHORS Ellis,H.M., Spann,D.R. and Posakony,J.W. TITLE Extramacrochaetae, a negative regulator of sensory organ development in Drosophila, defines a new class of helix-loop-helix proteins JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.W.Posakony, 06-MAR-1990. FEATURES from to/span description pept + 31 66 extramacrochaetae protein (emc), exon 1 pre-msg < 1 1140 emc mRNA and introns IVS < 1 30 emc intron BASE COUNT 499 a 244 c 193 g 371 t ORIGIN Chromosome 3, left arm 61D1,2. 1 taatttgaat cttccgcttc tctcttttag ccacagaatg cggagaaaga cagcagacag 61 tcgtaagcgt ggaaacaccc agccgggaaa acccacggaa cctataagcc tattaactaa 121 atgcataatg aaagtacagt caaaactcca ctcgcaacct tcgtgtgaag atctccgcat 181 aagataacgt gtgaaactct caagaaaacc acagcccagg cgcatctctt caacgctcct 241 tttcccggga aagggaaaac ccgggaaaac gagacctcgc ttttggctcc aacagagttt 301 agtttaaaca tttaataaca agaatacaac aacacaaaac ctagcattag agcagcgtaa 361 gcaagtacac attatgatta ttaattaaac attacaacta ggatgatggc gcttaagaat 421 gaacggaaga aagaaaactt ggcaaagctt taaacaattt aattttcctt taacgcttga 481 aattacactg aagcatatat ttttgttttc ctaattttag ttttgttttg gttaaaacaa 541 cgcaagaaaa aatgttacta atgcaaagca aacagagatt agcaataaaa aaaaattata 601 aaaatgcaca ggcaccaaac tggaattacg ttgaatcacc gtttcctgca ttacgattat 661 atgatttcaa tccaaatgat tttcctatta attttaatca atgaagagtt ttcctattat 721 tgattagcca cagccaataa acaacaagaa caaccaactc agcagcacac acacattcaa 781 attcattaat ttatatcgag agagtacgtt tagagtttaa gttcgagaaa gttcatttcc 841 aggcactaat tttcactcac accactcaca tagaaaaaca caaacacctc attttgtagt 901 tttccgagtc ccttggatcg ttttcctctt ttttaatcga aataacgcat aatgcataat 961 aattattacc agatatacac taaaacctat aaatatatat taatatatgt tctgtgatct 1021 gtagcatact ttgtacaaag tgtttttttt agcataatta tgcgatttta attatgatcc 1081 gcaatcgaaa ttagagattt ttgttctcat ttcgatggta aaaacactct atcatgatgt 1141 aaaaaaaaaa aacaaaagaa aacatttgca tgatgttgcg aaaaaatata ttatctaaaa 1201 cataaagaaa aacaaacact agcctcttaa taaattgcaa aaatttaaaa aacattaaaa 1261 atcgaatatg tcaatggatt tccgttggga gcaggggatt aagaaac // LOCUS DROFMRFA1 1328 bp ds-DNA INV 08-MAR-1990 DEFINITION D.melanogaster FMRFamide neuropeptide gene, 5' flank, exon 1. ACCESSION M32640 J05415 KEYWORDS FMRFamide neuropeptide. SEGMENT 1 of 2 SOURCE D.melanogaster (strain Canton S) DNA, (library of T.Maniatis). ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1328) AUTHORS Schneider,L.E. and Taghert,P.H. TITLE Organization and expression of the Drosophila FMRFamide neuropeptide gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.E.Schneider, 07-MAR-1990. FEATURES from to/span description pre-msg 1139 > 1328 FMRFamide neuropeptide mRNA and intron IVS 1248 > 1328 FMRFamide neuropeptide intron signal 1066 1071 CAAT box signal 1109 1114 TATA box site 40 51 conserved sequence 1 site 130 140 conserved sequence 2 site 309 326 conserved sequence 3 site 374 388 conserved sequence 4 site 394 414 conserved sequence 5 site 507 545 conserved sequence 6 site 574 602 conserved sequence 7 site 637 649 conserved sequence 8 site 671 721 conserved sequence 9 site 746 766 conserved sequence 10 site 860 872 conserved sequence 11 site 894 909 conserved sequence 12 site 1015 1027 conserved sequence 13 site 1028 1041 conserved sequence 14 site 1053 1061 conserved sequence 15 site 1063 1076 conserved sequence 16 site 1086 1095 conserved sequence 17 site 1108 1175 conserved sequence 18 BASE COUNT 340 a 331 c 319 g 338 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttctgg ggtgagtcca tagattacag agcagcgcag ttttcttgac cagatgatgg 61 tctagctgaa ctatttttaa ctgggcgatt ggcacacgac ttggcctggt catcaaactc 121 aagccaagat ttgaacttga actagtctcg ccttatcaca ttgccttcta attcgttttc 181 attttcgatt ttcgcatcta cttatgctaa tgaagccatc tgcagacgtg gttttcgaac 241 gtatttatat tgattatggg tgatcgtcaa caagagcagt ggacacccaa taaacctgtc 301 caaaacccga cacatttctg cccagtcatg cgtggtggac aatagccaaa tgccattgat 361 gagactcgtc tccaaaactt tggccttttg ccgggccgta attacagact tccgtctttt 421 gaacagtttt ttcagcccca cccaagagtc gagtcttgaa aagctggctg ggatggggtg 481 gtttcgggtg ctggacgaga tgccagaggc gccacaatgt atcctgttac aggttacagg 541 gccataaagc gccataaacg ccgcgacggc aatggcaaat tataacgcat acggacacgt 601 agtcgatcca ctggctagaa ggctaattgg acgtgcccgg ccaggatgtc cctgctcatc 661 ccacagtcga cttaaaagtg ggcgtaacat aacaattgtg ggcgatttgc atacaatttg 721 gtgctcgagt ggccacctaa ctggcgtcga aatgttgcta cgtcattgtt gctgcccctc 781 gtgcttcttg gctaataagt cagcatgaat gcggtctgtg ttttggcctt ttgtcccgct 841 gacaaccgct gacaagcaaa acaaacatat cagcggctga ccagcgacca tttgccgaca 901 ttgtcaaggt ttttttttca gccccgaata cacctgctaa tctgtggtgg tgactagcgc 961 gctcggaagg cagaaaaaac tgcagcgaaa atcagttcca ttgaatgtcc tctggcaaca 1021 tgttgccgca gcacttgttg cgcgcgtcca ctcgcagagg cacctcaaca tcagcctcct 1081 tgggcgccag cacaaaggac gcaacgcata taaaacagga tgtttcgcca aacatcctca 1141 cagaactgtt ccaccttcga gcgggcaaca agtgtgtgtg cggcccaaaa ggatccccag 1201 accttcgaat tcactctagt ttcctagtaa ggggacaggt ttcagaggtg agtgccatgc 1261 gaaaagtgtg aaaacagtgc tgagccttaa gcagtcgagt tattctttgg aaaagccaat 1321 agccaatt // LOCUS DROFMRFA2 1603 bp ds-DNA INV 08-MAR-1990 DEFINITION D.melanogaster FMRFamide neuropeptide gene, exon 2. ACCESSION M32641 J05415 KEYWORDS FMRFamide neuropeptide. SEGMENT 2 of 2 SOURCE D.melanogaster (strain Canton S) DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1603) AUTHORS Schneider,L.E. and Taghert,P.H. TITLE Organization and expression of the Drosophila FMRFamide neuropeptide gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.E.Schneider, 07-MAR-1990. FEATURES from to/span description pept 220 1263 FMRFamide neuropeptide precursor sigp 220 291 FMRFamide neuropeptide signal peptide matp 527 570 FMRFamide neuropeptide A matp 571 663 FMRFamide neuropeptide B matp 664 696 FMRFamide neuropeptide C matp 697 729 FMRFamide neuropeptide D matp 730 762 FMRFamide neuropeptide E matp 763 795 FMRFamide neuropeptide F matp 796 828 FMRFamide neuropeptide G matp 829 861 FMRFamide neuropeptide H matp 862 894 FMRFamide neuropeptide I matp 895 921 FMRFamide neuropeptide J matp 922 954 FMRFamide neuropeptide K matp 976 1002 FMRFamide neuropeptide L matp 1003 1035 FMRFamide neuropeptide M matp 1036 1065 FMRFamide neuropeptide N pre-msg < 1 1538 FMRFamide neuropeptide mRNA and introns IVS < 1 219 FMRFamide neuropeptide intron signal 1404 1408 poly-A signal signal 1482 1486 poly-A signal signal 1507 1511 poly-A signal BASE COUNT 409 a 398 c 431 g 365 t ORIGIN Undetermined number of bp after segment 1. 1 gcggctggtt aacttttaac gcgccggccc aacagatctg gccataaata tcatgatggt 61 taattagcag gacactttgt ctgcgcgtcc tggctgccgg tggctactgt taccctaagt 121 aaaacgaacg gtcttaatgt gattaaaagt tgcaatagat aacgcgagat gccgccgcct 181 cataattatc ctgccgtcac tccattctta tccttgcaga tgggcattgc cttgatgttc 241 ctgctggccc tgtaccagat gcagtcggcc atccacagcg agatcatcga tacgcccaac 301 tatgcgggca actcgttgca ggacgctgac tccgaggtga gtccatcgca ggacaatgac 361 ctggtagatg cactgctcgg caacgatcag accgagaggg cggagctgga gttccggcac 421 cccatctctg tgattggcat cgactactcg aagaacgccg tggtgctgca cttccagaaa 481 cacggccgga aaccgcgcta caagtacgat cccgagctgg aggccaagcg aaggtccgtg 541 caggacaact tcatgcactt cggcaagagg caggcggagc agctgccacc ggagggcagc 601 tatgctgaat ccgatgaact ggagggcatg gccaagcgag cagctatgga tcggtatggc 661 agagatccca agcaggactt catgcggttt ggtcgggatc cgaaacagga cttcatgagg 721 tttggcaggg atccaaagca ggacttcatg agattcggtc gggatcccaa gcaggatttc 781 atgagattcg gtcgagatcc caagcaggat ttcatgaggt ttggacgcac tccggctgag 841 gatttcatga ggttcggacg cactccggcg gaggacttca tgaggttcgg acgctccgac 901 aatttcatgc gcttcggacg cagtccccac gaggagcttc gcagtcccaa acaggatttc 961 atgcgattcg gtcgcccgga caacttcatg cgcttcgggc gttccgctcc gcaggatttt 1021 gtgcgctccg ggaagatgga ctcaaacttc attcgattcg gtaagagctt gaagccggcg 1081 gctcccgagt ccaagccagt caagtccaat caaggcaacc caggcgaacg cagtccagtg 1141 gacaaggcca tgacggagct gttcaagaaa caggagctgc aggatcagca ggtgaagaac 1201 ggcgcacagg cgaccaccac gcaggatggg agtgtggaac aggaccagtt cttcggccag 1261 tgaggtagtc ctgcgggacg cctccttgta aatagatatg gacaaatgta cgcaaggatc 1321 taaattgata tacgtatata acccactcct cacacgaact cctgacttat gcctgaacta 1381 tgaattttta atgaatgggc tggattaaaa attcaccgtg ctttgaagtt cttatctata 1441 aatatatcta gtgtaatatt gaagaaattg aaattggcgt gaataaaatc ctgtggcaac 1501 attttaaata aagattgctt tactgtaaat tatggcgcac tcgaatagcc ttatttcatt 1561 cggtcgaggt acatattgta tagccattat tagtttctcc act // LOCUS DRVFMRFRN1 3189 bp ds-DNA INV 08-MAR-1990 DEFINITION D.virilis FMRFamide-related neuropeptide gene, 5' flank, exon 1. ACCESSION M32642 KEYWORDS FMRFamide-related neuropeptide; neuropeptide; secretory granule protein. SEGMENT 1 of 2 SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 3189) AUTHORS Taghert,P.H. and Schneider,L.E. TITLE Inter-specific comparison of a Drosophila gene encoding FMRF amide-related neuropeptides JOURNAL J. Neurosci. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.H.Taghert 07-MAR-1990. FEATURES from to/span description pre-msg 2890 > 3189 FMRFamide-related neuropeptide mRNA and introns IVS 3059 > 3189 FMRFamide-related neuropeptide intron A site 1211 1222 conserved sequence 1 site 1225 1236 conserved sequence 2 site 1300 1416 conserved sequence 3 site 1491 1504 conserved sequence 4 site 1513 1533 conserved sequence 5 site 1636 1668 conserved sequence 6 site 1695 1724 conserved sequence 7 site 1426 1438 conserved sequence 8 site 1814 1864 conserved sequence 9 site 1888 1908 conserved sequence 10 site 2116 2128 conserved sequence 11 site 2144 2159 conserved sequence 12 site 2635 2646 conserved sequence 13 site 2674 2688 conserved sequence 14 site 2754 2853 conserved sequence 15 site 2691 2704 conserved sequence 16 site 2041 2050 conserved sequence 17 site 2856 2933 conserved sequence 18 BASE COUNT 1034 a 630 c 656 g 869 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttctat cagtaaacaa caacaactac cgttattgct caatacctgt agttgttgtc 61 gctgctgctg atttcccatg cagcttagaa catgacgcaa ctgaacagca gcagcagcag 121 cagccgccat cgcacaaata tgttcgttta ccgttttgaa tttatgttta tagcagcagc 181 agcacatgtt gttacgtctt gcaattgttt atggcataaa tctttatagc atacacacac 241 acatacatgc atgtgtacat gcataaattc cgtgtgtgcc cattaattag agagaattgg 301 ccatggataa gctggtcggt tacttattag gttaccacct ggtggactcg aaaacaaata 361 aaactcggca cataaacaaa aaaactaaaa attaagacga aaattcaaca atggcattga 421 catcaaacac attgcctgcg ctttcaaacg gcccatgaca acaagagcta actaactaaa 481 tcgtaaaaaa agagctagta aaatccaatt tttttaacca tctaaaatcg taaagcctat 541 caaaacacgg cttttggcat ttggagcaaa cggttgtcaa gcggagcccc attgagctaa 601 tcaggttctg ctgttgctca ttgacggaca ccaattttat ttatctattt ttattgactc 661 ttctagatgt attccatatc cattgttgag ctgtgatatg cttaaaaaag taacgattaa 721 ctggaaaata ctctttagtg aaatcaacgt aataaaatac ttattggtat ttataataga 781 acaaatgcaa atacaattat atgatcggaa ataaaatttt tttaagtaat gcatttgaaa 841 agaaaagaaa aataaatata tttacaatta ggtttctttt attttggtaa cggatgcata 901 gcctagaaga ggtatatatc gaaaaatatg tctatattta attgtctaat ctgtcaaaaa 961 tgcaaactga aagattatct aaatattagt tgagcagtaa tggattctag tttatttaac 1021 tgatttccaa ttacctaaaa gtaaagtaag aaagaagcag ttgatatatt taaaagattg 1081 cagggcatcg aaaattcgtt gaagtgaact taaagcacgc ccaatcttag tttccgatat 1141 atataattta agttatttat ttcgtatgtg ttactcactg atggaacagc gcgacggagt 1201 ccacgctctc ggtcaagaaa acgctttgaa cttgagcgtt gttagcgtta ggtatttatg 1261 caaatggatt cagcatcatg catatttatc gatgcaaatg tctggctatt gattttcgac 1321 gatcgttaga gaacaaaaaa aaaaaaaaga agctaaaatg aaataagaat tgaagcgaac 1381 tgccagacac acaatggttg acacatttct tgcaagtcat ggtccacatc ttggccggac 1441 gctgggcggg ttcttcaaat tcagacacgc aaaaaacggg ttgaaaacaa aaaacttggc 1501 ctttcaaagt gtggccgtaa ttacagactt ccgcttacga aaccgaaagt gtccgcccag 1561 ccaaatggcc aagaagtatt ggccgaagat ggggttgggg ctgggcgcag tcgggcagac 1621 gttttgggct gctaaaggcg ccacaatgta tcccgttaca ggggccataa aactccataa 1681 gcgactggcg acaaggccaa attataccgc atacgcacac gtagccatgg tctaagttgt 1741 ggacccggct aattggacgc gcaccagggg tatggtcagg gggtggtggg gttgtttttg 1801 gattctctat aaacttaaaa gtgggcggca cataacaatt gtgggcgatt tgcatacaat 1861 ttggcgcttg gattaccagc aaagcgcgtc aaaatgttgc tgcgtcatcg cgggccaaaa 1921 accagatgac tttgccatgt gctaaatata aacacatgca catatatata tatatatatg 1981 tgtatatata atatatatag tatatatgca catgcaagcg gctaataaag atattgtatg 2041 gccagcacaa tgcagctgtc tgcgggccat tgtttgtctg acagtccgtc tgacagtttt 2101 tgtctgtcca tcacaaacaa acatatcaca gccaaaatgg caggccgaca ttgtcaaggc 2161 tggcttctgt tcaaagcttg cagtttaacg ataaatgaat tgcacaaaga caccaaaaaa 2221 aaaccaacta tatatgctcc agttgcaagt gcttgactag cagataccct gtagccagcg 2281 acaagctgcc ttgatttgag ttctttgcaa agaaatagtt gatggtcctg ataaataatg 2341 gggctccaat tggattactt taagcctcct tggagcctgg acctgtccaa agccttctcc 2401 gttgtgagag tcgaaaaacc caagagcaag caagtttcta gctcacgaaa cttgcgataa 2461 agaccctctg gtcgggcctc gtagcagcag gagtatttac cagacaggac tcggttgttg 2521 atgctgatca ggaatatata tatatatata tacttaatgg gcagagcttg ctgacacttt 2581 gtatcctctt tcacccgttt acaatgggct tagggtataa agagagaagt cgctgcaact 2641 gttgccagct aacatgttgc ataaacatgc cgcagcaaca cttgttgctg cctcaacaac 2701 acccacacac acacacacac atacacacac acacacacag ctgcacagca acacgcagag 2761 gctttggact gccgcctgcc tattatcctg aacaaaggac accgtcggct gtcggccgtc 2821 ggccggttgt acgttgtcgg ccgccgtcta acggtatata aaaccggatg tttcgccaaa 2881 catttgcaca gaagtgtttc acattgccac ggtcaacaag tgtcctgacc aaaatcaggt 2941 agaagaagaa ccacacgagt ggccagtgaa aagtgtttta aaatttccca tttcccaacg 3001 gaatgtcgat ttgattgcat tgcaatattc gtataatttt aaatatttga aaaaaaaggt 3061 gagtttttca aaattttgaa gccctgagca aagtgatttg agtgaaatac taaagcttag 3121 ttaatgggtg gaagaaagct taaatagtct taaaggcttt aaggcttaaa gaaaaagttt 3181 gaatctaga // LOCUS DRVFMRFRN2 1803 bp ds-DNA INV 08-MAR-1990 DEFINITION D.virilis FMRFamide-related neuropeptide gene, exon 2. ACCESSION M32643 KEYWORDS FMRFamide-related neuropeptide; neuropeptide; secretory granule protein. SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1803) AUTHORS Taghert,P.H. and Schneider,L.E. TITLE Inter-specific comparison of a Drosophila gene encoding FMRFamide-related neuropeptides JOURNAL J. Neurosci. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.H.Taghert 07-MAR-1990. FEATURES from to/span description pept 551 1570 FMRFamide-related neuropeptide precursor pre-msg < 1 > 1570 FMRFamide-related neuropeptide mRNA and intron IVS < 1 539 FMRFamide-related neuropeptide intron A BASE COUNT 471 a 438 c 459 g 435 t ORIGIN 1 tgacggagtg cgaacgttca tatacatcag ctccggcgcc cagcccctgc tcctggcgta 61 gaccatcagc ttagcgcctt taacaagttg tattaaggtt gttgatggcc gtgtgtgtgt 121 gtgtgtgtgt gtgacgggga ggggggcaac tcatgttgct gttgttgctg caggtcgatt 181 gttgttgctg ttgctgttgt tgctgccttt aatgcgcttg gttaactttt tacgcgcaca 241 ttcatatttc gctcttggcc aggcggatta tcgcataaat attgttatgg ttaattagcg 301 ggacactttg tctggccggg caaacaaaag gcgttaagac gctgccaccg gagtagcctg 361 ctgctgcctg gtttttgctg ctgttgccag ttaagtaaaa gtgaatgcgt tcattgcttt 421 atccatgggg gcgttaatgt gattaaaact tgttatagat aatgcggcaa gtgatttagt 481 caacacctca taattatgtt caatttctcc ctctctccct ccctttcact ccatctcagc 541 accttcaatc atgggcatag cactgatgtt cctgctggcc ctctaccaga tgcagtcggc 601 catccatagc gagattattg agacgccatc cagctataat gataactcgc tgctggaggc 661 agctgcggag gagcccaatt cgagagccac cgcgtctgag tcggatctgc tggatggact 721 gatgagcacc gataatccca atccggagca gcaaaccgaa ctggagtttc gctatccgat 781 ctcggccatt ggcattggtt atgccaagaa ttcggtggtg ttgcgtttcc agaagcacgc 841 gcgcaaacag aacttcaagt acgatccgga ctacgagatg aagcgcaagt cgctacagga 901 caatttcatg cactttggca agcggcaggc ggagcagctg cctcaggcaa ctggacccgg 961 ctactatgag tgcatcaagc gctccgccat ggatcgctac ggacgcgatc ccaagcagga 1021 ctttatgcgt ttcgggcgtg ctccgccttc ggactttatg cgcttcggac gtgcaccctc 1081 ggactttatg cgtttcggac gcgatcccag ccaggatttc atgcgcttcg ggcgctccga 1141 taactttatg cgctttggca ggaacctgaa ctttcacgag gagctgcgca gtcccaaaca 1201 ggatttcatg cgtttcggtc gcccggacaa tttcatgcgc ttcggccgat ccgcgcccac 1261 agaattcgag cgtaatggca aaatggactc gaacttcatg cgctttggca agagatcagg 1321 tgtaatggcc aaattgacca aatcccagct gcagcaaaac aaattgacaa ctgccgatgg 1381 caagcagcag ccagccgagg agggcaatcc cacggacaag gccatctcga tgctattcaa 1441 taagcaccag cagcagcagc agcagcagca ggggcagagg ctgcagcagg aagaccgaca 1501 gcagatgaag agcagcgcag agcagaacaa cttggaggaa gccagcgtgg agcagttcta 1561 tgagccatag attctaccgc gcacagcaca ctgtaaatat aaattggcca aaaaaaccga 1621 acactatgat aattgtactt aacactatgt atgtatgtac ttaaacgcat tatagacatt 1681 gcagaagaaa tttaaaaaag gaaaaaaaaa aacgtaattt gaattttaat ggggctggat 1741 taaaaattca ccacgctttg atctgatcat aagaaaacaa aagaaaaaaa agtagaaaac 1801 tct // LOCUS ECACELS 1224 bp ds-DNA BCT 06-MAR-1990 DEFINITION E.carotovora beta(1,4)-glucan glucanohydrolase (celS) gene, complete cds. ACCESSION M32399 KEYWORDS beta(1,4)-glucan glucanohydrolase; endoglucanase. SOURCE E.carotovora carotovora (strain SCC3193) DNA, clone pHSK25. ORGANISM Erwinia carotovora Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1224) AUTHORS Saarilahti,H.T., Henrissat,B. and Palva,E.T. TITLE CelS: A novel endoglucanase identified from Erwinia carotovora subsp carotovora JOURNAL Gene (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.T.Saarilahti, 02-MAR-1990. FEATURES from to/span description pept 208 1002 endoglucanase (celS) precursor sigp 208 303 endoglucanase signal peptide matp 304 999 endoglucanase binding 196 200 ribosome binding site BASE COUNT 353 a 278 c 253 g 340 t ORIGIN 1 taaaattaac accctgattt tatttcacat tattcatatt tattaagact ttaatcattt 61 ccccatgaat tggattattc cgacagagtg gaaataacat aaccagttaa actaaccaca 121 tctcagactt gatcatagtc acgtgataac gatgtgttaa ctttcagatt aaaacacctc 181 aagaatatat ttaaaaagag atttattatg cagactgtga atacccagcc acaccgtatt 241 ttccgcgtat tattgcccgc agtattttct tccttgctac tctcatcact aacagtgtct 301 gctgccagtt catcaaacga tgcggataaa ctctatttcg gtaataataa atattaccta 361 ttcaataacg tatggggaaa agatgaaata aaaggatggc agcaaacgat tttctataat 421 agcccaatta gcatggggtg gaactggcac tggccaagca gtacccacag tgttaaagcc 481 tatccttcac tggtgagcgg ttggcactgg acggcgggct atacggaaaa tagcggattg 541 ccgatacaat tatccagtaa taagagtatc accagtaacg ttacttattc tatcaaggct 601 acgggtactt acaacgccgc ttacgacatt tggttccaca ctaccgataa agccaactgg 661 gactcgtcgc ctaccgacga attgatgatt tggctaaatg ataccaatgc aggccctgcg 721 ggtgattata ttgaaaccgt tttcctcggc gacagcagtt ggaacgtatt caaaggctgg 781 atcaacgcgg ataatggtgg aggttggaat gtcttttcct ttgtccatac ttccggcacc 841 aacagtgcat cactcaatat tcgtcacttc accgactatt tggtgcagac aaagcaatgg 901 atgagcgatg aaaaatatat cagcagcgtt gagtttggca ccgagatctt tggcggcgat 961 gggcagatcg acatcaccga atggcgcgta gacgtaaaat agcgcgtctg gcgttgctat 1021 tcctccccgt tcagccgcaa taaacacgtt aacgtagagc tgaaacggat tggtctcatc 1081 gacgttctct tttcattttc agttcagaca gacgttgccc gttctttagg cgggttcgct 1141 tccccccaca acaaaattcg tgagctgtgg cacgtaaatt tcattagcgt tatgataatc 1201 ccctcggcta atcaggaatg tccc // LOCUS ECODCM 2472 bp ds-DNA BCT 26-FEB-1990 DEFINITION E.coli DNA cytosine methylase (dcm) and patch repair protein genes, complete cds. ACCESSION M32307 KEYWORDS DNA cytosine methylase; patch repair protein. SOURCE E.coli (K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2472) AUTHORS Sohail,A., Lieb,M., Dar,M. and Bhagwat,A.S. TITLE Gene required for very short patch repair in Escherichia coli partly overlaps the DNA cytosine methylase gene JOURNAL Unpublished (1990)Wayne St. Univ. Detroit, MI 48202 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.S.Bhagwat, 23-FEB-1990. FEATURES from to/span description pept 250 1668 DNA cytosine methylase (dcm) pept 1649 2119 pot. patch repair protein BASE COUNT 606 a 620 c 695 g 550 t 1 others ORIGIN 43 min on K12 map. 1 gatcatttcc agactaagtt gctgaacgta ccgcaaacca tgcaaaccgc aagaggcaag 61 cacgtggcgc agcacaacgc gcatttttta gtcgagttta tggcgaagct cagtgccgaa 121 ctggcggggg agaatgaagg tgtcgatcac aaggtgatag atgcgttttc atccgctggc 181 tgagcgcgtg gccctaaatg gctgtaatta tgttaacctg tcggccatct cagatggccg 241 gtgaaatcta tgcaggaaaa tatatcagta accgattcat acagcaccgg gaatgccgca 301 caggcaatgc tggagaaact gctgcaaatt tatgatgtta aaacgttggt ggcgcagctt 361 aatggtgtag gtgagaatca ctggagcgcg gcaattttaa aacgtgcgct ggcgaatgac 421 tcggcatggc accgtttaag tgagaaagag ttcgcccatc tgcaaacgtt attacccaaa 481 ccaccggcac atcatccgca ttatgcgttt cgctttatcg atctattcgc cggaattggc 541 ggcatccgtc gcggttttga atcgattggc ggacagtgcg tgtttaccag cgaatggaac 601 aaacatgcgg tacgcactta taaagccaac cattattgcg atccggcgac gcatcatttt 661 aatgaagata tccgcgacat caccctcagc cataaagaag gcgtgagtga tgaggcggcg 721 gcggaacata ttcgtcaaca cattcctgaa cacgatgttt tactggccgg tttcccttgt 781 cagccatttt cgctggctgg cgtatcgaaa aagaactcgc tcgggcgggc gcacggtttt 841 gcctgcgata cccagggcac gctgtttttt gatgtggtac gcattatcga cgcgcgtcgt 901 ccggcgatgt ttgtgctcga aaacgtcaaa aacctgaaaa gtcacgacca gggtaaaacg 961 ttccgcatca tcatgcagac gctggacgaa ctgggctatg acgtggctga tgcagaagat 1021 aatgggccag acgatccgaa aatcatcgac ggcaaacatt ttctgccgca gcaccgtgaa 1081 cgcatcgtgc tggtgggttt tcgtcgcgat ctgaatctga aagccgattt taccctgcgt 1141 gatatcagcg aatgtttccc tgcgcagcga gtgacgctgg cgcagctgtt ggacccgatg 1201 gtcgaggcga aatatatcct gacgccggtg ctgtggaagt acctctatcg atatgcgaaa 1261 aaacatcagg cgcgcggtaa cggcttcggt tatggaatgg tttatccgaa caatccgcaa 1321 agcgtcacgc gtacgctgtc tgcgcgttat tacaaagatg gcgcggaaat tttaatcgat 1381 cgcggctggg atatggccac gggtgagaaa gactttgacg atccgctgaa tcagcaacat 1441 cgtccacgtc ggttaacgcc tcgggaatgc gcgcgcttaa tgggttttga agcgccggga 1501 gaagcgaaat tccgtattcc ggtttcggac actcaggcct atcgccagtt cggtaactcg 1561 gtggtcgtgc cggtctttgc cgcggtggca aaactgcttg agccaaaaat caaacaggcg 1621 gtggcgttgc gtcagcaaga ggcacaacat ggccgacgtt cacgataagg ccactcgcag 1681 caaaaatatg cgcgcgattg ccacgcgtga tacggcgata gagaagcgcc tcgccagtct 1741 gttaaccggg cagggcctgg catttcgcgt tcaggacgcc agtctgcccg gacgtccgga 1801 ttttgtcgtt gatgaatatc gctgcgtgat atttacccat ggctgcttct ggcatcatca 1861 tcactgctat ctgtttaaag tgcctgcgac tcgaaccgag ttctggctgg agaagatagg 1921 taaaaatgtt gagcgcgatc gccgcgatat cagtcgcttg caggaactcg gctggcgcgt 1981 attgattgtc tgggagtgcg cgttacgtgg gcgcgagaag ctgacggatg aagcgcttac 2041 cgagcgtctg gaagagtgga tctgcggcga aggtgccagc gcgcagatcg acacgcaggg 2101 gattcattta ctcgcttgat gcgtcctgaa taactggcgc aactacgggt tttgccggga 2161 agagatattt tcccaacgtg accagtacca ccgcgaagac aattacgccg agcgccagcc 2221 attcaatctt cgacagtgtt tctccaccca gtcccgtacc cagcaagacc gcgaccaccg 2281 ggttaacgta agcgtagctg gtggcgagag ccggactgac attacggatt aaatacatat 2341 aagcgttgat ggcgataatc gaaccaaaca gcgccagata gccgaccgca aggaagcctg 2401 aaagggaagg gagcgccgtc agtttttnac ccgcaatcat cgacgcgatc attaacacca 2461 cgcctgccgc cg // LOCUS FCLPOLCYS 3865 bp ss-RNA VRL 06-MAR-1990 DEFINITION Feline calicivirus RNA-dependent RNA polymerase, cystein protease, and 2C protein,. ACCESSION M32296 KEYWORDS RNA polymerase; cysteine protease; nonstructural protein. SOURCE Feline calicivirus (isolate CFI/68 FIV), cDNA to viral RNA, clones pCV[2,8]. ORGANISM Feline calicivirus Viridae; ss-RNA nonenveloped viruses; Caliciviridae. REFERENCE 1 (bases 1 to 3865) AUTHORS Neill,J.D. TITLE Nucleotide sequence of a region of the feline calicivirus genome which encodes picornavirus-like RNA-dependent RNA polymerase, cysteine protease and 2C polypeptides JOURNAL Unpublished (1990)Pioneer Hi-Bred International, Johnston, IA 50131 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.D.Neill, 23-FEB-1990. FEATURES from to/span description pept < 1 3865 polyprotein (AA at 2) matp < 1 451 2C-like protein matp 1802 2242 cysteine protease-like protein matp 2521 3762 RNA-dependent RNA polymerase-like protein BASE COUNT 1057 a 865 c 894 g 1049 t ORIGIN 1 ggttcctgta tgttacattc ttactggacc acctggatgc gggaaaacaa ctgcagcgca 61 ggcattagcc aagaagttat ctgaccaaga accatcggtc atcaaccttg atgtagacca 121 ccatgatacg tacactggca atgaggtgtg cataattgat gaatttgatt cctctgataa 181 ggttgattat gcaaattttg ttattggaat ggttaactcg gcaccaatgg tactaaattg 241 tgatatgctg gaaaataagg gcaagttgtt cacctcaaag tacataataa tgacttccaa 301 ttctgaaaca cctgtcaaac catcctccaa acgcgctggc gcattttatc ggagagttac 361 ctatcatgat gttgctacct tggttgaatc gcacaagcgt gcacggcctg gtactgctgt 421 cccccgtagt tgctacaaga aaaacttttc acatctatcg cttgctaaac gaggcgctga 481 gtgctggtgc aaagagtatg tccttgatcc taagggattg caacaccaga gcacaaaggc 541 tcctccccca accttcctta atattgattc tttggcccaa acaatgaagc aagactttgc 601 tttaaagaac atggcatttg aagctgaggt aggatgtagt gaacatcggt atggatttgt 661 atgtcagcag agtgaagttg aaacagttcg ccgactcctt aatgccatta gaatgaggct 721 taacgctacg tttaccgttt gcgttggcct tgaagcttct aattccgttg ggtgtactgc 781 gcacgtttta acacctgacg aaccattcaa tggaaaaagg ttcgttgtgt cacgctgcaa 841 tgaggcatcg ctatccgcac tagaaggaaa ctgcgttcaa actgcattag gagtgtgcat 901 gtccaataag gacctcactc acctgtgtca cttcattaag gggaaaattg tcaatgatag 961 tgtcaggtta gatgaactac ccgccaatca acatgtggta accgttaatt cggtgtttga 1021 tttggcctgg gctcttcgtc gtcactctac actaactggt cagttccaag caatcagagc 1081 cgcatatgat gtgcttcatg tccctgacaa ggtccccgcc atgttgcggc actggatgga 1141 tgaaacctcc ttctctgatg aacacgtggt aacgcagttc attacacctg gtggagttgt 1201 aattcttgag tcgtgtggcg gtgcgcgcat ctgggcactt ggtaacaatg ttatcagggc 1261 tggaggcgtc actgccatac ccactggggg atgtgttagg ttaatgggac tctcagctca 1321 aacaatgcca tggtctgaaa ttctttcgga actcttctct ctgctaggga aaatctggtc 1381 tagtgttaaa gtctctaccc ttattctcac cgctctcagc atgtatgcat ctaggttcag 1441 gcctaaaaca gaagcaaagg gaaaaacaaa atccaaaatt ggtccatata gaggccgagg 1501 tgtggcactt accgatgatg agtacgacga atggaaagaa cacaatgccg ctaggaagtt 1561 ggacctctca gttgaagatt ttctaatgtt gcgtcaccgt gccgcactag gggcggatga 1621 cactgatgcc gtcaaattta ggtcatggtg gaattcaagg tctagattgg ctgatgactt 1681 tgaagacgtc accgtgattg gtaagggtgg cgtaaagcat gagaaaatta gaacaaatac 1741 cctcagagcc gttgaccgtg gctacgatgt cagctttgcc gaggagtcag ggcccggggc 1801 aaaattccac aaaaacgcaa ttgggtcagt cactgacgtc tgtggagaac acaagggcta 1861 ttgtgttcac atgggtcacg gggtctacgc ctcagtagca cacgtggtta agggtgactc 1921 tttctttctg ggtgaaagaa tctttgactt aaaaactaat ggtgagttct gttgcttccg 1981 tagcactaag atcttgccta gtgctgctcc tttcttttct ggaagaccaa cgcgtgaccc 2041 ttggggatct cctgtagcaa ctgactggaa accaaaacct tactcaacca catcggggaa 2101 aattgtaggg tgtttcgcaa caacatcaac tgaaacccac cctggcgact gtggcctccc 2161 gtacattgac gacaatggta gagtcactgg gttgcacaca ggctctgggg gacccaagac 2221 cccaagtgca aaattggttg ttccctacgt acatattgac atgaaaacta aatctgtcac 2281 tgctcaaaag tatgatgtca ctaaaccaga tattagttat aaaggtttag tttgtaaaca 2341 attggatgaa attagaatta tcccaaaagg taccaggcta cacgtatctc ctgcacattt 2401 agaagatttt gaggaatgct ctcaccagcc tgcctctcta ggtagtggtg atcctcggtg 2461 tcccaaatcg ctcactgcaa ttgttgttga ctctcttaag ccctactgtg tggtagttaa 2521 tgggcccccc catgatatct tacatagggt acaaaagatg ctaattgatc atctttctgg 2581 ttttgtcccc atgaacatat catctgatac ctccatgctt tctgctttcc acaaactcaa 2641 ccatgataca tcttgtgggc catatctagg tggtagaaag aaagatcaca tggtcaatgg 2701 agaacctgac aaggcgctat tagatttgtt gtcttcaaaa tggaagttgg caacacaggg 2761 aattgcactc ccacatgagt acacaattgg tttgaaagat gaactaaggc ccattgaaaa 2821 agtgcaagaa gggaaaagaa ggatgatctg gggctgtgat gtcggcgttg ccactgtatg 2881 tgcggctgca tttaaaggtg ttagtgacgc cattacggca aaccatcaat atggccctat 2941 acaggttggc attaacatgg acagtcccag tgttgaggca cttttccaaa ggatcaaaag 3001 cgcacgaaag gtgtttgctg ttgattactc aaagtgggat tcaacccaat ccccccgcgt 3061 tagtgccgct tcaattgaca tactgcggta cttttctgat cggactccaa ttgttgattc 3121 ggccacaaac acgcttaaaa gcccccccat tgctgtattt aatggagtag ctgtcaaggt 3181 gtcctctggt ctcccatctg gcatgcccct cacatctgta atcaactcgc tcaatcactg 3241 tttatatgtt ggctgtgcaa tcttgcaatc tttggaagct cgtaacgtgc ctgtcacttg 3301 gaatcttttc tcaactttcg acatgatgac ttatggtgat gatggtgttt acatgttccc 3361 cacaatgtat gcaagtatta gtgaccaaat ttttgccaat ctttctgcat atggtcttaa 3421 gcccacaaga gttgacaaat cagttggttc aattgaacca attgatccta attcagttgt 3481 cttcctaaaa agaacaatta cacgaacccc ccaaggaata cgaggcttgc ttgatcgcag 3541 ctcaattctc aggcaattct actacatcaa gggtgagaac acagataatt ggaaagagcc 3601 cccaaaaact atagatccaa tgtcaagagg gcaacaattg tggaatgcat gtctgtatgc 3661 aagtcagcat ggtattgact tttataataa ggtatacaaa ttggccgaaa aagcagtcga 3721 gtacgaagga ttgcatcttg agcccccttc ttacagcaca gccttggagc attacaacag 3781 ccaatttaat ggtgtggagg cgcggactga ccagatcgat acgagtggca tggccgccct 3841 tcactgtgat gtgttcgaag tttga // LOCUS HUMGALOPS 794 bp ds-DNA PRI 06-MAR-1990 DEFINITION Human 1,3-galactosyltransferase pseudogene, 3' end. ACCESSION J05421 KEYWORDS 1,3-galactosyltransferase; pseudogene. SOURCE Human peripheral blood leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 794) AUTHORS Larsen,R.D., Rivera-Marrero,C.A., Ernst,L.K., Cummings,R.D. and Lowe,J.B. TITLE Frameshift and nonsense mutation in a human genomic sequence homologous to a murine UDP-gal-beta-D-gal alpha-1,3 galactosyltransferase cDNA JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.B. Lowe. 02-MAR-1990. FEATURES from to/span description pept.ps / 63 755 pseudo-1,3-galactosyltransferase site 394 394 frameshift mutation site 475 475 frameshift mutation site 427 429 in-frame stop codon site 708 710 in-frame stop codon BASE COUNT 227 a 165 c 181 g 221 t ORIGIN 1 cagcttgtgg tttctttcag gaatcccaga ggataaatgt tttgcttttc ttctttgttt 61 cagatataat gatcattact tggaggagtt cataacatct gctaataggt acttcatggt 121 tggccacaaa gtcatatttt acatcatggt ggatgatgtc tccaagctgc cgtttataga 181 gctgggtcct ctgcattcct tcaaaatgtt tgaggtcaag ccagagaaga ggtggcaaga 241 catcagcatg atgcgtatga agatcactgg ggagcacatc ttggcccaca tccaacacga 301 ggtcgacttc ctcttctgca tggatgtgga ccaggtcttc caagaccatt ttggggtgga 361 gaccctaggc cagtcagtgg ctcagctaca ggctggcggt acaaggcaga tccctatgac 421 tttacctagg agaggtggaa agagtcagca ggatacattc catttggcca ggggattttt 481 attaccatgc agccatttct ggaggaacac ccattcaggt tctcaacatc acccaggagt 541 gctttaaggg aatcctcctg gacaagaaaa atgacataga agccaagtgg catgatgaaa 601 gccacctaaa caagtatttc cttctcaata aaccctctaa aatcttatcc ctaaaatact 661 gctgggatta tcatataggc ctgccttcag atattaaaac tgtcaagtga tcgtggcaga 721 caaaagagta taatttggtt agaaataatg tctgacttca aattgtgcca gtagatttct 781 gaatttaaga gaga // LOCUS MEFTGSRNA 3367 bp ds-DNA BCT 06-MAR-1990 DEFINITION M.fervidus 7S RNA, Ser-tRNA, 16S rRNA and Ala-tRNA genes. ACCESSION M32222 KEYWORDS 16SrRNA; 7S RNA; transfer RNA-Ala; transfer RNA-Ser. SOURCE Methanothermus fervidus. ORGANISM Methanothermus fervidus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (bases 1 to 3367) AUTHORS Haas,E.S., Brown,J.W., Daniels,C.J. and Reeve,J.N. TITLE Genes encoding the 7S RNA and a Ser-tRNA are linked to one of the two rRNA operons in the genome of the extremely thermophilic archaebacterium Methanothermus fervidus JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.J.Daniels, 02-FEB-1990. FEATURES from to/span description pept 132 887 ORF 260 RNA 944 1258 7S RNA tRNA 1273 1359 Ser-tRNA rRNA 1566 3046 16S rRNA tRNA 3105 3178 Ala-tRNA BASE COUNT 898 a 751 c 998 g 720 t ORIGIN 1 ggagacagtg gttcaccagt catacaagat ggtaaggttg tgggtaatat tgtcttcaat 61 atattacaca aatgctacag ctatagcatc ttccctagaa aaatttttag aaaggaagta 121 atgtggtggc catgaggatt ggatttattg gatttggaga agtaagttct actttatcac 181 aattttttaa ggataaggtg gaagtacaaa catgtgttaa aggtaggagt gaaaaaacta 241 aaaaaattgc aaaaaaactt ggagtaaaga tatacaaaga ttataaagac cttgtaaaaa 301 attctgacat tgttatttca gctgttactc cattttcagc tttagatgtg gctaaaaaat 361 atggaaaata tgtaaaaggt atatatgtag atgtgaataa tgtatcacct ctaacaaaac 421 ataaaatttt gaaatatatt gacgaggaaa aatttgttga ttgtgcaata attggaagaa 481 taaaacgtaa atttaaaatg atttgttcag gtaaaaatgc aaataaactt aaaatattag 541 aaaaatttgg agtacctatt gaagttatag gttcaaaggt tggggaagcc tcaacattaa 601 aaatgctcag aagtttatat actaaaagtc ttgctgcaat attattagaa gttttttcgg 661 ttgctaataa attaggactt atcgatgaat tacttgaaat attagaggaa acagaaggga 721 agaaatttgt agatttatgt aagtcaagag tggtaggatc atttattcac tcaaggagaa 781 gatatgaaga aatatgtgaa atagaaaaat ttatattatc tcataatttg aaaccaataa 841 tgatcaaatg tacaaaaaat atgtttaaac atatagagga atgttgacaa agattatatc 901 aaaaaattta aataagattg aaaaataaaa tataaaatgg cagcgaggct aggccggggg 961 gttaggggtc ccctgtaagc gcaaatcccc tatatggcgc ggccgaagcc caggaggcgg 1021 caagaccgcc agacatcggc ctgagggtta aacaatgaag cctcgtccca cagggccacc 1081 ggtggcgagg gtccagctgg agggctggac ctaatcgcct ttgctgcggg aacgggtcag 1141 gcccggaagg gagcagccct accgcagacg gatggtgctt gtgggtcaac ggggtggagt 1201 ctataaccct cagatcaccg gtgtctggtg gtcttgtcca ctcctgggcg tgcctcgctg 1261 ccaaatgtct tcagtagccg gggtggccca gcctggtagg gcgtcggcct gctaagccga 1321 tgatccgtta aggatcgcgc gggttcaaat cccgtccccg gcgtgcaata ttttctaatt 1381 tttttgtatt gatagtgttg ctaggtatgt aatgccatgg taccggcgtg gattcggagt 1441 gcctatttag gctcgatgaa gaatcgcact ccagccatgg caacaaaatt ggtgagagcc 1501 tgagtgaata actcaggctt gatacatagg tcccacgggt gctggtaaag tcccaaagga 1561 atcccccgcc aactccgttt gatcctggcg gaggccactg ctatgggggt ccgactaagc 1621 catgcaagtc gaacgggcct tgtgcccgtg gcgaacggct cagtaacacg tggacaacct 1681 accctggggt ccgggataac cccgggaaac tggggctaat cccggatagg cgaggtctcc 1741 tggaatgggg cctcgccgaa aggtcttttg accgccccag gatgggtctg cggccgatta 1801 ggtagttggt agggtaacgg cctaccaagc ctacgatcgg tacgggttgt gagagcaaga 1861 gcccggagac ggggcctgag acaaggcccc gggccctacg gggcgcagca ggcgcgaaaa 1921 ctccgcaatg cgcgaaagcg cgacgggggg acccccagtg ccactccgta aggagtggct 1981 tttccggagt gtaaaaagct ccgggaataa gggctgggca agaccggtgc cagccgccgc 2041 ggtaacaccg gcagcccgag tggtggccgc gtttattggg cctaaagcgt ccgtagccgg 2101 tccggtaagt ctccggtgaa agcccgcagc tcaactgcgg gagtagccgg agatactgcc 2161 ggacttgggg ccgggagagg ccggaggtac ccccggggta ggggtgaaat cctgtaatcc 2221 cggggggacc acctgtggcg aaggcgtccg gctggaacgg gcccgacggt gagggacgaa 2281 agccagggga gcgaaccgga ttagataccc gggtagtcct ggccgtaaac gatgcggact 2341 tggtgttggg gcaacctcga gttgccccag tgccgaaggg aagccgttaa gtccgccgcc 2401 tggggagtac ggccgcaagg ctgaaactta aaggaattgg cgggggagca ccacaacgcg 2461 tggagcctgc ggtttaattg gattcaacgc cggacacctc accgggggcg acggcaggat 2521 gatggccagg ttgatgacct tgcctgacga gccgagagga ggtgcatggc cgccgtcagc 2581 tcgtaccgtg aggcgtcctg ttaagtcagg caacgagcga gacccgcgcc cctagttgcc 2641 agcgggtccc gtaagggacg ccgggcacac tagggggacc gccagcgata agctggagga 2701 aggtgcgggc gacggtaggt ccgtatgccc cgaaaccccc gggctacacg cgggctacaa 2761 tggccgggac aatgggtacc gaccccgaaa gggggaggta atcccataaa cccggccgta 2821 gttcggatcg agggctgcaa ctcgccctcg tgaagctgga atgcgtagta atcgcgggtc 2881 actatcccgc ggtgaatacg tccctgctcc ttgcacacac cgcccgtcac gccacccaaa 2941 cggggttcgg atgaggccat ggcctctggc catggtcgaa tccgggcccc gtgaggaggg 3001 cgaagtcgta acaaggtagc cgtaggggaa cctgcggctg gatctaacct ccttaaaata 3061 ataatattcc tttgggacac caaaagcacc cgtgggacct tttttatttt tgggcccata 3121 gctcagcctg ggagagcgcc gcccttgcaa ggcggaggcc ccgggttcaa atcccggtgg 3181 gtccatgcaa ctagccatct tatgataggc tagtgaagct agatgatcaa agatgattct 3241 aggtgcatag gtagttcgtg gacaccaact gtagggaact ggtccaaggt ttaagccgcc 3301 cggtggatgg ctcggcttgg gcgccgacga aggccgtggc aagctgcgat aagcccgggt 3361 gaggcgc // LOCUS STASAU3AIM 3360 bp ds-DNA BCT 06-MAR-1990 DEFINITION S.aureus Sau3AI-restriction-enzyme and Sau3AI-modification-enzyme genes, complete cds. ACCESSION M32470 KEYWORDS Sau3AI-modification-enzyme; Sau3AI-restriction-enzyme. SOURCE S.aureus (strain 3AI) DNA. ORGANISM Staphylococcus aureus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 3360) AUTHORS Seeber,S., Kessler,C. and Goetz,F. TITLE Cloning, expression and characterization of the Sau3AI restriction and modification genes in Staphylococcus carnosus TM300 JOURNAL Unpublished (1990) Univ. of Tuebingen, D-7400 Tuebingen 1 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.t Seeber, 02-MAR-1990. FEATURES from to/span description pept 448 1917 Sau3AIR protein (ttg start codon) pept 2017 3255 Sau3AIM protein binding 437 441 ribosome binding site binding 2004 2010 ribosome binding site BASE COUNT 1295 a 393 c 638 g 1034 t ORIGIN 1 aagtcgacct tcaccaagac cgaattttcc gcgtataccc gcagtaccaa acgttaattt 61 actttcaaaa ccttctcgct gttcaatgtc agattgctgc tcataaaaat gtttaactaa 121 actatcatta gctctttcta tccataattc tttatccatt gttgctaaac atcctttcaa 181 aatctcagtt agacttaata aaacatgaaa actaaagccc ttacatttat gtaatgaatt 241 ataaagaaat acgccccaaa agtaaaaaaa cacagcccca agacaatact tttcacaagt 301 attatataat agatgtgtat gaaaatgcat ggagtagatg taagagtgat attcaaaatg 361 tgtaaaaaat atggataatt ctatataatt atattattga aattttaaat agcggaaata 421 attatttaat gttaagaggg gataattttg gaaagttatt tgacaaaaca agccgtacat 481 aatcgcgcaa aagaagctgt tggtaaaagt gtattagaat taaatggtgg tgaatcgatt 541 aaacaaagta agagttcagt tggtgatgca tttgaaaatt ggtttggtaa gaaaaaagac 601 agtgatagta aaccagatat ggcagaagct ggggtggaac ttaaggcaac gccatttaaa 661 aagttgaaaa acggaaagta tagctccaaa gaaagattag tattaaatat tataaactat 721 gagaaagtgg caaatgaaaa ttttgaaact agtagttttt tatctaagaa taatactata 781 gaattagctt tctatgaata tatcaaggga acacctagtg ataattggat tattaaagaa 841 gcggtgcttt atgaaatgca taaaaacccg attgattatg aaataattaa acaagattgg 901 gaaataataa atcaatatat taatgaagga aaggcacatg aattgagtga aggtttgaca 961 agttatttag cgccatgtac aaagggtgcg aatgctagtt ctttaagaaa tcagccttat 1021 tcagacataa aagcaaagca aagagcattt tctttgaagt ctgggtatat gacatctatt 1081 ttacgcaaat atgttctagg tgatgaaaaa atagattcaa ttgtcaaaga cccatttgaa 1141 ataaaagaaa aatcaataga ggacatagtc tttgaaaaat ttcagccata tataaactgg 1201 tcaatcgata aattatgcga acatttttct atcaataaag gtgagaaagg tttaaattat 1261 agaatagcct ctgccatttt aaatctaaaa ggtaaaacta ctaaaagtaa accattcccg 1321 gaagttgaag agtttgaaaa atcatctata gtagtcaaaa cagttcattt taataaaaag 1381 aatgtgaata aagaaagtat gtcatttgga gcttttaaat ttgaagaact agctaatgag 1441 gaatgggaag atagtgaagg atatcctagt gcacaatggc gaaacttttt gttagaaaca 1501 aggtttttat tttttgttgt taaagaagat gaagatggtg tagacatatt caaaggaata 1561 aaatttttta gtatgcctga agaagacata aacggacctg tcaaaagaat gtgggatgat 1621 acagtgaaga aattaaaaga gggtgtcaca ttagaagctg taccggacaa aagtacaaag 1681 gatggttgga gaataaaaaa taattttgta gataaaagtg atgatttaat ttgccatgtt 1741 agaccacaca ctaataacag agactatcgt ggaggaagta atgcagataa gcttcctaaa 1801 aagattaact ggattaatag acctgactca gatgattatt cggatgagtg gatgactaaa 1861 caaagttttt ggataaataa tgactacata aaaaagcaag ttgaagattt attgtagtta 1921 aagtatgtta aaatataaga tattctttta aaatatcgaa cgatcgttcg tattttgtgt 1981 tataataagg ttgaattaag tataggaggt cgcctaatga ataaaattaa agtagtagaa 2041 ttgtttgcgg gtgtaggcgg gtttcgttta ggtttagaaa atacgaaaaa tggtatattt 2101 gacataactt gggcaaatca atgggagccc tcacgaaaaa tccaacatgc atttgattgt 2161 tatagtaaaa gatttaagaa cggcatccat agtaataagg atattgccca ggtatctgat 2221 gaagaaatgg caaatactga agctgatatg attgtaggag gatttccttg ccaagattat 2281 tcagttgcaa ggagtttaaa tggagaatta ggaatacaag gaaaaaaggg cgttttattc 2341 tggcaaatta ttagatatat tcaaaataca tttcctaaat acttgttgct tgaaaatgtt 2401 gatagattat tgaaatcacc ttcgagtcag agagggagag actttgctgt aatgttatca 2461 accttaaatg agttaggcta taatgttgaa tggcgcgtga ttaatgctgc tgattatggc 2521 aatgctcaaa gacgtagaag ggtatttata tttggatata agcaagattt aaactatagc 2581 aaagctatgg aagaaagtcc gttggataaa attatatatc acaatggttt gtttgctgaa 2641 gcttttccga ttgaagatta tgccaataaa aatagagtaa ataggactca tattactcat 2701 gatatagtcg atatttcaga taatttcagt tttcaatttt ataatagtgg aatcatgaaa 2761 aatggagaaa ttttaactat tgacacaata ccaaaatatg aaaaatcagt aaccttagga 2821 gaaattattg aaagtaatgt agatgatggt ttttcattaa atcaagatca aattgataaa 2881 tttaaatatt taagaggacc caaaaagatt aaacgaacta ctaaagatgg tcatgaatat 2941 tatttttcag aaggtggtat gtctgaaaca gattcattag agttacctgc aagaacaatg 3001 cttacaagtg aatcatctat taatagaagt actcattttt taaacgtaga tggtgtttat 3061 agaactttga cacctattga agcagaaagg ttgaatgggt ttccagataa ttggacagaa 3121 ggtatgccaa ttaagatgag atacttttgt atgggcaatg ctcttgttgt gcctttgatt 3181 actagaatag gtaatcaaat tgaaaaaatt gatagtatta caaatgatga attcagtcag 3241 ctacgtttat tttaaataaa tcaaatgtag agtgcgtttg atttaacaag tttgtatact 3301 aaagattcat agattattgt atattgacca gtatttatca gcgtattatt ttaatatata // LOCUS TRPTYF1 1321 bp ds-DNA BCT 06-MAR-1990 DEFINITION T.pallidum pallidum antigen TyF1 gene, complete cds. ACCESSION M32401 KEYWORDS antigen. SOURCE T.pallidum pallidum (strain Nichols) DNA. ORGANISM Treponema pallidum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; Spirochaetales; Spirochaetaceae. REFERENCE 1 (bases 1 to 1321) AUTHORS Noordhoek,G.T., Hermans,P.W.M., Paul,A.N., Schouls,L.M., van der Sluis,J.J. and van Embdon,J.D.A. TITLE Treponema pallidum subspecies pallidum (Nichols) and Treponema pallidum subspecies pertenue (CDC 2575) differ in at least one nucleotide: Comparison of two homologous antigens JOURNAL Microb. Pathog. 6, 29-42 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by G.T. Noordhoek, 02-MAR-1990. FEATURES from to/span description pept 615 1148 antigen TyF1 variant 736 736 g in T.pallidum pallidum; a in T.pallidum pertenue BASE COUNT 349 a 297 c 353 g 322 t ORIGIN 1 aacagcacgc gggcccccgg actacatcgc tcttgtgaac ctcaacggtc gcctttccat 61 actctaagga ataggagcac gagtacgtat cccccggtag tttccccgcc ttccgaacta 121 aaataagagg tattcccatg cgatctgcaa aaggcgcggc aaaaataaag ccacgtgatt 181 cgattgctgc gaccgcggta acgtgctcat cgcggtagaa ttccaccatt tgatcaagac 241 agtaacgaaa tacagccgcg ttcatcagca cgccagtaat gtcgtagtag agaattcctt 301 ttttagggaa atcaatccgc ttacgaattg cgcggtccag cgccgcgtgt ccgtccatag 361 gggcatggta acgtccaata ccacgcacgt caatgatctt accggtttgt tgggaggctt 421 ggtggattga gaattacgtc tcctggaaaa aagatttcgc tgaaacttca cgaaatctcg 481 gtgaaaataa atgattattt taccaatcgg tgaaaaaaag ccgggaaaag tccaaaaaga 541 cagtggttat gctccatttc tttcgatttt ttgttggcat ggtttttgct ttaaagtttg 601 gaggagaaag aacgatgaac atgtgtacag atggaaaaaa ataccacagc accgccacga 661 gcgctgcagt tggagccagc gcccccggtg taccggacgc tcgtgccatt gctgctatct 721 gcgagcaatt gcgccggcat gtagcggatc tgggagtact gtatatcaag ctacataact 781 atcactggca catctacggc attgagttta aacaggtgca tgagctcctt gaagagtatt 841 atgtatcagt tactgaagcc tttgatacga ttgccgagcg gttgttacag ctgggcgcgc 901 aggctcctgc gtctatggct gaataccttg cgttgagtgg aattgcagaa gagacggaga 961 aagagatcac tatcgtctct gcgcttgcgc gcgtaaagcg ggattttgaa tacctaagta 1021 cgcgattcag ccaaacgcaa gtacttgcag ctgaaagtgg ggatgcagtg actgacggca 1081 ttatcacaga catactgagg acgttgggaa aggccatttg gatgcttggt gctaccctga 1141 aagcctaggt agagcaggct gtacgtacaa cacacgtacg gccatgcgct ggaagtcctg 1201 tattttgcac ataaggcctc tctcccgtta cagcatgagg ggagggaggt gttggttgaa 1261 gtgcttgggg aagtgtgcat aatcgtccta cggaaggggg cgttttgtgg aaaaaattgt 1321 t // LOCUS YS5MTII 1257 bp ds-DNA PLN 07-MAR-1990 DEFINITION C.glabrata metallothionein (MTII) gene, complete cds. ACCESSION J05398 KEYWORDS metallothionein. SOURCE C.glabrata (ATCC 2001) DNA. ORGANISM Candida glabrata Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 1257) AUTHORS Mehra,R.K., Garey,J.R. and Winge,D.R. TITLE Selective and tandem amplification of a member of the metallothionein gene family in Candida glabrata JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.R.Garey 07-MAR-1990. FEATURES from to/span description pept 631 789 metallothionein (MTII) rpt 1246 > 1257 amplifiable repeat BASE COUNT 420 a 255 c 199 g 383 t ORIGIN 1 ccccgggatc gtttagaccc caagaatcca agatatgttt tagctgacag ggtgcatggc 61 tactccgtga tttaccatat gtccatcgca ttcgtcatgg cctgatgcca atgaccatac 121 tctttagact cggatccgag aagtcatccc tatccatacg atatcttctt ccgagtccat 181 tagataaggc aacgctagat ttagctgatt gattgccctg agaattaaag ggaatcagcg 241 aaaatagaaa aaaaggtttt ggatggatat ttttgttctt tttttgagac attggtgtca 301 ctgccacagc ttaataacat ggcaatcagc agtgtctaga ttcagcggaa tttagtggaa 361 gcgaaatgaa ttcggctgac ttagtgccta tcagtaatat acatatttgc tttttgcttt 421 ggcattgaat ttatcaataa ctgcataccc aaaattaaaa ataaaataga aaaaatagta 481 atatataaag cacccgtaac ttcccatttc tgggaaactt ggaattcatt tctcccattt 541 atcctttctt ctatatatcg aatcaacaca tcaacaatac caacaaactt caacttatac 601 acaacatcta atatttatta tagcttcgaa atgcctgaac aagtcaactg ccaatacgat 661 tgccactgct ccaactgtgc ttgtgaaaat acttgcaact gctgtgccaa gccagcatgt 721 gcttgcacaa actctgcttc caatgaatgc tcctgccaaa cttgcaagtg tcaaacatgc 781 aagtgctaaa cagcattcaa agaagaataa tttctagtat ttttgctatt gttgctttat 841 ttttcatgat tattgataag gtactgggta tcttctgcta atgacgctga actagctcta 901 tatgctagct atatacatta cgtataaaaa aatataaaaa tataaaaata taaaaaaaaa 961 ataaaaaaat cacttcaaaa actataacaa ctaccaagtt tctattttct ctttgactaa 1021 atcaaatcga tctagtaaac attgcagaac aatttgtgta attatcgtct gatctaaggg 1081 ccggcgtaag caattctaac cgaccacact acgttactta acaaccacac ttgctcttga 1141 atacgaaaca gcaagctcta cctatatgaa atggtctaca aaagaagttt tctctagtac 1201 tctgagaagg ttatattata catcattgaa atgcttaaag ctgatttgca tcccggg //