Path: utzoo!attcan!uunet!zephyr.ens.tek.com!uw-beaver!milton!dali.cs.montana.edu!uakari.primate.wisc.edu!zaphod.mps.ohio-state.edu!usc!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 18 Jul 90 12:00:16 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 3866 Approved: lear@genbank.bio.net Checksum: 44962 229 LOCUS HUMELA2 906 bp ss-mRNA PRI 18-JUL-1990 DEFINITION Human elastase 2 mRNA, complete cds. ACCESSION M16631 KEYWORDS elastase. SOURCE Human pancreas, cDNA to mRNA (library of R.Weiss), clones hpe2-lambda-[4,10]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 906) AUTHORS Fletcher,T.S., Shen,W.-F. and Largman,C. TITLE Primary structure of human pancreatic elastase 2 determined by sequence analysis of the cloned mRNA JOURNAL Biochemistry 26, 7256-7261 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.S.Fletcher, 05-AUG-1987. A poly-adenylation signal is located at positions 891-896. FEATURES from to/span description pept 22 831 elastase 2 precursor sigp 22 69 elastase 2 signal peptide matp 70 828 elastase 2 BASE COUNT 197 a 274 c 253 g 182 t ORIGIN 161 bp upstream of PstI site. 1 aaacagtccc agggacacac catgataagg acgctgctgc tgtccacttt ggtggctgga 61 gccctcagtt gtggggaccc cacttaccca ccttatgtga ctagggtggt tggcggtgaa 121 gaagcgaggc ccaacagctg gccctggcag gtctccctgc agtacagctc caatggcaag 181 tggtaccaca cctgcggagg gtccctgata gccaacagct gggtcctgac ggctgcccac 241 tgcatcagct cctccaggac ctaccgcgtg gggctgggcc ggcacaacct ctacgttgcg 301 gagtccggct cgctggcagt cagtgtctct aagattgtgg tgcacaagga ctggaactcc 361 aaccaaatct ccaaagggaa cgacattgcc ctgctcaaac tggctaaccc cgtctccctc 421 accgacaaga tccagctggc ctgcctccct cctgccggca ccattctacc caacaactac 481 ccctgctacg tcacgggctg gggaaggctg cagaccaacg gggctgttcc tgatgtcctg 541 cagcagggcc ggttgctggt tgtggactat gccacctgct ccagctctgc ctggtggggc 601 agcagcgtga aaaccagtat gatctgtgct gggggtgatg gcgtgatctc cagctgcaac 661 ggagactctg gcgggccact gaactgtcag gcgtctgacg gccggtggca ggtgcacggc 721 atcgtcagct tcgggtctcg cctcggctgc aactactacc acaagccctc cgtcttcacg 781 cgggtctcca attacatcga ctggatcaat tcggtgattg caaataacta accaaaagaa 841 gtccctggga ctgtttcaga cttggaaagg tcacagaagg aaaataatat aataaagtga 901 caactc // LOCUS CLOFTHFSA 1965 bp ds-DNA BCT 18-JUL-1990 DEFINITION C.thermoaceticum formyltetrahydrofolate synthetase (FTHFS) gene, complete cds. ACCESSION J02911 KEYWORDS formyltetrahydrofolate synthetase. SOURCE C.thermoaceticum DNA. ORGANISM Clostridium thermoaceticum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1965) AUTHORS Lovell,C.R., Przybyla,A. and Ljungdahl,L.G. TITLE Primary structure of the thermostable formyltetrahydrofolate synthetase from Clostridium thermoaceticum JOURNAL Biochemistry 29, 5687-5694 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Lovell, 13-APR-1990, for release after publication. FEATURES from to/span description pept 56 1735 formyltetrahydrofolate synthetase (FTHFS) (ttg start codon) (EC 6.3.4.3) mRNA < 1 1760 FTHFS mRNA binding 41 44 ribosomal binding site (put.) signal 1761 1785 pot. transcription termination signal BASE COUNT 459 a 576 c 532 g 398 t ORIGIN 1 aatatgtgac gtgcaaccgg atgttcactt tttcgaatga gaggagtgga ttacattgtc 61 caaggtaccc agtgatattg agattgccca ggcagccaaa atgaaaccgg tcatggaact 121 ggcccgggga ctgggcatcc aagaggacga ggtcgagctt tatggtaagt acaaggccaa 181 gatctccctc gatgtctatc gtcgcctcaa agacaagcct gacgggaaac taatcctggt 241 aaccgccatt acccctactc cggccggcga agggaaaact actaccagtg tcggtctcac 301 cgatgccctg gctcgcctgg ggaaaagggt gatggtctgc ctgcgggagc cctccctggg 361 acccagcttt ggtatcaaag gcggtgccgc cggcggtggt tatgcccagg tagtacccat 421 ggaagatatc aacctgcact tcaccggcga tatccacgcc gtcacctatg cccacaacct 481 gctggcggcc atggtggata accacctgca gcagggtaac gtcctgaata ttgatccccg 541 taccatcacc tggcgccggg tcatcgacct taatgaccgg gctctgagga acatagtcat 601 cggcctgggt ggcaaagcca acggcgtacc gcgggagaca gggtttgaca tctccgttgc 661 ctcggaggtt atggcctgcc tgtgcctggc cagcgacctc atggatctca aggaacgttt 721 cagccgcatt gttgtcggct acacctatga cggcaaaccg gtcaccgccg gcgatctgga 781 ggcccagggt tccatggctc ttctcatgaa ggacgccatt aaacccaacc tggtccaaac 841 cctggagaat acgccggcct ttatccacgg tggtcccttc gccaatatcg cccacggttg 901 caacagcatt atcgcaacca agacggccct gaaactggcg gattatgtcg tgacggaagc 961 cggtttcggt gccgacctgg gtgccgagaa gttctatgac gttaaatgcc gttatgccgg 1021 ctttaaaccc gatgccacag tcatcgtggc taccgtccgc gccctcaaga tgcacggcgg 1081 cgtacccaaa tcagacctgg ccactgaaaa cctggaagcc ctgcgggaag gctttgccaa 1141 cctggagaaa cacatcgaaa atatcggcaa gttcggcgta ccggcagtcg tggccatcaa 1201 tgccttcccc accgataccg aggccgagct aaatctcctc tacgagttgt gcgccaaagc 1261 tggggccgaa gttgccctct cggaagtctg ggctaagggc ggcgaaggcg gtctggaact 1321 tgcccggaag gtgttgcaga ccctggagag caggccatcc aacttccatg tcctctacaa 1381 cctggacctg agtattaaag acaaaattgc caaaatcgcc accgagatct acggggccga 1441 cggcgtcaac tatacggccg aagccgacaa agctatccag cgttatgaat ccctgggcta 1501 cggcaacctg ccggtggtca tggccaagac ccaatactcc ttttccgatg acatgaccaa 1561 gctcgggcgg ccgcggaact ttaccatcac cgtgcgcgag gtgcgcctct cggccggagg 1621 caggcttatc gtccccatca ccggcgccat aatgaccatg cccgggctgc ccaaacgccc 1681 ggcggcctgc aacatcgaca tcgatgccga cggcgtcatt accggtcttt tctagtatgc 1741 ctggagtaaa gcctatccct accccactcc tccagggagt ggggtttttg ctgtctatat 1801 gagaatgata taactaagaa gctggttttc caagccgtac ctatcatagg ggctgcgcca 1861 ccgccatgaa ccacgaaaat gcaaaagata tcattcatgt tagaactctg ttaccgagag 1921 gaacgattat gtctggcatg acgagataga gtataccgta gcgat // LOCUS HUMGLIBSA 176 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A1. ACCESSION M34816 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 176) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 31 53 human zinc finger protein binding site binding 94 116 human zinc finger protein binding site binding 157 176 human zinc finger protein binding site BASE COUNT 68 a 50 c 36 g 22 t ORIGIN 1 aagaacacag gaaccaactg caagagctcc cgaagaccac ccacaatgat ggttgtatgt 61 cagaaggaca caggaaccaa ctgcaagagc tcccaaagac cacccacaat gatggttgta 121 tgtcagaagg acacaggaac caactgcaag agctcccaaa aaccacccac aatgat // LOCUS HUMGLIBSB 166 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A2. ACCESSION M34817 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 166) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 27 49 human zinc finger protein binding site binding 90 112 human zinc finger protein binding site BASE COUNT 60 a 47 c 37 g 22 t ORIGIN 1 acacaggaac caactgcaag agctcccgaa gaccacccac aatgatggtg gtatgttgaa 61 aggacacagg aaccaactgc aagagctccc aaagaccacc cacaatgatg gttgtgtgtc 121 aaaaggacac aggaaccaac tgtaagagct cctgaagacc acccac // LOCUS HUMGLIBSC 313 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A3. ACCESSION M34818 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 313) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 1 23 human zinc finger protein binding site binding 64 86 human zinc finger protein binding site binding 127 149 human zinc finger protein binding site binding 190 212 human zinc finger protein binding site binding 253 275 human zinc finger protein binding site BASE COUNT 110 a 79 c 75 g 49 t ORIGIN 1 caaagaccac ccacaatgat ggttgtatgt cagaaggaca caggaaccaa ctgcaagagc 61 tcccaaaaac cacccacaat gatggtggta tgtcagatgg acacaggaac caactgcaag 121 agctcccgaa gaccacccac aatgatggtg gtatgttgaa aggacacagg aaccaactgc 181 aagagctccc aaagaccacc cacgatgatg gttgtgtgtc aaaaggacac aggaaccaac 241 tgtaagagct cctgaagacc acccacaatg atggttttat gtcagaagga tacaggaacc 301 agctgcaagg gct // LOCUS HUMGLIBSD 184 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A4. ACCESSION M34819 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 184) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 32 54 human zinc finger protein binding site binding 95 117 human zinc finger protein binding site binding 158 180 human zinc finger protein binding site BASE COUNT 67 a 46 c 41 g 30 t ORIGIN 1 aaaagacaca ggaaccaact gcaagagctc ccaaagacca cccacaatga tggttgtgtg 61 tcaaaaggac acaggaacca actgtaagag ctcctgaaga ccacccacaa tgatggtttt 121 atgtcagaag gatacaggaa ccagctgcaa gagctcctga agaccaccca caatgatggt 181 tgta // LOCUS HUMGLIBSE 62 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region B1. ACCESSION M34820 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone B1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 62) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 11 33 human zinc finger protein binding site BASE COUNT 18 a 26 c 11 g 7 t ORIGIN 1 aaaccagacg cgtggaccac ccaagacgaa attcacactc gaccctagcc ccgccagccc 61 tt // LOCUS HUMGLIBSF 67 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region C1. ACCESSION M34821 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone C1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 67) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 16 29 human zinc finger protein binding site BASE COUNT 15 a 26 c 15 g 11 t ORIGIN 1 agacccaggc acacagatag accacccagc ttcaggtggg ggaccaccct tgcatcccct 61 ctctgct // LOCUS MLVU3RA 383 bp ss-RNA VRL 18-JUL-1990 DEFINITION Murine leukemia virus CWM-T-15.4 U3 region genomic RNA. ACCESSION M34722 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWM-T-15.4), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 383) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 112 178 MuLV promoter region binding 131 144 CCAAT-binding transcription factor binding 147 152 Moloney leukemia virus binding protein b binding 156 167 enhancer core binding protein binding 350 362 CCAAT-binding transcription factor binding 363 372 glucocorticoid response element BASE COUNT 127 a 105 c 84 g 67 t ORIGIN 1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat cttgcaaggc 61 atggaaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc 121 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccctagc 181 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg 241 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc 301 gcttattgct gcccagctct ataaaaaggg taaaaacccc acactcggcc ggcccagggc 361 caagaacaga tggtccccag aaa // LOCUS MLVU3RB 348 bp ss-RNA VRL 18-JUL-1990 DEFINITION Murine leukemia virus CWM-T-15 U3 region genomic RNA. ACCESSION M34723 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWM-T-15), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 348) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 77 177 MuLV promoter region binding 96 109 CCAAT-binding transcription factor binding 112 117 Moloney leukemia virus binding protein b binding 121 132 enhancer core binding protein binding 142 154 CCAAT-binding transcription factor binding 155 164 glucocorticoid response element BASE COUNT 115 a 97 c 76 g 60 t ORIGIN 1 ctgcagtaac gccatcttgc aaggcatgga aaaataccag agctgatgtt ctcagaaaaa 61 caagaacaag gaagtacaga gaggctggaa agtaccggga ctagggccaa acaggatatc 121 tgtggtcaaa cacctgggcc ccggcccagg gccaagaaca gatggtcccc agaaatagct 181 aaaacaacaa cagtttcaag agacccaaaa actgtctcaa ggttccccag atgaccgggg 241 atcaacccca agcctcattt aaactaacca atcagctcgc ttctcgcttc tgtacccgcg 301 cttattgctg cccagctcta taaaaagggt aaaaacccca cactcggc // LOCUS MLVU3RC 445 bp ss-RNA VRL 18-JUL-1990 DEFINITION Murine leukemia virus CWM-T-25a U3 region genomic RNA. ACCESSION M34724 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWN-T-25a), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 445) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 77 274 MuLV promoter region binding 96 109 CCAAT-binding transcription factor binding 168 179 CCAAT-binding transcription factor binding 238 252 CCAAT-binding transcription factor binding 138 143 Moloney leukemia virus binding protein b binding 209 214 Moloney leukemia virus binding protein b binding 147 158 enhancer core binding protein binding 218 229 enhancer core binding protein binding 180 189 glucocorticoid response element binding 252 263 glucocorticoid response element BASE COUNT 142 a 126 c 104 g 73 t ORIGIN 1 ctgcagtaac gccattttgc aaggcatgaa aaagtaccag agctgaagtt ctcagaaaaa 61 caagaacaag gaagtacaga gaggctaaaa agtacccggc ccagggccaa gaacagatgg 121 tcccagaccg ctaacgacag gatatctgtg gttaagcact agggccccgc ccagggccaa 181 gaacagatgg tccccagacc gctaacgaca ggatatctgt ggttaagcac tagggccccg 241 gcccagggcc aagaacagat ggtccccaga aatagctaaa acaacaacag tttcaagaga 301 cgcagaaact gtctcaaggt tccccagatg accggggatc aaccccaagc ctcatttaaa 361 ctaaccaatc agctcgcttc tcgcttctgt acccgcgctt attgctgccc agctctataa 421 aaagggtaag aaccccacac tcggc // LOCUS MLVU3RD 409 bp ss-RNA VRL 18-JUL-1990 DEFINITION Murine leukemia virus CWM-T-25b U3 region genomic RNA. ACCESSION M34725 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWN-T-25b), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 409) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 112 238 MuLV promoter region binding 131 144 CCAAT-binding transcription factor binding 174 179 Moloney leukemia virus binding protein b binding 183 194 enhancer core binding protein binding 204 215 CCAAT-binding transcription factor binding 216 225 glucocorticoid response element BASE COUNT 133 a 114 c 91 g 71 t ORIGIN 1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc 61 atgggaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc 121 taaaaggtac ccggcccagg gccaagaaca gatggtcccc agaccgctaa cgacaggata 181 tctgtggtta agcactaggg ccccggccag ggccaagaac agatggtccc cagaaatagc 241 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg 301 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc 361 gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggc // LOCUS SIVPBJA 125 bp ss-RNA VRL 18-JUL-1990 DEFINITION Simian immunodeficiency virus upstream enhancer region. ACCESSION M31325 KEYWORDS . SOURCE Simian (mangabey) immunodeficiency virus (isolate PBj14), clone 4.41. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 125) AUTHORS Dewhurst,S., Embretson,J.E., Anderson,D.C., Mullins,J.I. and Fultz,P.N. TITLE Sequence analysis and acute pathogenicity of molecularly cloned SIV JOURNAL Nature 345, 636-640 (1990) STANDARD simple staff_review BASE COUNT 40 a 25 c 38 g 22 t ORIGIN 1 gaagaaaaca agctaagaca gcagggactt tccacaaaga cagcagggac tttccacaaa 61 ggggatgtta cggggaggta ctggggagga gctggctgga acgcccacgt attctctgta 121 taaat // LOCUS CPARRAA 119 bp ss-rRNA RNA 18-JUL-1990 DEFINITION C.paradoxa cellular 5S ribosomal RNA. ACCESSION M33029 KEYWORDS 5S ribosomal rRNA. SOURCE C.paradoxa rRNA. ORGANISM Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Maxwell,E.S., Liu,J. and Shively,J.M. TITLE Nucleotide sequence of Cyanophora paradoxa cellular and cyanelle- associated 5S ribosomal RNAs JOURNAL Ann. N.Y. Acad. Sci. 503, 559-561 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 25 a 35 c 37 g 22 t ORIGIN 1 gtgtacggct atactaccgg aaaagcgccc gttcccgtcc gattacgaga gcctagcccg 61 gtcaggcccg actagtacta gggtggggga ccacctggga acatcgggtg ctgtacact // LOCUS CPARRAB 118 bp ss-rRNA RNA 18-JUL-1990 DEFINITION C.paradoxa cyanellar 5S ribosomal RNA. ACCESSION M33030 KEYWORDS 5S ribosomal rRNA. SOURCE C.paradoxa cyanellar rRNA. ORGANISM Cyanelle Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa. REFERENCE 1 (bases 1 to 118) AUTHORS Maxwell,E.S., Liu,J. and Shively,J.M. TITLE Nucleotide sequence of Cyanophora paradoxa cellular and cyanelle- associated 5S ribosomal RNAs JOURNAL Ann. N.Y. Acad. Sci. 503, 559-561 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 31 a 26 c 31 g 30 t ORIGIN 1 cctggtgctt attgcgtggt ggaaccacac cgatccatac cgaactcggt tgtgaaacat 61 tactgcggcg acaatacttg aagggtaact ttctggaaag atagcttagt gccaggat // LOCUS SNDZSTR 15384 bp ss-RNA VRL 18-JUL-1990 DEFINITION Sendai virus nucleoprotein (NP), phosphoprotein (P), matrix protein (M), F protein, hemagglutinin-neuraminidase (HN), and L protein genes, complete cds. ACCESSION M30202 M30203 M30204 M19661 KEYWORDS F protein; L protein; hemagglutinin-neuraminidase; matrix protein; nucleoprotein; phosphoprotein. SOURCE Sendai virus (Z strain, F1-R mutant, and ts-f1 mutant) RNA. ORGANISM Parainfluenza virus type 1 Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Paramyxoviridae; Paramyxovirus. REFERENCE 1 (sites) AUTHORS Tashiro,M., Pritzer,E., Khoshnan,M.A., Yamakawa,M., Kuroda,K., Klenk,H.-D., Rott,R. and Seto,J.T. TITLE Characterization of a pantropic variant of Sendai virus derived from a host range mutant JOURNAL Virology 165, 577-583 (1988) STANDARD full staff_review REFERENCE 2 (bases 4866 to 6560; for [1]) AUTHORS Tashiro,M., Pritzer,E., Khoshnan,M.A., Yamakawa,M., Kuroda,K., Klenk,H.-D., Rott,R. and Seto,J.T. JOURNAL Unpublished (1988) STANDARD full staff_review REFERENCE 3 (bases 1 to 15384) AUTHORS Middleton,Y., Tashiro,M., Thai,T., Oh,J., Seymour,J., Pritzer,E., Klenk,H.-D., Rott,R. and Seto,J.T. TITLE Nucleotide sequence analyses of the genes encoding the HN, M, NP, P, and L proteins of two host range mutants of Sendai virus JOURNAL Unpublished (1989) California State University at Los Angeles, Dept. of Microbiology, Los Angeles, CA 90032-8745 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] and [2] kindly submitted by J.T.Seto, 08-JUL-1988, for [3], 27-NOV-1989. Author Address: J.T.Seto California State University at Los Angeles Department of Microbiology Los Angeles, CA 90032-8745 FEATURES from to/span description pept 120 1694 nucleoprotein (NP) pept 1844 3550 phosphoprotein (P) pept 3669 4715 matrix protein (M) pept 4866 6563 F protein pept 6693 8420 hemagglutinin-neuraminidase (HN) pept 8556 15242 L protein variant 310 310 c in strain Z and F1-R mutant; t in ts-f1 mutant variant 350 350 c in strain Z and F1-R mutant; t in ts-f1 mutant variant 1580 1580 t in strain Z and F1-R mutant; c in ts-f1 mutant variant 1586 1586 g in strain Z and ts-f1 mutant; a in F1-R mutant variant 2307 2307 g in strain Z; a in F1-R mutant and ts-fl mutant variant 4020 4020 a in strain Z and F1-R mutant; c in ts-fl mutant variant 4051 4051 g in strain Z and F1-R mutant; a in ts-fl mutant variant 4297 4297 c in strain Z and F1-R mutant; t in ts-fl mutant variant 5053 5053 t in strain Z and F1-R mutant; g in ts-fl mutant variant 5176 5176 g in strain Z and F1-R mutant; a in ts-fl mutant variant 5208 5208 c in strain Z and F1-R mutant; t in ts-fl mutant variant 5212 5212 a in strain Z and F1-R mutant; g in ts-fl mutant variant 5700 5700 a in strain Z and F1-R mutant; g in ts-fl mutant variant 5708 5708 t in strain Z and F1-R mutant; c in ts-fl mutant variant 6530 6530 a in strain Z and F1-R mutant; c in ts-fl mutant variant 10429 10429 g in strain Z; a in F1-R mutant and ts-fl mutant variant 15262 15262 c in strain Z and F1-R mutant; a in ts-fl mutant BASE COUNT 4595 a 3329 c 3756 g 3704 t ORIGIN 1 accaaacaag agaaaaaaca tgtatgggat atgtaatgaa gttatacagg attttagggt 61 caaagtatcc accctgagga gcaggttcca gaccctttgc tttgctgcca aagttcacga 121 tggccgggtt gttgagcacc ttcgatacat ttagctctag gaggagcgaa agtattaata 181 agtcgggagg aggtgctgtt atccccggcc agaggagcac agtctcagtg ttcgtactag 241 gcttaagtgt gactgatgat gcagacaagt tattcattgc aactaccttc ctagctcact 301 cattggacac agataagcag cactctcaga gaggagggtt cctcgtctcc ctgcttgcca 361 tggcttacag tagtccagaa ttgtacttga caacaaacgg agtaaacgcc gatgtcaaat 421 atgtgatcta caacatagag aaagacccta agaggacgaa gacagacgga ttcattgtga 481 agacgagaga tatggaatat gagaggacca cagaatggct gtttggacct atggtcaaca 541 agagcccact cttccagggt caacgggatg ctgcagaccc tgacacactc cttcaaatct 601 atgggtatcc tgcatgccta ggagcaataa ttgtccaagt ctggattgtg ctggtgaagg 661 ccatcacaag cagcgccggc ttaaggaaag ggttcttcaa caggttagag gcgttcagac 721 aagacggcac cgtgaaaggt gccttagttt tcactgggga gacagttgag gggataggct 781 cggttatgag atctcagcaa agccttgtat ctctcatggt tgagaccctt gtgactatga 841 atactgcaag atctgatctc accacattag agaagaacat ccagatagtt gggaactaca 901 tccgagatgc agggctggct tccttcatga acactattaa atatggggtg gagacaaaga 961 tggcagctct aacgttgtca aacctgaggc ccgatattaa taagcttaga agcctcatag 1021 acacctacct gtcaaaaggc cccagagctc cctttatctg tatcctcaag gaccctgttc 1081 atggtgaatt tgctccaggc aattatcctg cactatggag ttacgccatg ggagtcgccg 1141 tcgtacagaa caaggcaatg cagcagtacg tcacagggag gacatacctt gatatggaaa 1201 tgttcttact aggacaagcc gtggcaaagg atgctgaatc gaagatcacg agtgccttgg 1261 aagatgagtt aggagtgacg gatacagcca aggggaggct cagacatcat ctggcaaact 1321 tgtccggtgg ggatggtgct taccgcaaac caacaggcgg tggtgcaatt gaggtagctc 1381 tagacaatgc cgatatcgac ctagaaacaa aagctcatgc ggaccaggac gctaggggtt 1441 ggggtggaga tagtggtgaa agatgggcac gtcaggtgag tggtggccac tttgtcacac 1501 tacatggggc tgaacggtta gaggaggaaa ccaatgatga ggatgtatca gacatagaga 1561 gaagaatagc catgagactt gcagagagac ggcaagagga ttctgcaacc catggagatg 1621 aaggccgcaa taacggtgtc gatcacgacg aagatgacga taccgcagca gtagctgggg 1681 taggaggaat ctaggatcat acgaggcttc aaggtacttg atccgtagta agaaaaactt 1741 agggtgaaag ttcatccacc gatcggctca ggcaaggcca cacccaaccc caccgaccac 1801 acccagcagt cgagacagcc acggcttcgg ctacacttac cgcatggatc aagatgcctt 1861 cattcttaaa gaagattctg aagttgagag ggaggcgcca ggaggacgag agtcgctctc 1921 ggatgttatc ggattcctcg atgctgtcct gtcgagtgaa ccaactgaca tcggagggga 1981 cagaagctgg ctccacaaca ccatcaacac tccccaagga ccaggctctg ctcatagagc 2041 caaaagtgag ggcgaaggag aagtctcaac accgtcgacc caagataatc gatcaggtga 2101 ggagagtaga gtctctggga gaacaagcaa gccagaggca gaagcacatg ctggaaacct 2161 tgataaacaa aatatacacc gggcctttgg gggaagaact ggtacaaact ctgtatctca 2221 ggatctgggc gatggaggag actccggaat ccttgaaaat cctccaaatg agagaggata 2281 tccgagatca ggtattgaag atgaaagcag agagatggct gcgcaccctg ataagagggg 2341 agaagaccaa gctgaaggac ttccagaaga ggtacgagga ggtacatccc tacctgatga 2401 aggagaaggt ggagcaagta ataatggaag aagcatggag cctggcagct cacatagtgc 2461 aagagtaact ggggtcctgg tgattcctag ccccgaactt gaagaggctg tgctacggag 2521 gaacaaaaga agacctacca acagtgggtc caaacctctt actccagcaa ccgtgcctgg 2581 cacccggtcc ccaccgctga atcgttacaa cagcacaggg tcaccaccag gaaaaccccc 2641 atctacacag gatgagcaca tcaactctgg ggacaccccc gccgtcaggg tcaaagaccg 2701 gaaaccacca atagggaccc gctctgtctc agattgtcca gccaacggcc gcccaatcca 2761 cccgggtcta gagaccgact caacaaaaaa gggcatagga gagaacacat catctatgaa 2821 agagatggct acattgttga cgagtcttgg tgtaatccag tctgctcaag aattcgaatc 2881 atcccgagac gcgagttatg tgtttgcaag acgtgcccta aagtctgcaa actatgcaga 2941 gatgacattc aatgtatgcg gcctgatcct ttctgccgag aaatcttccg ctcgtaaggt 3001 agatgagaac aaacaactgc tcaaacagat ccaagagagc gtggagtcat tccgggatat 3061 ttataagaga ttctctgagt atcagaaaga acagaactca ttgctgatgt ccaacctatc 3121 tacacttcat atcatcacag atagaggtgg caagactgac aacacagact cccttacaag 3181 gtccccctcc gtttttgcaa aatcaaaaga gaacaagact aaggctacca ggtttgaccc 3241 atctatggag accctagaag atatgaagta caaaccggac ctaatccgag aggatgaatt 3301 tagagatgag atccgcaacc cggtgtacca agagagggac acagaaccta gggcctcaaa 3361 cgcatcacgt ctcctcccct ccaaagagaa gcccacaatg cactctctca ggctcgtcat 3421 agagagcagt cccctaagca gagctgagaa agcagcatat gtgaaatcat tatccaagtg 3481 caagacagac caagaggtta aggcagtcat ggaactcgta gaagaggaca tagagtcact 3541 gaccaactag atcccgggtg aggcatccta ccatcctcag tcatagagag atccaatcta 3601 ccatcagcat cagccagtaa agattaagaa aaacttaggg tgaaagaaat ttcacctaac 3661 acggcgcaat ggcagatatc tatagattcc ctaagttctc atatgaggat aacggtactg 3721 tggagcccct gcctctgaga actggtccgg ataagaaagc catcccccac atcaggattg 3781 tcaaggtagg agtccctcct aaacatggag tgagatacct agatttattg ctcttgggtt 3841 tctttgagac accgaaacaa acaaccaatc tagggagcgt atctgacttg acagagccga 3901 ccagctactc aatatgcggc tccgggtcgt tacccatagg tgtggccaaa tactacggga 3961 ctgatcagga actcttaaag gcctgcaccg atctcagaat tacggtgagg aggactgtta 4021 gagcaggaga gatgatcgta tacatggtgg gttcgattgg tgctccactc ctaccatggt 4081 caggcaggct gagacaggga atgatattta atgcaaacaa ggtcgcacta gctccccaat 4141 gcctccctgt ggacaaggac ataagactca gagtggtgtt tgtcaatggg acatctctag 4201 gggcaatcac catagccaag atcccaaaga cccttgcaga ccttgcattg cccaactcta 4261 tatccgttaa tttactggtg acactcaaga ccgggacctc cacagaacaa aagggggtac 4321 tcccagtact tgatgatcaa ggggagaaaa agctcaattt tatggtgcac ctcgggttga 4381 tcaggagaaa ggtcgggaag atatactctg ttgagtactg caagagcaag attgagagaa 4441 tgcggctgat tttctcactt gggttaatcg gcggtataag cttccatgtt caggttattg 4501 ggacactatc taagacattc atgagtcagc tcgcatggaa gagggcagtc tgcttcccat 4561 taatggatgt gaatccccat atgaacatgg tgatttgggc ggcatctgta gaaatcacag 4621 gcgtcgatgc ggtgttccaa ccggccatcc ctcgtgattt ccgctactac cctaatgttg 4681 tggctaagaa catcggaagg atcagaaagc tgtaaatgtg cacccatcag agacctgcga 4741 caatgcccca agcagacacc acctggcagt cggagccacc gggtcactcc ttgtcttaaa 4801 taagaaaaac ttagggataa agtcccttgt gagtgcttgg ttgcaaaact ctccccttgg 4861 gaaacatgac agcatatatc cagagatcac agtgcatctc aacatcacta ctggttgttc 4921 tcaccacatt ggtctcgtgt cagattccca gggataggct ctctaacata ggggtcatag 4981 tcgatgaagg gaaatcactg aagatagctg gatcccacga atcgaggtac atagtactga 5041 gtctagttcc ggtggtagac tttgagaatg ggtgcggaac agcccaggtt atccagtaca 5101 agagcctact gaacaggctg ttaatcccat tgagggatgc cttagatctt caggaggctc 5161 tgataactgt caccagtgat acgacacaaa atgccggtgc tccacagccg aaattcttcg 5221 gtgctgtgat tggtactatc gcacttggag tggcgacatc agcacaaatc accgcaggga 5281 ttgcactagc cgaagcgagg gaggccaaaa gagacatagc gctcatcaaa gaatcgatga 5341 caaaaacaca caagtctata gaactgctgc aaaacgctgt gggggaacaa attcttgctc 5401 taaagacact ccaggatttc gtgaatgatg agatcaaacc cgcaataagc gaattaggct 5461 gtgagactgc tgccttaaga ctgggtataa aattgacaca gcattactcc gagctgttaa 5521 ctgcgttcgg ctcgaatttc ggaaccatcg gagagaagag cctcacgctg caggcgctgt 5581 cttcacttta ctctgctaac attactgaga ttatgaccac aatcaggaca gggcagtcta 5641 acatctatga tgtcatttat acagaacaga tcaaaggaac ggtgatagat gtggatctaa 5701 agagatatat ggtcaccctg tctgtgaaga tccctattct ttctgaagtc ccaggtgtgc 5761 tcatacacaa ggcatcatct atttcttaca acatagacgg ggaggaatgg tatgtgactg 5821 tccccagcca tatactcagt cgtgcttctt tcttaggggg tgcagacata accgattgtg 5881 ttgagtccag attgacctat atatgcccca gggatcccgc acaactgata cctgacagcc 5941 agcaaaagtg tatcctgggg gacacaacaa ggtgtcctgt cacaaaagtt gtggacagcc 6001 ttatccccaa gtttgctttt gtgaatgggg gcgttgttgc taactgcata gcatccacat 6061 gtacctgcgg gacaggccga agaccaatca gtcaggatcg ctctaaaggt gtagtattcc 6121 taacccatga caactgtggt cttataggtg tcaatggggt agaattgtat gctaaccgga 6181 gagggcacga tgccacttgg ggggtccaga acttgacagt cggtcctgca attgctatca 6241 gacccgttga tatttctctc aaccttgctg atgctacgaa tttcttgcaa gactctaagg 6301 ctgagcttga gaaagcacgg aaaatcctct cggaggtagg tagatggtac aactcaagag 6361 agactgtgat tacgatcata gtagttatgg tcgtaatatt ggtggtcatt atagtgatca 6421 tcatcgtgct ttatagactc agaaggtcaa tgctaatggg taatccagat gaccgtatac 6481 cgagggacac atacacatta gagccgaaga tcagacatat gtacacaaaa ggtgggtttg 6541 atgcaatggc tgagaaaaga tgatcacgac cattatcaga tgtcttgtaa agcaggcatg 6601 gtattcgttg agatctgtat ataataagaa aaacttaggg tgaaagtgag gtcgcgcggt 6661 actttagctt tcacctcaaa caagcacaga tcatggatgg tgataggggc aaacgtgact 6721 cgtactggtc tacttctcct agtggtagca ctacaaaatt agcatcaggt tgggagaggt 6781 caagtaaagt tgacacatgg ttgctgattc tctcattcac ccagtgggct ttgtcaattg 6841 ccacagtgat catctgtatc ataatttctg ctagacaagg gtatagtatg aaagagtact 6901 caatgactgt agaggcattg aacatgagca gcagggaggt gaaagagtca cttaccagtc 6961 taataaggca agaggttata gcaagggctg tcaacattca gagctctgtg caaaccggaa 7021 tcccagtctt gttgaacaaa aacagcaggg atgtcatcca gatgattgat aagtcgtgca 7081 gcagacaaga gctcactcag cactgtgaga gtacgatcgc agtccaccat gccgagggaa 7141 ttgccccact tgagccacat agtttctgga gatgccctgt cggagaaccg tatcttagct 7201 cagatcctga aatctcattg ctgcctggtc cgagcttgtt atctggttct acaacgatct 7261 ctggatgtgt taggctccct tcactctcaa ttggcgaggc aatctatgcc tattcatcaa 7321 atctcattac acaaggttgt gctgacatag ggaaatcata tcaggtcctg cagctagggt 7381 acatatcact caattcagat atgttccctg atcttaaccc cgtagtgtcc cacacttatg 7441 acatcaacga caatcggaaa tcatgctctg tggtggcaac cgggactagg ggttatcagc 7501 tttgctccat gccgactgta gacgaaagaa ccgactactc tagtgatggt atcgaggatc 7561 tggtccttga tgtcctggat ctcaaaggga gaactaagtc tcaccggtat cgcaacagcg 7621 aggtagatct tgatcacccg ttctctgcac tataccccag tgtaggcaac ggcattgcaa 7681 cagaaggctc attgatattt cttgggtatg gtggactaac cacccctctg cagggtgata 7741 caaaatgtag gacccaagga tgccaacagg tgtcgcaaga cacatgcaat gaggctctga 7801 aaattacatg gctaggaggg aaacaggtgg tcagcgtgat catccaggtc aatgactatc 7861 tctcagagag gccaaagata agagtcacaa ccattccaat cactgaaaac tatctcgggg 7921 cggaaggtag attattaaaa ttgggtgatc gggtgtacat ctatacaaga tcatcaggct 7981 ggcactctca actgcagata ggagtacttg atgtcagcca ccctttgact atcaactgga 8041 cacctcatga agccttgtct agaccaggaa ataaagagtg caattggtac aataagtgtc 8101 cgaaggaatg catatcaggc gtatacactg atgcttatcc attgtcccct gatgcagcta 8161 acgtcgctac cgtcacgcta tatgccaata catcgcgtgt caacccaaca atcatgtatt 8221 ctaacactac taacattata aatatgttaa ggataaagga tgttcaatta gaggctgcat 8281 ataccacgac atcgtgtatc acgcattttg gtaaaggcta ctgctttcac atcatcgaga 8341 tcaatcagaa gagcctgaat accttacagc cgatgctctt taagactagc atccctaaat 8401 tatgcaaggc cgagtcttaa atttaactga ctagcaggct tgtcggcctt gctgacacta 8461 gagtcatctc cgaacatcca caatatctct cagtctctta cgtctctcac agtattaaga 8521 aaaacccagg gtgaatggga agcttgccat aggtcatgga tgggcaggag tcctcccaaa 8581 acccttctga catactctat ccagaatgcc acctgaactc tcccatagtc agggggaaga 8641 tagcacagtt gcacgtcttg ttagatgtga accagcccta cagactgaag gacgacagca 8701 taataaatat tacaaagcac aaaattagga acggaggatt gtcccctcgt caaattaaga 8761 tcaggtctct gggtaaggct cttcaacgca caataaagga tttagaccga tacacgtttg 8821 aaccgtaccc aacctactct cacgaattac ttaggcttga tataccagag atatgtgaca 8881 aaatccgatc cgtcttcgcg gtctcggatc ggctgaccag ggagttatct agtgggttcc 8941 aggatctttg gttgaatatc ttcaagcaac taggcaatat agaaggaaga gaggggtacg 9001 atccgttgca ggatatcggc accatcccgg agataactga taagtacagc aggaatagat 9061 ggtataggcc attcctaact tggttcagca tcaaatatga catgcggtgg atgcagaaga 9121 ccagaccggg gggacccctt gatacctcta attcacataa cctcctagaa tgcaaatcat 9181 acactctagt aacatacgga gatcttgtca tgatactgaa caagttgaca ttgacagggt 9241 atatcctaac ccctgagctg gtcttgatgt attgtgatgt tgtagaagga aggtggaata 9301 tgtctgctgc agggcatcta gataagaagt ccattgggat aacaagcaaa ggtgaggaat 9361 tatgggaact agtggattcc ctcttctcaa gtcttggaga ggaaatatac aatgtcatcg 9421 cactattgga gcccctatca cttgctctca tacaactaaa tgatcctgtt atacctctac 9481 gtggggcatt tatgaggcat gtgttgacag agctacagac tgttttaaca agtagagacg 9541 tgtacacaga tgctgaagca gacactattg tggagtcgtt actcgccatt ttccatggaa 9601 cctctattga tgagaaagca gagatctttt ccttctttag gacatttggc caccccagct 9661 tagaggctgt cactgccgcc gacaaggtaa gggcccatat gtatgcacaa aaggcaataa 9721 agcttaagac cctatacgag tgtcatgcag ttttttgcac tatcatcata aatgggtata 9781 gagagaggca tggcggacag tggcccccct gtgacttccc tgatcacgtg tgtctagaac 9841 taaggaacgc tcaagggtcc aatacggcaa tctcttatga atgtgctgta gacaactata 9901 caagtttcat aggcttcaag tttcggaagt ttatagaacc acaactagat gaagatctca 9961 caatatatat gaaagacaaa gcactatccc ccaggaagga ggcatgggac tctgtatacc 10021 cggatagtaa tctgtactat aaagccccag agtctgaaga gacccggcgg cttattgaag 10081 tgttcataaa tgatgagaat ttcaacccag aagaaattat caattatgtg gagtcaggag 10141 attggttgaa agacgaggag ttcaacatct cgtacagtct caaagagaaa gagatcaagc 10201 aagagggtcg tctattcgca aaaatgactt ataagatgcg agccgtacag gtgctggcag 10261 agacactact ggctaaagga ataggagagc tattcaggga aaatgggatg gttaagggag 10321 agatagacct acttaaaaga ttgactactc tttctgtctc aggcgtcccc aggactgatt 10381 cagtgtacaa taactctaaa tcatcagaga agagaaacga aggcatggga aataagaact 10441 ctggggggta ctgggacgaa aagaagaggt ccagacatga attcaaggca acagattcat 10501 caacagacgg ctatgaaaca ttaagttgct tcctcacaac agacctcaag aaatactgct 10561 taaactggag atttgagagt actgcattgt ttggtcagag atgcaacgag atatttggct 10621 tcaagacctt ctttaactgg atgcatccag tccttgaaag gtgtacaata tatgttggag 10681 atccttactg tccagtcgcc gaccggatgc atcgacaact ccaggatcat gcagactctg 10741 gcattttcat acataatcct agggggggca tagaaggtta ctgccagaag ctgtggacct 10801 taatctcaat gagtgcaatc cacctagcag ctgtgagagt gggtgtcagg gtctctgcaa 10861 tggttcaggg tgacaatcaa gctatagccg tgacatcaag agtacctgta gctcagactt 10921 acaagcagaa gaaaaatcat gtctatgagg agatcaccaa atatttcggt gctctaagac 10981 acgtcatgtt tgatgtaggg cacgagctaa aattgaacga gaccatcatt agtagcaaga 11041 tgtttgtcta tagtaaaagg atatactatg atgggaagat tttaccacag tgcctgaaag 11101 ccttgaccaa gtgtgtattc tggtccgaga cactggtaga tgaaaacaga tctgcttgtt 11161 cgaacatctc aacatccata gcaaaagcta tcgaaaatgg gtattctcct atactaggct 11221 actgcattgc gttgtataag acctgtcagc aggtgtgcat atcactaggg atgactataa 11281 atccaactat cagcccgacc gtaagagatc aatactttaa gggtaagaat tggctgagat 11341 gtgcagtgtt gattccagca aatgttggag gattcaacta catgtctaca tctagatgct 11401 ttgttagaaa tattggagac cccgcagtag cagccctagc tgatctcaaa agattcatca 11461 gagcggatct gttagacaag caggtattat acagggtcat gaatcaagaa cccggtgact 11521 ctagttttct agattgggct tcagaccctt attcgtgtaa cctcccgcat tctcagagta 11581 taactacgat tataaagaat atcactgcta gatctgtgct gcaggaatcc ccgaatcctc 11641 tactgtctgg tctcttcacc gagactagtg gagaagagga tctcaacctg gcctcgttcc 11701 ttatggaccg gaaagtcatc ctgccgagag tggctcatga gatcctgggt aattccttaa 11761 ctggagttag ggaggcgatt gcagggatgc ttgatacgac caagtctcta gtgagagcca 11821 gcgttaggaa aggaggatta tcatatggga tattgaggag gcttgtcaat tatgatctat 11881 tgcagtacga gacactgact agaactctca ggaaaccggt gaaagacaac atcgaatatg 11941 agtatatgtg ttcagttgag ctagctgtcg gtctaaggca gaaaatgtgg atccacctga 12001 cttacgggag acccatacat gggctagaaa caccagaccc tttagagctc ttgaggggaa 12061 tatttatcga aggttcagag gtgtgcaagc tttgcaggtc tgaaggagca gaccccatct 12121 atacatggtt ctatcttcct gacaatatag acctggacac gcttacaaac ggaagtccgg 12181 ctataagaat cccctatttt ggatcagcca ctgatgaaag gtcggaagcc caactcgggt 12241 atgtaagaaa tctaagcaaa cccgcaaagg cggccatccg gatagctatg gtgtatacgt 12301 gggcctacgg gactgatgag atatcgtgga tggaagccgc tcttatagcc caaacaagag 12361 ctaatctgag cttagagaat ctaaagctgc tgactcctgt ttcaacctcc actaatctat 12421 ctcataggtt gaaagatacg gcaacccaga tgaagttctc tagtgcaaca ctagtccgtg 12481 caagtcggtt cataacaata tcaaatgata acatggcact caaagaagca ggggagtcga 12541 aggatactaa tctcgtgtat cagcagatta tgctaactgg gctaagcttg ttcgagttca 12601 atatgagata taagaaaggt tccttaggga agccactgat attgcactta catcttaata 12661 acgggtgctg tataatggag tccccacagg aggcgaatat ccccccaagg tccacattag 12721 atttagagat tacacaagag aacaataaat tgatctatga tcctgatcca ctcaaggatg 12781 tggaccttga gctatttagc aaggtcagag atgttgtaca cacagttgac atgacttatt 12841 ggtcagatga tgaagttatc agagcaacca gtatctgtac tgcaatgacg atagctgata 12901 caatgtctca attagataga gacaacttaa aagagatgat cgcactagta aatgacgatg 12961 atgtcaacag cttgattact gagtttatgg tgattgatgt tcctttattt tgctcaacgt 13021 tcgggggtat tctagtcaat cagtttgcat actcactcta cggcttaaac atcagaggaa 13081 gggaagaaat atggggacat gtagtccgga ttcttaaaga tacctcccac gcagttttaa 13141 aagtcttatc taatgctcta tctcatccca aaatcttcaa acgattctgg aatgcaggtg 13201 tcgtggaacc tgtgtatggg cctaacctct caaatcagga taagatactc ttggccctct 13261 ctgtctgtga atattctgtg gatctattca tgcacgattg gcaagggggt gtaccgcttg 13321 agatctttat ctgtgacaat gacccagatg tggccgacat gaggaggtcc tctttcttgg 13381 caagacatct tgcataccta tgcagcttgg cagagatatc tagggatggg ccaagattag 13441 aatcaatgaa ctctctagag aggctcgagt cactaaagag ttacctggaa ctcacatttc 13501 ttgatgaccc ggtactgagg tacagtcagt tgactggcct agtcatcaaa gtattcccat 13561 ctactttgac ctatatccgg aagtcatcta taaaagtgtt aaggacaaga ggtataggag 13621 tccctgaagt cttagaagat tgggatcccg aggcagataa tgcactgtta gatggtatcg 13681 cggcagaaat acaacagaat attcctttgg gacatcagac tagagcccct ttttgggggt 13741 tgagagtatc caagtcacag gtactgcgtc tccgggggta caaggagatc acaagaggtg 13801 agataggcag atcaggtgtt ggtctgacgt taccattcga tggaagatat ctatctcacc 13861 agctgaggct ctttggcatc aacagtacta gctgcttgaa agcacttgaa cttacctacc 13921 tattgagccc cttagttgac aaggataaag ataggctata tttaggggaa ggagctgggg 13981 ccatgctttc ctgttatgac gctactcttg gcccatgcat caactattat aactcagggg 14041 tatactcttg tgatgtcaat gggcagagag agttaaatat atatcctgct gaggtggcac 14101 tagtgggaaa gaaattaaac aatgttacta gtctgggtca aagagttaaa gtgttattca 14161 acgggaatcc tggctcgaca tggattggga atgatgagtg tgaggctttg atttggaatg 14221 aattacagaa tagctcgata ggcctagtcc actgtgacat ggagggagga gatcataagg 14281 atgatcaagt tgtactgcat gagcattaca gtgtaatccg gatcgcgtat ctggtggggg 14341 atcgagacgt tgtgcttata agcaagattg ctcccaggct gggcacggat tggaccaggc 14401 agctcagcct atatctgaga tactgggacg aggttaacct aatagtgctt aaaacatcta 14461 accctgcttc cacagagatg tatctcctat cgaggcaccc caaatctgac attatagagg 14521 acagcaagac agtgttagct agtctcctcc ctttgtcaaa agaagatagc atcaagatag 14581 aaaagtggat cttaatagag aaggcaaagg ctcacgaatg ggttactcgg gaattgagag 14641 aaggaagctc ttcatcaggg atgcttagac cttaccatca agcactgcag acgtttggct 14701 ttgaaccaaa cttgtataaa ttgagcagag atttcttgtc caccatgaac atagctgata 14761 cacacaactg catgatagct ttcaacaggg ttttgaagga tacaatcttc gaatgggcta 14821 gaataactga gtcagataaa aggcttaaac taactggtaa gtatgacctg tatcctgtga 14881 gagattcagg caagttgaag acaatttcta gaagacttgt gctatcttgg atatctttat 14941 ctatgtccac aagattggta actgggtcat tccctgacca gaagtttgaa gcaagacttc 15001 aattgggaat agtttcatta tcatcccgtg aaatcaggaa cctgagggtt atcacaaaaa 15061 ctttattaga caggtttgag gatattatac atagtataac gtatagattc ctcaccaaag 15121 aaataaagat tttgatgaag attttagggg cagtcaagat gttcggggcc aggcaaaatg 15181 aatacacgac cgtgattgat gatggatcac tgggtgatat cgagccatat gacagctcgt 15241 aataattagt ccctatcgtg ccgaacgatc gaagctccgc ggtacctgga agtcttggac 15301 ttgtccatat gacaatagta agaaaaactt acaagaagac aagaaaattt aaaaggatac 15361 atatctctta aactcttgtc tggt // LOCUS HUMKRASPO 111 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human K-ras proto-oncogene, exon 1. ACCESSION M34904 KEYWORDS K-ras proto-oncogene. SOURCE Human normal fetal liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 111) AUTHORS Santos,E., Martin-Zanca,D., Reddy,E.P., Pierotti,M.A., Della Porta,G. and Barbacid,M. TITLE Malignant activation of a K-ras oncogene in lung carcinoma but not in normal tissue of the same patient JOURNAL Science 223, 661-664 (1984) STANDARD simple staff_entry FEATURES from to/span description pept 1 / 111 K-ras p21 protein, exon 1 BASE COUNT 35 a 16 c 30 g 30 t ORIGIN 1 atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 61 atacagctaa ttcagaatca ttttgtggac gaatatgatc caacaataga g // LOCUS HUMGLYE 498 bp ss-mRNA PRI 18-JUL-1990 DEFINITION Human glycophorin E mRNA, complete cds. ACCESSION M29610 J05183 M29620 KEYWORDS glycophorin. SOURCE Human erythroleukemia proerythroblast cell line K562, cDNA to mRNA, clone pHGpE-7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 498) AUTHORS Kudo,S. and Fukuda,M. TITLE Identification of a novel human glycophorin, glycophorin E, by isolation of genomic clones and complementary DNA clones utilizing polymerase chain JOURNAL J. Biol. Chem. 265, 1102-1110 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Kudo 31-OCT-1989. FEATURES from to/span description pept 57 293 glycophorin E signal 470 481 polyA signal BASE COUNT 128 a 107 c 94 g 169 t ORIGIN 1 agttgtcttt ggtagttttt ttgcactaac ttcaggagcc agctcgtgat ctcaggatgt 61 atggaaaaat aatctttgta ttactattgt cagaaattgt gagcatatca gcatcaagta 121 ccactggtgt ggcaatgcac acttcaacct cttcttcagt cacaaagagt tacatctcat 181 cacagacaaa tgggataaca ctcattaatt ggtgggcgat ggctcgtgtt atttttgagg 241 tgatgcttgt tgttgttgga atgatcatct taatttctta ctgtattcga tgactgataa 301 aggcatgagg atgtggcctg catgctgcct gatcttgcct agaaccagct gcacctgctg 361 ttctcttgtt atgcaaactg gctgcacctg ctattccttt gcttatgccc caacccttgg 421 ctatcctaac tccctgttct cctgcctatt actgtattct ctacttctaa ataaaaataa 481 aacaaaatac aaattatt // LOCUS BAMSYNAMY 85 bp ds-DNA SYN 18-JUL-1990 DEFINITION Synthetic B.amyloliquefaciens alpha-amylase gene promoter region. ACCESSION M34931 KEYWORDS . SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 85) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 72 > 85 synthetic alpha-amylase BASE COUNT 34 a 8 c 19 g 24 t ORIGIN 1 actcggggtt gttattattt tatcgatatg taaaatataa tttctagaag aaaagaaggt 61 ggagaggaaa catgatccaa aaacg // LOCUS PMP5END 139 bp ss-RNA VRL 18-JUL-1990 DEFINITION Papaya mosaic virus, 5' end. ACCESSION M35054 KEYWORDS . SOURCE Papaya mosaic virus RNA. ORGANISM Papaya mosaic potexvirus Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Potexvirus. REFERENCE 1 (bases 1 to 139) AUTHORS Lok,S. and Abouhaidar,M.G. TITLE The nucleotide sequence of the 5' end of papaya mosaic virus RNA: Site of in vitro assembly initiation JOURNAL Virology 153, 289-296 (1986) STANDARD simple staff_entry BASE COUNT 58 a 23 c 30 g 27 t 1 others ORIGIN 1 ggaaaagaaa cacaaagcaa agcaaagcaa agcaactcaa ataaancata tttggtcaag 61 gcacttggta atcaaacggg cacaacccta gattaatgat atggcaaatt tgaggagtgt 121 tttcgaacag ttgaacgag // LOCUS MZEEMA 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Z.mexicana 180-bp knob-specific repeat region. ACCESSION M32533 KEYWORDS . SOURCE Z.mexicana (Teosinte) 6-day-old seedling DNA, clone pEm1. ORGANISM Zea mexicana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review COMMENT Zea mexicana is also known as Zea mays subspecies mexicana. BASE COUNT 58 a 38 c 39 g 45 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccattttc aataatatcg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat // LOCUS MZEEMB 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Z.mexicana 180-bp knob-specific repeat region. ACCESSION M32534 KEYWORDS . SOURCE Z.mexicana (Teosinte) 6-day-old seedling DNA, clone pEm2. ORGANISM Zea mexicana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review COMMENT Zea mexicana is also known as Zea mays subspecies mexicana. BASE COUNT 60 a 40 c 38 g 42 t ORIGIN 1 ggccacacaa accccatttt tgtcgaaaat agccatgaac gaccattttc aataatatcg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctacgac aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat // LOCUS MZEZMA 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32521 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-5. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 40 c 39 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat // LOCUS MZEZMB 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32522 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-21. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 40 c 39 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat // LOCUS MZEZMC 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32523 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-10. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 41 c 38 g 43 t ORIGIN 1 ggccacacaa cccccatttt tttcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gacgatcaat // LOCUS MZEZMD 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32524 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-25. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 39 c 40 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcgtctggtc gatgatcaat // LOCUS MZEZME 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32525 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm2-3(I). ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 40 c 38 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctagtc gatgatcaat // LOCUS MZEZMF 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32526 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-7. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 40 c 40 g 41 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aaaaataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctgtggtc gacgatcaat // LOCUS MZEZMG 179 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32527 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-26. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 179) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 38 c 38 g 44 t ORIGIN 1 ggccacaaac ccccattttt gtcgaaaata gccatgaacg accattttca ataataccga 61 aggctaacac ctatggattt ttgaccaaga aatggtctcc accagaaatc caagaatgtg 121 atctatggca aggaaacata tgtggggtga ggtgtatgag cctctagtcg atgatcaat // LOCUS MZEZMH 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32528 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-14. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 61 a 39 c 38 g 42 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccatttac aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaaaaatgt 121 gatctatgga aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat // LOCUS MZEZMI 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32529 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-13. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 61 a 40 c 36 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat aaccatgaac gaccattttc aataatatcg 61 aaggctaaca cctatggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 aatctacgac aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat // LOCUS MZEZMJ 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32530 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-1. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 35 c 38 g 48 t ORIGIN 1 ggccacacaa cccgcatttt ttttgaaaat agccatgaat gaccattttc aataatatta 61 gaggctaaca cctacgagtt tttaaccaag aaatggtctc caccagaaat tcaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gacgatcaat // LOCUS MZEZMK 179 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32531 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm2-1(I). ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 179) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 62 a 39 c 37 g 41 t ORIGIN 1 ggccacacaa cccccatttt ttcaaaaata gccatgaacg accattttca aaaatacgaa 61 aggctaacac gtacggattt ttgaccatga aatggtctcc accagaaatc cgaaaatgag 121 atctacgaca aggaaacata tgtggggtga ggtgtatgag cctctggtcg atgatcaat // LOCUS MZEZML 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32532 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm2-1(II). ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 34 c 40 g 47 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccattttc aaaaatacga 61 aaggctaaca cgtacggatt tttgaccatg aaatggtttt caccagaaat tcaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat // LOCUS TRD2AA 177 bp ds-DNA PLN 18-JUL-1990 DEFINITION T.dactyloides 180-bp knob-specific repeat region. ACCESSION M32535 KEYWORDS . SOURCE T.dactyloides 6-day-old seedling DNA, clone 2. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 177) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 60 a 38 c 40 g 39 t ORIGIN 1 ggccacacaa cccccattta tgtcggaaat agcgatgaac gaccattttc aataatacga 61 aggtaacacc tacgaaattt tgaccaagat aggtctcaac cagaaatcca agaatgtgac 121 ctatggcaag gaaacatatg tggggtgagg tgtacgagcc tctggtcgat gatcaat // LOCUS TRD2AB 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION T.dactyloides 180-bp knob-specific repeat region. ACCESSION M32536 KEYWORDS . SOURCE T.dactyloides 6-day-old seedling DNA, clone 4. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 40 c 40 g 41 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agcgatgaac gaccattttc aataatacca 61 aaggctaaca cctatggatt ttggaccaag aaatggtctc caccagaaat ccaagaatgt 121 gacctatggc aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat // LOCUS TRD2AC 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION T.dactyloides 180-bp knob-specific repeat region. ACCESSION M32537 KEYWORDS . SOURCE T.dactyloides 6-day-old seedling DNA, clone 7. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 60 a 41 c 37 g 42 t ORIGIN 1 ggccacacaa cccccatttt tgtttaaaat agccaagaac gaccattttc aataataccg 61 aatgctaaca cctatggatt attgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gacctatggc aaggaaacat atgtggggtg aggtgtacga gcctccggtc gatgatcaat // LOCUS TRD2AD 179 bp ds-DNA PLN 18-JUL-1990 DEFINITION T.dactyloides 180-bp knob-specific repeat region. ACCESSION M32538 KEYWORDS . SOURCE T.dactyloides 6-day-old seedling DNA, clone 8. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 179) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 39 c 38 g 44 t ORIGIN 1 ggccacacaa cccccatttt ttttgaaaat agccatgaac gaccattttc aataataccg 61 aaggtaacac ctacggattt ttgaccaaga aatggtctcc accagaaatc caagaatgtg 121 acctatggca aggaaacata tgtggggtga ggtgtatgag cctctggtcg atgatcaat // LOCUS VSVNNJ 1329 bp ss-mRNA VRL 18-JUL-1990 DEFINITION Vesicular stomatitis virus (New Jersey) nucleocapsid protein mRNA, complete cds. ACCESSION K02379 M35062 KEYWORDS nucleocapsid protein; protein N. SOURCE VSV (New Jersey serotype; Ogden strain), cDNA to mRNA, clones 53 and 77. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Banerjee,A.K., Rhodes,D.P. and Gill,D.S. TITLE Complete nucleotide sequence of the mRNA coding for the N protein of vesicular stomatitis virus (New Jersey serotype) JOURNAL Virology 137, 432-438 (1984) STANDARD full staff_review REFERENCE 2 (bases 1234 to 1329) AUTHORS Rhodes,D.P. and Banerjee,A.K. TITLE Poly(A)-adjacent sequence of the 14.5 S mRNA of vesicular stomatitis virus (New Jersey serotype) JOURNAL Virology 105, 297-300 (1980) STANDARD simple staff_entry FEATURES from to/span description pept 14 1282 nucleocapsid protein mRNA 1 1329 N mRNA conflict 1264 1264 a in [1]; g in [2] BASE COUNT 443 a 271 c 281 g 334 t ORIGIN 182 bp upstream of BstNI site. 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcca taattcagcc 61 taaactaccg gccaatgagg atccggtaga gtatccagct gattacttca aaaataacac 121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttaaggg cctatgtcta 181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc 241 gttaaaggga gtagaaggaa cattagacag agactgggtt agctttggaa gaacgatcgg 301 aaagagggaa gagaatgtca aaatcttcga tctagtcaaa gttgaggaac tgaagacagc 361 tcttcctgat gggaaatcgg atccagatcg atcagctgaa gatgataaat ggcttcctat 421 ttacatccta ggcctatatc gagtgggtag atccaaagtt acagattaca gaaagaaact 481 cctggacggt ctagaaaatc agtgtagagt ggcatcaacc agatttgaaa gccttgttga 541 agatggtctt gacttctttg acatctggga aaatgatccc aacttcacca aaatagttgc 601 tgcagtggat atgttttttc acatgttcaa gaagcatgaa cgtgctccaa tcagatatgg 661 aaccatagtc tcacgattca aagactgtgc agcacttgct acatttggac atctcagcaa 721 ggttagcgga ctttcaattg aagacctaac aacatgggtt ctgaatcgag aagtcgcaga 781 cgaactatgc caaatgatgt acccaggcca agaaattgac aaggcagatt catacatgcc 841 atatatgatt gatttcgggt tatcccaaaa atctccttat tcatcagtga agaatccagc 901 tttccacttc tggggtcaac tggccgcact cttgctaaga tcaactcgag caaaaaatgc 961 tcggcaaccc gatgacattg agtacacttc gctgacttgt gcaagtttac tactgtcatt 1021 tgcagttgga tcttcagcag acattgaaca acaattctac atcggagagg acaaatacac 1081 aacagagaaa gatgacagtc tcaagaaatc tgatgtccca ccgaaaggaa gaaatgttgt 1141 agactggctc ggttggtatg atgacaatgg agggaaaccc acaccggata tgctcaactt 1201 cgcacgaaga gcagtcagtt cgttgcagtc actccgcgag aaaacaattg gcaaatacgc 1261 aaaagtagaa tttgacaaat gaccaggttt gacatatcaa tatcaatatt atattctgtg 1321 ctatatatg // LOCUS TNCPOLY 3660 bp ss-RNA VRL 18-JUL-1990 DEFINITION Tobacco necrosis virus RNA-dependent RNA polymerase and coat protein genes, complete cds. ACCESSION M33002 KEYWORDS RNA-dependent RNA polymerase; coat protein; polymerase. SOURCE Tobacco necrosis virus (strain A, isolate TNV-A-FM1B) Wk6 cell line, cDNA to viral RNA, clones pFM2[0,1,2,3]. ORGANISM Tobacco necrosis virus Viridae; ss-RNA nonenveloped viruses. REFERENCE 1 (bases 1 to 3660) AUTHORS Meulewaeter,F., Seurinck,J. and van Emmelo,J. TITLE Genome structure of tobacco necrosis virus strain A JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Meulewaeter, 20-MAR-1990, for release after publication. FEATURES from to/span description pept 60 665 RNA-dependent RNA polymerase (put.) /transl_except=(666..668) 669 2234 RNA-dependent RNA polymerase pept 2218 2436 ORF 3 pept 2440 2610 ORF 4 pept 2613 3443 coat protein pept 3467 3646 ORF 6 site 666 668 readthrough codon BASE COUNT 1034 a 890 c 895 g 841 t ORIGIN 1 agtattcata ccaagaatac caaataggtg caaggcctta ctcagctaaa gagtctaaaa 61 tggagctacc aaaccaacac aagcaaacgg ccgccgaggg tttcgtatct ttcctaaact 121 ggctatgcaa cccatggaga cgacagcgaa cagtcaacgc tgcagttgcg ttccaaaaag 181 atcttctcgc cattgaggat tccgagcatt tggatgacat caatgagtgt ttcgaggagt 241 ctgctggggc acaatctcag cgaactaagg ttgtcgccga cggagcatat gcccccgcaa 301 aatccaacag gacccgccga gttcgtaagc agaagaagca caagtttgta aaatatcttg 361 tcaacgaagc tcgtgccgag tttggattgc ccaaaccaac tgaggcaaac agacttatgg 421 tccaacattt cttgctcaga gtgtgcaagg attggggcgt tgttactgcc cacgtacacg 481 gcaatgttgc actagctttg ccactggtgt tcatcccaac ggaagatgat ctgctatcac 541 gagcattgat gaacacacat gctactagag ccgctgtacg aggcatggac aatgtccaag 601 gggaggggtg gtggaacaat aggttgggga ttgggggcca ggtcggactg gccttccggt 661 ccaaataggg gtgccttgaa aggaggccag gattctccac gtccgtttcg cgtggggaac 721 atcctgatct ggtggtcata ccatcagggc gccctgagaa acagcgtcag ttgttacgct 781 atagtggtat aggcggccat ttattaatcg gcatccacaa caactctctt tccaacctgc 841 gtaggggctt gatggaaaga gtattctatg tcgaggggcc caatgggctt caagacgccc 901 ctaagcccgt caagggagct tttcgaaccc ttgataagtt tcgtgatctc tatactaaaa 961 atagttggcg tcatacccct gtaactagtg aacaattcct aatgaattac acgggcagga 1021 aactgactat ttacagagag gcggttgata gtttgtcgca tcaacccctt agctcacgag 1081 atgcgaaact aaagacattc gtgaaggccg aaaaattaaa tctttctaag aagcctgacc 1141 ctgctcccag ggtcatccaa cctagatcgc ctcggtataa cgtttgtttg ggcaggtacc 1201 tccgacatta tgagcatcac gcgtttaaaa ccattgccaa gtgctttggg gaaatcacgg 1261 tcttcaaagg gtttactctg gagcaacaag gggaaatcat gcgctcgaag tggaataaat 1321 atgttaatcc cgtcgcagtc ggactcgacg ccagtcgttt cgaccaacac gtgtctgttg 1381 aagcactcga gtatgagcat gaattttacc tcagagacta cccaaatgat aaacagctaa 1441 aatggctgct aaagcagcaa ttgtgcaacg taggaacggc attcgccagt gacggcatta 1501 taaaatacaa gaagaagggt tgtagaatga gcggagacat gaacacgagc ttgggcaact 1561 gcattctaat gtgcgccatg gtctacgggt tgaaagaaca cttaaacatc aatttgtccc 1621 ttgcaaataa tggggatgac tgcgtcattg tctgtgagaa agcggattta aagaaattga 1681 caagcagcat cgagccatat ttcaagcagt ttggattcaa gatggaagtg gaaaaacccg 1741 tggatatatt tgagcgcata gaattttgcc aaacccaacc tgtgttcgat ggatcccagt 1801 acatcatggt acgcaaacct tctgtggtaa catctaaaga cgtcactagc cttatcccat 1861 gtcaaacgaa agcacaatac gcagaatggc tgcaagctgt aggtgagtgt ggcatgagca 1921 ttaacggtgg gattcctgtc atgcagaatt tctaccaaaa gctccaaact ggcatccgcc 1981 gcacaaaatt caccaagacc ggcgagttcc agacgaacgg attggggtat cactctagat 2041 atatgcatag agtggcccgg gttccttcgc ctgaaacccg tttatccttc tatctagctt 2101 tcggtatcac accagacctc caagaagcat tggagatctt ctatgatacc cacaagcttg 2161 agttggatga tgttatccca actgatacct accaagtgtc aggagagcat ttgatcaatg 2221 gattaccaaa ctgatgtaac ggaggacaat gtgcaaatac gcggtcgggc taggagcgtt 2281 gagggtaaga aacacaatgg ttcgggatta actggcgtta agcgtcacgc ggtgagcgaa 2341 acatctcaga aatcacagca aggtactggc aatggaacta tgaccaatat agccgaagaa 2401 cagaccatta ccgtgacata caactttaac ttttaagtta tggctgcgtg tcgctgttgt 2461 gatacttcac caggtattac actattccct tactttgcaa ttctcatcct tatattggca 2521 atacttgttg tagggactcc caatcaacaa tatcaccatt ctccaagcac ttacgagtac 2581 aagactcaac acatttcgat cgcaaaatag acatggcagg aaagaagaac aacaacaacg 2641 gtcagtatat aatactgcgt actccagagc aacaggtgga gatagaccag cgcaacgccc 2701 gtcgtgctca aatgggtcgc atgaagaagg ctagacagcc cgttcagcga tacttacagc 2761 aacacgggtt gcgaaacgga ttgtccggta gagggggcta catagtggct cccacctccg 2821 ggggggttgt cactcgaccc atagtgccga aattctccaa caggggagat tccactatag 2881 tccgtaacac tgagattttg aacaaccaaa tcttagcggc gctaggcgca ttcaatacaa 2941 caaactccgc actgattgca gcagcaccat catggctggc tagcatcgct gatctttaca 3001 gtaaatacag atggctctca tgtgagatca tctacattcc aaaatgcccc accaccacca 3061 gtggatcaat tgccatggct ttcacatacg acagaaatga cgctgcaccc accgcaaggg 3121 ctcagctgtc acaatcttac aaggccatca attttccacc gtatgcggga tacgacggag 3181 cagcatattt gaattcgaac cagggagctg ggtcagccat cgccgttcaa cttgatgtta 3241 ccaagttgga caagccatgg taccccacta tctcctctgc cggcttcggg gcgctcagcg 3301 tcctcgatca gaaccaattc tgccccgcgt cccttgtggt cgctagcgat gggggacccg 3361 ctactgctac tccagcaggg gaccttttca tcaagtacgt gattgagttc attgaaccaa 3421 tcaacccaac aatgaacgtc tagttctttg tactgtaact tggctaatgc ctaaggtgga 3481 gtcacaccat tggagacgga gacggatcct gggaaacagg cttgacgggc ggggggtggt 3541 gcccccgacg acgcatcact ccggatacca atggtacacc actatggcag ggtctgccaa 3601 ggtcttgtgc accaagaacc cctggaaacg ggggggaggg gggtagcaca tatcatccag // LOCUS BLYPROZ 75 bp ss-mRNA PLN 18-JUL-1990 DEFINITION Barley protein Z mRNA, partial cds. ACCESSION M35065 KEYWORDS albumin; protein Z. SOURCE Barley, cDNA to mRNA, clone paz1-1. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 75) AUTHORS Rasmussen,S.K., Hopp,H.E., Brandt,A., Svendsen,I. and Hejgaard,J. TITLE A cDNA clone for protein Z, a major barley endosperm albumin JOURNAL Carlsberg Res. Commun. 49, 385-390 (1984) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 75 protein Z (AA at 1) BASE COUNT 12 a 19 c 19 g 25 t ORIGIN 1 cccctgaagg tggatttggt ggattttgtc gcgaatcacc ctttcctctt ccttattcgg 61 gaagacattg ccggt // LOCUS BSURPOF 3078 bp ds-DNA BCT 18-JUL-1990 DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB), complete cds. ACCESSION M34995 KEYWORDS RNA polymerase; minor sigma-37 factor. SOURCE B.subtilis (strains 168 and Marburg), clones lambda-[21,gt11,1,11,21,31]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 368 to 2297) AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W. TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus subtilis RNA polymerase: Isolation, nucleotide sequence, chromosomal locus, and cryptic function JOURNAL J. Bacteriol. 169, 771-778 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 3078) AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W. TITLE Similar organization of the sigB and spoIIA operons encoding alternate sigma factors of Bacillus subtilis RNA polymerase JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.W.Price, 18-FEB-1987. Draft entry and computer-readable sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990. FEATURES from to/span description pept 200 529 ORF V pept 526 1008 ORF W pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg start codon) pept 1762 2361 ORF X mRNA 131 2368 sigB operon mRNA signal 131 168 sigB operon promoter signal 2368 2401 sigB transcription termination signal binding 188 194 ORF V ribosomal binding site binding 511 519 ORF W ribosomal binding site binding 954 961 sigB ribosomal binding site binding 1750 1753 ORF X ribosomal binding site BASE COUNT 999 a 575 c 755 g 749 t ORIGIN 40 degrees on B.subtilis map. 1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt 61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt 121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg 181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat 241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct 301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat 361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg 421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt 481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct 541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg 601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc 661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg 721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa 781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac 841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat 901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg 961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga 1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg 1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca 1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc 1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg 1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc 1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga 1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg 1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac 1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca 1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct 1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat 1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga 1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta 1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct 1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac 1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc 1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg 2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt 2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc 2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata 2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta 2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat 2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt 2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg 2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac 2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga 2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac 2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca 2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag 2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa 2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt 2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt 2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc 3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg 3061 caaactgctg cgaagctt // LOCUS HUMERPR 1624 bp ss-mRNA PRI 18-JUL-1990 DEFINITION Human erythropoietin receptor mRNA, complete cds. ACCESSION M34986 KEYWORDS erythropoietin receptor. SOURCE Human cell line OCI-M1, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1624) AUTHORS Ehrenman,K.St. and St John,T. TITLE Sequence of the human erythropoietin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.St.John, 07-JUN-1990. Author address: T.St.John Hutchinson Cancer Res Center 1124 Columbia St. Seattle, WA Email: TSTJOHN@FHCRCVAX FEATURES from to/span description pept 1 1527 erythropoietin receptor precursor sigp 1 72 erythropoietin receptor signal peptide matp 73 1524 erythropoietin receptor BASE COUNT 276 a 537 c 498 g 313 t ORIGIN 1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct 61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc 121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg 181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc 241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt 301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta 361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat 421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc 481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac 541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag 601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc 661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg 721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg 781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag 841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac 901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc 961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg 1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc 1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg 1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc 1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct 1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca 1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct 1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta 1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg 1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata 1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc 1621 agga // LOCUS HUMGFP40H 4379 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete cds. ACCESSION M30135 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor; megakeryoblastic leukemia cell growth factor. SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4379) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 12-DEC-1989. FEATURES from to/span description pept 796 909 P40 T-cell and mast cell growth factor precursor, exon 1 1022 1057 P40 T-cell and mast cell growth factor precursor, exon 2 1140 1172 P40 T-cell and mast cell growth factor precursor, exon 3 2461 2592 P40 T-cell and mast cell growth factor precursor, exon 4 4096 4215 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 796 849 P40 T-cell and mast cell growth factor signal peptide matp 850 909 P40 T-cell and mast cell growth factor 1022 1057 P40 T-cell and mast cell growth factor 1140 1172 P40 T-cell and mast cell growth factor 2461 2592 P40 T-cell and mast cell growth factor 4096 4212 P40 T-cell and mast cell growth factor IVS 910 1021 hP40 intron A IVS 1058 1139 hP40 intron B IVS 1173 2460 hP40 intron C IVS 2593 4095 hP40 intron D signal 742 747 TATA box signal 4354 4359 polyA signal rpt 2965 3265 Alu repeat binding 370 376 AP-2 binding site binding 624 629 IRF-1 binding site binding 627 633 AP-1 binding site BASE COUNT 1208 a 921 c 904 g 1346 t ORIGIN 1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt 61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc 121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca 181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct 241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact 301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc 361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag 421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa 481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt 541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt 601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa 661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt 721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc 781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc 841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac 901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat 961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta 1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac 1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg 1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg 1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat 1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt 1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt 1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag 1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa 1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc 1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg 1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc 1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc 1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat 1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac 1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc 1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg 1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt 2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc 2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga 2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc 2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa 2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa 2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt 2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag 2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg 2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac 2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt 2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc 2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc 2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct 2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag 2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa 2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg 3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg 3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat 3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca 3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc 3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt 3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac 3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg 3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc 3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt 3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca 3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca 3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact 3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga 3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt 3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc 3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc 3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg 4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa 4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac 4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg 4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc 4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga 4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa // LOCUS MEFDNABP 1710 bp ds-DNA BCT 18-JUL-1990 DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds. ACCESSION M34778 KEYWORDS DNA binding protein HMfB. SOURCE M.fervidus DNA. ORGANISM Methanothermus fervidus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (sites; for [2]) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. TITLE DNA binding protein HMf, isolated from the hypothermophilic archaeal Methanothermus fervidus is most closely related to histones JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1710) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Sandman, 01-JUN-1990. Author address: K.Sandman Department of Microbiology Ohio State University 484 W. 12th Ave. Columbus, Ohio 43210 E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU FEATURES from to/span description pept 578 787 DNA binding protein HMfB pept 1662 805 (c) ORF285 signal 450 457 promoter signal 523 530 promoter signal 790 814 transcription termination signal BASE COUNT 619 a 263 c 265 g 561 t 2 others ORIGIN 1 bp upstream of HindIII site. 1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa 61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag 121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag 181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg 241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat 301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat 361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa 421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt 481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt 541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat 601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct 661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag 721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa 781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat 841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc 901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt 961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat 1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa 1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg 1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata 1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc 1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc 1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc 1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat 1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc 1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt 1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt 1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa 1681 gttttgtttt acccattgct tctaaatatt // LOCUS MUSINT1A 5607 bp ds-DNA ROD 18-JUL-1990 DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds. ACCESSION K02593 M34750 KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene. SOURCE Mouse (BALB/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1098 to 5606) AUTHORS Van Ooyen,A. and Nusse,R. TITLE Structure and nucleotide sequence of the putative mammary oncogene int-1; Proviral insertions leave the protein-encoding domain intact JOURNAL Cell 39, 233-240 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 5607) AUTHORS Nusse,R., Theunisseen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A., Otte,A., Schuuring,E. and Van Ooyen,A. TITLE The promoter of the Wnt-1 (int-1) oncogene and its mechanism of activation by insertion of proviral DNA of the mouse mammary tumor virus JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [2] kindly submitted by R.Nusse, 31-MAY-1990. FEATURES from to/span description pept 1779 1882 Wnt-1 protein (exon 1) 2452 2705 Wnt-1 protein (exon 2) 3279 3544 Wnt-1 protein (exon 3) 4002 4490 Wnt-1 protein (exon 4) IVS 1883 2451 Wnt-1 cds intron A IVS 2706 3278 Wnt-1 cds intron B IVS 3545 4001 Wnt-1 cds intron C site 1133 1134 MMTV insertion site in tumor 17 [1] site 1394 1395 MMTV insertion site in tumor 35 [1] site 1631 1632 MMTV insertion site in tumor 102 [1] site 4495 4496 MMTV insertion site in tumor 53 [1] mRNA 1428 5398 Wnt-1 mRNA (alt.) mRNA 1595 5398 Wnt-1 mRNA (alt.) signal 1562 1566 TATA box signal 1573 1577 TATA box BASE COUNT 1188 a 1681 c 1519 g 1219 t ORIGIN Chromosome 15. 1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc 61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac 121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac 181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa 241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg 301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg 361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac 421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct 481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact 541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac 601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag 661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg 721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga 781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca 841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact 901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc 961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca 1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg 1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga 1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg 1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg 1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc 1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc 1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg 1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt 1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa 1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag 1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact 1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt 1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc 1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg 1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg 1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag 1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct 2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc 2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct 2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt 2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg 2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg 2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg 2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt 2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga 2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat 2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg 2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa 2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc 2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag 2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg 2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg 2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc 3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag 3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag 3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact 3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac 3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat 3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc 3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg 3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc 3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg 3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg 3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag 3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag 3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt 3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca 3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac 3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt 3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc 4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc 4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc 4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg 4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat 4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt 4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc 4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg 4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc 4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg 4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc 4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga 4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc 4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac 4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac 4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc 4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa 4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc 5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc 5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac 5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga 5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac 5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct 5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat 5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca 5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga 5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat 5581 gaccctaaat gagaccaact agggatc // LOCUS MUSP40M 3808 bp ds-DNA ROD 18-JUL-1990 DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete cds. ACCESSION M30136 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor. SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210 DNA, clone lambda-S40.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3808) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 21-NOV-1989. FEATURES from to/span description pept 644 757 P40 T-cell and mast cell growth factor precursor, exon 1 846 881 P40 T-cell and mast cell growth factor precursor, exon 2 965 997 P40 T-cell and mast cell growth factor precursor, exon 3 2184 2315 P40 T-cell and mast cell growth factor precursor, exon 4 3393 3512 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 644 697 P40 T-cell and mast cell growth factor signal peptide matp 698 757 P40 T-cell and mast cell growth factor 846 881 P40 T-cell and mast cell growth factor 965 997 P40 T-cell and mast cell growth factor 2184 2315 P40 T-cell and mast cell growth factor 3393 3509 P40 T-cell and mast cell growth factor IVS 758 845 mP40 intron A IVS 882 964 mP40 intron B IVS 998 2183 mP40 intron C IVS 2316 3392 mP40 intron D signal 589 594 TATA box binding 227 233 AP-2 binding site binding 475 480 IRF-1 binding site binding 478 484 AP-1 binding site BASE COUNT 1012 a 903 c 869 g 1024 t ORIGIN Chromosome 13. 1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca 61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt 121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag 181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc 241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga 301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt 361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt 421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga 481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg 541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta 601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat 661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg 721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga 781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac 841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag 901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt 961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc 1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct 1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat 1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga 1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa 1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac 1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac 1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc 1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca 1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta 1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac 1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc 1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct 1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta 1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat 1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg 1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc 1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg 2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa 2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt 2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt 2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa 2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca 2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc 2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag 2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg 2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct 2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag 2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg 2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag 2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat 2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc 2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct 2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc 3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga 3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg 3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga 3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa 3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt 3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc 3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc 3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga 3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg 3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag 3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga 3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac 3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat 3781 aggtaggatg gagcaaagac ttggtaat // LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 18-JUL-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete cds. ACCESSION M29938 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone ZQ30-1. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1105) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 13 777 MHC SLA-DQ alpha chain precursor sigp 13 81 MHC SLA-DQ alpha chain signal peptide matp 82 774 MHC SLA-DQ alpha chain mRNA < 1 1105 MHC SLA-DQ mRNA BASE COUNT 254 a 290 c 255 g 306 t ORIGIN 1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc 61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta 121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag 181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa 241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg 301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact 361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac 421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt 481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc 541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat 601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag 661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc 721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc 781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa 841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc 901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg 961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc 1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1081 aaacaataaa tcacttcatg aggtt // LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 18-JUL-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete cds. ACCESSION M29939 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 69 836 MHC SLA-DQ alpha chain precursor sigp 69 137 MHC SLA-DQ alpha chain signal peptide matp 138 833 MHC SLA-DQ alpha chain BASE COUNT 273 a 313 c 266 g 322 t ORIGIN 1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg 61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga 121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg 181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat 241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta 301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca 361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt 421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca 481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt 541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca 601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata 661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga 721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct 781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca 841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac 901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct 961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga 1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc 1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1141 aaacaataaa tcacttcatg aggtttttgt actg // LOCUS PIGMHDQBA 212 bp ds-DNA MAM 18-JUL-1990 DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end. ACCESSION M29944 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 212) AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F., Gustafsson,K. and LeGuern,C. TITLE Class II genes of miniature swine: II. Molecular identification and characterization of B (beta) genes from SLA-c haplotype JOURNAL Immunogenetics 31, 1-6 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept < 1 12 MHC SLA-DO beta chain BASE COUNT 38 a 58 c 52 g 64 t ORIGIN 1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg 61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc 121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct 181 cttacccttc tgcctttctt cctcctactt gt // LOCUS PP2LEND 266 bp ds-DNA PHG 18-JUL-1990 DEFINITION Bacteriophage P2 left end DNA. ACCESSION M34756 KEYWORDS . SOURCE Bacteriophage P2 DNA. ORGANISM Bacteriophage P2 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 266) AUTHORS Ziermann,R. and Calendar,R. TITLE Characterization of the cos sites of Bacteriophages P2 and P4 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Ziermann, 05-JUN-1990. Author address: R.Ziermann University of California Dept. Mol. Cell Biology Berkeley, CA 94720 Email: calendar@garnet.berkeley.edu FEATURES from to/span description site 1 19 cohesive end of phage DNA site 20 266 left end of phage DNA BASE COUNT 41 a 58 c 99 g 68 t ORIGIN Map position 0.0-0.8%. 1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct 61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt 121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg 181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc 241 gagccagccg tttatctcgc ggatcc // LOCUS RCABCHH 314 bp ds-DNA BCT 18-JUL-1990 DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and F108) genes, 3' end, and 5' end respectively. ACCESSION M34843 KEYWORDS . SOURCE R.capsulata DNA. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 314) AUTHORS Yang,Z. and Bauer,C.E. TITLE Rhodobacter capsulatus genes involved in early steps of the bacteriochlorophyll biosynthetic pathway JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.E.Bauer, 04-JUN-1990. FEATURES from to/span description pept < 1 79 bacteriochlorophyll biosynthesis protein (bchH) (AA at 2) pept 108 > 314 bacteriochlorophyll biosynthesis protein (F108) BASE COUNT 59 a 87 c 119 g 49 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga 61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg 121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg 181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg 241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca 301 agcgcgtgct gcag // LOCUS TREANTAT 999 bp ds-DNA ORG 18-JUL-1990 DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle. ACCESSION M34848 KEYWORDS . SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA. ORGANISM Kinetoplast Trypanosoma evansi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma evansi. REFERENCE 1 (bases 1 to 999) AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S., Steinert,M. and Hamers,R. TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Bajyana Songa, 05-JUN-1990. FEATURES from to/span description site 367 516 kinetoplast minicircle common region (ORI) BASE COUNT 391 a 78 c 203 g 327 t ORIGIN 1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt 61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt 121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg 181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt 241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt 301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat 361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg 421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt 481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag 541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag 601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca 661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat 721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac 781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa 841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg 901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat 961 aaaataatgg ataaaagagt ttataggtga agttgaagt // LOCUS VIBSCRRE 1524 bp ds-DNA BCT 18-JUL-1990 DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds, and sucrose transport (scrA) protein, 5' end. ACCESSION M35009 KEYWORDS Scr repressor protein; sucrose transport protein. SOURCE V.alginolyticus DNA. ORGANISM Vibrio alginolyticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1524) AUTHORS Blatch,G.L. and Woods,D.R. TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr repressor-encoding gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Blatch, 10-JUN-1990. Author address: G.L.Blatch University of Cape Town Dept. Microbiology Private Bag Rondebosch 7700 SOUTH AFRICA Email: bltgre01.uctvax@f4.n494.z5.fidonet.org FEATURES from to/span description pept 36 < 1 (c) sucrose transport protein (scrA) pept 271 555 Scr repressor protein (scrR) pept 791 1090 ORF5 binding 264 267 scrR ribosomal binding site binding 780 785 ORF5 ribosomal binding site signal 178 183 scrA -10 region signal 203 208 scrA -35 region signal 190 195 scrR -10 region signal 162 167 scrR -10 region signal 762 767 ORF5 -35 region signal 742 747 ORF5 -35 region site 78 91 repressor operator site (put.) site 229 242 repressor operator site (put.) site 12 71 ScrR helix-turn-helix motif BASE COUNT 466 a 344 c 321 g 393 t ORIGIN 1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc 61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg 121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt 181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc 241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg 301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa 361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc 421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg 481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt 541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga 601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt 661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac 721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag 781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt 841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga 901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg 961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga 1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta 1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc 1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt 1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt 1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat 1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct 1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca 1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa 1501 gccaattgtg atattgaaga agct // LOCUS XELNRP1A 2431 bp ss-mRNA VRT 18-JUL-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA, complete cds. ACCESSION M34894 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2431) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 74 1117 pot. RNA-binding protein (nrp-1B) mRNA < 1 2431 nrp-1B mRNA signal 2407 2418 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence site 921 941 alternative exon (put.) BASE COUNT 663 a 522 c 543 g 703 t ORIGIN 1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt 61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc 121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact 181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt 241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa 301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt 361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt 421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga 481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac 541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa 601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt 661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat 721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc 781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc 841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc 901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag 961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag 1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc 1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct 1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct 1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga 1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat 1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg 1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat 1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta 1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa 1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat 1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac 1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg 1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc 1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat 1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc 1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga 1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct 2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg 2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc 2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat 2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg 2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca 2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata 2401 gttcaaaata aactatgaat tttattcatt t // LOCUS XELNRP1B 2324 bp ss-mRNA VRT 18-JUL-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA, complete cds. ACCESSION M34895 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2324) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 29 1051 pot. RNA-binding protein (nrp-1B) mRNA < 1 2324 nrp-1B mRNA signal 2306 2311 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence BASE COUNT 647 a 467 c 520 g 690 t ORIGIN 1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc 61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac 121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt 181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca 241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga 301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat 361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca 421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg 481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt 541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc 601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat 661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag 721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac 781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg 841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc 901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat 961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat 1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg 1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct 1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat 1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta 1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa 1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa 1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct 1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa 1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc 1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact 1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg 1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt 1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct 1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag 1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag 1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt 1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta 2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac 2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt 2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat 2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa 2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc // LOCUS YSJSACDHY 2569 bp ds-DNA PLN 18-JUL-1990 DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds. ACCESSION M34929 KEYWORDS saccharopine dehydrogenase. SOURCE Y.lipolytica (strain W29) DNA. ORGANISM Yarrowia lipolytica Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2569) AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and Gaillardin,C. TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia lipolytica JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.M.Fournier, 06-JUN-1990. ORF1 is probably not translated, since a lacZ fusion/expression experiment did not produce results. FEATURES from to/span description pept 649 1929 ORF1 pept 1764 655 (c) saccharopine dehydrogenase mRNA 582 2060 ORF1 mRNA (alt.) mRNA 598 2060 ORF1 mRNA (alt.) mRNA 599 2060 ORF1 mRNA (alt.) mRNA 602 2060 ORF1 mRNA (alt.) mRNA 606 2060 ORF1 mRNA (alt.) mRNA 609 2060 ORF1 mRNA (alt.) mRNA 1807 639 (c) ORF2 mRNA (alt.) mRNA 1809 639 (c) ORF2 mRNA (alt.) mRNA 1811 639 (c) ORF2 mRNA (alt.) mRNA 1812 639 (c) ORF2 mRNA (alt.) mRNA 1814 639 (c) ORF2 mRNA (alt.) signal 521 528 TATA box signal 565 575 TATA box BASE COUNT 668 a 573 c 666 g 662 t ORIGIN 1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag 61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt 121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca 181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa 241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag 301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc 361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc 421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt 481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca 541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt 601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc 661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt 721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc 781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc 841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac 901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt 961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca 1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt 1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc 1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt 1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata 1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc 1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc 1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct 1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg 1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg 1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc 1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt 1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag 1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg 1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag 1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg 1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga 1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa 2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca 2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg 2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc 2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct 2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag 2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga 2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat 2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat 2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc // LOCUS MVISAA1A 772 bp ss-mRNA MAM 18-JUL-1990 DEFINITION Mink serum amyloid A1 mRNA, complete cds. ACCESSION M34953 J05445 KEYWORDS serum amyloid A1. SOURCE Mink liver, cDNA to mRNA, clone pmiSAA1. ORGANISM Mustela vison Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Musteloidea; Mustelidae. REFERENCE 1 (bases 1 to 772) AUTHORS Marhaug,G., Husby,G. and Dowton,S.B. TITLE Mink serum amyloid A protein--expression and primary structure based on cDNA sequences JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.O.Marhaug, 07-JUN-1990. FEATURES from to/span description pept 29 418 serum amyloid A1 mRNA < 1 772 serum amyloid A1 BASE COUNT 176 a 205 c 200 g 191 t ORIGIN 1 cagctctgct ccaccagaac tcagcaccat gaagcttttc acgggcctca tcttctgctc 61 cttggtcctg ggggtcagca gtcaatggta ttcattcatt ggtgaggctg ctcaaggagc 121 ttgggacatg tacagagcct actctgacat gatagaagcc aagtacaaaa attcggacaa 181 atacttccat gcccgcggga actatgatgc tgcacaaagg ggccctgggg gtgcgtgggc 241 tgctaaagtc atcagtgacg ccagagagcg ttctcagaga atcacagacc ttatcaagta 301 tggagacagc ggccacggag tggaggactc gaaggctgac caagctgcca acgaatgggg 361 ccggagtggc aaagacccca accacttccg accccctggc ctgcctgaca agtactgagc 421 tttcccttgg ctctgccctg gggagatggg ctgtgaggcc cctgaaggca agaacagttg 481 ctgagttaga gttcctgaat tgtataccct tcctactgaa tactttaaag agcacattaa 541 aatgtttagt aaatgcttgt gaatgggcac ctggatggcc cagttggtta agcggctgtc 601 ttcggctcat gtcatgatcc tagggtcctg ggaccgagtt ctccatgggg ctccttcctg 661 ggcagggagt ctgcttctct gcctgcctct ctgcctctgc ctgccactct gcctgcttgt 721 gtgctctctc tccctctttc tctgacaaat aaataaataa ataaaatctt tt // LOCUS MVISAA2A 552 bp ss-mRNA MAM 18-JUL-1990 DEFINITION Mink serum amyloid A2 mRNA, complete cds. ACCESSION M34954 J05445 KEYWORDS serum amyloid A2. SOURCE Mink liver, cDNA to mRNA, clone pmiSAA2. ORGANISM Mustela vison Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Musteloidea; Mustelidae. REFERENCE 1 (bases 1 to 552) AUTHORS Marhaug,G., Husby,G. and Dowton,S.B. TITLE Mink serum amyloid a protein - expression and primary structure based on cDNA sequences JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.O.Marhaug, 07-JUN-1990. FEATURES from to/span description pept 31 420 serum amyloid A2 precursor sigp 31 84 serum amyloid A2 signal peptide matp 85 417 serum amyloid A2 mRNA < 1 552 serum amyloid A2 mRNA BASE COUNT 129 a 144 c 161 g 118 t ORIGIN 1 agcagctctg ctccaccaga actcagcacc atgaagcttt tcacgggcct catcttctgc 61 tccttggtcc tgggggtcag cagtcaatgg tattcattca ttggtgaggc tgttcaaggg 121 gcttgggaca tgtacagagc ctactctgac atgagagaag ccaattacaa aaattcggac 181 aaatacttcc atgcccgtgg gaactatgat gctgcacaaa ggggccctgg gggcgcctgg 241 gctgctaaag tcatcagtga cgccagagag cgttctcaga gagtcacaga ccttttcaag 301 tatggagaca gcggccacgg agtggaggac tcgaaggctg accaggctgc caacgaatgg 361 ggccggagtg gcaaagaccc caaccacttc cgaccctctg gcttgcctga caagtactga 421 gcttcccctt ggctctgtcc tggggggatg ggctgtgagg accctgggga cagggacacc 481 cagtcattga gttctctgtc cccagaggct agtggagggc acctaatagg tgtctaataa 541 atgcttactg ag // LOCUS TFERUBISCO 2172 bp ds-DNA BCT 18-JUL-1990 DEFINITION Pot. Thiobacillus sp. ribulose 1,5 bisphosphate carboxylase (RUBISCO) large and small subunit genes, complete cds. ACCESSION M34536 KEYWORDS ribulose 1,5 bisphosphate carboxylase. SOURCE Pot. Thiobacillus sp. DNA, clones pAH5.0 and pAH196. ORGANISM Thiobacillus sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Colorless sulfur bacteria. REFERENCE 1 (bases 1 to 2172) AUTHORS Stein,J.L., Haygood,M. and Felbeck,H. TITLE Nucleotide sequence and expression of a deep sea ribulose 1,5 bisphosphate carboxylase gene cloned from a chemoautotrophic bacterial endosymbiont JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Stein, 23-MAY-1990. Author address: J.L.Stein Scripps Institution of Oceanography UC San Diego, A-002 A-002, Marine Biol Res Div. La Jolla, CA 92093 The taxonomy of this organism could not be determined. It can only live as a symbiont of Alvinoconcha hessleri, a gastropod. FEATURES from to/span description pept 201 1616 ribulose 1,5 bisphosphate carboxylase large subunit (EC 4.1.1.39) pept 1685 2050 ribulose 1,5 bisphosphate carboxylase small subunit signal 19 24 pot. -35 region signal 43 49 pot. -10 region binding 188 194 ribosome binding site binding 1673 1676 ribosome binding site BASE COUNT 441 a 741 c 641 g 349 t ORIGIN 1 atccttatcc tgaccctcaa tacataatcg tgagcttatg gctataatca gaaagtctga 61 gttcacttta tagatgacct ctcccatgct cactctcaat cctgagtgat tcactcggat 121 tggacccgat aaccgtgcgg gctgtcggac gctaggcgtc cggtgcccct cgaacaccga 181 gcctctggag aggacaagca atggcgaaga aatacgatgc gggcgtaaaa gactaccgcg 241 aaacctactg gatgccggac tacacgccga aggagaccga cctgctggcg tgcttcaaga 301 tcattccgca gccgggcgtg ccgcgcgagg aagcccgggc cgccgtggcc gccgagtcgt 361 ccaccggcac ctggaccacc gtgtggaccg acctgctgac cgacctcgac cactacaagg 421 gccgtgccta cgccatcgag gacgtgccgg gcgacgagga ggccttctat gccttcatcg 481 cctaccccat cgacctgttc gaggaaggct cggtggtcaa cgtgtttacc tctctggtgg 541 gcaacgtgtt cggcttcaag gccatccgcg ccctgcgtct ggaggacgtg cgcttcccca 601 tcgcatacgt catgacctgc aacggtccgc cgcacggcat ccaggtcgag cgcgacatca 661 tgaacaagta cggccgtccg ctgttgggct gcaccatcaa gcccaagctg ggcctgtcgg 721 ccaagaacta cggtcgcgcc gtgtacgaat gcctgcgcgg cggcctggac ttcaccaagg 781 acgacgagaa cgtcaactcc cagcccttca tgcgctggcg gcacggtttc gacttcgtga 841 tggaggccat cgagaaggcc gagcgcgaga ccggcgagcg caagggtcac tacctgaacg 901 tcaccgcgcc caccccggac gagatgttca agcgtgccga gtacgccaag gagatcggcg 961 cgccgatcat catgcacgac tacatcaccg gcggcttcac cgccaacacc ggcctggcgc 1021 agtggtgccg tgacaacggc gtgctgctgc acatccaccg cgccatgcac gccgtgctcg 1081 accgcaaccc gcaccacggc atccacttcc gcgtgctgac caagatcctg cgcctgtccg 1141 gcggcgacca cctgcacacc ggcaccgtgg tcggcaagct ggagggcgac cgcgaggcga 1201 ccctcggctg gatcgacctg ctgcgcgagt cctacatcaa ggaagaccgt tcgcgcggca 1261 tcttcttcga ccaggactgg ggctccatgc ccggcgtctt tgcggcctgc tccggcggta 1321 tccacgtctg gcacatgccg gcgctggtca ccatcttcgg cgagcatgcc gtgctccagt 1381 tcggtggcgg caccctgggt cacccctggg gcaacgccgg cgccgcggcc aaccgggtgg 1441 cgctggaagc ctgcgtcgag gctcgcaacg agggccacga gctggagaaa gagggcaagg 1501 acatcctcat ccaggccgcc aagcacagcc ccgaactcaa gaccgcgatg gagacctgga 1561 aggaaatcaa gttcgagttc gacaccgtgg acaaactgga cgtggcgcac aaataagcgc 1621 cccgccaccc cgggccggtc gtggctggcc gcgccacgca tccccaattt tcgaggtatc 1681 aacaatgagc gaaatccaag actacaactc gagcgtcagc gatccgtcga gccgcaagtt 1741 cgagaccttc tcctacctgc ccgaactcgg cgtggaaaag atccgcaagc aggtcgagta 1801 catcgtcagc aagggctgga acccggccgt cgagcacacc gagccggaga acgccttcga 1861 ccactactgg tacatgtgga agctgccgat gttcggcgaa accgacgtgg acgccatcct 1921 ggccgaggcc gaggcatgcc acaaggcgca tccctcgcat cacgtgcgcc tgatcggcta 1981 cgacaactat gcccagtcgc aaggcactgc catggtgatc ttccgcggcc ccatcagcgc 2041 caagtgctga gccaccgggg ccgtaaacac gcgactgccg aggtctgccg gcccccgtcg 2101 catggcgacc cccgagggca tccgccaggt gcgtcacctc gcgatgttct tccgaacatt 2161 ccaataagcc ag // LOCUS LSVGPCN 3417 bp ss-RNA VRL 18-JUL-1990 DEFINITION Lassa virus glycoprotein (GPC) and nucleoprotein (N) genes, complete cds. ACCESSION M36544 M33879 KEYWORDS glycoprotein; nucleoprotein. SOURCE Lassa virus (strain GA391), cDNA to viral RNA. ORGANISM Lassa virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Arenaviridae. REFERENCE 1 (bases 1 to 3417) AUTHORS Wilson,S.M. and Clegg,J.C.S. TITLE Sequence analysis of the S RNA of Lassa virus JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.C.S.Clegg, 26-APR-1990. Author address:J.C.S.Clegg PHLS Centre for Applied Microbiology and Research Porton Down, Salisbury SP4 OJG UNITED KINGDOM FEATURES from to/span description pept 71 1543 glycoprotein (GPC) pept 3315 1603 (c) nucleoprotein (N) BASE COUNT 944 a 801 c 706 g 966 t ORIGIN 1 gcaccgggga tcctaggcat ttaggattgc gcattttaaa acctcctttt tggaaagtgt 61 cgcaatcagg atgggacaga ttgtgacatt cttccaagaa gttcctcatg ttattgagga 121 agtgatgaat attgtcctta ttgcactatc catcctagca attctgaagg gactatacaa 181 tgttgccacg tgtggcttga tagggcttgt cacattcctt ctactttcag gaaggtcatg 241 ctcactgatc tacaaaggga cttacgagct gcaaaccctt gagttgaata tggagactct 301 taatatgacc atgccgctat cctgcacaaa gaacaacagt catcattata taagggtggg 361 gaatgagaca ggacttgagc tcaccttgac caacaccagc attcttaatc acaaattctg 421 taacctctct gatgcccaca aaaggaatct ttatgatcac agcctcatga gtatcatctc 481 tacctttcat ctgtccattc ccaacttcaa tcaatatgag gcaatgagct gcgatttcaa 541 tggggggaaa atcactgtgc aatacaacct gagtcatagc ttcgcagtgg atgcagcagg 601 tcactgcggc acacttgcaa atggtgtctt acaaacattt atgagaatgg cttggggagg 661 gagttatatt gctcttgatt ctggacgcgg taactgggac tgtataatga ccagttacca 721 atatctaatc attcagaata caacctggga tgaccactgc caattttcca gaccatcacc 781 tattggctac cttgggcttc tctcacaaag aactagagac atatacatta gcagaaggtt 841 gttggggaca ttcacctgga cactatcaga ctcagaggga aatgaaacac cagggggata 901 ctgccttact agatggatgt tgattgaggc cgaattaaag tgctttggaa acactgcagt 961 agccaagtgc aatgagaaac atgatgaaga attttgtgac atgctaaggt tgttcgattt 1021 caacaaacag gccataagga ggctcaaaac agaggcccaa atgagcatac agctgatcaa 1081 caaggctgtc aatgctttaa taaatgatca gctcatcatg aagaaccact tgagagacat 1141 catgggcata ccatattgta actacagcag atattggtac cttaaccaca catcaacagg 1201 aaagacctca ctaccaaggt gttggcttat ctcaaatgga tcatatctca atgagaccaa 1261 gttttcagat gacatcgaac aacaagctga caacatgata acagagatgc tacaaaagga 1321 atacatagat agacagggca aaactccact ggggttagtt gacctatttg tttttagcac 1381 aagtttctat ctgataagca tctttctcca cctggtcaaa ataccaaccc atagacacat 1441 tgtaggtaaa ccttgcccaa aaccccacag gctgaaccac atgggcatct gctcctgtgg 1501 tctatacaaa cagccaggtg tgcctgtcag atggaagagg tgaaatccca cagggccccc 1561 gtgacccacc gccaattggc ggtgggtcac gggggcgtcc atctacagga cgactttagg 1621 tgttgaagtt ctgaacacca tgtctctagg gagcacaact ctcaggactt ttgcatcaag 1681 tcctcctgaa actgctgcat caaacataat gcaatccatc agtgcacaat gaggagttat 1741 ttcctcttta ccacctctct tctttttctc cacaactacc ccagtgtgca tgtgacatag 1801 atccttgaat tgatcccaaa cagcattctc aaactttctt gaatctgctt tactaagagt 1861 gatgtcaatc agttttatgt ctctcctccc ttgtgactca aggagttttc tgatatcctc 1921 tgatccttgg caagtgatga ccatgttccg aggaagggct tctatcactg cactggttaa 1981 cccaggttgg gcagcaaaca aatcagtcac atccctacca tgtgaatact ttgcatcttg 2041 tttgaattgc ttcaaatctg ttggctccct aaagaaatgt acatagcaac ccgagctcgg 2101 ttgataaagg gctatctcaa ctgggtcttc tggtcttcct tcaatatcca tccaggtttt 2161 tgcgttggga atcaagttga agcacttgaa atctttgaga gtcatcaact gagagtagat 2221 taatcctgca gcaaagcctg cagactgtaa tgacttgttg gaccccccat ttccaatttt 2281 ctggggcttg ttgtcttgct caaggtccac cacagtattt tcccatgctc ttcccacaat 2341 cgaggtcctt gatgcaatat aggaccatcc gtctcctgag agacagatct tgtataggat 2401 gttctcataa gggttccttt cacccggtgt gtctgataca aacattccca gactcttctt 2461 aactttcaag attgacttca agataccatc catggtctga ggtgaaacct taatagtctc 2521 taacatgtta ccaccatcaa gcatgcaggc ccctgctttg acagcagcac ccaaactgaa 2581 attgtaacca gagatgttga gtgaactttt cttagtgtca atcatattca gaattggatg 2641 actctgagac aatctgtcga gatcagatga gttggggtat ttggctgtgt aaatcagccc 2701 taaatctgtc aaagcttgaa cggcatcatt caggtccact tgcccctgtt tggtcaggca 2761 tgctaaagtt aggcttggca ttgttccgaa ctgattgttg agtaactctg catttttgac 2821 atcccaaact ctcacaatgc cattactggc accctttccc cctccactta ctccagtcat 2881 gccaatcatg ttcaaaaggg ctctcctttg atcaagctgt tgtgaactca aattccccat 2941 ataaactcct gaactcaaag gcctttctgt tctggtgact tttgatttca gtttttctaa 3001 atcagcggcc aggattagta gatcgtctga acttaaggtt ccaactctta agacactttt 3061 ctgctgtgtg gatttgagct caacaagatt gttgactgct tgattgagat ctctcagtcg 3121 ttttaggtcg ccatcatctc ttttctgctt gcgcatcaat ctctgaacat tactgacctc 3181 ggagaagtca agaccatgaa ggagagcttg agcgtcttta actacctgca actttatgtt 3241 ggaacagtag ccagatagtt cccttcttag ggattgagtc cacaagcatg acctcacttc 3301 cttggaagca ctcattgtcg tgatggttgt ctgacccttg agtgggtctt gaatgtggtc 3361 actccaaagg tttgattagt gcaaagcgca atccaatagc ctaggatcca ctgtgcg // LOCUS MOPGPCNC 3419 bp ss-RNA VRL 18-JUL-1990 DEFINITION Mopeia virus glycoprotein (GPC) and nucleocapsid protein genes, complete cds. ACCESSION M33879 KEYWORDS glycoprotein; nucleocapsid protein; small RNA segment. SOURCE Mopeia virus (strain 800150), cDNA to viral RNA. ORGANISM Mopeia virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Arenaviridae. REFERENCE 1 (bases 1 to 3419) AUTHORS Wilson,S.M. and Clegg,J.C.S. TITLE Sequence analysis of the S RNA of the African arenavirus Mopeia: An unusual secondary structure feature in the intergenic region JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.C.S.Clegg, 26-APR-1990. Author address: J.C.S.Clegg PHLS Centre for Applied Microbiology and Research Porton Down, Salisbury SP4 OJG, UNITED KINGDOM FEATURES from to/span description pept 47 1516 glycoprotein pept 3352 1640 (c) nucleocapsid protein BASE COUNT 875 a 819 c 736 g 989 t ORIGIN 1 gtggatccta ggctttttgg ttgcgcattt ctagagcatc tcggagatgg ggcagatagt 61 caccttcttt caagaggtgc cacacatcct tgaagaagtg atgaacattg tgctgatgac 121 cctctcaatc ttggccatcc taaagggcat ctacaatgtg atgacctgtg gaatcatcgg 181 tttgataaca tttttgtttt tgtgtgggag atcatgctca agcatctata aggacaacta 241 tgagttcttc tctctcgacc tcgacatgtc ttcactgaat gcaacgatgc ctctctcctg 301 ctcaaagaac aactcccatc actacatcca agttgggaat gagacaggcc tagagctgac 361 attgacaaac actagcataa ttaatcataa gttttgcaac ctgtctgatg ctcacaggag 421 gaatctttat gacaaagcac ttatgtcaat cttgacaaca ttccacctaa gcattccaga 481 ttttaaccaa tatgaggcaa tgtcatgtga ttttaacgga gggaagatct cagttcaata 541 caacctgtcc cactcaaatt atgtggatgc cggcaaccac tgtggcacta ttgcaaatgg 601 cattatggat gtttttagaa gaatgtattg gagcacctcc ctttcggttg cctctgacat 661 aagtgggact caatgcatac agaccgatta taagtatttg attattcaga acacatcatg 721 ggaggatcat tgcatgttct caagaccctc acccatggga tttttaagcc ttctgtcaca 781 gagaaccagg aatttctaca tctcaagaag acttttgggt ctttttacat ggactttgag 841 tgactcggaa gggaacgaca tgccaggtgg ttactgtctc acaaggtcta tgctaatagg 901 gcttgatctg aaatgctttg ggaacactgc cattgcgaag tgtaatcagg cacatgatga 961 agaattctgt gacatgctcc gcctttttga ttttaataaa caagcaatta gcaaactgag 1021 gtctgaagtt cagcagagca taaatttgat aaataaagct gtcaacgccc tcatcaatga 1081 tcaactggtc atgaggaacc atttgagaga tctaatgggt attccttatt gcaactactc 1141 taagttttgg tacttgaatg atactaggac agggagaaca tccctcccaa agtgttggct 1201 tgtgaccaat gggtcatacc taaatgaaac ccagttctca acagaaattg agcaagaagc 1261 caataacatg ttcactgaca tgttgaggaa ggagtatgag aaaaggcaga gcacaacacc 1321 tctggggtta gtagatcttt ttgttttctc cactagcttt tatttgatct ccgtgttcct 1381 ccacctaatc aaaattccaa cacatagaca cataaaaggc aagccttgcc cgaaaccaca 1441 cagactcaat cacatggcaa tctgttcctg tggcttctat aagcaaccag gtctccccac 1501 acaatggaaa aggtgaagaa caggggcacc cccgagaccc accgccttcg gcggtgggtc 1561 tcgggggcct ggagtgactg ttccttgatg gcctccacga cccaccgcca gaggcggtgg 1621 gtcgtggagg tgtcccttct tacaggacaa ctctgggagg acctgttcta aacaccaggt 1681 ctctactcaa gactggtctg ggggttggga tttgtggact ccctatgact gctgcttcaa 1741 acatgagaca atccagcaat gcacagtgtg gtgtcacttc ttctttgcca cctctcttct 1801 tcttttgcgt tacaatcccc gtatgcattc tacaaaggtg tttgtattca tcccagatct 1861 tatcctcaaa ttttcttgca tcgtctttct gcatggacac atcaatcagt tttatgtccc 1921 tcctgttctg ggagtcaaga agctttctga tgtcatcagc accttgacac gacaagacca 1981 tgttctttgg caggctctca attacagcac ttgtcaaccc cggctgaact gagaataggt 2041 cttggatgtc aatgccatga gagtgtttgg agtcttgttt gaattgttta atgtctgttg 2101 gttccctgta aaaatgaata tactgaccat tattaggttg atagatagct atctccacgg 2161 ggtcttcagg tctcccttct atgtcaatcc atgtcttagc atttggatcc aattgcaaca 2221 tgctgtcttt gagttccatt gtctgagagt aggtcaaacc agcaggcatg ccttgggcct 2281 gagcgagtct gttggagtta cctttgtcag gctttgcatt ctgttgcaca tcaccactca 2341 gatcaacagt tgtgttatcc caagccctgc cgacgatcga tgttcttgat gctatgtaag 2401 gccaaccctc tccagaaaga catagtttgt agaggagatt ttcataaggg ttcctttccc 2461 ctggcgtgtc agagacaaac attcccaaag aacgcttgac tttcaaggca gccttcaaaa 2521 ttccttccaa attggatttt gttactttga tggtctccag catgtttccg ccgtctatta 2581 gacaagcacc agctttcacc gcagctgaca ggctaaaatt gtaccctgat atgtttatgg 2641 cacttttctt gggctcaatg atccccaaga ttgggtgggt ctgagagagt ttgtctaagt 2701 cagacatgtt tggatacttt gcagtgtaca caagccccaa gtctgacaac gattggatca 2761 catcattcag atctgcctgc ccttgtttgc acatgcaagc aatcgtcagg cttggcattg 2821 tcccaaactg attgtttaaa agctctgggt ttcgaacatc ccacactcta acaatgccat 2881 ctccgagagt gtttcctgcc cggaacccgc cacccattcc gaccatctgt agcaaaaccc 2941 tcctctgttc tagctgctga gctgttaggt tgcccatata gactccagca gcaagaggcc 3001 tctcacctct gatgactttt gctttgagtc tgtccaggtc ggcagcaagg acgaggaggt 3061 catcagaggt tagtgtcccc actctcaaaa cattcttctg ttggactgac tttaactcaa 3121 ctagattgtt cactgcctgg tttaggtccc tcaatctttt caggtcagag tcatccctct 3181 tctcctttct catcaacctt tgaacattgg caacttcaga gaagtccagc ccatgaagaa 3241 gtgcttgagc atccttgatg acttggatct ttatgttgga gcagtaccct gagagttctc 3301 tcctcaggct ctgtgtccac aagaaggact tcacctcctt tgaattggac attgttggag 3361 gcttttcggt gcagagctga ttccaaaggc gcaatcaatt agcctaggat ccactgtgc // LOCUS HIV2D194 9399 bp ss-RNA VRL 18-JUL-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2), complete proviral genome. ACCESSION J04542 KEYWORDS . SOURCE Human immunodeficiency virus type 2, isolate D194. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (sites; for [2]) AUTHORS Kuehnel,H., von Briesen,H., Dietrich,U., Adamski,M., Mix,D., Biesert,L., Kreutz,R., Immelmann,A., Henco,K., Meichsner,C., Andreesen,R., Gelderblom,H. and Ruebsamen-Waigmann,H. TITLE Molecular cloning of two West African human immunodeficiency virus type 2 isolates that replicate well in macrophages: A Gambian isolate, from a patient with neurologic acquired immunodeficiency syndrome, and a highly divergent Ghanian isolate JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2383-2387 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 9399) AUTHORS Kuehnel,H., von Briesen,H., Dietrich,U., Adamski,M., Mix,D., Biesert,L., Kreutz,R., Immelmann,A., Henco,K., Meichsner,C., Andreesen,R., Gelderblom,H. and Ruebsamen-Waigmann,H. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT The gag coding sequence appears to be distorted at the 5' end by a frameshift; the env cds has a premature stop codon. The 5' ends of the pol and gag genes have not been determined. FEATURES from to/span description pept < 851 2053 gag polyprotein (AA at 851) pept < 1657 4878 pol polyprotein (AA at 1657) pept 4808 5455 vif protein pept 5283 5621 vpX protein pept 5621 5938 vpR protein pept 5784 6079 tat protein, exon 2 (first expressed exon) 8212 8308 tat protein, exon 3 (AA at 8213) pept 6010 6079 rev protein, exon 2 (first expressed exon) 8212 8453 rev protein, exon 3 (AA at 8214) pept 6086 6886 env ORF pept 8471 9244 nef protein LTR 1 239 5' LTR LTR 8856 9399 3' LTR rpt 1 113 R repeat 5' copy binding 243 259 primer (Lys-tRNA) binding site BASE COUNT 3206 a 1924 c 2357 g 1911 t 1 others ORIGIN 1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gctccacgct tgcttgctta aagacctctt aataaagctg ccagttagaa gcaagttaag 121 tgtgtgttcc catctctcct agtcgccgcc tggtcattcg gtgttcatct gagtaacaag 181 accctggtct gttaggaccc ttcccgcttt gagaatccaa ggcaggaaaa tccctagcag 241 gttggcgccc gaacagggac ttgaaagagg actgagaagc cctggaacac ggctgagtga 301 aggcagtaag ggcggcagga acaaaccacg acggagtgct cctagaaaag cgcgggccga 361 ggtaccgaag cggcgtgtgg agcgggagtg aaagaggcct ccgggtgaag gtaagtacct 421 acaccgaaaa ctgtagccag aaaaggcttg ttatcctacc tttagacagg tagaagattg 481 tgggagatgg gcgcgagaaa ctccgtcttg agagggaaaa aagcagacga attagaaaaa 541 gttaggttac ggcccaacgg aaagaaaaga tacaggttaa aacatgttgt gtgggcagcg 601 aatgaattag acagattcgg attggcagag agcctgttgg aatcaaaaga aggttgccaa 661 aagattctta aagttttaga accattagta ccaacagggt cagaaaattt aaaaagcctt 721 tttaataccg tctgcgtcat ttggtgcttg cacgcagaag agaaagtgaa agatactgaa 781 gaagcaaaga aactagcaca gagacatcta gtggcagaaa caggaactgc agagaaaatg 841 ccaaatataa gtagaccaac agcaccacct agtgggaaag ggaggaaact tccccgtgca 901 acaggcaggc ggcaactata tccatgtgcc gctgagcccc cgaactctaa atgcttgggt 961 aaaattagta gaggaaaaga agttcggggc agaagtagtg ccaggatttc aggcactctc 1021 agaaggctgc acgccctatg atatcaatca aatgcttaat tgtgtgggcg atcaccaagc 1081 agctatgcaa ataatcagag aaattattaa tgaggaagca gcagattggg atgcgcagca 1141 cccaatacca ggccccttac cagcagggca gcttagagac ccaagggggt ctgacatagc 1201 aggaacaaca agcacagtag atgaacagat ccagtggatg tataggcaac caaatcccgt 1261 gccggtaggg aacatctaca ggagatggat ccagataggg ctacagaaat gtgtcaggat 1321 gtacaaccca actaacatct tagatgtgaa gcagggacca aaagaatcgt tccagagcta 1381 tgtagacaga ttctacaaaa gcctaagggc agaacaaaca gacccggctg taaaaaattg 1441 gatgacccaa acgctgctaa tacagaatgc caacccagac tgcaagttag tattaaaagg 1501 actagggatg aatcccaccc tagaggagat gctgactgcc tgccagggag taggcggacc 1561 aagccagaaa gccagactaa tggctgaagc cctaaaggag gctttgacgc cagcccctat 1621 cccatttgca gcagcccaac aaagaagggc aattaggtgt tggaattgtg gaaaggaggg 1681 acactcggcg aaacagtgcc gagcacccag aagacagggc tgctggaagt gtggcaagtc 1741 aggacacatc atggcaaact gcccggaaag acaggcaggt tttttaggga tgggcccacg 1801 gggaaagcag ccccgcaact tccccgcggc ccaagctcct caggggctga taccaacagc 1861 acccccaata gatccagcag tggacctgtt ggagaaatat atgcagcaag ggagaaagca 1921 gagagagcag agggagagac catacaagga ggtgacggag gacttactgc acctcgagca 1981 gggagagacg ccccacagag gggcgacaga ggacttgcta cacctcaatt ctctctttgg 2041 aaaagaccag tagtcacagc attcatcgag gatcagccgg tagaagtctt actagacaca 2101 ggagctgatg actcaatagt agcaggaata gagttagggg acaattacac tccaaaaata 2161 gtggggggaa tagggggatt cataaatacc aaagaatata aaaatgtaga aataaaggta 2221 ctaaataaaa gagtaagagc caccataatg acaggagata ccccaatcaa catttttggc 2281 agaaatattc tggcaacctt aggcatgtca ttaaacctac cagtcgccaa gttagaccca 2341 ataaaagtaa cattgaagcc agggaaagat ggaccaaggc tgaaacaatg gcccctaaca 2401 aaagaaaaaa tagaagcact aaaagaaatt tgtgaaaaaa tggaaaggga gggccaacta 2461 gaagaagcac ctccaactaa tccttataat acccccacat ttgcaattaa gaaaaaggac 2521 aagaacaaat ggagaatgct aatagatttt agagaactaa acagggtgac tcaagatttc 2581 acagaaattc agctaggaat tccacacccg gcaggattag ccaaaaagaa aaggattact 2641 gtactagatg taggggatgc ctacttttcc ataccactac atgaagattt taggcaatat 2701 actgcattta ccctaccatc agtaaacaat gcagagccag aaaaaagata tgtatataag 2761 gtcttaccac aaggatggaa aggatcacca gcaatctttc aattcatgat gaggcaaatc 2821 ttagaacctt tcagaaaagc aaacccagac gtcattctca tccaatacat ggatgatatc 2881 ttaatagcta gtgacaggac gggtttagag catgacaaag tagtcctgca actaaaagaa 2941 cttctgaatg gcctagggtt ctctacccca gatgagaagt tccaaaagga ccctccgttt 3001 caatggatgg gctatgaatt gtggccaact aaatggaaac tgcagaaaat acaattacct 3061 cagaaagaaa tatggacagt caatgacatc caaaaactag taggagtttt gaactgggcg 3121 gcgcagatct atccagggat aaaaaccaag catttatgta aattgattag aggaaaaatg 3181 acactcacag aggaagtaca gtggacagag ttagcagagg cagaactaga agaaaacaaa 3241 attatcttaa gtcaggaaca agagggatcc tactatcagg aagaagaaga actagaagca 3301 acagtcatca aaagccaaga caatcagtgg gcatacaaaa tacaccaggg agagagggtt 3361 ctaaaagtag gaaagtatgc gaagataaaa aatactcata ccaatggggt cagactacta 3421 gcacaagtag tccaaaaaat aggaaaggaa gcactggtca tttggggacg agtgccaaaa 3481 tttcacctac cggtagagag agacacctgg gagcaatggt gggataacta ctggcaagta 3541 acatgggtcc cagagtggga cttcgtatct accccaccac tggtcaggtt gacatttaac 3601 ttggtaggag atcctatacc aggcacagag accttttaca cagatggatc atgcaataga 3661 cagtcaaaag aaggaaaagc aggatatgta acagatagag ggagagacag ggtaagagta 3721 ttagagcaaa catccaatca gcaagcagaa ctagaagcct ttgcgatggc actggcagac 3781 tcaggtccca aggttaatat catagtagac tcacagtatg taatggggat agtagcaggc 3841 caaccaacag agtcagaaaa tagaatagta aaccaaatca ttgaggacat gataaagaaa 3901 gaagcagtct atgttgcatg ggtcccagcc cataaaggca taggaggaaa ccaggaagta 3961 gaccatttag taagtcaggg catcagacaa gtattattcc tggaaaagat agagcccgct 4021 caagaagaac acgaaaaata tcatagcaat ataaaagaac taacccataa atttggaata 4081 ccccaactag tggcaagaca gatagtaaac acatgtgccc aatgccaaca gaaaggagaa 4141 gccatacatg ggcaagtaaa tgcagaaata ggcgtttggc aaatggactg cacacactta 4201 gaaggaaaaa tcattatagt agcagtgcat gttgcaagtg gattcataga agcagaagtc 4261 atcccacagg aatcaggaag gcagacagca ctcttcctat taaaactggc cagtaggtgg 4321 ccaataacac acttgcacac agacaatggc cccaacttca cttcacagga agtgaagatg 4381 gtggcatggt ggataggtat agagcaatcc tttggagtac cttacaatcc acaaagccag 4441 ggagtagtag aagcaatgaa tcaccaccta aaaaatcaga taagtagaat tagagaacag 4501 gcaaatacaa tagaaacaat agtactaatg gcagttcatt gcatgaattt taaaagaagg 4561 ggaggaatag gggatatgac cccagcagaa agactaatta acatgatcac cacagaacaa 4621 gaaatacaat tcctccaaag aaaaaattca aattttaaaa aattccaggt ctattacaga 4681 gaaggcagag atcagctgtg gaaaggacct ggagagctac tgtggaaggg agacggagca 4741 gtcatagtca aggtaggggc ggacataaaa gtagtaccaa gaaggaaggc caagattatc 4801 agggactatg gaggaaggca agaactggat agtagttccc acctggaggg tgccagggag 4861 gatggagagg tggcatagcc ttgtcaagca cctgaagtac agaacaaaag acttagagga 4921 ggtgcgctat gttccccatc acaaggtagg atgggcatgg tggacttgca gcagggtaat 4981 attcccacta gaaggagaaa gtcatctaga gatacaggca tattggaacc taacaccaga 5041 aaaaggatgg ctctcctctc attcagtaag gttaacctgg tatacagaaa agttctggac 5101 agatgttacc ccagactgtg cagactccct aatacacagc acttatttct cttgctttac 5161 ggcaggtgaa gtaagaagag ccatcagagg ggaaaagtta ttgtcctgct gcaactaccc 5221 ccaagctcat aaagcacagg taccatcact tcaataccta gccctagtgg tagtgcaaca 5281 aaatggcaga ccccagagaa agggtgccgc caggaaacag tggagaagag accattggag 5341 aggccttcga gtggctagac aggactatag aagccttaaa cagggaggca gtgaaccatc 5401 tgccccgaga gctcattttc caggtgtggc aaaggtcctg ggcatattgg catgatgaac 5461 aagggatgtc aacaagttac acaaagtata gatatttgtg cataatgcag aaagctgtgt 5521 atatacattt caagaagggg tgcacttgcc tggggagagg acatggcccg ggaggatgga 5581 gaccaggacc tccccctcct ccccctccag gtctagtcta atgactgaag caccaacaga 5641 gtttccccca gaagatggga ccccacggag agagctaggg agtacctggg taatagaaac 5701 tctgaaggaa atcaaggaag aagccttaaa acattttgat ccctgcttgc taattgctct 5761 tggcaactat atctataata gacatggaga cacccttgaa ggagccagag agctcattag 5821 agtcctacaa cgagccctct tcgtgcacat cagagcggga tgtgaccgct caagaaaggg 5881 ccaaacaagg agaagagctc cttgcccagc tgcaccgacc cctagaggca tgcactaact 5941 catgctattg taagcagtgc agttaccatt gccagctgtg tttcttgaaa aaagggctcg 6001 ggatatggta tgcgcgacag ggcagacgaa gaaggactcc aagaaaaact aagactcatc 6061 cgcctcctgc atcagataag taagtatgga gcctggtagg aatcagctgc ttgttgccat 6121 tttattaact agtgcttgct taatatattg caaacaatat gtgactgttt tctatggcat 6181 acccgcgtgg agaaatgcat ctattcccct attttgtgca accaaaaata gagatacttg 6241 ggggaccatc cagtgcttgc cagacaatga tgattatcag gaaataacct taaatgtgac 6301 agaagctttt gatgcatggg ataatacagt aacagaacaa gcaatagaag atgtctggag 6361 actgtttgag acatcaataa aaccatgtgt caagttgacg cccctatgtg tggcgatgaa 6421 ttgtaatata acttcaggga ctaccgcgac cccgagtcca ccaaacatta caataataga 6481 tgaaaattct acctgtatag gcgacaacaa ctgcacagga ttagggaaag aagaggtggt 6541 tgagtgtgag ttcaatatga cggggctaga acaagataag aaaaggaagt ataatgacgc 6601 atggtactca agagatgtgg tttgtgacaa gacaaacgga acaggcacat gttacatgag 6661 acattgcaac acatcagtca tcaaagagtc atgtgacaag cactattggg atgctatgaa 6721 gtttagatac tgtgcaccac cgggttttgc cctactaaga tgcaatgata ccaactattc 6781 aggctttgaa cctaagtgct ctaaagtagt agctgcttca tgcacaagga tgatggaaac 6841 gcaaacttct acttggtttg gctttaatgg cactagagca gaatagaaca tatatctatt 6901 ggcatggtaa gataatagga ctatcattag cttaaacagt attataatct cacaatgcat 6961 tgtaagagac caggaaataa gacagttgta ccaataacac ttatgtcagg gcgaaggttt 7021 cactctcggc cagtctacaa caaaaaacct gggcaggcat ggtgttggtt tcaaggcaac 7081 tggatagaag ccatgcggga ggtgaagcaa acccttgcaa aacatcccag gtacggagga 7141 acaaatgata caggaaaaat taactttacg aagccaggaa taggttcaga cccagaagtg 7201 acatacatgt ggactaactg cagaggagaa tttctctact gtaatatgac ttggttcctc 7261 aattgggtag aaaataagac gaaccaaaca cacggcaact atgcgccatg ccatataagg 7321 cagataatta acacctggca taaggtaggg acaaatgtat atttgcctcc tagggaaggg 7381 gagttgacct gcaattcaac agtaaccagc ataattgcta acattgactc agatggaaat 7441 cagaccaaca ttacctttag tgcagaagtg gcagaactgt accgattaga attgggggac 7501 tacaaattga tagaagtaac accaattccg ttcgcaccta caaaagagaa aagatattcc 7561 tcggctccag tgaggaacaa aagaggtgtg ttcgtgctag ggttcttggg ttttctcgca 7621 gcagcaggtt ctgcaatggg cggcncgtcc ttgacgctgt cggctcagtc ccggacttta 7681 ctggccggga tagtgcagca acagcaacag ctgttggacg tggtcaagag acaacaagaa 7741 atgttgcgat tgaccgtctg gggaacgaaa aatctccagg caagagtcac tgctatcgag 7801 aaatacttaa aggaccaggc acagctaaat tcatggggat gtgcgtttag gcaggtctgc 7861 cacactactg taccatgggt aaatgactcc ttaacacctg actggaacaa tatgacatgg 7921 caggaatggg aaaaacgagt ccactaccta gaggcaaata tcagtcaaag tttagaacag 7981 gcacaaattc aacaagaaaa gaatatgtat gaactacaaa aactaaatag ctgggatgtc 8041 tttggcaact ggtttgattt gacctcctgg atcaaatata ttcaatatgg agtttatata 8101 gtagtaggaa taataggttt aagaatagcc atatatatag tgcaattgtt aagtagactt 8161 agaaagggct ataggcctgt tttctcctcc ccccccggtt atctccaaca gatccatatc 8221 cacacggaca ggggacagcc agccaacgaa gaaacagaag aagacgccgg agacgacagt 8281 ggtttcggct tgtggccttg gccactaaac tacatacaat tcctgatcca cctactgact 8341 cgcctcttga ccgggctata caacagctgc aggggcttac tatccaagaa ctccccgacc 8401 cgccgactga tctcccagag tctaacagca atcagggact ggctgagact taaggcggcc 8461 tacctgcaat atgggtgcga gtggatccaa gaagcgttcc gagcattcgc aaggactgcg 8521 agagagacta ttgcgggcgc gtggaggggg ttatgtgaag cagcgcaacg catcgggagg 8581 ggaatcctcg cagtcccaag aaggatcagg cagggagcag aaatcgccct cctgtgaggg 8641 acagcagtat cagcagggag agttcatgaa caccccatgg agaaccccag cagcaatagg 8701 gcagaaaaat tcatataagc agcaaaatat ggatgatgta gattctgatg atgatgacct 8761 agtgggagtt cctgttatgc caagagtacc gctgagagaa atgacctata aactggcaat 8821 agatatgtca cattttataa aagaaaaagg aggactggaa gggatatttt acagtaggga 8881 gagacataga atcctagact tgttcctaga aaaggaggaa gggataatac cagattggca 8941 gaattatact catgggccag gaacaaggta cccaatgtac ttcgggtggc tgtggaaact 9001 agtaccagta gacatctcac aagaggcaga ggaagtagag accaactgct tagtacaccc 9061 agcacaaaca agcagatatg atgacgagca tggggagaca ctagtttggc ggtttgaccc 9121 catgctggcc tatagttaca aggccttcat tctgcaccca gaagaatttg ggcacaagtc 9181 aggattgcca gagaaagagt ggaaggcaaa actgaaagca agagggatac catatagtga 9241 ataacaggaa caaccatact tggtcagggc aggaaatagc tactaagaac agctgagact 9301 gcagggactt tccagaaggg gctgtaacca agggagggac atgggaggag ctggtgtcat 9361 attctctgta taaatgtacc cgcttcttgc attgtattc // LOCUS ANAHETA 2608 bp ds-DNA BCT 18-JUL-1990 DEFINITION Anabaena sp. heterocyst differentiation protein (hetA) gene, complete cds. ACCESSION M31722 KEYWORDS differentiation protein; heterocyst differentiation protein. SOURCE Anabaena sp. (strain PCC 7120) DNA. ORGANISM Anabaena sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Nostocaceae. REFERENCE 1 (bases 54 to 2608) AUTHORS Holland,D. and Wolk,C.P. TITLE Identification and characterization of hetA, a gene that acts early in the process of morphological differentiation of heterocysts JOURNAL J. Bacteriol. 172, 3131-3137 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2608) AUTHORS Holland,D. and Wolk,C.P. JOURNAL Unpublished (1990) MSU-DOE Plant Res. Lab., M.S.U., Lansing, MI STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Wolk,C.P., 30-JAN-1990. FEATURES from to/span description pept 198 2021 heterocyst differentiation protein (hetA) ORF 2077 > 2608 unidentified reading frame (ORF2) rpt 1973 2000 direct repeat rpt 2075 2158 direct repeat BASE COUNT 793 a 510 c 513 g 792 t ORIGIN 1 taaaaatcct atataggact catatttgat ttctgaaaaa ccaaattggc tagaagcttg 61 ctatacagag ctttgtttct cagtatacgt agcaaaattc aagtacgatt cctatagatg 121 aattactaca ctttagttaa aaattaggtt gtcacggttc aaaaaattgt cttagaaaat 181 atacaacgca tttaaaaatg ccaaaatcac cacataaatt atttaaagct aatagcttct 241 ggaaagagaa caatttaata ttgcgagaaa taaaacattt tcgcaaaata gctatattag 301 ccgtaatatt ttcattttta gctgccagtt ttgaaggtgt tagtattggt tttttactgt 361 catttttaca aaaattgact agtcctaatg atccaattca aacaggaatt agctgggttg 421 acatgatttt ggctgctgat gcttggccga ttccccccat atacaggata tctctactaa 481 ttttgttgag tacctggatg cgtgctacct tcaattattt tggcggagta tacactgaat 541 cagcccaact taatttagca gatcgcttac ataagcaaat ttttgagcaa ttacaagctc 601 tgaggttgag ttactttgct caaactcgtt cgggtgaact cattaatacg attactacag 661 aaattgaaag aatcaaacag ggttttagtg ggttagcttt tgtgttaacc agaatcatga 721 ccgtttgcgt ttactttgtt gtgatgtttt ctatatcatg gcaactttcg attatttctg 781 tgctgatatt tttactgtta gcagtgggat tatctacgct gaataagcga gtcagagaaa 841 ctagctttgg catttctcat gctaatgctc aatttacggc tgtagctgta gagttcatca 901 atgggattcg gacaattcaa gcttttggga cacaagaatt tgaacggcaa cgtttctata 961 aagcgagtac taatcaactg aatgcagcta tcaaagttgt tttagcttgg acactagtaa 1021 aacctatagc tgaaggtata gctactacgg ttttaattag tttgattgtg atttccttcg 1081 cgacttttac attaccagtt gcctcattat tgacattctt ctttgtttta gtcagggtga 1141 ttcctaatat tcaagatatt aatggtacgg ttgcttttct gagtacttta caagggtcat 1201 cagaaaacat taaaaatatt ttgcagacga ataataaacc ttacttgaaa aatggcaaac 1261 ttcatttcca agggttgaag cgttcaattg atttagtatc cgtagatttt ggttatactg 1321 ctgataattt agtgttgaat aatatcactc tgacaattga gcgtggtaaa acaacagctc 1381 tggtgggtgc atcgggcgct ggtaaaacca cattagctga tttaattccc cgattttacg 1441 atccgacaga gggacagatt ttagtagatg ggcttgatgt acagtacttt gaaatcaatt 1501 ccctccgccg caaaatggct gtagttagtc aagatacatt tattttcaac acttctatta 1561 gagacaatat cgcctacggt acatctgggg cgagtgaagc ggaaattaga gaagtagcgc 1621 ggctagcaaa tgcgttgcaa tttatcgaag aaatgcccga agggtttgat actaagttag 1681 gcgatcgcgg tgtccgttta tctggaggac agagacaacg gattgcgatc gctcgtgcat 1741 tactccgaga tcccgaaatc ctcattcttg acgaagccac cagcgcccta gattcagtct 1801 ccgagcgatt aattcaggag tctatagaaa aactttccgt gggtagaaca gtaattgcga 1861 tcgctcacag actctccaca attgccaaag cagataaggt tgtggtgatg gaacaagggc 1921 gaattgttga gcagggaaat tatcaagaac ttctagaaca acgcggaaag ctctggaaat 1981 atcaccagat gcaacacgaa tcaggacaga ctaattcgta atatcaattc aaaattcaaa 2041 attcaaaatt caaaattagg gaagccgagc agaatcatgg ttttggggta tgtatctgtc 2101 ccattctttt ttcaaatcgg tataactccc caatccccaa tccccaatct ccagtcccca 2161 atccccaatc cccaatcccc aatccccaat ccccagtccc caatccccaa tcccatgaaa 2221 atttccgtca tcatctcgaa ttacaactat gctcgttatc tttctagagc aatcaactct 2281 gttctcgctc aaactcactc agacattgaa atcgttatcg tagatgatgg ttctacagat 2341 aacagccgtg atgttattac ccaactgcaa gaacaagcac cggataaaat caagcccatc 2401 tttcaagcaa atcaaggaca gggaggcgct ttcaatgcgg ggtttgcggc ggcgactggc 2461 gaagtcgtag cttttcttga tgcagacgat gtgtggaaac ctcataaatt acagcgtatt 2521 gttgaggtat ttcagacatc ggatgtagtt ggtgtcatgc accatttgga tatcatagat 2581 ggcaatgaca aaacaattga tcaagctt // LOCUS BOVTRLA 85 bp ss-tRNA RNA 18-JUL-1990 DEFINITION Bovine amber suppressor Leu-tRNA-cag. ACCESSION M35066 KEYWORDS transfer RNA-Leu. SOURCE Calf liver tRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 85) AUTHORS Valle,R.P.C., Morch,M.-D. and Haenni,A.-L. TITLE Novel amber suppressor tRNAs of mammalian origin JOURNAL EMBO J. 6, 3049-3055 (1987) STANDARD simple staff_entry FEATURES from to/span description tRNA 1 85 Leu-tRNA anticdn 35 37 Leu-tRNA anticodon cag modified 6 6 m2g modified 10 10 m2g modified 12 12 ac4c modified 17 17 gm modified 19 19 d modified 21 21 p modified 27 27 m22g modified 33 33 um modified 38 38 m1g modified 39 39 p modified 45 45 um modified 47 47 p modified 50 50 m3c modified 57 57 m5c modified 64 64 p modified 67 67 m1a BASE COUNT 14 a 25 c 28 g 18 t ORIGIN 5' end of mature tRNA. 1 gtcaggatgg ccgagcggtc taaggcgctg cgttcaggtc gcagtctccc tggaggcgtg 61 ggttcgaatc ccacttctga cacca // LOCUS BOVTRLB 86 bp ss-tRNA RNA 18-JUL-1990 DEFINITION Bovine amber suppressor Leu-tRNA-caa. ACCESSION M36600 KEYWORDS transfer RNA-Leu. SOURCE Calf liver tRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 86) AUTHORS Valle,R.P.C., Morch,M.-D. and Haenni,A.-L. TITLE Novel amber suppressor tRNAs of mammalian origin JOURNAL EMBO J. 6, 3049-3055 (1987) STANDARD simple staff_entry FEATURES from to/span description tRNA 1 86 Leu-tRNA anticdn 35 37 Leu-tRNA anticodon caa modified 6 6 m2g modified 10 10 m2g modified 12 12 ac4c modified 19 19 d modified 21 21 p modified 27 27 m22g modified 33 33 um modified 38 38 m1g modified 39 39 p modified 45 45 um modified 47 47 p modified 50 50 m3c modified 58 58 m5c modified 65 65 p modified 68 68 m1a BASE COUNT 16 a 24 c 28 g 18 t ORIGIN 5' end of mature tRNA. 1 gtcaggatgg ccgagcggtc taaggcgctg cgttcaagtc gcagtctcgc atggaggcgt 61 gggttcgaat cccacttctg acacca // LOCUS BRPMBOI 62 bp ds-DNA INV 18-JUL-1990 DEFINITION B.malayi MboI repeat consensus sequence DNA. ACCESSION M34369 KEYWORDS . SOURCE B.malayi (strain sub-periodic) DNA. ORGANISM Brugia malayi Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Gnathostomatidae. REFERENCE 1 (bases 1 to 62) AUTHORS Natarajan,S., Werner,C., Cameron,M. and Rajan,T.V. TITLE Isolation and characterization of a repetitive DNA element from the genome of the human filarial parasite, Brugia malayi JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.Natarajan, 18-MAY-1990. author address: S.Natarajan University of Connecticut Dept. of Pathology Farmington, CT 06032 BASE COUNT 24 a 12 c 4 g 22 t ORIGIN 1 ccatttctct acagatataa caatatcact agaagacatt ttgattaatt cattaactca 61 ta // LOCUS DUKARGIN 1568 bp ss-mRNA VRT 18-JUL-1990 DEFINITION Duck argininosuccinate lyase/delta 2-crystallin mRNA, complete cds. ACCESSION M35132 KEYWORDS argininosuccinate lyase/delta 2-crystallin. SOURCE Duck 14-day old embryo lens, cDNA to mRNA, clone DC5-1. ORGANISM Anas platyrhynchos Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Anseriformes; Anatidae. REFERENCE 1 (bases 1 to 1568) AUTHORS Wistow,G. and Piatigorsky,J. TITLE Gene conversion and splice-site slippage in the argininosuccinate lyase/delta-crystallins of the duck lens. Members of an enzyme superfamily JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.J.Wistow, 26-MAR-1990. FEATURES from to/span description pept 74 1480 argininosuccinate lyase/delta 2-crystallin signal 1538 1543 poly-A signal site 86 91 6 inserted bases at splice site slip BASE COUNT 389 a 398 c 439 g 342 t ORIGIN 1 ccggggctgc actgcggaca cggttgcacc aggtgctggg atcgccgcac acgcgagcaa 61 aacgtcctct gagatggcat ctgaggcgag gggagataaa ctttggggag gaagattcag 121 tggaagcaca gatcccatca tggagaagct caactcttcc attgcctatg accagaggct 181 gtctgaagtt gatatccagg ggagcatggc ttatgccaaa gctctggaga aggctgggat 241 cctaactaag actgagctgg agaagatcct gagtggcctg gaaaagatct ctgaggaatg 301 gtctaaggga gtctttgtgg tgaaacagag cgatgaggat atccacactg ccaacgaacg 361 cagactgaag gagctgattg gtgatatcgc tggaaagctg cacaccggaa gaagcaggaa 421 tgatcaggtt gtgactgact tgaagctgtt catgaagaat tccctctcta tcatctccac 481 tcacctgctg cagctcatca agaccctggt ggaacgtgct gccatagaaa ttgatgttat 541 cttgcctggc tacacccacc tgcagaaggc ccagcccatc agatggagcc agttcttgct 601 cagccatgct gttgccctga cccgtgactc tgagcgcctg ggagaggtga agaagaggat 661 caacgtcctg cccctgggaa gcggtgctct ggctggcaac cccctggaca ttgacagaga 721 gatgctgcgt agcgagctgg agtttgcttc catcagcctg aacagcatgg atgccattag 781 tgagagagac tttgtggtgg aattcctctt ccttgccacc ctgctgatga tccaccttag 841 caagatggct gaagatctca taatctacag caccagcgag tttggcttcc tgaccctttc 901 tgatgccttc agcactggca gcagcctgat gcctcagaag aagaaccctg atagcctgga 961 actgatccgc agcaaggctg gacgtgtgtt tggacggctg gcttcaattc tcatggttct 1021 caaaggactt ccaagcacct acaacaagga cctgcaggag gacaaggagg ctgtctttga 1081 tgttgtggac accctgactg ctgtgctcca ggttgccact ggggtgatct ctaccctcca 1141 gatcagcaag gagaacatgg agaaggctct gacccctgag atgctggcta ctgacctggc 1201 tctctacttg gttcgcaaag gagtgccgtt cagacaagcc cacactgctt cggggaaggc 1261 cgtccacctt gccgagacca aaggcatcac catcaataaa ctcagcctgg aggacctgaa 1321 gagcatcagc ccccagttca gcagcgatgt ctcccaggtc ttcaactttg tcaacagcgt 1381 ggagcaatac actgccctgg cgggtaccgc caagagcagc gtgactaccc agatcgagca 1441 gctgagggag ctgatgaaga agcagaagga acaagcttag agtgtgggga gatttcccgt 1501 ggatgtagtg ttttgcttat cacactaatg cagagttaat aaacactgtg gtgtattgta 1561 gttcactc // LOCUS DUKARGINX 1562 bp ss-mRNA VRT 18-JUL-1990 DEFINITION Duck argininosuccinate lyase/delta 1-crystallin mRNA, complete cds. ACCESSION M35133 KEYWORDS argininosuccinate lyase/delta 1-crystallin. SOURCE Duck 14-day old embryo lens, cDNA to mRNA, clone DC6-2. ORGANISM Anas platyrhynchos Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Anseriformes; Anatidae. REFERENCE 1 (bases 1 to 1562) AUTHORS Wistow,G. and Piatigorsky,J. TITLE Gene conversion and splice-site slippage in the argininosuccinate lyase/delta-crystallins of the duck lens. Members of an enzyme superfamily JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.J.Wistow, 26-MAR-1990. FEATURES from to/span description pept 71 1471 argininosuccinate lyase/delta 1-crystallin signal 1529 1534 poly-A signal BASE COUNT 393 a 400 c 427 g 342 t ORIGIN 1 gggctccact gcagacacgg ttgtaccagg tgccgggatc gccgcacacc cgagcaaaac 61 gtcgtctgag atggcatccg agggggataa acttatggga ggaaggtttg tcggaagcac 121 ggatcccatc atgcagatgc tcagcacttc catatccact gaacagaggc tgtctgaggt 181 tgatatccag gcaagcattg cttatgccaa agccctggag aaggctggga tcctaactaa 241 gactgaactg gagaagatcc tgagtggcct ggaaaagatc tctgaggaat tgtctaaggg 301 agtcattgta gtgacccaaa gcgatgagga tatccaaact gccaatgaac gcagactgaa 361 ggagctgatt ggtgatattg ctggaaagct gcacactgga agaagcagga acgaacaggt 421 tgtgactgac ttgaagctgt tcatgaagaa ttccctctct atcatctcca ctcacctgct 481 gcagctcatc aagaccctgg tggaacgtgc tgccatagaa attgatgtta tcttgcctgg 541 ctacacccac ctgcagaagg cccagcccat cagatggagc cagttcttgc tcagccatgc 601 tgttgccctg acccgtgact ctgagcgcct gggagaggtg aagaagagga tcaacgtcct 661 gcccctggga agcggtgctc tggctggcaa ccccctggac attgacagag agatgctgcg 721 tagcgagctg gagtttgctt ccatcagcct gaacagcatg gatgccatta gtgagagaga 781 ctttgtggtg gaattcctct ctgttgccac cctgctgctg atccacctta gcaagatggc 841 tgaagatctc atcatctaca gcaccagcga gtttggcttc ctgacccttt ctgatgcctt 901 cagcactggc agcagcctga tgcctcagaa gaagaaccct gatagcctgg aactgatccg 961 cagcaaggct ggacgtgtgt ttggacggct ggcttcaatt ctcatggttc tcaaaggact 1021 tccaagcacc tacaacaagg acctgcagga agacaaggag gctgtcattg atgttgtgga 1081 caccctgact gctgtgctcc aggttgccac tggggtgatc tctaccctcc agatcagcaa 1141 ggagaacatg gagaaggctc tgacccctga gatgctggct actgacctgg ctctctactt 1201 ggttcgcaaa ggaatgccat tcagacaagc ccacactgct tcggggaagg ccgtccacct 1261 tgccgagacc aaaggcattg ccatcaataa tctcaccctg gaggacctga agagcatcag 1321 ccccctgttc agcagcgatg tgtcccaggt cttcaacttt gtcaacagcg tggagcagta 1381 cactgccctg ggcggtaccg ccaagagcag cgtgactacc cagattgagc agctgaggga 1441 gctgatgaag aagcagaagg aacaagctta gagtgtgggg agatttcccg tggatgcagc 1501 attgtgctta tcacactaat gcagagttaa taaacactgt ggtgtattgt agttcactca 1561 aa // LOCUS VACFGENE 6185 bp ds-DNA VRL 18-JUL-1990 DEFINITION Vaccinia virus F1L, F2L, F3L, F4L, F5L, F6L, F7L, F8L, and F9L mRNAs, complete cds. ACCESSION M34368 KEYWORDS . SOURCE Vaccinia virus (strain WR), cDNA to viral mRNA, clone HindIII F. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 6185) AUTHORS Roseman,N.A. and Slabaugh,M.B. TITLE The vaccinia virus HindIII D fragment: Nucleotide sequence of the left 6.2-kb JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.A.Roseman, 17-MAY-1990. Author address: N.A.Roseman Oregon State University Biochemistry and Biophysics Corvallis, Oregon 97331 FEATURES from to/span description pept 119 757 F9L protein pept 817 1014 F8L protein pept 1162 1404 F7L protein pept 1420 1644 F6L protein pept 1674 2642 F5L protein pept 2674 3633 F4L protein pept 3644 5086 F3L protein pept 5110 5553 F2L protein pept 5565 > 6185 F1L protein BASE COUNT 2175 a 946 c 1138 g 1926 t ORIGIN 1 gaattcatca tgtgtaccaa aacagactgt gataaatata gattaaaggt ttccattctt 61 cacccaatta gtttcttgga aaaatttatt atgagagaca ttttctcaga ctggataaat 121 ggcggaaact aaagagttta aaactttgta taatcttttt atagatagtt atttacaaaa 181 attagctcaa cattctatcc ctactaatgt cacttgtgct attcatatag gagaggttat 241 aggacagttt aaaaattgcg cgctccgaat aactaacaaa tgcatgagta attctcgact 301 tagtttcaca ctcatggttg aatcatttat tgaagtgatt tcattgcttc cggaaaagga 361 tagaagacgt atcgctgaag aaataggaat agatctagac gatgtaccta gtgcggtatc 421 caagctagaa aagaactgta atgcgtatgc ggaggttaat aatattatag atatacagaa 481 attagatatc ggagaatgtt cggctccgcc cggtcaacat atgcttttac agatagttaa 541 tacaggatcc gcggaacgaa attgtggttt acagacaatt gttaagtcct taaataaaat 601 atacgttcca cctattatcg aaaaccgatt gccgtattac gatccgtggt ttctagtggg 661 tgtagcaatt attctagtta tttttactgt agctatttgt tctattagac gaaatctggc 721 tcttaaatac agatacggaa cgtttttata cgtttaatta ataaaaaaat ttaattacaa 781 ggtatataat agtactccat ctacgcaatc gcgataatgg agggatctaa acgcaaacac 841 gacagtcggc gactacaaca agaacaggag cagcttcgtc cacgtacacc gccatcatat 901 gaagaaattg caaaatatgg acactcattt aacgtgaaaa gatttacgaa tgaagaaatg 961 tgtcttaaga atgattatcc acgaattata tcatataatc ctccaccaaa atagagtata 1021 tatatatcat catttcatga tgtatactac tgacatagtt tcaatgtgaa cttttcactt 1081 tcttgccggt tatgaagaat attttttatt ttaatggtca ttactaatcg tatattataa 1141 ttgaaaatgg attagtttaa tatgacgctc gtcatgggat cctgctgtgg tagattctgt 1201 gacgctaaga ataagaataa gaaggaagat gtagaagagg gaagagaagg atgttacaat 1261 tataagaacc ttaatgatct ggatgaatcc gaagcacgtg tagaatttgg accattatat 1321 atgataaatg aagaaaaatc agacataaat acattggata taaaaagaag atatagacac 1381 acgatagagt ctgtatattt ctaaaagttt ttataaaaaa tgagtaaaat actcacgttt 1441 gttaaaaata agataattga cttgattaat aatgaccaaa ttaaatattc tagagttata 1501 atgatagaag agtccgatag tcttttaccg gttgatgagg tgcatgctaa ccacggattt 1561 gactgtgtgg agatgataga tgaaaatata agcaatgaga atatcgaaca gtataaaacc 1621 gaatcttttt ttacaataaa ttgaaatcaa aacatttatt aaaccgcatc aagatgggta 1681 cgaacggcgt tagagtattt gtcattttat atttgttggc tgtatgcgga tgtatcgaat 1741 acgacgtaga cgataatgta catatttgta cccacactaa cgtgtcacat attaatcaca 1801 ctagttggta ttataatgat aaggttatag cgctagccac cgaggataaa acttctggtt 1861 atatatcatc attcataaaa cgcgttaata tctcattaac ttgtttaaat atatcgagtt 1921 tgcggtacga agattctggt acatacaaag gagtatcgca tctaaaagat ggagtcatcg 1981 ttacaactac tatgaatata tctgtaaagg ctaatatcat tgacttgact ggtagagtgc 2041 gttatctaac cagaaattat tgcgaagtta aaatacgatg cgaaataaca tctttcgcgc 2101 ttaatggttc tactacacca ccacatatga tattaggaac agtagataaa tggaaatatc 2161 ttccatttcc tacagatgat tatagatacg taggggaact gaaaagatat atatctggaa 2221 acccatatcc aacagagtcg ctagcgttag aaattagctc gacgtttaat cggtttacta 2281 tcgttaaaaa tttgaacgat gacgagtttt cttgttatct gttttcacaa aattatagtt 2341 tccataaaat gttgaatgtg cgtaacattt gtgaatccga atgggaggcg ttaaataata 2401 ataacgataa ttcatcctcc atgcccgctt cccacaacaa tctcgcaaac gatttatcta 2461 gtatgatgtc acaattacaa aatgataatg atgataataa cgattattca gcacccatga 2521 atgtcgataa tttaattatg atagtactaa taacaatgct atcaataata cttgtaatta 2581 ttgtagtgat tgcggcgata tcgatgtaca aaagatccaa gtacaggcat atagataact 2641 gaaaaaaaat ttattgttat tgttaattta gttatggaac ccatccttgc accaaatcca 2701 aatagatttg ttattttccc aatccaatat tatgacatct ggaacatgta taaaaaggca 2761 gaggcatcat tttggacagt ggaagaagta gatatatcta aagatatcaa tgattggaat 2821 aaactaacac cagacgaaaa atattttata aaacatgtat tggcgttttt tgcagccagt 2881 gacggaatag tgaatgaaaa tttggcggaa cgattttgta cagaagtaca gattaccgag 2941 gctagatgtt tctacggatt tcagatggcc attgaaaaca ttcattcgga aatgtatagt 3001 cttttgatcg atacttatgt taaagatagt aatgaaaaaa actatctctt taatgccata 3061 gaaacgatgc cttgtgtaaa aaagaaggcc gattgggctc aaaagtggat acatgacagc 3121 gccggttatg gagagagact tattgccttt gctgcagtag aaggaatctt cttttctgga 3181 tcattcgctt ccatattttg gcttaaaaag cgtggcctaa tgcccggact cacgttttcc 3241 aacgaactaa ttagtagaga cgagggtctg cactgcgatt tcgcatgttt gatgtttaaa 3301 catttattgc atccaccgag tgaagaaacc gttagatcta ttataacaga tgcggtatcc 3361 attgaacaag aatttcttac tgcggctctt ccagttaaac ttataggaat gaattgtgaa 3421 atgatgaaaa catatataga attcgtcgcg gatagattga tttctgaatt gggatttaaa 3481 aaaatttata atgttaccaa tccgtttgat ttcatggaaa atatatcatt ggaaggaaaa 3541 actaattttt tcgaaaaacg tgtgggtgaa taccaaaaaa tgggagttat gtctcaagaa 3601 gataatcatt tttctttaga tgttgacttt taaagaaaca taaatgccga tatttgttaa 3661 tactgtgtac tgtaagaata tattagcatt gtctatgact aagaaattca aaacaattat 3721 tgatgctata ggtggcaata taatagtcaa ttctacgata ttgaaaaagt tatctcctta 3781 ctttcgcaca catttacgtc aaaaatacac gaaaaataaa gatccagtta cttgggtttg 3841 tctagacctt gacattcaca gtttaacttc tatagttatt tactcgtata ctggaaaggt 3901 atatatagat agtcataacg tcgtcaattt attacgtgct tctatattaa cctctgtaga 3961 atttatcatc tacacttgta taaactttat cttacgagat tttagaaagg aatattgtgt 4021 cgagtgttac atgatgggta tagaatacgg actatccaat ctcttatgtc atactaaaaa 4081 ctttattgcc aaacactttt tggaactgga agatgacatc atagacaatt ttgattatct 4141 atctatgaaa cttattctag aaagcgatga actaaatgtt ccagatgagg attatgtagt 4201 tgattttgtc attaagtggt atataaagcg aagaaataaa ttaggaaatc tgctactcct 4261 tatcaaaaat gtaatcaggt caaattatct ttctcccaga ggtataaata atgtaaaatg 4321 gatactagac tgtaccaaaa tatttcattg tgataaacaa ccacgcaaat catacaagta 4381 tccattcata gagtatccta tgaacatgga tcaaattata gatatattcc atatgtgtac 4441 aagtactcat gttggagaag tagtatatct catcggtgga tggatgaaca atgaaataca 4501 taacaatgct atagcggtaa attatatatc aaacaattgg attccaattc ctccgatgaa 4561 tagccccaga ctgtatgcta gcgggatacc cgctaacaat aaattatacg tagtaggagg 4621 tctaccaaat cccacatctg ttgagcgttg gttccacggg gatgctgctt gggttaatat 4681 gccgagtctt ctgaaaccta gatgtaatcc agcagtggca tccataaaca atgttatata 4741 cgtaatggga ggacattctg aaactgatac aactacagaa tatttgctac ccaatcatga 4801 tcagtggcag tttggaccat ccacttatta tcctcattat aaatcatgcg cgttagtgtt 4861 cggtagaagg ttattcttgg ttggtagaaa tgcggaattt tattgtgaat ccagcaatac 4921 atggactctg atagatgatc ctatttatcc gagggataat ccagaattga tcatagtgga 4981 taataaactg ctattgatag gaggatttta tcgtgaatcg tatatagata ctatagaagt 5041 gtacaatcat cacacttatt catggaatat atgggatggt aaataatttt gaaataaaat 5101 attagtttta tgttcaacat gaatattaac tcaccagtta gatttgttaa ggaaactaac 5161 agagctaaat ctcctactag gcaatcacct tacgccgccg gatatgattt atatagcgct 5221 tacgattata ctatccctcc aggagaacga cagttaatta agacagatat tagtatgtcc 5281 atgcctaagt tctgctatgg tagaatagct cctaggtctg gtctgtccct aaaaggcatt 5341 gatataggag gcggtgtaat agacgaagat tataggggaa acataggagt cattcttatt 5401 aataatggaa aatgtacgtt taatgtaaat actggagata gaatagctca gctaatctat 5461 caacgtatat attatccaga actggaagaa gtacaatctc tagatagtac aaatagagga 5521 gatcaagggt ttggatcaac aggacttaga taataaacaa tagtatgttg tcgatgttta 5581 tgtgtaataa tatcgtagat tatgtagatg atatagataa tggtatagta caggatatag 5641 aagatgaggc tagcaataat gttgatcacg actatgtata tccacttcca gaaaatatgg 5701 tatatagatt tgacaagtcc actaacatac tcgattatct atcaacggaa cgggaccatg 5761 taatgatggc tgttcgatac tatatgagta aacaacgttt agacgacttg tatagacagt 5821 tgcccacaaa gactagatca tatatagata ttatcaacat atattgtgat aaagttagta 5881 atgattataa tagggacatg aatatcatgt atgatatggc atctacaaaa tcatttacag 5941 tttatgacat aaataacgaa gttaatacta tactaatgga taacaagggg ttgggtgtaa 6001 gattggcgac aatttcattt ataaccgaat tgggtagacg atgtatgaac ccagtagaaa 6061 cgataaaaat gtttactcta ctatcgcata ctatatgcga tgattatttt gtagattata 6121 taacggacat ttcaccacca gataatacca tccctaacac tagcacgcgt gaatatctaa 6181 agctt // LOCUS BSUCCCA 1172 bp ds-DNA BCT 18-JUL-1990 DEFINITION B.subtilis 13 kD cytochrome c-550 (cccA) gene, complete cds. ACCESSION J05569 KEYWORDS cytochrome c-550. SOURCE B.subtilis (strain w168) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1172) AUTHORS von Wachenfeldt,C. and Hederstedt,L. TITLE Bacillus subtilis 13 kDa cytochrome c-550 encoded by cccA, consists of a membrane-anchor and a heme-domain JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.O.S.von Wachenfeldt, 12-JUN-1990. FEATURES from to/span description pept 542 904 cytochrome c-550 (cccA) BASE COUNT 400 a 170 c 264 g 338 t ORIGIN 223 min on B.subtilis map. 1 cagctggaag atgtgcttga tacgttaact gatcgtgaag aaaatgtatt gcgtcttcga 61 ttcggtcttg atgacggccg tacaagaaca ttagaagagg tcggcaaagt atttggagta 121 acgagagagc gtattcgaca aatcgaagcc aaagcgttgc ggaaactaag acatcctagc 181 agaagtaaac gtttgaaaga tttccttgaa taagatggaa cgggtcttga agatccgttc 241 ttcttttttt aaaaagatat atggataata tgcctttatt ttactgaaaa atgatgtcat 301 ttgcaaatga acattgtggt gaaaaatttc aaaatctaat tccatatttt ctattgtaag 361 cgtatacaat acattataca atagaataaa aaggatatta gagatttagg catgtttcta 421 ttcaattttg tataaagtgt gaataaaaaa cttttgtata gcaatccatt tactttttgt 481 aaaaataagt tagaattaga agtgtttaca tagggggaag gatcaaaaag ggggaaggga 541 aatgaaatgg aacccgctta ttccattttt gctgatcgct gttttaggaa ttggtctaac 601 tttcttttta tcagtaaaag gacttgatga ctctcgggag attgcgagcg gaggagaaag 661 caaatctgct gaaaagaaag atgcaaacgc ttcaccagaa gaaatttaca aggcaaactg 721 catcgcatgc cacggtgaaa attatgaagg tgtttcgggt cctagcttaa aaggtgtagg 781 ggataagaaa gatgtcgctg aaattaaaac aaagatcgaa aaaggcggta atggaatgcc 841 ttccggtctt gttccggccg ataagctgga tgatatggca gagtgggtgt caaaaattaa 901 ataaaagaac tatttttctc tttatacatt cacaggccgc cgagatgttc tcgacagcct 961 tttatttttg actgctgcgt gcagacccgc atcaaaagaa ataatttcaa aatcatttgc 1021 ttcagcttgg attccgggct gattttttat atgatagtaa aggaaaaaat gattgagaaa 1081 agttgtgatg acaagtgaac gaattaaaat tatctaaacg attgcaaaca gtagcggagt 1141 acataccgaa tggagcggtg atggccgata tc // LOCUS ECOGACAR 1630 bp ds-DNA BCT 18-JUL-1990 DEFINITION E.coli glutamate and aspartate carrier (gltP) gene, complete cds. ACCESSION M32488 KEYWORDS glutamate and aspartate carrier. SOURCE E.coli (strain K12-BK9MDG) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1630) AUTHORS Wallace,B., Yang,Y.-J., Hong,J. and Lum,D. TITLE Cloning and sequencing of a gene encoding a glutamate and aspartate carrier of E.coli K-12 JOURNAL J. Bacteriol. 172, 3214-3220 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Y. Hong, 02-MAR-1990. FEATURES from to/span description pept 151 1338 glutamate and aspartate carrier (gltP) BASE COUNT 330 a 391 c 472 g 437 t ORIGIN 1 caagggttgc gcaacatacc gcgcaaatga tactgatcat aagcgttaaa aaaatctaca 61 aaccaacgca acacaattca tgccctggca gtatgtcacg ttctcgcgtt tctgaacggg 121 gaacggcgct ccattgagga agtcattcat atgaaaaata taaatttcag cctggcctgg 181 cagattctgt ttgctatggt gctgggcatt ctcctgggaa gctacctgca ctaccatagc 241 gacagccgcg actggctggt cgtcaatttg ctctctccgg cgggtgatat cttcatccat 301 ctgattaaaa tgattgttgt gccgattgtg atctccacgc tggtggtggg tatcgcgggt 361 gttggtgatg ccaaacagct cgggcgtatt ggcgcgaaaa ccattatcta cttcgaggtg 421 atcaccaccg tcgccatcat tttgggatca ctctggcgaa ccgtcttcca gcccggtgcc 481 ggggtggata tgtcgcagtt ggcgaccgtc gatatctcga aatatcagag cactacggaa 541 gcggtacaaa gcagttccca cggcattatg ggcacgattt tgtcgctggt gccgacgaac 601 attgtggcgt cgatggcgaa aggcgaaatg ctgccgatca tctttttctc ggtgctgttt 661 ggtctggggc tttcttccct gcccgcgacg catcgtgaac cgctggtgac cgtgttccgc 721 tccatctctg aaaccatgtt taaagtgact cacatggtga tgcgttatgc accggtgggt 781 gtgtttgcgc tgattgcggt gacggtggct aactttggtt tctcgtctct gtggccactg 841 gcgaaactgg tgctgctggt gcatttcgcc attctgttct tcgcgctggt agtgctggga 901 attgtggcgc gcctgtgcgg gttaagcgtc tggatcctga ttcgtattct gaaagatgag 961 ctgattctgg cgtactccac tgccagctct gaaagcgtcg tgccgcgaca tatgagaaga 1021 tggaagccta cggagcaccg gtgtcgatca ccagtttcgt ggtgccgacc ggttaacctc 1081 ttttaccttg atggttcgac gctgtatcaa agtattgccg ctatcttcat cgcgcagttg 1141 tatggcattg acctgtccat ctggcaggaa atcattctgg tcgtgacgct gatggtgacc 1201 tcgaaaggga ttgctggcgt gcctggcgtg tcgtttgtgg tgttgctggc aacgctgggt 1261 agcgtaggta tccgctggaa ggtctggcgt ttattgctgg tgttgaccgt atcctcgaca 1321 tggcgcgtac tgccgtgaac gtggtgggta atgcgctggc ggtgctggtg attgccaagt 1381 gggaacacaa atttgaccgt aagaaagcgc tggcttatga gcgtgaagtg ctgggcaaat 1441 ttgataaaac tgcggatcaa taattgaaga ttgccgggga tatccacccg gcaatgtgtg 1501 aatgcctgat gcgacgcttg ccgcgtctta tcaggcctac gccagacagc gcaatagcct 1561 gatttagcgt gattttgtag gtcggataag gcgtttatgc cgcatccgac atcaacgcct 1621 gatgcgacgc // LOCUS ECOLYSU 3034 bp ds-DNA BCT 18-JUL-1990 DEFINITION E.coli lysyl-tRNA synthetase (lysU) gene, complete cds. ACCESSION M30630 KEYWORDS lysyl-tRNA synthetase. SOURCE E.coli (strain K12) isolate CS520 DNA, clone pLC4-5. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3034) AUTHORS Clark,R.L. and Neidhardt,F.C. TITLE Roles of the two lysyl-tRNA synthetases of Escherichia coli: Analysis of nucleotide sequences and mutant behavior JOURNAL J. Bacteriol. 172, 3237-3243 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.L.Clark, 12-DEC-1989. FEATURES from to/span description pept 316 1821 lysyl-tRNA synthetase (lysU) (E.C. 6.1.1.6) signal 194 199 -35 signal signal 216 221 -10 signal BASE COUNT 850 a 654 c 665 g 865 t ORIGIN 93.55 min on K12 map. 1 cgatcatcgt cgtattggcc tttgctaccc gttttctgtt tagcacgccg acgaatatga 61 tacaggagag caacgattaa tgtcgtaaaa acaattggtt atgaataaat taactttcag 121 tttataacac taacaaatcg agattaataa gtcatttaaa ccattttgat ggttatttat 181 tagtgatatc aacttgaggt aagcgttagt ttcgataaga taaactgagt tactaatagt 241 cgaggcagat aatacagtgt accgatctgt ctctttatct acgctaaatt gaaagctgga 301 tttagaggaa ccaaaatgtc tgaacaagaa acacggggag ccaatgaggc tattgatttt 361 aacgatgaac tgagaaatcg ccgcgaaaaa ctggcggcac tacgtcagca aggtgtggcg 421 tttcccaatg attttcgccg cgaccatacc tctgaccagt tgcacgaaga gtttgatgcg 481 aaggataacc aggaactgga atccttaaac attgaagtct cggttgctgg ccgaatgatg 541 acccgtcgta tcatggggaa agcctccttt gtaacgttgc aggatgtcgg tggccgtatt 601 caactgtacg ttgcaagaga tagcctgcca gaaggtgttt ataacgatca gtttaaaaaa 661 tgggatctgg gtgacattat cggtgccggt acgctgttta agacgcaaac gggtgagctt 721 tccattcact gtactgagct gcgcctgctg actaaagcac tacgtccttt accagataaa 781 ttccatggtc tgcaggatca ggaagtccgt tatcgtcaac gttatctgga cctcatcgct 841 aacgataaat cccgtcaaac gtttgttgtc cgttcaaaaa ttctggccgc tatccgtcaa 901 ttcatggtcg cgcgcggctt tatggaagta gaaaccccga tgatgcaggt aattccaggt 961 ggggcatctg ctcgcccgtt tattacccat cataatgctc tggatttaga tatgtatgct 1021 cgtatcgcgc cggagctgta tctgaaacgt ctggttgtag gcggttttga acgggtattc 1081 gaacacgtaa ctcgtaatga aggtattcgt cgccataatc ctgagttcac aatgatggaa 1141 ctctacatgg cgtatgcgga ttaccacgat ttgattgaac tgacagagtc actgttccgc 1201 accctggcac aagaggttct gggtaccact aaagtcactt atggcgagca tgtgtttgat 1261 ttcggcaaac cgtttgaaaa actcaccatg cgcgaagcaa tcaaaaaata tcgtccagaa 1321 accgatatgg ccgacctgga taattttgat gctcgtaaag cattagctga atctatcggt 1381 attacggtag agaaaagctg ggggttggga cgtagtgtca cagagatctt tgatgaagtg 1441 gtagaaggac atgtgattca gccatccttt attacggaat atccggcaga agtgtccccg 1501 ctggcacgtc gtaatgatgt taacccggaa atcaccgacc gttttgaatt cttcatcggt 1561 ggtcgtgaaa tcggtaatgg ttttagcgaa ttaaacgacg cagaagatca ggctgaacgt 1621 ttccaggaac aggttaatgc taaagctgca ggtgacgacg aagccatgtt ctatgacgaa 1681 gattacgtga ctgcgctgga atatggtctg ccgccaaccg ctggtctggg tattggtatc 1741 gaccgaatga ttatgctgtt tactaacagc catactattc gcgacgttat tctcttcccg 1801 gcgatgcgcc cacagaaata aatttcactt taatgaacga agcagtcagg cgactgcttc 1861 gttcatcgtt ccgcaaatcc agttttgagg agagacgctt cgtttcacga taaccagcgg 1921 gtccactcaa ctacagctcc taaaagagag aaaaaagtaa ctcgacaggc agactcattg 1981 gccaggttat tccgaccagg aatgcactaa gtaaacgaat atgtgttttc taatgataga 2041 gataaaccag agtaatcgca ggttattcac gccgaatacg ttcacacatc acataaaaaa 2101 ttaatatttt agagaatagt gccatttttt atgttcttct tatccttcct ggctacttgt 2161 aaaactgacg gaaatagtca tcaactgtca taatgttgag gtcaaaagca atatccggta 2221 agtctaagca tttgagcctg attaaatctt cgatttcact tgcggctaac gatgtcgtta 2281 ttgcgaagga gttttcattc aattgtcgtg gtatcccata ttcatcagcc acgatagagg 2341 agaaattcat atcatacata catcaatgtt cttaatctta agaagagtgg ggaataagat 2401 gatggataat caaaagaaac aacataagta ttgaacttgt ttttacctcc ataatatgac 2461 tcctatcaat attaaacgct ctggggacgc aaaacttatt aaccccgatg atgattctca 2521 tataggagat attctttttt ttgctcagta aaggggatgg acttgggaca aaaaaacgcg 2581 ctctctgccc attctttgtg catcccccta cccttcagat gctctccttg catcacatcg 2641 cggagagaat gaaaaatatc gtgactctgt actctcccat caactgcgaa tatcatcata 2701 ctcccgcgtt ttatcaaatt catgtttcgc aaatgggcat aatgggataa tttttcgttt 2761 ttcccgacgc attttttcca cgactttcgc aaccagctgt taccaatccc cgttgccttt 2821 caggctttca tcgacatcgt atgttcgata actcgctaaa tctccgtcgg tcggcacaaa 2881 gacaatttca gcgatttgct tgccttgttt gtcattaatg taaaatttat tgtggccttc 2941 gcgtatttcc attttaccct cgcttattta tgacggtatt tcagcgcgcc gctcgggcac 3001 gtatcatcac tttaaccaca gtggcgacgt cgac // LOCUS PSEBCD 4007 bp ds-DNA BCT 18-JUL-1990 DEFINITION P.aeruginosa pilin biogenesis protein (pilB, pilC and pilD) genes, complete cds. ACCESSION M32066 KEYWORDS pilin biogenesis protein. SOURCE P.aeruginosa (strain PAK) DNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 4007) AUTHORS Nunn,D., Bergman,S. and Lory,S. TITLE Products of three accessory genes, pilB, pilC and pilD are required for biogenesis of Pseudomonas aeruginosa pili JOURNAL J. Bacteriol. 172, 2911-2919 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.N.Nunn, 12-FEB-1990. FEATURES from to/span description pept 199 1899 pilin biogenesis protein (pilB) pept 1903 3123 pilin biogenesis protein (pilC) pept 3127 3999 pilin biogenesis protein (pilD) mRNA 8 < 1 (c) pilA mRNA BASE COUNT 882 a 1244 c 1110 g 771 t ORIGIN 1 cgcctaaccc taccaagcac ttaccatgcc aaataccgtt cacatagcga gcaaggccgt 61 cttatgccct tcccgaggag actcaaactt aaaacggaca ggatgtgaca ctttttgaca 121 gccggactcc cccgatttgg cagaccctgt tatctgcgtt ataaggcact tcaatagccc 181 tctgcgattt ccttctccat gaacgacagc atccaactga gcggtctgtc ccgacagctc 241 gtccaagcca atctactcga cgaaaagacc gccctgcagg cccagacgca ggcgcagcgc 301 aacaagctgt cgctggtgac gcacctggtg cagaacaagc tggtaagcgg cctggccctg 361 gccgaattgt ccgccgagca gttcggcatc gcctattgcg acctgaacag cctggacagg 421 gaaagctttc cgcgggacgc catcagcgaa aagctggtcc gccagcaccg ggtcattccc 481 ctgtggcggc ggggcaacaa gttgttcgta ggcatctccg acgccgccaa ccaccaggcc 541 atcaacgacg tccagttcag cactggcctg acgaccgagg ccattctggt cgaggacgac 601 aagctcggcc ttgccatcga caagctattc gagaatgcaa ccgacggctt ggcaggtctc 661 gatgacgtgg acctggaagg actggatgta ggcgtcaagg aaacgtccgg acaagaagat 721 actggcgcgg aggcagacga cgctcccgtc gtccgcttcg tcaacaagat gctgctggac 781 gctatcaaag gcgggtcttc cgacttgcac ttcgagccct acgagaagat ctaccgggtg 841 cgcttccgta ccgacggcat gcttcacgaa gtggccaagc cgccgatcca gttggccagt 901 cgtatctctg ctcgtctcaa ggtaatggcc ggcctggata tctccgaacg acgcaagccg 961 caggacggac ggatcaagat gcgcgtgtcg aagaccaagt ccatcgactt ccgcgtcaac 1021 accctgccga ccctgtgggg cgagaagatc gtgatgcgga tcctcgactc ctccagcgcg 1081 cagatgggca tcgacgccct gggctacgag gaggaccaga aggaactcta cctggccgca 1141 ctcaagcagc cgcagggcat gatcctggtc accggcccca ccggctcggg caagacggta 1201 tcgctgtaca ccggcctgaa catcctcaat accaccgaca tcaacatttc caccgccgaa 1261 gacccggtgg agatcaacct ggaaggcatc aaccaggtca acgtcaatcc gcgccagggc 1321 atggacttct cccaggcgct gcgcgccttc ctgcgccagg acccggacgt gatcatggtc 1381 ggcgagatcc gcgacctgga gaccgccgag atcgccatca aggcggcgca gaccgggcat 1441 atggtgatgt ccaccctgca caccaacagc gccgccgaga ccctgacccg cctgctgaac 1501 atgggcgtac cggcgttcaa cctggcgacc tcggtgaacc tgatcatcgc ccagcgcctt 1561 gcgcgaaaac tctgttcgca ctgcaagaaa gagcacgacg taccgaagga aaccctgctt 1621 cacgaaggtt tcccagaaga gttgatcggc accttcaagc tgtattcacc agtgggctgc 1681 gaccattgca agaacggtta caagggccgt gtcggtattt atgaagtagt taaaaacacc 1741 ccggccctgc agcggattat catggaggaa ggcaactcca tcgagatcgc cgagcaagcc 1801 cgcaaagaag gcttcaacga tctgcgcacc tcaggcctgc tgaaagccat gcaggggatc 1861 accagcctgg aggaagtcaa ccgcgtgacc aaggattaat ccatggcgga caaagcgtta 1921 aaaaccagcg ttttcatctg ggagggcacc gacaagaaag gcgccaaggt caagggcgaa 1981 ctgaccgggc agaatcccat gctggtgaaa gcccatctgc gcaagcaagg catcaatccg 2041 ctcaaggtac gcaagaaagg tatctccctg ctgggcgcag gcaagaaagt gaaacccatg 2101 gacatcgccc tgttcacccg gcagatggcg accatgatgg gcgctggcgt tcccctcctg 2161 caatcgttcg acatcatcgg cgagggcttc gacaacccca acatgcgcaa gcttgtggat 2221 gaaatcaaac aggaagtttc ctcaggtaac agcctagcca actccttgag aaaaaagccc 2281 cagtattttg acgagcttta ttgcaacctg gtagatgcag gggaacagtc tggcgccttg 2341 gaaaaccttc tcgatcgggt ggcaacctat aaagaaaaga cggaatcact gaaagccaag 2401 atcaaaaagg cgatgaccta tcccattgcc gtcatcattg tcgcactgat tgtatctgcg 2461 atcctcctga ttaaagtggt tccacaattt cagtcggtct ttgaaggttt cggcgcggaa 2521 cttcccgcct ttacccagat gattgtcaat ctatcggagt tcatgcagga gtggtggttc 2581 ttcatcatac tggcgatagc gatatttggc tttgcattca aagaattgca taaacgctca 2641 caaaaattcc gtgacacact cgacagaacg atcctcaaac ttcccatttt cggaggcatc 2701 gtctacaaat ctgcggtcgc ccgttatgca cggaccttgt ccacgacctt cgccgcgggt 2761 gttcccctgg tcgatgcgct cgactccgtc tccggagcga ccggcaatat cgtgttcaag 2821 aacgcggtca gcaagatcaa gcaagacgtt tccaccggca tgcagctcaa cttctccatg 2881 cgcaccacca gcgtctttcc caacatggcg atccagatga ccgccatcgg cgaggagtcc 2941 ggttcgctcg atgagatgct gagcaaagtc gccagctact acgaagagga agtcgacaac 3001 gccgtggaca acctcaccac gctcatggaa ccgatgatca tggccgttct cggcgtactg 3061 gttggcggtc tgatcgtggc catgtacctt ccgatcttcc aactcggcaa cgtcgtcgga 3121 taactgatgc ccctcctcga ctacctggcc agccatccgc tggcctttgt cttgtgcgcc 3181 atcctccttg gcctgctggt cggcagcttc ctcaacgtcg tcgtgcatcg cctgccgaag 3241 atgatggagc gcaactggaa ggccgaagcc cgcgaagcgc tcgggctgga gccggaaccc 3301 aagcaggcga cctacaacct ggtgctgccg aattcggcgt gcccgcgctg cggccacgag 3361 atccgcccct gggaaaacat cccgctggtc agctacctgg cgctgggcgg caagtgctcg 3421 tcctgcaagg ccgccatcgg caaacgctac cccctggtgg aactggccac cgcgctgctt 3481 tccggctacg tcgcctggca tttcggcttc acctggcagg cgggcgcgat gctcctgctg 3541 acctggggtc tgctggcgat gagcctgatc gatgccgatc accaactgct gccggacgtg 3601 ctggtgctgc cgctgctctg gctgggcctg atcgccaacc atttcggcct cttcgccagc 3661 ctcgacgatg cgctgttcgg cgcggtgttc ggctacctga gcctgtggtc ggtgttctgg 3721 ctgttcaagc tggtgaccgg caaggagggc atgggctacg gcgacttcaa gctgctggcc 3781 atgctcggtg cctggggtgg ctggcagatc ctgccgctga ccatcctgct gtcgtcgctg 3841 gtcggggcga tcctcggggt gatcatgctg cgcctgcgca acgccgaaag cggcacgccg 3901 atccccttcg gtccctatct ggctattgcc gggtggattg ctttgctctg gggtgatcaa 3961 ataacccgga cctatctgca attcgccgga ttcaaatgac ccaaccc // LOCUS RADHRPER 955 bp ds-DNA SYN 18-JUL-1990 DEFINITION Synthetic horseradish peroxidase isoenzyme C (HRP-C) subunit alpha-1 DNA, complete cds. ACCESSION J05552 KEYWORDS peroxidase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 955) AUTHORS Smith,A.T., Santama,N., Dacey,S., Edwards,M., Bray,R.C., Thornely,R.N.F. and Burke,J.F. TITLE Expression of a synthetic gene for horseradish peroxidase C in E.coli and folding and activation of the recombinant enzyme with calcium and heme JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.T.Smith, 13-JUN-1990. FEATURES from to/span description pept 11 940 synthetic horseradish peroxidase isoenzyme C (HRP-C) subunit alpha-1 (E.C. 1.11.1.7) BASE COUNT 246 a 269 c 221 g 219 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttaacc atgcagttaa cccctacatt ctacgacaat agctgtccca acgtgtccaa 61 catcgttcgc gacacaatcg tcaacgagct cagatccgat cccaggatcg ctgcttcaat 121 attacgtctg cacttccatg actgcttcgt gaatggttgc gacgctagca tattactgga 181 caacaccacc agtttccgca ctgaaaagga tgcattcggg aacgctaaca gcgccagggg 241 ctttccagtg atcgatcgca tgaaggctgc cgttgagtca gcatgcccac gaacagtcag 301 ttgtgcagac ctgctgacta tagctgcgca acagagcgtg actcttgcag gcggaccgtc 361 ctggagagtg ccgctcggtc gacgtgactc cctacaggca ttcctagatc tggccaacgc 421 caacttgcct gctccattct tcaccctgcc ccagctgaag gatagcttta gaaacgtggg 481 tctgaatcgc tcgagtgacc ttgtggctct gtccggagga cacacatttg gaaagaacca 541 gtgtaggttc atcatggata ggctctacaa tttcagcaac actgggttac ctgaccccac 601 gctgaacact acgtatctcc agacactgag aggcttgtgc ccactgaatg gcaacctcag 661 tgcactagtg gactttgatc tgcggacccc aaccatcttc gataacaagt actatgtgaa 721 tctagaggag cagaaaggcc tgatacagag tgatcaagaa ctgtttagca gtccaaacgc 781 cactgacacc atcccactgg tgagaagttt tgctaactct actcaaacct tctttaacgc 841 cttcgtggaa gccatggacc gtatgggtaa cattacccct ctgacgggta cccaaggcca 901 gattcgtctg aactgcagag tggtcaacag caactcttaa taaggatccg aattc // LOCUS RK2KORC 1701 bp ds-DNA BCT 18-JUL-1990 DEFINITION Plasmid RK2 kil regulatory protein (korC) (complete cds), kleA (5' end), and Tn1 bla protein (3' end) genes. ACCESSION M32794 KEYWORDS Tn1 bla protein; kil regulatory protein; kil-kor regulon; kleA protein; korC protein. SOURCE Plasmid RK2 DNA, passed in E.coli MV10. ORGANISM Plasmid RK2 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1701) AUTHORS Kornacki,J.A., Burlage,R.S. and Figurski,D.H. TITLE The kil-kor regulon of broad host-range Plasmid RK2: Nucleotide sequence, polypeptide product and expression of regulatory gene korC JOURNAL J. Bacteriol. 172, 3040-3050 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by D.H.Figurski, 13-MAR-1990. FEATURES from to/span description pept < 1 320 Tn1 bla protein pept 1169 1426 korC protein pept 1652 > 1701 kleA protein pept.ps 468 1146 interrupted ORFX trns < 1 467 Tn1 remnant mRNA 1153 1509 korC mRNA mRNA 1637 > 1701 kleA mRNA binding 1153 1161 korC ribosome binding site binding 1637 1641 kleA ribosome binding site signal 1568 1597 kleA promoter signal 1509 1526 pot. transcription termination signal site 1459 1471 KorB box site 1552 1562 KorA box site 1577 1593 KorC box BASE COUNT 359 a 481 c 523 g 338 t ORIGIN Map position 4.3'-6.0'. 1 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 61 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 121 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtggatctc 181 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 241 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 301 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 361 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 421 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccttt tatttccgaa 481 acgggctatc ggtcccatta cgaccatgcg cgcggcggca tgacggtgga ccaggtagcg 541 gacggggttt tgcgcgccct gttgcgctcg catcgccggt atctggacgc ccgcgaccag 601 gaccgccttg ccgacgagcc actgccggcg tggctcgcgg gtatcacgcc accgccgcgc 661 cgcgtgcggg ccgtggtcga ggactggcgg aagccggacg aactgccgcc gggcttcgcc 721 tgggttgatg ccgtgctgcc ggcgcaccag gcattcatcg cccgcaaatg ggcggccagc 781 gccaaggcga agctcgcggc cgcgcgtgcg aaagctcagg agccggccgg gcagcgcagg 841 gagccggtta caccggccaa gccggagccg gagccggcca aggacgagga cgcgccggcg 901 tggccggcga cgttctttcc tggcctgcgg tgcgagattg tgagcgtgca tcacccggtt 961 tttgccaagg aaatcgggaa gcacgtcatc attacgaaga tcagtcccga aacgcgacag 1021 gtgtgggcgc acgacgacaa gccgccgcgc tatcgcatca atcgcaacgg tagaaaggtt 1081 tgtgaatatg acccgcgttg catcgagtcg tgctacggct acgaccaatt gcgggccgcc 1141 atttgataac gattaggaga aaagcagcat gagcgacgtg aatatccggc ttgagtgcct 1201 gcgcccggcg gaacgctggg tgcagccgac cggcgcagaa atccgggaag tcttgcactt 1261 ggccggcctc accggcggac aggctgcgcg catcttgggc ttgggtgcca agggcgaccg 1321 cacggtgcgg cgttgggttg gcgaggattc gccgatcccc tatgccgcct gggcgatcct 1381 ttgcgatcta gcggggattg gggcgatctg gaaaggccag ggctgacgcc cgcaggactt 1441 tacgccaagg gagaggactt tagcggctaa aaccgccaac cctgatcgtt gcgatatgct 1501 gccagcagcc cggcccgcaa gccgggtttt ttcgtgcgcc cgccgcaaat tgtttagcta 1561 aatttccttg actatctagg gcataatgcc ctaatatagc aatccaaggc cgggcacttc 1621 gcccaggtca gcaaccggag gatcaacccc gatgaaaagc aagatcatgt cttggctcga 1681 tgagctgccc ggcgcggccg c // LOCUS STMSUAC 1380 bp ds-DNA BCT 18-JUL-1990 DEFINITION S.griseolus herbicide-inducible cytochrome P-450-SU1 (suaC) gene, complete cds. ACCESSION M32238 KEYWORDS cytochrome P-450-SU1. SOURCE S.griseolus (strain ATCC 11796) DNA, subclone pCAO302. ORGANISM Streptomyces griseolus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1380) AUTHORS Omer,C.A., Lenstra,R., Litle,P.J., Dean,C., Tepperman,J.M., Leto,K.J., Romesser,J.A. and O'Keefe,D.P. TITLE Genes for two herbicide-inducible cytochromes P-450 from Streptomyces griseolus JOURNAL J. Bacteriol. 172, 3335-3345 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Omer, 23-FEB-1990. FEATURES from to/span description pept 128 1348 cytochrome P-450-SU1 (suaC) binding 117 120 ribosomal binding site BASE COUNT 236 a 503 c 445 g 196 t ORIGIN 1 gcggacaggg ggactcctga agatgtctga tagaggccgt tgcgttctct acgggggcaa 61 gtctatgctc cgaaatagag aacatggcgt tctttaaagg tgagaattct tgaatcggag 121 tggaccgatg accgataccg ccacgacgcc ccagaccacg gacgcacccg ccttcccgag 181 caaccggagc tgtccctacc agttaccgga cggctacgcc cagctccggg acacccccgg 241 ccccctgcac cgggtgacgc tctacgacgg ccgtcaggcg tgggtggtga ccaagcacga 301 ggccgcgcgc aaactgctcg gcgacccccg gctgtcctcc aaccggacgg acgacaactt 361 ccccgccacg tcaccgcgct tcgaggccgt ccgggagagc ccgcaggcgt tcatcggcct 421 ggacccgccc gagcacggca cccggcggcg gatgacgatc agcgagttca ccgtcaagcg 481 gatcaagggc atgcgccccg aggtcgagga ggtggtgcac ggcttcctcg acgagatgct 541 ggccgccggc ccgaccgccg acctggtcag tcagttcgcg ctgccggtgc cctccatggt 601 gatctgccga ctcctcggcg tgccctacgc cgaccacgag ttcttccagg acgcgagcaa 661 gcggctggtg cagtccacgg acgcgcagag cgcgctcacc gcgcggaacg acctcgcggg 721 ttacctggac ggcctcatca cccagttcca gaccgaaccg ggcgcgggcc tggtgggcgc 781 tctggtcgcc gaccagctgg ccaacggcga gatcgaccgt gaggaactga tctccaccgc 841 gatgctgctc ctcatcgccg gccacgagac cacggcctcg atgacctccc tcagcgtgat 901 caccctgctg gaccaccccg agcagtacgc cgccctgcgc gccgaccgca gcctcgtgcc 961 cggcgcggtg gaggaactgc tccgctacct cgccatcgcc gacatcgcgg gcggccgcgt 1021 cgccacggcg gacatcgagg tcgaggggca cctcatccgg gccggcgagg gcgtgatcgt 1081 cgtcaactcg atagccaacc gggacggcac ggtgtacgag gacccggacg ccctcgacat 1141 ccaccgctcc gcgcgccacc acctcgcctt cggcttcggc gtgcaccagt gcctgggcca 1201 gaacctcgcc cggctggagc tggaggtcat cctcaacgcc ctcatggacc gcgtcccgac 1261 gctgcgactg gccgtccccg tcgagcagtt ggtgctgcgg ccgggtacga cgatccaggg 1321 cgtcaacgaa ctcccggtca cctggtgacg ggggagaggg gcaaggacat gaccatgcgg // LOCUS STMSUBC 1441 bp ds-DNA BCT 18-JUL-1990 DEFINITION S.griseolus cytochrome P-450-SU2 (subC) gene, complete cds. ACCESSION M32239 KEYWORDS cytochrome P-450-SU2. SOURCE S.griseolus (strain ATCC 11796) DNA, subclone pCAO304. ORGANISM Streptomyces griseolus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1441) AUTHORS Omer,C.A., Lenstra,R., Litle,P.J., Dean,C., Tepperman,J.M., Leto,K.J., Romesser,J.A. and O'Keefe,D.P. TITLE Genes for two herbicide-inducible cytochromes P-450 from Streptomyces griseolus JOURNAL J. Bacteriol. 172, 3335-3345 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Omer, 23-FEB-1990. FEATURES from to/span description pept 195 1406 cytochrome P-450-SU2 (subC) binding 182 186 ribosomal binding site BASE COUNT 207 a 605 c 432 g 197 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccggcc accgcccgac ccgtccgcac tccgccccgc cgaccgtcgt ccatccgccc 61 ctgcggccat gcggtttgag ccaacctcgg tgctgccgcg atctgccctt ccctcccccg 121 ccgggcctgc gttagcgtga cgacatctta attacctaag ttaggtaatt agctcacgcg 181 gaaggaccgg ccgcatgacg accgcagaac gcaccgctcc ccccgacgcc ctcaccgtcc 241 cggccagccg cgcccccggc tgccccttcg accccgcgcc cgacgtcacc gaggcggccc 301 gcaccgaacc ggtcacccgg gccaccctct gggacggctc ctcctgctgg ctggtgacgc 361 gccatcagga cgtccgcgcg gtcctcggcg acccgcgctt cagcgccgac gcccaccgca 421 ccggcttccc cttcctgacc gccggcggcc gcgagatcat cggcaccaac ccgaccttcc 481 tgcgcatgga cgacccggag cacgcccgac tgcgccggat gctcaccgcc gacttcatcg 541 tcaagaaggt cgaggcgatg cgccccgagg tgcagcgcct cgccgacgac ctggtcgacc 601 ggatgaccac cggacgcacc tccgccgacc tggtcaccga gttcgcgctg ccgctgccgt 661 ccctggtgat ctgcctgctg ctcggcgtcc cctacgagga ccacgcgttc ttccaggagc 721 gcagccgggt cctgctcacc ctgcggtcca ctcccgagga agtccgggcc gcccaggacg 781 agttgctgga gtacctcgcc cggctcgccc ggaccaagcg ggagcggccg gacgacgcca 841 tcatcagccg cctggtcgcc cgcggcgagc tcgacgacac ccagatcgcc accatgggac 901 gcctgttgct ggtcgccggc cacgagacga ccgccaacat gaccgcgctc tccaccctcg 961 tgctgctgcg caaccccgac caactcgccc ggctgcgcgc cgaacccgcg ctcgtcaagg 1021 gcgccgtcga ggagctgctg cgctacctga cgatcgtgca caacggcgtt ccccggatcg 1081 ccaccgagga cgtgctcatc ggcggccgca ccatcgccgc cggcgagggc gtcctgtgca 1141 tgatcagctc cgccaaccgg gacgccgagg tgttccccgg cggcgacgac ctcgacgtgg 1201 cccgcgacgc ccgccgccac gtggccttcg gcttcggcgt ccaccagtgc ctgggacagc 1261 cgttggccag ggtggagctc cagatcgcca tcgaaacgct gctgcgccgc ctgccggacc 1321 tgcggctggc cgtgccccac gaggagatcc cgttccgcgg cgacatggcg atctacgggg 1381 tccactcgct gccgatcgcc tggtagcccg ggcgccccca ccaccgacca ccacgcaccc 1441 a //