Path: utzoo!attcan!uunet!ns-mx!iowasp!maverick.ksu.ksu.edu!rutgers!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 28 Jun 90 12:00:26 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 6168 Approved: lear@genbank.bio.net Checksum: 30640 370 LOCUS HUMSODB 8841 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human ornithine decarboxylase gene, complete cds. ACCESSION M33764 KEYWORDS ornithine decarboxylase. SOURCE Human sultan myeloma cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 8841) AUTHORS Hickok,N.J., Wahlfors,J., Crozat,A., Halmekyto,M., Alhonen,A., Janne,J. and Janne,O.A. TITLE Human ornithine decarboxylase-encoding loci: Nucleotide sequence of the expressed gene and characterization of a pseudogene JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.J.Hickok, 18-APR-1990. FEATURES from to/span description pept 3687 3788 ornithine decarboxylase, exon 3 (first expressed exon; E.C. 4.1.17) /nomgen="ODC1" /map="2p25" /hgml_locus_uid="LP0064V" 4072 4245 ornithine decarboxylase, exon 4 4452 4624 ornithine decarboxylase, exon 5 4883 5017 ornithine decarboxylase, exon 6 5148 5229 ornithine decarboxylase, exon 7 5406 5489 ornithine decarboxylase, exon 8 6546 6708 ornithine decarboxylase, exon 9 6791 6903 ornithine decarboxylase, exon 10 6997 7211 ornithine decarboxylase, exon 11 7851 7995 ornithine decarboxylase, exon 12 pre-msg 388 8338 ODC1 mRNA and introns IVS 602 3454 ODC1 intron A IVS 3565 3669 ODC1 intron B IVS 3789 4071 ODC1 intron C IVS 4246 4451 ODC1 intron D IVS 4625 4882 ODC1 intron E IVS 5018 5147 ODC1 intron F IVS 5230 5405 ODC1 intron G IVS 5490 6545 ODC1 intron H IVS 6709 6790 ODC1 intron I IVS 6904 6996 ODC1 intron J IVS 7212 7850 ODC1 intron K signal 359 363 TATA box rpt 2277 2586 Alu-repeat BASE COUNT 2261 a 1884 c 2279 g 2417 t ORIGIN 1 aggctctgcc gttcagctgc cgcgggcggg gccggggcct gcggcgtcgt gcgccgtgcg 61 ggaccagttc caggcgggcg agaccgcgca gggcggggcg gggcgaggcg gccgcagggc 121 ggggagggcg gggagaggcg gccgcagggc ggggagggcg gggcgcgaag ccgggggcgg 181 gggccacgcg tggggcaggc ggtgctcggc tcggctgacg tcggcccgcc ggcgccccac 241 cacgtccgcg cgggcccggg ttgccaccgc cggcccccgc ccctcccccg gcggtgtccc 301 ggccggaacc gatcgtggct ggtttgagct ggtgcgtctc catggcgacc cgccggtgct 361 ataagtaggg agcggcgtgc cgtggggctt tgtcagtccc tcctgtagcc gccgccgccg 421 ccgccgcccg ccgcccctct gccagcagct ccggcgccac ctcgggccgg cgtctccggc 481 gggcgggagc caggcgctga cgggcgcggc gggggcggcc gagcgctcct gcggctgcga 541 ctcaggctcc ggcgtctgcg cttccccatg gggctggcct gcggcgcctg ggcgctctga 601 ggtgagggac tccccggccg cggaggaagg gagggagcga gggcgggagc ggggcgggct 661 gcgggcccgg gcccgggcac gtgtgcggcg cgcctcgccg gcctgcggag acacgtggtc 721 gccgagcggg ccacgacctt gaggcgccgc ttcctcccgg cccggggttc tcccgcggct 781 ggataagggt gatccgggcg cctcgttctg cccccgtctt cacagctcgg ggctggaggg 841 gcctagggga gacccacccg gagaccctgc ggccccgcgc cggcctcttt cccaaccctt 901 cggcggccgc gcgctggccg gggagccgtt ggggaggccc tggcggccgc gcagcaggtg 961 caggggcgca gagcctgggc tcgccttggt acagacgagc gggccccggc cttggcgcct 1021 tcagtttcct tccagttttt attttcgctg tgtctacaga gcagatgaca ccaatttgga 1081 aacccgcgag agtgggtaga gctaagatag tcttgctgta gtagctgtga tattagatgc 1141 tcggccatga cttagaggtg tttatttaag gactgtgaat gactcggtga tttcggaaaa 1201 gcttggctta gatgaacgga catacacagg ggagacagcc ctaaggtttg cagaaaaggc 1261 tgattgtgct gtttgcgaag tcgaaataat tggtgaaagt gtagaaggca gaacctctca 1321 ggaatgtctg gggaggacaa agaatgtgtt ggctgacttt gtttaaacat aaaattgggc 1381 agactttaat tgatttgtga aatttttttc aaagtttgtt tgaattagcc cctatctctt 1441 ctaacattat cctcttgtgc taattgattg accattttaa ataacttagc tgttacagaa 1501 agaccgaaag gtgttcttca gtaaaatata ttcaagtaag ttacttaagt aacgccttaa 1561 aagatacaga aaagcaaaaa agtattggcg tattaaaaag aaatcaaaac tttccaagtt 1621 taggcctgaa cattgcctta aaaatattta ataaggcctc aaatgaccca gtccgagact 1681 gcatgagcct atttattatt aaattgtaaa tattcttcat ataaacaaaa atatataacc 1741 atgtctgtaa caaaaatggt tttgctagcg ttgttactct cttcccttct ccgaggggtg 1801 atttaggcaa cttcggaggt tgacaatgcc aagcagtcac aatagataga gctttaaagc 1861 aaattctatg catgggtttg gatttatgac aggcccgtca ccctgggcct gtcatagtac 1921 cccatgccag agcaaactgt gtccccgaac cattgcctgg cctctgtgcc cgtaggctgc 1981 tggcactgaa gtgggttgca cagtggaaaa gaagaaagct ctacctggca gaaattttta 2041 aaggttaaaa taaataattt taagaaagct ggttcacaag gtgccacatt tgatgaaagc 2101 aaaatacagt ggcttttatt gttactagag tgatgttctt gcttgttttt cttttttggt 2161 gaagttagcc ccaaattatt ctcatagcta agcaaatacg agagtgactg taaggacagt 2221 tggcattccc ggaattgcta aacttggtag gcaacgctgg tttaagaata ctgagttcta 2281 gccgggcgtg gtggctcacg cctgtaatcc caacactttg ggaggctgag gcaggcggat 2341 cacctgaggt cgggagttgg agaccagcct gactaacatg gagaaacgcc atctccacta 2401 aaaatataaa attagccagg ccccgggtgt ggtggcacat gccggtaatc ccagctactc 2461 gggagactga ggcaggagaa tcgcttgaac ccaggaggcg gaggttgagg tgagccgaga 2521 tcatgccatt gcactccagc ctgggcaaca agagtaaaac tctgtctcaa aaaaaaaaaa 2581 aaaaaatact gaattctgat caggtaacag caactgtaat acaatgtgat aagttgactt 2641 gaagattaca gtttttaaga agtatatacc cagctaatac atgaaaatta actcgtaaaa 2701 tctcaaatgc tccagacatt tccatgatgc ctgttggtca gtaaaaatca ttctaagact 2761 tagtggaagt aggaaatgtt tgtatggcaa cgtggtgaaa tcctgtctct actaaaaatg 2821 tgtataaagg ctataatgta atcccagcac tttggaagac cgaggcgggt ggatcacctg 2881 gggtcaggag tttgagaccc acctggacca caaaaattag ccgggcatgg tggcaggcgc 2941 ctgtaatccc agctgctggg gaggctgagg caggagaatc gcttgaaccc gggaggcaga 3001 ggttgcagtg agccaagatt gcaccgctgc actccagcct gggtgacagc gtgagactct 3061 gtctcaaaaa aaataaaaaa gtctataatg ctattttaag tttctaagga actgaaactg 3121 ctctgaaata aatcagacca ttataagact tttttccata tcagtgagct aagtgcagat 3181 aagcttctga aacttgcatg ctagattttt ttggtacaaa tatttgaaat gcttagtgtg 3241 ctgccttgga aaaacctggt attttttgtt gtgtccttat actgccaagg tttatggaat 3301 catgtacctt atgcctagta ataattagga tgaccaggcc agtgagtggt tcatatccgg 3361 ggcatgatta gctctgcgtg tgctcagcca gtgccccatc ttcaactcga tgtgttccta 3421 aggtagacag caaattccct attttatttc tcagattgtc actgctgttc caagggcaca 3481 cgcagaggga tttggaattc ctggagagtt gcctttgtga gaagctggaa atatttcttt 3541 caattccatc tcttagtttt ccatgtaagt attcagttta catttatgtt gcaggttaat 3601 cttaagaatt gtattgctaa ggcttctaag tgaatttctc cactctattt gcattttgtt 3661 gcatttcaga ggaacatcaa gaaatcatga acaactttgg taatgaagag tttgactgcc 3721 acttcctcga tgaaggtttt actgccaagg acattctgga ccagaaaatt aatgaagttt 3781 cttcttctgt aagtatatga ggcccatgct ggcagtgcag ctgagagtgc caggcaagtg 3841 gaaaactttg gcaaggtcta aggaagagca atgaggctta catgtcttgt tatggaatgt 3901 agaaattaat tcactggtgg taaattaata gtgataatgg tgatactcat atcagtggct 3961 agactcaaaa gagcaggatt cattgtgact gatgggaatg aaggtcgctg gctattggtg 4021 tggtgtgtgg tgaggctgct agtgagtcac ctgtgaccac tcttgtttca ggatgataag 4081 gatgccttct atgtggcaga cctgggagac attctaaaga aacatctgag gtggttaaaa 4141 gctctccctc gtgtcacccc cttttatgca gtcaaatgta atgatagcaa agccatcgtg 4201 aagacccttg ctgctaccgg gacaggattt gactgtgcta gcaaggtaag cgatagcagc 4261 aggcctcaaa agcgttgtat aaaatgggcc tggtattccc cacgaggcag atacaagttg 4321 tgttttttgg gcaataaatg ctcactaaag gcaaatgggg cgggggggta catgacaact 4381 tcccatgctt ttctgtttat tccacgtgtt aagccacata tggatagcat gacaccactc 4441 ttctttttca gactgaaata cagttggtgc agagtctggg ggtgcctcca gagaggatta 4501 tctatgcaaa tccttgtaaa caagtatctc aaattaagta tgctgctaat aatggagtcc 4561 agatgatgac ttttgatagt gaagttgagt tgatgaaagt tgccagagca catcccaaag 4621 caaagtgagt tattccccca tctgagggca agatcgggag cataagatat gtggattctt 4681 atcaaacaaa cttaaatttc tgattattat atttctatac tttagtagaa agtagttgaa 4741 acccccattg agtcatgaag cctgggactc aaactacaga atatatcagc gacagtattt 4801 agaacaggat tgtttttatt ttaattgtgg ctataagtga acatctatca tgagacattt 4861 gctgcacttt ccttgcttgt aggttggttt tgcggattgc cactgatgat tccaaagcag 4921 tctgtcgtct cagtgtgaaa ttcggtgcca cgctcagaac cagcaggctc cttttggaac 4981 gggcgaaaga gctaaatatc gatgttgttg gtgtcaggtg agattttggt gggatagcta 5041 gaggtcaaga cattgaacag tttgagtttt acaggctttc tcctagtgtt tgctattatt 5101 ttaagaaata ctaagacaca gtgtctcgtc tctttatttt accccagctt ccatgtagga 5161 agcggctgta ccgatcctga gaccttcgtg caggcaatct ctgatgcccg ctgtgttttt 5221 gacatggggg tgagtatacg tgaccctgtt agggaagggc gggacacaac tgacaataac 5281 tagtcttaat tctagagtta actttttatg gcagttggtt ctgtattaca tgggtttcag 5341 cctatctgct gcatacattt ttgttattag ctgtggatct ggctgactta ttttcttgat 5401 tctaggctga ggttggtttc agcatgtatc tgcttgatat tggcggtggc tttcctggat 5461 ctgaggatgt gaaacttaaa tttgaagagg taatttagaa caaaactgta atactcagta 5521 gccgttctaa taaattcctt tttggaatat ttcaaaattt aagtgtctta actaatacca 5581 caatgggctg aagtgtcttg gtgtgatatt tttgagtgat ttctttgtgc tgtctgacat 5641 tacacttgat accatttggt tttctaaagt gtgaatcagc tttcccagaa gtcttggata 5701 attggttaca ttggaaatca tggctcacac ctgtaatcca gcacttgggg aggccaaggt 5761 ggtaggatca cttgagccca ggagtttgag accagcctgg gcaacacagt gagaccccat 5821 ctctacaaaa aaaattttaa aattagcctg gtgtggtggc gggcacctgt aatcccagct 5881 acttggaagg ctgaggtggg aggatcactt gagcccagga ggttgaggct gcagtgagcc 5941 atgatcatgc cactgcactc agcctgggct acagagtgag accctgtctc aaaaaaaaaa 6001 aagaaaaagc atgttgctgt gggcttccta gagaatatgc tgactgtagc acatcatcac 6061 cccaaatgtg ctttgctaga cctatgcttc ctctccttaa aatacttgaa atgtttagtc 6121 acttaggaag ttaagccatt atattggtgc ttgaatttat aaaatatatc cacatggttt 6181 gttaaaatca tgacgtaggc agaataggat ttttatcctg ttggcatgta tttgttaaaa 6241 tgttttgaca tcttgatgcc ttcctaggta gtagttagtt gcgtactgtt ctttgataaa 6301 aatcataccc ataacatcct aaaggagata gggtgcctgg aggggaatga aaacgagcca 6361 cctgggatat gtagcctggt tttcagggag atgttgatgt ttttttgctt ttgttacttt 6421 aatgataaac ctgtctgttg atgcctggtc tcatgatgtc atgtcacaag gccctgtgat 6481 gttactcccc catgtgaatt tcccacaatg aaggctgctc tttcttttct gtttcactct 6541 cttagatcac cggcgtaatc aacccagcgt tggacaaata ctttccgtca gactctggag 6601 tgagaatcat agctgagccc ggcagatact atgttgcatc agctttcacg cttgcagtta 6661 atatcattgc caagaaaatt gtattaaagg aacagacggg ctctgatggt atgtataaag 6721 gacgaatcac ttcatgtata actgaaagct gatgcaaaaa gtcattaaga ttgttgatct 6781 gcctttctag acgaagatga gtcgagtgag cagaccttta tgtattatgt gaatgatggc 6841 gtctatggat catttaattg catactctat gaccacgcac atgtaaagcc ccttctgcaa 6901 aaggtaattt ctgagcatac tgtataaaac aattaagagg actggtcaca acacgtgtaa 6961 ttaagtagta cttcctctct ccgtctcttt atatagagac ctaaaccaga tgagaagtat 7021 tattcatcca gcatatgggg accaacatgt gatggcctcg atcggattgt tgagcgctgt 7081 gacctgcctg aaatgcatgt gggtgattgg atgctctttg aaaacatggg cgcttacact 7141 gttgctgctg cctctacgtt caatggcttc cagaggccga cgatctacta tgtgatgtca 7201 gggcctgcgt ggtaagtaag ccatgcatgt tgatggtgct gccaagaata ggcaccttct 7261 tggatgtgtg cttcttgtct agacgaataa gaaattgtct tgcctaagat taaatatata 7321 tggatatttt tcctaagaaa agttttagaa aagactgatg agtgtatttc tatgtaattg 7381 gaatatattt aagttcatgc catgtgtctt gtggtttcct tattaccaaa acggtgactg 7441 aagaaacgct tgctttagaa atacattgaa ttggccaggt gtgctggctc acacctgaaa 7501 tcacaacaca ttgggaggcc aaggcagaag gatcacttga gcccaggagt tcgagcctgg 7561 gcaacatagt gagaccctgt ctctacaaaa aattaaaaaa ttagttggcc atggtagtgg 7621 gcgcctgtag tcccagctgc ttggctaagg tgagaggttt gcttgagcct gggaggttga 7681 ggctgcggtg agctatgata gcaccattgt attccagcct gagtaacaga gaaagaccct 7741 gtctcagaaa aaaaaaaaat acattgaatt gtttcctgat ggaagtaaat actctcatgc 7801 ccagttagga gtgagtcagg gtttttaata tgccactttt tctttctcag gcaactcatg 7861 cagcaattcc agaaccccga cttcccaccc gaagtagagg aacaggatgc cagcaccctg 7921 cctgtgtctt gtgcctggga gagtgggatg aaacgccaca gagcagcctg tgcttcggct 7981 agtattaatg tgtagatagc actctggtag ctgttaactg caagtttagc ttgaattaag 8041 ggatttgggg ggaccatgta acttaattac tgctagtttt gaaatgtctt tgtaagagta 8101 gggtcgccat gatgcagcca tatggaagac taggatatgg gtcacactta tctgtgttcc 8161 tatggaaact atttgaatat ttgttttata tggattttta ttcactcttc agacacgcta 8221 ctcaagagtg cccctcagct gctgaacaag catttgtagc ttgtacaatg gcagaatggg 8281 ccaaaagctt agtgttgtga cctgttttta aaataaagta tcttgaaata attaggcatt 8341 gggacgtttt tatggtgtgt tcattccaga cagttcacga atcccgtata gctcgctctg 8401 attctcagag aacaatgagt gggtccaccc acacacaggt aggaggacag gtgagacgga 8461 agccccatcc tcccatgtgg acggtgcaca tctgctcagc ccaccccaca tgtccagagt 8521 tggctgcaaa ctccttgtcc agagcctctg gtggtgggac ctacttaagt ctgacggacc 8581 tgtcctgtcc aggccagtgc ccagggaagg tgtgggaggc cctttgagcc tggcctgcag 8641 agaccatccg tgtcccctcc caccttcatg cctgtgagaa gttaggaatg tatacggtac 8701 cacatttggc agtcagctta ttttaataaa ttcagcaaca gcaagtccct accatgttgt 8761 gtatcttcac catcttgtct gaccatgacc actggccttg tgtgttcttt tactcaacgt 8821 gtacccccgc tctcccccaa a // LOCUS XELFIMB1 1341 bp ss-mRNA VRT 28-JUN-1990 DEFINITION X.laevis integumentary mucin B.1 (FIM-B.1) mRNA, complete cds. ACCESSION J02910 KEYWORDS integumentary mucin; mucin. SOURCE X.laevis skin, cDNA to mRNA, clones pREP1218 and pXGL-3'-1. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1341) AUTHORS Probst,J.C., Gertzen,E.-M. and Hoffmann,W. TITLE An integumentary mucin (FIM-B.1) from Xenopus laevis homologous with Von Willebrand factor JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Hoffmann, 26-APR-1990. FEATURES from to/span description pept < 1 1197 integumentary mucin (FIM-B.1; AA at 1) mRNA < 1 1341 FIM-B.1 mRNA signal 1319 1324 poly-A signal rpt 1 279 repeat sequence BASE COUNT 472 a 290 c 268 g 311 t ORIGIN 1 cctgcacctt cagaaacaac aggagagtca acacctgcac cttcagaaac aacaggagag 61 tcaacacctg caccttcaga aacaacagga gagtcaacac ctgcaccttc agaaacaaca 121 ggagagtcaa cacctgcacc ttcagaaaca acaggagagt caacacctgc accttcagaa 181 acaacagtac catcaggaga gtcaacacct gcaccttcag aaacaacagt accatcagta 241 ccatcaggag agtcaacacc tgcaccttca gaaacaacag agctgagaat catcccacca 301 gaagtatcta cagttgcagt gccagtaaca actggacaga taactcctgc agtaacaact 361 gaacacagca cagaagaaat actgacactt cctccgccag tggtaggacc agttcttcca 421 gcaaagccaa cagtagatat ctccaaatat acaaacacta caactactaa atcgactgtc 481 cctacaacta ccattcctcc taaagctact tgctgtggat cgtctggaga gtcagtacaa 541 gcaggtcata tgtggcaaac tggttgtgat gtgtgcacat gcaatggaac ctctggaaaa 601 acacaatgtg caccacgcca gtgtgaaaag gagataatct gtaagagcga cgaaaggagg 661 gtactgagaa aacctgggaa atcatgctgt ggatattgcg agccgctgac atgtaaacat 721 aatggaacag aatataagct tggggctacc tttatagaca aatcgaatcc ttgtatcact 781 tatagatgtg acgcatcagg cttgacagtg aatgttaaat catgcccaaa tgaacaagtt 841 tgcagtaaga gtgaaagaac ctatgattca gatggctgtt gtttctcatg tgacacatca 901 tgtaaaccag tccctgcgac agtgggtatc caaggggagt acgattatca gaatgaaaaa 961 acaaactgct ctgctaatat cataatggca aaatgttctg gtcaatgcca acataaattg 1021 acctatgata caattgacaa taaagttgtt accaaatgtc ggtgttgtaa ggcagaccgt 1081 gttgaaccaa gaaaagctca tctggtgtgt gataatggaa agaaaaaaat atataaatac 1141 aaacatatca catcttgcaa atgcacatct tgtaccgctt acaatataag actgtgaaga 1201 ctttcatgca tttgctaact gtcagctttt gcatgtacaa taaaataaac atgggggata 1261 tattggaggc agagatatct gttatggtac ataaagtatt ttttgtctgt tattgaaaaa 1321 taaatggttt ttttcagctg t // LOCUS LACISTRAN 808 bp ds-DNA BCT 28-JUN-1990 DEFINITION L.lactis insertion sequence IS946 transposase gene, complete cds. ACCESSION M33868 KEYWORDS transposase. SOURCE L.lactis DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 808) AUTHORS Romero,D.A. and Klaenhammer,T.R. TITLE Characterization of gram-positive insertion sequence IS946, an Iso-ISS1 element, isolated from the conjugative Lactococcal plasmid pTR2030 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.A.Romero, 26-APR-1990. Author address: D.A.Romero North Carolina State University Dept. of Food Science Box 7624, Raleigh, NC 27695 FEATURES from to/span description pept 74 754 transposase (put.) rpt 1 18 inverted repeat rpt 791 808 inverted repeat signal 6 35 promoter region binding 62 65 ribosomal binding site BASE COUNT 281 a 137 c 158 g 232 t ORIGIN 1 ggttctgttg caaagttttc tgataagtct attttagtgt aaaatgaata aaaatgacag 61 cgaggatata tcaatgaact attttaaagg taaacaattt caaaaagatg tgattattgt 121 cgctgttggt tactacctgc gttacaatct aagctatcgt gaagttcagg aattgttata 181 tgatcgtgga ataaatgttt gtcatactac gatttatcgt tgggtgcaag aatacagtaa 241 agtcctctat catctctgga aaaagaaaaa tagacagtcc ttctattcgt ggaaaatgga 301 cgaaacctat atcaaaatca aaggtcgttg gcattatctc tatcgtgcaa ttgatgcgga 361 tggcttaacc ttagatatct ggttacgaaa gaaacgggat acgcaagcag cctatgcttt 421 cttaaaacga ctccataaac agtttggtca accaagagta attgtcacgg ataaagcgcc 481 ctctattggt tctgcattta gaaagttaca gagtaacggt ttatatacta agacagagca 541 tcgaaccgtg aagtatctca ataacctcat tgagcaagac catcgaccaa tcaaacgacg 601 caataaattt tatcgaagtc tacgaactgc ctcaaccacg attaagggca tggaaacaat 661 tcgaggaata tacaaaaaga accgaagaaa tggaacgctc ttcggatttt cggtatctac 721 tgagattaag gtcttaatgg gaatattagc ttaagaacaa gaaggattat aaaccttgta 781 tttgattttt aaactttgca acagaacc // LOCUS RSSTRXA 464 bp ds-DNA BCT 28-JUN-1990 DEFINITION R.sphaeroides Y thioredoxin (trxA) gene, complete cds. ACCESSION M33806 KEYWORDS thioredoxin Y. SOURCE R.spaeroides DNA, clone pUTC3. ORGANISM Rhodobacter sphaeroides Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 464) AUTHORS Pille,S., Chuat,J.-C., Breton,A.M., Clement-Metral,J.D. and Galibert,F. TITLE Cloning, nucleotide sequence, and expression of the Rhodobacter sphaeroides Y thioredoxin gene JOURNAL J. Bacteriol. 172, 1556-1561 (1990) STANDARD simple staff_review FEATURES from to/span description pept 57 377 thioredoxin (trxA) BASE COUNT 84 a 147 c 153 g 80 t ORIGIN 1 ccggcagggg gccatcctta ctttccaagc aagcctacgg atcaggagaa ttgaatatgt 61 ccaccgttcc cgtgacggac gccaccttcg acaccgaggt gcgcaagtcc gacgtgcccg 121 tcgtcgtcga tttctgggcc gaatggtgcg gcccctgccg gcagatcggc ccggcgctcg 181 aggagctctc gaaggaatat gccggcaagg tgaagatcgt gaaggtcaat gtcgacgaga 241 accccgagag cccggcgatg ctgggcgttc gcggcatccc ggcgctgttc ctgttcaaga 301 acggtcaggt cgtgtcgaac aaggtcggcg ctgcgccgaa ggccgcgctg gccacctgga 361 tcgcttcggc gctctgagat cctctgccat agcagagacc ggacggggcg ccttcgggcg 421 cccttttcct tgagcaagcg accgaccgcg acgggcgagg atcc // LOCUS PSECATRA 1070 bp ds-DNA BCT 28-JUN-1990 DEFINITION P.putida catR regulatory protein gene, complete cds, and catBC gene, 5' end.. ACCESSION M33817 KEYWORDS regulatory protein. SOURCE P.putida DNA. ORGANISM Pseudomonas putida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1070) AUTHORS Rothmel,R.K., Aldrich,T.L., Houghton,J.E., Coco,W.M., Ornston,L.N. and Chakrabarty,A.M. TITLE Nucleotide sequencing and characterization of Pseudomonas putida catR: a positive regulator of the catBC operon is a member of the lysR family JOURNAL J. Bacteriol. 172, 922-931 (1990) STANDARD simple staff_review FEATURES from to/span description pept 135 1004 catR regulatory protein pept 3 < 1 (c) cis,cis-muconate-lactonizing enzyme I (catB) (EC 5.3.3.4) mRNA 61 < 1 (c) catBC mRNA BASE COUNT 199 a 355 c 329 g 187 t ORIGIN 1 cattgttgcc aggtccgttg tttcgggctg ccagccgcgg gcccagtggt gatttacttg 61 ttgttcaagg attgcgcgag accctgatag ccgtccaata tcgaatgaat ctcccaccat 121 accctggagg tctgatggag ctgcgccact tgcgttactt caaggtcctg gccgaaaccc 181 tgaacttcac tcgcgccgcc gagctgctgc acattgccca gccgccgctg agccggcaga 241 tcagccaact cgaggaccag ctcggtacct tgctggtagt gcgcgagcgc ccgctgcggc 301 tgacagaggc gggtcgcttc ttctacgaac agagctgcac cgtgctgcag ctgcagaaca 361 tcagcgacaa cacccgtcgc attggccagg gccagcgcca gtggctgggg atcggcttcg 421 ccccgtcgac cctgtacaag gtgctgccgg agctgatccg cgagctgcgc caggacagcg 481 agctggaact gggcctcaac gagatgacca cgctgcagca ggtggaggcg ctgaaaagcg 541 ggcgcatcga catcgccttc gggcgcattc gcatcgatga cccggcgatt caccaacaag 601 tgctgtgcga ggacccgctg gtggcggtat tgcccaagga tcacccgctg gcgagcagcc 661 cgcttacact tgcccagctg gctggcgagg cgttcatcct ctacccggcc aacccgcggc 721 ccagctatgc cgaccatgtg ctggcactgt tcgcccacca cggcatgagc atccacgtca 781 gccaatgggc caacgaactg cagaccgcca tcggcctggt ggccgtcggc gtgggcgtga 841 ccctggtgcc ggcgtcggtg caacagcagc accgcaccga tatcgaatat gtaagcctgc 901 tcgacagcgg cgccgtcagc ccgatcatcc tcagccggcg caagggcgat gtgagcccga 961 tcgtgcagcg ctgcctgaca ttgattgcac aacaggccga atgagggcgc gctcctggaa 1021 tcaacggtct ggctaaacac ctgccccctg tgggagcggg ttcacccgcg // LOCUS HUMMHDQ1A 1515 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human MHC class II HLA-DQ-alpha gene, regulatory region. ACCESSION M33765 KEYWORDS cell surface antigen; cell surface glycoprotein; class II gene; major histocompatibility complex. SOURCE Human (DR4, DR4) DNA, clone H11A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1515) AUTHORS Ombra,M.N., Del Pozzo,G., Perfetto,C. and Maffe,A. TITLE Effect of the AIR-I locus on the activation of an enhancerless HLA-DQA1 JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Guardiola, 18-APR-1990. BASE COUNT 441 a 322 c 340 g 412 t ORIGIN Chromosome 6p21.3. 1 atataactga tgacctcatc tatccatgag gctgctcacg aaatgccctc ccctggccag 61 gcacggtggc tcatgcctgt aatccaagca ctttgggagg ctgaggcagg cagatcatga 121 ggtcaggagt tcaagaccag cgtggccaac atagtgaaac tctgtctcta ctaaaaatac 181 agaattagcc gggcatggtg gcagcgctgc agtcaccagc tactcgggag ggtgaggcag 241 gagaatcgct tgaacctggg aggcaaaggt tgcagtgagc cgagaccatg ccattgtacc 301 tcagcctggg tgacagagtg agactacgtc tcgaaaagaa aagaaaaaga aaagaaaaag 361 aaaaaaaaag aaaaatgccc atccctcttg cgaatggcag acatgcacac accagagaag 421 attccaattt agtgtcctcc ctctgttcat agaacaattc ctcaagtcca ctctgagtag 481 aggctgcatc acaacaaggg gattgccctg tctccttcca gggctcttaa tacaaactct 541 tcagctagta actgagatgt caccatgggg gatttttcta attggccaaa acctgacctg 601 gcagggtttg gtttgggtgt cttcagattt ccttgtcttg aggccctcac aattgctcta 661 cagctcagaa cagcaactgc tgaggctgcc ttgggaagag gatgatccta aacaaagctc 721 tggatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg tgaagacatt 781 gtgggtgagt gcatgagtga ggaaatgttc tctggagctg aaaaacagta aattgaagga 841 aaagagagaa agcgatttgc agagaaattg tagagatttc ctaagaccct ttcactatta 901 agagaaatta aaaattatag ctgttcctcc ttcaggaaac cagagaccca atccactctt 961 tttgttatgt atgcttttgt gttcactaag gatgctattc tgtttatatt atattcagtg 1021 acaacagcct ggaggtctct atgtcgttcc gtcatgattg cctcaaaaat tagtgaagtt 1081 tccatcagtg gaataatttt ttattattaa aaatgtatga agtgtcattc tcaaatttcc 1141 ctgaacaact tttgaagctt ttcgtatgtc tcctgtagta gatcttgggg tcgttccatc 1201 aattatatac tctatagata ttaaaaaagt tgcccgtttc tttctctcag acttactcac 1261 atttccacat gggaactggc acaggtgggg agtgggtaaa ggagtccagc aggctgaatg 1321 ccttcaacaa tcattttacc acatggtcct cacttactct cagctgcctc atatgtgtca 1381 cctcacaaaa taatcaaaat aaaaatgggc atgtagctaa gctttgtaaa tagtgaaaac 1441 atggaatgtc aattgttttt acatatttct attacaggta tagcttcaca tttttcttta 1501 gcaaaataag ggatc // LOCUS FSOSTI35X 1856 bp ds-DNA PLN 28-JUN-1990 DEFINITION F.solani STI35 protein gene, complete cds. ACCESSION M33642 KEYWORDS STI31; heat-shock gene. SOURCE F.solani phaseoli mycelial ATCC 60860 isolate DNA, clone cKAES018. ORGANISM Fusarium solani Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Hypocreales; Hypocreaceae. REFERENCE 1 (bases 1 to 1856) AUTHORS Choi,G.H., Marek,E.T., Schardl,C.L., Richey,M.G., Chang,S. and Smith,D.A. TITLE STI35, a stress-responsive gene in Fusarium spp JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.L.Schardl, 08-APR-1990. FEATURES from to/span description pept 564 1443 STI35 protein, exon 1 1492 1586 STI35 protein, exon 2 IVS 1444 1491 STI35 intron A BASE COUNT 401 a 585 c 445 g 425 t ORIGIN 1 acacactctc tgttgacgca tcctcagact cggaaataca tttcgttggc cgatatcgcg 61 ggttactcca tcatgatatc agccatccat tacatcgtta cccctcatct gcggtataaa 121 agccagccat tgtctcctgc tgcttgatcc attctctctt cttcttcatc aactcatcaa 181 caacttgaac caaaaccctt atcaagactc aaggtttgta ctcgaggtga gaggtgatga 241 actggtctga cacctgtgac atgctagata tcatcacttc cggtacggat cctccgtcga 301 ggaatctgca gccatggagt acctttttgc gggtagttgt ccgaacctga ggttggagcc 361 attcaggtaa agcatcatgc atgagccggt gcccgttctt cccttcttgg ccattgtctg 421 tactcctgtg tacatcaatg tcaagaagag gagagcgtgg ctgagattat acggcgaaac 481 ttgatctgga taataccagc gaaaggatca tgctctcctc cccccattga ccccaatccc 541 ctaacacaac cccaggccca acaatgtctc ctcccgctgc cgtctccccc cctgcccgct 601 ctgctgagct cgcctccgcc cccgccgtca agctccccgt cggcctgagc aagaacagcg 661 ctgctgccac caccgtcgag gagatggagg gcaagtggga cgacttcaag tttgccccca 721 tccgcgagag ccaggtctcc cgggccatga cccgccgcta cttccaggac ctcgacaact 781 acgccgagtc cgacattgtc attgtcggcg cgggatcctg cggtcttagc acccgctaca 841 tcctcggcaa gaagaggccc gacctcaaga tcgccatcat cgaggcttct gtgtctcctg 901 gtggtggtgc ctggctcggt ggccagctct tctcggccat ggtgatgcgc aagcccgccg 961 atgccttcct ccgcgaagtc ggcgtccctt acgaggacga gggcaactat gtcgtcgtca 1021 agcacgctgc cctcttcacc tcgaccatca tgtccaaggt gctccagctc cccaactgca 1081 agctcttcaa cgccacctgc gtcgaggacc tcatcacccg accttccaag gagggagtcc 1141 gcatctctgg tgtcgtcacc aactggaccc tcgtgtcgat gcaccacgac gaccagtcct 1201 gcatggaccc caacaccatc aacgcgcccc tcgtcatctc caccaccggc cacgacgccc 1261 ccatgggtgc cttttgcgtc aagcgtctcg tcagcatggg acgcatcgag aagctcggcg 1321 gcatgcgtgg tctcgacatg aacgtggccg aggatgccat cgtcaaggga acccgtgaga 1381 ttgtccctgg tctgattgtc ggaggaatgg agctttccga ggttgacgga gccaaccgaa 1441 tgggtgagtt tctcgacacg agtatcaata actggttgct aactcgacta ggccctactt 1501 tcggtgccat ggttctgagc ggtctcaagg ctgccgagga ggccctcaag gtcattgaca 1561 tccgccagaa gcagaactcg ttctaagcga gttgacgtac aagaagaaca tggactggga 1621 aacacgatga ttcaggaaac atgaattcag gagccaggac ctcgtctctc tttcgaggca 1681 tggatccttg tcgctcatta ggccctttca gagaatgaaa ttggatatca aaactgctga 1741 attgctcata ctttctcgat ataaaacttg acgtacaatg tgcaactcca agaactttac 1801 tcatcctcca tctccttccc ctttggcttc tcgggcggaa acttcatccc cccgtt // LOCUS FSOSTI35Y 1177 bp ss-mRNA PLN 28-JUN-1990 DEFINITION F.oxysporum STI35 protein mRNA, complete cds. ACCESSION M33643 KEYWORDS STI35 protein. SOURCE F.oxysporum cucumerinum (isolate B1-GK) mycelial culture, cDNA to mRNA, clone pKAE5030. ORGANISM Fusarium oxysporum Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Hypocreales; Hypocreaceae. REFERENCE 1 (bases 1 to 1177) AUTHORS Choi,G.H., Marek,E.T., Schardl,C.L., Richey,M.G., Chang,S. and Smith,D.A. TITLE sti35, a stress-responsive gene in Fusarium spp JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.L.Schardl, 08-APR-1990. FEATURES from to/span description pept 29 991 STI35 protein mRNA < 1 1145 STI35 mRNA (alt.) mRNA < 1 1177 STI35 mRNA (alt.) BASE COUNT 264 a 328 c 291 g 294 t ORIGIN 1 aagatatcat aacctgccca agatcaaaat ggcccctccc gctgctgtct cccctccctc 61 tcgctctgca gagcttgcta catctaccaa gctcccagtc atgagcaaga acatcaacac 121 caagaccgtt gaggagatgc tcggtcagtg ggatgatttc aagtttgctc ccatccgcga 181 gagccaggtc tcccgcgcca tgactcgccg ctacttccag gaccttgata actatgccga 241 gtctgacatt gttatcatcg gtgctggttc gtgcggtctg agcgccgcgt acattcttgg 301 caagaagcgt cctgatctca agatcgccat cattgaggct tctgtgtctc ctggtggtgg 361 tgcttggctt ggtggacagc tcttctccgc catgatcatg cgcaagcctg ctgatgcttt 421 cctccgcgag gtcggtgttc cttatgaaga tgagggtaac tacgtcgttg tcaagcacgc 481 cgctctcttc acctcgacca tcatgtccaa ggttcttcag atgcccaaca tcaagctctt 541 caatgccact tgtgttgagg atctcatcac ccgaccttcc gaggagggag tccgcattgc 601 cggtgttgtc accaactgga ctcttgtttc catgcaccac gatgaccagt cttgcatgga 661 ccccaacact attaacgctc ctttgatcat ctccaccacc ggccacgatg gcccgatggg 721 agctttctgt gtcaagcgcc ttgtcagcat gcagcgcatt gagaagctcg gtggtatgcg 781 tggtcttgac atgaacctcg ccgaggatgc cattgtcaag ggaacccgtg agattgttcc 841 tggtcttatt gttggtggaa tggaactttc cgaggttgac ggtgccaacc gcatgggtcc 901 taccttcggt gccatggctc tcagtggtct caaggctgcc gaggaagctc tcaagatctt 961 cgacactcgc aagaagcaga acgatctgta agcgagatca tcaaagaagg tcatagttgg 1021 gaaacggaac atgtgaattc aggatcctgg acctcgtctc tcattcgagg catggaacct 1081 tgtcattcat ttagtctgag cgactcaagt ttggaaaaca tgaacccaaa gtcggatgcc 1141 actcgaaaga gacgccaaat aaagcaatac taaattg // LOCUS STMGLUSYN 1208 bp ds-DNA BCT 28-JUN-1990 DEFINITION S.hygroscopicus glutamine synthetase (GLNB) gene, complete cds. ACCESSION M33783 KEYWORDS glutamine synthetase. SOURCE S.hygroscopicus (strain ATCC 21705) DNA. ORGANISM Streptomyces hygroscopicus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1208) AUTHORS Kumada,Y., Takano,E., Nagaoka,K. and Thompson,C.J. TITLE Streptomyces hygroscopicus has two glutamine synthetase genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.J.Thompson, 20-APR-1990. Author address: C.J.Thompson Institut Pasteur 28 Rue du Dr. Roux 75015 Paris FRANCE Tel: (33) 145688842 FEATURES from to/span description pept 22 1158 glutamine synthetase (GLNB) precursor sigp 22 84 glutamine synthetase signal peptide matp 85 1155 glutamine synthetase BASE COUNT 198 a 432 c 418 g 160 t ORIGIN 1 gaattccttg gaaacggctg gatgtccggc tcggtaacct gtggttcaca aacgggcaac 61 ggacgagaaa tggcgcgttg cgagagtgcg gaggtacgcc cgcaccgtac ccgcaccgac 121 cgtgaagagg accccgtgag catcaaggcc gagtacatct ggatcgacgg cacgcagccg 181 accgccaagc tccgctccaa gaccaagatc ctgtccgacg gcagccggct tccgcggtgg 241 ggcttcgacg gttccagcac caaccaggcc gaaggccacg cctcggacct cgtactggag 301 ccggtgttca gctgcccgga cccgatccgc ggcggcgacc acctgctggt gctgtgcgag 361 gtgctgcaca ccgacctcac cccgcacccc tccaacaccc gggcgctgct gcgcccggtc 421 gcggagcggt tcgccggcca ggagccgatc ttcggcatcg agcaggagta caccttcctc 481 aagggcgacc gcccgctcgg cttccccgag ggcggcggct acccggcccc gcaggccgac 541 tactactgcg gcgtgggcgc cgacgcgatc ttcggccggg agatcgtcga gaagcacctc 601 gacctgtgcc tggcggccgg tctgggcctg tccggcatca acgccgaggt catgcccggc 661 cagtgggagt tccaggtcgg cgcgctgccg ccgctggagg tctcggacca catgtgggtg 721 gcgcgctggc tgctgcaccg ggtggcggag gagttcggcg tcaccgcgtc gctggacgcc 781 aagccggcca agggcgactg gaacggcgcg ggcgcgcaca ccaacttctc cacccgcgcg 841 atgcgcgagg gctacgaccc gatcatcacc gcctgcgagg cgctgggcca ggacgacaag 901 ccgctggagc acgtccgcca gtacggcacc ggcatcgagg accggctgac cggcgcgcac 961 gagaccgccc cctgggacgc gtactcctac ggcgcctccg accgcggcgc ctcggtgcgc 1021 atcccctggc aggtcgaggt cgagaagaag ggctacatcg aggaccggcg cccgaacgcc 1081 aacgtcgacc cgtacgtggt cacccggctg atggtggaca cctgctgcac ggagctggcg 1141 cggcgcgagc agatctgacg cctgcggtca cgtggccgtg cggtcacgtg accgcacggt 1201 cgcgcgag // LOCUS BLLEBG 2323 bp ds-DNA BCT 28-JUN-1990 DEFINITION B.lautus endo-beta-1,4-glucanase (celB) gene, complete cds. ACCESSION M33762 KEYWORDS endo-beta-1,4-glucanase. SOURCE B.lautus (strain PL236) DNA. ORGANISM Bacillus lautus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2323) AUTHORS Joergensen,P.L. and Hansen,C.K. TITLE Multiple endo-beta-1,4-glucanases from Bacillus lautus PL236 and characterization of the celB gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.K.Hansen, 18-APR-1990. Author address: C.K.Hansen Institut Pasteur 24 rue du Dr. Roux 75724 Paris Cedex 15 FRANCE Email: DEDONDER%PASTEUR.BITNET@CUNYVM.CUNY.EDU FEATURES from to/span description pept 172 1872 endo-beta-1,4-glucanase precursor (celB) (E.C. 3.2.1.4) sigp 172 262 endo-beta-1,4-glucanase signal peptide matp 263 1869 endo-beta-1,4-glucanase signal 46 51 -35 region signal 70 75 -10 region binding 158 164 ribosome binding site signal 1956 1981 transcription terminator BASE COUNT 644 a 508 c 563 g 608 t ORIGIN 1 ggggttttgt tcatttcaag aaagagcacg aataaacatc gctaatttac acatgattta 61 ttctctaatt attatggtgc atgccaattg gaaaatgtag tagattagtc atcgtaacct 121 gctttcatgc caagatatgt aatttttaaa aaagagtaaa ggagaattca gatgaaaaaa 181 cgtagaagca gtaaagttat tctttcgttg gccatcgttg ttgcattatt ggcagccgtc 241 gaacctaatg ccgctttagc agcggctcca ccaagtgcca tgcagtccta tgttgaagcg 301 atgcagcctg gctggaacct tggcaattct ctggatgctg tcggtgcgga tgagacgctg 361 gcacggggca atccgcggat cacgaaagag ctcattcaga acatcgctgc gcaaggctat 421 aagagcatac ggattcctgt tacctgggat tcccatatcg gcgcggcccc aaattatcaa 481 attgaagctg cgtacctcaa tcgagtgcag gaggtcgtac agtgggcttt ggacgcgaac 541 ctctatgtga tgattaatgt ccatcatgat tcctggctat ggatcagcaa aatggagtcg 601 cagcacgatc aagtactggc ccgttataat gcgatttgga cgcaaattgc caacaagttc 661 aagaacagcc cgagcaagct gatgttcgag agcgtgaatg agcctcgctt tacggatggc 721 ggaactacgg atgaagccaa gcagcaaaaa atgctggacg agctgaacgt atcctttttc 781 aacatcgtca gaaattccgg cggccagaac gcgactcgcc cgctagttct ttctacgttg 841 gaggcctctc ccacccaaga gagaatgacg gcgctttata atacgatgac caaactgaac 901 gacaagaatc tgatcgcaac cgttcatttt tatggattct ggccgtttag cgtaaatatc 961 gcaggatata cgaaatttga tgcggagacg caaaatgata ttataacgac cttcgataac 1021 gtgtataaca catttgtagc aaagggaatc ccggtggtag tcggcgaata tggccttctt 1081 ggattcgata agaataccgg cgtcattgaa cagggtgaga aattgaaatt tttcgagttt 1141 tttgcccagt atgtgaagca aaaaagcatt tccactatgc tatgggataa cggacagcac 1201 ttcaaccgca cgagcttcaa gtggtctgac ccggatttat tcaatatgat caaggccagt 1261 tggaccggac gttcatccac ggcttccagc gacctgatcc atgtcaagca gggcacggcg 1321 gtaaaagata cttcggttca gctcaatctt aacgggaata cgctaacttc cctttccgta 1381 aatggaacga cactgaaatc aggcacagat tacactttaa acagcagcag attaactttt 1441 aaagcgagcc agttgaccaa gctgacctcc ttgggcaaat tgggggtcaa cgcgacgatc 1501 gtgactaaat tcaatagagg cgccgactgg aagttcaacg tagtcctgta caatacgcct 1561 aagcttagca gtacgacggg gactacttct tcctttgcga ttccaacggc tttcaacggg 1621 gatcagcttg ctacgatgga agcggtctat gtaaacggcg gcaatgccgg tccgcataac 1681 tggacttcct ttaaggaatt cgaaacgacg ttcagccccg cttatagcga ggggaaaatc 1741 aaactgcagc aggcgttctt taatgaagtg aatgatacca cagtcacgct caagttccaa 1801 ttctggagcg gggagatcgt caactacacg attaaaaaga gcggttcgac ggtgacgggt 1861 acggcttcat aagcgagttt ggcaaaaaag gaccgatata ctgcctaatt tggtattgcc 1921 ttagttgaaa gcaattgctc cgaataaaca gaatgaagcc ccggccagct ggccgggact 1981 tatgcgttta ggaagtataa acgaatcatc agcaatttat ttagctcgtc tcagttcagc 2041 aatatcggct tcatgtgaaa cggagcggat gaacaatctt tcgagcaatt tctcatgctc 2101 ctgctgggtt tggagaacgg tttgctgatt agttttaagt acagatatat cctcacggac 2161 ttgattgatt catgtggtcc gttagttctt ctacctttgt atttgtggca gcaacgatat 2221 gaattaattg ttgaatgtgc ccgccatgac tgtttagctg ctcattgtgg ctttgtaact 2281 gttctcggat ttctttgaat tcttggtcgt gctcattaag ctt // LOCUS HLFRRD1 394 bp ss-rRNA BCT 28-JUN-1990 DEFINITION Haloferax 16S ribosomal RNA, partial sequence. ACCESSION M33803 KEYWORDS 16S ribosomal RNA. SEGMENT 1 of 3 SOURCE Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA. ORGANISM Haloferax sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 394) AUTHORS Holmes,M.L. and Dyall-Smith,M.L. TITLE A plasmid vector with a selectable marker for halophilic archaebacteria JOURNAL J. Bacteriol. 172, 756-761 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 394) AUTHORS Holmes,M.L. and Dyall-Smith,M.L. TITLE A plasmid vector with a selectable marker for halophilic archaebacteria JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by M.L.Dyall-Smith, 19-JUN-1990. FEATURES from to/span description rRNA < 1 > 394 16S rRNA revision 249 249 c in [1]; g in [2] BASE COUNT 102 a 101 c 126 g 65 t ORIGIN 1 gaaaagctca gtaacacgtg gccaaactac cctacagaga acgataacct cgggaaactg 61 aggctaatag ttcatacggg agtcatgctg gaatgccgac tccccgaaac gctcaggcgc 121 tgtaggatgt ggctgcggcc gattaggtag acggtggggt aacggcccac cgtgccgata 181 atcggtacgg gttgtgagag caagagcccg gagacggaat ctgagacaag attccgggcc 241 ctacggggcg cagcaggcgc gaaaccttta cactgcacgc aagtgcgata aggggacccc 301 aagtgcgagg gcatatagtc ctcgcttttc acgactgtaa ggcggtcgtg gaataagagc 361 tgggcaagac cggtgccagc cgccgcggta atac // LOCUS HLFRRD2 254 bp ss-rRNA BCT 28-JUN-1990 DEFINITION Haloferax 16S ribosomal RNA, partial sequence. ACCESSION M33804 KEYWORDS 16S ribosomal RNA. SEGMENT 2 of 3 SOURCE Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA. ORGANISM Haloferax sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 254) AUTHORS Holmes,M.L. and Dyall-Smith,M.L. TITLE A plasmid vector with a selectable marker for halophilic archaebacteria JOURNAL J. Bacteriol. 172, 756-761 (1990) STANDARD full staff_review FEATURES from to/span description rRNA < 1 > 254 16S rRNA BASE COUNT 67 a 59 c 85 g 43 t ORIGIN About 140 bp after segment 1. 1 tacgtccggg gtaggagtga aatcccgtaa tcctggacgg accaccgatg gcgaaagcac 61 ctcgagaaga cggatccgac ggtgagggac gaaagctagg gtctcgaacc ggattagata 121 cccgggtagt cctagctgta aacgatgctc gctaggtgtg acacaggcta cgagcctgtg 181 ttgtgccgta gggaagccga gaagcgagcc gcctgggaag tacgtccgca aggatgaaac 241 ttaaaggaat tggc // LOCUS HLFRRD3 539 bp ss-rRNA BCT 28-JUN-1990 DEFINITION Haloferax 16S ribosomal RNA, partial sequence. ACCESSION M33805 KEYWORDS 16S ribosomal RNA. SEGMENT 3 of 3 SOURCE Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA. ORGANISM Haloferax sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 539) AUTHORS Holmes,M.L. and Dyall-Smith,M.L. TITLE A plasmid vector with a selectable marker for halophilic archaebacteria JOURNAL J. Bacteriol. 172, 756-761 (1990) STANDARD full staff_review FEATURES from to/span description rRNA < 1 > 539 16S rRNA BASE COUNT 132 a 140 c 159 g 108 t ORIGIN About 45 bp after segment 2. 1 ctcaacgccg gacatctcac cagctccgac tacagtgatg acgatcaggt tgatgacctt 61 atcacgacgc tgtagagagg aggtgcatgg ccgccgtcag ctcgtaccgt gaggcgtcct 121 gttaagtcag gcaacgagcg agacccgcac ttctaattgc cagcagcagt ttcgactggc 181 tgggtacatt agaaggactg ccgctgctaa agcggaggaa ggaacgggca acggtaggtc 241 agtatgcccc gaatgagctg ggctacacgc gggctacaat ggtcgagaca atgggttgct 301 atctcgaaag agaacgctaa tctcctaaac tcgatcgtag ttcggattga gggctgaaac 361 tcgccctcat gaagctggat tcggtagtaa tcgcatttca atagagtgcg gtgaatacgt 421 ccctgctcct tgcacacacc gcccgtcaaa gcacccgagt gaggtccgga tgaggccacc 481 acacggtggt cgaatctggc ttcgcaaggg ggcttaagtc gtaacaaggt agccgtagg // LOCUS ZMOADHA 1365 bp ds-DNA BCT 28-JUN-1990 DEFINITION Z.mobilis alcohol dehydrogenase I (adhA) gene, complete cds. ACCESSION M32100 KEYWORDS alcohol dehydrogenase I. SOURCE Z.mobilis (strain CP4) DNA. ORGANISM Zymomonas mobilis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods. REFERENCE 1 (bases 1 to 1365) AUTHORS Keshav,K.F., Yomano,L.P., An,H. and Ingram,L.O. TITLE Cloning of the Zymomonas mobilis structural gene encoding alcohol dehydrogenase I (adhA): Sequence comparison and expression in Escherichia coli JOURNAL J. Bacteriol. 172, 2491-2497 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K. Keshav, 14-FEB-1990, for release after publication. FEATURES from to/span description pept 218 1231 alcohol dehydrogenase I (adhA) (EC 1.1.1.1) mRNA 108 > 1365 adhA mRNA (alt.) mRNA 137 > 1365 adhA mRNA (alt.) site 1261 1289 adhA transcription termination signal BASE COUNT 370 a 288 c 340 g 367 t ORIGIN 1 tttttgggcg tatttttatc aagaaaattt aaataatcac atttttatta ttttagattt 61 aagtattgat acaagtgata tctataaatg tttttataac tttctggatc gtaatcggct 121 ggcaatcgtt ttccctatat tcgcaagatg tatgtcagcc gcagattttt cgactgacct 181 ctatctctcc gagatatatc aacaaaaggt agtcaccatg aaagcagccg tcataactaa 241 agatcatacg atcgaagtga aagacaccaa attacgccct ctgaaatacg gggaagcgct 301 tttggaaatg gaatattgcg gggtatgtca taccgatctc catgtgaaaa acggggattt 361 cggcgatgaa accggcagaa ttaccggcca tgaaggtatc ggtatcgtca agcaggtcgg 421 ggaaggggtt acttctctga aagccggtga ccgcgccagt gttgcatggt tcttcaaagg 481 ctgcggccat tgcgaatatt gtgtcagcgg gaatgaaacg ctttgccgca acgttgaaaa 541 tgccggttat acggttgacg gcgctatggc agaagaatgc atcgtcgttg ccgattactc 601 ggtcaaggtg ccagatggtc ttgatcctgc ggttgccagc agcatcactt gcgcgggtgt 661 aaccacctat aaagcagtca aagtttctca gatacagccg ggacaatggc tggccatcta 721 tggcttgggc ggtttaggca atctagccct tcaatatgcc aagaatgttt tcaacgccaa 781 agtgatcgcg atcgatgtca atgatgaaca gctcgctttt gccaaagagt tgggcgcaga 841 tatggtcatc aatccgaaaa acgaagatgc tgccaaaatc attcaggaaa aagtcggcgg 901 cgcacatgcg acggtggtga cggctgttgc caaatccgcc tttaactcgg ctgttgaggc 961 tatccgcgcg ggtggccgtg ttgtcgccgt tggtctgcct cctgaaaaaa tggatttgag 1021 cattcctcgt ttggtgcttg acggtatcga agtcctaggt tccttggtcg gaacgcggga 1081 agatttgaaa gaggccttcc agtttgcagc cgaaggtaag gtcaagccga aagttaccaa 1141 gcgtaaagtc gaagaaatca accaaatctt tgacgaaatg gaacatggta aattcacagg 1201 ccgtatggtt gttgatttta cccatcacta ggttttcgtg aaggcggaag cataaacgga 1261 aaaagccttt ctcttaccag aaaggctttt tctttgtcgt ctgataaaaa ttttcataca 1321 gaattttaac acggcactcg ttgctataag ccgctatcca agctt // LOCUS MUSIGKABG 349 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V32 region, 5' end. ACCESSION M33992 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BXSB) adult spleen, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 349) AUTHORS D'Hoostelaere,L.A. and Klinman,D. TITLE Characterization of new mouse Vk groups JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.D'Hoostelaere, 04-MAY-1990. The frame shift between 178 and 184 could occur anywhere within those boundaries. Author address: L.A.D'Hoostelaere Grenzacherstrasse 487 Postfach 4005 Basel, Switzerland E-mail: IN%BASEL@UTOROCI.bitnet FEATURES from to/span description pept 1 178 Ig kappa-chain V32 region precursor 180 > 349 Ig kappa-chain V32 region precursor sigp 1 60 Ig kappa-chain V-region signal peptide matp 61 > 349 Ig kappa-chain V-region site 178 184 frame shift BASE COUNT 99 a 84 c 76 g 90 t ORIGIN 1 atgagggtcc ttgctgagct cctggggctg ctgctgttct gctttttagg tgtgagatgt 61 gacatccaga tgaaccagtc tccatccagt ctgtctgcat cccttggaga cacaataacc 121 atcacttgcc atgccagtca gaaaattaat gtttggttaa gctggtacca gcagaaaaaa 181 aaaaggaaat attcctaaac tattgatcta taggacttcc aacttgcaca caggcgtccc 241 atcaaggttc agtggcagtg gatcaggaac aggtttcaca ttaaccatca gcagcctgca 301 gcctgaagac attgccactt actactgtca acagggtcaa aattatccg // LOCUS MUSIGKABH 349 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V33 region, 5' end. ACCESSION M33993 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BXSB) adult spleen, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 349) AUTHORS D'Hoostelaere,L.A. and Klinman,D. TITLE Characterization of new mouse Vk groups JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.D'Hoostelaere 04-MAY-1990. The frame shift between 178 and 184 could occur anywhere within those boundaries. Author address: L.A.D'Hoostelaere Grenzacherstrasse 487 Postfach 4005 Basel, Switzerland E-mail: IN%BASEL@UTOROCI.bitnet FEATURES from to/span description pept 1 178 Ig kappa-chain V33-region precursor 180 > 348 Ig kappa-chain V33-region precursor sigp 1 60 Ig kappa-chain V-region signal peptide matp 61 > 348 Ig kappa-chain V-region BASE COUNT 99 a 84 c 76 g 90 t ORIGIN 1 atgagggtcc ttgctgagct cctggggctg ctgctgttct gctttttagg tgtgagatgt 61 gacatccaga tgaaccagtc tccatccagt ctgtctgcat cccttggaga cacaataacc 121 atcacttgcc atgccagtca gaaaattaat gtttggttaa gctggtacca gcagaaaaaa 181 aaaaggaaat attcctaaac tattgatcta taggacttcc aacttgcaca caggcgtccc 241 atcaaggttc agtggcagtg gatcaggaac aggtttcaca ttaaccatca gcagcctgca 301 gcctgaagac attgccactt actactgtca acagggtcaa aattatccg // LOCUS HUMCAIX 2785 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human carbonic anhydrase I (CAI) mRNA, complete cds. ACCESSION M33987 KEYWORDS carbonic anhydrase I. SOURCE Human EBV transformed SH B cell line DNA, and cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2785) AUTHORS Lowe,N., Brady,H.J.M., Barlow,J.H., Sowden,J.C., Edwards,M. and Butterworth,P.H.W. TITLE Structure and methylation patterns of the gene encoding human carbonic anhydrase I JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Since no intron sequences were provided this entry is treated as if originating from an mRNA. Draft entry and computer-readable sequence for [1] kindly submitted by N.Lowe, 09-MAY-1990. Author address: N.Lowe Department of Biochemistry University College London Gower Street, London WC1E 6BT, U.K. E-mail:UCBCMAR%EUCLID.UCL.AC.UK@CUNYVM.CUNY.EDU FEATURES from to/span description pept 1048 1833 carbonic anhydrase I (EC 4.2.1.1) /hgml_locus_uid="LS0141U" /nomgen="CA1" /map="8q13-q22" mRNA 902 1939 carbonic anhydrase I mRNA (alt.) mRNA 902 2165 carbonic anhydrase I mRNA (alt.) signal 874 879 TATA box BASE COUNT 840 a 603 c 492 g 850 t ORIGIN 1 ctttagccca acagtcaaaa ataattgatg ctaccctaca aatgtccaaa actctagtat 61 atcatatttc taagttacag caaatattag tcctgctaaa ccagggagct ttggcaaaaa 121 tgttttttga cagtaaattt gtccttgatt atatattaac tagtcaaaga ggtgtttgta 181 acattattag agcttcttgt tgtaggtggg ttaacaccac caatcaagag gtcattctaa 241 cagaaagcct ggatcagaaa accatcaccc taaaaaaaca tgccttacat atttaacaca 301 ctctgaaatc cagtcaaaat atgactaaag gcccttgcca tgactgatgt attctcctgg 361 ccaacgccaa acaaatggga gcctggttac gagtcagcct tcagggactt gtcacatttc 421 tacttggttt cttccttgtt attgtcataa taaaatgttt tctatgctgt ttagtgcaac 481 ttaggcccta ttctgtagaa gtctcctcta ctattcaggc cactcaaaca ccccaaataa 541 ttgagttcaa aatcgacatc aagatataaa ggaatcagtg actaaatata tttcatatat 601 ggtattttta ttgattattg tgctgtcttg acctagtatg gaggccttgg ctagaggctg 661 gtcagtttcc tctcttgagc agctgattaa atccacaccc caaccacttc ccttatcagg 721 ttctcacact ctggggccac tatgtaccca ctctaatcac cacagggcca gacatcagac 781 aattaaggac agcgcccatg ccccaaagcc cgccaaaatt atgcaaatta ttcaaaatta 841 ttcaacctag ctaaccccac cctttttgct gtacataagc tgcccattcc ccctccagcc 901 tgtggtaccc agtcctcagg tgcaaccccc tgcgtggtcc tctgtggcag ccttctctca 961 ttcagagctg ttttccacag aggtagtgaa aagaactgga ttttcaagtt cactttgcaa 1021 gagaaaaaga aaactcagta gaagataatg gcaagtccag actggggata tgatgacaaa 1081 aatggtcctg aacaatggag caagctgtat cccattgcca atggaaataa ccaatcccct 1141 gttgatatta aaaccagtga aaccaaacat gacacctctc tgaaacctat tagtgtctcc 1201 tacaacccag ccacagccaa agaaattatc aatgtggggc attctttcca tgtaaatttt 1261 gaggacaacg ataaccgatc agtgctgaaa ggtggtcctt tctctgacag ctacaggctc 1321 tttcagtttc attttcactg gggcagtaca aatgagcatg gttcagaaca tacagtggat 1381 ggagtcaaat attctgccga gcttcacgta gctcactgga attctgcaaa gtactccagc 1441 cttgctgaag ctgcctcaaa ggctgatggt ttggcagtta ttggtgtttt gatgaaggtt 1501 ggtgaggcca acccaaagct gcagaaagta cttgatgccc tccaagcaat taaaaccaag 1561 ggcaaacgag ccccattcac aaattttgac ccctctactc tccttccttc atccctggat 1621 ttctggacct accctggctc tctgactcat cctcctcttt atgagagtgt aacttggatc 1681 atctgtaagg agagcatcag tgtcagctca gagcagctgg cacaattccg cagccttcta 1741 tcaaatgttg aaggtgataa cgctgtcccc atgcagcaca acaaccgccc aacccaacct 1801 ctgaagggca gaacagtgag agcttcattt tgatgattct gagaagaaac ttgtccttcc 1861 tcaagaacac agccctgctt ctgacataat ccagttaaaa taataatttt taagaaataa 1921 atttatttca atattagcaa gacagcatgc cttcaaatca atctgtaaaa ctaagaaact 1981 taaattttag ttcttactgc ttaattcaaa taataattag taagctagca aatagtaatc 2041 tgtaagcata agcttatctt aaattcaagt ttagtttgag gaattcttta aaattacaac 2101 taagtgattt gtatgtctat ttttttcagt ttatttgaac caataaaata attttatctc 2161 tttctttctg ttgtgcattc agtttctaaa accattaagt ttctactcca tttacattca 2221 aaaatcttaa atactttact tgcaagagta ttttgcttca aatacaacaa cctaagagca 2281 gctggagatg aaatattggg aaattcattt gcttactcct gaagacaaaa atatagctga 2341 gatgaccact ggatttaata tcgttatgct ggcccaacat tgctaccatt tgtgttgtct 2401 gtgatcaaaa tgattatctt ttatatagga agatgacgct tctggatatt gctttcactt 2461 cttctcccca cgttagcaag gacaatgctt ctctgccatt attacaacta gttagtttgc 2521 atggagaatc tttactttaa aattggaaga aaagtcacaa gtgaatggtt tataaaaatg 2581 ctaaagaagt cattcttgct tagaatcata tagaaacatc atgcaatctt ttagtcagat 2641 gtgcgcttca ccttatgcta tttttatctt taattgacac acaataattg tacatgttta 2701 tggagtatag tgtggtgttt tctgtttgtt tgtttgtttt ttgagacaag gtctcactct 2761 gccagtcagg gtggagtgcg atggt // LOCUS HUMPDHBZ 1408 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human pyruvate dehydrogenase E1-beta subunit mRNA, 3' end. ACCESSION M34056 J05549 KEYWORDS pyruvate dehydrogenase E1-beta subunit. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1408) AUTHORS Huh,T.L., Casazza,J.P., Huh,J.W., Chi,Y.T. and Song,B.J. TITLE Characterization of two cDNA clones for pyruvate dehydrogenase E1~subunit and its differential regulation in cultured fibroblasts JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.J.Song, 07-MAY-1990. Author address: B.J.Song Laboratory of Metabolism and Molecular Biology National Institute on Alcohol Abuse and Alcoholism 12501 Washington Ave. Rockville, MD 20852 E-mail: landsman@thebard.nlm.nih.gov FEATURES from to/span description pept < 1 1008 pyruvate dehydrogenase E1-beta subunit precursor /hgml_locus_uid="LU0223C" /nomgen="PDHB" /map="unassigned" (AA at 1) sigp 1 18 pyruvate dehydrogenase E1-beta subunit signal peptide matp 19 1005 pyruvate dehydrogenase E1-beta subunit signal 1397 1402 poly-A signal BASE COUNT 404 a 268 c 339 g 397 t ORIGIN Chromosome 3p13-3q23. 1 tggaccgcgc cggctgccgt gcaggtgaca gttcgtgatg ctataaatca gggtatggat 61 gaggagctgg aaagagatga gaaggtattt ctgcttggag aagaagttgc ccagtatgat 121 ggggcataca aggttagtcg agggctgtgg aagaaatatg gagacaagag gattattgac 181 actcccatat cagagatggg ctttgctgga attgctgtag gtgcagctat ggctgggttg 241 cggcccattt gtgaatttat gaccttcaat ttctccatgc aagccattga ccaggttata 301 aactcagctg ccaagaccta ctacatgtct ggtggccttc agcctgtgcc tatagtcttc 361 aggggaccca atggtgcctc agcaggtgta gctgcccagc actcacagtg ctttgctgcc 421 tggtatgggc actgcccagg cttaaaggtg gtcagtccct ggaattcaga ggatgctaaa 481 ggacttatta aatcagccat tcgggataac aatccagtgg tggtgctaga gaatgaattg 541 atgtatgggg ttccttttga atttcctccg gaagctcagt caaaagattt tctgattcct 601 attggaaaag ccaaaataga aaggcaagga acacatataa ctgtggtttc ccattcaaga 661 cctgtgggcc actgcttaga agctgcagca gtgctatcta aagaaggagt tgaatgtgag 721 gtgataaata tgcgtaccat tagaccaatg gacatggaaa ccatagaagc cagtgtcatg 781 aagacaaatc atcttgtaac tgtggaagga ggctggccac agtttggagt aggagctgaa 841 atctgtgcca ggatcatgga aggtcctgcg ttcaatttcc tggatgctcc tgctgttcgt 901 gtcactggtg ctgatgtccc tatgccttat gcaaagattc tagaggacaa ctctatacct 961 caggtcaaag acatcatatt tgcaataaag aaaacattaa atatttagtt tggacttgaa 1021 tatcaagtcg ttgaaattta tttgaaatac ttgctggcac tgcacctgga tttgtactgc 1081 aagacctgac tattcataaa ggaaaacgat ttctaaagca acagcaggta tttttgtaca 1141 gggaagttta aatgtgtttg tgtatggaaa actctccact ctcctcccct agatgccatg 1201 cttccttttg tctgttacgg ttgccatgtt ctttgaataa caaattatat cacattttat 1261 cctctctcac cacaaggaca aagtatggat gtggcagagt cctgatgaaa gatgtatcca 1321 aacaagataa cttatatgta taaaattaaa gcatataata cacatttact gttagtttgt 1381 tttgataagg aataaaggaa tttctaac // LOCUS HUMPYRUV 1095 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human pyruvate dehydrogenase E1-beta subunit mRNA, complete cds. ACCESSION M34055 J05549 KEYWORDS pyruvate dehydrogenase E1-beta subunit. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1095) AUTHORS Huh,T.L., Casazza,J.P., Huh,J.W., Chi,Y.T. and Song,B.J. TITLE Characterization of two cDNA clones for pyruvate dehydrogenase E1~subunit and its differential regulation in cultured fibroblasts JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.J.Song, 07-MAY-1990. Author address: B.J.Song Laboratory of Metabolism and Molecular Biology National Institute on Alcohol Abuse and Alcoholism 12501 Washington Ave. Rockville, MD 20852 E-mail: landsman@thebard.nlm.nih.gov FEATURES from to/span description pept 6 1085 pyruvate dehydrogenase E1-beta subunit precursor /hgml_locus_uid="LU0223C" /nomgen="PDHB" /map="unassigned" sigp 6 95 pyruvate dehydrogenase E1-beta subunit signal peptide matp 96 1082 pyruvate dehydrogenase E1-beta subunit mRNA < 1 1095 pyruvate dehydrogenase E1-beta subunit mRNA signal 1061 1066 poly-A signal BASE COUNT 288 a 221 c 302 g 284 t ORIGIN Chromosome 3p13-3q23. 1 ccaagatggc ggcggtgtct ggcttggtgc ggagacccct tcgggaggtc tccgggctgc 61 tgaagaggcg ctttcactgg accgcgccgg ctgccgtgca ggtgacagtt cgtgatgcta 121 taaatcaggg tatggatgag gagctggaaa gagatgagaa ggtatttctg cttggagaag 181 aagttgccca gtatgatggg gcatacaagg ttagtcgagg gctgtggaag aaatatggag 241 acaagaggat tattgacact cccatatcag agatgggctt tgctggaatt gctgtaggtg 301 cagctatggc tgggttgcgg cccatttgtg aatttatgac cttcaatttc tccatgcaag 361 ccattgacca ggttataaac tcagctgcca agacctacta catgtctggt ggccttcagc 421 ctgtgcctat agtcttcagg ggacccaatg gtgcctcagc aggtgtagct gcccagcact 481 cacagtgctt tgctgcctgg tatgggcact gcccaggctt aaaggtggtc agtccctgga 541 attcagagga tgctaaagga cttattaaat cagccattcg ggataacaat ccagtggtgg 601 tgctagagaa tgaattgatg tatggggttc cttttgaatt tcctccggaa gctcagtcaa 661 aagattttct gattcctatt ggaaaagcca aaatagaaag gcaaggaaca catataactg 721 tggtttccca ttcaagacct gtgggccact gcttagaagc tgcagcagtg ctatctaaag 781 aaggagttga atgtgaggtg ataaatatgc gtaccattag accaatggac atggaaacca 841 tagaagccag tgtcatgaag acaaatcatc ttgtaactgt ggaaggaggc tggccacagt 901 ttggagtagg agctgaaatc tgtgccagga tcatggaagg tcctgcgttc aatttcctgg 961 atgctcctgc tgttcgtgtc actggtgctg atgtccctat gccttatgca aagattctag 1021 aggacaactc tatacctcag gtcaaagaca tcatatttgc aataaagaaa acattaaata 1081 tttagtttgg acttg // LOCUS ENTOMPX 664 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.cloacae outer membrane protein X (OmpX) gene, complete cds. ACCESSION M33878 KEYWORDS outer membrane protein X. SOURCE E.cloacae (strain 2249-1) DNA. ORGANISM Enterobacter cloacae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 664) AUTHORS Stoorvogel,J., van Bussel,M.J.A.W.M., Tommassen,J. and van de Klundert,J.A.M. TITLE Molecular characterization of an Enterobacter cloacae outer membrane protein OmpX JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.Stoorvogel, 26-APR-1990. Author address: J.Stoorvogel Dept of Medical Microbiol, Univ Hospital, P.O.Box 9600, 2300 RC Leiden, THE NETHERLANDS FEATURES from to/span description pept 83 601 outer membrane protein X precursor sigp 83 151 outer membrane protein X signal peptide matp 152 598 outer membrane protein X signal 18 25 -35 region signal 43 48 -10 region binding 74 77 ribosome binding site signal 620 649 transcription termination signal BASE COUNT 171 a 156 c 162 g 175 t ORIGIN 1 atcgtgaatg acaaagagtt tcacagattt ttttgatata tttaaaactt acggacttac 61 ttgaagcaca tttgaggtgg ttatgaaaaa aattgcatgt ctttcagcac tggcagctgt 121 tctggctgtt tccgcaggta ccgctgtagc ggcaacttct actgtaactg gtggttacgc 181 tcagagcgat atgcagggcg tgatgaacaa aaccaacggt ttcaacctga agtaccgtta 241 cgagcaagac aacaacccgc tgggtgtgat cggttctttc acttacaccg agaaagatcg 301 tactgaaaat ggctcttaca ataaaggtca gtactacggc atcaccgcgg gtcctgctta 361 ccgcctgaat gactgggcaa gcatctacgg tgttgtaggt gttggctacg gtaaattcca 421 gcagaccgaa aaccagggtc tgaaccgtac tgcaagcaac agcgactacg gtttctccta 481 tggcgcaggt atgcagttca acccaatcga aaacgttgct ctggacttct cctatgagca 541 gagccgtatc cgcaacgttg acgttggcac ctggatcgcg ggcgtaggtt accgcttcta 601 atcacttcgg tgagccagta aaaaatccgc ccaatgtggc ggattttttt tgtttgtacc 661 ctct // LOCUS ACARGA 1212 bp ds-DNA INV 28-JUN-1990 DEFINITION A.castellanii 5S RNA. ACCESSION M34003 KEYWORDS 5S RNA. SOURCE A.castellanii (strain Neff) trophozoite DNA, clone pAc5S.1. ORGANISM Acanthamoeba castellanii Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina; Acanthamoebidae. REFERENCE 1 (bases 1 to 1212) AUTHORS Zwick,M.G., Wiggs,M. and Paule,M.R. TITLE Cloning, sequencing and genomic organization of a 5S RNA gene from the eukaryotic protist Acanthamoeba castellanii JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.R.Paule, 04-MAY-1990. Author address: M.R.Paule Colorado State University, Dept. Biochemistry, Fort Collins, CO 80523 FEATURES from to/span description RNA 489 610 5S RNA precursor RNA 489 607 5S RNA BASE COUNT 195 a 374 c 329 g 314 t ORIGIN 1 ctgcaggaga ttcgtcgggc tcgtgccgcg atcgtcttga tgatcgagac cgacggcccg 61 ttcctcactc cgcgcacctt gcgccttgcc cgcgccgcaa cgagccgtct tacctcaccc 121 acgtcctaca gagtacttac atcctccaat gtgttgtgtg cgtgtcatgt gcccaatgta 181 tgtggtgtgt gcggtgtgtg cggtgtgtga ggtgtgtggt gtgcgcgtgt catgtctgtt 241 acacgttctg ttgaccaaca ttttttatgg tgtgtacagc ggtcgccgac tgcaccggac 301 gatcggtaga ggaggtggcc ggtgaaacca cgcagaccgc ggtgcgcttc ttcggtctgg 361 accaaatccg taaggagtga atgtagcaca cccctcccct ccatgtcatt gaagcccaag 421 ttagtgtgca atttaaatga tggtgtgacc aagagggatc acatgggcga agtccctggt 481 aggctgatgg atacggccat actgcgcaga aagcaccgct tcccatccga acagcgaagt 541 taagctgcgc caggcggtgt tagtactggg gtgggcgacc acccgggaat ccaccgtgcc 601 gtatcctcct ttttgcttct tcactctcgc cactattgct tcaggtcgcc tcatcgcctg 661 ccacagctga ctacaaactg ggtgagccaa acctgccttt tgcactggtc cagtgtacaa 721 ggctggaaca actttgtttc acacttgctg aggtcaatca atccagcctt gttggttgag 781 gccacaggag ggatctgcca cctttcacac catctcgtcg agaaggctgc ttgacgcttc 841 ttcgctgtcc ttcggggctg atgtctcgac tgtgtgtccg tcttcatcgc gattgtcact 901 tgcgttgttg ttgttgcggc tttcgtccct gtcctcatcg ccgtcgtcgt cttcatcgct 961 gttgtcgcgc gactcactcc gcaaaccgtc tggccgtgtg ctgctcctcc tcttcatcgt 1021 tgctgtcatc tcggtcgctg cggccacgcc tcgaagaaga cagcccgccc tcttcggccg 1081 cctcggtccc gcccctcgca aagcggatct cgcctcgtcg gctgccgctg gccgtcgctc 1141 cgcctggcgt cgttgtcttc cctcgcatcg tcctcaaact tgtcgtgcgt ccacttgtcg 1201 gcaaatgtcg ac // LOCUS HUMCDR34 2412 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene, complete cds. ACCESSION M31423 M34243 KEYWORDS cerebellar-degeneration-related antigen. SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2412) AUTHORS Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A., Chaganti,R.S.K., Posner,J.B. and Old,L.J. TITLE Cerebellar degeneration-related antigen: A highly conserved neuroectodermal marker mapped to chromosomes X in human and mouse JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.-T.Chen, 17-JAN-1990 FEATURES from to/span description pept 503 1174 cerebellar-degeneration-related antigen (CDR34) BASE COUNT 743 a 334 c 669 g 666 t ORIGIN 1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct 61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat 121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg 181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga 241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg 301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag 361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg 421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga 481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt 541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt 601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga 661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat 721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt 781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga 841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct 901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt 961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga 1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt 1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat 1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg 1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa 1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac 1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa 1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt 1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact 1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat 1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct 1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga 1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct 1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac 1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg 1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata 1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa 1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc 2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt 2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta 2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga 2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta 2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat 2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt 2401 gagatgctgc ag // LOCUS RATHGF 2485 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat hepatocyte growth factor mRNA, complete cds. ACCESSION M32987 KEYWORDS hepatocyte growth factor. SOURCE Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2485) AUTHORS Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M., Shimizu,S. and Nakamura,T. TITLE Deduced primary structure of rat hepatocyte growth factor and expression of the mRNA in rat tissues JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Nakamura, 26-MAR-1990. FEATURES from to/span description pept 143 2329 hepatocyte growth factor BASE COUNT 750 a 533 c 577 g 625 t ORIGIN 1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc 61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga 121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct 181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca 241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa 301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa 361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc 421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt 481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa 541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg 601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct 661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag 721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat 781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg 841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga 901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta 961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc 1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg 1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca 1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa 1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat 1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta 1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg 1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc 1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg 1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga 1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca 1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa 1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc 1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat 1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct 1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct 1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac 1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg 2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt 2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga 2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt 2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc 2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa 2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa 2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca 2461 ttaatatatg tggcgttttc tgttg // LOCUS DDIDPYK1A 1090 bp ss-mRNA INV 28-JUN-1990 DEFINITION D.discoideum protein-tyrosine kinase-1 (DPYK1) mRNA, complete cds. ACCESSION M33785 KEYWORDS protein-tyrosine kinase-1. SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1090) AUTHORS Tan,J.L. and Spudich,J.A. TITLE Developmentally regulated protein-tyrosine kinase genes in Dictyostelium JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Tan, 20-APR-1990. Author address: J.L.Tan Stanford University School of Medicine Dept. of Cell Biology Stanford, CA 94303 (415) 723-7408 FEATURES from to/span description pept < 1 1014 protein-tyrosine kinase-1 (DPYK1; AA at 1) BASE COUNT 371 a 198 c 186 g 335 t ORIGIN 1 cgcccatttg gtggttggga aactcaatca tcattatcac atccaccatc acgtccacca 61 ccacctccac caccaccacc acaactacca gttagatcag aatacgagat tgatttcaat 121 gaattagaat ttggtcaaac cattggtaaa ggtttctttg gtgaagtaaa gagaggttat 181 tggagagaga ctgatgttgc cataaaaatc atctatcgtg atcaattcaa aaccaaatca 241 tcattggtta tgtttcaaaa tgaagttgga atactaagta aattaagaca tccaaatgta 301 gttcaatttt tgggtgcatg tactgcagga ggtgaagatc atcattgtat agtaacagaa 361 tggatgggtg gaggtagttt aagacagttc ttgactgatc atttcaattt actcgaacaa 421 aatccacata ttcgtttgaa gttggctttg gatattgcaa aaggaatgaa ttatctacat 481 ggttggactc cacccattct tcatcgtgac ttatcctcaa gaaacatttt attggatcac 541 aacatcgatc caaagaatcc gttagtttcc tcaagacaag atattaaatg taagatctct 601 gattttggtc taagtagatt aaagaaggaa caagcctctc aaatgactca atcggttggt 661 tgtattccct acatggcacc agaggttttc aaaggcgata gtaatagtga aaagagtgat 721 gtttactcct atggcatggt tttgtttgaa ctattaacct ctgatgaacc tcaacaagat 781 atgaaaccaa tgaaaatggc tcacttggct gcttatgaat cttatcgtcc tccaattcca 841 ttaactacct cttccaagtg gaaagaaatt ctaactcaat gttgggattc taatcctgat 901 agtcgtccaa cctttaaaca aatcattgtt catctcaaag aaatggaaga tcaaggtgta 961 tcttcttttg catctgtacc tgttcaaact attgatactg gtgtttatgc ttaatttttt 1021 ttttataatt aaaaaaaaaa aaaacaaaac aaaaaaaaaa aataataata aatataatca 1081 cttcaactcg // LOCUS DDIDPYK2A 1292 bp ss-mRNA INV 28-JUN-1990 DEFINITION D.discoideum protein-tyrosine kinase-2 (DPYK2) mRNA, complete cds. ACCESSION M33784 KEYWORDS protein-tyrosine kinase-2. SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1292) AUTHORS Tan,J.L. and Spudich,J.A. TITLE Developmentally regulated protein-tyrosine kinase genes in Dictyostelium JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Tan, 20-APR-1990. Author address: J.L.Tan Stanford University School of Medicine Dept. of Cell Biology Stanford, CA 94303 (415) 723-7408 FEATURES from to/span description pept < 1 1234 protein-tyrosine kinase-2 (DPYK2; AA at 2) BASE COUNT 491 a 207 c 204 g 390 t ORIGIN 1 cgattctaca agtacaacaa actctactaa agatatcaca tttttagttt gtgataatcc 61 tgattcaact aaagaaaaga gtaacgtttc aaatacttca tcaataattt ccgcttcaaa 121 tttaaataga catataacac caaattctca tatgagacct agaggtagat caatttctga 181 atctttaatt atgtcaccaa ttaataaaga atctttaaat gatattcaaa gagcaattga 241 aagtgaaaaa ataaagaaaa ctaaatttga agaattaaaa tcaatattgg gcgaaagaga 301 atatataatt gatataaatg atattcaatt tatacaaaaa gttggagaag gtgcattcag 361 tgaagtttgg gaaggttggt ggaaaggtat tcatgttgcc ataaaaaagt taaagattat 421 aggagatgaa gaacaattca aagagagatt cattagagag gttcaaaatt tgaaaaaagg 481 aaatcatcaa aacattgtca tgtttattgg tgcatgttat aaaccagcat gtatcataac 541 agagtatatg gcaggtggta gtctttacaa tatacttcat aatccaaata gttccactcc 601 aaaagttaaa tattctttcc cattggtttt gaaaatggca accgacatgg cattgggctt 661 attacatctt cattccatca ccattgtgca tcgtgattta accagtcaaa acattctatt 721 ggatgaattg ggtaatataa agatctctga ttttggttta tctgctgaaa agagtagaga 781 aggttcaatg acaatgacaa atggtggcat ttgcaatcca agatggagac cacccgaatt 841 gacaaagaat ttaggtcact actcggaaaa ggttgatgtc tattgtttct ctctagtagt 901 ttgggaaatt ttaactggcg aaattccttt ctctgattta gatggatctc aacgatccgc 961 tcaagtagct tatgctggtt taagaccacc aataccagag tattgcgatc ctgaattaaa 1021 attactctta actcaatgtt gggaggctga tccaaatgat agacctccct ttacctatat 1081 agtaaacaaa ttaaaagaaa tctcttggaa taatccaatt ggtttcgtct ctgatcaatt 1141 ctatcaatat agcgaacctt caactccaag attagcatta tcaaatcaat cttcaaattc 1201 aagtagtatt tctttatcac caactaaatt ataaaaaaaa aaaaaaaaaa aaacaaattt 1261 caaacaccaa acaccaccac tcatcaaaat cg // LOCUS MUSH2A1X 929 bp ds-DNA ROD 28-JUN-1990 DEFINITION Mouse histone H2A.1 gene, complete cds. ACCESSION M33988 KEYWORDS histone H2A.1. SOURCE Mouse (BALB/c) DNA, clone 53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 929) AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and Sch|mperli,D. TITLE Structure of a mouse histone gene cluster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Sch|mperli, 02-MAY-1990. Author address: Dr.Schmperli Zoologisches Institut, Baltzerstrasse 4 CH-3012 Bern SWITZERLAND FEATURES from to/span description pept 164 556 histone H2A.1 mRNA 119 624 histone H2A.1 mRNA BASE COUNT 238 a 247 c 222 g 222 t ORIGIN 1 gagctcaaat tctggctttc tattgggtac gatatattaa ccaatgggag aaacacaaac 61 agaatacctc cagttagtat aaatgcttgc tgttcagttg cagaatttac tatatattct 121 tttcctttct ctgctttgcc tttactgata cttaaacgca tacatgtctg gacgcggaaa 181 gcaagggtgc aaggcccgcg ctaaggccaa gacccgctcc tcccgggccg gcctgcagtt 241 ccccgtgggc cgcgtgcacc ggctgctccg caagggcaac tactcggagc gcgtgggcgc 301 cggcgccccg gtgtacctgg cggctgtgct ggagtacctg acggccgaga tcctggagct 361 ggcgggcaat gcggcccgcg acaacaagaa gacgcgcatc atcccgcgcc acctgcagct 421 ggccatccgc aacgacgagg agctcaacaa gctgctgggc cgcgtgacca tcgcgcaggg 481 cggcgtcctg cccaacatcc aggccgtgct gctgcccaag aagaccgaga gccaccacaa 541 ggccaagggg aagtaatctg gcgattgtct gtactgccca gttgaaagtt aaccaaaaca 601 aaggctcttt tcagagccac ccacatcttt ccataaaatg agctgccacc tcgtgaaacg 661 ttcttccact acagttttta tactacatat gaaaaagtta cgaagtagct ttcaatctta 721 gtaaattgat tttaatactg ttagtccctg cgataaatct tacgaccttc cttagtttga 781 gtcaaaagtg tgtaagagat gaaaccttta gaacatacta taaattttta gtagaaattt 841 ggcacccagg tttgtcattc acgtcacgat tgtctagagc ataatggtag taagggctaa 901 gggccattaa atcccacttc catagtttc // LOCUS MUSH2A1XX 693 bp ds-DNA ROD 28-JUN-1990 DEFINITION Mouse histone H3.2 gene, complete cds. ACCESSION M33989 KEYWORDS histone H3.2. SOURCE Mouse (strain BALB/c) DNA, clone 53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 693) AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and Sch|mperli,D. TITLE Structure of a mouse histone gene cluster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Sch|mperil, 02-MAY-1990. Author address: D.Schlmperil Zoologisches Institut, Baltzerstrasse 4 CH-3012 Bern SWITZERLAND FEATURES from to/span description pept 151 561 histone H3.2 mRNA 116 611 histone H3.2 mRNA BASE COUNT 159 a 206 c 184 g 144 t ORIGIN 1 caggatttag aagcagaggc tgaccaatcc caacaaagcg cgggcccttt gaatgttctt 61 cggtccaata gcggatagtc tgattgtata aaaggtggac agcgccttgc agctcactat 121 agtgtcagtc tattttccct tgttaaagtc atggctcgta ctaagcagac cgctcgtaag 181 tccaccggtg gcaaggcccc gcgtaagcag ctggccacca aggccgcccg caagagcgcc 241 ccggccaccg gcggcgtgaa gaagcctcac cgctaccgtc ccggcaccgt ggcgctgcgc 301 gagatccggc gctaccagaa gtcgaccgag ctgctgatcc gcaagctgcc gttccagcgc 361 ctggtgcgcg agatcgcgca ggacttcaag accgacctgc gcttccagag ctcggctgtc 421 atggctctgc aggaggcgag cgaggcctac cttgtgggtc tgtttgagga caccaacctg 481 tgcgccatcc acgccaagcg tgtcaccatc atgcccaagg acatccagct cgcccgccgc 541 attcgtgggg agagggctta aacgaaacaa agtgcaaact ggaacccaaa ggctcttttc 601 agagccaccc acagtttctg taaaagcagc tgttactttt ctactgtctt ggaaactttg 661 tttttcaaag atttatctag ttacaaagta tac // LOCUS NEUPSAS 3681 bp ds-DNA PLN 28-JUN-1990 DEFINITION N.crassa arginine-specific carbamoyl phosphate synthetase (ARG-2) gene, complete cds. ACCESSION J05512 KEYWORDS carbamoyl phosphate synthetase. SOURCE N.crassa (strain 74-OR23-IVA)asexual phase DNA, clone pAR231, and cDNA to mRNA, clone pAB1. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 3681) AUTHORS Orbach,M.J., Sachs,M.S. and Yanofsky,C. TITLE The Neurospora crassa arg-2 locus: Structure and expression of the gene encoding the small subunit of arginine-specific carbamoyl phosphate synthetase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.S.Sachs, 07-MAY-1990. FEATURES from to/span description pept 1451 1630 arginine-specific carbamoyl phosphate synthetase, exon 1 (ARG-2) (EC 6.4.5.5) 1687 2868 arginine-specific carbamoyl phosphate synthetase, exon 2 pept 870 902 ORF, exon 1 1270 1282 ORF, exon 2 1359 1387 ORF, exon 3 pre-msg 752 3082 ARG-2 mRNA and introns pre-msg 809 3082 ARG-2 mRNA and introns (alt.) IVS 1631 1686 ARG-2 intron A IVS 903 1269 ORF intron A IVS 1283 1358 ORF intron B signal 161 166 TGACTC box signal 568 573 TGACTC box signal 1008 1013 TGACTC box signal 1307 1312 TGACTC box BASE COUNT 760 a 1093 c 863 g 965 t ORIGIN 1 ggatccagac gctatatttg attgttccct cttctgtcag cgtgcggcct ccgttgtccg 61 tgcagccgat tacgacggcc ggctctggcg caacctgggt ttgggagctt cagaagcttg 121 aaagcgaggc gaatcgcgac tgtacaagtc cccgcaactt gagtcaccga aagcgcgggg 181 tatccaccat ctttatctgc ggcggtaggt aggcggggac tttgcggggg ctttgcggga 241 cccgaccact ccaactccga gattcgttgg gggtgccccg ggcaggagat gccgagctgg 301 agtcgcgttg tttgtgaatg tcgtggctgg ggttgtgggc ataggttgcg atgtgtagtg 361 tacgtagtac atagtaggta tgttgaacac cggatgggtg attccagttg tgaggctccg 421 ggactagtag gggtggtaac cgtgatggtc aaagggcatg tggaacttgc agcgtcccaa 481 cgaagtccag gtccccagtc atcccaacgc cgcgatcaat cacccctcgg tccccgtaaa 541 tcggcctttg ggactgccta gcagcgggag tcaccgagtc ctggaccaat caggtcggcg 601 ctgaccctgt cactgcggca gaaaaaagac aaagagctca aaaaattcta ccagatccaa 661 tcaaatgtcg ccctttttct atttgcctca cccactttcg caccctcttt tatctccaag 721 tttctcccct cgacactaca actcactccg tcatcaactc ttgcccatct acaaaacaca 781 caataacccc aatcgcttcc cccgactttc ggactctttc gtccaacctt ctttcttcag 841 ctgaacttgt cttgtcgcaa tctgccctta tgaacgggcg cccgtcagtc ttcacctctc 901 aggtacgata acgcgatagc ccctccatcg tgctggtatc cagccaccat ccacccagtc 961 catcaaagaa gttcccccgg atgagcaatg aagcccccat accagcctga ctcgcgatgc 1021 tgtatcggag tagttgggac gacccgccag taacccacca acctgggtgc ttggcatcat 1081 gccgctctga gttggttttg ggagatgtgg ccagcggtga tccccaaagt tcagctcttc 1141 acggattccg attgcatatc tctcaagatc gcgcagacag ccatcctggg ggagacttct 1201 tctcctcgct tctgccgctt gccgtctgtt tgtagtgtct gatcaaaact aattctgttg 1261 cctttgtagg attacctctc aggtccgcat ccccgccgac tcccgttgac tcaccctttt 1321 ccggctggaa tactgacgat gaattttgtg tgttacagac catctgtgga gagcccttaa 1381 cgcataagag cctctcatca cccagcagcc gtaccaatca ccaccgcacc ccatcaccat 1441 tcaagtcaag atgttctctc gcttggccgc tcgtctcccc aaggcttctg ccctcaatgg 1501 cgtcgccgcc cgtcaggttc gcaacctgag ccagcccgcc attaccggca gcaagggcag 1561 gaacatgccc gcccgtgagc cgcgcactac tgccgctgcc accggcgccg aggccacctt 1621 cactatcagg gtgcgtacct acaactagtc ttctctcgtg gctaaaccta acattttctc 1681 gctcaggatg gccccgtttt ccagggtacc gccttcggtg ccaacaccaa catttctggt 1741 gaagccgttt tcaccacctc ccttgttggt taccccgagt cgatgaccga tccctcgtac 1801 cgcggccaga ttttggtctt cacccagccc ttgattggca actacggtgt cccttcgaat 1861 gagcgtgacg agttcaacct cctaagtact tcgagtctcc ccacatccag tgcgccggta 1921 tcgttgtctc cgatgtcgct acccagtaca gccactggac tgctgttcag agcttgggcg 1981 agtggtgcgc cagcgagggc attcccgcca ctctccggtg ttgacactcg tgcgattgtc 2041 acctacctcc gtgagcaggg ttcctctctc gccaggatct ccattggcga cgagtacgac 2101 gccgatgagg atgagggctt cattgaccct ggccagatca accttgtcaa gcgcgtgtcc 2161 accaaggctc cctttgtggt taccaacccc aacgccaagt tccacgtcgc tctgatcgac 2221 tgcggtgtca aggagaacat ccttcgtagc ttggtcagcc gtggcgcctc cgtcaccgtc 2281 ttcccctaca attaccccat ccacaaggtt gccgagaact tcgatggtgt cttcatttcc 2341 aacggccctg gtgatcccac ccactgccag gagactgttt acaacctcgc caagcttatg 2401 gagacttccc ctattcccat catgggtatc tgccttggtc accagcttct tgctctcgct 2461 gttggtgcca agaccatcaa gctaagtatg gtaatccgtg cccacaacat cccggctctt 2521 gacttgacca ctggtcagtg ccacatcacc tcgcagaacc acggttacgc cgtcgatatc 2581 agcaccctcc ctagcgactt caaggagtat ttcgtcaacc ttaacgacgg ctccaacgag 2641 ggtatgatgc acaagacccg ccccatcttc tctacccagt tccaccccga ggctaagggt 2701 ggtcccatgg actcttccta cctctttgac aagtacatgg agaatgtcga gcttttcaag 2761 agcaactccc aggtctaccg tgacaacagg cctactcagt tcatgattga cattctcagc 2821 aaggagcgtg tcggcgttga gcccactcct ctttctaacg ctgcttaacg gacgagaatg 2881 tctttgcatt ttcttagtga acggtgtttg tcattggtct tgggttattg ttttccagca 2941 caaaggatat gggttttgta aaagggactt tcttgggcga cgtcctctcg ttattaatgg 3001 tttagggtgt atatgatatc ttttgttact acttttgaat gcagaatcag ataagtttag 3061 tcggcattag gctgatatgg cgcagagagg acaatatcta gttgatacat acccatgtgc 3121 tatcacgtct aaacgtcttc cagtcgttgt attttaccca ttcctggtga ttgccgtatg 3181 tcttccagtc cagtacctcc cctaattcgg tgcccttcaa cattcagggt tatgttgaca 3241 gcgtgctgta taagcatggt tgccggggaa catggtgtca ttgactggta agcagggcag 3301 cctctggttc tgtgacacct ggttttccct ggtcttgatt tcaaactatt atcaaacaaa 3361 tcagcgtcgt cttagtagat ggcaggtatc ttagaacaga aagtagttat tcttcgtcgg 3421 atacgactcc aaaatcatat acgaacaagt ggatgcttta tactatgctt ctttttactg 3481 taatcttggt ctcgctcttc gcttagcctt cttcacctgg taaccggtat aggtaccttc 3541 gcaccctcaa ttggaaggat gcatcaaccc ctccatgcgc tgcttatcct gctatctatc 3601 catacatttc tatgttttta cagtgtagaa tatttcttgt gtatcagggt ccgtcccgcg 3661 atgctacccc ctgtcctgca g // LOCUS MUSIMPD 1707 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse IMP dehydrogenase mRNA, complete cds. ACCESSION M33934 KEYWORDS IMP dehydrogenase. SOURCE Mouse, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1707) AUTHORS Tiedemann,A.A. and Smith,J.M. TITLE Isolation and DNA sequence of a cDNA encoding mouse IMP dehydrogenase JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.M.Smith 27-APR-1990. Author address: Dr. J.M. Smith Seattle Biomedical Research Institute 4 Nickerson St. Seattle, WA 98109 FEATURES from to/span description pept 67 1611 IMP dehydrogenase (EC 1.2.1.14) BASE COUNT 457 a 408 c 451 g 391 t ORIGIN 1 cctctgcggc gcggtcctcg gagcggcgcg gttctcggag ccacgcgtct gtcttcctcc 61 gtggtcatgg cggactacct gattagcgga ggcacctctt acgtgccgga cgacgggctc 121 acagcgcagc agctcttcaa ctgcggggac ggcctcacct acaatgattt tctcattctt 181 cctgggtata tcgacttcac tgcagatcag gtggacttga cgtctgctct aactaagaag 241 attacactaa agaccccatt ggtttcctca cccatggaca ctgtcacaga ggctggaatg 301 gccatcgcga tggcgcttac aggaggtatt ggtttcatcc accacaactg cacacctgaa 361 ttccaggcca atgaagttcg gaaagtgaag aaatacgaac agggattcat cactgacccc 421 gtggtcctta gccccaagga tcgtgtacgc gatgtttttg aggccaaagc caggcatggc 481 ttctgtggta tccccatcac agatacaggc cggatgggga gtcgattggt gggcatcatc 541 tcctcaaggg acattgattt cctcaaggag gaagagcatg accggttctt ggaagagatc 601 atgactaaga gggaagattt ggtggtcgcc cctgccggcg tcactctgaa agaggcaaat 661 gagattctgc agcgaagtaa aaagggaaag ttgcccattg tgaatgaaaa tgatgagctg 721 gtagccatca ttgcccggac agacctaaag aagaatcgtg attaccccct ggcctccaaa 781 gatgccaaga agcaactgct gtgtggggca gccattggca ctcatgagga tgacaagtat 841 aggctggact tactggccct tgctggtgtg gatgtagtgg ttttggactc ttcccaggga 901 aactccatct tccaaatcaa tatgatcaaa tacatcaagg agaagtatcc cagtctacag 961 gtcattggag gcaatgtagt cactgctgcg caagccaaga acctcataga tgcaggtgta 1021 gatgctttgc gagtcggcat gggaagtggt tccatctgca tcacccagga agtgttggcc 1081 tgtgggcggc cccaagccac agcagtgtac aaggtctctg agtatgcccg tcgctttggt 1141 gttcctgtta ttgctgatgg aggaatccaa aatgtgggtc atattgccaa agctttggct 1201 cttggggctt ccacagtcat gatgggctcc ctcctggctg ccaccactga ggcccctggc 1261 gagtacttct tctcagatgg gatccggctg aagaaatacc gaggtatggg ttctcttgat 1321 gccatggaca aacatctcag cagccagaac cgatacttca gtgaagctga caaaatcaaa 1381 gtggcccaag gagtttcagg ggcagtgcag gacaaggggt ctatccacaa gttcgttcct 1441 tacctgattg ctggcatcca gcattcctgt caagacattg gtgccaagag tttaacccaa 1501 gtcagagcca tgacgtactc gggggagctt aaatttgaga agaggacatc ctctgctcag 1561 gtggaaggtg gcgtccacag cctccattcg tacgagaaac ggcttttctg aaaacagatc 1621 cagtatatgc cttgaatttt tcaataaaag tttgggaaaa aaaaagtgaa aaaaaaaaaa 1681 aaaaaaaaaa aaaaaaaaaa aaaaaaa // LOCUS HUMOSTRO 1425 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human osteopontin mRNA, complete cds. ACCESSION J04765 KEYWORDS osteopontin; phosphoprotein. SOURCE Human adult osteoblast, cDNA to mRNA, clone HOP-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1425) AUTHORS Young,M.F., Fisher,L.W., McBride,O.W. and Termine,J.D. TITLE cDNA cloning, chromosomal location and RFLP analysis of human osteopontin JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.F.Young, 04-MAY-1990. FEATURES from to/span description pept 68 1012 osteopontin precursor /hgml_locus_uid="LK0223T" /nomgen="OPN" /map="unassigned" sigp 68 115 osteopontin signal peptide matp 176 1012 osteopontin mRNA < 1 1425 osteopontin mRNA BASE COUNT 449 a 293 c 296 g 387 t ORIGIN Chromosome 4q13. 1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact 61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc 121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga 181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca 241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc 301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga 361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca 421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac 481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg 541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga 601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt 661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga 721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca 781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc 841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt 901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct 961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa 1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa 1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg 1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa 1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg 1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact 1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag 1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag // LOCUS RATALADPS 1612 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat delta-aminolevulinate dehydratase processed pseudogene, complete cds. ACCESSION J04764 KEYWORDS delta-aminolevulinate dehydratase; pseudogene. SOURCE Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1612) AUTHORS Bishop,T.R., Frelin,L.P. and Boyer,S.H. TITLE Isolation and characterization of a rat delta-aminolevulinate dehydratase processed pseudogene JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.R.Bishop, 20-APR-1990. FEATURES from to/span description pept.ps 271 1264 pseudo-delta-aminolevulinate dehydratase rpt 101 108 8 bp flanking direct repeat rpt 1385 1392 8 bp flanking direct repeat BASE COUNT 404 a 411 c 418 g 379 t ORIGIN 1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt 61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt 121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca 181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta 241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc 301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa 361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct 421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga 481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca 541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg 601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat 661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg 721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat 781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac 841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat 901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga 961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca 1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct 1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa 1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc 1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga 1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa 1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga 1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta 1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt 1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg 1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at // LOCUS MZEZEISP 215 bp ds-DNA PLN 28-JUN-1990 DEFINITION Z.mays zein storage protein gene, 5' flank. ACCESSION M33830 KEYWORDS storage protein; zein. SOURCE Z.mays endosperm DNA, clone pMS1. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 215) AUTHORS Grasser,K.D., Maier,U.-G., Haass,M.M. and Feix,G. TITLE Maize high mobility group proteins bind to CCAAT and TATA boxes of a zein gene promoter JOURNAL J. Biol. Chem. 265, 4185-4188 (1990) STANDARD simple staff_review BASE COUNT 84 a 42 c 26 g 63 t ORIGIN 1 gtataaatta taacccaaca aagtaatcac tcaatgtcaa aaccaactag ataccatgtc 61 atctctacct tatcttacta atatcttttt gcaaattcga aattaatctt gcacaagcac 121 aaggactgag atgtgtataa atatctctta gattagtagc taatatatcg cacatattat 181 tgagaccaac tagcaacata gaaagcacaa tagtg // LOCUS CHKARCASR 2392 bp ss-mRNA VRT 28-JUN-1990 DEFINITION Chicken 5-aminoimidazole ribonucleotide (AIR) carboxylase-5- aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR) synthetase, complete cds. ACCESSION M31764 KEYWORDS 5-aminoimidazole ribonucleotide carboxylase; 5-aminoimidazole-4-N-succinocarboxamide ribonucleotide synthetase. SOURCE Chicken liver, cDNA to mRNA, clone pZD1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2392) AUTHORS Chen,Z., Dixon,J.E. and Zalkin,H. TITLE Cloning of a chicken liver cDNA encoding 5-aminoimidazole ribonucleotide carboxylase and 5-aminoimidazole-4-n-succinocarboxamide ribonucleotide synthetase by functional complementation of Escherichia coli pur mutants JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3097-3101 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Zalkin, 01-FEB-1990. FEATURES from to/span description pept 59 1339 5-aminoimidazole ribonucleotide (AIR) carboxylase-5- aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR) synthetase matp 59 835 SAICAR synthetase domain matp 836 1336 AIR carboxylase domain site 2341 2347 poly-A signal BASE COUNT 696 a 466 c 527 g 702 t 1 others ORIGIN 1 ggcgctcagt ttctttcctg tgccttaggt gcctgtcgcc gccgccgctg tcgccgccat 61 ggcccccgcc gcctcagaac tgaaacttgg taaaaaagtt aatgagggta aaacgaaaga 121 agtgtatgag ctgccggata tcccgggatg cgttctgatg cagtccaagg accaaataac 181 agcggggaat gccgctagga aggaccggat ggaggggaag gctgccatct ccaacaccac 241 caccagctgc gtgttccagc tgctgcagga agcaggaatc aaaacggctt ttgtcaggaa 301 acagagtgac acagctttca tagcagctca ctgtgaaatg atcccaattg aatgggtctg 361 cagaagaatt gctactggct ctttcctcaa aagaaaccct ggtgtcaaag aaggctataa 421 gttttaccca cctaaaattg agatgtttta caaggatgat gccaataatg atccacagtg 481 gtctgaggag cagctaattg aagcaaaatt ctcttttgct ggacttacta ttggcaagac 541 tgaagtggat attatggctc gttctactca agctattttt gagatcctgg aaaaatcatg 601 gcagccccaa aactgcactc tggtggacct gaagattgaa tttggtgtta atattttgac 661 caaagaaatt gttcttgctg atgttattga taatgattca tggagactgt ggccatcggg 721 agacagaagc cagcagaagg acaaacagtc ctatcgagat ctgaaggaag tgactcctga 781 agcattgcag atggttaaga gaaactttga atgggttgca gaaagagtag agttgcttct 841 gaaaacaaag agccaaggta gagttgtggt attgatggga tctacttctg accttggcca 901 ctgtgagaaa ataaagaagg catgtgcaac ctttggaatt ccttgtgagt taagagtaac 961 ctccgctcac aaagggccag atgaaactct gaggatcaaa gcagaatatg aaggagatgg 1021 aatcccgact gtgtttgttg cagtagctgg cagaagcaat ggtttagggc cagtaatgtc 1081 tggtaacact gcttaccctg ttgtcaactg tcctcccctc tcatctgact ggggtgctca 1141 ggatgtgtgg tcctctctca gactgcccag tggtcttggc tgtcctacta ctctgtcacc 1201 tgaaggagct gctcagtttg ctgcccagat ttttggttta aacaaccact tggtatgggc 1261 caaactgcga tcaaacatgt taaatacatg gatctctttg aagcaggctg acaaaaaatt 1321 gcgggagtgc accttgtaag tccaaccaac aagtaactcc tcctcgctac acaaagatag 1381 tggcgtgcat atgcatttgt atcaggattt gcagttggat gatgtgagac atcttttctt 1441 acagaaggaa gcatcttcag agcagcagct gagtctgcta tgtcctcatt ccttttcttg 1501 tgtatgtttt tggttttttt agtatagcca cttagtaaaa gatgagaagg taatatttta 1561 gttatccttc tgttgatcca aaatagtgct gttctgctct ttagtgtata ctgttaacta 1621 ctgatgatgg ttatgtcaac acaatgattg atttggtgtg atttaaatag aaaccccctt 1681 ttttttcccc ctgtaacagt aacattcttc catagtaata atctatgtcc tgaagcttat 1741 taactcaatg ggactgctaa ctaaaactag tgctgatgta gtgctgtaat aatggtatgc 1801 tttcttttag caataaaatg gcaacatatg ctgtattaat aagaaaatag gctaagaatg 1861 agtaaaagcc tgaaagaaat aactgcctta tttaaaagaa aaacatatat tttttcatgt 1921 gtatgtgtgt atgtatatac accccatcag taactttacc acatcatttg agtgagttaa 1981 cagttctctt gtttgtgaga agctctaatt ttgaatggaa ccataactag gcttgttgcc 2041 tacttgctgt ataactcttg actggtccag tcttttgtcc ttcaaatagt agacagatct 2101 actgtgtatg tatgtgatgc tgtcaaatta tccttataca catcaatgta tctatattcc 2161 ctcactctgt ttctatggga attaaactga actgaaggtg ggaataaaca gtaaagcaac 2221 agaatacacg ggcaagccta agcatgactg tcttgntctt tgcacacaac attctttaaa 2281 gtaggtactg attccaagaa atatgcaatt actcaatgta tgtacaattt atgcttttat 2341 aattaaataa actatttaat gcaaggaatt cgatatcaag cttatcgata cc // LOCUS ECORAFRO 1289 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.coli raffinose repressor (rafR) gene, complete cds, and alpha-galactosidase (rafA) gene, 5' end. ACCESSION M29849 KEYWORDS alpha-galactosidase; raffinose repressor. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1289) AUTHORS Aslanidis,C. and Schmitt,R. TITLE Regulatory elements of the raffinose operon: Nucleotide sequences of operator and repressor genes JOURNAL J. Bacteriol. 172, 2178-2180 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Schmitt, 13-NOV-1989. FEATURES from to/span description pept 166 1176 raffinose repressor pept 1270 > 1289 alpha-galactosidase binding 156 161 ribosome binding site binding 1258 1263 ribosome binding site signal 120 126 -35 region signal 144 149 -10 region signal 1212 1216 -35 region signal 1235 1240 -10 region site 1196 1213 raf operator site 1217 1234 raf operator BASE COUNT 317 a 283 c 326 g 363 t ORIGIN 1 aagcttatta tgcttccatc ggaaacaatg atctggcagc ctgagttcac agataaaata 61 tctctccagg aaacccgggg ctgttcatca tgcaagtctg tcgattactg gctttggtga 121 cggaattttc tggatttccg gcttagaacc acagcaggag ataatatgtc acttaaagcg 181 attgccacga cactcggtat ttctgtcacc actgtcagtc gggctcttgg aggcttttca 241 gatgtggctg cttctacccg tgagcgcgtg gaagcggaag cacgtcgacg aggttaccgc 301 cctaatacac aggcaagaag actcaaaacc ggtaaaaccg atgctatcgg tctggtttat 361 cctgaaaatg atgtgccgtt taacagcggt gtttttatgg atatggtcag ttgcatcagc 421 agggaacttg cttatcatga tattgactta ctgctgatcg ctgatgatga gcatgcagac 481 tgccacagct atatgcggct tgttgaaagt cgcagaattg atgctcttat cattgcacat 541 actctggatg acgatccccg tatcacacat cttcataaag caggtattcc gtttctggct 601 cttggacggg taccgcaggg cttgccctgt gcgtggtttg actttgataa tcatgccgga 661 acctggcagg caacccagaa gctgattgct ttgggacata agagtattgc gctgttgagc 721 gagaacactt cacattctta tgttattgca agacgtcagg gatggcttga tgcactgcat 781 gagcatggac tgaaagatcc attgttgcgg ctggtttctc ccacgcgacg agcgggctat 841 ctggctgtga tggagttaat gtcattaccg gcgccaccaa cagctattat tactgacaat 901 gacctgagtg gagatggtgc ggctatggcg ctgcagttga gagggcgtct ttcagggaaa 961 gaagctgtat ctctggttgt atatgatggt ttgcctcagg acagcattat tgagctggat 1021 gtggctgctg ttattcagtc aacacgaagt ctcgttggtc gtcagatttc tgacatggtg 1081 tatcagataa tcaatggtgc atcaccagaa tcactgcaga taacctggac accgatattt 1141 taccctggta gcacggttca ttctccttcc ttctgatttt ttatccagat cacacaaccg 1201 aaacgttttg gttgatgttc gaaacgtttc ggatcaacag taagacatac ctgaaagcgg 1261 agatgtctaa tgatttcaaa gtactgcag // LOCUS HSEDILSTD 351 bp ds-DNA VRL 28-JUN-1990 DEFINITION Equine herpesvirus type 1 subfragments from near the left terminus of STD DNA. ACCESSION M34145 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone Sma2. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 351) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_review BASE COUNT 55 a 117 c 127 g 52 t ORIGIN 1 cgccgctcac gtcacactcc tccccaaaga gggtattcgg ggggtcgctc ggcggctcca 61 ggaaccgctg ctcgtaccgg tcctcggagt tcagtgcccg cagcgtggcg cgcagcggga 121 agctctgctc gggcagcagc tgcaggaggg actgccagca ctcgagtgtg gggcgcgtgt 181 gcgtgtaccc catggcgtaa aaatccagca gcgccctgcg cagggtgcgc gagtcctcgg 241 tcacgtggat cgaggaggtg gatatccccc tgaacaagcg gttgacgtcc cccgcgaggc 301 ggttcacatc ggggcgccac ggcggggcgc agaacgcccc tggcccccgg g // LOCUS HSEDISA 633 bp ds-DNA VRL 28-JUN-1990 DEFINITION Equine herpesvirus type 1 defective interfering particle DNA. ACCESSION M34143 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone pHSII. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 633) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_entry BASE COUNT 124 a 184 c 202 g 123 t ORIGIN 1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac 61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa 121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt 181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa 241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc acgtcacact 301 cctccccaaa gagggtattc ggggggtcgc tcggcggctc caggaaccgc tgctcgtacc 361 ggtcctcgga gttcagtgcc cgcagcgtgg cgcgcagcgg gaagctctgc tcgggcagca 421 gctgcaggag ggactgccag cactcgagtg tggggcgcgt gtgcgtgtac cccatggcgt 481 aaaaatccag cagcgccctg cgcagggtgc gcgagtcctc ggtcacgtgg atcgaggagg 541 tggatatccc cctgaacaag cggttgacgt cccccgcgag gcggttcaca tcggggcgcc 601 acggcggggc gcagaacgcc cctggccccc ggg // LOCUS HSEDISTD 290 bp ds-DNA VRL 28-JUN-1990 DEFINITION Equine herpesvirus type 1 subfragments from the inverted repeat of STD DNA. ACCESSION M34144 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone xbaG.. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 290) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_review BASE COUNT 69 a 72 c 77 g 72 t ORIGIN 1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac 61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa 121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt 181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa 241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc // LOCUS HUMAR 1230 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) mRNA, complete cds. ACCESSION M30704 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SOURCE Human cell line MCF-7 breast carcinoma, cDNA to mRNA, clones lambda-AR1 and lambda-AR2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1230) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept 210 968 amphiregulin precursor /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" sigp 210 266 amphiregulin signal peptide matp 510 962 amphiregulin (alt.) matp 528 962 amphiregulin (alt.) mRNA 1 1230 amphiregulin mRNA BASE COUNT 375 a 273 c 285 g 297 t ORIGIN 1 agacgttcgc acacctgggt gccagcgccc cagaggtccc gggacagccc gaggcgccgc 61 gcccgccgcc ccgagctccc caagccttcg agagcggcgc acactcccgg tctccactcg 121 ctcttccaac acccgctcgt tttgcggcag ctcgtgtccc agagaccgag ttgccccaga 181 gaccgagacg ccgccgctgc gaaggaccaa tgagagcccc gctgctaccg ccggcgccgg 241 tggtgctgtc gctcttgata ctcggctcag gccattatgc tgctggattg gacctcaatg 301 acacctactc tgggaagcgt gaaccatttt ctggggacca cagtgctgat ggatttgagg 361 ttacctcaag aagtgagatg tcttcaggga gtgagatttc ccctgtgagt gaaatgcctt 421 ctagtagtga accgtcctcg ggagccgact atgactactc agaagagtat gataacgaac 481 cacaaatacc tggctatatt gtcgatgatt cagtcagagt tgaacaggta gttaagcccc 541 cccaaaacaa gacggaaagt gaaaatactt cagataaacc caaaagaaag aaaaagggag 601 gcaaaaatgg aaaaaataga agaaacagaa agaagaaaaa tccatgtaat gcagaatttc 661 aaaatttctg cattcacgga gaatgcaaat atatagagca cctggaagca gtaacatgca 721 aatgtcagca agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca 781 tgattgacag tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg 841 tgatcctcac agctgttgct gttattacag tccagcttag aagacaatac gtcaggaaat 901 atgaaggaga agctgaggaa cgaaagaaac ttcgacaaga gaatggaaat gtacatgcta 961 tagcataact gaagataaaa ttacaggata tcacattgga gtcactgcca agtcatagcc 1021 ataaatgatg agtcggtcct ctttccagtg gatcataaga caatggaccc tttttgttat 1081 gatggtttta aactttcaat tgtcactttt tatgctattt ctgtatataa aggtgcacga 1141 aggtaaaaag tattttttca agttgtaaat aatttattta atatttaatg gaagtgtatt 1201 tattttacag ctcattaaac ttttttaacc // LOCUS HUMAR1 968 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 1. ACCESSION M30698 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 1 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 968) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept 859 + 919 amphiregulin precursor, exon 1 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" sigp 859 915 amphiregulin signal peptide pre-msg 649 > 968 amphiregulin mRNA and introns IVS 920 > 968 amphiregulin intron A BASE COUNT 206 a 280 c 257 g 225 t ORIGIN 1 gaattcatat ccacctggct ttgaacatta tcggctgtga gatggtgtag gtaaaatttt 61 aagtgcataa tttggcaata ataaatcatc aataaatatt aatgttgatg aggcccctgg 121 gccacataaa gaaataggga gtgaggggat ttgaaattct ggccacttca cagaaatggg 181 tgggaagggg ctcttgattg agatagaagc ccatcctaca tgaagcaatt cctcattgag 241 ttctctcgtc ctttatcctt gttggaaaca tcaggcaaag tcactcttgg tcttaaagta 301 cttttacatc taaatacgga actcttctat ttaatccctg tctgttgtag atgttaagta 361 tacaaagagg ttgtcagagt ttgaaacatc tggacttctg tcaggtacta gctccggaac 421 tccagtcctg ctcgccctca aaaacggctt gcagctagag gtttaagttc cacttcctct 481 cagcgaatcc ttacgcacga gggaggcggg gcgtgtgtcc tccgcgcgtg gttttcgggt 541 agcaccttct ggggcgccgc ctgcctccac ccacggccgg gccttgacgt catgggctgc 601 ggccccctcc cggctgagcc tataaagcgg caggtgcgcg ccgccctaca gacgttcgca 661 cacctgggtg ccagcgcccc agaggtcccg ggacagcccg aggcgccgcg cccgccgccc 721 cgagctcccc aagccttcga gagcggcgca cactcccggt ctccactcgc tcttccaaca 781 cccgctcgtt ttgcggcagc tcgtgtccca gagaccgagt tgccccagag accgagacgc 841 cgccgctgcg aaggaccaat gagagccccg ctgctaccgc cggcgccggt ggtgctgtcg 901 ctcttgatac tcggctcagg tgaggattca acggcgctga actgctgggc tctcctccca 961 tggcaggt // LOCUS HUMAR2 348 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 2. ACCESSION M30699 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 2 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 348) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 51 + 299 amphiregulin precursor, exon 2 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp 290 + 299 amphiregulin pre-msg < 1 > 348 amphiregulin mRNA and introns IVS < 1 50 amphiregulin intron A IVS 300 > 348 amphiregulin intron B BASE COUNT 88 a 75 c 86 g 99 t ORIGIN 1 agcaccctac tttacctttt cgttttcttc ctttattccc tcccctgcag gccattatgc 61 tgctggattg gacctcaatg acacctactc tgggaagcgt gaaccatttt ctggggacca 121 cagtgctgat ggatttgagg ttacctcaag aagtgagatg tcttcaggga gtgagatttc 181 ccctgtgagt gaaatgcctt ctagtagtga accgtcctcg ggagccgact atgactactc 241 agaagagtat gataacgaac cacaaatacc tggctatatt gtcgatgatt cagtcagagg 301 tgagtagggg ataaagcaaa aatatggcct gtgagatgtg ggtttata // LOCUS HUMAR3 302 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 3. ACCESSION M30700 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 3 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 302) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 52 + 252 amphiregulin precursor, exon 3 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp + 52 + 252 amphiregulin matp 60 + 252 amphiregulin (alt.) pre-msg < 1 > 302 amphiregulin mRNA and introns IVS < 1 51 amphiregulin intron B IVS 253 > 302 amphiregulin intron C BASE COUNT 125 a 46 c 54 g 77 t ORIGIN 1 aattatattc aagtttgaga gactcttgtc aataaatctt ttctttttta gttgaacagg 61 tagttaagcc cccccaaaac aagacggaaa gtgaaaatac ttcagataaa cccaaaagaa 121 agaaaaaggg aggcaaaaat ggaaaaaata gaagaaacag aaagaagaaa aatccatgta 181 atgcagaatt tcaaaatttc tgcattcacg gagaatgcaa atatatagag cacctggaag 241 cagtaacatg cagtaagttt tcctaaagca tatagatttt tgtatttcta gcaccatgtc 301 tg // LOCUS HUMAR4 254 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 4. ACCESSION M30701 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 4 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 254) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 51 + 204 amphiregulin precursor, exon 4 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp + 51 91 amphiregulin (both alt.) pre-msg < 1 > 254 amphiregulin mRNA and introns IVS < 1 50 amphiregulin intron C IVS 205 > 254 amphiregulin intron D BASE COUNT 81 a 42 c 53 g 78 t ORIGIN 1 cacaccgcac gtgagtgtga ttataatttt taaatgtgaa ttgcttgcag aatgtcagca 61 agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca tgattgacag 121 tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg tgatcctcac 181 agctgttgct gttattacag tccagtaagt atgacataac ttacaaattc ttaataaaat 241 aatgggaggt taat // LOCUS HUMAR5 217 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 5. ACCESSION M30702 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 5 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 217) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 50 143 amphiregulin precursor, exon 5 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" pre-msg < 1 > 217 amphiregulin mRNA and introns IVS < 1 49 amphiregulin intron D BASE COUNT 89 a 26 c 41 g 61 t ORIGIN 1 tatagatgaa tagaaccttg ataacattag aatgccttgt tctctgaagg cttagaagac 61 aatacgtcag gaaatatgaa ggagaagctg aggaacgaaa gaaacttcga caagagaatg 121 gaaatgtaca tgctatagca taactgaaga taaaattaca ggtttgagtt ttaaaatata 181 tctttagatc atatcctata attttgaaaa atttaac // LOCUS HUMAR6 354 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, 3' flanking region. ACCESSION M30703 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 6 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 354) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pre-msg < 1 294 amphiregulon mRNA and intron BASE COUNT 104 a 48 c 52 g 150 t ORIGIN 1 gtaacatttt gttttatttt attattttat tttattttat tttctcacag gatatcacat 61 tggagtcact gccaagtcat agccataaat gatgagtcgg tcctctttcc agtggatcat 121 aagacaatgg accctttttg ttatgatggt tttaaacttt caattgtcac tttttatgct 181 atttctgtat ataaaggtgc acgaaggtaa aaagtatttt ttcaagttgt aaataattta 241 tttaatattt aatggaagtg tatttatttt acagctcatt aaactttttt aaccaaacaa 301 attgagagtt tgaatattag ttctgatatt gcaagactcc agtgtacttt tctc // LOCUS HUMROSSAA 1890 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human Ro ribonucleoprotein autoantigen (Ro/SS-A), complete cds. ACCESSION M32294 KEYWORDS Ro ribonucleoprotein autoantigen. SOURCE Human T-cell line Jurkat, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1890) AUTHORS McCauliffe,D.P., Lux,F.A., Lieu,T.-S., Sanz,I., Hanke,J., Newkirk,M.M., Bachinski,L.L., Itoh,Y., Siciliano,M.J., Reichlin,M., Sontheimer,R.D. and Capra,J.D. TITLE Molecular cloning, expression and Chromosome 19 localization of a human Ro/SS-A auto antigen JOURNAL J. Clin. Invest. 85, 1379-1391 (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.P.McCauliffe, 23-FEB-1990. FEATURES from to/span description pept 67 1320 Ro ribonucleoprotein autoantigen (Ro/SS-A) precursor sigp 67 117 Ro ribonucleoprotein autoantigen signal peptide matp 118 1317 Ro ribonucleoprotein autoantigen site 1881 1886 poly-A signal (put.) BASE COUNT 464 a 497 c 520 g 409 t ORIGIN Chromosome 19. 1 ccgtactgca gagccgctgc cggagggtcg ttttaaaggg ccgcgttgcc gccccctcgg 61 cccgccatgc tgctatccgt gccgctgctg ctcggcctcc tcggcctggc cgtcgccgag 121 cccgccgtct acttcaagga gcagtttctg gacggagacg ggtggacttc ccgctggatc 181 gaatccaaac acaagtcaga ttttggcaaa ttcgttctca gttccggcaa gttctacggt 241 gacgaggaga aagataaagg tttgcagaca agccaggatg cacgctttta tgctctgtcg 301 gccagtttcg agcctttcag caacaaaggc cagacgctgg tggtgcagtt cacggtgaaa 361 catgagcaga acatcgactg tgggggcggc tatgtgaagc tgtttcctaa tagtttggac 421 cagacagaca tgcacggaga ctcagaatac aacatcatgt ttggtcccga catctgtggc 481 cctggcacca agaaggttca tgtcatcttc aactacaagg gcaagaacgt gctgatcaac 541 aaggacatcc gttgcaagga tgatgagttt acacacctgt acacactgat tgtgcggcca 601 gacaacacct atgaggtgaa gattgacaac agccaggtgg agtccggctc cttggaagac 661 gattgggact tcctgccacc caagaagata aaggatcctg atgcttcaaa accggaagac 721 tgggatgagc gggccaagat cgatgatccc acagactcca agcctgagga ctgggacaag 781 cccgagcata tccctgaccc tgatgctaag aagcccgagg actgggatga agagatggac 841 ggagagtggg aacccccagt gattcagaac cctgagtaca agggtgagtg gaagccccgg 901 cagatcgaca acccagatta caagggcact tggatccacc cagaaattga caaccccgag 961 tattctcccg atcccagtat ctatgcctat gataactttg gcgtgctggg cctggacctc 1021 tggcaggtca agtctggcac catctttgac aacttcctca tcaccaacga tgaggcatac 1081 gctgaggagt ttggcaacga gacgtggggc gtaacaaagg cagcagagaa acaaatgaag 1141 gacaaacagg acgaggagca gaggcttaag gaggaggaag aagacaagaa acgcaaagag 1201 gaggaggagg cagaggacaa ggaggatgat gaggacaaag atgaggatga ggaggatgag 1261 gaggacaagg aggaagatga ggaggaagat gtccccggcc aggccaagga cgagctgtag 1321 agaggcctgc ctccagggct ggactgaggc ctgagcgctc ctgccgcaga gcttgccgcg 1381 ccaaataatg tctctgtgag actcgagaac tttcattttt ttccaggctg gttcggattt 1441 ggggtggatt ttggttttgt tcccctcctc cactctcccc caccccctcc ccgccctttt 1501 tttttttttt tttaaactgg tattttatcc tttgattctc cttcagccct cacccctggt 1561 tctcatcttt cttgatcaac atcttttctt gcctctgtgc cccttctctc atctcttagc 1621 tcccctccaa cctggggggc agtggtgtgg agaagccaca ggcctgagat ttcatctgct 1681 ctccttcctg gagcccagag gagggcagca gaagggggtg gtgtctccaa ccccccagca 1741 ctgaggaaga acggggctct tctcatttca cccctccctt tctcccctgc ccccaggact 1801 gggccacttc tgggtggggc agtgggtccc agattggctc acactgagaa tgtaagaact 1861 acaaacaaaa tttctattaa attaaatttt // LOCUS PRD1XII 1701 bp ds-DNA PHG 28-JUN-1990 DEFINITION Bacteriophage PRD1 early protein genes XII and XIX, complete cds. ACCESSION M30146 KEYWORDS early protein XII; early protein XIX. SOURCE Bacteriophage PRD1 (Viradea strain) DNA. ORGANISM Bacteriophage PRD1 Viridae; ds-DNA nonenveloped viruses; Tectiviridae. REFERENCE 1 (bases 1 to 1701) AUTHORS Gerendasy,D. and Ito,J. TITLE Nucleotide sequence and transcription of the right early region of bacteriophage PRD1 JOURNAL J. Bacteriol. 172, 1889-1898 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Gerendasy, 27-NOV-1989. FEATURES from to/span description pept 242 724 early protein XII pept 797 955 early protein XIX (put.) mRNA 179 732 early protein XII mRNA mRNA 181 1100 early protein XIX mRNA binding 230 233 ribosome binding site binding 785 788 ribosome binding site rpt 1 111 inverted terminal repeat site 145 172 right early promoter BASE COUNT 419 a 447 c 390 g 445 t ORIGIN 1 gggggatacg tgcccctccc cacctacccg cgcccctaac atttttattt ccgtctgtca 61 ataccccctg catccgatag gcccgaacta tcacaaacgg aaaagcgata ggaataaaat 121 tttgatttat tttcaccata actattgcac gtcctaaaaa tgtgcgtata atttcaatca 181 tcgaaagggc aaacacaaaa cccccgccga taacttccac ttaactttaa aggtaactat 241 catggaaatc gtaagcaagc tgactctgaa aaccattggc gctcaaccca agccacatag 301 cgtaaaagaa aataccgcgc tggcttccat ctatggccgc gttcgcggta agaaagttgg 361 tcaatccacc tttggcgact tcatcaagtt tgaaggtgaa tttgaaggcg tgaatatcgc 421 cactggtgaa gtgttccgtt ccggtgcgct gattctgccg aaggtactgg aaagcctgct 481 tgccggtgcc gtggatggtg aaaacacggt tgattttgcg gttgaaattt gggccaagcc 541 ttccgaaaag ggcaacactg gttatgaata cggtgtcaag ccgctgattg aacccgccgc 601 atcggatgaa ctggccgcgc ttcgcaatca ggttaaggcc gcgctgcctg cccctgccgc 661 tgccggtgaa gccgctgccg aagccaagcc cgccgccaag gccaaggcca aggccgaagc 721 ctaaacagcg caccacggcc ccggctgata ccagcgcggg gccattcttt acccttcaaa 781 ttcaaaggta acaaacatgg aaaagcaaac cgaaaatacc cgccctgaat gcccgaaggc 841 attttatttt gtatccattc ccggcgactt tgggcaaacc ccttttgctt catcgcttat 901 gtatggtagc accgcgcttg ccgctgttat caagtcaagg gcgctatccg cgtagtaagc 961 gaacattttg atttgcggtt tgctgataac ggttttacgc ctgctggcgt tacgcaagct 1021 gaatggcttg gcaagctgat tactgaaact ttcggctttc gcttggaact gtttctttaa 1081 accagttttt tatttcttga atttgaaggc cgcatgattc ccttgcggcc ttttcttttg 1141 cgtatgcttc caccaaatcc cgccacgtta taaacggata aacaggcgcg gcgcattcag 1201 ttagcaggct tgacggcgga accaaataaa ccgtttcaat tttcggggct gttgtcgaac 1261 aagccgcgaa catcggcagg cataggacta tcaagaattt gtttagtttc atcgcttacc 1321 cttcccaatc cgctaattgt tcggctaagt ttgccgctga taattgtctg gccttcctga 1381 attgccgcga ttgctttcgc atcgcttgcg cgtactgatt gaagggtttc caccgccttg 1441 gcgtttgcgg tcgccttttc ttcaatggct ttaatttcat tttgaagccc cgttattttt 1501 ttagccgcta aaattgaata tccaaagtta gcaagcatta aaagcgcaac cgcaatagcg 1561 gccacgcttt tcaggttaga caaaacccat ttaatcattg aaggtacgcc ccggcttgtc 1621 attgtcggaa tctatcggcc ccggctgatt gataggggca atcttgcttt gcacaaattg 1681 cgagaaaacc gccttgacaa a // LOCUS PSEALALPQ 3141 bp ds-DNA BCT 28-JUN-1990 DEFINITION P.aeruginosa exopolysaccharide alginate regulatory protein (algP and algQ) genes, complete cds. ACCESSION M32077 KEYWORDS alginate regulatory protein P; alginate regulatory protein Q. SOURCE P.aeruginosa (strain PAO, isolate PA02003) DNA, from patient PA02003. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 3141) AUTHORS Konyecsni,W.M. and Deretic,V. TITLE DNA sequence and expression analysis of algP and algQ, components of the multigene system transcriptionally regulating mucoidy in Pseudomonas aeruginosa: algP contains multiple direct repeats JOURNAL J. Bacteriol. 172, 2511-2520 (1990) STANDARD full staff_review COMMENT draft entry and computer-readable sequence [1] kindly submitted by V.Deretic, 16-FEB-1990. FEATURES from to/span description pept 781 1263 alginate regulatory protein Q pept 2079 3137 alginate regulatory protein P (put.) mRNA 661 1310 algQ mRNA mRNA 2054 3141 algP mRNA BASE COUNT 607 a 1026 c 1009 g 499 t ORIGIN 1 bp upstream of BamHI site; map position 9 min. 1 ggatccgcca tgcccctggc cagcccccgt cagcttttcc ttctcgcgtt cctggcctgc 61 gtcgccatca tgggcggggc gctgtacctg gaacatgtgg ttggcctgga ggcctgcccg 121 ctgtgcgtcg tgcagcggat cttcttcatc ctgatcggcc tgacctgcct tgctggcgcg 181 atccaggggc ccggcctgcg tgggcggcgt atctactccg tgctggtgtt cctgctcgct 241 ctcggcggcg gggccacggc cgcccgccag gtatggttgc agaccgttcc gctggaccaa 301 ctgccggcct gcctgcccag cctcgactac atgatgcagg cgcttccctt ccaggaagtg 361 atccgcctgg tcctgcatgg caccgcggat tgtgcccagg tgagctggac gctattcacc 421 ctgagcattc ccgaatggag cctgctggcg ttcgttgcct atctcggctt ctccatcgtg 481 cagttcctcc gacgtgcctg atgcgggctg tcgcgtggtc tttcgtcagc tctcgagcgt 541 cggccgtatc gatttggcga aaaccagagc ggatgcggga taccgatcaa acatccgtat 601 gaatttttct gtacgggcgt cttgatgggg tcatcccggg ggcataatct ggctttccat 661 agcgccggag tccgcccctt gcacaggcgt tctcccacag agcgcagaga gtgcgtcagc 721 aatgcgtctc ggagcggacg atgattctcc gggacgtccg aagagggaag cgaggacaac 781 atgctcgaaa gctgccgtaa tgcccaagag cggtgggggg gtgtgcacca actgatcgat 841 cgctggctgc atgagcgcca gcaactggtg caggccttcg atgccctgag cggtattcag 901 gcaccggcgc ccaacgccga agaactccag catttctgcc aattgctgct ggactatgta 961 tctgccggac acttcgaggt ctacgagcaa ctgacggcgg aaggcaaggc cttcggcgat 1021 cagcgcggcc tggagctggc caagcagatc ttcccccggc tggaagccat caccgaatcc 1081 gcgctgaact tcaacgaccg ctgcgacaac ggcgattgcc gtgaaggagc ctgcctcatc 1141 gcggagctga aggtcctgcg gcaacagttg cacgaacgct tcgagctgga ggactgcctc 1201 atcgaagtcc tgcacaacgc ccacagccag agcggcgcag agggttcggc agtaccggtc 1261 tgattcaccg gcgagtcgtt acgaaacggc gcgtctccat gcgggagagc gccgtttttg 1321 catttcagcg gaagcccagc aggtcgatct cgaagaccag cggagcgtcg ggcgggatca 1381 ggtcgccagc gccttcgtga ccataggctt gcgccgaagg aatcaccact cgccagcgcg 1441 cgccgacagg catcgctcgc aacgctgtcc gccatccctc tatcacactg tccagggcga 1501 accactcggc gctttcgctc tggtcgaaca cttgaccgtc ggcaagcagg ccgcggtaac 1561 ggacgtgtac ctgggttgcc gcgccgatgc cgttaccttg gccacgacgt aattcgctga 1621 cgaggacgcc gccggtcaat tcgcggacgc cgaagcgggc tttttcgttg gccaggaagc 1681 gggcctcggc gggtgtggtc cgcggtgttt ccgtcgtggc gttctcgtgt tgcagcagaa 1741 gttgttcgat gcgttcgggg gggatttcaa gtgcctcgcc gcgataggcc tggcgcagtc 1801 cgagaagtag ctcggatagc tctaggccgg gcatttcctg ctgtaggcgc atacccaggc 1861 gggcgcctac ggcataggcc agttcgtcct tgggaggcgc ggcgaaaagc ggcagcgcag 1921 tagaaaaaga ccgatgaaaa ggcggcgcgg catgcggttt ctccaaacga atcggcgggc 1981 aatcctggcg cacgtcgctt gtagcacatc gaacccgttg gcgagagggg gtgtgcgggt 2041 ctagtatggg cgcaaccacg tccgcctgga ggcacgtcat gtcggccaac aagaagcccg 2101 tcaccacccc cttgcacctg ttgcagcaac tttcccacag ccttgtcgag cacctggaag 2161 atgcgtgcaa acaagcgctg gtcgattcgg aaaagctcct ggccaaactt gaaaagcaac 2221 gtggcaaagc ccaggaaaag ctgcacaagg ctcgcaccaa gctgcaggat gctgccaagg 2281 ccggcaagac caaggcacag gccaaggcgc gcgagaccat cagcgacctg gaagaggcgt 2341 tggataccct gaaggcccgg caggcggaca cccgtaccta catcgtcggc ctcaagcgtg 2401 acgtacagga aagcctcaag ctggcgcagg gtgtcggcaa ggtgaaggaa gctgctggca 2461 aggctctgga gagccgcaag gcgaaacccg cgaccaaacc tgctgcgaag gcggcagcca 2521 agcctgcggt gaaaaccgta gcggcgaagc ctgcggccaa gccggctgcg aagcctgctg 2581 cgaaaccggc ggccaagcct gcgacgaaaa ccgcggcagg caagcccgca gccaagccga 2641 cggcgaagcc tgctgcgaaa ccggcggcca agcccgcggc gaaaaccgca gccgcgaagc 2701 ccgcagccaa gccggcggcg aagcctgtgg cgaaaccggc ggccaagcct gcggcgaaaa 2761 ccgcagccgc gaagcccgcc gccaagccgg cagcgaagcc tgtcgcgaaa ccgacggcca 2821 agcccgcggc gaaaaccgca gccgcgaagc ccgcagccaa gccagctgcg aagcctgcgg 2881 cgaaaccggc ggccaagcct gtggcgaaat ccgcggccgc gaagcctgca gccaagccgg 2941 ctgcgaagcc tgcggcgaaa ccggcggcca agcctgcggc gaaacccgta gccgcgaaac 3001 ctgccgcaac caagcccgcc accgctcctg ctgcgaagcc tgcggcgact cccagcgccc 3061 cggcagccgc ctccagcgct gcttcggcaa cgcctgccgc gggcagcaac ggcgccgccc 3121 cgaccagcgc ctcctaaggc g // LOCUS STYAROF 1235 bp ds-DNA BCT 28-JUN-1990 DEFINITION S.typhimurium tyrosine sensitive 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase (aroF) gene, complete cds. ACCESSION M31302 KEYWORDS 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase. SOURCE S.typhimurium (strain LT2) DNA, clone pGM59. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1235) AUTHORS Muday,G.K. and Herrmann,K.M. JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 165 to 1235) AUTHORS Muday,G.K. and Herrmann,K.M. TITLE Regulation of the Salmonella typhimurium aroF gene in Escherichia coli JOURNAL J. Bacteriol. 172, 2259-2266 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Herrmann, 13-JAN-1990. Author address: K.M.Herrmann Purdue University Biochemistry department West Lafayette IN, 47907 (317) 494-1653 FEATURES from to/span description pept 165 1235 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase (E.C. 4.1.2.15) site 1 20 operator aroFo2 site 52 71 operator aroFo1 site 69 113 promoter site 75 94 TyrR box-1 BASE COUNT 309 a 286 c 343 g 297 t ORIGIN 1 gtgtaaagtt tttgatacga aatgagggag attaatcgac gattgcgcat ggtgtaaagt 61 ttattttacg aaatatggat tgaaatcttt actttttgtg gtatggttta aacatcctcg 121 tggaggatca actatcgcaa acaagaaaag acaggatcgc catcatgcaa aaagacgcgc 181 tgaataacgt acgtatcacc gatgaacagg tattaatgac gccggagcag cttaaagcgg 241 cctttccctt gagcctggcg caggaagccc agatagcgca gtcccgggga atcatttctg 301 acattattgc cgggcgcgat ccgcgtctct tggtggtatg cggtccttgt tctattcacg 361 atcctgaaac cgctctggaa tatgcccgtc gatttaaagc ccttgccgca gaggtcagcg 421 atagcctcta tctggtaatg cgcgtctatt ttgaaaagcc gcgaactacc gtcggctgga 481 aagggctgat taacgatcct cacatggatg gctcatttga tgtggaagcc gggttgaaaa 541 tagcgcgtca gctactggtg gaactggtga atatggggtt gccattggcg accgaagcgt 601 tggatccgaa cagcccgcaa tacctgggcg atctgtttag ctggtcggcg ataggggcgc 661 gcacaaccga atcgcaaacc caccgcgaaa tggcgtctgg tctttctatg ccggtcgcgt 721 ttaaaaacgg cacggatggc agcctggcga cagcgattaa cgccatgcgc gccgctgcgc 781 aacctcatcg ttttgttggc attaaccagg ccggtcaggt tgcgttattg caaacccagg 841 gaaatccgca tggccatgtg attctgcgtg gcggcaaagc gccgaactat agcccggcag 901 atgtcgctca gtgtgaaaaa gagatggaac aggcgggact acgtccttcg ctgatggtag 961 attgcagtca tggtaactcc aataaagatt atcgccgcca gccagccgtt gccgaatctg 1021 tggttgcgca gattaaagat ggcaatcgtt caatcattgg cttaatgatt gaaagtaata 1081 ttcatgaggg taaccagtct tccgaacagc cgcgcagcga aatgaagtat ggcgtttccg 1141 tcaccgatgc ttgtattagc tgggagatga ccgatgccct gttacgtgaa attcataaag 1201 atttgagcgg ccagctggcg gtgcgcgtcg cataa // LOCUS TRBANTAT 18598 bp ds-DNA INV 28-JUN-1990 DEFINITION T.brucei expression site variant specific antigen 4 - 7 genes, complete cds. ACCESSION M33720 KEYWORDS expression site variant specific antigen. SOURCE T.brucei (strain EATRO 1125), cDNA to mRNA, clone AnTat 1.3A. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 18598) AUTHORS Pays,E., Tebabi,P., Pays,A., Coquelet,H., Revelard,P., Salmon,D. and Steinert,M. TITLE The genes and transcripts of an antigen gene expression site from T.brucei JOURNAL Cell 57, 835-845 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Pays, 20-JAN-1989. FEATURES from to/span description pept 1174 2196 expression site variant specific antigen 7 pept 3528 4733 expression site variant specific antigen 6 pept 8519 9772 expression site variant specific antigen 5 pept 14450 18157 expression site variant specific antigen 4 mRNA 986 2676 ESAG-7 mRNA mRNA 3483 5056 ESAG-6 mRNA mRNA 8260 11304 ESAG-5 mRNA trns 5305 5808 rime retroposon BASE COUNT 5298 a 3468 c 4707 g 5125 t ORIGIN 1 gcatgcgaag agcaggggtg caacggaaga gtgaaaggat tagaggaaat agtggaatta 61 gcatgattgg agattgtgag ggttaggaaa aacgtgacga gggaaatggt aactggacat 121 cgtgcgcgag gatggcgaag gcttgaagcg tgtagagagc gtgtgtaaat ggcccaaacg 181 gtgtgggtgg ggtaataacg cagagtgctg aggtgtttga caagttcccg cgaggtgctg 241 gggaggagga agcaagacag taaaatcgtc gtgagacgtg gctgtgctcg ggaatccgta 301 caaatatact gaaaattatt gatggttgat atgcgtggga acgacgcagg atgatgctgc 361 atcgccaaga tgcttcggga gaacaggaaa ataacgacga atcaacactg gtgagttgat 421 actataaaaa tcgcgtgggc tttgctgctt cgctttcata aatgccacct gcaatgaatt 481 tggagggtgg aggtgagctg ccctgtgtat gggtgggtat tgcagatgca gcggcgcttt 541 tgggtgattt ttggtttaca gtttgctgtt gttgggaagg ttggtggtgg caacaaaatt 601 ggagaaaatg atttcatctg gaaacctact tcaaaacccc ataaacaatc ggacctcgaa 661 gtgcaagtgt ctcccagaac gtaaacaagg ccttgaagac gaccagcaga tgaagcacag 721 caacagcgac agaggtggac tgagacgaaa gtataccata tgactatgcc acaagaattg 781 atgggtgatt ctgccaacca cggacgcagt caaaaacata gaaggagctg tagtgaatat 841 aaaaccactc atattttcta caaatattag agctttatta cttaagaacc tgttttagca 901 ccaaggttgc gtgtgtattg ccgttgctgt tacttattgt tgagttggta tttgaggaaa 961 atgttttttc aggtgcgtcg ttgttatata cggctgtgag ttggtgatgt ttatttgaat 1021 gatagcttga taggataaga tgaaataata ctaagcatgt attcttctgg cgtgtcgccg 1081 aaatggtaat ctatgttacc catagcacga aaaaattttt ttactcagta acgaagcgga 1141 aagttttaaa aagcataatt taaaaaagtg ataatgatga agttttggtt tgtgctgttg 1201 gctcttttgg gaaaagaaac acatgcatat tatgaaaata aaaggaatgc attaaatgca 1261 accgccgcta ataaagtgtg tgggctatcg acctatctta aaggaatagc gcacagagta 1321 aacagcgaga gtgccgtggt tacggaaaaa ctatcagatt tgaaaatgag aagcatccag 1381 ttgcagttat cagtaatgcg gaacagagtg ccttctggcg agcaggattg taaagacatc 1441 aggacactct tgaaaacagt attgaggaat gagtttacat tccagcagga gttggaggaa 1501 atgaggaacg catccgcgtt agcagcagct gcagctggga tagcagctgg aagactggaa 1561 gagtggattt ttgtatttgc acaggcagca ggcgggtcat cacagttttg cataagtgtg 1621 gggaagcata tcccggctga gcacggcaac ttgcaagaat gttttgatgg aataattgga 1681 cctgaaaccc tttacaaaat tgaggattca cgtgtaaaag agtcggcgca gaaaagcttg 1741 caactccatg aagtgttatc atccatttcc ttcagtagct tgggtgcgga aagtattgtt 1801 gagcaaggag agaacagagg atgtaaccta atgcggacgg ctgacggagg tttgctgaaa 1861 gatatttgtt tgaattgcaa cttcacatgg ggtggtggag tgatgaattt cggatcctgt 1921 gtggcgggga acctaaaaat aaaaggggga gaatacggtg atgtcagttc ccacgatgtg 1981 gtacggtgga ccgaagatcc cagtaaagtg agcatattta aggatgtcat ccgcctgttt 2041 gcgcgcttcc aagaagcaaa aaatgcagtg atgaataaaa taaaaactac tgtggatgaa 2101 ttggcgaaat gcatagggca gaaagaggtt gagctcacta atgatcagct ttacgaggaa 2161 ttcgaggcaa tacaaaagta cctagggtct ctgtgaaatg gagtaaaagc gaattcaact 2221 atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg gcagcagtac 2281 atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga gtgctgtgaa 2341 ggaaggatgc gacagaagct gcgctgctta gcgtgaaaga ttatggtaat ggagggttgt 2401 gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg ggcaaaagta 2461 acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa ggaaagatgt 2521 gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg agatgtaagg 2581 ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa tttagttaca 2641 acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt ctactgttta 2701 atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc gaatttcact 2761 tctccagaat ctttatttta gtgcgctata gatgatagtc caaagagaag gaattacaat 2821 tattataacc catccaaagc tatgtcagag actcctcttc tccgagtgat tcttttctat 2881 gtcacttgtg gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg aaatggggcg 2941 tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt atgatgcata 3001 gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta ggtggaagcg 3061 gctaatgatt tgtcggcacc accaacagag gaagttaatt gttgacgtag ccgtttgtga 3121 atgagataga aggttattgg gtggaacatt atcttggaaa gagtttttct tccactaaaa 3181 atgtgtagag tggtggtttg gagaaaatgg tttagatatt gaggaggaaa caaggatttg 3241 aggtgcaagt tgatgcgctg taagatttgt attgccgttg ctgttactta ttgttgagtt 3301 ggtatttgag gaaaatgttt tttcaggtgt gtcgttgtta tatacggctg tgagttggtg 3361 atgtttattt gaatgatagc ttgataggat aagatgaaat aatactaagc atgtattctt 3421 ctggcgtgtc gccgaaatgg taatctatgt tacccatagc acgaaaaaat ttttttactc 3481 agtaacgaag cggaaagttt taaaaagcat aatttaaaaa agtgataatg atgaagtttt 3541 ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa aataaaagga 3601 atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgacctat cttaaaggaa 3661 tagcgcacag agtaaacagc gagagtgccg tggttacgga aaaactatca gatttgaaaa 3721 tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct ggcgagcagg 3781 attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt acattccagc 3841 aggagttgga ggaaatgagg aacgcatccg cgttagcagc agctgcagct gggatagcag 3901 ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg tcatcacagt 3961 tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa gaatgttttg 4021 atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta aaagagtcgg 4081 cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat agcttgggtg 4141 cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga acagatactg 4201 acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt ggtggagtga 4261 tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa tacggtgatg 4321 ctacagagta cggtgcagta cggtggaccg aagatcccag taaagtgagc atatttaagg 4381 atgtcatccg cctgtttgcg cgcttccaag aagcaaaaaa tgaagtgatg aataaaataa 4441 aaactactgt ggatgaattg gcgaaatgca tagggcagaa agaggttgag ctcactgatg 4501 atcagcttta cgaggaattt atctgggaga ccatacacag attggagctg tcaaagagag 4561 tgagtgaaca actctcttta ggtgaagaag aggagacgat cctgaaatct aactatactg 4621 cagaaccggt ccgtggacct ttcacggtag cggggtccaa cgcggcagca gtacatttga 4681 gtgtttctac cgctgcactt tgtttttcag ttttattgtt gggagtgctg tgaaggaagg 4741 atgcgacaga agctgcgctg cttagcgtga aagattatgg taatggaggg ttgtgaaaga 4801 ttgggggaac aaaaacctat ttcttttatt tgggggaaca aatgggcaaa agtaacgtaa 4861 gtttccagtg ggagtggtat gtgtgtgtgt atggggctgg ctaaggaaag atgtgagttc 4921 ggcatgtggt atgtacaagc tacgaaaacg tgtgaaacaa aacgagatgt aaggggaaaa 4981 tgtaacaacc aactatgtta aatttcagga gactattttt caaatttagt tacaacaaag 5041 taaatgtcaa ataatgccaa ctataaagga aaactagact aattctactg tttaatgtga 5101 cgagaattgt ggcgtgaaac ctagtaactg cttatcatgg cgtcgaattt cacttctcca 5161 gaatctttat tttagtgcgc tatagatgat agtccaaaga gaaggaatta caattattat 5221 aacccatcca aagctatgtc agagactcct cttctccgag tgattctttt ctatgtctct 5281 tgtggaaggt tgatattttt gctaccccct ggcgatgccg gccacctcaa cgtggtgcca 5341 gggtccagtg ccccgtatca ttgggggaag ccaagagcca gcagcgttcc tttcatgggg 5401 aacactgctg tgctccggct acggcatcat acagcacagg gatcagcagc gtcttgctgg 5461 gacaccgttt ttcatttgtc ggtccctggg cacgtgccag cgtgccatca gcagtatcat 5521 ccgcgctaag atgctgctgt ccggtgatgt ggacctccaa aaaaggattg ccatttggca 5581 tcttttggag agtccagggt gggaggcttc tcgccccatc tgctgtattc cgttcaactg 5641 cggagctaca acaaaaaatt atagagggtg tgttaggatg aatgaaaaag ggagactctg 5701 ccacagtcgc cagaccgata gcatctcagg gctctacggt gttggctgat ggccgcgcca 5761 gaggggggga aactctcacg aaggcacgaa gaaaattcaa aaaaaaattt ttgctaccac 5821 aggctgtaag gataacgctg gagatggagc cgactgtggt ttttcttgtg aatgcgtatc 5881 ttcgtaaccg gttgttaaga agggatcgga cgaaggaatg tgtctgagta ggaggtaatg 5941 tcgatgtttt ctttccattg gatgtagtaa taatctccaa aatttgttgg aaacaatata 6001 atattattgc aataaatact tgaaaagtta aattattgat ggattgactt gagtatcatg 6061 attccaatga acactctcat gatgcttaga atagcaaacg tatgaacaat caacgtagcg 6121 tcatgtagac atggacattt tttgtaatga tacgactgct attgcacatc tgaaaatttc 6181 tatgacggac cagtgccgtt aatcaccgta caattgaatt caggatgaaa aatgataaat 6241 cagtgctcta atgtatgatg cagctgtcaa ggcaatgttt ctgcacaaaa agagtttaac 6301 acactgccat ttgcatgcga ataaaacaat gagtatatga agcggaggat cgcatggcaa 6361 tgataaaatc gaacactgtg gctgcgagac gcagggccac ccggaagcac cgggtaacaa 6421 agcgagttca ggagtcgtcg aaagtggcaa aagtcagcac caaaccaagt tgctttttta 6481 aaaagatcag caaacgtatt tctcactgac ttatacagca ttcaaaggaa atgaacttag 6541 cagcaggaca agggtgcaaa ctaaggcatc gattctgtat ggctgcaaaa gtgaagacat 6601 caagcaaaca aaggtaactt atttaccctt ataggagcag tggaaaaagc agcggactcg 6661 tatggagata acctgacgaa aatactacca gttttagtca caggaagcga tatcctgtca 6721 aaatgccggc agccactcaa tggcggtcaa aacttgatta aacagttgaa aaaaacagcg 6781 gaatcaaaag gaaccttgcc cgtgctcacg gtggtagata aagcaacgaa aaatgtaaag 6841 gtagaattta acggtaacaa ttcgaacaaa accgtactat cgtgcaaaaa tatttcagca 6901 caaaatcccg acaaactttg cggcgacaac acagcagaca acatggccaa gtgcacgaaa 6961 gcttccaaag cagcttttgg cgctgatggt gtatatttca aaagaaaaaa gggcaaacct 7021 tgactcagca gctataataa agtcagctag aaccggcgaa acagcggaag cgctgagcaa 7081 acttaaaaca ctggacatta tcggtgattt tatattgcgt ttatttgtcg aaaagctcat 7141 ggacatgacc gtggatgagt acacgactca aagagagact aacgacaaaa aacgctgcag 7201 tggcaaaagt agcgtgcctg tacaggggac ggagaacttg ataataatgg taatattaat 7261 cttagaagaa tgttgtgagt atgtgtcact tttacagtga tagttgtgaa atattcggaa 7321 tgacagttgt aatggaggtg aaagaagctg ttaggttgaa tttggggaac catctatgta 7381 tacaaagaag cacaagggtt gtgatagctt atttatatga tggagaagcc atgcatggta 7441 aaaaaaaaca acaagtgata tgtcggtgtt tgtggctaat gtagcataaa gactaacagc 7501 tacagcagtg catcaaccga tggagagacg ctttcagaat tttcaaatgt aagaaagtca 7561 gtaaattaca aagaagtggt tgagttgata ctcagaaaag tgggactaac aaactgttga 7621 tgaatgaagg tgatgccata cgtaccaggt acacaggtca gcaatatgat gctcactgaa 7681 aaagagagat ggtgtgctac tggcgggatg tcgctggaaa agtcttagca agtgtagttg 7741 cagcaccgag cacacacact catgggagtt atcaaagtat gaaaacgcaa aggaagctga 7801 tgtgtgtgtg tgtgtttttg tgtgtgtttg caggaaacaa atgtatcagc ttcgacatct 7861 ttccgcaaga agtagaagtt tcagttgtgg tcactgacgc attacgtgga ttgttgggtt 7921 gaaacaacac aaatgtggca accatttagg atgttttggc gctttggcga cggttgtact 7981 gcagtttgta tgccccttca agaaacaagg aatggtgtga aagcgttgtt tttttttaag 8041 tcaagtatgt gcattcaaga aatcatatca cagtgttacg cagtgctttg tcgtcatggg 8101 aggaccttcc tgcattgaga gtgcatgtaa ggaaacggca acaggatgct atggctccga 8161 atttttttta agtatctgct tttcagacca cttgcgttga tcgacttcag ttatgttaca 8221 ctccgcacat cttttttgct atgttttttt ctactttagt tgcacttagt gtagccaaca 8281 ttaacttaaa gtacaaactc tatcctaagc aggattcggt gttatttcta ttcgacgatg 8341 tcattggtat cgagcgcacc cgttttggtg ttcatttccg ttgtaacact tctgtgttgt 8401 atcatcggct ctgggcaacg gtctagtagt ccctttacgg aaccggcact caaaccaggg 8461 ggcattaagg ttgctattca agaggcggct gttacgccat tgttaccagt acttgcgtat 8521 gagttcgaaa ggtttatgga gaatatcacc attcccgagc aaaagggtta accgtgtgag 8581 tgtagaagag acttattttc gtaacgtgac ggtcggaagc gcgacagtta aattcgaaga 8641 accaaataaa atcattttga agttttggaa cgtgagcgct actgtgccgt ttacgcggtt 8701 tgtataccac tcgttttggt gttacctgta tccttgcagt gggtcggcac aggcggaaat 8761 acgcaacggc agtgtggcac tgtggctcga cgtgtccgca ggaagggggg gactactaga 8821 catacatgta gggagttcgg aaattggcat gcgtgaccca ctgataactc ttatcggtga 8881 agggaaatca aagatgccta agtggcttgg cgggagagtt aaagatatgt atgataagga 8941 tgtgcttccc aagcttgggc atcatataat tactgcggtg aatcaaattt tagccaacaa 9001 agcaaaggag atatttcata tgtttccaat tgtctttgtc aacagttcta aaattgaata 9061 tggacaaatg agacttgaac tggtcgtgct tccagatgcc gcagacaaat taatgcttac 9121 ggagaatgtt tttttcccgc ggcacgcttt tccaaacttt cctgttgctg ttgtttcttc 9181 gtttacggct ttgaataata tgcttcgttt gatgattaaa agtggccact tggtggtgcg 9241 tgtgcccttt ccgctgaggt atgttatctc cagcaacgcg ctcagcgtca gttggaccgt 9301 cttttgctct ggatgtgcct ctgaggctac attcgaacta aaaacagccc cgtggttgaa 9361 gtccctaaat aagaaactgt ttacatttaa ctatcgggat gtcgaagttg ccgtggacct 9421 gctgccaagg ggtggtgccc ccatttcctt gtttagtatg ttaatgaatg tgtctgccca 9481 ggcggcgcat atcgctcttg ttgacggtgc cacccatgcg aatttggatt ctgttgatac 9541 taatgtgagt gtcacatcct cacgtatcga tggcctggac tcaagtacaa tgaacacaaa 9601 aattcgtgat ttgatcaact tgtcttggat caacctgaat gttacatata catttcctgc 9661 tcccttcgac ttgtgtacga aacacgtgaa tattacaagt gaatgctatg tagcgggatt 9721 taaccttgtg cgagctcttg gttcactcag tatactccct cacctccgat gaaggattgc 9781 gacattcctg gtcactttgt aaaacacact tccatgcgct tatgtggaag ggggtaacac 9841 tcaaacctat cttgctttcc cctgtttacg cacaacttca catacgcact ttctctttta 9901 cgagatgttt gcatgccgca acaaccaccc tgcgcaatcg acgagaagct atccgtgttt 9961 acgggtgatc cttcgaatta tggcggagga gtaaaaacct catgatatgt gaagtgctgt 10021 ggctgaagaa ggaaataaga taaaaagggt atttttcatt gacccgacca ttactggttt 10081 tgactttttg cccgaggcga gaaggcaaat ccattgttga ttggctcatg aatgtttccg 10141 tgactatacg cgccaagcct tggatatgtc gtgggctttc ggcatatgtt gataatataa 10201 cgactgccag ttggctgagt gacggctcaa agtgttagag gcgagtgtca atggaagata 10261 tatctattga tttggtaaat aatattggat tcgagattaa ctgttcaagt gtgctacaac 10321 accagatagc atgttcgtag tgtaggggtg gcaatagaag tgatggattt atcacgacta 10381 attgttctat tggcaacgct tgtgtctcgt gactgctatt ggttgattgc tgatgtttcg 10441 atttgagttg tttcgtggcg gcacgaaaag tatcatcatg tgaagtaggt taaggggaaa 10501 agaaacagat atataaagag cctttcgtgt tgctcccaag gaattgtata aatctctgag 10561 cgagatttgt gctcggtaat gggttcaaga gtgtgtatat acaaatacta tattaagagt 10621 ggtgattgag accaagtgga gggtcgaaag agatccactt aaagccatgg aaagtgtgcg 10681 gttgtagagg gtagaaaact aagaaaaggc aaatgataaa tagtattata ggaaatgcag 10741 agtgatagca agagagaggg agatggagta accaatggtt taaaaacaag catgacgatg 10801 ggcagtatta tacccataag gatcaaagag ggtcaaaaaa atacaaaatg ggttcagtgc 10861 acaagagatg tgaagatgct acaactgata cggagaagcc tagaatgtga aaattcagaa 10921 aataatacta tatgaaaata gtgagagtga caactcacaa tacaaaacaa actgatacgg 10981 gagaagatta tgctaaacat aaaagtgaaa aaagctagaa tataagctga taaaacacca 11041 taaactacat tgatttgagt ttcatgatat aaaacgtgga tgctctattc agcagaggat 11101 tctagtaaat tttttaatat atcttaaagt aatagtgaac gtagaataaa tttctgttcc 11161 aatacacaga cgagaaaaat aaagaattga acgggtagca aatttaagta acttgcgaca 11221 atatttttgc tgaagctaac ttctaaaaca aaaatgcaaa tacgtgttca aaaatatagt 11281 gaggaggata atcttttaat tacaaacgca tacatcatga taatttcaag gcattgtttg 11341 catatgtaaa cattgcttta ttaactgagt gcagttttct ggtaacctat ttcttttcat 11401 agttttcttt cataaccctc tctaatattt taaggtatat taggaaatga aatttgttac 11461 cttgtgaact ttactattac caattatatc acatttagtg tgtttaagtg caatatacct 11521 tcaagatgaa gaagtaagtg acatgacaaa tatgacacat aatggtacta agaaacccag 11581 agactttgtg gttgtcgttc aaactcatgc aacacaagga tggtgtaggc tcatattgag 11641 tgcgaaagta agccatgtgg aaattattac aattgaaatg ggaggtttgt acagtcatgt 11701 caaaaggcct aaatgggtta tggagttttt agagaaacag aatgcaagtg atgaggatat 11761 tttaatagct aaggatggat ctgatgttat agttggtgat ggaaataagt atgagagttc 11821 tgtggaatat tttgtgtaaa aaagcaccct acagtaagga caaattttat gaagaggata 11881 ttcagaaaaa ggatataggg taatgttatt gatttttgag atggccacaa aagtggagga 11941 aaagaaaagt ctaccaactc tactaaaatg gcctttaact caaaaagcat acttaaatgg 12001 aggtggctta attgccagag tttgggcatt tagaaggttt gaatatgcat ttggagagtt 12061 actaaaggga agtgaggagt ggtggtcaga gcggagtatt tacaaacctt tgttaatatg 12121 gagtgcaatt caggaagaag ctgttggaca gagatttgtt ttgaagagag gaatgattgg 12181 attggattac gaagagagat ttttttctat agaaagtagt ggggtaattg gagaagcccc 12241 attcattcat tttcctggac agcctattgc ttgggaggag aaagcgagac tgattgtgaa 12301 aaatcttagt tgatataaaa ggttgaaagg aagtgaattt aattcgaagg atatggggaa 12361 aagtagaaac ttatgtagta gatggagaga attgggaatt tcgatatgaa agaatttgtg 12421 gggacgcggt caaggagaat gacttgttta aggcaaagaa actgtaaggg atacaaaact 12481 aaagattttc ttgcggctga gtcagtagtc taagcaaatg tgggaccaac acgcatggcc 12541 aagatggccg cacaagtcta tgttgacatg aaaatcctgg ggcagcaaca aagagaagct 12601 ggaagacaaa cctcaggggt accctaacta ttcggctgaa atggtatgtc gtgacggcaa 12661 tatcgatatt ggtttcgggg gtggtaacac gatggacgca aaatttagaa gcttctatta 12721 cctctccata aaccatgatc tcaaagctag agaagcacaa aacaataaca caactagtga 12781 cactatgcgt tttttatgca cctgcacaga agaagtaacc gacgcttgga ccgaagatgc 12841 gaatacatat tcggcatctg cagtataaaa ggtaattcag ctgctagaag gaaaaaattc 12901 ctgaagaccc cgctgtgcca gagatatgta aacaactgtt tgaaatacat gcactcaata 12961 tgtcactatc agacaagacc tggcaggaca agttcataaa ggctcaacga agagccactg 13021 cagtggggtg acgaagccaa caaacataag accactaata cggagtggca cgaagtgtgg 13081 cagcgctggg tctaagccag gcaaagacta aaaaataacg aatactgagg gcgataaagt 13141 aaggaaaaat gccgcctcta aatggcaaag ttcaaaagcg gcacaggccc gagtcaagct 13201 aaactctttg ctgatccatg cggaagcctt gaacgctgag ctggaaaaaa agggcaaata 13261 atggctaacg gcaaaaaaca atagaagtgg acgttcttgc agccctctac ggaacgccca 13321 cggcaaaagc agcgtttggg agcagtggct tattgtcatc atcatttttt tgttgttttc 13381 gccactagtg gttttctttg tttgcgtatt tgatatctcc ttcgtcacta ttcttatctt 13441 tatcgtctgg aatatgggtg agatatcatt cccggtgtgt tttgcagtca tatggcgtaa 13501 catccacctg tgttgatatt ctttacagca tatcttacat gcttctcttt ccaacagtgc 13561 ttcctgtagt ggtgtgcttg gctccccttt actcttctcc ggtatgcact tatatctggt 13621 gagccaggtt ttgcttttta tagacctctt ggaagcagag acatttgtat gcttctttta 13681 ctttctcatg ttcaacttcg ctgcttattt tcctgcattt ccgtttgttg tgtcacttca 13741 gtttgctttt ctacgatgtt taaggcgtat atgttcctag tcaactagcc atacgtggca 13801 tactgctttg ggcaacattg ccacttgtag acttttttac cgaattatct tcttctcgct 13861 tccttcctct tttgacgctc tgtccttact actggagttt taggaggctg ttgtggcatg 13921 tgccttgtgc ttcttttatg accaacaagt gttattttta cagtcgtttt tgcactggtc 13981 gtgtgtggcg aaaaatacgc ccccgctcct ccaaccaatt agtggcaaag gggcggctcc 14041 gttgaggcag cactgcttgt cctcttaatg gtacattgtt gttttaccta tcgctggttc 14101 actccaccgc acgcggtgca taccttggga gttctgtact gttttggggg tggttttgtg 14161 cattgtaaca gccgcctctg acacaacaag cacagtttca cgcctgcgtt actggcattt 14221 tatgcctgtg gtggaacaac gtgtgtgttg acgtccgtaa atcttaatat ggcttgttgc 14281 gtgctaccgt ttatattttt cccccttcga tatgggcatc ccgttcttgt cgttacgtgg 14341 ttcccagttt cacctaacaa cccttgccct ctcttaccca gcagccgctc cctttcgttc 14401 gttccagtgt aacaaccact ttcacgcgta gactatcaaa atcactgcga tgaatatgct 14461 tcacttgagc gaccgcaatg cctcactcgc accgagtggc ggggaacatt cccttccgac 14521 gggaggagct gtgtgccgtg acgccatgga catacttccc gtgatactga gagctccggt 14581 cgcgttgtta cttctgctcg tcgtcctgcc gcaattgtca gtgggcgccg aggctaatgt 14641 gacagtcaag gttctctccg ccacgtggaa ctggtacatg ccgcggaagt atgtaaccgc 14701 gattaatgct ggcttcaacg cctcactgaa gtcacgaaac tggacagtcg ctggaagcgt 14761 caacgttcag gtagtttacc ccagcaacct tgacctgatg cccgaagatt tcattaaaaa 14821 gcagttggaa ttggaaactg accagaataa gattgttata gtgtatggac ccctcggtga 14881 caaaagtgta atgcactcga ttccacattt gatgaatcac agagtagtgg cctttgggtt 14941 gataactggg tccactttta tccgacaatg gaatccgtac ctttactttc tgcgtgccga 15001 ccccgctgct gaaacattgg ttctcattcg gtattcgcta tgccagttgc gcgtgctgcg 15061 cttgggcttc atgtaccttc agggtgttca ttatggcgac gaggagtatg cattgacagt 15121 gaacgtgatg tctcggatgg gttatgaact gcatggtgtg ttcaccgtga tgagtccaga 15181 cggtaaaccc gctcctgatg ctgagtttaa agaggtgttt gagcgcttcg ccactgccct 15241 tcctcaggcc attattgtct ttggtgcgcc agtggatgat accgcgaagt tcttgatgat 15301 gatggcagtg gatgagcgaa tcgctcggtc gtatattctt agtccttcat ctgtccagtt 15361 gtcactcata gaaatgtggc aactcgctct ggaagctgca ggggcctcat tcgcgcccgg 15421 acagctactt ttcacaggca ccaatccgct tgcgaaggac agccagtaca cagcaatcaa 15481 gcgttttcaa gaagttatga gtgaatacct aaaggctcac gtcggcgaaa ccaacatcac 15541 tgaagcagat tatttcctga cccacgatct tgagggagag ttgatggtgt atgggtggat 15601 ttctggggag gtgttatctc aggcacttag caaccttgaa tggctcaagg accgcgctac 15661 atttgtcagg tcgctatacg accaacggcg ctacgtcatc aatgacattg tgatcggtga 15721 ctatggcggt acgtgtgagg gggacgcggc caagcatggc gccacctgtg agtgtaacca 15781 aggcagcaag gcggtgtatg tgaaagagat gttggaaaat gggcagaaga cgtcggtgcg 15841 aagtggcttc acagttctga aagcttcgct gtgctatacg gattcttcgg agcttcacgg 15901 gccactggat ggactcgtcg ttttcatgaa agacgacgac atagcgtcga aagcagctgc 15961 gctgtggcag aaggggacct cccatcttgt cggcaagggt gatttgggat actcggacag 16021 gttctttctg cacgcattta atactaccat agctgaggca gcgaacgatc ttcggcgtga 16081 ccaaggtgaa agaatcgtaa ccgctgtgtt tggccccgtg accgaggcga tgttggacac 16141 gccgaatata acctttatcg atcctctgga acttaaacca cggctgaata agttcagaag 16201 gaatgtgatt catctctcgc ccaccctgga acaacaactt tacgtccttt catcgtacct 16261 cgcgggcgcc ggggtcggta atgttgatgc cgtgatctgc agcaacgaag cggatggaat 16321 tgcagatttt ctgcggagtt cactaactga attcgctgtc tcgctacgtt ccgctgtcat 16381 acgcgaagat ggggaagatg ttggtaaata ccttccgatg agtgggaccg tgtttgtcat 16441 tggcctttcc gttccagatg tgaaggaaat cgcgaggaaa cttgaagaac gcaacgattt 16501 gcgtgtgatt gttctcttcg gggaattttc attcctgtat gatctgtttg ccacggccct 16561 caataacacc gcaggcgccg cacgtcttgt gtttgccacc agcttgccac attggggtga 16621 tactgaaacc tcatcgaaga ctgctcaact gttccatgat gttgaaaaag actcacggtt 16681 atggacaccc ctatcagttc tggcattcgc aaccggtcga ttgatgcggg taatacttct 16741 tcatgtagaa gaaatgagtc cagaaacgtt ggtaaatttc ttctataccg attcttccat 16801 cgtctccgat gacatgcgct acggtgtatt cgacgacaca aaatgcgtcg acaccgcaaa 16861 taaactttcg aagaatggtt gtgcctcaaa ctatggtgca acgcagatat cggtgtggtc 16921 tatggcccgt gctctgaacg cttctattcc tccgcttaca aatcccatga caccgtcaat 16981 gactttcaga aactctaacg cgggaaggat ctccggggcg tcgctggtgg gcatcatcat 17041 cggtggtgcc cttgctttgt ttcttgtggt ggctctgggc gtggttccct actttttcct 17101 gcgcaacacc gtgataacaa tctgcaccaa agatgacaga cccgtaacgc taatatttac 17161 tgacattgag agcagtacgg cgttgtgggc tgcacaccct gaggtgatgc ctgatgccgt 17221 tgcgacacat caccgcttaa ttcgcacatt gatttccaag tatgaatgct acgaagtcaa 17281 gactgttgga gattctttta tgatagcgag taagagtcct ttcgctgccg tccaactcgc 17341 acaggaacta cagctgtgtt tcttgcacca tgactgggga acaaatgcga ttgatgaatc 17401 ctaccagcag tttgagcagc agcgtgcgga agatgacagt gattacacac cgccaactgc 17461 tcggttggac cctaaagtgt acagtcgttt gtggaatggc cttcgagtgc gtgttggaat 17521 ccacaccggg ctgtgcgaca tccgacgcga tgaagtaacg aagggatatg actattatgg 17581 acgtacatcc aatatggcag caaggacgga gagtgtagca aatggtggtc aggtgctgat 17641 gacgcatgcg gcgtacatgt cgctgtcagc tgaggagcgt cagcaaattg atgtcactgc 17701 acttggtgat gtgccgctac gtggtgtgcc aaagcctgtg gaaatgtacc ggttgaatgc 17761 cgtgcctggt cgtacctttt ccgttctccg tcttgagctt gagcttctta acgatgatga 17821 ggatcagact acaacttcct gtagtgacca cagttcttca cggacagatc tgagtgtagc 17881 tgcacaaact attgctgctt cccttcagtc gctcttgggt accttcaccc ctgctcagcg 17941 ccagaaggca ttaataccct tttgcgaacg gtggcgtgtt ccgcttcctc aaaaggtcgg 18001 gaacgtgtgg gacgatgacg gttgtcagga agttgtacgg cgtgttgctg caaaggtggg 18061 ccgtgttatg gattttggaa cgaggaagcc ctccagcagt gtgacatcgt ggaaaggggt 18121 ggaagtgtct tctcaggtgg aggagcggct gctataatgc ttgcttcatc ctcaagttcc 18181 tcctgcgttg atgggcattg tgttaccgta caattgattg acttggaaaa tgacagcgct 18241 accacctaat attaccctca tatcattgct gtatttgcgc tctgctattg aagtgaatga 18301 atgagttatt ttatatgatg tgtgcaatct tttctttata ttcaattcgt ataaagtgga 18361 cattataact ttccctcccc attcttgtgc ctgaaatgat cttctcaaac ggtaaatgct 18421 ttaaatacat ttgtacatga agatggtttt tgtttctgac atggctactc tccttttgag 18481 aactgtgttt gaagattgta cattttcttg tgtttgatgt gcctttgaat gtgagtatcc 18541 tcgccacttt tgataatctc cctcgtttaa ataatatatt tattgactga ataagctt // LOCUS BABUGU2A 680 bp ds-DNA PRI 28-JUN-1990 DEFINITION Baboon U2 small nuclear RNA gene, complete cds. ACCESSION M33777 KEYWORDS U2 small nuclear RNA. SOURCE Baboon (P.hamadryas) DNA, clone lambda-Pu2. ORGANISM Papio hamadryas Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Cercopithecidae; Cercopithecinae. REFERENCE 1 (bases 1 to 680) AUTHORS Matera,A.G., Weiner,A.M. and Schmid,C.W. TITLE Structure and evolution of the U2 snRNA multigene family in primates: Gene amplification under natural selection JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.G.Matera, 20-APR-1990. Author address: A.G.Matera Dept. of Chemistry University of California Davis, CA 95616 FEATURES from to/span description uRNA 191 378 U2 small nuclear RNA signal 99 114 upstream control element I signal 136 157 upstream control element II signal 379 420 3' processing box BASE COUNT 148 a 159 c 224 g 149 t ORIGIN 1 ggatccaagc acgggtgtaa gatttccctt ttcaaaggtg gagaataaga aaggagcccg 61 agtgtgtaac ggcgtccata gtggggtgga cgagacagag gggatggggc aaggagcgag 121 ggtggggctc tcaccgcgac ttgaatgtgg atgagagtgg gacggtgacg gcgggcgcga 181 aggcgagcgg atcgcttctc ggccttttgg ctaagatcaa gtgtagtatc tgttcttatc 241 agtttaatat ctgatacgtc ctctatccga ggacaatata ttaaatggat ttttggagca 301 gggagatgga ataggagctt gctccgtcca ctccacgcat cgacctggta ttgcagtacc 361 tccaggaacg gtgcaccccc tccggggata caaccgtttg ctaaaagtag agggaggtga 421 gagacgacaa caccggcggg gcggctcgca cgccaagcgt ctgtgactac gtcttgccgg 481 ctcaactgct tcgttggagt accacggggt tcctgagctg cgggcggcag acggtaggct 541 tcgggcacgc ttccatttac gctgtgactg cagcgctttg ggaaggctac gaccccccgc 601 agccgaccga ctcctttgga ggtcgttagg atctcagctt ggcagtcgag tggtggtgac 661 cttttaaagg aatgggatcc // LOCUS BOVLMCB 326 bp ds-DNA SYN 28-JUN-1990 DEFINITION Synthetic bovine liver microsomal lipase-solubilized cytochrome b-5 gene, complete cds. ACCESSION J02905 KEYWORDS cytochrome b-5. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 326) AUTHORS Funk,W.D., Lo,T.P., Mauk,M.R., Brayer,G.D., MacGillivray,R.T.A. and Mauk,A.G. TITLE Mutagenic, electrochemical, and crystallographic investigation of the cytochrome b-5 oxidation-reduction equilibrium: Involvement of asparagine-57, serine-64, and heme propionate-7 JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by G.Mauk, 16-MAY-1990. FEATURES from to/span description pept 27 311 microsomal cytochrome b-5 binding 16 19 ribosome binding site BASE COUNT 106 a 82 c 77 g 61 t ORIGIN 1 gtacggtacc gtaaaggaga acaacaatga gcaaagctgt caaatactac actctggaac 61 aaatccaaaa acacaacaac tcgaaatcga cgtggctgat cctcgactac aaagtatacg 121 acctgactaa attcctggaa gagcacccgg gaggcgaaga agtcctgcgc gaacaggccg 181 gcggtgacgc gactgaaaac tttgaggatg ttggccatag taccgacgct cgtgaactgt 241 cgaaaacgtt catcatcggt gagctgcacc cggacgatcg ttctaaaatc accaaaccgt 301 ccgaatctta atgagaattc gtctga // LOCUS DROEST6P 5424 bp ds-DNA INV 28-JUN-1990 DEFINITION D.melanogaster carboxylesterase 6 and P (Est-6 and Est-P) genes, complete cds. ACCESSION M33780 M33781 KEYWORDS carboxylesterase. SOURCE D.melanogaster (strain Dm145) adult DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 929 to 4027) AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and Richmond,R.C. TITLE Molecular analysis of duplicated esterase genes in Drosophila melanogaster JOURNAL Mol. Biol. Evol. 7, 9-28 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 5424) AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and Richmond,R.C. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.C.Richmond, 19-APR-1990. FEATURES from to/span description pept 1173 2559 carboxylesterase 6 (Est-6) precursor (EC 3.1.1.1), exon 1 2611 2858 carboxylesterase 6 precursor, exon 2 sigp 1173 1235 carboxylesterase 6 signal peptide matp 1236 2559 carboxylesterase 6 2611 2855 carboxylesterase 6 pept 3052 4438 carboxylesterase P (Est-P) precursor (EC 3.1.1.1), exon 1 4495 4742 carboxylesterase P precursor, exon 2 sigp 3052 3111 carboxylesterase P signal peptide matp 3112 4438 carboxylesterase P 4495 4739 carboxylesterase P pre-msg 1132 2867 Est-6 mRNA and intron (alt.) pre-msg 1132 2998 Est-6 mRNA and intron (alt.) IVS 2560 2610 Est-6 intron A IVS 4439 4494 Est-P intron A signal 1104 1110 TATA box BASE COUNT 1536 a 1117 c 1235 g 1536 t ORIGIN Chromosome 3L 69A1-5. 1 aagcttgcta tatatctatc tgtatatacc tatttgctta gaccgaactt aatgattttc 61 cctttttcca aaacttacac aaatatatac ctaatacctt acactgaaaa aactgttcaa 121 ttcactatgt acttattagt agttcttata aaccttgtta ttactaatta cgtacgaaaa 181 cagaacggga aataaatatc taaaaattta aatagtaaac tgagctgatg tttctacaca 241 aggatgaaca caagtatcaa agtgctcata ggtacagcat ttctagaaca caattacacc 301 agtcttttgc tagcaataaa ttttcactca tctcccactt ttctctagaa ctatcctgaa 361 tttatttatt tgattctatt ttgtcgcaag cacactcaat tcttgatctt tttaataata 421 atcgtagcgg aaagctgtat ctgtacaggc aagaccaccg cccctcaaaa tgccaacgaa 481 ctacagtcgg attcaaaagc gtgcacctgc aagtgcaagt tgtatgatat cttgcaccat 541 tttatttcga tacacgtttt ggtacttcga tttaatatgg ctaattgtta ttcagaaaat 601 aaacacaaaa acttaggaca catgttcaaa aatagaaata tgtacatatt tgttgcaaac 661 cttaaatctt attaattatc tttatctata atcaaaaagc ctactcattt gttaaatggt 721 ttcgttttaa agtgcattat caaattcatc tggataaaca ttcaaacaaa aaattgcatg 781 catgtaaagt aattaatcaa gttgactcga agcgattgtg caatataagt ctcacctgaa 841 gtgggattgg caactgcttg ggatgcatgt cgaaaatcta tattaagccc agtcaaatat 901 tttaagcgta aagtaacaaa cctaatttgt atacggctat cgttttaatt cgcacacgcc 961 tatcaactgg atgatgttca cactagagtt actcccattt gaaagccggg cattggaaaa 1021 ctaatctcat gcgtgcccag atctcaattg agactggttg actggatgtt caggtggccg 1081 ggtggcgata agccgatcga tggaataaaa ggggccgcaa ttgccgcatc tcaagatagt 1141 tgcggtctga attcgccgga gtgaggagca acatgaacta cgtgggactg ggacttatca 1201 ttgtgctgag ctgcctttgg ctcggttcga acgcgagtga tacagatgac cctctgttgg 1261 tgcagctgcc ccagggcaag ctacgtggtc gcgataatgg aagctactac agctacgaat 1321 cgattcccta cgccgaaccg cccactggcg atctacgatt cgaggctcca gagccgtaca 1381 aacaaaagtg gtcggatata ttcgatgcca ccaaaacccc ggtggcgtgc ctgcagtggg 1441 atcagttcac gcctggggcc aacaaattgg taggagagga ggattgccta accgtcagcg 1501 tctacaagcc gaagaatagc aagaggaata gctttccggt ggtggcccac attcacggag 1561 gtgcctttat gttcggtgca gcatggcaaa atggacacga gaacgtgatg cgtgagggca 1621 aattcattct ggtgaagata agctatcgcc tggggccatt gggtttcgtg agcaccggcg 1681 atagggatct tcccggaaac tatggactga aagatcaacg gctggctctc aaatggatta 1741 agcagaatat agccagtttt ggtggagaac cgcagaacgt actgttggtt ggtcactccg 1801 ctggaggagc ttcggtccat ctgcagatgc ttcgtgaaga tttcggccag ctggccaggg 1861 cggcattctc gtttagtgga aatgctctag atccatgggt tatacagaag ggagcaagag 1921 gacgagcctt tgaactggga cgcaacgtgg gatgtgaatc ggctgaagac tcgaccagcc 1981 tgaagaaatg cctaaagtca aagccagcca gtgaattagt caccgccgtc cgtaaattcc 2041 ttatattttc ctatgtgccc tttgctccat ttagtcctgt attggagcca tcggatgctc 2101 cagacgccat tatcacccag gatcccaggg atgtcattaa gagcggaaag ttcggacagg 2161 ttccgtgggc tgtttcctat gtcacagagg atggtggcta caatgccgcc ttgcttttga 2221 aggaacggaa atctggaata gttatcgatg atctaaacga gcgttggctt gagttggcac 2281 catatttact attctaccgg gacacgaaga ccaaaaagga tatggacgac tactcgcgga 2341 aaattaagca ggagtatata ggcaatcaga gatttgacat cgaaagctat tcagaattgc 2401 agcggctatt cacggatatt ctcttcaaga atagcacgca ggagtcattg gatcttcatc 2461 gcaaatatgg aaagagtcct gcctacgctt atgtctatga caatccagcc gaaaaaggaa 2521 tcgcacaggt cctggccaat cgaaccgatt atgattttgg taaggaaatc gtacttttaa 2581 atggacttag ttaaatcatt acttttatag gaactgtaca cggtgacgac tactttttga 2641 tattcgaaaa tttcgtacga gatgtggaaa tgcgtccgga tgagcagata atttcgagaa 2701 attttatcaa tatgctggca gattttgctt cgagtgataa tggctctcta aaatatggtg 2761 aatgcgattt caaagataat gtaggtagtg agaaattcca attattagct atttatattg 2821 atggctgcca gaataggcag catgtggaat ttccgtaagt tacatgaata aaatcaaaaa 2881 tttttcgttc tgtgtaattt ttaattattt aattattctc aactggcttt aaatatcatt 2941 tgtacaaaac gtgtttgtcg ctttatattt tggtttttgt tgtttcttta taaagaataa 3001 taaacgtttg tttacccgca gtttgattag tactttgtgg agttccagat catgagtata 3061 ttcaaacggc tgttgtgcct gactttgctg tggatagcag ctttagaatc tgaagctgat 3121 cccttgattg ttgagataac aaatggaaaa atccgtggca aagataatgg gttgtactac 3181 agctacgaat cgattcccta tgccgagcat ccaactggtg ccctccgttt tgaagcacct 3241 cagccgtata gtcatcattg gactgatgtt ttcaatgcca cgcagtctcc agttgagtgc 3301 atgcagtgga atcagtttat aaacgaaaac aataagctga tgggtgatga ggattgctta 3361 acggtaagca tctataagcc aaagaaaccc aatcggagca gctttcctgt cgtagtactc 3421 ctgcatggag gtgctttcat gttcggtagt ggatccatat atggacacga ctccattatg 3481 cgtgagggaa ctttgcttgt ggtaaaaata agctatcgtc ttggaccatt gggttttgca 3541 agtaccggcg atagacactt gccgggaaac tatggtctaa aggatcaacg tctggcccta 3601 caatggatca agaagaacat tgctcacttt ggtggaatgc cagataatat tgtgctcatt 3661 ggtcactctg caggcggtgc ttcggctcat ttgcagctgt tgcacgagga tttcaaacat 3721 ttggccaaag gagcgatttc ggtgagcggc aatgcattgg atccttgggt catacagcag 3781 ggtggacgac gacgtgcatt tgaactgggt cgtattgtcg gttgtggaca cacaaatgtc 3841 tccgcagaac tcaaggactg cttgaagtct aagccggcta gcgatatagt ctctgctgtc 3901 cgaagcttcc ttgtgttttc ctatgtaccc ttcagtgctt ttggacctgt tgtggagccg 3961 tcagatgcac cagacgcctt tctaaccgag gacccaagag cagtgattaa gagcgggaag 4021 tttgcccaag tcccttgggc tgtgacgtac accactgagg acgggggata caacgctgct 4081 cagctgttgg aaagaaacaa attaactggc gagagttgga ttgacctact caatgatcga 4141 tggtttgatt gggcaccata cttgctcttc tatcgggacg ccaagaaaac catcaaagat 4201 atggatgatc tttcatttga tctcaggcag cagtatctag cagatcggcg attcagtgtg 4261 gaaagttatt ggaacgtgca gcgaatgttt actgatgttc ttttcaagaa tagcgtgcca 4321 agtgcaatag atcttcaccg aaagtatggc aaaagtccgg tttattcttt tgtctacgat 4381 aatcctaccg attccggagt gggtcaattg ctttccaatc gaacagatgt acattttggt 4441 gggtacatac ttgcgtttca aatacttaaa gctctagctt tcttattatt aaaggtactg 4501 tccacggaga tgactttttc ttgattttca atacagctgc ataccgtatc ggcattcgtc 4561 cggatgaaga agttatttca aaaaagttta taggtatgct ggaggatttc gcactcaacg 4621 ataagggaac attaacattt ggagaatgta atttccaaaa taatgtgaac agcaaggaat 4681 atcaagtgct gcgtatttca cgaaacgctt gtaaaaacga ggaatatgct cggtttccct 4741 aaggaataaa tgcatttaaa aaatgtaaac agtataataa tgtaattaaa gaacaaattt 4801 attacacctt tatgccgcct caactgtcag aggtttttca ataagttcgt cttcttcttc 4861 tgggtcgttt tcctcgccct ccacattccc atacatctca atcttgctgt cgaggaagcg 4921 aaaatccttg aggatatgca acttgttgtg aaaccagcaa tggaaatgaa tttaatggcg 4981 atgtttgcca aacagcagag tagcaggcag tgtgctccac gggaatattg gcttgcagta 5041 ctccatttag agtgtaggat tgcacatgat tgtgctcacc accgattaag acacaatcgt 5101 ccacaaagtc acatagatgc acgcgtccgg ggaagctaaa gacacatgtg ctttccatgg 5161 aacgcaggtg gaaaattgaa gctttgggtc caccaccgca gagcagccaa tcctcgttga 5221 cagccactgc tcctatccat ttgccccaat ccggacgtaa cagatttggg ttcttgtaag 5281 gctccagcat tgaagtgtgc tgctgctgct tggtgctcca cactcgcaca gtgccatcct 5341 cggcgccaga gaagatctgt ccgttggcat tgcccaccac actgtgcacg tagtctgtgt 5401 gtccgcggta ctcgcgctga attc // LOCUS ECOLASNII 1530 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds. ACCESSION M34277 KEYWORDS L-asparaginase II. SOURCE E.coli (strain K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1530) AUTHORS Jennings,M.P. and Beacham,I.R. TITLE Analysis of the Escherichia coli gene encoding L-asparaginase II, ansB, and its regulation by cyclic AMP receptor and FNR proteins JOURNAL J. Bacteriol. 172, 1491-1498 (1990) STANDARD simple staff_review FEATURES from to/span description pept 385 1431 L-asparaginase II (ansB) precursor sigp 385 450 L-asparaginase II signal peptide matp 451 1428 L-asparaginase II mRNA 323 > 1431 ansB mRNA BASE COUNT 409 a 381 c 382 g 358 t ORIGIN 1 aaatgggcgc gaaagcggtg ctgaaaagcg gcggtaaccc attacagaat gtgctgggaa 61 gcctgggaag cctggggggg ctgcaatcct caatccaaac cgagtggaaa aagcaggaaa 121 aagatttcca gcagtttggc aaagatgttt gtagccgcgt tgtgactctg gaagatagcc 181 gcaaagccct ggtcgggaat ttaaaataat cctctatttt aagacggcat aatacttttt 241 tatgccgttt aattcttcgt tttgttacct gcctctaact ttgtagatct ccaaaatata 301 ttcacgttgt aaattgttta acgtcaaatt tcccatacag agctaaggga taatgcgtag 361 cgttcacgta actggaggaa tgaaatggag tttttcaaaa agacggcact tgccgcactg 421 gttatgggtt ttagtggtgc agcattggca ttacccaata tcaccatttt agcaaccggc 481 gggaccattg ccggtggtgg tgactccgca accaaatcta actacacagt gggtaaagtt 541 ggcgtagaaa atctggttaa tgcggtgccg caactaaaag acattgcgaa cgttaaaggc 601 gagcaggtag tgaatatcgg ctcccaggac atgaacgata atgtctggct gacactggcg 661 aaaaaaatta acaccgactg cgataagacc gacggcttcg tcattaccca cggtaccgac 721 acgatggaag aaactgctta cttcctcgac ctgacggtga aatgcgacaa accggtggtg 781 atggtcggcg caatgcgtcc gtccacgtct atgagcgcag acggtccatt caacctgtat 841 aacgcggtag tgaccgcagc tgataaagcc tccgccaacc gtggcgtgct ggtagtgatg 901 aatgacaccg tgcttgatgg ccgtgacgtc accaaaacca acaccaccga cgtagcgacc 961 ttcaagtctg ttaactacgg tcctctgggt tacattcaca acggtaagat tgactaccag 1021 cgtaccccgg cacgtaagca taccagcgac acgccattcg atgtctctaa gctgaatgaa 1081 ctgccgaaag tcggcattgt ttataactac gctaacgcat ccgatcttcc ggctaaagca 1141 ctggtagatg cgggctatga tggcatcgtt agcgctggtg tgggtaacgg caacctgtat 1201 aaatctgtgt tcgacacgct ggcgaccgcc gcgaaaaccg gtactgcagt cgtgcgttct 1261 tcccgcgtac cgacgggcgc taccactcag gatgccgaag tggatgatgc gaaatacggc 1321 ttcgtcgcct ctggcacgct gaacccgcaa aaagcgcgcg ttctgctgca actggctctg 1381 acgcaaacca aagatccgca gcagatccag cagatcttca atcagtacta atcgcctcgc 1441 cccggtatcg tgccggggct ttttcacttc agactcacgt ccattgccaa ttttaattac 1501 cctaatgata atcaccggaa taaattattt // LOCUS HUMPADPRP 2085 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human poly(ADP-ribose) polymerase (PADPRP) gene, exon 1. ACCESSION M34337 KEYWORDS poly(ADP-ribose) polymerase. SOURCE Human peripheral blood leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2085) AUTHORS Ogura,T., Nyunoya,H., Takahashi-Masutani,M., Miwa,M., Sugimura,T. and Esumi,H. TITLE Characterization of a putative promoter region of the human poly (ADP-ribose) polymerase gene: Structural similarity to that of the DNA polymerase beta gene JOURNAL Biochem. Biophys. Res. Commun. 167, 701-710 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1943 / 2062 poly(ADP-ribose) polymerase /hgml_locus_uid="LZ0154E" /nomgen="PPOL" /map="1q41-q42" pre-msg 1777 > 2085 PPOL mRNA and introns IVS 2063 > 2085 PPOL intron A BASE COUNT 495 a 535 c 550 g 505 t ORIGIN 1 tttagggatg atatagttgt caacccagag atggcatgat catgcctttt gacttggtca 61 ttctctaagt aaaactttta tttgttccat catattttcc acttattctg tttaccttca 121 aaatatcttt tttttttttt tttgagacag ggtcacactg tcacccaggc tagagtccag 181 tggcactatc atggctcacc acagcctcaa ccttcagggc tcaggtgatc ctcccacttc 241 agcctcccga gtagatggga ctacaggcac ctgccaccac ccccagctaa tttttgtaga 301 gacaaggttt tgccatgttg tccaggctgg tcttgaactc ctgggctcaa gggatccggc 361 cacctcagcc tcccaaagtg ctaggattat aggcatgagc cactgtgccc agcctacctt 421 caacgtatct aactggttac taacttttag gattcggcct atgtctcaca accttcttgc 481 ttactcaaca tccttgtctc ttaagccact agcttcttct ctatggttaa cactttttat 541 gagttttatt catctgctta tttttcttat cctctatacc agaattgaat attttcaaat 601 aaagcacact catgttacaa tctttgaaat ggaaaaaaaa aatgcatagg attagaaaag 661 aaaccaattt taataaacta tattttgaag tatagttcta tattaaacaa caagatctag 721 gccaggtgca gtggctcatg cctgtaatcc cagcaatttg ggaagtcgag gtgggaggat 781 tgcttgaggc caggggttca agaccagcct gggcaacatg gagagattcc ccatctcttt 841 ctttacacac acacacacac acacacacaa aatatctgat agcaacaggt gcagtcatta 901 ccacaatttc gagtagtgat gagcttaata atatttcgag ttatcaccaa caactgtaaa 961 ctaacatgaa aacgtctgtg atgactattg cccacaaagt cacaggtact gctaatactc 1021 ctggtatttg tagtaaattc ataataaagg aaatgctagg tttcagttgg tattttgtcc 1081 cgacggtctg tggacggcag gttagaacgc ccgtccaagc caggagggtg gacctagcac 1141 tgcagggtcc acctcgggcc aatcaactat attcccgagg cgggggcctg cgcttcccgg 1201 acccagctgc cctcagggga gagaggacac acttaagagt ttggggccgg cgtggtagct 1261 catgcccctg atcccagcac ttcgggaggc tgaggcgtga agatcacttg tagcaggagt 1321 ttgagaccag tctagccaac ttggcgagac cctgtcccta aaaaaaattt ttttttaatt 1381 agccagttgt ggtgagcgcc tgtagtccca gctactcggg aggctgaggt gggaggatcg 1441 ctgggctcag gagttccaga ctgcagtgag ccatgatggc ggcactgcac tccagcgcgg 1501 tgagactcag tctcaaaaat aaaaggggga ggggttgggg gtaaaattag ttgtgaaatc 1561 aagtaagact tcctgggaca gaacaatcaa aggggtggcg ccgggtcctc caaagagcta 1621 ctagctcagc ccaagccccg cctcggcccc cagggcagcg gccgcagagc tccacccggc 1681 aggcgcccgg gaaactccgc cccccggccg gcagggcgcg cgcccgccgg ccccgccccg 1741 tggacgcggg ttccgtggcg ttcccgcggc caggcatcag caatctatca gggaacggcg 1801 gtggccggtg cggcgtgttc ggtgcgctct ggccgctcag ccgtggcggc tgggtgagcg 1861 cacgcgaggc ggcgaggcgg caagcgtgtg tttctaggtc gtggcgtcgg gcttccggag 1921 ctttggcggc agctagggga ggatggcgga gtcttcggat aagctctatc gagtcgagta 1981 cgccaagagc gggcgcgcct cttgcaagaa atgcagcgag agcatcccca aggactcgct 2041 ccggatggcc atcatggtgc aggtgcgggc cgctgtgcgg cgggg // LOCUS HUMREVERB1 295 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human Rev-ErbA-alpha mRNA, 5' end. ACCESSION M34339 KEYWORDS . SEGMENT 1 of 2 SOURCE Human fetal skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 295) AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W. TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription from the noncoding DNA strand of a thyroid hormone receptor gene results in a related protein that does not bind thyroid hormone JOURNAL DNA 9, 77-83 (1990) STANDARD simple staff_review FEATURES from to/span description pept 204 > 295 Rev-ErbA-alpha protein (put.) BASE COUNT 58 a 107 c 74 g 56 t ORIGIN 1 ccgttgcctc aacgtccaac ccttcctgca gggctgcagt ccggccaccc caagcccttg 61 ctgcagggtg cttcggatcc tgatcgtgag tcgcggggtc cactccccgc ccttagccag 121 tgcccagggg gcaacagcgg cgatcgcaac ctctagtttg agtcaaggtc cagtttgaat 181 gaccgctctc agctggtgaa gacatgacca ccctggactc caacaacaac acaggtggcg 241 tcatcaccta cattggctcc agtggctcct ccccaagccg caccagccct gaatc // LOCUS HUMREVERB2 285 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human Rev-ErbA-alpha mRNA, partial cds. ACCESSION M34340 KEYWORDS . SEGMENT 2 of 2 SOURCE Human fetal skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 285) AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W. TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription from the noncoding DNA strand of a thyroid hormone receptor gene results in a related protein that does not bind thyroid hormone JOURNAL DNA 9, 77-83 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 285 Rev-ErbA-alpha protein (put.) (AA at 1) BASE COUNT 62 a 79 c 99 g 45 t ORIGIN About 1.7 kb after segment 1. 1 ctcgtaaagg agagagaagt gcagagttcg attctgtaca agggggcagc ggcagaaggc 61 cggccgggcg ggtcactggg cgtccacccg gaaggacagc agcttctcgg aatgcatgtt 121 gttcagggtc cgcaggtccg gcagcttgag cagcagcttg gtgaagcggg aagtctccaa 181 gggccggttc ttcagcacca gagcccgaag agcccgcagc agcgtctcct ggagctgctc 241 caccgaagcg gaattctcca tgcccgagcg gtctgcagag acaag // LOCUS HUMTFEB 1526 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human TFEB protein mRNA, partial cds. ACCESSION M33782 KEYWORDS . SOURCE Human BJAB B cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1526) AUTHORS Carr,C.S. and Sharp,P.A. TITLE A helix-loop-helix protein related to the immunoglobulin E box binding proteins JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.S.Carr, 19-APR-1990. FEATURES from to/span description pept < 1 > 1526 TFEB protein (AA at 1) BASE COUNT 324 a 495 c 493 g 214 t ORIGIN 1 ggtggagtga cagccggagc ccgggcgccg gcgcggggct cggtgacagc ggaggcggcg 61 gcccgggcgg gacgagggag cggccgcggc gccgacggcg ggcgggaagg gcggacgggg 121 cgggcctccc tggtggcacg gggccggagc aggccgggag caggtggcgg gcacagcttg 181 cgggccaggc acccgaactt gcgacaagtt gccggagccg gggcgcgggc ggcggacaga 241 ttgaccttca gagcgaggga gccagcgccg gcagccacca tggcgtcacg catagggttg 301 cgcatgcagc tcatgcggga gcaggcgcag caggaggagc agcgggagcg catgcagcaa 361 caggctgtca tgcattacat gcagcagcag cagcagcagc aacagcagca gctcggaggg 421 ccgcccaccc cggccatcaa tacccccgtc cacttccagt cgccaccacc tgtgcctggg 481 gaggtgttga aggtgcagtc ctacctggag aatcccacat cctaccatct gcagcagtcg 541 cagcatcaga aggtgcggga gtacctgtcc gagacctatg ggaacaagtt tgctgctgcc 601 cacatcagcc cagcccaggc tctccgaaac ccccaccagc cgcctcccca ggggtgcgag 661 ctggacacgt gctgtcctcc tccgctggca acagtgctcc caataagccc catggccatg 721 ctgcacattg gctccaaccc tgagagggag ttggatgatg tcattgacaa cattatgcgt 781 ctgacgatgt ctcttggcta catcaatcct gaaatgcaga tgcccaacac gctacccctg 841 tccagcagcc acctgaatgt gtacagcagc gacccccagg tcacagcctc cctggtgggc 901 gtcaccagca gctcctgccc tgcggacctg acccagaagc gagagctcac agatgctgag 961 agcagggccc tggccaagga gcggcagaag aaagacaatc acaacttaat tgaaaggaga 1021 cgaaggttca acatcaatga ccgcatcaag gagttgggaa tgctgatccc caaggccaat 1081 gacctggacg tgcgctggaa caagggcacc atcctcaagg cctctgtgga ttacatccgg 1141 aggatgcaga aggacctgca aaagtccagg gagctggaga accactctcg ccgcctggag 1201 atgaccaaca agcagctctg gctccgtatc caggagctgg agatgcaggc tcgagtgcac 1261 ggcctcccta ccacctcccc gtccggcatg aacatggctg agctggccca gcaggtggtg 1321 aagcaggagc tgcctagcga agagggccca ggggaggccc tgatgctggg ggctgaggtc 1381 cctgaccctg agccactgcc agctctgccc ccgcaagccc cgctgcccct gcccacccag 1441 ccaccgtccc cattccatca cctggacttc agccacagcc tgagctttgg gggcagggag 1501 gacgagggtc ccccgggcta ccccga // LOCUS MUSFCERII 2237 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse low affinity IgE receptor (Fc-epsilon-RII) mRNA, complete cds. ACCESSION M34163 KEYWORDS IgE receptor. SOURCE Mouse (strain Balb/c) splenic B cell, cDNA to mRNA, clone p23.18. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2237) AUTHORS Gollnick,S.O., Trounstine,M.L., Yamashita,L.C., Kehry,M.R. and Moore,K.W. TITLE Isolation, characterization, and expression of cDNA clones encoding the mouse Fc receptor for IgE (Fc-epsilon-RII) JOURNAL J. Immunol. 144, 1974-1982 (1990) STANDARD simple staff_review FEATURES from to/span description pept 175 1170 low affinity IgE receptor (Fc-epsilon-RII) mRNA < 1 2237 Fc-epsilon-RII mRNA BASE COUNT 592 a 622 c 533 g 490 t ORIGIN 1 ggagaagact actgtcttca acacactagc ctgagctacc ttatccaagt gctccacata 61 ttccagaagg agaaggacag acttcaagtt caaatcactt ccagagctgt gagtgacaag 121 tgccttggca ggtagtgcac gcctcatcac tgaaaggatc caaacaagac tgccatggaa 181 gaaaatgaat actcaggata ctgggaacct cctagaaagc gttgctgctg tgcaagacgt 241 gggacacagc tcatgttggt ggggctgctg agcacagcaa tgtgggctgg cctgctggcc 301 ctgcttcttc tgtggcactg ggaaacggag aagaatctaa aacagctggg agacactgca 361 attcagaatg tctctcatgt taccaaggac ttacaaaaat tccagagtaa tcaattggcc 421 cagaagtccc aggttgttca gatgtcacaa aacttgcaag aactccaagc tgaacagaag 481 caaatgaaag ctcaggactc tcggctctcc cagaacctga ccggactcca ggaggatcta 541 aggaacgccc aatcccagaa ctcaaaactc tcccagaacc tgaacagact ccaagacgat 601 ctagtcaaca tcaaatccct gggcttgaat gagaagcgca cagcctccga ttctctagag 661 aaactccagg aagaggtggc aaagctgtgg atagagatac tgatttcaaa gggaactgca 721 tgcaacatat gtcccaagaa ctggctccat ttccaacaga agtgctacta ttttggcaag 781 ggctccaagc agtggatcca ggccaggttc gcctgcagtg acctgcaagg gcgactagtc 841 agcatccaca gccaaaagga acaggacttc ctgatgcaac acatcaacaa gaaggattcc 901 tggattggcc tccaggatct caatatggag ggagagtttg tatggtcgga cgggagccct 961 gtgggttata gcaactggaa tccaggggag cccaataacg ggggccaggg tgaggactgt 1021 gtgatgatgc ggggatccgg ccagtggaac gacgccttct gccgcagcta cttggatgca 1081 tgggtgtgtg agcagctggc aacatgtgag atatctgccc ccttagcctc tgtgactcca 1141 acaaggccca ccccaaaaag tgaaccctga caaacttctg ctcacactct tctggatttc 1201 tcctctacct ttatcgtgga aacagctggg ccctgaggat acccctatca gggcccaggg 1261 ctctctgtga ccgaaggctt tgattatgtt cccacccata ctgaagcagc tggtggatgc 1321 cagctcctgc cagctaccca gaaaccctct ccagctctcc agctaagctg gccatcccat 1381 tccatctgcc ttcctcaaac ctgggcccca gccttgctag ctccctgact acgggcatgc 1441 atgtgggcag ctgagccaac cagggagctg ctgagaacaa agatttcgaa ggcttctttt 1501 gcagtcccca cctcctatca agttccccac tttctccccc tcggcatcag agaacagggg 1561 ttccctttcc ccaggatctg ggatgagtcc tcccatcaag tttgcatcag tggtcccagg 1621 actccgaccc tccttggagg ctaccaggtg tgctcctggt gcgggaggta ttgaaggaac 1681 tctaaacagc tccagcaagg cgagcctggc tctgtctggt aggcctggcc cttctctccc 1741 attccttcta ccttactaaa agctgttaga gaacagtcct aaagctagcc cccaaggtct 1801 attcccttat ttggccactt cctcctcctg aggctgacta caaggtccag ctatccaagt 1861 actgaagtct aacatcaaaa gccccctttg tctcacctaa gtagcaatgc ccaatcaaaa 1921 tacaccatca catcatagcc cagtctaaca gaccgccctt tttctcttca taaaattaca 1981 cctgcaacca ggcgtagtgg tgcaggcctt tagtcccagc acttgggagg cagagacaag 2041 cgaatttctg agttcgaggc cagcctggtc tacaaagtga gttccaggac agccagggct 2101 acacagagaa accctgtctc gaagaaagaa aaaaaaaaaa aattacacct gcgaggtcac 2161 ttgggctgct gtttttctgc ctgagtcaga gggcagccac ttaacttttc ttccctgctt 2221 aataaaggat ctctgtg // LOCUS RATATM1 1585 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 1b. ACCESSION M34137 KEYWORDS alpha-tropomyosin. SEGMENT 1 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1585) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review COMMENT [1] reports isolation of exon 1b which is the first exon coding for the alternatively expressed low-M-r alpha-tropomyosins. FEATURES from to/span description pept 1275 / 1406 low-M-r alpha-tropomyosin BASE COUNT 360 a 408 c 449 g 368 t ORIGIN 1 agaggtgttc cagctttgga agataagtga ggcaggctgt tttcttttct acctttgttt 61 ccattctcca cctcccacat ttaagcaagt tcaggactga ctctttgaca gaggtgtagg 121 attaaaaagc tcggagactt gacaggcttt aaaaaaaaca tagcggaaga aggaatgaca 181 gaggcgggtt cttacagact gtgtccatgt ggctttccag ctggtaacca gaggttctgt 241 cacaggtgtt tgtggtcttg cagacccggg ttcctcagat tccagatatc aggctgaatt 301 gcctgaagtt gcacagttcg ttatggggta cagacaaaat cttatggtag cgtttaatta 361 catagtttga gttcttaaat aagtgaggca tatgtaaatg ttttatatca tctgataact 421 tcaacagcct tgtgaagtgt cctgtactat tatactcgtt tttaagagag gaaatccctg 481 agaagtcatt tgtccaaagg cacacagcca gcagagactt aagagccaga ttgaaacctt 541 aaaaagatgt atttagaaca ggagatttaa ccactacgct ttgccagcta gtagtttaca 601 cttggtgaga tagtttgggt gttgggattc ttgtagtgag gatcttgtgt tctgtgttct 661 tcacaaggat ctttaggata ttcctatttc aggctccgag ttccttgttc agactctggg 721 aaatcgggag ttgccaatgc ctttagtatt cggaatgatg gcgttcttta caactcaaag 781 agtttaggga acgttcaagg tctaggttcc tacccttact cagcttcagg ggaatctcga 841 tttcagagcg ctccaaatgg ccagcaggag tcgccatagg ctttagcatc cagttcctct 901 acttggagtc acccaggctc cgcggaatgg atgcaaacgc cctcctgcgg tagcctcgct 961 ctaggcaaag cagcccagag ccagccattc ccgcgcgcac gcgcagtggt tccaagggta 1021 gccagttcct cgggtccacc ggccccgccc gcctgcccgc cgcgcccatc ccctgcgttc 1081 gcgcacccgc ccgcggctct ctagtgcggc cctggggccg tgacttccgg gctgcttctt 1141 agccgcaggg ggcgccggcg ttgcaaagcg ggcctgggcg ggacgaaccg gacctgggct 1201 ccgtaactgc aggagcccag cagtagcagc tgcagagcgc ggaggcagag cacagcctca 1261 gccgccgcct cgccatggcg ggtagcagct cgctggaggc ggtgcggaga aagatccgga 1321 gcctgcagga acaggcagat gccgcggagg agcgtgcggg cagcctgcag cgagagctgg 1381 accaggagcg gaagctgcgg gaaaccgtaa gggatccacc catcacccga ggcccccacg 1441 ccgcacctgg cgcacctagg ccaactggca cccggctact cagcctgccg gtctatctcc 1501 tcggtctaaa gtaaaacctc caagggggag caagagtgct tgctgttgag aaggttctgg 1561 gggcgagcat gaaatgggtc ccggg // LOCUS RATATM2 462 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 9c. ACCESSION M34138 KEYWORDS alpha-tropomyosin. SEGMENT 2 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 462) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review COMMENT [1] reports isolation of exon 9c which is the final exon coding for the alternatively expressed brain alpha-tropomyosins TMBr-1 and TMBr-3. FEATURES from to/span description pept / 375 448 alpha-tropomyosin BASE COUNT 97 a 141 c 100 g 124 t ORIGIN Unknown number of bp after segment 1. 1 ctcgcctaga atctctgctg agctcccaca aagcatgagt gtctggggct gccctccagc 61 ctggccttag tccccaaatc tgttgtccta aactacacat aagtgagtca ctgggtggct 121 ttaatgaggt gcagaaggag cattggttct gcacctcttc gcctctgctc tacaccaagg 181 aaagattctg gcccctcctt tattctgcca gacctccatg cgcctcaagg gcacagggca 241 gagggcctgg ggtcagggta ccttgccgct gtcctttgta ttgtgttctc agtctgttaa 301 ccgtttctca ctgactcttg ctaacctctt ccttctgcct cttcttttct gctaaccctt 361 gctgacctgc ccagatcaac tctaccatca actcgagcaa aaccgccgtc taactaatga 421 actaaagctg gccctgaatg aggattaaaa ccctgggcca ag // LOCUS RATPLPA1 1658 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exons 1 and 2. ACCESSION M34108 KEYWORDS parathyroid hormone-like peptide. SEGMENT 1 of 5 SOURCE Rat liver DNA, clone lambda-rPLPg10-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1658) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1477 + 1577 parathyroid hormone-like peptide (PLP) precursor, exon 2 (first expressed exon) sigp 1477 + 1577 parathyroid hormone-like signal peptide pre-msg 951 > 1658 PLP mRNA and introns IVS 1259 1452 PLP intron A IVS 1578 > 1658 PLP intron B BASE COUNT 325 a 507 c 466 g 360 t ORIGIN 1 gggaaccagt tacacacaca cacacacaca cacacacaca cacacacaca cacacacact 61 ggtaagaacc agtagcccac aacattctcc ccctctcccc acccacccac ccagcagcgc 121 tgagaagctg aaacgactgg cataaggtta ctcagtgaat ctatagcagg acatagagta 181 ggttatcaga ggaagagtgg ctaagttgga ttctgtgtgc tccaggttct ctgagatcac 241 aaagtgggtc ttcgatcctt gttcacacac cctttctagg atctgctcca gatgtgatgt 301 ccccagctgt tctcaaacag ttaagaccaa aaatcgcctt tgctctcctg ggtgctaccc 361 ttagtcccct ccccagggga aggccatgcc aggcaggaag gctggcttgc aacagtgcga 421 gcttgcaaac gagcttggga ctcttgggat tcgcaggcca cccggagttc ctccttgccg 481 ccggtgggga ccagggctcc tggacctggg ctcctggact caggcggcag caggctggtt 541 caggaggagg agagcgtcag atcggcccac gcggcgactg gggtggggct ccgtggctcc 601 cagctcccct gtgctccccg cccccgccgg cccgctcacc ccagtctgct ccgccggcgc 661 gcgttcctcg ggcgccacct ctttgcgact cgctcacttc tcagcaggtt ggccacagag 721 cgtgtgaacc ttccaaggct gggctcacag ctcctctcca acctgcgccg cccagctggg 781 cgtccgcccc gccgccggaa ccgcctcgcg ccacctggcc ctcgcatcca cgacacgcgc 841 gcctgcaact tgttcaaggg cgttgtggaa tcaactttcc ggaagcaacc agcccaccgg 901 aggaggtaga ccgacagcta tgtatatata cgctctctgc cgcaagtggc tttgcaccca 961 gaggcctggt gcgcaggaca gctgactcct gcgggcacac tcgcgtttga agaggggttc 1021 gccctgcccc gcgacccaga gcgctgccgc caagactaat tagacattgc tatgggagcc 1081 acagcaacgc gtcacgcatc cccgacgcct atgtaaaacg gttcggtctt cgctctttca 1141 gaggaaactc tctgattgct ttttcccttc tcgggtccct tttttttttt tttttttttt 1201 ttttttttgc ctgtgcggtt tgagagaggc acagttggag gcgctgattc ctacaaaagt 1261 aagtgctcag gggctcggaa acactttttt gctttctttt tcttgttttc ttttttcctt 1321 ataacttgtg agatgccctc gacattggag tggcttgttg ggaagccacg gggttccaaa 1381 agagggagag gggcagtgag aggaagagat agatggagag ccaggactga ccctcgggtt 1441 ccactcttgc aggtcccaga gccagcgagc ggcacgatgc tgcggaggct ggttcagcag 1501 tggagcgtcc tggtgttcct gctcagctac tccgtgccct cccgcgggcg ctcggtggag 1561 gggcttggtc gcaggctgta agtaccggat cttcctcccg ggtgagggat ggaggagacc 1621 agcaccagag gtgccgaact gaggcgggca atgcggaa // LOCUS RATPLPA2 144 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B (partial). ACCESSION M34109 KEYWORDS parathyroid hormone-like peptide. SEGMENT 2 of 5 SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 144) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description IVS < 1 > 144 parathyroid hormone-like peptide intron B BASE COUNT 42 a 26 c 21 g 55 t ORIGIN 3 kb after segment 1. 1 gcagaacact atgttttgac tgaagtccaa gttgcctctg ataaactttt gcccaatttt 61 tcttttaatt tttttacagt aaccaaagac aaggataaag tataaatcca cttatttttc 121 catgctggca ctttggtttg taat // LOCUS RATPLPA3 145 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B (partial). ACCESSION M34110 KEYWORDS parathyroid hormone-like peptide. SEGMENT 3 of 5 SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 145) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description IVS < 1 > 145 parathyroid hormone-like peptide intron B BASE COUNT 22 a 34 c 34 g 55 t ORIGIN 640 bp after segment 2. 1 agtctccata gggtcaatcc actgtcatgt gcttcttatg ctgtgctgtt tcttgtcccc 61 ttgctgtgac actcacttct gcctttcttg gacttgagaa cacgttgggg acttttgaga 121 atcggagtcc tgttttgtgg acttt // LOCUS RATPLPA4 920 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 3. ACCESSION M34111 KEYWORDS parathyroid hormone-like peptide. SEGMENT 4 of 5 SOURCE Rat liver DNA, clone lambda-rPLPg9-0. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 920) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 94 + 516 parathyroid hormone-like peptide (PLP) precursor, exon 3 sigp + 94 100 parathyroid hormone-like signal peptide matp 101 + 516 parathyroid hormone-like peptide IVS < 1 93 PLP intron B IVS 517 > 920 PLP intron C BASE COUNT 198 a 353 c 166 g 203 t ORIGIN 1.3 kb after segment 3. 1 tccctactct agaggtcaac tgttcaccca tctcattcct tcaagtccaa cacgggcagg 61 tcctcagctg acaactgtct tctcccctga cagaaaacgc gcggtgtctg agcaccagct 121 actgcatgac aagggcaagt ccatccaaga cttgcgccgc cgtttcttcc tccaccatct 181 gattgcggag atccacacag ctgaaatcag agctacctcg gaggtgtccc ctaactccaa 241 acctgctccc aacaccaaaa accaccctgt gcggtttggg tcagacgacg agggcagata 301 cctaactcag gaaaccaaca aggtggagac gtacaaagag cagccactca agacgcccgg 361 gaagaagaag aaaggcaagc cggggaaacg cagagaacag gagaaaaaaa agcgaaggac 421 tcggtctgcc tggccaggca caactgggag tggcctgctt gaggaccccc agccccacac 481 ctccccgacc tcgacctccc tggagcccag ctcaaggtat tgtctgtccg ctgtgctcag 541 tagggctccc ctccctccac cggcctgggt cccagagatg cagaatcctc tgtatcttga 601 tctgagacta catcagttag gccacaatct cagtccattc ctggttagcc tgctagacaa 661 tttaatttcc ctacttccct cccaccagcc caatcaacca tccttcctcc ctccctccct 721 ccttccctcc ctcactccct ccctccctcc atccctccct ccctaccatc ctccctccct 781 tccttctccc tttcctttcc atcccctccc ctcttctccc tttcctttcc atcccctccc 841 ctcttctcct cccctctctc tgattcctgt cttcctcact cctccccaca catagggtct 901 tcctcaagcc aaggctgact // LOCUS RATPLPA5 855 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 4. ACCESSION M34112 KEYWORDS parathyroid hormone-like peptide. SEGMENT 5 of 5 SOURCE Rat liver DNA, clone lambda-rPLPg9-0. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 855) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 21 30 parathyroid hormone-like peptide (PLP) precursor, exon 4 matp + 21 27 parathyroid hormone-like peptide pre-msg < 1 449 PLP mRNA and introns IVS < 1 20 PLP intron C BASE COUNT 280 a 139 c 157 g 279 t ORIGIN 3.6 kb after segment 4. 1 ccccttcctt gtcttcacag gacgcattga aactttcacc gaagatcttc caagacacgt 61 tacaggattc tgtaatagta aacatatgga aagtattaga catatttatt gcctgtacat 121 actgtaaatg cattgggatc aaactgtctc cccaggaaac tgcacatggg tcatgttacc 181 ataatttatt ttgtcaactg atgtatttat ttgtaaatgt atcttggtgc tgctgactct 241 gtttttttgt aacataatgc actttaggta tacatatcaa gtatgtcgat gaatttaaca 301 cgaaaaagtg tctctatttt gtggttcatt ttgatgagtt ctgaaatata attatctaga 361 ctgatctccc tctgcgcatg taaaaatggc actattttaa atttgtaaag aatgtctaat 421 aaaatataat ctaattacac cgtgacttac aatgttggtt ttaattcttt aaggtttcta 481 atcagagaaa cacagtacaa ttttttttta aattctactt gaataaaatc ctaagcttga 541 tgtgtttgtg tggggatcta ctccaatgtg tccgcattag cttttagtta aaaaccactt 601 aaaatgtatt ttgccaatat ttcagctcaa gtcaatagaa tggctgaaca gcctgaactt 661 caagttgaaa agaaagacaa gaaagggcaa aaaataagca gagaagtcag gagaaagggt 721 caaagagggt tgtccagagt taaaaccaga gaagtgggtc cagacttgaa gactctgaat 781 aaattttgcc aatgtaaagt ttagtttgca agggcttgct ctcgattcat aacaataata 841 aaagatctgc tcggg // LOCUS RATTMBR1A 1050 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat brain alpha-tropomyosin (TMBr-1) mRNA, complete cds. ACCESSION M34135 KEYWORDS alpha-tropomyosin. SOURCE Rat brain, cDNA to mRNA, clone pOk4. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1050) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review FEATURES from to/span description pept 41 886 brain alpha-tropomyosin (TMBr-1) mRNA < 1 1050 TMBr-1 BASE COUNT 336 a 220 c 306 g 188 t ORIGIN 1 ctctggctcc tctgcctaag ggcccctcgc caccgccacc atggacgcca tcaagaagaa 61 gatgcagatg ctgaagctcg acaaagagaa cgccttggat cgagcagagc aggcggaggc 121 tgacaagaag gctgcggaag accggagcaa gcagctggaa gatgagctgg tgtcactgca 181 aaagaaactc aaggccactg aagatgaact ggacaaatac tccgaggctc tcaaagatgc 241 ccaggagaaa ctggagctgg cggagaaaaa ggccacagat gctgaagctg acgtagcatc 301 tctgaacaga cgcatccagc tggttgagga ggagttggat cgcgctcagg agcgtctggc 361 cacagctcta cagaagctgg aggaggctga gaaggctgca gatgagagtg agagaggcat 421 gaaagtcatt gaaagccgag cccaaaaaga tgaagaaaag atggagattc aggagatcca 481 gctgaaagag gccaagcaca ttgctgaaga tgctgaccga aagtatgaag aggtggcccg 541 taagctggtc atcatcgaga gcgatctgga gcgtgcggag gagagggctg agctctcgga 601 aggcaaatgt gccgagcttg aagaagagtt gaaaacggtg acgaacaact tgaagtcact 661 ggaggctcag gctgagaagt actctcagaa agaagacaag tatgaagagg agatcaaggt 721 tctctctgac aagctgaagg aggctgagac ccgggctgag tttgcagaga gatcagtaac 781 caaattggag aaaagcattg atgacttaga agatcaactc taccatcaac tcgagcaaaa 841 ccgccgtcta actaatgaac taaagctggc cctgaatgag gattaaaacc ctgggccaag 901 ttctaggaac ggagtccatg ttcagaaaat ctatgactct cggggttaaa aaatgataac 961 ttctagagaa atgggaagga aatgcattgg ctggaaggcc agccttgcct atgttttcct 1021 ctgtatatct taattaaatc ttcaataacc // LOCUS RATTMBR2A 1004 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat brain alpha-tropomyosin (TMBr-2) mRNA, complete cds. ACCESSION M34134 KEYWORDS alpha-tropomyosin. SOURCE Rat brain, cDNA to mRNA, clone pOk15. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1004) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review FEATURES from to/span description pept 137 892 brain alpha-tropomyosin (TMBr-2) mRNA < 1 1004 TMBr-2 BASE COUNT 265 a 242 c 321 g 176 t ORIGIN 1 cttagccgca gggggcgccg gcgttgcaaa gcgggcctgg gcgggacgaa ccggacctgg 61 ctccgtaact gcaggagccc agcagtagca gctgcagagc gcggaggcag agcacagcct 121 cagccgccgc ctcgccatgg cgggtagcag ctcgctggag gcggtgcgga gaaagatccg 181 gagcctgcag gaacaggcag atgccgcgga ggagcgtgcg ggcagcctgc agcgagagct 241 ggaccaggag cggaagctgc gggaaaccgc tgaagctgac gtagcatctc tgaacagacg 301 catccagctg gttgaggagg agttggatcg cgctcaggag cgtctggcca cagctctaca 361 gaagctggag gaggctgaga aggctgcaga tgagagtgag agaggcatga aagtcattga 421 aagccgagcc caaaaagatg aagaaaagat ggagattcag gagatccagc tgaaagaggc 481 caagcacatt gctgaagatg ctgaccgaaa gtatgaagag gtggcccgta agctggtcat 541 catcgagagc gatctggagc gtgcggagga gagggctgag ctctcggaag gcaaatgtgc 601 cgagcttgaa gaagagttga aaacggtgac gaacaacttg aagtcactgg aggctcaggc 661 tgagaagtac tctcagaaag aagacaagta tgaagaggag atcaaggttc tctctgacaa 721 gctgaaggag gctgagaccc gggctgagtt tgcagagaga tcagtaacca aattggagaa 781 aagcattgat gacttagaag ataagtttct ttgcttctct cctcccaaga ctccttcatc 841 aagccggatg tcccacctct ctgagctctg catctgtctg ctctccagct gacccaggtt 901 tctttctagt gcccacccac cctagggcca ggcacagacc gtgctttcta ttgtacagag 961 gtgatcctcc cagtgtaaaa taaacactgt acgctagtcc tgtt // LOCUS RATTMBR3A 2488 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat brain alpha-tropomyosin (TMBr-3) mRNA, 3' end. ACCESSION M34136 KEYWORDS alpha-tropomyosin. SOURCE Rat brain, cDNA to mRNA, clone pOk10. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2488) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 604 brain alpha-tropomyosin (TMBr-3) mRNA < 1 2488 TMBr-3 BASE COUNT 713 a 531 c 579 g 665 t ORIGIN 1 tgaagctgac gtagcatctc tgaacagacg catccagctg gttgaggagg agttggatcg 61 cgctcaggag cgtctggcca cagctctaca gaagctggag gaggctgaga aggctgcaga 121 tgagagtgag agaggcatga aagtcattga aagccgagcc caaaaagatg aagaaaagat 181 ggagattcag gagatccagc tgaaagaggc caagcacatt gctgaagatg ctgaccgaaa 241 gtatgaagag gtggcccgta agctggtcat catcgagagc gatctggagc gtgcggagga 301 gagggctgag ctctcggaag gcaaatgtgc cgagcttgaa gaagagttga aaacggtgac 361 gaacaacttg aagtcactgg aggctcaggc tgagaagtac tctcagaaag aagacaagta 421 tgaagaggag atcaaggttc tctctgacaa gctgaaggag gctgagaccc gggctgagtt 481 tgcagagaga tcagtaacca aattggagaa aagcattgat gacttagaag atcaactcta 541 ccatcaactc gagcaaaacc gccgtctaac taatgaacta aagctggccc tgaatgagga 601 ttaaaaccct gggccaagtt ctaggaacgg agtccatgtt cagaaaatct atgactctcg 661 gggttaaaaa atgataactt ctagagaaat gggaaggaaa tgcattggct ggaaggccag 721 ccttgcctat gttttcctct gtatatctta attaaatctt caataaccaa gaagttttta 781 tgatacatta tgtatatggg aaaatattta taccaatcaa ttcagctctt cctgcatcag 841 caggccctct ttgaagtgat gataatgagg actcccccac caccaccatc cccccaaaaa 901 aaggaaaaaa ttttctattt cctctaaaca gctgtgagtc ctgctttaaa tatctcttct 961 acatttgtat tcctaacatc ctcctaacat gtatagtgtt tgaaatgtga tccaacgtgt 1021 acctatgctt gggcaaaata gcttttgaaa acaggagctc gtaccacaag tccctggttg 1081 aaaggtatgc tttgttaagt ttaacggtgt tggaatctga ttatcgtgtc atgctgttaa 1141 gcgggatact ataaaaagac taagagaatg tccctaatga agtgtgcatg agatgtgttc 1201 atataaccat ttgcgagccg tagaaataaa tctatttaaa aagtggtcag aaccttcctt 1261 acgtcatctt cgttttctgt gaggtcgtgt tctgacccac ctacccgatc tttaactgcc 1321 tgtaactcag aatcctgtgc tggtattaac gtcttcttcg gtctgtgtaa caatgctgta 1381 cctgtatttg gtttttgacc ctcccttgtt ccccgtagca cgcagtgggt gagccttcac 1441 caaacacacc ctgcacgtta cgtcagggtg ggttgcaggt gtaggtgcac ccggggcctt 1501 gtgtagatgt gagtgacttc caaagcatgt gtagtgccac tcatctgtgt gtgactggtg 1561 tttgcttact aaatgtacaa accactaccg agtgtgaagc tcagccattc atgactcaat 1621 tttctgatcc catcacagag aaagtggcac atgccaaaga agaaaacctt agtatgcacc 1681 agatgctgga ccagacttta ctggagctaa acaacatgtg aaaccctcct tagctgcggc 1741 cacattctct cgtttccccc tgttgtttta ttttgtcttt aaacacatgc ttaccatgaa 1801 accccttcaa tgcttttttt ttttatatat atatatacct ttaccactgt cactgcaaca 1861 tctgcagaga gccagctagg gcagggggtg gggaaagaca cacagaaagc aggtgtactt 1921 gttatttaaa tgtgccattt cccaggttga cattgccaca cttcacagag tttaggcatg 1981 cagtttgctt agccagtgta ggaagcctca cgaaaacaaa aggattaaca gcctaagtgc 2041 caatggtggg gtcatcagga agactgatgt tggagacaat caggtgcaga ttggtgctaa 2101 tttaaacaaa agaacccctg gtgtcttttg ttcagtctaa tacaatttag acttctgtcc 2161 aacactaatt tatttttgtc ttgagtagca ctgcgagagg atatgggttc catgtgcccg 2221 tgttcctcac agccaagggt ttgaaagcca cgctggtctt gtaagattcc cattccttcc 2281 cagctgacat gcttgtggtc ccttccaact cagtggggag agggctcagt gctgttcgcc 2341 cccccccccc ccaattctcc atgtcagggt aagagatcta ggcactacat atgtaattgg 2401 taaagaaaag gcattttctc gagagttata actatgtaaa cattgtataa tgatatgaaa 2461 taaaatgcac attgtaggac attttcta // LOCUS RSPNIFHD 1375 bp ds-DNA BCT 28-JUN-1990 DEFINITION R.rubrum dinitrogenase reductase (nifH) gene, complete cds; and dinitrogenase alpha subunit (nifD') gene, 5' end. ACCESSION M33774 KEYWORDS dinitrogenase alpha subunit; dinitrogenase reductase. SOURCE R.rubrum DNA. ORGANISM Rhodospirillum rubrum Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1375) AUTHORS Lehman,L.J., Fitzmaurice,W.P. and Roberts,G.P. TITLE The cloning and functional characterization of the nifH gene of Rhodospirillum rubrum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.J.Lehman, 18-APR-1990. FEATURES from to/span description pept 367 1254 dinitrogenase reductase (nifH) pept 1370 > 1375 dinitrogenase alpha subunit (nifD) binding 355 358 ribosome binding site binding 1357 1361 ribosome binding site signal 145 160 upstream activating sequence signal 175 190 upstream activating sequence signal 280 296 ntr promoter BASE COUNT 287 a 404 c 431 g 253 t ORIGIN 1 ggggatccca aggagggacc ctccgtcctg accaagaact gtgcgttcgt gttttcgccc 61 attcctttat caccgatcgg gcggcgacgg gaagggcgtt ctgggcagta gcgcaccatc 121 gcgcatagcg tcgcgccgcc ccgctgttgg gtccgcgaca gggttttggg gcgttgtcgg 181 gtttgcgaca cggttgaaaa taaccctaca ttccttgtta gatagattta ttcctttgat 241 taaaaacgat aattttcaaa aaaatcgggc aaatcaggtt tggcacgctg ggtgcattaa 301 cggggtgagc gtggcggtgg agcttccccg ccttaggacc ggatcacgaa gaaaaggatt 361 gattccatga gcgcactgcg tcagatcgcg ttctacggca agggcggtat cggcaagtcg 421 acgacgtccc agaacacgct ggccgctctg gtcgaaatgg gccagcggat cctgattgtc 481 ggctgcgatc ccaaggccga ttccacccgc ctgatcctga acaccaagct gcaggacacc 541 gttcttcatc tggccgccga ggccggctcg gtcgaagacc tcgacgtcgc cgatgtggtg 601 aagatcggct acaagggcat caagtgcacc gaatccggtg gtccggagcc cggcgtcggc 661 tgtgccggcc gtggcgtgat caccgccatc aacttccttg aggaaaacgg cgcctacgac 721 gacctcgatt acgtctctta cgacgtgctg ggcgacgtgg tttgcggtgg cttcgccatg 781 ccgatccgcg agaacaaggc tcaggaaatc tacatcgtca tgtcgggcga gatgatggcg 841 ctgtatgccg ccaacaacat cgccaagggc attttgaaat acgcccacac cggtggggtt 901 cgtctgggcg gtctgatctg caacgagcgt cagactgata aggaagtcga actggccgag 961 gccctggccg gccgtttggg ctgccgcctc atccacttcg tgccgcgcga caacggcgtt 1021 cagcacgccg agctgcgccg ccaaacggtg atccaatacg cccccgacag caagcaggcc 1081 ggtgagtatc gcacgctggc gaccaagatc cacaacaact ccggccaggg cgtggtgccg 1141 accccgatca ccatggaaga cctggaagag atgctgatgg aattcggcat catgaagtcc 1201 gacgaagagg cgctggccga gctcgaagcc aaggaatccg ccgccgccaa ttaaggccgt 1261 cgccgtgcga cgttcccccc ggccggccgg tcaggcgcag gggcgacgga agcgggcgac 1321 ctccaagaag gcctcttcgc gaggcgatag tcgatcagag gagagggata tgagc // LOCUS YSCMTFDEL 70 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial novel junction sequence. ACCESSION M34150 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review FEATURES from to/span description site 33 44 novel junction site BASE COUNT 11 a 25 c 19 g 15 t ORIGIN 1 cttcggagtt cggccccccc ataagggggg gacctcactc cttccccact gcactggatg 61 cgggacttat // LOCUS YSCMTGAP12 457 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial gap 12 sequence. ACCESSION M34149 X15185 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 457) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review COMMENT The gap 12 sequence is a segment of yeast mitochondrial DNA that had not been determined previously. BASE COUNT 161 a 57 c 57 g 182 t ORIGIN 1 tacaatttat aatttaataa agaaggaaat aaataataat aactcctttt ggggttccgg 61 tggggttcac acctttataa ataataaata aagatgttta ctcctcttcg gggttcggtc 121 ccctttttgg gttccggaac taattaatat tttatataat aataataata tattaatata 181 atttcattat taataaatat ctcctgcggg gttcggtccc ccccgtaagg ggggggtcct 241 cactccttcg gagcgtacta ttattataaa taattatata ttataatata attaaaaagt 301 attataattg aaacgaaaat tgtaatttta aatggaataa taattattat atatttaata 361 tatttaataa agttataata tctctttcta ccggactatt ttattttatt ttattttatt 421 tttataaaga aaaatagtat aatattatct tctcctc // LOCUS YSCMTORI1A 70 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial ori1 gene, 5' flank. ACCESSION M34153 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review BASE COUNT 19 a 11 c 24 g 16 t ORIGIN 1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag 61 ggggtgggtg // LOCUS YSCMTORI2A 70 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial ori2 gene, 5' flank. ACCESSION M34152 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 32 > 70 ori2 mRNA (+/- 4 bp) BASE COUNT 23 a 0 c 18 g 29 t ORIGIN 1 tttttttata agataatttt tgtaaatata taagtaataa attaagtttt atagggggag 61 ggggtgggtg // LOCUS YSCMTORI7A 70 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial ori7 gene, 5' flank. ACCESSION M34154 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review BASE COUNT 19 a 11 c 24 g 16 t ORIGIN 1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag 61 ggggtgggtg // LOCUS CFXCSMA 859 bp ds-DNA BCT 28-JUN-1990 DEFINITION C.aurantiacus 5.7 kd chlorosomal protein (cmsA) gene, complete cds. ACCESSION M33964 KEYWORDS chlorosomal protein. SOURCE C.aurantiacus (strain J-10-fl) DNA. ORGANISM Chloroflexus aurantiacus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Multicellular filamentous green bacteria; Chloroflexaceae. REFERENCE 1 (bases 107 to 554) AUTHORS Theroux,S.J., Redlinger,T.E., Fuller,R.C. and Robinson,S.J. TITLE Gene encoding a 5.7 kd chlorosomal protein in Chloroflexus aurantiacus: Regulated message levels and a predicted carboxy terminal protein extension JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 859) AUTHORS Theroux,S.J., Redlinger,T.E., Fuller,R.C. and Robinson,S.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.J.Robinson, 01-MAY-1990. Author address: S.J.Robinson University of Massachusetts 108 Morrill Science Center Amherst, MA 01003 FEATURES from to/span description pept 210 452 5.7 kd chlorosomal protein (cmsA) mRNA 135 > 452 cmsA mRNA binding 196 204 ribosome binding site rpt 464 495 inverted repeat BASE COUNT 158 a 232 c 283 g 186 t ORIGIN 1 gatcggcgac ttccggcggg tcattacgct gccgaccagt ctggccggcc tggaagccgg 61 tcaggccgag atggagggtg aatggttgat tgtccccttc atggcgccgc aaatggcgca 121 acgctgaggc agcctgaact gtggggtttc tcggaggctt gcctccggtg acctgaataa 181 tggagtccgt ttgataagga ggtgtgtgca tggcgacgag aggctggttc tcggagtcgt 241 cggcgcaggt ggcgcaaatc ggcgacatca tgttccaggg ccactggcaa tgggtctcga 301 atgcgctaca ggccaccgcg gcagcggttg acaacatcaa ccgcaatgct tacccgggcg 361 tgtcccggag cggctcgggc gagggagcgt tcagcagcag cccgagcaac ggcttccgtc 421 cgaagcgcat tcgctcgcgc ttcaaccgct aggctggcgg gcaggcgtga ggcgtgcgtt 481 ctgtcgcctc acgccgatcc ctcgggagcg ttatgattaa cctgctctac tacagtcata 541 ccaccgtacc ggctcatctg gaagagatta cgcgactgga tgtgctcgat cctgacgagc 601 aagtgctggt cgcgatagat ggcgttttgg tcgatcagca gggacagcgg ctgagcggcc 661 caacgctgca cgattattgt ctgatcaccg acttgcgggt gatcctgtgg gcacgagatt 721 atggccggca tttgtgttat gcctttccat tgcatgagct gacgttgatt gatgggcgcg 781 ggattgatcc gatccacggc gcagtgacga tgggctttag cgcgccagat accgatgatc 841 agatttttac gctgactct // LOCUS VVCACTA 3385 bp ds-DNA PLN 28-JUN-1990 DEFINITION Volvox carteri actin gene, complete cds. ACCESSION M33963 KEYWORDS actin. SOURCE Volvox carteri (female strain HK10) DNA, clone lambda-VAc21. ORGANISM Volvox carteri Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Volvocaceae. REFERENCE 1 (bases 1 to 3385) AUTHORS Cresnar,B., Mages,W., Mueller,K., Salbaum,J.M. and Schmitt,R. TITLE Structure and expression of the single Volvox carteri actin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Mages, 01-MAY-1990. Author address: W.Mages Lehrstuhl fuer Genetik Universitaet Regensburg Universitaetsstr.31, D-8400 Regensburg, FRG FEATURES from to/span description pept 450 509 actin, exon 2 (first expressed exon) 613 708 actin, exon 3 780 926 actin, exon 4 1009 1074 actin, exon 5 1150 1197 actin, exon 6 1396 1567 actin, exon 7 1667 1821 actin, exon 8 1959 2142 actin, exon 9 2425 2630 actin, exon 10 pre-msg 195 > 2630 actin mRNA and introns IVS 258 439 actin intron A IVS 510 612 actin intron B IVS 709 779 actin intron C IVS 927 1008 actin intron D IVS 1075 1149 actin intron E IVS 1198 1395 actin intron F IVS 1568 1666 actin intron G IVS 1822 1958 actin intron H IVS 2143 2424 actin intron I signal 120 124 CAAT box signal 164 169 TATA box BASE COUNT 775 a 830 c 928 g 852 t ORIGIN 1 aagcttgaag agaggtctta tggaccctag tcggtcacga tccagacacc aaacgtaacg 61 aaaaaaagga ataattagcc gggcttgttg atgcatattc gcggctttac attatcccgc 121 caatacgttc acgtacggca caggacatcc ttttcactcg agctataaac tatcatcatc 181 cagattgctt gttttacctt gcaccagtta ctatcagcac caaccggccc ttgacttcgc 241 acagccttct aaaaaaggta cgcgcgacgt gtttagcgtt cgggcagccg cagctaggag 301 gtttggtggc tgcgaggccc ttctgcccca ggataagaag gggattaact tccgttgcca 361 taacatgaag tccggggcct tgtccatgtt cgatcgccaa cgatgcaacc tatgctgtct 421 taccgctggg cttatgcagg gcagcaacaa tggctgagga gggcgaggtt tctgctctgg 481 tgtgcgacaa cggctcgggc atggtcaagg tacgttttct ggctgctgct cggttgggcc 541 cggccaggcc caccccgcag ccttgctgag cttttgatgg atcgcatcta acacgtggcc 601 cctgatttgt aggctggctt tgctggtgat gatgccccac gagctgtgtt tccgagcatt 661 gttggtcggc cccgccatac gggtgtgatg gtcggcatgg ggcagaaggt gaatagaatg 721 acggcgatgc aggcacggac tcctcgacat ttttataatg atctttttgt tttcgccagg 781 attcctacgt gggcgacgag gcacagtcta agcgtggtat cttgacgttg cgctacccta 841 ttgagcacgg tatcgttacc aactgggatg acatggagaa gatctggcac cacactttct 901 ttaatgagct tcgtgtggca ccggaggtgc gtctgcgtac tagcttcctt ggctttcagc 961 tatggaccag ttcgacattg acaactgctt tccttcccaa atctgtagga gcacccagtc 1021 cttctcacgg aggcgcctct taatcccaag gcgaaccgtg agaagatgac gcaggtacgg 1081 atattgtttt atgcacatac cgttgcgttc gttcccaagc ttgcctcccg ataatctccg 1141 tttgcgcaga tcatgttcga gaccttcaac gtccctgcca tgtatgtggc gatccaggtg 1201 acgattgggg gaggggactt agtaatacct cgattgctgt tgttcgattc acccctctca 1261 tttcgatttg cgcacttggg gtggcatcat gtttcgtgcg tcggggtagt tgttaaccct 1321 gcaaccctag ctactgccac taatgacagt aggaaagtaa tatttaattg ctgtccttca 1381 tgctgacaac tgcaggccgt gctgtcgctg tatgctagcg gtcgtacgac gggtatcgtt 1441 ctagattccg gtgacggtgt aacccacacg gtgcccatct atgaaggcta cgcgctgccg 1501 cacgccatcc tgcgtctgga ccttgccggc cgtgacctga cggactacct gatgaagatt 1561 ctaatgggtg aggcattcgg atagggggac caaaataccg taccgcgaga aatgggttat 1621 gagtcgtgtg ttttgacttg tgttgaacgg cttcccttcg ttacagagcg tggttactca 1681 ttcacgacca ctgctgagcg tgaaatcgtg cgcgacatca aggagaagct ctgctatgtg 1741 gcgctggatt tcgagcagga aatggcaaca gcagcctcca gctcggcgct tgagaagacg 1801 tacgagctgc ctgatggcca ggtgcggccg tagctagacg ccgacgaata gcctaaaact 1861 ttcggtgatg atcagcttag atacatttga ttgtgtcggc attgcacgaa aaaagtcgtt 1921 gtccataatg ttaaaattgc cctcttttct ccctgcagcc aatcacaatt ggcaacgagc 1981 gcttccgttg ccctgaggtg ctgtacaacc ctagcctaat cggcatggag gcggtcggta 2041 tccacgacac tacctttaac agcatcatga agtgcgatgt cgatatccgc aaggatctgt 2101 acaacaacat cgtgctctct ggtggaacga ccatgttccc tggtaagcaa tgcatactac 2161 tgttagactt caggacatgt tgtgatcgca ttagcgatta gaaccgggtt agcgcgggac 2221 agacaggagg tacggtgttt ggggctcggg gaactcttat gaggatgaat aagcggttct 2281 tagcgtctat gtcacggacg gatctatgga attccgtcag tatctggagg cttgcatgtt 2341 gcatcgctcc ccctgaactc catcgtcctc cgtgaaggtc acagtgcctt gacaaatgcg 2401 atctgtgctg ttgaacaatt gcaggcatcg ctgatcgcat gaccaaggag atcacggccc 2461 ttgcgccgtc tgccatgaag atcaaggtcg tcgccccgcc ggagcgcaag tactctgtgt 2521 ggattggtgg ttcaatcctg gcgtccttgt cgaccttcca gcaaatgtgg atcgcgaaga 2581 gcgagtacga cgagagtggt ccgtcgattg tgcaccggaa gtgcttctaa gcataatagc 2641 ttgcggactg ttattcggcc tctttcggtt ataactgacc ggaagcgggt tactcttcag 2701 tcaaggcgga tgcctatgat tttgcaaaaa ttcgagcgga gatgtcatct tggtttcaag 2761 ctgcgtgggt gagaaatcaa ctgtgggatc catgtcgaca gcttaacaac agtctctacc 2821 atttcatggt atgcggctgt agccgtagcg agtacgtcgt catgagcagc ggtagtttga 2881 tcgtgcgaag tagtctgata ggggcagggc gcgtggggat ggctaataga acgaagagga 2941 tgtgtgcatt caagcgagtg gatgtcggcg attggtctag gccaagcaaa gtgcgtgtaa 3001 gggaaagctg gatgacacga ccattgctcg tgtgctttcg gaacttccac tcttgaactc 3061 aggcctatgc gctgccttga gtatcctgcc ggataaaagg cgggaggcgg taacgttagt 3121 agacccttcc aggggtataa atggtcaatt acaacaaaac tgtaacataa cgaagcgctg 3181 cacgtagtac gaaatccgtt gcaaacgtga aacccagcta cttgccaata cttaaacgcg 3241 catcagcatt gtgtacaaga acgtcaatga ggcattgcgt aaattacaca tgtgagcact 3301 aagaaaacag ctgtatgttg tcacgtttat tcactagcgc acagcccgca agtaagcggt 3361 ggcaaagccc agctttggta caaag // LOCUS HAMCD44 1089 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Hamster hyaluronate receptor (CD44), complete cds. ACCESSION M33827 KEYWORDS CD44 hyaluronate receptor. SOURCE Hamster, cDNA to mRNA. ORGANISM Cricetulus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1089) AUTHORS Aruffo,A., Stamenkovic,I., Melnick,M., Underhill,C.B. and Seed,B. TITLE CD44 is the principal cell surface receptor for hyaluronate JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Seed, 23-APR-1990. FEATURES from to/span description pept 1 1089 hyaluronate receptor (CD44) BASE COUNT 299 a 282 c 277 g 231 t ORIGIN 1 atggacaagt tttggtggca cgcagcttgg ggactctgcc tcttgccgct gagcctggcg 61 cacgagcaga tcgatttgaa cataacctgc cgctatgcag gtgtattcca cgtggagaaa 121 aatggccgct acagcatctc acggactgag gcagctgacc tctgccaagc tttcaacagc 181 actctgccca ccatggacca gatggtgatg gccctgagca agggctttga aacatgcagg 241 tatgggttca tagaaggcca cgtggtgatc ccgaggatcc agcccaatgc catctgtgca 301 gccaaccaca ctggggtgta tatcctcaca tccaacacat ctcactacga tacatattgc 361 ttcaatgcct cagcacccct tgaagaagac tgtacatctg tcacagacct gcccaattcc 421 ttcgaaggac cagttaccat aactattgtc aaccgtgatg gtacccgcta cagcaagaag 481 ggcgagtata gaacacacca agaagacatt gatgcctcaa ataccacaga tgatgatgtc 541 agcagcggat cctccagtga gaagagcacc tcagggggct atgttttcca cacctacctt 601 cccactatac actcaactgc agaccaggat gatccctact tcatcgggag caccatggcc 661 accagagacc aagactcatc catggatccc agggggaatt ccctcactgt gactgatgga 721 tccaaattaa ctgaacactc aagtgggaat caagacagtg ggcttaactc aacttctcgt 781 cctggaggaa aacctcgagt tccagaatgg ctcatcgtct tggcatctct cctggcgctg 841 gctctgattc ttgctgtttg cattgctgtc aacagtagga gaaggtgtgg acagaagaaa 901 aagctggtga tcaacagtgg caatggaaag gtggaggaca ggaagccaag tgagctcaac 961 ggggaggcca gcaagtctca ggaaatggtg catttggtga acaaggaacc atcagagact 1021 cctgaccagt ttatgacagc tgatgagacc cggaatctgc agaatgtgga catgaagatt 1081 ggggtgtag // LOCUS DROGNBPSA1 441 bp ds-DNA INV 28-JUN-1990 DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha subunit gene, exon 1. ACCESSION M33996 KEYWORDS guanine nucleotide-binding protein. SEGMENT 1 of 3 SOURCE D.melanogaster (strain Canton S) embryo DNA, clones lambda-DGs-[4,5,7,10,11,12]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 441) AUTHORS Quan,F. and Forte,M.A. TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by alternate splicing involving an unusual splice site JOURNAL Mol. Cell. Biol. 10, 910-917 (1990) STANDARD simple staff_entry FEATURES from to/span description pre-msg 362 > 441 guanine nucleotide-binding protein (Gs-alpha) mRNA and introns (alt.) pre-msg 174 > 441 Gs-alpha mRNA and introns (alt.) IVS 426 > 441 intron A signal 152 157 promoter region (put.) variant 365 365 c in DNA, g in cDNA BASE COUNT 140 a 89 c 85 g 127 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgcta ttctacatag ctcgggatac tttattttat tattaatatt cgctaaaata 61 gcacatcgtc atttggcata gatgtttttg gcccagtaac atatctatct acatcaccgc 121 ccgcactatc cgataggcag actgactatc gataaaagta tctacttcgt tgcatcccta 181 ggttgataca taaaataaaa aatatataaa aagtgagtta gtactggtat taaaagtaat 241 ttttaagcta caaattgatg aattgcgagc acacagagac taagaacatt gggaatccgg 301 aaaacagtgg ctggcgcgtc gggtgcgata gaattgcctg tttttcagac cccaaaagtc 361 gctcccaacc gagtcgtgca cctatagtct tggcccgaag ttgctcccat gaataaaaat 421 gttcggtaat ttcacggtaa a // LOCUS DROGNBPSA2 897 bp ds-DNA INV 28-JUN-1990 DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha subunit gene, exons 2,3 and 4. ACCESSION M33997 KEYWORDS guanine nucleotide-binding protein. SEGMENT 2 of 3 SOURCE D.melanogaster (strain Canton S) embryo DNA, clones lambda-DGs-[4,5,7,10,11,12]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 897) AUTHORS Quan,F. and Forte,M.A. TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by alternate splicing involving an unusual splice site JOURNAL Mol. Cell. Biol. 10, 910-917 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 265 540 guanine nucleotide-binding protein (Gs-alpha-S), exon 2 607 703 guanine nucleotide-binding protein (Gs-alpha-S), exon 3 760 + 880 guanine nucleotide-binding protein (Gs-alpha-S), exon 4 pep$ 265 540 guanine nucleotide-binding protein (Gs-alpha-L), exon 2 607 703 guanine nucleotide-binding protein (Gs-alpha-L), exon 3 760 + 880 guanine nucleotide-binding protein (Gs-alpha-L), exon 4 pre-msg < 1 > 897 Gs-alpha mRNA and introns IVS < 1 29 intron A IVS 541 606 intron B IVS 704 759 intron C IVS 881 > 897 intron D BASE COUNT 266 a 225 c 226 g 180 t ORIGIN About 1.4 kbp downstream of segment 1. 1 ccatctacta taccattgta cgaatgcagc ctgctgcaga gatccactga agcgctgagc 61 caacgccaac ggtggtgcca ctgagatcgg agatcggaga tcgcagaccg gagaccgacg 121 agcacacaag ggattgtggt ttgatccgag gaacccaagc actcagctat actggagcag 181 aacggagaaa gcacaagggg tgccgagtcg cccgacgcga accaggaaaa ccgagcgacg 241 tgttggtgga ccccgtgggc tgcgatgggt tgctttgggt cgcccacctc caagcagtcg 301 gacgtgaact cggaggactc gaagagccag aagcgccgga gcgatgcaat atctagacag 361 ttgcagaagg acaaacagct ctacagggcc acacacaggc tgctcctcct gggggcgggc 421 gagtccggca aatcaaccat agtcaagcaa atgcgaatat tgcatgtcga cggattttct 481 gactcggaaa agaaacagaa aattgatgat attaaaaaga atattcgaga cgctatcttg 541 gtgagttcca tcccgccggc catctaagcc caaacacttg cttaactcct tgccaatccc 601 ttacagacta ttacaggagc catgagcaca cttaatccac ctgtagcttt agaaaagaag 661 gaaaatgaac ccagagtgga gtacattcag gattatgcat ctagtaagag cctctgccgc 721 cccaacttaa gttgttatca accattatct gccccgcagg tccggacttt aattatcctc 781 ctgaatttta tgaacataca gaagaactat ggaaagacaa gggcgttctt caaacctatg 841 agaggtcgaa tgagtatcaa ttaatcgatt gtgcgaaata gtaagtaatg cccaccc // LOCUS DROGNBPSA3 1380 bp ds-DNA INV 28-JUN-1990 DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha subunit gene, exons 5,6,7,8 and 9. ACCESSION M33998 KEYWORDS guanine nucleotide-binding protein. SEGMENT 3 of 3 SOURCE D.melanogaster (strain Canton S) embryo DNA, clones lambda-DGs-[4,5,7,10,11,12]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1380) AUTHORS Quan,F. and Forte,M.A. TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by alternate splicing involving an unusual splice site JOURNAL Mol. Cell. Biol. 10, 910-917 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 4 132 guanine nucleotide-binding protein (Gs-alpha-S), exon 5 206 385 guanine nucleotide-binding protein (Gs-alpha-S), exon 6 445 569 guanine nucleotide-binding protein (Gs-alpha-S), exon 7 746 828 guanine nucleotide-binding protein (Gs-alpha-S), exon 8 909 1055 guanine nucleotide-binding protein (Gs-alpha-S), exon 9 pep$ + 4 132 guanine nucleotide-binding protein (Gs-alpha-L), exon 5 206 385 guanine nucleotide-binding protein (Gs-alpha-L), exon 6 445 569 guanine nucleotide-binding protein (Gs-alpha-L), exon 7 755 828 guanine nucleotide-binding protein (Gs-alpha-L), exon 8 909 1055 guanine nucleotide-binding protein (Gs-alpha-L), exon 9 pre-msg < 1 1115 Gs-alpha mRNA and introns (alt.) pre-msg < 1 1176 Gs-alpha mRNA and introns (alt.) pre-msg < 1 1243 Gs-alpha mRNA and introns (alt.) IVS < 1 3 intron D IVS 133 205 intron E IVS 386 444 intron F IVS 570 745 intron G (for Gs-alpha-S) (no splice consensus) IVS 570 754 intron G' (for Gs-alpha-L) IVS 829 908 intron H variant 780 780 t in DNA, c in cDNA signal 1083 1088 polyA signal signal 1154 1159 polyA signal signal 1221 1226 polyA signal BASE COUNT 441 a 265 c 254 g 420 t ORIGIN About 500 bp downstream of segment 2. 1 aagcttcctg gaccgagtga gcacaatcaa gaatccaaac tacaccccta atgagcagga 61 tattcttcgg tgccgtgttt tgacttctgg aatatttgaa acaagatttc aagtggacaa 121 agtaaacttt cagtaagtgt aatcccaagt gatatcgcaa tattttagtt tctattccat 181 tctaatgata tgcattcgtg tgcagcatgt tcgatgtcgg tggccagcgg gacgagcgta 241 ggaaatggat tcagtgtttc aatgatgtaa ctgctatcat attcgtaact gcgtgctcaa 301 gttataacat ggttttgcgg gaagatccca cccagaaccg acttcgagaa tctttggatt 361 tgttcaagag tatttggaac aacaggtgaa ccgagcaatt tggcgccatc tcgacaaatg 421 gaaactaacg agttgttctt tcagatggct tcgcacgatt tctattatac tatttttaaa 481 taagcaagat ttgttagcag agaaaattaa ggctggaaaa agtaaattgt cggaatattt 541 ctccgagttt aacaaatacc aaacgccaag taagtaaaat agtataaccc atagtagagg 601 ccaattgagg gcgagggagt catcagatca accgactcag atctggccaa gaaattcgca 661 attccttttc aaatttgcaa accgtgccca agttaactta agaaatttaa tggataatgt 721 attttgtcta atttcaaatg tgctgtcgac acaggtgacg caataatgga atccaatgat 781 gacccagaag taatacgagc aaaatatttc atacgagacg agtttctggt gggttatatc 841 acatctccgt ttcttcctcc acctatctaa ctagccttaa tccttaatgg gatctctttc 901 ttatatagcg tatatctacc gctagcggag acggaaaaca ctactgctat ccacatttca 961 catgcgccgt tgacacagaa aacattaaac gtgtgtttaa tgattgcaga gacattattc 1021 aaaggatgca ccttcgtcaa tatgaattgt tataggttat ccccatcgcc gtaatgcaag 1081 taaataaaaa tattaatgac ttactgtttt aaatataatt atcaaatgta atttgtattt 1141 aaatagtccc ttgattaaaa caaaaaattc gctgcattta gatctggtat gacctttcac 1201 aaaactcaac attaaagaat aataaataaa catgttaata acagacacgc ttgtttcgat 1261 tattttggct attctaccag gatgcacctg ctttattttc tatagtccct tgacatgaag 1321 cattttagag tatttccgct atcagagtcc gtatccatat ccatgggcat ttacattcac // LOCUS SYCSBEFA 850 bp ds-DNA BCT 28-JUN-1990 DEFINITION Synechocystis sp. cytochrome B559 alpha subunit (psbE) and beta subunit (psbF) genes, complete cds. ACCESSION M33897 X06988 KEYWORDS cytochrome B559 alpha subunit; cytochrome B559 beta subunit; photosystem II protein. SOURCE Synechocystis sp. (strain PCC6803) DNA, clone pKW1261. ORGANISM Synechocystis sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 553) AUTHORS Pakrasi,B.P., Williams,J. and Arntzen,C. TITLE Targeted mutagenesis of the psbE and psbF genes blocks photosynthetic electron transport: Evidence for a functional role of cytochrome B559 in photosystem II JOURNAL EMBO J. 7, 325-332 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 850) AUTHORS Pakrasi,H.B., Nyhus,K.J. and Granok,H. TITLE Targeted deletion mutagenesis of the beta subunit of Cytochrome B559 protein destabilizes the reaction center of photosystem II JOURNAL Z. Naturforsch., C, Biosci. 45, 423-429 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Himadri, 01-MAY-1990. Washington University Dept. of Biology Campus Box 1137 St. Louis, MO 63130 FEATURES from to/span description pept 121 366 cytochrome B559 alpha-subunit (psbE) pept 403 537 cytochrome B559 beta subunit (psbF) binding 109 112 ribosomal binding site (put.) binding 392 396 ribosomal binding site (put.) BASE COUNT 197 a 196 c 197 g 260 t ORIGIN 1 gctaccccag caccgaggtg gcaatggttc cctaggcggc tcacaaaata gtagactaga 61 ctctacttgc tttgcatttg tcagtcaatg ttgttttgaa aaattgaagg agaacacaaa 121 atgtcaggga ctaccggcga gcgtccattt tccgatattg tcaccagcat tcgctactgg 181 gtgatccaca gcatcaccat cccgatgttg tttattgctg gttggttgtt tgtcagcacg 241 ggcttagcct acgatgcttt tggcactccc cgccccgatg aatatttcac ccagacccgt 301 caagagttgc ccattctcca ggaacgctac gacattaatc aggaaattca agagtttaat 361 caataaaaca tttaattgtt cttttttagt tggtaattaa caatggcaac ccaaaatcct 421 aatcaaccgg ttacttatcc catttttacg gtgcgctggc tggcggttca caccctggcg 481 gtgccctctg tcttctttgt cggggcgatc gccgcgatgc aatttattca acgctaggag 541 tttttcatgg acagaaattc aaacccaaac cgccaaccgg tggaattgaa ccgcacttct 601 ttatacctgg gtctattgtt ggtggctgtg ttggggattt tgttctccag ctatttcttt 661 aactaaactt ttttaatacg caatttagga ggcatggtat gttcgcagaa ggcagaatcc 721 ctttgtgggt ggtgggtgta gtggccggta ttggcgccat tggtgttcta ggattatttt 781 tctacggagc ctatgctggt ttaggttctt ccatgtaatc gagggctagc cgccacacaa 841 tatcatggtt // LOCUS ECOPURCA 2060 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.coli 5-phosphoribosyl 5-aminoimidazole 4-N-succinocarboxamide synthetase (SAICAR) and dapX gene, complete cds. ACCESSION M33928 KEYWORDS 5'-P-ribosyl 5-aminoimidazole 4-N-succinocarboxamide synthetase; SAICAR synthetase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2060) AUTHORS Tiedemann,A.A., DeMarini,D.J., Parker,J. and Smith,J.M. TITLE DNA sequence of the purC gene encoding SAICAR synthetase and organization of the dapA-purC region of Escherichia coli K12 JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (sites only) AUTHORS He,B., Shiau,A., Choi,K.Y., Zalkin,H. and Smith,J.M. TITLE Genes of the Escherichia coli pur regulon are negatively controlled by a repressor-operator interaction JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Authorin sequence for [1] kindly submitted by J.M.Smith, 27-APR-1990. Seattle Biomedical Research Institute 4 Nickerson Street Seattle, WA 98109-1651 e-mail: jms0234@max.acs.washington.edu FEATURES from to/span description pept 1281 1994 5'-phosphoribosyl-5-aminoimidazole-4- N-succinocarboxamide synthetase (EC 6.3.2.6) pept 637 1068 dapX protein signal 1106 1111 -10 region binding 1112 1127 purine repressor protein binding site (PUR box) site 1180 1216 repetitive extragenic palindrome BASE COUNT 501 a 504 c 587 g 468 t ORIGIN 1 gcatgccggt ttgctgtaaa gtttagggag atttgatggc ttactctgtt caaaagtcgc 61 gcctggcaaa ggttgcgggt gtttcgcttg ttttattact cgctgcctgt agttctgact 121 cacgctataa gcgtcaggtc agtggtgatg aagcctacct ggaagcggca ccgcttgcgg 181 agccttcatg ccccggctgg aatgattttg ccggtgacct ccggtgttat gcaatcccgg 241 tgaccaacgg tagtggtgct gtcggtaagg cgctgacatt cgtccaccag cccagccgct 301 ggcactggtt tctggcgcgc gtacccagtt cacgggcgat accgcttcat tgctggtgga 361 aaatggtcgt ggcaatactc tgtggccgca ggtggttagc gtgctgcagg cgaaaaacta 421 caccatcacc caagtgatga tgctggtcag acactgacca ccgattgggt acaatggaac 481 cgtctggacg aagacgagca gtatcgtggt cgttatcaaa tctctgttaa gccgcagggt 541 tatcagcagg cggttacggt taaactgctg aacctggaac aggcgggcaa accggttgca 601 gacgcggctt ccatgcagcg ttacagcacg gagatgatga acgttatttc cgccggtctg 661 gataaatctg ccactgacgc cgcgaaccgt gcgcaaaatc gtgcctccac cactatggac 721 gtacaaagtg cagctgatga caccggttta ccaatgctgg tcgtacgcgg gccgttcaat 781 gtggtttggc aacgtctgcc agcggcgctg gaaaaagtgg gcatgaaagt gaccgacagc 841 acccgttcgc agggcaacat ggccgtaact tataagccgc tgtctgacag cgactggcag 901 gaactgggcg cgagcgatcc aggcctggca tccggtgact ataaactgca ggttggcgat 961 ttagataacc gcagcagcct acagttcatc gatccgaaag gtcatactct gactcagagt 1021 cagaacgacg cgctggtagc tgtcttccag gctgcgttta gcaagtaaaa atacagggct 1081 ggaatcatcc ggcccttttt tctgatatga tacgcaaacg tgtgcgtctg caggaaaacg 1141 cgattttagc ggtaattcgc acgaaatttg tttgtcggac gtagttcgga taaggcgttc 1201 acgccgcatc cgacaaaaca tccggcacac cagacagcaa aagattttaa aacgttaatt 1261 cacacccagg agtgataaag atgcaaaagc aagctgagtt gtatcgtggt aaagcgaaaa 1321 ccgtatacag cacggaaaac ccggacctgt tggtgctcga attccgcaat gatacgtcag 1381 caggggatgg cgcgcgcatt gagcagtttg atcgcaaagg tatggtgaac aacaagttca 1441 actacttcat tatgagcaaa ctggctgaag cgggtatccc gactcaaatg gagcgtctgc 1501 tctccgatac cgaatgtctg gtgaaaaagc tggatatggt gccggttgag tgtgtcgtgc 1561 gtaaccgtgc tgctggctct ctggtgaaac gtcttggaat cgaagaaggt attgagctga 1621 acccgccgct gttcgatctg ttcctgaaaa acgacgccat gcacgatccg atggtcaacg 1681 aatcttactg cgaaaccttt ggctgggtga gcaaagagaa cctggcgcgt atgaaagagc 1741 tgacctacaa agcgaacgac gtgctgaaaa aactgttcga tgatgctggt ctgattctgg 1801 tcgacttcaa gctggaattt ggtctgtaca aaggcgaagt ggtactgggt gatgagttct 1861 ccccggacgg tagccgcctg tgggacaaag aaacgctgga gaaaatggac aaagaccgtt 1921 tccgccagag cctcggtggc ctgatcgaag cctatgaagc cgtcgcccgc cgcctgggtg 1981 tacagctgga ctgatttttc tgttcatcat cttgccgtgc agctggcacg gcaagacaac 2041 cgctctcgga atatatgcat // LOCUS PIGSGLT1A 3649 bp ss-mRNA MAM 28-JUN-1990 DEFINITION Pig Na+/glucose cotransporter protein (SGLT1) mRNA, 3' end. ACCESSION M34044 KEYWORDS Na+/glucose cotransporter. SOURCE Pig kidney cell line LLC-PK1, cDNA to mRNA, clone pPSGT-[B1,C2]. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 3649) AUTHORS Ohta,T., Isselbacher,K.J. and Rhoads,D.B. TITLE Regulation of glucose transporters in LLC-PK1 cells: Effects of glucose and monosaccharides JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Ohta 03-MAY-1990. Mass General Hospital Building 49 13th Street Charlestown, MA 02129 FEATURES from to/span description pept < 1 1818 Na+/glucose cotransporter (SGLT1) signal 1935 1940 polyA signal (put.) signal 2086 2091 polyA signal (put.) signal 2238 2243 polyA signal (put.) signal 3205 3210 polyA signal (put.) signal 3628 3633 polyA signal (put.) BASE COUNT 905 a 790 c 871 g 1083 t ORIGIN 1 ttcttcctgg ctggacgaag tatggtgtgg tggccggttg gagcttctct gtttgccagt 61 tatatcggaa gcgggcattt tgttgggctg gcaggaactg gagcagctgc aggcattgcc 121 actggtggct ttgaatggaa tgctctgatt tgggtggttg ttctgggctg gctgtttgtc 181 cccatttaca ttaaggcagg ggtggtgacg atgccagagt atctgcggaa gcgatttgga 241 ggcaagcgga tccaggtcta cctctctatc ctgtccctga tgctctacat tttcaccaag 301 atctcggcag acatcttctc tggggccata ttcatcacct tggccttggg cctggatctg 361 tacctggcca tctttctctt actggcaatc actggccttt acacaatcac agggggcctg 421 gctgctgtga tttatacgga taccttgcag acagcaatca tgctggtggg gtcttttatc 481 ctaactgggt ttgcttttca tgaagtggga ggttatgatg ccttcatcga aaaatacatg 541 aatgccattc cgactgtgat ttctgatgga aatatcacca tcaagaaaga atgttatgcc 601 ccgagggctg actccttcca tattttccga gatcctctca agggagacct gccatggcct 661 gggctcacct ttgggctgtc catcctcgcc ctgtggtact ggtgcacaga tcaggtcatt 721 gtgcagcgct gtctctcggc caaaaacatg tcacatgtga aggctggctg cgtcatgtgt 781 gggtacttta agctgctgcc catgtttgtc atagtgatgc cagggatgat cagccgcgtc 841 ctgtacacag aaaaaattgc ctgcaccgtc ccctcggaat gtgagaaata ttgcggcacc 901 aaggttggct gttccaacat tgcctacccg accttggtgg tggaactcat gcccaatgga 961 ttgcgaggcc tgatgctgtc ggtcattttg gcctctctca tgagctccct gacctccatc 1021 ttcaacagcg ccacgactct cttcaccatg gatgtctacg ccaagatccg gaagagagca 1081 tctgagaaag agctcatgat tgcgggaagg ctgttcatcc tggtgctgat tggcatcagc 1141 attgcctggg tgcccattgt gcagtcagca caaagtgggc agctcttcga ttacatccag 1201 tctgtcacca gttacttggg accacccatc gcagctgtct tcctacttgc tattttctgc 1261 aagagagtca atgaggaggg agccttttgg ggactggtca taggatgtat gattgggctg 1321 gcccgtatga ttaccgagtt tgcctatgga accgggagct gcgtggagcc cagcaactgt 1381 cccacaatta tctgtggtgt gcactacttg tactttgcca tcatcctctt tgtcatttcc 1441 atcatcatcg tcctggtcgt ctccctcttc accaagccca ttccagatgt gcatctctac 1501 cgcctgtgtt ggagcctgcg caacagcaaa gaggagcgta ttgacctgga tgcagaggag 1561 gaggacattc aagaagcccc agaagagacc attgaaatag aagttcctga ggagaagaaa 1621 ggatgcttca ggaggaccta tgacctgttt tgtggcctgg accagcagaa gggccccaaa 1681 atgaccaagg aagaggaggc agccatgaag ctgaagatga cagacacctc agagaagcct 1741 ttgtggcgga cagtagtgaa catcaatggc attatcctgc tgaccgtggc cgtattctgc 1801 catgcatatt ttgcctgaaa tcctgccttc tgccacaggc tgtgccctga aggctggact 1861 ctttactctg tttcctttta gtcccattct gtgctgtcaa agggacacca gccagttgta 1921 aattttgcct agtcaataaa tgtgtacatt tgtaattagc cagaaaaaaa aatcagtgat 1981 ctgttattaa cttatgtatt tgaggccagt gtcatacagt catctgtaca tatcagacct 2041 gcagaaggaa gtccactcag tcggatgtct aggaaaaggc agactaagaa acagaagtcc 2101 cgtgatgtct gatgcaagtt tgtctcaggt agattcagca tgtcagtgtt gtttgtgatc 2161 cttgaatgtt attttaagag ttgtggtcgc cctggttcgg ccacttccta taaacctcac 2221 ctctcataat ttttcaaaag aaaaaatccc ccccctcccg atatcatgtc aagtttttcc 2281 ggcacaagag taacaatttg ataagttgat gaacacattg aagctgagga tggaagtttg 2341 atggagtccc atcctgggtc tttgctctcc aaaggagagg tctgggatgg taatacttcc 2401 cccaaagaga gggacccagg tgccaatctc aggccactag ggaaatcagg taacttgagt 2461 caaggcgatt ttctgcattt cttagaagtg tccattgtga gttccttggg ttgagatttc 2521 attcattttg gcatatcttt tgcctgtact gtcttagtca ctcagcatgt ctgtggacca 2581 aatgaatgat ataacatttt gttttataat gtttgcttta tttttggtta actttttctt 2641 atggctgcac aagtacaacc tctcgtgact ataatgaatt gggtgcattt ccataggcct 2701 gtcttcctca tggtgccaat gctggcatac aggtctagaa gctgactctc tcacataata 2761 gagcaaatgc acctcttgga ggcagccgtc atctcaaagt aaccacactc tcctgaaaca 2821 ctgaaacttt ttggcctgag accctccttc tatgcgctca catgaccata gctcatattt 2881 ttttgaggtg tcttgattga tgatcatcca ctgaccaagg ttagagtggg tggtctctgt 2941 tccaggtggt agcacttgac actcttattt cttgtttcct agggagaatt tttaaaaatt 3001 aaatttaatt aatttaattc ttttaattcc accagtgaaa tattaattat aatataatcc 3061 tgtggaagtc atggtgttgg tattgactgg tttcaagcta gcttggaaaa gagaagtgat 3121 ctgagattct gtggagctaa gagcggttcc ttgggtataa aagtggctta acaggtatac 3181 aatggcttgt tcttgtataa ggaaaagaaa attttgatat gccctatatt ccccttaaag 3241 ctcaggcaga gaagatgcca aaagtggaga atttcacatt attcagagta cttgtgggag 3301 tttgtgatgt ttttgagctt tgagaggata aagagggatt atagaaagtt ggggaatgga 3361 acaggcataa gtatctagat tgtcctagat tctgtttccg actttgccaa tgattcactg 3421 tgtatattgc agtaaattat tttgtctcaa ctgtgtattt tcctctttga aaaatggaga 3481 ccttggtgtt atagatctac ctacctcctc tgggtgtgac aaggtcattt tgaaaaggtt 3541 agatataata catgaaagat gccaagcaaa tttatattaa gcttcattaa tgtagtccca 3601 tacattaaat tttactactt tggtaaaaat aaaagacatt aaaaatgat // LOCUS RATSHIIA 3064 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat K+ channel protein (KShIIA) mRNA, complete cds. ACCESSION M34052 KEYWORDS K+ channel protein. SOURCE Rat brain, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3064) AUTHORS McCormack,T., Vega-Saenz de Miera,E.V. and Bernardo,R. TITLE Molecular cloning of a member of a novel class of K+ channel genes in mammals JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by R.Bernardo, 07-MAY-1990. FEATURES from to/span description pept 80 1771 K+ channel protein (KShIIa) BASE COUNT 777 a 718 c 729 g 840 t ORIGIN 1 gccacggtga acatctggcc cacgagagct ttaacttggt gctgtgttcg ccttccctag 61 tcatgtctga gccacagaga tgggcaagat cgagaacaac gagagggtga tcctcaatgt 121 cggaggcacc aggcacgaaa cctaccgcag cactctcaag acccttcctg gaactcgcct 181 ggcccttctc gcctcctctg aacctcaggg cgactgcctg actgctgcgg gtgacaagct 241 gcagccgctg ccccctccgc tgtctccacc gccgcgaccg cctcccttgt cccctgtccc 301 cagcggctgc ttcgagggcg gcgcaggcaa ctgcagttcg cacggtggca atggcagcga 361 ccaccctggg ggaggccgcg aattcttctt cgatcgccac ccaggagtct tcgcctatgt 421 gctcaactac taccgcacgg gcaagctgca ctgccccgcc gacgtgtgtg gaccgctctt 481 cgaggaagag ctggcattct ggggcatcga tgagaccgac gtggagccct gctgctggat 541 gacctacagg cagcaccggg acgcggagga ggccctggat atcttcgaga cacccgacct 601 catcggaggc gaccctggtg atgatgagga cctagggggc aagagactgg gcattgagga 661 tgctgcgggg ctgggaggac ccgatggcaa gtctggccgc tggaggaagc tgcagcctcg 721 catgtgggct ctctttgagg acccctattc atccagagcc gctaggttta ttgcttttgc 781 ttctctgttc ttcattttgg tttccatcac aaccttttgc ctggagacac acgaagcttt 841 caatattgtt aaaaacaaga cagagccagt catcaacggc accagcgctg ttctccagta 901 tgaaatcgaa acggatcctg ccttgacata tgtggaagga gtgtgtgtgg tgtggtttac 961 ttttgaattt ttagtccgta ttgttttctc gcccaataaa cttgagttca tcaaaaatct 1021 attgaacatc attgactttg tggccatcct ccccttctac ttagaggtgg gactcagcgg 1081 gctgtcttcc aaagcggcta aagatgtgct cggctttctc agggtggtta ggtttgtgag 1141 gatcctgaga atcttcaagc ttacccgcca tttcgtaggt ctgagagtgc tcggacacac 1201 tcttcgtgcg agcaccaatg aatttttgtt gctgatcatc tttctggctc tgggagtttt 1261 gatattcgct acgatgatct actacgctga gcgagtaggg gctcaaccta atgatccctc 1321 agcgagtgag cacacacagt tcaaaaacat ccccattggt ttctggtggg ctgtggtgac 1381 catgactacc ttaggctatg gggatatgta cccccaaaca tggtcaggga tgttggtggg 1441 ggccttgtgt gctctggctg gagtgctgac catagctatg cctgtgcccg tcattgtcaa 1501 caattttggg atgtactact ccttggcaat ggcgaagcag aaacttccaa gaaaaagaaa 1561 gaagcacatt cctcctgccc ctctggcaag ctcacctaca ttttgcaaga cagaattaaa 1621 catggcttgt aacagtaccc agagtgacac atgtctgggc aaagaaaacc ggcttctgga 1681 acataacaga tcagtgttat caggtgacga cagtacagga agtgagccgc cattatcacc 1741 ttccggaaag gctccccatc agacgctcta gtaccagaga caaaaacaga agaggggaaa 1801 catgtttcct gttgacgaca ggtgattaca cgtgcgcttc tgatggagga atcaggaaag 1861 ataactgcaa agatgttgtc attactggtt acacgcaagc cgaggccaga tctcttactt 1921 aatgacttgg gaaaggcaca aaacatgaaa gaaagtgttg tacagaattt atcatggatt 1981 tttgcctgct gaaaatggga cattgagatt tagccattca aggattgtac tggaaacttc 2041 tgctactgaa tgtgaccgga tgtgaccggt ttgtgttcag aagagttccc gcattctctg 2101 aggcatttaa agcttgtaaa gaaatggtgg ctggaactga aatgggcttt gccaaggaaa 2161 tattctgctt gtcgacagac cagagtccac aaaacaactg taaataccaa cgtgtgcatg 2221 ggtccaccct tctcacagtc tcatagcaca gggagccaca ttcattgtca gcatctgtga 2281 agcttcaaga aatgcccaca aggctctgaa tggctcctct cgggctcatg gtggttttgc 2341 tgcaaattac agagggcagt gaatccaatg tagtataggt ctgttttcat ttcttttatt 2401 tgtacattac gtgttgtatc tacaaacacc gattgcctca ttttcttgca aaccacttat 2461 atatccgctt gtgaactatg ttgtaaactt ttagtgtcta catagaaatc taaccatttc 2521 ttagtcatgt attcatttag ttctggaacg ctgttactat ctagaagaca gtcatcacca 2581 gtattcatcg tataagtctt ttgtcattgt tacctcaatt ataaatatta aaaatataaa 2641 ttctggcaat gagaatattt ttttattaaa tgatcaagga aaatgtcagt atatagtaga 2701 atattatcaa attatatcct aaaatgtcta ttttgcataa aagagatatt cttcaatcga 2761 ttcctttttt tcgtgagctt tgtggctaat gaagcgtgtg tttgtcttca tcactgctgt 2821 agttgaaact gtgtaagagt ttttcatctt gcttaatcaa tatttccaga atctcttagt 2881 tcccctggga ttctgaatat actctatagc ctaattataa acccctgtat cgtgtacctt 2941 ttgtgaccat ctcaaggtgc atgcctaatc gttgtgataa gccaatggac atgttactaa 3001 ctgaaatgaa gaataaaagt caaacgatct ggggataaac ttgaatccta tctgattaaa 3061 tgat // LOCUS ECOHIP 726 bp ds-DNA BCT 28-JUN-1990 DEFINITION E-coli hip gene for integration host factor beta-subunit (IHF beta). ACCESSION X04864 KEYWORDS 30S ribosomal subunit protein S1; integration host factor; integration host factor beta-subunit; rpsA gene. SOURCE Escherichia coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 597) AUTHORS Weisberg,R.A. JOURNAL Unpublished (1987) NICHD/NIH, Bldg.6 Rm.306, Bethesda MD 20892. STANDARD simple automatic REFERENCE 2 (bases 1 to 597) AUTHORS Flamm,E. and Weisberg,R.A. TITLE Primary structure of the hip gene of Escherichia coli and of its product,the beta-subunit of integration host factor JOURNAL J. Mol. Biol. 183, 117-128 (1985) STANDARD simple automatic REFERENCE 3 (bases 1 to 726; revises [1],[2]) AUTHORS Mengeritsky,G., Giladi,H., Teff,D., Goitein,D., Weisberg,R.A. and Oppenheim,A.B. TITLE Revised DNA sequence of the hip gene of E.coli JOURNAL Unpublished (1990) STANDARD simple automatic COMMENT **map:20'; EMBL features not translated to GenBank features: key from to description INVREP 495 508 inverted repeat B INVREP 510 523 inverted repeat B' TERM 495 523 put. transcription terminator [3] Author address G.Mengeritsky Dept of Mol Genetics, Hebrew University-Hadassah Medical School Jerusalem, Israel GALINA%HUJIMD.BITNET@CUNYVM.CUNY.EDU FEATURES from to/span description pept < 1 29 rpsA pept 190 474 IHF revision 455 459 gcgcc in [3]; gc in [1],[2] rpt 40 48 inverted repeat A rpt 53 61 inverted repeat A' signal 40 61 put. transcription terminator signal 76 81 -35 region signal 99 104 -10 region binding 103 116 IHF binding site site 475 483 Lambda-E.coli erossover point mut 335 335 g in wt; a in hip157 (gly is asp) BASE COUNT 204 a 167 c 162 g 193 t ORIGIN 1 aagctttcaa agcagctaaa ggcgagtaat tctctgactc ttcgggattt ttattccgaa 61 gtttgttgag tttacttgac agattgcagg tttcgtcctg taatcaagca ctaagggcgg 121 ctacgggccg cccttaatca atgcagcaac agcagccgct taatttgcct ttaaggaacc 181 ggaggaatca tgaccaagtc agaattgata gaaagacttg ccacccagca atcgcacatt 241 cccgccaaga cggttgaaga tgcagtaaaa gagatgctgg agcatatggc ctcgactctt 301 gcgcagggcg agcgtattga aatccgcggt ttcggcagtt tctctttgca ctaccgcgca 361 ccacgtaccg gacgtaatcc gaagactggc gataaagtag aactggaagg aaaatacgtt 421 cctcacttta aacctggtaa agaactgcgc gatcgcgcca atatttacgg ttaagttttt 481 atactaagtt ggcattataa aaaagcattg cttatcaatt tgttgcaacg aacaggtcac 541 tatcagtcaa aataaaatca ttatttgatt tcaattttgt cccactccct gcctctgtca 601 tcacgatact gtgatgccat ggtgtccgac ttatgcccga gaagatgttg agcaaactta 661 tcgcttatct gcttctcata gagtcttgca gacaaactgc gcaactcgtg aaaggtaggc 721 ggatcc // LOCUS GPIGPC3A 5077 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Guinea pig complement C3 protein (GPC3) mRNA, complete cds. ACCESSION M34054 KEYWORDS complement C3. SOURCE Guinea pig (strain Hartley) liver, cDNA to mRNA. ORGANISM Cavia porcellus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Hystricomorpha; Caviidae; Caviinae. REFERENCE 1 (bases 1 to 5077) AUTHORS Auerbach,H.S., Burger,R., Dodds,A. and Colten,H.R. TITLE Molecular basis of complement C3 deficiency in guinea pigs JOURNAL J. Clin. Invest. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by H.S.Auerbach 07-MAY-1990. FEATURES from to/span description pept 28 5028 complement C3 protein (GPC3) precursor sigp 28 93 complement C3 protein signal peptide matp 94 2040 complement C3 beta chain matp 2053 5025 complement C3 alpha chain signal 5055 5060 polyA signal (put.) BASE COUNT 1174 a 1537 c 1504 g 862 t ORIGIN 1 cctgggtcgc tcggctctcc ctgcaccatg ggacccgccg caggtcccag cctgctgctg 61 ctgctgctgg ccagcgtctc cctggccctg ggggacccca tgtactcgat catcaccccc 121 aacatcctgc ggctggagaa cgaggagacc gtggtgctgg aggcccacga agtgcaaggg 181 gacatccctg tcacagtcac tgtccacgac tttccggcca agaagaatgt gctgtccagt 241 gagaagaccg tgctgaccag tgccacggga tatctaggca ctgtcaccat caagatcccc 301 gccagcaagg agttcaagtc cgataagggg cgcaagttgg tggtcgtgca ggccgccttt 361 gggggcacgc agctggagaa ggtggtgctg gtgagcctgc agagcgggta cctcttcatc 421 cagacggaca agaccatcta cacgccgggc tccaccgtgc tctaccggat cttcaccgtg 481 gacagcgacc tgctgcccgt gggccggacc atcattgtca ccatcgagac acccgacggc 541 atccccatca agcgggacac cctgtcttcc aacaaccagc atggcatctt gcccttgtcc 601 tggaacatcc ccgagttggt caacatgggg cagtggaaaa tccaggcctt ctacgagaac 661 tcgcccaagc aagtcttctc ggctgagttc gaggtgaaag aatacgtgct gcccagcttc 721 gaggtcctgg tggagcccac ggagaagttc tactacatcg acgatccgaa aggcctggag 781 gtcaacatca tagccaggtt cctctacggg aagaacgtgg acggaacggc ctttgtcatc 841 ttcggggtcc aggacggtga ccagaggatc tccctggccc agtccctcac ccgagtcgtg 901 atcgaggacg gcagcggaga ggtcgtgctg agcaggcagg tgctgctgga cggggtgcag 961 ccctccaggc ccgaggccct ggtggggaag tccctgtacg tgtctgtcac ggtcatcctg 1021 cactcaggca gcgacatggt ggaggcagag cgcagtggga tccccatcgt cacctccccc 1081 taccagatcc acttcaccaa gacgcccaag tacttcaagc cggctatgcc cttcgaaatc 1141 atggtgctgg tgacgaaccc cgacggctct ccggcccccc acgtccccgt ggtgacccag 1201 ggctccaacg tgcagtctct cacccaagct gacggcgtgg ccaggctgag catcaacacc 1261 cccaacaccc ggcagcctct gagcgtcacg gttcaaacaa agaaaggggg catcccggac 1321 gcccggcagg ccatcaacac catgcaggcc ttgccctaca ccaccatgta caactccaac 1381 aactatttac acctctccat gccgcgcacc gagctcaagc ctggcgagac catcaacgtc 1441 aacttccacc tgcgaagcga cccgaaccag gaagctaaga tccgctatta cacctacctg 1501 atcatgaaca aggggaagct gctgaaggtg ggtcggcagc ctcgagagcc gggccaggcc 1561 ctggtggtgc tgcccatgcc catcaccaag gagctgattc cctccttccg cctggtggcc 1621 tactacactc tcatcggtgc cagcgcgcag agggaggtgg tggccgactc tgtgtgggca 1681 gacgtcagag actcctgtgt gggcacgctg gtcgtcaaag gcggctccgg gaaagatggc 1741 caggacaagc ggcagcagca tctaccccgg cagcagatga ccctcaggat agagggcaac 1801 cagggggccc gcgtgggcct ggtggccgtg gacaagggtg tgtttgtgct gaacaagaaa 1861 cacaaactga ctcagagtaa gatctgggac gtggtggaga aagcggacat cggctgcacc 1921 ccgggcagcg ggaaggacta cgcgggcgtc ttcacggacg cggggctgtc cttcaagtcc 1981 agcaaggcgg ggctgcagac ggcccagcgg gaaggtctgg actgccccaa gcccgccgcg 2041 cgccgccgcc gctcggtgca gctcatggag aggaggatgg acaaagccgg gaagtacaag 2101 agcaaggagc tgcgcaggtg ctgcgaggac ggcatgcgcg aaaaccccat gcagttttcc 2161 tgccagcgcc gcgcgcgcta cgtgagcctg ggcgaggcgt gcgtcaaggc cttcctggac 2221 tgctgcacct acatggccca gctgcgccag cagcaccggc gcgagcagaa cctgggcctg 2281 gcccggagtg acatggatga agacattata ccagaagagg acatcatttc cagaagccag 2341 tttccagaga gctggctgtg gaccatagag gaactgaagg agccagagag aaacggcatc 2401 tccaccaaga ccatgaacat cttcctcaaa gacagcatca ctacttggga gattctggcc 2461 gtgagcttgt cagataagaa aggcatctgc gtggcggacc ccttcgaggt cacggtgatg 2521 caggatttct tcatcgacct gcggctaccc tactccgtgg tacgcaacga gcaggtggag 2581 atccgcgcgg tcctctacaa ctaccgcgag gcccagagtc tcaaggtgag ggtggagctg 2641 ctgcacaacc ccgccttctg cagcctggcc accgccaaga agcgccacac ccagacggtg 2701 accatcggac ccaagtcctc tgtggcggtg ccctacgtcc tggtgcccct gaagatcggc 2761 ctgcaggagg tagaggtcaa agctgccgtc tacaactact tcatcagcga cggcgtgaag 2821 aagaccctca aggtcgtgcc cgaagggatg agagtcaaca aaacggtggc cattcgcact 2881 ctgaacccag agcagctggg ccaaggcggc gttcagaggg aggaaatccc agctgcagac 2941 ttaagcgacc aagtcccaga caccgattct gagaccaaga ttctcctcca agggaccccg 3001 gtggcccaga tggccgagga cgctgtggac gcggagcggc tgaagcacct gatcatcacg 3061 ccctcgggct gcggagagca gaacatgatc ggcatgacgc ccaccgtgat cgcggtgcac 3121 tacctggacc agacggagca gtgggagaag ttcggcctgg agaagcggca ggaggcgctg 3181 aacctcatca acagggggta cacacaacag ctggctttca aacagcccaa ctgggcctac 3241 gcggctttca agaaccgggc gtccagcact tggctgaccg cctacgtggt gaaggtcttc 3301 tcgctggcgg ccaatctcat tggcatcgac tcggaggtcc tctgcggggc ggtgaaatgg 3361 ctgatcctgg agaagcagaa gcccgacgga gtcttccagg aggacgggcc cgtgatacac 3421 caagagatga ttggtggtgt ccggaccgcc caggaggctg atgtgtccct cacggccttc 3481 gtgctcattg cattgcaaga ggctaaagat atctgcagag cacaggtcaa caacctggaa 3541 gcgaacatca acaaagctgg cgattacatc gagtcaagat acgcagacgt gaggagaccg 3601 tacacgctgg ccatcgctgg ctacgccttg gccctgctgg agaggctgaa cggcgcaacc 3661 ctccaaaaat ttctaaacgc agctacagag aagaaccgct gggaggaggc ccgtcagaag 3721 ctgtacagcg tggaagccac gtcctacgcc ctcctggccc tgctcctgct gaaggacttc 3781 gacgcggtgc ctcccgtcgt gcgctggctc aacgaacaga gatactacgg caggggctat 3841 ggctccaccc aggccacctt catggtgttc caagccttgg cccagtacca gacagatgtc 3901 cccgatcaca aggacctgaa catggaggtg gccctccagc tgcccagccg cagctccccg 3961 tccaaatttc gcctcgtctg ggaagccggt agcctcctgc ggtcagaagc gaccaagcaa 4021 aacgagggct tcaaactgac agctaaaggg aagggccagg gcacgttgtc ggtggtggct 4081 gtgtactatg ccaagacgaa acgtaaagtt gtctgcaaga acttcgacct cagggtcact 4141 ctgaagccgg cccccgatac agtcaagaag cctcaggagg ccaagagcac catgatcctt 4201 ggcatctgta ccaggtacct gggagaccag gatgccacta tgtccatcct ggacatatcc 4261 atgatgaccg gtttcattcc tgacacagat gacctcaagc tgctggccac cggcgtggac 4321 agatacatct ctaagtacga gatgaacaaa gacttcagca agaacaccct catcatctac 4381 ctggacaagg tctcacactc cgaggaagaa tgtctgtcct tcaaaatcca ccagttcttc 4441 aatgtggggc tcatccagcc tggctccgtc aaggtctact cctactataa cctagatgaa 4501 acatgtaccc agttctacca cccggagaag gaggacggga tgctgaacaa actctgccac 4561 aaggacttgt gccgttgcgc agaggaaaac tgcttcatac agctgccgga gaagatcacc 4621 ctggatgagc ggctggagaa ggcctgcgag cccggggtgg actacgtgta caagaccaag 4681 ctgctcaaga tggagctgtc ggatgacttc gatgagtaca tcatgacgat cgagcaggtc 4741 atcaagtcag gctccgatga ggtgcaggcc gggaaggagc gcagattcat cagccacatc 4801 aagtgcagag acgccctgca cctgaaggag gggaagcact acctcatgtg gggcctctcc 4861 tcggacctgt ggggggagag acccaacatg agttacatca tcgggaagga cacctgggtg 4921 gaagcgtggc ccgaggcgga agaatgccaa gacgaggaga accagcagca atgccaggac 4981 ctgggcactt tcacagagaa catggtggtt tttggctgcc ccaactgacc acgcccgtca 5041 gtcatcctcc ctccaataaa gcttcgggtc tatggca // LOCUS HUMPP14B 8076 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human placental protein 14 (PP14) gene, complete cds. ACCESSION M34046 KEYWORDS placental protein 14. SOURCE Human (cell line GM1416) DNA, clone PP14G. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 8076) AUTHORS Vaisse,C., Atger,M., Potier,B. and Milgrom,E. TITLE Human placental protein 14 gene: Sequence and characterization of a short duplication JOURNAL DNA (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Vaisse, 04-MAY-1990. FEATURES from to/span description pept 3019 3114 placental protein 14 precursor (PP14), exon 1 3510 3649 placental protein 14 precursor (PP14), exon 2 4036 4109 placental protein 14 precursor (PP14), exon 3 5482 5592 placental protein 14 precursor (PP14), exon 4 6644 6748 placental protein 14 precursor (PP14), exon 5 7019 7035 placental protein 14 precursor (PP14), exon 6 sigp 3019 3072 placental protein 14 signal peptide matp 3073 3114 placental protein 14, exon 1 3510 3649 placental protein 14, exon 2 4036 4109 placental protein 14, exon 3 5482 5592 placental protein 14, exon 4 6644 6748 placental protein 14, exon 5 7019 7032 placental protein 14, exon 6 pre-msg 2971 8010 PP14 mRNA and introns IVS 3115 3509 PP14 intron A IVS 3650 4035 PP14 intron B IVS 4110 5481 PP14 intron C IVS 5593 6643 PP14 intron D IVS 6749 7018 PP14 intron E IVS 7036 7786 PP14 intron F signal 2891 2899 CAAT box signal 2942 2948 TATA box signal 7992 7997 polyA signal BASE COUNT 1636 a 2226 c 2203 g 2011 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcttta cctagcccta cgtcctgaag attttctctc acgctttctt ctaaaagttg 61 tatagtttta tgttttactt ttaaactatg agttaatgca tatgtcaggt gtgagtttta 121 gatggaggtt cttcgtttgc atgggatcga ttgcacgtga tgaattgctc cagcaccatt 181 tgttgcaaag actgtccttc ttttagaggg actcccgctt gccaggcctc tggtttaatg 241 aaacatgacc agagtgactc catcttaacg tgaataacta gacactcaca aggcacctat 301 aaggttatat aacgaggcta tgctgctcga tactgactac gacaatttcc tgtttctctg 361 cctacaggac accaccaccc ccatccagag catgatgtgc cagtacctgg gtgggtctca 421 cagcacatga gctcaacgtg ggtgagaggc agcagctact tccatggctg ggaaccctgg 481 ggagctgaca actggcttcc tgtccacctc agtgcctgtg ggctggtttt ttctttcttg 541 gttttttttt tatttgtttg tttgctttgt tttttttgag acagggtctc attctgtcac 601 tcaggctgga gtgcagtggc gtgatctcgg ctcactacaa cctccacctc ccagattcaa 661 gcgattctcg tgcctcagct tcccgagtag ctgggattac aggcgcgagc ttatggtctg 721 aaaatacccg catactaagc tgaccaccaa ttataactgc agaacattta tgcccatacg 781 aggcatctcc caccaagcct ggagaatgta ccgatgacct gggagtgcag ggggttatct 841 ttgctcacaa ataacgtcaa cgagtaggct gaggctgaag ggcaaatggt cattgatcac 901 actaggagcc cctatcttta gcgagtgcat ctgcatgatc caggtttcac tgtagctcat 961 tacagcttct tacaaacaga ggcactcaca gaggacgggc gttcctcctg ctcgctgagg 1021 ttgcccggct ctggcacaga gtcatttccc ataaacttgc tttcactgtg ctctgtgagt 1081 caccttgaat tctttcccgt gtgagatcta agaacccatt cttggggtct ggactgggac 1141 cctcttttcc gacaacactt cccccacgca ctgttcttgc agctttgtta aaggtctgtt 1201 gttgttgtgt gtgagtccat gtttggactc tctatttccc attgatctat gtgtccattt 1261 ctctgccaat accacaatct cttggtgact attgataata gtgcttgtaa acttgggtag 1321 actaactctt tatactttat tcttcttcaa aatggcttta gcaattgtag ctcctttgcc 1381 ttgacatgta aattttagaa aaaaaaattt ctctatatat ctacatatgg tggggtgtgg 1441 tggctcacac ctataattcc agcactttgg gaggccgagg tgagcagatc acttgaggtc 1501 agcagttcga gaccagcttg gccaaaatag tgaaaacccc tctctactaa aaagacaaac 1561 attagccagg tgtggtggtg ggcgcctata atcccagcta ctctggaggc tgaagcagca 1621 ggattacttg aacctgggat gcagaggttg cagtgagtca agatcgtacc actgcactcc 1681 accctgggca acagagcaag acttggcctc aaaaaaatat atatatgtat atatgtgtgt 1741 gtgtgtgtgt atacacacac acacacacac atgtctatag ctacctacaa aatatcttgc 1801 tgagaatttg ataggaattg cattaaatct ctaggttgag ttgaggagag ctaacacctt 1861 tgctgtgttg aagcttccag cccatgaaca cattgtgcct gtcctttacc ttaggtcttt 1921 gatttccctt ccttgggttc agagtttaag tcctgtccat atttggtgag atttttgcct 1981 acatgctgtg tgtgtgatta caaatcctga atttccagtt tcgggttctg tctactcctt 2041 gacactgtgt gggaacacgg tggatttttg tgcgattgac cttgctgacc tcactcggtt 2101 ctaggagggg ttttgtagag ttcttgcaat tttgtacctg ggcaatgatg ttatatgcga 2161 atagagacag gtatacctcc ccctctccaa tctgcctgcc tttagttacc atttcttgcc 2221 tagctgctct ggccggaatg acctgctctg atggatgggg gagtgggagt ggacacccgt 2281 gtcttgcctc aggactcaag tgctctccag ggagtgagat gcgggctcga ttgtaagtta 2341 taaagtgatg cttccctgtg ggaaaaagta catccaatat agactgtagg acaaagtctg 2401 aaagtccact gcctcccatt tccacccagt cttgcctgtg agtcagagag aaacagtgct 2461 aacaggcagt gtgcacctgt ccagaggctg gctgtttgag ggacacaggt gtgcacacct 2521 ggggatgtct gagtggaagg tacaggtcag gattatgacc gtgcagtcag tcacccactg 2581 gcatgatgtc tgtggcatcc tggggcagcc atggggctct caggccctct gcctgcccca 2641 caggccatac ccctgccctg gacacagctg tcctcagtgc tggcctctga cccaacattg 2701 tccaggagcc ccaacccaga aggtgctccc gccgctgcca gcctggaccc gacccaggcc 2761 cctcccgcct gaggccctgc caagaactgc ccagcccgga cacagaggag gttccgcgtg 2821 gacgcaggga agagcctccc attgccccag tggaggaagc tgcccagggg ccaaggatga 2881 gtcacaggtt cgaggaatca catggcgagg ctgtgggcgg ggatcttgtc tgccctcctc 2941 ctacataagg ccccctgagc ccacactgcc tcagcatccc tctggctcca gagctcagag 3001 ccacccacag ccgcagccat gctgtgcctc ctgctcaccc tgggcgtggc cctggtctgt 3061 ggtgtcccgg ccatggacat cccccagacc aagcaggacc tggagctccc aaaggtttga 3121 ggctggggga gcgggcactt tactgtggga ggcctggggc gggtgggagc tgcgggcagg 3181 cgggaagcca ggatctcaga aacctacagg aagcacagaa tggacgccat gacgtcagga 3241 agccctcagc cctgctctcc atctttaggg tggcctctct ggtttcccag catcctaggt 3301 gactcattat ttggactttg gaacactcct gagttagcac acactggtca ttttaagtac 3361 aggaaatttc atagcccagg atctggtaga tagcagacaa ccatccaatg ctcactgtac 3421 ccatcccagt tagactcagc cccgtctgca ccgggtgcaa cgagagccat ggtggggtgg 3481 gaccgccgtg cagcccaagg ccccctcagt tggcagggac ctggcactcc atggccatgg 3541 cgaccaacaa catctccctc atggcgacac tgaaggcccc tctgagggtc cacatcacct 3601 cactgttgcc cacccccgag gacaacctgg agatcgttct gcacagatgg tgggtttctc 3661 atcattgaga cgggctgggc gggggctcag tctcccccct caggggtcca ggactgggtg 3721 ggttgggcgg agctggactt agccccaggc attttctgac agccaggggc ttcactgtgg 3781 cccttccatg agggtggggt ggaaaaccag ggctccagac gttccctgtc cccttggatc 3841 ccctgcccca ggctctgggc caacagccaa ccacacagtg cagccccagg tcagactgag 3901 gagaaggtct gggcggctgc gggctgcggt gctccttgga cccggggaag ttcccgtggt 3961 gacctgattt taggagtgac agtgaaggca actccaattc aagtggccac tcatcctatt 4021 gtcaccacct ttcagggaga acaacagctg tgttgagaag aaggtccttg gagagaagac 4081 tgagaatcca aagaagttca agatcaactg tgagtgtccc caggccccaa gggctggctc 4141 agtgctggca tgctagccac gctctcccag aggcggctct gctggggcat gagggagtgg 4201 ggcctggcct gtccccactc tctctgcttc agggagtcag agtgtttact ccggtcaacc 4261 tgatgctgac cccagaggca tcttttacct ggagggcagg ggaagcacta attcttggca 4321 tgacatgact ggatgtgggt ctgcactgtg cccaggccaa ggggacaggt gctttgttgc 4381 actgttcact ctggcctcac aaaaggccag ggaggctgca ggcgagcagg tgggcaggtg 4441 ggcaggtggg taggtgggta ggtggatatg tatacaggtg gcaggagggt aggtgaacag 4501 gtgggtaggt gggcaggtgg ctaggtgagt aagtggttag gtgaacaggt gggcaggtga 4561 gcaggtggtt aggtgaacag gtgggcaggt ggataggtga acaggtgggc aggtgggtag 4621 gtgggtaggt atacaggtgg acaggtgggt aggtggacag gtgggcaggt gagtaggcga 4681 acaggtgggt gggtgaacag gtggccaggt gaacaagttg gtaggtgggc aggtgggtag 4741 gtgggtaggt gaacaggtgg gcaggtgggc aggtgggcag gtgggcaagt ggctgctgtt 4801 cccgtgggcc tggctgcctc ctgcgcactc tggggctgca gctctggtct taggctgagc 4861 tcccaggcct ctctggggga agagagaggg gcttacagca tgtccttggt ccactgaatt 4921 cttcctaaca atttgcaaca ttttgttcta ttttgttaat tattattttt ttaaaaagac 4981 agaggtggtc agggtctggg gcctcttatc ccctcatggg cacattttcc cagcaaatac 5041 agtttgcttc tcatgcttgg gacttgcctc aggcctttct gaccctgctt gccctcccca 5101 gaatcgagcc actctccaag gtccatttct tctccctcct cccgcccctg tgccctgttc 5161 ctgtgccatc tcccgccatc ctcacccgta cgtgacttct cagttggagt ctctccaggt 5221 cacagcctcc ctgcctgccg tgtctgcctc tccacggcac acctggcctc tcgccctcag 5281 ccggggctcc atggccctcc acattgcctc tcctcccctt tcctccctgg cttccctgat 5341 catggtccac agcaggggcc acgtcccatg gtgtcagtga tgaggaagcc acttagtgtg 5401 gtgggatgtc cacacacctg cacaggactc tgctgagacg gaggcttcat cttccttttg 5461 gttcttctct tctttcccca gatacggtgg cgaacgaggc cacgctgctc gatactgact 5521 acgacaattt cctgtttctc tgcctacagg acaccaccac ccccatccag agcatgatgt 5581 gccagtacct gggtgggtct cacagcacat gagctcaacg tgggtgagag gcagcagcta 5641 cgtccatggc tgggaaccct ggggagctga caactggctt cctgtccacc tcggtgcctg 5701 tgggctgact ttttctttct tggttttttt tatttgtttg ttgtttgctt tgtttttttg 5761 agaaagggtc tcattctgtc actcaggctg gagtgtagtg acgtgatctc ggctcactgc 5821 aacttctgcc tcccagattc aagcgattct cgtgcctcag cctccagagt agctgggatt 5881 acaggcgcac gccaccatgc ccagctaatt tttgtatttt ttggtagaga cggggtttca 5941 ccatgttggc caggctggtc tccaactcct ggatcaagtg atccacccgc cttggcctcc 6001 caaaggctgg gattacaggt catccaccac gcccggccag gctgagtttt tctccagcgg 6061 ttcatcgagt cctctgacaa agcaaggagc tgatataggg ccagtgggac ggtcgccagt 6121 caaggggctg ggcttggtgg atagattaat actcactggg cgtccagtca aaacgccctg 6181 aaacctatga tgctgtcaac caaacgaagg ccaggaatac caaaatagcc acataggcac 6241 agcccttccc catgtttctg agcacagtgt ttcctctggg gtcacacagg tgtcttcttg 6301 atcagcctca gccatgcttg gtgagagccg ggcactggga gagccaggca ctgtgctctc 6361 ctgtgacgct gtagacacca tcctaagctg tgcagacccc agcgctgccc agagcggagc 6421 agagggggcc gggcaaggag tgggagctgg ggtcagggaa cctggaggtg cagtggacag 6481 agccccggag accgccctag ggacctactc cagaccaaac tctgccagac ctcggagcac 6541 tggggcctcc ttctctgccc tccctcctca ggcaaggcct ctggagctcc ccagctctca 6601 tggaagcccc aggggcccag gactgaccca gcctcttcca cagccagagt cctggtggag 6661 gacgatgaga tcatgcaggg attcatcagg gctttcaggc ccctgcccag gcacctatgg 6721 tacttgctgg acttgaaaca gatggaaggt gagctctgcc taggacacgc ccagcctcag 6781 ctggaggaga agctgcctct ttcttagccc gagccccctg ctggctctgc aggactcagg 6841 tcactccttt ttggcccctc ccctgttctc ccctggcctt ctggggtgca gagccaccct 6901 gaggtggggt cctgccctct cccaccatcc tttcatccct tctctagccc tggggctgct 6961 gtgtccccag ctgtctcttc tctcgctgac acctccactg tcccatctcc tcccacagag 7021 ccgtgccgtt tctaggtgag ctcctgcctg gtcctgcctc ctgggtaatg tatcagcctc 7081 gcccactgtc tgcggctgcc tctctgggcc cctgggacag accctactgt gtccagttca 7141 gggctgaccc tacaggaatg aactggggtc tggtcttgtg attccagaaa gccaggctgc 7201 tgacgtcccc attcacgagc ccagcctgtg tcttgcagcc attgtattag tcacgggctt 7261 gtgccctata gtcagacctc atgctttctt ttggggttag gggtgttggt tggaaatggt 7321 gggggctata ggaggaggaa ggaggatggt tacatggaag ggcatgagaa gctggggacc 7381 tgcaggtctc ggtcccacgt tctttttttt ttttcttttt ttaagatgga gtctcgctct 7441 gtcaccaggc tggagtgcag tggcacaatc tcagctcact gcaacctcga cctcctgggt 7501 tcaagcgatt ctcctgcctc aaccccccga gttgctggaa ccacaggcgt gtgccaccat 7561 gcccagctaa tttttgtatt tttaatagaa acggggtttc accatgttgg ccacgatggt 7621 ctcaatctct tgacctcatg atccccccgc tttagcctcc caaagtgctg ggatttcagt 7681 gccacattct taagggggtg tgctcaagcc caccacatcc ttccagggct cccccgaaac 7741 accctgctct tcctccctct acttaagtga cctgtaaacc caacagctca cctccgcctc 7801 caggaagacc agactcccac ccttccacac ctccagagca gtgggacttc ctcctgccct 7861 ttcaaagaat aaccacagct cagaagacga tgacgtggtc atctgtgtcg ccatcccctt 7921 cctgctgcac acctgcacca cggccatggg gaggctgctc cctgggggca gagtctctgg 7981 cagaggttat taataaaccc ttggagcatg tcctgtctgg atgcgcagcc actgctgggt 8041 gtgggattca gggacgaggg cctggggtcg gggcag // LOCUS PHAMP1A 1313 bp ss-mRNA PLN 28-JUN-1990 DEFINITION P.chrysosporium Mn-dependent peroxidase (MP-1) mRNA, complete cds. ACCESSION J04980 KEYWORDS Mn-dependent peroxidase. SOURCE P.chrysosporium (strain BKM-F-1767), cDNA to mRNA, clone lambda- MP-1. ORGANISM Phanerochaete chrysosporium Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes; Agaricales; Corticiaceae. REFERENCE 1 (bases 1 to 1313) AUTHORS Pease,E.A., Andrawis,A. and Tien,M. TITLE Manganese-dependent peroxidase form Phanerochaete chrysosporium: Primary structue deduced from cDNA sequence JOURNAL J. Biol. Chem. 264, 13531-13535 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.A.Pease, 21-JUN-1989, for release after publication. FEATURES from to/span description pept 36 1184 Mn-dependent peroxidase precursor (MP-1) sigp 36 72 Mn-dependent perocidase signal peptide matp 73 1181 Mn-dependent perocidase BASE COUNT 251 a 462 c 334 g 266 t ORIGIN 1 tcagctctca aggacatccg cactcgaata tcgcaatggc cttcggttct ctcctcgcct 61 tcgtggctct cgccgccata actcgcgccg ccccgactgc ggagtctgca gtctgtccag 121 acggtacccg cgtcaccaac gcggcgtgct gcgctttcat tccgctcgca caggatttgc 181 aagagactct gttccagggt gactgtggcg aagatgccca cgaagtcatc cgtctgacct 241 tccacgacgc tattgcaatc tcccagagcc taggtcctca ggctggcggc ggtgctgacg 301 gctccatgct gcacttcccg acaatcgagc ccaacttctc cgccaacagc ggcatcgatg 361 actccgtcaa caacttgctt cccttcatgc agaaacacga caccatcagt gccgccgatc 421 ttgtacagtt cgccggtgcg gtcgcgctga gcaactgccc aggtgctcct cgcctcgagt 481 tcatggctgg acgtccgaac actaccatcc ccgcagttga gggcctcatt cctgagcctc 541 aagacagcgt caccaaaatc ctgcagcgct tcgaggacgc cggcaacttc tcgccgttcg 601 aggtcgtctc gctcctggct tcacacaccg ttgctcgtgc ggacaaggtc gacgagacca 661 tcgatgctgc gcccttcgac tcgacaccct tcaccttcga cacccaggtg ttcctcgagg 721 tcctgctcaa gggcacaggc ttcccgggct cgaacaacaa caccggcgag gtgatgtcgc 781 cgctcccact cggcagcggc agcgacacgg gcgagatgcg cctgcagtcc gactttgcgc 841 tcgcgcgcga cgagcgcacg gcgtgcttct ggcagtcgtt cgtcaacgag caggagttca 901 tggcggcgag cttcaaggcc gcgatggcga agcttgcgat cctcggccac agccgcagca 961 gcctcattga ctgcagcgac gtcgtccccg tcccgaagcc cgccgtcaac aagcccgcga 1021 cgttccccgc gacgaagggc cccaaggacc tcgacacgct cacgtgcaag gccctcaagt 1081 tcccgacgct gacctctgac cccggtgcta ccgagaccct catcccccac tgctccaacg 1141 gcggcatgtc ctgccctggt gttcagttcg atggccctgc ctaaaccact cacctccggc 1201 aatgcacctt tagtagatgt cgattctatt tggttgatta gaagtactgg gttctcgtgt 1261 catataccac caagttttca gaatgcaagt ctcctttctt tccaaaaaaa aaa // LOCUS PEAHSP177A 772 bp ss-mRNA PLN 28-JUN-1990 DEFINITION Pisum sativum 17.7 kDa heat shock protein (hsp17.7) mRNA, complete cds. ACCESSION M33901 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 772) AUTHORS Vierling,E. and Helm,K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 65 523 17.7 kDa heat shock protein (hsp17.7) BASE COUNT 254 a 127 c 170 g 221 t ORIGIN 1 caaaaatcaa aacgtgcgac aaacacaaaa tcatcccaca aagaaagcaa tggatttcag 61 gctaatggat ttggattctc cactcttcaa cactctccat catataatgg acctcaccga 121 cgacacaacc gagaagaact taaacgctcc aactcgaaca tatgtccgtg acgcaaaggc 181 aatggctgca actccagcgg acgtgaaaga gcatccaaat tcatacgtgt ttatggtgga 241 catgcctggg gtgaaatctg gtgacataaa ggttcaggtg gaagatgaga atgtgctatt 301 gataagtggc gagaggaaga gagaagaaga gaaagaaggt gttaaatatt tgaagatgga 361 aagaaggatt ggtaagttga tgaggaaatt tgtgttacct gagaatgcga atattgaagc 421 tatctctgct atttctcaag atggtgttct tacggttaca gttaataaat tgcctccacc 481 tgaacctaag aaaccaaaaa ctattcaagt taaggttgct tgatcggtgt acgatttcat 541 gtcaacaaat cagaaggaat gtttgtcttt ttagttggtt tgtgtagcaa tggttttgtg 601 tgttttcgcc tagttggccc tatatatgat gatcatcatg cgatgtaatt tgtaacaata 661 tgacatgaat gaattttaat tacttggttt ttctgcttgt aacattgttg cgttgccccc 721 atgataaaat tgagaaactg aagtattaaa gaaaagaaaa tgtttcattt ac // LOCUS PEAHSP179A 700 bp ss-mRNA PLN 28-JUN-1990 DEFINITION Pisum sativum 17.9 kDa heat shock protein (hsp17.9) mRNA, complete cds. ACCESSION M33900 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 700) AUTHORS Vierling,E. and Helm,K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept < 1 469 17.9 kDa heat shock protein (hsp17.9) BASE COUNT 209 a 123 c 170 g 198 t ORIGIN 1 gataattcca agagtcttcg gtactggacg aagaaccaat gcattcgatc cattctcatt 61 agatttatgg gacccattcc agaacttcca actcgcaaga tccgccaccg gaaccaccaa 121 cgagacggca gcttttgcca acgctcacat tgactggaag gaaacaccgg aggctcacgt 181 gttcaaggct gatcttcccg gagtgaagaa ggaagaagtg aaagttgaaa tagaagaaga 241 tcgtgtgctc aagataagcg gagagaggaa aactgaaaag gaagacaaga acgacacctg 301 gcaccgtgtt gagcgtagtc aggggagttt cctccgccgt ttcaggttgc cggaaaatgc 361 taaagttgat caggtgaagg ctgctatgga aaacggtgtt cttaccgtta ctgttcctaa 421 agaggaggtt aagaagcctg aagctaagcc cattcagatt acaggatgag ctcttattct 481 tcctatattt tgatgtttgt gtctcttaat aaaatgttaa aataaaacaa ataataattg 541 tgtgtagtcg agttccagct ttaagagatt gagacatgta tggacttggc tattacttaa 601 gtgtagtagt ttgtgagtat tttgttgggt tatgttagtg tgtatgcaaa taactttttt 661 gagtatgtga aagtttcttt tgattaagct gtatttatcc // LOCUS PEAHSP181A 862 bp ss-mRNA PLN 28-JUN-1990 DEFINITION Pisum sativum 18.1 kDa heat shock protein (hsp18.1) mRNA, complete cds. ACCESSION M33899 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 862) AUTHORS Vierling,E. and DeRocher,A. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 12 488 18.1 kDa heat shock protein (hsp18.1) BASE COUNT 278 a 128 c 207 g 249 t ORIGIN 1 ctatatcaaa catgtctctg attccaagtt tctttagtgg ccgaaggagc aatgttttcg 61 atcctttctc cctggacgtc tgggatcctt tgaaggactt tccattttca aattcttcac 121 cttccgcttc attccctcgt gagaatcctg cttttgtgag cacacgagtt gactggaagg 181 aaacaccgga agcgcatgtt ttcaaggctg atcttcctgg gctgaaaaag gaggaagtga 241 aagttgaagt tgaagatgat agggttctac agataagcgg agagagaagc gttgagaaag 301 aagataagaa tgatgaatgg catcgcgtgg aacgtagcag tggaaagttc ttaagaaggt 361 tcagattgcc tgagaatgct aaaatggata aagtgaaagc ttccatggag aacggcgttc 421 tgacagtgac cgttccaaaa gaagagataa agaaggctga ggttaagtct attgagattt 481 ctggttaaac ttagaatgag ctatgttact ctgttgcttt tcttggttat aatgttttcc 541 tttttgtggc gtgtgcaaga aataaatggt catgtaattc tgaaatgtta atgtataaat 601 aaataagtaa acagttgttg ttggttattc agaggtgtta tagtattcat attgtaatgt 661 atcagaatga atcttgagaa aagagctgct ataaatagag cttgaagttt taaataaaaa 721 aaaaggttcc agaaaggaat aaaaaactgg taacagctag cagagagaaa aagctcaaac 781 cactgtgtta aggtgaacag cggaagaaaa tgaagagatg ttcatagccc ttcttcttga 841 gtctctccaa gatggagaat tc // LOCUS PEAHSP227A 795 bp ss-mRNA PLN 28-JUN-1990 DEFINITION Pisum sativum 22.7 kDa heat shock protein (hsp22.7) mRNA, complete cds. ACCESSION M33898 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 795) AUTHORS Vierling,E. and Helm,K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 44 637 22.7 kDa heat shock protein (hsp22.7) BASE COUNT 261 a 122 c 170 g 242 t ORIGIN 1 ccaagttcca aacctcaaga acaaaaaaca cacatttcta agtatgagtc tgaaacctct 61 aaacatgtta ctcgttccat ttcttctgct tattctcgcg gctgattttc ctttgaaagc 121 aaaagcatca ctactaccat tcatagattc tcccaacact ctcttatcgg atctctggtc 181 tgatcgtttc ccagatccgt ttcgcgtctt agaacaaatt ccctatggag ttgagaaaca 241 cgaaccatcc ataacattgt cacatgctag agtagactgg aaggaaactc cagagggaca 301 tgtgataatg gtggacgtgc ctgggttgaa aaaagatgat ataaagatag aagtggaaga 361 gaatagggtg ctaagagtga gtggtgagag gaagaaagaa gaagataaaa aaggagatca 421 ttggcacaga gttgaaagat cttatggaaa gttctggagg cagtttaaat tacctcaaaa 481 tgttgatttg gattctgtca aagctaaaat ggaaaacggt gttcttactt taactcttca 541 taagttgtcg catgataaga ttaaaggtcc tagaatggtt agtattgtgg aagaggatga 601 caaaccatct aagatcgtca atgatgagtt gaaataatta tgtgatttgt actcataaaa 661 atgaaaaatg ttttttcatt gtgttatttg tgaataaagg aatgttacct atgatattgg 721 ttgtttgttg tatgtcaact aaagagtgct gtaaaggctt gttaatttca tagtgaataa 781 cttgttggct tttgt // LOCUS LACNISC 1222 bp ds-DNA BCT 28-JUN-1990 DEFINITION L.lactis insertion sequence IS981. ACCESSION M33933 KEYWORDS insertion sequence. SOURCE L.lactis (strain LM0230) DNA, clone pKM40. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1222) AUTHORS Polzin,K.M. and McKay,L.L. TITLE Identification, DNA sequence and distribution of IS981, a new, high copy number insertion sequence in the lactococci related to the IS2/IS3 family of IS-elements JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by K.M.Polzin 27-APR-1990. University of Minnesota Dept. of Food Science 1334 Eckles Avenue, St. Paul, MN 55108 e-mail: kayla@molbio.cbs.umn.edu FEATURES from to/span description pept 58 318 ORF1 pept 330 1169 ORF2 binding 44 50 ribosomal binding site (put.) rpt 1 40 5' inverted repeat rpt 1183 1222 3' inverted repeat signal 33 38 -10 region signal 269 274 -35 region signal 294 299 -10 region BASE COUNT 442 a 233 c 206 g 341 t ORIGIN 132 bp upstream of PvuII site. 1 gatgttatcc ttaaatctta gagtcactat tgtataattt agacaaagga caaaaacatg 61 caaaaacgct actcaaaaga atttaaagaa acccttatcg ccttctatca ttctggtcaa 121 tccgtcaccc agctgtctaa agaatatgac gtggcccctg caacaattta taaatggata 181 gacctctact ctaaatctaa tgaaagctcc gtctctaaag ctgattttct agaattaaaa 241 agacaactgg ctaaagttaa ggaagaacga gacatcttaa aaaaagtatt gaccatattc 301 gccgagaaaa agaagtgagt gctgcggata tggctcaaac catacaaact ttagcactca 361 atgtcagact aagctgtcaa ctccttgatg ttcctgaatc aagttattat gaacggatta 421 accgacaccc atctaaaact caattaagga gacaatacct gtcactcaaa atttctcaac 481 tcttcaatgc taaccgagga atctatggtg ctcctaaaat tcatcatctt ctacttaaac 541 aaggggaaaa agtcgggtta aaactggtac agaagctaat gaagcaactt caactcaagt 601 ctgtagtcat taagaaattt aagcctggat actcactaag tgatcacatc aatcgaaaaa 661 atctcataca gactgaacct acaaagaaaa ataaggtttg gtcaaccgac attacttata 721 ttcctactca acaaggatgg gcttatctct caaccattat ggatcgttat actaaaaaag 781 tcattgcttg ggatttgggc aagcgaatga ctgtagaatt agtgcaaaga actttaaata 841 aggccattaa atcacaagac tatccagaag ctgttattct tcattctgac caaggaagcc 901 agtatacgag tctagagtat gaagagttgc ttaagtatta tgggatgact cactctttca 961 gtcgaagggg atacccttat cataatgcca gtcttgaatc ttggcatgga catttaaaaa 1021 gagagtgggt gtaccaattt aaatataaga actttgaaga agcctatcag agtattttct 1081 ggtacatcga agccttttat aattcaaaac gaatccatca aagtttaggg tatcttacac 1141 ctaatcaatt tgaaaaggta agtgcttaaa ataaatagat taaaattcta cgtttgttac 1201 tctaaaaact tgacttaacg tc // LOCUS DRONCDA 2294 bp ss-mRNA INV 28-JUN-1990 DEFINITION D.melanogaster non-claret disjunctional protein (ncd) mRNA, complete cds. ACCESSION M33932 KEYWORDS non-claret disjunctional protein. SOURCE D.melanogaster (strain dp cl cn bw) 0-4 hr embryo, cDNA to mRNA, clone pNB40. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2294) AUTHORS Mcdonald,H.B. and Goldstein,L.S.B. TITLE Identification and characterization of a gene encoding a kinesin-like protein in Drosophila JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.B.Mcdonald, 30-APR-1990. FEATURES from to/span description pept 111 2168 non-claret disjunctional protein site 180 319 alpha helical domain BASE COUNT 622 a 661 c 603 g 408 t ORIGIN 1 bp upstream of EcoRI site; chromosome 3 map position 99BC. 1 gaattgataa aatcggttgc aaggaggcag acgtatcttc taagttaggc acaacacagt 61 tggcgatgga atcccggcta ccgaaaccgt cgggcctgaa gaaaccccaa atgccgatta 121 aaaccgtgct gcccacagat cgaattcgcg caggattggg aggtggagcc gctggagcag 181 gcgccttcaa tgtcaatgcc aaccagacat actgcggcaa cttattgccg cccctctcaa 241 gggacctcaa caatctgccc caggtgctgg agcgtcgcgg aggaggagca cgtgccgcct 301 ccccagagcc catgaagttg ggccaccggg ccaagctgag acgtagccgt agcgcttgcg 361 acatcaacga actgcgtggt aacaagcgca ctgcggctgc tccttcattg cccagcattc 421 ccagcaaagt atcccgcctg ggcggtgcac tcactgtttc cagccagcga ctagtgcgtc 481 ctgcggcgcc ttcgtcaata acagcaacag ctgtcaaaag accaccagta acgcgtcctg 541 ctccacgggc tgcaggagga gcagccgcca agaaaccagc aggaacagga gcagcagctt 601 cgtcaggagc cgcggctgct gctcccaagc gcatcgctcc ctacgacttc aaggcccgct 661 tccacgatct gctagagaag cacaaggtgc ttaagacaaa gtacgaaaag caaacagagg 721 acatgggcga gctggagtcc atgcctcagc aactggagga gacgcagaac aagcttatcg 781 agacggagtc ctcgctgaag aacacccaga gcgacaacga gtgtcttcag aggcaggtga 841 agcagcatac cgccaaaatt gaaacaatca catcgacgct gggcaggacc aaagaggagc 901 tatccgagct gcaagcaata catgagaaag taaaaacgga gcatgctgct ctaagcacag 961 aagtggtgca tctgcgccag cgcaccgagg aactcctgcg ctgcaatgag cagcaggccg 1021 ccgagctgga gacctgcaaa gagcagctct tccagtcgaa catggagcgc aaagagctgc 1081 acaacacggt catggacctg cgcggcaaca tccgggtctt ctgtcgaata cgaccgccgc 1141 tggagtccga ggagaaccgt atgtgttgca cctggaccta tcacgacgag tccaccgtgg 1201 agctgcagag cattgacgca caggccaaaa gcaagatggg ccagcagatc ttctcattcg 1261 accaggtctt ccacccgctc tcctcgcagt cggacatctt cgagatggtc tcgccgctca 1321 tccagtcggc cctggatggc tacaatatct gcatctttgc ctacggacag acgggcagtg 1381 gcaagaccta cacaatggac ggagtgccgg agagtgtggg cgtcataccg cgcacggtgg 1441 atctgctctt cgactccatc cggggatatc gcaacttggg ctgggagtac gagatcaagg 1501 ccacctttct ggagatctac aacgaggtgc tctacgatct gctgagcaac gagcagaagg 1561 acatggagat tcgaatggcc aagaacaaca agaacgacat ctacgtgtcc aacataacgg 1621 aggagacggt tctggatcca aatcacctgc gccacctcat gcacacggcc aagatgaacc 1681 gtgccaccgc ctcgacagct ggcaacgagc gctcctctcg ttcccacgcg gttaccaagc 1741 ttgagctcat cggacgccat gccgaaaagc aagagatctc cgtgggttcc ataaacctgg 1801 tggatttggc cggctctgag tctcccaaga cgagcacccg gatgaccgag acaaagaaca 1861 tcaatcgctc gctatcggag ctcaccaacg taatcctggc gctgctgcag aagcaggacc 1921 acatcccgta caggaactcc aagctgacgc acctgctgat gccctcgctg ggcggcaact 1981 cgaaaacgct tatgttcatc aacgtctcgc cgttccaaga ctgtttccaa gagtccgtca 2041 agtcgctgcg cttcgcggcc tccgtaaact cctgcaaaat gaccaaggcc aagcggaatc 2101 gctacctgaa caactcggtg gccaacagca gcacacagag caacaacagc ggcagtttcg 2161 ataaataaag aatgcattct gagcccagtt ttaacaattt tcaaatttct aacctgttat 2221 tgcttaattt atgtgtgttt acttttagtg caaataaact aataaagtgc tggaaaaaaa 2281 aaaaaaaaaa aaaa // LOCUS ECOPHOAJ 3133 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.fergusonii alkaline phosphatase (phoA) gene, complete cds. ACCESSION M33966 KEYWORDS alkaline phosphatase. SOURCE E.fergusonii (ATCC 35469) DNA. ORGANISM Escherichia fergusonii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3133) AUTHORS DuBose,R.F. and Hartl,D.L. TITLE The molecular evolution of bacterial alkaline phosphatase: Correlating variation among enteric bacteria with experimental manipulations of the protein JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.F.DuBose, 01-MAY-1990. FEATURES from to/span description pept 544 1962 alkaline phosphatase (phoA) (EC 3.1.3.1) site 466 486 pho box regulatory sequence BASE COUNT 874 a 711 c 774 g 774 t ORIGIN 1 aactattaat taagcaattt tttaattcgt ggaattataa tgcgcgagaa acgtttcgcg 61 ctaatttgtg ttaagaatat ttgccaaaag tattcgttat gtcataaaaa tactactttc 121 agactctagg aagttttgag tctaaatgtt aacgtcacca tactttggat aaggaaatgc 181 agttatgaaa aatcttattt ctgagttgtt gattaaactt gcacaaaaag aagaagagtc 241 aaaggagttg gttgcccagg tagaagccct ggagattatc gtcacggcta tgttgcgtaa 301 tatggggcaa cctgaacaaa aacgcgtaat cgaccagata gaaggggctt tggccggggt 361 aaagcctgat gccagcgttc ctgacggcga tacggaaatg ctgcgcgcat atgtaaagaa 421 gctcttaagg catccccggc aataagacac agaaccgaca cgacactgtc ataaagttgt 481 catcctcgct acctatagtc gctttgtttt tattttactg tatttataca tggagataat 541 aaagtgaaac aaagcgctat tgcattggcc ctgttatctt gtctgattac cccggtaagc 601 caggctcaaa cttctcaaaa tatcaatatc ctggaaaatc gtgccgctca gggtgatatt 661 acaatgccag gtggcgctcg ccgtttaagt ggcgatcaaa ctgaggcatt acgtgcatct 721 cttaatgata agccagctaa aaatattatt ttattaattg gcgatggcat gggggattct 781 gaaattactg ctgcacgaaa ttatgcagaa ggtgcaggtg gttattttaa agggatcgat 841 gcgttgccat taaccgggca gtacactcac tatgcattgg acaaaaaaac cggcaaacct 901 gattacgtga cagattctgc tgcatccgca accgcatgga cgacaggcgt caaaacatac 961 aatggtgcgc tgggtgtcga tatccatgaa aatcctcata ccactattct tgagatggca 1021 aaagcagccg gtctggcaac aggcaacgtg tctacggctg agttgcagga tgcgacacct 1081 gctgcgttgg tttctcatgt cacctcacgc aaatgctatg gcccttctgt taccagtgaa 1141 aaatgcccgg gcaatgcact ggaaaagggg gggaaaggtt caatcactga acaactgttg 1201 aatgcccgtg cggatgtgac gttgggtggt ggtgcaaaaa cgtttgctga aacggcaacc 1261 gctggagagt ggcagggtaa aacattgcgt gagcaagctc tggctcgcgg ttatcagatt 1321 gtcagcgatg ccgcatccct cgcagcagtg acgcaagcag ggcaggacaa acctctgctc 1381 gggttatttg ctgaaggaaa tatgcctgta cgttggcatg ggccaaaggc gagctatcac 1441 gggaatctcg ataagcccgc agtaacctgc acgccaaatc ctcagcgtaa cgaaactgta 1501 ccgacactgg cacagatgac agataaagct attgaattac ttagcaaaaa tgagagaggt 1561 ttcttcctgc aagtagaagg ggcatctata gataagcaag atcacgcggc taatccatgt 1621 ggacagattg gtgagacggt tgatctcgac gaagctgttc aacgtgctct ggaatttgcc 1681 aaaaaagatg gcaatacact ggtgatcgta actgcagatc atgcgcattc cagccagatt 1741 gtggcaccag ataccaaagc gccaggttta actcaggcgc taaatacgaa agatggtgcg 1801 gtgatggcga taagctacgg taactctgaa gaagactctc aggagcatac cggtagccag 1861 ctgcgcatcg cagcgtatgg gccgaatgcc gctaatgtgg tcggactgac cgatcaaacg 1921 gatcttttct acaccatgaa agccgctctg ggcttgcaat aatccatacc gacggccatt 1981 aattgcgccg tcgggtggtt ttttcaccat cagcagccag acttataaac actatccttc 2041 gggcgtattt tccggcccga aggaaactgg tgaagttgca acctcacaaa aggatggtgc 2101 gatgaaaatg acattactgg ttaccctgct atttgggctg gtttttttaa ccactgtcgg 2161 tgctgccgag agaacattaa cgccgcaaca acagcgtatg accacctgta atcagcaagc 2221 tactgcacag gcgttaaaag gtgatgcccg aaaaacgtat atgagcgatt gcctgaaaaa 2281 cagcaaatct cccccgaatg aaaaaagcct gacaccacaa caacaaaaga tgcgtgaatg 2341 taatacccag gcgacggaac aatcactgaa aggtgacgac cgtaataaat ttatgagtgc 2401 gtgtctgaag aaagctgcat agatccccct tagtgcttac gggtgagaca cgcaaacgcc 2461 tcacccgaaa tatcatactt ctccctcacc ccccctccta taatttggga aatttgtttc 2521 taaatattcc caaaaaaata atgaatgatg aaaacttcct taaaaaagcg gtcgcaccgg 2581 gtgaagaacc accttcttca cctcaaaacg aacatcaacg ttccgggtta cgctttgccc 2641 gtcgtgtgag gctaccaaga gttgtcggtt ttggcggtat gttctcccca tcgcctcggc 2701 gctggtggag caaccgatcg ctggatggtg gtggttgatc cttgttggct ggtcattcgt 2761 ttggccacat ctggcctggc aaatgtccag tcgtgcaatg gaccccttaa accgtgaaat 2821 ttacaaccta aaaacggatg cgattactgc cgggatgtgg gttggcataa tgggtgttaa 2881 cgcactacct tcaacgacga tgctgatgat tatgtgcctt aacctgatgg gcgccggagg 2941 acttaggttg ttcgtcgccg ggctggtatt gatggtagtc gcatgtctgg tgacgctaca 3001 gttgaccggc atccgcgtcg cgtttaccag ttcgcccctt gagtggtggc tttcactccc 3061 tgtcattgtg ttgtatccct taattttcgc ctgggtcagc tatcagacag caactaaatt 3121 ggcagagcat aaa // LOCUS SMAPHOAA 2672 bp ds-DNA BCT 28-JUN-1990 DEFINITION S.marcescens alkaline phosphatase (phoA) gene, complete cds. ACCESSION M33965 KEYWORDS alkaline phosphatase. SOURCE S.marcescens (ATCC 13880) DNA. ORGANISM Serratia marcescens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2672) AUTHORS DuBose,R.F. and Hartl,D.L. TITLE The molecular evolution of bacterial alkaline phosphatase: Correlating variation among enteric bacteria with experimental manipulations of the protein JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.F.DuBose, 01-MAY-1990. FEATURES from to/span description pept 667 2094 alkaline phosphatase (phoA) (EC 3.1.3.1) site 575 595 pho box regulatory sequence BASE COUNT 588 a 845 c 749 g 490 t ORIGIN 1 ggccagtgaa ttcctggcgc tgaacccgaa cggcaagatc ccggcgatca tcgatcccga 61 tggccccggc ggccggccgc tgccgctgtt cgaatccggc gccattctgc tctacctggc 121 ggaaaagagc ggccgtttcc tgccgcagga cccggcgcag cgttacgaaa ccattcaatg 181 ggtgttcttc cagatggccg ccgtcggccc gatgttcggc cagctcggtt tcttccataa 241 attcgccggg cgcgaatatg aagacaagcg gccgctggag cgctacaaaa acgaatcgaa 301 acgcctgctc ggcgtgctgg aaacgcgact tgaaggccgc gactggatca tgggcgcgga 361 gtacaccatc gccgatatct cgctgctggg ttgggtgcgc aacctgatcg gcttctatga 421 ggcgcgcgaa ctggtggagt tcgacagttt ccctcgcgtc gcgcagtggc tggagcgcgg 481 tctggcgcgc ccggcggtgc agcgtgggct cgctatcccg gcgcgcgcgg cctgagcgtc 541 aaacgggatg gccgccgggc cgtcccgcct tccgctgtca taaaattgtc atccctctgc 601 tctactctcc tttcgtaaca atttgatgtt atttacattg ttgtattacg acaaggagct 661 taccccatgc aacctgctgt atccctgatc gccggcgccg tactgtccgc cctgctctgc 721 tcctccgcca tcgcggcgga aacctcggcc aatgccgatg gcctgaccga tcgcgccgcg 781 cgcggcaacc tcgtcgaacc gggcggcgcc cgccgtctgg ccggcgatca gaccaccgcg 841 ctgaaggcat cgctgtcgga caaaacggcg aaaaacgtga tcctgctgat cggcgacggc 901 atgggggatt ccgagatcac cgccgcgcgc aactacgccg aaggcgcagg cggttacttc 961 aagggcatcg acgccctgcc gctgaccgga caatacaccc attactcgct ggacaggaaa 1021 acccacaagc cggactacgt gaccgactcc gccgcctccg ccaccgcctg ggccaccggg 1081 gtgaaaacct ataacggcgc gctcggcgtg gacgtgaacg gcaaggatca accgacgctg 1141 ctggagatcg ccaaggcggc cggcaaggcc accggcaacg tctccaccgc cgagttgcaa 1201 gacgccacgc cggcggcgct ggtctcgcac gtcatctcac gcaagtgcta tggcccggag 1261 gaaaccagcg agaaatgcgc cgctaacgcg ctggaaaatg gcggccgtgg ctccatcacc 1321 gaacaactgc tgaaaacccg cgccgacgtc acgctgggcg gcggcgcgaa atcgttcaac 1381 cagctggcaa aaagcggcga atggcaagga aaatcgctga aggaccaggc ggcggcgcag 1441 ggttatcagt gggtgagcaa cgccgacgaa ctgcaggccg ttaccctcgc caaccagcaa 1501 aaaccgctgc tggggctgtt cgccgacggc aatatgccgg tgcgttggct ggggccgaaa 1561 gccagctacc atggcaacct cgacaaaccg gcagtgacct gcgaaaacaa cccggcgcgc 1621 accgccgcca cgccaacgct ggcggcgatg accgaaaaag cgatcgcgct gttgaaagac 1681 aacccgaacg gcttcttcct gcaggtagag ggcgcgtcga tcgataaaca ggatcacgcc 1741 gccaacccgt gcgggcagat tggcgaaacc gtcgatctgg acgaagcggt acaaaaagcg 1801 ctggccttcg cccgcgccga cggcaacacg ctggtgatcg tcaccgccga ccacgcgcac 1861 gccagccaga tcgtctccgc cgacgccaaa gcgccgggcc tgacccaaaa gctgaccacc 1921 aaggacggcg cgccgatgac cctcagctac ggcaactctg aagaggaatc tcagggacac 1981 accggcaccc agctgcgcgt cgcggcctac ggcccgcacg ccgccaacgt agtgggcctg 2041 accgatcaga ccgatctgtt cttcaccatg cgcgatgcga tgggtatcaa gtaaccgacc 2101 cctgaacgat ggccgcccgc cacggcggcc atttattatt ggttaccgct aggtatttgc 2161 aatttaaccg caattattgt gtgataacga aataacgtct aattacgcgg caattaaccc 2221 atgaaaataa cgcggttata cggcctttct tccctctctg tttattcctg cgacagcaac 2281 cggccgtttt ttgcttatat tcattttttt tagttacaaa agtagttttt aacattctcg 2341 caaacaataa taccccgcca gtaacgtgaa ccgatcaatt tgcatccgga cggcctgcaa 2401 tagttcactt ttcgttaaat accctattaa caaaaaggca ttattgctgc ctatcggctt 2461 tcgccgggcg gtcggcttca ttatttccag ggtaactccc gctaaattct ttcacctgca 2521 atgagtagaa tttacacaca agagatttaa ttcaatgaga aaattactcg tcctgatttt 2581 cagtctcagc ctgtttggcg ttacgcaaca ggcggcggct gaagagaaga ataaaaccgt 2641 cgacctgcag gcatgcaagc ttggcactgg cc // LOCUS STYMALGA 1232 bp ds-DNA BCT 28-JUN-1990 DEFINITION S.typhimurium inner membrane protein genes (malG and malF) genes, complete cds, and 3' end. ACCESSION M33921 KEYWORDS . SOURCE S.typhimurium (strain LT-2) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1232) AUTHORS Francoz,E., Schneider,E. and Dassa,E. TITLE Sequence of malG in Salmonella typhimurium JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT malG protein is proposed to form with proteins malK, malF, and malE a hetero-oligomeric protein complex that performs the ATP-dependent translocation of maltose and of maltodextrins through the inner membrane of enterobacteriaceae. Draft entry and computer-readable sequence for [1] kindly submitted by E.Dassa, 30-MAR-1990. Institut Pasteur Unite de Programation Moleculaire et Toxicologie Genetique Inserm U169, CNRS URA 271 e-mail: hofnung%pasteur.bitnet@cunyvm.cuny.edu FEATURES from to/span description pept < 1 64 inner membrane protein (malF) pept 79 969 inner membrane protein (malG) binding 66 70 ribosomal binding site (put.) signal 1050 1073 rho-independent transcription termination signal (put.) BASE COUNT 241 a 336 c 332 g 323 t ORIGIN 91 minutes on the Styphimurium map. 1 cttcctgctg gtaggcgcgc tggcaatagt gaacctgaaa gccacgcgta tgaagtttga 61 ttaagggaga taataacaat ggctatggtc caacccaaat ctcagaaact gcgtctcctc 121 atcacgcacc tggggctgct gattttcatc gcagcgatta tgttcccgct gctgatggtc 181 atcgctatct cgctacgtga agggaacttc gccaccggaa gcctgatccc ggataaaatc 241 tcctgggaac actggcggct ggcgctgggt ttcagcgtgg aacatgcgga tggtcgggta 301 acgccgcctc ccttcccggt cctgctgtgg ctgtggaact cggtgaaaat cgccggtatt 361 accgccatcg gcatcgtggc gctctccacc acctgcgcct acgctttcgc acgaatgcgt 421 tttccaggta aagcaacact gctcaaaggg atgctgattt tccaaatgtt tccggcggtt 481 ttgtctctgg tagcgctgta tgcgttattt gatcgcctgg gacaatacat tccatttatc 541 gggctgaata cgcatggcgg cgtgatcttc gcctatcttg gcggcatcgc gctgcatgtt 601 tggacgatta aaggctactt cgaaactatc gacagttcgc tggaagaggc cgccgcgctg 661 gatggcgcaa ccccgtggca ggcgtttcgc ctggtgctgc tgccgctttc cgtaccgatt 721 ctggcggtgg tgtttattct gtcgttcatc gcagcgatta ccgaagttcc ggtcgcctcg 781 ttgttactgc gcgatgtgga cagctatacg ctggccgtgg gtatgcagca atatctcaac 841 ccgcaaaact acctgtgggg cgactttgcc gccgccgccg tcctttcagc tattcctatc 901 accctggtgt tcctgttggc gcaacgttgg ctggtcaacg gcctgacggc agggggtgtg 961 aaaggttaag ttttatcgaa gtacctgccc ggcaacgggc aatgccactg ccctcaagtt 1021 atgttatccc cacttatggt cgtattttgg cgctcttcgg agcgcttttt ttatgcgtat 1081 agacagcggc taatttattg ccacgccata cgtccgttac ggtgtggcga cgcctattct 1141 cgccgtagtc ggttcgaatt acataaccac aagtgataac caacaggagg atcgccgccg 1201 aatagagtaa tacatccatt gctttatgat cc // LOCUS ECOCLPPA 1236 bp ds-DNA BCT 28-JUN-1990 DEFINITION Escherichia coli ATP-dependent clp protease proteolytic component (clpP) gene, complete cds. ACCESSION J05534 KEYWORDS ATP-dependent protease. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1236) AUTHORS Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J., Bowers,B. and Gottesman,S. TITLE Sequence and structure of ClpP, the proteolytic component of the ATP-dependent Clp protease of E.coli JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.Gottesman, 23-APR-1990. FEATURES from to/span description pept 378 1001 ATP-dependent protease (clpP) mRNA 279 > 1236 clpP mRNA pept < 1 133 ORF signal 244 249 -35 region signal 267 272 -10 region signal 165 193 transcription termination signal binding 367 372 ribosome binding site (put.) BASE COUNT 329 a 291 c 318 g 298 t ORIGIN 1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga 61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa 121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc 181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg 241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac 301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt 361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg 421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt 481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta 541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc 601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc 661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt 721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga 781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg 841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat 901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg 961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg 1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt 1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc 1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg 1201 cgcaagctga ttgccggtcc atccgtgtat atctgc // LOCUS PLYORIA 375 bp ds-DNA VRL 28-JUN-1990 DEFINITION Human polyomavirus BK (strain WWT) origin of replication. ACCESSION M34048 KEYWORDS origin of replication. SOURCE Polyomavirus BK (strain WWT) DNA, from human urine. ORGANISM Polyomavirus BK Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 375) AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 375 regulatory region site 143 210 P block site 211 249 Q block site 250 302 R block BASE COUNT 123 a 86 c 95 g 71 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt 301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta 361 aactggacaa aggcc // LOCUS PLYORIB 424 bp ds-DNA VRL 28-JUN-1990 DEFINITION Human polyomavirus BK (strain UT) origin of replication. ACCESSION M34049 KEYWORDS origin of replication. SOURCE Human polyomavirus BK (strain UT) DNA, from human urine. ORGANISM Polyomavirus BK Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 424) AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 424 regulatory region site 143 210 P block site 211 249 Q block site 250 261 R block site 263 315 P block site 316 349 Q block site 350 361 R block BASE COUNT 134 a 103 c 105 g 82 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa 301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc 361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa 421 ggcc // LOCUS ECOUNCGA 85 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.coli H+-ATPase gamma subunit (uncG) gene, 3' end. ACCESSION M34095 KEYWORDS H+-ATPase. SOURCE E.coli (strain KF12) DNA, clone pBWG1. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 85) AUTHORS Iwamoto,A., Miki,J., Maeda,M. and Futai,M. TITLE H+-ATPase gamma-subunit of Escherichia coli: Role of the conserved carboxyl-terminal region JOURNAL J. Biol. Chem. 265, 5043-5048 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 84 H+-ATPase gamma subunit (uncG) (AA at 1) BASE COUNT 21 a 24 c 23 g 17 t ORIGIN 1 ctgcagttgg tatacaacaa agctcgtcag gccagcatta ctcaggaact caccgagatc 61 gtctcggggg ccgccgcggt ttaaa // LOCUS HUMPIM1B 1737 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human protein kinase (Pim-1) promoter DNA. ACCESSION M34228 KEYWORDS protein kinase; proto-oncogene. SOURCE Human cell line K562 DNA, clone pLH/P. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1737) AUTHORS Meeker,T.C., Loeb,J., Ayres,M. and Sellers,W. TITLE The human Pim-1 gene is selectively transcribed in different hemato-lymphoid cell lines in spite of a G+C-rich housekeeping promoter JOURNAL Mol. Cell. Biol. 10, 1680-1688 (1990) STANDARD simple staff_entry FEATURES from to/span description mRNA 1699 > 1737 Pim-1 mRNA (alt.) mRNA 1700 > 1737 Pim-1 mRNA (alt.) mRNA 1703 > 1737 Pim-1 mRNA (alt.) mRNA 1705 > 1737 Pim-1 mRNA (alt.) mRNA 1706 > 1737 Pim-1 mRNA (alt.) mRNA 1732 > 1737 Pim-1 mRNA (alt.) mRNA 1733 > 1737 Pim-1 mRNA (alt.) mRNA 1735 > 1737 Pim-1 mRNA (alt.) binding 891 900 Sp1 binding site binding 1422 1431 Sp1 binding site binding 1433 1442 Sp1 binding site binding 1496 1505 Sp1 binding site binding 1222 1248 AP2 binding site binding 1290 1308 AP2 binding site binding 1662 1685 AP2 binding site binding 1340 1366 PPF-348 binding site BASE COUNT 360 a 546 c 426 g 405 t ORIGIN 1 ctttgaatac tcaggaggtg aatttgggtc atttcatctc ctctcttagc tgctgtatga 61 ccctgtgcca tcctatttat gtttcatttt cctatctgca aaaagggagt aattcttgtt 121 cccttgggtt ttctcctagt cataaaaaat gaatgagttc ggccacagta cacaaacaaa 181 agaaaatatg taatattttg taggataaat tcataaaagt tgtggaggat ctgggcactt 241 ttataaagca agcactgtag aaaactttca aaggaaacat ttaaattcac taatgacagc 301 tatagggatc ctgatttttc ctttaaatgg caggcacttc aaaaattaaa atagaatgta 361 gttagctttc aattacttag gccactgaca attcaattta tggattatat agtattttaa 421 tttactcatt tcacacgtcg ttatcaactc tacatgaagt ttaaaaggac aaaggatgca 481 agtggaattg gtcagtgttc cgggttattt ccaagggagg cagagggtgg ggggtttcct 541 ttgagacaag acttggggtt ggccaataat tgctggtatt gcctgcctgg taataacagg 601 ctgatgaaaa ggtgactaca aaacgtgaaa actggttaaa tcaagcgcac cctcccaccc 661 tagttttaga tgaggaattt tccgccctca cagaaggggc tgaggcagca tctggcatca 721 caacactaac atttgcttcg tgatttcctc tttacccggc cctttgacac acatcccttc 781 ccagaaatca ggattcgctg gtgcttttgc atttctaaaa tgggaatccc gtggctgagc 841 ttttagccgg ccggaacgac tgagggctgc atccctttcg caggagcggc gtcccgcctc 901 cagttctgtt ccaggcctga ctcctccact ccctccgtga ctcatgtctg cggatccttc 961 gcccccgacg cgccccccaa cacacaaacc cccagaatcc gcccccagcc tacagcgcga 1021 cgtcagcccg ccccagccga cttggaggtc tcgggtctga gtcacacaga aagaccaccc 1081 tcgtcggcat ccccacacac agtccgacac ccggcgcgcc ggcctccccg cctgacacac 1141 taacgcccgt cgtctccgcg caacttgtta tgctccggct cgagcccttg acccaaaaac 1201 ctcagcgaaa cggagagccg cagagccggc ctcgggcggc ctttgatggc tttgttattg 1261 tttgggtttg aatcgatacg cccctcccca tccttcctcc ctcgcggccc tacacccagc 1321 tcccgcctcc cctcacgccc cgcgcccctc cccctccatt ttggcgcctt ttccttcccg 1381 ccacgtcgtg gcggcgtaga gaccattctg accgcgagag ctgggcgggg cgggggcggg 1441 gcgcgccgag ttatgcagat caatcggcct ctggttggct ggagtagcgc tggcaggggc 1501 ggggccgggg cgcggccaca gagcgcgcgg ggcgggggcc gaggggagtc gcccagtccc 1561 gccgcttccc caccccctct cctccctcgg ccggcccggc agccctgctc cccgccttgg 1621 cctcccggag aggccccgcc ccgtcccccc ccgccgcgcc ctccccgcgc gccctccccg 1681 ccggcgcgct cctccccttt actcctggct gcggggcgag ccgggcgtct gctgcag // LOCUS ONNCG 11835 bp ss-RNA VRL 28-JUN-1990 DEFINITION O'Nyong-nyong virus (Gulu strain) complete genome. ACCESSION M20303 M33999 KEYWORDS nonstructural protein; polyprotein. SOURCE O'Nyong-nyong virus (strain Gulu, isolated from Anopheles gambiae), cDNA to viral RNA, clones 2,4,7,8,9,11,13,14,17,17s,18,22,23,27, passed in BHK-21 cells. ORGANISM O'Nyong-nyong virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae; Alphaviridae. REFERENCE 1 (bases 1 to 11835) AUTHORS Levinsion,R.S., Strauss,J.H. and Strauss,E.G. TITLE Complete sequence of the genomic RNA of O'Nyong-nyong virus and its use in the construction of alphavirus phylogenetic trees JOURNAL Virology 175, 110-123 (1990) STANDARD simple staff_entry REFERENCE 2 (bases 4079 to 7621) AUTHORS Strauss,E.G., Levinson,R., Rice,C.M., Dalrymple,J. and Strauss,J.H. TITLE Nonstructural proteins nsP3 and nsP4 of Ross River and O'Nyong-nyong viruses: Sequence and comparison with those of other alphaviruses JOURNAL Virology 164, 265-274 (1988) STANDARD full staff_entry FEATURES from to/span description pept 80 7624 O'Nyong-nyong polyprotein A matp 80 1684 nonstructural protein P1 matp 1685 4078 nonstructural protein P2 matp 4079 5788 nonstructural protein P3 matp 5789 7621 nonstructural protein P4 pept 7670 11413 O'Nyong-nyong polyprotein B matp 7670 8449 C protein matp 8450 8641 E3 protein matp 8642 9910 E2 protein matp 9911 10093 6K protein matp 10094 11410 E1 protein mRNA 1 11835 O'Nyong-nyong mRNA RNA 7621 7669 26S RNA signal 11817 11835 polyA signal BASE COUNT 3676 a 2860 c 2859 g 2440 t ORIGIN 1 atagctgcgt gatacacaca cgcagcttac gggtttcata ctgctctact ctgcattgca 61 agagattaaa gtacccatca tggattcagt gtatgtagac atagatgctg acagcgcgtt 121 tctgaaggcg ttgcagcaag cataccccat gtttgaggtg gaaccaaagc aggtcacgcc 181 aaatgaccat gcaaacgcta gagcattttc gcatctagca ataaaactga tagagcagga 241 aattgatcca gactcaacca ttctagacat tggtagcgca ccagctagga ggatgatgtc 301 tgatagaaaa taccactgcg tctgcccgat gcgcagcgca gaagaccctg agaggctcgc 361 gaattacgcg agaaaacttg cgtcagccgc tggaaaggtg acagataaaa acatctccgg 421 aaaaattaat gatctacaag ctgtgatggc cgtaccgaat atggaaacat ccacattctg 481 cctacacact gatgctacat gcaaacaaag aggagacgtc gccatttatc aagacgtcta 541 cgccgtccat gcacctacct cgctgtacca tcaggcgatt aaaggagtcc gcgtggcata 601 ctggataggg ttcgatacga cacctttcat gtacaatgca atggctggcg catacccatc 661 atattcaaca aactgggctg atgagcaggt actgaaagct aagaacatag ggctgtgttc 721 aacagaccta tctgagggta gacgaggcaa actatccatc atgagaggca aaaaattgaa 781 gccatgcgac cgagtgctat tctcggtcgg ctcaacactc taccctgaaa gtcgtaaact 841 tctacaaagc tggcatttac catcggtatt tcatctgaag ggtaaactca gcttcacctg 901 ccgctgtgac acgatcgtct catgcgaagg atacgttgtc aagagagtga ccatgagtcc 961 aggcatctac ggaaagacat cggggtatgc tgtaactcat catgccggcg gcttcctgat 1021 gtgcaagacg acagatacag tagacggcga aagggtatca ttctccgtgt gtacttacgt 1081 accagctact atctgcgacc agatgactgg aatccttgcc actgaggtaa ccccagaaga 1141 cgcacagaaa ctactggttg ggctaaacca acggatagtg gtcaatggca ggacgcaacg 1201 taatacaaac accatgaaaa actacctgct cccaatagtt gctcaggcct tcagcaagtg 1261 ggccaaagaa tgtcgaaagg acatggagga cgaaaaactc ttgggtgtcc gagagaggac 1321 cttaacgtgc tgttgcctat gggcatttag aaagcacaag acgcatacgg tgtacaaaag 1381 accggatacc cagtcaatcc aaaaggtccc tgccgaattt gacagctttg tgataccaag 1441 tctgtggtcg tcaggtttat caattccgct gagaaccaga atcaagtggc tcttgagcaa 1501 agctccaaaa tacgagcaac taccgcacag cggaaacgcc gaggaagcag cccaggctga 1561 aacagatgcg gtagaagaac aggaggcaga gctaacccga gaagctatgc caccattgca 1621 ggcgacacag gatgacattc aggtagaaat tgatgtagag caacttgaag accgagcagg 1681 agcgggcata gtcgaaacac caagaggagc aatcaaagtc acagcccaac cgtcagacct 1741 tgttgtcgga gagtacttag tactgacacc gcaggcggtc ctgcgcagcc aaaaactcag 1801 tctgattcac gcgcttgcag agcaggtaaa aacgtgcaca catagtgggc gagcaggcag 1861 gtacgcggtt gaagcatacg atgggcgtgt tctagtgccc tcgggctacg cgatacccca 1921 ggaagacttc cagagcttaa gcgaaagtgc caccatggta tttaacgagc gagagtttgt 1981 gaaccggaag ttacaccaca tcgccatgca cggcccagcg ctgaacactg atgaagagtc 2041 atatgaactg gtaagggtag agaaaacaga acacgagtac gtctatgacg ttgatcagaa 2101 gaaatgttgt aagagggagg aagcaacagg actagtgcta gtaggcgact taactagccc 2161 accataccat gagttcgcct acgaaggact aaaaatccgc ccagcatgtc catacaaaac 2221 ggcagttata ggtgtcttcg gagtaccggg ttctggcaag tcggctataa tcaaaaacct 2281 ggtaaccagg caagacttgg tgactagtgg aaaaaaagaa aactgccaag aaatctccaa 2341 tgacgtaatg cggcaaagga aattggagat atctgcacgt acagtcgact cactactcct 2401 gaatggatgt aacaagccag tggaagtact gtacgtggac gaggcattcg cttgtcattc 2461 gggaaccctg ttggcactga tagccatggt tagaccgcgt cagaaggtcg tactttgtgg 2521 cgacccaaag cagtgcggat tcttcaatat gatgcaaatg aaggtcaact ataatcacaa 2581 catctgcaca caggtgtacc ataaaagcat atcaaggcgg tgtacactgc ctgtaacagc 2641 catcgtgtcc tcgttgcatt acgagagcaa gatgcgcact acaaatgagt acaaccagcc 2701 aatcgtagtg gatactacgg gcataacaaa accagaaccc ggggacttag tgttaacgtg 2761 tttccgggga tgggttaagc agctgcaaat agactaccgt ggaaacgaag tcatgacagc 2821 agctgcttct caggggctga ccagaaaagg tgtttatgca gtaaggcaga aagtcaacga 2881 aaaccctctg tatgcaccaa catcagaaca cgttaacgtg ctattgacac gcacagaggg 2941 caagttgaca tggaagacac tctcaggcga cccatggata aagatactgc agaacccccc 3001 aaaaggggac tttaaggcaa caatcaagga gtgggaagca gaacacgcct ccatcatggc 3061 aggaatatgc aatcaccaga tggcttttga cacatttcag aacaaagcta atgtatgctg 3121 ggctaaatgc ctggtcccta ttcttgacac tgctggaatc aaattaagtg acaggcagtg 3181 gtctcagata gtgcaagctt ttaaagaaga tagggcctac tctccagaag ttgcactgaa 3241 tgaaatatgc actcgcatat atggggtaga cctggacagc ggactattct caaagccact 3301 gatatccgtc tactatgcag acaaccactg ggacaataga ccaggaggaa aaatgttcgg 3361 gttcaaccct gaggtggcac ttatgcttga aaagaaatat ccctttacaa aaggtaagtg 3421 gaacattaac aagcagatat gtataactac cagaaaggtt gacgaattta accccgaaac 3481 caacataata ccggccaacc gcagactgcc gcactcactc gtggctgaac accactcagt 3541 gagaggggaa agaatggaat ggctggtaaa caaaatcagc ggtcaccaca tgttgttggt 3601 tagcggtcat aatcttatat taccaacaaa aagagtcacc tgggtagcac cgttaggcac 3661 ccgaggtgca gactacacat ataacctgga acttggtcta ccagccacac taggcagata 3721 tgacctggta gttatcaata tccatactcc attccgcata catcattacc agcagtgtgt 3781 agatcacgca atgaagctcc agatgctagg gggggactct ctacggctgt taaagccggg 3841 aggttcactt ctgattagag cttacgggta cgccgaccga accagtgaaa gggtcattag 3901 cgtattggga cgcaagttca gatcgtccag ggctctgaaa cctcagtgca tcacgagcaa 3961 tacagaaatg ttcttcctat ttagccgatt cgacaatgga agaaggaact tcaccacaca 4021 tgttatgaac aaccagctga acgcagtgta tgcaggactg gccactagag cgggctgtgc 4081 cccgtcatac cgagtgaaac ggatggacat cgcaaagaac actgaggaat gcgtggtaaa 4141 cgccgccaat ccgcgcggag taccaggcga tggagtatgt aaagccgtgt atagaaaatg 4201 gccagaatca ttcagaaaca gtgcaacacc agtggggact gcaaagacaa tcatgtgcgg 4261 tcaatacccc gtcatccacg cagtaggccc taacttctca aactattctg aggctgaagg 4321 ggatagggaa ttggcttcag tgtatagaga agtggcgaaa gaagtgtcta ggctaggagt 4381 gagcagtgta gccatccctt tgctctcaac cggtgtgtac tcaggaggca aagacagact 4441 gctgcaatca ctaaaccatc ttttcgcagc gatggattcg acagatgcag acgttgtcat 4501 ctactgcagg gacaaggaat gggagaagaa gatcactgaa gccatatcac taagatccca 4561 ggtagaatta ctagatgatc acatctcagt ggattgcgac attgtacgcg ttcatccaga 4621 cagcagcttg gcaggccgaa aggggtacag cacagtagag ggagcactct actcgtacct 4681 agagggaaca agattccacc aaactgcagt agatatggca gagatatata ccatgtggcc 4741 gaaacaaact gaagccaacg aacaggtctg cctatatgct ctgggggaga gtatagagtc 4801 cgtcaggcaa aaatgtcccg tagacgacgc cgacgcctca ttccctccga aaacagtccc 4861 gtgcctatgc cgttatgcta tgacgcctga acgagttgca cgtctacgca tgaatcatac 4921 caccagcatc atagtgtgct cgtcttttcc gctgccgaaa tacaaaatcg agggcgtgca 4981 aaaagtaaaa tgttcgaaag cactcttgtt tgatcacaac gtaccgtctc gagtgagccc 5041 gagaacgtac aggcctgcgg acgaaatcat acagacacct caaacaccaa ctgaagcgtg 5101 tcaggacgca caactcgtgc agtcaataaa tgatgaagca gtgccagttc cctcagactt 5161 agaggcttgt gacgcaacta tggactggcc ctctatcggc accgtatcaa caagacaaag 5221 acacgactca tctgacagcg agtatagtgg ctccagaagt aacatacaac tagtgacggc 5281 ggacgtgcat gcaccaatgt acgcacattc gctggcgtcc agcggaggtt caatgctgtc 5341 gctgtccagt gaaccagctc agaacggcac aatgatacta cttgactcag aagacacaga 5401 cagtataagc agagtaagca caccgatcgc cccgcccaga agacgtttgg gaaggaccat 5461 aaatgtgacc tgcgacgagc gggaagggaa aatactccct atggccagcg acaggttctt 5521 cactgctaag ccatacactg tcgcactgag cgtatcaaca gcagatatga ctgtgtatcc 5581 catccaggca ccgctaggat tgataccacc acctaccctc gaaccgatca ctttcggaga 5641 cttcgccgaa ggtgaaatag acaacctcct gacaggggca ttgacatttg gggacttcga 5701 gccaggtgaa gtggaagagc tgacggatag cgagtggtca acgtgctcgg acacagacga 5761 agagttacga ctagacagag cagggggtta catattctcc tctgacactg gtcaaggtca 5821 tctacagcaa aaatcagtac gtcaaacgac gctaccggta aacattgttg aagaggtcca 5881 cgaagagaaa tgctatccac ctaaattgga tgagatcaaa gagcaactac tacttaagag 5941 acttcaggag agtgcttcca cggctaaccg gagtaggtac caatctagaa aagtggaaaa 6001 catgaaagcc acgattatcc acagactgaa agagggttgc agactctatt tggcgtcaga 6061 aacaccgagg gtcccatctt accgagtcac atacccggcg cccatctact cgccttcaat 6121 caatatcaaa ctgactaacc cagagactgc agtagcagtg tgtaacgagt ttttggccag 6181 aaactatcca actgtggcat cctaccaagt cactgacgag tacgacgcgt acttggatat 6241 ggtagacggg tccgaaagtt gcctagacag agctacattc aacccgtcta aactcaggag 6301 ttacccaaaa caacactctt accacgcacc caccatcaga agtgcagtgc catcaccatt 6361 ccaaaatacg ttgcagaatg tcttggcagc ggccacaaaa agaaactgca acgtaacgca 6421 gatgagggaa ctacccacta tggactccgc agtgtttaac gtggagtgtt ttaagaagta 6481 cgcttgcaac caagagtact ggagagagtt cgcctcaagc cctataaggg taacgacaga 6541 gaatctgaca atgtatgtga cgaaactaaa ggggcctaaa gcggcggcac tcttcgcaaa 6601 aacacacaac ttgctgccgc tacaagaggt accgatggac aggttcacaa tggacatgaa 6661 acgtgatgtg aaagtgacac caggtacaaa gcacaccgag gaaaggccga aagtacaggt 6721 catacaggcg gcagaaccac tggcaacagc atacctgtgt ggcatacaca gagagttggt 6781 gagaagacta aatgcagttc tgctaccgaa tgtccacaca ctgttcgata tgtcagccga 6841 agacttcgat gcaattatag ccacccattt caaaccgggc gatgctgtac tagaaactga 6901 catagcctca tttgataaga gtcaagacga ctcgcttgcg tcgaccgcca tgatgttgct 6961 agaagacctt ggggtagatc aacctatcct ggatctgata gaagcagcat tcggcgaaat 7021 atccagttgt catctaccga cgggtacgcg gttcaagttc ggcgcaatga tgaaatcagg 7081 catgtttcta accctgtttg tcaataccct cctgaacatc accattgcca gtcgggtgtt 7141 agaggagcga ttgactactt cagcctgtgc agctttcatt ggggacgaca acataataca 7201 tggggttgtc tctgacgcac taatggctgc acgttgtgct acgtggatga acatggaagt 7261 gaaaatcatc gatgcagtag tgtcagagaa ggcgccatac ttctgtgggg gatttatttt 7321 acacgacacg gtgacaggca cgtcgtgcag agtagcagac cctttaaaga gactgttcaa 7381 gctaggcaaa cctctggcag ctggagacga acaggatgag gacagaagac gtgctctggc 7441 agatgaggtt actagatggc aaagaaccgg cttagtcaca gaattggaaa aagcagtata 7501 ttcaaggtat gaagtacaag gaataacagc cgtaataaca tcaatggcta cctttgcgaa 7561 tagcaaagaa aactttaaga aactaagagg gcccgtcgta accttgtacg gcggacctaa 7621 ataggtacgc actacagcta cctacttaga gcatacatcg ctaatagcca tggagttcat 7681 accagcacaa acttactaca atagaagata ccagcctaga ccctggactc aacgccctac 7741 tatccaggtg atcaggccaa aaccacgccg aagaaggcct gcaggacaac tcgcacaact 7801 gatatccgca gtcagcagac tagcactgcg tacagttccc cagaaaccac gccggacccg 7861 aaaaattaag aagcaaaagc aagtaaagca agaacaacag agtactacga accagaagaa 7921 aaaggcgccg aaacaaaagc agacccaaaa gaaaaagaga ccaggacgaa gggaaaggat 7981 gtgcatgaag attgaaaatg actgcatctt cgaagtcaga catgaaggaa aagtaacggg 8041 gtatgcatgc ctagtaggtg ataaggtaat gaaaccagca cacgtgaaag gaactattga 8101 caacgcagac ctagcgaagt tggcgttcaa aagatcatcc aaatatgatc tagagtgcgc 8161 acagatacca gtgcacatga aatcggacgc ctcaaagttc acccatgaaa aaccagaagg 8221 ctattacaac tggcatcacg gagcagtaca gtattctgga gggaggttca cgatccctac 8281 aggcgcagga aagcctgggg acagcggaag accaatcttt gacaacaagg ggcgtgtcgt 8341 ggctattgtt ctaggcggag caaacgaagg aaccaggaca gcactatctg tagtgacttg 8401 gaataaagac atagtcacaa aaatcacacc agaggggtca gttgaatgga gccttgccct 8461 ccctgtcatg tgcctgttgg caaatacaac cttcccatgt tcccaaccgc cttgcgcgcc 8521 gtgctgctac gaaaagaaac cggaagaaac cttgagaatg ctggaggaca acgtcatgca 8581 accaggatat taccagttac tcgattcagc attggcctgc tcacaacgtc gtcaaaaacg 8641 taatgcaaga gaaaacttca atgtctacaa agtcactagg ccgtacttag cccactgtcc 8701 tgactgcggg gagggacact catgccacag cccaatagca ttagaacgga tcagaagtga 8761 ggcaacagat ggtaccttga aaatccaggt atctctgcaa atcggaataa agacagacga 8821 cagccacgat tggacgaagc tacggtatat ggatagccat acacctgtgg atgcagaccg 8881 atccgggttg tttgtcagaa cgtcagcacc gtgcaccatc acgggaacga tgggacattt 8941 catactagca cgctgtccga aaggagagac gctgacggta ggatttgtag acagtagaag 9001 gatcagtcac acgtgcatgc acccgttccg ccacgagcca ccgctgatag ggagagagaa 9061 gtttcactcc cgcccgcagc atggcaaaga actaccttgc agtacatacg tccataccac 9121 agcggcaact gctgaggaaa tagaagtgca tatgccgcca gatacccctg actacacgct 9181 gatgacacag caagcgggaa acgttaagat cacagttgac ggccagacgg tacgatacaa 9241 gtgcaaatgt gacggctcca atgaaggatt aataaccgct gacaaagtca taaataactg 9301 caaagtagac caatgccaca cagcggttac aaaccacaag aaatggcaat acaattcacc 9361 gctgaccccg cggaactccg aacaaggaga tagaaaaggt aagatccata tcccatttcc 9421 actggtgaac acaacctgca gggtaccaaa agcaagaaat ccgactgtca catacggtaa 9481 aaacagagtc actctgctgt tacatccaga ccacccaaca ctcctttcgt accgcgccat 9541 gggaaggatc ccggattacc atgaagagtg gataacaaac aagaaggaaa taagtatcac 9601 agtaccagca gaaggcttag aggttacgtg gggtaataat gacccataca aatattggcc 9661 ccaactgtct acaaatggta ctgcgcacgg gcacccacat gaaataatcc tctattacta 9721 tgagctgtac ccaactacca caattgctgt actagctgct gcttctatcg taataacatc 9781 tttggtaggt ctatcattag gcatgtgcat atgcgcgaga cgcaggtgca tcacgccata 9841 tgagctgact ccaggagcta ccatcccatt cctcctaggt gtactatgct gtgccaggac 9901 tgcaaaagca gcatcgtact acgaagctgc aacatacctc tggaatgagc aacaaccatt 9961 attttggtta cagcttctaa tccctctgtc agctgcaatt gttgtgtgta attgcctaaa 10021 acttttacca tgctgctgca aaacattgac ttttttagcc gtcatgagca tcggtgcccg 10081 cactgtgacc gcgtacgagc acgcaacagt gatcccgaac acggtgggag taccgtgtaa 10141 gactcttgtt agcagaccag ggtacagccc tatggtctta gaaatggagc tacagtcggt 10201 cactctggaa ccagcattat ccttggatta cattacgtgt gagtataaaa caatcacacc 10261 gtccccgtac gtaaaatgct gtggtacagc tgaatgtaag gccaagaacc tgccagatta 10321 taactgcaaa gtattcacag gcgtctaccc atttatgtgg ggaggagcat actgcttctg 10381 tgacgcagag aacacacagc tcagcgaggc acacgttgag aaatcagaat catgcaaaac 10441 tgagtttgca tcagcctaca gagcccacac agcttcagta tcagctaaac tacgtgtctt 10501 ttaccaaggg aataatatca ccgtgtctgc atacgccaat ggtgatcatg cagttacggt 10561 ggaagacgcg aagtttgtca tcggtccact atcgtccgcc tggtcaccat ttgataataa 10621 gatcgtggtg tacaaaggcg aagtctacaa tatggactat ccacctttcg gcgcagggag 10681 gccaggacag ttcggtgaca tccagagccg cacgccagac agcaaggacg tctatgcgaa 10741 tacgcagtta atactgcaaa gaccagcggc aggagcaata cacgtgcctt actcccaggc 10801 accttcgggc tttaagtact ggctcaagga aaaaggggca tcattgcagc atactgcacc 10861 atttggctgt cagatagcaa caaacccggt aagagcagtg aactgtgcag tgggcaacat 10921 accagtctcc attgacatcc cagatgcagc tttcaccagg gtcactgacg ctccttccat 10981 cacagacatg tcctgcgaag tagcttcgtg tacccattca tctgattttg gaggtgccgc 11041 agtcataaag tacacagcta gtaaaaaagg aaaatgcgcc gtgcactctg taacaaatgc 11101 ggtcactatc cgcgaaccta acgtagatgt caagggaaca gcacaattgc aaattgcctt 11161 ctcgaccgca ctagctagtg cggaattcaa ggtgcagatc tgctccacac tggtacactg 11221 ctcagcgacg tgccatcctc ctaaagacca tatagtcaat tacccgtcac ctcacaccac 11281 actaggagtg caggacattt caacgacagc tatgtcttgg gtccagaaga ttacaggagg 11341 agtgggactc gtggttgcta tagctgcttt gatcttaatt atagttctct gcgtatcatt 11401 tagcagacac taaagtacac atgccccaaa gtaatatact gtacaggtat acgtgctccc 11461 tgagcagcac gatatatgta ttatccataa aagaaaaaca aaacaaaaat ataaaaatta 11521 taaaatacaa aagtataaaa acaggtattg gtaccccctt agaggtacat tatttaacca 11581 gtgaaaatag gtattggtgc ccccttagag gcacattata gaaccaggta taggtgcccc 11641 cttagaggta caccaacaat aggtataagt gcccccttag tggcacacta accaccacaa 11701 tcggtaatat gaagagacgt aggtatgaag cttcgtaaaa gctgccgaac ttactttaag 11761 atgtaggcgt accgaactct tcgacaattc tccgacgcag ggacgtagga gaagttattt 11821 tgtttttaat atttc // LOCUS PNGE155A 155 bp ds-DNA INV 28-JUN-1990 DEFINITION P.redivivus E155 gene. ACCESSION M33925 KEYWORDS . SOURCE P.redivivus (strain SC) adult DNA, clone E155. ORGANISM Panagrellus redivivus Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae. REFERENCE 1 (bases 1 to 155) AUTHORS de Chastonay,Y., Muller,F. and Tobler,H. TITLE Two highly reiterated DNA sequences in the low C-value genome of Panagrellus redivivus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.De Chastonay, 26-APR-1990. Institue of Zoology Perolles, Fribourg CH-1700 SWITZERLAND e-mail: stocker%cfruni51.bitnet@cunyvm.cuny.edu BASE COUNT 43 a 33 c 30 g 49 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccaag gtggcccccg gggccatacg gtacttaaca aatttgcata aatgtgtgat 61 ttttgacaca aaacgagctt attaacatct ttggaattgc ccatgaatct tcatttgaat 121 gtggattata gatcttatcg ccccccaagc ttttg // LOCUS PNGE167A 167 bp ds-DNA INV 28-JUN-1990 DEFINITION P.redivivus E167 gene. ACCESSION M33926 KEYWORDS . SOURCE P.redivivus (strain SC) adult DNA, clone E167. ORGANISM Panagrellus redivivus Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae. REFERENCE 1 (bases 1 to 167) AUTHORS de Chastonay,Y., Muller,F. and Tobler,H. TITLE Two highly reiterated DNA sequences in the low C-value genome of Panagrellus redivivus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.De Chastonay, 26-APR-1990. Institue of Zoology Perolles, Fribourg CH-1700 SWITZERLAND e-mail: stocker%cfruni51.bitnet@cunyvm.cuny.edu BASE COUNT 54 a 31 c 27 g 55 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcccga tgttcagatt tgaactttgt atgaatttga gatggcccat ttgatcattg 61 cggattcttc agaatccgaa tctaatggct cagacccaca gaataagctt taaataggct 121 attctaacta tattgattca actaatctaa agtatgcctt aaaacat // LOCUS SNDHNGP 2600 bp ss-RNA VRL 28-JUN-1990 DEFINITION Sendai virus hemagglutinin-neuraminidase (HN) gene, complete cds. ACCESSION M34033 KEYWORDS glycoprotein; hemagglutinin; neuraminidase. SOURCE Parainfluenza virus type 4A (strain Toshiba) from primary monkey kidney cells, cDNA to viral RNA, clone pG4HF1. ORGANISM Parainfluenza virus type 1 Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Paramyxoviridae; Paramyxovirus. REFERENCE 1 (bases 1 to 2600) AUTHORS Bando,H., Kondo,K., Kawano,M., Komada,H., Tsurudome,M., Nishio,M. and Ito,Y. TITLE Molecular cloning and sequence analysis of human parainfluenza type 4A virus HN gene: Its irregularities on structure and activities JOURNAL Virology 175, 307-312 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 343 2064 hemagglutinin-neuraminidase (HN) mRNA 53 2600 HN mRNA mRNA < 1 52 fusion protein (F) mRNA signal 39 52 consensus polyA signal signal 2587 2600 consensus polyA signal BASE COUNT 955 a 561 c 410 g 674 t ORIGIN 1 acaaaacgac acccaatcaa acaatctcca catttcaatt ttaagaaaaa aactattata 61 ggggggaaca cacttctcag ccctgattgc tcaaggccct tgcatgtgca accgagacac 121 cccccacaag caccggaata agacctgaca acaaagtagc agccaccacg acccaaaaac 181 aaaattaaaa ggatccggta acagcccatc aaccagcaat catagaatcc aacaatccag 241 agagacgtca catcaactca tccacgaatc ttcgaaggga acatcccaga caaaatcaca 301 gcccattccc tgatcacgga taaactgaga aagatcacaa gaatgcaaga ctcacatggt 361 aatacacaaa tactcaacca ggcaaattca atggtgaaaa gaacatggag attactattt 421 cgaattgcaa ccttaatatt acttgtttca atatttgtgt tatcgctcat aattgtatta 481 cagtcaacac cggggaattt gcaaaacgat atcaatataa ttagaaagga gctcaatgaa 541 cttatggaga attttgaaac tacatctaag tcactgttaa gtgtatcaaa tcaaatcact 601 tacgatgtat cagtacttac tcctataaga caagaagcta ttgaaacaaa catcatttca 661 aaaataaaag atcattgcaa agatagagta attaaagaag gaagcacttg cacattgaat 721 cgcagccctt tgcatgatgt ctctttttta aatgggttca ataaattcta tttcacatat 781 aaagataata tgcaaattaa gtttaaatca ttattagatt accccaattt tattccaact 841 gctacaactc cccacggatg cattcgaatt ccatcattct ccttaggtca aacccattgg 901 tgttataccc ataatataaa cctactagga tgtgcagacc ctgcatctag caatcaatat 961 gtatcactag gaaccttaca agtcttaaaa atgggtgacc cttattttaa agtcgagcat 1021 agtcattatt taaatgacgg gaggaatcga aagagttgtt cagtggttgc tgtccccgac 1081 ggatgcctgc ggaattgtgt gaccatgaca aaaaatgaga cagagaattt caaagacctc 1141 aattggcaac acaattactt acatacatat catataatgg taccattaaa gactcgtata 1201 ataaatccac caggatcatc cagagattgg gttcatatcg caccaggggt aggctcgggc 1261 cttttgtatg ccaaattact tatatttcct ttgtatgggg gtctcacgga aaaatcagtg 1321 atacataata atcaatcagg gaaatatttt ttccctaatt caactaaatt gcaatgccgt 1381 aacagcacta tggaaaaaat aaaaggagca aaagattcat acacaataac ttacttctca 1441 gggagactta tacagagtgc atttctggtt tgtgatctaa gacaatttct ttctgaagat 1501 tgtgaaatct taattcctag taatgattac atgatggtcg gtgcagaggg tcgattatat 1561 aacattgaga acaacatatt ttattatcag agaggatcca gctggtggcc ttatccgagc 1621 ctctatagaa tcaggttaaa ccttagtaag aaatatccta gaataactga aattaaattt 1681 acaaaaattg aaatcgcccc aagaccaggc aacaaagatt gtccaggaaa taaggcttgc 1741 ccaaaagaat gtataacggg agtctaccaa gatatattgc cactaagtta tcccaatact 1801 gcatttccac acttaaaaca agcgtattat acaggttttt atcttaataa ctcgctcgag 1861 agacgcaatc caacatttta tactgctgac aatctagatt accatcaaca ggaaagatta 1921 ggtaaattca atcttactgc tggatactct actacaactt gttttaaaca gaccactact 1981 gcgaggttat actgtctcta cataattgaa gtgggtgact cagtcattgg ggactttcag 2041 atcacccttt ttttagcagc ttaatagacc agactgttaa ttaatcaaca aagttattct 2101 gtaatataaa ctgatcttat aagtgaaaag atgcctatcc aaggaggttg atagacaaat 2161 agtaaaagta gcaattgtaa caaaactcta aggaaaaagt aattcgagaa atattataga 2221 ctgacttcag agcaaacaca acatcgatcc ataatagtca atataatcaa taatactcta 2281 tgagacctta cctatcaaca gcaaaaaaca cagtccatca agcggaaccc aactcgctcc 2341 atccttaatc atccactgaa agaaaaaata tacgaaggac catcggccac cgggtccaaa 2401 caatctagca caaaaattca aacaaccgcc aaactctgtt cggcctcaac aaacaatccg 2461 ccaagccatc tgtcattcct ataccaacac acaaccatcc cattcctcaa aagcaattca 2521 atccgcgacc caaagaagac tctccacata tccagctaat ccgtcgatcc gacacatcat 2581 cgtatctttt aagaaaaaaa // LOCUS YSCRP026A 2133 bp ds-DNA PLN 28-JUN-1990 DEFINITION S.cerevisiae RNA polymerase II sixth subunit (RP026) gene, complete cds. ACCESSION M33924 KEYWORDS RNA polymerase II; oncogene; transmembrane protein; tyrosine kinase. SOURCE S.cerevisiae single-copy plasmid DNA, clone 2. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2133) AUTHORS Archambault,J., Schappert,K.T. and Friesen,J.D. TITLE A suppressor of an RNA polymerase II mutant of Saccharomyces cerevisiae encodes a subunit common to RNA polymerase I, II, and III JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Friesen, 01-MAY-1990. The Hospital for Sick Children 555 University Avenue Toronto, Ontario, CANADA M5G 1X8 e-mail: jim@sickkids.toronto.edu FEATURES from to/span description pept 1026 1045 RNA polymerase II sixth subunit (RP026), exon 1 1122 1569 RNA polymerase II sixth subunit (RP026), exon 2 IVS 1046 1121 RP026 intron A BASE COUNT 644 a 374 c 408 g 707 t ORIGIN Chromosome XVI. 1 tctagaaaaa gacatatcag gacatatatg acacttccaa tttttggtta ctagtgagtc 61 gtcatgaata atcatgtgca tttgtaaacc attttccccc acacatggtt tgctacaaat 121 aggacatttt aatttaggat gatcattttt tatatgcgat tgcaattgtg accatatccg 181 aaactctttg caacaaccag caaaagtaca ttggtaagga ttttctacct caggatcgtg 241 atgtttagaa atgtgatttc ttagcctata tggcctctga aagcttttat tgcagtgtgg 301 acaggttagt ttatgtaaat gaacagataa aatatgtgcc cttaattgtg gatgcttgta 361 gaatcggagg ttgcatcctt cctctggaca aatgaaagat ttggtatgcg ttacttcgtg 421 tcgcttcagt tgctggcgag tcgtcactcc ttttccacaa taagaacatt ggaatggttt 481 cgtatcagaa tgcgtataca agtgtctctc taagtgactc tttttaacga aggattttgc 541 acactatcac actgaaatgc tcttaaaccc tgatgtacgc ttaattggtg ttcagtcaaa 601 attgaaggtc ttgtaaatgc ctatcacagc atcatagtca cagaaatatg tctttggcct 661 attgctcgat gagcttctag ttgatgttaa actgttcaat gattctgaac tctctgatcg 721 tgatatggga atggtctcct gtttaagttc ggccagtggc attccttcat tatttagaac 781 ctctcctccc atttgacagc gatttcaaca gttactactg tgcttaatgc ccttatttga 841 tagttagttc ttcttataat aaataatcat tgtatattga ttattcgtat agtatgatat 901 ttaattgaaa aaaaaatttt ttttttttta agaatatcat tcaaaggatc aatcacaacc 961 ttgaagaaag gctaaaaaga cacattttgc aggtaacagt gtaaagatta aggctacaag 1021 tgatcatgtc agactacgag gaggcgtatg tgcattcata ttggagaaat tgcgtgcttt 1081 agttttaatt atatttacta acttctatca ttgtgataca ggtttaacga cggtaatgag 1141 aacttcgaag attttgatgt agagcatttt tctgatgagg agacttatga ggaaaaacct 1201 caattcaagg atggtgaaac aaccgatgcc aacggtaaga ccatcgttac tggtggtaat 1261 ggcccagaag attttcaaca gcatgagcaa ataagaagaa agacacttaa ggaaaaggcc 1321 atcccaaaag accaaagagc aactactcca tacatgacca aatatgaaag ggcaagaatt 1381 ttaggtacaa gagccctaca aatttccatg aatgcgccag tttttgtaga tttagaaggt 1441 gaaactgatc cattgcgtat tgccatgaag gagttggctg aaaagaaaat tcctttggtt 1501 attagaagat atttaccaga tggttccttt gaggactgga gtgtggagga actcattgtg 1561 gatttgtgat tacacctgca ttttccttta tgtgtatttg ctttgtttgt atattattat 1621 aactataaag aaccgattta ttttgtaatg caaataactg gacgagagaa ccacgctata 1681 acctatttgg taataaaaat gtttgtatga ataaaaaagc tgctgactgc tttctttgat 1741 atgtgtatat agaatattta tcggtgattg atctaagaaa agcttacctc tgaaagttct 1801 ctcttcgtcg cttgctgatt tgtagtaaac aatttgaaaa gtttagcaaa ttcttcctca 1861 ggattttcaa aaccggcttc ctttaaagat caatgacctc attcaaggga acgtttaaat 1921 cgtgtcctct tccaatggct tttaaacttt cttccaaatt cttcacgctc ggggaaactg 1981 cgataagttc ttccccatga tagataagaa tatgggaaag cttacacctt cttctccaac 2041 ttccgcggta ctggtgtcat tgtcaggtac cattttggac cattcctcat ctgtcaatgt 2101 cttccccaat gttgcatata tcttggtaag atc // LOCUS MUS25OAS 1412 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse 2'-5' oligo A synthetase mRNA, complete cds. ACCESSION M33863 KEYWORDS 2'-5' oligo A synthetase. SOURCE Mouse cell line L-929, cDNA to mRNA, clone L3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1412) AUTHORS Coccia,E.M., Nissim,A. and Chebat,J. TITLE A full-length murine 2-5A synthetase cDNA transfected in NIH-3T3 cells impairs EMCV but not VSV replication JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.M.Coccia, 26-APR-1990. FEATURES from to/span description pept 36 1139 2'-5' oligo A synthetase signal 1389 1394 poly-A signal BASE COUNT 340 a 371 c 390 g 311 t ORIGIN 1 ccaggctggg agacccagga agctccagac ttagcatgga gcacggactc aggagcatcc 61 cagcctggac gctggacaag ttcatagagg attacctcct tcccgacacc acctttggtg 121 ctgatgtcaa atcagccgtc aatgtcgtgt gtgatttcct gaaggagaga tgcttccaag 181 gtgctgccca cccagtgagg gtctccaagg tggtgaaggg tggctcctca ggcaaaggca 241 ccacactcaa gggcaggtca gacgctgacc tggtggtgtt ccttaacaat ctcaccagct 301 ttgaggatca gttaaaccga cggggagagt tcatcaagga aattaagaaa cagctgtacg 361 aggttcagca tgagagacgt tttagagtca agtttgaggt ccagagttca tggtggccca 421 acgcccggtc tctgagcttc aagctgagcg ccccccatct gcatcaggag gtggagtttg 481 atgtgctgcc agcctttgat gtcctgggtc atgttaatac ttccagcaag cctgatccca 541 gaatctatgc catcctcatc gaggaatgta cctccctggg gaaggatggc gagttctcta 601 cctgcttcac ggagctccag cggaacttcc tgaagcagcg cccaaccaag ctgaagagtc 661 tcatccgcct ggtcaagcac tggtaccaac tgtgtaagga gaagctgggg aagccattgc 721 ctccacagta cgccctagag ttgctcactg tctttgcctg ggaacaaggg aatggatgtt 781 atgagttcaa cacagcccag ggcttccgga ccgtcttgga actggtcatc aattatcagc 841 atcttcgaat ctactggaca aagtattatg actttcaaca ccaggaggtc tccaaatacc 901 tgcacagaca gctcagaaaa gccaggcctg tgatcctgga cccagctgac ccaacaggga 961 atgtggccgg tgggaaccca gagggctgga ggcggttggc tgaagaggct gatgtgtggc 1021 tatggtaccc atgttttatt aaaaaggatg gttcccgagt gagctcctgg gatgtgccga 1081 cggtggttcc tgtacctttt gagcaggtag aagagaactg gacatgtatc ctgctgtgag 1141 cacagcagca cctgcccagg agactgctgg tcaggggcat ttgctgctct gctgcaggcc 1201 catgacccag tgagggaggg ccccacctgg catcagactc cgtgcttctg atgcctgcca 1261 gccatgtttg actcctgtcc aatcacagcc agccttcctc aacagattca gaaggagagg 1321 aaagaacaca cgcttggtgt ccatctgtcc acctgttgga aggttctgtc tgacaaagtc 1381 tgatcaacaa taaaccacag caggtgccgt ca // LOCUS PSEHUTR 1604 bp ds-DNA BCT 28-JUN-1990 DEFINITION P.putida histidine utilization genes repressor protein (hut) gene, complete cds. ACCESSION M33922 KEYWORDS histidine utilization genes repressor protein. SOURCE P.putida (ATCC 12633) DNA. ORGANISM Pseudomonas putida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1604) AUTHORS Allison,S.L. and Phillips,A.T. TITLE Nucleotide sequence of the gene encoding the repressor for the histidine utilization genes of Pseudomonas putida JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.T.Phillips, 26-APR-1990. FEATURES from to/span description pept 167 913 histidine utilization genes repressor protein (hut) pept 910 1482 protein of unknown function binding 46 65 hut binding site (put.) binding 894 898 ribosomal binding site (put.) site 82 95 operator site signal 94 122 promoter (put.) signal 1516 1543 transcription termination signal BASE COUNT 295 a 484 c 513 g 312 t ORIGIN 1 ggacatggct ggcccagccc gtaggcaaca gagcgcgttc ggcgaagtag gcggacatcg 61 gtcaaatcct gttattgtta acttgtatat acatatacag gcgtttgcct gccgggtaaa 121 ctgcggcaag ctaccgttca ttccctatgc acaaggatcc aacgccgtgc cgacacctcc 181 tgtctccgcg ctggttgccc agatgggcga gggcccggcg ccgctgtatg cccgggtcaa 241 acagatgatc atccagcaga tcgacaacgg cagctggccg ccgcatcacc gggtcccctc 301 ggagagtgaa ctggtcaacg agctaggctt cagccgcatg accatcaacc gtgccctgcg 361 cgaactcacg gccgacggcc tgctggtgcg catgcagggg gtcggcacgt tcgtagccga 421 gccaaagggc cgttcggcgt tgttcgaagt caacaacatt gccgatgaaa ttgccgcgcg 481 cggccatcag catagctgcc aggtgatcac gctcaccgag gaagcagccg gttccgaacg 541 ggccctggcc ctggacatgc gtgaaggcca gcgggtgttc cactcgctga tcgtgcattt 601 cgagaacggc gtgccggtgc agatcgagga ccgctacgtc aacgccgcga tcgcacccga 661 ctacctcaag caggatttca cccggcagac gccatatgcc tacctgtccc aggtagcgcc 721 gctgaccgag ggtgagcacg tggtcgaagc catcctggcc gagccggaag aatgccgcct 781 gctgcagatc gagcggggcg aaccttgcct gctgatccgc cgtcgtactt ggtccggccg 841 ccagccggta accgcggcgc ggctgatcca ccccggttcc cgtcatcgcc tggaaggacg 901 tttcagcaaa tgagccagct gcagttgttg cgcgcacagg attacccgcg catgccgtgg 961 aagaacggtg gcggtttcac cgaagagatc acccgcgaca gtggagaggg cctggacggc 1021 tttggctggc gcctgtcgat tgccgatatc gaagagtctg gcggcttttc caccttcgcc 1081 ggttaccagc ggatcatcac cgtgctgcag ggcgatggca tgcgcctgtt ggtcgatggc 1141 cagcccagcc ggccgttgct gccgttcgat gcctttgcct tcagcggcga aagccaggtc 1201 agctgcaagc tgctgggtgg ggcgatccgc gatttcaacc tgatctatgc accgcaacgg 1261 taccgggcga ggttgcagtg gtttgatggc acgagccgtt tgtacagctc ggcgtcgaca 1321 gtgctgttgt ttgctgccag cagtcacgtg gaagtgtcca tggcggggcg tgaggtgcag 1381 cggttggggt tgtatgactg cctgcggctg gagggcaacg atgagttgct tgggctggaa 1441 gttcaggggc ggttttgctt gattgagctc atttctcgct gatgggcttg gcgatacatt 1501 ttcatcgcct gtgagatcga gcgccgcgcg ggcggcgctc gatttgcgcg ccgccgcaaa 1561 actcaagccg gaccgacgct cgcttcaccc ccccaaaaaa aatc // LOCUS MUSCR2 427 bp ss-mRNA ROD 28-JUN-1990 DEFINITION M.musculus complement receptor type 2 (CR2) mRNA, 3' end. ACCESSION M35685 J04153 KEYWORDS complement receptor type 2. SOURCE M.musculus (strains BALB/c, DBA/2, CBA/J and C57B10.D2/OSN C5-deficient) and Swiss nude (strain nu/nu), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 427) AUTHORS Fingeroth,J.D., Benedict,M.A., Levy,D.N. and Strominger,J.L. TITLE Identification of murine complement receptor type 2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 242-246 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Fingeroth, 13-JAN-1989. FEATURES from to/span description pept < 1 285 complement receptor type 2 BASE COUNT 148 a 95 c 76 g 108 t ORIGIN Chromosome 1. 1 tgtttacaac cgaagatcaa aagagggcaa atattatcta ttttgaaaga tagttattca 61 tataatgaca ctgtggcatt ttcttgtgaa cctggcttca ccttgaaggg caacaggagc 121 attcgatgca atgctcatgg cacatcggag ccaccggtac cagtgtgtga aaaaggaaaa 181 cacagagaaa gcaattatta tacaaagaca agacccaaag aaggagctct tcatttagaa 241 acacgagaag tatattctat tgatccatat aacccagcaa gctgatgaca tgacaaatca 301 agatgtagaa ctctcagcta cctcttcagc accatatctg cttacatgcc accaagctac 361 cctccacgac aataatggac taaacctctg atttgtaagc cagccccaat taaatgtttt 421 tctctat // LOCUS RABCYP2C16 2006 bp ss-mRNA MAM 28-JUN-1990 DEFINITION Rabbit cytochrome P450IIC16 (CYP2C16) mRNA, complete cds. ACCESSION M29968 KEYWORDS cytochrome P450; monooxygenase. SOURCE Rabbit (strain New Zealand White) adult liver, cDNA to mRNA. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 2006) AUTHORS Hassett,C. and Omiecinski,C.J. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Hassett, 20-NOV-1989. Author Address [1]: C.Hasset University of Washington Department of Environmental Health SC-34 Seattle, WA 98195 FEATURES from to/span description pept 66 1529 cytochrome P450IIC16 (CYP2C16) mRNA < 1 2006 CYP2C16 mRNA signal 1985 1989 poly-A signal BASE COUNT 524 a 474 c 472 g 536 t ORIGIN 1 cggcatcggt accaaataag atagacagtg ctactcagaa atccaagaaa atggtggaag 61 aaataatgga tccagttgtg gtcctggtgt tgggtctctg ctgtttgctt ctcctttcac 121 actggaagca aaattccggg agggggaagc tccctcccgg ccccactcct ttccccatta 181 ttggaaatat tctccagata gatgctaagg acatcagcaa atccctaact aagttctcag 241 aacgctatgg ccccgtgttc actgtgtatc tgggcatgaa gcccgctgta gtgctgcatg 301 gataccaggc agtgaaggag gccctggttg atcttggaga ggagtttgct ggaagaggca 361 gttttcctat gcttgataaa gttagtaagg gactcggaat cgttttcacc aatggaaaga 421 gatggaaaga gatccggcgc ttctcgctca tgaccctgcg gaatttcggg atggggaaga 481 ggagcattga ggaccgagtt caagaggagg cccgctgcct ggtggaggag ctgagaaaaa 541 ccaacgcctc accctgtgat cccaccttta tcctgggctg tgctccctgc aatgtgatct 601 gctccattat tttccataat cgctttgatt ataaagatga ggagtttctt aaactattgg 661 aaaaattcaa tgaaaatgtt aggattctga gttctccatg gttgcaggtc tgcaataatt 721 tccctgctct tattgattac ttaccaggaa gtcataagac cttactaaag aattctgatt 781 atgtgaaaaa ttttattatg gagaaagtga aggaacacca aaaattcctg gatgttaaca 841 atcctcggga ctttatagat tgtttcttga tcaaaatgga gcaggaaaac catttggagt 901 tcactcttga aagcttggta accactgtgt ttgatttgtt tggagctggg actgagacaa 961 cgagcacaac gctgagatac tccctcctgc tcctgctgaa gcaccccgag gtcgcagata 1021 aagtgcagga ggagattgag cgtgtgattg gcaggcaccg gagcccctgc atgcaggaca 1081 ggagccgcat gccttacaca gatgccgtaa tacatgagat ccagagattc attgacctgg 1141 tccccaataa tctgccccac acagtgaccc gtgacattaa attcagaaac tactttatcc 1201 ccaagggtac ggacatcatg acatcactga catccgtgct acatgatgaa aaagcatttc 1261 ctaacccaaa ggtatttgac cctggacact ttctggatga gagtggcaac ttcaagaaga 1321 gtgactactt catgcctttc tcagcaggaa aacggatctg tgtgggagag gccctggccc 1381 gcatggagct gtttttgttc ctgacctcca ttttgcagaa ctttaaactg caatctctgg 1441 ttgagccaaa ggacctggac atcactgcag ttctcaatgg atttgtttct gtgccacctt 1501 cgttccagct ctgcttcatt cctgtttgaa aaggagcaga ctggcttcta ctgtgccatc 1561 atttcaaagg cattgcccat caccttactg catttgagac acttctttaa cttttctcac 1621 atcttactat tcccttaaga tctagtgaaa acctaacttc tgtgggtgat cccctgagac 1681 tgcctgccct gaccatgcaa gaggtagaga gggcatggca agccatgctc ctgggaggga 1741 ccccacagcc tggctgctgg caggtggcgg gacccaggca catttctctc cattcctgcc 1801 tgtcaggtaa actgctccta gctgtgtcca aagcccatca agaaagctac cgtaggctat 1861 gtgaccttca agatgattgt aggagcatat cagtaccaat attgcctcta tcctatagaa 1921 ttagtactgc cctgaattag ttacaccctt tctgcctgcc ctttagaaag tgtgcatgct 1981 cattaataaa gtggatgcat tcactg //