Path: utzoo!attcan!uunet!tut.cis.ohio-state.edu!cica!iuvax!bionet!daemon From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: <9004121714.AA25930@life.lanl.gov.LANL.GOV> Date: 12 Apr 90 17:14:11 GMT Sender: daemon@genbank.BIO.NET Distribution: bionet Lines: 1400 Approved: lear@genbank.bio.net Checksum: 17050 90 LOCUS CULESTB1 3105 bp ds-DNA INV 27-FEB-1990 DEFINITION C.pipiens esterase B1 gene, complete cds. ACCESSION M32328 KEYWORDS esterase. SOURCE Mosquito (C.pipiens quinquefasciatus, strain Tem-R) 4th instar larva DNA. ORGANISM Culex pipiens Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae. REFERENCE 1 (bases 1 to 3105) AUTHORS Mouches,C., Pauplin,Y., Agarwal,M., Lemieux,L., Herzog,M., Abadon,M., Beyssat-Arnaouty,V., Hyrien,O., Robert de Saint Vincent,B., Georghiou,G.P. and Pasteur,N. TITLE Characterization of amplification core and esterase B1 gene responsible for insecticide resistnace in Culex JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by N.Pasteur, 23-FEB-1990. FEATURES from to/span description pept 406 546 esterase B1, exon 1 1322 1459 esterase B1, exon 2 1512 2393 esterase B1, exon 3 2455 2916 esterase B1, exon 4 pre-msg 268 3016 esterase B1 mRNA and introns IVS 547 1321 esterase B1 intron A IVS 1460 1511 esterase B1 intron B IVS 2394 2454 esterase B1 intron c rpt 155 171 direct repeat copy 1 rpt 176 192 direct repeat copy 2 signal 246 250 pot. TATA box signal 2998 3003 poly-A signal BASE COUNT 904 a 671 c 700 g 830 t ORIGIN Chromosome 2. 1 tacataccgc aataatcaaa atataacaaa aataatgttt tttttcctct caaatatgtt 61 gaacaatccc attttacccc gcatctgcat tcattttaag agagaattgt ctgtgtatag 121 gtagagtagt ctcgcaggta gggtacaaac acgagcgatg aacacatttg catttgcgat 181 gaacacattt gcttgatgcg tgttttaaaa cttagcagtg caaagcagaa taccaccaac 241 atcgcaattt tcagctccac aaatcatcag tacagagtgg gcagccgcac cgagctgttg 301 gtgcaagtca attcagctga gcaaaccgaa aaaaaaaaac ttcgaagagt cacacccagc 361 tgatagcgaa aatttaagca acaaaaaaac tccaatctac gtaggatgag tttggaaagc 421 ttaaccgttc agaccaaata cggcccggtc cggggcaaac ggaacgtatc gttgctggga 481 caggagtacg tcagctttca gggaattccg tacgcccggg caccggaagg ggagctgcgg 541 tttaaggtga gagtggtaaa ttgtttcaag tgctgttcaa attttatgga tgtgcaagtg 601 catttttgtt caaataaaga gcaacgagtg ctgctgatta gcgctgtatc taagagtgtg 661 acctccgcgg cttgatactt caccgcaatg acatgacata tttgttgata aaaataaaat 721 gtaataaata ttttgtactt ggacacgtac aaaccagtca tggtctaggc tggaaataaa 781 ttgaagagtg acaaagtcaa acataaattt ggcagtgact tccattttga taaaagataa 841 gaccataaac ttaatccaaa attgtttgtt ggacaattga taatgaagtc tgtgcaagta 901 gaagttattg aaacattaga ctgtttaaaa tttaatttaa tctatgattt aatctttatt 961 ttgttcaata atctaaattt ttgtaaaaca aaaaaatctt cagaaaacct atcaactgca 1021 tgctataaca ttttcaaatg acttcgaaat attaccgaaa taacattaaa tttgtcaatc 1081 aaatttagta gttttgacat tttgaaaatg ctaaaaaaat gatcaaactt gcaaaaccat 1141 tttttttttt cataaatcct cagcaatttt gcgaacaaat tattaataaa aagaaacgtt 1201 caaatatcac acctcactgt gtaaaatatt ttaaccagca ctgtatttcc cggtgacttg 1261 aacgcaaaac aagcaacgaa caaaactgga aatttaaaaa cataacctta aaacatttca 1321 ggcaccagtt ccaccgcaaa agtggaccga aacgttggac tgcacgcagc aatgcgagcc 1381 ctgctatcac ttcgaccggc gcctccagaa gatcgtcggc tgcgaggaca gtctgaagat 1441 caacgtgttt gcgaaggagg tgagttggtt tcaaagaatc aatttcaact ctgaattcac 1501 gatttctcca gatcaaccct tcaacccctc ttccggtgat gctgtacatc tacggcgggg 1561 gcttcacgga aggaaccagc ggaaccgaac tgtacgggcc ggatttcctg gttcagaagg 1621 atatcgtgtt ggtgtcgttc aattaccgta ttggggcgtt aggttttctg tgttgtcaat 1681 cggagcagga tggcgtaccc ggtaatgccg gactcaaaga tcagaacttg gccattcgtt 1741 gggttctgga gaacattgcc gcctttggag gagacccgaa gcgcgtgacc ctggccggcc 1801 atagcgcagg tgccgcttcg gttcagtatc atctgatttc ggatgcgtcc aaggacttgt 1861 ttcagcggcg tatcgtaatg tctgggagta cgtattccag ttggtctttg accaggcaac 1921 gcaactgggt tgagaagttg gcgaaggcca tcggttggga tggacagggt ggtgagtccg 1981 gagcgttgag attcttgaga cgtgccaaac cggaggacat tgttgctcac caggagaagc 2041 ttctgacgga ccaggacatg caggatgata tctttactcc gtttggacct accgttgaac 2101 cgtacctgac ggaacagtgc ataataccga aggcaccgtt cgagatggct cgaacagctt 2161 ggggtgacaa gattgatatc atgatcggtg gtacttctga agaaggactg ctactgctgc 2221 aaaagatcaa gttgcatccg gaactactgt cccatcctca tctattcctg ggaaatgttc 2281 ctccaaattt gaagatcagc atggaaaaac gaatcgagtt tgctgccaag ctgaaacaac 2341 gttactaccc cgacagcatt ccttcaatgg agaacaacct gggatacgtt catgtaagtc 2401 caaacctaac ctcaatcacc acatcaactt atcaaccctc cacaaaaatt ccagatgatg 2461 tccgaccggg tcttctggca cggcctgcac cgcaccatcc ttgcccgcgc cgctcgatcg 2521 cgcgcccgca ccttcgtgta ccggatctgt ctggattcgg agttttacaa ccactaccgc 2581 atcatgatga tcgacccgaa gctgcgcggc acggcccatg ccgacgagct gtcctatctg 2641 ttttccaact ttacccagca ggtccccggc aaggaaacgt tcgagtaccg cggtctgcaa 2701 acgctggtcg atgtgttcag cgcgttcgtc atcaacgggg atccaaactg tggcatgacg 2761 gcgaagggtg gtgtggtctt tgagccgaac gcgcagacga agcccacgtt caagtgtctg 2821 aacattgcca acgacggggt ggcgttcgtt gactatccgg atgcggaccg gttggacatg 2881 tgggacgcaa tgtacgtgaa tgatgagctg ttttgaggag aaagttttta aaatatccta 2941 ttgagatttt gcaatgcctt atttaatctg ttgttttatt ttaattatgt attgttgaat 3001 aaatttatac tatataacaa aagttttatt cttggagtca ttccgctaac ttttttgttt 3061 gtgttcaaca tattgaagat ctgacaaccc tatcaaaggt tatag // LOCUS DROFASI 15288 bp ds-DNA INV 26-FEB-1990 DEFINITION D.melanogaster fasciclin I (FasI) gene, complete cds. ACCESSION M32311 SOURCE D.melanogaster DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 15288) AUTHORS McAllistar,L., Zinn,K., Rehm,J. and Goodman,?. TITLE no title given JOURNAL Unpublished (1990)Howard Hughes Med. Inst. Berkeley, CA 94720 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by L.B. McAllister, 22-FEB-1990 FEATURES from to/span description pept 1593 1697 fasciclin I, exon 2 (first expressed exon) 7090 7222 fasciclin I, exon 3 7304 7470 fasciclin I, exon 4 7545 7775 fasciclin I, exon 4 7843 8011 fasciclin I, exon 5 8089 8249 fasciclin I, exon 6 9814 9822 fasciclin I, exon 7 10989 11143 fasciclin I, exon 8 11210 11300 fasciclin I, exon 9 11364 11584 fasciclin I, exon 10 11644 11806 fasciclin I, exon 11 11871 12018 fasciclin I, exon 12 12428 12633 fasciclin I, exon 13 IVS 824 1449 FasI intron A IVS 1698 7089 FasI intron B IVS 7223 7303 FasI intron C IVS 7471 7544 FasI intron D IVS 7776 7842 FasI intron E IVS 8012 8088 FasI intron F IVS 8250 9813 FasI intron G IVS 9823 10988 FasI intron H IVS 11144 11209 FasI intron I IVS 11301 11363 FasI intron J IVS 11585 11643 FasI intron K IVS 11807 11870 FasI intron L IVS 12019 12427 FasI intron M ORF 1 164 ORF (AA at 3) BASE COUNT 4691 a 3381 c 2990 g 4225 t 1 others ORIGIN Chromosome 3, map position 89e-f. 1 ctcgaggaga ccccgaaggt ccagctggag accccgatgc tggcgttgca gcctctgcag 61 ctcggcagtt gccaactgat ccagctcgct ttggtcagtc atcgttattg gctttctgaa 121 ttcggtcttt gcaatggtat tgtcaaaact gataatgata ttaatttgat ttcaatagca 181 aacaagcgtt gctggtggca agttgcttag taacggcgtg gtttatcgaa gacttttacc 241 atttacttaa agtaagaaga accaaatagg tacactgctt gttaaaattc aatgttatct 301 ccttagaaaa gtaaaaaata agtgcttgaa aattataacc ggttcaattt ggaataaaat 361 ctttcaaaca gtagagactc tgcttagcgg agttatccac cgcattctgt cgttgccaca 421 ttttgcgaaa cggttaaaat gattatttat gcttttaaac gaaaaatgag aaaatttaaa 481 gcagtcttct tgtaataata ataagattgg ataaacagac attattataa atctgacaca 541 gaagagcaac gcagcgctcg atttgtacta atattttacc tctaacaaga ttcggaaaag 601 cataaaaagc acataaataa gagtcaccgc caaagacatg cacaaataaa tataaatgtt 661 aagcgattta ttaacagtgc acagtgttgc gatttgaaac atcgacagct atcgataaca 721 ctcgttatcg tcaccacacg ctcggcttaa acctgttctc cgcttttcgt tttaattcga 781 aaaaaacgtt cgagcgtggc gcacgttgcg attcctccga ctggtacttg ctccccttat 841 attgaaactc tgtggaaaat ctatactgtg catagggcat aactctaaaa tagctgcacg 901 gcccatttcg ctgcccttgc tattgattta ccaccgcgct gcgttcgttt gtgtgtgcta 961 cttaaagaat ttgtgtgttt tccaagtttg gtccgtggac attgcataat ttctgcggcg 1021 gccgttaatg ttaattcctg cagcccgagt ttccgagaat tacgcagaat aaagaccaga 1081 gagaaaacta taaaatcgaa aacagaaaaa agtgccgcag cagcgaaatg caaaggcgca 1141 taataattaa acacacagcg acggaatgaa gaaaaaataa tacacaataa gcgcagcttt 1201 gtttctagtt aaattgcgtt tgtgttgtgc ttgccgattc ctcggtggtc cgtttttcgc 1261 ttgttgttct atgtgacata acggaactct gggcaaaagc gaacaggaag cagcgataac 1321 cttgcaaaaa caaagaaaat accaaggagg acaaaaagca tgccaagcat atatctgtga 1381 aataatattt tcttttccga ggaaatgctg tttgtcgtcg gctaactgtt gtttgccttt 1441 gaattgcaga tcttaatcgt agagcagcac tcacaccagc acacgccccc cgcaaaacag 1501 cacacacagc cacactcaca aacgacgtac actgccagcg cataaacaag tgctgttgtg 1561 gttgttgttg ctgctgctgc tcgtccgcta aaatgctgaa cgctgcagcg ctgcttttgg 1621 cgctgctctg cgccgcgaac gcagccgccg ccgccgattt ggcggacaaa ttgcgcgatg 1681 attcggaact ctctcaggtg agtcagaaga agaagaagaa gcgaataacg agaacgtgga 1741 ggatgaggag gtgaaaggag gtggtagagt aagagcacga agaaaagcag tagcagcaga 1801 aacagttggt gcaaataacc gaaacacttg tccgtagtca attaagcccc cagcaccacc 1861 caccaaacgt cgccccccct ccccccgctc acccttgggg tacagtaggg attgcctgcc 1921 atgaacatta gtaatccttt tctatctttg caaccgatta aattaatgcg aagagttaca 1981 cttcgaatat ttgtcaaaag tatgtttgtt taggtatccg tacgcgcaac atgtaaaatt 2041 tgaacacatt caggggcttt tattagtcta atatatcatt tcgttggaaa agtatttctt 2101 aactgcctac attttataca gtaatcagtc gcattttttc tgtagccagg gaaactgcca 2161 aaattaaaat aacttttgtg tgagcaaaaa acaaactatt tttagcagac atttatgcgt 2221 ttaaataatt ttttcagcct tttgaatatg tttcatatat ttctgtgatg ccttaatttc 2281 ttctcatggt cggatacctt attgcaatga taaaatcaat tttttattat gcctgacttg 2341 gctttggcga agtcaataaa tttaccttgc cacctaatag tggcaactta tcaaagttcg 2401 gctgtaattg tgccaccacc cacaagacgc cgccccaatt cgttggcctt gaaaggaaaa 2461 gccttggtgg agtgtgcgag gaggcggggc agatcgggcg ggaaaaagga gcctgcgcaa 2521 gagggacgca cccgcatttg ggcacaggta gcgtagcctt cgttccggca gcatcgtcct 2581 tgacctttag ccatctcact tgttgttgga ctgcggaaaa tcgatgatgc tgcctaaaat 2641 ccctggcggc agttcaggtc cagccgaccg ctaagctcca cttggccaca tttcgtatgc 2701 atggcggagt tcgttatgta aaaatgaaaa gccacataaa ttaagcccag gtctgctgaa 2761 ttggctctcc ccagcaaaag gcaaatggta gaaaaaacgc cggccgttga cgtccactgc 2821 gcataacagt tttatattta gttttgcctt tttacacctt cacaggccaa ctcacataca 2881 aagtacatag aaaacaaaaa cacattaatg tgtaaaaaac tcagaagcca acaaaggtgc 2941 gcggctgaaa taacaagaac aacatgtgta atgaattcaa aggcgggggt tcagtcacca 3001 gaaggggtat tagcattaca ccaggagaaa tatatatata tttttttaat atgttaaggt 3061 ttatggtagc ccattaagcc acataactta tactcaaaaa atagaattga tttgagtgga 3121 aaatagtaac atagttcgtc tattgtaatt gcttttatct tatatagtag gtgaagtagt 3181 agttgaatac gcgtttttaa gtaactgact tgagtccctg cttaattttc tagtctgcct 3241 taacactgca tttaacaccc gaaatacacg ctgcttcaaa agtaaattag acttccaagt 3301 aaaaagaaag cgtgtgacag actgtaattg ttgcagcaag tgtctcaggg aagtagttgc 3361 atccacacag aaaaggcaac cttttcgagt tacaataggc gaaacagagc gtacaactaa 3421 aaaaaattaa aaacaagaag ggggactcgg cttgctcact tgcacccttg ctttagggca 3481 ttgtaacttt gcccagaagt ttgccacttt cggagcaaag catttccggc ctcatatctg 3541 gagtactgct atacatgagg gtgggaccat ttggtcgcca taaaaaagta agggcggaaa 3601 gtaatcttct ggcatttatg caaattaagt gccgtaaaac taatggatca aaaagtcagg 3661 cttgggaaat cccttttaaa gttacttctt aagactatga aataatcatt ttattaataa 3721 tatgtcgtat tgcaggtagc tgctttcaat ccaatcccct ttttaaattt tgcgatttat 3781 atattttctc agttgtattt aaaatgtaga caagactggt tcaaaagggt atctccatct 3841 tctaaagtcg aagtcgggct tctttactct ggtgcctgca cttctcatta catgctgatg 3901 gcttagataa atgtaggttg cctgttttgg tttgcggctt acggcttacg gcttgtggtt 3961 ttgcatggca aaccaccgag tttaggagag agtttcgaaa ttagcagata attgagccac 4021 gccgatctga gttttatatt caacttcttg acagccgaaa tgaaagtaca ctccgaggaa 4081 cgagctgacc tcctgcgttt gacccccact tcctgtcgtg cgtcagccaa cgtcttgcgc 4141 gttttattaa ttttaatttt taacctttga ctgatgcaat tgcagttggc cgagtggggg 4201 tgtggacagc ttgggttgac aactgcgggg ctggctgaac tctgtctagg aacactcact 4261 cgtatggctc acggcacaga acagtccaca ctcgcagcat catcgaattc aattaaattt 4321 aatttaagat ttcaatgcga tgccaaggcg cgttatttga tatgcgacga gctccattac 4381 aggcgaggcg atgaatgagc cacgactgac cttaatggat ccagatgctc atagacggat 4441 ctctgtccga cggagcactg aaggtgcaca gttttacgat cctaggcgtg ggggcggaat 4501 ggcctcggag tgatgccagt gtagaacccc ccatttcgat ccctttccac tgctgaaccg 4561 ccaataaaag gagaatgctg ttgccccaat ctctcgattc cgaaccggaa tccgaagtcg 4621 ggggttgaga tcgctttcgc ttttgcttct gttttctgtt tctgctgccg ctgcaattat 4681 cgaaaagaaa atatactgaa caaccgacat atgcatacat aatttgtacc aattaaattt 4741 atttatacaa catgcactcg cccagaagga aacaacctaa ccctcctgct ccgatttcag 4801 tccctagtcc aaaatacgaa gcttggctcg ctctcttggc taattagttt taatggctcc 4861 attagatgat ggaaattgaa tttcctccaa acaggcttac cctctcatga ttgttgttgt 4921 ttttggtata ggaattagta caatttttgg ctacaaatcg aaagaaaaca gaacaaattt 4981 gaactgccag cggttctctc tctttctctt tctttctttc tctctctcta tctctttcag 5041 agcactgcca aatccttgag gcgccttatg ccagttaggg taagcagcaa ttccaaaaaa 5101 ccaaacagaa acacacgaac acaattagct attgtaaatt tggaacaaac tcaaaagaca 5161 ttgtaaatgt acaactcaca tgtttttggc ttaatttagt taaccaaccc gtactcatcg 5221 ggccattggc aaattattgt ttgctccgcc gtctgactct cacccctgat gttcgttgtt 5281 cgtctgtggc cccagggtca ttggcttggc actcgggaaa atcaaaatgc gcttttgaat 5341 aaacccaccc tcttgggagc acttttggcg gttgggcggc gagtggtgtt tgcatagtaa 5401 atgagacgcc ccagaccgaa agggaaactg tttatcgatg acagcataat actctaacaa 5461 ttgtatcaaa ttttcgggct tccaaaaaag gactccccgg ggaagagctg gccttatcgc 5521 gatgatgtgc caaaagagcc ggaaactcat taagatctgt tcccttccgg cccggccttt 5581 catcaaatgc aaaacattcg ataacggaaa tttatgaaag taattgcatc taattgtcaa 5641 aggcttccag ccagaaccca agggggttga atttagtaag gatgctgaca gatggccatt 5701 ggaaaatggc ggccaagaga aagtgtcgca aacaatatca ttgcctattc aataaaactt 5761 ctctgtactc tattggccat tatacctgaa cgtatcccat atcaacctgt gacactgtga 5821 catatgccac agccattccc caggacccca cttaacagca gggcactctc ttggccagat 5881 aagcgaaaac atttcgagag cgatatagac accagcaagc agttctgacc aagctggcca 5941 agaagacggt gtaagaacaa agcgcgctgg cgactcgaat gtccacgacc aggacaacaa 6001 caacgctaac ggccccgaca actgacggtc tcaaattacc aaaaagaaaa tgaatgcaga 6061 gcgaggataa atgtgtattt acactcagga aaaggacgaa aattcctcaa aaaaagagta 6121 tttcaatttt aaggtgttaa gttcgttcgt tgatgtttga aaggaattca aaggcatatc 6181 aatatcagta aaagaaatta caagtaagga tcccactggc aatttaaatt taagcatata 6241 tttccaatta tgaattgaat ggattgaatg gacgtgaagc acctatttct aggcctagcc 6301 actttttttt cggtgtacat gcgtgtgtat tggcgatgcc agagcataga tatgtacagc 6361 aggcagggga aatgggaatg gaaaagggaa aggacggact tttaaaccca aattgccatg 6421 actcatgccg ggcagcacac gaaaacacga gagtcacacg accgttttaa aggagggcac 6481 ggaaaaaggt gttgtcctcg gcaggacacg cacccacaca ggcacaccaa gacacagata 6541 tgtccgcagg acctcataaa gccgactggg aatgtgcagc ggggcgtggc ggcagtggtt 6601 ctccacgccg ggaagccgca gagtgtgcgg ttgcatgtgg cctcatatgg caggggcagg 6661 tggctggccc gcccgagcaa atgactctgg gcccggttta gggtcagtgg ctggcgtttt 6721 ggggtggctg gaaaatgcat tttcacatcc cgctctcaat tgaatgacac acaatccact 6781 tgggcggcgt aacccaccca gaggcttacg taattggttc aataaacctt ttttttctag 6841 tacgccgccc tggcatcacc accaggacat gcataaagtt gttcctcatt ttattagttt 6901 gttagacgtt gccatggtgc tggccggttg ccgttttgcc aaaaccgccc agcaaccgtc 6961 tgcgaaccac cccttaaggc acccactcca tcctagatcc accacccact actccatccc 7021 actcggttcg ataagatatt ggcaatagag cgacgcgaat gagtgtaacg aacccgttta 7081 tctttgcagt tctacagcct gctggagagc aatcaaattg ccaactcaac gctttcgctg 7141 cgcagctgca cgatctttgt gcccaccaat gaagccttcc agcgctacaa gagcaaaacc 7201 gcccatgtgc tctatcacat tagtgagttc agtttcaatc aggctgatta aaaataataa 7261 aaatcaaaat aacatgactc taccctctga ttactctttg cagccactga ggcgtacacc 7321 cagaaacgac tgccgaatac cgtgtcatcg gacatggccg gcaatccacc gctgtacatc 7381 acaaagaact cgaatggcga catctttgtg aacaatgccc ggatcatacc ctcgctcagt 7441 gtggagacaa acagcgatgg caagcggcag gtcagtggcc ccacatatgc atatgttgca 7501 tttcatagag tgattgaaca actctcttca ttacttaccg acagatcatg cacatcatcg 7561 acgaggtact ggagccgctc accgtcaagg ctggccattc ggataccccc aacaatccga 7621 atgctctcaa gttcctgaag aacgccgagg agttcaacgt ggacaacatc ggtgtgcgca 7681 cgtaccgcag ccaggtgacg atggccaaga aagagtcggt ctatgatgcc gccggacagc 7741 acacgttcct ggttcccgtc gatgaaggct tcaaggtgct tttctagttt aattgcatta 7801 tttgttacgc attgtaatct cactgttgct ttatgactcc agctctcggc tcgcagcagc 7861 ctcgtggacg gcaaggtcat cgatggccat gtgataccaa acactgtcat cttcactgcc 7921 gctgcccagc atgacgatcc caaggcttcc gccgcttttg aggacttact caaggtcacc 7981 gtcagtttct tcaagcagaa gaacggcaaa agtaagattg cactaaagac ataccatttt 8041 actaaaggtt tacgaatgat tgtatggttg atataatttt acttgcagtg tacgtcaagt 8101 caaacaccat tgtgggtgat gccaaacacc gcgtgggcgt ggttctggcc gagatcgtga 8161 aggcgaacat cccagtgagc aacggagtag tccatctgat ccaccgcccg ctgatgatca 8221 tcgatacgac ggtcacccaa ttcctgcagg tatgtagggt gttgaagact agcatccagt 8281 gcaattgtag acacacagac acgacacaga cagtccaaga aacaaacaca tagaaacaat 8341 gatagtgcct gtggcttgtg ataaggcttg gcattttgat tggctcggca cgcactgaaa 8401 gatagcgctg gcggcagacc gatggactgg gaacataaat cacttgcctg ctgccccagc 8461 gaatgctaca aacacatgag ctaaacacat tgaaacatac acactgataa gcggcttttc 8521 tttgccctaa gtgagcagct cctccaagtg gcgccaaagg gataatattt ccaaggttga 8581 agtttcgtcg gggctcaact gagggcaaag gccaaaacca aataatcttg cacccagaaa 8641 ccactgccta aacacataca gtataaataa ataagatcaa tgcagacact agttttcagt 8701 atcccctaga tatatctaac taatacaaat gaaaacccaa aagactgatt ttatttggaa 8761 ctgaattcgc tcaaaagtct gtattccgat tcggttctat gtaaaatcaa aagcaaaatt 8821 ttattttaaa ataattttta aaatataaat tttattttaa aaaaattttg ggtgtaatgt 8881 aagtattgcc aaaaagctga aaaacaaaaa aaaaaaaaac aaatcttaaa gcaatgtatt 8941 tcatgtaaat ttttctatga atcagatttt cgtatcaata tgcatttcac aagcagttgt 9001 ttgcattgca attgcaaaaa actgactcag tagggcaacg cgctcagaat tggccctcga 9061 aattcaatac gtgatttagt caccaattgg caaacactaa aaaaanttgc ttagcggctg 9121 cgctttgctt gatctgaccg aaatcgacca aaacttcatt cacaaaaact tccattgttc 9181 tatattgaaa tcgaccaaaa cttcattcac aaaaacttcc atttttctat attgaatgta 9241 ttggtacttt aatggcgtca atgaactaag caagtgtata aattattccc atcattgatt 9301 ggctaaagtt cttcgatccc tcgaccgccg tgcatatcca aagatcagtc atctgtacgc 9361 tcgcctacag ccggctagct ttgatcatca ccgtaaccgt agatgtatat gtatacacca 9421 agcataaact gataacaaac cacccacccg ctcgataaat agttaccctc ctcataggca 9481 tatacattag aaacagtgcg tttcgaaact gtgaggcaaa gtgtcgccga acagttttga 9541 gtccgcactg tagaacggct atatcattcg accgctcgaa tcgtaagcgc cttcatatcg 9601 tgctatcgtt ttcgaaattt tcaaactagt tttgttgtaa ttacgtgcga tactctcctt 9661 ttgctaattt actgtatcca tacctctatc ctttgtttcc gtcacatgcc gaataattgt 9721 acttaaaact acaaaacaca atccaaccca accacatatc gctcgaaatt gaatcggtgc 9781 acccaaatat tgaaacaaaa ccaaaacgtt cagtcgttca aggtaaacaa cagactcaaa 9841 atccgatttt aattttaagt tgttcggttt cttttgactt gcattttgtt ttgcatcccc 9901 tgctatttta gctatgcatg agtttattac aaaaaatgtg tagacctttt ttgtaaaatt 9961 agtccattgg ttgttaaagt cgcgatgcct tagggtctaa gaatatttct tctgattttt 10021 attcacaaat agtctttata ggatgtacag cactagttta aatttgcaaa tgaagcaagt 10081 gcacatgata ctttaaagtt aaagagtatt tatcgtatga tgagtttaca gaaattttac 10141 ctaattactt aagataaact tctatttttt gtcggagctt ttatatcctt gttttatttt 10201 tttaaatcat agtaatggaa attattatga ttcgctgtaa attgttgaaa aatgcacgta 10261 actcagtaac attgtacata atggagccaa accgtttgcc tgttctttga aaaatgtcta 10321 gaacaatgaa agattaaata taattccctt gtgcttcatt tcaaacgttc gtttgtggcg 10381 gaacaactcg ggcaatctgc gtggaaccca aatcccatat taaacaatat ttaaattcaa 10441 ttttattcga aacttgtctt caccaaatca cttcgaatga atattgaact atttctgcta 10501 acactccgtt ttaaaactga tttagtttat gaacgtaagt aaacatcaca ccagcagaca 10561 aagaacaaaa accacaacag tcatcagtgc aacgctttgc tatcaaattg ccatccaacc 10621 caaccaccgt gcaaacaatt caatgattcg gtagattttc cgtcattaaa tcgcataaat 10681 tcagctgtgt taagtgagtc attaagcgag cattcccatt gattcacaac ttaattggcc 10741 agccatccat tcccccaccc atatgtgtat aactggctta cattataatc ggatatattt 10801 ccaattgttg tatttttttc ctaattcttt gtactactcc caaaaaaaaa aaaacaattt 10861 ataagcttat gcatacccct cttccattca ccgtttattt atttatatgc attttgtttt 10921 aaggtacttc aaatagcatt atgatatgtt cattggtatt aattttgcaa tctatcttga 10981 aggagaagga gaatgctgag aacggagctc tgcgcaagtt ctacgaagtt ataatggaca 11041 atggtggagc agttctggac gacatcaata gcctgacaga agtgaccatt ttggctccca 11101 gcaatgaggc ttggaactcc tcgaacatca acaatgtttt gcggtaatgt tgctcaaatc 11161 gggaagataa tatttaatta tcataataat aaataaatac atcttacaga gatcggaata 11221 agatgaggca gatcctgaac atgcatatca tcaaggaccg cttaaatgtg gacaagatca 11281 ggcagaaaaa tgcaaatttg gtgagcttta catgagctta taatcaaatg gaaatttttt 11341 acaaacacca ttcttttcaa aagattgccc aggtgcccac tgtcaacaac aacactttcc 11401 tgtacttcaa cgttcgcggt gagggatcgg ataccgtgat aacagttgag ggaggcggcg 11461 tgaatgccac cgttatccag gctgatgtgg cccagactaa tggttatgtt cacatcatcg 11521 accatgtgct gggcgtgcct tacactacag ttcttggcaa acttgaatcc gatcccatga 11581 tgaggtatgt tcaattttaa agctttatta gtacgctgtt taaattttat cgtattcctt 11641 aagtgacacc tataagatgg gaaaattctc gcactttaat gaccagctga acaacacaca 11701 acgccgcttc acctactttg tgcccaggga caagggctgg cagaagaccg agctggatta 11761 cccatcggct cacaagaagc tttttatggc cgacttttcc tatcatgtag gtttttcacc 11821 ctaagcctat taggcttttc tattatactc aaaatgtttt gtaaccgcag tccaagtcca 11881 ttctggagcg tcatttggct atttcggata aggagtacac catgaaggat ctggttaagt 11941 tttcgcaaga atcgggcagc gtaatcctac ccacgttccg cgactctttg agtatccgcg 12001 tggaggagga agctggacgt aagtataaca cattggatac caagggctca gttgcttgtc 12061 ctttcaattc aagtcgattc gtttcaccta atttgattga ttctattggt cttcccatac 12121 cgtgctgcat tactaaccga acctgttttt ttacaactct taaactaaat ttttcgcttt 12181 ctgcttaact aaaactaatc gttgtccaat caaaagatct ccatgatgag tatgctagtc 12241 acgaatggac tggtgagtgc tgccgttttt caaacccaca tcgcaccgcc cacaacgaaa 12301 tccttgacct ggccatgtcc acccatagct cgctcattgg gcaatccact agcattaacc 12361 acccatcgat tgcataccaa ttttctattg tctagtggct attaacaacg attacttcca 12421 tttgcaggct atgtgatcat ttggaactac aagaagatca acgtataccg gcccgatgtt 12481 gagtgcacca acggaattat ccacgtcatc gactacccac tcctggagga aaaggatgtg 12541 gtcgtggccg gaggtagcta tttgccagaa tcaagcattt gcatcatctt ggccaacctc 12601 ataatgataa cagtagcaaa gttcttgaac taaatgcatc cgatatgtaa aaacaaatcc 12661 aatccaaagc aaatgcaaat caaacacaac aacaacagtc gtctacagaa caagaaccaa 12721 caacactcag tatcagacta acttaacatc cacatggatc taaataatca gcaccagttt 12781 gttgataccg atcgaaaacc acaagcaacc caaacagtat ctgtaatata tacgtcacaa 12841 ggaacgatca tcattccaac ccaacatgcc cacgtccacg ccatctacat acccacatac 12901 acacactcag aaaaaaacga tgcagaaaac tgcctgctaa atttgctttt gcatcgacag 12961 cttctcccaa aatagctggc aagagagact aagataaagt cgaagaaacc cctatgttta 13021 agtattaaac tcgattttcc ctcaaacggc cttatgtata atattgaata tgaaactttc 13081 tgtaaaacat tttaatttta tgttcgccct atactggtag cttacgttca tcagctcgct 13141 atgttttaag tttagaccca cactgacatt ggtatcgttc catgtaacaa ctgattgtgt 13201 ttatgatttt aagtgtacat tttttgtgaa ttgcttgttt tagttaatgg tctaatttat 13261 gatttatgaa atgagttatc ttaagtagaa tgcgaatctc attgtacatc tcgaaaacga 13321 agaagctaag cataagcaat aagcgcatca cacactctca tactcgagcg cccacacatt 13381 ccccctattc ctatccataa atacatacaa catgtaacga aacaaagttc attgtaagat 13441 ttgaaagtgt gaaatttgat aataaactga aagtttttca tttgaacctt actattaaga 13501 gcggcagcaa tcttgtctcg aaccaatcgc gtgacccctg taggcctaga actatatcag 13561 tgtcggacaa gaaatgtgca accaacaaac ccaattatat gatatgccat cctgcactta 13621 gttactagcc ccgctatgca tgtgaacccc cagaaaaacc cattcgccca cacacaccag 13681 ccacttggag cgagagtagc cactgcaact aggttagtcc aatgtataag ttccgaaact 13741 gccgacaact acgaaaacta tacagcatac ataaactccc tgtaatctaa atcactcacg 13801 gcacacaact acaactacaa ctacatgtca tcattgaatg gattttgata ccgattttaa 13861 cttgcatata aacaaaaaca agaactagac aacgtgaagg agattttaaa caaaattctc 13921 cctcggtcga gcagttgcat ttcaaacttt gtacgtagtt taaaactagt tttttagtcc 13981 gacgtagaac aacccaattg ctaactatat accaactttc tttctatttc tctctgtctc 14041 tccccctaat gctatgtact tataggttag acattgtaac tattgtaatc aacccagtgc 14101 gttaaacccg agtgttaagt cgaacagtaa cacagaattg tactatcccc aaatgaataa 14161 ctatcagcct accagtacac tgtctcaact ctcaccacca ccaccaccca cttaggaact 14221 cagtcgaact tgaactcgaa atcaaagatc cagttgtggc agtcgcttca cgtagttgct 14281 aattcccaat tcgaaccgat cctttccgaa agtcttatct ttagtatagg tggtttagtt 14341 tcatttggag ccgtgcagtg ccgtagcagc taagtaaaaa tgtatgaaat gaagatgaac 14401 acgagatcga aatcgtacgg aatgatcaga aatcagaaat aaataatgaa tacgctaatg 14461 aattgtacaa gtaagcttta aagaattgct ggaggagcgc ggatcggaga acttagagga 14521 gggagaaccg cattgcaatc gcattgcaat ttgtgtcgta gtcagtagtt acacgttaag 14581 cggcgtctta acgtgtaact agtgccttac taaagataaa cgcattacct taacctttat 14641 acaaatttac tcaaaacata cttgtacccc aagcatacgt tccgcttcga atgataccca 14701 gatatatata cggagttaca ccccaagaat acaagtataa ctacaaatga tattgcgcca 14761 cacgctattt acaccaaata caccaaacaa atcgagaaat gcatattttt catatattta 14821 attgtcagaa taatataacg tatatgtaat atgtagttta tttactgtaa aacgcaagaa 14881 cctaacaagt ggaatttgaa tcacatacaa ttgatgtata ttagctattg agtttctaag 14941 caagcgttag acactgaaat atatgttcaa atatatgatt caaatatata caatatgcga 15001 aaccaagcaa actatggaaa actggagtgc caaaagaata ttattccatt ttattttacg 15061 acaagcgctt tttacaaata aaccgaatcc atttaaatta ctcgtaaaga acagacagat 15121 tatatttagc attagttaaa ctaattatta catgtactag aaaaccgaat gtcaaccgag 15181 aatcttcagc aagcttgagc gaataataaa actttaaaac taactataaa taaatcgacc 15241 gtcctttttt gcacttaatc atgggttatg gttgtggaac ctgaattc // LOCUS DROIAB 2131 bp ds-DNA INV 14-FEB-1990 DEFINITION D.melanogaster abdominal-A (iab) 1.7 and 2.0 kb RNAs, complete cds. ACCESSION M32109 KEYWORDS . SOURCE D.melanogaster (strain Canton S) embryo DNA, and cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2131) AUTHORS Cumberledge,S., Zaratzian,A. and Sakonju,S. TITLE Characterization of two RNAs transcribed from the cis-regulatory region of the abd-A domain within the Drosophila bithorax complex JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.Sakonju, 14-FEB-1990. FEATURES from to/span description RNA 141 1802 1.7 kb RNA RNA 141 2101 2.0 kb RNA BASE COUNT 592 a 468 c 478 g 593 t ORIGIN Chromosome III, bithorax complex. 1 ttagttcgaa gcacgtgcgt gtgagttccg catgaacggt acaccaacac tttcgaacag 61 aaacccctca cagatactcc tcacacagat actctactgt actgggctac gatatctgtg 121 agatacgcac agataccggc aggccgccgg cgtcaaagtc gacgtgctgc cggcgtttga 181 gagaccgagc ggcgtgcaag aaatcgtcgg gcaacggctt caaatgaacg caaaacgttg 241 gaaagcaaac aaccggcaaa agccatcagc agctaaaaat aaaactgcaa ctatcaaagt 301 gaatctgtag agaaagagtg atagctcagt gtgcccagtg tttatttaat tacttattta 361 aatttgattt tcaaaataag ttgtgatttt ggttcgtttg tttgtttccc cattaaatcg 421 catcgcggaa ctggtttgca ttttgtatct tttatcgctg cagttcgttt ctgtgtggtt 481 tcatgtattc cccatctatt ccatctcttt ttgcagatac aagtgaatga aaccctaacg 541 tttaaacgcc accataaaca ccaaataaca aattttctgg caaacatggc atgtcacggt 601 cccatttaac gtggaaaatc ggcccagaaa ttcaacacga cgacaacttt ttcgactgtc 661 ccagaaagca aatatgtaaa gtgtatttgt gtgcgtgcca ttgacattga cttatggtca 721 tgaaggggtt agggggacgg gtctaagggt gggggttgat ggctaaaggg gggtctcgcc 781 gctacgacaa aaacttggat tctctcgttt cttttttgtc tttcgtcttc aacgcacatt 841 tatggtttgt tttgctcaca cacacactca aaaagcctac gcagcggcag agagggcaca 901 agcagcagag aaagagacga aacatgtctc ggatgatgat aaacgcctct cgtctatttt 961 ggcgggtcca tgaatgaaac aattgcctcg aatgactgac tgccagacag atagctagaa 1021 actgccctta cggcgggcaa attggggtat tttgggtttc ggtgactttg agggaccggg 1081 acctgagggc ccggaaagcc gacagcaata aacactgtgc tcagacaaaa gtctgatgag 1141 acagaccatc atgatgctca tcatcatcgg cgataataat gattttcatt tgcacattcc 1201 acccgtttcg ctgttgccgg cttccagcat tcagattcag tttgcattca gtggggcagg 1261 ttatttatgg cacttttcct atcgccactc actggggtaa tcgattttta ttggattaat 1321 attgctttcc gcatgatttg gccttctttt cgttatcact tagcaggttt cttgacgctg 1381 aggagtggac aatgactaaa actcagtcat ctggaaaagc tggaccaagg acatccatat 1441 acaagtgctg gctaaacgtc tgtggttatc gcaatcattt atggcggaat agaaggacac 1501 gtcttatgtg tcgattttcg ggccccatcc ctgttgaaaa atcagccgga cagcagcaac 1561 caccgaacac aagacacagg acgaccggaa gtgaggaaaa cgcttgtaaa atcggtgcac 1621 acgagtgcga gcgagacacg agcgaacggg ccgaaaggaa acgacaatta cgaggacacg 1681 ccggcttgct tttctgcttt tcctgtctgc tcctttcagt tgcggccgtg caatcgtcgg 1741 gtcaacaacg ccgcgtgtcc gatttcaatt tacaaattag aatatacatc tttacatctg 1801 gatttggttt catctttatc tgcatttgga tccgcgacga cgcataattt tcagcacttt 1861 tgaataatgg cgattctgtg cgacctcgta aacgtatact gaatgtatcc tgagtgtatc 1921 ctatccggta taccttcagt atacgtaaca cgagagagca ctagcaacgt gggatgtgaa 1981 acacttttta agacacctta atacacgtag cactttgtta tatggtctta cactttttaa 2041 atatctcgac ttataaatat ataatttaaa tataaatact ttaataatgc gatataagtt 2101 aaatacttcg atttactggc tggaagttca t // LOCUS FVBOPD 1693 bp ds-DNA BCT 05-MAR-1990 DEFINITION Flavobacterium sp. parathion hydrolase gene, complete cds. ACCESSION M29593 KEYWORDS parathion hydrolase. SOURCE Flavobacterium sp. (strain ATCC 27551) DNA, clone pPDL2. ORGANISM Flavobacterium sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Acidiphilium cryptum; Neisseriaceae. REFERENCE 1 (bases 1 to 1693) AUTHORS Mulbry,W.W. and Karns,J.S. TITLE Parathion hydrolase specified by the Flavobacterium opd gene: Relationship between the gene and protein JOURNAL J. Bacteriol. 171, 6740-6746 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 419 1516 parathion hydrolase signal 312 317 -35 region signal 334 339 -10 region binding 408 411 ribosme binding site BASE COUNT 372 a 497 c 477 g 347 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccgagc ggcttaccgg cgccctgctc gaccggttga cccaccatgt ccatatcctc 61 gagatgaacg ggggcagcta tcggcttgcc agcagccgga aacggcaaaa gcacagccac 121 cacgcacagc aagaggaggt gcccccccca tgacctgatc agaaaacccc tcatctgctg 181 tgctgaacgg ccttcgctac gctccagacc gttcagcaca gcagatgaaa gcaccgcctc 241 gacaagaggc tttttgttca atccaactgg tacactctta caccggaatc ttgcacaatt 301 ttaccccggc attgacatct gacgcgtcaa cagtaaaaga aacaaccggt tcagatctgc 361 agcctgactc ggcaccagtc gctgcaagca gagtcgtaag caatcgcaag ggggcagcat 421 gcaaacgaga agggttgtgc tcaagtctgc ggccgccgca ggaactctgc tcggcggcct 481 ggctgggtgc gcgagcgtgg ctggatcgat cggcacaggc gatcggatca ataccgtgcg 541 cggtcctatc acaatctctg aagcgggttt cacactgact cacgagcaca tctgcggcag 601 ctcggcagga ttcttgcgtg cttggccaga gttcttcggt agccgcaaag ctctagcgga 661 aaaggctgtg agaggattgc gccgcgccag agcggctggc gtgcgaacga ttgtcgatgt 721 gtcgactttc gatatcggtc gcgacgtcag tttattggcc gaggtttcgc gggctgccga 781 cgttcatatc gtggcggcga ccggcttgtg gttcgacccg ccactttcga tgcgattgag 841 gagtgtagag gaactcacac agttcttcct gcgtgagatt caatatggca tcgaagacac 901 cggaattagg gcgggcatta tcaaggtcgc gaccacaggc aaggcgaccc cctttcagga 961 gttagtgtta aaggcggccg cccgggccag cttggccacc ggtgttccgg taaccactca 1021 cacggcagca agtcagcgcg atggtgagca gcaggccgcc atttttgagt ccgaaggctt 1081 gagcccctca cgggtttgta ttggtcacag cgatgatact gacgatttga gctatctcac 1141 cgccctcgct gcgcgcggat acctcatcgg tctagaccac atcccgcaca gtgcgattgg 1201 tctagaagat aatgcgagtg catcagccct cctgggcatc cgttcgtggc aaacacgggc 1261 tctcttgatc aaggcgctca tcgaccaagg ctacatgaaa caaatcctcg tttcgaatga 1321 ctggctgttc gggttttcga gctatgtcac caacatcatg gacgtgatgg atcgcgtgaa 1381 ccccgacggg atggccttca ttccactgag agtgatccca ttcctacgag agaagggcgt 1441 cccacaggaa acgctggcag gcatcactgt gactaacccg gcgcggttct tgtcaccgac 1501 cttgcgggcg tcatgacgcc atctggatcc ttccagccag cggccactat tccccgtcaa 1561 gataccgaac gatgaagtcg cgcatcgatc gataggcatc ttcaatttga tcagggctgc 1621 cacctccaaa gccgtggcca cccctgtcga tagtcttgag gacgtagggc acaccgtgct 1681 tttcgaactg cag // LOCUS HUMCSAE 1372 bp ss-mRNA PRI 26-FEB-1990 DEFINITION Human epithelial glycoprotein (EGP) mRNA, complete cds. ACCESSION M32306 KEYWORDS cell surface antigen; cell surface glycoprotein; epithelial glycoprotein; transmembrane protein; tumor-associated antigen. SOURCE Human colon carcinoma cell line HT-29, cDNA to mRNA, (library HT-29/cdm8). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1372) AUTHORS Simon,B., Podolsky,D.K., Moldenhauer,G., Isselbacher,K.J., Gattoni-Celli,S. and Brand,S.J. TITLE Epithelial glycoprotein is a member of a family of epithelial cell surface antigens homologous to nidogen, a matrix adhesion protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.Simon, 23-FEB-1990. FEATURES from to/span description pept 22 966 epithelial glycoprotein (EGP) precursor sigp 22 84 epithelial glycoprotein signal peptide matp 85 963 epithelial glycoprotein mRNA < 1 1372 EGP mRNA BASE COUNT 436 a 233 c 317 g 386 t ORIGIN 1 gctctaaagg cgcgcgcagg catggcgccc ccgcaggtcc tcgcgttcgg gcttctgctt 61 gccgcggcga cggcgacttt tgccgcagct caggaagaat gtgtctgtga aaactacaag 121 ctggccgtaa actgctttgt gaataataat cgtcaatgcc agtgtacttc agttggtgca 181 caaaatactg tcatttgctc aaagctggct gccaaatgtt tggtgatgaa ggcagaaatg 241 aatggctcaa aacttgggag aagagcaaaa cctgaagggg ccctccagaa caatgatggg 301 ctttatgatc ctgactgcga tgagagcggg ctctttaagg ccaagcagtg caacggcacc 361 tccatgtgct ggtgtgtgaa cactgctggg gtcagaagaa cagacaagga cactgaaata 421 acctgctctg agcgagtgag aacctactgg atcatcattg aactaaaaca caaagcaaga 481 gaaaaacctt atgatagtaa aagtttgcgg actgcacttc agaaggagat cacaacgcgt 541 tatcaactgg atccaaaatt tatcacgagt attttgtatg agaataatgt tatcactatt 601 gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc tgatgtggct 661 tattattttg aaaaagatgt taaaggtgaa tccttgtttc attctaagaa aatggacctg 721 acagtaaatg gggaacaact ggatctggat cctggtcaaa ctttaattta ttatgttgat 781 gaaaaagcac ctgaattctc aatgcagggt ctaaaagctg gtgttattgc tgttattgtg 841 gttgtggtga tagcagttgt tgctggaatt gttgtgctgg ttatttccag aaagaagaga 901 atggcaaagt atgagaaggc tgagataaag gagatgggtg agatgcatag ggaactcaat 961 gcataactat ataatttgaa gattatagaa gaagggaaat agcaaatgga cacaaattac 1021 aaatgtgtgt gcgtgggacg aagacatctt tgaaggtcat gagtttgtta gtttaacatc 1081 atatatttgt aatagtaaag cctgtactca aaatataagc agcttgaaac tggctttacc 1141 aatcttgaaa tttgaccaca agtgtcttat atatgcagat ctaatgtaaa atccagaact 1201 tggactccat cgttaaaatt atttatgtgt aacattcaaa tgtgtgcatt aaatatgctt 1261 ccacagtaaa atctgaaaaa ctgatttgtg attgaaagct gcctttctat ttacttgagt 1321 cttgtacata catacttttt tatgagctat gaaataaaac cattttaaac tg // LOCUS HUMNOXF 2206 bp ss-mRNA PRI 15-FEB-1990 DEFINITION Human neutrophil oxidase factor (p67-phox) mRNA, complete cds. ACCESSION M32011 KEYWORDS neutrophil oxidase factor. SOURCE Human promyelocytic leukemia myeloid cell line HL60, cDNA to mRNA, clone 10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2206) AUTHORS Leto,T.L., Lomax,K.J., Volpp,B.D., Nunoi,H., Sechler,J.M.G., Nauseef,W.M., Clark,R.A., Gallin,J.I. and Malech,H.L. TITLE Cloning of a 67K neutrophil oxidase factor and its similarity to a noncatalytic region of P60c-src JOURNAL Science (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.L.Malech, 08-FEB-1990. FEATURES from to/span description pept 68 1648 neutrophil oxidase factor (p67-phox) mRNA < 1 2206 p67-phox signal 2030 2035 poly-A signal signal 2181 2186 poly-A signal BASE COUNT 624 a 499 c 573 g 510 t ORIGIN 1 ctagtctttc agccttcagg ctgtttttgg cttgaagctc tcttggcctc ctagtttcta 61 cctaatcatg tccctggtgg aggccatcag cctctggaat gaaggggtgc tggcagcgga 121 caagaaggac tggaagggag ccctggatgc cttcagtgcc gtccaggacc cccactcccg 181 gatttgcttc aacattggct gcatgtacac tatcctgaag aacatgactg aagcagagaa 241 ggcctttacc agaagcatta accgagacaa gcacttggca gtggcttact tccaacgagg 301 gatgctctac taccagacag agaaatatga tttggctatc aaagacctta aagaagcctt 361 gattcagctt cgagggaacc agctgataga ctataagatc ctggggctcc agttcaagct 421 gtttgcctgt gaggtgttat ataacattgc tttcatgtat gccaagaagg aggaatggaa 481 aaaagctgaa gaacagttag cattggccac gagcatgaag tctgagccca gacattccaa 541 aatcgacaag gcgatggagt gtgtctggaa gcagaagcta tatgagccag tggtgatccc 601 tgtgggcaag ctgtttcgac caaatgagag acaagtggct cagctggcca agaaggatta 661 cctaggcaag gcgacggtcg tggcatctgt ggtggatcaa gacagtttct ctgggtttgc 721 ccctctgcaa ccacaggcag ctgagcctcc acccagaccg aaaaccccag agatcttcag 781 ggctctggaa ggggaggctc accgtgtgct atttgggttt gtgcctgaga caaaagaaga 841 gctccaggtc atgccaggga acattgtctt tgtcttgaag aagggcaatg ataactgggc 901 cacggtcatg ttcaacgggc agaaggggct tgttccctgc aactaccttg aaccagttga 961 gttgcggatc caccctcagc agcagcccca ggaggaaagc tctccgcagt ccgacatccc 1021 agctcctcct agttccaaag cccctggaaa accccagctg tcaccaggcc agaaacaaaa 1081 agaagagcct aaggaagtga agctcagtgt tcccatgccc tacacactca aggtgcacta 1141 caagtacacg gtagtcatga agactcagcc cgggctcccc tacagccagg tccgggacat 1201 ggtgtctaag aaactggagc tccggctgga acacactaag ctgagctatc ggcctcggga 1261 cagcaatgag ctggtgcccc tttcagaaga cagcatgaag gatgcctggg gccaggtgaa 1321 aaactactgc ctgactctgt ggtgtgagaa cacagtgggt gaccaaggct ttccagatga 1381 acccaaggaa agtgaaaaag ctgatgctaa taaccagaca acagaacctc agcttaagaa 1441 aggcagccaa gtggaggcac tcttcagtta tgaggctacc caaccagagg acctggagtt 1501 tcaggaaggg gatataatcc tggtgttatc aaaggtgaat gaagaatggc tggaagggga 1561 gtgcaaaggg aaggtgggca ttttccccaa agtttttgtt gaagactgcg caactacaga 1621 tttggaaagc actcggagag aagtctagga tgtttcacaa actacaaagc tgaagaaaat 1681 gaagccctat tacttgtttg taagatttag cacccttctg ctgtatactg tactgagaca 1741 ttacagtttg gaagtgttaa ctatttattc cctgttaaaa tttaacctac tagacaatga 1801 tgtgagtacc caggatgatt tcctggggca cagtgggtga ggagatgggg acaggtgaat 1861 ggaggagtta ggggagagga aaagtggatg gaagtgtctg gaaagggcac gagagagtct 1921 tccaggtact gatcctgttt cttgctctga gtgctagcta gccagctgtg ttcacactgt 1981 aaacattcat caagctgtac atttggtgca cttttctgtg tcataccaca ataaaaaaaa 2041 acctatcatc atcttacaaa aacaagacac ccaagtccag gcccaaggag taagtacaaa 2101 tattcctgtt tctgaaccat tactgtaatt ggctcttaag gcttgaagta accttatagg 2161 ttactcataa ggcatataca aataaacttg tttgttttct tttttc // LOCUS HUMUDPGTA 1855 bp ss-mRNA PRI 26-FEB-1990 DEFINITION Human 3,4-catechol estrogen UDP-glucuronosyltransferase mRNA, complete cds. ACCESSION J05428 KEYWORDS 3,4-catechol estrogen UDP-glucuronosyltransferase. SOURCE Human liver, cDNA to mRNA, clone 63-11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1855) AUTHORS Ritter,J.K., Sheen,Y.Y. and Owens,I.S. TITLE Cloning and expression of human liver UDP-glucuronosyltransferase in COS-1 cells: 3,4-catechol estrogens and estriol as primary substrates JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.S.Owens, 22-FEB-1990. FEATURES from to/span description pept 15 1604 UDP-glucuronosyltransferase (EC 2.4.1.17) BASE COUNT 571 a 337 c 394 g 553 t ORIGIN 1 tgcattgcac caggatgtct gtgaaatgga cttcagtaat tttgctaata caactgagct 61 tttgctttag ctctgggaat tgtggaaagg tgctggtgtg ggcagcagaa tacagccatt 121 ggatgaatat aaagacaatc ctggatgagc ttattcagag aggtcatgag gtgactgtac 181 tggcatcttc agcttccatt ctttttgatc ccaacaactc atccgctctt aaaattgaaa 241 tttatcccac atctttaact aaaactgagt tggagaattt catcatgcaa cagattaaga 301 gatggtcaga ccttccaaaa gatacatttt ggttatattt ttcacaagta caggaaatca 361 tgtcaatatt tggtgacata actagaaagt tctgtaaaga tgtagtttca aataagaaat 421 ttatgaaaaa agtacaagag tcaagatttg acgtcatttt tgcagatgct atttttccct 481 gtagtgagct gctggctgag ctatttaaca taccctttgt gtacagtctc agcttctctc 541 ctggctacac ttttgaaaag catagtggag gatttatttt ccctccttcc tacgtacctg 601 ttgttatgtc agaattaact gatcaaatga ctttcatgga gagggtaaaa aatatgatct 661 atgtgcttta ctttgacttt tggttcgaaa tatttgacat gaagaagtgg gatcagtttt 721 atagtgaagt tctaggaaga cccactacgt tatctgagac aatggggaaa gctgacgtat 781 ggcttattcg aaactcctgg aattttcagt ttcctcatcc actcttacca aatgttgatt 841 ttgttggagg actccactgc aaacctgcca aacccctgcc taaggaaatg gaagactttg 901 tacagagctc tggagaaaat ggtgttgtgg tgttttctct ggggtcaatg gtcagtaaca 961 tgacagaaga aagggccaac gtaattgcat cagccctggc ccagatccca caaaaggttc 1021 tgtggagatt tgatgggaat aaaccagata ccttaggtct caatactcgg ctgtataagt 1081 ggatacccca gaatgacctt ctaggtcatc caaagaccag agcttttata actcatggtg 1141 gagccaatgg catctacgag gcaatctacc atgggatccc tatggtgggg attccattgt 1201 ttgccgatca acctgataac attgctcaca tgaaggccag gggagcagct gttagagtgg 1261 acttcaacac aatgtcgagt acagacttgc tgaatgcatt gaagagagta attaatgatc 1321 cttcatataa agagaatgtt atgaaattat caagaattca acatgatcaa ccagtgaagc 1381 ccctggatcg agcagtcttc tggattgaat ttgtcatgcg ccacaaagga gctaaacacc 1441 ttcgggttgc agcccacgac ctcacctggt tccagtacca ctctttggat gtgattgggt 1501 tcctgctggt ctgtgtggca actgtgatat ttatcgtcac aaaatgttgt ctgttttgtt 1561 tctggaagtt tgctagaaaa gcaaagaagg gaaaaaatga ttagttatat ctgagatttg 1621 aagctggaaa acctgatagg tgagactact tcagtttatt ccagcaagaa agattgtgat 1681 gcaagatttc tttcttcctg agacaaaaaa aaaaaaaaga aaaaaaaatc ttttcaaaat 1741 ttactttgtc aaataaaaat ttgtttttca gagatttacc acccagttca tggttagaaa 1801 tattttgtgg caatgaagaa aacactacgg aaaataaaaa ataagataaa gcctt // LOCUS PSERCBC 1060 bp ds-DNA BCT 08-MAR-1990 DEFINITION P.aeruginosa carbenicillinase gene, complete cds. ACCESSION J05162 KEYWORDS carbenicillinase. SOURCE P.aeruginosa (strain Dalgleish) DNA, clone pMON709. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Acidiphilium cryptum; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1060) AUTHORS Boissinot,M. and Levesque,R.C. TITLE Nucleotide sequence of the PSE-4 carbenicillinase gene and correlations with the Staphylococcus aureus PC1 beta-lactamase crystal structure JOURNAL J. Biol. Chem. 265, 1225-1230 (1990) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence kindly provided by authors, 25-JAN-1990. FEATURES from to/span description pept 122 1036 carbenicillinase BASE COUNT 316 a 195 c 246 g 303 t ORIGIN 357 bp upstream of HincII site. 1 tcgggcatcc aagcagcaag cgcgttacgc cgtgggtcga tgtttgatgt tatggagcag 61 caacgatgtt acgcagcagg gcagtcgccc taaaacaaag ttagccatat tatggagcct 121 catgctttta tataaaatgt gtgacaatca aaattatggg gttacttaca tgaagttttt 181 attggcattt tcgcttttaa taccatccgt ggtttttgca agtagttcaa agtttcagca 241 agttgaacaa gacgttaagg caattgaagt ttctctttct gctcgtatag gtgtttccgt 301 tcttgatact caaaatggag aatattggga ttacaatggc aatcagcgct tcccgttaac 361 aagtactttt aaaacaatag cttgcgctaa attactatat gatgctgagc aaggaaaagt 421 taatcccaat agtacagtcg agattaagaa agcagatctt gtgacctatt cccctgtaat 481 agaaaagcaa gtagggcagg caatcacact cgatgatgcg tgcttcgcaa ctatgactac 541 aagtgataat actgcggcaa atatcatcct aagtgctgta ggtggcccca aaggcgttac 601 tgatttttta agacaaattg gggacaaaga gactcgtcta gaccgtattg agcctgattt 661 aaatgaaggt aagctcggtg atttgaggga tacgacaact cctaaggcaa tagccagtac 721 tttgaataaa tttttatttg gttccgcgct atctgaaatg aaccagaaaa aattagagtc 781 ttggatggtg aacaatcaag tcactggtaa tttactacgt tcagtattgc cggcgggatg 841 gaacattgcg gatcgctcag gtgctggcgg atttggtgct cggagtatta cagcagttgt 901 gtggagtgag catcaagccc caattattgt gagcatctat ctagctcaaa cacaggcttc 961 aatggaagag cgaaatgatg cgattgttaa aattggtcat tcaatttttg acgtttatac 1021 atcacagtcg cgctgataag gctaacaagg ccatcaagtt // LOCUS RAT5MDAA 2030 bp ss-mRNA ROD 15-DEC-1988 DEFINITION Rat iodothyronine 5' monodeiodinase (5'MD) mRNA, 3' end. ACCESSION M21018 KEYWORDS iodothyronine 5' monodeiodinase. SOURCE Rat liver, cDNA to mRNA, clone 2302. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2030) AUTHORS Boado,R.J., Campbell,D.A. and Chopra,I.J. TITLE Nucleotide sequence of rat liver iodothyronine 5'-monodeiodinase (5' MD): Its identity with the protein disulfide isomerase JOURNAL Biochem. Biophys. Res. Commun. 155, 1297-1304 (1988) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Campbell, 14-OCT-1988. FEATURES from to/span description pept < 1 1449 iodothyronine 5' monodeiodinase (AA at 1) BASE COUNT 527 a 493 c 532 g 478 t ORIGIN 1 gtcctggtgc tgaagaagag caacttcgca gaggcgctgg cggcgcacaa ctacctgctg 61 gtggagttct atgccccatg gtgtggccac tgcaaagcac tggccccaga gtatgccaaa 121 gctgctgcaa aactgaaggc agaaggctct gagatccgac tagcaaaggt ggacgccaca 181 gaagagtctg acctggccca gcagtatggt gtccgtggct accccacaat caagttcttc 241 aagaatggag acacagcctc cccaaaggaa tatacagctg gcagggaagc tgacgacatt 301 gtgaactggc tgaagaaacg cacaggccca gcagccacaa ccctgtctga cactgcagct 361 gcagagtcct tggtggactc aagcgaagtg acggtcatcg gcttcttcaa ggacgcaggg 421 tcagactccg ccaagcagtt cttgctggca gcagaggctg ttgatgacat accttttgga 481 atcacttcca atagcgatgt gttttccaag taccagctgg acaaggatgg ggtggtcctc 541 tttaagaagt ttgatgaagg ccgcaacaat tttgaaggtg agatcaccaa ggagaagctg 601 ttagacttca tcaagcacaa ccagctgcct ttggtcatcg agttcactga acagacagct 661 ccaaagattt tcggaggtga aatcaaaaca catattctgc tgttcctgcc caagagtgtg 721 tctgactacg atggcaaatt gagcaacttt aagaaagcgg ccgagggctt taagggcaag 781 atcctgttca tcttcatcga tagtgaccac actgacaacc agcgcatact tgagttcttt 841 ggcctgaaga aggaggaatg tccagctgtg cggcttatta ccctggagga agagatgacc 901 aagtacaaac cggagtcaga cgagctgaca gctgagaaga tcacacaatt ttgccaccac 961 ttcctggagg gcaagatcaa gccccacctg atgagccagg aactgcctga agactgggac 1021 aagcagccag tgaaagtgct agttgggaaa aactttgagg aggttgcttt tgatgagaaa 1081 aagaacgtgt ttgttgaatt ctatgctccc tggtgtggtc actgcaagca gctagccccg 1141 atttgggata aactgggaga gacatacaaa gaccatgaga atatcgtcat cgctaagatg 1201 gactcaacag ccaatgaggt ggaagctgtg aaggtgcaca gctttcccac actcaagttc 1261 ttcccagcaa gtgcagacag aacggtcatt gattacaacg gtgagcggac actagatggt 1321 tttaagaaat tcttggagag cggtggccag gatggagcgg gggacaatga cgacctcgac 1381 ctagaagaag ctttagagcc agatatggaa gaagacgacg atcagaaagc cgtgaaggat 1441 gaactgtagt gcagaagcca gatctgggcg cctgaaccca aaacctcggt ggccatgtcc 1501 cagcagccca catctccgga gcctgagcct caccccagga ggagcgccat cagaacccag 1561 ggaatctttc tgaagccaca ctcatctgac acacgtacac ttaaacctgt ccttcttttt 1621 ttgcttttca attttggaaa gggatctctg accaggccag cccatcttga agggctacgt 1681 tttgttttaa ttggtggtgt actttttgta cgtggatttt gtcccaagtg cttgctacca 1741 tatttgggat ttcacactgg taatgtcttt cctgttagag aggtttatgc tatcacttca 1801 gatttcgtct gtgagatgtt tcatcttcct gacatgtctt catgtcgagg tacttgttcc 1861 accacgcaga cctccctgag accccttctg ccctgcgcag gaggcgatgg ttctgggtcg 1921 tatgctctct ctctctccac cttgtactag tgttcgcatg acagcatggc ttttgtagtt 1981 tgcatttaac ctggggattt ctgcatcctg tcagagggtg ggtccccacc // LOCUS RIRANT17KA 539 bp ds-DNA BCT 23-FEB-1990 DEFINITION R.conorii 17K genus-common antigen gene, complete cds. ACCESSION M28480 KEYWORDS antigen. SOURCE R.conorii DNA. ORGANISM Rickettsia conorii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae. REFERENCE 1 (bases 1 to 539) AUTHORS Anderson,B.E. and Tzianabos,T. TITLE Comparative sequence analysis of a genus-common rickettsial antigen gene JOURNAL J. Bacteriol. 171, 5199-5201 (1989) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by B.Anderson, 27-OCT-1989. FEATURES from to/span description pept 60 539 17 kD antigen precursor sigp 60 119 17 kD antigen signal peptide matp 120 536 17 kD antigen mRNA 38 > 539 17 kD mRNA signal 1 6 -35 region signal 21 27 -10 region BASE COUNT 179 a 93 c 125 g 142 t ORIGIN 1 tttacaaaat tctaaaaacc atatacttat taattatata ttaatttaga gagaattata 61 tgaaactatt atctaaaatt atgattatag ctcttgcaac ttctatgtta caagcctgta 121 acggtccggg cggcatgaat aaacaaggta caggaacact tcttggcggt gctggcggcg 181 cattacttgg ttctcaattc ggtaagggca aaggacagct tgttggagta ggtgtaggtg 241 cattacttgg agcagttctt ggtggacaaa tcggtgcagg tatggatgaa caggatagaa 301 gacttgcaga gcttacctca cagagagctt tagaaacagc tcctagtggt agtaacgtag 361 aatggcgtaa tccggataac ggcaattacg gttacgtaac acctaataaa acttatagaa 421 atagcactgg tcaatattgc cgtgagtaca ctcaaacagt tgtaataggc ggaaaacaac 481 aaaaagcata cggtaatgca tgccgccaac ctgacggaca atggcaagtt gtgaattga // LOCUS RIRANT17KB 535 bp ds-DNA BCT 23-FEB-1990 DEFINITION R.typhi 17K genus-common antigen gene, complete cds. ACCESSION M28481 KEYWORDS antigen. SOURCE R.typhi DNA. ORGANISM Rickettsia typhi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae. REFERENCE 1 (bases 1 to 535) AUTHORS Anderson,B.E. and Tzianabos,T. TITLE Comparative sequence analysis of a genus-common rickettsial antigen gene JOURNAL J. Bacteriol. 171, 5199-5201 (1989) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by B.Anderson, 27-OCT-1989. FEATURES from to/span description pept 56 535 17 kD antigen precursor sigp 56 115 17 kD antigen signal peptide matp 116 535 17 kD antigen mRNA 37 > 535 17 kD mRNA signal 1 6 -35 region signal 21 27 -10 region BASE COUNT 182 a 93 c 122 g 138 t ORIGIN 1 tttacaaaat tctaaaaacc atatacttat taaataatag attaaagagg attatatgaa 61 actattatct aaagtaatga ttttagctct tgcagcttct atgttacaag catgtaatgg 121 tccgggtggt atgaacaaac aagggactgg aacacttctt ggtggcgccg gcggtgcatt 181 acttggttct caattcggtc acggtaaagg acaacttgtc ggagtaggtg taggcgcatt 241 acttggggca gttcttggtg gacaaatcgg tgcaagtctg gatgagcagg atagaaaact 301 tctagaacta acatcacaaa gagctttaga atctgctcct agcggtagta acatagaatg 361 gcgcaatcca gataatggca atcatggtta cgtaacacct aataaaactt ataggaacag 421 tacaggtcaa tattgccgtg aatacactca aacagttgta atagggggaa aacaacaaac 481 aacatatgga aatgcatgcc gccaacctga cggacaatgg caagttgtga attga // LOCUS RIRANT17KC 532 bp ds-DNA BCT 23-FEB-1990 DEFINITION R.typhi 17K genus-common antigen gene, complete cds. ACCESSION M28482 KEYWORDS antigen. SOURCE R.typhi DNA. ORGANISM Rickettsia typhi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae. REFERENCE 1 (bases 1 to 532) AUTHORS Anderson,B.E. and Tzianabos,T. TITLE Comparative sequence analysis of a genus-common rickettsial antigen gene JOURNAL J. Bacteriol. 171, 5199-5201 (1989) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by B.Anderson, 27-OCT-1989. FEATURES from to/span description pept 56 532 17 kD antigen precursor sigp 56 115 17 kD antigen signal peptide matp 116 529 17 kD antigen mRNA 37 > 532 17 kD mRNA signal 1 6 -35 region signal 21 27 -10 region BASE COUNT 183 a 88 c 121 g 140 t ORIGIN 1 tttacaaaat tctaaaaacc atatacttat ttaaataata attagagagg attatatgaa 61 actattatct aaaataatga ttatagctct tgcagcttct atgttacaag cttgtaatgg 121 tcagagtggt atgaacaaac aaggtaccgg tacacttctt ggtggcgcag gaggtgcatt 181 acttggttct caattcggtc aaggtaaagg acaacttgtc ggagtaggtg taggcgcatt 241 acttggggca gttcttggtg gacaaatcgg cgcaagtatg gatgagcagg atagaagact 301 tctagaacta acatcacaaa gagctttaga atctgcacct agcggtagta acatagaatg 361 gcggaatcca gataacggca atcatggtta cgtcacacct aataaaactt ataggaacag 421 tgcaggtcaa tattgccgtg aatacactca aacagttata ataggcggaa aacaaaaaac 481 atatggtaat gcatgccgtc aacctgatgg gcaatggcaa gttgtgaatt ga // LOCUS RIRANT17KD 539 bp ds-DNA BCT 23-FEB-1990 DEFINITION R.rickettsia 17K genus-common antigen gene, complete cds. ACCESSION M28479 KEYWORDS antigen. SOURCE R.rickettsia DNA. ORGANISM Rickettsia rickettsia Unclassified. REFERENCE 1 (bases 1 to 539) AUTHORS Anderson,B.E. and Tzianabos,T. TITLE Comparative sequence analysis of a genus-common rickettsial antigen gene JOURNAL J. Bacteriol. 171, 5199-5201 (1989) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by B.Anderson, 27-OCT-1989. FEATURES from to/span description pept 60 539 17 kD antigen precursor sigp 60 119 17 kD antigen signal peptide matp 120 536 17 kD antigen mRNA 37 > 539 17 kD mRNA signal 1 6 -35 region signal 21 27 -10 region BASE COUNT 179 a 92 c 125 g 143 t ORIGIN 1 tttacaaaat tctaaaaacc atatacttat taattatata ttaatttaga gagaattata 61 tgaaactatt atctaaaatt atgattatag ctcttgcaac ttctatgtta caagcctgta 121 acggtccggg cggtatgaat aaacaaggta caggaacact tcttggcggt gctggcggcg 181 cattacttgg ttctcaattc ggtaagggca aaggacagct tgttggagta ggtgtaggtg 241 cattacttgg agcagttctt ggtggacaaa tcggtgcagg tatggatgaa caggatagaa 301 gacttgcaga gcttacctca cagagagctt tagaaacagc tcctagtggt agtaacgtag 361 aatggcgtaa tccggataac ggcaattacg gttacgtaac acctaataaa acttatagaa 421 atagcactgg tcaatattgc cgtgagtaca ctcaaacagt tgtaataggc ggaaaacaac 481 aaaaagcata cggtaatgca tgccgccaac ctgacggaca atggcaagtt gtgaattga // LOCUS ROTNS53 1611 bp ds-RNA VRL 02-MAR-1990 DEFINITION Simian rotavirus SA11 segment 5 nonstructural protein (NS53) gene, complete cds. ACCESSION X14914 KEYWORDS nonstructural protein. SOURCE Simian rotavirus, cDNA to viral RNAi, clone 35. ORGANISM Simian (SA11) rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1611) AUTHORS Mitchell,D.B. and Both,G.W. TITLE Conservation of a potential metal binding motif despite extensive sequence diversity in the rotavirus nonstructural protein NS53 JOURNAL Virology 174, 618-621 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.B.Mitchell, 03-APR-1989. FEATURES from to/span description pept 31 1518 nonstructural protein BASE COUNT 571 a 214 c 288 g 538 t ORIGIN 1 ggcttttttt tgaaaagtct tgtgttagcc atggctactt ttaaagatgc atgctttcat 61 tatcgtagat taactgcttt aaatcggaga ttatgcaaca ttggtgcaaa ttctatttgc 121 atgccagttc ctgatgagaa aattaagggg tggtgtttag aatgttgtca aatagctgat 181 ttaacccatt gttatggttg ctcattgccg catgtttgca aatggtgtgt tcagaacaga 241 agatgcttcc ttgacaatga acctcatttg cttaagctta gaactgtgaa acatccaatt 301 accaaagaca aattacagtg tatcatagac ttgtacaata taatatttcc aattaatgat 361 aaagtaatta gaaaatttga aagaatgata aagcaaagaa aatgtaggaa tcaatataaa 421 attgaatggt ataatcattt gctgctccca attacattaa atgctgctgc atttaagttt 481 gatgaaaata atctttatta tgtttttggg ttatatgaga aatcagtcag tgatatatat 541 gctccatata gaattgttaa ctttataaat gaatttgata aattattgct tgatgatatt 601 aactttacaa gaatgtccaa tctaccaata gagttgagaa ccattatgca agaatacttc 661 caattatcaa gactgccatc atcaaaacta aagcaaattt acttttcaga ttttactaaa 721 gaaactgtga tttttaatac ttatacaaaa acgccaggaa gatcaatata cagaaatgta 781 actgaattta attggagaga tgaattggag ctttattctg atttaaaaaa tgataagaat 841 aaattaattg ctgcaatgat gacgagtaag tatactcggt tctatgctca tgataataat 901 tttggaaggt tgaaaatgac aatatttgag ttgggacatc attgtcagcc taactacgtg 961 gcatctaatc acccaggcaa tgcttccgat atccagtact gtaaatggtg taatataaaa 1021 tattttctta gtaaaattga ttggcggatt cgtgatatgt ataatttatt gatggaattt 1081 attaaggatt gttataaaag taatgttaac gttggacatt gtagttctgt tgaaaacata 1141 tatcctttaa ttaaaagatt aatttggagt ttgtttacta atcacatgga tcaaacaatt 1201 gaagaagtgt ttaatcacat gtcgccagtg tcagttgaag gtacgaatgt catcatgttg 1261 attcttggat tgaatattag tttgtataat gaaattaagc gcactttgaa tgtagatagc 1321 ataccaatgg tacttaattt aaatgaattc agtagtatag ttaaatcaat tagcagtaaa 1381 tggtataatg ttgatgaatt ggataaattg ccaatgtcaa taaaatcaac ggaggaactg 1441 attgaaatga agaattctgg aactttaact gaagaatttg agctactgat ctccaactca 1501 gaagatgaca atgagtgaaa ttactaatgt cactatctaa ttatacagta tttagccatc 1561 acaagaccgt ccagactaga gtagcgccta gctggcaaaa tactgtgaac c // LOCUS SPINR 3284 bp ss-mRNA PLN 06-MAR-1990 DEFINITION Spinach NADH nitrate reductase mRNA, complete cds. ACCESSION M32600 KEYWORDS NADH nitrate reductase; nitrate reductase. SOURCE Spinach, cDNA to mRNA, clone pIMP4. ORGANISM Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae. REFERENCE 1 (bases 1 to 3284) AUTHORS Prosser,I.M. and Lazarus,C.M. TITLE Complete nucleotide sequence of spinach nitrate reductase cDNA JOURNAL Unpublished (1990) Long Ashton Res. Stn, Long Ashton, Bristol, UK STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.M.Prosser, 05-MAR-1990. FEATURES from to/span description pept 116 2896 NADH nitrate reductase BASE COUNT 894 a 642 c 849 g 899 t ORIGIN 1 aaaaaccata acacctctgg aaaaatattg aggaagaacc gtagttcttc tttccttttt 61 ctttttttct ttttaattac atttcaaaaa accctacatt tgatcgattt gagctatggc 121 ggcgtcagtc gatcgtcaat atcatccggc gccgatgagc ggtgtcgttc gtaccccatt 181 tagcaaccac caccgttccg actctccggt tcggaacggt tatactttta gtaacccgcc 241 ttcttcaaat ggcgttgtta aacccggtga aaaaatcaag ctcgtagata ataatagtaa 301 tagtaataat ggtagcaata ataataataa tcgctatgat agcgatagtg aagaagatga 361 tgacgaaaat gagatgaatg tctggaatga gatgatcaaa aaaggtaaca gcgaattgga 421 gccgtcttcc gtcgatagcc gtgatgaagg aacggctgat caatggattg agaggaatcc 481 ttcaatgatc cgtctcacag ggaagcaccc gttcaattcg gagcctccat tgacccgatt 541 gatgcaccac gggtttctca ccccggttcc gcttcattac gttcggaacc acggtccagt 601 ccccaacgct aagtgggagg attggaccgt tgaggtgacc gggttagtca aaagaccaat 661 ccggttcact atggaccaat tggtcaatga cttccaaagc cgtgagtttc cggtgacgct 721 tgtctgcgcc ggtaaccgga ggaaggaaca gaatatgacg aaacaatcga tagggtttaa 781 ctggggttcc gccgccgtgt caacttccgt gtggcgcggt gttccacttc gcgacgtgct 841 taagcggtgt ggggtcatga gttcgttaaa aggggctctc aacgtttgtt ttgagggggc 901 tgaggattta ccaggaggtg gcgggtcgaa atatggtaca agtgttaaaa gggagtttgc 961 aatggatccg gcccgtgaca ttattctagc gtacatgcaa aatggggaga agttgagccc 1021 ggatcatggg tatccggttc ggatgatcat acccgggttt attggtgggc ggatggttaa 1081 gtggttgaag aggattattg tgactacaac agagtctgat aattattacc attacaagga 1141 taatagggtc cttccctctc atgttgatgc tgaacttgcc aattctgaag cttggtggta 1201 caaacaagag tacatcataa atgagctgaa cgttaactcg gtgataacat cgccgtgcca 1261 cgaagaaatc ttgcctatca acgcctggac tacccaacgt ccttacacaa tgaggggtta 1321 cgcttattct gggggaggaa gaaaagtgac ccgagtggaa gtgacaatgg atggtggaga 1381 cacatgggac atatgtgaat tggaccacca agagagggga agcaaatatg gtaaattctg 1441 gtgttggtgt ttctggtctt tagaggtcga agtactagac ttactcggag ccaaagagat 1501 tggggtccgt gcttgggatg aaagcctcaa cacccaacct gaaaagctca tttggaatgt 1561 catgggtatg atgaacaact gctggtttcg tgttaagacc aacgtgtgca agcctcacaa 1621 gggagagatc ggtatcgtgt tcgaacaccc gacccaaccc ggtaacaagt cgggcgggtg 1681 gatggcccgt gaacgccacc tcgagatctc cgattctggc ccaacattga agaggacagc 1741 atccacccca ttcatgaaca ccacctcaaa gatgtactca atgtcagagg tgaagaaaca 1801 caacactgct gactcagctt ggattgtagt ccatggaaat gtctacaacg cgacccggtt 1861 cctaaaggat cacccgggtg ggtcagacag tatcctcatc aatgcaggaa ccgactgcac 1921 cgaagagttc gatgcaattc actcagacaa ggccaagagg cttctagaag atttcaggat 1981 aggtgagttg atctcaacgg gctacacctc tgactcgtcc tcgcccggta actcagtcca 2041 tggcgggtcg gtttactccg ggttagccgg gttagcccca attacagaag cggttccatt 2101 acgtaatgtt gctttaaacc cacgggttaa gatcccatgc aaacttattg agaaagtttc 2161 actttcccat gacgtaagaa ggttccgatt cgggttaccc tctgaagacc aggtgttggg 2221 tttacctgta ggaaaacaca tcttcctttg tgctaatgtt gatgataagc tttgtatgag 2281 agcttatact ccatcaagta ccattgatgt ggttgggtac tttgatcttg tggttaaggt 2341 ttacttcaag gatgttcacc ctaggttccc taacggtggg gtgatgtcac aacatctaga 2401 ctcactttct ttgggctcga ttgttgacgt gaaaggccca ttgggccata ttgagtattt 2461 aggaaagggg aatttcacag tccatggaaa gcccaagttt gctaaaaagc ttgctatgat 2521 atcaggtggg accgggatta ccccgattta tcaggtgatg caggcaatac tgaaagatcc 2581 tgaggataag actgagatgc atgttgtgta tgcaaatcgg acggaggagg atatcttgct 2641 tagagaagag cttgataaat gggctgatga gtttagagat cgggttaaag tatggtacgt 2701 tgttgaaaag gcggaggaag ggtggaaata tgataccggg tttatatctg agaagatact 2761 tagagatcac gtgccggcgg ttggggatga tgtgttggca ctcacgtgcg gtcctcctcc 2821 tatgattcaa tttgcggttc aacctaactt ggataagatg ggttttgata ttaaggaaca 2881 gttgttgatt ttttgaaagt actgttacgt tgggaagtga atttactaca ctacccctga 2941 agatatttgt gttacgtgca cattgcaata tcgacgtgtg taacacatgc gttgtgcttt 3001 cgaatgtata gtagaagttg ggtaaaatgg agtatatgaa cactactata tgtattctat 3061 gtatgtaatg taatttgtgg agctttttct tctttcttta cccggtgaaa ttatgtgaat 3121 ataagtgctc cactagattg tttgttgata agctttttgt tgtccttatg aaaattggtt 3181 gatggaggaa gaagtatgta gaaggaagaa attcaagtag ctaaggttta tgtattgtaa 3241 gagggagatt actatttggt agtctccatt taccatatta ttat // LOCUS SRGMDH 4638 bp ds-DNA PLN 06-MAR-1990 DEFINITION S.vulgare NADP malate dehydrogenase gene, compete cds. ACCESSION M31965 KEYWORDS NADP malate dehydrogenase. SOURCE S.vulgare (haplotype 2n) 5 day old leaf mesophyll cell DNA. ORGANISM Sorghum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 4638) AUTHORS Luchetta,P., Cretin,C. and Gadal,P. TITLE Structure and characterization of the sorghum vulgare gene encoding NADP-malate dehydrogenase JOURNAL Gene (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Luchetta, 02-MAR-1990. FEATURES from to/span description mRNA 763 783 NADP-malate dehydrogenase, exon 1 873 963 NADP-malate dehydrogenase, exon 2 1461 1547 NADP-malate dehydrogenase, exon 3 1643 1727 NADP-malate dehydrogenase, exon 4 1984 2165 NADP-malate dehydrogenase, exon 5 2420 2490 NADP-malate dehydrogenase, exon 6 2570 2660 NADP-malate dehydrogenase, exon 7 2749 2823 NADP-malate dehydrogenase, exon 8 2912 3013 NADP-malate dehydrogenase, exon 9 3100 3222 NADP-malate dehydrogenase, exon 10 3303 3371 NADP-malate dehydrogenase, exon 11 3456 3524 NADP-malate dehydrogenase, exon 12 3619 3672 NADP-malate dehydrogenase, exon 13 3765 4638 NADP-malate dehydrogenase, exon 14 pre-msg 603 4638 NADP-malate dehydrogenase mRNA and introns IVS 784 872 NADP-malate dehydrogenase intron A IVS 964 1460 NADP-malate dehydrogenase intron A IVS 1548 1642 NADP-malate dehydrogenase intron A IVS 1728 1983 NADP-malate dehydrogenase intron A IVS 2166 2419 NADP-malate dehydrogenase intron A IVS 2491 2569 NADP-malate dehydrogenase intron A IVS 2661 2748 NADP-malate dehydrogenase intron A IVS 2824 2911 NADP-malate dehydrogenase intron A IVS 3014 3099 NADP-malate dehydrogenase intron A IVS 3223 3302 NADP-malate dehydrogenase intron A IVS 3372 3455 NADP-malate dehydrogenase intron A IVS 3525 3618 NADP-malate dehydrogenase intron A IVS 3673 3764 NADP-malate dehydrogenase intron A BASE COUNT 1255 a 1044 c 993 g 1346 t ORIGIN 1 aggcctagtc aaaaagtaaa tgtgattatt tacaaacatt gtctctacat tttgcatggc 61 ttcaaaacca aagatcgcac acacttttgc caaaattttg ccaccgttgc taaccaatcg 121 atgtgtaacg atgagaagtg ccatcacact ctctgaaaag actcaggccc tggcactggc 181 aggaccgcag gaaaggcact gagcaaagat gatggtcgtg ggagaataaa caaccttttt 241 ttttttttgc ctttgtaata aatgtccccg ccgtctctca cgtagtacgt agtttcggtg 301 tggctttgtt ttaaaaggga accaaatcta ccaatgagtc tttttaaaaa gaagaaaaat 361 actggtcagc aaaactgaga ttctccagca acgaaaatga aaaagacttg gttgtcagtg 421 ttaaccacca ccaggatagc aggcgtgtgg acgtacaaga tgccgaacag tcgctgtcgc 481 gcgacgtgtc ccgcgccggg cggtccgctc gccttgcctg ccgctaatcc aattcctacg 541 cgcgctcctg ctcctgcgta caaccagaga ggctataatc tcctccactc cctcccctgt 601 agctgcagcc cgttccactc ctatccatcc gtgctccagc tccaccatgg gcctctcaac 661 ggcttactcc ccggtcggat cgcacctcgc gccggcgcct ctcgggcacc gtcgctccgc 721 ccagctccac cgcccgcgcc gggccctgct cgccaccgtc cgatgctccg tcgacgccgc 781 caagtgagta gtagtatact acccggacgt ggacatgaat gcgtgccatt tatcaactac 841 taggctgact agccccctat acgttgatgc aggcaggtgc aggatggcgt ggcgacggcg 901 gaggcgccgg cgacgcggaa ggactgcttc ggggtcttct gcaccaccta cgacctcaag 961 gcggtgagtc cctcccttcg cctcaggcgg tggtggtcct ccccggttcc cggtccggcc 1021 tccccgtgtg tgttcgctga aatgctccat cgacttttgc cttttgcttc aaactcaaag 1081 tagaatccct tttccaccta aagctagcaa cgttgctttt ctccaaaaga gaaaaaaaaa 1141 gaaaaagcta gcacgtcgtt ggtgtcatgg tgtggaccgt tggaggtgtg gtgtggatat 1201 atgctacgat actggatagc ttgcactcat atcaacagct ggcgttatgc ctctataaat 1261 aaaaataaat atcatacaga ccaccaaaac caaagcgcag ggtggagtgc cacgcatgag 1321 acgctgcact gcacgccact gctgcttgac ttaatttggt ttctaattaa cttgctgctg 1381 tatatatcgt catatagtag tacagtacta acaatctgtc atcctatgcc attgaatgga 1441 ttgggagttg ggacacacag gaagacaaga ccaagtcgtg gaagaagcta gtgaacattg 1501 ctgtgtcagg cgcggctggg atgatatcga accacctgct gttcaaagta actaaatcgt 1561 ttactgactc tacttgtttc ctatcttgag ttgcagtatt gtaatgctga ccattcattt 1621 tattaacaca aaaaaaattc agcttgcctc tggtgaggtt ttcggacaag accaaccaat 1681 agcactgaag ttacttggtt cagagagatc gtttcaagct ctcgaaggtg aatgactgac 1741 tgcctcttct tctttctgcc ttgctatctg agaagaagaa tcaattcata ttttccactc 1801 actctgttcc aaaagctaaa atgccttata tataatttgg aacaggggga gtacatacgt 1861 atagctttag tttcaagctg cccttacagc ttccagtgtt gtactctctt atatcaatgg 1921 attactgtga cagtgaatat ttatcacttc ttcctcatgt gctctcaaaa actgcaactg 1981 taggtgttgc tatggaactg gaggactcgc tatatccatt gctgagggaa gtcagcattg 2041 gtatagatcc ttatgaggtc tttgaagatg tagattgggc ccttcttatt ggtgctaagc 2101 cccgaggtcc tggcatggag cgagctgcgt tactggatat caatggtcaa atctttgctg 2161 atcaggtttc ttttgcactc tgttcctcct ttcttctcaa gtactatgtc accaaattgt 2221 aagaagtcct ctcaggaaag gtgccaagaa gcattttctg aaacttcaac ctcaaatttt 2281 aaggattcct aagcattaga acctgcaaat tgccaaaacc agaaaccaag tacaaatttg 2341 agtcaaagcc cacctaaata aacgaccaga gtaacctgtt attcttgtta ttatctgaat 2401 tttttttaaa ttgtgtcagg ggaaagcact taatgccgtg gcctcgaaga acgtgaaagt 2461 cttagttgtt ggaaatccct gtaacactaa gtacggacat tgattgttca ccaagattaa 2521 cgattttatg tcatcaccat gtcttagttg tggcttgccc ttgatgcagt gcgttaattt 2581 gcttgaaaaa tgctccagac ataccagcaa aaaattttca tgcactgacg aggttggatg 2641 aaaatagagc aaagtgccag gtgactgacg ttgaacctta ttatgtctcg tccttccatt 2701 ccaaacagga aaacagatta agaaatcaca cacaatctcc ttttgtagct agcactcaaa 2761 gcaggtgtat tttatgacaa agtatcaaac gtgactattt gggggaacca ttcgacaact 2821 caggtgaagt tttgcattat aaattaaatg agcagactat tttgcagaca catttcttcg 2881 ctgaatggtt ggcactaact tctctgtgta ggttcctgat ttcttgaatg ccaaaattga 2941 tgggagacca gtgaaagaag tcattaagga taccaagtgg ttagaagaag agttcaccat 3001 tacggttcaa aaggtatgac aatccatttg gagtgtctgt tactcggata aataaacaag 3061 gttgtgacca taattcgttt cattcttata tctaaacagc gtggaggtgc gctcatccaa 3121 aaatggggca gatcttcagc tgcatcaacc gctgtatcaa tagcggatgc tattaaatcc 3181 ctggtaactc ctaccccaga aggcgactgg ttctccacag gggtaagttt tgatgccttt 3241 gaattgatca cagctttatt tcggcattcc tctaatggaa aaaagtaata aaaaaaacac 3301 aggtttatac gactggaaat ccttatggca tagcagagga tatcgtgttc agcatgccat 3361 gcagatcgaa ggtatgatct actaatcaaa ttgtccttat tatttttgtc ctaagataag 3421 gtaccactcc aatgatcgta aaattcttcc aacagggtga tggtgattac gaactagcta 3481 ctgatgtgtc aatggacgat tttctctggg aacggattaa aaaggtaagg gctataggaa 3541 ctgccatcac tgatatgctg agaatataag cagcactaga gattttaccc taacagttgg 3601 ccctgtacat attttcagag tgaagctgaa ttgcttgctg agaagaaatg cgttgcccat 3661 cttactggag aggtatgtca cagtgacgta ttttatttag cttaaaaaga gagaaaccgt 3721 gccggaaagg attggcttag ctttttctgt ttgtgcaaat gcaggggaat gcatattgtg 3781 atgttccgga ggataccatg ctaccgggag aagtgtagtt tagttacaga gaagcaatag 3841 caggggaatg catttgtcta tactatagct gagactcgag agaaacagct gcttgcaaga 3901 tgcagcaact acttttactt atatatatat attgtttata ttctgtaccc gctcagagcg 3961 ttgtgtgcac acgctgtgtg attccgggac cgtctttacg atcgttttgc cttttttggg 4021 ccataccgct tcgctcccgt gttgcgcttc cattattgag ttattaccac tctgttttat 4081 aatatttatt actgccgaga ttttgtgtag atattacaat cttgtgtttg gttgtgtctc 4141 tgctcatacg caagtgctgc tgcaactctt gtttttattc acatttagtt acaactgtgt 4201 tattttgtag taacgacgtg ttatatgtgc tgttacaatc gtattgtaac tctattcttt 4261 tcctcatttc tccttttact gtacttattt aacagaatta gtcgattttc agtcacagaa 4321 taatattttc tctcacaaca aatcaacatc aatcagtgtc gtcgaatctc tacctcgtgg 4381 agccatggac cgttgtctcg tgggcaatcc caagtgagcc ctatatgagg cagttctttc 4441 gctaatcatt tttaaaacat ataccacaag acccatacac cgattatgtt tgaaactttc 4501 aatgctagca acttcatcga catacagggc tcacacgtta gtaaaaaaat gtgaattaag 4561 cactgtctct gcattttgca tggcttcaaa caaaagatct cgcactttga ctacatttcg 4621 ccactgctgc taaggctt // LOCUS TRBHSRP 2378 bp ds-DNA INV 15-FEB-1990 DEFINITION T.brucei heat shock-related protein gene, complete cds. ACCESSION M32139 KEYWORDS heat shock-related protein. SOURCE T.brucei (strain 427-60) DNA, clone H8. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 2378) AUTHORS Lee,M.G.-S., Polvere,R.I. and Van der Ploeg,L.H.T. TITLE Evidence for segmental gene conversion between a cognate hsp 70 gene and the temperature sensitively transcribed hsp 70 genes of Trypanosoma brucei JOURNAL Mol. Biochem. Parasitol. (1900) In press STANDARD full staff_entry COMMENT The conserved domains are 100 % homologous to the corresponding regions of Hsp70 genes 2-6 of T.brucei. Draft entry and computer-readable sequence [1] kindly submitted by L.T.H.Van der Ploeg, 16-FEB-1990. FEATURES from to/span description pept 331 2361 heat shock-related protein site 301 463 conserved domain site 532 613 conserved domain site 732 920 conserved domain site 985 1028 conserved domain BASE COUNT 575 a 535 c 717 g 551 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttcctt tattttcttt ttccttttga gctaactcta cctccgtagc cgccagcgtt 61 ctcaatgcag ttttgcgtgt aagtcacttg cctcttggcc aagtagactc ggtatgcaac 121 accaggagcg ccgcacgcat aacaatttgc gtttacgcgt ctacgttagt gtgcagatgt 181 tatgatactt ttgttttaac tttgctccgc gtccttggcg ttggtattgt tgcaatcttt 241 catcggccag tgcctgtata tttcaatatc ctcttgcttt tcatccttgc tactgcctct 301 gaaaacaagg aatatctgcc tctttgaagg atgacatacg aaggcgccat tggtatcgac 361 ctcggtacga cgtactcgtg cgttggtgtg tggcagaacg aacgtgtgga aatcattgca 421 aatgaccagg gtaaccgtac gacgccgtcc tacgttgctt tcgtcaataa tgaggttctt 481 gttggcgatg cggcgaagag ccacgccgct cgtggctcga atggtgttat attcgacgcg 541 aaacggctga tcggacggaa gttctccgac tctgtcgtgc agtccgacat gaagcactgg 601 cccttcaagg tcgaggaggg cgagaaaggt ggtgctgtga tgagggtaga gcatcttgga 661 gaggggatgt tactgcagcc cgagcaaata tctgcccgtg tccttgcgta cctgaagtct 721 tgtgcggagt cgtatcttgg gaagcaggtg gctaaggctg tcgtaacagt gcctgcatac 781 ttcaacgact cgcagcgcca ggcaacgaag gacgcgggga cgattgctgg tctggaggtg 841 ctgcgcatca tcaacgagcc aacggctgct gcaattgcct atggcctgga caaggctgat 901 gaggggaagg aacgcaacgt tttggtgttt gactttggag gtggaacgtt tgacgtctca 961 attatttcag taagcggcgg tgttttcgag gtgaaagcca caaatggtga cacccacctt 1021 ggtggagagg atgtggacgc cgcactgctg gagcacgcac ttgctgatat aaggaaccgt 1081 tatggaatag aacagggctc tctgtcccaa aagatgctat ccaagctgcg ttcacggtgt 1141 gaggaagtga agcgagtact gtcacactcg acagtggggg aaattgcact cgacgggctg 1201 cttcccgacg gcgaagaata cgtgctcaag ctaacgcgag cgaggttgga agagctatgc 1261 acgaaaattt ttgctcgctg tttgagtgtg gtacagaggg cgttgaagga cgcttcaatg 1321 aaggttgagg atattgagga tgttgtgctt gtcgggggaa gttctcggat tcctgctgtg 1381 caggctcagt taagggagct attcagaggt aagcaactct gcagcagtgt gcaccctgac 1441 gaggcagtgg cgtatggggc tgcatggcag gcgcacgtac tttccggtgg atatggggag 1501 agctctagga cagcaggtat agtgcttctg gatgttgtgc ccctttcaat cggcgttgaa 1561 gtggacgacg gtaagtttga tgtgataatt cgtcggaaca caaccattcc ataccttgcg 1621 accaaggagt acagcactgt ggatgacaat cagagtgaag tggagataca ggttttcgaa 1681 ggtgagcgcc ccctcacgcg tcacaaccac aggctaggct cctttgttct tgacggtatt 1741 accccggcaa agcacggtga accgactatc accgttactt ttagtgttga cgccgacggg 1801 atcctgaccg ttacagccgc agaagaactc gggagcgtaa caaaaacact tgttgtggag 1861 aactcagagc gacttaccag tgaggaagta cagaagatga ttgaagttgc acaaaagttt 1921 gccttaacgg acgccaccgc tctggcgagg atggaggcca ccgagaggct cacgcagtgg 1981 tttgaccgac tcgaagctgt aatggagacc gtcccgcaac catactcaga gaagcttcaa 2041 aagcgtattg cctttcttcc gcacgggaaa gagtgggtcg gcacacagct tcatacgtac 2101 actgacgcgg cgtcgataga ggcgaaagta gcgaagattg aacggcttgc aaagagggcc 2161 ctgaaatcgg cgcgtcgtga aggaaaggat ggatgggcac cgggcaatga ggacaatgga 2221 agcggtgatg acaacgacgg tgacgataac agtgatgagg aggacgagct ccaaaggggg 2281 aggggcgtga cagagggatc ggggaggtcc ccgataagga aacgcgaccg catagaggct 2341 atcaacgcaa acacagagta agtcgtagag gtgtctaa // LOCUS YSCALG1 2011 bp ds-DNA PLN 06-MAR-1990 DEFINITION S.cerevisiae beta-mannosyltransferase (alg1) gene, complete cds. ACCESSION J05416 KEYWORDS beta-mannosyltransferase. SOURCE S.cerevisiae DNA. REFERENCE 1 (bases 1 to 2011) AUTHORS Albright,C.F. and Robbins,P.W. TITLE The sequence and transcript heterogeneity of the yeast gene ALG1, an essential mannosyltransferase involved in N-glycosylation JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.F.Albright 06-MAR-1990. FEATURES from to/span description pept 115 339 upstream ORF pept 379 1728 beta-mannosyltransferase (alg1) mRNA 102 1826 alg1 mRNA (alt., both ends approx.) mRNA 203 1826 alg1 mRNA (alt., both ends approx.) BASE COUNT 639 a 341 c 390 g 641 t ORIGIN Chromosome II; 2.1 map units from Lys2. 1 gggcaataaa aaattctaag aatagaatca tatcacagta cacgaaaata taaatacatc 61 ttgaaaatgt ctcaaattgg gtggaaaaga gggtgcttga attcattagg acatatggtt 121 gatgttctca ggttctactt atgtttgctt tgtcgatttt tgcatgcttt gactgtgact 181 ttcctgtctg atatctttgt atggcttgtt gctaaaacaa ggtcaatcca agcagtgatc 241 atcttgcacg ttgcttcaat agaacgcgca tatagcaatc atcaagttaa ttggtcatat 301 atctttcaat cagcaattag caaagccatt agagggtaat tgtaacggca ttttaggggc 361 tatagtcaat tggtgaaaat gtttttggaa attcctcggt ggttacttgc cttaataata 421 ttataccttt ccataccgtt agtggtttat tatgttatac cctacttgtt ttatggcaac 481 aagtcgacca aaaaaaggat catcatattt gtgctgggtg atgtaggaca ctctccaagg 541 atatgctatc acgctataag tttcagtaag ttaggttggc aagtcgagct atgcggttat 601 gtggaggaca ctctacccaa aattatttcc agtgatccaa atatcaccgt ccatcatatg 661 tcaaacttga aaagaaaggg aggcggaaca tcagttatat ttatggtaaa gaaggtgctt 721 tttcaagttt taagtatttt caaattactt tgggaattga gaggaagcga ttacatacta 781 gttcaaaatc caccgagcat acccattctt ccgattgctg tgctatacaa gttgaccggt 841 tgtaaactaa ttattgattg gcacaatcta gcatattcga tattgcaact aaaatttaaa 901 ggaaactttt accatccttt agtgttgata tcttacatgg tagagatgat attcagcaaa 961 tttgctgatt ataacttgac tgttactgaa gcaatgagga aatatttaat tcaaagcttt 1021 cacttgaatc caaagagatg tgctgttctc tacgaccgcc cggcttccca atttcaacct 1081 ttggcaggtg acatttctcg tcaaaaagcc ctaactacca aagcctttat aaagaattat 1141 attcgcgatg attttgatac agaaaaaggc gataaaatta ttgtgacttc aacatcattc 1201 acccctgatg aagatattgg tattttatta ggtgccctaa agatttacga aaactcttat 1261 gtcaaatttg attcaagttt gcctaagatc ttgtgtttta taacgggtaa aggaccacta 1321 aaggagaaat atatgaagca agtagaagaa tatgactgga agcgctgtca aatcgaattt 1381 gtgtggttgt cagcagagga ttacccaaag ttattacaat tatgcgatta cggagtttcc 1441 ctgcatactt caagttcagg gttggacctg ccaatgaaaa ttttagatat gtttggctca 1501 ggtcttcctg ttattgcaat gaactatcca gtgcttgacg aattagtaca acacaatgta 1561 aatgggttaa aatttgttga tagaagggag cttcatgaat ctctgatttt tgctatgaaa 1621 gatgctgatt tataccaaaa attgaagaaa aatgtaacgc aggaagctga gaacagatgg 1681 caatcaaatt gggaacgaac aatgagagat ttgaagctaa ttcattgagt caatggtaac 1741 tcagcctttc ttttttgaaa attactattt tcgactcttt tttatacagt tacatagtac 1801 tacctctaat acacattcat gattaacaat gtttcaaaca atataaagtc ccgataacga 1861 ccttttgaag tggtgacgtt accgctcttc gttgacaaga ttcaagaggg ctgtcagaat 1921 aacagctatc atggtggaaa gtaggtgacc ctttgaacaa ggcagcatat atccatcgac 1981 tatcatgttc ctaaaatgtt atcatagccg g // LOCUS YSCHEM15 2916 bp ds-DNA PLN 07-MAR-1990 DEFINITION S.cerevisiae ferrochelatase (HEM15) gene, complete cds, and Val-tRNA gene. ACCESSION J05395 KEYWORDS delta repetitive element; ferrochelatase; protoheme ferro-lyase; transfer RNA-Val. SOURCE S.cerevisiae (strain FL200) DNA, clones pHEM15-[1,2]. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2916) AUTHORS Labbe-Bois,R. TITLE The ferrochelatase from Saccharomyces cerevisiae: Sequence, disruption and expression of its structural gene HEM15 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Lobbe-Bois, 06-MAR-1990. FEATURES from to/span description pept 841 2022 ferrochelatase precursor (EC 4.99.1.1) sigp 841 933 ferrochelatase signal peptide matp 934 2019 ferrochelatase tRNA 2253 2325 Val-tRNA rpt 2435 2765 variant solo delta element rpt 2776 2888 truncated delta-element BASE COUNT 907 a 563 c 527 g 918 t 1 others ORIGIN 1 aaaatagcat tcagtgggaa cgatcccaat aagcagatag catatcgcag tgtaaaacta 61 tcaatcccat agttggtaat tatctttgtt aaaacagcgt ccacaggatt gtacatgttg 121 gaagcgatca gcagcagaat taaatcacca cggtttgtct tggcgtatng gtttgtcttc 181 ttgcttgacc tgtgcctgta ttttgtcctt ggctttaaat ctgtttaata agaacccttt 241 atttagatga ccattgtttt cgacggtttg tggtttcgac gcgcaatgtg aaagatcttc 301 aatgttttgt tgaacagtaa agcaaaacag ctcaacagca ggacaaaaaa agctaaaaaa 361 gaactcttcg cacttttaac aagcagtttt actgaacaat tttagaagta aatttcagga 421 aggttcaaca gctttacatg aaagaggatg ctggccattt ttttttttta aaaaagtttt 481 ttatatacac aaactcagta aaataattac aaatatgtag catgtgtagg atgccttgaa 541 acatctccgc tggcaatagc gttagacatg cccaatcttc aatggtcacc gtaagcctta 601 tatgacgaag gagttgctgc ccttcgtttt ctcatgctgc tttttttcgt aggggatatg 661 ggattgaaga aggaggacgc gaaggaatac ttttggtcgg tcaaggtcac tcttggcttg 721 acaccaaaca agcgtaaatt ctcgtgttcc acttgagtcg gtttacaaga agtacagaaa 781 tatttaacac atacctgcta tttggacaaa tttagcagta aaaaaaagaa cgtttaaaga 841 atgctttcca gaacaatccg tacacaaggt tccttcctaa gaagatcaca actgaccatt 901 acaagatcat tttcggttac attcaacatg cagaatgcac aaaagagatc acccacagga 961 attgttttga tgaacatggg tggcccctct aaagttgagg aaacatatga ttttttgtat 1021 caattatttg ccgataatga cctaattccc attagtgcta agtatcagaa gacaattgct 1081 aaatatattg ctaagtttcg tacccccaag atagagaagc aatataggga aattggtggg 1141 ggctccccaa tccggaaatg gtctgagtat caagccactg aggtctgtaa aatcttagat 1201 aaaacctgtc cagaaacggc gcctcataag ccttacgtgg cgtttcgtta cgcaaagccg 1261 ctaaccgcag aaacttataa acaaatgcta aaagatggcg tgaagaaggc agtggccttt 1321 tctcaatatc ctcatttctc ttattccact accgggtcat ccatcaacga attgtggaga 1381 cagattaagg cattggactc cgagagatct atatcttggt cggttattga tcgttggcct 1441 acaaatgaag gtctaatcaa ggccttctcc gaaaatatca ccaaaaaact acaagagttt 1501 ccgcaacctg tcagagacaa ggttgtttta ttgttttccg cacattctct acccatggat 1561 gttgttaaca ccggtgatgc ctacccagct gaggtagctg cgacggttta caacatcatg 1621 caaaaattaa agtttaaaaa cccttatagg ttggtttggc aatcccaagt tggaccaaaa 1681 ccatggttgg gagcgcagac agctgaaatt gcggaatttt taggccccaa agttgatggc 1741 ctaatgttta ttcctatcgc ctttacctct gatcatattg aaacattgca tgaaattgac 1801 ttaggcgtca ttggggaatc ggaatataag gataaattta agagatgcga atctttaaat 1861 ggcaaccaga cctttattga aggcatggca gatctcgtca aaagccactt acagagtaac 1921 caactctatt ctaatcaact acctcttgat tttgcacttg gcaagtccaa tgatcctgta 1981 aaggaccttt cattggtatt tggcaatcac gaatctactt gataagggcc attcatccca 2041 caatctcaat atcagtatat tctcttattt attaattgag gttgcgtatt tatgttttct 2101 ttgtggtata taatatatca ttatttattg cattttaaaa cttttttttt cattattttt 2161 ctctgttgaa tgcagtcaac atctccctta atattcgata ctcatcttta atactatcag 2221 caaaaagttt aagataaaat attcttaaaa acgatttcgc ccaggatcga actggggacg 2281 ttctgcgtgt taagcagatg ccataaccga ctagaccacg aaaccactat ttgttgaaat 2341 ctttaacaag cgaggaaact ttagtgaaat attaaaagga ctaaaccgga ccggttcttc 2401 tatttctaca tatctaatca cccaatgtag tttttgaata tttgtgaata gttggataat 2461 tgttgggatc ccattgtgca caaaggccat aatattatgt ctacagaata tactagatgt 2521 cctccctata ggatatagta atcctctaaa tggaaccgat atttctacat aataatatta 2581 cgattattcc tccttccgtt ttatatgttt cattatccta gcacactatc aatctttgca 2641 tttcagcttc cattagattt gatgactatt tctcaatctt tatgttatct ccttacgccg 2701 catgtgataa tatactgcta gtatgactac tagttgatag aagatagttg atttttactc 2761 caacaaaagt aacaatatta tttagaacta tagattccat tttgtgcatt cccatattct 2821 cgaggaaaac ttttagtata ttctgtagac ataatattat cgcctttgtg aacaatagaa 2881 tcccaacaat tgtcgcaaat ttaccaattt tctaga // LOCUS YSCOSCP 1451 bp ds-DNA PLN 05-MAR-1990 DEFINITION S.cerevisiae oligomycin sensitivity conferring protein (OSCP) gene, complete cds. ACCESSION M32487 KEYWORDS oligomycin sensitivity conferring protein. SOURCE S.cerevisiae (strain D-273-10B, ATCC 25657) DNA, clone 62-17. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1451) AUTHORS Misook,U., Jones,D. and Mueller,D.M. JOURNAL Unpublished (1990)Univ. of Health Sci., North Chicago, IL 60064 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by U.Misook, 02-MAR-1990. FEATURES from to/span description pept 498 1136 oligomycin sensitivity conferring protein (OSCP) mRNA 429 > 1136 OSCP mRNA (alt.) mRNA 430 > 1136 OSCP mRNA (alt.) BASE COUNT 484 a 245 c 273 g 449 t ORIGIN 1 agatctggct gcaaacctgg tctacataaa tgtagtacat tataagggac taccaagaaa 61 tacgaagcgc tcagatatct cttaaagttc atttttttgc cggaataatc ttggtttagt 121 aacttataaa actacttacg tccaataaat cgttatattt tatgccgtcg tcataaagtg 181 gacactttcg agagacggtg ctggccaata agatgctgat atttgaaata tcatttaccc 241 gggcccgggt gggaaaaaca aggaattaaa gataatgaag tgatcttttg gcgagatagt 301 aaaactttca attatctagg aacaatttca aacgccaatt gttaactttg tgtacagtat 361 ttctagtttt tatctcattg aaaacaaaat tcagtcaact tggaaattta agcaggctac 421 tgtggcattg agatattttt ccgcagagta agaaccgatt agcaagttac cgcattattt 481 ttaaaccgtg gacaatcatg tttaatagag tctttaccag gtcatttgca tcaagcttaa 541 gagctgctgc ttccaaagct gctgctcccc caccggtgag attgttcggt gttgagggca 601 cttacgcaac agctttatat caggccgctg caaagaactc ctctattgat gctgcctttc 661 aatctttgca aaaagttgaa agcacagtca aaaagaatcc aaaattaggt cacttattac 721 tgaatcctgc attgtcattg aaggacagaa attctgtcat tgatgccatc gtggagactc 781 acaagaattt ggatggttat gtcgttaacc tattgaaggt tctttccgaa aataacagac 841 tgggatgttt tgaaaaaatt gcgtctgact ttggcgtttt aaacgatgct cacaacgggt 901 tacttaaagg tacagtgact agtgctgaac cattggatcc taaatctttc aagaggattg 961 aaaaagcttt gagcgcctcc aaattggtgg ggcaaggtaa atctttgaag ctggaaaacg 1021 tggttaagcc agagattaag ggtggtttga ttgtagaact tggtgacaag actgttgact 1081 taagcatttc tacaaagatt caaaaactga ataaggtctt agaggacagc atttaacgtt 1141 aataatatag ttggatatgc tcaactcgtt tgatgttact ttctttgcct ccctcctaat 1201 ttttgattct gttcaaatat atattctttt ccgtgaattt ttcctaaaca tcaaaattac 1261 gtacatatac gaataaataa gaaacccctc taaaaagtga atttttttgt tcgagacaac 1321 cattttaatc atatatatgt tcgtgtatcc aggcaaactt atattaatcg caaaaaaaaa 1381 aattataagt gtatatatgg aataaacata aaaaataata aaagaagaaa aaaaaaaata 1441 atgaagaatt c //