Path: utzoo!attcan!uunet!jarthur!usc!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 28 Jul 90 12:00:13 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 3355 Approved: lear@genbank.bio.net Checksum: 39103 194 LOCUS MUSMDR1A 4924 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Mouse P-glycoprotein (mdr1a) mRNA, complete cds. ACCESSION M33581 KEYWORDS P-glycoprotein. SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line J774.2-vinblastine resistant subline J7.V1-1, cDNA to mRNA, library pUC18-cDNA and pGEM-zf, clones pV1.PRC2, pV1.3, pV1.20, and pV1.10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4924) AUTHORS Hsu,S., Cohen,D., Lothstein,L., Kirschner,L.S., Hartstein,M. and Horwitz,S.B. TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter reveals the basis for differential transcript heterogeneity in multidrug-resistant J774.2 cells JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.S.Kirschner, 05-APR-1990. Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461 FEATURES from to/span description pept 137 3967 P-glycoprotein (mdr1a) signal 4315 4320 poly-A signal signal 4898 4903 poly-A signal BASE COUNT 1450 a 1021 c 1210 g 1243 t ORIGIN Chromosome 5. 1 acagtggaac agcggtttcc aggagctgct ggtcccatct tccaaggctc tgctcaactc 61 agagccgctt cttccaaagt ctacatcttg gtggactttg cagaggaaac cgggagtaga 121 gacacgtgag gccgtgatgg aacttgaaga ggaccttaag ggaagagcag acaagaactt 181 ctcaaagatg ggcaaaaaga gtaaaaagga gaagaaagaa aagaaaccag cagtcagtgt 241 gcttacaatg tttcgttatg caggttggct agacaggttg tacatgctgg tgggaactct 301 ggctgctatt atccatggag tggcgctccc acttatgatg ctgatctttg gtgacatgac 361 agatagcttt gcaagtgtag gaaacgtctc taaaaacagt actaatatga gtgaggccga 421 taaaagagcc atgtttgcca aactggagga agaaatgacc acgtacgcct actattacac 481 cgggattggt gctggtgtgc tcatagttgc ctacatccag gtttcatttt ggtgcctggc 541 agctggaaga cagatacaca agatcaggca gaagtttttt catgctataa tgaatcagga 601 gataggctgg tttgatgtgc atgacgttgg ggagctcaac acccggctca cagatgatgt 661 ttccaaaatt aatgaaggaa ttggtgacaa aatcggaatg ttcttccagg caatggcaac 721 attttttggt ggttttataa taggatttac ccgtggctgg aagctaaccc ttgtgatttt 781 ggccatcagc cctgttcttg gactgtcagc tggtatttgg gcaaagatat tgtcttcatt 841 tactgataag gaactccatg cttatgcaaa agctggagca gttgctgaag aagtcttagc 901 agccatcaga actgtgattg cgtttggagg acaaaagaag gaacttgaaa ggtacaataa 961 caacttggaa gaagctaaaa ggctggggat aaagaaagct atcacggcca acatctccat 1021 gggtgcagct tttctcctta tctatgcatc atatgctctg gcattctggt atgggacttc 1081 cttggtcatc tccaaagaat actctattgg acaagtgctc actgtcttct tttccgtgtt 1141 aattggagca ttcagtgttg gacaggcatc tccaaatatt gaagccttcg ccaatgcacg 1201 aggagcagct tatgaagtct tcaaaataat tgataataag cccagtatag acagcttctc 1261 aaagagtggg cacaaaccag acaacataca aggaaatctg gaatttaaga atattcactt 1321 cagttaccca tctcgaaaag aagttcagat cttgaagggc ctcaatctga aggtgaagag 1381 cggacagacg gtggccctgg ttggcaacag tggctgtgga aaaagcacaa ctgtccagct 1441 gatgcaaagg ctctacgacc ccctagatgg catggtcagt atcgacggac aggacatcag 1501 aaccatcaat gtgaggtatc tgagggagat cattggtgtg gtgagtcagg aacctgtgct 1561 gtttgccacc acgatcgccg agaacattcg ctatggccga gaagatgtca ccatggatga 1621 gattgagaaa gctgtcaagg aagccaatgc ctatgacttc atcatgaaac tgccccacca 1681 atttgacacc ctggttggtg agagaggggc gcacgtgagt gggggacaga aacagagaat 1741 cgccattgcc cgggccctgg tccgcaatcc caagatcctt ttgttggacg aggccacctc 1801 agccctggat acagaaagtg aagctgtggt tcaggccgca ctggataagg ctagagaagg 1861 ccggaccacc attgtgatag ctcatcgctt gtctaccgtt cgtaatgctg acgtcattgc 1921 tggttttgat ggtggtgtca ttgtggagca aggaaatcat gatgagctca tgagagaaaa 1981 gggcatttac ttcaaacttg tcatgacaca gacagcagga aatgaaattg aattaggaaa 2041 tgaagcttgt aaatctaagg atgaaattga taatttagac atgtcttcaa aagattcagg 2101 atccagtcta ataagaagaa gatcaactcg caaaagcatc tgtggaccac atgaccaaga 2161 caggaagctt agtaccaaag aggccctgga tgaagatgta cctccagctt ccttttggcg 2221 gatcctgaag ttgaattcaa ctgaatggcc ttattttgtg gttggtatat tctgtgccat 2281 aataaatgga ggcttacagc cagcattctc cgtaatattt tcaaaagttg taggggtttt 2341 tacaaatggt ggcccccctg aaacccagcg gcagaacagc aacttgtttt ccttgttgtt 2401 tctgatcctt gggatcattt ctttcattac attttttctt cagggcttca catttggcaa 2461 agctggagag atcctcacca agcgactccg atacatggtt ttcaaatcca tgctgagaca 2521 ggatgtgagc tggtttgatg accctaaaaa caccaccgga gcactgacca ccaggctcgc 2581 caacgatgct gctcaagtga aaggggctac agggtctagg cttgctgtga ttttccagaa 2641 catagcaaat cttgggacag gaatcatcat atccctaatc tatggctggc aactaacact 2701 tttactctta gcaattgtac ccatcattgc gatagctgga gtggttgaaa tgaaaatgtt 2761 gtctggacaa gcactgaaag ataagaagga actagaaggt tctggaaaga ttgctacgga 2821 agcaattgaa aacttccgca ctgttgtctc tttgactcgg gagcagaagt ttgaaaccat 2881 gtatgcccag agcttgcaga taccatacag aaatgcgatg aagaaagcac acgtgtttgg 2941 gatcacgttc tccttcaccc aggccatgat gtatttttct tatgctgctt gtttccggtt 3001 cggtgcctac ttggtgacac aacaactcat gacttttgaa aatgttctgt tagtattctc 3061 agctattgtc tttggtgcca tggcagtggg gcaggtcagt tcattcgctc ctgactatgc 3121 gaaagcaaca gtgtcagcat cccacatcat caggatcatt gagaaaaccc ccgagattga 3181 cagctacagc acgcaaggcc taaagccgaa tatgttggaa ggaaatgtgc aatttagtgg 3241 agtcgtgttc aactatccca cccgacccag catcccagtg cttcaggggc tgagccttga 3301 ggtgaagaag ggccagacgc tggccctggt gggcagcagt ggctgcggga agagcacagt 3361 ggtccagctg ctcgagcgct tctacgaccc catggctgga tcagtgtttc tagatggcaa 3421 agaaataaag caactgaatg tccagtggct ccgagcacag ctgggcattg tgtcccaaga 3481 gcccattctc tttgactgca gcatcgcaga gaacattgcc tacggagaca acagccgggt 3541 cgtgtcttat gaggagattg tgagggcagc caaggaggcc aacatccacc agttcatcga 3601 ctcgctacct gataaataca acaccagagt aggagacaaa ggcactcagc tgtcgggtgg 3661 gcagaagcag cgcatcgcca tcgcacgcgc cctcgtcaga cagcctcaca ttttacttct 3721 ggacgaagca acatcagctc tggatacaga aagtgaaaag gttgtccagg aagcgctgga 3781 caaagccagg gaaggccgca cctgcattgt gatcgctcac cgcctgtcca ccatccagaa 3841 cgcggacttg atcgtggtga ttcagaacgg caaggtcaag gagcacggca cccaccagca 3901 gctgctggcg cagaagggca tctacttctc aatggtcagt gtgcaggctg gagcaaagcg 3961 ctcatgaact gtgaccatgt aagatgttaa gtatttttat tgtttgtatt catatatggt 4021 gtttaatcca agtcaaaagg aaaacactta ctaaaatagc cagttatcta ttttctgcca 4081 cagtggaaag catttagttt ggtttagagt cttcagaggc tttgtaatta aaaaaacaaa 4141 aatagataca gcatcaaatg gagattaatg ctttaaaatg cactataaaa tttataaaag 4201 ggttaaaagt gaatgtttga taatatatac ttttatttat actttctcat ttgtaactat 4261 aactgatttc tgcttaacaa attatgtatg tatcaaaaat tactgaaatg tttgtataaa 4321 gtatatatag tgaaactgag cattcatatt tttgagttat tttgctcaaa tgcatgcgaa 4381 attatatatt gtcccaactg ggatattgta cataatttta gcctttaaaa aacagtccat 4441 tactgggggg agggggcatc actctatggg caaagtgtta ctcagacatg ggcacctgag 4501 ttcagatccc taccacctaa gtaagcagac aaggtgtggt gtttttgtaa tgccagtgct 4561 agaggcagaa aagacagatc ctgcaggctc agtggctggc caaacagcct agccaacata 4621 gcgcgttcca ggttcagtga gaaaacttgt ctcaaaaatc agagggaaaa gcaaatgagg 4681 tgtcagccat gtgcactcat gcaaatgcca tacatgcaga agtatgtgca cacacacgca 4741 cacattaacc aacgactagc aaggaaaatg aaggtggata agaggggtgg gactgggaca 4801 aaggagggta cctggatgaa tatgactgaa ggacgttatg tacacatatg aaaacgtcgt 4861 actgaaactc actacaatgt atacttaata tattgctaat aaaatatttt taaaagaaaa 4921 aaat // LOCUS MUSMDRXX 2873 bp ds-DNA ROD 28-JUL-1990 DEFINITION Mouse P-glycoprotein (mdr1a) gene, exons 1 and 2. ACCESSION M33580 KEYWORDS P-glycoprotein. SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line J774.2-vinblastine resistant subline J7.V1-1 DNA, clone pV1.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2873) AUTHORS Hsu,S., Cohen,D., Lothstein,L., Kirschner,L.S., Hartstein,M. and Horwitz,S.B. TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter reveals the basis for differential transcript heterogeneity in multidrug-resistant J774.2 cells JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.S.Kirschner, 05-APR-1990. Albert Einstein C., 1300 Morris Pk, Bronx, 10461. E-mail: kirschne@aecom.yu.edu. FEATURES from to/span description pre-msg 1992 > 2873 P-glycoprotein mRNA and introns IVS 2120 2606 P-glycoprotein intron A IVS 2678 > 2873 P -glycoprotein intron B signal 1904 1912 CAAT box signal 1956 1963 TATA box site 1880 1887 SP-1 site site 1921 1927 SP-1 site site 1937 1944 SP-1 site site 1869 1875 AP-1 site rpt 1 1300 L1Md repetitive element BASE COUNT 860 a 621 c 714 g 678 t ORIGIN Chromosome 5. 1 gaattctcac ctgaggaata ccgaatccag agaaacacct gaaaaaatgt tcaacatcct 61 taatcatcag ggaaatgcaa atcaaaacaa ccctgagatt ccacctcaca ccagtcagaa 121 tggctaagat caaaaattca ggtgacagca gatgctggcg aggatgtgga gaaagaggaa 181 cactcctcca ttgttggtgg gagtgcaggc ttgtacaacc actctggaaa tcagtctggc 241 ggttcctcag aaaactggac atagtactct cggaggatcc agcaatacct ctcctgggca 301 tatatccaga agatgcccca acaggtaaga aggacacatg ctccactatg ttcatagcag 361 ccttatttat aatagccaga agctggaaag aacctagatg cccctcaaca gaggaatgga 421 tacagaaaat gtggtacatc tacacaatgg agtactactc agctattaaa aagaatgaat 481 ttatgaaatt cctagccaaa tggatggacc tggggggcat catcctgagt gaggtaacac 541 attcacaaag aaactcacac aatatgtatt cactgataag tggatattag ccccaaacct 601 aggataccca agatataaga tataatttgc taaacacatg aaactcaagg agaatgaaga 661 ctgaagtgtg gacactatgc ccctccttag atttgggaac aaaacaccca tggaaggagt 721 tacagagacg gagtttggag ctgagatgaa aggatggacc atgtagagac tgccatagcc 781 agggatccac cccataatca gcatccaaac gctgacacca ttgcatacac tagcaagatt 841 ttattgaaag gacgcagatg tagctgtctc ttgtgagact atgccggggc cagcaaacac 901 agaagtggat gctcacagtc agctaatgga tggatcatag ggctcccaat ggaggagcta 961 gagaaagtag ccaaggagct aaagggatct gcaaccctat aggtggaaca acattatgag 1021 ctaaccagta ccccggagct cttgactcta gctgcatata tatcaaaaga tggcctagtc 1081 ggccatcact ggaaagagag gcccattgga cttgcaaact ttatatgccc cagtacaggg 1141 gaataccagg gccaaaaagg gggagtgggt gggcagggga gtgggggtgg gtggatatgg 1201 gggacttttg gtatagcatt ggaaatgtaa atgagttaaa tacctaataa aaaatggaaa 1261 aaaaaataaa ataaaaataa gatgaaactg gaaaaaaaaa gttatgttta ataattccaa 1321 ttgaactgta agaatttcag atgccctgga aaaacatgga cattggttta gtacctaaaa 1381 gttcaaaata ttatatattt ttaaatacca ttttacactg aaatactcca tttatatact 1441 ggggactgtc ctctttctgg tttgctttgt tttgtttaat aaaagaaata aaccaatcta 1501 cctgaggaac tgtgaactat attgaagaaa agcctgcacg ggggttctct taccttttca 1561 agagtgcttc aaagaaggga aatttactga caggcaaggt ctgtacccat tgtttaattg 1621 tctgttagat gttatgcata gaatacgtct tttaacttag ccaaatgcag aaggccaagt 1681 gcactatcta caaacacata actctatata tagacatgtg catggccgtg tagagatgag 1741 actctgcaag tgtgtctcta atgattcggg ggatatgagt ttgtctaatt gacctttgag 1801 agggaaacca gactgcacat ttcatctaca aatccaacct gtttcgcaat ttctccagca 1861 ataatacttg agtcaagctg ggccgggagc tggttaacct ccaggtcaaa ctcactggct 1921 gggcgggact gcgcctgggc gtagattgag catgctaaat ttactctcct gtccacagaa 1981 agcccaggca cagtggaaca gcggtttcca ggagctgctg gtcccatctt ccaaggctct 2041 gctcaactca gagccgcttc ttccaaagtc tacatcttgg tggactttgc agaggaaacc 2101 gggagtagag acacgtgagg taagcatttc ctaggaaggg tcgggtgttc cggataccag 2161 agcctggtcc gggtgtcagc gtaatcgtga gtctgtgggg accaagtggc gacacaagag 2221 tcgctccagg agcacccgca gcatcagctt tcaggacggt gttttccgcg ccaccctgtg 2281 ctgtggatct cgctgcccag ctcgcagcca ggggtggtgg aggagcgcgc cagggcgagg 2341 ggacccagca ggcgggtggc ggacctagag ccgagcaccc ggtccacgca ggtgacacag 2401 cttcccggga ttccccagtg agttacctcc aggccctctc cggcagcatc agggcggggc 2461 tcctcctcac cactgggctc tgcggggcag tgagctttgc ataaactctg gtcccgtgtt 2521 tggctaatga actgtggttt ctccccaggt cgtgatggaa cttgaagagg accttaaggg 2581 aagagcagac aagaacttct ccccaggtcg tgatggaact tgaagaggac cttaagggaa 2641 gagcagacaa gaacttctca aagatgggca aaaagaggta gccagattgt ttcactttcg 2701 tactttactt gtcttgtaca ttcgggcaat tagtttgtag cctccagcac tgtacttgat 2761 tagtgggtgt tatttcagac ttcagaaatg taaaccagcc cttggaagga actcctcgct 2821 tggagcagtc cttcaaatgt gtgtgacaga tcaatcaatg attctgtgaa ttc // LOCUS HUMKSAA 1504 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human adenocarcinoma-associated antigen (KSA) mRNA, complete cds. ACCESSION M32325 KEYWORDS adenocarcinoma-associated antigen. SOURCE Human cell line UCLA-P3, cDNA to mRNA, clone AG[1,1338,933]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1504) AUTHORS Strnad,J., Hamilton,A.E., Beavers,L.S., Gamboa,G.C., Apelgren,L.D., Taber,L.D., Sportsman,J.R., Bumol,T.F., Sharp,J.D. and Gadski,R.A. TITLE Molecular cloning and characterization of a human adenocarcinoma/epithelial cell surface antigen complementary DNA JOURNAL Cancer Res. 49, 314-317 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D. Sharp, 22-FEB-1990. There are a few base differences between the sequence presented here and that which appears in entry M26481. The difference occurs mostly in the Poly-A signal. FEATURES from to/span description pept 155 1099 adenocarcinoma-associated antigen precursor (KSA) sigp 155 223 adenocarcinoma-associated antigen signal peptide matp 398 1096 adenocarcinoma-associated antigen mRNA < 1 1504 adenocarcinoma-associated antigen mRNA site 1486 1491 polyadenylation site BASE COUNT 442 a 302 c 356 g 404 t ORIGIN 1 gagcgagcac cttcgacgcg gtccggggac cccctcgtcg ctgtcctccc gacgcggacc 61 cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt gtcccactcc cggcgcacgc 121 cctcccgcgc ccctcttctc ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt 181 cgggcttctg cttgccgcgg cgacggcgac ttttgccgca gctcaggaag aatgtgtctg 241 tgaaaactac aagctggccg taaactgctt tgtgaataat aatcgtcaat gccagtgtac 301 ttcagttggt gcacaaaata ctgtcatttg ctcaaagctg gctgccaaat gtttggtgat 361 gaaggcagaa atgaatggct caaaacttgg gagaagagca aaacctgaag gggccctcca 421 gaacaatgat gggctttatg atcctgactg cgatgagagc gggctcttta aggccaagca 481 gtgcaacggc acctccacgt gctggtgtgt gaacactgct ggggtcagaa gaacagacaa 541 ggacactgaa ataacctgct ctgagcgagt gagaacctac tggatcatca ttgaactaaa 601 acacaaagca agagaaaaac cttatgatag taaaagtttg cggactgcac ttcagaagga 661 gatcacaacg cgttatcaac tggatccaaa atttatcacg agtattttgt atgagaataa 721 tgttatcact attgatctgg ttcaaaattc ttctcaaaaa actcagaatg atgtggacat 781 agctgatgtg gcttattatt ttgaaaaaga tgttaaaggt gaatccttgt ttcattctaa 841 gaaaatggac ctgacagtaa atggggaaca actggatctg gatcctggtc aaactttaat 901 ttattatgtt gatgaaaaag cacctgaatt ctcaatgcag ggtctaaaag ctggtgttat 961 tgctgttatt gtggttgtgg tgatggcagt tgttgctgga attgttgtgc tggttatttc 1021 cagaaagaag agaatggcaa agtatgagaa ggctgagata aaggagatgg gtgagatgca 1081 tagggaactc aatgcataac tatataattt gaagattata gaagaaggga aatagcaaat 1141 ggacacaaat tacaaatgtg tgtgcgtggg acgaagacat ctttgaaggt catgagtttg 1201 ttagtttaac atcatatatt tgtaatagtg aaacctgtac tcaaaatata agcagcttga 1261 aactggcttt accaatcttg aaatttgacc acaagtgtct tatatatgca gatctaatgt 1321 aaaatccaga acttggactc catcgttaaa attatttatg tgtaacattc aaatgtgtgc 1381 attaaatatg cttccacagt aaaatctgaa aaactgattt gtgattgaaa gctgcctttc 1441 tatttacttg agtcttgtac atacatactt ttttatgagc tatgaaataa aacattttaa 1501 actg // LOCUS HUMMHDNDRW 1066 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human MHC class II DN alpha mRNA, complete cds. ACCESSION M26039 M27046 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (haplotype DRw8,Dw8.2/DRw8,Dw8.2) cell line SPL, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1066) AUTHORS Jonsson,A.-K. and Rask,L. TITLE Human class II DNA and DOB genes display low sequence variability JOURNAL Immunogenetics 29, 411-413 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.-K.Jonsson, 07-JUL-1989. FEATURES from to/span description pept 26 778 MHC DN alpha protein precursor /nomgen="HLA-DNA" /map="6p21.3" /hgml_locus_uid="LV0117X" sigp 26 106 MHC DN alpha protein signal peptide matp 107 775 MHC DN alpha protein mRNA < 1 1066 MHCDNA-a mRNA BASE COUNT 199 a 337 c 283 g 247 t ORIGIN Chromosome 6p21.3. 1 catttgatta aagcaccaga gtgtaatggc cctcagagca gggctggtcc tggggttcca 61 caccctgatg accctcctga gcccgcagga ggcaggggcc accaaggctg accacatggg 121 ctcctacgga cccgccttct accagtctta cggcgcctcg ggccagttca cccatgaatt 181 tgatgaggaa cagctgttct ctgtggacct gaagaaaagc gaggccgtgt ggcgtctgcc 241 tgagtttggt gactttgccc gctttgaccc gcagggcggg ctggccggca tcgccgcaat 301 caaagcccat ctggacatcc tggtggagcg ctccaaccgc agcagagcca tcaacgtgcc 361 tccacgggtg accgtgctcc ccaagtctcg ggtggagctg ggccagccca acatcctcat 421 ctgcatcgtg gacaacatct tcccccctgt gatcaatatc acctggctgc gcaacggcca 481 aactgtcact gagggagtgg cccagaccag cttctattcc cagcctgacc atttgttccg 541 caagttccac tacctgccct tcgtgccctc agccgaggac gtctatgact gccaggtgga 601 gcactggggc ctggatgcgc cactcctcag gcattgggag ctccaggtgc ctattccacc 661 accagatgcc atggagaccc tggtctgtgc cctgggcctg gccatcggcc tggtgggctt 721 cctcgtgggc accgtcctca tcatcatggg cacatatgtg tccagtgtcc ccaggtaatg 781 atccttctga gagaaatgac ttgtgggaga caccctgcag atcctcatgg gtttgtgaca 841 gaccctgcgt gctcagtgcc ctttaagtgc atcccgctgt gctgactttg agtgggatca 901 acatctgtcc tacgggtccc ctcttttttg gccccagtat tcatggcagg gtttgttgga 961 cacctactag cttcccttcc cattcaacac acacacacat tcttgctcta cccaaagctc 1021 tggctggcag cactaaatgc tttggtggtg tttgcactgt gtcctt // LOCUS HUMMHDOBDR 1293 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human MHC class II DO beta mRNA, complete cds. ACCESSION M26040 M27047 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (haplotype DRw8,Dw8.2/DRw8,Dw8.2) cell line SPL, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1293) AUTHORS Jonsson,A.-K. and Rask,L. TITLE Human class II DNA and DOB genes display low sequence variability JOURNAL Immunogenetics 29, 411-413 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.-K.Jonsson, 07-JUL-1989. FEATURES from to/span description pept 36 857 MHC DO beta protein precursor /nomgen="HLA-DOB" /map="6p21.3" /hgml_locus_uid="LM0050S" sigp 36 125 MHC DO beta protein signal peptide matp 126 854 MHC DO beta protein BASE COUNT 293 a 315 c 330 g 355 t ORIGIN Chromosome 6p21.3. 1 cgattttact gtctcatttt tttcctttct ccagaatggg ttctgggtgg gtcccctggg 61 tggtggctct gctagtgaat ctgacccgac tggattcctc catgactcaa ggcacagact 121 ctccagaaga ttttgtgatt caggcaaagg ctgactgtta cttcaccaac gggacagaaa 181 aggtgcagtt tgtggtcaga ttcatcttta acttggagga gtatgtacgt ttcgacagtg 241 atgtggggat gtttgtggca ttgaccaagc tggggcagcc agatgctgag cagtggaaca 301 gccggctgga tctcttggag aggagcagac aggccgtgga tggggtctgt agacacaact 361 acaggctggg cgcacccttc actgtgggga gaaaagtgca accagaggtg acagtgtacc 421 cagagaggac cccactcctg caccagcata atctgctgca ctgctctgtg acaggcttct 481 atccagggga tatcaagatc aagtggttcc tgaatgggca ggaggagaga gctggggtca 541 tgtccactgg ccctatcagg aatggagact ggacctttca gactgtggtg atgctagaaa 601 tgactcctga acttggacat gtctacacct gccttgtcga tcactccagc ctgctgagcc 661 ctgtttctgt ggagtggaga gctcagtctg aatattcttg gagaaagatg ctgagtggca 721 ttgcagcctt cctacttggg ctaatcttcc ttctggtggg aatcgtcatc cagctaaggg 781 ctcagaaagg atatgtgagg acgcagatgt ctggtaatga ggtctcaaga gctgttctgc 841 tccctcagtc atgctaaggt cctcactgaa gcttctctct ctggagcctg aagtagtgat 901 gagtagtctg ggccctgggt gaggtaaagg acattcatga ggtcaatgtt ctgggaataa 961 ctctcttccc tgatccttgg aggagcccga actgattctg gagctctgtg ttctgagatc 1021 atgcatctcc cacccatctg cccttctccc ttctacgtgt acatcattaa tccccattgc 1081 caagggcatt gtccagaaac tcccctgaga ccttactcct tccagcccca aatcatttac 1141 ttttctgtgg tccagcccta ctcctataag tcatgatctc caaagctttc tgtcttccaa 1201 ctgcagtctc cacagtcttc agaagacaaa tgctcaggta gtcactgttt ccttttcact 1261 gtttttaaaa accttttatt gtcaaataaa atg // LOCUS TRPFLAA 966 bp ds-DNA BCT 28-JUL-1990 DEFINITION T.pallidum endoflagellar sheath protein (flaA) gene, 3' end. ACCESSION M26525 KEYWORDS endoflagellar sheath protein. SOURCE T.pallidum (strain Nichols) DNA. ORGANISM Treponema pallidum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; Spirochaetales; Spirochaetaceae. REFERENCE 1 (bases 1 to 966) AUTHORS Isaacs,R.D., Hanke,J.H., Guzman-Verduzco,L.-M., Newport,G., Agabian,N., Norgard,M.V., Lukehart,S.A. and Radolf,J.D. TITLE Molecular cloning and DNA sequence analysis of the 37-kilodalton endoflagellar sheath protein of Treponema pallidum JOURNAL Infect. Immun. 57, 3403-3411 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Radolf, 26-JUL-1989. FEATURES from to/span description pept < 1 966 endoflagellar sheath protein (AA at 1) BASE COUNT 199 a 196 c 325 g 246 t ORIGIN Unreported 1 aagctgaacg ctgatatcat ggcggataag agtggaggta tgacgcataa tcggcgtacc 61 gttctggact atgcttctct ggcggatacc tcgtacactg acgagcagaa ggcattgatg 121 agatcttctc ttgcggttgc acagtgggag gttgtgctga attcttccgc gcgtaatcct 181 gtcgcccatg ctgcctctcg cgttattgag gctccggtaa gtgagggagc gaagagtttt 241 gctggtgagc gtgtccttgg tgtgcgcgtg ttgttcccca cgtgggacag taacgcaaac 301 gcaatgataa agccggcgtt cgtaattcct gcgtacgagg tgatggctca ggtggacgat 361 cagggtaatg tacaggcccc cacagaggag gagaaggctt ctggaaaggg gcgttttgaa 421 gatgggtacg gagtggtaaa gaatgtgggt gttcttaagt ccatcgcggt gaacacttac 481 gggatgaatt atcctcatgg tttgtacgtg atgatgcggg atcaggatgg tgaggtgcat 541 cgctacttca tggggtatct cctgttcgac tcctggaagg agttggtgtg gaacaatcct 601 tcgtatatct ctgatgttcg gtcgcgggag gtgcgcttgt atcccgtgta tcccgcgtcg 661 acgccccacg tcgtgtttga aggctttatg gttactaggg acgcggctca tgccggaggg 721 gactatgttg gttatttcaa ggacgtcaag attatctatg ataaggcggt gctgagtacg 781 gtgcgcgatt ttgcggacga ggacctgtgg ggtatccagg cgcggcgtga ggctgagcgt 841 aagagagttg aggttgcgcg tttcgggcag cagcaggtgc tgcgttatat agagcaagag 901 aagcttgcta cagaggttgg ttttacaccc tctgggggtg ctcagcggca ggaagagcag 961 cagtag // LOCUS DROMPP1 3376 bp ds-DNA INV 28-JUL-1990 DEFINITION D.melanogaster membrane protein (patched) gene complete cds. ACCESSION M28418 KEYWORDS transmembrane protein. SEGMENT 1 of 2 SOURCE D.melanogaster (embryo), DNA and cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 3376) AUTHORS Hooper,J.E. and Scott,M.P. TITLE The Drosophila patched gene encodes a putative membrane protein required for segmental patterning JOURNAL Cell 59, 751-765 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.E.Hooper, 27-SEP-1989. FEATURES from to/span description pept 3161 + 3289 membrane protein exon 1 pre-msg 2389 > 3376 MPR mRNA and intron IVS 3290 > 3376 MPR intron A BASE COUNT 1059 a 731 c 764 g 822 t ORIGIN Chromosome 2, map position 44D-E. 1 gatcctgaat tgagaaatat agattgaaac agaattcatt accatttaag caatcattat 61 ttatgggggc gtaatgcgcc tccgagtagg caatgctttt cttgacattg ttactaagaa 121 ttgtgaatga tatttgggcg tggatcaacg ccgattaaaa gctgcttttg cttccaggcg 181 gccagagaag agatccaaac ttcaactcca gccataaaag caacaacatt tccgtctccc 241 ccttgtagct ccccttcctc cggctcttcc actctccacg aaacggcaaa tgaagctctc 301 aaagcgaact gtgcttcgct ggtggtccat tggcagctgc cgccacacag gcgctgcttt 361 tgtgtgtgtg tgtaatatca atcttgctct ccctctcttt ttatctctct tgggaattgg 421 agctgcatgc gaattgagcg acagcaaaac gaactgcaag tcattgagag gagagcaaaa 481 actcgagcgc aagccaaaga tagcgcaatc tggggagagc gaaataaagc taaaatatgc 541 atgttggaga aaaaatgccg cccatgtcgc caaaatgcgc cacacgcaga gtgagcgggc 601 ggaggtggga gtaatggaaa gggcgatgag ggaacgatta gcttgaagag agagaacaac 661 aaatgaatgt gctgcaacgt tagttcaggt gagcgcgtta gagagagagt tgttgttttt 721 tgattgtaat agctcgcttg gtggtgggtc cacattcaca tctccctctc ccactctttc 781 tccccgaaag agagagcggg agcgaagggg cacgagggga gcacgatgac tatgcagttg 841 cattcaattt gaatttccat ggtgctgatg attcgagcgc caattttttc gaagagttct 901 tatttgttta cttcgttgtt gttgcctcaa ttggaaaggg aaaatgtgga atgcggagaa 961 acaccagaag caaatgcatt tccattcata aatccaaaga agttttaaag ataacatgtc 1021 atttggctta agttcgtggt gcacaaaaaa gatcggtttg cggttgtcgc atgaaatgag 1081 tttattccat tggtatatta ttattcagaa attaaaaaaa aacttgttta gtctattttt 1141 tttttttaaa taaaaaaaaa aaattctttt ataagtcgat tttagagtaa atatttaaag 1201 actacgtcta ataaacatat aatttgttct gtgttttaat ttgccggcaa aaacaaacct 1261 acttgtgtgg tcctcgcaca ctcataaccc ctcgcatatt tgagattcat ggggcaagag 1321 gctgcaaaaa caatggaaag ggaaaagcag aaacatcctg ccgctcataa tttagcatcg 1381 gaacatgcaa aaacagacat catcgcatgg ggcagcagca acagccataa aaccaacacg 1441 agcaatgtaa agctaacaaa tttgccaaca gttcgcggca cggctacaca cacacacatg 1501 catgcgcagc ctgccacgca cgcgcttccc ccaaacaaat acacacacac acactgagac 1561 gaaagctcca ttgggcagcg ctgccgacgc tgaaggccga catcggcaga gctgaacgtt 1621 tgggtagggg accacccaca tcgcttggcg gtttcagttt aatgaaggca gaaacaaatt 1681 tatttttggg tggtccacac tgcagcgaaa ataaactaca gtggcaacaa caaaccagca 1741 gccaaggcac tttgggtggt ccatgcaaaa aaaaaacaaa ttacggcatg cgaataacaa 1801 tagaaattag cgctctcgtg gcggagctat ttgggtatat tagagctaca tattttattt 1861 gtttataaaa agtataaatg taaacaatga gttccaagca ttaagtccgt atgctcaaca 1921 attacattat cattattatt atcacttaaa tatttacaaa ggatatttaa acagtaatag 1981 atatatattt tatttcttaa tttctgttaa catatgtatt tacattggta gttattcttt 2041 attttgcaac aagcattcat aaattttata taacaaactt ggtattttct cggaaaaact 2101 cctgaatcac ccctcggtat tttgtgcgtt gagctatcgt taaagcagcc ctcgcagaga 2161 gcgttctcaa accaaaatgg ccgcacacga aacaagagag cgagtgagag tagggagagc 2221 gtctgtgttg tgtgttgagt gtcgcccacg cacacaggcg caaaacagtg cacacagacg 2281 cccgctgggc aagagagagt gagagagaga aacagcggcg cgcgctcgcc taatgaagtt 2341 gttggcctgg ctggcgtgcc gcatccacga gatacagata catctctcag actgcgtgcg 2401 atcctcgaac gaaacggttg taagtgcgga gcgcgacgac ttgttattcg tatttccgac 2461 tactggcact ctctgtgtgt ggtatactaa caagatagat atcacagaac tcgtggaaaa 2521 gctaagatat tgtacctcac ggatgcgagg cgaagttcat ggattaaatg ccaggcaaca 2581 acaaaagcca gccaaccagc cagtgtttgt gtgtgtgcgt cgccaagtgc aaagtaaagt 2641 aaaggtaaaa gagcgaaagg cgagagagaa aaccgaatac gtgagtcgtc cgactgccgc 2701 ttttccatgt gtaaaagatc tgtgaaaatt ctgtcaaatt cccctgagaa attgtgccca 2761 agataaaacc cgaaaaccgc gttttaatcg tcgaaaaaac ccagcaaaag cgaagccagc 2821 aatcacaaca aaacaacata acgagagctc agatacacag cgtgctcagt gagtgagcga 2881 gagagcgcgg gagagagcgt ctcttgattt aaaatacaaa ataattaaaa ataaaaatgc 2941 ggaatgcagt gcaaaatgca gccaaacaaa atacgagatt ccaataacaa ttaatcgaac 3001 cgaaagtcca cgaacaatcc gcacactgtc tcccaagtct cagttctcag gacgcagacg 3061 aacggcaggc actgtagaaa gaccgattcc gcagcacact cccatctgca catctccgcc 3121 acgcgattcc gtccggaatc tggctataaa cataaccata atggaccgcg acagcctccc 3181 acgcgttccg gacacacacg gcgatgtggt cgatgagaaa ttattctcgg atctttacat 3241 acgcaccagc tgggtggacg cccaagtggc gctcgatcag atagataagg tgagtgccca 3301 actacagtga actttcactg tgaaggatag ccatgtgttg aattcaataa tattcttgat 3361 cgtattcgga ggatcc // LOCUS DROMPP2 5665 bp ds-DNA INV 28-JUL-1990 DEFINITION D.melanogaster membrane protein (patched) gene, complete cds. ACCESSION M28999 KEYWORDS transmembrane protein. SEGMENT 2 of 2 SOURCE D.melanogaster (embryo), DNA and cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 5665) AUTHORS Hooper,J.E. and Scott,M.P. TITLE The Drosophila patched gene encodes a putative membrane protein required for segmental patterning JOURNAL Cell 59, 751-765 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.E.Hooper, 27-SEP-1989. Mak called J.E.Hooper and requested copy of cds be sent showing introns and exons, 9-OCT-1989. Copy was received and corrections made, 18-OCT-1989. FEATURES from to/span description pept + 95 648 membrane protein exon 2 986 2912 membrane protein exon 3 3051 3258 membrane protein exon 4 3322 3593 membrane protein exon 5 3789 4559 membrane protein exon 6 pre-msg < 1 > 5448 MPR mRNA and introns IVS < 1 94 MPR intron A IVS 649 985 MPR intron B IVS 2913 3050 MPR intron C IVS 3259 3321 MPR intron D IVS 3594 3788 MPR intron E BASE COUNT 1397 a 1537 c 1350 g 1381 t ORIGIN About 9.2kb after segment 1, Chromosome 2, map position 44D-E. 1 aattttaatg cgtattttat ggcagtggag caaggcgggg gaatctaaaa aaaaaactaa 61 acgctaaatt ccgtattttt gttgcatttt tcagggcaaa gcgcgtggca gccgcacggc 121 gatctatctg cgatcagtat tccagtccca cctcgaaacc ctcggcagct ccgtgcaaaa 181 gcacgcgggc aaggtgctat tcgtggctat cctggtgctg agcaccttct gcgtcggcct 241 gaagagcgcc cagatccact ccaaggtgca ccagctgtgg atccaggagg gcggccggct 301 ggaggcggaa ctggcctaca cacagaagac gatcggcgag gacgagtcgg ccacgcatca 361 gctgctcatt cagacgaccc acgacccgaa cgcctccgtc ctgcatccgc aggcgctgct 421 tgcccacctg gaggtcctgg tcaaggccac cgccgtcaag gtgcacctct acgacaccga 481 atgggggctg cgcgacatgt gcaacatgcc gagcacgccc tccttcgagg gcatctacta 541 catcgagcag atcctgcgcc acctcattcc gtgctcgatc atcacgccgc tggactgttt 601 ctgggaggga agccagctgt tgggtccgga atcagcggtc gttataccgt aagtagttaa 661 tatgtagtta atagccacat cttatagatt ctaaagtgaa cgtatccctt atgaccatat 721 ccttttgcat gatctacttt aacccacagt acttctctat tcatattaag gaattaataa 781 agtacttact ttgcgcttac ctttattaaa tacgatagct tatctttata aacttgctat 841 caagtcgaaa gataaacgtg acaagagtat ctttgtactt atcccagttg cttaccatcg 901 taaataatct tcttattaat aaatattcgt aaataaatat tcttaactca acaaatccat 961 ctttattatt gttactcctc tacagaggcc tcaaccaacg actcctgtgg accaccctga 1021 atcccgcctc tgtgatgcag tatatgaaac aaaagatgtc cgaggaaaag atcagcttcg 1081 acttcgagac cgtggagcag tacatgaagc gtgcggccat tggcagtggc tacatggaga 1141 agccctgcct gaacccactg aatcccaatt gcccggacac ggcaccgaac aagaacagca 1201 cccagccgcc ggatgtggga gccatcctgt ccggaggctg ctacggttat gccgcgaagc 1261 acatgcactg gccggaggag ctgattgtgg gcggacggaa gaggaaccgc agcggacact 1321 tgaggaaggc ccaggccctg cagtcggtgg tgcagctgat gaccgagaag gaaatgtacg 1381 accagtggca ggacaactac aaggtgcacc atcttggatg gacgcaggag aaggcagcgg 1441 aggttttgaa cgcctggcag cgcaactttt cgcgggaggt ggaacagctg ctacgtaaac 1501 agtcgagaat tgccaccaac tacgatatct acgtgttcag ctcggctgca ctggatgaca 1561 tcctggccaa gttctcccat cccagcgcct tgtccattgt catcggcgtg gccgtcaccg 1621 ttttgtatgc cttttgcacg ctcctccgct ggagggaccc cgtccgtggc cagagcagtg 1681 tgggcgtggc cggagttctg ctcatgtgct tcagtaccgc cgccggattg ggattgtcag 1741 ccctgctcgg tatcgttttc aatgcgctga ccgctgccta tgcggagagc aatcggcggg 1801 agcagaccaa gctgattctc aagaacgcca gcacccaggt ggttccgttt ttggcccttg 1861 gtctgggcgt cgatcacatc ttcatagtgg gaccgagcat cctgttcagt gcctgcagca 1921 ccgcaggatc cttctttgcg gccgccttta ttccggtgcc ggctttgaag gtattctgtc 1981 tgcaggctgc catcgtaatg tgctccaatt tggcagcggc tctattggtt tttccggcca 2041 tgatttcgtt ggatctacgg agacgtaccg ccggcagggc ggacatcttc tgctgctgtt 2101 ttccggtgtg gaaggaacag ccgaaggtgg cacctccggt gctgccgctg aacaacaaca 2161 acgggcgcgg ggcccggcat ccgaagagct gcaacaacaa cagggtgccg ctgcccgccc 2221 agaatcctct gctggaacag agggcagaca tccctgggag cagtcactca ctggcgtcct 2281 tctccctggc aaccttcgcc tttcagcact acactccctt cctcatgcgc agctgggtga 2341 agttcctgac cgttatgggt ttcctggcgg ccctcatatc cagcttgtat gcctccacgc 2401 gccttcagga tggcctggac attattgatc tggtgcccaa ggacagcaac gagcacaagt 2461 tcctggatgc tcaaactcgg ctctttggct tctacagcat gtatgcggtt acccagggca 2521 actttgaata tcccacccag cagcagttgc tcagggacta ccatgattcc tttgtgcggg 2581 tgccacatgt gatcaagaat gataacggtg gactgccgga cttctggctg ctgctcttca 2641 gcgagtggct gggtaatctg caaaagatat tcgacgagga ataccgcgac ggacggctga 2701 ccaaggagtg ctggttccca aacgccagca gcgatgccat cctggcctac aagctaatcg 2761 tgcaaaccgg ccatgtggac aaccccgtgg acaaggaact ggtgctcacc aatcgcctgg 2821 tcaacagcga tggcatcatc aaccaacgcg ccttctacaa ctatctgtcg gcatgggcca 2881 ccaacgacgt cttcgcctac ggagcttctc aggtgggtct tcttattaaa ttaaattaaa 2941 ttaaattaaa ttagatcgcc ttagttctcc tcatatgtac atacatatta taacttatcg 3001 cactccaaag ttaaagatta ctaaatgtgt gtgtatcttt attcttacag ggcaaattgt 3061 atccggaacc gcgccagtat tttcaccaac ccaacgagta cgatcttaag atacccaaga 3121 gtctgccatt ggtctacgct cagatgccct tttacctcca cggactaaca gatacctcgc 3181 agatcaagac cctgataggt catattcgcg acctgagcgt caagtacgag ggcttcggcc 3241 tgcccaacta tccatcgggt gagtcggaaa tgagtacttc atacatgggg cccaactaac 3301 agtcgattta tttatcgcca ggcattccct tcatcttctg ggagcagtac atgaccctgc 3361 gctcctcact ggccatgatc ctggcctgcg tgctactcgc cgccctggtg ctggtctccc 3421 tgctcctgct ctccgtttgg gccgccgttc tcgtgatcct cagcgttctg gcctcgctgg 3481 cccagatctt tggggccatg actctgctgg gcatcaaact ctcggccatt ccggcagtca 3541 tactcatcct cagcgtgggc atgatgctgt gcttcaatgt gctgatatca ctggtgagtc 3601 ttcatttctg gctggaccat taagagcttc ggagtgagtc ttcatttctg gctggaccat 3661 taagagcttc ggagtgagtc ttcatttctg gctggaccat taagagcttc ggattttcca 3721 gagatatccc aagacttttc attggatcct cttcagcaca cattaattgc ttatctttcc 3781 gattctaggg cttcatgaca tccgttggca accgacagcg ccgcgtccag ctgagcatgc 3841 agatgtccct gggaccactt gtccacggca tgctgacctc cggagtggcc gtgttcatgc 3901 tctccacgtc gccctttgag tttgtgatcc ggcacttctg ctggcttctg ctggtggtct 3961 tatgcgttgg cgcctgcaac agccttttgg tgttccccat cctactgagc atggtgggac 4021 cggaggcgga gctggtgccg ctggagcatc cagaccgcat atccacgccc tctccgctgc 4081 ccgtgcgcag cagcaagaga tcgggcaaat cctatgtggt gcagggatcg cgatcctcgc 4141 gaggcagctg ccagaagtcg catcaccacc accacaaaga ccttaatgat ccatcgctga 4201 cgacgatcac cgaggagccg cagtcgtgga agtccagcaa ctcgtccatc cagatgccca 4261 atgattggac ctaccagccg cgggaacagc gacccgcctc ctacgcggcc ccgccccccg 4321 cctatcacaa ggccgccgcc cagcagcacc accagcatca gggcccgccc acaacgcccc 4381 cgcctccctt cccgacggcc tatccgccgg agctgcagag catcgtggtg cagccggagg 4441 tgacggtgga gacgacgcac tcggacagca acaccaccaa ggtgacggcc acggccaaca 4501 tcaaggtgga gctggccatg cccggcaggg cggtgcgcag ctataacttt acgagttagc 4561 actagcacta gttcctgtag ctattaggac gtatctttag actctagcct aagccgtaac 4621 cctatttgta tctgtaaaat cgatttgtcc agcgggtctg ctgaggattt cgttctcatg 4681 gattctcatg gattctcatg gatgcttaaa tggcatggta attggcaaaa tatcaatttt 4741 tgtgtctcaa aaagatgcat tagcttatgg tttcaagata catttttaaa gagtccgcca 4801 gatatttata taaaaaaaat ccaaaatcga cgtatccatg aaaattgaaa agctaagcag 4861 acccgtatgt atgtatatgt gtatgcatgt tagttaattt cccgaagtcc ggtatttata 4921 gcagctgcct tccgcgcccc ccttcccttg aaatgaacac ccttccagcc acgccccacc 4981 gcccctctgc gtagcagctt tgtatgtatg tagtatgcta gcacctaagg aatacttaaa 5041 cttagagata tttattgtaa cacacgcaaa acacacacaa tgtacttaca tataattcaa 5101 tgcgagattc acccacacaa aaaggaaaca caacaaacta gtaattgtag ctcgtaattt 5161 agtttaaata tgttacataa aacacaagga cttgaaccaa aatagtatcg cttaaacgga 5221 aacgagagaa acgagaaaaa ataactatta cttaatcaac tacaagagag atatccctcc 5281 tcccctaacc gtacttacaa ccaaaataaa acaagagtat aagcataaaa atggaaaacg 5341 aagcgaggaa cgattgtaaa cgcggtcatt tatcctgtac atttgttgcc cgaagactga 5401 ctgtcttttt tttaataaaa atatatatta tacagttttt taaaagcgaa attcatgact 5461 tttttttaac agtgagcaga gaacaaaaga aacggaagtt ttcgctgtat caataaaaag 5521 attccatttt tttaataaat tgtaaaaatc ctaaaaaaaa gaagactaca aaagtttaaa 5581 tttttatacg ttattgataa acttttatac acgaaaatac ttgtacttag ctatgatcaa 5641 ctccttggct taagtctcgg gtaag // LOCUS BLYGEH 1250 bp ss-mRNA PLN 28-JUL-1990 DEFINITION Barley (1->3)-beta-glucan endohydrolase mRNA, complete cds. ACCESSION M23548 X15205 KEYWORDS glucan endohydrolase. SOURCE Barley (2 days into germination) scutellum, cDNA to mRNA, clone lambda-3. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1250) AUTHORS Hoej,P.B., Hartman,D.J., Morrice,N.A., Doan,D.N.P. and Fincher,G.B. TITLE Purification of (1->3)-beta-glucan endohydrolase isoenzyme II from germinated barley and determination of its primary structure from a cDNA clone JOURNAL Plant Mol. Biol. 13, 31-42 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by G.W.Fincher, 02-MAY-1989. FEATURES from to/span description pept 48 1052 glucan endohydrolase precursor sigp 48 131 glucan endohydrolase signal peptide matp 132 1049 glucan endohydrolase mRNA < 1 1250 GEH mRNA BASE COUNT 262 a 404 c 362 g 222 t ORIGIN 1 ccagcattgc atagcatttg agcaccagat actccatgtg tgcagcaatg gctagaaaag 61 atgttgcctc catgtttgca gctgctctct tcattggagc gttcgctgct gttcctacga 121 gtgtgcagtc catcggcgtg tgctacggcg tgatcggcaa caacctcccc tcccggagcg 181 acgtggtgca gctctacagg tccaagggca tcaacggcat gcgcatctac ttcgccgacg 241 ggcaggccct ctcggcgctc cgcaactccg gcatcggcct catcctcgac atcggcaacg 301 accagctcgc caacatcgcc gccagcacct ccaacgcggc gtcctgggtc cagaacaacg 361 tgcggcccta ctaccctgcc gtgaacatca agtacatcgc cgccggcaac gaggtgcagg 421 gcggcgccac gcagagcatc ctgccggcca tgcgcaacct caacgcggcc ctctccgcgg 481 cggggctcgg cgccatcaag gtgtccacct ccatccggtt cgacgaggtg gccaactcct 541 tcccgccctc cgccggcgtg ttcaagaacg cctacatgac ggacgtggcc cggctcctcg 601 cgagcaccgg cgcgccgctg ctcgccaacg tctaccccta cttcgcgtac cgtgacaacc 661 ccgggagcat cagcctgaac tacgcgacgt tccagccggg caccaccgtg cgtgaccaga 721 acaacgggct gacctacacg tccctgttcg acgcgatggt ggacgccgtg tacgcggcgc 781 tggagaaggc cggcgcgccg gcggtgaagg tggtggtgtc ggagagcggg tggccgtcgg 841 cgggcgggtt tgcggcgtcg gccggcaatg cgcggacgta caaccagggg ctgatcaacc 901 acgtcggcgg gggcacgccc aagaagcggg aggcgctgga gacgtacatc ttcgccatgt 961 tcaacgagaa ccagaagacc ggggacgcca cggagaggag cttcgggctc ttcaacccgg 1021 acaagtcgcc ggcatacaac atccagttct agtgtagcta cctagctcac atacctacat 1081 ccccagccta aataaataag ctgctcgtac gtacgtaatg cggcatccaa gtgtaacgta 1141 gacacgtaca ttcatccatg gaagagtgca accaagcatg cgttaacttc ctggtgatga 1201 tacatcatca tggtatgaat aaaagatatg gaagatgtta tgaatttgtg // LOCUS ECOPOLBDA 4666 bp ds-DNA BCT 28-JUL-1990 DEFINITION E.coli DNA polymerase (polB) gene, 5' flank. ACCESSION M35371 KEYWORDS DNA polymerase; polB gene. SOURCE E.coli (strain W3110) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 4666) AUTHORS Chen,H., Sun,Y., Stark,T., Beattie,W. and Moses,R. TITLE Nucleotide sequence and deletion analysis of the polB gene of E.coli JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Chen, 20-JUN-1990. Author address: H.Chen Baylor College of Medicine Cell Biology and IMG One Baylor Plaza Houston, TX 77030 email: hchen@vulture.bcm.tmc.edu FEATURES from to/span description signal 1209 1214 -35 region signal 1229 1234 -10 region binding 1250 1255 ribosome binding site site 1193 1208 lexA box BASE COUNT 1100 a 1278 c 1257 g 1031 t ORIGIN 1 aagcttgcat gcctgcaggt cgactctaga ggatcctcgc tggtggcgcg caccataccg 61 tcttcagcca tgcactgaac ctcaacgata tgcgccaatt cgccgagatg cacgacattg 121 aaatcacggt gattgataac gacacacgcc tgccagcgtt taaagacgcg ctgcgctgga 181 acgaagtgta ttacgggttt cgtcgctaag tagccgcatc cggtatgtaa cgcctgatgc 241 gacgctgacg cgtcttatct ggcctacacg ctgcgatttt gtaggccgga taagcaaagc 301 gcatccggca ttcaacgcct gatgcgacgc tggcgcgtct tatcaggcct acgcgctgcg 361 attttgtagg ccggataagc aaagcgcatc cggcattcaa cgcctgatgc gacgctggcg 421 cgtcttatca ggcctacacg ctgcgatttt gtaggccgga taagcaaagc gcatccggca 481 cgaaggagtc aacatgttag aagatctcaa acgccaggta ttagaagcca acctggcgct 541 gccaaaacac aacctggtca cgctcacatg gggcaacgtc agcgccgttg atcgcgagcg 601 cggcgtcttt gtgatcaaac cttccggcgt cgattacagc gtcatgaccg ctgacgatat 661 ggtcgtggtt agcatcgaaa ccggtgaagt ggttgaaggt acgaaaaagc cctcctccga 721 cacgccaact caccggctgc tctatcaggc attcccctcc attggcggca ttgtgcatac 781 gcactcgcgc cacgccacca tctgggcgca ggcgggtcag tcgattccag caaccggcac 841 cacccacgcc gactatttct acggcaccat tccctgcacc cgcaaaatga ccgacgcaga 901 aatcaacggc gaatatgagt gggaaaccgg taacgtcatc gtagaaacct ttgaaaaaca 961 gggtatcgat gcagcgcaaa tgcccggcgt tctggtccat tcccacggcc cgtttgcatg 1021 gggcaaaaat gccgaagatg cggtgcataa cgccatcgtg ctggaagagg tcgcttatat 1081 ggggatattc tgccgtcagt tagcgccgca gttaccggat atgcagcaaa cgctgctgga 1141 taaacactat ctgcgtaagc atggcgcgaa ggcatattac gggcagtaat gactgtataa 1201 aaccacagcc aatcaaacga aaccaggcta tactcaagcc tggttttttg atggattttc 1261 agcgtggcgc aggcaggttt tatcttaacc cgacactggc gggacacccc gcaagggaca 1321 gaagtctcct tctggctggc gacggacaac gggccgttgc aggttacgct tgcaccgcaa 1381 gagtccgtgg cgtttattcc cgccgatcag gttccccgcg ctcagcatat tttgcagggt 1441 gaacaaggct ttcgcctgac accgctggcg ttaaaggatt ttcaccgcca gccggtgtat 1501 ggcctttact gtcgcgccca tcgccaattg atgaattacg aaaagcgcct gcgtgaaggt 1561 ggcgttaccg tctacgaggc cgatgtgcgt ccgccagaac gctatctgat ggagcggttt 1621 atcacctcac cggtgtgggt cgagggtgat atgcacaatg gcactatcgt taatgcccgt 1681 ctgaaaccgc atcccgacta tcgtccgccg ctcaagtggg tttctataga tattgaaacc 1741 acccgccacg gtgagctgta ctgcatcggc ctggaagcgt gcgggcagcg catcgtttat 1801 atgctggggc cggagaatgg cgacgcctcc tcgcttgatt tcgaactgga atacgtcgcc 1861 agccgcccgc agttgctgga aaaactcaac gcctggtttg ccaactacga tcctgatgtg 1921 atcatcggtt ggaacgtggt gcagttcgat ctgcgaatgc tgcaaaaaca tgccgagcgt 1981 taccgtcttc cgctgcgtct tgggcgcgat aatagcgagc tggagtggcg cgagcacggc 2041 tttaaaaacg gcgtcttttt tgcccaggct aaaggtcggc taattatcga cggtatcgag 2101 gcgctgaaat ccgcgttctg gaatttctct tcattctcgc tggaaactgt cgctcaggag 2161 ctattaggcg aaggaaaatc tatcgataac ccgtgggatc gaatggacga aattgaccgc 2221 cgtttcgccg aagataaacc tgcgctggca acttataacc tgaaagattg cgagctggtg 2281 acgcagatct tccacaaaac tgaaatcatg ccatttttac tcgaacgggc aacggtgaac 2341 ggcctgccgg tggaccgaca cggcggttcg gtggcggcat ttggtcatct ctattttccg 2401 cgaatgcatc gcgctggtta tgtcgcgcct aatctcggcg aagtgccgcc gcacgccagc 2461 cctggcggct acgtgatgga ttcacggcca gggctttatg attcagtgct ggtgctggac 2521 tataaaagcc tgtacccgtc gatcatccgc acctttctga ttgatcccgt cgggctggtg 2581 gaaggcatgg cgcagcctga tccagagcac agtaccgaag gttttctcga tgcctggttc 2641 tcgcgagaaa aacattgcct gccggagatt gtgactaaca tctggcacgg gcgcgatgaa 2701 gccaaacgcc agggtaacaa accgctgtcg caggcgctga aaatcatcat gaatgccttt 2761 tatggcgtgc tcggcaccac cgcctgccgc ttcttcgatc cgcggctggc atcgtcgatc 2821 accatgcgtg gtcatcagat catgcggcaa accaaagcgt tgattgaagc acagggctac 2881 gacgttatct acggcgatac cgactcaacg tttgtctggc tgaaaggcgc acattcggaa 2941 gaagaagcgg cgaaaatcgg tcgtgcactg gtgcagcacg ttaacgcctg gtgggcggaa 3001 acgctgcaaa aacaacggct gaccagcgca ttagaactgg agtatgaaac ccatttctgc 3061 cgttttctga tgccaaccat tcgcggagcc gataccggca gtaaaaagcg ttatgccgga 3121 ctgattcagg agggcgacaa gcagcggatg gtgtttaaag ggctggaaac cgtgcgcacc 3181 gactggacgc cgctggccca gcagtttcag caggagctat acctgcgcat cttccgcaac 3241 gagccatatc aggaatatgt acgcgaaacc atcgacaaac tgatggcggg tgaactggat 3301 gcgcgactgg tttaccgtaa acgccttcgc cgtccgctga gcgagtatca gcgtaatgtg 3361 ccgcctcatg tacgcgccgc tcgccttgcc gatgaagaaa accaaaagcg tggtcgcccc 3421 ttgcaatatc agaatcgcgg caccattaag tacgtatgga ccaccacagg cccggagccg 3481 cctggactac caacgttcac cactggatta cgaacactat ctgacccgcc agctacaacc 3541 cgtggcggag ggaatactcc cttttattga ggataatttt gctacactta tgaccgggca 3601 acttgggcta ttttgagcaa aaaaaagagt tcgccagata ccattttgat gcgtgacgaa 3661 tgctttgcca tccagtacca tagcgccctt tccattcctg gacctgaata acaccactac 3721 ctcataagca cggtagcggg tggttattgc ctgcaattaa agatatagag ccgaacacat 3781 atgcctttta cacttggtca acgctggatc agcgatacag aaagcgaatt gggacttgga 3841 accgttgtcg cggtggatgc gcgaactgtc actttacttt tcccatctac tggtgaaaac 3901 cgtctgtacg cacgcagtga ttcccccgtg acccgcgtga tgttcaaccc tggtgatacc 3961 attaccagcc atgacggctg gcagatgcaa gtcgaagaag taaaagaaga aaatggcttg 4021 ctgacctata tcggtactcg cctggatact gaagaggtcc ggcgtagccc tgcgtgaagt 4081 tttccttgat agcaaactgg tgttcagcaa accgcaggca ccgtctgttt gccgggcaga 4141 ttgaccgtat ggaccgcttt gcgctgcgtt atcgcgcgcg taaatattcc agcgaacagt 4201 tccgtatgcc gtacagcggc ctgcgcggtc agcgtaccag cctgatccgc atcagctcaa 4261 catcgctcat gatgttggtc gccgccacgc gccgcgcgtc ctgctggctg acgaagtggg 4321 tttagggaaa accattgaag ccgggatgat cctgcatcag caactgctct ctggcgctgc 4381 tgaacgtgtg ctaattatcg tcccggaaac cttacagcat cagtggctgg tagaaatgct 4441 gcgccgtttc aacctgcgct ttgcgctatt tgatgatgag cgttatgccg aagctcagca 4501 cgatgcttac aacccgtttg acaccgtgaa gcggcgcacg aaaaacgcga aagcgtttca 4561 cgataaatgc gaaaacttta gctttcgcgc ttcaaatgaa acagatgtat taattactgc 4621 tttttattca ttacatgggg atccccgggt accgagctcg aattcc // LOCUS HUMBIGFII 1387 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human insulin-like growth factor binding protein 2 (IGFBP2) mRNA, complete cds. ACCESSION M35410 KEYWORDS insulin-like growth factor binding protein 2. SOURCE Human 67-year old retina, cDNA to mRNA, clone AS200. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1387) AUTHORS Agarwal,N., Hsieh,C.-L., Sills,D., Swaroop,M., Desai,B., Francke,U. and Swaroop,A. TITLE Sequence analysis, expression and chromosomal localization of a gene, isolated from a subtracted human retina cDNA library, that encodes an insulin-like growth factor binding protein (IGFBP2) JOURNAL Exp. Eye Res. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Swaroop, 20-JUN-1990. FEATURES from to/span description pept 64 1050 insulin-like growth factor binding protein 2 (IGFBP2) precursor /hgml_locus_uid="LN0220S" /nomgen="IBP1" /map="7p13-p12" sigp 64 180 insulin-like growth factor binding protein 2 signal peptide matp 181 1047 insulin-like growth factor binding protein 2 mRNA < 1 1387 IGFBP2 mRNA signal 1362 1367 poly-A signal binding 175 197 ATP binding site BASE COUNT 232 a 455 c 477 g 223 t ORIGIN 1 gtgccacctg cccgcccgcc cgctcgctcg ctcgcccgcc gcgccgcgct gccgaccgcc 61 agcatgctgc cgagagtggg ctgccccgcg ctgccgctgc cgccgccgcc gctgctgccg 121 ctgctgccgc tgctgctgct gctactgggc gcgagtggcg gcggcggcgg ggcgcgcgcg 181 gaggtgctgt tccgctgccc gccctgcaca cccgagcgcc tggccgcctg cgggcccccg 241 ccggttgcgc cgcccgccgc ggtggccgca gtggccggag gcgcccgcat gccatgcgcg 301 gagctcgtcc gggagccggg ctgcggctgc tgctcggtgt gcgcccggct ggagggcgag 361 gcgtgcggcg tctacacccc gcgctgcggc caggggctgc gctgctatcc ccacccgggc 421 tccgagctgc ccctgcaggc gctggtcatg ggcgagggca cttgtgagaa gcgccgggac 481 gccgagtatg gcgccagccc ggagcaggtt gcagacaatg gcgatgacca ctcagaagga 541 ggcctggtgg agaaccacgt ggacagcacc atgaacatgt tgggcggggg aggcagtgct 601 ggccggaagc ccctcaagtc gggtatgaag gagctggccg tgttccggga gaaggtcact 661 gagcagcacc ggcagatggg caagggtggc aagcatcacc ttggcctgga ggagcccaag 721 aagctgcgac caccccctgc caggactccc tgccaacagg aactggacca ggtcctggag 781 cggatctcca ccatgcgcct tccggatgag cggggccctc tggagcacct ctactccctg 841 cacatcccca actgtgacaa gcatggcctg tacaacctca aacagtgcaa gatgtctctg 901 aacgggcagc gtggggagtg ctggtgtgtg aaccccaaca ccgggaagct gatccaggga 961 gcccccacca tccgggggga ccccgagtgt catctcttct acaatgagca gcaggaggct 1021 cgcggggtgc acacccagcg gatgcagtag accgcagcca gccggtgcct ggcgcccctg 1081 ccccccgccc ctctccaaac accggcagaa aacggagagt gcttgggtgg tgggtgctgg 1141 aggattttcc agttctgaca cacgtattta tatatggaaa gagaccagca ccgagctcgg 1201 cacctccccg gcctctctct tcccagctgc agatgccaca cctgctcctt cttgctttcc 1261 ccgggggagg aagggggttg tggtcgggga gctggggtac aggtttgggg agggggaaga 1321 gaaattttta tttttgaacc cctgtgtccc ttttgcataa gattaaagga aggaaaagta 1381 aagtgtg // LOCUS HUMLBPA 1431 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human lipopolysaccharide binding protein (LBP) mRNA, complete cds. ACCESSION M35533 KEYWORDS lipopolysaccharide binding protein. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites; for [2]) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. TITLE Structure and function of lipopolysaccharide binding protein JOURNAL Science (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1431) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Tobias, 21-JUN-1990. Author address: P.S.Tobias Department of Immunology, IMM-12 10466 N. Torrey Pines Rd La Jolla, CA 92037 FEATURES from to/span description pept 1 > 1431 lipopolysaccharide binding protein (LBP) precursor sigp 1 75 lipopolysaccharide binding protein signal peptide matp 76 1431 lipopolysaccharide binding protein BASE COUNT 319 a 417 c 359 g 336 t ORIGIN 1 atgggggcct tggcaagagc cctgccgtcc atactgctgg cattgctgct tacgtccacc 61 ccagaggctc tgggtgccaa ccccggcttg gtcgccagga tcaccgacaa gggactgcag 121 tatgcggccc aggaggggct attggctctg cagagtgagc tgctcaggat cacgctgcct 181 gacttcaccg gggacttgag gatcccccac gtcggccgtg ggcgctatga gttccacagc 241 ctgaacatcc acagctgtga gctgcttcac tctgcgctga ggcctgtccc cggccagggc 301 ctgagtctca gcatctccga ctcctccatc cgggtccagg gcaggtggaa ggtgcgcaag 361 tcattcttca aactacaggg ctcctttgat gtcagtgtca agggcatcag catttcggtc 421 aacctcctgt tgggcagcga gtcctccggg aggcccacag gttactgcct cagctgcagc 481 agtgacatcg ctgacgtgga ggtggacatg tcgggagatt cggggtggct cttgaacctc 541 ttccacaacc agattgagtc caagttccag aaagtactgg agagcaggat ttgcgaaatg 601 atccagaaat cagtgtcctc cgatctacag ccttatctcc aaactctgcc agttacaaca 661 gagattgaca gtttcgccga cattgattat agcttagtgg aagcccctcg ggcaacagcc 721 cagatgctgg aggtgatgtt taagggtgaa atctttcatc gtaaccaccg ttctccagtt 781 accctccttg ctgcagctga ggaacacaac aaaatggtct actttgccat ctcggattat 841 gtcttcaaca cggccagcct ggtttatcat gaggaaggat atctgaactt ctccatcaca 901 gatgacatga taccgcctga ctctaatatc cgactgacca ccaagtcctt ccgacccttc 961 gtcccacggt tagccaggct ctaccccaac atgaacctgg aactccaggg atcagtgccc 1021 tctgctccgc tcctgaactt cagccctggg aatctgtctg tggaccccta tatggagata 1081 gatgcctttg tgctcctgcc cagctccagc aaggagcctg tcttccggct cagtgtggcc 1141 actaatgtgt ccgccacctt gaccttcaat accagcaaga tcactgggtt cctgaagcca 1201 ggaaaggtaa aagtggaact gaaagaatcc aaagttggac tattcaatgc agagctgttg 1261 gaagcgctcc tcaactatta catccttaac accctctacc ccaagttcaa tgataagttg 1321 gccgaaggct tcccccttcc tctgctgaag cgtgttcagc tctacgacct tgggctgcag 1381 atccataagg acttcctgtt cttgggtgcc aatgtccaat acatgagagt t // LOCUS HUMPEC12L 2344 bp ds-DNA PRI 28-JUL-1990 DEFINITION Human cell 12-lipoxygenase gene, complete cds. ACCESSION M35418 KEYWORDS lipoxygenase. SOURCE Human platelet/erythroleukemia cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2344) AUTHORS Funk,C.D., Furci,L. and FitzGerald,G.A. TITLE Molecular cloning, primary structure and expression of the human platelet/erythroleukemia cell 12-lipoxygenase JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.D.Funk, 20-JUN-1990. FEATURES from to/span description pept 42 2033 12-lipoxygenase signal 2324 2329 poly-A signal mRNA < 1 2344 12-lipoxygenase mRNA BASE COUNT 514 a 692 c 642 g 496 t ORIGIN 1 ggaggacccg gctcccctcg cctaagctgc tggggggcgc catgggccgc taccgcatcc 61 gcgtggccac cggggcctgg ctcttctccg ggtcgtacaa ccgcgtgcag ctttggctgg 121 tcgggacgcg cggggaggcg gagctggagc tgcagctgcg gccggcgcgg ggcgaggagg 181 aggagtttga tcatgacgtt gcagaggact tggggctcct gcagttcgtg aggctgcgca 241 agcaccactg gctggtggac gacgcgtggt tctgcgaccg catcacggtg cagggccctg 301 gagcctgcgc ggaggtggcc ttcccgtgct accgctgggt gcagggcgag gacatcctga 361 gcctgcccga gggcaccgcc cgcctgccag gagacaatgc tttggacatg ttccagaagc 421 atcgagagaa ggaactgaaa gacagacagc agatctactg ctgggccacc tggaaggaag 481 ggttacccct gaccatcgct gcagaccgta aggatgatct acctccaaat atgagattcc 541 atgaggagaa gaggctggac tttgaatgga cactgaaggc aggggctctg gagatggccc 601 tcaaacgtgt ttacaccctc ctgagctcct ggaactgcct agaagacttt gatcagatct 661 tctggggcca gaagagtgcc ctggctgaga aggttcgcca gtgctggcag gatgatgagt 721 tgttcagcta ccagttcctc aatggtgcca accccatgct gttgagacgc tcgacctctc 781 tgccctccag gctagtgctg ccctcgggga tggaagagct tcaggctcaa ctggagaaag 841 aacttcagaa tggttccctg tttgaagctg acttcatcct tctggatgga attccagcca 901 acgtgatccg aggagagaag caatacctgg ctgcccccct cgttatgctg aagatggagc 961 ccaatgggaa gctgcagccc atggtcatcc agattcagcc tcccagcccc agctctccaa 1021 ccccaacact gttcctgccc tcagaccccc cacttgcctg gctcctggca aagtcctggg 1081 tccgaaattc agatttccaa ctgcacgaga tccagtatca cttgctgaac actcacctgg 1141 tggctgaggt catcgctgtc gccaccatgc ggtgcctccc aggactgcac cccatcttca 1201 agttcccgat cccccatatc cgctacacca tggaaatcaa cacccgggcc cggacccaac 1261 tcatctcaga tggaggaatt tttgataagg cagtgagcac aggtggaggg ggccatgtac 1321 agttgctccg tcgggcggca gctcagctga cctactgctc cctctgtcct cctgacgacc 1381 tggctgaccg gggcctgctg ggactcccag gtgctctcta tgcccatgat gctttacggc 1441 tctgggagat cattgccagg tatgtggagg ggatcgtcca cctcttctac caaagggatg 1501 acatagtgaa gggggaccct gagctgcagg cctggtgtcg ggagatcacg gaggtggggc 1561 tgtgccaggc ccaggaccga ggtttccctg tctccttcca gtcccagagt caactctgcc 1621 atttcctcac catgtgcgtc ttcacgtgca ctgcccagca tgccgccatc aaccagggcc 1681 agctggactg gtatgcctgg gtccctaatg ctccatgcac aatgcggatg cccccaccca 1741 ccaccaagga agatgtgacg atggccacag tgatggggtc actacctgat gtccggcagg 1801 cctgtcttca aatggccatc tcatggcatc tgagtcgccg ccagccagac atggtgcctc 1861 tggggcacca caaagaaaaa tatttctcag gccccaagcc caaagctgtg ctaaaccaat 1921 tccgaacaga tttggaaaag ctagaaaagg agattacagc ccggaatgag caacttgact 1981 ggccctatga atatctgaag cccagctgca tagagaacag tgtcaccatc tgagccctag 2041 agtgactcta cctgcaagat ttcacatcag ctttaggact gacatttcta tcttgaattt 2101 catgctttcc taaagtctct gctgctaagg ctctatttcc tcccccagtt aaacccctac 2161 attagtatcc cactagccca ggggagcagt aaactttctc tgcaaagact agatcctttt 2221 ttacgctttg cagaccgcat agtcactgtc tcaactactc agctctcctg ctgcagcatg 2281 aaggcagcca cagacaacat ggaaatgagt gtgactatgt tccaataaaa ctttatggac 2341 actg // LOCUS HUMRALBA 1327 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human GTP-binding protein (RALB) mRNA, complete cds. ACCESSION M35416 KEYWORDS GTP-binding protein. SOURCE Human retina, cDNA to mRNA, clone AS181. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1327) AUTHORS Hsieh,C.-L., Swaroop,A. and Francke,U. TITLE Chromosomal localization and cDNA sequence of human RALB, a GTP binding protein JOURNAL Somat. Cell Mol. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Swaroop, 20-JUN-1990. FEATURES from to/span description pept 171 791 GTP-binding protein (RALB) mRNA < 1 1327 RALB mRNA signal 1303 1308 poly-A signal BASE COUNT 368 a 273 c 373 g 313 t ORIGIN Chromosome cen-q13. 1 gagcccggca gctcaatgac aaatcggtgg aggacggctg gggtccggcc ccgggagggc 61 ccggggcgcg tttaagagct gcgggccggg tgcggacggc ggaggcggcg ggactggtcc 121 ctgctcttca gtgggtcatc tgtgtgtcac agcctcagaa gaccagcgag atggctgcca 181 acaagagtaa gggccagagc tccttggccc tccacaaggt gatcatggtt ggcagcggag 241 gcgttggcaa gtcagccctg acgcttcagt tcatgtatga cgagtttgta gaagactatg 301 aacctaccaa agctgacagt tatagaaaga aagtggttct tgatggggaa gaagttcaga 361 tagatattct ggacaccgct gggcaagagg actacgcagc cattcgagat aactactttc 421 ggagtgggga agggtttctt cttgtgttct caatcacaga acatgaatcc tttacagcaa 481 ctgccgaatt cagggaacag attctccgtg tgaaggctga agaagataaa attccactgc 541 tcgtcgtggg aaacaagtct gacctagagg agcggaggca ggtgcctgtg gaggaggcca 601 ggagtaaagc cgaagagtgg ggcgtgcagt acgtggagac gtcagcgaag acccgggcca 661 acgtggacaa ggtgttcttt gacctaatga gagaaatcag aacaaagaag atgtcagaaa 721 acaaagacaa gaatggcaag aaaagcagca agaacaagaa aagttttaaa gaaagatgtt 781 gcttactatg agtgtcaagg tgacggatga agccagctgc tcctaaggac acagggctgg 841 gttggtaaag agaaggctat ggttgacttc ttgcttgtgc ttcccactct ccccgacttc 901 attcactcaa acttctttaa atggggaaaa atatttgtga ctctgtggct ggcagaagaa 961 ataagcccat gcaagtggaa gggctgcttt gtcaggaggt tgtggaattt ctttcttctc 1021 cccttcttcc ctcccaaaag cttagctatg tataaagtgc cacagatagg aaacagctgt 1081 taattacaaa gagaaagaat tgtcatagca tcttattttg ttcctagttt tataacatta 1141 ccatccttcg ttttgaacta cagatgttgt agtgggtttt ggaggaggga gtggagtaag 1201 atgccctccc acttttatca gtttagtagt agtactgaga aaaatccctt cagctctaag 1261 aacactgaaa aatccaccga ttttttgggt aagcttcttg gcaataccct gtggatctga 1321 aacagct // LOCUS LACLACR 1332 bp ds-DNA BCT 28-JUL-1990 DEFINITION L.lactis lactose phosphotransferase system repressor (lacR) gene, complete cds. ACCESSION M35375 KEYWORDS lactose phosphotransferase system repressor; lactose repressor. SOURCE L.lactis (strain MG1820) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1332) AUTHORS Van Rooijen,R.J. and Devos,W.M. TITLE Molecular cloning, transcriptional analysis, and nucleotide sequence of LACR, a gene encoding the repressor of the lactose phosphotransferase system of Lactococcus lactis JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.J.Van Rooijen, 20-JUN-1990. Author address: R.J.Van Rooijen Netherlands Inst for Dairy Research Kernhemseweg 2 P.O. Box 20 6710 BA EDE THE NETHERLANDS FEATURES from to/span description pept 370 1155 lactose repressor (lacR; alt.) pept 388 1155 lactose repressor (lacR; alt.) mRNA 79 1245 lactose repressor mRNA signal 1215 > 1155 transcription termination signal binding 353 357 ribosomal binding site signal 45 51 -35 region signal 68 74 -10 region BASE COUNT 469 a 207 c 198 g 458 t ORIGIN 1 gatatcaaac attcaaacaa aacgcaacta tttttgttaa ttttttgttt ttttttattt 61 gtttttttaa aaaatagata acaccgttaa attattgttc atttttgttc atttaatcca 121 tcacaaaatg gacgtgaaat atctattcag gtattacaaa agtcttttac tttctataac 181 ttactgatta agaggtccta ctttattttc gtcttataca aaatctgacc taagctaata 241 tacgtcaatc ctctgttctt atttcatcat ctaacgtttg tttttgtttg aaattgtttg 301 ttttaccttg aaaatattat cttttatgat acaattaaaa gagaattatc tttggaaaaa 361 aattacttta tgaaagaaag tcttcatatg aacaaaaaac gacgattaga aaaaatttta 421 gatatgttaa agattgatgg gaccataacc ataaaagaaa taatagatga actagatatt 481 tccgatatga cagcccgtag agaccttgat gctctagaag ctgatggact tttaacacgt 541 actcatggtg gtgcacaatt gctttcctct aaaaagccac ttgaaaagac acatatcgag 601 aagaaaagtc taaatacaaa agaaaaaatt gacattgcta aaaaagcctg ctctttaatc 661 aaagatggcg atactatttt tattggaccc ggaactacac ttgtacaact ggcattagaa 721 ttgaaaggtc gtaaaggtta taaaattcgt gtcattacaa atagtctccc tgtgttcttg 781 attctaaatg atagcgaaac cattgattta ttgcttcttg gcggtgaata tagagaaata 841 actggagctt ttgtaggttc aatggcttcg acaaatttaa aagcaatgag atttgccaaa 901 gcttttgttc gtgcaaatgc tgttacccat aattctattg ctacatatag tgacaaggaa 961 ggtgtgattc aacaacttgc cctaaacaat gctgtagaaa aattcttatt agtagacagt 1021 actaaattcg atcgatacga tttctttaac ttctacaatc tagatcaact cgataccatc 1081 attacagata accagattag ccctcaacac ttagaggaat ttagccagta cactactatt 1141 ttaaaagcgg actagaatta tgacttataa aaatattgga ctactcttaa ataattagac 1201 ataaaaaaag caccgtatga atcaaacaat tctacggtgt ttttttgtta tttctaatgt 1261 atggtttgtc gaaaatatgt acacattatt taactttcca aaaaattgga gttttcttga 1321 taattggata tc // LOCUS MARCMYCA 1391 bp ds-DNA ROD 28-JUL-1990 DEFINITION Woodchuck c-myc protein gene, exon 1. ACCESSION M35498 KEYWORDS c-myc protein. SOURCE Woodchuck (Marmota monax) DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 1391) AUTHORS Wei,Y., Hsu,T.Y., Tiollais,P., Buendia,M.A. and Etiemble,J. TITLE Evolutionary conservation of target sequences for cis-acting regulation in c-myc exon 1 and its upstream sequences JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Buendia, 21-JUN-1990. FEATURES from to/span description pre-msg 818 > 1391 c-myc mRNA and introns IVS 1378 > 1391 c-myc intron 1 site 159 220 P0 promoter site 790 794 TATA box site 964 968 TATA box BASE COUNT 317 a 385 c 383 g 306 t ORIGIN 1 ctcagcgatt agtgcgtctt gcgggaatag ccgcttccca cacccggccg ggtggaagtc 61 tgagcctgct gggcaaaacg agcgatatct gctgttttgg cagcaaacta ggggattcat 121 tctgggtggg aagtgcccaa tctagatagc tgtgcataca taatgcataa tgaattacac 181 tcacacaacc tcaagaaatg taataggtat gtattcataa cactctccaa gtatatgtgg 241 caaggcattg ctgcgttatt ttaattattc cagaaatcat tttcctccct acctcctctg 301 tcatttatcc ctaacactcc atatactgaa tgcgcactca taaatattcc ttctgcccgc 361 ctgtcttcat aagacttatt ttcaaaatgc tgctctttcc ccagccttag ggaggcgccc 421 ggccgcccgg gacgtgcgtg cgcggccgtg ggtacatggt gtattctcag tgttgagggt 481 gagggcagct gttccacctg tgttaattgg aacacgcagg acgagaatgc agtttgtcag 541 agtactgcgc cagaggagca gcagagaaag ggaaaggatt taaacaggag caaaagaaaa 601 tggtaggcgc gcgcagttaa ttcttgctgc gcccttatac tgtttacatc cgatagctgg 661 agtgccgggc tgcggggctg agtctcctcc ccttccctca ctcggcagtg cccctcccag 721 gttcccaaag ccgagggcgg ggagaaagaa aaaaaaaaga ttccgtggaa tccccgccca 781 ccagcccttt ataatacgag ggtctgcgcg cccgaggacc cctgagctgc gcttctcgtg 841 gccgccaaca tcgccgcgcc ccggcggccg ctcttggctc ccctcctgcc tagagaaggg 901 cagggcttct tagaggcttg gcgggaaaaa gaagcgaggg ggagggatcg cgcgtaacag 961 cagtataaaa gtcgttttcg gggctttatc tcactcgctg tagtaattcc agcgagagac 1021 agagggagtg agcgggcgaa cccgtgaggt ggaagaaccg agcagagctg ctccgggcgt 1081 cctgggaagg gaaacccgga gtgaaaggag acttagtctt ctgaccagcg cccccacccc 1141 agccctcccg cggagcccct ccagggtccg caaccgcgaa actttgccct ttgctgcggg 1201 cggacacttt gcactggaac ttaaaatacc cgatcgagga cgcgactctc cggagcgggg 1261 aggctatact gcctatttgg ggacactttt ccccgccttt acccaggacc cgctcctctg 1321 aaagcgctcc tggctgccgt ttgaaggctg gatttccttc gggtagttga aaacccggta 1381 agcaccagat c // LOCUS ONGOSTLE 214 bp ds-DNA INV 28-JUL-1990 DEFINITION O.volvulus recombinant antigen gene, 3' end. ACCESSION M35370 KEYWORDS T-cell epitope; recombinant antigen. SOURCE O.volvulus DNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 214) AUTHORS Colina,K.F., Perler,F.B., Matsumura,I., Meda,M. and Nutman,T.B. TITLE The identification of an Onchocerca-specific recombinant antigen containing a T cell epitope JOURNAL Unpublished (19900 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.B.Nutman, 19-JUN-1990. Author address: T.B.Nutman Inst. Lab. of Parasitic Diseases NIAID National Institutes of Health Bldg 4, Rm 126 9000 Rockville Pike Bethesda, MD 20892 email: tbn@helix.nih.gov FEATURES from to/span description pept < 1 93 recombinant antigen (AA at 1) site 91 93 nematode splice junction BASE COUNT 81 a 37 c 31 g 65 t ORIGIN 1 gaattcagtg taagaagcag cagaacattt caatcattac gaagatatat atacaacatt 61 tctttcttct tcattcttga gttgcatatg taaattcaaa aataattacg atttaatgaa 121 ttgagcaagc ataacttttc ccagcaagta taacaaagtt ttgcgaggaa cgaactcaga 181 aaactttcac ttatgtaaaa ttgcgcacaa gacc // LOCUS PSEIAAL 2766 bp ds-DNA BCT 28-JUL-1990 DEFINITION P.syringae IAA-lysine synthetase (iaaL) gene, complete cds. ACCESSION M35373 KEYWORDS IAA-lysine synthetase. SOURCE P.syringae savastanoi DNA. ORGANISM Pseudomonas syringae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2766) AUTHORS Roberto,F., Klee,H., White,F., Nordeen,R. and Kosuge,T. TITLE Expression and fine structure of the gene encoding IAA-lysine synthetase from Pseudomonas savastanoi JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.J.Klee, 20-JUN-1990. FEATURES from to/span description pept 1100 2287 IAA-lysine synthetase (iaaL) pept 95 1003 ORF1 BASE COUNT 637 a 760 c 754 g 615 t ORIGIN 1 gaattccata gcgtgcgggg cttggaggag cgccgcggcc tgagtatctg tggctaaccc 61 ttgcggcttc ggtgctggtc gctgtcgagc agctatgcgt gcagatcctg cgcagtgcag 121 gcttcggcaa acaggcgatg tggaccctgc tgacggggac ggccgccgtt gccatcgcag 181 atcccctgct tattgtggcg ttcgatctgg gcctggtggg tgccggcatc gctacctgat 241 atcgagcctg gtatcggcct gtctggggtt ttactacgtt caccgagtcg cccatctgac 301 ctgtcgggtc agcctgaaga acctgtcagg tgacatcaga aatatcgggc gaaccgcctt 361 gccagcggtg attggcaacc tggcaactcc agtgggcatg gcctacgtaa tggctgcgat 421 ggcgccgttc ggatctcagg cgctggcgac tatcggggtg atcgacaggg tcattcaggt 481 tgctttttgt gtcgtgttcg ccttgcccgg tgcgctgatc ccgatactgg ggcaaaacct 541 gggcgcaatg aacactgctc gcgtgtctca agccataaag atgacgtacg gattgttgat 601 cggctacggc tcagtgacct cgctgttact cattctgctc gctgagccat tagccagctt 661 gtttcatctc gccgctgaac gccaagtcgt gttcttcgcg ttctgccgat ggggcggcgc 721 tctggacgct catcgggctg caattcattg ccacctcagt cttcctcagt atggggcgac 781 cggcgtacgt cacactgttc ggctggttcc gcgccacctg ggaaccatgc cgttcgtgtg 841 gtatggggca cataaatttg gcagcgtcgg ggtaatgctc gggcagttgc tgggtaacac 901 catagtggcc ttttgtgcct gcgtggctcg cgcatctgct catgaaaaag atgttggaca 961 tcgagatcca ttcaataggg aaccgatccc tccacaggag taactgataa tccacgtttt 1021 gcccaccctt ggctgtcgtc aggtgggcag gatgtccagg atgtccagga aatcaaaaaa 1081 cggactatag aggactcgca tgactgccta cgatatggaa aaggaatgga gtagaatttc 1141 cattactgcc gctaaaatcc accagaacaa cgattttgaa ggattcactt atcaggactt 1201 cagaacccac gtaccgatca tggacaaaga cggcttcgcg gcacagactg aacgctgtct 1261 agagcgcaat gagcgaaact gcctgatcgg ctttaccagt ggcaccagcg gcaacatcaa 1321 acgctgttat tactactacg actgcgaagt cgatgaagac agctccctct ccaacgtctt 1381 ccgcagcaac ggctttattc tgcccggtga tcgctgcgcc aacctgttca cgatcaacct 1441 gttttctgct ctgaacaaca cgattaccat gatggccggt aactgcggtg ctcacgtcgt 1501 gtccgtaggt gacatcaccc tggtgaccaa gagccatttc gaagcgctta actcgatcaa 1561 gctcaacgta ctgctcggcg tgccatccac tatcttgcag ttcatcaatg ccatgcaaca 1621 taacggtgtg cacatcaata tcgagaaggt tgtcttcacc ggcgagagcc tgaaaacttt 1681 ccagaagaaa atcatcaggc aagcctttgg cgaacaagtc tccatcgtcg gtgtgtatgg 1741 cagttccgag ggcggcattc tcggtttcac caacagccct tgccacactg aatacgagtt 1801 tctgtccgac aagtatttca tcgaaaaaga aggcgacagc atcctcatca cctcgctgac 1861 ccgagaaaac tttacgccgc tgctgaggta tcgcctagga gacaccgcaa ccctttcgat 1921 gaaaggcgac aagctctacc tgacagacat ccagcgggag gacatgagct tcaacttcat 1981 gggcaacctc atcgggctgg gcatcattca gcaaacgatt aaacagacac tgggccgatc 2041 gctggaaatc caggttcacc tgtcagtgac cgaagagcgc aaggaactgg tgaccgtttt 2101 cgttcaggcc tctgaagtcg atgaagacga acgcgtcaga atcgaaacag ccatcgccga 2161 tatccccgac atcaaagagg cgtatcagaa aaaccaaggc accgtgtcgg tcctgcgcaa 2221 ggatgccaga gactacgcgg tctcggagcg aggcaaaatg ctctacatca tcgaccgccg 2281 aaactgaatg gctgatgtga acgagtgagt agctgcaccg acggggcctt tggcggtgtc 2341 ggtgcagttt tttagaggat tcggaagcgc cagaggtcag agtccacgaa actggaacga 2401 actgggcagc ctgcggctgc aaattgtggg attttgaaat cggttatcat agccgaaatc 2461 gagtcgatcc ctcctcagca caggcttaca catggcgtca gagaccaaaa aacgtaaacg 2521 ggcgagccgg gcaaaagcca aggcaaagca gacccgtctc caacgcgccg ggcatactac 2581 cttcgtgccc gataccgact tttccttcga tatcgatcct ttcggtgatg tcgatctttg 2641 tagttgctgc cagacaacgt atctgaacga catgtttccc gacgcttctt gcgtaaggct 2701 ttagatgaga gaagggccag gcggattcgc atcaccgccg tcattcacca cgatgaggag 2761 ccgcct // LOCUS RABLPBA 1446 bp ss-mRNA MAM 28-JUL-1990 DEFINITION Rabbit lipopolysaccharide binding protein (LBP) protein mRNA, complete cds. ACCESSION M35534 KEYWORDS lipopolysaccharide binding protein. SOURCE Rabbit liver, cDNA to mRNA. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (ases 1 to 1446ites; for [2] AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. TITLE Structure and function of lipopolysaccharide binding protein JOURNAL Science (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1446) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Tobias, 21-JUN-1990. Author address: P.S.Tobias Department of Immunology, IMM-12 10466 N. Torrey Pines Rd La Jolla, CA 92037 FEATURES from to/span description pept 1 > 1446 lipopolysaccharide binding protein (LBP) precursor sigp 1 78 lipopolysaccharide binding protein signal peptide matp 79 1446 lipopolysaccharide binding protein BASE COUNT 316 a 454 c 391 g 285 t ORIGIN 1 atggggacct gggccagggc cctgctgggg tccaccctgc tgagcctgct gctcgcagct 61 gccccgggag ctctgggcac caaccccggc ctcatcacca ggatcaccga caaaggcctg 121 gagtacgcgg ccagggaggg gctgctggct ctgcagagaa agctcctgga agtcacgctg 181 ccggattccg atggggactt caggatcaaa catttcgggc gtgcacagta caagttctac 241 agtctgaaaa tccccagatt cgagctgctc cgtggcaccc tgaggcccct ccccggccag 301 ggcctgagtc tcgacatctc cgacgcctac atccacgtgc ggggcagctg gaaggtgcgc 361 aaggcgttcc tgagactgaa gaactccttt gacctgtatg tcaagggcct caccatttcc 421 gtccacctcg tgttgggcag cgagtcctcc gggaggccca cggtcaccac ctccagctgc 481 agcagcgaca tccagaacgt ggagttggac atagaggggg acctggagga gctgctgaac 541 ctcctccaaa gccagatcga tgccaggctg cgcgaagtgc tggagagcaa gatttgcagg 601 cagattgagg aagccgtgac ggcccacctg cagccttatc tacagacact gccagtcaca 661 acgcagatcg acagctttgc cggcattgac tacagcttga tggaggcccc ccgggcaaca 721 gctgggatgt tggatgtgat gtttaagggt gaaattttcc ctctggatca ccgcagccca 781 gtggacttcc ttgctccagc catgaacctc cccgaggctc acagccgaat ggtctacttt 841 tccatctccg attacgtctt caacaccgcc agcctggcct accacaagtc agggtactgg 901 aacttctcca tcacagacgc catggttccg gccgacctca acatccggcg gaccaccaag 961 tccttccgac ccttcgttcc cctgcttgcc aatctctacc ccaacatgaa cttggagctc 1021 caagggacag tgaactcgga acaactggtg aacctcagca ccgagaatct gttagaggaa 1081 cccgagatgg atattgaggc cttggtggtc ctgcccagct ctgccaggga gcctgtcttc 1141 cggctgggtg tggccactaa tgtgtctgcc acactgacct tgaacaccag gaagatcact 1201 gggttcctga agccgggaag gctacaggtg gaactgaaag aatccaaagt cggaggattc 1261 aatgtggagc tgttggaagc tctcctcaac tactacattc tcaacaacct ctaccccaag 1321 gtcaatgaga agttggccca ccgcttcccg ctccctctgc tgaggcacat tcagctctac 1381 gacctgcttc tccagaccca cgagaacttc ctgctcgtgg gcgccaacat ccagtacagg 1441 agagtt // LOCUS RATUKATPA 1484 bp ss-mRNA ROD 28-JUL-1990 DEFINITION R.norvegicus gastric (H+,K+)-ATPase beta-subunit mRNA, complete cds. ACCESSION M35535 KEYWORDS (H+,K+)-ATPase beta-subunit. SOURCE R.norvegicus gastric mucosa oxyntic cell, cDNA to mRNA, clone RG4. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1484) AUTHORS Canfield,V.A., Okamoto,C.T., Chow,D., Dorfman,J., Gros,P., Forte,J.G. and Levenson,R. TITLE Cloning of the H,K-ATPase beta subunit: Tissue-specific expression, chromosomal assignment, and relationship to Na,K-ATPase beta subunits JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.A.Canfield, 21-JUN-1990. Author address: V.A.Canfield Yale University School of Medicine Dept. of Cell Biology P.O. Box 3333 New Haven, CT 06510 email: levenson@YALEMED FEATURES from to/span description pept 176 1060 gastric (H+,K+)-ATPase beta-subunit (E.C. 3.6.1.3) mRNA 1 1484 gastric (H+,K+)-ATPase beta-subunit BASE COUNT 363 a 457 c 362 g 302 t ORIGIN 1 ctgacttctg ggacagtgga ggacagatag cacgcaagcc ccagccctcc cttatgttta 61 tagaggcgat agcggagaac tgatagctgg ttctgatgcc tttggcctca cacagaggag 121 actataagcc ccagaggacg ctccctgggc ccagtccagg caagcaggag aggacatggc 181 agccctgcag gagaagaagt catgcagcca gcgcatggcc gaattccggc aatactgttg 241 gaacccggac actgggcaga tgctgggccg caccccagcc cggtgggtgt ggatcagcct 301 gtactatgca gctttctacg tggtcatgac tgggctcttt gccttgtgca tctatgtgct 361 gatgcagacc attgatccct acacccccga ctaccaggac cagttaaagt caccgggggt 421 aaccttgaga ccggatgtgt atggggaaag agggctgcag atttcctaca acatctctga 481 aaacagctcc tgggctggcc tcacacacac cctccacagc ttcttagcgg gctacacccc 541 agcatcccag caggacagca tcaactgttc gtctgaaaag tacttcttcc aggagacctt 601 ttctgctccg aaccatacca agttctcctg caagttcacg gcggacatgc tacagaattg 661 ctcaggcctg gtggacccca gtttcggctt tgaggaggga aagccctgct tcattattaa 721 aatgaacagg attgtcaagt tcctgcccag caacaacacg gctccccgag tggactgcac 781 cttccaggat gacccccaaa agccccggaa ggacattgaa cccctgcagg tccagtacta 841 tccccccaat ggtaccttca gtctccacta cttcccctac tacggcaaga aagcacagcc 901 ccactacagc aaccctctgg tggcggcaaa gttcctcaac gtccccaaaa acacgcaagt 961 cctcatcgtg tgcaagatca tggcggacca cgtgaccttc gacaaccccc acgaccccta 1021 tgaagggaag gtggagttca agctcacaat acagaagtaa ggagtaggcg tggctgtcca 1081 ccccagagcc tggtggaccc tgagggacca ctcttcctga ctgacatcat cggctggcca 1141 gcatgcacgg ccacttcatg gttcagagct gacaccactg cccatctgcc gacagcagga 1201 agtgctcctt cccagcactc cctgagcacc accagctttg aactgaaacc cgacgtgcgc 1261 acgcacgttt gcaatcccgt gcggttaaca caggaaccca gagtccggct accactaagg 1321 gacaacccat ctgtagggca tttctatcct gtgaccattt gtctgtcctg cactttgata 1381 tgaactatgg gtccacatca gtgtaacact ggtcaccccg gcctccagtt tgtgcttctg 1441 gggccacagc ccctaggtca ttaaaacaaa ctatagtaaa gtta // LOCUS YSCMYO2A 5675 bp ds-DNA PLN 28-JUL-1990 DEFINITION S.cerevisiae myosin-1 isoform (MYO2) gene, complete cds. ACCESSION M35532 KEYWORDS myosin-1. SOURCE S.cerevisiae (strain GRF88) DNA, clone 10-2B. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 5675) AUTHORS Johnston,G.C., Prendergast,J.A. and Singer,R.A. TITLE The S.cerevisiae MYO2 gene encodes an essential myosin for vectorial transport of vesicles JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.C.Johnston, 21-JUN-1990. Author address: G.C.Johnston Dalhousie University 7E Tupper Medical Bldg. Halifax, N.S. CANADA B3H 4H7 email: JOHNSTON@AC.DAL.CA FEATURES from to/span description pept 581 5305 myosin 1 isoform (MYO2) binding 138 171 ATP-binding site binding 443 523 actin-binding site rpt 926 981 heptad repeat rpt 1010 1086 heptad repeat BASE COUNT 1994 a 986 c 1110 g 1585 t ORIGIN Chromosome 15 right arm. 1 gatcaataaa taaataggct cgaagacgcc tcagaactcc ggtcactggt ttgtcttgtt 61 gatatacgat gtgccaagcg ccgtttctcg atgcttatct ggtttagttt acgctgttaa 121 aaccaaaacc ccaacagatt ttcgacccta acgtatgtag ggctaaaata gatattgagt 181 aggttacaat taattattgg caattgcacc tagtgacaca tttacgaaaa cgtagggcaa 241 aaactattac ccgacccagg gctattttgt gattttttcc ttttttttgt ttatgatcgc 301 gcttctcgaa aagccaaata tcagaaatcc caaacacgcc ttcatttgat acgattcgta 361 gcctgcgttt cagagatcta tcaactttgc aaggccaatc agagaacaaa aaagtctcgc 421 aaagtcattt cacttttctc gcttgaaatt attcgttcga tttctggctg cttgcttgtt 481 ttttgttttc taaggtacta ttcgacacca ttccattgga cagcgatact tataccattg 541 tacatatagg acataaaaac agcagatatt acagcgtata atgtcttttg aagtgggtac 601 acgatgctgg tatccccata aagaattggg ctggattggg gcggaagtaa tcaaaaatga 661 gttcaacgac ggcaagtacc acctggagtt acaattggaa gacgatgaaa tcgtgtccgt 721 ggacacaaaa gacttgaata acgataagga ccaatctcta ccgcttctta gaaaccctcc 781 cattttggaa gcaacggaag atttgacctc tttatcttac ttgaatgagc cagctgtttt 841 acatgccatc aaacagcgct attctcaatt gaatatctac acatactcgg gtattgttct 901 gattgctaca aacccttttg atcgtgtcga ccagctttat acacaagaca tgatccaagc 961 atatgcggga aagcgcagag gtgaactgga acctcacttg tttgccattg ccgaagaagc 1021 gtataggttg atgaaaaatg acaaacaaaa tcaaaccatt gtggtaagtg gtgaatctgg 1081 tgctggaaaa acggtttctg ccaagtatat tatgcgttat tttgcttctg tagaagagga 1141 aaattccgct actgtacaac atcaagtgga aatgtcggaa acagaacaaa agattctagc 1201 tacaaaccct atcatggaag catttggtaa tgctaagact accagaaatg acaattcttc 1261 cagatttggt aagtatctag aaattttatt cgataaggac acatctatta ttggagcaag 1321 gatccgcaca tacttgttgg aacggtccag attagtttac cagccgccaa ttgagagaaa 1381 ctaccacata ttttatcaat taatggctgg attaccagct caaaccaagg aggaattgca 1441 tcttaccgat gcctcagatt acttctacat gaaccaaggc ggtgacacca agatcaacgg 1501 tattgatgat gccaaagaat acaaaattac agtagatgca ttgacattag tcggaatcac 1561 caaggaaact caacaccaaa tatttaagat cttggccgca cttctgcata tcggtaacat 1621 agaaattaaa aaaactagaa atgatgcatc actatcagct gatgagccaa acctgaaact 1681 ggcgtgcgaa ttgctgggaa ttgatgccta caactttgcc aaatgggtca ccaaaaagca 1741 gatcattaca aggtcagaga aaattgtttc gaatctaaat tatagtcaag ctctggttgc 1801 caaagattcc gtggctaagt ttatttattc cgcccttttc gattggcttg tggaaaatat 1861 caacaccgtg ttatgcaacc cggctgtgaa cgaccaaatt agctcattta ttggtgttct 1921 ggatatttat gggtttgaac attttgaaaa aaattcattt gaacaatttt gtattaacta 1981 tgccaacgaa aaactacaac aagagttcaa ccaacatgtt ttcaaattag agcaagaaga 2041 atacgttaaa gaagaaattg aatggtcttt tatagagttt aatgataatc aaccttgtat 2101 tgatctgatt gaaaacaagt tgggtatttt atcactgctt gacgaagaaa gtaggttacc 2161 tgctggttcc gacgaatctt ggacccaaaa actttatcaa actttggata aatctcctac 2221 gaacaaagta ttttctaaac caagattcgg gcaaactaaa tttatcgtga gccattatgc 2281 tctagatgtc gcttatgatg tggaaggatt tattgaaaaa aatagagaca ccgtatctga 2341 cggacatttg gaagtgttga aggcttctac caacgagaca ctaataaata tcttagaggg 2401 attagaaaaa gctgccaaaa aactggaaga agcgaaaaag cttgaattag agcaggctgg 2461 cagtaaaaag ccaggtccga taagaacggt taacaggaaa cccactttag gttccatgtt 2521 taagcaatct ttgattgaac taatgaatac catcaactca actaatgttc attatattcg 2581 ttgtataaag cctaatgcag ataaagaagc ttggcaattt gataatttga tggtgttgtc 2641 tcaactcaga gcctgtggtg ttttggaaac tattagaata tcttgtgctg ggtttccttc 2701 taggtggact tttgaagaat ttgtattaag atattacatc ttgataccac atgagcagtg 2761 ggacctaatc ttcaaaaaaa aggaaactac agaagaagat atcatatcag tggttaaaat 2821 gatcctagat gctactgtaa aggacaaatc caagtaccag attggtaata caaaaatttt 2881 cttcaaagca ggtatgcttg catatctgga aaaacttaga agcaataaga tgcataattc 2941 aattgttatg atccagaaga aaattagagc taaatattac cgtaagcagt atttgcaaat 3001 atctcaggcc atcaagtatt tgcagaacaa catcaaaggt ttcatcattc gtcaacgcgt 3061 taatgatgaa atgaaagtta actgtgcaac tttattacag gccgcttaca ggggtcattc 3121 catccgtgcc aatgtgttca gcgtattgag aacaattaca aatttgcaaa agaaaattag 3181 aaaggaacta aaacaaagac aactgaaaca agaacatgaa tataatgctg cggtaactat 3241 tcaaagtaaa gttaggacct ttgagccgag atcgagattt ttacgcacta aaaaagacac 3301 tgttgttgtc caatctttga tcagaagaag agctgctcaa aggaaattga aacaattgaa 3361 ggcagacgct aaatcagtta atcatctgaa agaagtgagc tataaattag agaataaagt 3421 gattgaactg acgcagaatc tagcatccaa ggtcaaagaa aataaagaaa tgacagaaag 3481 aattaaagaa ctacaggttc aagtggaaga aagtgccaag ttacaagaga cattagaaaa 3541 tatgaaaaaa gagcacttaa tagatattga taatcagaaa tctaaggata tggaattaca 3601 aaaaactatt gagaacaatt tgcaatccac tgaacaaact ctaaaggacg ctcaattaga 3661 gttggaggac atggttaaac aacatgatga attgaaagaa gaatctaaaa agcaacttga 3721 agaattagag caaacaaaga aaacattggt tgaataccag acattaaacg gagacttgca 3781 aaacgaagtt aaatctttaa aggaagaaat tgctaggtta caaactgcca tgtcgctggg 3841 caccgttact actagtgtac tacctcaaac accattaaag gatgtaatgg gaggcggtgc 3901 ttcaaatttc aacaatatga tgcttgagaa ttccgactta tctcctaatg atttgaatct 3961 aaagtctaga tctactccat cgtccggaaa caaccacatt gattcattga gtgtcgatcg 4021 cgaaaatggt gtcaatgcta cacaaatcaa tgaagagtta tacaggttat tggaggacac 4081 tgaaattttg aatcaagaaa tcacggaagg cctgttaaag ggattcgaag taccggatgc 4141 tggtgtagct attcaactaa gtaaaagaga cgttgtttat ccggctagaa tactgattat 4201 agttttaagt gaaatgtgga gatttgggct gaccaagcaa agtgaaagct ttcttgccca 4261 agtattgact acaattcaaa aagttgtcac tcaattgaag ggtaacgatt taattccaag 4321 cggtgtattc tggttagcaa acgttagaga gttatactca tttgtggtgt ttgctctaaa 4381 ctctatttta accgaagaaa cgttcaaaaa cggcatgacc gatgaggagt ataaggagta 4441 tgtttcattg gtcacagaac taaaggatga tttcgaagct ctaagttata atatatataa 4501 catttggctg aagaaattgc agaagcaatt gcaaaaaaag gccatcaatg ctgtggtcat 4561 ctccgaatca ttaccaggtt tcagcgcggg agaaaccagc gggtttttga acaaaatttt 4621 tgctaacact gaagaatata caatggacga cattttgacc tttttcaaca gcatatactg 4681 gtgcatgaaa tcttttcata ttgagaatga agtgttccat gctgtagtca caaccttatt 4741 gaattatgtg gatgcaattt gttttaacga attaatcatg aaacgtaatt tcttgtcgtg 4801 gaaaaggggt cttcaattga actacaacgt tactagatta gaggaatggt gcaagacgca 4861 tggcttgaca gatggtactg agtgcttaca acatttgatt cagaccgcta agctactgca 4921 agtccgtaag tatactatcg aagacattga tatcttaaga ggaatttgtt attcgctaac 4981 acctgcacaa ttgcaaaaat tgatttcaca ataccaggtg gcagactatg agtctccaat 5041 tccacaggaa atcttaagat acgttgctga tatagttaag aaagaagctg cgttatcttc 5101 atcaggtaat gattctaagg gtcacgagca tagcagcagt atatttatca ctccagaaac 5161 aggtccattt actgacccat tcagtttgat aaagacaaga aaatttgacc aagtagaagc 5221 ctatatacca gcgtggttat ccttgccctc aactaagaga atagttgacc ttgttgccca 5281 acaagtcgtt caagacggcc actaaaactg atggcgcgag aaacaaaatt gtacatgaat 5341 gctaaaaaaa gaaatgacaa aaaaagagaa aaaaaaaaat gaaactacat agttaattaa 5401 taatagaagt atttgtcaat agtatgataa tgaaatcgat attatggaag atattaaccg 5461 cgcgccgtat tagtgtacac tatattaaac tacattttgc ttcttactga atttataaat 5521 tatgattata ttattattac tattatgact actgtatata tttttagaat tagatcggga 5581 accgatgagc gttagctgaa atggacgacg ataaggaacg ataattacca ctagtaaaat 5641 aataacaact aagaataaac acattctcat tttta // LOCUS PTUB256 205 bp ds-DNA SYN 28-JUL-1990 DEFINITION Synthetic pTUB256 alpha-amylase gene promoter region. ACCESSION M36663 KEYWORDS alpha-amylase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 205) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 164 > 205 synthetic alpha-amylase BASE COUNT 73 a 29 c 44 g 59 t ORIGIN 1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaagcgtga acaaaattct 61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga 121 ctccgaagta agtcttcaaa aaatcaaata aggagtgtca agaatgtttg caaaacgatt 181 caaaacctct ttactgccgt tattc // LOCUS PTUB261 232 bp ds-DNA SYN 28-JUL-1990 DEFINITION Synthetic pTUB261 alpha-amylase gene promoter region. ACCESSION M36664 KEYWORDS alpha-amylase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 232) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 191 > 232 synthetic alpha-amylase BASE COUNT 82 a 28 c 52 g 70 t ORIGIN 1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaagcgtga acaaaattct 61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga 121 ctcggggttg ttattatttt atcgatatgt aaaatataat ttctagaaga aaagaaggtg 181 gagaggaaac atgatccaaa aacgattcaa aacctcttta ctgccgttat tc // LOCUS PTUB263 232 bp ds-DNA SYN 28-JUL-1990 DEFINITION Synthetic pTUB263 alpha-amylase gene promoter region. ACCESSION M36665 KEYWORDS alpha-amylase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 232) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 191 > 232 synthetic alpha-amylase BASE COUNT 78 a 40 c 49 g 65 t ORIGIN 1 aagcactccc gcgatcgcct atttggcttt tccccaaaat gtaagcgtga acaaaattct 61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga 121 ctcggggttg ttattatttt atcgatatgt aaaatataat ttctagaaga aaagaaggtg 181 gagaggaaac atgatccaaa aacgattcaa aacctcttta ctgccgttat tc // LOCUS PTUB265 214 bp ds-DNA SYN 28-JUL-1990 DEFINITION Synthetic pTUB265 alpha-amylase gene promoter region. ACCESSION M36666 KEYWORDS alpha-amylase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 214) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 173 > 214 synthetic alpha-amylase BASE COUNT 73 a 28 c 50 g 63 t ORIGIN 1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaatcgtga acaaaattct 61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga 121 ctcggggttg ttattatttt atcgctagaa gaaaagaagg tggagaggaa acatgatcca 181 aaaacgattc aaaacctctt tactgccgtt attc // LOCUS RABMEPHA 1653 bp ss-mRNA MAM 28-JUL-1990 DEFINITION Rabbit microsomal epoxide hydrolase. ACCESSION M21496 KEYWORDS microsomal epoxide hydrolase. SOURCE Rabbit (New Zealand White) adult liver cDNA to mRNA, clone pEH. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 1653) AUTHORS Hassett,C., Turnblom,S.M., DeAngeles,A. and Omiecinski,C.J. TITLE Rabbit microsomal epoxide hydrolase: Isolation and characterization of the xenobiotic metabolizing enzyme cDNA JOURNAL Arch. Biochem. Biophys. 271, 380-389 (1989) STANDARD full staff_review COMMENT Draft entry and computer readable copy for sequence [1] kindly submitted by C.Hassett 12-JAN-1989. FEATURES from to/span description pept 148 1515 microsomal epoxide hydrolase (EC 3.3.2.3) BASE COUNT 351 a 505 c 465 g 332 t ORIGIN 1 cggcatccgc aaggacctgt acgccaacac ggtgctgtct cgcctctccc gcagctctgc 61 agtgtcgccg tgcgcagagt tccacagctc tgcttcccaa gcaggtgagc agaggctgac 121 aacacagcgc ccttgtggac aggagccatg ttgctggaac tccttctcgc ctcggtgctg 181 ggcttcgtca tctactggtt cgtctctgga gacaaggagg agagtctgcc actggaggat 241 gggtggtggg gcccggggtc gaggcccgta ggcctggagg acgagagcat ccggcccttc 301 aaggtggaga cgtcggacga agagatcaac gacttacacc agaggatcga caggatccgc 361 ttgaccccac ctttggagaa cagccgcttc cactacggct tcaactccaa ctacctgaag 421 aagatcctct cctactggag gcacgaattc gactggaaga agcaagtgga gattctgaac 481 tcataccctc acttcaagac caagatcgaa gggctggaca tccacttcat ccacgtgaag 541 cccccgcagg tgccccctgg ccgcacccca aagcccttgc tgatggtgca tggctggccc 601 ggctccttct tcgagttcta caaaatcatc ccgctgctga ctgaccccaa gagccacggc 661 ctgagcgatg agcacatctt tgaagtcatc tgcccttcca ttccaggcta tggcttctca 721 caggcatctt ccaagaaggg cttcaactcg gtgagcaccg ccaggatctt ctacaagctg 781 atgctgcggc tgggcttcca ggagttctac atccagggcg gggactgggg ggccctggtc 841 tgcacgaaca tggcccagct ggtgcccagc cacgtgaaag gtctgcactt gaacatggct 901 ttgattttaa gaaatcacta cactctgacc ctcctgctgg gacggcgcat cgggggactt 961 cttggctaca ctgagaggga catggagctg ctgtacccct tcaaggagaa ggtgttctac 1021 agtctgatga gggagagcgg ctacatgcac atccgggcca ccaagcccga cactgtgggc 1081 tgtgctctga atgactctcc tgtgggactg gctgcataca ttctagagaa attttccacc 1141 tggaccaact cagaattccg agacctggag gacggaggcc tggagaggaa gttctccctg 1201 caggacctgc tgaccaacat catgatctac tggaccactg gctccatcgt ctcctcccag 1261 cgctactaca aggagaacct gggccagggc ttcatggccc acaagcatga gcggctgaag 1321 gtccacgtgc ccacgggctt cgcagccttc ccgtgtgaga taatgcatgt gccagagaag 1381 tgggtgagga ccaagtaccc gcagctcatc tcctactcct acatgccccg cgggggccac 1441 ttcgccgcct tcgaggagcc ggagctgctg gcccgggaca tctgcaagtt cgtggggctg 1501 gtggagcggc agtgatgctc ccagccttgc ctggggtgag gggtcggctt gcctcctccc 1561 ctggcctgct ggaacccacc tcaggcctcc atactcactg tctcaccccc atggcgtggc 1621 tgataaatga tttgactccc aaaaaaaaaa aaa // LOCUS XELBETA 1138 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A1 mRNA, complete cds. ACCESSION M35359 KEYWORDS thyroid hormone receptor beta A1 protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1138) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 1 1110 thyroid hormone receptor beta A1 protein BASE COUNT 351 a 228 c 283 g 276 t ORIGIN 1 atggaagggt atatacccag ctacttggat aaagatgagc tatgcgtggt gtgtggagac 61 aaggctacag ggtatcatta tagatgtatc acctgcgagg gctgcaaggg cttttttaga 121 agaactattc agaagaacct ccacccaagc tattcttgta aatatgaagg aaaatgtgtt 181 atagacaaag taacaagaaa ccagtgccaa gaatgtcgct tcaaaaagtg catcgctgtt 241 ggaatggcaa cagacttggt tttggatgac aacaaacgtt tggcaaaaag aaagctcata 301 gaagaaaaca gagaaaaaag acggaaagat gagattcaga aatcacttgt tcagaaacct 361 gaacccacac aagaagaatg ggagttgata caagttgtca ctgaagcaca tgtggccacc 421 aacgcacagg gaagccactg gaaacagaaa agaaaatttt tgccagagga cattggacaa 481 gctcccatag ttaatgcgcc cgagggtgga aaagtggact tagaagcctt cagccagttt 541 acaaaaataa tcaccccagc aattacaaga gttgttgatt ttgccaaaaa gctacctatg 601 ttttgtgagc tgccatgtga agaccagatc atccttctta aaggctgttg tatggagatc 661 atgtcgctcc gagcagcagt gcgttatgac cccgaaagtg aaactctaac gttaaatggt 721 gagatggcag tgacaagggg gcagctaaaa aatggaggac ttggagtggt ttcagatgcc 781 atctttgact taggggtatc gctttcttca ttcagtcttg atgataccga agtcgccttg 841 ttgcaggctg tgctgcttat gtcatcagat cggcctggtc ttgctagcgt ggagagaata 901 gaaaagtgcc aggaaggttt cctcttggct tttgaacact acattaatta caggaaacat 961 aacattgcac acttttggcc aaaactgctg atgaaagtca ccgacctccg catgattgga 1021 gcgtgccacg ccagccggtt cctgcacatg aaggtggagt gccccactga actgtttccc 1081 ccactgttct tggaagtgtt tgaggactag aacagactgt gcttctggat tctcagca // LOCUS XELBETA1 259 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon A. ACCESSION M35345 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 1 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 259) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 63 a 55 c 88 g 53 t ORIGIN 1 aaattgggat ctatcctggg agagaatgga aatagacgac agcgctttat cctgactgaa 61 ctgaggcagg ggtaacgctg ggagtgactg gcatagcagg ggctgcgggg aggcacttca 121 gtccgtgcca agtccaacat tgtagctagt gacgagaatc gtactacagt gcgggctctc 181 actaagtgac gctcgaattc gggaagaacg acgcggcagc tgttgcatta tggtgcgtct 241 gtaggtcgga gagccggcg // LOCUS XELBETA2 97 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon B. ACCESSION M35346 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 2 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 97) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 14 a 32 c 24 g 27 t ORIGIN 1 atttcaggac agcccagcgc cctggtgcac gatcagctgt agatctccct gtctgtgtcg 61 ctgctgccgc tgctacttca gttcctctga ctgtcag // LOCUS XELBETA3 44 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon C. ACCESSION M35347 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 3 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 44) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 17 a 5 c 13 g 9 t ORIGIN 1 atgttgaaga ctgattgggg ttaagcaggc acatacaaga aaag // LOCUS XELBETA4 79 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon D. ACCESSION M35348 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 4 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 79) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 32 a 12 c 21 g 14 t ORIGIN 1 acagaagccg tgaaccaatg cagaattaca ggaaaggacg aggattgaaa catctgtaca 61 tgagaaggaa tttctgaag // LOCUS XELBETA5 72 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon E. ACCESSION M35349 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 5 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 72) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 19 a 15 c 20 g 18 t ORIGIN 1 ttaaagttga agtatttctg gtcaggtgat ctctgaggca gcgcacaggc cctcacaaaa 61 tggtggctca ag // LOCUS XELBETA6 46 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon F. ACCESSION M35350 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 6 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 46) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 17 a 11 c 8 g 10 t ORIGIN 1 gttcctctca agcccaggaa caaaaaccgg aaatttttca aatgag // LOCUS XELBETA7 64 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon G. ACCESSION M35351 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 7 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 64) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 46 > 64 thyroid hormone receptor beta A protein, exon G (first expressed exon) (alt.) BASE COUNT 20 a 13 c 14 g 17 t ORIGIN 1 gctatatgtg attcttagaa gaatgagcgg accttccaat ccataatgcc aagcagtatg 61 tcag // LOCUS XELBETA8 191 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon H. ACCESSION M35352 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 8 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 191) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 185 > 191 thyroid hormone receptor beta A protein, exon H (first expressed exon) (alt.) BASE COUNT 57 a 31 c 37 g 66 t ORIGIN 1 gcagagtata tggtttagaa gaactaacac agaagttttt tgttggacac tactctccat 61 aatgacaatg agatttccat tgtaacatcc taattgtaac cagtaatcag agatgctgct 121 tggacagtgc ttacagcttt tttaaagaga ttttttattt ttgctttgca tcgaaccgtg 181 tactatggaa g // LOCUS XELBETAB 1150 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A5 mRNA, complete cds. ACCESSION M35360 KEYWORDS thyroid hormone receptor beta A5 protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1150) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 1 1122 thyroid hormone receptor beta A5 protein BASE COUNT 354 a 232 c 285 g 279 t ORIGIN 1 atgccaagca gtatgtcagg gtatataccc agctacttgg ataaagatga gctatgcgtg 61 gtgtgtggag acaaggctac agggtatcat tatagatgta tcacctgcga gggctgcaag 121 ggctttttta gaagaactat tcagaagaac ctccacccaa gctattcttg taaatatgaa 181 ggaaaatgtg ttatagacaa agtaacaaga aaccagtgcc aagaatgtcg cttcaaaaag 241 tgcatcgctg ttggaatggc aacagacttg gttttggatg acaacaaacg tttggcaaaa 301 agaaagctca tagaagaaaa cagagaaaaa agacggaaag atgagattca gaaatcactt 361 gttcagaaac ctgaacccac acaagaagaa tgggagttga tacaagttgt cactgaagca 421 catgtggcca ccaacgcaca gggaagccac tggaaacaga aaagaaaatt tttgccagag 481 gacattggac aagctcccat agttaatgcg cccgagggtg gaaaagtgga cttagaagcc 541 ttcagccagt ttacaaaaat aatcacccca gcaattacaa gagttgttga ttttgccaaa 601 aagctaccta tgttttgtga gctgccatgt gaagaccaga tcatccttct taaaggctgt 661 tgtatggaga tcatgtcgct ccgagcagca gtgcgttatg accccgaaag tgaaactcta 721 acgttaaatg gtgagatggc agtgacaagg gggcagctaa aaaatggagg acttggagtg 781 gtttcagatg ccatctttga cttaggggta tcgctttctt cattcagtct tgatgatacc 841 gaagtcgcct tgttgcaggc tgtgctgctt atgtcatcag atcggcctgg tcttgctagc 901 gtggagagaa tagaaaagtg ccaggaaggt ttcctcttgg cttttgaaca ctacattaat 961 tacaggaaac ataacattgc acacttttgg ccaaaactgc tgatgaaagt caccgacctc 1021 cgcatgattg gagcgtgcca cgccagccgg ttcctgcaca tgaaggtgga gtgccccact 1081 gaactgtttc ccccactgtt cttggaagtg tttgaggact agaacagact gtgcttctgg 1141 attctcagca // LOCUS XELBETAC 1132 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B1 mRNA, complete cds. ACCESSION M35361 KEYWORDS thyroid hormone receptor beta B1 protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1132) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 1 1122 thyroid hormone receptor beta B1 protein BASE COUNT 352 a 233 c 284 g 263 t ORIGIN 1 atgccaagca gtatgtcagg gtacataccc agctacttgg ataaagatga gctatgtgtg 61 gtatgtggag acaaagctac agggtatcac tatagatgta tcacctgcga gggctgcaag 121 ggctttttta gaagaactat tcagaagaac ctccacccaa gctattcctg taaatatgaa 181 ggaaaatgtg ttatagacaa agtaacaagg aaccagtgcc aagaatgtcg cttcaaaaag 241 tgcaaaactg ttggaatggc aacagacttg gttttggatg acagcaaacg tttggcgaaa 301 agaaagctca tagaagaaaa cagagaaaaa agacggaaag acgagataca gaaatcaatt 361 gttcagagac cggaaccaac acaagaagaa tgggagttga tacaagttgt cactgaagca 421 catgtggcca ccaacgcaca gggaagccac tggaaacaga aaagaaaatt tttgccagag 481 gacattggac aagctcccat agttaatgcg cctgaaggtg gaaaagtgga cttagaagcc 541 ttcagccagt ttacaaaaat aatcacccca gcaattacaa gagtggttga ttttgccaaa 601 aagctaccta tgttttgtga gctgccatgt gaagaccaga tcatccttct taaaggctgt 661 tgtatggaga tcatgtccct ccgagcagcc gtgcggtatg accccgaaag tgaaactcta 721 acgctgaatg gggagatggc agtgacaagg gggcagctaa aaaatggagg actcggtgtg 781 gtctcagatg ccatctttga cttgggggtg tcgctttctt cattcagtct tgatgatacc 841 gaagtcgcct tgttgcaggc tgtgctgctt atgtcatcag atcgtcctgg tctctctagt 901 gtggagagaa tagaaaagtg ccaggaaggt ttcctcttgg cttttgaaca ctacattaat 961 tacaggaaac acaacattgc acacttttgg ccaaaactgc tgatgaaagt caccgacctc 1021 cgcatgatcg gagcatgcca cgccagccgg ttcctgcaca tgaaggtgga gtgccccact 1081 gaactgtttc ccccactgtt cttggaagtg tttgaggact agaacagact gt // LOCUS XELBETAD 1255 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B2 mRNA, complete cds. ACCESSION M35362 KEYWORDS thyroid hormone receptor beta B2 protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1255) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 1 1245 thyroid hormone receptor beta B2 protein BASE COUNT 383 a 258 c 323 g 291 t ORIGIN 1 atgccaagca gtatgtcagt tcggcttttc actgcatctg ccgcacaaag aaagaagata 61 caggaagggg attgctgtgt ggtgctcgct ggaaaaaccc agggccggtt tatattgata 121 ggagcagtgg cccgggtatc agggtacata cccagctact tggataaaga tgagctatgt 181 gtggtatgtg gagacaaagc tacagggtat cactatagat gtatcacctg cgagggctgc 241 aagggctttt ttagaagaac tattcagaag aacctccacc caagctattc ctgtaaatat 301 gaaggaaaat gtgttataga caaagtaaca aggaaccagt gccaagaatg tcgcttcaaa 361 aagtgcaaaa ctgttggaat ggcaacagac ttggttttgg atgacagcaa acgtttggcg 421 aaaagaaagc tcatagaaga aaacagagaa aaaagacgga aagacgagat acagaaatca 481 attgttcaga gaccggaacc aacacaagaa gaatgggagt tgatacaagt tgtcactgaa 541 gcacatgtgg ccaccaacgc acagggaagc cactggaaac agaaaagaaa atttttgcca 601 gaggacattg gacaagctcc catagttaat gcgcctgaag gtggaaaagt ggacttagaa 661 gccttcagcc agtttacaaa aataatcacc ccagcaatta caagagtggt tgattttgcc 721 aaaaagctac ctatgttttg tgagctgcca tgtgaagacc agatcatcct tcttaaaggc 781 tgttgtatgg agatcatgtc cctccgagca gccgtgcggt atgaccccga aagtgaaact 841 ctaacgctga atggggagat ggcagtgaca agggggcagc taaaaaatgg aggactcggt 901 gtggtctcag atgccatctt tgacttgggg gtgtcgcttt cttcattcag tcttgatgat 961 accgaagtcg ccttgttgca ggctgtgctg cttatgtcat cagatcgtcc tggtctctct 1021 agtgtggaga gaatagaaaa gtgccaggaa ggtttcctct tggcttttga acactacatt 1081 aattacagga aacacaacat tgcacacttt tggccaaaac tgctgatgaa agtcaccgac 1141 ctccgcatga tcggagcatg ccacgccagc cggttcctgc acatgaaggt ggagtgcccc 1201 actgaactgt ttcccccact gttcttggaa gtgtttgagg actagaacag actgt // LOCUS XELBETB1 226 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon A. ACCESSION M35353 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 1 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 226) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 57 a 50 c 73 g 46 t ORIGIN 1 agcttcatta tcctgactga acacaagcag ggataacgct gggagtgact ggcatagcag 61 gggctgcagg gaggcacttc ataatccgtg ccaaatccaa cgttgtagcg agtgacgaga 121 atcgtagagt gcgcggaaca gtctcacgga cgctggggtt tgggaaggac gacgcggcag 181 ctgttgcact acgttacgtc taactctata ggttggagag ctgacg // LOCUS XELBETB2 65 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon B. ACCESSION M35354 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 2 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 65) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 7 a 19 c 16 g 23 t ORIGIN 1 agctgtagat ctcctgtctg tgttgctgcc actgctgttg ctgctccagt tcctctgact 61 gtcag // LOCUS XELBETB3 50 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon C. ACCESSION M35355 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 3 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 50) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 19 a 5 c 15 g 11 t ORIGIN 1 atgttgaaga gtgattgggg ttaagcaggc acatactgta caagaaaaag // LOCUS XELBETB4 67 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon F. ACCESSION M35356 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 4 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 67) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 27 a 14 c 12 g 14 t ORIGIN 1 ctacaggttt ccctcaagca ccaagaacga aaaccagaaa gaatttgcag agaatttttc 61 aaatgag // LOCUS XELBETB5 64 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon G. ACCESSION M35357 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 5 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 64) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 46 > 64 thyroid hormone receptor beta B gene BASE COUNT 21 a 12 c 14 g 17 t ORIGIN 1 gttatatgtg atgcttagaa gaatgagcag accttccaat ccataatgcc aagcagtatg 61 tcag // LOCUS XELBETB6 123 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon H. ACCESSION M35358 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 6 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 123) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 31 a 25 c 39 g 28 t ORIGIN 1 ttcggctttt cactgcatct gccgcacaaa gaaagaagat acaggaaggg gattgctgtg 61 tggtgctcgc tggaaaaacc cagggccggt ttatattgat aggagcagtg gcccgggtat 121 cag // LOCUS XELTHYA 1406 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor alpha A mRNA, complete cds. ACCESSION M35343 KEYWORDS thyroid hormone receptor protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1406) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 122 1378 thyroid hormone receptor protein BASE COUNT 346 a 350 c 406 g 304 t ORIGIN 1 gtcgacctgt gagaggcgtc cgcccgcctc catgtgaacg ctacgcccca tgatcctcgg 61 ggagctgggg gcggagcccg ccttggtctc ttcggattgg ttctggatgg aattacgttg 121 aatggaccag aatctcagcg ggctggactg cttgtcagag ccagatgaaa aaaggtggcc 181 ggatgggaag cgaaaaagaa agaacagcca atgtatggga aaaagcggca tgtccggtga 241 cagcttggtg tctctgccct ctgcagggta catccccagc tatctggaca aagatgagcc 301 atgcgtggtg tgcagtgata aggccacggg gtaccactac cgctgtatca cttgcgaggg 361 gtgtaagggt ttctttcgcc gcaccatcca gaagaacctg cacccctcct actcgtgcaa 421 gtacgatggc tgctgcatta tcgacaagat cacccgaaat cagtgccagc tctgccgctt 481 caagaaatgc attgccgttg gcatggcaat ggatcttgtc ctggatgatg gcaagcgggt 541 agccaagcga aaactgattg aagagaatcg acagcggcgg cggaaggagg agatgatcaa 601 gactctgcaa cagcgtcccg agccaagcag cgaggagtgg gagttgattc gcattgtaac 661 agaagctcac aggagtacca atgctcaggg cagccactgg aaacagcgta ggaagtttct 721 gccggaagat atcgggcagt ctcccatggc ttccatgccg gatggggata aagttgacct 781 ggaagctttc agtgagttca ccaagataat caccccggca attaccagag tggtggactt 841 tgccaagaag ctgcccatgt tctctgagct gacttgtgaa gaccagatca tcctgttgaa 901 aggatgttgt atggagatca tgtctctccg tgctgctgta cgctacgatc cagacagcga 961 gaccctaacg ctgagcggag agatggctgt gaaacgggag cagcttaaga acggaggtct 1021 gggtgttgtc tctgatgcca tctttgacct cgggaggtcg cttgctgcgt ttaaccttga 1081 cgatacggaa gtggcgctgc tgcaggctgt tttgctaatg tcatcagacc gaactggttt 1141 aatctgcacg gacaagatag agaaatgtca agagacctac cttctcgcct ttgaacacta 1201 catcaaccat cgcaaacaca acattcccca cttctggccc aaactcctaa tgaaggtgac 1261 ggacctgcgc atgatagggg catgccatgc cagccgcttt ctgcacatga aggtcgagtg 1321 ccccaccgag ctctttccac cgctcttcct tgaggtcttt gaggaccagg aagtttgagg 1381 gacagtgcat gtcggtagag aggaaa // LOCUS XELTHYB 1406 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor alpha B mRNA, complete cds. ACCESSION M35344 KEYWORDS thyroid hormone receptor protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1406) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 122 1378 thyroid hormone receptor protein BASE COUNT 350 a 347 c 402 g 307 t ORIGIN 1 gtcgacctgt gagaggcgcc cgcccgcctc catgtgaaag ccacgcccca tgagccttgg 61 gcagctgggg gcggagccca ccttggtctc ttcggattgg ttctggatgg aattacgttg 121 aatggaccag aatctcagcg ggctggactg cttgtcagag ccagatgaaa aaaggtggcc 181 ggatgggaag cgaaaaagaa agaacagcca atgtatggga aaaagcggca tgtccggtga 241 cagcttggtg tctctgcccc ctgcagggta catccccagc tatctggaca aagatgagcc 301 atgcgtggtg tgcagtgata aggccacggg gtaccactac cgctgtatca cttgcgaggg 361 gtgcaagggt ttcttccgcc gcaccatcca gaagaacctg cacccctcct attcttgcaa 421 gtacgatggc tgctgcatta tcgacaaaat cacccgtaat cagtgccagc tctgccgctt 481 caagaaatgc attgccgttg gcatggcaat ggatcttgac ctggatgata gcaagcgggt 541 agccaagcga aaactgattg aagaggatcg agtgcggcgg cggaaggagg agatgatcaa 601 gactctgcaa cagtgtcccg agccaagcag cgaggagtgg gagttgattc gcattgtaac 661 agaagctcac aggagtacca atgcccaggg cagccattgg aaacagcgta ggaagtttct 721 gccagaagac atcggacagt ctcctatggc ttccatgcca gatggggata aagttgacct 781 ggaagctttc agtgagttca ccaaaataat caccccggca attaccagag tggtggactt 841 tgcgaagaag ctgcccatgt tctctgagct gacttgtgaa gaccagatca tcctgttgaa 901 aggatgttgt atggagatca tgtctcttcg tgctgctgtg cgctacgatc cagacagcga 961 gaccctaacg ctgagcggcg agatggcggt gaaacgggag cagcttaaga acggaggtct 1021 gggtgttgtc tctgatgcca tctttgacct tgggaggtcg cttgctgcgt tcaaccttga 1081 tgatacggaa gtggcactgt tgcaggctgt tttgctaatg tcatcagacc gtactggttt 1141 aatctgcaca gacaagatag agaaatgtca agagacctac cttctcgcct ttgaacacta 1201 catcaaccat cgcaaacaca acattcccca cttctggccc aagctcctaa tgaaggtgac 1261 ggacctgcgc atgatagggg catgccatgc cagctgcttt ctgcacatga aggtcgagtg 1321 ccccaccgag ctctttccac cgctcttcct tgaggtcttt gaggaccagg aagtttgagg 1381 gacagtgcat gtcggtagag aggaaa // LOCUS RATSIMPA1 205 bp ds-DNA ROD 28-JUL-1990 DEFINITION Rat simple sequence DNA, clone 5. ACCESSION M36626 KEYWORDS simple sequence DNA. SEGMENT 1 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 205) AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L. TITLE Simple sequences of the rat genome detected by hybridization with adenovirus DNA JOURNAL Dokl. Biochem. 276, 189-193 (1984) STANDARD simple staff_entry BASE COUNT 52 a 53 c 79 g 21 t ORIGIN 1 cagctctgtc ctgttgtcgc ccttgggcag agttgtgcct cctgctcttc tttccctaag 61 gaggggcagc agcagcagca gcaggaggag caggaggagc agcagcagga gcagcaggag 121 cagcagcagc agcaggagga gcaggagcag cagcaggagc agcagcagca ggagcagcag 181 cagcagcagg agcaggagga gcagc // LOCUS RATSIMPA2 146 bp ds-DNA ROD 28-JUL-1990 DEFINITION Rat simple sequence DNA, clone 5. ACCESSION M32514 KEYWORDS simple sequence DNA. SEGMENT 2 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 146) AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L. TITLE Simple sequences of the rat genome detected by hybridization with adenovirus DNA JOURNAL Dokl. Biochem. 276, 189-193 (1984) STANDARD simple staff_entry BASE COUNT 46 a 36 c 61 g 3 t ORIGIN About 500 base pairs after segment 1. 1 agcagcagca gcaggagcag caggaggagc agcaggagca ggagcagcag gagcagcagc 61 aggagcagga gcaggagcag caggagcagc aggagcagca gcaggagcag cagcagcagc 121 agcagcagca gcggtgcagc tccatg // LOCUS RATSIMPB 380 bp ds-DNA ROD 28-JUL-1990 DEFINITION Rat simple sequence DNA, clone 8. ACCESSION M32515 KEYWORDS simple sequence DNA. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 380) AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L. TITLE Simple sequences of the rat genome detected by hybridization with adenovirus DNA JOURNAL Dokl. Biochem. 276, 189-193 (1984) STANDARD simple staff_entry BASE COUNT 131 a 96 c 138 g 15 t ORIGIN 1 tgatcattgc tgcaatccca cagcaggagc agcagcagga gcagcagcag cagcaggagc 61 aggagcagcc acaggaggag cagcaacaag aggcagcagc agcagcagga gcagcagcag 121 caggaggagc agcaacagga gcagcagcaa caggagcagc agcaggaaca gaacaggagc 181 agcagcagca ggaacaagga gtagcagcag cagcagcagg aacaggagaa gcagcagcag 241 cagcagcagc aggagcagga gcagcaggag cagcagcagc agcagtagga gcagcagcag 301 cagcaggagc agcagcagca gcagcaggag gagcagcagc agcagcagca cagcagcagg 361 gtacttggtg atcccttgac // LOCUS RATSIMPC 542 bp ds-DNA ROD 28-JUL-1990 DEFINITION Rat simple sequence DNA, clone 16. ACCESSION M36627 KEYWORDS simple sequence DNA. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 542) AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L. TITLE Simple sequences of the rat genome detected by hybridization with adenovirus DNA JOURNAL Dokl. Biochem. 276, 189-193 (1984) STANDARD simple staff_entry BASE COUNT 76 a 255 c 142 g 69 t ORIGIN 1 ggatccaccg cctgagtagc cgccgccaca gctagagccg cctccacctc caccgccgtc 61 ggagtagccg cctccgcagc tggagccacc gccgccgccg ccgccggagt acttgccccc 121 ttcggaccgc cgccgcgacc accgggctgc cgctccagag gagcctccgc agtaggagcc 181 gccgcctcct gattcgtctc ctatagttgg agcctccgcc tccgtcggag tcgccgccgc 241 cgccgtagcc ggagccgccg ccgccgccgc ccgcctccgg agtaccttga cgccgccgcc 301 gccgccgccg gagtacttcg cccctccgga ccgccgccgc gaccagagaa ctgacgcccc 361 ctccggagcc gcctccgccg ccgcagctgg aaccacctcc ataggaacca ccgcctccgc 421 ctccgcctcc gcagccagag cctcctccag atgagccacc tccgcagctg ggagcctcca 481 ccgctaccac caccgctata gtaaccgcca ccgccgcctc ctcctccacc agaggtcttt 541 tc // LOCUS RATPSTIAA 2382 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Rat pancreatic secretory trypsin inhibitor-like protein (PSTI) mRNA, complete cds. ACCESSION M35299 KEYWORDS monitor protein; pancreatic secretory trypsin inhibitor-like protein. SOURCE Rat (strain Wistar) adult pancreas, cDNA to mRNA, clone MP2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 427; 594 to 1693; 1954 to 2338) AUTHORS Fukuoka,S.-I. and Scheele,G. TITLE Rapid and selective cloning of monitor peptide, a novel CCK-releasing peptide, using minimal amino acid sequence and the polymerase chain reaction (PCR) JOURNAL Pancreas 4, 1-7 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2382) AUTHORS Fukuoka,S.-I. and Scheele,G. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.-I.Fukuoka, 19-JUN-1990. FEATURES from to/span description pept 10 249 pancreatic secretory trypsin inhibitor-like protein precursor sigp 10 63 pancreatic secretory trypsin inhibitor-like protein signal peptide matp 64 246 pancreatic secretory trypsin inhibitor-like protein signal 1665 1670 poly-A signal signal 2064 2069 poly-A signal signal 2303 2308 poly-A signal BASE COUNT 770 a 423 c 517 g 672 t ORIGIN 1 tctacaacca tgaaggtagc aattatcttt cttctcagtg ctttggccct gctcagttta 61 gcaggtaacc ctccagctga ggtgaatgga aaaacgccta attgccctaa gcaaattatg 121 ggatgtccca ggatttatga ccctgtgtgt gggactaacg gaattactta ccccagtgaa 181 tgcagtctgt gctttgaaaa caggaaattc ggaacatcta tccacattca gaggagaggg 241 acttgctgaa tgtcctgatt ttgaaatctt ttagggctac cataatgttt agcaagaagg 301 tttgctgaat aaatgcatct gaacatattt tgttcttccc aaagcttttg ctcaaaggca 361 tatatgagta tattgagaat agggatctga gaagaaaacc agagtagagc aagctttacc 421 acttagttct tcatgctcat acttcaaaaa ttgcagatga tgacaacaca tagttgagca 481 tgaacatgtg taatgaatag agtttgggtt aggatgaaga aggtagccta tctgtgcaca 541 agaaagaagt agactgactt ggatctttct taggggagtt taccaaagga aagactgcct 601 tgtatatcta cagtgtttca cttgtgagac accacaactc tgcagattta ctcttgttct 661 gtgaggaaac ttagaagagt caaattgttt gactaatagt ccaacataca tgatgccagg 721 gtgttctttt agatcaagct gacctcttcc ttcatccata tgagcactcc ttcttttaac 781 cacaatcttc tcttgtggat catgccttga ctttcttcaa tgggaatcct agataatatt 841 ccctactgta agatcttgca tgtctatatt cagtgataga atatagacgt gatataatag 901 gatataacca aatgaattag aaacaaggaa atattctcaa aagggaaagt atcaacaact 961 acttttaaaa aaggaatcat tttaagatcc tgagtttcta aagaaaatct tagtctaaga 1021 tggaaagaga gtaaagagct aacacaggtg agtctgggca aggaacccta gtacagtggg 1081 gttgggtcag cacctttgcc agaaataacc aagctattca gaaatacact aggaaaggag 1141 agttgcctag taacccactt ctggtcatat tcagtattca tgccttgaac tgaactcttg 1201 ctcctagagg atgctataac taacaaaccg agcaacttaa acagcctgac agctctcacc 1261 aaataccttg ctatctcaag ttatggatgc aagatggctc ccagtgtcta tctgtgattc 1321 tagaggacac ttgaagggca ccaacactta acaaattctg tgggggtaaa tttattttaa 1381 tcactggatg ctggaagaca cacacagaga cacaaacaca caaagagaga cagagagaga 1441 gaaagagaga gagagaggta gagagagaga gagagagaga gagggagaga gagggagaga 1501 gagagtgttt tgggttttgt tgttgttgtt gttgttgatt tggaattata tcaagatata 1561 agataatctc aaatgtatct ttagtagttc tgctccctgg acccatgaga agacaggaat 1621 gaggattctg tgcatgtggt acttacattt caaaaggagt atctaataaa ctggaaactg 1681 cttaaaagaa tgagactatc agcactgata agaatataaa gcttcaagct atgaagagtg 1741 attcaaagaa ggaaaagaat tccctcagaa ctgggaggac cttttaaaaa attctgagtc 1801 cccgtttcta aagtttcacc ttcctaactt catgtatttt ttaatagctc aaagagtcca 1861 attactgctg ctcatatact catgagtgtg acaccatgca ctgttactgc caatatatga 1921 aaggccatac ccctaaagaa aattgactta agaactcctt gtttagggtt gggtacttct 1981 gtgaccctcc cacattcatg ctggaatgtt gactggcttc atttttataa ggcaaaagat 2041 cttcccactc tcttctgaga gagaataaat cagttttgct caatggagtg attctgagta 2101 tactaatcac gatcccagga caggccccat tctcacaagc agttagctaa cacaaataga 2161 actccatatt ttatagcagt ttttatcttt tgttcttggt tttagttctt attttcaaga 2221 cagagaaaaa cacatgaagt tggaagggta gaagtggggg ggggcgtggg tctgggagga 2281 gttgggggat agagaaaaat ataataaaaa tatatgaaat tctcgagaat gaataaatgg 2341 aattcgatat caagcttatc gataccgtcg acctcgaggg gg // LOCUS RATPSTIBA 300 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Rat pancreatic secretory trypsin inhibitor-like protein (PSTI) mRNA, 3' end. ACCESSION M35300 KEYWORDS monitor protein; pancreatic secretory trypsin inhibitor-like protein. SOURCE Rat (strain Wistar) adult pancreas, cDNA to mRNA, clone MP3. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Fukuoka,S.-I. and Scheele,G. TITLE Rapid and selective cloning of monitor peptide, a novel CCK-releasing peptide, using minimal amino acid sequence and the polymerase chain reaction (PCR) JOURNAL Pancreas 4, 1-7 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.-I.Fukuoka, 19-JUN-1990. FEATURES from to/span description pept < 1 116 pancreatic secretory trypsin inhibitor-like protein precursor (AA at 3) matp < 1 113 pancreatic secretory trypsin inhibitor-like protein signal 285 290 poly-A signal BASE COUNT 98 a 57 c 65 g 80 t ORIGIN 1 gtcccaggat ttatgaccct gtgtgtggga ctaacggaat tacttacccc agtgaatgca 61 gtctgtgctt tgaaaacagg aaattcggaa catctatcca cattcagagg agatagagcg 121 tctgcaaaaa cagatcgaac ggcataagaa gaagattaat acctaaagaa tagtgaggca 181 ttgagtgcac acagtcagtc tctcacatag tggcagtatc attcccactc ttatagagat 241 tgttttgaat gattgatgtt tgaccatgtg tgctactaac agataataaa ttatcaccag // LOCUS SYNTRPA 3763 bp ds-DNA circular SYN 28-JUL-1990 DEFINITION Cloning vector pATH3 propagated in E.coli. ACCESSION M33622 KEYWORDS trpE' protein. SOURCE Synthetic DNA, clone pATH3. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3763) AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A. TITLE High-expression vectors with multiple cloning sites for construction of trpE-fusion genes: pATH vectors JOURNAL Meth. Enzymol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hill, 06-APR-1990. Nucleotides 1-147 are provided as a personal communication from R.P.Gunsalus at the Dept. of Microbiology at UCLA. Construction of pATH3: 1. PvuII-HindIII fragment from the 5' end of the trp operon (through nt 1999 of ECOTGP, which is in trpD cds) was ligated to the HindIII-PvuII fragment of pBR322 containing the bla (= Amp-resistance) gene and origin of replication, but not the rop gene, which encodes a negative regulator of ColE1 replication. In addition, the EcoRI site in the pBR322 backbone was eliminated. This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141 (1984)) 2. The BglII-HindIII fragment (nt 1392 of trpE to the end of the trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI fragment and an EcoRI-HindIII fragment, both from the MCS of M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more details). 3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and the remaining plasmid religated. This produced plasmid pATH2 (GenBank acc M33624) 4. An EcoRI linker was inserted at the remaining SmaI site of pATH2 replacing the SmaI site and changing the reading frames of the other sites in the MCS. This plasmid is pATH3. FEATURES from to/span description pept 423 > 1391 trpE' fusion protein BASE COUNT 926 a 942 c 946 g 949 t ORIGIN 1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg 61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt 121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg 181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc 241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt 301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag 361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa 421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca 481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat 541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc 601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc 661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact 721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct 781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag 841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag 901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa 961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc 1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga 1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc 1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag 1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg 1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg 1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc 1381 agattgagat ccccccgaat tcggggggat cctctagagt cgacctgcag cccaagctta 1441 tcgatgataa gctgtcaaac atgagaatta attcttgaag acgaaagggc ctcgtgatac 1501 gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt 1561 ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt 1621 atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta 1681 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 1741 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 1801 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 1861 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 1921 gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 1981 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 2041 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 2101 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 2161 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 2221 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 2281 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 2341 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 2401 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 2461 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 2521 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 2581 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 2641 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 2701 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 2761 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 2821 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 2881 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 2941 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 3001 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 3061 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc 3121 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 3181 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 3241 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 3301 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 3361 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 3421 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 3481 agcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt 3541 gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc 3601 gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg 3661 acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg 3721 catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg agg // LOCUS SYNTRPB 3771 bp ds-DNA circular SYN 28-JUL-1990 DEFINITION Cloning vector pATH10, propagated in E.coli. ACCESSION M33623 KEYWORDS beta-lactamase; trpE' protein. SOURCE Synthetic DNA, clone pATH10. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3771) AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A. TITLE High-expression vectors with multiple cloning sites for construction of trpE-fusion genes path vectors JOURNAL Meth. Enzymol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hill 06-APR-1990. Nucleotides 1-147 are provided as a personal communication from R.P.Gunsalus at the Dept. of Microbiology at UCLA. Construction of pATH10: 1. PvuII-HindIII fragment from the 5' end of the trp operon (through nt 1999 of ECOTGP, which is in the trpD cds) was ligated to the HindIII-PvuII fragment of pBR322 containing the bla (= Amp-resistance) gene and origin of replication, but not the rop gene, which encodes a negative regulator of ColE1 replication. In addition, the EcoRI site in the pBR322 backbone was eliminated. This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141 (1984)) 2. The BglII-HindIII fragment (nt 423 of trpE to the end of the trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI fragment and an EcoRI-HindIII fragment, both from the MCS of M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more details). 3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and the remaining plasmid religated producing plasmid pATH2 (GenBank acc M33624). 4. An interim vector was constructed by inserting an EcoRI linker at the remaining SmaI site of pATH2. 5. The EcoRI-HindIII fragment of MCS in this interim vector was replaced with the EcoRI-HindIII fragment containing the MCS of M13mp12. 6. Tha AvaII-AvaII fragment that spanned the PstI site in the bla gene of this interim vector was replaced with the corresponding AvaII fragment from pUC8, eliminating this PstI site, making the PstI site in the MCS unique. This is plasmid pATH10. FEATURES from to/span description pept 423 1472 trpE' protein pept 1688 2548 beta-lactamase BASE COUNT 927 a 945 c 948 g 951 t ORIGIN 1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg 61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt 121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg 181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc 241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt 301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag 361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa 421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca 481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat 541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc 601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc 661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact 721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct 781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag 841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag 901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa 961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc 1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga 1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc 1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag 1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg 1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg 1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc 1381 agattgagat cccccggaat tcgagctcgc ccggggatcc tctagagtcg acctgcagcc 1441 caagcttatc gatgataagc tgtcaaacat gagaattaat tcttgaagac gaaagggcct 1501 cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg 1561 tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc 1621 aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag 1681 gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg 1741 ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt 1801 gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 1861 tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt 1921 attatcccgt gttgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa 1981 tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag 2041 agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac 2101 aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac 2161 tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac 2221 cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac 2281 tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact 2341 tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg 2401 tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt 2461 tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat 2521 aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta 2581 gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa 2641 tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 2701 aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac 2761 aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt 2821 tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc 2881 gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat 2941 cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag 3001 acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc 3061 cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc attgagaaag 3121 cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac 3181 aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg 3241 gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct 3301 atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc 3361 tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga 3421 gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga 3481 agcggaagag cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg 3541 catatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agtatacact 3601 ccgctatcgc tacgtgactg ggtcatggct gcgccccgac acccgccaac acccgctgac 3661 gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc 3721 gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag g // LOCUS SYNTRPC 3753 bp ds-DNA SYN 28-JUL-1990 DEFINITION Cloning vector pATH2, propagated in E.coli. ACCESSION M33624 KEYWORDS beta-lactamase; trpE' protein. SOURCE Synthetic DNA, clone pATH2. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3753) AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A. TITLE High-expression vectors with multiple cloning sites for construction of trpe-fusion genes path vectors JOURNAL Meth. Enzymol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hill 06-APR-1990. Nucleotides 1-147 are provided as a personal communication from R.P.Gunsalus at the Dept. of Microbiology at UCLA. Construction of pATH2: 1. PvuII-HindIII fragment from the 5' end of the trp operon (through nt 1999 of ECOTGP, which is in trpD cds) was ligated to the HindIII-PvuII fragment of pBR322 containing the bla (= Amp-resistance) gene and origin of replication, but not the rop gene, which encodes a negative regulator of ColE1 replication. In addition, the EcoRI site in the pBR322 backbone was eliminated. This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141 (1984)) 2. The BglII-HindIII fragment (nt 1392 of trpE to the end of the trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI fragment and an EcoRI-HindIII fragment, both from the MCS of M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more details). 3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and the remaining plasmid religated. This produced plasmid pATH2 FEATURES from to/span description pept 423 1454 trpE' protein pept 1670 2530 beta-lactamase BASE COUNT 924 a 939 c 943 g 947 t ORIGIN 1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg 61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt 121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg 181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc 241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt 301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag 361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa 421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca 481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat 541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc 601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc 661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact 721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct 781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag 841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag 901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa 961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc 1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga 1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc 1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag 1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg 1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg 1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc 1381 agattgagat ccccggggat cctctagagt cgacctgcag cccaagctta tcgatgataa 1441 gctgtcaaac atgagaatta attcttgaag acgaaagggc ctcgtgatac gcctattttt 1501 ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa 1561 tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat 1621 gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca 1681 acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca 1741 cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta 1801 catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt 1861 tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtgttgacgc 1921 cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc 1981 accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc 2041 cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa 2101 ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga 2161 accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgcagcaat 2221 ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca 2281 attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc 2341 ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat 2401 tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag 2461 tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa 2521 gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca 2581 tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc 2641 ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc 2701 ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc 2761 agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt 2821 cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt 2881 caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc 2941 tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa 3001 ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac 3061 ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc ttcccgaagg 3121 gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga 3181 gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact 3241 tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa 3301 cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc 3361 gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg 3421 ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcctgat 3481 gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag 3541 tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc gctacgtgac 3601 tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg acgggcttgt 3661 ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag 3721 aggttttcac cgtcatcacc gaaacgcgcg agg // LOCUS SYNTRPD 3772 bp ds-DNA SYN 28-JUL-1990 DEFINITION Cloning vector pATH11, propagated in E.coli. ACCESSION M33625 KEYWORDS beta-lactamase; trpE' protein. SOURCE Synthetic DNA, clone pATH11. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3772) AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A. TITLE High-expression vectors with multiple cloning sites for construction of trpe-fusion genes path vectors JOURNAL Meth. Enzymol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hill 06-APR-1990. Nucleotides 1-144 are provided as a personal communication from R.P.Gunsalus at the Dept. of Microbiology at UCLA. Construction of pATH11: 1. PvuII-HindIII fragment from the 5' end of the trp operon (through nt 1999 of ECOTGP, which is in the trpD cds) was ligated to the HindIII-PvuII fragment of pBR322 containing the bla (= Amp-resistance) gene and origin of replication, but not the rop gene, which encodes a negative regulator of ColE1 replication. In addition, the EcoRI site in the pBR322 backbone was eliminated. This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141 (1984)) 2. The BglII-HindIII fragment (nt 423 of trpE to the end of the trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI fragment and an EcoRI-HindIII fragment, both from the MCS of M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more details). 3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and the remaining plasmid religated. This produced plasmid pATH2 (GenBank acc M33624). 4. An interim vector was constructed by inserting an EcoRI linker at the remaining SmaI site of pATH2. 5. The EcoRI-HindIII fragment of MCS in this interim vector was replaced with the EcoRI-HindIII fragment containing the MCS of M13mp12. 6. Tha AvaII-AvaII fragment that spanned the PstI site in the bla gene of this interim vector was replaced with the corresponding AvaII fragment from pUC8, eliminating this PstI site, making the PstI site in the MCS unique. This is plasmid pATH11. FEATURES from to/span description pept 423 1487 trpE' protein pept 1689 2549 beta-lactamase BASE COUNT 927 a 946 c 948 g 951 t ORIGIN 1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg 61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt 121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg 181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc 241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt 301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag 361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa 421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca 481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat 541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc 601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc 661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact 721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct 781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag 841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag 901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa 961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc 1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga 1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc 1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag 1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg 1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg 1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc 1381 agattgagat ccccccggaa ttcgagctcg cccggggatc ctctagagtc gacctgcagc 1441 ccaagcttat cgatgataag ctgtcaaaca tgagaattaa ttcttgaaga cgaaagggcc 1501 tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag 1561 gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt 1621 caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa 1681 ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt 1741 gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt 1801 tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt 1861 ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg 1921 tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga 1981 atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa 2041 gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga 2101 caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa 2161 ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca 2221 ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta 2281 ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac 2341 ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc 2401 gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag 2461 ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga 2521 taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt 2581 agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata 2641 atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag 2701 aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa 2761 caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt 2821 ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc 2881 cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa 2941 tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa 3001 gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc 3061 ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag cattgagaaa 3121 gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa 3181 caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg 3241 ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc 3301 tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg 3361 ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg 3421 agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg 3481 aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc 3541 gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagtatacac 3601 tccgctatcg ctacgtgact gggtcatggc tgcgccccga cacccgccaa cacccgctga 3661 cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 3721 cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gg // LOCUS ECOTRMF 77 bp ss-tRNA RNA 28-JUL-1990 DEFINITION E. coli initiator Met-tRNA-f. ACCESSION K00305 M25117 KEYWORDS transfer RNA; transfer RNA-Met. SOURCE E. coli (strain CA265) tRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 77) AUTHORS Dube,S.K. and Marcker,K.A. TITLE The nucleotide sequence of N-formyl-methionyl-transfer RNA: Partial digestion with pancreatic and T-1 ribonuclease and derivation of the total primary structure JOURNAL Eur. J. Biochem. 8, 256-262 (1969) STANDARD full staff_review REFERENCE 2 (bases 1 to 77) AUTHORS Uemura,H., Imai,M., Ohtsuka,E., Ikehara,M. and Soell,D. TITLE E. coli initiator tRNA analogs with different nucleotides in the discriminator base position JOURNAL Nucleic Acids Res. 10, 6531-6539 (1982) STANDARD full staff_review REFERENCE 3 (sites) AUTHORS Dahlberg,J.E., Kintner,C. and Lund,E. TITLE Specific binding of tRNA-Met-f to 23S rRNA of Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 1071-1075 (1978) STANDARD simple staff_entry COMMENT [1] Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss from their entry 1310 in Nucleic Acids Res. 11, r1-r54 (1983). [1] notes that there may be either another minor Met-tRNA-f or a modification of this sequence, because there is a small amount of an oligonucleotide which shows that base 47 is adenosine instead of m7g. [2] generated all possible substitutions at the fourth base up from the 3' end (position 74); all four variants accepted methionine in in-vitro aminoacylation reactions, implying that the "discriminator hypothesis" is incorrect. FEATURES from to/span description tRNA 1 77 Met-tRNA (NAR: 1310) modified 8 8 s4u modified 21 21 d modified 33 33 cm anticdn 35 37 Met-tRNA-f anticodon cat modified 47 47 m7g modified 55 55 t modified 56 56 f BASE COUNT 14 a 26 c 25 g 12 t ORIGIN 5' end of mature Met-tRNA-f. 1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggtcg tcggttcaaa 61 tccggccccc gcaacca // LOCUS MCPRNA3A 80 bp ss-RNA VRL 28-JUL-1990 DEFINITION Cowpea mosaic virus M RNA 3' terminal sequence. ACCESSION M25438 KEYWORDS . SOURCE Cowpea mosaic virus RNA. ORGANISM Cowpea mosaic virus Viridae; ss-RNA nonenveloped viruses; Comoviridae. REFERENCE 1 (bases 1 to 80) AUTHORS Davies,J.W., Stanley,J. and Van Kammen,A. TITLE Sequence homology adjacent to the 3' terminal poly(A) of cowpea mosaic virus RNAs JOURNAL Nucleic Acids Res. 7, 493-500 (1979) STANDARD simple staff_entry BASE COUNT 22 a 13 c 13 g 32 t ORIGIN 1 tatgaattta atttcttttg tgagctcctg tttagcaggt cgtcccttca gcaaggacac 61 aaaaagattt taattttatt // LOCUS MCPRNA3B 80 bp ss-RNA VRL 28-JUL-1990 DEFINITION Cowpea mosaic virus B RNA 3' terminal sequence. ACCESSION M25439 KEYWORDS . SOURCE Cowpea mosaic virus RNA. ORGANISM Cowpea mosaic virus Viridae; ss-RNA nonenveloped viruses; Comoviridae. REFERENCE 1 (bases 1 to 80) AUTHORS Davies,J.W., Stanley,J. and Van Kammen,A. TITLE Sequence homology adjacent to the 3' terminal poly(A) of cowpea mosaic virus RNAs JOURNAL Nucleic Acids Res. 7, 493-500 (1979) STANDARD simple staff_entry BASE COUNT 22 a 12 c 13 g 33 t ORIGIN 1 taaataatgc ttatgttttt gtttgctcct gtttagcagg tcgttccttc agcaagaaca 61 acaaaaatat gtgttttatt // LOCUS PPCCGAAA 5306 bp ds-DNA VRL 28-JUL-1990 DEFINITION Hamster papovavirus complete genome. ACCESSION M26281 KEYWORDS complete genome. SOURCE Hamster papovavirus DNA. ORGANISM Hamster papovavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 5306) AUTHORS Delmas,V., Bastien,C., Scherneck,S. and Feunteun,J. TITLE A new member of the polyomavirus family: The hamster papovavirus. Complete nucleotide sequence and transformation properties JOURNAL EMBO J. 4, 1279-1286 (1985) STANDARD simple staff_entry FEATURES from to/span description ORF 192 839 early proteins polyprotein (T antigens) ORF 5083 4046 (c) VP2 ORF 4711 4046 (c) VP2 ORF 4045 2927 (c) VP1 BASE COUNT 1595 a 1124 c 1080 g 1507 t ORIGIN 1 ccccttgcct ccttagctct caagtagaaa aggaagagag gcttttgggg ctttttggct 61 ttaagcctca ttttatgagc aggaggagct tgttgcaact tgagaggcgt tttgaggctt 121 ccaggcagag aatactcaca gaccccacac agtctagacg ctcagaagca tctctagctg 181 caacaagcaa gatggataga attcttacta aagaagaaaa gcaagcctta ataagtttac 241 tagatttgga gccacaatat tggggagact atggacgaat gcagaaatgc tacaagaaaa 301 agtgtcttca actgcatcct gataaaggtg gcaatgaaga gctcatgcaa cagcttaata 361 ccctgtggac caaactaaaa gatggtcttt acagagttag gctgttactt gggcctagtc 421 aggtaagaag acttggaaaa gatcagtgga atttatcttt acagcaaaca ttttctggta 481 cctactttag gaggctctgc agactcccca ttacctgcct aagaaacaag ggaattagta 541 cctgcaattg catactttgt ttgctcagaa aacagcattt tctgctaaag aagtcctgga 601 gagtaccttg cctggtgtta ggagaatgct actgcataga ctgctttgcc ttatggtttg 661 gcctgccagt taccaatatg ctggttccat tatatgcaca atttcttgct ccaatacctg 721 tggattggct tgatctgaat gttcatgagg tctacaatcc ggcctcaggt atgtatgaat 781 atggggggct tatagttgta actgtacaag tttaaaatgt gcttttttca ggaccctaat 841 gcttccacct ccaccagcag acccggagag ttctacaatc ctgacacagg aggatactgg 901 tcctactctt atgggtcagc aggatactct gaccagcaga agaaatactg ggaagagttt 961 ttctctaagt gggatgttaa tgaggacctc acctgccaag aagagttatc atcatcagaa 1021 gatgaattca ccccctggca tcccaatccc cccccctccc ctgtttctat ttccagtgac 1081 agctccagtt cctcctgtga cgaggaatac ccaagaaact caagcagaaa gagaaaacga 1141 gtacatgcca atggctcccc aaatacacct atacagccaa ataagagagc ccacacacca 1201 ggaggaggaa gaaccacaat acgaggagat accgatatac ctagaactcc tgccagagaa 1261 tcccaatcaa catttggctc ttacttcaac agcacggagg agcttgagga ggaaatatca 1321 caaacacaac agtcacatca taacacaacg ccaaagaaac cgcctccgac ggttagtcct 1381 gatgattttc ctactatcct tagggggttt ctttctcacg ctattttttc taataaaacg 1441 caaaatgcat ttataatcta cagtactaag gaaaaatgtg aagtacttta tgaacaaata 1501 gacaaatata atccagacta taaaggtatc ttcattatga aacaaacaga agcatttgta 1561 atgtttatga ctcctggaaa acatagagta gctgcagtta aaagttactg ttgtaaattt 1621 tgtaccgtta gcttcctgct atgcaaagct gttacaaaac cgttagagtt gtataactgt 1681 gtggctaaat gtgatgactt tcaaatttta aaagaaaata agcctggtct atatcatttt 1741 gaattctgtg atgaaaaaaa agaggtgaag caaatagact ggaatttcct aacatctttt 1801 gcagttgaaa atgagttaga tgatcctctt gtaattatgg gacattatct agaatttagt 1861 cagtgtgaaa gctcttgcaa aaagtgtgca gaagctttac caaggatgaa agtccactgg 1921 gctaaccaca gtcagcactt agagaatgct gagcttttct tacactgcaa acaacagaaa 1981 agtatctgtc agcaagcagc agataatgtt ctggcaagga gaagattaaa ggtccttgaa 2041 tcaacaagac aagaattgtt ggcagagaga ctgaacaaac tgttagacca attaaaagat 2101 ttatctcctg tagataagca tttatatctt gctggagtag cctggtacca atgtatgttt 2161 cctgattttg agatgatgtt attagatatt ttaaaattgt ttactgaaaa tgttccaaaa 2221 aaaagaaatg tactttttag aggtcctgta aattcaggga aaactagcct tgctgcagct 2281 atcatgaatc ttgtaggagg agttgccctc aatgttaatt gtcctgcaga taagctcaac 2341 tttgaacttg gtgttgctat agataaattt gcagtagtct ttgaagatgt caaaggacaa 2401 accggagata agagacacct acagtctgga cttggaatta ataaccttga taacctgaga 2461 gattaccttg atggaagtgt aaaggttaat ttagaaaaga agcatgtaaa taagaggtcc 2521 cagatatttc ctccttgtat tgttactgct aatgaatatt tttttcctca aacactctat 2581 gccagattcc ataaagttta taactttgaa gtgaaggatt ttcttgccaa gagccttgag 2641 gaaaacagtt acatggggag acatagagtc tgtcaaagtc cacttacaat gctgatagca 2701 ttgctttgga atgtacccac tgaaaatttt gataagtctc tcaaagagaa ggtggaaaca 2761 gaaaagaagg ttttgtctga tatgtgtaac tttactacat ttgcagaaat gtgtctcaat 2821 attcagaggg gtgctgatcc ccttgaggca ttgtaattga ggaggaaaca ataattgatg 2881 aataaagcat ttattagaag ctctgtgtac agtcattttt caagcattag tttgctggtt 2941 ttgcaggggg tttagtatgc tgttggccat acttgtcaat gaacctattc acatctgggt 3001 caccaggaac agcctctgta ccctcataaa tcctgacttc ttctacctga gcagcttctc 3061 cttccatggg ctggccttca attgttggaa gcatattgtt gtacaaagaa gctagcaagc 3121 ttgtaactgg gtaaggattt ttcacccatc tttttctcaa ggtcacatta aaatatctag 3181 gcagccccct ccaatgccag cctgcactgt tgtattctat gtaccagccc ataacatctg 3241 ctgcactgag ataaagccca tctcctttgc aaagaggccc aaccccattt tcatccagaa 3301 gcacagtagt caaggtatta gtaaactgca tcactggtgg agtaccagta ccacctgtga 3361 ggtacctacc atccttgtcc aattttgctt ttgcagtagg gtccagcacc tggtttgtgg 3421 aagtcattgc tttgccagta acagttttga tactaacaat agctgcctca taatttgcat 3481 tatagttctg cactaggcct tgcaaatcta atggttctcc tcccactgca aacatgtggt 3541 aagttgtacc ctcaactggt ttggaaattc caatatcctt tgtctcactt ctggagccat 3601 atccatgcac atttagaagg gatcccactc caacaacttc agtttttaca gatacagcct 3661 cccacatttg aagggtatca caggtcaaat cttcattcag tgttggaagc tgtattttag 3721 ccatactgta atatggcagt tgattagcct tcacttcatc agcagtaagg gagctattta 3781 ctttaatact ctgggagaac ccataatact ggccatcagt tcctgtgcca ggcttgttct 3841 gacccattct aggattaagg taggcctcaa tttgtgtgat actgtcttct cctgttacaa 3901 gatcaagcac acccacacca ccccgcataa taagcttggg aacattagca ggctttggac 3961 agggtttcca caggggtttg cacatctact ggaagcgccg ctttttcttt ttggggccat 4021 actcaacctc atcaatgtat gtctgccaag taggactaat gtctccgtac aatcctagaa 4081 ttaaaggaag catccaatca ggtgtcactc tttggtgggc tccaccagga gcaaaatacc 4141 tcatgatatt tgcccctgat tcaaaccaac tagaagagtc ttcctgctgt tgacttcttc 4201 tctggacatc aggtctccct agttcagctt ctaatacttg tctgctattg gcatcttcaa 4261 tagaaggtct attactgtat tctaaagctc tttctatttg tcttctttga gctggattaa 4321 ttcctggaag ttctgcatag tagttttgta ggccaccata tattctacta taggcctctc 4381 taggtaaatt agtaacaacc catctactat tttccatcat tctggcaatg gcatctaaaa 4441 attgatgggt ggtctgcaaa cttaagtccc tcacagcaga ttctacagct ccttgagttt 4501 cccttctcaa agtatcccag atatactctc ctacagattg aaataatgag tggccccagc 4561 catgaataac atccaaggca tgggtaaatg actgtacacc agggaataat atatcatagt 4621 agtcagctgg tctcctggga ataagtgcca tgtttctatt cacaatcggt acttcgtgag 4681 caagatagcc gtgtagactt cccaaagaga aggctgctga acctgcaaca gtttgaaaaa 4741 taaatgctgt ctggactgat tctctcacaa actcagtcat tactgttgat gttaactcag 4801 gggcagcttg cataaatata aacatgtctt cacttaggcc aattgaagac aaagctgtct 4861 cagctcctaa aaacccctcc atagttatta atgaagtaac ttgggcatct atcgcggcaa 4921 aggcttctcc actaagtatg gcctctactg aaattccagt aactgatgaa atttcggaga 4981 ggtagctgat catctcaata atcactgaaa tggcagatcc catgttgact tacttgaaca 5041 gtttgaaaat cttctgaact gtttcaggca ggtttttagg ccgaattcta aagaaacaga 5101 aagcaaacac tcagcgccga agagcaggaa atggctgacc actgcacttg ggcgacacga 5161 cacgcctagc gataaggaag tcaccatggc aacataaccg cagcactgct gttgtcacag 5221 ttgcctagca aatgacagac tcagcaacca caggagagga aatgataggg ctagcatttt 5281 ttcaaatgta aaccagaggc tagggg // LOCUS RATGST2YB 500 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Rat liver glutathione S-transferase Ya subunit mRNA. ACCESSION M26874 KEYWORDS S-transferase; glutathione S-transferase; ligandin; transferase. SOURCE Rat liver cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 500) AUTHORS Daniel,V., Sarid,S., Bar-Nun,S. and Litwack,G. TITLE Rat ligandin mRNA molecular cloning and sequencing JOURNAL Arch. Biochem. Biophys. 227, 266-271 (1983) STANDARD simple staff_entry FEATURES from to/span description pept < 1 402 glutathione S-transferase Ya subunit (AA at 1) BASE COUNT 153 a 112 c 109 g 126 t ORIGIN 1 gccctgattg acatgtattc agagggtatt ttagatctga ctgaaatgat tatccaattg 61 gtaatatgtc ccccagacca aagagaagcc aagaccgcct tggcaaaaga caggaccaaa 121 aaccggtact tgcctgcctt tgaaaaggtg ttgaagagcc atggccaaga ctaccttgta 181 ggtaacaggc tgacccgggt agacatccac ctgctggaac ttctcctcta tgttgaagag 241 tttgatgcca gccttctgac ctctttccct ctgctgaagg ccttcaagag cagaatcagc 301 agcctcccca atgtgaagaa gttcctgcag cctggcagtc agagaaagct tcccgtggat 361 gcaaaacaaa tcgaagaagc aaggaagatt ttcaagtttt agcggagctg cactatccaa 421 tttctttatg ttttgcaaaa aatgagaagc aattgttgat cctaggtatt tttgaaataa 481 taaacacgaa aaaatactct // LOCUS CPARBCSL 528 bp ds-DNA PLN 28-JUL-1990 DEFINITION C.paradoxa ribulose-1,5-bisphosphate carboxylase/oxygenase large (rbcL) and small (rbcS) subunits, 3' end and complete cds. ACCESSION M35728 KEYWORDS ribulose-1,5-bisphosphate carboxylase/oxygenase. SOURCE C.paradoxa DNA. ORGANISM Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae. REFERENCE 1 (bases 1 to 528) AUTHORS Starnes,S.M., Lambert,D.H., Maxwell,E.S., Stevens,S.E.Jr., Porter,R.D. and Shively,J.M. TITLE Cotranscription of the large and small subunit genes of ribulose- 1,5-bisphosphate carboxylase/oxygenase in Cyanophora paradoxa JOURNAL FEMS Microbiol. Lett. 28, 165-169 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 18 ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit (rbcL) pept 124 444 ribulose-1,5-bisphosphate (AA at 1) carboxylase/oxygenase small subunit (rbcS) BASE COUNT 189 a 85 c 60 g 194 t ORIGIN 1 actattgata ctatctaata tcatttaatt tatttaatta tttagagttt aaaactctaa 61 ataattaatc aaaatgatat tacttcaatc tatttttacc ttaaaattcg gaattataaa 121 taaatgcaac ttagagtaga acgtaagttc gaaacttttt cttatttacc accattaaac 181 gaccaacaga ttgcgcgtca attacaatac gcactttcca atggttatag cccagcaatc 241 gaattcagtt ttacaggtaa agctgaagac ttagtatgga ctttatggaa attaccttta 301 tttggtgcac aatctcctga agaagtactt agcgaaattc aagcttgtaa acaacagttc 361 cctaatgctt acattcgtgt tgtagcattt gactctatca gacaagttca aactttaatg 421 ttcttagttt acaaaccatt atagtttaat tgatatctac tctaattgat agatatcaat 481 ttttaattaa tctacaaaac aaaattatct aattattatt aatacttt // LOCUS HUMCFIX 873 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human coagulation factor IX mRNA, partial cds. ACCESSION M35672 KEYWORDS coagulation factor IX; serine protease. SOURCE Human adult liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 873) AUTHORS Jagadeeswaran,P., Lavelle,D.E., Kaul,R., Mohandas,T. and Warren,S.T. TITLE Isolation and characteriztion of human factor IX cDNA: Identification of Taq I polymorphism and regional assignment JOURNAL Somat. Cell Mol. Genet. 10, 465-473 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 873 coagulation factor IX (AA at 1) BASE COUNT 279 a 146 c 205 g 243 t ORIGIN 1 aacgccaaca aaattctgaa tcggccaaag aggtataatt caggtaaatt ggaagagttt 61 gttcaaggga accttgagag agaatgtatg gaagaaaagt gtagttttga agaagcacga 121 gaagtttttg aaaacactga aagaacaact gaattttgga agcagtatgt tgatggagat 181 cagtgtgagt ccaatccatg tttaaatggc ggcagttgca aggatgacat taattcctat 241 gaatgttggt gtccctttgg atttgaagga aagaactgtg aattagatgt aacatgtaac 301 attaagaatg gcagatgcga gcagttttgt aaaaatagtg ctgataacaa ggtggtttgc 361 tcctgtactg agggatatcg acttgcagaa aaccagaagt cctgtgaacc agcagtgcca 421 tttccatgtg gaagagtttc tgtttcacaa acttctaagc tcacccgtgc tgagactgtt 481 tttcctgatg tggactatgt aaattctact gaagctgaaa ccattttgga taacatcact 541 caaagcaccc aatcatttaa tgacttcact cgggttgttg gtggagaaga tgccaaacca 601 ggtcaattcc cttggcaggt tgttttgaat ggtaaagttg atgcattctg tggaggctct 661 atcgttaatg aaaaatggat tgtaactgct gcccactgtg ttgaaactgg tgttaaaatt 721 acagttgtcg caggtgaaca taatattgag gagacagaac atacagagca aaagcgaaat 781 gtgattcgaa ttattcctca ccacaactac aatgcagcta ttaataagta caaccatgac 841 attgcccttc tggaactgga cgaaccctta gtg // LOCUS HUMMHDRBPV 292 bp ds-DNA PRI 28-JUL-1990 DEFINITION Human MHC class II HLA-DR-beta-I allele gene, partial cds. ACCESSION M35651 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (Pemphigus vulgaris patient, haplotype DR4 Dw10) blood DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 292) AUTHORS Scharf,S.J., Long,C.M. and Erlich,H.A. TITLE Sequence analysis of the HLA-Dr-beta and HLA-DQ-beta loci from three Pemphigus vulgaris patients JOURNAL Hum. Immunol. 22, 61-69 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 26 > 292 HLA-DR-beta, exon 2 (AA at 26) BASE COUNT 64 a 74 c 101 g 53 t ORIGIN 1 ccggatcctt cgtgtcccca gaccacgttt cttggagcag gttaaacatg agtgtcattt 61 cttcaacggg acggagcggg tgcggttcct ggacagatac ttctatcacc aagaggagta 121 cgtgcgcttc gacagcgacg tgggggagta ccgggcggtg acggagctgg ggcggcctga 181 tgccgagtac tggaacagcc agaaggacat cctggaagac gagcgggccg cggtggacac 241 ctactgcaga cacaactacg gggttgtgga gagcttcaca gtgcagcggc ga // LOCUS MUSC3B 647 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Mouse complement component 3 (C3) mRNA, partial cds. ACCESSION M35659 KEYWORDS complement component 3. SOURCE Mouse liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 647) AUTHORS Fey,G., Domdey,H., Wiebauer,K., Whitehead,A.S. and Odink,K. TITLE Structure and expression of the C3 gene JOURNAL Springer Semin. Immunopathol. 6, 119-147 (1983) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 647 complement 3 (AA at 1) BASE COUNT 170 a 171 c 168 g 138 t ORIGIN 1 atccccatgt attccatcat tactcccaat gtcctacggc tggagagcga agagaccatc 61 gtactggagg cccacgatgc tcagggtgac atcccagtca cagtcactgt gcaagacttc 121 ctaaagaggc aagtgctgac cagtgagaag acagtgttga caggagccag tggacatctg 181 agaagcgtct ccatcaagat tccagccagt aaggaattca actcagataa ggaggggcac 241 aagtacgtga cagtggtggc aaacttcggg gaaacggtgg tggagaaagc agtgatggta 301 agcttccaga gtgggtacct cttcatccag acagaccaga ccatctacac ccccggctcc 361 actgtcttat atcggatctt cactgtggac aacaacctac tgcccgtggg caagacagtc 421 gtcatcctca ttgagacccc cgatggcatt cctgtcaaga gagacattct gtcttccaac 481 aaccaacacg gcatcttgcc tttgtcttgg aacattcctg aactggtcaa catggggcag 541 tggaagatcc gagcctttta cgaacatgcg ccgaagcaga tcttctccgc agagtttgag 601 gtgaaggaat acgtgctgcc cagttttgag gtccgggtgg agcccac // LOCUS P30LTA 777 bp ds-DNA BCT 28-JUL-1990 DEFINITION Plasmid P307 (from E.coli) heat-labile enterotoxin subunit A (LTA) gene, complete cds. ACCESSION M35581 KEYWORDS enterotoxin. SOURCE Plasmid P307 (from Escherichia coli) DNA, clone pAT153. ORGANISM Plasmid P307 Unclassified. REFERENCE 1 (bases 1 to 777) AUTHORS Dykes,C.W., Halliday,I.J., Hobden,A.N., Read,M.J. and Harford,S. TITLE A comparison of the nucleotide sequence of the A subunit of heat- labile enterotoxin and cholera toxin JOURNAL FEMS Microbiol. Lett. 26, 171-174 (1985) STANDARD simple staff_review FEATURES from to/span description pept 1 777 heat-labile enterotoxin subunit A (LTA) BASE COUNT 255 a 136 c 164 g 222 t ORIGIN 1 atgaaaaata taactttcat tttttttatt ttattagcat cgccattata tgcaaatggc 61 gacagattat accgtgctga ctctagaccc ccagatgaaa taaaacgttc cggaggtctt 121 atgcccagag ggcataatga gtacttcgat agaggaactc aaatgaatat taatctttat 181 gatcacgcga gaggaacaca aaccggcttt gtcagatatg atgacggata tgtttccact 241 tctcttagtt tgagaagtgc tcacttagca ggacagtcta tattatcagg atattccact 301 tactatatat atgttatagc gacagcacca aatatgttta atgttaatga tgtattaggc 361 gtatacagcc ctcacccata tgaacaggag gtttctgcgt taggtggaat accatattct 421 cagatatatg gatggtatcg tgttaatttt ggtgtgattg atgaacgatt acatcgtaac 481 agggaatata gagaccggta ttacagaaat ctgaatatag ctccggcaga ggatggttac 541 agattagcag gtttcccacc ggatcaccaa gcttggagag aagaaccctg gattcatcat 601 gcaccacaag gttgtggaaa ttcatcaaga acaatcacag gtgatacttg taatgaggag 661 acccagaatc tgagcacaat atatctcagg gaatatcaat caaaagttaa gaggcagata 721 ttttcagact atcagtcaga ggttgacata tataacagaa ttcgggatga attatga // LOCUS PIGFSHB 929 bp ss-mRNA MAM 28-JUL-1990 DEFINITION Pig follicle stimulating hormone (FSH) beta-subunit mRNA, 3" end. ACCESSION M35676 KEYWORDS follicle stimulating hormone. SOURCE Pig anterior pituitary, cDNA to mRNA. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 929) AUTHORS Kato,Y. TITLE Cloning and DNA sequence analysis of the cDNA for the precursor of porcine follicle stimulating hormone (FSH) beta-subunit JOURNAL Mol. Cell Endocrinol. 55, 107-112 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 348 follicle stimulating hormone beta-subunit (AA at 1) BASE COUNT 256 a 230 c 207 g 236 t ORIGIN 1 gccatctgct gcaatagctg tgagctgacc aacatcacca tcacagtgga gaaagaggag 61 tgtaacttct gcataagcat caacaccacg tggtgtgctg gctattgcta cacccgggac 121 ctggtataca aggacccagc caggcccaac atccagaaaa catgtacctt caaggagctg 181 gtgtacgaga ccgtgaaagt acctggctgt gctcaccatg cagactccct gtatacgtat 241 ccagtagcca ctgaatgtca ctgtggcaag tgtgacagtg acagtactga ctgcaccgtg 301 agaggcctgg ggcccagcta ctgctccttc agtgaaatga aagaataaag agcagtggac 361 atttcatgct tcctaccctt gtctgaagga ccaagacgtc caagaagttt gtgtgtacat 421 gtgcccaggc tgcaaaccac tatgagagac cccactgatc cctgctgtcc tgtggaggag 481 gagctccagg aatgcagagt gctagggcct cagtcccatc accactcaac cctgtatttt 541 gggtctggtt ccataagttt tattcggtct ttttttttaa attactcaat gaattttatt 601 acatttataa ttgtacaatg atcatcacaa cccaatttta taggatttcc atcccaaacc 661 cccagcatag acccccatct cccaatctgt ctcatttgga aaccataagt ttttcaaagt 721 ccgtgagtca gtatctactc agtcttatta ccttaaagac atgtgggtgt tttctgttta 781 ataatcttag aaatcctctc aagacaggga tatggaccca gaggaaggaa atgggctaag 841 aatgggtgaa aggactaaat gcagcattct cccactagac acagcagcct acaagagcag 901 ggccagtctc tttgtcatga gtgtggccg //