Path: utzoo!attcan!uunet!jarthur!usc!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 17 Jul 90 12:01:37 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 14214 Approved: lear@genbank.bio.net Checksum: 05670 857 LOCUS BPEP69OMP 3000 bp ds-DNA BCT 17-JUL-1990 DEFINITION B.pertussis omp69A gene for P.69 outer membrane protein, complete cds. ACCESSION J04560 KEYWORDS membrane protein; outer membrane protein. SOURCE B.pertussis (CN2992) DNA. ORGANISM Bordetella pertussis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 3000) AUTHORS Charles,I.G., Dougan,G., Pickard,D., Chatfield,S., Smith,M., Novotny,P., Morrissey,P. and Fairweather,N.F. TITLE Molecular cloning and characterization of protective outer membrane protein P.69 from Bordetella pertussis JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3554-3558 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by N.F.Fairweather 26-JUL-1989. FEATURES from to/span description pept 145 2877 P.69 outer membrane protein precursor sigp 145 246 P.69 outer membrane protein signal peptide matp 247 2277 P.69 outer membrane protein BASE COUNT 453 a 1019 c 1096 g 432 t ORIGIN 2 bp upstream of ClaI site. 1 atcgatgata cgtcgctgta acacgacaaa tagcgtgcgt tgcagcggtt ctggatggcg 61 ttattcgtac ttttgctgcg cccattcttc cctgttccat cgcggtgcgg gcatggcggg 121 cgtctgctct ccacctggca tccaatgaac atgtctctgt cacgcattgt caaggcggcg 181 cccctgcgcc gcaccacgct ggccatggcg ctgggcgcgc tgggcgccgc cccggcggcg 241 catgccgact ggaacaacca gtccatcgtc aagaccggtg agcgccagca tggcatccat 301 atccagggct ccgacccggg cggcgtacgg accgccagcg gaaccaccat caaggtaagc 361 ggccgtcagg cccagggcat cctgctagaa aatcccgcgg ccgagctgca gttccggaac 421 ggcagtgtca cgtcgtcggg acagttgtcc gacgatggca tccggcgctt tctgggcacc 481 gtcaccgtca aggccggcaa gctggtcgcc gatcacgcca cgctggccaa cgttggcgac 541 acctgggacg acgacggcat cgcgctctat gtggccggcg aacaggccca ggccagcatc 601 gccgacagca ccctgcaggg cgctggcggc gtgcagatcg agcgcggcgc caatgtcacg 661 gtccaacgca gcgccatcgt cgacgggggc ttgcatatcg gcgccctgca gtcattgcag 721 ccggaagacc ttccgcccag ccgggtggtg ctgcgcgaca ccaacgtgac cgccgtgccc 781 gccagcggcg cgcccgcggc ggtgtctgtg ttgggggcca gtgagcttac gctcgacggc 841 gggcacatca ccggcgggcg ggcagcgggg gtggcggcca tgcaaggggc ggtcgtgcat 901 ctgcagcgcg cgacgatacg gcgcggggac gcgcttgccg gcggtgcggt tcccggcggt 961 gcggttcccg gtggtgcggt tcccggcggc ttcggtcccg gcggcttcgg tcccgtcctc 1021 gacggctggt atggcgtgga cgtatcgggc tccagcgtgg agctcgccca gtcgatcgtc 1081 gaggcgccgg agctgggcgc cgcaatccgg gtgggccgcg gcgccagggt gacggtgccg 1141 ggcggcagct tgtccgcacc gcacggcaat gtcatcgaga ccggcggcgc gcgtcgcttt 1201 gcgcctcaag ccgcgcccct gtcgatcacc ttgcaggccg gcgcgcatgc ccaggggaaa 1261 gcgctgctgt accgggtcct gccggagccc gtgaagctga cgctgaccgg gggcgccgat 1321 gcgcagggcg acatcgtcgc gacggagctg ccctccattc ccggcacgtc gatcgggccg 1381 ctcgacgtgg cgctggccag ccaggcccga tggacgggcg ctacccgcgc ggtcgactcg 1441 ctgtccatcg acaacgccac ctgggtcatg acggacaact cgaacgtcgg tgcgctacgg 1501 ctggccagcg acggcagcgt cgatttccag cagccggccg aagctgggcg gttcaaggtc 1561 ctgacggtca atacgctggc gggttcgggg ctgttccgca tgaatgtctt cgcggacctg 1621 gggctgagcg acaagctggt cgtcatgcag gacgccagcg gccagcacag gctgtgggtc 1681 cgcaacagcg gcagcgagcc ggccagcgcc aacaccctgc tgctggtgca gacgccacta 1741 ggcagcgcgg cgacctttac ccttgccaac aaggacggca aggtcgatat cggtacctat 1801 cgctatcgat tggccgccaa cggcaatggg cagtggagcc tggtgggcgc gaaggcgccg 1861 ccggcgccca agcccgcgcc gcagccgggt ccccagccgc cgcagccgcc gcagccgcag 1921 ccggaagcgc cggcgccgca accgccggcg ggcagggagt tgtccgccgc cgccaacgcg 1981 gcggtcaaca cgggtggggt gggcctggcc agcacgctct ggtacgccga aagcaatgcg 2041 ttgtccaagc gcctgggcga gttgcgcctg aatccggacg ccggcggcgc ctggggccgc 2101 ggcttcgcgc aacgccagca gctggacaac cgcgccgggc ggcgcttcga ccagaaggtg 2161 gccggcttcg agctgggcgc cgaccacgcg gtggcggtgg ccggcggacg ctggcacctg 2221 ggcgggctgg ccggctatac gcgcggcgac cgcggcttca ccggcgacgg cggcggccac 2281 accgacagcg tgcatgtcgg gggctatgcc acatatatcg ccgacagcgg tttctacctg 2341 gacgcgacgc tgcgcgccag ccgcctggag aatgacttca aggtggcggg cagcgacggg 2401 tacgcggtca agggcaagta ccgcacccat ggggtgggcg cctcgctcga ggcgggccgg 2461 cgctttaccc atgccgacgg ctggttcctc gagccgcagg ccgagctggc ggtattccgg 2521 gccggcggcg gtgcgtaccg cgcggccaac ggcctgcggg tgcgcgacga aggcggcagc 2581 tcggtgctgg gtcgcctggg cctggaggtc ggcaagcgca tcgaactggc aggcggcagg 2641 caggtgcagc catacatcaa ggccagcgtg ctgcaggagt tcgacggcgc gggtacggta 2701 cacaccaacg gcatcgcgca ccgcaccgaa ctgcgcggca cgcgcgccga actgggcctg 2761 ggcatggccg ccgcgctggg ccgcggccac agcctgtatg cctcgtacga gtactccaag 2821 ggcccgaagc tggccatgcc gtggaccttc cacgcgggct accggtacag ctggtaaagc 2881 gaggagggtc tatcccccgc ggaggagttt ttcctggagc ttggccggtg ccagtctcca 2941 ggctcaggcg gccagggcct gcgggccggg caggccgcgc tggtgctggc cgaaccattg // LOCUS ATHTRPB 4225 bp ds-DNA PLN 17-JUL-1990 DEFINITION A.thaliana tryptophan synthase beta subunit (trpB) gene, complete cds. ACCESSION M23872 KEYWORDS tryptophan synthase. SOURCE A.thaliana (strain Landsberg erecta) DNA, clone lambda-M1-g, and cDNA to mRNA (strain Columbia), clone lambda-CD-2. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 882) AUTHORS Berlyn,M.B. JOURNAL Unpublished (1989) Yale Univ, 165 Prospect, New Haven CT, 06511. STANDARD full staff_review REFERENCE 2 (bases 883 to 4225) AUTHORS Berlyn,M.B., Last,R.L. and Fink,G.R. TITLE A gene encoding the tryptophan synthase beta subunit of Arabidopsis thaliana JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 4604-4608 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.B.Berlyn, 18-APR-1989. FEATURES from to/span description pept 1517 1858 tryptophan synthase beta subunit, exon 1 (EC 4.2.1.20) 2109 2465 tryptophan synthase beta subunit, exon 2 2549 2991 tryptophan synthase beta subunit, exon 3 3084 3178 tryptophan synthase beta subunit, exon 4 3275 3450 tryptophan synthase beta subunit, exon 5 IVS 1859 2108 tryptophan synthase intron A IVS 2466 2548 tryptophan synthase intron B IVS 2992 3083 tryptophan synthase intron C IVS 3179 3274 tryptophan synthase intron D site 3643 3645 poly-A addition site BASE COUNT 1252 a 733 c 817 g 1423 t ORIGIN 1 gaattctttc atatctcctg caaagttctt gatatcaata ctccagcagt aactaagact 61 tagactcttg agcgtaggag agtttgataa caaagactcg gcctctgtga gcttgatcca 121 accaatagag agctttctag gcaatcccga gtttttgaac ttggagggat caagcccaca 181 cgcgtaaatc tttagtgatt cgagatttgt gtttaaaatc cgaattaaaa cctaatcaaa 241 ttaaaactaa accaaaccaa atacaatcca aaattaaact aattttggtt gagtttggtt 301 atagttttac taaatccaaa ttaacagaac ataaccaaac ccgaagattt ttagagtctt 361 tagaatttta aggtgatttt agataaaaga gattaaacac aaatctcgaa aactaaagaa 421 agagtttttg aaaattttta agtgttttca tgtaaagtgg atttctctgt gttttctgca 481 ttctgcggat tataactcct atgttttttt tctccgtcaa ttatatgtgt ttattttctc 541 tattttcttt tatttttatt tttattctct atattagggt ttagtttatg aaaacttttt 601 gttatctata taggcttggg ggatgtattt aaattagaat ttaaagtgat ttgagttctt 661 tgagttttta aataatttta acgattttaa aaaagttcgt atgatttttg taaaatctat 721 taaaatctca ccttaaatca tgggatttgg atttctgtat tttgaactaa gaaaatcctc 781 tcaaatcctc caaaatcatt aaaattcaaa tccacaaatt gttctgaata acagtgaatt 841 ttaaggtgga ttttgaaata attagttcaa taacactgaa tttcatgaga ttttttaaaa 901 tacatgtttg aataacatat gatttataaa ttctacacaa atcttttaaa attctaattt 961 caatacattg tttttgaaag tgttattgac tcttgccaat atagtatccc aattcccaac 1021 ttgtgtttca ttttttcatc tatctaataa acaattagat gaacacaaaa aaatattggt 1081 aggtgatggc tcaattggat atgtttttga aaaccatgtg ttaaaaactt aaaatactat 1141 ccaacttacc ccagtcctac caactttttt tttcttctct tggtctgctt acatgtgtct 1201 gcttatatct ccaaaaggaa atagatatat aaaaattcaa atttaaatat ttgcgatttg 1261 ttaaatttta atcaatattt aatttttgtt tttttttgtt tttttttatg aagacaacaa 1321 ataaccaaat ttatcaaatc tgatcaaagc agatttagga ttttacaaat atattttttt 1381 aatatgaatt ttgtggtcag attttgacca attctctttg aaaaaaaaaa aaatctatct 1441 ataaaaacat gtgttacttt gaaaggatat ttcaaggaga agaatatatt tgactcagag 1501 agagattcat tcagtaatgg cagcctcagg cacctctgct actttcagag cctccgtttc 1561 ttcagctcct tcctcttctt cccaattgac ccatttgaaa tcacccttca aagctgtcaa 1621 atatacgcct ctgccatcgt ctcgctccaa gtcatcatcc ttctccgtct cctgcaccat 1681 cgccaaggac ccgcctgttc tcatggccgc cggatctgac ccggccctgt ggcaacgacc 1741 cgattcgttc ggtcggtttg ggaagtttgg tgggaagtat gtacctgaaa cccttatgca 1801 cgctctatct gagcttgaat ccgctttcta tgctcttgcc accgacgatg atttccaggt 1861 agagaccttg ttcttcgtga ttagataaaa atgagacatt ttgcgttgtg ggtcgttgga 1921 attgtagcgt tatcatttgc gggttgtgat aaagttcgga tcttgttgat gggatatgtt 1981 gattctaggg catttgaatg aaattggttc tgttcattta tgaaatggtt agaaatatgg 2041 ttttggttcc tctggttcaa tgtctcattc atgtcctttg tgtatttgtg tgaatgaatc 2101 tttgttagag agagttggct ggaatcttga aggactatgt gggtagagaa agtcctctgt 2161 attttgcaga gaggcttacg gagcattaca ggcgcgagaa tggcgaaggg cctcttatat 2221 acttgaagag agaagacttg aatcacacag gagctcacaa gattaacaac gctgtggctc 2281 aggctcttct tgctaagcgg ttggggaaga agaggattat tgctgagaca ggagcgggtc 2341 aacatggtgt ggctacagct actgtgtgtg cccgttttgg tttggagtgt attatctata 2401 tgggtgctca agatatggag agacaagcac tcaatgtgtt cagaatgcga cttcttggtg 2461 ccgaggtgat gcttaggatt catttccatc tcttttaaat gtgataaact ttgttactga 2521 accttttttt gggtgtctgg tgctataggt gagaggagtc cactctggaa cagcgacatt 2581 gaaggatgcg acatctgaag cgataagaga ttgggtgaca aatgtggaga ctacccatta 2641 catattggga tctgtggcgg gtcctcatcc ttaccccatg atggtcagag actttcacgc 2701 tgtgattggt aaagaaacaa ggaaacaagc gttggagaaa tggggcggga agccggatgt 2761 cttggtggct tgtgtcggtg gtggttcaaa cgctatggga ctcttccatg aatttgtgaa 2821 tgacacagag gtccggatga tcggtgtgga agcagcggga ttcggattgg acagtggcaa 2881 acacgctgct acattgacaa agggagatgt tggtgtcctc catggagcta tgagttactt 2941 gctgcaagat gatgatggac aaatcattga accacactcc atcagtgcag ggtaagtgaa 3001 attctctatt cagatccggt aaccataata agtctactca gttagccacg gctgcatgaa 3061 catggtttga ttttctctta cagattggac taccctggag tcggacccga gcacagtttc 3121 tttaaagaca tgggacgggc tgaatactat agcataaccg acgaagaagc gttggaaggt 3181 aatgtaccga tgcatagtct ttagttctgt atttgaaaat ctgttctttg tttcataact 3241 tacctttgca ttccctttgg tttgtggtct gtagcgttca agagagtgtc gcggttagag 3301 ggaatcattc cagcactgga gacctcacac gcactagctt acctcgagaa gctatgtccc 3361 acattatcag acgggacgag agtggtcttg aacttcagcg gaagaggaga taaagatgtt 3421 cagacagtgg ctaaatatct tgatgtttga agagagaaac acttatgttt gtttttcagt 3481 tattcagaca gaatgaaatc ctttttgtac ttgtaatctg tttgtacttt acttctctct 3541 ggtcagtacg tttccaacaa aattggaaat agaaaaagat tacgaagaga acgaaatttg 3601 aataaatgta ttatcaataa aataaaaaac atttcatatt ggaaatagtc ttatgttcct 3661 caatctgctt tctttttcgt tttcaaaacg aaatcctact tttgcagata caaacatgag 3721 atatgatctc agaactctaa tagcttattt atctcaaatt tcatacgaga tgttacattg 3781 gaaagaggtc caaattgcta gaagacatat atattcgcat atagtatacc aatctgccaa 3841 agttatgttc cccttgtgaa ttgtgatcct taagatttaa gaaaaaatag aagtggtgac 3901 tttgatgtat gaaaaataat aatataagca aagttgcaaa atccaataga gaaagaatct 3961 tcttttagta tatccttcac caaacatatc tctctgcctc ttctgaaatc ccaagagaca 4021 tgtctctctc tctcttttcc aagtccgaac caaatctaat gtatcaacat tacttcttct 4081 gaagtcccaa gagacattgc tctctctctt ttccaagtct gaaccaaatc taatgcatca 4141 acattacttt tgtaataaaa ttcaataagg attagctgat tactgttgtc actctttggt 4201 ttcttttcct ttgttaatag aattc // LOCUS HS5E1P 2590 bp ds-DNA VRL 17-JUL-1990 DEFINITION Murine cytomegalovirus early transcription unit e1 protein gene, complete cds. ACCESSION M35146 KEYWORDS . SOURCE Murine cytomegalovirus (salivary strand strain Smith; ATCC VR-194) DNA, passed in mouse (BALB/c) embryonal fibroblast cell line B25. ORGANISM Murine cytomegalovirus Viridae; ds-DNA enveloped viruses; Herpesviridae; Betaherpesvirinae. REFERENCE 1 (bases 1 to 2590) AUTHORS Buehler,B., Keil,G.M., Weiland,F. and Koszinowski,U.H. TITLE Characterization of the murine cytomegalovirus early transcription unit e1 that is induced by immediate-early proteins JOURNAL J. Virol. 64, 1907-1919 (1990) STANDARD simple staff_review FEATURES from to/span description pept 163 955 early transcription unit e1, exon 1 1049 1225 early transcription unit e1, exon 2 1552 1574 early transcription unit e1, exon 3 pre-msg 44 2548 e1 mRNA and introns IVS 956 1048 e1 intron A IVS 1226 1551 e1 intron B signal 16 20 TATA signal signal 2548 2553 Poly-A signal BASE COUNT 564 a 732 c 751 g 543 t ORIGIN 1 gcgtgatgca gactttataa atcgcaagcc ggagcggatc ctaggctcat ttcttctgcg 61 atcttcgcat ttagcagacc aaatgctgat agttcctgcg tcgcggtaga ttacgtgccc 121 acttttctcg tcgcgaccgg tgaaaagacc ttcgttcgga ccatggccgc gccagatcga 181 cgcggatcgc ccatcgtttc gagacgacgt tacttcacct tccggaactc gagtcggacg 241 ctgcatcaga atgtcacccg catgttcgac ctccgtcagt ggacctacga gagcgcccgc 301 gttttggact gcacgaacgg cgagggccgc accgagaact gggggcccgg ctggctgtgc 361 gcgaccatca tgcagtctcc ggacagcggc tcgtcgtccg gatcggcggc ccagggatgc 421 atgtctctgg acatcacctc ggacgacgga ccggagaagg tcatgaacat gttccaccgc 481 gggagcatcg tctgtaacaa gaccgtctct tcggtcgcga cgccgccggg gtccgacggc 541 acgggcgcca gcctcttgac tctggtggcg gacggcagct tgctgcaagt catcctggtc 601 gaacactcgc cgggcgccca ccgcgccgaa gccgacgatg tcgccgagag cgtcggcaac 661 gccgcggtgg ccgccgccgt gcagtcggcc acttcgatgg gtagctcgag ctccgtcggc 721 ggcgccggtg gcggcggctt caattcttcg gaagaacgtc gccgcgataa agactcgtcc 781 aaatctcacg atgacgaccg acgtaagaga ttcgaactct gcaacggtac ccttcatagg 841 gagcggacct cgggtagctc cggatcgagc ggcggcggcg gcggcagcgg cagcggaggc 901 agcagcggcc tcagtacaaa gcagaaggag cgcagcagga ggctcgaaga ggaatgtaag 961 tagttctgat cggggtttta ctggtcgctt tccccctcga ccttcggtga accgtcacgg 1021 gactaatgtt gttactctgt tcggaaaggt tctccacgca gcggggggga gcccaagcgc 1081 cagaagaccc accacgatcc gagaccggag agggatctgg aaccgcctag gagcagtacc 1141 actgtcgatg gtaatgcaga cgaagctgga gcggtctcct ttttgaacag ttatgctgct 1201 agttcgctgt ccgccgtcag cgatggtgcg ttaccaccct cctctccggt gactatagag 1261 acaccaaggt ttaatgactt gatagaatcc gaggaggaag acgatggaac tcggtgttcg 1321 tctgccgatg gtacggcgac ggtaaccgag accggggcgt ctgccgagat cgagcaaacg 1381 ttcaccgacg gtgggctagc atgcatgagg tccttaattg actcgcgggc cctcatggat 1441 ctcgccttta cgagccgcat gacggcggtg aagtcgggat ctgagggagc gggcaacacc 1501 tgtcctgggg aacagaggct ttctaactgc aatcttcctc cgaattcaca ggaggacaac 1561 aacccgcgtt ttgaaggcgt ctttaattcc ctgatgagac tcttagacga gtataaagac 1621 aagagcacca gctcgagttc cgccgccgta ccgggatcca gttgtacagg caccacgcag 1681 acagagggga tgcctcacca caggggtatc ttcgacaggg acacatccgc cgctcagcag 1741 aaacctgtat gtgagatacg gcccttcatc gacatagctc gctgtgtcga gccggcggcg 1801 gcgccgaccg cttcgcgctc cagagccacc acggccgccg ctcgcaggcg gggaagaggc 1861 tcgcagcctc ggaggaacag tcgggtcgcg gcgcgcgtcg cccaggtcga tccgaccgac 1921 ccacccgcag ctgctgctgc cgccgccgcc ggtcccacca cgcggggaag gagcaggagt 1981 aggaggggaa gacgaggccc agacctcacg gaggacgggt tggagattgt cgagacgggc 2041 gccaccgccg gcactctggc cgtttccgag gaggaaaccg cgatggccgc cgctatgtta 2101 gaggatatgg tcgacttgga caatgtgttc gatgatctta attgatctca tgaaagatgg 2161 ataaagaaat gatgctctgt aaactctaac tgactgttgt tatcgatcgc ctcgtctcgc 2221 tcgggtcttc tgccgatgct gccgtctgta agcacgcaag caagcactcg ctctatagca 2281 gtagaatatg tatggatgat atacttactg ttgttgttca cttactatta tcacctatat 2341 aaaacttgta ttcatgatac ttaccactga aaaaattgtg tatgtctttt tatacttttg 2401 accaccgtta ttgatgataa gtatactctt ctaactgtgt gtgtctctgt ctaccgggcc 2461 gatggagata ttatccctgg tccccctctg tctgtgctaa caacataaca tccatgtaaa 2521 taaaattaat atttttaata ttttatcaat aaaaaccaca catttgttac aataaaccgc 2581 gctcgccttt // LOCUS HSE14GP 3351 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete cds. ACCESSION M35145 KEYWORDS glycoprotein 14. SOURCE Equine herpesvirus type 1 DNA. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 3351) AUTHORS Guo,P., Goebel,S., Perkus,M.E., Taylor,J., Norton,E., Allen,G., Languet,B., Desmettre,P. and Paoletti,E. TITLE Coexpression by vaccinia virus recombinants of equine herpesvirus 1 glycoproteins gp13 and gp14 results in potentiated immunity JOURNAL J. Virol. 64, 2399-2406 (1990) STANDARD simple staff_review FEATURES from to/span description pept 300 3242 glycoprotein 14 (gp14) precursor sigp 300 554 glycoprotein 14 (gp14) signal peptide matp 555 3239 glycoprotein 14 (gp14) signal 3251 3256 Poly-A signal BASE COUNT 882 a 892 c 858 g 719 t ORIGIN 1 aacgttgggt tgttaccgca tctcaaggag gaactagctc ggtttatgat tactgcggct 61 aaaggtaatt ggtcaattag cgagtttcaa aggttttatt gctttgaggg agtgacaggt 121 gtgacggcca cgcagcggct ggcgtggaaa tatatcgggg agctcatcct agccgccgca 181 gtattctcct cggttttcca ctgtggagag gtgcgcctcc tgcgcgcaga tcgtacctac 241 ccggactcca gcggcgcaca gcgctgcgtg agcggcattt acataaccta cgaggcgtca 301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg 361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct 421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa 481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct 541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt 601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac 661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc 721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg 781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct 841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat 901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga 961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata 1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt 1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg 1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg 1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt 1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc 1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag 1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact 1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg 1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tctacgttta 1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag 1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta 1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc 1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg 1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca 1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa 1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg 1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac 2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg 2101 cgtggtgtac cctccaaaac aaagagcgga ccctatggaa cgaaatggtg aagattaacc 2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg 2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc 2281 gctcgatgga cagtaacacg tgctactccc gcccccccgt aacatttaca attactaaga 2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg 2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag 2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg 2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc 2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac 2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg 2701 acaataccgc agtgattatg caggggatcg ccagcttttt caagggcctg ggtaaagtgg 2761 gggaggccgt gggaacgctc gttctcggcg ccgccggcgc tgttgtttca accgtatctg 2821 gaatagcttc gtttttaaac aacccatttg gggggctagc catcggcctg ctggtaatcg 2881 ccggcctggt agctgcgttt tttgcttaca gatatgtaat gcagatccgc agtaacccca 2941 tgaaagctct ataccccata acaacaaagg ccttgaaaaa caaagccaaa acttcctacg 3001 gccagaacga ggaggacgat gggagcgact ttgatgaggc caagcttgaa gaggctcgcg 3061 aaatgatcaa atacatgtct atggtttcgg ccctggaaaa gcaggaaaag aaagctataa 3121 agaaaaacag tggggttggc ctgatcgcca gtaacgtctc aaagctggcc ctgcgaaggc 3181 gcggtcccaa atatacccga ctccaacaga acgataccat ggaaaatgaa aaaatggttt 3241 aaacatgttt aataaatatt atgacacgta ctcaaagtgt gacctcatat ttgcataacc 3301 actttctagt tccggcccca aggatattta agcctagtat ctccgccgag g // LOCUS MLV3ENV 232 bp ds-DNA VRL 17-JUL-1990 DEFINITION Murine leukemia virus p15-gag protein gene, complete cds. ACCESSION M35144 KEYWORDS p15-gag protein. SOURCE Murine leukemia virus (strain Emv-3) DNA. ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 232) AUTHORS Mercer,J.A., Lee,K.H., Nexo,B.A., Jenkins,N.A. and Copeland,N.G. TITLE Mechanism of chemical activation of expression of the endogenous ecotropic murine leukemia provirus Emv-3 JOURNAL J. Virol. 64, 2245-2249 (1990) STANDARD simple staff_review FEATURES from to/span description pept 17 > 232 p15-gag protein BASE COUNT 51 a 57 c 53 g 50 t 21 others ORIGIN 1 gaccgcttct aaaaacatgg gaccgaccgt aaccacccct ctgagtctga ccctagaaca 61 ctgggaagat gtccagcgca tcgcgtccaa tcagtccgta gatgtcaaga agagacgctg 121 ggtcaccttc tgctctgccg agtggccaac tttcggtgta gggtggccac aagatggtac 181 ttttaatnnn nnnnnnnnnn nnnnnnnnaa atctaaggtg ttctctcctg gt // LOCUS CTRTPDC 1740 bp ss-mRNA PLN 17-JUL-1990 DEFINITION C.roseus tryptophan decarboxylase mRNA, complete cds. ACCESSION M25151 J04521 KEYWORDS decarboxylase; tryptophan decarboxylase. SOURCE C.roseus leaf, cDNA to mRNA. ORGANISM Catharanthus roseus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Gentianales; Apocynaceae. REFERENCE 1 (bases 1 to 1740) AUTHORS De Luca,V., Marineau,C. and Brisson,N. TITLE Molecular cloning and analysis of cDNA encoding a plant tryptophan decarboxylase: Comparison with animal dopa decarboxylases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2582-2586 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Brisson, 01-JUN-1989 FEATURES from to/span description pept 70 1572 tryptophan decarboxylase (EC 4.1.1.28) mRNA 1 1740 tryptophan decarboxylase mRNA BASE COUNT 505 a 355 c 359 g 521 t ORIGIN Unreported. 1 ctctctctct ctctctctaa gactttctct ctctacacat acacctacac cagaaaaaag 61 aaaaaaataa tgggcagcat tgattcaaca aatgtagcca tgtccaattc tccagttgga 121 gaatttaagc cacttgaagc tgaggaattc cgaaaacaag cccatcgtat ggtagatttc 181 atagccgatt attacaaaaa tgtggaaaca tatccggtcc ttagcgaagt cgaacctgga 241 tatctccgaa aacgtatccc cgaaaccgct ccttacctcc ccgaaccact tgacgacatc 301 atgaaagata ttcagaagga tattatccca ggaatgacaa attggatgag ccctaatttt 361 tatgcatttt ttcctgccac tgttagttca gctgcctttt taggagaaat gttgtctact 421 gccctaaatt cagtaggctt tacttgggtt tcttcaccag ccgccaccga attagaaatg 481 attgttatgg attggttggc tcagatcctt aaactcccca aatctttcat gttttcaggt 541 accggtggcg gcgtcatcca aaacaccact agcgagtcca ttctttgtac aatcattgcc 601 gcccgggaaa gggccctgga gaagctcggt cccgatagta ttggaaaact tgtctgttac 661 ggatccgatc aaacccatac catgttcccc aaaacttgca aattggcggg aatttatccg 721 aataatatta ggttaatacc tacgaccgtc gaaacggatt tcggcatctc acctcaagtt 781 ctacgaaaaa tggtcgagga tgacgtggcg gccggatatg taccgctgtt cttatgcgct 841 accctgggta ccacctcgac cacggctacc gatcctgtgg actcactttc tgaaatcgct 901 aacgagtttg gtatttggat ccacgtggat gctgcttatg cgggaagcgc ctgtatatgt 961 cccgagttta gacattactt ggatggaatc gaacgagttg actcactgag tctgagtcca 1021 cacaaatggc tactcgctta cttagattgc acttgcttgt gggtcaagca accacatttg 1081 ttactaaggg cactcactac gaatcctgag tatttaaaaa ataaacagag tgatttagac 1141 aaagttgtgg acttcaaaaa ttggcaaatc gcaacgggac gaaaatttcg gtcgctgaaa 1201 ctttggctca ttttacgtag ctatggagtt gttaatttac agagtcatat tcgttctgac 1261 gtcgcaatgg gcaaaatgtt cgaagaatgg gttagatcag actccagatt cgaaattgtg 1321 gtaccgagaa acttttctct tgtttgtttt agattaaaac ctgacgtttc gagtttacat 1381 gtagaagaag tgaataagaa acttttggac atgcttaact cgacgggacg agtttatatg 1441 actcatacta ttgtgggagg catatacatg ctaagactgg ctgttggctc atcgctaact 1501 gaagaacatc atgtacgccg tgtttgggat ttgattcaaa aattaaccga tgatttgctc 1561 aaagaagctt gatgaataag taagggtttt tttttaattt ttttttaaat tttatatttg 1621 ctgattgttt gaagagttta aaaataaagt gatttgtaaa ggtttattgt actcaaacaa 1681 tcatgcaatt aattatatgt attaattatg acatgagaat aaaatagaat ttgtgtgtgc // LOCUS WHTAMYA 2397 bp ds-DNA PLN 17-JUL-1990 DEFINITION Wheat alpha-amylase gene, complete cds. ACCESSION M16991 KEYWORDS alpha-amylase. SOURCE Wheat (strain Chinese Spring) distal half-grain DNA, clone lambda-Amy3/33. ORGANISM Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 2397) AUTHORS Baulcombe,D.C., Huttly,A.K., Martienssen,R.A., Barker,R.F. and Jarvis,M.G. TITLE A novel wheat alpha-amylase gene (alpha-Amy3) JOURNAL Mol. Gen. Genet. 209, 33-40 (1987) STANDARD full staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by D.C.Baulcombe, 13-AUG-1987. The region bounded by nucleotides 975 and 1032 might be a signal peptide. FEATURES from to/span description pept 963 1052 alpha-amylase, exon 1 1138 2028 alpha-amylase, exon 2 2133 2393 alpha-amylase, exon 3 pre-msg 908 > 2397 alpha-amylase mRNA and introns (5' end +/- 3 bp) IVS 1053 1137 alpha-amylase intron A IVS 2029 2132 alpha-amylase intron B BASE COUNT 613 a 615 c 599 g 570 t ORIGIN 874 bp upstream of SstI site 1 aacattcaca atgcaaaatc tatatcatca gatccatcat gaaatatatt ttcatattat 61 atattttttg gtattgtagc tgcttttttc cttgaataca cacgagtgtg atatatatat 121 ttttggtatt gtagctgctt taattatata tatatatata tatatataca tatatatata 181 tatatataat taaagaagat gatgggctaa gaagcccttg gtgttgtaga tgttgatatt 241 ttttgatata attttggtca aactttataa agtttggatt ttcagaaatc taacacagag 301 agtaaaaaga aacggaagga gtatataata agagttctag cagccgctcc tccaactgca 361 cccaactgca ccctcgcctg agccggagcg gcgaggggga ggctaggtcg ccgcgccacc 421 atgtaaaaag attctttttt tatctttacc gactctagcc tcaccttgta cacacaaaaa 481 caagccaatc gaggtttttt tctagaatca acatcaaacc atccggtttt atgtgaaccg 541 cccggttcat cggttttttt tcaaaaaacg tctgattcga ccgggttttc ctggttggac 601 tcaatcggtc caccagactt aaagaactgc aggaatcgcc ggttccggtt ttcaaaacta 661 tgagcagagt gcttgtctag tgcagatata catggagagt aattaaacgg cagtaattaa 721 tagacaaaac tttagctcct aatccatttt atctcagaag ccagcggtcc ttatccatgc 781 ggatatcttt ggagaaaaat taaggagaat tgatagtttt tactaggcac ctactaatct 841 ctaagctcca agtacctata aatacaagcg agctcaagca ttgccaaacc aaggagtagt 901 cgtcggtcca ggagtccaga tttcatagga agtagaggcg agctgcactg cagactgtag 961 ctatgggcaa gcactctgct actctctgtg gccttctggt tgttgtgctt tgtctcgcct 1021 ccagcttagc acaggctcaa attcttttcc aggtgagaaa atgcatgcat cttcaagttt 1081 catttgcacc cagaatcttt catatgatga aatgtgcttt ggtgtttcgt tggccagggg 1141 tttaattggg aatcgtggaa gacacaaggc gggtggtaca agttcatgca agggaaggtg 1201 gaggagatcg ccagcaccgg cgccacccac gtctggctcc ccccgccgtc gcaatccgtg 1261 tcgccggagg gctatctgcc ggggcagctc tacaacctca actccaagta cggcagcggg 1321 gccgacctca agtcgctgat ccaggcgttc cgcggcaaga acatctcgtg cgtcgccgac 1381 atcgtcatca accaccgctg cgccgacaag aaggacggcc gcggcgtcta ctgcatcttc 1441 gagggcggga cgtccgacaa ccgcctcgac tggggccccg acgagatctg cagcgacgac 1501 accaagtact ccaacggccg cggccacagg gacaccggcg gtggcttcga cgccgcgccc 1561 gacatcgacc acctcaaccc gcgggtccag agggagctct ccgcctggct caactggctc 1621 aaaaccgacc tcggcttcga cggatggcgc ctcgacttcg ccaagggata ctccgcggcc 1681 atggccaaga tctacgtcga caacagcaag ccggcgttcg tcgttggcga gctctacgac 1741 cgggaccgac agctgctcgc gaactgggtg cggggcgtcg gcgggccggc cacggcgttt 1801 gacttcccca ccaagggcgt tctccaggag gccgtccagg gcgatctggg gaggatgcgc 1861 ggcagcgacg gcaaggcgcc cggtatgatt gggtggatgc cggagaagac cgtcacgttc 1921 atcgacaacc acgacactgg gtcgacgcag aggctgtggc cattcccctc ggacaaggtc 1981 atgcagggct acgcctacat cctcacacac ccgggcatac catgcatcgt aagtagtagc 2041 acactacaca acctcaccat aacatttcgc atcaaacgta ccccacgatg tttgtgatct 2101 gaacttacaa ctacttggtt ttgcgcgcgc agttctacga ccatgtgttc gactggaaac 2161 tgaagcagga gatcaccgca ctggctacgg tcaggtcaag gaacgggatc catccgggta 2221 gcacactgga tatcctcaaa gccgagggcg atctctatgt tgccaagatt gggggcaagg 2281 ttataaccaa gatcgggtca agatacaata ttggcgacaa tgtgatcccc tcgggtttca 2341 agattgcagc taaaggcaac aactattgtg tctgggagaa gagcggcctc tgaattc // LOCUS STAFEMA 3446 bp ds-DNA BCT 17-JUL-1990 DEFINITION S.aureus factor essential for expression of methicillin resistance (femA) gene, complete cds, and trpA gene, 3' end. ACCESSION M23918 KEYWORDS femA protein. SOURCE S.aureus (strain NCTC 8325; organism BB270) DNA. ORGANISM Staphylococcus aureus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 3446) AUTHORS Berger-Baechi,B., Barberis-Maino,L., Straessle,A. and Kayser,F.H. TITLE FemA, a host mediated factor essential for methicillin resistance in Staphylococcus aureus: Molecular cloning and characterization JOURNAL J Antimicrob Chemother 23, 671-673 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by B.Berger-Baechi, 11-APR-1989. FEATURES from to/span description pept < 1 294 trpA protein (AA at 1) pept 577 1878 FemA protein pept 1897 3156 ORF 419 protein pre-msg 514 > 3446 FemA mRNA rpt 361 390 large inverted repeat, left end rpt 405 434 large inverted repeat, right end rpt 416 423 small inverted repeat, left end rpt 430 437 small inverted repeat, right end BASE COUNT 1281 a 448 c 617 g 1100 t ORIGIN 3 bp upstream of EcoRV site; segment 18 of S.aureus chromosome. 1 gatatcgtat cccatgcgga aggctttatt tatactgtga cgatgaatgc gacaacaggg 61 caaaacggtg cgtttcatcc agaattaaaa cgaaaaattg agtcaattaa agcgatagcc 121 aatgtgccag ttgtcgcagg atttggtata agaacaccac aacatgttgc agatataaaa 181 gaggttgcag atggcattgt cattggtagc gaaatcgtta agcgatttaa atctaacacg 241 cgtgaggaaa tcattaaata tttacaatct atccaacaaa cattgaataa ttaagtttac 301 ttgatttaaa aaaattaggc gaatactgtt tgaaaaagtg aaaaacggtg aattataaaa 361 ttgaatacaa tttcaaaaaa agtaatatga gcaaacccaa acgttcatat tacttttttt 421 gaaattgtat tcaaaaatct aaatattact ataaaagtat acgcaattaa agcgtttatg 481 ttttagtttt aacattaact attgtatact tatttagatt agatttatta tttttgacat 541 ttgcagaggg gaaatagaaa aactgcaaat acggaaatga aattaattaa cgagagacaa 601 ataggagtaa tgataatgaa gtttacaaat ttaacagcta aagagtttgg tgcctttaca 661 gatagcatgc catacagtca tttcacgcaa actgttggcc actatgagtt aaagcttgct 721 gaaggttatg aaacacattt agtgggaata aaaaacaata ataacgaggt cattgcagct 781 tgcttactta ctgctgtacc tgttatgaaa gtgttcaagt atttttattc aaatcgcggt 841 ccagtgattg attatgaaaa tcaagaactc gtacactttt tctttaatga attatcaaaa 901 tatgttaaaa aacatcgttg tctataccta catatcgatc catatttacc atatcaatac 961 ttgaatcatg atggcgagat tacaggtaat gctggtaatg attggttctt tgataaaatg 1021 agtaacttag gatttgaaca tactggattc cataaaggat ttgatcctgt gctacaaatt 1081 cgttatcact cagtgttaga tttaaaagat aaaacagcag atgacatcat taaaaatatg 1141 gatggactta gaaaaagaaa cacgaaaaaa gttaaaaaga atggtgttaa agtaagattt 1201 ttatctgaag aagaactacc aatttttaga tcatttatgg aagatacgtc agaatcaaaa 1261 gcttttgctg atcgtgatga caaattttac tacaatcgct taaaatatta caaagaccgt 1321 gtgttagtac ctttagcgta tatcaacttt gatgaatata ttaaagaact aaacgaagag 1381 cgtgatattt taaataaaga tttaaataaa gcgttaaagg atattgaaaa acgtcctgaa 1441 aataaaaaag cacataacaa gcgagataac ttacaacaac aacttgatgc aaatgagcaa 1501 aagattgaag aaggtaaacg tctacaagaa gaacatggta atgaattacc tatctctgct 1561 ggtttcttct ttatcaatcc atttgaagtt gtttattatg ctggtggtac atcaaatgca 1621 ttccgtcatt ttgccggaag ttatgcagtg caatgggaaa tgattaatta tgcattaaat 1681 catggcattg accgttataa tttctatggt gttagtggta aatttacaga agatgctgaa 1741 gatgctggtg tagttaaatt caaaaaaggt tacaatgctg aaattattga atatgttggt 1801 gactttatta aaccaattaa taaacctgtt tacgcagcat ataccgcact taaaaaagtt 1861 aaagacagaa ttttttagga agggaattat caaaacatga aatttacaga gttaactgtt 1921 accgaatttg acaactttgt acaaaatcca tcattggaaa gtcattattt ccaagtaaaa 1981 gaaaatatag ttacccgtga gaatgatggc tttgaagtag ttttattagg tattaaagac 2041 gacaataaca aagtaattgc agcaagcctt ttctctaaaa ttcctactat gggaagttat 2101 gtttactatt cgaatcgtgg tccagtaatg gatttttcag atttaggatt agttgattat 2161 tatttaaaag agttagataa atatttacag caacatcaat gtttatatgt taaattagat 2221 ccgtattggt tatatcatct atatgataaa gatatcgtgc catttgaagg tcgcgagaaa 2281 aatgatgccc tagtaaactt gtttaaatca catggttacg agcatcatgg ctttacaact 2341 gagtatgata catcgagcca agtacgatgg atgggcgtat taaaccttga aggtaaaaca 2401 cccgaaacat tgaaaaagac atttgatagt caacgtaaac gtaatattaa taaagcgata 2461 aactatggtg ttaaagtcag attccttgaa cgtgatgagt tcaatctttt cttagattta 2521 tatcgtgaaa ctgaagagcg tgctggattt gtatcaaaaa cagatgatta tttttataac 2581 tttattgaca catatggaga taaagtatta gtaccattag catatattga ccttgatgaa 2641 tatgtgttaa agttgcaaca ggaattgaat gacaaagaaa atcgtcgtga tcaaatgatg 2701 gcgaaagaaa acaaatcaga taagcaaatg aagaaaattg cagaattaga taagcaaatt 2761 gatcatgatc agcatgaatt attgaatgca agtgaattga gcaaaacgga cggcccaatt 2821 ctaaaccttg cttctggcgt ttattttgca aatgcatatg aagtgaatta tttctctggt 2881 ggttcatcag aaaaatataa tcaatttatg ggaccataca tgatgcattg gtttatgatt 2941 aactattgct tcgataatgg ctatgatcgt tataatttct atggtttatc aggtgatttt 3001 acggaaaaca gtgaagatta tggcgtatac cgctttaaac gtggatttaa tgtacaaatc 3061 gaagaattaa taggggattt ctataaacca attcataaag tgaaatattg gttgttcaca 3121 acattggata aattacgtaa aaaattaaag aaatagatgt aaatgatgtt agggcttata 3181 gttattgata ctataggctc ttttttatat gttttttaat aaagcctcga ataatgatat 3241 attaaatgac aagtttaaca agtgattgca ataaatataa atatgaattg cacaacctat 3301 ttgtaaatga ttgagttgag tatacttttt aattcattca atatgactca aaacaagcca 3361 atatgtatta tatttttggt gaataaacct tagcataaca atttatgtct gattatatat 3421 gtgtgtgttt gcaagtgtgt gatatc // LOCUS WTVPNS11A 1172 bp ds-RNA VRL 17-JUL-1990 DEFINITION Wound Tumor Virus genomic segment S10 encoding Pns11 nonstructural polypeptide, complete cds. ACCESSION M24114 J04352 KEYWORDS polypeptide Pns11. SOURCE Wound Tumor Virus, cDNA to viral RNA. ORGANISM Wound tumor virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus. REFERENCE 1 (bases 1 to 1172) AUTHORS Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L. TITLE Complete nucleotide sequence of wound tumor virus genomic segments encoding nonstructural polypeptides JOURNAL Virology 171, 222-228 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.L.Nuss, 20-APR-1989. FEATURES from to/span description pept 25 1068 polypeptide Pns11 BASE COUNT 359 a 222 c 236 g 355 t ORIGIN 1 ggtatttttg ttggatctgc tgtgatggat gcttctgtcg acaggatcac aaatctacat 61 tttgaaatcc tcgctaaagc gggtggacat gaaatacacc aaaaatatga agcaattagg 121 aagttgaacc ttacgggtga ttcttcaaag agtaatatct ctgtatctgc aaggtcagca 181 atattgaaat gggctgacgc taaacaagga tacatagcca gccaactgga cgatcgagat 241 tatggcgacc tcatcgccaa agctgtaatc tttgtcccga tgtctgttat aacaggaggg 301 aaaaatccga aggatttaat accttacgga gttgtcgctg cagtgttaat ttttgttcct 361 gaaacgttga cactactgga tgagatagtt attaatttaa tgcatgataa gaaacctttg 421 tcatcaattt tattgacaaa gatactcaga gacatgaaaa ttgatgtctg tggtagtaat 481 tttgattcat tttattactg cccaatttcg aggtataata gacatattat taagctagct 541 ggtgcattac ctcaaatgcc gactagcgtt agattatcag ttaatgattt agctcgagta 601 gctatttccg aagtacataa ccagcttatt tcggataaac agatgttttt caaactacca 661 actggttttt cgccaaaagt gcattgttta aaagtacttt gcactactga aatggaaatt 721 tttcagaaat gggttagaac gttcatgtct gatagaccga acgaattcat ctattctgac 781 cagtttaata tattatctag gaccacctac ttttcttcag atgatccttt ttcatttttt 841 actctttgga ggggatggtc aacctacaaa gaaatactct ctcaagatca agcttcaagc 901 tttcttgaag ccataggaag tggaaagccg ttgagaagtt caatcgctac ttttccatca 961 atgtttgatg aaggtgctat ttacatcaga tatgaatgga taactccaaa agattcagca 1021 aattcgaaga aggctggctc ttcagcaccg agcgcgccta aaatgtagaa gatttgtcct 1081 tgagattcgc atctggagca gcccggccta gtcacccgga gccagatcga gctaaaactc 1141 gagttgaatt ccccagctcc aacaaacatg at // LOCUS WTVPNS4A 2565 bp ds-RNA VRL 17-JUL-1990 DEFINITION Wound Tumor Virus genomic segment S4 encoding Pns4 nonstructural polypeptide, complete cds. ACCESSION M24117 J04352 KEYWORDS polypeptide Pns4. SOURCE Wound Tumor Virus, cDNA to viral RNA. ORGANISM Wound tumor virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus. REFERENCE 1 (bases 1 to 2565) AUTHORS Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L. TITLE Complete nucleotide sequence of wound tumor virus genomic segments encoding nonstructural polypeptides JOURNAL Virology 171, 222-228 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.L.Nuss, 20-APR-1989. FEATURES from to/span description pept 64 2262 polypeptide Pns4 BASE COUNT 809 a 463 c 625 g 668 t ORIGIN 1 ggtattgatc ccactttcta gatcagtgag ctagtgaagt aaggttcaaa actaatcatc 61 gccatgaagc aagtagccgg gtttattggt atgagtcaaa agaacaaagg aattcagcaa 121 aaccagtggc attccggtcc gccacaggga ctgctttcag gacaaacgaa agctgaacaa 181 ggtaccagca gtcagcaagc tggagtgaat caaggagaga attccaaatc tggggcggta 241 gtccagagga aaatgccaat gaggagagac gttaattctg ccgcacaacg tcaacatgta 301 aggaggttgg tgtacacacc atactttcca aatgaaacat ggaatatatc aacattcaaa 361 aatggagcta aagacgtaga acggtcggtt atatcgacgc tgaacaacgt cgctgagaat 421 agagtcatgg acaactgtac ttccagagtc atttttgaga tgactcaaat acaatttgag 481 tcacttccag atataataag aaatgaattt actagagttg gggatgacgc cttgaaatgg 541 gcggtccctg aagatttgaa atctgcagat cttgaccata tgatggtcgt gaaattatca 601 acagaaggca cgatataccc aacgacgctt atttttccag gtggatgtag tgggatggct 661 aaattgaaat cagtctactc attcttggaa agtcagctag aaagaatagt cacaccgaca 721 ccctcggtgt ctttgaagta tgttacaagc tgggctgagc acttatttga tctgtgctcc 781 ggacagctga tcaattcaca aaacgagcgt gtagacaaac tgcttggata catgatctgg 841 gatattgaaa aagctatcac tttgactgat caagtgatcg catgttacaa ccaccctgaa 901 gttgtgctga ggaggctagg ggcctcggat attgcatgcg ctgtgttagc tggggagtct 961 gtggtgaagc tcacccgcct agcgttatcc aagtctcctg ttgatggctg ctcatgctgt 1021 agaatattgg agctgattct aaatttacca tcacgtaagc ccaacgataa ggtgccgcag 1081 gtaccattgg acatactatt cgcatctgtc tatagatacg tatcagccat gtgcatgggg 1141 cgagtgctga atgggcgcat tgacgcatcg gggatacagt cgactgatca tgcgactgct 1201 tcgatcaagc tgaatgatat aattgtaaat gatcttgagt taaggagtat gggtgttgat 1261 aaaacgtcat cattcagagg aacgcaatct atgagggcgt tctacgttcc tgagaattta 1321 gcaggctcta tactagatag gattaacgtg ttggtcatgc gccattttgg aatcctgcat 1381 atgtggggtt ttaatggtgt agtacttcaa aatcaagagg ggtactgtga ctatcacatt 1441 ataacaggat taaaccactt aacaactata acgcatacta actccatggt tgcagtacat 1501 tggggtactg aatcaaggat ggataatata tttgaaataa aagcaagaac tcttccaacc 1561 gcatccgaga cgatgataac attgatcgaa aatgccttga aggagcaact gacatcaata 1621 gtgaaagacg gcctcaggaa gggagtaagt ttttcagtga aaagaaatat aaatgactct 1681 cgctttggat ttgagactaa ttcatcccca gcgatatttc tgaaactcag agacatgcta 1741 aaaagagcta agcctttctc agacctgtta tccttggcat tgtcaaaggt aataaaaaaa 1801 gagaacgcta tgatacaacg gtcaatcacc acagtggaag tggccgttgc cattaaaatg 1861 aaagtctatg ggttggatga gtatgtcagc ctgatgaaag ttgagaagaa ggaagtggaa 1921 agtggaagcc ttccacttca agaatttctg aaattaaaat caaatgcagc aggggcacag 1981 tcatcgactg ttgcagttaa gatgaaagaa gaagaagtca atagcaaagc ctattgcctg 2041 atatcagaaa ctattgttgt aaacatggac gcagtgagaa gtgcatgtgg agtagtgcaa 2101 tcagagaatc tggtcataaa gtcagaactt tcaggaccag aattgagtga aagtgtaact 2161 tcagggttga tggagttact aggaagaaat gccggaccaa gtaaatcttg ggctgaccaa 2221 gttgaagaag cggaaaatga agaggaaaaa caaaaggagt gagtgtgctc catttctgga 2281 tcggagtgtc ttgcgaggct cgggtctata gatttataaa catgtattat aattaatatt 2341 attagtggta atatattgca tttgtaaata gtgtgcttat ctcattaggc ctgccattag 2401 ggcggggtcg tgatgaaccc tcaggactat accacgtccc attttagctc gctatagcaa 2461 atttgagtat caacataagt gactggagtt taaaccctct tagctttatt cttgagcgaa 2521 gctataatga cgatggttgg tctagggctg atggatcatc atgat // LOCUS WTVPNS7A 1700 bp ds-RNA VRL 17-JUL-1990 DEFINITION Wound Tumor Virus genomic segment S6 encoding Pns7 nonstructural polypeptide, complete cds. ACCESSION M24116 J04352 KEYWORDS polypeptide Pns7. SOURCE Wound Tumor Virus, cDNA to viral RNA. ORGANISM Wound tumor virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus. REFERENCE 1 (bases 1 to 1700) AUTHORS Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L. TITLE Complete nucleotide sequence of wound tumor virus genomic segments encoding nonstructural polypeptides JOURNAL Virology 171, 222-228 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.L.Nuss, 20-APR-1989. FEATURES from to/span description pept 45 1607 polypeptide Pns7. BASE COUNT 551 a 298 c 338 g 513 t ORIGIN 1 ggtattttct cctgaaatac ttgacctggc tagggttcgg tgaaatgaag atttccgact 61 tctgtttcgc ctctgcaaat gatggatcat acactttaaa agcattttcg gaacttaatg 121 aatacaagga tgtagtgaaa cttgtctctg atgaaaagat tggagttgga tttcactgct 181 acaatttagg tctgatgaat atagttgaag atttttccgg aaatcttgac aacgaatctt 241 acttgaccag taaggtaggc aaaagaatgg cttccgagct agtcactgca tactcaaaat 301 ttgggtctac tagcagccgg accttacata gctcattaaa tctaccagtc gtaaatataa 361 cttcattacc aacatcccaa gctaaagatc tcaaaccgaa tcattcattg gatgataagg 421 gatctatgct acggactcaa atacattcaa ttctgactgg aaacggtccg ctaactataa 481 aacgaaggat agatgcgttt tattatagcg cctcaagtat attcactaga cacatgactt 541 cgaaatatgc taatcctgga tcaaatgtgc ctcaaaggtt ttcatttata ccagattgtg 601 caatgaataa gaaaccaacg ctatttctag agaatagaga taacgaactc caagattcca 661 tgaccatcat gttaatgttg ggacaggtct tttcggatgc tttaacatat tatctaaatg 721 cctcaatact atacggtatc ctaggtagaa ttgaatccaa agtgcaggtt gatttaccag 781 ccatcactct tgaatccgtt catgttacaa ataatttgga aatctctccg gccgcatttg 841 ctctaatagc gtcagtctgg cttgataaag ctgaaatatt atcaaaatta aatgcgatag 901 acttcatagt gtctcctgaa gataatgaag atagaatatc aaatttactg aaactgatgc 961 tgccagttca gtctaataat ataacggttg aaaaatctga cactaggttc tcggtaaccc 1021 attcagatgg atttatgagg tattatatgt gtttttctaa acacgaattt gattatggag 1081 accaccttga gtcttttggc atcccagtgc tgcgtgtaag actaggtaaa ccaatatcaa 1141 atgaactgaa caagccgatg ctggtaatgt tcaagaagca tgaatcaata tctagcataa 1201 atgttagata tcaagtaaga ggtggaagta tacctaaatt caggacctca gaatttagaa 1261 gagatatagg aatgttagtt gccaattctc ggtttatggc gactgatatt actttgatat 1321 tgtctacatt ttatccattc actcaggaaa cggataaatt atttattgaa caacacataa 1381 aagaaatttt tcttgatatg tatccatgga ttgataaact aacttcagcg gacgctaaat 1441 ctgaggttaa tatatcatat gggaatttag tgctatactc atatggagag ttagtcaaaa 1501 atagtatatt tattgcaatg atggacaatt gcaaggacgc ccggaatagt ttttctcggg 1561 ctgatatgag agagatccaa gcatttgtag cagcgtttac acagtgagta agattttttc 1621 atgaacctgg agcttgcgag gccaccagca gctaaaaccc gtaggtggag gttgtataca 1681 tggtgagtgg ggagcatgat // LOCUS ATUVIRD 3662 bp ds-DNA BCT 17-JUL-1990 DEFINITION A.tumefaciens virD1/D2 protein genes, complete cds. ACCESSION M33673 KEYWORDS virulence protein. SOURCE A.tumefaciens plasmid pTiC58 DNA, from Nicotiana tabacum. ORGANISM Agrobacterium tumefaciens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 3662) AUTHORS Wang,K., Herrera-Estrella,A.H. and Van Montagu,M. TITLE Overexpression of virD1/D2 genes in A.tumefaciens enhances T-complex formation and plant transformation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by A.Herrera-Estrella, 11-APR-1990. Author address [1]: A.Herrera-Estrella Rijksuniversiteit Gent Department of Genetics Ledeganckstraat 35, Ghent, East Flanders Belgium B-9000 e-mail [1]: jecop@gengenp.rug.ac.be FEATURES from to/span description pept 648 1091 unknown protein 1 pept 1125 2468 unknown protein 2 pept 2483 3079 unknown protein 3 ORF 3118 > 3662 unknown protein 4 BASE COUNT 870 a 975 c 1042 g 774 t 1 others ORIGIN 1 gtcgacaatg cttcatcgat atcgagcgga gttaacatgg tcgggatcag aagcaggttt 61 gagctggcaa tgatcgtgtt gttgagttcg ctcgaaccac catgcgtatc ggccagcgca 121 taatcaaatc cttggagttc ggcgtcctca taggccgcct caaggagtgc catttcctcg 181 gcggcgtaga cttcgcagaa ggagccctag gtattgctgc gaagggcgtt ttctttccat 241 cgcgttagtg gtcggttttc atcagcgtcg aagagagcca atcgtttgcc gtcacttgca 301 aaggcggcgc aaagacccat gagtgccgtg gttttgccgg ctcctccctt gaaggagcaa 361 aatgtcagaa gtttcatgtc cttatcctgt cgattttgtg aagcggaagt gtgtctgtac 421 ttttatttgt gtgtatgatt ttgcgataat tcataagtaa tgtagtaatt acctgatttt 481 atatttcaat tttattgtaa tataatttca attgtaataa tataaaaata aatatccctt 541 atgtgttctt gatttcgttt tgtatatggc tagattccca tctgccacga cgaggaaatg 601 ctacggcggg gcaagttcag atctttccgt cttctatgga ggaagctatg tcgcaaggca 661 gtaggcccac ctcaagtgac attgccgtca accagcgcga atgcgtgaag gttgaaggct 721 tcaaggtcgt cagtacccga ttaagatcgg ccgaatatga gagtttttct catcaggcac 781 gcttgctggg cctctccgac agcatggcca tacgggttgc ggtgcgccgc attggtggct 841 ttcttgaaat cgacgcagag actcgtcata ggatggaggc catactacaa tccataggaa 901 cactctcaag caacattgcc gcgctgctat ctgcctatgc cgaaaatccg acaatggatt 961 tggaggcttt gcgagctgaa cgtatcgcct tcggtaaatc tttcgctgac ctcgacggct 1021 tgctccgttc cattttgtcc gtatcacggc ggcggatcga cggttgctcg ctgctgaaag 1081 acgccttgta gcactgacgt agcacttggc ggggaacata ttcgatgccc gatcgagctc 1141 aagttatcat tcgcattgtg ccgggaggtg gcaccaagac ccttcaacaa attatcaatc 1201 agttggagta tctatcccgg aagggcaggc tggagctgca gcgttcagcc cgacatctcg 1261 atattcccct gccaccggat caaatccacg aacttgcccg aagctgggtt caagagactg 1321 gaacttatga cgaaagtcag ccagacgagg aaaggcaaca ggagttgacc acccatatta 1381 ttgtaagctt ccccgccggt acaagccagg tagcggctta tgcggcgagc cgggagtggg 1441 cagccgagat gtttgggtca ggcgcagggg ggggccgata caactatctt acggccttcc 1501 acatcgatcg cgaccaccca catctgcatg tcgtcgtcaa tcggcgcgaa cttttaggac 1561 acggctggct gaagatatct cggcgccatc cccaactgaa ttacgacgcc ctgcgcataa 1621 agatggccga gatttcactt cgtcatggca ttgccctcga tgcgagccga cgagcagaac 1681 gtggcatcac cgagcggccg atcacttatg cccaatatcg gcgccttgag cgggagcagg 1741 ctcgccaaat ccgtttcgaa gacgcggatt tggaacagtc gtcgccgcaa ggagatcatc 1801 cagaattcag ccaacctttc gatacatccc catttgaagc atccgcgggc ggaccggagg 1861 acatgcctcg gcccaacaat cggcagaatg agtcgcaagt tcatctccag gagccagctg 1921 gtgtcagcaa cgaagccggt gtccttgtgc gggttgcatt ggagacggag cgccttgctc 1981 aaccattcgt ttccgaaacc attctcgcgg acgacatagg gagcggctct tcgcgtgttg 2041 ccgagggccg tgtggagagc gcaaaccgca ctcccgatat tcctcgcgca gcaactgaag 2101 ctgccacgca cacgacacac gaccggcagc ggcgtgcaaa gcgtcctcat gatgacgacg 2161 gagggccgag tggagcaaaa cgtgtgacat tggaaggcat cgcggttggg ccccagcgaa 2221 cggccggcga acaggatggc agtagtggcc ccttagaacg gcaagctgga acgtctcggc 2281 catctccacc gacggccacg acgcgggcca gcaccgcaac cgattcattg tctgctacag 2341 cccacctcca gcaacggaga ggtgtccttt caaagcgtcc gcgtgaagat gatgatggag 2401 aaccgagtga acgcaaacgc gagagagatg agcgcagcaa ggacgggcgt gggggaaata 2461 ggagatagag aaggaccgag tgatggcaaa tgaagagttc accagacact atgcgtggcc 2521 cgttcctgtg gcttcgaatg atgaagggcg tgggaccgcg cggatcccca tccaggcaca 2581 atcaatcgtt gctggagaag acggtcggga cacttcggtc ccaacggctt tgtcgcgacc 2641 gccaattgaa gatatgccgc acggcgtcca agaaacatcg gcgagtggcg gacgactggg 2701 agcggcccgt ctgcgggatt ccgtaatccc gccaggaata tccgaagccc gcacggacct 2761 atccgcaatt ttgcggaaaa aaagcggttc tttccgcacc ggtatgcagt atctgcgtgg 2821 gcttgaacgg gaaaattttg ataaacaaga cagggaagcc agtgcgttgc cagatttaag 2881 tgcaaggggc ataaagcgac cgcgcgaaat tgagtatccc ggcaatgcaa gcggattaac 2941 cataaagagg caagacggct taggcataga gatcaatact atctcggcat cttcgcccgt 3001 gaaccggggc cgcgcattcg tcgaactggc aaggcgcgcc ggaaccgggc gtgtacaatg 3061 ttcagccatc ggcagataga gcacagaact ctgcgcagga aagttcgacg tttcccgatg 3121 gcacttctgt ttccgctctc tattcaggcc cactcgcaga atggttcgag agggatactg 3181 gcagcgagac gaccaggaat tcaggcaaca ccatttcgtc accgctccga gggctggagg 3241 agttcggcga ttccgcggac agccggtatc ttgggcgcga agctcagagt ctttcagtta 3301 ccgtaacaac gcctaattcg aatgctgagg caagttctca tagcgcacac actgaaactc 3361 tcgacgatgt cagcagtgac cgctcgagcg aacaggggag gggtcccctt ggcgccgcaa 3421 tccttggatc ccatcatgac cttttcnccg cgtgcgcaga agttatcaca aacaaaccgc 3481 gattcccctg aactgaccga cgccgaccta gcaaaagtcg atgcggtgtt cgaatctctc 3541 tccaaggggc ccccggcggg ggagagcgcc gcaccggact ttcgtgagcg cggaccaggc 3601 agcgctttcc agaaagaagg cgtgtccgac agggcgaacg gggtgcccac taactgggag 3661 gt // LOCUS CHKFIBAB 2730 bp ss-mRNA VRT 17-JUL-1990 DEFINITION Chicken alpha-fibrinogen mRNA, 3' end, and ORF, complete cds. ACCESSION M34096 KEYWORDS alpha-fibrinogen; fibrinogen. SOURCE Chicken (strain White Leghorn) liver parenchymal cell line, cDNA to mRNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2730) AUTHORS Weissbach,L. and Grieninger,G. TITLE Bipartite mRNA for chicken alpha-fibrinogen potentially encodes an amino acid sequence homologous to beta- and gamma-fibrinogens JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5198-5202 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Grieninger, 08-MAY-1990. FEATURES from to/span description pept < 1 1518 alpha-fibrinogen (AA at 1) ORF 1830 2582 ORF mRNA < 1 2730 alpha-fibrinogen mRNA BASE COUNT 825 a 574 c 678 g 653 t ORIGIN 1 acgatactct gtgtgttgct ctgcctcaat ttagcttggg ctcaggatgg gaagaccacc 61 tttgaaaagg agggcggagg cgggcgtggc cccaggatcc tagagaacat gcacgagtcg 121 tcctgcaagt atgagaagaa ctggcctatc tgtgtggatg atgactgggg tacaaaatgt 181 ccatcatgct gcagaatgca aggaataatt gacgacacag accagaatta cagtcaaaga 241 atagacaata ttaggcagca actcgcagac agtcaaaaca agtataaaac atccaatcgg 301 gtaattgtgg aaaccataaa tatactaaag cctggtctgg agggtgccca gcagctcgat 361 gagaattacg gtcatgtgtc aacagaactg aggaggagaa ttgtgacatt aaagcagcga 421 gttgccactc aagtgaacag aattaaagct ctgcagaaca gcatccagga gcaggtggtg 481 gaaatgaagc gcttggaggt ggacattgat attaagatac gagcttgcaa agggagctgt 541 gctagaagtt ttgattacca ggtggacaaa gaaggctatg acaacatcca gaagcatctt 601 acccaggcca gctccattga tatgcaccca gactttcaaa caaccacctt gagcacactg 661 aaaatgaggc ctcttaagga ctcgaatgtt cctgaacatt ttaagctcaa gccttcacca 721 gaaatgcaag ctatgagtgc atttaataac atcaaacaga tgcaagtggt attagaaaga 781 ccagaaacag accacgtggc agaagcaagg ggggattctt caccttccca cactggcaaa 841 ttaattactt cttctcacag gagagagagc cctagcctgg tagacaaaac ctcctctgcc 901 tcctctgtcc atagatgcac cagaactgtc accaagaaag tcattagtgg ccctgatggt 961 cctagagaag aaatagttga aaaaatggtt tcctctgatg gctcagactg ctcccatcta 1021 caaggaggac gagaaggaag cacgtaccat tttagtggga caggtgactt tcacaagcta 1081 gacaggcttc tgccagatct agagtcattc ttcacccatg actctgtatc cactagtagt 1141 aggcactcaa ttggctcaag caccagcagc catgtgactg gggcaggcag cagccaccta 1201 ggcactggag ggaaagacaa attcacagat ttaggggagg aggaagaaga tgactttgga 1261 ggacttcagc catctggatt tgctgctggc agtgcgagtc actccaagac tgtactgacc 1321 agctcctctt ctagtttcaa caagggaggc tccacttttg aaaccaagtc actaaagacc 1381 cgtgaaacat ctgagcagct aggtggggtg caacatgatc agagtgcaga ggacacccca 1441 gactttaagg cgcgcagctt cagaccagca gcaatgagta caaggagatc ctacaatggg 1501 aaaggtacac agaagtagta attgaggtag tggagccaaa ttcaattcat aaccaaactg 1561 acagaatgtt tttagatact gcagtataat agtgtgatga taaaatgcca gaactgtgtc 1621 tgtttgttgc cactttggag ataaaagaga actatatcac taatagtcag gtattgcata 1681 gacaattagc aatataagat ctgtgtaagg ttttcctctc tcaattcttt ttaatgttgc 1741 ctgtcactaa accttatgtt gtccaagaca catttaacaa attttcctca gatctattgc 1801 tgcttggatt cctctgaagt ttttattaag ctttactgat aacttctgtc aaaccagatc 1861 aacttttttt tttagactgc gatgatatcc gccagaaaca cacttctggt gccaaaagtg 1921 gcattttcaa aatcaagcca gagggatcca ataaggtttt gtcagtttat tgcgaccaag 1981 agaccacttt gggaggatgg ctattgatcc aacagagaat ggatggatca gtgaatttta 2041 accgtacgtg gcaagactac aggagaggtt tcggcagcgt ggatggcaaa gggcaaggag 2101 agttgtggct gggcaatgag aatatccact tgctgactca gaacgatact ctgcttcgag 2161 tagagttaga ggactgggat ggaaatgctg catatgcaga gtacatcgtg caggtaggga 2221 ctgaagcaga aggctatgcc ctgactgtgt cctcctatga ggggactgct ggggatgctc 2281 tggttgctgg ctggctggag gagggctctg agtacacatc ccatgcccag atgcagttca 2341 gcacttttga ccgggaccag gaccactggg aggagagctg tgcagaggtg tacgggggtg 2401 gctggtggta caacagctgc caggcagcca acctcaatgg catttactat ccagggggcc 2461 attatgaccc caggtacaat gttccatatg agatcgagaa tggagtagtc tggataccat 2521 ttagagcctc cgattattcc ctcaaagttg ttagaatgaa aatcagaccc ctggaaaccc 2581 tgtagaagga cgggcattta atatatgtgt cacaactaaa agttggaaac atcttttata 2641 tacctgtgtg cacgatgtac ctttacccta tgaatttgaa ggcagcatgg cacatctatg 2701 gcatacaaaa taaacggatt aattattaac // LOCUS ECOHYA 6023 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli hya operon encoding hydrogenase isozyme 1 large and small subunit (hyaB and A) genes and hyaC-F protein genes, complete cds. ACCESSION M34825 KEYWORDS hyaA gene; hyaB gen; hyaC gene; hyaD gene; hyaE gene; hyaF gene; hydrogenase. SOURCE E.coli (strain LE392) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 6023) AUTHORS Menon,N.K., Robbins,J., Peck,H.D.Jr., Chatelus,C.Y., Choi,E.-S. and Przybyla,A.E. TITLE Cloning and sequencing of a putative Escherichia coli [NiFe] hydrogenase-1 operon containing six open reading frames JOURNAL J. Bacteriol. 172, 1969-1977 (1990) STANDARD simple staff_review FEATURES from to/span description pept 188 1306 hydrogenase-1 small subunit (hyaA) precursor sigp 188 322 hydrogenase-1 small subunit (hyaA) signal peptide matp 323 1303 hydrogenase-1 small subunit (hyaA) pept 1303 3096 hydrogenase-1 large subunit (hyaB) pept 3115 3822 hyaC protein pept 3819 4406 hyaD protein pept 4403 4801 hyaE protein pept 4798 5655 hyaF protein signal 5657 5683 pot. transcription termination signal BASE COUNT 1334 a 1607 c 1720 g 1362 t ORIGIN 1 ttttgtgcaa aagtttcact acgctttatt aacaatactt tctggcgacg tgcgccagtg 61 cagaaggatg agctttcgtt ttcagcatct cacgtgaagc gatggtttgc cttgctacag 121 ggacgtcgct tgccgaccat aagcgcccgg tgtcctgccg gtgtcgcaag gaggagagac 181 gtgcgatatg aataacgagg aaacatttta ccaggccatg cggcgtcagg gcgttacccg 241 gcgcagcttt ctcaaatatt gtagtctggc tgccacgtcg ctgggattag gcgcgggaat 301 ggcaccaaag attgcctggg cgctggagaa caaaccgcgc attccggtgg tatggatcca 361 cggtctggaa tgcacctgct gtaccgaatc ttttatccgc tccgctcacc cactggcgaa 421 ggacgtcatc ctttccctga tttccctcga ttacgacgat actttgatgg ctgccgccgg 481 aacccaggcg gaagaagtct ttgaagacat catcacgcaa tacaatggca aatatatcct 541 cgcagtagaa ggtaatccgc cgctgggcga gcaggggatg ttctgtatca gcagcggtcg 601 accgtttatt gagaaactca aacgtgccgc tgccggagcc agcgcgatta tcgcctgggg 661 aacctgcgcg tcctggggct gcgtgcaggc cgcgcgaccc aatccgacgc aggcaacgcc 721 tatcgacaaa gtcatcaccg acaaacccat tatcaaagta cctggctgcc cgccgatccc 781 ggatgtgatg agcgccatca ttacttacat ggtgaccttt gatcgcttgc cagatgtcga 841 cagaatgggc cgtccgctga tgttctatgg tcagcgaatc cacgataaat gctatcgccg 901 cgcccacttc gacgccggag agttcgtcca gagttgggat gatgacgctg cccgcaaagg 961 ttactgcctg tacaaaatgg gctgcaaagg gcctaccacc tataacgcct gttcctccac 1021 acgctggaat gatggcgttt ctttcccaat ccagtctggt cacggctgcc tgggctgtgc 1081 ggaaaatggt ttctgggatc gcggttcgtt ctacagccgc gtggtcgata ttccgcaaat 1141 gggtactcat tccaccgccg ataccgtcgg tttaaccgcg cttggcgtgg tggcagcggc 1201 tgttggtgtg cacgcagtcg ccagcgccgt tgaccagcgc agacgtcata accagcaacc 1261 tacagaaacc gaacatcagc caggcaatga ggataaacag gcatgagcac tcagtacgaa 1321 actcagggat acaccatcaa taatgccgga cgccgcctgg tggtcgaccc gattacgcgc 1381 atcgaaggcc acatgcgctg cgaagtgaat attaacgatc agaatgtgat caccaatgcc 1441 gtctcctgcg gcaccatctt tcgcgggctg gagatcatcc tacaagggcg cgacccgcgc 1501 gatgcgcggg cgttcgttga acgtatctgc ggcgtctgta ctggcgtaca cgccctggct 1561 tcggtttacg ccatcgaaga tgctatcggt attaaagtgc cggacaacgc caatatcatc 1621 cgcaacatta tgctggcaac gctctggtgc cacgatcatc tggtgcactt ctatcagctt 1681 gccgggatgg actggatcga tgtgttagat gcgctgaaag ccgacccgcg gaaaacctcc 1741 gaactggcgc aaagtctctc ctcttggccg aaatcatccc ctggctattt cttcgacgta 1801 caaaaccgcc tgaaaaaatt tgttgaaggc gggcagttgg ggatcttccg caatggctac 1861 tgggggcacc cgcagtacaa actgccgcca gaagctaacc tgatgggctt tgcccactat 1921 ctcgaagctc tcgatttcca gcgtgaaatt gtcaaaatcc acgcggtctt tggcggtaaa 1981 aacccgcatc caaactggat tgtcggcggg atgccttgcg ccatcaacat tgacgaaagc 2041 ggcgcggtcg gggcagtcaa tatggaacgc ctgaacctgg tgcagtcaat tatcacccgc 2101 acggcggact tcattaacaa cgtgatgatc cccgacgcct tagccatcgg tcagttcaac 2161 aaaccgtgga gcgaaatcgg cactggtctt tctgataaat gcgttctcag ctacggcgca 2221 ttcccggata ttgccaacga ctttggcgag aaaagtctgc tgatgcctgg cggcgcggtg 2281 attaacggcg acttcaacaa tgtgctgcca gtggatttgg ttgatccgca gcaggtgcag 2341 gagtttgtcg accacgcctg gtatcgatat cccaacgatc aggtcgggcg tcatccgttc 2401 gatggcatca ccgacccgtg gtacaacccc ggcgatgtca aaggcagcga taccaacatt 2461 cagcagctga atgaacagga acgctactcg tggatcaaag cgccacgctg gcgcggtaac 2521 gcgatggaag tggggccgct ggcgcgcacg ttaatcgctt atcacaaagg cgatgctgcg 2581 accgttgagt cggtcgatcg catgatgtcg gcgttgaacc tgccgctttc cggtatccag 2641 tcaacgttag gccgcatttt gtgccgcgcg cacgaagcgc agtgggccgc aggtaagttg 2701 cagtatttct tcgacaagct gatgaccaac ctgaaaaacg gcaatctcgc cactgcttcc 2761 acggaaaaat gggaacctgc aacctggccg acagagtgcc gtggtgtcgg ttttaccgaa 2821 gcgccgcgcg gggcgttagg ccactgggcc gccattcgcg atggcaagat tgatctctac 2881 cagtgcgtgg tgccgaccac ctggaacgcc agcccgcgcg atcccaaagg gcagattggc 2941 gcttatgaag cggcgctgat gaacaccaaa atggcgatcc ccgagcaacc gctggagatc 3001 ctgcgtactc tgcacagctt tgacccgtgc ctcgcctgtt caacacacgt gctgggcgac 3061 gacggtagcg agctgatctc cgtgcaggtg cgttaacagc gaaggagaat catcatgcaa 3121 cagaaaagcg acaacgttgt cagccactat gtctttgaag cgccagtgcg catctggcac 3181 tggttgacgg tgttatgcat ggcggtgttg atggtcaccg gatactttat cggcaagccg 3241 ctaccttccg tcagcggcga ggcgacgtat ctgttctata tgggctacat caggttaatt 3301 cacttcagcg ccgggatggt ttttaccgtg gttttgctga tgcggatcta ctgggctttt 3361 gttggcaatc gatactcccg cgagctgttt atcgtgccgg tatggcgtaa aagctggtgg 3421 cagggcgtgt ggtatgaaat ccgctggtat ctgtttctgg caaaacgtcc gagtgccgat 3481 ataggccata atcccatcgc ccaggcggcg atgttcggct atttcctgat gtcggtcttt 3541 atgatcatca ctggttttgc gctgtacagc gaacacagcc agtacgctat ttttgcgccg 3601 ttccgttatg tggtggaatt tttctactgg acgggtggca actcaatgga cattcacagc 3661 tggcatcggc tggggatgtg gctgattggc gcgtttgtga tcggtcatgt ctacatggcg 3721 ctgcgtgaag acatcatgtc cgacgacacg gtgatctcca ccatggtcaa cggctaccgt 3781 agccacaaat ttggcaaaat aagtaacaag gagcgttcat gagcgagcaa cgcgtggtgg 3841 tcatggggct gggcaacctg ctgtgggccg atgaaggctt cggcgtgcgg gtggcggaac 3901 ggctgtatgc ccattaccac tggcccgagt atgtggagat tgtcgatggc ggtactcagg 3961 gactgaactt gctggggtat gtcgaaagcg ccagccatct gttgattctc gatgccattg 4021 actacgggct ggaacctgga acgctgcgaa cctatgccgg agaacgcatt ccggcttatc 4081 tcagcgcgaa gaaaatgagc ctgcatcaga acagtttctc cgaagtgttg gcgctggcgg 4141 atatccgcgg acatctgcca gcacatattg ccctcgtcgg tctgcaaccc gcaatgctcg 4201 acgactacgg cggtagcctg agcgaactgg cacgggagca actgcccgct gcggaacagg 4261 cggcgctggc gcagcttgct gcgtggggaa ttgtgccgca accggctaat gaatcgcgct 4321 gtctcaatta tgactgtctg tcgatggaaa attacgaagg cgttcgcttg cgccagtacc 4381 ggatgacaca ggaggagcag ggatgagcaa cgacacgcca tttgatgcgt tgtggcaacg 4441 aatgctggcg cgcggctgga cgccagtcag tgaatcccgt cttgacgact ggcttacgca 4501 agcgccagac ggcgtggtgt tattaagcag tgacccgaaa cgcacgccag aggtcagcga 4561 taatccggta atgattggcg aattactgcg cgagtttccc gactatacat ggcaggtggc 4621 gattgctgac cttgagcaga gcgaagccat cggcgatcgc tttggcgtct ttcgctttcc 4681 tgccacttta gtgtttaccg gcggaaacta tcgcggcgtg ctgaatggta ttcacccgtg 4741 ggcggaactg ataaacctga tgcgcgggct tgtcgaaccg cagcaggagc gtgcctcatg 4801 agcgaaactt ttttccatct gctggggcca ggaacgcaac cgaacgatga cagtttcagc 4861 atgaatccac tgccgatcac ctgtcaggtg aatgatgaac cgagtatggc ggccctggag 4921 caatgtgctc acagcccgca ggtgattgcg ctgttaaacg agttacaaca tcaactaagc 4981 gaacgccaac cgccgttggg cgaggtgctg gcagtcgatc tgttaaatct caacgccgac 5041 gatcgtcact ttatcaatac gcttctcggg gaaggggaag tgtcagtgcg cattcagcag 5101 gctgacgaca gtgaaagtga aatacaggag gcgatcttct gcggattatg gcgggtgcgc 5161 agacgtcgcg gcgaaaagtt gctggaggac aaactggagg ctggctgcgc gccgctggcg 5221 ttgtggcagg cggcaacgca aaatctcttg ccgacagatt cgctgttacc gccgcccatt 5281 gatggcctga tgaatggcct accgttggcg catgagttac tggcacatgt acgtaacccc 5341 gacgcgcagc cgcacagcat taatctgacg caattaccca tcagcgaggc tgatcggctt 5401 tttctctcac gtctctgtgg gccgggaaat attcagattc gtaccattgg ctatggcgag 5461 agctatatca acgccacggg gttacgccat gtctggcatt tacgctgtac ggacacctta 5521 aaaggcccgt tactggaaag ttatgaaatc tgcccaatac cggaagtggt gctggcagcg 5581 ccagaagatt tggtcgactc tgcgcagcgg cttagcgagg tatgtcagtg gctggcggaa 5641 gctgcaccga cgtaaaaaga cggtaagtat cgctttcagt cttatgaata tcgcaatcgg 5701 cgaatacctc tggtcgtaga gtttcaggat aaagagggag atctaccatt atcgggttat 5761 ttttctctct tcgcctacag gagtgcgcat gtgggatgtc attgatttat cgcgctggca 5821 gtttgctctg accgcgctgt atcacttttt atttgtaccc cttaccctgg gctgattttt 5881 ttgctggcta ttatggaaac catttacgtg gtcaccggca aaaacaatct accggcaaaa 5941 caatctaccg cgatatgacg cgcttctggg gtaagctctt cggtatcaat tttgctcttg 6001 gcgtggctac ccggcctaac cat // LOCUS HUMFDXC 1174 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human ferredoxin mRNA, complete cds. ACCESSION M34788 KEYWORDS ferredoxin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept 19 573 ferredoxin BASE COUNT 337 a 211 c 267 g 359 t ORIGIN Chromosome 11q13-qter. 1 agttcccgac cgcgggcgat ggctgccgct gggggcgccc ggctgctgcg cgccgcttct 61 gctgtcctcg gcggcccggc cggccggtgg ctgcaccacg ctgggtcccg cgctggatcc 121 agcggcctgc tgaggaaccg ggggccgggc ggtagcgcgg aggcgagccg gtcgctgagc 181 gtgtcggcgc gggcccggag cagctcagaa gataaaataa cagtccactt tataaaccgt 241 gatggtgaaa cattaacaac caaaggaaaa gttggtgatt ctctgctaga tgttgtggtt 301 gaaaataatc tagatattga tggctttggt gcatgtgagg gaaccctggc ttgttcaacc 361 tgtcacctca tctttgaaga tcacatatat gagaagttag atgcaatcac tgatgaggag 421 aatgacatgc tcgatctggc atatggacta acagacagat cacggttggg ctgccaaatc 481 tgtttgacaa aatctatgga caatatgact gttcgagtgc ctgaaacagt ggctgatgcc 541 agacaatcca ttgatgtggg caagacctcc tgaactagaa caaataggaa tattttcatg 601 gaattttacc tatttttata attattattt cttaaagtga ttaaatgaga acatggatga 661 gtggacttca tattatgact agctttacta ttttaattca ccttgcataa ctactgaatt 721 ttgtcattct tgaaagtatg caatttttat tttggttata ttacaaaaat gtcaatcaaa 781 tattaaaaaa tagttaatgt gatagaaaaa ccttacatat ttttttctta tgtttgttta 841 gcgactttag caaaatgttt tcatataatc tcatctgttt acctagaaga taggttaagg 901 aaatatatta ttattcctgt ttgatgtggg tgaaggcaga gatctaacct ggcttgttta 961 gggccatacc actaattaga aaatctgtgc tagaacctgt gtcttattcc tataagctat 1021 gtgttcagac tgaaactgga gaaattatga ctattttatt tatagtagta gttaaatctg 1081 aatgtgtatg gacaaaaata tttaattgct gagtaaactg cttaacttca aagatagtta 1141 ttgaccttat aaataaatat ttcaaaattt tgat // LOCUS HUMFDXPSA 676 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin pseudogene, complete cds. ACCESSION M34787 KEYWORDS ferredoxin; pseudogene. SOURCE Human DNA, clone h2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 676) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps 19 569 pseudo-ferredoxin BASE COUNT 171 a 149 c 192 g 164 t ORIGIN Chromosome 20. 1 agttcccgac cgcgggcgat ggctgccgct gggggcgccc ggctgctgcg cgccgcttct 61 gctgtcctcg gcgacccggc cggccggtgg ctgcaccacg ccgggtcccg cgctggagcc 121 agcggcctgc tgaggagccg gggaccgggc cggagcgcgg aggcaagccg gccgctgagc 181 gtgtcggcgg gggcgcggag cagctcagaa gataaagcga cagtccactt tataaactgt 241 gatggtgaaa cattaacaac caaaggaaaa gttggtgatt ctctgctaga cgttgtggtt 301 gaaaataatc cagatattga tggctttggt gcatgtgagg gaactctaac ttgtttaacc 361 tgtcatctca tctttgaaga tcacatatat gagaagttag atgcaatcac tgatgaggag 421 aatcacatgc tcgatctggc atatggacta acagatcaca gttgggctgc caaatctgtt 481 tgacaaaatc tatggacaat atgactgttc gagtgcctga aacagtggct gatgccagac 541 aatccattga tgtgggcaag acctcctgaa ctagaacaaa taggaatatt ttcatggaat 601 tttacctatt tttataatta ttatttaagt gataaaatga gtacatggat gagtggattc 661 tgtattatga ctagct // LOCUS HUMFDXPSB 976 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin pseudogene, complete cds. ACCESSION M34789 KEYWORDS ferredoxin; pseudogene. SOURCE Human DNA, clone h3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 976) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps < 1 386 pseudo-ferredoxin BASE COUNT 340 a 156 c 168 g 312 t ORIGIN Chromosome 21. 1 ttcagtgaag ggcccagagc agctcggaag ataaaataac agtccacttt ataaaccaac 61 agtgaaacat taacaaacaa aggaaaagtt gatgactctt tgctagatgt tgtggttgaa 121 aataatgtag atattgatgg gtttgttgca tggcagggaa ccttggctct gcttgtcacc 181 tcatttttga aaagcacata tttgagaagt tagacacaat cactgacaag gagattgaca 241 tgcttgatct ggcatatgga ctgacagaca ggtcagagtt gtattgccaa atctgtttga 301 caaaatctat gggccatatg actgtctgag tacctgatgg agtggccagt gccagacaat 361 ccactgatat gggcaagacc acctaagcag aataaatagc aatgctttca ctaaatttta 421 cctattttat aattatttct tatataatta aataaaaaca tacatgaatg aacttattat 481 tatgaccaac tttactgttt taattcacct tgtataacta ctgaatttta tagttctgaa 541 aatacgcaat ttttatattt gttacattag aaaaaggtca gttaaatatt agaaaataat 601 taatatgaca ataccttaca aattttgcct tacgtttgtt tagcaacttt tagcaaaatg 661 ttttcacatg gccttatgtc tgcttaccta gaaagtaaat tttttaaaaa agatattatt 721 atccctgttt tatgtgagtg aaggcaaaga cctaaaatgg cttgttaagg gccatccaac 781 taattagaaa atatgtacac acctgcatct tgttactata aggtatatgt tagactgaaa 841 ctggagaaat tatgactatc ttacagcagc aattagatct gactgcatat ggacaaaaac 901 attaattgct cagtgaactg cttaactgac agttattaac cttacacata aagatttcaa 961 aattttgatt cagtga // LOCUS HUMFRDXA1 350 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin gene, exon 3. ACCESSION M34783 KEYWORDS ferredoxin. SEGMENT 1 of 2 SOURCE Human DNA, clone h4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 350) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 45 + 174 ferredoxin, exon 3 IVS < 1 44 ferredoxin intron B IVS 175 > 350 ferredoxin intron C BASE COUNT 99 a 56 c 78 g 117 t ORIGIN Chromosome 11q13-qter. 1 ctgatgtaga agggactatg ttcagtgttt gttgcttttg tcaggtgcat gtgagggaac 61 cctggcttgt tcaacctgtc acctcatctt tgaagatcac atatatgaga agttagatgc 121 aatcactgat gaggagaatg acatgctcga tctggcatat ggactaacag acaggtaaga 181 tttttggact gcttcaattg taataataat ctgggaacat agatgttgta ttatgttgtc 241 tatgtaagac cagacccata aatgtaatag tgttctacca ggttaaataa cagtcacaga 301 ttttgagtat cagatatgtg gcatttctct ttgttttcac tgttctacag // LOCUS HUMFRDXA2 1260 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin gene, exon 4. ACCESSION M34785 KEYWORDS ferredoxin. SEGMENT 2 of 2 SOURCE Human DNA, clone h4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1260) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 33 147 ferredoxin, exon 4 IVS < 1 32 ferredoxin intron C BASE COUNT 410 a 196 c 241 g 413 t ORIGIN About 5.3 kb after segment 1; chromosome 11q13-qter. 1 actaaaccat accttccccc ttttccatac agatcacggt tgggctgcca aatctgtttg 61 acaaaatcta tggacaatat gactgttcga gtgcctgaaa cagtggctga tgccagacaa 121 tccattgatg tgggcaagac ctcctgaact agaacaaata ggaatatttt catggaattt 181 tacctatttt tataattatt atttcttaaa gtgattaaat gagaacatgg atgagtggac 241 ttcatattat gactagcttt actattttaa ttcaccttgc ataactactg aattttgtca 301 ttcttgaaag tatgcaattt ttattttggt tatattacaa aaatgtcaat caaatattaa 361 aaaatagtta atgtgataga aaaaccttac atattttttt cttatgtttg tttagcgact 421 ttagcaaaat gttttcatat aatctcatct gtttacctag aagataggtt aaggaaatat 481 attattattc ctgtttgatg tgggtgaagg cagagatcta acctggcttg tttagggcca 541 taccactaat tagaaaatct gtactagaac ctgtgtctta ttcctacaag ctatgtgttc 601 agactgaaac tggagaaatt atgactattt tatttatagt agtagttaaa tctgaatgtg 661 tatggacaaa aatatttaat tgctcagtaa actgcttaac ttcaaagata gttattaacc 721 ttataaataa atatttcaaa attttgattc ggaagactaa gtctggacgt agacattata 781 atgctatcaa agaagtttga tctctgtttt gactaaacta gaggaaaaat gattggatgt 841 gtttattctt ttctaagcag aatggtttaa ctttgtactc tttgaaaaat aatgctgatt 901 tataaatctc tgcctataac agaatggaaa ccttatgaat gaattgtgtt tctctgtcct 961 gagctggaga agggaatgag caggctgaca cgttgcacag ccccaggtgg cgccattctc 1021 tcacgcaagg atggggctgc agggtgagca gcgtgggctg cagtgtgtca gtcccaggag 1081 tgagggagtg gcaagcacca cagattacca cgtatgtgtg gaagacattc gtacccttat 1141 ctttactata aataaattca taaaagttaa caaaggggta cacagtatgg tctttggaaa 1201 tataataaaa catcaactaa cttggactaa ttgtgaggaa gagcagaaca aattagtaga // LOCUS HUMFRDXB1 350 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin gene, exon 3. ACCESSION M34784 KEYWORDS ferredoxin. SEGMENT 1 of 2 SOURCE Human DNA, clone h1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 350) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 45 + 174 ferredoxin, exon 3 IVS < 1 44 ferredoxin intron B IVS 175 > 350 ferredoxin intron C BASE COUNT 99 a 56 c 78 g 117 t ORIGIN Chromosome 11q13-qter. 1 ctgatgtaga agggactatg ttcagtgttt gttgcttttg tcaggtgcat gtgagggaac 61 cctggcttgt tcaacctgtc acctcatctt tgaagatcac atatatgaga agttagatgc 121 aatcactgat gaggagaatg acatgctcga tctggcatat ggactaacag acaggtaaga 181 tttttggact gcttcaattg taataataat ctgggaacat agatgttgta ttatgttgtc 241 tatgtaagac cagacccata aatgtaatag tgttctacca ggttaaataa cagtcacaga 301 ttttgagtat cagatatgtg gcatttctct ttgttttcac tgttctacag // LOCUS HUMFRDXB2 1269 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin gene, exon 4. ACCESSION M34786 KEYWORDS ferredoxin. SEGMENT 2 of 2 SOURCE Human DNA, clone h1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1269) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 32 146 ferredoxin, exon 4 IVS < 1 31 ferredoxin intron C BASE COUNT 407 a 197 c 246 g 419 t ORIGIN About 5.3 kb after segment 1; chromosome 11q13-qter. 1 actaaaccat accttcccct tttccataca gatcacggtt gggctgccaa atctgtttga 61 caaaatctat ggacaatatg actgttcgag tgcctgaaac agtggctgat gccagacaat 121 ccattgatgt gggcaagacc tcctgaacta gaacaaatag gaatattttc atggaatttt 181 acctattttt ataattatta tttcttaaag tgattaaatg agaacatgga tgagtggact 241 tcatattatg actagcttta ctattttaat tcaccttgca taactactga attttgtcat 301 tcttgaaagt atgcaatttt tattttggtt atattacaaa aatgtcaatc aaatattaaa 361 aaatagttaa tgtgatagaa aaaccttaca tatttttttc ttatgtttgt ttagcgactt 421 tagcaaaatg ttttcatata atctcatctg tttacctaga agataggtta aggaaatata 481 ttattattcc tgtttgatgt gggtgaaggc agagatctaa cctggcttgt ttagggccat 541 accactaatt agaaaatctg tgctagaacc tgtgtcttat tcctataagc tatgtgttca 601 gactgaaact ggagaaatta tgactatttt atttatagta gtagttaaat ctgaatgtgt 661 atggacaaaa atatttaatt gctgagtaaa ctgcttaact tcaaagatag ttattgacct 721 tataaataaa tatttcaaaa ttttgattcg gaagactaag tctggacgta gacattataa 781 tgctatcaaa gaagtttgat ctctgttttg actaaactag aggaaaaatg attggatgtg 841 tttattcttt tctaagcaga atggtttaac tttgtactct ttgaaaaata atgctgattt 901 ataaatctct gcctataaca gaatggaaac cttatgaatt gtttgtgttt ctctgtcctg 961 agctcggtac ccctggagaa gggaatgagc aggctgacac gttgcacagc cccaggtggc 1021 gccattctct cacgcaagga tggggctgca gggtgagcag cgtgggctgc agtgtgtcag 1081 tcccaggagt gagggagtgg caagcaccac agattaccac gtatgtgtgg aagacattcg 1141 tactcttatc tttactataa ataaattcat aaaagttaac aaaggggtac acagtatggt 1201 ctttggaaat ataataaaac atcaactaac ttggactaat tgtgaggaag agcagaacaa 1261 attagtaga // LOCUS RATC1H4SY 3146 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat cytoplasmic C-1-tetrahydrofolate synthase mRNA, complete cds. ACCESSION J05519 KEYWORDS C-1-tetrahydrofolate synthase. SOURCE Rat adult liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3146) AUTHORS Thigpen,A.E., West,M.G. and Appling,D.R. TITLE Rat C-1-tetrahydrofolate synthase: cDNA isolation, tissue-specific levels of the mRNA, and expression of the protein in yeast JOURNAL J. Biol. Chem. 265, 7907-7913 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Appling, 18-MAY-1990. FEATURES from to/span description pept 121 2928 C-1-tetrahydrofolate synthase mRNA < 1 3146 C-1-tetrahydrofolate synthase mRNA signal 3138 3143 poly-A signal BASE COUNT 845 a 759 c 845 g 697 t ORIGIN 1 cgcactgagt ttccccggga gatccctgag cacgtgggtt gtgctgtccg gtaagcgcac 61 ggagggactg gaacctgaat tttgctggtg ttcttcgtgg gtcgcagact agctaaggcc 121 atggcaccag cgggaatcct gaatggaaaa gtggtctccg cgcaaatcag gaatctactg 181 aagacccagt ttactcagat gcaggagcag gtacctggct tcacccctgg cctggcaatc 241 ctgcaggttg gcgacagaga tgattccaat ctttatataa atgtgaagct gaaggctgct 301 caagagatcg ggatcaaagc cactcacatt aaattaccaa gaacttccac agagtcggag 361 gtgttaaaat acgtcatctc cctgaatgag gatgccactg tgcacggatt catagtacag 421 ctgcctttag actccgagaa ttccattaac acagaggcag tcatcaatgc cattgcccct 481 gagaaggatg tggacgggtt gacgagcatc aatgccggga agcttgccag aggtgatcta 541 aaggactgct tcattccgtg cacacccaaa ggatgcttgg aactcatcaa agagacagga 601 gtgcagattg caggaaggca cgccgtggtg gtcgggcgaa gtaaaatagt tggtgcaccc 661 atgcacgacc tgcttctgtg gaacaatgcc acggtgacca cctgccactc caagactgcc 721 gatctggaca aggaggtaaa taaaggagac attctggtgg ttgcaacagg acagcctgag 781 atggtgaaag gggagtggat caagcctgga gctgtggtca tagactgtgg gatcaattat 841 gttccagatg atacaaaacc aaatggaagg aaagttgtgg gtgacgtagc atatgacgag 901 gccaaggaga aggcgagctt tatcacgcct gtccccggtg gcgtggggcc catgaccgtg 961 gcaatgctga tgcagagcac agtagagagc gcacagcgct tcctgaagaa atttaagcca 1021 gggaagtgga caattcagta taacaagctg aacctcaaga cgcctgtacc aagtgacatt 1081 gctatatcac gatcttgcaa acccaagctc atcggtaacc tggcccgaga aattgggcta 1141 ctcactgagg aggtggaatt gtatggagaa acaaaggcca aggtcttact gtcagcacta 1201 gatcgcctga agcatcagcc agatgggaaa tacgttgtgg tgactggaat tactccaaca 1261 cccctgggag aagggaagag cacaaccacc atcgggcttg tgcaagccct tggcgcccac 1321 ctgcatcaga atgtttttgc gtgtgtgcga cagccttctc agggccccac ctttgggata 1381 aaaggtggcg ctgcaggagg cggctattcc caggtcattc ctatggaaga gtttaatctc 1441 cacctcactg gtgacatcca cgccatcact gccgctaata accttgtggc tgctgctatt 1501 gacgctcgga tatttcatga gctgacccag acagacaagg ctctctttaa tcgtctggta 1561 ccctcagtaa atggagtaag aaagttctct gacatccaaa tccgcaggtt acggaggcta 1621 ggcattgaga agactgaccc cgctgcgctg acagatgatg agataaacag atttgcaaga 1681 ctagatattg atccagaaac cataacatgg cagagagtgc tggatactaa cgatagattc 1741 ctgaggaaga tcaccattgg acaggctcca accgagaaag gccacacacg cacggcccag 1801 tttgatatct ctgtggccag tgaaatcatg gcagtcctgg ccctcactag ttctctggaa 1861 gacatgagag caaggctggg caaaatggtg gtagcgtcca gtaagaaagg ggagcctatc 1921 agctgtgaag acctgggcgt gagcggggcg ctgacggtgc tcatgaagga tgcgatcaag 1981 cccaatctca tgcagacctt agagggcacg ccggtatttg tccatgctgg gccttttgcc 2041 aacattgcac atgggaattc ctccatcatt gcagaccgga ttgcactcaa gctggttggc 2101 cctgagggct ttgtagtgac agaagcagga ttcggagcag acataggaat ggaaaagttc 2161 ttcaacatca agtgccggta ttctggtctc cagcctcatg tggtggttct tgttgccact 2221 gtcagggctc ttaagatgca cgggggtggc cccacggtca ccgctggact gcctcttccc 2281 aaggcttaca cagaagagga cctggacctg gtggaaaagg gcttcagtaa cttgaggaaa 2341 cagatagaaa atgctagaat gtttggagtg cctgtcgtgg tggccatgaa tgcattcaag 2401 acagatacag atactgagct ggacctcatc ggccgcctct ccagagagca tggggctttc 2461 gatgctgtca agtgcaccca ctgggcagaa gggggccagg gagccttagc cctggctcag 2521 gctgtccaga gagcttcaca ggcccccagc agcttccagc tcctctatga cctcaagctc 2581 tcagttgaag ataaaatcag gattattgca cagaagatct acggggccga tgacatcgaa 2641 ttgctccctg aagcacaaaa caaagcagaa atctacacaa agcagggctt tgggaatcta 2701 cccatctgca tggccaaaac acacttgtct ttatctcaca acccagagca aaaaggggtg 2761 cctactggct tcgttctgcc catccgggac atccgggcca gcgttggggc tggtttcctg 2821 taccctttag taggaacgat gagcacaatg cctggactcc ctactcgacc ctgtttttat 2881 gatatcgatt tggaccctga aactgaacaa gtgaatggat tgttttaaac agatcttcca 2941 tctccaagag gctactctgt ctggccagtg tctgttcagg cccactgagg aagtgtgcag 3001 agaagtcttg aaagtctgtg cctgccctga agagcttcag aaatagtgga agtttctcta 3061 aagcctttca tagccttaat tcagatcatg tataaattaa cataaatcat gcctatttac 3121 atagtaaaag tccacagaat aaatga // LOCUS NGOARGF 1260 bp ds-DNA BCT 17-JUL-1990 DEFINITION N.gonorrhoeae ornithine transcarbamoylase (argF) gene, complete cds. ACCESSION M34930 KEYWORDS ornithine transcarbamoylase. SOURCE N.gonorrhoeae (strain CDC 50) DNA. ORGANISM Neisseria gonorrhoeae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1260) AUTHORS Martin,P.R., Cooperider,J.W. and Mulks,M.H. TITLE Sequence of the argF gene encoding ornithine transcarbamoylase from Neisseria gonorrhoeae JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Mulks, 06-JUN-1990. Author address [1]: M.H.Hulks Michigan State University Department of Microbiology and Public Health 367 Giltner Hall, East Lansing, MI 43324 e-mail [1]: 21667CFM@MSU.bitnet FEATURES from to/span description pept 141 1136 ornithine transcarbamoylase (argF) (EC 2.1.3.3) binding 128 133 ribosome binding site signal 1172 1204 transcription termination signal site 39 48 DNA uptake sequence site 54 63 DNA uptake sequence site 713 722 DNA uptake sequence site 1175 1184 DNA uptake sequence site 1192 1201 DNA uptake sequence BASE COUNT 350 a 350 c 319 g 241 t ORIGIN 1 tccccgtcat ccgtatggag taagggattg accgcaatgc cgtctgaaca accttcagac 61 ggcattgcaa cattccgcta acccttcttt ccgcaaacgc tgcaaatacg gcgttcacgc 121 cccacataaa ggaaacgaca gtgaacctga aaaaccgcca ttttctgaaa cttttggact 181 tcacgccgga agaaatcacc gcctacctcg accttgccgc cgagttgaaa gacgccaaaa 241 aggcagggcg cgagattcag cggatgaaag ggaaaaacat cgccctgatt tttgaaaaaa 301 catccacgcg cacacgctgt gcgtttgaag tcgccgcacg cgaccaaggc gcggatcgaa 361 cctatctgga accgtccgcc agccaaatcg ggcacaaaga aagcatcaaa gacacggcgc 421 gcgtcttagg cagaatgtac gatgccatcg aatatcgcgg cttcgctcag gaaactgtcg 481 aagaattggc aaaatatgcg ggcgtacccg tgttcaacgg gctgaccaac gagttccatc 541 ccacacaaat gcttgccgac gcactgacta tgcgcgaaca cagcggcaaa cctttgaacc 601 aaaccgcgtt tgcctacgtc ggcgacgcgc gttacaacat gggcaattcc ctgctgattt 661 taggggcaaa attggggatg gacgtgcgta tcggcgcacc tcaaagcctg tggccgtctg 721 aaggcattat cgccgccgca cacgccgccg ccaaagaaac cggtgcaaaa attaccctga 781 ccgaaaacgc gcatgaagcc gtcaaaggtg tcggtttcat tcatactgac gtatgggtca 841 gcatgggcga gccgaaagaa gtctggcagg aacgcatcga tttgctgaaa gattaccgcg 901 ttacgcccga actgatggcg gcatcgggca atccgcaagt caaattcatg cactgcctgc 961 ccgccttcca caaccgcgaa accaaagtcg gcgaatggat ttacgaaacc ttcgggctga 1021 acggtgtgga agttacagaa gaagtattcg aaagtccggc cggcatcgtg ttcgatcagg 1081 cggaaaaccg tatgcacacg attaaagcgg taatggtcgc ggctctgggc gactgacaga 1141 actgtgcctg tttaaattca tccgcaacac agataccgtc tgaacacgat gttcagacgg 1201 tatccatata acaaactgcc tacacgatgt gtaggcagtc ccgtttgaaa acaatcagtt // LOCUS PFACSPA 1218 bp ds-DNA INV 17-JUL-1990 DEFINITION P.falciparum CS gene encoding circumsporozoite protein, complete cds. ACCESSION M22982 J04650 KEYWORDS circumsporozoite protein. SOURCE P.falciparum (strain NF54) DNA, clone 1.3A. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1218) AUTHORS Caspers,P., Gentz,R., Matile,H., Pink,J.R. and Sinigaglia,F. TITLE The circumsporozoite protein gene from NF54, a Plasmodium falciparum isolate used in malaria vaccine trials JOURNAL Mol. Biochem. Parasitol. 35, 185-190 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly provided by P.Caspers, 14-MAR-1989. FEATURES from to/span description pept 1 1218 circumsporozoite protein BASE COUNT 531 a 259 c 168 g 260 t ORIGIN 1 atgatgagaa aattagctat tttatctgtt tcttcctttt tatttgttga ggccttattc 61 caggaatacc agtgctatgg aagttcgtca aacacaaggg ttctaaatga attaaattat 121 gataatgcag gcactaattt atataatgaa ttagaaatga attattatgg gaaacaggaa 181 aattggtata gtcttaaaaa aaatagtaga tcacttggag aaaatgatga tggaaataac 241 gaagacaacg agaaattaag gaaaccaaaa cataaaaaat taaagcaacc agcggatggt 301 aatcctgatc caaatgcaaa cccaaatgta gatcccaatg ccaacccaaa tgtagatcca 361 aatgcaaacc caaatgtaga tccaaatgca aacccaaatg caaacccaaa tgcaaaccca 421 aatgcaaacc caaatgcaaa cccaaatgca aacccaaatg caaacccaaa tgcaaaccca 481 aatgcaaacc caaatgcaaa cccaaatgca aacccaaatg caaacccaaa tgcaaaccca 541 aatgcaaacc caaatgcaaa cccaaatgca aaccccaatg caaatcctaa tgcaaaccca 601 aatgcaaacc caaacgtaga tcctaatgca aatccaaatg caaacccaaa cgcaaacccc 661 aatgcaaatc ctaatgcaaa ccccaatgca aatcctaatg caaatcctaa tgccaatcca 721 aatgcaaatc caaatgcaaa cccaaacgca aaccccaatg caaatcctaa tgccaatcca 781 aatgcaaatc caaatgcaaa cccaaatgca aacccaaatg caaaccccaa tgcaaatcct 841 aataaaaaca atcaaggtaa tggacaaggt cacaatatgc caaatgaccc aaaccgaaat 901 gtagatgaaa atgctaatgc caacagtgct gtaaaaaata ataataacga agaaccaagt 961 gataagcaca taaaagaata tttaaacaaa atacaaaatt ctctttcaac tgaatggtcc 1021 ccatgtagtg taacttgtgg aaatggtatt caagttagaa taaagcctgg ctctgctaat 1081 aaacctaaag acgaattaga ttatgcaaat gatattgaaa aaaaaatttg taaaatggaa 1141 aaatgttcca gtgtgtttaa tgtcgtaaat agttcaatag gattaataat ggtattatcc 1201 ttcttgttcc ttaattag // LOCUS NPA25K 900 bp ds-DNA VRL 17-JUL-1990 DEFINITION Autographa californica nuclear polyhedrosis virus 25K protein gene, complete cds. ACCESSION M22537 M23428 KEYWORDS 25K protein. SOURCE Autographa californica nuclear polyhedrosis virus DNA, clone HindIII-I. ORGANISM Autographa californica nuclear polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 900) AUTHORS Beames,B. and Summers,M.D. TITLE Location and nucleotide sequence of the 25K protein missing from baculovirus few polyhedra (FP) mutants JOURNAL Virology 168, 344-353 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.D.Beames, 10-FEB-1989. The 25K gene product is not needed for replication of the virus in tissue culture. It is believed to be important in the viral occlusion process but this has not yet been experimentally shown. Disruption of the gene leads to few polyhedra (FP) phenotype. FEATURES from to/span description pept 115 753 25K protein recomb 63 64 IFP1.6 insertion site recomb 586 587 IFP2.2 insertion site BASE COUNT 334 a 151 c 165 g 250 t ORIGIN Map position 36.6 to 37.5. 1 aagaaaacat gattacgtct gtacacgaac gcgtattaac gcagagtgca aagtataaga 61 gggttaaaaa atatatttta cgcaccatat acgcatcggg ttgatatagt taatatggat 121 caatttgaac agttgattaa cgtgtctctg ctcaagtcgt tgatcaaaac gcaaatcgac 181 gaaaatgtgt cggacaatat caagtcgatg agcgaaaaac taaaaaggct agaatacgac 241 aatctcacag acagcgttga gatatacggt attcacgaca gcaggctgaa taataaaaaa 301 attagaaact attatttaaa aaagatttgc gctttactcg acctaaactt taaacacgtc 361 atagaatctt cgtttgacaa aaaccacatt gtggccaagc tgtgtgacgc gacgcgcgct 421 aaagaatggc aaaccaagtc gcgcgagcgt cgacttaaaa attttaattt aaacattaat 481 tacgacgggc ccgtaaaaat atttgtggcc gccacagcgg agcaaaagct gttacttaaa 541 aaaactcgcg acgctctgtt gccgttttac aaatacattt ccatttgtaa aaacggtgtt 601 atggtgagac gcgacgaaaa gagtcgcgtg tttattgtta aaaatgagca aaatatagaa 661 tatctcaaag ccaacaaata ttacgctttt catagcgata gcgtcgataa ttttgagtct 721 gaaaacgata gcgaaaaaat gcttcaaatt taattaaatt taattaataa ttaattttag 781 tatcaagttg aatctcgtaa tgtaagaaaa aaaacactaa tattggttgt gattcttgcc 841 agacgtccat tttatgtgat aaataaaaca caaacacaaa gtttttatat tgtcttttat // LOCUS SHPHSKB3C 413 bp ds-DNA MAM 17-JUL-1990 DEFINITION Sheep BIIIB4 high sulfur keratin gene, 3' end. ACCESSION M21101 J03062 KEYWORDS BIIIB high-sulphur wool keratin. SOURCE Sheep (Dorset horn strain) DNA, clone pSWK31. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 413) AUTHORS Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E. TITLE The keratin BIIIB gene family: Isolation of cDNA clones and structure of a gene and a related pseudogene JOURNAL Genomics 4, 182-191 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.J.Frenkel, 24-OCT-1989. FEATURES from to/span description pept < 1 51 BIIIB4 high-sulfur keratin (AA at 1) BASE COUNT 112 a 99 c 84 g 118 t ORIGIN 15 bp upstream of PvuII site. 1 actacctaca ctcagcccag ctgtgagccc tgcatcccaa gctgctgctg accgacggct 61 acctcaccca gtgcctgaca gagtcaaccc agaagcttta gtgctcacct gtctcagtac 121 ctgcaactaa ttatgtctcc gctttcaaag ttggaacaag gcattatcac agacaaccct 181 cacaaaaaac aaaccaagag actttcgatg gccgtgtagt ggacatcagt gaacaaggac 241 agctggagta ggtagatgcc tacaggtttc ccagagttgt tcagttcgtt agtgttaaat 301 tgtatctttc tttggtgctt tgggaaattc gtttccagtc ttgaatggta tctttctgga 361 aattaaggaa cttcttcatg attattcaat aaagtttaca tctctggtat aat // LOCUS SHPHSKB3D 1080 bp ss-mRNA MAM 17-JUL-1990 DEFINITION Sheep BIIIB3 high-sulfur keratin pseudogene mRNA. ACCESSION M21102 J03062 KEYWORDS BIIIB high-sulphur wool keratin. SOURCE Sheep (Dorset horn strain), cDNA to mRNA, clone psi-BIIIB3A. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1080) AUTHORS Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E. TITLE The keratin BIIIB gene family: Isolation of cDNA clones and structure of a gene and a related pseudogene JOURNAL Genomics 4, 182-191 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.J.Frenkel, 24-OCT-1989. FEATURES from to/span description pept.ps / 191 459 BIIIB3 high-sulfur keratin BASE COUNT 310 a 226 c 195 g 349 t ORIGIN 74 bp upstream of AvaII site. 1 aaatgtcaaa ttgattataa aaattttagt ggaaaacggt ggcataaata aaaaagagag 61 gaaaactttt ggtggaccaa cacctgatga agcggatata taaagagccc caaagtgaga 121 ggagacattt cgtcctggat agttcgtctt tcactccaag atgccaaaag aaaccaaggt 181 tcccggtgcc acggcttgct gagctcccct ctgctgcagt gttcccacca gccccgccac 241 tactatctgc tcctctgaca aattctgggc atgagtctgc ctgcccagct cctgctcaca 301 tatggtttgg ttactggaga caacctgctg tgacagctgc ccaaagcctg cacattcctc 361 aaccctatgt gccaacctgc ttcctgctca actctaccca gcccaccctg gaccaggaaa 421 ccatcagcct cacagccctg tatcccaagc tgctgctaac tgatgtttga ctcactcagt 481 gcctggcaat aacacagaag ctatctgttt ggcattcact tgctcagtat tttatcacat 541 attgaggtag acccaatggc atagatatgg aaggcctact ttttatctta atggaaagaa 601 aataaataaa ctttatgctt atttggctaa caattttttc atttgggcag gtgaatgtca 661 tctatgtcaa aatagtatta aagtctataa gacttcagac cctattttat tggtcatatt 721 gcttcctgga ccccatttat tgtattcggg attttcatag aggagaaata attttgatgg 781 ttttcctaat aagctatttc tgtggcaaaa aagagaaaga gatgtgaaat tttgttgagt 841 gctttttctg tatctatgga agtaattatt tagttttttt tatttaatct tttacatttt 901 aaagtttata actatgttca ttttgattga ttttaaaaat gttaaaccaa ccttcttttc 961 ctacgaaacc cagattgata taatgtattt tccctttcat acatgattga ctttgatttg 1021 ctacattaaa aacatttatt tatttggcta cactggatct tagttgcagc agatgagatc // LOCUS SHPHSKB3E 1047 bp ss-mRNA MAM 17-JUL-1990 DEFINITION Sheep BIIIB4 high-sulfur keratin mRNA, complete cds. ACCESSION M21103 J03062 KEYWORDS BIIIB high-sulphur wool keratin. SOURCE Sheep (Dorset horn strain), cDNA to mRNA, clone BIIIB4. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1047) AUTHORS Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E. TITLE The keratin BIIIB gene family: Isolation of cDNA clones and structure of a gene and a related pseudogene JOURNAL Genomics 4, 182-191 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.J.Frenkel, 24-OCT-1989. FEATURES from to/span description pept 289 585 BIIIB4 high-sulfur keratin BASE COUNT 298 a 294 c 193 g 262 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttgtgt tccagtgaca aatagcaaga gtaatttgtt cagatttagc agtgaactca 61 aacaaaggtt aattaggaaa ataaagactc tttacaagca tctaaatgtc aaaccagtca 121 aaggataatg ggtacactaa cataaacaca caggaaagaa gccttccggt tggaccaaca 181 ccactggaga gggtatataa gagcctcaga gcagaaagag agattctcac gccaggagac 241 tcgtcttctt taaaaccaaa tcaataaaaa acacagcttc ccaacaccat ggcttgctgt 301 gcccgcctct gctgcagcgt ccccaccacc cccgccacca ccatctgctc ctctgacaaa 361 ttctgcagat gtggagtctg tctgcccagc acctgcccac acacagtctg gttcctggag 421 ccaacctgct gtgacaaccg ccccccacct tgccacattc ctcagccctc tgtgcccacc 481 tgcttcctgc tcaactcttc ccagcccacc ccaggcctgg aaagcatcaa cctcacaacc 541 tacactcagc ccagctgtga gccctgcatc ccaagctgct gctgaccgac ggctgcctca 601 cccactgcct gacagagtca acccagaagc tttagtgctc acctgtctca gtacctgcaa 661 ctaattatgt ctccgctttc aaagttggaa caaggcatta tcacagacaa ccctcacaaa 721 aaacaaacca agagactttc gatggccgtg tagtggacat cagtgaacaa ggacagctgg 781 agtaggtaga tgcctacagg tttcccagcg ttgttcagtt ccttcgtgtt aaattgtatc 841 tttctttggt gctttgggaa ttctgtttcc agtcttgaat cgtatctttc tggaaattga 901 ggagcttctt catgattatt ctaataaagt ttccatctct ggcataacat aaatgtctat 961 aggtatttcc atttattttt gttaacacat caaatctata tcttaactcc aaaggacacc 1021 attttaacaa actgagaaca tctgcag // LOCUS HUMRARG2 250 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human retinoic acid receptor gamma 2 mRNA, 5' end. ACCESSION M32074 KEYWORDS retinoic acid receptor gamma. SOURCE Human fetal skin, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 250) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Chambon,P., 13-FEB-1990, for release after publication. FEATURES from to/span description pept 62 > 250 retinoic acid receptor gamma 2 mRNA 1 > 250 retinoic acid receptor gamma 2 mRNA BASE COUNT 38 a 77 c 87 g 48 t ORIGIN Chromosome 12. 1 ggggaatcgg ctgcgggtcc ttggtgtttc cagcacccag tttcccttaa gccgggtcgc 61 gatgtacgac tgtatggaaa cgtttgcccc gggtccgcga cggctgtacg gggcggccgg 121 gcccggggcc ggcttgctgc gcagagccac cggcggctcc tgtttcgccg gacttgaatc 181 ttttgcctgg ccgcaacccg ccagcctgca atcggtggag acacagagca ccagctcaga 241 ggagatggtg // LOCUS MUSRARG1 620 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 1 mRNA, 5' end. ACCESSION M32068 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse F9 cell, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 620) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Chambon,P., 13-FEB-1990, for release after publication. FEATURES from to/span description pept 399 > 620 retinoic acid receptor gamma 1 mRNA 1 > 620 retinoic acid receptor gamma 1 mRNA BASE COUNT 108 a 217 c 192 g 102 t 1 others ORIGIN Chromosome 15. 1 attttggggc ggtcacgtgg gcgggcgggc tccgagaggc cccgggattg tcccagccta 61 gagccgtgcc cccctgaagc cccccattac cgcgagtcac taacaccgcg gtgctccatc 121 cccgagaccg cccgacgccg ggacctgggg ctctgcggcc ttccttcccc cgccctcccc 181 tccagcagtt tccaccaggt ccctcacctc agcctggccc agtatgtagg agggactctc 241 tgcangaggc cagagggatc cttggaaacc cactggacag accaggcagg gtgggcacgg 301 agcctcccag gccagggcag tgggcatggg cgggggctgt agctgaagac cacccccgcc 361 tgctgcagag tccaagggat tcccacgccg cagctaccat ggccaccaat aaggagagac 421 tctttgcgcc cggtgccctg gggcctggat ctggttaccc aggagcaggc ttcccattcg 481 ccttcccagg tgcactcaga gggtcgccac catttgagat gctgagccct agcttccggg 541 gcctgggcca gcctgacctc cccaaggaga tggcttctct ctcggtggag acacagagca 601 ccagctcgga ggagatggta // LOCUS MUSRARG2 479 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 2 mRNA, 5' end. ACCESSION M32069 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse F9 L cell, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 479) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by P.Chambon, 13-FEB-1990, for release after publication. FEATURES from to/span description pept 291 > 479 retinoic acid receptor gamma 2 mRNA 1 > 479 retinoic acid receptor gamma 2 mRNA BASE COUNT 75 a 163 c 143 g 98 t ORIGIN Chromosome 15. 1 tctaggcttt gcttttacaa aaggtctcca cagagctagc tgcctaggca ccctaagtga 61 ggcagactga ggcagaggct acagcttcgc cttcgcgccc cgcccagttc ttttcccttg 121 ctggggatcc ccctcctccc cgcccatccc ttacccccca tgcatctcag cgccctacgc 181 tagccctccc cctcccccct ccaggagcgg ggcgctgcgg ggggaggagg gggaatgggc 241 tgcgggtcct tggtgttcta gcacccagtt tctctccaag ccgggtcgcg atgtacgact 301 gcatggaatc gtttgtcccg ggtccgcgac ggctgtatgg ggcggccggg cccggggccg 361 gcttactacg cagagccact ggcagctcct gcttcgccgg acttgagtct tttgcctggg 421 cacagcctgc cagtctacaa tcggtggaga cacagagcac cagctcggag gagatggta // LOCUS MUSRARG3 646 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 3 mRNA, 5' end. ACCESSION M32070 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse 11.5 day embryo F9 L cell, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 646) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Chambon, 13-FEB-1990, for release after publication. FEATURES from to/span description pept 496 > 646 retinoic acid receptor gamma 3 mRNA < 1 > 646 retinoic acid receptor gamma 3 mRNA BASE COUNT 151 a 150 c 184 g 161 t ORIGIN Chromosome 15. 1 tccccacccc caccgccacc aactgcttcc agggaactct gggattacta gagtcaggag 61 tgagccctaa ccttttcagt tctatgcccc tccccgcccc ctttaaaaat gtgtatggtg 121 ttctgtctat atgtattttc gcgtgccatt tttgtgcctg gtgcccatgg aagatggaag 181 agggaaccgg ttctgttaga actggcgatt acaaatggtt gtaaactacc atgtagatgc 241 tgggaatgga accctggtcc tctgggagag cagccactgc tcttacctgc tgagtcccaa 301 ccaatcttca actttatgga gcagaagcag agaagttaac tccttggcct ggaatttgca 361 gaagtgaacg ttaatgtaga agagttggct ttatgggggt ggggatgggg tagggggcag 421 tggtggggcc tgaaatccca acaagctaca aagagtggtg gtctgggctt tccagggagt 481 acctgttaag ggcttatgca caagggtgac aacagcggtc accagcaggt cccaagaaag 541 agaggccatg ggatgagggt gcttctgctc agcttctgct tatcttctca tgctgctttt 601 cattcagcgg tggagacaca gagcaccagc tcggaggaga tggtac // LOCUS MUSRARG4 309 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 4 mRNA, 5' end. ACCESSION M32071 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse 9.5 day embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 309) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Chambon, 13-FEB-1990, for release after publication. FEATURES from to/span description pept 300 > 309 retinoic acid receptor gamma 4 mRNA < 1 > 309 retinoic acid receptor gamma 4 mRNA BASE COUNT 77 a 81 c 100 g 51 t ORIGIN Chromosome 15. 1 gatgacaatc gtaaggcccc caaatatcca tcttccccct ctctgtaatg ggagatgcca 61 ggcttggggc ccaggaaccc ctttccagag cggcaataaa aagctgggga tggagaggca 121 gtaaaaataa agtccagatg agaagcggct ttgaagcctg ggccacagtg cctggcgggg 181 gtgggctgtt ggaaggggct gagagaggag cctgggacac cccccccccc aaatttactg 241 ggggttggca ccgcactcgc agtctagcgg tggagacaca gagcaccagc tcggaggaga 301 tggtaccca // LOCUS MUSRARG5 111 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 5 mRNA, 5' end. ACCESSION M32072 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse F9 cell, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 111) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Chambon,P., 13-FEB-1990, for release after publication. FEATURES from to/span description pept 102 > 111 retinoic acid receptor gamma 5 mRNA 1 > 111 retinoic acid receptor gamma 5 mRNA BASE COUNT 23 a 26 c 37 g 25 t ORIGIN Chromosome 15. 1 tgaagcttct ccttgggggc tggattttgg attgaagagt cagcccctgg caccttgggg 61 atatccttgc ggtggagaca cagagcacca gctcggagga gatggtaccc a // LOCUS MUSRARG6 420 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 6 mRNA, 5' end. ACCESSION M32073 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse 9.5 day embryo F9, P19, and L cells, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 420) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Chambon,P., 13-FEB-1990, for release after publication. FEATURES from to/span description pept 407 > 420 retinoic acid receptor gamma 6 mRNA 1 > 420 retinoic acid receptor gamma 6 mRNA BASE COUNT 84 a 121 c 128 g 87 t ORIGIN Chromosome 15. 1 tgaggcagcg ctctgtgggt agggagggcg tcagtgcagg agagagtacc acccacgctc 61 aggcctgtgg ggaccccagg atgggctgaa agtgagggcc ggaaaggcct tccaggcttc 121 cccaaacctc ccagcaccta ccattcaggc aaccccaccc ccagtcttga ataaactccc 181 tgcacccttc cgcccccttt cttttgaggg ggaatctaac tccagcagga ttcttatgct 241 aattgggtgc gtggggggtg gggtgggtgg agaaggcttc cctcttgtaa ggtggggagc 301 tggtctggaa cccccaacct ataggctctt ctgtcctctc actaccttgg gtctcagtat 361 ggacttgtga ccagcggtgg agacacagag caccagctcg gaggagatgg tacccagctc // LOCUS PMGMTF1 190 bp ds-DNA ORG 17-JUL-1990 DEFINITION P.magellanicus mitochondrial fragment I DNA. ACCESSION M34468 KEYWORDS . SEGMENT 1 of 3 SOURCE P.magellanicus mitochondrial DNA. ORGANISM Mitochondrion Placopecten magellanicus Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia; Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea; Pectinidae; Placopecten magellanicus. REFERENCE 1 (bases 1 to 190) AUTHORS La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E. TITLE Molecular characterization of a repeat element causing large-scale size variation in the mitochondrial DNA of the sea scallop Placopecten magellanicus JOURNAL Mol. Biol. Evol. 7, 45-64 (1990) STANDARD simple staff_review BASE COUNT 66 a 63 c 14 g 47 t ORIGIN 1 ctttcaaaca caaaaaacac aaaaagctcg ccaataaact ttccaatttt ttaacactac 61 agctaagttt tccccattct ttcacctcaa gcgaagggaa atctatatcc ccactcccaa 121 caatctactg gcatactact tatactacca aagaaactca tctacccccc atccacccca 181 cattggtacc // LOCUS PMGMTF2 1449 bp ds-DNA ORG 17-JUL-1990 DEFINITION P.magellanicus mitochondrial fragment II DNA. ACCESSION M34466 KEYWORDS . SEGMENT 2 of 3 SOURCE P.magellanicus mitochondrial DNA. ORGANISM Mitochondrion Placopecten magellanicus Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia; Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea; Pectinidae; Placopecten magellanicus. REFERENCE 1 (bases 1 to 1449) AUTHORS La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E. TITLE Molecular characterization of a repeat element causing large-scale size variation in the mitochondrial DNA of the sea scallop Placopecten magellanicus JOURNAL Mol. Biol. Evol. 7, 45-64 (1990) STANDARD simple staff_review BASE COUNT 405 a 403 c 193 g 448 t ORIGIN About 500 bp after segment 1. 1 ctgcagcttt ttttttcctt ccccgaaatt ttacccccca ataaaaatgg ataaaaaccc 61 cctccaaaaa ataagaaaaa accggaaaac gcaattttcg tgtaaaaaaa aggtgttttt 121 cgaaaaaaaa aaataaaatc tgtacctcgc cgaccaaaaa actttttcat tttttcaaca 181 aatttttcaa ctttttaatc atcaaaccga gcacttttta taaacataaa ccacgaccaa 241 ctgaagggaa aaaaaacact ttattttggg cacttttctc aaaaaagctt cagctagaat 301 tgccagaaat ccttgcccaa attcgacaaa ttttaggtac ctttgttttt actgcgaagg 361 taaagttaaa cttcgcccaa tcccctctac cagcgagggg ggaataatcc ccactttcag 421 gctataagcc taacgttctt actacactac ctcgaccccc cattcctact gcccagcccc 481 ttctctcact aaaccatcca ccccccactt tccaattttt ttaacactac agctaagttt 541 tccccattct ttcacctcaa gcgagaaggg aaatctatat ccccactccc aacaatctac 601 tggcatacta cttatactac caaaagaaac tcatctaccc cccatccacc ccacattggt 661 accagaacgt tcctacaaac cctttcttcg ctataggggg aaggaacctt cttccttccc 721 ctcggacccc ttcctcgccc cctcccccct cccatcccca acccagctat cctttcccct 781 cccccctagc ccccctacac gcttaactgg tagtcccccc ccttcctccc cccctccctg 841 tgcgggtcaa cgaacctgtc tctactccgc tcttgatcaa acctcgaaaa aaaaaatctt 901 tttttttttt tttttttcgg tctccaagga ctccgtggcc gagaccagga ttctaccctg 961 tagggaaagt ggagggtaag gggttaaata tttatcccgg gcggggggaa cccggaatta 1021 tcctttataa ccaatttcaa cccaatttat ggctccgtac caaaacgcta tttgccccac 1081 ccccctctga ggctttttga ttcagaaaac acgagattta gtgatatttt tcataatatg 1141 gcacaagcaa ttcactgaag gcctaaattt ccgaccctgg aacaattttt ttcaaaaatt 1201 tgacatctaa aagtcacctg aatttacagc cttttttttg attttcaaat ttttgccgtt 1261 tttggacccc gttttttttg actttttttt tgtgtttttt tgtgtttttt tttcaccctt 1321 ttttttgcca aaaattacca tttttgcttg gctacaataa gtttacaaag ccggctataa 1381 aaaattttta gaaaaacccg aaaatatcag ttttgagttt tgaaatgaac tttttttttt 1441 tgactgcag // LOCUS PMGMTF3 746 bp ds-DNA ORG 17-JUL-1990 DEFINITION P.magellanicus mitochondrial fragment III DNA. ACCESSION M34467 KEYWORDS . SEGMENT 3 of 3 SOURCE P.magellanicus mitochondrial DNA. ORGANISM Mitochondrion Placopecten magellanicus Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia; Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea; Pectinidae; Placopecten magellanicus. REFERENCE 1 (bases 1 to 746) AUTHORS La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E. TITLE Molecular characterization of a repeat element causing large-scale size variation in the mitochondrial DNA of the sea scallop Placopecten magellanicus JOURNAL Mol. Biol. Evol. 7, 45-64 (1990) STANDARD simple staff_review BASE COUNT 260 a 213 c 99 g 174 t ORIGIN About 500 bp after segment 2. 1 ctgcagcttt ttttttcctt ccccgaaatt ttacccccca ataaaaatgg ataaaaaccc 61 cctccaaaaa ataagaaaaa accggaaaac gcaattttcg tgtaaaaaaa aggtgttttt 121 cgaaaaaaaa aaataaaatc tgtacctcgc cgaccaaaaa actttttcat tttttcaaca 181 aatttttcaa ctttttaatc atcaaaccga gcacttttta taaacataaa ccacgaccaa 241 ctgaagggaa aaaaaacact ttattttggg cacttttctc aaaaaagctt cagctagaat 301 tgccagaaat ccttgcccaa attcgacaaa ttttaggtac ctttgttttt actgcgaagg 361 taaagttaaa cttcgcccaa tcccctctac cagcgagggg ggaataatcc ccactttcag 421 gctataagcc taacgttctt actacactac ctcgaccccc cattcctact gcccagcccc 481 ttctctcact aaaccatcca ccccccactt tcctccggga aaaaaaaaaa aaagaaggga 541 tctcatataa ccccatctta ccgctccggc acaaccaata gacgattcac aagtagagcc 601 cagccctcct tgcgacccta accctgaaag taccgaagag acccgcaacc atcacaaaca 661 aaaacatccg gcctaaagga accaaacccc aacaccagtc cgaggggtta tcgtttcaat 721 accgcatgca tagaacaaac ctgcag // LOCUS HUMCREB 1026 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human active transcription factor CREB mRNA, complete cds. ACCESSION M34356 KEYWORDS active transcription factor CREB; alternative splicing. SOURCE Human T-cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1026) AUTHORS Berkowitz,L.A. and Gilman,M.Z. TITLE Two distinct forms of active transcription factor CREB JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Z.Gilman, 16-MAY-1990. FEATURES from to/span description pept 1 1026 active transcription factor CREB-B pept 1 261 active transcription factor CREB-A 304 1026 active transcription factor CREB-A BASE COUNT 332 a 249 c 233 g 212 t ORIGIN 1 atgaccatgg aatctggagc cgagaaccag cagagtggag atgcagctgt aacagaagct 61 gaaaaccaac aaatgacagt tcaagcccag ccacagattg ccacattagc ccaggtatct 121 atgccagcag ctcatgcaac atcatctgct cccaccgtaa ctctagtgca gctgcccaat 181 gggcagacag ttcaagtcca tggagtcatt caggcggccc agccatcagt tattcagtct 241 ccacaagtcc aaacagttca gtcttcctgt aaggacttaa aaagactttt ctccggaaca 301 cagatttcaa ctattgcaga aagtgaagat tcacaggagt cagtggatag tgtaactgat 361 tcccaaaagc gaagggaaat tctttcaagg aggccttcct acaggaaaat tttgaatgac 421 ttatcttctg atgcaccagg agtgccaagg attgaagaag agaagtctga agaggagact 481 tcagcacctg ccatcaccac tgtaacggtg ccaactccaa tttaccaaac tagcagtgga 541 cagtatattg ccattaccca gggaggagca atacagctgg ctaacaatgg taccgatggg 601 gtacagggcc tgcaaacatt aaccatgacc aatgcagcag ccactcagcc gggtactacc 661 attctacagt atgcacagac cactgatgga cagcagatct tagtgcccag caaccaagtt 721 gttgttcaag ctgcctctgg agacgtacaa acataccaga ttcgcacagc acccactagc 781 actattgccc ctggagttgt tatggcatcc tccccagcac ttcctacaca gcctgctgaa 841 gaagcagcac gaaagagaga ggtccgtcta atgaagaaca gggaagcagc tcgagagtgt 901 cgtagaaaga agaaagaata tgtgaaatgt ttagaaaaca gagtggcagt gcttgaaaat 961 caaaacaaga cattgattga ggagctaaaa gcacttaagg acctttactg ccacaaatca 1021 gattaa // LOCUS PFAATUBII 3706 bp ds-DNA INV 17-JUL-1990 DEFINITION P.falciparum alpha-tubulin II gene, complete cds. ACCESSION M34390 KEYWORDS alpha-tubulin II. SOURCE P.falciparum haploid intraerythrocytic cell line K1 (Thailand) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 2779) AUTHORS Holloway,S.P., Gerousis,M., Delves,C.J., Sims,P.F.G., Scaife,J.G. and Hyde,J.E. TITLE The tubulin genes of the human malaria parasite Plasmodium falciparum: Their chromosomal location and sequence analysis of the alpha-tubulin II gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 2780 to 3706) AUTHORS Holloway,S.P., Gerousis,M., Delves,C.J., Sims,P.F.G., Scaife,J.G. and Hyde,J.E. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hyde, 22-MAY-1990. FEATURES from to/span description pept 749 810 alpha-tubulin II, exon 1 1022 1142 alpha-tubulin II, exon 2 1262 2431 alpha-tubulin II, exon 3 IVS 811 1021 alpha-tubulin II intron A IVS 1143 1261 alpha-tubulin II intron B BASE COUNT 1390 a 347 c 502 g 1467 t ORIGIN Chromosome 4. 1 gaattcctta tatatgtact cgttttatat ttttttccta taatttcaac atattaaaaa 61 gtactgaata taacattata tatttacata ttcaagttat atttattttc tttttctttt 121 ttcttgatat attatattat ttgattattt ttttttcttt taatatgtgg cttatattta 181 caaaaatagt aatatatata tgttaatatg taaagaaaca gaaatgccat tgtatgaaat 241 ataaaagttt caattttttt aaaaaatata aaaaaagaaa gaaaataaca aataaaaata 301 aatatattat atatatgtgt ttatattttt atttttatat ataattatat tttttcttta 361 tgatatatat tattatatct ttataaagaa gacataaaag aaacagattg attgaagttt 421 ttcttgcttt aatataagaa agtaatttca taatgaaaat aatatatata tatgtatatg 481 atattttaat ttataattat aatgagtttg taaaaatata ttttatataa gtttatatgt 541 attttttata tatatatata tatatatata tggctttgga taatataata tatatatata 601 aataaataaa tatatatata tatatatata tatatatgta tatttattta catatttatt 661 taatataggt acaaacatag acctacatat ttttatattt atttatttta ttttatatta 721 acatttttta tctaacgttt tacgaacgat gagagaagtc attagtattc atgttggaca 781 ggctggtatt caaataggaa atgcttgctg gtaaaaaaaa aaaaaaaaaa aaaaaaaaaa 841 aaaaaatata tatatatata tataaatgca tgtgtgttgt atatttttct tcttcatttt 901 tatttttaaa gagatacata atgtttaatg tattattcaa tgtgttgtgg taaaaacatg 961 tgtataaatt ataattaaaa atatacatat atatatatat atatatattt tttttttata 1021 gggaattgtt ttgccttgaa catggaattc aaccggatgg gcagatgcca agtgaccaag 1081 tcgttgctgg tggtgatgat gcctttaata catttttctc agaaacggga gctggaaaac 1141 atgtaataat aaaaaaaaaa aaaaaaaaat atatacatat atatatatat atatatatat 1201 atttatttat ttatttattt acttatacat gtgtgcttct ttttgtcgtt actcatttta 1261 ggtaccacgt tgtgtgttcg ttgatttaga acccaccgtc gttgacgaag ttcgaacagg 1321 aacgtatcgt cagctgtttc accctgaaca actaatatct ggaaaagagg atgcagcaaa 1381 taatttcgca aggggacatt ataccatagg aaaagaaatt gttgatgtat gtttggatag 1441 ggttcgaaag ttggctgata attgcactgg attacaagga tttttgatgt ttaatgcagt 1501 aggtggaggt acaggtagtg gtcttggttg tttattatta gaaaggttgg caatagatta 1561 tggaaagaaa tcaaaattaa atttttgttc gtggccatct cctcaagtat cgacagctgt 1621 tgtagagcct tataattctg tattatcaac acattcattg ttagaacata cagatgtggc 1681 aattatgctc gataacgaag caatatatga tatatgtaag aaaaatttag atatagaaag 1741 gccaacctat actaacttga atagattgat tgctcaagtt atctcttcat taacagcatc 1801 tttaagattt gatggtgctt tgaatgttga tgtaacagaa tttcagacta atttagtacc 1861 atatcctaga attcacttta tgttatcatc atatgctcca atcataagtg ctgagaaggc 1921 atatcacgag caattgtcgg tttctgaaat aacgaattct gcctttgagc ctgcatctat 1981 gatggcaaag tgtgatccca gacatggaaa atatatggct tgttgtttaa tgtatagagg 2041 agatgtagta ccaaaggatg ttaatgctgc cgtcgcaact attaagacta agagatctat 2101 acaattcgtt gattggtgtc ctacgggatt taaatgtgga atcaattatc agccccctac 2161 cgtggttcca ggaggagatt tagcaaaagt tatgagagct gtttgcatga tcagcaactc 2221 aacagcaatt gccgaagtat tctcacgaat ggaccaaaaa tttgatttga tgtatgcgaa 2281 aagggctttc gttcattggt atgtaggtga aggtatggaa gaaggagaat ttagtgaagc 2341 tagagaggat ttggctgcct tggaaaagga ttatgaagag gtaggaatag aatcgaatga 2401 tggagaggga gaagatgagg gatatgaatg aacataaata taaatggata aataaataaa 2461 tatatatata tatatatata tatatatata tatatatatg tatatctact tataattatt 2521 tttttttttt atctcatacg tcctatattc ccaaaatatt atatatatgt tcataataat 2581 tatatggaac atatatatcc tcttatcata ttgtatattt attcgttgta aaaaataaaa 2641 aaaaatatac aagaatatta atattttttt ctatgtattt acaccttcat tagtttaata 2701 ttcaagaaaa ataatactta ctagaaataa ataaaatata tatatatata tatatgtatg 2761 tatatatgtg tgtatgtacc attaattagt aagtagtaat ttattatgca atttatttat 2821 tatatatata tatatatata tatatatata gtatcaaatt aaatatacga caattcgtaa 2881 gtacattctt gaaaaaataa atgtatagat ataatattat tttttttata caaatttata 2941 tttgtagaat atatattttt ttccctttat tattatgcat tatataataa tatatacatg 3001 ataatatgca tacatataca tattatatat atatatgtat gtatgtatat atgtcgtgtg 3061 ataatttatc tttatgctaa taatacatta cacatataaa tacttattta tatctatata 3121 tatttttttt atatacgaca ataatttatg taaacattaa gttacatatt aaagttcgat 3181 taataaatat ataataattt tttttatttt attatatcta tttttccaat gtataaataa 3241 tatatgaaat atactatatg ctatataata tgagcacaca taagatttgg tgttatataa 3301 atgtagaagt aatattaata gtatcagacg aaaagaacat tttttttttt tttttttttt 3361 tttttttggt ccttttaaat tgatataaaa aactaaactt tagagtttct ttaaaatgag 3421 acatttttta caacttaata taatttaatt tttttttttt tttttttcat atttattaga 3481 aatgaatata taggttaaca catatatatt atatatatat atattaacat gaaaaggtta 3541 aaaaaaaata tggaaaatac aaacatataa ttatatacat atatattata tatatgtatg 3601 tatctttttt atgttttaat ttggtcacgt ggttaatttt atatgatgct aaaaaaattt 3661 tcttccattt aagaaacagg aattaacaaa aaatttaagg gaattc // LOCUS XELFIMB1 1341 bp ss-mRNA VRT 17-JUL-1990 DEFINITION X.laevis integumentary mucin B.1 (FIM-B.1) mRNA, complete cds. ACCESSION J02910 KEYWORDS integumentary mucin; mucin. SOURCE X.laevis skin, cDNA to mRNA, clones pREP1218 and pXGL-3'-1. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1341) AUTHORS Probst,J.C., Gertzen,E.-M. and Hoffmann,W. TITLE An integumentary mucin (FIM-B.1) from Xenopus laevis homologous with von Willebrand factor JOURNAL Biochemistry 29, 6240-6244 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Hoffmann, 26-APR-1990. FEATURES from to/span description pept < 1 1197 integumentary mucin (FIM-B.1; AA at 1) mRNA < 1 1341 FIM-B.1 mRNA signal 1319 1324 poly-A signal rpt 1 279 repeat sequence BASE COUNT 472 a 290 c 268 g 311 t ORIGIN 1 cctgcacctt cagaaacaac aggagagtca acacctgcac cttcagaaac aacaggagag 61 tcaacacctg caccttcaga aacaacagga gagtcaacac ctgcaccttc agaaacaaca 121 ggagagtcaa cacctgcacc ttcagaaaca acaggagagt caacacctgc accttcagaa 181 acaacagtac catcaggaga gtcaacacct gcaccttcag aaacaacagt accatcagta 241 ccatcaggag agtcaacacc tgcaccttca gaaacaacag agctgagaat catcccacca 301 gaagtatcta cagttgcagt gccagtaaca actggacaga taactcctgc agtaacaact 361 gaacacagca cagaagaaat actgacactt cctccgccag tggtaggacc agttcttcca 421 gcaaagccaa cagtagatat ctccaaatat acaaacacta caactactaa atcgactgtc 481 cctacaacta ccattcctcc taaagctact tgctgtggat cgtctggaga gtcagtacaa 541 gcaggtcata tgtggcaaac tggttgtgat gtgtgcacat gcaatggaac ctctggaaaa 601 acacaatgtg caccacgcca gtgtgaaaag gagataatct gtaagagcga cgaaaggagg 661 gtactgagaa aacctgggaa atcatgctgt ggatattgcg agccgctgac atgtaaacat 721 aatggaacag aatataagct tggggctacc tttatagaca aatcgaatcc ttgtatcact 781 tatagatgtg acgcatcagg cttgacagtg aatgttaaat catgcccaaa tgaacaagtt 841 tgcagtaaga gtgaaagaac ctatgattca gatggctgtt gtttctcatg tgacacatca 901 tgtaaaccag tccctgcgac agtgggtatc caaggggagt acgattatca gaatgaaaaa 961 acaaactgct ctgctaatat cataatggca aaatgttctg gtcaatgcca acataaattg 1021 acctatgata caattgacaa taaagttgtt accaaatgtc ggtgttgtaa ggcagaccgt 1081 gttgaaccaa gaaaagctca tctggtgtgt gataatggaa agaaaaaaat atataaatac 1141 aaacatatca catcttgcaa atgcacatct tgtaccgctt acaatataag actgtgaaga 1201 ctttcatgca tttgctaact gtcagctttt gcatgtacaa taaaataaac atgggggata 1261 tattggaggc agagatatct gttatggtac ataaagtatt ttttgtctgt tattgaaaaa 1321 taaatggttt ttttcagctg t // LOCUS YSCANBI 2156 bp ds-DNA PLN 17-JUL-1990 DEFINITION S.cerevisiae ANB1 locus encoding protein synthesis initiation factor (eIF-4D) gene, complete cds. ACCESSION J05455 KEYWORDS protein synthesis initiation factor eIF-4D. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2156) AUTHORS Mehta,K.D., Leung,D., Lefebvre,L. and Smith,M. TITLE The ANB1 locus of Saccharomyces cerevisiae encodes the protein synthesis initiation factor eIF-4D JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Dr. K.D.Mehta, 16-MAR-1990. FEATURES from to/span description pept 569 1042 protein synthesis initiation factor (eIF-4D) mRNA 516 > 1042 protein synthesis initiation factor mRNA (alt.) mRNA 526 > 1042 protein synthesis initiation factor mRNA (alt.) mRNA 540 > 1042 protein synthesis initiation factor mRNA (alt.) signal 395 400 TATA box BASE COUNT 602 a 422 c 369 g 763 t ORIGIN 3 bp upstream of SmaI site. 1 cccgggaatt ttagattcag gtaggaaatt gattacataa atactgttac cctgaatcat 61 attcgacgat gtcgtctcac acggaaatat aattcatttc ttggttttcc aaaaaaattt 121 tcattttttt tcactttttt gtttcgtcct cctttttttt tttttttatt ttttttcctg 181 tgttcacctt tttttttttt ttttcagttt acaactttct gcattctttt ctgtgttttt 241 tttttttcgt ttttccattg ttcgttcgtt gcctgttttt ttgccctatt gttctcgagc 301 ctaaaaattt tttcctttcc tgctttcctt tcttcgttca aagtttccta ttccattgtt 361 ctcttcggta aactcattgt tgtcggaact cagatatatt caggtcaatt tactgtactt 421 caattgactt ttttcttgaa atttcaactt gccttttcaa cttgttcttc ttttttaatc 481 ttattctaca ctttagttcc cttaccttgt tcctaattat tgtctagcaa aaagaaaaca 541 tacacctatt tcattcacac actaaaacat gtctgacgaa gaacacacct ttgaaaatgc 601 tgacgctggt gcctccgcca cctacccaat gcaatgttct gccttgagaa agaatggttt 661 cgttgtcatc aaaggtagac catgtaagat tgtcgacatg tccacttcca agaccggtaa 721 gcacggtcac gccaaggtcc atttggttac ccttgatatc ttcactggta agaagttgga 781 agatttgtct ccatccactc acaacttgga agttccattt gtcaagagaa gcgaatacca 841 attgttggat attgatgatg gctacttgtc cttgatgacc atggacggtg aaaccaaaga 901 cgatgtcaag gcaccagaag gtgaattggg tgacagcatg caagctgctt tcgatgaagg 961 taaggatttg atggtcacta tcatttctgc catgggtgaa gaagctgcca tctccttcaa 1021 ggaagctcca agatctgatt agatatttta gatttctata gcttacaaaa tacgcgatgt 1081 ttttttgatc tagaaatctt atttatacat tttatataat attataataa tcataatcat 1141 aattattcat cctaactaac ttcgttctgt ttcagctacg aagttgcctt cattatcggt 1201 acgttcgtgt taattggtgt ttgataccat agtcacttat gacgacttac agaacttttt 1261 tttttgacgg atattttccc ctcaatatgc cgaggatgat aactttataa attgccttct 1321 ttcggttcct tttttctttt taatttagct gacgggaaca tgtgtatata ccccagaaga 1381 atttaaacag aaattttttt gagctatttt acgaaaatat taaaaactag ggctacagat 1441 atcagagaaa aaattactta tatgatgaca ataacttaat atcacagtca ctatcgttct 1501 cgcctttata tgcgcaatga cagcattcga cttttcctta gaagggatcc gatcaacgaa 1561 acttcaaaaa gagtagaagt cgaaatattt tttgcacgac gagggtgaca aaccgtgcta 1621 tgtccacaaa tgaggccctg ctttcccaag ataaattgcg taaacatata attgttacaa 1681 ggatccgatc aacgaaactt caaaaagagt agaagtcgaa atattttttg cacgacgagg 1741 gtgacaaacc gtgctatgtc cacaaatgag gccctgcttt cccaagataa attgcgtaaa 1801 catataattg ttacaaatcc aaggctctgt gatttcttca ccaagtgtag ttgtgatgta 1861 tagtgtgtag gataggtaat gaccggtaaa tcaagctggt aaaaagtcct aataacaagc 1921 ccgaatctgg aataatggta atattcccat gccaggtgga cagtgttgtc cttacacaaa 1981 ctcaagagat taaaggcaac ttggccgagt ggttaaggcg aaagattaga aatcttttgg 2041 gctttgcccg cgcaggttcg agtcctgcag ttgtcgttat tttttatttc cctttctttt 2101 cgtacataca caatgcacag tcttcaaagt aaaataccaa cgtatgtatc aagatc // LOCUS BSUBMRX 1402 bp ds-DNA BCT 17-JUL-1990 DEFINITION B.subtilis multidrug-resistant protein (BMR) gene, complete cds. ACCESSION M33768 KEYWORDS multidrug-resistance. SOURCE B.subtilis DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1402) AUTHORS Neyfakh,A.A., Bidnenko,V.E. and Chen,L.B. TITLE Efflux-mediated multidrug resistance in bacteria: Similarities and dissimilarities with mammalian system JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.A.Neyfahh, 20-APR-1990. FEATURES from to/span description pept 195 1364 multidrug-resistance protein (BMR) signal 130 135 -35 promoter signal 155 160 -10 promoter BASE COUNT 353 a 300 c 323 g 426 t ORIGIN About 216 min. on B.subtilis map. 1 ggcggatatg gatggagaag tttatacaca tacacccgca gtcattcagg tcctgccaca 61 gcatatcgac atgcttgttc ccgcaaatga ataactgtca taaggcttta gaaagatttt 121 gcaaatccgt tgactctccc ctaggaggag gtcttacagt ataagggata cacacttgaa 181 aaaggggaag tcatatggag aagaaaaata ttaccttaac tatattatta accaatttat 241 ttattgcttt tttggggatc gggcttgtga ttccagtaac gccgaccatt atgaatgaat 301 tgcatttatc ggggaccgcg gtcggctata tggttgcctg cttcgctatt acacagctca 361 ttgtctcacc aatagccgga cgatgggttg atcgcttcgg gcgcaagatc atgatcgtaa 421 tcggcctgtt gttctttagt gtgtcggagt ttttgttcgg cattggaaaa acagttgaga 481 tgttatttat cacccgtatg ctgggcggta tcagcgcacc gttcattatg cccggggtca 541 cggcttttat tgcagatatc acgaccatta aaacacggcc aaaagcgctc ggttatatgt 601 cagccgctat ttcaacagga tttattatcg gccccggcat cgggggattt ttagcagaag 661 tccattcccg gctgcctttt ttctttgcgg cagcttttgc actgttagca gccattttat 721 caatcctcac gctgcgcgag ccggaacgaa accctgaaaa tcaggaaata aaaggacaga 781 agacaggctt taaacgaatt tttgccccca tgtatttcat agcttttctc attatcttaa 841 tttcgtcttt tggtttagca tcatttgaat ctttatttgc attattcgtg gatcataaat 901 tcggatttac ggccagcgac attgccatta tgattacagg aggagcgatt gttggcgcca 961 ttacgcaagt cgtcttattc gaccgcttca caagatggtt tggcgaaatt catttaattc 1021 ggtacagctt aattctctcg acgagtctgg tattcttgct gacaacggta cattcatatg 1081 ttgcgattct gctggtgaca gtcaccgtat ttgtcggatt tgatctcatg cggcctgcgg 1141 taacgactta cctgtcaaag attgcgggaa atgaacaggg gtttgccggc ggtatgaatt 1201 caatgtttac aagtatcggc aatgtattcg ggcctattat cggcggaatg ctgttcgata 1261 tagatgtaaa ctatcctttc tactttgcaa cggtcacctt agccataggg attgcactga 1321 ccattgcttg gaaagcgcct gcacatctta aagccagcac gtgataagaa gcgcattctt 1381 tgtgtactgc aaagaatgcg ct // LOCUS ECOMOR 1500 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli morphology and auto-aggregation control protein (mor) gene, complete cds. ACCESSION M34102 KEYWORDS auto-aggregation control protein; morphology control protein. SOURCE E.coli (strain K12) DNA, clones pSRW[220,226]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1500) AUTHORS Warne,S.R., Varley,J.M., Boulnois,G.J. and Norton,M.G. TITLE Identification and characterization of a gene that controls colony morphology and auto-aggregation in Escherichia coli K12 JOURNAL J. Gen. Microbiol. 136, 455-462 (1990) STANDARD simple staff_review FEATURES from to/span description pept 203 1120 morphology control protein (put.) signal 133 138 -35 region signal 157 162 -10 region rpt 1135 1164 region of dyad symmetry rpt 1167 1211 region of dyad symmetry BASE COUNT 338 a 364 c 428 g 370 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcctgga gatccgcaaa agttcacgtt ggctttagtt attcgagttg agaaactctc 61 gaaacgggca gtgacttcaa gggttaaaag aggtgccgct ccgtttctgt gagcaattat 121 cagtcagaat gcttgatagg gataatcgtt cattgctatt ctacctatcg ccatgaacta 181 tcgtggcgat ggaggatgga taatgaatat tcgtgatctt gagtacctgg tggcattggc 241 tgaacaccgc cattttcggc gtgcggcaga ttcctgccac gttagccagc cgacgcttag 301 cgggcaaatt cgtaagctgg aagatgagct gggcgtgatg ttgctggagc ggaccagccg 361 taaagtgttg ttcacccagg cgggaatgct gctggtggat caggcgcgta ccgtgctgcg 421 tgaggtgaaa gtccttaaag agatggcaag ccagcagggc gagacgatgt ccggaccgct 481 gcacattggt ttgattccca cagttggacc gtacctgcta ccgcatatta tccctatgct 541 gcaccagacc tttccaaagc tggaaatgta tctgcatgaa gcacagaccc accagttact 601 ggcgcaactg gacagcggca aactcgattg cgtgatcctc gcgctggtga aagagagcga 661 agcattcatt gaagtgccgt tgtttgatga gccaatgttg ctggctatct atgaagatca 721 cccgtgggcg aaccgcgaat gcgtaccgat ggccgatctg gcaggggaaa aactgctgat 781 gctggaagat ggtcactgtt tgcgcgatca ggcaatgggt ttctgttttg aagccggggc 841 ggatgaagat acacacttcc gcgcgaccag cctggaaact ctgcgcaaca tggtggcggc 901 aggtagcggg atcactttac tgccagcgct ggctgtgccg ccggaggcca aacgcgatgg 961 ggttgtttat ctgccgtgca ttaagccgga accacgccgc actattggcc tggtttatcg 1021 tcctggctca ccgctgcgca gccgctatga gcagctggca gaggccatcc gcgcaagaat 1081 ggatggccat ttcgataaag ttttaaaaca ggcggtttaa accgtttaac gcagctaccc 1141 gatagcttcg ccatcgtcgg gtagttaaag gtggtgttga cgaagtactc aatagtgttg 1201 ccgccacctt tctgttccat aatcgcctga ccgatatgaa taatttcggc agcgcgctcg 1261 ccaaagcagt gaatacccag aatctctttt gtttcccgat ggaacaaaat tttcagcgtg 1321 cccacgttca tgccgacgat ttgtgcgcgt gccagatgtt taaactgggc gcggcccact 1381 tcatatggca ctttcattgc ggtcagctgc tgttcggttt tgcccacaga gctgatttcc 1441 gggatggtgt aaataccggt agggatatct tcaatcagat gtgcggtggc ttcgcctttt // LOCUS ECOTGMETY 800 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli Met-tRNA-f2 (metY) gene, 5' end. ACCESSION M28401 KEYWORDS transfer RNA-Met. SOURCE E.coli (strain K12) DNA, clones p50L and p50. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 800) AUTHORS Granston,A.E., Tompson,D.L. and Friedman,D.I. TITLE Identification of a second promoter for the metY-nusA-infB operon of Escherichia coli JOURNAL J. Bacteriol. 172, 2336-2342 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.E.Granston, 27-SEP-1989. FEATURES from to/span description tRNA 682 758 Met-tRNA-f2 (metY) anticdn 716 718 Met-tRNA anticodon atg signal 560 565 -35 region promoter P1 signal 584 589 -10 region promoter P1 signal 638 643 -35 region promoter P0 signal 661 666 -10 region promoter P0 mRNA 596 > 800 P1-associated mRNA BASE COUNT 210 a 191 c 190 g 209 t ORIGIN 5 bp upstream of KpnI site. 1 ggtaccagtc acggcgcggc cagcgcgtcg ttttgaaata ggtcaggccg ccggtggtgt 61 tatgaaatgc gccacactga atagcggcaa taccttcggc caccagttgt ttgcggcagt 121 cgatcagacg tgcgttctcc gcgccgtatt ccatggcacg acgagggatc gcatcataat 181 cctcttcgtc tggctggccc aggtttgcag tatatgcata aggaaccgct cccttttgtc 241 gcatccacag cagtgcggca ctggtgtcca gaccgccaga aaaagcgata ccaatacgtt 301 gacctaccgg gagatgcttg agaatcgtcg tcataaaata acaccctgct taattaactg 361 atgatgagcc tggatttccg ctctcactga atttttatgc aaaataaatg agttttcatt 421 taatcatctt ttatcggaga caggaagagt ttagtgtgtt ttttgtaaaa taatgcgctt 481 aagggagagc aggagaaggc aaaagtattc aacaaatgaa agtgaactgg atattcattc 541 acatgattag caataaacgt tgacaaaatg tggcgtggat cactataatg cctgcagatt 601 ttacgtcccg tctcggtaca ccaaatccca gcagtatttg cattttttac ccaaaacgag 661 tagaatttgc cacgtttcag gcgcggggtg gagcagcctg gtagctcgtc gggctcataa 721 cccgaagatc gtcggttcaa atccggcccc cgcaaccact ttcccttaga gtcctttttc 781 aaatatactg tgaagacttc // LOCUS RAT5HT1A 2314 bp ds-DNA ROD 17-JUL-1990 DEFINITION Rat 5-hydroxytryptamine-1a receptor (5-HT-1a) gene, complete cds. ACCESSION J05276 KEYWORDS 5-hydroxytryptamine-1a receptor. SOURCE Rat DNA, clone D. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2314) AUTHORS Albert,P.R., Zhou,Q.-Y., Van Tol,H.H.M., Bunzow,J.R. and Civelli,O. TITLE Cloning, functional expression, and mRNA tissue distribution of the rat 5-hydroxytryptamine-1a receptor gene JOURNAL J. Biol. Chem. 265, 5825-5832 (1990) STANDARD simple staff_review FEATURES from to/span description pept 123 1391 5-hydroxytryptamine-1a receptor (15-HT-1a) BASE COUNT 528 a 617 c 591 g 578 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccctc cctctgaaac tccccagaaa agaaggaaga ccctaggggg aggggcgagg 61 gagatcccct tgcagttctt tcccccctcc cttcccccac cccagcggga gtgcaggcag 121 gcatggatgt gttcagtttt ggccagggca acaacaccac agcgtcccag gagccctttg 181 ggaccggcgg caacgttact agcatctccg acgtgacctt cagctaccaa gtgatcacct 241 ctctgttgct gggtactctc attttctgcg cggtgctcgg caatgcctgc gtggttgctg 301 ccatcgccct ggagcgctcc ctccagaatg tggccaacta tctcatcggc tccttggcgg 361 ttactgatct catggtatca gtgctggtgc tgcccatggc tgctctgtac caggtgctca 421 acaagtggac cctgggccag gtcacctgcg acctgtttat cgccctggat gtactgtgct 481 gcacctcgtc catcctgcac ctgtgcgcca tcgcgctaga caggtactgg gctatcaccg 541 accctataga ctatgtgaac aaaaggacgc cccggcgcgc cgctgcgctg atctcgctca 601 cttggctcat tggctttctc atctccatcc cgcccatgct gggctggcgc accccggaag 661 accgatcgga ccccgacgcg tgcaccatca gcaaggacca cggctacacc atctactcca 721 ctttcggcgc tttctatatc ccgctgttgc tcatgctggt tctctacggg cgcatcttca 781 gagccgcacg cttccgaatc cgcaagactg tcaggaaggt ggaaaagaag ggagcaggca 841 ccagcttagg aacttcgtcg gcaccgcccc ccaagaagag cctgaacgga cagccaggta 901 gtggggactg gaggcgctgt gctgagaaca gggcggtggg gactccgtgc actaatgggg 961 cagtgaggca gggtgacgac gaggccaccc tggaggtgat cgaggtgcac cgagtgggca 1021 actccaaaga gcaccttcct ctgcccagcg agtcaggatc taactcctat gcccccgcct 1081 gcttggagag aaaaaatgag cgcaatgctg aagcaaagcg caagatggcc ttggcccgtg 1141 aaaggaagac ggtgaagact ctgggcatca tcatgggcac cttcatcctc tgctggctgc 1201 cgtttttcat tgtggctctg gtcctgcctt tctgtgaaag cagctgccac atgcctgcgt 1261 tgttgggtgc cataattaac tggctaggct actccaactc cctgctcaat ccggttattt 1321 atgcttattt caacaaagac tttcaaaacg cttttaagaa gataatcaag tgcaagttct 1381 gccgccgatg atgatgatga tggtggtggt aatagtggtg actatggaga aggaggagga 1441 ggaggaagag gaggaggaag aagaggaaga ggaggaagag tgtagggttt acaggaccct 1501 ccccattcac tcagtaccct agccctggaa gcaacaccta agataacttg cttttcctag 1561 aacatgcagc ttctcagcac atggctttca gaccttgttc tctctatatc tcgtttcaag 1621 gggacagtac tctttgcaga aatcttagag aaccactagt atctgtgtga aaagttattg 1681 gcttggtctt tggccctagg atcagttttg gtagttgtaa taacatcctc cctattcaaa 1741 atcccaattc ttcacgatgg aattttaaac cttggtgatt caaggtagaa agccgaaaca 1801 gaaaatctaa actgtcctga catctgccag gcccactaga actgtcactc tctgccctac 1861 ttctggaact ttgaggatca agactgctct gtctctcact ccctcttgct ctagaaacat 1921 ctgttccctt tgggtgtcaa tctacactgt tgaactggtc tccatgctgg tctggaaaag 1981 cagttggaag taactactct gggttatgtg ctctggaaat ttttgaatag caaaagatcc 2041 cccccacccc tgcacaacac ccactttctg tatttttttt aaattttttt ttaatttttg 2101 aactcaaaat caagtggcaa aagtgtctgt tggagtggac ttataagtag gtctggctcg 2161 taggaagagc caggaaggtc tgtggatggt ttcttgtcct ggtgttaaca ctcaccaaag 2221 aaactgacaa tcagactatt tggaaggaag agttttattc ctacagtttg ttggcctttc 2281 tgtaagttct ggcaaggctc tgtgtcttct gcag // LOCUS RATFCGRE 1341 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat Fc-gamma receptor mRNA, complete cds. ACCESSION M32062 KEYWORDS Fc-gamma receptor. SOURCE Rat (strain Fisher 344) natural killer cell, cDNA to mRNA, clone 6a6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1341) AUTHORS Zeger,D.L., Hogarth,P.M. and Sears,D.W. TITLE Characterization and expression of an Fc-gamma receptor cDNA cloned from rat natural killer cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3425-3429 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W. Sears, 12-FEB-1990. FEATURES from to/span description pept 50 853 Fc-gamma receptor BASE COUNT 374 a 346 c 319 g 302 t ORIGIN 1 ccagcaattt catctcctag acctcatcag actctgaccc agttcttgaa tgactttgga 61 gacccagatg tttcagaatg cacattctgg aagccaatgg ctactcccac cactgacaat 121 gttgctgctg tttgcttttg cagacaggca gacgggagat cttctgaagg ctgtggtgaa 181 acgtgatccc ccatggatcc aggtgctcaa ggacgacact gtgacgctga cgtgcgaagg 241 gacccacaat cctggaaact cttctaccca gtggttccac aaccagagct ccacctgggg 301 ccaggtccaa gccagctaca cgtttaaggc cacagtcaat gacagtggag aataccggtg 361 ccgaatggcg cacaccagcc tcagcgaccc catacatctg gaagtgattt ctgactggct 421 gctgctccag acccctcaac tggtgtttga ggaaggggaa accatcacat taaggtgcca 481 tagctggaag aacaaacagc tgactaaggt cttgttgttc cagaatggaa aacctgtgag 541 gtattattat caaagcagca atttttcaat ccccaaagcc aaccacagtc acagtgggaa 601 ctactactgc aaagcatatc taggaaggac aatgcatgtg tccaagcctg tcaccatcac 661 tgtccaaggt tcagcaaccg cgtccaccag ctctctagtg tggttccatg ccgctttctg 721 cctagtgatg tgcctcctgt ttgcagtgga caccggcctg tatttctgtg tacggagaaa 781 tcttcaaacc tcgggggagg actggaggaa atccctgtca gtcggaaagt acaaggctcc 841 acaggacaaa tgacatccca tcgtatggct aaacagcggc agcttctttt cagccacacc 901 gccgcttcat ctacagcctt cctttgaaag caacttacca ccaggctgga tatttggtcc 961 tgcaatcaca actgctagga ggctgaggca ggaggatcac caaaggctgc ccgggtttta 1021 gagaagagag agtgcaagtc tatcctggat aacccaataa gaccctgggt ttagatggct 1081 tatcaggaaa gagcacccat tgctaagctc acagacaaga ccacagattg atccctaagg 1141 acacagggca aagggggaga acaaattcct gaaagttgtc ctctgacctc cacacgtcca 1201 ccatggcaga tgcacacaat aaacaaatta aaatgtaata tatatatttt aaacaagagg 1261 tggggaaagg ctgggggttg gtggcacagt ttttatccag gacacatgat gctctggatt 1321 ttgacctcct ataataaaac g // LOCUS YEPYOPYER 1527 bp ds-DNA BCT 17-JUL-1990 DEFINITION Y.pestis virulence determinant (yopE) and yopE regulator (yerA) genes, complete cds. ACCESSION M34279 KEYWORDS virulence detrminant; yopE regulator. SOURCE Y.pestis DNA, clone pYV019. ORGANISM Yersinia pestis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1527) AUTHORS Forsberg,A. and Wolf-Watz,H. TITLE Genetic analysis of the yopE region of Yersinia spp.: Identification of a novel conserved locus, yerA, regulating yopE expression JOURNAL J. Bacteriol. 172, 1547-1555 (1990) STANDARD simple staff_review FEATURES from to/span description pept 476 84 (c) yopE regulator (yerA) pept 670 1329 virulence determinant (yopE) BASE COUNT 426 a 329 c 339 g 433 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccgaca ttactaagtg agcgttgtat ctaatcctgg gggcaggtca ttccgtataa 61 taaggcaaca accaaaaatc tactcaacta aatgaccgtg gtggtgagat tagtgatgag 121 gtttgtagcc gttcagcccc ctgcaccagc atctcaagct gagtatatag tgagttatta 181 tccaggctgt tcaatggttg tcgattccat aacactgggt gccccccaac ctcgtcccag 241 gataagatgg gttttaatat atcttgactg aatatattat ggctaagtaa ggtttccttt 301 tcatcattat tgtcaagaga aggtagggta aacattaata tttgcccgac aggatgctct 361 gttatatggc aggcgaattc cccaactttg acaccgataa ccggttcaat agtatctgga 421 atagacaacg aaagttgttg aaataattga gtgatagctt gttcaaatga atacattatg 481 atctcataat agttagataa aatatcaact taaccaaagc actctcggca gaccatcaat 541 tttagcctat aatttttagt ttttgttttg tctaatataa caacaaaaac agcagcgatt 601 ttttatatag ccatcggcta ttttcccact aagataacct tgttttaata gccaaggtaa 661 taaatagtca tgaaaatatc atcatttatt tctacatcac tgcccctgcc gacatctgtg 721 tcaggatcta gcagcgtagg agaaatgtct gggcgctcag tctcacagca aacaagtgat 781 caatatgcaa acaatctggc cgggcgcact gaaagccctc agggttccag cttagccagc 841 cgtatcattg agaggttatc atcagtggcc cactctgtga ttgggtttat ccaacgcatg 901 ttctcggagg ggagccataa accggtggtg acaccagcac ccacacctgc acaaatgcca 961 agtcctacgt ctttcagtga cagtatcaag caacttgctg ctgagacgct gccaaaatac 1021 atgcagcagt tgaatagctt ggatgcagag atgctgcaga aaaatcatga tcagttcgct 1081 acgggcagcg gccctcttcg tggcagtatc actcaatgcc aagggctgat gcagttttgt 1141 ggtggggaat tgcaagctga ggccagtgcc atcttaaaca cgcctgtttg tggtattccc 1201 ttctcgcagt ggggaactat tggtggggcg gccagcgcgt acgtcgccag tggcgttgat 1261 ctaacgcagg cagcaaatga gatcaaaggg ctggcgcaac agatgcagaa attactgtca 1321 ttgatgtgat atggataaaa acaaggggat agtgtttccc cctttttcta tcaatattgc 1381 gaatatcttc gtccctgatc tttcaggggc gaatcgtttt ttagcatgct cattgttaga 1441 atttctgact tatctctctt ctgtattact actcatgctc tggaaaatcc tgaacatgta 1501 tatctatgga ttgatgcagc actcgag // LOCUS YEPYOPYER1 1762 bp ds-DNA BCT 17-JUL-1990 DEFINITION Y.enterocolitica virulence determinant (yopE) and yopE regulator (yerA) genes, complete cds. ACCESSION M34278 KEYWORDS virulence determinant; yopE regulator. SEGMENT 1 of 2 SOURCE Y.enterocolitica DNA, clone pYV8081. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1762) AUTHORS Forsberg,A. and Wolf-Watz,H. TITLE Genetic analysis of the yopE region of Yersinia spp.: Identification of a novel conserved locus, yerA, regulating yopE expression JOURNAL J. Bacteriol. 172, 1547-1555 (1990) STANDARD simple staff_review FEATURES from to/span description pept 609 217 (c) yopE regulator (yerA) pept 803 1462 virulence determinant (yopE) rpt 1468 > 1762 insertion sequence rpt 1464 1467 5' insertion target sequence BASE COUNT 497 a 395 c 382 g 488 t ORIGIN 1 ccggtactcc tcattaccaa tgaggaggcg gtgattccac caagtgaatg ggcttaatac 61 aagtctttta cattaaaaat tgttaaaaaa cgggagataa aagtcaacac tccaacttgg 121 tgttaacttt tactgagcga aatctgatat tgctggcacc acaaatttat aggttatcgc 181 tatttccgta taataaggca acaaccaaaa atctactcaa ctaaatgacc gtggtggtga 241 gattagtgat gaggtttgta gccgttcagc cccctgtacc agcatttcaa gctgagtata 301 tagcgagtta ttatccaggt tgttcaatgg ttggcgattc cataacactg ggtgcccccc 361 gacctcgtcc caagataaga tgggttttaa tatatcttga ctgaatatat tatggctaag 421 taaggtttcc ttttcattat tattgtcaag agaaggcagg gtaaacatta atatttgccc 481 gacaggatgc tctgttatat ggcaggcgaa ttccccaact ttgacaccga taaccggttc 541 aatagtatct ggaatagaca gcgaaagttg ttgaaataat tgagtgatag cttgttcaaa 601 tgaatacatt atgatctcct aatagttaga taaaatatca agttaaccaa agcactctcg 661 gcagaccatc aatttgagcc tataattttt agtttttatt ttgtctaata taacaacaaa 721 aacagcagcg attttttata tagccaccgg ctattttccc actaagataa ccttgtttta 781 atagccaagg gaataaatag tcatgaaaat atcatcattt atttctacat cactgcccct 841 gccgacatca gtgtcaggat ctagcagcgt aggagaaatg tctgggcgct cagtctcaca 901 gcaaaaaagt gagcaatatg caaacaatct ggctgggcgc actgaaagcc ctcagggttc 961 cagcttagcc agccgtatca ctgagaagtt atcatcaatg gcccactctg cgattgaatt 1021 tatcaaacgc atgttctcgg aggggagcca taaaccggtg gtgacaccag cacccacgcc 1081 tgcacaaatg ccaagtccta cgtctttcag tgatagtatc aagcaacttg ctgctgagac 1141 gctgccaaaa tacatgcagc agttgagtag cttggatgca gagacgctgc agaaaaatca 1201 tgatcagttc gccacgggca gcggccctct tcgtggcagt atcactcaat gccaagggct 1261 gatgcagttt tgtggtgggg aattgcaagc tgaggccagt gccatcttaa acacgcctgt 1321 ttgtggtatt cccttctcgc agtggggaac tattggtggg gcggccagcg cgtacgtcgc 1381 cagtggcgtt gatctaacgc aggcagcaaa tgagctcaaa gggctggcgc aacagatgca 1441 tcaattactg tcattgatgt gatatggtga ctagtcctgc tataggttga cacttttcag 1501 ccttaaaacg cctgatgaac ttcatcgggc gttttgtatt ttaacgacag atgcggtcgc 1561 cgcgtgttgt agatttccac tgactccctg accatctttc ttgcctggcc aatatcttcc 1621 ggcttaacca gcagatattc catttttaat atcccattga cccgctccgc cagcgcattc 1681 tgataacaat catatccatc cgtcatcgaa cagatgacac cgtgacgttg atgtaacgcc 1741 tgatattctg ttgaacagta ct // LOCUS YEPYOPYER2 546 bp ds-DNA BCT 17-JUL-1990 DEFINITION Y.enterocolitica DNA with insertion element, 3' end. ACCESSION M34280 KEYWORDS insertion element. SEGMENT 2 of 2 SOURCE Y.enterocolitica DNA, clone pYV8081. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 546) AUTHORS Forsberg,A. and Wolf-Watz,H. TITLE Genetic analysis of the yopE region of Yersinia spp.: Identification of a novel conserved locus, yerA, regulating yopE expression JOURNAL J. Bacteriol. 172, 1547-1555 (1990) STANDARD simple staff_review FEATURES from to/span description rpt < 1 242 insertion element rpt 243 246 3' insertion target sequence BASE COUNT 144 a 119 c 109 g 174 t ORIGIN About 650 bp after segment 1. 1 gatccgtgac ccagaaagac aaatccgtca ttaacttgcg tagccacttc aggacggtag 61 aacggccttg aataccgttg cggtcttgcg ctgtcggtaa gtcatttccc ttttttcgac 121 ctgttcgaca acggccaatt taaaggatag aggataatgc cgttgcgtgc gtttaacata 181 ttggttcatc acattttcct aagtacatga gttaaatgtg tcaacgctat ttaggacggg 241 tcatggataa aaacaagggg atagtgtttc cccctttttc tctcaatatt gcgaatatct 301 tcgtccctga tctttcaggg gcgaatcgtt ttttagcatg ctcattgtta gaatttctga 361 cttatctctc ttctgtatta ctactcatgc tctggaaaat cctgggcgtc tatattctat 421 ggattgatgc aacactcgag aaatcaaaat atcattgcta agcgttatat agtatatacc 481 gtgcttttta tatgaaaacg gcgcatgtca gagcaaatcc agttacactc agcccctaac 541 tctaga // LOCUS HUMCDR34 2412 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene, complete cds. ACCESSION M31423 KEYWORDS cerebellar-degeneration-related antigen. SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2412) AUTHORS Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A., Chaganti,R.S.K., Posner,J.B. and Old,L.J. TITLE Cerebellar degeneration-related antigen: A highly conserved neuroectodermal marker mapped to chromosomes X in human and mouse JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.-T.Chen, 17-JAN-1990 FEATURES from to/span description pept 503 1174 cerebellar-degeneration-related antigen (CDR34) BASE COUNT 743 a 334 c 669 g 666 t ORIGIN 1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct 61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat 121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg 181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga 241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg 301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag 361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg 421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga 481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt 541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt 601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga 661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat 721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt 781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga 841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct 901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt 961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga 1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt 1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat 1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg 1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa 1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac 1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa 1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt 1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact 1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat 1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct 1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga 1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct 1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac 1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg 1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata 1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa 1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc 2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt 2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta 2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga 2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta 2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat 2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt 2401 gagatgctgc ag // LOCUS RATHGF 2485 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat hepatocyte growth factor mRNA, complete cds. ACCESSION M32987 KEYWORDS hepatocyte growth factor. SOURCE Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2485) AUTHORS Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M., Shimizu,S. and Nakamura,T. TITLE Deduced primary structure of rat hepatocyte growth factor and expression of the mRNA in rat tissues JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Nakamura, 26-MAR-1990. FEATURES from to/span description pept 143 2329 hepatocyte growth factor BASE COUNT 750 a 533 c 577 g 625 t ORIGIN 1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc 61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga 121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct 181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca 241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa 301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa 361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc 421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt 481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa 541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg 601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct 661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag 721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat 781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg 841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga 901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta 961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc 1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg 1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca 1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa 1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat 1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta 1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg 1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc 1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg 1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga 1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca 1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa 1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc 1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat 1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct 1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct 1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac 1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg 2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt 2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga 2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt 2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc 2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa 2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa 2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca 2461 ttaatatatg tggcgttttc tgttg // LOCUS MUSIMPD 1707 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse IMP dehydrogenase mRNA, complete cds. ACCESSION M33934 KEYWORDS IMP dehydrogenase. SOURCE Mouse, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1707) AUTHORS Tiedemann,A.A. and Smith,J.M. TITLE Isolation and DNA sequence of a cDNA encoding mouse IMP dehydrogenase JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.M.Smith 27-APR-1990. Author address: Dr. J.M. Smith Seattle Biomedical Research Institute 4 Nickerson St. Seattle, WA 98109 FEATURES from to/span description pept 67 1611 IMP dehydrogenase (EC 1.2.1.14) BASE COUNT 457 a 408 c 451 g 391 t ORIGIN 1 cctctgcggc gcggtcctcg gagcggcgcg gttctcggag ccacgcgtct gtcttcctcc 61 gtggtcatgg cggactacct gattagcgga ggcacctctt acgtgccgga cgacgggctc 121 acagcgcagc agctcttcaa ctgcggggac ggcctcacct acaatgattt tctcattctt 181 cctgggtata tcgacttcac tgcagatcag gtggacttga cgtctgctct aactaagaag 241 attacactaa agaccccatt ggtttcctca cccatggaca ctgtcacaga ggctggaatg 301 gccatcgcga tggcgcttac aggaggtatt ggtttcatcc accacaactg cacacctgaa 361 ttccaggcca atgaagttcg gaaagtgaag aaatacgaac agggattcat cactgacccc 421 gtggtcctta gccccaagga tcgtgtacgc gatgtttttg aggccaaagc caggcatggc 481 ttctgtggta tccccatcac agatacaggc cggatgggga gtcgattggt gggcatcatc 541 tcctcaaggg acattgattt cctcaaggag gaagagcatg accggttctt ggaagagatc 601 atgactaaga gggaagattt ggtggtcgcc cctgccggcg tcactctgaa agaggcaaat 661 gagattctgc agcgaagtaa aaagggaaag ttgcccattg tgaatgaaaa tgatgagctg 721 gtagccatca ttgcccggac agacctaaag aagaatcgtg attaccccct ggcctccaaa 781 gatgccaaga agcaactgct gtgtggggca gccattggca ctcatgagga tgacaagtat 841 aggctggact tactggccct tgctggtgtg gatgtagtgg ttttggactc ttcccaggga 901 aactccatct tccaaatcaa tatgatcaaa tacatcaagg agaagtatcc cagtctacag 961 gtcattggag gcaatgtagt cactgctgcg caagccaaga acctcataga tgcaggtgta 1021 gatgctttgc gagtcggcat gggaagtggt tccatctgca tcacccagga agtgttggcc 1081 tgtgggcggc cccaagccac agcagtgtac aaggtctctg agtatgcccg tcgctttggt 1141 gttcctgtta ttgctgatgg aggaatccaa aatgtgggtc atattgccaa agctttggct 1201 cttggggctt ccacagtcat gatgggctcc ctcctggctg ccaccactga ggcccctggc 1261 gagtacttct tctcagatgg gatccggctg aagaaatacc gaggtatggg ttctcttgat 1321 gccatggaca aacatctcag cagccagaac cgatacttca gtgaagctga caaaatcaaa 1381 gtggcccaag gagtttcagg ggcagtgcag gacaaggggt ctatccacaa gttcgttcct 1441 tacctgattg ctggcatcca gcattcctgt caagacattg gtgccaagag tttaacccaa 1501 gtcagagcca tgacgtactc gggggagctt aaatttgaga agaggacatc ctctgctcag 1561 gtggaaggtg gcgtccacag cctccattcg tacgagaaac ggcttttctg aaaacagatc 1621 cagtatatgc cttgaatttt tcaataaaag tttgggaaaa aaaaagtgaa aaaaaaaaaa 1681 aaaaaaaaaa aaaaaaaaaa aaaaaaa // LOCUS NEUPSAS 3681 bp ds-DNA PLN 17-JUL-1990 DEFINITION N.crassa arginine-specific carbamoyl phosphate synthetase (arg-2) small subunit gene, complete cds. ACCESSION J05512 KEYWORDS carbamoyl phosphate synthetase. SOURCE N.crassa (strain 74-OR23-IVA)asexual phase DNA, clone pAR231, and cDNA to mRNA, clone pAB1. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 3681) AUTHORS Orbach,M.J., Sachs,M.S. and Yanofsky,C. TITLE The Neurospora crassa arg-2 locus: Structure and expression of the gene encoding the small subunit of arginine-specific carbamoyl phosphate synthetase JOURNAL J. Biol. Chem. 265, 10981-10987 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.S.Sachs, 07-MAY-1990. FEATURES from to/span description pept 1451 1630 arginine-specific carbamoyl phosphate synthetase small subunit, exon 1 (arg-2) (EC 6.4.5.5) 1687 2868 arginine-specific carbamoyl phosphate synthetase small subunit, exon 2 pept 870 902 ORF, exon 1 1270 1282 ORF, exon 2 1359 1387 ORF, exon 3 pre-msg 752 3082 arg-2 mRNA and introns pre-msg 809 3082 arg-2 mRNA and introns (alt.) IVS 1631 1686 arg-2 intron A IVS 903 1269 ORF intron A IVS 1283 1358 ORF intron B signal 161 166 TGACTC box signal 568 573 TGACTC box signal 1008 1013 TGACTC box signal 1307 1312 TGACTC box BASE COUNT 760 a 1093 c 863 g 965 t ORIGIN 1 ggatccagac gctatatttg attgttccct cttctgtcag cgtgcggcct ccgttgtccg 61 tgcagccgat tacgacggcc ggctctggcg caacctgggt ttgggagctt cagaagcttg 121 aaagcgaggc gaatcgcgac tgtacaagtc cccgcaactt gagtcaccga aagcgcgggg 181 tatccaccat ctttatctgc ggcggtaggt aggcggggac tttgcggggg ctttgcggga 241 cccgaccact ccaactccga gattcgttgg gggtgccccg ggcaggagat gccgagctgg 301 agtcgcgttg tttgtgaatg tcgtggctgg ggttgtgggc ataggttgcg atgtgtagtg 361 tacgtagtac atagtaggta tgttgaacac cggatgggtg attccagttg tgaggctccg 421 ggactagtag gggtggtaac cgtgatggtc aaagggcatg tggaacttgc agcgtcccaa 481 cgaagtccag gtccccagtc atcccaacgc cgcgatcaat cacccctcgg tccccgtaaa 541 tcggcctttg ggactgccta gcagcgggag tcaccgagtc ctggaccaat caggtcggcg 601 ctgaccctgt cactgcggca gaaaaaagac aaagagctca aaaaattcta ccagatccaa 661 tcaaatgtcg ccctttttct atttgcctca cccactttcg caccctcttt tatctccaag 721 tttctcccct cgacactaca actcactccg tcatcaactc ttgcccatct acaaaacaca 781 caataacccc aatcgcttcc cccgactttc ggactctttc gtccaacctt ctttcttcag 841 ctgaacttgt cttgtcgcaa tctgccctta tgaacgggcg cccgtcagtc ttcacctctc 901 aggtacgata acgcgatagc ccctccatcg tgctggtatc cagccaccat ccacccagtc 961 catcaaagaa gttcccccgg atgagcaatg aagcccccat accagcctga ctcgcgatgc 1021 tgtatcggag tagttgggac gacccgccag taacccacca acctgggtgc ttggcatcat 1081 gccgctctga gttggttttg ggagatgtgg ccagcggtga tccccaaagt tcagctcttc 1141 acggattccg attgcatatc tctcaagatc gcgcagacag ccatcctggg ggagacttct 1201 tctcctcgct tctgccgctt gccgtctgtt tgtagtgtct gatcaaaact aattctgttg 1261 cctttgtagg attacctctc aggtccgcat ccccgccgac tcccgttgac tcaccctttt 1321 ccggctggaa tactgacgat gaattttgtg tgttacagac catctgtgga gagcccttaa 1381 cgcataagag cctctcatca cccagcagcc gtaccaatca ccaccgcacc ccatcaccat 1441 tcaagtcaag atgttctctc gcttggccgc tcgtctcccc aaggcttctg ccctcaatgg 1501 cgtcgccgcc cgtcaggttc gcaacctgag ccagcccgcc attaccggca gcaagggcag 1561 gaacatgccc gcccgtgagc cgcgcactac tgccgctgcc accggcgccg aggccacctt 1621 cactatcagg gtgcgtacct acaactagtc ttctctcgtg gctaaaccta acattttctc 1681 gctcaggatg gccccgtttt ccagggtacc gccttcggtg ccaacaccaa catttctggt 1741 gaagccgttt tcaccacctc ccttgttggt taccccgagt cgatgaccga tccctcgtac 1801 cgcggccaga ttttggtctt cacccagccc ttgattggca actacggtgt cccttcgaat 1861 gagcgtgacg agttcaacct cctaagtact tcgagtctcc ccacatccag tgcgccggta 1921 tcgttgtctc cgatgtcgct acccagtaca gccactggac tgctgttcag agcttgggcg 1981 agtggtgcgc cagcgagggc attcccgcca ctctccggtg ttgacactcg tgcgattgtc 2041 acctacctcc gtgagcaggg ttcctctctc gccaggatct ccattggcga cgagtacgac 2101 gccgatgagg atgagggctt cattgaccct ggccagatca accttgtcaa gcgcgtgtcc 2161 accaaggctc cctttgtggt taccaacccc aacgccaagt tccacgtcgc tctgatcgac 2221 tgcggtgtca aggagaacat ccttcgtagc ttggtcagcc gtggcgcctc cgtcaccgtc 2281 ttcccctaca attaccccat ccacaaggtt gccgagaact tcgatggtgt cttcatttcc 2341 aacggccctg gtgatcccac ccactgccag gagactgttt acaacctcgc caagcttatg 2401 gagacttccc ctattcccat catgggtatc tgccttggtc accagcttct tgctctcgct 2461 gttggtgcca agaccatcaa gctaagtatg gtaatccgtg cccacaacat cccggctctt 2521 gacttgacca ctggtcagtg ccacatcacc tcgcagaacc acggttacgc cgtcgatatc 2581 agcaccctcc ctagcgactt caaggagtat ttcgtcaacc ttaacgacgg ctccaacgag 2641 ggtatgatgc acaagacccg ccccatcttc tctacccagt tccaccccga ggctaagggt 2701 ggtcccatgg actcttccta cctctttgac aagtacatgg agaatgtcga gcttttcaag 2761 agcaactccc aggtctaccg tgacaacagg cctactcagt tcatgattga cattctcagc 2821 aaggagcgtg tcggcgttga gcccactcct ctttctaacg ctgcttaacg gacgagaatg 2881 tctttgcatt ttcttagtga acggtgtttg tcattggtct tgggttattg ttttccagca 2941 caaaggatat gggttttgta aaagggactt tcttgggcga cgtcctctcg ttattaatgg 3001 tttagggtgt atatgatatc ttttgttact acttttgaat gcagaatcag ataagtttag 3061 tcggcattag gctgatatgg cgcagagagg acaatatcta gttgatacat acccatgtgc 3121 tatcacgtct aaacgtcttc cagtcgttgt attttaccca ttcctggtga ttgccgtatg 3181 tcttccagtc cagtacctcc cctaattcgg tgcccttcaa cattcagggt tatgttgaca 3241 gcgtgctgta taagcatggt tgccggggaa catggtgtca ttgactggta agcagggcag 3301 cctctggttc tgtgacacct ggttttccct ggtcttgatt tcaaactatt atcaaacaaa 3361 tcagcgtcgt cttagtagat ggcaggtatc ttagaacaga aagtagttat tcttcgtcgg 3421 atacgactcc aaaatcatat acgaacaagt ggatgcttta tactatgctt ctttttactg 3481 taatcttggt ctcgctcttc gcttagcctt cttcacctgg taaccggtat aggtaccttc 3541 gcaccctcaa ttggaaggat gcatcaaccc ctccatgcgc tgcttatcct gctatctatc 3601 catacatttc tatgttttta cagtgtagaa tatttcttgt gtatcagggt ccgtcccgcg 3661 atgctacccc ctgtcctgca g // LOCUS HUMOSTRO 1425 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human osteopontin mRNA, complete cds. ACCESSION J04765 KEYWORDS osteopontin; phosphoprotein. SOURCE Human adult osteoblast, cDNA to mRNA, clone HOP-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1425) AUTHORS Young,M.F., Fisher,L.W., McBride,O.W. and Termine,J.D. TITLE cDNA cloning, chromosomal location and RFLP analysis of human osteopontin JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Young, 04-MAY-1990. FEATURES from to/span description pept 68 1012 osteopontin precursor /hgml_locus_uid="LK0223T" /nomgen="OPN" /map="unassigned" sigp 68 115 osteopontin signal peptide matp 176 1012 osteopontin mRNA < 1 1425 osteopontin mRNA BASE COUNT 449 a 293 c 296 g 387 t ORIGIN Chromosome 4q13. 1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact 61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc 121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga 181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca 241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc 301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga 361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca 421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac 481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg 541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga 601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt 661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga 721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca 781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc 841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt 901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct 961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa 1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa 1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg 1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa 1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg 1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact 1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag 1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag // LOCUS MUS25OAS 1412 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse 2'-5' oligo A synthetase mRNA, complete cds. ACCESSION M33863 KEYWORDS 2'-5' oligo A synthetase. SOURCE Mouse cell line L-929, cDNA to mRNA, clone L3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1412) AUTHORS Coccia,E.M., Nissim,A. and Chebat,J. TITLE A full-length murine 2-5A synthetase cDNA transfected in NIH-3T3 cells impairs EMCV but not VSV replication JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.Coccia, 26-APR-1990. FEATURES from to/span description pept 36 1139 2'-5' oligo A synthetase signal 1389 1394 poly-A signal BASE COUNT 340 a 371 c 390 g 311 t ORIGIN 1 ccaggctggg agacccagga agctccagac ttagcatgga gcacggactc aggagcatcc 61 cagcctggac gctggacaag ttcatagagg attacctcct tcccgacacc acctttggtg 121 ctgatgtcaa atcagccgtc aatgtcgtgt gtgatttcct gaaggagaga tgcttccaag 181 gtgctgccca cccagtgagg gtctccaagg tggtgaaggg tggctcctca ggcaaaggca 241 ccacactcaa gggcaggtca gacgctgacc tggtggtgtt ccttaacaat ctcaccagct 301 ttgaggatca gttaaaccga cggggagagt tcatcaagga aattaagaaa cagctgtacg 361 aggttcagca tgagagacgt tttagagtca agtttgaggt ccagagttca tggtggccca 421 acgcccggtc tctgagcttc aagctgagcg ccccccatct gcatcaggag gtggagtttg 481 atgtgctgcc agcctttgat gtcctgggtc atgttaatac ttccagcaag cctgatccca 541 gaatctatgc catcctcatc gaggaatgta cctccctggg gaaggatggc gagttctcta 601 cctgcttcac ggagctccag cggaacttcc tgaagcagcg cccaaccaag ctgaagagtc 661 tcatccgcct ggtcaagcac tggtaccaac tgtgtaagga gaagctgggg aagccattgc 721 ctccacagta cgccctagag ttgctcactg tctttgcctg ggaacaaggg aatggatgtt 781 atgagttcaa cacagcccag ggcttccgga ccgtcttgga actggtcatc aattatcagc 841 atcttcgaat ctactggaca aagtattatg actttcaaca ccaggaggtc tccaaatacc 901 tgcacagaca gctcagaaaa gccaggcctg tgatcctgga cccagctgac ccaacaggga 961 atgtggccgg tgggaaccca gagggctgga ggcggttggc tgaagaggct gatgtgtggc 1021 tatggtaccc atgttttatt aaaaaggatg gttcccgagt gagctcctgg gatgtgccga 1081 cggtggttcc tgtacctttt gagcaggtag aagagaactg gacatgtatc ctgctgtgag 1141 cacagcagca cctgcccagg agactgctgg tcaggggcat ttgctgctct gctgcaggcc 1201 catgacccag tgagggaggg ccccacctgg catcagactc cgtgcttctg atgcctgcca 1261 gccatgtttg actcctgtcc aatcacagcc agccttcctc aacagattca gaaggagagg 1321 aaagaacaca cgcttggtgt ccatctgtcc acctgttgga aggttctgtc tgacaaagtc 1381 tgatcaacaa taaaccacag caggtgccgt ca // LOCUS ACARGA 1212 bp ds-DNA INV 17-JUL-1990 DEFINITION A.castellanii 5S RNA. ACCESSION M34003 KEYWORDS 5S RNA. SOURCE A.castellanii (strain Neff) trophozoite DNA, clone pAc5S.1. ORGANISM Acanthamoeba castellanii Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina; Acanthamoebidae. REFERENCE 1 (bases 1 to 1212) AUTHORS Zwick,M.G., Wiggs,M. and Paule,M.R. TITLE Cloning, sequencing and genomic organization of a 5S RNA gene from the eukaryotic protist Acanthamoeba castellanii JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Paule, 04-MAY-1990. Author address: M.R.Paule Colorado State University, Dept. Biochemistry, Fort Collins, CO 80523 FEATURES from to/span description RNA 489 610 5S RNA precursor RNA 489 607 5S RNA BASE COUNT 195 a 374 c 329 g 314 t ORIGIN 1 ctgcaggaga ttcgtcgggc tcgtgccgcg atcgtcttga tgatcgagac cgacggcccg 61 ttcctcactc cgcgcacctt gcgccttgcc cgcgccgcaa cgagccgtct tacctcaccc 121 acgtcctaca gagtacttac atcctccaat gtgttgtgtg cgtgtcatgt gcccaatgta 181 tgtggtgtgt gcggtgtgtg cggtgtgtga ggtgtgtggt gtgcgcgtgt catgtctgtt 241 acacgttctg ttgaccaaca ttttttatgg tgtgtacagc ggtcgccgac tgcaccggac 301 gatcggtaga ggaggtggcc ggtgaaacca cgcagaccgc ggtgcgcttc ttcggtctgg 361 accaaatccg taaggagtga atgtagcaca cccctcccct ccatgtcatt gaagcccaag 421 ttagtgtgca atttaaatga tggtgtgacc aagagggatc acatgggcga agtccctggt 481 aggctgatgg atacggccat actgcgcaga aagcaccgct tcccatccga acagcgaagt 541 taagctgcgc caggcggtgt tagtactggg gtgggcgacc acccgggaat ccaccgtgcc 601 gtatcctcct ttttgcttct tcactctcgc cactattgct tcaggtcgcc tcatcgcctg 661 ccacagctga ctacaaactg ggtgagccaa acctgccttt tgcactggtc cagtgtacaa 721 ggctggaaca actttgtttc acacttgctg aggtcaatca atccagcctt gttggttgag 781 gccacaggag ggatctgcca cctttcacac catctcgtcg agaaggctgc ttgacgcttc 841 ttcgctgtcc ttcggggctg atgtctcgac tgtgtgtccg tcttcatcgc gattgtcact 901 tgcgttgttg ttgttgcggc tttcgtccct gtcctcatcg ccgtcgtcgt cttcatcgct 961 gttgtcgcgc gactcactcc gcaaaccgtc tggccgtgtg ctgctcctcc tcttcatcgt 1021 tgctgtcatc tcggtcgctg cggccacgcc tcgaagaaga cagcccgccc tcttcggccg 1081 cctcggtccc gcccctcgca aagcggatct cgcctcgtcg gctgccgctg gccgtcgctc 1141 cgcctggcgt cgttgtcttc cctcgcatcg tcctcaaact tgtcgtgcgt ccacttgtcg 1201 gcaaatgtcg ac // LOCUS ENTOMPX 664 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.cloacae outer membrane protein X (OmpX) gene, complete cds. ACCESSION M33878 KEYWORDS outer membrane protein X. SOURCE E.cloacae (strain 2249-1) DNA. ORGANISM Enterobacter cloacae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 664) AUTHORS Stoorvogel,J., van Bussel,M.J.A.W.M., Tommassen,J. and van de Klundert,J.A.M. TITLE Molecular characterization of an Enterobacter cloacae outer membrane protein OmpX JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Stoorvogel, 26-APR-1990. Author address: J.Stoorvogel Dept of Medical Microbiol, Univ Hospital, P.O.Box 9600, 2300 RC Leiden, THE NETHERLANDS FEATURES from to/span description pept 83 601 outer membrane protein X precursor sigp 83 151 outer membrane protein X signal peptide matp 152 598 outer membrane protein X signal 18 25 -35 region signal 43 48 -10 region binding 74 77 ribosome binding site signal 620 649 transcription termination signal BASE COUNT 171 a 156 c 162 g 175 t ORIGIN 1 atcgtgaatg acaaagagtt tcacagattt ttttgatata tttaaaactt acggacttac 61 ttgaagcaca tttgaggtgg ttatgaaaaa aattgcatgt ctttcagcac tggcagctgt 121 tctggctgtt tccgcaggta ccgctgtagc ggcaacttct actgtaactg gtggttacgc 181 tcagagcgat atgcagggcg tgatgaacaa aaccaacggt ttcaacctga agtaccgtta 241 cgagcaagac aacaacccgc tgggtgtgat cggttctttc acttacaccg agaaagatcg 301 tactgaaaat ggctcttaca ataaaggtca gtactacggc atcaccgcgg gtcctgctta 361 ccgcctgaat gactgggcaa gcatctacgg tgttgtaggt gttggctacg gtaaattcca 421 gcagaccgaa aaccagggtc tgaaccgtac tgcaagcaac agcgactacg gtttctccta 481 tggcgcaggt atgcagttca acccaatcga aaacgttgct ctggacttct cctatgagca 541 gagccgtatc cgcaacgttg acgttggcac ctggatcgcg ggcgtaggtt accgcttcta 601 atcacttcgg tgagccagta aaaaatccgc ccaatgtggc ggattttttt tgtttgtacc 661 ctct // LOCUS NEUMPPX 2038 bp ss-mRNA PLN 17-JUL-1990 DEFINITION N.crassa matrix processing peptidase (MPP) mRNA, complete cds. ACCESSION J05484 KEYWORDS matrix processing peptidase. SOURCE N.crassa, cDNA to mRNA. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 2038) AUTHORS Schneider,H., Arretz,M., Wachter,E. and Neupert,W. TITLE Matrix processing peptidase of mitochondria: Structure-function relationships JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Neupert, 17-APR-1990. FEATURES from to/span description pept 41 1774 matrix processing peptidase BASE COUNT 439 a 626 c 552 g 421 t ORIGIN 1 cccacattac gctgccgcat cacaattcct tgttgcagcc atgctgaatc gcttccggcc 61 agcgcggcta gtagcccaat cctccagatg cttgcccttg acgagggcgc gggcaggtcc 121 cttgcccgtt aacaatgcca ggactttggc tacgagagcc gctgctgtca acaccaagga 181 accgaccgaa cgcgacaaca tcaccactct ctccaatggt gtccgtgtcg cttccgagga 241 ccttcccgat gccttctccg gtgtaggtgt ctacatcgac gcggggtccc gatatgagaa 301 cgactatgtc cggggtgcca gtcacatcat ggaccggcta gccttcaagt ctacaagtgc 361 gaggactgcg gacgaaatgc tcgaaactgt tgagaagctc ggtggtaaca ttcagtgcgc 421 ttcttcgcgc gagtctatga tgtaccaggc ggccaccttc aacaaggcta ttcccaccgc 481 tgttgagctc atggccgaga ccatccgcga tcccaagctt acggacgagg agctggaggg 541 acagatcatg acggcgcaat atgaggtcaa cgagatctgg tccaaggccg aactgatcct 601 gcccgagttg gtgcacatgg ctgccttcaa ggacaacact cttggcaacc cgttgctttg 661 tcccaaggag aggttggatt acatcaaccg ggatgtcatc caaacatacc gcgacgcttt 721 ctacaggccc gagcgccttg ttgttgcctt tgctggtgtg cctcatgaga gggccgtcaa 781 gctcgcagag aagtactttg gtgatatgaa ggcctccgat gctcccggtc tctcgaggac 841 aggttccgaa acctccgtcg actcgctagt gtccgagtcc agcgaggcct cgagtgaatc 901 ttcatcatcc tcctcggact cttccgagtc gagtggcggg ctgctctcca agcttttctc 961 tcccaaggcc aagaaagcca cccccaaccc cttcctcacc cgggtaccta ttagcaccga 1021 agacttgact cggcctgctc actacacagg cggtttcctc accctcccat cacagccccc 1081 accgctcaac cccaaccttc ccacatttac tcacatacag ctcgccttcg agggcctcgc 1141 catctcggac gacgacatct acgccctcgc caccctgcag accctcctcg gcggcggcgg 1201 ctccttctct gccggcggtc ccggcaaggg catgtactcg cgtctctaca ctaacgttct 1261 caaccagcac ggctgggttg agtcctgcgt ggccttcaac cactcataca cggactcggg 1321 tctcttcggc atcgccgcct cgtgctaccc gggtcgcacc ctgcccatgc tccaggtcat 1381 gtgccgcgag ctgcacgccc tcaccaccga ccatggctac tcggccctgg gcgagctcga 1441 ggtttcgcgc gccaagaacc agctccgcag cagcctcctg atgaacctcg agagccgcat 1501 ggtcgagctc gaggatctgg gccgccaagt tcaggttcac ggtcgcaaga tcccggtccg 1561 cgagatgacg cgccgtatca acgagctgac ggtcaaggac ctccgaaggg tcgctaagcg 1621 cgtggttggt ggcatggcga ataacgccgg ccagggaagc ggtgcgccga cggtggtgct 1681 gcaggaggcg acggtgcaag gactcaagac tacggagctg gggtgggatc agatccagga 1741 tacaattgct cagtggaagc tcggtagacg gtaaacgttt gtcaagggga aaaaaagagt 1801 agggcgtgga gaagttatgt aagaggagcg ctgtattgaa cttggcgaca cgcacacacc 1861 ggaacgataa aggcgtttta ggttccccac gagcataggg aagaggctag atggttgctc 1921 tgtacaatcg caacttttct tggtgagtta tacaagatgt gtccaggtac atctttgcct 1981 taccatactg tacgatagca atgaagattt tctgatatat caaaagtcaa aagtcaaa // LOCUS XELXOTCH 9163 bp ss-mRNA VRT 17-JUL-1990 DEFINITION X.laevis Xotch protein mRNA, complete cds. ACCESSION M33874 KEYWORDS Xotch. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 9163) AUTHORS Coffman,C., Harris,W. and Kintner,C. TITLE Xotch, the Xenopus homolog of the Drosophila notch gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Coffman, 26-APR-1990. Author address: C.R.Coffman University of California Dept. of Biology-B022, La Jolla, CA 92093 FEATURES from to/span description pept 346 7917 Xotch protein mRNA < 1 9163 Xotch mRNA signal 9146 9151 poly-A signal rpt 409 4632 EGF-like repeats rpt 4684 5025 lin-12/Notch repeats rpt 5953 6591 cdc-10/SWI-6 repeats BASE COUNT 2499 a 2192 c 2188 g 2284 t ORIGIN 1 gaattccgga cgcctggtgg aactaaacag actggaatgc actggaacac ggaagcctat 61 gcggggatac attgtagcgc tgagcccttc cgctgtaaca ttgtagcgct gagccgctca 121 tacactgtag cgctgtaact tcaggctgtg cgcaaactga gtgcgctatt ctctctccaa 181 tctggaaact ctgtcgcttt tggaccaggg attgcactga cttggattat cttcaccttt 241 ctctaaggga tttatagatt tttgccttgg gacctttatc ctccccttgt cctgggacag 301 ggaccttcat caggatctat gttcctatag tgagactgca ggaaaatgga tcggataggg 361 ctggctgtgc tgctctgctc cctgcctgtc ctcacacaag gtcttcgatg cactcagact 421 gcagaaatgt gcctgaacgg tgggaggtgc gaaatgaccc ctggtggaac tggggtgtgc 481 ctatgcggta atttgtattt cggtgaaagg tgccagttcc ccaatccctg caccataaag 541 aatcagtgta tgaactttgg aacctgcgag cctgtgctac aggggaacgc catagacttc 601 atctgccact gccccgtggg tttcaccgac aaagtgtgcc tgactcccgt ggataacgcc 661 tgcgtcaaca acccgtgccg gaatggagga acctgtgaac ttttgaatag cgtgactgaa 721 tacaaatgca ggtgtccccc aggatggaca ggggattcgt gccagcaggc agatccatgt 781 gcctctaatc catgtgccaa tggtgggaag tgtttgccct ttgagatcca gtatatctgc 841 aagtgtcccc ctggattcca tggagctacc tgcaagcagg acattaatga gtgcagccaa 901 aatccctgca agaatggcgg ccagtgcatc aatgagtttg gatcataccg ctgcacctgt 961 cagaacaggt ttactggaag aaactgtgat gagccatatg tcccctgcaa cccatcgccg 1021 tgcttgaatg gagggacctg ccgccaaact gatgacacct cctatgattg cacttgtctt 1081 ccaggcttta gtggacaaaa ttgtgaagag aacattgacg actgcccgag caacaactgc 1141 cgcaatggtg ggacctgtgt ggatggagtc aacacctata actgccagtg tccacctgac 1201 tggacaggcc aatactgcac ggaggatgtg gacgaatgcc aactgatgcc aaacgcttgc 1261 caaaatgggg gaacctgcca caacacctac ggtggataca actgtgtttg tgtcaacggc 1321 tggaccggag aggactgcag cgagaacatt gacgattgtg ccaacgccgc ctgccatagt 1381 ggtgccacct gtcatgatcg cgtggcatct ttctactgcg agtgcccaca cggccggaca 1441 ggtctccttt gccacctgga taatgcctgt atcagcaatc cctgcaacga aggctccaac 1501 tgtgacacca acccagtcaa cgggaaagcc atttgcacct gcccacctgg gtacactggt 1561 cctgcttgca acaatgatgt ggatgagtgc tctctgggtg ccaatccatg tgagcatggt 1621 gggagatgta ccaacaccct gggttccttc cagtgcaact gcccccaggg atatgcagga 1681 ccccgctgcg agattgatgt caatgaatgt ctttccaacc cgtgccaaaa tgattccaca 1741 tgcctggacc agattggaga gtttcagtgt atatgtatgc caggttacga aggcctttac 1801 tgtgaaacca acatagatga atgtgccagc aacccctgcc tacataatgg caagtgcatt 1861 gataaaatca acgagttccg ctgcgactgt cccacaggct tcagtggaaa cctctgccag 1921 catgactttg atgagtgtac cagcactccg tgcaaaaatg gagccaaatg cttggatggt 1981 ccaaattctt acacctgcca atgtacagaa ggtttcaccg ggcgccactg cgaacaagac 2041 atcaatgaat gcattcctga tccgtgccac tacggcacct gcaaagacgg cattgctacc 2101 ttcacatgcc tgtgccggcc tgggtacacg gggcgtctgt gtgataatga catcaatgag 2161 tgcctgagca agccgtgcct aaacgggggg cagtgtacag accgagagaa tggatacata 2221 tgcacttgtc cgaaaggaac cacaggggtg aattgcgaaa ccaaaataga tgactgtgcc 2281 agtaacctgt gcgataatgg aaagtgcatt gacaagattg atggctacga gtgcacctgc 2341 gagcctggtt acacaggcaa gttgtgcaat attaacatta acgaatgtga cagcaatcca 2401 tgccgcaatg gagggacttg caaagaccag atcaatggct tcacatgtgt gtgcccagat 2461 ggctaccatg accacatgtg cctgtctgaa gtaaatgaat gcaacagcaa tccatgtatc 2521 cacggggcgt gccatgatgg agtcaatggg tataaatgtg actgtgaagc aggatggagc 2581 ggtagcaatt gtgacatcaa taacaatgag tgtgaatcca acccgtgtat gaatggtggt 2641 acctgcaaag acatgaccgg agcatacatc tgtacttgca aagcaggatt tagtgggcca 2701 aactgccaaa ctaacatcaa cgaatgttcc tctaacccct gcttgaacca tggaacgtgt 2761 attgatgatg ttgctggtta caaatgcaac tgcatgttac cttatacagg tgccatttgt 2821 gaggcagttt tggctccgtg tgcaggcagt ccttgtaaaa atggtggcag atgtaaagaa 2881 tcagaagact ttgaaacctt ctcttgtgaa tgtccacccg ggtggcaagg gcaaacatgt 2941 gagattgaca tgaatgaatg cgtaaatcgc ccttgccgta acggtgcaac gtgtcaaaac 3001 accaatggca gctacaagtg caactgcaaa cctggttaca ctgggcggaa ctgcgagatg 3061 gacattgatg actgccaacc aaatccttgt cacaatggag gatcctgttc tgacggcatt 3121 aatatgtttt tctgtaactg cccggccggg ttccgggggc ctaagtgcga agaggacatt 3181 aacgagtgtg ccagtaaccc gtgcaaaaat ggagccaact gcactgattg tgtaaacagc 3241 tacacatgca cctgccagcc tggatttagt ggcatccact gtgaaagcaa cacccctgac 3301 tgcaccgaga gctcgtgctt taatggtggc acctgcattg atgggatcaa tacgtttaca 3361 tgccagtgcc cacctggatt taccggaagc tactgccaac atgatatcaa tgaatgtgac 3421 tccaagcctt gtttgaacgg aggcacctgc caggacagtt acgggacata caagtgcacc 3481 tgtccgcaag gatacactgg gctgaactgc cagaacctgg tgcgttggtg tgactcttct 3541 ccctgtaaaa atggaggaaa gtgctggcag acaaacaact tttatcgttg tgagtgcaag 3601 agtggctgga ctggtgttta ctgtgatgtc cccagtgtct cctgcgaggt tgctgctaag 3661 cagcaaggtg tggatatcgt tcatctgtgt cgcaattctg gcatgtgtgt ggacactgga 3721 aacacacatt tctgccgctg tcaggctggc tacacaggca gttactgcga ggaacaggtg 3781 gacgaatgct cgccgaatcc ttgtcagaac ggtgccacat gcacagacta cttaggtggc 3841 tactcttgcg agtgtgttgc cggctaccat ggagtaaact gctccgagga gatcaatgag 3901 tgtttgtccc acccgtgcca gaatggagga acctgtattg atctcattaa cacctacaaa 3961 tgctcctgcc ccagaggaac tcaaggtgtg cactgtgaaa ttaatgtgga tgattgcacg 4021 cccttctacg actccttcac cttggaaccc aaatgcttca acaatggcaa atgtatagac 4081 cgcgtgggtg gctataactg catctgccct ccaggctttg taggggaacg ctgtgaaggc 4141 gatgtgaacg agtgcctgtc taatccctgt gactcccgtg gcacccagaa ctgcattcaa 4201 ctggtgaatg attaccggtg tgagtgccgg cagggattca caggaaggcg ctgtgaatct 4261 gttgtggatg gttgtaaggg aatgccctgc cgaaacggtg gaacatgtgc tgtcgccagc 4321 aatacagaac gtggatttat ttgcaaatgc cctcctgggt ttgacggagc cacttgcgag 4381 tatgattcac gaacctgcag caaccttcgc tgccagaatg gcggcacttg catttcggtg 4441 ctgacgagtt ccaaatgcgt gtgctcggaa ggctatactg gcgccacatg tcagtaccct 4501 gttatcagtc cttgtgcttc ccacccttgt tacaatggag gaacctgcca gttctttgct 4561 gaggaacctt tcttccagtg cttctgtcca aaaaacttta acgggctctt ctgccatatc 4621 ttggattatg agtttcccgg gggactaggc aagaatatta ctcctcctga taatgacgat 4681 atctgcgaaa acgagcagtg ctctgagctg gctgacaaca aagtctgcaa cgccaactgc 4741 aacaaccatg cttgcggatg ggatggagga gactgctctc tcaacttcaa tgacccatgg 4801 aagaactgca cccagtcgct gcagtgctgg aaatatttca atgatggcaa atgcgattct 4861 cagtgcaaca acactgggtg tctgtacgac gggtttgact gccagaaagt tgaggttcag 4921 tgcaaccctt tgtatgacca gtactgcaag gatcactttc aagatggcca ctgcgaccaa 4981 ggctgtaaca atgcagagtg cgaatgggat ggcctggact gcgccaacat gccagagaac 5041 ctggctgaag gcaccctggt gttggtcgtc ctcatgcccc ctgaaaggct taagaacaac 5101 tctgtcaatt tcctccgtga gctgagccga gtgctccaca caaatgtggt gttcaagaaa 5161 gacagcaaag gcgaatacaa gatctaccct tactatggca atgaggaaga acttaaaaaa 5221 catcacatca agaggtcaac tgattactgg tcggatgctc cttctgccat cttcagcaca 5281 atgaaagaat ccattttact aggaaggcat cgtagggagt tggatgaaat ggaagttagg 5341 gggtcaatcg tatacttgga aatagacaac cgccagtgtt acaaatcctc ctctcagtgc 5401 ttcaacagtg ctacagatgt tgccgcattc cttggggctt tggcctccct tgggagtctg 5461 gacacccttt cctataagat tgaggcagtg aaaagtgaga acatggagac ccccaagccc 5521 tctactctct atcccatgtt atccatgttg gtcattccgt tattaatcat cttcgtcttc 5581 atgatggtca tcgttaataa gaagcgtcgc cgtgaacacg acagctttgg ttccccgacg 5641 gctttattcc aaaagaaccc agcaaaaaga aacggcgaga cgccttggga ggattctgtt 5701 ggcttaaagc ccattaagaa tatgacagat ggctctttta tggacgataa tcagaatgaa 5761 tggggagatg aggagaccct ggaaaacaag aggttcaggt ttgaagagca agtgattctc 5821 ccagaacttg ttgacgacaa aactgacccc cgacagtgga cacggcagca ccttgacgcc 5881 gctgatctgc gcatttcatc catggcccca acgccgccgc agggagagat tgaggccgac 5941 tgtatggatg tcaatgtccg tggccctgat ggcttcaccc cacttatgat tgcctcctgt 6001 agcggaggtg gattagagac tggaaacagc gaagaggaag aggatgcttc agctaatatg 6061 atttccgact tcattgggca gggtgcccaa ctgcataacc aaaccgaccg cactggcgag 6121 acagcacttc atctggccgc aagatacgca cgtgctgatg cagccaagcg cctgttggaa 6181 tcgagtgcgg acgccaacgt ccaggataac atgggcagga ctcctctcca tgcagcagtg 6241 gcggctgatg ctcagggcgt attccagatt ctcattcgga accgagcgac agacttagac 6301 gcccgcatgt ttgatggcac tacccctctg atcctggccg ctcggctggc cgtggaaggg 6361 atggtggagg agcttatcaa tgctcatgca gatgtcaacg ctgttgatga atttggaaaa 6421 tctgctttgc attgggcagc ggctgtgaat aacgttgatg ctgcagctgt gcttctcaag 6481 aatagtgcaa ataaggacat gcaaaacaac aaggaagaga catccctgtt cttggccgca 6541 agagaaggca gctacgaaac tgccaaagtc cttttggatc actacgccaa ccgtgacatc 6601 acagaccaca tggatcggct gcctcgtgac atcgcccaag aacgcatgca ccacgacatt 6661 gttcacctgc tggatgaata taaccttgtg aagagcccaa cgctgcacaa tggtccgttg 6721 ggagcaacga cattatcacc tcccatctgc tcccctaatg gttacatggg gaacatgaag 6781 ccttctgttc agagcaagaa agcccgcaag cccagtatca aaggtaatgg ctgcaaagag 6841 gccaaagagc tgaaagccag aaggaaaaaa tctcaagatg ggaaaacaac tctcttggat 6901 tctggcagtt ctggagtgtt gtccccagtg gactccctgg agtcaacaca tggatactta 6961 tcagatgtgt cttctcctcc gctgatgacc tctccgtttc agcagtctcc atccatgcct 7021 ctgaaccact tgacaagcat gccagagtcc cagcttggca tgaatcacat aaacatggcc 7081 accaagcagg aaatggcagc aggttccaac agaatggctt ttgatgccat ggtgccacgt 7141 ctgacccatc tcaatgcctc aagccctaat accatcatga gcaatggatc catgcatttc 7201 actgtgggag gagctccgac tatgaacagc caatgtgact ggttagctag gctgcagaat 7261 gggatggtcc agaatcagta tgacccaatc agaaatggca tccaacaagg caatgctcaa 7321 caagctcaag ctcttcagca tggccttatg acctcgctcc ataatggtct gccagcaaca 7381 actctctccc aaatgatgac ctatcaggcc atgcccaaca caaggctagc caatcagcca 7441 catctaatgc aagcccagca aatgcaacag cagcaaaact tgcagttgca ccagagcatg 7501 cagcaacaac atcacaattc cagcacgacc tctactcaca tcaactcacc attctgcagc 7561 agtgacataa gccagacgga cctgcagcaa atgtcaagca acaacattca ttcagtaatg 7621 ccccaggaca ctcagatatt tgctgcatct ctgccttcca atcttacgca gtctatgaca 7681 actgcacaat ttttaacccc cccttcccag catagctact cctccccaat ggacaataca 7741 ccaagccatc aactacaagt accagaccac ccgttcctga cgccttctcc tgagtcacct 7801 gaccagtggt caagctcctc ccctcattcc aatatgtctg actggtcaga aggaatatca 7861 agtcctccca cgagtatgca gcctcagcgc acccacatac ctgaagcttt caagtaaaaa 7921 aaaaaaaagt ttaaaaaaat gtaaaatatt tttaaagaca ctgagagaga ctttaagaga 7981 ctgaaggaaa tttttatatg gtttttatac ttaaaataac agaacatttg aattttctag 8041 tatttattta tatatacgtt tgacctaaaa cactgccctt ttatttataa gctttttttc 8101 atttttagga caaaatattt taacttcttg ccttgaaagt ttttcagttc taaatcttat 8161 gaaattggtt cctgcctggt attgaaaacg gcaatgtatt tattttttat ttacctgaat 8221 agtatacagg aacaaaccac tggggtgggg gggttatcgg gatgtgtatt tagcagaaaa 8281 aagattttct ataaaatgaa atctttcagg ttttcattta tagcactaaa aagattccag 8341 tattaatttt aaattaaatc atgaagaaga tgctccgaat acctctcgct gtcagggagg 8401 tgcccaatag gttttttttt tttgtttttt ttttggaaaa cctgaaactc ttggtaagta 8461 cagaaagaaa gcataaaata caggagggca cgcctaggtc tgatgcattc tcaatgggaa 8521 taaaatcctg aaaagtgact cattttaaaa aattatattt gttagaaaaa acaaactgcc 8581 attttgaatc ccttgtgtcg catggggtat tggtcagaaa atgcatgcaa tttttggtac 8641 aagcccgtgg catattttct acagtttttt tttttttttt tttttttttg taaatatcta 8701 tggtggattt tgtttcaaaa atctatttgt atatcctgat ttatattgct tatccgagta 8761 ggagtgcagg ctggatggaa ccatgccgaa gacaaatcgc agattggaca ttgtttttta 8821 tttttttgcc tacagagact atttgcacat gaacaacaac aaaaaaaaga agatatattt 8881 aaatataaaa ttttatacaa acgaaaaaac aaaaaacaaa aactgtcgat ttgcctctag 8941 ttccacttct tctgtgatac gatgcagttg ggcagttcat tccttatgcc gtactgtaca 9001 aagatctgta gaatttagtt gtttttcctg aacaaataaa tatgtaatgt atcgagcaga 9061 aatgtagttt acaaaaaaaa ctatttgttg ttgttgctgt taacacttct gtaaacaaat 9121 tttttttggt ttgtttattt gttgtaataa aattgtacaa aat // LOCUS TRCTC26 638 bp ss-mRNA INV 17-JUL-1990 DEFINITION T.cruzi AMP-inducible protein (TC26) mRNA, partial cds. ACCESSION M34062 KEYWORDS AMP-inducible protein. SOURCE T.cruzi (strain X10/4) trypomastigote, cDNA to mRNA, clone G26. ORGANISM Trypanosoma cruzi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 638) AUTHORS Heath,S., Hieny,S. and Sher,A. TITLE A cyclic AMP inducible gene expressed during the development of infective stages of Trypanosoma cruzi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Heath, 08-MAY-1990. FEATURES from to/span description pept < 1 > 638 AMP-inducible protein (TC26) (AA at 1) BASE COUNT 154 a 174 c 168 g 142 t ORIGIN 1 accccaccag accaactggt aatggtagcg accggcgctc agcttggaat tccgtgcgct 61 gcttctagcc ctagagaagc tgatgatccc ccgtatccga cacaggcgta caaccctgct 121 tgtggttacg gacagtcagt ctcttctagc ggctctaaac aagggcccgc tcaatcagac 181 agactggacg gaggatcaga tctggcggcg tctcttgaca ctgacgtgtg ctggctggtc 241 gtgcacctgc agttttgtta cggacattgt ggagtacatg ctaacgagct tgcagatcag 301 tatgcgagga actatggaaa gtggacaata cacggagcaa ggaatcgcac ctttatggca 361 tacggatctg ctgacgtgtt ttactaccca gctcaccaac aagtggcgta ctaccattcg 421 tcaagacact catcgctacc tgctttgcgg cacaaggcca tcagatctcg cggtaaggac 481 ctgatcactc aggaagttct acaccgtcag aactggttca cctcgcaagg gcaaggtgcg 541 gggaatctga gctctggggc cgactatctg ggccgtgaga gattgcacga accaatgccg 601 tttctgcaac atctcaccgg aacagtctgc atatatat // LOCUS DROANNIX 1104 bp ss-mRNA INV 17-JUL-1990 DEFINITION D.melanogaster annexin IX mRNA, 3' end. ACCESSION M34068 J05501 KEYWORDS annexin IX. SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-6. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1104) AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and Suedhof,T.C. TITLE Two novel annexins from Drosophila melanogaster: Cloning, characterization and differential expression in development JOURNAL J. Biol. Chem. 265, 11382-11388 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Suedhof, 04-MAY-1990. FEATURES from to/span description pept < 1 891 annexin IX (AA at 1) signal 1091 1096 poly-A signal BASE COUNT 281 a 300 c 306 g 217 t ORIGIN 1 attctgcgca aggcgatgaa gggcttcggc accgacgaga aggccatcat cgagatcctg 61 gccaggcgtg gcatcgtcca gcgtttggag atcgctgagg cgttcaagac ctcgtacggc 121 aaggatctga tctcggacct caagtccgag ctgggcggca agttcgagga tgttatcctg 181 gctctgatga cgccgctgcc ccagttctat gcccaggagc tgcacgacgc catctcggga 241 ctgggaaccg acgaggaggc catcatcgag atcctctgca cgctgtccaa ctacggcatt 301 aagaccattg cccagttcta cgagcagagc ttcggcaagt ccctagagtc cgacctaaag 361 ggcgacacca gtggccactt caagcggctg tgcgtctcgc tcgtccaggg caaccgggat 421 gagaaccagg gcgtggacga ggccgcggcc atcgccgatg cccaggctct gcacgacgcc 481 ggtgagggac agtggggcac agatgagtcc accttcaact cgatcctgat cacccgctcc 541 taccagcagc tgcgccagat cttcctcgaa tacgagaatc tgtcgggcaa cgacatcgag 601 aaggccatca agcgggagtt tagcggctcc gtggagaagg gtttcctggc catcgtcaag 661 tgctgcaagt ccaagatcga ctacttttcg gagcgcctgc acgactccat ggccggcttg 721 ggcaccaagg acaagacgct gatccgcatc atcgtcagcc ggtcggagat cgatctgggt 781 gacatcaagg aggcattcca gaacaagtac ggcaagagct tggagtcctg gatcaaggag 841 gatgccgaga ccgatattgg atacgtcctg gtcactctta cggcttggta gacggaagca 901 gccggaatat ccgaatatct atgagcaata ccccactgtt caagtagaaa atgccaaaaa 961 aaaaaacgtt gcatttcccc aaaaaaaagt ataacaaaag cgaagaacaa atggagttgg 1021 tctatataca gtagttgtga tgtgttctaa aaatccaatc tacaaaacgc ttagtatttt 1081 ccctctgtgc aataatcgga attc // LOCUS DROANNX 1192 bp ss-mRNA INV 17-JUL-1990 DEFINITION D.melanogaster annexin X mRNA, complete cds. ACCESSION M34069 J05501 KEYWORDS annexin X. SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-16. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1192) AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and Suedhof,T.C. TITLE Two novel annexins from Drosophila melanogaster: Cloning, characterization and differential expression in development JOURNAL J. Biol. Chem. 265, 11382-11388 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Suedhof , 04-MAY-1990. FEATURES from to/span description pept 91 1056 annexin X signal 1175 1184 poly-A signal BASE COUNT 271 a 348 c 363 g 210 t ORIGIN Chromosome 93B or 19A-4,7. 1 gaattccaaa agtcccagga gaaagactga ttcgtgtgaa gtcgtctact gaagagccac 61 aaggaaccca aggaatcttc cagctgcata atggaataca aacccgtgcc cacggttaag 121 gacgcagctc ccttcgacgc ctcccaggac gcccaggtgc tgcgggcggc gatgaaggga 181 ttcggcaccg acgagcagga aatcatcgac gtgctcgtcg gcaggagcaa ccagcagagg 241 cagacgatca aggcggttta cgaagcggag ttcgagcgcg acctggtgga cgatcttaag 301 gacgagctgg gaggcaagtt cgaggacgtg atcgtgggtc taatgatgcc accagtggag 361 tacctgtgca agcaactgca cgccgccatg gcgggcatcg gaaccgagga ggccacgctc 421 gtcgagatcc tgtgcaccaa gaccaacgag gagatggccc agatcgtggc cgtctacgag 481 gagcgctacc agcgcccgct ggccgagcag atgtgcagcg agacctccgg ctttttccgc 541 cgcctgctca cgctgatcgt gaccggagta cgtgacggac tggacacgcc cgtcgacgtc 601 ggtcaggcca aggagcaggc cgcccagctc tactcggccg gcgaggccaa gctgggaacg 661 gacgaggagg tcttcaaccg gatcatgtcg cacgccagct tcccgcagct gcgacttgtc 721 ttcgaggagt acaaggtgct ctccgggcag accatcgagc aggccatcaa gcacgagatg 781 tccgacgagc tgcacgaggc catgatggcc atagttgagt gcgtccagtc accggcggcc 841 ttcttcgcca accgcctcta caaggccatg aatggcgccg gcaccgatga cgccacgctc 901 atccgcatca tcgtcagccg ctcggagatc gacctggaga ccattaagca ggagttcgag 961 cggatctaca accgtacgct gcacagcgcc gtggtggacg cggagacctc tggtgactac 1021 aagcgggccc tgacagccct acttggatcc gcctaggccc gaggatgtgg cagctggtcc 1081 gcccaatatt ttattcgtgt taatagcttt gatcgtagtg tgccttttag gaaaatcgct 1141 tttaatgtcg tctgcgcatg cgcacactgt tggcaataaa taaacggaat tc // LOCUS ECODKSA 1273 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli dnaK suppressor (dksA) gene, complete cds. ACCESSION M34945 KEYWORDS dnaK suppressor. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1273) AUTHORS Kang,P.J. and Craig,E.A. TITLE Identification and characterization of a new Escherichia coli gene that is a dosage-dependent suppressor of a dnaK deletion mutation JOURNAL J. Bacteriol. 172, 2055-2064 (1990) STANDARD simple staff_review FEATURES from to/span description pept 229 441 ORF 1 pept 619 1074 dnaK suppressor (dksA) BASE COUNT 343 a 301 c 333 g 296 t ORIGIN 1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa 61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc 121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata 181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg 241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt 301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg 361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa 421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg 481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg 541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc 601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt 661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa 721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa 781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac 841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac 901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc 961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg 1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag 1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga 1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct 1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt 1261 acgcatagaa gat // LOCUS STFCYCLI 2180 bp ss-mRNA INV 17-JUL-1990 DEFINITION Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds. ACCESSION M33880 KEYWORDS cyclin B. SOURCE Starfish (A.pectinifera) egg, cDNA to mRNA, clone lambda-gt10-cyc10. ORGANISM Asterina pectinifera Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa; Stelleroidea; Asteroidea; Spinulosida; Asterinidae. REFERENCE 1 (bases 1 to 2180) AUTHORS Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T. TITLE The starfish egg mRNA responsible for meiosis reinitiation encodes cyclin JOURNAL Dev. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Kishimoto, 11-MAY-1990. FEATURES from to/span description pept 126 1313 cyclin B (CYC) mRNA < 126 2180 cyclin B mRNA BASE COUNT 650 a 440 c 482 g 608 t ORIGIN 1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg 61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga 121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt 181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc 241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag 301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca 361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa 421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct 481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac 541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt 601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc 661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc 721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc 781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg 841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt 901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca 961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca 1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag 1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc 1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat 1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca 1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg 1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt 1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt 1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg 1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa 1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt 1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga 1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc 1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca 1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac 1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat 1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag 1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat 2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg 2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt 2161 gtaataaagt taatggaaat // LOCUS MUSH2A1X 929 bp ds-DNA ROD 17-JUL-1990 DEFINITION Mouse histone H2A.1 gene, complete cds. ACCESSION M33988 KEYWORDS histone H2A.1. SOURCE Mouse (BALB/c) DNA, clone 53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 929) AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and Sch|mperli,D. TITLE Structure of a mouse histone gene cluster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Sch|mperli, 02-MAY-1990. Author address: Dr.Schmperli Zoologisches Institut, Baltzerstrasse 4 CH-3012 Bern SWITZERLAND FEATURES from to/span description pept 164 556 histone H2A.1 mRNA 119 624 histone H2A.1 mRNA BASE COUNT 238 a 247 c 222 g 222 t ORIGIN 1 gagctcaaat tctggctttc tattgggtac gatatattaa ccaatgggag aaacacaaac 61 agaatacctc cagttagtat aaatgcttgc tgttcagttg cagaatttac tatatattct 121 tttcctttct ctgctttgcc tttactgata cttaaacgca tacatgtctg gacgcggaaa 181 gcaagggtgc aaggcccgcg ctaaggccaa gacccgctcc tcccgggccg gcctgcagtt 241 ccccgtgggc cgcgtgcacc ggctgctccg caagggcaac tactcggagc gcgtgggcgc 301 cggcgccccg gtgtacctgg cggctgtgct ggagtacctg acggccgaga tcctggagct 361 ggcgggcaat gcggcccgcg acaacaagaa gacgcgcatc atcccgcgcc acctgcagct 421 ggccatccgc aacgacgagg agctcaacaa gctgctgggc cgcgtgacca tcgcgcaggg 481 cggcgtcctg cccaacatcc aggccgtgct gctgcccaag aagaccgaga gccaccacaa 541 ggccaagggg aagtaatctg gcgattgtct gtactgccca gttgaaagtt aaccaaaaca 601 aaggctcttt tcagagccac ccacatcttt ccataaaatg agctgccacc tcgtgaaacg 661 ttcttccact acagttttta tactacatat gaaaaagtta cgaagtagct ttcaatctta 721 gtaaattgat tttaatactg ttagtccctg cgataaatct tacgaccttc cttagtttga 781 gtcaaaagtg tgtaagagat gaaaccttta gaacatacta taaattttta gtagaaattt 841 ggcacccagg tttgtcattc acgtcacgat tgtctagagc ataatggtag taagggctaa 901 gggccattaa atcccacttc catagtttc // LOCUS MUSH2A1XX 693 bp ds-DNA ROD 17-JUL-1990 DEFINITION Mouse histone H3.2 gene, complete cds. ACCESSION M33989 KEYWORDS histone H3.2. SOURCE Mouse (strain BALB/c) DNA, clone 53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 693) AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and Sch|mperli,D. TITLE Structure of a mouse histone gene cluster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Sch|mperil, 02-MAY-1990. Author address: D.Schlmperil Zoologisches Institut, Baltzerstrasse 4 CH-3012 Bern SWITZERLAND FEATURES from to/span description pept 151 561 histone H3.2 mRNA 116 611 histone H3.2 mRNA BASE COUNT 159 a 206 c 184 g 144 t ORIGIN 1 caggatttag aagcagaggc tgaccaatcc caacaaagcg cgggcccttt gaatgttctt 61 cggtccaata gcggatagtc tgattgtata aaaggtggac agcgccttgc agctcactat 121 agtgtcagtc tattttccct tgttaaagtc atggctcgta ctaagcagac cgctcgtaag 181 tccaccggtg gcaaggcccc gcgtaagcag ctggccacca aggccgcccg caagagcgcc 241 ccggccaccg gcggcgtgaa gaagcctcac cgctaccgtc ccggcaccgt ggcgctgcgc 301 gagatccggc gctaccagaa gtcgaccgag ctgctgatcc gcaagctgcc gttccagcgc 361 ctggtgcgcg agatcgcgca ggacttcaag accgacctgc gcttccagag ctcggctgtc 421 atggctctgc aggaggcgag cgaggcctac cttgtgggtc tgtttgagga caccaacctg 481 tgcgccatcc acgccaagcg tgtcaccatc atgcccaagg acatccagct cgcccgccgc 541 attcgtgggg agagggctta aacgaaacaa agtgcaaact ggaacccaaa ggctcttttc 601 agagccaccc acagtttctg taaaagcagc tgttactttt ctactgtctt ggaaactttg 661 tttttcaaag atttatctag ttacaaagta tac // LOCUS RATGGSLEC 1358 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat Gal/GalNAc-specific lectin mRNA, complete cds. ACCESSION J05495 KEYWORDS Gal/GalNAc-specific lectin. SOURCE Rat (strain Wistar) adult peritoneal macrophage, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1358) AUTHORS Kurata,M.Ii., Itoh,H., Yamashina,N. and Kawasaki,I.T. TITLE Molecular cloning and sequence analysis of cDNA encoding the macrophage lectin specific for galactose and N-acetylgalactosamine JOURNAL J. Biol. Chem. 265, 11295-11298 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Kawasaki, 20-APR-1990, for release after publication. FEATURES from to/span description pept 172 1092 Gal/GalNAc-specific lectin precursor sigp 172 195 Gal/GalNAc-specific lectin signal peptide matp 196 1089 Gal/GalNAc-specific lectin signal 1330 1335 poly-A signal BASE COUNT 340 a 325 c 365 g 328 t ORIGIN 1 ccacctctgg ccactgctgc acagggaagc cacgatttca cacagttcca ggacttacct 61 cagcttcaca ctgcttctga cactctcctt ggtcccagct cagtatctgt ctggattctg 121 gtgtctcggt ttcagttcag atagccctag aggccttgtg tttgtgcaag catgacaatg 181 gcatatgaaa acttccagaa cttggggagc gaagagaaaa accaagaggc tggtaaagct 241 cctccccagt ccttcctgtg caatatcctc tcttggaccc acctcctcct gttctccctg 301 ggccttagcc tcctgctgct ggtggttatc tccgtgattg gatcccaaaa ttcccagtta 361 aggagggacc tagaaaccct gagaaccact ttagacaaca ccacctccaa caccaaggct 421 gaactacagg ccctggcctc caggggtgac agcttgcaaa caggaatcaa ttctctgaaa 481 gtggaggtgg atgatcatgg gcaggaactg caggcaggcc gaggcttgag ccagaaggtg 541 gcttctctgg agagcacagt ggagaagaag gagcagactc tcagaacaga tctatctgaa 601 ataaccgatc gtgtgcaaca gctggggaag gacttgaaga ccctgacatg tcagctggcc 661 agcctcaaga acaacggctc agcagtggcc tgctgccccc ttcactggat ggagcatgaa 721 ggcagctgct actggttctc tcagtctggg aagccgtggc ctgaagctga caagtactgc 781 cagctggaga attctaacct agtggtggtc aactccctgg cggagcagaa ttttctacag 841 actcacatgg gcagtgtggt cacttggatt ggcctaacgg accaaaatgg gccctggcga 901 tgggtggacg ggaccgacta tgagaaaggc tttacgcact gggccccaaa gcagccagat 961 aactggtatg gacacgggct gggaggaggc gaggactgtg cacacttcac ctcagatggt 1021 cgctggaatg atgacgtttg ccagaggccc taccgctggg tctgtgagat gaagctggcc 1081 aaggacagct aggagcactc tccaagtaat ttatttcttc agtggtttta gctggggaag 1141 cctggagttt ggactcctcc tatcccattt caatgttatt tgggactttt ctttttgtca 1201 aaaaatttaa gtcaaggaga aaggattgtg tttgagatgg tagtgtggtg tccggaagaa 1261 tggggttatt gaaatctgtg gtattttgta cagagtgccg cttattgtag gactttttaa 1321 tgtaacaaaa ataaaatact gttggtttta aaaaaaac // LOCUS MLUDNAA 4171 bp ds-DNA BCT 17-JUL-1990 DEFINITION M.luteus ribonuclease P (rnpA), 50S ribosomal subunit protein L34 (rpmH), DNA biosynthesis initiation protein (dnaA), and DNA polymerase III beta subunit (dnaN) genes, complete cds. ACCESSION M34006 KEYWORDS 50S ribosomal subunit protein L34; DNA biosynthesis initiation protein; DNA polymerase III beta subunit; ribonuclease P. SOURCE M.luteus (strain IF03333) DNA. ORGANISM Micrococcus luteus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 4171) AUTHORS Fujita,M.Q., Yoshikawa,H. and Ogasawara,N. TITLE Structure of the dnaA region of Micrococcus luteus: Conservation and variations among eubacteria JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Ogasawara, 05-MAY-1990. FEATURES from to/span description pept 435 37 (c) ribonuclease P (rnpA) pept 601 464 (c) 50S ribosomal subunit protein L34 (rpmH) pept 944 2491 DNA biosynthesis initiation protein (dnaA) pept 3242 > 4171 DNA polymerase III beta subunit (dnaN) signal 843 848 dnaA -35 signal signal 866 871 dnaA -10 signal BASE COUNT 672 a 1558 c 1361 g 580 t ORIGIN 1 ctcgagcacg acgaagggcg agggcgtgac ggtcatctac gcgtgctccg aggcggcggg 61 caggtgcggc ttcagggcct tgccgagcgc gccgacggtc tcgcggcgca gcagcgcgta 121 gtccgcctcg gcggcggcag gcagcgcgcg gacctgcacc agaaccggca ggtcgcgcag 181 cgggggcagg cgcatctgct ccgcgacgac ggcgcgcagc cgccgcttca cccggttgcg 241 ggtcaccgcg ttccccacgg ccttggacac gacgaacccg gcccgcggcc gaggggcgct 301 cggcgacgtg gaccgggtct gatcggggtc cgtggcgacg ctcaccacga cggtgcgccg 361 acccgcgcgg gtgccggtgc gaccgaggtg acggaactcc gcgggcgtgc ggacgcgacg 421 gtcacggggc agcacggggc gcgccgtccc tggaccgagg tcctcaggcg gacagctcgg 481 cgcggttctt gccgcggcga gcggacagga tggcgcggcc ggcgcgggtg cgcatgcggg 541 cacggaagcc gtgcttgcgg gcacgacggc ggttgttggg ctggaacgtg cgcttggtca 601 ctgtgatctc tccacgacgt gttgtgtgcc ttcgaccgtc gacctgagca ggtggaccag 661 ggggaaggac cggtgaggct ctgacatgaa atccccggcg ggcacgatca accgacaccc 721 gggggaccag acaactctag tggccggggt acgcaggggt caaacgggtg cgcgcggatc 781 ccggacgccc gtgggcgccc gggacggtcg tccacaggat gataggggtt gtccaccgtc 841 tgtgcacaac cctgtggaca accgttagac tccgagcacc gtcgtccgtc tcgcgcagcc 901 gcccccgtcc ccggggacgt ctgccccagg aaaggaagcc tccgtggtgg cagaccaggc 961 cgtgctcagc tcgtggcgct ccgtcgtggg ctccctcgag gacgacgcgc gggtcagcgc 1021 ccgcctcatg ggcttcgtct acctggccca gccgcagggc ctcatcggca atacgctcct 1081 gctggccgtg ccgaacgaga ccacccgcga gacgctccag ggcacccagg tggccgacgc 1141 cctcacggac gccctgaccc aggagttccg cgaggagatc ctgctggcca tctcgatcga 1201 cgcgaacctg cagccgccgc ggaccccctc ctccgaggcc cgccgctcct ccctcgccgg 1261 cgggccgtcc ggcgccgcgg cgccggacgt cgagctgccc ccggccgcga ccgccgccac 1321 ctcccgccgc gccgtcgccg aggagctgcc gggtttccgc atcgagccgc cggccgacgt 1381 cgtgcccgcc gcgaacgcgg ccccgaacgg gaacggcaag ccgacccccg cgccgccgtc 1441 gacgtccgcg gagaccagcc gcctcaacga ccgctaccac ttcgagacgt tcgtgatcgg 1501 ctcgtccaac cggttcgcgc atgcggccgc gaacgcggtg gccgaggcgc cggcgaaggc 1561 gtacaacccg ctgttcatct acggcgagtc cggcctgggc aagacgcacc tgctgcacgc 1621 gatcgggcac tacgcccgcc gcctctaccc gggcttgcgg gtgcggtacg tgaactcgga 1681 ggagttcacc aacgacttca tcaactccat ccgccacgac gagggcgcct cgttcaagca 1741 ggtctaccgc aacgtggaca tcctgctgat cgacgacatc cagttcctgg cggacaagga 1801 ggcgacggtc gaggagttct tccacacctt caacacgctc tacaacaaca acaagcaggt 1861 ggtcatcacc tcggacctgc cgcccaagca gctctccggg ttcgaggacc ggctgcgctc 1921 ccgcttcgag tgggggctga tcacggacat ccagccgccg gacctcgaga cgcgcatcgc 1981 gatcctccgg aagaaggcgg aggccgaggg gctcgtggcc ccgccggagg cgctggagta 2041 catcgcctcg cgcatctcca cgaacatccg cgagctcgag ggtgcgctga tccgcgtgac 2101 cgcgttcgcc tcgctcaacc ggcagaccgt ggacatcgag ctggccgagc atgtgctcaa 2161 ggacctgatc acggacgaga cggcgcacga gatcacgccg gagctgatcc tgcacgccac 2221 gggggagtac ttcaatctca ccctcgagga gctgaccagc aagtcccgca cccgcacgct 2281 ggtgacggcg cggcagatcg ccatgtacct gctgcgcgag ctgaccgaga tgtcgctgcc 2341 caagatcgga caggtcctgg gcggccgcga ccacaccacc gtcatccacg cggaccgcaa 2401 gatccgcgag ctgatggccg agcggcggac gatctacaac caggtcaccg agctcaccaa 2461 cgagatcaag cggaaacagc gcggcgcctg agccacccgc cgtcccaccc tcgagtcccg 2521 gttcgaacct tcgaccggga ctcgagcctt cgtccacatt gtccaccggt gtggacaact 2581 ctgtggagaa cccccgtcgc tgtggacaac ctccgcccct gtggatggcg gaatcaggcc 2641 gctcacgccg cgtccacacc cctgggggcg acgacggtgc acaggctggt gaatcacacg 2701 gatgccattc ggcggtccac acgtatccac agatctccac agccgaaaac cgcatgatct 2761 cgcgggttcc gacgggtggg gacgcaccct gtggacagga atgtgggcaa ctggggggtg 2821 gatccgggga cgagtctgga caactccggt cgaacctgtg cagagcaggg ggaggacctc 2881 cggggctcgt ccacaccgga tgcacacccc gaacgagcgg gcgtccacac ggttccccca 2941 accggatccg catgctgacg cggcagaacg gcgttctcca caggatccac acccgctacc 3001 actccttctg atcccccttc agagattccg ccaagcaaca tcatctccgc cgcctcgcac 3061 cgagctgagg gccccgagag cccgggctgc gcccgagcac tcagaatcct tggaatcaca 3121 cgcttgtcat tcgcttccga cctgacaccc gccgcgcgga ccgcctatgc tgaggcccgc 3181 gtgtcgccgt tcccccgggc ccgcaacccc cgtcatcacc cccagagagg cagtgcagct 3241 cgtgaagttc accgtcgaac gcgacatcct caccgacgcc gtctcctggg ccgcccgctc 3301 cctgtccccg cgcccgccgg tgccggtgct ctcgggtctg ctgatcacgg ccgaggccgg 3361 cgtcgtgtcc atcgcgagct tcgactacga gacctccgcg cgcctggaga tcgaggcgga 3421 cgtcgagacc gcgggccagg tgctcgtctc cggccgtctg ctgaacgaca tcgtccgctc 3481 cctgccgcag gcccaggtca cggtggagct cgacggcggc aaggtgctcg tcacctgccg 3541 atcctcccgc ttctccctgg ccacgatgcc cgtgggcgac tacccggccc tgcccgagct 3601 gccggccccc gcaggcaccg tggacggggc cgcgttcgcc cacgccgtcg cccaggtcac 3661 cgtcgccgcc tccaaggacg acaccctgcc gatcctcacc gccgtgaagg tggagatcga 3721 gggcgacacc ataaccttcc tggccacgga ccgctaccgg ctggccatga aggagatccg 3781 ctggaccccg gcggacccgt ccatctcgac gtcgctgctc atcaaggcgc gcacgctcac 3841 cgaggtcgcc aagtccctcg gctccggcgg cgacctggag atcctcctcg gtcagaccgc 3901 cgacctcgtg ggcttcgcct ccggtggccg ccgcaccacc tccgtgctcg tggacggcga 3961 gtaccccaag atccgctcgc tcttccccga gtccagcccc atccaggccg tggtggacac 4021 cgccgccctg gtcgaggcgt cccgccgcgt ggccctcgtg gccgagcgca acaccgccct 4081 gcgcatggtg ttcacggagg gccaggtcac cctggacgcc ggcaccggcg atgacgcgag 4141 cgccaacgag tccgtgccgt gcacgctcga g // LOCUS HUMLYAM1 1192 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 1. ACCESSION M32406 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 1 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1192) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.-J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept 1040 + 1042 leukocyte adhesion molecule-1 (LAM-1), exon 1 mRNA 931 > 1192 leukocyte adhesion molecule-1 mRNA IVS 1043 > 1192 leukocyte adhesion molecule-1 intron A BASE COUNT 410 a 185 c 326 g 271 t ORIGIN Chromosome 1q22-25. 1 agagagctgt tataaagatt aaacaatata ataaatatgg cgcgtgagct tcagagtttt 61 tgttgttgtt attattattt tcctaaaaat gcaaatctga tttgcatttg actcattgac 121 tcacatcagt gggtcttcct ttttattgtc cttcatcata tgggtcctaa tttcacatgc 181 agtcttataa aaccatctca ttttatagtc caagaatatt aaaggtactt gtaggctccc 241 aaacctacac ggtgaaaagc tagagagcat gggctctctt caggggttaa cttcaggaag 301 tgccactaac aaggacgtcc actaggtggt gagcaaggaa agacggaggt gaaggaaccg 361 aaacgagtca agtccactgc ttagctctac tgaagttttg caaacatcat aaatatgtct 421 gaaatgcagt tttgatttgt agtatttgca atttccaagg gccatttacc acaggtagcc 481 aagagttagt ttagcattta tgaaaaagat aggggagggt ggtggttaag aaggaggtgg 541 aggagagagt gaaggaggaa gaggagaaca agaaccaaac aaaaacaaga acaagaacaa 601 gtagaagaag aggagcaggg aggaaaaaga agaggaagaa gaacagcaac aacaatgagt 661 gaaggaggag gagggtaagg aaagatgcat aggagaatgg aaggaaggat agaaaggagg 721 gaaggaagag agaatctagt cacattactt tctgatcagc agttcatttt tgtctcagtg 781 ggaggcaata gaggccagtc taggaaaggg gtggggaaag aggaaagaga agtgcaggag 841 gaaggggagg cccaagggga ggaggaggag gatgtgagac tgggttagag aaatgaaaga 901 aagcaaggct ttctgttgac attcagtgca gtctacctgc agcacagcac actccctttg 961 ggcaaggacc tgagaccctt gtgctaagtc aagaggctca atgggctgca gaagaactag 1021 agaaggacca agcaaagcca tggtgagcct ttcagcctaa aagacgttta gatgctcaga 1081 tagaaactct tggggttgta gaggcaggtg gcaaggatag gaatcacccc atttcaattc 1141 tggttttaaa taatatagaa actaaacatt ttctcagacc ctcaaaaaaa gt // LOCUS HUMLYAM2 363 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 2. ACCESSION M32407 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 2 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 363) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 229 + 310 leukocyte adhesion molecule-1 (LAM-1), exon 2 mRNA < 1 > 363 leukocyte adhesion molecule-1 mRNA IVS < 1 228 leukocyte adhesion molecule-1 intron A IVS 311 > 363 leukocyte adhesion molecule-1 intron B BASE COUNT 115 a 51 c 69 g 128 t ORIGIN Chromosome 1q22-25. 1 cactgagact aagcgtaaaa taaatagaac aaacaaactg tgcatcagtt ctgatgtaaa 61 tttgaagtaa ttttcatcta tgtctgagaa acctgttacc tcagacaggg ttagtagaca 121 tatgtgtttt attctgatta ttaagaaagt tgtaagcacc acctcaaagg ctataaatgt 181 gtggtttaag ggtatacatc taaatataat tttgtatttc atttgcagat atttccatgg 241 aaatgtcaga gcacccagag ggacttatgg aacatcttca agttgtgggg gtggacaatg 301 ctctgttgtg gtatgttatg atatttatat atcactaagt ctattttact tatattcatt 361 ttt // LOCUS HUMLYAM3 531 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 3. ACCESSION M32408 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 3 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 531) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 73 + 459 leukocyte adhesion molecule-1 (LAM-1), exon 3 mRNA < 1 > 531 leukocyte adhesion molecule-1 mRNA IVS < 1 72 leukocyte adhesion molecule-1 intron B IVS 460 > 531 leukocyte adhesion molecule-1 intron C BASE COUNT 172 a 106 c 138 g 115 t ORIGIN Chromosome 1q22-25. 1 ctggagtagt gctaggttct ttttagctgt aacattatgt aagtctgcat aggtcacact 61 gatgtcttgc agatttcctg gcacatcatg gaaccgactg ctggacttac cattattctg 121 aaaaacccat gaactggcaa agggctagaa gattctgccg agacaattac acagatttag 181 ttgccataca aaacaaggcg gaaattgagt atctggagaa gactctgcct ttcagtcgtt 241 cttactactg gataggaatc cggaagatag gaggaatatg gacgtgggtg ggaaccaaca 301 aatctctcac tgaagaagca gagaactggg gagatggtga gcccaacaac aagaagaaca 361 aggaggactg cgtggagatc tatatcaaga gaaacaaaga tgcaggcaaa tggaacgatg 421 acgcctgcca caaactaaag gcagccctct gttacacagg tagggagtga caagacggct 481 atgctgcctc agactcagga agggccacgg ttaagagaat actcagattt a // LOCUS HUMLYAM4 832 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 4. ACCESSION M32409 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 4 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 832) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 161 + 268 leukocyte adhesion molecule-1 (LAM-1), exon 4 mRNA < 1 > 832 leukocyte adhesion molecule-1 mRNA IVS < 1 160 leukocyte adhesion molecule-1 intron C IVS 269 > 832 leukocyte adhesion molecule-1 intron D BASE COUNT 241 a 163 c 150 g 278 t ORIGIN Chromosome 1q22-25. 1 aaaattttag ccatatgatt tttatgctat gaatttacca aataaacctt tcctgattat 61 ttaaatcatc tcagacaaaa ggttatctat gtctaaagaa atgactttga gtactaaaat 121 gtaatcacat taaaatattt tttttctgac ctccttaaag cttcttgcca gccctggtca 181 tgcagtggcc atggagaatg tgtagaaatc atcaataatt acacctgcaa ctgtgatgtg 241 gggtactatg ggccccagtg tcagtttggt aagtctcttt cctttctttg cttcttctta 301 ggtaaagtca caggaatcat tatagcttat catgaagctg gttggaacaa aatgatacta 361 gccactctga gaaatgggaa gttttgatca gaaagctctg ctttcacaat attgttacct 421 ttccgtaaag atttcataag tcagcatgaa gtttcgattc acttctcaac aagtcttttt 481 gagtaccaca agaagcacag tgttgggata aagctgtcag ggttacaata aggaattagc 541 atggtagatt cccgctctca agaagctcac gatctaatga gcttgttaga ttaattagaa 601 ctctaaggtc tggaagaaac tatgccattt atcattagga ggctgagtta cccagaaagt 661 atcttgcttt ttccttctag tagttccttt ccttcttgca gttctccaca cttaacacat 721 gtgctctgta gcacactgac tttgctggtg gccttctctc tcattttgca catggccaaa 781 aaacatgtca tctttaagac attgttcaaa gacagtttct tctaggaagc tt // LOCUS HUMLYAM5 712 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 5. ACCESSION M32410 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 5 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 712) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 377 + 562 leukocyte adhesion molecule-1 (LAM-1), exon 5 mRNA < 1 > 712 leukocyte adhesion molecule-1 mRNA IVS < 1 376 leukocyte adhesion molecule-1 intron D IVS 563 > 712 leukocyte adhesion molecule-1 intron E BASE COUNT 200 a 145 c 139 g 228 t ORIGIN Chromosome 1q22-25. 1 ctctgatgtg atagttattt cccgactaag ctggtcattc ccagttacac ctatttggct 61 ttaaggattc tcactacaga taatactgaa gataataata tgaagactag ctaatgttta 121 cttagaattt ctgatgagtc aggctttgtt ctaacgtcct tgacttatgc taattgaatt 181 acatttagtt tccatatcaa tttgataaag ataacacaat ttcattattc ctcttatata 241 gatgaagaaa ctgaagttgg agggttcaag taaccttgtt taaaggcaca tggttatcaa 301 gtggcagggc taggattcaa atccaggcgt cagttcctct taactcttcc ccatactgtt 361 tctttcccta ttgaagtgat tcagtgtgag cctttggagg ccccagagct gggtaccatg 421 gactgtactc accctttggg aaacttcagc ttcagctcac agtgtgcctt cagctgctct 481 gaaggaacaa acttaactgg gattgaagaa accacctgtg gaccatttgg aaactggtca 541 tctccagaac caacctgtca aggtgagtaa cttcagacta gaggttttgt catgcaatcc 601 tgggcttaca gtcagaacat tcagtagaag tttgctgaga agtcaaactt aggatcctaa 661 tttaacctaa cttttgttta acctactgtg atgtttctca aaggacttat tc // LOCUS HUMLYAM6 451 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 6. ACCESSION M32411 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 6 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 451) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 240 + 425 leukocyte adhesion molecule-1 (LAM-1), exon 6 mRNA < 1 > 451 leukocyte adhesion molecule-1 mRNA IVS < 1 239 leukocyte adhesion molecule-1 intron E IVS 426 > 451 leukocyte adhesion molecule-1 intron F BASE COUNT 121 a 107 c 89 g 134 t ORIGIN Chromosome 1q22-25. 1 gagggtcacc ttagctaggg cagcagcctg gagtagctac tcctctcccc acagctttca 61 atgcttcctt gccttcatct ctcattcacc acccaccatc attctcaaga aaataaagcc 121 tggaagcaat atcacaagta atgtagtcag gcagctttgg ctaaaaatcc aaagctcaag 181 ggagggtctc tactcagaaa tactgttttg tctttttttt tttttctttt tcattgaagt 241 gattcagtgt gagcctctat cagcaccaga tttggggatc atgaactgta gccatcccct 301 ggccagcttc agctttacct ctgcatgtac cttcatctgc tcagaaggaa ctgagttaat 361 tgggaagaag aaaaccattt gtgaatcatc tggaatctgg tcaaatccta gtccaatatg 421 tcaaagtgag taagtttgtc ctggaactga a // LOCUS HUMLYAM7 543 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 7. ACCESSION M32412 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 7 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 543) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 300 + 428 leukocyte adhesion molecule-1 (LAM-1), exon 7 mRNA < 1 > 543 leukocyte adhesion molecule-1 mRNA IVS < 1 299 leukocyte adhesion molecule-1 intron F IVS 429 > 543 leukocyte adhesion molecule-1 intron G BASE COUNT 174 a 99 c 122 g 148 t ORIGIN Chromosome 1q22-25. 1 tatcagaact aagaaagctt gggctgcagg tcgactctag gtgcattttc aggaactcta 61 tgaaccacaa atctgggcat tgagattctg taggcattag actagcaagg ctggtcagtc 121 tttgcctatg ctgtagactc atcaggggcc ttcccatgcc agtttcctca tctgtcaaat 181 ggcatcattt gggctactac tgggagatgt aaggaggaaa aaagtcaaat atcatgagat 241 agactaagga aataatgctg gtggtctcat gctatgtgcc ttactgattt ctctttcaga 301 attggacaaa agtttctcaa tgattaagga gggtgattat aaccccctct tcattccagt 361 ggcagtcatg gttactgcat tctctgggtt ggcatttatc atttggctgg caaggagatt 421 aaaaaaaggt atgtgagttt aacttcacat gaaaagaaca caactttaaa gtgaaaaaga 481 aaaaaaaaag aaacccacag gaaattaaat gtgatagatc aacacaagca ggatgccaag 541 ctt // LOCUS HUMLYAM8 524 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 8. ACCESSION M32413 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 8 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 524) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 380 + 398 leukocyte adhesion molecule-1 (LAM-1), exon 8 mRNA < 1 > 524 leukocyte adhesion molecule-1 mRNA IVS < 1 379 leukocyte adhesion molecule-1 intron G IVS 399 > 524 leukocyte adhesion molecule-1 intron H BASE COUNT 146 a 104 c 103 g 171 t ORIGIN Chromosome 1q22-25. 1 tagtttacag tattagcagc tgtccctcaa ggaagaatct gcaggtagat gagatgcaga 61 ttgggtggga taaacacttg aatgacatat tgggtcttgc caccaggcaa tttagcaatt 121 ctgtcttctt gagtagcacg gagatggaat ggaacctcag gaggcatctg catcaacatg 181 tctgttctgt attagtgtct accactgttt attaagccag ttcctcaaat ctcctttgac 241 acagataggg tccacctaac aaatacctaa tatacttcaa aagacagttt tgagagtggg 301 agtcttcctt ctcccttact tgaaaaactt taaattgtct aatttttgct aatgcctttt 361 tctctatttt ctatttcagg caagaaatcc aagagaaggt aagttttatt agtggcgagg 421 agtttccaca tctgctgatt cattctctac ttcttaagtt acttctgctc tagctagaca 481 catacccata gtagttatta ctgggtctat caatgacaga tagg // LOCUS HUMLYAM9 1696 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 9. ACCESSION M32414 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 9 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1696) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 310 328 leukocyte adhesion molecule-1 (LAM-1), exon 9 mRNA < 1 1414 leukocyte adhesion molecule-1 mRNA IVS < 1 309 leukocyte adhesion molecule-1 intron H BASE COUNT 443 a 387 c 292 g 574 t ORIGIN Chromosome 1q22-25. 1 ataagcatca ctaaagagct tgttaggggt gcagaatctc aggctccact cagacctact 61 gaatcagagt ctgcatttta acaccatctc tgagtggtaa ggacatgaaa atctgagaag 121 tgctgctact agggtttgct tacatttgtt catcttcaga ggttcctaaa gcctggcctc 181 ttgtctgaga tttccagctg aaagcatttc cttgctcctc ttctcatctc taatgaatat 241 ttacctttac tactaacact ccaagttttg caatttttaa actcttatta tcttttgttt 301 ttctttcagt atgaatgacc catattaaat cgcccttggt gaaagaaaat tcttggaata 361 ctaaaaatca tgagatcctt taaatccttc catgaaacgt tttgtgtggt ggcacctcct 421 acgtcaaaca tgaagtgtgt ttccttcagt gcatctggga agatttctac ctgaccaaca 481 gttccttcag cttccatttc acccctcatt tatccctcaa cccccagccc acaggtgttt 541 atacagctca gctttttgtc ttttctgagg agaaacaaat aagaccataa agggaaagga 601 ttcatgtgga atataaagat ggctgacttt gctctttctt gactcttgtt ttcagtttca 661 attcagtgct gtacttgatg acagacactt ctaaatgaag tgcaaatttg atacatatgt 721 gaatatggac tcagttttct tgcagatcaa atttcgcgtc gtcttctgta tacgtggagg 781 tacactctat gaagtcaaaa gtctacgctc tcctttcttt ctaactccag tgaagtaatg 841 gggtcctgct caagttgaaa gagtcctatt tgcactgtag cctcgccgtc tgtgaattgg 901 accatcctat ttaactggct tcagcctccc caccttcttc agccacctct ctttttcagt 961 tggctgactt ccacacctag catctcatga gtgccaagca aaaggagaga agagagaaat 1021 agcctgcgct gttttttagt ttgggggttt tgctgtttcc ttttatgaga cccattccta 1081 tttcttatag tcaatgtttc ttttatcacg atattattag taagaaaaca tcactgaaat 1141 gctagctgca actgacatct ctttgatgtc atatggaaga gttaaaacag gtggagaaat 1201 tccttgattc acaatgaaat gctctccttt cccctgcccc cagacctttt atccacttac 1261 ctagattcta catattcttt aaatttcatc tcaggcctcc ctcaacccca ccacttcttt 1321 tataactagt cctttactaa tccaacccat gatgagctcc tcttcctggc ttcttactga 1381 aaggttaccc tgtaacatgc aattttgcat ttgaataaag cctgcttttt aagtgttaac 1441 tagtttgcct agtttgttat tttgaaaatt gatcatatgt tttgttttct ccccagtgag 1501 ttacatgctc cttcagggca gagtttgtgt cagatccctg gagtatctag tgcattactt 1561 gacactcaat aaatgaatgt tcaaataaat cagaaagagc atacagtgca ctgctgatat 1621 aagtttcagc atccctcttt ctctatggca tctgatgacc tgggtcagat atcacctaat 1681 gtcaacagct gaattc // LOCUS RATALADPS 1612 bp ds-DNA ROD 17-JUL-1990 DEFINITION Rat delta-aminolevulinate dehydratase processed pseudogene, complete cds. ACCESSION J04764 KEYWORDS delta-aminolevulinate dehydratase; pseudogene. SOURCE Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1612) AUTHORS Bishop,T.R., Frelin,L.P. and Boyer,S.H. TITLE Isolation and characterization of a rat delta-aminolevulinate dehydratase processed pseudogene JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.R.Bishop, 20-APR-1990. FEATURES from to/span description pept.ps 271 1264 pseudo-delta-aminolevulinate dehydratase rpt 101 108 8 bp flanking direct repeat rpt 1385 1392 8 bp flanking direct repeat BASE COUNT 404 a 411 c 418 g 379 t ORIGIN 1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt 61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt 121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca 181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta 241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc 301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa 361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct 421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga 481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca 541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg 601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat 661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg 721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat 781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac 841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat 901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga 961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca 1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct 1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa 1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc 1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga 1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa 1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga 1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta 1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt 1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg 1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at // LOCUS BOVPDEAP 585 bp ss-mRNA MAM 17-JUL-1990 DEFINITION Bovine cone photoreceptor cyclic nucleotide phosphodiesterase alpha'-subunit (PDE), partial cds. ACCESSION M33140 M29465 KEYWORDS cone photoreceptor cyclic nucleotide phosphodiesterase. SOURCE Bovine dark-adapted frozen retina, cDNA to mRNA, clone BC-alpha-1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 585) AUTHORS Charbonneau,H., Prusti,R.K., LeTrong,H., Sonnenburg,W.K., Mullaney,P.J., Walsh,K.A. and Beavo,J.A. TITLE Identification of a noncatalytic cGMP-binding domain conserved in both the cGMP-stimulated and photoreceptor cyclic nucleotide phosphodiesterases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 288-292 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 585 cone photoreceptor cyclic nucleotide phosphodiesterase alpha'-subunit (AA at 1) BASE COUNT 198 a 115 c 126 g 146 t ORIGIN 1 agagaagtca tcttttataa aatcatcgat tacattttac atggaaaaga agagatcaaa 61 gtcattccga cacctcccat ggaccactgg actctcatta gtgggttgcc aacatatgtt 121 gctgaaaatg gatttatctg caacatgctg aacgccccgg cggatgaata cttcacgttt 181 cagaaaggac ctgtagatga aactggctgg gtcattaaaa atgtcttgtc cctgcctatt 241 gtcaacaaaa aggaagacat cgtgggcgta gctacatttt acaacaggaa ggatggaaag 301 ccttttgatg aatatgatga gcacatcgct gagactctca cacagtttct tggatggtct 361 ctcttaaata ctgacaccta tgagaaaatg aataagctgg agaacagaaa ggacatagcc 421 caggaaatgc tcatgaacca caccaaggct acacctgatg agatcaagtc tattttgaaa 481 tttaaagaga agttaaatat agatgtaatt gaagactgtg aagaaaaaca gcttgtcaca 541 attttgaagg aggacctgcc agacccacgg actgcagacc tgtat // LOCUS CHKG1CLSE 240 bp ds-DNA VRT 17-JUL-1990 DEFINITION Chicken delta-1-crystallin gene, intron 3 lens-specific enhancer cor segments B3 and B4. ACCESSION M33954 KEYWORDS delta-1-crystallin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 240) AUTHORS Goto,K., Okada,T.S. and Kondoh,H. TITLE Functional cooperation of lens-specific and nonspecific elements in the delta-1-crystallin enhancer JOURNAL Mol. Cell. Biol. 10, 958-964 (1990) STANDARD simple staff_review FEATURES from to/span description site 17 116 core segment B3 site 112 235 core segment B4 BASE COUNT 61 a 52 c 53 g 74 t ORIGIN 1 gtcagtgagg tgtgctcagc atgacctgcc ctcccaccct cttcagactg aacattcctg 61 aggaattgtt tcagtatgaa ttaggaatat tctttttcca atggcacttg ggatcccttt 121 gtgtctggct gcctgagtta gtagaagaca atgcacaata ttgtataggg gtgaagaaga 181 gtcagccact aagcactttt tctgaaatat tcattgttgt tgctcaccta ccatggacaa // LOCUS CHKOVAL 9206 bp ds-DNA VRT 17-JUL-1990 DEFINITION Chicken ovalbumin gene, complete cds. ACCESSION J00895 KEYWORDS ovalbumin. SOURCE Chicken oviduct DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1343 to 8906) AUTHORS Woo,S.L.C., Beattie,W.G., Catterall,J.F., Dugaiczyk,A., Staden,R., Brownlee,G.G. and O'Malley,B.W. TITLE Complete nucleotide sequence of the chicken chromosomal ovalbumin gene and its biological significance JOURNAL Biochemistry 20, 6437-6446 (1981) STANDARD full staff_review REFERENCE 2 (bases 1043 to 1562; 2675 to 4732; 8885 to 9206) AUTHORS Benoist,C., O'Hare,K., Breathnach,R. and Chambon,P. TITLE The ovalbumin gene-sequence of putative control regions JOURNAL Nucleic Acids Res. 8, 127-142 (1980) STANDARD full staff_review REFERENCE 3 (bases 1357 to 1389; 2941 to 3052; and ivs junctions) AUTHORS Breathnach,R., Benoist,C., O'Hare,K., Gannon,F. and Chambon,P. TITLE Ovalbumin gene: evidence for a leader sequence in mRNA and DNA sequences at the exon-intron boundaries JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 4853-4857 (1978) STANDARD full staff_review REFERENCE 4 (bases 1282 to 1420; 2952 to 2999) AUTHORS Gannon,F., O'Hare,K., Perrin,F., LePennec,J.P., Benoist,C., Cochet,M., Breathnach,R., Royal,A., Garapin,A., Cami,B. and Chambon,P. TITLE Organisation and sequences at the 5' end of a cloned complete ovalbumin gene JOURNAL Nature 278, 428-434 (1979) STANDARD full staff_review REFERENCE 5 (bases 1343 to 8906; exons only) AUTHORS McReynolds,L., O'Malley,B.W., Nisbet,A.D., Fothergill,J.E., Givol,D., Fields,S., Robertson,M. and Brownlee,G.G. TITLE Sequence of chicken ovalbumin mRNA JOURNAL Nature 273, 723-728 (1978) STANDARD full staff_review REFERENCE 6 (bases 2675 to 5042) AUTHORS Robertson,M.a., Staden,R., Tanaka,Y., Catterall,J.F., O'Malley,B.W. and Brownlee,G.G. TITLE Sequence of three introns in the chick ovalbumin gene JOURNAL Nature 278, 370-372 (1979) STANDARD full staff_review REFERENCE 7 (bases 1 to 1042) AUTHORS Heilig,R., Muraskowsky,R. and Mandel,J.L. TITLE The ovalbumin gene family: The 5' end region of the X and Y genes JOURNAL J. Mol. Biol. 156, 1-19 (1982) STANDARD full staff_review REFERENCE 8 (bases 5576 to 5624) AUTHORS Lai,E.C., Woo,S.L.C., Dugaiczyk,A. and O'Malley,B.W. TITLE The ovalbumin gene: Alleles created by mutations in the intervening sequences of the natural gene JOURNAL Cell 16, 201-211 (1979) STANDARD simple staff_entry REFERENCE 9 (bases 443 to 611) AUTHORS Schweers,L.A., Frank,D.E., Weigel,N.L. and Sanders,M.M. TITLE The steroid-dependent regulatory element in the ovalbumin gene does not function as a typical steroid response element JOURNAL J. Biol. Chem. 265, 7590-7595 (1990) STANDARD simple staff_entry COMMENT Eight exons reported. Sequence homologies with adenovirus early 1a, adenovirus major late, chicken conalbumin, chkx and chky genes noted for 5' flanking sequence. OV1.3 is identical to OV1.8 except that intron E is shorter by 522 nucleotides [8]. FEATURES from to/span description pept 2996 3163 ovalbumin, exon 1 3415 3465 ovalbumin, exon 2 4047 4175 ovalbumin, exon 3 4576 4693 ovalbumin, exon 4 5652 5794 ovalbumin, exon 5 6126 6281 ovalbumin, exon 6 7864 8259 ovalbumin, exon 7 pre-msg 1343 8906 oval mRNA and introns IVS 1390 2978 oval intron A IVS 3164 3414 oval intron B IVS 3466 4046 oval intron C IVS 4176 4575 oval intron D IVS 4694 5651 oval intron E IVS 5795 6125 oval intron F IVS 6282 7863 oval intron G allele 1282 1282 g may be c [1] allele 1309 1309 g may be a [1] allele 1376 1376 g may be c [1],[7] allele 1385 1385 g may be a [1],[7] allele 1393 1393 a may be g [1] allele 3010 3010 t may be c [1] allele 3154 3154 a may be g [1] allele 5747 5747 g may be a [1] allele 8032 8032 can be g [5] conflict 1471 1471 c in [1]; t in [2],[7] conflict 1523 1523 a in [1]; t in [2],[7] conflict 1538 1539 at in [1]; a in [2] conflict 2681 2681 t in [1],[6]; g in [2] conflict 2696 2696 a in [1],[6]; g in [2] conflict 3552 3552 a in [1],[6]; g in [2] conflict 3683 3683 c in [1],[6]; t in [2] conflict 3934 3934 a in [1],[6]; c in [2] conflict 3960 3960 a in [1],[6]; g in [2] conflict 3994 3994 t in [1],[6]; c in [2] conflict 4207 4208 tt in [1],[6]; ttt in [2] conflict 4396 4396 t in [1],[6]; c in [2] conflict 5636 5636 c in [1]; g in [3] conflict 6100 6100 t in [1]; tc in [3] conflict 6119 6119 t in [1]; tg in [3] conflict 8492 8505 gactcacagtactg in [1]; g in [5] site 443 611 steroid-dependent regulatory element [9] BASE COUNT 2994 a 1749 c 1721 g 2742 t ORIGIN 5 bp upstream of PstI site. 1 ctgcagactg acatgcattt cataggtaga gataacattt actgggaagc acatctatca 61 tcataaaaag caggcaagat tttcagactt tcttagtggc tgaaatagaa gcaaaagacg 121 tgattaaaaa caaaatgaaa caaaaaaaat cagttgatac ctgtggtgta gacatccagc 181 aaaaaaatat tatttgcact accatcttgt cttaagtcct cagacttggc aaggagaatg 241 tagatttcta cagtatatat gttttcacaa aaggaaggag agaaacaaaa gaaaatggca 301 ctgactaaac ttcagctagt ggtataggaa agtaattctg cttaacagag attgcagtga 361 tctctatgta tgtcctgaag aattatgttg tacttttttc ccccattttt aaatcaaaca 421 gtgctttaca gaggtcagaa tggtttcttt actgtttgtc aattctatta tttcaataca 481 gaacaatagc ttctataact gaaatatatt tgctattgta tattatgatt gtccctcgaa 541 ccatgaacac tcctccagct gaatttcaca attcctctgt catctgccag gccattaagt 601 tattcatgga agatctttga ggaacactgc aagttcatat cataaacaca tttgaaattg 661 agtattgttt tgcattgtat ggagctatgt tttgctgtat cctcagaaaa aaagtttgtt 721 ataaagcatt cacacccata aaaagataga tttaaatatt ccagctatag gaaagaaagt 781 gcgtctgctc ttcactctag tctcagttgg ctccttcaca tgcatgcttc tttatttctc 841 ctattttgtc aagaaaataa taggtcacgt cttgttctca cttatgtcct gcctagcatg 901 gctcagatgc acgttgtaga tacaagaagg atcaaatgaa acagacttct ggtctgttac 961 tacaaccata gtaataagca cactaactaa taattgctaa ttatgttttc catctctaag 1021 gttcccacat ttttctgttt tcttaaagat cccattatct ggttgtaact gaagctcaat 1081 ggaacatgag caatatttcc cagtcttctc tcccatccaa cagtcctgat ggattagcag 1141 aacaggcaga aaacacattg ttacccagaa ttaaaaacta atatttgctc tccattcaat 1201 ccaaaatgga cctattgaaa ctaaaatcta acccaatccc attaaatgat ttctatggcg 1261 tcaaaggtca aacttctgaa gggaacctgt gggtgggtca caattcaggc tatatattcc 1321 ccagggctca gccagtgtct gtacatacag ctagaaagct gtattgcctt tagcagtcaa 1381 gctcgaaagg taagcaactc tctggaatta ccttctctct atattagctc ttacttgcac 1441 ctaaacttta aaaaattaac aattattgtg ctatgtgttg tatctttaag ggtgaagtac 1501 ctgcgtgata ccccctataa aaacttctca cctgtgtatg cattctgcac tattttatta 1561 tgtgtaaaag ctttgtgttt gttttcagga ggcttattct ttgtgcttaa aatatgtttt 1621 taatttcaga acatcttatc ctgtcgttca ctatctgata tgctttgcag tttgcttgat 1681 taacttctag ccctacagag tgcacagaga gcaaaatcat ggtgttcagt gaattctggg 1741 gagttatttt aatgtgaaaa ttctctagaa gtttaattcc tgcaaagtgc agctgctgat 1801 cactacacaa gataaaaatg tggggggtgc ataaacgtat attcttacaa taatagatac 1861 atgtgaactt atatacagaa aagaaaatga gaaaaatgtg tgtgtgtata ctcacacacg 1921 tggtcagtaa aaacttttga ggggtttaat acagaaaatc caatcctgag gccccagcac 1981 tcagtacgca tataaagggc tgggctctga aggacttctg actttcacag attatataaa 2041 tctcaggaaa gcaactagat tcatgctggc tccaaaagct gtgctttata taagcacact 2101 ggctatacaa tagttgtaca gttcagctct ttataataga aacagacaga acaagtataa 2161 atcttctatt ggtctatgtc atgaacaaga attcattcag tggctctgtt ttatagtaaa 2221 cattgctatt ttatcatgtc tgcatttctc ttctgtctga atgtcaccac taaaatttaa 2281 ctccacagaa agtttatact acagtacaca tgcatatctt tgagcaaagc aaaccatacc 2341 tgaaagtgca atagagcaga atatgaatta catgcgtgtc tttctcctag actacatgac 2401 cccatataaa ttacattact tatctattct gccatcacca aaacaaaggt aaaaatactt 2461 ttgaagatct actcatagca agtagtgtgc aacaaacaga tatttctcta catttatttt 2521 tagggaataa aaataagaaa taaaatagtc agcaagcctc tgctttctca tatatctgtc 2581 caaacctaaa gtttactgaa atttgctctt tgaatttcca gttttgcaag cctatcagat 2641 tgtgttttaa tcagaggtac tgaaaagtat caatgaattc tagctttcac tgaacaaaaa 2701 tatgtagagg caactggctt ctgggacagt ttgctaccca aaagacaact gaatgcaaat 2761 acataaatag atttatgaat atggttttga acatgcacat gagaggtgga tatagcaaca 2821 gacacattac cacagaatta ctttaaaact acttgttaac atttaattgc ctaaaaactg 2881 ctcgtaattt actgttgtag cctaccatag agtaccctgc atggtactat gtacagcatt 2941 ccatccttac attttcactg ttctgctgtt tgctctagac aactcagagt tcaccatggg 3001 ctccatcggt gcagcaagca tggaattttg ttttgatgta ttcaaggagc tcaaagtcca 3061 ccatgccaat gagaacatct tctactgccc cattgccatc atgtcagctc tagccatggt 3121 atacctgggt gcaaaagaca gcaccaggac acaaataaat aaggtgagcc tacagttaaa 3181 gattaaaacc tttgccctgc tcaatggagc cacagcactt aattgtatga taatgtccct 3241 tggaaactgc atagctcaga ggctgaaaat ctgaaaccag agttatctaa aagtgtggcc 3301 acctccaact cccagagtgt tacccaaatg cactagctag aaatcttgaa actggattgc 3361 ataacttctt tttgtcataa ccattatttc agctactatt attttcaatt acaggttgtt 3421 cgctttgata aacttccagg attcggagac agtattgaag ctcaggtaca gaaataattt 3481 cacctccttc tctatgtccc tttcctctgg aagcaaaata cagcagatga agcaatctct 3541 tagctgttcc aagccctctc tgatgagcag ctagtgctct gcatccagca gttgggagaa 3601 cactgttcat aagaacagag aaaaagaagg aagtaacagg ggattcagaa caaacagaag 3661 ataaaactca ggacaaaaat accgtgtgaa tgaggaaact tgtggatatt tgtacgctta 3721 agcaagacag ctagatgatt ctggataaat gggtctggtt ggaaaagaag gaaagcctgg 3781 ctgatctgct ggagctagat tattgcagca ggtaggcagg agttccctag agaaaagtat 3841 gagggaatta cagaagaaaa acagcacaaa attgtaaata ttggaaaagg accacatcag 3901 tgtagttact agcagtaaga cagacaggat gaaaaatagt tttgtaaaca gaagtatcta 3961 actactttac tctgttcata cactacgtaa aacttactaa gtaataaaac tagaataaca 4021 acatctttct ttctctttgt attcagtgtg gcacatctgt aaacgttcac tcttcactta 4081 gagacatcct caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta 4141 gactttatgc tgaagagaga tacccaatcc tgccagtaag ttgctctaaa atctgatctg 4201 agtgtattcc atgccaaagc tctaccattc tgtaatgcaa aaacagtcag agttccacat 4261 gtttcactaa gaaaatttct ttttctcttg tttttacaaa tgaaagagag gacaaataac 4321 atttctctat caccgacctg aaactctaca gtcttcagag aatgaatggc ttgctaaaag 4381 aatgtcaaat cttactatac agctatttca tattacacta ctaaatacac tataaggcat 4441 agcatgtagt aatacagtgt aaaatagctt tttacactac tatattatta atatctgtta 4501 attccagtct tgcatttcac atttgcaaaa cgttttgaaa ttcgtatctg aaagctgaat 4561 actcttgctt tacaggaata cttgcagtgt gtgaaggaac tgtatagagg aggcttggaa 4621 cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc ctgggtagaa 4681 agtcagacaa atggtaaggt agaacatgct ttgtacatag tgagagttgg ttcaccctaa 4741 tactgagaac ttggatatag ctcagccagc gtgctttgcg ttcaagctta ccagagctgt 4801 tgtatgcctg ttaagcaggg catacagtca tgaggctctt gaaaaatctt aacagacaaa 4861 gggcaatgga aaatcggagt taagggatgg tagggataaa atgcatagaa agaggtacca 4921 caattttgat ttttgcccta atgcctctct gcgtggttcc tcaatttttc tacttcattc 4981 ctcatctcct cagagcattc ctttccctca tgcttgaaac acagatgaaa gactgtgaat 5041 tctaactgag atgaaaacat ccacaaccac acaacctctg gtgtggagtc acattctgtg 5101 aaggcaaaaa ctaggccacg taatctatgc gtgcaagcta cgcgtaagct atgtgtgtga 5161 caggacaatg tgaggaacat actatgtgca caaggactgc agaataaaca ggagcaaagt 5221 ttttgaagaa aacagagtaa aatcctgttt tcctcttttg ttacattctt tacatatatc 5281 tcaaatttcc tctttggtta gaagcaagta atatttatgt ttcttggtac tgtttgggtt 5341 gaagaccatt ctgggataag agaaattcca gtggttcttc ccctaatcat aaaatgtcag 5401 gtttagtttt tttgtaacac agaaatctct tcatctttta tcttttgttg tgattcttga 5461 tagagagaga aacaagactt actgacaata gcagcaagaa aatcaatctt ggaagaacaa 5521 gattgcaatt gcaaaaacaa accaatgtcc ttgcccctac atcctcttcc ccataaattc 5581 tacattctct atctaccttg tgcttgccaa catgatatac gtaaactctc ttttcctatt 5641 cattcttaaa ggaattatca gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc 5701 aatggttctg gttaatgcca ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga 5761 agacacacaa gcaatgcctt tcagagtgac tgaggtatat gggcatacct tagagatgta 5821 atctagaatt tatgaagaga gtagacatgt tgttatatga acactgcatt agcgtatctg 5881 ctcatttgtc tgcatctctt tcagacactg tgttaaaagc agggaatttt ccttatgtct 5941 ctctcgtcac aatattcctg acattgcaaa gctcctgaga aataacttca gattccactt 6001 ttcctaggaa ggcttctgga tgagaactaa tcatcttaac tgtaactaga catttctgca 6061 tccaagaata atctttgtta aaactatatt ctctctctct tttttttttt tttttggttc 6121 tccagcaaga aagcaaacct gtgcagatga tgtaccagat tggtttattt agagtggcat 6181 caatggcttc tgagaaaatg aagatcctgg agcttccatt tgccagtggg acaatgagca 6241 tgttggtgct gttgcctgat gaagtctcag gccttgagca ggtatggccc tagaagttgg 6301 cttcagaata ttaaaaacac atggaaattt agctgttgta aagctctttt caacacagtt 6361 atcctaaaac atttaaccag cacaaatttc atcatgattc aatatgtgat tgttgcatag 6421 aagtgtagat ttgtcccact gggtcctgca atagcccatg ctgagcatgg cttgctgaaa 6481 gaactgcttt agagggtgaa aagtttgaca cagcagacaa gatgattctc acctaagcag 6541 ctgttactgt agtggcttga actctaaagg tcttgtatct ccattcctgt gcactgagga 6601 gcttcttgga aagttcatat aaggtttact agttctaact attatctcat ttggtggcac 6661 tcaatgtgct ttgttcacgt cttcataaat taatctatct aaaaattgga tgtggttaaa 6721 gcaatttcag aaataacatg tacataatgt acaattattg atatgaacag aacacaggca 6781 tagcatattg taattaggag gactgtagtt attttgaata ggaaacacaa tgtaataaat 6841 gagaattcat tgaaatgtta gtatgctaac tcaatctaaa ttataaagat aaagaggcat 6901 ttaatcacag ctagatttcc atcacttgtg acagacaggc atatgaatga ttatgtacag 6961 ctctaggaaa aaaagtatgt aggaaaacta gtacattttg attagaaagt ctgaaaatga 7021 ggtgccttga tcaaagagaa tacgtgtgtt tgagaaaaaa aaagtttgga tagaggtggt 7081 aagagagaat atattgaaat ggtgtttcta caaactgcca tggccagatt tgtgtaagag 7141 acattcagta agtaggcaag gaaagaaata ttactaggta caaagcaaca tcagtaatac 7201 caaaagaaac caattattcc agatgccaat ctcgtaatag ggttaagaga tttccacccc 7261 tctagtggtc accagtgcaa ccagtaactt tgctaattta cattttcttt ttttaaatgg 7321 cagatatagc tttgaactga gtgatcatga actggtactg tgtaatagat gaagacatac 7381 ttgacgacta aacttctgat ttttaaaaac tcaaattctc ttgaaagatc agttcccagt 7441 ctagtaacag ctgatagttt aagtatcagt aattggctac cattaacaac tggctcctga 7501 gaggtcttaa atgtagagac agctttaaac tcaaaagcac agagtgattt ttagaataga 7561 tttcccaagc aaagaaaata aacagggagg agctttaagg gagtagccat ctcattatta 7621 ttattattta aagaaatggc agcaagccta caaaagaaaa ataagacaga gcagagaaga 7681 aagagtcatg gtatgctttt ctatcttagc aaaattaatc tctacatgcc taggaaaaag 7741 ccatgacaag agcaatcagt tcaaaaggtg tatgcaaaaa accacataat agtaactagt 7801 actgcattgc caggaaggaa gttatgtcgc cattccatgg atctcattct catttccttg 7861 cagcttgaga gtataatcaa ctttgaaaaa ctgactgaat ggaccagttc taatgttatg 7921 gaagagagga agatcaaagt gtacttacct cgcatgaaga tggaggaaaa atacaacctc 7981 acatctgtct taatggctat gggcattact gacgtgttta gctcttcagc caatctgtct 8041 ggcatctcct cagcagagag cctgaagata tctcaagctg tccatgcagc acatgcagaa 8101 atcaatgaag caggcagaga ggtggtaggg tcagcagagg ctggagtgga tgctgcaagc 8161 gtctctgaag aatttagggc tgaccatcca ttcctcttct gtatcaagca catcgcaacc 8221 aacgccgttc tcttctttgg cagatgtgtt tccccttaaa aagaagaaag ctgaaaaact 8281 ctgtcccttc caacaagacc cagagcactg tagtatcagg ggtaaaatga aaagtatgtt 8341 atctgctgca tccagacttc ataaaagctg gagcttaatc tagaaaaaaa atcagaaaga 8401 aattacactg tgagaacagg tgcaattcac ttttccttta cacagagtaa tactggtaac 8461 tcatggatga aggcttaagg gaatgaaatt ggactcacag tactgagtca tcacactgaa 8521 aaatgcaacc tgatacatca gcagaaggtt tatgggggaa aaatgcagcc ttccaattaa 8581 gccagatatc tgtatgacca agctgctcca gaattagtca ctcaaaatct ctcagattaa 8641 attatcaact gtcaccaacc attcctatgc tgacaaggca attgcttgtt ctctgtgttc 8701 ctgatactac aaggctcttc ctgacttcct aaagatgcat tataaaaatc ttataattca 8761 catttctccc taaactttga ctcaatcatg gtatgttggc aaatatggta tattactatt 8821 caaattgttt tccttgtacc catatgtaat gggtcttgtg aatgtgctct tttgttcctt 8881 taatcataat aaaaacatgt ttaagcaaac acttttcact tgtagtattt gaagtacagc 8941 aaggttgtgt agcagggaaa gaatgacatg cagaggaata agtatggaca cacaggctag 9001 cagcgactgt agaacaagta ctagtgggtg agaagttgaa caagagtccc ctacaagcaa 9061 cttaatctaa taagctagtg gtctacatca gctaaaagag catagtgagg gatgaaattg 9121 gttctccttt ctaagcatca cctgggacaa ctcatctgga gcagtgtgtc caatctgccg 9181 ctgccctgat ctcggctggg gtgatg // LOCUS PMUCEN 150 bp ds-DNA PHG 17-JUL-1990 DEFINITION Bacteriophage Mu wild type DNA fragment with a gyrase cleavage site. ACCESSION M32302 KEYWORDS . SOURCE Bacteriophage Mu (wild type) DNA. ORGANISM Bacteriophage mu Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Pato,M., Howe,M. and Higgins,P. TITLE DNA gyrase binds to a centrally located replication enhancer (CEN) in the bacteriophage Mu genome JOURNAL Unpublished (1990) In Press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.L.Pato, 23-FEB-1990. FEATURES from to/span description site 49 50 DNA gyrase cleavage site in complimentary strand mut 44 44 a in wt; g in Mu nuB103 mut 50 50 g in wt; c in Mu nuB1 BASE COUNT 40 a 38 c 27 g 45 t ORIGIN Map position at 18.0 kb. 1 acgcgtcagc gccgctctga ggcaataaac agaatcaggc ataaaatcag ccgcacagat 61 tttttaaaac gcgccacggg atttttaaac cggtatttaa cggtgtatga atcccgtttt 121 atcttccttt cactttcttt ctccagtact // LOCUS RATRNRTR 2577 bp ds-DNA ROD 17-JUL-1990 DEFINITION Rat snRNP-associated polypeptide N, complete cds. ACCESSION J05497 KEYWORDS snRNP-associated polypeptide N. SOURCE Rat male adult (Fisher) DNA, clones rgV and rgIII2. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2577) AUTHORS Schmauss,C. and Lerner,M.R. TITLE The closely related small nuclear ribonucleoprotein polypeptides N and B/B' are distinguishable by antibodies as well as by differences in their mRNAs and gene structures JOURNAL J. Biol. Chem. 265, 10733-10739 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Schmauss, 13-APR-1990. FEATURES from to/span description pept 1314 2036 snRNP-associated polypeptide N signal 774 779 TATA box signal 698 702 CAAT box BASE COUNT 822 a 537 c 581 g 637 t ORIGIN 1 taactagaga actgagaaca gaatccctgt tagaggaatt agagaccaaa ttaaaagagg 61 tgaaggggct tgcaacccca ttagaacaac aatgccaacc aaccagagct cccagggact 121 aaaccactac ccaaagacta cacatggctc caactgcata tgtagcagag gatggcctag 181 ttgggcacaa tggaaagaga agcccttgga cctgcccagg ctggacccac cagtgtagag 241 taatgtctga ggggtagaag caggtggttg ggaatgggaa tacccttata tgtgaaggga 301 agcaggatga ggtagggaac ttatgttcgt aaaccaggaa agggaataac gtttgaaatg 361 taaataaaaa tatatccagt gaaaaaaaaa ctgaagtcta taataaaagc ttttaatcct 421 ctcagccctt aataaaagtt aattatatta cttatgttaa aaaaaacata aaacagcatg 481 gtattgtact tttttttttc agacaaaagg tctatggcac acagccaaat cagtgacctc 541 tggggcacaa tttccagaaa tcaacatcct agagttgacc tctggcttcc atgtatacgt 601 gcatgcacac acacatatgc atatacatac aaaattacat atatgcagtt gtctaaatca 661 tatgaagaat ttcaagttgt tttaagttta atatcagcaa atacatgcaa atgtgattat 721 aagaagctgg atggaatcct gagttgttga ctaaagagct aagaaggggc aattataaaa 781 caaaaatgac acatgaaatt ccacccgagg ttagaaataa ttaaagaagg ccattgcggc 841 aagtctagca cagagagtag agggtgctgg aggatgacag acggttggtt ctgaggaggg 901 attttgcaac gaatggagcg aggaagggat cgtttacact tgagaagaac tactgaacag 961 cacgtcccag agattgaggt ccaggtcaaa cgtagaagga cagcctcact gagcaaccaa 1021 gagtgtcact tgtacccacg gcattctcag caacagcaaa ttcctgtggt ggatttccag 1081 gcagaactga gacaggcgtt cttagctgag acaccaagag gtggttaaag cagtattgga 1141 acttcaaggt ggtggaagtc aacaaacaca ggacctatcc actgattgtg aaactttggt 1201 caagcttaca ctgtgttaat aaccctgcat caaaccttta tttattgccc ttccccaagt 1261 tttaaggatc ttgtaatttt agtgttgaca actgctattg tggaacagca atcatgactg 1321 tgggtaagag tagcaagatg ctgcagcata ttgactatag aatgagatgt atcctgcaag 1381 atggaagatt cttcattggc acctttaagg cttttgacaa gcatatgaat ttgatcctct 1441 gtgattgtga tgagttcagg aagatcaagc caaagaatgc aaaacagcca gaacgtgaag 1501 aaaaacgggt tttgggtctg gtcttgctac gtggagagaa cttggtttcc atgacagtgg 1561 agggtccacc tcctaaagat actggcattg ctcgtgtgcc acttgctagt gctgcaggtg 1621 gccctggtgt tggaagagca gctggcagag gagtaccagc aggtgtacct attccccaag 1681 ctcctgctgg attagcaggc cctgtccgag gagtgggagg cccatcccag caggtcatga 1741 ccccacaggg aagaggcact gttgcagctg ctgctgttgc tgctactgct agcattgcag 1801 gagccccaac ccagtacccg ccaggacggg gaactccacc tccacctgta ggcagagcaa 1861 ccccacctcc aggcattatg gctcctccac ctggaatgag accacccatg ggcccaccaa 1921 ttggacttcc ccctgctcaa gggagaccta taggcatgcc ccctccagga atgagactcc 1981 ctcctccagg aattagaggc ccacctcccc caggaatgcg tccaccaaga ccctaagata 2041 cagttgataa atctcagccc ttctctttcc ctacaatgct tcttgtgaaa ttgtgtcgcc 2101 tgcaagcttt tgacccctct tactgcatta actatagata ataaatacat agcgcaattg 2161 aattgaaaaa aaaagaaata attaaagaaa gtaagtcaca atgactattt gctattgaca 2221 ttttttttaa atgcccgaat gagagccagt ggagacgata gaaagtccag aagaagctaa 2281 gataatttca aaacacataa tgtcagtaga acgagggaag gtaagaaccc acagaacaca 2341 agaaaccact catgaaactc ctcacacaca ggaagaaaag gaagaatgta atttttaaaa 2401 aaaaagttat agtcaagtta aactatattt tctcattggt ttttttttgt gactttgtat 2461 ttatttttat gtttctttgt gtatattgta catgtctcag tcaaaggcca acggtgagtg 2521 ttttcctcta aaaaacctta ttgtttaaga cagggtctct tcctgagctc agaattc // LOCUS WUCSSP 1323 bp ds-DNA INV 17-JUL-1990 DEFINITION W.bancrofti species specific DNA fragment. ACCESSION M27140 KEYWORDS . SOURCE W.bancrofti DNA, clone IWb35. ORGANISM Wuchereria bancrofti Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 1323) AUTHORS Dissanayake,S. and Piessens,W.F. TITLE Cloning and characterization of a Wuchereria bancrofti-specific DNA sequence JOURNAL Mol. Biochem. Parasitol. 39, 147-150 (1990) STANDARD simple staff_entry BASE COUNT 399 a 204 c 251 g 469 t ORIGIN 1288 bp upstream of SacI site. 1 gatctctgtt tcattatacc gagtaaatat tggagaaaag aaaaatttgt tcaacgtgtt 61 aaagattaac ttgctttcta tataatggaa acattttgca tattggatta gtcagtaaat 121 taataatgga caattgtgat aagtaaaact aaaaagacat cgtcactctc ttccttatta 181 tagcatttcc ttgcttaaaa ccacttgcga cgtcactttt tgttataaat catatggtga 241 atacttttcc tcatttaaga tcgtttatta gcttttgcat tacaaattgt tcattttagt 301 tgtgaacgca ttttgtacat ttaaatgctt gctttagaat tttaggtttc aactggtacg 361 tttatgccgt ttatatgaaa ttatgggata acaaagaaaa ataaagataa agaagtaaaa 421 attcgaatga ttaaatgaat tattagtacc ctgattgcta tagccctttt ctacgttttg 481 gcaagaagtc ccaaattggt tctcactttt cagaatgaaa atttttagtt gtttatagcg 541 ccaaaagaaa tgattaacag cagtttggct ttgtggacgg aatgatatgc ttttctgcat 601 acctttcata aattggaaaa aacaaaataa tttggctaag agtgaatgga gtattcgttc 661 gtttgtgata ttttcaatgt ttgttgatgt atattcgaag cgtctctgct cactactgtc 721 aaaccctttt taagaacgtt gcttctacgg tcactgggca gctactacgt attgagtgag 781 cgatatgaaa agaatataca gtatctaatg actgccaatg tcaaataaat ttttgtatcg 841 tcactcagcg gtcacaaatg tttcataaat atttcacatg cattctattt taggttcaaa 901 tatgctttta aaattctgct aaatttgcaa actaacgaga ttttgtttgg cagctcttct 961 tatgataacg cagttcaatc ctggtggtga agaatttgcc acagtcttcg cattttggat 1021 gaggttcatg cgtttgtttg tgtttgtgaa atgttgattt atggtcgaat gtccgcccac 1081 aaccgggtac tttgcattca tagatgaaag gctgaccgtg tgtttcctat gtgttatata 1141 ttcgttgtaa ttgttcgtga tcaataggaa acaactggca ggatggcaga ttttaataca 1201 accatatcaa taattatatt aaatgtaaat gttctagctg ggtagagtgg cgtgcatctg 1261 tagtctcggc cacttggaag actgagctca ggaagattac ttgcacccag gagcttgagg 1321 agc // LOCUS YSCHXT2 2890 bp ds-DNA PLN 17-JUL-1990 DEFINITION S.cerevisiae high affinity hexose transporter-2 (HXT2) gene, complete cds. ACCESSION M33270 KEYWORDS high affinity hexose transporter-2. SOURCE S.cerevisiae (isogenic strain to S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2890) AUTHORS Kruckeberg,A.L. and Bisson,L.F. TITLE The HXT2 gene of Saccharomyces cerevisiae is required for high affinity glucose transport JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.L.Kruckeberg, 26-MAR-1990. FEATURES from to/span description pept 818 2443 high affinity hexose transporter-2 BASE COUNT 808 a 571 c 535 g 976 t ORIGIN 1 aaaaagaaat attattcatt actatcaaga taccgtagaa aagaaaaaga accggggatg 61 aataataaca aaacgggctg ctttttcttt ttctctttct ttttcatttg gtccctctcc 121 actctttctc cacgtggctt tgcttcccgt atttttcttc gtcagagaga ctacatgata 181 gtccaaagaa aagaaacagg ggggacgaag aagaggagag gaaaaaccaa aatataattt 241 tccgtgaaat agattctttt tctccactgc acgacttctt ctcctcccac aaaaaatgac 301 gcctcataga cagccccgca gcttcacttt taagtttctt tttctcctca cggcgcaacc 361 gctaacttaa gctaatcctt atgaatccgg agaaaagcgg ggtcttttaa ctcaataaaa 421 ttttccgaaa tcctttttcc tacgcgtttt cttcgggaac tagataggtg gctcttccac 481 ctgtttttcc atcattttag tttttcgcaa gccatgcgtg ccttttcgtt tttgcgatgg 541 cgaacgaggg ctggaaaaat taacggtacg ccgcctaacg atagtaatag gccacgcaac 601 tggcgtggac gacaacaata agtcgcccat tttttatgtt ttcaaaacct agcaaccccc 661 accaaacttg tcatcgttcc cggattcaca aatgatataa aaagcgatta caattctaca 721 ttctaaccag atttgagatt tcctctttct caattcctct tatattagat tataagaaca 781 acaaattaaa ttacaaaaag acttataaag caacataatg tctgaattcg ctactagccg 841 cgttgaaagt ggctctcaac aaacttctat ccactctact ccgatagtgc agaaattaga 901 gacggatgaa tctcctattc aaaccaaatc tgaatacact aacgctgaac tcccagcaaa 961 gccaatcgcc gcatattgga ctgttatctg tttatgtcta atgattgcat ttggtgggtt 1021 tgtctttggt tgggatactg gtaccatctc tggttttgtt aatcaaaccg atttcaaaag 1081 aagatttggt caaatgaaat ctgatggtac ctattatctt tcggacgtcc ggactggttt 1141 gatcgttggt atcttcaata ttggttgtgc ctttggtggg ttaaccttag gacgtctggg 1201 tgatatgtat ggacgtagaa ttggtttgat gtgcgtcgtt ctggtataca tcgttggtat 1261 tgtgattcaa attgcttcta gtgacaaatg gtaccaatat ttcattggta gaattatctc 1321 tggtatgggt gtcggtggta ttgctgtcct atctccaact ttgatttccg aaacagcacc 1381 aaaacacatt agaggtacct gtgtttcttt ctatcagtta atgatcactc taggtatttt 1441 cttaggttac tgtaccaact atggtactaa agactactcc aattcagttc aatggagagt 1501 gcctttgggt ttgaactttg ccttcgctat tttcatgatc gctggtatgc taatggttcc 1561 agaatctcca agattcttag tcgaaaaagg cagatacgaa gacgctaaac gttctttggc 1621 aaaatctaac aaagtcacca ttgaagatcc aagtattgtt gctgaaatgg atacaattat 1681 ggccaacgtt gaaactgaaa gattagccgg taacgcttct tggggtgagt tattctccaa 1741 caaaggtgct attttacctc gtgtgattat gggtattatg attcaatcct tacaacaatt 1801 aactggtaac aattacttct tctattatgg tactactatt ttcaacgccg tcggtatgaa 1861 agattctttc caaacttcca tcgttttagg tatagtcaac ttcgcatcca ctttcgtggc 1921 cttatacact gttgataaat ttggtcgtcg taagtgtcta ttgggtggtt ctgcttccat 1981 ggccatttgt tttgttatct tctctactgt cggtgtcaca agcttatatc caaatggtaa 2041 agatcaacca tcttccaagg ctgccggtaa cgtcatgatt gtctttacct gtttattcat 2101 tttcttcttc gctattagtt gggccccaat tgcctacgtt attgttgccg aatcctatcc 2161 tttgcgtgtc aaaaatcgtg ctatggctat tgctgttggt gccaactgga tttggggttt 2221 cttgattggt ttcttcactc ccttcattac aagtgcaatt ggattttcat acgggtatgt 2281 cttcatgggc tgtttggtat tttcattctt ctacgtgttt ttctttgtct gtgaaaccaa 2341 gggcttaaca ttagaggaag ttaatgaaat gtatgttgaa ggtgtcaaac catggaaatc 2401 tggtagctgg atctcaaaag aaaaaagagt ttccgaggaa taagagatta tacttaaact 2461 agcactgatt tttttaaggc taatggctac taatacttta atagatgatc ttcatacttt 2521 tttatttaac gatttttaat gatgttttta tttgtaccac tcatttatct agattttttt 2581 aatactgatc aaatcttacg gactcgacgt taaaaagttc ctacatacgt ctggtacttg 2641 aaacgctgct tcgaggtatt gacactataa gaatacgatc caaatactta caccgcatgt 2701 aaaaatatgc cgacaatatg aatacttgtt gatgaatgat atttgatttt aatccggcaa 2761 tttacctcct ttatataatc caataattgt tgataattag tggttaggtt gcagtactaa 2821 taagaattaa gacaaatatt cttctactat ataaaaggtg caaacaaaac acacgccgat 2881 cggccatact // LOCUS CHKARCASR 2392 bp ss-mRNA VRT 17-JUL-1990 DEFINITION Chicken 5-aminoimidazole ribonucleotide (AIR) carboxylase-5- aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR) synthetase, complete cds. ACCESSION M31764 KEYWORDS 5-aminoimidazole ribonucleotide carboxylase; 5-aminoimidazole-4-N-succinocarboxamide ribonucleotide synthetase. SOURCE Chicken liver, cDNA to mRNA, clone pZD1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2392) AUTHORS Chen,Z., Dixon,J.E. and Zalkin,H. TITLE Cloning of a chicken liver cDNA encoding 5-aminoimidazole ribonucleotide carboxylase and 5-aminoimidazole-4-n-succinocarboxamide ribonucleotide synthetase by functional complementation of Escherichia coli pur mutants JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3097-3101 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Zalkin, 01-FEB-1990. FEATURES from to/span description pept 59 1339 5-aminoimidazole ribonucleotide (AIR) carboxylase-5- aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR) synthetase matp 59 835 SAICAR synthetase domain matp 836 1336 AIR carboxylase domain site 2341 2347 poly-A signal BASE COUNT 696 a 466 c 527 g 702 t 1 others ORIGIN 1 ggcgctcagt ttctttcctg tgccttaggt gcctgtcgcc gccgccgctg tcgccgccat 61 ggcccccgcc gcctcagaac tgaaacttgg taaaaaagtt aatgagggta aaacgaaaga 121 agtgtatgag ctgccggata tcccgggatg cgttctgatg cagtccaagg accaaataac 181 agcggggaat gccgctagga aggaccggat ggaggggaag gctgccatct ccaacaccac 241 caccagctgc gtgttccagc tgctgcagga agcaggaatc aaaacggctt ttgtcaggaa 301 acagagtgac acagctttca tagcagctca ctgtgaaatg atcccaattg aatgggtctg 361 cagaagaatt gctactggct ctttcctcaa aagaaaccct ggtgtcaaag aaggctataa 421 gttttaccca cctaaaattg agatgtttta caaggatgat gccaataatg atccacagtg 481 gtctgaggag cagctaattg aagcaaaatt ctcttttgct ggacttacta ttggcaagac 541 tgaagtggat attatggctc gttctactca agctattttt gagatcctgg aaaaatcatg 601 gcagccccaa aactgcactc tggtggacct gaagattgaa tttggtgtta atattttgac 661 caaagaaatt gttcttgctg atgttattga taatgattca tggagactgt ggccatcggg 721 agacagaagc cagcagaagg acaaacagtc ctatcgagat ctgaaggaag tgactcctga 781 agcattgcag atggttaaga gaaactttga atgggttgca gaaagagtag agttgcttct 841 gaaaacaaag agccaaggta gagttgtggt attgatggga tctacttctg accttggcca 901 ctgtgagaaa ataaagaagg catgtgcaac ctttggaatt ccttgtgagt taagagtaac 961 ctccgctcac aaagggccag atgaaactct gaggatcaaa gcagaatatg aaggagatgg 1021 aatcccgact gtgtttgttg cagtagctgg cagaagcaat ggtttagggc cagtaatgtc 1081 tggtaacact gcttaccctg ttgtcaactg tcctcccctc tcatctgact ggggtgctca 1141 ggatgtgtgg tcctctctca gactgcccag tggtcttggc tgtcctacta ctctgtcacc 1201 tgaaggagct gctcagtttg ctgcccagat ttttggttta aacaaccact tggtatgggc 1261 caaactgcga tcaaacatgt taaatacatg gatctctttg aagcaggctg acaaaaaatt 1321 gcgggagtgc accttgtaag tccaaccaac aagtaactcc tcctcgctac acaaagatag 1381 tggcgtgcat atgcatttgt atcaggattt gcagttggat gatgtgagac atcttttctt 1441 acagaaggaa gcatcttcag agcagcagct gagtctgcta tgtcctcatt ccttttcttg 1501 tgtatgtttt tggttttttt agtatagcca cttagtaaaa gatgagaagg taatatttta 1561 gttatccttc tgttgatcca aaatagtgct gttctgctct ttagtgtata ctgttaacta 1621 ctgatgatgg ttatgtcaac acaatgattg atttggtgtg atttaaatag aaaccccctt 1681 ttttttcccc ctgtaacagt aacattcttc catagtaata atctatgtcc tgaagcttat 1741 taactcaatg ggactgctaa ctaaaactag tgctgatgta gtgctgtaat aatggtatgc 1801 tttcttttag caataaaatg gcaacatatg ctgtattaat aagaaaatag gctaagaatg 1861 agtaaaagcc tgaaagaaat aactgcctta tttaaaagaa aaacatatat tttttcatgt 1921 gtatgtgtgt atgtatatac accccatcag taactttacc acatcatttg agtgagttaa 1981 cagttctctt gtttgtgaga agctctaatt ttgaatggaa ccataactag gcttgttgcc 2041 tacttgctgt ataactcttg actggtccag tcttttgtcc ttcaaatagt agacagatct 2101 actgtgtatg tatgtgatgc tgtcaaatta tccttataca catcaatgta tctatattcc 2161 ctcactctgt ttctatggga attaaactga actgaaggtg ggaataaaca gtaaagcaac 2221 agaatacacg ggcaagccta agcatgactg tcttgntctt tgcacacaac attctttaaa 2281 gtaggtactg attccaagaa atatgcaatt actcaatgta tgtacaattt atgcttttat 2341 aattaaataa actatttaat gcaaggaatt cgatatcaag cttatcgata cc // LOCUS ECORAFRO 1289 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli raffinose repressor (rafR) gene, complete cds, and alpha-galactosidase (rafA) gene, 5' end. ACCESSION M29849 KEYWORDS alpha-galactosidase; raffinose repressor. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1289) AUTHORS Aslanidis,C. and Schmitt,R. TITLE Regulatory elements of the raffinose operon: Nucleotide sequences of operator and repressor genes JOURNAL J. Bacteriol. 172, 2178-2180 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Schmitt, 13-NOV-1989. FEATURES from to/span description pept 166 1176 raffinose repressor pept 1270 > 1289 alpha-galactosidase binding 156 161 ribosome binding site binding 1258 1263 ribosome binding site signal 120 126 -35 region signal 144 149 -10 region signal 1212 1216 -35 region signal 1235 1240 -10 region site 1196 1213 raf operator site 1217 1234 raf operator BASE COUNT 317 a 283 c 326 g 363 t ORIGIN 1 aagcttatta tgcttccatc ggaaacaatg atctggcagc ctgagttcac agataaaata 61 tctctccagg aaacccgggg ctgttcatca tgcaagtctg tcgattactg gctttggtga 121 cggaattttc tggatttccg gcttagaacc acagcaggag ataatatgtc acttaaagcg 181 attgccacga cactcggtat ttctgtcacc actgtcagtc gggctcttgg aggcttttca 241 gatgtggctg cttctacccg tgagcgcgtg gaagcggaag cacgtcgacg aggttaccgc 301 cctaatacac aggcaagaag actcaaaacc ggtaaaaccg atgctatcgg tctggtttat 361 cctgaaaatg atgtgccgtt taacagcggt gtttttatgg atatggtcag ttgcatcagc 421 agggaacttg cttatcatga tattgactta ctgctgatcg ctgatgatga gcatgcagac 481 tgccacagct atatgcggct tgttgaaagt cgcagaattg atgctcttat cattgcacat 541 actctggatg acgatccccg tatcacacat cttcataaag caggtattcc gtttctggct 601 cttggacggg taccgcaggg cttgccctgt gcgtggtttg actttgataa tcatgccgga 661 acctggcagg caacccagaa gctgattgct ttgggacata agagtattgc gctgttgagc 721 gagaacactt cacattctta tgttattgca agacgtcagg gatggcttga tgcactgcat 781 gagcatggac tgaaagatcc attgttgcgg ctggtttctc ccacgcgacg agcgggctat 841 ctggctgtga tggagttaat gtcattaccg gcgccaccaa cagctattat tactgacaat 901 gacctgagtg gagatggtgc ggctatggcg ctgcagttga gagggcgtct ttcagggaaa 961 gaagctgtat ctctggttgt atatgatggt ttgcctcagg acagcattat tgagctggat 1021 gtggctgctg ttattcagtc aacacgaagt ctcgttggtc gtcagatttc tgacatggtg 1081 tatcagataa tcaatggtgc atcaccagaa tcactgcaga taacctggac accgatattt 1141 taccctggta gcacggttca ttctccttcc ttctgatttt ttatccagat cacacaaccg 1201 aaacgttttg gttgatgttc gaaacgtttc ggatcaacag taagacatac ctgaaagcgg 1261 agatgtctaa tgatttcaaa gtactgcag // LOCUS HSEDILSTD 351 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 subfragments from near the left terminus of STD DNA. ACCESSION M34145 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone Sma2. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 351) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_review BASE COUNT 55 a 117 c 127 g 52 t ORIGIN 1 cgccgctcac gtcacactcc tccccaaaga gggtattcgg ggggtcgctc ggcggctcca 61 ggaaccgctg ctcgtaccgg tcctcggagt tcagtgcccg cagcgtggcg cgcagcggga 121 agctctgctc gggcagcagc tgcaggaggg actgccagca ctcgagtgtg gggcgcgtgt 181 gcgtgtaccc catggcgtaa aaatccagca gcgccctgcg cagggtgcgc gagtcctcgg 241 tcacgtggat cgaggaggtg gatatccccc tgaacaagcg gttgacgtcc cccgcgaggc 301 ggttcacatc ggggcgccac ggcggggcgc agaacgcccc tggcccccgg g // LOCUS HSEDISA 633 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 defective interfering particle DNA. ACCESSION M34143 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone pHSII. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 633) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_entry BASE COUNT 124 a 184 c 202 g 123 t ORIGIN 1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac 61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa 121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt 181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa 241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc acgtcacact 301 cctccccaaa gagggtattc ggggggtcgc tcggcggctc caggaaccgc tgctcgtacc 361 ggtcctcgga gttcagtgcc cgcagcgtgg cgcgcagcgg gaagctctgc tcgggcagca 421 gctgcaggag ggactgccag cactcgagtg tggggcgcgt gtgcgtgtac cccatggcgt 481 aaaaatccag cagcgccctg cgcagggtgc gcgagtcctc ggtcacgtgg atcgaggagg 541 tggatatccc cctgaacaag cggttgacgt cccccgcgag gcggttcaca tcggggcgcc 601 acggcggggc gcagaacgcc cctggccccc ggg // LOCUS HSEDISTD 290 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 subfragments from the inverted repeat of STD DNA. ACCESSION M34144 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone xbaG.. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 290) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_review BASE COUNT 69 a 72 c 77 g 72 t ORIGIN 1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac 61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa 121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt 181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa 241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc // LOCUS HUMAR 1230 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) mRNA, complete cds. ACCESSION M30704 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SOURCE Human cell line MCF-7 breast carcinoma, cDNA to mRNA, clones lambda-AR1 and lambda-AR2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1230) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept 210 968 amphiregulin precursor /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" sigp 210 266 amphiregulin signal peptide matp 510 962 amphiregulin (alt.) matp 528 962 amphiregulin (alt.) mRNA 1 1230 amphiregulin mRNA BASE COUNT 375 a 273 c 285 g 297 t ORIGIN 1 agacgttcgc acacctgggt gccagcgccc cagaggtccc gggacagccc gaggcgccgc 61 gcccgccgcc ccgagctccc caagccttcg agagcggcgc acactcccgg tctccactcg 121 ctcttccaac acccgctcgt tttgcggcag ctcgtgtccc agagaccgag ttgccccaga 181 gaccgagacg ccgccgctgc gaaggaccaa tgagagcccc gctgctaccg ccggcgccgg 241 tggtgctgtc gctcttgata ctcggctcag gccattatgc tgctggattg gacctcaatg 301 acacctactc tgggaagcgt gaaccatttt ctggggacca cagtgctgat ggatttgagg 361 ttacctcaag aagtgagatg tcttcaggga gtgagatttc ccctgtgagt gaaatgcctt 421 ctagtagtga accgtcctcg ggagccgact atgactactc agaagagtat gataacgaac 481 cacaaatacc tggctatatt gtcgatgatt cagtcagagt tgaacaggta gttaagcccc 541 cccaaaacaa gacggaaagt gaaaatactt cagataaacc caaaagaaag aaaaagggag 601 gcaaaaatgg aaaaaataga agaaacagaa agaagaaaaa tccatgtaat gcagaatttc 661 aaaatttctg cattcacgga gaatgcaaat atatagagca cctggaagca gtaacatgca 721 aatgtcagca agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca 781 tgattgacag tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg 841 tgatcctcac agctgttgct gttattacag tccagcttag aagacaatac gtcaggaaat 901 atgaaggaga agctgaggaa cgaaagaaac ttcgacaaga gaatggaaat gtacatgcta 961 tagcataact gaagataaaa ttacaggata tcacattgga gtcactgcca agtcatagcc 1021 ataaatgatg agtcggtcct ctttccagtg gatcataaga caatggaccc tttttgttat 1081 gatggtttta aactttcaat tgtcactttt tatgctattt ctgtatataa aggtgcacga 1141 aggtaaaaag tattttttca agttgtaaat aatttattta atatttaatg gaagtgtatt 1201 tattttacag ctcattaaac ttttttaacc // LOCUS HUMAR1 968 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 1. ACCESSION M30698 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 1 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 968) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept 859 + 919 amphiregulin precursor, exon 1 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" sigp 859 915 amphiregulin signal peptide pre-msg 649 > 968 amphiregulin mRNA and introns IVS 920 > 968 amphiregulin intron A BASE COUNT 206 a 280 c 257 g 225 t ORIGIN 1 gaattcatat ccacctggct ttgaacatta tcggctgtga gatggtgtag gtaaaatttt 61 aagtgcataa tttggcaata ataaatcatc aataaatatt aatgttgatg aggcccctgg 121 gccacataaa gaaataggga gtgaggggat ttgaaattct ggccacttca cagaaatggg 181 tgggaagggg ctcttgattg agatagaagc ccatcctaca tgaagcaatt cctcattgag 241 ttctctcgtc ctttatcctt gttggaaaca tcaggcaaag tcactcttgg tcttaaagta 301 cttttacatc taaatacgga actcttctat ttaatccctg tctgttgtag atgttaagta 361 tacaaagagg ttgtcagagt ttgaaacatc tggacttctg tcaggtacta gctccggaac 421 tccagtcctg ctcgccctca aaaacggctt gcagctagag gtttaagttc cacttcctct 481 cagcgaatcc ttacgcacga gggaggcggg gcgtgtgtcc tccgcgcgtg gttttcgggt 541 agcaccttct ggggcgccgc ctgcctccac ccacggccgg gccttgacgt catgggctgc 601 ggccccctcc cggctgagcc tataaagcgg caggtgcgcg ccgccctaca gacgttcgca 661 cacctgggtg ccagcgcccc agaggtcccg ggacagcccg aggcgccgcg cccgccgccc 721 cgagctcccc aagccttcga gagcggcgca cactcccggt ctccactcgc tcttccaaca 781 cccgctcgtt ttgcggcagc tcgtgtccca gagaccgagt tgccccagag accgagacgc 841 cgccgctgcg aaggaccaat gagagccccg ctgctaccgc cggcgccggt ggtgctgtcg 901 ctcttgatac tcggctcagg tgaggattca acggcgctga actgctgggc tctcctccca 961 tggcaggt // LOCUS HUMAR2 348 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 2. ACCESSION M30699 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 2 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 348) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 51 + 299 amphiregulin precursor, exon 2 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp 290 + 299 amphiregulin pre-msg < 1 > 348 amphiregulin mRNA and introns IVS < 1 50 amphiregulin intron A IVS 300 > 348 amphiregulin intron B BASE COUNT 88 a 75 c 86 g 99 t ORIGIN 1 agcaccctac tttacctttt cgttttcttc ctttattccc tcccctgcag gccattatgc 61 tgctggattg gacctcaatg acacctactc tgggaagcgt gaaccatttt ctggggacca 121 cagtgctgat ggatttgagg ttacctcaag aagtgagatg tcttcaggga gtgagatttc 181 ccctgtgagt gaaatgcctt ctagtagtga accgtcctcg ggagccgact atgactactc 241 agaagagtat gataacgaac cacaaatacc tggctatatt gtcgatgatt cagtcagagg 301 tgagtagggg ataaagcaaa aatatggcct gtgagatgtg ggtttata // LOCUS HUMAR3 302 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 3. ACCESSION M30700 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 3 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 302) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 52 + 252 amphiregulin precursor, exon 3 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp + 52 + 252 amphiregulin matp 60 + 252 amphiregulin (alt.) pre-msg < 1 > 302 amphiregulin mRNA and introns IVS < 1 51 amphiregulin intron B IVS 253 > 302 amphiregulin intron C BASE COUNT 125 a 46 c 54 g 77 t ORIGIN 1 aattatattc aagtttgaga gactcttgtc aataaatctt ttctttttta gttgaacagg 61 tagttaagcc cccccaaaac aagacggaaa gtgaaaatac ttcagataaa cccaaaagaa 121 agaaaaaggg aggcaaaaat ggaaaaaata gaagaaacag aaagaagaaa aatccatgta 181 atgcagaatt tcaaaatttc tgcattcacg gagaatgcaa atatatagag cacctggaag 241 cagtaacatg cagtaagttt tcctaaagca tatagatttt tgtatttcta gcaccatgtc 301 tg // LOCUS HUMAR4 254 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 4. ACCESSION M30701 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 4 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 254) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 51 + 204 amphiregulin precursor, exon 4 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp + 51 91 amphiregulin (both alt.) pre-msg < 1 > 254 amphiregulin mRNA and introns IVS < 1 50 amphiregulin intron C IVS 205 > 254 amphiregulin intron D BASE COUNT 81 a 42 c 53 g 78 t ORIGIN 1 cacaccgcac gtgagtgtga ttataatttt taaatgtgaa ttgcttgcag aatgtcagca 61 agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca tgattgacag 121 tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg tgatcctcac 181 agctgttgct gttattacag tccagtaagt atgacataac ttacaaattc ttaataaaat 241 aatgggaggt taat // LOCUS HUMAR5 217 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 5. ACCESSION M30702 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 5 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 217) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 50 143 amphiregulin precursor, exon 5 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" pre-msg < 1 > 217 amphiregulin mRNA and introns IVS < 1 49 amphiregulin intron D BASE COUNT 89 a 26 c 41 g 61 t ORIGIN 1 tatagatgaa tagaaccttg ataacattag aatgccttgt tctctgaagg cttagaagac 61 aatacgtcag gaaatatgaa ggagaagctg aggaacgaaa gaaacttcga caagagaatg 121 gaaatgtaca tgctatagca taactgaaga taaaattaca ggtttgagtt ttaaaatata 181 tctttagatc atatcctata attttgaaaa atttaac // LOCUS HUMAR6 354 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, 3' flanking region. ACCESSION M30703 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 6 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 354) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pre-msg < 1 294 amphiregulon mRNA and intron BASE COUNT 104 a 48 c 52 g 150 t ORIGIN 1 gtaacatttt gttttatttt attattttat tttattttat tttctcacag gatatcacat 61 tggagtcact gccaagtcat agccataaat gatgagtcgg tcctctttcc agtggatcat 121 aagacaatgg accctttttg ttatgatggt tttaaacttt caattgtcac tttttatgct 181 atttctgtat ataaaggtgc acgaaggtaa aaagtatttt ttcaagttgt aaataattta 241 tttaatattt aatggaagtg tatttatttt acagctcatt aaactttttt aaccaaacaa 301 attgagagtt tgaatattag ttctgatatt gcaagactcc agtgtacttt tctc // LOCUS HUMROSSAA 1890 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human Ro ribonucleoprotein autoantigen (Ro/SS-A), complete cds. ACCESSION M32294 KEYWORDS Ro ribonucleoprotein autoantigen. SOURCE Human T-cell line Jurkat, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1890) AUTHORS McCauliffe,D.P., Lux,F.A., Lieu,T.-S., Sanz,I., Hanke,J., Newkirk,M.M., Bachinski,L.L., Itoh,Y., Siciliano,M.J., Reichlin,M., Sontheimer,R.D. and Capra,J.D. TITLE Molecular cloning, expression and Chromosome 19 localization of a human Ro/SS-A auto antigen JOURNAL J. Clin. Invest. 85, 1379-1391 (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.P.McCauliffe, 23-FEB-1990. FEATURES from to/span description pept 67 1320 Ro ribonucleoprotein autoantigen (Ro/SS-A) precursor sigp 67 117 Ro ribonucleoprotein autoantigen signal peptide matp 118 1317 Ro ribonucleoprotein autoantigen site 1881 1886 poly-A signal (put.) BASE COUNT 464 a 497 c 520 g 409 t ORIGIN Chromosome 19. 1 ccgtactgca gagccgctgc cggagggtcg ttttaaaggg ccgcgttgcc gccccctcgg 61 cccgccatgc tgctatccgt gccgctgctg ctcggcctcc tcggcctggc cgtcgccgag 121 cccgccgtct acttcaagga gcagtttctg gacggagacg ggtggacttc ccgctggatc 181 gaatccaaac acaagtcaga ttttggcaaa ttcgttctca gttccggcaa gttctacggt 241 gacgaggaga aagataaagg tttgcagaca agccaggatg cacgctttta tgctctgtcg 301 gccagtttcg agcctttcag caacaaaggc cagacgctgg tggtgcagtt cacggtgaaa 361 catgagcaga acatcgactg tgggggcggc tatgtgaagc tgtttcctaa tagtttggac 421 cagacagaca tgcacggaga ctcagaatac aacatcatgt ttggtcccga catctgtggc 481 cctggcacca agaaggttca tgtcatcttc aactacaagg gcaagaacgt gctgatcaac 541 aaggacatcc gttgcaagga tgatgagttt acacacctgt acacactgat tgtgcggcca 601 gacaacacct atgaggtgaa gattgacaac agccaggtgg agtccggctc cttggaagac 661 gattgggact tcctgccacc caagaagata aaggatcctg atgcttcaaa accggaagac 721 tgggatgagc gggccaagat cgatgatccc acagactcca agcctgagga ctgggacaag 781 cccgagcata tccctgaccc tgatgctaag aagcccgagg actgggatga agagatggac 841 ggagagtggg aacccccagt gattcagaac cctgagtaca agggtgagtg gaagccccgg 901 cagatcgaca acccagatta caagggcact tggatccacc cagaaattga caaccccgag 961 tattctcccg atcccagtat ctatgcctat gataactttg gcgtgctggg cctggacctc 1021 tggcaggtca agtctggcac catctttgac aacttcctca tcaccaacga tgaggcatac 1081 gctgaggagt ttggcaacga gacgtggggc gtaacaaagg cagcagagaa acaaatgaag 1141 gacaaacagg acgaggagca gaggcttaag gaggaggaag aagacaagaa acgcaaagag 1201 gaggaggagg cagaggacaa ggaggatgat gaggacaaag atgaggatga ggaggatgag 1261 gaggacaagg aggaagatga ggaggaagat gtccccggcc aggccaagga cgagctgtag 1321 agaggcctgc ctccagggct ggactgaggc ctgagcgctc ctgccgcaga gcttgccgcg 1381 ccaaataatg tctctgtgag actcgagaac tttcattttt ttccaggctg gttcggattt 1441 ggggtggatt ttggttttgt tcccctcctc cactctcccc caccccctcc ccgccctttt 1501 tttttttttt tttaaactgg tattttatcc tttgattctc cttcagccct cacccctggt 1561 tctcatcttt cttgatcaac atcttttctt gcctctgtgc cccttctctc atctcttagc 1621 tcccctccaa cctggggggc agtggtgtgg agaagccaca ggcctgagat ttcatctgct 1681 ctccttcctg gagcccagag gagggcagca gaagggggtg gtgtctccaa ccccccagca 1741 ctgaggaaga acggggctct tctcatttca cccctccctt tctcccctgc ccccaggact 1801 gggccacttc tgggtggggc agtgggtccc agattggctc acactgagaa tgtaagaact 1861 acaaacaaaa tttctattaa attaaatttt // LOCUS PRD1XII 1701 bp ds-DNA PHG 17-JUL-1990 DEFINITION Bacteriophage PRD1 early protein genes XII and XIX, complete cds. ACCESSION M30146 KEYWORDS early protein XII; early protein XIX. SOURCE Bacteriophage PRD1 (Viradea strain) DNA. ORGANISM Bacteriophage PRD1 Viridae; ds-DNA nonenveloped viruses; Tectiviridae. REFERENCE 1 (bases 1 to 1701) AUTHORS Gerendasy,D. and Ito,J. TITLE Nucleotide sequence and transcription of the right early region of bacteriophage PRD1 JOURNAL J. Bacteriol. 172, 1889-1898 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Gerendasy, 27-NOV-1989. FEATURES from to/span description pept 242 724 early protein XII pept 797 955 early protein XIX (put.) mRNA 179 732 early protein XII mRNA mRNA 181 1100 early protein XIX mRNA binding 230 233 ribosome binding site binding 785 788 ribosome binding site rpt 1 111 inverted terminal repeat site 145 172 right early promoter BASE COUNT 419 a 447 c 390 g 445 t ORIGIN 1 gggggatacg tgcccctccc cacctacccg cgcccctaac atttttattt ccgtctgtca 61 ataccccctg catccgatag gcccgaacta tcacaaacgg aaaagcgata ggaataaaat 121 tttgatttat tttcaccata actattgcac gtcctaaaaa tgtgcgtata atttcaatca 181 tcgaaagggc aaacacaaaa cccccgccga taacttccac ttaactttaa aggtaactat 241 catggaaatc gtaagcaagc tgactctgaa aaccattggc gctcaaccca agccacatag 301 cgtaaaagaa aataccgcgc tggcttccat ctatggccgc gttcgcggta agaaagttgg 361 tcaatccacc tttggcgact tcatcaagtt tgaaggtgaa tttgaaggcg tgaatatcgc 421 cactggtgaa gtgttccgtt ccggtgcgct gattctgccg aaggtactgg aaagcctgct 481 tgccggtgcc gtggatggtg aaaacacggt tgattttgcg gttgaaattt gggccaagcc 541 ttccgaaaag ggcaacactg gttatgaata cggtgtcaag ccgctgattg aacccgccgc 601 atcggatgaa ctggccgcgc ttcgcaatca ggttaaggcc gcgctgcctg cccctgccgc 661 tgccggtgaa gccgctgccg aagccaagcc cgccgccaag gccaaggcca aggccgaagc 721 ctaaacagcg caccacggcc ccggctgata ccagcgcggg gccattcttt acccttcaaa 781 ttcaaaggta acaaacatgg aaaagcaaac cgaaaatacc cgccctgaat gcccgaaggc 841 attttatttt gtatccattc ccggcgactt tgggcaaacc ccttttgctt catcgcttat 901 gtatggtagc accgcgcttg ccgctgttat caagtcaagg gcgctatccg cgtagtaagc 961 gaacattttg atttgcggtt tgctgataac ggttttacgc ctgctggcgt tacgcaagct 1021 gaatggcttg gcaagctgat tactgaaact ttcggctttc gcttggaact gtttctttaa 1081 accagttttt tatttcttga atttgaaggc cgcatgattc ccttgcggcc ttttcttttg 1141 cgtatgcttc caccaaatcc cgccacgtta taaacggata aacaggcgcg gcgcattcag 1201 ttagcaggct tgacggcgga accaaataaa ccgtttcaat tttcggggct gttgtcgaac 1261 aagccgcgaa catcggcagg cataggacta tcaagaattt gtttagtttc atcgcttacc 1321 cttcccaatc cgctaattgt tcggctaagt ttgccgctga taattgtctg gccttcctga 1381 attgccgcga ttgctttcgc atcgcttgcg cgtactgatt gaagggtttc caccgccttg 1441 gcgtttgcgg tcgccttttc ttcaatggct ttaatttcat tttgaagccc cgttattttt 1501 ttagccgcta aaattgaata tccaaagtta gcaagcatta aaagcgcaac cgcaatagcg 1561 gccacgcttt tcaggttaga caaaacccat ttaatcattg aaggtacgcc ccggcttgtc 1621 attgtcggaa tctatcggcc ccggctgatt gataggggca atcttgcttt gcacaaattg 1681 cgagaaaacc gccttgacaa a // LOCUS PSEALALPQ 3141 bp ds-DNA BCT 17-JUL-1990 DEFINITION P.aeruginosa exopolysaccharide alginate regulatory protein (algP and algQ) genes, complete cds. ACCESSION M32077 KEYWORDS alginate regulatory protein P; alginate regulatory protein Q. SOURCE P.aeruginosa (strain PAO, isolate PA02003) DNA, from patient PA02003. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 3141) AUTHORS Konyecsni,W.M. and Deretic,V. TITLE DNA sequence and expression analysis of algP and algQ, components of the multigene system transcriptionally regulating mucoidy in Pseudomonas aeruginosa: algP contains multiple direct repeats JOURNAL J. Bacteriol. 172, 2511-2520 (1990) STANDARD full staff_review COMMENT draft entry and computer-readable sequence [1] kindly submitted by V.Deretic, 16-FEB-1990. FEATURES from to/span description pept 781 1263 alginate regulatory protein Q pept 2079 3137 alginate regulatory protein P (put.) mRNA 661 1310 algQ mRNA mRNA 2054 3141 algP mRNA BASE COUNT 607 a 1026 c 1009 g 499 t ORIGIN 1 bp upstream of BamHI site; map position 9 min. 1 ggatccgcca tgcccctggc cagcccccgt cagcttttcc ttctcgcgtt cctggcctgc 61 gtcgccatca tgggcggggc gctgtacctg gaacatgtgg ttggcctgga ggcctgcccg 121 ctgtgcgtcg tgcagcggat cttcttcatc ctgatcggcc tgacctgcct tgctggcgcg 181 atccaggggc ccggcctgcg tgggcggcgt atctactccg tgctggtgtt cctgctcgct 241 ctcggcggcg gggccacggc cgcccgccag gtatggttgc agaccgttcc gctggaccaa 301 ctgccggcct gcctgcccag cctcgactac atgatgcagg cgcttccctt ccaggaagtg 361 atccgcctgg tcctgcatgg caccgcggat tgtgcccagg tgagctggac gctattcacc 421 ctgagcattc ccgaatggag cctgctggcg ttcgttgcct atctcggctt ctccatcgtg 481 cagttcctcc gacgtgcctg atgcgggctg tcgcgtggtc tttcgtcagc tctcgagcgt 541 cggccgtatc gatttggcga aaaccagagc ggatgcggga taccgatcaa acatccgtat 601 gaatttttct gtacgggcgt cttgatgggg tcatcccggg ggcataatct ggctttccat 661 agcgccggag tccgcccctt gcacaggcgt tctcccacag agcgcagaga gtgcgtcagc 721 aatgcgtctc ggagcggacg atgattctcc gggacgtccg aagagggaag cgaggacaac 781 atgctcgaaa gctgccgtaa tgcccaagag cggtgggggg gtgtgcacca actgatcgat 841 cgctggctgc atgagcgcca gcaactggtg caggccttcg atgccctgag cggtattcag 901 gcaccggcgc ccaacgccga agaactccag catttctgcc aattgctgct ggactatgta 961 tctgccggac acttcgaggt ctacgagcaa ctgacggcgg aaggcaaggc cttcggcgat 1021 cagcgcggcc tggagctggc caagcagatc ttcccccggc tggaagccat caccgaatcc 1081 gcgctgaact tcaacgaccg ctgcgacaac ggcgattgcc gtgaaggagc ctgcctcatc 1141 gcggagctga aggtcctgcg gcaacagttg cacgaacgct tcgagctgga ggactgcctc 1201 atcgaagtcc tgcacaacgc ccacagccag agcggcgcag agggttcggc agtaccggtc 1261 tgattcaccg gcgagtcgtt acgaaacggc gcgtctccat gcgggagagc gccgtttttg 1321 catttcagcg gaagcccagc aggtcgatct cgaagaccag cggagcgtcg ggcgggatca 1381 ggtcgccagc gccttcgtga ccataggctt gcgccgaagg aatcaccact cgccagcgcg 1441 cgccgacagg catcgctcgc aacgctgtcc gccatccctc tatcacactg tccagggcga 1501 accactcggc gctttcgctc tggtcgaaca cttgaccgtc ggcaagcagg ccgcggtaac 1561 ggacgtgtac ctgggttgcc gcgccgatgc cgttaccttg gccacgacgt aattcgctga 1621 cgaggacgcc gccggtcaat tcgcggacgc cgaagcgggc tttttcgttg gccaggaagc 1681 gggcctcggc gggtgtggtc cgcggtgttt ccgtcgtggc gttctcgtgt tgcagcagaa 1741 gttgttcgat gcgttcgggg gggatttcaa gtgcctcgcc gcgataggcc tggcgcagtc 1801 cgagaagtag ctcggatagc tctaggccgg gcatttcctg ctgtaggcgc atacccaggc 1861 gggcgcctac ggcataggcc agttcgtcct tgggaggcgc ggcgaaaagc ggcagcgcag 1921 tagaaaaaga ccgatgaaaa ggcggcgcgg catgcggttt ctccaaacga atcggcgggc 1981 aatcctggcg cacgtcgctt gtagcacatc gaacccgttg gcgagagggg gtgtgcgggt 2041 ctagtatggg cgcaaccacg tccgcctgga ggcacgtcat gtcggccaac aagaagcccg 2101 tcaccacccc cttgcacctg ttgcagcaac tttcccacag ccttgtcgag cacctggaag 2161 atgcgtgcaa acaagcgctg gtcgattcgg aaaagctcct ggccaaactt gaaaagcaac 2221 gtggcaaagc ccaggaaaag ctgcacaagg ctcgcaccaa gctgcaggat gctgccaagg 2281 ccggcaagac caaggcacag gccaaggcgc gcgagaccat cagcgacctg gaagaggcgt 2341 tggataccct gaaggcccgg caggcggaca cccgtaccta catcgtcggc ctcaagcgtg 2401 acgtacagga aagcctcaag ctggcgcagg gtgtcggcaa ggtgaaggaa gctgctggca 2461 aggctctgga gagccgcaag gcgaaacccg cgaccaaacc tgctgcgaag gcggcagcca 2521 agcctgcggt gaaaaccgta gcggcgaagc ctgcggccaa gccggctgcg aagcctgctg 2581 cgaaaccggc ggccaagcct gcgacgaaaa ccgcggcagg caagcccgca gccaagccga 2641 cggcgaagcc tgctgcgaaa ccggcggcca agcccgcggc gaaaaccgca gccgcgaagc 2701 ccgcagccaa gccggcggcg aagcctgtgg cgaaaccggc ggccaagcct gcggcgaaaa 2761 ccgcagccgc gaagcccgcc gccaagccgg cagcgaagcc tgtcgcgaaa ccgacggcca 2821 agcccgcggc gaaaaccgca gccgcgaagc ccgcagccaa gccagctgcg aagcctgcgg 2881 cgaaaccggc ggccaagcct gtggcgaaat ccgcggccgc gaagcctgca gccaagccgg 2941 ctgcgaagcc tgcggcgaaa ccggcggcca agcctgcggc gaaacccgta gccgcgaaac 3001 ctgccgcaac caagcccgcc accgctcctg ctgcgaagcc tgcggcgact cccagcgccc 3061 cggcagccgc ctccagcgct gcttcggcaa cgcctgccgc gggcagcaac ggcgccgccc 3121 cgaccagcgc ctcctaaggc g // LOCUS STYAROF 1235 bp ds-DNA BCT 17-JUL-1990 DEFINITION S.typhimurium tyrosine sensitive 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase (aroF) gene, complete cds. ACCESSION M31302 KEYWORDS 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase. SOURCE S.typhimurium (strain LT2) DNA, clone pGM59. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1235) AUTHORS Muday,G.K. and Herrmann,K.M. JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 165 to 1235) AUTHORS Muday,G.K. and Herrmann,K.M. TITLE Regulation of the Salmonella typhimurium aroF gene in Escherichia coli JOURNAL J. Bacteriol. 172, 2259-2266 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Herrmann, 13-JAN-1990. Author address: K.M.Herrmann Purdue University Biochemistry department West Lafayette IN, 47907 (317) 494-1653 FEATURES from to/span description pept 165 1235 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase (E.C. 4.1.2.15) site 1 20 operator aroFo2 site 52 71 operator aroFo1 site 69 113 promoter site 75 94 TyrR box-1 BASE COUNT 309 a 286 c 343 g 297 t ORIGIN 1 gtgtaaagtt tttgatacga aatgagggag attaatcgac gattgcgcat ggtgtaaagt 61 ttattttacg aaatatggat tgaaatcttt actttttgtg gtatggttta aacatcctcg 121 tggaggatca actatcgcaa acaagaaaag acaggatcgc catcatgcaa aaagacgcgc 181 tgaataacgt acgtatcacc gatgaacagg tattaatgac gccggagcag cttaaagcgg 241 cctttccctt gagcctggcg caggaagccc agatagcgca gtcccgggga atcatttctg 301 acattattgc cgggcgcgat ccgcgtctct tggtggtatg cggtccttgt tctattcacg 361 atcctgaaac cgctctggaa tatgcccgtc gatttaaagc ccttgccgca gaggtcagcg 421 atagcctcta tctggtaatg cgcgtctatt ttgaaaagcc gcgaactacc gtcggctgga 481 aagggctgat taacgatcct cacatggatg gctcatttga tgtggaagcc gggttgaaaa 541 tagcgcgtca gctactggtg gaactggtga atatggggtt gccattggcg accgaagcgt 601 tggatccgaa cagcccgcaa tacctgggcg atctgtttag ctggtcggcg ataggggcgc 661 gcacaaccga atcgcaaacc caccgcgaaa tggcgtctgg tctttctatg ccggtcgcgt 721 ttaaaaacgg cacggatggc agcctggcga cagcgattaa cgccatgcgc gccgctgcgc 781 aacctcatcg ttttgttggc attaaccagg ccggtcaggt tgcgttattg caaacccagg 841 gaaatccgca tggccatgtg attctgcgtg gcggcaaagc gccgaactat agcccggcag 901 atgtcgctca gtgtgaaaaa gagatggaac aggcgggact acgtccttcg ctgatggtag 961 attgcagtca tggtaactcc aataaagatt atcgccgcca gccagccgtt gccgaatctg 1021 tggttgcgca gattaaagat ggcaatcgtt caatcattgg cttaatgatt gaaagtaata 1081 ttcatgaggg taaccagtct tccgaacagc cgcgcagcga aatgaagtat ggcgtttccg 1141 tcaccgatgc ttgtattagc tgggagatga ccgatgccct gttacgtgaa attcataaag 1201 atttgagcgg ccagctggcg gtgcgcgtcg cataa // LOCUS TRBANTAT 18598 bp ds-DNA INV 17-JUL-1990 DEFINITION T.brucei expression site variant specific antigen 4 - 7 genes, complete cds. ACCESSION M33720 KEYWORDS expression site variant specific antigen. SOURCE T.brucei (strain EATRO 1125), cDNA to mRNA, clone AnTat 1.3A. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 18598) AUTHORS Pays,E., Tebabi,P., Pays,A., Coquelet,H., Revelard,P., Salmon,D. and Steinert,M. TITLE The genes and transcripts of an antigen gene expression site from T.brucei JOURNAL Cell 57, 835-845 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Pays, 20-JAN-1989. FEATURES from to/span description pept 1174 2196 expression site variant specific antigen 7 pept 3528 4733 expression site variant specific antigen 6 pept 8519 9772 expression site variant specific antigen 5 pept 14450 18157 expression site variant specific antigen 4 mRNA 986 2676 ESAG-7 mRNA mRNA 3483 5056 ESAG-6 mRNA mRNA 8260 11304 ESAG-5 mRNA trns 5305 5808 rime retroposon BASE COUNT 5298 a 3468 c 4707 g 5125 t ORIGIN 1 gcatgcgaag agcaggggtg caacggaaga gtgaaaggat tagaggaaat agtggaatta 61 gcatgattgg agattgtgag ggttaggaaa aacgtgacga gggaaatggt aactggacat 121 cgtgcgcgag gatggcgaag gcttgaagcg tgtagagagc gtgtgtaaat ggcccaaacg 181 gtgtgggtgg ggtaataacg cagagtgctg aggtgtttga caagttcccg cgaggtgctg 241 gggaggagga agcaagacag taaaatcgtc gtgagacgtg gctgtgctcg ggaatccgta 301 caaatatact gaaaattatt gatggttgat atgcgtggga acgacgcagg atgatgctgc 361 atcgccaaga tgcttcggga gaacaggaaa ataacgacga atcaacactg gtgagttgat 421 actataaaaa tcgcgtgggc tttgctgctt cgctttcata aatgccacct gcaatgaatt 481 tggagggtgg aggtgagctg ccctgtgtat gggtgggtat tgcagatgca gcggcgcttt 541 tgggtgattt ttggtttaca gtttgctgtt gttgggaagg ttggtggtgg caacaaaatt 601 ggagaaaatg atttcatctg gaaacctact tcaaaacccc ataaacaatc ggacctcgaa 661 gtgcaagtgt ctcccagaac gtaaacaagg ccttgaagac gaccagcaga tgaagcacag 721 caacagcgac agaggtggac tgagacgaaa gtataccata tgactatgcc acaagaattg 781 atgggtgatt ctgccaacca cggacgcagt caaaaacata gaaggagctg tagtgaatat 841 aaaaccactc atattttcta caaatattag agctttatta cttaagaacc tgttttagca 901 ccaaggttgc gtgtgtattg ccgttgctgt tacttattgt tgagttggta tttgaggaaa 961 atgttttttc aggtgcgtcg ttgttatata cggctgtgag ttggtgatgt ttatttgaat 1021 gatagcttga taggataaga tgaaataata ctaagcatgt attcttctgg cgtgtcgccg 1081 aaatggtaat ctatgttacc catagcacga aaaaattttt ttactcagta acgaagcgga 1141 aagttttaaa aagcataatt taaaaaagtg ataatgatga agttttggtt tgtgctgttg 1201 gctcttttgg gaaaagaaac acatgcatat tatgaaaata aaaggaatgc attaaatgca 1261 accgccgcta ataaagtgtg tgggctatcg acctatctta aaggaatagc gcacagagta 1321 aacagcgaga gtgccgtggt tacggaaaaa ctatcagatt tgaaaatgag aagcatccag 1381 ttgcagttat cagtaatgcg gaacagagtg ccttctggcg agcaggattg taaagacatc 1441 aggacactct tgaaaacagt attgaggaat gagtttacat tccagcagga gttggaggaa 1501 atgaggaacg catccgcgtt agcagcagct gcagctggga tagcagctgg aagactggaa 1561 gagtggattt ttgtatttgc acaggcagca ggcgggtcat cacagttttg cataagtgtg 1621 gggaagcata tcccggctga gcacggcaac ttgcaagaat gttttgatgg aataattgga 1681 cctgaaaccc tttacaaaat tgaggattca cgtgtaaaag agtcggcgca gaaaagcttg 1741 caactccatg aagtgttatc atccatttcc ttcagtagct tgggtgcgga aagtattgtt 1801 gagcaaggag agaacagagg atgtaaccta atgcggacgg ctgacggagg tttgctgaaa 1861 gatatttgtt tgaattgcaa cttcacatgg ggtggtggag tgatgaattt cggatcctgt 1921 gtggcgggga acctaaaaat aaaaggggga gaatacggtg atgtcagttc ccacgatgtg 1981 gtacggtgga ccgaagatcc cagtaaagtg agcatattta aggatgtcat ccgcctgttt 2041 gcgcgcttcc aagaagcaaa aaatgcagtg atgaataaaa taaaaactac tgtggatgaa 2101 ttggcgaaat gcatagggca gaaagaggtt gagctcacta atgatcagct ttacgaggaa 2161 ttcgaggcaa tacaaaagta cctagggtct ctgtgaaatg gagtaaaagc gaattcaact 2221 atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg gcagcagtac 2281 atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga gtgctgtgaa 2341 ggaaggatgc gacagaagct gcgctgctta gcgtgaaaga ttatggtaat ggagggttgt 2401 gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg ggcaaaagta 2461 acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa ggaaagatgt 2521 gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg agatgtaagg 2581 ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa tttagttaca 2641 acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt ctactgttta 2701 atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc gaatttcact 2761 tctccagaat ctttatttta gtgcgctata gatgatagtc caaagagaag gaattacaat 2821 tattataacc catccaaagc tatgtcagag actcctcttc tccgagtgat tcttttctat 2881 gtcacttgtg gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg aaatggggcg 2941 tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt atgatgcata 3001 gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta ggtggaagcg 3061 gctaatgatt tgtcggcacc accaacagag gaagttaatt gttgacgtag ccgtttgtga 3121 atgagataga aggttattgg gtggaacatt atcttggaaa gagtttttct tccactaaaa 3181 atgtgtagag tggtggtttg gagaaaatgg tttagatatt gaggaggaaa caaggatttg 3241 aggtgcaagt tgatgcgctg taagatttgt attgccgttg ctgttactta ttgttgagtt 3301 ggtatttgag gaaaatgttt tttcaggtgt gtcgttgtta tatacggctg tgagttggtg 3361 atgtttattt gaatgatagc ttgataggat aagatgaaat aatactaagc atgtattctt 3421 ctggcgtgtc gccgaaatgg taatctatgt tacccatagc acgaaaaaat ttttttactc 3481 agtaacgaag cggaaagttt taaaaagcat aatttaaaaa agtgataatg atgaagtttt 3541 ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa aataaaagga 3601 atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgacctat cttaaaggaa 3661 tagcgcacag agtaaacagc gagagtgccg tggttacgga aaaactatca gatttgaaaa 3721 tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct ggcgagcagg 3781 attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt acattccagc 3841 aggagttgga ggaaatgagg aacgcatccg cgttagcagc agctgcagct gggatagcag 3901 ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg tcatcacagt 3961 tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa gaatgttttg 4021 atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta aaagagtcgg 4081 cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat agcttgggtg 4141 cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga acagatactg 4201 acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt ggtggagtga 4261 tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa tacggtgatg 4321 ctacagagta cggtgcagta cggtggaccg aagatcccag taaagtgagc atatttaagg 4381 atgtcatccg cctgtttgcg cgcttccaag aagcaaaaaa tgaagtgatg aataaaataa 4441 aaactactgt ggatgaattg gcgaaatgca tagggcagaa agaggttgag ctcactgatg 4501 atcagcttta cgaggaattt atctgggaga ccatacacag attggagctg tcaaagagag 4561 tgagtgaaca actctcttta ggtgaagaag aggagacgat cctgaaatct aactatactg 4621 cagaaccggt ccgtggacct ttcacggtag cggggtccaa cgcggcagca gtacatttga 4681 gtgtttctac cgctgcactt tgtttttcag ttttattgtt gggagtgctg tgaaggaagg 4741 atgcgacaga agctgcgctg cttagcgtga aagattatgg taatggaggg ttgtgaaaga 4801 ttgggggaac aaaaacctat ttcttttatt tgggggaaca aatgggcaaa agtaacgtaa 4861 gtttccagtg ggagtggtat gtgtgtgtgt atggggctgg ctaaggaaag atgtgagttc 4921 ggcatgtggt atgtacaagc tacgaaaacg tgtgaaacaa aacgagatgt aaggggaaaa 4981 tgtaacaacc aactatgtta aatttcagga gactattttt caaatttagt tacaacaaag 5041 taaatgtcaa ataatgccaa ctataaagga aaactagact aattctactg tttaatgtga 5101 cgagaattgt ggcgtgaaac ctagtaactg cttatcatgg cgtcgaattt cacttctcca 5161 gaatctttat tttagtgcgc tatagatgat agtccaaaga gaaggaatta caattattat 5221 aacccatcca aagctatgtc agagactcct cttctccgag tgattctttt ctatgtctct 5281 tgtggaaggt tgatattttt gctaccccct ggcgatgccg gccacctcaa cgtggtgcca 5341 gggtccagtg ccccgtatca ttgggggaag ccaagagcca gcagcgttcc tttcatgggg 5401 aacactgctg tgctccggct acggcatcat acagcacagg gatcagcagc gtcttgctgg 5461 gacaccgttt ttcatttgtc ggtccctggg cacgtgccag cgtgccatca gcagtatcat 5521 ccgcgctaag atgctgctgt ccggtgatgt ggacctccaa aaaaggattg ccatttggca 5581 tcttttggag agtccagggt gggaggcttc tcgccccatc tgctgtattc cgttcaactg 5641 cggagctaca acaaaaaatt atagagggtg tgttaggatg aatgaaaaag ggagactctg 5701 ccacagtcgc cagaccgata gcatctcagg gctctacggt gttggctgat ggccgcgcca 5761 gaggggggga aactctcacg aaggcacgaa gaaaattcaa aaaaaaattt ttgctaccac 5821 aggctgtaag gataacgctg gagatggagc cgactgtggt ttttcttgtg aatgcgtatc 5881 ttcgtaaccg gttgttaaga agggatcgga cgaaggaatg tgtctgagta ggaggtaatg 5941 tcgatgtttt ctttccattg gatgtagtaa taatctccaa aatttgttgg aaacaatata 6001 atattattgc aataaatact tgaaaagtta aattattgat ggattgactt gagtatcatg 6061 attccaatga acactctcat gatgcttaga atagcaaacg tatgaacaat caacgtagcg 6121 tcatgtagac atggacattt tttgtaatga tacgactgct attgcacatc tgaaaatttc 6181 tatgacggac cagtgccgtt aatcaccgta caattgaatt caggatgaaa aatgataaat 6241 cagtgctcta atgtatgatg cagctgtcaa ggcaatgttt ctgcacaaaa agagtttaac 6301 acactgccat ttgcatgcga ataaaacaat gagtatatga agcggaggat cgcatggcaa 6361 tgataaaatc gaacactgtg gctgcgagac gcagggccac ccggaagcac cgggtaacaa 6421 agcgagttca ggagtcgtcg aaagtggcaa aagtcagcac caaaccaagt tgctttttta 6481 aaaagatcag caaacgtatt tctcactgac ttatacagca ttcaaaggaa atgaacttag 6541 cagcaggaca agggtgcaaa ctaaggcatc gattctgtat ggctgcaaaa gtgaagacat 6601 caagcaaaca aaggtaactt atttaccctt ataggagcag tggaaaaagc agcggactcg 6661 tatggagata acctgacgaa aatactacca gttttagtca caggaagcga tatcctgtca 6721 aaatgccggc agccactcaa tggcggtcaa aacttgatta aacagttgaa aaaaacagcg 6781 gaatcaaaag gaaccttgcc cgtgctcacg gtggtagata aagcaacgaa aaatgtaaag 6841 gtagaattta acggtaacaa ttcgaacaaa accgtactat cgtgcaaaaa tatttcagca 6901 caaaatcccg acaaactttg cggcgacaac acagcagaca acatggccaa gtgcacgaaa 6961 gcttccaaag cagcttttgg cgctgatggt gtatatttca aaagaaaaaa gggcaaacct 7021 tgactcagca gctataataa agtcagctag aaccggcgaa acagcggaag cgctgagcaa 7081 acttaaaaca ctggacatta tcggtgattt tatattgcgt ttatttgtcg aaaagctcat 7141 ggacatgacc gtggatgagt acacgactca aagagagact aacgacaaaa aacgctgcag 7201 tggcaaaagt agcgtgcctg tacaggggac ggagaacttg ataataatgg taatattaat 7261 cttagaagaa tgttgtgagt atgtgtcact tttacagtga tagttgtgaa atattcggaa 7321 tgacagttgt aatggaggtg aaagaagctg ttaggttgaa tttggggaac catctatgta 7381 tacaaagaag cacaagggtt gtgatagctt atttatatga tggagaagcc atgcatggta 7441 aaaaaaaaca acaagtgata tgtcggtgtt tgtggctaat gtagcataaa gactaacagc 7501 tacagcagtg catcaaccga tggagagacg ctttcagaat tttcaaatgt aagaaagtca 7561 gtaaattaca aagaagtggt tgagttgata ctcagaaaag tgggactaac aaactgttga 7621 tgaatgaagg tgatgccata cgtaccaggt acacaggtca gcaatatgat gctcactgaa 7681 aaagagagat ggtgtgctac tggcgggatg tcgctggaaa agtcttagca agtgtagttg 7741 cagcaccgag cacacacact catgggagtt atcaaagtat gaaaacgcaa aggaagctga 7801 tgtgtgtgtg tgtgtttttg tgtgtgtttg caggaaacaa atgtatcagc ttcgacatct 7861 ttccgcaaga agtagaagtt tcagttgtgg tcactgacgc attacgtgga ttgttgggtt 7921 gaaacaacac aaatgtggca accatttagg atgttttggc gctttggcga cggttgtact 7981 gcagtttgta tgccccttca agaaacaagg aatggtgtga aagcgttgtt tttttttaag 8041 tcaagtatgt gcattcaaga aatcatatca cagtgttacg cagtgctttg tcgtcatggg 8101 aggaccttcc tgcattgaga gtgcatgtaa ggaaacggca acaggatgct atggctccga 8161 atttttttta agtatctgct tttcagacca cttgcgttga tcgacttcag ttatgttaca 8221 ctccgcacat cttttttgct atgttttttt ctactttagt tgcacttagt gtagccaaca 8281 ttaacttaaa gtacaaactc tatcctaagc aggattcggt gttatttcta ttcgacgatg 8341 tcattggtat cgagcgcacc cgttttggtg ttcatttccg ttgtaacact tctgtgttgt 8401 atcatcggct ctgggcaacg gtctagtagt ccctttacgg aaccggcact caaaccaggg 8461 ggcattaagg ttgctattca agaggcggct gttacgccat tgttaccagt acttgcgtat 8521 gagttcgaaa ggtttatgga gaatatcacc attcccgagc aaaagggtta accgtgtgag 8581 tgtagaagag acttattttc gtaacgtgac ggtcggaagc gcgacagtta aattcgaaga 8641 accaaataaa atcattttga agttttggaa cgtgagcgct actgtgccgt ttacgcggtt 8701 tgtataccac tcgttttggt gttacctgta tccttgcagt gggtcggcac aggcggaaat 8761 acgcaacggc agtgtggcac tgtggctcga cgtgtccgca ggaagggggg gactactaga 8821 catacatgta gggagttcgg aaattggcat gcgtgaccca ctgataactc ttatcggtga 8881 agggaaatca aagatgccta agtggcttgg cgggagagtt aaagatatgt atgataagga 8941 tgtgcttccc aagcttgggc atcatataat tactgcggtg aatcaaattt tagccaacaa 9001 agcaaaggag atatttcata tgtttccaat tgtctttgtc aacagttcta aaattgaata 9061 tggacaaatg agacttgaac tggtcgtgct tccagatgcc gcagacaaat taatgcttac 9121 ggagaatgtt tttttcccgc ggcacgcttt tccaaacttt cctgttgctg ttgtttcttc 9181 gtttacggct ttgaataata tgcttcgttt gatgattaaa agtggccact tggtggtgcg 9241 tgtgcccttt ccgctgaggt atgttatctc cagcaacgcg ctcagcgtca gttggaccgt 9301 cttttgctct ggatgtgcct ctgaggctac attcgaacta aaaacagccc cgtggttgaa 9361 gtccctaaat aagaaactgt ttacatttaa ctatcgggat gtcgaagttg ccgtggacct 9421 gctgccaagg ggtggtgccc ccatttcctt gtttagtatg ttaatgaatg tgtctgccca 9481 ggcggcgcat atcgctcttg ttgacggtgc cacccatgcg aatttggatt ctgttgatac 9541 taatgtgagt gtcacatcct cacgtatcga tggcctggac tcaagtacaa tgaacacaaa 9601 aattcgtgat ttgatcaact tgtcttggat caacctgaat gttacatata catttcctgc 9661 tcccttcgac ttgtgtacga aacacgtgaa tattacaagt gaatgctatg tagcgggatt 9721 taaccttgtg cgagctcttg gttcactcag tatactccct cacctccgat gaaggattgc 9781 gacattcctg gtcactttgt aaaacacact tccatgcgct tatgtggaag ggggtaacac 9841 tcaaacctat cttgctttcc cctgtttacg cacaacttca catacgcact ttctctttta 9901 cgagatgttt gcatgccgca acaaccaccc tgcgcaatcg acgagaagct atccgtgttt 9961 acgggtgatc cttcgaatta tggcggagga gtaaaaacct catgatatgt gaagtgctgt 10021 ggctgaagaa ggaaataaga taaaaagggt atttttcatt gacccgacca ttactggttt 10081 tgactttttg cccgaggcga gaaggcaaat ccattgttga ttggctcatg aatgtttccg 10141 tgactatacg cgccaagcct tggatatgtc gtgggctttc ggcatatgtt gataatataa 10201 cgactgccag ttggctgagt gacggctcaa agtgttagag gcgagtgtca atggaagata 10261 tatctattga tttggtaaat aatattggat tcgagattaa ctgttcaagt gtgctacaac 10321 accagatagc atgttcgtag tgtaggggtg gcaatagaag tgatggattt atcacgacta 10381 attgttctat tggcaacgct tgtgtctcgt gactgctatt ggttgattgc tgatgtttcg 10441 atttgagttg tttcgtggcg gcacgaaaag tatcatcatg tgaagtaggt taaggggaaa 10501 agaaacagat atataaagag cctttcgtgt tgctcccaag gaattgtata aatctctgag 10561 cgagatttgt gctcggtaat gggttcaaga gtgtgtatat acaaatacta tattaagagt 10621 ggtgattgag accaagtgga gggtcgaaag agatccactt aaagccatgg aaagtgtgcg 10681 gttgtagagg gtagaaaact aagaaaaggc aaatgataaa tagtattata ggaaatgcag 10741 agtgatagca agagagaggg agatggagta accaatggtt taaaaacaag catgacgatg 10801 ggcagtatta tacccataag gatcaaagag ggtcaaaaaa atacaaaatg ggttcagtgc 10861 acaagagatg tgaagatgct acaactgata cggagaagcc tagaatgtga aaattcagaa 10921 aataatacta tatgaaaata gtgagagtga caactcacaa tacaaaacaa actgatacgg 10981 gagaagatta tgctaaacat aaaagtgaaa aaagctagaa tataagctga taaaacacca 11041 taaactacat tgatttgagt ttcatgatat aaaacgtgga tgctctattc agcagaggat 11101 tctagtaaat tttttaatat atcttaaagt aatagtgaac gtagaataaa tttctgttcc 11161 aatacacaga cgagaaaaat aaagaattga acgggtagca aatttaagta acttgcgaca 11221 atatttttgc tgaagctaac ttctaaaaca aaaatgcaaa tacgtgttca aaaatatagt 11281 gaggaggata atcttttaat tacaaacgca tacatcatga taatttcaag gcattgtttg 11341 catatgtaaa cattgcttta ttaactgagt gcagttttct ggtaacctat ttcttttcat 11401 agttttcttt cataaccctc tctaatattt taaggtatat taggaaatga aatttgttac 11461 cttgtgaact ttactattac caattatatc acatttagtg tgtttaagtg caatatacct 11521 tcaagatgaa gaagtaagtg acatgacaaa tatgacacat aatggtacta agaaacccag 11581 agactttgtg gttgtcgttc aaactcatgc aacacaagga tggtgtaggc tcatattgag 11641 tgcgaaagta agccatgtgg aaattattac aattgaaatg ggaggtttgt acagtcatgt 11701 caaaaggcct aaatgggtta tggagttttt agagaaacag aatgcaagtg atgaggatat 11761 tttaatagct aaggatggat ctgatgttat agttggtgat ggaaataagt atgagagttc 11821 tgtggaatat tttgtgtaaa aaagcaccct acagtaagga caaattttat gaagaggata 11881 ttcagaaaaa ggatataggg taatgttatt gatttttgag atggccacaa aagtggagga 11941 aaagaaaagt ctaccaactc tactaaaatg gcctttaact caaaaagcat acttaaatgg 12001 aggtggctta attgccagag tttgggcatt tagaaggttt gaatatgcat ttggagagtt 12061 actaaaggga agtgaggagt ggtggtcaga gcggagtatt tacaaacctt tgttaatatg 12121 gagtgcaatt caggaagaag ctgttggaca gagatttgtt ttgaagagag gaatgattgg 12181 attggattac gaagagagat ttttttctat agaaagtagt ggggtaattg gagaagcccc 12241 attcattcat tttcctggac agcctattgc ttgggaggag aaagcgagac tgattgtgaa 12301 aaatcttagt tgatataaaa ggttgaaagg aagtgaattt aattcgaagg atatggggaa 12361 aagtagaaac ttatgtagta gatggagaga attgggaatt tcgatatgaa agaatttgtg 12421 gggacgcggt caaggagaat gacttgttta aggcaaagaa actgtaaggg atacaaaact 12481 aaagattttc ttgcggctga gtcagtagtc taagcaaatg tgggaccaac acgcatggcc 12541 aagatggccg cacaagtcta tgttgacatg aaaatcctgg ggcagcaaca aagagaagct 12601 ggaagacaaa cctcaggggt accctaacta ttcggctgaa atggtatgtc gtgacggcaa 12661 tatcgatatt ggtttcgggg gtggtaacac gatggacgca aaatttagaa gcttctatta 12721 cctctccata aaccatgatc tcaaagctag agaagcacaa aacaataaca caactagtga 12781 cactatgcgt tttttatgca cctgcacaga agaagtaacc gacgcttgga ccgaagatgc 12841 gaatacatat tcggcatctg cagtataaaa ggtaattcag ctgctagaag gaaaaaattc 12901 ctgaagaccc cgctgtgcca gagatatgta aacaactgtt tgaaatacat gcactcaata 12961 tgtcactatc agacaagacc tggcaggaca agttcataaa ggctcaacga agagccactg 13021 cagtggggtg acgaagccaa caaacataag accactaata cggagtggca cgaagtgtgg 13081 cagcgctggg tctaagccag gcaaagacta aaaaataacg aatactgagg gcgataaagt 13141 aaggaaaaat gccgcctcta aatggcaaag ttcaaaagcg gcacaggccc gagtcaagct 13201 aaactctttg ctgatccatg cggaagcctt gaacgctgag ctggaaaaaa agggcaaata 13261 atggctaacg gcaaaaaaca atagaagtgg acgttcttgc agccctctac ggaacgccca 13321 cggcaaaagc agcgtttggg agcagtggct tattgtcatc atcatttttt tgttgttttc 13381 gccactagtg gttttctttg tttgcgtatt tgatatctcc ttcgtcacta ttcttatctt 13441 tatcgtctgg aatatgggtg agatatcatt cccggtgtgt tttgcagtca tatggcgtaa 13501 catccacctg tgttgatatt ctttacagca tatcttacat gcttctcttt ccaacagtgc 13561 ttcctgtagt ggtgtgcttg gctccccttt actcttctcc ggtatgcact tatatctggt 13621 gagccaggtt ttgcttttta tagacctctt ggaagcagag acatttgtat gcttctttta 13681 ctttctcatg ttcaacttcg ctgcttattt tcctgcattt ccgtttgttg tgtcacttca 13741 gtttgctttt ctacgatgtt taaggcgtat atgttcctag tcaactagcc atacgtggca 13801 tactgctttg ggcaacattg ccacttgtag acttttttac cgaattatct tcttctcgct 13861 tccttcctct tttgacgctc tgtccttact actggagttt taggaggctg ttgtggcatg 13921 tgccttgtgc ttcttttatg accaacaagt gttattttta cagtcgtttt tgcactggtc 13981 gtgtgtggcg aaaaatacgc ccccgctcct ccaaccaatt agtggcaaag gggcggctcc 14041 gttgaggcag cactgcttgt cctcttaatg gtacattgtt gttttaccta tcgctggttc 14101 actccaccgc acgcggtgca taccttggga gttctgtact gttttggggg tggttttgtg 14161 cattgtaaca gccgcctctg acacaacaag cacagtttca cgcctgcgtt actggcattt 14221 tatgcctgtg gtggaacaac gtgtgtgttg acgtccgtaa atcttaatat ggcttgttgc 14281 gtgctaccgt ttatattttt cccccttcga tatgggcatc ccgttcttgt cgttacgtgg 14341 ttcccagttt cacctaacaa cccttgccct ctcttaccca gcagccgctc cctttcgttc 14401 gttccagtgt aacaaccact ttcacgcgta gactatcaaa atcactgcga tgaatatgct 14461 tcacttgagc gaccgcaatg cctcactcgc accgagtggc ggggaacatt cccttccgac 14521 gggaggagct gtgtgccgtg acgccatgga catacttccc gtgatactga gagctccggt 14581 cgcgttgtta cttctgctcg tcgtcctgcc gcaattgtca gtgggcgccg aggctaatgt 14641 gacagtcaag gttctctccg ccacgtggaa ctggtacatg ccgcggaagt atgtaaccgc 14701 gattaatgct ggcttcaacg cctcactgaa gtcacgaaac tggacagtcg ctggaagcgt 14761 caacgttcag gtagtttacc ccagcaacct tgacctgatg cccgaagatt tcattaaaaa 14821 gcagttggaa ttggaaactg accagaataa gattgttata gtgtatggac ccctcggtga 14881 caaaagtgta atgcactcga ttccacattt gatgaatcac agagtagtgg cctttgggtt 14941 gataactggg tccactttta tccgacaatg gaatccgtac ctttactttc tgcgtgccga 15001 ccccgctgct gaaacattgg ttctcattcg gtattcgcta tgccagttgc gcgtgctgcg 15061 cttgggcttc atgtaccttc agggtgttca ttatggcgac gaggagtatg cattgacagt 15121 gaacgtgatg tctcggatgg gttatgaact gcatggtgtg ttcaccgtga tgagtccaga 15181 cggtaaaccc gctcctgatg ctgagtttaa agaggtgttt gagcgcttcg ccactgccct 15241 tcctcaggcc attattgtct ttggtgcgcc agtggatgat accgcgaagt tcttgatgat 15301 gatggcagtg gatgagcgaa tcgctcggtc gtatattctt agtccttcat ctgtccagtt 15361 gtcactcata gaaatgtggc aactcgctct ggaagctgca ggggcctcat tcgcgcccgg 15421 acagctactt ttcacaggca ccaatccgct tgcgaaggac agccagtaca cagcaatcaa 15481 gcgttttcaa gaagttatga gtgaatacct aaaggctcac gtcggcgaaa ccaacatcac 15541 tgaagcagat tatttcctga cccacgatct tgagggagag ttgatggtgt atgggtggat 15601 ttctggggag gtgttatctc aggcacttag caaccttgaa tggctcaagg accgcgctac 15661 atttgtcagg tcgctatacg accaacggcg ctacgtcatc aatgacattg tgatcggtga 15721 ctatggcggt acgtgtgagg gggacgcggc caagcatggc gccacctgtg agtgtaacca 15781 aggcagcaag gcggtgtatg tgaaagagat gttggaaaat gggcagaaga cgtcggtgcg 15841 aagtggcttc acagttctga aagcttcgct gtgctatacg gattcttcgg agcttcacgg 15901 gccactggat ggactcgtcg ttttcatgaa agacgacgac atagcgtcga aagcagctgc 15961 gctgtggcag aaggggacct cccatcttgt cggcaagggt gatttgggat actcggacag 16021 gttctttctg cacgcattta atactaccat agctgaggca gcgaacgatc ttcggcgtga 16081 ccaaggtgaa agaatcgtaa ccgctgtgtt tggccccgtg accgaggcga tgttggacac 16141 gccgaatata acctttatcg atcctctgga acttaaacca cggctgaata agttcagaag 16201 gaatgtgatt catctctcgc ccaccctgga acaacaactt tacgtccttt catcgtacct 16261 cgcgggcgcc ggggtcggta atgttgatgc cgtgatctgc agcaacgaag cggatggaat 16321 tgcagatttt ctgcggagtt cactaactga attcgctgtc tcgctacgtt ccgctgtcat 16381 acgcgaagat ggggaagatg ttggtaaata ccttccgatg agtgggaccg tgtttgtcat 16441 tggcctttcc gttccagatg tgaaggaaat cgcgaggaaa cttgaagaac gcaacgattt 16501 gcgtgtgatt gttctcttcg gggaattttc attcctgtat gatctgtttg ccacggccct 16561 caataacacc gcaggcgccg cacgtcttgt gtttgccacc agcttgccac attggggtga 16621 tactgaaacc tcatcgaaga ctgctcaact gttccatgat gttgaaaaag actcacggtt 16681 atggacaccc ctatcagttc tggcattcgc aaccggtcga ttgatgcggg taatacttct 16741 tcatgtagaa gaaatgagtc cagaaacgtt ggtaaatttc ttctataccg attcttccat 16801 cgtctccgat gacatgcgct acggtgtatt cgacgacaca aaatgcgtcg acaccgcaaa 16861 taaactttcg aagaatggtt gtgcctcaaa ctatggtgca acgcagatat cggtgtggtc 16921 tatggcccgt gctctgaacg cttctattcc tccgcttaca aatcccatga caccgtcaat 16981 gactttcaga aactctaacg cgggaaggat ctccggggcg tcgctggtgg gcatcatcat 17041 cggtggtgcc cttgctttgt ttcttgtggt ggctctgggc gtggttccct actttttcct 17101 gcgcaacacc gtgataacaa tctgcaccaa agatgacaga cccgtaacgc taatatttac 17161 tgacattgag agcagtacgg cgttgtgggc tgcacaccct gaggtgatgc ctgatgccgt 17221 tgcgacacat caccgcttaa ttcgcacatt gatttccaag tatgaatgct acgaagtcaa 17281 gactgttgga gattctttta tgatagcgag taagagtcct ttcgctgccg tccaactcgc 17341 acaggaacta cagctgtgtt tcttgcacca tgactgggga acaaatgcga ttgatgaatc 17401 ctaccagcag tttgagcagc agcgtgcgga agatgacagt gattacacac cgccaactgc 17461 tcggttggac cctaaagtgt acagtcgttt gtggaatggc cttcgagtgc gtgttggaat 17521 ccacaccggg ctgtgcgaca tccgacgcga tgaagtaacg aagggatatg actattatgg 17581 acgtacatcc aatatggcag caaggacgga gagtgtagca aatggtggtc aggtgctgat 17641 gacgcatgcg gcgtacatgt cgctgtcagc tgaggagcgt cagcaaattg atgtcactgc 17701 acttggtgat gtgccgctac gtggtgtgcc aaagcctgtg gaaatgtacc ggttgaatgc 17761 cgtgcctggt cgtacctttt ccgttctccg tcttgagctt gagcttctta acgatgatga 17821 ggatcagact acaacttcct gtagtgacca cagttcttca cggacagatc tgagtgtagc 17881 tgcacaaact attgctgctt cccttcagtc gctcttgggt accttcaccc ctgctcagcg 17941 ccagaaggca ttaataccct tttgcgaacg gtggcgtgtt ccgcttcctc aaaaggtcgg 18001 gaacgtgtgg gacgatgacg gttgtcagga agttgtacgg cgtgttgctg caaaggtggg 18061 ccgtgttatg gattttggaa cgaggaagcc ctccagcagt gtgacatcgt ggaaaggggt 18121 ggaagtgtct tctcaggtgg aggagcggct gctataatgc ttgcttcatc ctcaagttcc 18181 tcctgcgttg atgggcattg tgttaccgta caattgattg acttggaaaa tgacagcgct 18241 accacctaat attaccctca tatcattgct gtatttgcgc tctgctattg aagtgaatga 18301 atgagttatt ttatatgatg tgtgcaatct tttctttata ttcaattcgt ataaagtgga 18361 cattataact ttccctcccc attcttgtgc ctgaaatgat cttctcaaac ggtaaatgct 18421 ttaaatacat ttgtacatga agatggtttt tgtttctgac atggctactc tccttttgag 18481 aactgtgttt gaagattgta cattttcttg tgtttgatgt gcctttgaat gtgagtatcc 18541 tcgccacttt tgataatctc cctcgtttaa ataatatatt tattgactga ataagctt // LOCUS JS3PRSBS 81 bp ds-DNA BCT 17-JUL-1990 DEFINITION Plasmid pJS37 promoter region DNA. ACCESSION M34567 KEYWORDS . SOURCE Plasmid pJS37 DNA, from B.subtilis (strain MB11). ORGANISM Plasmid pJS37 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 81) AUTHORS Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M. TITLE Comparative expression of the pC194 cat gene in Streptococcus pneumoniae, Bacillus subtilis and Escherichia coli JOURNAL Gene 86, 71-79 (1990) STANDARD simple staff_review FEATURES from to/span description pept 61 81 leader peptide (put.) mRNA 37 > 81 cat mRNA mRNA 64 > 81 cat mRNA binding 46 51 ribosomal binding site (put.) signal 34 62 promoter region 1 signal 5 34 promoter region 2 signal 57 62 P1 -10 region signal 34 39 P1 -35 region signal 29 34 P2 -10 region signal 5 10 P2 -35 region BASE COUNT 36 a 5 c 11 g 29 t ORIGIN 1 atttttaaaa tatatattta tgttacagta atattgactt ttaaaaaagg attgattcta 61 atgaagaaag cagacaagta a // LOCUS JS3PRSEC 62 bp ds-DNA BCT 17-JUL-1990 DEFINITION Plasmid pJS37 promoter region DNA. ACCESSION M34566 KEYWORDS . SOURCE Plasmid pJS37 DNA, from E.coli (strain C600). ORGANISM Plasmid pJS37 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 62) AUTHORS Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M. TITLE Comparative expression of the pC194 cat gene in Streptococcus pneumoniae, Bacillus subtilis and Escherichia coli JOURNAL Gene 86, 71-79 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 52 > 62 cat mRNA signal 21 43 promoter region PT signal 43 48 PT -10 region signal 21 26 PT -35 region BASE COUNT 23 a 3 c 14 g 22 t ORIGIN 1 ccgatgatga agaaaagaat ttgaaactta gtttatatgt ggtaaaatgt tttaattggg 61 at // LOCUS JS3PRSSP 109 bp ds-DNA BCT 17-JUL-1990 DEFINITION Plasmid pJS37 promoter region DNA. ACCESSION M34565 KEYWORDS . SOURCE Plasmid pJS37 DNA, from S.pneumoniae (strain 708). ORGANISM Plasmid pJS37 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 109) AUTHORS Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M. TITLE Comparative expression of the pC194 cat gene in Streptococcus pneumoniae, Bacillus subtilis and Escherichia coli JOURNAL Gene 86, 71-79 (1990) STANDARD simple staff_review FEATURES from to/span description pept 33 53 leader peptide (put.) signal < 1 6 promoter region 1 signal 6 34 promoter region 2 binding 18 23 ribosomal binding site (put.) BASE COUNT 47 a 13 c 16 g 33 t ORIGIN 1 taatattgac ttttaaaaaa ggattgattc taatgaagaa agcagacaag taagcctcct 61 aaattcactt tagataaaaa tttaggaggc atatcaaatg aactttaat // LOCUS HUMTNC1 1085 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human fast skeletal muscle troponin C gene, exon 1. ACCESSION M33771 J05502 KEYWORDS troponin C. SEGMENT 1 of 2 SOURCE Human HeLa cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1085) AUTHORS Gahlmann,R. and Kedes,L. TITLE Cloning, structural analysis and expression of the human fast-twitch skeletal muscle troponin C gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Gahlmann, 20-APR-1990. FEATURES from to/span description pept 1033 + 1035 troponin C, exon 1 pre-msg 968 > 1085 troponin C mRNA and introns (alt.) pre-msg 969 > 1085 troponin C mRNA and introns (alt.) IVS 1036 > 1085 troponin C intron A BASE COUNT 266 a 276 c 306 g 237 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct 61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc 121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa 181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa 241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc 301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac 361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc 421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca 481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga 541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt 601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc 661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga 721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc 781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt 841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt 901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag 961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg 1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg 1081 tggac // LOCUS HUMTNC2 1773 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human fast skeletal muscle troponin C gene, exons 2-6. ACCESSION M33772 J05502 KEYWORDS troponin C. SEGMENT 2 of 2 SOURCE Human HeLa cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1773) AUTHORS Gahlmann,R. and Kedes,L. TITLE Cloning, structural analysis and expression of the human fast-twitch skeletal muscle troponin C gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Gahlmann, 20-APR-1990. FEATURES from to/span description pept + 51 102 troponin C, exon 2 247 390 troponin C, exon 3 479 593 troponin C, exon 4 758 894 troponin C, exon 5 1506 1537 troponin C, exon 6 pre-msg < 1 1668 troponin C mRNA and introns IVS < 1 50 troponin C intron A IVS 103 246 troponin C intron B IVS 391 478 troponin C intron C IVS 594 757 troponin C intron D IVS 895 1505 troponin C intron E BASE COUNT 358 a 455 c 628 g 331 t 1 others ORIGIN About 2.5 kb after segment 1. 1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc 61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag 121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg 181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc 241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg 301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg 361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat 421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc 481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac 541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct 601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag 661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc 721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc 781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg 841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg 901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct 961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc 1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta 1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat 1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac 1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc 1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa 1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa 1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg 1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc 1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca 1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg 1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc 1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc 1741 ggctggcacc acccggagca cccttgggga tcc // LOCUS HUMMHDQ3L 967 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33841 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 967) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 961 967 inverted repeat B rpt 560 810 R region signal 792 797 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 530 536 TATA box BASE COUNT 254 a 235 c 222 g 256 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct 121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc 301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt 361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa 421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa 481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg 541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg 601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc 661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg 721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc 781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga 841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct 901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac 961 ccctaca // LOCUS HUMMHDQ5L 960 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33842 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 960) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 954 960 inverted repeat B rpt 555 803 R region signal 785 790 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 525 531 TATA box BASE COUNT 250 a 232 c 219 g 259 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct 121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag 301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa 361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt 421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa 481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt 541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca 601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac 661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct 721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat 781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg 841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa 901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca // LOCUS SYNLACZA 6476 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD1.27. ACCESSION M34296 KEYWORDS lacZ. SOURCE Cloning vector pPD1.27. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6476) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3500 3501 E.coli lacZ end/SV40 start recomb 3634 3635 SV40 end/synthetic start recomb 3657 3658 synthetic end/pUC19 start recomb 4076 4077 pUC19 end/C.elegans sup-7 start recomb 4442 4443 C.elegans sup-7 end/pUC19 start BASE COUNT 1560 a 1646 c 1676 g 1594 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc agctgagcgc cggtcgctac cattaccagt tggtctggtg tcaaaaataa 3241 taataaccgg gcaggccatg tctgcccgta tttcgcgtaa ggaaatccat tatgtactat 3301 ttaaaaaaca caaacttttg gatgttcggt ttattctttt tcttttactt ttttatcatg 3361 ggagcctact tcccgttttt cccgatttgg ctacatgaca tcaaccatat cagcaaaagt 3421 gatacgggta ttatttttgc cgctatttct ctgttctcgc tattattcca accgctgttt 3481 ggtctgcttt ctgacaaact cggaacttgt ttattgcagc ttataatggt tacaaataaa 3541 gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt 3601 tgtccaaact catcaatgta tcttatcatg tctggatcga caaagtcaaa gcggccgcct 3661 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct 3721 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 3781 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 3841 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 3901 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 3961 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4021 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa 4081 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc 4141 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt 4201 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt 4261 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt 4321 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac 4381 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg 4441 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 4501 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 4561 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 4621 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 4681 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 4741 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 4801 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 4861 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 4921 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 4981 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5041 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 5101 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 5161 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 5221 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 5281 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 5341 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 5401 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 5461 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 5521 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 5581 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct 5641 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 5701 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 5761 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 5821 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 5881 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 5941 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6001 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6061 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 6121 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 6181 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 6241 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 6301 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 6361 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 6421 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZB 6096 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD8.02. ACCESSION M34297 KEYWORDS lacZ. SOURCE Cloning vector pPD8.02. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6096) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3275 3276 synthetic end/unknown DNA start recomb 3696 3697 unknown DNA end/C.elegans sup-7 end recomb 4062 4063 C.elegans sup-7 end/pUC19 start BASE COUNT 1458 a 1571 c 1616 g 1451 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcctact agtcggccgt acgggccctt aaggccgcct gatgcggtat tttctcctta 3301 cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg 3361 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt 3421 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc 3481 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat 3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 3661 tcatgagaca ataaccctga taaatgcttc aataatacaa ttttcagaat acgttttttg 3721 tgggcttggg tatattgttt ttaatgttat acttgcagtc gtgaaatttg attttcaaat 3781 ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt gtcaaaaacc aatttcaaca 3841 aattttcgtg tgagaaatac attaccagaa ggcatttttt cacacgatta gcattttgga 3901 ctactttatt aaatttttgc gtgtaatttt gaattaaatt gtattatatt actacttaaa 3961 aaacaaaaaa tttgaccact gagcggatcg aacgcccaac ctttcgatct agagtcgaac 4021 gcgctaccat tgcgccaagc agtcatgtta ttctctcttg tcattgaaaa aggaagagta 4081 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 4141 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 4201 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 4261 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 4321 gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 4381 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 4441 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 4501 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 4561 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 4621 ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 4681 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 4741 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 4801 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 4861 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 4921 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 4981 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 5041 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 5101 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 5161 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 5221 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 5281 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 5341 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 5401 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 5461 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 5521 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 5581 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 5641 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 5701 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 5761 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 5821 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 5881 agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 5941 acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 6001 tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 6061 ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZC 7376 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD8.33. ACCESSION M34298 KEYWORDS lacZ. SOURCE Cloning vector pPD8.33. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7376) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 3524 3425 unknown DNA end/C.elegans sup-7 end recomb 4555 4556 synthetic end/pUC19 start recomb 4976 4977 pUC19 end/C.elegans sup-7 start recomb 5342 5343 C.elegans sup-7 end/pUC19 start BASE COUNT 1863 a 1793 c 1814 g 1906 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat 3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat 3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac 3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg 3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca 3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca 3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc 3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt 3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg 3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa 3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga 3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa 3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa 4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg 4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg 4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa 4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa 4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc 4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta 4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa 4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca 4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt aaggccgcct 4561 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct 4621 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 4681 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 4741 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa 4981 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc 5041 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt 5101 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt 5161 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt 5221 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac 5281 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg 5341 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 5401 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 5461 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 5521 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 5581 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 5641 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 5701 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 5761 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 5821 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 5881 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5941 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 6001 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 6061 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 6121 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 6181 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 6241 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 6301 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 6361 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 6421 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 6481 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct 6541 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 6601 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 6661 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 6721 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 6781 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 6841 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6901 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6961 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 7021 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 7081 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 7141 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 7201 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 7261 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 7321 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZD 5730 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD16.43. ACCESSION M34299 KEYWORDS lacZ. SOURCE Cloning vector pPD16.43. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 5730) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3264 3265 synthetic end/unknown DNA start recomb 3513 3514 unknown DNA end/synthetic start recomb 3524 3525 synthetic end/pUC19 start BASE COUNT 1348 a 1488 c 1580 g 1314 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcctact agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga 3301 aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg 3361 gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa 3421 ctatgcggca tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca 3481 cagatgcgta aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat 3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 3661 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta 3721 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg 3781 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 3841 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 3901 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg 3961 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 4021 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 4081 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 4141 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt 4201 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag 4261 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 4321 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 4381 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 4441 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 4501 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 4561 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 4621 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 4681 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 4741 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 4801 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 4861 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 4921 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 4981 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 5041 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 5101 cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg 5161 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 5221 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 5281 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca 5341 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 5401 ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 5461 ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 5521 caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca 5581 ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc 5641 attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga 5701 gcggataaca atttcacaca ggaaacagct // LOCUS SYNLACZE 7010 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD16.51. ACCESSION M34300 KEYWORDS lacZ. SOURCE Cloning vector pPD16.51. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7010) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 4544 4545 synthetic end/pUC19 start recomb 4793 4794 pUC19 end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 4524 4525 unknown DNA end/synthetic start recomb 4804 4805 synthetic end/pUC19 start BASE COUNT 1753 a 1710 c 1778 g 1769 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat 3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat 3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac 3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg 3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca 3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca 3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc 3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt 3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg 3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa 3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga 3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa 3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa 4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg 4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg 4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa 4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa 4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc 4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta 4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa 4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca 4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt tcgtctcgcg 4561 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct 4621 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc 4681 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 4741 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcggccttaa 4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 4981 aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 5041 attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 5101 tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 5161 gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 5221 cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 5281 tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 5341 agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 5401 tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 5461 tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 5521 tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact 5581 acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg 5641 accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg 5701 tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat 5761 cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc 5821 tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat 5881 actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt 5941 tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 6001 cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 6061 gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 6121 tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt 6181 gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 6241 gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 6301 ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 6361 acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagcattg 6421 agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 6481 cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 6541 tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 6601 gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 6661 ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 6721 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 6781 cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca 6841 ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat 6901 taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg 6961 tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct // LOCUS SYNLACZF 7088 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD18.32. ACCESSION M34301 KEYWORDS lacZ. SOURCE Cloning vector pPD18.32. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7088) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 4622 4623 synthetic end/pUC19 start recomb 4871 4872 pUC19 end/synthetic start recomb 3322 3323 synthetic end/unknown DNA start recomb 4602 4603 unknown DNA end/synthetic start recomb 4882 4883 synthetic end/pUC19 start BASE COUNT 1780 a 1724 c 1797 g 1787 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct 3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc 3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc 3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata 3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta 3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac 3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg 3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa 3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc 3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt 3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg 3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta 4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttaaa 4081 attaaacatt tttctaaatt ttatatgatt tcttttaaat ttgcaaaaat tacttaaatt 4141 tgaattcccg cgcaaatgag tgacttcatt ttctgcatta ttgtgttttc cggctatatt 4201 aataggtatt tgtttgtgtt tttctttatt ttatgattcg aactccaatt tgtaaatttt 4261 cgaacatatt tccctaaaga aaaaatatga ttaatctgga aaaattggaa aattattttt 4321 caaataaaaa acaaagaaaa aaatgaagaa aaacctatta gtttggccat aaaacgcaaa 4381 aatgtcgaaa atgacgtcac tcatctgcgc gggaaatcaa gaataattcg gcctttttta 4441 tttttttgga aaatcgtaaa acatttagaa aaatttttta atagttatag tgggactgta 4501 ttctgtcatt tagggcaaaa gccagagacg ctactccacc gttaacatga attatgaata 4561 ttattgcgac aagacccaaa cattgataaa ccgcaaatct agcctactag tcggccgtac 4621 gggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc 4681 ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc 4741 gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc agagcagatt 4801 gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 4861 cgcatcaggc ggccttaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat 4921 aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat 4981 ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata 5041 aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 5101 tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 5161 agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 5221 cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 5281 taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 5341 tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 5401 tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 5461 cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 5521 gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 5581 cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 5641 actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 5701 ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 5761 tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 5821 tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 5881 acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 5941 ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 6001 ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 6061 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 6121 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 6181 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 6241 aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 6301 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 6361 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 6421 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 6481 cctacagcgt gagcattgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 6541 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 6601 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 6661 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 6721 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 6781 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 6841 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 6901 cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 6961 cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 7021 ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 7081 aaacagct // LOCUS SYNLACZG 6563 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD21.28. ACCESSION M34302 KEYWORDS lacZ. SOURCE Cloning vector pPD21.28. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6563) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 4097 4098 synthetic end/pUC19 start recomb 4346 4347 pUC19 end/synthetic start recomb 3322 3323 synthetic end/unknown DNA start recomb 4077 4078 unknown DNA end/synthetic start recomb 4357 4358 synthetic end/pUC19 start BASE COUNT 1587 a 1650 c 1723 g 1603 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct 3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc 3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc 3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata 3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta 3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac 3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg 3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa 3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc 3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt 3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg 3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta 4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttcct 4081 actagtcggc cgtacgggcc ctttcgtctc gcgcgtttcg gtgatgacgg tgaaaacctc 4141 tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga 4201 caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct taactatgcg 4261 gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc 4321 gtaaggagaa aataccgcat caggcggcct taagggcctc gtgatacgcc tatttttata 4381 ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt 4441 gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 4501 acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 4561 tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc 4621 agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 4681 cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 4741 aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 4801 gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 4861 agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 4921 aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 4981 gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 5041 ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc 5101 aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt 5161 aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc 5221 tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc 5281 agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca 5341 ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca 5401 ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt 5461 ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 5521 acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg 5581 agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc 5641 ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag 5701 cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa 5761 gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 5821 cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc 5881 gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta 5941 caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc ccgaagggag 6001 aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct 6061 tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga 6121 gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc 6181 ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt 6241 atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg 6301 cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg 6361 caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc 6421 cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc 6481 accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata 6541 acaatttcac acaggaaaca gct // LOCUS SYNLACZH 6562 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD22.04. ACCESSION M34303 KEYWORDS lacZ. SOURCE Cloning vector pPD22.04. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6562) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 179 180 synthetic end/E.coli trpS start recomb 263 264 E.coli trpS end/synthetic start recomb 267 268 synthetic end/E.coli lacZ start recomb 3261 3262 E.coli lacZ end/synthetic start recomb 4096 4097 synthetic end/pUC19 start recomb 4345 4346 pUC19 end/synthetic start recomb 3321 3322 synthetic end/unknown DNA start recomb 4076 4077 unknown DNA end/synthetic start recomb 4356 4357 synthetic end/pUC19 start BASE COUNT 1587 a 1650 c 1722 g 1603 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt gagggtaccg agctcagaaa aaatgactgc tccaaagaag aagcgtaagg 181 taccggtggg tgaagaccag aaacagcacc tcgaactgag ccgcgatatt gcccagcgtt 241 tcaacgcgct gtatggcgag atcgatcccg tcgttttaca acgtcgtgac tgggaaaacc 301 ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 361 gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc 421 gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag tgcgatcttc 481 ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac gatgcgccca 541 tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc acggagaatc 601 cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag gaaggccaga 661 cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac gggcgctggg 721 tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca tttttacgcg 781 ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt tatctggaag 841 atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg cataaaccga 901 ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc agccgcgctg 961 tactggaggc tgaagttcag atgtgcggcg agttgcgtga ctacctacgg gtaacagttt 1021 ctttatggca gggtgaaacg caggtcgcca gcggcaccgc gcctttcggc ggtgaaatta 1081 tcgatgagcg tggtggttat gccgatcgcg tcacactacg tctgaacgtc gaaaacccga 1141 aactgtggag cgccgaaatc ccgaatctct atcgtgcggt ggttgaactg cacaccgccg 1201 acggcacgct gattgaagca gaagcctgcg atgtcggttt ccgcgaggtg cggattgaaa 1261 atggtctgct gctgctgaac ggcaagccgt tgctgattcg aggcgttaac cgtcacgagc 1321 atcatcctct gcatggtcag gtcatggatg agcagacgat ggtgcaggat atcctgctga 1381 tgaagcagaa caactttaac gccgtgcgct gttcgcatta tccgaaccat ccgctgtggt 1441 acacgctgtg cgaccgctac ggcctgtatg tggtggatga agccaatatt gaaacccacg 1501 gcatggtgcc aatgaatcgt ctgaccgatg atccgcgctg gctaccggcg atgagcgaac 1561 gcgtaacgcg aatggtgcag cgcgatcgta atcacccgag tgtgatcatc tggtcgctgg 1621 ggaatgaatc aggccacggc gctaatcacg acgcgctgta tcgctggatc aaatctgtcg 1681 atccttcccg cccggtgcag tatgaaggcg gcggagccga caccacggcc accgatatta 1741 tttgcccgat gtacgcgcgc gtggatgaag accagccctt cccggctgtg ccgaaatggt 1801 ccatcaaaaa atggctttcg ctacctggag agacgcgccc gctgatcctt tgcgaatacg 1861 cccacgcgat gggtaacagt cttggcggtt tcgctaaata ctggcaggcg tttcgtcagt 1921 atccccgttt acagggcggc ttcgtctggg actgggtgga tcagtcgctg attaaatatg 1981 atgaaaacgg caacccgtgg tcggcttacg gcggtgattt tggcgatacg ccgaacgatc 2041 gccagttctg tatgaacggt ctggtctttg ccgaccgcac gccgcatcca gcgctgacgg 2101 aagcaaaaca ccagcagcag tttttccagt tccgtttatc cgggcaaacc atcgaagtga 2161 ccagcgaata cctgttccgt catagcgata acgagctcct gcactggatg gtggcgctgg 2221 atggtaagcc gctggcaagc ggtgaagtgc ctctggatgt cgctccacaa ggtaaacagt 2281 tgattgaact gcctgaacta ccgcagccgg agagcgccgg gcaactctgg ctcacagtac 2341 gcgtagtgca accgaacgcg accgcatggt cagaagccgg gcacatcagc gcctggcagc 2401 agtggcgtct ggcggaaaac ctcagtgtga cgctccccgc cgcgtcccac gccatcccgc 2461 atctgaccac cagcgaaatg gatttttgca tcgagctggg taataagcgt tggcaattta 2521 accgccagtc aggctttctt tcacagatgt ggattggcga taaaaaacaa ctgctgacgc 2581 cgctgcgcga tcagttcacc cgtgcaccgc tggataacga cattggcgta agtgaagcga 2641 cccgcattga ccctaacgcc tgggtcgaac gctggaaggc ggcgggccat taccaggccg 2701 aagcagcgtt gttgcagtgc acggcagata cacttgctga tgcggtgctg attacgaccg 2761 ctcacgcgtg gcagcatcag gggaaaacct tatttatcag ccggaaaacc taccggattg 2821 atggtagtgg tcaaatggcg attaccgttg atgttgaagt ggcgagcgat acaccgcatc 2881 cggcgcggat tggcctgaac tgccagctgg cgcaggtagc agagcgggta aactggctcg 2941 gattagggcc gcaagaaaac tatcccgacc gccttactgc cgcctgtttt gaccgctggg 3001 atctgccatt gtcagacatg tataccccgt acgtcttccc gagcgaaaac ggtctgcgct 3061 gcgggacgcg cgaattgaat tatggcccac accagtggcg cggcgacttc cagttcaaca 3121 tcagccgcta cagtcaacag caactgatgg aaaccagcca tcgccatctg ctgcacgcgg 3181 aagaaggcac atggctgaat atcgacggtt tccatatggg gattggtggc gacgactcct 3241 ggagcccgtc agtatcggcg gaattccaac tgagcgccgg tcgctaccat taccaacttg 3301 tctggtgtca aaaataatag gggccgctgt catcagatcg ccatctcgcg cccgtgcctc 3361 tgacttctaa gtccaattac tcttcaacat ccctacatgc tctttctccc tgtgctccca 3421 ccccctattt ttgttattat caaaaaaact tcttcttaat ttctttgttt tttagcttct 3481 tttaagtcac ctctaacaat gaaattgtgt agattcaaaa atagaattaa ttcgtaataa 3541 aaagtcgaaa aaaattgtgc tccctccccc cattaataat aattctatcc caaaatctac 3601 acaatgttct gtgtacactt cttatgtttt ttttacttct gataaatttt ttttgaaaca 3661 tcatagaaaa aaccgcacac aaaatacctt atcatatgtt acgtttcagt ttatgaccgc 3721 aatttttatt tcttcgcacg tctgggcctc tcatgacgtc aaatcatgct catcgtgaaa 3781 aagttttgga gtatttttgg aatttttcaa tcaagtgaaa gtttatgaaa ttaattttcc 3841 tgcttttgct ttttgggggt ttcccctatt gtttgtcaag agtttcgagg acggcgtttt 3901 tcttgctaaa atcacaagta ttgatgagca cgatgcaaga aagatcggaa gaaggtttgg 3961 gtttgaggct cagtggaagg tgagtagaag ttgataattt gaaagtggag tagtgtctat 4021 ggggtttttg ccttaaatga cagaatacat tcccaatata ccaaacataa ctgtttccta 4081 ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct 4141 gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac 4201 aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg 4261 catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg 4321 taaggagaaa ataccgcatc aggcggcctt aagggcctcg tgatacgcct atttttatag 4381 gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg 4441 cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga 4501 caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat 4561 ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca 4621 gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc 4681 gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca 4741 atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg 4801 caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca 4861 gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata 4921 accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag 4981 ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg 5041 gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca 5101 acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta 5161 atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct 5221 ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca 5281 gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag 5341 gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat 5401 tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt 5461 taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa 5521 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 5581 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 5641 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 5701 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 5761 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 5821 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 5881 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 5941 accgaactga gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga 6001 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 6061 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 6121 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 6181 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 6241 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 6301 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc 6361 aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc 6421 gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca 6481 ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa 6541 caatttcaca caggaaacag ct // LOCUS SYNLACZI 6567 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD22.11. ACCESSION M34304 KEYWORDS lacZ. SOURCE Cloning vector pPD22.11. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6567) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 184 185 synthetic end/E.coli trpS start recomb 268 269 E.coli trpS end/synthetic start recomb 272 273 synthetic end/E.coli lacZ start recomb 3266 3267 E.coli lacZ end/synthetic start recomb 4101 4102 synthetic end/pUC19 start recomb 4350 4351 pUC19 end/synthetic start recomb 3326 3327 synthetic end/unknown DNA start recomb 4081 4082 unknown DNA end/synthetic start recomb 4361 4362 synthetic end/pUC19 start BASE COUNT 1587 a 1651 c 1724 g 1605 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt gcttggaggg taccgagctc agaaaaaatg actgctccaa agaagaagcg 181 taaggtaccg gtgggtgaag accagaaaca gcacctcgaa ctgagccgcg atattgccca 241 gcgtttcaac gcgctgtatg gcgagatcga tcccgtcgtt ttacaacgtc gtgactggga 301 aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 361 taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 421 atggcgcttt gcctggtttc cggcaccaga agcggtgccg gaaagctggc tggagtgcga 481 tcttcctgag gccgatactg tcgtcgtccc ctcaaactgg cagatgcacg gttacgatgc 541 gcccatctac accaacgtaa cctatcccat tacggtcaat ccgccgtttg ttcccacgga 601 gaatccgacg ggttgttact cgctcacatt taatgttgat gaaagctggc tacaggaagg 661 ccagacgcga attatttttg atggcgttaa ctcggcgttt catctgtggt gcaacgggcg 721 ctgggtcggt tacggccagg acagtcgttt gccgtctgaa tttgacctga gcgcattttt 781 acgcgccgga gaaaaccgcc tcgcggtgat ggtgctgcgt tggagtgacg gcagttatct 841 ggaagatcag gatatgtggc ggatgagcgg cattttccgt gacgtctcgt tgctgcataa 901 accgactaca caaatcagcg atttccatgt tgccactcgc tttaatgatg atttcagccg 961 cgctgtactg gaggctgaag ttcagatgtg cggcgagttg cgtgactacc tacgggtaac 1021 agtttcttta tggcagggtg aaacgcaggt cgccagcggc accgcgcctt tcggcggtga 1081 aattatcgat gagcgtggtg gttatgccga tcgcgtcaca ctacgtctga acgtcgaaaa 1141 cccgaaactg tggagcgccg aaatcccgaa tctctatcgt gcggtggttg aactgcacac 1201 cgccgacggc acgctgattg aagcagaagc ctgcgatgtc ggtttccgcg aggtgcggat 1261 tgaaaatggt ctgctgctgc tgaacggcaa gccgttgctg attcgaggcg ttaaccgtca 1321 cgagcatcat cctctgcatg gtcaggtcat ggatgagcag acgatggtgc aggatatcct 1381 gctgatgaag cagaacaact ttaacgccgt gcgctgttcg cattatccga accatccgct 1441 gtggtacacg ctgtgcgacc gctacggcct gtatgtggtg gatgaagcca atattgaaac 1501 ccacggcatg gtgccaatga atcgtctgac cgatgatccg cgctggctac cggcgatgag 1561 cgaacgcgta acgcgaatgg tgcagcgcga tcgtaatcac ccgagtgtga tcatctggtc 1621 gctggggaat gaatcaggcc acggcgctaa tcacgacgcg ctgtatcgct ggatcaaatc 1681 tgtcgatcct tcccgcccgg tgcagtatga aggcggcgga gccgacacca cggccaccga 1741 tattatttgc ccgatgtacg cgcgcgtgga tgaagaccag cccttcccgg ctgtgccgaa 1801 atggtccatc aaaaaatggc tttcgctacc tggagagacg cgcccgctga tcctttgcga 1861 atacgcccac gcgatgggta acagtcttgg cggtttcgct aaatactggc aggcgtttcg 1921 tcagtatccc cgtttacagg gcggcttcgt ctgggactgg gtggatcagt cgctgattaa 1981 atatgatgaa aacggcaacc cgtggtcggc ttacggcggt gattttggcg atacgccgaa 2041 cgatcgccag ttctgtatga acggtctggt ctttgccgac cgcacgccgc atccagcgct 2101 gacggaagca aaacaccagc agcagttttt ccagttccgt ttatccgggc aaaccatcga 2161 agtgaccagc gaatacctgt tccgtcatag cgataacgag ctcctgcact ggatggtggc 2221 gctggatggt aagccgctgg caagcggtga agtgcctctg gatgtcgctc cacaaggtaa 2281 acagttgatt gaactgcctg aactaccgca gccggagagc gccgggcaac tctggctcac 2341 agtacgcgta gtgcaaccga acgcgaccgc atggtcagaa gccgggcaca tcagcgcctg 2401 gcagcagtgg cgtctggcgg aaaacctcag tgtgacgctc cccgccgcgt cccacgccat 2461 cccgcatctg accaccagcg aaatggattt ttgcatcgag ctgggtaata agcgttggca 2521 atttaaccgc cagtcaggct ttctttcaca gatgtggatt ggcgataaaa aacaactgct 2581 gacgccgctg cgcgatcagt tcacccgtgc accgctggat aacgacattg gcgtaagtga 2641 agcgacccgc attgacccta acgcctgggt cgaacgctgg aaggcggcgg gccattacca 2701 ggccgaagca gcgttgttgc agtgcacggc agatacactt gctgatgcgg tgctgattac 2761 gaccgctcac gcgtggcagc atcaggggaa aaccttattt atcagccgga aaacctaccg 2821 gattgatggt agtggtcaaa tggcgattac cgttgatgtt gaagtggcga gcgatacacc 2881 gcatccggcg cggattggcc tgaactgcca gctggcgcag gtagcagagc gggtaaactg 2941 gctcggatta gggccgcaag aaaactatcc cgaccgcctt actgccgcct gttttgaccg 3001 ctgggatctg ccattgtcag acatgtatac cccgtacgtc ttcccgagcg aaaacggtct 3061 gcgctgcggg acgcgcgaat tgaattatgg cccacaccag tggcgcggcg acttccagtt 3121 caacatcagc cgctacagtc aacagcaact gatggaaacc agccatcgcc atctgctgca 3181 cgcggaagaa ggcacatggc tgaatatcga cggtttccat atggggattg gtggcgacga 3241 ctcctggagc ccgtcagtat cggcggaatt ccaactgagc gccggtcgct accattacca 3301 acttgtctgg tgtcaaaaat aataggggcc gctgtcatca gatcgccatc tcgcgcccgt 3361 gcctctgact tctaagtcca attactcttc aacatcccta catgctcttt ctccctgtgc 3421 tcccaccccc tatttttgtt attatcaaaa aaacttcttc ttaatttctt tgttttttag 3481 cttcttttaa gtcacctcta acaatgaaat tgtgtagatt caaaaataga attaattcgt 3541 aataaaaagt cgaaaaaaat tgtgctccct ccccccatta ataataattc tatcccaaaa 3601 tctacacaat gttctgtgta cacttcttat gtttttttta cttctgataa attttttttg 3661 aaacatcata gaaaaaaccg cacacaaaat accttatcat atgttacgtt tcagtttatg 3721 accgcaattt ttatttcttc gcacgtctgg gcctctcatg acgtcaaatc atgctcatcg 3781 tgaaaaagtt ttggagtatt tttggaattt ttcaatcaag tgaaagttta tgaaattaat 3841 tttcctgctt ttgctttttg ggggtttccc ctattgtttg tcaagagttt cgaggacggc 3901 gtttttcttg ctaaaatcac aagtattgat gagcacgatg caagaaagat cggaagaagg 3961 tttgggtttg aggctcagtg gaaggtgagt agaagttgat aatttgaaag tggagtagtg 4021 tctatggggt ttttgcctta aatgacagaa tacattccca atataccaaa cataactgtt 4081 tcctactagt cggccgtacg ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa 4141 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag 4201 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta 4261 tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa taccgcacag 4321 atgcgtaagg agaaaatacc gcatcaggcg gccttaaggg cctcgtgata cgcctatttt 4381 tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa 4441 atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca 4501 tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 4561 aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 4621 acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 4681 acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 4741 ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg 4801 ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 4861 caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 4921 ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 4981 aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg 5041 aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 5101 tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 5161 aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 5221 cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 5281 ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga 5341 gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 5401 agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 5461 atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 5521 cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 5581 cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 5641 cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 5701 tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact 5761 tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 5821 ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 5881 aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 5941 cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag 6001 ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 6061 agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac 6121 ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca 6181 acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg 6241 cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 6301 gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 6361 tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 6421 ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt 6481 aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg 6541 gataacaatt tcacacagga aacagct // LOCUS SYNLACZJ 7242 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD26.77. ACCESSION M34305 KEYWORDS lacZ. SOURCE Cloning vector pPD26.77. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7242) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 4776 4777 synthetic end/pUC19 start recomb 5025 5026 pUC19 end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 4756 4757 unknown DNA end/synthetic start recomb 5036 5037 synthetic end/pUC19 start BASE COUNT 1780 a 1801 c 1846 g 1815 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcgaaac aaatcatctg acaccaccac cgtctgatgg atcgttctca tctccgtctc 3301 cacattatta tccgacgact acatcgacac cgaatcgaat ggaaacaagt ccggagtaca 3361 tgtttaacca tgaaatggtg ggtagatgat tattaaaatg tttaagaaaa ttaaataatt 3421 tgttttaggc accaccggtc aatgcgatgt ggtatactac accacctcct tatcaagatc 3481 caaactatcg tcatgtgcct ccaaatactg catttcaaaa tgcagagcaa atgaatggct 3541 ccttctactg ttaatctatt taattcatta atttttcatt tattgactgt atcccggatg 3601 tttcttgtcc tcccaacata tctcctaact gctcggttca ttttaaatat gctcatctca 3661 ctacatcacc cagacactgg tccccacaga gttttttgta tactatttcg ggtcattttt 3721 cttattctag actaatattg taagctataa gttgtagaat aattattgat ccaaatcaga 3781 ttaagagtat aagctttgtt ttttctcctt ttctttataa cttgttacaa tttttgaaat 3841 tccctttttt gacaggcttt tattacactg taactgtgtt tcttatcttg caaacattta 3901 atgaattgta attctttagt atcttgaggg ctttttgttt ttcgaattat tgaagctcaa 3961 agttccagtt ttactacgat ccagcgaatt ctcctcattt cgatccgatg caattgactt 4021 cagatcaata ttggttgcct gaaagaaata attgtgagca tttttgtcaa aaaacagaga 4081 actcaccatt ctcgaggctc ccgttccagg agcagtactt ggtgatggac acgtagattg 4141 attaaaccaa accaaaggtt ctttcagagt caacttacag cctcgagcgt agtccgtgat 4201 agcttctcgc agaacactga aaattggaaa tttattggaa taaaaacttt ttctgcactt 4261 tatagaataa aaaaatcatg aatttacccg aatttaacct ccgaatcgta gaccaaattg 4321 tccaagtaga tggaaatcac cttgaacatc ggatgttttt catatgctga aaataaatta 4381 atgaatttat gtaatttttt aaataattac ttttcaattt ggtgaacaat tcctgcttct 4441 ttgcataggc atctggacga gtgagtcctt tccaatcaat caatgtggtg tcgacctcga 4501 gggggggccc ggtacccagc ttttgttccc tttagtgagg gttaattccg agcttggcgt 4561 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 4621 taggagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagg taactcacat 4681 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 4741 aatgaatcgg ccaacgccta ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg 4801 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 4861 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 4921 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg 4981 tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcggcctt aagggcctcg 5041 tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg 5101 gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 5161 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 5221 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 5281 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 5341 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 5401 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 5461 tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 5521 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 5581 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 5641 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 5701 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 5761 cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 5821 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 5881 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 5941 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 6001 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 6061 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 6121 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 6181 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 6241 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 6301 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 6361 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 6421 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 6481 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 6541 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 6601 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcat tgagaaagcg 6661 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 6721 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 6781 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 6841 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 6901 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 6961 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 7021 cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 7081 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 7141 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 7201 gtggaattgt gagcggataa caatttcaca caggaaacag ct // LOCUS SYNLACZK 6620 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD34.110. ACCESSION M34306 KEYWORDS lacZ. SOURCE Cloning vector pPD34.110. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6620) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 237 238 synthetic end/E.coli trpS start recomb 321 322 E.coli trpS end/synthetic start recomb 325 326 synthetic end/E.coli lacZ start recomb 3319 3320 E.coli lacZ end/synthetic start recomb 4154 4155 synthetic end/pUC19 start recomb 4403 4404 pUC19 end/synthetic start recomb 3379 3380 synthetic end/unknown DNA start recomb 4134 4135 unknown DNA end/synthetic start recomb 4414 4415 synthetic end/pUC19 start BASE COUNT 1592 a 1665 c 1731 g 1632 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc tcgagaaagc tggcaaaggg ctcttgtcct gctaatcgta 181 ctactcttca tcgtcatctt cgttattact gttttgttcg tcataagatc taacaaggta 241 ccggtgggtg aagaccagaa acagcacctc gaactgagcc gcgatattgc ccagcgtttc 301 aacgcgctgt atggcgagat cgatcccgtc gttttacaac gtcgtgactg ggaaaaccct 361 ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 421 gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 481 tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct 541 gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc 601 tacaccaacg taacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg 661 acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg 721 cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc 781 ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc 841 ggagaaaacc gcctcgcggt gatggtgctg cgttggagtg acggcagtta tctggaagat 901 caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact 961 acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta 1021 ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct 1081 ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc 1141 gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa 1201 ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac 1261 ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat 1321 ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat 1381 catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg 1441 aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac 1501 acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc 1561 atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc 1621 gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg 1681 aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat 1741 ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt 1801 tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc 1861 atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc 1921 cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat 1981 ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat 2041 gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc 2101 cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa 2161 gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc 2221 agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat 2281 ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg 2341 attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc 2401 gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag 2461 tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat 2521 ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac 2581 cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg 2641 ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc 2701 cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa 2761 gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct 2821 cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat 2881 ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg 2941 gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga 3001 ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat 3061 ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc 3121 gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc 3181 agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa 3241 gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg 3301 agcccgtcag tatcggcgga attccaactg agcgccggtc gctaccatta ccaacttgtc 3361 tggtgtcaaa aataataggg gccgctgtca tcagatcgcc atctcgcgcc cgtgcctctg 3421 acttctaagt ccaattactc ttcaacatcc ctacatgctc tttctccctg tgctcccacc 3481 ccctattttt gttattatca aaaaaacttc ttcttaattt ctttgttttt tagcttcttt 3541 taagtcacct ctaacaatga aattgtgtag attcaaaaat agaattaatt cgtaataaaa 3601 agtcgaaaaa aattgtgctc cctcccccca ttaataataa ttctatccca aaatctacac 3661 aatgttctgt gtacacttct tatgtttttt ttacttctga taaatttttt ttgaaacatc 3721 atagaaaaaa ccgcacacaa aataccttat catatgttac gtttcagttt atgaccgcaa 3781 tttttatttc ttcgcacgtc tgggcctctc atgacgtcaa atcatgctca tcgtgaaaaa 3841 gttttggagt atttttggaa tttttcaatc aagtgaaagt ttatgaaatt aattttcctg 3901 cttttgcttt ttgggggttt cccctattgt ttgtcaagag tttcgaggac ggcgtttttc 3961 ttgctaaaat cacaagtatt gatgagcacg atgcaagaaa gatcggaaga aggtttgggt 4021 ttgaggctca gtggaaggtg agtagaagtt gataatttga aagtggagta gtgtctatgg 4081 ggtttttgcc ttaaatgaca gaatacattc ccaatatacc aaacataact gtttcctact 4141 agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga 4201 cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa 4261 gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca 4321 tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta 4381 aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat ttttataggt 4441 taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg 4501 cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 4561 ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt 4621 ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga 4681 aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga 4741 actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat 4801 gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 4861 agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 4921 cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 4981 catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 5041 aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5101 gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5161 aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat 5221 agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg 5281 ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 5341 actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5401 aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 5461 gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta 5521 atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 5581 tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 5641 tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 5701 ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 5761 agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa 5821 ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 5881 tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 5941 gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6001 cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg aagggagaaa 6061 ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 6121 agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 6181 tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6241 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6301 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6361 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6421 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 6481 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 6541 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 6601 atttcacaca ggaaacagct // LOCUS SYNLACZL 5808 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD16.01. ACCESSION M34307 KEYWORDS lacZ. SOURCE Cloning vector pPD16.01. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 5808) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 3342 3343 synthetic end/unknown DNA start recomb 3591 3592 unknown DNA end/synthetic start recomb 3602 3603 synthetic end/pUC19 start BASE COUNT 1375 a 1502 c 1599 g 1332 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggcctactag tcggccgtac gggccctttc gtctcgcgcg 3361 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg 3421 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg 3481 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat 3541 gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc ggccttaagg 3601 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt 3661 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 3721 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 3781 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 3841 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 3901 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 3961 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 4021 cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc 4081 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 4141 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 4201 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 4261 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 4321 acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 4381 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac 4441 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 4501 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 4561 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 4621 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac 4681 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg 4741 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 4801 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 4861 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 4921 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 4981 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 5041 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 5101 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 5161 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag 5221 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 5281 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 5341 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 5401 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 5461 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 5521 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 5581 aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt 5641 aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta 5701 atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta 5761 tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct // LOCUS MUSTCAXL 331 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma LD1. ACCESSION M34194 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 331 T-cell receptor alpha-chain (AA at 1) BASE COUNT 71 a 86 c 89 g 85 t ORIGIN 1 cagtcagtga cgcagcccga tgctcgtgtc actgtctctg aaggagcctc tctgcagctg 61 agatgcaagt attcctcctc tgtgacacct tatctgttct ggtatgtcct gtacccgcgg 121 caggggctgc agctgctcct caagtactat tccggagacc cagtggttca aggagtgaat 181 ggctttgagg ctgagttcag caagagtaac tcttccttcc acctgcggaa agcctccgtg 241 cactggagcg actcggctgt gtacttctgt gctgtgagca tggatggaaa tgagaaaata 301 acttttgggg ctggaaccaa actcaccatt a // LOCUS MUSTCAXM 334 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma LD3. ACCESSION M34196 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 334 T-cell receptor alpha-chain (AA at 1) BASE COUNT 97 a 84 c 75 g 78 t ORIGIN 1 cagcaggtga gacaaagtcc ccaatctctg acagtctggg aaggagagac agcaattctg 61 aactgcagtt atgaggacag cacttttgac tacttcccat ggtaccgtct gttccctggg 121 gaaagccctg cactcctgat agccatacgt ccagtgtcca ataaaaagga agatggacga 181 ttcacaatct tcttcaataa aagggagaaa aagctctcct tgcacatcac agactctcag 241 cctggagact cagctaccta cttctgtgca gcaagaagta caggctttgc aagtgcgctg 301 acatttggat ctggcacaaa agtcattgtt ctac // LOCUS MUSTCAXN 327 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-14. ACCESSION M34198 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 327) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 327 T-cell receptor alpha-chain (AA at 1) BASE COUNT 92 a 80 c 85 g 70 t ORIGIN 1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata 61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga 121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg 181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa 241 gagtcagact cggctgtgta ctactgtgct ctgagtgatc agcgggggaa gcttatcttt 301 ggacagggaa ccaagttatc tatcaag // LOCUS MUSTCAXO 324 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-27. ACCESSION M34200 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 324 T-cell receptor alpha-chain (AA at 1) BASE COUNT 92 a 80 c 83 g 69 t ORIGIN 1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata 61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga 121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg 181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa 241 gagtcagact cggctgtgta ctactgtgct ctgaggagca actatcagtt gatctggggc 301 tctgggacca agctaattat aaag // LOCUS MUSTCAXP 297 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-6. ACCESSION M34202 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 297) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 297 T-cell receptor alpha-chain (AA at 1) BASE COUNT 85 a 73 c 74 g 65 t ORIGIN 1 gtgacattat ctgaaggaac ttctctgact gtgaactgtt cctatgaaac caaacagtac 61 ccaaccctgt tctggtatgt gcagtatccc ggagaaggtc cacagctcct ctttaaagtc 121 ccaaaggcca acgagaaggg aagcagcaga gggtttgaag ccacatacaa taaagaagcc 181 acctccttcc acttgcagaa agcctcagtg caagagtcag actcggctgt gtactactgt 241 gctctgagtg atcgggggac caatacaggc aaattaacct ttggggatgg gaccgtg // LOCUS MUSTCAXQ 193 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, T-cell clone V2.1. ACCESSION M34204 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 193) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 193 T-cell receptor alpha-chain (AA at 2) BASE COUNT 56 a 45 c 49 g 43 t ORIGIN 1 cctctttaaa gtcccaaagg ccaacgagaa gggaagcagc agagggtttg aagccacata 61 caataaagaa gccacctcct tccacttgca gaaagcctca gtgcaagagt cagactcggc 121 tgtgtactac tgtgctctga gtggaggcaa taataagctg acttttggtc aaggaaccgt 181 tctgagtgtt ctg // LOCUS MUSTCAXR 333 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-33. ACCESSION M34206 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 333) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 333 T-cell receptor alpha-chain (AA at 1) BASE COUNT 84 a 96 c 78 g 74 t 1 others ORIGIN 1 gactccgtga cccagacaga aggcctggtc actgtcaccg aggggttgcc tgtgaagctg 61 aactgcacct atcagactac ttatttaact attgcctttt tctggtatgt gcaatatctc 121 aacgaagccc ctcaggtact cctgcggagc tccacagaca acaagaggac cgagcaccaa 181 gggttccacg ccactctcna taagagcagc agctccttcc atctgcagaa gtcctcagcg 241 cagctgtcag actctgccct gtactactgt gctctgagga atacaggagg tgcagataga 301 ctcacctttg ggaaaggaac tcagctgatc atc // LOCUS MUSTCAXS 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-7. ACCESSION M34208 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 339 T-cell receptor alpha-chain (AA at 1) BASE COUNT 86 a 103 c 75 g 75 t ORIGIN 1 gactcagtga cccagacaga aggcctggtc actctcaccg aggggttgcc tgtgatgctg 61 aactgcacct atcagactgc ttactcaact ttccttttct ggtatgtgca acatctcaat 121 gaagccccta aactactcct gaagagctcc acagacaaca agaggaccga gcaccaaggg 181 ttccacgcca ctctccataa gagcagcagc tccttccatc tgcagaagtc ctcagcgcag 241 ctgtcagact ctgccctgta ctactgtgct ctgagtgata agactggagc taacactgga 301 aagctcacgt ttggacacgg caccatcctt agggtccat // LOCUS MUSTCAXT 342 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma P1F12C4. ACCESSION M34210 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 342) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 342 T-cell receptor alpha-chain (AA at 1) BASE COUNT 80 a 102 c 79 g 81 t ORIGIN 1 gactccgtga cccagacaga aggcctggtc actctcaacg aggggttgcc tgtgatgctg 61 aactgcacct atcagactat ttactcaaat gctttccttt tctggtatgt gcactatctc 121 aatgaatccc cttggctact cctgcggagc tccacagaca acaagaggac cgagcaccaa 181 gggttccacg ccactctcca taagagcagc agctccttcc atctgcagaa gtcctcagcg 241 cagctgtcag actctgccct gtactactgt gctttgagtg agaggtctgg agctaacact 301 ggaaagctca cgtttggaca cggcaccatc cttagggtcc at // LOCUS MUSTCAXU 324 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma P1D3A6. ACCESSION M34212 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 324 T-cell receptor alpha-chain (AA at 1) BASE COUNT 82 a 80 c 81 g 81 t ORIGIN 1 cagcaagtgc agcagagccc cgcgtccttg gttctgcagg agggggagaa tgcagagctg 61 cagtgtaact tttccacatc tttgaacagt atgcagtggt tttaccaacg tcctgaggga 121 agtctcgtca gcctgttcta caatccttct gggacaaagc agagtgggag actgacatcc 181 acaacagtca tcaaagaacg tcgcagctct ttgcacattt cctcctccca gatcacagac 241 tcaggcactt atctctgtgc tatggaggct actggaggca ataataagct gacttttggt 301 caaggaaccg ttctgagtgt tata // LOCUS MUSTCAXV 210 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 1E1O. ACCESSION M34214 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 210 T-cell receptor alpha-chain (AA at 1) BASE COUNT 54 a 53 c 50 g 53 t ORIGIN 1 gggggaagtc tcgtcagcct gttctacaat ccttctggga caaagcagag tgggagactg 61 acatccacta cagtcatcaa agaacgtcgc agctctttgc acatttcctc ctcccagaca 121 acagactcag gcacttatct ctgtgctatg gcggctactg gaggcaataa taagctgact 181 tttggtcaag gaaccgttct gagtgttata // LOCUS MUSTCAXW 234 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 7/6AH1. ACCESSION M34216 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 234) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 234 T-cell receptor alpha-chain (AA at 1) BASE COUNT 55 a 59 c 58 g 62 t ORIGIN 1 atgcagtggt tttatcaacg tcctggggga agtctcgtca gcctgttcta caatccttct 61 gggacaaagc agagtgggag actgacatcc actacagtca tcaaagaacg tcgcagctct 121 ttgcacattt cctcctccca gacaacagac tcaggcactt atctctgtgc tatgggtgta 181 tctggtagct tcaataagtt gacctttgga gcagggacca gactggctgt gtgc // LOCUS MUSTCAXX 312 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 2B11. ACCESSION M34218 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 312 T-cell receptor alpha-chain (AA at 1) BASE COUNT 89 a 79 c 79 g 64 t 1 others ORIGIN 1 aatccgtggg ccctgagngt ccacgagggt gaaagtgtca cggtgaattg tagttacaag 61 acatccataa ctgccctaca gtggtacaga cagaagtcag gcgaaggccc tgcccagcta 121 atcttaatac gttcaaatga gagagagaag cgcaatggaa gactcagagc cacccttgac 181 acctccagcc agagcagctc cttgtccatc actgctactc ggtgtgaaga caccgctgtg 241 tacttctgtg ctactgagac aggcaatact agaaaacaca tctttgggct ggggacaact 301 ttgcaagtgc aa // LOCUS MUSTCBYAO 153 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma LD1. ACCESSION M34195 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 153) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 153 T-cell receptor beta-chain (AA at 1) recomb 122 123 V-region end/J-region start BASE COUNT 39 a 37 c 36 g 41 t ORIGIN 1 caaataggag atgtccctga tgggtacaag gccaccagaa caacgcaaga agacttcttc 61 ctcctgctgg aattggcttc tccctctcag acatctttgt acttctgtgc cagcagtgta 121 ggttctggaa atacgctcta ttttggagaa gga // LOCUS MUSTCBYAP 111 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma LD3. ACCESSION M34197 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 111) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 111 T-cell receptor beta-chain (AA at 1) recomb 73 74 V-region end/J-region start BASE COUNT 18 a 37 c 25 g 31 t ORIGIN 1 ttcctcctgc tggaattggc ttctccctct cagacatctt tgtacttctg tgccgcgtcc 61 ccgacaggga acaccgacta caccttcggc tcagggacca ggcttttggt a // LOCUS MUSTCBYAQ 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-14. ACCESSION M34199 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 321 T-cell receptor beta-chain (AA at 1) recomb 292 293 V-region end/J-region start BASE COUNT 106 a 81 c 61 g 73 t ORIGIN 1 gacccgaaaa ttatccagaa accaaaatat ctggtggcag tcacagggag cgaaaaaatc 61 ctgatatgcg aacagtatct aggccacaat gctatgtatt ggtatagaca aagtgctaag 121 aagcctctag agttcatgtt ttcctacagc tatcaaaaac ttatggacaa tcagactgcc 181 tcaagtcgct tccaacctca aagttcaaag aaaaaccatt tagaccttca gatcacagct 241 ctaaagcctg atgactcggc cacatacttc tgtgccagca gccccaagac acgtcaaaac 301 accttgtact ttggtgcggg c // LOCUS MUSTCBYAR 210 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-27. ACCESSION M34201 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 210 T-cell receptor beta-chain (AA at 1) recomb 167 168 V-region end/J-region start BASE COUNT 57 a 56 c 43 g 54 t ORIGIN 1 aagattatgt ttagctacaa taataagcaa ctcattgtaa acgaaacagt tccaaggcgc 61 ttctcacctc agtcttcaga taaagctcat ttgaatcttc gaatcaagtc tgtagagccg 121 gaggactctg ctgtgtatct ctgtgccagc agctatcgga caccccccta tgctgagcag 181 ttcttcggac cagggacacg actcaccgtc // LOCUS MUSTCBYAS 255 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-6. ACCESSION M34203 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 255) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 255 T-cell receptor beta-chain (AA at 1) recomb 219 220 V-region end/J-region start BASE COUNT 57 a 68 c 77 g 53 t ORIGIN 1 cagtatccct ggatgagctg gtatcagcag gatctccaaa agcaactaca gtggctgttc 61 actctgcgga gtcctgggga caaagaggtc aaatctcttc ccggtgctga ttacctggcc 121 acacgggtca ctgatacgga gctgaggctg caagtggcca acatgagcca gggcagaacc 181 ttgtactgca cctgcagtgc ggggactggg ggggctacta acaccttgta ctttggtgcg 241 ggcacccgac tatcg // LOCUS MUSTCBYAT 213 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, T-cell clone V2.1. ACCESSION M34205 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 213) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 213 T-cell receptor beta-chain (AA at 1) recomb 171 172 V-region end/J-region start BASE COUNT 48 a 59 c 58 g 48 t ORIGIN 1 ctgaggctga tccattattc atatggtgct ggcagcactg agaaaggaga tatccctgat 61 ggatacaagg cctccagacc aagccaagag aacttctccc tcattctgga gttggctacc 121 ccctctcaga catcagtgta cttctgtgcc agcggtggcg gccgggggag ttatgctgag 181 cagttcttcg gaccagggac acgactcacc gtc // LOCUS MUSTCBYAU 207 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-33. ACCESSION M34207 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 207) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 207 T-cell receptor beta-chain (AA at 1) recomb 163 164 V-region end/J-region start BASE COUNT 65 a 54 c 36 g 52 t ORIGIN 1 ctagagttca tgttttccta cagctatcaa aaacttatgg acaatcagac tgcctcaagt 61 cgcttccaac ctcaaagttc aaagaaaaac catttagacc ttcagatcac agctctaaag 121 cctgatgact cggccacata cttctgtgcc agcagcaaaa gggccaacga aagattattt 181 ttcggtcatg gaaccaagct gtctgtc // LOCUS MUSTCBYAV 156 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-7. ACCESSION M34209 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 156) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 156 T-cell receptor beta-chain (AA at 1) recomb 118 119 V-region end/J-region start BASE COUNT 30 a 48 c 40 g 38 t ORIGIN 1 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg 61 gcttctccct ctcagacatc tttgtacttc tgtgccagca gtgtccgggt ctgggggcct 121 gaacagtact tcggtcccgg caccaggctc acggtt // LOCUS MUSTCBYAW 132 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma P1F12C4. ACCESSION M34211 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 132) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 132 T-cell receptor beta-chain (AA at 1) recomb 94 95 V-region end/J-region start BASE COUNT 35 a 39 c 28 g 30 t ORIGIN 1 ccaagccaag agaacttctc cctcattctg gagttggcta ccccctctca gacatcagtg 61 tacttctgtg ccagcggtgc cagacaggca aacacagaag tcttctttgg taaaggaacc 121 agactcacag tt // LOCUS MUSTCBYAX 303 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma P1D3A6. ACCESSION M34213 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 303 T-cell receptor beta-chain (AA at 1) recomb 259 260 V-region end/J-region start BASE COUNT 80 a 72 c 79 g 72 t ORIGIN 1 aaggtgacag taacaggagg aaacgtgaca ttgagctgtc gccagactaa tagccacaac 61 tacatgtact ggtatcggca ggacactggg catgggctga ggctgatcca ttactcatat 121 ggtgctggca accttcaaat aggagatgtc cctgatgggt acaaggccac cagaacaacg 181 caagaagact tcttcctcct gctggaattg gcttctccct ctcagacatc tttgtacttc 241 tgtgccagca gtgcaggagc tggaaatacg ctctattttg gagaaggaag ccggctcatt 301 gtt // LOCUS MUSTCBYAY 159 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 1E1O. ACCESSION M34215 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 159) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 159 T-cell receptor beta-chain (AA at 1) recomb 113 114 V-region end/J-region start BASE COUNT 35 a 40 c 39 g 45 t ORIGIN 1 gatgtccctg atgggtacaa ggccaccaga acaacgcaag aagacttctt cctcctgctg 61 gaattggctt ctccctctca gacatctttg tacttctgtg ccagcagtgt gggttctgga 121 aatacgctct attttggaga aggaagccgg ctcattgtt // LOCUS MUSTCBYAZ 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 7/6AH1. ACCESSION M34217 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 321 T-cell receptor beta-chain (AA at 1) recomb 275 276 V-region end/J-region start BASE COUNT 86 a 78 c 81 g 76 t ORIGIN 1 acccaaagcc ctagaaacaa ggtgacagta acaggaggaa acgtgacatt gagctgtcgc 61 cagactaata gccacaacta catgtactgg tatcggcagg acactgggca tgggctgagg 121 ctgatccatt actcatatgg tgctggcaac cttcaaatag gagatgtccc tgatgggtac 181 aaggccacca gaacaacgca agaagacttc ttcctcctgc tggaattggc ttctccctct 241 cagacatctt tgtacttctg tgccagcagt gtgggttctg gaaatacgct ctattttgga 301 gaaggaagcc ggctcattgt t // LOCUS MUSTCBYBA 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 2B11. ACCESSION M34219 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 339 T-cell receptor beta-chain (AA at 1) recomb 294 295 V-region end/J-region start BASE COUNT 90 a 88 c 88 g 73 t ORIGIN 1 gaggctgcag tcacccaaag ccctagaaac aaggtgacag taacaggagg aaacgtgaca 61 ttgagctgtc gccagactaa tagccacaac tacatgtact ggtatcggca ggacactggg 121 catgggctga ggctgatcca ttactcatat ggtgctggca accttcaaat aggagatgtc 181 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg 241 gcttctccct ctcagacatc tttgtacttc tgtgccagca ggagacaggg gcctagtcaa 301 aacaccttgt actttggtgc gggcacccga ctatcggtg // LOCUS CEACAEVA 264 bp ds-DNA VRL 17-JUL-1990 DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene, complete cds. ACCESSION M34092 KEYWORDS tat protein. SOURCE Caprine arthritis-encephalitis lentivirus (strain Cork) DNA, from goat synovial membrane, clone pCol.9. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis- encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 08-MAY-1990. Author address: W.P.Cheevers Washington State University Dept Veterinary Microbiology Pullman, WA 94164-7040 email: b384@wsuvmsl.bitnet FEATURES from to/span description pept 1 264 tat protein BASE COUNT 97 a 40 c 80 g 47 t ORIGIN 1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta 61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg 121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg 181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta 241 tgtaacccgg ggtggggaac ataa // LOCUS CEACAEVB 264 bp ds-DNA VRL 17-JUL-1990 DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene, complete cds. ACCESSION M34093 KEYWORDS tat protein. SOURCE Caprine arthritis-encephalitis lentivirus (strain G63) DNA, from goat synovial membrane, clone pC63-49. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis- encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 08-MAY-1990. Author address: W.P.Cheevers Washington State University Dept Veterinary Microbiology Pullman, WA 94164-7040 email: b384@wsuvmsl.bitnet FEATURES from to/span description pept 1 264 tat protein BASE COUNT 99 a 43 c 75 g 47 t ORIGIN 1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata 61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg 121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg 181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta 241 tgtaacccgg ggtgggaaac ataa // LOCUS FLAPR834HA 540 bp ss-RNA VRL 17-JUL-1990 DEFINITION Influenza A/PR/8/34, hemagglutinin (seg 4) gene. partial cds. ACCESSION M34335 KEYWORDS glycoprotein; hemagglutinin. SOURCE Influenza A/PR/8/34 RNA, passed in bovine MBDK cells, originally from human. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 540) AUTHORS Bressoud,A., Whitcomb,J., Pourzand,C., Haller,O. and Cerutti,P. TITLE Rapid detection of influenza virus H1 by the polymerase chain reaction JOURNAL Biochem. Biophys. Res. Commun. 167, 425-430 (1990) STANDARD simple staff_review COMMENT Sequence reported is + strand. FEATURES from to/span description pept < 1 > 540 hemagglutinin (AA at 2) BASE COUNT 179 a 121 c 123 g 117 t ORIGIN 1 cctactggtc ctgttatgtg cacttgcagc tgcagatgca gacacaatat gtataggcta 61 ccatgcgaac aattcaaccg acactgttga cacagtactc gagaagaatg tgacagtgac 121 acactctgtt aacctgctcg aagacagcca caacggaaaa ctatgtagat taaaaggaat 181 agccccacta caattgggga aatgtaacat cgccggatgg ctcttgggaa acccagaatg 241 cgacccactg cttccagtga gatcatggtc ctacattgta gaaacaccaa actctgagaa 301 tggaatatgt tatccaggag atttcatcga ctatgaggag ctgagggagc aattgagctc 361 agtgtcatca ttcgaaagat tcgaaatatt tcccaaagaa agctcatggc ccaaccacaa 421 cacaaacgga gtaacggcag catgctccca tgaggggaaa agcagttttt acagaaattt 481 gctatggctg acggagaagg agggctcata cccaaagctg aaaaattctt atgtgaacaa // LOCUS HUMGPPSBAA 355 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein C-D gene, intron C1. ACCESSION M34422 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, clone PS-beta-G C. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 355) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description IVS 1 355 pregnancy-specific beta-1 glycoprotein intron C1 BASE COUNT 110 a 64 c 62 g 119 t ORIGIN Chromosome 19. 1 gtaagtggat cccagcatcg ttggcaatag ggttttaggt ggagtctatc tggcattcag 61 agaagagtca ggaaaacaat tgtattccca gcctgtgtcc catgggcaca agcaaatccc 121 aaattctcct cctgaaccct ccaaatttgt ctaagaactt cgaaaacttt aacaaacagg 181 ctgatatctt cataatattc ccagcctaga ccaagcagga agaacattga tttcattgaa 241 ataattgata ataatgaaga taatgttttt atgattttta tttgaaaatt tgctgattct 301 ttaaatggtt tgttttctac attgatggaa tttttctctt ttaatctatc tacag // LOCUS HUMGPPSBD 1418 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M34421 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1418) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE Structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept 79 1359 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 79 180 pregnancy-specific beta-1 glycoprotein signal peptide matp 181 1356 pregnancy-specific beta-1 glycoprotein mRNA 1 1418 pregnancy-specific beta-1 glycoprotein mRNA BASE COUNT 408 a 398 c 296 g 316 t ORIGIN Chromosome 19. 1 cagctgacag ccgtgctcag acagcttctg gatcctaggc tcatctccac agaggagaac 61 acgcaggcag cagagaccat ggggcccctc ccagcccctt cctgcacaca gcgcatcacc 121 tggaaggggc tcctgctcac agcatcactt ttaaacttct ggaacccgcc caccactgcc 181 gaagtcacga ttgaagccca gccacccaaa gtttctgagg ggaaggatgt tcttctactt 241 gtccacaatt tgccccagaa tcttcctggc tacttctggt acaaagggga aatgacggac 301 ctctaccatt acattatatc gtatatagtt gatggtaaaa taattatata tgggcctgca 361 tacagtggaa gagaaacagt atattccaac gcatccctgc tgatccagaa tgtcacccgg 421 aaggatgcag gaacctacac cttacacatc ataaagcgag gtgatgagac tagagaagaa 481 attcgacatt tcaccttcac cttatacttg gagactccca agccctacat ctccagcagc 541 aacttaaacc ccagggaggc catggaggct gtgcgcttaa tctgtgatcc tgagactctg 601 gacgcaagct acctatggtg gatgaatggt cagagcctcc ctgtgactca caggttgcag 661 ctgtccaaaa ccaacaggac cctctatcta tttggtgtca caaagtatat tgcaggaccc 721 tatgaatgtg aaatacggaa cccagtgagt gccagtcgca gtgacccagt caccctgaat 781 ctcctcccga agctgcccat cccctacatc accatcaaca acttaaaccc cagggagaat 841 aaggatgtct tagccttcac ctgtgaacct aagagtgaga actacaccta catttggtgg 901 ctaaacggtc agagcctccc cgtcagtccc ggggtaaagc gacccattga aaacaggata 961 ctcattctac ccagtgtcac gagaaatgaa acaggaccct atcaatgtga aatacgggac 1021 cgatatggtg gcctccgcag taacccagtc atcctaaatg tcctctatgg tccagacctc 1081 cccagaattt acccttcatt cacctattac cgttcaggag aaaacctcga cttgtcctgc 1141 ttcacggaat ctaacccacc ggcagagtat ttttggacaa ttaatgggaa gtttcagcaa 1201 tcaggacaaa agctctttat cccccaaatt actagaaatc atagcgggct ctatgcttgc 1261 tctgttcata actcagccac tggcaaggaa atctccaaat ccatgacagt caaagtctct 1321 ggtccctgcc atggagacct gacagagtct cagtcatgac tgcaacaact gagacactga 1381 gaaaaagaac aggctgatac cttcatgaaa ttcaagac // LOCUS HUMGPPSBE 1856 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M34420 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1856) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE Structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept 82 1368 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 82 183 pregnancy-specific beta-1 glycoprotein signal peptide matp 184 1365 pregnancy-specific beta-1 glycoprotein mRNA 1 1856 pregnancy-specific beta-1 glycoprotein mRNA BASE COUNT 544 a 476 c 364 g 472 t ORIGIN Chromosome 19. 1 gcacagctga gagccatgct caggaagttt ctggatccta ggctcagctc cacagaggag 61 aacacgcagg cagcagagac catggggccc ctctcagccc ctccctgcac acagcgcatc 121 acctggaagg ggctcctgct cacagcatca cttttaaact tctggaaccc gcctaccact 181 gcccaagtca cgattgaagc cgagccaacc aaagtttcca aggggaagga cgttcttcta 241 cttgtccaca atttgcccca gaatcttgct ggctacatct ggtacaaagg gcaaatgaag 301 gacctctacc attacattac atcatacgta gtagatggtc aaataattat atatgggcct 361 gcatacagtg gacgagaaac agtatattcc aatgcatccc tgctgatcca gaatgtcacc 421 cgggaggacg caggatccta caccttacac atcgtaaagc gaggtgatgg gactagagga 481 gaaactggac atttcacctt caccttatac ctggagactc ccaagccctc catctccagc 541 agcaacttat accccaggga ggacatggag gctgtgagct taacctgtga tcctgagact 601 ccggacgcaa gctacctgtg gtggatgaat ggtcagagcc tccctatgac tcacagcttg 661 cagttgtcca aaaacaaaag gaccctcttt ctatttggtg tcacaaagta cactgcagga 721 ccctatgaat gtgaaatacg gaacccagtg agtgccagcc gcagtgaccc agtcaccctg 781 aatctcctcc cgaagctgcc caagccctac atcaccatca acaacttaaa ccccagggag 841 aataaggatg tcttagcctt cacctgtgaa cctaagagtg agaactacac ctacatttgg 901 tggctaaatg gtcagagcct cccggtcagt cccagggtaa agcgacccat tgaaaacagg 961 atcctcattc tacccagtgt cacgagaaat gaaacaggac cctatcaatg tgaaatacag 1021 gaccgatatg gtggcatccg cagttaccca gtcaccctga atgtcctcta tggtccagac 1081 ctccccagaa tttacccttc attcacctat taccattcag gagaaaacct ctacttgtcc 1141 tgcttcgcgg actctaaccc accagcagaa tattcttgga caattaatgg gaagtttcag 1201 ctatcaggac aaaagctctt tatcccccag attactacaa agcatagcgg gctctatgct 1261 tgctctgttc gtaactcagc cactggcatg gaaagctcca aatccatgac agtcaaagtc 1321 tctgctcctt caggaacagg acatcttcct ggccttaatc cattatagca gccgtgatgt 1381 catttctgta tttcaggaag actggcagac agttgctttc attcttcctc aaagtattta 1441 ccatcagcta cagtccaaaa ttgctttttg ttcaaggaga tttatgaaaa gactctgaca 1501 aggactcttg aatacaagtt cctgataact tcaagatcat accactggac taagaacttt 1561 caaaatttta atgaacaggc tgatacttca tgaaattcaa gacaaagaaa aaaacccaat 1621 tttattggac taaatagtca aaacaatgtt ttcataattt tctatttgaa aatgtgctga 1681 ttctttgaat gttttattct ccagatttat gcactttttt tcttcagcaa ttggtaaagt 1741 atacttttgt aaacaaaaat tgaaacattt gcttttgctc cctaagtgcc ccagaattgg 1801 gaaactattc aggagtattc atatgtttat ggtaataaag ttatctgcac aagttc // LOCUS HUMGPPSBF 2004 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M23575 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone pSP1-i. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2004) AUTHORS Rooney,B.C., Horne,C.H.W. and Hardman,N. TITLE Molecular cloning of a cDNA for human pregnancy-specific beta-1-glycoprotein: Homology with human carcinoembryonic antigen and related proteins JOURNAL Gene 71, 439-449 (1988) STANDARD simple staff_review FEATURES from to/span description pept 232 1518 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 232 333 pregnancy-specific beta-1 glycoprotein signal peptide matp 334 1515 pregnancy-specific beta-1 glycoprotein BASE COUNT 579 a 510 c 406 g 509 t ORIGIN 1 gggcgggaca actggtctga gtactatggc tgattttcgc tgtctggcat tgagaagcca 61 cacgcccctt ttgcttagga ggcctctctg ctggaggatg acgatggcat ggtttatcta 121 aggccactga caagtcatca atataggaca gcacagctga gagccatgct caggaagttt 181 ctggatccta ggctcagctc cacagaggag aacacgcagg cagcagagac catggggccc 241 ctctcagccc ctccctgcac acagcgcatc acctggaagg ggctcctgct cacagcatca 301 cttttaaact tctggaaccc gcctaccact gcccaagtca cgattgaagc cgagccaacc 361 aaagtttcca aggggaagga cgttcttcta cttgtccaca atttgcccca gaatcttgct 421 ggctacatct ggtacaaagg gcaaatgaag gacctctacc attacattac atcatacgta 481 gtagatggtc aaataattat atatgggcct gcatacagtg gacgagaaac agtatattcc 541 aatgcatccc tgctgatcca gaatgtcacc cgggaggacg caggatccta caccttacac 601 atcgtaaagc gaggtgatgg gactagagga gaaactggac atttcacctt caccttatac 661 ctggagactc ccaagccctc catctccagc agcaacttat accccaggga ggacatggag 721 gctgtgagct taacctgtga tcctgagact ccggacgcaa gctacctgtg gtggatgaat 781 ggtcagagcc tccctatgac tcacagcttg cagttgtcca aaaacaaaag gaccctcttt 841 ctatttggtg tcacaaagta cactgcagga ccctatgaat gtgaaatacg gaacccagtg 901 agtgccagcc gcagtgaccc agtcaccctg aatctcctcc cgaagctgcc caagccctac 961 atcaccatca acaacttaaa ccccagggag aataaggatg tcttagcctt cacctgtgaa 1021 cctaagagtg agaactacac ctacatttgg tggctaaatg gtcagagcct cccggtcagt 1081 cccagggtaa agcgacccat tgaaaacagg atcctcattc tacccagtgt cacgagaaat 1141 gaaacaggac cctatcaatg tgaaatacag gaccgatatg gtggcatccg cagttaccca 1201 gtcaccctga atgtcctcta tggtccagac ctccccagaa tttacccttc attcacctat 1261 taccattcag gagaaaacct ctacttgtcc tgcttcgcgg actctaaccc accagcagaa 1321 tattcttgga caattaatgg gaagtttcag ctatcaggac aaaagctctt tatcccccag 1381 attactacaa agcatagcgg gctctatgct tgctctgttc gtaactcagc cactggcatg 1441 gaaagctcca aatccatgac agtcaaagtc tctgctcctt caggaacagg acatcttcct 1501 ggccttaatc cattatagca gccgtgatgt catttctgta tttcaggaag actggcagac 1561 agttgctttc attcttcctc aaagtattta ccatcagcta cagtccaaaa ttgctttttg 1621 ttcaaggaga tttatgaaaa gactctgaca aggactcttg aatacaagtt cctgataact 1681 tcaagatcat acatggacta agaactttca aaattttaat gaacaggctg atacttcatg 1741 aaattcaaga caaagaaaaa aacccaattt tattggacta aatagtcaaa acaatgtttt 1801 cataattttc tatttgaaaa tgtgctgatt ctttgaatgt tttattctcc agatttatgc 1861 actttttttc ttcagcaatt ggtaaagtat acttttgtaa acaaaaattg aaacatttgc 1921 ttttgctccc taagtgcccc agaattggga aactattcag gagtattcat atgtttatgg 1981 taataaagtt atctgcacaa accc // LOCUS HUMLEUELA 920 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human elastase/medullasin mRNA, complete cds. ACCESSION M34379 KEYWORDS elastase; medullasin. SOURCE Human leukemic cell line ML3, cDNA to mRNA, clone pSRHLE. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 920) AUTHORS Okano,K., Aoki,Y., Shimizu,H. and Naruto,M. TITLE Functional expression of human leukocyte elastase (HLE)/medullasin in eukaryotic cells JOURNAL Biochem. Biophys. Res. Commun. 167, 1326-1332 (1990) STANDARD full staff_review FEATURES from to/span description pept 39 842 elastase/medullasin precursor (EC 3.4.21.37) sigp 39 119 elastase/medullasin signal peptide matp 126 839 elastase/medullasin BASE COUNT 141 a 329 c 287 g 163 t ORIGIN 1 gcacggaggg gcagagaccc cggagcccca gccccaccat gaccctcggc cgccgactcg 61 cgtgtctttt cctcgcctgt gtcctgccgg ccttgctgct ggggggcacc gcgctggcct 121 cggagattgt ggggggccgg cgagcgcggc cccacgcgtg gcccttcatg gtgtccctgc 181 agctgcgcgg aggccacttc tgcggcgcca ccctgattgc gcccaacttc gtcatgtcgg 241 ccgcgcactg cgtggcgaat gtaaacgtcc gcgcggtgcg ggtggtcctg ggagcccata 301 acctctcgcg gcgggagccc acccggcagg tgttcgccgt gcagcgcatc ttcgaaaacg 361 gctacgaccc cgtaaacttg ctcaacgaca tcgtgattct ccagctcaac gggtcggcca 421 ccatcaacgc caacgtgcag gtggcccagc tgccggctca gggacgccgc ctgggcaacg 481 gggtgcagtg cctggccatg ggctggggcc ttctgggcag gaaccgtggg atcgccagcg 541 tcctgcagga gctcaacgtg acggtggtga cgtccctctg ccgtcgcagc aacgtctgca 601 ctctcgtgag gggccggcag gccggcgtct gtttcgggga ctccggcagc cccttggtct 661 gcaacgggct aatccacgga attgcctcct tcgtccgggg aggctgcgcc tcagggctct 721 accccgatgc ctttgccccg gtggcacagt ttgtaaactg gatcgactct atcatccaac 781 gctccgagga caacccctgt ccccaccccc gggacccgga cccggccagc aggacccact 841 gagaagggct gcccgggtca cctcagctgc ccacacccac actctccagc atctggcaca 901 ataaacattc tctgttttgt // LOCUS MSGIS6110 1360 bp ds-DNA BCT 17-JUL-1990 DEFINITION M.tuberculosis-50 complex IS6110 insertion sequence-like element. ACCESSION M29899 KEYWORDS insertion sequence. SOURCE M.tuberculosis (strain H37RV) DNA (cosmid library pHC79), clone I21. ORGANISM Mycobacterium tuberculosis Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae. REFERENCE 1 (bases 1 to 1360) AUTHORS Thierry,D., Cave,M.D., Eisenach,K.D., Crawford,J.T., Bates,J.H., Gicquel,B. and Guesdon,J.L. TITLE IS6110 an IS-like element of Mycobacterium tuberculosis-50 complex JOURNAL Nucleic Acids Res. 18, 188-188 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Thierry 15-NOV-1989. BASE COUNT 269 a 439 c 432 g 220 t ORIGIN 1 cgatgaaccg ccccggcatg tccggagact ccagttcttg gaaaggatgg ggtcatgtca 61 ggtggttcat cgaggaggta cccgccggag ctgcgtgagc gggcggtgcg gatggtcgca 121 gagatccgcg gtcagcacga ttcggagtgg gcagcgatca gtgaggtcgc ccgtctactt 181 ggtgttggct gcgcggagac ggtgcgtaag tgggtgcgcc aggcgcaggt cgatgccggc 241 gcacggcccg ggaccacgac cgaagaatcc gctgagctga agcgcttagc ggcgggacaa 301 cgccgaattg cgaagggcga acgcgatttt aaagaccgcg tcggctttct tcgcggccga 361 gctcgaccgg ccagcacgct aattaacggt tcatcgccga tcatcagggc caccgcgagg 421 gccccgatgg tttgcggtgg ggtgtcgagt cgatctgcac acagctgacc gagctgggtg 481 tgccgatcgc cccatcgacc tactacgacc acatcaaccg ggagcccagc cgccgcgagc 541 tgcgcgatgg cgaactcaag gagcacatca gccgcgtcca cgccgccaac tacggtgttt 601 acggtgcccg caaagtgtgg ctaaccctga accgtgaggg catcgaggtg gccagatgca 661 ccgtcgaacg gctgatgacc aaactcggcc tgtccgggac cacccgcggc aaagcccgca 721 ggaccacgat cgctgatccg gccacagccc gtcccgccga tctcgtccag cgccgcttcg 781 gaccaccagc acctaaccgg ctgtgggtag cagacctcac ctatgtgtcg acctgggcag 841 ggttcgccta cgtggccttt gtcaccgacg cctacgtcgc aggatcctgg gctggcgggt 901 cgcttccacg atggccacct ccatggtcct cgacgcgatc gagcaagcca tctggacccg 961 ccaacaagaa ggcgtactcg acctgaaaga cgttatccac catacggata ggggatctca 1021 gtacacatcg atccggttca gcgagcggct cgccgaggca ggcatccaac cgtcggtcgg 1081 agcggtcgga agctcctatg acaatgcact agccgagacg atcaacggcc tatacaagac 1141 cgagctgatc aaacccggca agccctggcg gtccatcgag gatgtcgagt tggccaccgc 1201 gcgctgggtc gactggttca accatcgccg cctctaccag tactgcggcg acgtcccgcc 1261 ggtcgaactc gaggctgcct actacgctca acgccagaga ccagccgccg gctgaggtct 1321 cagatcagag agtctccgga ctcaccgggg cggttcacga // LOCUS MUSIGHAAT 348 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig J558 family active H-chain mRNA V-J3 region from hybridoma CE5, partial cds. ACCESSION M34119 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene. SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 348 Ig H-chain V-J3 region (AA at 1) recomb 294 295 J-region end/D-region start recomb 303 304 D-region end/J-region start BASE COUNT 89 a 82 c 97 g 80 t ORIGIN Chromosome 12. 1 caggttcagc tgcagcagtc tggagctgaa ctgatgaagc ctggggcctc agtgaagata 61 tcctgcaagg ctactggcta cacattcagt aagtactgga tagagtgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagag attttacctg gaagtggtag tactaaccat 181 gatgagaagt tcaagggcaa ggccacattc actgcagata catcctccaa cacagcctac 241 atgcaactca gcagcctgac atctgaggac tctgccgtct attactgtgc aagagacggt 301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca // LOCUS MUSIGKCSR 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma GA733, partial cds. ACCESSION M34120 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma GA733, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain (AA at 1) BASE COUNT 87 a 83 c 74 g 77 t ORIGIN Chromosome 6. 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagtgtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatccgacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcaccaa cattatatta ctcctcggac gttcggtgga 301 ggcaccaaac tggaaatcaa a // LOCUS MUSIGKCSS 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma C017-1A, partial cds. ACCESSION M34121 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma C017-1A, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain (AA at 1) BASE COUNT 89 a 77 c 78 g 77 t ORIGIN Chromosome 6. 1 aacattgtaa tgacccaatc tcccaaatcc atgtccatgt cagtaggaga gagggtcacc 61 ttgacctgca aggccagtga gaatgtggtt acttatgttt cctggtatca acagaaacca 121 gagcaatctc ctaaactctt gatttacggg gcctccaacc ggtacactgg ggtccccgat 181 cgcttcacag gtagtggatc tgcaacagat ttcactctga ccattagtag tgtgcaagct 241 gaagaccttg cagattatca ctgtggacag ggttacagct atccgtacac gttcggaggg 301 gggaccaagc tggaaataaa a // LOCUS MUSIGKCST 318 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma CE5, partial cds. ACCESSION M34122 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 318 Ig kappa-chain (AA at 1) BASE COUNT 84 a 78 c 80 g 76 t ORIGIN Chromosome 6. 1 gacattgtga tgacccagtc tcagaaattc atgtccacat cagtaggaga cagggtcggc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaatca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgag 181 cgcttcgctg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcatcaa cattatagta ctcggacgtt cggtggaggc 301 accaagctgg aaatcaaa // LOCUS PSERRSAA 1517 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.aeruginosa 16S ribosomal RNA. ACCESSION M34133 KEYWORDS 16S ribosomal RNA; ribosomal RNA; small subunit ribosomal RNA. SOURCE P.aeruginosa (strain 25330) ribosomal RNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1517) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept of Microbiology 407 S. Goodwin Avenue Urbana, IL 61801 email: carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1517 16S rRNA BASE COUNT 379 a 334 c 470 g 301 t 33 others ORIGIN 1 ntactgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 61 gtcgagcgga tgaagggagc ttgctcctgg attcagcggc ggacgggtga gtaatgccta 121 ggaatctgcc tgatagtggg ggataacgtc cggaaacggg cgctaatacc gcatacgtcc 181 tgagggagaa agggggggat cctcggacct cacgctatcn gatgagccta ggtcggatta 241 gctagttggt ggggtaaagg cctaccaagg cgacgatccg taactggtct gagaggacga 301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaata 361 ttggacaatg ggcgaaagcc ngatccagcc atgccgcgtg tgtgaagaag gtcttcggat 421 tgtaaagcac tttaagttgg gaggaagggc agtaagttaa taccttgctg ttttgacgtt 481 accaacagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac gaagggtgcg 541 agcgttaatc ggaattactg ggcgtaaagc gcgcgtaggt ggttcagcaa gttggatgtg 601 aaatccccgg gctcaacctg ggaactgcat ccnaaactac tgagctagag tacggtagag 661 ggtggtggaa tttcctgtgt agcggtgaaa tgcgtagata taggaaggaa caccagtggc 721 gaaggcgacc acctggactg atactgacac tgaggtgcga aagcgtgggg agcaaacagg 781 attagatacc ctggtagtcc acgccgtaaa cgatgtcgac tagccgttgg gatccttgag 841 atcttagtgg cgcagctaac gcgataagtc gaccgcctgg ggagtacggc cgcaaggtta 901 aaactcaaat gaattgacgg gggcnngcac aagcggtgga gcatgtggtt taattcgaag 961 caacgcgaag aaccttacct ggccttgaca tgctgagaac tttccagaga tggattggtg 1021 ccttcgggaa ctcagacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt 1081 gggttaagtc ccgtaacgag cgcaaccctt gtccttagtt accagcacct cgggtgggca 1141 ctctaaggag actgccggtg acaaaccgga ggaaggtggg gatgacgtca agtcatcatg 1201 gcccttacgg cnagggctac acacgtgcta caatggtcgg tacaaagggt tgcgaagccg 1261 cgaggtggag ctaatcccat aaaaccgatc gtagtccgga tcgcagtctg caactcgact 1321 gcgtgaagtc ggaatcgcta gtaatcgtga atcagaatgt cacggtgaat acgttcccgg 1381 gccttgtaca caccgcccgt cacaccatgg gagtgggttg ctccagaagt agctagtcta 1441 accgcaaggg ggacggttac cacggagtga ttcatgnnnn nnnnnnnnnn gtaacaagnn 1501 nnnnnnnnnn gaacctg // LOCUS RATNESTIN 5946 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat nestin mRNA, complete cds. ACCESSION M34384 KEYWORDS intermediate filament protein; nestin. SOURCE Rat (strain E15) embryo central nervous system, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5946) AUTHORS Lendahl,U., Zimmerman,L.B. and McKay,R.D.G. TITLE CNS stem cells express a new class of intermediate filament protein JOURNAL Cell 60, 585-595 (1990) STANDARD simple staff_review FEATURES from to/span description pept 128 5545 nestin (128 could be 161) mRNA 1 5495 nestin mRNA BASE COUNT 1600 a 1401 c 1871 g 1074 t ORIGIN 1 tgctggagtt ctccgcttcc gctgggtcac tgtcgccgct acttcttttc aacccctaaa 61 agctccacgg gccactccct tctctagtgc tccacgtccg cttgccctcg ggggccagac 121 cagcgacatg gagggttgcg tcggggaaga atcttttcag atgtgggagc tcaatcgacg 181 cctggaggcc tacctgaccc gggtcaagac gctagaagag caaaaccagc tgctcagcgc 241 cgagcttggg ggactccggg cgcagtccgg agacacctcc tggagagccc gagccgatga 301 cgagctggca tccctgcgga tcctcgtcga tcagcgctgg cgggagaagc tcgaggctga 361 ggtgcagcgc gacaaccttg cggaagagct ggagagcgtg gcgggccggt gccagcaggt 421 gcggcttgct cgggagcgga ccgtccagga ggccgcctgc agccggcgcg cactcgaggc 481 ggagaagaat gcgcggggct ggctgagcac ccaggcggcc gagctggagc gggagttaga 541 ggctctgcga gccgcgcacg aggaggagcg cgcacacctg aacgcccagg ccgcctgtgc 601 gcctcgccgg ccccccgcac cgccccaccg gatccccggt ccggcccccg aagtcgagga 661 tctggccagg cgactaggcg aagtgtggcg cggggcggtg cgtgactacc aggagcgcgt 721 ggctcacatg gaaagctcgc tgggtcaggc acgcgagcgg ctgagccaag ccgtgcgggg 781 cgctcgggag tgtcgcttag aggtgcaaca gctgcaggct gatcgcgaca gcctccagga 841 gcgcagagaa gcgctggaac agagattgga aggccgctgg caggaccggc tgcaggccac 901 tgataagttc cagctggctg tggaagccct ggagcaggag aagcaaggtc tacagagtca 961 gatcgctcag atcctggaag gtgggcagca actggcacac ctcaagatgt cccttagtct 1021 ggaggtggct acatacagga ctctgctgga ggctgagaac tctcggttgc agacacctgg 1081 acgaggttcc caggcttctc ttggctttct ggaccccaag ctgaagccga atttccttgg 1141 gataccagag gaccagtacc tgggatctgt gctccctgcc ctcagcccca catccttccc 1201 ttcccccttg cctaataccc ttgagactcc tgtgacagcc ttcctgaaga ctcaggagtt 1261 ccttcaggcc agaaccccca ccttggccag cactcccatc ccacctatat ctgaggctcc 1321 ctgtcctcca aatgcagagg tgagagccca ggaggtccct ctttctctgc tccagacaca 1381 ggctccagag cccctttggc tgaaggccac agtgcctagt tcttctgcta tcctcccaga 1441 actagaggaa cctgggggca agcagcaggg tcacttccct gatgatctga cctccttagc 1501 cacaaacctc aaccctcacc accctacttt agaggctaaa gatggagaat ccagtgagtc 1561 tagagtttct agcatattcc aggaagatga ggggcaaatc tgggaactgg tagagaaaga 1621 agcagatata gaggtaaaag tagaaaacag ctcagcccag aaaacacaag aaagtggtct 1681 ggacacagaa gaaacccagg attcccaggg acctttgcag aaggaaacac tgaaggctct 1741 aggagaggag ccactgatgt ctctgaaaat ccagaactat gagacagcag ggaaagagaa 1801 ttgcaattct tctacagaag gccacctggg aacactagaa ggcccagaaa aagaaaagca 1861 aataccacta aagtctttag aagaaaagaa tgtagagtca gagaaaactc tagaaaatgg 1921 ggttcctgta ctatctgagc ttttaggaaa agaagacaca agaacagagg atcaagaatt 1981 aatgtctcct aaaggtacac taaagagatt ttcatctcta ggaaaggaaa gtcaagaagt 2041 agtgaggcct tcaaaagagg ggaacctaga atcatggaca gcttttaaag aggagagcca 2101 acacccactg ggatttccag gagctgagga ccagatgctt gagagactgg tagagaaaga 2161 ggatcagagc ttcccaaggt ctccagagga agaggaccag gaggcatgta gacctctgca 2221 gaaagagaat caggaaccac tagggtatga agaagcagag ggccagatac ttgagagact 2281 gatagaaaaa gagagtcagg agtccctgag gtctccagaa gaagaggacc aggaggcagg 2341 tagatctctg cagaaagaga atcaggagcc actagggtat gaagaagcag aggaccagat 2401 gcttgagaga ctgatagaaa aagagagtca ggagtccctg aagtctccag aagaaaacca 2461 gaggattggg aagcctctag aaagagagaa tcagaaatct ctgaggtatc ttgaagaaaa 2521 ccaggagact tttgtaccac tagaaagcag gaaccagagg ccactgagat ctctagaagt 2581 agaagaggag gagcagagaa ttgtgaaacc tctagaaaaa gtgagtcagg attccctcgg 2641 atctctagca gaagagaatg tgcagccact gaggtatctg gaagaagatg actgcataaa 2701 taagagcctt ctagaagaca agactcacaa gtccttgggg tctcttgaag atagaaatgg 2761 ggatagcatt attataccac aagaaagtga gacccaggtt tcattgaggc ctccagaaga 2821 ggaggaccag aggattgtga accatctaga aaaagaaagt caggagttct cgaggtcttc 2881 agaagaagaa gagcaggtga tggagagatc tctagaagga gagaaccatg aatcactgag 2941 ttctgtagaa aaagaggacc agatggttga gagccaacta gagaaagaga gtcaggactc 3001 agggaagtct cttgaagatg agagccagga gacctttgga cctctggaaa aagagaatgc 3061 agagtccctg agatctctag caggacagga ccaagaggaa cagaagcttg aacaagagac 3121 ccaacaaaca ctgagggctg tagggaatga gcagatggca gtgagcccac cagaaaaggt 3181 ggatccagag ttaccgaagc ctcttggaaa tgaccaggaa atagctagat ctcttggaaa 3241 agagaatcaa gagtcactag tgtcactgaa agaaaaaggt atagagacag tgaagtcttt 3301 agaaacagag atcatagaac cactggagac tgcagaagag gacctggaaa gaaggaagtc 3361 tatagatact caggagccat tgtggtctac tgaagtggct agagagacag tagaacctcc 3421 agaagatgag cccccaggat cgctagggtc tgtggatgag aaccgagaga cactgacatc 3481 ccttgaaaag gagagtcaag aactgagctc tctgggcaag tggaacgtag agaccagggt 3541 agaggacagt cagcagtgcc tgcaagtaga agagggtctg caggaggaac agcaccaaga 3601 gtctctgaga gaggtgaagc aggagctgcc tagctctgga aatcaacagc ggtgggagga 3661 tgtggtggag ggcaaagcag tgggtcagga agcacctctg gcaaccacag gagtgggaac 3721 tgaggataag gcagagttgc atctgagggg gcaaggtgga gaggaagaag ctgcagcaga 3781 gggagagctg ttgcaggata ttgtggggga ggcctggagt ctggggagct ctgagcccaa 3841 ggagcagagg gtccctgctg aggccctcga caacctggaa ggaggggcct tagaggtccc 3901 agttgctcag tcaatgccag aggtgacaga gcgagatgag gatagagccc aagcaggtga 3961 acaagactcc atagaggtga cccttgggtt agaggctgcc agaactggac tggaactcga 4021 gcaggaagtg gtagggctag aggacccaag gcattttgcc agggaggagg ccattccccc 4081 atccctgggg gaggaaagtg tgaaggcaaa gatagctcag ggcttggaag ggcctggaaa 4141 ggaaccaaaa gaggcaggtg ctctggactc ggggatcctt gaattgccca agactagcag 4201 cgaggctctg gaatgccagg gccatgaaga gtctgagtcc atggagggct gggaagaaga 4261 ggaggcctca ctggagactt cagatcatga gggcagtgat gcccctcagc ccaggccccc 4321 agaaacagaa gaagatgagg gtgcacaggc agcactgaca gcccctggtc ccaagctctt 4381 ggaaccctgt tcacccatcc caatcctgac agatgcccat gagctgcagc cccaggctga 4441 ggggatccag gaggctggct ggcagccaga agctgggtct gaagcactag aaagggtaga 4501 aaatgagcca gagtttggtc ttggggagat cccggagggc ctccaggatt gggaagaggg 4561 cagagaagaa agcgaggcag atgatctagg ggaaactctc cctgactcta ctcccctggg 4621 cctctacctg aggtcccctg cttctccaaa gtgggatctg gctggagaac agaggctttc 4681 ccctcaaggg gatgccggga aggaagactg gggtcctgct gtccccgctg cccagggcct 4741 cagtggtcca ccggaagagg aggaggagca aggccatggc tctgacctat catctgagga 4801 gtttgaggac ctagggactg aggcctctct tcttccaggg gttcccaagg aggtggcaga 4861 tcacgtgggc caagtgcccc cggtactgca gcctgcatgc tgggatcagg gtggggaatc 4921 tgatgggttt gctgatgagg aagaaagtgg ggaggaggga gaggaagaag atgctgatga 4981 ggaaggagca gagtcaggag ctcagtggtg ggggtcaggg gcctctggtg gaggctgcaa 5041 ggtccaggat attgcccaaa gaggagaccc ggtacaggag tctgtgggtg tcagtggtct 5101 ctgggatgat ggcttgagag gtgctgcagc taatgttcct gccctagaga tggtatctca 5161 ggacagtgct gagccttctg ggtcagagga gtctgagtct gcttccttgg agggggagga 5221 aggtcaagtg actgaccatt tagatgctcc ccaggaggtg accagcatgg tcccgggggt 5281 aggagatgcc tttgacattg gtggccagag ccccaacttg gactcagaac aagtgaatgg 5341 gaaaatggag aatggactag aacaggctga ggggcaggtg gtcctggatg gggacgagga 5401 tcaagaactc ctattacagg gacaggaggt gggtgctcta aaggttcctt tggtagcatc 5461 tcctgtgcat ctaggcccaa gccagcccct gaagttcact ctgagtgggg tagatgggga 5521 ttcctggtcc tcaggggaag actagaaact gcccctctgg ctctgaggat gtactggtgg 5581 ggatgtccct ccctgctctg ggtgaccact cttagctttg ataacttgac ccatggtatt 5641 tgtcctggag agttgtggct gggctgagca agggaggtga gatcctcctg aaggctcagg 5701 agttccaggc ctatagttct accccctctt tcttctgtgg ctcacctgct ggaagaggcc 5761 tgggcccaga gctttcccac aaggctgttc tggccacagc ttgctagcct tgcctaccac 5821 ctgcacaagg tctggtctgg tgtatgacca ggggagctga gggcagcatt tatctgaccc 5881 ttcatctcag cctgctgaga gcttgttcct ctcttcctcc ctgaataaag ccgtatccct 5941 acctac // LOCUS CLLRRE 1860 bp ss-rRNA RNA 17-JUL-1990 DEFINITION C.sapidus 18S rRNA, 3' end. ACCESSION M34360 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE C.sapidus rRNA. ORGANISM Callinectes sapidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; Brachyrhyncha; Portunoidea; Portunidae. REFERENCE 1 (bases 1 to 1860) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-113 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1860 18S ribosoma RNA BASE COUNT 359 a 375 c 421 g 367 t 338 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct 61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt 121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta 181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc 241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc 301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc 361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac 421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac 481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat 541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct 601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg 661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc 721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata 841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc 901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc 961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg 1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt 1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt 1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac 1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg 1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct 1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc 1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga 1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag 1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc 1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc 1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn 1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn // LOCUS HUMNCSRC 138 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human membrane-associated tyrosine protein kinase (C-SRC) gene, exons 3, 4, NI, and NII, partial cds. ACCESSION M34469 KEYWORDS membrane-associated tryosine protein kinase. SOURCE Human adult brain DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 138) AUTHORS Pyper,J.M. and Bolen,J.B. TITLE Identification of a novel neuronal C-SRC exon expressed in human brain JOURNAL Mol. Cell. Biol. 10, 2035-2040 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 29 membrane-associated tyrosine protein kinase (C-SRC), exon 3 (AA at 1) 30 47 membrane-associated tyrosine protein kinase, exon NI 48 80 membrane-associated tyrosine protein kinase, exon NII 81 > 136 membrane-associated tyrosine protein kinase, exon 4 variant 117 117 a or g variant 135 135 c or t BASE COUNT 36 a 40 c 40 g 22 t ORIGIN 1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc 61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga 121 cagacaggct acatcccc // LOCUS MHVAPEPA 1000 bp ss-RNA VRL 17-JUL-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34435 KEYWORDS peplomer protein. SOURCE Murine (strain JHM-DL) RNA. ORGANISM Murine coronavirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1000) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review BASE COUNT 258 a 192 c 227 g 317 t 6 others ORIGIN 1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag 61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa 121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct 181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa 241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta 301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt 361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca 421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac 481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg 541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga 601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg 661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt 721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt 781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa 841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa 901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag 961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa // LOCUS MHVAPEPB 843 bp ss-RNA VRL 17-JUL-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34436 KEYWORDS peplomer protein. SOURCE Murine (strain A59) RNA. ORGANISM Murine coronavirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 843) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review FEATURES from to/span description site 400 401 deletion site BASE COUNT 227 a 153 c 183 g 280 t ORIGIN 1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag 61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa 121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct 181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa 241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta 301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt 361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt 421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc 481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc 541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat 601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt 661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc 721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact 781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc 841 gaa // LOCUS MHVSP 3780 bp ss-mRNA VRL 17-JUL-1990 DEFINITION Mouse hepatitis virus surface protein S mRNA, complete cds. ACCESSION X04797 M34437 KEYWORDS glycoprotein; membrane glycoprotein; surface glycoprotein; surface projection glycoprotein. SOURCE Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones pJMS1010, pJS112, and pJS92. ORGANISM Murine hepatitis virus A59 Unclassified. REFERENCE 1 (bases 1 to 3780) AUTHORS Schmidt,I., Skinner,M. and Siddell,S. TITLE Nucleotide Sequence of the Gene Encoding the Surface Projection Glycoprotein of Coronavirus MHV-JHM JOURNAL J. Gen. Virol. 68, 47-56 (1987) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description SITE 24 30 put. region of homology preceeding mRNA 5' initiation codons SITE 121 129 pot. N-glycosylation site SITE 208 216 pot. N-glycosylation site SITE 430 438 pot. N-glycosylation site SITE 604 612 pot. N-glycosylation site SITE 1099 1107 pot. N-glycosylation site SITE 1333 1341 pot. N-glycosylation site SITE 1636 1644 pot. N-glycosylation site SITE 1732 1740 pot. N-glycosylation site SITE 1756 1764 pot. N-glycosylation site SITE 1825 1833 pot. N-glycosylation site SITE 1900 1914 put. proteolytic cleavage site SITE 1972 1980 pot. N-glycosylation site SITE 2023 2031 pot. N-glycosylation site SITE 2293 2301 pot. N-glycosylation site SITE 2440 2448 pot. N-glycosylation site SITE 3139 3147 pot. N-glycosylation site SITE 3301 3309 pot. N-glycosylation site SITE 3331 3339 pot. N-glycosylation site SITE 3388 3396 pot. N-glycosylation site SITE 3436 3444 pot. N-glycosylation site SITE 3499 3507 pot. N-glycosylation site SITE 3553 3654 pot. transmembrane domain SITE 3622 3675 cysteine-rich region SITE 3655 3660 charge cluster SITE 3715 3723 pot. N-glycosylation site FEATURES from to/span description pept 31 3738 surface protein S precursor sigp 31 60 surface protein S signal peptide matp 61 3736 surface protein S BASE COUNT 1030 a 718 c 800 g 1232 t ORIGIN 1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc 61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat 121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact 181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct 241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg 301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac 361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt 421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct 481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat 541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag 601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt 661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg 721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct 781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt 841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt 901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt 961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa 1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact 1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg 1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt 1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt 1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt 1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat 1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat 1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt 1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct 1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc 1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct 1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt 1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat 1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc 1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt 1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc 1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat 2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt 2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat 2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct 2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct 2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac 2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag 2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa 2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg 2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct 2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta 2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac 2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag 2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg 2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca 2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc 2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc 3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata 3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca 3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg 3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat 3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg 3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt 3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc 3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag 3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa 3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt 3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa 3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt 3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt // LOCUS MUSIGLAZ 713 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds. ACCESSION M34598 M29013 J03562 KEYWORDS constant region; immunoglobulin; immunoglobulin light chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone Y31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 713) AUTHORS Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A. TITLE Structure of a third murine immunoglobulin lambda light chain variable region that is expressed in laboratory mice JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987) STANDARD simple staff_entry REFERENCE 2 (bases 2 to 450) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review REFERENCE 3 (bases 266 to 429) AUTHORS Sanchez,P. and Cazenave,P.-A. TITLE A new variable region in mouse immunoglobulin lambda light chains JOURNAL J. Exp. Med. 166, 265-270 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 12 > 713 Ig lambda chain precursor V-x,J-2,C-2 region sigp 12 68 Ig lambda chain signal peptide matp 69 > 713 Ig lambda chain recomb 379 380 V-region end/J2-region start recomb 414 415 J2-region end/C2-region start BASE COUNT 175 a 190 c 159 g 189 t ORIGIN 1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt 61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag 121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc 181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca 241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta 301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa 361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc 421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca 481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga 541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca 601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca 661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct // LOCUS MUSIGLVD 681 bp ds-DNA ROD 17-JUL-1990 DEFINITION Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds. ACCESSION M34597 KEYWORDS constant region; germline; immunoglobulin light chain; joining exon; lambda-immunoglobulin; variable region. SOURCE Mouse (strain Balb/c AnPt) liver DNA, clone 30X2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 681) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review FEATURES from to/span description pept 59 104 Ig lambda-chain precursor Vx-J2-C2 region, exon 1 209 531 Ig lambda-chain precursor Vx-J2-C2 region, exon 2 sigp 59 104 Ig lambda-chain signal peptide 209 219 Ig lambda-chain signal peptide matp 220 528 Ig lambda-chain IVS 105 208 Ig lambda-chain Vx-J2-C2 region intron A recomb 530 531 Vx-region end/J2-region start recomb 565 566 J2-region end/C2-region start BASE COUNT 179 a 156 c 136 g 210 t ORIGIN Chromosome 16. 1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat 61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt 121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc 181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca 241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag 301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg 361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct 421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa 481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa 541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta 601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa 661 tcctcttgga cttcgtaggg c // LOCUS PAERRE 1877 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.kadiakensis 18S rRNA, 3' end. ACCESSION M34359 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.kadiakensis rRNA. ORGANISM Palaemonetes kadiakensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; Palaemonoidea; Palaemonidae. REFERENCE 1 (bases 1 to 1877) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-113 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1877 18S ribosoma RNA BASE COUNT 339 a 321 c 399 g 313 t 505 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt 61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt 121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac 181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg 241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca 301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct 361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna 421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga 481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac 541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn 601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac 841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna 901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta 961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta 1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa 1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt 1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc 1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt 1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn 1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg 1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna 1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg 1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca 1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact 1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag 1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnnnnn // LOCUS PBESVBRA 584 bp ds-DNA INV 17-JUL-1990 DEFINITION P.berghei telomeric repeat region subfragment alpha DNA. ACCESSION M34601 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 584) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 461 541 27 bp repeats BASE COUNT 205 a 41 c 85 g 251 t 2 others ORIGIN 1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac 61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc 121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata 181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat 241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat 301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa 361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt 421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg 481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg 541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga // LOCUS PBESVBRB 593 bp ds-DNA INV 17-JUL-1990 DEFINITION P.berghei telomeric repeat region subfragment a DNA. ACCESSION M34602 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 593) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 484 510 27 bp repeat motif BASE COUNT 209 a 40 c 97 g 247 t ORIGIN 1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca 61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac 121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa 181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt 241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat 301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat 361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat 421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg 481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt 541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg // LOCUS PEURRE 1902 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.aztecus 18S rRNA, 3' end. ACCESSION M34362 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.aztecus rRNA. ORGANISM Penaeus aztecus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea; Penaeidae. REFERENCE 1 (bases 1 to 1902) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1902 18S ribosoma RNA BASE COUNT 340 a 369 c 427 g 332 t 434 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt 61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt 121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg 181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc 241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg 301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc 361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag 481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga 541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga 601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg 661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc 721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn 781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn 841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg 901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg 961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna 1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn 1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg 1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa 1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn 1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg 1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac 1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta 1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct 1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg 1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact 1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg 1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt 1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn // LOCUS POCRRE 1874 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.ascensionis 18S rRNA, 3' end. ACCESSION M34358 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.ascensionis rRNA. ORGANISM Procaris ascensionis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; Procaridoidea; Procarididae. REFERENCE 1 (bases 1 to 1874) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1874 18S ribosoma RNA BASE COUNT 298 a 291 c 331 g 266 t 688 others ORIGIN 1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct 61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt 121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac 181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt 241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac 301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn 361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc 481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa 541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt 781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg 841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat 901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat 961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga 1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac 1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt 1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca 1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt 1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta 1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct 1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag 1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca 1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn 1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg 1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg 1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnn // LOCUS PRARRE 1869 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.leonensis 18S rRNA, 3' end. ACCESSION M34363 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.leonensis rRNA. ORGANISM Procambarus leonensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea; Astacoidea; Cambaridae. REFERENCE 1 (bases 1 to 1869) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1869 18S ribosoma RNA BASE COUNT 409 a 412 c 459 g 419 t 170 others ORIGIN 1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt 61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt 121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat 181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt 241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc 301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca 361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg 421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt 481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact 541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt 601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag 661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc 721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn 781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa 841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg 901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac 961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg 1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg 1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg 1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa 1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta 1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact 1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg 1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg 1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac 1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa 1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn 1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt 1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa 1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnn // LOCUS PVIC1RPTA 711 bp ds-DNA INV 17-JUL-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28745 M25758 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK247) sporozoite DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 711) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 711 circumsporozoite protein (AA at 1) site 1 15 region 1 site 697 711 region 2 site 502 597 post repeat variable region site 598 696 post repeat constant region BASE COUNT 261 a 151 c 226 g 73 t ORIGIN 1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat 61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca 121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca 181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga 241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc 301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa 361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga 421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat 481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca 541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga 601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta 661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c // LOCUS PVIC1RPTB 657 bp ds-DNA INV 17-JUL-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28746 M25759 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK210) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 657) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 657 circumsporozoite protein (AA at 1) site 1 15 region 1 site 643 657 region 2 rpt 16 477 27 bp repeats site 478 544 3' post repeat variable region site 545 642 3' post repeat constant region BASE COUNT 229 a 135 c 224 g 69 t ORIGIN 1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat 61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa 121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca 181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat 241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca 301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga 361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca 421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga 481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca 541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa 601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc // LOCUS SHV2A 554 bp ss-RNA VRL 17-JUL-1990 DEFINITION Simian hepatitis A virus segment 2A-encoded protein mRNA, partial cds. ACCESSION M34085 KEYWORDS . SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 554) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL Unpublished (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. Author address: E.A.Brown 547 Burnett-Womack CB# 7030 Chapel Hill, NC 27599 FEATURES from to/span description pept < 1 > 554 segment 2A-encoded protein (AA at 1) BASE COUNT 184 a 71 c 138 g 161 t ORIGIN Segment 2A; map position 3108-3662. 1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg 61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag 121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa 181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg 241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg 301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg 361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat 421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt 481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg 541 tcaggaatac agga // LOCUS SHVVP1CP 2373 bp ss-RNA VRL 17-JUL-1990 DEFINITION Simian hepatitis A virus capsid protein VP1 mRNA, partial cds. ACCESSION M34084 KEYWORDS capsid protein VP1. SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed in cwll line BS-C-1. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 2373) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL J. Virol. 63, 4932-4937 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. FEATURES from to/span description pept < 1 > 2373 capsid protein VP1 (AA at 1) BASE COUNT 661 a 447 c 491 g 774 t ORIGIN Segment P1; map position 735-3107. 1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg 61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact 121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca 181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga 241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa 301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg 361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca 421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct 481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc 541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat 601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct 661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca 721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt 781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat 841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca 901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag 961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa 1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg 1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt 1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc 1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg 1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag 1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat 1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct 1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct 1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag 1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta 1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca 1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg 1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt 1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg 1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg 1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca 1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg 2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat 2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc 2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg 2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg 2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat 2341 gatcctcgaa ctgaagagga tcgtaaattt gaa // LOCUS STNRRE 1885 bp ss-rRNA RNA 17-JUL-1990 DEFINITION S.hispidus 18S rRNA, 3' end. ACCESSION M34361 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE S.hispidus rRNA. ORGANISM Stenopus hispidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea; Stenopodidae. REFERENCE 1 (bases 1 to 1885) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1885 18S ribosoma RNA BASE COUNT 327 a 336 c 430 g 368 t 424 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt 61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact 121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn 181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta 241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag 301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt 361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga 421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn 481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag 541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag 601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn 661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc 721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg 841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg 901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg 961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt 1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt 1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg 1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat 1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg 1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt 1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga 1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng 1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt 1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc 1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg 1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg 1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn 1861 nnnnnnnnnn nnnnnnnnnn nnnnn // LOCUS SIVGAGFA1 541 bp ss-RNA VRL 17-JUL-1990 DEFINITION Simian immunodeficiency virus gag protein (smmgagFA1) gene, partial cds. ACCESSION M33754 KEYWORDS gag protein. SOURCE Simian immunodeficiency virus (isolate smm) RNA, from Cercocebus atys peripheral blood mononuclear cells. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 541) AUTHORS Villinger,F., Powell,J.D., Jehuda-Cohen,T., Neckelmann,N., Vuchetich,M., De,B.K., Folks,T.M., McClure,H.M. and Ansari,A.A. TITLE Detection of occult SIVsmm infection in asymptomatic seronegative nonhuman primates and evidence for variation in SIV gag sequence between endogenous and in vitro propagated virus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.J.Villinger, 18-APR-1990. Author address: F.J.Villinger Emory University, Department of Biology Winship, Cancer Center Atlanta GA, 30322 Ph: (404) 248-4555. FEATURES from to/span description pept < 1 > 541 gag protein (AA at 1) BASE COUNT 191 a 109 c 126 g 115 t ORIGIN 1 tttcaagcgc tatcagaagg ctgcactccc tatgacatca atcaaatgct aaattgtgta 61 ggagaacatc aggcagccat gcaaattatt agagagatta taaatgaaga agctgctgat 121 tgggatctac aacacccgca gccaggtcca ataccagcag tacaactcag agagccaaga 181 ggatcagata ttgcaggaac cactagtaca gtggacgagc aaattcagtg gatgtatagg 241 caacaaaatc ctgtaccagt gggaaatatt tacaggagat ggatccaact tggattacaa 301 aagtgtgtca ggatgtataa tccaacaaac attctggacg taaaacaggg gccgaaagag 361 ccattccaga gctatgtaga caggttctac aaaagcttaa gagcagaaca gacagatcca 421 gcagtgaaga attggatgac ccaaacactg ctgattcaaa atgctaaccc agattgtaaa 481 ttggtgctca agggtctggg tatgaatccc actttagaag aaatgctgac agcctgtcag 541 g // LOCUS SIVGAGFA2 335 bp ss-RNA VRL 17-JUL-1990 DEFINITION Simian immunodeficiency virus gag protein (smmgagFA2) gene, partial cds. ACCESSION M33755 KEYWORDS gag protein. SOURCE Simian immunodeficiency virus (isolate smm) RNA, from Cercocebus atys peripheral blood mononuclear cells. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 335) AUTHORS Villinger,F., Powell,J.D., Jehuda-Cohen,T., Neckelmann,N., Vuchetich,M., De,B.K., Folks,T.M., McClure,H.M. and Ansari,A.A. TITLE Detection of occult SIVsmm infection in asymptomatic seronegative nonhuman primates and evidence for variation in SIV gag sequence between endogenous and in vitro propagated virus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.J.Villinger, 18-APR-1990. Author address: F.J.Villinger Emory University, Department of Biology Winship, Cancer Center Atlanta GA, 30322 FEATURES from to/span description pept < 1 > 335 gag protein (AA at 1) BASE COUNT 121 a 70 c 72 g 72 t ORIGIN 1 tttcaagcgc tatcagaagg ctgcactccc tatgacatca atcaaatgct aaattgtgta 61 ggagaacatc aggcagccat gcaaattatt agagagatta taaataaaga agctgctgat 121 tgggatttac aacacccgca gccaggtcca ataccagcag gacaactcag agagccaaga 181 ggatcagata ttgcaggaac cactagtaca gtggacgagc aaattcagtg gatgtatagg 241 caacaaaatc ctgtaccagt cggaaacatt tacaggagat ggatccaact tgcattacaa 301 aagtgtgtca ggatgtacaa tccaacaaac attct // LOCUS NEUAMTR 5928 bp ds-DNA PLN 17-JUL-1990 DEFINITION N.crassa mating type protein gene, complete cds. ACCESSION M33876 KEYWORDS mating type protein. SOURCE N.crassa (strain 74-ORS-A) DNA. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 5928) AUTHORS Grotelueschen,J., Metzenberg,R.L. and Glass,N.L. TITLE The Neurospora crassa A mating type region JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.L.Glass, 26-APR-1990. FEATURES from to/span description pept 4121 4283 mating type protein, exon 1 4343 5046 mating type protein, exon 2 IVS 4284 4342 mating type protein intron A signal 3858 3906 GC signal signal 3945 3949 CAAT box signal 5319 5326 poly-A signal BASE COUNT 1470 a 1459 c 1526 g 1473 t ORIGIN 1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg 61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga 121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct 181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct 241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag 301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa 361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga 421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa 481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg 541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg 601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg 661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc 721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg 781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc 841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg 901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc 961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat 1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa 1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc 1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg 1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt 1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa 1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc 1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca 1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat 1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac 1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca 1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta 1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga 1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt 1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa 1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac 1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc 1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc 2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg 2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga 2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg 2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca 2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca 2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc 2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca 2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat 2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag 2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc 2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca 2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg 2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag 2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg 2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata 2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg 3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca 3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca 3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt 3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt 3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca 3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag 3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac 3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta 3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca 3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac 3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg 3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca 3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt 3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt 3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc 3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa 3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc 4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt 4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat 4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc 4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt 4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat 4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag 4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat 4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg 4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg 4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg 4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag 4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat 4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt 4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct 4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc 4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat 4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg 5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca 5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc 5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg 5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg 5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg 5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg 5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca 5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa 5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct 5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg 5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag 5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg 5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc 5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc 5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac // LOCUS HUMNCADH 3451 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human N-cadherin mRNA, complete cds. ACCESSION M34064 KEYWORDS N-cadherin; cell adhesion molecule; transmembrane protein. SOURCE Human muscle, cDNA to mRNA, clones lambda-[4-10,1-5,13,14]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3451) AUTHORS Walsh,F.S., Barton,C.H., Putt,W., Moore,S.E., Kesell,D., Spurr,N. and Goodfellow,P.N. TITLE The N-cadherin gene maps to human Chromosome 18 and is not linked to the E-cadherin gene JOURNAL J. Neurochem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.H.Barton, 08-MAY-1990. FEATURES from to/span description pept < 1 2247 N-cadherin (AA at 1) BASE COUNT 1041 a 703 c 724 g 983 t ORIGIN Chromosome 18. 1 gactgggtca tccctccaat caacttgcca gaaaactcca ggggaccttt tcctcaagag 61 cttgtcagga tcaggtctga tagagataaa aacctttcac tgcggatacg tgtaactggg 121 ccaggagctg accagcctcc aactggtatc ttcattctca accccatctc gggtcagctg 181 tcggtgacaa agcccctgga tcgccagcag aatgcccggt ttcatttagg ggcacatgca 241 gtagatatta atggaaatca agtggagacc cccattgaca ttgtcatcaa tgttattgac 301 atgaatgaca acagacctga gttcttacac caggtttgga atgggacagt tcctgaggga 361 tcaaagcctg gaacatatgt gatgaccgta acagcaattg atgctgacga tcccaatgcc 421 ctcaatggga tgttgaggta cagaatcgtg tctcaggctc caagcacccc ttcacccaac 481 atgtttacaa tcaacaatga gactggtgac atcatcacag tggcagctgg acttgatcga 541 gaaaaagtgc aacagtatac gttaataatt caagctacag acatggaagg caatcccaca 601 tatggccttt caaacacagc cacggccgtc atcacagtga cagatgtcaa tgacaatcct 661 ccagagttta ctgccatgac gttttatggt gaagttcctg agaacagggt agacatcata 721 gtagctaatc taactgtgac cgataaggat caaccccata caccagcctg gaacgcagtg 781 tacagaatca gtggcggaga tcctactgga cggttcgcca tccagaccga cccaaacagc 841 aacgacgggt tagtcaccgt ggtcaaacca atcgactttg aaacaaatag gatgtttgtc 901 cttactgttg ctgcagaaaa tcaagtgcca ttagccaagg gaattcagca cccgcctcag 961 tcaactgcaa ccgtgtctgt tacagttatt gacgtaaatg aaaaccctta ttttgccccc 1021 aatcctaaga tcattcgcca agaagaaggg cttcatgccg gtaccatgtt gacaacattc 1081 actgctcagg acccagatcg atatatgcag caaaaatatt taagatacac taaattatct 1141 gatcctgcca attggctaaa aatagatcct gtgaatggac aaataactac aattgctgtt 1201 ttggaccgag aatcaccaaa tgtgaaaaac aatatatata atgctacttt ccttgcttct 1261 gacaatggaa ttcctcctat gagtggaaca ggaacgctgc agatctattt acttgatatt 1321 aatgacaatg cccctcaagt gttacctcaa gaggcagaga cttgcgaaac tccagacccc 1381 aattcaatta atattacagc acttgattat gacattgatc caaatgctgg accatttgct 1441 tttgatcttc ctttatctcc agtgactatt aagagaaatt ggaccatcac tcggcttaat 1501 ggtgattttg ctcagcttaa tttaaagata aaatttcttg aagctggtat ctatgaagtt 1561 cccatcataa tcacagattc gggtaatcct cccaaatcaa atatttccat cctgcgcgtg 1621 aaggtttgcc agtgtgactc caacggggac tgcacagatg tggacaggat tgtgggtgcg 1681 gggcttggca ccggtgccat cattgccatc ctgctctgca tcatcatcct gcttatcctt 1741 gtgctgatgt ttgtggtatg gatgaaacgc cgggataaag aacgccaggc caaacaactt 1801 ttaattgatc cagaagatga tgtaagagat aacattttaa aatatgatga agaaggtgga 1861 ggagaagaag accaggacta tgacttgagc cagctgcagc agcctgacac tgtggagcct 1921 gatgccatca agcctgtggg aatccgacga atggatgaaa gacccatcca cgccgagccc 1981 cagtatccgg tccgatctgc agccccacac cctggagaca ttggggactt cattaatgag 2041 ggccttaaag cggctgacaa tgaccccaca gctccaccat atgactccct gttagtgttt 2101 gactatgaag gcagtggctc cactgctggg tccttgagct cccttaattc ctcaagtagt 2161 ggtggtgagc aggactatga ttacctgaac gactgggggc cacggttcaa gaaacttgct 2221 gacatgtatg gtggaggtga tgactgaact tcagggtgaa cttggttttt ggacaagtac 2281 aaacaatttc aactgatatt cccaaaaagc attcagaagc taggctttaa ctttgtagtc 2341 tactagcaca gtgcctgctg gaggctttgg cataggctgc aaaccaattt gggctcagag 2401 ggaatatcag tgatccatac tgtttggaaa aacactgagc tcagttacac ttgaatttta 2461 cagtacagaa gcactgggat tttatgtgcc tttttgtacc tttttcagat tggaattagt 2521 tttctgttta aggctttaat ggtactgatt tctgaaacga taagtaaaag acaaaatatt 2581 ttgtggtggg agcagtaagt taaaccatga tatgcttcaa cacgcttttg ttacattgca 2641 tttgctttta ttaaaataca aaattaaaca aacaaaaaaa ctcatggagc gattttatta 2701 tcttggggga tgagaccatg agattggaaa atgtacatta cttctagttt tagactttag 2761 tttgtttttt ttttttttca ctaaaatctt aaaacttact cagctggttg caaataaagg 2821 gagttttcat atcaccaatt tgtagcaaaa ttgaattttt tcataaacta gaatgttaga 2881 cacattttgg tcttaatcca tgtacacctt tttatttctg tatttttcca cttcactgta 2941 aaaatagtat gtgtacataa tgttttattg gcatacgtct atggagaagt gcagaaactt 3001 cagaacatgt gtatgtatta tttggactat ggattcaggt tttttgcatg tttatatctt 3061 tcgttatgga taaagtattt acaaaacagt gacatttgat tcaattgttg agctgtagtt 3121 agaatactca atttttaatt tttttaattt ttttattttt tattttcttt ttggtttggg 3181 gagggagaaa agttcttagc acaaatgttt tacataattt gtaccaaaaa aaaaaaaaaa 3241 ggaaaggaaa gaaaggggtg gcctgacact ggtggcacta ctaagtgtgt gtttttttaa 3301 aaaaaaaatg gaaaaaaaaa agcctttaaa ctggagagac ttctgacaac agctttgcct 3361 ctgtattgtg taccagaata taaatgatac acctctgacc ccagcgttct gaataaaatg 3421 ctaattttgg ataacaaaaa aaggggaatt c // LOCUS HUMFGF2H 3365 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34185 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3365) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 256 2457 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 786 a 962 c 917 g 700 t ORIGIN 1 gcaccgagcg ccgccgggag tcgagcgccg gccgcggagc tcttgcgacc ccgccaggac 61 ccgaacagag cccgggggcg gcgggccgga gccggggacg cgggcacacg cccgctcgca 121 caagccacgg cggactctcc cgaggcggaa cctccacgcc gagcgagggt cagtttgaaa 181 aggaggatcg agctcactgt ggagtatcca tggagatgtg gagccttgtc accaacctct 241 aactgcagaa ctgggatgtg gagctggaag tgcctcctct tctgggctgt gctggtcaca 301 gccacactct gcaccgctag gccgtccccg accttgcctg aacaagatgc tctcccctcc 361 tcggaggatg atgatgatga tgatgactcc tcttcagagg agaaagaaac agataacacc 421 aaaccaaacc gtatgcccgt agctccatat tggacatccc cagaaaagat ggaaaagaaa 481 ttgcatgcag tgccggctgc caagacagtg aagttcaaat gcccttccag tgggacccca 541 aaccccacac tgcgctggtt gaaaaatggc aaagaattca aacctgacca cagaattgga 601 ggctacaagg tccgttatgc cacctggagc atcataatgg actctgtggt gccctctgac 661 aagggcaact acacctgcat tgtggagaat gagtacggca gcatcaacca cacataccag 721 ctggatgtcg tggagcggtc ccctcaccgg cccatcctgc aagcagggtt gcccgccaac 781 aaaacagtgg ccctgggtag caacgtggag ttcatgtgta aggtgtacag tgacccgcag 841 ccgcacatcc agtggctaaa gcacatcgag gtgaatggga gcaagattgg cccagacaac 901 ctgccttatg tccagatctt gaagactgct ggagttaata ccaccgacaa agagatggag 961 gtgcttcact taagaaatgt ctcctttgag gacgcagggg agtatacgtg cttggcgggt 1021 aactctatcg gactctccca tcactctgca tggttgaccg ttctggaagc cctggaagag 1081 aggccggcag tgatgacctc gcccctgtac ctggagatca tcatctattg cacaggggcc 1141 ttcctcatct cctgcatggt ggggtcggtc atcgtctaca agatgaagag tggtaccaag 1201 aagagtgact tccacagcca gatggctgtg cacaagctgg ccaagagcat ccctctgcgc 1261 agacaggtaa cagtgtctgc tgactccagt gcatccatga actctggggt tcttctggtt 1321 cggccatcac ggctctcctc cagtgggact cccatgctag caggggtctc tgagtatgag 1381 cttcccgaag accctcgctg ggagctgcct cgggacagac tggtcttagg caaacccctg 1441 ggagagggct gctttgggca ggtggtgttg gcagaggcta tcgggctgga caaggacaaa 1501 cccaaccgtg tgaccaaagt ggctgtgaag atgttgaagt cggacgcaac agagaaagac 1561 ttgtcagacc tgatctcaga aatggagatg atgaagatga tcgggaagca taagaatatc 1621 atcaacctgc tgggggcctg cacgcaggat ggtcccttgt atgtcatcgt ggagtatgcc 1681 tccaagggca acctgcggga gtacctgcag gcccggaggc ccccagggct ggaatactgc 1741 tacaacccca gccacaaccc agaggagcag ctctcctcca aggacctggt gtcctgcgcc 1801 taccaggtgg cccgaggcat ggagtatctg gcctccaaga agtgcataca ccgagacctg 1861 gcagccagga atgtcctggt gacagaggac aatgtgatga agatagcaga ctttggcctc 1921 gcacgggaca ttcaccacat cgactactat aaaaagacaa ccaacggccg actgcctgtg 1981 aagtggatgg cacccgaggc attatttgac cggatctaca cccaccagag tgatgtgtgg 2041 tctttcgggg tgctcctgtg ggagatcttc actctgggcg gctccccata ccccggtgtg 2101 cctgtggagg aacttttcaa gctgctgaag gagggtcacc gcatggacaa gcccagtaac 2161 tgcaccaacg agctgtacat gatgatgcgg gactgctggc atgcagtgcc ctcacagaga 2221 cccaccttca agcagctggt ggaagacctg gaccgcatcg tggccttgac ctccaaccag 2281 gagtacctgg acctgtccat gcccctggac cagtactccc ccagctttcc cgacacccgg 2341 agctctacgt gctcctcagg ggaggattcc gtcttctctc atgagccgct gcccgaggag 2401 ccctgcctgc cccgacaccc agcccagctt gccaatggcg gactcaaacg ccgctgactg 2461 ccacccacac gccctcccca gactccaccg tcagctgtaa ccctcaccca cagcccctgc 2521 tgggcccacc acctgtccgt ccctgtcccc tttcctgctg gcaggagccg gctgcctacc 2581 aggggccttc ctgtgtggcc tgccttcacc ccactcagct cacctctccc tccacctcct 2641 ctccacctgc tggtgagagg tggcaaagag gcagatcttt gctgccagcc acttcatccc 2701 ctcccagatg ttggaccaac acccctccct gccaccaggc actgcctgga gggcagggag 2761 tgggagccaa tgaacaggca tgcaagtgag agcttcctga gctttctcct gtcggtttgg 2821 tctgttttgc cttcacccat aagcccctcg cactctggtg gcaggtgcct tgtcctcagg 2881 gctacagcag tagggaggtc agtgcttcgt gcctcgattg aaggtgacct ctgccccaga 2941 taggtggtgc cagtggctta ttaattccga tactagtttg ctttgctgac caaatgcctg 3001 gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc ctggggccca 3061 gccccaaact gggggctctg tatatagcta tgaagaaaac acaaagtgta taaatctgag 3121 tatatattta catgtctttt taaaagggtc gttaccagag atttacccat cgggtaagat 3181 gctcctggtg gctgggaggc atcagttgct atatattaaa aacaaaaaag aaaaaaaagg 3241 aaaacgtttt taaaaaggtc atatattttt tgctactttt gctgttttat ttttttaaat 3301 tatgttctaa acctattttc agtttaggtc cctcaataaa aattgctgct gcttcaaaaa 3361 aaaaa // LOCUS HUMFGF3H 3503 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34186 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3503) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 527 2722 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 777 a 1044 c 1005 g 677 t ORIGIN 1 gcggaaccca aggacttttc tccggtccga gctcggggcg ccccgcaccg ggacggtacc 61 cgtgctgcag tcgggcacgc cgcgggcccg ccgggggcct ccgcagggcg atggagccgg 121 tctgcaagga aagtgaggcg ccgccgctgc gttctggagg aggggggcac aaggtctgga 181 gaccccgggt ggcggacggg agccctcccc ccgccccgcc tccggggcac cagctccggc 241 tccattgttc ccgcccgggc tggaggcgcc gagcaccgag cgccgccggg agtcgagcgc 301 cggccgcgga gtcttgcgac cccgccagga cccgaacaga gcccgggggc ggcgggccgg 361 agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc ccgaggcgga 421 acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg tggagtatcc 481 atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt ggagctggaa 541 gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta ggccgtcccc 601 gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc 661 ctcttcagag gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac 721 atccccagaa aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt 781 caaatgccct tccagtggga ccccaaaccc cacactgcgc tggttggaaa atggcaaaga 841 attcaaacct gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat 901 aatggactct gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta 961 cggcagcatc aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat 1021 cctgcaagca gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat 1081 gtgtaaggtg tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa 1141 tgggagcaag attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt 1201 taataccacc gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc 1261 aggggagtat acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt 1321 gaccgttctg gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga 1381 gatcatcatc tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt 1441 ctacaagatg aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa 1501 gctggccaag agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc 1561 catgaactct ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat 1621 gctagcaggg gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga 1681 cagactggtc ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga 1741 ggctatcggg ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt 1801 gaagtcggac gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa 1861 gatgatcggg aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc 1921 cttgtatgtc atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg 1981 gaggccccca gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc 2041 ctccaaggac ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc 2101 caagaagtgc atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt 2161 gatgaagata gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa 2221 gacaaccaac ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat 2281 ctacacccac cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct 2341 gggcggctcc ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg 2401 tcaccgcatg gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg 2461 ctggcatgca gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg 2521 catcgtggcc ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta 2581 ctcccccagc tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt 2641 ctctcatgag ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa 2701 tggcggactc aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc 2761 tgtaaccctc acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc 2821 tgctggcagg agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact 2881 cagctcacct ctccctccac ctcctctcca cctgctggtg agaggtggca aagaggcaga 2941 tcttttcact gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc 3001 accaggcact gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc 3061 ttcctgagct ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac 3121 tctggtggca ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc 3181 tcgattgaag gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac 3241 tagtttgctt tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg 3301 ggggcagtgt tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga 3361 agaaaacaca aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt 3421 accagagatt tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata 3481 tattaaaaac aaaaaaaaaa aaa // LOCUS HUMFGF4H 2283 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34187 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2283) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 417 1325 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 566 a 621 c 612 g 484 t ORIGIN 1 ggagcccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga ggggggcaca 61 aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct ccggggcacc 121 agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc gccgccggga 181 gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga gcccgggggc 241 ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc 301 ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg 361 tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt 421 ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta 481 ggccgtcccc gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg 541 atgatgactc ctcttcagag gagaaagaaa cagataacac caaaccaaac cgtatgcccg 601 tagctccata ttggacatcc ccagaaaaga tggaaaagaa attgcatgca gtgccggctg 661 ccaagacagt gaagttcaaa tgcccttcca gtgggacccc aaaccccaca ctgcgctggt 721 tgaaaaatgg caaagaattc aaacctgacc acagaattgg aggctacaag gtccgttatg 781 ccacctggag catcataatg gactctgtgg tgccctctga caagggcaac tacacctgca 841 ttgtggagaa tgagtacggc agcatcaacc acacatacca gctggatgtc gtggagcggt 901 cccctcaccg gcccatcctg caagcagggt tgcccgccaa caaaacagtg gccctgggta 961 gcaacgtgga gttcatgtgt aaggtgtaca gtgacccgca gccgcacatc cagtggctaa 1021 agcacatcga ggtgaatggg agcaagattg gcccagacaa cctgccttat gtccagatct 1081 tgaaggtaat catggcacca gtcttcgtgg gccagtctac tgggaaggag accactgtct 1141 cgggggctca agttcctgtg ggcaggctca gttgcccccg aatgggatca ttcctcacgc 1201 ttcaggcaca cacactccat ctcagtaggg atctagccac atcccccagg actagtaaca 1261 gaggtcacaa agtggaggtg agctgggaac agagggctgc agggatgggt ggtgctggtc 1321 tgtaataagc tttgagagca acgtcactgg ggctttgggg tcagctacac aaggaaggca 1381 tttggacccc tgccttttca ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt 1441 cttagccctg tgttctgagt tacgtacgat ctttctggca aatggggtgc atgataagag 1501 catctcttac gaagagttgg aaaaacaaat gccatatata aattctaagc catatgagga 1561 cgaggagtaa tggcattttc ttcctttttc ctctcactcc cagacattca ttgtccctga 1621 atgctccatt aatccaggga aggtaattgc ctaaatctcc agtggatctc gcaacaggaa 1681 ggaaccagaa gctgggaaag ttgtttacct ctttgtccca gagttagacc tcatcctccc 1741 ctagcttagc tgtctcagag atatactggc cctcccttct cttctctttg ctgctggtgc 1801 taaaactgct ctgtaggtca ttggccactg tctccactca caacccctgc tccagtcctg 1861 gagggagtgg gttaaacaca aatagaacat tccatttgaa gcagtgattc tttttttttt 1921 tttttttttt taatcaaatg ctttggactt ttgaagtcca cttgttctgt acttgtaaaa 1981 gggaaagaag gccgggcgca gtcgtcacgc ctgtaatccc agcactttag atcacttgag 2041 gtcaggagtt tgagaccagc ccggccaaca tggtgaaacc ccatctctac taaaaataca 2101 aaaattagct gtgcatagtg gttggcacct gtagtcccag ctactcagga ggctgaggca 2161 agctaactgc ttgaacccag aaggcagagg ttgcagtgag ctgagatcac gccactgcac 2221 tccagcctgg gtgacagagt gagtgagact ctgcgttaaa aaaaaaaaaa aaaaaaaaaa 2281 aaa // LOCUS HUMFGF5H 1625 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) secreted form mRNA, complete cds. ACCESSION M34188 KEYWORDS FGF receptor; fibroblast growth factor receptor. SOURCE Human female placenta endothelial cell line HUVEC, cDNA to mRNA, clone h5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1625) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 523 1425 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 368 a 480 c 489 g 288 t ORIGIN 1 cggaacccaa ggacttttct ccggtccgag ctcggggcgc cccgcaggcg acggtacccg 61 tgctgcagtc gggcacgccg cgggcccggg gcctccgcag ggcgatggag cccggtctgc 121 aaggaaagtg aggcgccgcc gctgcgttct ggaggagggg ggcacaaggt ctggagaccc 181 cgggtggcgg acgggagccc tccccccgcc ccgcctccgg ggcaccagct ccggctccat 241 tgttcccgcc cgggctggag gcgccgagca ccgagcgccg ccgggagtcg agcgccggcc 301 gcggagctct tgcgaccccg ccaggacccg aacagagccc gggggcggcg ggccggagcc 361 ggggacgcgg gcacacgccc gctcgcacaa gccacggcgg actctcccga ggcggaacct 421 ccacgccgag cgagggtcag tttgaaaagg aggatcgagc tcactgtgga gtatccatgg 481 agatgtggag ccttgtcacc aacctctaac tgcagaactg ggatgtggag ctggaagtgc 541 ctcctcttct gggctgtgct ggtcacagcc acactctgca ccgctaggcc gtccccgacc 601 ttgcctgaac aagatgctct cccctcctcg gaggatgatg atgatgatga tgactcctct 661 tcagaggaga aagaaacaga taacaccaaa ccaaaccccg tagctccata ttggacatcc 721 ccagaaaaga tggaaaagaa attgcatgca gtgccggctg ccaagacagt gaagttcaaa 781 tgcccttcca gtgggacccc aaaccccaca ctgcgctggt tgaaaaatgg caaagaattc 841 aaacctgacc acagaattgg aggctacaag gtccgttatg ccacctggag catcataatg 901 gactctgtgg tgccctctga caagggcaac tacacctgca ttgtggagaa tgagtacggc 961 agcatcaacc acacatacca gctggatgtc gtggagcggt cccctcaccg gcccatcctg 1021 caagcagggt tgcccgccaa caaaacagtg gccctgggta gcaacgtgga gttcatgtgt 1081 aaggtgtaca gtgacccgca gccgcacatc cagtggctaa agcacatcga ggtgaatggg 1141 agcaagattg gcccagacaa cctgccttat gtccagatct tgaaggtaat catggcacca 1201 gtcttcgtgg gccagtctac tgggaaggag accactgtct cgggggctca agttcctgtg 1261 ggcaggctca gttgcccccg aatgggatca ttcctcacgc ttcaggcaca cacactccat 1321 ctcagtaggg atctagccac atcccccagg actagtaaca gaggtcacaa agtggaggtg 1381 agctgggaac agagggctgc agggatgggt ggtgctggtc tgtaataagc tttgagagca 1441 acgtcactgg ggctttgggg tcagctacac aaggaaggca tttggacccc tgccttttca 1501 ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt cttagccctg tgtcctgagt 1561 tacgtacgat ctttctggca aatggggtgc atgataagag catctcttac gaagagttgg 1621 aaaaa // LOCUS HUMINTB1A 1146 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human integrin beta-1 subunit mRNA, 3' end (cytoplasmic domain). ACCESSION M34189 KEYWORDS integrin; integrin beta-1 subunit cytoplasmic domain. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1146) AUTHORS Altruda,F., Cervella,P., Tarone,G., Botta,C., Balzac,F., Stefanuto,G. and Silengo,L. TITLE A human integrin beta-1 subunit with a unique cytoplasmic domain JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Altruda, 10-MAY-1990. FEATURES from to/span description pept < 1 117 integrin beta-1 subunit (AA at 1) (cytoplasmic domain) BASE COUNT 373 a 184 c 222 g 367 t ORIGIN 1 aagcttttaa tgataattca tgacagaagg gagtttgcta aatttgaaaa ggagaaaatg 61 aatgccaaat gggacacggt aagttacaaa acatccaaaa agcaaagtgg cttataaagt 121 aaatgtaata ctcctaagac ttatgtatta gctgtcaggc tgattattaa agtcctttct 181 aagtatttta ttcccccaaa agtttcttac tcaaggaatt tgcatttagt gaaaaacaga 241 aagcatccta aatatatccc attgaaacaa aacattgatt ataagcatgt atattctggt 301 tcatgtggcc gatattttta tttctttaat gattttgatc ctaaatctgc cttttcatct 361 aatgtgaagt agaatcctaa ataatgttat ctgtgtagca agctattcaa tgggaaagct 421 gcttctttct ttaaaacaaa caaacaaaaa aaaccttcag tggaaagcca aattccaaaa 481 ggttatatac caagcttgtc caactcgcag ctcgtcggcc aggacatgca gcccagaata 541 gctttgaatg tggccccaac acaaatttgt aaactttctt agaaattgta attattatta 601 ttattttttt ttggtaactt tttttaaagc tcatcagcta tcgttagtgt attttatgtg 661 tggcccaaga cagttcttct tcttgccagt gtggcccagg gaagccaaaa gattggacac 721 ccctgctata tactatatga ttccatttag aggacattct ggaaaagcaa aactgtaggg 781 gcaaaaatca gtggttgcta ggggctggaa tgggggaaag tgttgaccac agaggggcgt 841 aagggatctt ccttgggatg acttgattgt gggtggattt atgtatttga aaactcacag 901 aactatgtac tttaaaaaga tgtatgttcc tctatgaaaa ttatatctca gtaaactttg 961 gcttataaaa atcttaaaag ccctaagtga ccgaaaggtt atgttagcat tgagtgcttt 1021 gaaatatgga gtcagagggt ggggtaacca aatgttggcc tttgtgtatt catcttttga 1081 tacaagaaag caatgccaat cttcagtatt tttaaattgt aaatgaattt tgtagttccc 1141 gaattc // LOCUS RATIRF1A 2078 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat interferon regulatory factor 1 (IRF-1) mRNA, complete cds. ACCESSION M34253 KEYWORDS interferon regulatory factor 1; transcription factor. SOURCE Rat cell line Nb2-11c T-cell, cDNA to mRNA, clones 25,4b. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2078) AUTHORS Yu-Lee,L.-Y., Hrachovy,J.A., Stevens,A.M. and Schwarz,L.A. TITLE Interferon regulatory factor 1 is an immediate-early gene under transcriptional regulation by prolactin in Nb2 T cells JOURNAL Mol. Cell. Biol. 3087, 3094 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.-Y.Yu-Lee, 11-MAY-1990. FEATURES from to/span description pept 198 1184 interferon regulatory factor 1 (IRF-1) BASE COUNT 542 a 544 c 540 g 452 t ORIGIN 1 ctcgacgaag gagtaggacg agctctcact gtctgagcca aaccgaaccg ggccgagctg 61 agccgaggtc agcggtggcc agaggaaccc agcatctcgg gcatcattcg ctccgtgcac 121 gcatcgtgta cctacaccgc aactccgtgc ctcattcccg ggtaccctct gtgactcgct 181 cctgcagcaa agccaccatg cctatcactc ggatgcgaat gagaccctgg ctagagatgc 241 agattaattc caaccaaatt ccagggctga gctggatcaa taaagaagag atgatcttcc 301 agatcccatg gaagcatgct gccttgcacg gttgggatat caacaaggat gcctgtctgt 361 tccggagctg ggccattcac acaggccgat acaaagctgg ggaaaaagag ccagatccca 421 agacttggaa ggcaaacttc cggtgtgcca tgaactccct accagacatc gaggaagtga 481 aggaccagag caggaacaag ggcagctctg ctgtacgcgt gtaccggatg ctgccacccc 541 tcaccaagaa ccagaggaaa gagagaaagt ccaagtccag ccgtgacact aagagcaaaa 601 ccaagaggaa gctgtgcgga gattctagcc ctgacacctt atctgacgga ctgagcagct 661 ctactctgcc tgatgaccac agcagttaca cagctcaggg atacctgggt caggacttgg 721 acatggacag ggacattacc ccagctctgt caccgtgcgt cgtcagcagc agtctctctg 781 agtggcatat gcagatggac atcatgccag acagcaccac tgatctgtac aacttgcagg 841 tgtcgcccat gccctccacc tctgaagctg caacagatga ggatgaggaa gggaagttac 901 ctgaggacat catgaagctc tttgaacagt ctgagtggca gccgacgcac gtggatggca 961 agggatactt gctcaatgaa ccaggagccc aactctctac tgtctatgga gacttcagct 1021 gcaaggagga accagagatc gacagccctg gaggggacat cgagataggc atacagcgtg 1081 tcttcacaga gatgaagaat atggaccccg tcatgtggat ggacaccctg ctgggcaact 1141 ctaccaggcc gccctccatt caggctattc cttgtgcacc ataatttggg tccctgaccc 1201 gttcttgccc tcctgagtga gctaggtcca gcatcatggt ggctgtgata caacataaag 1261 ctaaacttcc gtggacccct tgatgtggca aaacataatc ccattgccaa gcagggaagg 1321 gaccaaacca tcctccttgg gtcagtggac tgactcttca gagcttagga ggcagggtct 1381 aagtttttca agctggtcct gactcctagg aagatggatt ggcgttctga ggttagtgtg 1441 aggcagagga cctggacgga agttaccttc tagctctttg aaagcttcat tgcttagaga 1501 gggtctcacc actgggctgg cctgggggat agaccagcgc ccacagaaga gcattgcact 1561 ggccttaggg ctggctccac actgggagac aattgcacta agtcctattc ccaaagaact 1621 gctgcccttc ccaaccgagc cctgggatgg ttctagagcc agtgaaatgt gaaggaaaaa 1681 atggggtcct gtgagggttg tctcccttag cctcagaggg attctgcctc actccctgct 1741 ccagctgtgg ggctcaggaa aaaaaaatgg cactttctct gtggactttg ccacatttct 1801 gatcagaagt gtacactaac atttctccca agtcttggcc tttgcattta tttatatagt 1861 gccttgccct gtgcctgctg tctctcctca ggcctcagca gtcctcagca ggcccaggga 1921 gggggttgtg agcgccttgg cgtgactctg aacattggaa acgccaccta actactaagt 1981 tgtgtctgat ctcgtgtgga tctgtgtaaa tatgtatatt catcttttta taaaaaccta 2041 agttgtttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS ACCRRSAA 1536 bp ss-rRNA RNA 17-JUL-1990 DEFINITION A.calcoaceticus 16S ribosomal RNA. ACCESSION M34139 KEYWORDS 16S ribosomal RNA. SOURCE A.calcoaceticus (strain 33604) ribosomal RNA. ORGANISM Acinetobacter calcoaceticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1536) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1536 16S ribosomal RNA BASE COUNT 387 a 329 c 460 g 330 t 30 others ORIGIN 1 ttaactgaag agtttgatca tggctcagat tgaacgctgg cggcaggctt aacacatgca 61 agtcgagcgg ggaaggttgc ttcggtaact gactagcggc ggacgggtga gtaatgctta 121 ggaatctgcc atttagtggg ggacaacatt ccgaanggaa tgctaatacc gcatacgtcc 181 tacaggagaa agcaggggat ctccggacct tgcgctaaat gatgagccta agtcggatta 241 gctagttggt ggggtaaagg cctaccaagg cgacgatctg tagcgggtct gagaggatga 301 tccgccacac tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata 361 ttggacaatg ggcgcaagcc ngatccagcc atgccgcgtg tgtgaagaag gccttttggt 421 tgtaaagcac tttaagcgag gaggaggctc tcttagttaa tacctaagat gagtggacgt 481 tactcgcaga ataagcaccg gctaactctg tgccagcagc cgcggtaata cagagngtgc 541 gagcgttaat cggatttact gggcgtaaag cgtgcgtagg cggcttttta agtcggatgt 601 gaaatccccg agcttaactt gggaattgca ttcgatactg ggaagctaga gtatgggaga 661 ggatggtaga attccaggtg tagcggtgaa atgcgtagag atctggagga ataccgatgg 721 cgaaggcagc catctggcct aatactgacg ctgaggtacg naagcatggg gagcaaacag 781 gattagatac cctggtagtc catgccgtaa acgatgtcta ctagccgttg gggcctttga 841 ggctttagtg gcgcagctaa cgcgataagt agactgcctg gggagtacgg tcgcaagact 901 aaaactcaaa tgaattgacg ggggcncgca caagcggtgg agcatgtggt ttaattcgat 961 gcaacgcgaa gaaccttacc tggccttgac atactagaaa ctttccagag atggattggt 1021 gccttcggga atctagatac aggtgctgca tggctgtcgt cagctcgtgt cgtgagatgt 1081 tgggttaagt cccgcaacga gcgcaaccct tttccttact tgccagcatt tcggatggga 1141 actttaagga tactgccagt gacaaactgg aggaaggcgg ggacgacgtc aagtcatcat 1201 ggcccttacg gctagggcta cacacgtgct acaatggtcg gtacaaaggg ttgctaccta 1261 gcgataggat gctaatctca aaaagccgat cgtagttcgg attggagtct gcaactcgac 1321 tccatgaagt cggaatcgct agtaatcgcg gatcagaatg ccgcggtgaa tacgttcccg 1381 ggccttgtac acaccgcccg tcacaccatg ggagtttgtt gcaccagaag tagctagcct 1441 aactgcaaag agggcggtta ccacggtgtg gccgatgact agggnnnnnn ngtaacaagn 1501 nnnnnnnnnn ngaacctgnn nnnngatcac ctcctt // LOCUS BDERRSAA 1553 bp ss-rRNA RNA 17-JUL-1990 DEFINITION B.stolpii 16S ribosomal RNA. ACCESSION M34125 KEYWORDS 16S ribosomal RNA. SOURCE B.stolpii (strain uki-2) ribosomal RNA. ORGANISM Bdellovibrio stolpii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1553) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept Microbiology, 131 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1553 16S ribosomal RNA BASE COUNT 381 a 320 c 429 g 316 t 107 others ORIGIN 1 agcatnnaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa 61 gtcgaacgtg aaagtccttc gggatgagta aagtggcgca cgggtgagta acacgtaggt 121 gacctgcctt ttagagggga ataaccagaa gaaattttgg ctaatgccgc atacgaagca 181 cggttttaag actgtgcttg aaagaatgcc tctgcatatg ngcattcgct attagatggg 241 cctgcgggac attagctagt tggtggggta aaggcctacc aaggcgacga tgtctatccg 301 gtctgagagg atgatcggac acactggaac tgagacacgg tccagactcc tacgggaggc 361 agcagtgggg aatattgcgc aatgggggaa accctgacgc agcaacgccg cgtgagtgag 421 gaaggacttc ggtctgtaaa gctctgttaa tgtggaaaaa tggcagttgg tctaataggc 481 cnattgtttg atggtacaca tagaggaagc accggctaac ttcgtgccag cagccgcggt 541 aatacgaagg gtgcnagcgt tgttcggatt tattgggcgt aaagcgcgcg taggcggacc 601 tgcaagtcag atgtgaaatc tcggggctca acctcgaaac tgcgtctgaa actacaggtc 661 tagaatctcg gagggggaag gggaatatcg catgtagggg taaaatccgt agatatgcga 721 tggaacacca gaggcgaagg cgccttcctg gacgagtatt gacgctgagg cncnnaagcg 781 tggggatcaa acaggattag ataccctggt agtccacgct gtaaacgatg aacactagat 841 attggaggat ttgacccctt cagtgtcgta gctaacgcgt caagtgttcc gcctgggaag 901 tacggtcgca agactaaaac tcaaaggaat tgnnnnnnnn nngcacaagn nnnngattat 961 gnngtttaat tcgnngcaac gcgcagaacc ttacctaggc ttgaaatcct acgaatccct 1021 tttaaacgag ggagtgctct tcggagaatg tagtgacagg cgctgcatgg ctgtcgtcag 1081 ctcgtgtcgt gagatgttgg gttaagtctc gcaacgagcg caacccccat ttttagttgc 1141 cagcattaag ttgggcactc tagaaagact gcntgggcta accaggagga aggtggggat 1201 gacgtcaagt cctcatggcc cttatgtcta gggctacaca cgtaatacaa tggtcggtac 1261 aaagggatgc gaactcgcga gggggagcca atctcaaaaa accgatctca gtccggattg 1321 gagtctgcaa ctcgactcca tgaagttgga atcgcgagta atcgcggatc agcacgccgc 1381 ggtgaatacg ttcccgggcc ttgtacacac cgcccgtcac accatgggag ttgtttttac 1441 ctgaagnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1501 nnnnnnngta acaagnnnnn nnnnnnngaa cctgnnnnnn gatcacctcc ttt // LOCUS PLTRRSAA 1525 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.staleyi 16S ribosomal RNA. ACCESSION M34126 KEYWORDS 16S ribosomal RNA. SOURCE P.staleyi (strain ATCC 27377) ribosomal RNA. ORGANISM Planctomyces staleyi Prokaryota; Bacteria; Eubacteriomycetes; Eubacteriales. REFERENCE 1 (bases 1 to 1525) AUTHORS Woese,C.R. and Oyalzu,H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 15-MAY-1990. Author address: C.R.Woese University of Illinois Dept Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1525 16S ribosomal RNA BASE COUNT 376 a 348 c 482 g 315 t 4 others ORIGIN 1 caattgaaga gtttgatcct ggctcagaat gaacgttggc ggcatggatt aggcatgcaa 61 gtcgtgcgcg atatgtagca atacatggag agcggcgaaa gggagagtaa tacgtaggaa 121 cctaccttcg ggtctgggat agcggcggga aactgccggt aataccagat gatgtttccg 181 aaccaaaggt gtgattccgc ctgaagaggg gcctacgtcg tattagctag ttggtagggt 241 aatggcctac caaggcaaag atgcgtatgg ggtgtgagag catgccccca ctcactggga 301 ctgagacact gcccagacac ctacgggtgg ctgcagtcga gaatcttcgg caatgggcga 361 aagcctgacc gagcgatgcc gcgtgcggga tgaaggcctt cgggttgtaa accgctgtcg 421 taggggatga agtgctaggg ggttctccct ctagtttgag ctgaacctag gaggaagggc 481 cggctaatct cgtgccanna gccgcggtaa tacgagaggc ccaaacgtta ttcggattta 541 ctgggcttaa agagttcgta ggcggtcttg taagtggggt gtgaaatccc tcggctcaac 601 cgaggaactg cgctccaaac tacaagactt gagggggata gaggtaagcg gaactgatgg 661 tggagcggtg aaatgcgttg atatcatcag gaacaccgga ggcgaaggcg gcttactggg 721 tcctttctga cgctgaggaa cgaaagctag gggagcaaac gggattagat accccggtag 781 tcctagccgt aaacgatgag cactggaccg gagctctgca cagggtttcg gtcgtagcga 841 aagtgttaag tgctccgcct ggggagtatg gtcgcaaggc tgaaactcaa aggaattgac 901 gggggctcac acaagcggtg gaggatgtgg cttaattcga ggctacgcga agaaccttat 961 cctagtcttg acatgcttag gaatcttcct gaaagggagg agtgctcgca agagagcctt 1021 tgcacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgtcgggt taagtccctt 1081 aacgagcgaa acccttgtcc ttagttacca gcgcgtcatg gcggggactc taaggagact 1141 gccggtgtta aaccggagga aggtggggat gacgtcaagt cctcatggcc tttatgatta 1201 gggctgcaca cgtcctacaa tggtgcacac aaagcgacgc aaactcgtga gagccagcta 1261 atcgcaaaaa atgtacctca gttcggattg caggctgcaa ctcgcctgca tgaagctgga 1321 atcgctagta atcgcgggtc agcataccgc ggtgaatntg ttcctgagcc ttgtacacac 1381 cgcccntcaa gccacgaaag tgggggggac ccaacagcgc tgccgtaacc gcaaggaaca 1441 aggcgcctaa ggtcaactcc gtgattggga ctaagtcgta acaaggtagc cgtaggggaa 1501 cctgcggctg gatcacctcc tttct // LOCUS RDCRRSAA 1478 bp ss-rRNA BCT 17-JUL-1990 DEFINITION R.purpureus 16S ribosomal RNA. ACCESSION M34132 KEYWORDS 16S ribosomal RNA. SOURCE R.purpureus (strain 6770) ribosomal RNA. ORGANISM Rhodocyclus purpureus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1478) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1478 16S ribosomal RNA BASE COUNT 367 a 345 c 470 g 289 t 7 others ORIGIN 1 tgaactgaag agtttgatcc tggctcagat tgaacgctgg cggcatgcct tacacatgca 61 agtcgaacgg taacgggncc ttcgggcgcc gaacgagtgg cgaacgggtg agtaatgcat 121 cggaacatgc cctgaagtgg gggataacgt agcgaaagtt acgctaatac cgcatattct 181 gtgagcagga aagcagggga ccttcgggcc ttgcgctttg ggagtggccg atgtcggatt 241 agctagttgg tggggtaaaa gcctaccaag gcaacgatcc gtagcgggtc tgagaggatg 301 atccgccaca ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat 361 tttggacaat gggcgaaagc ctgatccagc catgccgcgt gagtgaagaa ggccttcggg 421 ttgtaaagct ctttcggcgg ggaagaaatc gggtttccta atacggaacc cggatgacgg 481 tacccgaaga agaagcaccg gctaactacg tgccagcagc cgcggtaata cgtagggtgc 541 nagcgttaat cggaattact gggcgtaaag cgtgcgcagg cggttgtgta agacagacgt 601 gaaatccccg ggctcaacct gggaactgcg tttgtgactg cacagctaga gtacggcaga 661 ggggggtgga attccacgtg tagcagtgaa atgcgtagag atgtggagga acaccgatgg 721 cgaaggcagc cccctgggcc aatactgacg ctcatgcacg naagcgtggg gagcaaacag 781 gattagatac cctggtagtc cacgccctaa acgatgtcaa ctaggtgttg gtggggttaa 841 acccattagt gccgtagcta acgcgtgaag ttgaccgcct ggggagtacg gcggcaaggt 901 taaaactcaa aggaattgac gggganccgc acaagcggtg gatgatgtgg attaattcga 961 tgcaacgcga aaaaccttac ctacccttga catgtcagga atcctgagga gactcgggag 1021 tgcccgaaag ggnacctgaa cacaggtgct gcatggcngt cgtcagctcg tgtcgtgaga 1081 tgttgggtta agtcccgcaa cgagcgcaac ccttgtcatt aattgccatc attcagttgg 1141 gcactttaat gaaactgccg gtgacaaacc ggaggaaggt ggggatgacg tcaagtcctc 1201 atggccctta tgggtagggc ttcacacgtc atacaatggt cggtccatag ggttgcnaac 1261 ccgcgagggg gagctaatcc cagaaagccg atcgtagtcc ggattgcagt ctgcaactcg 1321 actgcatgaa gtcggaatcg ctagtaatcg cggatcagca tgtcgcggtg aatacgttcc 1381 cgggtcttgt acacaccgcc cgtcacacca tgggagcggg ttctgccaga agtagttagc 1441 ctaaccgcaa ggagggcgat taccacggca gcgttcgt // LOCUS RATSVPIIA 4161 bp ds-DNA ROD 17-JUL-1990 DEFINITION Rat seminal vesicle secretion II protein (SVS II) gene, complete cds. ACCESSION J05443 KEYWORDS seminal vesicle secretion II protein. SOURCE Rat (strain CHARLES RIVER) male seminal vesicle epithelial cell DNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4161) AUTHORS Harris,S.E., Harris,M.A., Johnson,C.M., Bean,M.F., Dodd,J.G., Matusik,R.J., Carr,S.A. and Crabb,J.W. TITLE Structural characterization of the rat seminal vesicle secretion II protein and gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.E.Harris, 25-APR-1990. FEATURES from to/span description pept 2065 2137 seminal vesicle secretion II protein (SVS II) precursor, exon 1 2377 3548 seminal vesicle secretion II protein precursor, exon 2 sigp 2065 2130 seminal vesicle secretion II protein signal peptide matp 2131 3545 seminal vesicle secretion II protein signal 1934 1946 CAAT box signal 2013 2020 TATA box BASE COUNT 1243 a 902 c 822 g 1194 t ORIGIN 1 tttcgatcca atgtgtggat tactcaccaa gtgtctgtct ttctttcttt ctctctttct 61 ctctttcttc ctcccttcct tccttccttc cttccttcct tccttccttc cttccttcct 121 ttctgttcaa ttgctcgttt ctcccttcat ctctcgccag tataccgcac actcaaactt 181 aaattttcat ttcaatgcgt tctcttctgg cacgtgcagc ataattacac tcatgattgt 241 caactccgtg atctgtttgc acaccttacc ccccccccca aggttttatc tgcatttaaa 301 aaaaagggat tatcaagaaa tttattctta attcagaaat gtgatcaaag ttgtcagatc 361 cgttctttac tgcctcctgt tggaaaaaaa aaatatccag ttcctggatt tttctaaaac 421 acagaaaaga gacctgggac aggggtatag gattgagcag gcatggtgag caattttata 481 ctgaatagat tcattgtgac ttaccggtct cctgagggaa ataatcactt ttcccaggta 541 gagagcagcc tagcaagaga tcagagtgca agcataaaac ccatgtgctt tataagtgta 601 tttattttat gcattttctg tttataagga catgagtgga ctttttattt gtcccttcca 661 tacaggacta cctagactat tgggatggga tgactgaaaa tatgttttca agtagacttc 721 cttccggaac taccttcata tggttctgaa ggcaaagtgg aacactgcac gggtgtcctc 781 ttctcccaag aacttggcca tggcgtcgtc gttttgagtc tatgtctgag ccacgaatgc 841 cataacagcc cttcctgtta ctctcacagt ggcacagagc tgtttctaaa caagaaggaa 901 gtcttccatc ttgtgtcagg atgctaatga cgtcaccaat ggcagtaagt gttcaccaca 961 gcccgttgct aaggcaatta tgttatccct cctgtcagag tttcctgtat taaaatatac 1021 tgagtttaat tttatgtcgg attccatgac atacattcag caaggaaacc aacagtatct 1081 tttgttcttt caacagtgat ttcctgtcac catttaactg ttgtctcgcc cccattcttt 1141 aaaatgtctc tgcacctcac cttgcctccc agatacactc ccaaactcat ttccctggac 1201 acacttgaaa tgttgctgct agcaagccac agctaccacg tcttctctgt cagggttcta 1261 gacaactcat ctaaagcagc accaggtctc tcttagaaat cagacatcgg atgtcatggt 1321 catagtatac ctcacagcta ctttggacat tcatgggccc agtattattt tccagggctg 1381 aggtttaact caagagcctc atgctcacat ggctggtggt ctggccacac agctatgact 1441 cgtctccatt tattcttcaa acttttattc ggagctccgt tgtgttctgt tgtctcctgt 1501 gcctttctat atgtgtgact gctcctttgc ctgtaaatga gaagctatgt caaattcaac 1561 gtaaaaaagg caacttcatg ggcttctgtg agatagcatg ctaaacagtg tcagctccac 1621 tacactgtga ccaggaaaat ttgatcaggc cctggttact ctcggagcat aaaagaagaa 1681 aaaaaaatct cttccccgct ctactctgga ttttgtttga aaataaaagg tccaatctgt 1741 ccttataaaa catgcataga ataaatatac tagaaaacac actttgtttg caaagggtat 1801 gtgataaagt cagagggttg ataaagattt gctgaggctt atgacataga aaaggtccct 1861 gacattgcat ccctgtgcaa agtacctggg aacattacca atgtccccaa ctgtgcagag 1921 gggaggaagt tgacatttag agataatttt tttaaaaaag caggcagtgc ttttgtagtg 1981 tcagttatat ctgtaataca tccagctaga gatatataaa tgtgaaagtc agctcagctc 2041 tcagtgaagg tccttcttga caagatgaag tcctctgtct tcattctatc tctgttcctc 2101 cttctggaaa gacaggcagc tgtggttgga cagtatggtg agtagggaga tggtgactag 2161 agggaaagtc actcagggag aatgttttta agggtgctct gggagtagca gatcctttca 2221 taggggaatt tttttttaaa tgagacctaa ttcttctcta ctgaaaacca aaacccttgt 2281 gggaacatca atggttttat gaggaaattt tggaaatgag acttggaagg actgtgcaga 2341 tcatgtaact taaaccttcc tcctctcaat taccaggtgg gacaaaaggt cacttccaga 2401 gcagctcatc agggtttatg cttggtcaga aaggccacct caattttggg ctcaaaggag 2461 gaagtgagga agcagctgaa gaaagcattt tcatgcaatc acaacaccag atgttcggcc 2521 aggatggtgg tgacatggcg cagacaagtg tttcacaaga gcatacaggt gtaaaggggg 2581 ccgcgatttg tcgtaaagga caagtatccc aattgaaatc ccaagaatcc caaataaaat 2641 cctttagaca agtaaaatcc agtggacagc tgaaatctgg aggatcccaa ttaaaatcct 2701 ttggacaagt gaaatccagt gagtcccaat taaaatcctt tggccaagtg aaagccagtg 2761 ggtcccaatt aaaatccttc ggacaagtga aagccagtgg gtcccaatta aaatcctatg 2821 gacaaatgaa atccagtggg tcccaagtga aatcctttgg acaaatgaaa tccagtgggt 2881 cccaagtaaa atcctttgga caaatgaaag ccagtgagtc ccaaataaaa tcctttggac 2941 aaagaaaatc ccaaggtggt caactacaat cctatggcca aatgaaatcc tatgggcaga 3001 cgaaatccct agaatcccag gccaaatcct tcggacaagt aaagtcccaa agtggccaaa 3061 tgaaatcctc ctatggtcag agaaaatcct atggtgaaga gactcaactg aagtctttcg 3121 accaagatgc ccaactaaaa tcctatggtc aacaaaaatc ccaaaaacaa tcctccttta 3181 gccaagtaaa atctcaaagt gcccaactaa agtcctttgg ccaacaaaaa tccctcaaag 3241 ggttttctca acaaactcaa cagaaaggat ttgccatgga tgaagatttg tcacaagtgc 3301 ggaaacaatt tgacgatgat gacctctctg tacaacagaa gtctacccaa cagatgaaaa 3361 cagaggaaga cttatcccaa tttggacaac aacgacaatt tggacaagaa cgctcccaat 3421 cctataaagg atatcttgca caatacagaa agaaattaca ggaacaacaa caacagaaaa 3481 attttaatca ggataacttt tttacaaagg gaggggcagg cctatatcag gctcaactta 3541 agggataaca tattcactga gcaactgaag accaagatca atgtcaaggt atgttccacc 3601 aagtaggaag atattatcca aatttacttg tggtatatag gaatcctgga tccattatgg 3661 attgataccc atttgttact atcagtagaa gtattgttac acacttttag aaggatgaag 3721 aacagaccct ggtaaaatga gtccttgtag agtaaaggca gagtaagcaa gctaagttaa 3781 caattggtcc tgaattacta cattcaggga gcacttttca gtgcttctct gagcacagac 3841 agtttatatt attaaatgtg taccacctat gcaatcatat ttaacatttc atgatggaat 3901 cttatttgtt cttacacttt gacttgataa aaaaaaaaga ttggtttctt gcttatattg 3961 gtataaggtg gtattgcagc tgagctcttt ctctacacca gtgcgtgttc ttgagtcccc 4021 tgggacctct gctttccatc acaatccatg gggttaagga ttagctgctt ttccatcaga 4081 tggaagattt ggttacaaag atctctgcct ggagcagaca ctatattcag ttgtatgtcc 4141 aatggtgacc ctgttgaatt c // LOCUS CREAPCYN 577 bp ss-mRNA PLN 17-JUL-1990 DEFINITION C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds. ACCESSION J05524 KEYWORDS apoplastocyanin. SOURCE C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone PC6-2. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 1 to 577) AUTHORS Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and Bogorad,L. TITLE Isolation and characterization of a complementary DNA clone for an algal pre-apoplastocyanin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Hill, 11-MAY-1990. FEATURES from to/span description pept 22 459 apoplastocyanin (PC6-2) precursor sigp 22 162 apoplastocyanin signal peptide matp 163 456 apoplastocyanin BASE COUNT 91 a 189 c 171 g 126 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc 61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt 121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg 181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag 241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc 301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc 361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac 421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat 481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac 541 gcagagagcg tgtagctctg taactttttc ggaattc // LOCUS MHVNSGII 870 bp ss-RNA VRL 17-JUL-1990 DEFINITION Murine hepatitis virus non-structural protein gene-2 (NS2). ACCESSION M34035 KEYWORDS non structural protein. SOURCE Murine hepatitis virus (strain MHV-JHM), cDNA to viral RNA. ORGANISM Murine hepatitis virus A59 Unclassified. REFERENCE 1 (bases 1 to 870) AUTHORS Schwarz,B., Routledge,E. and Siddell,S.G. TITLE The coronavirus MHV 30 kDa non-structural protein NS2 is not essential for virus replication in transformed murine cells JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.G.Siddell, 04-MAY-1990. Author address: S.G.Siddell Inst of Virology Univ of Wuerzburg Versbacherstrasse 7 8700 Wuerzburg FEATURES from to/span description pept 40 837 non-structural protein-2 (NS2) BASE COUNT 269 a 147 c 187 g 267 t ORIGIN 1 gcgatagcct agtaaatgtt aaataaatct atacttgtca tggctgcgag aatggccttt 61 gctgacaagc ctaatcattt tataaacttt cctctagccc aatttagtgg ctttatgggt 121 aagtatttaa agcttcagtc tcaacttgtg gaaatgggtt tggactgtaa attacaaaag 181 gtaccacatg ttagtattac cctgcttgac attaaagcag accaatacaa acaggtggaa 241 tttgcaatac aagaaataat agatgatctg gcggcatatg agggagatat tgtctttgac 301 aaccctcata tgcttggcag atgtcttgtt cttgatgtta aaggatttga agagttgcat 361 gaagatattg ttgaaattct ccgcagaagg ggttgcactg cagatcaatc cagacaatgg 421 attccgcact gcactgtggc ccaatttgat gaagaaaaag aaataaaaga aatgcaattc 481 tattttaaat tgcccttcta tctcaagcat aacaacctac ttacggatgc taggcttgag 541 cttgtgaaga taggttcttc caaagtaggt gggttttatt gtagtgaact aagtatttgg 601 tgtggtgaga gactttgtta caagccccca acccccaaat tcagtgatat atttggctat 661 tgctgcatag ataaaatacg tggtgattta gaaataggag acctaccgcc agatgatgag 721 gaagcgtggg ccgagctaag ttaccactat caaagaaaca cctacttctt cagacatgtg 781 cacgataata gtatctattt tcgtaccgta tgtagaatga agggttgtat gtgttgattt 841 gtttttacac tattagtgta ataaacttat // LOCUS HUMFGF1A 3343 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds. ACCESSION M34641 KEYWORDS FGF receptor-1; fibroblast growth factor receptor; transmembrane glycoprotein; tyrosine kinase. SOURCE Human teratocarcinoma cell line Tera-2, clone 13, cDNA to mRNA, clone 1:1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3343) AUTHORS Wennstroem,S., Sandstroem,C. and Claesson-Welsh,L. TITLE cDNA cloning and expression of a human FGF receptor which binds acidic and basic FGF JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Claesson-Welsh, 25-MAY-1990. Author address: L.Claesson-Welsh Ludwig Institute for Cancer Research Biomedical Center Box 595 S-751 24 Uppsala SWEDEN FEATURES from to/span description pept 10 2472 FGF receptor-1 precursor sigp 10 72 FGF receptor-1 signal peptide matp 73 2469 FGF receptor-1 BASE COUNT 766 a 957 c 911 g 709 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca 61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct 121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg 181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac 241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc 301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat 361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag 421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa 481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct 541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct 601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct 661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc 721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca 781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg 841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag 901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc 961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat 1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg 1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc 1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg 1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag 1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct 1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg 1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc 1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg 1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac 1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg 1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc 1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca 1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac 1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc 1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata 1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac 1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac 2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc 2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg 2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca 2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc 2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc 2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag 2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc 2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc 2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg 2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct 2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc 2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg 2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc 2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc 2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc 2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac 3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc 3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat 3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc 3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga 3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt 3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc // LOCUS ECOFADA 1594 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli 3-ketoacyl-coenzyme A thiolase (fadA) DNA. ACCESSION J05498 KEYWORDS 3-ketoacyl-coenzyme A thiolase. SOURCE E.coli (strain K12) DNA, clone pK52. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1594) AUTHORS Yang,S.-Y., Yang,X.-Y.H., Healy-Louie,G., Schulz,H. and Elzinga,M. TITLE Nucleotide sequence of the fadA gene: Primary structure of 3- ketoacyl-coenzyme A thiolase from Escherichia coli and the structural organization of the fadAB operon JOURNAL J. Biol. Chem. 265, 10424-10429 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-Y.Yang, 11-MAY-1990. FEATURES from to/span description pept 110 1276 3-ketoacyl-coenzyme A thiolase (fadA) precursor (E.C. 2.3.1.16) sigp 110 139 3-ketoacyl-coenzyme A thiolase signal peptide matp 140 1273 3-ketoacyl-coenzyme A thiolase BASE COUNT 329 a 425 c 489 g 351 t ORIGIN 86 min on K12 map. 1 tgaagtgccg gaaggtctgc gtaataaagc gcgtcataac gaaccgtact atcctccggt 61 tgagccagcc cgtccggttg gcgacctgaa aacggcttaa ggagtcacaa tggaacaggt 121 tgtcattgtc gatgcaattc gcaccccgat gggccgttcg aagggcggtg cttttcgtaa 181 cgtgcgtgca gaagatctct ccgctcattt aatgcgtagc ttgctggcgc gtaacccggc 241 gctggaagcg gcggccctcg acgatattta ctggggttgt gtgcagcaga cgctggagca 301 gggttttaat atcgcccgta acgcggcgct gctggcagaa gtaccacact ctgtcccggc 361 ggttaccgtt aatcgcttgt gtggttcatc catgcaggca ctgcatgacg cagcacgaat 421 gatcatgact ggcgatgcgc aggcatgtct ggttggcggc gtggagcata tgggccatgt 481 gccgatgagt cacggcgtcg attttcaccc cggcctgagc cgcaatgtcg ccaaagcggc 541 gggcatgatg ggcttaacgg cagaaatgct ggcgcgtatg cacggtatca gccgtgaaat 601 gcaggatgcc tttgccgcgc ggtcacacgc ccgcgcctgg gccgccacgc agtcggccgc 661 atttaaaaat gaaatcatcc cgaccggtgg tcacgatgcc gacggcgtcc tgaagcagtt 721 taattacgac gaagtgattc gcccggaaac caccgtggaa gccctcgcca cgctgcgtcc 781 ggcgtttgat ccagtaaacg gtatggtaac ggcgggcaca tcttctgcac tttccgatgg 841 cgcagctgcc atgctggtga tgagtgaaag ccgcgcccat gaattaggtc ttaagccgcg 901 cgctcgtgtg cgttcgatgg cggtcgttgg ttgtgaccca tcgattatgg gttacggccc 961 ggttccggcc tcgaaactgg cgctgaaaaa agcggggctt tctgccagcg atatcggcgt 1021 gtttgaaatg aacgaagcct ttgccgcgca gatcctgcca tgtattaaag atctgggact 1081 aattgagcag attgacgaga agatcaacct caacggtggc gcgatcgcgc tgggtcatcc 1141 gctgggttgt tccggtgcgc gtatcagcac cacgctgctg aatctgatgg aacgcaaaga 1201 cgttcagttt ggtctggcgg acggatgtgt atcgggtctg ggtcagggta ttgcgacggt 1261 gtttgagcgg gtttaaccaa cttgccggat tgcggccgtt gaacggcctt atccggtcta 1321 ccgatccggc accaatgtag gcctgataag acgcgccagc gtcgcatcag gcatcatgca 1381 ccaattgccg gatgcggcgt aaacgcctta tccggctacg gttcggacta tttgtaggca 1441 cgataagacg cgacagcgtc gcatcaggca tcatgcacca attgccggat gcggcgtaaa 1501 cgccttatcc ggcctacggt tcggactatt tgtaggcacg ataagacgcg acagcgtcgc 1561 atcgggcatg gtttgccaga tacgtcatag atct // LOCUS HSSCLS 1645 bp ds-DNA VRL 17-JUL-1990 DEFINITION Herpesvirus saimiri collagen-like protein mRNA, complete cds. ACCESSION M31964 KEYWORDS collagen-like protein. SOURCE Herpesvirus saimiri (strain 484-77), cDNA to mRNA. ORGANISM Herpesvirus saimiri Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 1645) AUTHORS Geck,P., Whitaker,S.A., Medveczky,M.M. and Medveczky,P.G. TITLE Expression of collagen-like sequences by a tumor virus, herpesvirus saimiri JOURNAL J. Virol. 64, 3509-3515 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P. Medveczky, 07-FEB-1990. FEATURES from to/span description pept 101 400 collagen-like protein pept 580 1224 ORF2 mRNA 56 1248 collagen-like protein mRNA BASE COUNT 545 a 363 c 393 g 344 t ORIGIN Map position 98-100. 1 gagtttccaa aatgtagtaa gctaacatat aaatatcagc agaaaataga gtaacacaag 61 aaacagctaa caagagcaac acaaacacgc ttgaaagaaa atggcaagcg agccaaacct 121 aagatatcca actgaagaaa caggagatcg aggacctcaa ggacctccag gacctccagg 181 accccaagga cctccaggac cccaaggacc tccaggaccc caaggacctc caggacccca 241 aggacctcca ggaccccaag gacctccagg accccaagga cctccaggac ccccaggacc 301 tccaggacct tcagactcca ggattgtttg taacaaactt attgcttgga atcataattt 361 tactcttatt aattatagtt gcgatcttac tggtgtctaa attagtagta aactaagaac 421 agagcaagca gtatagaaga gagtaattat gctagcttaa tttcatctag tagtcagttg 481 tgtagaactt tttattagta aatctagaag tgttaagaaa aaggtataga aagtagtgac 541 ctttattagc tgcatcttca tatctctaaa ttagtaagta tggaaaacca gagagaagaa 601 atagaactaa cagaaattcc agagactgag aaaaaaagaa cagctgaaga aaaattatta 661 tcatgctctg ctgaaactgc tgaagaaaag gtatcattat gctctgaaga aactacagac 721 acgtcaagca gttctagcag tgagcaaaca cctgctccta tagaagtaaa cgttaacatt 781 caaacgagca catacttgcc tcagaacgcg gcaacaaact tgaacagttt gtacacaagt 841 tttgaagacg caagggcaca aggaaaagga ctggtgcgac acaattcgga cgatttaaaa 901 agttttttag aaaaataccc accagattat agaaaaccta aacgggactt gtcagaatca 961 tgggatccag gtatgccgaa accaactttg cctcctaggc ctgcgaacct aggtgcaagc 1021 caggcaagta ctgtaagacg gcatgtaaga gaacaaaatt ttaaacagct tagagaacgt 1081 aaagctaatg aagggaaaat agtaaaagat ttgaaaagac ttgagtacaa agttaatata 1141 attctttgtt tagtagttgt tatattagct ataatattat tactaactgg gctaagtata 1201 ctgttcataa gaataaaaag ttaatttaac aaagaattta acaaatatgt gtgtgcatgg 1261 tcattaaaag aagcgggccc caagcagtgg gcgctgggca gctggaaaaa tgtcctgggc 1321 gtggctagag cgagctcggg gggcaaagcg agcagcgagc ggctgcggag ggagggggaa 1381 gggattgagc tgccccgccc ctctgggctg agattccaga gcccctgttg tgtccccagg 1441 ccaccgcagg cccggggctg tgcccctgtg cttggagcca gtccccagag ccctggctag 1501 gccgaggcgc tgctagagta agggtcccac aactgttcag gcccccccag ggggagagcc 1561 cagcgcagcg ctctgccggg tagggggggg gtagctggct ccccccccac ctctgcgtga 1621 agcagagtgc atgcagcccc tgcag // LOCUS MUSAGP1A 777 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA. ACCESSION M34648 J05488 KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein; non-specific immunosuppressant; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-1A. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 777) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 39 662 alpha-1-acid glycoprotein (AGP) precursor sigp 39 92 alpha-1-acid glycoprotein (AGP) signal peptide matp 93 659 alpha-1-acid glycoprotein (AGP) signal 747 752 AGP poly-A signal BASE COUNT 225 a 189 c 186 g 177 t ORIGIN Chromosome 4. 1 atcttttcca agccctggtg cctctgagtg tcctaagcat ggcactgcac atgattcttg 61 tcatgttgag cctcctgccg ctgttggaag ctcagaaccc agaacatgtc aacatcacta 121 taggcgaacc tatcaccaat gagaccctga gctggctctc tgacaaatgg tttttcattg 181 gtgcggctgt cctaaaccct gattaccggc aggaaattca aaagatgcag atggtatttt 241 ttaacattac ccccaacttg ataaatgaca cgatggagct tcgagagtat cacaccatag 301 atgaccactg tgtctataac tctacacatc taggaatcca gagagagaat gggaccctct 361 ccaagtatgt aggaggagta aaaatctttg cagacctgat agtgctgagg aaacatgggg 421 ccttcatgct tgcctttgac ctgaaggatg agaagaaacg gggactgtcc ctcaatgcca 481 aaaggccaga tatcaccccg gagctgcggg aagtattcca gaaggctgtc aaacacgtgg 541 gcatggatga atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg 601 agaagaagca gcttgagctg gagaaggaga ccaagaaaga tcctgaggaa ggccaggcat 661 gaactcagct ctctgaactc catgggctgt ccccaggctc accaaacccc accccctcct 721 gtgcactttg attctgtctc tgcaacaata aaggtttgct gacacagtca aaaaccg // LOCUS MUSAGP1X 778 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA. ACCESSION M34647 J05488 KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein; non-specific immunosuppressant; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-1X. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 778) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 39 662 alpha-1-acid glycoprotein (AGP) precursor sigp 39 92 alpha-1-acid glycoprotein (AGP) signal peptide matp 93 659 alpha-1-acid glycoprotein (AGP) signal 747 752 AGP poly-A signal BASE COUNT 224 a 191 c 187 g 176 t ORIGIN Chromosome 4. 1 atctcttcca ggccctggtg cctctgagtg tcctaagcat ggcactgcac atgattcttg 61 tcatgttgag cctcctgccg ctgttggaag ctcagaaccc agaacatgtc aacatcacta 121 taggcgaacc tatcaccaat gagaccctag gctggctatc tgacaaatgg tttttcattg 181 gtgcggctgt cctaaaccct gattaccggc aggaaattca aaagatgcag atggtatttt 241 ttaacattac ccccaacttg ataaatgaca cgatggagct tcgagagtat cacaccatag 301 atgaccactg tgtctataac tccacacatc taggaatcca gagagagaat gggaccctct 361 ccaagtatgt aggaggagta aaaatctttg cagacctgat agtgctgagg aaacatgggg 421 ccttcatgct tgcctttgac ctgaaggatg agaagaaacg gggactgtcc ctcaatgcca 481 aaaggccaga tatcaccccg gagctgcggg aagtattcca gaaggctgtc aaacacgtgg 541 gcatggatga atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg 601 agaagaagca gcttgagctg gagaaggaga ccaagaaaga tcctgaggaa ggccaggcat 661 gaactcagct ctctgaactc catgggctgt ccccaggctc accaaacccc accccctcct 721 gtgcactttg attctgtctc tgcaacaata aaggtttgct gacactgtca aaaacccg // LOCUS MUSAGP2AX 765 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA. ACCESSION M34649 J05488 KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein; non-specific immunosuppressant; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clones lambda-[2A,2X]. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 765) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 29 652 alpha-1-acid glycoprotein (AGP) precursor sigp 29 82 alpha-1-acid glycoprotein (AGP) signal peptide matp 83 649 alpha-1-acid glycoprotein (AGP) signal 735 740 AGP poly-A signal BASE COUNT 226 a 185 c 185 g 169 t ORIGIN Chromosome 4. 1 gggcctggtg cctctgagtg ccctcagcat ggcactgcac acggttctta tcatgttgag 61 cctcctgccg atgttggaag ctcagaaccc agaacatgcc aacatcacta taggcgaacc 121 tatcaccaat gagaccctag gctggctctc tgacaaatgg tttttcatgg gcgcagcttt 181 cagaaaactc gagtacaggc aggcaattca aatgatgcag actgaatttt tttaccttac 241 caccaacttg ataaacgaca caatagagct tcgggagtct caaacaatag gtgaccagtg 301 tgtctataac tccacccatc taggattcca gagagaaaat gggaccttct ccaagtatga 361 aggaggagta gaaacctttg cccacctgat agtgctaagg aaacatgggg ccttcatgct 421 tgcctttgat ctcaatgatg agaagaaacg gggactgtcc ctctatgcca aaaggccaga 481 tatgaccctg gagctacggg aagtattcca gaaggctgtc aaacacgtgg gcatggatga 541 atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg agaagaagca 601 gcttgagttg gggaaggaga ccaagaaaga tcctgaggaa ggccaggcat gaactcagct 661 ctgaactcca tgggctatcc ccaggctcac caaaccccac cccctcctgt gcactttgat 721 tctgtctctg caacaataaa gatttgctga aagagtcaaa aaccg // LOCUS MUSAGP3X 766 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA. ACCESSION M34646 J05488 KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein; non-specific immunosuppressant; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-3X. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 766) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 31 654 alpha-1-acid glycoprotein (AGP) precursor sigp 31 84 alpha-1-acid glycoprotein (AGP) signal peptide matp 85 651 alpha-1-acid glycoprotein (AGP) signal 737 742 AGP poly-A signal BASE COUNT 228 a 186 c 185 g 167 t ORIGIN Chromosome 4. 1 cggggcctgg tgcctctgag tgccctcagc atggcactgc acacggttct tatcatgttg 61 agcctcctgc cgatgttgga agctcagaac ccagaacatg ccaacatcac tataggcgaa 121 cctatcacca atgagaccct aggctggctc tctgacaaat ggtttttcat gggcgcacgt 181 ttcagaaaac tcgagtacag gcaggcaatt caaatgatgc agactgaatt tttttacctt 241 accaccaact tgataaacga cacaatagag cttcgggagt ctcaaacaat aggtgaccag 301 tgtgtctata actccaccca tctaggattc cagagagaaa atgggacctt ctccaagtat 361 gaaggaggag tagaaacctt tgcccacctg atagtgctaa ggaaacatgg ggccttcatg 421 cttgcctttg acctcaatga tgagaagaaa cggggactgt ccctctatgc caaaaggcca 481 gatatgaccc tggagctacg ggaagtattc cagaaggctg tcaaacacgt gggcatggat 541 gaatcagaaa tcatatttgt cgactggaaa aaggataagt gtggtcagca ggagaagaag 601 cagcttgagt tggggaagga gaccaagaaa gatcctgagg aaggccaggc atgaactcag 661 ctctgaactc catgggctat ccccaggctc accaaacccc accccctcct gtgcactttg 721 attctgtctc tgcaacaata aagatttgct gaaagaaaaa aaaccg // LOCUS MUSAGP7A 1348 bp ds-DNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) gene 7, 5' flank and exon 1. ACCESSION M34645 J05488 KEYWORDS alpha-1-acid glycoprotein; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte DNA, clones lambda-[11B,27A]. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1348) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 1232 / 1348 alpha-1-acid glycoprotein (AGP), exon 1 mRNA 1194 > 1348 AGP mRNA signal 1161 1166 TATA box BASE COUNT 312 a 316 c 356 g 364 t ORIGIN Chromosome 4. 1 gaattcacac aggccttgct catctggcct ctgcaggcaa aggacaggag aagcagggga 61 ggcacatgtg agcagaagcc tgggcagact ctggaatggt atgcagcaag taccctgggg 121 agaccttagc tggatgtcca cttccctgcc tgcacccttg cccttagaat ccacaggctc 181 tggatacaca gcacagccac aggaagcttt ggggatttct tttccctatc ttaagtttat 241 tgttttcttc atttggttgt ataacaaaaa gagattttta aatttgctaa tgggtgcaag 301 tgtgagtgtg tgtgtgtgtg tgtgtgtgtt tctgtgtgtg tgtgtgtgtg tgtgtgtgtg 361 tgtgtgtgtg tgtgtgtgtg agagagagag agagagagga agggacagag tgaaatagat 421 catgacactg tgagcatgtt ctaacacgtt tgctgtattg atgttttgaa gcagctcact 481 gtgatctcaa gctcccagca atgtctctgt gcctcagcgg gagtcaccag gcctgtccct 541 gcaatacttc tagagattgt tgattatgta gacaaagatc aggtaggaac tagaaccaga 601 tattgactgg atgctgggca cagaccctgg gttgggcata cactgtccta ggcagatcct 661 ttcctgctgt agatactggg agctttgctg aactcgatgg tcaactcaga ttcacccctt 721 tttgggcact tggctgcctt caggctgggt aggagtccgc aggaacatca cactcctttg 781 aaaaccatcc atctttattc ttggccctca acttgagccc ctaagtgtct tctatgttca 841 ctatgaaccc tgacctggga ccccttctta tcacgcttcg ggggggtgga gttgatgtag 901 tgtaggtttc actcctgtta gacagcttca tgggataaga gagaaagggg accgctgtct 961 gggacctaag tatcatcagg ctagcccttt acctaccttg accatgaatc agccactctg 1021 gtgtaggcag aagccgactg agggagctgc acaaacctgg cttgagggaa catttggggc 1081 aagacattta tcaagtgctg gtgactttgt ggcactgctc taacgccctg ctgcagtcac 1141 atgccctccc cacatcctgt tataaaagtc actgcaccct ccaggcacca gttatctctt 1201 cccagcactg gtgcctctga gtgccgtcag catggagcta cacacagttc ttgtgatgtt 1261 gagcctcctg ctgctgttgg aagctcagaa cccagaacat gccaacatca acataggcaa 1321 acctatcaac aatgagaccc tgagctgg // LOCUS MUSAGP8A 2094 bp ds-DNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) gene 8, 5' flank and exon 1. ACCESSION M34644 J05488 KEYWORDS alpha-1-acid glycoprotein; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte DNA, clones lambda-[15A,11B]. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2094) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 1978 / 2094 alpha-1-acid glycoprotein (AGP), exon 1 mRNA 1940 > 2094 AGP mRNA signal 1907 1912 TATA box BASE COUNT 554 a 502 c 464 g 574 t ORIGIN Chromosome 4. 1 gaattcacac aggccttgct catctggcct ctgcaggcaa aggacaggag aagcagggga 61 ggcacatgtg agcagaagcc tgggcagact ctggaatggt atgcagcaag taccctgggg 121 agaccttagc tggtgtccac tccctgcctg cacccttgcc cttagaatcc acaggctctg 181 gatgcacagc acagccacag gaagctttgg agatatctaa tccgtttctt aagtttatta 241 ttttccatca tttgctttta taacaaaaaa tagatatttt atatctgcca gtgtgtgcaa 301 atatgtttat gtgcacgtgt acgtgtgcgc ctgtgtgtaa gagacataac agtgacaaag 361 aggcacaaac agagagatta tgacttctgg gcactgctaa cttgggacag aacatatatc 421 attggtgctt ttgggctcct catggttatg ctgtttctcc tctgggaaga gagaaacttt 481 cttaagactg tataaataca acaaaacaca tatgttttgg gtctccagct cagcaacttc 541 tcctaggccc tgtacagccc actccatggt cagggggctc tcaggaccat ctgtggcccc 601 ttccttttgc cctaggtggt cactgtcact gacctgatta atatcattac aattagtctg 661 atcctctaga acgccctgta aacaggctca cacaacccca cctctgggcg atctgctggc 721 aataatgttg tcgagctgca tttgtatggt caaatctatc agtctttcat ttctctctgt 781 tgctaagcaa caccatgctt gcatgaaatg tcacaaccca tttctcattt tcctggtggt 841 aggcacttaa gagcttggag tttgtggcta ctttgaataa tgccattaga acatttacac 901 acatgatact ggaggtgact catgttttgt tttcctttct gttggtagac taatttttaa 961 aggcaatatg ttgctctgac acttataaag aggggaagca taaaatacaa aatgccacag 1021 gataaaagaa ggttctcgtg aaattagaac acagtaagtg aatgacaatt atttatcatt 1081 aaaacaaaaa ccctgaatac acagaaacat gagaaattgt ggaggacaca ggaagtagat 1141 tactcaactg tgagcgtgtt ctcatatgtt tggtgctttt gatgttttga aagagctcac 1201 ggtgatctca agctcctagc aatgactccg agcctcagtg agagtcacca ggcctgtccc 1261 tagcaataca tttcgagagt gttgattatg tagacaaaaa tcaggctata tctatcaagc 1321 aggaaccaga accagatatt gactggttgc tgtgcacaga ccctggattg ggcacacact 1381 attctagaca gatcctttcc tgctgtagat acagggagct ttgctgaact acattttcaa 1441 ctcagattca tccctctttt ttgggcattt ggctgcctct agcgtgcgta ggggtcccca 1501 gaaacatcac actcctttgg aaactaatcc atctttgtcc ttggccctta acttgagccc 1561 ctaaatgact tctaagttca ctatgaaccc tgaccaggac cccttcatag tccccttggt 1621 gataaataga ataggtctca ctcctgctag gcagcttcat gggataagac agatatggga 1681 ccatgactgg gatctaagta ttatcaggct agcccagtat ccaccttgac catgaatcag 1741 ccactctggt gtaggtcagg agcctgtgtc ggggccggct ggctgaggga gctgcacaaa 1801 gctggcttga gggaacattt tgcgcaagac atttcccaag tgctggggag attgtgccag 1861 ggctctagag gccctgctgc agtcccatgc cctccccaca ttctgttata aaagccactg 1921 caccctccat ccaccaatta tctcttccaa gccctggtgc ctctgagtgt cctaagcatg 1981 gcactgcaca tgattcttgt catgttgagc ctcctgccgc tgttggaagc tcagaaccca 2041 gaacatgtca acatcactat aggcgaacct atcaccaatg agaccctagg ctgg // LOCUS MUSMK1 933 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK-1) mRNA, complete cds. ACCESSION M19662 M34326 J05473 KEYWORDS DNA-binding protein; regulatory protein; retanoic acid-responsive protein; secreted protein. SOURCE Mouse (strain 129/Sv) HM-1 embryonal carcinoma, cDNA to mRNA, clone pO1-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 33 to 932) AUTHORS Kadomatsu,K., Tomomura,M. and Muramatsu,T. TITLE cDNA cloning and sequencing of a new gene intensely expressed in early differentiation stages of embryonal carcinoma cells and in mid-gestation period of mouse embryogenesis JOURNAL Biochem. Biophys. Res. Commun. 151, 1312-1318 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 933) AUTHORS Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T. TITLE A retinoic acid responsive gene, MK, found in the teratocarcinoma system: Heterogeneity of the transcript and the nature of the translation JOURNAL J. Biol. Chem. 265, 10765-10770 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Kadomatsu, 08-JUL-1988. Draft entry and computer-readable sequence for [2] kindly submitted by M.Tomomura and T.Muramatsu, 16-MAY-1990. The expression of MK-1 mRNA increases in the early stages of retinoic acid-induced differentiation of embryonal carcinoma cells. The putative peptide product may act as a DNA-binding protein regulating gene expression. FEATURES from to/span description pept 249 671 retanoic acid-responsive protein (MK-1) precursor sigp 249 314 retanoic acid-responsive protein signal peptide matp 315 668 retanoic acid-responsive protein mRNA < 1 910 MK-1 mRNA BASE COUNT 216 a 272 c 270 g 175 t ORIGIN 1 gagtgagcgc acgcagtggc tgtggcccca gtcccttcag gcggctgctc tgccaccaag 61 ggggctgagg tgggggtggg ggtacgctga gacatcggtt ccaagtcctc cctccgtctc 121 ccccttgtcg gtccgacgtt ttgggcctgg aaagtgggac aagtcagtca agggtgggag 181 gtccttcccg cggttcctag cggagaagag actaggcgag aaactctaac ccaggtttta 241 cccctaggat gcagcaccga ggcttcttcc ttctcgccct tcttgccctc ttggtggtca 301 cgtccgcggt ggccaaaaaa aaagagaagg tgaagaaggg cagcgagtgt tcggagtgga 361 cctgggggcc ctgcaccccc agcagcaagg actgcggcat gggcttccgc gagggtacct 421 gtggggccca gacccagcgc gtccattgca aggtgccctg caactggaag aaggaatttg 481 gagccgactg caaatacaag tttgagagct ggggggcgtg tgatgggagc actggcacca 541 aagcccgcca agggaccctg aagaaggcgc ggtacaatgc ccagtgccag gagaccatcc 601 gcgtgactaa gccctgcacc tccaagacca agtcaaagac caaagccaag aaaggaaaag 661 gaaaggacta agtcaggagg ccagagagcc tccggcctcg cctggagcct gaacggagcc 721 ctcctctccc acaggcccaa gatataaccc accagtgcct tttgtcttcc tgtcagctct 781 gtcaatcacg cctgtcctct cacgcccaca ccaagtgccc aaagtgggga gggacaagag 841 attctggaaa gtgagcctcc ccataccctc ttttgttctc cccaccctga tacttgttat 901 taagaaatga ataaaataaa ctcacttttt tcc // LOCUS MUSMK2 730 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK-2) mRNA, complete cds. ACCESSION M34327 J05473 KEYWORDS DNA-binding protein; regulatory protein; retanoic acid-responsive protein; secreted protein. SOURCE Mouse HM-1 embryonal carcinoma, cDNA to mRNA, clone pB2-25. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 730) AUTHORS Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T. TITLE A retinoic acid responsive gene, MK, found in the teratocarcinoma system: Heterogeneity of the transcript and the nature of the translation JOURNAL J. Biol. Chem. 265, 10765-10770 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Tomomura and T.Muramatsu, 16-MAY-1990. FEATURES from to/span description pept 44 466 retanoic acid-responsive protein (MK-2) precursor sigp 44 109 retanoic acid-responsive protein signal peptide matp 110 463 retanoic acid-responsive protein mRNA < 1 705 MK-2 mRNA BASE COUNT 186 a 211 c 206 g 127 t ORIGIN 1 caggccggag cgggagggag cgaagcatcg agcagtgagc gagatgcagc accgaggctt 61 cttccttctc gcccttcttg ccctcttggt ggtcacgtcc gcggtggcca aaaaaaaaga 121 gaaggtgaag aagggcagcg agtgttcgga gtggacctgg gggccctgca cccccagcag 181 caaggactgc ggcatgggct tccgcgaggg tacctgtggg gcccagaccc agcgcgtcca 241 ttgcaaggtg ccctgcaact ggaagaagga atttggagcc gactgcaaat acaagtttga 301 gagctggggg gcgtgtgatg ggagcactgg caccaaagcc cgccaaggga ccctgaagaa 361 ggcgcggtac aatgcccagt gccaggagac catccgcgtg actaagccct gcacctccaa 421 gaccaagtca aagaccaaag ccaagaaagg aaaaggaaag gactaagtca ggaggccaga 481 gagcctccgg cctcgcctgg agcctgaacg gagccctcct ctcccacagg cccaagatat 541 aacccaccag tgccttttgt cttcctgtca gctctgtcaa tcacgcctgt cctctcacgc 601 ccacaccaag tgcccaaagt ggggagggac aagagattct ggaaagtgag cctccccata 661 ccctcttttg ttctccccac cctgatactt gttattaaga aatgaataaa ataaactcac 721 ttttttccaa // LOCUS MUSMK3 1039 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK-3) mRNA, complete cds. ACCESSION M34328 J05473 KEYWORDS DNA-binding protein; regulatory protein; retanoic acid-responsive protein; secreted protein. SOURCE Mouse HM-1 embryonal carcinoma, cDNA to mRNA, clones pB3-30 and FK-3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1039) AUTHORS Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T. TITLE A retinoic acid responsive gene, MK, found in the teratocarcinoma system: Heterogeneity of the transcript and the nature of the translation JOURNAL J. Biol. Chem. 265, 10765-10770 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Tomomura and T.Muramatsu, 16-MAY-1990. FEATURES from to/span description pept 355 777 retanoic acid-responsive protein (MK-3) precursor sigp 355 420 retanoic acid-responsive protein signal peptide matp 421 774 retanoic acid-responsive protein mRNA < 1 1016 MK-3 mRNA BASE COUNT 278 a 292 c 302 g 167 t ORIGIN 1 cccaggtacc cagctccctg ccacatcaga gacccttctt gcactctgag tgaactgatt 61 aaaaaaaaaa aaaaaaaaaa aaaaccaagc cggaggtgag ccgggcctcg aagggaaggt 121 tcgcgggtgc ggtggccgcc ccgagcctgt gacaccagga catactcccg gggcccgcgg 181 tgggcaagcg aagtggtgac ctgagagctg acaggctgcg agagggaaaa gtatagacag 241 gcctagacca ggggaagggg aggggataga gagctgggcc tgctacgagg ggacctgagc 301 cagaagcgca ctggtaaaac cgaactccag gaccagagac ccagagatca gaggatgcag 361 caccgaggct tcttccttct cgcccttctt gccctcttgg tggtcacgtc cgcggtggcc 421 aaaaaaaaag agaaggtgaa gaagggcagc gagtgttcgg agtggacctg ggggccctgc 481 acccccagca gcaaggactg cggcatgggc ttccgcgagg gtacctgtgg ggcccagacc 541 cagcgcgtcc attgcaaggt gccctgcaac tggaagaagg aatttggagc cgactgcaaa 601 tacaagtttg agagctgggg ggcgtgtgat gggagcactg gcaccaaagc ccgccaaggg 661 accctgaaga aggcgcggta caatgcccag tgccaggaga ccatccgcgt gactaagccc 721 tgcacctcca agaccaagtc aaagaccaaa gccaagaaag gaaaaggaaa ggactaagtc 781 aggaggccag agagcctccg gcctcgcctg gagcctgaac ggagccctcc tctcccacag 841 gcccaagata taacccacca gtgccttttg tcttcctgtc agctctgtca atcacgcctg 901 tcctctcacg cccacaccaa gtgcccaaag tggggaggga caagagattc tggaaagtga 961 gcctccccat accctctttt gttctcccca ccctgatact tgttattaag aaatgaataa 1021 aataaactca cttttttcc // LOCUS YSCVPS33P 2695 bp ds-DNA PLN 17-JUL-1990 DEFINITION S.cerevisiae vacuolar biogenesis and vacuolar protein sorting protein (VPS33/SLP1), complete cds. ACCESSION M34638 KEYWORDS vacular biogenesis protein; vacuolar protein sorting protein. SOURCE S.cerevisiae (strain GRF88) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2695) AUTHORS Banta,L.M., Vida,T.A., Herman,P.K. and Emr,S.D. TITLE Characterization of the yeast Vps33p, a protein required for vacuolar protein sorting and vacuole biogenesis JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.M.Banta, 18-MAY-1990. Author address: L.M.Banta California Insitute of Technology 147-75 Div of Biology Pasadena, CA 91125 FEATURES from to/span description pept 401 2476 Vps33p protein signal 245 252 TATA box binding 480 498 ATP-binding domain A (put.) binding 661 679 ATP-binding domain B (put.) BASE COUNT 916 a 445 c 522 g 812 t ORIGIN 1 gtcaagaagc gggcttggct gccataatgg aagatttaga tgtcgcaaaa gtggaattac 61 gtcattttga gaaagctttt aaaggaattg ctaggggcat tactccagaa atgctctctt 121 attatgaaga gtttgctcta agaagcggtt catcttcgta agcttgttca tagtcaattc 181 ttttcctttg tgtgctcaat aatagtagat agaaattata ctgaactccg gtcattttgt 241 ataatatatt aatcacttca cacgaacata cataaataaa atatcataaa ggttagcaaa 301 ttggaactag ttatatgtta attagttaaa agatagaaaa ttcgagaaag gaagaaaaag 361 ctgatattgc ccatctccaa ctttatcaaa tcatttcacg atgaatagat tttggaatac 421 taagaaattt tcattaacaa atgccgatgg actatgtgct accttaaatg agatatctca 481 aaatgatgaa gttcttgtgg ttcaaccaag tgtattgcca gtactcaata gtttgctaac 541 tttccaagat ttgactcaat caactcctgt aaggaaaatt acgttactcg atgatcagct 601 aagtgacgat ttaccgagtg ccttaggcag cgttccgcaa atggatctta tttttcttat 661 tgatgtcaga acatctctcc gactccctcc acaactgctt gatgctgctc aaaagcacaa 721 tttatcatct ttgcatataa tatactgtcg atggaaaccg tctttccaaa atactttgga 781 ggatacagag caatggcaaa aggatggttt cgatttgaat tcaaaaaaaa cacatttccc 841 taacgtcatt gaatctcagt taaaggagct atcgaacgaa tatacccttt acccttggga 901 tctcttgccc ttcccacaga ttgatgaaaa tgttctattg actcattccc tttataacat 961 ggaaaatgta aacatgtatt atcccaactt acgttctttg cagagtgcca cagagtcaat 1021 actggttgat gatatggtca attcgttgca gagcttgatt tttgaaacta atagtatcat 1081 aacaaatgtt gtgtcgatag gtaatctgtc taagagatgt agccatcttt tgaagaaacg 1141 aatcgatgag catcaaacag agaatgattt attcatcaag ggtacgcttt atggtgaacg 1201 aaccaactgt ggactagaaa tggacttgat tatcttggaa aggaataccg atcctataac 1261 gccattgttg acacaactta cgtatgcagg aatactagat gatctatatg aattcaattc 1321 tggcataaag ataaaggaga aagacatgaa cttcaattat aaggaagata aaatatggaa 1381 tgatttgaaa tttttaaatt ttgggtcgat tgggccgcag ttaaataaat tggcaaagga 1441 actacaaacg caatatgata caaggcataa agccgagagc gtacatgaaa tcaaagaatt 1501 cgttgattcc ttaggttctt tgcaacaaag gcaagctttt ttgaaaaatc acacaacctt 1561 atcatccgac gttttgaaag tggtagagac tgaagagtac ggatctttca ataaaatctt 1621 agagttagag ctggaaattt tgatgggaaa tacacttaat aacgacattg aagatattat 1681 actcgagttg cagtaccagt acgaggttga tcaaaagaag attctcagat taatctgttt 1741 attgtctctt tgtaaaaatt cacttcgaga aaaggattat gaatatctaa gaacctttat 1801 gatcgactct tggggcattg aaaaatgctt tcaacttgaa tcattggctg agttaggatt 1861 tttcactagc aaaacgggaa aaactgattt gcatattaca acaagtaagt caacaagatt 1921 acagaaagaa taccgttata tttcacaatg gttcaataca gtacccatag aagacgagca 1981 tgctgccgat aaaatcacaa atgagaacga tgacttctcg gaagccactt ttgcttacag 2041 tggtgtagtg cccttgacaa tgagactggt tcagatgtta tatgataggt ctatcttgtt 2101 ccataattat tcctcgcagc agccttttat actgtcaaga gaacctagag tttctcaaac 2161 ggaggattta attgaacagt tatatggaga ctcacatgcg atcgaagaga gtatatgggt 2221 cccgggaacc attacaaaaa agatcaatgc aagcatcaag agcaataata gacggtccat 2281 agacggatct aatgggacat ttcatgctgc agaggatatt gcactcgtag tattcctcgg 2341 aggtgtaaca atgggtgaaa tagctataat gaagcatttg caaaaaatac taggtaaaaa 2401 aggtatcaat aaaaggttta tcatcatcgc cgatggcttg atcaatggca caaggatcat 2461 gaactctata tcttaattat tatatgatag atttgttaat tttttgtata tgcaaatgtg 2521 ctttttttca ccaaacggtt tgcaccaatc atacgagaga agtgttcggt gtttacggaa 2581 aagctagggg actaagaaaa attgaaaata aaggctgaca gcagtagaaa ccattgtgct 2641 ggcttagtga tttataagaa tggttaatta gttttgtatc ctttaatttt ctaga // LOCUS BOVCASA 1123 bp ss-mRNA MAM 17-JUL-1990 DEFINITION Bovine alpha-s1-casein mRNA, complete cds. ACCESSION M33123 KEYWORDS alpha-s1-casein. SOURCE Bovine (strain Holstein) lactating mammary gland, cDNA to mRNA, clone p-alpha-s1 C228. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1123) AUTHORS Nagao,M., Maki,M., Sasaki,R. and Chiba,R. TITLE Isolation and sequence analysis of bovine alpha-s1-casein cDNA clone JOURNAL Agric. Biol. Chem. 48, 1663-1667 (1984) STANDARD simple staff_review FEATURES from to/span description pept 64 708 alpha-s1-casein precursor sigp 64 108 alpha-s1-casein signal peptide matp 109 705 alpha-s1-casein mRNA < 1 1123 alpha-s1-casein mRNA BASE COUNT 331 a 247 c 223 g 322 t ORIGIN 1 tcacttcgac catcaaccca gcttgctgtt cttcccagtc ttgggttcaa gatcttgaca 61 accatgaaac ttctcatcct tacctgtctt gtggctgttg ctcttgccag gcccaaacat 121 cctatcaagc accaaggact ccctcaagaa gtcctcaatg aaaatttact caggtttttt 181 gtggcacctt ttccagaagt gtttggaaag gagaaggtca atgaactgag caaggatatt 241 gggagtgaat caactgagga tcaagccatg gaagatatta agcaaatgga agctgaaagc 301 atttcgtcaa gtgaggaaat tgttcccaat agtgttgagc agaagcacat tcaaaaggaa 361 gatgtgccct ctgagcgtta cctgggttat ctggaacagc ttctcagact gaaaaaatac 421 aaagtacccc agctggaaat tgttcccaat agtgctgagg aacgacttca cagtatgaaa 481 gagggaatcc atgcccaaca gaaagaacct atgataggag tgaatcagga actggcctac 541 ttctaccctg agcttttcag acaattctac cagctggatg cctatccatc tggtgcttgg 601 tattacgttc cactaggcac acaatacact gatgccccat cattctctga catccctaat 661 cccattggct ctgagaacag tgaaaagact actatgccac tgtggtgaag agtcaagtga 721 attctgaggg actccacagt tatggtcttt gatgggtctg aaaattccat gctctacatg 781 tcgcctcatc tacatgtcaa accattcatc caaaggcttc aactgctgtt ttagaacagg 841 gcaatctcaa actgaggcac tccttgatgc tctactgtat tttagatagt gtaacatcct 901 taagtgaaat tgtcctaaca gcttgttacc taaattccag tagtatcatg ctggtataaa 961 ggccactgag tcaaagggaa ttaaagtctt cattaaattt ctgtatggaa aatgttttaa 1021 aagcctttga atcacttctc ctgtaagtgc catcatatca aataattgtg tgcattaact 1081 gagattttgt ctttcttctt ttcaataaat tacattttaa ggc // LOCUS BPHINTXIS 1741 bp ds-DNA PHG 17-JUL-1990 DEFINITION Bacteriophage phi-11 integrase (int) and excisionase (xis) genes, complete cds. ACCESSION M34832 KEYWORDS excisionase; integrase. SOURCE Bacteriophage phi-11 DNA. ORGANISM Bacteriophage phi-11 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1741) AUTHORS Ye,Z.-H., Buranen,S.L. and Lee,C.Y. TITLE Sequence analysis and comparison of int and xis genes from Staphylococcal bacteriophages l54a and phi-11 JOURNAL J. Bacteriol. 172, 2568-2575 (1990) STANDARD simple staff_review FEATURES from to/span description pept 267 67 (c) excisionase (xis) pept 379 1425 integrase (int) BASE COUNT 650 a 264 c 263 g 564 t ORIGIN 1 cctatgccag caccagtgaa actctattat gcatggtatt aaaatcgaag agtacaattc 61 gataattcaa acattatttg acgaaatagc taagctgtct aatgtatata agtctcttaa 121 taaacagtaa gcaaaatcgg attcttcatt acataccgaa tattcatcat aaacactgac 181 tgcatcttct aagacatttt ttaaaattct aatgtcttca ttcgttaaaa ctaattcatt 241 gaaattatga ttgtttttaa atgtcataac atcacctact ttttatttta ttatatcaca 301 tttagtacct agtactaaat ttcgggtagc ccgcctaccc ttattatttt ttgccaattt 361 tgaggaggga gaagcaaaat gccagtatat aaggatgata atacaggtaa atggtatttt 421 tccattagat ataaagatgt atacggtaat aacaaacgaa aaatgaagcg tgggtttgaa 481 cgtaagaaag atgccaaact agctgaaagc gaatttatac aaaatgttaa atatggatac 541 tcggacaatc aaccctttga atatatattt tttgatcgtt taaaaaatga aaatctttct 601 gcacgctcaa tagaaaagcg aactacagaa tataatactc acataaaaga aaggttcgga 661 aatatcccta ttggcaaaat cactactacg caatgtactg ctttcaggaa ttatttgtta 721 aacgatgcag gtctttctgt tgactatgca cgatctgtgt gggcaggttt taaagcagtt 781 atcaattacg ccaaaaagca ttacaagctc ttatacgacc ccacattatc ggtaactcct 841 attcccagaa caaaaccaca agctaaattt atcactcgtg aagaatttga tgaaaaagta 901 gaacaaatca caaatgatac ttctcgtcag ctaactagac tgttatttta ttctggtctt 961 agaataggag aagctttagc tttgcagtgg aaagattacg ataaaataaa aggcgaaatt 1021 gacgtaaata agaaaatcaa tttaagtaat agaaaaattg aatataatct aaaaaaagaa 1081 agctctaaag ggataatacc tgtaccaaat ttaattagag agatgcttaa aaacatgtat 1141 aatgaatctt ctaaaagata taaatatttt gacgaaaact attttatatt cgggggttta 1201 gaacctatta gatacgttac ttattcgtat cattttaaat ctgtattccc gaatctaaaa 1261 atacaccatt taagacactc gtacgctagc tatttaatta ataatggtgt agatatgtat 1321 ttattaatgg aattaatgag gcattctaac attacagaaa caattcaaac gtactctcat 1381 ttatatactg ataaaaaaca tcaagctatg agcatatttg attaaacggt atcaaattgg 1441 tatcaaataa caattaagga gtttataaaa tgcgtaataa caagcctaaa ataagtattc 1501 aaaacgaccc atgggaagtg aaatttatat acatttaaat ttcatgagac aataaacgtt 1561 gatttaatgc gtttttttgc cttttttatt ttccttattt tttctgtttt acaacaaaat 1621 ggtatcaaaa atggtatcat ttgtagttat tttagcttca catattaaaa caaccacact 1681 cctaaattaa taggtggtgt ggttttgttg gttgtgtggg gataaaaata accgcatcag 1741 t // LOCUS BSTNPRAS 3510 bp ds-DNA BCT 17-JUL-1990 DEFINITION B.stearothermophilus neutral protease (nprS), and transcriptional activator (nprA) genes, complete cds. ACCESSION M34237 KEYWORDS neutral protease; transcriptional activator. SOURCE B.stearothermophilus (strain TELNE) DNA, clone pSP53. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3510) AUTHORS Nishiya,Y. and Imanaka,T. TITLE Cloning and nucleotide sequences of the neutral protease gene and its transcriptional activator gene from Bacillus stearothermophilus JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Nishiya, 11-MAY-1990. FEATURES from to/span description pept 181 1401 transcriptional activator (nprA) pept 1750 3405 neutral protease (nprS) precursor sigp 1570 2274 neutral protease (nprS) signal and propeptide matp 2275 3219 neutral protease (nprS) BASE COUNT 1177 a 581 c 737 g 1015 t ORIGIN 1 tacggtcttc agacatttct attcctatag cccaaatgag tagttccttt tggaggagaa 61 aatgtgtata atttttagta aatttatatt agtaaaaaat taagaaggag taggtattat 121 ttgaagattg gtgatcgctt aaaattttcc cgtatcaaac ataagttaac gcaagaggaa 181 gtggctgacg gaattatttc cgtatcatat ttatcaaaaa ttgaaaacaa tcaagtggtt 241 ccaagtgaag aagtgcttcg cctcctttgt caacggttgg gaatcaacaa tatcctgaaa 301 aatagacaag atgaattaac aagtaaattg ttattatggt acaaaacgat tacggataaa 361 aaccgacagg aagcagcccg gatgtacgag gaaatcaaac gaactttcga tgacgtccag 421 ggggcggaat ccatcgctta ctttctgttg tttgaaatgc gctatcactt gttattaaaa 481 gatattcata ctgtcgaagc gttgttgatc aaattaaggg aattgtatga cacctttgat 541 gatgtgatga agtattatta ttataaattt ttaggtctac tttactattg caaggaaaaa 601 tatgaagatg ctttggaata ttataaaaag gcggagcagc gatttcgaag ccaatcattt 661 gaaaaatggg aagaagctga tttgcattat ttactagcgc ttgtttatag ccggctctgg 721 agaatattag gctgtattaa ctatgcgcag catgctttag cgatttacca atccgaatac 781 gatttaaagc gaagcgctga atgccacatt ttacttggta tttgttacag aaggtacgga 841 gaagtagatc aagcgatcga atgctattca ttggcccata aaattgccca aatcattaat 901 gataccgaat tattaggtac gattgagcat aacctaggct acttaatgtc aatgaaacat 961 gagcattatg aagccattca gcattataag aagagtttgc tgtataagcg aaactcttca 1021 ttacaagcta gatttattac gttgttttct ctcatcaaag aatattatgt ttccaaaaac 1081 tataaaaaag cattagccaa tgtagaggaa agtttgcagc ttctcaagag ggaaaaagat 1141 gggatgacaa cgtattatga atattatctt catttcacag tttatcaata tttactatca 1201 gaagatattt cggaaaatga atttgaaaca tttatgaaag atcgagtgct cccttatttt 1261 caaaggttta aaaaatatga agatgttgca caatacgctg aatacttggc aatctattac 1321 gagaaacgtc ataagtataa actagcaagc aaattctata aaatgagtta tcaatttcta 1381 aaaaatatga taaatattta ggagggattt ttttgaaaaa gcttttatta ggaatcatga 1441 cgtttggtat tatgagttta cttgttctca ttggtagtga ccaagaacca aaatatgtgg 1501 caaaagacga acatccgcct ccaaccatca tcattgcagc gaaagatgaa catccaccag 1561 caacgattat ttgaagagga ataagcaaaa agacagctag ttttctagct gtcttttttc 1621 atgcatagga aaatgtgaaa aaaacgtagg gaattatcaa ctatatcaga ctctattttt 1681 cccaatacaa aatactgtaa aatattgtgt ttaatattct aaatacaaag aataaaggag 1741 gatgaaaaaa tgaaaaggaa aatgaaaatg aaattagtac gttttggtct tgcagcagga 1801 ctagcggccc aagtattttt tttaccttac aatgcgctgg cttcaacgga acacgttaca 1861 tggaaccaac aatttcaaac ccctcaattc atctccggtg atctgctgaa agtgaatggc 1921 acatccccag aagaactcgt ctatcaatat gttgaaaaaa acgaaaacaa gtttaaattt 1981 catgaaaacg ctaaggatac tctacaattg aaagaaaaga aaaatgataa ccttggtttt 2041 acgtttatgc gcttccaaca aacgtataaa gggattcctg tgtttggagc agtagtaact 2101 gcgcacgtga aagatggcac gctgacggcg ctatcaggga cactgattcc gaatttggac 2161 acgaaaggat ccttaaaaag cgggaagaaa ttgagtgaga aacaagcgcg tgacattgct 2221 gaaaaagatt tagtggcaaa tgtaacaaag gaagtaccgg aatatgaaca gggaaaagac 2281 accgagtttg ttgtttatgt caatggggac gaggcttctt tagcgtacgt tgtcaattta 2341 aactttttaa ctcctgaacc aggaaactgg ctgtatatca ttgatgccgt agacggaaaa 2401 attttaaata aatttaacca acttgacgcc gcaaaaccag gtgatgtgaa gtcgataaca 2461 ggaacatcaa ctgtcggagt gggaagagga gtacttggtg atcaaaaaaa tattaataca 2521 acctactcta cgtactacta tttacaagat aatacgcgtg gaaatgggat tttcacgtat 2581 gatgcgaaat accgtacgac attgccggga agcttatggg cagatgcaga taaccaattt 2641 tttgcgagct atgatgctcc agcggttgat gctcattatt acgctggtgt gacatatgac 2701 tactataaaa atgttcataa ccgtctcagt tacgacggaa ataatgcagc tattagatca 2761 tccgttcatt atagccaagg ctataataac gcattttgga acggttcgca aatggtgtat 2821 ggcgatggtg atggtcaaac atttattcca ctttctggtg gtattgatgt ggtcgcacat 2881 gagttaacgc atgcggtaac cgattataca gccggactca tttatcaaaa cgaatctggt 2941 gcaattaatg aggcaatatc tgatattttt ggaacgttag tcgaatttta cgctaacaaa 3001 aatccagatt gggaaattgg agaggatgtg tatacacctg gtatttcagg ggattcgctc 3061 cgttcgatgt ccgatccggc aaagtatggt gatccagatc actattcaaa gcgctataca 3121 ggcacgcaag ataatggcgg ggttcatatc aatagcggaa ttatcaacaa agccgcttat 3181 ttgattagcc aaggcggtac gcattacggt gtgagtgttg tcggaatcgg acgcgataaa 3241 ttggggaaaa ttttctatcg tgcattaacg caatatttaa caccaacgtc caactttagc 3301 caacttcgtg ctgccgctgt tcaatcagcc actgacttgt acggttcgac aagccaggaa 3361 gtcgcttctg tgaagcaggc ctttgatgcg gtaggggtga aataaagtgg tatctcatca 3421 gtgggggatt ttttcctcca ctgatgtttt gtttgtgatc ttttaatgat gtattggggt 3481 gcaaaatgcc caaaggctta taatgttgat // LOCUS HSEGP14 3347 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete cds. ACCESSION M34861 KEYWORDS glycoprotein 14. SOURCE Equine herpesvirus type 1 DNA. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 3347) AUTHORS Guo,P. TITLE Characterization of the gene and an antigenic determinant of equine herpesvirus type-1 glycoprotein 14 with homology to gB-equivalent glycoproteins of other herpesviruses JOURNAL Gene 87, 249-255 (1990) STANDARD simple staff_review FEATURES from to/span description pept 300 3239 glycoprotein 14 (gp14) BASE COUNT 885 a 891 c 851 g 720 t ORIGIN 1 tacaacggtt gaaacgtggt gtacgcatct caagagacta gctcgtttat gataactgcg 61 gctaaaggtg aattggtcaa ttagcgaagt ttcaaaggtt ttattgcttt gaagggagtg 121 acaggtgtga cggccacgca gcggctggcg tgaaatatat cggggagctc atcctagccg 181 ccgcagtatt ctcctcggtt ttccactgtg gagaggtgcc tcctgcgcgc agatcgtacc 241 tacccggact ccgcgccaca gtgctgcgtg agcggcattt acataaccta cgaggcgtca 301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg 361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct 421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa 481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct 541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt 601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac 661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc 721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg 781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct 841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat 901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga 961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata 1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt 1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg 1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg 1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt 1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc 1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag 1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact 1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg 1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tcttacttta 1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag 1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta 1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc 1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg 1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca 1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa 1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg 1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac 2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg 2101 cgtggtgtcc cctccaaaac aaagagcggc ccctatggaa cgaaatggtg aagattaccc 2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg 2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc 2281 gctcgatgga cagtaacact tgctactccc gcccccccgt aacatttaca attactaaga 2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg 2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag 2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg 2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc 2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac 2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg 2701 acaataccgc agtgattatc aggggatcgc cagctttttc aagggcctgg gtaaagtggg 2761 ggaggccgtg ggaacgctcg ttctcggcgc gcggcgctgt tgtttcaacc gtatctggaa 2821 tagcttgctt tttaaacaac ccatttgggg ggctagccat cggcctgctg gtaatcgccg 2881 gcctggtagc tgcgtttttt gcttacagat atgtaatgca gatccgcagt aaccccatga 2941 aagctctata ccccataaca acaaaggcct tgaaaaacaa agccaaaact tcctacggcc 3001 agaacgagga ggacgatggg agcgactttg atgaggccaa gcttgaagag gctcgcgaaa 3061 tgatcaaata catgtctatg gtttcggccc tggaaaagca ggaaaagaaa gctataaaga 3121 aaaacagtgg ggttggcctg atcgccagta acgtctcaaa gctggccctg cgaaggcgcg 3181 gtcccaaata tacccgactc caacagaacg ataccatgga aaatgaaaaa atggtttaaa 3241 catgtttaat aaatattatg acacgtactc aaagtgtgac ctcatatttg cataaccact 3301 tctagttccg gcccaaggat atttaagcct agtatctccg ccgaagg // LOCUS HUMHBGAA 1227 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human A-gamma-globin gene, 3' end. ACCESSION M33200 KEYWORDS A-gamma-globin. SOURCE Human (hereditary persistence of fetal hemoglobin individual II-1) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1227) AUTHORS Gelinas,R.E., Rixon,M., Magis,W. and Stamatoyannopoulos,G. TITLE Gamma gene promoter and enhancer structure in Seattle variant of hereditary persistence of fetal hemoglobin JOURNAL Blood 71, 1108-1112 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 3 A-gamma-globin (AA at 1) /hgml_locus_uid="LK0092S" /nomgen="HBG1" /map="11p15.5" mut 794 794 t in wt; c in mutant mut 970 970 c in wt; a in mutant mut 1186 1186 a in wt; g in mutant BASE COUNT 366 a 204 c 294 g 363 t ORIGIN 1 tgagcctctt gcccatgatt cagagctttc aaggataggc tttattctgc aagcaataca 61 aataataaat ctattctgct gagagatcac acatgatttt cttcagctct tttttttaca 121 tctttttaaa tatatgagcc acaaagggtt tatattgagg gaagtgtgta tgtgtatttc 181 tgcatgcctg tttgtgtttg tggtgtgtgc atgctcctca tttattttta tatgagatgt 241 gcattttgtt gagcaaataa aagcagtaaa gacacttgta cacgggagtt ctgcaagtgg 301 gagtaaatgg tgtaggagaa atccggtggg aagaaagacc tctataggac aggacttctc 361 agaaacagat gttttggaag agatgggaaa aggttcagtg aagacctggg ggctggattg 421 attgcagctg agtagcaagg atggttctta atgaagggaa agtgttccaa gctttaggaa 481 ttcaaggttt agtcaggtgt agcaattcta ttttattagg aggaatacta tttctaatgg 541 cacttagctt ttcacagccc ttgtggatgc ctaagaaagt gaaattaatc ccatgccctc 601 aagtgtgcag attggtcaca gcatttcaag ggagagacct cattgtaaga ctctggggga 661 ggtggggact taggtgtaag aaatgaatca gcagaggctc acaagtcagc atgagcatgt 721 tatgtctgag aaacagacca gcactgtgag atcaaaatgt agtgggaaga atttgtacaa 781 cattaattgg aaggtttact taatggaatt tttgtatagt tggatgttag tgcatctcta 841 taagtaagag tttaatatga tggtgttacg gacctaatgt ttgtgtctcc tcaaaattca 901 catgctgaat ccccaactcc caactgacct tatctgtggg ggaggctttt gaaaagtaat 961 taggtttagc tgagctcata agagcagatc cccatcataa aattattttc cttatcagaa 1021 gcagagagac aagccatttc tctttcctcc cggtgaggac acagtgagaa gtccgccatc 1081 tgcaatccag gaagagaacc ctgaccacga gtcagccttc agaaatgtga gaaaaaactc 1141 tgttgttgaa gccacccagt cttttgtatt ttgttatagc accttacact gagtaaggca 1201 gatgaagaag gagaaaaaaa taagctt // LOCUS HUMHBQ1A 1114 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human theta-1-globin gene, complete cds. ACCESSION M33022 KEYWORDS theta-1-globin. SOURCE Human black female with alpha-thal-2 heterozygosity white cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1114) AUTHORS Gonzalez-Redondo,J.M., Han,I.S., Gu,Y.-C. and Huisman,T.H.J. TITLE Nucleotide sequence of the human theta-1-globin gene JOURNAL Biochem. Genet. 26, 207-211 (1988) STANDARD simple staff_review FEATURES from to/span description pept 359 453 theta-1-globin, exon 1 /hgml_locus_uid="LV0155X" /nomgen="HBQ1" /map="16p13.3" 538 742 theta-1-globin, exon 2 852 980 theta-1-globin, exon 3 IVS 454 537 theta-1-globin intron A IVS 743 851 theta-1-globin intron B BASE COUNT 166 a 386 c 393 g 169 t ORIGIN 1 atcccagtta ctcgggaggc tgaggcagga gaatcgtttg aacccgggag gcggaggttg 61 cagtgagccg gaatggcgcc actgcactca ccgcacccgg ccaatttttg tgtttttagt 121 agagactaaa taccatatag tgaacaccta agacgggggg ccttggatcc agggcgattc 181 agagggcccc ggtcggagct gtcggagatt gagcgcgcgc ggtcccggga tctccgacga 241 ggccctggac ccccgggcgg cgaagctgcg gcgcggcgcc ccctggaggc cgcgggaccc 301 ctggccggtc cgcgcaggcg cagcggggtc gcagggcgcg gcgggttcca gcggggggat 361 ggcgctgtcc gcggaggacc gggcgctggt gcgcgccctg tggaagaagc tgggcagcaa 421 cgtcggcgtc tacacgacag aggccctgga aaggtgcggc aggctgggcg cccccgcccc 481 caggggccct ccctccccaa gccccccgga cgcgcctcac ccacgttcct ctcgcaggac 541 cttcctggct ttccccgcca cgaagaccta cttctcccac ctggacctga gccccggctc 601 ctcacaagtc agagcccacg gccagaaggt ggcggacgcg ctgagcctcg ccgtggagcg 661 cctggacgac ctaccccacg cgctgtccgc gctgagccac ctgcacgcgt gccagctgcg 721 agtggacccg gccagcttcc aggtgagcgg ctgccgtgct gggcccctgt ccccgggagg 781 gccccggcgg ggtgggtgcg gggggcgtgc ggggcgggtg caggcgagtg agccttgagc 841 gctcgccgca gctcctgggc cactgcctgc tggtaaccct cgcccggcac taccccggag 901 acttcagccc cgcgctgcag gcgtcgctgg acaagttcct gagccacgtt atctcggcgc 961 tggtttccga gtaccgctga actgtgggtg ggtggccgcg ggatccccag gcgaccttcc 1021 ccgtgtttga gtaaagcctc tcccaggagc agccttcttg ccgtgctctc tcgaggtcag 1081 gacgcgagag gaaggcgccg cccctcccca agga // LOCUS HUMITIH1A 1149 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human inter-alpha-trypsin inhibitor heavy chain mRNA, partial cds. ACCESSION M33033 KEYWORDS inter-alpha-trypsin inhibitor heavy chain. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1149) AUTHORS Salier,J.-P., Diarra-Mehrpour,M., Sesbouee,R., Bourguignon,J. and Martin,J.-P. TITLE Human inter-alpha-trypsin inhibitor: Isolation and characterization of heavy (H) chain cDNA clones coding for a 383 amino-acid sequence of the H chain JOURNAL Biol. Chem. Hoppe-Seyler 369, 15-18 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 1149 inter-alpha-trypsin inhibitor heavy chain (AA at 1) /hgml_locus_uid="LE0221G" /nomgen="ITIH1" /map="3p21.2-p21.1" BASE COUNT 332 a 292 c 253 g 272 t ORIGIN 1 ggaggcacaa acatcaacga agcactccta cgggcaatct tcattttgaa tgaagccaat 61 aacttgggac tgttagaccc caactccgtc tcgctgatca ttttggtttc tgatggagat 121 ccaacagtgg gcgaactaaa actgtcaaaa attcagaaaa acgttaagga gaacatccaa 181 gacaatatct ccttgttcag tttgggcatg ggatttgatg tggactatga ttttttgaag 241 agactgtcca atgaaaacca tggaattgca caaaggattt atggaaacca ggacacgtct 301 tcccagctta agaaattcta caaccaggtc tccactccat tgctccggaa tgttcagttc 361 aactatcccc atacatcagt cacggacgtc actcaaaaca atttccataa ctactttgga 421 ggctcagaga ttgtggtggc aggaaaattt gaccctgcta aattggatca aatagagagc 481 gttatcacgg cgacttcggc taacacgcag ttagtcttgg agaccctggc ccagatggac 541 gacttgcagg attttctatc gaaagacaag catgcagatc ccgatttcac caggaaactg 601 tgggcctatc taaccatcaa ccaactgcta gctgaacgaa gcctggctcc tacagctgcc 661 gccaagagaa gaattacaag atcgatcctg cagatgtctc tagaccacca cattgtgact 721 ccgctgacct cgctggtgat cgagaacgag gctggggatg agcgcatgct ggcggatgcc 781 ccaccgcagg atccctcctg ctgctcaggg gccctgtatt acggcagcaa agtggttcca 841 gattccaccc cgtcttgggc caatccttca gcaacgcccg tgatctccat gctggcacaa 901 ggatctcagg tgctagagtc cacgccaccc ccacatgtga tgagagttga aaatgaccca 961 cattccatca tttatctacc aaaaagccaa aagaacattt gtttcaatat tgactcagaa 1021 cctggaaaaa tcctcgacct ggcttctgac ccagaatcag gaattgtagt caacggtcag 1081 cttgttggtg ccaagaagcc caacaatgga aaactaagca cctattttgg aaaactggga 1141 ttttatttc // LOCUS HUMPTHROM 327 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human thrombin mRNA, 5'end. ACCESSION M33031 KEYWORDS serine protease; thrombin. SOURCE Human, cDNA to mRNA, clone pIIH13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 327) AUTHORS MacGillivray,R.T.A., Irwin,D.M., Guinto,E.R. and Stone,J.C. TITLE Recombinant genetic approaches to functional mapping of thrombin JOURNAL Ann. N.Y. Acad. Sci. 485, 73-79 (1986) STANDARD simple staff_review FEATURES from to/span description pept 28 > 327 thrombin precursor /hgml_locus_uid="LD0134L" /nomgen="F2" /map="11p11-q12" sigp 28 156 thrombin signal peptide matp 157 > 327 prothrombin BASE COUNT 60 a 97 c 109 g 61 t ORIGIN 1 ccgtagtgac ccaggagctg acacactatg gcccgcatcc gaggcttgca gctgcctggc 61 tgcctggccc tggctgccct gtgtagcctt gtgcacagcc agcatgtgtt cctggctcct 121 cagcaagcac ggtcgctgct ccagcgggtc cggcgagcca acaccttctt ggaggaggtg 181 cgcaagggca acctggagcg agagtgcgtg gaggagacgt gcagctacga ggaggccttc 241 gaggctctgg agtcctccac ggctacggat gtgttctggg ccaagtacac agcttgtgag 301 acagcgagga cgcctcgaga taagctt // LOCUS MUSCC3A 312 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse complement component C3 mRNA, partial cds. ACCESSION M33032 KEYWORDS complement component C3. SOURCE Mouse liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Fey,G.H., Wiebauer,K. and Domdey,H. TITLE Amino acid sequences of mouse complement C3 derived from nucleotide sequences of cloned cDNA JOURNAL Ann. N.Y. Acad. Sci. 421, 307-312 (1983) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 complement component C3 precursor (AA at 1) matp < 1 27 complement component C3-beta subunit (AA at 1) matp 40 273 complement component C3-alpha subunit matp 274 > 312 complement component C3-alpha' subunit BASE COUNT 90 a 77 c 90 g 55 t ORIGIN 1 gatcttgagt gcaccaagcc agcagcccgc cgccgtcgct cagtacagtt gatggaaaga 61 aggatggaca aagctggtca gtacactgac aagggtcttc ggaagtgttg tgaggatggt 121 atgcgggata tccctatgag atacagctgc cagcgccggg cacgcctcat cacccagggc 181 gagaactgca taaaggcctt catagactgc tgcaaccaca tcaccaagct gcgtgaacaa 241 cacagaagag accacgtgct gggcctggcc aggagtgaat tggaggaaga cataattcca 301 gaagaagata tt // LOCUS MUSN038A 1260 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse nucleolar protein N038 mRNA, complete cds. ACCESSION M33212 KEYWORDS nucleolar protein N038. SOURCE Mouse teratocarcinoma stem cell line F9, cDNA to mRNA, clone lambda-FML-185.19. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1260) AUTHORS Schmidt-Zachmann,M.S. and Franke,W.W. TITLE DNA cloning and amino acid sequence determination of a major constituent protein of mammalian nucleoli: Correspondence of the nucleoplasmin-related protein N038 to mammalian protein B23 JOURNAL Chromosoma 96, 417-426 (1988) STANDARD simple staff_review FEATURES from to/span description pept 79 957 nucleolar protein N038 mRNA < 1 1260 nucleolar protein N038 mRNA BASE COUNT 419 a 214 c 306 g 321 t ORIGIN 1 ggcgcgtctg ttctgtggaa caggaggcag ttgttttccg tccggcttct cccacaccga 61 agtgcgcgcc tccacctcat ggaagactcg atggatatgg acatgagtcc tcttaggcct 121 cagaactacc ttttcggctg tgaactaaag gctgacaaag actatcactt taaagtggat 181 aatgatgaaa atgagcacca gttgtcatta agaacggtca gtttaggagc aggggcaaaa 241 gatgagttac acatcgtaga ggcagaagca atgaactatg aaggcagtcc aattaaagta 301 acactggcaa ctttgaaaat gtctgtacaa ccaacagttt ccctaggggg ctttgaaatt 361 acaccacctg tggtcttacg gttgaagtgt ggttcagggc ctgtgcacat tagtggacag 421 catctagtag ctgtagagga agatgcagag tctgaagatg aagatgagga ggacgtaaaa 481 ctcttaggca tgtctggaaa gcgatctgct cctggaggtg gtaacaaggt tccacagaaa 541 aaagtaaaac ttgatgaaga tgatgaggac gatgatgagg acgatgagga tgatgaggat 601 gatgatgatg atgattttga tgaagaggaa actgaagaaa aggtcccagt gaagaaatct 661 gtacgagata ccccagccaa aaatgcacaa aaatcaaacc aaaatggaaa agacttaaaa 721 ccatcaacac cgagatcaaa gggtcaagag tccttcaaaa aacaggaaaa gactcctaaa 781 acaccaaaag gacctagttc tgtagaagac attaaggcaa aaatgcaagc aagtatagaa 841 aaaggcggtt ctcttcccaa agtggaagcc aagttcatta attatgtgaa gaattgtttc 901 cggatgactg accaggaggc tattcaagat ctctggcagt ggaggaaatc tctttaagaa 961 aagggtttaa acagtttgaa atattctgtc ttcatttctg taatagttaa tatctggctg 1021 tcctttttat aatgcaaagt gagaactttc cctactgtgt ttgataaatg ttgtccaggt 1081 tcacttgcca agaatgtgtt gtctaaaatg cctgtttagt tttcaaggat ggaactccac 1141 cctttacttg gttttaagta tgtatggaat gttatgatag gacatagtaa tagtggtcag 1201 atgtggaaat ggtagggaga caaatataca tgtgaaataa actcagtatt ttaataaagt // LOCUS RATPOS 1804 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat type-2A protein phosphatase catalytic subunit mRNA, complete cds. ACCESSION M33114 KEYWORDS type-2A protein phosphatase catalytic subunit. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1804) AUTHORS Kitagawa,Y., Tahira,T., Ikeda,I., Kikuchi,K., Tsuiki,S., Sugimura,T. and Nagao,M. TITLE Molecular cloning of cDNA for the catalytic subunit of rat liver type 2A protein phosphatase, and detection of high levels of expression of the gene in normal and cancer cells JOURNAL Biochim. Biophys. Acta 951, 123-129 (1988) STANDARD simple staff_review FEATURES from to/span description pept 114 1043 type-2A protein phosphatase catalytic subunit mRNA < 1 1804 type-2A protein phosphatase catalytic subunit mRNA BASE COUNT 482 a 391 c 437 g 494 t ORIGIN 1 ctggggccgc aggaagcacc ccggggagcg gcggcggcgt gtgcgtgtgg cccgggtgcg 61 ggcggcggcg cgggagcagc gcagagcggc agccggttcg ggcgggcggc atcatggacg 121 agaagttgtt caccaaggag ctggaccagt ggatcgagca gctgaacgag tgcaagcagc 181 tctccgagtc ccaggtcaag agcctctgcg agaaggctaa agaaatcctg acaaaagaat 241 ctaatgttca ggaggttcga tgtccagtca ctgtgtgtgg agatgtgcat gggcaatttc 301 atgacctcat ggaactcttt agaattggtg gtaaatcacc agatacaaat tacttgttta 361 tgggagacta tgtggacaga ggatattact cagttgaaac agttacactg cttgtagctc 421 ttaaggttcg ttaccgagag cgtatcacca tactccgagg gaatcacgag agcagacaga 481 tcacacaagt ttatggtttc tacgatgagt gtttaaggaa atacggaaat gcaaatgttt 541 ggaaatactt cacagacctt tttgactacc ttcctctcac tgccttggtg gatgggcaga 601 tcttctgtct acatggtggt ctttcaccat ccatagacac actggatcac atccgagcac 661 ttgatcgcct acaagaagtt cctcatgagg gtccaatgtg tgacttgctg tggtcagatc 721 cagatgaccg tggtggctgg gggatatctc ctcggggagc tggttatacc tttggccaag 781 atatttctga gacatttaat catgccaatg gcctcacgtt ggtgtccaga gctcaccagc 841 tggtgatgga gggatataac tggtgccatg accggaatgt agtaacaatt ttcagtgctc 901 caaactattg ctatcgttgt ggtaaccaag ctgcaatcat ggaacttgat gacactctta 961 agtattcttt cttgcagttc gatccagcac ctcgtagagg cgagccacat gtcactcgtc 1021 gtaccccaga ctacttcctg taatgaaagt ttaaccttgt acagtattgc catgaacacc 1081 gtctgttgac ctaatggaat cgggaagagc agcagtaact ccaaagtgtc agaaatagtt 1141 aacattcaaa cttgtttcca cacggaccaa aagatgtgcc atataaaata caaagcctct 1201 tgtcatcaac agccgtgacc actttagaat gaaccagttc attgcatgct gacgcgacat 1261 tgttggtcaa gaatccagtt tctggcatag cgctatttgt agttactttt gctttcttga 1321 gagactgcag atctaggatg taacattaac acctgtgagt ccagttgact tccacttagc 1381 tgtagcttac tcagcatgac tgtagatgag gatagcaaac aatcattgga gcttaatgaa 1441 catttttaaa tgagtaccaa ggcctcccct cttgttgtgt tctttcaggg atactattaa 1501 tttaattgta tgatttctct gcactcagtt tctcccttct caaatctcgg ccccgcgttg 1561 ttctttgtta ctgtcagaaa acctggtgag ttgttttgaa cagaactgtc tccctcctgt 1621 aagatgatgt actgcacaag tcaccgcagt gttttcataa taaacttgag aactgagaaa 1681 gtcaggtttg aattgtatca gtgggcacga ctggtgctgt ttattaaaca agataaatct 1741 attgatcaat ttcagaattt gtagaattcc aggtaaagaa aaataaagat caaggccact 1801 atat // LOCUS RATSCP2 1409 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat sterol carrier protein-2 (SCP-2)mRNA, complete cds. ACCESSION M34728 KEYWORDS sterol carrier protein-2. SOURCE Rat liver, cDNA to mRNA, clone SP43. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1409) AUTHORS Billheimer,J.T., Strehl,L.L., Davis,G.L., Strauss,J.F.III. and Davis,L.G. TITLE Characterization of a cDNA encoding rat sterol carrier protein-2 JOURNAL DNA Cell Biol. 9, 159-165 (1990) STANDARD simple staff_review FEATURES from to/span description pept 307 1128 sterol carrier protein-2 (SCP-2) mRNA < 1 1409 sterol carrier protein-2 mRNA BASE COUNT 387 a 300 c 390 g 332 t ORIGIN 2 bp upstream of EcoRI site. 1 ggaattccga acaaaggttg aacactttgc aaaaattgga tggaaaaatc ataaacactc 61 agttaataac ccgtattccc agttccaaga tgaatacagc ttagatgaga taatgaaatc 121 aaggccagtt ttcgattttc tgactgtctt acaatgctgt cccacctcag atggtgccgc 181 agcagcaatt gtgtctagtg aggagtttgt gcagaagcat ggcctgcagt ccaaagctgt 241 ggaaattgtg gcacaggaga tggtgactga catgcccagt acatttgaag aaaaagtgtt 301 attaaaatgg ttggctatga tatgagtaaa gaagctgcca ggaagtgcta tgagaagtcc 361 ggcctgggtc ccagtgatgt cgacgtgata gagcttcacg attgcttctc taccaatgaa 421 ctcctgactt atgaagcact ggggctctgt ccagaaggac aaggtggagc actggtggac 481 agaggggaca acacttacgg aggaaagtgg gtcataaacc ctagtggagg cctcatctcc 541 aagggacacc cactgggtgc cacaggtctg gctcagtgcg cggagctctg ctggcagctg 601 agaggcgaag ccggaaagag gcaggttcct ggggcaaagg tggctctgca gcacaattta 661 ggccttggag gagctgctgt tgtcaccctc tacagaatgg gttttcccga agctgccagc 721 tccttcagaa cgcaccagat ttcagctgct cccaccagct ctgcagggga tggattcaag 781 gcaaatctca tttttaagga aatcgagaag aagcttgaag aggaagggga agagttcgtg 841 aagaaaatcg gtggcatttt tgccttcaaa gtgaaggatg gccccggggg caaagaagct 901 acgtgggtgg tggacgtgaa gaacggcaaa ggatcggtgc ttccggattc agataagaag 961 gctgactgca caatcaccat ggctgactca gacttgctgg ctttgatgac tggtaaaatg 1021 aaccctcagt cggccttctt tcaaggtaaa ctgaaaattg ccggtaacat gggcctggcc 1081 atgaaactgc aaagcctgca gcttcagccg gacaaagcta agctgtgaag agtccctttg 1141 gcaacctcag gacatcaaga tgagatgtgt ggatacgtag aaatccacgt ctccctgtca 1201 ggacttagac tgacacttcc tgaatagcat gagatagatt tcttgctagg tggctatggc 1261 caattgtatt tcccccaagc tgggggtgca aagggcctcc caggctacac tgctgctttg 1321 aggacttgca ttctactgtg cttcatgaag ctactatgtt aatgatggtt tggggtaaac 1381 ttgagtttca gaataaagtt cagaatagt // LOCUS SYNPSBAII 556 bp ds-DNA BCT 17-JUL-1990 DEFINITION Synechococcus sp. photosystem II D1 protein (psbAII) gene, 5' end. ACCESSION M34833 KEYWORDS D1 protein; photosystem II. SOURCE Synechococcus (strain PCC 7942) DNA. ORGANISM Synechococcus sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 556) AUTHORS Bustos,S.A., Schaefer,M.R. and Golden,S.S. TITLE Different and rapid responses of four cyanobacterial psbA transcripts to changes in light intensity JOURNAL J. Bacteriol. 172, 1998-2004 (1990) STANDARD simple staff_review FEATURES from to/span description pept 81 425 ORF1 pept 527 > 556 photosystem II D1 protein (psbAII) mRNA 59 > 556 psbAII mRNA (alt.) mRNA 478 > 556 psbAII mRNA (alt.) BASE COUNT 136 a 145 c 142 g 133 t ORIGIN 1 ttccgtgacg gctactgcca gcatgccgag cctgatgtgt gacacctaag atcactccag 61 ttctctttgg aaactggctg atgagtgaag acaccatctt tggcaagatc atccggcgcg 121 agattccagc agacattgtt tatgaagatg atctctgtct ggcttttcga gatgtggcac 181 cccaagcgcc ggttcacatt ctggtgattc ccaagcaacc aattgccaac cttttggaag 241 cgacagcaga acatcaagcg ctgctgggtc atttgttgct gactgtaaag gcgatcgcgg 301 cccaagaagg actcaccgag ggctaccgca ccgtgattaa cacgggccct gcgggtgggc 361 aaaccgttta ccacctgcat attcacttac tgggcgggcg atcgctggct tggccgcccg 421 gctgagaaaa gtctgaaagt tctttacaaa actcaatctg cttgttagat tttactcacg 481 aggctattaa gtctcgtaaa tagttcaact aaggactcat cgcaaaatga cgactgcatt 541 gcagcggcgc gagagc // LOCUS ACCPCACGH 2391 bp ds-DNA BCT 17-JUL-1990 DEFINITION A.calcoaceticus gamma-carboxymuconolactone decarboxylase (pcaC) and protocatechuate 3,4-dioxygenase alpha and beta subunit (pcaGH) genes, complete cds. ACCESSION M33798 KEYWORDS gamma-carboxymuconolactone decarboxylase; protocatechuate 3,4-dioxygenase. SOURCE A.calcoaceticus DNA, clone pZR2. ORGANISM Acinetobacter calcoaceticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 2391) AUTHORS Hartnett,C., Neidle,E.L., Ngai,K.-L. and Ornston,N. TITLE DNA sequences of genes encoding Acinetobacter calcoaceticus protocatechuate 3,4-dioxygenase: Evidence indicating shuffling of genes and of DNA sequences within genes during their evolutionary divergence JOURNAL J. Bacteriol. 172, 956-966 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 324 728 gamma-carboxymuconolactone decarboxylase (pcaC) (EC 4.1.1.44) pept 750 1463 protocatechuate 3,4-dioxygenase, alpha subunit (pcaH) (EC 1.13.11.3) pept 1491 2120 protocatechuate 3,4-dioxygenase, beta subunit (pcaG) (EC 1.13.11.3) ORF < 1 203 ORF (AA at 3) binding 312 315 ribosomal binding site (put.) binding 737 740 ribosomal binding site (put.) binding 1478 1482 ribosomal binding site (put.) BASE COUNT 670 a 463 c 533 g 725 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttatca aacccaacgt tacttgctct gtttcatttt atgtgcaggt attgccgtca 61 atggtgctca atcttcaatg ccagtattga gtgcgcgttt ctatccaacg caatgtcgtg 121 caacaggtgt ggcatggatg tctggaattg gccgttttgg tcgccgtatt tggtgcatgg 181 attggtgcag ttttattagg taataactgg tcatttacca tgatcctgag catgttgata 241 tcctgctcgt gccgctgcaa ttgcaatttt tgttaaatcc cttgttggca catacagatg 301 caacttgatg tgaggtttct cttatgaatg atgaacaacg ttataaacaa gggctggaag 361 tccgtacaga agttctgggt gaaaagcatg ttaaccgttc acttgagaac ctaaacgact 421 tcaatcagga ttttcaaaac tttattagcc gttttgcatg gggagaggtt tggtctcgtc 481 ctggacttcc tcgtcatacg cgtagtttgg tcacaattgc agtgttgttg gcgcttgggc 541 gtgaagatga gctgcgtatc gatttacgtg cctgtttcaa caatggtgtg accaaagatg 601 aattaaaaga gttgattcta cattgttcac tttatgcagg cttacccgct tcaaatgctg 661 cgatgcatat ggcagaagag gtctttaagg acttgggaat tgcacctgaa aaagtaaaca 721 aggattaatg gatttagagg gagtagaaaa tgtctcaaat tatttgggga gcttacgctc 781 agtcgaatac cgaagatcat ccgccagctt atcgaccagg ttataaaacc agtgtgttac 841 gctcgcctaa gaatcgactc atttctattg cagaaacttt aagtgaagtc actgcaccac 901 attttagtgc tgataaattt ggtccaaaag acaatgactt aatcttgaac tatgccaaag 961 atggtttgcc gattggtgaa cgtgtcattg tgcatggtta tgtacgtgac cagtttggtc 1021 gacctgtaaa aaatgcactt gttgaagtat ggcaagccaa tgcatctggt cgttatcgtc 1081 atccaaatga tcaatatatc ggtgccatgg atcctaattt cggtggttgt ggccgtatgt 1141 tgactgatga taatggttat tatgtattcc gtacgattaa accaggtcca tacccatggc 1201 gtaaccgtat caatgaatgg cgtcctgctc acattcactt ctctttaatt gcagatggtt 1261 gggcacaacg tttaatttcg cagttctatt ttgaaggcga tacattgatt gattcttgcc 1321 caatcctgaa aaccattcct tctgagcaac aacgtcgtgc gctaattgca ttagaagaca 1381 agagcaattt cattgaagct gacagccgtt gttatcgctt tgacattacg cttcgtggtc 1441 gcgcggactt acttcgaaaa tgacttaact taatggatgg agtaaaaaac atgaatggtt 1501 ggaattttca ggaactcaaa gaaaccccat ctcaaacagg tggtccatat gtccatattg 1561 gtcttttgcc aaaacaagcc aatatcgaag tatttgaaca caatttaggt aacaaccttg 1621 tacaggacaa tacgcaaggt caacgcatcc gattagaagg tcaagtgttt gatgggctta 1681 gtttacctct acgtgacgta ctgattgaaa tctggcaagc agataccaat ggcgtttacc 1741 caagtcaagc tgatactcaa ggtaaacaag tcgatccgaa ctttttgggt tggggccgta 1801 caggtgcaga ttttggtaca ggcttctgga gttttaatac cattaagcca ggtgctgttc 1861 caggccgtaa aggttcaact caagcaccac atatttcact gattattttt gcgcgtggta 1921 tcaacattgg tcttcacact cgtgtctatt ttgatgacga agcagaagca aatgcaaaag 1981 atcctgtttt aaacagtatc gagtgggcga cacgtcgtca aacactggtt gccaaacgtg 2041 aagagcgtga tggtgaagtg gtttatcgtt ttgatattcg tattcaaggt gaaaacgaaa 2101 cggtattttt cgatatctaa atttttgact atggaacggg ccgtatatct ttcggtatac 2161 agcctccata attttacaag gaaacgtaaa tgttcaatac aaacaaactg gttaaacaac 2221 tttgtctaag tggtgcgttg ttgatgtcat tgcctatatt gtcaacgacc tatgcggctg 2281 aaaatactgt accagcatct aaatcaactt atgttgtaaa aaatttaaac attggtgatt 2341 tacctgttaa aaccctcgtg ccaatcactg ctaaaacaag agagcaagct t // LOCUS HAMSAA3A 530 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.auratus serum amyloid A (SAA3) mRNA, complete cds. ACCESSION M33431 KEYWORDS serum amyloid A protein. SOURCE M.auratus (strain LVG) embryo fibroblast, cDNA to mRNA, clone pAA825. ORGANISM Mesocricetus auratus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 530) AUTHORS Gervais,C. and Suh,M. TITLE Serum amyloid A protein-related mRNA expression in herpes simplex virus type 2 transformed hamster cells JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Suh, 30-MAR-1990. FEATURES from to/span description pept 42 410 serum amyloid A signal 510 515 polyA signal BASE COUNT 147 a 122 c 140 g 121 t ORIGIN 1 ggggacctct agagtcactc accaggcatc tcgaaagcag gatgaagcct ttccttgcca 61 tcattttttg cttcttgatc ctgggagttg acagccaaag gtggttccag ttcatgaaag 121 aagctggcca agggagtaca gacatgtgga gagcttactc tgacatgaga gaggccaact 181 ggaaaaactc agataaatac ttccatgccc gggggaacta tgatgctgcc aaaaggggac 241 caggaggagc ctgggctgct aaagtgatca gtgatgccag agagggtatt cagaggttca 301 caggacgtgg agcagcggac tcaagagctg accagtttgc caacaaatgg ggccggagtg 361 gtaaagaccc caaccacttt cgacctgctg gcctgcctag taaatactga gtttcatctt 421 catgctattc ccagccatgc accccaaggt tgcatttact agattaagtt agttcctgag 481 ctctggacgt gtaagcaccc gaaatttaga ataaatacct gtgaaatgct // LOCUS HEIBEXBA 795 bp ds-DNA BCT 17-JUL-1990 DEFINITION H.influenza (strain RM153) capsulation protein (bexB) gene, 5' end. ACCESSION M33787 KEYWORDS capsulation protein. SOURCE H.influenza (type b, strain RM153) DNA. ORGANISM Haemophilus influenzae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 795) AUTHORS Kroll,J.S. and Moxon,E.R. TITLE Capsulation in distantly related strains of Haemophilus influenzae type b: genetic drift and gene transfer at the capsulation locus JOURNAL J. Bacteriol. 172, 1374-1379 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 795 capsulation protein (bexB) BASE COUNT 188 a 133 c 177 g 297 t ORIGIN 1 atgcaatatg gtgataaaac aacatttaaa cagtcattag ctattcaagg acgggtgatt 61 aacgcgttgc ttatgcgtga aataatcact cgttatgggc gtcaaaatat tggctttttt 121 tggctatttg ttgagccttt gctcatgact ttttttattg taatgatgtg gaaatttatc 181 cgtgctgata aattttctac tttaaatatg attgcctttg tgatgacggg gtatccaatg 241 gcaatgatgt ggcgaaatgc ttcaaaccgt gcgattggtt ccatttctgc aaacttgagt 301 ttactttatc accgaaatgt tcgagtgcta gataccattt ttactcgcgt gttgcttgag 361 gttgctggtg cttctattgc tcaaattctt tttatggcaa tcttagtcat gattgattgg 421 attgatgcac cgcatgatgt gttttatatg ctcattgcat ggttcctcat ggcaatgttt 481 gcttttggct tggggttaat catttgtgcg attgctcagc aatttgatgt gtttggtaaa 541 atttggggaa cactcagttt cgttttatta ccaatttctg gcgcattctt cttcgtccat 601 aaccttccag cacaggcaca atctattgca ctttggtttc cgatgattca tggtacagaa 661 atgttccgcc atggctattt tggcgataca gtggttactt acgaaagtat aggctttttg 721 gttgtgagtg atctagcctt gttgttactg ggcttagtga tggttaaaaa ctttagtaaa 781 ggagttgagc cacaa // LOCUS HEIBEXBB 795 bp ds-DNA BCT 17-JUL-1990 DEFINITION H.influenza (strain RM926) capsulation protein (bexB) gene, 5' end. ACCESSION M33788 KEYWORDS capsulation protein. SOURCE H.influenza (type b, strain RM926 ATCC 8468) DNA, clones lambda-926.[8,11,14]. ORGANISM Haemophilus influenzae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 795) AUTHORS Kroll,J.S. and Moxon,E.R. TITLE Capsulation in distantly related strains of Haemophilus influenzae type b: genetic drift and gene transfer at the capsulation locus JOURNAL J. Bacteriol. 172, 1374-1379 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 795 bexB protein BASE COUNT 188 a 128 c 178 g 301 t ORIGIN 1 atgcaatatg gtgatcaaac aacgtttaaa cagtcattag cgattcaagg aagagtgatt 61 aatgcattac ttatgcgtga aataatcact cgttatgggc gcaaaaatat tggctttcta 121 tggctatttg ttgagccttt actcatgact ttttttattg tgatgatgtg gaaatttatt 181 cgtgctgata aattttcaac tttaaatatg attgcttttg tcatgaccgg gtatcctatg 241 gcgatgatgt ggcgaaatgc ttctaatcga gctattggtt cgatttctgc aaacttgagt 301 ttactttatc accgaaatgt tcgagtgcta gatactatct ttactcgagt attgcttgag 361 gttgcaggtg cttcgattgc tcaaatcctt tttatggcag ttttagtgct tattggttgg 421 attgatgcac cacgtgatgt gttttatatg cttatggcat ggtttctcat ggctatgttt 481 gcttttgcct taggccttat tatttgtgcg gtagctcaac aatttgatgt gttcggtaaa 541 atttggggaa ccttaagttt cgtattgtta ccaatttctg gtgcattttt ctttgttcac 601 aaccttccat cacaggctca atctatcgcg ctttggctcc caatgattca tggtacggaa 661 atgtttcgtc atggctactt tggggatacg gtcgttactt atgagagtat cggattttta 721 gtcgtaagtg atttggcctt attgctaatg ggcttggtga tggttaaaaa ctttagtaag 781 gggattgagc cacaa // LOCUS PSEARCDA 1502 bp ds-DNA BCT 17-JUL-1990 DEFINITION P.aeruginosa membrane protein arcD (complete cds) and arcA (5' end) genes. ACCESSION M33223 KEYWORDS membrane protein. SOURCE P.aeruginosa (strain PAO1) DNA, clone pME183. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1502) AUTHORS Luethi,E., Baur,H., Gamper,M., Brunner,F., Villeval,D., Mercenier,A. and Haas,D. TITLE The arc operon for anaerobic arginine catabolism in Pseudomonas aeruginosa contains an additional gene, arcD, encoding a membrane protein JOURNAL Gene 87, 37-43 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Haas, 26-MAR-1990. FEATURES from to/span description pept 24 1472 membrane protein (arcD) pept 1494 > 1502 membrane protein (arcA) binding 8 15 ribosome binding site (put.) BASE COUNT 247 a 507 c 443 g 305 t ORIGIN 1 attcctatag ggagattccg aaaatgtccc aagaatccag ccaaaaactc cgactaggag 61 cgctaacggc tctggttgtc ggctcgatga tcggaggggg catcttctcc ctcccgcaaa 121 acatggcagc cagcgccgat gtcggcgccg tcctgatcgg atgggcgatc accgccgtcg 181 gcatgctcac cctcgctttc gtcttccaga ccctcgccaa ccgcaagccc gaactcgacg 241 gcggcgtgta cgcctacgcc aaggccggct tcggcgacta catgggtttc tcctcggcct 301 ggggctactg gatcagcgcc tggctgggca acgtcggcta cttcgtcctg ttgttcagca 361 ccctcggcta cttcttcccg atcttcggca agggcgacac cgtcgcggcg atcgtctgcg 421 cctcggtcct gctctgggcc ctgcacttcc tggtgctgcg cgggatcaag gaggcggcgt 481 tcatcaacac cgtcaccacc gtggccaagg tcgtgccgct gttcctgttc atcctgatct 541 gcctgttcgc cttcaagctg gacatcttca ccgccgacat ctggggcaag agcaacccgg 601 acctgggcag cgtgatgaac caggtgcgca acatgatgct ggtcaccgtc tgggtgttca 661 tcggcatcga gggcgcgagc atcttctcct cccgcgcgga aaaacgttcc gacgtcggca 721 aggccaccgt gatcggcttc atcaccgtcc tgctcctgct ggtgctggtc aacgtgctgt 781 ccatgggcgt gatgacccag ccggaactgg ccaagctgca gaacccgtcg atggcgctgg 841 tactcgagca tgtggtcggc cactggggcg ccgtgctgat cagcgtcggc ctgctgatct 901 cgctgctggg cgcgctgctc tcctgggtgc tgctgtgcgc cgagatcatg ttcgccgccg 961 ccaaggacca caccatgccg gagttcctgc gccgcgagaa cgccaaccag gtgccggcca 1021 acgccctgtg gctgaccaac atctgcgtac aggtgttcct ggtggtggtg ttcttcacct 1081 cgggcgaccc ggacggcatg gacccgtaca ccaagatgct gctcctggcc acctcgatga 1141 tcctgatccc gtacttctgg tccgccgcct atggcctgct gctgaccctg aagggcgaga 1201 cctacgagaa cgatgcccgc gaacgcagca aggacctggt catcgccggc atcgccgtgg 1261 cctacgcggt ctggctgctc tacgcggggg gcctgaagta cctgctgctg tccgccctgc 1321 tgtatgcgcc cggcgcgatc cttttcgcca aggccaagca cgaggtcggc cagccgatct 1381 tcaccggaat cgaaaaactg atcttcgccg cagtcgtcat tggcgccctg gtggctgcct 1441 acggcctcta cgacggtttc ctcaccctct gattactcca ctggagacgt gaaatgagca 1501 cg // LOCUS TOMCD4A 5429 bp ds-DNA PLN 17-JUL-1990 DEFINITION Tomato ATP-dependent protease (CD4A) gene, complete cds. ACCESSION M32603 KEYWORDS ATP-dependent protease. SOURCE Tomato (L.esculentum) DNA. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 5429) AUTHORS Gottesman,S. and Pichersky,E. TITLE A conserved regulatory subunit for ATP-dependent protease in prokaryotes and eukaryotes JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Pichersky,E., 08-MAR-1990. FEATURES from to/span description pept 787 1357 ATP-dependent protease (CD4A), exon 1 1461 1576 ATP-dependent protease (CD4A), exon 2 1828 1950 ATP-dependent protease (CD4A), exon 3 2034 2219 ATP-dependent protease (CD4A), exon 4 2853 3068 ATP-dependent protease (CD4A), exon 5 3248 3465 ATP-dependent protease (CD4A), exon 6 3768 3843 ATP-dependent protease (CD4A), exon 7 3926 4015 ATP-dependent protease (CD4A), exon 8 4125 5309 ATP-dependent protease (CD4A), exon 9 pre-msg 1 5429 CD4A mRNA and introns IVS 1358 1460 CD4A intron A IVS 1577 1827 CD4A intron B (no splice consensus) IVS 1951 2033 CD4A intron C IVS 2220 2852 CD4A intron D IVS 3069 3247 CD4A intron E IVS 3466 3767 CD4A intron F IVS 3844 3925 CD4A intron G (no splice consensus) IVS 4016 4124 CD4A intron H BASE COUNT 1521 a 972 c 1221 g 1715 t ORIGIN 1 tcgaagttgt aaaggttggt cttttgcttc tacttgtttg gactagtttc tttccaagta 61 atggggttat taagttggta ttgttgtgga ttttgacagg atatgggggt atgatgtagc 121 tctcatgttt atattttcaa ttaagcaatg tagttctggt tgactagttg tggtgctaga 181 attcccaatt aacctcttca ctgaattact cctagtgctg aataaaactg ccttatctca 241 agggtgtggg ttgttgttgt attctagagg gtttgtttat ataatcccat tggaaccaag 301 ttaacctatg caataggaat tatatattca agattctgct agtttagaaa ctaaagtcct 361 tgcagcttgg ctgctttatc ctgtatgaaa tagttaccat cctgtggttg catagttttc 421 caggtcttct tgttttgagg ctttattatt cattttactg atgatataac ttttatgatg 481 aaatttgaat ttagttttaa gcttatgcgc taggtgtatg ttgctcaaca tgcattcgca 541 gagttttttg aagtatggca ttctcataat gatcctattt gtctacatta tgtcatctaa 601 agaaattaac cttcaacggt taatggttta caatggtcca aaaaaagtag ccttcaatgg 661 ttattagtga tggttttgtt gatgtctgct aactatatgt atatgttatg taatatgatc 721 cttggtactt cttccattgc cactaatgat cttttcttat atgctatcat gtcaggttgc 781 aggtccatga tggctagagc tttagttcag tcaacaaaca ttctaccttc agttgctggt 841 gaaagggctg gacaattcaa tggatcccgg aaagatcaaa gaactgttag aatgctatgt 901 aatgtaaaat gttgttcctc aaggctgaac aattttgcag gactgcgagg gtgcaacgca 961 ttagatacac tactagtaaa atctggagaa actctccatt cgaaagtggc agctgcaact 1021 tttgtcagac ggccacgagg gtgccgattt gtcccaaaag caatgtttga gcggttcact 1081 gagaaagcaa taaaagtcat tatgcttgcc caagaagagg ccagacgact aggtcacaac 1141 tttgttggga cagaacagat cctgctgggt cttattggtg agggaactgg tattgctgct 1201 aaggttctta aatccatggg aattaatttg aaagatgctc gtgtggaagt agagaagatc 1261 attggcaggg gtagtggatt tattgcagtg gaaatccctt ttactcctcg tgcaaagcgt 1321 gttctagaac tatctctgga ggaagcccgc caactaggta cccttatcta tcatttcagc 1381 cttttgttgg atggcctaac tgtcatgcga actcaaaatt catttctttg ttgcatctgt 1441 tgattggttt tgtgatgcag ggcataatta tattggctcc gagcacttgt tacttggact 1501 gttgcgcgaa ggtgaaggtg tggctgcccg tgttcttgaa aacttgggtg ctgaccccac 1561 taacattcgc acacaggcaa gtgattttcc tttctcaaac tggtttgaaa tgcttttctt 1621 gaagccaagg gtctaatcag aaacagcctc tctacctcca caagtagggg taaggtctgc 1681 atacatctta ccttcccaga tgccacctgt gggattacac tgggtatgtt gttgttgttg 1741 tttacttctg atgggggtga gggtatttta aattgcttga cttcattgta atcagtttct 1801 cttttttctt gtaaaataaa taaataggtg atacggatgg ttggcgagag tagtgaggct 1861 gttggtgcta gtgttggagg tggaacttct ggcctaaaga tgcctacatt ggaggagtac 1921 ggcaccaatt tgacaaagtt agctgaagag gtacatagaa agataagtcc gtcataatgt 1981 ggcaagtgat tggctctctt caattatttg atatggtcat attatatttt caggggaaac 2041 tagatcctgt agttggaaga caggctcaaa tcgagcgtgt tactcaaatc ttgggtcgcc 2101 gtaccaaaaa caacccttgc cttattggag aaccaggtgt tggaaaaact gctattgccg 2161 aagggctagc tcaaagaatt gctaatggtg atgtccctga aacaatagaa ggaaagaagg 2221 tcacaggggt gtatgataaa ttgcataaca tgttttatgc ttgagctttt tttctcttta 2281 tcaatttcat ttcaactgtg tggcagatcc agaagctgtg ctacattctt agatagaaag 2341 acttcaaatc atcattttta taatgtctta gaatttgatt ttatacagag ggtaattttt 2401 taaattggaa gtaaatattc tgcagacaaa tatatataca caaatattgt tatctgatcc 2461 cccatataac acccccaccc cccaccccaa aagcctctag gccaaaaatt tacgcgattc 2521 taagagtatc caatcaatat ttttggtagg cgtgccaaaa atttacagaa acatttgaat 2581 gtcccactaa cttgtacgaa caggttttca aatttcaacc gccttgagcc actacatatt 2641 cttcattgtc tgatatatgg actcaaggca tatatcaggt ttagttccct cttacttcaa 2701 ttctagtcta actgaagtca tagggaagat gaacctattt atctaaatct ttttatatgc 2761 aactattttg atgaggtata ccctcctttt tcaaaaaatg tgtatatcct tgtcattata 2821 cgagtgtatg acttgctttg ggtgatggtc aggtcataac acttgatatg ggattgcttg 2881 ttgctgggac aaaataccgt ggagagtttg aggaaagact gaagaaacta atggaggaaa 2941 ttaaacaaag tgatgaaata atactgttta ttgatgaagt acacacattg attggagctg 3001 gagctgcaga gggagcaatt gatgctgcaa acatcttgaa acctgccctg gctagaggcg 3061 agctacaggt tcgttatcgt ttcgacctgt agtatgttgt tccttggaat ttcaatgagc 3121 actatgaaat ctaccttaaa tttggtgttt aatcattatc ttggtccgag tacatcttgg 3181 taaaactaca aaagaatttc aaagttggtg ttaattaaga agtattttgc ttggtttcac 3241 ttgtcagtgt attggagcca caaccctgga tgagtacaga aagcacattg agaaagaccc 3301 tgcactagag agaaggttcc agccagttaa agtccctgaa ccttctgtgg atgaaaccat 3361 acagatcttg aaagggcttc gggagaggta tgagattcat cacaagctcc attacactga 3421 tgaggcaata gaggccgctg ccaagctttc tcaccagtac atcaggtatt attattaatc 3481 atatattcat tttccaaagt atcttccgta gtcaagttgt tggtacattt tgactttggg 3541 atttcacggc aaactaattt attcttgtat gatttaagat atatccttct caataataaa 3601 actcttagga taagcggctt ttcttttgta tcagtgtctg aataggctga cgcacggttt 3661 atctcctgca aattgcacat ttatctccac tccaaaacaa tcatcttttt ccgtgattta 3721 tctgaagtaa agcattaata tgtttttgag ttgaaccttt attccagtga ccgctttctg 3781 cctgataaag caattgactt gattgatgaa gctggttccc gtgttcgact tcgtcatgca 3841 caggcataga gtactctttt tgtcttactc tgtcctgcca tttcttccta agggagtcat 3901 gacccgttat attgatacta tctagctccc tgaggaagca agagagctgg aaaaggagct 3961 tcgccagatt acaaaggaga aaaatgaagc tgtccgcggt caagattttg agaaggtact 4021 ggtttcctct cgtcctgtaa aaaaattcct ctggactgat tatttcttag aaaacattga 4081 gattttcagg tgtgcaattt taatcctctt tatgcattgg acaggctggg gagttgcgtg 4141 atagagaaat ggatcttaag gcacaaatct cagccttgat agacaaaaac aaagagaaga 4201 gcaaggcaga atctgaggca ggagatgcag caggtcccat agtgacagag gcagatattc 4261 agcacattgt ctcttcctgg accggcattc ctgtagagaa ggtctcaact gacgaatctg 4321 atcgcctcct aaaaatggaa gaaacacttc atacccgagt cattggccag gatgaagcag 4381 taaaagccat tagtcgcgct atcagacgtg cccgtgttgg cctcaagaat cccaaccgac 4441 ctattgccag tttcatcttt tctggtccaa ctggtgttgg gaagtcagaa ctggcaaagt 4501 ctttagcaac gtactacttt ggttctgaag aagcaatgat tcggcttgat atgagtgagt 4561 ttatggaaag acacactgtg tctaaactca tcggttcacc ccctgggtat gttggttaca 4621 ctgaaggtgg tcaattaact gaagctgtta ggcgtcgacc ttatacagtt gtgctctttg 4681 atgagattga gaaggctcat cctgatgtct tcaacatgat gcttcaaatt cttgaagatg 4741 gaagattgac agacagcaaa ggcagaactg tcgacttcaa gaatacactc ctcatcatga 4801 catcgaatgt tggaagcagt gtgatagaga aaggaggtcg tcgtataggt tttgatctag 4861 attttgacga gaaggatagt agttacaacc gtatcaagag cttggtgact gaggagttga 4921 aacagtactt caggccagag tttttaaaca gattgagtga gatgattgta ttccgtcagc 4981 tcactaagtt ggaggtgaag gagattgccg atatcatgct taaggaggtc tttgtgaggt 5041 tgaagaataa ggagatagaa ctccaagtga cagagaggtt tagagatagg gtagttgatg 5101 aaggatataa cccaagctat ggagctagac cattgaggag agctattatg agactgctag 5161 aggatagtat ggccgagaag atgcttgcag gtgagatcaa agaaggtgat tcagtaattg 5221 tggatgtgga ttcagatggc aacgtcaccg ttctcaatgg cactagtggt gctccctcag 5281 attcagctcc tgagcctatc cttgtgtaga tcaattcttc atttagctct acaaatttgt 5341 tcttcgtaat gttgcttttg gatgtcttgg ctaagcaaca tgtgagtaat tttttgggtg 5401 acaggctgat atattcttcc aagttgact // LOCUS TOMCD4B 5846 bp ds-DNA PLN 17-JUL-1990 DEFINITION Tomato ATP-dependent protease (CD4B) gene, complete cds. ACCESSION M32604 KEYWORDS ATP-dependent protease. SOURCE Tomato (L.esculentum) DNA. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 5846) AUTHORS Gottesman,S. and Pichersky,E. TITLE A conserved regulatory subunit for ATP-dependent protease in prokaryotes and eukaryotes JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Pichersky,E., 08-MAR-1990. FEATURES from to/span description pept 1111 1675 ATP-dependent protease (CD4B), exon 1 1896 2011 ATP-dependent protease (CD4B), exon 2 2133 2255 ATP-dependent protease (CD4B), exon 3 2326 2511 ATP-dependent protease (CD4B), exon 4 3020 3235 ATP-dependent protease (CD4B), exon 5 3445 3662 ATP-dependent protease (CD4B), exon 6 3978 4053 ATP-dependent protease (CD4B), exon 7 4138 4227 ATP-dependent protease (CD4B), exon 8 4354 5535 ATP-dependent protease (CD4B), exon 9 pre-msg 1 5846 CD4B mRNA and introns IVS 1676 1895 CD4B intron A IVS 2012 2132 CD4B intron B (no splice consensus) IVS 2256 2325 CD4B intron C IVS 2512 3019 CD4B intron D IVS 3236 3444 CD4B intron E IVS 3663 3977 CD4B intron F IVS 4054 4137 CD4B intron G (no splice consensus) IVS 4228 4353 CD4B intron H BASE COUNT 1592 a 1067 c 1283 g 1904 t ORIGIN 1 aagcttctgt agcccaaagt tttgtccttt cacttcttgt ctactcctca tactgtctct 61 tccaaattca acagttgcca ctgtcgattt ttcagcagta aatttacctt gctcaatatc 121 aactccccat agcccaccca aacttctctt catttttata ataatgacgt tgcatgtttc 181 atacctcgcc taggaatgtg ttcatcgaat ttccgtcatt gtcacatcat atctgttctt 241 gtcctttact ctttttcctt ttctttcctg tactctcgtt cgtgttgtta ggatttgtac 301 tgaaaccgtt gggttatcag ttttacagac ctttaagcag tctctctagg gtttactaag 361 agctggaaat ttgaataatc tgatcaaaat tgctgaggtt cttttaatcc ttctaaagat 421 ctctgtagca gagtggtgaa aagcggaaca gggggaatac tcggtacaaa tggtgttgtg 481 ttcaattgca aagaccaaac ctcccattct aatgtgttgc aacaaactat taggggagtt 541 ttaaggaaga gaaaatgggg gataatgcta cacaatgcta acctattata atcatctatt 601 tagtttgcaa accccacata tctctgtgtg tattgtccta tttgttcaat ctattcatga 661 aaccttggct actcaatcct ggattgcgca cgtactactc agcagttgcg tagtgcccta 721 gttcttatat cttggagggt agactgaatt tggtttatgt caatgaatca tgtttcacac 781 agatggttac atattttatt acatagaatc tatatgtcat gcatcgactc ctgtcgagta 841 tggatacttc ctacttcttc ttggaactct ctgattgagt tatctttctg ctaatggact 901 gtgcctcact atttttatct tcctctggat tggaaaatat ggacattcac ttgttatcaa 961 tttttgaatt aataatttgt tgatatgttg gctgattgat gtgtcattga cttcatatac 1021 tgttatgggt ttgtcatgtg gaagatttga tattttcgat tatctaagga gagctttttt 1081 gattattttc ttatcctggg caggtccatc atggctagag ctctagttca gtcaacaagc 1141 atcccatcgt cagttgctgg tgaaaggaca accaaattca atggatctgg gaaaacaaaa 1201 agagctgtta caatgctatg caatgcacaa tcatcttcac ttacactgag ggattttaca 1261 ggattgcgag gatgcaacgc aatagataca ctagttagat ctggagaaac tctccaatcc 1321 aaagtagctg ctgcaactta tgtcagacgg ccacgaggtt gccgatttgt accaaaagca 1381 atgtttgagc gcttcacaga gaaggcgata aaagtcatta tgcttgcaca agaagaggcc 1441 agacgacttg gtcacaattt tgttggcact gagcagatct tgttgggtct aattggtgag 1501 ggaactggta ttgctgccaa ggttcttaaa tcgatgggga tcaatttgaa agatgctcgt 1561 gtggaagtgg aaaagataat tggaaggggt agtgggtttg ttgctgttga gatccctttt 1621 acccctcgtg ccaagcgtgt tctggaactc tctctggagg aagcccgcca gctaggtatc 1681 cttttttgat tgtctgtatt tgttgtatta cctttgggac atgtctaaaa gtggttctcc 1741 ggttgagtct tattctcatg ttgctcgttg acacctttgg tttttcccaa aattcattga 1801 ttttctttct ctttgcttgc ctttctggtc ttctacatgc accagcatga ttatttgtct 1861 aaatatttat tccatttata aactggttgg ttcagggcat aactatattg gttcggagca 1921 cttgctactt ggattgctac gtgaaggtga aggtgtggct gcccgtgttc ttgaaaactt 1981 gggtgctgac cccagcaaca tccgcactca ggcaagtagt agatctcaca tgcttcctct 2041 gggtagtcta atattgtgct tttgtgggaa gttggatttt ttggtttcat catgaccagt 2101 gttcattggt tttattcttc ttctaaaaat aggtgatccg gatggttggt gagagtaatg 2161 aggctgttgg tgctagcgtt ggaggtggaa cttctggcca aaaaatgcca acactggagg 2221 agtatggaac aaatttgaca aaattggcgg aagaggtaat ctaaaaaact atatttgtga 2281 tgatatgtga acagttatga ataattgtcc atttttggtt tgtagggaaa attggacccc 2341 gttgttggaa gacagccgca aattgaacgg gtcactcaaa tcttgggtcg gcggactaag 2401 aacaaccctt gtcttattgg tgaaccaggt gttggcaaaa cagctattgc tgagggtttg 2461 gcacaaagaa ttgcaaatgg tgatgtccct gaaacaattg aggggaagaa ggtacgtcat 2521 gtggatagct tgcagaatgt gatgtttacc tgtgcccttt cttctcccat ttgttgtatc 2581 atatggtgaa cccagaagct gagcagggtt atgaggtgca gtctagtggt attttcgatt 2641 cacaaggatt gggttcaaat gaatgtgcat tctatctcca ctaaattgaa atatgtcaaa 2701 ttatattttc agtgtctgtg ttctcttgaa tttttttata tataaattct tccgtgctac 2761 ttgttctctt ggcatatttc tttttagtag attgatttgt cgatctatgc ttataactaa 2821 ttgagaaaaa acgattagta tcccttgttg aacttaattt cttagctcaa actatcaaac 2881 tgaagtgatt tcaagtgaga aaggtccacc acctattctt gtgaatctaa tccatctgaa 2941 actttgtcaa atgtttcgtg ctatgatata gctacctttc tcgttgttcc taagagtcgc 3001 tttttttggt gataattagg tgataactct tgatatggga ttgcttgttg ctggaacaaa 3061 ataccgtgga gagtttgagg aaaggctaaa aaagttgatg gaggaaatta agcagagtga 3121 tgaaataata ctatttatcg atgaagtgca cacattgatt ggagctggag cagcagaggg 3181 ggccatcgat gctgcaaaca tcttgaaacc tgccctagct cgaggtgaac tacaggtaag 3241 ttcctccttc gcttgttctt tcctttctgt tctttctaga aattaaaaat gagaactaag 3301 aatatttctg ttaacaactc aatgttttag aagatatatc attgttttaa catgcatttt 3361 accgcctcat aagaagttaa aatgactgct cagttaaagc aggagaaaaa acttctacat 3421 gtgtatttac ttggcttctt gcagtgtatc ggagctacta cactggatga atacagaaag 3481 catattgaga aagatcctgc actagagagg agattccaac cagttaaggt ccctgaacct 3541 actgttgatg aaactataca gattctgaaa gggcttcgtg agaggtatga aattcatcac 3601 aaacttcgtt acactgacga agacttagtg gctgctgctc agctctcata ccagtacatc 3661 aggtatctaa tctctaccat tttcctatgt cttcttgtac gtttttactc ttttctcgtc 3721 ctgaattgtt gatgcattgg taccttgaag aaatatgttt gctttaagac ggtagacctg 3781 aagttcaact tattttatga tagttaagtg tgataataga agttggttta cttttgttta 3841 tcagcatcta attaggtttt tatgcttatc tctagaaaag attgtcctag attccacgat 3901 atagattttc gtataaagtt tttttttcct tgttctgcta agtgatacat cttttcactt 3961 gcgttttcac atttcagtga ccgatttctg cctgacaaag caattgattt gattgatgaa 4021 gctggttctc gtgttcgact tcgccatgct caggcatgaa atgctctctt agtctctctt 4081 ttaatctgtc attgccctac tatttgttga gtgacaatgc tctgttgttg cctccagctc 4141 cccgaggaag caaaagagct cgagaaagaa cttcgtcaga taacaaagga gaagaatgaa 4201 gctgttcgtg gtcaagattt cgaaaaggta ctgagtttgt catgtacatc taaagaaccc 4261 ctcaaaaatg gttgttatat agcccaacta ttttcttatc aagtattata attaaagaat 4321 tttctaatga catggttttg gattaatgga caggctgggg aattacgtga tagagaaatg 4381 gatctgaagg cacagatcac agccctcata gacaaaaaca aagaggtgag caaggctgaa 4441 agtgaggctg cagatacagg tccacttgtg acagaagcag atattcagca catcgtctct 4501 tcttggactg gtatccctgt tgagaaggtc tccactgacg aatctgaccg tctcctaaaa 4561 atggaagaaa cacttcacac acgaatcatt ggccaggatg aagctgtcaa agccattagt 4621 cgcgctattc gacgtgcacg agttgggctc aagaatccca acagacctat tgctagtttc 4681 atcttttccg gtcccactgg tgttgggaaa tcagaactgg caaaggcact ggctgcatat 4741 tactttggtt ctgaagaagc aatgatccgg cttgatatga gtgagttcat ggaaagacac 4801 actgtctcta aactcattgg atccccccct ggttatgttg gttacacaga gggtggtcaa 4861 ttgactgaag ctgttaggcg tcgaccttat acagttgtgc tctttgatga gattgagaag 4921 gctcatcctg atgtcttcaa catgatgctt caaattcttg aagatggaag attgacagac 4981 agcaagggta gaactgtaga tttcaagaac acacttctca tcatgacatc aaatgttgga 5041 agtagtgtaa ttgagaaagg tggccgtcgt ataggttttg atctcgacct tgatgagaag 5101 gatagcagtt acaatcgcat caagagctta gtgacagaag aattgaagca gtacttcagg 5161 ccagagtttt tgaacagatt ggatgagatg attgtattcc gtcagctcac taagttagag 5221 gttaaggaga tagctgatat catgcttaag gaggtctttg agaggttgaa agttaaggag 5281 atagaacttc aagtgacaga gaggtttaga gacagggtgg ttgacgaggg atacaaccct 5341 agctatggag cacgacctct gagaagagct attatgagac ttttagagga cagcatggct 5401 gagaaaatgc ttgcaaatga gattaaagaa ggtgattcag ttatcgtgga cgttgattca 5461 gatggcaatg tgactgtcct caatggaagt agtggtactc cctcggatcc ggctcctgag 5521 cctatccccg tttagatcag gtgatcacat tttagctcaa cagtctctgt gtggtctttt 5581 aagagttcat ttgtgttggc cactaagctt tgtggggctt atgaagcaac ttttgagttt 5641 atctctggga atgctcaggt aaatcttgcc acgtcgagaa ggcagcatcc tttctatgtt 5701 gatcttaatg attttacaga agtatactgg attaagattg tagttggtca caagaaagta 5761 ttgtattaac aagacactcc ttatcaaaaa catatctagt ttattgtagg cgacagttga 5821 atcgttaatc ctcgtattgt ctatcc // LOCUS PLYORIA 375 bp ds-DNA VRL 17-JUL-1990 DEFINITION Human polyomavirus BK (strain WWT) origin of replication. ACCESSION M34048 KEYWORDS origin of replication. SOURCE Polyomavirus BK (strain WWT) DNA, from human urine. ORGANISM Polyomavirus BK Unclassified. REFERENCE 1 (bases 1 to 375) AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 375 regulatory region site 143 210 P block site 211 249 Q block site 250 302 R block BASE COUNT 123 a 86 c 95 g 71 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt 301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta 361 aactggacaa aggcc // LOCUS PLYORIB 424 bp ds-DNA VRL 17-JUL-1990 DEFINITION Human polyomavirus BK (strain UT) origin of replication. ACCESSION M34049 KEYWORDS origin of replication. SOURCE Human polyomavirus BK (strain UT) DNA, from human urine. ORGANISM Polyomavirus BK Unclassified. REFERENCE 1 (bases 1 to 424) AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 424 regulatory region site 143 210 P block site 211 249 Q block site 250 261 R block site 263 315 P block site 316 349 Q block site 350 361 R block BASE COUNT 134 a 103 c 105 g 82 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa 301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc 361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa 421 ggcc // LOCUS ECOCLPPA 1236 bp ds-DNA BCT 17-JUL-1990 DEFINITION Escherichia coli ATP-dependent clp protease proteolytic component (clpP) gene, complete cds. ACCESSION J05534 KEYWORDS ATP-dependent protease. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1236) AUTHORS Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J., Bowers,B. and Gottesman,S. TITLE Sequence and structure of ClpP, the proteolytic component of the ATP-dependent Clp protease of E.coli JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Gottesman, 23-APR-1990. FEATURES from to/span description pept 378 1001 ATP-dependent protease (clpP) mRNA 279 > 1236 clpP mRNA pept < 1 133 ORF signal 244 249 -35 region signal 267 272 -10 region signal 165 193 transcription termination signal binding 367 372 ribosome binding site (put.) BASE COUNT 329 a 291 c 318 g 298 t ORIGIN 1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga 61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa 121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc 181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg 241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac 301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt 361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg 421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt 481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta 541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc 601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc 661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt 721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga 781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg 841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat 901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg 961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg 1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt 1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc 1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg 1201 cgcaagctga ttgccggtcc atccgtgtat atctgc // LOCUS LACNISC 1222 bp ds-DNA BCT 17-JUL-1990 DEFINITION L.lactis insertion sequence IS981. ACCESSION M33933 KEYWORDS insertion sequence. SOURCE L.lactis (strain LM0230) DNA, clone pKM40. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1222) AUTHORS Polzin,K.M. and McKay,L.L. TITLE Identification, DNA sequence and distribution of IS981, a new, high copy number insertion sequence in the lactococci related to the IS2/IS3 family of IS-elements JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Polzin 27-APR-1990. University of Minnesota Dept. of Food Science 1334 Eckles Avenue, St. Paul, MN 55108 e-mail: kayla@molbio.cbs.umn.edu FEATURES from to/span description pept 58 318 ORF1 pept 330 1169 ORF2 binding 44 50 ribosomal binding site (put.) rpt 1 40 5' inverted repeat rpt 1183 1222 3' inverted repeat signal 33 38 -10 region signal 269 274 -35 region signal 294 299 -10 region BASE COUNT 442 a 233 c 206 g 341 t ORIGIN 132 bp upstream of PvuII site. 1 gatgttatcc ttaaatctta gagtcactat tgtataattt agacaaagga caaaaacatg 61 caaaaacgct actcaaaaga atttaaagaa acccttatcg ccttctatca ttctggtcaa 121 tccgtcaccc agctgtctaa agaatatgac gtggcccctg caacaattta taaatggata 181 gacctctact ctaaatctaa tgaaagctcc gtctctaaag ctgattttct agaattaaaa 241 agacaactgg ctaaagttaa ggaagaacga gacatcttaa aaaaagtatt gaccatattc 301 gccgagaaaa agaagtgagt gctgcggata tggctcaaac catacaaact ttagcactca 361 atgtcagact aagctgtcaa ctccttgatg ttcctgaatc aagttattat gaacggatta 421 accgacaccc atctaaaact caattaagga gacaatacct gtcactcaaa atttctcaac 481 tcttcaatgc taaccgagga atctatggtg ctcctaaaat tcatcatctt ctacttaaac 541 aaggggaaaa agtcgggtta aaactggtac agaagctaat gaagcaactt caactcaagt 601 ctgtagtcat taagaaattt aagcctggat actcactaag tgatcacatc aatcgaaaaa 661 atctcataca gactgaacct acaaagaaaa ataaggtttg gtcaaccgac attacttata 721 ttcctactca acaaggatgg gcttatctct caaccattat ggatcgttat actaaaaaag 781 tcattgcttg ggatttgggc aagcgaatga ctgtagaatt agtgcaaaga actttaaata 841 aggccattaa atcacaagac tatccagaag ctgttattct tcattctgac caaggaagcc 901 agtatacgag tctagagtat gaagagttgc ttaagtatta tgggatgact cactctttca 961 gtcgaagggg atacccttat cataatgcca gtcttgaatc ttggcatgga catttaaaaa 1021 gagagtgggt gtaccaattt aaatataaga actttgaaga agcctatcag agtattttct 1081 ggtacatcga agccttttat aattcaaaac gaatccatca aagtttaggg tatcttacac 1141 ctaatcaatt tgaaaaggta agtgcttaaa ataaatagat taaaattcta cgtttgttac 1201 tctaaaaact tgacttaacg tc // LOCUS RATSHIIA 3064 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat K+ channel protein (KShIIA) mRNA, complete cds. ACCESSION M34052 KEYWORDS K+ channel protein. SOURCE Rat brain, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3064) AUTHORS McCormack,T., Vega-Saenz de Miera,E.V. and Bernardo,R. TITLE Molecular cloning of a member of a novel class of K+ channel genes in mammals JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Bernardo, 07-MAY-1990. FEATURES from to/span description pept 80 1771 K+ channel protein (KShIIa) BASE COUNT 777 a 718 c 729 g 840 t ORIGIN 1 gccacggtga acatctggcc cacgagagct ttaacttggt gctgtgttcg ccttccctag 61 tcatgtctga gccacagaga tgggcaagat cgagaacaac gagagggtga tcctcaatgt 121 cggaggcacc aggcacgaaa cctaccgcag cactctcaag acccttcctg gaactcgcct 181 ggcccttctc gcctcctctg aacctcaggg cgactgcctg actgctgcgg gtgacaagct 241 gcagccgctg ccccctccgc tgtctccacc gccgcgaccg cctcccttgt cccctgtccc 301 cagcggctgc ttcgagggcg gcgcaggcaa ctgcagttcg cacggtggca atggcagcga 361 ccaccctggg ggaggccgcg aattcttctt cgatcgccac ccaggagtct tcgcctatgt 421 gctcaactac taccgcacgg gcaagctgca ctgccccgcc gacgtgtgtg gaccgctctt 481 cgaggaagag ctggcattct ggggcatcga tgagaccgac gtggagccct gctgctggat 541 gacctacagg cagcaccggg acgcggagga ggccctggat atcttcgaga cacccgacct 601 catcggaggc gaccctggtg atgatgagga cctagggggc aagagactgg gcattgagga 661 tgctgcgggg ctgggaggac ccgatggcaa gtctggccgc tggaggaagc tgcagcctcg 721 catgtgggct ctctttgagg acccctattc atccagagcc gctaggttta ttgcttttgc 781 ttctctgttc ttcattttgg tttccatcac aaccttttgc ctggagacac acgaagcttt 841 caatattgtt aaaaacaaga cagagccagt catcaacggc accagcgctg ttctccagta 901 tgaaatcgaa acggatcctg ccttgacata tgtggaagga gtgtgtgtgg tgtggtttac 961 ttttgaattt ttagtccgta ttgttttctc gcccaataaa cttgagttca tcaaaaatct 1021 attgaacatc attgactttg tggccatcct ccccttctac ttagaggtgg gactcagcgg 1081 gctgtcttcc aaagcggcta aagatgtgct cggctttctc agggtggtta ggtttgtgag 1141 gatcctgaga atcttcaagc ttacccgcca tttcgtaggt ctgagagtgc tcggacacac 1201 tcttcgtgcg agcaccaatg aatttttgtt gctgatcatc tttctggctc tgggagtttt 1261 gatattcgct acgatgatct actacgctga gcgagtaggg gctcaaccta atgatccctc 1321 agcgagtgag cacacacagt tcaaaaacat ccccattggt ttctggtggg ctgtggtgac 1381 catgactacc ttaggctatg gggatatgta cccccaaaca tggtcaggga tgttggtggg 1441 ggccttgtgt gctctggctg gagtgctgac catagctatg cctgtgcccg tcattgtcaa 1501 caattttggg atgtactact ccttggcaat ggcgaagcag aaacttccaa gaaaaagaaa 1561 gaagcacatt cctcctgccc ctctggcaag ctcacctaca ttttgcaaga cagaattaaa 1621 catggcttgt aacagtaccc agagtgacac atgtctgggc aaagaaaacc ggcttctgga 1681 acataacaga tcagtgttat caggtgacga cagtacagga agtgagccgc cattatcacc 1741 ttccggaaag gctccccatc agacgctcta gtaccagaga caaaaacaga agaggggaaa 1801 catgtttcct gttgacgaca ggtgattaca cgtgcgcttc tgatggagga atcaggaaag 1861 ataactgcaa agatgttgtc attactggtt acacgcaagc cgaggccaga tctcttactt 1921 aatgacttgg gaaaggcaca aaacatgaaa gaaagtgttg tacagaattt atcatggatt 1981 tttgcctgct gaaaatggga cattgagatt tagccattca aggattgtac tggaaacttc 2041 tgctactgaa tgtgaccgga tgtgaccggt ttgtgttcag aagagttccc gcattctctg 2101 aggcatttaa agcttgtaaa gaaatggtgg ctggaactga aatgggcttt gccaaggaaa 2161 tattctgctt gtcgacagac cagagtccac aaaacaactg taaataccaa cgtgtgcatg 2221 ggtccaccct tctcacagtc tcatagcaca gggagccaca ttcattgtca gcatctgtga 2281 agcttcaaga aatgcccaca aggctctgaa tggctcctct cgggctcatg gtggttttgc 2341 tgcaaattac agagggcagt gaatccaatg tagtataggt ctgttttcat ttcttttatt 2401 tgtacattac gtgttgtatc tacaaacacc gattgcctca ttttcttgca aaccacttat 2461 atatccgctt gtgaactatg ttgtaaactt ttagtgtcta catagaaatc taaccatttc 2521 ttagtcatgt attcatttag ttctggaacg ctgttactat ctagaagaca gtcatcacca 2581 gtattcatcg tataagtctt ttgtcattgt tacctcaatt ataaatatta aaaatataaa 2641 ttctggcaat gagaatattt ttttattaaa tgatcaagga aaatgtcagt atatagtaga 2701 atattatcaa attatatcct aaaatgtcta ttttgcataa aagagatatt cttcaatcga 2761 ttcctttttt tcgtgagctt tgtggctaat gaagcgtgtg tttgtcttca tcactgctgt 2821 agttgaaact gtgtaagagt ttttcatctt gcttaatcaa tatttccaga atctcttagt 2881 tcccctggga ttctgaatat actctatagc ctaattataa acccctgtat cgtgtacctt 2941 ttgtgaccat ctcaaggtgc atgcctaatc gttgtgataa gccaatggac atgttactaa 3001 ctgaaatgaa gaataaaagt caaacgatct ggggataaac ttgaatccta tctgattaaa 3061 tgat // LOCUS HUMROS1 7375 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human transmembrane tyrosine-specific protein kinase (ROS1) mRNA, complete cds. ACCESSION M34353 KEYWORDS oncogene; transmembrane tyrosine-specific protein kinase. SOURCE Human glioblastoma cell line SW-1088, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 7375) AUTHORS Birchmeier,C., O'Neil,K., Riggs,M. and Wigler,M. TITLE Characterization of ROS1 cDNA from a human glioblastoma cell line JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4799-4803 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Wigler, 15-MAY-1990. FEATURES from to/span description pept 207 7250 transmembrane tyrosine-specific protein kinase precursor /hgml_locus_uid="LN0124W" /nomgen="ROS1" /map="6q21-q22" sigp 207 314 transmembrane tyrosine-specific protein kinase signal peptide matp 208 7247 transmembrane tyrosine-specific protein kinase BASE COUNT 2203 a 1496 c 1605 g 2071 t ORIGIN 1 ccgcattcaa gctttcaagc attcaaaggt ctaaatgaaa aaggctaagt attatttcaa 61 aaggcaagta tatcctaata tagcaaaaca aacaaagcaa aatccatcag ctactcctcc 121 aattgaagtg atgaagccca aataattcat atagcaaaat ggagaaaatt agaccggcca 181 tctaaaaatc tgccattggt gaagtgatga agaacattta ctgtcttatt ccgaagcttg 241 tcaattttgc aactcttggc tgcctatgga tttctgtggt gcagtgtaca gttttaaata 301 gctgcctaaa gtcgtgtgta actaatctgg gccagcagct tgaccttggc acaccacata 361 atctgagtga accgtgtatc caaggatgtc acttttggaa ctctgtagat cagaaaaact 421 gtgctttaaa gtgtcgggag tcgtgtgagg ttggctgtag cagcgcggaa ggtgcatatg 481 aagaggaagt actggaaaat gcagacctac caactgctcc ctttgcttct tccattggaa 541 gccacaatat gacattacga tggaaatctg caaacttctc tggagtaaaa tacatcattc 601 agtggaaata tgcacaactt ctgggaagct ggacttatac taagactgtg tccagaccgt 661 cctatgtggt caagcccctg caccccttca ctgagtacat tttccgagtg gtttggatct 721 tcacagcgca gctgcagctc tactcccctc caagtcccag ttacaggact catcctcatg 781 gagttcctga aactgcacct ttgattagga atattgagag ctcaagtccc gacactgtgg 841 aagtcagctg ggatccacct caattcccag gtggacctat tttgggttat aacttaaggc 901 tgatcagcaa aaatcaaaaa ttagatgcag ggacacagag aaccagtttc cagttttact 961 ccactttacc aaatactatc tacaggtttt ctattgcagc agtaaatgaa gttggtgagg 1021 gtccagaagc agaatctagt attaccactt catcttcagc agttcaacaa gaggaacagt 1081 ggctcttttt atccagaaaa acttctctaa gaaagagatc tttaaaacat ttagtagatg 1141 aagcacattg ccttcggttg gatgctatat accataatat tacaggaata tctgttgatg 1201 tccaccagca aattgtttat ttctctgaag gaactctcat atgggcgaag aaggctgcca 1261 acatgtctga tgtatctgac ctgagaattt tttacagagg ttcaggatta atttcttcta 1321 tctccataga ttggctttat caaagaatgt atttcatcat ggatgaactg gtatgtgtct 1381 gtgatttaga gaactgctca aacatcgagg aaattactcc accctctatt agtgcacctc 1441 aaaaaattgt ggctgattca tacaatgggt atgtctttta cctcctgaga gatggcattt 1501 atagagcaga ccttcctgta ccatctggcc ggtgtgcaga agctgtgcgt attgtggaga 1561 gttgcacgtt aaaggacttt gcaatcaagc cacaagccaa gcgaatcatt tacttcaatg 1621 acactgccca agtcttcatg tcaacatttc tggatggctc tgcttcccat ctcatcctac 1681 ctcgcatccc ctttgctgat gtgaaaagtt ttgcttgtga aaacaatgac tttcttgtca 1741 cagatggcaa ggtcattttc caacaggatg ctttgtcttt taatgaattc atcgtgggat 1801 gtgacctgag tcacatagaa gaatttgggt ttggtaactt ggtcatcttt ggctcatcct 1861 cccagctgca ccctctgcca ggccgcccgc aggagctttc ggtgctgttt ggctctcacc 1921 aggctcttgt tcaatggaag cctcctgccc ttgccatagg agccaatgtc atcctgatca 1981 gtgatattat tgaactcttt gaattaggcc cttctgcctg gcagaactgg acctatgagg 2041 tgaaagtatc cacccaagac cctcctgaag tcactcatat tttcttgaac ataagtggaa 2101 ccatgctgaa tgtacctgag ctgcagagtg ctatgaaata caaggtttct gtgagagcaa 2161 gttctccaaa gaggccaggc ccctggtcag agccctcagt gggtactacc ctggtgccag 2221 ctagtgaacc accatttatc atggctgtga aagaagatgg gctttggagt aaaccattaa 2281 atagctttgg cccaggagag ttcttatcct ctgatatagg aaatgtgtca gacatggatt 2341 ggtataacaa cagcctctac tacagtgaca cgaaaggcga cgtttttgtg tggctgctga 2401 atgggacgga tatctcagag aattatcacc tacccagcat tgcaggagca ggggctttag 2461 cttttgagtg gctgggtcac tttctctact gggctggaaa gacatatgtg atacaaaggc 2521 agtctgtgtt gacgggacac acagacattg ttacccacgt gaagctattg gtgaatgaca 2581 tggtggtgga ttcagttggt ggatatctct actggaccac actctattca gtggaaagca 2641 ccagactaaa tggggaaagt tcccttgtac tacagacaca gccttggttt tctgggaaaa 2701 aggtaattgc tctaacttta gacctcagtg atgggctcct gtattggttg gttcaagaca 2761 gtcaatgtat tcacctgtac acagctgttc ttcggggaca gagcactggg gataccacca 2821 tcacagaatt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactata 2881 gtggtcggct gttctggatc aatggcttta ggattatcac aactcaagaa ataggtcaga 2941 aaaccagtgt ctctgttttg gaaccagcca gatttaatca gttcacaatt attcagacat 3001 cccttaagcc cctgccaggg aacttttcct ttacccctaa ggttattcca gattctgttc 3061 aagagtcttc atttaggatt gaaggaaatg cttcaagttt tcaaatcctg tggaatggtc 3121 cccctgcggt agactggggt gtagttttct acagtgtaga atttagtgct cattctaagt 3181 tcttggctag tgaacaacac tctttacctg tatttactgt ggaaggactg gaaccttatg 3241 ccttatttaa tctttctgtc actccttata cctactgggg aaagggcccc aaaacatctc 3301 tgtcacttcg agcacctgaa acagttccat cagcaccaga gaaccccaga atatttatat 3361 taccaagtgg aaaatgctgc aacaagaatg aagttgtggt ggaatttagg tggaacaaac 3421 ctaagcatga aaatggggtg ttaacaaaat ttgaaatttt ctacaatata tccaatcaaa 3481 gtattacaaa caaaacatgt gaagactgga ttgctgtcaa tgtcactccc tcagtgatgt 3541 cttttcaact tgaaggcatg agtcccagat gctttattgc cttccaggtt agggccttta 3601 catctaaggg gccaggacca tatgctgacg ttgtaaagtc tacaacatca gaaatcaacc 3661 catttcctca cctcataact cttcttggta acaagatagt ttttttagat atggatcaaa 3721 atcaagttgt gtggacgttt tcagcagaaa gagttatcag tgccgtttgc tacacagctg 3781 ataatgagat gggatattat gctgaagggg actcactctt tcttctgcac ttgcacaatc 3841 gctctagctc tgagcttttc caagattcac tggtttttga tatcacagtt attacaattg 3901 actggatttc aaggcacctc tactttgcac tgaaagaatc acaaaatgga atgcaagtat 3961 ttgatgttga tcttgaacac aaggtgaaat atcccagaga ggtgaagatt cacaatagga 4021 attcaacaat aatttctttt tctgtatatc ctcttttaag tcgcttgtat tggacagaag 4081 tttccaattt tggctaccag atgttctact acagtattat cagtcacacc ttgcaccgaa 4141 ttctgcaacc cacagctaca aaccaacaaa acaaaaggaa tcaatgttct tgtaatgtga 4201 ctgaatttga gttaagtgga gcaatggcta ttgatacctc taacctagag aaaccattga 4261 tatactttgc caaagcacaa gagatctggg caatggatct ggaaggctgt cagtgttgga 4321 gagttatcac agtacctgct atgctcgcag gaaaaaccct tgttagctta actgtggatg 4381 gagatcttat atactggatc atcacagcaa aggacagcac acagatttat caggcaaaga 4441 aaggaaatgg ggccatcgtt tcccaggtga aggccctaag gagtaggcat atcttggctt 4501 acagttcagt tatgcagcct tttccagata aagcgtttct gtctctagct tcagacactg 4561 tggaaccaac tatacttaat gccactaaca ctagcctcac aatcagatta cctctggcca 4621 agacaaacct cacatggtat ggcatcacca gccctactcc aacatacctg gtttattatg 4681 cagaagttaa tgacaggaaa aacagctctg acttgaaata tagaattctg gaatttcagg 4741 acagtatagc tcttattgaa gatttacaac cattttcaac atacatgata cagatagctg 4801 taaaaaatta ttattcagat cctttggaac atttaccacc aggaaaagag atttggggaa 4861 aaactaaaaa tggagtacca gaggcagtgc agctcattaa tacaactgtg cggtcagaca 4921 ccagcctcat tatatcttgg agagaatctc acaagccaaa tggacctaaa gaatcagtcc 4981 gttatcagtt ggcaatctca cacctggccc taattcctga aactcctcta agacaaagtg 5041 aatttccaaa tggaaggctc actctccttg ttactagact gtctggtgga aatatttatg 5101 tgttaaaggt tcttgcctgc cactctgagg aaatgtggtg tacagagagt catcctgtca 5161 ctgtggaaat gtttaacaca ccagagaaac cttattcctt ggttccagag aacactagtt 5221 tgcaatttaa ttggaaggct ccattgaatg ttaacctcat cagattttgg gttgagctac 5281 agaagtggaa atacaatgag ttttaccatg ttaaaacttc atgcagccaa ggtcctgctt 5341 atgtctgtaa tatcacaaat ctacaacctt atacttcata taatgtcaga gtagtggtgg 5401 tttataagac gggagaaaat agcacctcac ttccagaaag ctttaagaca aaagctggag 5461 tcccaaataa accaggcatt cccaaattac tagaagggag taaaaattca atacagtggg 5521 agaaagctga agataatgga tgtagaatta catactatat ccttgagata agaaagagca 5581 cttcaaataa tttacagaac cagaatttaa ggtggaagat gacatttaat ggatcctgca 5641 gtagtgtttg cacatggaag tccaaaaacc tgaaaggaat atttcagttc agagtagtag 5701 ctgcaaataa tctagggttt ggtgaatata gtggaatcag tgagaatatt atattagttg 5761 gagatgattt ttggatacca gaaacaagtt tcatacttac tattatagtt ggaatatttc 5821 tggttgttac aatcccactg acctttgtct ggcatagaag attaaagaat caaaaaagtg 5881 ccaaggaagg ggtgacagtg cttataaacg aagacaaaga gttggctgag ctgcgaggtc 5941 tggcagccgg agtaggcctg gctaatgcct gctatgcaat acatactctt ccaacccaag 6001 aggagattga aaatcttcct gccttccctc gggaaaaact gactctgcgt ctcttgctgg 6061 gaagtggagc ctttggagaa gtgtatgaag gaacagcagt ggacatctta ggagttggaa 6121 gtggagaaat caaagtagca gtgaagactt tgaagaaggg ttccacagac caggagaaga 6181 ttgaattcct gaaggaggca catctgatga gcaaatttaa tcatcccaac attctgaagc 6241 agcttggagt ttgtctgctg aatgaacccc aatacattat cctggaactg atggagggag 6301 gagaccttct tacttatttg cgtaaagccc ggatggcaac gttttatggt cctttactca 6361 ccttggttga ccttgtagac ctgtgtgtag atatttcaaa aggctgtgtc tacttggaac 6421 ggatgcattt cattcacagg gatctggcag ctcgaaattg ccttgtttcc gtgaaagact 6481 ataccagtcc acggatagtg aagattggag actttggact cgccagagac atctataaaa 6541 atgattacta tagaaagaga ggggaaggcc tgctcccagt tcggtggatg gctccagaaa 6601 gtttgatgga tggaatcttc actactcaat ctgatgtatg gtcttttgga attctgattt 6661 gggagatttt aactcttggt catcagcctt atccagctca ttccaacctt gatgtgttaa 6721 actatgtgca aacaggaggg agactggagc caccaagaaa ttgtcctgat gatctgtgga 6781 atttaatgac ccagtgctgg gctcaagaac ccgaccaaag acctactttt catagaattc 6841 agaaccaact tcagttattc agaaattttt tcttaaatag catttatcag tgcagagatg 6901 aagcaaacaa cagtggagtc ataaatgaaa gctttgaagg tgaagatggc gatgtgattt 6961 gtttgaattc agatgacatt atgccagttg ttttaatgga aacgaagaac cgagaagggt 7021 taaactatat ggtacttgct acagaatgtg gccaaggtga agaaaagtct gagggtcctc 7081 taggctccca ggaatctgaa tcttgtggtc tgaggaaaga agagaaggaa ccacatgcag 7141 acaaagattt ctgccaagaa aaacaagtgg cttactgccc ttctggcaag cctgaaggcc 7201 tgaactatgc ctgtctcact cacagtggat atggagatgg gtctgattaa tagcgttgtt 7261 tgggaaatag agagttgaga taaacactct cattcagtag ttactgaaag aaaactctgc 7321 tagaatgata aatgtcatgg tggtctataa ctccaaataa acaatgcaac gttcc // LOCUS MBOPIVML 969 bp ds-DNA BCT 17-JUL-1990 DEFINITION M.lacunata pilin gene inverting protein (PivML) gene, complete cds. ACCESSION M34367 KEYWORDS pilin gene inverting protein. SOURCE M.lacunata (ATCC 17956) DNA, clone pMxL1. ORGANISM Moraxella lacunata Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 969) AUTHORS Marrs,C.F., Rozsa,F.W., Hackel,M., Stevens,S.P. and Glasgow,A.C. TITLE Identification, cloning and sequencing of Piv, a new gene involved in inverting the pilin gene of M.lacunata JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.F.Marrs, 17-MAY-1990. FEATURES from to/span description pept 1 969 pilin gene inverting protein (PivML) BASE COUNT 318 a 236 c 165 g 250 t ORIGIN 1 atgtctaaaa cttacattgg gattgatatc gccaaaaaca catttgatgc ctgctttatt 61 acccataaca cttggcaaaa ctgtacattc acaaataacc aacagggttt tatagagcta 121 acattgtgga ttcaagcaca tcattataac acatctacct tacatttgat tattgaggca 181 acgggggctt attgggaaaa attagctcat tgggcaatct ctcatcatca taaggtaagt 241 attgtcaacc ccttatatat tcatgcatac gccaaaagcc taggcatacg caccaagaca 301 gataagcagg atgccattct cttggcacgc tatggtgcaa aagaaaatcc acctttatgg 361 cagcctaaat cagacaacga aataaaactg actgcccttc taaaacaacg tgaacaccac 421 aaacggcaac tcatcaaaga acgcactcga caggaagctc tgtctattta tgtgaaatcg 481 tatactgatg acaacattcg tcattggagt gacagcatca cacagctcga tcatcaaatt 541 tggcaactca tcaattgcac gcccgagcta aattaccgtg ccagcctact tgccacaata 601 cctggaattg gcaaaaagac actgccacat ctactggttg tcattggaga tggttcatca 661 tttcaaagtg ccaaacatct agcaagctat gctgggcttg cccctcgtca tcatcaatca 721 ggcattagta ttcacaagca gtcttccatt ggatttagtg gacaaaaaga actacgttca 781 gcccttttta tgcccgctgt catagtcagc tttggtcgat atcccgcttt tcaaaagttc 841 gtcaaacgca tggaacaaaa aggcaaaacc aaaaaacaaa ttattatcgc catcatgcgc 901 aaattactca ccatcagcta tgccgttatt cgccaaaatc gcccttttga caaacgcatt 961 catgaataa // LOCUS STYMKAD 915 bp ds-DNA BCT 17-JUL-1990 DEFINITION Plasmid pEX102 mouse killing agent D (mkaD) gene, complete cds. ACCESSION M34355 KEYWORDS mouse killing agent D. SOURCE Plasmid pEX102 (from Salmonella typhimurium, strain TML R66) DNA. ORGANISM Plasmid pEX102 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 915) AUTHORS Taira,S. and Rhen,M. TITLE Nucleotide sequence of mkaD, a virulence associated gene of Salmonella typhimurium containing variable and constant region JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.P.Taira, 16-MAY-1990. FEATURES from to/span description pept 132 857 mouse killing agent D (mkaD) BASE COUNT 273 a 181 c 213 g 248 t ORIGIN 1 ggaccgaaat atcatttttt cgcctgataa aaaatgaggt tttctggata actaatcgtt 61 ttattaaaaa aaaactgaga atttatatct aataatatgg cgatatatcc atatcgcaaa 121 ggagatttcc catgcccata aataggccta atctaaatct aaacatccct cctttgaata 181 ttgtagctgc ttatgatggg gcggaaatac catctacaaa taagcacctg aaaaataatt 241 tcaactcctt gcacaaccaa atgcggaaga tgccggtatc ccactttaaa gaggcgctgg 301 atgtgcctga ctattcaggg atgcgccaga gtggtttctt tgctatgagc caaggttttc 361 agctgaataa ccatggttac gatgttttca tccatgctcg tcgagaatca cctcagtctc 421 agggcaaatt tgccggtgac aagttccaca tcagtgtgct cagggatatg gtgccacaag 481 catttcaagc gctgtccgga ttgctgtttt cagaggacag tccggtagat aagtggaaag 541 tgaccgatat ggagaaggtc gttcaacaag cccgtgttag cctgggcgct cagttcacgt 601 tgtatataaa accagaccag gaaaattcgc agtacagtgc gtcgtttctc cacaagacac 661 ggcaatttat agagtgtctg gaatccagac tatccgaaaa tggggttatt tcaggacagt 721 gtcctgagtc agacgttcat cctgaaaatt ggaaatatct cagttatcgt aatgaactac 781 gaagtgggcg tgatggtggc gaaatgcaga gacaggcttt acgtgaggaa ccgttttatc 841 gtttgatgac agagtaagta tgggtttggg gagcaacgga acagtaaacg ccgttaaaca 901 gctattttaa atgct // LOCUS TOBEXT 3994 bp ds-DNA PLN 17-JUL-1990 DEFINITION N.plumbaginifolia extensin (ext) gene, complete cds. ACCESSION M34371 KEYWORDS cell wall protein; extensin. SOURCE N.plumbaginifolia leaf DNA. ORGANISM Nicotiana plumbaginifolia Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 3994) AUTHORS De Loose,M.R.R., Gheyson,G., Tire,C., Gielen,J., Villaroel,R., Genetello,C., Van Montagu,M., Depicker,A. and Inze,D. TITLE The extensin signal peptide allows secretion of heterologous proteins from protoplasts JOURNAL Unpublished (1990) STANDARD unannotated staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.R.De Loose, 18-MAY-1990. Author address: M.R.R.De Loose Laboratorium voor Genetika, Wetenschappen, K.L.Ledeganckstraat, Gent, Oost-Vlaanderen, BELGIUM FEATURES from to/span description pept 1590 2840 extensin precursor sigp 1590 1652 extensin signal peptide (could end at 1667) matp 1653 2837 extensin (could start at 1668) mRNA 1560 3342 extensin mRNA rpt 1740 1824 tandem 85 bp repeats rpt 1740 1758 19 bp repeats signal 1532 1537 TATA box signal 3323 3329 poly-A signal BASE COUNT 1264 a 992 c 550 g 1188 t ORIGIN 1 cgcccgggga tcctcctgga tgccattttt cctttgcttt tggatacctt ttcgcaactt 61 tagtccattg cttgaacatg gttttttcta cctcttgtct gttcccatga tgataaacta 121 tgatagctaa cattttcagt aatattggat tgaattagat atactttgaa ttattgagct 181 acaaaattcg ttgaatattt tgccctatcc cccacccaaa agaaaataat tcataccttt 241 atttggtaat aaaagtgggt cacatgcaca gtttttctcg tcttctctag attaaactct 301 ttggaaatga cctactgaaa atactacaca taaaattccc cactcccctc caagttgggg 361 gagggtgggg gttggtgggt tttttttccc cttgaatcca ttcagccggc ttatgcctcc 421 cgaggtggtg ggaggtttaa tttgaaccct taccctatag ttaatatcga aatagataat 481 acaaaggagg ggaacataaa accaaaacct ccgacaaaac atcaaagtgt aggagttcat 541 gatatacgct tgattgaaag tgcacaatgg tctttgcata ctatcaaagt atcaattggt 601 tcttgaatta tatctcttaa tataatgtat tgtgtttaat ttctctacta tcccctattt 661 tataagacta aaagatcctg acatgcttct tgaacacatg tgaaggttag ttaactggtc 721 agaagtacac aagaattaat ttgtacacct atgtttctca ccagaatgcc cataaattac 781 atcaattctt caacatcaag agatcgaaaa acttaacaat tgttctaata gttgaactga 841 gtcgtccgat ccatatccga ttcttcacta gaagcattaa tcatatagag caatttaact 901 taatttacta tattggtaat catttacata gttaagttat aaaaccttgg agcgacacaa 961 ggatttacac taatcatgat tgaatattaa ctttactcaa tttatcaatg tctgatacaa 1021 gtgaattaat ttctctcttt gtgatttcgg tagaaaatta atgtcaagtt tcaaagtttt 1081 tctttttgaa attagtcata catgtgaata gaacattaat ttaagttaaa gaatcatata 1141 aaaaagttag ccgctttgat ttctgtaact gaaatcgtgc agagatgagg catccacact 1201 tgtttttcaa agcttcagta ctatttaaaa acaaagacag taaaaaggag actttttctt 1261 tgacattatt gcatcagaaa tagtatagct gccataatag tttattcctt tgcttagctt 1321 gcagcctcta tcaaacaaaa aaatacacca gctcaagtca atttgagccg acaacatgac 1381 aaaaccaaat caaatatgca tttttttttt tttttttttt tttttttttt tttttttttt 1441 tttacacttt ggtaggtgtt aagtaatcta gtgagacttt taccttcatt tatgaaaatc 1501 ttgaaaaggg taattgtcta attgaaagct atataaaggg gtcgaagtga agcttaagag 1561 gacaacaact tttctcattt gtttcaaaga tgggaaaaat ggcttctcta tttgccacat 1621 ttttagtggt tttagtgtca cttagcttag cttctgaaag ctcagcaaat tatcaatact 1681 catctccacc accacctaag aagccatacc acccttcacc aacaccatat tatcctgcac 1741 cagtttataa atctccacca ccaccaattc cggtttataa gtcaccacca ccacccaaga 1801 agccatacta ccctccacac accccagttt acaaatctcc accaccacca actccagttt 1861 acaagtcgcc accaccaccc aagaagccac actaccctcc acacacaccc gtttacaagt 1921 ctccaccacc acctactcca gtttacaagt caccaccatc acccaagaag ccacactatc 1981 ctccacacac tccggtttac aagtcgccac caccaccaac tcccgtttac aagtcgccac 2041 caccacccaa gaagccacac tatccaccac acaccccggt ttacaagtcg ccaccaccac 2101 caactcccgt ttacaagtcg ccaccaccac caaagaagcc acactatcct ccacacacac 2161 cagtttacaa gtcaccacca ccacccaaga agccatacta tccaccacac accccagttt 2221 acaagtcgcc accaccacca actccagttt acaagtcacc accaccacct aagaagccat 2281 accacccttc accaacaccg tatcatcctt caccagttta caagtcccct ccaccaccaa 2341 ctccagttta caagtcacca ccaccaccta agaagccata ccacccttca ccaacaccgt 2401 atcatccatc accagtttac aagtcccctc caccaccaac tccagtttac aagtcaccac 2461 caccacctaa gaagccatac cacccttcac caacaccgta tcatccatca ccagtttaca 2521 agtcccctcc accaccaact ccagtttaca agtcaccacc accacctaag aagccatacc 2581 acccttcacc aacaccgtat catcctgcac cagtttacaa gtccccacca ccaccaactc 2641 cagtttacaa gtcaccaccg ccaccagtga agccatacca tccttcacca acaccatacc 2701 atcctgcacc agtttataag tctccaccac caccaactcc tgtttacaag tcacctccac 2761 caccaactcc agtctacaag tctccaccac cacaccaccc ctatgtttac gcttctcctc 2821 ctcctcccta ccattactaa gaagtgacat cacaaagctg aaggtaatta aggcaaatta 2881 attaaaacat tttcaaaatg atatgtcagg aatgcaaaga aaattgcata actttgttta 2941 tgcaaaacat taatataata taaagaactg cgctctctgc ttattaacta tttttttagc 3001 ttcatttatt agttactagc tgttcagaat tttcagtatc ttttgatatt acaaagaacc 3061 taatcagaca atgtatattc ttatgcagga aaagcagaat gttgagctaa aagaaaggct 3121 ttttccattt tcgagaggca acgagaagaa aagaagaaga aaaaatagag taaataataa 3181 gccccacggg aggcgaagtt cttttgtagc ttcatgttgt ctaagctatt gatattgttt 3241 gtaccctata ttttatttct ctgtctttgt gtatgttttg ttcaatttcg atctccttgc 3301 aaaatgcaga gattatgagc tgaataaagt aagttaatat tattatacgt gttaattttc 3361 tactcctctc tctagccagc cttttgtttt ctctttttct tatttcattt tctttatatc 3421 aaaccatttt aggagagggc caaggagtga tccagtaaaa cacgaagatt agaagaaact 3481 tcccttcttt ttttttatct gaaaaaaatt taaagtcgag atttatctct ttttgtaatg 3541 gaatcatttc cacagttatg actattttca aattgatatc gccagaccta acaaattaaa 3601 ttactccttc cgctcatctg gccgtacact tctgctccat ctggccgtac agttctgctc 3661 catctggcag tgtacggaga atggataact taatattcag tatgaatttg gacataaaat 3721 cttctaagat tgctagtgaa ttttgtgagc gttattgtgt taaaacaaaa gacagaaaat 3781 tattagaaga attaaagaaa tatgaaaata gtatagaatc agaaatattt cgagtccaca 3841 agttttttgt gtgttcttaa ggaatgtttt aacccctcac acgttgccaa ggtaatggat 3901 taaatcctcc caagataaaa cggaataaac cttcctgcaa cagtggcaat acaaactgca 3961 ggtcgactct agggatcccc gggcgagcga attc // LOCUS MUSIGHABN 336 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34574 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 336) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 336 Ig heavy-chain V-D-J region (AA at 1) recomb 276 277 V-region DNA end/D-region DNA start recomb 285 286 D-region DNA end/J-region DNA start BASE COUNT 86 a 76 c 95 g 79 t ORIGIN Chromosome 12. 1 actggaggag gcttggtgca acctgggggg tcacggggac tctcttgtga aggctcaggg 61 ttcactttta gtggcttctg gatgagctgg gttcgacaga cacctgggaa gaccctggag 121 tggattggag acattaattc tgatggcagt gcaataaact acgcaccatc cataaaggat 181 cgattcacta tcttcagaga caatgacaag agcaccctgt acctgcagat gagcaatgtg 241 cgatcggagg acacagccac gtatttctgt atgagatata gtaactactg gtacttcgat 301 gtctggggca caaggaccac ggtcaccgtc tcctca // LOCUS MUSIGHABO 357 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34575 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy-chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 306 307 D-region DNA end/J-region DNA start BASE COUNT 89 a 77 c 106 g 85 t ORIGIN Chromosome 12. 1 gaagtgcagc tgttggagac tggaggaggc ttggtgcaac ctggggggtc acggggactc 61 tcttgtgaag gctcagggtt cacttttagt ggcttctgga tgagctgggt tcgacagaca 121 cctgggaaga ccctggagtg gattggagac attaattctg atggcagtgc aataaactac 181 gcagcatcca taaaggatcg attcactatc ttcagagaca atgacaagag caccctgtac 241 ctgcagatga gcaatgtgcg atcggaggac acagccacgt atttctgtat gagatatgat 301 ggttactact ggtacttcga tgtctggggc acaaggacca cggtcaccgt ctcctca // LOCUS MUSIGHABP 354 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34576 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH34. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 303 304 D-region DNA end/J-region DNA start BASE COUNT 88 a 78 c 104 g 83 t 1 others ORIGIN Chromosome 12. 1 gaantgcagc tgttggagac tggaggaggc ttggtgcaac ctggggggtc acggggactc 61 tcttgtgaag gctcagggtt cacttttagt ggcttctgga tgagctgggt tcgacagaca 121 cctgggaaga ccctggagtg gattggagac attaattctg atggcagtgc aataaactac 181 gcaccatcca taaaggatcg attcactatc ttcagagaca atgacaagag caccctgtac 241 ctgcagatga gcaatgtgcg atcggaggac acagccacgt atttctgtat gagatatggt 301 aactactggt acttcgatgt ctggggcaca gggaccacgg tcaccgtctc ctca // LOCUS MUSIGHABQ 366 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34577 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 366) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 366 Ig heavy chain V-D-J-region (AA at 1) recomb 300 301 V-region DNA end/D-region DNA start recomb 318 319 D-region DNA end/J-region DNA start BASE COUNT 89 a 84 c 97 g 96 t ORIGIN Chromosome 12. 1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagtctc 61 tcctgtgcag cttctggatt caccttcact gattactaca tgagctgggt ccgccagcct 121 ccagggaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca 181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc 241 ctctatcttc aaatgaatgc cctgagagct gaggacagtg ccacttatta ctgtgcaaga 301 tattactacg gtagtagcta tgctatggac tactggggtc aaggaacctc agtcaccgtc 361 tcctca // LOCUS MUSIGHABR 360 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34578 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy chain V-D-J-region (AA at 1) recomb 300 301 V-region DNA end/D-region DNA start recomb 315 316 D-region DNA end/J-region DNA start BASE COUNT 85 a 87 c 93 g 95 t ORIGIN Chromosome 12. 1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagtctc 61 tcctgtgcag cttctggatt caccttcact gattactaca tgagctgggt ccgccagcct 121 ccagggaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca 181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc 241 ctctatcttc aaatgaatgc cctgagagct gaggacagtg ccacttatta ctgtgcaaga 301 tccctctatg atggttactt tgactactgg ggccaaggca ccactctcac agtctcctca // LOCUS MUSIGHABS 360 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34579 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 309 310 D-region DNA end/J-region DNA start BASE COUNT 90 a 93 c 93 g 83 t 1 others ORIGIN Chromosome 12. 1 caggtccaac tgcagcngcc tggggctgag ctggtgaggc ctgggtcttc agtgaagctg 61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcattgggt gaagcagagg 121 cctatacaag gccttgaatg gattggtaac attgaccctt ctgatagtga aactcactac 181 aatcaaaagt tcaaggacaa ggccacattg actgtagaca aatcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgcggtct attactgtgc aagagggatt 301 tatagtaact actggtactt cgatgtctgg ggcacaggga ccacggtcac cgtctcctca // LOCUS MUSIGHABT 357 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34580 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 306 307 D-region DNA end/J-region DNA start BASE COUNT 90 a 94 c 89 g 82 t 2 others ORIGIN Chromosome 12. 1 caggtccaac tgcancancc tggggctgag ctggtgaggc ctgggtcttc agtgaagctg 61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcattgggt gaagcagagg 121 cctatacaag gccttgaatg gattggtaac attgaccctt ctgatagtga aactcactac 181 aatcaaaagt tcaaggacaa ggccacattg actgtagaca aatcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgcggtct attactgtgc aagatactat 301 ggtaactact ggtacttcga tgtctggggc acagggacca cggtcaccgt ctcctca // LOCUS MUSIGHABU 366 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34581 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 366) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 366 Ig heavy chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 321 322 D-region DNA end/J-region DNA start BASE COUNT 95 a 93 c 96 g 81 t 1 others ORIGIN Chromosome 12. 1 caggtccaac tgcagcagcc tgggactgaa ctggtnaagc ctggggcttc agtgaagctg 61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg 121 cctggacaag gccttgagtg gattggaaat attaatccta gcaatggtgg tactaactac 181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aatcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagagattac 301 tacggtagta gctgggggta ctactttgac tactggggcc aaggcaccac tctcacagtc 361 tcctca // LOCUS MUSIGHABV 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34582 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH27. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 321 Ig heavy chain V-D-J-region (AA at 1) recomb 258 259 V-region DNA end/D-region DNA start recomb 270 271 D-region DNA end/J-region DNA start BASE COUNT 81 a 90 c 63 g 81 t 6 others ORIGIN Chromosome 12. 1 aaaccctnac agtnactctt nnttacctgc tctattactg ntttccccat caccagtggt 61 tactactgga tctggatccg tcagtcacct gggaaacccc tagaatggat ggggtacatc 121 actcatagtg gggaaacttt ctacaaccca tctctccaga gccccatctc cattactaga 181 gaaacgtcaa agaaccagtt cttcctccaa ttgaactctg tgaccacaga ggacacagcc 241 atgtattact gtncaggaga ctatgatggt tactggtact tcgatgtctg gggcacaggg 301 accacggtca ccgtctcctc a // LOCUS MUSIGHABW 336 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34583 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH32. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 336) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 336 Ig heavy chain V-D-J-region (AA at 1) recomb 273 277 V-region DNA end/D-region DNA start recomb 285 286 D-region DNA end/J-region DNA start BASE COUNT 82 a 99 c 71 g 82 t 2 others ORIGIN Chromosome 12. 1 ggacctggcc tggtgaaacc ctcacagtna ctcttcctta cctgctctat tactggtttc 61 cccatcacca gtggttacta ctggatctgg atccgtcagt cacctgggaa acccctagaa 121 tggatggggt acatcactca tagtggggaa actttctaca acccatctct ccagagcccc 181 atctccatta ctagagaaac gtcaaagaac cagttcttcc tccaattgaa ctctgtgacc 241 acagaggaca cagccatgta ttactgtnca ggagacaccg atggttactg gtacttcgat 301 gtctggggca cagggaccac ggtcaccgtc tcctca // LOCUS MUSIGKABQ 306 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34586 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 306) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 306 Ig kappa-chain V-J-region (AA at 1) recomb 284 285 V-region DNA end/J-region DNA start BASE COUNT 75 a 85 c 77 g 69 t ORIGIN Chromosome 6. 1 caaattgttc tcacccagtc tccagcaatc atgtctgcat ctccagggga gaaggtcacc 61 atgacctgca gtgccagctc aagtgtaagt tacatgcact ggtaccagca gaagtcaggc 121 acctccccca aaagatggat ttatgacaca tccaaactgg cttctggagt ccctgctcgc 181 ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagcagcat ggaggctgaa 241 gatgctgcca cttattactg ccagcagtgg agtagtaacc cgtggacgtt cggtggaggc 301 accaag // LOCUS MUSIGKABR 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34588 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin: processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 339 Ig kappa-chain V-J-region (AA at 1) recomb 302 303 V-region DNA end/J-region DNA start BASE COUNT 89 a 81 c 81 g 86 t 2 others ORIGIN Chromosome 6. 1 gatgttttga tgacccaaac tccactctcc ctncctgtca gtcttggaga tcaagcctcc 61 atctcttgca gatctagtca gagcattgta catagtaatg gaaacaccta tttagaatgg 121 tacctgcaga aaccaggcca gtctccaaag ctcctnatct acaaagtttc caaccgattt 181 tctggggtcc cagacaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc 241 agcagagtgg aggctgagga tctgggagtt tattactgct ttcaaggttc acatgttccg 301 tggacgttcg gtggaggcac caagctggaa atcaaacgg // LOCUS MUSIGKABS 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34589 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 339 Ig kappa-chain V-J-region (AA at 1) recomb 302 303 V-region DNA end/J-region DNA start BASE COUNT 89 a 79 c 82 g 86 t 3 others ORIGIN Chromosome 6. 1 gatgttttga tgacncaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc 61 atctcttgca gatctagtca gagcattgta catagtaatg gaaacaccta tttagaatgg 121 tacctgcaga aaccaggcca gtctccaaag ctcctnatct acaaagtttc caaccgattt 181 tctggggtcc caganaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc 241 agcagagtgg aggctgagga tctgggagtt tattactgct ttcaaggttc acatgttccg 301 tggacgttcg gtggaggcac caagctggaa atcaaacgg // LOCUS MUSIGKABT 323 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34590 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 323) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 323 Ig kappa-chain V-J-region (AA at 1) recomb 288 289 V-region DNA end/J-region DNA start BASE COUNT 96 a 81 c 70 g 76 t ORIGIN Chromosome 6. 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa 241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctccgac gttcggtgga 301 ggcaccaagc tggaaatcaa acg // LOCUS MUSIGKABU 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34591 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH27. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 339 Ig kappa-chain V-J-region (AA at 1) recomb 290 291 V-region DNA end/J-region DNA start BASE COUNT 84 a 91 c 81 g 82 t 1 others ORIGIN Chromosome 6. 1 gagaatgtgc tcacccagtc tccaaccacc atggctgcat ctcccggnga gaagatcact 61 atcacctgca gtgccagctc aagtataagt tccaattact tgcattggta tcagcagaag 121 ccaggattct cccctaaact cttgatttat aggacatcca atctggcttc tggagtccca 181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaattgg caccatggag 241 gctgaagatg ttgccactta ctactgccag cagggtagta gtataccacg cacgttcggc 301 tcggggacaa agttggaaat aaaacgggct gatgctgca // LOCUS MUSIGKABV 327 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34592 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH32. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 327) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 327 Ig kappa-chain V-J-region (AA at 1) recomb 290 291 V-region DNA end/J-region DNA start BASE COUNT 82 a 87 c 80 g 78 t ORIGIN Chromosome 6. 1 gagaatgtgc tcacccagtc tccaaccacc atggctgcat ctcccgggga gaagatcact 61 atcacctgca gtgccagctc aagtataagt tccaattact tgcattggta tcagcagaag 121 ccaggattct cccctaaact cttgatttat aggacatcca atctggcttc tggagtccca 181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaattgg caccatggag 241 gctgaagatg ttgccactta ctactgccag cagggtagta gtataccgta cacgttcgga 301 ggggggacca agctggaaat aaaacgg // LOCUS MUSIGKABW 324 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34593 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH34. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 324 Ig kappa-chain V-J-region (AA at 1) recomb 287 288 V-region DNA end/J-region DNA start BASE COUNT 96 a 76 c 80 g 72 t ORIGIN Chromosome 6. 1 gacatcaaga tgacccagtc tccatcctcc atgtatgcat cgctgggaga gagagtcact 61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca 121 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca 181 agattcagtg gcagtggatc tgggcaagat tattctctaa ccatcagcag cctggagtct 241 gacgatacag caacttatta ctgtctacag catggtgaga gcccgtacac gttcggaggg 301 gggaccaagc tggaaataaa acgt // LOCUS MUSIGLBLP 349 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig lambda-chain mRNA V-J region, partial cds. ACCESSION M34587 KEYWORDS immunoglobulin heavy-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 349) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 349; revises [1]) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 349 Ig lambda-chain V-J-region (AA at 1) recomb 293 294 V-region DNA end/J-region DNA start BASE COUNT 87 a 94 c 81 g 86 t 1 others ORIGIN Chromosome 16. 1 caggctgttg tgactcagga atctgcactc accacatcac ctggtggaac agtcatactc 61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa 121 aaaccagatc atttattcac tggtctaata ggtggtacca gcaaccgagc tccaggtgtt 181 cctgtcagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca 241 cagactgagg atgatgcaat gtatttctgt gctctatggt acagcaccca ttatgttttc 301 ggcggtggaa ccaaggtcac tgtcctaggt cagcccaagt ccantccca // LOCUS MUSIGLBN 330 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig lambda-chain mRNA V-J region, partial cds. ACCESSION M34584 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 330) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 330; revises [1]) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 330 Ig lambda-chain V-J-region (AA at 1) recomb 291 292 V-region DNA end/J-region DNA start BASE COUNT 87 a 87 c 79 g 77 t ORIGIN Chromosome 16. 1 caggctgttg tgactcagga atctgcactc accacatcac ctggtgaaac agtcacactc 61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa 121 aaaccagatc atttattcac tggtctaata ggtggtacca acaaccgagc tccaggtgtt 181 cctgccagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca 241 cagactgagg atgaggcaat atatttctgt gctctatggt acagcaacca ctgggtgttc 301 ggtggaggaa ccaaactgac tgtcctaggc // LOCUS MUSIGLBO 330 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig lambda-chain mRNA V-J region, partial cds. ACCESSION M34585 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 330) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 330; revises [1]) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 330 Ig lambda-chain V-J-region (AA at 1) recomb 291 292 V-region DNA end/J-region DNA start BASE COUNT 87 a 86 c 78 g 79 t ORIGIN Chromosome 16. 1 caggctgttg tgactcagga atctgcactc accacatcac ctggtgaaac agtcacactc 61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa 121 aaaccagatc atttattcac tggtctaata ggtggtacca acaaccgagc tccaggtgtt 181 cctgccagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca 241 cagactgagg atgaggcaat atatttctgt gctctatggt acagcaacca tttggtgttc 301 ggtggaggaa ccaaactgac tgtcctaggc //