Path: utzoo!attcan!uunet!bu.edu!mit-eddie!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 3 Aug 90 12:00:21 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 5504 Approved: lear@genbank.bio.net Checksum: 02213 315 LOCUS CLLRRE 1861 bp ss-rRNA RNA 03-AUG-1990 DEFINITION C.sapidus 18S rRNA. ACCESSION M34360 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE C.sapidus rRNA. ORGANISM Callinectes sapidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; Brachyrhyncha; Portunoidea; Portunidae. REFERENCE 1 (bases 1 to 1861) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1860 18S ribosomal RNA BASE COUNT 359 a 375 c 421 g 367 t 339 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct 61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt 121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta 181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc 241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc 301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc 361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac 421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac 481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat 541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct 601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg 661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc 721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata 841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc 901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc 961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg 1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt 1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt 1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac 1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg 1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct 1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc 1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga 1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag 1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc 1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc 1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn 1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1861 n // LOCUS HUMNCSRC 138 bp ds-DNA PRI 03-AUG-1990 DEFINITION Human membrane-associated tyrosine protein kinase (C-SRC) gene, exons 3, 4, NI, and NII, partial cds. ACCESSION M34469 KEYWORDS membrane-associated tryosine protein kinase. SOURCE Human adult brain DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 138) AUTHORS Pyper,J.M. and Bolen,J.B. TITLE Identification of a novel neuronal C-SRC exon expressed in human brain JOURNAL Mol. Cell. Biol. 10, 2035-2040 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 29 membrane-associated tyrosine protein kinase (C-SRC), exon 3 (AA at 1) 30 47 membrane-associated tyrosine protein kinase, exon NI 48 80 membrane-associated tyrosine protein kinase, exon NII 81 > 136 membrane-associated tyrosine protein kinase, exon 4 variant 117 117 a or g variant 135 135 c or t BASE COUNT 36 a 40 c 40 g 22 t ORIGIN 1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc 61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga 121 cagacaggct acatcccc // LOCUS MHVAPEPA 1000 bp ss-RNA VRL 03-AUG-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34435 KEYWORDS peplomer protein. SOURCE Murine (strain JHM-DL) RNA. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1000) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review BASE COUNT 258 a 192 c 227 g 317 t 6 others ORIGIN 1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag 61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa 121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct 181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa 241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta 301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt 361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca 421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac 481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg 541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga 601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg 661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt 721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt 781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa 841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa 901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag 961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa // LOCUS MHVAPEPB 843 bp ss-RNA VRL 03-AUG-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34436 KEYWORDS peplomer protein. SOURCE Murine (strain A59) RNA. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 843) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review FEATURES from to/span description site 400 401 deletion site BASE COUNT 227 a 153 c 183 g 280 t ORIGIN 1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag 61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa 121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct 181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa 241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta 301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt 361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt 421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc 481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc 541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat 601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt 661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc 721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact 781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc 841 gaa // LOCUS MHVSP 3780 bp ss-mRNA VRL 03-AUG-1990 DEFINITION Mouse hepatitis virus surface protein S mRNA, complete cds. ACCESSION X04797 M34437 KEYWORDS glycoprotein; membrane glycoprotein; surface glycoprotein; surface projection glycoprotein. SOURCE Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones pJMS1010, pJS112, and pJS92. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 3780) AUTHORS Schmidt,I., Skinner,M. and Siddell,S. TITLE Nucleotide Sequence of the Gene Encoding the Surface Projection Glycoprotein of Coronavirus MHV-JHM JOURNAL J. Gen. Virol. 68, 47-56 (1987) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description SITE 24 30 put. region of homology preceeding mRNA 5' initiation codons SITE 121 129 pot. N-glycosylation site SITE 208 216 pot. N-glycosylation site SITE 430 438 pot. N-glycosylation site SITE 604 612 pot. N-glycosylation site SITE 1099 1107 pot. N-glycosylation site SITE 1333 1341 pot. N-glycosylation site SITE 1636 1644 pot. N-glycosylation site SITE 1732 1740 pot. N-glycosylation site SITE 1756 1764 pot. N-glycosylation site SITE 1825 1833 pot. N-glycosylation site SITE 1900 1914 put. proteolytic cleavage site SITE 1972 1980 pot. N-glycosylation site SITE 2023 2031 pot. N-glycosylation site SITE 2293 2301 pot. N-glycosylation site SITE 2440 2448 pot. N-glycosylation site SITE 3139 3147 pot. N-glycosylation site SITE 3301 3309 pot. N-glycosylation site SITE 3331 3339 pot. N-glycosylation site SITE 3388 3396 pot. N-glycosylation site SITE 3436 3444 pot. N-glycosylation site SITE 3499 3507 pot. N-glycosylation site SITE 3553 3654 pot. transmembrane domain SITE 3622 3675 cysteine-rich region SITE 3655 3660 charge cluster SITE 3715 3723 pot. N-glycosylation site FEATURES from to/span description pept 31 3738 surface protein S precursor sigp 31 60 surface protein S signal peptide matp 61 3736 surface protein S BASE COUNT 1030 a 718 c 800 g 1232 t ORIGIN 1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc 61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat 121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact 181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct 241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg 301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac 361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt 421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct 481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat 541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag 601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt 661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg 721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct 781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt 841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt 901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt 961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa 1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact 1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg 1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt 1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt 1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt 1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat 1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat 1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt 1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct 1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc 1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct 1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt 1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat 1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc 1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt 1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc 1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat 2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt 2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat 2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct 2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct 2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac 2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag 2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa 2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg 2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct 2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta 2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac 2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag 2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg 2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca 2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc 2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc 3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata 3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca 3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg 3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat 3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg 3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt 3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc 3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag 3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa 3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt 3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa 3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt 3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt // LOCUS MUSIGLAZ 713 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds. ACCESSION M34598 M29013 J03562 KEYWORDS constant region; immunoglobulin; immunoglobulin light chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone Y31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 713) AUTHORS Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A. TITLE Structure of a third murine immunoglobulin lambda light chain variable region that is expressed in laboratory mice JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987) STANDARD simple staff_entry REFERENCE 2 (bases 2 to 450) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review REFERENCE 3 (bases 266 to 429) AUTHORS Sanchez,P. and Cazenave,P.-A. TITLE A new variable region in mouse immunoglobulin lambda light chains JOURNAL J. Exp. Med. 166, 265-270 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 12 > 713 Ig lambda chain precursor V-x,J-2,C-2 region sigp 12 68 Ig lambda chain signal peptide matp 69 > 713 Ig lambda chain recomb 379 380 V-region end/J2-region start recomb 414 415 J2-region end/C2-region start BASE COUNT 175 a 190 c 159 g 189 t ORIGIN 1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt 61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag 121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc 181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca 241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta 301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa 361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc 421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca 481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga 541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca 601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca 661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct // LOCUS MUSIGLVD 681 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds. ACCESSION M34597 KEYWORDS constant region; germline; immunoglobulin light chain; joining exon; lambda-immunoglobulin; variable region. SOURCE Mouse (strain Balb/c AnPt) liver DNA, clone 30X2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 681) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review FEATURES from to/span description pept 59 104 Ig lambda-chain precursor Vx-J2-C2 region, exon 1 209 531 Ig lambda-chain precursor Vx-J2-C2 region, exon 2 sigp 59 104 Ig lambda-chain signal peptide 209 219 Ig lambda-chain signal peptide matp 220 528 Ig lambda-chain IVS 105 208 Ig lambda-chain Vx-J2-C2 region intron A recomb 530 531 Vx-region end/J2-region start recomb 565 566 J2-region end/C2-region start BASE COUNT 179 a 156 c 136 g 210 t ORIGIN Chromosome 16. 1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat 61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt 121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc 181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca 241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag 301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg 361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct 421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa 481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa 541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta 601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa 661 tcctcttgga cttcgtaggg c // LOCUS PAERRE 1877 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.kadiakensis 18S rRNA. ACCESSION M34359 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.kadiakensis rRNA. ORGANISM Palaemonetes kadiakensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Caridea; Palaemonoidea; Palaemonidae. REFERENCE 1 (bases 1 to 1877) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1877 18S ribosomal RNA BASE COUNT 339 a 321 c 399 g 313 t 505 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt 61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt 121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac 181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg 241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca 301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct 361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna 421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga 481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac 541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn 601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac 841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna 901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta 961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta 1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa 1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt 1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc 1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt 1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn 1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg 1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna 1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg 1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca 1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact 1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag 1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnnnnn // LOCUS PBESVBRA 584 bp ds-DNA INV 03-AUG-1990 DEFINITION P.berghei telomeric repeat region subfragment alpha DNA. ACCESSION M34601 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 584) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 461 541 27 bp repeats BASE COUNT 205 a 41 c 85 g 251 t 2 others ORIGIN 1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac 61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc 121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata 181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat 241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat 301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa 361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt 421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg 481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg 541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga // LOCUS PBESVBRB 593 bp ds-DNA INV 03-AUG-1990 DEFINITION P.berghei telomeric repeat region subfragment a DNA. ACCESSION M34602 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 593) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 484 510 27 bp repeat motif BASE COUNT 209 a 40 c 97 g 247 t ORIGIN 1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca 61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac 121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa 181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt 241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat 301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat 361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat 421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg 481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt 541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg // LOCUS PEURRE 1902 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.aztecus 18S rRNA. ACCESSION M34362 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.aztecus rRNA. ORGANISM Penaeus aztecus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea; Penaeidae. REFERENCE 1 (bases 1 to 1902) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1902 18S ribosomal RNA BASE COUNT 340 a 369 c 427 g 332 t 434 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt 61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt 121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg 181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc 241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg 301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc 361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag 481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga 541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga 601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg 661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc 721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn 781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn 841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg 901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg 961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna 1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn 1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg 1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa 1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn 1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg 1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac 1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta 1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct 1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg 1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact 1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg 1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt 1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn // LOCUS POCRRE 1874 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.ascensionis 18S rRNA. ACCESSION M34358 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.ascensionis rRNA. ORGANISM Procaris ascensionis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Caridea; Procaridoidea; Procarididae. REFERENCE 1 (bases 1 to 1874) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1874 18S ribosomal RNA BASE COUNT 298 a 291 c 331 g 266 t 688 others ORIGIN 1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct 61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt 121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac 181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt 241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac 301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn 361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc 481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa 541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt 781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg 841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat 901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat 961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga 1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac 1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt 1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca 1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt 1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta 1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct 1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag 1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca 1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn 1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg 1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg 1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnn // LOCUS PRARRE 1869 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.leonensis 18S rRNA. ACCESSION M34363 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.leonensis rRNA. ORGANISM Procambarus leonensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea; Astacoidea; Cambaridae. REFERENCE 1 (bases 1 to 1869) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1869 18S ribosomal RNA BASE COUNT 409 a 412 c 459 g 419 t 170 others ORIGIN 1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt 61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt 121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat 181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt 241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc 301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca 361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg 421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt 481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact 541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt 601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag 661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc 721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn 781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa 841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg 901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac 961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg 1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg 1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg 1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa 1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta 1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact 1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg 1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg 1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac 1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa 1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn 1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt 1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa 1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnn // LOCUS PVIC1RPTA 711 bp ds-DNA INV 03-AUG-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28745 M25758 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK247) sporozoite DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 711) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 711 circumsporozoite protein (AA at 1) site 1 15 region 1 site 697 711 region 2 site 502 597 post repeat variable region site 598 696 post repeat constant region BASE COUNT 261 a 151 c 226 g 73 t ORIGIN 1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat 61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca 121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca 181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga 241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc 301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa 361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga 421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat 481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca 541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga 601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta 661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c // LOCUS PVIC1RPTB 657 bp ds-DNA INV 03-AUG-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28746 M25759 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK210) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 657) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 657 circumsporozoite protein (AA at 1) site 1 15 region 1 site 643 657 region 2 rpt 16 477 27 bp repeats site 478 544 3' post repeat variable region site 545 642 3' post repeat constant region BASE COUNT 229 a 135 c 224 g 69 t ORIGIN 1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat 61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa 121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca 181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat 241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca 301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga 361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca 421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga 481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca 541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa 601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc // LOCUS SHV2A 554 bp ss-RNA VRL 03-AUG-1990 DEFINITION Simian hepatitis A virus segment 2A-encoded protein mRNA, partial cds. ACCESSION M34085 KEYWORDS . SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 554) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL Unpublished (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. Author address: E.A.Brown 547 Burnett-Womack CB# 7030 Chapel Hill, NC 27599 FEATURES from to/span description pept < 1 > 554 segment 2A-encoded protein (AA at 1) BASE COUNT 184 a 71 c 138 g 161 t ORIGIN Segment 2A; map position 3108-3662. 1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg 61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag 121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa 181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg 241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg 301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg 361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat 421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt 481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg 541 tcaggaatac agga // LOCUS SHVVP1CP 2373 bp ss-RNA VRL 03-AUG-1990 DEFINITION Simian hepatitis A virus capsid protein VP1 mRNA, partial cds. ACCESSION M34084 KEYWORDS capsid protein VP1. SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed in cwll line BS-C-1. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 2373) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL J. Virol. 63, 4932-4937 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. FEATURES from to/span description pept < 1 > 2373 capsid protein VP1 (AA at 1) BASE COUNT 661 a 447 c 491 g 774 t ORIGIN Segment P1; map position 735-3107. 1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg 61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact 121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca 181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga 241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa 301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg 361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca 421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct 481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc 541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat 601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct 661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca 721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt 781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat 841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca 901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag 961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa 1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg 1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt 1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc 1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg 1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag 1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat 1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct 1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct 1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag 1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta 1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca 1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg 1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt 1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg 1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg 1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca 1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg 2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat 2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc 2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg 2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg 2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat 2341 gatcctcgaa ctgaagagga tcgtaaattt gaa // LOCUS STNRRE 1885 bp ss-rRNA RNA 03-AUG-1990 DEFINITION S.hispidus 18S rRNA. ACCESSION M34361 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE S.hispidus rRNA. ORGANISM Stenopus hispidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea; Stenopodidae. REFERENCE 1 (bases 1 to 1885) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1885 18S ribosomal RNA BASE COUNT 327 a 336 c 430 g 368 t 424 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt 61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact 121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn 181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta 241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag 301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt 361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga 421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn 481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag 541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag 601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn 661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc 721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg 841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg 901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg 961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt 1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt 1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg 1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat 1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg 1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt 1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga 1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng 1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt 1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc 1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg 1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg 1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn 1861 nnnnnnnnnn nnnnnnnnnn nnnnn // LOCUS SV4CG 5243 bp ds-DNA circular VRL 03-AUG-1990 DEFINITION Simian Virus 40 complete genome. ACCESSION J02400 J02402 J02406 J02407 J02408 J02409 J02410 V01380 J02403 J04139 M24914 KEYWORDS T-antigen; agnoprotein; alternate splicing; antigen; auxiliary region; circular; complete genome; core region; early leader protein; enhancer; large T-antigen; major structural protein VP1; minor structural protein VP2; minor structural protein VP3; origin of replication; palindrome; small T-antigen; tandem repeat; topoisomerase. SOURCE Simian Virus 40 (SV40) DNA, cDNA, mRNA and RNA from wild type strains 777 and 776. (When the material is not simply DNA, such is indicated on the reference line.). ORGANISM Rhesus macaque polyomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 2463 to 2640; mRNA) AUTHORS Dhar,R., Zain,S., Weissman,S.M., Pan,J. and Subramanian,K. TITLE Nucleotide sequences of RNA transcribed in infected cells and by Escherichia coli RNA polymerase from a segment of Simian Virus 40 DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 71, 371-375 (1974) STANDARD full staff_review REFERENCE 2 (bases 2623 to 2699; mRNA) AUTHORS Dhar,R., Weissman,S.M., Zain,B.S., Pan,J. and Lewis,A.M.Jr. TITLE The nucleotide sequence preceding an RNA polymerase initiation site on SV40 DNA. part 2. the sequence of the early strand transcript JOURNAL Nucleic Acids Res. 1, 595-613 (1974) STANDARD full staff_review REFERENCE 3 (bases 2623 to 2699; mRNA) AUTHORS Dhar,R., Subramanian,K., Zain,B.S., Pan,J. and Weissman,S.M. TITLE Nucleotide sequence about the 3' terminus of SV40 DNA transcripts and the region where DNA synthesis is initiated JOURNAL Cold Spring Harb. Symp. Quant. Biol. 39, 153-160 (1975) STANDARD full staff_review REFERENCE 4 (sites; cds start for VP1) AUTHORS van de Voorde,A., Contreras,R., Rogiers,R. and Fiers,W. TITLE The initiation region of the SV40 VP1 gene JOURNAL Cell 9, 117-120 (1976) STANDARD full staff_review REFERENCE 5 (bases 470 to 524) AUTHORS Ysebaert,M., Thys,F., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the restriction fragments Hind L and Hind M of SV40 DNA JOURNAL Nucleic Acids Res. 3, 3409-3421 (1976) STANDARD full staff_review REFERENCE 6 (bases 32 to 236; direct tandem repeats) AUTHORS Subramanian,K.N., Reddy,V.B. and Weissman,S.M. TITLE Occurrence of reiterated sequences in an untranslated region of Simian Virus 40 DNA determined by nucleotide sequence analysis JOURNAL Cell 10, 497-507 (1977) STANDARD full staff_review REFERENCE 7 (bases 4592 to 4661; mRNA and DNA) AUTHORS Thimmappaya,B. and Weissman,S.M. TITLE The early region of SV40 DNA may have more than one gene JOURNAL Cell 11, 837-843 (1977) STANDARD full staff_review REFERENCE 8 (bases 1046 to 1620; overlap between VP1, VP2 and VP3 genes) AUTHORS Contreras,R., Rogiers,R., van de Voorde,A. and Fiers,W. TITLE Overlapping of the VP2-VP3 gene and the VP1 gene in the SV40 genome JOURNAL Cell 12, 529-538 (1977) STANDARD full staff_review REFERENCE 9 (bases 1 to 164; 5092 to 5243; origin of replication) AUTHORS Subramanian,K.N., Dhar,R. and Weissman,S.M. TITLE Nucleotide sequence of a fragment of SV40 DNA that contains the origin of DNA replication and specifies the 5' ends of "early" and "late" viral RNA: III. construction of the total sequence of EcorII-G fragment of SV40 DNA JOURNAL J. Biol. Chem. 252, 355-367 (1977) STANDARD full staff_review REFERENCE 10 (bases 1 to 585; 4857 to 5243) AUTHORS Dhar,R., Subramanian,K.N., Pan,J. and Weissman,S.M. TITLE Structure of a large segment of the genome of Simian Virus 40 that does not encode known proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 74, 827-831 (1977) STANDARD full staff_review REFERENCE 11 (bases 3733 to 4007) AUTHORS Volckaert,G., Contreras,R., Soeda,E., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of Simian Virus 40 Hind H restriction fragment JOURNAL J. Mol. Biol. 110, 467-510 (1977) STANDARD full staff_review REFERENCE 12 (bases 1427 to 2593; VP1 gene) AUTHORS Pan,J., Reddy,V.B., Thimmappaya,B. and Weissman,S.M. TITLE Nucleotide sequence of the gene for the major structural protein of SV40 virus JOURNAL Nucleic Acids Res. 4, 2539-2548 (1977) STANDARD full staff_review REFERENCE 13 (bases 323 to 565; mRNA) AUTHORS Celma,M.L., Dhar,R., Pan,J. and Weissman,S.M. TITLE Comparison of the nucleotide sequence of the messenger RNA for the major structural protein of SV40 with the DNA sequence encoding the amino acids of the protein JOURNAL Nucleic Acids Res. 4, 2549-2559 (1977) STANDARD full staff_review REFERENCE 14 (bases 347 to 472) AUTHORS van Heuverswyn,H., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the SV40 DNA restriction fragment Hind C-Hap 2 JOURNAL Nucleic Acids Res. 4, 1015-1024 (1977) STANDARD full staff_review REFERENCE 15 (bases 1709 to 1786; RNA and DNA) AUTHORS Contreras,R., Volckaert,G., Thys,F., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the restriction fragment Hind F-Eco RI2 of SV40 DNA JOURNAL Nucleic Acids Res. 4, 1001-1014 (1977) STANDARD full staff_review REFERENCE 16 (bases 1 to 163; 181 to 5243) AUTHORS Reddy,V.B., Thimmappaya,B., Dhar,R., Subramanian,K.N., Zain,S., Pan,J., Ghosh,P.K., Celma,M.L. and Weissman,S.M. TITLE The genome of Simian Virus 40 JOURNAL Science 200, 494-502 (1978) STANDARD full staff_review REFERENCE 17 (bases 1 to 163; 181 to 5243) AUTHORS Fiers,W., Contreras,R., Haegeman,G., Rogiers,R., van de Voorde,A., van Heuverswyn,H., van Herreweghe,J., Volckaert,G. and Ysebaert,M. TITLE Complete nucleotide sequence of SV40 DNA JOURNAL Nature 273, 113-120 (1978) STANDARD full staff_review REFERENCE 18 (bases 2633 to 2933; RNA and DNA) AUTHORS Thimmappaya,B., Zain,B.S., Dhar,R. and Weissman,S.M. TITLE Nucleotide sequence of DNA template for the 3' ends of SV40 mRNA: II. the sequence of the DNA fragment EcorII-F and a part of EcorII-H JOURNAL J. Biol. Chem. 253, 1613-1618 (1978) STANDARD full staff_review REFERENCE 19 (bases 325 to 526; 1463 to 1540; sites-late 16S mRNA and RNA splicing) AUTHORS Ghosh,P.K., Reddy,V.B., Swinscoe,J., Choudary,P.V., Lebowitz,P. and Weissman,S.M. TITLE The 5'-terminal leader sequence of late 16 S mRNA from cells infected with Simian Virus 40 JOURNAL J. Biol. Chem. 253, 3643-3647 (1978) STANDARD full staff_review REFERENCE 20 (bases 322 to 536; 1450 to 1541; sites-late 16S mRNA and RNA splicing) AUTHORS Haegeman,G. and Fiers,W. TITLE Evidence for 'splicing' of SV40 16S mRNA JOURNAL Nature 273, 70-73 (1978) STANDARD full staff_review REFERENCE 21 (bases 4568 to 5175; small t antigen gene) AUTHORS Volckaert,G., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian Virus 40 small-t gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 2160-2164 (1978) STANDARD full staff_review REFERENCE 22 (bases 5109 to 5243; 1 to 63; RNA) AUTHORS Woodworth-Gutai,M. and Nathans,D. TITLE Evolutionary variants of Simian Virus 40: nucleotide sequence of a conserved SV40 DNA segment containing the origin of viral DNA replication as an inverted repetition JOURNAL J. Mol. Biol. 126, 259-274 (1978) STANDARD full staff_review REFERENCE 23 (bases 1 to 74; 5171 to 5243; T antigen binding sites) AUTHORS Tjian,R. TITLE The binding site on SV40 DNA for a T antigen-related protein JOURNAL Cell 13, 165-179 (1978) STANDARD full staff_review REFERENCE 24 (sites; 5' and 3' termini of 19s late mRNAs) AUTHORS Ghosh,P.K., Reddy,V.B., Swinscoe,J., Lebowitz,P. and Weissman,S.M. TITLE Heterogeneity and 5'-terminal structures of the late RNAs of Simian Virus 40 JOURNAL J. Mol. Biol. 126, 813-846 (1978) STANDARD full staff_review REFERENCE 25 (bases 562 to 1620; VP2 and VP3 genes) AUTHORS Reddy,V.B., Dhar,R. and Weissman,S.M. TITLE Nucleotide sequence of the genes for the Simian Virus 40 proteins VP2 and VP3 JOURNAL J. Biol. Chem. 253, 621-630 (1978) STANDARD full staff_review REFERENCE 26 (bases 358 to 914; sites-5' termini of late mRNAs) AUTHORS Dhar,R., Reddy,V.B. and Weissman,S.M. TITLE Nucleotide sequence of the DNA encoding the 5'-terminal sequences of Simian Virus 40 late mRNA JOURNAL J. Biol. Chem. 253, 612-620 (1978) STANDARD full staff_review REFERENCE 27 (bases 2300 to 2668; sites-mRNA 3' termini) AUTHORS Zain,B.S., Thimmappaya,B., Dhar,R. and Weissman,S.M. TITLE Nucleotide sequences of DNA encoding the 3' ends of SV40 mRNA I. the sequence of the DNA fragment HindII,III-G JOURNAL J. Biol. Chem. 253, 1606-1612 (1978) STANDARD full staff_review REFERENCE 28 (bases 2669 to 2901) AUTHORS van Heuverswyn,H., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian-Virus-40 DNA region coding for the carboxyl-terminal part of the T antigen JOURNAL Eur. J. Biochem. 86, 335-344 (1978) STANDARD full staff_review REFERENCE 29 (bases 264 to 526; 1463 to 1516; 2582 to 2602; 2663 to 2676; sites-cDNA, 16S leader segments) AUTHORS Reddy,V.B., Ghosh,P.K., Lebowitz,P. and Weissman,S.M. TITLE Gaps and duplicated sequences in the leaders of SV40 16S RNA JOURNAL Nucleic Acids Res. 5, 4195-4213 (1978) STANDARD full staff_review REFERENCE 30 (sites; cap site for 16s and 19s late mRNAs) AUTHORS Haegeman,G. and Fiers,W. TITLE Localization of the 5' terminus of late SV40 mRNA JOURNAL Nucleic Acids Res. 5, 2359-2371 (1978) STANDARD full staff_review REFERENCE 31 (sites; mutations in region of origin of replication) AUTHORS Subramanian,K.N. and Shenk,T. TITLE Definition of the boundaries of the origin of DNA replication in Simian Virus 40 JOURNAL Nucleic Acids Res. 5, 3635-3642 (1978) STANDARD full staff_review REFERENCE 32 (sites; splice sites in late mRNAs) AUTHORS Lai,C.-J., Dhar,R. and Khoury,G. TITLE Mapping the spliced and unspliced late lytic SV40 RNAs JOURNAL Cell 14, 971-982 (1978) STANDARD full staff_review REFERENCE 33 (sites; early mRNAs) AUTHORS May,E., Kress,M. and May,P. TITLE Characterization of two SV40 early mRNAs and evidence for a nuclear "prespliced" RNA species JOURNAL Nucleic Acids Res. 5, 3083-3099 (1978) STANDARD full staff_review REFERENCE 34 (sites; early mRNAs and RNA splicing) AUTHORS Berk,A.J. and Sharp,P.A. TITLE Spliced early mRNAs of Simian Virus 40 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 1274-1278 (1978) STANDARD full staff_review REFERENCE 35 (review; bases 1 to 163; 181 to 5243) AUTHORS Lebowitz,P. and Weissman,S.M. TITLE Organization and transcription of the Simian Virus 40 genome JOURNAL Curr. Top. Microbiol. Immunol. 87, 43-172 (1979) STANDARD full staff_review REFERENCE 36 (bases 1 to 472; 5174 to 5243) AUTHORS van Heuverswyn,H. and Fiers,W. TITLE Nucleotide sequence of the Hind-C fragment of Simian Virus 40 DNA: comparison of the 5'-untranslated region of wild-type virus and of some deletion mutants JOURNAL Eur. J. Biochem. 100, 51-60 (1979) STANDARD full staff_review REFERENCE 37 (bases 1276 to 1539; deletion mutant analysis) AUTHORS Contreras,R., Cole,C., Berg,P. and Fiers,W. TITLE Nucleotide sequence analysis of two Simian Virus 40 mutants with deletions in the late region of the genome JOURNAL J. Virol. 29, 789-793 (1979) STANDARD full staff_review REFERENCE 38 (bases 325 to 562; 1403 to 1582; cDNA) AUTHORS Bina-Stein,M., Thoren,M., Salzman,N. and Thompson,J.A. TITLE Rapid sequence determination of late Simian Virus 40 16S mRNA leader by using inhibitors of reverse transcriptase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 731-735 (1979) STANDARD full staff_review REFERENCE 39 (bases 2691 to 4571; 4638 to 5163; sites-5' termini of early mRNAs) AUTHORS Reddy,V.B., Ghosh,P.K., Lebowitz,P., Piatak,M. and Weissman,S.M. TITLE Simian Virus 40 early mRNA's: I. genomic localization of 3' and 5' termini and two major splices in mRNA from transformed and lytically infected cells JOURNAL J. Virol. 30, 279-296 (1979) STANDARD full staff_review REFERENCE 40 (sites; splice sites for large T antigen mRNA) AUTHORS Mark,D.F. and Berg,P. TITLE A third splice site in SV40 early mRNA JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 55-62 (1980) STANDARD full staff_review REFERENCE 41 (bases 4963 to 5179; deletion mutant analysis) AUTHORS Pipas,J.M., Adler,S.P., Peden,K.W.C. and Nathans,D. TITLE Deletion mutants of SV40 that affect the structure of viral tumor antigens JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 285-291 (1980) STANDARD full staff_review REFERENCE 42 (bases 2667 to 2840; deletion mutant analysis) AUTHORS van Heuverswyn,H., Cole,C., Berg,P. and Fiers,W. TITLE Nucleotide sequence analysis of two Simian Virus 40 mutants with deletions in the region coding for the carboxyl terminus of the T antigen JOURNAL J. Virol. 30, 936-941 (1979) STANDARD full staff_review REFERENCE 43 (bases 263 to 532; deletion mutant analysis) AUTHORS Subramanian,K.N. TITLE Segments of Simian Virus 40 DNA spanning most of the leader sequence of the major late viral messenger RNA are dispensable JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 2556-2560 (1979) STANDARD full staff_review REFERENCE 44 (bases 4548 to 4923; deletion mutant analysis) AUTHORS Volckaert,G., Feunteun,J., Crawford,L.V., Berg,P. and Fiers,W. TITLE Nucleotide sequence deletions within the coding region for small-t antigen of Simian Virus 40 JOURNAL J. Virol. 30, 674-682 (1979) STANDARD full staff_review REFERENCE 45 (bases 250 to 304; 345 to 399; deletion mutant analysis) AUTHORS Haegeman,G., van Heuverswyn,H., Gheysen,D. and Fiers,W. TITLE Heterogeneity of the 5' terminus of late mRNA induced by a viable Simian Virus 40 deletion mutant JOURNAL J. Virol. 31, 484-493 (1979) STANDARD full staff_review REFERENCE 46 (bases 4566 to 5225; cDNA) AUTHORS Thompson,J.A., Radonovich,M.F. and Salzman,N.P. TITLE Characterization of the 5'-terminal structure of Simian Virus 40 early mRNA's JOURNAL J. Virol. 31, 437-446 (1979) STANDARD full staff_review REFERENCE 47 (sites; deletion mutant analysis over bases 4553 to 5172) AUTHORS Thimmappaya,B. and Shenk,T. TITLE Nucleotide sequence analysis of viable deletion mutants lacking segments of the Simian Virus 40 genome coding for small t antigen JOURNAL J. Virol. 30, 668-673 (1979) STANDARD full staff_review REFERENCE 48 (sites; deletion mutant analysis of major late cap site) AUTHORS Haegeman,G., Iserentant,D., Gheysen,D. and Fiers,W. TITLE Characterization of the major altered leader sequence of late mRNA induced by SV40 deletion mutant dl-1811 JOURNAL Nucleic Acids Res. 7, 1799-1814 (1979) STANDARD full staff_review REFERENCE 49 (sites; splice sites in early mRNAs) AUTHORS Khoury,G., Gruss,P., Dhar,R. and Lai,C.-J. TITLE Processing and expression of early SV40 mRNA: a role for RNA conformation in splicing JOURNAL Cell 18, 85-92 (1979) STANDARD full staff_review REFERENCE 50 (review; bases 1 to 163; 181 to 5243) AUTHORS Buchman,A.R., Burnett,L. and Berg,P. TITLE Appendix A: The SV40 nucleotide sequence JOURNAL (in) Tooze,J. (Eds.); DNA Tumor Viruses - Second Edition Revised: 799-841; Cold Spring Harbor Laboratory, Cold Spring Harbor (1981). STANDARD full staff_review REFERENCE 51 (bases 2663 to 2921; deletion mutant analysis) AUTHORS Alwine,J.C. and Khoury,G. TITLE Simian Virus 40-associated small RNA: mapping on the Simian Virus 40 genome and characterization of its synthesis JOURNAL J. Virol. 36, 701-708 (1980) STANDARD full staff_review REFERENCE 52 (bases 4007 to 4574) AUTHORS Volckaert,G., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian Virus 40 HindII+III restriction fragment A: (second part of the T antigen gene) JOURNAL Eur. J. Biochem. 106, 169-177 (1980) STANDARD full staff_review REFERENCE 53 (bases 3476 to 3735; RNA and DNA) AUTHORS van Herreweghe,J., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian Virus 40 HindII+III restriction fragment I: (fourth part of the T antigen gene) JOURNAL Eur. J. Biochem. 106, 179-192 (1980) STANDARD full staff_review REFERENCE 54 (bases 3146 to 3481) AUTHORS van Herreweghe,J., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Hind-I-proximal part of Simian Virus 40 HindII+III restriction fragment B: (fifth part of the T antigen gene JOURNAL Eur. J. Biochem. 106, 193-198 (1980) STANDARD full staff_review REFERENCE 55 (bases 2669 to 3251) AUTHORS van Heuverswyn,H., van de Voorde,A., van Herreweghe,J., Volckaert,G., de Winne,P. and Fiers,W. TITLE Nucleotide sequence of Simian Virus 40 DNA: structure of the middle segment of the HindII+III restriction fragment B: (sixth part of the T antigen gene) and codon usage JOURNAL Eur. J. Biochem. 106, 199-209 (1980) STANDARD full staff_review REFERENCE 56 (bases 4572 to 4917) AUTHORS Seif,I., Khoury,G. and Dhar,R. TITLE A rapid enzymatic DNA sequencing technique: determination of sequence alterations in early Simian Virus 40 temperature sensitive and deletion mutants JOURNAL Nucleic Acids Res. 8, 2225-2240 (1980) STANDARD full staff_review REFERENCE 57 (bases 1 to 74; 5161 to 5243; deletion and point mutants over the replication origin) AUTHORS DiMaio,D. and Nathans,D. TITLE Cold-sensitive regulatory mutants of Simian Virus 40 JOURNAL J. Mol. Biol. 140, 129-142 (1980) STANDARD full staff_review REFERENCE 58 (bases 13 to 65; 5155 to 5227; deletion mutants over the early promoter) AUTHORS Benoist,C. and Chambon,P. TITLE Deletions covering the putative promoter region of early mRNAs of Simian Virus 40 do not abolish T-antigen expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 3865-3869 (1980) STANDARD full staff_review REFERENCE 59 (sites; T antigen binding sites) AUTHORS Shalloway,D., Kleinberger,T. and Livingston,D.M. TITLE Mapping of SV40 DNA replication origin region binding sites for the SV40 T antigen by protection against exonuclease III digestion JOURNAL Cell 20, 411-422 (1980) STANDARD full staff_review REFERENCE 60 (bases 1 to 33; 212 to 236; 5161 to 5243; origin-defective deletion mutants) AUTHORS Gluzman,Y., Sambrook,J.F. and Frisque,R.J. TITLE Expression of early genes of origin-defective mutants of Simian Virus 40 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 3898-3902 (1980) STANDARD full staff_review REFERENCE 61 (bases 5213 to 5236; early mRNA initiation sites) AUTHORS Haegeman,G. and Fiers,W. TITLE Characterization of the 5'-terminal cap structures of early Simian Virus 40 mRNA JOURNAL J. Virol. 35, 955-961 (1980) STANDARD full staff_review REFERENCE 62 (sites; early mRNA initiation sites and repression by T antigen) AUTHORS Hansen,U., Tenen,D.G., Livingston,D.M. and Sharp,P.A. TITLE T antigen repression of SV40 early transcription from two promoters JOURNAL Cell 27, 603-612 (1981) STANDARD full staff_review REFERENCE 63 (sites; poly-A at 2662; deletion mutants over 2533-2775) AUTHORS Fitzgerald,M. and Shenk,T. TITLE The sequence 5'-AAUAAA-3' forms part of the recognition site for polyadenylation of late SV40 mRNAs JOURNAL Cell 24, 251-260 (1981) STANDARD full staff_review REFERENCE 64 (bases 306 to 428; deletion mutant analysis) AUTHORS Woodworth-Gutai,M. TITLE Recombination in SV40-infected cells: viral DNA sequences at sites of circularization of transfecting linear DNA JOURNAL Virology 109, 353-365 (1981) STANDARD full staff_review REFERENCE 65 (bases 101 to 178; 251 to 280; early promoter analysis) AUTHORS Gruss,P., Dhar,R. and Khoury,G. TITLE Simian Virus 40 tandem repeated sequences as an element of the early promoter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 943-947 (1981) STANDARD full staff_review REFERENCE 66 (bases 110 to 178; 251 to 601; deletion mutant analysis) AUTHORS Barkan,A. and Mertz,J.E. TITLE DNA sequence analysis of Simian Virus 40 mutants with deletions mapping in the leader region of the late viral mRNA's: mutants with deletions similar in size and position exhibit varied phenotypes JOURNAL J. Virol. 37, 730-737 (1981) STANDARD full staff_review REFERENCE 67 (bases 1 to 54; 5148 to 5243; cDNAs over this region) AUTHORS Ghosh,P.K., Lebowitz,P., Frisque,R.J. and Gluzman,Y. TITLE Identification of a promoter component involved in positioning the 5' termini of Simian Virus 40 early mRNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 100-104 (1981) STANDARD full staff_review REFERENCE 68 (bases 1 to 173; 182 to 280; 5216 to 5243; early promoter analysis) AUTHORS Benoist,C. and Chambon,P. TITLE In vivo sequence requirements of the SV40 early promoter region JOURNAL Nature 290, 304-310 (1981) STANDARD full staff_review REFERENCE 69 (sites; promoter for early mRNAs) AUTHORS Mathis,D.J. and Chambon,P. TITLE The SV40 early region TATA box is required for accurate in vitro initiation of transcription JOURNAL Nature 290, 310-315 (1981) STANDARD full staff_review REFERENCE 70 (bases 1 to 177; 5171 to 5243; autoregulation by T antigen) AUTHORS Myers,R.M., Rio,D.C., Robbins,A.K. and Tjian,R. TITLE SV40 gene expression is modulated by the cooperative binding of T antigen to DNA JOURNAL Cell 25, 373-384 (1981) STANDARD full staff_review REFERENCE 71 (bases 1 to 102; 5148 to 5243; upstream shift of early mRNA initiation site) AUTHORS Ghosh,P.K. and Lebowitz,P. TITLE Simian Virus 40 early mRNA's contain multiple 5'termini upstream and downstream from a Hogness-Goldberg sequence; a shift in 5' termini during the lytic cycle is mediated by large T antigen JOURNAL J. Virol. 40, 224-240 (1981) STANDARD full staff_review REFERENCE 72 (bases 2644 to 2923; deletion mutant analysis) AUTHORS Feunteun,J., Carmichael,G., Nicolas,J.C. and Kress,M. TITLE Mutant carrying deletions in the two Simian Virus 40 early genes JOURNAL J. Virol. 40, 625-634 (1981) STANDARD full staff_review REFERENCE 73 (sites; cds for agnoprotein) AUTHORS Jackson,V. and Chalkley,R. TITLE Use of whole-cell fixation to visualize replicating and maturing Simian Virus 40: identification of new viral gene product JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 6081-6085 (1981) STANDARD full staff_review REFERENCE 74 (bases 1 to 32; 212 to 236; 5161 to 5243; T antigen binding to a deletion mutant) AUTHORS McKay,R. and DiMaio,D. TITLE Binding of an SV40 T antigen-related protein to the DNA of SV40 regulatory mutants JOURNAL Nature 289, 810-813 (1981) STANDARD full staff_review REFERENCE 75 (sites; transcriptional enhancement by the 72 bp repeats) AUTHORS Moreau,V., Hen,R., Wasylyk,B., Everett,R., Gaub,M.P. and Chambon,P. TITLE The SV40 72 base repair repeat has a striking effect on gene expression both in SV40 and other chimeric recombinants JOURNAL Nucleic Acids Res. 9, 6047-6068 (1981) STANDARD full staff_review REFERENCE 76 (bases 325 to 622; 1463 to 1511; cDNAs; late mRNA synthesis by deletion mutants) AUTHORS Piatak,M., Subramanian,K.N., Roy,P. and Weissman,S.M. TITLE Late messenger RNA production by viable Simian Virus 40 mutants with deletions in the leader region JOURNAL J. Mol. Biol. 153, 589-618 (1981) STANDARD full staff_review REFERENCE 77 (sites; agnoprotein coding sequence) AUTHORS Jay,G., Nomura,S., Anderson,C.W. and Khoury,G. TITLE Identification of the SV40 agnogene product: a DNA binding protein JOURNAL Nature 291, 346-349 (1981) STANDARD full staff_review REFERENCE 78 (sites; mapping of the origin of replication) AUTHORS Bergsma,D.J., Olive,D.M., Hartzell,S.W. and Subramanian,K.N. TITLE Territorial limits and functional anatomy of the Simian Virus 40 replication origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 381-385 (1982) STANDARD full staff_review REFERENCE 79 (sites; topoisomerase cleavage sites) AUTHORS Edwards,K.A., Halligan,B.D., Davis,J.L., Nivera,N.L. and Liu,L.F. TITLE Recognition sites of eukaryotic DNA topoisomerase I: DNA nucleotide sequencing analysis of topo I cleavage sites on SV40 DNA JOURNAL Nucleic Acids Res. 10, 2565-2576 (1982) STANDARD full staff_review REFERENCE 80 (bases 2619 to 2920; deletion mutant analysis) AUTHORS Polvino-Bodnar,M. and Cole,C.N. TITLE Construction and characterization of viable deletion mutants of Simian Virus 40 lacking sequences near the 3' end of the early region JOURNAL J. Virol. 43, 489-502 (1982) STANDARD full staff_review REFERENCE 81 (bases 255 to 2535; only joints of deletion mutants) AUTHORS Johnson,A.D., Barkan,A. and Mertz,J.E. TITLE Nucleotide sequence analysis of the recombinant joints in 16 naturally arising deletion mutants of Simian Virus 40 JOURNAL Virology 123, 464-469 (1982) STANDARD full staff_review REFERENCE 82 (bases 260 to 422; RNA) AUTHORS Hay,N., Skolnik-David,H. and Aloni,Y. TITLE Attenuation in the control of SV40 gene expression JOURNAL Cell 29, 183-193 (1982) STANDARD full staff_review REFERENCE 83 (sites; late mRNA initiation sites) AUTHORS Alwine,J.C. TITLE Evidence for Simian Virus 40 late transcriptional control: mixed infections of wild-type Simian Virus 40 and a late leader deletion mutant exhibit trans effects on late viral RNA synthesis JOURNAL J. Virol. 42, 798-803 (1982) STANDARD full staff_review REFERENCE 84 (bases 284 to 326; late promoter analysis) AUTHORS Brady,J., Radonovich,M., Vodkin,M., Natarajan,V., Thoren,M., Das,G., Janik,J. and Salzman,N.P. TITLE Site-specific base substitution and deletion mutations that enhance or suppress transcription of the SV40 major late RNA JOURNAL Cell 31, 625-633 (1982) STANDARD full staff_review REFERENCE 85 (bases 1 to 16; 5227 to 5243; mutant analysis over origin of replication) AUTHORS Bergsma,D.J., Olive,D.M., Hartzell,S.W., Byrne,B.J. and Subramanian,K.N. TITLE Cyclization of linear chimeric plasmids in vivo by a novel end-to-end joining reaction or by intramolecular recombination: one of the products contains a 147-bp perfect palindrome stable in Escherichia coli JOURNAL Gene 20, 157-167 (1982) STANDARD full staff_review REFERENCE 86 (bases 5164 to 5243; 1 to 48; T antigen binding to a deletion mutant) AUTHORS Tenen,D.G., Haines,L.L. and Livingston,D.M. TITLE Binding of an analog of the Simian Virus 40 T antigen to wild-type and mutant viral replication origins JOURNAL J. Mol. Biol. 157, 473-492 (1982) STANDARD full staff_review REFERENCE 87 (sites; T antigen binding sites) AUTHORS DiMaio,D. and Nathans,D. TITLE Regulatory mutants of Simian Virus 40: effect of mutations at a T antigen binding site on DNA replication and expression of viral genes JOURNAL J. Mol. Biol. 156, 531-548 (1982) STANDARD full staff_review REFERENCE 88 (sites; mapping of early region promoter) AUTHORS Fromm,M. and Berg,P. TITLE Deletion mapping of DNA regions required for SV40 early promoter function in vivo JOURNAL J. Mol. Appl. Genet. 1, 457-481 (1982) STANDARD full staff_review REFERENCE 89 (sites; SV40 enhancement of beta-globin gene expression) AUTHORS Banerji,J., Rusconi,S. and Schaffner,W. TITLE Expression of a beta-globin gene is enhanced by remote SV40 DNA sequences JOURNAL Cell 27, 299-308 (1981) STANDARD full staff_review REFERENCE 90 (bases 1 to 129; 5228 to 5243; mapping of the early promoter) AUTHORS Byrne,B.J., Davis,M.S., Yamaguchi,J., Bergsma,D.J. and Subramanian,K.N. TITLE Definition of the Simian Virus 40 early promoter region and demonstration of a host range bias in the enhancement effect of the Simian Virus 40 72-base-pair repeat JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 721-725 (1983) STANDARD full staff_review REFERENCE 91 (sites; agnogene mutants) AUTHORS Mertz,J.E., Murphy,A. and Barkan,A. TITLE Mutants deleted in the agnogene of Simian Virus 40 define a new complementation group JOURNAL J. Virol. 45, 36-46 (1983) STANDARD full staff_review REFERENCE 92 (bases 335 to 394; 1706 to 1753; mutant analysis) AUTHORS Margolskee,R.F. and Nathans,D. TITLE Suppression of a VP1 mutant of Simian Virus 40 by missense mutations in serine codons of the viral agnogene JOURNAL J. Virol. 48, 405-409 (1983) STANDARD full staff_review REFERENCE 93 (bases 3512 to 3565; 4154 to 4311; mutant analysis over these regions) AUTHORS Pipas,J.M., Peden,K.W.C. and Nathans,D. TITLE Mutational analysis of Simian Virus 40 T antigen: isolation and characterization of mutants with deletions in the T-antigen gene JOURNAL Mol. Cell. Biol. 3, 203-213 (1983) STANDARD full staff_review REFERENCE 94 (bases 1709 to 1786; topoisomerase II) AUTHORS Liu,L.F., Rowe,T.C., Yang,L., Tewey,K.M. and Chen,G.L. TITLE Cleavage of DNA by mammalian DNA topoisomerase II JOURNAL J. Biol. Chem. 258, 15365-15370 (1983) STANDARD full staff_review REFERENCE 95 (bases 2773 to 4406; only joints of deletion mutants) AUTHORS Tornow,J. and Cole,C.N. TITLE Intracistronic complementation in the Simian Virus 40 A gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 6312-6316 (1983) STANDARD full staff_review REFERENCE 96 (bases 21 to 118; transcription factor binding to early promoter) AUTHORS Dynan,W.S. and Tjian,R. TITLE The promoter-specific transcription factor Sp1 binds to upstream sequences in the SV40 early promoter JOURNAL Cell 35, 79-87 (1983) STANDARD full staff_review REFERENCE 97 (sites; 7S-K RNA of transformed cells) AUTHORS Sohn,U., Szyszko,J., Coombs,D. and Krause,M. TITLE 7S-K nuclear RNA from Simian Virus 40-transformed cells has sequence homology to the viral early promoter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 7090-7094 (1983) STANDARD full staff_review REFERENCE 98 (bases 1 to 49; 5181 to 5243; T antigen binding analysis) AUTHORS Tenen,D.G., Livingston,D.M., Wang,S.-S. and Martin,R.G. TITLE Effect of a stem-loop structure within the SV40 replication origin upon SV40 T antigen binding to origin region sequences JOURNAL Cell 34, 629-639 (1983) STANDARD full staff_review REFERENCE 99 (bases 253 to 482; cDNA; late mRNA initiation) AUTHORS Piatak,M., Ghosh,P.K., Norkin,L.C. and Weissman,S.M. TITLE Sequences locating the 5' ends of the major Simian Virus 40 late mRNA forms JOURNAL J. Virol. 48, 503-520 (1983) STANDARD full staff_review REFERENCE 100(bases 1 to 14; 5210 to 5243; autoregulation by T antigen) AUTHORS Rio,D.C. and Tjian,R. TITLE Sv40 T antigen binding site mutations that affect autoregulation JOURNAL Cell 32, 1227-1240 (1983) STANDARD full staff_review REFERENCE 101(bases 1 to 134; 5216 to 5243; early promoter analysis) AUTHORS Everett,R.D., Baty,D. and Chambon,P. TITLE The repeated GC-rich motifs upstream from the TATA box are important elements of the SV40 early promoter JOURNAL Nucleic Acids Res. 11, 2447-2464 (1983) STANDARD full staff_review REFERENCE 102(bases 5165 to 5243) AUTHORS Tenen,D.G., Taylor,T.S., Haines,L.L., Bradley,M.K., Martin,R.G. and Livingston,D.M. TITLE Binding of simian virus 40 large T antigen from virus-infected monkey cells to wild-type and mutant viral replication origins JOURNAL J. Mol. Biol. 168, 791-808 (1983) STANDARD full staff_review REFERENCE 103(bases 1 to 26; 5170 to 5243; T antigen binding sites) AUTHORS Jones,K.A. and Tjian,R. TITLE Essential contact residues within SV40 large T antigen binding sites I and II identified by alkylation-interference JOURNAL Cell 36, 155-162 (1984) STANDARD full staff_review REFERENCE 104(sites; sites-5' termini of late mRNAs) AUTHORS Lycan,D.E. and Danna,K.J. TITLE S1 mapping of purified nascent transcripts of Simian Virus 40 JOURNAL Mol. Cell. Biol. 4, 625-633 (1984) STANDARD full staff_review REFERENCE 105(bases 1 to 33; 5213 to 5243; T antigen binding analysis) AUTHORS Lewton,B.A., Delucia,A.L. and Tegtmeyer,P. TITLE Binding of Simian Virus 40 A protein to DNA with deletions at the origin of replication JOURNAL J. Virol. 49, 9-13 (1984) STANDARD full staff_review REFERENCE 106(sites; T antigen binding sites) AUTHORS Pomerantz,B.J. and Hassell,J.A. TITLE Polyomavirus and Simian Virus 40 large T antigens bind to common DNA sequences JOURNAL J. Virol. 49, 925-937 (1984) STANDARD full staff_review REFERENCE 107(bases 1 to 118; 5186 to 5243; primase initiation sites) AUTHORS Tseng,B.Y. and Ahlem,C.N. TITLE Mouse primase initiation sites in the origin region of Simian Virus 40 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 2342-2346 (1984) STANDARD full staff_review REFERENCE 108(bases 1 to 294; 297 to 379; transcriptional enhancer analysis) AUTHORS Weber,F., de Villiers,J. and Schaffner,W. TITLE An SV40 'enhancer trap' incorporates exogenous enhancers or generates enhancers from its own sequences JOURNAL Cell 36, 983-992 (1984) STANDARD full staff_review REFERENCE 109(bases 1 to 33; 5213 to 5243; mutant analysis at the replication origin) AUTHORS Cohen,G.L., Wright,P.J., Delucia,A.L., Lewton,B.A., Anderson,M.E. and Tegtmeyer,P. TITLE Critical spatial requirement within the origin of Simian Virus 40 DNA replication JOURNAL J. Virol. 51, 91-96 (1984) STANDARD full staff_review REFERENCE 110(sites; mRNA polyadenylation signal sequences) AUTHORS Sadofsky,M. and Alwine,J.C. TITLE Sequences on the 3' side of hexanucleotide AAUAAA affect efficiency of cleavage at the polyadenyation site JOURNAL Mol. Cell. Biol. 4, 1460-1468 (1984) STANDARD full staff_review REFERENCE 111(bases 1582 to 1635; 2524 to 2652; chromatin structure at origin of replication; mutational analysis) AUTHORS Innis,J.W. and Scott,W.A. TITLE DNA replication and chromatin structure of Simian Virus 40 insertion mutants JOURNAL Mol. Cell. Biol. 4, 1499-1507 (1984) STANDARD full staff_review REFERENCE 112(sites; bidirectional promoter element) AUTHORS Hartzell,S.W., Byrne,B.J. and Subramanian,K.N. TITLE Mapping of the late promoter of Simian virus 40 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 23-27 (1984) STANDARD full staff_review REFERENCE 113(sites; late promoter sequence requirements in the presence of T antigen) AUTHORS Hartzell,S.W., Byrne,B.J. and Subramanian,K.N. TITLE The simian virus 40 minimal origin and the 72-base-pair repeat are required simultaneously for efficient induction of late gene expression with large tumor antigen JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 6335-6339 (1984) STANDARD full staff_review REFERENCE 114(bases 304 to 528) AUTHORS Hay,N., Kessler,M. and Aloni,Y. TITLE SV40 deletion mutant (d1861) with agnoprotein shortened by four amino acids JOURNAL Virology 137, 160-170 (1984) STANDARD full staff_review REFERENCE 115(bases 1 to 366; 5230 to 5243) AUTHORS Swimmer,C. and Shenk,T. TITLE A viable simian virus 40 variant that carries a newly generated sequence reiteration in place of the normal duplicated enhancer element JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 6652-6656 (1984) STANDARD full staff_review REFERENCE 116(sites; Okazaki fragment sequences) AUTHORS Hay,R.T., Hendrickson,E.A. and DePamphilis,M.L. TITLE Sequence specificity for the initiation of RNA-primed simian virus 40 DNA synthesis in vivo JOURNAL J. Mol. Biol. 175, 131-157 (1984) STANDARD full staff_review REFERENCE 117(sites; tsA3900 point mutant sequence) AUTHORS Hutchinson,N.I., Chang,L.-S., Pater,M.M., Bouck,N., Shenk,T.E. and Di Mayorca,G. TITLE Characterization of a new simian virus 40 mutant, tsA3900, isolated from deletion mutant tsA1499 JOURNAL J. Virol. 53, 814-821 (1985) STANDARD full staff_review REFERENCE 118(sites; large t-antigen binding specificity) AUTHORS Scheller,A. and Prives,C. TITLE Simian virus 40 and polyomavirus large tumor antigens have different requirements for high-affinity sequence-specific DNA binding JOURNAL J. Virol. 54, 532-545 (1985) STANDARD full staff_review REFERENCE 119(bases 1499 to 2590; temperature-sensitive mutants) AUTHORS Ng,S.-C., Behm,M. and Bina,M. TITLE DNA sequence alterations responsible for the synthesis of thermosensitive VP1 in temperature-sensitive BC mutants of simian virus 40 JOURNAL J. Virol. 54, 646-649 (1985) STANDARD full staff_review REFERENCE 120(sites; recombination in poly(GT)-sequences) AUTHORS Stringer,J.R. TITLE Recombination between poly[d(GT)-d(CA)] sequences in simian virus 40-infected cultured cells JOURNAL Mol. Cell. Biol. 5, 1247-1259 (1985) STANDARD full staff_review REFERENCE 121(sites; large T-antigen binding sites) AUTHORS Barrera-Saldana,H., Takahashi,K., Vigneron,M., Wildeman,A., Davidson,I. and Chambon,P. TITLE All six GC-motifs of the SV40 early upstream element contribute to promoter activity in vivo and in vitro JOURNAL EMBO J. 4, 3839-3849 (1985) STANDARD full staff_review REFERENCE 122(bases 4556 to 4657; 4907 to 4937) AUTHORS Noble,J.C., Prives,C. and Manley,J.L. TITLE In vitro splicing of simian virus 40 early pre mRNA JOURNAL Nucleic Acids Res. 14, 1219-1235 (1986) STANDARD full staff_review REFERENCE 123(sites; SV40 early leader protein (SELP) boundaries) AUTHORS Khalili,K., Brady,J. and Khoury,G. TITLE Translational regulation of SV40 early mRNA defines a new viral protein JOURNAL Cell 48, 639-645 (1987) STANDARD full staff_review REFERENCE 124(bases 1494 to 1708) AUTHORS Rogiers,R., van de Voorde,A., Soeda,E. and Fiers,W. TITLE Nucleotide sequence of the Simian virus 40 Hind-K restriction fragment JOURNAL Eur. J. Biochem. 85, 205-224 (1978) STANDARD full staff_entry REFERENCE 125(bases 662 to 1051) AUTHORS Ysebaert,M., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian virus 40 HindII+III restriction fragment D and the total amino acid sequence of the late proteins VP2 and VP3 JOURNAL Eur. J. Biochem. 91, 431-439 (1978) STANDARD full staff_entry REFERENCE 126(bases 2060 to 2299) AUTHORS van Heuverswyn,H., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian virus 40 HindII+III restriction fragment J and the total amino acid sequence of the major structural protein VP1 JOURNAL Eur. J. Biochem. 91, 415-430 (1978) STANDARD full staff_entry REFERENCE 127(bases 1 to 107; 5146 to 5243) AUTHORS Hay,R.T. and DePamphilis,M.L. TITLE Initiation of SV40 DNA replication in vivo: Location and structure of 5' ends of DNA synthesized in the ori region JOURNAL Cell 28, 767-779 (1982) STANDARD full staff_entry REFERENCE 128(bases 80 to 350) AUTHORS Contreras,R., Gheysen,D., Knowland,J., van de Voorde,A. and Fiers,W. TITLE Evidence for the direct involvement of DNA replication origin in synthesis of late SV40 RNA JOURNAL Nature 300, 500-505 (1982) STANDARD full staff_entry REFERENCE 129(sites; early SV40 transcription) AUTHORS Lebowitz,P. and Ghosh,P. TITLE Initiation and regulation of Simian virus 40 early transcription in vitro JOURNAL J. Virol. 41, 449-461 (1982) STANDARD full staff_entry REFERENCE 130(sites; temperature sensitive mutants in the VP1 gene) AUTHORS Behm,M., Lowman,H., Ng,S.-C. and Bina,M. TITLE Analysis of temperature-sensitive mutations in the simian virus 40 gene encoding virion protein 1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9421-9425 (1988) STANDARD full staff_review REFERENCE 131(sites; ends of "early" and "late" mRNA) AUTHORS Subramanian,K.N., Ghosh,P.K., Dhar,R., Thimmappaya,B., Zain,S.B., Pan,J. and Weissman,S.M. TITLE The primary structure of regions of SV40 DNA encoding the ends of mRNA JOURNAL Prog. Nucleic Acid Res. Mol. Biol. 19, 157-164 (1976) STANDARD simple staff_review REFERENCE 132(bases 1 to 164; 5092 to 5243) AUTHORS Weissman,S.M., Subramanian,K.N., Dhar,R., Zain,B. and Pan,J. TITLE SV40 DNA sequences as an example of the structure of genes functioning in animal cell nuclei JOURNAL Med. Pediatr. Oncol. 2, 259-263 (1976) STANDARD full staff_entry REFERENCE 133(bases 1787 to 2059) AUTHORS Contreras,R., Van De Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the restriction fragment Hind-F-EcoRI-1 of simian-virus-40 DNA (part of the VP1 gene) JOURNAL Eur. J. Biochem. 86, 317-324 (1978) STANDARD simple staff_review COMMENT Kindly reviewed by K. N. Subramanian. Draft entry and computer- readable sequence for [130] kindly submitted by M.Bina, 16-FEB-1989. The E or minus strand of SV40 strain 776 is shown as it is reported by [16] and [17] using the numbering system by [50], with exception of of the addition of 17 bases (between nucleotides 164 and 165) to their sequence [36]. Differences between the complete sequences as published by [16] and [17] and this sequence are limited to the 17 bp change, the revision at 328 below, and the numbering. All references tend to agree upon an independent coordinate system, where map units run from 0.00 to 1.00 and the origin is defined by the sole EcoRI site, which is found at base 1782 below. In keeping with [50], map units for this entry have been calculated by SV + 3461 mu = ----------- 5243 If the result is > 1, subtract 1 for the value. The origin of this sequence, then, is at 0.66. Identical units are used for Polyoma virus. DNA replication is bidirectional from an origin which has been narrowed by mutational analysis to bases 5193 to 34 [31]. Given that early mRNA transcription also arises from this region, the two processes will be, to some extent, simultaneously regulated. In addition to the 21-bp and 72-bp repeats concerned with early transcription (see below), the origin includes a 17-bp A + T-rich sequence; further demarcation of a "core" region and an "auxiliary" region makes the former comprise a 17-bp palindrome (5193-5209), a 15-bp palindrome (5213-5227), a 27-bp palindrome (5230-13) and the A + T-rich region (15-31), and the latter the three 21-bp repeats [85]. Thus the "core" overlaps the T-antigen binding sites I and II and the "auxiliary region" (which is said to enhance replication efficiency) overlaps the T-antigen binding site III [85]. Transcription of early mRNAs (18s and 19s) is leftward off the E, or minus, strand and transcription of late mRNAs (16s and 19s) is rightward off the L, or plus, strand. In the former case, the annotation will show the symbols "(c)" and "comp strand" for the transcripts and products, and for the regulatory elements which govern them. In some references, the E strand has been labeled "Late" to indicate that it has the polarity of late mRNA, and the L strand "Early" to indicate that it has the polarity of early mRNA. (Early mRNAs can be produced throughout the lytic cycle.) The characterization of the mRNAs as 18s and 19s early and 16s and 19s late is a simplification: heterogeneity with regard to 5' and 3' termini, splice sites, and abundance as a function of time in the lytic cycle and regulatory events, have all been observed [24] [35],[67],[71]. For the purposes of this entry, only the simplest major mRNA species have been entered in the Sites. The promoter for the early mRNAs is considered to include at least bases 5185 to 250 on the E strand: the T antigen binding sites (for which the minimal spans are given [62]), the "TATA" box at position 21, the 21-bp repeats, and the 72-bp repeats are all known to play a role in early transcription. One of the effects of regulation at these sites is to shift the cap site to one of at least seven positions between bases 95 and 32 [71]. Bases 55 to 97 are also of interest because they permit base-pairing homology with the small nuclear RNA 7S-K [97] and because they bind the cell factor SP-1 [96]. For late promoter induction to occur at high efficiency two domains are required simultaneously: the minimal replication origin and the 72 bp repeat [113]. The origin-proximal 22 bp portion of the 72-bp repeat is sufficient for induction, while the origin-distal portion is dispensable [113]. If T antigen is present, the 21 bp repeat is dispensable for induction of late promoter activity in vivo [113]. The regulatory elements for early transcription have been annotated for the "comp strand". The late region induction signals are to be found in the same region as the early regulatory elements [113]. The efficacy of the 72-bp repeat in promoting transcription has been demonstrated for both orientations [75]. The SAS-RNA is a small viral associated RNA whose function is unknown. It appears to arise through processing from some of the longer early transcripts; mutants in the SAS-RNA coding region remain viable [51]. Seven proteins -- the small t antigen, the large T antigen, the three structural proteins, the early leader protein [123], and the agnoprotein (which is thought to be involved with structural development [92])-- are known to be translated from the multiplicity of mRNAs. The existence of a middle T antigen, known to exist in Polyoma, has been suggested [40]. For a summary of other possible coding sequences, see [50]. Missing data project [124-129], [131]. EMBL features not translated to GenBank features: key from to description PRM 21 15 (C) TATA-box [3] MSG 120 2674 major 16S RNA FEATURES from to/span description pept 16 1 (c) early leader protein (SELP; [123]) 5243 5188 (c) early leader protein (SELP; [123]) pept 335 523 agnoprotein (lp-1, VCP) pept 562 1620 VP2 minor structural protein pept 916 1620 VP3 minor structural protein pept 1499 2593 VP1 major structural protein pept 5163 4918 (c) large T antigen exon 1 4571 2691 (c) large T antigen exon 2 pept 5163 4639 (c) small t antigen RNA 2907 2842 (c) SV40-associated small (SAS) RNA; 0.21 [51] pre-msg 264 2676 minor late 19s mRNA [29],[62],[71] pre-msg 325 2676 major late 19s mRNA [10],[19],[29],[30],[38] pre-msg 2587 31 (c) early mRNA (alt.) [3],[34],[39],[62],[71] pre-msg 2599 31 (c) early mRNA (alt.) [28],[62],[71] pre-msg 5225 2587 (c) early mRNA (alt.) [3],[34],[39],[46],[61] pre-msg 5230 2587 (c) early mRNA (alt.) [3],[34],[39],[68] pre-msg 5236 2587 (c) early mRNA (alt.) [3],[6],[34],[39],[61] pre-msg 5225 2599 (c) early mRNA (alt.) [28],[46],[61] pre-msg 5230 2599 (c) early mRNA (alt.) [28],[68] pre-msg 5236 2599 (c) early mRNA (alt.) [6],[28],[39],[61] IVS 295 434 late 19s intron [24],[29] IVS 527 1462 late 16s intron [19],[29],[32] IVS 4636 4572 (c) early 19s mRNA (small t antigen) intron [24],[29] IVS 4917 4572 (c) large T antigen (18s) intron [24],[29] rpt 60 40 (c) 21 bp non-tandem repeat I [90],[96],[97] rpt 82 62 (c) 21 bp tandem repeat II [90],[96],[97] rpt 103 83 (c) 21 bp tandem repeat III [90],[96],[97] rpt 178 107 (c) 72 bp tandem repeat enhancer sequence A [65],[68],[75],[90] rpt 250 179 (c) 72 bp tandem repeat enhancer sequence B [65],[68],[75],[90] binding 5231 13 (c) T antigen binding site II [23],[62],[74] binding 61 36 (c) T antigen binding site III 3' [23],[62],[74] binding 5209 5184 (c) T antigen binding site I [23],[62],[74] signal 27 21 (c) early mRNA promoter element; 0.66 [58],[68],[69] orgrpl 5191 31 replication origin core region; 0.67 [85] orgrpl 32 83 replication origin auxiliary region; 0.67 [85] variant 164 182 gggactttccacacctggt in strain 776 [36]; gt in strain 777; 0.69 [16],[17] revision 328 331 gtgc in [36],[50]; gc in [16],[17] signal 2613 2608 (c) early mRNA polyadenyation signal on the comp strand; 0.16 [28],[110] signal 2642 2637 (c) early mRNA polyadenyation signal on the comp strand; 0.16 [28],[110] signal 2657 2662 late mRNA polyadenyation signal; 0.17 [28],[63],[110] mut 1626 1626 g in wild-type; a in temperature-sensitive mutant tsC260 mut 1667 1667 c in wild-type; a in temperature-sensitive mutant tsB228 mut 1680 1680 c in wild-type; g in temperature-sensitive mutant tsB218 mut 1718 1718 g in wild-type; a in temperature-sensitive mutant tsB[204,211,265] mut 1719 1719 c in wild-type; t in temperature-sensitive mutant tsB8 mut 1756 1756 a in wild-type; c in temperature-sensitive mutant tsB8 mut 1995 1995 g in wild-type; c in temperature-sensitive mutant tsB221 mut 2003 2003 g in wild-type; a in temperature-sensitive mutant tsB201 mut 2084 2084 c in wild-type; t in temperature-sensitive mutant tsBC223 mut 2091 2091 c in wild-type; c in temperature-sensitive mutant tsB4 mut 2141 2141 c in wild-type; t in temperature-sensitive mutant tsC219 mut 2237 2237 c in wild-type; a in temperature-sensitive mutant tsC240 mut 2262 2262 c in wild-type; t in temperature-sensitive mutant tsC260 mut 2354 2354 c in wild-type; t in temperature-sensitive mutant tsBC[208,214,216,217,248,274] mut 2367 2367 a in wild-type; c in temperature-sensitive mutant tsBC11 BASE COUNT 1518 a 1100 c 1039 g 1586 t ORIGIN BglI site on the E (minus) strand of the genome; mu 0.66 [50] 1 gcctcggcct ctgcataaat aaaaaaaatt agtcagccat ggggcggaga atgggcggaa 61 ctgggcggag ttaggggcgg gatgggcgga gttaggggcg ggactatggt tgctgactaa 121 ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt tccacacctg 181 gttgctgact aattgagatg catgctttgc atacttctgc ctgctgggga gcctggggac 241 tttccacacc ctaactgaca cacattccac agctggttct ttccgcctca gaaggtacct 301 aaccaagttc ctctttcaga ggttatttca ggccatggtg ctgcgccggc tgtcacgcca 361 ggcctccgtt aaggttcgta ggtcatggac tgaaagtaaa aaaacagctc aacgcctttt 421 tgtgtttgtt ttagagcttt tgctgcaatt ttgtgaaggg gaagatactg ttgacgggaa 481 acgcaaaaaa ccagaaaggt taactgaaaa accagaaagt taactggtaa gtttagtctt 541 tttgtctttt atttcaggtc catgggtgct gctttaacac tgttggggga cctaattgct 601 actgtgtctg aagctgctgc tgctactgga ttttcagtag ctgaaattgc tgctggagag 661 gccgctgctg caattgaagt gcaacttgca tctgttgcta ctgttgaagg cctaacaacc 721 tctgaggcaa ttgctgctat aggcctcact ccacaggcct atgctgtgat atctggggct 781 cctgctgcta tagctggatt tgcagcttta ctgcaaactg tgactggtgt gagcgctgtt 841 gctcaagtgg ggtatagatt ttttagtgac tgggatcaca aagtttctac tgttggttta 901 tatcaacaac caggaatggc tgtagatttg tataggccag atgattacta tgatatttta 961 tttcctggag tacaaacctt tgttcacagt gttcagtatc ttgaccccag acattggggt 1021 ccaacacttt ttaatgccat ttctcaagct ttttggcgtg taatacaaaa tgacattcct 1081 aggctcacct cacaggagct tgaaagaaga acccaaagat atttaaggga cagtttggca 1141 aggtttttag aggaaactac ttggacagta attaatgctc ctgttaattg gtataactct 1201 ttacaagatt actactctac tttgtctccc attaggccta caatggtgag acaagtagcc 1261 aacagggaag ggttgcaaat atcatttggg cacacctatg ataatattga tgaagcagac 1321 agtattcagc aagtaactga gaggtgggaa gctcaaagcc aaagtcctaa tgtgcagtca 1381 ggtgaattta ttgaaaaatt tgaggctcct ggtggtgcaa atcaaagaac tgctcctcag 1441 tggatgttgc ctttacttct aggcctgtac ggaagtgtta cttctgctct aaaagcttat 1501 gaagatggcc ccaacaaaaa gaaaaggaag ttgtccaggg gcagctccca aaaaaccaaa 1561 ggaaccagtg caagtgccaa agctcgtcat aaaaggagga atagaagttc taggagttaa 1621 aactggagta gacagcttca ctgaggtgga gtgcttttta aatcctcaaa tgggcaatcc 1681 tgatgaacat caaaaaggct taagtaaaag cttagcagct gaaaaacagt ttacagatga 1741 ctctccagac aaagaacaac tgccttgcta cagtgtggct agaattcctt tgcctaattt 1801 aaatgaggac ttaacctgtg gaaatatttt gatgtgggaa gctgttactg ttaaaactga 1861 ggttattggg gtaactgcta tgttaaactt gcattcaggg acacaaaaaa ctcatgaaaa 1921 tggtgctgga aaacccattc aagggtcaaa ttttcatttt tttgctgttg gtggggaacc 1981 tttggagctg cagggtgtgt tagcaaacta caggaccaaa tatcctgctc aaactgtaac 2041 cccaaaaaat gctacagttg acagtcagca gatgaacact gaccacaagg ctgttttgga 2101 taaggataat gcttatccag tggagtgctg ggttcctgat ccaagtaaaa atgaaaacac 2161 tagatatttt ggaacctaca caggtgggga aaatgtgcct cctgttttgc acattactaa 2221 cacagcaacc acagtgcttc ttgatgagca gggtgttggg cccttgtgca aagctgacag 2281 cttgtatgtt tctgctgttg acatttgtgg gctgtttacc aacacttctg gaacacagca 2341 gtggaaggga cttcccagat attttaaaat tacccttaga aagcggtctg tgaaaaaccc 2401 ctacccaatt tcctttttgt taagtgacct aattaacagg aggacacaga gggtggatgg 2461 gcagcctatg attggaatgt cctctcaagt agaggaggtt agggtttatg aggacacaga 2521 ggagcttcct ggggatccag acatgataag atacattgat gagtttggac aaaccacaac 2581 tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt gatgctattg ctttatttgt 2641 aaccattata agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca 2701 ggttcagggg gaggtgtggg aggtttttta aagcaagtaa aacctctaca aatgtggtat 2761 ggctgattat gatcatgaac agactgtgag gactgagggg cctgaaatga gccttgggac 2821 tgtgaatcaa tgcctgtttc atgccctgag tcttccatgt tcttctcccc accatcttca 2881 tttttatcag cattttcctg gctgtcttca tcatcatcat cactgtttct tagccaatct 2941 aaaactccaa ttcccatagc cacattaaac ttcatttttt gatacactga caaactaaac 3001 tctttgtcca atctctcttt ccactccaca attctgctct gaatactttg agcaaactca 3061 gccacaggtc tgtaccaaat taacataaga agcaaagcaa tgccactttg aattattctc 3121 ttttctaaca aaaactcact gcgttccagg caatgcttta aataatcttt gggcctaaaa 3181 tctatttgtt ttacaaatct ggcctgcagt gttttaggca cactgtactc attcatggtg 3241 actattccag ggggaaatat ttgagttctt ttatttaggt gtttcttttc taagtttacc 3301 ttaacactgc catccaaata atcccttaaa ttgtccaggt tattaattcc ctgacctgaa 3361 ggcaaatctc tggactcccc tccagtgccc tttacatcct caaaaactac taaaaactgg 3421 tcaatagcta ctcctagctc aaagttcagc ctgtccaagg gcaaattaac atttaaagct 3481 ttccccccac ataattcaag caaagcagct gctaatgtag ttttaccact atcaattggt 3541 cctttaaaca gccagtatct ttttttagga atgttgtaca ccatgcattt taaaaagtca 3601 tacaccactg aatccatttt gggcaacaaa cagtgtagcc aagcaactcc agccatccat 3661 tcttctatgt cagcagagcc tgtagaacca aacattatat ccatcctatc caaaagatca 3721 ttaaatctgt ttgttaacat ttgttctcta gttaattgta ggctatcaac ccgcttttta 3781 gctaaaacag tatcaacagc ctgttggcat atggtttttt ggtttttgct gtcagcaaat 3841 atagcagcat ttgcataatg cttttcatgg tacttatagt ggctgggctg ttctttttta 3901 atacatttta aacacatttc aaaactgtac tgaaattcca agtacatccc aagcaataac 3961 aacacatcat cacattttgt ttccattgca tactctgtta caagcttcca ggacacttgt 4021 ttagtttcct ctgcttcttc tggattaaaa tcatgctcct ttaacccacc tggcaaactt 4081 tcctcaataa cagaaaatgg atctctagtc aaggcactat acatcaaata ttccttatta 4141 acccctttac aaattaaaaa gctaaaggta cacaattttt gagcatagtt attaatagca 4201 gacactctat gcctgtgtgg agtaagaaaa aacagtatgt tatgattata actgttatgc 4261 ctacttataa aggttacaga atatttttcc ataattttct tgtatagcag tgcagctttt 4321 tcctttgtgg tgtaaatagc aaagcaagca agagttctat tactaaacac agcatgactc 4381 aaaaaactta gcaattctga aggaaagtcc ttggggtctt ctacctttct cttctttttt 4441 ggaggagtag aatgttgaga gtcagcagta gcctcatcat cactagatgg catttcttct 4501 gagcaaaaca ggttttcctc attaaaggca ttccaccact gctcccattc atcagttcca 4561 taggttggaa tctaaaatac acaaacaatt agaatcagta gtttaacaca ttatacactt 4621 aaaaatttta tatttacctt agagctttaa atctctgtag gtagtttgtc caattatgtc 4681 acaccacaga agtaaggttc cttcacaaag atcaagtcca aaccacattc taaagcaatc 4741 gaagcagtag caatcaaccc acacaagtgg atctttcctg tataattttc tattttcatg 4801 cttcatcctc agtaagcaca gcaagcatat gcagttagca gacattttct ttgcacactc 4861 aggccattgt ttgcagtaca ttgcatcaac accaggattt aaggaagaag caaatacctc 4921 agttgcatcc cagaagcctc caaagtcagg ttgatgagca tattttactc catcttccat 4981 tttcttgtac agagtattca ttttcttcat tttttcttca tctcctcctt tatcaggatg 5041 aaactccttg cattttttta aatatgcctt tctcatcaga ggaatattcc cccaggcact 5101 cctttcaaga cctagaaggt ccattagctg caaagattcc tctctgttta aaactttatc 5161 catctttgca aagctttttg caaaagccta ggcctccaaa aaagcctcct cactacttct 5221 ggaatagctc agaggccgag gcg // LOCUS BOVPROB1 257 bp ss-mRNA MAM 03-AUG-1990 DEFINITION Cattle prolactin mRNA, 5' end. ACCESSION M36873 KEYWORDS prolactin. SEGMENT 1 of 2 SOURCE Cattle hypophysis, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 257) AUTHORS Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and Baev,A.A. TITLE Genetic engineering of peptide hormones: II. Possible polymorphism of preprolactin in cattle. Data of molecular cloning JOURNAL Mol. Biol. 22, 117-121 (1988) STANDARD simple staff_review FEATURES from to/span description pept 51 > 257 prolactin precursor sigp 51 143 prolactin signal peptide matp 144 > 257 prolactin BASE COUNT 51 a 76 c 66 g 64 t ORIGIN 1 ataggacgag agcttcctgg tgaagtgtgt ttcttgaaat catcaccacc atggacagca 61 aaggttcgtc gcagaaagca gggtcccgcc tgctcctgct gctggtggtg tcaaatctac 121 tcttgtgcca gggtgtggtc tccacccccg tctgtcccaa tgggcctggc aactgccagg 181 tatcccttcg agacctgttt gaccgggcag tcatggtgtc ccactacatc catgacctct 241 cctcggaaat gttcaac // LOCUS BOVPROB2 551 bp ss-mRNA MAM 03-AUG-1990 DEFINITION Cattle prolactin mRNA, 3' end. ACCESSION M36874 KEYWORDS prolactin. SEGMENT 2 of 2 SOURCE Cattle hypophysis, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 551) AUTHORS Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and Baev,A.A. TITLE Genetic engineering of peptide hormones: II. Possible polymorphism of preprolactin in cattle. Data of molecular cloning JOURNAL Mol. Biol. 22, 117-121 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 405 prolactin BASE COUNT 153 a 135 c 119 g 144 t ORIGIN 1 ccggaagata aagaacaagc ccaacagacc catcatgaag tccttatgag cttgattctt 61 gggttgctgc gctcctggaa tgaccctctc tatcacctag tcaccgaggt gcggggtatg 121 aaaggagccc cagatgctat cctatcgagg gccatagaga ttgaggaaga aaacaaacga 181 cttctggaag gcatggagat gatatttggc caggttattc ctggagccaa agagactgag 241 ccctaccctg tgtggtcagg actcccgtcc ctgcaaacta aggatgaaga tgcacgttat 301 tctgcttttt ataacctgct ccactgcctg cgcagggatt caagcaagat tgacacttac 361 cttaagctcc tgaattgcag aatcatctac aacaacaact gctaagccca cattccatcc 421 tatccatttc tgagatggtt cttaatgatc cattccctgg caaacttctc tgagctttat 481 agctttgtaa tgcatgcttg gctctaatgg gtttcatctt aaataaaaac agactctgta 541 gcgatgtcaa a // LOCUS CHKBABA1 4051 bp ds-DNA VRT 03-AUG-1990 DEFINITION G.gallus B-creatine kinase (B-CK) gene, exons 1-4. ACCESSION M33711 J05496 KEYWORDS B-creatine kinase. SEGMENT 1 of 4 SOURCE G.gallus (strain White Leghorn) adult blood cell DNA, clones T[56,62]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 4051) AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C. TITLE A unique chicken B-creatine kinase gene gives rise to two B-creatine kinase isoproteins with distinct N-termini by alternative splicing JOURNAL J. Biol. Chem. 265, 11656-11666 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Wirz, 13-APR-1990. FEATURES from to/span description pept 1201 1393 B-creatine kinase, exon 2b (EC 2.7.3.2) (first expressed exon) 2449 2603 B-creatine kinase, exon 3 3464 + 3596 B-creatine kinase, exon 4 pep$ 1800 1977 B-creatine kinase, exon 2a 2449 2603 B-creatine kinase, exon 3 3464 + 3596 B-creatine kinase, exon 4 pre-msg 411 > 4051 B-creatine kinase mRNA and introns IVS 1394 1789 B-creatine kinase intron A IVS 1978 2498 B-creatine kinase intron B (no splice consensus) IVS 2604 3463 B-creatine kinase intron C IVS 3597 > 4051 B-creatine kinase intron D signal 384 390 TATA box signal 251 256 GC box signal 310 315 GC box signal 328 332 CAT box signal 346 351 TATA box signal 358 362 GC box signal 384 389 TATA box BASE COUNT 943 a 985 c 1128 g 995 t ORIGIN 1 tttgcacacg accagaagag accttcagcg cgttacacaa taaacctatg ttcaaatctg 61 aggtaaaagg agccagctag caaggatgga agcaaaaagc cgcttgtgcc cgccttggtg 121 cctaacggct gccgtattcg ggctgccggt ggccgctggg acggatgtac gtgcacgtct 181 gcgttcacga ggtcacgacg gctaccgctc caggctcgag gcggacgggc cacgaaccca 241 aagccaccgg ccgcccaacc gccgcgcaaa atggcggcac gccttcactg gccaggagga 301 gccgctgctc cgcccctcac ggctcgacca atggaatgaa tgggctataa ataggagcca 361 atgaggcggc gggtcgcgct ctatataaaa ggggcgcggc ggctgactcg gggctccact 421 ccggcttggg cagcgggtgg gtgcggagcg gccgctgagg tccttacgtt ggctgtggag 481 ccctcggtaa gtcccggggc agccgagccg cctgcagctt aaagaaaaga agaaggaaac 541 gatttaactt ttataagaaa aatgtatttt tgatggtgtg ggttggagcc ggtccggccg 601 ggaaagcgtc ggggggagcg ggatgcggcg gggcggcgcg gggcaggtgc cgggtgcgta 661 ccgggtgcgg ccgcagcttg agccggagcc ccgctctgaa agcagctcac cccggtattt 721 agcagggtag ttacatacgg tcgccccagc cgcgtccccc gagggagccg tccgttgtgt 781 aacggctcgc tgagcactgc ccactcgccc cggtccccgg ctcgcgctgc cggtaccggc 841 accggccgcg gcgctcccca acggccgctc cgcggggctc cgccggcggc tgctccgccg 901 cctccccgcg ccctggccga gccccggagc gtccggaccg ggatgctgaa ggatggggcg 961 aggtttgcgg agctggagaa agtttccttt tttttttttc tcggtttttt tttctctttt 1021 tttttttttt tttgggccct ccctccttgc ctggccttga ggaaggtgtt gtggcaacga 1081 aggtgcggaa ggcacgggtg gaagcgaagc caaggtgacc ttgcttggcg ttggcgcctc 1141 ccccgggccc ctggtgatag gcttccctct cttccctcag tgactgctgt agggacagcc 1201 atgcccttct caaacagcca caacctcctg aagatgaagt actccgttga cgatgagtac 1261 cccgacctga gcgtgcacaa caatcacatg gccaaggtgc tgaccctgga cctgtacaag 1321 aagttgaggg acaggcagac ttccagtgga tttacgctgg atgatgtcat tcagaccggg 1381 gttgacaacc caggtaacgg cggcaccgtt aggactgcag ggctctctgc tctccctcct 1441 tcctggaagt gctccttgac agcaaagcaa accccgcacg ctccgaggga cggctgtgct 1501 cctccgtgca atctgacaca gcaattccca ggcgctccca ttatgtaact gctctgctaa 1561 atacttcaaa tcccaaccaa aaaaaaaaaa aaaaagcaaa cgctcaaatg tgttctcatt 1621 tttatgcaac cggggggaaa aaaattctct gagctgtgtg ggcttacccg gctgttctga 1681 gctcgtctgt tccaaagtgg ttatgtgatg ttctgtagca cttctatgaa agcaaggggg 1741 agcggatgcc gtatacgtgt taacattaca gtatttattt ttactctagc aatatcaaaa 1801 tggcccaact aaataatcag aggctgcctc ccgaggagga gtacccggac ctgagcaccc 1861 acaacaacca catggccaaa gtgctaaccc tggatttgta caagaaactg agagacagag 1921 tcacgcccag tggcttcacc ctggatgatg tcattcagac tggggttgat aatcccggta 1981 aaatgcgctg agatattctg tgggagccag catgaagtac ccggtgcttt tcagtatggt 2041 gacttaagct aagctcgtgg aggtggtgtt tatgagcaga ctatgaaaat gcggttctgt 2101 ctgtagcaga gcgaataaag cgatgcatgc ttgctcacag caaagcatgc taaagtgtag 2161 ttctatctac gggtttgcat tagctagtaa aattggatgt gctgtgcgaa attgccctaa 2221 accaaagctt gacaagagtg tggcttaacg gcaactgctg tggtttccca tagcactgca 2281 tttgactaac tggatgaaat gtaaaatgta gggtctccca actgctctta ggccagtgca 2341 tttctaagtg agtagatctg ctgtgaagaa acttgcctgg ttagtgctgt gctggagtac 2401 agctgagcat cacccaactt agctcttctg cctttgttac cactccaggc caccccttca 2461 taatgacagt aggatgcgta gctggcgacg aagaatccta tgaagtgttt aaggaactct 2521 ttgatccggt tattgaggac aggcacggtg gctacaaacc aactgatgag cacaagaccg 2581 acctgaatgc tgataacctg caggtatacg tttttgttga gctactgata cccactgatc 2641 tgaggcctgg ggaaagctgt gagcggctgc ctaaacggat tgtttgggct ctgtcctctc 2701 tctctcatct ctcgctgccc gctgcagtct tacttaagag cctgcagcag taggttgtca 2761 gtaactgggt agaaccaggc agctgacaag caggacttgc ttcatgtgct gctgcaatgt 2821 aagtcatacc aaacacgttg agcacgtatt tctttgggct gtgctagttg tgtgaaacaa 2881 cccagtccca gggctgttag aaatcttaac tcttcctact gtatgatgtt gtaggctaaa 2941 aatcagccag tggtctcgtg ttatagaata gccattgtta cggctggaag tagaggtttc 3001 tgcccatctt acagacttca aatgttattg cagtctctgg gaattccaac tgctatcctg 3061 gttggcctgc cataggacat ctcaaaaatg agaacgcttt gacaataatg agtaactctc 3121 atttgcagga gtaaacacct acctaagcaa ggatttgtct cgtgttgata cctctagatt 3181 gcagagtgaa aggagtaaca gaagcctggg atttgggtct gagatgggtg taaaactgat 3241 ggtatcccta tcttctagca catagttggt agtcgtagtc ttagcctcaa aacaaggagt 3301 ttgatactct catacttgtt ggaaagctgg aaaaaatggt cctgaagtac atcgtattac 3361 aggacaacta gtcagtctca aatatgaatg cacgtgacct gtgtgtattt agggtaagca 3421 tcttagcaac aaggcagtac tgactaattt tctgatcttg tagggtggtg acgacttgga 3481 tcctaattac gtgctgagct ctcgtgtgag aactggcagg agcatccgtg gattctgcct 3541 tccccctcat tgcagcaggg gagagagacg ggctattgaa aagctctctg ttgaaggtaa 3601 agtgttgaat caggcaatga ggggaaactg ctgacctcgt ctaggcaagg aggtgctctg 3661 tgcagttgga gaatggaggt ggttggcaga agctgccttc ctcaaggctt aacttgagtg 3721 tggaaccttc cctgaaaaag gcagcttgga ttatggttgc gtcacgttgc agtcggtact 3781 cagagcagga acacatgtca gggaagttgg actgtattaa gttgaccttt gctgatcttg 3841 gaagcttctt tctgtgtgtg gggaaatagg ttggttcatc acggctcaaa cctgagatgt 3901 aataggtatg gagcaaccct agggcttgag tcatggttgc agtattttgt ggcttctttt 3961 aattgacttt tattaatgtt cagcttgctt agaggtgtag tccagctctg agagagctct 4021 agttctcagt ggaactcctc agctgcagtt c // LOCUS CHKBABA2 444 bp ds-DNA VRT 03-AUG-1990 DEFINITION G.gallus B-creatine kinase (B-CK) gene, exon 5. ACCESSION M33712 J05496 KEYWORDS B-creatine kinase. SEGMENT 2 of 4 SOURCE G.gallus (strain White Leghorn) adult blood cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 444) AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C. TITLE A unique chicken B-creatine kinase gene gives rise to two B-creatine kinase isoproteins with distinct N-termini by alternative splicing JOURNAL J. Biol. Chem. 265, 11656-11666 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Wirz, 13-APR-1990. FEATURES from to/span description pept + 45 + 216 B-creatine kinase (B-CK), exon 5 pep$ + 45 + 216 B-creatine kinase (B-CK), exon 5 pre-msg 1 > 444 B-creatine kinase mRNA and introns IVS < 1 44 B-creatine kinase intron E IVS 217 > 444 B-creatine kinase intron F BASE COUNT 108 a 95 c 120 g 121 t ORIGIN 1 tgtttatagt ctctgctccc atcctaaaag cacctctctt tcagctctgg gcagtctggg 61 aggtgatctc aaggggaagt actatgccct aaggaacatg actgatgcag agcagcagca 121 gctgattgat gatcacttct tgtttgacaa acctgtttct cctcttctgt tggcatctgg 181 gatggcacga gattggcctg atgccagggg tatctggtga gtatatgccc agcagggtga 241 gttgtgtgaa atccctgcaa caaaaagcct tgaaattttg gttgtggctg atataattgg 301 acaaaacagc acgttgtgaa cttacgctgc tgaatctaat gaggaacttc tgcgtggtat 361 cccctgggaa gcgatatctg aaagcagaac ttgccccaag tgctagtgtg cactgcattg 421 ggctggtagg aagaagccag cttg // LOCUS CHKBABA3 1586 bp ds-DNA VRT 03-AUG-1990 DEFINITION G.gallus B-creatine kinase (B-CK) gene, exons 6 and 7. ACCESSION M33713 J05496 KEYWORDS B-creatine kinase. SEGMENT 3 of 4 SOURCE G.gallus (strain White Leghorn) adult blood cell DNA, clone T8. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1586) AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C. TITLE A unique chicken B-creatine kinase gene gives rise to two B-creatine kinase isoproteins with distinct N-termini by alternative splicing JOURNAL J. Biol. Chem. 265, 11656-11666 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Wirz, 13-APR-1990. FEATURES from to/span description pept + 308 431 B-creatine kinase, exon 6 1170 + 1359 B-creatine kinase, exon 7 pep$ + 308 431 B-creatine kinase, exon 6 1170 + 1359 B-creatine kinase, exon 7 pre-msg 1 > 1586 B-creatine kinase mRNA and introns IVS < 1 307 B-creatine kinase intron G IVS 432 1171 B-creatine kinase intron H (no splice consensus) IVS 1360 > 1586 B-creatine kinase intron I BASE COUNT 435 a 336 c 385 g 430 t ORIGIN 1 tggactgaac ataaatgagg tgagcaatga ctgaactctt aagttttgat gctgttaaca 61 ctggctaatg tcctgcaatt actttctgac atggatagat acaggccact tccaaatacc 121 tcttgactag agtagggtaa aagtacaagc ttaattttgt tagtgcttag atggagaaca 181 gggagataaa tttctttgga gaagcaatta aacgagcctc ttggaagcta cagcagtgag 241 taacttggtg ctactgaatc tgtcttgaaa acgtcaagtg agacaacagg tctgtcttgt 301 ctcctaggca caatgacaac aagaccttcc ttgtttggat caatgaggag gatcacctca 361 gggttatttc catgcagaaa ggtggcaaca tgaaggaagt gtttactcgc ttctgtacag 421 ggctaacaca ggtaaatcaa atactgcttg agctctgggt ctaaatatta atctggttgg 481 agctgaccat acaggatctt ccaggctggg agcatgtaca caggaggtga cagccttctc 541 ctttgttctg cagggtgctt ctttgtagtt tgggtttctg ggaacatcct gatgagatcc 601 tctgctttga gggctctgaa atgcctgtac tggcaaagct ttatggctgt gaaatctgga 661 ataaaacctc taatgtgaca cctttaccaa actgtactta aaggatctct gtggttctat 721 aaaatgtatg tagagaccct gggtaggagg aaaaaattgc ttaaatgtga atgggtgaca 781 ttggacagaa gtttgtaggc tgcatgcttt tctcagcttc tggtcttgat atcagcttgt 841 gctttaccag ccccttccct aatcctaaca gatgcaccag tttgacatgt actcaatcaa 901 gcttcccagt agatagcaac ttcagacttt acagtgtagc ttgaacttag gctgtctcgc 961 tgctgggagc aagagtaagg ccttcaaaag cccctcattg gtgaccctct tctacaaatg 1021 gagggggccc tatgtaactg acttaaataa tctgctcaaa ctgttgtggc tgttgctatg 1081 caactgaagc tgtaaatgag cacttagtgg atgacctaca tcgttcaggc agagaacaat 1141 ctctaaacat ccattctctc tatttataga tagaaactct cttcaagtcc aaaaactacg 1201 agttcatgtg gaacccacac ttgggctaca tcctgacctg cccatccaac cttggaacgg 1261 ggctccgtgc tggtgtgcac atcaagctac caaaccttgg gaagcatgag aagtttggag 1321 aagtcctcaa gaggcttcgg ctgcagaaac gaggcacagg tgagaggtgg tgatacagtg 1381 atgccccaag caggctgcct ctgaccagac agcagtgaga gcctggctgc agcgaagcta 1441 atcttggttc atgctgacag tgcctgacag accatccaga agggaccctt ctgaagcgtg 1501 gggggggaca accaagcatt aaaagtgtag cttagaagag gtggagatgc tggtactttg 1561 cttaagcctt attatatcca gcttct // LOCUS CHKBABA4 650 bp ds-DNA VRT 03-AUG-1990 DEFINITION G.gallus B-creatine kinase (B-CK) gene, exon 8. ACCESSION M33714 J05496 KEYWORDS B-creatine kinase. SEGMENT 4 of 4 SOURCE G.gallus (strain White Leghorn) adult blood cell DNA, clone T8. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 650) AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C. TITLE A unique chicken B-creatine kinase gene gives rise to two B-creatine kinase isoproteins with distinct N-termini by alternative splicing JOURNAL J. Biol. Chem. 265, 11656-11666 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Wirz, 13-APR-1990. FEATURES from to/span description pept + 49 227 B-creatine kinase, exon 8 pep$ + 49 227 B-creatine kinase, exon 8 pre-msg 1 > 396 B-creatine kinase mRNA and introns IVS < 1 48 B-creatine kinase intron J signal 391 396 poly-A signal BASE COUNT 161 a 129 c 161 g 199 t ORIGIN 1 aactggctca ggcagcggtc agcgtactaa ttaaatctct tcgtccaggt ggtgtggaca 61 cagctgcagt cggaggggtg tttgatgtat ccaatgctga tcgtcttggc ttctctgagg 121 tagagctggt gcagatggtg gtagatggtg tgaagctgct cattgaaatg gaaaaacgcc 181 ttgagaaagg ccagtccatt gatgacctca tgccagctca gaaataaagc actttattct 241 catgcttcct aacttattgg atgaataata aaatgtcact ccaatttcaa accccttggg 301 tcagagccca cttagttaca ctgtagagaa gtcttccatc catctgtgtt agagtttatt 361 ttttgatggc tgaaatgttg ttgaaaatga aataaactgt tgttttggcc tgacctgcct 421 tagatgtgtt aactgagtgt ctttgaggtc taagtgactt gctgttcctg gtaagctgaa 481 ctgattggtt ttaggtgcta gcacatcatc ctggctgtgc tgctgtagtc tgcttggtgg 541 gcttagaaat gtgcttatag cagtgtcaaa agcaagcctc taatgtgatt gcagctgccg 601 taaacctgag aactggtgct aaacatgagt cactttaaac ctgccctacc // LOCUS HUMSATIII1 132 bp ds-DNA PRI 03-AUG-1990 DEFINITION Human satellite III DNA fragment, 5' end. ACCESSION M25431 KEYWORDS satellite repetitive sequence. SEGMENT 1 of 3 SOURCE Human DNA, clone lambda-HS3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 132) AUTHORS Cooke,H.J. and Hindley,J. TITLE Cloning of human satellite III DNA: Different components are on different chromosomes JOURNAL Nucleic Acids Res. 6, 3177-3197 (1979) STANDARD simple staff_review BASE COUNT 39 a 27 c 14 g 52 t ORIGIN 1 aattcatttg aagacaattc cattcaatac caattgatga tggttatttt tgattccatt 61 tgatgatgat tacattccat ttcatcataa ttccattcga ttccactcga gattccattc 121 gattccattc aa // LOCUS HUMSATIII2 75 bp ds-DNA PRI 03-AUG-1990 DEFINITION Human satellite III DNA fragment. ACCESSION M25432 KEYWORDS satellite repetitive sequence. SEGMENT 2 of 2 SOURCE Human satellite DNA, clone lambda-HS3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 75) AUTHORS Cooke,H.J. and Hindley,J. TITLE Cloning of human satellite III DNA: Different components are on different chromosomes JOURNAL Nucleic Acids Res. 6, 3177-3197 (1979) STANDARD simple staff_review BASE COUNT 19 a 15 c 11 g 30 t ORIGIN 1 cgaatgaatg agtccatcca tttcaatttc atgataattc cattcgtttc aattcgatgg 61 tgtttccatt cgatt // LOCUS HUMSATIII3 93 bp ds-DNA PRI 03-AUG-1990 DEFINITION Human satellite III DNA fragment, 3' end. ACCESSION M25433 KEYWORDS satellite repetitive sequence. SEGMENT 3 of 3 SOURCE Human satellite DNA, clone lambda-HS3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 93) AUTHORS Cooke,H.J. and Hindley,J. TITLE Cloning of human satellite III DNA: Different components are on different chromosomes JOURNAL Nucleic Acids Res. 6, 3177-3197 (1979) STANDARD simple staff_review BASE COUNT 21 a 20 c 16 g 36 t ORIGIN 1 ttcattcgat tcatttgatg atgattcatg cgcgattcat tagatgatga cccctttcat 61 ttccattcaa tggaggattc cattcggttc cat // LOCUS MS2AAR 218 bp ss-RNA PHG 03-AUG-1990 DEFINITION Bacteriophage MS2 A protein mRNA, 3' end, and coat protein mRNA, complete cds. ACCESSION M25187 KEYWORDS A protein; coat protein. SOURCE Bacteriophage MS2 RNA. ORGANISM Bacteriophage MS2 Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Leviviridae. REFERENCE 1 (bases 1 to 218) AUTHORS Contreras,R., Ysebaert,M., Min Jou,W. and Fiers,W. TITLE Bacteriophage MS2 RNA: Nucleotide sequence of the end of the A protein gene and the intercistronic region JOURNAL Nature New Biol. 241, 99-101 (1973) STANDARD simple staff_review FEATURES from to/span description pept < 1 137 A protein (AA at 3) pept 161 > 218 coat protein BASE COUNT 54 a 58 c 52 g 54 t ORIGIN 1 ctaaggccca aatctcagcc atgcatcggg gagtacaatc cgtatggcca acaactggcg 61 cgtacgtaaa gtctcctttc tcgatggtcc ataccttaga tgcgttagca ttaatcaggc 121 aacggctctc tagatagagc cctcaaccgg agtttgaagc atggcttcta actttactca 181 gttcgttctc gtcgacaatg gcggaactgg cgacgtga // LOCUS MUSCMDH1 567 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 1. ACCESSION M36076 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 1 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 567) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept 540 + 542 cytosolic malate dehydrogenase, exon 1 (cMDHase) (EC 1.1.1.37) pre-msg 458 > 567 cytosolic malate dehydrogenase mRNA and introns (alt.) pre-msg 459 > 567 cytosolic malate dehydrogenase mRNA and introns (alt.) IVS 543 > 567 cytosolic malate dehydrogenase intron A BASE COUNT 109 a 156 c 169 g 133 t ORIGIN 1 gttcttccgc aagcgtcaat tcctcccgcc tctgagagag tttttaaaag gtttgtttcc 61 gggtcgagcg ggctttaagc aacggaaggt ctcttagccg cccttgttta gtcttgggga 121 ggatagattc tcgtggagcg acgtgtgtgt cgctcagggg tcggtttctc ctccctcgag 181 ttaacgcctc ctctcctgcc aattgctgag cgccatcagg caggcgcctc actcaaagca 241 ccaaccctct gctcacagac gcgctccaat caccgaggct cagcccggga ctactttgca 301 gcgaggcgcg taggaagaag gggtttgggg gaattgtagt ttagcactgt caggaatgca 361 cgaaggtggg cgccagaggt cgcggaagaa ctacacttcc cagaaagggc cgtgtctcca 421 gccgaagcct ttgcgggcca gccccggttc tctcccagag tctgttccgc tgtagaggtg 481 acctgactgc tggagactgc cttttgcagg tgcagagatc ggccttgcag tttgcaataa 541 tggtgaggtg ggctctggaa ctcacac // LOCUS MUSCMDH2 149 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 2. ACCESSION M36077 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 2 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 149) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 27 + 125 cytosolic malate dehydrogenase, exon 2 (cMDHase) pre-msg < 1 > 149 cytosolic malate dehydrogenase mRNA and introns IVS < 1 26 cytosolic malate dehydrogenase intron A IVS 126 > 149 cytosolic malate dehydrogenase intron B BASE COUNT 41 a 25 c 36 g 47 t ORIGIN About 4.8 kb after segment 1. 1 taaactagtg gtctttgtca ttacagtctg aaccaatcag agtccttgtg actggagcag 61 ctggtcaaat tgcatattca ctgttgtaca gtattggaaa tggatctgtc tttgggaaag 121 accaggtagg ggcatgttct tataaatac // LOCUS MUSCMDH3 147 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 3. ACCESSION M36078 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 3 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 147) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 122 cytosolic malate dehydrogenase exon 3 (cMDHase) pre-msg < 1 > 147 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron B IVS 123 > 147 cytosolic malate dehydrogenase intron C BASE COUNT 27 a 39 c 40 g 41 t ORIGIN About 0.8 kb after segment 2. 1 gcctgctgtc cttgctcttt ggcagcccat cattcttgtg ctgttggaca tcacccccat 61 gatgggtgtt ctggacggtg tcctgatgga actgcaagac tgtgcccttc cccttctgca 121 gggtgagttg gaagtcaaag aaaacag // LOCUS MUSCMDH4 226 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 4. ACCESSION M36079 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 4 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 226) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 201 cytosolic malate dehydrogenase, exon 4 pre-msg < 1 > 226 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron C IVS 202 > 226 cytosolic malate dehydrogenase intron D BASE COUNT 69 a 45 c 62 g 50 t ORIGIN About 1.6 kb after segment 3. 1 tgtgtgttgt ttgccatgtc catagatgtc attgcaacgg acaaagaaga gattgccttc 61 aaagacctgg atgtggctgt cctagtgggc tccatgccaa gaagggaagg catggagagg 121 aaggacctac tgaaagccaa tgtgaaaatc ttcaaatccc agggcacagc cttggagaaa 181 tacgccaaga aatcagttaa ggtgactcac acagatttca tggggt // LOCUS MUSCMDH5 173 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 5. ACCESSION M36080 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 5 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 173) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 148 cytosolic malate dehydrogenase, exon 5 pre-msg < 1 > 173 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron D IVS 149 > 173 cytosolic malate dehydrogenase intron E BASE COUNT 48 a 49 c 35 g 41 t ORIGIN About 0.9 kb after segment 4. 1 tctgctctgt gcctccacca tctaggtcat tgttgtggga aacccagcca atacgaactg 61 cctgacagcc tccaagtcag cgccatcgat ccccaaggag aatttcagtt gcctgactcg 121 cttggaccac aaccgagcaa aatctcaagt aagaaaaatg gcttttagat tta // LOCUS MUSCMDH6 227 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 6. ACCESSION M36081 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 6 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 227) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 202 cytosolic malate dehydrogenase, exon 6 pre-msg < 1 > 227 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron E IVS 203 > 227 cytosolic malate dehydrogenase intron F BASE COUNT 71 a 40 c 60 g 56 t ORIGIN About 3.4 kb after segment 5. 1 tgatatgatg ttttacatga actagattgc tcttaaactc ggtgtaaccg ctgatgatgt 61 aaagaatgtc attatctggg gaaatcattc atcgacccag tatccagatg tcaatcatgc 121 caaggtgaaa ctgcaaggaa aggaagtcgg tgtgtatgaa gccctgaaag acgacagctg 181 gctgaaggga gagttcatca cggtaagaag gatgtgaacc ctctgag // LOCUS MUSCMDH7 164 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 7. ACCESSION M36082 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 7 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 164) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 139 cytosolic malate dehydrogenase, exon 7 pre-msg < 1 > 164 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron F IVS 140 > 164 cytosolic malate dehydrogenase intron G BASE COUNT 34 a 46 c 44 g 40 t ORIGIN About 0.7 kb after segment 6. 1 actgtctctc tgttgtccca cccagactgt gcaacagcgt ggtgctgctg tcatcaaggc 61 tcggaagctg tccagtgcaa tgtctgctgc gaaagccatc gcagaccaca tcagagacat 121 ctggtttgga accccagagg tgagggttct catttgtact ggcc // LOCUS MUSCMDH8 131 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 8. ACCESSION M36083 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 8 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 131) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 115 cytosolic malate dehydrogenase, exon 8 pre-msg < 1 > 131 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron G IVS 116 > 131 cytosolic malate dehydrogenase intron H BASE COUNT 27 a 27 c 36 g 41 t ORIGIN About 0.3 kb after segment 7. 1 gctatgataa tgtaaacttt ttcagggaga gttcgtgtcg atgggtgtta tctctgatgg 61 caactcctat ggtgtccctg atgacctgct ctactcactc cctgtcgtga tcaaggtggg 121 tacatggaga g // LOCUS MUSCMDH9 528 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 9. ACCESSION M36084 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 9 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 528) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 151 cytosolic malate dehydrogenase, exon 9 (cMDHase) pre-msg < 1 > 353 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron H signal 329 334 poly-A signal signal 449 454 poly-A signal BASE COUNT 175 a 98 c 100 g 155 t ORIGIN About 1.4 kb after segment 8. 1 agctctcgcc cttgtcccct gacagaataa gacctggaag tttgttgaag gcctccccat 61 taatgacttc tcccgtgaaa agatggacct gacagcaaag gagctgaccg aggaaaagga 121 gaccgctttt gagtttctct cctctgcgtg actagacact cgttttgaca tcagcagaca 181 gccgaaggct gaggaatcaa aatgtcgtct ttgagcctag taccaaacag taataatgct 241 acattcaaat tgtgaacagc aaaatatttt aaatagtgtg tgctttatga tttgtgaaag 301 tctatcatgt tgttagtgct gcaatctaaa taaaagtata ttcaagtgaa aatctctcag 361 actctgtttc tactttatat ttagtatctt caggaaaaca agtttggccc aatagattat 421 aattttactt ttttaattga ctaaaagaaa taaagatgga aaatattatg aagtaaagca 481 ttagtctcta acataaacaa ggaagcccaa tcaatttcag agggatcc // LOCUS MUSMHHLABA 158 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class I HLA-B7 gene, 5' flank. ACCESSION M35971 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse L cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 158) AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I., Probst,H. and Mueller,C.A. TITLE Differential regulation of HLA class I genes by interferon JOURNAL Immunogenetics 31, 245-252 (1990) STANDARD simple staff_review BASE COUNT 21 a 53 c 43 g 41 t ORIGIN 1 ggtctcaggg tctcaggctc cgagggccgg gtctgcaatg gggaggcgca cgttggggat 61 tccccactcc cctgagtttc acttcttctc ccaacttgtg tcgggtcctt cttccaggat 121 actcgtgacg cgtccccact tcccactccc attgggta // LOCUS MUSMHHLABB 160 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class I HLA-Bw64 gene, 5' flank. ACCESSION M35972 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse L cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 160) AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I., Probst,H. and Mueller,C.A. TITLE Differential regulation of HLA class I genes by interferon JOURNAL Immunogenetics 31, 245-252 (1990) STANDARD simple staff_review BASE COUNT 21 a 55 c 43 g 41 t ORIGIN 1 ggctctcagg gtctcaggct ccgagggccg cgtctgcaat ggggaggcgc agcgttgggg 61 attccccact cccctgagtt tcacttcttc tcccaacttg tgtcgggtcc ttcttccagg 121 atactcgtga cgcatcccca cttcccactc ccattgggtg // LOCUS MUSMHHLABC 159 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class I HLA-B38 gene, 5' flank. ACCESSION M35973 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse L cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 159) AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I., Probst,H. and Mueller,C.A. TITLE Differential regulation of HLA class I genes by interferon JOURNAL Immunogenetics 31, 245-252 (1990) STANDARD simple staff_review BASE COUNT 21 a 52 c 44 g 42 t ORIGIN 1 ggctctcagg gtctcaggct cgagggccgc gtctgtaatg gggaggcgga gcgttgggga 61 ttccccactc ccctgagttt cacttcttct cccaacttgt gtcgggtcct tcttccagga 121 tactcgtgac gcatccccac ttcccactcc cattgggtg // LOCUS MUSMHHLABD 159 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class I HLA-B39 gene, 5' flank. ACCESSION M35974 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse L cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 159) AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I., Probst,H. and Mueller,C.A. TITLE Differential regulation of HLA class I genes by interferon JOURNAL Immunogenetics 31, 245-252 (1990) STANDARD simple staff_review BASE COUNT 21 a 52 c 44 g 42 t ORIGIN 1 ggctctcagg gtctcaggct cgagggccgc gtctgtaatg gggaggcgga gcgttgggga 61 ttccccactc ccctgagttt cacttcttct cccaacttgt gtcgggtcct tcttccagga 121 tactcgtgac gcatccccac ttcccactcc cattgggtg // LOCUS MUSTCEAEA 396 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35486 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL23.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 396) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 396 T-cell receptor alpha-chain precursor (AA at 1) sigp < 1 33 T-cell receptor alpha-chain signal peptide matp 34 > 396 T-cell receptor alpha-chain recomb 327 328 V-region end/J-region start recomb 385 386 J-region end/C-region start BASE COUNT 113 a 101 c 95 g 87 t ORIGIN 1 ctaggccttc acctagctgg ggtgagtggc cagcaggaga aacgtgacca gcagcaggtg 61 agacaaagtc cccaatctct gacagtctgg gaaggagaga ccgcaattct gaactgcagt 121 tatgagaaca gtgcttttga ctacttccca tggtaccagc agttccctgg ggaaggtccc 181 gctctcctga tatccatact ttcagtgtcc gataaaaagg aagatggacg attcacaatc 241 ttcttcaata aaagggagaa aaagctctcc ttgcacattg cagactctca gcctggagac 301 tcagccacct acttctgtgc agcaagtgca aattctggga cttaccagag gtttggaact 361 gggacaaaac tccaagtcgt tccaaacatc cagaac // LOCUS MUSTCEAEB 384 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35487 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL51.1.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 384) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 384 T-cell receptor alpha-chain precursor (AA at 1) sigp < 1 39 T-cell receptor alpha-chain signal peptide matp 40 > 384 T-cell receptor alpha-chain recomb 315 316 V-region end/J-region start recomb 373 374 J-region end/C-region start BASE COUNT 111 a 96 c 93 g 84 t ORIGIN 1 ctagctgggg tgagtggcca gcaggagaaa cgtgaccagc agcaggtgag acaaagtccc 61 caatctctga cagtctggga aggagagacc gcaattctga actgcagtta tgagaacagt 121 gcttttgact acttcccatg gtaccagcag ttccctgggg aaggtcccgc tctcctgata 181 tccatacttt cagtgtccga taaaaaggaa gatggacgat tcacaatctt cttcaataaa 241 agggagaaaa agctctcctt gcacattgca gactctcagc ctggagactc agccacctac 301 ttctgtgcag caagtgcaaa ttctgggact taccagaggt ttggaactgg gacaaaactc 361 caagtcgttc caaacatcca gaac // LOCUS MUSTCEAEC 388 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35488 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL83.12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 388) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 388 T-cell receptor alpha-chain precursor (AA at 2) sigp < 1 43 T-cell receptor alpha-chain signal peptide matp 44 > 388 T-cell receptor alpha-chain recomb 319 320 V-region end/J-region start recomb 377 378 J-region end/C-region start BASE COUNT 112 a 98 c 93 g 85 t ORIGIN 1 tcacctagct ggggtgagtg gccagcagga gaaacgtgac cagcagcagg tgagacaaag 61 tccccaatct ctgacagtct gggaaggaga gaccgcaatt ctgaactgca gttatgagaa 121 cagtgctttt gactacttcc catggtacca gcagttccct ggggaaggtc ccgctctcct 181 gatatccata ctttcagtgt ccgataaaaa ggaagatgga cgattcacaa tcttcttcaa 241 taaaagggag aaaaagctct ccttgcacat tgcagactct cagcctggag actcagccac 301 ctacttctgt gcagcaagtg caaattctgg gacttaccag aggtttggaa ctgggacaaa 361 actccaagtc gttccaaaca tccagaac // LOCUS MUSTCEAED 359 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35489 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL127.6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 359) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 359 T-cell receptor alpha-chain precursor (AA at 3) sigp < 1 14 T-cell receptor alpha-chain signal peptide matp 15 > 359 T-cell receptor alpha-chain recomb 290 291 V-region end/J-region start recomb 348 349 J-region end/C-region start BASE COUNT 107 a 91 c 81 g 80 t ORIGIN 1 agaaacgtga ccagcagcag gtgagacaaa gtccccaatc tctgacagtc tgggaaggag 61 agaccgcaat tctgaactgc agttatgaga acagtgcttt tgactacttc ccatggtacc 121 agcagttccc tggggaaggt cccgctctcc tgatatccat actttcagtg tccgataaaa 181 aggaagatgg acgattcaca atcttcttca ataaaaggga gaaaaagctc tccttgcaca 241 ttgcagactc tcagcctgga gactcagcca cctacttctg tgcagcaagt gcaaattctg 301 ggacttacca gaggtttgga actgggacaa aactccaagt cgttccaaac atccagaac // LOCUS MUSTCEAEE 396 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35490 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL172.10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 396) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 396 T-cell receptor alpha-chain precursor (AA at 1) sigp < 1 51 T-cell receptor alpha-chain signal peptide matp 52 > 396 T-cell receptor alpha-chain recomb 327 328 V-region end/J-region start recomb 385 386 J-region end/C-region start BASE COUNT 113 a 101 c 95 g 87 t ORIGIN 1 ctaggccttc acctagctgg ggtgagtggc cagcaggaga aacgtgacca gcagcaggtg 61 agacaaagtc cccaatctct gacagtctgg gaaggagaga ccgcaattct gaactgcagt 121 tatgagaaca gtgcttttga ctacttccca tggtaccagc agttccctgg ggaaggtccc 181 gctctcctga tatccatact ttcagtgtcc gataaaaagg aagatggacg attcacaatc 241 ttcttcaata aaagggagaa aaagctctcc ttgcacattg cagactctca gcctggagac 301 tcagccacct acttctgtgc agcaagtgca aattctggga cttaccagag gtttggaact 361 gggacaaaac tccaagtcgt tccaaacatc cagaac // LOCUS MUSTCEAEF 390 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35491 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL183.2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 390) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 390 T-cell receptor alpha-chain precursor (AA at 1) sigp < 1 45 T-cell receptor alpha-chain signal peptide matp 46 > 390 T-cell receptor alpha-chain recomb 321 322 V-region end/J-region start recomb 379 380 J-region end/C-region start BASE COUNT 112 a 99 c 93 g 86 t ORIGIN 1 cttcacctag ctggggtgag tggccagcag gagaaacgtg accagcagca ggtgagacaa 61 agtccccaat ctctgacagt ctgggaagga gagaccgcaa ttctgaactg cagttatgag 121 aacagtgctt ttgactactt cccatggtac cagcagttcc ctggggaagg tcccgctctc 181 ctgatatcca tactttcagt gtccgataaa aaggaagatg gacgattcac aatcttcttc 241 aataaaaggg agaaaaagct ctccttgcac attgcagact ctcagcctgg agactcagcc 301 acctacttct gtgcagcaag tgcaaattct gggacttacc agaggtttgg aactgggaca 361 aaactccaag tcgttccaaa catccagaac // LOCUS MUSTCEAEG 395 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35492 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL214.12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 395) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 395 T-cell receptor alpha-chain precursor (AA at 3) sigp < 1 50 T-cell receptor alpha-chain signal peptide matp 51 > 395 T-cell receptor alpha-chain recomb 326 327 V-region end/J-region start recomb 384 385 J-region end/C-region start BASE COUNT 108 a 101 c 98 g 88 t ORIGIN 1 ctaggccttc acatgattgt gatgctcctc atattcacaa gggcccatgg agactcagtg 61 actcagacgg aaggtcaagt ggccctctca gaagaggact ttcttacgat acactgcaac 121 tactcagcct agggtaccca gctctgttct ggtacgagca gtatcccggg gaaggtccac 181 agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggt tttgaagcta 241 catatgataa agggaccacc tccttccact tgcggaaagc ctcagtgcaa gagtcagact 301 cggctgtgta ctactgtgct ctgggtccaa attctgggac ttaccagagg tttggaactg 361 ggacaaaact ccaagtcgtt ccaaacatcc agaac // LOCUS RATIDA 198 bp ds-DNA ROD 03-AUG-1990 DEFINITION Rat ID-A element in the S-100 beta-subunit gene. ACCESSION M35983 KEYWORDS . SOURCE Rat brain DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 198) AUTHORS Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and Takahashi,Y. TITLE ID sequences in the genes of three brain-specific proteins JOURNAL Biochem. Int. 15, 809-816 (1987) STANDARD simple staff_review FEATURES from to/span description rpt 5 19 direct repeat rpt 184 198 direct repeat BASE COUNT 94 a 26 c 58 g 20 t ORIGIN 1 agctaaaaac gaactccgtg gggttgggga tttagctcag tggtagagcg cttgcctagc 61 aagagcaagg ccctgggttc ggtccccagc tccgaaaaaa agaaaaaaag aaaaaagaaa 121 aagaagaaga agaagaagaa gaagaagaag aagaagaaga agaagaagaa gaagaagaag 181 aagaaaaaag aaacgcgt // LOCUS RATIDB 141 bp ds-DNA ROD 03-AUG-1990 DEFINITION Rat ID-B element in the S-100 beta-subunit gene. ACCESSION M35984 KEYWORDS . SOURCE Rat brain DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and Takahashi,Y. TITLE ID sequences in the genes of three brain-specific proteins JOURNAL Biochem. Int. 15, 809-816 (1987) STANDARD simple staff_review FEATURES from to/span description rpt 5 19 direct repeat rpt 121 135 direct repeat BASE COUNT 52 a 26 c 43 g 20 t ORIGIN 1 gtccaaagaa tggggcagag aggctgggga tttagctcag tggtagagcg cttacctagg 61 aagcgacagg ccctgggttc ggtccccagc tccgaaagaa agaaccaaaa aaaaaaaaaa 121 aaagaatggg gcagaataga t // LOCUS RATIDC 141 bp ds-DNA ROD 03-AUG-1990 DEFINITION Rat ID-C element in the S-100 beta-subunit gene. ACCESSION M35985 KEYWORDS . SOURCE Rat brain DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and Takahashi,Y. TITLE ID sequences in the genes of three brain-specific proteins JOURNAL Biochem. Int. 15, 809-816 (1987) STANDARD simple staff_review FEATURES from to/span description rpt 7 19 direct repeat rpt 113 125 direct repeat BASE COUNT 39 a 32 c 48 g 22 t ORIGIN 1 cccttagagc ccaagggcga gggttgggga tttagctcag tggtagagcg cttgcctggc 61 aagtgcaagg ccctgggttc ggtccccagc tccggaaaaa aaaaaaaaaa aggagcccaa 121 gggtgggtca aaggcagcta g // LOCUS RSFGCA1 71 bp ds-DNA BCT 03-AUG-1990 DEFINITION Plasmid RSF1010 transposon Tn951 5' inverted terminal repeat. ACCESSION M25019 KEYWORDS inverted terminal repeat. SEGMENT 1 of 3 SOURCE Plasmid RSF1010 DNA. ORGANISM Plasmid RSF1010 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 71) AUTHORS Cornelis,G., Sommer,H. and Saedler,H. TITLE Transposon Tn951 (TnLac) is defective and related to Tn3 JOURNAL Mol. Gen. Genet. 184, 241-248 (1981) STANDARD simple staff_review FEATURES from to/span description rpt 16 56 5' inverted terminal repeat rpt 11 15 5' insertion target sequence BASE COUNT 23 a 13 c 19 g 16 t ORIGIN 1 ttgtcaaact gactaggggt ctgacgctca gtggaacgaa aactcacgtt aagcaattgt 61 ggctgagaaa a // LOCUS RSFGCA2 109 bp ds-DNA BCT 03-AUG-1990 DEFINITION Plasmid RSF1010 transposon Tn951 DNA fragment. ACCESSION M25020 KEYWORDS inverted terminal repeat. SEGMENT 2 of 3 SOURCE Plasmid RSF1010 DNA. ORGANISM Plasmid RSF1010 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 109) AUTHORS Cornelis,G., Sommer,H. and Saedler,H. TITLE Transposon Tn951 (TnLac) is defective and related to Tn3 JOURNAL Mol. Gen. Genet. 184, 241-248 (1981) STANDARD simple staff_review BASE COUNT 29 a 27 c 23 g 30 t ORIGIN 1 gaatcggcat gagtgcaacg gcataagcct gcttaaaaaa caggcttaac gtgggatatt 61 ttccgttttc caagcggacc cctatcaata tgctcggcca ttattcctt // LOCUS RSFGCA3 105 bp ds-DNA BCT 03-AUG-1990 DEFINITION Plasmid RSF1010 transposon Tn951 3' inverted terminal repeat. ACCESSION M25021 KEYWORDS inverted terminal repeat. SEGMENT 3 of 3 SOURCE Plasmid RSF1010 DNA. ORGANISM Plasmid RSF1010 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 105) AUTHORS Cornelis,G., Sommer,H. and Saedler,H. TITLE Transposon Tn951 (TnLac) is defective and related to Tn3 JOURNAL Mol. Gen. Genet. 184, 241-248 (1981) STANDARD simple staff_review FEATURES from to/span description rpt 50 90 3' inverted terminal repeat rpt 91 95 3' insertion target sequence BASE COUNT 31 a 27 c 26 g 21 t ORIGIN 1 accaaaggac atctgagacc attaaaagag gcgtcagagg tagaaaacgt tgcttaacgt 61 gagttttcgt tccactgagc gtcagacccc gactacgtcc acgcg // LOCUS SYNPROA 808 bp ss-mRNA SYN 03-AUG-1990 DEFINITION Manipulation-derived rearrangement of cattle prolactin mRNA. ACCESSION M35587 KEYWORDS prolactin. SOURCE Synthetic, cDNA to mRNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 808) AUTHORS Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and Baev,A.A. TITLE Genetic engineering of peptide hormones: II. Possible polymorphism of preprolactin in cattle. Data of molecular cloning JOURNAL Mol. Biol. 22, 117-121 (1988) STANDARD simple staff_review BASE COUNT 217 a 201 c 195 g 195 t ORIGIN 1 ttgaacattt ccgaggagag gtcatggatg tagtgggaca ccatgactgc ccggtcaaac 61 aggtctcgaa gggatacctg gcagttgcca ggcccattgg gacagacggg ggtggagacc 121 acaccctggc acaagagtag atttgacacc accagcagca ggagcaggcg ggaccctgct 181 ttctgcgacg aacctttgct gtccatggtg gtgatgattt caagaaacac acttcaccag 241 gaagctctcg tcctatcccg gaagataaag aacaagccca acagacccat catgaagtcc 301 ttatgagctt gattcttggg ttgctgcgct cctggaatga ccctctctat cacctagtca 361 ccgaggtgcg gggtatgaaa ggagccccag atgctatcct atcgagggcc atagagattg 421 aggaagaaaa caaacgactt ctggaaggca tggagatgat atttggccag gttattcctg 481 gagccaaaga gactgagccc taccctgtgt ggtcaggact cccgtccctg caaactaagg 541 atgaagatgc acgttattct gctttttata acctgctcca ctgcctgcgc agggattcaa 601 gcaagattga cacttacctt aagctcctga attgcagaat catctacaac aacaactgct 661 aagcccacat tccatcctat ccatttctga gatggttctt aatgatccat tccctggcaa 721 acttctctga ggtttatagc tttgtaatgc atgcttggct ctaatgggtt tcatcttaaa 781 taaaaacaga ctctgtagcg atgtcaaa // LOCUS FLANSHB 856 bp ss-RNA VRL 03-AUG-1990 DEFINITION Influenza A/Houston/24269/85 (H3N2) nonstructural protein RNA (seg 8). ACCESSION M17699 KEYWORDS nonstructural protein. SOURCE Influenza A/Houston/24269/85 RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 856) AUTHORS Buonagurio,D.A., Nakada,S., Parvin,J.D., Krystal,M., Palese,P. and Fitch,W.M. TITLE Evolution of human influenza A viruses over 50 years: Rapid, uniform rate of change in NS gene JOURNAL Science 232, 980-982 (1986) STANDARD full staff_review COMMENT Computer-readable sequence for [1] kindly provided by P.Palese, 12-OCT-1987. FEATURES from to/span description BASE COUNT 280 a 173 c 198 g 205 t ORIGIN Unreported. 1 caacactgtg tcaatcttcc aggtagattg ctttctttgg catgtccaca aacaagttgt 61 agaccaagaa ctgagtgatg ccccattcct tgatcggctt cgccgagatc agaggtccct 121 aaggggaaga ggcagcaccc tcggtctaga catcaaagca gccacccatg ttggaaagca 181 gatagtagaa aagattctga aggaagaatc tgatgaggca cttaaaatga ccatggcctc 241 cacacctgct tcgcgataca taactgacat gactattgag gaattgtcaa gaaactggtt 301 catgctaatg cccaagcaga aagtggaagg acctctttgc atcagaatgg accaggcaat 361 catggagaaa aacatcatgt tgaaagcgaa tttcagtgtg atctttgacc gactagagac 421 cctagtatta ctaagggctt tcaccgaaga gggagcaatt gttggcgaaa tctcaccatt 481 gccttctttt ccaggacata ctattgagga tgtcaaaaat gcaattgggg tcctcatcgg 541 aggacttgaa tggaatgata acacagttcg agtctctaaa actctacaga gattcgcttg 601 gggaagcagt aatgaggatg ggggacctcc acttactcca aaacagaaac ggaaaatggc 661 gagaacagct aggtcaaaag tttgaagaga taagatggct gattgaagaa gtgagacaca 721 gactaaaaac aactgagaat agttttgagc aaataacatt catgcaagcc ttacagctgc 781 tgtttgaagt ggaacaggag ataagaactt tctcatttca gcttatttaa tgataaaaaa 841 cacccttgtt tctact // LOCUS VACSPI1A 3043 bp ds-DNA VRL 03-AUG-1990 DEFINITION Vaccinia virus serine protease inhibitor superfamily gene SPI-1. ACCESSION M24217 KEYWORDS serine protease inhibitor. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 3043) AUTHORS Kotwal,G.J. and Moss,B. TITLE Vaccinia Virus Encodes Two Proteins That Are Structurally Related to Members of the Plasma Serine Protease Inhibitor Superfamily JOURNAL J. Virol. 63, 600-606 (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence for [1] kindly submitted by B.Moss, 15-FEB-1990. FEATURES from to/span description pept < 1 243 unspecified ORF pept 255 659 16 kDa ORF pept 927 1988 SPI-1 (40 kDa ORF) pept 2150 2722 21.5 ORF BASE COUNT 1100 a 459 c 487 g 997 t ORIGIN 1 tcacataatc tatttagaga tcgagtcatg cacgattata taagtaatac atatattgat 61 cttgagtgtt tagatattat tagatcgttg gatggattcg atatcaatgg ttactttgaa 121 ggacgtacac cacttcattg cgctatacaa cataacttca ctcagattgc taagtactta 181 ttagatcgag gagctgatat agtcgtaccc aacacattga ttatacatca gtacatacag 241 taaatagcat agatatggag gaggatacaa atatttcaaa taaagttata aggtacaaca 301 ctgtcaataa tatatgggaa acattaccta acttctggac tggaactata aatccaggcg 361 tggtctcgca taaagatgat atatatgttg tatgcgacat caaagatgaa aaaaatgtta 421 aaacttgtat atttagatat aacacgaata cgtataacgg atgggaattg gtcacgacga 481 cagaaagcag attatcagct ctgcatacta ttctttataa caataccata atgatgttac 541 attgttatga atcgtatatg ttacaagata catttaatgt gtacactcgc gaatggaatc 601 atatgtgtca tcaacattcg aatagttata tcatgtacaa tatactaccc atctactaaa 661 tataatagaa taaaataaat gagtatgatc attttagata acgattgatt ttatcattac 721 cgcttcattc ttatattctt tgcttacgga acctatattt agaaacatct actaacgatt 781 ttttatgctt gcattattaa tggtatgtaa tatgattgat tgtgtacgca ataccaattt 841 gttaagtatg aatacggggt acaaacataa actgaagttt aacattattt atttatgata 901 tatatcgtta ttgtttggtc tataccatgg atatctttaa agaactaatc ttaaaacaca 961 cggatgaaaa tgttttgatt tctccagttt ctattttatc tactttatct attctaaatc 1021 atggagcagc tggttctaca gctgaacaac tatcaaaata tatagagaat atgaatgaga 1081 atacacccga tgacaataat gacatggacg tagatattcc gtattgtgcg acactagcta 1141 ccgcaaataa aatatacggt agcgatagta tcgagttcca cgcctccttc ctacaaaaaa 1201 taaaagacga ttttcaaact gtaaacttta ataatgctaa ccaaacaaag gaactaatca 1261 acgaatgggt taagacaatg acaaatggta aaattaattc cttattgact agtccgctat 1321 ccattaatac tcgtatgaca gttgttagcg ccgtccattt taaagcaatg tggaaatatc 1381 cattttctaa acatcttaca tatacagaca agttttatat ttctaagaat atagttacca 1441 gtgttgatat gatggtgagc actgagaata acttgcaata tgtacatatt aatgaattat 1501 tcggaggatt ctctattatc gatattccat acgagggaaa ctctagtatg gtaattatac 1561 taccggacga catagaaggt atatataaca tagaaaaaaa tataacagat gaaaaattta 1621 aaaaatggtg tggtatgtta tctactaaaa gtatagactt gtatatgcca aagtttaaag 1681 tggaaatgac agaaccgtat aatctggtac cgattttaga aaatttagga cttactaata 1741 tattcggata ttatgcagat tttagcaaga tgtgtaatga aactatcact gtagaaaaat 1801 ttctacatac gacgtttata gatgttaatg aggagtatac agaagcatcg gccgttacag 1861 gagtatttat gactaacttt tcgatggtat atcgtacgaa ggtctacata aaccatccat 1921 tcatgtacat gattaaagac aacacaggac gtatactttt tatagggaaa tactgctatc 1981 cgcaataaat ataaacaaat agacttttat cacgtttatc tatgtctaaa tattacaaat 2041 agtaatagta taaactaaag ctgataatac ttaaaaaaat aataatatca tttacaatta 2101 atagtataaa ctaaaaatta aacaaatcgt tattataagt aatatcaaaa tgatgatata 2161 cggattaata gcgtgtctta tattcgtgac ttcatccatc gctagtccac tttatattcc 2221 cgttattcca cccatttcgg aagataaatc gttcaatagt gtagaggtat tagtttcctt 2281 gtttagagat gaccaaaaag actatacggt aacttctcag ttcaataact acactatcga 2341 taccaaagac tggactatcg gcgtactatc cacacctgat ggtttggata taccattgac 2401 taatataact tattggtcac ggtttactat aggtcgtgca ttgttcaaat cagagtctga 2461 ggatattttc caaaagaaaa tgagtattct aggtgtttct atagaatgta agaagtcgtc 2521 gacattactt acttttttga ccgtgcgtaa aatgactcga gtatttaata aatttccaga 2581 tatggcttat tatcgaggag actgtttaaa agccgtttat gtaacaatga cttataaaaa 2641 tactaaaact ggagagactg attacacgta cctctctaat ggggggttgc ctgcatacta 2701 tcgtaatggg gtcgatggtt gattattgat tagtatattc cttattcttt ttattcacac 2761 aaaaagaaca tttttataaa catgaaacca ctgtctaaat gtaattatga tcttgattta 2821 tagatgaaga tcagccttta gaggatttta accagtatgt ttaatatgaa aaaaataaac 2881 ataacatatt ttgagattaa gcgctattgt gcaagattat attagaatca aattaatctt 2941 tcatacgaga aaaataacga catacgtcgt caacaaatta aactttttat ttattagtta 3001 actagcttat agaacttgct cattgttatg tttctaaaac ggg // LOCUS VACSPI2A 1475 bp ds-DNA VRL 03-AUG-1990 DEFINITION Vaccinia virus serine protease inhibitor superfamily gene SPI-2. ACCESSION M24218 KEYWORDS serine protease inhibitor. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 1475) AUTHORS Kotwal,G.J. and Moss,B. TITLE Vaccinia Virus Encodes Two Proteins That Are Structurally Related to Members of the Plasma Serine Protease Inhibitor Superfamily JOURNAL J. Virol. 63, 600-606 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1475) AUTHORS Kotwal,G.J. and Moss,B. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence for [2] kindly submitted by B.Moss, 15-FEB-1990. FEATURES from to/span description pept 295 1332 SPI-2 BASE COUNT 487 a 245 c 286 g 457 t ORIGIN 1 tccatggaaa aacgaaagta gtataaaagt aataaaacaa aaaaaagaat ataaaaaatt 61 tatagctact ttctttgagg actgttttcc tgaaggaaat gaacctctgg aattagttag 121 atatatagaa ttagtataca cgttagatta ttctcaaact cctaattatg acagactacg 181 taaactgttt atacaagatt gaaattatat tctttttttt atagagtgtg gtagtgttac 241 ggatatttaa tattagacta tctctatcgc gctacacgac caatatcgat tactatggat 301 atcttcaggg aaatcgcatc ttctatgaaa ggagagaatg tattcatttc tccagcgtca 361 atctcgtcag tattgacaat actgtattat ggagctaatg gatccactgc tgaacagcta 421 tcaaaatatg tagaaaagga ggagaacatg gataaggtta gcgctcaaaa tatctcattc 481 aaatccataa ataaagtata tgggcgatat tctgccgtgt ttaaagattc ctttttgaga 541 aaaattggcg ataagtttca aactgttgac ttcactgatt gtcgcactat agatgcaatc 601 aacaagtgtg tagatatctt tactgagggg aaaatcaatc cactattgga tgaaccattg 661 tctcctgata cctgtctcct agcaattagt gccgtatact ttaaagcaaa atggttgacg 721 ccattcgaaa aggaatttac cagtgattat cccttttacg tatctccgac ggaaatggta 781 gatgtaagta tgatgtctat gtacggcaag gcatttaatc acgcatctgt aaaggaatca 841 ttcggcaact tttcaatcat agaactgcca tatgttggag atactagtat gatggtcatt 901 cttccagaca agattgatgg attagaatcc atagaacaaa atctaacaga tacaaatttt 961 aagaaatggt gtaactctct ggaagctacg tttatcgatg ttcacattcc caagtttaag 1021 gtaacaggct cgtataatct ggtggatact ctagtaaagt caggactgac agaggtgttc 1081 ggttcaactg gagattatag caatatgtgt aattcagatg tgagtgtcga cgctatgatc 1141 cacaaaacgt atatagatgt caatgaagag tatacagaag cagctgcagc aacttgtgca 1201 ctggtgtcag actgtgcatc aacaattaca aatgagttct gtgtagatca tccgttcatc 1261 tatgtgatta ggcatgttga tggaaaaatt cttttcgttg gtagatattg ctctccgaca 1321 actaattgtt aaccattttt tttaaaaaat agaaaaaaca tgtggtatta gtgcaggtcg 1381 ttattcttcc aattgcaatt ggtaagatga cggccaactt tagtacccac gtcttttcac 1441 cacagcactg tggatgtgac agactgacca gtatt // LOCUS YSCARSX 1539 bp ds-DNA PLN 03-AUG-1990 DEFINITION Yeast (S.cerevisiae) chromosome III L terminal region DNA. ACCESSION M12922 KEYWORDS . SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1539) AUTHORS Button,L.L. and Astell,C.R. TITLE The Saccharomyces cerevisiae chromosome III left telomere has a type X, but not a type Y', ARS region JOURNAL Mol. Cell. Biol. 6, 1352-1356 (1986) STANDARD simple staff_review REFERENCE 2 (sites) AUTHORS Longtine,M.S., Wilson,N.M., Petracek,M.E. and Berman,J. TITLE A yeast telomere binding activity binds to two related telomere sequence motifs and is indistinguishable from RAP1 JOURNAL Curr. Genet. 16, 225-239 (1989) STANDARD full staff_entry FEATURES from to/span description site 232 236 an unusual yeast telomere repeat motif (CCCCA) BASE COUNT 514 a 551 c 130 g 344 t ORIGIN 1 cccacacacc acacccacac cacacccaca caccacacac accacaccca cacacccaca 61 ccacaccaca cccacaccac acccacacac ccacacccac acaccacacc cacacacacc 121 acacccacac acaccacacc cacacaccac acccacacac acaccacacc cacacacacc 181 acaccacacc cacaccacac ccacacccac acaccacacc acaccacacc cacaccacac 241 accacaccca cacacaccac accacacaca ccacaccaca cccacacacc acacccacac 301 acccacaccc acacacacca cacccacacc acacccacac ccacacaccc acaccctaac 361 actaccctaa cactacccta ttctaaccct gattttacct gtctcccaac ttactctcca 421 ttaccctacc tctccactcg ttaccctgtc tgattcaacc gtaccactcc caaccaccat 481 ccatctctct acttactacc accaacccac cgtccaccat aaccgttacc ctccaactac 541 ccgtatccaa ctccactacc gcttacccta ccatcgacca tgtcctactc actgtactgt 601 tgttcaccca ccatattgaa acgtctacaa atgatcgtaa ataatacaca tatacttatc 661 ctaccactct aatcccacta ccacatgcca tactcacctt cacttgtatt ctgatcggtc 721 atacgcacac ggatgctaca gtatatacca tctcaaactt acctactttc atattcatca 781 tcacccatct ctcaccatca gtaccaatgc actcgcatca ttatgcacgg cttgcctcag 841 cggtctatac cctgtgccat ttacgcataa cgcccatcat tatccacatt ttaatatcta 901 tatctcattc ggcgacacca aatattgtat aactgccctt aatacatacg ttataccact 961 tttacaccat atactaacca ctcaatttat acacacttat gtcaatataa ccacaaaatc 1021 accactaaaa tcacctaaac ataaaaatat tctactcttc aacaataata cataaacaca 1081 ctcaattgcg tatctatacc accatgacgt cattaacgta aaagttcctt aatattgcaa 1141 attgcttgaa cggatgccat ttcatgttga atgtggtaac ccaatagcat gatatgagta 1201 atgctttagt attgtttcag agttgtttca gtaatgtttt agacaaggag aacatatagt 1261 agcaaacctc taatccggta gtacttaaga aactacagtt tctatgtacg aaagcagtaa 1321 ctatgtaatt attacattta catgacatat aggaaggtcc aataaactta ctacattatg 1381 acctataagc tagatcgtaa ttcattacgt caacaggtta tgagccctag agcaatgctt 1441 ctgagaacat aatcaatcta tctagcccca acaattataa acagtggctg tacggtatcg 1501 agaccgctgc tgaatatgct aacgaatata tgaacgaat // LOCUS VAC35K 840 bp ds-DNA VRL 03-AUG-1990 DEFINITION Vaccinia virus gene for 35K complement control protein. ACCESSION X13166 KEYWORDS complement protein; regulatory protein; secretory protein. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 840) AUTHORS Kotwal,G.J. and Moss,B. TITLE Vaccinia virus encodes a secretory polypeptide structurally related to complement control proteins JOURNAL Nature 335, 176-178 (1988) STANDARD simple automatic COMMENT *source: library=HindIII in M13; strain=wildtype. EMBL features not translated to GenBank features: key from to description SITE 43 54 late transcription start site OPR 543 549 early transcription stop signal FEATURES from to/span description pept 52 > 840 35K preprotein sigp 52 108 signal peptide (AA -19 to -1) matp 109 > 840 mature 35K protein (AA 1-244) BASE COUNT 257 a 152 c 179 g 252 t ORIGIN 1 tttttattat ttgtacgatg tccaggataa catttttacg gataaataaa tatgaaggtg 61 gagagcgtga cgttcctgac attgttggga ataggatgcg ttctatcatg ctgtactatt 121 ccgtcacgac ccattaatat gaaatttaag aatagtgtgg agactgatgc taatgctaat 181 tacaacatag gagacactat agaatatcta tgtctacctg gatacagaaa gcaaaaaatg 241 ggacctatat atgctaaatg tacaggtact ggatggacac tctttaatca atgtattaaa 301 cggagatgcc catcgcctcg agatatcgat aatggccaac ttgatattgg tggagtagac 361 tttggctcta gtataacgta ctcttgtaat agcggatatc atttgatcgg tgaatctaaa 421 tcgtattgtg aattaggatc tactggatct atggtatgga atcccgaggc acctatttgt 481 gaatctgtta aatgccaatc ccctccatct atatccaacg gaagacataa cggatacgag 541 gatttttata ccgatgggag cgttgtaact tatagttgca atagtggata ttcgttgatt 601 ggtaactctg gtgtcctgtg ttcaggagga gaatggtccg atccacccac gtgtcagatt 661 gttaaatgtc cacatcctac aatatcaaac ggatacttgt ctagcgggtt taaaagatca 721 tactcataca acgacaatgt agactttaag tgcaagtacg gatataaact atctggttcc 781 tcatcatcta cttgctctcc aggaaataca tggaagccgg aacttccaaa atgtgtacgc // LOCUS BSUCISAB 2246 bp ds-DNA BCT 03-AUG-1990 DEFINITION B.subtilis site-specific DNA recombinase (cisA) and RNA polymerase sigma-K subunit (cisB; spoIVCB) genes, complete cds. ACCESSION M29040 KEYWORDS DNA recombinase; RNA polymerase sigma-K subunit. SOURCE B.subtilis (strain JH642) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2246) AUTHORS Sato,T., Samori,Y. and Kobayashi,Y. TITLE The cisA cistron of Bacillus subtilis sporulation gene spoIVC encodes a protein homologous to a site-specific recombinase JOURNAL J. Bacteriol. 172, 1092-1098 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Kobayashi, 14-OCT-1989. FEATURES from to/span description pept 134 1636 DNA recombinase (cisA) (gtg start codon) pept 2014 1544 (c) RNA polymerase sigma-K subunit (cisB) binding 119 127 ribosome binding site signal 1635 1660 pot. transcription termination signal BASE COUNT 776 a 405 c 475 g 590 t ORIGIN 3 bp upstream of PvuII site. 1 cagctgttac atagcattac ccaaggggtg atgcatttta tgaaagtgat aatcatcgag 61 ggaccgcaag ctgacaaatg cattaacgat tgctatcatt atttaataaa actttatagg 121 aaggagattc agggtgatag caatatatgt aagggtatcg accgaggaac aagcgatcaa 181 gggatcgagc atcgacagcc aaatcgaggc ctgtataaag aaagcaggga ctaaagatgt 241 gctgaagtat gcagatgaag gattttcagg agagctttta gaacgtccgg ctttgaatcg 301 cttgagggag gatgcaagca agggacttat aagtcaagtc atttgttacg atcctgaccg 361 tctttctcgg aaattaatga atcagctaat cattgatgac gaattgcgaa agcgaaacat 421 acctttgatt tttgtaaatg gtgaatacgc caattctcca gaaggtcaat tgtttttcgc 481 aatgcgcggg gcaatctcag aatttgaaaa agccaaaatc aaagaacgga catcaagcgg 541 ccgacttcaa aaaatgaaaa aaggcatgat cattaaagat tctaaactat atggctataa 601 atttgttaaa gagaaaagaa ctcttgagat attagaagag gaagcaaaaa tcattcggat 661 gatttttaac tatttcaccg atcataaaag cccttttttc ggcagagtaa atggtattgc 721 tctacattta actcagatgg gggttaaaac aaaaaaaggc gccaaagtat ggcacaggca 781 ggttgttcgg caaatattaa tgaactcttc ctataagggt gaacatagac agtataaata 841 tgatacagag ggttcctatg tttcaaagca ggcagggaac aaatctataa ttaaaataag 901 gcctgaagaa gaacaaatca ctgtgacaat tccagcaatt gttccagctg aacaatggga 961 ttatgctcaa gaactcttag gtcaaagtaa aagaaaacac ttgagtatca gccctcacaa 1021 ttacttgtta tcgggtttgg ttagatgcgg aaaatgcgga aataccatga cagggaagaa 1081 aagaaaatca catggtaaag actactatgt atatacttgc cggaaaaatt attctggcgc 1141 aaaggaccgc ggctgcggaa aagaaatgtc tgagaataaa ttgaaccggc atgtatgggg 1201 tgaaattttt aaattcatca caaatcctca aaagtatgtt tcttttaaag aggctgaaca 1261 atcaaatcac ctgtctgatg aattagaact tattgaaaaa gagatagaga aaacaaaaaa 1321 aggccgcaag cgtcttttaa cgctaatcag cctaagcgat gacgatgatt tagacataga 1381 tgaaatcaaa gcacaaatta ttgaactgca aaaaaagcaa aatcagctta ctgaaaagtg 1441 taacagaatc cagtcaaaaa tgaaagtcct agatgatacg agctcaagtg aaaatgctct 1501 aaaaagagcc atcgactatt ttcaatcaat cggtgcagat aacttaactc ttgaagataa 1561 aaaaacaatt gttaacttta tcgtgaaaga agttaccatt gtggattctg acaccatata 1621 tattgaaacg tattaaagag gggtgtatgc accccccttt tgtaattaca atctcatttt 1681 caatacacct cgctgcatac gtcgccacct ttgtcccttt tccagcggaa tagctttcaa 1741 ttcctttaat aagcccgatc gttccgatgg agattaagtc ctctgcatcc tcacctgtat 1801 tttcgaactt tttcacaata tgggcgacca agcgaagatt atgctcaatc agcatgtttc 1861 tggcatgttc atccccttta gccatgagct ctaagtattt tttttcttcg ctgcttgaga 1921 gcggttgtgg aaaggcattg tttttcacgt aagatactaa aaagacaagc tctttaacaa 1981 caaagccgag cgctgcgaaa acacctgtca ccatcgtcac ctccacaaaa gtatgtaggc 2041 aaaagcctat atgtaaatgt atgtgaccgg gaggctgtct gtgtctgtac cgggaaattt 2101 ctcgcggggg agtccggctt tggtaagcag catactaact tcaaatgtta aaaatattct 2161 tcatcaagcg cccatacatt gaaatgaaca aaggtatggg gggatgggga tgaaaaaatg 2221 gatggcaggc ctgtttcttg ctgcag // LOCUS BSUSIG43 5127 bp ds-DNA BCT 03-AUG-1990 DEFINITION Bacillus subtilis sigma 43 operon with P23-dnaE-rpoD genes (dnaE for DNA primase, rpoD for RNA polymerase). ACCESSION X03897 KEYWORDS DNA primase; RNA polymerase; polymerase; primase; unidentified reading frame. SOURCE Bacillus subtilis. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 5127; enum. 1 to 5127) AUTHORS Wang,L.-F. and Doi,R.H. TITLE Nucleotide sequence and organization of Bacillus subtilis RNA polymerase major sigma (sigma 43) operon JOURNAL Nucleic Acids Res. 14, 4293-4307 (1986) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description PRM 728 733 -35 region P1 PRM 750 755 -10 region P1 PRM 754 759 -35 region P2 PRM 778 783 -10 region P2 RBS 848 851 pot. rRNA binding site PRM 942 949 -35 region P3 PRM 963 971 -10 region P3 RBS 1462 1472 pot. rRNA binding site RBS 3462 3474 pot. rRNA binding site OPR 4609 4631 palindrome pot. transcription terminator FEATURES from to/span description pept 855 1445 P23 unidentified reading frame pept 1479 3290 dnaE pept 3489 4604 rpoD BASE COUNT 1677 a 904 c 1177 g 1369 t ORIGIN 1 gaattcttat gttgaagata taggcactat caatgaagtg atttcacttg caaaggcaga 61 cggcggcatt atctgtttta cactcgtggt gccggaaatc agagaatatt tgatagccga 121 agcggaaaaa gcaaatgttt tatattatga tattatcggc ccgttgattg ataaaatgga 181 aacagcctac ggtttaacag cgaaatacga accggggcgg gtgcgccagc ttgatgaaga 241 ttatttcaaa aaagtggagg ccatcgagtt tgcagttaaa tacgatgatg gacgtgatcc 301 aagagggatt ttaaaagctg atatcgtttt gatcggcgtg tcaagaacgt ctaaaacacc 361 gctgtctcaa tatctcgcac acaaacgcct gaaggttgcc aatgttccga ttgtaccgga 421 ggttgatccg ccggaagaac tctttaacgt tgatccgaaa aaatgcatcg gtttaaagat 481 tagccctgat aaactgaatc atatcagaaa agaacgttta aaatcactcg ggcttaatga 541 taaagcgatt tatgcaaata tcaacagaat caaagaggaa ctcgagtatt tcgaaaagat 601 tgtggatcgg atcggctgcc aggttgttga tgtttcaaat aaagcggttg aggaaacagc 661 aaatattatc catcatctca aaacaaaaaa catataactc aggacgctct atcctgggtt 721 tttggctgtg ccaaaaggga ataatgaaaa acaatagcat ctttgtgaag tttgtattat 781 aataaaaaat tgtgataaaa tgattaattt taggtttaag gatcgtgtga tacgaataaa 841 ctattatggg taagatgtca agaatttctc ccggaaattt ttcgacaaat tcatatacat 901 ccacaataat aaaggatgtg cgattttgct tgcttttatg caggagttta atggagggat 961 ggagaattac tcttcttaat gaacaagaaa agacgatttt tgtcgatgct gatgcttgtc 1021 cggtaaaaga tgaaatttta caaacagcat ccgagtatga agttcaagtt ctttttgtcg 1081 cttcatttga acattatcag ctttccagaa gcaatgaaga aaaatggaag tatgttgatc 1141 ctcataaaga agctgctgat ttatatatcg caaatcacgt gaaaccggga gatattgttg 1201 tgacgcagga catcggatta gcatctctgc tgttgaacag aaatgtctct gttatgtcgg 1261 aaagaggtcg tctttacaag gaagacacga ttgattttgc cctagagggc cgtcattttt 1321 ccggcaaaca aagaagaaaa ggcgtatatg ccaaagggcc taaaaaattg aataaagaag 1381 atcgagaacg atttattaca ctgctgcaaa aaatcctgtc gaacgatgaa gggattttgc 1441 actaaagcat cgaataatgt acgacggagt gttataagat gggaaatcgg ataccagatg 1501 aaattgtgga tcaggtgcaa aagtcggcag atatcgttga agtcataggt gattatgttc 1561 aattaaagaa gcaaggccga aactactttg gactctgtcc ttttcatgga gaaagcacac 1621 cttcgttttc cgtatcgccc gacaaacaga tttttcattg ctttggctgc ggagcgggcg 1681 gcaatgtttt ctctttttta aggcagatgg aaggctattc ttttgccgag tcggtttctc 1741 accttgctga caaataccaa attgattttc cagatgatat aacagtccat tccggagccc 1801 ggccagagtc ttctggagaa caaaaaatgg ctgaggcaca tgagctcctg aagaaatttt 1861 accatcattt gttaataaat acaaaagaag gtcaagaggc actggattat ctgctttcta 1921 ggggctttac gaaagagctg attaatgaat ttcagattgg ctatgctctt gattcttggg 1981 actttatcac gaaattcctt gtaaagaggg gatttagtga ggcgcaaatg gaaaaagcgg 2041 gtctcctgat cagacgcgaa gacggaagcg gatatttcga ccgcttcaga aaccgtgtca 2101 tgtttccgat ccatgatcat cacggggctg ttgttgcttt ctcaggcagg gctcttggca 2161 gccagcagcc taagtatatg aacagtcctg aaaccccgct ctttcataaa agcaaactgc 2221 tttacaattt ttataaggcc cgccttcata tcagaaagca ggaaagagca gtcttatttg 2281 aagggtttgc tgatgtctat acggccgtaa gctcggatgt aaaggaaagc atagccacga 2341 tgggaacgtc tcttacagat gatcatgtca agatcctgag aagaaacgtc gaagaaatca 2401 ttctttgcta tgactctgat aaagccggtt atgaagccac cttaaaagct tcggagcttc 2461 tgcaaaaaaa aggctgcaaa gtcagagttg caatgattcc tgacggattg gaccctgatg 2521 attacatcaa aaaattcggc ggggaaaaat ttaaaaacga cattattgac gcaagtgtca 2581 ccgtaatggc gttcaaaatg caatatttcc gaaaaggaaa gaacctgtcc gatgaaggcg 2641 accgcctagc ttacattaaa gacgtactga aagaaatcag cacgctttca gggtctctag 2701 agcaggaagt ctatgtaaag cagcttgctt cagagttttc gctttcacag gagtctttaa 2761 ctgagcagct gtctgttttc agcaagcaaa acaaacctgc tgacaatagc ggtgaaacta 2821 aaacgcggcg agcgcatctg acgacaaaag caaggcaaaa acgtttgcgt ccggcgtatg 2881 aaaatgcaga aaggctgtta ctcgctcaca tgcttcgaga tcggagcgtc atcaaaaaag 2941 tgattgaccg ggtagggttt caatttaata ttgatgagca ccgggcatta gccgcttatc 3001 tttatgcttt ttatgaagag ggagccgagc tgacgcctca gcatctgatg gccagggtga 3061 cggatgatca tataagccag ctcttgtccg atatattaat gcttcaggtt aatcaagagc 3121 ttagcgaagc cgagttatca gattatgtaa aaaaagtgtt gaatcaaaga aattggtcaa 3181 tgataaaaga aaaagaggcg gaaagagccg aagcagaaag gcaaaaagat tttttaagag 3241 ctgcttcttt ggctcaagaa atcgttacat tgaaccgatc tttaaaataa ctggagaact 3301 gatgaggagc atttattggc aatgattcct tgcggaggag caaatagatc gcttaacctc 3361 atcatgaatt gtcatttcat tattcgcaca ttgttaaagg cagttcacat agaaaacgcc 3421 tgaatggacc gaataagaat cataccgctt atagaattcg ttgcaagctt tggaaggagg 3481 gatccataat ggctgataaa caaacccacg agacagaatt aacattcgac caagtaaaag 3541 agcaattaac agagtctggt aaaaaacgtg gcgttttgac atatgaagaa attgctgagc 3601 gtatgtccag ctttgaaatt gaatcagacc aaatggatga gtattatgaa tttttaggtg 3661 aacaaggtgt tgaattaatt agtgagaatg aagaaacaga agatcctaat attcagcagc 3721 ttgccaaagc cgaagaagaa tttgacctta atgacctaag tgtaccgcct ggcgttaaaa 3781 tcaatgaccc agttcgtatg tatttaaagg aaatcggtcg ggttaacctt ctttctgcaa 3841 aagaagaaat cgcctacgct caaaagattg aagaaggtga cgaagaatct aaacgcagat 3901 tggctgaagc gaacctgcgg cttgttgtca gtatcgcaaa acggtatgtc ggacgcggta 3961 tgctgttcct tgatctgatc catgaaggaa acatgggcct gatgaaagcc gttgaaaaat 4021 ttgattatcg caaaggttat aaattcagta cgtatgctac gtggtggatc agacaggcga 4081 ttacacgcgc cattgccgat caggcgagaa cgatccggat tcccgttcat atggttgaaa 4141 ccattaataa attaatccgt gtgcagcgtc aattactgca agacttaggc agagaaccaa 4201 cacctgaaga aattgcggaa gatatggatt taacgcctga aaaagtacgc gaaatcttaa 4261 agattgctca agagccggta tctctggaaa caccgatcgg tgaagaggat gactcgcacc 4321 ttggtgattt cattgaagac caagaagcaa cttcaccttc tgaccacgcc gcatacgagc 4381 tattgaaaga gcagctggaa gatgtgcttg atacgttaac tgatcgtgaa gaaaatgtat 4441 tgcgtcttcg attcggtctt gatgacggcc gtacaagaac attagaagag gtcggcaaag 4501 tatttggagt aacgagagag cgtattcgac aaatcgaagc caaagcgttg cggaaactaa 4561 gacatcctag cagaagtaaa cgtttgaaag atttccttga ataagatgga acgggtcttg 4621 aagatccgtt cttctttttt taaaaagata tatggataat atgcctttat tttactgaaa 4681 aatgatgtca tttgcaaatg aacattgtgg tgaaaaattt caaaatctaa ttccatattt 4741 tctattgtaa gcgtatacaa tacattatac aatagaataa aaaggatatt agagatttag 4801 gcatgtttct atttcaattt tgtataaagt gtgaataaaa aacttttgta tagcaatcca 4861 tttacttttt gtaaaaataa gttagaatta gaagtgttta cataggggga aggatcaaaa 4921 agggggaagg gaaatgaaat ggaacccgct tattccattt ttgctgatcg ctgttttagg 4981 aatgggtcta actttctttt tatcagtaaa aggacttgat gactctcggg aggttgcgag 5041 cggaggagaa agcaaatctg ctgaaaagaa agatgcaaac gcttcaccag aagaaattta 5101 caaggcaaat cgcatcgcat cgcatgc // LOCUS CHKMHBFVB 4790 bp ds-DNA VRT 03-AUG-1990 DEFINITION Chicken MHC class I B-FIV-B12 alpha-chain gene, complete cds. ACCESSION M31012 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Chicken (strain CB; haplotype B12), clone cBF23. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 4790) AUTHORS Kroemer,G., Zoorob,R. and Auffray,C. TITLE Structure and expression of a chicken MHC class I gene JOURNAL Immunogenetics 31, 405-409 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Kroemer, 21-DEC-1990. FEATURES from to/span description pept 1405 1471 MHC B12 alpha-chain, exon 1 1589 1852 MHC B12 alpha-chain, exon 2 2081 2353 MHC B12 alpha-chain, exon 3 2457 2729 MHC B12 alpha-chain, exon 4 2803 2910 MHC B12 alpha-chain, exon 5 3021 3053 MHC B12 alpha-chain, exon 6 3213 3245 MHC B12 alpha-chain, exon 7 (put.) 3400 3419 MHC B12 alpha-chain, exon 8 IVS 1472 1588 MHC B12 alpha-chain intron A IVS 1853 2080 MHC B12 alpha-chain intron B IVS 2354 2456 MHC B12 alpha-chain intron C IVS 2730 2802 MHC B12 alpha-chain intron D IVS 2911 3020 MHC B12 alpha-chain intron E IVS 3054 3212 MHC B12 alpha-chain intron F IVS 3246 3399 MHC B12 alpha-chain intron G pre-msg 1347 3587 MHC B12 alpha-chain and introns (alt.) pre-msg 1379 3587 MHC B12 alpha-chain and introns (alt.) binding 1328 1337 Sp1 binding site site 1219 1228 interferon responsive element site 1263 1270 cAMP responsive element site 1238 1244 S box (B-L-beta) site 1247 1255 X box (B-L-beta) site 1288 1294 Y box (B-L-beta) site 1313 1317 CAAT box BASE COUNT 812 a 1637 c 1460 g 881 t ORIGIN Microchromosome 16. 1 acctataagg gacatgggaa cacagcagga ggtgtttggg ttctggggac atggggggaa 61 ctctggggaa ggagatgtga ggtgacatgg gtgacatggc aggaggggac ccaagggtgc 121 acacatccct gtgtgccaca ggggtgtccc caaccccctc accccagagg acatccccca 181 tgtccccata tcccccggtg tccccatacc tcaatttgta ggagctggtt gaagttgttg 241 tggggtgtga gggcaaatgg gactgggagg gtgcaggccc agccccacgg tccccctcag 301 cccatgcagt caccgtcccg gtcaccggcc ccacagctgc cagcaacagc ccatgggggt 361 ccccaaggcc acaggggcgg ggggccacca ggacccacct gagggaacac tgcttgagcc 421 accagcccca taactgctcc ataagccctc ccagatccca taagcatcca gacacccctt 481 aggtccatag gaatcccccc agctctggcc ccataacccc acccaaccac acagaaatct 541 cccttagtcc ccccagactc catcacaccc caatcctata gcagccccca gccccaccag 601 tgccccatag tccctcatag gcactgcaga acattcatag tcccatagct accccacaaa 661 caccccatag ccccatactc agggtcctgg gagctggccc tcattaaggg agtgaggaag 721 aggaggagga agagctgaag ccagtgaggc ccatggctcc ggcagtgtgg ggcagatggg 781 agcagtggcg cagggacagc ggggttgggg tcaatcatta acttgggggg tggacattgg 841 ggtccacctg cagaccctga gggcatcgac ctcagatgga ccacgtggaa tgtggggaag 901 cggtgtgggg cggcgctgtg gggaaatggc actgatccca aaggaagccc tgggggacac 961 cactgtcacc accccctccc ccccggacac gagcctctaa ccctatggat gtgacccccc 1021 ccccaaacag ttccttgact ggaatggaga atgactcctc ctcttctcac ggcccccgtc 1081 ctcccggcag agaccctcgg ccccctcccc cggggtaagg acgcggcccc gtgcgaacac 1141 agcccgggtg cccccccgtg ctcgaagggc cgggggttcc cacaccacgc ccatcccctc 1201 ccccgctccg cctttcgctt tcgcttcaca acctgaggga gcgcattctg cctggcgccc 1261 gatgacgtca cataaaactc caactaccat tggcggagag gcgacggagg agccaatggg 1321 ggcgcggggc ggggcggagg agtaggaaaa gctgaaggag ctgcgctggg tgcggcggac 1381 ttgagagtgc agcggtgcga ggcgatgggg ccgtgcgggg cgctgggcct ggggctgctg 1441 ctcgccgccg tgtgcggggc ggcggccccc ggtgagtgcg gccggaccgg gacccctccc 1501 ccgcccgtaa ccccaccccg gggctgtgtc cgtgggatcc tgagacccac acccgcggct 1561 cacggcccgc tgtgctccgt ccccgcagag ctccataccc tgcggtacat ccaaacggcg 1621 atgacggatc ccggccccgg gcagccgtgg ttcgtgactg tggggtacgt ggacggggaa 1681 ctcttcgtgc actacaacag caccgcgcgg aggtacgtgc cccgcaccga gtggatagcg 1741 gccaaggcgg accagcagta ctgggatgga cagacgcaga tcggacaggg caatgagcag 1801 attgaccgcg agaacctggg catactgcag cggcgctaca accagaccgg cggtgagcac 1861 ggccggggcc gcggctccgt gggtgtggga tgggctccat gcgcagtgcc gcccacaccc 1921 cccaggcctg gccctgcccg gcggcaccgt cccggggctg cccgtcacag ccccaccgcg 1981 ctcggggtgc cgcgtcccgg ggggacccca acccatcccc gctgcagtgg gagccccgga 2041 gccggagggg cccctcaccc cctgcccggc tgtgtttcag ggtctcacac ggtgcagtgg 2101 atgtacggct gtgacatcct cgagggcggc cccatccggg ggtattatca gatggcctac 2161 gatgggagag acttcactgc cttcgacaaa ggcacgatga cgttcactgc ggcagttcca 2221 gaggcagttc ccaccaagag gaaatgggag gaagagagtg aacctgagag gtggaagaat 2281 tacctggagg aaacctgcgt ggagtggctg cggagatacg tggaatacgg gaaggctgag 2341 ctgggcagga gaggtgagcg gggtgggggg gggagcggct gcagtgtggg gctggacgtg 2401 gggcgggggc tcagcgtggg gagctcagcc cggccctcac tgccgcccgc ccgcagagcg 2461 gcccgaggtg cgagtgtggg ggaaggaggc cgacgggatc ctgaccttgt cctgccgcgc 2521 tcacggcttc tacccgcggc ccatcgttgt cagctggctg aaggacggcg cggtgcgggg 2581 ccaggacgcc cactcggggg gcatcgtgcc caacggcgac ggcacctacc acacctgggt 2641 caccatcgat gcgcagccgg gggacgggga caagtaccag tgccgcgtgg agcacgccag 2701 cctgccccag cccggcctct actcgtgggg tgagtgaggg gatgtggggc tggggggctg 2761 cgggctgccc cttcccctgc tgatggcccc gctctccccc agagccgcca cagcccaacc 2821 tggtgcccat cgtggcgggg gtggccgtcg ccattgtggc cattgccatc atggttggtg 2881 ttggattcat catctacaga cgccatgcag gtaaaagcag aggggtgcag gcgggcagtg 2941 gggggatctg ggtcccccct tgggagccct catgcctggc tgtgatgtga acctgtgctg 3001 aagcatctct ctgtctgcag ggaagaaggg gaagggctac aacatcgcgc ccggtgagtg 3061 atgagggcag cgctgtcccc cacctctgcc cagtgccagg gcggtcctgg ggtctgcact 3121 ttctcccagg gtacccattc ctggtgcttg gggctgctcc acgccccata gggagcacag 3181 ggctggatct cacagctgtt cctcccttat agacagggaa ggtggatcca gcagctcgag 3241 cacaggtgcg tgtggggctg tgggttggga ggggtccgtg tgctctctgt ggtactgccc 3301 agggctgggc tatgctgggg ctctgcgggg agacccccgg agcagagggt tgggatgtga 3361 acctggcccc gtgggacatc atcccttctc atcccacagg gagcaacccc gccatctgag 3421 tgctgtgctt cagcctgcaa ggagccaaca gtccacacca gcatttgggg tcagtgatgg 3481 gcacagcccc atcctcttga cctctcacat ctcattctgc ttcctatgct gactgttatg 3541 ctttgcctgc actgcttcct gtgaaataaa atgatgggcc attctgtggc tcagcttgcc 3601 tgcagtctgc actgtgctgt ggttggggat ggggtggatg aggggaccgt gtcagtttgg 3661 ctgctcaggg tgcagatgtg gccctgtgct gagtacccac tgccctcccc tcttctatct 3721 gcctgctgct cactcccccc tgtgtacccc catcccttct cacctctcct ctgtgacccc 3781 atgctggtgg ttgcttgctc cctgtcctgg cagaactctc attttccaat ggcatccctg 3841 tttgttggga tgtggtctcc ttggtcctcc ccccagcagt cactgcacat atccacccca 3901 cttccccccc aggttgctgt cccacagcac tcctatttcc ttctctgccc tccccccccg 3961 cccatccagc tgcctctgca atcctcaccc acaaccttgc ccactccacc tccctcatcc 4021 cgcccttccc ccagctgtcc tgtccctgct gggccccctc cccccacatt gtaccctaca 4081 cccaaataaa tacgtttgtt ctgctgccct ccagccgtct tctggtttat ttcccccccg 4141 atttgttgtt gtggcgtccg ctcttctcct ggggggaagg ggctctgggg gtccctcatt 4201 ctccctgctc ttcttacagc accgggactc cccgcgctga gatcccaaca cacccgggta 4261 caaacatgcg gcttttattc ccagttctgt gtcccacccc cggccctggt ggcactcagt 4321 ggcactgcag tccatgcagt ggccgttgtg tgtcgtagag cagcggtacc cgagcgcgcc 4381 ccggctcggc atccatgtgc ccacggcaca gctcttgtgg tcccttgtcg ctgccccggt 4441 gtccccacct ccaccctcag tgtccccaac gtgcagtgtc cccatcccac atcccaccac 4501 cccctgccct ccccatccat aactcccgct atccctctcc ctctgtcccc ctcccccggt 4561 gctccctgtt gtccccagtc ccgcagaagg ctgccggggg cagcacctcg tgggggggtc 4621 cctgctgcca cacctctccc ccgtccaaca ccactaccca ttgtaccagg gctgcccgcc 4681 ccgtcaccat cagcactgca catccttacc ctctggctag ctggatccaa cttgtggaca 4741 ggcttctgat gccctgtgta tgacaaggtt tctgatcctc gcgcctgcag // LOCUS DOGMHDL9A 1499 bp ss-mRNA MAM 03-AUG-1990 DEFINITION Dog MHC class I DLA-A9/A9 alpha-chain mRNA, complete cds. ACCESSION M32283 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Dog (#C624; haplotype DLA-A9/A9 homozygote) peripheral blood leukocyte, cDNA to mRNA, clone pBT-I16. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 1499) AUTHORS Sarmiento,U.M. and Storb,R. TITLE Nucleotide sequence of a dog class I cDNA clone JOURNAL Immunogenetics 31, 400-404 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by U.M. Sarmiento, 22-FEB-1990. FEATURES from to/span description pept 1 1089 MHC DLA-A9/A9 alpha-chain precursor sigp 1 72 MHC DLA-A9/A9 alpha-chain signal peptide matp 73 1086 MHC DLA-A9/A9 alpha-chain mRNA 1 1499 MHC DLA-A9/A9 alpha-chain mRNA BASE COUNT 296 a 425 c 489 g 289 t ORIGIN 1 atggaggtgg tgatgccgcg agccctcctc gtgctgctgt cggcggccct ggccctgacc 61 ccgacccggg cgggctccca ctccctgagg tatttctaca cctccgtgtc ccggcccggc 121 gcgggggacc cccgcttcat cgccgtcggc tacgtggacg acacgcagtt cgtgcggttc 181 gacagcgacg cggccactgg gaggatggag ccgcgggcgc cgtgggtgga gcaggagggg 241 ccggagtatt gggaccggca gacgcggacc atcaaggaga ccgcacggac tttccgagtg 301 gacctggaca ccctgcgcgg ctactacaac cagagcgagg ccgggtctca cacccgccag 361 accatgtacg gctgtgacct ggggcccgac gggcgccttc tccgcgggta cagtcaggac 421 gcctacgacg gcgccgatta catcgccctg aacgaggacc tgcgctcctg gaccgcggcg 481 gacacggcgg cgcagatcac ccagcgcaag tgggaagcgg caggtgtagc agagctacaa 541 tggaggaact acctggagac gacgtgcgtg gagtggctgc ggaggtacct ggagatgggg 601 aaggagacgc tgctgcgcgc agacccccca agcacacgtg tgacccacca ccccgtctct 661 gaccatgagg tcaccctgag gtgctgggcg ctgggcttct accctgcgga gatcaccctg 721 acctggcagc gggatgggga ggaccagacc caggacacag aggttgtgga cacaaggcct 781 gcaggagatg ggaccttcca gaagtgggcg gccgtggtgg tgccttctgg acaggagcag 841 agatacacgt gccacgtcca gcatgagggg ctgccggagc ctatcacgcg gagatgggag 901 ccttcccctc tgtccaccat tgtcatcgtc agcattgctg ctctggttct cctcgtggtc 961 gctggggtga ttggagctgt gatctggagg aagcagcgct caggaggaaa aggaccaggc 1021 tactctcatg ctgcacgtga tgacagtgcc cagggctctg atgtgtctct gacagctcct 1081 agagtgtgag accagctgcc tgtgggactg acggatgcaa gatgtgttca catctcacgt 1141 gatgacatca acaaccctgg cttgtctctg caaacagtgt caggatgtgc ctgtgtccct 1201 aggagcataa tgtgaggagg tggggagatt ggcccaccct gcccaccatg acctgtccct 1261 aatctgatgt gcgctctcct ctctgatgtg ctttcctgtc caggagaggc agggctggac 1321 catctccatc cctgtctttg tttcatgttg agtactaatc tcttactatc cgattgaaaa 1381 taagaatcca gatatgagtt tgtgtttcct gagtcttggg atgtggggct gatgaggtaa 1441 taaaaggaga tttgtgaagt tgagagagca aataaatgga agccctgaga accttccag // LOCUS DOGMHDRBA 1165 bp ss-mRNA MAM 03-AUG-1990 DEFINITION C.familiaris MHC class II DLA-DR-beta-1 gene, complete cds. ACCESSION M29611 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE C.familiaris (dog #C624, haplotype DLA-Dw3/Dw3) leukocyte, cDNA to mRNA, clone pBT-DR5. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 1165) AUTHORS Sarmiento,U.M. and Storb,R. TITLE Nucleotide sequence of a dog DRB cDNA clone JOURNAL Immunogenetics 31, 396-399 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by U.M.Sarmiento, 31-OCT-1989. FEATURES from to/span description pept 40 840 MHC DLA-DR-beta-1 precursor sigp 40 126 MHC DLA-DR-beta-1 signal peptide matp 127 837 MHC DLA-DR-beta-1 mRNA 1 1165 MHC DLAB1 mRNA BASE COUNT 237 a 313 c 319 g 296 t ORIGIN 1 cgactgctcc cctgcaccct gtcctttctg ttttccagca tggtgtgtct gtgttttctt 61 ggaggctcct ggatgacagc tctgatgctg atactgatgg tgctgaaccc tcccttcgct 121 tgggccaggg acaccccacc acatttcttg gaggtggcaa agtccgagtg ctatttcacc 181 aacgggacgg agcgggtgcg gttcgtggaa agatacatcc ataaccggga ggagttcgtg 241 cgcttcgaca gcgacgtggg ggagttccgg gcggtcacgg agctcgggcg gcccgtcgct 301 gagtcctgga acgggcagaa ggagatcttg gagcaggagc gggcaacggt ggacacctac 361 tgcagacaca actacggggt gattgagagc ttcacggtgc agcggcgagt cgagcctaca 421 gtgactgtgt atcctacgaa gactcagacc ttgcagcatc acaacctcct ggtctgctct 481 gtgaatggtt tctatccagg ccacattgaa gtcaggtggc tccggaatgg ccaggaagag 541 gaagctgggg ttgtgtccac aggcctaatc cgtaatggag actggacctt ccagatcctg 601 gtgatgctgg agatagttcc tcagagcgga gaggtctaca cctgccaagt ggagcaccca 661 agtttgacaa gccctgtcac cgtggaatgg agggcacagt ctgattctgc acagagcaag 721 atgctgagtg gaatcggggg ctttgtcctg ggtctgctct tccttgcagt ggggctgttc 781 atctacttca gaaatcagaa gggacactct ggacttcagc caacaggact cctgagctga 841 agtggagatg gtgaccctca agaaaaaacc ttcctttcca gcttcttctc agagtccaaa 901 ggtttccagc ttggctctta ttcttctaca aatacagcgc ctttctcagg atctggtttg 961 ccctggcttc agtgaccctg cacagcttgt gttccccaat ggctccctct gccgtgtcct 1021 tgacctggaa cccccagtgc tgactgcagt accttatctg cattctctcc ttcctcttcc 1081 tatgcagccc ttatgtttcc cttgcatggg aactcacctt ctgctcatgt ttctttataa 1141 acttttctca aataaatgtg gattg // LOCUS LACTHYSYN 957 bp ds-DNA BCT 03-AUG-1990 DEFINITION L.lactis thymidylate synthase (thyA) gene, complete cds. ACCESSION M33770 KEYWORDS thymidylate synthase. SOURCE L.lactis lactis (strain 712) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 957) AUTHORS Ross,P., O'Gara,F. and Condon,S. TITLE Cloning and characterization of the thymidylate synthase gene from Lactococcus lactis subsp. lactis JOURNAL Appl. Environ. Microbiol. 56, 2156-2163 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by P.R.Rose, 04-APR-1990. FEATURES from to/span description pept 87 926 thymidylate synthase (EC 2.1.1.45) binding 77 80 ribosome binding site site 28 33 -35 region site 51 56 TATA box BASE COUNT 329 a 139 c 192 g 297 t ORIGIN 1 tctgagaggt tattttggga aatactattg aaccatatcg aggtgtgtgg tataatgaag 61 ggaattaaaa aagataggaa aatttcatga cttacgcaga tcaagttttt aaacaaaata 121 tccaaaatat cctagataat ggtgtttttt cagaaaatgc aagaccaaag tataaggatg 181 gtcaaatggc gaatagcaaa tatgtcactg gttcattcgt tacttatgat ttgcaaaagg 241 gggagtttcc aattaccact ttgcgtccaa ttccaatcaa atctgctatt aaagaattga 301 tgtggatata ccaagaccaa acaagtgaac tttctgttct cgaagagaag tatggagtca 361 aatactgggg agaatgggga attggtgatg gtacgattgg gcaacgttat ggtgcaacag 421 tcaaaaaata taatatcatt ggtaaattat tagaaggctt ggccaaaaat ccatggaatc 481 gtcgtaatat catcaacctt tggcagtatg aagattttga ggaaacagaa ggtcttttac 541 catgtgcttt ccaaacgatg tttgatgtcc gtcgagaaaa agatggtcag atttatttgg 601 atgccacact gattcaacgt tcaaacgata tgcttgtagc ccaccatatc aatgcgatgc 661 aatatgttgc tttgcaaatg atgattgcaa aacatttttc ttggaaagtt gggaaattct 721 tttattttgt aaataattta catatttatg ataatcagtt tgagcaggca aatgaattaa 781 tgaagcgaac agcttctgaa aaagaacctc gtttggtcct taatgttcct gatggtacaa 841 actttttcga tattaaacct gaagattttg aacttgtgga ctatgagcca gtaaaacctc 901 aattgaaatt tgatttagca atttaaatta atctataagt tactgacaaa actgtca // LOCUS MUSLY48A 121 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse leukosialin (Ly48) mRNA, partial cds. ACCESSION M30693 KEYWORDS leukosialin. SOURCE Mouse (strain C57BL/6J) adult thymus EL-4 tumor, cDNA to mRNA, clone ML18. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 121) AUTHORS Baecher,C.M., Dorfman,K.S., Mattei,M.G. and Frelinger,J.G. TITLE cDNA cloning and localization of the mouse leukosialin gene (Ly48) to chromosome 7 JOURNAL Immunogenetics 31, 307-314 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.G.Frelinger, 14-DEC-1989. FEATURES from to/span description pept < 3 > 119 leukosialin (Ly48; AA at 3) BASE COUNT 28 a 28 c 43 g 22 t ORIGIN Chromosome 7 put. 7F3/7F4 bands. 1 gccagggctc tttagtacta gaagagctga agcctgggtc gggtcccaac ctgaaggggg 61 aggaagagcc gcttgtgggc agtgaggatg aagctgtgga aaccccaact tctgacggtc 121 c // LOCUS MUSUBILP 2912 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse housekeeping DXS253E (P3) and DXS254E (GdX) genes, 3' end, and complete cds. ACCESSION J04761 KEYWORDS housekeeping protein. SOURCE Mouse (strain Balb/c) liver DNA, clone lambda-M7P. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2912) AUTHORS Filippi,M., Tribioli,C. and Toniolo,D. TITLE Linkage and sequence conservation of the X linked genes DXS253E (P3) and DXS254E (GdX) in mouse and man JOURNAL Genomics 7, 453-457 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by D.Toniolo, 10-APR-1990. FEATURES from to/span description pept < 1 550 housekeeping protein DXS253E (P) (AA at 2) pept 1536 1583 housekeeping protein DXS254E (GdX), exon 1 1740 1845 housekeeping protein DXS254E (GdX), exon 2 2089 2297 housekeeping protein DXS254E (GdX), exon 3 2413 2523 housekeeping protein DXS254E (GdX), exon 4 pre-msg < 1 801 DXS253E mRNA pre-msg 1533 > 2912 DXS254E mRNA and introns IVS 1584 1739 DXS254E intron A IVS 1846 2088 DXS254E intron B IVS 2298 2412 DXS254E intron C BASE COUNT 649 a 822 c 765 g 676 t ORIGIN X chromosome. 1 cactggtttc ctaccactct catcagccat ctacagctac cttctcagca tccatgaaac 61 actccacgtg cccatctcca agatattggg gaccctgctg tttatcgcca tccccatagc 121 agcgggtgtg gtgatcaaat ctaagctccc caagttctct gaactactgt tacaggtcat 181 caagcccttc agctttatac ttctcctggg tggcctgttc ctggcctacc acatgggggt 241 cttcatccta gtgggagtca ggttacccat tgtactggtg ggtttcacag tgcctcttgt 301 tggcctcttg gtgggctaca gcctggccat ctgcctgaag ctgccagtgg ctcagcgacg 361 aacagtcagc attgaggtag gggtgcaaaa cagcctgcta gccttagcca tgctacagct 421 gtctctgcgc cgccttcaag cagactatgc ctctcaggcc cctttcattg tggcactgag 481 tggtacctcc gagatgttgg ctttggttat tggccagttc atctatagca tcttgtttcc 541 tgttccctga gccctctgga tcaagtttta taacttccag cccccatact cctctcttcc 601 cacagttctt gtgtaccgta ccaaaggcct ttagttctca tgcactatgc actcaagaca 661 aatccaggct tattttttta ctccttttcg ccccagcttt cagtgccaaa gaggccatgc 721 tgagttagat agttgggtat tgcccagaaa tatatttcaa taaaaaaaaa ccaaaaaaac 781 aaaagcaagc tcgggtccct tccagtctgt cccttggagc ctgtggattg gggtaagaag 841 agtgtgtctg ggcggatatg ggccagaaca gtgaaccagc ttggtgaccc tgcacagaac 901 ccactttctc ctcctcactc tgttttgcac atggtagaca tgtacctttg agctgtatcc 961 tcagatctac aaaggctttg ctctaacagc cctacctaac agggcagacc tctgcactat 1021 aaggagtgaa gagagcacta cttccgagct tcagtcagga tatttttgca gccagactag 1081 aaaggaagta gaaagccatt aagctatgac aggcaggact ggaccatttc tgtatctgag 1141 agacaccacc acgtgtttgc attgacttgg tccaccagca gggtgggaat tagcacaggc 1201 cgggcctgga gtgaggctga cattctggtc cttcctttca agccacagct cccagactaa 1261 gcaagcctgg ccccacaggt acctcacttg gatcccagtg ggcggggcat ccccacgtcc 1321 ccgcgccatt aagcccaggt ggtggactct acccaaggca cggccggggc agaagcgaag 1381 gccgggaggc gggactgaag aatgcgcagg cgggagccac gagcggcgcg cccctttcac 1441 ggcttccggc ggcggcgggc ggttccgggg cggcggcgcg cggcagggga ccgttggtgt 1501 ttgcgttggc cgtagtggac tgggccgtgg acaccatgca gctgaccgtg aaggcgctcc 1561 agggccggga atgtagccta caggtaaagt caccccgcag gctgctgcac tcactcctgc 1621 cgctggggcc gagcgcgcgt gggggaggcc gggagagctg acctcaggag ggggtgcgag 1681 ggcttcaccg cccagggaat tcgggctgag cccttgcctc tccacctcgg ccggatcagg 1741 tggcggagga cgagctagtg tctacactga agcacctggt ctcggataag ctgaatgtcc 1801 ctgtgcgcca gcaacgtctg ctgttcaagg gcaaggccct agcaggtaat cagggaaagg 1861 agccatccag ggagctttac agaaagctgg gggacgggat gtacaggcca gtgtagtgga 1921 tcggggcact ggctgccact ggacctcaga aaagtagctg agccatcacc accccaccac 1981 cccatagcca gatgctgcag tgcttcactc tggctgtctg tggcaggatt gcttcccagc 2041 cacctccaca tcagtgagga aggaaccaca ccaatctctc tttggcagat gaaaaacgac 2101 tgtcagatta caacattggg cccaattcta agctcaacct agttgttaag cctttggaga 2161 aggtgctact ggaagaaggg tctgcccaca gactggtcga ctccccagcc acccccatct 2221 ggcagctgat ctccaaagtc ctggcccgtc acttcagtgt agcagatgcc agcagggtcc 2281 tggaacaact acagagggtg agaagagtaa tgctggccct ccattcttaa tgtttctacc 2341 cgtcccctac ctaatgcagc atacgccttc cccactggcc tctctgaata ccactcggtc 2401 ttctctcctc aggattatga caggtccttg agccgcctaa cactggatga catcgaacgt 2461 ttggccagcc gctttctaca ccctgaagtg actgaggcta tggaaaaagg gttctgcaaa 2521 tagcattctg ggattgtggg gagaaatccc aggtcaggcc acagctgcat gttgcattaa 2581 atgtgttctc atgtcgcagt ttggctcata gtgataataa tagctggtat gtatccagct 2641 cttgctaggt gctaggtact caaagtacat ggcctgagtt ctcttctgcc tcccacagca 2701 aagttgaggc tgcactgtag agacacctag caatggcttg ggacagacag ggtccaagtc 2761 tagccagcct atagaagtgc ctctgtagga gaaagagtgt gctgttttca gagatgggtc 2821 ttagtattcc catttagccc actcagcata caccctcaaa agactccctc aggaaaccac 2881 tagttcctgt cacctgccct ctggtgtgtc cc // LOCUS YEPLCREA 3316 bp ds-DNA BCT 03-AUG-1990 DEFINITION Virulence plasmid pYVO3 (from Y.enterocolitica) LcrE protein mRNA, complete cds. ACCESSION M32097 KEYWORDS . SOURCE Virulence plasmid pYV03 (from Yersinia enterocolitica, strain 6471/76, serotype 0:3) DNA, clones pY[MS4146,MS4073,P6]. ORGANISM Plasmid pYV03 Prokaryota; Bacteria. REFERENCE 1 (bases 2 to 1081) AUTHORS Viitanen,A.-M., Toivanen,P. and Skurnik,M. TITLE The lcrE gene is part of an operon in the lcr region of Yersinia enterocolitica O:3 JOURNAL J. Bacteriol. 172, 3152-3162 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 3316) AUTHORS Viitanen,A.-M., Toivanen,P. and Skurnik,M. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by M.Skurnik, 14-FEB-1990. FEATURES from to/span description pept 649 1530 lcrE product ORF 1511 1789 ORF1 ORF 1776 2147 ORF2 (gtg start codon) ORF 2509 2853 ORF4 ORF 2840 > 3315 ORF5 ORF 451 119 (c) ORF6 ORF 173 < 1 (c) ORF7 mRNA 1020 < 1 (c) lcrB locus mRNA BASE COUNT 892 a 751 c 791 g 882 t ORIGIN 1 aggatcccca aagaaagtgg tgtggtaata agttttcggc tcatcggagc tggggcatcc 61 tgataaactg ggtaccaagc cgccggttca gggagatgcc ccccatcgaa aggctgccct 121 aaaccatcca acacctgccc cagcagatgt tcacccaccc caacctgatg cattgtcctg 181 tcgggctaac ttcagtatta gaagatatcc cgtacatttc accaagtgga ataagtaatg 241 cttgatgttg ggcaaaacct atgacttcag cctgtaaaga caggctgttg tctgggttac 301 gtaagtaaca taactcaccg atgcgcacac caggcactac cgcttttaat agcgttcctg 361 tcacttgagt gacacgtcct ctaatttgga ttaggcggct acctacaatg ccatgacgaa 421 tatgatgagg tatctgatct agtgagagca taaatccata atggttgaaa tattaaccac 481 tattttagtg actaaaaacg ctaaaaaatt gtagcgggag ccgcgagttt ttagaaaaat 541 agccaagcag cactaaaatt tctcggctga ttttggcatc gataagcaag aactattttt 601 ataatcgcgg taattgcaat tataaaccgt tcatcttagg gagtagttat gacgacgctt 661 cataacctat cttatggcaa taccccgctg cgtaatgagc atccagagat tgccagtagt 721 cagatcgtaa atcagactct gggtcaattt cggggagaat ctgtgcagat agtcagcggc 781 actctgcagt ctatagctga tatggcagaa gaggtaacat ttgtcttctc cgagcgtaag 841 gagctctccc tcgacaaacg caaattaagt gacagccaag ctcgagttag cgacgttgag 901 gagcaggtta atcaatacct tagcaaagtt ccagagttga aacaaaaaca gaatgtgagt 961 gagctgctca gtctgttgag taacagcccc aatataagct tgtcccagtt aaaggcttat 1021 ctggagggga aatcagaaga accgagtgag caattcaaaa tgctctgcgg cttgcgtgat 1081 gccctgaaag ggcgccctga attagcacat cttttgcatt tggttgaaca agctctggtc 1141 agcatggttg aagagcaaga agaagccatt gtattgggtg ccagaataac cccggaagcg 1201 tacagagaat cccagtcggg tgttaatcca ctgcagccgc tccgtgatac ctaccgcgat 1261 gcagtgatgg gttatcaagg gattaatgcg atctggagtg atttacaaaa acgttttccc 1321 aatggggata tagactcggt gatattattc ctgcaaaagg cgcttagtgc agatctacaa 1381 agtcaacaaa gcgggtctga acgggaaaaa ttagaaatag ttattagtga cttacagaag 1441 ctaaaggagt ttcgtagcgt gagtgaccag gttaaaggat tttggcaact tttttcagag 1501 ggtataacta atggcttacg acctttctga gtttatggga gatattgtcg cactggttga 1561 caagcgctgg gcggggattc atgacattga acatcttgcc aacgcctttt cccttcctac 1621 gcctgaaatc aaagtgcgtt tctatcaaga tttaaaaaga atgtttcgtc ttttccctct 1681 gggggtattt agcgatgagg agcaacggca aaatttattg caaatgtgtc aaaatgcgat 1741 cgatatggct attgagagtg aagaggaaga attgagtgag ttggattgaa cccatcattt 1801 cccatttctg ccaggatctg ggagtgccaa catctagccc cctttcgcct cttattcaat 1861 tagagatggc tcaatctggc acgctgcaac tggaacaaca tggtgcgaca ctgacactgt 1921 ggttagcgcg ttctctcgcc tggcaccagt gcgaagatgc tatggtcaaa gcgctaacgc 1981 tcacggcggc ccaaaagagt ggcgctttac cgctgcgagc ggggtggtta ggggaaaatc 2041 aactggtgtt atttgtctcg cttgatgagc gttccttaac cttgcccctt ttacatcaag 2101 ctttcgaaca gttactgcga ttgcagcaag aggtgcttgc gccgtgagtc gcataataac 2161 tgccccccat attggcatcg aaaaactgtc ggcgattagc ctggaagagc tatcctgtgg 2221 cttgcctgat cgttatgcct tgccgcctga tgggcatcca gtcgaaccac atttagagcg 2281 cctttaccct acagcacaaa gcaagcgtag cctatgggac tttgcttctc ccggctatac 2341 atttcatgga ttacatcgag ctcaagatta tcggcgcgaa ctggatacct tgcagtcact 2401 gctaaccacc agtcagtcct cagagctaca agctgccgcg gcgctgctga aatgccaaca 2461 agatgatgat cggttactgc aaataatcct taacctgttg cacaaagtat gaatattact 2521 ttaaccaaac gacaacagga gttcttgctg ctcaacggtt ggttacaact acaatgtggc 2581 catgcagagc gcgcatgtat tctattggac gccttgctga cgttaaatcc tgagcattta 2641 gccggtcggc gttgccgatt agtcgcgcta cttaataata accagggaga acgtgccgaa 2701 aaagaagcgc aatggctaat atcacatgac cctttacagg ctggaaattg gctctgcttg 2761 agccgtgccc aacaactgaa cggcgatctt gataaggctc gccatgctta tcaacattat 2821 ttggagttga aagatcataa tgaatcccca tgatcttgag tggctaaatc gtattggcga 2881 gcgtaaagat atcatgctgg cagtgctgct gttagctgtg gtattcatga tggtcttacc 2941 actccccccc ctcgtgttgg acattctgat tgctgttaac atgaccatct cagtggtgtt 3001 gttaatgata gcgatctata tcaactctcc tttacaattt tcagctttcc cagcggtgct 3061 actcgttacc acgttatttc gtctcgcact ttcagttagc accacccgca tgatcctgct 3121 acaagctgat gcggggcaga ttgtttacac ctttggtaat ttcgtcgttg gcggtaacct 3181 catcgtcggg attgtcatct tcctgatcat cactattgtg caatttttag tgataacgaa 3241 aggctcagaa cgtgtagcag aagttagtgc cagattctct cttgatgcga tgccgggtaa 3301 acagatgagt atcgat // LOCUS FSOFARS1A 170 bp ds-DNA PLN 03-AUG-1990 DEFINITION F.oxysporum autonomous replicating sequence (ARS). ACCESSION M29451 KEYWORDS autonomous replicating sequence. SOURCE F.oxysporum lycopersici (strain 73) ARS DNA, clone pFOLT4R4. ORGANISM Fusarium oxysporum Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Hypocreales; Hypocreaceae. REFERENCE 1 (bases 1 to 170) AUTHORS Powell,W.A. and Kistler,H.C. TITLE In vivo rearrangement of foreign DNA by Fusarium oxysporum produces linear self-replicating plasmids JOURNAL J. Bacteriol. 172, 3163-3171 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.A.Powell, 27-OCT-1989. FEATURES from to/span description site 20 28 DNA homologous to yeast autonomously replicating consensus sequence site 26 31 telomere repeat unit site 130 136 ligation point of telomeres -> ApaI site BASE COUNT 35 a 18 c 66 g 51 t ORIGIN 116 bp upstream of ApaI site. 1 catgccatcc gtaagatgct ttggtttagg gttagggtta gggttagggt tagggttagg 61 gttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg 121 gttagggtta gggccctaac cctaaaccaa agcatcttac ggatggcatg // LOCUS ARURRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION A.gigantea ribosomal RNA. ACCESSION M37170 M35661 KEYWORDS ribosomal RNA. SOURCE A.gigantea rRNA. ORGANISM Arundinaria gigantea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 18 c 12 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctacgt ggaggcatcc ctccgcagct // LOCUS ASTRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION A.sativa ribosomal RNA. ACCESSION M37171 M35661 KEYWORDS ribosomal RNA. SOURCE A.sativa rRNA. ORGANISM Avena sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS BLYRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION H.vulgare ribosomal RNA. ACCESSION M37172 M35661 KEYWORDS ribosomal RNA. SOURCE H.vulgare ribosomal RNA. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctacgt ggagccatcc ctccgcagct // LOCUS MUSELF2PS 2183 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse elongation factor 2 processed pseudogene, complete cds. ACCESSION X15335 J04749 KEYWORDS elongation factor 2; pseudogene. SOURCE Mouse cell line Ltk- DNA, clone cMER1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2183) AUTHORS Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and Uchida,T. TITLE Amplification of a long sequence that includes a processed pseudogene for elongation factor 2 in the mouse JOURNAL Genomics 6, 80-88 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Koide, 31-AUG-1989. FEATURES from to/span description pept.ps 319 2038 pseudo-elongation factor 2 rpt 319 330 repeat copy A rpt 2027 2038 repeat copy B BASE COUNT 594 a 543 c 585 g 461 t ORIGIN 1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca 61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct 121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaaatgta 181 gtttgaaaaa gggagcttgc ctgcctatca ttttagtgcc tgtgaggtgg agatgtgagg 241 atcagtaagt agttcaaggt tattcttggc tgtttagtga gtgttggcta tcctaggcta 301 cataagtccc tgtttcaaga aaacaaacca ggccactgta ggcgcgcttc cctgttcacc 361 tctgactctg agaatctgtc gccatccacc accatggtga acttcacagt agttcagatc 421 cgtgccatca cggacaggaa agccaacatt cagaacatgt cagtcatctc catgtggacc 481 acggcaagtc cacgctgaac gactcccttg tgtgcaaggc tggcatcatt gcctccgccg 541 gagccgggga gacacgtttc actgacatgc gcaaagatga gcaggagcgc tgcatcacaa 601 tcaagtccac cgccatctcc ctcttctacg agctctctaa gaacgacctg aaccgcatta 661 agcagagcaa ggatggctcg ggcttcctca tcaacctcat cgactctccc ggccatgtgg 721 acttctcttc agaggtgaca gctgccttgc gtgtcaccga tggagctctg gtggtggtgg 781 actgtgtgtc tggcgtgtgt gtgcagacag aaattgtgct gcgccaggcc atcgctgagc 841 gcatcaagcc cgtcctgatg atgaacaaga tggaccgggc ccctgctgga gctgcagctg 901 gaacccgagg agctctacca gaccttccag cgcatcgtgg aaaacgtcaa cgtcatcatc 961 tctaccgatg gcgagggtga gagtgggccc atgggcaata tcatgattga ccccgtcctg 1021 ggtgccgtag gctttggttc tggcctgcat ggctgggcct tcaccctgaa gcagtttgcg 1081 gagatgtacg tggccaagtt tgcagccaag ggcgagggcc agctgggtgc aactgagcgt 1141 gccaagaaag tagagggcat gatgaagaag ctgtggggag accagtactt tgatctgccc 1201 tcatggagcc tatctatctg gtggagatcc agtgtcctga gcaagtggtg ggtggcatct 1261 atggtgtcct gaacaggaag cgtggccata tgtcccaggt ggccggtacc cccatgtttg 1321 tggtcaaggc atacctgcct gtcaatgaat cctttggctt cactgctgat ctgcgatcca 1381 acactagcag ccaggccttc ccccagtgcg agtttgacca ctggcagatc ctgcctgggg 1441 atccttttgt caacagcagc cgccccagcc aagtggtagc tgagacctgc aagcacaagg 1501 gcctgaaaga gggctggaca acttcctgga caaactgtag gcagcctgat actgccacat 1561 gttgcacagt gcctgcccat cagaagacac cttgagactg tccccacagt gctcctctgg 1621 aggctgttgg ggccaccctg acatcactca gcactcactt ggctaccaat tctatttatt 1681 tcagaattac aaaatagcgg gaatctctct gcaggctgga ctggcaggcc gtggggtagg 1741 cggacatggc tcttaacatt ttcagaggga aacacgcaga tgtccaaaag tctaaataaa 1801 tgcattcaga ggtttatgag gtccatggcc aagtggagct cccccagagg gggaggtggg 1861 gtaagtacct ccaggaaggc aggcagcctg ccttagactt gcaacccggc tgtgcgaatc 1921 attggagtaa taaactacag tggttgatct gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1981 aaaaaagaaa agaaaagaaa agaaagaaag aaagaaagaa agaaaggaaa acaaaccaaa 2041 agaaaaagag tttccatgat gtccaggatg atgcaggtgg gaagggctag ctcaggttag 2101 aagggctagc tcaggtctct cttaccctgc tctaactcag cctgcttctg gaggagagac 2161 aagtcaaaga gcagcaggag ctt // LOCUS MUSELF2PSA 2199 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse elongation factor 2 processed pseudogene, complete cds. ACCESSION X15336 J04749 KEYWORDS elongation factor 2; pseudogene. SOURCE Mouse cell line Ltk- DNA, clone cMER9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2199) AUTHORS Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and Uchida,T. TITLE Amplification of a long sequence that includes a processed pseudogene for elongation factor 2 in the mouse JOURNAL Genomics 6, 80-88 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Koide, 31-AUG-1989. FEATURES from to/span description pept.ps 324 2016 pseudo-elongation factor 2 rpt 324 335 repeat copy A rpt 2005 2016 repeat copy B BASE COUNT 602 a 546 c 590 g 461 t ORIGIN 1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca 61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct 121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaacaaaa 181 caaaacaaaa atgtagtttg aaaaagggag cttgcctgcc tatcatttta gtgcctgtga 241 ggtggagatg tgaggatcag taagtagttc aaggttattc ttggctgttt agtgagtgtt 301 ggctatccta ggctacataa gtccctgttt caagaaaaca aaccaggcca ctgtaggcgc 361 gcttccctgt tcacctctga ctctgagaat ctgtcgccat ccaccaccat ggtgaacttc 421 acagtagttc agatccgtgc catcacggac aggaaagcca acattcagaa catgtcagtc 481 atctccatgt ggaccacggc aagtccacgc tgaacgactc ccttgtgtgc aaggctggca 541 tcattgcctc cgccggagcc ggggagacac gtttcactga catgcgcaaa gatgagcagg 601 agcgctgcat cacaatcaag tccaccgcca tctccctctt ctacgagctc tctaagaacg 661 acctgaaccg cattaagcag agcaaggatg gctcgggctt cctcatcaac ctcatcgact 721 ctcccggcca tgtggacttc tcttcagagg tgacagctgc cttgcgtgtc accgatggag 781 ctctggtggt ggtggactgt gtgtctggcg tgtgtgtgca gacagaaatt gtgctgcgcc 841 aggccatcgc tgagcgcatc aagcccgtcc tgatgatgaa caagatggac cgggcccctg 901 ctggagctgc agctggaacc cgaggagctc taccagacct tccagcgcat cgtggaaaac 961 gtcaacgtca tcatctctac cgatggcgag ggtgagagtg ggcccatggg caatatcatg 1021 attgaccccg tcctgggtgc cgtaggcttt ggttctggcc tgcatggctg ggccttcacc 1081 ctgaagcagt ttgcggagat gtacgtggcc aagtttgcag ccaagggcga gggccagctg 1141 ggtgcaactg agcgtgccaa gaaagtagag ggcatgatga agaagctgtg gggagaccag 1201 tactttgatc tgccctcatg gagcctatct atctggtgga gatccagtgt cctgagcaag 1261 tggtgggtgg catctatggt gtcctgaaca ggaagcgtgg ccatatgtcc caggtggccg 1321 gtacccccat gtttgtggtc aaggcatacc tgcctgtcaa tgaatccttt ggcttcactg 1381 ctgatctgcg atccaacact agcagccagg ccttccccca gtgcgagttt gaccactggc 1441 agatcctgcc tggggatcct tttgtcaaca gcagccgccc cagccaagtg gtagctgaga 1501 cctgcaagca caagggcctg aaagagggct ggacaacttc ctggacaaac tgtaggcagc 1561 ctgatactgc cacatgttgc acagtgcctg cccatcagaa gacaccttga gactgtcccc 1621 acagtgctcc tctggaggct gttggggcca ccctgacatc actcagcact cacttggcta 1681 ccaattctat ttatttcaga attacaaaat agcgggaatc tctctgcagg ctggactggc 1741 aggccgtggg gtaggcggac atggctctta acattttcag agggaaacac gcagatgtcc 1801 aaaagtctaa ataaatgcat tcagaggttt atgaggtcca tggccaagtg gagctccccc 1861 agagggggag gtggggtaag tacctccagg aaggcaggca gcctgcctta gacttgcaac 1921 ccggctgtgc gaatcattgg agtaataaac tacagtggtt gatctgaaaa aaaaaaaaaa 1981 gaaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa 2041 aagaaaacaa accaaaagaa aaagagtttc catgatgtcc aggatgatgc aggtgggaag 2101 ggctagctca ggttagaagg gctagctcag gtctctctta ccctgctcta actcagcctg 2161 cttctggagg agagacaagt caaagagcag caggagctt // LOCUS MUSELF2XX 2161 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse elongation factor 2 related sequence. ACCESSION X15337 J04749 KEYWORDS elongation factor 2. SOURCE Mouse cell line Ltk- DNA, clone MER7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2161) AUTHORS Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and Uchida,T. TITLE Amplification of a long sequence that includes a processed pseudogene for elongation factor 2 in the mouse JOURNAL Genomics 6, 80-88 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Koide, 31-AUG-1989. FEATURES from to/span description pept.ps 334 2054 elongation factor 2 related sequence rpt 334 345 tandem repeat rpt 2043 2054 tandem repeat BASE COUNT 575 a 544 c 581 g 461 t ORIGIN 1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca 61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct 121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaacaaaa 181 atgtagtttg aaaaagggag cttgcctgcc tatcatttta gtgcctgtga ggtggagatg 241 tgaggatcag taagtagttc aaggttattc ttggctgttt agtgagtgtt ggctatccta 301 ggctacataa gtccctgttt caagaaaaca aaccaggcca ctgtaggcgc gcttccctgt 361 tcacctctgg ctctgagaat ctgtcgccat ccaccaccat ggtgaacttc acagtagttc 421 agatccgtgc catcacggac aggaaagcca acattcagaa catgtcagtc atctccatgt 481 ggaccacggc aagtccacgc tgaacgactc ccttgtgtgc aaggctggca tcattgcctc 541 cgccggagcc ggggagacac gtttcactga catgcgcaaa gatgagcagg agcgctgcat 601 cacaatcaag tccaccgcca tctccctctt ctacgagctc tctaagaacg acctgaaccg 661 cattaagcag agcaaggatg gctcgggctt cctcatcaac ctcatcgact ctcccggcca 721 tgtggacttc tcttcagagg tgacagctgc cttgcgtgtc accgatggag ctctggtggt 781 ggtggactgt gtgtctggcg tgtgtgtgca gacagaaatt gtgctgcgcc aggccatcgc 841 tgagcacatc aagcccgtcc tgatgatgaa caagatggac cgggcccctg ctggagctgc 901 agctggaacc cgaggagctc taccagacct tccagcgcat cgtggaaaac gtcaacgtca 961 tcatctctac cgatggcgag ggtgagagtg ggcccatggg caatatcatg attgaccccg 1021 tcctgggtac cgtaggcttt ggttctggcc tgcatggctg ggccttcacc ctgaagcagt 1081 ttgcggagat gtacgtggcc aagtttgcag ccaagggcga gggccagctg ggtgcaactg 1141 agcgtgccaa gaaagtagag ggcatgatga agaagctgtg gggagaccag tactttgatc 1201 tgccctcatg gagcctatct atctggtgga gatccagtgt cctgagcaag tggtgggtgg 1261 catctatggt gtcctgaaca ggaagcgtgg ccatatgtcc caggtggccg gtacccccat 1321 gtttgtggtc aaggcatacc tgcctgtcaa tgaatccttt ggcttcactg ctgatctgcg 1381 atccaacact agcagccagg ccttccccca gtgcgagttt gaccactggc agatcctgcc 1441 tggggatcct tttgtcaaca gcagccgccc cagccaagtg gtagctgaga cctgcaagca 1501 caagggcctg aaagagggct ggacaacttc ctggacaaac tgtaggcagc ctgatactgc 1561 cacatgttgc acagtgcctg cccatcagaa gacaccttga gactgtcccc acagtgctcc 1621 tctggaggct gttggggcca ccctgacatc actcagcact cacttggcta ccaattctat 1681 ttatttcaga attacaaaat agcgggaatc tctctgcagg ctggactggc aggccgtggg 1741 gtaggcggac atggctctta acattttcag agggaaacac gcagatgtcc aaaagtctaa 1801 ataaatgcat tcagaggttt atgaggtcca tggccaagtg gagctccccc agagggggag 1861 gtggggtaag tacctccagg aaggcaggca gcctgcctta gacttgcaac ccggctgtgc 1921 gaatcattgg agtaataaac tacagtggtt gatctgaaaa aaaaaaaaaa aaaagaaaag 1981 aaagaaagaa agaaagaaag aaaagaaaac aaaccaaaag aaaaagagtt tccatgatgt 2041 ccaggatgat gcaggtggga agggctagct caggttagaa gggctagctc aggtctctct 2101 taccctgctc taactcagcc tgcttctgga ggagagacaa gtcaaagagc agcaggagct 2161 t // LOCUS MUSTHBS1 4915 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse thrombospondin (THBS1) gene, exons 1-7. ACCESSION J05605 KEYWORDS thrombospondin. SEGMENT 1 of 2 SOURCE Mouse (strain BALB/C) DNA, clone phi-42. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4915) AUTHORS Bornstein,P., Alfi,D., Devarayalu,S., Framson,P. and Li,P. TITLE Characterization of the mouse thrombospondin gene and evaluation of the role of the first intron in human gene expression JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Bornstein, 28-JUN-1990. FEATURES from to/span description pept 1222 1288 thrombospondin, exon 2 (first expressed exon) 1580 2139 thrombospondin, exon 3 2947 3022 thrombospondin, exon 4 3267 3466 thrombospondin, exon 5 3561 3683 thrombospondin, exon 6 4439 + 4532 thrombospondin, exon 7 pre-msg 484 > 4915 thrombospondin mRNA and introns IVS 667 1193 thrombospondin intron A IVS 1289 1579 thrombospondin intron B IVS 2140 2946 thrombospondin intron C IVS 3023 3266 thrombospondin intron D IVS 3467 3560 thrombospondin intron E IVS 3684 4438 thrombospondin intron F IVS 4533 > 4915 thrombospondin intron G BASE COUNT 1188 a 1342 c 1100 g 1282 t 3 others ORIGIN Chromosome 2, region F. 1 gggatccccc acccaccagg attgactcag gaggcagctt cctttaaggg gtccaacctt 61 ttgccctctt ttatttttaa accagactgt ctcgatcccc aagcgtctct gtcatggaat 121 tattcaagga gatgtgcttt aatgaaagcc tccctaaagg gtcttaggtg gtccccaaag 181 aagcatcgcg tctggggact ctcatgcaga acgtctccag ttcacatggc gcaagatcct 241 aagcgctaaa ggctgagtac gccaaggctg cgtgggcgga gacctatttt ttctgacaag 301 ttccaggggc tcctgtgcgg gatcggagtc tcccccttca ctttcagccc gagagctgtg 361 cgccaagcag cagggggcgg agaaatgaac ccgtccgtcc ccgtcccccg cccccgcccc 421 cagaaccctc ccccctccct gcttcctgcc tctttaaaag gtcctgggct cctcagtcaa 481 gccagccact gcctggagtc agccagcctc atcggacttc tgcaggcaat cgcgaagctg 541 ctatccagtt ctgccacggt ctctcccggc gcaccggcag tctcagcgtc ttcaccggac 601 tcagcgtcct tgtccttcac ttcacctttg ccacctctcc gggttactga gccccggtgc 661 acacaggtaa acctccgcct ctgcggagca cctggccggc cacctcaaag ccccaggctg 721 cagctgcaga tgatcagcgc ttctcctcat ctcctactct tccccttctg gttttcctaa 781 ctcctttatc ctgccccgtc tatcccctgg aaagaacttt ccggagctgg ttctcttcag 841 ccttcgggtt cacaaactat acctccaaga caccccaaga ttttctcccc agtagtctcc 901 aggttacccc tgactgtgcc atccaaaata cattctgtaa tctgtctaaa ggggcggggg 961 ggggagggac ggacccaagt cactttatac cctttatcct ggaactccaa caatccccct 1021 gcctctggcg ctctttttat actggatacc gtttggaaag taaacccttc tttttcagca 1081 catatggggc aaaagaagaa acacacacac acacacacac acacacacac acctctgccc 1141 ttccaccttg cttcctggtc tctgactctg gcctcctgtg cttcttgcta caggctccgt 1201 gttgggcaca aaggctccac catggagctc ctgcggggac taggtgtcct gttcctgttg 1261 catatgtgtg gaagcaaccg cattccaggt gagtttgtga tatatcattt tcctccccct 1321 tctcctctct ctctctctct ctttctctct ctctctctct gctgctgctc tctctctctc 1381 tctctctctc tctctctctc tctctcgaag ctcctgtgtt tccttggagt ccttatctcc 1441 tgataaggag gcagcttccc tcgctcagct ttccataagt gccattattg tcacgagttc 1501 ccaccttggc tcctcacagc cagaacttta cccccttata cctagcctag aaagccgatt 1561 gtcttctctc tccttacaga gtctggggga gataacggtg tgtttgacat ctttgaactc 1621 attggaggtg cacgaagggg ccccggtcgc cgactggtga agggccaaga tctatccagc 1681 cccgccttcc gcattgagaa tgccaacctg atccccgctg tgccggatga caagttccaa 1741 gacctactgg acgctgtgtg ggccgacaaa ggcttcatct tcttggcttc cttgaggcag 1801 atgaagaaga cccggggcac actcctggct gtggaacgga aagacaacac tggccagatc 1861 ttcagtgtgg tctccaacgg caaagctggc accctggacc tgagcctgag cctgccaggg 1921 aagcaacaag tggtgtcagt ggaggaagct ctcctggcca ctggccagtg gaagagcatc 1981 acgctgtttg ttcaagagga ccgggctcaa ctctacatag actgtgataa gatggagagc 2041 gcggagctgg atgtacccat ccagagcatc ttcaccaggg atctggccag cgttgccagg 2101 ctccgagttg caaagggaga tgtcaatgac aattttcagg taaatcttct tacctgtcac 2161 aagggctatg tggaattaat atcggtagac taaaaatgat tctcaaatgn taaagaggaa 2221 cagacacccc cagaaggttg gttaaatcat attacagatg acccaaaact ccagggnttt 2281 cagatttgat agggacccga gaaattttnt tcctaataag aaataagttg ctaatggtgc 2341 caggacctgg gcacccatgc tttcaaagct atcaggagag ataagaaatg gttttcctat 2401 tactacagcc tgtttcagaa tggggtactc cctgtgccct taggaaagta acttgtatca 2461 aatgtaccta gcagttttaa tagggcatga aaaggatact ttgtcagtat ttgagtaaat 2521 gggcagggtg ttctgggtgc tttcagtatg gatctaaggc aagacatgaa taacagtcca 2581 gtcatgtgcc tctgggactc cttacttgcc ttaactttca gatatttttt tttcttacaa 2641 gcgtggcttt gaacgcccac attaactctt tgtcactgga gagtatcatg tcctccagac 2701 ttgggatcaa attccatttc ctgtgtaccc caagcagcct gaaaactgga ttctttgtaa 2761 aatgctccag ttggcagaag acacaattaa gattttgaag ttgttcggca atttatgtct 2821 ctgtgatagt ttggcaggat gatcttaagt tcttttaaac tttctgagcg ttcaccacaa 2881 gcaaactcag actggccgcc ccacccccac gcccagcatt aaaccacgtc tgtcctctgc 2941 ccacaggggg tgctgcagaa tgtgaggttt gtctttggaa ccaccccaga agacattctc 3001 aggaacaaag gctgctccag ctgtgagtac ccctgtgttt tttaagattt ggagaattct 3061 ggggaagtcc accgaaagca ccctgattaa tgtaagcagt agggtgcata cacaatatct 3121 cagggggagc agagttggct tgcacccact ctgtcatgcc aaaggaaatg tcttttataa 3181 cccagaaccc gaagggtagg ctggtaggat cagctgttga aaccttgaat cagctatcga 3241 atacgttact tcttatttct tagcagctac caacgtcctt cttacccttg acaacaacgt 3301 ggtgaacggt tccagccctg ctatccgcac caactacatc ggccacaaaa caaaggacct 3361 ccaagctatc tgtggcctct cctgtgatga actatccagc atggtcctgg aactgaaggg 3421 cctgcgcacc atcgtgacca ctctgcagga cagcatccga aaagtggtca gtggcctccc 3481 ctcccatttg ccagctcttg gaaatcacca caggtggtcc cacgggagcc tggacactca 3541 tgagccttct cccctttcag acggaagaga acagagagct ggtcagtgag ctgaagcggc 3601 ctcccctctg ctttcacaat ggagtccagt acaagaacaa cgaggagtgg actgtagaca 3661 gttgcacaga gtgtcactgc caggtaagag acactcacag actgcaataa gacttggaga 3721 catatgtctc gggagatacg taaatgacta cacacatttt taaatgtgct ctggcctgct 3781 accaaagtga ttttttttct ttaatatgta atgacatctg caaattcctc tttaaatatc 3841 agaaggttta tcttaataat accccaggct gagcagctaa gagaacaaat ctatctaaac 3901 gttggaatga cttacctgct ttagttcaaa gaccaagtat tgagtgtcag atacataaaa 3961 cccgcttaaa gcattcggtt ttggccagcc ctaatttgta ttcagctctg gcaggcagat 4021 tcttctaaca ctgttcggat ctctggctca ccgggaaaag tttaaatgca aatcactggt 4081 tactgaattt gagacacaac ttcaattaag ttcccgtctc ccttcaatat gtgataaaaa 4141 cccttatcta cctttgagat gcttactgat acctgcaatg tccgttgctt tcccttatgc 4201 aaatgtgttg ttctgtaact tccacccctg actgagtagc agttaaaaag aactttccca 4261 aagaatggag tgagaattag ccctggaact gtatacctct ggggactaca gatccatgag 4321 acatccctcc tataaaagcc acccctcctg tgtcctggct tataatggca atgtcacaga 4381 aagctcctct ctgcaatacg cttggtaaca tcgttcctct ccttctcttg ccttgcagaa 4441 ctcggttacc atctgcaaaa aggtgtcctg tcccatcatg ccctgctcca acgccacagt 4501 tcctgatggt gaatgctgcc cacggtgctg gcgtaagttc ctaaaatggt gtagccatcc 4561 ctcagtggct agctcagtct gggggcatcc tgacagctgg gcacaacacc agagcccagc 4621 ccctcccagc tccatcagac gttccagtca gatctgggtt atggcatagg caatggatga 4681 tcttagcaga gcttatggat tctggtttta tgctataagg agagagatgg atacagcagt 4741 agacttaagc tggttgcttt tagcaattga tctttgttga aaaaaatgat ctactggaga 4801 acggacagaa actgtgtaag cattgggctg aggcttgaaa agccacctgt tttatctgca 4861 gtgagaggag gcctaaggct tagcatctgt tcattcaggc agtaaagaaa tcagc // LOCUS MUSTHBS12 1145 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse thrombospondin (THBS1) gene, exons 8 and 9. ACCESSION J05606 KEYWORDS thrombospondin. SEGMENT 2 of 2 SOURCE Mouse (strain BALB/C) DNA, clone phi-42. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1145) AUTHORS Bornstein,P., Alfi,D., Devarayalu,S., Framson,P. and Li,P. TITLE Characterization of the mouse thrombospondin gene and evaluation of the role of the first intron in human gene expression JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by P.Bornstein, 28-JUN-1990. FEATURES from to/span description pept + 297 470 thrombospondin, exon 8 969 > 1145 thrombospondin, exon 9 pre-msg < 1 > 1145 thrombospondin mRNA and introns IVS < 1 296 thrombospondin intron G IVS 471 968 thrombospondin intron H BASE COUNT 302 a 294 c 260 g 289 t ORIGIN 820 bp after segment 1. 1 tgaccgtagt gctagattgc tgtgctgtga gagagatgtg tgatgctcca ttatcattat 61 cagtcagact cttgatagcc aagcgtctga gagtacataa atgtacacaa tctctagttg 121 ttaatcatgc ttccattaaa ctaagagtat gaaaaaagaa ggactttttt taaaaaaaga 181 aacctattgc tttaccacct ctggctataa agggccataa gcatataagt aaataaactg 241 cctaagaaca gtttatctta gacatgagat gctaacgacc tcatttctcc ccacagccag 301 cgactctgct gacgatggct ggtctccctg gtctgagtgg acctcctgct ctgccacatg 361 tggcaatgga attcagcaac gtggtcgttc ctgtgacagc ctcaacaaca gatgcgaggg 421 ctcttcggta cagacgagga cctgccacat tcaggagtgt gacaaaagat gtaagcatca 481 tggctgccaa gggtggcaat ctcccttgcc cagtgcttcc ctcacaaaga gcagctccgg 541 agcggggagc ctgagaatgc aggttctgag cctgcggctc tcagtctcaa tcacacctct 601 aacctttgca ctcctgggct tgaccttcct tccctgaggc tcctcctcat ctgtaacgag 661 agaacaataa ccattcctga taaagttact ttagccattt ctctagataa ttcatatgaa 721 acaatcagaa tggatctggg ctcctgacca cgaacctata aatattaggt tgttgacagc 781 aactcacaca cacagacaaa taccatcttc cctatagttt ctccactgaa gtgtggcagt 841 agccagtttc tacaatacac tagttgaagc tgcttttggt gggctgctgg aatgattgcc 901 tgctgagact tgctgtgtgg ccccctccaa ggcagcaccc attggtgaca atcaactttg 961 cccttcagtt aaacaggatg gtggctggag tcactggtct ccatggtcgt cctgttctgt 1021 gacctgtggt gacggtgtga tcacaaggat ccgtctctgc aactccccca gcccccagat 1081 gaacgggaag ccctgtgaag gtgaagcccg ggagaccaaa gcctgcaaga aagacgcctg 1141 cccaa // LOCUS MZERRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION Z.mays ribosomal RNA. ACCESSION M37178 M35661 KEYWORDS ribosomal RNA. SOURCE Z.mays rRNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 12 a 17 c 10 g 11 t ORIGIN 1 acgagacctc agcctgctaa ctagctatac ggagccatcc ctccgtagtt // LOCUS PNURRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.nudum ribosomal RNA. ACCESSION M37179 M35661 KEYWORDS ribosomal RNA. SOURCE P.nudum rRNA. ORGANISM Psilotum nudum Eukaryota; Plantae; Thallobionta; Psilotophyta; Psilotopsida; Psilotales; Psilotaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 16 c 12 g 11 t ORIGIN 1 acgagacctc agcctgctaa ctagttacgc gaaggatcct cttcgtggcc // LOCUS RATRGK5 1883 bp ds-DNA ROD 03-AUG-1990 DEFINITION Rat voltage-gated potassium channel protein (RGK5) gene, complete cds. ACCESSION M30312 KEYWORDS voltage-gated potassium channel protein. SOURCE Rat (Sprague Dawley) adult liver DNA, clone lambda RGK5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1883) AUTHORS Douglass,J., Osborne,P.B., Cai,Y.-C., Wilkinson,M., Christie,M.J. and Adelman,J.P. TITLE Characterization and functional expression of a rat genomic DNA clone encoding a lymphocyte potassium channel JOURNAL J. Immunol. 144, 4841-4850 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.O.Douglass 01-DEC-1989. FEATURES from to/span description pept 35 1612 voltage-gated potassium channel protein BASE COUNT 380 a 567 c 489 g 447 t ORIGIN 1 ctgcagagcc aggctacgcg agctgccgcc agacatgacc gtggtgcccg gggaccacct 61 gctggagcca gaagcggcgg gaggcggcgg cggggacccg cctcagggag gctgtgtcag 121 tggcggcggc tgcgaccgct acgaaccgct gccgcccgcg ctgcccgccg cgggcgagca 181 ggattgctgc ggggagcgcg tggtcatcaa catctccggg ctgcgcttcg agacacagct 241 caagaccctc tgccagttcc ctgagacgct gctaggcgac cccaagcggc gcatgcgata 301 cttcgacccg ctccgcaatg agtacttctt cgaccgcaac agacccagct tcgacgccat 361 cctctactac taccagtccg ggggccgcat ccgccggccg gtcaacgtgc ccatcgacat 421 cttctccgag gagatccgct tctaccaact gggtgaggag gctatggaga agttccgtga 481 ggacgagggc ttcctgcggg aggaggagcg acccctgccc cgccgtgact tccagcgcca 541 ggtgtggctg ctcttcgaat accccgagag ctcgcggccg gcccggggca ttgccatcgt 601 gtcagtgctg gtcattctca tctccattgt catcttctgc ttggagacac tacccgagtt 661 tcgcgacgag aaggactatc ccgcctctcc gtcgcaggac gtgtttgagg ctgccaacaa 721 cagcacgtcg ggggcctcct ctggagcctc cagcttctcg gaccccttct tcgtagtgga 781 gaccctgtgc atcatctggt tctcctttga gctgctggtg cgattctttg cttgccccag 841 taaagccacc ttctccagaa atatcatgaa cctgatagac attgtagcca tcatccctta 901 ttttattact ctgggcactg agctggctga gcgacagggt aatgggcagc aggctatgtc 961 actggccatc ctgagggtca tccgcctagt aagggtcttc cgcatcttca agctctcccg 1021 ccattctaag gggctgcaga tcctgggaca gacactgaag gcttccatgc gagagctggg 1081 gctgctcatt ttcttccttt tcattggggt catccttttc tccagtgcag tctactttgc 1141 tgaggcagac gacccttctt cgggttttaa cagtatcccg gatgccttct ggtgggccgt 1201 ggtaaccatg acaactgtcg gttatggtga tatgcaccca gtgaccatag gaggcaagat 1261 tgtgggctct ctttgtgcca tcgcaggtgt cttgaccatt gcattgccgg ttcctgtgat 1321 tctttccaac ttcaattact tctaccaccg ggagacagaa ggggaagagc aagcccagta 1381 catgcacgtg ggaagctgcc agcacctctc ctcttcagca gaggagctcc gaaaagcccg 1441 gagtaactcc actctgagta agtcggagta tatggtgatc gaagaggggg gtatgaacca 1501 cagcgccttc ccccagaccc ccttcaaaac gggcaactcc actgccactt gcaccaccaa 1561 caataatccc aactcctgtg tcaacatcaa aaagatattc actgatgtct aatagatgat 1621 acgattgcca ttctgtgccc agtattgtgt ggaacatgcc cccttggtct gtgtatgccc 1681 ttgatttata catttccaga ccattcatca aggaaagtac atgaagaagt ggaaaagcac 1741 acttcattct ccctctccct attgcttcat actgaaacag gtgcctggtt ttgcaagtgg 1801 gctcattctc tcagctcttt ttctctctcc ctctctctct ccctgtttct taattttgtg 1861 aacaacaaac ttacattaag ctt // LOCUS RICRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION O.sativa ribosomal RNA. ACCESSION M37173 M35661 KEYWORDS ribosomal RNA. SOURCE O.sativa rRNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS SAHRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION S.officinarum ribosomal RNA. ACCESSION M37174 M35661 KEYWORDS ribosomal RNA. SOURCE S.officinarum rRNA. ORGANISM Saccharum officinarum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS SRGRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION S.bicolor ribosomal RNA. ACCESSION M37175 M35661 KEYWORDS ribosomal RNA. SOURCE S.bicolor rRNA. ORGANISM Sorghum bicolor Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS TRDRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION T.dactyloides ribosomal RNA. ACCESSION M37176 M35661 KEYWORDS ribosomal RNA. SOURCE T.dactyloides rRNA. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 17 c 11 g 11 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgtagtt // LOCUS WHTRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION T.aestivum ribosomal RNA. ACCESSION M37177 M35661 KEYWORDS ribosomal RNA. SOURCE T.aestivum rRNA. ORGANISM Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS SUSFCK 3985 bp ss-mRNA INV 03-AUG-1990 DEFINITION S.purpuratus flagellar creatine kinase mRNA, complete cds. ACCESSION M33763 KEYWORDS creatine kinase; flagellar creatine kinase. SOURCE S.purpuratus sperm, cDNA to mRNA, clone WD2.2. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 3985) AUTHORS Wothe,D.D., Charbonneau,H. and Shapiro,B.M. TITLE The phosphocreatine shuttle of sea urchin sperm: Flagellar creatine kinase resulted from a gene triplication JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5203-5207 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.D.Wothe, 18-APR-1990. FEATURES from to/span description pept 95 3619 flagellar creatine kinase (E.C. 2.7.3.2) signal 3961 3966 poly-A signal BASE COUNT 1184 a 923 c 910 g 968 t ORIGIN 1 ccctaaactt caatcacact tctaatcaag aaaggaacaa atttgtatta attatttcat 61 tttctacata tgcattgtaa tcagtgaccc agcaatggga tgcgcagcat cgtcacaaca 121 aacgacggcc acaggtggtc aaccagctgc tggtgaaaag gcaaacccgg cccccgctaa 181 taacaatcca aacgcagcaa acaaagcaga aacgacagga gcagctgaag aactcactaa 241 agaatctgaa ccatttgtgg aaccggatcc caattaccct gacctaagca agcacaacaa 301 ttacctggca gaatctttaa cgccttccat ctacaacaaa atctgtaacc ttcgcacact 361 ttctggttac tctgtggatg gttgcatgca gaccggcgtg gacaatccag gacatccttt 421 catcaagact gtcggattgg tggctgggga tgaagagtgt tacgatctct ttgctgacct 481 ctttgatcca actattgaca aaagacacaa cggatatccc cgcaatgcaa agcacacaac 541 tgatctcaac cctgatcatc tcaagggtgg tgatgacttt gatccaaagt atgtcctttc 601 atgcagagtt cgtacaggcc gctgcattag agggtacggc ctcccaccac attgcacgag 661 ggctgaacgt cgagacgttg agaaagtttg caaagacgca ctggcaactc tagatgggcc 721 actcaaaggc acgtattacc cactaacggg gatgacagag gaaatgcagg acaaactaat 781 tgcagatcat ttcctattcg ataagcccgt ttctcctttg cttatgtcag ctcgcatggc 841 ccgtgattgg cctgatggac gtgggatctg gcacaatgcc gataaaaact tcctggtttg 901 gataaatgag gaagatcaca ctcgggttat ttccatggaa acaagtggca acatgaagaa 961 tgtctttaaa cgattctgta atggtctcaa caaagtcgag aatgctctca aagcaaaagg 1021 gtatgaattc tcatggaatg agcatcttgg gtatgtgctc acttgtccat ccaaccttgg 1081 tacaggtgta cgtgcaggag tccacatcaa gattcctctt ttcagcaagc acgctggatt 1141 tgaatccatt ctgaagcatt atcgtctcca gaaacgtggc acaggaggcg tggatacggc 1201 atccactgac ggcacgttcg acatttcaaa cctagatcgt cttggcactt ctgaagtaca 1261 acaggttcaa tccgttgttg atggtgtcaa gaagctgatt gagttggaaa aagccttaga 1321 aaagggatca gatatcagcg gccagattcc ccgggaccca gcaatagttc gtgcagaaca 1381 agtgaaagaa ggatatccag acctatccaa acacaacaat catctggccc actgtctgac 1441 atacgacatt tggaaaagtc tcaaggacaa aaagacgcca agtggcttta ctctggatgg 1501 ctgcatccaa acgggagtca tgaaccctgg tcatcctcac atcatgaccg ttggaatggt 1561 tgcaggagat gaggaatcat atgacgtctt cgctgatatc tttgacccgg tcattgatgc 1621 acgacacgga ggttatccca aagatgccgt ccatgtcacc aacataaacc atgcagatct 1681 taaaggagga gataacctgg atcccaaata tgttttgtct tgccgtgttc gcacaggacg 1741 cagcatcatc gggtattctc tgcccccgca ttgcaccgtg gaagagagag ctgccgtaga 1801 gacgatcacg atcggagctc ttgataaatt tgatggagac ttgcaaggaa aatactaccc 1861 actggaagga atgtctgatg aaacacaaac acagctgata gatgaccatt tcctcttcga 1921 taagcctgtt tctcctcttc ttaccgccgc aagaatgcat cgtgattggc cacagggaag 1981 aggaatttgg cataacgaaa acaagaactt cttggtgtgg gtcaatgaag aagatcatat 2041 ccgtgtcatc tctatggaga aggatgggaa tatgcgagcg gtcttcaagc gattctgtga 2101 aggtttgcag aagttcgagc agatgatcaa aaaggatggc aaggagttca tgtggaacaa 2161 acatctgggt tacgtactta cttgtccgtc taacctggga accggcctac gcgctggcgt 2221 acatgtgaaa ctgcctctgc tgagcaagta tccgagattc gaccagatcc taagagccct 2281 ccgacttcag aagagaggta caggtggagt agatactgcc tcaactgatg gaaccttcga 2341 catctcgaac ctcgacaggc ttggatcatc agaagtacag caagttcagt ttgttgtcga 2401 tggcgttgaa cttctcgtcc aaatggagaa gaaacttgaa aagggcgagg acatttttga 2461 tattctacca cagcaatgcc gtcccaaacc accaataaag cctttcagtt atgactaccc 2521 cgatttctct ctgcacaaca actggatgtc taaatgtatg acagaagaaa tctacaacaa 2581 actttgcaac ttgaagacaa aaggtggagt caccctgaat gactgcattc aaactggcat 2641 tgacaatcct ggacatccat acatcatgac tgttggtttg gtagctggag atgaagagtg 2701 ctacgaggtt ttcgctcctc tcttcgaccc ggtgataagt gccaggcacg gcggctacgc 2761 tcttgatgcc aaacacccta ccaacctcaa tgcagcagaa ctcaaaggcg gagatgattt 2821 ggaccctgag ttcgtcctat cttgtcgtgt acgtactggt cgttgcatcc gcggcttggc 2881 tctcccgcca tgttgcacca gagcagaacg tgcggaagta gagaagatta caacagaagc 2941 attatctacc ctaagtggac ctctgaaggg aaagtactac ccactaaccg gtatgacaga 3001 cgaagagcaa gagaagctca tcgaagacca cttcctcttt gacaagcccg tatccccgct 3061 gttgctttgt gccaatatgg cacgtgactg gccccagggt cgtggcatat ggcacaatga 3121 tgagaagaat ttccttgtct gggtcaacga ggaagatcac actcgagtca tctccatgga 3181 gaagagcggg aacatgaagc gagtgtttga gaggttctgc gacggtctca aaaaggtcga 3241 agactccatc aagtccaagg gttaccagtt catgtggaat gagcatcttg gttatgtact 3301 gacatgtcca tccaacctcg ggacgggact gcgtgcaggc gtccatgtca aggttccttt 3361 gttaagtcaa cagaaaatct ttgactctat ccttgatcac atgcgtctgc aaaagcgagg 3421 aacaggaggt gtagacacag cctctactga cggcaccttt gatatctcca actcggatcg 3481 catcggtttc tccgaagtcc atcttgtcca acaactcgta gacggtgtca agctcctcgt 3541 caacctcgaa aaggctctca tgaagggaga agacatcaac agtctcctgc cagagaagct 3601 aagggaagat tcctcataaa gccaggatcc acagcttgca gaatagtggc cgttgtcgcg 3661 caagtgttat aaatgccacc tttagaagat gtagcctaca cattaaaaaa aaatacaatt 3721 agaattgaaa tctataagga cgatttgaat ataccataaa gatttgccca tgattttgaa 3781 tcaaaataac ttcatgctga acttgtaaat gaaaaagtaa tgttagaaaa ctctgtaaat 3841 tgggaagata taaaatttca tggccgtttt tctttctgat gaagtacact aattatatta 3901 atttctctta ttaatatgta tggaaacttt taattatgta aaaacaagga aacgaactga 3961 aatacaattc aaactacttt tcatt // LOCUS BSUSACP 3518 bp ds-DNA BCT 03-AUG-1990 DEFINITION B.subtilis sacT gene, complete cds, enzyme II (sacP) gene (of the sucrose phosphotransferase system), complete cds, and sucrase gene, 5' end. ACCESSION J03006 M33761 KEYWORDS enzyme II; sucrase; sucrose transferase system. SOURCE B.subtilis (strain Marburg 168) DNA, clone pBSG8-10. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1987 to 3518) AUTHORS Fouet,A., Arnaud,M., Klier,A. and Rapoport,G. TITLE Bacillus subtilis sucrose-specific enzyme II of the phosphotransferase system: Expression in Escherichia coli and homology to enzymes II from enteric bacteria enzymes II from enteric bacteria JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 8773-8777 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 2070) AUTHORS Debarbouille,M., Arnaud,M., Fouet,A., Klier,A. and Rapoport,G. TITLE The sacT gene regulating the sacPA operon in Bacillus subtilis shares strong homology with transcriptional antiterminators JOURNAL J. Bacteriol. 172, 3966-3973 (1990) STANDARD full staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by A.Fouet, 16-SEP-1987. Draft entry and computer-readable sequence for [2] kindly submitted by M.Debarbouille 18-APR-1990. FEATURES from to/span description pept 136 897 sacT protein pept 2008 3390 enzyme II (sacP) pept 3387 > 3518 sucrase BASE COUNT 1013 a 712 c 867 g 926 t ORIGIN 1 ctttacatgg cggtcaaaag gaggtaaaat cccagtcaaa agcaaattgg cgggagagat 61 aacctcttga aaatctataa agtattaaac aacaatgcgg ctttaataaa agaggatgat 121 caggaaaaaa tcgtgatggg accgggaatc gcttttcaaa aaaagaaaaa tgatctcatc 181 cctatgaata aagtggaaaa gatctttgtc gtgcgcgacg agaatgaaaa gttcaaacaa 241 atcctgcaaa cactgccgga ggagcatata gaaatagctg aggatatcat cagctatgcg 301 gaaggagagc tcgcggcgcc gctgagcgat cacattcata tcgcgctttc cgaccatttg 361 tcctttgcga tcgaaaggat tcaaaatggg ctgcttgtgc agaataaatt gctgcatgag 421 ataaaggcgc tctataaaaa agaatatgag atcggcctgt gggcgatcgg acatgtaaaa 481 gagacattgg gcgtgtctct gcctgaagat gaagcgggtt atattgccct tcacatccat 541 acggcgaaga tggatgcgga gagcatgtat tcagcgctga agcatacgac catgatcaaa 601 gaaatgatag agaaaataaa acaatacttt aatcgaaagg tggatgaaaa cagcatttcc 661 tatcaacgcc ttgtcacgca tttgcgatac gcggtcagcc ggttggaatc aaatgaagcg 721 ctccatcgta tggacgagga gatgctttat ttcatccaaa aaaagtattc attcgcctat 781 caatgtgcgc tggagctggc ggagtttttg aaaaatgaat atcaattaca tttgccggaa 841 tccgaggccg gctatatcac gctgcatgtc cagcgtcttc aagatctctc ggaataaccg 901 ctttgacttg cagggagtga tctctggaag tttttttatt gatcaggatt cagtgtgtag 961 tacatccagc ccatcataac tattcccgcg gtcagattgc cgagcgtaac ggggattaag 1021 tttctgactg ctcccatcag tgtcactgta tcagggtgct cgatcaaaag cgagatggcg 1081 aatgtgcaca tattggcaat gctgtgttca aagccggaaa taaagaagca gaaaacgaaa 1141 agcatcatgg taaaaagctt tgctccttcc cctttgagag acattggaat gaaaaaggcg 1201 aggcacacaa gccaattgca cagcattcct ctgaaaaaca attcggaagc cggcggctcc 1261 attttgtgct ctgccaaatg aatcaaaaag gaatgaacag aaggctcctc aaaaagtccc 1321 gtcgcgctga tcaggatggc aaacagaatg gcgccgatta aattgccggc atagcttgac 1381 atccacaagt atagggtgtc gcgccagctg atttttttcc ggagcgcggt ataggtgaaa 1441 taaaaggtgt tgccggtaaa taaatctccg cctccgtaag caatcatcag aatagcggcc 1501 ccgaaagtga cagcggctgc cggaaaggca aacggagaat cagccataaa gaaatagctg 1561 cctgttttgc ttgcggctgt aatgccaaaa ccaataaata tactggccaa aatggaccga 1621 agcacataac ggattttgct tgaagcgaat atgttttgtt ttttcaaagc atactgttca 1681 accttttgta atgcttgagt ttccattgtc ttttcctgct ttctttcatg attgagcata 1741 gtatggcacg atattgtgaa gtatttcaca attaattttt acgaattttt gtctacatgt 1801 tatatcaaac tatttaacta aaaaaatagt tgacgaaaac gctatcatga tttatgatga 1861 aagcgtattc ttaatctgaa taagcgggat tgtgactggt aaagcaggca agacctaaaa 1921 tttgcgtaaa tgaaaaagga tcgctgtgtc ctttattcgt tggcgaattt taggtctttt 1981 ttgtttaaat aaaaggggga gaaatagatg gattacaaag agactgcaaa acgcctcatt 2041 gagcttctcg gagggaaaga aaatattatc agcgcggctc attgtgcaac aagactgcgt 2101 ttagtgatga aagatgaatc aaagatagat caagcacaag tagaagagct tgatggcgtt 2161 aaaggggctt tcagcagctc tggccagtac cagatcattt tcggaacagg ccttgtgaat 2221 aaagtattcg atgccttttc taaagaagct gatatcgagc gtgaagaaca cgtcaatcat 2281 caggatgcgg caaaagaaaa gctgaatccc gctgcgagat ttgcgaaaac gctttctaat 2341 atttttgttc caatcatccc agctattgta gccagcggcc tattaatggg attgctgggt 2401 atgatcaatg cgtttcattg gatgagcaag gattctgcgc ttttgcagct gcttgatatg 2461 ttttcaagtg cagcattcat tttcttgccg attctaatcg gggtcagcgc ttcaaaagag 2521 tttggcagca atccatactt gggagcggtc atcgggggaa tcatgattca tccgaatctt 2581 ttaaatccat ggggattggc ggaagaacaa ctgattacat gcatcttttc ggatttgata 2641 tcgctcttct cggctacagg gaactgttat ccctgtcctg cttgcggtgt atgtgatgag 2701 caaggtagag aaatggacga gaaaagtggt tccacatgcg gtgatttgct tgttacacca 2761 tttgttactg tgattgtcac cgggtttgtt gcttttatcg cgattggccc tctaggcaga 2821 gcgcttggct ccggcattac ggttgcttta acttatgtat atgaccatgc cggttttgtc 2881 gcaggtctga ttttcggggg cacgtattca ctcatcgtgc tgacaggcgt tcatcacagc 2941 ttccatgcga ttgaggcagg gctaattgca gatataggca aaaactactt gctgccgatc 3001 tggtcaatgg cgaatgtcgc acagggcggg gcaggtcttg ccgtcttctt tatggcgaaa 3061 aaggcaaaga caaaagaaat cgcgcttccg gcagcatttt ccgcttttct cggcattact 3121 gagcccgtca tattcggagt caatctgcgc taccgaaaac cgtttatcgc cgcgatgatc 3181 ggaggcgcat tgggtggggc atatgtcgtc tttacacatg tagctgcaaa cgcttacggg 3241 ttaacgggaa ttccgatgat cgccattgcg gctccgtttg gcttcagcaa cttgattcac 3301 tatctgattg gcatggcgat tgcggcagtg tcagcgttca ttgcagcatt tgtcatgaag 3361 attaatgagg atgaggagag gaaaaaatga cagcacatga ccaggagctt cgtcgccggg 3421 cttatgaaga agtggagaaa aaagagccca tcgctaacag cgatccgcac cgccagcatt 3481 ttcatatcat gccgccggtt gggctgctga atgacccg // LOCUS BOVACTH 1084 bp ss-mRNA MAM 03-AUG-1990 DEFINITION Bovine corticotropin-beta-lipotropin mRNA, complete cds. ACCESSION M25587 KEYWORDS adrenocorticotropic hormone; corticotropin-beta-lipotropin; lipotropin; melanocortin. SOURCE Bovine, cDNA to mRNA, clone pSNAC20. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1084) AUTHORS Cohen,S.N., Chang,A.C.Y., Nakanishi,S., Inoue,A., Kita,T., Nakamura,M. and Numa,S. TITLE Studies of cloned DNA encoding the structure for the bovine corticotropin-beta-lipotropin precursor protein JOURNAL Ann. N.Y. Acad. Sci. 343, 415-425 (1980) STANDARD simple staff_review FEATURES from to/span description pept 114 911 corticotropin-beta-lipoprotein precursor sigp 114 191 corticotropin-beta-lipoprotein signal peptide matp 342 377 gamma-melanocortin (gamma-MSH) matp 507 548 adrenocorticotropic hormone (ACTH) matp 630 908 beta-lipotropin matp 756 809 beta-melanocortin (beta-MSH) matp 816 908 beta-endorphin mRNA < 1 1084 corticotropin-beta-lipoprotein mRNA BASE COUNT 220 a 337 c 370 g 157 t ORIGIN 1 aagagaacga agggaagaaa agtgaccgag acgccgctga acatcctcgc cccggcgcag 61 cgggagccgc ccgagccagc ttccccgtga cagagcctca gcctgcctgg aagatgccga 121 gactgtgcag cagtcgttcg gccgccctgc tgctggcctt gctgcttcag gcctccatgg 181 aagtgcgtgg ttggtgcctg gagagcagcc agtgtcagga cctcaccacg gaaagtaacc 241 tgctggcgtg catccgggcc tgcaagcccg acctctccgc cgagacgccg gtgttccccg 301 gcaacggcga tgagcagccg ctgactgaga acccccggaa gtacgtcatg ggccatttcc 361 gctgggaccg cttcggccgt cggaatggta gcagcagcag cggagttggg ggcgcggccc 421 agaagcgcga ggaggaagtg gcggtgggcg aaggccccgg gccccgcggc gatgacgccg 481 agacgggtcc gcgcgaggac aagcgttctt actccatgga acacttcccc tggggcaagc 541 cggtgggcaa gaagcggcgc ccggtgaagg tgtaccccaa cggcgccgag gacgagtcgg 601 cccaggcctt tcccctcgaa ttcaagaggg agctgaccgg ggagaggctc gagcaggcgc 661 gcggccccga ggcccaggct gagagtgcgg ccgcccggcc tgagctggag tatggcctgg 721 tggcggaggc ggaggctgag gcggccgaga agaaggactc ggggccctat aagatggaac 781 acttccgctg gggcagcccg cccaaggaca agcgctacgg cgggttcatg acctccgaga 841 agagccaaac gccccttgtc acgctgttca aaaacgccat catcaagaac gcccacaaga 901 agggccagtg agggcgcagc gggcaggggc ctctctccgc ggaaagttga ccctgaaggc 961 ctctcttctg ccctcctacc gcctcgcagc ctgggtgagg attcgcccag gcagtgatgg 1021 cgccaggtat cccgactctt aaagctgtct gtagttaaga aataaaacct ttcaagtttc 1081 acga // LOCUS ECOK88ACF 861 bp ds-DNA BCT 03-AUG-1990 DEFINITION E.coli K88ac fimbriae subunit gene, 5' end. ACCESSION M35954 KEYWORDS K88ac fimbriae subunit; antigen. SOURCE E.coli (K12, strain MC1000) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 861) AUTHORS Josephsen,J., Hansen,F., de Graaf,F.K. and Gaastra,W. TITLE The nucleotide sequence of the protein subunit of the K88ac fimbriae of porcine enterotoxigenic Escherichia coli JOURNAL FEMS Microbiol. Lett. 25, 301-306 (1984) STANDARD simple staff_review FEATURES from to/span description pept 13 > 861 K88ac fimbriae subunit precursor sigp 13 75 K88ac fimbriae subunit signal peptide matp 76 > 861 K88ac fimbriae subunit BASE COUNT 227 a 154 c 230 g 250 t ORIGIN 1 gggtttaatc ttatgaaaaa gactctgatt gcactggcaa ttgctgcatc tgctgcatct 61 ggtatggcac atgcctggat gactggtgat ttcaatggtt cggtcgatat cggtggtagt 121 atcactgcag atgattatcg tcagaaatgg gaatggaaag ttggtacagg tcttaatgga 181 tttggtaatg tattgaatga cctgaccaat ggtggaacca aactgaccat tactgttact 241 ggtaataagc caattttgtt aggccgaacc aaagaagcat ttgctacgcc agtaactggt 301 ggtgtagatg gaattcctca tattgcattt actgactatg aaggagcttc tgtagtactc 361 agaaaccctg atggtgaaac taataaaaaa ggtttagcat attttgttct gccgatgaaa 421 aatgcagagg gcactaaagt tggttcagtg aaagtgaatg catcttatgc cggtgtgtta 481 gggagaggtg gggttacttc tgcggacggg gagctgcttt cgctttttgc cgacgggttg 541 agcgctatct tttatggtgg tttgccgagg ggttctgaat cgtcggctgg gagtgccgca 601 gcggcgcgca caaagttgtt tggaagtcta tcaagaaatg atattctcgg acagattcaa 661 agagtaaacg caaatattac ttctcttgtt gacgtcgcag gttcttacag ggaaaacatg 721 gagtacactg atggaactgt tgtttctgct gcctatgcac tgggtattgc aaacggtcag 781 actattgagg caacttttaa tcaggctgta actaccagca ctcagtggag cgctccgctg 841 aacgtagcaa taacttatta c // LOCUS LBARNII 400 bp ds-DNA BCT 03-AUG-1990 DEFINITION L.acidophilus RNA polymerase (RNAI and RNAIIa) genes, 5' promoter region. ACCESSION M35739 KEYWORDS RNA polymerase. SOURCE L.acidophilus DNA, clone pRNL5. ORGANISM Lactobacillus acidophilus Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 400) AUTHORS Natori,Y., Kano,Y. and Imamoto,F. TITLE Characterization and promoter selectivity of Lactobacillus acidophilus RNA polymerase JOURNAL Biochimie 70, 1765-1774 (1988) STANDARD simple staff_review FEATURES from to/span description mRNA 143 > 400 RNAI mRNA mRNA 209 < 1 (c) RNAIIa mRNA BASE COUNT 149 a 69 c 67 g 115 t ORIGIN 1 tgcgcggggt tcgagctacg gccaaaagaa ccccgcgttg gaacatctgg cacaaacttt 61 ttctcatttt taactttttt ttaatttaat ttagacaaaa aaagaagcct taactggctt 121 cgatctgtgg tataattttt ttaaagtaaa caaaaaaagt gctaactaaa acaacaacaa 181 cagcactttt tcaatcaact tttcaacttc gaatttatta tagcataaaa ctagtaaaaa 241 taaatacaaa aaaaaattgc ctgtattatc ttcgaagctg ggaagttgtc ccactatcgg 301 aggttttttt tatgggaaaa atcgtaactt taccacaagg ataggctagt cagcctaagt 361 ctaaaaagaa aaaaagcaag aagagttcag gcagtgcaag // LOCUS LEIKPGRA 44 bp ss-RNA RNA 03-AUG-1990 DEFINITION L.tarentolae kinetoplast minicircle guide RNA (COIII gRNA-I). ACCESSION M33870 KEYWORDS guide RNA. SOURCE L.tarentolae (strain UC) kinetoplast minicircle D-12-1, guide RNA. ORGANISM Kinetoplast Leishmania tarentolae Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Leishmania tarentolae. REFERENCE 1 (bases 1 to 44) AUTHORS Sturm,N. and Simpson,L. TITLE Kinetoplast DNA minicircles encode guide RNAs for editing of cytochrome oxidase subunit III mRNA JOURNAL Cell 61, 879-884 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Simpson 13-APR-1990. BASE COUNT 17 a 7 c 9 g 11 t ORIGIN 1 ctatctttac gaggtaaaga cagagagatg aaaacactat tcgt // LOCUS LEIKPGRB 46 bp ss-RNA RNA 03-AUG-1990 DEFINITION L.tarentolae mitochondrial minicircle guide RNA (4E-2 gRNA). ACCESSION M33871 KEYWORDS guide RNA. SOURCE L.tarentolae mitochondrial minicircle Lt154, guide RNA. ORGANISM Mitochondrion Leishmania tarentolae Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Leishmania tarentolae. REFERENCE 1 (bases 1 to 46) AUTHORS Sturm,N. and Simpson,L. TITLE Kinetoplast DNA minicircles encode guide RNAs for editing of cytochrome oxidase subunit III mRNA JOURNAL Cell 61, 879-884 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Simpson 13-APR-1990. BASE COUNT 27 a 5 c 5 g 9 t ORIGIN 1 ataaaacaca acaaaaaaca tagagaaatc atagagtgtt aaatat // LOCUS MUSIFNRES 508 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class II-associated invariant chain gene, exon 1. ACCESSION M35872 KEYWORDS MHC class II-associated invariant chain. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 508) AUTHORS Eades,A.-M., Litfin,M. and Rahmsdorf,H.J. TITLE The IFN-gamma response of the murine invariant chain gene is mediated by a complex enhancer that includes several MHC class II consensus elements JOURNAL J. Immunol. 144, 4399-4409 (1990) STANDARD simple staff_review FEATURES from to/span description pept 432 > 508 invariant chain protein, exon 1 mRNA 348 > 508 invariant chain protein mRNA site 87 158 IFN-gamma response element signal 267 272 SP1 binding site signal 255 258 CAAT box signal 315 319 TATA box BASE COUNT 133 a 131 c 140 g 104 t ORIGIN 1 gggtgggggt gggtagacaa aggcactttc tgcttcaaaa ataattcctt ccctggggaa 61 agtgcctgct tctagaaccg tccaaagagc ctgatgaatc cagaagtctg cctagaaaca 121 agtgatgata gccctggcca gccaatggga tcatgcaggc ctttctacct gtttagggaa 181 ctcccccttc atcctgccca gggaggcagc tttgagtgag tggggaattt ccagatttgt 241 ggctttcagt tccacatcta ccatgtgggc ggagtgacct gctgtgggcg aatcagattc 301 cttccagtat cagctttaag aggtgatctt ggggctcaag ggtcccagac acacagcagc 361 agcagcagca gcagcagcag caacagcagc agcagcagca gcgcctgtgg gaaaaaatag 421 aggctagagc catggatgac caacgcgacc tcatctctaa ccatgaacag ttgcccatac 481 tgggcaaccg ccctagagag ccagaaag // LOCUS SOYBCONGL 794 bp ss-mRNA PLN 03-AUG-1990 DEFINITION Soybean beta-conglycinin-alpha subunit mRNA, 3' end. ACCESSION M26128 KEYWORDS beta-conglycinin. SOURCE Soybean, cDNA to mRNA, clone pB36. ORGANISM Glycine max Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 794) AUTHORS Coates,J.B., Medeiros,J.S., Thanh,V.H. and Nielsen,N.C. TITLE Characterization of the subunits of beta-conglycinin JOURNAL Arch. Biochem. Biophys. 243, 184-194 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 659 beta-conglycinin-alpha subunit (AA at 3) mRNA < 1 794 beta-conglycinin-alpha subunit BASE COUNT 253 a 153 c 177 g 211 t ORIGIN 1 tgagcaaacg tgccaaatct agttcaagga aaaccatttc ttctgaagat aaacctttta 61 acttgggaag ccgcgacccc atctactcca agaagcttgg caagttcttt gagatcaccc 121 cagagaaaaa cccccagctt cgggacttgg atatcttcct cagtattgtg gatatgaacg 181 agggagctct tcttctacca cacttcaatt caaaggcgat agtgatactg gtaattaatg 241 aaggagatgc aaacattgaa cttgttggcc taaaagaaca acaacaggag cagcaacagg 301 aagagcaacc tttggaagtg cggaaatata gagccgaatt gtctgaacaa gatatatttg 361 taatcccagc aggttatcca gttgtggtca acgctacctc aaatctgaat ttctttgcta 421 ttggtattaa tgccgagaac aaccagagga acttcctcgc aggttcgcaa gacaatgtga 481 taagccagat acctagtcaa gtgcaggagc ttgcattccc tgggtctgca caagctgttg 541 agaagctatt aaagaaccaa agagaatcct actttgtgga tgctcagcct aatgaaaaag 601 aggagggtaa taagggaaga aagggtcctt tgtcttcaat tttgagggct ttttactgaa 661 taagtatgta ctaaaatgta tgctgtaata gctcatagtg agcgaggaaa gtatcgggct 721 atgtaactat gactagagct tcaactatga ataaataaat cgacagcata tgatgctttt 781 gttttgtgtt cttc // LOCUS SOYGLYB4 911 bp ss-mRNA PLN 03-AUG-1990 DEFINITION Soybean glycinin A3-B4 subunit mRNA, 3' end. ACCESSION M35671 KEYWORDS glycinin; glycinin-B4 subunit. SOURCE Soybean (var. Early-10) seed, cDNA to mRNA, clones pSPG[204,673]. ORGANISM Glycine max Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 911) AUTHORS Epishin,S.M., Kotenko,S.V., Zakharova,E.S., Kiselev,S.L. and Vinetskii,Y.P. TITLE Cloning and structural analysis of cDNA encoding a portion of the A-3-B-4 glycinin subunit JOURNAL Genetika 22, 810-815 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 723 glycinin precursor matp 202 720 glycinin-B4 subunit mRNA < 1 911 glycinin mRNA BASE COUNT 280 a 220 c 203 g 208 t ORIGIN 1 caagaagacg aagacgaaga tgaagacgaa gaatatgaac aaactccctc ttatcctcca 61 cgacgaccaa gccatggaaa gcatgaagat gacgaggacg aggacgaaga agaacatcaa 121 cctcgtcctg atcaccctcc acagcgacca agcaggcccg aacaacaaga accacgtgga 181 agaggatgtc agactagaaa tggggttgag gaaaatattt gcaccatgaa gcttcacgag 241 aacattgctc gcccttcacg tgctgacttc tacaacccaa aagctggtcg cattagcacc 301 ctcaacagtc tcaccctccc agccctccgc caattcggac tcagtgccca atatcttgtc 361 ctctacagga atggaattta ctctccacat tggaacttga acgcgaacag tgtgatctat 421 gtgactcgag ggaaaggaag agttagagtg gtgaactgcc aagggaatcc agtgttcgac 481 ggtgacctaa ccaggggaca attgctactg gtgccgcaga actttgtggt cgctgaccaa 541 gggggaaaac aaggattgga atacgtagtg ttcaagacac agcacaacgc cgtgagcagc 601 tacattaagg atctgtttag ggcaatccct tcggaggttc tttccaattc ttacaacctt 661 ggccagagtc aagtgcgtca gctcaagtat caaggaaact ccggcccttt gctcaaccca 721 taaataacaa caagcatata tcaaggtgtg gtgaggccat cttatatgaa taatatgaaa 781 atatattttg tgtaataata aaactatggc ctatgtattt accaccctcc gacccagcct 841 atgttaatat ctgagtggcg ttgtaccttt gaatcgcctt aataaaatgt cagtcttcaa 901 gttttgtctt t // LOCUS TLARGN 133 bp ds-DNA PLN 03-AUG-1990 DEFINITION T.lanuginosus 26S ribosomal RNA gene, partial. ACCESSION M35561 KEYWORDS 26S ribosomal RNA. SOURCE T.lanuginosus DNA. ORGANISM Thermomyces lanuginosus Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 133) AUTHORS Nazar,R.N. and Wong,W.M. TITLE Is the 5S RNA a primitive ribosomal RNA sequence? JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 5608-5611 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 133 23S rRNA BASE COUNT 39 a 23 c 39 g 32 t ORIGIN 1 gtagcaaata ttcaaatgag aactttgaag actgaagtgg ggaaaggttc cacgtcaaca 61 gcagttggac gtgggttagt cgatcctaag agatggggaa gctccgtttc aaaggcctga 121 ttttatgcag gcc // LOCUS YSCRGNA 252 bp ds-DNA PLN 03-AUG-1990 DEFINITION S.cerevisiae 26S ribosomal RNA gene, partial. ACCESSION M35562 KEYWORDS 26S ribosomal RNA. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 252) AUTHORS Nazar,R.N. and Wong,W.M. TITLE Is the 5S RNA a primitive ribosomal RNA sequence? JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 5608-5611 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 252 23S rRNA BASE COUNT 62 a 60 c 75 g 55 t ORIGIN 1 gtagcaaata ctcaaatgag aactttgagg actgaagtgg ggaaaggttc catgtgaaca 61 gcagttggac atgggttagt cgatcctaag acacagggaa gttccgtttg aaaggcgccc 121 tggtgcgccg ttgacatgca acagctggga ttcgctggtg gtcacccacc caactactaa 181 ccagccggcg tgtggcttaa gtacggctga gcggacggga agccctgttt tccacaccct 241 atggtcgcat gt // LOCUS YSCSRP 413 bp ds-DNA PLN 03-AUG-1990 DEFINITION S.cerevisiae serine rich protein (srp) gene, 5' end. ACCESSION M35738 KEYWORDS serine rich protein. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 413) AUTHORS Marguet,D. and Lauquin,G.J.-M. TITLE The yeast srp gene: Positive modulation by glucose of its transcriptional expression JOURNAL Biochem. Biophys. Res. Commun. 138, 297-303 (1986) STANDARD simple staff_review FEATURES from to/span description pept 408 > 413 serine rich protein (srp) mRNA 347 > 413 SRP mRNA (alt.) mRNA 355 > 413 SRP mRNA (alt.) mRNA 404 > 413 SRP mRNA (alt.) BASE COUNT 123 a 83 c 71 g 136 t ORIGIN 1 gatccatctt tcttcgcgcg tttattttca gcaccatgtt ttaggttttt acagcatcgt 61 ttaaggaacc caacaataca atagcgggaa gaatgcactt tctcgttcca taaagggtct 121 ctttcaccta tacggttggt acagatttcc agtgaatgcc agtcagccac ggcattacgt 181 cgtttgcttc tattttcttc gttttgaact gcgtttgtat gcaactgtcc cttgacagag 241 aaaaaagtga agcaaaatga cagacaaaga aatcttttgt ataaaaggtc ggttgaatct 301 tgttgttagc ttgaaatcag cttgcttttc tcctctaaat tacataaaaa accaagaaaa 361 tatcagactt tttcattcgc tttcaacaag tactacaata attaaaaatg gct // LOCUS AVIVNFDG 6557 bp ds-DNA BCT 03-AUG-1990 DEFINITION A.vinelandii nitrogenase (vnfHDGK) genes, complete cds. ACCESSION M32371 KEYWORDS nitrogenase 2; vanadium nitrogenase. SOURCE A.vinelandii (strain CA) DNA. ORGANISM Azotobacter vinelandii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 6557) AUTHORS Joerger,R.D., Loveless,T.M., Pau,R.N., Mitchenall,L.A., Simon,B.H. and Bishop,P.E. TITLE Nucleotide sequence and mutational analysis of the structural genes for nitrogenase 2 of Azotobacter vinelandii JOURNAL J. Bacteriol. 172, 3400-3408 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.N.Pau, 28-FEB-1990. FEATURES from to/span description pept 238 1110 nitrogenase (vnfH) pept 2415 3839 nitrogenase (vnfD) pept 3839 4180 nitrogenase (vnfG) pept 4237 5664 nitrogenase (vnfK) pept 1227 1424 ferredoxin-like ORF rpt 1172 1192 inverted repeat rpt 4188 4219 inverted repeat rpt 5737 5760 inverted repeat binding 1214 1218 pot. ribosome binding site binding 2400 2406 pot. ribosome binding site binding 4222 4228 pot. ribosome binding site site 2311 2327 pot. NTRA-dependent promoter BASE COUNT 1359 a 2270 c 1874 g 1054 t ORIGIN 1 gatcgcttcc cggctgtacc tgcgggtacg tcgacggagc gcacagcatc ctggcctgga 61 tttatggagt ccaataaacc tgcaaaaatt aaaataattc actaattaaa tatgtttttt 121 tgattttata ttccaaaaaa taggcaatca tcgacttatc gatccttgtg gcaccgccct 181 tgctcaactc tcgtcggcac aaatcaaaac gccaacgaat caacggaggt tcctaagatg 241 gcattgcgtc agtgtgcaat ttacggcaag ggtggcatcg gcaagtccac caccacccag 301 aacctggtcg ccgccctcgc cgaagccggc aagaaagtga tgatcgtcgg ttgtgacccg 361 aaagccgact ccacccgcct gatcctgcac tccaaggccc agggcaccgt catggaaatg 421 gccgcgtccg ccggctcggt cgaagacctg gagctggaag acgtgctgca gatcggcttc 481 ggcggcgtca agtgcgtcga atccggtggc ccggagccgg gcgtcgcttg cgccggccgt 541 ggcgtgatca ccgcgatcaa cttcctggaa gaagaaggcg cctacagcga cgacctggac 601 ttcgtgttct atgacgtgct gggcgacgtg gtatgcggcg gcttcgccat gccgatccgc 661 gagaacaagg cccaggaaat ctacatcgtc tgctccggcg agatgatggc catgtacgcc 721 gccaacaaca tcgccaaggg catcgtgaaa tacgcccact ccggcagcgt gcgtctgggc 781 ggcctgatct gcaacagccg caagaccgac cgcgaagacg agctgatcat ggccctggcc 841 gcgaagatcg gcacccagat gatccacttc gtgccgcgcg acaacgtcgt gcaacacgcc 901 gaaatccgcc gcatgaccgt gatcgaatac gatccgaagg ccggacaggc cgacgagtac 961 cgtgccctgg ctcgcaaaat cgtcgacaac aagctgctgg tcatcccgaa cccggcctcc 1021 atggaagaac tcgaagagct gctaatggaa ttcggcatca tggaagtcga agacgagtcc 1081 gtcgtcggca aggccgccgc cgaaggctga ttcacccagc acagcgtttg cggaggagcg 1141 tgcgccgcgg gctttcggaa tggcttctcg cggccggcgg cgcacgccgc cctcccttcg 1201 aacaaccgac ctcaggagct gacaccatgg ccatggccat cgacggctac gaatgcactg 1261 tctgcggcga ctgcgagccg gtctgcccga ccggttcgat cgtcttcagg gacgatcact 1321 acgcgatcga agccgacagt tgcaacgaat gcaccgacgt gggcgagccg cgctgtctcg 1381 gcgtctgccc cgtcgacttc tgcatccagc cgctcgatga ctgaacactg aacgactccg 1441 caccccgttg ccggcggcag gacattccgc gccgtcctgc cgccggaccc agaacggcga 1501 tcgcttttcc tcaggggcga tcgccgtttt acttttcccc gctccgctag cgcccgcgga 1561 caacagccgt ccgtatccgc gccgttcgcc gtcctcgacc ggccacgatc gcggcacagc 1621 tcgccaagct acagcccgct ccacaagctg accatcggca tccagacctt cgccaagatc 1681 cgcgaagact tcttacgtcg acaagaccgg cttcgcccgg cgaccgatcg aacgggacac 1741 actacttcct ctcccggcca tgccgcttcg gcaagagccc gttcctcgat accctggccg 1801 aaccgttcgc cggcgacgaa ccttcgttcc gcaggctgca gatgcatgac cgatgggact 1861 ggcgccggcg ccatccggca ttgcggatcg gcttcggcgg cgcgccgatc cgggacggcg 1921 gataagctcg ccgagccgct gctcccaaag ccacgcccac ctccacgtcc cggcgcggaa 1981 gtggcgatcc ccagccgcac gtccccatgg tttccgggtc tggaaccgtc ccccggactt 2041 cccgtacgga cgcccggaag ccaccgcccc gacggccacg cacagccacg tacagacccc 2101 ccgctctccg tacttggccg aacgaaaaac cttttacgta acccgccggc cgtccacccg 2161 gttgtcccga acaggaaaaa aaagcccgga aaaggcttgc cccggcgctt ttccaaaacc 2221 tcgaaaatgc gcacaaataa ttgattcgaa aggattaatc tgagacagcg gcgatgccgc 2281 gaaaaattcc tgcaagcgct gcaaggacat atggcacgca tcctgccctt acctcttcgc 2341 caaccggttt ttcggtcccg gtcgagtgcc gtcaggggac tcgatccacg catagagccg 2401 aggagacttc aatcatgcca atggtattgc tggaatgtga caaggacata cccgagcgcc 2461 agaaacacat ctatctgaag gcgcccaacg aggacacccg cgagttcctg ccgatcgcca 2521 acgcggcgac catccccggc accctgtccg aacgcggctg cgccttctgc ggcgccaagc 2581 tggtgatcgg cggtgtgctc aaagacacca tccagatgat tcacggcccg ctcggctgtg 2641 cctacgacac ctggcacacc aagcgctacc cgaccgacaa cggccacttc aacatgaagt 2701 acgtctggtc gaccgacatg aaggaaagcc atgtggtctt cggcggcgag aaacgcctcg 2761 agaagagcat gcacgaagcc ttcgacgaaa tgcccgacat caagcggatg atcgtctaca 2821 cgacctgccc gaccgcgctg atcggcgacg acatcaaggc cgtggccaag aaggtgatga 2881 aggaccgtcc ggacgtggac gtcttcaccg tcgaatgccc cggcttctcc ggtgtgtccc 2941 agtccaaggg ccaccacgtc ctgaacatcg gctggatcaa cgagaaagtc gagacgatgg 3001 agaaggaaat caccagcgaa tacaccatga acttcatcgg tgacttcaat attcaaggtg 3061 atacccaact gctgcaaacc tactgggacc gcctgggcat ccaggtcgtc gcccacttca 3121 ccggcaacgg cacctacgac gacctgcgct gcatgcacca ggcccagctc aacgtggtga 3181 actgcgcccg ttcctccggc tacatcgcca acgagctgaa gaagcgctac ggcatcccgc 3241 gtctggacat cgactcctgg ggcttcaact acatggccga gggcatccgc aagatctgcg 3301 ccttcttcgg catcgaggag aagggcgagg agctgatcgc cgaggaatac gccaagtgga 3361 agccgaagct cgactggtac aaggagcgtc tgcaaggcaa gaaaatggcg atctggaccg 3421 gcggcccgcg cctgtggcac tggaccaagt cggtcgagga cgacctgggc gtgcaagtgg 3481 tggccatgtc ctccaagttc ggccatgagg aagacttcga gaaggtcatc gcccgcggca 3541 aggaaggcac ctactacatc gacgacggca acgagctgga attcttcgag atcatcgacc 3601 tggtcaagcc ggacgtgatc ttcaccggcc cgcgcgtcgg cgaactggtc aagaagctgc 3661 acatccccta cgtcaacggc cacggctacc acaacggccc gtacatgggc ttcgaaggct 3721 tcgtcaacct ggcccgcgac atgtacaacg ccgtgcacaa cccgctgcgc cacctggccg 3781 ccgtggatat ccgcgacaag tcgcagacta ccccggtcat cgtgcggggg gccgcctgat 3841 gagccagtcc catctcgacg atctgttcgc ctatgtcgag gagcgctgcc tgtggcagtt 3901 cttctcgcgc acctgggacc gcgaggaaaa catcgagggc gtgctcaatc aggtcggccg 3961 cctgctgacc ggccaggagc cgctgcgcgg caccccgcag gagcgcctgt tctacgccga 4021 cgccctggcc atggccaacg atgtccgcga gcgtttcccc tgggcttcgc aggtcaacaa 4081 ggaagagatc gagttcctgc tcgacggtct caagtcccgt ctggtcgacg tgaccatcac 4141 ccgctcgacc aaccgcgaac tcaaccacca cctctactga gtcatcggcg cggagacggc 4201 cggaacgccg tctgcgcgcg cgaaggagcc attttcatga gcaattgcga actgaccgtg 4261 ctgaagccgg cagaagtcaa gctgagcccg cgtgaccgcg agggcatcat caacccgatg 4321 tacgactgcc agccggccgg cgcccagtac gccggcatcg gcatcaagga ctgcatcccg 4381 ctggtccacg gcggccaggg ctgcacgatg ttcgtccgcc tgctgttcgc ccagcacttc 4441 aaggaaaact tcgacgtcgc ctccacctcg ctgcacgagg agtcggcggt gttcggcggc 4501 gccaagcgcg tcgaggaagg cgtgctggtc ctcgcccgcc gctacccgaa cctgcgcgtc 4561 atcccgatca tcaccacctg ctccaccgaa gtcatcggcg acgacatcga gggcagcatc 4621 cgggtctgca accgggcact cgaagccgaa ttcccggatc gcaagatcta cctggcgccg 4681 gtacacaccc cgagcttcaa gggcagccac gtcaccggct acgccgagtg cgtgaagtcg 4741 gtgttcaaga ccatcaccga cgcgcacggc aagggccagc cgagcggcaa gctcaacgtg 4801 ttcccgggct gggtcaaccc cggcgacgtg gtgctgctca agcgctactt caaggaaatg 4861 gacgtcgaag ccaacatcta catggacacc gaggacttcg actcgccgat gctgccgaac 4921 aagagcatcg agacccacgg ccgcaccacc gtcgaggaca tcgccgacag cgccaacgcc 4981 ctggccaccc tgtccctggc ccgctacgag ggcaacacca ccggcgagtt gctgcagaag 5041 accttcgcgg tgccgaatgc cctggtcaac accccctacg gcatcaagaa caccgacgac 5101 atgctgcgca agatcgccga agtcaccggc aaggagatcc ccgagtcgct ggtccgcgag 5161 cgcggcatcg ccctcgacgc cctggccgac ctggcgcaca tgttcttcgc caacaagaaa 5221 gtggcgatct tcggccaccc ggacctggtg ctcggcctgg cccagttctg catggaagtc 5281 gagctggaac ccgtactgct gctgatcggc gacgaccagg gcaacaagta caagaaggac 5341 ccgcgcatcg aggagctgaa gaacaccgcg cacttcgaca tcgagatcgt ccacaacgcc 5401 gacctctggg aactggagaa gcgcatcaac gccggcctcc agctcgacct gatcatgggt 5461 cactcgaagg gccgctacgt cgccatcgag gccaacatcc cgatggtccg cgtcggcttc 5521 ccgaccttcg accgcgccgg tctctaccgc aagccctcga tcggctacca gggcgccatg 5581 gaactgggcg agatgatcgc caacgccatg ttcgcccaca tggaatacac ccgtaacaag 5641 gagtggatcc tcaatacgtg gtgagttgag gtgccggagc ggtttccacg cactccggct 5701 gtcgagccga ccgagatgac ggcatgcacg gatctcccct cgcccatgca tgggcgaggg 5761 agttgcgagc cgtcacgccc gaggtcgttc cgacggcagg ccgatgcacc ggaaaaccgc 5821 tcccgccccc ggcggctccc ggctccggcc ggggcctccg ccccgtattc cagatccggc 5881 cgccatgccg cattcccggg agtgatgcca tgaaacagcg acaggaaatg gtcgcccact 5941 accgcgcctg cttcggcgag ctgtgcgccc gaccggaaca ccgtcctatc gaaccctaca 6001 cccgcccccg gcgcctgagc ttcgcggagc cggaaagcgc gaccgcccgt caggtgccgg 6061 cgcgcctggt actggccctg accagtgcct acgccctgct cgccgactgg caggaatccc 6121 gcgacccgtc gctggccgac ctgggcagtt ggcaacgcta cctcgccctg ccccgccgca 6181 ccccggcgga aaagctgatg gccgagatct tccgcatcct gcgcgtgttc cgcgccgcgg 6241 cgatccagca gaacggcgcc atcgaaatcc gcgacgacgg cctgatccgc gccagttgca 6301 cctacaaccg ctgcgcgctg agcctgctga tcacccaggc cggcctcgaa ctgctcaccg 6361 cctgcgtcgc ctgctacctc gaatccttcg accagcccta cagcgatgcc tacgaggaac 6421 tgctgctcgg ccagtactac gccgacatcg tcgccgagat ccgctccttc gccgacgacg 6481 accgggtgct gttccagttc cgccagaaag cctggttcaa ccgccatgtc cgcctggact 6541 gcgacaaccc gcgcctg // LOCUS ECOCYS 5755 bp ds-DNA BCT 03-AUG-1990 DEFINITION E.coli thiosulfate binding protein (cysP), sulfate permease (cysT, cysW, cysA) and o-acetylserine (thiol)-lyase-B (cysM) genes, complete cds. ACCESSION M32101 KEYWORDS cysA gene; cysM gene; cysP gene; cysT gene; cysW gene; o-acetylserine (thiol)-lyase-B; sulfate permease; thiosulfate binding protein. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1292 to 5755) AUTHORS Sirko,A., Hryniewicz,M., Hulamicka,D. and Boeck,A. TITLE Sulfate and thiosulfate transport in E.coli K12: Nucleotide sequence and expression of the cysTWAM gene cluster JOURNAL J. Bacteriol. 172, 3351-3357 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1291) AUTHORS Hryniewicz,M., Sirko,A., Palucha,A., Boeck,A. and Hulamicka,D. TITLE Sulfate and thiosulfate transport in E.coli K12: Identification of a gene encoding a novel protein involved in thiosulfate binding JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.Halanicka, 14-FEB-1990. FEATURES from to/span description pept 559 1575 thiosulfate binding protein (cysP) pept 1575 2408 sulfate permease (cysT) pept 2408 3283 sulfate permease (cysW) pept 3273 4370 sulfate permease (cysA) pept 4505 5416 o-acetylserine (thiol)-lyase-B (cysM; gtg start codon; EC 4.2.99.8) signal 516 521 -10 region site 5737 5755 right end of mu BASE COUNT 1259 a 1477 c 1682 g 1337 t ORIGIN 52 min on K12 map. 1 gttaacgcca tttgcccggg atacgtgcgc acaccaatgg cggaaagcat tgcccgccag 61 tcgaacccgg aagatccaga gtcggtgctg actgaaatgg cgaaagcaat cccgatgcgt 121 cgcctcgccg atccgctgga agtcggcgaa ctggcggcct tcctcgcatc ggatgaatcc 181 agctatttaa ccggtacaca gaatgtgatt gatggcggca gcacactgcc ggagacggtt 241 agcgtcggta tctgattcac ctctgtttcc tccctgcatt tgtggggagg atttcgtctt 301 gaactaagtt caccaggcta ttttatttgt cattttggcc ccgggcagtg ctcgaaatcc 361 tcacgtacta tgtgtacgct ccggtttctc cgcgctgttc gtgtccaaac tgactgcaac 421 aattacgcct gttgaaccaa gttcttattc ccttttcaac ttccaaatca ccaaacggta 481 tataaaaccg ttactccttt cacgtccgtt ataaatatga tggctattag aaagtcatta 541 aatttataag ggtgcgcaat ggccgttaac ttactgaaaa agaactcact cgcgctggtc 601 gcttctctgc tgctggcggg ccatgtacag gcaacggaac tgctgaacag ttcttatgac 661 gtctcccgcg agctgtttgc cgccctgaat ccgccgtttg agcaacaatg ggcaaaagat 721 aacggcggcg acaaactgac gataaaacaa tctcatgccg ggtcatcaaa acaggcgctg 781 gcgattttac agggcttaaa agccgacgtt gtcacttata accaggtgac cgacgtacaa 841 atcctgcacg ataaaggcaa gctgatcccg gccgactggc agtcgcgcct gccgaataat 901 agctcgccgt tctactccac catgggcttc ctggtgcgta agggtaaccc gaagaatatc 961 cacgattgga acgacctggt gcgctccgac gtgaagctga ttttcccgaa cccgaaaacg 1021 tcgggtaacg cgcgttatac ctatctggcg gcatggggcg cagcggataa agctgacggt 1081 ggtgacaaag gcaaaaccga acagtttatg acccagttcc tgaaaaacgt tgaagtgttc 1141 gatactggcg gtcgtggcgc gaccaccact tttgccgagc gcggcctggg cgatgtgctg 1201 attagcttcg aatcggaagt gaacaacatc cgtaaacagt atgaagcgca gggctttgaa 1261 gtggtgattc cgaaaaccaa cattctggcg gaattcccgg tggcgtgggt tgataaaaac 1321 gtgcaggcca acggtacgga aaaagccgcc aaagcctatc tgaactggct ctatagcccg 1381 caggcgcaaa ccatcatcac cgactattac taccgcgtga ataacccgga ggtgatggac 1441 aaactgaaag acaaattccc gcagaccgag ctgttccgcg tggaagacaa atttggctcc 1501 tggccggaag tgatgaaaac ccacttcacc agcggcggcg agttagacaa gctgttagcg 1561 gcggggcgta actgatgttt gctgtctcct ccagacgcgt gctgccgggc tttaccttaa 1621 gcctcggcac cagtctgctg tttgtgtgcc tgattttgct gctgccgctc tccgcgctgg 1681 tgatgcaact ggcccagatg agctgggcgc agtactggga ggtgatcacc aacccgcagg 1741 tggtcgcggc ctacaaagta acgctgctgt cggcgtttgt ggcatcgatt tttaacggcg 1801 ttttcggtct gctgatggcg tggatcctaa cccgctatcg cttcccaggc cgcacgctgc 1861 ttgatgcgct gatggattta ccctttgcgc tgccaacggc tgtcgccggt ttaacgctgg 1921 cctcgctctt ttccgtaaac ggtttttacg gtgaatggct ggcgaagttt gatatcaaag 1981 tcacctatac atggctgggg attgcggtgg ctatggcctt taccagcatt ccgtttgtgg 2041 tgcgtaccgt gcagccggtg ctggaagagt taggcccgga atatgaagaa gcggcggaaa 2101 cgcttggtgc aacgcgctgg cagagtttct gcaaagtggt gctgccggag ctttctccgg 2161 cgctggtggc gggcgtggcg ctgtcgttta cccgtagtct tggtgaattt ggcgcggtga 2221 tttttatcgc cggaaatatc gcgtggaaga cggaagtgac gtcgctgatg atttttgtgc 2281 gcttacagga gtttgattac ccggcagcga gcgcgattgc ttcggtgatc ctcgcggcat 2341 ctctgctgct gctgttctca attaacactc tgcaaagtcg ctttggtcgg cgtgtggtag 2401 gtcattaatg gcggaagtta cccaattgaa gcgttatgac gcgcgcccga ttaactgggg 2461 caaatggttt ctgattggca tcgggatgct ggtttcggcg ttcatcctgc tggtgccgat 2521 gatttacatc ttcgtgcagg cattcagcaa ggggctgatg ccggttttac agaatctggc 2581 cgatccggac atgctgcacg ccatctggct gacggtgatg atcgcgctga ttgccgtacc 2641 ggtaaacctg gtgttcggca ttctgctggc ctggctggtg acgcgcttta acttccctgg 2701 acgccagtta ctgctgacgc tactggacat tccgtttgcc gtatcgccgg tggttgccgg 2761 tctggtgtat ttgctgttct acggctctaa cggcccgctc ggcggttggc tcgacgagca 2821 taacctgcaa attatgttct cctggccggg aatggtgctg gtcaccatct tcgtgacgtg 2881 tccgtttgtg gtgcgcgaac tggtgccggt gatgttaagc cagggcagcc aggaagacga 2941 agcggcgatt ttgcttggcg cgtccggctg gcagatgttc cgtcgcgtca cattaccgaa 3001 catccgctgg gcgctgcttt atggcgtggt gttgaccaac gcccgcgcaa ttggcgagtt 3061 tggcgcggtg tcggtggttt ccggctcgat tcgcggcgaa accctgtcgc tgccgttaca 3121 gattgaattg ctggagcagg actacaacac cgtcggctcc tttaccgctg cggcgctgtt 3181 aacgctgatg gcgattatca ccctgttttt aaaaagtatg ttgcagtggc gcctggagaa 3241 tcaggaaaaa cgcgcacagc aggaggaaca tcatgagcat tgagattgcc aatattaaga 3301 agtcgtttgg tcgcacccag gtgctgaacg atatctcact ggatattcct tcaggtcaga 3361 tggtcgcgtt gctggggccg tccggttccg ggaaaaccac gctgctgcgc attatcgccg 3421 ggctggagca tcaaaccagc gggcatattc gcttccacgg caccgacgtg agccgcctgc 3481 acgcacgtga tcgtaaagtc ggtttcgtgt tccagcatta cgcgctgttc cgccatatga 3541 cggtgttcga caatatcgct tttggcctga cggtgctgcc gcgtcgcgag cgcccgaatg 3601 ccgcagccat caaagcgaaa gtgacaaaat tgctggaaat ggtccagctt gcccatctgg 3661 cggatcgtta tccggcgcac gtttccggcg gccagaaaca gcgcgtggcg ctggcgcgcg 3721 cgctggctgt ggaaccgcaa attctgctgc ttgatgaacc gtttggcgcg ctggatgcgc 3781 aggtgcgtaa agagctgcgt cgctggctgc gtcaactcca tgaagaacta aaattcacca 3841 gcgtttttgt gacccacgat caggaagaag cgaccgaagt agctgatcgt gtagttgtga 3901 tgagccaggg caatattgaa caggctgacg cgccggatca ggtatggcgc gaaccggcga 3961 cccgttttgt gctcgaattt atgggcgaag tgaaccgcct gcagggaacc attcgcggcg 4021 ggcagttcca tgttggcgcg catcgctggc cgctgggcta cacacctgcg tatcaggggc 4081 cggtggatct cttcctgcgc ccttgggaag tggatatcag ccgccgtacc agcctcgatt 4141 cgccgctgcc ggtacaggta ctggaagcca gcccgaaagg tcactacacc caattagtgg 4201 tgcagccgct ggggtggtac aacgaaccgc tgacggtcgt gatgcatggc gacgatgccc 4261 cgcagcgtgg cgagcgttta ttcgttggtc tgcaacatgc gcggctgtat aacggcgacg 4321 agcgtatcga aacccgcgat gaggaacttg ctctcgcaca aagcgcctga taggttgagt 4381 gaatgttaaa cgcccggagg cgcttcccgc gagtccgggc tttttaatgg caaggtttgt 4441 aacctgtaga cctgataaga cgcgcaagcg tcgcatcagg caacaccacg tatggataga 4501 gatcgtgagt acattagaac aaacaatagg caatacgcct ctggtgaagt tgcagcgaat 4561 ggggccggat aacggcagtg aagtgtggtt aaaactggaa ggcaataacc cggcaggttc 4621 ggtgaaagat cgtgcggcac tttcgatgat cgtcgaggcg gaaaagcgcg gggaaattaa 4681 accgggtgat gtcttaatcg aagccaccag tggtaacacc ggcattgcgc tggcaatgat 4741 tgccgcgctg aaaggctatc gcatgaaatt gctgatgccc gacaacatga gccaggaacg 4801 ccgtgcggcg atgcgtgctt atggtgcgga actgattctt gtcaccaaag agcagggcat 4861 ggaaggtgcg cgcgatctgg cgctggagat ggcgaatcgt ggcgaaggaa agctgctcga 4921 tcagttcaat aatcccgata acccttatgc gcattacacc accactgggc cggaaatctg 4981 gcagcaaacc ggcgggcgca tcactcattt tgtctccagc atggggacga ccggcactat 5041 caccggcgtc tcacgcttta tgcgcgaaca atccaaaccg gtgaccattg tcggcctgca 5101 accggaagag ggcagcagca ttcccggcat tcgccgctgg cctacggaat atctgccggg 5161 gattttcaac gcttctctgg tggatgaggt gctggatatt catcagcgcg atgcggaaaa 5221 caccatgcgc gaactggcgg tgcgggaagg aatattctgt ggcgtcagct ccggcggcgc 5281 ggttgccgga gcactgcggg tggcaaaagc taaccctgac gcggtggtgg tggcgatcat 5341 ctgcgatcgt ggcgatcgct acctttctac cggggtgttt ggggaagagc attttagcca 5401 gggggcgggg atttaaggat taatagcatc ggagactgat gacaaacgca aaactgcctg 5461 atgcgctacg cttatcaggc ctacaaggtt tctgcaatat attgaattag cacgattttg 5521 taggccggat aaggcgttta cgccgcatcc ggcataaaca aagcgcactt ttttaacagt 5581 tgttgctgcc gacaaatgca gtatttaatt ttcgtgagga aacgccgtaa ggtcattgaa 5641 gcggcgcacg aaaaaccgaa agcgtttcac gataaatgcg aaaactttac gtttcgcgct 5701 tcaaatgaaa cagatgtatt aattactact ttttattcat tacatgggga tccag // LOCUS TTHTRPBA 2586 bp ds-DNA BCT 03-AUG-1990 DEFINITION T.thermophilus tryptophan synthetase (trpAB) genes, complete cds. ACCESSION M32108 KEYWORDS tryptophan synthetase. SOURCE T.thermophilus (strain HB27) DNA. ORGANISM Thermus thermophilus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 2586) AUTHORS Koyama,Y. and Furukawa,K. TITLE Cloning and sequence analysis of tryptophan synthetase genes of an extreme thermophile, Thermus thermophilus HB27: Plasmid transfer from replica-plated E.coli recombinant colonies to competent T.thermophilus cells JOURNAL J. Bacteriol. 172, 3490-3495 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Y.Koyama, 14-FEB-1990. FEATURES from to/span description pept 353 1567 tryptophan synthetase B (EC 4.2.1.20) pept 1564 2379 tryptophan synthetase A (gtg start codon; EC 4.2.1.20) BASE COUNT 396 a 879 c 905 g 406 t ORIGIN 1 bp upstream of BglII site. 1 agatctcgtt gacgctcgcc gtgcggtggc ggaaccactg cctcgccacg aagatggggg 61 ccttgacgtg aaacttgaag accaccatct caaaggggct cgtgtggcgg tggcgcatga 121 ggtagtcaat gagggcggcg tcctcccgca ccgtcttcgt cccttccccg taggagaccc 181 gggccgcctg gacgatggcg cggtcgtccc ccatctggtc caccaggcgg acgaagccct 241 tgtccaagac cgggatcgtg agaggccctt ccatcccccc gagtttaccg ggaggcccct 301 ccggggtagg atgggagttg tcttggcgcg aggcgccttt agggagcgaa gcatgctgac 361 cctacccgac tttcccttgc ccgacgcgag ggggcggttc ggcccctacg gggggcggta 421 cgtgcccgag accctgatcc ccgccctgga ggagttggag gccgcctacc gggaggccaa 481 gaaggatccg gccttcctgg aggagctgga ccactacctc aggcagtttg ccggcaggcc 541 cacgcccctt taccacgcca agcggctttc cgagtactgg ggtggggccc aggtcttcct 601 caagcgggag gacctcctcc acaccggggc ccacaagatc aacaacaccc tgggccaagc 661 cctcctggcg aggcgcatgg gcaagaggcg ggtcatcgcc gaaacggggg cggggcagca 721 cggggtgagc gtggccacgg tggcggccct cttcggcctg gagtgcgtgg tctacatggg 781 ggaggaggac gtcaggcgcc aggccttaaa cgtcttccgc atgaagctcc tcggggccga 841 ggtccggccc gtggccgcgg ggagccgcac cctcaaggac gccaccaacg aggccatccg 901 cgactggatc acgaacgtgc gcaccacctt ctacatcctg ggctccgtgg tgggccccca 961 cccctacccc atgatggtcc gggacttcca gagcgtcatc ggggaggagg tgaagcggca 1021 aagcctggag cttttcggcc gcctgcccga cgccctcatc gccgcggtgg ggggcggttc 1081 caacgccatc ggcctcttcg cccccttcgc ctacctgcct gagggcaggc ccaagctcat 1141 cggggtggag gccgccgggg aggggctttc cacgggcagg cacgccgcga gcatcggggc 1201 ggggaagcgg ggggtgctgc acgggagcta catgtacctc ctctacgacc acgatgggca 1261 gatcaccccg gcccactccg tctccgcagg gctggactac cccggggtgg ggccggagca 1321 cagctactac gccgacgccg gggtggcgga gtacgccagc gtcacggacg aggaggccct 1381 cgagggcttc aagctcctgg cgcggcttga gggcatcatc cccgccttgg agtcggccca 1441 cgccatcgcc tacgccgcca aggtggtgcc ggagatggac aaggaccagg tggtggtcat 1501 caacctctcc ggccgggggg acaaggacgt gaccgaggtg atgcgccttc tgggagggga 1561 gctgtgacca ccctcgaggc cttcgccaag gcccggtccg agggccgggc cgccctcatc 1621 ccttacctca ccgcgggctt ccccagccgg gagggctttt tgcaggcggt ggaggaggtc 1681 ctcccctatg ccgaccttct ggagatcggc ctcccctact ccgaccccct gggggacggc 1741 cccgtgatcc agcgggcaag cgaactggcc ctgaggaagg ggatgagcgt ccaaggcgct 1801 ttggagctcg tccgcgaggt gcgcgccctc acggagaaac ccctcttcct catgacctac 1861 ctgaaccccg tcctggcctg ggggccggag cgcttcttcg gcctcttcaa gcaggcgggg 1921 gccacggggg tgatccttcc cgaccttccc cccgacgagg acccaggcct cgtgcgcctc 1981 gcccaggaga tcggcctgga gacggtcttc cttctcgccc ccacctccac ggacgcccgc 2041 atcgccaccg tggtccgcca cgccacgggc ttctcgtacg ccgtctccgt caccggggtc 2101 acggggatgc gggaaaggct tcccgaggag gtcaaggacc tggtgcggcg catcaaggcc 2161 aggaccgccc tccccgtggc cgtgggcttc ggggtttccg gcaaggccac ggccgcgcag 2221 gcggcggtgg cggacggggt ggtggtggga agcgccttgg tccgggcctt ggaggagggg 2281 aggtcccttg cgccccttct ccaggagatc cgccagggcc tccagcgcct cgaggccaac 2341 cccggcctga aggaaagctc aaagaagccg ttgtcctgag ccttggaggg taaacccatg 2401 cggcagtggc cttttttagt tctggcaggc ggcctcctcc tcggggcctg tgggctcctt 2461 ccggggtttg gggggcctaa ggtctccgga gagttccagg gggactggtc cggcgtggcc 2521 cagggcctgc gcctggccct ggtgggcctc acgacggagg ggcaggtgaa ctacgacaac 2581 cagctg //