Path: utzoo!attcan!uunet!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 26 May 90 12:00:09 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 1832 Approved: lear@genbank.bio.net Checksum: 31567 110 LOCUS ECOSPEA 3236 bp ds-DNA BCT 26-MAY-1990 DEFINITION E.coli arginine decarboxylase (speA) gene, complete cds, agmatinase (speB) and methionine adenosyltransferase (metK) genes, 5' end. ACCESSION M31770 KEYWORDS agmatinase; arginine decarboxylase; metK gene; methionine adenosyltransferase; speA gene; speB gene. SOURCE E.coli (strain K12) DNA, clones pLC2-5 and lambda-[1H10,23G45]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3236) AUTHORS Moore,R.C. and Boyle,S.M. TITLE Nucleotide sequence and analysis of the speA gene encoding arginine decarboxylase of Escherichia coli JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R.C.Moore, 02-FEB-1990. FEATURES from to/span description pept 192 < 1 (c) methionine adenosyltransferase (metK) pept 987 2963 arginine decarboxylase (speA) (EC 4.1.1.19) pept 3101 > 3236 agmatinase (speB) signal 3030 3067 rho-independent transcription terminator signal 811 816 -35 region signal 839 844 -10 region binding 977 980 ribosome binding site site 1878 1886 pyridoxal phosphate binding site (put.) BASE COUNT 743 a 784 c 860 g 849 t ORIGIN 62.9 min on K12 map. 1 tacccaaggt cgctggtggt gatttcgccg ccaactaaaa ccaatgccgg tttttacgta 61 ggtttcgcaa gcaacgcgtg ctttcggatc ctgttcgagg atcgcgtcta aaacggcatc 121 agaaatttgg tcagcaattt tgtcaggatg cccttcagag acggactcgg acgtaaaaag 181 gtgttttgcc atatttaata tcacctaaag agaatttggt tagctcaaac tgttgtgtgg 241 attttctgtg gtagcggatc ctaccacgac tctgcaggtt aaaaacactg gcagtctgag 301 tgttaatcgg tatggatgga ttaacatctg gatggctatt ttaggtcaat tcttcaccct 361 atttccactt ttttttgaat cgtgtctcat tctgttaaaa acgtggctgg aaatttttcc 421 tgacaatgcc ggcattctgc gtatttatct tttgcaattt tctgccattg tggggtataa 481 aacgcggcgc gcggcttaaa taaaaagcac acgacgtttc tttcgtgttg ccacttccag 541 ccgggttcaa atcagagttt tggcttgtgg gttcgtctta acaggcggcc gtggaggtga 601 tacgaaataa tgaaccgttg tctgctgctt aacctgtctc accgttctgg tgaagattcg 661 ttccccgcac tctgcatctc tgctttgcat acctgccgat gttataccca tctcggcgct 721 tctcaggatt caagagctgg ttacagttac tgaggactga acaagggcgc tcttgtaaaa 781 acaagagttt tctcgtggtt tcgccgaact ttcacactta cgttcggtta tgtgcttaat 841 aatgttatga aaaagaaacc ggttgcgcag ttggagcgtc agcattcact gctggaaaat 901 ccatgtgctt atgggttgtt atcgcagttc caggctgcga tagtcgttaa ctgttttaca 961 cttaataaaa taatttgagg ttcgctatgt ctgacgacat gtctatgggt ttgccttcgt 1021 cagcgggcga acacggtgta ctacgctcca tgcaggaggt tgcaatgagc tcccaggaag 1081 ccagcaagat gctgcgtact tacaatattg cctggtgggg caataactac tatgacgtta 1141 acgagctggg ccacattagc gtgtgcccgg acccggacgt cccggaagct cgcgtcgatc 1201 tcgcgcagtt agtgaaaact cgtgaagcac agggccagcg tctgcctgca ctgttctgtt 1261 tcccacagat cctgcagcac cgtttgcgtt ccattaacgc cgcgttcaaa cgtgcgaggg 1321 aatcctacgg ctataacggc gattacttcc ttgtttatcc gatcaaagtt aaccagcacc 1381 gccgcgtgat tgagtccctg attcattcgg gcgaaccgct gggtctggaa gccggttcca 1441 aagccgagtt gatggcagta ctggcacatg ctggcatgac ccgtagcgtc atcgtctgca 1501 acggttataa agaccgcgaa tatatccgcc tggcattaat tggcgagaag atggggcaca 1561 aggtctatct ggtcattgag aagatgtcag aaatcgccat tgtgctggat gaagcagaac 1621 gtctgaatgt cgttcctcgt ctgggcgtgc gtgcacgtct gcgttcgcag ggttcgggta 1681 aatggcagtc ctccggcggg gaaaaatcga agttcggcct ggctgcgact caggtactgc 1741 aactggttga aaccctgcgt gaagccgggc gtctcgacag cctgcaacta ctgcacttcc 1801 acctcggttc gcagatggcg aatattcgcg atatcgcgac aggcgttcgt gaatccgcgc 1861 gtttctatgt ggaactgcac aagctgggcg tcaatattca gtgcttcgac gtcggcggcg 1921 gtctgggcgt ggattatgaa ggtactcgtt cgcagtccga ctgttcggtg aactacggcc 1981 tcaatgaata cgccaacaac attatctggg cgattggcga tgcgtgtgaa gaaaacggtc 2041 tgccgcatcc gacggtaatc accgaatcgg gtcgtgcggt gactgcgcat cacaccgtgc 2101 tggtgtctaa tatcatcggc gtggaacgta acgaatacac ggtgccgacc gcgcctgcag 2161 aagatgcgcc gcgcgcgctg caaagcatgt gggaaacctg gcaggagatg cacgaaccgg 2221 gaactcgccg ttctctgcgt gaatggttac acgacagtca gatggatctg cacgacattc 2281 atatcggcta ctcttccggc atctttagcc tgcaagaacg tgcatgggct gagcagcttt 2341 atttgagcat gtgccatgaa gtgcaaaagc agctggatcc gcaaaaccgt gctcatcgtc 2401 cgattatcga cgagctgcag gaacgtatgg cggacaaaat gtacgtcaac ttctcgctgt 2461 tccagtcgat gccggacgca tgggggatcg accagttgtt cccggttctg ccgctggaag 2521 ggctggatca agtgccggaa cgtcgcgctg tgctgctgga tattacctgt gactctgacg 2581 gtgctatcga ccactatatt gatggtgacg gtattgccac gacaatgcca atgccggagt 2641 acgatccaga gaatccgccg atgctcggtt tctttatggt cggcgcatat caggagatcc 2701 tcggcaacat gcacaacctg ttcggtgata ccgaagcggt tgacgtgttc gtcttccctg 2761 acggtagcgt agaagtagaa ctgtctgacg aaggcgatac cgtggcggac atgctgcaat 2821 atgtacagct cgatccgaaa acgctgttaa cccagttccg cgatcaagtg aagaaaaccg 2881 atcttgatgc tgaactgcaa caacagttcc ttgaagagtt cgaggcaggt ttgtacggtt 2941 atacttatct tgaagatgag taagtcctgt gttacttgaa tccgcttaat ttagcggtga 3001 taatccgcca caatttattg tgacaaatcc aacccttcct cgtcgggcct aacgacgcgg 3061 aagggttttt ttatatcgac tttgtaatag gagtccatcc atgagcacct taggtcatca 3121 atacgataac tcactggttt ccaatgcctt tggtttttta cgcctgccga tgaacttcca 3181 gccgtatgac agcgatgcag actgggtgat tactggcgtg ccgttcgata tggcca // LOCUS FIBEGASE 2310 bp ds-DNA BCT 26-MAY-1990 DEFINITION F.succinogenes endoglucanase 3 (cel3) gene, complete cds. ACCESSION M29047 M29681 KEYWORDS cellobiosidase; endoglucanase. SOURCE F.succinogenes (strain S85, ATCC 19169) DNA. ORGANISM Fibrobacter succinogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 2310) AUTHORS McGavin,M.J., Forsberg,C.W., Crosby,B., Bell,A.W., Dignard,D. and Thomas,D.Y. TITLE Structure of the cel-3 gene from Fibrobacter succinogenes S85 and characteristics of the encoded gene product, endoglucanase 3 JOURNAL J. Bacteriol. 171, 5587-5595 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Dignard, 14-OCT-1989. FEATURES from to/span description pept 177 2153 endoglucanase 3 precursor sigp 177 245 endoglucanase 3 signal peptide A (alt.) sigp 177 251 endoglucanase 3 signal peptide A' (alt.) matp 246 2150 endoglucanase 3 A (alt.) matp 252 2150 endoglucanase 3 A' (alt.) site 167 172 ribosome binding site site 2172 2213 region of dyad symmetry BASE COUNT 649 a 653 c 529 g 479 t ORIGIN 1 ggatccgggt gcgtcagtta aataaaatat tttttaacgt ttttcgtaca gaaagtggac 61 ttttagacca aaacacttat tacacttttt attccgatat atcattttac atagcataaa 121 accgaccccc aaatatatct ttggtaaaaa agaaaaaatc accttaagag ggttttatgc 181 aactcaagaa tttctatccc aaaatgagcg ttctcggtat cgcaaccgtg atggcactta 241 ccgcctgtgg cgatgaaaat acccaggcac tgttcgccaa caatccggtt ccgggtgccg 301 aaaatcaggt tccggtttct agcagcgaca tgagcccgac ctctagcgac gctgtcattg 361 acccgacctc cagctctgcc gcagtggtcg acccgtctac gctccctgca gaaggtccta 421 ttaccatgcc ggaaggtctc ggcactttgg tcgatgactt tgaagatggc gataacttga 481 gcaaaatcgg tgattactgg tacacctaca acgataacga caacggtggt gcatccatca 541 tcacgactcc gctaaacgaa gaagaaaaca tcatcccggg ccgcgtcaac aacggttcca 601 actacgcctt gcaagtcaac tacacgcttg atagaggcga ttacgaatac gatccgtacg 661 taggctgggg cgtgcaggtc gcaccggacg aagccaacgg acatttcggc ggccttacct 721 actggtacaa gggcggcgca cacgaagtac atatcgaaat caccgacgtc gaagactacg 781 acgtgcatct cgccaagttc ccggcatccc gcacatggaa gcaggctgtc gtccgcttca 841 aggacctcgt tcaaggtggc tggggcaagg aaattccgtt cgacgccaag cacatcatgg 901 caatcagctt ccaggccaag ggaaacaaga gcaagctcgt gaccgactcc ctcttcatcg 961 acaacatcta cctgcaggat tcttccgaag ttgaaaagga ccagccggat atggaaatca 1021 aggacccggt cattccggtc gttgaattta ccgaagctga aatcactgtg acgaacccgt 1081 tgcaggaaaa ggccatgaag tacctcaaca agggtgtcaa ctttaccaac tggctcgaaa 1141 acgcagatgg caagttcaag tcctttgaat tgggcgaaag cgacgtcaag attcttgccg 1201 acaacggatt caagagcctc cgcttgccga ttgaccttga cctctatgcc acaaaccgtg 1261 acgcattcat cgcaggcacc gacacagaac tcaagttcga tgacgacacc ttgttcctgg 1321 ttctcgactc cttcgtagaa tggaccgcca agtacaacat gtctttcgtg attgactacc 1381 atgaatatga caacagctac aacaccacca gcgctaagga ccccaactac atcaagatga 1441 tggcagaaac gtggaagcat gttgcagccc actacgccga aagcccccgc gaagacttgt 1501 tcttcgaact cttgaacgaa ccggacatga gcgatggtaa ggtcactgca gcaacatgga 1561 ccaccgcagc ccaggccatg attgacgcca tccgcacggt tgataccaag cacaccatcc 1621 tcttcggtga tgcccagtgg tactccatca cgctcctcgc caagcgcact ccgttcaccg 1681 atgacaacat catctacgtg atccacacct acgaaccgtt cgccttcacg catcagggcg 1741 gttcctggac ggactacgcc accatccacg atattccgtt cccctacgat ccggcaaagt 1801 ggtctacggt ttctggcgac ttcggtgtca acaagagcac aaagtcctac gtgaaaacca 1861 acatcaagaa ctactacaag accggcagca aggaagccat cttggaacag attctcaagg 1921 ccaagaagtg ggccgccacc aacaacgtac cggtgatcat caacgaattc ggcgcattga 1981 acctccgctc taccgctgaa tcccgcctca actacctcac ggccatgcgc gaaatctgcg 2041 ataccctcca gattccttgg acgcactggg gctacaccgg caacttctcc gtgatcgaaa 2101 acggcaagtt gattgaaggc ctcgacaagg cactcggcgt cggtagcaaa taagtctctc 2161 cttaaaaccc cctcaaaaaa aggtcacgca gaaatgcgtg gcttttttag taggaagtag 2221 acggtaggaa gttggaagtt agaagtagga agtaacagga atggcgcaat ggatacagtt 2281 gacacagata cattacaaaa ccccggatcc // LOCUS SFSSA 1747 bp ss-RNA VRL 26-MAY-1990 DEFINITION Sandfly fever sicilian virus S RNA encoding N protein, complete cds, and NS-s protein, complete cds. ACCESSION J04418 KEYWORDS N protein; NS-s protein. SOURCE Sandfly fever Sicilian virus, cDNA to viral RNA. ORGANISM Sandfly fever Sicilian virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Uukuvirus. REFERENCE 1 (bases 1 to 1747) AUTHORS Marriott,A.C., Ward,V.K. and Nuttall,P.A. TITLE The S RNA segment of Sandfly fever sicilian virus: Evidence for an ambisense genome JOURNAL Virology 169, 341-345 (1989) STANDARD full staff_review REFERENCE 2 (bases 693 to 695) AUTHORS Marriott,A.C., Ward,V.K. and Nuttall,P.A. JOURNAL Unpublished (1990) Oxford, UK STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.C.Marriott 12-JAN-1989. FEATURES from to/span description pept 42 782 N protein pept 1727 924 (c) NS-s protien revision 693 695 ttc in [2]; tc in [1] BASE COUNT 483 a 358 c 459 g 447 t ORIGIN Unreported. 1 acacaaaggt ccctagttaa tctgagtgag ctaagtttga aatggacgag taccagaaaa 61 ttgctgttga gtttggagag caggctattg atgagactgt gatccaggat tggctacaag 121 catttgcgta tcaaggattt gatgccagaa caattataca caaccttgtg cagcttggag 181 ggaagagttg ggaagaggat gccaagaaga tgatcatcct atccctaact cgtggcaaca 241 agcccaagaa gatggttgag agaatgtctc cagagggagc aagagaagtt aagagcctgg 301 ttgcaaagta taagatagta gagggcagac caggcaggaa tggaattacc ctgtcaaggg 361 tgctgcagcc ctggctgggt ggacagtcca agctgtggaa gtggttgaaa acttcttacc 421 agtcccaggg agcacaatgg accgcattgt gtggacaaac ataccccagg cagatgatgc 481 atccaagctt tgccggtctc attgacccaa gcctcgacca ggaggatttt aatgcagtat 541 tggatgctca caaacttttc ttgttcatgt tttccaaaac aatcaatgtc agcctccgcg 601 gtgcgcagaa gagagacatt gaggaatcat tctctcaacc aatgcttgct gctatcaata 661 gctcattcat tgacaacact cagaggaggg cattcttgac taagtttggg atcctaactt 721 ctggagcaag agctacagca gttgtaaaga agattgcaga agtttacagg aaactagagt 781 aagctgctgc tagtgtgggg tgggatgggg attctgggtt gggggttctg gggtggaggg 841 tggctaggtg gggggtggca agggtggatt cggtttgggt tggggtcatg gggaggggtg 901 ggtctggggc tgggcagcgg agatcaaaag tcagagtcag acgagctctc atcattttca 961 tccacatgac tgtgtattgg ggtccaaaga gaattgccat actcggtgag gccagtagaa 1021 gggtcacttg ctctatagga tctaatcact gttcttacat caagtgcctc cccagaggag 1081 gcagtgtcaa aaggctctgc attgataagt ctgagacaaa ccagagatcc tatctctcta 1141 aatagatcgt atccattgta atgctcatca ctaagaccca acctcctagc ttcttgtagt 1201 atctttttgt gtgcctgaac tatgcactca tccaagctat gtgaatcccc cattctcaga 1261 atgtaagaca ttagctgatc ccttgtttgt agccctctca caaatctatc actgcatatg 1321 ctaaagatct cacaatcagg gatacctagt ggccagctaa gagccttcag gacatttggc 1381 agcccctttc tagagaaact tgtgaggtca aacctggaga ggtcacttgc cataccttgg 1441 aaggtataca tcataggctt gacagaacta aaatagcatg ctgggcccca agaagctggc 1501 aactctccaa gggaataaaa gtcagccagt gagtttctgc gtccaaaccc aagtcttaac 1561 ttctctagtg gtatttcaca atgctcataa gttgaaacgt catgagtgtg aaatttattg 1621 taggcaacat aagacacact ggagaggagt ctatgacacc tcacatcaat gttaattgcc 1681 gggtagtcaa acatgtactg gctgttcatc atgttgttgt tgatcattga ctagggggtc 1741 tttgtgt // LOCUS RSSB800AB 437 bp ds-DNA BCT 26-MAY-1990 DEFINITION R.sphaeroides B800-850 alpha and beta subunits of major light-harvesting complex. ACCESSION X05200 KEYWORDS light-harvesting complex. SOURCE Rhodobacter sphaeroides. ORGANISM Rhodobacter sphaeroides Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 437; no enum.) AUTHORS Ashby,M.K., Coomber,S.A. and Hunter,C.N. TITLE Cloning,nucleotide sequence and transfer of genes for the B800-850 light harvesting complex of Rhodobacter sphaeroides JOURNAL FEBS Lett. 213, 245-248 (1987) STANDARD simple automatic FEATURES from to/span description pept 40 195 B800-850 beta subunit (AA 1-51) pept 210 374 B800-850 alpha subunit (AA 1-54) BASE COUNT 78 a 156 c 124 g 79 t ORIGIN 1 gccctagcgc acaccgtcga tttaccattg gagacgcaca tgactgacga tctcaacaaa 61 gtctggccga gcggcctcac cgttgccgaa gccgaagaag ttcataagca actcatcctc 121 ggcacccgcg tcttcggtgg catggctctg ctcgcgcact tcctcgccgc cgctgcgacc 181 ccctggctcg gctgatatga gagactgaca tgaccaacgg caaaatctgg ctcgtggtga 241 aaccgaccgt cggcgttccg ctgttcctca gcgctgccgt catcgcctcc gtcgttatcc 301 acgctgctgt gctgacgacc accacctggc tgcccgccta ctaccaaggc tcggctgcgg 361 tcgcggccga gtaatgctgc gcaagcgcgg gcctgcgggc ccacgccagc cagtccgtga 421 gtccgagcag gccggga // LOCUS RSSPETA 316 bp ds-DNA BCT 26-MAY-1990 DEFINITION R.sphaeroides Rieske Fe-S protein cytochrome b (petA) gene, 5' end. ACCESSION M18577 KEYWORDS cytochrome b. SOURCE R.sphaeroides (strain GA) DNA. ORGANISM Rhodobacter sphaeroides Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 316) AUTHORS Davidson,E. and Daldal,F. TITLE fbc operon, encoding the Rieske Fe-S protein cytochrome b, and cytochrome c1 apoproteins previously described from Rhodopseudomonas sphaeroides, is from Rhodopseudomonas capsulata JOURNAL J. Mol. Biol. 195, 25-29 (1987) STANDARD full staff_entry FEATURES from to/span description pept 32 > 316 Rieske Fe-S protein cytochrome b (gtg start codon) BASE COUNT 54 a 111 c 99 g 52 t ORIGIN Unreported. 1 ctgcagcggc ccgaggaagg gagaagttct cgtgtccaac gcagaagatc acgcaggcac 61 tcgcagggat ttcctgtatt acgccacggc cggagccggg gcggtggcca ccggggccgc 121 cgtctggccg ctgatcaacc aaatgaatcc gtcggccgac gtgcaggccc tcgcctccat 181 cttcgtcgat gtgagctcgg tcgagccggg tgtccagctg accgtcaagt tcctcggcaa 241 accgatcttc atccgccgcc gcaccgaggc cgacatcgag ctcggccgct ccgtccagct 301 cggccagctg gtcgac // LOCUS HUMERCC3A 2751 bp ss-mRNA PRI 26-MAY-1990 DEFINITION Human DNA repair helicase (ERCC3) mRNA, complete cds. ACCESSION M31899 KEYWORDS Cockayne's syndrome; DNA repair; excision repair; helicase. SOURCE Human lymphoid cell line K562 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2751) AUTHORS Weeda,G., Van Ham,R.C.A., Vermeulen,W., Bootsma,D., Van der Eb,A.J. and Hoeijmakers,J.H.J. TITLE Identification of the molecular defect involving the human repair disorders xeroderma pigmentosum and Cockayne's syndrome in the ERCC-3-encoding, a presumed DNA repair helicase JOURNAL Mol. Cell. Biol. 10, 2570-2581 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G. Weeda, 07-FEB-1990, for release after publication. FEATURES from to/span description pept 96 2444 DNA repair helicase /hgml_locus_uid="LF0034Q" /map="2q21" /nomgen="ERCC3" BASE COUNT 727 a 668 c 726 g 630 t ORIGIN 1 gggagcttcc ggattgagcc ggaagtcccc ccagagcgga tgccgcggcg ggcctgtggg 61 agcggggtca tcttctctct gctgctgtag ctgccatggg caaaagagac cgagcggacc 121 gcgacaagaa gaaatccagg aagcggcact atgaggatga agaggatgat gaagaggacg 181 ccccggggaa cgaccctcag gaagcggttc cctcggcggc ggggaagcag gtggatgagt 241 caggcaccaa agtggatgaa tatggagcca aggactacag gctgcaaatg ccgctgaagg 301 acgaccacac ctccaggccc ctctgggtgg ctcccgatgg ccatatcttc ttggaagcct 361 tctctccagt ttacaaatat gcccaagact tcttggtggc tattgcagag ccagtgtgcc 421 gaccaaccca tgtgcatgag tacaaactaa ctgcctactc cttgtatgca gctgtcagcg 481 ttgggctgca aaccagtgac atcaccgagt acctcaggaa gctcagcaag actggagtcc 541 ctgatggaat tatgcagttt attaagttgt gtactgtcag ctatggaaaa gtcaagctgg 601 tcttgaagca caacagatac ttcgttgaaa gttgccaccc tgatgtaatc cagcatcttc 661 tccaggaccc cgtgatccga gaatgccgct taagaaactc tgaaggggag gccactgagc 721 tcatcacaga gactttcaca agcaaatctg ccatttctaa gactgctgaa agcagtggtg 781 ggccctccac ttcccgagtg acagatccac agggtaaatc tgacatcccc atggacctgt 841 ttgacttcta tgagcaaatg gacaaggatg aagaagaaga agaagagaca cagacagtgt 901 cttttgaagt caagcaggaa atgattgagg aactccagaa acgttgcatc cacctggagt 961 accctctgtt ggcagaatat gacttccgga atgattctgt caaccctgat atcaacattg 1021 acctaaagcc cacagctgtc ctcagaccct atcaggagaa gagcttgcga aagatgtttg 1081 gaaacgggcg tgcacgttcg ggggtcattg ttcttccctg cggtgctgga aagtccctgg 1141 ttggtgtgac tgctgcatgc actgtcagaa aacgctgtct ggtgctgggc aactcagctg 1201 tttctgtgga gcagtggaaa gcccagttca agatgtggtc caccattgac gacagccaga 1261 tctgccggtt cacctccgat gccaaggaca agcccatcgg ctgctccgtt gccattagca 1321 cctactccat gctgggccac accaccaaaa ggtcctggga ggccgagcga gtcatggagt 1381 ggctcaagac ccaggagtgg ggcctcatga tcctggatga agtgcacacc ataccagcca 1441 agatgttccg aagggtgctc accatcgtgc aggcccactg taagctgggt ttgactgcga 1501 ccctcgtccg cgaagatgac aaaattgtgg atttaaattt tctgattggg cctaagctct 1561 acgaagccaa ctggatggag ctgcagaata atggctacat cgccaaagtc cagtgtgctg 1621 aggtctggtg ccctatgtct cctgaatttt accgggaata tgtggcaatc aaaaccaaga 1681 aacgaatctt gctgtacacc atgaacccca acaaatttag agcttgccag tttctgatca 1741 agtttcatga aaggaggaat gacaagatta ttgtctttgc tgacaatgtg tttgccctaa 1801 aggaatatgc cattcgactg aacaaaccct atatctacgg acctacgtct cagggggaaa 1861 ggatgcaaat tctccagaat ttcaagcaca accccaaaat taacaccatc ttcatatcca 1921 aggtaggtga cacttcgttt gatctgccgg aagcaaatgt cctcattcag atctcatccc 1981 atggtggctc caggcgtcag gaagcccaaa ggctagggcg ggtgcttcga gctaaaaaag 2041 ggatggttgc agaagagtac aatgcctttt tctactcact ggtatcccag gacacacagg 2101 aaatggctta ctcaaccaag cggcagagat tcttggtaga tcaaggttat agcttcaagg 2161 tgatcacgaa actcgctggc atggaggagg aagacttggc gttttcgaca aaagaagagc 2221 aacagcagct cttacagaaa gtcctggcag ccactgacct ggatgccgag gaggaggtgg 2281 tggctgggga atttggctcc agatccagcc aggcatctcg gcgctttggc accatgagtt 2341 ctatgtctgg ggccgacgac actgtgtaca tggagtacca ctcatcgcgg agcaaggcgc 2401 ccagcaaaca tgtacacccg ctcttcaagc gctttaggaa atgatgctta ggcagggtac 2461 ttcgttcaag accggcgctt ggcacccttg ttggaaaggg attttcagca taacattttc 2521 cttccacctc tttgaccttc cctccagcgt tggccaaatt gtgctgagga agatgcatca 2581 agggcttggc tgtgccttca taggtcatct agggttttat aaaggaggag gagacaatat 2641 tttttcaaac tttttgggga gtggggtcat ttctgtatat aaaaaatgtt aatatttaag 2701 gtgtatttat gttaccgttc tgaataaaca gaatggacca ttgaaccagt a // LOCUS BOLREPA 182 bp ds-DNA PLN 26-MAY-1990 DEFINITION B.campestris tandemly repeated DNA. ACCESSION M30962 KEYWORDS repetitive DNA. SOURCE B.campestris (strain Var B-85) seedling DNA. ORGANISM Brassica campestris Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 182) AUTHORS Das Gupta,J. and Mandal,R.K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.K.Mandal, 21-DEC-1989. Bose Institute, Dept. Biochemistry, Centenary Building, P 1/12 CIT Scheme VIIM, Calcutta 700 054 INDIA. FEATURES from to/span description BASE COUNT 54 a 39 c 29 g 60 t ORIGIN 1 aagcttctta catcgtgatt catcctggtt tgattagaat gacaaagaag ctgtccaatt 61 cccaaacagg aaaactggga tcacctgatt tgaaagtggg ttagcttctt catcctaact 121 cctatgagat ttcttcaact tcctagtgat tctccattac tttaagtatc aaaatcaagc 181 tt // LOCUS BOLREPB 182 bp ds-DNA PLN 26-MAY-1990 DEFINITION B.juncea tandemly repeated DNA. ACCESSION M30963 KEYWORDS repetitive DNA. SOURCE B.juncea (strain Var B-9) DNA. ORGANISM Brassica juncea Unclassified. REFERENCE 1 (bases 1 to 182) AUTHORS Das Gupta,J. and Mandal,R.K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.K.Mandal, 21-DEC-1989. Bose Institute, Dept. Biochemistry, Centenary Building, P 1/12 CIT Scheme VIIM, Calcutta 700 054 INDIA. FEATURES from to/span description BASE COUNT 53 a 42 c 30 g 57 t ORIGIN 1 aagcttctta cagagtcatt tatcctggtt tgattggaac accgaagaag ctgtcctatt 61 cccaaactgg gaaactggaa tcacctgatt agaaagtggg ataacttctt catcccaact 121 cctatgagat ttattcaact tcctggtgat tctccaacac tttatgtatc caaatcaagc 181 tt // LOCUS HUMHPV16A1 336 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H022. ACCESSION M33610 KEYWORDS insertion site. SEGMENT 1 of 2 SOURCE Human cervical cancer DNA, clone H022. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 336) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 325 326 Human DNA end/HPV-16 DNA start BASE COUNT 114 a 61 c 56 g 105 t ORIGIN 1 aggtatataa atggccaagg tagaagatat caaaatgagg tggatttgat ttctcatgtg 61 agactcatag ctaatttaaa tgaaaattta aataagattt atttgacatg attgggaaca 121 attcaattca actttacaaa cactgattaa atgtctacca tctggatggc accgtgctaa 181 gtgagtctcc aaacctgaac tgtgattata aagggcattt ataaactttc cctcaaagat 241 aggacatttg cccatgtaat catgccatct ttaaaagcat cactctaaat tatttaggtg 301 acttctaact ttgcccagta ctctgtccca cagcta // LOCUS HUMHPV16A2 1002 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 3' flank, clone H022. ACCESSION M33611 KEYWORDS insertion site. SEGMENT 2 of 2 SOURCE Human cervical cancer DNA, clone H022. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1002) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 9 10 HPV-16 DNA end/Human DNA start BASE COUNT 284 a 190 c 206 g 322 t ORIGIN About 3187 bp after segment 1. 1 gaagtggaat aaagtgaaag cctcactctt ctctagccta agttttagag tccagtgaag 61 cattgcaagc ataggctttg tagtcagaaa accctgagat caaatcctgg ttctaccact 121 tgctatagcg atcttgggca aggggtcaga tctctctaag cctgtttcct catctgtaag 181 gaagggtatt atatcacata aggttactgt gaggactaaa ttagactaag tatgcaatag 241 gaatacaggg tccagttttc tttggatgta atgggcctgg aaaattcctt aaaatccttt 301 tcacctacaa aatcttatga agttctgcct attttctgct taaaaacttt aaaaaattaa 361 tagaaataaa agagaattct actagagaga taggttgacg ttacttcttc cttgcttttt 421 ccttaaagtg gaatgttaaa aactaggata tgcctggaaa gtgttctatc tacaaaaaag 481 gaagttagca gccgctgaaa agtaactaca gatggctatt cactttactc tgaaagcatt 541 tgctgttgat ataatcacac cacaggaaaa catcataatg ttggctgaaa gaaatctgaa 601 atgacacagc aataatgctt catcatgtag aagttggttt caagtttttt tttttttttc 661 ggtctggata gtgtgattgc aagaagggag gctatgctag cttggttata agcagggaag 721 ttggctgtga ggagataaac agagatctca caggaattct ggggtagaaa tcactggacc 781 ggaactgaag ggctatctcc cagcttctgt ttctgccttt tcattcagtc attccctcgt 841 ttactcaaca gttccctctg ctttggtggc agtttctgct ccttctcaag gctgacttgc 901 acatggctct gacttgctgt ggcctcctct ccatcattct ctgcatcagg tgctttcaac 961 cttgatttta ttgtttatat atacttatga acttttctgc ag // LOCUS HUMHPV16B 871 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H404. ACCESSION M33612 KEYWORDS insertion site. SOURCE Human cervical cancer DNA, clone H404. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 871) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 861 862 Human DNA end/HPV-16 DNA start BASE COUNT 291 a 147 c 164 g 269 t ORIGIN 1 atactctgag taaacaagta aaacatttgg taaaataact ggaaggatat ataccatagt 61 aaatgattct ttttcaaatt ttctattata tagctatata aggtatgaat ctagtagtta 121 ccctcaaatt agggtaaaca atttcctcag cagtttgagc agctcatctc ataatacttt 181 gcaaagatag ccacacaagg gaatgggctg cttgatttga acacaggtgg ggatggatta 241 atagaactgg ggatcaggga acattgggca ggactaataa gaattaggca gtcagaaaaa 301 gatttacaaa aaagactgta taacgagtct aaagataaat tctacctatt taacatttct 361 gcctgagttt ggagaaggca agaaaacatt cttctcttcc tcttacgtac acagacaatt 421 agggaagcca caatgagata atttatgcta tgttagtgag taacacataa ttttccttca 481 cagctgatat aacttgatta ctggagtggc agtggaaggg catggagacc caggccatgg 541 tcacttttct aggtgctcct acgactcaat ttctctcttc tgtcttgatt cctttgggag 601 attcctggat tttagaaaat cagatgagta agttgttatc atctgaaaaa tgccctctta 661 ccacacaatt atctattaga ggaaagttta ggaacagttg gtttaactga gagaaataaa 721 gataatctct atctcccttg cctgctctta ggataagggt tctgagatcc tatataatct 781 tatatcattt aacataaaca caatttctta ctttgcttga aaagttgtat taaagattcc 841 agggtgcagt taaatacact tcacaatata c // LOCUS HUMHPV16C1 1130 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H705. ACCESSION M33613 KEYWORDS insertion site. SEGMENT 1 of 2 SOURCE Human cervical cancer DNA, clone H705. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1130) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 1120 1121 Human DNA end/HPV-16 DNA start BASE COUNT 321 a 222 c 244 g 343 t ORIGIN 1 tgccatcatt aatgcagctg gcacgacagg ttcccgactg aaacggcagt gagcgcaacg 61 caattaatgt gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg 121 ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgaca 181 tgattacaat tcgagctcgg taccagcaca atgaggaatg catgctagca caagtgaaac 241 tcatagatgt ccattgtgct atgcattttt tccttgggcc tgatccattt atccatttac 301 tggtttcctg tctgtaaatt tagaaaagat acaggctctc tgaaaagtaa tttctgtctc 361 ttacaagtga agggttaatc aaccaatcca cataattttc tccagtactg agagatcatt 421 tgttttaata aatgcaaata aggtttctta tagttaaagg taattggctt ttcattgtaa 481 ttcttgatgc tggtcatttt gtgtctgagt tgttcctaat tgctttggtt cagagtctga 541 gaaatgaaat agccccttga ctataactgt aactacaatt ataacaattt atttatttaa 601 atcagcaatc cctgcaaagt catttacagt ttgtttattt cagtatgttt tacaaggtgc 661 aacaaaagca gcctcatcac atagcaaatc tttcttacag gattaaaagt taatgggtaa 721 ggtaagtctg gcataggcat taaagtggaa gcattgtttc ttcttgactg gtcaacttta 781 gagacaactt ttcccattcg aagttatcta tcctctaaaa tatacagaga ttgaggccag 841 gtgggatggc tcacccctgt aatcccagaa ctttgggagg ccaagatgag tggattgctt 901 gagctcagaa gtttgagacc ttggtaacat ggcgggatgc cggtggtgcc atgcctgtgg 961 tcccagcttc ttgggggctg aggtgggagg accttctgag cctggtggca aagttgcagt 1021 aagctgtgtt ggtatcactg cactccagcc tgcactcctg cacaaagcaa gaccctgtct 1081 caaaacaata aataaattaa aatatagaga gactttgcat tgcaaaggca // LOCUS HUMHPV16C2 148 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 3' flank, clone H705. ACCESSION M33614 KEYWORDS insertion site. SEGMENT 2 of 2 SOURCE Human cervical cancer DNA, clone H705. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 148) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 28 29 HPV-16 DNA end/Human DNA start BASE COUNT 45 a 24 c 28 g 51 t ORIGIN About 489 bases after segment 1. 1 attatcacag atggtacaat gggcctactg atgcagtgat aatagtactg agatgtacta 61 ttatcccaca tttagttaag ttaggattga tcctagattc acatgttgtc agtgtgatgc 121 cttaaatatc aagtttccaa ttaagctt // LOCUS HUMHPV16D2 510 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 3 ' flank, clone H901. ACCESSION M33616 KEYWORDS insertion site. SEGMENT 2 of 2 SOURCE Human cervical cancer DNA, clone H022. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 510) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 10 11 HPV-16 DNA end/Human DNA start BASE COUNT 140 a 133 c 94 g 143 t ORIGIN About 3994 bp after segment 1. 1 acattattat ggaaacagat ctgtgagtac caagaaaaga ggataaagat tcatcccatc 61 caccagtcat tcccatgcac ctctacccgc catcccctgt atccaggaca acccccttct 121 gacaccaaaa tgcatttcac cattggctgc tgtcggtaga taatacctgc tcagcatttg 181 ggacaagttc cagacataac ttcctcttag tgaatgatcc tgacaggaga aagaattgag 241 cttaatttat gccatctaat aacctcagtg cagctacttg ggaagttagc cctccagagt 301 ttcccccaaa gttttctcca gtgaattaca gtgccatata ttctcattgc taccagcgct 361 gctcccaaaa tctatctgct gtttaatagt ttttaccttt caaaaatgca agctggctgg 421 gcgtggattt ttgaaagcat tcctcctgcc ttggcctctc aaagtgctgg attagagggt 481 gccttctaat cccagcaatc agcattggaa // LOCUS HUMHPV1D1 510 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H901. ACCESSION M33615 KEYWORDS insertion site. SEGMENT 1 of 2 SOURCE Human cervical cancer DNA, clone H022. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 510) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 500 501 Human DNA end/HPV-16 DNA start BASE COUNT 159 a 82 c 101 g 168 t ORIGIN 1 attcgagctc ggtacccaac atctcaaaat tttgttcttc agtctgtaaa atgggatgat 61 aaatctctca ggtttggtgt aagaaaaaaa taatatgctc acctaataga ccttcaatta 121 ctggtagttt ccatcatctt aatgaggatt atatctttat agtgagcacc cattagatgg 181 tgttgataaa tacatcaatg agtattttag gcagaaagca gagtaaagca gaagtactgg 241 cattctttgc tgtactcagt tttattaact gattttatat tgatcacgtt ctttgttaca 301 tgtcagtatt atagtggcag ttgaaggtgg taatattttt agtctccgtt agtgaaatga 361 caggcattga gctctcagtc atacctttgt aggccttcgt tgaggtgaat acctacctct 421 taactagaaa aagatggaga atttcttgct tggaaggaaa ttaatgcaat gtccaggtca 481 tctcctaaaa agcctgaagg aaacaaagta // LOCUS HUMMHDQBH 1104 bp ss-mRNA PRI 26-MAY-1990 DEFINITION Human MHC HLA-DQ beta mRNA, complete cds. ACCESSION M32577 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (DR4-Dw14), cDNA to mRNA, LS40 homozygous cell line. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1104) AUTHORS Hilden,J.M., Curtsinger,J.M., Cairns,J.S. and Bach,F.H. TITLE DQ beta sequences in HLA-DR4 haplotypes JOURNAL Hum. Immunol. 18, 261-264 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 754 MHC HLA-DQ beta precursor (AA at 2) /nomgen="LS0098W" /map="6p21.3" /hgml_locus_uid="HLA-DQB1" sigp < 1 64 MHC HLA-DQ beta signal peptide (AA at 2) matp 65 751 MHC HLA-DQ beta BASE COUNT 231 a 324 c 303 g 246 t ORIGIN 1 aggccttcgg gtagcaactg tgaccttgat gctggcgatg ctgagcaccc cggtggctga 61 gggcagagac tctcccgagg atttcgtgta ccagtttaag ggcatgtgct acttcaccaa 121 cgggacggag cgcgtgcgtc ttgtgaccag atacatctat aaccgagagg agtacgcacg 181 cttcgacagc gacgtggggg tgtatcgggc ggtgacgccg ctggggccgc ctgccgccga 241 gtactggaac agccagaagg aagtcctgga gaggacccgg gcggagttgg acacggtgtg 301 cagacacaac taccagttgg agctccgcac gaccttgcag cggcgagtgg agcccacagt 361 gaccatctcc ccatccagga cagaggccct caaccaccac aacctgctgg tctgctcagt 421 gacagatttc tatccagccc agatcaaagt ccggtggttt cggaatgacc aggaggagac 481 aactggcgtt gtgtccaccc cccttattag gaacggtgac tggaccttcc agatcctggt 541 gatgctggaa atgactcccc agcgtggaga cgtctacacc tgccacgtgg agcaccccag 601 cctccagaac cccatcatcg tggagtggcg ggctcagtct gaatctgccc agagcaagat 661 gctgagtggc attggaggct tcgtgctggg gctgatcttc ctcgggctgg gccttattat 721 ccatcacagg agtcagaaag ggctcctgca ctgactcctg agactatttt aactgggatt 781 ggttatcact tttctgtaac gcctgcttgt ccctgcccag aattcccagc tgcctgtgtc 841 agcctgtccc cctgagatca gagtcctaca gtggctgtca cgcagccacc aggtcatctc 901 ctttcatccc cacctcgagg ctgatggctg tgaccctgct tcctgcactt acccagagcc 961 tctgcctgtg cacggccagc tgcgtctact gaggccccaa ggggtttctg tttctattct 1021 ctcctcagac tgctcaagag aagcacatga aaaccattac ctgactttag agctttttta 1081 cataattaaa catgatcctg agtt // LOCUS HUMMHDR1C 1191 bp ss-mRNA PRI 26-MAY-1990 DEFINITION Human class II HLA-DRB1-BON mRNA, complete cds. ACCESSION M33600 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (haplotype DRB1-BON) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1191) AUTHORS Coppin,H.L., Avoustin,P., Fabron,J., Huchenq,A., Garnier,J.M., Thomsen,M. and De Preval,C. TITLE Evolution of the HLA-DR1 gene family: Structural and functional analysis of the new allele "DR-BON" JOURNAL J. Immunol. 144, 984-989 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 71 871 MHC HLA-DR1-BON precursor sigp 71 157 MHC HLA-DR1-BON signal peptide matp 158 868 MHC HLA-DR1-BON BASE COUNT 258 a 312 c 344 g 277 t ORIGIN 1 gcccaagtat caagagggag agtgagactt gcctgcttct ctggcccctg gtcctgtcct 61 gttctccagc atggtgtgtc tgaagctccc tggaggctcc tgcatgacag cgctgacagt 121 gacactgatg gtgctgagct ccccactggc tttggctggg gacacccgac cacgtttctt 181 gtggcagctt aagtttgaat gtcatttctt caatgggacg gagcgggtgc ggttgctgga 241 aagatgcatc tataaccaag aggagtccgt gcgcttcgac agcgacgtgg gggagtaccg 301 ggcggtgacg gagctggggc ggcctgatgc cgagtactgg aacagccaga aggacatcct 361 ggaagacgag cgggccgcgg tggacaccta ctgcagacac aactacgggg ttggtgagag 421 cttcacagtg cagcggcgag ttgagcctaa ggtgactgtg tatccttcaa agacccagcc 481 cctgcagcac cacaacctcc tggtctgctc tgtgagtggt ttctatccag gcagcattga 541 agtcaggtgg ttccggaacg gccaggaaga gaaggctggg gtggtgtcca caggcctgat 601 ccagaatgga gattggacct tccagaccct ggtgatgctg gaaacagttc ctcggagtgg 661 agaggtttac acctgccaag tggagcaccc aagtgtgacg agccctctca cagtggaatg 721 gagagcacgg tctgaatctg cacagagcaa gatgctgagt ggagtcgggg gcttcgtgct 781 gggcctgctc ttccttgggg ccgggctgtt catctacttc aggaatcaga aaggacactc 841 tggacttcag ccaacaggat tcctgagctg aaatgcagat gaccacattc aaggaagaac 901 cttctgtccc agctttgcag aatgaaaagc tttcctgctt ggcagttatt cttccacaag 961 agagggcttt ctcaggacct ggttgctact ggttcggcaa ctgcagaaaa tgtcctccct 1021 tgtggcttcc tcagctcctg cccttggcct gaagtcccag cattgatgac agcgcctcat 1081 cttcaacttt tgtgctcccc tttgcctaaa ccgtatggcc tcccgtgcat ctgtacctca 1141 ccctgtacga caaacacatt acattattaa atgtttctca aagatggagt t // LOCUS HUMMHDRBBB 1216 bp ss-mRNA PRI 26-MAY-1990 DEFINITION Human MHC class II HLA-DR beta-1 mRNA (DR2.3), 5'end. ACCESSION M32578 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human type I diabetic (Dw4/LD MN2), cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1216) AUTHORS Freeman,S.M., Saunders,T.L., Madden,M., Segall,M., Bach,F.H. and Wu,S. TITLE Comparison of DR beta-1 alleles from diabetic and normal individuals JOURNAL Hum. Immunol. 19, 1-6 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 62 862 MHC HLA-DR beta-1 precursor /nomgen="LV0063D" /map="6p21.3" /hgml_locus_uid="HLA-DRB1" sigp 62 148 MHC HLA-DR beta-1 signal peptide matp 149 859 MHC HLA-DR beta-1 BASE COUNT 265 a 331 c 341 g 279 t ORIGIN 1 agttctccct gagtgagact tgcctgctcc tctggcccct ggtcctgtcc tgttctccag 61 catggtgtgt ctgaagctcc ctggaggttc ctacatggca gtgctgacag tgacactgat 121 ggtgctgagc tccccactgg ctttggctgg ggacacccga ccatgtttct tgcagcagga 181 taagtatgag tgtcatttct tcaacgggac ggagcgggtg cggttcctgc acagaggcat 241 ctataaccaa caggagaacg tgcgcttcga cagcgacgtg ggggagtacc gggcggtgac 301 ggagctgggg cggcctgacg ctgagtactg gaacagccag aaggacatcc tggagcaggc 361 gcgggccgcg gtggacacct actgcagaca caactacggg gctgtggaga gcttcacagt 421 gcagcggcga gttgagccta aggtgactgt gtatcctgca aggacccaga ccctgcagca 481 ccacaacctc ctggtctgct ctgtgaatgg tttctatcca ggcagcattg aagtcaggtg 541 gttccggaac ggccaggaag agaaggctgg ggtggtgtcc acaggcctga ttcagaatgg 601 agactggacc ttccagattc tggtgatgct ggaaacagtt cctcggagtg gagaggttta 661 cacctgccaa gtggagcacc caagcgtgac gagccctctc acagtggaat ggagagcaca 721 gtctgaatct gcacagagca agatgctgag tggaatcggg ggctttgtgc tgggcctgct 781 cttccttggg gccgggctat tcatctactt caagaatcag aaagggcact ctggacttca 841 cccaacagga ctcgtgagct gaagtgcaga tgaccacatt caagggggaa ccttctgccc 901 cagctttgca tgatgaaaag ctttcctgct tggctcttat tcttccacaa gagaggactt 961 tctcaggccc tggttgctac cggttcagca actctgcaga aaatgtccat ccttgtggct 1021 tcctcagctc ctgcccttgg cctgaagtcc cagcattgat ggcagtgcct catcttcaac 1081 tttagtgctc ccctttacct aaccctacgg cctcccatgc atctgtactc cccctgtgcc 1141 acaaatggac tacgttatta aatttttctg aagcccagag ttaaaaatca tctgtccacc 1201 tggcaccaaa gacaaa // LOCUS J05239 162 bp ds-DNA BAD 26-MAY-1990 DEFINITION Figure 1. Sequence of the 166-bp restriction fragment. ACCESSION J05239 REFERENCE 1 (bases 1 to 162) AUTHORS Jones,B.K. and Yeung,A.T. TITLE dna base composition determines the specificity of uvrabc endonuclease incision of a psoralen cross-link JOURNAL J. Biol. Chem. 265, 3489-3496 (1990) STANDARD unannotated staff_entry COMMENT Bad entry: secondary reference to PNASU 75, 5314-5318 (1978): lac promoter sequence. FEATURES from to/span description BASE COUNT 40 a 43 c 40 g 39 t ORIGIN 1 cctccgttga gccatctgga tcggcagcgt tgtcttcatc aaccggaacg agcatgccgg 61 agagcagctc actcattagg caccccaggc tttacacttt atgcttccgg ctcgtataat 121 gtgtggaatt gtgagcggat aacaatttca cacaggaaac ag // LOCUS MLVENVB 2002 bp ss-RNA VRL 26-MAY-1990 DEFINITION Murine leukemia virus 10A1 derivative env gene, complete cds. ACCESSION M33470 KEYWORDS envelope protein. SOURCE Murine leukemia virus 10A1 derivative viral RNA, clone 10A1. ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 2002) AUTHORS Ott,D., Friedrich,R. and Rein,A. TITLE Sequence analysis of amphotropic and 10A1 murine leukemia viruses: Close relationship to mink cell focus-inducing viruses JOURNAL J. Virol. 64, 757-766 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 65 2002 env protein BASE COUNT 528 a 553 c 482 g 439 t ORIGIN 1 ggatccacgc cgctcacgta aaggcggcga caacccctcc ggccggaaca gcatcaggac 61 cgacatggaa ggtccagcgt tctcaaaacc ccttaaagat aagattaacc cgtggaagtc 121 cttaatggtc atgggggtct atttaagagt agggatggca gagagccccc atcaggtctt 181 taatgtaacc tggagagtca ccaacctgat gactgggcgt accgccaatg ccacctccct 241 tttaggaact gtacaagatg ccttcccaag attatatttt gatctatgtg atctggtcgg 301 agaagagtgg gacccttcag accaggaacc atatgtcggg tatggctgca aataccccgg 361 agggagaaag cggacccgga cttttgactt ttacgtgtgc cctgggcata ccgtaaaatc 421 ggggtgtggg gggccaagag agggctactg tggtgaatgg ggttgtgaaa ccaccggaca 481 ggcttactgg aagcccacat catcatggga cctaatctcc cttaagcgcg gtaacacccc 541 ctgggacacg ggatgctcca aaatggcttg tggcccctgc tacgacctct ccaaagtatc 601 caattccttc caaggggcta ctcgaggggg cagatgcaac cctctagtcc tagaattcac 661 tgatgcagga aaaaaggcta attgggacgg gcccaaatcg tggggactga gactgtaccg 721 gacaggaaca gatcctatta ccatgttctc cctgacccgc caggtcctca atatagggcc 781 ccgcatcccc attgggccta atcccgtgat cactggtcaa ctacccccct cccgacccgt 841 gcagatcagg ctccccaggc ctcctcagcc tcctcctaca ggcgcagcct ctatagtccc 901 tgagactgcc ccaccttctc aacaacctgg gacgggagac aggctgctaa acctggtaga 961 aggagcctat caggcgctta acctcaccaa tcccgacaag acccaagaat gttggctgtg 1021 cttagtgtcg ggacctcctt attacgaagg agtagcggtc gtgggcactt ataccaatca 1081 ttctaccgcc ccggccagct gtacggccac ttcccaacat aagcttaccc tatctgaagt 1141 gacaggacag ggcctatgca tgggagcact acctaaaact caccaggcct tatgtaacac 1201 cacccaaagt gccggctcag gatcctacta ccttgcagca cccgctggaa caatgtgggc 1261 ttgtagcact ggattgactc cctgcttgtc caccacgatg ctcaatctaa ccacagacta 1321 ttgtgtatta gttgagctct ggcccagaat aatttaccac tcccccgatt atatgtatgg 1381 tcagcttgaa cagcgtacca aatataagag ggagccagta tcgttgaccc tggcccttct 1441 gctaggagga ttaaccatgg gagggattgc agctggaata gggacgggga ccactgccct 1501 aatcaaaacc cagcagtttg agcagcttca cgccgctatc cagacagacc tcaacgaagt 1561 cgaaaaatca attaccaacc tagaaaagtc actgacctcg ttgtctgaag tagtcctaca 1621 gaaccgaaga ggcctagatt tgctcttcct aaaagaggga ggtctctgcg cagccctaaa 1681 agaagaatgt tgtttttatg cagaccacac gggactagtg agagacagca tggccaaact 1741 aagggaaagg cttaatcaga gacaaaaact atttgagtca ggccaaggtt ggttcgaagg 1801 gcagtttaat agatccccct ggtttaccac cttaatctcc accatcatgg gacctctaat 1861 agtactctta ctgatcttac tctttggacc ctgcattctc aatcgattgg tccaatttgt 1921 taaagacagg atctcagtgg tccaggctct ggttttgact caacaatatc accagctaaa 1981 acctatagag tacgagccat ga // LOCUS MLVENVC 2001 bp ss-RNA VRL 26-MAY-1990 DEFINITION Murine leukemia virus env gene, complete cds. genes. ACCESSION M33469 KEYWORDS envelope protein. SOURCE Murine leukemia virus viral RNA, clone 4070A. ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 2001) AUTHORS Ott,D., Friedrich,R. and Rein,A. TITLE Sequence analysis of amphotropic and 10A1 murine leukemia viruses: Close relationship to mink cell focus-inducing viruses JOURNAL J. Virol. 64, 757-766 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 37 2001 env protein BASE COUNT 532 a 560 c 472 g 437 t ORIGIN 1 ggccgacacc cagagtggac catcctctgg acggacatgg cgcgttcaac gctctcaaaa 61 ccccctcaag ataagattaa cccgtggaag cccttaatag tcatgggagt cctgttagga 121 gtagggatgg cagagagccc ccatcaggtc tttaatgtaa cctggagagt caccaacctg 181 atgactgggc gtaccgccaa tgccacctcc ctcctgggaa ctgtacaaga tgccttccca 241 aaattatatt ttgatctatg tgatctggtc ggagaggagt gggacccttc agaccaggaa 301 ccgtatgtcg ggtatggctg caagtacccc gcagggagac agcggacccg gacttttgac 361 ttttacgtgt gccctgggca taccgtaaag tcggggtgtg ggggaccagg agagggctac 421 tgtggtaaat gggggtgtga aaccaccgga caggcttact ggaagcccac atcatcgtgg 481 gacctaatct cccttaagcg cggtaacacc ccctgggaca cgggatgctc taaagttgcc 541 tgtggcccct gctacgacct ctccaaagta tccaattcct tccaaggggc tactcgaggg 601 ggcagatgca accctctagt cctagaattc actgatgcag gaaaaaaggc taactgggac 661 gggcccaaat cgtggggact gagactgtac cggacaggaa cagatcctat taccatgttc 721 tccctgaccc ggcaggtcct taatgtggga ccccgagtcc ccatagggcc caacccagta 781 ttacccgacc aaagactccc ttcctcacca atagagattg taccggctcc acagccacct 841 agccccctca ataccagtta ccccccttcc actaccagta caccctcaac ctcccctaca 901 agtccaagtg tcccacagcc acccccagga actggagata gactactagc tctagtcaaa 961 ggagcctatc aggcgcttaa cctcaccaat cccgacaaga cccaagaatg ttggctgtgc 1021 ttagtgtcgg gacctcctta ttacgaagga gtagcggtcg tgggcactta taccaatcat 1081 tccaccgctc cggccaactg tacggccact tcccaacata agcttaccct atctgaagtg 1141 acaggacagg gcctatgcat gggggcagta cctaaaactc accaggcctt atgtaacacc 1201 acccaaagcg ccggctcagg atcctactac cttgcagcac ccgccggaac aatgtgggct 1261 tgcagcactg gattgactcc ctgcttgtcc accacggtgc tcaatctaac cacagattat 1321 tgtgtattag ttgaactctg gcccagagta atttaccact cccccgatta tatgtatggt 1381 cagcttgaac agcgtaccaa atataaaaga gagccagtat cattgaccct ggcccttcta 1441 ctaggaggat taaccatggg agggattgca gctggaatag ggacggggac cactgcctta 1501 attaaaaccc agcagtttga gcagcttcat gccgctatcc agacagacct caacgaagtc 1561 gaaaagtcaa ttaccaacct agaaaagtca ctgacctcgt tgtctgaagt agtcctacag 1621 aaccgcagag gcctagattt gctattccta aaggagggag gtctctgcgc agccctaaaa 1681 gaagaatgtt gtttttatgc agaccacacg gggctagtga gagacagcat ggccaaatta 1741 agagaaaggc ttaatcagag acaaaaacta tttgagacag gccaaggatg gttcgaaggg 1801 ctgtttaata gatccccctg gtttaccacc ttaatctcca ccatcatggg acctctaata 1861 gtactcttac tgatcttact ctttggacct tgcattctca atcgattggt ccaatttgtt 1921 aaagacagga tctcagtggt ccaggctctg gttttgactc agcaatatca ccagctaaaa 1981 cccatagagt acgagccatg a // LOCUS MTYRPVP 6331 bp ss-RNA VRL 26-MAY-1990 DEFINITION Eggplant mosaic virus genome. ACCESSION J04374 KEYWORDS replicase protein; virion protein. SOURCE Eggplant mosaic tymovirus viral RNA. ORGANISM Eggplant mosaic virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Tymovirus. REFERENCE 1 (bases 1 to 6331) AUTHORS Osorio-Keese,M.E., Keese,P. and Gibbs,A. TITLE Nucleotide sequence of the genome of eggplant mosaic tymovirus JOURNAL Virology 172, 547-554 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 102 2051 overlapping out-of-phase protein pept 109 5628 replicase protein (putative) pept 5633 6199 virion protein BASE COUNT 1337 a 2441 c 987 g 1566 t ORIGIN 1 gtaatcagaa ccagaactaa ccctgttatc agccttagtt cttttacttt cctgtccaaa 61 tttctgaacc gactagtgcc ttcctagaac ccactacgtc aatgcctcat ggcctttcag 121 tctgctctcg aagctctcaa ctcaactact cacagagatg cttctacaaa tccaattctg 181 aactccgtcg tggaacctct ccgcgactct ctatccctat atccctggct ccttcccaaa 241 gaagccgttc cccaccttct atcctggggc atcccgaact ccggcctcgg agtcactccc 301 cacccccacc caatccacaa aacagtcgag acttttctcc tgttcaatca ctggcatgct 361 ctcgctcgcc tgccttcaac tgtgatgttc atgaaaccgt ccaagtttca aaaacttgcg 421 gctctaaacc caaaattcca agagttgatc aactttcgac tcactgccgc cgacaccact 481 cgctacccct ccacctcact cacttttcca agcaattcaa tttgcttcat gcacgatgct 541 ctgatgtact tttctccagc tcagatcgtc gatctcttca ctcagtctcc cgcactcgag 601 accctgtact gcagtctcat agtgcctcca gagtctcatt tcacagatct ctctctcttc 661 cccgagatct acacttacaa gatctcaggt cagactctcc actacatccc ggagaatcac 721 cactccggct cgtacaatca gcccctccaa gccccatctt ggctgaagat ttcctccatc 781 ctctcgcctt ccctcgcttt gtctgtgacc aagctggaat cttggggccc agtccactcc 841 atattgatcc agcgaggcct accaccaaag ccctctctct ctgcacgccc ccccgtcctg 901 ccaaatcaac ctccccgtgc aacaactccc aactcccaaa accaactgct gcatcagaca 961 agccagctat tcttccaact gcagcagcct caactcagcc tggtctcctt ccgaattcca 1021 gactgcgtag aactgccaca agccaccttt ctgcgccaac ctctccgcca ccggctagtg 1081 ccaacaagcg tttacaacgc tctcttcacc tacactcgcg cagtccgcac tcttcgcact 1141 tccgacccag ccggatttgt gcgaactcaa agcaacaaac ccgagcacgc ttgggtcact 1201 ccaaacgcgt gggacaatct gcagaccttg tctgtcaatg ccccccaccg cccccaagta 1261 tgctaccact tcttctcctc ccccgtggca aggttaaagc tccacttcgc ccaacactgg 1321 cgagcctatc ttttggctct caccccattc cttaccacgt cacctcttct cctcccctta 1381 ttcaatttca acaccccttt ccccctccct cggctacttt ctctgtttcg ccgctcggtg 1441 tcctcaccac ggcttttgca ctcaatccta cccagtcagc tgagaggagc tgcgatcccg 1501 aatcgcccac tcccactctg ggtcacaaaa ctacatcact ttctcgactc ccactccctc 1561 ctccccactc cccccattcg gcccaggata gagcttcagc gcttgccact gatgtctcta 1621 attccgaaac caaaaattgt ccttccccta ctgtccctcc tcctttcctc cccaaccatc 1681 tacatccact tcttccaggc acagaccccc caacaactcc acgacaatta tcaccttcac 1741 cttcatccct ctcgcttcga actttcctgg actctgcagt catatcatgt gactcaagcc 1801 cagtccttcc tccctctcct tctcccagct cccactcaag ctcaagcttc caatcctgca 1861 cctcgccccc ccgctttcca tgctatcccc ctcccccctc agccctcgac ctcctcttct 1921 cctccactcc aggaaccgac cctttccccc cacctgatac acccccccct cacaagagaa 1981 ccatcgccct tgaacggctg cgcctgcgac agtgcgctac tcccttccac agctgcgatg 2041 acgtctgctg aacatcccac tccactcaac ccccccacac ctagcccaac accagacgtc 2101 cctcctcccg actcacccgg taacccatca cttttgaagc aagtccctcc cgaagcgaac 2161 ttgcatccta tccacaaccc agacctcccc tcttccacca ctcttccttc tggggccctg 2221 acactggtcc cagccaaaac tccttccatc tacgccaatc ccaccccccc cagttcccat 2281 ccgttcaccc cactggctga tgaccccact gctgtgggtc cttgcctacc gttccacgtt 2341 ctccacccgg ctgactactt tcctctttca gccgagtttc tcacacggac ccggcatgtc 2401 cccccctctt ctctctcaca tccaaaactc aattgcctac tcacctgctt ttctgaactt 2461 tcaggacact ctgagtcaga tctttggttg tccctgcaat caatacttcc tgactcccaa 2521 ctccaaaatc ctgaagtctc gacacttggc ctgtccactg acattctcac agctctctgc 2581 ttcatctacc attcatctgt gactctccat gccccctcag gagtttatca ctacggcata 2641 gcctcctctt ctaccgtcta tgtcatccac tatcaaccag gccctcctcc tcatttttct 2701 ctctccccta gacttgccgc ttctgctcct cgctgcaacc ccaccaacag cagattggtc 2761 agacaagctc tgcggtttaa attgaacggc gagtttctcc ccttcaccca ggcttacgcg 2821 catgaatctt ccatcaccca tgccaaaaac ctcatctcca acatgaagaa tggttttgat 2881 ggaatcatgt cttctctcac tgactcctct aagggtccct ccccccgtga aaaactgacc 2941 actctcgact ctctcataga tgtcgctgcc cctcgcgaag tttctctcat ccacatcgcc 3001 ggcttcgcag gctgcggcaa gacccacccc atccaaaaac tcctccaaac ttcccctttt 3061 cacgacttcc gaatctcatg ccccactaat gaactccgat ccgaatggaa gcgtgatatg 3121 caaccaacag ctgaaaatgt ttggaggttc tccacatggg aatccagcct gctcaaacat 3181 tccgagatcc tcgtaatcga cgagatttac aagctccctc gtggctacct agatctctcc 3241 atccttgctg atccaactct ctccttggtc atcatccttg gtgaccctct ccaaggagag 3301 tatcactcga cctctcctca cagctccaat cactttcttc caagtgaggt ccaccgcttc 3361 aagtcttaca tcgactgcta ctgtttttgg tcccaccgca ttccaaagca gatagcatcc 3421 ttgttcggcg tagtatgcca caacacgaac gaaggtttcg tgagagccct cacatctcat 3481 ccccccaatt ccaaaaacct caccaatgcg accaacactg ctctcagtct ccaacagatg 3541 ggccaccacg ctatcaccat cagcgccaga agggtcacct tcaccgaggc ccatacaatt 3601 ctgcttgatc gtcataccaa ccttctctcc cccaacaact gtcttgttgc cctcacccgc 3661 agccgcactg gcgtctactt cgtcggcaat ctgcacctgg catcaaacag ctttggcaca 3721 aactacatgt tctctcaagc tctctgccaa ggcacaatcg acctaaacaa cgtgttcccc 3781 cacatcatgc ctcacctccc gaaaatgtat gaacccatcc gctcccggtc caaccgtttt 3841 gtgtctgggt ccctcaattt tcgaccaacc accaattccc gcctcctttc cagtctcact 3901 aagccaaccc acctcccccc tcacatccct accaaccact ccctggatgt cctagtttcc 3961 aaccctgtgc tccttggtga gaccctcgac cctcgattgg aggtcctcca cctcccccca 4021 actcgcctcc cattgcatct ggacctcctg cccacagtac cttcctcttc cagcttctcc 4081 tcagtcgacc atcttttccc aacccccatc tcccccgcta tctgcggcta caccttcgaa 4141 aatttggccg cattcttcct cccagctcat gacccggacc taaaggaggt gctcatcaat 4201 gaccaaaaga gcaaccagtt cccatacttg gacgcccctt ttgagctttc gtgccaaccc 4261 tcctcactgt tggcaccaat tcacaagccg gcctcggatc caacccttct ccctggctcc 4321 atcaagaaac gcctcagatt ccgcgcttct tcctccccat attccatcac tccatctgat 4381 caacttcttg gtcaacacct cttctcttct ttgtgcctgg cttatgggcg caaccccaat 4441 tctgtcctcc ccttccaacc tgagctcttc agtgagtgca tatgcattaa tgattacgct 4501 caactctcct ccaagactca agccaccatc gtggccaatc atcaaaggtc tgatcctgac 4561 tggcgcctaa ctgctgtccg catctttgcc aaggctcaac acaaagtaaa cgacgcttcc 4621 atcttttccg ggtggaaggc ttgccaaact ctagccctga tgcacggtta catcattctc 4681 gtactcggcc cagtcaagaa ataccaacgc atttttgatt ccaaggacag acctccccac 4741 atctactacc actgcggtaa aactccctcc cagctctccc aatggtgcca aactcacctt 4801 tctggctctt cctacatcgc caacgactac actgcctttg atcagtccca acacggcgag 4861 gctgtggtcc tggaatgttt gaagatgcgc cgcctctcca tcccggactc tctcattcag 4921 ctccactccc acctcaagtg ttccgtcgac acccagttcg gccccctcac ctgcatgcgc 4981 ctcactggcg agccgggcac ttatgatgac aactctgact acaacctagc tgtcatctac 5041 tcccaatact ccctcaatgg ccaccccatt ctgatctcag gcgatgactc cgtcctttgc 5101 ggcacaccgc ccccttctcc actttggccc actctcaaga aaatgcttca tctccgtttc 5161 aagatcgaac ggacctccca ccccctcttc tgcgggtatt acgtctcccc tcatggcgct 5221 gcccgcaacc cgtatgctct cttcgccaag ctcatgatct gcgttgatga caagagcctc 5281 catgacaaga agttgtccta tctctctgaa ttctccactg gccatctggc tggcgacctg 5341 gtcacctcca ttctcccttc ccacctactt ccctatcagt ccgccgtgca cgacttcttc 5401 tgccggaatt gcacgcccgc ggaaaaaatt ctcctgtctc tggacccaat ccctgagtcc 5461 aaaatcctcc agctcattct caaagttcgc tgggcttctc aagctttctt ttcctacctg 5521 cctcaaaaag ctcgcgaact ccttgtggca cgctcttctc tcccgtccct ctattccaat 5581 cccaaagtct ctcaactgga gtctgaattg cttcccttct ctcaatagat caatggaaga 5641 cacagcaatc atcagaagcc ctcagccctc cataaacgca ccaggcttcc atctgccacc 5701 caccgactca caacaatcct ctgctattga actccccttc cagtttcagg ccaccacttt 5761 tggcgcgact gaaacagctg ctcaaatcag tctggcctcc gccaacgcta ttaccaagct 5821 cgcgtctctc taccgccatg tgcggctcac gcagtgcgct gccaccatca ctccgacagc 5881 ggccgccatt gccaatcctc tcactgtcaa catcgtctgg gtgtctgaca attccactgc 5941 caagcccacc gagattctca atgtctttgg tggatcttcc tacacgtttg gcggcgccct 6001 caatgccacc aagcccctta ccatccctct ccccatgaac tcggtcaact gtatgctcaa 6061 ggactctgtt ctttacacag attgcccaaa gctcctggcc tactcagctg ctcccagctc 6121 tccctccaaa accccaaccg ccactatcca aatccatggc aagctccgct tgtcctcccc 6181 cctcctccaa gccaattaac tctctctccc tcagccacca cctcgctcct cccccatctc 6241 ctatggtaat tgcggacagt tccgctccct ctagcacaca gaggtccatt tgggtgcgac 6301 tcccccccct cccgtgggtc aacgggaacc a // LOCUS RATRGHA 542 bp ds-DNA ROD 26-MAY-1990 DEFINITION Rat growth hormone (rGH) gene, intron B repetitive DNA. ACCESSION M32696 KEYWORDS repetitive DNA. SOURCE Rat (strain Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 542) AUTHORS Guitierrez-Hartmann,A., Lieberburg,I., Gardner,D., Baxter,J.D. and Cathala,G.G. TITLE Transcription of two classes of rat growth hormone gene-associated repetitive DNA: Differences in activity and effects of tandem repeat structure JOURNAL Nucleic Acids Res. 12, 7153-7173 (1984) STANDARD simple staff_entry BASE COUNT 199 a 114 c 118 g 111 t ORIGIN 1 aacagtaatg acagagaggg ctggagagat ggctcagtgg ttaagagcac ccgactgctc 61 ttccaaaggt cctgagttca attccagcaa ccacatggtg gctcacaacc atctgtaaag 121 agatccgatg ccctcttctg gtgtgtctga agacagctac agtgtactta tataataaac 181 aaataaatct ttaaaaaaaa aaacaaaaac ggggctggag agatggctca gcggttaaga 241 gcgcccgact gctcttccag aggtcatgag ttcaattcca gcaaccacat ggtggctcac 301 aaccatctgt aaagagatct gatgccctct tctggtgtat ctgaagacag ctacagtgta 361 cttatatata ataaataaat aaatctttaa aaaaaaaaca aaacaggggc tggggattta 421 gctcagtggt agagcgctta cctaggaagc gcaaggccct gggttcggtc cccagctccg 481 aaaaaaagaa ccaaaaaaaa aaaaaaaaac caaaacaaaa acaaaacagt aatgacagag 541 ag // LOCUS ALRVSRC 1801 bp ss-RNA VRL 26-MAY-1990 DEFINITION Rous sarcoma virus (Schmidt-Ruppin D strain) v-src gene, complete cds. ACCESSION M33292 KEYWORDS oncogene; pp60v-src; src gene; tyrosine kinase. SOURCE Rous sarcoma virus (strain Schmidt-Ruppin D) RNA, clone psrc1. ORGANISM Rous sarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian sarcoma viruses. REFERENCE 1 (bases 1 to 1801) AUTHORS Reddy,S., Mazzu,D., Mahan,D. and Shalloway,D. TITLE Sequence and functional differences between Schmidt-Ruppin D and Schmidt-Ruppin A strains of pp60v-src JOURNAL Unpublished (1990) 406 S. Frear Bldg, University Park, PA 16802 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.I.Shalloway, 26-MAR-1990. FEATURES from to/span description pept 65 1645 pp60v-src protein BASE COUNT 392 a 537 c 551 g 321 t ORIGIN 1 actctgctgg tggcctcgcg taccactgtg gccaagcggt agctggaacg tgcagccgac 61 caccatgggg agtagcaaga gcaagcctaa ggaccccagc cagcgccggc gcagcctgga 121 gccacccgac agcacccacc acgggggatt cccagcctcg cagaccccca acaagacagc 181 agcccccgac acgcaccgca cccccagccg ctccttcggg accgtggcca ccgagcccaa 241 gctcttcgag gacttcaaca cttctgacac cgttacgtcg ccgcagcgtg ccggggcact 301 ggctggcggc gtcaccactt tcgtggctct ctacgactac gagtcctgga ttgaaacgga 361 cttgtccttc aagaaaggag aacgcctgca gattgtcaac aacacggaag gtaactggtg 421 gctggctcat tccgtgacta caggacagac gggctacatc cccagtaact atgtcgcgcc 481 ctcagactcc atccaggctg aagagtggta ctttgggaag atcactcgtc gggagtccga 541 gcggctgctg ctcaaccccg aaaacccccg gggaaccttc ttggtccggg agagcgagac 601 gacaaaaggt gcctattgcc tctccgtttc tgactttgac aacgccaagg ggctcaatgt 661 gaagcactac aagatccgca agctggacag cggcggcttc tacatcacct cacgcacaca 721 gttcagcagc ctgcagcagc tggtggccta ctactccaaa catgctgatg gcttgtgcca 781 ccgcctgacc aacgtctgcc ccacgtccaa gccccagacc cagggactcg ccaaggacgc 841 gtgggaaatc ccccgggagt cgctgcggct ggaggtgaag ctggggcagg gctgctttgg 901 agaggtctgg atggggacct ggaacggcac caccagagtg gccataaaga ctctgaagcc 961 cggcaccatg tccccggagg ccttcctgca ggaagcccaa gtgatgaaga agctccagca 1021 tgagaagctg gttcaactgt acgcagtcgt gtcggaagag cccatctaca tcgtcattga 1081 gtacatgagc aaggggagcc tcctggattt cctgaaggga gagatgggca agtacctgcg 1141 gctgccacag ctcgttgata tggctgatca gattgcatcc ggcatggcct atgtggagag 1201 gatgaactac gtgcaccgag acctgcgggc ggccaacatc ctggtggggg agaacctggt 1261 gtgcaaggtg gctgactttg ggctggcacg cctcatcgag gacaacgagt acacagcacg 1321 gcaaggtgcc aagttcccca tcaagtggac agcccccgag gcagccctct atggccggtt 1381 caccatcaag tcggatgtct ggtccttcgg catcctgctg actgagctga ccaccaaggg 1441 ccggatgcca tacccaggga tgggcaacgg ggaggtgctg gaccgggtgg agaggggcta 1501 ccgcatgccc tgcccgcccg agtgccccga gtcgctgcat gaccttatgt gccagtgctg 1561 gcggagggac cctgaggagc ggcccacttt tgagtacctg caggcccagc tgctccctgc 1621 ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt taagctacaa caaggcaagg 1681 cttggccgac aattgcatga agaatctgct tagggttagg cgttttgcgc tgcttcgcga 1741 tgtacgggcc agatatacgc gtatctgagg ggactagggt gtgtttaggc gaaaagcggg 1801 g // LOCUS AVIH2AA 3800 bp ds-DNA BCT 26-MAY-1990 DEFINITION A.vinelandii H2 uptake hydrogenase (hoxK), complete cds, and H2 uptake hydrogenase (hoxG), complete cds. ACCESSION M33152 KEYWORDS H2 uptake hydrogenase. SOURCE A.vinelandii (strain OP) DNA, clone pALM21. ORGANISM Azotobacter vinelandii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 3800) AUTHORS Menon,A.L., Stultz,L.W., Robson,R.L. and Mortenson,L.E. TITLE Cloning, nucleotide sequence and characterization of the (NiFe) hydrogenase structural genes and hoxG from Azotobacter vinelandii JOURNAL Unpublished (1990) U of Georgia, Dep Biochemistry, Athens, GA 30602 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.L.Robson, 22-MAR-1990. FEATURES from to/span description pept 149 1225 H2 uptake hydrogenase (hoxK) precursor sigp 149 283 H2 uptake hydrogenase signal peptide (put.) matp 284 1222 H2 uptake hydrogenase pept 1222 3030 H2 uptake hydrogenase (hoxG) ORF 3047 3769 ORF3 BASE COUNT 686 a 1318 c 1213 g 583 t ORIGIN 1 tgtatcaagc catgacaaaa acatggcatt ggcgcattat tcgtgcggtt ttcattcagc 61 aaccgtgggc catacaaccg gcgcgccgtc atagccgaag gacggtgcgc aggggcgccg 121 ataacgacct ggccacaagg gtaacggcat gtctcgactc gaaactttct atgacgtgat 181 gcggcgtcag ggcatcacgc gccgcagctt tctcaaatat tgcagcctga ccgccgcggc 241 cctgggcctc ggcccggcct tcgccccgcg gatcgcccac gcgatggaaa ccaagccgcg 301 cactccggtg ctctggctgc acggcctgga gtgcacctgc tgctccgagt cgttcatccg 361 ttcggcccac ccgctggtca aggacgtggt gctgtcgatg atctcgctgg actacgacga 421 caccctgatg gccgccgccg gccaccaggc cgaggccgcc ctcgaagaga ccatgcgcaa 481 gtacaagggc gagtacatcc tcgccgtgga gggcaacccg ccgctcaacg aggacggcat 541 gttctgcatc gtcggcggca agccgttcat cgagcagctc aggcatgtgg cgaaggacgc 601 caaggcggtg atcgcctggg gcagttgcgc cagttggggc tgcgtgcagg cggcccggcc 661 caacccgacc caggcggtgc cgatccacaa ggtcatcacc gacaagccga tcgtcaaggt 721 gcccggctgc ccgccgatcg ccgaggtgat gaccggggtg atcacctaca tgctgacctt 781 cggcaagctg cccgagctgg accgccaggg gcggccgaag atgttctacg gccagcgcat 841 ccacgacaag tgctaccgcc gcccgcactt cgacgccggc cagttcgtcg agcactggga 901 cgacgagggc gcgcgcaagg gctactgcct gtacaaggtc ggctgcaagg gcccgaccag 961 ctacaacgcc tgctcgacgg tgcgctggaa cgagggcact tccttcccga tccaggccgg 1021 ccacggctgc atcggctgct cggaggacgg tttctgggac aagggctcgt tctatgaacg 1081 cctgaccacc attccgcagt tcggcatcga gaagaacgcc gacgaaatcg gcgccgccgt 1141 cgccggcggg gtcggcgcgg ccatcgccgc gcatgccgcg gtcaccgcca tcaagcgcct 1201 gcagaacaag ggggatcgcc catgagcagc ctgccgaacg ccagccaact ggacaagtcc 1261 ggcaggcgca tcgtcgtcga cccggtgacc cgcatcgagg gccacatgcg ctgcgaggtc 1321 aacgtcgacg ccagcaacgt gatcaccaac gccgtctcca ccggcaccat gtggcgcggc 1381 ctggaggtca tcctcaaggg ccgcgacccg cgcgacgcct gggccttcgt cgagcgcatc 1441 tgcggcgtct gcaccggcac ccatgcgctg acctcggtgc gcgcggtgga ggatgccctg 1501 gacatccgca tcccctacaa cgcccacctg atccgcaacc tgatggacaa gacgctgcag 1561 gtgcacgacc acatcgtgca cttctaccac ctgcacgcgc tggactgggt caacccggtc 1621 aacgccctga aggccgatcc caaggctacc tccgccctgc agcaggcggt ttcgccggcc 1681 catgccaagt ccagccccgg ctacttccgc gacgtgcaga cgcgcctgaa gaagttcgtc 1741 gagagcggcc agctcggcct gttctccaac ggctactggg acaatccggc ctacaagctg 1801 ccgcccgagg cggacctgat ggccgtggcc cactacctgg aggcgctgga cctgcagaag 1861 gacatcgtca agatccatac catcttcggc ggcaagaacc cgcatccgaa ctacatggtc 1921 ggcggcgtgg cctgcgccat caacctggac gacgtcggcg ccgccggcgc gccggtcaac 1981 atgaccagcc tgaacttcgt cctcgaacgc atccacgagg cccgcgagtt caccaggaac 2041 gtctacctgc cggacgtgct ggcggtcgcc gggatctaca aggactggct gtacggcggc 2101 ggtctggccg cgcacaacct gctgtcctac ggcaccttca ccaaggtgcc ctacgacaag 2161 tccagcgacc tgttgccggc cggcgccatc gtcggcggca attgggacga ggtgctgccg 2221 gtcgacgtgc gcgatcccga ggagatccag gagttcgtca gccactcctg gtacagctac 2281 gccgacgaaa ccaaggggct gcatccctgg gacggcgtca ccgagccgaa attcgagctc 2341 ggcccgaaca ccaagggcag ccgcacccac atccaggaaa tcgacgaggc gcacaagtac 2401 agctggatca aggcgccgcg ctggcgcggc cacgctatgg aggtcggccc gctggcacgt 2461 tacatcatcg cctacgcttc gggccgcgaa tacgtgaagg aacaggtcga ccgctcgctg 2521 gccgccttca accagagcac cggcctgaac ctcggcctca agcagttcct gccctcgacc 2581 ctcggccgca ccctggcgcg cgccctggag tgcgagctgg cggtggacag catgctcgac 2641 gactggcagg ccctggtcgg caacatcaag gccggcgacc gcgccaccgc caacgtcgag 2701 aagtgggacc cgagcacctg gccgaaggag gccaagggcg tgggcatcaa cgaggcgccg 2761 cgcggcgccc tgggccactg gatcaggatc aaggacggca agatcgagaa ctaccaggcg 2821 atcgtgccga ccacctggaa cggcaccccg cgcgaccatc tgggcaacat cggcgcctac 2881 gaggccgcgc tgctcaacac caggatggag cgcccggacg agccggtgga gatcctgcgc 2941 accctgcaca gcttcgaccc ctgcctggcc tgttcgaccc acgtgatgtc gccggacggc 3001 caggagctga cccgggtgaa ggtccgctga accggaggat tgcgcgatgg cactggaaaa 3061 atccctggaa accggcgacg gccaggagaa ggtccgcaag cagaccgcgg tgtacgtcta 3121 cgaggcgccg ctgcgcctct ggcactgggt cacggcgctg tccatcgtcg tgctcggcgt 3181 gaccggctac ttcatcggcg cgccgctgcc gacgatgccc ggcgaggcga tggacaacta 3241 cctgatgggc tacatccgct tcgcccactt cgccgccggc tacgtgctgg cgatcggctt 3301 cctcggccgg gtctactggg ccttcgtcgg caaccaccac gcccgcgagc tgttcctcgt 3361 gccggtgcac cgcaaggcct ggtggaagga gctgtggcac gaggtgcgct ggtacctgtt 3421 cctggaaaag accccgaaga agtacatcgg ccacaacccc ctgggccagt tggcgatgtt 3481 ctgcttcttc gtggtcggcg cggtgttcat gagcgtcacc ggcttcgccc tctacgccga 3541 ggggctgggg cgggacagct gggccgaccg gctgttcggc tgggtgatcc cgctgttcgg 3601 ccagagccag gacgtgcaca cctggcacca cctgggcatg tggtacctcg tcgtcttcgt 3661 catggtgcat gtctacctgg ccgtgcgcga agacatcgtt tcccggcagt cgctgatctc 3721 caccatggtc ggcggctggc ggatgttcaa ggacgaccgg ccggattgag ccccgtgtcg 3781 tcccttccgt ccgggccggt // LOCUS RABIGHAS 402 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-1. ACCESSION M29412 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 402) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 402 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 402 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 355 356 D-region end/J-region start BASE COUNT 86 a 108 c 118 g 90 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagcctctg gattctccct cagtagttac tacatgcaat gggtccgcca ggctccaggg 181 aaggggctgg aatggatcgg aatcattggt agtagtggta gcacatacta cgcgagctgg 241 gtgaagggcc gattcaccat ctccaaaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacctatttc tgtgccagag catatattag taatactgat 361 ggttctggct ttaacttgtg gggccaaggc accctggtca cc // LOCUS RABIGHAT 399 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-3-1. ACCESSION M29413 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-3-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 399) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 399 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 399 Ig mu-chain recomb 342 343 V-region end/D-region start recomb 361 362 D-region end/J-region start BASE COUNT 88 a 114 c 111 g 86 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccggcgg tcgcctggta aagcctgacg aatccctgac actcacctgc 121 acagcctctg gattctccct cagtacctac aacatgatct gggtccgcca ggctccagga 181 aaggggctgg aatacatcgg ccacattagt tttggtggta gcacatacta cgcgagctgg 241 gcgaaaggtc gatgcaccat atccaaaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacctatttc tgtgccaggg gatggactcc taaaagtctt 361 tcagccttta acttgtgggg cccaggcacc ctggtcacc // LOCUS RABIGHAU 390 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-5. ACCESSION M29414 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-5. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 390) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 390 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 390 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 368 369 D-region end/J-region start BASE COUNT 86 a 100 c 119 g 85 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccgggga tcgcctggtc acgcctggga cacccctgac actcacatgc 121 acagtctctg gattctccct caatagttat gtagtgggct gggtccgcca ggctccagag 181 aagggactgg aatacatcgg aaccatttgg gtcgatggta agacatacta cgcgagctgg 241 acgaagggcc gattcaccat ctctaaaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacatatttc tgtgccagat atggtagtag tggtgattta 361 ggcgtgtggg gccaagggac cctggtcacc // LOCUS RABIGHAV 351 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 2-1. ACCESSION M29415 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 2-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 351) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept < 1 > 351 Ig mu-chain V-D-J precursor (AA at 1) sigp < 1 21 Ig mu-chain signal peptide matp 22 > 351 Ig mu-chain recomb 306 307 V-region end/D-region start recomb 339 340 D-region end/J-region start BASE COUNT 77 a 101 c 100 g 73 t ORIGIN 1 gtgctcaaag gtgtccagtg tcagtcgctg gaggagtccg ggggtcgcct ggtcacgcct 61 gggacacccc tgacactcac ctgcacagcc tctggattct ccctcagtag ctactggatg 121 acctgggtcc gccaggctcc agggaagggg ctggaatgga tcggaatcat tgttcatggt 181 gatagcgcat actacgcgag ctgggcgaaa ggccgattca ccatctccag aacctcgacc 241 acggtggatc tgaaaatcac cagtccgaca accgaggaca cggccaccta tttctgtgcc 301 agagaatatt atggtactat taacttgtgg ggcccaggca ccctggttac c // LOCUS RABIGHAW 408 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-2. ACCESSION M29416 KEYWORDS diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-2. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 408) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 408 Ig gamma-chain V-D-J precursor sigp 1 57 Ig gamma-chain signal peptide matp 58 > 408 Ig gamma-chain recomb 342 343 V-region end/D-region start recomb 369 370 D-region end/J-region start BASE COUNT 77 a 112 c 127 g 92 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcgctggagg agtccggggg tcgcctggtc acgcctggga catccctgac actcacctgc 121 acagtctctg gattctccct cagtactagt gcaatggcct gggtccgcca ggctccaggg 181 aaggggctgg aatatgtcgg agtcattagt ggaagtggtg gcacatacta cgcgagctgg 241 gcgagcggcc ggttcaccat ttccaaagcc tcgtcgacca cggtggatct gaaaatgacc 301 agtctgacaa ccgaggacac ggccacctat ttctgtgcca gagtcaggga tagtcatggt 361 tatattggtg atgcttttga tccctggggc ccaggcaccc tggtcacc // LOCUS RABIGHAX 390 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-3-1. ACCESSION M29417 KEYWORDS diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-3-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 390) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 390 Ig gamma-chain V-D-J precursor sigp 1 57 Ig gamma-chain signal peptide matp 58 > 390 Ig gamma-chain recomb 339 340 V-region end/D-region start recomb 356 357 D-region end/J-region start BASE COUNT 77 a 110 c 120 g 83 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcgctggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagtctctg gattctccct cagtagtcgc tggatgagct gggtccgcca ggctccaggg 181 gaggggctgg aatccatcgg agccattgat actggtggta gcgcatacta cgcgaactgg 241 gtgaaaggcc gactcaccat ctccaaaacc tcgtcgacca cggtggattt gaaaatgacc 301 agtctgacaa ccgaggacac ggccacctat ttctgtgcca gagattatag tggtggactt 361 gacttgtggg gcacaggcac cctggtcacc // LOCUS RABIGHAY 399 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-4. ACCESSION M29418 KEYWORDS diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-4. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 399) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 399 Ig gamma-chain V-D-J precursor sigp 1 57 Ig gamma-chain signal peptide matp 58 > 399 Ig gamma-chain recomb 339 340 V-region end/D-region start recomb 364 365 D-region end/J-region start BASE COUNT 83 a 116 c 116 g 84 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcgctggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagcctctg gattcaccat cagtagctac cacatgatct gggtccgcca ggctccaggg 181 gaggggctgg aatacatcgg atggattagt actggtggta gcgcatacta cgcgaactgg 241 gcaaaaggcc gattcaccat ctccagaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacctatttc tgttgcagaa ctcctgctgt tagtaaatgg 361 gacttgtggg gcccgggcac cctagtcacc gtctcctca // LOCUS RABIGHAZ 384 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 4-1. ACCESSION M29419 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 4-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 384) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 384 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 384 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 365 366 D-region end/J-region start BASE COUNT 82 a 114 c 113 g 75 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagtctctg gaatcgacct cagtggctac cacatgagct gggtccgcca ggctccaggg 181 gaggggctgg aatggatcgg aaccatgagt actactgata acacatatta cgcgagctgg 241 gcaaaaggcc gattcaccat ctccaaaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacagccg cggacacggc cacctatttc tgtgccagag gacaggcaac ttttattccc 361 tggggcccag gcaccctggt cacc // LOCUS RABIGHBA 393 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 5-2. ACCESSION M29420 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 5-2. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 393) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 393 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 393 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 364 365 D-region end/J-region start BASE COUNT 80 a 103 c 118 g 92 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagtctctg gaatcgacct cagtagcttt gcaatggcct gggttcgcca ggctccaggg 181 aaggggctgg agtggatcgg aatcattaat ggttatggta ctacatacta cgcgagctgg 241 gtgaatggcc gattcaccat ctccaaaacc tcgacctcgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacctatttc tgtgtcagat atcttagtga tggttggtat 361 ctagacttgt ggggccaagg caccctggtc acc // LOCUS RABIGHBB 375 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 7-2. ACCESSION M29421 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 7-2. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 375) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept < 1 > 375 Ig mu-chain V-D-J precursor (AA at 1) sigp 1 57 Ig mu-chain signal peptide matp 58 > 375 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 361 362 D-region end/J-region start BASE COUNT 82 a 98 c 112 g 83 t ORIGIN 1 gtgctcaaag gtgtccagtg tcagtcggtg gaggagtccg ggggtcgcct ggtcacgcct 61 gggacacccc tgacactcac ctgcacagtc tctggattct ccctcaataa ttatgcaatg 121 ggctgggtcc gccaggctcc agggaagggg ctagaatgga tcggaaccat tggtactggt 181 ggtagcgtat actacgcgaa ctgggcaaaa ggccgattca ccatctccag aacctcgacc 241 acggtggatc tgaaaatgac cagtctgaca accgaagaag gacacgccac ctatttctgt 301 gccagagtgg ctggtggtac tgtttttggc tatgtggggt actttaactt gtggggccaa 361 ggcaccctgg tcacc // LOCUS PHVARCA 902 bp ss-mRNA PLN 26-MAY-1990 DEFINITION P.vulgaris arcelin 2 mRNA, complete cds. ACCESSION M28470 KEYWORDS arcelin. SOURCE P.vulgaris, cDNA to mRNA, clone pARC2-11 and pARC2-191. ORGANISM Phaseolus vulgaris Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 902) AUTHORS John,M.E. and Long,C.M. TITLE Sequence analysis of arcelin 2: A lectin-like plant protein JOURNAL Gene 86, 171-176 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by M.E.John, 29-SEP-1989. FEATURES from to/span description pept 1 798 arcelin 2 BASE COUNT 249 a 282 c 164 g 207 t ORIGIN 1 atggcttcct ccaacttact caccctagcc ctcttccttg tgcttctcac ccacgcaaac 61 tcaagcaacg acgcctcctt caacgtcgag acgttcaaca aaaccaacct catcctccaa 121 ggcgatgcca ccgtctcatc cgaaggccac ttactactaa ccaatgttaa aggcaacgaa 181 gaggactcta tgggccgcgc cttctactcc gcccccatcc aaatcaatga cagaaccatc 241 gacaacctcg ccagcttctc caccaacttc acattccgta tcaacgctaa gaacaatgaa 301 aattccgcct atggccttgc ctttgctctc gtccccgtcg gctctcggcc caaacttaaa 361 ggccgttatc taggtctttt caacacagcc aactacgacc gcgacgccca tactgtggct 421 gtggtgttcg acaccgtcag caaccgtatt gaaatcgacg tgaactccat ccggcctatc 481 gcaacggagt cttgcaattt cggccacaac aacggagaaa aggccgaggt tcggatcacc 541 tattactccc ccaagaacga cttgagggtt tctctgcttt acccttcttc ggaagaaaag 601 tgccacgtct ctgccacagt gccgctggag aaagaagttg aggactgggt gagcgttggg 661 ttctctgcca cctcagggtc gaaaaaagag accactgaaa cgcacaacgt cctctcttgg 721 tctttttctt ccaacttcat caattttgag ggcaaaaaat ctgaacgttc caacatcctc 781 ctcaacaaga tcctctagac tcccaaagcc agcttcactg tgacagtaaa accttcctta 841 tacgctaata atgttcatct gtcacacaaa ctacaataaa taaaatggga gcaataaata 901 aa // LOCUS DROGOALA 2204 bp ss-mRNA INV 26-MAY-1990 DEFINITION Drosophila melanogaster G-o-alpha-like protein, clone lambda-DGo59. ACCESSION M29731 J05089 KEYWORDS G protein; guanine nucleotide-binding protein. SOURCE D.melanogaster adult head cDNA to mRNA, clone lambda-DGo59. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2204) AUTHORS Thambi,N.C., Quan,F., Wolfgang,W.J., Spiegel,A. and Forte,M. TITLE Immunological and molecular characterization of G-o-alpha-like proteins in the Drosophila central nervous system JOURNAL J. Biol. Chem. 264, 18552-18560 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Thambi 04-OCT-1989. FEATURES from to/span description pept 166 1230 G-o-alpha-like protein BASE COUNT 757 a 493 c 459 g 495 t ORIGIN 1 gaattccgtg ctcggcaagt gcaacgttga aatcgttaaa ctgtacataa gcaaataaga 61 cataaagaaa aaagtccagg aaaattggaa aacaaaagcc cgaaaaccga aaagccccgt 121 gtaaatccga atccgaatcc aaatcagtat ccaaacccaa ccacaatggg ctgcaccaca 181 tccgccgaag aacgcgccgc catccagcga tccaaacaga tcgagaagaa tctaaaggag 241 gatggaatcc aggcggccaa ggacatcaag ctcctgctgc tgggtgccgg tgagtcgggc 301 aagagcacaa tagtcaaaca gatgaaaatc attcacgaga gcggcttcac tgcggaggac 361 tttaaacaat atcgaccggt tgtctacagc aacacaatac aatcattagt tgcaatattg 421 cgcgcgatgc caaccctaag tattcagtac agcaataacg agcgggagag cgatgccaag 481 atggtgttcg acgtatgcca acgcatgcac gacaccgagc ccttctcgga ggagctgctg 541 gccgccatga aacgcctctg gcaggacgcc ggtgtccagg agtgcttctc gcgcagcaac 601 gaataccaac taaacgattc cgcaaaatat ttcctggacg atttggatcg gttaggcgcc 661 aaggattacc agccaactga acaagatatc ttgcgcactc gcgtcaagac cactggcatc 721 gttgaggtac acttctcctt caaaaacctc aactttaaat tgtttgacgt gggcggtcag 781 cgctcggaac gtaagaaatg gatacactgc ttcgaagatg tcacggcgat cattttctgc 841 gtggccatgt ccgagtacga tcaagtcttg catgaggatg aaaccacgaa ccgcatgcaa 901 gagtcgctga aactgtttga ctcgatctgt aacaacaaat ggttcacgga cacctcgatt 961 attctatttc tgaacaagaa ggatttgttc gaggagaaga ttcgcaagag tcccctgacg 1021 atttgcttcc ccgaatacac aggtggacag gagtacggcg aggcggctgc ttacattcag 1081 gctcaatttg aagcgaaaaa caaatcaacc tcaaaagaaa tctactgcca catgacgtgt 1141 gccacagata ccaataacat tcagtttgta ttcgatgctg tcaccgatgt catcatagca 1201 aacaacctgc gcggctgtgg actgtactaa gatggattcc aggccggatc ccgacgatgt 1261 cgacgtccga gtcgatattg atgacgatga cgattatgtg gagcagaatg ggggcgttac 1321 gagggaacac cgtaacggta ttaaagagca gcgcggagca caacaaccca ccagcattga 1381 tcaaaaaacc aaacaattta ggagcagatg atagaaccaa ccaacaaacc aaccgcaaac 1441 cacacagaaa acataggaca ctgaacaagc aaagcccaaa gaacttttat ttgtttaaca 1501 aaaaaacggc ggacggacgg aaatcccgaa tggatgttat agggaaaatg agcgacaagt 1561 acattacata atatcgataa tattgaagca gatgcagatg caaatacaca caatgctaat 1621 gatgatcagg gcgactatga ctaaatgagg cagcaggcaa ctgacactgg gacacgcgat 1681 taaagtcaca tctgaaaaaa ggcagttgat tgaaaggcat ttctatatac aaacatatac 1741 aaacacatac atatgcatta tgcaaagcca catgtacgac atgacactaa cacactcaca 1801 cgacaaacac aagcgccaac attgcataca gttgttgttt ggtctgaata atttttatag 1861 aatttcataa tttatgtgta gtttagtttc ctcatgtatt tattaaaaca aaaaccaaac 1921 gagcgtatat ctacatatac cgcatatata tatatacata cacttctata catatatata 1981 tatatatata catatatata aatattatat attaaatgtt tcctgttgca atctctcttt 2041 aaaattattc atgccatcaa cgctctgcat ttgtcatgct tgtttagact taagttcgaa 2101 agtttcaaca aaatccagcg tcaaaggaaa tatcaatatt catttgattg agtgtcagcg 2161 tgtggtctaa agtaaatata taaaataaca aaccaaaaaa aaaa // LOCUS DROGOALB 2558 bp ss-mRNA INV 26-MAY-1990 DEFINITION Drosophila melanogaster G-o-alpha-like protein, clone lambda-DGo21. ACCESSION M29732 J05089 KEYWORDS G protein; guanine nucleotide-binding protein. SOURCE D.melanogaster adult head cDNA to mRNA, clone lambda-DGo21. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2558) AUTHORS Thambi,N.C., Quan,F., Wolfgang,W.J., Spiegel,A. and Forte,M. TITLE Immunological and molecular characterization of G-o-alpha-like proteins in the Drosophila central nervous system JOURNAL J. Biol. Chem. 264, 18552-18560 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Thambi 04-OCT-1989. FEATURES from to/span description pept 520 1584 G-o-alpha-like protein BASE COUNT 867 a 557 c 530 g 604 t ORIGIN 1 gaattccggt tgcctatttc tctcgcttac ctatttattt agcatacatt ttccaagcat 61 cctgtgaaaa aaccatcaca agttttcctt cgaacggaat gccaagtgca ttctggaagg 121 aaatcgttgt acatctacat aatgccaata aagaaaatgt aactaaagta aaaaaaaaaa 181 aaaagagcta aaccgttaaa ttaaagtttt aaagttaaaa aaacgctgaa taagtgttaa 241 atatatataa caaaaatatt gttgaattga agaaaaccaa agttcaaaaa cctgaaaaaa 301 ccataaagaa gtgattgaaa aatcagttga agtgccgtac tgaaaattaa agtccagtga 361 cacgatcgaa tccctcggat agcggagtta gtttagcccc ccgaattcga gtccccgcac 421 gttgtacacc tggtttttct cgctggcaac gtagtcggcc attgagttgg ccgataccaa 481 acgaccttca aaacgttttg cgtcgaggca atacgcacca tgggctgcgc acagtctgcc 541 gaggagcgag ccgcagccgc caggagtcgc ctcatcgagc gcaatctaaa ggaggatgga 601 atccaggcgg ccaaggacat caagctcctg ctgctgggtg ccggtgagtc gggcaagagc 661 acaatagtca aacagatgaa aatcattcac gagagcggct tcactgcgga ggactttaaa 721 caatatcgac cggttgtcta cagcaacaca atacaatcat tagttgcaat attgcgcgcg 781 atgccaaccc taagtattca gtacagcaat aacgagcggg agagcgatgc caagatggtg 841 ttcgacgtat gccaacgcat gcacgacacc gagcccttct cggaggagct gctggccgcc 901 atgaaacgcc tctggcagga cgccggtgtc caggagtgct tctcgcgcag caacgaatac 961 caactaaacg attccgcaaa atatttcctg gacgatttgg atcggttagg cgccaaggat 1021 taccagccaa ctgaacaaga tatcttgcgc actcgcgtca agaccactgg catcgttgag 1081 gtacacttct ccttcaaaaa cctcaacttt aaattgtttg acgtgggcgg tcagcgctcg 1141 gaacgtaaga aatggataca ctgcttcgaa gatgtcacgg cgatcatttt ctgcgtggcc 1201 atgtccgagt acgatcaagt cttgcatgag gatgaaacca cgaaccgcat gcaagagtcg 1261 ctgaaactgt ttgactcgat ctgtaacaac aaatggttca cggacacctc gattattcta 1321 tttctgaaca agaaggattt gttcgaggag aagattcgca agagtcccct gacgatttgc 1381 ttccccgaat acacaggtgg acaggagtac ggcgaggcgg ctgcttacat tcaggctcaa 1441 tttgaagcga aaaacaaatc aacctcaaaa gaaatctact gccacatgac gtgtgccaca 1501 gataccaata acattcagtt tgtattcgat gctgtcaccg atgtcatcat agcaaacaac 1561 ctgcgcggct gtggactgta ctaagatgga ttccaggccg gatcccgacg atgtcgacgt 1621 ccgagtcgat attgatgacg atgacgatta tgtggagcag aatgggggcg ttacgaggga 1681 acaccgtaac ggtattaaag agcagcgcgg agcacaacaa cccaccagca ttgatcaaaa 1741 aaccaaacaa tttaggagca gatgatagaa ccaaccaaca aaccaaccgc aaaccacaca 1801 gaaaacatag gacactgaac aagcaaagcc caaagaactt ttatttgttt aacaaaaaaa 1861 cggcggacgg acggaaatcc cgaatggatg ttatagggaa aatgagcgac aagtacatta 1921 cataatatcg ataatattga agcagatgca gatgcaaata cacacaatgc taatgatgat 1981 cagggcgact atgactaaat gaggcagcag gcaactgaca ctgggacacg cgattaaagt 2041 cacatctgaa aaaaggcagt tgattgaaag gcatttctat atacaaacat atacaaacac 2101 atacatatgc attatgcaaa gccacatgta cgacatgaca ctaacacact cacacgacaa 2161 acacaagcgc caacattgca tacagttgtt gtttggtctg aataattttt atagaatttc 2221 ataatttatg tgtagtttag tttcctcatg tatttattaa aacaaaaacc aaacgagcgt 2281 atatctacat ataccgcata tatatatata catacacttc tatacatata tatatatata 2341 tatacatata tataaatatt atatattaaa tgtttcctgt tgcaatctct ctttaaaatt 2401 attcatgcca tcaacgctct gcatttgtca tgcttgttta gacttaagtt cgaaagtttc 2461 aacaaaatcc agcgtcaaag gaaatatcaa tattcatttg attgagtgtc agcgtgtggt 2521 ctaaagtaaa tatataaaat aacaaaccaa aaaaaaaa // LOCUS MTYCLCGA 6319 bp ss-RNA VRL 26-MAY-1990 DEFINITION Turnip yellow mosaic virus Club Lake isolate, complete genome. ACCESSION J04373 KEYWORDS complete genome; nucleotide binding protein; replicase; virion protein. SOURCE Turnip yellow mosaic virus Club Lake isolate cDNA to viral RNA. ORGANISM Turnip yellow mosaic virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Tymovirus. REFERENCE 1 (bases 1 to 6319) AUTHORS Keese,P., Mackenzie,A. and Gibbs,A. TITLE Nucleotide sequence of the genome of an Australian isolate of turnip yellow mosaic tymovirus JOURNAL Virology 172, 536-546 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Gibbs, 04-AUG-1989. FEATURES from to/span description pept 96 5630 replicase polyprotein pept 89 1975 Unknown protein pept 5645 6214 virion protein BASE COUNT 1461 a 2426 c 1061 g 1371 t ORIGIN 1 gtaatcaact accaattcca gctctctttt gacaactggt cttataccaa ctttccgtac 61 acttgcaacc ctcgtaagac aattgcaaat gagtaatggc cttccaatta gcattggacg 121 cccttgcacc cacgactcac agagatccct ctctgcatcc gattctcgaa tccacagtag 181 attcgattcg ctcctcgata cagacctacc catggtccat tccgaaggaa cttctgcccc 241 tactcaactc ctacggcatc ccaacatctg gtttgggaac atcccaccac ccccacgccg 301 cccacaagac aatcgagact tttctccttt gcacccactg gtctttccag gccaccactc 361 ccagctccgt catgttcatg aaacccagca agttcaacaa acttgcccag gtgaactcaa 421 actttcggga attgaagaac taccgcctgc accccaacga cagcactcgt taccccttca 481 catcaccaga ccttcccgtt ttccccacca ttttcatgca cgacgccctg atgtactacc 541 atccctccca gatcatggac ctgttcttgc agaaaccaaa cctcgaacgt ctgtacgcca 601 gcctcgtagt accacccgag gcccatcttt ccgaccaatc cttcttcccg aagttgtaca 661 cgtacacgac gacccgccac actcttcact acgtcccgga aggtcacgaa gccggcagct 721 acaaccaacc atccgacgcc cactcttggc tccgaatcaa ttcaattcgc ctcggcaacc 781 accacctctc agtgacgatc ctggaatcct ggggccctgt ccactcgctc ctaattcaac 841 gagggacccc cccccccgac ccatcactcc aggccccttc aacacccatg gcgtccgacc 901 tctttcggtc ttaccaagag ccccgcctcg acgtggtctc cttccgaatc ccagacgcca 961 tcgaacttcc acaggccaca ttccttcaac aaccgcttcg agaccgactg gtcccccgag 1021 ccgtctacaa cgccctgttc acctacacca gagcggtccg cacactccgg acttcagacc 1081 cagcggcatt cgtaaggatg cattcctcca aaccggacca cgattgggtc acctcgaacg 1141 cctgggacaa tctgcagacc ttcgcacttc tgaacgtacc ccttcgacca aacgtcgtct 1201 accacgtcct tcagagccca attgcctccc tagctcttta cctgaggcaa cattggcgcc 1261 gtcttaccgc caccgccgtt cccatcctct ccttcctaac cctcctgcag cgcttcctcc 1321 cattgcctat acctctggca gaggtaaaat ccatcacagc cttccgaagg gagctctacc 1381 gaaagaaggc cccccaccac cccctcgacg tcttccatct ccagcaacac ctccgcaatc 1441 accactccgc gatctcggcc gtacgcccag cttccccacc ccaccaaaga cttccacacg 1501 cgctccagaa agctgcattg ctgctcctcc gaccgatatc gcccctcttg acagcgaccc 1561 cgttctttcg gtccgaacag aagtccatgc tcccgaacgc cgaactttca tggaccctga 1621 agcgcttcgc gctgccttgg caggcctccc tagtcctcct ctctctgtcg gaatcatccg 1681 tactgcttca caaactgttc tccccaccaa ctctccaagc ccaacacgac acctaccacc 1741 gacatcttca ccctggatcc tacagtctcc agtgggagag gacgccattg tcgattccga 1801 ggacgacagc atttcttcct ttcactccca cgacttcaac agcccctccg gaccactccg 1861 aagccagtct ccctcccgct ttcgcctcca cctccgttcc ccgtccacct ccagtggcat 1921 cgagccttgg agcccagcct cctacgacta cggcagcgcc cccgacaccg attgaaccca 1981 cccagcgcgc tcatcaaaat tctgacctca cgcttgaaag ttcaacccca attgaacccc 2041 ccccaccccc catccaatcc tccgacatcc cgccttccgc ccccgttctt ttcccagaaa 2101 tcaactcacc gcatcgtttt tcccccaaac ttcccaccac acccgatttc gaacccaccc 2161 gcacttcacc ccctccttcc acttcgcatc aagattcgac tgaccccgcg gaccccctga 2221 tgggctccca ccttctgcac cattcactac ctgcacctcc cacccacccg cttcaatctt 2281 cacagctctt gcccgcacct ttgacaaacg accccaccgc gatcggcccg gtactcccct 2341 ttgaagaact ccacccacgc aggtaccccg aaaacaccgc cactttcctc acgaggctcc 2401 gttcacttcc ttcaaaccat ctaccacaac ccaccctgaa ttgtctcctc tctgctgtct 2461 ccgaccaaac caaggtttcc gaggatcacc tctgggagtc cctacagaca attctcccag 2521 acagccaact caggaacgaa gagatcaact ctctcgggct ttcaactgaa cacctcactg 2581 cgttggccca tctttacaac ttccaggcaa ccatctactc cgatcgtggt cccatcctct 2641 tcggcccatc cgacaccatt aagagaatcg acatcaccca caccaccgga ccgccatccc 2701 acttttcacc cggcaaaaga cttttaggca gccaaccctc agctaagggc catccctccg 2761 actcactcat cagagccatg aagtctttca aagtatccgg caactacctt cccttctctg 2821 aggcccacaa ccatcccacc tccatctcac atgccaagaa cttggtttca aacatgaaga 2881 atggattcga cggcatcctc tcccttctcg acgtctccac aggccaacga accggaccca 2941 cccccaaaga cgcgatcatt cagatagacc actacctcga caccaacccc ggcaaaacca 3001 cccctgtggt gcattttgct ggtttcgctg gctgtggaaa gacatatccg atccaacagc 3061 tccttaaaac taaactgttc aaagactttc gggtctcctg ccccaccaca gaactcagaa 3121 ccgaatggaa gactgcgatg gaacttcatg gctcccagtc atggcgcttt aacacttggg 3181 agtcttccat tctcaagtca tccagaattc tggtcatcga tgaaatctac aaaatgccaa 3241 gagggtacct cgacctttcc attctcgctg accccgccct cgaactcgtc ataattctcg 3301 gtgatcctct ccagggcgag taccactctc aatccaaaga ctcatccaat caccgccttc 3361 cctccgaaac tctcaggctg ctaccataca ttgacatgta ctgctggtgg agttatcgca 3421 ttccccaatg tatcgcccga ctcttccaaa ttcacagctt caatgcctgg cagggaatca 3481 tcggctccgt ttcaactccc caggatcaat cccccgttct caccaacagt catgcctcat 3541 ctctcacctt caacagcctg ggatatcgct cctgcacgat cagctctagc caaggcctca 3601 cattctgcga ccctgccatc atcgtcctgg acaactacac caagtggctc tcctcggcca 3661 acggcctcgt cgccctcacc cgatccagat caggtgtcca attcatgggc ccctcttcct 3721 atgtcggggg aaccaacggc tcttctgcca tgttttctga cgccttcaac aacagcctca 3781 tcatcatgga tcgctacttc ccatccctgt tcccacaact caagctcatc acctcccccc 3841 tcacaactcg cagccccaaa ctcaacgggg ccacccccag cgcatctccc acccatcgct 3901 cgccaaactt ccacctcccc ccacacattc ccctctctta tgatcgtgat ttcgtcacgg 3961 tcaacccaac tctccctgat cagggacccg aaacaagact cgacacccac ttcctcccac 4021 cttctcggct cccgcttcat ttcgatctcc caccagctat cacccccccc ccgatttcca 4081 caagcgtcga cccgccacaa gctaaagcta gccccgtcta tccaggcgag ttcttcgatt 4141 ctctggcggc gttcttctta ccagcacacg acccatcaac aagggaagta ctccacaaag 4201 atcaatctag caaccagttc ccttggttcg accgaccctt cagcttgtcc tgccagccct 4261 caagtttaat ttctgccaag catgcaccca accacgatcc gacccttctg cctgcctcca 4321 tcaataaacg cttgcgattc agacccagtg aagcaccgca ccaaatcacc gcagacgacg 4381 tggtcctagg cctgcaactc ttccactctc tctgccgcgc ctactcacgt caacccaaca 4441 tcaccgttcc attcaaccct gaacttttcg cagaatgtat ctctctgaat gaatacgcgc 4501 agctcagttc caaaacccaa tccaccatag tggccaacgc ttcacgctcc gacccagact 4561 ggcgacacac caccgtcaag atttttgcga aagctcaaca caaagtcaac gacggctcca 4621 tcttcggttc atggaaggcc tgccaaactc tcgcactcat gcatgattac gtaattctgg 4681 ttcttggacc cgtcaagaaa tatcaaagaa tcttcgacaa cgttgatcgg ccatctcaca 4741 tctactcaca ctgcggcaag acacccaacc aacttcgaga ttggtgccag gaacatctca 4801 ctcattccac cccaaaaatc gcaaacgact acaccgcctt cgaccaatcc cagcatggag 4861 aatccgtggt tcttgaagcc ctcaaaatga agagactgaa cattccgagc catttgattc 4921 agctccatgt ccacctcaag accaacgtct ccacccagtt cggccccctc acatgcatgc 4981 gcctgaccgg ggaacccgga acctacgacg acaacactga ctacaacctc gcagtcatct 5041 actctcagta tgacgttggt tcctgcccca tcatggtctc tggcgacgac tcactcatag 5101 accaccctct tcccactcgc cacgactggc cctctgttct caaacgcctc cacctccgct 5161 ttaaacttga actcacttct catcccctct tttgtggcta ctacgtcggt ccagcaggct 5221 gcatccgcaa ccccttggcc cttttctgca agctcatgat cgcagtggac gatgacgccc 5281 tcgacgaccg acgactcagc tacctcaccg agttcaccac cggacacctc cttggcgaat 5341 cactatggca cctcctccct gaaacccacg tccagtatca gtcagcttgc tttgacttct 5401 tctgcagacg ttgcccaaaa cacgagaaga tgctcctcga tgattccaca cccacactca 5461 gcctcctcga acgaatcact tcttcaccga ggtggctcac caagaacgcc atgtacctcc 5521 tccccgccaa gctcagactg gctatcacct ctctgtctca aacgcaatct ttcccagaat 5581 ccattgaggt ttcccacgct gagtctgaat tgcttcacta tgtccaatag caatcagccc 5641 cgacatggaa atcgacaaag aactcgcccc ccaagaccgc accgtcaccg tcgccaccgt 5701 tttaccgact gtccccggcc cctcaccttt caccatcaaa caaccgttcc agtctgaagt 5761 tctgtttgct gggaccaaag atgccgaggc ctctctcacc atcgccaaca tcgacagcgt 5821 ttccaccctc accaccttct atcgtcatgc ctctctggaa tcactctggg tcaccatcca 5881 tcctaccttg caagccccag ctttcccgac cacggttggc gtttgctggg tacccgccaa 5941 ctccccagtc actcccaccc aaatcaccaa gacctacggc ggccagatct tctgcattgg 6001 aggcgccatc aacactctct cacccctcat tgtcaagtgc ccacttgaaa tgatgaaccc 6061 ccgggtcaaa gattcaattc aataccttga ctcgcccaaa ctcctcatct ccatcaccgc 6121 tcaacccacc gctccccccg catcgacctg cataataact gtatcaggaa ctctctcgat 6181 gcattctccg ctcatcacgg acacttccac ctaagttctc gatctttaaa atcgttagct 6241 cgccagttag cgaggtctgt ccccacacga cagataatcg ggtgcaactc ccgccccttt 6301 tccgagggtc atcggaacc // LOCUS RATTH2BAA 181 bp ds-DNA ROD 26-MAY-1990 DEFINITION Rat TH2B gene promoter region. ACCESSION M33578 KEYWORDS H2B histone; histone; transcription regulatory element. SOURCE Rat DNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 181) AUTHORS Hwang,I., Lim,K. and Chae,C.-B. TITLE Characterization of the S-phase-specific transcription regulatory elements in a DNA replication-independent testis-specific H2B (TH2B) histone gene JOURNAL Mol. Cell. Biol. 10, 585-592 (1990) STANDARD simpl staff_entry FEATURES from to/span description mRNA 161 > 181 H2B histone mRNA signal 61 68 octamer signal signal 110 115 hexamer signal BASE COUNT 45 a 47 c 32 g 57 t ORIGIN 1 acctgattgg ctgattggtg atgaattaac caatcagaaa gcaccacttg aattcccctt 61 atttgcatac aaggaacatt tattgtccaa tcatctttcg cgtgctcata cgtcatccaa 121 ggcccacgcc tataaatacc tctcttcttg gccttcaagc ggtgtgtttt ctcagcagtt 181 g // LOCUS TCVDIGAA 347 bp ss-RNA VRL 26-MAY-1990 DEFINITION Turnip crinkle virus defective interfering RNA. ACCESSION M29290 KEYWORDS defective interfering RNA. SOURCE Turnip crinkle virus cDNA to RNA. ORGANISM Turnip crinkle virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Tombusvirus. REFERENCE 1 (bases 1 to 347) AUTHORS Li,X.H., Heaton,L.A., Morris,T.J. and Simon,A.E. TITLE turnip crinkle virus defective interfering rnas intensify viral symptoms and are generated de novo JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9173-9177 (1989) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by A.E.Simon, 20-OCT-1989. FEATURES from to/span description RNA 1 347 defective interfering RNA BASE COUNT 90 a 98 c 86 g 73 t ORIGIN 1 gggataaaaa aggaggctta ccaaccttct ctctattcac gatgcctctt ctacacacac 61 tcaaaacagc gctcgcagtg ggactccttg gagccaggta ctaccccgaa ggttcaaaac 121 caagaccccc aagtcgcttt actttgagat gtgttagaaa gccccaaggt cattttactt 181 tgacctgtgt tagagaccca aaacggtggc agcactgtct agctgcgggc attagactgg 241 aaaactagtg ctctctgggt aaccactaaa atcccgaaag ggtgggctag tggcgaccct 301 ccgaactaaa agacagcctc cctcctcgcg gggggggggg cctgccc //