Path: utzoo!attcan!uunet!jarthur!usc!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 26 Jul 90 12:00:34 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 6340 Approved: lear@genbank.bio.net Checksum: 12487 375 LOCUS ATUNPSS 6425 bp ds-DNA SYN 26-JUL-1990 DEFINITION A.tumefaciens neomycin phosphotransferase and streptomycin/spectinomycin adenyltransferase, complete cds. ACCESSION M35007 KEYWORDS neomycin phosphotransferase; streptomycin/spectinomycin adenyltransferase. SOURCE N.tabacum T-DNA inserts in A.tumefaciens DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6425) AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and Depicker,A. TITLE Cloning and sequence analysis of truncated T-DNA inserts from Nicotiana tabacum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.R.Gheysen, 01-JUN-1990. FEATURES from to/span description pept 2782 1985 (c) neomycin phosphotransferase pept 4983 4042 (c) streptomycin/spectinomycin adenyltransferase recomb 24 25 T-DNA end/plant DNA start recomb 40 41 plant DNA end/T-DNA start recomb 1094 1095 plant DNA end/T-DNA start recomb 2786 2787 T-DNA end/plant DNA start recomb 3044 3045 T-DNA end/plant DNA start recomb 3354 3355 plant DNA end/T-DNA start recomb 5585 5586 T-DNA end/plant DNA start recomb 6389 6390 T-DNA end/plant DNA start site 1 24 left T-DNA border site 372 395 24 bp border-like sequence site 1569 1592 24 bp border-like sequence site 1669 1692 24 bp border-like sequence site 1779 1756 (c) 24 bp border-like sequence site 2128 2105 (c) 24 bp border-like sequence site 2449 2472 24 bp border-like sequence site 2485 2462 (c) 24 bp border-like sequence site 3660 3683 24 bp border-like sequence site 3875 3898 24 bp border-like sequence site 4359 4336 (c) 24 bp border-like sequence site 5868 5891 24 bp border-like sequence BASE COUNT 1509 a 1754 c 1710 g 1452 t ORIGIN 1 cggcaggata tattcaattg taaatggctt catgtccggg aaatctacat ggatcagcaa 61 tgagtatgat ggtcaatatg gagaaaaaga aagagtaatt accaattttt tttcaattca 121 aaaatgtaga tgtccgcagc gttattataa aatgaaagta cattttgata aaacgacaaa 181 ttacgatccg tcgtatttat aggcgaaagc aataaacaaa ttattctaat tcggaaatct 241 ttatttcgac gtgtctacat tcacgtccaa atgggggctt agatgagaaa cttcacgatc 301 gatgccttga tttcgccatt cccagatacc catttcatct tcagattggt ctgagattat 361 gcgaaaatat acactcatat acataaatac tgacagtttg agctaccaat tcagtgtagc 421 ccattacctc acataattca ctcaaatgct aggcagtctg tcaactcggc gtcaatttgt 481 cggccactat acgatagttg cgcaaatttt caaagtcctg gcctaacatc acacctctgt 541 cggcggcggg tcccatttgt gataaatcca ccatcacaat agatagtcta atggacgaaa 601 aaggcgaata tttcgatgct gagattcgac gcaattaatt cgagaaaaat cccgtgattg 661 atgctgttga gttaccaata atatgggcag cgaaggccat ttaattataa gatcctgcaa 721 gcctcgtcgt cctggccgga ccacgctatc tgtgcaaggt ccccggcccc ggacgcgcgc 781 tccatgagca gagcgcccgc cgccgaggcg aagagtcggg cggcgccctg cccgtcccac 841 caggtcaaca ggcggtaacc ggcctcttca tcgggaatgc gcgcgacctt cagcatcgcc 901 ggcatgtccc cctggcggac gggaagtatc cagctcgacc aaagcggcca tcgtgcctcc 961 ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc 1021 gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa 1081 ccaactcgcg aggggatcga gcccctgctg agcctcgaca tgttgtcgca aaattcgccc 1141 tggacccgcc caacgatttg tcgtcactgt caaggtttga cctgcacttc atttggggcc 1201 cacatacacc aaaaaaatgc tgcataattc tcggggcagc aagtcggtta cccggccgcc 1261 gtgctggacc gggttgaatg gtgcccgtaa ctttcggtag agcggacggc caatactcaa 1321 cttcaaggaa tctcacccat gcgcgccggc ggggaaccgg agttcccttc agtgaacgtt 1381 attagttcgc cgctcggtgt gtcgtagata ctagcccctg gggccttttg aaatttgaat 1441 aagatttatg taatcagtct tttaggtttg accggttctg ccgctttttt taaaattgga 1501 tttgtaataa taaaacgcaa ttgtttgtta ttgtggcgct ctatcataga tgtcgctata 1561 aacctattca gcacaatata ttgttttcat tttaatattg tacatataag tagtagggta 1621 caatcagtaa attgaacgga gaatattatt cataaaaata cgatagtaac gggtgatata 1681 ttcattagaa tgaaccgaaa ccggcggtaa ggatctgagc tacacatgct caggtttttt 1741 acaacgtgca caacagaatt gaaagcaaat atcatgcgat cataggcgtc tcgcatatct 1801 cattaaagca gggggtgggc gaagaactcc agcatgagat ccccgcgctg gaggatcatc 1861 cagccggcgt cccggaaaac gattccgaag cccaaccttt catagaaggc ggcggtggaa 1921 tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc 1981 ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg 2041 cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat 2101 cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg ccacagtcga 2161 tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca tcgccatggg 2221 tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac agttcggctg 2281 gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg gcttccatcc 2341 gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat 2401 caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg gcaggagcaa 2461 ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag tcccttcccg 2521 cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc agccacgata 2581 gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc ttgacaaaaa 2641 gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct 2701 gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa cctgcgtgca 2761 atccatcttg ttcaatccac atgatcagat ctctaggcgc gtgggtgcgg acgtagtcag 2821 cgccattgcc gatcgcgtga agttccgccg caaggccgct ggacccagat cctttacagg 2881 aaggccaacg gtggcgccca agaaggattt ccgcgacacc gagaccaata gcggaagccc 2941 caacgccgac ttcagctttt gaaggttcga cagcacgtgc agcgatgttt ccggtgcggg 3001 gctcaagaaa aatcccatcc ccggatcgag gatgagccgg tcggcagcga ccccgctccg 3061 tcgcaaggcg gaaacccgcg cctcgaagaa ccgcacaatc tcgtcgagcg cgtcttcggg 3121 tcgaaggtga ccggtgcggg tggcgatgcc atcccctgcg ctgagtgcat aaccaccagc 3181 ctgcagtccg cctcagcaat atcgggatag agcgcagggt caggaaatcc ttggatatcg 3241 ttcaggtagc ccacgccgcg cttgagcgct agcgcgggtt tccggttgga agctgtcgat 3301 tgaaacacgg tgcatctgat cggacagggc gtctaagagc ggcgcaatac gtctgatctc 3361 atcggccggc gatacaggcc tcgcgtccgg atggctggcg gccggtccga catccacgac 3421 gtctgatccg actcgcagca tttcgaccgc cgcggtgaca gcgttggtgg ggtctagcag 3481 tacgtcaatc gaagaaggag tcctcggtga gattcagaat gccgaacacc gtcaccatgg 3541 cgtcggcctc cgcagcgact tccacgatgg ggatcgggcg agcaaaaagg cagcaattat 3601 gagccccata cctacaaagc cccacgcatc aagcttttga ccctgaagca actaggcaat 3661 ggctgtaatt atgacgacgc cgagtcccga accagactgc ataagcaaca accgacagaa 3721 tggatttcga aaccagagaa agaaaataaa tgcgatgcca taaccgatta tgaacaacgg 3781 cggaaggggc aagcttagta aatgcctcgc tagattttaa tgcggatgtt gcgattactt 3841 cgccaactat tgcgataaca agaaaaagcc agcctttcat gatatatctc ccaatttgtg 3901 tagggcttat tatgcacgct taaaaataat aaaagcagac ttgacctgat agtttggctg 3961 tgagcaatta tgtgcttagt gcatctaatc gcttgagtta acgccggcga agcggcgtcg 4021 gcttgaacga attgttagac attatttgcc gactaccttg gtgatctcgc ctttcacgta 4081 gtggacaaat tcttccaact gatctgcgcg cgaggccaag cgatcttctt cttgtccaag 4141 ataagcctgt ctagcttcaa gtatgacggg ctgatactgg gccggcaggc gctccattgc 4201 ccagtcggca gcgacatcct tcggcgcgat tttgccggtt actgcgctgt accaaatgcg 4261 ggacaacgta agcactacat ttcgctcatc gccagcccag tcgggcggcg agttccatag 4321 cgttaaggtt tcatttagcg cctcaaatag atcctgttca ggaaccggat caaagagttc 4381 ctccgccgct ggacctacca aggcaacgct atgttctctt gcttttgtca gcaagatagc 4441 cagatcaatg tcgatcgtgg ctggctcgaa gatacctgca agaatgtcat tgcgctgcca 4501 ttctccaaat tgcagttcgc gcttagctgg ataacgccac ggaatgatgt cgtcgtgcac 4561 aacaatggtg acttctacag cgcggagaat ctcgctctct ccaggggaag ccgaagtttc 4621 caaaaggtcg ttgatcaaag ctcgccgcgt tgtttcatca agccttacgg tcaccgtaac 4681 cagcaaatca atatcactgt gtggcttcag gccgccatcc actgcggagc cgtacaaatg 4741 tacggccagc aacgtcggtt cgagatggcg ctcgatgacg ccaactacct ctgatagttg 4801 agtcgatact tcggcgatca ccgcttccct catgatgttt aactttgttt tagggcgact 4861 gccctgctgc gtaacatcgt tgctgctcca taacatcaaa catcgaccca cggcgtaacg 4921 cgcttgctgc ttggatgccc gaggcataga ctgtacccca aaaaaacagt cataacaagc 4981 catgaaaacc gccactgcgc cgttaccacc gctgcgttcg gtcaaggttc tggaccagtt 5041 gcgtgaggcc atacgctact tgcattacag cttacgaacc gaacaggctt atgtccactg 5101 ggttcgtgcc ttcatccgtt tccacggtgt gcgtcacccg gcaaccttgg gcagcagcga 5161 agtcgaggca tttctgtcct ggctggcgaa cgagcgcaag gtttcggtct ccacgcatcg 5221 tcaggcattg gcggccttgc tgttcttcta cggcaagtgc tgtgcacgga tctgccctgg 5281 cttcaggaga tcggaagacc tcggccgtcc gggcgcttgc cggtggtgct gaccccggat 5341 gaagtggttc gcatcctcgg ttttctggaa ggcgagcatc gtttgttcgc ccagcttctg 5401 tatggaacgg gcatgcggat cagtgagggt ttgcaactgc gggtcaagga ctggatttcg 5461 atcacggcac gatcatcgtg cgggagggca agggctccaa ggatcgggcc ttgatgttac 5521 ccgagagctt ggcacccagc ctgcgcgagc agctgtctcg tgcacgggca tggtggctga 5581 aggactaggc cgagggccgc agcggcgttg cgcttcccga cgcccttgag cggaagtatc 5641 cgcgcgccgg gcattcctgg ccgtggttct gggtttttgc gcagcacacg cattcgaccg 5701 atccacggag cggtgtcgtg cgtcgccatc acatgtatga ccagaccttt cagcgcgcct 5761 tcaaacgtgc cgtagaacaa gcaggcatca cgaagcccgc cacaccgcac accctccgcc 5821 actcgttcgc gacggccttg ctccgcagcg gttacgacat tcgaaccgtg caggatctgc 5881 tcggccattc cgacgtctct acgacgatga tttacacgca tgtgctgaaa gttggcggtg 5941 ccggagtgcg ctcaccgctt gatgcctgcc gcccctcact gtgagaggta gggcagcgca 6001 agtcaatcct agcggattca ctacccctgc gcgaaggcca tcggtgccgc atcgaacggc 6061 cggttgcgga aagtcctccc tgcgtccgct gatggccggc agcagcccgt cgttgaagga 6121 tccctgaaag cgacgttgga tgttaacatc tacaaattgc cttttcttac gaccatgtac 6181 gtaagcgctt acgtttttgg tggacccttg aggaaactgg tagctgttgt gggcctgtgg 6241 tctcaagatg gatcattaat ttccaccttc acctacgatg gggggcatcg caccggtgag 6301 taatattgta cggctaagag cgaatttggc ctgtagacct caattgcgag ctttctaatt 6361 tcaaactatt cgggcctaac ttttggtgtg atgatgctga ctggcaggat atataccgtt 6421 gtaat // LOCUS TOBNPTII 200 bp ds-DNA PLN 26-JUL-1990 DEFINITION N.tabacum nptII gene, complete cds. ACCESSION M34757 KEYWORDS nptII protein. SOURCE N.tabacum (strain SR1) DNA. ORGANISM Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 200) AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and Depicker,A. TITLE Cloning and sequence analysis of truncated T-DNA inserts from Nicotiana tabacum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.R.Gheysen, 01-JUN-1990. FEATURES from to/span description pept 198 > 200 nptII protein mRNA 133 > 200 nptII protein mRNA (5' end +/- 2 bp) recomb 193 194 T-DNA end/plant DNA start signal 25 30 CAAT box signal 47 55 CAAT box signal 101 107 TATA box BASE COUNT 64 a 53 c 32 g 51 t ORIGIN 1 caagcctcgc tagtcaaaag tgtaccaaac aacgctttac agcaagaacg gaaatgcgcg 61 tgacgctcgc ggtgacgcca tttcgccttt tcagaaatgg ataaatagcc ttgcttccta 121 ttatatcttc ccaaattacc aatacattac actagcatct gaatttcata accaatctcg 181 atacaccaaa tcggatcatg // LOCUS BOVANDRE 2461 bp ss-mRNA MAM 26-JUL-1990 DEFINITION Cow alpha-1C-adrenergic receptor mRNA, complete cds. ACCESSION J05426 KEYWORDS alpha-1C-adrenergic receptor. SOURCE Cow adult brain cortex, cDNA to mRNA, clone B12. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 2461) AUTHORS Schwinn,D.A., Lomasney,J.W., Lorenz,W., Szklut,P.J., Fremeau,R.T.Jr., Yang-Feng,T.L., Caron,M.G., Lefkowitz,R.J. and Cotecchia,S. TITLE Molecular cloning and expression of the cDNA for a novel alpha-1-adrenergic receptor subtype JOURNAL J. Biol. Chem. 265, 8183-8189 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by D.Schwinn, 29-MAY-1990. FEATURES from to/span description pept 97 1497 alpha-1C-adrenergic receptor BASE COUNT 551 a 667 c 647 g 596 t ORIGIN 1 tgactccccg ctccctcgct cccctcctcc tcacccgccg aggggtggcc ctcaagagcc 61 ggactttgcc ggccccggcc ccggggggct gggaccatgg tgtttctctc cggaaatgcc 121 tccgacagct ccaactgcac ccacccgccg ccaccggtga acatttccaa ggccattctg 181 ctcggggtga tcttgggggg cctcatcctt ttcggggtac tggggaacat cctcgtgatc 241 ctttccgtgg cctgccaccg gcacctgcac tcggtcacac actactacat cgtcaacctg 301 gcggtggccg accttctcct cacttccacg gtgctgccct tctccgctat cttcgagatc 361 ttgggctact gggccttcgg cagggtcttc tgcaatgtct gggcggcggt ggacgtcctg 421 tgctgcacgg cttccatcat gggactctgc atcatctcca tcgaccgcta catcggcgtg 481 agctatcctc tgcgctaccc caccatcgtc acccagaaga ggggcctcat ggccctgctc 541 tgcgtctggg cgctctcttt ggtcatctcc atcgggcccc tcttcggctg gaggcagccg 601 gccccggagg acgagaccat ctgccagatc aacgaggagc cgggctacgt gctcttctcg 661 gctctgggct ccttctacgt gccgctgacc atcatcctgg tcatgtactg ccgggtctac 721 gtcgtggcca agagggagag ccggggcctc aagtcgggcc ttaagaccga caagtcagac 781 tcggagcagg tgacgctccg catccatcgc aaaaacgccc aggtaggagg cagcggggtg 841 accagcgcca agaacaagac gcacttctcc gtgagactgc tcaaattttc ccgcgagaag 901 aaagcggcca aaacgctggg catcgtggtc ggctgcttcg tcctctgctg gctgcctttt 961 ttcttagtga tgcccattgg gtctttcttt cctgatttca ggccctcaga aaccgttttt 1021 aaaatagcat tttggctcgg ttacctaaac agctgcatca accccattat atacccatgc 1081 tccagtcaag agtttaaaaa ggcctttcag aatgtcttga gaatccagtg tctgcgacga 1141 aagcagtcct ccaaacacac cctgggctac acgctgcacg cacccagcca cgtcctggag 1201 ggacagcaca aggacctggt tcgcattccg gtgggatctg cagagacctt ctataagatc 1261 tccaagacgg atggggtctg tgaatggaaa attttctctt ccctaccccg cggatctgcc 1321 aggatggcgg tggccagaga cccatcagcc tgcaccactg cccgggtgag aagtaaaagc 1381 tttttgcaag tgtgctgttg cctggggccc tcgaccccca gtcatggaga gaatcatcag 1441 attccgacca ttaagatcca caccatctcc ctcagtgaaa atggggagga agtctaaagg 1501 acaggaaagg tcagaaggat gggagggtga tcttaggtac ccactctcca cttccttctg 1561 ggaaggccag ttcacgttcc gtggatgctg agacacagcc agtaaaccag ggaccatctg 1621 ggaatgggct ggggaggaga gctgactctg gggcagaggt agggcttaga gacgagagag 1681 gatgtcctac caccatccag ttcactatga tgagaaacag catttccttg aggctaatgc 1741 tctctgggtc attctctgag cctgctttct acgcctgtcc ctttcaacga caaacaccat 1801 gggaaacaga atttcataca caatccaaaa gacgataaat ataggattat gatttcatca 1861 tgaatatttt gagcatgcac tctaagtttg gagctatttc ttgatggagt gaggggattt 1921 tatttccagg ctaaacttgc tgaaagccac gttggatttt tatggagaga aggcctggag 1981 aggaagagcc ttaagatggt ggccaatatc cagacgcatt atttttagag caagttttac 2041 agtccaccct ttctcagttt gggtgaaact tgacagtgag attttattta ccttttgctg 2101 ctgcttgaca ggatactgct cccaattccc taaggatgag ggtgaggggt actcattatg 2161 ccaatggtca tctgcacttg ggtatagaga gtgttgaaag aaccagttgg gaaaaggatg 2221 gcttttcctg gtggaagaca gtaaggatga gagtcagttc ttcaaattct atggacagaa 2281 ttccattaag tggttccaag atcaggtgga ggaaggcttc ttgtgtaaca tatttaaaga 2341 tcaagagttt ggggtggggt gggtgctact ttcaagctaa gatagaggct gcaaaattac 2401 tccacagcct tttcaacatg gcatagaaag gcttttcttg gcaaatcact taccttttcc 2461 a // LOCUS CHKANCC2A 1229 bp ss-mRNA VRT 26-JUL-1990 DEFINITION Chicken anchorin CII mRNA, 3' end. ACCESSION M30971 J03194 KEYWORDS anchorin; collagen-binding protein. SOURCE Chicken cartilage (sternum) and bone, cDNA to mRNA, clones A[1,4,6,7,14,15,22,23]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1229) AUTHORS Fernandez,M.P., Selmin,O., Martin,G.R., Yamada,Y., Pfaeffle,M., Deutzmann,R., Mollenhauer,J. and von der Mark,K. TITLE The structure of anchorin CII, a collagen binding protein isolated from chondrocyte membrane JOURNAL J. Biol. Chem. 263, 5921-5925 (1988) STANDARD simple staff_entry REFERENCE 2 (bases 373 to 504) AUTHORS Fernandez,M.P., Selmin,O., Martin,G.R., Yamada,Y., Pfaeffle,M., Deutzmann,R., Mollenhauer,J. and von der Mark,K. TITLE The structure of anchorin CII, a collagen binding protein isolated from chondrocyte membrane JOURNAL J. Biol. Chem. 265, 8344-8344 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 19 984 anchorin CII BASE COUNT 353 a 230 c 310 g 336 t ORIGIN 28 bp upstream of AccI site. 1 cccggcgaac cggggaagat ggcgaagtat acaagaggca ccgtgacagc attctctcct 61 tttgatgcca gagctgatgc agaagccctt cgcaaggcca tgaagggaat ggggactgat 121 gaagagacaa ttctgaagat ccttaccagc agaaataatg ctcaacgtca agaaattgca 181 tctgctttta aaacactgtt tggcagggat cttgtggatg acctgaaatc agaacttact 241 ggcaagtttg aaacactgat ggtatctttg atgagaccag cacgtatttt tgatgcgcat 301 gcactgaagc atgcaatcaa gggagcagga accaatgaga aagtgttgac tgaaattctt 361 gcctccagaa cacctgctga agtgcagaat attaaacagg tttatatgca agagtatgag 421 gccaacttgg aggataagat cacaggagag acatcaggcc attttcagag actgctggtg 481 gtcctgctgc aggcaaatag agatcctgat ggcagagttg acgaggctct tgttgagaag 541 gatgctcagg tcttgtttag agctggggag ctaaaatggg gaacagatga agaaacattc 601 atcaccatct tgggaactcg aagtgtttct catttgagga gggtgtttga caaatacatg 661 actatttctg gctttcaaat tgaagaaacc attgaccgtg aaacctctgg tgatttggag 721 aagttgcttt tggcagttgt gaagtgcatc cgaagtgtgc ctgcttattt tgctgaaact 781 ttgtattatt ctatgaaagg ggctggcact gatgatgata ccctgatcag agtcatggtt 841 tcaagaagtg aaatcgacct gttggatatt agacatgaat tcagaaagaa ttttgcgaaa 901 tcgttgtatc agatgattca gaaagataca tctggggact acaggaaggc actcctgctc 961 ctctgtggtg gagatgatga gtaatggtgg cagcgacgtg aaggatttct tgtaatccag 1021 ctttgcagcc cttcagttag catgcctagc taagattttg catcttaatg ctttatggct 1081 gttcgaattt atattcatat cacacttatt aaacacaaac atgttactac tagctgataa 1141 acagtccctc ctcctcagac gtcctgactc tgggaatttc agtgccttct gagtgtatgc 1201 aaagtctctc atggagtaga gtagtatcg // LOCUS ECOHLYCA 633 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli hly plasmid hemolysin (hlyC) gene, complete cds. ACCESSION M35668 KEYWORDS hemolysin. SOURCE E.coli hly plasmid pHly152 DNA, clone pANN202-419. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 633) AUTHORS Goebel,W., Hacker,J., Knapp,S., Then,I., Wagner,W., Hughes,C. and Juarez,A. TITLE Structure, function, and regulation of the plasmid-encoded hemolysin determinant of Escherichia coli JOURNAL Basic Life Sci. 30, 791-805 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 121 633 hemolysin (hlyC) BASE COUNT 219 a 99 c 115 g 200 t ORIGIN 1 tagtcacgca ataaaacgtt ctttaatatt aatgcagtta tgacattaaa ggcaagaaac 61 ataaaggcat atttttgcca caatatttaa tcatataatt taagttgtag tgagtttatt 121 atgaatataa acaaaccatt agagattctt gggcatgtat cctggctatg ggccagttct 181 ccactacaca gaaactggcc agtatctttg tttgcaataa atgtattacc cgcaatacag 241 gctaaccaat atgttttatt aacccgggat gattaccctg tcgcgtattg tagttgggct 301 aatttaagtt tagaaaatga aattaaatat cttaatgatg ttacctcatt agttgcagaa 361 gactggactt caggtgatcg taaatggttc attgactgga ttgctccttt cggggataac 421 ggtgccctgt acaaatatat gcgaaaaaaa ttccctgatg aactattcag agccatcagg 481 gtggatccca aaactcatgt tggtaaagta tcagaatttc atggaggtaa aattgataaa 541 cagttagcga ataaaatttt taaacaatat caccacgagt taataactga agtaaaaaga 601 aagtcagatt ttaatttttc attaactggt taa // LOCUS ECOTRMX4 77 bp ss-tRNA RNA 26-JUL-1990 DEFINITION E.coli f-Met-tRNA. ACCESSION M35184 KEYWORDS transfer RNA-f-Met. SOURCE E.coli tRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 77) AUTHORS Dube,S.K., Marcker,K.A., Clark,B.F.C. and Cory,S. TITLE Nucleotide sequence of N-formyl-methionyl-transfer RNA JOURNAL Nature 218, 232-233 (1968) STANDARD simple staff_review FEATURES from to/span description tRNA 1 77 f-Met-tRNA anticdn 35 37 f-Met-tRNA anticodon cat modified 21 21 d modified 33 33 2'Ome modified 47 47 m7g modified 56 56 p BASE COUNT 14 a 26 c 25 g 12 t ORIGIN 1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggtcg tcggttcaaa 61 tccggccccc gcaacca // LOCUS HECDA8 1435 bp ss-rRNA BCT 26-JUL-1990 DEFINITION H.mustelae 16S ribosomal RNA. ACCESSION M35048 KEYWORDS 16S ribosomal RNA. SOURCE H.mustelae (strain ATCC 43772) ribosomal RNA. ORGANISM Helicobacter mustelae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 9 to 1435) AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and Ferrero,R. TITLE The phylogeny of Helicobacter felis sp. nov., a spiral-shaped bacterium isolated from the gastric mucosa of the cat, Helicobacter mustelae, and related bacteria JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1435) AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and Ferrero,R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by B.J.Paster, 06-JUN-1990. Author address:B.J.Paster Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 > 1435 16S ribosomal RNA BASE COUNT 375 a 301 c 412 g 326 t 21 others ORIGIN 1 attatggaga gtttnatcct ggctcagagt gaacgctggc ggcgtgccta atacatgcaa 61 gtcgaacgat gaagcttcta gcttgctaga agtggattag tggcgcacgg gtgagtaacg 121 cataggttat gtgccccata gtctgggata gccactggaa acggtgatta atactggata 181 ctcctacggg ggnaaagntn ttcgctatgg gatcagccta tgtcctatca gcttgttggt 241 gaggtaatgg ctcacnnagg ctatgacggg tatccggcct nagagggtga tcggacacac 301 tggaactgag acacggtcca gactcctacg ggaggcagca gtagggaata ttgctcaatg 361 ggcgaaagcc tgaagcagca acgccgcgtg gaggatgaag gttttaggat tgtaaactcc 421 ttttctaaga gaagataatg acggtatctt aggaataagc accggcnnac tccgtgccag 481 cagccgcggn antacggagg gtgcnagcgt tactcggaat cactgggcgt naagagcgcg 541 taggcggagt aataagtcag atgtgaaatc ctgtagctta actacagaac tgcatttgaa 601 actgttattc tagagtgtgg gagaggtagg tggaattctt ggtgtagggg tnaaatccgt 661 agagatcaag aggaatactc attgcgaagg cgacctactg gaacattact gacgctgatg 721 cgcgaaagcg tggggagcaa acaggattag ataccctggt agtccacgcc ctaaacgatg 781 aatgctagtt gttggggtgc ttgtcactcc agtaatgcag ttaacacatt aagcattccg 841 cctggggagt acggtcgcaa gattaaaact caaaggaata gacggggacc cgcacaagcg 901 gtggagcatg tggtttaatt cgannntacg cgaagaacct tacctaggct tgacattgat 961 agaatctgct agaaatagcg gagtgtctag tttactagac cttgaaaaca ggtgctgcac 1021 ggctgtcgtc agctcgtgtc gtgagatgtt gggttaagtc ccgcaacgag cgcaaccctc 1081 gttcttagtt gctagcagtt cggctgagca ctctaagaag actgccttcg tnaggaggag 1141 gaaggtgagg acgacgttaa gtcatcatgg cccttacgcc tagggctaca cacgtgctac 1201 aatggggtgc acaaagagac gcaataccgc gaggtggagc aaatctcaaa aacatctctc 1261 agttcggatt gtagtctgca actcgactac atgaagctgg aatcgctagt aatcgtgaat 1321 cagccatgtc acggtgaata cgttcccggg tcttgtactc accgnccgtc acaccatggg 1381 agttgtattc gccttaagcc gggatgctaa attggctacc gtccanggcg gatnc // LOCUS HECRDA 1446 bp ss-rRNA BCT 26-JUL-1990 DEFINITION H.felis 16S ribosomal RNA. ACCESSION M35047 KEYWORDS 16S ribosomal RNA. SOURCE H.felis (ATCC 49179) ribosomal RNA. ORGANISM Helicobacter felis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 9 to 1446) AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and Ferrero,R. TITLE The phylogeny of Helicobacter felis sp. nov., a spiral-shaped bacterium isolated from the gastric mucosa of the cat, Helicobacter mustelae, and related bacteria JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1446) AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and Ferrero,R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by B.J.Paster, 06-JUN-1990. Author address:B.J.Paster Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 > 1446 16S ribosomal RNA BASE COUNT 354 a 299 c 402 g 310 t 81 others ORIGIN 1 tttatggaga gtttgatcct ggctcagagt gaacgctggc ggcgtgccta atacatgcaa 61 gtcgaacgat gaagcctagc ttgctaggcg gattagtggc gcacgggtga gtaacgcata 121 gatgacatgc cctttagttt gggatagcca ctagaaatgg tgattaatac caaatactac 181 ctacggggga aagatttatc gctaaaggat tggtctatgt cctatcagct tgttggtgag 241 gtaaaggctc acnnaggcta tgacgggtat ccggcctgag agggtgaacg gacacactgg 301 aactgagaca cggtccagac tccnncggga ggcagcagta gggaatattg ctcaatgggc 361 gcaagcctga agcagcaacg ccgcgtggag gatgaaggtt ttaggattgt aaactccttt 421 tgtcagagaa gataatgacg gtatctgacg aataagcacc ggctanctcc gtgccagcag 481 ccgcggtaat acggagggtg cnagcgttac tcggaatcnc tgggcgtaaa gagtgcgtag 541 gcggggttgt aagtcagatg tgaaatccta tggcttaacc atagaactgc atttgaaact 601 acaactctgg agtgtgggag aggtaggtgg aattcttggt gtaggggtaa aatccgtaga 661 gatcaagagg aatactcatt gcgaaggcga cctgctggaa caatactgac gctgattgcn 721 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccct aaacgatgga 781 tgctagttgt tggggggctt tgtcctccca gtaatgcagc taacgcctta agcatcccgc 841 ctggggagta cggtcgcaag annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc gaagaacctt acctaggctt gacattgaan 961 gaatctgcta gaaatatgtg agtgtctagc ttgctagacc ctgaaaacag gtgctgcacg 1021 gctgtcgtca gctcgtgtcg tgagatgttg ggttaagtcc cgcaacgagc gcaaccctct 1081 ttcttagttg ctaacaggta gtgctgagct ctctaagaat actgcctgcg taagcaggag 1141 gaaggtgagg acgacgtcaa gtcatcatgg cccttacgcc tagggctaca cacgtgctac 1201 aatggggtgc acaaagagat gcaatgccgc gaggttgagc caatcttaaa aacnnctctc 1261 agttcggatt gcaggctgca actcgcctgc atgaagctgg aatcgctagt aatcgcaaat 1321 cagccatgtt gcggtgaata cgttcccggg tcttgtactc accgnncgtc acaccatggg 1381 agttgtgttt gccttaagtc aggatgctaa ggtagctact gcccacggca cacacagcga 1441 ctgggg // LOCUS HUMHPBS 821 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human peripheral benzodiazepine receptor (hpbs) mRNA, complete cds. ACCESSION M36035 KEYWORDS peripheral benzodiazepine receptor. SOURCE Human hystiocytic lymphoma monocyte-like cell line U937, cDNA to mRNA, clone p-hPBS11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 821) AUTHORS Riond,J., Mattei,M.G., Kaghad,M., Dumont,X., Guillemot,J.C., Le Fur,G., Caput,D. and Ferrara,P. TITLE Molecular cloning and chromosomal localization of a human peripheral-type benzodiazepine receptor JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Riond, 27-JUN-1990. Author address:J.Riond SANOFI ELF BIO RECHERCHES BP137 31328 LABEGE CEDEX FRANCE FEATURES from to/span description pept 62 571 peripheral benzodiazepine receptor mRNA < 1 811 peripheral benzodiazepine receptor mRNA site 800 805 polyadenylation site BASE COUNT 118 a 271 c 260 g 171 t 1 others ORIGIN Chromosome 22, map position q13.3. 1 agtgcccttc ccggagcgtg ccctcgccgc tgagctcccc tgaacagcag ctgcagcagc 61 catggccccg ccctgggtgc ccgccatggg cttcacgctg gcgcccagcc tggggtgctt 121 cgtgggctcc cgctttgtcc acggcgaggg tctccgctgg tacgccggcc tgcagaagcc 181 ctcgtggcac ccgccccact gggtgctggg ccctgtctgg ggcacgctct actcagccat 241 ggggtacggc tcctacctgg tctggaaaga gctgggaggc ttcacagaga aggctgtggt 301 tcccctgggc ctctacactg ggcagctggc cctgaactgg gcatggcccc ccatcttctt 361 tggtgcccga caaatgggct gggccttggt ggatctcctg ctggtcagtg gggcggcggc 421 ngccactacc gtggcctggt accaggtgag cccgctggcc gcccgcctgc tctaccccta 481 cctggcctgg ctggccttcg cgaccacact caactactgc gtatggcggg acaaccatgg 541 ctggcatggg ggacggcggc tgccagagtg agtgcccggc ccaccaggga ctgcagctgc 601 accagcaggt gccatcacgc ttgtgatgtg gtggccgtca cgctttcatg accactgggc 661 ctgctagtct gtcagggcct tggcccaggg gtcagcagag cttcagaggt tgccccacct 721 gagcccccac ccgggagcag tgtcctgtgc tttctgcatg cttagagcat gttcttggaa 781 catggaattt tataagctga ataaagtttt tgacttcctt t // LOCUS XELAAA 121 bp ss-rRNA VRT 26-JUL-1990 DEFINITION X.laevis 5S ribosomal RNA. ACCESSION M35175 KEYWORDS 5S ribosomal RNA. SOURCE X.laevis somatic cell ribosomal RNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 121) AUTHORS Wegnez,M. and Denis,H. TITLE Expression des genes ribosomiques 5 S chez le Xenope JOURNAL Arch. Int. Physiol. Biochim. 81, 211-213 (1973) STANDARD simple staff_review FEATURES from to/span description rRNA 1 121 5S ribosomal RNA BASE COUNT 24 a 34 c 38 g 25 t ORIGIN 1 gcctacggcc acaccaccct gaaagtgccc gatctcgtct gatctcggaa gccaagcagg 61 gtcgggcctg gttagtactt ggatgggaga ccgcctggga ataccaggtg tcgtaggctt 121 t // LOCUS XELAAB 121 bp ss-rRNA VRT 26-JUL-1990 DEFINITION X.laevis 5S ribosomal RNA. ACCESSION M35176 KEYWORDS 5S ribosomal RNA. SOURCE X.laevis oocyte ribosomal RNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 121) AUTHORS Wegnez,M. and Denis,H. TITLE Expression des genes ribosomiques 5 S chez le Xenope JOURNAL Arch. Int. Physiol. Biochim. 81, 211-213 (1973) STANDARD simple staff_review FEATURES from to/span description rRNA 1 121 5S ribosomal RNA BASE COUNT 25 a 33 c 37 g 26 t ORIGIN 1 gcctacggcc acaccaccct gaaagtgcct gatctcgtct gatctcagaa gcgatacagg 61 gtcgggcctg gttagtactc ggatgggaga ccgcctggga ataccaggtg tcgtaggctt 121 t // LOCUS ECORR50L1A 165 bp ds-DNA RNA 26-JUL-1990 DEFINITION E.coli 50S rRNA protein L1-associated RNA. ACCESSION M24864 KEYWORDS 50S ribosomal RNA. SOURCE E.coli 50S ribosomal RNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 165) AUTHORS Branlant,C., Krol,A., Sriwidada,J. and Brimacombe,R. TITLE RNA sequences associated with proteins L1, L9, and L5, L18, L25, in ribonucleoprotein fragments isolated from the 50-S subunit of Escherichia coli ribosomes JOURNAL Eur. J. Biochem. 70, 483-492 (1976) STANDARD simple staff_entry FEATURES from to/span description modified 13 13 7-methyluridine unsure 61 61 u could be a unsure 141 141 c could be g BASE COUNT 36 a 35 c 47 g 47 t ORIGIN 1 taacctttac tatggcgaca ctgaacattg agccttgatg tgtaggatag gtgggagctt 61 tgaagtggac gtgccagtct gcatggagcc gaccttgaaa taccctttac aatgtttgat 121 gttctaacgt ggacccgctt cgggttgcat cgcggacagt gtctg // LOCUS BMOSP1 1512 bp ds-DNA INV 26-JUL-1990 DEFINITION Silkworm (B.mori) storage protein 2 (SP2) gene, exon 1. ACCESSION M24371 J04829 KEYWORDS arylphorin-type storage protein; storage protein; storage protein 2. SEGMENT 1 of 2 SOURCE Silkworm (strain Tokai x Asahi; 5th larval instar) DNA and cDNA to mRNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 983) AUTHORS Fujii Tomino,S. TITLE Structure of the gene for the arylphorin-type storage protein, Sp 2 of Bombyx mori JOURNAL J. Biol. Chem. 264, 11020-11025 (1989) STANDARD simple staff_review REFERENCE 2 (bases 984 to 1512) AUTHORS Fujii,T., Sakurai,H., Izumi,S. and Tomino,S. JOURNAL Unpublished (1989) 2-1-1 Fukazawa, Setagaya-ku, Tokyo 158, Japan STANDARD simple staff_review COMMENT Draft entry and sequence for [1],[2] kindly submitted by S.Tomino, 28-APR-1989. FEATURES from to/span description pept 796 + 883 storage protein SP2, exon 1 pre-msg 771 > 1512 SP2 mRNA and intron IVS 884 > 1512 SP2 intron A binding 436 441 glucocorticoid-receptor binding site site 447 457 SV40 enhancer core conflict 199 199 c in [1]; g in [2] BASE COUNT 475 a 266 c 247 g 524 t ORIGIN Unreported. 1 aagcttttta aaaaaagaac tttatttaat tttaataatt aaaacatttg aaattaacaa 61 ttgaaattaa ttggcgcaag tgtcaccggg agcgcggtta gaattgaact gcgtgatcta 121 tcggtaacct aactaagctg cattacgtcg tgcaccttac attgcacatt tatgtacatt 181 aaaaatatat aacagtaccc aataaaaaag cattatttcg tcttgtaaca gtcggttgaa 241 aaattgaaag taattaacga catgcttaga gtttcgatcg tagtaaaagc tacgttttgt 301 ctatcatatt agaaagatat agtaacttct tttgtctctc tttattcttt aaattttact 361 taatcaggtg aatagctttt actactttac tcaatgtttt catcatactc ctggctaagt 421 cttcgctagc ccgcctgtcc tagtaagccg tggaaaggct ccgggacacc agcaaacctt 481 caatcataaa aaaaattgct ttcatcatgt tttcgtttac agttttacaa atatttcata 541 attttccatt cctttttttt gaattatata ataataacaa gaaaaaaact ttatatctat 601 ttgtttatca tcatcgttga aatttatatt cagtaattca aattatgaga ccggtgaaaa 661 ggtcagtaga ttacgttgat aatgaaagca taacacttgt tgctaatgag tgcatgtttc 721 gggagaagat aaagtgtggg tataaatatt cgaaaacgga ttgcagaagc acagtttgct 781 tctaggctgg aaaccatgaa gtctgtcttg attctggctg ggcttgtagc cgtcgcgctc 841 agcagtgcag taccaaaacc gagcaccata aagtcaaaaa atggtaagcg ttaaatagta 901 gtgctctatt ttaatacgct tttattatta ttattattat tattaattct ttatttcagt 961 tttgtttttt aaaaccataa cattttgtta gtagtaatta cttatatcta tgttagtgac 1021 ttaaaaaatc taacacataa ctctcattat atatatacat tttataccat tacatttttt 1081 attttatttt tttctccttc caagtgccta ctgcaaaggc tattgatcag cagtccctcg 1141 atcttgctcg atatgattct caaaagactg ttgccactgt cacgaactcg acgcaataac 1201 gatgcacttc tcttccgcat tattgcaaag aagtcatcgg tgtgagatgt cgcaaacatt 1261 gtggatgcac tacaaaagcg cggcagtgac aacatcatcc taaacgcatt attatattga 1321 acgcgtaggg cattgtaagc tctccgcgtg tatgtggtcc acagactact ggcgtaaaaa 1381 ttctggcaat aagctttaaa aattgtaatt tgacatacta tcgcaaccag taaatctgcg 1441 ggccagcata ttgcatctta ctatcaatta ttattattat tttttttatt gcttagatgt 1501 gtggacgagc tc // LOCUS BMOSP2 3876 bp ds-DNA INV 26-JUL-1990 DEFINITION Silkworm (B.mori) storage protein 2 (SP2) gene, exons 2,3,4, and 5. ACCESSION M24370 J04829 KEYWORDS arylphorin-type storage protein; storage protein; storage protein 2. SEGMENT 2 of 2 SOURCE Silkworm (strain Tokai x Asahi; 5th larval instar) DNA and cDNA to mRNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1288 to 3876) AUTHORS Fujii,T., Sakurai,H., Izumi,S. and Tomino,S. TITLE Structure of the gene for the arylphorin-type storage protein, Sp 2 of Bombyx mori JOURNAL J. Biol. Chem. 264, 11020-11025 (1989) STANDARD simple staff_review REFERENCE 2 (bases 1 to 1287) AUTHORS Tomino,S. JOURNAL Unpublished (1989) 2-1-1 Fukazawa, Setagaya-ku, Tokyo 158, Japan STANDARD simple staff_review COMMENT Draft entry and sequence for [1],[2] kindly submitted by S.Tomino, 28-APR-1989. FEATURES from to/span description pept + 1388 1521 storage protein SP2, exon 2 1614 2444 storage protein SP2, exon 3 2526 2692 storage protein SP2, exon 4 2850 3744 storage protein SP2, exon 5 pre-msg < 1 3846 SP2 mRNA and introns IVS < 1 1387 SP2 intron A IVS 1522 1613 SP2 intron B IVS 2445 2525 SP2 intron C IVS 2693 2849 SP2 intron D BASE COUNT 1204 a 734 c 696 g 1242 t ORIGIN Unknown number of bp after segment 1. 1 ttgctagccc ttcttcttta tgttttggag aaggttctca attcaaaatg tacgttttca 61 ttatagcctt attacgaaag cttatacgaa cgttatatct ttaactatgc atacagccgt 121 ctattgaatc attgttgtta taaattgttt tacaattgct ataggctcac atctctttct 181 gaggcgtgat ttagaaaagg atgcacgatg cgtgatccaa tttggaattt gatagctcgg 241 cctcatctcc tgcctcatag caaggccgat tttgtgaggc ctcctatcta aactaaaaag 301 aacaaaaccg cacttacccc gcagcggccg actaggttgc actgttgcta taccatcatt 361 tgtatgttgg tatattatta ccgctgtaat gtataggtac attaccgcca gtattgcata 421 tgttgcacga tgaacatgtt caatatatgt aaaatttaca atttaaatac gtcaccgttt 481 caacacaaaa ctatttgcaa atggattcat cattcatcat ctaaactcgt cgtggcctaa 541 aggataagac gtccggtgca ttcgtgttga gcgatgcacc ggtgctcgaa tcccaagcgg 601 gtaccaattt ttctaatgga atacgtactc aacaaatgtt catgattgac ttccacggta 661 aaggaataac atctatacta atattataaa gaggaaagat ttgtttgttt gtttgtttcg 721 aataggctcc gaaactactg gaccgatttg aaaaattctt tttccattag aagccaacat 781 tgtccctgat gaacataggc tacatttttt aatttttttt tttttttttg tttcatgtgt 841 gttttaatgt ttccgaagcg aagcgagggc gggtcgctag tcgtgtaata aaaatcaaag 901 ccgcaaaaat tataatttgc gtaattacta gtggtaggac ctcttgtgac gcaagggtag 961 gtacttgaga ccttagaatt tatatctcaa ggtgggtggt gcatatacgt tgtaaatgtc 1021 tatggggtct agtaaccgct taacaccagg tggactagtt cagccaccta agcaataaaa 1081 ataaaaatca tcaaaataga aaatcaacca ttgtaggttt ataccgtatt gactaagtaa 1141 taaagaaaag caggtttttt ttacaaacaa ccaaattatg taataaaagt aaatatagta 1201 agctatgaac gaccgattag tggtaacata tcggcgctga aagttcctaa tgtgctttga 1261 tgccaatatt tatctcagaa ttgaagttat tcaatacttt ccagataatg atgacatcta 1321 agtgatatcg cttattcgta aatacttctt tataaaatat ttacatatat ttttttactt 1381 tattcagtgg atgccgtatt tgttgaaaag caaaagaaaa ttctgtcctt cttccaagat 1441 gtgagccaac taaacactga tgatgaatat tataaaattg gcaaagacta tgatatcgaa 1501 atgaatatgg acaactacac tgtaagtact aataattaat atcaatttaa atttaacgtg 1561 aatttgtttg tttctttctt tctttattga aaaccatgtt tcatatttaa cagaacaaga 1621 aagctgttga agaatttctg aagatgtaca ggactggttt tatgcctaag aatttagagt 1681 tctccgtttt ttatgacaag atgagggatg aagctattgc tctattggat ttattctatt 1741 acgctaagga ctttgaaacg ttctacaaga gtgcctgttt tgcgcgtgtg catctcaatc 1801 aaggtcaatt cttgtatgcc ttctacatcg ctgttatcca gcgccctgat tgccacggtt 1861 tcgttgttcc tgctccgtat gaagtatacc ctaaaatgtt tatgaatatg gaagtgctgc 1921 aaaaaattta cgtaacaaag atgcaacatg gcctcattaa tcctgaagcc gcagctaagt 1981 atggcattca caaggaaaac gactacttcg tttacaaagc caattattct aacgccgttt 2041 tatacaataa tgaagaacaa aggctgacat acttcactga ggatattggc atgaacgctt 2101 actactacta cttccactct catttaccgt tctggtggac atcagaaaaa tacggagccc 2161 ttaaagagcg tcgtggagag gtttacttct acttctacca gcaattattg gctcgttact 2221 actttgagcg tcttaccaat ggacttggta agattcccga attctcatgg tactctccga 2281 taaagactgg atactatcca ttgatgctaa ctaagtttac acccttcgca caaagacctg 2341 actactacaa cttgcacacc gaagaaaact atgaaagagt aagattcctt gacacttatg 2401 agaagacatt cgttcagttc ctccaaaagg accactttga agccgtaagt tcgaacatta 2461 agtgtctaat cttattggtt tatttctaaa aatgtatgaa tttaatagat tttttcattt 2521 tatagttcgg acaaaaaatt gattttcacg acccgaaagc cattaacttc gtcggcaact 2581 actggcaaga taatgcagat ctgtatggag aagaagtcac aaaagattac caacgttctt 2641 acgaagtatt tgcgcgccgt gtgctaggtg ctgcgcctat gccattcgac aagtacgttt 2701 aaaaaatatt ttcaaaactt aatttttact aagcaatgac gacaactctt ttctacgtta 2761 tatccaagtc aaccgtaatc cggatttgtc tttgtacgtt tgcaaaaaaa ttaatagtaa 2821 tacatagttt cttcatgcta ctttttcagg tacactttca tgcctagtgc aatggacttt 2881 taccagactt ctcttcgtga tcctgctttc tatcagctct acaacagaat tgtggaatac 2941 atcgttgagt tcaagcaata cttgaagcct tacactcaag acaaacttta ctttgatggt 3001 gtcaagataa ctgatgttaa agtcgacaaa ttgacaacat tcttcgagaa ctttgaattc 3061 gacgccagca acagcgtgta ctttagtaag gaggagatta agaacaatca cgtccatgag 3121 ttaaggtgcg ccacacgatt gaaccacagc cccttcaacg ttaacattga ggttgattct 3181 aatgtcgcca gtgacgctgt tgtcaaaatg ttgctggccc ccaaatacga tgacaacgga 3241 atacctctca cattagagga caactggatg aaattcttcg agttggactg gttcacaact 3301 aaactcaccg ctggtcagaa caagattatc cgcaattcga atgaatttgt catatttaaa 3361 gaagactccg tgccaatgac tgaaattatg aagatgctcg acgaaggaaa agtacctttt 3421 gatatgtcgg aagagttctg ttacatgcct aaaagactca tgctgcctag aggtactgaa 3481 ggtggattcc cattccagct ctttgttttc gtctatccat tcgacaacaa aggcaaggac 3541 ttggctcctt tcgaatcttt tgttcttgac aataacctct tggcttccct ctggatcgcc 3601 ccgttgttga tgcattattc aaggttccta acatgtattt caaggatatt ttcatttacc 3661 acgagggtga acggttccct tacaaattca atcttccttc gtatgacaca catgataatg 3721 ttgttccaaa aaattaaatt ttaataaact gatgaatttt gcatccgtaa tatccaaaga 3781 aaatgtaaaa actttaagta gaactgttat gatttagaaa aaataaaatc aagtaggtaa 3841 aattataatt atgtattttt attgcatgca ttttta // LOCUS HUMG6PA 1464 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human glucose-6-phosphate dehydrogenase, complete cds. ACCESSION M24470 M27958 KEYWORDS glucose-6-phosphate dehydrogenase. SOURCE Human, cDNA to mRNA, clone NG6PD 1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1464) AUTHORS Kanno,H., Huang,I.-Y., Kan,Y.W. and Yoshida,A. TITLE Two structural genes on different chromosomes are required for encoding the major subunit of human red cell glucose-6-phosphate dehydrogenase JOURNAL Cell 58, 595-606 (1989) STANDARD simple staff_review COMMENT Draft entry and sequence for [1] kindly submitted by A.Yoshida, 02-MAY-1989. FEATURES from to/span description pept 72 1109 glucose-6-phosphate dehydrogenase /nomgen="G6PD" /map="Xq28" /hgml_locus_uid="LH0033J" mRNA < 1 1464 glucose-6-phosphate dehydrogenase mRNA BASE COUNT 331 a 404 c 389 g 340 t ORIGIN 1 ctccccgcgc cgccccgcgc aggcgccccc gccccgccgt cgccgccgcc gcagccagga 61 gccgctgcac catgccccgc atagatgcgg acctcaagct cgacttcaag gacgtcctgc 121 tccgacctaa gcggagcagc ctcaagagcc gagccgaggt ggatcttgaa cgcaccttca 181 cgtttcgaaa ttcaaagcag acctactcag ggattcccat catcgtggcc aacatggaca 241 ctgtgggcac gtttgagatg gcagccgtga tgtcacagca ctccatgttt acagcaattc 301 ataagcatta ctccctggat gactggaagc tctttgccac aaatcaccca gaatgcctgc 361 agaatgtagc cgtgagttca ggcagtgggc agaatgatct ggaaaagatg accagcatcc 421 tggaagctgt gccacaggtt aagtttattt gcctggatgt ggccaatggg tattcagaac 481 attttgtgga attcgtgaaa cttgtccgtg ccaaatttcc tgaacacacc attatggcag 541 ggaacgtggt gacaggagaa atggtagaag agcttattct ttccggagca gatatcatca 601 aagtgggagt tggaccaggt tctgtgtgca ccacccgcac caagacggga gtggggtacc 661 cccagctgag tgccgtcatt gagtgtgccg actctgccca cggcctgaag ggccacatca 721 tctctgatgg aggctgtacg tgtccagggg atgtcgccaa agcctttgga actggagcag 781 attttgtcat gctgggagga atgttttcgg gtcatacgga gtgtgctgga gaagtgattg 841 agaggaacgg acggaagctc aagctcttct acgggatgag ctctgacacc gccatgaaca 901 agcacgcagg aggagttgct gagtacagag cctctgaggg taagactgtg gaagttcctt 961 acaaaggaga tgtggaaaac actatcctgg atattctcgg gggactgagg tccacgtgca 1021 cctacgtggg ggccgccaaa ctcaaggagc tcagcaggag ggcaacattc atccgggtga 1081 cccagcagca caacaccgtg ttcagctaac cctggggaca aagcagcgtc tggctcgatg 1141 gaagcgtcca aacctgcttt tcccatctcc ccccaagtct gttccgtcag agcttctggc 1201 tgctcctgaa tggtggaatg cctgtgtcct ctcttctgtc tcctgccgcc tggaggcttc 1261 ggggctctcc cgcctgcctt ctcggggccc agacgcaagg caccgattgg gccaacatca 1321 gagccctgct gcccagaact cataacctca ttgttcaaac caacacttgc acctttctct 1381 ttttctcttt ctctctccct ttctttgttt ttctttcttt tttaaaagaa gatggtttca 1441 gctttaatat aatgctatta tctt // LOCUS MUSGT1A 2544 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse glucose transporter 1 mRNA, complete cds. ACCESSION M23384 J04557 KEYWORDS glucose transporter 1. SOURCE Mouse adipocyte cell line 3T3-L1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites) AUTHORS Kaestner,K.H., Christy,R.J., McLenithan,J.C., Braiterman,L.T., Cornelius,P., Pekala,P.H. and Lane,M.D. TITLE Sequence, tissue distribution, and differential expression of mRNA for a putative insulin-responsive glucose transporter in mouse 3T3-L1 adipocytes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3150-3154 (1989) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 2544; for [1]) AUTHORS Kaestner,K.H., Christy,R.J., McLenithan,J.C., Braiterman,L.T., Cornelius,P., Pekala,P.H. and Lane,M.D. JOURNAL Unpublished (1989) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly provided by M.Lane, 28-MAR-1989. FEATURES from to/span description pept 190 1668 glucose transporter 1 BASE COUNT 514 a 719 c 679 g 632 t ORIGIN 1 ttggtcctat aaaaaggcag ctccgcgcgc tctcccccaa gagcagaggc ttgcttgtag 61 agtgacgatc tgagctacgg ggtcttaagt gcgtcagggc gtggaggtct ggcgggagac 121 gcatagttac agcgcgtccg ttctccgtct cgcagccggc acagctagag cttcgagcgc 181 agcgcggcca tggatcccag cagcaagaag gtgacgggcc gcctcatgtt ggctgtggga 241 ggagcagtgc tcggatcact gcagttcggc tataacactg gtgtcatcaa cgccccccag 301 aaggttattg aggagttcta caatcaaaca tggaaccacc gcatcggaga gcccatccca 361 tccaccacac tcaccacgct ttggtctctc tccgtggcca tcttctctgt cgggggcatg 421 attggttcct tctctgtcgg cctctttgtt aatcgctttg gcaggcggaa ctccatgctg 481 atgatgaacc tgttggcctt tgtggctgct gtgcttatgg gcttctccaa actgggcaag 541 tcctttgaga tgctgatcct gggccgcttc atcatcggtg tgtactgcgg cctgactact 601 ggctttgtgc ccatgtatgt gggagaggtg tcacctacag ctctacgtgg agccctaggc 661 acactgcacc agctgggaat cgtcgttggc atccttattg cccaggtgtt tggcttagac 721 tccatcatgg gcaatgcaga cttgtggcct ctgctgctca gtgtcgtctt cgtcccagcc 781 ctgctacagt gtatcctgtt gcccttctgc cccgagagcc cccgcttcct gctcatcaat 841 cgtaacgagg agaaccgggc caagagtgtg ctgaagaagc ttcgagggac agccgatgtg 901 acccgagacc tgcaggagat gaaagaagag ggtcggcaga tgatgcggga gaagaaggtc 961 accatcttgg agctgttccg ctcacccgcc taccgccagc ccatcctcat cgctgtggtg 1021 ctgcagctgt cccagcagct gtcgggtatc aatgctgtgt tctactactc aacgagcatc 1081 ttcgagaagg caggtgtgca gcagcctgtg tacgccacca tcggctccgg tatcgtcaac 1141 acggccttca ctgtggtgtc gctgtttgtt gtagagcgag ctggacgacg gaccctgcac 1201 ctcattggcc tggctggcat ggcaggctgt gctgtgctca tgaccatcgc cctggccttg 1261 ctggaacggc tgccttggat gtcctatctg agcatcgtgg ccatctttgg ctttgtggcc 1321 ttctttgaag taggccctgg tcctattcca tggttcattg tggccgagct gttcagccag 1381 gggccccgtc ctgctcgtat tgctgtggct ggcttctcca actggacctc aaacttcatt 1441 gtgggcatgt gcttccagta tgtggagcaa ctgtgcggcc cctacgtctt catcatcttc 1501 acggtgctcc tcgtgctctt cttcatcttc acctacttca aagtccctga gaccaaaggc 1561 cgaaccttcg atgagatcgc ttccggcttc cggcaggggg gtgccagcca aagtgacaag 1621 acacccgagg agctcttcca ccctctgggg gcggactccc aagtgtgagg agccccacac 1681 ccagcccggc ctgctccctg cagcccaagg atctctctgg agcacaggca gctagatgag 1741 acctcttccg aaccgacaga tctcgggcaa gccgggcctg ggcgcctttc ctcagccagc 1801 agtgaagtcc aggaggatat tcaggacttt gatggctcca gaatttttaa tgaaagcaag 1861 actgctgctc agatctattc agataagcag caggttttat aattttttta ttactgattt 1921 tgttattttt tttttttatc agccactctc ctatctccac actgtagtct tcaccttgat 1981 tggcccagtg cctgagggtg gggaccacgc cctgtccaga cacttgcctt ctttgccaag 2041 ctaatctgta gggctggacc tatggccaag gacacactaa taccgaactc tgagctagga 2101 ggctttacgc tggaggcggt agctgccacc cacttccgca ggcctggacc tcggcaccat 2161 aggggtccgg actccatttt aggattcgcc cattcctgtc tcttcctacc caaccactca 2221 attaatcttt ccttgcctga gaccagttgg aagcactgga gtgcagggag gagagggaag 2281 ggccaggctg ggctgccagg ttctagtctc ctgtgcactg agggccacac aaacaccatg 2341 agaaggacct cggaggctga gaacttaact gctgaagaca cggacactcc tgccctgctg 2401 tgtatagatg gaagatattt atatactggt tgtcaatatt aaatacagac actaagttat 2461 agtatatctg gacaaaccca cttgtaaata caccaacaaa ctcctgtaac tttacctaag 2521 cagatataaa tggctggttt ttag // LOCUS MUSMS6HM 321 bp ds-DNA ROD 26-JUL-1990 DEFINITION M.musculus Ms6-hm locus, repeat elements. ACCESSION J04743 KEYWORDS dispersed repetitive element; minisatellite sequence. SOURCE M.musculus (strain C57BL/6J) DNA, clone pMm3-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Kelly,R., Bulfield,G., Collick,A., Gibbs,M. and Jeffreys,A.J. TITLE Characterization of a highly unstable mouse minisatellite locus: Evidence for somatic mutation during early development JOURNAL Genomics 5, 844-856 (1989) STANDARD full staff_entry COMMENT Printed sequence for [1] kindly submitted by R.Kelly, 08-AUG-1989. FEATURES from to/span description rpt 37 168 MT dispersed repetitive element rpt 168 263 tandem repeated element rpt 263 > 321 MT dispersed repetitive element BASE COUNT 84 a 66 c 109 g 62 t ORIGIN 1 gatccccagt gatgtaaacc agactatatg gctaactgtt ttagttagag tttctagttg 61 ctgtgaccaa caccatgacc aaaaagcaag ttggggagga aaggatttat ttgacttaca 121 cttccatata actgttcatc atcaaaagaa atcaggacag aaacccgggg gcagggcagg 181 gcagggcagg gcagggcagg gcagggcagg gcagggcagg gcagggcagg gcagggcagg 241 gcagggcagg gcagggcagg gcagggctga tgtagcgtca ctgaggagtc ctgcttccta 301 ctttgcttcc atgggtggat c // LOCUS RABCYP4A6 1790 bp ss-mRNA MAM 26-JUL-1990 DEFINITION Rabbit cytochrome P450IVA6 (CYP4A6) mRNA, complete cds. ACCESSION M28656 KEYWORDS cytochrome P450; lauric acid omega-hydroxylase. SOURCE Rabbit (strain New Zealand White, adult) kidney, cDNA to mRNA, clone KdA6. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 1790) AUTHORS Johnson,E.F., Walker,D.L., Griffin,K.J., Clark,J.E., Okita,R.T., Muerhoff,A.S. and Masters,B.S. TITLE Cloning and expression of three rabbit kidney cDNAs encoding lauric acid omega-hydroxylases JOURNAL Biochemistry 29, 873-879 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.F.Johnson, 04-OCT-1989. FEATURES from to/span description pept 14 1546 lauric acid omega-hydroxylase BASE COUNT 341 a 610 c 479 g 360 t ORIGIN 1 gggccgctgc accatgagcg tgtctgcact gaaccccacc cggctcccgg gcagcctctc 61 cgggctcctc caagtggcgg gcctgctggg cctgctcctg ctgctgctca aggcagctca 121 gctctacctg caccgccagt ggctgctcag agccctccag cagttcccgt gcccaccctt 181 ccactggctc ctggggcaca gccgagagtt ccaaaatggc catgagttac aagtgatgct 241 gaaatgggtg gagaaattcc caagtgcttg tcctcgctgg ctatggggga gcagagccca 301 cctcctgatc tatgaccctg actacatgaa ggtgattctg gggagatcag acccaaaagc 361 tcaaggttcc tacagattcc tggctccctg gattgggtat ggtttgctcc tgctgaatgg 421 gcagacgtgg ttccagcacc ggcgcatgct caccccagcc ttccactacg acatcctgaa 481 gccctacgtg gggctcatgg cggactccgt ccaaatcatg ctggacaaat gggagcagct 541 ggtcagccag gactcctccc tggaggtctt ccaagacatc tccctgatga ccctggacac 601 catcatgaag tgtgccttca gccaccaggg cagcgtccag ttggacagga attcccagtc 661 ctacatccag gctgttgggg acctgaacaa cctgttcttt tcccgagtga ggaacgtctt 721 tcatcagagt gacaccatct acaggctgag ccctgaaggc cgcttgtccc accgtgcctg 781 ccagctcgcc cacgagcaca cagaccgagt gatccagcag aggaaggctc agctgcagca 841 ggagggggag ctggagaagg tcaggaggaa gaggcgcttg gacttcctgg acgtcctcct 901 ctttgccaag atggagaacg ggagcagcct gtccgaccag gacctccgcg ccgaggtgga 961 cacgttcatg ttcgagggcc acgacaccac ggccagcggc atctcctgga tcttctatgc 1021 cctggccacg caccccgagc atcagcaccg gtgccgcgag gagatccagg gcctcctggg 1081 ggacggagcc tccatcacct gggagcacct ggaccagatg ccctacacca ccatgtgcat 1141 caaggaggcg ctgagactct acccaccagt gccaggtgtc ggcagacagc tcagctcacc 1201 tgtcaccttc cctgatggac gctccctccc caagggtgtc atagtcacgc tctccatcta 1261 cgcccttcac cacaacccga aggtgtggcc aaacccagag gtgtttgacc ctttcccgtt 1321 cgcaccgggt tctgctcgcc acagccacgc tttcctgccc ttctcaggag gaccacggaa 1381 ctgcatcggg aagcaatttg ccatgaatga gctgaaggtg gccgtggccc tgaccctcgt 1441 gcgcttcgag ctgctgccag atcccaaaag agtcccggac caaaaaccac gtcttgtgct 1501 gaagtccagc aacgggatcc acctgcgtct gaggaagctc cgctaaccct ggtggggaca 1561 agagcaggct ctggggcctt ctgccaggcg tcctggcttc ctgtcacctg cccatgcccc 1621 ctgcctgtct gcccacatcc tgctttctat ccaccagcac ttcttccacc tgtctgcctt 1681 gctgcctctt ggcctccagg ctgtctgtcc tctcgcacct tcctctgggc cactgacctg 1741 tctgtctact gtccgcttcc tgccagcatc tctgaccgtg cacctaaccc // LOCUS RABCYP4A7 1694 bp ss-mRNA MAM 26-JUL-1990 DEFINITION Rabbit cytochrome P450IVA7 (CYP4A7) mRNA, complete cds. ACCESSION M28657 KEYWORDS cytochrome P450; lauric acid omega-hydroxylase. SOURCE Rabbit (strain New Zealand White, adult) kidney, cDNA to mRNA, clone KdB18. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 1694) AUTHORS Johnson,E.F., Walker,D.L., Griffin,K.J., Clark,J.E., Okita,R.T., Muerhoff,A.S. and Masters,B.S. TITLE Cloning and expression of three rabbit kidney cDNAs encoding lauric acid omega-hydroxylases JOURNAL Biochemistry 29, 873-879 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.F.Johnson, 04-OCT-1989. FEATURES from to/span description pept 25 1560 lauric acid omega-hydroxylase BASE COUNT 330 a 564 c 463 g 337 t ORIGIN 1 ggcagatcca gaagctgctg caccatgagc gtgtctgcgc tgagctccac ccggctcccg 61 ggcagcttct ccgggttcct ccaagcggcg gccctgctgg gcctactcct gctgctgctc 121 aaggcagctc agctctacct gcgccgccag tggctgctca gagccctcca gcagttcccg 181 tgcccaccct cccactggct cctggggcac agccgagagt ttccaataga ctcggagctg 241 cagcaggtgc tgaagcgagt ggagaaattc ccaagcgcct gtcctcgctg gctgtggggg 301 agtgagctgt ttctcatttg ctacgaccct gactacatga agacgattct ggggcgatca 361 gacccaaagg ctcgtgtttc ctacagcttc ctggctccct ggattgggta tggcttgctg 421 cttttggaag ggcagacgtg gttccagcac cggcgcatgc tcaccccagc cttccactac 481 gacatcctga agccctacgt ggggctcatg gtggactccg tccaagtgat gctggacaaa 541 ctggagaagc tcgcccgcaa ggacgcgcct ctggagatat acgaacacgt ctccctgatg 601 accctggaaa ccatcatgaa gtgcgccttc agccaccagg gcagcgtcca gctggaaagc 661 aggacctcca aatcctacat ccaggctgtc agggagctca gcgacttggc attgcagcgg 721 gtgaggaacg tctttcacca gagcgacttc ctctacaggc tgagccctga gggccgcttg 781 tcccaccgtg cctgccagct cgcccacgag cacacagacc gagtgatcca gcagaggaag 841 gctcagctgc agcaggaggg ggagctggag aaggtcagga ggaagaggcg cttggacttc 901 ctggacgtcc tcctctttgc caagatggag aacgggagca gcctgtccga ccaggacctc 961 cgcgccgagg tggacacgtt catgttcgag ggccacgaca ccacggccag cggcatctcc 1021 tggatcttct atgccctggc cacgcacccc gagcatcagc accggtgccg cgaggagatc 1081 cagggtctcc tgggggacgg agcctccatc acctgggagc acctggacaa gatgccctac 1141 accaccatgt gcatcaagga ggcgctgaga ctctacccac cggtgccagg tgtcggcagc 1201 aagctcagct cacctgtcac cttccctgat ggacgctccc tccccaaggg catcataatc 1261 acactctcca tctatggcct gcatcacaac ccgaaggtgt ggccaaaccc agaggtgttt 1321 gacccttccc gcttcgcacc gggttctgct cgccacagcc acgctttcct gcccttctca 1381 ggaggatcga ggaactgcat cgggaaacaa tttgccatga acgagctgaa ggtggccgtg 1441 gccctgaccc tcgtgcgctt cgagctgctg ccggatccca ccagagtccc catccccata 1501 acaagacttg tgctgaagtc taagaatggg attcacctac gtctcaggaa gctccactaa 1561 ccctgctgga aacaagaatg gtctgccagg cgtcctctct tcctgtcacc tgcccgtgtc 1621 ccgcactctg tctgtatctt gctttctctc tacctacctg cccttcttcc acctgcctcc 1681 gattcggcct tttg // LOCUS HUMRGIT 1095 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human rRNA gene internal transcribed spacer 1 (ITS1). ACCESSION M36624 KEYWORDS internal transcribed spacer. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1095) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD simple staff_entry FEATURES from to/span description BASE COUNT 83 a 429 c 441 g 139 t 3 others ORIGIN 1 acggagcccg gagggcgagg cccgcggcgg cgccgccgcc gccgcgcgct tccctccgca 61 cacccacccc cccaccgcga cgcggcgcgt gcgcgggcgg ggcccgcgtg cccgttcgtt 121 cgctcgctcg ttcgttcgcc gcccggcccc gccgccgcga gagccgagaa ctcgggaggg 181 agacgggggg gagagagaga gagagagaga gagagagaga gagagagaga gaaagaaggg 241 cgtgtcgttg gtgtgcgcgt gtcgtggggc cggcgggcgg cggggagcgg tccccggccg 301 cggccccgac grcgtgggtg tcggcgggcg cgggggcggt tctcggcggc gtcgcggcgg 361 gtctgggggg gtctcggtgc cctcctcccc gccggggccc gtcgtccggc cccgccgcgc 421 cggctccccg tcttcggggc cggccggatt cccgtcgcct ccgccgcgcc gctccgcgcc 481 gccgggcacg gccccgctcg ctctccccgg ccttcccgct agggcgtctc gagggtcggg 541 ggccggacgc cggtcccctc ccccgcctcc tcgtccgccc ccccgccgtc caggtaccta 601 gcgcgttccg gcgcggaggt ttaaagaccc cttgggggga tcgcccgtcc gcccgtgggt 661 cgggggcggt ggtgggcccg cgggggagtc ccgtcgggag gggcccggcc cctcccgcgc 721 ctccaccgcg gactccgctc cccggccggg gccgcgccgc cgccgmcgcc gcggcggccg 781 tcgggtgggg gctttacccg gcggccgtcg cgcgcctgcc gcgcgtgtgg cgtgcgcccc 841 gcgccgtggg ggcgggaacc cccgggcgcc tgtggggtgg tgtccgcgct cgcccccgcg 901 tgggcggcgc gcgcctcccc gtggtgtgaa accttccgac ccctctccgg agtccggtcc 961 cgtttgctgt ctcgtctggc cggcctgagg caaccccctc tcctcttggg cggggggggs 1021 ggggggacgt gccgcgccag gaagggcctc ctcccggtgc gtcgtcggga gcgccctcgc 1081 caaatcgacc tcgta // LOCUS MHV1NP 1670 bp ss-RNA VRL 26-JUL-1990 DEFINITION Mouse hepatitis virus nucleocapsid (N-MHV1) RNA, complete cds. ACCESSION M35253 KEYWORDS N protein; RNA binding viral structural protein; nucleocapsid protein. SOURCE Mouse hepatitis virus (strain 1), cDNA to viral RNA. ORGANISM Mouse hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1670) AUTHORS Parker,M.M. and Masters,P.S. TITLE Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Masters, 19-JUN-1990. FEATURES from to/span description pept 1 1368 hepatitis virus nucleocapsid (N-MHV1) ORF 1 pept 65 688 hepatitis virus nucleocapsid (N-MHV1) ORF 2 BASE COUNT 497 a 364 c 426 g 383 t ORIGIN 1 atgtcttttg ttcctgggca agaaaatgcc ggtagcagaa gctcctctgt aaaccgcgct 61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat 121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caattccggg 181 agtgtggttc cccattactc ttggttttcg ggcattaccc aatttcagaa gggaaaagag 241 tttcagtttg cacaaggaca gggagtgcct attgccaacg gaatcccagc ttcagagcaa 301 aagggatatt ggtatagaca caaccgacgg tcttttaaaa cacctgatgg ccagcagaag 361 cagctactgc ccagatggta tttttactat cttggaacag ggccccatgc tggcgcagag 421 tatggcgacg atatcgacgg agttgtctgg gtcgcaagcc aacaggccga cactaagacc 481 actgccgata ttgttgaaag ggacccaagt agccatgagg ctattcctac taggtttgcg 541 cccggtacgg tattgcctca aggtttttat gttgaaggct caggaaggtc tgcacctgct 601 agtcgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagctcc 661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt 721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacaaa gcaaagcgcc 781 aaagaagtca ggcagaaaat tttaaacaag cctcgtcaaa agaggactcc aaacaagcag 841 tgccctgtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa 901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggccccaaca 961 cctagtgcct tcttctttgg atctaaatta gaattggtca aaaagaactc tggtggtgct 1021 gatgacccca ccaaagatgt gtatgaattg cagtattcag gtgcaattag atttgatagt 1081 actctcccag gatttgagac tatcatgaaa gtgttgaatg agaatttgga tgcctaccag 1141 gatcaagctg gtggtgcaga tgtagtgagc ccaaagcccc aaagaaagag agggacaaaa 1201 caaaaggctc tgaaaggtga agtagataat gtaagcgttg caaagcccaa aagctctgtg 1261 cagcgaaatg taagtagaga attaacccct gaggatcgta gtctgttggc tcagatcctt 1321 gatgatggcg ttgtgcctga tgggttagaa gatgactcta atgtgtaaag agaatgaatc 1381 ctatgtcggc actcggtggt aacccctcgc gagaaagtcg ggataggaca ctctctatca 1441 gaatggatgt cttgctgtca taacagatag agaaggttgt ggcagaccct gtatcaatta 1501 gttgaaagag attgcaaaat agagaatgtg tgagagaagt tagcaaggtc ctacgtctaa 1561 ccataagaac ggcgataggc gcccccctgg gaagagctca catcagggta ctattcctgc 1621 aatgccctag taaatgaatg aagttgatca tggccaattg gaagaatcac // LOCUS MHV3NP 1666 bp ss-RNA VRL 26-JUL-1990 DEFINITION Mouse hepatitis virus nucleocapsid (N-MHV3) RNA, complete cds. ACCESSION M35254 KEYWORDS N protein; RNA binding viral structural protein; nucleocapsid protein. SOURCE Mouse hepatitis virus (strain 3), cDNA to viral RNA. ORGANISM Mouse hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1666) AUTHORS Parker,M.M. and Masters,P.S. TITLE Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Masters, 19-JUN-1990. FEATURES from to/span description pept 1 1365 hepatitis virus nucleocapsid (N-MHV3) ORF 1 pept 65 688 hepatitis virus nucleocapsid (N-MHV3) ORF 2 BASE COUNT 494 a 358 c 432 g 382 t ORIGIN 1 atgtcttttg ttcctgggca agaaaatgcc ggtggcagaa gctcctctgg aaaccgcgct 61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat 121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caactccggg 181 agtgtggttc cccattactc ctggttttct ggcattaccc agttccaaaa gggaaaggag 241 tttcagtttg cagaaggaca aggagtgcct attgccaatg gaatccccgc ttcagagcaa 301 aagggatatt ggtatagaca caaccgccgt tcttttaaaa cacctgatgg gcagcagaag 361 caattactgc ccagatggta tttttactat cttggcacag ggccccatgc tggagccagt 421 tatggagaca gcattgaagg agtcttctgg gttgcaaaca gccaagcgga caccaatacc 481 cgctctgata ttgtcgaaag ggacccaagc agtcatgagg ctattcctac taggtttgcg 541 cccggcacgg tattgcctca gggcttttat gttgaaggct ctggaaggtc tgcacctgct 601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagttcc 661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt 721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacgaa gcaaagtgcc 781 aaagaagtca ggcagaaaat tttaaacaag cctcgccaaa agaggactcc aaacaagcag 841 tgcccagtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa 901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggctccaaca 961 gttggtgcct tcttctttgg atctaaatta gaattggtca aaaagaattc tggtggtgct 1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt 1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag 1141 aaggatggtg gtgcagatgt ggtgagccca aagccccaaa gaaaagggcg tagacaggct 1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag 1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatccttgat 1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta 1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa 1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt 1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca 1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcctgcaatg 1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcgc // LOCUS MHVA59NP 1666 bp ss-RNA VRL 26-JUL-1990 DEFINITION Mouse hepatitis virus nucleocapsid (N-MHVA59) RNA, complete cds. ACCESSION M35256 KEYWORDS N protein; RNA binding viral structural protein; nucleocapsid protein. SOURCE Mouse hepatitis virus (strain A59), cDNA to viral RNA. ORGANISM Mouse hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1666) AUTHORS Parker,M.M. and Masters,P.S. TITLE Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Masters, 19-JUN-1990. FEATURES from to/span description pept 1 1365 hepatitis virus nucleocapsid (N-MHVA59) ORF 1 pept 65 688 hepatitis virus nucleocapsid (N-MHVA59) ORF 2 BASE COUNT 497 a 355 c 433 g 381 t ORIGIN 1 atgtcttttg ttcctgggca agaaaatgcc ggtagcagaa gctcctctgg aagccgctct 61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgcgc tggaaataat 121 ggaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caattccggg 181 agtgtggttc cccattactc ttggttttcg ggcattaccc aattccagaa gggaaaagag 241 tttcagtttg tacaaggaca gggagtgcct attgccaatg gaatcccagc ttcagagcaa 301 aagggatatt ggtatagaca caaccgacgt tcttttaaaa cacctgatgg ccagcagaag 361 cagctactgc ccagatggta tttttactat ctcggaacag ggccccatgc tggcgcagag 421 tatggcgacg atatcgaagg agttgtctgg gtcgcaagcc aacaggccga cactaagacc 481 actgccgata ttgttgaaag ggacccaagt agccatgagg ctattcctac taggtttgcg 541 cccggtacgg ttttgcctca gggtttttat gttgaaggct caggaaggtc tgcacctgct 601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagctcc 661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt 721 gttttggcta agctcggtaa agatgccggt cagcccaagc aagtaacaaa gcaaagtgcc 781 aaagaagtca ggcagaaaat tttaaacaag cctcgtcaaa agaggactcc aaacaagcag 841 tgcccagtgc agcaatgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa 901 atgcttaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggccccaaca 961 gctggtgcct tcttctttgg atctaaatta gaattggtca aaaagaactc tggtggtgct 1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt 1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag 1141 aaggatggtg gtgcagatgt agtgagccca aagccccaaa gaaaagggcg tagacaggct 1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag 1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatcctagat 1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta 1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa 1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt 1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca 1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcttgcaatg 1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcac // LOCUS MHVSHV 1666 bp ss-RNA VRL 26-JUL-1990 DEFINITION Mouse hepatitis virus nucleocapsid (N-MHVS) RNA, complete cds. ACCESSION M35255 KEYWORDS N protein; RNA binding viral structural protein; nucleocapsid protein. SOURCE Mouse hepatitis virus (strain S), cDNA to viral RNA. ORGANISM Mouse hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1666) AUTHORS Parker,M.M. and Masters,P.S. TITLE Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Masters, 19-JUN-1990. FEATURES from to/span description pept 1 1365 hepatitis virus nucleocapsid (N-MHVS) ORF 1 pept 65 688 hepatitis virus nucleocapsid (N-MHVS) ORF 2 BASE COUNT 494 a 357 c 430 g 385 t ORIGIN 1 atgtcttttg ttcctgggca agaaaatgcc ggtggcagaa gctcctctgt aaaccgcgct 61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat 121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caactccggg 181 agtgtggttc cccattactc ctggttttct ggcattaccc agttccaaaa gggaaaggag 241 tttcagtttg cagaaggaca aggagtgcct attgccaatg gaatccccgc ttcagagcaa 301 aagggatatt ggtatagaca caaccgccgt tcttttaaaa cacctgatgg gcagcagaag 361 caattactgc ccagatggta tttttactat cttggcacag ggccccatgc tggagccagt 421 tatggagaca gcattgaagg tgtcttctgg gttgcaaaca gccaagcgga caccaatacc 481 cgctctgata ttgtcgaaag ggacccaagc agtcatgagg ctattcctac taggtttgcg 541 cccggcacgg tattgcctca gggcttttat gttgaaggct ctggaaggtc tgcacctgct 601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagttcc 661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt 721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacgaa gcaaagtgcc 781 aaagaagtca ggcagaaaat tttaaacaag cctcgccaaa agaggactcc aaacaagcag 841 tgcccagtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa 901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggctccaaca 961 gttggtgcct tcttctttgg atctaaatta gaattggtca aaaagaattc tggtggtgct 1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt 1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag 1141 aaggatggtg gtgcagatgt ggtgagccca aagccccaaa gaaaagggcg tagacaggct 1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag 1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatccttgat 1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta 1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa 1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt 1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca 1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcttgcaatg 1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcac // LOCUS MXAFRZGF 2999 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.xanthus frzG and frzF genes, complete cds. ACCESSION M35200 KEYWORDS FrzF protein; FrzG protein; methylesterase; methyltransferase. SOURCE M.xanthus (strain DZF1) DNA. ORGANISM Myxococcus xanthus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; Myxococcaceae. REFERENCE 1 (bases 1 to 2999) AUTHORS McCleary,W.R., McBride,M.J. and Zusman,D.R. TITLE Developmental sensory transduction in Myxococcus xanthus involves methylation and demethylation of FrzCD JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.R.McCleary, 15-JUN-1990. FEATURES from to/span description pept 120 1124 FrzG protein pept / 1117 2899 FrzF protein (AA at 2) BASE COUNT 383 a 990 c 1134 g 492 t ORIGIN 1 ggatgccggc gcggacgcgt acctcgtcaa gggcgagctg ggcgtggagg ttctcgcgca 61 ggccatcgac cggctgacct gaggagccag gcttgggcgg tggcgcggta gtcgcaggaa 121 tggcgtttcg ggtgctcatg gtgggcaagg ggctgcgtgc gctcgcggcc cggggcctgt 181 tcgatgggga atccctggtg cccgtggggc cggcggaggt ggacttcgcc ggcgccctgg 241 tggccgtgca gcggcacttc ccggacgtgg tgctggtgga cctgagcgcg ctggacgcgc 301 tgcccgccat cgagcacgtc atggtggagc ggcccgtgcc ggtgctggcg ttgcaccccg 361 gcgtgttgtc cggccaggag gccttccagg cgatggtggc gggcgcgctg gacgtgctgg 421 agcgtccggc gaaccccggg cccgagttct ggacgcacgt gtcgcgcaag ctggtgctgc 481 tggcgcaggt gaaggcggtg cggcaggtgc agacgcggcc gccaccgcaa gcggcgcgtg 541 aggcgaagcc gcctcctccg tatccgctgg tggccatcgc cgcgtccctg ggtggcccca 601 aggcggtggc gcaggtgctg cggatgattc cgcgcgcctt cccggcgccc atcgcctact 661 gccagcacat cagcgacggt ttcacggaag ggctggcgca ctggttgtcc aatgaaacgg 721 cgctgcgcgt gctggaggcc gagcatgacg tgctcatggc gccgggcacg gtgtacatcg 781 ctccgtcggg cagtcacctc ttggtccgac ccgagggcag gttggagctg gacgcgggcc 841 ccgcgcttcg cggtttccgg ccgtcctgtg acatgctgct gacttcagcg ggtgagtcgt 901 tcggcccgcg ctgcatcggg gtcatcctga cgggcatggg gcgcgacggg gcgcgagggt 961 tgaaggagat tcgagagcgc ggcggtcgga ccattgccca ggacgaagcg tcgagcgtcg 1021 tctggggcat gccgcgcgag gcggtgttga tgggcgcggc gcacgaggtg ctgccactga 1081 gccggattgg cgcggcgctg atgcagtggg tggatgtgtg ttgacggcga gccagaaagt 1141 cttgcaacaa ctcgcggcgc tgctgctgga gcgcgcgggg ctgaaaatca cgccggatgg 1201 cttccacagc ctccgactgg cgctgtccgc gcggatgccc gtgctggggc tggaagagcc 1261 cgagcactac atccagcgac tgacgggcgc cggtggcgaa gaggagctgc gctcgctgtt 1321 gccgctggtg acggtggggc acacggagtt cttccgcgac gcgaagcagt tccgcgcgct 1381 ggagaagagc gtgctgccgg acctggtgtc ccgttcgcgg cgcgagatgc gcaaggtgtc 1441 catctggtcc gcgggctgcg cgacggggga ggagccctac agcctggcca tggtgctggc 1501 ggagctgggc gcgctgtcgc tggaggtgga cctgtgggcc accgacctca acctggccgc 1561 ggtggaggcc gcgaagcagg ggcgcttcac ctcgcggcgg gccatcagca tcaaccaggc 1621 gcggctgacg cgcttcttca agcccgtgga agagggctat gaggcgctgc ccgcgctgcg 1681 tgagtacatc cgcttcgatg gacagaacct ggcggttccc gtcttcgaca aggtggccct 1741 gtcgtcgctg gacctcatcc tctgccgcaa cgtcatcatc tacttcgacc tgcccaccat 1801 ccgcgggttg atggaccgct tcctcgccgc gctgcggccg ggcgggctgt tgttcctggg 1861 gtactcggag agcctcttca aggtctacga ccgcttcgag atgatcgaag tcgatggggc 1921 gttcgtgtac cgccgcccgc tgaacgacaa gagcatgcgg gcgccgccgc tgcgcatcac 1981 cccgtatcct ggcgagcccg atgtcgccgc gcgcaggccc gtgcctgcgg acgcgttcac 2041 cgcggacctg cgcaagcgga tgctgcccga ggacgtcccg ttgacgacgc ggctgcccgc 2101 ggtgtcagcg tcgtcggtgg cggcgcctgg ctcgcccagc gtgacgctgc cggcgctggg 2161 ggcctcttcg agtccgcgtt ccgtggtgcc ggggcggctg cccgcggtgt cgcctcactc 2221 gccgctgccg gccatcgccg cgcgctcgcg tgtcaccgcg gagttgccca cggtgggaag 2281 cgtggactcc gcccgtccgc gcatcaccac cgagctgccg gccgtggcca ccacgccgcg 2341 cgcgcccacc gtggaggtgc ccgcctggcc cacgctgctg cctccggcgg agcggctggc 2401 catggcggtg cggaagatgg cgcaggggga tttctcggcg gccatcgctg gcgtgcagcg 2461 gctgctcgcg gacgagccca gtgacttgga tgggctgctg acgctgggca acctgttctc 2521 gctcaccggc cgcatccccg aggcgcgcga ggccttcgcg caggccattc agcgcgagcc 2581 gctgtgcgtg gaggcgcggg tgttcggcgg ggtcgccgcg ttgcaagcag gggagttgag 2641 cgaggcgcgc tccgagctga gcaaggccct gttcctggag cccacgctgg ccattggcca 2701 ctacctgctg gcgcaggtgc acgagcgcac gcaggaccat gaggcggccc gccgcagcta 2761 ccgcaacgcc attgcccagc ttcgcttccc gcagcgtccc ctcgcggggc actacccgga 2821 gatgccggac tcggcggatg ccatctctcg cgcggcgcgt tacgccctgg ccgcgctgga 2881 ggagcagccc ctgcgctgag gcaggggccg cgtcccaggc ttcacgtcag tccaggctgc 2941 tcttcacctg gtccaggctc ttgctcgggt cgagcacgga gccgaacttc ttctgcagg // LOCUS ECOSFIM 762 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli S-fimbrial protein (sfaA) gene, complete cds. ACCESSION M35273 KEYWORDS S-fimbrial protein. SOURCE E.coli (strain 536) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 762) AUTHORS Schmoll,T., Hacker,J. and Goebel,W. TITLE Nucleotide sequence of the sfaA gene coding for the S-fimbrial protein subunit of Escherichia coli JOURNAL FEMS Microbiol. Lett. 41, 229-235 (1987) STANDARD simple staff_review FEATURES from to/span description pept 166 708 S-fimbrial protein precursor sigp 166 237 S-fimbrial protein signal peptide matp 238 705 S-fimbrial protein BASE COUNT 218 a 140 c 172 g 232 t ORIGIN 1 gaaaatatta tcggagataa tgtcataaat gctgcctgag tgtatttctc acattgcatt 61 tatgaagttc tcctgaaaaa agattcccgt cgttcgggat attgattgtg tctgttgtga 121 tgacagatac ggtgtgcgta gttcaattaa aaacaggaat taaatatgaa gttaaaattc 181 atctccatgg ctgtattttc agccctgacc ttgggtgttg cgacaaatgc gtctgctgtc 241 accacggtta atggtggtac agttcatttt aagggggaag ttgttgatgc tgcatgtgct 301 gtaaacacta attcagcaaa tcaaacgttt tctgggcaag ttcgttcagc taagttggcg 361 aatgatggag agaagagttc ccctgttgga tttagtattg aacttaatga ctgtagttct 421 gcaactgccg ggcatgcatc aattatcttt gcaggaaatg ttattgctac acacaatgat 481 gtgctgtctc tacagaatag tgctgcaggt agtgcaacaa atgtaggtat tcagatattg 541 gatcatacag gtactgcagt tcaatttgac ggagtgactg catctacaca atttacatta 601 acagatggca ccaataaaat tcctttccag gcagtttatt atgcaacagg taagtcaacg 661 cctggtattg ccaacgccga cgccaccttt aaagttcagt accagtaata tcagaacagt 721 gtaacgatat atacccggcc aggagggctg tttttatcat gc // LOCUS ECOSRNB 655 bp ds-DNA BCT 26-JUL-1990 DEFINITION F plasmid (from E.coli) stable RNA degradation promoter (srnB) gene, complete cds. ACCESSION M35279 KEYWORDS . SOURCE F plasmid (from E.coli) DNA. ORGANISM Plasmid F Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 655) AUTHORS Akimoto,S., Ono,K., Ono,T. and Ohnishi,Y. TITLE Nucleotide sequence of the F plasmid gene srnB that promotes degradation of stable RNA in Escherichia coli JOURNAL FEMS Microbiol. Lett. 33, 241-245 (1986) STANDARD simple staff_review FEATURES from to/span description pept 251 457 stable RNA degradation promoter (srnB) signal 127 132 -35 region signal 150 155 -10 region signal 466 485 transcription termination signal (put.) binding 242 245 ribosome binding site BASE COUNT 163 a 156 c 173 g 163 t ORIGIN 1 aattcccatt ctggaccagc gggagcatac gaacaataat ttacggtttc gcgctatagc 61 tggctcaagt taggttggac cctgaatctc cagacaacca atatctgatc gcgccagtgg 121 tggcagttat taagcaacag ggaatgtggt attatcgcgg cgggtgtctg agcctttctg 181 gttcaggcaa gacgcaggta ccagaaatgc gaagacccca cttgttaatc cattaactcg 241 tgaggtctgc atgaagtacc ttaacactac tgattgtagc ctcttccttg cagagaggtc 301 aaagtttatg acgaaatatg cccttatcgg gttgctcgcc gtgtgcgcta cggtgttgtg 361 tttttcactg atattcaggg aacggttatg tgagctgaat attcacaggg gaaatacagt 421 ggtgcaggta actctggcct acgaagcacg gaagtaagct gccgggcggg gacggaagtc 481 cccgctttcc ggaagtgtga ggtatttcag gggcagacac ccgacatgcc agaaacagcc 541 ggtcccgccc ggggccggca cccaggttca ggcatttcct gcttttcagt catttcatta 601 tcaaaatcac attaaacggt cgtaatcaga catgatttgt gcgccaacac agatc // LOCUS HUMTIMP2 1062 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human metalloproteinase-2 inhibitor (TIMP-2) mRNA, complete cds. ACCESSION J05593 KEYWORDS metalloproteinase-2 inhibitor. SOURCE Human melanoma cell line A2058, cDNA to mRNA, clone pT2-M01. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1062) AUTHORS Stetler-Stevenson,W.G., Brown,P.D., Onisto,M., Levy,A.T. and Liotta,L.A. TITLE Tissue inhibitor of metalloproteinases-2 (TIMP-2) mRNA expression in tumor cell lines and human tumor tissues JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by W.G.Stetler-Stevenson, 19-JUN-1990. FEATURES from to/span description pept 271 933 metalloproteinase-2 inhibitor precursor sigp 271 348 metalloproteinase-2 inhibitor signal peptide matp 349 930 metalloproteinase-2 inhibitor BASE COUNT 222 a 370 c 316 g 154 t ORIGIN 1 ggggccgccg agagccgcag cgccgctcgc ccgccgcccc ccaccccgcc gccccgcccg 61 gcgaattgcg ccccgcgccc tcccctcgcg cccccgagac aaagaggaga gaaagtttgc 121 gcggccgagc gggcaggtga ggagggtgag ccgcgcggag gggcccgcct cggccccggc 181 tcagcccccg cccgcgcccc cagcccgccg ccgcgagcag cgcccggacc ccccagcggc 241 ggccccgccc gcccagcccc ccggcccgcc atgggcgccg cggcccgcac cctgcggctg 301 gcgctcggcc tcctgctgct ggcgacgctg cttcgcccgg ccgacgcctg cagctgctcc 361 ccggtgcacc cgcaacaggc gttttgcaat gcagatgtag tgatcagggc caaagcggtc 421 agtgagaagg aagtggactc tggaaacgac atttatggca accctatcaa gaggatccag 481 tatgagatca agcagataaa gatgttcaaa gggcctgaga aggatataga gtttatctac 541 acggccccct cctcggcagt gtgtggggtc tcgctggacg ttggaggaaa gaaggaatat 601 ctcattgcag gaaaggccga gggggacggc aagatgcaca tcaccctctg tgacttcatc 661 gtgccctggg acaccctgag caccacccag aagaagagcc tgaaccacag gtaccagatg 721 ggctgcgagt gcaagatcac gcgctgcccc atgatcccgt gctacatctc ctccccggac 781 gagtgcctct ggatggactg ggtcacagag aagaacatca acgggcacca ggccaagttc 841 ttcgcctgca tcaagagaag tgacggctcc tgtgcgtggt accgcggcgc ggcgcccccc 901 aagcaggagt ttctcgacat cgaggaccca taagcaggcc tccaacgccc ctgtggccaa 961 ctgcaaaaaa agcctccaag ggtttcgact ggtccagctc tgacatccct tcctggaaac 1021 agcatgaata aaacactcat cccatgggtc caaattaata tg // LOCUS ALREV1 717 bp ss-RNA VRL 26-JUL-1990 DEFINITION Rous sarcoma defective endogenous virus ev-1 locus gag polyprotein RNA, 5' end. ACCESSION M30517 KEYWORDS gag polyprotein. SOURCE Rous sarcoma defective endogenous virus (strain Prague C), cDNA to viral RNA, clone pGD27. ORGANISM Rous sarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian sarcoma viruses. REFERENCE 1 (bases 1 to 717) AUTHORS Vogt,V.M., Pepinsky,R.B. and Southard,L.E. TITLE Primary structure of p19 species of avian sarcoma and leukemia viruses JOURNAL J. Virol. 56, 31-39 (1990) STANDARD full staff_review FEATURES from to/span description pept 1 > 717 gag polyprotein matp 1 465 p19 protein matp 466 531 p2 protein matp 532 717 pp10 protein BASE COUNT 161 a 173 c 246 g 137 t ORIGIN 1 atggaagccg tcataaaggt gatttcgtcc gcgtgtaaaa cctattgcgg gaaaacctct 61 ccttctaaga aggaaatagg ggccatgttg tccctgttac aaaaggaagg gttgcttatg 121 tctccctcag acttatattc cccggggtcc tgggatccca ttaccgcggc gctctcccag 181 cgggcaatgg tacttgggaa atcgggagag ttaaaaacct ggggattggt tttgggggca 241 ttgaaggcgg ctcgagagga acaggttaca tctgagcaag caaagttttg gttgggatta 301 gggggaggga gggtctctcc cccaggtccg gagtgcatcg agaaaccagc aacggagcgg 361 cgaatcgaca aaggggagga agtgggagaa acaactgcgc agcgagatgc gaagatggcg 421 ccggagaaaa tggccacacc taaaaccgtt ggcacatcct gctatcagtg cggaacagct 481 actggctgta attgcgccac agcctcggcc cctcctcctc cttatgtggg gagtggtttg 541 tatccttccc tggcgggggt gggagagcag cagggccagg ggggtgacac accttggggg 601 gcggaacagc caagggcgga gccagggcac gcgggtctgg cccctgggcc ggccctgact 661 gactgggcaa ggatcaggga ggagcttgcg agtactggtc cgcccgtggt ggccatg // LOCUS ALREV2 564 bp ss-RNA VRL 26-JUL-1990 DEFINITION Rous sarcoma endogenous virus ev-2 locus gag polyprotein RNA, partial cds. ACCESSION M30518 KEYWORDS gag polyprotein. SOURCE Rous sarcoma endogenous virus (strain Prague C), cDNA to viral RNA, clone pAS2. ORGANISM Rous sarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian sarcoma viruses. REFERENCE 1 (bases 1 to 564) AUTHORS Vogt,V.M., Pepinsky,R.B. and Southard,L.E. TITLE Primary structure of p19 species of avian sarcoma and leukemia viruses JOURNAL J. Virol. 56, 31-39 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 564 gag polyprotein (AA at 1) matp < 1 312 p19 protein matp 313 378 p2 protein matp 379 564 pp10 protein BASE COUNT 123 a 138 c 207 g 96 t ORIGIN 1 gatcccatta ccgcggcgct ctcccagcgg gcaatggtac ttgggaaatc gggagagtta 61 aaaacctggg gattggtttt gggggcattg aaggcggctc gagaggaaca ggttacatct 121 gagcaagcaa agttttggtt gggattaggg ggagggaggg tctctccccc aggtccggag 181 tgcatcgaga aaccagcaac ggagcggcga atcgacaaag gggaggaagt gggagaaaca 241 actgtgcagc gagatgcgaa gatggcgccg gaggaaacgg ccacacctaa aaccgttggc 301 acatcctgct atcattgcgg aacagctatt ggctgtaatt gcgccacagc ctcggcccct 361 cctcctcctt atgtggggag tggtttgtat ccttccctgg cgggggtggg agagcagcag 421 ggccaggggg gtgacacacc tcggggggcg gaacagccaa gggcggagcc agggcacgcg 481 ggtctggccc ctgggccggc cctgactgac tgggcaagga tcagggagga gcttgcgagt 541 acaggtccgc ccgtggtggc catg // LOCUS HAMCHO1 1953 bp ss-mRNA ROD 26-JUL-1990 DEFINITION C.griseus intracisternal A-particle retrovirus like sequences. ACCESSION M34949 KEYWORDS p27 protein; pseudogene. SOURCE C.griseus adult ovary, cDNA to mRNA, clone CHIAP.SW2. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1953) AUTHORS Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H., Nguyen,T.P. and Wurm,F.M. TITLE Presence and transcription of intracisternal A-particle-related sequences in CHO cells JOURNAL J. Virol. 64, 2021-2032 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps 2 277 IAP p27 homologue 277 564 IAP p27 homologue mRNA < 1 1953 p27 (pot.) mRNA BASE COUNT 485 a 473 c 509 g 486 t ORIGIN 1 ctttactctt acacaattgg atagacttgc cctaaatgcc ttgacgccat ctgactggca 61 gatggtcaca aaagctgcgc ttgtcagcat gggccaatac atggagtgga aagcactctg 121 gcatgaggcc gcccaagagc aggccagagc taacgcgacg gccttaactc ctgagcaaca 181 actatggaca ttcgacctgt taacgggcca gggtcgtttt gcagctgatc aaacaaatta 241 tcattggggc gcttatccac aaatcgacaa cgcggcatta gggcctgaaa ggtgctctcc 301 aagaaaggag gggttgacaa tcagcttact aaaatcattc aaggaaccca ggagactttc 361 tccgattttg tagcaaggat gacagaggca gcgggatgga tctttggcga tcctgagcag 421 gccgcacctc ttgttgagca acttatcttt gaacaggcct cccaagaatg tcgcgcagct 481 atagccccga gaaaaaacaa aggattacaa gattggctta gggtctgtag agaacttggg 541 ggacccctta ctaatgcagg gttagctact gccatcctac agtctcaaaa gcgccccctt 601 aaggggccag ataaaagaac ttgctttaga tgtggaacaa ttggacatat tatggcagat 661 ggcccaacta ggctgtgagc agaagctccc cggcctatat gtcacctcca tccaatatga 721 aaattttacc aaagcagcta atttgtctaa aagcctttct cagttcatgt tacagaattg 781 gacctccaaa tttgagcaaa cgcttcggga gttgagagcc gctattatcc agattaactc 841 cacgcgcctt gacctgtcct tgacggaggg attgtcatca tggatcgctt cgactgtctc 901 ctattttaag gaatgggtgg gggtgggatt gtttggtgca gccgtttgct gcggattggt 961 gttgcttcta tggctggtct gtaggctcag ggctcaaact aagagagaca aggtggttat 1021 cgcccaagcg cttgtagctt tggaacaagg ggcttccact gacatttggt taacaatact 1081 taagcaatag gcgctggcca gacagctctt gcacacccgg agcctaggct cattgcacag 1141 ggtagagtgt ctggcttgag cagcccatga gggaatgtgg agcaaggcat cgcacagaag 1201 agttgcccag tatgcaggct tctctgggag gcatgttgtc ctgcataagg gttgcctgcc 1261 ctagtctccc tttcccagaa aacggcagag gacaggtcga gagcgcttcg ggtcaagcta 1321 acagcctaat ggcgactctc gtacacagtc ttaatgtttg attgggaagg tacaacctct 1381 gcctctatcc ctcaacatat gggtgaccta tttgcttgta aaaatatgta agccttatca 1441 ttaattaata aaaaagggga gatgtaggga gccgtccctg cattctctat tacaagatgg 1501 cgcctgcatc cggcaggcac cgaatggtaa acaagttaat gcgcaggtgc tgggtaactt 1561 tccatccctt ggtctctgcc tctcccgtgg cgtcatatgg tccgatgagc tgcagccagt 1621 cagggggtga cacgtccgag gcggtggttg ccagcctata taagggatgg gtttttggga 1681 gttcggggtc tctgctctgt aagcttatgc tctccctctc aagatgcatt aaagctttac 1741 tacagaagga tcctgaatgt cctgcgtcat tcttgctggc gagacggtag cgcgggacag 1801 atggtgacag ccggtgcaga aagtgtcaac ctcagcttcc ttctccagga agacttcagc 1861 ctgggactgc tcctctacag agccccctac caagattatc taacctgcct gccttcttgt 1921 tgagctgtgt gtaataaact cattgagttt ccc // LOCUS HAMCHO2 1570 bp ss-mRNA ROD 26-JUL-1990 DEFINITION C.griseus intracisternal A-particle retrovirus like sequences. ACCESSION M34950 KEYWORDS p27 protein; pseudogene. SOURCE C.griseus adult ovary, cDNA to mRNA, clone CHIAP.LY6. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1570) AUTHORS Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H., Nguyen,T.P. and Wurm,F.M. TITLE Presence and transcription of intracisternal A-particle-related sequences in CHO cells JOURNAL J. Virol. 64, 2021-2032 (1990) STANDARD full staff_review FEATURES from to/span description pept.ps 26 694 IAP p27 homologue BASE COUNT 418 a 368 c 406 g 378 t ORIGIN 1 aaaaagaaag ctgggcctcg cttttcccat ctttgagggc attgagggag agtgtatgca 61 tgcacccatg gagtataatc agataaaaga attggcagaa tcagtcagga aatatggagt 121 cacagccaac tttactctta cacaattgga tagacttgcc ctaaatgcct tgacgccatc 181 tgactggcag atggtcacaa aagctgcgct tgtcagcatg ggccaataca tggagtggaa 241 agcactctgg catgaggccg cccaagagca ggccagagct aacgcgacgg ccttaactcc 301 tgagcaacaa ctatggacat tcgacctgtt aacgggccag ggtcgttttg cagctgatca 361 aacaaattat cattggggcg cttatccaca aatcgacaac gcggccatta gggcctgaaa 421 ggtgctctcc aagaaaggag gggttgacaa tcagcttact aaaatcattc aaggaaccca 481 ggagactttc tccgattttg tagcaaggat gacagaggca gcgggatgga tctttggcga 541 tcctgagcag gccgcacctc ttgttgagca acttatcttt gaacaggcct cccaagaatg 601 tcgcgcagct atagccccga gaaaaaacaa aggattacaa gattggctta gggtctgtag 661 agaacttggg ggacccctta ctaatgcagg gttagctact gccatcctac agtctcaaaa 721 gcgccccctt aaggggccag ataaaagaac ttgctttaga tgtggaacaa ttggacatat 781 tatggcagat ggcccaacta ggctgtgagc agaagctccc cggcctatat gtcacctcca 841 tccaatatga aaattttacc aaagcagcta atttgtctaa aagcctttct cagttcatgt 901 tacagaattg gacctccaaa tttgagcaaa cgcttcggga gttgagagcc gctattatcc 961 agattaactc cacgcgcctt gacctgtcct tgacggaggg attgtcatca tggatcgctt 1021 cagctgtctc ctattttaag gaatgggtgg gggtgggatt gtttggtgca gccgtttgct 1081 gcggattggt gttgcttcta tggctggtct gtaggctcag ggctcaaact aagagagaca 1141 aggtggttat cgcccaagcg cttgtagctt tggaacaagg ggcttccact gacatttggt 1201 taacaatact taagcaatag gccgctggcc agacagctct tgcacacccg gagcctaggc 1261 tcattgcaca gggtagagtg tctggcttga gcagcccatg agggatgtgg agcaaggcat 1321 cgcacagaag agttgcccag tatgcaggct tctctgggag gcatgttgtc ctgcataagg 1381 gttgcctgcc ctagtctccc tttcccagaa aaacggcaga ggacaggtcg agagcgcttc 1441 gggtcaagct aacagcctaa tggcgactct cgtacacagt cttaatgttt gattgggaag 1501 gtacaacctc tgcctctatc cctcaacata tgggtgacct atttgcttgt aaaaatatga 1561 agccttatca // LOCUS HAMCHO3 2186 bp ss-mRNA ROD 26-JUL-1990 DEFINITION C.griseus intracisternal A-particle retrovirus like sequences. ACCESSION M34951 KEYWORDS protease; pseudogene. SOURCE C.griseus adult ovary, cDNA to mRNA, clone CHIAP.YL[7,9]. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2186) AUTHORS Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H., Nguyen,T.P. and Wurm,F.M. TITLE Presence and transcription of intracisternal A-particle-related sequences in CHO cells JOURNAL J. Virol. 64, 2021-2032 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps 553 1281 IAP protease homologue BASE COUNT 649 a 431 c 518 g 588 t ORIGIN 1 gcaataactc catataaggg taaaggcctt gaagtctgga tgaaagtctg tagggagtta 61 gggggtccgc tgactaatgc tggactagca gctgctgtgt tgcaattaac taagaaaggt 121 ggaggttcag gagcttgctt taaatgcggc aagcaagggc atttgaaaaa gcaatgcccc 181 gagggaggaa acactaaagt caataaactt tgctccgcgc cctaagcaac ctggcttatg 241 tcctagatgt agaaaaggaa atcattgggc taaggattgt agatcagtaa aagacatcag 301 tggacagcct cttgttcagg ggtatggagg agcccgttca aaaaacggac gacggggccc 361 acgaccccag ggcccacaaa tatatggggc catggaggat cagaaccagg agcagagtcc 421 cgaaacctgg ccctctcttc gtcatccgag ggaccgagga gagccactac aggctccgcg 481 gggctggact tacgctccac caccagactc gtattaactc ccagaatggg ggtccagctt 541 gttgacaccg attttaaggg accccttgag cctggcacag taggtttgct tataggaaga 601 tcatctgcag cattgaaagg tttacgagta catcctggag ttatagatcc tgattacatg 661 ggtgtagtaa agatcatggt agaatctcct agagggatta cggccatttc tcctggagac 721 aggatagcac agttactgct tttgccaagc ttgcatgaca agtttccagc acaagccaga 781 gagagaggag agggaaactt tggctccact ggatcaaact taactttcct agctttagac 841 cttgatcaac gtccaaccct tgagttaata gtgaatggta agaaaatctt aggcttacta 901 gattctggag ctgataagag catcatagcc actaaagatt ggccctctgg ctggcctata 961 caggtttctt ctcaaagttt acaaggttta ggctatgcta aggctcctga tatgagtgct 1021 agacaattgc cttggaaaga tcaggaaggg cattcaggga ccatgcaacc ttatgtgtta 1081 gacttaccaa tttcattatg gggaagagat ttgttaaagg atatgggttt taaactcaca 1141 aatgaatact cagaaacatc tcaaggtatc atgaaacgaa tgggatacag tcccaggcca 1201 ggcctcggga aacatctgca gggtcgtacc agtcctatta attccacaat tgagaccaaa 1261 gaatctaggt ctgggttttt cctagggcca ctgaggaggt attcctatta cctggaaaac 1321 agaggagccg gtatgggttc ctcagtggcc actttcctct gagaaactgg aagctgctaa 1381 gactctagtg cgggagcagc tggatctggg gcatataaaa tcctctgtat ctccatggaa 1441 tactcctatt tttgtcatta agaaaaaatc tggtaaatgg agactgcttc acgatcttag 1501 agctattaat caacagatgc aaattatggg ccctgtacaa cgtggtcttc cacttttaac 1561 ttctttacct gcatcatggc ctatcatctc tatagatatt aaagattgct tcttttccat 1621 acctttgtgt gccaaggatt cagggcgttt tgcgtttacg ctgccctctt gtaatcatga 1681 acaacctgat ttaaggtatg aatgggatag tgttggccac aggggatggc caatagtcct 1741 actatgtgtc agttgtttgt agcagaagca attgctcctt ttgagagtgg actttcccaa 1801 agattagatg tgttcattat atggatgata ttttattggc tgccaaagat gataaaacgc 1861 ttaataaggc atatacaaaa ttggtaaaat tgcttgagat gcataattta gtcatagcct 1921 cagaaaaggt acaaaaggac actgttgtta actatctagg ggctaagatt ctccctcata 1981 caattattcc acaaaagata gagattagaa aagataattt aaaaactctt aatgattttc 2041 aaaagttgtt gggagatata aattggataa gatgttattt aaaattacca aattatgagt 2101 tgaagccatt gtataatatt ctcaatggtg attcagcatt agattcacct aggcagttaa 2161 ctgctgaagc cagagaagct ttaaag // LOCUS HUMCHRM 2098 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human muscarinic acetylcholine receptor gene, complete cds. ACCESSION M35128 Y00508 KEYWORDS muscarinic acetylcholine receptor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2098) AUTHORS Allard,W.J., Sigal,I.S. and Dixon,R.A.F. TITLE Sequence of the gene encoding the human M1 muscarinic acetylcholine receptor JOURNAL Nucleic Acids Res. 15, 10604-10604 (1987) STANDARD simple staff_review FEATURES from to/span description pept 449 1831 muscarinic acetylcholine receptor BASE COUNT 458 a 662 c 570 g 408 t ORIGIN 1 agtatagctt ataagtggat gaatgcttga gaagttgcag attatacaaa gtagttccca 61 actcctgcaa cccagtatgt aagatagaat tgtagttaat ttcccagtaa gaaaatgagc 121 ctgagtctga aaggtaaaac tgaatgaagt attcaaaccc tggatcccaa agccactcca 181 cgctgctggc aaatccactt atggctggga aagtgccact gcataaatga ccatgagtgg 241 gcaccggtaa gggagggtga tgctatctgg tctgaagctc tgaagggcaa gaattacatc 301 ccatgcatct tccaataagg tctatcagaa atgtccagtg gcccaaccaa agcccatgtc 361 ctctctttta ggtgatgact ttcccctgag gaagccctgt agcgtgcctg gaggaagggg 421 tctccaaccc cagccccacc tagccaccat gaacacttca gccccacctg ctgtcagccc 481 caacatcacc gtcctggcac caggaaaggg gccctggcaa gtggccttca ttgggatcac 541 cacgggcctc ctgtcgctag ccacagtgac aggcaacctg ctggtactca tctccttcaa 601 ggtcaacacg gagctcaaga cagtcaataa ctacttcctg ctgagcctgg cctgtgctga 661 cctcatcatc ggtaccttct ccatgaacct ctataccacg tacctgctca tgggccactg 721 ggctctgggc acgctggctt gtgacctctg gctggccctg gactatgtgg ccagcaatgc 781 ctccgtcatg aatctgctgc tcatcagctt tgaccgctac ttctccgtga ctcggcccct 841 gagctaccgt gccaagcgca caccccgccg ggcagctctg atgatcggcc tggcctggct 901 ggtttccttt gtgctctggg ccccagccat cctcttctgg cagtacctgg taggggagcg 961 gacagtgcta gctgggcagt gctacatcca gttcctctcc cagcccatca tcacctttgg 1021 cacagccatg gctgccttct acctccctgt cacagtcatg tgcacgctct actggcgcat 1081 ctaccgggag acagagaacc gagcacggga gctggcagcc cttcagggct ccgagacgcc 1141 aggcaaaggg ggtggcagca gcagcagctc agagaggtct cagccagggg ctgagggctc 1201 accagagact cctccaggcc gctgctgccg ctgctgccgg gcccccaggc tgctgcaggc 1261 ctacagctgg aaggaagaag aggaagagga cgaaggctcc atggagtccc tcacatcctc 1321 agagggagag gagcctggct ccgaagtggt gatcaagatg ccaatggtgg accccgaggc 1381 acaggccccc accaagcagc ccccacggag ctccccaaat acagtcaaga ggccgactaa 1441 gaaagggcgt gatcgagctg gcaagggcca gaagccccgt ggaaaggagc agctggccaa 1501 gcggaagacc ttctcgctgg tcaaggagaa gaaggcggct cggaccctga gtgccatcct 1561 cctggccttc atcctcacct ggacaccgta caacatcatg gtgctggtgt ccacgttctg 1621 caaggactgt gttcccgaga ccctgtggga gctgggctac tggctgtgct acgtcaacag 1681 caccatcaac cccatgtgct acgcactctg caacaaagcc ttccgggaca cctttcgcct 1741 gctgctgctt tgccgctggg acaagagacg ctggcgcaag atccccaagc gccctggctc 1801 cgtgcaccgc actccctccc gccaatgctg atagtcccct ctcctgcatc cctccacccc 1861 agtccccggg aaaaggccgg tcggaagagg gcaggggctg catcctcagc cccagggccc 1921 tgctcaggcc tcacctggct tcccaggacc ctgggtcacc ttcctgggca gcccagagag 1981 acctgccaac tttccagact tcgctattcc caggcaggga gggaaacccg gggaactggt 2041 ttttctgttc cctgctgggt gggaatgcgc tcttcacagg aagaaggccc gggaggag // LOCUS MVOTRPBA 2874 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.voltae tryptophan synthase operon (trp) genes, complete cds. ACCESSION M35130 KEYWORDS tryptophan synthase. SOURCE M.voltae (PS DSM 1537) DNA. ORGANISM Methanococcus voltae Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanococcales; Methanococcaceae. REFERENCE 1 (bases 1 to 2874) AUTHORS Sibold,L. and Henriquet,M. TITLE Cloning of the trp genes from the archaebacterium Methanococcus voltae: Nucleotide sequence of the trpBA genes JOURNAL Mol. Gen. Genet. 214, 439-450 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 206 tryptophan synthase F (AA at 3) pept 304 1533 tryptophan synthase B pept 1571 2425 tryptophan synthase A pept 2460 2600 ORF 46 pept > 2874 2666 (c) ORF 68 (AA at 2872) BASE COUNT 1046 a 381 c 569 g 878 t ORIGIN 1 gggttgcgga aactcatgac catagagtta gcgaaattat ctccaaaaaa tttgatgtcg 61 tacttgcagg cggtataact tttgaaaacg tgagaaaaat tgtaaattcc gtaaaacccg 121 ttggaattga tgtttctagt ggcgttgagt taaacaacag aaaaaacgaa ttattaataa 181 aaaagatttg tcataatttg atttaattag aattaattag aattaatcga attttaacta 241 attaaaaatt ataggttatt aaattatgac taaatacagt atatgtaaaa ttaaggtgaa 301 attatgaaat gtaatacaaa atgtgacaaa aatggatatt ttggggaatt tgggggtcaa 361 tatatacctg aagttttaaa accggctgtt gaagagctta aagaagccta taaagagtta 421 aaagatgacg aagactttca aaatgagctt gcatactatt taaaacatta tgcaggacgt 481 gaaactcccc tatattatgc aaaaaacttg actgaaaaac ttggtggtgc caaaatctac 541 ctaaaaagag aggacttatt gcatggtggt gcccataaaa ccaataacac tattggtcaa 601 gcacttcttg ctaaaaaaat gggtaaaaca agaataattg ctgagacggg tgcgggtcaa 661 catggtgttg gcacgtctat ggcaggagca ctttttggtc tcgaaacaga gatttttatg 721 ggtagggtag atacagaacg acaacaacct aacgtagcac gtatgaaatt attgggtgca 781 aaagttacgc cagtcgatac aggttccaaa gttttaaaag acgctgtaaa tgaagctatg 841 agaaattgga ctgctacttt tgaaaatact cactatttac ttggcactgt gatgggtcca 901 cacccattcc caactatggt gagagatttt cagtcagtaa ttgggaaaga agttaaaaaa 961 caaataatgg agcaggaaga aagacttcct gattatttag ttgcctgtat tggagggggt 1021 agcaatgcaa tgggtttatt tcatccattt ttaagtaata atatcagtac tggcaatgat 1081 gatgccaaaa atgttaaaat gataggaata gaggctgcag gtaaggggct taacactagc 1141 cttcacggtg catccataac taaaggtgaa aaaggggtac ttcacggtat gctttcgtat 1201 ttcttacaag acgaggatgg acaaatagaa gaagcttata gtatttctgc cggattggat 1261 tacccaggga taggtccaga gcatgcttat ttacataacc ttgggcgtgt gcagtatgct 1321 tcagcaactg ataaacaggc cttaaaagca tttatggaac ttacgagaac cgaaggaatt 1381 atcccggctc tagaatcgtc tcacgcgatt gcttatgcca ttgaaaatgc aggaaatatg 1441 gataaggacg atataatggt aataaacctt tcaggacgtg gggataaaga tttaaacaca 1501 gtaataaatg cagtacataa attgggttgt taaaattaat taaaattaat taaaatatcg 1561 aggaatttaa atgaaaaact tagaaaattt agaaaaagat ttgaaaaatg acttaaaaaa 1621 agatttgaaa aaagaaaaac caattttagt tagtttttta gtatcagggg acccaaatat 1681 tgaagctaca ctaaaattta tgaatgcact agacgaatat tgcggagtta tagaactagg 1741 tataccattt agtgacccga tagcagatgg ttcaactatt caagaggcaa atgtacgttc 1801 cttatcaaat ggttataaaa tacatcaatc ttttgacgta ttacgggaat ttaggaaatt 1861 ttcagatacg ccagttgtac ttatgacgta ttacaatcca atatataaaa gaggtattga 1921 aaattttgta attcaagcaa aagaagcagg ggcaaatggg cttataattg tagatttacc 1981 cctagatgaa gcagaacagt atagggcaat atgtaaaaag catgatatgg gaacagtatt 2041 ccttgtagcc ccaaatacac ctgatgagag gttgatgtat tctgatgagg ctagtacact 2101 gtttttatac gtaatttcga catttggtat tactggagct aggggttcat ttgaaaagat 2161 gacttttgaa tttatagctc gtgcaaaaaa tctttgcgat aaaaataagc tgtatgtagg 2221 ttttggaatt tcaaacggtg aacatgctga aaaaataatt gaaaatggtg ctgacggtgt 2281 tattgtaggg agtgcttttg tagatattat taaggaatac ggggattcta atgaaactat 2341 ttataaatta aaagaattag ctcgggaatt aagcgaaggg attcataaag gttatgttaa 2401 atacaatgaa aagaataaat attaaataat ataatttatt ttaaattttg ggtggagata 2461 tgaatttaaa agataatata ctttataaat caatcaaatg gttttttgcg gttaaatcgg 2521 agaaacctaa aaattacgat actgaagtaa aacctatatt gtatgagcaa gagcgacgtg 2581 gtagacgccg tatattataa taaattctaa tttaaaaaaa taaaaaaaga aattatatta 2641 ttgtagtatt taattaatta ttcatttaat tcttttttga attcaaaaag tttttggcaa 2701 tgtccattat attttcagat attatatatt ttgaattttc ttttaaaacg ctatttgcaa 2761 cgtcaagtga tttataaact tgtgcttcac ctttaaagta catttgtgcc gcttctgcaa 2821 ctgtttttat tgctttagcc tgcccctcag cttcaattct aatactttct gcag // LOCUS MYCSD1XX 425 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.pneumoniae SDC1 repetitive sequence. ACCESSION M35024 KEYWORDS . SOURCE M.pneumoniae (strain M-129) DNA. ORGANISM Mycoplasma pneumoniae Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 425) AUTHORS Colman,S.D., Hu,P.-c. and Bott,K.F. TITLE Prevalence of novel repeat sequence in and around the P1 operon in the genome of Mycoplasma pneumoniae JOURNAL Gene 87, 91-96 (1990) STANDARD simple staff_review BASE COUNT 100 a 118 c 120 g 87 t ORIGIN 1 aattcgaatt tgaaggccca aggcctcacc caacccgcct acctcatcgc cggtcttgac 61 gttgtggccg accacctcgt ctttgcggcc tttaaagcgg gcgcggtggg gtatgatatg 121 acgactgatt cgagcgcttc gacctacaac caagcactcg cctggtcgac cacggccggg 181 ttggacagtg atggggggta caaggccttg gtggaaaaca cggccgggct caacggcccg 241 attaatggct tgtttaccct gctcgacacc tttgcgtatg tgacccccgt gagtgggatg 301 aaagggggga gtcagaataa tgaagaagtg caaacgactt acccggtcaa gtccgaccaa 361 aaggccaccg ccaaaattgc ctccttaatt aatgccagcc cactcaacag ttatggggat 421 gatgg // LOCUS MYCSDC1 425 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.pneumoniae SDC1 repetitive sequence. ACCESSION M35022 KEYWORDS . SOURCE M.pneumoniae (strain M-129) DNA, clone MP135. ORGANISM Mycoplasma pneumoniae Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 425) AUTHORS Colman,S.D., Hu,P.-c. and Bott,K.F. TITLE Prevalence of novel repeat sequences in and around the P1 operon in the genome of Mycoplasma pneumoniae JOURNAL Gene 87, 91-96 (1990) STANDARD simple staff_review BASE COUNT 102 a 110 c 123 g 90 t ORIGIN 1 aattcgaatt tgaaggctca aggcctcacc caacccgcct acctcatcgc cggtcttgac 61 gttgtggccg accacctcgt ctttgcggcc tttaaagcgg gcgcggtggg gtatgatatg 121 agcacggaaa acagtgctgc caccaaagac caagcactcg cctggtcgac cacggccggg 181 ttggacagtg ctggggggta caaggccttg gtggaaaaca cggccgggct caacggtccg 241 attaatggct tgtttaccct gctcgacagc tttgcctatg tgaccccggt gagtggcatg 301 aaagggggta gtcagaataa cgaagaagtg cagaccaagt atcccgttaa ggatgatagt 361 aaggcttccg ccaaaattgc gtccttaatt aatgccagcc cactcaacag ttatggggat 421 gatgg // LOCUS MYCSDC1X 425 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.pneumoniae SDC1 repetitive sequence. ACCESSION M35023 KEYWORDS . SOURCE M.pneumoniae (strain M-129) DNA, clone MP46.. ORGANISM Mycoplasma pneumoniae Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 425) AUTHORS Colman,S.D., Hu,P.-c. and Bott,K.F. TITLE Prevalence of novel repeat sequences in and around the P1 operon in the genome of Mycoplasma pneumoniae JOURNAL Gene 87, 91-96 (1990) STANDARD simple staff_review BASE COUNT 105 a 114 c 117 g 89 t ORIGIN 1 aattcgaatt tgaagaccca aggcctcacc caacccgcct acctcatcgc cggtcttgac 61 gttgtggccg accacctcgt ctttgcggca tttaaagcgg gcgcggtggg gtatgatatg 121 acgactgatt cgaacgcttc gacctacaac caagcactcg tctggtcgac cacggccggg 181 ttggacagtg atggggggac aaggctttgg tagaaaacac aggccgggct caacggcccg 241 attaatggtt tgtttaccct gctcgacacc tttgcgtatg tgacccccgt gagtgggatg 301 aaagggggga gtcagaataa tgaagaagtg caaacgactt acccggtcaa gtccgaccaa 361 aaggccaccg ccaaaattgc ctccttaatt aatgccagcc cactcaacag ttatggggat 421 gatgg // LOCUS MZECAT1 2065 bp ss-mRNA PLN 26-JUL-1990 DEFINITION Z.mays catalase isozyme 1 (CAT-1) mRNA, complete cds. ACCESSION M33104 KEYWORDS catalase isozyme 1. SOURCE Z.mays seedling, cDNA to mRNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 2065) AUTHORS Redinbaugh,M.G., Wadsworth,G.J. and Scandalios,J.G. TITLE Characterization of catalase transcripts and their differential expression in maize JOURNAL Biochim. Biophys. Acta 951, 104-116 (1988) STANDARD simple staff_review FEATURES from to/span description pept 169 1647 catalase isozyme 1 (EC 1.11.1.6) mRNA 1 2065 catalase isozyme 1 mRNA BASE COUNT 502 a 558 c 487 g 518 t ORIGIN 1 gaaaaaaaag gggaaatcgg cttcctactc cccgtcctta tcgccagccg aaccgacatg 61 ttttctcccc ccttctcgcc ttctccttct ccccctagtc tagaggcgtt tgctccccaa 121 ctccttcggc ccgtccgccc gcccactcga ctgatcccac cggcagccat ggatccatac 181 aagcaccgcc cgtctagtgg gagcaactcc agcttctgga ccaccaactc cggcgccccc 241 gtctggaaca acaactctgc cctcaccgtc ggacagcgag gtccaatcct ccttgaggat 301 tatcatctaa tcgaaaagct tgctcagttc gacagagaac gtatccctga acgtgttgtg 361 catgcacggg gagccagtgc caagggtttc tttgaggtca ctcatgatgt ctctcacctt 421 acatgtgctg attttctccg tgctcctggg gtccagacac ctgttattgt ccgtttctct 481 acagttgtgc atgagcgtgg aagccctgag accttgaggg atccacgtgg ttttgctgtc 541 aagttctaca ccagagaggg taactttgac ctcgtgggta acaacatgcc tgtgtttttc 601 atacgagatg ggatgaaatt ccctgacatg gtccacgctt tcaagccgaa tccaaagacc 661 aatttgcagg agaactggag aatagtagat ttcttctctc accacccaga gagcctacac 721 atgttcacct tcctctttga cgatgttggc atcccactca actacaggca catggagggc 781 tttggtgtca atacctactc cttgatcaac agggatggaa agcctcacct tgtgaaattc 841 cattggaagc ctacttgtgg tgtgaaatgc ttgctcgaca atgaagctgt gactgttgga 901 ggcacctgcc acagccatgc gacgaaggat ctatatgatt ccatcgcagc tgggaattac 961 cctgaatgga agctctacat ccagactatt gatcttgacc atgaggataa gtttgacttt 1021 gacccgctcg atgtcaccaa gacctggcct gaggatatca tcccgctgca gcccgttgga 1081 cggatggtcc tgaacaagaa cgtcgacaac ttctttgcag agaatgaaca gattgctttc 1141 tgcccagcga ttagtgttcc tgcaattcac tattctgatg ataagctgct ccagacgaga 1201 atcttctcct atgctgatac ccagaggcac cgccttggtc caaactatct gatgcttcct 1261 gtgaatgcac caaaatgtgc ccaccacaat aaccaccatg atgggttcat gaacttcatg 1321 cacagggacg aagaggtgaa ctacttccct tcgaggtttg atcccgcccg tcacgcggag 1381 aaggtcccca ttcctccccg tgttctaaca cgctgtcgtg agaagtgcat cattcagaag 1441 gagaacaact tcaagcaggc tggcgagaga tatcgttcct tcgaccctgc aaggcaagac 1501 cggttcatcc agcgatgggt tgacgcactg acacaccctc gcgtgaccca tgaacaccgt 1561 accatttgga tctcctactg gtcccagtgc gacgccgctc ttggccagaa gctgccttct 1621 aggctgaacc tgaagccgag catgtaagga tcgacgagga agaaagcagg caccggtggc 1681 caaggatgca acgcaacatg gagcgtgtga tgtttacacc aatataattg aataaacagg 1741 ggatgtgcgc gttgtcgtac ttatgctgat gctgatggtc ggtggtcgat tatatatact 1801 ggaacttctg gtgtatgctc ttctcttctg gggagacgta atctaacgaa gaagaatgtg 1861 tgtcattgtg gcctgtgcta caaaccctgc tgtatgggcc tgtctataag aaaacacgga 1921 tggagttgtg acgttatgtt ctgacagttt atttactaat gagcacatac tttgatctaa 1981 ctagaacgaa gagaagttca cggaactgtc ggacacatgc agcaaggatc ctcattataa 2041 tacgaatcac tcttcgtttg cattc // LOCUS MZECAT3 1790 bp ss-mRNA PLN 26-JUL-1990 DEFINITION Z.mays catalase isozyme 3 (CAT-3) mRNA, complete cds. ACCESSION M33103 KEYWORDS catalase isozyme 3. SOURCE Z.mays seedling, cDNA to mRNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1790) AUTHORS Redinbaugh,M.G., Wadsworth,G.J. and Scandalios,J.G. TITLE Characterization of catalase transcripts and their differential expression in maize JOURNAL Biochim. Biophys. Acta 951, 104-116 (1988) STANDARD simple staff_review FEATURES from to/span description pept 22 1509 catalase isozyme 3 (EC 1.11.1.6) mRNA 1 1790 catalase isozyme 3 mRNA BASE COUNT 386 a 550 c 550 g 304 t ORIGIN 1 cgtgggtagc tagctaggtg aatgacaatg gatcctacca agttccgtcc gtccagcagc 61 cacgacacga cggtgacgac gacgaacgct ggcgctcctg tgtggaacga caacgaggcg 121 ctgactgtgg ggcctcgcgg tcccatcctg ctggaggact accacctgat cgagaaggtg 181 gcgcacttcg accgcgagcg catcccggag agggtggtgc acgcgcgtgg cgcgtccgcc 241 aagggcttct tcgagtcgac ccacgacgtg acgtcgctga cgtgcgccga cttcctgcgc 301 gcgcccggcg tgcggacgcc cgtgatcgtg cgcttctcgc aggtgatccc agagccgggg 361 tccggacgga cgatccgaga cgcgcgcggg ttcgccgtga agttctacac ccgcgagggc 421 aactgggacc tgctgggcaa caacttcccc gtcttcttca tccgcgacgg catcaagttc 481 cccgacgtga tccacgcgtt caagcccaac ccgcggtcgc acgtgcagga gtactggcgg 541 gtgttcgact tcctgtcgca cctccccgag agcctgcaca ccttcttctt cctcttcgac 601 cacgtgggcg tgccgtccga ctaccgccac atggaagggt tcggcgtgaa cacgtacacg 661 ttcgtgagcg cggcggggaa ggcgcagtac gtgaagttcc actggaagcc gacgtgcggc 721 gagcggtcca tcctgacgga cgaggaggcg cgcgtcgggg gacggaacca cagccacacg 781 caggacctgt acgactccat cgcggcggag gggagcttcc cggagtggac gctgtacgtg 841 caggtgatgg acccggcaca gcaggagcag tacgacttcg acccgctgga cgacaccaag 901 acgtggccgg aggacctgtt gccgctccgc cccgtgggga ggctggtgct ggacaggaac 961 gtggacaact tcttgaacga gaacgagcag ctggcgttcg ggccggggct ggtggtgcca 1021 gggatctact actcggacga caagatgctg cagtgccggg tgttcgccta cgccgacacg 1081 cagcgctaca ggctgggtcc caactacctg atgctgcccg tcaacgcgcc gcgctgcggc 1141 acccacaaca accactacga cggcgccatg aacttgatgc accgcgacga ggaggtggac 1201 tactacccgt ccaggcacgc gcgccgctgc ggcagggcgg cgcccacgcc actgccgccc 1261 aggccggtcg cggggaggag ggagaaggca accatacgca agcccaacga cttcaagcag 1321 ccaggggaga ggtaccgctc ctgggacgcc gaccgacagg accgattcgt gaaggcgatt 1381 cgccgactcg ctcggacacc caaacgtcag ccagagctca ggtccatctg gatagacctc 1441 ctcgccaagg tcgacgcgtc gctggggatg aagattgcca cccggctcaa catgaaggca 1501 aacatgtgat gcttgtgctg aatagaataa taatgaagac gcatgcatgt cgtcgccagg 1561 aacaagagaa ataataacaa gaccaccacg catgggcata ctccatatat atatgtatag 1621 cccgtgcccg tgtccgcctt tgtaccaata caagccaaga ctagtggatg tattattatt 1681 attattattg cgctatcaca tacatgtacc cctgctacct gaagatggat attgtatcca 1741 gttatcaaat taagacacct gcagcaaaaa aactatatat gttgcataag // LOCUS BRVRNASA 197 bp ss-mRNA VRL 26-JUL-1990 DEFINITION Berne virus ORF5 mRNA, 5'end. ACCESSION M33503 M33501 KEYWORDS core protein. SOURCE Berne virus (strain P138/72) viral RNA. ORGANISM Berne virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Toroviridae. REFERENCE 1 (bases 1 to 197) AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M. TITLE A 3'-coterminal nested set of independently transcribed mRNAs is generated during Berne virus replication JOURNAL J. Virol. 64, 331-338 (1990) STANDARD simple staff_review FEATURES from to/span description pept 137 > 197 ORF5 mRNA 113 > 197 RNA5 BASE COUNT 50 a 27 c 38 g 82 t ORIGIN 1 ttatttcttc ttcctacttt gtggctactt gggttttgtt ggtggtggtt attattttag 61 tatttataat tataagtttt tgtattagta attaagtagg ttagtgagag acactatctt 121 tagagaaaga gccaagatga attctatgct taatccaaat gctgtgccat ttcaaccatc 181 acctcaggtt gttgcat // LOCUS BRVRNASB 179 bp ss-RNA VRL 26-JUL-1990 DEFINITION Berne virus ORF3 mRNA, 5' end. ACCESSION M33502 KEYWORDS core protein. SOURCE Berne virus (strain P138/72) viral RNA. ORGANISM Berne virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Toroviridae. REFERENCE 1 (bases 1 to 179) AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M. TITLE A 3'-coterminal nested set of independently transcribed mRNAs is generated during Berne virus replication JOURNAL J. Virol. 64, 331-338 (1990) STANDARD simple staff_review FEATURES from to/span description pept 153 > 179 ORF3 BASE COUNT 52 a 17 c 34 g 76 t ORIGIN 1 ttataatctt cttcctactt ggattacatg gcttacttta ggttttagtt tgtttagtat 61 agtaataagt ggtattaata ttattttgtt ttttgaaatg aatggtaagg tgaagaaaag 121 ttagtcactt tctttagaag aaggttgcca aaatgtttga gaccaattat tggccattt // LOCUS CHKGLOBA 1204 bp ds-DNA VRT 26-JUL-1990 DEFINITION Chicken pie-alpha-globin gene, fragment H3/H4. ACCESSION M30485 KEYWORDS pie-alpha-globin. SOURCE Chicken AEV transformed erythroblast DNA, fragment H3/H4. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1204) AUTHORS Broders,F., Zahraoui,A. and Scherrer,K. TITLE The chicken alpha-globin gene domain is transcribed into a 17-kilobase polycistronic RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 91 > 1204 pie-alpha-globin mRNA fragment H3/H4 (put.) BASE COUNT 282 a 252 c 263 g 407 t ORIGIN 1 ggatctatct agttgctgca gtcgtttgta tgaaggttgg atccatcctg ttttgtactg 61 gatgactgcc ttcaattcac tggcaatcta ggatcaaatg tgtcctagag aacattcaat 121 atcgcttttt ttctaagctg ttgcaagcca gaatggttac ttttgagctg atctcggtgg 181 agcagttgag ttgttgtaag ttatttctta atggctccag aaaattacat catttaggtg 241 ctataactct ccatttccat cttgtatgcg taattgcatt tcttgaatac ttcagacatt 301 aatttcccgt cctacctgca ggttactggt gtgtattggc tatacagatt acttttccac 361 agatgtaacc ctaggtcttt tgaatataga tcccatctat tgtctgctta gagaccccga 421 taaccctccc gataaatcag agtccatgtt ttttgacagt atatcggtgt gaacatctgg 481 attttagtgc aatatgctag tagcaatctg agtccccgtt tctaagacag agtcatttag 541 tccgagaatg gctgtttaag actccaaatg gcagtcttga gtcttttagt gactgtactc 601 gttcctctac tgagggcagt cttgagtgtt ttagtgactg taccctgtct cttaacttga 661 ccggtctgat agatcttaaa tgacagtcgt ggccgcaatt tcaaatggaa gagctaggag 721 tctcaggaac cgtcgccctt gtttactctt atgtttaccc gttaagccgt catgaaaagg 781 atttttctgt agagaacggt tatatgagtt gtattccatc tagggtcacg gcccctagac 841 caaccaacga cgagtcgatt tgttgtctgg cactttctgt gacttcaagt tttgtggctt 901 tctctattaa ctttccccac aacgtaactg tctaacttag atgttggcgc gagaactaca 961 gtctgaggga cttgtcaaga gctggcacac tcgcctttat gttaaagtgt gtcctttgtc 1021 gatactggta ctaatgctta agctcgagcg ggcccctaga ccaacgacga gtcgatttgt 1081 tgtctggctc tttctgtgac ttcaagtttt gtggctttct ctattaactt tcccacaacg 1141 taactgtcta attagatgtt ggcgcgagaa tacagtctga gggattgtca agagtggact 1201 ggtt // LOCUS CHKGLOBB 582 bp ds-DNA VRT 26-JUL-1990 DEFINITION Chicken pie-alpha-globin gene, fragment H10. ACCESSION M30486 KEYWORDS pie-alpha-globin. SOURCE Chicken AEV transformed erythroblast DNA, fragment H10. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 582) AUTHORS Broders,F., Zahraoui,A. and Scherrer,K. TITLE The chicken alpha-globin gene domain is transcribed into a 17-kilobase polycistronic RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 241 > 582 pie-alpha-globin fragment mRNA H10 BASE COUNT 171 a 128 c 108 g 175 t ORIGIN 1 tccaaaaaac ttactctgct tgtaaatgtc gtctcctttt tcggagacaa aaacttgata 61 ccttcttgcc ttgtccgaag tcactttatc ggttatagga cccaagtttt gggccttgct 121 agaaggatac aattccctat gaccgccgta ttttggggta ctcgcattcg cccgacatcg 181 agtggacctc ctttttttct cttgtcgttc gtagaggtta tcgaggtccc cccatatata 241 ataaccctat cgtgagttta gacttcctac aaaaacttct gtcgtttaat gttttcgtac 301 cgtcacggtg actgtccagt aatcaaagtt gtcactgtct aaaaagattc gacaacttcg 361 tcttaccaat gcgaaaactc gactagagac actcgtcaac tcacacattc aataaagaat 421 taccgaggtc ttttaatgta gtgaaatcac gatattgaga ggtaaaggta gaaacatacg 481 cattaaccta aagaacttat gaagtctgta attaaaggac cacaagcaat acgaaagaca 541 atgtatttct tctaacgtcg gataagtatt aggatggacg tc // LOCUS ECOPHOAA 600 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli alkaline phosphatase (phoA) gene, 5' end. ACCESSION M33536 KEYWORDS alkaline phosphatase. SOURCE E.coli (strain K-12) cell line BW7710 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 600) AUTHORS Agrawal,D.K. and Wanner,B.L. TITLE A phoA structural gene mutation that conditionally affects formation of the enzyme bacterial alkaline phosphatase JOURNAL J. Bacteriol. 172, 3180-3190 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.K.Agrawal, 03-APR-1990. The phoA503 mutation does not interfere with export of active enzyme but does interfere with assembly. FEATURES from to/span description pept 283 > 600 alkaline phosphatase precursor (phoA) (EC 3.1.3.1) sigp 283 345 alkaline phosphatase signal peptide matp 346 > 600 alkaline phosphatase variant 413 413 c in wild type; t in phoA503 mutation BASE COUNT 159 a 130 c 151 g 160 t ORIGIN Map position 8.7 minutes; 1 bp upstream of HindIII site. 1 aagctttgga gattatcgtc actgcaatgc ttcgcaatat ggcgcaaaat gaccaacagc 61 ggttgattga tcaggtagag ggggcgctgt acgaggtaaa gcccgatgcc agcattcctg 121 acgacgatac ggagctgctg cgcgattacg taaagaagtt attgaagcat cctcgtcagt 181 aaaaagttaa tcttttcaac agctgtcata aagttgtcac ggccgagact tatagtcgct 241 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt 301 gcactggcac tcttaccgtt actgtttacc cctgtgacaa aagcccggac accagaaatg 361 cctgttctgg aaaaccgggc tgctcagggc gatattactg cacccggcgg tgctcgccgt 421 ttaacgggtg atcagactgc cgctctgcgt gattctctta gcgataaacc tgcaaaaaat 481 attattttgc tgattggcga tgggatgggg gactcggaaa ttactgccgc acgtaattat 541 gccgaaggtg cgggcggctt ttttaaaggt atagatgcct taccgcttac cgggcaatac // LOCUS GCOEARA 1771 bp ds-DNA PLN 26-JUL-1990 DEFINITION G.tikvahiae McLachlan 18S ribosomal RNA gene. ACCESSION M33640 KEYWORDS 18S ribosomal RNA. SOURCE G.tikvahiae McLachlan (isolate Pomquet Harbour-Nova Scotia) DNA. ORGANISM Gracilaria tikvahiae McLachlan Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; Florideophycideae; Gigartinales; Gracilariaceae. REFERENCE 1 (bases 1 to 1771) AUTHORS Liu,Q.-Y., Bird,C.J., Rice,E.L., Murphy,C.A. and Ragan,M.A. TITLE Nucleotide sequence of the 18S ribosomal RNA gene from the red alga Gracilaria tikvahiae mclachlan JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ragan 08-APR-1990. Atlantic Research Lab, National Research Council of Canada, 1411 Oxford Street, Halifax, Nova Scotia CANADA B3H 3Z1 FEATURES from to/span description rRNA 1 1771 18S ribosomal RNA BASE COUNT 445 a 371 c 501 g 454 t ORIGIN 1 ccacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc 61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct 121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccata 181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt 241 tttttctgat cgcactcgtt gcgacgcacc gttcaaattt ctgacctatc aactttggat 301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg 361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc 421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta 481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc 541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc 601 tcgtagtcgg attttggcgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg 661 cctttgtgga tgggagtcag gtggtgcttc actggatcgc ttggctgccg ccaccgttta 721 ctgtgaaaaa attagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata 781 atagaatagg acccggtcct attttgttgg tttgtttgaa tcgggtaatg attaagaggg 841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg 901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga 961 tcgaagacga tcagataccg tcgtagtctt tac tataaac gatgaggact ggagatcgga 1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg 1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg 1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg 1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt 1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag 1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc 1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc 1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat 1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa 1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct 1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat 1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt 1741 ttccgtaggt gaacctgcag aaggatcaag c // LOCUS HS6MCP 4440 bp ds-DNA VRL 26-JUL-1990 DEFINITION Human herpesvirus type 6 major capsid protein (MCP) gene, complete cds. ACCESSION M33515 KEYWORDS major capsid protein. SOURCE Human herpesvirus type 6 DNA. ORGANISM Human herpesvirus type 6 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 4440) AUTHORS Littler,E., Lawrence,G., Liu,M.-Y., Barrell,B.G. and Arrand,J.R. TITLE Identification, cloning, and expression of the major capsid protein gene of human herpesvirus 6 JOURNAL J. Virol. 64, 714-722 (1990) STANDARD simple staff_review FEATURES from to/span description pept 235 4272 major capsid protein (MCP) BASE COUNT 1422 a 1169 c 785 g 1064 t ORIGIN 1 tatcgtgaac gatatttggc ccggacgttt gaaaaatttt ctctatgatt gactcgatct 61 tttccagaac tacaggcatg gatcgcgcta aacgagtttc ctcgtcgcga gacacttcag 121 cggtcagatc acacgaatct ataaaaactg gaatcgaccg tgcacaagtg gaaccaaaac 181 atgaattaac tattaaagtt tcacaattac cggtgtgctg cataacgccg aaacatggaa 241 aattggcagg cgaccgaaat tttacctaag atcgaagcac ctctaaatat tttcaatgac 301 attaaaacat acacagccga acaacttttt gacaatttgc gaatttattt cggtgacgat 361 ccgagccgtt acaacatcag ttttgaagcc ttactcggaa tctactgcaa caaaatagaa 421 tggattaact ttttcaccac gccgatcgcc gttgcagcga acgtaatccg cttcaatgat 481 gtgagtcgaa tgaccctcgg gaaggttctc ttctttattc aattacctag agtcgctaca 541 ggaaacgacg taactgcttc aaaagaaacc accatcatgg tagccaaaca ctcagaaaaa 601 caccccataa acatatcgtt cgatttgagc gctgcctgtc tggaacatct ggaaaacaca 661 tttaaaaaca cagtcatcga tcagatttta aacatcaatg cgttacatac agtcttaaga 721 tctttaaaga attcagccga ttcgctcgag cgaggtttga ttcacgcatt catgcaaacc 781 ttattgagaa aatctccccc gcaatttatc gtcctgacca tgaatgagaa caaagtacat 841 aataaacaag ctctgagccg agtacagcgc agcaacatgt ttcagagcct gaagaacaga 901 ttgttaacgt cattattttt tttgaacagg aataataata tttcatatat ctatagaatt 961 ctaaacgaca tgatggaatc ggtcacggaa agcattctaa atgatacgaa caactacact 1021 tccaaagaaa acgtccccct agatggtgtt ttattaggac cgatcggctc tatccaaaaa 1081 ctcaccagca tactctccca gtacatctcc acacaagtcg tctccgcccc aatctcatat 1141 ggtcacttta ttatgggcaa agaaaacgca gtgactgcga ttgcataccg tgcaatcatg 1201 gccgatttta ctcaattcac cgtgaacgcc gggacagaac aacaagacac taacaacaaa 1261 tcagaaatct tcgacaaaag ccgcgcgtac gccgacctaa agctgaacac gttgaaattg 1321 ggagataaat tagtcgcatt cgaccaccta cacaaagttt acaaaaacac agacgtcaac 1381 gatccgctag aacagagctt acaactaaca ttctttttcc ctttgggtat ctacataccg 1441 agcgagaccg gtttcagtac aatggaaaca cgtgtgaaat taaacgacac catggaaaac 1501 aacctaccca ccagcgtttt tttccacaat aaagaccaag tcgtgcagcg aattgatttt 1561 gccgacatat taccgtcggt ttgccatccc attgtccacg actcgaccat cgtcgaacga 1621 ctcatgaaaa gcgaaccatt gcctaccggc caccgctttt cccaactatg tcaactaaaa 1681 attacccgag aaaacccagc caggatctta cagaccttat acaacttata cgaaagtcga 1741 caagaagtac ccaaaaacac caacgtctta aaaaacgaat taaacattga agatttttac 1801 aaaccggaca atccaacact gccgaccgaa agacacccct tcttcgatct cacgtatatc 1861 cagaaaaacc gagccacaga agtactctgc acaccaagaa taatgatagg caacatacct 1921 ttaccgttag ctccagtctc tttccacgaa gcccgtacaa atcaaatact ggaacatgca 1981 aagacgaact gccaaaagta cgacttcacc ctcaaaattg tcaccgaaag cttgacgagt 2041 ggctcgtacc cagaattggc ttacgttatc gagaccttag tgcatggaaa caagcatgct 2101 tttatgatcc taaaacaagt aattagccag tgtatttctt attggtttaa catgaaacat 2161 atacttcttt tttgcaacag cttcgagatg atcatgctaa tctctaacca catgggcgac 2221 gaactgatcc cgggagcagc tttcgctcac tacagaaatc ttgtgtcgct aattcgccta 2281 gtgaagagaa caatctctat ctccaacctc aacgagcaac tttgcggcga acctctggtg 2341 aatttcgcca acgcgttgtt cgacggacgt ctgttctgcc cgttcgtcca taccatgccc 2401 agaaacgaca cgaatgcaaa aataacagcg gatgatacac cactgacaca gaacaccgta 2461 agagttagaa attacgaaat atccgatgtg caaagaatga atctaataga ttcaagcgtc 2521 gtctttaccg acaatgacag accatcgaac gaaaccacca tcctgagcga gatattttac 2581 ttctgcgtac tcccggcact atcaaataac aaggcctgtg gcgctggcgt caacgtaaag 2641 gaactagttc tagacttatt ctacacggaa ccgttcatca gtccagatga ttatttccag 2701 gagaatccga ttaccagcga cgttctaatg tctctgatcc gagaaggtat gggccctggc 2761 tacaccgtag ccaacacatc ctgtatcgca aaacagttgt ttaaatcgct aatctacatt 2821 aatgaaaata cgaaaatatt ggaagtggaa gtctccttag atcccgcgca gcgacacggc 2881 aactccgttc attttcaatc actacaacac attctataca acgggctttg cctgatctca 2941 ccgatcacca ccctaagacg gtactatcaa ccaatcccat ttcatcgatt cttctccgac 3001 ccgggaatct gcggcaccat gaatgctgat atccaagttt tcctaaatac atttcctcac 3061 tgtcaaagaa acgacggcgg ttttcctctc ccgcccccat tagcattaga attttataat 3121 tggcaacgaa caccgttttc cgtgtactca gccttctgcc ccaattccct gttgagcatt 3181 atgacgcttg ccgccatgca ctcaaaattg tctcccgttg ccatagcgat ccaaagcaaa 3241 aacaaaatcc atccgggctt tgcggccaca ctagtccgga cggataattt cgacgtcgag 3301 tgcctattat acagttccag agcagccaca tctataattt tagacgatcc cacggtcacc 3361 gcggaagcta aagatatcgc aaccacttac aacttcaccc agcacctaag ttttgtagat 3421 atgggcttag gttttagctc taccaccgcc actgccaatc ttaagcgaat taaatcagat 3481 atggggagca agatacaaaa ccttttctcc gccttcccga tacacgcgtt taccaacgcg 3541 gacataaata cgtggattcg acatcacgtc gggatagaaa aacctaatcc ctccgagagc 3601 gaagcactaa acatcataac gttcggcgga attaacaaaa acccaccctc catactactg 3661 catggtcaac aagctatctg cgaagttata ctgaccccgg ttacgacaaa cattaacttt 3721 ttcaaatcgc cccacaaccc aagaggcagg gaatcatgta tgatgggaac ggacccgcac 3781 aacgaagagg cggctagaaa agcattgtac gaccacaccc aaacagacag cgatacattc 3841 gccgcaacca caaacccttg ggcatctcta ccaggctcct taggcgatat tctatacaac 3901 acggcacaca gagaacaact atgttacaac cccaagacat acagtcccaa cgctcaattt 3961 tttaccgaat ctgacatctt aaaaacaaac aagatgatgt acaaagtgat aagcgaatac 4021 tgcatgaaat cgaactcgtg tttaaacagc gatagcgaaa tacaatactc gtgctctgag 4081 ggcacggata gcttcgtaag cagaccatgc cagttcttac aaaacgctct gcctcttcac 4141 tgttcatcca accaagctct attagagagt cggtctaaaa ccggcaatac gcagatcagc 4201 gaaacccatt attgtaatta cgccatagga gaaaccatac ctttccaact cattatcgaa 4261 tcatccatat aaaatggaaa ccgtctactg cactttcgat cacaaactgt cactttccga 4321 tatcagcacc ctatgcaagc tcatgaacat cgtcataccg atcccagctc accaccatct 4381 aataggtagc ggcaatttag gtctttatcc catcgtctcc tccaacaaag attacgtcca // LOCUS HUMSEXREPB 916 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human sex chromosome repeat, clone pDP330. ACCESSION M33524 KEYWORDS sex chromosome repeat. SOURCE Human cell line OXENII DNA, clone pDP320. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 916) AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and Page,D.C. TITLE Human sex-chromosome-specific repeats within a region of pseudoautosomal/Yq homology JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.C.Fisher, 03-APR-1990. FEATURES from to/span description site 1 388 low copy flanking sequence rpt 389 916 sex chromosome repeat BASE COUNT 206 a 228 c 229 g 253 t ORIGIN Chromosome Yp. 1 gaattcaggc ctcagtgtat gtctgtaaca caacagacag ggtctgcagg ggtcgaagta 61 ttttgtcatc aaagaggaag gaatgatcat tcatcataaa aggcaagaca tctttggtgc 121 aaggaaaact caagaaaaat accgcagacc atgcaatgag gcactggtcg atggagtgtt 181 gtaaacccgt cttcccagag tggcatgcac atggatccct cagcacatgg gtgacacaca 241 gactatgctt cagcaggtct gtctgggccc aagacacatt gtttctcatc agctcccagg 301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct 361 tcccagagtt gttacaaaga aatcccaaag actgggcccc ttaaacaaca accttgattc 421 tcacagtcct tgaggctaga agtctgagat caagctatgg ccagggctgg ttcctcctga 481 ggcctctctc cttgggttgt agatgctgtc ttctccctgt gtcctcacag ggttgtccct 541 ctgtgtgtgt ctgtgtcctc atctcctctt cttatgaggt gtcttagtcc atttcaggct 601 gctgtcacag catgccgtag actgggtggc ttatcagcaa cagacattga ttctcccaca 661 gtcctggaag ctggacgtct gagatcaggg tatgggcagg gctgcttcct cctgaggcct 721 ctgtcctggg cttgtagatg ctgtcttctc catgtgtccc catgtggtca tccctctgtg 781 ggtgtgtctg tttcctcatc tgctcttcta atgagatgtc ttagtccatt gcaggctgct 841 atcacagaat accataggct gggtggctta taaaccacag agttttattc ttccacagtc 901 ctggaggctg gaattc // LOCUS HUMSEXRPA 918 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human sex chromosome repeat, clone pDP316. ACCESSION M33523 KEYWORDS sex chromosome repeat. SOURCE Human cell line OXENII DNA, clone pDP316. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 918) AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and Page,D.C. TITLE Human sex-chromosome-specific repeats within a region of pseudoautosomal/Yq homology JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.C.Fisher, 03-APR-1990. FEATURES from to/span description site 1 388 low copy flanking sequence rpt 389 918 sex chromosome repeat BASE COUNT 207 a 242 c 226 g 243 t ORIGIN Chromosome Yp. 1 gaattcaggc ctcagtgtct gtctgtaacc caacagacgg tgtctgcaga gatcgaagta 61 ttttgtcgtc gaagaggaag gaatgatcat tcatcacaaa aagcaagaca tctttggtgc 121 aaggaaaact cgaggaaaat accgcagacc atgcaatgag gcactggttg acggtgtgtt 181 ataaacccgt cttcccagag tggcatgcac acggatccct caggacatgg gtgacacaca 241 gactatgctt cagcaggtct gtctgggccc aagacacagt gtttctcatc agctcccagg 301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct 361 tcccagagtt gttacaatgc aatcccaaag actgggcagc ttaaacaaca accttgattc 421 tcccacagtc ctggaagctg gaagtctgag atcaaggtgt gggcagggcg gttcctcctg 481 agtcctctct cctgggcttg tagatgccgt cttctccctg agtccccacg tggtcatccc 541 tctgtgtgcg tctgtgtcct catctcctct tcttatgagg tgtcttagtc catttcaggc 601 tgctgtcaca gcataccata gactgggtgg cttataagca acagacattg attctcccac 661 agccctggag gctggacgtc ttgagatcag gatatgggca aggctgtttc ctcctgaggc 721 ctctgtcctg ggcttgtaga caccatcttc tccctgtgtc cccacgtggt catccctcta 781 tgtgcatgtc tgtgtcctca tctgctcttc ttatgagatg tcttagtcca ttgcaggctg 841 ctatcacaga ataccatagg ctgggtggct tacaaaccac agacttttat tctcccacag 901 tcctggaggc tggaattc // LOCUS IRICAP 2461 bp ds-DNA VRL 26-JUL-1990 DEFINITION Iridescent virus type 1 capsid protein gene, complete cds. ACCESSION M33542 KEYWORDS capsid protein. SOURCE Iridescent virus type 1 DNA. ORGANISM Iridescent virus type 1 Viridae; ds-DNA nonenveloped viruses; Iridoviridae. REFERENCE 1 (bases 1 to 2461) AUTHORS Tajbakhsh,S., Lee,P.E., Watson,D.C. and Seligy,V.L. TITLE Molecular cloning, characterization, and expression of the Tipula iridescent virus capsid gene JOURNAL J. Virol. 64, 125-136 (1990) STANDARD simple staff_review FEATURES from to/span description pept 601 1995 capsid protein mRNA 587 > 2461 capsid protein mRNA ( 5' end +/- 5 bp) BASE COUNT 717 a 462 c 443 g 839 t ORIGIN 1 gaaggtgttg aaagatctac tgaaataggc ttcattagca tttttatttt gtccacaaat 61 tcattatttt taataggctg ttcttcacct ttattcgcat attcaaagta atcgattaaa 121 tttttttgaa tatggacgat atcatccatg aacataaacc aaacttcata atatatagta 181 tggagtaacg ggttaattaa accattgatt ccttttaatt gttttggatt aatgaggttt 241 aaatcatcat aaattttttc tatttttttt aaattttttc gagcaatttt taaatttgat 301 ttaaccaaac aaacttcctc tactttaatt gttacggttg gtacttttaa accattaatt 361 ttatttttag aggaagaaca acgctttatt aaagcgttgg aatccattaa tcgcttgttt 421 tatcataggt tattttttaa ctataaaaaa ataactaaat tactacagtt accaatatgt 481 cggcattagt tctccttcat attttcgtat tttataccct taaatttaac ctaatcaatt 541 tctacattta tttttgggtt caaaattttt agccgaaata ttgctactaa taaattaaac 601 atgtctatgt cctcatcgaa tataacctca gggtttatcg atatcgccac ttttgacgaa 661 atcgaaaaat atatgtatgg cggcccaaca gcaacagcat actttgttag agaaattaga 721 aagtcgactt ggttcactca agtaccagtt ccactatcta gaaatactgg taatgcggct 781 tttggacaag aatggtcggt atctatatca cgtgctggag attatttgtt gcagacctgg 841 ttacgagtca atatcccacc agttactctt agtggtctac ttggtaacac ttactcttta 901 agatggacca aaaatttaat gcataacttg attcgtgaag ccaccattac ctttaatgat 961 ttggttgcag ctcgatttga taactatcat ttggatttct ggtctgcttt caccgtacct 1021 gccagcaaac gcaatgggta tgataacatg attggtaatg tctcttcttt aattaatcca 1081 gttgctccgg gtggtacttt gggtagcgta ggtggtatta accttaatct tccacttcca 1141 tttttcttct ctcgagatac tggtgtagca ctaccaacag ctgctctacc ttacaatgag 1201 atgcaaatca actttaattt cagagattgg catgagcttt tgattttgac taacagtgct 1261 ctagtaccac cagcaagtcc atatgttcca attgttgtag gtactcatat ttcagctgct 1321 ccagttttag gaccagttca agtatgggct aactatgcca tcgtctccaa cgaagaacgt 1381 cgtagaatgg gttgtgccat tcgagacatt ttgattgaac aggttcaaac ggcaccacgt 1441 caaaattatg tacctttgac caatgctagt ccaacatttg atattcgttt ctctcatgca 1501 atcaaagcat tattctttgc tgtacgaaat aaaacatctg cagcagaatg gtcaaattat 1561 gctacttctt ctccagttgt tactggtgca acggttaact acgaaccaac aggttctttt 1621 gaccctattg ccaatacaac attgatttat gagaacacta atcgtttggg tgccatggga 1681 tcagattact tctctttgat taatccattc tatcatgctc caactattcc atcattcatt 1741 ggatatcatt tgtactcata ttctcttcac ttttatgact tggatccgat gggttctacc 1801 aattacggta aactcactaa tgtgtctgtt gtaccccaag ctagtccggc agcaattgcg 1861 gcagcaggag gtactggtgg tcaagcaggt tcagattacc ctcaaaatta tgaatttgtc 1921 atattagctg tcaataataa tattgtcaga atatcaggtg gagaaacacc acaaaattac 1981 atagcagttt gttaaggtaa tttgtaacgc tccacaacag gcggaagtgg tctcgtgaga 2041 gaccgatatt gaggttttat caaccttaat ttgaatcatg aattaacatg atactttggt 2101 accgtctagt cggcttatat gtcgggctaa tggtcttttt tgatcatcaa gtggctataa 2161 gtggtacgtc gacgacagtc gacacctagt ggtttaataa aggtttttta cccaaattaa 2221 actggaacag gcaaggttga tgaaaacggt caaaattcag atagtctcgg gggctatttt 2281 ggacaagacc gtcggtgcag ctaatgcgta agcatcagtg atatcgctat cgactgggtc 2341 atcaatcggt tgtcctatct gactttttaa agtctcagga tggctcaatg tacagtcagc 2401 ccgcagtaag gtgtattccg agctgtcttt gaggataaaa gtaaacttga aaaagaagct 2461 t // LOCUS MUSIGHAAR 363 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-J region, partial cds. ACCESSION M33679 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J. TITLE Clustered heavy chain somatic mutations shared by anti p azophenylarsonate antibodies confer enhanced affinity and ablate the cross-reactive idiotype JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital, Jackson 1402, Blossom Street Receiving, Boston, MA 02114 FEATURES from to/span description pept < 1 > 363 Ig heavy chain V-D-J region (AA at 1) BASE COUNT 98 a 83 c 89 g 93 t ORIGIN 1 gaggttcagc ttcagcagtc tggagctgag ttgatgaggc ctgggtcctc agtgacgatg 61 tcctgcaagg cttccggata tgcaatcaca agctacggtt taaactgggt gaaacagagg 121 cctggacagg gcctggaatg ggttggatat attcatcctg gaaaaggtta tattcactac 181 aatgaaaaat tcaagggcaa gaccacactg actgtagaca aatcctccaa tacagcctac 241 atgcaggtca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcgttt 301 tttgacattt acatgtatta ctttgactac tggggccagg gcaccactct cacagtctcc 361 tca // LOCUS MUSIGKABF 324 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse Ig rearranged L-chain mRNA V-J region, partial cds. ACCESSION M33678 KEYWORDS immunoglobulin light chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J. TITLE Clustered heavy chain somatic mutations shared by anti p azophenylarsonate antibodies confer enhanced affinity and ablate the cross-reactive idiotype JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital, Jackson 1402, Blossom Street Receiving, Boston, MA 02114 FEATURES from to/span description pept < 1 > 324 Ig light-chain V-J region (AA at 1) BASE COUNT 96 a 77 c 73 g 77 t 1 others ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagntgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaaaat taaagtcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagtga cctggagcat 241 gaagacattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga 301 ggcaccaagt tggaaatcaa acgg // LOCUS MUSTCVYAN 2567 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse T cell receptor rearranged beta-chain gene, V-2 region, 5' end. ACCESSION M33500 KEYWORDS T cell receptor; beta-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, hybridoma B.1.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2567) AUTHORS Ratanavongsiri,J., Igarashi,S., Mangal,S., Kilgannon,P., Fu,A. and Fotedar,A. TITLE Transcription of the T cell receptor beta-chain gene is controlled by multiple regulatory elements JOURNAL J. Immunol. 144, 1111-1119 (1990) STANDARD simple staff_review FEATURES from to/span description pept 2544 > 2567 T cell receptor beta-chain V-2 region precursor sigp 2544 > 2567 T cell receptor beta-chain signal peptide mRNA 2478 > 2567 T cell receptor beta-chain mRNA BASE COUNT 708 a 560 c 583 g 716 t ORIGIN 1 ctaaagttct tggctactgt tgtgtgcact ttgagtaatg attaagatgc attgggacag 61 ggggtggaga aatgtcccaa ggaggtagcc atgacctcca acactggtcc tgtggaggcc 121 ccgaggagct agctagccat ctgatctgga aacaagaggc ttaacctggc tcagtactga 181 aagctggtca agataagagg gggcaggcag atacctggag gcactgacct tgggaggcag 241 gaaggttagc aagggagata actggagtgt gagagacatt ctgatcccaa tcttgttaga 301 ggattaggct gaagagggtt cagtgtgaag ctcagtaaac tgagaagggc ctaggtttcc 361 ttctcctgga gtctgcttgg ctggacagag cacactgtcc ttagaaaagc aacagagctc 421 tcctggagga gctaggagcc actgacttca gacccaggga atatcttctc taccctcttc 481 cttctggctc ttaaggaggc tcacagggag cttatttagc tttttaagga gatttataga 541 ggctggagga acttgttttt tcaaaagtaa atgctctaga aaaatgaagg ttgaaggtgt 601 tatcaaactt gtgggtcaaa gctaaatgaa aaaaaaaatc aaaagaagga catgtctatt 661 cccaacataa gcagaagact tttattataa atatggtggg agaccatagt cagagacaga 721 gacagctggg aaaggccagc atgaacttga ccctgagcct ggacatctga ggacttgggg 781 gagcaggtgg gaagaaagaa gagagaaaag agagaagagg ggagaccagg agagtaaaga 841 gtagacaaaa ggacagcata gcaaaaatag ctggatttat aggggaaggt agctggggaa 901 aaggcagccc atcccctggg ctggagaagt ttagattaga gggtctgtat tctggccata 961 tcatatacta ggtaggacta aggaatgctg agtgaagctg gcatccaggt ccacaatgac 1021 atgttaaata agaacttcag ttagccattt gctttgggat tgaggcataa taaacgccag 1081 taccccaagc cagctctgtc cacttgtcct cagtaagtga acttaaacag ccaaaccagt 1141 aatctaaata actaactaac taactaacta aatcaatcaa tcaatcaatc aataaaagta 1201 gaaaagattt tttcagtgta aacacattgg taacatggaa aaagatccag agatccagta 1261 aactccctgt gtcagtcttg gggacctgca ggcaagatgg aagtttagag ggccaaggat 1321 aagcaatcta gctcaaagta tggtcctgcc ctgcattgac ccattgccta ggcttgttaa 1381 agctgtgtga aatctctttc caggagatac attcccactc tcgctggtgc ctttcctttc 1441 ttccatgttt tcctggggaa atttctcttt ctttggggtc acttttatca atagcctgct 1501 gttcagattg aaagactgtc tctttagaat gtctttattt ctgccaggtc agttatagaa 1561 agtggcatgt tttcctttat tcaggacaaa actcccattt tgattttctg cttgcattcc 1621 tggagtcaga cagatgagta ttcactgcat acagcctcgt ataaccctgc aaccacctcc 1681 acatgttcac ttaaatggag acattttact ctcttgcaag agcttgaaac tcaaactcag 1741 atctgtgaaa ctataaatcc agtttccttc catccctgct cctggagtga tgaccctgag 1801 actaattatc aataaatgcc tagagcataa gctccagcta gttctctgac ttgctctcaa 1861 cttattatgc cttttattct aacccagctt tagctacatg gctggtttcc tctccttgtc 1921 ttcttacttc agtctcctca gcattacagc tcgaatctct gttctatttc tcaagttcct 1981 ctacctgctg gattatgtcc ttttcctcag tgttccaggc aatctctact tttattctat 2041 cttgagtgac tagttacttc tgctcagctc ccatgattct gacctcctgt gttttgcagg 2101 caaatcttcc atgccctctc ctactatttc ccagaattct ctctattcct gctggatgtc 2161 ccacctactt cctgcatcag ctcattggcc ataagctttt ttattgacag gtgatactta 2221 acacatatca cttccaggaa tatctgttca ccactgagaa gatgcagggg cccagtcact 2281 gcactcagtt ctgtagtgag tgtacaatgt gcatgagtgt ggatgagaga gcattgctca 2341 gaccacagga aagggtgcaa accttcagtt tgaggttttc actttagagg aaagcttagt 2401 cagtttcctg aggaagtcac accctttgga acctcagccc caagacttaa gtttctcgtt 2461 accaccttac tggtttggat tctcttctct tgcctgatgc cctgcatgcc ccacagagat 2521 agagagaacc tgaggtctca gagatgtggc agttttgcat tctgtgc // LOCUS R751TRA 578 bp ds-DNA BCT 26-JUL-1990 DEFINITION Plasmid R751 traJ and traK genes, 5'end. ACCESSION M25422 KEYWORDS inverted repeat; transfer origin region. SOURCE Plasmid R751 (strain HB101, Inc P-beta) DNA. ORGANISM Plasmid R751 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 578) AUTHORS Lanka,E. and Euerste,J.P. TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of plasmid-encoded products with the transfer origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.Lanka, 17-JUL-1989. FEATURES from to/span description pept 48 < 1 (c) traJ protein pept 403 > 578 traK protein signal 243 211 promoter PL signal 266 294 promoter PR rpt 49 64 inverted repeat rpt 118 157 inverted repeat rpt 296 331 inverted repeat BASE COUNT 141 a 168 c 163 g 106 t ORIGIN 1 cggccgtgtt ccttttcgtc gttctccatg cctcgcctcg tctctcatgc cggcggtagc 61 cggctgcctc gcagagcagg atgacccgtt gagcgccccc ggcgcgaata agggacagtg 121 aagatagata accggctcgc cggttagcta acttcacaca tcctgcccgc cttacggcgt 181 taataacacc aaggaaagtc tacaccagcc attacgattt atccgcaact atcgcgctat 241 caggccgcaa aagcagcaac ggatatagcg aaacccgcca caatggccca taatgccgct 301 atcgaagcgt gccaatgcac gccgatagcg gactttttgc gtttccgtag cgccgcttag 361 tagcgttaca tttgcgatga gaggattaga tggacgaaca cgatgccaaa gacctacccc 421 gaagagctgg ctgaatgggt gaagggacgg gaagccaaga agccgcgcca ggacaagcac 481 gtggtcgcgt tcctggccgt caagagcgac gttcaagcgg cgctcgatgc gggctatgcg 541 atgaaaacga tctgggagca catgaaggaa accggccg // LOCUS RP4TRAB 571 bp ds-DNA BCT 26-JUL-1990 DEFINITION Plasmid RP4 traJ and traK genes, 5' end. ACCESSION M25423 KEYWORDS inerted repeat; transfer origin region. SOURCE Plasmid RP4 (strain HB101, IncP-alpha) DNA. ORGANISM Plasmid RP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 571) AUTHORS Lanka,E. and Euerste,J.P. TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of plasmid-encoded products with the transfer origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.Lanka, 17-JUL-1989. FEATURES from to/span description pept 26 < 1 (c) traJ protein pept 394 > 571 traK protein rpt 48 63 inverted repeat rpt 118 157 inverted repeat signal 219 192 promoter PL rpt 281 318 inverted repeat signal 246 272 inverted repeat BASE COUNT 135 a 148 c 181 g 107 t ORIGIN 1 ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc ggcggtagcc 61 ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata agggacagtg 121 aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg ctgacgccgt 181 tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata tcgtgcgaaa 241 aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta tgcagcggaa 301 aagcgctgct tccctgctgt tttgtggaat atctaccgac tggaaacagg caaatgcagg 361 aaattactga actgagggga caggcgagag acgatgccaa agagctacac cgacgagctg 421 gccgagtggg ttgaatcccg cgcggccaag aagcgccggc gtgatgaggc tgcggttgcg 481 ttcctggcgg tgagggcgga tgtcgaggcg gcgttagcgt ccggctatgc gctcgtcacc 541 atttgggagc acatgcggga aacggggaag g // LOCUS STAREPEBR 2389 bp ds-DNA BCT 26-JUL-1990 DEFINITION S.aureus ethidium resistance (ebr) and replication protein (repA) genes, complete cds. ACCESSION M33479 KEYWORDS ethidium resistance protein; replication protein. SOURCE S.aureus plasmid DNA. ORGANISM Staphylococcus aureus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 2389) AUTHORS Liao,J., C,-H., Moghazeh,S.L. and Projan,S.J. TITLE Genetic mapping and nucleotide sequence of pWBG32, an ethidium bromide resistance plasmid naturally occurring in Staphylococcus aureus JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.J.Projan, 30-MAR-1990. Public Health Res Inst, 455 First Avenue, RM 1166, New York, NY 10016 FEATURES from to/span description pept 1153 1476 ethidium resistance protein (ebr) BASE COUNT 796 a 403 c 290 g 900 t ORIGIN 1 ggtcaatatc tttaagataa tctaaatcgc cattttttaa tttatttctt gcgtctttaa 61 ataatccaga ataaacaaga atttgtttcc ctttaagaga tttataaaat gcgtcgaaca 121 ctttctgatt aattaaatag tcactatcct taccagaata tttagccatt tcatataatt 181 ctttattgct attttgctta attttttgaa catgaacttg cgtaatttca gaaattcctg 241 ttacatctcg ccataaattt aaccattctt tttgactaat ataagctttt gtatctttaa 301 aatatgattt attaacggcc atcaaaacat gaaaatgcgg attataatca tcacgctttg 361 agttatacgt tatctctaat tttcttacat aacctttagt gatcgcattt acttttttgc 421 gtttaaacat cttttgaaag gcatgattat aattcttaat ttcactttct aaatgctcat 481 ctgtaacgtt tggtgtcgta agtgtcaaaa agataaattg cttatcttct tcttgcttaa 541 tatattgcat cattaacgat aatcctaatg catcttttct tgctttacgc cacgcacata 601 ccggacaaaa tcgattctta caaggattcg atttatataa tttctttttt tcaaattttt 661 tatccgtcac aaaagacaaa aatgtattac aatttttaac caaatccatt tgatctcccc 721 gatatgacgt tcaataaaat ttttaaatac ttgatttctt tgctttttct cagtatactt 781 ttccatacga taatacacaa aaacaactta gttttctcaa aaactatgca taaaaaagtt 841 gcttttttct ccttttcttt ttttttcgtt tggattagac acctaaaacg atacaatagt 901 atgctagaaa aagcaacttt ttttgtgctt caaaccagtt ataccaatga attgaaaggg 961 ttatacatcg ccgggaatag ttacccttat tatcaagaca agaagaaact cgttttcaac 1021 tcgtttcaaa aacctttcaa aaaccatcaa tccacaaaaa taccacgcga atgacactca 1081 aaatacaaga ctacaattaa aaaatactta gaataaaatt aaataaaata cgaaaattaa 1141 aaggagttaa aaatgcctta tatttattta ataatagcca taagtactga agttattgga 1201 agtgcatttc ttaaatcttc agaaggcttt tcaaaattta taccatcctt aggaacaata 1261 atttcatttg gaatttgttt ctatttttta agtaaaacaa tgcaacacct accactaaat 1321 ataacttatg caacttgggc gggactaggt ttagtcttaa caaccgtagt ctcaataatt 1381 attttcaaag aacaaataaa tctaataact atagtatcta tagttttaat catagtcggc 1441 gtagtttcgt taaacatttt cggaacatcg cattaattgc tttattccaa ttgctttatt 1501 gacgttgagc ctcggaaccc ttaacaatcc caaaacttgt cgaatggtcg gcttaatagc 1561 tcacgctatg ccgacattcg tctgcaagtt tagttaaggg ttcttctcaa catcaataaa 1621 ttttctcggc ataaatgcca tgctataata gatacacgtc ttctcttagc gtttcatagt 1681 attatcctcg tttattatac ttataattat aggggaaggc ttagagctat cattttgata 1741 gctctttatt tttgttcaaa catttattca aaatcagaat gcctttattt tttaatttta 1801 aggggtattt tgaagaatta agggttattt atatagtttt atacctaaaa acttatatcg 1861 gctcttaaaa cgcaaataag agccgaataa aaataattgc ttttcacaaa caaaaatttg 1921 agcaaaacca gtgttgaatt ttttagacac tgcccatcta catgcaaatt taaaaattgg 1981 cataaaaaat gggcaaccat gctggttgaa cgctatagtt cctgcagggg caaaaaagca 2041 taaaaaaacg ctagctttga tgagctaacg ttagttataa aattcagtaa tatgcttttg 2101 taattcaata gattctcttt cttttttagc ttgtcttttt ttaaaacctt ctgaatttct 2161 agaagcctta tatatatcca ttattttttt ataatcaatg tcgtaaccat atttttgtaa 2221 ctcttctaca aaaaacttat cgcaatttaa tatcattttt cttcctcgat ttcgtttatc 2281 atttgatgat ttattttttc tttttcttgt tcagttaaat cataaatttc acttgctaag 2341 tattcttttt gattccaaat ataaaaaatt tgataaatat attcagtcg // LOCUS XANAVR 2100 bp ds-DNA BCT 26-JUL-1990 DEFINITION X.campestris avirulence protein (avrBs1) gene, complete cds. ACCESSION M32142 J03672 KEYWORDS avirulence protein. SOURCE X.campestris (strain E3, race 2, pv. vesicatoria) DNA. ORGANISM Xanthomonas campestris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2100) AUTHORS Ronald,P.C. and Staskawicz,B.J. TITLE The avirulence gene avrBs-1 from Xanthomonas campestris pv. vesicatoria encodes a 50-kD protein JOURNAL Mol. Plant Microb. Interact. 1, 191-198 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by P.Ronald, 15-FEB-1990. FEATURES from to/span description pept 308 622 ORF1 pept 713 2050 ORF2 BASE COUNT 656 a 423 c 505 g 516 t ORIGIN 1 ccattgtcgg cggttatccg ggtacttggc gtacaccaaa caactggggc aatgctggca 61 aatcacgtga cgaagccttg gcagacgagc aacagaggat tcaagcgctt aaatcgcaag 121 agacggtaca tatcttccat cgcaaagatg tcaagagcga acccgcaacc cacgcggggc 181 gacgttaagt aagccactga tttttagcga agaagagctt gtgagagctg cgggcgccaa 241 atatgtacgt ttgacagtga cagatcatct ttcaccacgg gcggacgata ttgatgcgtt 301 tattgcaatg gagcgggaga tggcccatga tgagagactg catgtacatt gtggtatggg 361 cctaggccgt acgacaatat ttattgtcat gcatgacata ctaagaaatg ctgcaatgtt 421 atcgtttgat gatatcatcg aacggcaacg taaatttaat ccagggcgaa gcttggataa 481 taataaagac gtttctgaca aggggcgctc agaatttcgt aatgaacggt cagagttcct 541 tcctctattc tacgagtacg ccaagcaaaa tccaaagggc cagccattgt tatggtccga 601 atggctcgac cacaatgcat aaatcgcaag tacattttcg gctatgacgg acttgtgctc 661 gatgcgctgg cggctttctc gataaatatc aattaatata aatatcgaac taatgtccga 721 catgaaagtt aatttctctt caaaaataat agattcaaca cccagtgaag aggaggtcgc 781 cactcagcaa gatagttata cgaaatctgg actggtggcg ccatcgctcg attcacaagc 841 cttgaaaaaa gcacctagaa aaagagtaat aaaagaaaat atagctgctt tgcacacctc 901 atcgttagag cgagttcatc aaaagaaggt attagttcag aatttagcgc agttgcagag 961 agggttggct aagataaatg gtagagtcga actcgaagag ctaattgatg gattttcagt 1021 caaggaattg ctaataaaaa gaaatccaaa gattgctgaa gagtatggag aaggaaatcc 1081 tttaatgatt cgatctctaa gattttcaaa cccccaagag gtgactagta agcttggggc 1141 ggaaggaaaa acgccagcca aaagagaggt tgatacgatt tgcaataaat ccacgctgca 1201 tgacattgtc atgacgcccg cctcccttgt aaaaaaggaa gtgcggatga acctgatatc 1261 tgaagtccca agggcgaagg ataaacaaaa atacagaggt cttccttcag tcgtatatgg 1321 ccaaagcagc cgccgtagtg aatcagacta tctaacgtct cgaaatggtt tcggcgacgt 1381 gcactctttg aaatccaata acgcatttaa ttccgactac gaaaaaatat gtgggtcgct 1441 tagccatgcc gaaaagttgg ggttaattga aaggaatctt actcccttta taaggcatga 1501 tccagataga atctccaccg actttgttca ctctattgaa gaattggctg aacaccagat 1561 gctattgcaa tcaagaaaac ctgccagtgc tttgcggcat aatgaatatt gcaccaagct 1621 tgaactgtgg gatgctaaag ctatagcagt tggtgaatct cgtgccttgg cggtcgctac 1681 cctgattgaa tttaatttgg agatgttgtc gatagcacaa gagatagatg atgatgggca 1741 caagagtaaa atggtcgccg attttatcga gcgccaacta tcatggcttg gcccacaaac 1801 cgcacttgac agcaagtcaa cgcttgaaag ggtttcagcg gtgaccatac aagaaaggga 1861 atttatcgct aatgagatta gccgatcgtt gcgtcaaggt gtttcacttt gcacttacga 1921 taaagatgaa gcaggaagtc atatccgtga aatgagtttg ttggatttta gggttgaaga 1981 aatcatagag gggataagta tttttatttc ctccaagctt ttacatgtta caaatgcagg 2041 agaagcgtaa gagaagaagt atccgccaca atcgtgcgac ggaccgacgt cctaacgccc // LOCUS YSCSCD25 5055 bp ds-DNA PLN 26-JUL-1990 DEFINITION S.cerevisiae SCD25 gene, complete cds. ACCESSION M26647 M31771 KEYWORDS Ras protein; SCD25 gene; cell division cycle. SOURCE S.cerevisiae (strain OL136) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 2129 to 5055) AUTHORS Boy-Marcotte,E., Damak,F., Camonis,J., Garreau,H. and Jacquet,M. TITLE The C-terminal part of a gene partially homologous to CDC25 gene suppresses the CDC25-5 mutation in Saccharomyces cerevisiae JOURNAL Gene 77, 21-30 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 3880) AUTHORS Damak,F., Boy-Marcotte,E., Le-Roscouet,D., Guilbaud,R. and Jacquet,M. TITLE SCD25, a CDC25 like gene, which contains a RAS activating domain is a dispensable gene of Saccharomyces cerevisiae JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by E.Boy-Marcotte, 02-AUG-1989, for [2] by F.Damak, 01-FEB-1990. Laboratoire IGD, Groupe des laboratoires de biologie cellulaire, Centre universitaire d'Orsay, 91405-Orsay Cedex FEATURES from to/span description pept 128 3880 SCD25 protein pept 4319 > 5055 ORF X BASE COUNT 1638 a 973 c 900 g 1544 t ORIGIN 1 ctgcaggctc gcaaaattta aggttccctt ctacaatagt agtcaaaatt gcttttttgc 61 atataacaaa gtgaaaaaaa aaaatatgag agacatatct aaaagacata tataatctgc 121 caccataatg agttgcactg cgtcatatgc cggcatgaca actccggtga aagataagga 181 aggccacggg attccatgct tacaacctat cgatgtagtg gaatgtacct atcaatattt 241 tacaaaatca cggaataaac tgtctttaag ggtaggcgat ttgatttacg tactcactaa 301 aggttctaat ggctggtggg atggtgttct tatcagacac agcgctaata ataataataa 361 taattcgttg atactagaca gaggttggtt ccccccttct tttacacggt ccattctaaa 421 cgaactacac ggggtgcctg acatcggtaa tgaattggaa atatttcaag cgggtcttaa 481 tcttaaactg gaattatcaa gcaacccagt gatcttatca ttggaagact ttttagactg 541 ctgtcgcgat attgaattca aggaacaact ggcttggtca cctactcccg tccacgaaag 601 gaaaggctgc tgtgagctgc tgtactataa ccaggattta gatgtttatt gtcgcacgtt 661 accatattta ccacaaaatc aagttgaaac cgtgaacgac tattcgtctt ttcctgcaat 721 atcgaagatt gctggtaaaa agatgcctat aacgtcaagc cccgatctgt tctatctcaa 781 tgattgtgat gtcgtctatt ggtatgacct cactcgctta gtgtgtcatt atgttaattt 841 aacagagcgc gacctattgg caaatgaacg ggaaaagttt ctaacttcct tggatttatt 901 aacagctcaa ataacctatg tttatatgct tttcaggaat ctccgtttag ttgaagatag 961 tttcaaaaaa accctcaaaa aactaattta caccttgtct aggttttcaa taaatgcaaa 1021 tatttggttt cattccacat cgtttgaaga aagagaagcc atagcctccc agaaggatcc 1081 agaaagaaga tcccctcttc tacagtcaat cctaggaacc ttccaaaaat ttcattttct 1141 actgcgtcta ctacatttcc tctcaaatcc taacgaactt acaatactgc ctcaattgac 1201 tcctcgattt ttcaaggatt ctttcaatac aatttcatgg aataacccgt ttttgcgtac 1261 agtcttcaac cagcatatgt ccatgacctt accgagacag atgattaaag ccgttgctgg 1321 cgcttcagga attgttgcgg aaaatattga tgaaattcca gcttccaaac agggcacttt 1381 catctcgtca gaaacgtctc accattcacc atcagccccg tttcaaagaa ggagaagagg 1441 taccattttc tctaatgtgt caggaagttc cgatgagtct gacaccatat ggtccaaaag 1501 gaaaaaacca tacccgctaa atgaagaaac tctaagcctt gtaagggcca ggaagaagca 1561 gcttgatggt aaactaaaac aaatgatcaa aagtgctaat gaatatctca gtaacacggc 1621 taatttcaaa atgttgaatt ttgaaatgaa cttcaaaacc tacgaagaag taagcggaac 1681 aattcctata attgatattc tggaaaacct agatttaact atttttctaa acttgagaga 1741 gttgggagat gagaatagag tttttgacga agatgtcttt gacgaagatg tcgctattgg 1801 tgatgaagat aaagagtttt tgaaacactc tttatcatcc ctatcgtata tcttatccga 1861 ctattttaat atgaagcaat attttcatga attgtcgccc acgcatttga cattagagga 1921 tcctttcgtt ttctcgccaa tgcaaaacga cttgcctacc ggttattatg aaccaatgaa 1981 accttcatcc ttgaatttag ataatgccaa ggataagaag aatgggagcc aaaatactga 2041 tatccaagag gaggaagatg aatatgagcc agacccggat agtcttattc tcttccacaa 2101 cctcatcaat caagattctg atttcaatga tctaaagttt tttaatctcg cccacgtttt 2161 taaaaaatcc tgtgatgact attttgatgt gcttaaacta gccattgagt tcgtgaatca 2221 attaattcta gaaagagaga atttgttaaa ttatgctgct agaatgatga aaaacaatat 2281 cacggaattg ctattgcgcg gggaagaagg ctatgggtcc tatgacggcg gtgaaactga 2341 aaaaagtgac acgaatgctg tttatgcaga ttcagatact aaagacaatg acgaatggcg 2401 tgacagccaa gtcaaattac cgaggtattt gcagcgcgag tatgacagtg aactgatttg 2461 gggctctaac aataggatta aaggtggttc taaacacgca ctgatctctt acttgacaga 2521 taatgaaaag aaggacctat ttttcaatat tactttttta atcactttca gaagcatctt 2581 tactacaacg gagtttttaa gctacttgat ctcgcaatat aatttggatc caccagagga 2641 tttgtgcttt gaagaataca atgaatgggt gacgaaaaag cttataccgg ttaaatgtag 2701 ggtggttgag attatgacaa cctttttcaa gcaatattgg ttcccgggct atgatgagcc 2761 cgatcttgcg accctaaatc tggattattt tgcgcaagta gcaatcaagg aaaatataac 2821 aggatctgtg gaattactaa aggaggtcaa tcagaagttt aaactaggta atatacaaga 2881 agcgactgca ccaatgaaaa cgttagatca acagatctgc caggaccatt actcgggcac 2941 tttatactct accacggaat ccattttggc cgtcgatcca gttttatttg ccactcaatt 3001 aacgatacta gagcatgaaa tttattgtga gataaccact tttgattgtt tgcaaaaaat 3061 ttggaagaac aagtatacaa aatcgtatgg ggcttcaccg ggtttgaacg agtttatcag 3121 ttttgccaat aaactgacaa atttcatatc ctactctgtt gtaaaggagg ctgataaaag 3181 taagcgcgcc aagctactct ctcattttat ttttatcgca gaatattgta ggaaattcaa 3241 taacttttct tccatgactg acatcatttc agcattatat tcttcaccaa tttatcgttt 3301 agagaaaacc tggcaggcag ttattcctca aacgagagat ctattgcagt cactgaacaa 3361 gttgatggat cccaagaaaa atttcataaa ttacagaaac gagctgaagt ctttacatag 3421 cgctccctgc gtaccgtttt tcggcgttta tttatctgat ctaaccttta ctgattccgg 3481 aaatccggat tatcttgtct tggaacatgg tttaaagggt gtccatgatg agaagaaata 3541 tataaacttc aacaaaagga gcagacttgt tgatatctta caagagatca tatatttcaa 3601 gaaaacacat tatgatttca ctaaagatcg gacggtaatt gaatgtatat caaattcatt 3661 ggaaaacatc ccccatattg agaaacaata ccaattatca ttaattattg aaccaaaacc 3721 aagaaagaaa gtcgttccga attccaattc gaataataaa tcacaagaaa aatccaggga 3781 tgaccaaacc gatgaaggaa aaacatccac taagaaagac agatttccaa aatttcaatt 3841 acataagaca aagaaaaaag ctcccaaggt ttctaagtaa cggcgccgta tgttcgattt 3901 ccttctctcg gtggattaat tattttgttt gttttctcct gttatattat ttattgatca 3961 ctatagtaaa ctatgtccgt catcaagccc gacggctgct atcccacaat gttgatcgta 4021 ttgtttgcct agtttattat atatttgctt atttatagca taccataata tttaaatgcc 4081 ctcaaatttt tggccgtagc gacatcgcga taattccaat tccctttaaa aaattgcgcc 4141 tgagtataag ttaattcagc cagttctcca aattaaaatc gcatactcct gaacctatca 4201 acagattgtc ctcgcatact tttctatacc aaggtctctt ctgaacatat attagcagtg 4261 gttaatttta aagagatcat aaagaaaatt ttgtctaaaa aagattaata taaagacaat 4321 gtcttcacta gaagtggtag atgggtgccc ctatggatac cgaccatatc cagatagtgg 4381 cacaaatgca ttaaatccat gttttatatc agtaatatcc gcctggcaag ccgtcttttt 4441 cctattgatt ggtagctatc aattgtggaa actttataag aacaataaag taccacccag 4501 atttaagaac tttcctacat taccaagtaa aatcaacagt cgacatctaa cgcatttgac 4561 caatgtttgc tttcagtcca cgcttataat ttgtgaactg gccttggtat cccaatctag 4621 cgatagggtt tatccattta tactaaagaa ggctctgtac ttgaatctcc ttttcaattt 4681 gggtatttct ctccctactc aatacttagc ttattttaaa agtacatttt caatgggcaa 4741 ccagcttttc tattacatgt ttcaaattct tctacagctc ttcttgatat tgcagaggta 4801 ctatcatggt tctagtaacg aaaggcttac tgttattagc ggacaaactg ctatgatttt 4861 agaagtgctc cttcttttca attctgtggc aatttttatt tatgatctat gcatttttga 4921 gccaattaac gaattatctg aatactacaa gaaaaatggg tggtatcccc ccgttcatgt 4981 actatcctat attacattta tctggatgaa caaactgatt gtggaaactt accgtaacaa 5041 gaaaatcaaa gatct // LOCUS ADBMLPA 101 bp ds-DNA VRL 26-JUL-1990 DEFINITION Mastadenovirus 2 R1, R2 and R3 binding sites. ACCESSION M33540 KEYWORDS . SOURCE Mastadenovirus 2 viral DNA. ORGANISM Mastadenovirus 2 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 101) AUTHORS Leong,K., Lee,W. and Berk,A.J. TITLE High-level transcription from the adenovirus major late promoter requires downstream binding sites for late-phase-specific factors JOURNAL J. Virol. 64, 51-60 (1990) STANDARD simple staff_review COMMENT Sequence-specific binding proteins are induced during the late phase of infection. These proteins interact with three regions in the first intron of the major late promoter (MLP). BASE COUNT 24 a 25 c 26 g 26 t ORIGIN 1 ccagctgttg gggtgagtac tccctctcaa aagcgggcat gacttctgcg ctaagattgt 61 cagtttccaa aaacgaggag gatttgatat tcacctggcc c // LOCUS LB3HDCBA 804 bp ds-DNA BCT 26-JUL-1990 DEFINITION Lactobacillus 30a histidine decarboxylase-B (hdcB) gene, complete cds. ACCESSION X13099 KEYWORDS histidine decarboxylase. SOURCE Lactobacillus 30a DNA. ORGANISM Lactobacillus 30a Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 804) AUTHORS Copeland,W.C., Domena,J.D. and Robertus,J.D. TITLE The molecular cloning, sequence and expression of the hdcB gene from Lactobacillus 30a JOURNAL Gene 85, 259-265 (1989) STANDARD simple staff_review FEATURES from to/span description pept 85 609 histidine decarboxylase-B (hdcB) BASE COUNT 277 a 140 c 152 g 235 t ORIGIN 1 actaatccac aggacatagt ttgaggaaga gatggtgttt actacctctt cctttaatat 61 tttgtaagtt aaggattgat tgcaatgagc aacagtaact accaagttag tttagaacga 121 attaaaaaag ttgtccctga agaactctta accaatgcat tgttagcagc tattgacaat 181 tctggtgaaa ggatgtcaca aataatagtc gataaaaaag ataacggcaa cgactattac 241 ctcaccatcc atagattctt cgtttatagc aacgaagaat tcaccgcttt tgataaagaa 301 gatgttgcag atgtcgaatt cgttaatggt acgccagatg gtgaagtaat cattacttta 361 aaggacggca aagtgttgca cccgtctcac atttgttacg gccgagcttt tgactttatc 421 caagatgtca agccaaaagt aattacaatg gcgggatatg acagcacaat tcgaggcgaa 481 tttccacaat tattagatcc agatcatgcg gaagagattg atcgattacg tcgctggatg 541 caagatggaa atattagcca ttacgaatac gatgatgcaa atccagctta tccaaaagca 601 ggaaaataaa aaaacatatt gacatatcat cagatatagg ttatgttaca atcaagcatc 661 ttaataggta atgcgcaatt tatatctttg aatatagttc cattatttat ttataaatag 721 ttactccgaa aaggactacg tacctactat acttttaaat aaatatattt cgtgatgggg 781 agcgttatta ccccggctgt cgac // LOCUS LBPREPA 3547 bp ds-DNA BCT 26-JUL-1990 DEFINITION L.plantarum repA, repB and repC genes, complete cds. ACCESSION M33531 KEYWORDS rep protein. SOURCE L.plantarum DNA. ORGANISM Lactobacillus plantarum Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 3547) AUTHORS Bates,E.E.M. and Gilbert,H.J. TITLE Characterization of a cryptic plasmid from Lactobacillus plantarum JOURNAL Gene 85, 253-258 (1989) STANDARD simple staff_review FEATURES from to/span description pept 2191 2349 repA protein pept 2406 3062 repB protein pept 570 1655 repC protein BASE COUNT 1189 a 589 c 758 g 1011 t ORIGIN 1 gatatctggt taactttgat cacattagtg atcaaattca tttctttagc cccatcaaac 61 gatcagtttg ctttatgaaa gtgaccgctt gatggggctt tttcgtttac cttttgtcaa 121 aggtaaggtg tgacgggctt gactttgggt ggcgttgtgc ggaagcgcaa tcgacacgat 181 tttgactttg aggggagtta agaggggaag cgtagcgccc cttcttacaa gtgtaaagtg 241 tggacaagag agcgtagcga tattgtctac actttacccc aattgtcatg cgactttaaa 301 tagaattatt gattaataaa agccccctga caaaagtcga agggggactt ttattttagt 361 ttgaggtttg catacctact taaaaaagta gggcagcaaa acgtcaaaca ggtatcagct 421 aatcatccga tagggtgcgc tgatacggtc ctcaaaagag agccgacaga gccgtctgca 481 agacccctcg gcggaggccc acctttacga agtaagatat agtgggttat actttacttg 541 gaagataact ccgaaatgag gtgcatacaa tgagttttgc agtggctaga atgacgaaat 601 taaaagctga taatttagtc ggcattggca atcatgacca acggaaaacg actaatcaca 661 gcaacgaaga tattgatgtt tcccgctctc acctgaatta tgatttagtg gctgggcgca 721 ctgataactt taaaacggat attgaagcct atatcaacga aaacaaagcg agtaagcggg 781 cagttcgcaa agacgctgtt ttagtcaatg agtggattat aaccagtgac aaagactttt 841 ttgagcaatt agacgaagcc gaaacccgta aatattttga aacagccaaa caatattttg 901 cagataacta tggtgacgaa aatattcgct atgcagttgt tcatatggac gagaagaccc 961 ctcacatgca tatgggcatt gtgccctttg atgatgataa aaagctctca gctaagcgta 1021 tattcaatcg tgaagcctta cagcacattc aagaggaatt accacagtac ctcaaagaaa 1081 atggctttga tgttcaacgt ggtaacaaaa ataaagagcg taagaattta tcagtacccg 1141 aatacaaagc tatgcgggaa gaattgaaaa aaatagagac cgaaaaacaa gagacacaag 1201 caaagcttgc agatacaaaa aaacagcttg atgagatcaa accacgggat accaagaaaa 1261 ttgctagtaa acccaccttg atgaataaaa ataaagtcac ggttgataaa tctgatctcg 1321 ctgatttgga acaaagggcg gtgactagcg acgcttataa ctttgaaaaa attcatctgg 1381 aagtaggaaa tcatagttta cgtaatgatt tgagtgaagc caagggccgc aactatgaac 1441 tgagaaaaga aaatgagcga ttgcaaaaac tagtaggaac gcttcaaggc attatacgaa 1501 atgttgatga gtttctacac aaaaaactag gtattaattt acctgaaaag tggctagagc 1561 gtgcaggact aaaagaaccg tctaaaaaag cccctgaaag ctcacaggaa ctcgacagac 1621 ataaatctga tgaattaggc ggtccacatc tttaaatcgc ttatacgagc ttaaaatggc 1681 gtttaagagc ttaatttacc atctcgctag attgaacgta gttaactttg tgtccgtcaa 1741 cggtaaatcg acgtaggcgt tttatagccg ctgggctatt agacgcccta ggaggcttta 1801 aggagttgat agactagcgg ataaaacact tttgcacatg caaagaaaag cacccctgct 1861 ttttttgcct gccccacggc gagtgcgggg tgagtttagc gggtgctccc gtcatttatg 1921 gggtcaagct gacacagctt gcgggtttgg gcagagccca tattttggtt tggtttgagt 1981 gggataaaaa aattgggcga aaaacatggg ggtactacga caccccccca tgtgtccatt 2041 gtccattaaa cagaacactt ttttcaagaa accttttagg ttaggggttt tcgggggggt 2101 ttgagatttt ataaaaaatg ttgtatttct aacgtatgta taatataatg atggaataga 2161 gataaaaata gtaagaaaga aggttttttg atggttgaag ttgaaaagaa aaaaattact 2221 ttgtctatac ctgttgaaac taatggaaag ctggaagaat tggcccagaa atatggcatg 2281 actaaatctg gattggttaa ttttttggtt aatcaggttg cagaagctgg aactatttat 2341 aggcaataaa aaaagcgccc tgtgcatagg acgcaatcta aaagtctgtg aggtaattat 2401 aacatatgaa aagtgaatct aaaatcgatt ggacggtacc tcgtccaaat aaaaatccca 2461 aaacaaaaca gccttataaa cgtggtcgta attggggtat tgttgtttat cctgaaagtc 2521 ttcctgaaaa ttggaaagat attatcaggc aagagcctat tgctgtcagt cccttacatg 2581 ataaagatgt taaccctgat ggagaaaaga aaaaatctca ctatcatctt gttttgaact 2641 ataaagggaa caaatctttt gaacaaattg atgaaattgc taggtcttta agggcgcctg 2701 ctcctcaaag aattagtagt ttaactggcg ctgttaggta cttgacacat atggataatc 2761 ctgaaaaata tcagtatgat aatgctgata ttgagacctt tggaggcttt gatttagaga 2821 gttgcttagc tctttctact ggcgataagc gccaagcctt acgtgacatg ttggctttta 2881 tttctgaaaa tgaaattatg catttaaaag actttgcaga ttattgcatg tctgaggaag 2941 caccagctgg ctggttcgaa cttctaactg aaaggaatac gctttttatt aaagaatata 3001 tcaagtcaaa ttggcagaaa caacagtatg ctagtaaaaa catcaataaa atgtcggatt 3061 aaaattttat tgatgttgtt gctatattat tagtgaaagg atggtttact ttatgccaac 3121 aagaaaaaat attttagatg atattcaaga acatattgac aatgaagaac gtgttttggt 3181 tactaattca agcaaaatta actagcacca cgcgtataga gtgatttaaa ataactaaca 3241 tcgtttttat ttgaatttag aagggaagag atttttatta aaaatatagg ttttaactca 3301 aattatttta aaacctggta tttttggcta ggcatattaa cggtagtggg attaatcggt 3361 gatcccattt tacactatca ttcttcaact agtccgtggt tacaaatact tattgctatt 3421 ttattatttg tagcagcatt taccaaaaaa ataaataata actgacttaa atcgcaattc 3481 actctaaact tttaacaaat ttgttatcat aattgggtaa ggtgtttgca agttaagtat 3541 ttttccc // LOCUS RATUD2A01 1088 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 1. ACCESSION M35202 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 1 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1088) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP-glucuronosyltransferase, UDPGTr-2, gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept 349 + 1072 UDP glucuronosyltransferase-2 (UDPGTr-2) pre-msg 315 > 1088 UDP glucuronosyltransferase-2 mRNA and intron IVS 1073 > 1088 UDP glucuronosyltransferase-2 intron A signal 288 293 CAT box BASE COUNT 329 a 194 c 216 g 349 t ORIGIN 1 ctgcagtcaa cggatcttca ctgctatgta agaacattta agaaataaga gctttcatct 61 gtgattttta catgactcta acacgttata atcaacagat gatgtttgca catgagaagt 121 gattcaattt tggctgaata gaatcaggga caaaaaagac aaataaactc tgttaacctt 181 gagctcatgt tccatgcttg tatttacaca tggcgtaaca tcattgcact catctaatcg 241 gtgatggttt aaaagttata tattaatttc ttgggtgact gaactttcat aaaaaacatg 301 aatatctaca atgaacgaca gatatcaaaa gcattccatt tctgcaagat gtctatgaaa 361 cagacttcag tgtttctgtt gatacagctc atatgctact ttagacctgg agcctgtgga 421 aaagtgctag tgtggcccac agaatacagc cactggatta atataaagat aattctgaat 481 gaacttgccc agagaggtca tgaagtcacg gttcttgtat cttcggcttc cattctcatt 541 gagcctacca aggaatcttc tattaatttt gagatttact ctgtaccttt gagtaaaagt 601 gatcttgaat atagttttgc aaaatggata gatgaatgga cacgtgattt tgaaacactc 661 tcgatttgga catattattc aaaaatgcaa aaagtcttca atgaatattc tgatgtcgtt 721 gaaaatttat gcaaagcact catttggaac aagagtctta tgaaaaaact ccaaggatct 781 caatttgatg tcattctcgc agatgctgtg ggtccctgtg gtgagctgct agcagaactg 841 cttaagacac ctttagtgta cagtctccgc ttctgtcctg gatacagatg tgaaaagttc 901 agtgggggac ttccactgcc tccttcctat gtgcctgttg ttctttcaga attaagtgac 961 cgcatgacat ttgtggaaag agtgaagaat atgttgcaga tgctgtattt tgacttttgg 1021 tttcaaccat ttaaagagaa gtcctggagt cagttttaca gtgatgttct aggtaaactg 1081 tgcctttc // LOCUS RATUD2A02 373 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron A. ACCESSION M35078 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 2 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 373) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 373 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 373 UDP glucuronosyltransferase-2 intron A BASE COUNT 137 a 75 c 56 g 105 t ORIGIN About 0.1 kb after segment 1. 1 aaaatgctat agagtaactg agcagaacac tccaaaaatt actatccatg taaactgaga 61 caaagatttc tcttagtaat cactagatct actctaagtt tgtcttagta aaagaaactc 121 caagtttctc gaatgcttta atgactgtag atgcgaacac taaagagtca ttatatacca 181 ccacaactat ctgtgtagca cagaaggaaa catgttccct tatacaaatt actcacttgc 241 aaatgatgaa aaaactccaa ggagctaagt ttgatgttat cacctagaat atcacgacag 301 gttttctcac aattaaatca tatcactaga accagaaaca gtcaaggcat cttagtttct 361 tcgagttcag ctg // LOCUS RATUD2A03 380 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron A. ACCESSION M35079 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 3 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 380) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 380 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 380 UDP glucuronosyltransferase-2 intron A BASE COUNT 154 a 49 c 55 g 122 t ORIGIN About 0.1 kb after segment 2. 1 tcaaataaaa tagtacctaa attaatagga gaaagaattt aaaggttaac tatttgtgga 61 aatatccagg tgtaactttg acatatacaa ctaagttagt attacttgtc tcttctaata 121 ggcacagcac agtagtgata aaaagaaact tagtcataaa ctgcagatta tcacagtgca 181 tttcaagaat cagaaatcaa aagaatagct actaaaatgt ataaagtaga tgaaatattc 241 tacaaaagtt gatttttcta aggcattttc aagctttttt gcaaggaaca aatgttccaa 301 attcattggt gtaactttag aaaacatgta attgacaaca ttgatattat gttatacatt 361 atatcataat caaatgactt // LOCUS RATUD2A04 1435 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 2. ACCESSION M35080 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 4 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1435) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 1435 UDP glucuronosyltransferase-2 mRNA and intron pept + 51 + 199 UDP glucuronosyltransferase-2, exon 2 IVS < 1 50 UDP glucuronosyltransferase-2 intron A IVS 200 > 1435 UDP glucuronosyltransferase-2 intron B BASE COUNT 375 a 301 c 268 g 485 t 6 others ORIGIN About 0.2 kb after segment 3. 1 ccacaaaacc tcttttcacc attgagtatt tttatctgtt ttggatgcag gtagacccac 61 aacattaact gagatgatgg ggaaggcaga tatatggctc attcgaacct tctgggactt 121 ggaatttcca cacccattct tacctaattt tgactttgtt ggaggactac attgcaaacc 181 agccaaacca ctgcctaggg taacattgga ttgttttcct tgataaactg ttcgttcctt 241 tatcattctt tatttgtttt tacaaagagg atagtttatt ttaattatta atatttatct 301 ttaatctttt tttacagtcc agtaattatc cccttctgga ccaccctcgt tccatcctcc 361 tcctcccttg ctccaagagt atgtatgcca ggagcctcct gcgatggaga ggatagtgtc 421 aggggtgcag gagggaacaa agtaagactc tggtgtggct ttaaagctga cggtctcctg 481 acattctaac tctctacctg ttcagaaaca ctgatgataa cttctagaaa atcatacaaa 541 ctttcttgct ctttctcatg ataaaaggct gctggcttgg gaatcagtac ctgtaactta 601 acaacagagg attgagcaat gtggccttgg tcctatatag taggaactgt gtggctctaa 661 ctttcagcct gctagtcaga anngcagaag ggatctttcc acatgatgtc tcctccttct 721 tcttcttgta gtcctcctct actctcctgg attctcaact gggatcagac gccctgccct 781 cttctcttct gcccagctga tcgattcttt attaactaat caaggatgat ctaaattatt 841 ttatacataa cattgagacc agtgatgctt gactgtgcca aattttggac tgcaaccaga 901 tatctgggca taaaaattag cacatgaata cacagtgtaa aaaaaaaacc gtcccctaac 961 actcacctat tgttttctgc atgtgggtga gtctacatgt gtctgatggg aggcctgtgc 1021 atgtttcttt ttacaactag gtcccttnnc tg gtatataa gtttcattac taggaagtgt 1081 tagcatttaa tggtaatttt gttagatgga tgggattgtg aatttaaaac ttgccttgaa 1141 gtagattttg agtgacatag cacattttta aattttattt tgtgtttttt taaagaggac 1201 atctctctat agcttanntg tccttaacct catagcagtc cttctgcctc agtctcccat 1261 gtgctgagat tagaccagtc ttaatacctc ttctgaaaca tgatgtgtaa tatcagtgat 1321 ggagatctta ctgtgcacag ctttagatca tgatgtttag cagattgtaa cttccattca 1381 tgagaagaaa ctgcacaaac catctcattc ctgtcttact ttattgattg gaagc // LOCUS RATUD2A05 769 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron B. ACCESSION M35081 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 5 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 769) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 769 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 769 UDP glucuronosyltransferase-2 intron B BASE COUNT 293 a 116 c 117 g 243 t ORIGIN About 0.1 kb after segment 4. 1 aaatgctact tcatttgatc ttgaaggtgt gtgagctgtc attatttaat tggtacggta 61 tttctttcaa ataaacaatt aaaatagtgt tcttttcttt aaaaaaataa agaaaaaaga 121 gatcataaag aaaaaaagaa gttgcagaaa gaaaagggga caccttgaaa agtgattata 181 gcacttatta ctaagttgta aaaggtttcc tatgaaaact atctaagaag ataagtagaa 241 aagtcctaat gagggaaagg aaaaaaaaat tcttctcctt ctcatcattt tgtcctcagt 301 acttacacat cttttcagaa tacatgacca caagttaaaa gtcataacaa aaaattaaat 361 aataaattta agtagaagtt tacaagaaaa aaatgcttac atgcatatcc attaggagta 421 atttctggct aaacaccatt cacatggctc cacaggttca tagaaggttg aaaaccataa 481 ttaaaattat tagtgaagtt ttgtattgat gaacccagtc catattttat cttctgtctt 541 agcacctata ataaatttta gttccctttt tacgaccttt agttaagtgt tttacaacct 601 cttggattgt gctctgagaa gaagaaagtc tggttgctat ctaagaacaa ttaactggtg 661 acacatagga gactgataca gttctcattg cacttttcac tatcagaaaa ggaactaaaa 721 taattccact ataaaagagc ttaataatca ctgatatact tagatctct // LOCUS RATUD2A06 359 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 3. ACCESSION M35082 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 6 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 359) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 359 UDP glucuronosyltransferase-2 mRNA and intron pept + 175 + 306 UDP glucuronosyltransferase-2, exon 3 IVS < 1 174 UDP glucuronosyltransferase-2 intron B IVS 307 > 359 UDP glucuronosyltransferase-2 intron C BASE COUNT 114 a 69 c 75 g 101 t ORIGIN About 4.0 kb after segment 5. 1 gtagtatagt acaaatgcac acttaatgaa cactgggtac cgaggcaatg gatacactgg 61 tctcccaaaa taattccagg aattacataa tttcctctgg taagtttgtc tcggtagttg 121 agacaatgct tcccatgcaa ccattcatct gtgatgtcat aaccatcttc ataggaaatg 181 gaagaatttg ttcagagctc tggagaacat ggtgtagtgg tgttttctct gggatcaatg 241 gttaaaaacc tgactgaaga aaaagccaat gtagttgctt ctgctcttgc ccaaattcca 301 cagaaggtaa gataaaatgt ccacagagat ggcaaatgta ttataagtca tctgaaccc // LOCUS RATUD2A07 609 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exons 4 and 5. ACCESSION M35083 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 7 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 609) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept + 69 156 UDP glucuronosyltransferase-2, exon 4 292 + 511 UDP glucuronosyltransferase-2, exon 5 pre-msg < 1 > 609 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 68 UDP glucuronosyltransferase-2 intron C IVS 157 291 UDP glucuronosyltransferase-2 intron D IVS 512 > 609 UDP glucuronosyltransferase-2 intron E BASE COUNT 170 a 119 c 115 g 205 t ORIGIN About 0.6 kb after segment 6. 1 ccaggaacaa attttaccaa agccttggaa tttctgtaat taaataaggc attgtctgtg 61 tgtaacaggt tgtatggaga tttgatggta agaaaccaga taccttagga tctaacactc 121 ggctgtacaa gtggatcccc cagaatgacc ttcttggtaa ggcaaagttt aactacaagt 181 ttgtggctat agtaacacac tttcttgaga atagcacact tctgagtctt catattttcc 241 tctcttaaat attattcggt caataattat gtcaacttct tctcattgca ggtcatccaa 301 aaaccaaagc ttttgtagct catggtggaa caaatggcat ctatgaggca atctaccatg 361 gcattcctat tgttggtatt cccttgtttg cagatcaacc ggataacatt aatcacatgg 421 tagccaaagg agctgctgtt agagttgact tcagcatact gtcaactaca ggccttctca 481 ctgccttgaa gattgtcatg aatgaccctt cgtgagtctg tttgtttgtt gaagttgttt 541 tttccaagga aggctgtttc tttttctttt ttgaaacata atttttacta tataactaca 601 agagctgcc // LOCUS RATUD2A08 316 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron E. ACCESSION M35084 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 8 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 316) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 316 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 316 UDP glucuronosyltransferase-2 intron E BASE COUNT 112 a 39 c 42 g 123 t ORIGIN About 0.1 kb after segment 7. 1 ctcatagata tttgcttgct tcagcctcct gggtgctggg attagaaata tctgaattta 61 tatttgctgt gaataactat tattttaaaa atattgacag attcagatga tcatcagatt 121 gattttatcc tatttgaagg agggagaata atttcgaaaa attatgtttt tgcatatctg 181 aaatatgtgc ttttttaaca ataaagttac tctaaatttc taattgaatc aattagacat 241 gattattctc aaactattct atataaagaa ataatattac aaatatttat ctattataac 301 aaaggacaca ttttct // LOCUS RATUD2A09 487 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron E. ACCESSION M35085 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 9 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 487) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 487 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 487 UDP glucuronosyltransferase-2 intron E BASE COUNT 196 a 89 c 80 g 122 t ORIGIN About 0.05 kb after segment 8. 1 gaataagaga cagtattaaa ttcatacaaa tacctggaga acactattgt aatttcaagg 61 tttgctagaa gacaaatgta cctaatgaga aggtcctgag tcaaaaataa ctggagaaag 121 tgctgttcgt tcctacatac acagtcttct agtccaggaa cagaattaaa ttgttttcat 181 tgtggtgaat tcttgtggaa ctgttgtaca aagaagagtc ataaacaaca aagtgttttt 241 agaagaagaa cctagttata aacagataca taggagagga aaaaaaacta gagaggagat 301 atcgaacatg acatatgacc tggaaaaagt tctatggcta cttcccttct tggtcttata 361 tcatgagtta catgttacac aaaaacacac acacacaaac aaacacacac aaacatacac 421 acacacaaac atacacacac acaaacatac aaacacatac acacaagttt gtgtgtctta 481 ctagttt // LOCUS RATUD2A10 895 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 6. ACCESSION M35086 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 10 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 895) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept + 212 488 UDP glucuronosyltransferase-2, exon 6 pre-msg < 1 836 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 211 UDP glucuronosyltransferase-2 intron E signal 808 812 poly-A signal BASE COUNT 248 a 206 c 163 g 278 t ORIGIN About 0.15 kb after segment 9. 1 gttcataatt accctgtgct aaacaagact gtttcactgt ctttcctgtc actcaactct 61 cctctgccac cacctgaaac aaaacacttg agtgggaagt atacatgatt tattttaagt 121 tgcttgtgag acttttccct aaaacaacaa atgttgttaa gtcatcaaat tgcctcctct 181 ttaatcttag ttgtatacat tgtcccttca gctataagga gaatgccatg agattatcca 241 gaatccacca tgatcagcca gtgaagcccc tggaccgagc cgtcttctgg atcgagtatg 301 tcatgcgtca caaaggagcc aagcacctcc gctcaactct gcatgacctt agctggttcc 361 agtaccactc tctggatgtc attgggttcc tattgctctg tgtggtaggt gtggtattca 421 tcatcacaaa attctgcctc ttttgttgcc gtaagactgc taacatggga aagaagaaga 481 aagagtagca tcataaaggc tgaagcagag ccctgagaga tgagcctctg ccagctgctt 541 ccagcaggaa cctgttgtca tgccagtgcc ttccctctaa aagaagacag cgttgggacc 601 tcattgaaca tggctccaat gaattcacta tgttctgaag acatgcaaga tttcatgcca 661 aatatatatt cagtgctaaa aaaacaaaat cctgtgttca gtttagaatg ttttgatgta 721 gctgagaagc tttgcccaac aacaataact gaagctactg tagttcataa agttcacatg 781 gctttatagc ctttgcaaaa catatctata aatcaattac tttttgaaaa tacccagcct 841 gctttgtctt catttagtag actatttttc tctccttctt tcttttttct tcttt // LOCUS RATUDPA 1858 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-5 (UDPGTr-5) mRNA, complete cds. ACCESSION M33746 J05440 KEYWORDS UDP glucuronosyltransferase-5. SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone UDPGTr-5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1858) AUTHORS Mackenzie,P.I. TITLE The cDNA sequence and expression of a variant 17B-hydroxysteroid UDP glucuronosyltransferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept 43 1635 UDP glucuronosyltransferase-5 precursor (EC 2.4.1.17) sigp 43 111 UDP glucuronosyltransferase-5 signal peptide (put.) matp 112 1632 UDP glucuronosyltransferase-5 mRNA < 1 1858 UDP glucuronosyltransferase-5 mRNA BASE COUNT 559 a 365 c 387 g 547 t ORIGIN 1 agaaaggaac acagtgtgaa cagaaggatt ttgattttca aaatgccagg aaaatggatt 61 tttgctctgc tcttgctgca gataagtttc tgcctcagat ctgcgaaatg tgggaaggtg 121 ttggtgtggc cgatggaatt cagtcactgg atgaatataa aaacaatact ggatgaactt 181 gtacagaggg gccatgaagt cactgttctg aaaccttcag cttactatgt tcttgatcca 241 aaaaaatcgc cagaccttaa gtttgaaact tttcctacat ctgtcagtaa agatgaactg 301 gaaaaatatt tcataaaact tgctgatgcg tggacttatg agttgcaaag agatacatgt 361 ttgtcttttt ctcctttact acaaaatatg atggatgaat tttctgatta ttatctaagt 421 gtttgtaaag acgccgtttc aaacaagcag ctcatggcaa aactacagga atccaagttt 481 gatgttcttt tgtcagatcc tgtggctgcc tgtggggagc tgatagccga agtgctccac 541 attccttttc tgtacagtct tcgtgcctct ccaggccata aaattgaaaa gtccagtgga 601 agatttatac tacctccctc ttatgtgcct gtaattttgt caggattggg tggccaaatg 661 acattcatag acagggttaa aaatatgata tgtatgcttt attttgactt ttggttccat 721 atgtttaatg ccaagaattg ggatccattt tatactgaga ttttgggaag gcccaccacc 781 ttagctgaga caatgggcaa agcagaaatg tggctcatta gatcctactg ggatttggag 841 tttccccacc caacattacc aaatgttgac tacattggag gactccaatg caaacctgct 901 aaacccttgc ccaaggatat agaagacttt gtccagagct ctggagagca tggtgtggtg 961 gtgttttctc tggggtcaat ggtcagcagc atgacagaag aaaaggccaa cgcaattgca 1021 tgggcccttg cccagattcc acaaaaggtt ctttggaaat ttgatggcaa aatcccagca 1081 actttaggac ccaataccag agtctacaag tggcttcccc agaatgacct ccttggtcat 1141 ccaaaaacca aagcctttgt aactcatggt ggagccaatg gtgtctatga ggccatctat 1201 catggaatcc ctatgattgg cattcctatg tttggagaac aacatgataa cattgcccac 1261 atggtggcca aaggagcagc tgttacactg aatatcagga caatgtcaaa gtcagatttg 1321 ttcaatgcac ttaaggaagt aataaacaat cctttctata aaaaaaatgc tatgtggctg 1381 tcaaccattc accatgacca acctatgaaa cccctggaca aggctatctt ctggattgag 1441 tatgtcatgc gccacaaaag agccaagcac ctgagaccac ttggacataa ccttccctgg 1501 taccagtacc actctctgga tgtgattgga ttcctgctag cctgtttggc agtcattgca 1561 gcccttgctg taaaatgctt cttgttcatt taccgattct ttgcaaagaa gcaaaagaaa 1621 atgaagaatg agtagagctc gttgacaatg cactacagga atgaaattta agcctcattc 1681 taatttatga atcactttct taacacttcc tgattttttt ttgtggaggc agatcatcat 1741 tgtaagaaga catatagctc tgtgaatatt gatatgttat caaaatttta aaatcactta 1801 atgtaaaaaa gttgcattgt agaaaaattg aggaaaataa agtttacttg atagtctt // LOCUS RATUDPB 2216 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-21 (UDPGTr-21) mRNA, 3' end. ACCESSION M33747 J05440 KEYWORDS UDP glucuronosyltransferase-21. SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone UDPGTr-21. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2216) AUTHORS Mackenzie,P.I. TITLE The cDNA sequence and expression of a variant 17B-hydroxysteroid UDP glucuronosyltransferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept < 1 463 UDP glucuronosyltransferase-21 (AA at 2) (EC 2.4.1.17) mRNA < 1 2216 UDP glucuronosyltransferase-21 nRNA BASE COUNT 738 a 386 c 386 g 706 t ORIGIN 1 agccaatggt gtctatgagg ccatctatca tggaatccct atgattggca ttcctatgtt 61 gggagaacaa catgataaca ttgcccacat ggtggccaaa ggagcagctg ttacactgaa 121 tatcaggaca atgtcaaagt cagatttgtt caatgcactt aaggaagtaa taaacaatcc 181 tttctataaa aaaaatgcta cgtggctgtc aaccattcac catgaccaac ctatgaaacc 241 cctggacaag gctatcttct ggattgagta tgtcatgcgc cacaaaagag ccaagcacct 301 gagaccactt ggacataacc ttccctggta ccagtaccac tctctggatg tgattggatt 361 cctgctagcc tgtttggcag tcattgcagc ccttgctgta aaatgcttct tgttcattta 421 ccgattcttt gcaaagaagc aaaagaaaat gaagaatgag tagagctcgt tgacaatgca 481 ctacaggaat gaaatttaag cctcattcta atttatgaat cactttctta acatttcctg 541 attttttttt gtggaggcag atcatcattg taagaagaca tatagctctg tgaatattga 601 tatgttatca aaattttaaa atcacttaat gtaaaaaagt tgcattgtag aaaaattgag 661 gaaaataaag tttacttgat agtcttaaaa atcacagtat taaccttaca atatttgaat 721 attgtccatt gacctctttc tctgagactg aatctgtagc tttcatacaa ataagtagct 781 aacttgtata ctataaatat ggacatataa atagtttttt ctgtaatagt cttaattatt 841 tgtagtcggg gataaagtgt ggtttggttt ggatattcat ttcaaagggt aggaatctgt 901 tggctatttt gttcctgtaa caaaatgtgc tgaccaaaag catctccagg gaaaagcaga 961 gcagtttatt ttgagttgtg cttacagatc ctgagaacgc aggatagata ggaaggcagg 1021 gcagcagtca gccagatgac aaaactctct cattacatct taaccacaca tagaaagcac 1081 aaagagtgag caaaaagtgt gactatggtg tgaactttca aagcttgctc cagtgatata 1141 tttcctccaa aaagatttaa cccctttaaa taatattcct gtacccctgg agttgggagt 1201 ttagctcagt ggtagagcat ttgcctacca aacacaaggc tctgtgttca gtcctcagct 1261 ccgggggaaa aaaaaaagaa agattccata acctcaaaca gcattacaaa ttttggaaaa 1321 tgtgctaaaa ttcatcagcc tatctgaaac attttacatt gaatccataa caggaaataa 1381 acctgtttct taattcttat tttttagcat accattctaa tactccaagt tctaacacag 1441 cacttgtacc tcttcaatgt aatttaacta tgatcatgag gcataatgtt cattggaaat 1501 gaagcatatg aacaggaaac aaataaaagt cctaactaaa gtaaacttag ctttgagatt 1561 ggctattaca agtctggttg taattccact aatgctgcca tatgctgtga ggaatgttat 1621 aaaagagcta tgtaactatt atgacagttg tagcttttag cattgaaata catagatatt 1681 aatataaaag taagtgtata atatgatgct taaatgtgta acctaatatt ttagaataaa 1741 tttaattagt ggaaacattc tagacaggaa cagtaaatat atccaacatc attattcttt 1801 gatttaaaaa atgcaatttg gaggttcttc cctgcaaaag actatctctt tcccactctt 1861 aacattactt aggtgcttat tacagtttta tgttgagttg gggaaagggt aaaattgacc 1921 cctttccata ttagcatgaa tattggtatc atctttattg agatcttgtt taggaaccca 1981 ttatgagact tcaggagtat aactttcata atgtttgtaa tagatgcaac tttacagcag 2041 acaacttgat ccttctggcc tcttaaatct ttccatcctc tattatgtaa tgttttgttg 2101 atagttactt cagtatttga cacaagattc aataatttta tgcctatggg ttccatcaaa 2161 catcatgact ctatatatat gtaaatccaa aataagaaat aaaaaatagt gtatct // LOCUS BCEHEMOL 280 bp ds-DNA BCT 26-JUL-1990 DEFINITION B.cereus hemolysin gene, partial cds. ACCESSION M35411 KEYWORDS hemolysin. SOURCE B.cereus DNA. ORGANISM Bacillus cereus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 280) AUTHORS Gilmore,M.S., Gilmore,K.S. and Goebel,W. TITLE A new strategy for ordered DNA sequencing based on a novel method for the rapid purification of near-milligram quantities of a cloned restriction fragment JOURNAL Gene Anal. Tech. 2, 108-114 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 280 hemolysin (AA at 2) BASE COUNT 99 a 43 c 51 g 87 t ORIGIN 1 gaattctcat ttatggattg taaaccgtgc aattgatatt atgtctcgta atacaacact 61 tgtaaaacaa gatcgagttg cactattaaa tgaatggcgt actgagttag agaacggtat 121 ttatgctgct gactatgaaa atccttatta tgataatagc acatttgctt cacatttcta 181 tgaccctgac aatgggaaaa cttatattcc gtatgcaaag caggcaaagg aaactggagc 241 taaatatttt aaattagctg gtgagtctta caaaaataaa // LOCUS BPEFHAA 164 bp ds-DNA BCT 26-JUL-1990 DEFINITION B.pertussis filamentous hemagglutinin antigen gene, partial cds. ACCESSION M35274 KEYWORDS filamentous hemagglutinin antigen. SOURCE B.pertussis DNA, clone lambda-FHA15. ORGANISM Bordetella pertussis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 164) AUTHORS Mattei,D., Pichot,F., Bellalou,J., Mercereau-Puijalon,O. and Ullmann,A. TITLE Molecular cloning of a coding sequence of Bordetella pertussis filamentous hemagglutinin gene JOURNAL FEMS Microbiol. Lett. 37, 73-77 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 164 filamentous hemagglutinin antigen (AA at 1) BASE COUNT 38 a 45 c 60 g 21 t ORIGIN 1 gaattccaca tgcacctgga tgcgccgcgc atcgagaaca ccgcgaaact gacgcggcga 61 ggtgcaacgc aaaggcgtgc aggacgtcgg gggaggcgag cacggccgct ggacgtatcg 121 gctatgtcaa ctactggttg cgcgcgcatg gaagaaggcg ggca // LOCUS BPEFHAB 165 bp ds-DNA BCT 26-JUL-1990 DEFINITION B.pertussis filamentous hemagglutinin antigen gene, partial cds. ACCESSION M35275 KEYWORDS filamentous hemagglutinin antigen. SOURCE B.pertussis DNA, clone lambda-FHA15. ORGANISM Bordetella pertussis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 165) AUTHORS Mattei,D., Pichot,F., Bellalou,J., Mercereau-Puijalon,O. and Ullmann,A. TITLE Molecular cloning of a coding sequence of Bordetella pertussis filamentous hemagglutinin gene JOURNAL FEMS Microbiol. Lett. 37, 73-77 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 165 filamentous hemagglutinin antigen (AA at 1) BASE COUNT 30 a 54 c 55 g 26 t ORIGIN 1 gaattcggac cagcctggcc cgagcgctgc atgccgcgcg ggaaggccca cacagttggt 61 cccgacactg ccactttccg agtcccatcg caacgggcgg tgatccactc gtcgttggcg 121 cgtgatagac agcgcgtgca tgcgagagcg catgcagcag gctgg // LOCUS CRECYCA 662 bp ss-mRNA PLN 26-JUL-1990 DEFINITION C.reinhardtii mitochondrial apocytochrome c (cyc) mRNA, complete cds. ACCESSION M35173 KEYWORDS apocytochrome c; cytochrome c apoprotein. SOURCE C.reinhardtii, cDNA to mRNA, clone C321. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 1 to 662) AUTHORS Amati,B.B., Goldschmidt-Clermont,M., Wallace,C.J.A. and Rochaix,J.-D. TITLE cDNA and deduced amino acid sequences of cytochrome c from Chlamydomonas reinhardtii: Unexpected functional and phylogenetic implications JOURNAL J. Mol. Evol. 28, 151-160 (1988) STANDARD simple staff_review FEATURES from to/span description pept 42 380 apocytochrome c (cyc) BASE COUNT 147 a 176 c 200 g 139 t ORIGIN 1 ccgaaccaaa acctttcctg tgacccttct atctgcttaa aatgtcgacc ttcgctgagg 61 cccccgctgg cgaccttgct cgcggcgaga agattttcaa gaccaagtgc gcgcaatgcc 121 acgttgctga gaagggcggc ggccacaagc agggccccaa cctgggcggt ctgttcggcc 181 gtgtctcggg cactgctgcc ggcttcgcat actcgaaggc gaacaaggag gctgccgtga 241 cctggggcga gagcactctc tacgagtacc tgctgaaccc caagaagtac atgcctggca 301 acaagatggt gttcgctggc ctgaagaagc ccgaggagcg cgccgatctg attgcctacc 361 tgaagcaggc gactgcttaa actgcgcgcg gcttagcaag cggcttcatt cattaggcag 421 aagcgggtct caagagcggg atagggttgc atctgggcgc ggcgtgtgtt cgcttcagaa 481 cgtcccacca gatgcaacag gcggatgtgt tacgagtgtc gagtgtgtac tgatgatggt 541 gtgcatgtgt aacggcgaca tacggatgga atagacatat cgtcttgaag actgtctcat 601 aggcagagac atctgctcac aggcaactta ttatgtctgc catgggcggt cgtaaagaat 661 tc // LOCUS ECOABC 1993 bp ds-DNA SYN 26-JUL-1990 DEFINITION Synthetic plasmid (for E.coli) DNA. ACCESSION M34519 KEYWORDS b-galactosidase; b-lactamase; bla gene; lacZ gene; promoter. SOURCE Synthetic DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 1993) AUTHORS Hayden,M.A., Shallcross,M.A., Stotland,E. and Mandecki,W. TITLE A totally synthetic plasmid for general cloning, gene expression and mutagenesis in Escherichia coli JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Mandecki, 22-MAY-1990. Author address:W.Mandecki Abbott Laboratories Corporate Molecular Biology D93D Abbott Park, IL 60064 FEATURES from to/span description pept 119 301 b-galactosidase pept 438 1298 b-lactamase site 81 327 lacZ fragment mRNA site 136 137 SmaI site for introduction of multicloning signal 301 336 trpA terminator signal 337 402 bla promoter P3 mRNA 403 1329 bla mRNA signal 1299 1342 phage fd terminator signal 1343 1408 RNAII promoter mRNA 1409 1962 RNAII mRNA signal 1415 1440 RNAI terminator signal 1518 1555 RNAI promoter mRNA 1422 1517 RNAI mRNA site 1961 1962 RNaseH cleavage site BASE COUNT 524 a 475 c 500 g 494 t ORIGIN 1 gaattgatta atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctg 61 ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat 121 gactatgatt acgcccgggc ttgccgtcgt tttacagcga cgagactggg aaaatcctgg 181 cgttacccaa cttaatcgcc ttgccgcaca cccccctttc gccagttggc gtaatagcga 241 agaagcccgc accgaccgcc cttcccaaca gttgcgtagt ctgaatggcg aatggcgtta 301 aactagtagc ccgcctaatg agcgggcttt tttttaattc ccctatttgt ttatttttct 361 aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat 421 attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg 481 cggcattttg ccttcctgtt tttgctcacc cagaaacgct cgtgaaagta aaagacgcag 541 aggaccaatt gggggcacga gtgggataca tagaactgga cttgaatagc ggtaaaatcc 601 ttgagagttt tcgccctgaa gagcgttttc caatgatgag cactttcaaa gttctgctat 661 gtggagcagt attatcccgt gtagatgcgg ggcaagagca actcggacga cgaatacact 721 attcgcagaa tgacttggtt gaatactccc cagtgacaga aaagcacctt acggacggaa 781 tgacggtaag agaattatgt agtgccgcca taacgatgag tgataacact gcggcgaact 841 tacttctgac aaccatcggt ggaccgaagg aattaaccgc ttttttgcac aatatgggag 901 accatgtaac tcgccttgac cgttgggaac cagaactgaa tgaagccata ccaaacgacg 961 agcgagacac cacaatgcct gcggcaatgg caacaacatt acgcaaacta ttaactggcg 1021 aactacttac tctggcttca cggcaacaat taatagactg gcttgaagcg gataaagttg 1081 caggaccact actgcgttcg gcacttcctg ctggctggtt tattgctgat aaatctgggg 1141 caggagagcg tggttcacgg ggtatcattg ccgcacttgg accagatggt aagccttccc 1201 gtatcgtagt tatctacacg acgggtagtc aggcaactat ggacgaacga aatagacaga 1261 ttgctgaaat aggggcttca ctgattaagc attggtaaac cgatacaatt aaaggctcct 1321 tttggagcct ttttttttgg acggaccgag tagaaaagat caaaggatct tcttgagatc 1381 ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg 1441 tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag 1501 cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact 1561 ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg 1621 gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc 1681 ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg 1741 aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggacaaagg 1801 cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag 1861 ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc 1921 gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct 1981 ttttacggtt cct // LOCUS HS1PROM 591 bp ds-DNA VRL 26-JUL-1990 DEFINITION Herpes simplex virus type 1 joint promoter. ACCESSION M34532 KEYWORDS promoter. SOURCE Herpes simplex virus type 1 (strain KOS) DNA, clone pRAB6. ORGANISM Herpes simplex virus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 591) AUTHORS Bohenzky,R.A., Papavassiliou,A.P., Gelman,I.H. and Silverstein,S. TITLE Identification of novel transcripts mapping to the joint region of Herpes simplex virus type 1 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Silverstein, 23-MAY-1990. Author address:S.Silverstein Dept. of Microbiology Columbia University 701 W. 168th ST. N.Y., N.Y. 10032 FEATURES from to/span description binding 134 139 CTF binding site binding 290 295 Sp1 binding site binding 508 513 Sp1 binding site binding 387 392 Sp1 binding site binding 480 485 Sp1 binding site binding 492 497 Sp1 binding site site 298 306 Oct1/a-TIF site site 327 330 a4 enhancer site 366 372 E4TF1 site signal 528 533 TATA box BASE COUNT 103 a 193 c 204 g 91 t ORIGIN Map position 0.794-0.798. 1 gcatgcccct cccgccgacg caacaggggc ttggcctgcg tcggtgcccc ggggcttccc 61 gccttcccga agaaactcat taccataccc ggaaccccag gggaccaatg cgggttcatt 121 gagcgacccg cgggccaatg cgcgaggggc cgtgtgttcc gccaaaaaag caattaacat 181 aacccggaac cccaggggag tggttacgcg cggcgcggga ggcggggaat accggggttg 241 cccattaagg gccgcgggaa ttgccggaag cgggtaatgt cggccggggc cgcccattaa 301 tgagtttcta attaccatac cgggaagcgg aacaaggcct ctgcaagttt ttaattacca 361 taccgggaag tgggcgcccg cccagtgggc gggagttacc gcccagtggg ccggcccgac 421 gactcggcgg acgctggttg gccgggcccc gccgcgctgg cggccgccga ttggccagtc 481 ccgccctccg agggcggccc gcctcggggg cgggccggct ccaagcgtat atatgcgcgg 541 ctcctgccat cgtctctccg gagagcggct tggtgcggac ctgcagccaa g // LOCUS MZEHETRO 184 bp ds-DNA PLN 26-JUL-1990 DEFINITION Corn heterochromatin repetitive DNA. ACCESSION M35408 KEYWORDS . SOURCE Corn knob heterochromatin DNA, clone pZm4.25. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 184) AUTHORS Peacock,W.J., Dennis,E.S., Rhoades,M.M. and Pryor,A.J. TITLE Highly repeated DNA sequence limited to knob heterochromatin in maize JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 4490-4494 (1981) STANDARD simple staff_review BASE COUNT 58 a 41 c 42 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcgtctggtc gatgatcaat 181 ggcc // LOCUS RATRSB1 170 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat B1 repetitive sequence. ACCESSION M35409 KEYWORDS B1 repetitive sequence. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 170) AUTHORS Blin,N., Weber,T. and Alonso,A. TITLE Cross-reaction of snRNA and an Alu I-like sequence from rat with DNAs from different eucaryotic species JOURNAL Nucleic Acids Res. 11, 1375-1388 (1983) STANDARD simple staff_entry BASE COUNT 57 a 29 c 44 g 40 t ORIGIN 1 aaaaaaaagc aaatgacagc tgtgtgtggt ttcatatgtg tttaatccag cactcaggag 61 gcagaggtaa atggatctct gtgagttcga gtccagtctg gctacaaagc aagttctaga 121 gcagccaggg ctgttacaca gagaaactct gtcttggaag ataaaaaaga // LOCUS SHFINV 261 bp ds-DNA BCT 26-JUL-1990 DEFINITION Plasmid pINV (from S.flexneri) RepA gene, 5' end. ACCESSION M35403 KEYWORDS . SOURCE Plasmid pWR110 (from S.flexneri 5) DNA. ORGANISM Shigella flexneri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 261) AUTHORS Silva,R.M., Saadi,S. and Maas,W.K. TITLE A basic replicon of virulence-associated plasmids of Shigella spp. and enteroinvasive Escherichia coli is homologous with a basic replicon in plasmids of IncF groups JOURNAL Infect. Immun. 56, 836-842 (1988) STANDARD simple staff_review FEATURES from to/span description pept 256 > 261 repA protein mRNA 171 82 (c) inc mRNA BASE COUNT 70 a 54 c 68 g 69 t ORIGIN 1 gatcgtttaa ggaattttat ggctggccac gccttaaggt ggcagggaac tggttctgat 61 gtggatgtac aggagccaga aaagcaaaaa ccccgataat cttctttaac tttggcgagt 121 cagaaagatt accggggccc acttaaaccg tatagccaac aatcaagcta tgcggggagt 181 atagttatat gcccggaaaa gttcaagact tctttctgtg ctcgctcctt ctgcgcattg 241 taagtgcagg atggtgtgac t // LOCUS YSYPSKLA 598 bp ds-DNA PLN 26-JUL-1990 DEFINITION S.kluyveri plasmid pSKL left-end inverted terminal repeat. ACCESSION M35319 KEYWORDS . SOURCE S.kluyveri plasmid pSKL DNA. ORGANISM Saccharomyces kluyveri Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 598) AUTHORS Kitada,K. and Hishinuma,F. TITLE A new linear DNA plasmid isolated from the yeast Saccharomyces kluyveri JOURNAL Mol. Gen. Genet. 206, 377-381 (1987) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 247 a 18 c 206 g 127 t ORIGIN 1 aaaaggtata gatatagata tattttttat gggtttggaa gggggaagtg gaagaatgta 61 tcgtgtaaaa aaagagcaaa aaaaaaatta gatgagagaa ggggaaaaga ggggagtgta 121 tcatgtgaaa aaacgcgtca aaatgaagag aagggaaaaa ggggagagtg tatcgtgggg 181 aaagtgaatt ttgaagaaga gaaggggaaa agaggggagt gtatcgtcta agaagggggt 241 attataagag aaggggatat tggtagagtg tattgaatgt ggcttagcaa aaatagaaaa 301 agggtaaaaa atgggggata aaaaaaagaa aaaaacggta ttaaggggag aaggggaaaa 361 gggtagagtg tatcgtgcaa aaagtgagtt caaaatgaag agaaggggaa aagggtagag 421 tgtatcgtgg gggaaagtga gtttaaatga agagaagggg aaaagggtag agtgtatcgt 481 gggggaaagt gagtttaaat gaagagaagg gaaaaagggg gagtgtatcg tataaaaagt 541 gaatatattt tatttgatgg gattaagtat tgaaaatgga aatggatgat aggttgtt // LOCUS YSYPSKLB 117 bp ds-DNA PLN 26-JUL-1990 DEFINITION S.kluyveri plasmid pSKL right-end DNA. ACCESSION M35320 KEYWORDS . SOURCE S.kluyveri plasmid pSKL DNA. ORGANISM Saccharomyces kluyveri Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Kitada,K. and Hishinuma,F. TITLE A new linear DNA plasmid isolated from the yeast Saccharomyces kluyveri JOURNAL Mol. Gen. Genet. 206, 377-381 (1987) STANDARD simple staff_review BASE COUNT 35 a 5 c 18 g 59 t ORIGIN 1 caaaaagtga gattaggggg agaatatatt tattatgtta aatataaggt agttttttta 61 taatttattt aatttatttt gtttgtattt tagcttcttt aattagtctg tattctt // LOCUS XELTRH 1442 bp ss-mRNA VRT 26-JUL-1990 DEFINITION X.laevis thyrotropin releasing hormone (TRH) mRNA, complete cds. ACCESSION M34699 K00931 J05514 KEYWORDS thyrotropin releasing hormone. SOURCE X.laevis skin, cDNA to mRNA, clone L4 and 8/136. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 478) AUTHORS Richter,K., Kawashima,E., Egger,R. and Kreil,G. TITLE Biosynthesis of thyrotropin releasing hormone in the skin of Xenopus laevis: Partial sequence of the precursor deduced from cloned cDNA JOURNAL EMBO J. 3, 617-621 (1984) STANDARD full staff_review REFERENCE 2 (bases 15 to 1442) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. TITLE Two precursors of thyrotropin releasing hormone from skin of Xenopus laevis: Each contains seven copies of end-product JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.Kreil, 18-MAY-1990. FEATURES from to/span description pept 110 793 thyrotropin releasing hormone precursor matp 332 340 thyrotropin releasing hormone copy 1 matp 374 382 thyrotropin releasing hormone copy 2 matp 428 436 thyrotropin releasing hormone copy 3 matp 470 478 thyrotropin releasing hormone copy 4 mRNA < 1 1442 TRH mRNA conflict 139 139 t in [2]; c in [1] conflict 214 216 tct in [2]; ctc in [1] conflict 319 319 g in [2]; t in [1] BASE COUNT 460 a 286 c 334 g 362 t ORIGIN 1 agcacagagc agcacaagga cacactctgc atattgtgct gccggacaag gaggtgacag 61 ccagtcaggc tgagacaaag gaacttccag acctctgaca gcaggaaaga tggtgtctgt 121 ctggtggttg ctgcttcttg gtacaaccgt atctcacatg gtgcacacac aagagcagcc 181 tttactggag gaggacacag caccattaga tgatctggat gttcttgaga aagccaaagg 241 tatcctgatc cgcagtatcc tggagggatt tcaagaaggg caacaaaaca atagagatct 301 accagatgca atggaaatga tatctaagcg ccagcaccca gggaaacgat tccaggagga 361 gatagaaaag agacaacacc ctggaaagag ggatctggaa gatctgaatc tagagctttc 421 caaaaggcaa caccccggaa gaagatttgt ggatgatgta gagaagaggc aacatccagg 481 aaagagagaa gagggtgact ggagtaggag gtatctgaca gatgactcac gttatttgga 541 cctcctttct gatgtttcca ggagacagca cccaggcaaa agagttccag ccccattgtt 601 tacaaaacgt caacacccag gtaagagagt gacagaagaa gagggtgata ctgaatttga 661 aaactcgaag gaagtgggga agcgccagca tccaggaaag agatatgacc cttgtgaagg 721 ccctaatgcc tacaactgta actcaggaaa cattctaccg gattctgtag aagaattgag 781 ttttgggctt taagctgccc agccccttta ttagttccat ctgaccctaa atgattccca 841 atgaacacaa ctttctataa ttgttaaata acattgtatt aagtatcata catttctgga 901 aagcaagcag ctcttagaac acttcttcgc tttaaaaggc acctggggca taagagtatt 961 aagcttcaga cagtaacctg cccaccacag ggagggattc aacaatcaca attggctgag 1021 tgttcctttc ccttgtttgg cagtgagatc agataataaa tataagatgg ccaggaaagt 1081 ggactctttc ttttctgaaa atttgcaagt aacaccaaaa tataataatt tgcacactca 1141 gtagtattaa cgtgaagatc tcaagaaggt tataaattct tggtgatctg ctcaaagcat 1201 ttaattcata gttgcttcca tggtttgatg gggaatgcac attctaaatt gcttattgct 1261 aattagcgct tgccacacag ttctggtggt agatcttgat gaggcatatt caataaaagt 1321 agagcccata gtaaaatttg tgccccgtca gctttaagga tcctctgtaa gcaatatgtg 1381 ttgtgagggc cacttgtttc taaagtaata ttttcatttt aataaatatg tctactcaaa 1441 tg // LOCUS XELTRHA 2955 bp ss-mRNA VRT 26-JUL-1990 DEFINITION X.laevis thyrotropin releasing hormone mRNA. ACCESSION M34698 J05514 KEYWORDS thyrotropin releasing hormone. SOURCE X.laevis, cDNA to mRNA, clone C6. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2955) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. TITLE Two precursors of thyrotropin releasing hormone from skin of Xenopus laevis: Each contains seven copies of end-product JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Kreil, 18-MAY-1990. FEATURES from to/span description pept 157 831 thyrotropin releasing hormone BASE COUNT 927 a 597 c 604 g 827 t ORIGIN 1 catgcagttt attagatata cagtacaatg aagtcagtta tgagaaatag caattgcagc 61 acaaggacac actctgcata ttgtgctgcc ggacaaggag gtgacagcca gtcaggctga 121 gacaaaggaa cttccagacc tctgacagca ggaaagatgg tgtctgtctg gtggttgctg 181 cttcttggta caaccgtatc tcacatggtg cacacacaag agcagccttt actggaggag 241 gacacagcac cattagatga tctggatgtt cttgagaaag ccaaaggtat cctgatccgc 301 agtatcctgg agggatttca agaagggcaa caaaacaata gagatctacc agatgcaatg 361 gaaatgatat ctaagcgcca gcacccaggg aaacgattcc aggaggagat agaaaagaga 421 caacaccctg gaaagaggga tctggaagat ctgaatctag agctttccaa aaggcaacac 481 cccggaagaa gatttgtgga tgatgtagag aagaggcaac atccaggaaa gagagaagag 541 ggtgactgga gtaggaggta tctgacagat gactcacgtt atttggacct cctttctgat 601 gtttccagga gacagcaccc aggcaaaaga gttccagccc cattgtttac aaaacgtcaa 661 cacccaggta agagagtgac agaagaagag ggtgatactg aatttgaaaa ctcgaaggaa 721 gtggggaagc gccagcatcc aggaaagaga tatgaccctt gtgaaggccc taatgcctac 781 aactgtaact caggaaacat tctaccggaa gaattgagtt ttgggcttta agctgcccag 841 cccctttatt agttccatct gaccctaaat gattcccaat gaacacaact ttctataatt 901 gttaaataac attgtattaa gtatcataca tttctggaaa gcaagcagct cttagaacac 961 ttcttcgctt taaaaggcac ctggggcata agagtattaa gcttcagaca gtaacctgcc 1021 caccacaggg agggattcaa caatcacaat tggctgagtg ttcctttccc ttgtttggca 1081 gtgagatcag ataaataaat ataagatggc caggaaagtg gactctttct tttctgaaaa 1141 tttgcaagta acaccaaaat ataataattt tgcactctgc agtgtattaa cgtgaagatc 1201 tcaagaaggt tataaattag gttataaatt cttggtgatc tgctcaaagc atttaattca 1261 tagttgcttc catggtttga tggggaatgc acattctaaa ttgcttattg ctaattagcg 1321 cttgccacac agttctggtg gtagatcttg atgaggcata ttcaataaaa gtagagccca 1381 tagtaaaatt tgtgccccgt cagctttaag gatcctctgt aagcaatatg tgttgtgagg 1441 gccacttgtt tctaaagtaa tattttcatt ttaataaata tgtctactca aatgacaaaa 1501 acattcatta tttcactaca ttatactcct tcccacagca attatgtacc tatgaatcct 1561 gatagaagac tgcagttttc ctcttatatc ctccatgttg gattcaccat aagtcaccaa 1621 aatatatcta tagggaagca cactatacac aatagcagtg acccccatcc agtggcttgt 1681 gggcaacaag ctactcacca acccccttgg ctgttgctcc cagtggccct aaagtaaggt 1741 gcataaaaaa accagatgaa cttgtcaaaa agagcctccc ttagactgcc ttgttccaca 1801 tagaggctac catatagcca atcacagccc ttatttggca cccccgggaa cttttttcat 1861 gcttgagttg ctccccaaat ctttttacag ttgaatatgt ctcatggcta aaaaaacgtg 1921 aggaccccgg cgtaatatag tataatatac acacactcac tttggaaaac tctatggaga 1981 tcaataagca cttttgggtt aaactatttt tttgatacaa tttgagcact ttatatatgg 2041 attttaaaga tattccgctt tagtagtctg tggtgcgctg ccccataaat atattggtga 2101 attattcacc acctactctt aacaattctg ctcaattcat ctagatgtta acataataca 2161 tcaccagtat cacaatggca gcgggaagca aagacattct gtagtgtcct gagaccagct 2221 aaagcctaga ggtggaccat aaataatgtc tattgcaggg tcagtacaaa caaaaacacc 2281 aaggctgctt tatacaaggc atatctaatt tgcaggtatt ttgctgaact attactccac 2341 acacaaagct tgagggacac agactaataa tctgctgaag gtttgcagga tggacagttg 2401 gacactgctt tgcttcaact ttattctagg cttgtgctct gatgtatgca gcgtcaaata 2461 ccagctgttg tttgactaca actcccagaa gcctcagcat actgagggtg gtatgcttga 2521 atgcttgaat gcttgaatac cgaaggctgt ctgtcctcca acacctcccg ttgatctccc 2581 gctccagctc ttattgtcat tccattgtat attttgtttt taaatgtata aagaaataaa 2641 aaaaaagtat gatatattca cccttcttct tctgagtata aaaagattta aatgaatgtg 2701 aaaataatat ttttatagac aacaatcttt gtgcagtgtt ggtaaataca tgtttattct 2761 gtatatagct attttaatat gcatactgaa agaatatata tatataataa gaagcatgaa 2821 catctcattg cctgggtatg aaacaataaa gattgcatct gataatgaag caaattcgct 2881 ctgtggcgca gtattatgtt gacctgatga tgaagttagg tctggtgcgc ttctcaatgt 2941 tcgtggcgct ggccc // LOCUS MUSIGCS 302 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse Ig heavy-chain gene enhancer region. ACCESSION M35179 KEYWORDS constant region; germline; immunoglobulin heavy-chain. SOURCE Mouse (strain BXXB:SB/Le) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 302) AUTHORS Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J. TITLE Molecular aspects of murine systemic lupus erythematosus JOURNAL Springer Semin. Immunopathol. 9, 121-142 (1986) STANDARD simple staff_review BASE COUNT 90 a 59 c 68 g 85 t ORIGIN 1 ctgcagcagc tggcaggaag caggtcatgt ggcaaggcta tttggggaag ggaaaataaa 61 accactaggt aaacttgtag ctgtggtttg aagaagtggt tttgaaacac tctgtccagc 121 cccaccaaac cgaaagtcca ggctgagcaa aacaccacct gggtaatttg catttctaaa 181 ataagttgag gattcagccg aaactggaga ggtcctcttt taacttattg agttcaacct 241 tttaatttta gcttgagtag ttctagtttc cccaaactta agtttatcga cttctaaaat 301 gt // LOCUS MUSIGCT 313 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse Ig heavy-chain gene enhancer region. ACCESSION M35180 KEYWORDS constant region; germline; immunoglobulin heavy-chain. SOURCE Mouse (lupus erythematosus strain MRL/I) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 313) AUTHORS Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J. TITLE Molecular aspects of murine systemic lupus erythematosus JOURNAL Springer Semin. Immunopathol. 9, 121-142 (1986) STANDARD simple staff_review BASE COUNT 94 a 59 c 69 g 91 t ORIGIN 1 ctgcagcagc tggcaggaag caggtcatgt ggcaaggcta tttggggaag ggaaaataaa 61 accactaggt aaacttgtag ctgtggtttg aagaagtggt tttgaaacac tctgtccagc 121 cccaccaaac cgaaagtcta ggctgagcaa aacaccacct gggtaatttg catttctaaa 181 ataagttgag gattcagccg aaactggaga ggtcctcttt taacttattg agttcaacct 241 tttaatttta gcttgagtag ttctagtttc cccaaactta agtttatcga cttctaaaat 301 gtatttagaa ttc // LOCUS MUSTCBYBB 459 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse T-cell receptor C beta-1/2 recombinant chain, exon 1. ACCESSION M35181 KEYWORDS T-cell receptor beta chain; constant region; germline. SOURCE Mouse (strain NZW) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 459) AUTHORS Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J. TITLE Molecular aspects of murine systemic lupus erythematosus JOURNAL Springer Semin. Immunopathol. 9, 121-142 (1986) STANDARD simple staff_review FEATURES from to/span description pre-msg < 1 > 459 TCR C-beta-1/2 recombinant chain, exon 1 IVS 436 > 459 TCR C-beta-1/2 intron A (no splice consensus) BASE COUNT 117 a 124 c 126 g 92 t ORIGIN 1 ttacaagatc aaggcagatc cagatagctc tcagaccatt cgtactctct ttactttcca 61 gaggatctga gaaatgtgac tccacccaag gtctccttgt ttgagccatc aaaagcagag 121 attgcaaaca aacaaaaggc taccctcgtg tgcttggcca ggggcttctt ccctgaccac 181 gtggagctga gctggtgggt gaatggcagg gaggtccaca gtggggtcag cacggaccct 241 caggcctaca aggagagcaa ttatagctac tgcctgagca gccggctgag ggtctctgct 301 accttctggc acaatcctcg aaaccacttc cgctgccaag tgcagttcca tgggctttca 361 gaggaggaca agtggccaga gggctcaccc aaacctgtca cacagaacat cagtgcagag 421 gcctggggcc gagcaggtaa gtgcggacct catgaggaa // LOCUS HAMSCARPB 537 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Hamster alpha-crystallin B chain mRNA, 5' end. ACCESSION J03849 KEYWORDS alpha-crystallin B chain. SOURCE Hamster scrapie infected brain, cDNA to mRNA. ORGANISM Mesocricetus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 537) AUTHORS Duguid,J.R., Rohwer,R.G. and Seed,B. TITLE Isolation of cDNAs of scrapie-modulated RNAs by subtractive hybridization of a cDNA library JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by J.Duguid, 25-OCT-1990. FEATURES from to/span description pept 21 > 537 alpha-crystallin B chain BASE COUNT 113 a 177 c 127 g 120 t ORIGIN 1 catacattca cctagccacc atggacatcg ccatccacca cccctggatc cgccgtccct 61 ttttcccttt ccactccccc agccgcctct ttgaccagtt cttcggagag cacctgttgg 121 agtctgacct cttctcaact gccacttctc tgagtccctt ctacctgcgg ccaccttcct 181 tccttcgggc acccagctgg attgacactg gactctcaga gatgcggatg gagaaggaca 241 gattctccgt caacctggat gtgaagcact tctccccgga agagctgaaa gtcaaggtgc 301 tgggggacgt ggttgaagtg catggcaagc acgaagagcg ccaggacgaa cacggcttca 361 tctctaggga gttccatagg aagtaccgga tcccagctga tgtggatcct ctgaccatta 421 cttcatccct gtcatctgac ggcgtcctca ctgtgaatgg accaaggaaa caggcctctg 481 gccccgagcg taccattccc atcacccgtg aagagaagcc tgctgtcact gcagccc // LOCUS HAMSCRAP 282 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Hamster glial fibrillary acidic protein mRNA, partial cds. ACCESSION J03847 KEYWORDS glial fibrillary acidic protein. SOURCE Hamster scrapie infected brain, cDNA to mRNA. ORGANISM Mesocricetus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 282) AUTHORS Duguid,J.R., Rohwer,R.G. and Seed,B. TITLE Isolation of cDNAs of scrapie-modulated RNAs by subtractive hybridization of a cDNA library JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by J.Duguid, 25-OCT-1990. FEATURES from to/span description pept < 1 > 282 glial fibrillary acidic protein (AA at 1) BASE COUNT 90 a 69 c 76 g 47 t ORIGIN 1 gagggccaaa gcctcaagga ggagatggct cgccacctgc aggagtatca agatctactc 61 aatgtcaagc tagccctgga catcgagatt gccacctata ggaaattgct agaaggcgag 121 gaaaaccgca tcaccatccc tgtacaaact ttctccaacc tgcaaatccg agaaaccagc 181 ctggacacca agtccgtgtc agaaggacac ctcaagagga acatcgtggt aaagacagtg 241 gagatgaggg atggtgaggt cattaaggag tccaagcagg ag // LOCUS HAMSCRAPA 327 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Hamster metallothionein II mRNA, complete cds. ACCESSION J03848 KEYWORDS metallothionein II. SOURCE Hamster scrapie infected brain, cDNA to mRNA. ORGANISM Mesocricetus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 327) AUTHORS Duguid,J.R., Rohwer,R.G. and Seed,B. TITLE Isolation of cDNAs of scrapie-modulated RNAs by subtractive hybridization of a cDNA library JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by J.Duguid, 25-OCT-1990. FEATURES from to/span description pept 62 247 metallothionein II BASE COUNT 65 a 100 c 81 g 81 t ORIGIN 1 cactcaagtt tcgacttttc ctcggtcctc agccggtctt caaccgccgc cttcactcgc 61 catggacccc aactgctcct gtgccacaga tggatcctgc tcctgctctg ggtcttgcaa 121 atgcaaagag tgcaaatgca ccacgtgcaa gaaaagctgc tgctcctgct gcccggtggg 181 ctgtgcgaag tgctcccagg gctgcgtctg caaagaggct tcggagaagt gcagctgctg 241 cgcctgaagc ggattcccct cagctgtctg taaatagagc aatgtgtaga aacgtattgg 301 tttttttaca accccgtcct attctcc // LOCUS ASOTAAG1 2935 bp ds-DNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G1) gene, complete cds. ACCESSION M33218 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 2935) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 607 772 Taka-amylase A (Taa-G1) precursor, exon 1 828 868 Taka-amylase A precursor, exon 2 955 1070 Taka-amylase A precursor, exon 3 1140 1248 Taka-amylase A precursor, exon 4 1317 1545 Taka-amylase A precursor, exon 5 1603 1765 Taka-amylase A precursor, exon 6 1830 1976 Taka-amylase A precursor, exon 7 2041 2281 Taka-amylase A precursor, exon 8 2360 2647 Taka-amylase A precursor, exon 9 sigp 607 669 Taka-amylase A signal peptide matp 670 772 Taka-amylase A 828 868 Taka-amylase A 955 1070 Taka-amylase A 1140 1248 Taka-amylase A 1317 1545 Taka-amylase A 1603 1765 Taka-amylase A 1830 1976 Taka-amylase A 2041 2281 Taka-amylase A 2360 2644 Taka-amylase A pre-msg 543 > 2789 Taa-G1 mRNA and introns IVS 773 827 Taa-G1 intron A (no splice consensus) IVS 869 954 Taa-G1 intron B IVS 1071 1139 Taa-G1 intron C IVS 1249 1316 Taa-G1 intron D IVS 1546 1602 Taa-G1 intron E IVS 1766 1829 Taa-G1 intron F IVS 1977 2040 Taa-G1 intron G IVS 2282 2359 Taa-G1 intron H signal 2784 2789 poly-A signal BASE COUNT 818 a 752 c 657 g 708 t ORIGIN 1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact 61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca 121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca 181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag 241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat 301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat 361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt 421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc 481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg 541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag 601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct 661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat 721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt 781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg 841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg 901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac 961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc 1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat 1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga 1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt 1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct 1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct 1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact 1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct 1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga 1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc 1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg 1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt 1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca 1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt 1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca 1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca 1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa 1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag 2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg 2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa 2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc 2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa 2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga 2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg 2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga 2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt 2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg 2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag 2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag 2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga 2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca 2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc 2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg // LOCUS ASOTAAG2A1 197 bp ds-DNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 5' end. ACCESSION M33220 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 197) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 195 > 197 Taka-amylase A (Taa-G2) precursor pre-msg 131 > 197 Taa-G2 mRNA and introns signal 2 11 CAAT box signal 95 100 TATA box BASE COUNT 59 a 52 c 36 g 50 t ORIGIN 1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa 61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg 121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc 181 cacagaaggc atttatg // LOCUS ASOTAAG2A2 198 bp ds-DNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 3' end. ACCESSION M33222 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 198) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa-G2) (AA at 1) pre-msg < 1 151 Taa-G2 mRNA and introns (alt.) pre-msg < 1 156 Taa-G2 mRNA and introns (alt.) BASE COUNT 45 a 48 c 53 g 52 t ORIGIN About 2.1 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc 181 gtcctccaga ggcgagga // LOCUS ASOTTAM1 191 bp ss-mRNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 3' end. ACCESSION M33219 KEYWORDS Taka-amylase A. SEGMENT 1 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 191) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 189 > 191 Taka-amylase A (Taa) mRNA 125 > 191 Taa mRNA BASE COUNT 56 a 52 c 32 g 51 t ORIGIN 1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca 61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt 121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga 181 aggcatttat g // LOCUS ASOTTAM2 156 bp ss-mRNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 5' end. ACCESSION M33221 KEYWORDS Taka-amylase A. SEGMENT 2 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 156) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa) (AA at 1) mRNA < 1 151 Taa mRNA (alt.) mRNA < 1 156 Taa mRNA (alt.) BASE COUNT 40 a 35 c 42 g 39 t ORIGIN About 1.5 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgactgag agcccgtctc gaatcc // LOCUS CIPNADGAPD 1354 bp ss-mRNA PLN 26-JUL-1990 DEFINITION M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH) mRNA, complete cds. ACCESSION J05223 KEYWORDS D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase; glyceraldehyde-3-phosphate dehydrogenase. SOURCE M.crystallinum, cDNA to mRNA. ORGANISM Mesembryanthemum crystallinum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Aizoaceae. REFERENCE 1 (bases 1 to 1354) AUTHORS Ostrem,J.A., Vernon,D.M. and Bohnert,H.J. TITLE Increased expression of a gene coding for NAD:glyceraldehyde-3- phosphate dehydrogenase during the transition from C-3 photosynthesis to crassulacean acid metabolism in Mesembryanthemum crystallinum JOURNAL J. Biol. Chem. 265, 3497-3502 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 55 1068 glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.1.12) BASE COUNT 315 a 308 c 345 g 386 t ORIGIN 1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct 61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc 121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac 181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag 241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg 301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga 361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt 421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac 481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg 541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc 601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg 661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt 721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg 781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct 841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat 901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga 961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac 1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga 1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc 1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt 1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta 1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc 1321 agcatttatg aaacattgct gtgagctttt gagt // LOCUS ECOPRIA 2658 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli primosomal protein n' (priA) gene, complete cds, and cytR gene, 5' end. ACCESSION M33293 KEYWORDS cytR gene; priA gene; primosomal protein. SOURCE E.coli (strain W3110) DNA, clone pEL042. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2658) AUTHORS Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A. TITLE The priA gene encoding the primosomal replicative n' protein of Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4620-4624 (1990) STANDARD full automatic COMMENT Authorin sequence for [1] kindly submitted by G.C.Allen Jr., 26-MAR-1990. FEATURES from to/span description pept 64 2262 primosomal protein n' pept 2418 > 2658 cytR gene product signal 27 32 -10 region signal 5 10 -35 region binding 53 59 ribosome binding site BASE COUNT 578 a 738 c 756 g 586 t ORIGIN 1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat 61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg 121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag 181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag 241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga 301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc 361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt 421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa 481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa 541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa 601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg 661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc 721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa 781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg 841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc 901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt 961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt 1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac 1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc 1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt 1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg 1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac 1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt 1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac 1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc 1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg 1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg 1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca 1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg 1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc 1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac 1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa 1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg 1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg 2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg 2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg 2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt 2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg 2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg 2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg 2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg 2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata 2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac 2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc 2641 cggatatctg cgatcccc // LOCUS EWCTELRNA 657 bp ds-DNA INV 26-JUL-1990 DEFINITION E.crassus telomerase RNA component gene, complete cds. ACCESSION M33461 KEYWORDS telomerase RNA. SOURCE E.crassus DNA. ORGANISM Euplotes crassus Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; Spirotricha; Heterotrichida; Clevelandellina. REFERENCE 1 (bases 1 to 657) AUTHORS Shippen-Lentz,D. and Blackburn,E.H. TITLE Functional evidence for an RNA template in telomerase JOURNAL Science 247, 546-552 (1990) STANDARD simple staff_entry FEATURES from to/span description RNA 152 342 telomerase RNA component site 186 197 functional telomeric template BASE COUNT 202 a 122 c 111 g 222 t ORIGIN 1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga 61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt 121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg 181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg 241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt 301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg 361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt 421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat 481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga 541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac 601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt // LOCUS HS4DWXJ 160 bp ds-DNA VRL 26-JUL-1990 DEFINITION Epstein-Barr virus defective WZhet junction. ACCESSION M33474 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, clinical sample 9) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 160) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 87 90 WZhet junction BASE COUNT 41 a 45 c 41 g 33 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc 61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact 121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac // LOCUS HS4WXJ 181 bp ds-DNA VRL 26-JUL-1990 DEFINITION Epstein-Barr virus WZhet junction, HR-1 clone 5. ACCESSION M33473 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 181) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 108 111 WZhet junction BASE COUNT 50 a 42 c 46 g 43 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt 61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca 121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga 181 c // LOCUS HUMREGA 4251 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human regenerating protein (reg) gene, complete cds. ACCESSION J05412 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4251) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues: The reg protein, pancreatic stone protein, and pancreatic thread protein are one and the same product of the gene JOURNAL J. Biol. Chem. 265, 7432-7439 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. FEATURES from to/span description pept 1571 1634 regenerating protein (reg), exon 2 (first expressed exon) 2270 2388 regenerating protein, exon 3 2696 2833 regenerating protein, exon 4 3549 3660 regenerating protein, exon 5 3856 3923 regenerating protein, exon 6 pre-msg 1196 4116 reg mRNA and introns IVS 1224 1524 reg intron A IVS 1635 2269 reg intron B IVS 2389 2695 reg intron C IVS 2834 3548 reg intron D IVS 3661 3855 reg intron E signal 1169 1174 TATA box BASE COUNT 1161 a 927 c 869 g 1294 t ORIGIN 1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc 61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat 121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt 181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc 241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt 301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc 361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc 421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat 481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac 541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg 601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg 661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata 721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag 781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc 841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga 901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt 961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag 1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc 1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa 1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa 1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat 1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta 1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc 1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag 1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac 1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt 1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct 1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact 1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt 1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat 1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg 1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt 1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa 1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac 2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga 2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat 2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa 2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc 2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc 2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag 2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca 2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc 2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac 2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca 2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta 2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt 2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca 2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta 2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag 2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata 3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt 3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca 3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct 3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa 3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga 3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt 3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa 3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc 3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc 3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg 3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag 3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg 3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt 3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt 3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct 3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc 3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca 4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt 4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc 4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct 4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c // LOCUS HUMREGRELA 1524 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human reg-related sequence, complete cds. ACCESSION J05413 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human esophageal mucosa DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1524) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues: The reg protein, pancreatic stone protein, and pancreatic thread protein are one and the same product of the gene JOURNAL J. Biol. Chem. 265, 7432-7439 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. BASE COUNT 382 a 368 c 345 g 429 t ORIGIN 1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt 61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt 121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg 181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat 241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg 301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga 361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata 421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat 481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag 541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa 601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc 661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac 721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta 781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca 841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt 901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat 961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct 1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc 1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga 1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc 1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct 1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct 1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc 1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg 1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc 1501 tataaatcat ataatgtaaa ggat // LOCUS MDPCGA 4801 bp ss-RNA VRL 26-JUL-1990 DEFINITION Aleutian mink disease parvovirus complete genome. ACCESSION M20036 KEYWORDS complete genome. SOURCE Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (bases 1 to 4592) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 4593 to 4801) AUTHORS Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W., Perryman,S. and Wolfinbarger,J.B. TITLE Nucleotide sequence of the 5'-terminal palindrome of Aleutian mink disease parvovirus (ADV) and construction of an infectious molecular clone JOURNAL J. Virol. 64, 3551-3556 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by M.E.Bloom, 18-MAR-1990. Sequence reported below is (+) strand. FEATURES from to/span description ORF 116 1978 Left ORF ORF 1983 2207 Middle ORF2 ORF 1993 2211 Middle ORF1 ORF 2241 4399 Right ORF BASE COUNT 1740 a 912 c 943 g 1206 t ORIGIN 1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt 61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt 121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag 181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact 241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt 301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact 361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc 421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac 481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt 541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta 601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag 661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa 721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc 781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga 841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta 901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt 961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga 1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg 1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa 1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt 1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga 1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt 1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga 1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa 1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg 1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta 1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat 1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac 1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg 1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa 1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa 1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca 1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt 1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca 2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa 2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg 2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata 2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta 2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc 2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag 2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg 2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg 2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt 2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact 2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag 2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata 2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta 2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa 2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt 2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg 3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc 3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct 3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta 3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt 3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg 3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa 3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg 3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt 3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac 3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga 3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat 3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac 3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt 3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc 3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg 3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg 3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac 4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg 4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa 4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag 4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag 4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg 4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact 4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc 4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta 4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac 4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt 4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt 4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct 4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg 4801 a // LOCUS MDPUPS 3454 bp ss-RNA VRL 26-JUL-1990 DEFINITION Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial sequence. ACCESSION M32981 KEYWORDS . SOURCE Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (sites) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3454, for [1]) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. JOURNAL Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840 STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by M.E.Bloom 18-MAR-1990. Sequence reported below is (+) strand. BASE COUNT 1268 a 673 c 698 g 815 t ORIGIN 1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc 61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac 121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac 181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta 241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa 301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga 361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat 421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt 481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa 541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta 601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat 661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag 721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga 781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat 841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc 901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact 961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt 1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac 1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac 1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa 1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta 1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag 1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac 1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg 1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa 1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac 1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca 1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga 1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc 1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg 1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta 1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt 1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga 1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag 2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag 2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac 2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt 2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga 2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc 2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg 2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat 2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta 2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg 2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa 2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg 2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta 2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc 2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa 2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt 2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca 3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg 3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct 3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga 3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt 3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc 3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt 3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa 3421 aacctatggt tactttactt ggagaggcaa gctt // LOCUS MUSAA2DEL 300 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion fragment. ACCESSION M33468 KEYWORDS deletion mutant. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 130 133 deletion breakpoint BASE COUNT 82 a 45 c 76 g 97 t ORIGIN 1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact 61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt 121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt 181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata 241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa // LOCUS MUSSL20JA 300 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion fragment. ACCESSION M33467 KEYWORDS deletion mutant. SOURCE Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description pept.ps / 42 122 dilute gene, exon 3 179 > 300 dilute gene, exon 5 recomb 129 130 deletion breakpoint BASE COUNT 86 a 51 c 61 g 102 t ORIGIN 1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac 61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac 121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta 181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta 241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc // LOCUS PRVVP4 2359 bp ds-RNA VRL 26-JUL-1990 DEFINITION Porcine rotavirus capsid protein VP4 gene, complete cds. ACCESSION M33516 KEYWORDS capsid protein VP4. SOURCE Porcine rotavirus (Gottfried strain; serotype 4) DNA. ORGANISM Porcine rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 2359) AUTHORS Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K. TITLE Similarity of the outer capsid protein VP4 of the Gottfried strain of porcine rotavirus to that of asymptomatic human rotavirus strains JOURNAL J. Virol. 64, 414-418 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 10 2337 capsid protein VP4 BASE COUNT 812 a 406 c 442 g 699 t ORIGIN 1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa 61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg 121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat 181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca 241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac 301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga 361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg 421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta 481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat 541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa 601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag 661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta 721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag 781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt 841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca 901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt 961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc 1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac 1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg 1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt 1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc 1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca 1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga 1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac 1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg 1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata 1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt 1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct 1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc 1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata 1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga 1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct 1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat 1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat 2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc 2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata 2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc 2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa 2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag 2341 ctctatagag gatgtgacc // LOCUS RATSTAA 1000 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds. ACCESSION M33329 KEYWORDS hydroxysteroid sulfotransferase a. SOURCE Rat (strain Sprague-Dawley) female liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1000) AUTHORS Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K., Yamazoe,Y. and Kato,R. TITLE cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a strong homology in amino acid sequence with the senescence marker protein SMP-2 in rat livers JOURNAL Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 38 892 hydroxysteroid sulfotransferase a (STa) mRNA < 1 1000 STa mRNA signal 977 982 poly-A signal BASE COUNT 299 a 187 c 211 g 303 t ORIGIN 1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga 61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa 121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg 181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt 241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa 301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct 361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc 421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta 481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc 541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa 601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga 661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc 721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg 781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt 841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa 901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa 961 gtgaatgtgg tcattgaata aattctattc tggcattgtg // LOCUS SMFPOLENV 3534 bp ss-RNA VRL 26-JUL-1990 DEFINITION Simian foamy virus type 1 polymerase (pol) gene, 3' end; and envelope (env) gene, complete cds. ACCESSION M33561 KEYWORDS envelope protein; polymerase. SOURCE Simian foamy virus type 1, cDNA to viral RNA. ORGANISM Simian foamy virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Spumavirinae. REFERENCE 1 (bases 1 to 3534) AUTHORS Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A. TITLE Relationship of the env genes and the endonuclease domain of the pol genes of simian foamy virus type 1 and human foamy virus JOURNAL J. Virol. 64, 406-410 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 582 polymerase (AA at 1) pept 524 3481 envelope protein BASE COUNT 1164 a 648 c 674 g 1048 t ORIGIN 1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac 61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta 121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat 181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta 241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc 301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg 361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc 421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt 481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg 541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac 601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag 661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg 721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat 781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt 841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt 901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt 961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt 1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt 1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt 1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca 1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac 1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca 1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa 1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa 1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga 1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg 1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc 1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt 1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca 1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa 1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg 1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt 1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa 1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc 2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac 2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa 2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag 2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg 2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt 2341 atatttactc cgggatcatg tggtaaccct gatggaagct gcccttcatg atgtttcgat 2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat 2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt 2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta 2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga 2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt 2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa 2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt 2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg 2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg 2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt 3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc 3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt 3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa 3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg 3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc 3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg 3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact 3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg 3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat // LOCUS MUSC5DPROA 5401 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse complement component C5D (pro-C5D) mRNA, complete cds. ACCESSION M35526 J05234 KEYWORDS complement component C5D. SOURCE Mouse (strain B10.D2/oSnJ) liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5401) AUTHORS Wetsel,R.A., Fleischer,D.T. and Haviland,D.L. TITLE Deficiency of the murine fifth complement component (C5): A 2- base pair gene deletion in a 5'-exon JOURNAL J. Biol. Chem. 265, 2435-2440 (1990) STANDARD full staff_review COMMENT This coding sequence is translated into a truncated protein of the fifth complement component C5, for the C5S sequence see accession # M35525. FEATURES from to/span description pept 14 664 complement component C5D BASE COUNT 1636 a 1212 c 1221 g 1332 t ORIGIN Chromosome 2. 1 gccgctacca gccatgggtc tttggggaat actttgtctt ttaattttcc tggacaaaac 61 ttggggacag gaacaaacct acgtcatttc agcacccaaa atcctccggg tcggctcgtc 121 tgaaaatgtg gtaattcaag tccatggcta cactgaagca tttgatgcaa ctctttctct 181 aaaaagctat cctgacaaaa aagtcacctt ctcttcaggc tatgttaatt tgtccccgga 241 aaacaaattc caaaacgcgg cactgttgac actacagccc aatcaagttc ctagagaaga 301 aagcccagtc tctcacgtgt atctggaagt tgtgtcaaaa cacttttcaa aatcaaagaa 361 aataccaatt acctataaca atggaattct cttcatccat acagacaaac ctgtttacac 421 gccggaccag tcagtaaaga tcagagtcta ttctctgggt gacgacttga agccagccaa 481 acgggagact gtcttaactt tcatagaccc cgaaggatca gaagttgaca ttgtagaaga 541 aaatgattac accggaatta tctcttttcc tgacttcaag attccatcta atcccaagta 601 tggtgtttgg acaattaaag ctaactataa gaaggatttt acaacaactg gaactgcact 661 ttgaaattaa agaatatgtc ttgccacgat tctctgtttc aatagaacta gaaagaacct 721 tcattggcta taaaaacttt aagaactttg aaatcactgt gaaagcaaga tatttttata 781 ataaagtggt acctgatgct gaagtgtatg ccttttttgg attgagagag gacataaaag 841 atgaggagaa gcagatgatg cacaaagcca cacaagccgc aaagttggtt gacggagttg 901 ctcagatctc ttttgattct gaaacagcag ttaaagagct gtcctacaac agtctagaag 961 acttaaacaa caagtacctt tatattgcag taacagtcac agaatcttca ggtggatttt 1021 cagaagaggc agaaatccct ggagtcaaat atgtcctctc tccctacaca ctgaatttgg 1081 tcgctactcc tcttttcgtg aagcccggga ttccattttc catcaaggca caggttaaag 1141 attcactcga gcaggcggta ggaggggtcc cagtaactct gatggcacaa acagtcgatg 1201 tgaatcaaga gacatctgac ttggaaacaa agaggagcat cactcacgac actgatggag 1261 tagctgtgtt tgtgctgaac ctcccatcaa acgtgacggt gctaaagttt gagatcagaa 1321 ctgatgaccc agaacttccc gaagaaaatc aagccagcaa agagtacgaa gcagttgcgt 1381 actcgtctct cagccaaagt tacatttaca tcgcttggac tgaaaactac aagcccatgc 1441 ttgtgggaga atacctgaat attatggtta cccccaagag cccatatatc gacaaaataa 1501 ctcactataa ttacttgatt ttatccaaag gcaaaattgt acagtacggc acaagagaga 1561 aacttttctc ctcaacttat caaaatataa atattccagt gacacagaac atggttcctt 1621 cagcacgact cctggtctat tacatagtca caggggagca aacagcagaa ttagtggctg 1681 acgcagtctg gataaatatt gaggagaagt gtggcaacca gctccaggtc catctgtctc 1741 cagatgaata tgtgtattct ccaggccaaa ctgtgtccct tgacatggtg actgaagcag 1801 actcatgggt agcactatca gcagtggaca gagctgtgta taaagtccag ggaaacgcca 1861 aaagggccat gcaaagagtc tttcgagctt tggatgaaaa gagtgacctg ggctgtgggg 1921 caggtggtgg ccatgacaat gcagatgtat tccatctagc tgggctcacc ttcctcacca 1981 acgcaaacgc agatgactcc cattatcgtg atgactcttg taaagaaatt ctcaggtcaa 2041 agagaaatct gcatctccta aggcagaaaa tagaagaaca agctgctaag tacaaacata 2101 gtgtgctaaa gaaatgctgc tatgacggag cccgagtgaa cttctatgaa acctgtgagg 2161 agcgagtggc ccgggttacc ataggccctc tctgcatcag ggccttcaac gagtgctgta 2221 ctattgcgaa caagatccga aaagaaagcc cccataaacc tgtccaactg ggaaggatcc 2281 acattaagac cctgttacca gtgatgaagg cagatatccg aagctacttt ccagagagct 2341 ggctatggga aattcatcgc gttcccaaaa gaaaacagct gcaggtcacg ctgcctgact 2401 cactaacgac ttgggaaatt caaggcattg gcatttcaga caatggtata tgtgttgctg 2461 atacactcaa ggcaaaggtg ttcaaagaag tcttcctgga gatgaacata ccatattctg 2521 ttgtgcgagg agaacagatc caattgaaag gaactgttta caactatatg acctcaggga 2581 caaagttctg tgttaaaatg tctgctgtgg agggaatctg cacttcggga agctcagctg 2641 ctagccttca cacctccagg ccctccagat gtgtgttcca gaggatagag ggctcgtcca 2701 gtcacttggt gaccttcacc ctgcttcctc tggaaattgg ccttcactcc ataaacttct 2761 cactagagac ctcatttggg aaagacatct tagtaaagac attacgggta gtgccagaag 2821 gagtcaagag ggaaagctat gccggcgtga ttctggaccc taagggaatt cgtggtattg 2881 ttaacagacg aaaggaattc ccatacagga tcccattaga tttggtcccc aagaccaaag 2941 ttgaaaggat tttgagtgtc aaaggactgc ttgtagggga gttcttgtcc acggttctga 3001 gtaaggaagg catcgacatc ctaacccacc tccccaaggg cagtgcagag gcagagctca 3061 tgagcatagc tccggtgttc tatgttttcc actacctgga agcaggaaac cattggaata 3121 ttttctatcc tgatacactg agtaaaagac agagcctgga gaaaaaaata aaacaagggg 3181 tggtgagcgt catgtcctac agaaacgctg actattccta cagcatgtgg aagggggcga 3241 gcgctagtac ctggctgaca gcttttgctc tgagagtgct tggacaggtg gccaagtatg 3301 taaaacagga tgaaaactca atttgtaact ctttgctatg gctggttgag aagtgtcagc 3361 tggaaaacgg ctctttcaag gaaaattccc aatatctacc aataaaatta cagggtactt 3421 tgcctgctga agcccaagag aaaactttgt atcttacagc cttttctgtg attggaatta 3481 gaaaggcagt tgacatatgc cccaccatga aaatccacac agcgctagat aaagccgact 3541 ccttcctgct tgaaaacacc ctgccatcca agagcacctt cacactggcc attgtagcct 3601 atgctctttc cctaggagac agaacccacc cgaggtttcg tctaattgtg tcggccctga 3661 ggaaggaagc ttttgttaaa ggtgatccgc ccatttaccg ttactggaga gataccctca 3721 aacgtccaga cagctctgtg cccagcagcg gcacagcagg tatggttgaa accacagcct 3781 atgctttgct cgccagcctg aaactgaagg atatgaatta cgccaacccc atcatcaagt 3841 ggctatctga agagcagagg tatggaggcg gcttttattc cacccaggat acgattaatg 3901 ccatcgaggg cctgacagaa tattcactcc tgttaaaaca aattcatttg gatatggaca 3961 tcaatgtcgc ctacaaacac gaaggtgact tccacaagta taaggtgaca gagaagcatt 4021 tcctggggag gccagtggag gtatctctca atgatgacct tgttgtcagc acaggctaca 4081 gcagtggctt ggccacagta tatgtaaaaa ctgtggttca caaaattagt gtctctgagg 4141 aattttgcag cttttacttg aaaattgata cccaagatat tgaagcatcc agccacttca 4201 ggctcagtga ctctggattc aagcgcataa tagcatgtgc cagctacaag cccagcaagg 4261 aggagtcaac atccgggtcc tcccatgcag taatggatat atcactgccg actggaatcg 4321 gagcaaacga ggaagattta cgggctcttg tggaaggagt ggatcaacta ctaactgatt 4381 accagatcaa agatggccat gtcattctgc aactgaattc gatcccctcc agagatttcc 4441 tctgtgtccg gttccggata tttgaacttt tccaagttgg gtttctgaat cctgctacct 4501 tcacggtgta cgagtatcac agaccagata agcagtgcac catgatttat agcatttctg 4561 acaccaggct tcagaaagtc tgtgaaggag cagcttgcac atgtgtggaa gctgactgtg 4621 cgcaactgca ggcagaagtg gacctagcca tctctgcaga ctccagaaaa gagaaagcct 4681 gtaaaccaga gactgcatat gcttataaag tcaggatcac atcagccact gaagaaaatg 4741 tttttgtcaa gtacactgcg actcttctgg tcacttacaa aacaggggaa gctgctgatg 4801 agaattcgga ggtcaccttc attaaaaaga tgagctgtac caatgccaac ctggtgaaag 4861 ggaagcagta tttaatcatg ggcaaagagg ttctgcagat caaacacaat ttcagtttca 4921 agtatatata ccctctagat tcctccacct ggattgaata ttggcccaca gacacaacgt 4981 gtccatcctg tcaagcattt gtagagaatt tgaataactt tgctgaagac ctctttttaa 5041 acagctgtga atgaaaagtt ctgctgcacg aagattcctc ctgcggcggg gggatttctc 5101 ctcctctggc ttggaaacct agcctagaat cagatacact ttctttagag taaagcacaa 5161 gctgatgagt tacgactttg tgaaatggat agccttgagg ggaggcgaaa acaggtcccc 5221 caaggctatc agacgtcagt gccaatagac tgaaacaagt ctgtaaagtt agcagtcagg 5281 ggtgttggtt ggggccggaa gaagagaccc actgaaactg tagcccctta tcaaaacata 5341 tccttgcttg aaagaaaaat accaaggaca gaaaatgcca taaaatcttg actttgcact 5401 c // LOCUS MUSC5PRO 5403 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse complement component C5S (pro-C5) mRNA, complete cds. ACCESSION M35525 M15079 J05234 KEYWORDS clotting factor; complement component C5; complement protein. SOURCE Mouse (strain B10.D2/nSnJ) liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 141 to 5403) AUTHORS Wetsel,R.A., Ogata,R.T. and Tack,B.F. TITLE Primary structure of the fifth component of murine complement JOURNAL Biochemistry 26, 737-743 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 5403) AUTHORS Wetsel,R.A., Fleischer,D.T. and Haviland,D.L. TITLE Deficiency of the murine fifth complement component (C5): A 2- base pair gene deletion in a 5'-exon JOURNAL J. Biol. Chem. 265, 2435-2440 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Noack, 02-APR-1987; for [2] by R.A.Wetsel, 01-DEC-1989. For the C5D sequence see accession # M35526. FEATURES from to/span description pept 14 5056 complement component C5S precursor sigp 14 67 complement component C5S signal peptide matp 68 2035 complement component pro-C5S beta-chain matp 2048 5053 complement component pro-C5S alpha-chain mut 659 660 2 bp deletion in C5D BASE COUNT 1640 a 1212 c 1219 g 1332 t ORIGIN Chromosome 2. 1 gccgctacca gccatgggtc tttggggaat actttgtctt ttaattttcc tggacaaaac 61 ttggggacag gaacaaacct acgtcatttc agcacccaaa atcctccggg tcggctcgtc 121 tgaaaatgtg gtaattcaag tccatggcta cactgaagca tttgatgcaa ctctttctct 181 aaaaagctat cctgacaaaa aagtcacctt ctcttcaggc tatgttaatt tgtccccgga 241 aaacaaattc caaaacgcgg cactgttgac actacagccc aatcaagttc ctagagaaga 301 aagcccagtc tctcacgtgt atctggaagt tgtgtcaaaa cacttttcaa aatcaaagaa 361 aataccaatt acctataaca atggaattct cttcatccat acagacaaac ctgtttacac 421 gccggaccag tcagtaaaga tcagagtcta ttctctgggt gacgacttga agccagccaa 481 acgggagact gtcttaactt tcatagaccc cgaaggatca gaagttgaca ttgtagaaga 541 aaatgattac accggaatta tctcttttcc tgacttcaag attccatcta atcccaagta 601 tggtgtttgg acaattaaag ctaactataa gaaggatttt acaacaactg gaactgcata 661 ctttgaaatt aaagaatatg tcttgccacg attctctgtt tcaatagaac tagaaagaac 721 cttcattggc tataaaaact ttaagaactt tgaaatcact gtgaaagcaa gatattttta 781 taataaagtg gtacctgatg ctgaagtgta tgcctttttt ggattgagag aggacataaa 841 agatgaggag aagcagatga tgcacaaagc cacacaagcc gcaaagttgg ttgacggagt 901 tgctcagatc tcttttgatt ctgaaacagc agttaaagag ctgtcctaca acagtctaga 961 agacttaaac aacaagtacc tttatattgc agtaacagtc acagaatctt caggtggatt 1021 ttcagaagag gcagaaatcc ctggagtcaa atatgtcctc tctccctaca cactgaattt 1081 ggtcgctact cctcttttcg tgaagcccgg gattccattt tccatcaagg cacaggttaa 1141 agattcactc gagcaggcgg taggaggggt cccagtaact ctgatggcac aaacagtcga 1201 tgtgaatcaa gagacatctg acttggaaac aaagaggagc atcactcatg acactgatgg 1261 agtagctgtg tttgtgctga acctcccatc aaatgtgacg gtgctaaagt ttgagatcag 1321 aactgatgac ccagaacttc ccgaagaaaa tcaagccagc aaagagtacg aagcagttgc 1381 gtactcgtct ctcagccaaa gttacattta catcgcttgg actgaaaact acaagcccat 1441 gcttgtggga gaatacctga atattatggt tacccccaag agcccatata tcgacaaaat 1501 aactcactat aattacttga ttttatccaa aggcaaaatt gtacagtacg gcacaagaga 1561 gaaacttttc tcctcaactt atcaaaatat aaatattcca gtgacacaga acatggttcc 1621 ttcagcacga ctcctggtct attacatagt cacaggggag caaacagcag aattagtggc 1681 tgacgcagtc tggataaata ttgaggagaa gtgtggcaac cagctccagg tccatctgtc 1741 tccagatgaa tatgtgtatt ctccaggcca aactgtgtcc cttgacatgg tgactgaagc 1801 agactcatgg gtagcactat cagcagtgga cagagctgtg tataaagtcc agggaaacgc 1861 caaaagggcc atgcaaagag tctttcaagc tttggatgaa aagagtgacc tgggctgtgg 1921 ggcaggtggt ggccatgaca atgcagatgt attccatcta gctgggctca ccttcctcac 1981 caacgcaaac gcagatgact cccattatcg tgatgactct tgtaaagaaa ttctcaggtc 2041 aaagagaaac ctgcatctcc taaggcagaa aatagaagaa caagctgcta agtacaaaca 2101 tagtgtgcca aagaaatgct gctatgacgg agcccgagtg aacttctacg aaacctgtga 2161 ggagcgagtg gcccgggtta ccataggccc tctctgcatc agggccttca acgagtgctg 2221 tactattgcg aacaagatcc gaaaagaaag cccccataaa cctgtccaac tgggaaggat 2281 ccacattaag accctgttac cagtgatgaa ggcagatatc cgaagctact ttccagagag 2341 ctggctatgg gaaattcatc gcgttcccaa aagaaaacag ctgcaggtca cgctgcctga 2401 ctcactaacg acttgggaaa ttcaaggcat tggcatttca gacaatggta tatgtgttgc 2461 tgatacactc aaggcaaagg tgttcaaaga agtcttcctg gagatgaaca taccatattc 2521 tgttgtgcga ggagaacaga tccaattgaa aggaactgtt tacaactata tgacctcagg 2581 gacaaagttc tgtgttaaaa tgtctgctgt ggaggggatc tgcacttcag gaagctcagc 2641 tgctagcctt cacacctcca ggccctccag atgtgtgttc cagaggatag agggctcgtc 2701 cagtcacttg gtgaccttca ccctgcttcc tctggaaatt ggccttcact ccataaactt 2761 ctcactagag acctcatttg ggaaagacat cttagtaaag acattacggg tagtgccaga 2821 aggagtcaag agggaaagct atgccggcgt gattctggac cctaagggaa ttcgtggtat 2881 tgttaacaga cgaaaggaat tcccatacag gatcccatta gatttggtcc ccaagaccaa 2941 agttgaaagg attttgagtg tcaaaggact gcttgtaggg gagttcttgt ccacggttct 3001 gagtaaggaa ggcatcaaca tcctaaccca cctccccaag ggcagtgcag aggcagagct 3061 catgagcata gctccggtgt tctatgtttt ccactacctg gaagcaggaa accattggaa 3121 tattttctat cctgatacac tgagtaaaag acagagcctg gagaaaaaaa taaaacaagg 3181 ggtggtgagc gtcatgtcct acagaaacgc tgactattcc tacagcatgt ggaagggggc 3241 gagcgctagt acctggctga cagcttttgc tctgagagtg cttggacagg tggccaagta 3301 tgtaaaacag gatgaaaact caatttgtaa ctctttgcta tggctggttg agaagtgtca 3361 gctggaaaac ggctctttca aggaaaattc ccaatatcta ccaataaaat tacagggtac 3421 tttgcctgct gaagcccaag agaaaacttt gtatcttaca gccttttctg tgattggaat 3481 tagaaaggca gttgacatat gccccaccat gaaaatccac acagcgctag ataaagccga 3541 ctccttcctg cttgaaaaca ccctgccatc caagagcacc ttcacactgg ccattgtagc 3601 ctatgctctt tccctaggag acagaaccca cccgaggttt cgtctaattg tgtcggccct 3661 gaggaaggaa gcttttgtta aaggtgatcc gcccatttac cgttactgga gagataccct 3721 caaacgtcca gacagctctg tgcccagcag cggcacagca ggtatggttg aaaccacagc 3781 ctatgctttg ctcgccagcc tgaaactgaa ggatatgaat tacgccaacc ccatcatcaa 3841 gtggctatct gaagagcaga ggtatggagg cggcttttat tccacccagg atacgattaa 3901 tgccatcgag ggcctgacag aatattcact cctgttaaaa caaattcatt tggatatgga 3961 catcaatgtc gcctacaaac acgaaggtga cttccacaag tataaggtga cagagaagca 4021 tttcctgggg aggccagtgg aggtatctct caatgatgac cttgttgtca gcacaggcta 4081 cagcagtggc ttggccacag tatatgtaaa aactgtggtt cacaaaatta gtgtctctga 4141 ggaattttgc agcttttact tgaaaattga tacccaagat attgaagcat ccagccactt 4201 caggctcagt gactctggat tcaagcgcat aatagcatgt gccagctaca agcccagcaa 4261 ggaggagtca acatccgggt cctcccatgc agtaatggat atatcactgc cgactggaat 4321 cggagcaaac gaggaagatt tacgggctct tgtggaagga gtggatcaac tactaactga 4381 ttaccagatc aaagatggcc atgtcattct gcaactgaat tcgatcccct ccagagattt 4441 cctctgtgtc cggttccgga tatttgaact tttccaagtt gggtttctga atcctgctac 4501 cttcacggtg tacgagtatc acagaccaga taagcagtgc accatgattt atagcatttc 4561 tgacaccagg cttcagaaag tctgtgaagg agcagcttgc acatgtgtgg aagctgactg 4621 tgcgcaactg caggcagaag tagacctagc catctctgca gactccagaa aagagaaagc 4681 ctgtaaacca gagactgcat atgcttataa agtcaggatc acatcagcca ctgaagaaaa 4741 tgtttttgtc aagtacactg cgactcttct ggtcacttac aaaacagggg aagctgctga 4801 tgagaattcg gaggtcacct tcattaaaaa gatgagctgt accaatgcca acctggtgaa 4861 agggaagcag tatttaatca tgggcaaaga ggttctgcag atcaaacaca atttcagttt 4921 caagtatata taccctctag attcctccac ctggattgaa tattggccca cagacacaac 4981 gtgtccatcc tgtcaagcat ttgtagagaa tttgaataac tttgctgaag acctcttttt 5041 aaacagctgt gaatgaaaag ttctgctgca cgaagattcc tcctgcggcg gggggattgc 5101 tcctcctctg gcttggaaac ctagcctaga atcagataca ctttctttag agtaaagcac 5161 aagctgatga gttacgactt tgtgaaatgg atagccttga ggggaggcga aaacaggtcc 5221 cccaaggcta tcagatgtca gtgccaatag actgaaacaa gtctgtaaag ttagcagtca 5281 ggggtgttgg ttggggccgg aagaagagac ccactgaaac tgtagcccct tatcaaaaca 5341 tatccttgct tgaaagaaaa ataccaagga cagaaaatgc cataaaatct tgactttgca 5401 ctc // LOCUS HUMENN 1592 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human endonexin II mRNA, complete cds. ACCESSION J03745 KEYWORDS Ca2+ -dependent phospholipid binding protein; endonexin. SOURCE Human placenta, cDNA to mRNA, (library of Clonetech Laboratories Inc.). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1592) AUTHORS Kaplan,R., Jaye,M., Burgess,W.H., Schlaepfer,D.D. and Haigler,H.T. TITLE Cloning and expression of cDNA for human endonexin II, a Ca2+ and phospholipid binding protein JOURNAL J. Biol. Chem. 263, 8037-8043 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.T.Haigler, 06-APR-1988 FEATURES from to/span description pept 160 1122 endonexin II /hgml_locus_uid="LS0217S" /nomgen="ENX2" /map="4q28-q32" mRNA < 1 1592 endonexin II mRNA BASE COUNT 434 a 337 c 366 g 455 t ORIGIN 284 bp upstream of HincII site. 1 ttggatcagt ctaggtgcag ctgccggatc cttcagcgtc tgcatctcgg cgtcgcccgc 61 gtaccgtcgc ccggctctcc gccgctctcc cggggtttcg gggcacttgg gtcccacagt 121 ctggtcctgc ttcaccttcc cctgacctga gtagtcgcca tggcacaggt tctcagaggc 181 actgtgactg acttccctgg atttgatgag cgggctgatg cagaaactct tcggaaggct 241 atgaaaggct tgggcacaga tgaggagagc atcctgactc tgttgacatc ccgaagtaat 301 gctcagcgcc aggaaatctc tgcagctttt aagactctgt ttggcaggga tcttctggat 361 gacctgaaat cagaactaac tggaaaattt gaaaaattaa ttgtggctct gatgaaaccc 421 tctcggcttt atgatgctta tgaactgaaa catgccttga agggagctgg aacaaatgaa 481 aaagtactga cagaaattat tgcttcaagg acacctgaag aactgagagc catcaaacaa 541 gtttatgaag aagaatatgg ctcaagcctg gaagatgacg tggtggggga cacttcaggg 601 tactaccagc ggatgttggt ggttctcctt caggctaaca gagaccctga tgctggaatt 661 gatgaagctc aagttgaaca agatgctcag gctttatttc aggctggaga acttaaatgg 721 gggacagatg aagaaaagtt tatcaccatc tttggaacac gaagtgtgtc tcatttgaga 781 aaggtgtttg acaagtacat gactatatca ggatttcaaa ttgaggaaac cattgaccgc 841 gagacttctg gcaatttaga gcaactactc cttgctgttg tgaaatctat tcgaagtata 901 cctgcctacc ttgcagagac cctctattat gctatgaagg gagctgggac agatgatcat 961 accctcatca gagtcatggt ttccaggagt gagattgatc tgtttaacat caggaaggag 1021 tttaggaaga attttgccac ctctctttat tccatgatta agggagatac atctggggac 1081 tataagaaag ctcttctgct gctctgtgga gaagatgact aacgtgtcac ggggaagagc 1141 tccctgctgt gtgcctgcac caccccactg ccttccttca gcacctttag ctgcatttgt 1201 atgccagtgc ttaacacatt gccttattca tactagcatg ctcatgacca acacatacac 1261 gtcatagaat gaaaatagtg gtgcttcttt ctgatctcta gtggagatct ctttgactgc 1321 tgtagtacta aagtgtactt aatgttacta agtttaatgc ctggccattt tccatttata 1381 tatatttttt aagaggctag agtgctttta gcctttttta aaaactccat ttatattaca 1441 tttgtaacca tgatacttta atcagaagct tagccttgaa attgtgaact cttggaaatg 1501 ttattagtga agttcgcaac taaactaaac ctgtaaaatt atgatgattg tattcaaaag 1561 attaatgaaa aataaacatt tctgtccccc tg // LOCUS CPAFPRFA 1277 bp ds-DNA ORG 26-JUL-1990 DEFINITION C.paradoxa cyanelle ferredoxin (petF) and ribosomal protein S10 (rps10; rpsJ) genes, complete cds, and elongation factor Tu (tufA) gene, 5' end. ACCESSION M35206 KEYWORDS elongation factor Tu; ferredoxin; ribosomal protein S10. SOURCE C.paradoxa (isolate UTEX LB555) cyanelle DNA, clone pCpcGP1.3. ORGANISM Cyanelle Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa. REFERENCE 1 (bases 1 to 1277) AUTHORS Bryant,D.A., Schluchter,W.M. and Stirewalt,V.L. TITLE Ferredoxin and ribosomal protein S10 are encoded on the cyanelle genome of Cyanophora paradoxa JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.A.Bryant, 14-JUN-1990. Author address: D.A.Bryant s-101 Frear Bldg. Dept. of Mol. and Cell Biol. Pennsylvania State University University Park, PA 16802 email: DAB14@PSUVM FEATURES from to/span description pept 338 637 ferredoxin (petF) pept 1027 710 (c) ribosomal protein S10 (rps10; rpsJ) pept > 1277 1114 (c) elongation factor Tu (tufA; AA at 1275) binding 313 318 ribosome binding site rpt 643 694 inverted repeat rpt 1065 1097 inverted repeat BASE COUNT 477 a 170 c 163 g 467 t ORIGIN 1 agatcttatc taagatatgt aaataaataa aaatatatat ctatatttat agtatatatt 61 aatttttttt aaaaatcgat actaaattta aattttcctt ttttttcttt ataaaaattt 121 aattttaaat agaaaaaatt aagtttttcg aaaaaagcaa ttaaaacata ttaaaaaaaa 181 attaataaac atggtaaact ttaaatataa atttataatt aactgaaaaa ataataaaaa 241 taaatttata tatatatata ttttagatta aaataattta aattaaatta ttaaaagttc 301 taccttgtaa ctataattat ttaggagata gtattttatg gcagtatata aagttcgtct 361 tatttgtgaa gaacaaggtt tagataccac tattgaatgt ccagatgatg agtacattct 421 tgatgcagca gaagaacaag gtattgattt accatactcc tgtcgtgcag gtgcatgttc 481 tacttgtgca ggtaaagtgg tagaaggaac tgtagatcaa tctgatcaat ctttcttaga 541 tgacgctcaa ttagcagctg gttatgtatt aacttgtgta gcatacccat cttctgactg 601 tacagttaaa actcaccaag aagaatctct ttactaaaaa ataaaaaatc taaataataa 661 aatagaaatc tctattttat tatttagatt ttcttaattc aaaaaaaaac taaagtttaa 721 cttccacatc aacacctgct ggtaaatcta aacgagttaa agtatcaatt gttttggaag 781 atggtaaata taaatcaatt attctgcgat gaactctaat ttcgaaatgt tctcgtgaat 841 ctttatctac atgtggggaa cgtaaaacgc aataaatttt cttttttgtt ggtaaaggaa 901 taggtcctac tgcggtagca tcagttcgtt ttgcagcttc aataatttgt tcacatgagt 961 tttctaataa tgaagagtca taagaacgta gttgaatacg aatttttaat tgttgattac 1021 tggccataat ttttaatttt taatttttat tttttaaatt aaaaagagag aaataaatac 1081 attttctatt tctctctaaa atttagattt taattatttt aaaatcttag aaactacacc 1141 tgcaccaatt gtacgaccac cttcacgaat cgcgaaacgc ataccttgtt caatcgcaat 1201 tggatgtact aaacttactg tcattttaat acgatctcct ggcataacca tttctgcatt 1261 actaccatca tctgcag // LOCUS MUSIGHZSA 333 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse Ig germline H-chain gene, D region. ACCESSION M35332 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Mouse liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 333) AUTHORS Landolfi,N.F., Capra,J.D. and Tucker,P.W. TITLE Germ-line sequence of the D-H segment employed in Ars-A antibodies: Implications for the generation of junctional diversity JOURNAL J. Immunol. 137, 362-365 (1986) STANDARD simple staff_review FEATURES from to/span description pept / 107 / 129 Ig heavy chain D region (AA at 107; 107 could be 109) iDNA < 1 106 V-D intervening DNA iDNA 130 > 333 D-J intervening DNA BASE COUNT 95 a 98 c 62 g 78 t ORIGIN 1 tgacaactga aactcaaccg tgctgcctgg cccccaatgc tctctacacc tgcaaaacca 61 gagaccatac tggccagtgc tttttgtgaa gggatctact actgtgttta ttactatggt 121 ggtagctacc acagtgctat atccatcagc aaaaacccat tgtgcccagc agactcttga 181 gctcgaaaaa ctgagtctag aaaagctggc atcacggggt ttatatcccg agtcttgacc 241 actgacccat taatactatc caacacagag ctctccgtct gcccacaaag aaatccaacc 301 accctaaagt cagatcctct agagtcgacc tgc // LOCUS WHTREPTA 295 bp ds-DNA PLN 26-JUL-1990 DEFINITION T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD1. ACCESSION M35329 KEYWORDS repetitive DNA. SOURCE T.monococcum aegilopoides leaf DNA, clone pTbUCD1. ORGANISM Triticum monococcum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 295) AUTHORS Dvorak,J., McGuire,P.E. and Cassidy,B. TITLE Apparent sources of the A genomes of wheats inferred from polymorphism in abundance and restriction fragment length of repeated nucleotide sequences JOURNAL Genome 30, 680-689 (1988) STANDARD simple staff_review BASE COUNT 72 a 58 c 66 g 99 t ORIGIN 1 tccagacttg ggtaacaggg tgtgccttag aatcccagtt gatagtgggc agtcctgaca 61 gaagatagtg cactgagcca aacttgaatg tgtcaagtgc ttcattcgga atctccttgt 121 acatgttgaa catagagttg tggtccatct ttttcttggc ataaatgtcc aagtcatctg 181 cttgctcctc tggggcattg atcattataa gtaatagtct tttcttcaac tttaataggt 241 gcagctactt ttacttctat gggaggatga tatttaaacc acttctcctt gggga // LOCUS WHTREPTB 273 bp ds-DNA PLN 26-JUL-1990 DEFINITION T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD2. ACCESSION M35330 KEYWORDS repetitive DNA. SOURCE T.monococcum aegilopoides leaf DNA, clone pTbUCD2. ORGANISM Triticum monococcum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 273) AUTHORS Dvorak,J., McGuire,P.E. and Cassidy,B. TITLE Apparent sources of the A genomes of wheats inferred from polymorphism in abundance and restriction fragment length of repeated nucleotide sequences JOURNAL Genome 30, 680-689 (1988) STANDARD simple staff_review BASE COUNT 58 a 44 c 74 g 97 t ORIGIN 1 ctggccatgg agggcctatg tagatagaca ggcttcgaga agcttctttc tttctagtgt 61 ctgtactcag accggttgct tccgcatgtg cttgtatgag tgtatgactt gagtgtcggg 121 tcatgtgacc cctatctgta tgaacatgtt atgtatggct ctctagagcc tttaaataaa 181 gtacttgagt tgtagagtat tgttgtgatg ccatgttgta tgtactcata tcgggcatat 241 tgtgtgtatg attgaaatgc ttggtatgag tgg // LOCUS WHTREPTC 229 bp ds-DNA PLN 26-JUL-1990 DEFINITION T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD3. ACCESSION M35331 KEYWORDS repetitive DNA. SOURCE T.monococcum aegilopoides leaf DNA, clone pTbUCD3. ORGANISM Triticum monococcum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 229) AUTHORS Dvorak,J., McGuire,P.E. and Cassidy,B. TITLE Apparent sources of the A genomes of wheats inferred from polymorphism in abundance and restriction fragment length of repeated nucleotide sequences JOURNAL Genome 30, 680-689 (1988) STANDARD simple staff_review BASE COUNT 85 a 55 c 37 g 52 t ORIGIN 1 caaattagct actccagtat gtaaaaacct gtttgtccaa cacttagcag atttcactct 61 tgatagatca ctagcaatag ctcccgcaaa atcgcaaaag agttcatgat ctgcccaaaa 121 caacaactat gcaaaagttg agctcgattg agtcaaccta gggtgctcca acataacaag 181 taaagacatg gatggattaa gcacaacaag catgacaaac cactcttac // LOCUS RATMTXXX 169 bp ds-DNA ORG 26-JUL-1990 DEFINITION Rat mitochondrial HindIII fragment. ACCESSION M35251 KEYWORDS . SOURCE Rat mitochondrial DNA. ORGANISM Mitochondrion Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; Rattus norvegicus. REFERENCE 1 (bases 1 to 169) AUTHORS Brown,G.G., Castora,F.J., Frantz,S.C. and Simpson,M.V. TITLE Mitochondrial DNA polymorphism: Evolutionary studies on the genus Rattus JOURNAL Ann. N.Y. Acad. Sci. 361, 135-153 (1981) STANDARD simple staff_review FEATURES from to/span description ORF < 1 > 169 ORF allele 56 56 a in type A; g in type B allele 80 80 a in type A; g in type B allele 122 122 t in type A; g in type B BASE COUNT 44 a 24 c 43 g 58 t ORIGIN 1 agcttgctaa tagtcatcat gttgctatca atggaaagat tatttgtaat cctcgagcta 61 taattatagt tcggctgtga attcgttcgt agttggtgtt tgctaggcag aataagagtg 121 atgaggttaa gccgtgggcg attattagta ttgtagctcc catgaagct // LOCUS MUSCRABP 868 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse cellular retinoic acid-binding protein (CRABP-II) mRNA, complete cds. ACCESSION M35523 KEYWORDS cellular retinoic acid-binding protein. SOURCE Mouse 12.5 day old embryo, cDNA to mRNA, clone lambda-mE2.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 868) AUTHORS Giguere,V., Lyn,S., Yip,P., Siu,C.-H. and Amin,S. TITLE Molecular cloning of a novel cellular retinoic acid-binding protein expressed during mouse embryogenesis and in adult skin JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.Giguere, 22-JUN-1990. FEATURES from to/span description pept 116 532 cellular retinoic acid-binding protein BASE COUNT 226 a 218 c 220 g 204 t ORIGIN Chromosome 2. 1 gaattccggg gaggatctgt tctgcaaagg agacagcaaa gtatctttag cctaaaggac 61 tcagcgtcca gtgttctagt tgaagatcta aagagaaagc caccttgctg ccactatgcc 121 taacttttct ggcaactgga agatcatccg atcggaaaac tttgaggaaa tgctaaaagc 181 tctgggggtg aacatgatga tgaggaagat cgctgtggct gcagcctcca agccagcagt 241 cgagatcaaa caggagaatg acactttcta catcaaaacc tccaccactg tgcgaaccac 301 ggagattaac ttcaagatcg gggaggaatt tgaggagcag accgtggatg ggagaccctg 361 taagagtttg gtgaaatggg agagtggaaa caaaatggtg tgcgagcaga ggcttctgaa 421 gggggagggc cccaagacct cctggagccg agaactgacc aatgatggag agctgatcct 481 gacaatgaca gcagatgacg ttgtgtgcac cagggtctac gtccgagagt gagtgcctac 541 gggtccaaga actgcctgag acgacttctg tgcccgctac aggacacaaa cctccctccc 601 acgtccatct tacaaactag ctctcccctt actcctgagg gttactgctt cctccaaggc 661 cttttgttct ttgccttctc tacgccagag aggggcagaa gctcagaacc ctcccaccgc 721 catttgcccc tcccaggtca gcagtcccag ctccatacca gggtccttcc tggaagagac 781 tgtctctctg gcctctactc cttatccttg tagtctgtgt gatttagaat atttattggt 841 taattttatt aaaatgtttc cggaattc // LOCUS BTHCRYIA 4320 bp ds-DNA BCT 26-JUL-1990 DEFINITION B.thuringiensis delta-endotoxin gene, complete cds. ACCESSION M35524 KEYWORDS delta-endotoxin. SOURCE B.thuringiensis kenyae (strain HD588) DNA. ORGANISM Bacillus thuringiensis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 4320) AUTHORS Von Tersch,M.A., Loidl,R.H., Jany,C.S. and Johnson,T.B. TITLE Insecticidal toxin genes from Bacillus thuringiensis variety kenyae: Cloning characterization and comparative studies JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.A.Von Tersch, 19-JUN-1990. Author address: M.A.Von Tersch Ecogen Inc. 2005 Cabot Blvd. West Loanghorne PA 19047 FEATURES from to/span description pept 239 3772 delta-endotoxin binding 228 232 ribosome binding site BASE COUNT 1392 a 724 c 909 g 1295 t ORIGIN 1 gttaacggaa tacaaaccct taatgcattg gttaaacatt gtaaagtcta aagcatggat 61 aatgggcgag aagtaagtag attgttaaca ccctgggtca aaaattgata tttagtaaaa 121 ttagttgcac tttgtgcatt ttttcataag atgagtcata tgttttaaat tgtagtaatg 181 aaaaacagta ttatatcata atgaattggt atcttaataa aagagatgga ggtaacttat 241 ggataacaat ccgaacatca atgaatgcat tccttataat tgtttaagta accctgaagt 301 agaagtatta ggtggagaaa gaatagaaac tggttacacc ccaatcgata tttccttgtc 361 gctaacgcaa tttcttttga gtgaatttgt tcccggtgct ggatttgtgt taggactagt 421 tgatataata tggggaattt ttggtccctc tcaatgggac gcatttcttg tacaaattga 481 acagttaatt aaccaaagaa tagaagaatt cgctaggaac caagccattt ctagattaga 541 aggactaagc aatctttatc aaatttacgc agaatctttt agagagtggg aagcagatcc 601 tactaatcca gcattaagag aagagatgcg tattcaattc aatgacatga acagtgccct 661 tacaaccgct attcctcttt tggcagttca aaattatcaa gttcctcttt tatcagtata 721 tgttcaagct gcaaatttac atttatcagt tttgagagat gtttcagtgt ttggacaaag 781 gtggggattt gatgccgcga ctatcaatag tcgttataat gatttaacta ggcttattgg 841 caactataca gatcatgctg tacgctggta caatacggga ttagaacgtg tatggggacc 901 ggattctaga gattgggtaa ggtataatca atttagaaga gaattaacac taactgtatt 961 agatatcgtt gctctgttcc cgaattatga tagtagaaga tatccaattc gaacagtttc 1021 ccaattaaca agagaaattt atacaaaccc agtattagaa aattttgatg gtagttttcg 1081 aggctcggct cagggcatag aaagaagtat taggagtcca catttgatgg atatacttaa 1141 cagtataacc atctatacgg atgctcatag gggttattat tattggtcag ggcatcaaat 1201 aatggcttct cctgtcggtt tttcggggcc agaattcacg tttccgctat atggaaccat 1261 gggaaatgca gctccacaac aacgtattgt tgctcaacta ggtcagggcg tgtatagaac 1321 attatcctct actttttata gaagaccttt taatataggg ataaataatc aacaactatc 1381 tgttcttgac gggacagaat ttgcttatgg aacctcctca aatttgccat ccgctgtata 1441 cagaaaaagc ggaacggtag attcgctgga tgaaatacca ccacagaata acaacgtgcc 1501 acctaggcaa ggatttagtc atcgattaag ccatgtttca atgtttcgtt caggctctag 1561 tagtagtgta agtataataa gagctcctat gttctcttgg atacatcgta gtgctgaatt 1621 taataatata attgcatcgg atagtattac tcaaatccct gcagtgaagg gaaactttct 1681 ttttaatggt tctgtaattt caggaccagg atttactggt ggggacttag ttagattaaa 1741 tagtagtgga aataacattc agaatagagg gtatattgaa gttccaattc acttcccatc 1801 gacatctacc agatatcgag ttcgtgtacg gtatgcttct gtaaccccga ttcacctcaa 1861 cgttaattgg ggtaattcat ccattttttc caatacagta ccagctacag ctacgtcatt 1921 agataatcta caatcaagtg attttggtta ttttgaaagt gccaatgctt ttacatcttc 1981 attaggtaat atagtaggtg ttagaaattt tagtgggact gcaggagtga taatagacag 2041 atttgaattt attccagtta ctgcaacact cgaggctgaa tataatctgg aaagagcgca 2101 gaaggcggtg aatgcgctgt ttacgtctac aaaccaacta gggctaaaaa caaatgtaac 2161 ggattatcat attgatcaag tgtccaattt agttacgtgt ttatcggatg aattttgtct 2221 ggatgaaaag cgagaattgt ccgagaaagt caaacatgcg aagcgactca gtgatgaacg 2281 caatttactc caagattcaa atttcaaaga cattaatagg caaccagaac gtgggtgggg 2341 cggaagtaca gggattacca tccaaggagg ggatgacgta tttaaagaaa attacgtcac 2401 actatcaggt acctttgatg agtgctatcc aacatatttg tatcaaaaaa tcgatgaatc 2461 aaaattaaaa gcctttaccc gttatcaatt aagagggtat atcgaagata gtcaagactt 2521 agaaatctat ttaattcgct acaatgcaaa acatgaaaca gtaaatgtgc caggtacggg 2581 ttccttatgg ccgctttcag cccaaagtcc aatcggaaag tgtggagagc cgaatcgatt 2641 cgcgccacac cttgaatgga atcctgactt agattgttcg tgtagggatg gagaaaagtg 2701 tgcccatcat tcgcatcatt tctccttaga cattgatgta ggatgtacag acttaaatga 2761 ggacctaggt gtatgggtga tctttaagat taagacgcaa gatgggcacg caagactagg 2821 gaatctagag tttctcgaag agaaaccatt agtaggagaa gcgctagctc gtgtgaaaag 2881 agcggagaaa aaatggagag acaaacgtga aaaattggaa tgggaaacaa atatcgttta 2941 taaagaggca aaagaatctg tagatgcttt atttgtaaac tctcaatatg atcaattaca 3001 agcggatacg aatattgcca tgattcatgc ggcagataaa cgtgttcata gcattcgaga 3061 agcttatctg cctgagctgt ctgtgattcc gggtgtcaat gcggctattt ttgaagaatt 3121 agaagggcgt attttcactg cattctccct atatgatgcg agaaatgtca ttaaaaatgg 3181 tgattttaat aatggcttat cctgctggaa cgtgaaaggg catgtagatg tagaagaaca 3241 aaacaaccaa cgttcggtcc ttgttgttcc ggaatgggaa gcagaagtgt cacaagaagt 3301 tcgtgtctgt ccgggtcgtg gctatatcct tcgtgtcaca gcgtacaagg agggatatgg 3361 agaaggttgc gtaaccattc atgagatcga gaacaataca gacgaactga agtttagcaa 3421 ctgcgtagaa gaggaaatct atccaaataa cacggtaacg tgtaatgatt atactgtaaa 3481 tcaagaagaa tacggaggtg cgtacacttc tcgtaatcga ggatataacg aagctccttc 3541 cgtaccagct gattatgcgt cagtctatga agaaaaatcg tatacagatg gacgaagaga 3601 gaatccttgt gaatttaaca gagggtatag ggattacacg ccactaccag ttggttatgt 3661 gacaaaagaa ttagaatact tcccagaaac cgataaggta tggattgaga ttggagaaac 3721 ggaaggaaca tttatcgtgg acagcgtgga attactcctt atggaggaat agtctcatgc 3781 aaactcaggt ttaaatatcg ttttcaaatc aattgtccaa gagcagcatt acaaatagat 3841 aagtaatttg ttgtaatgaa aaacggacat cacctccatt gaaacggagt gatgtccgtt 3901 ttactatgtt attttctagt aatacatatg tatagagcaa cttaatcaag cagagatatt 3961 ttcacctatc gatgaaaata tctctgcttt ttcttttttt atttggtata tgctttactt 4021 gtaatcgaaa ataaagcact aatagggtgt ttttgcccat cccttcggga aatcaagact 4081 aaaatgaaaa ataaacagaa aatataaggc tcttactttg tggatatgac cacaaagtaa 4141 gagccttatt tcattaaatt tgttcataca tttttccttg tagtcttttg ttttcatcct 4201 ttaatcgcct attctcgtac tctacttcct tgattcgatc ccgtaataat tgaatcattg 4261 catctttatt ttcatcactc attttccgtt tttcgaattt tggagataca gctcgttgct // LOCUS HUMHBLOD 3373 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase mRNA, complete cds. ACCESSION M35531 KEYWORDS GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase. SOURCE Human epidermal carcinoma cell line A431, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3373) AUTHORS Larsen,R.D., Ernst,L.K., Nair,R.P. and Lowe,J.B. TITLE Molecular cloning, sequence and expression of a human GDP-L-fucose: Beta-D-galactoside 2-alpha-l-fucosyltransferase cDNA that can be from the H blood group antigen JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD ull staff_review staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.B.Lowe, 22-JUN-1990. FEATURES from to/span description pept 104 1201 GDP-L-fucose:beta-D-galactoside 2-alpha-L-fucosyltransferase site 1744 2385 Alu sequence homologue BASE COUNT 687 a 925 c 905 g 856 t ORIGIN 1 gcctggcgtt ccaggggcgg ccggatgtgg cctgcctttg cggagggtgc gctccggcca 61 cgaaaagcgg actgtggatc tgccacctgc aagcagctcg gccatgtggc tccggagcca 121 tcgtcagctc tgcctggcct tcctgctagt ctgtgtcctc tctgtaatct tcttcctcca 181 tatccatcaa gacagctttc cacatggcct aggcctgtcg atcctgtgtc cagaccgccg 241 cctggtgaca cccccagtgg ccatcttctg cctgccgggt actgcgatgg gccccaacgc 301 ctcctcttcc tgtccccagc accctgcttc cctctccggc acctggactg tctaccccaa 361 tggccggttt ggtaatcaga tgggacagta tgccacgctg ctggctctgg cccagctcaa 421 cggccgccgg gcctttatcc tgcctgccat gcatgccgcc ctggccccgg tattccgcat 481 caccctgccc gtgctggccc cagaagtgga cagccgcacg ccgtggcggg agctgcagct 541 tcacgactgg atgtcggagg agtacgcgga cttgagagat cctttcctga agctctctgg 601 cttcccctgc tcttggactt tcttccacca tctccgggaa cagatccgca gagagttcac 661 cctgcacgac caccttcggg aagaggcgca gagtgtgctg ggtcagctcc gcctgggccg 721 cacaggggac cgcccgcgca cctttgtcgg cgtccacgtg cgccgtgggg actatctgca 781 ggttatgcct cagcgctgga agggtgtggt gggcgacagc gcctacctcc ggcaggccat 841 ggactggttc cgggcacggc acgaagcccc cgttttcgtg gtcaccagca acggcatgga 901 gtggtgtaaa gaaaacatcg acacctccca gggcgatgtg acgtttgctg gcgatggaca 961 ggaggctaca ccgtggaaag actttgccct gctcacacag tgcaaccaca ccattatgac 1021 cattggcacc ttcggcttct gggctgccta cctggctggc ggagacactg tctacctggc 1081 caacttcacc ctgccagact ctgagttcct gaagatcttt aagccggagg cggccttcct 1141 gcccgagtgg gtgggcatta atgcagactt gtctccactc tggacattgg ctaagccttg 1201 agagccaggg agactttctg aagtagcctg atctttctag agccagcagt acgtggcttc 1261 agaggcctgg catcttctgg agaagcttgt ggtgttcctg aagcaaatgg gtgcccgtat 1321 ccagagtgat tctagttggg agagttggag agaaggggga cgtttctgga actgtctgaa 1381 tattctagaa ctagcaaaac atcttttcct gatggctggc aggcagttct agaagccaca 1441 gtgcccacct gctcttccca gcccatatct acagtacttc cagatggctg cccccaggaa 1501 tggggaactc tccctctggt ctactctaga agaggggtta cttctcccct gggtcctcca 1561 aagactgaag gagcatatga ttgctccaga gcaagcattc accaagtccc cttctgtgtt 1621 tctggagtga ttctagaggg agacttgttc tagagaggac caggtttgat gcctgtgaag 1681 aaccctgcag ggcccttatg gacaggatgg ggttctggaa atccagataa ctaaggtgaa 1741 gaatcttttt agtttttttt tttttttttt ggagacaggg tctcgctctg ttgcccaggc 1801 tggagtgcag tggcgtgatc ttggctcact gcaacttccg cctcctgtgt tcaagcgatt 1861 ctcctgtctc agcctcctga gtagatggga ctacaggcac aggccattat gcctggctaa 1921 tttttgtatt tttagtagag acagggtttc accatgttgg ccgggatggt ctcgatctcc 1981 tgaccttgtc atccacctgt cttggcctcc caaagtgctg ggattactgg catgagccac 2041 tgtgcccagc ccggatattt ttttttaatt atttatttat ttatttattt attgagacgg 2101 agtcttgctc tgtagcccag gccagagtgc agtggcgcga tctcagctca ctgcaagctc 2161 tgcctcccgg gttcatgcca ttctgcctca gcctcctgag tagctgggac tacaggcgcc 2221 cgccaccacg cccggctaat tttttttgta tttttagtag agacggggtt tcatcgtgtt 2281 aaccaggatg gtctcgatct cctgacctcg tgatctgccc acctcggcct cccacagtgc 2341 tgggattacc ggcgtgagcc accatgcctg gcccggataa ttttttttaa tttttgtaga 2401 gacgaggtct tgtgatattg cccaggctgt tcttcaactc ctgggctcaa gcagtcctcc 2461 caccttggcc tcccagaatg ctgggtttat agatgtgagc cagcacaccg ggccaagtga 2521 agaatctaat gaatgtgcaa cctaattgta gcatctaatg aatgttccac cattgctgga 2581 aaaattgaga tggaaaacaa accatctcta gttggccagc gtcttgctct gttcacagtc 2641 tctggaaaag ctggggtagt tggtgagcag agcgggactc tgtccaacaa gccccacagc 2701 ccctcaaaga cttttttttg tttgttttga gcagacaggc taaaatgtga acgtggggtg 2761 agggatcact gccaaaatgg tacagcttct ggagcagaac tttccaggga tccagggaca 2821 ctttttttta aagctcataa actgccaaga gctccatata ttgggtgtga gttcaggttg 2881 cctctcacaa tgaaggaagt tggtctttgt ctgcaggtgg gctgctgagg gtctgggatc 2941 tgttttctgg aagtgtgcag gtataaacac accctctgtg cttgtgacaa actggcaggt 3001 accgtgctca ttgctaacca ctgtctgtcc ctgaactccc agaaccacta catctggctt 3061 tgggcaggtc tgagataaaa cgatctaaag gtaggcagac cctggaccca gcctcagatc 3121 caggcaggag cacgaggtct ggccaaggtg gacggggttg tcgagatctc aggagcccct 3181 tgctgttttt tggagggtga aagaagaaac cttaaacata gtcagctctg atcacatccc 3241 ctgtctactc atccagaccc catgcctgta ggcttatcag ggagttacag ttacaattgt 3301 tacagtactg ttcccaactc agctgccacg ggtgagagag caggaggtat gaattaaaag 3361 tctacagcac taa // LOCUS MUSCRABPA 868 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse cellular retinoic acid-binding protein (CRABP-II) mRNA, complete cds. ACCESSION M35523 KEYWORDS cellular retinoic acid-binding protein. SOURCE Mouse 12.5 day old embryo, cDNA to mRNA, clone lambda-mE2.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 868) AUTHORS Giguere,V., Lyn,S., Yip,P., Siu,C.-H. and Amin,S. TITLE Molecular cloning of a novel cellular retinoic acid-binding protein expressed during mouse embryogenesis and in adult skin JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.Giguere, 22-JUN-1990. FEATURES from to/span description pept 116 532 cellular retinoic acid-binding protein BASE COUNT 226 a 218 c 220 g 204 t ORIGIN Chromosome 2. 1 gaattccggg gaggatctgt tctgcaaagg agacagcaaa gtatctttag cctaaaggac 61 tcagcgtcca gtgttctagt tgaagatcta aagagaaagc caccttgctg ccactatgcc 121 taacttttct ggcaactgga agatcatccg atcggaaaac tttgaggaaa tgctaaaagc 181 tctgggggtg aacatgatga tgaggaagat cgctgtggct gcagcctcca agccagcagt 241 cgagatcaaa caggagaatg acactttcta catcaaaacc tccaccactg tgcgaaccac 301 ggagattaac ttcaagatcg gggaggaatt tgaggagcag accgtggatg ggagaccctg 361 taagagtttg gtgaaatggg agagtggaaa caaaatggtg tgcgagcaga ggcttctgaa 421 gggggagggc cccaagacct cctggagccg agaactgacc aatgatggag agctgatcct 481 gacaatgaca gcagatgacg ttgtgtgcac cagggtctac gtccgagagt gagtgcctac 541 gggtccaaga actgcctgag acgacttctg tgcccgctac aggacacaaa cctccctccc 601 acgtccatct tacaaactag ctctcccctt actcctgagg gttactgctt cctccaaggc 661 cttttgttct ttgccttctc tacgccagag aggggcagaa gctcagaacc ctcccaccgc 721 catttgcccc tcccaggtca gcagtcccag ctccatacca gggtccttcc tggaagagac 781 tgtctctctg gcctctactc cttatccttg tagtctgtgt gatttagaat atttattggt 841 taattttatt aaaatgtttc cggaattc // LOCUS YSCGLN3 3021 bp ds-DNA PLN 26-JUL-1990 DEFINITION S.cerevisiae nitrogen regulatory protein (GLN3) gene, complete cds. ACCESSION M35267 KEYWORDS nitrogen regulatory protein. SOURCE S.cerevisiae (strain S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3021) AUTHORS Minehart,P.L. and Magasanik,B. TITLE Sequence and expression of GLN3, a positive nitrogen regulatory gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by P.Minehart, 19-JUN-1990. Author address: P.Minehart MIT, 56-428 MIT 77 Mass Ave. Cambridge, MA 02139 FEATURES from to/span description pept 730 2922 GLN3 protein signal 509 514 TATA box site 1 140 acidic activation region BASE COUNT 1020 a 704 c 530 g 767 t ORIGIN Chromosome VL, map position 43cm distal to GCN4. 1 gacgtcaact ccatagaagt gacttttccg ccaaagaaga ggacctcgcc ataagcaatg 61 agaatgatcg tcagattctt gaaaattgtg tagatgggca cggcaaggta ttgtaagctc 121 tttgacgacg tataaatcat caatacgagc agcaaagaaa ttggaaacca gttttttaca 181 tctgtcctgt tcaaagatca aaaattagca acgcctacaa ttcgtaggat acatagcgtc 241 acagtgcaca ccagtgattg tacaaacaac atcacaaagt tcatgttaaa gttgtccagg 301 ttaaccacga atttgttcgt tactgtcatc aaaatcgagg acgcgcagta agataagatt 361 gaagccggcc cagagttggc cactgattcc gtccattcat gcttatgctt gctcataatt 421 accacacctt cttgatctct ttacagcttt tcaaccttcc attcttgtac tctatctcta 481 cctggccctt taaacattct taatatgata tattcacatt ttttgctcta ttacccggcg 541 gacaggttcc cgaaagaaag tgacatggca atgctgagag agtggaaaga gtcatcttgc 601 aagacagaga aagatgttca agagtggtaa gctaatgtca gcgcagtagc ccatcccaca 661 ataacagagt gtgtaagaaa gagagacgag agagagcaca gggccccctt ttcccccacc 721 aacaaacaaa tgcaagacga ccccgaaaat tcgaagctgt acgacctgct gaatagtcat 781 ctggacgtgc atggtcgaag taatgaagag ccgagacaaa ctggtgacag taggagccag 841 agtagtggca acaccggtga aaacgaggag gatatagcat ttgccagtgg attaaacggc 901 ggcacattcg actcaatgct ggaggcactg cccgatgatt tatattttac ggacttcgtg 961 tctcctttta cagcagctgc cacgaccagc gtgactacta agacggtcaa ggacaccaca 1021 ccagctacca atcatatgga tgatgatatt gcgatgtttg attcacttgc cacaactcag 1081 cccatcgaca tagccgcatc caaccaacaa aatggtgaaa ttgcacaact ttgggacttt 1141 aacgtggacc aattcaacat gacgcccagc aactcgagcg gttcagctac tattagtgct 1201 cctaacagct ttacttccga cataccgcaa tacaaccacg gttccctcgg caacagcgtc 1261 tccaaatcct cactgttccc gtataattcc agcacgtcca acagcaacat caaccagcca 1321 tctatcaata acaactcaaa tactaatgcg cagtcccacc attccttcaa catctacaaa 1381 ctacaaaaca acaactcatc ttcatccgct atgaacatta ccaataataa taatagcaac 1441 aatagtaata tccagcatcc ttttctgaag aagagcgatt cgataggatt atcttcatcc 1501 aacacaacaa attctgtaag aaaaaactca cttatcaagc caatgtcgtc cacgtccctg 1561 gccaatttca aaagagctgc ctcagtatct tccagtatat ccaatatgga accatcagga 1621 caaaataaaa aacctctgat acaatgtttc aattgtaaaa ctttcaagac accgctttgg 1681 aggagaagcc cagaggggaa tactctttgc aatgcctgcg gtcttttcca gaaattacat 1741 ggtaccatga ggccattatc cttaaaatcg gacgttatca aaaagaggat ttcaaagaag 1801 agagccaaac aaacggaccc aaacattgca caaaatactc caagtgcacc tgcaactgcc 1861 tcaacttcag taaccactac aaatgctaaa cccatacgat cgaggaaaaa atcactacaa 1921 caaaactctt tatctagagt gatacctgaa gaaatcatta gagacaacat cggtaatact 1981 aataatatcc ttaatgtaaa taggggaggc tataacttca actcagtccc ctccccggtc 2041 ctcatgaaca gccaatcgta taatagtagt aacgcaaatt ttaatggagc aagcaatgca 2101 aatttgaatt ctaataactt aatgcgtcac aattcgaaca ctgttactgg taattttaga 2161 aggtcttcaa gacgaagtag tacttcatcg aacacctcaa gttccagtaa atcttcatcc 2221 agatctgttg ttccgatatt accaaaacct tcacctaata gcgctaattc acagcagttc 2281 aacatgaaca tgaacctaat gaacacaaca aataatgtaa gtgcaggaaa tagtgtcgca 2341 tcctcaccaa gaattatatc gtccgcaaac tttaactcaa atagtcctct acagcagaat 2401 ctattatcaa attctttcca acgtcaagga atgaatatac caagaagaaa gatgtcgcgc 2461 aatgcatcgt actcctcatc gtttatggct gcgtctttgc aacaactgca cgaacagcaa 2521 caagtggacg tgaattccaa cacaaacacg aattcgaata gacagaattg gaattcaagc 2581 aatagcgttt caacaaattc aagatcatca aattttgtct ctcaaaagcc aaattttgat 2641 atttttaata ctcctgtaga ttcaccgagt gtctcaagac cttcttcaag aaaatcacat 2701 acctcattgt tatcacaaca attgcagaac tcggagtcga attcgtttat ctcaaatcac 2761 aaatttaaca atagattatc aagtgactct acttcaccta taaaatatga agcagatgtg 2821 agtgcaggcg gaaagatcag tgaggataat tccacaaaag gatcttctaa agaaagttca 2881 gcaattgctg acgaattgga ttggttaaaa tttggtatat gaccgcgtat tatcattatc 2941 attattctta ttatgttaat aattactgaa cggttgcatt gatagatttt cattacctct 3001 gaccacaatc ctgagcattg g // LOCUS BLYHISH3PA 505 bp ss-mRNA PLN 26-JUL-1990 DEFINITION Barley histone H3 mRNA, 3' end. ACCESSION M34928 KEYWORDS histone H3 protein. SOURCE Barley (strain Nudinka) seed scutella 2 days after germination, cDNA to mRNA. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 505) AUTHORS Chojecki,J. TITLE Identification and characterization of a cDNA clone for histone H3 in barley JOURNAL Carlsberg Res. Commun. 51, 211-217 (1986) STANDARD simple staff_entry FEATURES from to/span description pept < 1 243 histone H3 protein mRNA < 1 505 histone H3 mRNA BASE COUNT 95 a 138 c 146 g 126 t ORIGIN 1 aagagcaccg agctgctgat ccgcaagctc ccgttccagc gcctggtgag ggagatcgcg 61 caggacttca agaccgacct caggttccag tcccacgccg tgctggccct ccaggaggcc 121 gccgaggcgt acctcgtcgg gctgttcgag gacaccaacc tgtgcgccat ccacgccaag 181 cgcgtcacca tcatgcccaa ggacatccag ctcgcccgcc gcatccgcgg ggagcgcgcc 241 taagccaccc agagcgctgc attcgggagc gatgacaccg ttcgccagca ttagtgtagt 301 tgattggctt tccttgtcca gatatgcgtc ttgtggttcg ttgtagaaac cctggttggt 361 tggttcccgt agttacagag acttttctgc ttaagtggtt ttggtttgcg gtgttgcaaa 421 ccgatgctta ctgtgatgca aattgttggt taatgtagtg ttgattgaca attatcgatg 481 gatgaacttg tggtgttgcg tagtt // LOCUS BMOFIBA 324 bp ss-mRNA INV 26-JUL-1990 DEFINITION B.mori silk fibroin mRNA, partial cds. ACCESSION M35378 KEYWORDS fibroin. SOURCE B.mori (Kinryu x Showa) posterior silk gland, cDNA to mRNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 324) AUTHORS Mita,K., Ichimura,S., Zama,M. and James,T.C. TITLE Specific codon usage pattern and its implications on the secondary structure of silk fibroin mRNA JOURNAL J. Mol. Biol. 203, 917-925 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 324 silk fibroin (AA at 1) BASE COUNT 35 a 60 c 144 g 85 t ORIGIN 1 ggatacggag caggagctgg aagcggagct gcctctggtg ccggtgccgg ttcaggtgct 61 ggtgctggtt caggagctgg tgctggttca ggtgctggtg ctggttcagg tgctggtgct 121 ggttcaggtg ctggtgctgg ttcaggagct ggtgctggtt caggtgctgg tgctggttca 181 ggagctggtg ctggatacgg agcaggagct ggcgttggat acggagcagg agctgggagc 241 ggagctgcct ctggtgctgg tgctggttca ggtgctggtg ctggttcagg tgctggtgct 301 ggttcaggtg ctggtgctgg ttca // LOCUS DROMETA 338 bp ss-mRNA INV 26-JUL-1990 DEFINITION D.melanogaster metallothionein (MT) mRNA, complete cds. ACCESSION M35390 KEYWORDS metallothionein. SOURCE D.melanogaster larva, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 338) AUTHORS Maroni,G., Lastowski-Perry,D., Otto,E. and Watson,D. TITLE Effects of heavy metals on Drosophila larvae and a metallothionein cDNA JOURNAL Environ. Health Perspect. 65, 107-116 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 124 246 metallothionein mRNA < 1 338 metallothionein mRNA signal 308 313 polyA signal BASE COUNT 101 a 88 c 77 g 72 t ORIGIN 1 gatcagttgt ggtcagcagc aaaatcaagt gaatcatctc agtgcaacta aaggcctaaa 61 tagcccatac ctaccttttt tgtaaacaag tgaacaagtt cgaggaaata caactcaatc 121 aagatgcctt gcccatgcgg aagcggatgc aaatgcgcca gccaggccac caagggatcc 181 tgcaactgcg gatctgactg caagtgcggc ggcgacaaga aatccgcctg cggctgctcc 241 gagtgagctt tcccccaaaa aagatctgga gtagaggcgc tgcatcttgt ctctctacac 301 accctgcaat aaatgtccaa ttaaagtaat tgatgcct // LOCUS HUMVPREBA 503 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human pre-B lymphocyte VpreB gene, 5' end. ACCESSION M34927 KEYWORDS . SOURCE Human myeloid cell line U937 DNA, clone pHVPB-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 503) AUTHORS Bauer,S.R., Kudo,A. and Melchers,F. TITLE Structure and pre-B lymphocyte restricted expression of the VpreB gene in humans and conservation of its structure in other mammalian species JOURNAL EMBO J. 7, 111-116 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 1 46 VpreB protein precursor, exon 1 133 > 503 VpreB protein precursor, exon 2 sigp 1 46 VpreB protein signal peptide 133 143 VpreB protein signal peptide matp 144 > 503 VpreB protein IVS 47 132 VpreB intron A BASE COUNT 104 a 160 c 140 g 99 t ORIGIN 1 atgtcctggg ctcctgtcct gctcatgcac tttgtctact gcacaggtga gggaaccccc 61 agatcccaaa gactcctgcc ccttccttca tcctgccctg cccccacggg ccacatgcat 121 ctgtgtcacc aggttgtggt cctcagccgg tgctacatca gccgccggcc atgtcctcgg 181 cccttggaac cacaatccgc ctcacctgca ccctgaggaa cgaccatgac atcggtgtgt 241 acagcgtcta ctggtaccag cagaggccgg gccaccctcc caggttcctg ctgagatatt 301 tctcacaatc agacaagagc cagggccccc aggtcccccc tcgcttctct ggatccaaag 361 atgtggccag gaacaggggg tatttgagca tctctgagct gcagcctgag gacgaggcta 421 tgtattactg tgctatgggg gcccgcagct cggagaagga ggagagggag agggagtggg 481 aggaagaaat ggaacccact gca // LOCUS MUSNGF 1176 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse nerve growth factor (NGF) precursor mRNA, complete cds. ACCESSION M35075 J00608 KEYWORDS nerve growth factor. SOURCE Mouse male submaxillary gland, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1176) AUTHORS Scott,J., Selby,M., Urdea,M., Quiroga,M., Bell,G.I. and Rutter,W.J. TITLE Isolation and nucleotide sequence of a cDNA encoding the precursor of mouse nerve growth factor JOURNAL Nature 302, 538-540 (1983) STANDARD simple staff_review REFERENCE 2 (bases 3 to 226) AUTHORS Edwards,R.H., Selby,M.J. and Rutter,W.J. TITLE Differential RNA splicing predicts two distinct nerve growth factor precursors JOURNAL Nature 319, 784-787 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 96 1019 nerve growth factor precursor sigp 96 656 nerve growth factor signal peptide matp 657 1010 nerve growth factor mRNA 1 1176 NGF mRNA BASE COUNT 283 a 330 c 295 g 268 t ORIGIN 1 gagcgcctgg agccggaggg gagcgcatcg agtgactttg gagctggcct tatatttgga 61 tctcccgggc agctttttgg aaactcctag tgaacatgct gtgcctcaag ccagtgaaat 121 taggctccct ggaggtggga cacgggcagc atggtggagt tttggcctgt ggtcgtgcag 181 tccagggggc tggatggcat gctggaccca agctcacctc agtgtctggg cccaataaag 241 gttttgccaa ggacgcagct ttctatactg gccgcagtga ggtgcatagc gtaatgtcca 301 tgttgttcta cactctgatc actgcgtttt tgatcggcgt acaggcagaa ccgtacacag 361 atagcaatgt cccagaagga gactctgtcc ctgaagccca ctggactaaa cttcagcatt 421 cccttgacac agccctccgc agagcccgca gtgcccctac tgcaccaata gctgcccgag 481 tgacagggca gacccgcaac atcactgtag accccagact gtttaagaaa cggagactcc 541 actcaccccg tgtgctgttc agcacccagc ctccacccac ctcttcagac actctggatc 601 tagacttcca ggcccatggt acaatccctt tcaacaggac tcaccggagc aagcgctcat 661 ccacccaccc agtcttccac atgggggagt tctcagtgtg tgacagtgtc agtgtgtggg 721 ttggagataa gaccacagcc acagacatca agggcaagga ggtgacagtg ctggccgagg 781 tgaacattaa caacagtgta ttcagacagt acttttttga gaccaagtgc cgagcctcca 841 atcctgttga gagtgggtgc cggggcatcg actccaaaca ctggaactca tactgcacca 901 cgactcacac cttcgtcaag gcgttgacaa cagatgagaa gcaggctgcc tggaggttca 961 tccggataga cacagcctgt gtgtgtgtgc tcagcaggaa ggctacaaga agaggctgac 1021 ttgcctgcag cccccttccc cacctgcccc ctccacactc tcttgggccc ctccctacct 1081 cagcctgtaa attattttaa attataagga ctgcatgata atttatcgtt tatacaattt 1141 taaagacatt atttattaaa ttttcaaagc atcctg // LOCUS RATXDHA 4162 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Rat xanthine dehydrogenase mRNA, complete cds. ACCESSION J05579 KEYWORDS xanthine dehydrogenase. SOURCE Rat (strain Wistar) liver, cDNA to mRNA, clones lambda-RXD[7,32,42,51]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4162) AUTHORS Amaya,Y., Yamazaki K,-i., Sato,M., Noda,K., Nishino,T. and Nishino,T. TITLE Proteolytic conversion of xanthine dehydrogenase from the NAD- dependent type to the oxygen-dependent type: Amino acid sequence of rat liver xanthine dehydrogenase and identification of the cleavage sites of the enzyme protein during irreversible conversion by trypsin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.Amaya, 22-JUN-1990. FEATURES from to/span description pept 27 3986 xanthine dehydrogenase (EC 1.1.1.204) signal 4125 4130 polyA signal (put.) signal 4146 4151 polyA signal (put.) BASE COUNT 1055 a 1046 c 1121 g 940 t ORIGIN 1 agagctcagt gactccagca gccacgatga ctgcggatga gttggtcttc tttgtgaatg 61 gcaaaaaggt ggtggagaaa aatgcggacc ctgaaacaac acttctggtc tacctgagaa 121 gaaagttggg gctatgtggg accaagcttg gctgtggaga aggtggctgt ggggcatgca 181 ccgtgatgat ctccaagtat gaccgtcttc agaacaagat tgttcatttt tctgtcaatg 241 cctgcttggc tcccatctgc tccttgcacc atgttgctgt gaccaccgtg gaaggcatag 301 gaaacaccca gaagctgcat cctgtacagg agagaattgc cagaagccat ggttcccagt 361 gtgggttctg cactcctggc attgtcatga gtatgtacac actgctccgg aaccagcctg 421 agcctactgt tgaggagatc gagaatgcct tccaaggaaa cctctgtcgc tgtacaggct 481 acagacccat cctccaggga ttccggacct ttgccaagga tggtgggtgc tgtggaggga 541 gtggaaacaa cccaaactgc tgtatgaacc agacgaaaga ccaaacggtt tctctctcac 601 cttctttatt caacccagag gatttcaaac ctttagatcc cacgcaagag cccatcttcc 661 ccccagagtt gctgaggctg aaagacactc cccagaagaa gctgcgtttt gaaggggaac 721 gtgtgacctg gatccaggct tcaactatgg aggagctgct tgacctgaaa gctcagcacc 781 ctgatgccaa gctggtggtg ggaaacacag agataggcat tgaaatgaaa tttaagaata 841 tgctatttcc tctgatcgtc tgcccagcct ggatccctga actgaattca gtggtgcatg 901 ggcctgaggg aatctccttc ggagcttctt gcccccttag cttggtggaa agtgtcctgg 961 cggaggagat tgctaaactt ccagagcaaa agacagaggt gttcagaggc gtgatggagc 1021 agctgcgctg gtttgccggc aagcaggtca agtccgtggc gtccatcgga gggaacatca 1081 tcactgccag ccccatctct gacctcaacc ctgtgttcat ggccagtgga gccaagctga 1141 ctctggtgtc tagaggtacc aggagaactg ttcggatgga tcataccttc ttccctggct 1201 acagaaagac tctgctcaga ccagaggaga tattgctgtc catcgagatc ccctatagca 1261 aggagggaga gtttttctca gccttcaagc aggcctccag gagggaagat gacattgcca 1321 aggtgactag tggcatgaga gtcctgttca aaccggggac cattgaagtg caggaactgt 1381 ccctttgctt cggagggatg gccgacagaa ctatctcagc cctcaagacc actccgaagc 1441 agctatcgaa gtcctggaat gaggagctgc agctggcccc cgatgcccct ggtggtatgg 1501 tggaattccg gcgcaccctc accctcagct tcttcttcaa gttctacctg acagtgctcc 1561 agaagctggg cagagcggac cttgaggata tgtgtggtaa actggacccc acctttgcca 1621 gtgccaccct gctctttcag aaggaccctc cagctaatgt ccagcttttc caagaggtgc 1681 caaaggatca gtctgaggag gacatggtgg gccggcccct gcctcacctg gcggcaaaca 1741 tgcaggcatc gggagaggcc gtgtactgtg atgacattcc ccgctatgag aatgagctct 1801 ctctcaggct ggtcaccagc acccgggcgc atgctaaaat cacgtccatc gacacttcag 1861 aagccaagaa ggtgccaggg tttgtttgct tcctcaccgc agaggatgtc cctaatagta 1921 atgcaaccgg ccttttcaat gatgaaactg tctttgcgaa ggatgaggtt acttgtgttg 1981 ggcacatcat tggtgctgtg gtcgctgaca ccccagaaca cgcacagaga gctgcgagag 2041 gggtgaaaat cacctatgaa gatcttccag ccattatcac aatccaggat gctataaaca 2101 acaactcctt ttatggctct gagataaaaa ttgagaaagg agatctcaag aaaggctttt 2161 cagaagctga caatgttgtc tcaggagagt tgtatatcgg tggccaggag cacttctacc 2221 tggagaccaa ctgcaccatt gccgtgccaa aaggcgaggc aggcgagatg gagctgttcg 2281 tgagcacaca gaacaccatg aaaacccaga gctttgttgc aaaaatgttg ggcgttccgg 2341 acaacagaat cgtagtccga gtgaagagga tgggtggagg ctttggaggg aaggagaccc 2401 ggagcactgt ggtgtccaca gcactggcct tggctgcaca caagactggc cggcccgtac 2461 gttgcatgtt ggaccgagat gaggacatgc tgataactgg tggcagacat cccttcctgg 2521 ctaaatacaa ggttggcttc atgaagactg ggactgtagt ggctctcgag gtggctcact 2581 tcagcaatgg tggtaacact gaggatctct ctcggagtat aatggaacga gctttgttcc 2641 acatggataa cgcctataag atccccaaca ttcgaggcac tgggaggatt tgcaagacta 2701 atctgccctc caacacagcc ttcagaggtt ttgggggtcc tcaggggatg ctaatcgcag 2761 aatactggat gagcgaggtc gccataacct gtgggctgcc tgcagaggag gtacggagga 2821 aaaacatgta caaagaaggg gacctgactc acttcaacca gaagctggag gggttcacct 2881 tgcccaggtg ctgggatgaa tgcatcgcca gctctcagta tcttgctcgc aagagggaag 2941 tggagaaatt caacagggag aattgttgga aaaagagagg gctgtgtata atcccaacta 3001 agtttggaat aagctttaca cttccttttc tgaaccaggg aggcgctctg gttcacgtgt 3061 acactgatgg ttcggtgctg ttgacccatg gagggactga gatgggccaa ggccttcaca 3121 ccaagatggt tcaggtggcc agcagagctc tgaaaatccc cacctccaag attcatataa 3181 gtgagacaag cactaacacc gtccccaaca cttctcccac agctgcctct gccagtgctg 3241 acctcaatgg acagggtgtt tatgaagcat gccagaccat actgaaaagg ctggaacctt 3301 tcaagaagaa gaaacccacc ggcccctggg aggcatgggt gatggacgcc tatacgagcg 3361 cagtgagttt gtccgcaact ggattttata agacacccaa ccttggctac agctttgaga 3421 caaactccgg aaatcccttc cactatttca gttatggggt ggcttgctct gaagtagaaa 3481 ttgactgctt aacaggggat cataagaatc tccgtacgga tatcgtcatg gatgttggtt 3541 ccagcttgaa tcctgccatt gatattggac aagtagaggg ggcatttgtc cagggccttg 3601 gtctcttcac tatggaggag ctgcactact cccctgaggg gagcctgcat actcgtggcc 3661 ccagtaccta caaaatccct gcatttggta gcatccccat tgagttcaga gtatccctac 3721 tccgggactg ccccaacaag agggccatct atgcatccaa ggctgttggg gagccacctc 3781 ttttcctggc ttcctctatc ttctttgcca tcaaagatgc cattcgtgca gctcgagctc 3841 agcacggaga taacgcaaaa caacttttcc agctagacag ccctgccact ccggagaaga 3901 tccgaaacgc ctgtgtggac cagttcacca ccctgtgtgt cactggagta ccagaaaact 3961 gtaaatcctg gtctgtgagg atctgaagag aaggtctcca ccattggttt gtaccgcacc 4021 aggattcctt ggagccacaa gcacatcctg tagtatccag atttccgcat gccgcgtggg 4081 actcagcagg atgacatttt caggaagatg gacattttga cccaaataag agctgcaaac 4141 aaaccaataa gcaaatgggg ag // LOCUS RICHISH2AA 321 bp ds-DNA PLN 26-JUL-1990 DEFINITION Rice histone H2A gene, 5' end. ACCESSION M35379 KEYWORDS histone. SOURCE Rice DNA, clone pIR22. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 321) AUTHORS Thomas,G. and Padayatty,J.D. TITLE Restriction map and partial sequence of a rice DNA fragment carrying histone genes H2A, H2B and H4 JOURNAL Indian J Biochem Biophys 21, 1-6 (1984) STANDARD simple staff_entry FEATURES from to/span description pept 260 > 321 histone H2A protein mRNA 186 > 321 histone H2A mRNA signal 36 40 CAAT box signal 74 77 GATCC motif signal 138 145 TATA box BASE COUNT 74 a 70 c 60 g 73 t 44 others ORIGIN 1 caaaggacnt gttcccgctg atgtgagcaa ttgtcacaat gccctcccaa acngttttca 61 gatngtngat gtggatcnnn antttnttgc gnntnnanac ctggctctcg ttttttcgca 121 angtcccgaa cnnnnngtat aaatagcgtg tggacccgta ncgtgagaac tcgtgatctn 181 atttcatctg gaacgactcn nggaatnttc cgaaaannnn nnnnnnnnng ccgaaagcct 241 tttggaactt ttcnnccaaa tgcacaccaa aggcctcngg aagnnttttc ancgcaaaaa 301 gatatcaccc gcagggatca c // LOCUS TEYMT14SRR 169 bp ds-DNA ORG 26-JUL-1990 DEFINITION T.pyriformis mitochondrial 14S rRNA. ACCESSION M35376 KEYWORDS 14S ribosomal RNA. SOURCE T.pyriformis (strain ST) linear mitochondrial DNA. ORGANISM Mitochondrion Tetrahymena pyriformis Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae; Tetrahymena pyriformis. REFERENCE 1 (bases 1 to 169) AUTHORS Suyama,Y., Fukuhara,H. and Sor,F. TITLE A fine restriction map of the linear mitochondrial DNA of Tetrahyemena pyriformis: Genome size, map locations of rRNA and tRNA genes, terminal inversion repeat, and restriction site polymorphism JOURNAL Curr. Genet. 9, 479-493 (1985) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 169 14S rRNA site 31 144 conserved U5 region BASE COUNT 54 a 26 c 37 g 52 t ORIGIN 1 gaattcagaa tagctaacgc aaagtattct gcttggggag tattatcgca agattaaaac 61 ttaactgaat tggcgggaat ttgttcgaac ggtggaacat gtggtttaat gcgataatcc 121 acgcaaaatc ttaccaacgt tttaggcttt atctgataat atggttaac // LOCUS YSCPET122 2862 bp ds-DNA PLN 26-JUL-1990 DEFINITION Yeast PET122 encoded protein gene, complete cds. ACCESSION X07558 KEYWORDS PET122 encoded protein. SOURCE Yeast (S.cerevisiae, strain AB320) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2862) AUTHORS Ohmen,J.D., Burke,K.A. and McEwen,J.E. TITLE Divergent overlapping transcripts at the PET122 locus in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 3027-3035 (1990) STANDARD simple staff_entry REFERENCE 2 (bases 953 to 2862) AUTHORS Ohmen,J.D., Kloeckener-Gruissem,B. and McEwen,J.E. TITLE Molecular cloning and nucleotide sequence of the nuclear PET122 gene required for expression of the mitochondrial COX3 gene in S.cerevisiae JOURNAL Nucleic Acids Res. 16, 10783-10862 (1988) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Ohmen, 11-JUN-1990. Draft entry and computer-readable sequence for [1] submitted to EMBL by J.D.Ohmen, 09-JUN-1989. EMBL features not translated to GenBank features: key from to description SITE 746 1096 similarity to E.coli alanyl tRNA-synthetase (AA 116-232) [1] Author address: McEwan J.E. Department of Microbiology College of Letters and Science University of California 405 Hilgard Avenue Los Angeles, CA 90024-1489 FEATURES from to/span description pept 1139 < 1 (c) ORF3 pept 1354 2118 PET122 protein pept 2171 > 2862 ORF2 mRNA 1144 < 1 (c) ORF3 mRNA mRNA 1152 < 1 (c) ORF3 mRNA mRNA 1157 < 1 (c) ORF3 mRNA mRNA 1159 < 1 (c) ORF3 mRNA mRNA 1870 < 1 (c) ORF3 mRNA mRNA 1875 < 1 (c) ORF3 mRNA mRNA 1882 < 1 (c) ORF3 mRNA mRNA 1883 < 1 (c) ORF3 mRNA mRNA 1887 < 1 (c) ORF3 mRNA mRNA 1895 < 1 (c) ORF3 mRNA mRNA 1907 < 1 (c) ORF3 mRNA mRNA 1343 > 2119 PET122 mRNA mRNA 1348 > 2119 PET122 mRNA mRNA 1354 > 2119 PET122 mRNA mRNA 2140 > 2862 ORF2 mRNA mRNA 2143 > 2862 ORF2 mRNA mRNA 2147 > 2862 ORF2 mRNA mRNA 2150 > 2862 ORF2 mRNA BASE COUNT 748 a 652 c 795 g 667 t ORIGIN 1 aagctttctt gtaacttctt ctcattatct tgcatcaatt gccttctttc cgcctgatct 61 cttgcctttt gaatgttatg ttttaatgat tggaagatgc ccatgttctc tgtgggggaa 121 gcgccagcga taggagtcct tggtttagct acttctgtta tcttcagttt cgaacgaacc 181 catttgtttc tcaaaatcat tgtctgtagg acggagaagg caccattaaa ggcaaagtag 241 aggaccacag cggacgataa gttcattgtg gccggtatag aaatgatcgg tagaatagtg 301 aaaagacgct tcatgggaga actgaattgt tgagcaccag tctcaccccc cagccttgta 361 aatgagatga acacagcggc agtgattact tgcaaaccta agtaagggtc tgcttgagtc 421 aagtctgtaa accaagcgac accttgatta gcgaacccat ctactgggta gttagccatg 481 tgtctcaatg cgttgaaaaa cccaagggcg attggaattt gtagcatggg tgcggccagc 541 catctgttct taatgccgtg cgaggagagc agttttttcc tttgcatggc gactagctga 601 ccttgttgca aatctgtagt ggacattagc ttattattca aggcgtccag ctcgggcttg 661 atatgggaat ttctagcaac agtatcagag gacttgacat agaggggaaa catcaggcat 721 cgaatgagga tggtggtggc cgcgatagtt ccccaccaag gcaacccaga gtaaacatga 781 acggcctcca agacgtgttg gataatgtcc gagggccagt accaggtttg ggccaggcca 841 atgctattta agtaccctat atgggaggac aactcgccca ctgtttgggt cgtgttagcg 901 ataaggtccg aagtagaagc ggaaagagaa ggagctgaag aggttaattc atcgatggaa 961 ggcaactggg tttggatttc cgagacatcg ttggcatttg ggcccgtcga attaaatctt 1021 ttggcctgaa aagagatcca tgacggatgg ggccggggca atactatggt tcgagcggtg 1081 gccagtctgg aagaggcagc aaaccttgac gtgacgagtc gagaggtgag tttgaacatc 1141 gtcggggagg ttattctgtg gctccgcttg tacgtgaaca gatacgtata gagggcgagc 1201 cactggttaa atttttcatg gctcggatta cttccgtact gctggctaaa atcgaaatct 1261 cggcctgctg agagtgtttt gagcaatcaa gggaacatct gaacgtggaa gagcagacga 1321 ggcattagct cgaacataag aacggaacac gtcatgttga ctatcacgaa aagactggtg 1381 accaccgatg tgcggtcgcg aatactgtta agcagtttaa acgggaaaat gtccgatgca 1441 ctggcgctgc tgcgtcagca gcagcagacc agcgtggatg tggagctgct gcacacgatg 1501 ctagcgcgag ccgctgcgct tgcccatgcc gacactatag catacatgtg gtatcagcat 1561 gtgatgccac gccggttgcc agtagagggc cgcctgctat gtgaaatggc tggcgtagca 1621 ttgtaccagg acaggctctt cttacccgcg cagttcctcc agcactacca ggcgatgaat 1681 cgcgatcgtc gcaccagccc agaagatgaa ctgattgagt atgagcttag acggattaaa 1741 gtcgaagcgt ttgcgcgtgg cacaatgcac tccacggcgc tcagggaaaa gtggaaggta 1801 ttcttgcagg agatggatac gctaccaggg cagccgccat taaggctgcg cgacttcccg 1861 caaatgacca aggctatggg catagcattg atgcagcaag atgagcaagc agctgccctg 1921 gcgttgtttg gacgacagcc cctagtgata aagaacgaat ggtcactacc gctactactg 1981 gctggtgtcc tttggcatgt tcccggccca gcgcaggcgc gacgtgtgct ggcggagttc 2041 cgtcaaagtt atcgcgggct gccgctgctg gatgccgaac tagtgataaa gagaagagga 2101 tttgaaatca acacataaat ctgggtggag catcgctgta acaaggaaca acgcgtgcta 2161 gcaagcggta atgaaataca aggaaatcaa tttcttcaag ggccatccga gctcgaggtt 2221 gctgcctcga gaagcagtaa ttcaagcgac tgcggctata ttggggcccg agaccaggga 2281 gtacgataac gacccctata acaggcatcc gctgacgtac ggttcggacg aaggtgccct 2341 gtgggtgcga gagcagattt gtacgtttct gaatgatcag ctgtttaagt tcgaaaatgg 2401 ggctcggagc aggacacggg cagactattt gaatctgaat agcggcgctt cgtatggcat 2461 gctgaacatc cttctgcaaa caaccttgcc acataacggg tataccaggc aggcgttcat 2521 catcacgcca acatatttct tgatcaacaa ttgcttcaca gatgcgggat tcaaggggaa 2581 aatgaccgcc atcaacgagc agggccacga ctcgattgat ttcgagtcgt tgatttctgc 2641 ccttgagcag cacgaggcgg agccgcagcc ccatagtacc acagagatga ttcaggggcc 2701 aaagttgacc aagaaggtct acaggtacgt tatgtactgc atcccgacgt ttgcaaaccc 2761 atcgggaaac acatactcgc ttgagaccag acgcagactt atcgacatcg ctcggaagta 2821 cgacatgctg ataatcactg atgacgtgta cgatattcta ga // LOCUS ECO987P 954 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli fimbriae 987P subunit gene, complete cds. ACCESSION M35257 KEYWORDS fimbriae. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 954) AUTHORS De Graaf,F.K. and Klaasen,P. TITLE Nucleotide sequence of the gene encoding the 987P fimbrial subunit of Escherichia coli JOURNAL FEMS Microbiol. Lett. 42, 253-258 (1987) STANDARD simple staff_review FEATURES from to/span description pept 259 843 fimbriae 987P subunit precursor sigp 259 328 fimbriae 987P subunit signal peptide matp 329 840 fimbriae 987P subunit BASE COUNT 309 a 170 c 180 g 295 t ORIGIN 1 aaatttagaa aagtgcatta tgcttatcac tagataagaa aataaaacac gaaatatagc 61 gagccatata gcctgttgtg tttgtaatag ataaaaaaca cgcaattgat tatttatgta 121 tctttttgtt tgtatttttt tattaaaaaa agcacacaat tactgcgtgc atcgaaatga 181 gttgaagtgg atgcatatat gcatgaaatg cttttaactt gaaagtctta atgtttctat 241 taattaagat aaggtaatat gagaatgaaa aaatccgcat taacattagc agtgctttcc 301 tctctgttca gtggttactc gctcgcagcg cccgctgaaa acaacaccag ccaggcaaat 361 ttagacttta ctggtaaagt tactgccagt ctatgccaag tggatacttc taatctgtcg 421 caaaccatag atcttggaga gttgtctact tctgctctta aagctactgg caaggggcct 481 gccaagtcat ttgcagttaa tcttatcaac tgcgatacaa cattgaattc tattaaatac 541 actattgctg gtaataataa tacaggaagt gatactaaat atttagttcc agcctccaat 601 gatactagtg catcaggagt tggcgtatac attcaggaca acaacgccca ggctgtggaa 661 attggtactg aaaaaactgt acctgtggta tcaaatggcg gattagctct ttcagaccaa 721 agtattccac tgcaagcata catcggaacc accacaggga atcctgatac aaacggtgga 781 gttacggccg gtactgtcac tgctagtgca gtaatgacta ttcgttcagc aggtacaccg 841 taattagata acaattttta tacaacaaaa caggaaggat tttgaactaa tccttcctgt 901 tattggagat tgaaatgtct aagtttgtaa tatttcttgt gtttttgttt atat //