Path: utzoo!attcan!uunet!samsung!usc!apple!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 25 Jul 90 12:00:18 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 3910 Approved: lear@genbank.bio.net Checksum: 07698 223 LOCUS DOGRAB2A 656 bp ss-mRNA MAM 25-JUL-1990 DEFINITION C.familiaris GTP-binding protein (rab2) mRNA, complete cds. ACCESSION M35521 KEYWORDS GTP-binding protein. SOURCE C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to mRNA, clone II. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 656) AUTHORS Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M. TITLE Localization of low-molecular weight GTP binding proteins to exocytic and endocytic compartments JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Chavrier, 22-JUN-1990. Base-pairs 508 to 564 form a synthetic peptide used to raise antibodies. FEATURES from to/span description pept 7 645 GTP-binding protein (rab2) BASE COUNT 209 a 124 c 158 g 165 t ORIGIN 1 gcggccatgg cgtacgctta tctcttcaag tacatcatca tcggcgacac aggtgttggt 61 aaatcatgct tattgctaca gtttacagac aagaggtttc agccagtgca tgacctgact 121 atcggtgtag agtttggtgc tcgaatgata actattgatg ggaaacagat aaaacttcag 181 atatgggata cggcagggca agagtccttt cgttccatca caaggtcata ttacagaggt 241 gcagcagggg ctttactagt gtatgatatt acaaggagag atacattcaa ccacttgaca 301 acctggttag aagatgcccg ccagcattcc aattccaaca tggtcattat gcttattgga 361 aataaaagtg atttagaatc aagaagagaa gtaaaaaaag aagaaggtga agcttttgca 421 cgagaacatg gacttatctt catggaaact tctgctaaga ctgcttccaa tgtagaagag 481 gcatttatta atacagcaaa agaaatttat gagaaaatcc aagaaggagt ctttgacatt 541 aataatgagg caaacggcat taaaattggc cctcagcacg ctgctactaa tgccacacac 601 gcgggcaatc agggaggaca gcaggccggg ggaggctgct gttgagtccg tttttt // LOCUS DOGRAB5A 796 bp ss-mRNA MAM 25-JUL-1990 DEFINITION C.familiaris GTP-binding protein (rab5) mRNA, complete cds. ACCESSION M35520 KEYWORDS GTP-binding protein. SOURCE C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to mRNA, clone II. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 796) AUTHORS Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M. TITLE Localization of low-molecular weight GTP binding proteins to exocytic and endocytic compartments JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Chavrier, 22-JUN-1990. Base-pairs 664 to 711 form a synthetic peptide used to raise antibodies. FEATURES from to/span description pept 121 768 GTP-binding protein (rab5) BASE COUNT 267 a 163 c 174 g 192 t ORIGIN 1 ccgcggctcc tcgtgctgcg gcctcaggtt tctgtatatc cagaaagaaa aaatttgaca 61 ccttgcatcc tggaagttca tttaagagac tgaaattagg gacttctttc aaatttggac 121 atggctaatc gaggagcaac aagacccaac gggccaaata ctggaaataa aatatgccag 181 ttcaaactag tacttctggg agagtctgct gttggcaaat caagcctagt gcttcgtttt 241 gtgaagggcc aatttcatga atttcaagag agtaccatag gggctgcttt tctaacccaa 301 actgtgtgtc ttgatgatac aacagtaaag tttgaaatat gggatacagc tggtcaagaa 361 cgataccata gcttagcacc aatgtactac agaggagcac aagcagccat agttgtatat 421 gatatcacaa atgaggagtc ctttgccaga gccaaaaact gggttaaaga acttcagagg 481 caagccagtc ctaacattgt aatagcttta tcaggaaaca aggctgatct tgcaaataaa 541 agagctgtcg atttccagga agcacagtcc tatgcagatg acaacagttt attattcatg 601 gagacatcag ctaaaacatc gatgaacgta aatgaaatat tcatggcaat agctaaaaag 661 ttgccaaaga acgaaccaca gaatccagga gcaaattctg ccagaggaag aggagtagac 721 cttactgaac ccacgcagcc aaccaggagt cagtgttgta gtaactaaac ctccagtttg 781 aacttcctgg aatatc // LOCUS DOGRAB7A 811 bp ss-mRNA MAM 25-JUL-1990 DEFINITION C.familiaris GTP-binding protein (rab7) mRNA, complete cds. ACCESSION M35522 KEYWORDS GTP-binding protein. SOURCE C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to mRNA, clone II. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 811) AUTHORS Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M. TITLE Localization of low-molecular weight GTP binding proteins to exocytic and endocytic compartments JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Chavrier, 22-JUN-1990. Base-pairs 542 to 592 form a synthetic peptide used to raise antibodies. FEATURES from to/span description pept 20 643 GTP-binding protein (rab7) BASE COUNT 251 a 206 c 194 g 160 t ORIGIN 1 gagcggctgc gtttgaagga tgacctctag gaagaaagtg ttgctgaagg ttatcatcct 61 gggagattct ggagttggta agacatcact catgaaccag tatgtgaaca agaaattcag 121 taatcagtac aaagctacaa taggagcaga ctttctgaca aaggaggtga tggtggatga 181 cagactagtt acaatgcaga tctgggacac agcaggccag gaacggttcc agtcccttgg 241 tgtggccttc tacagaggtg cagactgctg cgttctggta tttgacgtta ctgcccccaa 301 cacattcaaa accctcgata gctggagaga tgagtttctc atccaggcca gtccccggga 361 tcctgaaaac ttccctttcg ttgtgttggg aaacaagatt gacctcgaaa acagacaagt 421 ggccacaaag cgggcacagg cctggtgcta cagcaaaaac aacattccct acttcgagac 481 cagtgccaag gaggccatca atgtggagca ggcgttccag acgattgcaa ggaatgcact 541 taaacaggaa acagaggtgg agctgtacaa tgaattccct gaacccatca aactggacaa 601 gaacgaccgg gccaagacct cagcggaaag ctgcagttgc tgaaggggca gtgagagcag 661 agcacagagt ccttcacaaa caaagaacac acttaggcct tccaacacga gcccccttct 721 tctcttccaa acaaaacata aagtcatctc tcgaatccag ctgccaaaag accctaccaa 781 acacttcacc ctgacacaca catacacaca c // LOCUS HUMU7AA 649 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human U7 small nuclear RNA pseudogene, fragment 32sm. ACCESSION M35537 KEYWORDS U7 small nuclear RNA; pseudogene. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 649) AUTHORS Soldati,D. and Schimperli,D. TITLE Structures of four human pseudogenes for U7 small nuclear RNA JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Schimperli, 22-JUN-1990. FEATURES from to/span description uRNA.ps 293 352 pseudo-U7 uRNA BASE COUNT 217 a 107 c 115 g 199 t 11 others ORIGIN 1 attatggcag agtacatgta acatatagtt tgctattcna actgattttt gacaaagata 61 caacagcana tcaatggagg aacaatagcn tttttaacaa atggtgttgg cacaactgga 121 caactgtaag nnaaagaaaa tgaanttcaa tctanatctc anaccgtatt aaaaaaaact 181 caaagtgggc cacagactta gatataaaat gtaaaactat aacactttta gaaaanatat 241 aggagaanat ctatgggatt tagggcaaaa gcatgattca aaaaaggaaa gtcagtgtta 301 cagccctttt agaatttgtc tagcaggttt tctggttttc cagaaaacct ccacataaaa 361 aggaaaaaga aaaaaaggaa aaagtaataa attagtatga attgagcatt ttaatgattc 421 tattttattg cctttgttgg cttattaaat ataactctct gttttgttat tttagtggtt 481 gctttaggtt ttatagtaat acatctttaa cctgttacag tccaccttct ttttgtttgt 541 ttgttttgga agcagggtct cactctgtca ccaaggctag agtgcagtgg cactatcacg 601 gctcactgca acctcaacct cccaggctcc agngttcctc ctgctgcag // LOCUS HUMU7AB 521 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human U7 small nuclear RNA pseudogene, fragment 32BG. ACCESSION M35538 KEYWORDS U7 small nuclear RNA; pseudogene. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 521) AUTHORS Soldati,D. and Schimperli,D. TITLE Structures of four human pseudogenes for U7 small nuclear RNA JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Schimperli, 22-JUN-1990. FEATURES from to/span description uRNA.ps 295 344 pseudo-U7 uRNA BASE COUNT 186 a 102 c 89 g 144 t ORIGIN 1 tttcttcttt ttccacctct tgtctattca ggccctcagt gaattggatc atgctcaccc 61 acatcagggc aggcaatcta cttattgagt tcactgattc aaatgataac ctcacctgga 121 aaaatcctca cagacccaga aataatgttt aatctaagca cccatggcca gtcaagttga 181 gacataaaat tagccatcac agtacaggca tacctgggaa atgacgcagg ttcagttcca 241 gaccatcaca ataaagcaaa tattgcaata aagtgagtca caaaaagaaa aagtcagtgt 301 tacagctttt agaatttgtc tagcaggttt tctggaaaac cttcacaaaa aaaggagaaa 361 gagtgcatat aaaatgctta tgttgatacc atactgtagt ctattaagtg tgcaatagca 421 ttatgtctat aaaacaatgt acatacttta aaaatatttt attgttaaaa catgctatca 481 cagagacaca aagtgagcac atgctgttgg aaaaatggta c // LOCUS HUMU7AC 513 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human U7 small nuclear RNA pseudogene. ACCESSION M35539 KEYWORDS U7 small nuclear RNA; pseudogene. SOURCE Human liver DNA, clone 25H. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 513) AUTHORS Soldati,D. and Schimperli,D. TITLE Structures of four human pseudogenes for U7 small nuclear RNA JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Schimperli, 22-JUN-1990. FEATURES from to/span description uRNA.ps 204 264 pseudo-U7 uRNA BASE COUNT 127 a 83 c 81 g 222 t ORIGIN 1 aattgtctgt ctttcatatt tttgtcattc tcgtgagtgt gaagtggtat ctcattgtgg 61 ttttgatttg catttcccta atgactaatg gtgttgaata tcttttcata tgcttataag 121 ccatttatat gtctttggag aaattctttt caaatctctt gctcatttta aaattaggtt 181 gtcattttat tacggagttg cattagtgtt acagctcttt tagaatttgt ctagcaggtt 241 ttctgatttt tacccggaac ccctccccag ccaaaagtaa aagaaaaaaa aagctgcaat 301 agttctttat atagtttaga tacaaggccc ttatcagata tttgattttc aaatattgtc 361 tcccattctg tgagttgttt tttcactctc ttgatggtgt catatgaagc acaaattttt 421 ttttttattt tgataatgtc ccatttatct atgtattttt tcttttcatt tgtgcttttg 481 gtgtcgtacc taagaaactg ctgcttaact caa // LOCUS HUMU7AD 418 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human U7 small nuclear RNA pseudogene, fragment 36h. ACCESSION M35540 KEYWORDS U7 small nuclear RNA; pseudogene. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 418) AUTHORS Soldati,D. and Schimperli,D. TITLE Structures of four human pseudogenes for U7 small nuclear RNA JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Schimperli, 22-JUN-1990. FEATURES from to/span description uRNA.ps 229 286 pseudo-U7 uRNA BASE COUNT 128 a 58 c 106 g 122 t 4 others ORIGIN 1 agaggcacat gtcaagatga agctctggtg aagaattgat caaaaatagt ggcggagtga 61 gatggagatt taaatccaag ggctgattta tgaaggcttc aaagattttt tttttttaaa 121 gaaagaacat agattagttg tttctgaggg ctggagggga cagagataga ggcggcgacg 181 gaaggatcct tcaggtttct tcttgaggtg attaaacgtt ctgaaatcgc gtgttacagc 241 tcttttggaa tttgtctagc aggttttctg gttttcactg caaaacccca cagtnnnaaa 301 acagaaaaaa aaawttatcc taaaattggg ctgtggtaat ggttgcgcat atgctgtgaa 361 taggcttcca aatattgaaa tgtccacttc aaacgagtga actgtatggt atgtgaat // LOCUS SCMPMYA1 3156 bp ss-mRNA INV 25-JUL-1990 DEFINITION S.mansoni paramyosin mRNA, complete cds. ACCESSION M35499 KEYWORDS paramyosin. SEGMENT 1 of 2 SOURCE S.mansoni (strain Puerto Rican) adult worm, cDNA to mRNA, clones Pmy[1,8,11,15]. ORGANISM Schistosoma mansoni Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda; Digenea; Strigeidida; Strigeata; Schistosomatoidea; Schistosomatidae. REFERENCE 1 (bases 1 to 3156) AUTHORS Laclette,J.P., Landa,A., Arcos,L., Willms,K., Davis,A.E. and Shoemaker,C.B. TITLE Paramyosin is the Schistosoma mansoni (trematoda) homologue of antigen B from Taenia solium (cestoda) JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.P.Laclette, 22-JUN-1990. Author address: J.P.Laclette Department of Tropical Public Health Harvard School of Public Health 665 Huntington Avenue Boston, MA 02115 Email: zehm%hscvax%harvunxwxw.edu FEATURES from to/span description pept 47 2647 paramyosin mRNA < 1 > 3156 paramyosin mRNA BASE COUNT 1279 a 435 c 516 g 923 t 3 others ORIGIN 1 tctttcacta atattaaaaa gaaaaattta aaaaaaaaga ggaaaaatga tgaatcatga 61 tacagaatct catgtgaaaa tatcaagaac tatttatcga ggagtatcac caagtacaac 121 aagacttgag agtcgagtac gggaattaga agatcttttg gatttagaac gtgatgcaag 181 agttcgagct gaacgacatg ctgctgattt aggttttcaa gtggatgcat tatcagaacg 241 tttagatgaa gctggaggtt ctacaacaca aactcaagaa ttattaaaac gtcgtgaaat 301 ggaaatcaat aaactacgta aagatttaga aaatgctaat gcatcacttg aactagctga 361 aacatcaatg agacgtcgac atcaaacagc attgaatgaa ttagctttgg aagttgaaaa 421 tttacaaaaa caaaaaggaa aggctgaaaa agacaaaagt catttgatta tggaagtgga 481 taatgttcta ggacaattag atggtgcatt aaaagctaag caatcagctg aatcaaaatt 541 agaaggatta gatagtcaat taaatcgttt aaaatcatta accgacgatt tacaaagaca 601 attaactgaa ttaaataatg ctaaatcaag attaacatca gaaaattttg aattattaca 661 tataaatcaa gattatgaag cacaaatatt aaattattct aaagctaaat catcacttga 721 aagtcaagta gatgatttaa aaagatcatt agatgatgaa gctaaaaatc gttttaatct 781 tcaagctcaa cttacatcac ttcaaatgga ttatgataat ttacaagcta aatatgatga 841 agaaagtgaa gaagctagta atttacgtag tcaagtatct aaatttaacg ctgatattgc 901 tgcattaaaa tcgaaatttg aacgtgaact tatgagtaaa acagaagaat tcgaagaaat 961 gaagaggaaa ttcactatga gaattaccga acttgaagat actgctgaaa gagaacgatt 1021 aaaagcggta tcattagaaa aacttaaaac aaaattaaca ttagaaatta aagatttaca 1081 atctgaaata gaaagtcttt cattagaaaa tagtgaatta attcgtcgtg ctaaagctgc 1141 tgaatcatta gcttctgatt tacaacgtcg tgttgatgaa ttaacaattg aagtgaatac 1201 attaacatca caaaatagtc aattagaaag tgaaaatcta cgtttaaaaa gtttagttaa 1261 tgatttaacg gataaaaata atttattaga acgtgaaaat cgtcaaatga atgatcaagt 1321 caaagaatta aaaagttcac ttcgtgatgc taatcgtcgt cttactgatt tagaagcatt 1381 aagatcgcaa ttagaggctg aaagagataa tcttgcatca gctttacatg atgctgaaga 1441 agcattacat gatatggatc aaaagtatca agcatcacaa gctgcattaa atcatttgaa 1501 atctgaaatg gaacaaaggc ttagagaaag agatgaagaa ttagaaagtt taagaaaaag 1561 tactactaga acaattgaag aattaactgt tacaataact gaaatggaag ttaaatataa 1621 atcagaatta tcacgtttaa aaaaacgtta tgaatcaaat attgctgatt tagaaattca 1681 acttgataca gctaataaag ctaatgcaaa tcttatgaaa gagaataaaa atttatcaca 1741 acgtgttaaa gatttagaaa catttttaga tgaagaacgt cgtcttcgtg aagcagctga 1801 aaataattta caaattactg aacataaacg tttacaatta gcaaatgaaa ttgaagaaat 1861 acgtagtaca ttagaaaatt tagaacgttt acgtaaacat gctgaaacag aacttgaaga 1921 agctcaatca cgtgttagtg aattaactat tcaagttaat acattaacta atgataaacg 1981 tcgtcttgaa ggtgatattg gtgtaatgca ggctgatatg gatgatgcta ttaatgctaa 2041 acaagcttct gaagatcgag caattagatt aaataatgaa gtattacgtt tagctgatga 2101 attacgtcaa gaacaaggaa attataaaca tgctgaagca ttaagaaaac aattagaaat 2161 tgaaatacgt gaaattacag ttaaattaga agaagctgaa gcatctgcta cacgtgaagg 2221 tcgtcgtatg gtacaaaaat tacaggctcg tgtacgtgaa cttgaatcag aattcgatgg 2281 tgaatcaaga agatgtaaag atgcattagc tcaagcacgt aaatttgaac gtcaatataa 2341 agaattacaa acacaagctg aagatgatcg tcgtatggta ttagaacttc aagatttatt 2401 agataaaact caaatgaaaa tgaaagccta taaacgtcaa ttggaagaaa tggaagaagt 2461 atctcaaatt acaatgaata aatatcgtaa agcccaacaa caaattgaag aagctgaaca 2521 tcgtgcagat atggctgaac gtacagtcac tgtacgtcgt gttggtccag gtggacgtgc 2581 tgtttctgta gcacgtgaat tatctgtcac atcaaataga ggaatgagag caacaagtat 2641 gatgtaaagc acttaaataa taataataat agtgatacta tacacatata caaacgccta 2701 tatctttctt tctctctttg tttcgttttc ctcatcttcg ctttttttta gtcatgatat 2761 tcatctaaat gaggaaatta tcaataatga cctattatta ttcaatgtgc tttactttac 2821 ttcccaccct aaatctcctc ggtatatcgt ttcccttttt ttttcttttt ttttctaaaa 2881 acaaaaaatt ctaaaagtga aagacgaaaa aaaaaaannn cagaaatttg tttcctcctc 2941 tcatattttc tctttgttct ttttattcat ttcatttatt gtattattaa tattgctatt 3001 attattattg ttattactac ctaaccgatg gtttcaacga cagcaatctc ccatatttct 3061 acacacacac acacacaaca cacacaacac acaaaagtat ctgtgcaatc gtaatagata 3121 atctttattt attgattaaa aaaaaaaaaa aaaaaa // LOCUS SCMPMYA2 217 bp ss-mRNA INV 25-JUL-1990 DEFINITION S.mansoni paramyosin mRNA, 3' flank. ACCESSION M36871 KEYWORDS paramyosin. SEGMENT 2 of 2 SOURCE S.mansoni (strain Puerto Rican) adult worm, cDNA to mRNA, clones Pmy[1,8,11,15]. ORGANISM Schistosoma mansoni Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda; Digenea; Strigeidida; Strigeata; Schistosomatoidea; Schistosomatidae. REFERENCE 1 (bases 1 to 217) AUTHORS Laclette,J.P., Landa,A., Arcos,L., Willms,K., Davis,A.E. and Shoemaker,C.B. TITLE Paramyosin is the Schistosoma mansoni (trematoda) homologue of antigen B from Taenia solium (cestoda) JOURNAL Unpublished (1990) Harvard 665 Huntington Avenue, Boston, MA 02115 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.P.Laclette, 22-JUN-1990. Author address: J.P.Laclette Department of Tropical Public Health Harvard School of Public Health 665 Huntington Avenue Boston, MA 02115 Email: zehm%hscvax%harvunxwxw.edu FEATURES from to/span description mRNA < 1 217 paramyosin mRNA BASE COUNT 66 a 46 c 17 g 88 t ORIGIN About 1 kb after segment 1. 1 cagaaatttg tttcctcctc tcatattttc tctttgttct ttttattcat ttcatttatt 61 gtattattaa tattgctatt attattattg ttattactac ctaaccgatg gtttcaacga 121 cagcaatctc ccatatttct acacacacac acacacaaca cacacaacac acaaaagtat 181 ctgtgcaatc gtaatagata atctttattt attgatt // LOCUS ECAPNL 420 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.carotovora pectin lyase (PNL) gene, 5' end. ACCESSION M35271 KEYWORDS pectin lyase. SOURCE E.carotovora DNA, clone pTN2159. ORGANISM Erwinia carotovora Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 420) AUTHORS Nishida,T., Suzuki,T., Ito,K., Kamio,Y. and Izaki,K. TITLE Cloning and expression of pectin lyase gene from Erwinia carotovora in Escherichia coli JOURNAL Biochem. Biophys. Res. Commun. 168, 801-808 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 284 > 420 pectin lyase (EC 4.2.2.10) BASE COUNT 127 a 66 c 94 g 133 t ORIGIN 1 cctatcagtc tgatgaagtt gaacaggctg cgaaccgtat ttttaatggc ggcgggtaaa 61 aggctggtga tgataatcgt agcgctgcca ttttactaaa agatggcggc gtattaattg 121 ggtattgaat tattcgcaag gttgtttttt tattaaactc gattaataag cgtaatgaaa 181 tcctttctat acaattttta attgtcggag gcgtattatt tagtctcaat taaataatac 241 gctggaagac attattattc actcattgta aaaaggaaaa cttatggctt atccaacaac 301 aaatcttact gggcttattg gttttgcaaa agcggcaaaa gttaccggag gaacgggcgg 361 taaagtcgtt acggtaaatt ctttggccga ttttaaatca gcggtgacgg ttccgcaaaa // LOCUS ECOUXEX 318 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli uxaCT-exuT intercistronic region. ACCESSION M35280 KEYWORDS catabolite receptor protein. SOURCE E.coli (strain K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 318) AUTHORS Blanco,C. and Mata-Gilsinger,M. TITLE Identification of cyclic AMP-CRP binding sites in the intercistronic regulatory uxaCA-exuT region of Escherichia coli JOURNAL FEMS Microbiol. Lett. 33, 205-209 (1986) STANDARD simple staff_entry FEATURES from to/span description site 46 71 catabolite receptor protein binding site 1 site 165 193 catabolite receptor protein binding site 2 BASE COUNT 98 a 62 c 70 g 88 t ORIGIN 1 gtcgacttat gatttgcgac ggcagaaaga taacttgtca tacaacttta aaaggtgaga 61 gccatcacaa atgtgggaat atttgtaggg acattacctg acgacagcaa ggccagtact 121 ggcgcggcct gcagcgagat ttaccacttt gagagtaatt tttttaacta cgtttattga 181 tctaactcac gaaaatatct tcggactctg gaaattggtg tgataacttt gtcagcatcg 241 caccataagc aagctagctc actcgttcga agaggaagac gaaaataact ccgtttatga 301 ctgaagatta tcctgtta // LOCUS HUMSYNIFA 144 bp ds-DNA SYN 25-JUL-1990 DEFINITION Human synthetic interferon alpha-2 gene, 3' end. ACCESSION M35281 KEYWORDS interferon. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 144) AUTHORS Rossi,J.J., Kierzek,R., Huang,T., Walker,P.A. and Itakura,K. TITLE An alternate method for synthesis of double-stranded DNA segments JOURNAL J. Biol. Chem. 257, 9226-9229 (1982) STANDARD simple staff_entry FEATURES from to/span description pept < 13 135 interferon alpha-2 BASE COUNT 37 a 32 c 32 g 43 t ORIGIN 1 caagaattca tgatcactct gtacctgaag gaaaagaaat actctccgtg tgcttgggaa 61 gttgtacgtg ctgaaatcat gcgttctttc tccctgtcta ctaaccttca ggagtctctg 121 cgttctaaag aatagctgca gtgg // LOCUS RATMAL5 1104 bp ds-DNA ROD 25-JUL-1990 DEFINITION Rat malic enzyme (ME) gene, 5' end. ACCESSION M35258 M21619 KEYWORDS malic enzyme. SOURCE Rat (Sprague-Dawley, female) liver, clone lambda-g-ME-29. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1104) AUTHORS Morioka,H., Tennyson,G.E. and Nikodem,V.M. TITLE Structural and functional analysis of the rat malic enzyme gene promoter JOURNAL Mol. Cell. Biol. 8, 3542-3545 (1988) STANDARD simple staff_review REFERENCE 2 (bases 427 to 925; revises [1]) AUTHORS Petty,K.J., Desvergne,B., Mitsuhashi,T. and Nikodem,V.M. TITLE Identification of a thyroid hormone response element in the malic enzyme gene JOURNAL J. Biol. Chem. 265, 7395-7400 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 918 > 1104 malic enzyme (EC 1.1.1.40) mRNA 883 > 1104 malic enzyme mRNA rpt 814 823 direct repeat rpt 827 836 direct repeat BASE COUNT 220 a 376 c 290 g 218 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgcat agcccagaag ctatagctgt actgatgggc tcaagtaaaa taattagaaa 61 ttatttctca ggtatctagg caatatttaa cccccaaatt gttccgcagt gtctagatga 121 acaccataga atttggccgt gcgacttaac tgaaaagaaa gggctttgtt gtctgaaggc 181 tgcttggctg tattgttttg ttttaatcag acatccttgg gagacatagg atttatttct 241 ccagtccttg gatcttcaag tataaatatc aataatacaa ccactgggtt tcagtactgg 301 aagacctgtt attctgaccc tctgtcatca gagaagaaac catacatcat cttgcaaaaa 361 ttaacatctt ggtttccaga acgctcagga aaattgttct taagctcaat aggactggcc 421 actggacctg tgccctctaa cacctttttc ttaccacgtt cgaacacaat tccctcagat 481 actattcaga aacaggcgag gagtcgcccg ccctatcgcc cagtgccatc gaggcctggg 541 cattctgggt caaagttgat cccctcctgc atcaggcccc tggggcatgg ctggcatcca 601 ggacgttggg gttaggggag gacagtggac gagcggagga agcgaggcgg cccgcccctc 661 acccgtcggt gcccaggtcg cacgctcggc gctcaccagc ttggccggcg ccccgccccc 721 gcctcctcgc acggcggctc ggccgatgcc gccgtgactc agcgcttctc gcgggccgcc 781 cgcgcggccg cggctaggcc gggctcctcc cgcctcgcca ccccctctcg ccacccacgc 841 ccgcccccgg ccgcggggcc ttccgtcgca cggccgccgc ccgccgcact cccgtccgcc 901 ccgccacggt gctggccatg gatccccgag ccccccgccg ccgacacacc caccagcgcg 961 gctacctgct gacgcgggac ccgcatctca acaaggtgag ccccgccccg agagccgccc 1021 tgggcccgcc gctgggctcg ggcacccgcg tcccaccgag gggacggtcc cacccgggag 1081 gccactgcgg agccggcgcc aacg // LOCUS RATSPA 1595 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Rat serine pyruvate aminotransferase mRNA, complete cds. ACCESSION M35270 X06357 KEYWORDS serine pyruvate aminotransferase. SOURCE Rat (strain Wistar) liver, clones pRspt910,321]. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 63 to 1595) AUTHORS Oda,T., Miyajima,H., Suzuki,Y. and Ichiyama,A. TITLE Nucleotide sequence of the cDNA encoding the precursor for mitochondrial serine:pyruvate aminotransferase of rat liver JOURNAL Eur. J. Biochem. 168, 537-542 (1987) STANDARD simple automatic REFERENCE 2 (bases 1 to 198) AUTHORS Oda,T., Funai,T. and Ichiyama,A. TITLE Generation from a single gene of two mRNAs that encode the mitochondrial and peroxisomal serine:pyruvate aminotransferase of rat liver JOURNAL J. Biol. Chem. 265, 7513-7519 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 109 1353 peroxisomal serine:pyruvate aminotransferase precursor (EC 2.6.1.51; pSPT) sigp 109 118 serine:pyruvate aminotransferase signal peptide matp 119 1350 serine:pyruvate aminotransferase pept 175 1353 mitochondrial serine:pyruvate aminotransferase (mSPT) mRNA 61 > 1520 pSPT mRNA (alt.) mRNA 62 > 1520 pSPT mRNA (alt.) mRNA 127 > 1520 mSPT mRNA (alt.) mRNA 129 > 1520 mSPT mRNA (alt.) mRNA 130 > 1520 mSPT mRNA (alt.) signal 1515 1520 polyA signal BASE COUNT 376 a 437 c 455 g 327 t ORIGIN 1 aggacaaaca tcgatcaggg tcaaattgac aataaaaggg ctggagcaag caacagggac 61 tcaccaacca ggcctcgcct ctgagttcag cccagagcta gctgggaaat gttccggatg 121 ttggccaagg ccagtgtgac gctgggctcc agggcagcaa gttgggtacg gaacatgggc 181 tcgcaccagc tgctggtgcc acccccagag gccctgagca agcccctgtc aattcctaag 241 aggctcctgt tgggtccggg accctccaac ctggctcctc gtgtgctagc agctggaagt 301 ctgaggatga ttggccacat gcaaaaagag atgtttcaga tcatggatga gatcaagcag 361 ggcatccagt atgtgttcca gaccaggaac cccctcacac tggttgtcag cggctcagga 421 cattgtgcca tggagactgc cctgttcaac ctcctggagc ctggggactc ctttcttgtg 481 ggaaccaatg gcatctgggg gatacgggct gcagagatcg ctgagcggat tggagcccgt 541 gtgcaccaga tgatcaagaa gcctggagaa cattacacac tgcaggaggt ggaggagggc 601 ctggctcagc ataaaccagt gttgctgttc ctgacccacg gggagtcatc cactggtgtg 661 ctgcagcccc tggatggttt cggggagctc tgccacaggt atcagtgcct actcctggtg 721 gactcggtgg catcattggg cggagtccct atctacatgg accaacaagg catcgacatc 781 ttgtactctg gctctcagaa ggtcctgaat gccccaccag ggatctccct catctccttc 841 aacgacaagg ccaaatccaa agtctactcc cggaagacaa agccagtctc cttctacaca 901 gacatcactt atttgtccaa gttgtggggc tgtgagggca agaccagagt aattcatcat 961 acgttgcctg tcatcagctt atactgcctg agggagagcc tagcactcat ttcagagcag 1021 ggcctggaga attcctggcg gcgtcacagg gaggctacag cacatctgca caagtgcctg 1081 cgggagttgg gcttaaagtt ctttgtgaag gacccggaaa tccggctacc tacaatcacc 1141 accgtgaccg tgcctgccgg ctacaactgg agggacatcg tcagctacgt gctggaccac 1201 ttcaacattg aaatctctgg tggtcttggg ccctctgagg ataaggtgct gcggattggc 1261 ctcctgggct acaacgccac cacagagaat gcggaccgtg tagcggaggc cctgagggag 1321 gccctgcaac attgtcctaa gaataaattg tgagcatcgt ctcaccagac tgtgccctcc 1381 tggaggggct gggaatatag caggaacgag aagactgtgc aagccctcca gccagcaaag 1441 gctgccgatg taaccaggcg ggaagggtca gggcgaagct gcccctctcc ccacagatgg 1501 agccctgtgg tcacatgatg ctaatcacct tccgatgaag ctgcattctg caggccactg 1561 gacttcggga atattcaata aagtacttgc cagac // LOCUS YSCCOX9A 180 bp ds-DNA PLN 25-JUL-1990 DEFINITION S.cerevisiae cytochrome c oxidase subunit VIIa (COX9) gene, complete cds. ACCESSION M35260 KEYWORDS cytochrome c oxidase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Duhl,D.M., Powell,T. and Poyton,R.O. TITLE Mitochondrial import of cytochrome c oxidase subunit VIIa in Saccharomyces cerevisiae: Identification of sequences required for mitochondrial localization in vivo JOURNAL J. Biol. Chem. 265, 7273-7277 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 180 cytochrome c oxidase subunit VIIa BASE COUNT 53 a 35 c 51 g 41 t ORIGIN 1 atgactattg ctccaattac tggtacgatc aagagaagag tcatcatgga catcgtcctc 61 gggttctccc tcgggggtgt catggcctct tactggtggt ggggattcca catggataag 121 attaacaaga gagagaagtt ctacgcagag ctagctgaga ggaaaaagca agagaactga // LOCUS DROTNCOPIA 276 bp ds-DNA INV 25-JUL-1990 DEFINITION D.melanogaster transposable element copia DNA in omega-aLTR1. ACCESSION M35053 KEYWORDS copia transposon; transposable element. SOURCE D.melanogaster (strain w-a-1A) DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 276) AUTHORS Zachar,Z., Davison,D., Garza,D. and Bingham,P.M. TITLE A detailed developmental and structural study of the transcriptional effects of insertion of the copia transposon into the white locus of Drosophila melanogaster JOURNAL Genetics 111, 495-515 (1985) STANDARD simple staff_entry BASE COUNT 100 a 42 c 34 g 100 t ORIGIN 1 tgttggaata tactattcaa cctacaaaaa taacgttaaa caacactact ttatatttga 61 tatgaatggc cacacctttt atgccataaa acatattgta agagaatacc actcttttta 121 ttccttcttt ccttcttgta cgttttttgc tgtgagtagg tcgtggtgct ggtgttgcag 181 ttgaaataac ttaaaatata aatcataaaa ctcaaacata aacttgacta tttatttatt 241 tattaagaaa ggaaatataa attataaatt acaaca // LOCUS ECOMETBJI 82 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli metB-metJ intercistronic DNA region. ACCESSION M34899 KEYWORDS . SOURCE E.coli (strain K12) DNA, clone pAA110. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 82) AUTHORS Smith,A.A., Greene,R.C., Kirby,T.W. and Hindenach,B.R. TITLE Isolation and characterization of the product of the methionine- regulatory gene metJ of Escherichia coli K-12 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6104-6108 (1985) STANDARD simple staff_entry BASE COUNT 25 a 13 c 15 g 29 t ORIGIN 1 tataatttta acggctattt gggatttgct catctatacg caaagaagtt tagatgtcca 61 gatgtattga cgtccattaa ca // LOCUS MNICPRRKA 103 bp ds-DNA RNA 25-JUL-1990 DEFINITION M.rugicum 4.5S ribosomal RNA. ACCESSION M35056 KEYWORDS 4.5S ribosomal RNA. SOURCE M.rugicum chloroplast DNA. ORGANISM Chloroplast Mnium rugicum Eukaryota; Plantae; Embryobionta; Bryophyta; Bryopsida; Bryidae; Bryales; Mniaceae; Mnium rugicum. REFERENCE 1 (bases 1 to 103) AUTHORS Troitsky,A.V., Bobrova,V.K., Ponomarev,A.G. and Antonov,A.S. TITLE The nucleotide sequence of chloroplast 4.5 S rRNA from Mnium rugicum (Bryophyta): Mosses also posses this type of RNA JOURNAL FEBS Lett. 176, 105-109 (1984) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 103 4.5S ribosomal RNA BASE COUNT 33 a 20 c 28 g 22 t ORIGIN 1 taaggtgacg gcaagactag ccgtttatca tcacgatagg tgccaagtgg aagtgcagta 61 atgtatgcag ctgaggcatc ctaacagacc gagagattta aac // LOCUS MUSCABLA 125 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse tyrosine kinase (c-abl) mRNA, 3' terminus. ACCESSION M34905 KEYWORDS tyrosine kinase. SOURCE Mouse (strain NIH Swiss) testis, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 125) AUTHORS Meijer,D., Hermans,A., von Lindern,M., van Agthoven,T., de Klein,A., Mackenbach,P., Grootegoed,A., Talarico,D., Valle,G.D. and Grosveld,G. TITLE Molecular characterization of the testis specific c-abl mRNA in mouse JOURNAL EMBO J. 6, 4041-4048 (1987) STANDARD simple staff_entry FEATURES from to/span description mRNA < 1 44 tyrosine kinase (c-abl) mRNA (alt.) mRNA < 1 125 tyrosine kinase (c-abl) mRNA (alt.) BASE COUNT 26 a 35 c 26 g 38 t ORIGIN 1 gcttactgta cctgcacctt tgatgcttac aaactgtccc cgagagcctg tgctcactgt 61 gttttcattg gaaggaagct gcttactgta cctgcacctt tgatgcttac aaactgtccc 121 cgaga // LOCUS SOPMPDNA 111 bp ds-DNA SYN 25-JUL-1990 DEFINITION Synthetic ovalbumin pre-message selfprimer DNA. ACCESSION M35058 KEYWORDS ovalbumin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 111) AUTHORS Oyama,F., Kikuchi,R. and Uchida,T. TITLE A synthetic, partial pre-mRNA for ovalbumin primes its own complementary DNA with reverse transcriptase JOURNAL J. Biochem. 104, 403-408 (1988) STANDARD simple staff_entry FEATURES from to/span description site 3 3 cDNA start with primer site 36 36 cDNA start without primer site 69 84 primer-independent cDNA BASE COUNT 44 a 15 c 20 g 32 t ORIGIN 1 atcctggaag tttatcaaag cgaacaacct gtaattgaaa ataatagtag ctgaaataat 61 ggttatgaca aaaagaagtt atgcaatcca gtttcaagat ttctagctag t // LOCUS XELRRAA 121 bp ss-RNA RNA 25-JUL-1990 DEFINITION X.laevis 5S RNA. ACCESSION M35055 KEYWORDS 5S ribosomal RNA. SOURCE X.laevis kidney ribosomal RNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 121) AUTHORS Brownlee,G.G., Cartwright,E., McShane,T. and Williamson,R. TITLE The nucleotide sequence of somatic 5 S RNA from Xenopus laevis JOURNAL FEBS Lett. 25, 8-12 (1972) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 121 5S ribosomal RNA BASE COUNT 24 a 34 c 38 g 25 t ORIGIN 1 gcctacggcc acaccaccct gaaagtgccc gatctcgtct gatctcggaa gccaagcagg 61 gtcgggcctg gttagtactt ggatgggaga ccgcctggga ataccaggtg tcgtaggctt 121 t // LOCUS YSCTRR2 76 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae, Brewer's) Arg-tRNA-II. ACCESSION K00157 M34900 KEYWORDS transfer RNA; transfer RNA-Arg. SOURCE Yeast (Saccharomyces cerevisiae, Brewer's) tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Weissenbach,J., Martin,R. and Dirheimer,G. TITLE Nucleotide sequence of tRNA-Arg-II from Brewer's yeast JOURNAL FEBS Lett. 28, 353-355 (1972) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 76) AUTHORS Weissenbach,J., Martin,R. and Dirheimer,G. TITLE The primary structure of Arg-tRNA-II from brewer's yeast: Partial digestion with ribonuclease T-1 and derivation of the complete sequence JOURNAL Eur. J. Biochem. 56, 527-532 (1975) STANDARD full staff_review COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their entry 0130 in Nucleic Acids Res. 11, r1-r54 (1983). FEATURES from to/span description tRNA 1 76 Arg-tRNA-II (NAR: 0130) anticdn 34 36 Arg-tRNA-II anticodon gcg modified 1 1 f = pseudouridine modified 9 9 m1g = 1-methylguanosine modified 10 10 m2g = 2-methylguanosine modified 16 16 d = dihydrouridine modified 19 19 d = dihydrouridine modified 26 26 m22g = 2,2-dimethylguanosine modified 27 27 f = pseudouridine modified 34 34 i = inosine modified 47 47 d = dihydrouridine modified 49 49 m5c = 5-methylcytidine modified 54 54 t = 5-methyluridine modified 55 55 f = pseudouridine modified 58 58 m1a = 1-methyladenosine BASE COUNT 15 a 22 c 24 g 15 t ORIGIN 5' end of mature tRNA. 1 ttcctcgtgg cccaatggtc acggcgtctg gctgcgaacc agaagattcc aggttcaagt 61 cctggcgggg aagcca // LOCUS YSCTRT1A 76 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae, brewer's) Thr-tRNA-1a. ACCESSION K00278 M34898 KEYWORDS transfer RNA; transfer RNA-Thr. SOURCE Yeast (Saccharomyces cerevisiae, brewer's) tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Weissenbach,J., Kirarly,I. and Dirheimer,G. TITLE The nucleotide sequences of two threonine tRNAs from Brewer's yeast JOURNAL FEBS Lett. 71, 6-8 (1976) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 76) AUTHORS Weissenbach,J., Kiraly,I. and Dirheimer,G. TITLE Structure primaire des Thr-tRNA-1a-et-b de levure de biere JOURNAL Biochimie 59, 381-391 (1977) STANDARD full staff_review COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their entry 1760 in Nucleic Acids Res. 11, r1-r54 (1983). Brewer's yeast Thr-tRNA-1 is 50% Thr-tRNA-1a and 50% Thr-tRNA-1b [1]. FEATURES from to/span description tRNA 1 76 Thr-tRNA-1a (NAR: 1760) anticdn 34 36 Thr-tRNA-1a anticodon ggt modified 10 10 m2g modified 16 16 d modified 17 17 d modified 20 20 d modified 26 26 m22g modified 32 32 m3c modified 34 34 i modified 37 37 t6a modified 39 39 f modified 47 47 d modified 48 48 m5c modified 54 54 t modified 55 55 f modified 58 58 m1a BASE COUNT 20 a 17 c 21 g 18 t ORIGIN 5' end of mature tRNA. 1 gcttctatgg ccaagttggt aaggcgccac actggtaatg tggagatcat cggttcaaat 61 ccgattggaa gcacca // LOCUS YSCTRT1B 76 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae, brewer's) Thr-tRNA-1b. ACCESSION K00279 M34898 KEYWORDS transfer RNA; transfer RNA-Thr. SOURCE Yeast (Saccharomyces cerevisiae, brewer's) tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Weissenbach,J., Kirarly,I. and Dirheimer,G. TITLE The nucleotide sequences of two threonine tRNAs from Brewer's yeast JOURNAL FEBS Lett. 71, 6-8 (1976) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 76) AUTHORS Weissenbach,J., Kiraly,I. and Dirheimer,G. TITLE Structure primaire des Thr-tRNA-1a-et-b de levure de biere JOURNAL Biochimie 59, 381-391 (1977) STANDARD full staff_review COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their entry 1760 in Nucleic Acids Res. 11, r1-r54 (1983). Brewer's yeast Thr-tRNA-1 is 50% Thr-tRNA-1b and 50% Thr-tRNA-1a [1]. FEATURES from to/span description tRNA 1 76 Thr-tRNA-1b (NAR: 1760) modified 10 10 m2g modified 16 16 d modified 17 17 d modified 20 20 d modified 26 26 m22g modified 32 32 m3c modified 34 34 i anticdn 34 36 Thr-tRNA-1b anticodon ggt modified 37 37 t6a modified 39 39 f modified 47 47 d modified 48 48 m5c modified 54 54 t modified 55 55 f modified 58 58 m1a BASE COUNT 19 a 18 c 22 g 17 t ORIGIN 5' end of mature tRNA. 1 gcttctatgg ccaagttggt aaggcgccac actggtaatg tggagatcgt cggttcaaat 61 ccgactggaa gcacca // LOCUS BSTGLGBA 2735 bp ds-DNA BCT 25-JUL-1990 DEFINITION B.stearothermophilus branching enzyme (glgB) gene, complete cds. ACCESSION M35089 KEYWORDS branching enzyme. SOURCE B.stearothermophilus (strain 1503-4R, variant 4) DNA, clone pKVS1. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2735) AUTHORS Kiel,J.A.K.W., Boels,J.M., Beldman,G. and Venema,G. TITLE Molecular cloning and nucleotide sequence of the branching enzyme gene (glgB) from Bacillus stearothermophilus, expression in E.coli and B.subtilis JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.K.W.Kiel, 12-JUN-1990. Dept of Genetics Center of Biological Sciences Kerklaan 30, NL 9751 NN Haren, THE NETHERLANDS FEATURES from to/span description pept 522 2441 branching enzyme (glgB) (EC 2.4.1.18) pept 325 < 1 (c) unidentified ORF2 binding 337 330 (c) ORF2 ribosomal binding site (put.) binding 504 516 glgB ribosomal binding site (put.) signal 370 365 (c) ORF2 -10 region (put.) signal 394 389 (c) ORF2 -35 region (put.) signal 446 458 glgB -35 region (put.) signal 469 479 glgB -10 region (put.) BASE COUNT 835 a 492 c 634 g 774 t ORIGIN 1 gaattccaat ggaaataatg gctaacgtaa ggccgtttaa aaaggacgta ataatttcaa 61 agcgcaaata accgaatgta aatcgatgat ttggcggacg catggcaaga taaagagcga 121 tcatgctaag cccaagcgcc aatacgtcag atgccatatg ggcagagtcg gaaagcaaag 181 ctaaggaatt ggataatagc cccccaacaa tttccacaat cgtaaaaaac aatgttaaaa 241 cgagagtgat ccaaagcgtt tttttcgatt gattttgcgt ttttacatga ggaagatggt 301 gataatcgta ttgaattggt gacatgacac acctcttatt tagaattatt tttaatttat 361 atacattata atatagtttt ttataattgt gcaaaaaaat tttttgttta tttatcgaaa 421 aatgtaaaaa aaatacaatt tttttatcaa ggaatttatg gaatcgctgt ggaatataag 481 taacaacggt aagaaacttt aaggaaagga tgcgatacag attgatcgcc gtcggtccca 541 ctgatttaga aatctattta tttcatgaag gcagcttata taaaagttat gaattgtttg 601 gtgcacatgt gataaagaaa aatggcatgg tcggaacccg gttttgtgta tgggcacccc 661 atgcgcggga agtgcgatta gtcggcagtt ttaatgaatg gaacggaact aattttaacc 721 ttatgaaagt aagtaatcaa ggcgtatgga tgatttttat tcctgaaaac ttagaagggc 781 atttatataa atacgaaatt acgacgaacg atgggaatgt tctgttaaaa tcggatccat 841 acgcgtttta ctccgagttg cgtccccata ctgcttccat tgtctacaac ataaaaggat 901 atcaatggaa tgaccagaca tggcgacgga agaaacagcg aaagcgaatt tatgaccagc 961 ctttgttcat ttatgaactt cactttggtt cgtggaaaaa gaaagaggac ggcagttttt 1021 atacatatca agagatggca gaggagctaa tcccttatgt tctcgaacat gggtttactc 1081 atattgagct gctcccactc gtcgagcatc cgttcgatcg ttcttgggga tatcagggaa 1141 taggttatta ttcagcaaca agccgctacg gaacaccgca tgatttgatg tattttattg 1201 accgctgtca ccaagctgga ataggcgtca ttctcgattg ggttcctggc cacttttgta 1261 aagattccca tgggttatat atgtttgatg gcgcaccggc atatgaatat gccaacatgc 1321 aagaccggga aaattacgta tggggaacgg caaactttga ccttggcaag ccggaagtcc 1381 gcagcttttt gatttccaat gcgttatttt ggatggaata tttccatgtg gacgggtttc 1441 gtgtagatgc tgttgccaat atgttatatt ggccaaacag cgacgtacta tacaaaaata 1501 cgtatgccgt ggagttcttg caaaaattaa atgaaacggt attcgcctat gatccgaaca 1561 tattaatgat tgccgaagat tcgacagact ggccgcgcgt cactgctcca acatacgacg 1621 gaggattagg atttaactat aaatggaaca tgggatggat gaacgatatt ttaacttata 1681 tggaaacgcc gcctgaacat cgaaaatacg tgcacaataa agtaacattt tccctcttgt 1741 atgcgtattc ggaaaatttc attttacctt tttcccatga cgaggtcgta catggaaaaa 1801 aatcgctgtt aagtaaaatg ccggggacat atgaggaaaa gtttgcgcaa ttaaggttgc 1861 tgtatggata tttgttgacg catcctggta agaaattatt gtttatgggc ggcgaatttg 1921 gccagtttga tgaatggaaa gatttagagc agctggattg gatgcttttt gattttgata 1981 tgcatcggaa tatgaatatg tatgtgaaag aattgttgaa atgttataag cgctataaac 2041 cgctttatga gttagaccac tctccagatg gattcgagtg gattgatgtt cataacgccg 2101 aacaaagtat tttctcattc attcgcagag gaaaaaaaga ggatgatttg cttattgttg 2161 tgtgtaattt cacaaataaa gtataccacg gttataaagt tggtgttccg ttatttacaa 2221 gatatcggga agtaatcaat agcgatgcaa tccaattcgg cggctttggg aatatcaatc 2281 caaaaccgat tgcggcgatg gaagggccgt ttcacggaaa gccatatcat attcagatga 2341 cgatcccgcc gtttggcatt tctattttaa gaccagtaaa aaaaggtagc gtcaaaagtt 2401 ttatgaaaac tccacatccg ccatcccatg gagcatcgta aggcatcctt ggagccggat 2461 tcgcccttga ccaacacccg ccaaaggtgt gaaagggacg tcaagggcga cggggacaaa 2521 aaagagggca taggaaagcc gcccttgccc ttaccgaatt ttacctttga cgaggttcgg 2581 ttggtcaagg gttcgcttcg ccgaatccgg ctgttcttct gatccatggg ctccggcgga 2641 caaaaaagtt aggctgcctc ttgttggagg aaatcttgag ccatggcgat cagcttcgtc 2701 caccgggccg gcatatgggg cagatcggcg agctc // LOCUS HUMETMAGA 3343 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human secreted epithelial tumor mucin antigen (H23Ag) gene, complete cds. ACCESSION M35093 KEYWORDS cell surface antigen; tumor mucin antigen. SOURCE Human breast tumor cell line MCF7 DNA, clone lambda-gtWES. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3343) AUTHORS Tsarfaty,I., Hareuveni,M., Horev,J., Zaretsky,J., Weiss,M., Jeltsch,J.M., Garnier,J.M., Lathe,R., Keydar,I. and Wreschner,D.H. TITLE Isolation and characterization of an expressed hypervariable gene coding for a breast cancer associated antigen JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Tsarfaty, 12-JUN-1990. FEATURES from to/span description pept 785 842 secreted epithelial tumor mucin antigen precursor, exon 1 (H23Ag) 1342 2207 secreted epithelial tumor mucin antigen precursor, exon 2 (H23Ag) sigp 785 805 secreted epithelial tumor mucin antigen signal peptide matp 806 842 secreted epithelial tumor mucin antigen 1342 2207 secreted epithelial tumor mucin antigen pre-msg 777 > 842 H23Ag mRNA and introns IVS 843 1341 H23Ag intron A signal 384 397 H23Ag ERE signal 633 644 H23Ag CACCT motifs signal 689 692 H23Ag TATA box site 1063 1090 put. enhancer rpt 1670 1729 repeat unit BASE COUNT 679 a 986 c 981 g 697 t ORIGIN Chromosome 1q21-q24. 1 gagctcctgg ccagtggtgg agagtggcaa ggaaggaccc tagggttcat cggagcccag 61 gtttactccc ttaagtggaa atttcttccc ccactcccct ccttggcttt ctccaaggag 121 ggaaccccag gctgctggaa agtccggctg gggcggggac tgtgggtttc agggtagaac 181 tgcgtgtgga acgggacagg gagcggttag aagggtgggg ctattccggg aagtggtggt 241 ggggggaggg agcccaaaac tagcacctag tccactcatt atccagccct cttatttctc 301 ggccgcctct gcttcagtgg acccggggag ggcggggaag tggagtggga gacctagggg 361 tgggcttccc gaccttgctg tacaggacct cgacctagct ggctttgttc cccatcccca 421 gttagttgtt gccctgaggc taaaactaga gcccaggggc cccaagttcc agactgcccc 481 tcccccctcc cccggagcca gggagtggtt ggtgaaaggg ggaggccagc tggagaagaa 541 acgggtagtc aggggttgca gcattagagc ccttgtagcc ctagcccagg aatggttgga 601 gagagaagag tagagtaggg aggggggttt gtcacctgtc acctgctcgg ctgtgcctag 661 ggcgggcggg ggggagtggg gggaccggta taaagcggta ggcgcctgtg cccgctccac 721 ctctcaagca gccagcgcct gcctgaatct gttctgcccc ctccccaccc atttcaccac 781 caccatgaca ccgggcaccc agtctccttt cttcctgctg ctgctcctca cagtgcttac 841 aggtgagggg cacgaggtgg ggagtgggct gccctgctta ggtggtcttc gtggtctttc 901 tgtgggtttt gctccctggc agatggcacc agaagttaag gtaagaattg cagacagagg 961 ctgccctgtc tgtgccagaa ggagggagag gctaaggaca ggctgagaag agttgccccc 1021 aaccctgaga gtgggtacca ggggcaagca aatgtcctgt agagaagtct agggggaaga 1081 gagtagggag agggaaggct taagagggga agaaatgcag gggccatgag ccaaggccta 1141 tgggcagaga gaaggaggct gctgcaggaa ggaggcggcc aacccagggg ttactgaggc 1201 tgcccactcc ccagtcctcc tggtattatt tctctggtgg ccaggcttat attttcttct 1261 tgctcttatt tttccttcat aaagacccaa ccctatgact ttaacttctt acagctacca 1321 cagcccctgg gcccgcaaca gttgttacag gttctggtca tgcaagctct accccaggtg 1381 gagaaaagga gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg 1441 ctgtgagtat gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca 1501 ctcagggaca ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca 1561 cctggggaca ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc 1621 cgccagccca cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc 1681 ccccagccca gggtgtcacc tcggccccgg agaccaggcc gcccccgggc tccaccgccc 1741 ccccagccca tggtgtcacc tcggcgccgg acaacaggcc cgccttggcg tccaccgccc 1801 ctccagtcca caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg 1861 tgcacaacgg cacctctgcc agggctacca caaccccagc cagcaagagc actccattct 1921 caattcccag ccaccactct gatactccta ccacccttgc cagccatagc accaagactg 1981 atgccagtag cactcaccat agcacggtac ctcctctcac ctcctccaat cacagcactt 2041 ctccccagtt gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc 2101 agtttaattc ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca 2161 tttctgaaat ggtgagtatc ggcctttcct tccccatgct cccctgaagc agccatcaga 2221 actgtccaca ccctttgcat caagcctgag tcctttccct ctcaccccag tttttgcaga 2281 tttataaaca agggggtttt ctgggcctct ccaatattaa gttcaggtac agttctgggt 2341 gtggacccag tgtggtggtt ggaggggtgg gtggtggtca tgagccgtag ggagggactg 2401 gtgcacttaa ggttggggga agagtgctga gccagagctg ggacccgtgg ctgaagtgcc 2461 catttccctg tgaccaggcc aggatctgtg gtggtacaat tgactctggc cttccgagaa 2521 ggtaccatca atgtccacga cgtggagaca cagttcaatc agtataaaac ggaagcagcc 2581 tctcgatata acctgacgat ctcaagacgt cagcggtgag gctacttccc tgctgcagcc 2641 agcaccatgc cggggcccct ctccttccag tgtctgggtc cccgctcttt ccttagtgct 2701 ggcagcggga ggggcgcctc ctctgggaga ctgccctgac cactgctttt ccttttagtg 2761 agtgatgtgc catttccttt ctctgaccag tctggggctg gggtgccagg ctggggcatc 2821 gcgctgctgg tgctggtctg tgttctggtt gcgctggcca ttgtctatct cattgccttg 2881 gtgagtgcag tccctggccc tgatcagagc cccccggtag aaggcactcc atggcctgcc 2941 ataacctcct atctccccag gctgtctgtc agtgccgccg aaagaactac gggcagctgg 3001 acatctttcc agcccgggat acctaccatc ctatgagcga gtaccccacc taccacaccc 3061 atgggcgcta tgtgccccta gcagtaccga tcgtagcccc tatgagaagg tgagattggg 3121 ccccacaggc aggggaagca gagggtttgg ctgggcaagg attctgaagg gggtacttgg 3181 aaaacccaaa gagcttggaa gaggtgagaa gtggcgtgaa gtgagcaggg gagggctggc 3241 aaggatgagg ggcagaggtc agaggagttt tgggggacag gcctgggagg agactatgga 3301 agaaaggggc ccctcaaaag ggagtgcccc actgccagaa ttc // LOCUS MPMVPIA 1155 bp ds-DNA VRL 25-JUL-1990 DEFINITION Mouse polyomavirus major structural protein (VP1) gene, complete cds. ACCESSION M34958 KEYWORDS major structural protein. SOURCE Mouse polyomavirus (strain RA) DNA. ORGANISM Mouse polyomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 1155) AUTHORS Freund,R., Garcea,R.L., Sahli,R. and Benjamin,T.L. TITLE A specific amino acid substitution in polyoma virus VP1 correlates with plaque size and hemagglutination behavior JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Freund, 08-JUN-1990. Author address: R.Freund Bldg C2 RM 129A Dept of Pathology Harvard Medical School 200 Longwood Avenue Boston, MA 02115 FEATURES from to/span description pept 1 1155 VPI protein (VPI) BASE COUNT 367 a 274 c 285 g 229 t ORIGIN 1 atggccccca aaagaaaaag cggcgtctct aaatgcgaga caaaatgtac aaaggcctgt 61 ccaagacccg cacccgttcc caaactgctt attaaagggg gtatggaggt gctggacctt 121 gtgacagggc cagacagtgt gacagaaata gaagcttttc tgaaccccag aatggggcag 181 ccacccaccc ctgaaagcct aacagaggga gggcaatact atggttggag cagagggatt 241 aatttggcta catcagatac agaggattcc ccaggaaata atacacttcc cacatggagt 301 atggcaaagc tccagcttcc catgctcaat gaggacctca cctgtgacac cctacaaatg 361 tgggaggcag tctcagtgaa aaccgaggtg gtgggctctg gctcactgtt agatgtgcat 421 gggttcaaca aacccacaga tacagtaaac acaaaaggaa tttccactcc agtggaaggc 481 agccaatatc atgtgtttgc tgtgggcggg gaaccgcttg acctccaggg acttgtgaca 541 gatgccagaa caaaatacaa ggaagaaggg gtagtaacaa tcaaaacaat cacaaagaag 601 gacatggtca acaaagacca agtcctgaat ccaattagca aggccaagct ggataaggac 661 ggaatgtatc cagttgaaat ctggcatcca gatccagcaa aaaatgagaa cacaaggtac 721 tttggcaatt acactggagg cacaacaact ccacccgtcc tgcagttcac aaacaccctg 781 acaactgtgc tcctagatga aaatggagtt gggcccctct gtaaaggaga gggcctatac 841 ctctcctgtg tagatataat gggctggaga gttacaagaa actatgatgt ccatcactgg 901 agagggcttc ccagatattt caaaatcacc ctgagaaaaa gatgggtcaa aaatccctat 961 cccatggcct ccctcataag ttcccttttc aacaacatgc tcccccaagt gcagggccaa 1021 cccatggaag gggagaacac ccaggtagag gaggttagag tgtatgatgg gactgaacct 1081 gtaccggggg accctgatat gacgcgctat gttgaccgct ttggaaaaac aaagactgta 1141 tttcctggaa attaa // LOCUS MYCP115A 3082 bp ss-mRNA BCT 25-JUL-1990 DEFINITION M.hyorhinis 115 kDa protein (p115) gene, complete cds. ACCESSION M34956 KEYWORDS . SOURCE M.hyorhinis (strain GDL) DNA, clone MhrG27. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 3082) AUTHORS Notarnicola,S.M., McIntoch,M.A. and Wise,K.S. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.S.Wise, 08-JUN-1990. University of Missouri-Columbia Dept of Mol Microbiol and Immunol School of Medicine-M653 Columbia, MO 65212 FEATURES from to/span description pept 70 3009 115 kDa protein BASE COUNT 1324 a 410 c 443 g 905 t ORIGIN 1 gaattctttt ttaataattt ttttacttta aaattctagt taaaactcta caaaaaaaca 61 aggacaaata tgttaaagct tattaaaatt gaaatcgaag gttttaaatc gttcgccgat 121 ccgatcagca taaatttcga tggttctgtt gtaggaatag ttggaccaaa tggttcagga 181 aaatctaata ttaatgacgc aattagatga gtattaggtg aacaatcagc aaaacaactt 241 cgtggactaa atatggatga tgttatcttt gcaggttcca aaactgtcaa acctcaagaa 301 aaagcaatgg taaaattaac cttcaaaaat gaagatgcaa ttgaagaaac gaaacaaatt 361 tttactattt ctcgtttact taaaagaggt caaggaacta atgaatattt ttacaatgat 421 caacctgtta gatataaaga tattaaaaat ttagctgttg aatctggaat ttctaaatct 481 tcacttgcaa ttatttccca aggtactata tctgaaattg cagaagcaac gcctgaacaa 541 agaaaagcag ttattgaaga agctgctgga acttcaaaat acaaattaga caaagaagaa 601 gcacaaaaga aacttattag aacaaatgat gcaattgata aattacaagg tgcaatcaaa 661 gagttagaac gtcaagtaaa ctcgcttgat aaacaagctt ctaaagcaaa aatttattta 721 gaaaaaagta aagctcttga atcagttgaa gtaggtttaa ttgttaatga tctaaacttt 781 ttcaatgaaa aattaaataa tttaaatact tcactattag aagtagaaca acaaagaaat 841 gatcttgaac tcaacattca aacttatgaa tccagtattt cacaaactgt tcattttaaa 901 acagaagttg aatcttcaat ccaagaaatt acttcaaaat tagacaattt aaaaaacgca 961 ctttccgaaa tcaaccttca agaagctaga attgaagaac gtagaaaatt aattatcagt 1021 ggtgaaattg tagttgatca aaaaacaaaa attgaagaaa ttaaaaaaca agttgaatca 1081 ctcaaaatac aaataaatgc ttcaaaacaa agagaaattg aactagacca acaacttaca 1141 agactaaatg caaaagctaa ttctttaaaa ttgcaagaaa atgatattaa taaagaaatt 1201 ggtgtattac ttgaaaaaaa atcagctgct gcagcaaata ttaatatatt aaaacaacaa 1261 tttgaaaata aaagttttct ttctaaagga attaaaacta ttaaagataa ctcattttta 1321 tttgatggtt acattggatt agcttctgaa ttatttaaag tagaatccga atttagttta 1381 gcaattgaaa ctgttttagg tgctgcttta aatcaaatag taatgaaaac atctgaagat 1441 gtacttcaag ctattgactt tttaaagaaa aatctttcag gtaaagcaac ttttattcct 1501 ttaacatcta ttaaagaaag agaagtaaga gaagatcatt tacttgtttt aaaaggacaa 1561 aaaggatttt taggtgttgc aaaagaacta attgaatttg atactcaatt taacaaactc 1621 tttggatttt tacttggaaa catcttagtg gttgataatg tagacaatgc aaatagaata 1681 gctaaaatat tagatcataa atacactata gtttctttag aaggtgattt attcagacca 1741 ggcggaacca ttactggagg ttcaaaacta gaaagaactt ctattttaaa ttacgatatc 1801 aaaataaaag aacacacaaa tacacttaaa tttgctgaag atcaaattca tgatttaaaa 1861 attaaacagc aaacaatata taacgaaatt gaaacagtca attcaacaat ccaacaagta 1921 aaaattgaag ctaattcaat aaattcaaaa cttaatatct taaacgaaga attaaataac 1981 ttaaaactaa acgcaagcga aattttcaaa gaacaacaag aagaccaaga gagtttaaat 2041 ttaagttttg attctgaaaa attgaacata gaaaaacaaa tttctactct aacaattgaa 2101 ttaaattcta aaaaagatcg actaacaaat ttaattagtg agcaaggaaa aggagaaacc 2161 aagaaacaag aattagatgc caaactaaga aaattaaaca ctcaacactc agatagtatc 2221 actgaacaaa acagagcaaa attcttggta gagcaaaatc aaaaaagact ttctgagcac 2281 tacaaattaa ctttagaagc tgctagtgaa caatattctt tagatttaga cattgaacaa 2341 gcaagacatt ttgttgatag ccttaaaaaa gagttaaaag aattaggaaa cgttaattta 2401 gaagcaatta ctgaatttga agaagtaaat caacgttacc aagagaaaaa acaatacatc 2461 gaagaactaa ccactgctaa atccaaaatt gaagaagcaa tttctgattt agataaaatt 2521 attatcaata aaacaacaga aattgttaac ttagtaaata atgaatttaa tatggtattt 2581 caaaaaatgt ttggtggtgg aaaagcagaa attcacttca cagacaaaaa tgatatttta 2641 aattctggtg ttgaaatatc tgcacaacca cctggtaaaa caattaaaaa cttacgactt 2701 ttttcaggtg gagaaaaagc tattattgca atttcacttc tttttgctat tttaaaagca 2761 agaccaattc cattgtgtat tttagacgaa gttgaagctg cacttgatga atctaatgtt 2821 attcgttatg tagaattttt aaaattacta aaagaaaata ctcaattctt aattattact 2881 caccgttcag gaacaatgtc aagagtagat cagttacttg gagttactat gcaaaaacgt 2941 ggagttactt ccattttctc agttgaacta agcaaagcaa aagagatgct aaaagacgaa 3001 ttaaaataat acaaataaaa ataaaaaaaa cagaagtttg aagtgaggtg ataccctttt 3061 cttgaaaaaa ttttttgagt gt // LOCUS PPHVLCRA 314 bp ds-DNA VRL 25-JUL-1990 DEFINITION Human papillomavirus type 6 long control region DNA. ACCESSION M35091 KEYWORDS . SOURCE Human papillomavirus type 6 (patient specimen X020) DNA. ORGANISM Human papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 314) AUTHORS Hrisomalos,T.F., Boggs,D.L. and Fife,K.H. TITLE The human papillomavirus type 6 long control region and human cellular DNA contain related sequences JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.H.Fife, 12-JUN-1990. AUTHOR address: K.H.Fife Emerson Hall 435 Indiana University School of Medicine 545 Barnhill Dr. Indianapolis, IN 46202-5124 FEATURES from to/span description pept < 1 21 L1 open reading frame (AA at 1) signal 231 236 polyA signal site 81 175 insert (as compared to prototype sequence) site 245 259 insert (as compared to prototype sequence) BASE COUNT 75 a 29 c 72 g 138 t ORIGIN Mapped between nucleotides 7271 to 7476. 1 cgcgccaaaa ccaaaaggta atatatgtgt atatgtactg ttatatatat gtgtgtatgt 61 actgttatgt atatgtgttt atgtactgtt atatgtatgt gtgttgtata tatgtgtgta 121 tatatgtgta tgtgtgtata tgtatatgta tgtgttgtgt atatatatgt gtgtgtgtgt 181 tatgtgtgta atgtaattta tttgtgtaat gtgtatgtgt gtttatgtgc aataaacaat 241 taactacatt attgtatatc ttgttacacc ctgtgactca gtggctgttg cacgcgtttt 301 ggtttgcacg cgcc // LOCUS PPHVLCRB 300 bp ds-DNA VRL 25-JUL-1990 DEFINITION Human papillomavirus type 6 long control region DNA. ACCESSION M35092 KEYWORDS . SOURCE Human papillomavirus type 6 (patient specimen X019) DNA. ORGANISM Human papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 300) AUTHORS Hrisomalos,T.F., Boggs,D.L. and Fife,K.H. TITLE The human papillomavirus type 6 long control region and human cellular DNA contain related sequences JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.H.Fife, 12-JUN-1990. Emerson Hall 435 Indiana Univ School of Medicine 545 Barnhill Dr. Indianapolis, IN 46202-5124 FEATURES from to/span description pept < 1 21 L1 open reading frame (AA at 1) signal 231 236 polyA signal site 81 175 insert (as compared to prototype sequence) BASE COUNT 68 a 29 c 72 g 131 t ORIGIN Mapped between nucleotides 7271 to 7476. 1 cgcgccaaaa ctaaaaggta atatatgtgt atatgtactg ttatatatat gtgtgtatgt 61 actgttatgt atatgtgtgt atgtactgtt atatgtatgt gtgttgtata tatgtgtgta 121 tatatgtgta tgtgtgtata tgtatatgta tgtgttgtgt atatatatgt gtgtgtgtgt 181 tctgtgtgta atgtaattta tttgtgtaat gtgtatgtgt gtttatgtgc aataaacaat 241 tacctcttgt tacaccctgt gactcagtgg ctgttgcacg cgttttggtt tgcacgcgcc // LOCUS TRHTCSA 1010 bp ss-mRNA PLN 25-JUL-1990 DEFINITION T.kirilowii trichosanthin (TCS) mRNA, complete cds. ACCESSION M34858 KEYWORDS ribosome inactivating protein; trichosanthin. SOURCE T.kirilowii maximowicz, cDNA to mRNA. ORGANISM Trichosanthes kirilowii Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Violales; Cucurbitaceae. REFERENCE 1 (bases 1 to 1010) AUTHORS Shaw,P.-C., Yung,M.-H., Zhu,R.-H., Ho,W.K.-K., Ng,T.-B. and Yeung,H.-W. TITLE Molecular cloning of trichosanthin cDNA and its expression in Escherichia coli JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.-C.Shaw, 06-JUN-1990. Author address: P.-C.Shaw Department of Biochemistry Chinese University of Hong Kong Shatin, NT, HONG KONG FEATURES from to/span description pept 10 879 trichosanthin precursor sigp 10 78 trichosanthin signal peptide matp 79 819 trichosanthin variant 196 196 t in wild type; a in allele variant 197 197 c in wild type; g in allele variant 468 468 t in wild type; a in allele BASE COUNT 290 a 218 c 205 g 297 t ORIGIN 1 gtcaaaaaga tgatcagatt cttagtcctc tctttgctaa ttctcaccct cttcctaaca 61 actcctgctg tggagggcga tgttagcttc cgtttatcag gtgcaacaag cagttcctat 121 ggagttttca tttcaaatct gagaaaagct cttccaaatg aaaggaaact gtacgatatc 181 cctctgttac gttcctctct tccaggttct caacgctacg cattgatcca tctcacaaat 241 tacgccgatg aaaccatttc agtggccata gacgtaacga acgtctatat tatgggatat 301 cgcgctggcg atacatccta ttttttcaac gaggcttctg caacagaagc tgcaaaatat 361 gtattcaaag acgctatgcg aaaagttacg cttccatatt ctggcaatta cgaaaggctt 421 caaactgctg caggcaaaat aagggaaaat attccgcttg gactccctgc tttggacagt 481 gccattacca ctttgtttta ctacaacgcc aattctgctg cgtcggcact tatggtactc 541 attcagtcga cgtctgaggc tgcgaggtat aaatttattg agcaacaaat tgggaagcgt 601 gttgacaaaa ccttcctacc aagtttagca attataagtt tggaaaatag ttggtctgct 661 ctctccaagc aaattcagat agcgagtact aataatggac agtttgaaag tcctgttgtg 721 cttataaatg ctcaaaacca acgagtcacg ataaccaatg ttgatgctgg agttgtaacc 781 tccaacatcg cgttgctgct gaatagaaac aatatggcag ccatggatga cgatgttcct 841 atgacacaga gctttggatg tggaagttat gctatttagt gtaacttcaa gctacgtacg 901 agtacaaact cccacttgaa gaatctatta tcgtttgaga gtttaatcta cttgtagaaa 961 taataaagca tgttcgtgtg accgacctac gtggatgctc tgtatgtgtg // LOCUS CIBABI 1989 bp ds-DNA BCT 25-JUL-1990 DEFINITION Plasmid ColIB abortive infection protein (abi) gene, complete cds. ACCESSION J03314 KEYWORDS abortive infection protein. SOURCE Plasmid ColIB DNA, clone pTP64, isolated from E.coli K-12 strain W3110. ORGANISM Plasmid Colicin Ib Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1989) AUTHORS Gupta,S.K. and McCorquodale,D.J. TITLE Nucleotide sequence of a DNA fragment that contains the Abi gene of the ColIb plasmid JOURNAL Plasmid 20, 194-206 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.J.McCorquodale, 21-NOV-1988. FEATURES from to/span description pept 1306 1575 abortive infection protein binding 1028 1050 LexA binding site binding 1100 1121 LexA binding site binding 1215 1234 LexA binding site binding 1232 1252 LexA binding site BASE COUNT 476 a 533 c 531 g 449 t ORIGIN 5bp upstream of PstI site. 1 ctgcaggtcc gtgccgacca ggtgcttaag gggtggaaaa atatcccgcg cgggatctcc 61 ctgaccttct ccctgtttgc cgagatcgcc ggccgggaca gggaaaccat cgaccaggcc 121 tggaaaaata tcttctactc gcaactgagg gaaaaaaaac accgctttta ccaaagatat 181 cgaggccatc cgcgccctga aaaaactgcc tgccctcacc ggcgacagct ggcgcgggat 241 ggcatcacgg tgcgtatcta ccgcccggaa aattacgccc gcggcgatgg cggcttacac 301 tgagcctccc ggaaaattac gccacccaga tgtggaacat cccgttcccg gagcttgaat 361 accgcctctt taccgccgat ccgggctaca gcgccctgat cagcgccgaa cccgacaggt 421 gggacaaggc cttccgtttt gtggacgggg tgtgcgagct tcacctttac accaacggtg 481 tggaaggaag atcacaatcc caccccgctc ggggatgtcg ctcaggcgct gatcaacgtg 541 gtggaagaaa acctgctgta acggacccgg atgctgcggg cacaactgca tcatcaggag 601 gatgcaatga aaggacgaca gagccgctat gttaccggcg gagagagttt cgcggagatt 661 gcccgtctcc cttcaggggc ggtggtgagg ctctgtctga acaccggtct tgaggatgcg 721 ctgcgggagg cctccaaatc gctcaagtca gccttcaccc gttccgggcg aaaatgccgg 781 ctgtcagcgg gtacggcgca ggggccgttt accggacgcc ggcaggcgtg gccacacatc 841 tcttcgtctc ggtactctga gggggcaggg ggcaaaaaaa gtaaaaatgt attcgccagg 901 ttgcccggag gtgaaggaaa atagacatac agcagaacga cggatagcac tttttgctaa 961 atggacatca gtattactat gctatagttg ctttaatgga taagtgcgcc ttgacaaagg 1021 cggtgatttc tgttaacatt actctcatag tattgttccg tcccgctcca ccccaacaag 1081 atccgtttat ttcccgccag actggttatc accattcagg cccggatttt tttggatttt 1141 tttccgggga gcccccggac gagcttaaaa tcggtatgac aaacaggagg atgcgaatga 1201 acacatcata acagagctga aagataaaac attctgtacg gcattaacag cgttcacgtg 1261 tgtgaggcgc cgggtgcctt ttgacttaaa aacgaggtta ttgagatgac caaaatcaag 1321 acagttactt ttgtaaatac ttacccggga gggtctatga aaaacttgtt agacaccgag 1381 ggaacggttc tattcccatt ccagactgaa atccatttta tttggacgat tttctccacc 1441 gttaaacgcc tggttatcgg aaccagggac catatttgcc agaagcaata ctggagcgcc 1501 tgtctctgta ttttgcttct tatggcctat gtgggtctct gtgctgcggt ggtctggttt 1561 gtagtgccct gctgaaggcc tttatagtgt cgaaatttgc ggtttcggca ctatgggtca 1621 cgccagtaaa gcgcggacta ctctggggta tcggtaaagt ggttaccgcc acttgccgaa 1681 gatttactct gctaaagtaa gtagccgcaa cgctacacga actgatggtg aatgtcaaca 1741 gatactcacc atctccttac ggcggtggtc cctgtgacca ctggcctttc gcgtgggtgc 1801 aacacggcaa aactcctctg tacaacaggc tcccgccgtc attttccggc acaggtgagg 1861 ccggaattcg gactaaaacg taaaccgcgg gccagtccgg tagcgttcac tatcggccag 1921 cattctctca accagagaga aatccttttc accgcagaac acgtacgtct ccgcgaactc 1981 cacctgcag // LOCUS HUMET3 2223 bp ss-mRNA PRI 25-JUL-1990 DEFINITION Human endothelin 3 (EDN3) mRNA, complete cds. ACCESSION J05081 KEYWORDS endothelin. SOURCE Human adult hypothalamus, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2223) AUTHORS Bloch,K.D., Eddy,R.L., Shows,T.B. and Quertermous,T. TITLE cDNA cloning and chromosomal assignment of the gene encoding endothelin 3 JOURNAL J. Biol. Chem. 264, 18156-18161 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.D.Bloch, 06-OCT-1989. FEATURES from to/span description pept 194 910 endothelin 3 precursor /hgml_locus_uid="LU0066V" /map="unassigned" /nomgen="EDN1" sigp 194 268 endothelin 3 signal peptide (put.) matp 484 544 endothelin 3 matp 670 712 endothelin-like protein BASE COUNT 575 a 535 c 583 g 530 t ORIGIN 1 cgggtagcgc gctctgaaag tttatgaccg ccgcagccaa ctcctggccg gagctggaga 61 cgcagcgagc gatcggccgg cctcgaaccc ccacagctgg agggcgaggc cagctgtacc 121 cggccccagt gccctttcgc ggccacaagc ggccgtcctc ctggtccggt gctccggcgc 181 ctgatctagg ttcatggagc cggggctgtg gctccttttc gggctcacag tgacctccgc 241 cgcaggattc gtgccttgct cccagtctgg ggatgctggc aggcgcggcg tgtcccaggc 301 ccccactgca gccagatctg agggggactg tgaagagact gtggctggcc ctggcgagga 361 gactgtggct ggccctggcg aggggactgt ggccccgaca gcactgcagg gtccaagccc 421 tggaagccct gggcaggagc aggcggccga gggggcccct gagcaccacc gatccaggcg 481 ctgcacgtgc ttcacctaca aggacaagga gtgtgtctac tattgccacc tggacatcat 541 ttggatcaac actcccgaac agacggtgcc ctatggactg tccaactaca gaggaagctt 601 ccggggcaag aggtctgcgg ggccacttcc agggaatctg cagctctcac atcggccaca 661 cttgcgctgc gcttgtgtgg ggagatatga caaggcctgc ctgcactttt gcacccaaac 721 tctggacgtc agcagtaatt caaggacggc agaaaaaaca gacaaagaag aggaagggaa 781 ggttgaagtc aaggaccaac aaagcaagca ggctttagac ctccaccatc caaagctcat 841 gcccggcagt ggactcgccc tcgctccatc tacctgcccc cgctgcctct ttcaggaagg 901 agccccttag gaggacaggc ctgcagctcc aatttcatgc aggaaattgg ttttggagag 961 ttttggcaag ttggaaagcc acttactggc ttttgacatg acttctcttg gagaataagt 1021 ggactccaag ctaactcttt gcaaatgtaa acacatgtcc atcttgttaa taaatgcaaa 1081 atgcccgtgc agcagaagca tgcgactttc atatccttgc ctagaatagg ctgcatggtg 1141 tatgtcagtg agggccacga ggcgtcggct ttagacacag atcatagctc tacaggagtt 1201 tatgaatttg aagcttatgg gattttggca gagaaatttt cagctgtgct tgatacccac 1261 caaaagaatg tatctcgaaa gaatgaagga agaagaaaaa aggatccttg atgtttgtga 1321 caagaaaatg agaaagttag tatctgcaat acagagcttg ttcctgttca gtgactgacc 1381 ctctgtattc tgtatagaca ccaggccgat acacagtgga gttcccaggc cttgtttgca 1441 ggaagccgac tgtaaagaca gccccagctc aaggctatta ggttgaatat ttgctttcat 1501 gagtaaatgt ggatctttgg ggaatggctt caaaataagt cacgaacaca aattctttgt 1561 aaattatgta aattcctgtt tatataaatt ggcaacaact tataccgtct gacagttcaa 1621 aatctctttc agctgcgctc ttcccaccga gccgagctta ctgtgagtgt ggagatgtta 1681 tcccaccatg taaagtcgcc tgcgcagggg agggctgccc atctccccaa cccagtcaca 1741 gagagatagg aaacggcatt tgagtgggtg tccagggccc cgtagagaga catttaagat 1801 ggtgtatgac agagcattgg ccttgaccaa atgttaaatc ctctgtgtgt atttcataag 1861 ttattacagg tataaaagtg atgacctatc atgaggaaat gaaagtggct gatttgctgg 1921 taggattttg tacagtttag agaagcgatt atttattgtg aaactgttct ccactccaac 1981 tcctttatgt ggatctgttc aaagtagtca ctgtatatac gtatagagag gtagataggt 2041 aggtagattt taaattgcat tctgaataca aactcatact ccttagagct tgaattacat 2101 ttttaaaatg catatgtgct gtttggcacc gtggcaagat ggtatcagag agaaacccat 2161 caattgctca aatactcaga aagtactgtc aaaagcctaa taaaaaacct aaagtttgct 2221 ctg // LOCUS HUMSATAA 293 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human alpha satellite DNA, clone pC1.8. ACCESSION M26918 J04744 KEYWORDS alpha satellite DNA; satellite DNA. SOURCE Human (cell line HHW423) DNA, clone pC1.8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 293) AUTHORS Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A. TITLE A human alphoid DNA clone from the EcoRI dimeric family: Genomic and internal organization and chromosomal assignment JOURNAL Genomics 5, 822-828 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by A.Baldini, 08-AUG-1989. FEATURES from to/span description rpt 1 293 alpha-satellite BASE COUNT 84 a 55 c 63 g 91 t ORIGIN Chromosomes 1, 5, and 19; centromere. 1 gatcctttac acagagcaga cttgaaacac tctttttgtg gaatttgcag tggagatttc 61 aagcgctttg aggccaatgg cagaaaagga aatacttcga tataaaaact agacagaatc 121 attctcagaa actgctctgc gatgtgtcgg ttcaactctc agagtttaac ttttcttttc 181 attcagcagt ttggaaacac tctgtttgta aagtctgcaa cgtggatatt tgaccactta 241 gaggccttcg ttggaaacgg gtttttttcc tgtaaggcta gacagaagaa ttc // LOCUS HUMSATAB 344 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human alpha satellite DNA, clone pC1.8. ACCESSION M26919 J04744 KEYWORDS alpha satellite DNA; satellite DNA. SOURCE Human (cell line HHW423) DNA, pC1.8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 344) AUTHORS Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A. TITLE A human alphoid DNA clone from the EcoRI dimeric family: Genomic and internal organization and chromosomal assignment JOURNAL Genomics 5, 822-828 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by A.Baldini, 08-AUG-1989. FEATURES from to/span description rpt 1 344 alpha-satellite BASE COUNT 98 a 67 c 68 g 111 t ORIGIN Chromosomes 1, 5 and 9; centromere. 1 gaattcccag tagcttcctt gtgttgtgaa cattcaactc acagagttga acgttccctt 61 agacagagca gatttgaaca ctctttttgt gcaattggca agtggagatt tcaagcgctt 121 taaggtcaat ggcagaaaag gaaatatctt cgtttcaaaa ctagacagaa tcattcccac 181 aaactgcgtt gtgatgtgtt cattcaactc acacagttta acctttcttt tcatagagca 241 gttaggaaac agtctgtttg taaattctct aagtggatat tctgacatct tgtggccttc 301 gttggaaacg ggatttcttc atattctgct agacagaaga attc // LOCUS HUMSATAC 1049 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human alpha satellite DNA, clone pC1.8. ACCESSION M26920 J04744 KEYWORDS alpha satellite DNA; satellite DNA. SOURCE Human (cell line HHW423) DNA, clone pC1.8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1049) AUTHORS Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A. TITLE A human alphoid DNA clone from the EcoRI dimeric family: Genomic and internal organization and chromosomal assignment JOURNAL Genomics 5, 822-828 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by A.Baldini 08-AUG-1989. FEATURES from to/span description rpt 1 1049 alpha-satellite BASE COUNT 295 a 198 c 221 g 335 t ORIGIN Chromosomes 1, 5 and 19; centromere. 1 aaattttctt ttcatacagc agagtttgga aacactctgt ttgtaaagtc tgcacgtgga 61 taagttgtcc acttagaggc attcgttgga aacgggtttt tttcatgtaa ggctacacag 121 aagaattccc agtaacttcc ttgtgttgtg tgtatcaact caaagagttg aacgatcctt 181 tacacagagc agacttctaa cactcttttt gtggaatttg caagtggaga tttcagccgc 241 tttgaagtca aaggtagaaa aggaaatatc ttcctataaa aactagacag aatgattctc 301 agaaactcct ttgtgatgtg tgcgttcaac tcacagagtt taacctttct tttcatagag 361 cagttaggaa acactctgtt tgtaaagtct gcaagtggat attcagacct ctttgaggcc 421 ttcgtggaac gggttttcat ataaggctag gcagagaatt cccagtaact tccttgtgtt 481 gtgtgtgtca actcacagag ttgactttca tttacacaga gcagacttga aacactcttt 541 ttgtaattgc aagtggagat ttcaagcgct ttgagcaagg ccgaaaagga aatatcttcg 601 tataaaaact agacagaatc attctcagaa actgctctgc gatgtgtgcg ttcaactctc 661 agagtttaac ttttcttttc atcagcagtt tggaaacact ctgtttgtaa agtctgcacg 721 tggatatttt gaccacttag aggccttcgt tggaaacggg tttttttcct gtaaggctag 781 acagaagaat tccctgtagc ttccttgtgt tgtgtacatt caacgcacag agttgaacgt 841 tcccttagac agagcagatt tgaaacactc tttttgtgca attggcaagt ggagatttca 901 ggcgctttaa ggtcaatggc agaaaaggaa atatcttcgt ttcaaaacta gacagaatca 961 ttcccacaaa ctgcgtggtg atgtgttcgt tcaactcaca gagtttaacc tttcctttca 1021 tagagcagtt aggaaacagt ctgtttttt // LOCUS PVYCPA 1122 bp ss-RNA VRL 25-JUL-1990 DEFINITION Potato virus Y coat protein gene, 3' end. ACCESSION M22470 KEYWORDS coat protein. SOURCE Potato virus Y (necrotic strain; isolate New Zealand; N-PVY), passed in Nicotiana tabacum cv. Burley 21, cDNA to viral RNA, clone PVYN 27. ORGANISM Potato virus Y Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Potyvirus. REFERENCE 1 (bases 1 to 1122) AUTHORS Hay,J.M., Fellowes,A.P. and Timmerman,G.M. TITLE Nucleotide sequence of the coat protein gene of a necrotic strain of potato virus Y from New Zealand JOURNAL Arch. Virol. 107, 111-122 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.M.Hay, 09-FEB-1989. FEATURES from to/span description pept < 1 796 coat protein (AA at 2) BASE COUNT 351 a 196 c 263 g 312 t ORIGIN 6 bp upstream of TaqI site. 1 cacaatcgat gcaggaggaa gcactaaaaa ggatgcaaaa caagagcaag gtagcattca 61 accaaatttc aacaaggaaa aggaaaagga cgtgaatgtt ggaacatctg gaactcatac 121 tgtgccacga attaaagcta tcacgtccaa aatgagaatg cccaagagta aaggtgcaat 181 tgcattaaat ttggaacact tactcgagta tgctccacag caaattgaca tctcaaatac 241 tcgagcaact caatcacagt ttgatacgtg gtatgaagca gtacaacttg catacgacat 301 aggagaaact gaaatgccaa ctgtgatgaa tgggcttatg gtttggtgca ttgaaaatgg 361 aacctcgcca aacatcaacg gagtttgggt tatgatggat ggagatgaac aagtcgaata 421 cccactaaaa ccaatcgttg agaatgcaaa accaacactt aggcaaatca tggcacattt 481 ctcagatgtt gcagaagcgt atatagaaat gcgcaacaaa aaggaaccat atatgccacg 541 atatggttta gttcgtaatc tgcgcgatgg aagtttggct cgctatgctt ttgactttta 601 tgaagttaca tcacggacac cagtgagggc tagagaggca cacattcaaa tgaaggccgc 661 agctttaaaa tcagctcaat ctcgactttt cggattggat ggtggcatta gtacacaaga 721 ggaaaacaca gagaggcaca ccaccgagga tgtttctcca agtatgcata ctctacttgg 781 agtgaagaac atgtgattgt agtgtctttc cggacgatat atagatattt atgtttgcag 841 taagtatttt ggcttttcct gtactacttt tatcgaaatt aataatcgtt tgaatattac 901 tggcagatag gggtggtata gcgattccgt cgttgtagtg accttagctg tcgtttctgt 961 attattatgt ttgtataaaa gtgccgggtt gttgttgttg tggctgatct atcgattagt 1021 tgatgttgcg atttgtcgta gcagtgacta tgtctggatt tagttagttg ggtgatgctg 1081 tgattctgtc atagcagtga ctgtaaactt caatcaggag ac // LOCUS SRAAFPG 2420 bp ds-DNA VRT 25-JUL-1990 DEFINITION Sea raven (H.americanus) antifreeze protein type II gene, complete cds. ACCESSION J05100 KEYWORDS antifreeze protein. SOURCE Sea raven (adult) testes DNA, clone lambda SR7. ORGANISM Hemitripterus americanus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Scorpaeniformes; Cottoidei; Cottidae. REFERENCE 1 (bases 1 to 2420) AUTHORS Hayes,P.H., Scott,G.K., Ng,N.F.L., Hew,C.L. and Davies,P.L. TITLE Cystine-rich type II antifreeze protein precursor is initiated from the third AUG codon of its mRNA JOURNAL J. Biol. Chem. 264, 18761-18767 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.L.Davies, 19-OCT-1989. FEATURES from to/span description pept 434 494 antifreeze protein, exon 2 (first expressed exon) 1246 1382 antifreeze protein, exon 3 1488 1604 antifreeze protein, exon 4 1697 1805 antifreeze protein, exon 5 2045 2112 antifreeze protein, exon 6 pre-msg 226 2112 antifreeze protein mRNA and introns IVS 284 389 antifreeze protein intron A IVS 495 1245 antifreeze protein intron B IVS 1383 1487 antifreeze protein intron C IVS 1605 1696 antifreeze protein intron D IVS 1806 2044 antifreeze protein intron E rpt 74 94 repeat copy A rpt 95 114 repeat copy B rpt 115 135 repeat copy C signal 74 135 antifreeze protein regulatory sequence (put.) site 161 164 antifreeze protein CAAT box site 195 198 antifreeze protein TATA box signal 2368 2373 antifreeze protein polyA signal BASE COUNT 684 a 475 c 496 g 765 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttcaga aattcactcc tttttctaat attaacttta aagccacagt gtgcgatttg 61 gagccctttg atttgttgtt ttcaaagttc aaactgttgt ttcaaaattc aaactgttgt 121 tttcaaagtt caaactgatg ccagtgtcca taataaaaat caatgtatga ataatattgt 181 gaaatgtaat tgactatata agagctggtc tttctctagt tcagcacatg aatgcagagg 241 caacaggctg acactgaaac aagagaagat atttctacag caggtttgct ctcagcctct 301 tcttcgtcct gccgagcccc acaggcactg tgctgccctg ctgtctttgt aattcattgc 361 aactcttgtg tttttctctt ctgatgcagg gctatcaatc atcttcatcg tctgcaccat 421 ctctaccacg aggatgctga ctgtgtctct actggtttgt gccatgatgg ctctgactca 481 agctaatgat gacagtgagt ctcagtctta cattctgtgt gtaggatact atactgtctg 541 taaatatatt caattgtaga cctattaaga tgctgtgaat attaatatta ggtaatattt 601 agtttattta tatatgtata tatatttgac agtaataaca aaaaactagg atagattgca 661 atccgacttt ttgttatctt tattgttaac aatattaaag acataattcc atagaattat 721 ataatttaca tagaaacagc aaatacaact gtcagagaaa gacttgacag ctaaagcagg 781 agagatcaag tgtagaaggg agatttgatc tcgtctcaac tgaagctaga actgaatgta 841 ctaacttatt tttggtgaaa caaccgaata attaattcat ttttccccca caaaactaaa 901 cgagacgcag accaagctaa gtgtgtgcta acagtaatca gcattcgttt agcaaagtat 961 tagtaactgc catcacagct tttgactcta gtggaattca tgaaatttgg cagaacaaag 1021 gagacctgtg cacatctgat tccaatgaga atacaatgtg cttcacagaa aagcacttca 1081 ccaatcctgt acacattcat aaagccacag aaaaaaagag agctgattaa tcgtcgttcc 1141 ctctgctctg acaataaaag gattataaac tccagatttc tgataaacag actcggtggc 1201 ttacctgtga tcagacatgt tacccactct tctgtttgtc ctcagaaata ctcaaaggca 1261 cggctacaga ggctggaccg gtctctcaga gagccggacc aaactgtccc gctggttggc 1321 aacctcttgg tgaccgctgt atctattatg agacaacagc gatgacttgg gctctggctg 1381 aggtagtcag gatatgatta tgattcagat tgcttctaaa ctggtctggt ggtattgcct 1441 tacatgctcg gttaattgag catgagcttg actcatttcc actgcagaca aactgtatga 1501 aattgggtgg acaccttgca tccatccaca gccaggagga gcatagtttc attcagacct 1561 tgaatgctgg tgttgtatgg atcggaggct ccgcttgcct ccaggtaaaa cattgcatta 1621 caatggtggc agaaagaaag gatttttatt acatgctatc ttactatacg tatattcttt 1681 cctttctgtt ttctaggcag gtgcttggac ctggtctgat ggtacaccta tgaattttcg 1741 ttcctggtgt tctaccaaac ctgatgatgt actggccgcg tgctgtatgc agatgactgc 1801 tgcaggtaaa tcacaacaca ttagagcata gtattaaatg actgaaggca gtagtgttgt 1861 ttagtacatt tggttcatct tgagatcaat actctcagaa tttcactttt gaatcacttt 1921 tgttcttcag ttcatgtgta gctttggcct cgttatccgt gtctttgtct gtctagtgat 1981 gaagacagtt tcaggttagg ttggtatggc gctgactcac ttcttgtgtt tttgatgttt 2041 acagctgacc aatgctggga tgacttgcct tgtccggcgt cccacaaatc agtctgcgcc 2101 atgacattct aagctaacac agaggccatc catcacacaa acactttagt gggtgtttga 2161 ttgtgtgtgt tcgcatactc atctgtgttc gtgtcaacag cctcatgctg aacctgaagg 2221 ttcaaaatct catatgacat ctttaattct ttgctattgt tggagctgcc tgaaaggatg 2281 agacgacaag agctggaaag catctgaggg attttaggaa gaaagtgaat ggttatgaaa 2341 atgatggtct ttttatgtat tatgtcaaat taaaaggctg acacgttgaa acaaactctt 2401 ctgtgagttt ggcagaattc // LOCUS YSCTFIIDA 2439 bp ds-DNA PLN 25-JUL-1990 DEFINITION Yeast (S.cerevisiae) TATA-binding protein (TFIID) gene, complete cds. ACCESSION M27135 KEYWORDS DNA binding protein; TATA-binding protein; transcription factor. SOURCE Yeast (S.cerevisiae, strain S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2439) AUTHORS Hahn,S., Buratowski,S., Sharp,P.A. and Guarente,L. TITLE Isolation of the gene encoding the yeast TATA binding protein TFIID: A gene identical to the SPT15 suppressor of Ty element insertions JOURNAL Cell 58, 1173-1181 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Hahn, 10-AUG-1989. FEATURES from to/span description pept 1237 1959 TATA-binding protein BASE COUNT 737 a 481 c 472 g 749 t ORIGIN 1 bp upstream of EcoRI site; chromosome 5 right arm after TRP1. 1 gaattcgttc aagtggtccg taatattccc gtctttacaa agctggatta ccatctctaa 61 tgccaacttc catgcatata gctcaggccc caccgtgtgc agctccgtgc ttcgcagctc 121 ctgcagagca tcctcgggga ttgggaacct ctcatttagc aagtaattca cataacacag 181 atttagaaac catttccatt gtgacttttc ccgacattgc gagagtagcc catgaaaact 241 cgtcttcacc ctgcggtgct gtttcagctt aatgcaaagc atcacgccga catactggaa 301 tacggatgcc caattttgat acaactcatc ctgcaaattt accatgtact ggactaattc 361 attgcaattt cttagtgcaa tcttatagtg gaacttactg tctctcataa gtggcaagtc 421 atgtaacagc agaaactcgc aacgcatgat ctcttctacc aaatctgtgt cgctctggtg 481 cgtttgtaac cgttctttca aactggaaat gtaaagctct gctaggtcaa aattatacgt 541 ctcctgtatc aataactcca ccatctcaaa cgtgacctta ctatcctcca gaactgaaag 601 cgtacatttc gttttcaata gctgaaacat ctggatagac atgttcatga ggccataata 661 ctgcttcaac ccttcctcag aaccgatttt attcgcaatt gatatgcatg gtctctgtat 721 tcctgtgcta agtggtatac ttgtgaaata ctaagtttgt cgccaagatt ttccatgaat 781 ttgtacttct ttcgaaatcg ttcaatttct accaatactg attcccctct gatagctgag 841 atgtcgggat tccctttgct gatagatcta actcatctct ttacgtattt taattgtgaa 901 gccgtaaata gttatcttcc aagtttctct tacgcgagct ttttgggaaa agaaaaaaat 961 ttgaagatct acatataaaa catggcttca aaggattact aatgactttt tttaccttga 1021 taggtattct tgatggtaag agtaaacaag ggacgtgaaa attacagtag ttactgtttt 1081 ttttggacta taagatcggg ggaaagataa cacataagaa ataaaacgac tactagttag 1141 actgctctgc ggaagaagca aggaagtaaa ggctgcattt tatttttctt ttctagtcca 1201 acataaacag gtgtatcaag agaaactttt ttaattatgg ccgatgagga acgtttaaag 1261 gagtttaaag aggcaaacaa gatagtgttt gatccaaata ccagacaagt atgggaaaac 1321 cagaatcgag atggtacaaa accagcaact actttccaga gtgaagagga cataaaaaga 1381 gctgccccag aatctgaaaa agacacctcc gccacatcag gtattgttcc aacactacaa 1441 aacattgtgg caactgtgac tttggggtgc aggttagatc tgaaaacagt tgcgctacat 1501 gcccgtaatg cagaatataa ccccaagcgt tttgctgctg tcatcatgcg tattagagag 1561 ccaaaaacta cagctttaat ttttgcctca gggaaaatgg ttgttaccgg tgcaaaaagt 1621 gaggatgact caaagctggc cagtagaaaa tatgcaagaa ttatccaaaa aatcgggttt 1681 gctgctaaat tcacagactt caaaatacaa aatattgtcg gttcgtgtga cgttaaattc 1741 cctatacgtc tagaagggtt agcattcagt catggtactt tctcctccta tgagccagaa 1801 ttgtttcctg gtttgatcta tagaatggtg aagccgaaaa ttgtgttgtt aatttttgtt 1861 tcaggaaaga ttgttcttac tggtgcaaag caaagggaag aaatttacca agcttttgaa 1921 gctatatacc ctgtgctaag tgaatttaga aaaatgtgat ggggaaggag tagacgaaaa 1981 gaaaaaaagg ttttctattt gttccatttt ctcaattatt aatggtcctc aaagaaataa 2041 aagaaaagga agaagaagta attgtaatat caaacggttt tttatagtat attcttctta 2101 ttctatattt atatatcaat gttttataat aagatgttta ttcatagcat atctggtgga 2161 tcgtctctat taagcgccag cgaggtgttt gcctctgcat ttttcagcaa agcaagctcc 2221 ctttccagct tgaatctatg ttcacgctca tccgacaatt ctttttcata ctttctttgt 2281 gtactcgtaa gcactttttt aaactcactt gtcattattg aaagtgaacg tgatccagaa 2341 ccgcttgtgg ggcttcctac agaggaaggt gaacttggat cccaagtcac tggcgaactc 2401 gctggtgatg acatgccgaa attatgtctg cttgaattc // LOCUS ECOPUTC 730 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli putC region encoding proline uptake protein (putP) and proline oxidase (putA) genes, 5'ends. ACCESSION M35174 KEYWORDS putA protein; putC region; putP protein. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 730) AUTHORS Nakao,T., Yamato,I. and Anraku,Y. TITLE Nucleotide sequence of putC, the regulatory region for the put regulon of Escherichia coli K 12 JOURNAL Mol. Gen. Genet. 210, 364-368 (1987) STANDARD simple staff_review FEATURES from to/span description pept 129 < 1 (c) proline uptake protein (putP) pept 549 > 730 proline oxidase (putA) pept 209 544 ORF mRNA 266 < 1 (c) putP mRNA (alt.) mRNA 255 < 1 (c) putP mRNA (alt.) mRNA 249 < 1 (c) putP mRNA (alt.) mRNA 224 < 1 (c) putP mRNA (alt.) mRNA 142 < 1 (c) putP mRNA (alt.) mRNA 506 > 730 putA mRNA BASE COUNT 205 a 173 c 144 g 208 t ORIGIN 1 cccaagacta cgaccgccca gaatatagtc gtcaaagttt ttcgttgatc gccaggcgat 61 aaacccaatc aatatcatgc caaagatata gacacaaaat gtcaccaaca tcggtgtgct 121 aatagccatc taaagtctcc aaaaaattat tatcggcaat gtcgaaactt gccgttatat 181 ctgccaccgg aacggggtaa cagagtttat gttttaccag ggcgaccgta tcctgccgga 241 agcgctggtt attcacaatc gatttaacac accatttaca ttaaatttta gtgctcagcg 301 acactatttt tcatcaggtt gcactctctc acattttttg cggttgcacc tttcaaaaat 361 gttaactgcc gcagagaaaa agtctgagtt atttttttcc ctgtcatatc gatttctttt 421 attaacattt cattcatttt taagcttgct acgcatgtca catttaacat ggttgcacaa 481 agttgcaaca tcatggatat ttcacgataa cgttaagttg cacctttctg aacaacagga 541 gtaatggcat gggaaccacc accatggggg ttaagctgga cgacgcgacg cgtgagcgta 601 ttaagttcgc cgcgacacgt atcgatcgca caccacactg gttaattaag caggcgattt 661 tttcttatgc tcgaacaact ggaaaacagc gatactctgc cggagctacc tgcgctgctt 721 tctggcgcgg // LOCUS FIBGLUC 1426 bp ds-DNA BCT 25-JUL-1990 DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene, complete cds. ACCESSION M33676 M33311 KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase. SOURCE F.succinogenes (strain S85) DNA, clone PJI5. ORGANISM Fibrobacter succinogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1426) AUTHORS Teather,R.M. and Erfle,J.D. TITLE DNA sequence of a Fibrobacter succinogenes mixed linkage beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene JOURNAL J. Bacteriol. 172, 3837-3841 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.M.Teather, 11-APR-1990. FEATURES from to/span description pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase precursor (EC 3.2.1.73) sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase signal peptide matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase binding 132 137 ribosome binding site signal 62 66 -35 region signal 85 90 -10 region BASE COUNT 371 a 346 c 335 g 374 t ORIGIN 1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt 61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt 121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc 181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt 241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg 301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc 361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc 421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct 481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac 541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac 601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt 661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat 721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat 781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt 841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc 901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa 961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct 1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac 1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt 1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg 1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt 1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa 1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt 1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa // LOCUS BFRRCRRA 89 bp ss-RNA PHG 25-JUL-1990 DEFINITION Bacteriophage fr coat protein replicase cistron (R region) RNA. ACCESSION M35063 KEYWORDS coat protein. SOURCE Bacteriophage fr RNA. ORGANISM Bacteriophage fr Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Leviviridae. REFERENCE 1 (bases 1 to 89) AUTHORS Cielens,I.E., Jansone,I.V., Gribanov,V.A., Vishnevskii,Y.I., Berzin,V.M. and Gren,E.J. TITLE Regulator region of phage fr replicase cistron: II. Isolation and structure of specific fr RNA fragments JOURNAL Mol. Biol. 16, 886-892 (1982) STANDARD simple staff_entry FEATURES from to/span description pept 55 > 89 coat protein (R region) pept < 1 20 undefined ORF (AA at 3) BASE COUNT 34 a 23 c 13 g 19 t ORIGIN 1 ccaactcggg aatctactaa gaaacccgtg ccattccaac aatgaggaat acccatgtca 61 aaatcaacaa agaagttcaa ctctttatg // LOCUS CHKAGLBB 71 bp ss-mRNA VRT 25-JUL-1990 DEFINITION Chicken alpha-globin gene, partial cds. ACCESSION M35068 KEYWORDS alpha-globin. SOURCE Chicken (strain white Leghorn) 2-3 week old, cDNA to mRNA, clone pHb1003. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 71) AUTHORS Cummings,I.W., Liu,A.Y. and Salser,W.A. TITLE Identification of a new chicken alpha-globin structural gene by complementary DNA cloning JOURNAL Nature 276, 418-419 (1978) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 71 alpha-globin (AA at 1) BASE COUNT 17 a 22 c 18 g 14 t ORIGIN 1 aagaaggtag tggctgcctt gatcgaggct gccaaccaca ttgatgacat cgccggcacc 61 ctctccaagc t // LOCUS ECOTGLPA 141 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli suppressor tRNA-Leu (leuX) precursor gene. ACCESSION M35064 KEYWORDS leuX gene; suppressor transfer RNA-Leu. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 141) AUTHORS Nomura,T. and Ishihama,A. TITLE A novel function of RNase P from Escherichia coli: Processing of a suppressor tRNA precursor JOURNAL EMBO J. 7, 3539-3545 (1988) STANDARD simple staff_entry FEATURES from to/span description tRNA 23 114 Leu-tRNA anticdn 57 59 Leu-tRNA anticodon caa site 35 36 self-cleavage site BASE COUNT 38 a 34 c 33 g 36 t ORIGIN 1 gttttccgca tacctcttca gtgccgaagt ggcgaaatcg gtagacgcag ttgattcaaa 61 atcaaccgta gaaatacgtg ccggttcgag tccggccttc ggcaccaaaa gtatgtaaat 121 agacctcaac tgaggtcttt t // LOCUS HUMFBPC 66 bp ss-mRNA PRI 25-JUL-1990 DEFINITION Human folate binding protein mRNA, partial cds. ACCESSION M35069 KEYWORDS folate binding protein. SOURCE Human epidermoid carcinoma cell line KB, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 66) AUTHORS Sadasivan,E. and Rothenberg,S.P. TITLE Molecular cloning of the complementary DNA for a human folate binding protein JOURNAL Proc. Soc. Exp. Biol. Med. 189, 240-244 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 66 folate binding protein (AA at 1) BASE COUNT 21 a 17 c 17 g 11 t ORIGIN 1 acaaggattg catgggccag gactgagctt ctcaatgtct gcatgaacgc caagcaccac 61 aaggaa // LOCUS HUMMETONA 90 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human met oncogene, middle exon. ACCESSION M35073 KEYWORDS met oncogene; tyrosine kinase. SOURCE Human cell line MNNG-HOS DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 90) AUTHORS Dean,M., Park,M., Le Beau,M.M., Robins,T.S., Diaz,M.O., Rowley,J.D., Blair,D.G. and Vande Woude,G.F. TITLE The human met oncogene is related to the tyrosine kinase oncogenes JOURNAL Nature 318, 385-388 (1985) STANDARD simple staff_entry FEATURES from to/span description pept / 22 / 90 met oncogene (AA at 24) /hgml_locus_uid="LN0032R" /nomgen="MET" /map="7q31" IVS < 1 21 met oncogene intron BASE COUNT 28 a 16 c 21 g 25 t ORIGIN Chromosome 7q31. 1 ttggctttgg tcttcaagta gccaaagcga tgaaatatct tgcaagcaaa aagtttgtcc 61 acagagactt ggctgcaaga aactgtatgt // LOCUS HUMMETONB 375 bp ss-mRNA PRI 25-JUL-1990 DEFINITION Human met oncogene mRNA, 3' end. ACCESSION M35074 KEYWORDS met oncogene; tyrosine kinase. SOURCE Human cell line MNNG-HOS, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 375) AUTHORS Dean,M., Park,M., Le Beau,M.M., Robins,T.S., Diaz,M.O., Rowley,J.D., Blair,D.G. and Vande Woude,G.F. TITLE The human met oncogene is related to the tyrosine kinase oncogenes JOURNAL Nature 318, 385-388 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 375 met oncogene /hgml_locus_uid="LN0032R" /nomgen="MET" /map="7q31" BASE COUNT 94 a 97 c 83 g 101 t ORIGIN Chromosome 7q31. 1 tggtcctttg gcgtcgtcct ctgggagctg atgacaagag gagccccacc ttatcctgac 61 gtaaacacct ttgatataac tgtttacttg ttgcaaggga gaagactcct acaacccgaa 121 tactgcccag accccttata tgaagtaatg ctaaaatgct ggcaccctaa agccgaaatg 181 cgcccatcct tttctgaact ggtgtcccgg atatcagcga tcttctctac tttcattggg 241 gagcactatg tccatgtgaa cgctacttat gtgaacgtaa aatgtgtcgc tccgtatcct 301 tctctgttgt catcagaaga taacgctgat gatgaggtgg acacacgacc agcctccttc 361 tgggagacat catag // LOCUS MS23ENDA 105 bp ss-RNA PHG 25-JUL-1990 DEFINITION Bacteriophage MS2 3' terminal fragment. ACCESSION M35059 KEYWORDS . SOURCE Bacteriophage MS2 RNA. ORGANISM Bacteriophage MS2 Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Leviviridae. REFERENCE 1 (bases 1 to 105) AUTHORS Contreras,R., Vandenberghe,A., Jou,W.M., De Wachter,R. and Fiers,W. TITLE Studies on the Bacteriophage MS2 nucleotide sequence of a 3' terminal fragment (n=104) JOURNAL FEBS Lett. 18, 141-144 (1971) STANDARD simple staff_entry BASE COUNT 21 a 34 c 30 g 20 t ORIGIN 1 gctccaccga aaggtgggcg ggcttcggcc cagggacccc tccctaaaga gaggacccgg 61 gattctcccg atttggtaac tagctgcttg gctagttacc accca // LOCUS PEAPCATE 1004 bp ds-DNA SYN 25-JUL-1990 DEFINITION Chimaeric gene with P.sativum ribulose 1,5-bisphosphate carboxylase 5' flank/A.tumefaciens chloramphenicol acetyltransferase gene, 5' end. ACCESSION M35072 KEYWORDS . SOURCE Recombined Pisum sativum and Agribacterium tumefaciens DNA inserted in decapitated tobacco seedlings. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 1004) AUTHORS Herrera-Estrella,L., Van den Broeck,G., Maenhaut,R., Van Montagu,M., Schell,J., Timko,M. and Cashmore,A. TITLE Light-inducible and chloroplast-associated expression of a chimaeric gene introduced into Nicotiana tabacum using a Ti plasmid vector JOURNAL Nature 310, 115-120 (1984) STANDARD simple staff_entry FEATURES from to/span description pept 1002 > 1004 chloramphenicol acetyltransferase (CAT) recomb 965 966 P.sativum DNA end/CAT DNA start signal 943 946 TATA box BASE COUNT 309 a 176 c 146 g 373 t ORIGIN 1 gaattcaaca ttggctatta ctggttttac aaagtcagac taaggagcat gtccaaccac 61 tataaggtct ataataggat ttaccttttt ccttagaagc actttaatca actagaaatc 121 aaagaagcaa aatgtagtgt ctagatcttc atcagaagta aagtatagag ctttagcaaa 181 cacatcatgt gagacacgat ggtttctata cttgcttcag gatctctgca tttcccatac 241 ctcgttcatg acaattgcaa accaacctcg tacatttgat gcccataatt tctgaaaacc 301 aagttgcata cctcttcacc aaaactcttc atcttggtct cttctcctct tttgttcaca 361 aactaggaat tattaacttt cattctaatt tataggggct gctacaactt aatatatttt 421 taattatttt tattctctta atttcctttt tttctatttg tttgtcaggt agttgagata 481 tttgggctaa tctattagag atagtttctc taacaaactt gtaactttgg gtctatatta 541 gctaatgatt catcttatat tttttcaaat gaatcattaa taaaactttc ctcttttatt 601 taattttttc aattcagttt catcatcaaa gcaaatgttt ccctgccatc tgtttgtcaa 661 cactaacatc taatgtactt atctcattag tttaattatt gtttgatcat gtttaatcct 721 tctagtgttg ttagtttttt cagttagctt aatgggcatc ttacacgtgg cattatccta 781 ttggtggcaa atgataaggt taggacacac aacttttcaa tcttgtgtgg ttaatatggc 841 tgcaaagttt atcatttcac aatctaacaa gattggtact aggcagtagc taattaccac 901 aatattaaga ccataatatt ggaaatagat aaataaaaac attatatata gcaagtttta 961 gcagaagctt ggcgagattt tcaggagcta aggaagctaa aatg // LOCUS TRFMTTGVA 149 bp ds-DNA ORG 25-JUL-1990 DEFINITION C.oncopelti mitochondrion Val-tRNA gene. ACCESSION M35071 KEYWORDS transfer RNA-Val. SOURCE C.oncopelti mitochondrial DNA, clone pCo150. ORGANISM Mitochondrion Crithidia oncopelti Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Crithidia oncopelti. REFERENCE 1 (bases 1 to 149) AUTHORS Entelis,N.S., Maslov,D.A., Bol'shakova,E.V. and Zaitseva,G.N. TITLE Primary structure of an unusual valine tRNA gene from mitochondria of Crithidia oncopelti JOURNAL Dokl. Biochem. 297, 435-438 (1987) STANDARD simple staff_entry FEATURES from to/span description tRNA 18 89 Val-tRNA anticdn 45 47 Val-tRNA anticodon tac BASE COUNT 44 a 39 c 16 g 50 t ORIGIN 1 gatctaaaat ccctgttaga cacttgtttt tgcaaacgta taattacgtt ttctacacca 61 aaacccttta aatccctgtt aggaccccat ttcttcaaat gtataatcac gttttctgcg 121 tccaaacccc ttaaaaccca gatttcgat // LOCUS YSCTRV2A 75 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae, Baker's) Val-tRNA-2a. ACCESSION M35070 K01066 KEYWORDS transfer RNA-Val. SOURCE Yeast (S.cerevisiae, Baker's) tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 75) AUTHORS Aksel'rod,V.D., Kryukov,V.M., Isaenko,S.N. and Baev,A.A. TITLE Nucleotide sequence in Val-tRNA-2a from Baker's yeast JOURNAL FEBS Lett. 45, 333-336 (1974) STANDARD full staff_review REFERENCE 2 (bases 1 to 75) AUTHORS Aksel'rod,V.D., Kryukov,V.M., Isaenko,S.N.. and Baev,A.A. TITLE Primary structure of Val-tRNA-2a from Baker's yeast JOURNAL Mol. Biol. 9, 42-48 (1975) STANDARD simple staff_entry COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their entry 2050 in Nucleic Acids Res. 11, r1-r54 (1983). [1] compared given sequence with that of baker's yeast Val-tRNA-1. FEATURES from to/span description tRNA 1 75 Val-tRNA-2a (NAR: 2050) anticdn 35 37 Val-tRNA-2a anticodon tac modified 10 10 m2g modified 16 16 d modified 19 19 d modified 20 20 d modified 27 27 m22g modified 28 28 p modified 33 33 p modified 35 35 unidentified uridine derivative modified 46 46 d modified 48 48 m5c modified 53 53 t modified 54 54 p modified 57 57 m1a BASE COUNT 16 a 22 c 20 g 17 t ORIGIN 5' end of mature tRNA. 1 ggtccaatgg tccagtggtt caagacgtcg cctttacacg gcgaatcccg agttcgaacc 61 tcggttggat cacca // LOCUS YSCTRW 75 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae) Trp-tRNA-cca. ACCESSION M35060 X02698 KEYWORDS transfer RNA-Trp. SOURCE Yeast tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 75) AUTHORS Keith,G., Roy,A., Ebel,J.P. and Dirheimer,G. TITLE The nucleotide sequences of two tryptophane-tRNAs from Brewer's yeast JOURNAL FEBS Lett. 17, 306-308 (1971) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 75) AUTHORS Keith,G., Roy,A., Ebel,J.-P. and Dirheimer,G. TITLE The primary structure of tryptophan transfer ribonucleic acid from Brewer's yeast: II. Partial digestion with pancreatic ribonuclease and derivation of complete sequence JOURNAL Biochimie 54, 1417-1426 (1972) STANDARD full staff_review FEATURES from to/span description tRNA 1 75 transfer RNA-Trp anticdn 33 35 Trp-tRNA anticodon cca modified 9 9 1-methylguanosine modified 10 10 2-methylguanosine modified 16 16 dihydrouridine modified 17 17 2'-O-methylguanosine modified 19 19 dihydrouridine modified 25 25 pseudouridine modified 26 26 pseudouridine modified 27 27 pseudouridine modified 31 31 2'O-methylcytidine modified 33 33 2'O-methylcytidine modified 38 38 pseudouridine modified 45 45 7-methylguanosine modified 47 47 dihydrouridine modified 53 53 5-methyluridine (ribosylthymine) modified 54 54 pseudouridine modified 57 57 1-methyladenosine modified 64 64 pot. pseudouridine BASE COUNT 17 a 18 c 20 g 20 t ORIGIN 1 gaagcggtgg ctcaatggta gagctttcga ctccaaatcg aagggttgca ggttcaattc 61 ctgtccgttt cacca // LOCUS YSUTRAI 76 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (T.utilis) Ala-tRNA-I. ACCESSION M35061 K00143 KEYWORDS transfer RNA-Ala. SOURCE Yeast (T.utilis) tRNA. ORGANISM Candida utilis Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 76) AUTHORS Takemura,S., Ogawa,K. and Nakazawa,K. TITLE Nucleotide sequence of alanine tRNA I from Torulopsis utilis JOURNAL FEBS Lett. 25, 29-32 (1972) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 76) AUTHORS Takemura,S. and Ogawa,K. TITLE The primary structure of alanine transfer ribonucleic acid 1 from Torulopsis utilis: II. Partial digestion with ribonuclease T-1 and derivation of the complete sequence JOURNAL J. Biochem. 74, 323-333 (1973) STANDARD full staff_review COMMENT Contributed on tape April 1983 by M.Sprinzl and D.H.Gauss; from their entry 0020 in Nucleic Acids Res. 11, r1-r54 (1983). [1]: The cloverleaf model for the secondary structure was compared with that of Saccharomyces Ala-tRNA, especially with respect to the aminoacyl-tRNA synthetase recognition sites. FEATURES from to/span description tRNA 1 76 Ala-tRNA-I (NAR: 0020) anticdn 34 36 Ala-tRNA-I anticodon ggc modified 9 9 m1g = 1-methylguanosine modified 16 16 d = dihydrouridine modified 17 17 d = dihydrouridine modified 20 20 d = dihydrouridine modified 26 26 m22g = 2,2-dimethylguanosine modified 27 27 f = pseudouridine modified 34 34 i = inosine modified 37 37 m1i = 1-methylinosine modified 38 38 f = pseudouridine modified 47 47 d = dihydrouridine modified 54 54 t = 5-methyluridine modified 55 55 f = pseudouridine modified 58 58 m1a = 1-methyladenosine BASE COUNT 9 a 21 c 28 g 18 t ORIGIN 5' end of mature tRNA 1 gggcgtgtgg cgtagttggt agcgcgttcg cttggcgtgc gaaaggtctc cggttcgact 61 ccggactcgt ccacca // LOCUS MUSPTKA 211 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD15. ACCESSION M33421 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 211) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 211 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 47 a 55 c 64 g 45 t ORIGIN 1 ggatccacag ggacctggct gctcggaact gcctggtgac agagaagaat gtcctgaaga 61 tcagcgactt tgggatgtcc cgcgaagaag ctgatgggat ctatgccgcc tgcagcggcc 121 tcagacaagt ccctgttaag tggactgccc ctgaggccct taactacgga cgctactcct 181 cagagagtga tgtgtggagc tttggaattc c // LOCUS MUSPTKB 211 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD16. ACCESSION M33422 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD16. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 211) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 211 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 50 a 50 c 59 g 52 t ORIGIN 1 ggatccacag agaccttgct gctaggaact gcatggatgc cgaagatttc acagtaaaaa 61 ttggagattt cggtatgaca cgagacatct acgagacgga ctactaccgg aaaggcggga 121 aggggttgct gcctgtgcgc tggatgtctc tcgagtccct caaggatggt gtcttcacta 181 ctcattctga cgtctggtcc ttcggaattc c // LOCUS MUSPTKC 214 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD17. ACCESSION M33423 M22448 J04523 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD17. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 214) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review REFERENCE 2 (sites) AUTHORS Wilks,A.F. TITLE Two putative protein-tyrosine kinases identified by application of the polymerase chain reaction JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1603-1607 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [2] kindly submitted by A.Wilks, 08-FEB-1989, for release after publication. FEATURES from to/span description pept < 1 > 214 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 69 a 42 c 58 g 45 t ORIGIN 1 ggatccacag ggacctggca acaaggaaca tattggtgga aaatgagaac agggttaaaa 61 taggagactt cggattaacc aaagtcttgc cgcaggacaa agaatactac aaagtaaagg 121 agccagggga aagaccgata ttctggtacg cacctgaatc cttgacggag agcaagtttt 181 ctgtggcctc agatgtctgg tcctttggaa ttcc // LOCUS MUSPTKD 217 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD19. ACCESSION M33424 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD19. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 217) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 217 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 64 a 44 c 57 g 52 t ORIGIN 1 ggatccacag agacttagct gcaagaaact gcatgttgga tgaaaaattc actgtcaagg 61 ttgctgattt cggtcttgcc agagacatgt acgataaaga gtactatagt gtccacaaca 121 agacgggtgc caagctacca gtgaagtgga tggctttaga gagtctgcaa aggcagaagt 181 tcaccaccac gtcagatgtg tggtcctttg gaattcc // LOCUS MUSPTKE 214 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD22. ACCESSION M33425 M22447 J04523 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD22. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 214) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review REFERENCE 2 (sites) AUTHORS Wilks,A.F. TITLE Two putative protein-tyrosine kinases identified by application of the polymerase chain reaction JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1603-1607 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [2] kindly submitted by A.Wilks, 08-FEB-1989, for release after publication. FEATURES from to/span description pept < 1 > 214 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 59 a 46 c 55 g 54 t ORIGIN 1 ggatccaccg ggacttagca gcaagaaatg tccttgttga gagtgagcat caagtgaaga 61 tcggagactt tggtttaacc aaagcaattg aaaccgataa ggagtactac acagtcaagg 121 acgaccggga cagcccagtg ttctggtacg ctccggagtg tttaatccag tgtaaatttt 181 atatcgcctc tgacgtctgg tcctttggaa ttcc // LOCUS MUSPTKF 208 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD175. ACCESSION M33426 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD175. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 208) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 208 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 49 a 53 c 55 g 51 t ORIGIN 1 ggatccaccg tgatctgcga gctgctaacg tcctggtctc tgagtcactc atgtgcaaga 61 ttgcagactt tggcctcgcg agagtcatcg aagataacga gtacacagca agggaaggtg 121 cgaagttccc tatcaagtgg acagctccag aggcgttcaa cttcggctgc ttcactatca 181 aatctgacgt gtggtccttt ggaattcc // LOCUS MUSPTKG 208 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone W3.13. ACCESSION M33427 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line WEH1-3B D+, cDNA to mRNA, clone W3.13. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 208) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 208 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 53 a 44 c 56 g 55 t ORIGIN 1 ggatccacag agacctggct gccagaaatt gtctagtgaa tgaagcagga gttgtcaaag 61 tatctgattt tggaatggcc aggtacgttc tggatgatca gtacacaagt tcttctggcg 121 ccaagttccc tgtgaagtgg tgtcccccag aagagtttaa ttacagccgc tttagcagca 181 agtcagacgt gtggtcctat ggaattcc // LOCUS RATCROS1A 7839 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Rat lung-derived c-ros-1 proto-oncogene mRNA, complete cds. ACCESSION M35104 KEYWORDS c-ros-1 proto-oncogene; tyrosine kinase. SOURCE Rat (strain Fischer) lung, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 7839) AUTHORS Matsushime,H. and Shibuya,M. TITLE Tissue-specific expression of rat c-ros-1 gene and partial structural similarity of its predicted products with sev protein of Drosophila melanogaster JOURNAL J. Virol. 64, 2117-2125 (1990) STANDARD simple staff_review FEATURES from to/span description pept 402 7355 c-ros-1 tyrosine kinase (put.) mRNA < 1 7839 c-ros-1 mRNA BASE COUNT 2159 a 1760 c 1887 g 2033 t ORIGIN 1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag 61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg 121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag 181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc 241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc 301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac 361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc 421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg 481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca 541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta 601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg 661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca 721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac 781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac 841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc 901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc 961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc 1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac 1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa 1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca 1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta 1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa 1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt 1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt 1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg 1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt 1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct 1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt 1681 cctacaatgg ggacaccaaa gctgtgcgta ttgtggagag tggcacatta aaggacttcg 1741 cagtaaagcc gcagtccaag cgaatcattt acttcaatgg caccatgcaa gtcttcatgt 1801 cgacatttct ggatggctcg gcattccaca gggttctgcc gtgggtcccc cttgcggatg 1861 tgaagagctt tgcttgtgaa aacaatgact tcctcatcac agatggcaag gccattttcc 1921 aacaggactc tctgtctttc aatgagttca tcgtgggatg tgacctgagt cacatagaag 1981 aatttgggtt tggtaacttg gtcatctttg gctcctccgt ccagtcgtac cctctgccag 2041 gccatccaca ggaggtctcg gtgctgtttg gttctcgaga ggcccttatt cagtggaagc 2101 ctccgattct cgccatagga gccagtcctt ccgcctggca gaactggact tatgaggtca 2161 aagtttcctc ccaggacatt ctggaaacca ctcaagtttt cttgaacata agcaggactg 2221 tgctgaatgt acccaagctg caaagttcta caaagtacat ggtgtctgtg cgagcaagtt 2281 ctcctaaagg cccaggccca tggtcagaac cctcagtggg tactaccttg gtaccagcca 2341 ctgagccacc gttcatcatg gctgtgaaag aagatgggct ttggagcaaa ccactcagta 2401 gttttggccc aggagagttc ctatcctctg acgtaggaaa cgtgtcagat atggattggt 2461 ataacaacag cctctactac agtgacacaa aaggcaatgt gtatgtgcgg cctctgaatg 2521 ggatggatat ctcggagaat taccacatat ccagcattgc aggagcttgt gccttggcct 2581 ttgaatggct gggtcacttt ctctactggg ctgggaagac atatgtgatt caaaggcagt 2641 ctgtgttaac gggacacaca gacattgtga ctcacgtgaa gctgttggtg aatgacatgg 2701 ccgtggatcc agttggtggc tatctgtact ggacgacgct ctactcggtt gaaagcacca 2761 gactcaatgg agaaagttct cttgtactac aggctcagcc ctggctctct ggaaaaaagg 2821 ttattgctct aacattagac ctcagcgatg ggctcctgta ctggctggtg caggacaatc 2881 agtgtattca cctgtacacg gctgttctcc ggggatggag tggtgcggat gctaccatca 2941 ccgagtttgc agcctggagt acttctgaaa tttcccagaa tgcactgatg tactacagcg 3001 gtagactctt ctggatcaat ggctttagga tcatcacagc acaggaaata ggtcagagaa 3061 ccagcgtgtc tgtttctgag ccagggaaat tcaatcagtt tacgatcata cagacatccc 3121 tcaagcctct gccagggaac ttttcctcta ctcccacggt tatcccagat tctgttcagg 3181 agtcctcatt tcgaattgaa ggacacactt caagtttccg aatcctgtgg aatgagcccc 3241 ctgcggtgga ctggggcata gttttctaca gtgtggaatt tagtgctcat tctaagttcc 3301 tggctattga acaacagtct ttacctgttt ttactgtgga aggactggag ccctatgcct 3361 tatttaatct ttctgtcact ccttatacct attggggaaa aggtcaaaaa acatctctat 3421 catttcgagc gcctgaatca gttccgtcag caccagagaa ccccagaata tttatattgt 3481 cacttggaag atacaccagg aagaatgaag tcgtggtaga gtttaggtgg aataaaccta 3541 agcatgaaaa tggagtgcta accaaatctg aaatcttcta ccacatatct aaacaaagtg 3601 gcacaaataa atcaacggaa gactgggtat ctgtcagcgt tacaccgccg gtgatgtctt 3661 ttcaacttga agccatgagt cctgggtata ttgtttcctt ccaggttcga gtcttcacct 3721 ccaaagggcc aggaccattt tctgatatag tgatgtctaa aacatcagaa atcaagccat 3781 gtccatatct catatctctt cttggcaata agattgagtt cttagacatg gaccaaaatc 3841 aagttgtgtg gacattttcc ctggagggag ccgtcagcac agtggggtac acagcggatg 3901 atgaaatggg gtatttcgct caaggagatg cactcttcct tctgaatttg cacaatcatt 3961 ccagctccaa gcttttccag gacgtgctgg cttctgacat tgcggttatt gctgttgact 4021 ggatcgcaag gcacctctac tttgctctga aagcatcgca agatggaaca cagatattcg 4081 atgttgacct tgaacacaag gtgaaatccc ccagggaggt gaagatttgc aaaagccata 4141 cagcaataat ttctttctct atgtatcccc tcttaagtcg cctgtattgg acagaagttt 4201 cagatctggg ctaccagatg ttctactgca atattagcag tcacaccttg catcacgttc 4261 tacaacccaa ggcctcaaac cagcatggaa ggagacagtg ttcttgtaat gtgacagaat 4321 ccgagttaag tggggcaatg actgtggaca cgtctgatcc agacagacct tggatatact 4381 ttaccaaaca gcaagagatc tgggccatgg atctggaagg atgtcagtgt tggaaagtca 4441 tcatggtacc tgctacccct ggaaaaagaa tcattagttt aacagtggat ggggagttta 4501 tatattggat cacaacaatg aaggacgaca cagaaattta tcaagcaaag aagggaagtg 4561 gggccatcct ctcccaggtg aaggccccca ggagtaagca tatcttggct tacagttcag 4621 ctctgcaacc ttttccagat aaagcatatc tgtctgtagc ttccaatatg gtagaagcaa 4681 gtatattgaa tgccaccaac accagcctca ttctcaagtt acctccagtc aagacaaacc 4741 tcacgtggca tggaattacc actcccacgt caacatacct ggtttactat atggaggcta 4801 atagggcaaa cagctctgac aggaaacaca acatgttgga atcacaggag aatgtagccc 4861 ggattgaagg tctgcagcca ttttcaacat acgtgattca gatagctgtg aagaactatt 4921 attctgatcc tttagaacat ctctctctgg gaaaagagat tcaaggaaaa actaaaagtg 4981 gagtgcccgg ggcagtttgt catatcaatg caactgtgct gtcggacacc agtcttcttg 5041 tattctggac agaatcgcat aaaccaaacg gacccaaaga gttagtccgc tatcagttgg 5101 ttatgtcata cctggctccg attcctgaga ctcctctaag acaggacgaa tttccaagcg 5161 ccaggctttc tctacttgtc actaaactct ctggtggaca acaatatgtg ctgaagatcc 5221 ttgcctgcca ctcagaggaa atgtggtgta ctgagagtca tcctgtcagt gtcaacatgt 5281 ttgacacacc ggagaaacct tctgccttgg ttccagagaa cactagtctg ctgttggatt 5341 ggaaggctcc gtctaacgct aacctcacca gattttggtt tgaactccag aagtggaagt 5401 atagtgagtt ttaccatgtc aaggcttcat gcagccaagg tccagtttat gtctgtaaca 5461 tcgcaaatct gcagccttac actccttata acatccgagt ggtggtggtc tatacgacag 5521 gagaaaatag ctcctcgatt cccgagagct tcaagacaaa agctggagtc ccaagcaaac 5581 cagggattcc taagttacta gaagggagta aaaattcaat ccagtgggaa aaagccgaag 5641 ataacgggaa cagattgatg tactacaccc tggaggtcag aaaaagcatt tcaaatgact 5701 cacgggacca gagtttaagg tggacggcgg tgtttaatgg gtcctgcagt agcatttgca 5761 catggaggtc aaaaaaccta aaaggaactt tccagttcag agcagtagcg tcaaatgcta 5821 ttggatttgg agaatacagt gaaatcagtg aagatattac attagtggaa gatggttttt 5881 ggataacaga aacaagtttt atacttacta tcatagttgg gatatttctg gttgccacag 5941 tcccactgac ctttgtctgg catagaagct tgaaaaacca caaagctacc aaggaaggcc 6001 tctcagttct caacgacaat gaccaagagt tggctgagct tcgaggactg gcggctggag 6061 tgggcctggc caatgcctgc tatgcagtac atactcttcc aacccaagag gagattgaaa 6121 gtcttcccgc cttccctcgg gagaagctga gcctgcgcct tctgttggga agtggagctt 6181 ttggagaagt gtacgagggc acagctgtag acatcctagg acggggaagt ggagaaatca 6241 aggtggccgt gaagaccctg aagaaaggtt cgacagacca ggagaagatc gagttcctga 6301 aggaggcaca cctgatgagc aagtttaatc accccaacat tctgaagcag ctgggagtct 6361 gtctgctgag tgaaccccag tacattatcc tggaactgat ggaaggggga gaccttctaa 6421 gctatctgcg caaagcccga gggacaacgt tgtctggccc tttactcaca ttggctgacc 6481 tggtagagct gtgtgtagat atttcaaaag gctgcgtcta cttggagcag atgcacttca 6541 ttcacaggga tctggcagct cggaattgcc ttgtgtctgt gaaagactat accagtcctc 6601 gggtagtcaa gatcggtgac tttggtttgg caagggaaat ctataagcat gattattata 6661 gaaagagagg ggaaggcctg cttcctgtcc ggtggatggc tcctgaaaac ttgatggatg 6721 gaatcttcac ttcccagtct gatgtatggt cttttggaat tttggtttgg gagattttaa 6781 ctcttggtca tcaaccttat ccagcgcatt ccaaccttga tgttttaaac tatgtgcaag 6841 caggagggag actggagcca ccgagaaact gtcctgatga tctgtggaat ttaatgttcc 6901 gatgttgggc ccaagaacct gaccaaagac ccactttcta taacattcaa gaccagcttc 6961 agttattcag aaatgtttcc ttaaacaatg tttctcactg tggacaagca gctcctgctg 7021 gtggagtcat caacaaaggc tttgaaggtg aagacaatga aatggccact ttgaattcag 7081 atgacacgat gccagttgcc ttgatggaaa ccaggaacca agaaggatta aattatatgg 7141 tacttgccac aaagtgtagc caaagtgagg atcgttatga gggtcctcta ggctctaagg 7201 aatctgggtt gcatgatctg aagaaagacg agaggcaacc agcagacaaa gatttctgcc 7261 agcaaccaca ggtggcttat ggctctcctg gccactctga aggcctgaac tatgcctgtc 7321 ttgctcacag tggacatgga gatgtgtctg aataatagta tctcatagga aacatagcac 7381 tgagatgaac actgtattaa gttaaaaaga agaaaggtgg ggtggcagtc tagactctga 7441 actgacacag ccaagttcca aagttctgat cttggttcca agagccatta tgtttcattc 7501 agcattctct ttaccagtga cgtaaccttc agtggattat cagaggaacc tgtgtgtgtg 7561 cggaaatccc aggacaaatc ctaagtctgg gaagaaaaca tcactgtctc tctcctctga 7621 agccctttac ttcagagcat tgcctgccct ggcaatctta ctaggttcat gcaaggatgt 7681 gagtggggga ggggccggag tctgctgagg accacctgaa ctacagatta ccttaagagg 7741 atgcaggaaa caattactca cacaggagga agcagcctgt ggaccatgag gaatcatctg 7801 gcacgctatt attccaataa aatattccct ttaatcatc // LOCUS RATCROS1B 8010 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Rat lung-derived L01 c-ros-1 proto-oncogene mRNA, complete cds. ACCESSION M35105 KEYWORDS c-ros-1 proto-oncogene; tyrosine kinase. SOURCE Rat (strain Fischer) lung, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 8010) AUTHORS Matsushime,H. and Shibuya,M. TITLE Tissue-specific expression of rat c-ros-1 gene and partial structural similarity of its predicted products with sev protein of Drosophila melanogaster JOURNAL J. Virol. 64, 2117-2125 (1990) STANDARD simple staff_review FEATURES from to/span description pept 402 5966 c-ros-1 unknown protein mRNA < 1 8010 c-ros-1 mRNA BASE COUNT 2197 a 1812 c 1930 g 2071 t ORIGIN 1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag 61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg 121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag 181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc 241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc 301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac 361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc 421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg 481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca 541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta 601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg 661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca 721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac 781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac 841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc 901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc 961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc 1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac 1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa 1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca 1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta 1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa 1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt 1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt 1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg 1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt 1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct 1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt 1681 cctacaatgg ggacaccaaa gctgtgcgta ttgtggagag tggcacatta aaggacttcg 1741 cagtaaagcc gcagtccaag cgaatcattt acttcaatgg caccatgcaa gtcttcatgt 1801 cgacatttct ggatggctcg gcattccaca gggttctgcc gtgggtcccc cttgcggatg 1861 tgaagagctt tgcttgtgaa aacaatgact tcctcatcac agatggcaag gccattttcc 1921 aacaggactc tctgtctttc aatgagttca tcgtgggatg tgacctgagt cacatagaag 1981 aatttgggtt tggtaacttg gtcatctttg gctcctccgt ccagtcgtac cctctgccag 2041 gccatccaca ggaggtctcg gtgctgtttg gttctcgaga ggcccttatt cagtggaagc 2101 ctccgattct cgccatagga gccagtcctt ccgcctggca gaactggact tatgaggtca 2161 aagtttcctc ccaggacatt ctggaaacca ctcaagtttt cttgaacata agcaggactg 2221 tgctgaatgt acccaagctg caaagttcta caaagtacat ggtgtctgtg cgagcaagtt 2281 ctcctaaagg cccaggccca tggtcagaac cctcagtggg tactaccttg gtaccagcca 2341 ctgagccacc gttcatcatg gctgtgaaag aagatgggct ttggagcaaa ccactcagta 2401 gttttggccc aggagagttc ctatcctctg acgtaggaaa cgtgtcagat atggattggt 2461 ataacaacag cctctactac agtgacacaa aaggcaatgt gtatgtgcgg cctctgaatg 2521 ggatggatat ctcggagaat taccacatat ccagcattgc aggagcttgt gccttggcct 2581 ttgaatggct gggtcacttt ctctactggg ctgggaagac atatgtgatt caaaggcagt 2641 ctgtgttaac gggacacaca gacattgtga ctcacgtgaa gctgttggtg aatgacatgg 2701 ccgtggatcc agttggtggc tatctgtact ggacgacgct ctactcggtt gaaagcacca 2761 gactcaatgg agaaagttct cttgtactac aggctcagcc ctggctctct ggaaaaaagg 2821 ttattgctct aacattagac ctcagcgatg ggctcctgta ctggctggtg caggacaatc 2881 agtgtattca cctgtacacg gctgttctcc ggggatggag tggtgcggat gctaccatca 2941 ccgagtttgc agcctggagt acttctgaaa tttcccagaa tgcactgatg tactacagcg 3001 gtagactctt ctggatcaat ggctttagga tcatcacagc acaggaaata ggtcagagaa 3061 ccagcgtgtc tgtttctgag ccagggaaat tcaatcagtt tacgatcata cagacatccc 3121 tcaagcctct gccagggaac ttttcctcta ctcccacggt tatcccagat tctgttcagg 3181 agtcctcatt tcgaattgaa ggacacactt caagtttccg aatcctgtgg aatgagcccc 3241 ctgcggtgga ctggggcata gttttctaca gtgtggaatt tagtgctcat tctaagttcc 3301 tggctattga acaacagtct ttacctgttt ttactgtgga aggactggag ccctatgcct 3361 tatttaatct ttctgtcact ccttatacct attggggaaa aggtcaaaaa acatctctat 3421 catttcgagc gcctgaatca gttccgtcag caccagagaa ccccagaata tttatattgt 3481 cacttggaag atacaccagg aagaatgaag tcgtggtaga gtttaggtgg aataaaccta 3541 agcatgaaaa tggagtgcta accaaatctg aaatcttcta ccacatatct aaacaaagtg 3601 gcacaaataa atcaacggaa gactgggtat ctgtcagcgt tacaccgccg gtgatgtctt 3661 ttcaacttga agccatgagt cctgggtata ttgtttcctt ccaggttcga gtcttcacct 3721 ccaaagggcc aggaccattt tctgatatag tgatgtctaa aacatcagaa atcaagccat 3781 gtccatatct catatctctt cttggcaata agattgagtt cttagacatg gaccaaaatc 3841 aagttgtgtg gacattttcc ctggagggag ccgtcagcac agtggggtac acagcggatg 3901 atgaaatggg gtatttcgct caaggagatg cactcttcct tctgaatttg cacaatcatt 3961 ccagctccaa gcttttccag gacgtgctgg cttctgacat tgcggttatt gctgttgact 4021 ggatcgcaag gcacctctac tttgctctga aagcatcgca agatggaaca cagatattcg 4081 atgttgacct tgaacacaag gtgaaatccc ccagggaggt gaagatttgc aaaagccata 4141 cagcaataat ttctttctct atgtatcccc tcttaagtcg cctgtattgg acagaagttt 4201 cagatctggg ctaccagatg ttctactgca atattagcag tcacaccttg catcacgttc 4261 tacaacccaa ggcctcaaac cagcatggaa ggagacagtg ttcttgtaat gtgacagaat 4321 ccgagttaag tggggcaatg actgtggaca cgtctgatcc agacagacct tggatatact 4381 ttaccaaaca gcaagagatc tgggccatgg atctggaagg atgtcagtgt tggaaagtca 4441 tcatggtacc tgctacccct ggaaaaagaa tcattagttt aacagtggat ggggagttta 4501 tatattggat cacaacaatg aaggacgaca cagaaattta tcaagcaaag aagggaagtg 4561 gggccatcct ctcccaggtg aaggccccca ggagtaagca tatcttggct tacagttcag 4621 ctctgcaacc ttttccagat aaagcatatc tgtctgtagc ttccaatatg gtagaagcaa 4681 gtatattgaa tgccaccaac accagcctca ttctcaagtt acctccagtc aagacaaacc 4741 tcacgtggca tggaattacc actcccacgt caacatacct ggtttactat atggaggcta 4801 atagggcaaa cagctctgac aggaaacaca acatgttgga atcacaggag aatgtagccc 4861 ggattgaagg tctgcagcca ttttcaacat acgtgattca gatagctgtg aagaactatt 4921 attctgatcc tttagaacat ctctctctgg gaaaagagat tcaaggaaaa actaaaagtg 4981 gagtgcccgg ggcagtttgt catatcaatg caactgtgct gtcggacacc agtcttcttg 5041 tattctggac agaatcgcat aaaccaaacg gacccaaaga gttagtccgc tatcagttgg 5101 ttatgtcata cctggctccg attcctgaga ctcctctaag acaggacgaa tttccaagcg 5161 ccaggctttc tctacttgtc actaaactct ctggtggaca acaatatgtg ctgaagatcc 5221 ttgcctgcca ctcagaggaa atgtggtgta ctgagagtca tcctgtcagt gtcaacatgt 5281 ttgacacacc ggagaaacct tctgccttgg ttccagagaa cactagtctg ctgttggatt 5341 ggaaggctcc gtctaacgct aacctcacca gattttggtt tgaactccag aagtggaagt 5401 atagtgagtt ttaccatgtc aaggcttcat gcagccaagg tccagtttat gtctgtaaca 5461 tcgcaaatct gcagccttac actccttata acatccgagt ggtggtggtc tatacgacag 5521 gagaaaatag ctcctcgatt cccgagagct tcaagacaaa agctggagtc ccaagcaaac 5581 cagggattcc taagttacta gaagggagta aaaattcaat ccagtgggaa aaagccgaag 5641 ataacgggaa cagattgatg tactacaccc tggaggtcag aaaaagcatt tcaaatgact 5701 cacgggacca gagtttaagg tggacggcgg tgtttaatgg gtcctgcagt agcatttgca 5761 catggaggtc aaaaaaccta aaaggaactt tccagttcag agcagtagcg tcaaatgcta 5821 ttggatttgg agaatacagt gaaatcagtg aagatattac attagtggaa gatggttttt 5881 ggataacaga aacaagtttt atacttacta tcatagttgg gatatttctg gttgccacag 5941 tcccactgac ctttgcctgt cactgaagct ggggctcaca gatcagctag gccggctggc 6001 caacagatcc ccgagatctg cctgcctctg acctctacct ccaacactgg ggctacagat 6061 gtgtgctaca ttctcagtat ttaactgggt gctgaggaac caagcacagg tcctcatgct 6121 cgtaagtctg gcatagaagc ttgaaaaacc acaaagctac caaggaaggc ctctcagttc 6181 tcaacgacaa tgaccaagag ttggctgagc ttcgaggact ggcggctgga gtgggcctgg 6241 ccaatgcctg ctatgcagta catactcttc caacccaaga ggagattgaa agtcttcccg 6301 ccttccctcg ggagaagctg agcctgcgcc ttctgttggg aagtggagct tttggagaag 6361 tgtacgaggg cacagctgta gacatcctag gacggggaag tggagaaatc aaggtggccg 6421 tgaagaccct gaagaaaggt tcgacagacc aggagaagat cgagttcctg aaggaggcac 6481 acctgatgag caagtttaat caccccaaca ttctgaagca gctgggagtc tgtctgctga 6541 gtgaacccca gtacattatc ctggaactga tggaaggggg agaccttcta agctatctgc 6601 gcaaagcccg agggacaacg ttgtctggcc ctttactcac attggctgac ctggtagagc 6661 tgtgtgtaga tatttcaaaa ggctgcgtct acttggagca gatgcacttc attcacaggg 6721 atctggcagc tcggaattgc cttgtgtctg tgaaagacta taccagtcct cgggtagtca 6781 agatcggtga ctttggtttg gcaagggaaa tctataagca tgattattat agaaagagag 6841 gggaaggcct gcttcctgtc cggtggatgg ctcctgaaaa cttgatggat ggaatcttca 6901 cttcccagtc tgatgtatgg tcttttggaa ttttggtttg ggagatttta actcttggtc 6961 atcaacctta tccagcgcat tccaaccttg atgttttaaa ctatgtgcaa gcaggaggga 7021 gactggagcc accgagaaac tgtcctgatg atctgtggaa tttaatgttc cgatgttggg 7081 cccaagaacc tgaccaaaga cccactttct ataacattca agaccagctt cagttattca 7141 gaaatgtttc cttaaacaat gtttctcact gtggacaagc agctcctgct ggtggagtca 7201 tcaacaaagg ctttgaaggt gaagacaatg aaatggccac tttgaattca gatgacacga 7261 tgccagttgc cttgatggaa accaggaacc aagaaggatt aaattatatg gtacttgcca 7321 caaagtgtag ccaaagtgag gatcgttatg agggtcctct aggctctaag gaatctgggt 7381 tgcatgatct gaagaaagac gagaggcaac cagcagacaa agatttctgc cagcaaccac 7441 aggtggctta tggctctcct ggccactctg aaggcctgaa ctatgcctgt cttgctcaca 7501 gtggacatgg agatgtgtct gaataatagt atctcatagg aaacatagca ctgagatgaa 7561 cactgtatta agttaaaaag aagaaaggtg gggtggcagt ctagactctg aactgacaca 7621 gccaagttcc aaagttctga tcttggttcc aagagccatt atgtttcatt cagcattctc 7681 tttaccagtg acgtaacctt cagtggatta tcagaggaac ctgtgtgtgt gcggaaatcc 7741 caggacaaat cctaagtctg ggaagaaaac atcactgtct ctctcctctg aagcccttta 7801 cttcagagca ttgcctgccc tggcaatctt actaggttca tgcaaggatg tgagtggggg 7861 aggggccgga gtctgctgag gaccacctga actacagatt accttaagag gatgcaggaa 7921 acaattactc acacaggagg aagcagcctg tggaccatga ggaatcatct ggcacgctat 7981 tattccaata aaatattccc tttaatcatc // LOCUS RATCROS1C 7902 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Rat heart-derived c-ros-1 proto-oncogene mRNA, complete cds. ACCESSION M35106 KEYWORDS c-ros-1 proto-oncogene; tyrosine kinase. SOURCE Rat (strain Fischer) heart, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 7902) AUTHORS Matsushime,H. and Shibuya,M. TITLE Tissue-specific expression of rat c-ros-1 gene and partial structural similarity of its predicted products with sev protein of Drosophila melanogaster JOURNAL J. Virol. 64, 2117-2125 (1990) STANDARD simple staff_review FEATURES from to/span description pept 402 7418 c-ros-1 tyrosine kinase (put.) mRNA < 1 7902 c-ros-1 mRNA BASE COUNT 2171 a 1775 c 1899 g 2057 t ORIGIN 1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag 61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg 121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag 181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc 241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc 301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac 361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc 421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg 481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca 541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta 601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg 661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca 721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac 781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac 841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc 901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc 961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc 1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac 1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa 1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca 1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta 1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa 1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt 1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt 1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg 1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt 1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct 1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt 1681 cctacaatgg gtatgtcttt tatctcctaa gagacggcat ttatagagtc catcttcctt 1741 tgccgtctgt cagggacacc aaagctgtgc gtattgtgga gagtggcaca ttaaaggact 1801 tcgcagtaaa gccgcagtcc aagcgaatca tttacttcaa tggcaccatg caagtcttca 1861 tgtcgacatt tctggatggc tcggcattcc acagggttct gccgtgggtc ccccttgcgg 1921 atgtgaagag ctttgcttgt gaaaacaatg acttcctcat cacagatggc aaggccattt 1981 tccaacagga ctctctgtct ttcaatgagt tcatcgtggg atgtgacctg agtcacatag 2041 aagaatttgg gtttggtaac ttggtcatct ttggctcctc cgtccagtcg taccctctgc 2101 caggccatcc acaggaggtc tcggtgctgt ttggttctcg agaggccctt attcagtgga 2161 agcctccgat tctcgccata ggagccagtc cttccgcctg gcagaactgg acttatgagg 2221 tcaaagtttc ctcccaggac attctggaaa ccactcaagt tttcttgaac ataagcagga 2281 ctgtgctgaa tgtacccaag ctgcaaagtt ctacaaagta catggtgtct gtgcgagcaa 2341 gttctcctaa aggcccaggc ccatggtcag aaccctcagt gggtactacc ttggtaccag 2401 ccactgagcc accgttcatc atggctgtga aagaagatgg gctttggagc aaaccactca 2461 gtagttttgg cccaggagag ttcctatcct ctgacgtagg aaacgtgtca gatatggatt 2521 ggtataacaa cagcctctac tacagtgaca caaaaggcaa tgtgtatgtg cggcctctga 2581 atgggatgga tatctcggag aattaccaca tatccagcat tgcaggagct tgtgccttgg 2641 cctttgaatg gctgggtcac tttctctact gggctgggaa gacatatgtg attcaaaggc 2701 agtctgtgtt aacgggacac acagacattg tgactcacgt gaagctgttg gtgaatgaca 2761 tggccgtgga tccagttggt ggctatctgt actggacgac gctctactcg gttgaaagca 2821 ccagactcaa tggagaaagt tctcttgtac tacaggctca gccctggctc tctggaaaaa 2881 aggttattgc tctaacatta gacctcagcg atgggctcct gtactggctg gtgcaggaca 2941 atcagtgtat tcacctgtac acggctgttc tccggggatg gagtggtgcg gatgctacca 3001 tcaccgagtt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactaca 3061 gcggtagact cttctggatc aatggcttta ggatcatcac agcacaggaa ataggtcaga 3121 gaaccagcgt gtctgtttct gagccaggga aattcaatca gtttacgatc atacagacat 3181 ccctcaagcc tctgccaggg aacttttcct ctactcccac ggttatccca gattctgttc 3241 aggagtcctc atttcgaatt gaaggacaca cttcaagttt ccgaatcctg tggaatgagc 3301 cccctgcggt ggactggggc atagttttct acagtgtgga atttagtgct cattctaagt 3361 tcctggctat tgaacaacag tctttacctg tttttactgt ggaaggactg gagccctatg 3421 ccttatttaa tctttctgtc actccttata cctattgggg aaaaggtcaa aaaacatctc 3481 tatcatttcg agcgcctgaa tcagttccgt cagcaccaga gaaccccaga atatttatat 3541 tgtcacttgg aagatacacc aggaagaatg aagtcgtggt agagtttagg tggaataaac 3601 ctaagcatga aaatggagtg ctaaccaaat ctgaaatctt ctaccacata tctaaacaaa 3661 gtggcacaaa taaatcaacg gaagactggg tatctgtcag cgttacaccg ccggtgatgt 3721 cttttcaact tgaagccatg agtcctgggt atattgtttc cttccaggtt cgagtcttca 3781 cctccaaagg gccaggacca ttttctgata tagtgatgtc taaaacatca gaaatcaagc 3841 catgtccata tctcatatct cttcttggca ataagattga gttcttagac atggaccaaa 3901 atcaagttgt gtggacattt tccctggagg gagccgtcag cacagtgggg tacacagcgg 3961 atgatgaaat ggggtatttc gctcaaggag atgcactctt ccttctgaat ttgcacaatc 4021 attccagctc caagcttttc caggacgtgc tggcttctga cattgcggtt attgctgttg 4081 actggatcgc aaggcacctc tactttgctc tgaaagcatc gcaagatgga acacagatat 4141 tcgatgttga ccttgaacac aaggtgaaat cccccaggga ggtgaagatt tgcaaaagcc 4201 atacagcaat aatttctttc tctatgtatc ccctcttaag tcgcctgtat tggacagaag 4261 tttcagatct gggctaccag atgttctact gcaatattag cagtcacacc ttgcatcacg 4321 ttctacaacc caaggcctca aaccagcatg gaaggagaca gtgttcttgt aatgtgacag 4381 aatccgagtt aagtggggca atgactgtgg acacgtctga tccagacaga ccttggatat 4441 actttaccaa acagcaagag atctgggcca tggatctgga aggatgtcag tgttggaaag 4501 tcatcatggt acctgctacc cctggaaaaa gaatcattag tttaacagtg gatggggagt 4561 ttatatattg gatcacaaca atgaaggacg acacagaaat ttatcaagca aagaagggaa 4621 gtggggccat cctctcccag gtgaaggccc ccaggagtaa gcatatcttg gcttacagtt 4681 cagctctgca accttttcca gataaagcat atctgtctgt agcttccaat atggtagaag 4741 caagtatatt gaatgccacc aacaccagcc tcattctcaa gttacctcca gtcaagacaa 4801 acctcacgtg gcatggaatt accactccca cgtcaacata cctggtttac tatatggagg 4861 ctaatagggc aaacagctct gacaggaaac acaacatgtt ggaatcacag gagaatgtag 4921 cccggattga aggtctgcag ccattttcaa catacgtgat tcagatagct gtgaagaact 4981 attattctga tcctttagaa catctctctc tgggaaaaga gattcaagga aaaactaaaa 5041 gtggagtgcc cggggcagtt tgtcatatca atgcaactgt gctgtcggac accagtcttc 5101 ttgtattctg gacagaatcg cataaaccaa acggacccaa agagttagtc cgctatcagt 5161 tggttatgtc atacctggct ccgattcctg agactcctct aagacaggac gaatttccaa 5221 gcgccaggct ttctctactt gtcactaaac tctctggtgg acaacaatat gtgctgaaga 5281 tccttgcctg ccactcagag gaaatgtggt gtactgagag tcatcctgtc agtgtcaaca 5341 tgtttgacac accggagaaa ccttctgcct tggttccaga gaacactagt ctgctgttgg 5401 attggaaggc tccgtctaac gctaacctca ccagattttg gtttgaactc cagaagtgga 5461 agtatagtga gttttaccat gtcaaggctt catgcagcca aggtccagtt tatgtctgta 5521 acatcgcaaa tctgcagcct tacactcctt ataacatccg agtggtggtg gtctatacga 5581 caggagaaaa tagctcctcg attcccgaga gcttcaagac aaaagctgga gtcccaagca 5641 aaccagggat tcctaagtta ctagaaggga gtaaaaattc aatccagtgg gaaaaagccg 5701 aagataacgg gaacagattg atgtactaca ccctggaggt cagaaaaagc atttcaaatg 5761 actcacggga ccagagttta aggtggacgg cggtgtttaa tgggtcctgc agtagcattt 5821 gcacatggag gtcaaaaaac ctaaaaggaa ctttccagtt cagagcagta gcgtcaaatg 5881 ctattggatt tggagaatac agtgaaatca gtgaagatat tacattagtg gaagatggtt 5941 tttggataac agaaacaagt tttatactta ctatcatagt tgggatattt ctggttgcca 6001 cagtcccact gacctttgtc tggcatagaa gcttgaaaaa ccacaaagct accaaggaag 6061 gcctctcagt tctcaacgac aatgaccaag agttggctga gcttcgagga ctggcggctg 6121 gagtgggcct ggccaatgcc tgctatgcag tacatactct tccaacccaa gaggagattg 6181 aaagtcttcc cgccttccct cgggagaagc tgagcctgcg ccttctgttg ggaagtggag 6241 cttttggaga agtgtacgag ggcacagctg tagacatcct aggacgggga agtggagaaa 6301 tcaaggtggc cgtgaagacc ctgaagaaag gttcgacaga ccaggagaag atcgagttcc 6361 tgaaggaggc acacctgatg agcaagttta atcaccccaa cattctgaag cagctgggag 6421 tctgtctgct gagtgaaccc cagtacatta tcctggaact gatggaaggg ggagaccttc 6481 taagctatct gcgcaaagcc cgagggacaa cgttgtctgg ccctttactc acattggctg 6541 acctggtaga gctgtgtgta gatatttcaa aaggctgcgt ctacttggag cagatgcact 6601 tcattcacag ggatctggca gctcggaatt gccttgtgtc tgtgaaagac tataccagtc 6661 ctcgggtagt caagatcggt gactttggtt tggcaaggga aatctataag catgattatt 6721 atagaaagag aggggaaggc ctgcttcctg tccggtggat ggctcctgaa aacttgatgg 6781 atggaatctt cacttcccag tctgatgtat ggtcttttgg aattttggtt tgggagattt 6841 taactcttgg tcatcaacct tatccagcgc attccaacct tgatgtttta aactatgtgc 6901 aagcaggagg gagactggag ccaccgagaa actgtcctga tgatctgtgg aatttaatgt 6961 tccgatgttg ggcccaagaa cctgaccaaa gacccacttt ctataacatt caagaccagc 7021 ttcagttatt cagaaatgtt tccttaaaca atgtttctca ctgtggacaa gcagctcctg 7081 ctggtggagt catcaacaaa ggctttgaag gtgaagacaa tgaaatggcc actttgaatt 7141 cagatgacac gatgccagtt gccttgatgg aaaccaggaa ccaagaagga ttaaattata 7201 tggtacttgc cacaaagtgt agccaaagtg aggatcgtta tgagggtcct ctaggctcta 7261 aggaatctgg gttgcatgat ctgaagaaag acgagaggca accagcagac aaagatttct 7321 gccagcaacc acaggtggct tatggctctc ctggccactc tgaaggcctg aactatgcct 7381 gtcttgctca cagtggacat ggagatgtgt ctgaataata gtatctcata ggaaacatag 7441 cactgagatg aacactgtat taagttaaaa agaagaaagg tggggtggca gtctagactc 7501 tgaactgaca cagccaagtt ccaaagttct gatcttggtt ccaagagcca ttatgtttca 7561 ttcagcattc tctttaccag tgacgtaacc ttcagtggat tatcagagga acctgtgtgt 7621 gtgcggaaat cccaggacaa atcctaagtc tgggaagaaa acatcactgt ctctctcctc 7681 tgaagccctt tacttcagag cattgcctgc cctggcaatc ttactaggtt catgcaagga 7741 tgtgagtggg ggaggggccg gagtctgctg aggaccacct gaactacaga ttaccttaag 7801 aggatgcagg aaacaattac tcacacagga ggaagcagcc tgtggaccat gaggaatcat 7861 ctggcacgct attattccaa taaaatattc cctttaatca tc // LOCUS HUMFVIIIM 65 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human mutant coagulation factor VIII exon 13 duplication region. ACCESSION M34731 KEYWORDS coagulation factor VIII. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 65) AUTHORS Murru,S., Casula,L., Pecorara,M., Mori,P., Cao,A. and Pirastu,M. TITLE Illegitimate recombination produced a duplication within the FVIII gene in a patient with mild hemophilia A JOURNAL Genomics 7, 115-118 (1990) STANDARD simple staff_review COMMENT As a result of illegitimate recombination of two misaligned chromosomes, exon 13 of the factor VIII is duplicated in its entirety. The exon undergoes normal splicing and its incorporation into the mRNA generates an unstable protein. FEATURES from to/span description recomb 25 26 chromosome DNA end; misaligned chromosome DNA start BASE COUNT 26 a 3 c 11 g 25 t ORIGIN 1 aagttttagg ggtacatgtg cacaattagt ttgaaataat ttaattagtt tgaaataatt 61 taaaa // LOCUS EUBBAIA3 2596 bp ds-DNA BCT 25-JUL-1990 DEFINITION Eubacterium sp. baiA3 protein gene, complete cds. ACCESSION M34658 KEYWORDS . SOURCE Eubacterium sp. (strain VPI 12708) DNA. ORGANISM Eubacterium sp. Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 135 to 2242) AUTHORS Gopal-Srivastava,R., Mallonee,D.H., White,W.B. and Hylemon,P.B. TITLE Multiple copies of a bile acid-inducible gene in Eubacterium sp. strain VPI 12708 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 134; 2243 to 2596) AUTHORS Gopal-Srivastava,R., Mallonee,D.H., White,W.B. and Hylemon,P.B. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by D.H.Mallonee, 24-MAY-1990. FEATURES from to/span description pept 1165 1914 baiA3 protein gene BASE COUNT 778 a 521 c 633 g 664 t ORIGIN 1 tccctgtgct ttcttctgca gttcataaaa tccgccgcca caaatccaag aatccacaat 61 agactcagaa gcaaggcgta ttccagcgca tccattggga tattatacaa atagaatagc 121 aaggcaaata tggccatgat cccggcgtac atgcctattc ccctgatatg atccctgata 181 tatcttcctg tcaatctcat gcctgcacca tatatcctat tccttttttc gttacgatcc 241 atttgcattt atcctgtctg atgcggatta tgcatcgtat accgctggcc ttacagagga 301 taacaaagag gatatggttt tctttaacgt gaaggatgtg atggatactt atccattcgc 361 caaagaactg gaagaagaat atatcgcgca tgccacagat atctcggacc attattttct 421 ttatgatgcc cgcgaagaag aacttgcaaa aaaagcaggg gaaccctaca catattcagg 481 cagggtaggg atgacggcgg acaatccgga acttcttcag gactggaaat atgcgcctgc 541 cttcaaagtt cttacaaaag gggaggttat gcagatgatt gcggtattcg tgatgcttag 601 cgcctacatt gcgataattg ccctggcggc aatcggggtt atgacttatg taagaagcgt 661 taccattgct gtcgataaca ggcagctgtt cgaggatatg aagaagctgg gggccagccg 721 ggattatgag acgcgggtgg taaaagtaca gcttcgcaag atcttcttat atcccggtat 781 cgcaggatgc gggatatccc tggtctttac ggtcctgatg ctctttttta acaatatgcg 841 cctggaaatt gaagaaatca ggctgatcgg aatcgagagc attatgattg gggcatccgc 901 catcttcctg tacgtactgt accggatctc ttttcggaag atgagaagca tgctggatct 961 atagggaaac aaaatagtga tagtgtttgc aaactttttg tccatggact gcttatattt 1021 tgcaattaaa aaagaacttt acaagttgta agatgccgtg tgattttcca atgtcgcgtc 1081 ctgtaaaatg ttaaagttgt atcaatcgat acgatacttt ggcagatatg ataagccaaa 1141 ggaaaagaaa ggaaggaaaa gttcatgaaa cttgtacagg acaaaattac aattatcaca 1201 ggcggaaccc gtggaatcgg attcgcagca gcaaaactct ttattgagaa tggagcaaaa 1261 gtctccatat ttggcgagac ccaggaagag gtagacacag cgctggctca gttaaaggaa 1321 ctctatccgg aggaagaggt attaggattc gctccagacc ttacatcaag agatgctgtt 1381 atggcagcag ttggaacggt tgcacagaag tacggaagac tggatgtcat gatcaacaac 1441 gcaggcatta caatgaattc tgtattctcc agggtatcag aagaggattt caaaaatata 1501 atggacatca atgttaacgg cgtattcaat ggcgcatggt ctgcttatca gtgcatgaaa 1561 gatgcaaagc agggcgttat catcaatacg gcatctgtaa ccggaatcta tggttcctta 1621 tcaggaatcg gatatcctac cagcaaggcg ggcgtaatcg gcctgactca tggtcttgga 1681 agagagatta tccgtaagaa catccgtgta gttggcgttg cacctggcgt tgtagataca 1741 gatatgacga aggggcttcc accggagatc ctggaggact acttgaagac actgccaatg 1801 aagagaatgc ttaagccgga agagatcgcg aatgtatatc tgttccttgc atccgacctg 1861 gctagcggca tcacggctac gacgatcagc gtagatgggg cttacaggcc atagaaaaga 1921 catactgcta ttaattccat agttcatact ccaagaacag gcaggcaaga ggcatttgcg 1981 ttttagcgcg gatgcccggg cctgcctgat ttaattcagc tggtatatca tgaaattcag 2041 atatgcggcg aacaggcacc atataaggta ggggatctgt agataggcgg caacaggact 2101 tatcttgtga aactgatata tcatcagggc tatgaggatg ataagcacga gaagccataa 2161 aaatgcaaag aggtacatgg aaaagccgaa aaagaatatg ctccagagga agttgaagaa 2221 cagctggata aaatatagtc gaagcgcctt attcttttca ggagtttcgg attcatagat 2281 tatataagaa gatatcccca ttaatatata taatatggtc cagacgatgg gaaataggaa 2341 ggacggagga ctaagaggcg gcttattcaa tgccaaatag gccgccgaat tgccgcttaa 2401 gagagcagac aaggatcctg ccgcaagagg aataaggata aaaataatga gagcgctttt 2461 gtttttgatg ttcatatata ccggctccag gcatgacttt caatattata tgaaaaatct 2521 ccgggaaata tgaacggtat ctccggcttt acttgccgct ctttgacttg cccgccgtct 2581 ctttgagcag ttccag // LOCUS ECOTRAU 1080 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli F plasmid transfer operon: traU gene, complete cds; traW gene, 3' end; and trbC gene, 5' end. ACCESSION M34695 KEYWORDS periplasmic protein; transfer operon. SOURCE E.coli F Plasmid (strain K12; isolate Flac plasmid FLO) DNA, clones pKI[182;282;175]. ORGANISM Plasmid F Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1080) AUTHORS Moore,D., Maneewannakul,K., Maneewannakul,S., Wu,J.H., Ippen-Ihler,K.A. and Bradley,D.E. TITLE Characterization of the F plasmid conjugative transfer gene traU JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.A.Ippen-Ihler, 25-MAY-1990. FEATURES from to/span description pept 49 1041 traU protein precursor sigp 1 66 traU protein signal peptide matp 67 1038 traU protein pept < 1 52 traW protein pept 1050 > 1080 trbC protein BASE COUNT 242 a 281 c 311 g 246 t ORIGIN Map position 77.9-78.9 units on the genome. 1 cgatcgcttc ctgaaggtgg aatttattcc ggcagaggag ggcagaaaat gaagcgaagg 61 ctgtggctgc tgatgttatt ccttttcgcc ggtcatgtcc ctgcggcgtc tgcggattct 121 gcctgtgagg ggcgttttgt aaacccgatc acagatatct gctggagctg tattttcccg 181 ctctcgctgg gcagtatcaa agtcagtcag ggcaaggtcc ccgacacggc gaacccgtcg 241 atgcccattc agatttgtcc ggcaccgccg ccgctgttca ggcgtatcgg gctggccatt 301 ggttactggg agccgatggc gttgacggac gtcacccggt caccgggatg catggtgaac 361 ctgggcttca gcctgccggc ttttggtaaa acggcacagg gaacggcgaa aaaggatgag 421 aagcaggtaa atggggcgtt ctatcacgtt cactggtaca aatacccgct gacgtactgg 481 ctgaacatca tcacatcgct gggctgtctg gaaggtggtg acatggatat cgcttatctt 541 tctgaaatcg accccacctg gacggacagc agcctgacca ccattctcaa tccggaagct 601 gtcatctttg ccaatccgat agcacaggga gcctgcgcag cagatgcgat tgccagcgcc 661 tttaatatgc ctctcgatgt tctgttctgg tgtgccggtt cgcagggaag tatgtacccg 721 ttcaatggct gggtgagtaa tgagtccagt ccgttgcagt cctccctgct ggtcagtgaa 781 cgcatggcgt tcaagctgca ccgtcagggc atgattatgg aaaccatcgg gaaaaataac 841 gccgtctgta atgaatatcc gtccccaatc ctgcccaaag aacgctggcg ttaccagatg 901 gtgaatatgt atccggacag cgggcagtgc cacccgttcg ggcgcagcgt gacccgctgg 961 gaaaccggga aaaatccgcc caacacaaag aaaaacttcg gctacctgat gtggcgtaaa 1021 cgtaactgtg tcttcctgtg aggtgaatga tgaagctgag tatgaaatct ctggcagcac // LOCUS MUSSMRNAA 74 bp ss-RNA RNA 25-JUL-1990 DEFINITION Mouse brain-specific small RNA, clone pABr-4. ACCESSION M35067 KEYWORDS small RNA. SOURCE Mouse 17-day fetus, cDNA to RNA, clone pABr-4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 74) AUTHORS Anzai,K., Kobayashi,S., Suehiro,Y. and Goto,S. TITLE Conservation of the ID sequence and its expression as small RNA in rodent brains: Analysis with cDNA for mouse brain-specific small RNA JOURNAL Mol. Brain Res. 2, 43-49 (1987) STANDARD simple staff_review FEATURES from to/span description RNA < 1 > 74 brain-specific small RNA BASE COUNT 12 a 16 c 28 g 18 t ORIGIN 1 ggggttgggg atttagctca gtggtagagc gcttgcctag caagcaaggc cctgggttcg 61 gtcctaagct ctgg // LOCUS MUSSMRNAB 74 bp ss-RNA RNA 25-JUL-1990 DEFINITION Mouse brain-specific small RNA, clone pABr-9. ACCESSION M36619 KEYWORDS small RNA. SOURCE Mouse 17-day fetus, cDNA to RNA, clone pABr-9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 74) AUTHORS Anzai,K., Kobayashi,S., Suehiro,Y. and Goto,S. TITLE Conservation of the ID sequence and its expression as small RNA in rodent brains: Analysis with cDNA for mouse brain-specific small RNA JOURNAL Mol. Brain Res. 2, 43-49 (1987) STANDARD simple staff_review FEATURES from to/span description RNA < 1 > 74 brain-specific small RNA BASE COUNT 11 a 15 c 28 g 20 t ORIGIN 1 ggggttgggg atttagctta gtggtagagc ttgcctagca agcgcaaggc cctgggttcg 61 gtccttagct ctgg // LOCUS BOVPRLB 1214 bp ds-DNA MAM 25-JUL-1990 DEFINITION Bovine prolactin gene, exon 5. ACCESSION M34535 KEYWORDS prolactin. SOURCE Bovine pituitary DNA, and cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 628) AUTHORS Carroll,S.M., Narayan,P. and Rottman,F.M. TITLE N-6-methyladenosine resides in an intron-specific region of bovine prolactin pre-mRNA JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 629 to 1214) AUTHORS Carroll,S.M., Narayan,P. and Rottman,F.M. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Narayan, 16-MAY-1990. The cDNA sequence which corresponds to this gene is found in J.B.C. 257: 678-681 (1982), accession number M25007. Author address: P.Narayan Dept. of Molecular Biology and Microbiology School of Medicine Case Western University Cleveland, OH 44106 FEATURES from to/span description pept / 629 820 prolactin, exon 5 (AA at 629) pre-msg < 1 971 prolactin mRNA and intron IVS < 1 628 prolactin intron D BASE COUNT 400 a 229 c 197 g 388 t ORIGIN 1 gtgagcttca tgaaagcttc cttgctattt tcatgaatga gagaggtgat ttctgtaatg 61 aggaatgagt tttgaactat ctcactgtac aagaacacaa ttcaggcctt ctttttctag 121 accggtgtta cataaagcaa gaacctgttc attcatagtg atagattcta ttgtaagtga 181 attagaattc caccagcaat ttttcacaga ggtatagtct ttcttgaatt gtacagttac 241 accaaaatct tgcctcttcc tgggtacaga tggctgaaat attttcaagg ataagagaat 301 tagagaatac aatttgcaag ataaatgttt tcttcaaaat atcccaagat atcctctact 361 gaaattcagc ttgtattctt tctctattct cctcaaacca caggatgaga atgagaagaa 421 agaaaagaga agatcaaaac caaatacttg agttctgctt tagtttttat taataaatta 481 ctaacatata tctgatacac tggctccaaa atccaagtgt agagactttc atgtatcttc 541 cctaattttt aatttgataa atagaaagaa caaagatgag ctaatactac taaaactcat 601 aataactcat tatcttttgg atgtttaggt tattcctgga gccaaagaga ctgagcccta 661 ccctgtgtgg tcaggactcc cgtccctgca aactaaggat gaagatgcac gttattctgc 721 tttttataac ctgctccact gcctgcgcag ggattcaagc aagattgaca cttaccttaa 781 gctcctgaat tgcagaatca tctacaacaa caactgctaa gcccacattc catcctatcc 841 atttctgaga tggttcttaa tgatccattc cctggcaaac ttctctgagc tttatagctt 901 tgtaatgcat gcttggctct aatgggtttc atcttaaata aaaacagact ctgtagcgat 961 gtcaaaatct aagactgcaa ttttgtcaat gtttcttatc ttcatttaat agacaatcaa 1021 atgaaaatcc ttccttatga ttgagagaaa gaacttctga ttaaaatttg tcacaaatag 1081 cagaaactga cattacaaag accgttaata acttacttta gaatcacagc aaattattct 1141 ggggtcaagt tattagaatt aaaaattaga taaacattca ttgtgttggt catgctacca 1201 agaagactga attc // LOCUS RHMCYA 1560 bp ds-DNA BCT 25-JUL-1990 DEFINITION R.meliloti adenylate cyclase (cyaA) gene, complete cds. ACCESSION M35096 KEYWORDS adenylate cyclase. SOURCE R.meliloti DNA. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 1560) AUTHORS Beuve,A., Boesten,B., Crasnier,M., Danchin,A. and O'Gara,F. TITLE Rhizobium meliloti adenylate cyclase is related to eucaryotic adenylate and guanylate cyclases JOURNAL J. Bacteriol. 172, 2614-2621 (1990) STANDARD simple staff_review FEATURES from to/span description pept 885 1466 adenylate cyclase (cyaA) binding 871 879 ribosome binding site BASE COUNT 292 a 512 c 487 g 269 t ORIGIN 1 ggatcctgtt cctggacgcg agcggcctgc agtttgccga acgtcacgct gcctccaacg 61 gcttcgatcc gaggacgcgg ccctggtacc gcgcggccgt caacggcaag gcgccggtgg 121 ccatcggtcc ctatgagatg gccaccacag gcaatctcgg gatgaccata tcgcaagcgc 181 accgcggcaa cccccaaatc gtcatcggcg ccgatgtcgt tctcgatacg atcacggatt 241 ttctgtcccg cgagcggctg accgacgact cggtttcctt cgtgctcgat gcggtgggac 301 gaccgatcat ccactccgac tccaccatga tgcggcgcat catggcatcg aagggccggg 361 accggccggt ggccacgccg caggaggatg gactgatcga gagcatccgg cgcaacccgc 421 caccggccgg aaaggcaact ctcgtcgaag tcggaaaccg cacctatctc gtcacggtgg 481 cgccgctcga atcggcattg cttctgtccg ggcaccgggt ggtcgtcgcc gcccctctcg 541 acgagctgct ggcggccgca aacgagacgc tcgttcaggg acttgccgtc tcgggcgccg 601 tggtggtggt cgccgttctc ctggccctcg tgcttgcgca tctgatcacg aagtcgctca 661 accagctcac cgacagcgcc aaccgcctgc aggacctgga tttcgccact cctatcgacg 721 tttcgtcgca tgtggcggaa atctcgacgc tcaacggcgc aatgaacagg gctcgcgacg 781 cgatcttcac cttcgcgctc tatgttccga aggagctggt gcgcaagggc atcgaatccg 841 gccatttcgg cggccgcgcc gcatggcggc aggaggtgac ggcgatgttc accgacatct 901 acgacttcac caccatcagc gagggccggt cgccggaaga agtggtcgcg atgctctcgg 961 agtatttcga cctgttcagc gaggtcgtcg ccgcccacga cggaaccatc atccaattcc 1021 atggagactc ggtctttgcc atgtggaacg cgccggtcgc cgataccagg catgccgagc 1081 atgcctgtcg atgcgcactc gcggtcgagg agaggctcga ggccttcaat tctgcgcaac 1141 gcgccagcgg attgccggag ttccgcaccc gcttcggcat ccacaccgga acggccgtcg 1201 tcggcagcgt cggcgccaag gaacggctgc aatatacggc gatgggcgac acggtgaacg 1261 tcgcctcgcg gctcgagggc atgaacaagg attacggcac gagcgttctt gcaagcggcg 1321 cggtggtcgc ccaatgcaaa gacatggtga agttccgccc gctcggcacc gccaaggcaa 1381 agggccgttc gacggcgctc gacatttacg aagtcgtggg cgtcgtccgc gcggtgaaca 1441 ctaccgaagc cggaacggcc gcctgaggaa aggcagatgc cgcggcgaac ggcggccccg 1501 ctgaattcgc ttcgaaactc tgaaagcaaa aaagcccgga aacccgggct ttttttgact // LOCUS ECOCYSD 492 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli sulfate adenylate transferase (cysD) gene, 5' end. ACCESSION M35098 KEYWORDS sulfate adenylate transferase. SOURCE E.coli (strain K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 492) AUTHORS Malo,M.S. and Loughlin,R.E. TITLE Promoter elements and regulation of expression of the cysD gene of Escherichia coli K-12 JOURNAL Gene 87, 127-131 (1990) STANDARD simple staff_review FEATURES from to/span description pept 412 > 492 sulfate adenylate transferase (cysD) mRNA 373 > 492 cysD mRNA BASE COUNT 141 a 110 c 118 g 123 t ORIGIN 1 ctgcaggagt tccggtcatg cgtcccggaa agaaagtagc aatatgtcgt gcctgagtat 61 tagcaaaatc gccaggttta ggtgacgagg cgtgtacggg gagaataaag catacgccga 121 gcgccagggc agcggtacgg tggcgcaatg cggaaaacat agtgagtcct taaataccat 181 gcaaattttt ttaccgccat agtatgaaac tgccgctgcg ctaaaacaat ttcaaatctt 241 cctaaacgcc cgaaatccgg tgccttaagc actttttgat attagctttg ccaaatcgtt 301 attccgttaa ggaactactc attctaattg gtaatttcat tcgttctctt acgctcccta 361 tagtcgaaac atctgatggc aagaaaatag cggtattgca aaggaacggt tatggatcaa 421 atacgactta ctcacctgcg gcaactggag gcggaaagca tccacattat tcgcgaggtg 481 gcggcagaat tc // LOCUS CHPCOX41A 956 bp ds-DNA PRI 25-JUL-1990 DEFINITION Chimpanzee cytochrome c oxidase subunit IV (COX4P1) processed pseudogene, complete cds. ACCESSION M34599 KEYWORDS cytochrome c oxidase subunit IV; pseudogene. SOURCE Chimpanzee DNA, clone lambda-Ch1. ORGANISM Pan troglodytes Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 1 to 956) AUTHORS Lomax,M.I., Welch,M.D., Darras,B.T., Francke,U. and Grossman,L.I. TITLE Novel use of a chimpanzee pseudogene for chromosomal mapping of human cytochrome c oxidase subunit IV JOURNAL Gene 86, 209-216 (1990) STANDARD simple staff_review FEATURES from to/span description pept 216 665 cytochrome c oxidase subunit IV (COXIV) pseudogene (E.C. 1.9.3.1) signal 881 888 poly-A signal BASE COUNT 268 a 222 c 277 g 189 t ORIGIN Chromosome 14q21-qter. 1 ggtacctcca atcccagcta ctcgggaggc tgaggcagga gaatcacttg aactcgggag 61 gcggaggttg cagtgagctg agatcacgcc tctgcgctac agcctgggca acaagagcaa 121 aactccgtct cggaaaagaa aaaaacaaaa aagaactact ggggtcgcgg gacaccgggc 181 atagagggcg gcggtggtgg ggcagctgcg gcagaatgtt ggctaccagg gtagttagcc 241 tagttggcaa gcgagcaatt tccaccttgg tgtctgtacg agcacacgga aatgttgtga 301 agagcgatga ctatgcgctc ccagcttatg tggatcgacg tgactatccc gtacccgatg 361 tggcccatgt caagcacctg tctgccagac agaaagcctt gaagaagaag gagaaggcct 421 cctggagcaa ccgctccacg gatgggaaag tcgagttgta tcacattcag ttcaaggaga 481 gctttgctga gatgaacagg ggcgtgaacg agtggaagat ggttgtgggc gctgccatgt 541 tcttccttgg cttcacggcg ttcattatca tctgggagaa gcgctgtgtg tacggcccca 601 tcccgcacac ctttgacaaa gagtgggtgc ccatgcagac caagaggatg ctggacatga 661 ggtgaacccc tgcagggctt cgccagccaa gtgggactat gacaagaacg agtggaagaa 721 gtgaacccct gcagggcttc gccagccaag tgggactatg acaagaacga gtggaagaag 781 tgagagatgc tgtcctgctt ttgagccttg ctctgtcacc tccatactat aactccatgc 841 ctatttactg gaaacctgtt atgccaaaca gtaccactgc taataaatga ccagtttacc 901 tgaaagaaaa aaaaaaaaag aactactgaa gtgaaagaaa aatctggaga aagtac // LOCUS CHTMOMPA 682 bp ss-mRNA BCT 25-JUL-1990 DEFINITION C.trachomatis outer membrane protein (ompl) gene, 5' end. ACCESSION M35099 KEYWORDS outer membrane protein. SOURCE C.trachomatis (serovar L2/434/Bu), cDNA to mRNA. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 682) AUTHORS Kaul,R., Duncan,M.J.J., Guest,J. and Wenman,W.M. TITLE Expression of the Chlamydia trachomatis major outer membrane protein-encoding gene in Escherichia coli: Role of the 3' end in mRNA stability JOURNAL Gene 87, 97-103 (1990) STANDARD simple staff_review FEATURES from to/span description pept 568 > 682 outer membrane protein (ompl) precursor sigp 568 633 outer membrane protein (ompl) signal peptide matp 634 > 682 outer membrane protein (ompl) mRNA 1 > 682 ompl mRNA BASE COUNT 218 a 144 c 111 g 209 t ORIGIN 1 aaaaacactt tctttgtagt aataaaaacg atttctatca aaacaaattc ttagattttc 61 ttacaaaaat ctcctctttt cttttagcca aacccccatc ttcgagctat tccaaacaca 121 aaaatcttag gttttggaaa ttaacaactc ataaaaattg aactgttttg taattaactc 181 aaaaccctct cattctcaac aatcaacata ttgccaacat ggcttttgct ctcggtttca 241 cagcgatttt tttcgcaaaa accaagaaca taaaacataa aaagatatac aaaaatggct 301 ctctgcttta tcgctaaatc aggaggcgct taagggcttc ttcctgggac gaacgttttt 361 cttatcaact ttacgagaat aagaaaattt tgttatggtc tcgagcattg aacgacatgt 421 tctcgattaa ggctgctttt acttgcaaga cattcctcag gccattaatt gctacaggac 481 atcttgtctg gctttaacta ggacgcagtg ccgccagaaa aagatagcga gcacaaagag 541 agctaattat acaatttaga ggtaagaatg aaaaaactct tgaaatcggt attagtgttt 601 gccgctttga gttctgcttc ctccttgcaa gctctgcctg tggggaatcc tgctgaacca 661 agccttatga tcgacggaat tc // LOCUS ECOK99FIM 740 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli K99 fimbrial subunit gene, complete cds. ACCESSION M35282 KEYWORDS K99 fimbrial subunit. SOURCE E.coli (strain K-12 C600) DNA, clones 1, 2, 3, 4 and 5. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 740) AUTHORS Roosendaal,B., Gaastra,W. and de Graaf,F.K. TITLE The nucleotide sequence of the gene encoding the K99 subunit of enterotoxigenic Escherichia coli JOURNAL FEMS Microbiol. Lett. 22, 253-258 (1984) STANDARD simple staff_review FEATURES from to/span description pept 70 615 K99 fimbrial subunit precursor sigp 70 135 K99 fimbrial subunit signal peptide matp 136 612 K99 fimbrial subunit pept 648 > 740 ORF1 BASE COUNT 235 a 133 c 146 g 226 t ORIGIN 1 tagggaatgg ctatgttttc tggtgattcc acggaactaa aaaataatat cgaacaatgg 61 agaatctaga tgaaaaaaac actgctagct attatcttag gtggtatggc ttttgcgact 121 accaatgctt ctgcgaatac aggtactatt aacttcaatg gcaaaataac gagtgctact 181 tgtacaattg accctgaggt caatggtaat cgtacatcaa ctatagatct tgggcaggct 241 gctattagtg gtcatggcac tgtagtggat tttaaactaa aaccagcgcc cggcagtaat 301 gactgcctag cgaaaacaaa tgctcgtatt gactggtctg gttctatgaa cagtttaggt 361 tttaataata cagcttcagg aaatactgct gctaaaggat accatatgac tttgcgcgca 421 acaaacgttg gaaatgggtc tggtggtgct aatattaata cttcattcac tacggctgaa 481 tacactcaca cttctgcaat tcagtcattt aactattcag cccagctgaa aaaagatgac 541 cgcgctccgt ctaatggtgg atataaagct ggcgtattta ctacttcagc atccttctta 601 gtcacttata tgtaatattt aaagtatttt acattgcggg catatctatg attgcccgca 661 atattactga tggatattat atgaatagaa aaaaacatca gattttaaaa attttattgt 721 tgtgtctaat aagcagtaaa // LOCUS ECORRDAA 72 bp ss-rRNA RNA 25-JUL-1990 DEFINITION E.coli 16S rRNA fragment. ACCESSION M35308 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE E.coli (MRE 600) ribosomal RNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 72) AUTHORS Ehresmann,C., Fellner,P. and Ebel,J.P. TITLE Nucleotide sequences of sections of 16S ribosomal RNA JOURNAL Nature 227, 1321-1323 (1970) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 72 16S rRNA BASE COUNT 17 a 16 c 20 g 19 t ORIGIN 1 ggcttggttt gcaagtgtca gatactgtta agcatctgaa atccccgggc taaccctggg 61 aactgatgac tg // LOCUS ECORRDAB 174 bp ss-rRNA RNA 25-JUL-1990 DEFINITION E.coli 16S rRNA fragment. ACCESSION M35309 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE E.coli (MRE 600) ribosomal RNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 174) AUTHORS Ehresmann,C., Fellner,P. and Ebel,J.P. TITLE Nucleotide sequences of sections of 16S ribosomal RNA JOURNAL Nature 227, 1321-1323 (1970) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 158 16S rRNA BASE COUNT 42 a 40 c 55 g 37 t ORIGIN 1 ggcatgaaga cacactgcta actccgaata cgcacaagcc cgtaatggag cgacggtggg 61 ccttgttccc gtgccccgat gtggggtgga ggtgactgtg ggttgtgata ttcggggagg 121 caaaagaagt agcgagtcta accttgctta ccactttgcc taatacggga aacg // LOCUS HPTRRA 117 bp ss-rRNA RNA 25-JUL-1990 DEFINITION H.aurantiacus 5S rRNA gene. ACCESSION M35310 KEYWORDS 5S ribosomal RNA; ribosomal RNA. SOURCE H.aurantiacus (strain Sengas Wie 2) ribosomal RNA. ORGANISM Herpetosiphon aurantiacus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Cytophagaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Van den Eynde,H., Stackebrandt,E. and De Wachter,R. TITLE The structure of the 5S ribosomal RNA of a member of the phylum of green non-sulfur bacteria and relatives JOURNAL FEBS Lett. 213, 301-303 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 117 5S rRNA BASE COUNT 22 a 37 c 40 g 18 t ORIGIN 1 tccggtggca atgtcggagg ggtcccaccc gttcccatcc cgaacacgga agttaagccc 61 tccagagccg atggtactcc gcggggaacc gcgcgggaga gtaggtcgct gccggat // LOCUS HUMCOX4AA 634 bp ss-mRNA PRI 25-JUL-1990 DEFINITION Human cytochrome c oxidase subunit IV (COX4) mRNA, complete cds. ACCESSION M34600 KEYWORDS cytochrome c oxidase; cytochrome c oxidase subunit IV. SOURCE Human liver, cDNA to mRNA, clones pCOX4.-[111 and 4.2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 634) AUTHORS Lomax,M.I., Welch,M.D., Darras,B.T., Francke,U. and Grossman,L.I. TITLE Novel use of a chimpanzee pseudogene for chromosomal mapping of human cytochrome c oxidase subunit IV JOURNAL Gene 86, 209-216 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1 510 cytochrome c oxidase subunit IV (COX4) /hgml_locus_uid="LS0022W" /nomgen="COX4L2" /map="16q22-q24" mRNA < 1 634 COX4 mRNA BASE COUNT 156 a 157 c 180 g 141 t ORIGIN 1 atgttggcta ccagggtatt tagcctagtt ggcaagcgag caatttccac ctctgtgtgt 61 gtacgagctc atgaaagtgt tgtgaagagc gaagactttt cgctcccagc ttatatggat 121 cggcgtgacc accccttgcc ggaggtggcc catgtcaagc acctgtctgc cagccagaag 181 gcactgaagg agaaggagaa ggcctcctgg agcagcctct ccatggatga gaaagtcgag 241 ttgtatcgca ttaagttcaa ggagagcttt gctgagatga acaggggctc gaacgagtgg 301 aagacggttg tgggcggtgc catgttcttc atcggtttca ccgcgctcgt tatcatgtgg 361 cagaagcact atgtgtacgg ccccctcccg caaagctttg acaaagagtg ggtggccaag 421 cagaccaaga ggatgctgga catgaaggtg aaccccatcc agggcttagc ctccaagtgg 481 gactacgaaa agaacgagtg gaagaagtga gagatgctgc ctgcgcctgc acctgcgcct 541 ggctctgtca ccgccatgca actccatgcc tatttactgg aaacctgtta tgccaaacag 601 ttgtaccact gctaataaat gaccagttta cctg // LOCUS MRGRBMII 2574 bp ds-DNA VRT 25-JUL-1990 DEFINITION M.serrator retropseudogene-like repetitive element I (RBMI). ACCESSION M35143 KEYWORDS repetitive sequence. SOURCE M.serrator blood DNA. ORGANISM Mergus serrator Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Anseriformes; Anatidae. REFERENCE 1 (bases 1 to 2574) AUTHORS McHugh,K.P., Madsen,C.S. and de Kloet,S.R. TITLE A highly repeated retropseudogene-like sequence in DNA of the redbreasted merganser (Mergus serrator) JOURNAL Gene 87, 193-197 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 1 2574 retropseudogene-like repetitive element pept 1403 909 (c) ORF1 pept 2118 1426 (c) ORF2 BASE COUNT 691 a 600 c 484 g 790 t 9 others ORIGIN 1 gaattcctca aacacgctgc ggctgcttac ctttaataca cccgttgcat gcgatggagc 61 tgtatttctt gcttttncct gcactggaag gcttcccttc cttgtcaggt tgtttactgc 121 cctcactctt ctgcattgct cacatgaaga gccatctgga ggatgggttt cttccttctt 181 ctcccgggtt atcttctgga aacgaggacc taagtattcc aaggagcctt tcactttcct 241 ggtgtttctc cttttttttc tttttcttct cctttttctt ctttttctta tgcttgtgat 301 tggcattgtc aaagtggagc gcacagaaac acaaatcgtg aagtctgaaa gaaacatgca 361 agttaaaaag agaaaaaaag atgtggcact tgttgcctat atgaaacttt atttttttta 421 ccacaggtga tgatttgcag catgtcagct attttgtggt gctttgtgca cacgcaactt 481 acttacttta gatgcagcaa acttaagccc tcagattgaa ggaccatagg ctggtttgta 541 cacagatcat taaccatggt tagctctgga atacgtgcaa gcagaaaaaa acttttaacc 601 taatccggaa tggtgtacag atgtgattcg aactatgtgg tctaacgcta gtgctctgac 661 acaattcagc aatagctttc ctatcttcac tgaacaccta cacacagacc cagccagctg 721 atgctatcta aataacttag aaactaccag aaaaaaaaaa aaaaaaaaaa gaagaaaaaa 781 cgagaataaa aaaaaaaagt agaaaaaaaa aaaaaaagga agacatgaga agcacccaga 841 aatgaattag gataaaaaat tcggagtatg ctggaatcct tgcttacttg gaatccttct 901 ctgcatgttt aatccttaga cttctttttt cttctagaac ttgttgatat ttttgcattt 961 ttttcaccac ctaaaagctc cttttctatc tttctgtctt tcctttctat ttcactttca 1021 ctaccttctg cacgggtata ttttcttttt ctgtttcttt ctgtttcatt tttctggcga 1081 cagttctcca aatgagctga cacgggtgga agcgcatgtc tttcacgaga atgtcttctg 1141 gaatgttgct gatgtaccga gcaacgatgc aagtctgctc ggggtgtgct aaagcgacgt 1201 acatcttcct ctctcaagag ggaactgtga ggccatccgc ttttgtaatg ataactctta 1261 tgtgacctgc tgtagtaagt tgcagtcgat ttgtcaaagg ctgcatcgcc gtgagacaac 1321 tttctctctc tactgtctcc tgtcgcatga ggtgaatagt aatcattgta atagctacat 1381 ctttcccatc tccgagcntt catcctcgat agtatctntc tctgctcaac ttctttgccc 1441 tttggatcgg taatatctat tgctacctcg ttctgatctt cctccgcttg ccagatctgt 1501 actttgaata tttgacngct cttctgccat tctcagggct gtttctttca nnnnggaaag 1561 atctgcacct gcttcccccc cagtgctcct gcttgtgacg cttttgctca acaacttcca 1621 cgctctgaga acacctcctc ttgctggaag gacctgcttt ttgactctcc ttctcttcag 1681 taggagcatg ttcctcttgc tttggtaatg ctctttgtca gtgtttttag tctcgncttg 1741 tatcttggca tctctctgta atagctgagg aggaaaggtt tttagagcta cattcagtgt 1801 cagacttgag agaggaagct tgccgcaatt tctcaccagg ctcagaagac tctttgccgg 1861 acaaaacgtt ttcttttgaa atgaggtcac gttcttttca tcttcttgct ttctccttat 1921 ctccaccgtc atcacattgg tactgtgcga ggtattcatc attccagtag attttcgagg 1981 gtccgcaaca ctgcacaaaa taaaagcaca tttctcagtt ctgctgaagg acgtgaatat 2041 taagaggaaa accttccaaa agtcgaacaa acaaacaaaa acctccggac tacaggaaca 2101 ctctccaaga tatgccattt agaaacctct cctgtcatta ggacaccttc ttcagctcca 2161 cagaaagggg ttttgccctc ttgcttctga agccattgca ctaaaaagca aacgcagtgc 2221 tgtctccctc cacatgctgc tctgaataag agccagaata ttcaaaacca ctctctttgt 2281 tctcccacat agccgaaaaa acaccggttg aaacagagtt ttctacctct cgcccaacaa 2341 tttacattca catagcctat gactgaaaaa ataaaaggcg gggctgagga ggaacagcca 2401 gtgttggaaa tgaaaagaag cagcccgttc cttcatagtc ttaagcctat gctactagga 2461 aaacaaaaca aaacaaaaca aaacaagagg agaggagaac aacagcggga aattttcctg 2521 ttctccaggt gttaaattgc aaagcctcct ctggaggatc acagctgtga attc // LOCUS MUSLAMB 2789 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse lamin B mRNA, complete cds. ACCESSION M35153 KEYWORDS lamin B. SOURCE Mouse liver, cDNA to mRNA, clone FML11-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2789) AUTHORS Hoeger,T.H., Krohne,G. and Franke,W.W. TITLE Amino acid sequence and molecular characterization of murine lamin B as deduced from cDNA clones JOURNAL Eur J Cell Biol 47, 283-290 (1988) STANDARD simple staff_review FEATURES from to/span description pept 256 2019 lamin B mRNA < 1 2789 lamin B mRNA site 2768 2773 poly-A signal BASE COUNT 696 a 657 c 812 g 624 t ORIGIN 1 aataatctta agctcttaca aagagctgcg ggcgggagac tcgcgtccgg cgcacagccg 61 tctgcgtctc ccggctgccc tggcctcttc ccgcgcgcgc gtgcagtgtg cgtgtacact 121 cacaaagggc gtctggcggg cgatccgcgg ccctcccgct tcgctctttg tgcggtagcc 181 ccgccgccac cgccagccca ggtccgctcg atcctcaccg gcctgtggtt tgtaccttcg 241 gtcccgccgc ccgccatggc gaccgcgacc cccgtgcagc agcagcgggc gggcagccgc 301 gccagcgccc ccgccacgcc gctcagcccc acgcgcctgt cgcgcctgca ggagaaagag 361 gagctgcggg agctcaacga ccgcctggct gtgtacatcg ataaggtccg cagcctggag 421 acggagaaca gcgcgctgca gctgcaggtg accgagcggg aggaggtgcg cggccgcgag 481 ctcaccggcc tcaaggctct ctacgagacc gagctggccg acgcacgccg cgctctggac 541 gacacggccc gcgagcgcgc caagcttcag atcgagctgg gcaagttcaa ggccgagcac 601 gaccagctgc tgctcaatta tgccaagaag gaatctgatc tcagtggagc ccagatcaag 661 cttcgagagt atgaggcggc actaaactct aaggatgcgg cgctggcaac tgccctaggg 721 gacaaaaaga gtttagaggg agacttggag gatctgaaag atcagattgc ccagctagaa 781 gcatccttat ctgccgccaa aaagcagtta gcagatgaaa ctttacttaa agtggatttg 841 gagaatcgct gtcagagcct tactgaggac ttggagtttc gtaaaaatat gtatgaagag 901 gagatcaatg agacaaggag gaagcatgag acccgcttgg tggaagtgga ctctgggcgt 961 cagattgagt atgagtacaa gctggctcaa gccctgcatg agatgcggga gcagcacgac 1021 gcgcaggtga ggctgtacaa ggaagagctg gagcagacct accacgccaa gcttgagaat 1081 gccagactct cctcagagat gaacacttcc actgtcaaca gtgcccggga agagctgatg 1141 gagagccgga tgaggatcga gagcctctcc tcacagctct ctaacctgca gaaagagtct 1201 agagcgtgtt tggaaaggat ccaggaattg gaggacatgc ttgctaagga gagagacaac 1261 tcgcgccgca tgctgtctga cagagagaga gagatggcgg agatcaggga ccagatgcag 1321 cagcagctga gtgattatga gcagctgctg gacgtgaagc tggccctgga catggagatc 1381 agcgcctaca ggaagctcct ggaaggcgaa gaagagcggt taaagctctc tccaagccct 1441 tcttcccggg tgaccgtgtc cagagcgtcc tccagtcgca gtgtgcgcac caccagagga 1501 aagcggaaga gagttgatgt ggaggagtcg gaggcgagca gcagtgttag catttcccac 1561 tctgcctcag ccacggggaa cgtgtgcatt gaagagatag atgttgatgg gaagtttatt 1621 cgcttgaaga acacttctga gcaggatcaa ccaatgggag gctgggagat gatcagaaaa 1681 attggagaca catcagtcag ttacaaatat acctcaagat atgtgctgaa ggctggccag 1741 actgtcacag tgtgggctgc aaatgctggc gtcacagcca gccctccaac tgacctcatc 1801 tggaagaacc agaactcttg gggtactggt gaagatgtga aggttatgct taagaattct 1861 cagggagagg aggttgctca gagaagctct gtcttcaaga ccaccatacc cgaggaggag 1921 gaggaggagg aggagcccat cggagtggct gtggaggagg agcgtttcca ccagcaggga 1981 gccccaagag catggaataa aagctgtgcc attatgtgaa cttatcaaga catggtcgat 2041 cttcctcaag ctagaagcat ggagtcctgt atacagtgca gagccttctc agaagcacat 2101 gatatttttg tatttccttt atgtgaattt ttaagctgcg aatctgatgg ccttaatttc 2161 ctttttgaca ctgaaagttt tgtcaaaaga aatcctatcc atacacgttg taagatgtga 2221 attattgaca ctgagctaac tgtactgttt ggaaaggggc cctcaagttt ttggcatttt 2281 ttctttcctt tttgtatgtg tgtatgtaat tttttttttt taagttcttt taagagggga 2341 caaggagggt aagaaaacca ctgcgtgtcc gggcattaat tgaagcttgc tctccctaga 2401 tgggcggtct gctctcggtc cttctctgct ctctataaaa tggtgctgtc ggggagggag 2461 gggggaagtt tttcaatata tgaacttttg tatggaattt tttgtaataa gtgatcaggt 2521 tacaattttt ttaaatagaa aagagaagaa aaacgttgta agaacggaat attaatctag 2581 tcacccatgt acgcactctg gatggaggtt ctacagagct gttgattggt caactacttc 2641 tcttacattg ttgactcatg aggggagcgg gcaggcgggt gagggtgggg gaaggctttc 2701 tcttcaaatt cgctagttga gtttttaaga tagtgtacat gcttacattt cttatccgac 2761 attaacaaat aaaacgctgt tttcctatt // LOCUS MUSMSTA1 1651 bp ds-DNA ROD 25-JUL-1990 DEFINITION Mouse metastatic cell protein (mts1) gene, exons 1 and 2. ACCESSION M36578 M35147 KEYWORDS mts1 protein. SEGMENT 1 of 2 SOURCE Mouse metastatic cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1651) AUTHORS Tulchinsky,E.M., Grigorian,M.S., Ebralidze,A.K., Milshina,N.I. and Lukanidin,E.M. TITLE Structure of gene mts1, transcribed in metastatic mouse tumor cells JOURNAL Gene 87, 219-223 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1498 / 1638 mts1 protein, exon 2 (first expressed exon) pre-msg 255 > 1076 mts1 mRNA and introns IVS 293 1482 mts1 intron A IVS 1639 > 1651 mts1 intron B signal 225 231 TATA box BASE COUNT 380 a 358 c 460 g 437 t 16 others ORIGIN 1 ttctggctga gctgtggctg cttggtggtg tccaccccat ccaagcctct gccgtgccca 61 ctggagctca ctcactactt gattgtgcct gctggggagg gagcaggaag cctagatccc 121 agactgggct ggtcgagggt gctatgacat ttactacatc aaccaacagc aagagcacag 181 tatccatgtt cccccatcct ctgcatgggc agggcctagc agggtataaa taggtcagat 241 tgttgggctc tccccaaacc tctctattca gcacttcctc tctcttggtc tggtgagttg 301 tgttggtctg atagcactgc tagcggcatt agaggctgag gctagggtag aagaaagggg 361 ggctgctgtg ggggaacaga tgtctttaat aaatccagat gagagattct gatgtggagg 421 ttcatgtatg tgtgtgtgtg tgtgtgtttt cacgagaatg aaaaccaaaa aaaaaaaaaa 481 aaaaaaaaaa agtgtataaa tggctacatc tgagctcccg aaggttttga gatactgagg 541 ctggcttgca tgttgctata gtgtatattg gtggtgcttg ggagtcactg tcatgcatag 601 gatgctgact cgtgttgctg ggtaatacaa gacagtgtgt ggacactcgg gtacaggaag 661 caaagcgaag gcatcagtag gcctttttgt tttacagtat ttaaattaca gtttttattt 721 gtgtgtatga gcgtatgggt tgggctggag caaatgccaa ggcgacattg tgggagccaa 781 aggacaattt gtgtgggagt caactcgttc cttctagcat gtgggctgtg gggatcaaac 841 tcaggccttg gagcttggtg gcaagcacct ctacccattg agctatctct ccagcaccct 901 cctgcagnnn nnnnnnnnnn nntttgtagt gtcttgtttt taattgccct atgaacatat 961 agcacctagg ccaagaaagc ctagcttccc caccctctcc tcttgcatcc ctacctctgc 1021 cacttcatct tactcctatt aggcagctgg ggtttttcca cttttttttt gtctgcctct 1081 gggcaggcag ccagcagccg cgcccaacgc tgggagggag aagaatgggc caggcctgtg 1141 cttgtggttg agctgtggga gtgagtaagc tgatggaaaa ctgctgttgt tgaggccata 1201 gctgagaggc acagaaaggt gctggcatag gtctccagag tttgaggggt agctttgcag 1261 gtttcagagc ccagagcaca tgtgaccttc ttgccaccaa tgggtcccat tcctctgatc 1321 cccnaggggg tgaggtccat ctcttagaga gttgtgggat agagcactta aaatgggaac 1381 agaatgagtg tgatttgggt catgctcagc aacacatatc cagttctcaa cacactgttg 1441 gcgtgggttg gagaatgtta cttttgtgtc tcctgccctt aggtctcaac ggttaccatg 1501 gcaagaccct tggaggaggc cttggatgta attgtgtcca ccttccacaa atactcaggc 1561 aaagagggtg acaagttcaa gctgaacaag acagagctca aggagctact gaccagggag 1621 ctgcctagct tcctgggggt aagtgggtcc t // LOCUS MUSMSTA2 545 bp ds-DNA ROD 25-JUL-1990 DEFINITION Mouse mts1 protein gene, exon 2. ACCESSION M36579 M35147 KEYWORDS mts1 protein. SEGMENT 2 of 2 SOURCE Mouse metastatic cell line NIH3T3 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 545) AUTHORS Tulchinsky,E.M., Grigorian,M.S., Ebralidze,A.K., Milshina,N.I. and Lukanidin,E.M. TITLE Structure of gene mts1, transcribed in metastatic mouse tumor cells JOURNAL Gene 87, 219-223 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 14 178 mts1 protein, exon 2 pre-msg < 1 315 mts1 mRNA and introns IVS < 1 13 mts1 intron B signal 297 302 poly-A signal BASE COUNT 126 a 97 c 170 g 152 t ORIGIN 1 cttcaacggc cagaaaagga cagatgaagc tgcattccag aaggtgatga gcaacttgga 61 cagcaacagg gacaatgaag ttgacttcca ggagtactgt gtcttcctgt cctgcattgc 121 catgatgtgc aatgaattct ttgagggctg cccagataag gagccccgga agaagtgaag 181 actcctcaga tgaagtgttg gggtgtagtt tgccagtggg ggatcttccc tgttggctgt 241 gagcatagtg ccttactctg gcttcttcgc acatgtgcac agtgctgagc aaattcaata 301 aaaggttttg aaactattag ctgttgtctg agagactgga gctatgggct gagggctgtg 361 gtagagactg ctggaagttg acctgagctt tgtggggcca aactaaaaaa aggtcgggga 421 gggggtgggt ggcttatttt gagtacattg caagtatgta tttgtgtgtg tcggcttagt 481 catgcgtgca tgtgtgcgtg cgtgtgtgtt tgtgtgtgtt tacgtgctcc tatatagcaa 541 ccgag // LOCUS MUSNFH 3959 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse neurofilament component (NF-H) mRNA, complete cds. ACCESSION M35131 KEYWORDS neurofilament protein. SOURCE Mouse (strain Swiss-Webster) brain, cDNA to mRNA, clones pMuH1, pMuH5, and DNA, clone lambda-5A. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3959) AUTHORS Shneidman,P.S., Carden,M.J., Lees,J.F. and Lazzarini,R.A. TITLE The structure of the largest murine neurofilament protein (NF-H) as revealed by cDNA and genomic sequences JOURNAL Mol. Brain Res. 4, 217-231 (1988) STANDARD simple staff_review COMMENT Nucleotides 1-955 are derived from genomic DNA. FEATURES from to/span description pept 154 3372 neurofilament component (NF-H) signal 59 65 TATA box signal 3936 3942 NF-H mRNA BASE COUNT 1104 a 1122 c 1197 g 536 t ORIGIN 1 ggggccgcgg gggaggaggt ggagcccact gccgaggggc cggaccgggc caccgcgata 61 taaaagagcc ggagtcccag agctgccgca gtgctgcctg ccccgtccca gccccgcact 121 cccgctccgc tggcggccgc acctgctccg gccatgatga gcttcggcag cgccgatgcg 181 ctgctgggcg ccccgttcgc gccgctgcac ggaggcggca gcctgcacta ctcgctgagc 241 cgcaaggcag gcccgggcgg cacgcgctcc gcggccggct cctccagcgg cttccactcg 301 tgggcgcgga cgtccgtgag ctccgtgtcc gcctcaccca gccgcttccg cggcgccgcc 361 tcgagcaccg actcgctaga caccctaagc aacggcccag agggctgcgt ggtggcggcg 421 gtggcggcgc gcagcgagaa ggagcagctg caggctctga acgaccgctt cgcgggctac 481 atcgacaagg tgaggcagct cgaggcgcac aaccgcagcc tggagggcga ggcggcggcg 541 ctgcggcagc aacaagccgg ccgcgccgcc atgggcgagc tgtacgagcg cgaggtgcgc 601 gagatgcgcg gcgccgtgct gcgcctcggg gcggcgcgcg ggcagctgcg cctggagcag 661 gagcacctgc tggaggacat cgctcacgtc cgccagcggc tggacgagga ggcccggcag 721 cgtgaggagg cggaggcggc ggcgcgcgcc ctggcgcgct tcgcgcagga ggcggaagcg 781 gcgcgcgtgg agctgcagaa gaaggcgcag gcgctgcagg aggagtgcgg ctacctgcgg 841 cgccaccacc aggaggaggt gggcgagctg ctcggtcaga tccagggctg cggggccgcg 901 caggcgcagg ctcaggccga ggctcgcgac gccctcaagt gcgacgtgac gtcggcgctg 961 cgggagatcc gcgcgcagct cgaaggccac gcggtgcaga gcacgctgca gtccgaggag 1021 tggttccgag tgaggttgga ccgactctca gaggcagcca aagtgaacac agatgctatg 1081 cgctcggccc aagaggagat aactgagtac cggcggcagc tgcaagccag gaccacagag 1141 ttggaggccc tgaaaagcac caaggagtca ctggagaggc agcgctctga gctagaggac 1201 cgtcatcagg cagacattgc ctcctaccag gacgctattc agcagctgga cagtgagctg 1261 agaaacacca agtgggagat ggctgcacag ctccgagagt accaggacct gctcaacgtc 1321 aagatggccc tggacattga gattgccgct tacagaaagc tcctggaagg cgaagagtgt 1381 cggattggct ttggtccgag tcccttctct cttactgaag gactcccaaa aattccctcc 1441 atatccacgc acataaaagt caaaagcgaa gagatgataa aggtagtaga gaaatccgag 1501 aaggaaactg tgattgtaga aggacagaca gaagagatcc gggtgacgga aggagtgaca 1561 gaagaggagg acaaagaggc ccaaggtcag gaaggagaag aagcagaaga gggagaagaa 1621 aaagaagaag aggaaggagc agcagctaca tctccccctg cagaagaggc tgcatctcca 1681 gaaaaagaaa ccaagtctcg tgtgaaagaa gaggccaagt ccccaggtga ggccaagtcc 1741 ccaggtgagg ccaagtcccc aggtgaggcc aagtccccag ctgaggccaa gtccccaggt 1801 gaggccaagt ccccacgtga ggccaagtcc ccaggtgagg ccaagtctcc agctgagccc 1861 aagtctccag ctgagcccaa gtctccagct gaggccaagt caccagctga gcccaagtct 1921 ccagctacag tgaagtctcc aggtgaggcc aagtcaccat ctgaggccaa atctccagct 1981 gaagccaaat ctccagctga ggccaaatct ccagctgagg ccaaatctcc agctgaggcc 2041 aagtcaccag ctgaagccaa gtcaccagct gaagccaaat ctccagctac agtgaagtct 2101 ccaggtgagg ccaagtcacc atctgaggcc aaatctccag ctgaagccaa atctccagct 2161 gaggccaaat ctccagctga ggccaaatct ccagctgagg tcaagtcacc aggtgaggcc 2221 aagtctccag ctgagcccaa gtcaccagct gaggccaaat ctccagctgc agtgaagtca 2281 ccagctgagg ccaagtctcc agctgcagtc aagtccccag gtgaggccaa gtccccaggt 2341 gaggccaagt caccagctga ggccaaatct ccagctgagg ccaagtcacc aattgaggta 2401 aaatctccag agaaggccaa gacccccgtc aaggaaggag caaaatctcc agctgaggcc 2461 aagtctcctg agaaggccaa gtcccccgtg aaggaagata tcaagccccc agctgaggcg 2521 aaatcccctg agaaggccaa gagccccatg aaggaaggag caaagcctcc tgagaaggcc 2581 aagcctctag atgtgaagtc tccggaagcc cagactccag tacaggagga agcgaacgac 2641 cccacagaca tcagaccccc tgagcaggtg aaaagtcctg ccaaggagaa ggccaagtcc 2701 cctgagaagg aagaagccaa gacttctgaa aaggtggctc ccaagaagga agaggtgaag 2761 tcccctgtga aggaggaggt aaaagccaaa gaacccccaa agaaggtaga agaagagaag 2821 acactgccta caccaaagac agaggcgaag gagagtaaga aagacgaagc tcccaaggag 2881 gccccgaagc ccaaggtgga ggagaagaag gaaactccca cggaaaagcc caaggactct 2941 acagcagaag ccaagaagga agaggctgga gagaagaaga aagccgtggc ctcagaggag 3001 gagactcctg ccaagttggg tgtgaaggaa gaagctaaac ccaaagagaa gacagagaca 3061 accaagacag aagcagaaga caccaaggcc aaagaaccta gcaaacccac agagacggaa 3121 aagccaaaga aagaggagat gccagcggca ccagagaaga aagacaccaa ggaggagaag 3181 accacagagt ccaggaagcc tgaggagaag cccaaaatgg aggccaaggt caaggaggat 3241 gacaagagcc tttccaaaga gcctagcaaa cccaagacag aaaaggctga aaaatcctct 3301 agcacagacc agaaagaaag ccagccccca gagaagacca cagaggacaa ggccaccaag 3361 ggagagaagt aagagaacaa gagaaacacc cagaatagcc aaagaaactc aggacggtcc 3421 cagtactcag gggtcggcgt aataaatttt atttcttcct ttccctccgt aagaagaaac 3481 actgcttaga tggtgggcct gccctcacca aacaggaatt tctattaaga ttaagttagc 3541 aagagaagat aaccctgagc cttgtccccc acgccgaaaa ccctccccag gtgatggaca 3601 attatgatag cttcttgtag ccgaacgtga tgtatgctga acgctacgcg taaaacacgc 3661 gtctaaaaac tgccccctcc tttccaagta agtgcattta tttcctgtat gtccaactga 3721 cagatgaccg caataatgaa tgagcagtta gaaacgcatt atgcttgaaa tgttgtaacc 3781 tattcctgaa tgccttcttg ttttccaaag gagtggtcag gcccttgccc agtacacgct 3841 cctggaagag ctgcagcagg tgaggcaggg cgctggccac tgaaccacgc cagggtgtac 3901 tctccactga agtccacttt caattgcttc catgcaataa aaccaagtgc ttctgaaat // LOCUS MUSRGCA 350 bp ds-DNA ROD 25-JUL-1990 DEFINITION Mouse 18S rRNA gene. ACCESSION M35283 KEYWORDS 18S ribosomal RNA; processing factor; ribosomal RNA. SOURCE Mouse (strain S100) ribosomal DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 350) AUTHORS Mishima,Y., Katayama,M. and Ogata,K. TITLE Identification of a protein factor and the nucleotide sequence required for processing of mouse precursor rRNA JOURNAL J. Biochem. 104, 515-520 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA 325 > 350 18S rRNA site 220 220 processing site BASE COUNT 29 a 128 c 117 g 76 t ORIGIN 1 tcgacgttcc ggctctcccg atgccgaggg gttcgggatt tgtgccgggg acggagggga 61 gagcgggtaa gagaggtgtc ggagagctgt cccggggcga cgctcgggtt ggctttgccg 121 cgtgcgtgtg ctcgcggcgg gttttgtcgg accccgacgg ggtcggtccg gccgcatgca 181 ctctcccgtt ccgcgcgagc gccgcccggc tcacccccgg tttgtcctcc cgcgaggctc 241 tccgccgccg cctcctcctc ctctctcgcg ctctctgttc cgcctggtcc tgtcccaccc 301 ccgacggctt cgctcgcgct tccttacctg gttgatcctg ccagtagcat // LOCUS MYCRDNAA 190 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyorhinis A-repeat sequence DNA. ACCESSION M35303 KEYWORDS A-repeat. SOURCE M.hyorhinis DNA, clone pG102.1. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 190) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 94 a 15 c 31 g 50 t ORIGIN 1 gaattcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt 61 atttttgaag aaaataaagc aagatatgaa aaaaggagaa tcaaagctga acttaataat 121 agaggctata aaattggact taaaaaagtt cacagattat tggaaaaatt caatcttaaa 181 gcaatttgtt // LOCUS MYCRDNAB 190 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyorhinis A-repeat sequence DNA. ACCESSION M35304 KEYWORDS A-repeat. SOURCE M.hyorhinis DNA, clone pG102.3. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 190) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 87 a 16 c 38 g 49 t ORIGIN 1 gaattcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt 61 atttttgaag aaaataaagc aagatatgaa aaaaggagaa tcaaagctga acttaataat 121 agaggctata aaattggact tagatagggt tgagtgttgt tccagtttgg acaagaagtc 181 cactattaaa // LOCUS MYCRDNAC 191 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyorhinis A-repeat sequence DNA. ACCESSION M35305 KEYWORDS A-repeat. SOURCE M.hyorhinis DNA, clone pG101. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 191) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 97 a 18 c 29 g 47 t ORIGIN 1 gaactcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt 61 atttttgaaa gaaaataaag caagatatga aaaaaagaga atcaaagctg aactcaataa 121 tagaggctat aaaattggac ttaaaaaagt tcacagatta ttgaaaaaat tcaatctcaa 181 agcaatttgt t // LOCUS MYCRDNAD 191 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyopneumoniae A-repeat sequence DNA. ACCESSION M35306 KEYWORDS A-repeat. SOURCE M.hyopneumoniae DNA, clone pJ125. ORGANISM Mycoplasma hyopneumoniae Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 191) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 94 a 17 c 31 g 49 t ORIGIN 1 gaactcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt 61 atttttgaaa gaaaataaag caagatatgc aaaaaagaga ataaaagctg atcttaataa 121 tagaggctat aaaattggac ttaaaaaagt tcgcagatta ttggaaaaat tcaatctcaa 181 agcaatttgt t // LOCUS MYCRDNAE 210 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyorhinis B-repeat sequence DNA. ACCESSION M35307 KEYWORDS B-repeat. SOURCE M.hyorhinis DNA, clones pG102.[1,3]. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 84 a 29 c 28 g 69 t ORIGIN 1 gaattcttta aatttagtag aaatcaaaaa aactcaacaa ggcaactgag ttcgttataa 61 aaaagtttat caatatgcta aattcgatgc aagaactaaa caatttatct tagttgaaaa 121 aggcgttcct tttactaata tgattattgc taatcaaaac aatctacatt tgaatatttt 181 gactgacagg ttctaaagaa tgcagcattt // LOCUS STRLACZ 209 bp ds-DNA BCT 25-JUL-1990 DEFINITION S.bovis lactose catabolic protein (lacZ) gene, 5' end. ACCESSION M35285 KEYWORDS catabolic protein. SOURCE S.bovis (strain H/3) DNA. ORGANISM Streptococcus bovis Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 209) AUTHORS Gilbert,H.J. and Hall,J. TITLE Molecular cloning of Streptococcus bovis lactose catabolic genes JOURNAL J. Gen. Microbiol. 133, 2285-2293 (1987) STANDARD simple staff_review FEATURES from to/span description pept 184 > 209 lactose catabolic protein (lacZ) BASE COUNT 59 a 55 c 43 g 52 t ORIGIN 1 tcgattagcc cttggaccct gctagtcttg acctgcctag gtttcccagg tcaagttccc 61 agttaccgac tacccgtaaa tcgatactac gccattgtta gatcggatct gaacccgtaa 121 ctttatagtt gggtatcgtg agcagatcac aatatcccac aataaaagga ggataacatc 181 caaatgatca cggacacagt ggccatcga // LOCUS STYSSCA 1551 bp ds-DNA BCT 25-JUL-1990 DEFINITION S.typhimurium Ssc protein (ssc) gene, complete cds. ACCESSION M35193 KEYWORDS Ssc protein. SOURCE S.typhimurium (strain SH5014, isolate LT2) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1551) AUTHORS Hirvas,L., Koski,P. and Vaara,M. TITLE Characterization of a new protein encoding region between ompH and lipid A biosynthesis genes of Salmonella typhimurium JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.H.Hirvas, 15-JUN-1990. Author address: L.H.Hirvas University of Helsinki Dept of Bacteriology and Immunology Haartmaninkatu 3 00290 Helsinki FINLAND FEATURES from to/span description pept 19 1044 Ssc protein BASE COUNT 362 a 354 c 438 g 397 t ORIGIN 1 aaacaggtta aataagtaat gccttcaatt cgactggctg acttagcaga acagttggat 61 gcagaattac acggtgatgg cgatatcgtc atcaccggcg ttgcgtccat gcaatctgca 121 acaacaggcc acattacgtt tatggtgaat cctaagtacc gtgaacactt aggtttatgc 181 caggcttctg cggttgtcat gacgcaggac gatcttcctt ttgctaagag tgcggcgctg 241 gtagttaaaa atccctacct gacctacgcg cgcatggcgc aaattttaga tactacgccg 301 cagcccgcgc agaatatcgc gccaagcgcc gtgattgatg cgacggcaac gctgggtagc 361 aatgtttcag tcggcgcgaa tgcggtgatt gaatctggcg tacaactggg cgataacgtg 421 gttatcggcg caggctgttt cgtcggaaaa aatagcaaaa tcggggcggg ttcacgcttg 481 tgggcgaacg taacgattta ccacgacatt cagatcggtg agaattgcct gatccagtcc 541 agtacggtga tcggcgcgga cggttttggc tacgctaacg atcgtggcaa ctgggtgaag 601 atcccacaac tgggccgggt cattattggc gatcgtgtcg agatcggcgc ttgtaccacc 661 attgaccgtg gcgcgttgga tgatactgtt attggcaatg gcgtgattat tgataatcag 721 tgccagattg cacataacgt cgtgattggc gacaatacgg cagttgccgg tggcgtcatt 781 atggcgggta gcctgaagat tggccgttac tgcatgattg gcggcgccag cgtgatcaat 841 gggcatatgg aaatatgcga caaagtcacg gtaactggca tgggtatggt gatgcgtccc 901 atcacggaac cgggcgtcta ctcctcaggc attccgctgc aacccaacaa agtatggcgt 961 aaaactgctg cactggtgat gaacattgat gatatgagca agcgtctcaa agcgattgag 1021 cgcaaggtta atcaacaaga ctaacgttcc gccttgtagt tgccattctt ttccggcctg 1081 tcacattcat acgattgcgg caggccgtgt tattattgcc tttttgtata tttggacagg 1141 aagagtattt tgactactaa cactcatact ctgcagattg aagagatttt agagcttctg 1201 ccgcaccgtt ttccgttttt actggtcgat cgcgtgctgg actttgaaga aggtcgtttt 1261 ctgcgtgcgg tgaaaaatgt ctccgtcaac gagccgtttt tccaggggca tttcccgggc 1321 aaaccgattt tgccaggcgt gctgattctg gaagcgatgg cgcaggcaac cggtattctg 1381 gcgtttaaaa gcgttggtaa actggaacct ggcgaactgt attatttcgc gggtattgat 1441 gaagcgcgct ttaagcgtcc ggtggtgcca ggcgatcaga tgatcatgga agtcactttc 1501 gagaaaacgc gccgtggcct gacccgcttt aaaggggttg cgctggtcga c // LOCUS TOBRUBPA 979 bp ds-DNA PLN 25-JUL-1990 DEFINITION Tobacco ribulose-1,5-bisphosphate carboxylase small subunit gene, exons 1 and 2. ACCESSION M32419 KEYWORDS ribulose-1,5-bisphosphate carboxylase. SOURCE Tobacco DNA, clone TSSU3-8. ORGANISM Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 979) AUTHORS O'Neal,J.K., Pokalsky,A.R., Kiehne,K.L. and Shewmaker,C.K. TITLE Isolation of tobacco SSU genes: Characterization of a transcription- ally active pseudogene JOURNAL Nucleic Acids Res. 15, 8661-8676 (1987) STANDARD simple staff_review FEATURES from to/span description pept 584 760 ribulose-1,5-bisphosphate carboxylase small subunit precursor, exon 1 854 > 979 ribulose-1,5-bisphosphate carboxylase small subunit precursor, exon 2 sigp 584 757 ribulose-1,5-bisphosphate carboxylase small subunit signal peptide matp 758 760 ribulose-1,5-bisphosphate carboxylase small subunit 854 > 979 ribulose-1,5-bisphosphate carboxylase small subunit IVS 761 853 ribulose-1,5-bisphosphate carboxylase small subunit intron A BASE COUNT 320 a 175 c 185 g 299 t ORIGIN 1 ttaattatgt ctttgtttgc ttctcatgtg ataaagaatc gaagccttga tgaacataat 61 ttgcatttga gtagtgaata gctgctttca caaagagtac tctagctatt aagtttagtt 121 tgaatatttt gaaacacaaa aatatatgta tacatacaaa aacaaatacc gcaatagtcc 181 aagcaaaagg gactttaaaa aaaaaaacca acctcaatta cacattcata tcctcttcct 241 accccatcta ggatgagata agattactga ggttgtttac acgtggcacc tccattgtgg 301 tgaattaaat gatcaatggc ttagctcaaa atataatttt ccaacctttc atgtgtggat 361 attaagtttt gtgtagtgaa tcaagaacca cataatccaa tggttagctt tactccaaga 421 tgagggggtt gttgattttt gtccgttaga tatgggaaat atgtaaaacc ttatcattat 481 atatagagtg gtgggcaact atgcaatgac catcttggaa gtttaaagga aaaaaaagga 541 aagggagaaa gagaaatctt tctgtcttaa agtgtaatta acaatggctt cctcagttct 601 ttcctctgca gcagttgcca cccgcagcaa tgttgctcaa gctaacatgg ttgcaccttt 661 cactggcctt aagtcagctg cctcattccc tgtttcaagg aagcaaaacc ttgacatcac 721 ttccattgcc agcaacggcg gaagagtgca atgcatgcag gtaatttata tacaatgaca 781 gtgcaaaaaa ttttgataca attaatgcat cttaacatgt catagctaaa aattctattt 841 tggtggaata taggtgtggc caccaattaa caagaagaag tacgagactc tctcatacct 901 tcctgatttg agccaggagc aattgcttag tgaagttgag taccttttga aaaatggatg 961 ggttccttgc ttggaattc // LOCUS TOBRUBPB 1337 bp ds-DNA PLN 25-JUL-1990 DEFINITION Tobacco ribulose-1,5-bisphosphate carboxylase small subunit pseudogene, complete cds. ACCESSION M32420 KEYWORDS pseudogene; ribulose-1,5-bisphosphate carboxylase. SOURCE Tobacco DNA, clone TSSU3-2. ORGANISM Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 1337) AUTHORS O'Neal,J.K., Pokalsky,A.R., Kiehne,K.L. and Shewmaker,C.K. TITLE Isolation of tobacco SSU genes: Characterization of a transcription- ally active pseudogene JOURNAL Nucleic Acids Res. 15, 8661-8676 (1987) STANDARD simple staff_review FEATURES from to/span description pept.ps 272 451 ribulose-1,5-bisphosphate carboxylase small subunit, exon 1 620 754 ribulose-1,5-bisphosphate carboxylase small subunit, exon 2 912 1100 ribulose-1,5-bisphosphate carboxylase small subunit, exon 3 IVS 452 619 ribulose-1,5-bisphosphate carboxylase small subunit intron A IVS 755 911 ribulose-1,5-bisphosphate carboxylase small subunit intron B BASE COUNT 379 a 249 c 256 g 453 t ORIGIN 1 gttttagaaa atatttccca ttcacaaatt aagtttggga actttgagat aaggacgact 61 gagtgtaatc aatgtcaggg gttcaaattt atgtgcccgt caatttttca atccacggct 121 acgattcctc taagatgagg tcattgcttg cttgtgtccg ttagatgaga aaaagacgtg 181 aaaccttatc actatatata gcactcatca cacccttgaa agcaaaggtc aagggaagca 241 atagctttaa gctaaacaat tactttcaac aatggcttcg tctgtgattt cctcagccgc 301 tgccgttgcc accggcgcta atgcggctca agccagtatg gttgcacctt tcactggcct 361 caaatccgcc tactccttcc ctgtttccag aaaacaaaac cttgacatta cttccattgc 421 tagcaatggt ggaagagttt aatgcatgca ggtttgtagc atatattatt gtagttagct 481 tatataaact gatagagtaa agaaatttta cgttatatat tgatatattt taacctggta 541 atttgattta tttttcatat tattaatccc acttttttat tgtacttatg aagtttattt 601 taattcttta tatatatagg tgtggccacc aattaacaag aagaagtacg agacactctc 661 ataccttcct gatttgagcg aggagcaatt gcttagggaa gttgaatacc ttttgaaaaa 721 tggatgggtt ccttgcttgg aattcgagac tgaggtcaaa catctattct aaatcatgct 781 actattatca agcataacta acatgaataa ctcaatccta actagtttgg gattagacat 841 atatagttga ttaagtgaaa gaggagtatt atctcatgtt aatgttttgt ttatcttgtg 901 gatatgcgca gcacggattc gtctaccgtg agaataacaa gtcaccaggt tactacgatg 961 gaagggccac tcaggtcttg gctgaggtcg aggaggcaaa gaaggcttac ccacaagcct 1021 ggatcagaat cattggattc gacaacgtcc gtcaagtgca atgcatcagt ttcatcgcct 1081 acaagcccgc aggctactaa aatctccatt tttaagacaa cttaccgtat gtattcaggg 1141 gaagtttgtt tgaattctcc ttgtgttttt ccccggagaa actgttttgg ttttcctttg 1201 ttttaattcc ttctttctat tcggtgtata tttttgaatt ccaatcaagt ttatgagaac 1261 taataatgtc atttgtttct ttcgtaattt gctttgtggt gtacatcggt tttaattatc 1321 cgagtaatat ctgcttt // LOCUS ZYMCPA 1374 bp ss-RNA VRL 25-JUL-1990 DEFINITION Zucchini yellow mosaic virus coat protein (cp) mRNA, 3' end. ACCESSION M35095 KEYWORDS capsid protein; coat protein. SOURCE Zucchini yellow mosaic virus, cDNA to viral RNA, clone ZYKS-22cp. ORGANISM Zucchini yellow mosaic virus Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Potyvirus. REFERENCE 1 (bases 1 to 1374) AUTHORS Gal-On,A., Antignus,Y., Rosner,A. and Raccah,B. TITLE Nucleotide sequence of the zucchini yellow mosaic virus capsid- encoding gene and its expression in Escherichia coli JOURNAL Gene 87, 273-277 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 1164 coat protein (cp) (AA at 1) mRNA < 1 1374 cp mRNA BASE COUNT 423 a 279 c 342 g 330 t ORIGIN 1 tcgacgaagg agagattgtt tccaatttta gagtgggata gaagcaaaga aattatgcac 61 cgaacagagg ctatttgcgc tgcgatgatt gaggcatggg gacacaccga gcttttacaa 121 gagatcagaa agttttatct atggttcgtt gaaaaggaag aagtgcgaga attagccgcc 181 ctcggaaaag ctccatacat agctgagaca gcacttcgta agctatacac tgacaaggga 241 gcggatacaa gtgaactggc acgttatcta caagccctcc accaagacat cttctttgaa 301 caaggagaca ctgtaatgct ccaatcaggc actcagccaa ctgtggcaga cactggagcc 361 acaaagaaag acaaagaaga tgacaaaggg aaaaacaagg atgttacagg ctccggctca 421 agtgagaaaa cagtggcagc tgtcacgaag gacaaggatg taaatgctgg ttctcatggg 481 aaaattgtgc cgcgtctttc gaagataaca aagaagatgt cactgccacg cgtgaaagga 541 aatgtgatac tcgacattga tcacttgctg gagtataagc cggatcaaat tgagttatac 601 aacacacgag cgtctcatca gcaattcgcc tcttggttca accaagttaa aacagaatat 661 gatctgaatg agcaacagat gggagttgta atgaatggtt tcatggtttg gtgcatcgaa 721 aatggcacgt cacccgacat taacggagta tgggttatga tggacggtaa tgagcaggtt 781 gaatatcctt tgaaaccaat agttgaaaat gcaaagccaa cgctgcgaca aataatgcat 841 cacttttcag atgcagcgga ggcatatata gagatgagaa atgcagaggc accatacatg 901 ccgaggtatg gtttgcttcg aaacttacgg gataggagtt tggcacgata tgctttcgac 961 ttctacgaag tcaattccaa aactccggaa agagcccgcg aagctgttgc gcagatgaaa 1021 gcagcagccc ttagcaatgt ttcttcaagg ttgtttggcc ttgatggaaa tgttgccacc 1081 actagcgaag acactgaacg gcacactgca cgtgatgtta ataggaacat gcacaccttg 1141 ctaggtgtga atacaatgca gtaaagggta ggtcgcctac ctaggttatc gtttcgctcc 1201 gacgtaattc taatatttac cgctttatgt gatgtcttta catttctaga gtgggcctcc 1261 cacctttaaa gcgtaaagtt tatgttagtt gtccaggagt gccgtagtcc tgtcggaagc 1321 tttagtgtga gcctctcacg aataagctcg agattagact ccgtttgcaa gcct //