Path: utzoo!attcan!uunet!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 10 Jul 90 12:00:35 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 5381 Approved: lear@genbank.bio.net Checksum: 60632 317 LOCUS HUMCDR34 2412 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene, complete cds. ACCESSION M31423 KEYWORDS cerebellar-degeneration-related antigen. SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2412) AUTHORS Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A., Chaganti,R.S.K., Posner,J.B. and Old,L.J. TITLE Cerebellar degeneration-related antigen: A highly conserved neuroectodermal marker mapped to chromosomes X in human and mouse JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.-T.Chen, 17-JAN-1990 FEATURES from to/span description pept 503 1174 cerebellar-degeneration-related antigen (CDR34) BASE COUNT 743 a 334 c 669 g 666 t ORIGIN 1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct 61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat 121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg 181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga 241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg 301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag 361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg 421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga 481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt 541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt 601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga 661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat 721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt 781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga 841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct 901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt 961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga 1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt 1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat 1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg 1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa 1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac 1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa 1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt 1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact 1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat 1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct 1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga 1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct 1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac 1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg 1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata 1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa 1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc 2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt 2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta 2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga 2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta 2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat 2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt 2401 gagatgctgc ag // LOCUS RATHGF 2485 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat hepatocyte growth factor mRNA, complete cds. ACCESSION M32987 KEYWORDS hepatocyte growth factor. SOURCE Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2485) AUTHORS Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M., Shimizu,S. and Nakamura,T. TITLE Deduced primary structure of rat hepatocyte growth factor and expression of the mRNA in rat tissues JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Nakamura, 26-MAR-1990. FEATURES from to/span description pept 143 2329 hepatocyte growth factor BASE COUNT 750 a 533 c 577 g 625 t ORIGIN 1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc 61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga 121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct 181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca 241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa 301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa 361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc 421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt 481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa 541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg 601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct 661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag 721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat 781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg 841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga 901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta 961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc 1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg 1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca 1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa 1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat 1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta 1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg 1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc 1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg 1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga 1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca 1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa 1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc 1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat 1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct 1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct 1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac 1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg 2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt 2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga 2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt 2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc 2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa 2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa 2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca 2461 ttaatatatg tggcgttttc tgttg // LOCUS HUMINSGS 351 bp ds-DNA SYN 10-JUL-1990 DEFINITION Human (synthetic) insulin gene, complete cds. ACCESSION J02547 M25881 KEYWORDS artificial gene; insulin. SOURCE Synthetic human DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 79 to 351) AUTHORS Brousseau,R., Scarpulla,R., Sung,W., Hsiung,H.M., Narang,S.A. and Wu,R. TITLE Synthesis of a human insulin gene: V. Enzymatic assembly, cloning and characterization of the human proinsulin DNA JOURNAL Gene 17, 279-289 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 351) AUTHORS Georges,F., Brousseau,R., Michniewicz,J., Prefontaine,G., Stawinski,J., Sung,W., Wu,R. and Narang,S.A. TITLE Synthesis of a human insulin gene: VII. Synthesis of preproinsulin-like human DNA, its cloning and expression in M13 bacteriophage JOURNAL Gene 27, 201-211 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 351) AUTHORS Narang,S.A., Brousseau,R., Georges,F., Michniewicz,J., Prefontaine,G., Stawinski,J. and Sung,W. TITLE The human preproinsulin gene: synthesis, cloning, gene modification, and expression studies JOURNAL Can. J. Biochem. 62, 209-216 (1984) STANDARD full staff_review COMMENT In places where the human insulin amino acid sequence is identical to the rat insulin amino acid sequence, the synthetic sequence follows the published nucleotide sequence for rat (see separate entry). FEATURES from to/span description pept 6 350 synthetic preproinsulin sigp 6 77 synthetic insulin signal peptide matp 90 179 synthetic insulin B-chain matp 186 278 synthetic insulin C-chain matp 285 347 synthetic insulin A-chain BASE COUNT 65 a 93 c 100 g 93 t ORIGIN 78 bp upstream of EcoRI site. 1 aattcatggg cctatggatc cgtctactgc ctctgatcgc gctgctgatc ctctggggac 61 cggatccagc tgcggccgaa ttccggatgt ttgtcaatca gcacctttgt ggttctcacc 121 tggtggaggc tctgtacctg gtgtgtgggg aacgtggttt cttctacaca cccaagaccc 181 gtcgtgaagc tgaagacctt caagtgggtc aagttgaact tggtgggggt cctggtgcgg 241 gttctcttca acctttggct ctcgagggat cacttcaaaa gcgtggcatt gtggagcagt 301 gctgcaccag catctgctcc ctctaccaac tggagaacta ctgcaactga g // LOCUS TRFRRECF 212 bp ss-rRNA RNA 10-JUL-1990 DEFINITION Trypanosomatid (C.fasciculata) small rRNA e from the large ribosomal subunit. ACCESSION K02691 M25882 KEYWORDS ribosomal RNA. SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 212) AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W. TITLE Primary structures of four novel small ribosomal RNAs from Crithidia fasciculata JOURNAL Can. J. Biochem. 61, 38-45 (1983) STANDARD full staff_review COMMENT The large subunit of the ribosome of C.fasciculata contains six small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i) are found in ribosomes of other organisms. rRNAs e,f,g, and j were analyzed by [1]. In rRNA e the number of "g" residues at positions 91-93 and 123-124 were ambiguous. At positions 77-81 three "c"s and two "t"s were found, but the order was unclear. Positions 116 and 119 gave strong "t" bands but also consistently gave weak bands in the "u-2" track. [1] is not sure of the reason: sequencing artifact or an indication of cistron heterogeneity. No evidence of this heterogeneity was found in chemical gels. FEATURES from to/span description rRNA 1 212 ribosomal RNA e modified 125 125 p (putative) BASE COUNT 46 a 53 c 51 g 62 t ORIGIN 5' end of mature rRNA e. 1 tagtggaaat gcgaaacact tgccaggtga caaatcaatc ctcccacggt gagctttctt 61 ttcaccataa tccacatctc cggctttgct gggcttgggc ctttttactt ctcgcgttgt 121 tcggtgcggg ggcccaagat tgaaaaatgc agctctccct acgtactgtc attgttgtga 181 gttctgcgca ttaaagcaaa aacctggggt gt // LOCUS TRFRRFCF 183 bp ss-rRNA RNA 10-JUL-1990 DEFINITION Trypanosomatid (C.fasciculata) small rRNA f from the large ribosomal subunit. ACCESSION K02692 M25883 KEYWORDS ribosomal RNA. SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 183) AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W. TITLE Primary structures of four novel small ribosomal RNAs from Crithidia fasciculata JOURNAL Can. J. Biochem. 61, 38-45 (1983) STANDARD full staff_review COMMENT The large subunit of the ribosome of C.fasciculata contains six small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i) are found in ribosomes of other organisms. rRNAs e,f,g, and j are reported by [1]. FEATURES from to/span description rRNA 1 183 ribosomal RNA f BASE COUNT 41 a 49 c 57 g 36 t ORIGIN 5' end of mature rRNA f. 1 gtgagattgt gaagggatct cgcaggcatc gtgagggaag tatggggtag tacgagagga 61 actcccatgc cgtgcctcta gtttctgggg tttgtcgaac ggcaagtgcc ccgaagccat 121 cgcacggtgg ttctcggctg aacgcctcta agccagaagc caatcccaag accagatgcc 181 ccc // LOCUS TRFRRGCF 136 bp ss-rRNA RNA 10-JUL-1990 DEFINITION Trypanosomatid (C.fasciculata) small rRNA g from the large ribosomal subunit. ACCESSION K02693 M25884 KEYWORDS ribosomal RNA. SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 136) AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W. TITLE Primary structures of four novel small ribosomal RNAs from Crithidia fasciculata JOURNAL Can. J. Biochem. 61, 38-45 (1983) STANDARD full staff_review COMMENT The large subunit of the ribosome of C.fasciculata contains six small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i) are found in the ribosomes of other organisms. rRNAs e,f,g, and j are reported by [1]. There was some question whether rRNA g contained 135 or 136 bp, starting with base 1 or 2 in the sequence presented below. FEATURES from to/span description rRNA 1 136 ribosomal RNA g BASE COUNT 31 a 37 c 40 g 28 t ORIGIN 5' end of mature rRNA g. 1 acaacgtccc tctccaaacg agagaatatg catgggctgg catgagcggc atgcttcact 61 ccggtggggc tcgaggggca cttacgtccc gaggcgctga accttgaggc ctgaaatttc 121 atgctctggg actaaa // LOCUS TRFRRJCF 73 bp ss-rRNA RNA 10-JUL-1990 DEFINITION Trypanosomatid (C.fasciculata) small rRNA j from the large ribosomal subunit. ACCESSION K02694 M25885 KEYWORDS ribosomal RNA. SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 73) AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W. TITLE Primary structures of four novel small ribosomal RNAs from Crithidia fasciculata JOURNAL Can. J. Biochem. 61, 38-45 (1983) STANDARD full staff_review COMMENT The large subunit of the ribosome of C.fasciculata contains six small rRNAs (designated e,f,g,h,i,j) when normally only two (h,i) are found in the ribosomes of other organisms. rRNAs e,f,g, and j are reported by [1]. There was some question whether rRNA j contained 72 or 73 bp, starting with base 1 or 2 in the sequence presented below. FEATURES from to/span description rRNA 1 73 ribosomal RNA j BASE COUNT 17 a 23 c 14 g 19 t ORIGIN 5' end of mature rRNA j. 1 tcatcgaatc gccacctaca cgactggagc ttgctccctc gtcggcctct agtatattca 61 tgatcacaag gta // LOCUS YSCRGEA 1798 bp ds-DNA PLN 10-JUL-1990 DEFINITION Yeast (S.cerevisiae) 18S ribosomal RNA gene. ACCESSION J01353 M27607 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE Yeast (S.cerevisiae + D4) DNA, clones pY1rA3 and prYC. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1798) AUTHORS Rubtsov,P.M., Musakhanov,M.M., Zakharyev,V.M., Krayev,A.S., Skryabin,K.G. and Bayev,A.A. TITLE The structure of the yeast ribosomal RNA genes. I. The complete nucleotide sequence of the 18S ribosomal RNA gene from Saccharomyces cerevisiae JOURNAL Nucleic Acids Res. 8, 5779-5794 (1980) STANDARD full staff_review REFERENCE 2 (bases 1 to 1798; revises [1]) AUTHORS Mankin,A.S., Skryabin,K.G. and Rubtsov,P.M. TITLE Identification of ten additional nucleotides in the primary structure of yeast 18S rRNA JOURNAL Gene 44, 143-143 (1986) STANDARD full staff_review FEATURES from to/span description rRNA 1 1798 18S ribosomal RNA revision 943 943 a in [2]; g in [1] revision 962 962 a in [2]; g in [1] revision 982 983 ag in [2]; ga in [1] revision 988 999 tcgaagatgatc in [2]; tc in [1] revision 1002 1002 g in [2]; a in [1] revision 1122 1123 ag in [2]; aag in [1] revision 1742 1742 a in [2]; g in [1] BASE COUNT 480 a 348 c 459 g 511 t ORIGIN 9 bp upstream of Sau3A site. 1 tatctggttg atcctgccag tagtcatatg cttgtctcaa agattaagcc atgcatgtct 61 aagtataagc aatttataca gtgaaactgc gaatggctca ttaaatcagt tatcgtttat 121 ttgatagttc ctttactaca tggtataacc gtggtaattc tagagctaat acatgcttaa 181 aatctcgacc ctttggaaga gatgtattta ttagataaaa aatcaatgtc ttcggactct 241 ttgatgattc ataataactt ttcgaatcgc atggccttgt gctggcgatg gttcattcaa 301 atttctgccc tatcaacttt cgatggtagg atagtggcct accatggttt caacgggtaa 361 cggggaataa gggttcgatt ccggagaggg agcctgagaa acggctacca catccaagga 421 aggcagcagg cgcgcaaatt acccaatcct aattcaggga ggtagtgaca ataaataacg 481 atacagggcc cattcgggtc ttgtaattgg aatgagtaca atgtaaatac cttaacgagg 541 aacaattgga gggcaagtct ggtgccagca gccgcggtaa ttccagctcc aatagcgtat 601 attaaagttg ttgcagttaa aaagctcgta gttgaacttt gggcccggtt ggccggtccg 661 attttttcgt gtactggatt tccaacgggg cctttccttc tggctaacct tgagtccttg 721 tggctcttgg cgaaccagga cttttacttt gaaaaaatta gagtgttcaa agcaggcgta 781 ttgctcgaat atattagcat ggaataatag aataggacgt ttggttctat tttgttggtt 841 tctaggacca tcgtaatgat taatagggac ggtcgggggc atcggtattc aattgtcgag 901 gtgaaattct tggatttatt gaagactaac tactgcgaaa gcatttgcca aggacgtttt 961 cattaatcaa gaacgaaagt taggggatcg aagatgatct ggtaccgtcg tagtcttaac 1021 cataaactat gccgactaga tcgggtggtg tttttttaat gacccactcg gtaccttacg 1081 agaaatcaaa gtctttgggt tctgggggga gtatggtcgc aaggctgaaa cttaaaggaa 1141 ttgacggaag ggcaccacta ggagtggagc ctgcggctaa tttgactcaa cacggggaaa 1201 ctcaccaggt ccagacacaa taaggattga cagattgaga gctctttctt gattttgtgg 1261 gtggtggtgc atggccgttt ctcagttggt ggagtgattt gtctgcttaa ttgcgataac 1321 gaacgagacc ttaacctact aaatagtggt gctagcattt gctggttatc cacttcttag 1381 agggactatc ggtttcaagc cgatggaagt ttgaggcaat aacaggtctg tgatgccctt 1441 agaacgttct gggccgcacg cgcgctacac tgacggagcc agcgagtcta accttggccg 1501 agaggtcttg gtaatcttgt gaaactccgt cgtgctgggg atagagcatt gtaattattg 1561 ctcttcaacg aggaattcct agtaagcgca agtcatcagc ttgcgttgat tacgtccctg 1621 ccctttgtac acaccgcccg tcgctagtac cgattgaatg gcttagtgag gcctcaggat 1681 ctgcttagag aagggggcaa ctccatctca gagcggagaa tttggacaaa cttggtcatt 1741 tagaggaact aaaagtcgta acaaggtttc cgtaggtgaa cctgcggaag gatcatta // LOCUS DROSHA1A 1473 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.melanogaster Sha12 protein mRNA, complete cds. ACCESSION M32660 KEYWORDS . SOURCE D.melanogaster, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1473) AUTHORS Butler,A., Wei,A. and Salkoff,L. TITLE Shal, Shab, and Shaw: Three genes encoding potassium channels in Drosophila JOURNAL Nucleic Acids Res. 18, 2173-2174 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1473) AUTHORS Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L. TITLE Diverse K+ currents expressed by a Drosophila extended gene family which is conserved in mouse JOURNAL Science 248, 599-603 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Salkoff, 07-MAR-1990. FEATURES from to/span description pept 1 1473 Sha12 protein BASE COUNT 302 a 434 c 432 g 305 t ORIGIN Chromosome 3 left arm at locus 76B. 1 atggcctcgg tcgccgcttg gctgcccttc gcccgggcgg cggccatcgg gtgggtgccg 61 atagccaccc acccactgcc accgcccccg atgcccaagg atcgccgcaa aacggacgac 121 gagaagctcc tgatcaacgt ctccgggcgg cgcttcgaga cgtggcggaa tactttggag 181 aagtatccgg acaccctttt aggttccaat gaaagggagt tcttctacga cgaggactgc 241 aaagaatact tcttcgatcg ggacccggac atcttccggc acatactgaa ctactaccgg 301 acgggcaagc tgcactaccc gaagcacgaa tgcctcacca gctacgacga ggagctggcc 361 ttctttggaa taatgccgga tgtcattggc gattgctgct acgaggacta ccgggaccgg 421 aagcgggaga acgcggagcg gctgatggac gacaagctgt cggagaacgg ggatcagaat 481 ctgcagcagc tgaccaacat gcgccagaag atgtggcggg ccttcgagaa tccgcacacg 541 tcgacgagcg ccctggtgtt ctactatgtt acgggtttct tcatcgccgt ctccgtgatg 601 gccaacgtgg tggagacggt gccgtgtggc caccggccgg gcagagcggg aactctgccc 661 tgcggcgagc gctacaagat cgtcttcttc tgcctggata ccgcctgcgt gatgatcttt 721 acggcggagt acctacttcg actcttcgcc gcccccgatc gctgcaagtt cgtgcgctcg 781 gtgatgagca ttattgatgt ggtggccatt atgccgtact acattggcct cgggatcacc 841 gacaacgacg acgtgagcgg tgctttcgtc acgctgcgcg tgttccgtgt cttccgcata 901 ttcaagttct cgcgccactc gcaaggactt cggatcctcg gctacacgct caagtcctgc 961 gccagcgaac tgggcttcct tgtcttctcg ctggccatgg ccattatcat ctttgccacc 1021 gtcatgttct acgccgagaa gaacgtcaat ggcaccaact tcacatcgat tccggcggcc 1081 ttctggtata ccatcgtcac aatgacgacg ctgggatatg gcgacatggt gccagagaca 1141 atagctggca aaattgtggg cggcgtctgc tcgcttagcg gtgtgctggt catcgcctta 1201 cctgtacctg ttatcgtatc gaactttagt agaatctatc accagaacca gcgagcggac 1261 aagcgcaagg cgcagcggaa agctcgcctg gcgcgcatcc gcattgccaa ggcctcgtcc 1321 ggagccgcct ttgttagcaa gaagaaggcc gccgaggccc ggtgggctgc ccaggagtcg 1381 ggcatcgagc tggatgacaa ctatcgggac gaggacatct tcgagctgca gcaccatcat 1441 ttgctgcgat gtctggagaa gacaacgatg tag // LOCUS DROSHABA 2778 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.melanogaster Shab11 protein mRNA, complete cds. ACCESSION M32659 KEYWORDS . SOURCE D.melanogaster, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2778) AUTHORS Butler,A., Wei,A. and Salkoff,L. TITLE Shal, Shab, and Shaw: Three genes encoding potassium channels in Drosophila JOURNAL Nucleic Acids Res. 18, 2173-2174 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2778) AUTHORS Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L. TITLE Diverse K+ currents expressed by a Drosophila extended gene family which is conserved in mouse JOURNAL Science 248, 599-603 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Salkoff, 07-MAR-1990. FEATURES from to/span description pept 1 2775 Shab11 protein BASE COUNT 679 a 784 c 788 g 527 t ORIGIN 1 atggtcgggc aattgcaagg tggacaggct gctggccagc aacagcaaca gcaacaagcg 61 actcagcaac agcaacactc gaagcagcag ctgcaacagc agcagcagca acagcagcaa 121 ctgcaactca agcagcatca gcagcagcaa caggacatcc tgtatcagca acataacgag 181 gcaattgcaa ttgcacgcgg actgcaggct gcaacacctg ccgacatcgg cgataatcag 241 ccgtactacg atacaagcgg taatgtcgat tgggagcggg cgatgggagc cggtggagct 301 ggtgcatatg gtggcatcgg catcggatct ctaccagcag ctggcggtgc tgcttatcac 361 cttgggccag ctaatcccgc aggcctcgtt tctcgtcact tggattacgg tgatggcggc 421 caccttgctg gcccatccgc cggtcttcct gctggagctg tgggatcagg agcaggagcg 481 ggagccggtg cgggagcatc agtcacggga tcaggatcag gagcagggac aggaacagga 541 accggagccg gatctggatc gggcagtgga gcagcaggca aggaagttcg ctacgcccct 601 ttcccagtcg catcaccaac gcactcgatt cccacaacct cccagcagat cgttggcggc 661 gtcggtggcg tgggcgtcgg tggtgccagc agccagtcga tttcgggcgg tgtacccacc 721 cacagccaga gcaacaccac cggcgctctg cagcggacac attccagatc catgtcctcc 781 ataccgccgc ccgagccgtt catgatagcc cagtcgaagg cggtcaacag ccgcgtgtcc 841 atcaacgtgg gcggggtgag gcacgaggtc ctgtggagga cgctggagcg gctgccccac 901 acgcggctcg ggcggctggg ggagtgcacc acccacgagg ccatcgtgga gctgtgcgac 961 gactactcgc tggcggacaa cgagtacttc ttcgaccgac atccgaagag cttcagctcc 1021 atcctgaact tctatcgcac cggcaagctg cacatcgtcg acgagatgtg cgtgctcgcg 1081 tttggtgatg acctggagta ctggggcgtc gacgaactgt acctggagtc ctgctgccag 1141 cacaagtacc accagcgcaa ggagaacgtt cacgaggaga tgcgtaagga ggccgagtcc 1201 ctgcggcagc gcgacgagga ggaattcggc gaaggtaaat tctccgagta ccagaagtat 1261 ctgtgggagc tcctcgagaa gcctaacact agtttcgccg cccgggttat cgcagtgata 1321 tccatactat tcatagtcct gtctaccata gccctgacgt tgaacaccct accacaacta 1381 caacacattg acaacggtac accacaggat aatccgcaat tggcaatggt tgaggccgtg 1441 tgtatcacgt ggttcactct agagtacata cttaggttta gctcctcgcc ggacaagtgg 1501 aagttcttta agggcggcct taacataatc gatctattgg caatactccc atactttgtt 1561 tcgttatttc tattggaaac gaataagaat gcaacggacc agttccagga tgtgcgtcgg 1621 gtggtgcagg tctttcgcat catgcgcatc ctgcgggtcc ttaagctggc ccgtcactca 1681 acgggcctgc agtcgttagg ctttacgctg cgtaactcat ataaggaact cggtctacta 1741 atgctgttcc tggccatggg cgttctcata ttttcttcgc tggcatattt tgccgaaaag 1801 gatgaaaagg atacaaaatt cgtttcaata ccggaagcat tttggtgggc gggtattaca 1861 atgacaactg ttggctacgg ggacatctgt cccacaactg cactgggaaa ggttattggt 1921 actgtgtgtt gcatatgcgg tgttctggtg gtcgctttgc ctattcccat catcgttaac 1981 aattttgctg aattttataa gaatcagatg cgccgcgaaa aggccctcaa gcgtcgcgag 2041 gcactcgatc gtgccaagcg cgagggcagc attgtctcct tccatcatat caatctgaaa 2101 gatgccttcg ccaagtccat ggatctcatc gatgtgattg tcgacacagg aaagcaaaca 2161 aatgtcgtgc atccgaaggg taaaagacaa agcaccccca atataggcag gcagaccctc 2221 gatgtgcaaa gcgccccagg ccacaatctc tcgcaaacgg acggcaacag caccgaaggc 2281 gagtctacca gcggacgcaa tccggccacc accggaaccg gatgctataa gaattacgac 2341 cacgtagcca acctgcgcaa ctccaacctg cacaaccgac gcggatccag ctctgagcag 2401 gatgcagtgc cgccctacag cttcgacaat cccaatgccc gccagacctc aatgatggcc 2461 atggagagct atcggcgcga cgaacaggca ctgctgcagc aacagcaaca gcagcagcaa 2521 cagatgttgc agatgcaaca gattcagcag aaggccccga acggaaatgg aggtgcaacc 2581 ggaggaggag tggccaacaa cctggccatg gtggccgcat caagtgccgg aacagccgtg 2641 gccaccgcca ccaatgccag taatgccagc aataccgccc ccgggtcaga gggcgccgag 2701 ggaggcgtga tggagatggg ggcggtgtcg atgacgacaa cctttcccag gccaagggac 2761 tgcccatcca gatgatga // LOCUS DROSHAWA 1497 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.melanogaster Shaw2 protein mRNA, complete cds. ACCESSION M32661 KEYWORDS . SOURCE D.melanogaster, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1497) AUTHORS Butler,A., Wei,A. and Salkoff,L. TITLE Shal, Shab, and Shaw: Three genes encoding potassium channels in Drosophila JOURNAL Nucleic Acids Res. 18, 2173-2174 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1497) AUTHORS Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L. TITLE Diverse K+ currents expressed by a Drosophila extended gene family which is conserved in mouse JOURNAL Science 248, 599-603 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Salkoff, 07-MAR-1990. FEATURES from to/span description pept 1 1497 Shaw2 protein BASE COUNT 344 a 426 c 390 g 337 t ORIGIN Chromosome 2 left arm at locus 24B-C. 1 atgaatctga tcaacatgga ctcggaaaac agggtggtgc tcaatgtggg tggcattagg 61 cacgaaacct acaaggccac gctgaagaag attccggcta cgcgattatc gcgattaaca 121 gaggcgctgg ccaactatga tccgatactg aatgagtact tctttgatcg gcatccgggc 181 gtcttcgcac aagtgctcaa ctattacaga actggaaagc tgcattatcc cacggatgtg 241 tgcggtccgc tgtttgagga ggaattggag ttctggggcc tagactcgaa ccaagtggag 301 ccctgctgtt ggatgaccta cacacagcat cgcgacaccc aggaaaccct agccgtactc 361 gatcgtctcg atctggatac ggaaaaaccg tccgaagagg aattggcacg caaattcggc 421 ttcgaggagg actactacaa aggcacaata tcctggtggc aggaaatgaa gccgcgcatt 481 tggtccttgt tcgatgagcc ctacagttcc aatgcagcca agactattgg cgtggtttcg 541 gtgttcttca tctgcatttc gatcctgtcg ttctgcctga agacccatcc cgatatgcgg 601 gtgcccatcg tccggaacat tacagtgaaa actgcgaatg gaagtaatgg ctggtttttg 661 gacaaaacgc agaccaatgc gcacatagcc ttcttctata tcgaatgcgt gtgcaatgcc 721 tggtttacct ttgaaatatt ggtgcgcttt atctcatcgc cgaacaagtg ggagttcatc 781 aagtcatctg ttaacatcat agactacata gcgacgctta gtttttatat cgatctagtg 841 cttcagcggt tcgcatcgca cctggagaac gctgacatcc tcgagttctt ctcgatcatc 901 cgcatcatgc gtctgttcaa gctgacgcgc cactcgtccg gactgaagat cctgatccag 961 acgttccggg cctcggccaa ggagctgacc ctgctggtgt tcttcctcgt cctgggcatc 1021 gtgatcttcg ccagccttgt ctactacgcg gagcgcatcc agcccaatcc gcacaacgac 1081 ttcaacagca taccgctggg cctgtggtgg gccctggtca caatgaccac cgtcggctac 1141 ggcgacatgg cccccaaaac ctacattggc atgttcgtgg gtgccctctg cgccctggcc 1201 ggcgtactaa ccatcgcact gccagtgccc gtcatcgtca gcaacttcgc catgtactac 1261 tcgcacacgc aggccagggc caaactgcca aagaagcgga gacgagtgct tcccgtcgag 1321 cagccgcgcc agcccagact gccaggtgcc cctggtggtg tcagtggttg cggcaccccg 1381 ggctcgggtc cccactccgg tccgatggga tccggcggaa ctggaccacg tcgcatgaac 1441 aataaaacaa aggacctggt cagccccaag tcagatatgg ccttcagttt cgactaa // LOCUS SUVSATA 332 bp ss-RNA VRL 10-JUL-1990 DEFINITION Subterranean clover mottle virus satellite RNA (virusoid) sequence. ACCESSION M33000 KEYWORDS . SOURCE Subterranean clover mottle virus (isolated from Trifolium subterraneum) satellite RNA. ORGANISM Subterranean clover mottle virus Viridae; ss-RNA nonenveloped viruses; Velvet tobacco mottle virus group. REFERENCE 1 (bases 1 to 332) AUTHORS Davies,C., Haseloff,J. and Symons,R.H. TITLE Structure, self-cleavage, and replication of two viroid-like satellite RNAs (virusoids) of subterranean clover mottle virus JOURNAL Virology 177, 216-224 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.H.Symons, 20-MAR-1990. FEATURES from to/span description site 62 63 self-cleavage site site 1 120 high sequence homology with virusoid of subterranean clover mottle virus site 232 332 high sequence homology with virusoid of subterranean clover mottle virus BASE COUNT 77 a 93 c 80 g 82 t ORIGIN 1 agaggcatac cctcctcgcg gattttgaag gtgttctagc tacccaagta ttccacgctg 61 tctgtacttg tatcagtaca ctgacgagtc cctaaaggac gaaacagcgc accgcaatct 121 acgtataccc cgattcgact tgcttggagc aagcgttcga cagagtgccg cgcctggaat 181 gacgcggttc tggccacact cacccgggag gccatcgggc ggattatact agttgtcaag 241 gacctgtcgt tagttctact atacattact acactacgtg ttacttgtta ggtggcccca 301 cctcactttc gtgaaggcta gagaacgtcc ac // LOCUS SUVSATB 388 bp ss-RNA VRL 10-JUL-1990 DEFINITION Subterranean clover mottle virus satellite RNA (virusoid) sequence. ACCESSION M33001 KEYWORDS . SOURCE Subterranean clover mottle virus (isolated from Trifolium subterraneum) satellite RNA. ORGANISM Subterranean clover mottle virus Viridae; ss-RNA nonenveloped viruses; Velvet tobacco mottle virus group. REFERENCE 1 (bases 1 to 388) AUTHORS Davies,C., Haseloff,J. and Symons,R.H. TITLE Structure, self-cleavage, and replication of two viroid-like satellite RNAs (virusoids) of subterranean clover mottle virus JOURNAL Virology 177, 216-224 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.H.Symons, 20-MAR-1990. FEATURES from to/span description site 63 64 self-cleavage site site 1 120 high sequence homology with virusoid of subterranean clover mottle virus site 286 388 high sequence homology with virusoid of subterranean clover mottle virus BASE COUNT 97 a 106 c 91 g 94 t ORIGIN 1 agaggcatac cctcctcgcg gattttgaag gtgtttcagc tacccaaagt attccacgct 61 gtctgtactt atatcagtac actgacgagt ccctaaagga cgaaacagcg caccgcaact 121 tggccagacc tcgccaatca cccccacacc aagccaaaaa ccggtcccca acgcagttta 181 gtatcaagtc gtcgcatcca cgctcccgag ggaggaagtt tgcgccttga ggttctgcac 241 ggtcgtggta acaggaaaag tgttggaatg tttgaaggtc ttgcggttgt caaggaccaa 301 gtcgttagtg ttactatata ttactaccct acgtgttact ttgttaggtg gccccacctc 361 actttcgtga aggctaggaa acgtccac // LOCUS BOVCYP4SC 1073 bp ss-mRNA MAM 10-JUL-1990 DEFINITION Bovine cytochrome P450-scc mRNA fragment. ACCESSION M25920 KEYWORDS cytochrome P450-scc. SOURCE Bovine adrenal gland, cDNA to mRNA, clone pBA644. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1073) AUTHORS Chung,B.-C., Matteson,K.J., Morin,J.E., Mellon,S.H. and Miller,W.L. TITLE An approach to the molecular biology of congenital adrenal hyperplasia JOURNAL Ann. N.Y. Acad. Sci. 458, 238-251 (1985) STANDARD simple staff_entry COMMENT The coding region for cytochrome P450-scc was not indicated in [1]. BASE COUNT 259 a 243 c 193 g 378 t ORIGIN 1 taagtctgaa ttttgcaata aggaactcat gatttgaatt acagtcagct cccattcctg 61 tttttgctga ctatatagag ccttctccat ttttggctgc aaaacatata atcagtctga 121 tttggtattt atcattttgt gacataatgt gtaagagtgc ctcgtctgtt tggaaaaggt 181 agtttctatg accagtgtgt ctcttggcaa actctgttaa cctttgtctc accacttcat 241 tttgtattcc aaggcctttg tttctctgtt tctccaggta tctcttgact tcctactttt 301 accttccaat cctctaggat gaaaaggaca tctttttttt tttttttggt gtagttctag 361 aaggtcttca tagaaagggt caacttcaac ttcttaggca tcagtggtta gggcatatac 421 ttggattact gtaatgttaa atggtttgct ttggaaacta accaagatca ttctgttgct 481 tttgagattg cacccaaata ctgcattttg gactcttctg tttactatga ggactactcc 541 atttaatcta aaggattctt aggccacaat agtagatata atggtcatct gaattattat 601 aaatttatca attttcttcc attttagttc actgaattct aacttattga tgcttcattc 661 ttgccatctc ctgcttgacc atgtttttta ccttgattca tggacctgac attccaggtt 721 cctatgcaat attattctgt atagtgtcag acttactttc accaccagac atatccacaa 781 ctgtatatca tttccgtttt ggcccagctg cttcactttt tctggaacta ttcatatctg 841 ccctccactc tttcccaata gcatattgga cacattctcg aacacaggga gccgggggac 901 aggtgctggt ttcttctggc acacctgggg cagctgaaca cagtgttgac tggcagacac 961 agccccacac caaacgctcg ctaacactga cactgttccc gtgatggcca gggagccccc 1021 tccccaaaaa cctgctcctg gaagctggca ggatttgtgc cattcataag ggt // LOCUS BOVCYPC21 920 bp ss-mRNA MAM 10-JUL-1990 DEFINITION Bovine cytochrome P450-c21 mRNA fragment. ACCESSION M25921 KEYWORDS cytochrome P450-c21. SOURCE Bovine adrenal gland, cDNA to mRNA, clone pBA4.8. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 920) AUTHORS Chung,B.-C., Matteson,K.J., Morin,J.E., Mellon,S.H. and Miller,W.L. TITLE An approach to the molecular biology of congenital adrenal hyperplasia JOURNAL Ann. N.Y. Acad. Sci. 458, 238-251 (1985) STANDARD simple staff_entry COMMENT The coding region for cytochrome P450-c21 was not indicated in [1]. BASE COUNT 185 a 278 c 203 g 254 t ORIGIN 1 gttcagatgc tgtgtcccat tgggaaagtt cagcaggtta ccagggccac ggcctcagtc 61 atcctcagaa tcgctgtccc tcttggcagg gacagagcac cgcaccgcag acagcagcac 121 gtcttccacg ggcttcttgg gattctcctc caggctcgtc ttgatggctc cagactcaga 181 gcaacttcca ctccaactcg tccaaagtca ggttcatgcc accaaacacc agaggtccgg 241 ataactgagc cttgatgtca ccttcaaggt acacaaatac cgtggcagat tcctatcagg 301 gtaactgggt atgcaggtgg ttgaaatggc tttgataaac ttgacatcag gaaacttcct 361 ggcgaggtgc actcaagtgc tgatttatca gggcacagag gggaatccct tgtttgtaaa 421 ggtgcaggat gacccataag ccctcaccag ctttggtaac ttcttgaaca taatcctttc 481 cagagatttc caaaacctct ccaaatttgt tcttcagttg ggtcgctttc cattcggcca 541 gcctttgctg cctgtacatt tcaattgcac gttcgtcttc ctcattaaat tcgtcttcat 601 tatcctccag ttcttccaaa gtcatgtctt catatgtttt cacaatggac tgctggagga 661 tccgctgctc ctcttcttct gcctccttct ccagatcttt caaatcttcc tttgaaggca 721 agatgccttt tttgcgtaag atgtcattcc actcggtgtc tgcgttgggg tcctgcattt 781 tctgtcaaat cgctagggcc ctgccggcca cagccacccg gcccgtgagc tctctaccgc 841 gcacgcaggc gccactcgcc tcctctccca gcctgccctg agatctcgtc cgcccgttgg 901 ccctccttct cttggcgccg // LOCUS MUSINT4 3000 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse proto-oncogene Wnt-4 protein mRNA, complete cds. ACCESSION M32502 KEYWORDS Wnt protein; proto-oncogene. SOURCE Mouse (strain BALB/c) 8.5 day old embryo, cDNA to mRNA, (library of B.Hogan). REFERENCE 1 (bases 1 to 3000) AUTHORS Roelink,H., Wagenaar,E., Lopes da Silva,S. and Nusse,R. TITLE Wnt-3, a gene activated by proviral insertion in mouse mammary tumors, is homologous to int-1/Wnt-1 and is normally expressed in mouse embryos and adult brain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 4519, 4523 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H. Roelink, 03-MAR-1990. FEATURES from to/span description pept 46 1113 Wnt-4 protein BASE COUNT 703 a 789 c 787 g 721 t ORIGIN Chromosome 11. 1 cctcttcatg atcgccggca aacttcctcc tcggcgctgc ttctaatgga gccccacctg 61 ctcgggctgc tactcggcct cctgctcagt ggcaccaggg tcctcgctgg ctacccaatt 121 tggtggtccc tggccctggg ccagcagtac acatctctgg cctcccagcc tctgctctgc 181 ggctccatcc caggcctggt ccccaagcaa ctgcgcttct gccgcaatta catcgagatc 241 atgcccagcg tagcagaagg tgtgaagctg ggcatccagg agtgccagca tcagttccgg 301 ggccgccggt ggaactgtac caccatagat gacagcctgg ccatctttgg gcctgtcttg 361 gacaaagcca cccgtgaatc ggccttcgtg catgccatcg cctcggctgg tgtcgccttc 421 gcagtcacac gctcctgcgc tgagggaacc tccaccatct gcggctgtga ctcacatcat 481 aaggggccac ctggagaagg ctggaagtgg ggcggctgca gcgaggacgc cgacttcggg 541 gtgctggtgt cccgggaatt tgcggatgcg cgggagaaca ggccagatgc ccgctcagct 601 atgaacaagc acaacaatga agcaggccga acgaccatcc tggaccacat gcacctaaag 661 tgtaaatgcc acgggttgtc cggcagctgc gaggtgaaga cctgctggtg ggcccagccc 721 gacttccgtg ccattggcga cttcctcaag gacaagtacg acagtgcctc cgagatggtg 781 gtggagaaac accgtgagtc ccgaggctgg gtggagaccc tgcgggctaa gtacgcgctc 841 ttcaagccac ccaccgagag ggacctggtc tactacgaga actcccccaa cttttgtgag 901 cccaacccag agacgggctc ctttggtacc agggaccgga cttgcaatgt cacctcccac 961 ggcatcgatg gctgcgatct gctgtgctgt ggccggggcc acaacacgag gacggagaaa 1021 cggaaggaga aatgccattg cgtcttccac tggtgctgct atgtcagctg ccaagagtgt 1081 attcgcatct acgatgtgca cacctgcaag tagtgagcca gggcactggg aaggggtaga 1141 ttgtgcggct ggatccattc atcgaagtcc catgagaagc aggatctaga tccaggccag 1201 ccttcggcac tggccagcaa ggagcatgga ctgttgccag ctgcatgtga taaacgacct 1261 ggacccagcc ggcctcggac ggacgggcgg cttctttctc aactaacgtc tctccccctg 1321 ctctggatgg tgtacggctt tacagagggg ctttctttat ggttttacca gggtctgctg 1381 gggacagact cgaggcttac ctttgcacat gttaaagaaa ataaaaatga aaaaaaaaaa 1441 tctaccgcaa cagaacaggc tgggctagtg tgagctcttg gcctggtggg aaggacaaga 1501 ccatggcgag attctgtgtc caagctgcct ctactcgtga cattccaaga tgcctctgag 1561 gtgggaactg tgaagtagga cagagccccg cagtcccctc ttgtccgtcg actcccattt 1621 aaattggaca taccttgtcg ttctgagaaa agccatagat aggtgtagct gggatgtagt 1681 gatggggagg cccctggcca acagtgggag caagatcttg agttttgaag acctcagagt 1741 tctgggcggc ctgggaagcc atctgcagaa cagagttcct tgtgggctcc tgttttcgct 1801 agccctgttc tgccctggag cgacagtcag atctccacgc ccctttctgt tgttctacag 1861 tgtccacctt tactacgcgt tttttttttt tttttcatga tgaccttgta aataggtcag 1921 atgtggaggc aggtctcttc tggctccatc caccacaccc agaaagaatg ggctgctctg 1981 cccttctcag ccttgctaac cagcagacac cgaggagagc agcggggcac cttagagagc 2041 aatctaaaca tggttggcag gtggggaggg taaagagtcc cacttccttt gtgttagaag 2101 gcagactacc ctgcgtcctt ttctcccatt ggctgaagta accagaaaga caagagatcc 2161 ttaacaagcc cttcttccca cttgtaaaag ggatagccta tctcagttcc caaggatctg 2221 gattagatag atattcaaaa gaggcaagca gcgaatggag gcagctccca gctctgttcc 2281 cgacgcatga tggtactggc tgggtttagt aaggtgggtg gggctgcacg gatcaatcca 2341 tcaactccgt cttaaggaga atcagaaaga ggagataaaa tgggggaatg gggcagaaca 2401 aagaatttgt cctttcccgc ttctgtctag ggtctgctaa tgctggcttg acgaggggtc 2461 agccacttct ttcctgttgt gcagttggct tgccaagcag gctccagtag gcccttgcct 2521 gcactctcta ccatgtgacc atgagcactg ctctagggac acctcccatc ccttcctagc 2581 accccaaatg ccccttccca tctctccttc cagaagttgg aaatcaagtc aactggataa 2641 cgcttgtgtg agacacttga gcagaacgga tacaacaatt tacaagtctc ttcatatcta 2701 tgtattctat attaaaagtg ataaagtcat gtttccgggg cgtattcaag tagctgacaa 2761 gtaattattt aataatagta catgagcgca ttgtaattat cctcgccata gtcaggtaat 2821 agcatccaat gggaggtccc taccaacctg ctgtatccaa agttttgtaa aaagttgtag 2881 aagttgttga tctttttgat tttatattca aaaagtctct ttttataaat attatttatt 2941 atacaatgta tatacctttg agttaactaa gattatatat tatataaata tatatatatt // LOCUS DRONCDA 2294 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.melanogaster non-claret disjunctional protein (ncd) mRNA, complete cds. ACCESSION M33932 KEYWORDS non-claret disjunctional protein. SOURCE D.melanogaster (strain dp cl cn bw) 0-4 hr embryo, cDNA to mRNA, clone pNB40. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2294) AUTHORS Mcdonald,H.B. and Goldstein,L.S.B. TITLE Identification and characterization of a gene encoding a kinesin-like protein in Drosophila JOURNAL Cell 61, 991-1000 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.B.Mcdonald, 30-APR-1990. FEATURES from to/span description pept 111 2168 non-claret disjunctional protein site 180 319 alpha helical domain BASE COUNT 622 a 661 c 603 g 408 t ORIGIN 1 bp upstream of EcoRI site; chromosome 3 map position 99BC. 1 gaattgataa aatcggttgc aaggaggcag acgtatcttc taagttaggc acaacacagt 61 tggcgatgga atcccggcta ccgaaaccgt cgggcctgaa gaaaccccaa atgccgatta 121 aaaccgtgct gcccacagat cgaattcgcg caggattggg aggtggagcc gctggagcag 181 gcgccttcaa tgtcaatgcc aaccagacat actgcggcaa cttattgccg cccctctcaa 241 gggacctcaa caatctgccc caggtgctgg agcgtcgcgg aggaggagca cgtgccgcct 301 ccccagagcc catgaagttg ggccaccggg ccaagctgag acgtagccgt agcgcttgcg 361 acatcaacga actgcgtggt aacaagcgca ctgcggctgc tccttcattg cccagcattc 421 ccagcaaagt atcccgcctg ggcggtgcac tcactgtttc cagccagcga ctagtgcgtc 481 ctgcggcgcc ttcgtcaata acagcaacag ctgtcaaaag accaccagta acgcgtcctg 541 ctccacgggc tgcaggagga gcagccgcca agaaaccagc aggaacagga gcagcagctt 601 cgtcaggagc cgcggctgct gctcccaagc gcatcgctcc ctacgacttc aaggcccgct 661 tccacgatct gctagagaag cacaaggtgc ttaagacaaa gtacgaaaag caaacagagg 721 acatgggcga gctggagtcc atgcctcagc aactggagga gacgcagaac aagcttatcg 781 agacggagtc ctcgctgaag aacacccaga gcgacaacga gtgtcttcag aggcaggtga 841 agcagcatac cgccaaaatt gaaacaatca catcgacgct gggcaggacc aaagaggagc 901 tatccgagct gcaagcaata catgagaaag taaaaacgga gcatgctgct ctaagcacag 961 aagtggtgca tctgcgccag cgcaccgagg aactcctgcg ctgcaatgag cagcaggccg 1021 ccgagctgga gacctgcaaa gagcagctct tccagtcgaa catggagcgc aaagagctgc 1081 acaacacggt catggacctg cgcggcaaca tccgggtctt ctgtcgaata cgaccgccgc 1141 tggagtccga ggagaaccgt atgtgttgca cctggaccta tcacgacgag tccaccgtgg 1201 agctgcagag cattgacgca caggccaaaa gcaagatggg ccagcagatc ttctcattcg 1261 accaggtctt ccacccgctc tcctcgcagt cggacatctt cgagatggtc tcgccgctca 1321 tccagtcggc cctggatggc tacaatatct gcatctttgc ctacggacag acgggcagtg 1381 gcaagaccta cacaatggac ggagtgccgg agagtgtggg cgtcataccg cgcacggtgg 1441 atctgctctt cgactccatc cggggatatc gcaacttggg ctgggagtac gagatcaagg 1501 ccacctttct ggagatctac aacgaggtgc tctacgatct gctgagcaac gagcagaagg 1561 acatggagat tcgaatggcc aagaacaaca agaacgacat ctacgtgtcc aacataacgg 1621 aggagacggt tctggatcca aatcacctgc gccacctcat gcacacggcc aagatgaacc 1681 gtgccaccgc ctcgacagct ggcaacgagc gctcctctcg ttcccacgcg gttaccaagc 1741 ttgagctcat cggacgccat gccgaaaagc aagagatctc cgtgggttcc ataaacctgg 1801 tggatttggc cggctctgag tctcccaaga cgagcacccg gatgaccgag acaaagaaca 1861 tcaatcgctc gctatcggag ctcaccaacg taatcctggc gctgctgcag aagcaggacc 1921 acatcccgta caggaactcc aagctgacgc acctgctgat gccctcgctg ggcggcaact 1981 cgaaaacgct tatgttcatc aacgtctcgc cgttccaaga ctgtttccaa gagtccgtca 2041 agtcgctgcg cttcgcggcc tccgtaaact cctgcaaaat gaccaaggcc aagcggaatc 2101 gctacctgaa caactcggtg gccaacagca gcacacagag caacaacagc ggcagtttcg 2161 ataaataaag aatgcattct gagcccagtt ttaacaattt tcaaatttct aacctgttat 2221 tgcttaattt atgtgtgttt acttttagtg caaataaact aataaagtgc tggaaaaaaa 2281 aaaaaaaaaa aaaa // LOCUS YSCVPS1A 2457 bp ds-DNA PLN 10-JUL-1990 DEFINITION S.cerevisiae GTP-binding protein (VPS1) gene, complete cds. ACCESSION M33315 KEYWORDS GTP-binding protein. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2457) AUTHORS Rothman,J.H., Raymond,C.K., Gilbert,T., O'Hara,P.J. and Stevens,T.H. TITLE A putative GTP binding protein homologous to interferon-inducible Mx proteins performs an essential function in yeast protein sorting JOURNAL Cell 61, 1063-1074 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.O'Hara, 02-APR-1990. FEATURES from to/span description pept 318 2432 GTP-binding protein (VPS1) BASE COUNT 812 a 454 c 496 g 695 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgata gatacttgaa tcctctaata gtcgaaaaat gctcgagggt aaaccacttg 61 tgcgcttgga ctggcctagt ttccaaaacc aatgttctaa tggattgatt tcttccccaa 121 acattattaa gtggccgggt cacccaaaga cttgggcgcc gttgattcgc gtcgctttgc 181 catcaagaga acaacatatc ttccaagaca gaccgagata attcatctat ttactcctaa 241 aaaagaatta gagaggcctt ttatagcacc aaaataagga ccgtacgaaa actgcacatt 301 ttatattatc agatatcatg gatgagcatt taatttctac tattaacaag cttcaggacg 361 ctttggcgcc cttaggagga ggatctcaat ctcctattga tttaccacag atcaatgttg 421 tcggttccca gtcgtcagga aagtcgtccg ttttggagaa cattgttggt agggatttct 481 tgccaagagg tactggtatt gtcaccagga gacctttagt gttacaattg attaatagga 541 gaccaaaaaa gtcagaacat gctaaagtaa accaaactgc taatgaattg attgacttga 601 acatcaacga tgatgacaag aaaaaggatg aatcaggaaa gcaccaggaa gagggacaat 661 ctgaagacaa taaagaggaa tggggtgaat ttttgcattt acctggtaag aagttttata 721 attttgacga aattagaaag gaaatcgtca aagaaactga caaagtgaca ggtgccaatt 781 caggtatttc ttctgtgccc attaacttga gaatttattc tccgcatgtt cttactttga 841 cgttagtgga tttgcctggg ttgacgaagg ttcccgtagg tgaccaacct cctgatattg 901 aaagacaaat taaggacatg ttgttaaagt atatttcgaa accaaacgct atcatattat 961 ctgttaatgc cgctaacacc gatttagcca acagcgatgg tttgaagctg gctagagagg 1021 tcgatccaga aggaacgaga actattggtg tcttgacaaa agtcgatttg atggatcaag 1081 gtacagatgt catagatatt ttggctggaa gagtcattcc tttgagatat ggttatatcc 1141 cagttatcaa tagaggtcaa aaggatattg aacacaaaaa aacaatcaga gaagcccttg 1201 aaaacgaaag aaaatttttt gagaaccatc cctcttacag ttctaaagct cattactgtg 1261 gtacaccata tttggctaaa aagttaaact caatcttatt acaccacatt aggcaaactc 1321 tgccagaaat caaagcgaaa atcgaagcca cattgaaaaa atatcaaaac gaacttataa 1381 acttgggccc agaaactatg gattcagcta gttcggttgt tttgagcatg attactgatt 1441 tttccaatga atatgccggt atcttggacg gtgaggcgaa ggagctttcc agtcaggaac 1501 tttctggtgg tgctagaatt tcttacgtat tccatgaaac tttcaaaaat ggtgtagact 1561 ctttggatcc attcgaccag atcaaagatt ctgatatcag aaccattatg tacaatagtt 1621 caggttctgc cccatctttg tttgtcggta ccgaagcttt tgaagtttta gttaaacagc 1681 aaattagaag atttgaagaa ccatctctac gtttagttac tctggtgttt gatgaacttg 1741 ttcgtatgct aaaacagatt atttcacaac caaagtactc aaggtatcct gctctaagag 1801 aagcgatttc taatcagttc attcagttct taaaggatgc tactattcct acgaatgagt 1861 ttgttgtcga tatcatcaaa gctgaacaaa cttacatcaa tacagcccat cccgaccttt 1921 tgaagggttc tcaagcaatg gttatggtgg aagaaaaatt acatcctcgc caagtcgctg 1981 ttgacccaaa gacgggtaaa ccattaccaa cccaaccatc gtctagtaag gcgccagtta 2041 tggaagagaa atcaggattt tttggtgggt tcttctccac taaaaacaag aagaaattgg 2101 cagctttgga atccccacct cctgttttaa aagctactgg ccaaatgaca gagagggaaa 2161 caatggaaac agaagtaatc aagttgttga ttagtagtta tttctctatt gtcaaaagaa 2221 ccattgccga tattatacca aaggctttga tgcttaaatt gattgtgaaa agtaaaactg 2281 atattcagaa agttttactc gaaaaacttt acggaaagca agatattgaa gaattaacga 2341 aagaaaacga cataaccatt caaagaagaa aagaatgtaa gaagatggtc gagatattga 2401 gaaacgctag tcaaatcgtc tcctctgttt aggttttcct catctatacc ggtcgac // LOCUS R75RELAX 99 bp ds-DNA BCT 10-JUL-1990 DEFINITION Plasmid R751 relaxation region. ACCESSION M33118 KEYWORDS . SOURCE Plasmid R751 DNA. ORGANISM Plasmid R751 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 99) AUTHORS Pansegrau,W., Ziegelin,G. and Lanka,E. TITLE The origin of conjugative IncP plasmid transfer: Interaction with plasmid-encoded products and the nucleotide sequence at the relaxation site JOURNAL Biochim. Biophys. Acta 951, 365-374 (1988) STANDARD simple staff_entry BASE COUNT 32 a 26 c 22 g 19 t ORIGIN 1 gaataaggga cagtgaagat agataaccgg ctcgccggtt agctaacttc acacatcctg 61 cccgccttac ggcgttaata acaccaagga aagtctaca // LOCUS RP4RELAX 99 bp ds-DNA BCT 10-JUL-1990 DEFINITION Plasmid RP4 relaxation region. ACCESSION M33117 KEYWORDS . SOURCE Plasmid RP4 DNA. ORGANISM Plasmid RP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 99) AUTHORS Pansegrau,W., Ziegelin,G. and Lanka,E. TITLE The origin of conjugative IncP plasmid transfer: Interaction with plasmid-encoded products and the nucleotide sequence at the relaxation site JOURNAL Biochim. Biophys. Acta 951, 365-374 (1988) STANDARD simple staff_entry BASE COUNT 27 a 28 c 28 g 16 t ORIGIN 1 gaataaggga cagtgaagaa ggaacacccg ctcgcgggtg ggcctacttc acctatcctg 61 cccggctgac gccgttggat acaccaagga aagtctaca // LOCUS ACCTRPF 1466 bp ds-DNA BCT 10-JUL-1990 DEFINITION A.calcoaceticus 5'-phosphoribosyl anthranilate isomerase (trpF) and tryptophan synthase (trpB) genes, complete cds and 5' end. ACCESSION M34485 KEYWORDS 5'-phosphoribosyl anthranilate isomerase; tryptophan synthase. SOURCE A.calcoaceticus DNA. ORGANISM Acinetobacter calcoaceticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1466) AUTHORS Ross,C.M., Kaplan,J.B., Winkler,M.E. and Nichols,B.P. TITLE An evolutionary comparison of Acinetobacter calcoaceticus trpF with trpF genes of several organisms JOURNAL Mol. Biol. Evol. 7, 74-81 (1990) STANDARD simple staff_review FEATURES from to/span description pept 506 1147 5'-phosphoribosyl anthranilate isomerase pept 1149 > 1466 tryptophan synthase (trpB) (gtg start codon) BASE COUNT 430 a 297 c 334 g 405 t ORIGIN 1 gatcaagttt agttgcatct gttgaatcat cagcaaaaac agttgttgaa gaaaacccca 61 ttgcaattgc aatcgccccc actaaacggg taggctgaaa agaaatagac atgtattgtg 121 ctccatacat tcaccccacg tgaatgattg agtggataga tgtaacaagc aggtctccgg 181 actcaaatgg catctcaaaa agagacaagc atattcacct tcccacatct atgcatgcag 241 tggcgtaagt ctaaatgact tttttaatat ggtttacatt tttaccgttg cgggggcagc 301 actggatttg caccagtttc cctaaagcga atgcttttaa cttgttacga attgtgtaaa 361 gtataaagtc tgagcgaaga ttaaacaatc tgaatacgat caaattcgtt caactttgac 421 gcaaagcaca aaaattgcat tacaatactt agcccaatga tggatagatc ggctgtctgt 481 caggcaatac aatgagcttc tttctatgcg aacgcgcgca aaaatttgcg gtattacccg 541 ttcccaagat gtccaagcag cagtaagtgc aggtgcagat gccattggac tggttttttt 601 cccaccaagt cctcgacatg tttctatagc gcaagcgcaa gcattgctcc agcatattcc 661 cgcttatgtt caggtggttg gtttatttgt gaatgcaact gcggatcaaa tcaaatcagt 721 gcttgattgt gtggctttgg atgtattaca actacatggc gatgaaacgc ctgagcaatg 781 tcaagagatt gctctgcagt gcaagcgtcg ctggtataaa gccattcaag ttaaaccaga 841 gcttgatgta gttgatgaag ttcagcgtta tcaggccgct ggtgcaagtg cggtattgct 901 ggatgcgtgg catccagagc tcaaaggtgg aactggtcat caatttgatt ggtcgaagtt 961 tcccaagctg gatattccac ttattcttgc aggcggttta acgcctgaaa atgttgtaga 1021 tgccattcaa accacacacg cttttgcagt ggatgtgagc ggaggggtag aggccgcaaa 1081 aggtattaaa gataaacaac tcatcgaacg atttatgcaa ggagtccaat gtggatcagc 1141 aaaataacgt gattgactat acgcaatatc cagatgctcg tgggcatttt ggtattcatg 1201 gcggacgttt tgtatcagaa acacttatgg cggcacttga agatttagaa aatctttaca 1261 accgcatgaa aaatgacgaa cagtttctgg cagaatttga ccgcgatctt gcctattatg 1321 taggtcgtcc tagtccactt tattatgctg aacgatggtc aaagaagctc ggtggtgcgc 1381 aaatttactt aaaacgtgaa gacctgaatc atacaggttc acacaaagtt aataacacca 1441 ttggtcaggc attattggcc aagctt // LOCUS BCIGLCA 2316 bp ds-DNA BCT 10-JUL-1990 DEFINITION B.circulans beta-1,3-glucanase A1 (glcA) gene, complete cds. ACCESSION M34503 KEYWORDS beta-1,3-glucanase. SOURCE B.circulans (strain WL-12) DNA, clone pNT003. ORGANISM Bacillus circulans Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2316) AUTHORS Yahata,N., Watanabe,T., Nakamura,Y., Yamamoto,Y., Kamimiya,S. and Tanaka,H. TITLE Structure of the gene encoding beta-1,3-glucanase A-1 of Bacillus circulans WL-12 JOURNAL Gene 86, 113-117 (1990) STANDARD simple staff_review FEATURES from to/span description pept 241 2289 beta-1,3-glucanase A1 (glcA) BASE COUNT 705 a 489 c 538 g 584 t ORIGIN 1 ggaaattcaa cccacagagt atcgacaaat gatgcgccaa aacgtagaac gtgaagtaca 61 ataccacagt acaaatatat aaattgaatc aaaacccaaa aaattgggat ataacaaaaa 121 taattgtacc ttttcagcag attatcctat tcgatagaat aaagatattc ccccatgtaa 181 gcgatttcct ttatacgcat agattgggag aaactattat cctatcaaag gagggcaatt 241 atgaaaccat ctcactttac ggagaaacgg tttatgaaaa aggtacttgg tttgttctta 301 gtggttgtga tgctggctag tgttggcgtg ttgccaactt caaaagttca agcagctggg 361 accacagtta cctcaatgga gtacttctca ccagcagatg gacctgttat ttcaaaatct 421 ggcgttggca aagccagcta cggatttgtt atgcctaagt tcaatggagg ctccgctacg 481 tggaacgatg tttacagtga cgtgggtgtc aatgtgaaag tgggtaacaa ctgggttgat 541 attgatcaag ccggaggtta tatctataac caaaactggg ggcactggag cgatggcggt 601 ttcaatggct attggttcac cctttccgca acaaccgaaa ttcaactgta ctccaaagcg 661 aatggtgtta agcttgaata tcaacttgta ttccaaaaca ttaacaaaac aaccatcaca 721 gcgatgaatc cgacacaagg gccgcaaatt acagcaagtt tcacaggcgg tgcaggcttt 781 acatatccaa cgttcaacaa tgattctgcg gtaacctatg aagccgtagc ggatgatttg 841 aaggtgtatg taaaacctgt aaacagcagc tcatggattg atattgacaa taatgcagcc 901 agcggctgga tttatgatca caacttcggc caattcaccg acggtggagg aggttactgg 961 tttaacgtaa cggaatcgat caacgtcaaa ttggaatcaa agacttcttc ggctaacctt 1021 gtttatacaa ttacgtttaa tgaacctaca agaaattcat atgtcattac gccatacgaa 1081 ggaacaacct tcacagcaga tgcgaatggt tccattggaa tcccgcttcc caaaattgat 1141 gggggtgcgc caatcgccaa agaactgggc aatttcgtat atcagattaa catcaatggg 1201 caatgggtgg atttgagtaa ctccagtcag agcaagtttg catactcggc taatggctac 1261 aacaatatgt ctgatgccaa ccagtggggg tactgggccg attatatcta tggcctttgg 1321 ttccagccaa tccaggaaaa tatgcaaatc cgtatcggat atccgctgaa cggacaggcg 1381 ggtggaaata ttggcaacaa cttcgtcaac tataccttca tcggtaatcc aaatgctccg 1441 cgtccggatg tatccgatca agaggatatc tcgatcggaa caccaactga cccggctatt 1501 gcgggcatga atcttatctg gcaggatgaa tttaacggaa ctacactgga tacaagtaaa 1561 tggaactatg aaacaggtta ttatctcaat aacgatcccg ctacttgggg atggggaaat 1621 gcagagttgc agcactacac aaacagcaca caaaatgtat atgtacagga cgggaagctg 1681 aatatcaaag ccatgaacga tagcaaatct ttcccgcagg atccgaatcg gtatgcacag 1741 tattcttcag gtaagattaa caccaaggat aaactctcct tgaagtacgg cagagtagat 1801 tttcgtgcca agcttcctac aggggatggc gtttggccag cgctgtggat gcttccaaaa 1861 gattctgtat atggcacatg ggctgcatcg ggtgaaatcg atgttatgga agcaagagga 1921 cgtcttccag ggtctgtaag cggtaccata cactttggcg gacaatggcc cgtgaaccag 1981 tcttcgggtg gcgattatca cttcccagaa gggcaaactt ttgccaatga ttatcatgta 2041 tactcggtag tctgggaaga ggacaatatt aaatggtatg tcgacggcaa gtttttctat 2101 aaagtcacta accagcagtg gtattccaca gctgcaccga ataatccgaa tgctcctttc 2161 gatgagccgt tctacctcat tatgaacttg gcagtcggcg gaaacttcga cggaggccgt 2221 actccgaacg cgtccgatat cccggcaact atgcaagtgg attatgtacg tgtgtataaa 2281 gaacagtaat aaaacagccg tttccgcgat tggggt // LOCUS CHKAGLOB 1737 bp ds-DNA VRT 10-JUL-1990 DEFINITION Chicken alpha-globin gene, alpha-5HR DNA fragment. ACCESSION M34465 KEYWORDS alpha-globin. SOURCE Chicken fibroblast DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1737) AUTHORS Kalandadze,A.G., Bushara,S.A., Vassetzky,Y.S.Jr. and Razin,S.V. TITLE Characterization of DNA pattern in the site of permanent attachment to the nuclear matrix located in the vicinity of replication origin JOURNAL Biochem. Biophys. Res. Commun. 168, 9-15 (1990) STANDARD simple staff_review BASE COUNT 356 a 589 c 447 g 345 t ORIGIN 1 gcggcacggg gcggccccgg gcccggcgcg cacttactgg ccttggcggc ggggtgctcg 61 gcgccgcgct ggaaggggaa gcggaagagc agcttgttgc cgcggctgcc cgagctcaca 121 aggataacgc tgatggggct ggtgctctcg cccatgccgc cgcgccacag cgagcaccgg 181 gcgggcaacg acggacgcgg ctccgcggaa ggcggcccgg cccgcgcgac ttccgcttcc 241 gcgcctccgc cgccgccgcc ggttcccccg ggccgcggcc gagcggcggg gcggagctgc 301 gggcacagcg ctccccgggc aggtcgcgct cagaggccgg gccgccgctt cagcgccgtg 361 ccctcagtgc ggcccagcgc cgtgcccgca gcgctgccca cacgccctcg gggtgcccca 421 cggctgctgc ttgctcccgg tgcccgccgt tcctcccagc acctcgcagt gcagccgtgc 481 ctgaagtgca gcccagcacc tcacacctca gccccgggct cccagtacga ccagcaggtc 541 acgttggagt ctcttgtcct caagactgcg cagtgtctca cctttgagcc ttgtgccccc 601 cattcagccc agcacatcac actgtagccc ttacaccctc accacagcac agcacctcac 661 gttcaggccc cagcacgtca agatggagcc ctgtgccccc agacagccag catggaacca 721 tcaaatcctt agagttggaa gatgtctgaa tccttgtgcc cccagttcag cccggcacct 781 ctcacacccc actcaacact cttcagccaa gagcctacag ctcaacccag cacctcacgc 841 cacccagcag cactcccgcc atcagcccag tgcccccagt ccggatcggt acctctcatg 901 cccatgcaca gtgcaccaga tcagcctagc accactagtt cattccagca cctcacgtgc 961 ccacagccaa ccactccagc acccccggtg ccctagtcac acctctccgc tgcctcaagg 1021 ttcattccca cctcttccca catcccctca caccccctca ttattttcat gtctcgcaat 1081 ctcctttggt cacttggagt cattcagtta tgacaactcc agaactagaa gctgctggcc 1141 agcagcaagt gccacaaact gtgttccccc ggcagctctt ctggctcatt tgtcttattg 1201 tgtgtccagc tgagatcaga aagctatcgg caattatgtc agaggatggc ccagtttttc 1261 acatagattt gtctgtattt gatagcaata tttagtattt ggtgctccga gtatccccac 1321 tctggatttt tctctgcaag attcttccct tggacttcag gcagagaagg ggactgaaag 1381 ggagatgagc acccgcagtg agggcttaat ctgcacggcc attctctgca aggcaggtga 1441 taacaactga agcaagagaa gctgtcattg aggggagaga gttgttggtg agcgattaaa 1501 gagcagtcac attatcacag cagagcattc atcgtggccc agtgctgggg agctacgtta 1561 gaattgccca gtgtgtctgc ttcccagcat aactatgcat tcttcaatta aaaaactgca 1621 ggcatgtttg ccatttccag ctctcggaga tgagttaaag caaagctctg gaaacctgca 1681 agctctctga gtgctagtag aatgaaatga aagaataaag ccagatatag attctgc // LOCUS HUMPDHBA 1484 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human pyruvate dehydrogenase beta-subunit mRNA, complete cds. ACCESSION M34479 KEYWORDS pyruvate dehydrogenase. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1484) AUTHORS Ho,L. and Patel,M.S. TITLE Cloning and cDNA sequence of the beta-subunit component of human pyruvate dehydrogenase complex JOURNAL Gene 86, 297-302 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1 1080 pyruvate dehydrogenase beta-subunit precursor /hgml_locus_uid="LU0223C" /nomgen="PDHB" /map="unassigned" sigp 1 90 pyruvate dehydrogenase beta-subunit signal peptide matp 91 1077 pyruvate dehydrogenase beta-subunit mRNA < 1 1484 pyruvate dehydrogenase beta-subunit mRNA BASE COUNT 414 a 287 c 369 g 414 t ORIGIN 1 atggcggcgg tgtctggctt ggtgcggaga ccccttcggg aggtctccgg gctgctgaag 61 aggcgctttc actggaccgc gccggctgcg ctgcaggtga cagttcgtga tgctataaat 121 cagggtatgg atgaggagct ggaaagagat gagaaggtat ttctgcttgg agaagaagtt 181 gcccagtatg atggggcata caaggttagt cgagggctgt ggaagaaata tggagacaag 241 aggattattg acactcccat atcagagatg ggctttgctg gaattgctgt aggtgcagct 301 atggctgggt tgcggcccat ttgtgaattt atgaccttca atttctccat gcaagccatt 361 gaccaggtta taaactcagc tgccaagacc tactacatgt ctggtggcct tcagcctgtg 421 cctatagtct tcaggggacc caatggtgcc tcagcaggtg tagctgccca gcactcacag 481 tgctttgctg cctggtatgg gcactgccca ggcttaaagg tggtcagtcc ctggaattca 541 gaggatgcta aaggacttat taaatcagcc attcgggata acaatccagt ggtggtgcta 601 gagaatgaat tgatgtatgg ggttcctttt gaatttcctc cggaagctca gtcaaaagat 661 tttctgattc ctattggaaa agccaaaata gaaaggcaag gaacacatat aactgtggtt 721 tcccattcaa gacctgtggg ccactgctta gaagctgcag cagtgctatc taaagaagga 781 gttgaatgtg aggtgataaa tatgcgtacc attagaccaa tggacatgga aaccatagaa 841 gccagtgtca tgaagacaaa tcatcttgta actgtggaag gaggctggcc acagtttgga 901 gtaggagctg aaatctgtgc caggatcatg gaaggtcctg cgttcaattt cctggatgct 961 cctgctgttc gtgtcactgg tgctgatgtc cctatgcctt atgcaaagat tctagaggac 1021 aactctatac ctcaggtcaa agacatcata tttgcaataa agaaaacatt aaatatttag 1081 tttggacttg aatatcaagt cgttgaaatt tatttgaaat acttgctggc actgcacctg 1141 gatttgtact gcaagacctg actattcata aaggaaaacg atttctaaag caacagcagg 1201 tatttttgta cagggaagtt taaatgtgtt tgtgtatgga aaactctcca ctctcctccc 1261 ctagatgcca tgcttccttt tgtctgttac ggttgccatg ttctttgaat aacaaattat 1321 atcacatttt atcctctctc accacaagga caaagtatgg atgtggcaga gtcctgatga 1381 aagatgtatc caaacaagat aacttatatg tataaaatta aagcatataa tacacattta 1441 ctgttagttt gttttgataa ggaataaagg aatttctaac atga // LOCUS LEIGP63A 3047 bp ds-DNA INV 10-JUL-1990 DEFINITION L.chagasi major surface glycoprotein (gp63) gene, complete cds. ACCESSION M28527 KEYWORDS glycoprotein; protease; surface antigen. SOURCE L.chagasi (isolate MHOM/BR/82/BA-2C1a) DNA, clones pLc63-[1 and 2]. ORGANISM Leishmania chagasi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 3047) AUTHORS Miller,R.A., Reed,S.G. and Parsons,M. TITLE Leishmania gp63 molecule implicated in cellular adhesion lacks an Arg-Gly-Asp sequence JOURNAL Mol. Biochem. Parasitol. 39, 267-274 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.Parsons, 03-OCT-1989. FEATURES from to/span description pept 496 2295 gp63 protein BASE COUNT 423 a 1195 c 925 g 504 t ORIGIN 1 ggtacctccc ccaccccggc cctccggccc cgcgcccccg cctctgtgct gtgccgtgcc 61 ctggactccc tctcctccac ctctcctcgc ttctgtcgct ccgcctcccc gagcgacccg 121 cggcgccgcg cggtgcgtgt ctggtgcggc gagtggcggg gtgccgtccc ccctcgctgc 181 ggcacccctc cccgcgccac cacggaggca cccgtgagca cgccaacaga ccaacgcact 241 cacgtcccca tcgtcctccc ccctccccgc accagcaccg acgtgctctc cgctctccct 301 ccctcaccac ctcccctcgc accctccctt gccttctccc tgtcccctcc ctccccagat 361 ccgccaacgc atccgatccc gctacacccc cctctccccc gcccacacgc acgcgcacac 421 cgccgtgcac aagccctcgc cctcgccctc gccaccacac cccactgccc acagcgcccc 481 cgcgcctgca gagccatgtc cgtcgacagc agcagcacgc accggcaccg cagcgtcgcc 541 gcgcgcctgg tgcgcctcgc ggctgccggc gccgcagtca tcgctgctgt cggcaccgcg 601 gccgcgtggg cacacgccgg tgcggtgcag caccgctgca tccacgacgc gatgcaggca 661 cgcgtgcggc agtcggtggc gcgccaccac acggcccccg gcgccgtgtc cgcggtgggc 721 ctgccgtacg ttactctcga caccgcggcc gccgccgatc gccggccggg cagcgcgccc 781 acagtcgtgc gcgccgcgaa ctggggcgcg ctgcgcatcg ccgtctccac cgaggacctc 841 accgaccccg cctaccactg cgctcgcgtc gggcagcaca tcaagaggcg acttggcggc 901 gtcgacatat gcacggccga ggacatcctc accgacgaga agcgcgacat cctggtcaag 961 cacctcatcc cgcaggcgct gcagctgcac acggagcggc tgaaggtgcg gcaggtgcag 1021 gacaagtgga aggtgacggg catgggcgac gatgtgtgca gcgacttcaa ggtgccgccg 1081 gcgcacatca ccgatggcct gagcaacacc gacttcgtga tgtacgtcgc ctccgtgccg 1141 agcgaggagg gtgtgctggc gtgggccacg acctgccagg tgttctctga cggccatcca 1201 gccgtgggcg tcatcaacat ccccgcggcg aacattgcgt cgcggtacga ccagctggtg 1261 acgcgtgtcg tcacgcacga gatggcgcac gcgctcggct tcagcgtcgg cttcttcgaa 1321 ggcgcccgca tcctggagag catttcgaac gttcggcaca aggacttcga tgttcccgtg 1381 atcaacagca gcacggcggt ggcgaaggcg cgcgagcagt acggctgcga caccttggag 1441 tatctggaga tcgaggacca gggcggtgcg ggctccgccg ggtcgcacat caagatgcgc 1501 aacgcgcagg acgagctcat ggcgcctgcc gcagctgccg ggtactacag cgccctgacc 1561 atggccatct tccaggacct cggcttctac caggcggact tcagcaaggc cgaggtgatg 1621 ccgtggggcc ggaacgccgg ctgcgccttc ctcagcgaga agtgcatgga gcggaacatc 1681 acgaagtggc cggcgatgtt ctgcaatgag aacgaggtga ctatgcgctg ccccaccagt 1741 cgtctcagcc ttggaaagtg cggtgttacc cgtcacccgg accttccgcc gtactggcag 1801 tacttcacgg acccgtccct cgccggcatc tccgccttca tggactgctg ccctgtcgtg 1861 gagccctacg gtgatggcag ctgcgcacag cgtgcgtctg aagcgggcgc accattcaaa 1921 ggcttcaacg tcttctccga cgcggcgcgc tgcatcgatg gcgccttcag gccgaagacg 1981 agtcacggca taatcaagtc gtacgccgga ctgtgcgcca acgtgcggtg cgacacggcc 2041 acgcgcacgt acagcgtgca ggtgcacggc ggcagcggct acgccaactg cacgccgggc 2101 ctcagagttg agctgagcac cgtgagcagc gccttcgagg agggcggcta catcacgtgc 2161 ccgccgtacg tggaggtgtg ccagggcaac gtgcaggctg ccaaggacgg cggcaacgcc 2221 gcggctggtc gccgtggtcc gcgcgccgcg gcgacggcgc tgctggtggc cgcgctgctg 2281 gccgtggcgc tctagacggt ggataggacg ggtgctgatg gcgtgtcccc tgctcccccc 2341 tccctccctc cctctcgttg tctctcggaa gagctccacg ctgtcctttc atctcctcgc 2401 ctgttctacg cttgcttcgc tgcgccgctg caccgggccg gtcctcgccg accctcgcct 2461 gccctctccc cctcctctct cccgccaccc caccccgctc cccgctgcgc acggtgcctg 2521 tgcgcttgga gagaggtgca gcagcgcgcg ggagctgagg gagggagggg gtgtcgtgcg 2581 cgggtgcgca tgccttcttt cacttcctta tttgtcttct atttgttccc tgcggcaccc 2641 gcacaccccc acccgctggc ggccatccgc ggcatccgcg ggtgcgtgcg cggtgtgtct 2701 gccttctctc tcctcctttc gctctgtttc cctgtcctcg gactccccgg cgccagcgtg 2761 agctccgcag tcaccgccca cccggcgctc cggcgcggtc agcgccaccc caccccaccc 2821 cctctccccc attcgtgcgt gtctcttctc gctttttctg tttcctcttg tagcagggcg 2881 cgccgcgttg tgggagcggt ggcggcctct gcgcgcggac ggcatgcagg tcggccggga 2941 gagtctcccg ccagcgcccg cgcagcgcag agccgtcgcc cacccaccgt ctcctcccac 3001 cttcgcatgc cgccgcacta ggtgcacgtc gtcggcacga ccaccga // LOCUS PFATUBB 2833 bp ds-DNA INV 10-JUL-1990 DEFINITION P.falciparum beta-tubulin gene, complete cds. ACCESSION M28398 KEYWORDS tubulin. SOURCE P.falciparum (Brazilian strain 7G8, isolate 78G) nonsynchronous blood stage DNA, clone 768. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 2833) AUTHORS Sen,K. and Godson,G.N. TITLE Isolation of alpha and beta-tubulin genes of Plasmodium falciparum using a single oligonucleotide probe JOURNAL Mol. Biochem. Parasitol. 39, 173-182 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.N.Godson, 27-SEP-1989. FEATURES from to/span description pept 654 749 beta-tubulin, exon 1 1112 2064 beta-tubulin, exon 2 2228 2516 beta-tubulin, exon 3 IVS 750 1111 TUBB intron A IVS 2065 2227 TUBB intron B BASE COUNT 1018 a 311 c 455 g 1049 t ORIGIN 1 aattcctagt ttatttaatt taaaaattaa aagatcgaat gctcaacatt ttaaaaagaa 61 atctgtgaaa catatcttaa caagaaatgg tgtaacaaaa gaaacaatat taaatgataa 121 attaccaaag ataaatgatg aaattgacag aacatataat ggacacaaaa tggatgaaaa 181 tttacaggat aaacaaaaaa ggaatcatgg agtaaatata aaattaataa atgaatatga 241 aaatatcatg tgaagaataa attctcaaaa tcattgattg tatgacaaga ttcaagaatt 301 ggttatataa aaatatattt aggaaaagta attttgggtc atatgtatca acatttacag 361 gtgtatttgg aggtgctgca gctgttagct gtttctgcca taagtggagc ttgtataact 421 aaatttagtg ttacattggt tccggtattt gcatgttttg ggggtgtctt tgcgattatt 481 ataatattat taatattagg aacatggatg cttgttacat ggttatggca acacaaagaa 541 gtagtatttt tttttttttt taatttttac ttaatatatc ctcttacaat ataaaatatt 601 tatatattta aaaaaaaaag aaaaaatttt ctttgagatt attttattaa agaatgagag 661 aaattgttca tattcaagct ggccaatgtg gaaatcaaat aggtgcaaag ttttgggaag 721 tcatttctga tgagcatgga atagatccag taagtttaaa aaaaaaatat atttatttat 781 atgaatctgt aaacatatgt atatttatat atatatatat atatatatgg aagaataatt 841 ttgtgtgtat aatttggggt ccttcccctt tattgtattc tataaatgcc tcctttatat 901 tgataataat ttatatatgt aaacctttaa tgacgaggct tatatataaa aaccttagat 961 attataaata aatgtatatt atgtacatat gacgatatcg ctctctctat atatatatat 1021 atatatatat atatatattt atttatttat atatttattt atttatttat ttatttattt 1081 tttttttttt tttttatttt atttttttta gagtggtacc tatagtgggg acagtgactt 1141 acagttagaa agagttgacg ttttttacaa cgaagcaaca ggaggtagat atgttccaag 1201 agctatattg atggacttgg aacctggtac tatggatagt gttcgtgctg gcccctttgg 1261 tcaattattt cgtccagata attttgtgtt tggtcaaaca ggtgcaggaa ataattgggc 1321 taaaggacat tatactgaag gtgctgaatt gatagatgca gttttagatg tgcttagaaa 1381 agaagcagaa ggttgtgatt gtttacaagg atttcagatt actcattcat taggtggtgg 1441 tacaggtagt ggtatgggta ctttgttgat tagtaaaata agagaggagt atcctgatcg 1501 tattatggaa acattttctg tatttccatc accaaaagtt tctgatactg ttgttgaacc 1561 atataatgct acattatcag tccatcagtt ggttgaaaat gctgatgaag ttcaagttat 1621 cgataatgaa gctttatatg acatatgttt taggactctt aaattaacaa caccaacata 1681 tggagattta aatcaccttg tatcagctgc aatgtcaggt gtaacctgtt cgttaagatt 1741 tcctggtcaa cttaacagtg acttaagaaa attagctgtt aatttgatcc cattcccacg 1801 tttacatttc tttatgtacg ggtttgctcc tttaactagt agaggcagtc aacaatacag 1861 agccttaact gtgccggagt taacacaaca aatgttcgac gcaaaaaata tgatgtgcac 1921 aagtgatcca agacatggaa gatatttaac ggcatgtgct atgtttagag gaagaatgtc 1981 cacaaaggaa gttgacgaac aaatgttaaa cgttcaaaat aaaaactcat cttattttgt 2041 cgaatggatt cctcacaaca caaagtaaga aggaacaatt gatactagta tgcatgtttt 2101 tttgtttata tgtatttata tatatatata tatatatgta ttcatttata tattttgaaa 2161 tatacatttt acatataaat tttttttttt tctttttctt tttttttttt tttgtttttt 2221 tctttagatc aagtgtttgt gatattccac cattgggatt aaaaatggct gttacttttg 2281 taggaaactc aaccgccatt caagaaatgt ttaaaagagt ttctgatcaa tttactgcta 2341 tgtttagaag aaaagccttt ttgcactggt acaccggaga aggtatggac gagatggaat 2401 ttacagaagc tgaatcaaat atgaatgatt tagtttcaga atatcaacaa tatcaagatg 2461 ctacagcaga agaggaagga gaatttgaag aagaagaagg agacgtagaa gcctaaatct 2521 atttatattt atgaaaatat atacatatta tatatatatg tatatgtaat taacaagaat 2581 aaaaaataaa aaataaaaaa aaaataaaat aaaaaaataa aaatacataa taaaaaagta 2641 taaaataaat atctaatcat taattatata taacaatata atttaactct tttttttttt 2701 attattattg aagttatgtt cgggtatata taacatatat ataaattata tatatgttgc 2761 agtttctttt tttttttttt tttttttttt tcttatcatt tgattttaca ctcacatata 2821 tatgacatat ata // LOCUS RATADOME1 2513 bp ds-DNA ROD 10-JUL-1990 DEFINITION Rat S-adenosylmethionine decarboxylase pseudogene, complete cds. ACCESSION M34463 KEYWORDS S-adenosylmethionine decarboxylase; pseudogene. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2513) AUTHORS Pulkka,A., Keraenen,M.-R., Salmela,A., Salmikangas,P., Ihalainen,R. and Pajunen,A. TITLE Nucleotide sequence of rat S-adenosylmethionine decarboxylase cDNA. Comparison with an intronless rat pseudogene JOURNAL Gene 86, 193-199 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps 436 1379 pseudo-S-adenosylmethionine decarboxylase BASE COUNT 771 a 478 c 518 g 746 t ORIGIN 1 tctactaaac atgataaaga atttaagaaa tccatctctt cacttccagt ctatatatct 61 ttgagatgct attcaggata ctgagttaaa aaataagatt aggcttacac agcatggcgc 121 ggaacattag ctaactctca ctcaactctg acaagaaagc agcagactac atgagactga 181 actgtatctg cctttagttc caacagactc acgttcaact tttcttcacg aaaacagcca 241 gggaaatttt attagtcctt ttttaaaaat agttaatata aaattataac aacaacagca 301 gcagcaacaa caacaaggac cctgaactta gtaacacacg tggaacaaac cgtagcagcg 361 actggagcag tgggagaaga gatttaattt aggtgatttt tttggatttg ttggttgttg 421 gtcagcctca cagtgatgga agttgcacat atttttttga agggactgag aagctgctag 481 aggtctggtt ttccagacag cagtccgacg ccagccaggg acatggggat cttcatacca 541 tcccaagatc tcagtgggat gtgcttttga ggatgtgcag tcctcaacca taagtacgac 601 aaagatgcac aagcaggaag cttacacact cagtgagagt agcatgttta tacatttcat 661 gtgatactac cctcttactg aaagctctgg tttccaggtt gaagctcgct agggattacc 721 gtgggttaga ctcaattctt ttattctcat aagaatttca tgaagccctc tctccaaggg 781 tacccacacc gaaatttcca cgaagaaatc gaatttctta atgcagtttt cccaaatgga 841 gcagcatatt gtatgggaca aacgaattct gactgttggt acttatatac ttggatctcc 901 agagagccga gtcatcaaac agtcagatca accctgggaa ttctgatgag tgagcttgac 961 ctagcagtta cggaccagtt ctattgctgc aaaggatgtc actcgtgaga gtgaattcat 1021 gacctgatat caggtcattg atgacacact gtttaatcct tgcagcttct tgatgaatgg 1081 aatgaaatcg attggactag tcacatcgct ccagaagcag agttctctta tgttagcttt 1141 gaaacaaacc taagtgagac atcctatgac agcccgatca ggaaagttgg gaaattcgtc 1201 aagccaggaa aatttgtgac caccttgttt gttaatcaga gttctaaatg tcgcacaggc 1261 cattcttcat cccagaagat tgacggtttt aaacatcatg attgccaaag tgctatgctc 1321 aacgaagata aatgcaatat tgaatgtatc aaatgaaaag aattcagtct ctggtggagg 1381 gggattggag caaggatgaa tcagcccact aaagaaaact ccatggaaaa gacaggctat 1441 gcagtgcact ttaatcagct tcacacggtg cctaccatgc cttcactaac taaccaagta 1501 gtgatagaaa tgtccactaa gtcaaagcag aaatgtaata ctaagcattc tgacctcagt 1561 aagcaccacc attgccacca ttgccaccaa tttttactaa aggaaatttt gaatcaaatg 1621 aggatctgta gtttccgtct gttctgaggt cggctgttct ctttggtctt cgtttcacca 1681 tggcgctcag atgatcaaat gagtagctgc cagagggagg aatctccagg ttacttagcc 1741 tggagaatgg atgaatggat gaaacagcac aatattatga ctgtttagaa atacaggctt 1801 tcaagagtcg gcatgttagt ggcatttgta gatactgtgg aatttaagca gcaaagaaca 1861 aattggacta aatttcctat taattgccct cccactgttt cttggtagtt tctggactgg 1921 cacatcgatg tttttttttt ttttttcctt ccatatttaa aatgaagcac ttttttagca 1981 tttctaagca aagaatgcac ttggtttgta atcaagtagt tggaacgctg tctgaatgtt 2041 tactttatac accatgctga ttgaacgctt cattgaggaa gctttcagtc agttattggt 2101 ctgattctgt aatgagcaca gcacgtggtt tgaattgcca tttggaggac cagtgcttat 2161 ttaggctgga tcgcgtaaac cggtagattt tagcttgagg tttgattccc tcaccttata 2221 aaattaagaa ttctaatgtt gaaaattgca taggtttgtg tgaaacaaag cccagaagag 2281 aaactgtagg tagactagta atcttgtgta attataggtg agaagtttta gtgccgtaat 2341 ttctttgttg gcgttggact tttatcagct gaaatgtatt tctgtaccac aatgtaagct 2401 tcaataaagt ttgcttaatt gtctagtaac attaaaaaat ataagattaa tagaattgat 2461 ctcaacagta aggaaacaaa actaccttta ttattacata acataatctt tca // LOCUS RATADOMET 3102 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat S-adenosylmethionine decarboxylase mRNA, complete cds. ACCESSION M34464 M21155 J04048 M21783 KEYWORDS AdoMet decarboxylase; S-adenosylmethionine decarboxylase. SOURCE Rat prostate, cDNA to mRNA, clone pSAMr1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 232 to 1821) AUTHORS Pajunen,A., Crozat,A., Janne,O.A., Ihalainen,R., Laitinen,P.H., Stanley,B., Madhubala,R. and Pegg,A.E. TITLE Structure and regulation of mammalian S-adenosylmethionine decarboxylase JOURNAL J. Biol. Chem. 263, 17040-17049 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 3102) AUTHORS Pulkka,A., Keraenen,M.-R., Salmela,A., Salmikangas,P., Ihalainen,R. and Pajunen,A. TITLE Nucleotide sequence of rat S-adenosylmethionine decarboxylase cDNA. Comparison with an intronless rat pseudogene JOURNAL Gene 86, 193-199 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.Crozat, 27-OCT-1988. FEATURES from to/span description pept 273 1274 S-adenosylmethionine decarboxylase (EC 4.1.1.50) mRNA 1 3102 S-adenosylmethionine decarboxylase mRNA BASE COUNT 835 a 650 c 724 g 893 t ORIGIN 1 cggggaaagc agcggactac aagagactga actgtatctg cctctatttc caacggactc 61 acgttcaact ttcgctcacg aaaatagccg ggaaaatttt attagtcctt tttttaaaaa 121 aagttaatat aaaattatag caaaaaaaaa aaaaggaacc tgaactttag taacacagct 181 ggaacaatcc gcagcggcgg caggagcggc gggagaagag tttaatttag ttgattttct 241 gtggttgttg gttgttcgct agtctcacgg tgatggaagc tgcacatttt ttcgaaggga 301 ccgagaaact gctggaggtc tggttctcca gacagcagtc cgacgcaagc cagggatctg 361 gggaccttcg taccatccca agatccgagt gggatgtcct tctgaaggat gtgcagtgct 421 caatcataag tgtgacaaag actgacaagc aggaagctta tgtactcagt gagagtagca 481 tgtttgtctc caagagacgt ttcattttga agacatgtgg taccaccctc ttactgaaag 541 cactggttcc cctgttgaag cttgctaggg actacagtgg gtttgactcg attcaaagct 601 tcttttattc tcgtaagaat ttcatgaagc cttctcacca agggtaccca caccggaatt 661 tccaggaaga aatcgagttt cttaatgcaa ttttcccaaa cggagcagga tattgtatgg 721 gacgtatgaa ttctgactgt tggtacctgt acactttgga tctcccagag agccgagtaa 781 tcaatcagcc agatcaaacc ctggaaattc tgatgagtga gcttgaccca gcagttatgg 841 accagttcta catgaaagat ggtgttactg caaaggatgt cactcgtgag agtggaattc 901 gtgacctgat accaggttct gtcattgatg ccacactgtt caatccttgt ggctactcaa 961 tgaatggaat gaaatcggat ggaacatatt ggactattca catcactcca gaaccagaat 1021 tttcttatgt tagctttgaa acaaacctaa gtcagacctc ctatgatgac ctgatcagga 1081 aagttgtgga agtcttcaag ccaggaaaat ttgtgaccac cttgtttgtt aatcagagtt 1141 ctaagtgtcg cacagtgctt tcttcgcccc agaagattga cggtttcaaa cgtcttgatt 1201 gccagagcgc tatgttcaac gattacaatt ttgtttttac cagttttgct aagaaacagc 1261 aacaacagag ttgattagga aaaatgaaaa agaaaaaacg caaaaagaga agacacacag 1321 gaggtggtgg ctgctttcta gatgttgatc ctgggggcca tgctgaccgt gaccaccacc 1381 ttgtagctgc agaaagccct aggtgtaatg atagtgtaat cattttgaag tgtatgcatt 1441 attatatcaa ggagttagat atcttgcatg aatgctctct tctgtgttta ggtgttctat 1501 gccactcttg ctgtggaact gaagtgcatg tagaaaagaa ctctgactgt atgaatcttt 1561 acgacacttg tgaaaacgat tcgacttggt ttatgcacag cgtaatattt ctgcaggcat 1621 cgtccaaaat cccccacaga caaggctttc gtccccatta gatgcggcct cagctgacca 1681 ttggcgactg ttctatttgc tgccagagtt tttacatcca gttacctcca ctttctagag 1741 catattctct actaatgttc aaaaccgatt tctacttcat acgggtgtct tatgcaatgg 1801 caattaaagt tttcttccac aagttgagtc tttgtaagga aatgattcca gttgcttgtt 1861 ttgtgttcta ctgttttagt aattgctcct gcatttatag tcctatggtt tttcactacc 1921 cctgatgaag caatacacgg tcacactgtg ggcttacatt gtaatcttca ccccagatgg 1981 gagctcagag acggtccctt gctcattttt ccctaagatg tagaatgtgg ccttgctatt 2041 ggcatgccct tctgtggaag ataaatgatg gaagtgaaag tatcccgggg gtgagcaagg 2101 agaaccaccc catggcagtg atgggcttgg cagtgcactc cgagctctca cagtggagtg 2161 cccaccatgc cttcactaac tcactgagca gtgataggat gcccaccaag tcagagcaga 2221 aatctaaccc taaggattct cacctcggta agtgccgcca ttgccaccac tttactaaag 2281 gaagtttccg ctcagaggag agtctgtact tcccgcctgt cctaatgtca gctgttctct 2341 ctggtctttc accatggcgt tcagatgctc aaatgaatgg ctgatcggcc gcagggagga 2401 ctctccgggt tactgggcct ggagaatgga gaaacaggca cggtattctg acagttaatg 2461 gcaccagaga tgcgggcttt caagagctgg cctgttagtg gcatttttaa gcagaaaaga 2521 gcaaactaga cgaagttccc tatttattgc cctcccactg tttccttggc agtttctgga 2581 ctggcgcaat gatgccttgt tccttccgta tttataacga agctaaaaag cgtttctaag 2641 catggagtct acttggtttg aaatcaagtg gttggaacac tgtctggatt tttactttac 2701 gcagtgttga ttgaacgctt cgttggggaa gccttcagtc cgcttcatcg gtctgttctg 2761 taatgagcac agcacaccta gtttgaattg ctgtttggag ggccagtgct tatttgagct 2821 gggtcttgta acccagtaga ttttggcttg aggtctgact cccccatctt acgaaattaa 2881 gaattctaat gttggaaatt gcatagggtt tgcgtggaaa aaagcccagg gaaaaaaaaa 2941 aaaaaacaga aggcggacta gtgatctagt gtgattacag gcggggaagt tttggtgcca 3001 taatttcttt gttggtgttg gacttttaat cagctgaaat gtatttctgt accacaatgt 3061 aagcttcaat aaaagtttgc ttaattgtct agtaacatcc ag // LOCUS HUMCD38 1407 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human lymphocyte differentiation antigen CD38 mRNA, complete cds. ACCESSION M34461 KEYWORDS cell surface glycoprotein; lymphocyte differentiation antigen CD38; membrane glycoprotein. SOURCE Human PHA-treated peripheral blood cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1407) AUTHORS Jackson,D.G. and Bell,J.I. TITLE Isolation of a cDNA encoding the human CD38 (T10) molecule, a cell surface glycoprotein with an unusual discontinuous pattern of expression during lymphocyte differentiation JOURNAL J. Immunol. 144, 2811-2815 (1990) STANDARD simple staff_review FEATURES from to/span description pept 70 972 lymphocyte differentiation antigen CD38 /hgml_locus_uid="LZ0047A" /nomgen="CD38" /map="4" BASE COUNT 381 a 332 c 326 g 368 t ORIGIN 1 ctaaagctct cttgctgcct agcctcctgc cggcctcatc ttcgcccagc caaccccgcc 61 tggagcccta tggccaactg cgagttcagc ccggtgtccg gggacaaacc ctgctgccgg 121 ctctctagga gagcccaact ctgtcttggc gtcagtatcc tggtcctgat cctcgtcgtg 181 gtgctcgcgg tggtcgtccc gaggtggcgc cagacgtgga gcggtccggg caccaccaag 241 cgctttcccg agaccgtcct ggcgcgatgc gtcaagtaca ctgaaattca tcctgagatg 301 agacatgtag actgccaaag tgtatgggat gctttcaagg gtgcatttat ttcaaaacat 361 ccttgcaaca ttactgaaga agactatcag ccactaatga agttgggaac tcagaccgta 421 ccttgcaaca agattcttct ttggagcaga ataaaagatc tggcccatca gttcacacag 481 gtccagcggg acatgttcac cctggaggac acgctgctag gctaccttgc tgatgacctc 541 acatggtgtg gtgaattcaa cacttccaaa ataaactatc aatcttgccc agactggaga 601 aaggactgca gcaacaaccc tgtttcagta ttctggaaaa cggtttcccg caggtttgca 661 gaagctgcct gtgatgtggt ccatgtgatg ctcaatggat cccgcagtaa aatctttgac 721 aaaaacagca cttttgggag tgtggaagtc cataatttgc aaccagagaa ggttcagaca 781 ctagaggcct gggtgataca tggtggaaga gaagattcca gagacttatg ccaggatccc 841 accataaaag agctggaatc gattataagc aaaaggaata ttcaattttc ctgcaagaat 901 atctacagac ctgacaagtt tcttcagtgt gtgaaaaatc ctgaggattc atcttgcaca 961 tctgagatct gagccagtcg ctgtggttgt tttagctcct tgactccttg tggtttatgt 1021 catcatacat gactcagcat acctgctggt gcagagctga agattttgga gggtcctcca 1081 caataaggtc aatgccagag acggaagcct ttttccccaa agtcttaaaa taacttatat 1141 catcagcata cctttattgt gatctatcaa tagtcaagaa aaattattgt ataagattag 1201 aatgaaaatt gtatgttaag ttacttcctt tagagcacaa tggatctcga gggatcttcc 1261 atacctacca gttctgcgcc tgcgagtcgc ggccgcatct agaggatctt tgtgaaggaa 1321 ccttacttct gtggtgtgac ataattggac aaactaccta tagagattta aagctctaag 1381 gtaaatataa aatttttaag tgtataa // LOCUS MUSCD28 1492 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse glycoprotein CD28 mRNA, complete cds. ACCESSION M34563 KEYWORDS glycoprotein CD28. SOURCE Mouse lymphoma T cell line EL4, cDNA to mRNA, clone lambda-SSD1.5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1492) AUTHORS Gross,J.A., St John,T. and Allison,J.P. TITLE The murine homologue of the T lymphocyte antigen CD28: Molecular cloning and cell surface expression JOURNAL J. Immunol. 144, 3201-3210 (1990) STANDARD simple staff_review FEATURES from to/span description pept 57 713 glycoprotein CD28 precursor sigp 57 113 glycoprotein CD28 signal peptide matp 114 710 glycoprotein CD28 BASE COUNT 401 a 355 c 332 g 404 t ORIGIN 1 acacactctg ccttgctcac agaggagggg ctgcagccct ggccctcatc agaacaatga 61 cactcaggct gctgttcttg gctctcaact tcttctcagt tcaagtaaca gaaaacaaga 121 ttttggtaaa gcagtcgccc ctgcttgtgg tagatagcaa cgaggtcagc ctcagctgca 181 ggtattccta caaccttctc gcaaaggaat tccgggcatc cctgtacaag ggcgtgaaca 241 gcgacgtgga agtctgtgtc gggaatggga attttaccta tcagccccag tttcgctcga 301 atgccgagtt caactgcgac ggggatttcg acaacgaaac agtgacgttc cgtctctgga 361 atctgcacgt caatcacaca gatatttact tctgcaaaat tgagttcatg taccctccgc 421 cttacctaga caacgagagg agcaatggaa ctattattca cataaaagag aaacatcttt 481 gtcatactca gtcatctcct aagctgtttt gggcactggt cgtggttgct ggagtcctgt 541 tttgttatgg cttgctagtg acagtggctc tttgtgttat ctggacaaat agtagaagga 601 acagactcct tcaagtgact accatgaaca tgactccccg gaggcctggg ctcactcgaa 661 agccttacca gccctacgcc cctgccagag actttgcagc gtaccgcccc tgacagggac 721 ccctatccag aagcccgccg gctggtaccc gtctacctgc tcatcatcac tgctctggat 781 aggaaaggac agcctcatct tcagccggcc actttggacc tctactgggc caccaatgcc 841 aactatttta gagtgtctag atctaacatc atgatcatct tgagactctg gaatgaatga 901 cagaagcttc tatggcagga taaagtctgt gtggcttgac ccaaactcaa gcttaataca 961 tttattgact tgattgggga agttagagta gagcaatcaa aaagatcatt cattcagcct 1021 tgggaagtca atttgcaggc tcctggatga gccctgcccc gttttcactt gccagcacat 1081 ttcagtcatg tggtgtgata gccaaagatg ttttggacag agaagaaagg atagaaaaac 1141 cttctctttg gctaagttgg tgtttggggt ggggataggt tagagtatag tacttaacta 1201 tttgaaaaat aatgaaaaca cttttttcac tcatgaaatg agccacttag ctcctaaata 1261 gtgttttcct gttagtttag aaagttgtgg acatattttt ttaatgattt ctgaccattt 1321 ttaatcacat tgactcatgg aatggcctca aagcaccccc cagtgcttct ttcctcattc 1381 ccggtcatgg gaactcagta ttattaatag tcacaacatg atttcagaac tagatagccc 1441 tcccacacca agaagaatgt gagaggaagt aaggtcactt tatgtaaaaa cg // LOCUS MUSIGHAAU 294 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 123) mRNA V region, partial cds. ACCESSION M34523 KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (BALB/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig gamma-chain V-region (AA at 1) BASE COUNT 83 a 61 c 77 g 73 t ORIGIN 1 caggtccagc tgcagcagtc tgggccagag gtggtgaggc ctggggtctc agtgaagatt 61 tcctgcaagg gttccgacta cacattcact gattatgcta tgcactgggt gaagcagagt 121 catgcaaaga gtctagagtg gattggagtt attagtactt acaatggtaa tacaaactac 181 aaccagaagt ttaagggcaa ggccacaatg actgtagaca aatcctccag cacagcctat 241 atggaacttg ccagattgac atctgaggat tctgccatgt attactgtgt acgt // LOCUS MUSIGHAAV 294 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 126) mRNA V region, partial cds. ACCESSION M34524 KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (BALB/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig gamma-chain V-region (AA at 1) BASE COUNT 83 a 69 c 77 g 65 t ORIGIN 1 gaggtccagc tgcaacagtc tggacctgag ctggtgaagc ctggggcttc agtgaagata 61 tcctgcaaga cttctggata cacattcact gaatacacca tgcactgggt gaagcagagc 121 catggaaaga gccttgagtg gattggaggt attaatccta acaatggtgg ttctaactac 181 aaccagaagt tcaagggcaa ggccacattg actgtagaca agtcctccag cacagcctac 241 atggagctcc gcagcctgac atctgaggat tctgcagtct attactgtgc aaga // LOCUS MUSIGHAAW 294 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 125) mRNA V region, partial cds. ACCESSION M34525 KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig gamma-chain V-region (AA at 1) BASE COUNT 83 a 66 c 74 g 71 t ORIGIN 1 cagatccagt tggtgcagtc tggacctgaa ctgaagaagc ctggagagac agtcaagatc 61 tcctgcaagg cttctggtta taccttcaca gactattcaa tgcactgggt gaagcaggct 121 ccaggaaagg gtttaaagtg gatggactgg ataaacactg agactggtgt gccaacatat 181 gcagatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat 241 ttgcagatca acgacctcaa aaatgaggac acggctacat atttctgtac taga // LOCUS MUSIGHAAX 294 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 127) mRNA V region, partial cds. ACCESSION M34526 KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig gamma-chain V-region (AA at 1) BASE COUNT 81 a 80 c 61 g 72 t ORIGIN 1 gatgtgcagc ttcaggaggt aggacctgac ctggtgaaac cttctcagtc actttcactc 61 acctgcactg tcactggcta ctccatcacc agtggttata gctggcactg gatccggcag 121 tttccaggaa acaaactgga atggatgggc tacatacact acagtgatag ctctaactac 181 aacccatctc tcaaaagtcg aatctctatc actcgagaca catccaagaa ccagttcttc 241 ctgcagttga attctgtgac tactgaggac acagccacat attactgtgc aagg // LOCUS MUSIGKABI 300 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig kappa-chain (anti-insulin Ab 123) mRNA V region, partial cds. ACCESSION M34527 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 300 Ig kappa-chain V-region (AA at 1) BASE COUNT 74 a 86 c 70 g 70 t ORIGIN 1 caaattgttc tcacccagtc tccagcaatc atgtctgcat ctccagggga gaaggtcacc 61 atgacctgca gtgccagctc aagtgtaagt tacatgcact ggtaccagca gaagtcaggc 121 acctccccca aaagatggat ttatgacaca tccaaactgg cttctggagt ccctgctcgc 181 ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagcagcat ggaggctgaa 241 gatgctgcca cttattactg ccagcagtgg agtagtaaac cacccatcac gttcggtgct // LOCUS MUSIGKABJ 300 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig kappa-chain (anti-insulin Ab 126) mRNA V region, partial cds. ACCESSION M34528 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 300 Ig kappa-chain V-region (AA at 1) BASE COUNT 77 a 80 c 66 g 77 t ORIGIN 1 gatattgtgc taactcagtc tccagccacc ctgtctgtga ctccaggaga tagcgtcagt 61 ctttcctgca gggccagcca aagtattagc aacaacctac actggtatca acaaaaatca 121 catgagtctc caaggcttct catcaagtat gcttcccagt ccatctctgg gatcccctcc 181 aggttcagtg gcagtggatc agggacagat ttcactctca gtatcaacag tgtggagact 241 gaagattttg gaatgtattt ctgtcaacag agtaacagct ggcctcacac gttcggctcg // LOCUS MUSIGKABK 312 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig kappa-chain (anti-insulin Ab 127) mRNA V region, partial cds. ACCESSION M34529 KEYWORDS immunoglobulin heavy-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-region (AA at 1) BASE COUNT 78 a 81 c 77 g 76 t ORIGIN 1 gacattgtgc tgacccaatc tccagcttct ttggctgtgt ctctagggca gagggccacc 61 atatcctgca gagccagtga aagtgttgat agttatggca atagttttat gcactggtac 121 cagcagaaac caggacagcc acccaaactc ctcatctatc gtgcatccaa cctagaatct 181 gggatccctg ccaggttcag tggcagtggg tctaggacag acttcaccct caccattaat 241 cctgtggagg ctgatgatgt tgcaagctat tactgtcagc aaagtaatga ggaacctccc 301 acgttcggag gg // LOCUS MUSIGKABL 312 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V region, partial cds. ACCESSION M34530 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-region (AA at 1) BASE COUNT 71 a 96 c 69 g 76 t ORIGIN 1 caaattgttc tcacccagtc tccaacaatc atgtctgcat ctctagggga acgggtcacc 61 atgacctgca ctgccagctc aagtgtaagt tccagttact tgcactggta ccagcagaag 121 ccaggatcct cccccaaact ctggatttat agtacatcca acctggcttc tggagtccca 181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgccag cagtatcatc gttccccacc cacgttcggt 301 gctgggacca ag // LOCUS HUMINSR01 2085 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 1. ACCESSION M23100 M32822 KEYWORDS Alu repetitive sequence; insulin receptor. SEGMENT 1 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1933) AUTHORS Seino,S., Seino,M., Nishi,S. and Bell,G.I. TITLE Structure of the human insulin receptor gene and characterization of its promoter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 114-118 (1989) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 2085) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept 1824 + 1923 human insulin receptor precursor, exon 1 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" sigp 1824 1904 human insulin receptor signal peptide matp 1905 + 1923 human insulin receptor pre-msg 1541 > 2085 hINSR mRNA and introns (alt.) pre-msg 1542 > 2085 hINSR mRNA and introns (alt.) pre-msg 1548 > 2085 hINSR mRNA and introns (alt.) IVS 1924 > 2085 hINSR intron A rpt < 1 76 Alu repeat BASE COUNT 417 a 631 c 702 g 335 t ORIGIN Chromosome 19p13.3-13.2. 1 agatctggcc attgcactcc agcctgggca acagagaaaa actccatcta aaaaaaaaaa 61 aaaaaaaaaa aaaaaacaga gagagagaga gagagagaga gaaggaaacg gaactggggg 121 gaggatttgc aaaaatatgg ttagggatgg cacttcagag atgaagccat cctggagtgt 181 tacgggcaag ggaaatgctg gggcaaagcc ccagaggcag gaataggttt ggcctgttgc 241 atgaacagtg ggtccagctc ctagcaaact gtttattgaa tgaaagaaga atgaatgcct 301 tgggtctagg gttgtgctgg gcgctttctt aagttttctt tcccgggtac ctccccagaa 361 ctggcatgca ggtattatta aacccattac acaagtgaaa ctggcccaga gacagaaaag 421 tccctggtcc aagaccacac aggagtgagg ggtggaggaa ccctcctccc attgagttct 481 ggctttccta tactgaaagc cccttcctct cctgcagtaa ggtaggtgga accgctgtcc 541 cgccttgttg gtgaatgtcg ttgctagact tcagacacat acaggctggt ctgctgaaaa 601 tcagagatgt ccacctgcgc cctattcgag gtctccggcg tcttctttgg cgtcgtcttt 661 gccctttcag aagcgtctgc acatttttcc aggtgtcatt tctccaactt gaacacaggg 721 agcgcactgg gcacgcgggc acgtggctgt ccccaggggc ctggcttggg tctcgcccct 781 gggccggggc gcacgcgcgg gcgggacatc tgggggcgcc cacgcgctct gggacgagtg 841 tcgctggcca ggcccggact gaggaaaggc gagtgagaca ctactcgcct ggggtgcaaa 901 atttaaggga gtgaaaaaaa aaaaaaaaga aagaaaccaa aaccacctcg agtcaccaaa 961 ataaacattt taatgcagta ttttttaaaa aatcaacagg aatcctccaa agcccactat 1021 gaacaaaata gcaaaatggt agagaaagga tctgtgccgc tgcgtcgggc ctgtggggcg 1081 cctccggggg tctgaaactg gaggagactc ggggctgtag ggcgcgcgga tctggggcgc 1141 gccctcggtc ccggcgcgcc cagggcctcc cgcgcggggc ccggcacagg gaggcgggga 1201 ggcgggcggg gcggggcggg accgggcggc acctccctcc cctgcaagct ttccctccct 1261 ctcctgggcc tctcccgggc gcagagtccc ttcctaggcc agatccgcgc cgccttttcc 1321 cgcggcccgc acggggccca gctgacgggc cgcgttgttt acgggccgga gcagccctct 1381 ctcccgccgc ccgcccgcca cccgccagcc caggtgcccg cccgccagtc agctagtccg 1441 tcggtccgcg cgtccctctg tcccggagcc cgcagatcgc gacccagagc gcgcggggcc 1501 gagagccgag agacagtccc gggcgcagcg cggagctccg ggccccgaga tcctgggacg 1561 gggcccgggc cgcagcggcc ggggggtcgg ggccaccacc gcaagggcct ccgctcagta 1621 tttgtagctg gcgaagccgc gcgcgccctt cccggggctg cctctgggcc ctccccggca 1681 ggggggctgc ggcccgcggg tcgcgggcgt ggaagagaag gacgcgcggc ccccagcgcc 1741 tcttgggtgg ccgcctcgga gcatgacccc cgcgggccag cgccgcgcgc tctgatccga 1801 ggagaccccg cgctcccgca gccatgggca ccgggggccg gcggggagcg gcggccgcgc 1861 cgctgctggt ggcggtggcc gcgctgctac tgggcgccgc gggccacctg taccccggag 1921 agggtgagtc tgggggcgcg ggcgtgggcg gggagcgccg cgatggggag aggaccccac 1981 ccaagccaaa atcgatcccc cgcttgtgga ctgagaaccc tccccagggg cggggggcgg 2041 tggccaggac ggtagctcct gcatcgcgta gggggagcgg gaagc // LOCUS HUMINSR02 928 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 2. ACCESSION M32823 KEYWORDS insulin receptor. SEGMENT 2 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 928) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 174 + 725 human insulin receptor precursor, exon 2 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 174 + 725 human insulin receptor pre-msg < 1 > 927 hINSR mRNA and introns IVS < 1 173 hINSR intron A IVS 726 > 927 hINSR intron B BASE COUNT 218 a 234 c 237 g 239 t ORIGIN About 25.0 kbp downstream of segment 1. 1 tactttacag agaaagctac tcatcccggc tggctgcaga gtttacaggg cccgggatga 61 aaacacaggg cccaggtttc ctgtccatga agccggctct gcccctgatc cttctgatgc 121 atccaccgtg cgtctgctca cctgtcttgc tttctgttca ttttctcttg tagtgtgtcc 181 cggcatggat atccggaaca acctcactag gttgcatgag ctggagaatt gctctgtcat 241 cgaaggacac ttgcagatac tcttgatgtt caaaacgagg cccgaagatt tccgagacct 301 cagtttcccc aaactcatca tgatcactga ttacttgctg ctcttccggg tctatgggct 361 cgagagcctg aaggacctgt tccccaacct cacggtcatc cggggatcac gactgttctt 421 taactacgcg ctggtcatct tcgagatggt tcacctcaag gaactcggcc tctacaacct 481 gatgaacatc acccggggtt ctgtccgcat cgagaagaac aatgagctct gttacttggc 541 cactatcgac tggtcccgta tcctggattc cgtggaggat aattacatcg tgttgaacaa 601 agatgacaac gaggagtgtg gagacatctg tccgggtacc gcgaagggca agaccaactg 661 ccccgccacc gtcatcaacg ggcagtttgt cgaacgatgt tggactcata gtcactgcca 721 gaaaggtacg ccggggatac agggttctaa gcagtgtctc gtgccttgtt ctagaaagct 781 taaaatgttt tatggcttaa aaatgttaaa tggtcattag gtaggggccg gggaatagtg 841 ggtggtggca ttcactagcc cagggagtgg cagacatttt ctgtaaagac tcagatagta 901 gatacttcag attttgcagg ccatatgg // LOCUS HUMINSR03 639 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 3. ACCESSION M32824 KEYWORDS insulin receptor. SEGMENT 3 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 639) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 114 + 435 human insulin receptor precursor, exon 3 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 114 + 435 human insulin receptor pre-msg < 1 > 639 hINSR mRNA and introns IVS < 1 113 hINSR intron B IVS 436 > 639 hINSR intron C BASE COUNT 134 a 171 c 163 g 171 t ORIGIN About 25.0 kbp downstream of segment 2. 1 gatccagaat tgctgcatat gcagacagga attggacaaa gccatttatt tatttattta 61 tttatttatt tatttattta tttatttccc tctctctctc tctctctctc cagtttgccc 121 gaccatctgt aagtcacacg gctgcaccgc cgaaggcctc tgttgccaca gcgagtgcct 181 gggcaactgt tctcagcccg acgaccccac caagtgcgtg gcctgccgca acttctacct 241 ggacggcagg tgtgtggaga cctgcccgcc cccgtactac cacttccagg actggcgctg 301 tgtgaacttc agcttctgcc aggacctgca ccacaaatgc aagaactcgc ggaggcaggg 361 ctgccaccaa tacgtcattc acaacaacaa gtgcatccct gagtgtccct ccgggtacac 421 gatgaattcc agcaagtgag ttctggatgt gggtctgggg ggcagccgag aggagaagga 481 acgtggggtt ggttgtgacg atgccgcttg ttaaaactgt gtgcaaaccc agggttaatt 541 ggctatgagt gaggtctctg ctctcagatg ctacttttgc accctgtttt ggtcctgggc 601 ttgggagtgg gagttgacta cctttttctc taaaggacc // LOCUS HUMINSR04 663 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 4. ACCESSION M32825 KEYWORDS insulin receptor. SEGMENT 4 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 663) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 318 + 466 human insulin receptor precursor, exon 4 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 318 + 466 human insulin receptor pre-msg < 1 > 663 hINSR mRNA and introns IVS < 1 317 hINSR intron C IVS 467 > 663 hINSR intron D BASE COUNT 159 a 195 c 171 g 138 t ORIGIN About 15.0 kbp downstream of segment 3. 1 ccaacatggt aaccccgtct ctactcaaaa atacaaaaat tagccaggca cggtggcggg 61 cacctataat cccagctact gtggaggctg aggcaggaga atctcttgaa cccagaaggc 121 agaggttgca gtgagctgag atcgcaccac tgcactccag cctgggcaac agagcgagac 181 tctgtcacac aaacacacac acacacacaa agaaatacca tatcaggcag aaagatgcct 241 gagatgtctg aaggaccttg gataccgtga cacccccctc ccctttctct ttctctctct 301 ctctgctccg tccttagctt gctgtgcacc ccatgcctgg gtccctgtcc caaggtgtgc 361 cacctcctag aaggcgagaa gaccatcgac tcggtgacgt ctgcccagga gctccgagga 421 tgcaccgtca tcaacgggag tctgatcatc aacattcgag gaggcagtga gtgtctctgt 481 gtgggcgtcg ggggtgcctg ttgggctcca tgtccctctg agctgtgagc ggggaagaaa 541 agcagtgcag accctgctgc gtgctcctac agcactttta ggatggtcgt tcagtggctc 601 ccccatggat agaaccatgc tgggagtctg cctcaaaacc tgaaatgaac agctcagtct 661 tcc // LOCUS HUMINSR05 410 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 5. ACCESSION M32826 KEYWORDS insulin receptor. SEGMENT 5 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 410) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 188 + 332 human insulin receptor precursor, exon 5 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 188 + 332 human insulin receptor pre-msg < 1 > 410 hINSR mRNA and introns IVS < 1 187 hINSR intron D IVS 333 > 410 hINSR intron E BASE COUNT 105 a 80 c 100 g 125 t ORIGIN About 3.0 kbp downstream of segment 4. 1 gggcagaagt atgcttgacc catttaagga atgctaagga cttcagattg tgttctaagc 61 atgatgagtt ttgagctggg tatgtccagt catttgcagc ctgagggtta tcttctcacc 121 atggagaatc atgagaagat tgaaatatgt ctatagaaac ccactggata ttctctcctt 181 tccttagaca atctggcagc tgagctagaa gccaacctcg gcctcattga agaaatttca 241 gggtatctaa aaatccgccg atcctacgct ctggtgtcac tttccttctt ccggaagtta 301 cgtctgattc gaggagagac cttggaaatt gggtacgtgg gcctgattgt gtgtatggcc 361 tgagtgctaa ctaggaagtt cgtgtattag aacaacttaa ggattttttt // LOCUS HUMINSR06 554 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 6. ACCESSION M32827 KEYWORDS insulin receptor. SEGMENT 6 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 554) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 189 + 403 human insulin receptor precursor, exon 6 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 189 + 403 human insulin receptor pre-msg < 1 > 554 hINSR mRNA and introns IVS < 1 188 hINSR intron E IVS 404 > 554 hINSR intron F BASE COUNT 154 a 129 c 130 g 141 t ORIGIN About 1.0 kbp downstream of segment 5. 1 ggccatgaaa acttcctcaa cttcctctgt tatccacatt caacaaatat gtgttgagta 61 tgtgccaagc aagtggagag gattaggcac gtagcactga acaagatcaa ctccgagcat 121 ggccacacca tcttggagtt gtagaagacc agccgttgaa tgactagatg tgtgtgtttt 181 ttccatagga actactcctt ctatgccttg gacaaccaga acctaaggca gctctgggac 241 tggagcaaac acaacctcac catcactcag gggaaactct tcttccacta taaccccaaa 301 ctctgcttgt cagaaatcca caagatggaa gaagtttcag gaaccaaggg gcgccaggag 361 agaaacgaca ttgccctgaa gaccaatggg gaccaggcat cctgtaagtc actggtcccc 421 aacctttttg gcacgaggga ccggtttagt ggaagatggt ttttccatgg actggtggtg 481 ggtggggatg gtttcagcat gattcaagtg cattacattt actatgcact ttattcctat 541 tatgattaca ttgt // LOCUS HUMINSR07 592 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 7. ACCESSION M32828 KEYWORDS insulin receptor. SEGMENT 7 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 592) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 277 + 403 human insulin receptor precursor, exon 7 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 277 + 403 human insulin receptor pre-msg < 1 > 592 hINSR mRNA and introns IVS < 1 276 hINSR intron F IVS 404 > 592 hINSR intron G BASE COUNT 125 a 144 c 144 g 179 t ORIGIN About 1.0 kbp downstream of segment 6. 1 ttgcgcgggt acagactgcg cttattcagt tgactgtctg gctgagtcaa gtcattggct 61 tacgtgagtg tgagtggcca agttgcaaaa ctggctctta cctttgaatc ttcccccatt 121 catactcagc caggcacatg gggaggagac ccttaaggga atagcagcat cacctctgcc 181 ttctcacggt ccctccagga agtgtggggg tcccaggctt tggtctgaaa ctacactgaa 241 atagctcatt tttgcctttt gttttaactt ttccaggtga aaatgagtta cttaaatttt 301 cttacattcg gacatctttt gacaagatct tgctgagatg ggagccgtac tggccccccg 361 acttccgaga cctcttgggg ttcatgctgt tctacaaaga ggcgtaagta gaagagttag 421 agagacgctg aggaggcgag ggctggctgg ctctgtgctt gctacgtttg tgctccaatc 481 tgcccctctt gggttcctgt ctatctccct cctcctcctg gaataaatat cttaggttcc 541 tttttacaat ctcaccagtc gatggcatgc aaagtcaata gtgtctgctt tt // LOCUS HUMINSR08 401 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 8. ACCESSION M32829 KEYWORDS insulin receptor. SEGMENT 8 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 401) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 124 + 374 human insulin receptor precursor, exon 8 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 124 + 374 human insulin receptor pre-msg < 1 > 401 hINSR mRNA and introns IVS < 1 123 hINSR intron G IVS 375 > 401 hINSR intron H BASE COUNT 90 a 98 c 112 g 101 t ORIGIN About 3.0 kbp downstream of segment 7. 1 cattagattg ttgggtgagt aacatgtgac cctatgggat gtaacttccc aggcctcatc 61 tgcacggcac tcagtgtgac ggtcttgtaa gggtaactgc cttctgctgt tttgtcttga 121 aagcccttat cagaatgtga cggagttcga tgggcaggat gcgtgtggtt ccaacagttg 181 gacggtggta gacattgacc cacccctgag gtccaacgac cccaaatcac agaaccaccc 241 agggtggctg atgcggggtc tcaagccctg gacccagtat gccatctttg tgaagaccct 301 ggtcaccttt tcggatgaac gccggaccta tggggccaag agtgacatca tttatgtcca 361 gacagatgcc accagtgagt gtgtcttggg aatgtgaatt c // LOCUS HUMINSR09 420 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 9. ACCESSION M32830 KEYWORDS insulin receptor. SEGMENT 9 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 420) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 106 + 273 human insulin receptor precursor, exon 9 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 106 + 273 human insulin receptor pre-msg < 1 > 420 hINSR mRNA and introns IVS < 1 105 hINSR intron H IVS 274 > 420 hINSR intron I BASE COUNT 85 a 125 c 94 g 116 t ORIGIN About 3.0 kbp downstream of segment 8. 1 ggtgccctca tgatgtcttt aacttgtgtg tcccccgcca tcctcccacc agctttcttt 61 gcacactgtt tctcatgatg gacccgtttc ctttctccct ggcagacccc tctgtgcccc 121 tggatccaat ctcagtgtct aactcatcat cccagattat tctgaagtgg aaaccaccct 181 ccgaccccaa tggcaacatc acccactacc tggttttctg ggagaggcag gcggaagaca 241 gtgagctgtt cgagctggat tattgcctca aaggtgagtg caggcagctg tgctaggatc 301 ggtggggttt gcacacgtgt gtctgatgca ctttgcttca cctctaggga agcagctatc 361 tcttcctgtg tctcagtgtc ggaaggcaca cacacacact ccattctatc tcatatgaaa // LOCUS HUMINSR10 517 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 10. ACCESSION M32831 KEYWORDS insulin receptor. SEGMENT 10 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 517) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 187 + 388 human insulin receptor precursor, exon 10 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 187 + 388 human insulin receptor pre-msg < 1 > 517 hINSR mRNA and introns IVS < 1 186 hINSR intron I IVS 389 > 517 hINSR intron J BASE COUNT 83 a 88 c 194 g 152 t ORIGIN About 11.0 kbp downstream of segment 9. 1 tttgtggtgt gtgtatgtgt ggtgtgttgt gtgatgtgtg tggtgtgtgt gtgggggggt 61 gtgtggtgtg tgtatgtgtg gtgtgtgtgg tgtgtgtgtg tggtgtgtgt gtgtgggggg 121 ggtgtgtgtg tgtatgtgtg ttcagccgca gagacttgag cccccctttt ctgtttcttt 181 ctccagggct gaagctgccc tcgaggacct ggtctccacc attcgagtct gaagattctc 241 agaagcacaa ccagagtgag tatgaggatt cggccggcga atgctgctcc tgtccaaaga 301 cagactctca gatcctgaag gagctggagg agtcctcgtt taggaagacg tttgaggatt 361 acctgcacaa cgtggttttc gtccccaggt caggacttgg cgctgggctc tcttagtggg 421 tgccaattgg cttggtgttg gtggaaggtc attacttagg gaccgagagg tagtgggagg 481 gagagacggc agaaccctgg gtggagtctg aatggag // LOCUS HUMINSR11 343 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 11. ACCESSION M32832 KEYWORDS insulin receptor. SEGMENT 11 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 343) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 123 + 158 human insulin receptor precursor, exon 11 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 123 + 158 human insulin receptor pre-msg < 1 > 343 hINSR mRNA and introns IVS < 1 122 hINSR intron J IVS 159 > 343 hINSR intron K BASE COUNT 68 a 97 c 98 g 80 t ORIGIN About 2.0 kbp downstream of segment 10. 1 tggtccaggg tcaaagccag ggtgccctta ctcggacaca tgtggcctcc aagtgtcaga 61 gcccagtggt ctgtctaatg aagttccctc tgtcctcaaa ggcgttggtt ttgtttccac 121 agaaaaacct cttcaggcac tggtgccgag gaccctaggt atgactcacc tgtgcgaccc 181 ctggtgcctg ctccgcgcag ggccggcggc gtgccaggca gatgcctcgg agaacccagg 241 ggtttctctg gctttttgca tgcggcgggc agctgtgctg gagagcagat gcttcaccaa 301 ttcagaaatc caatgccttc actctgaaat gaaatctggg cat // LOCUS HUMINSR12 719 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 12. ACCESSION M32833 KEYWORDS insulin receptor. SEGMENT 12 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 719) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 161 + 435 human insulin receptor precursor, exon 12 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 161 + 435 human insulin receptor pre-msg < 1 > 719 hINSR mRNA and introns IVS < 1 160 hINSR intron K IVS 436 > 719 hINSR intron L BASE COUNT 137 a 198 c 195 g 189 t ORIGIN About 8.0 kbp downstream of segment 11. 1 ggtcattcct ggcagtctgt attgtaatcc atgttcccca ttgctgcacc ctcctgcgct 61 ctgatctttc ttcttaatca agccttttat tctccagtgt cactttttta aaaaaaatga 121 tggtgatggt gtcatcatac atgtcctact gtcgttccag gccatctcgg aaacgcaggt 181 cccttggcga tgttgggaat gtgacggtgg ccgtgcccac ggtggcagct ttccccaaca 241 cttcctcgac cagcgtgccc acgagtccgg aggagcacag gccttttgag aaggtggtga 301 acaaggagtc gctggtcatc tccggcttgc gacacttcac gggctatcgc atcgagctgc 361 aggcttgcaa ccaggacacc cctgaggaac ggtgcagtgt ggcagcctac gtcagtgcga 421 ggaccatgcc tgaaggtagg gctgctggtc cggggtccga gtgtcatggg tgggacatca 481 aggctgactt tttgtttgag acggagcctt gctctgtcgc ccaggctgga gtacagtggt 541 gcgacctcag ctcactccag cctctgccac ctatgtcaag tgattccctg cttcagcctc 601 ccaagtagct gggactacag gtgtctgcca ccacgcccag ctaatttttg tatttttagt 661 agagatgggg tttcaccata ttgcccaggc tggtcttgaa ctcctgggct caagtgatc // LOCUS HUMINSR13 439 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 13. ACCESSION M32834 KEYWORDS insulin receptor. SEGMENT 13 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 439) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 93 + 232 human insulin receptor precursor, exon 13 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 93 + 232 human insulin receptor pre-msg < 1 > 439 hINSR mRNA and introns IVS < 1 92 hINSR intron L IVS 233 > 439 hINSR intron M BASE COUNT 98 a 114 c 105 g 122 t ORIGIN About 1.0 kbp downstream of segment 12. 1 gtcaccagcc caaggttgca ccatggacag gtggcagaag tgggatctca tccaagagtt 61 acatccctgc ctctcacttc ctctccttac agccaaggct gatgacattg ttggccctgt 121 gacgcatgaa atctttgaga acaacgtcgt ccacttgatg tggcaggagc cgaaggagcc 181 caatggtctg atcgtgctgt atgaagtgag ttatcggcga tatggtgatg aggtaaggcc 241 cttgactctt gggcatgccc ctgcaccact tcagcatgcc ccttcagagt tgcacttggt 301 acctccttcc tctgctgaaa ttttgattcc agtgcttctc tcatcaggta ctgtgctatt 361 agtacttaaa gccttgatac ctgacttcgc aggaagatgg gtcagaaatg ccaatctacc 421 agcttgttac ttttcttag // LOCUS HUMINSR14 386 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 14. ACCESSION M32835 KEYWORDS insulin receptor. SEGMENT 14 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 386) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 85 + 244 human insulin receptor precursor, exon 14 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 85 + 244 human insulin receptor pre-msg < 1 > 386 hINSR mRNA and introns IVS < 1 84 hINSR intron M IVS 245 > 386 hINSR intron N BASE COUNT 62 a 123 c 115 g 86 t ORIGIN About 6.0 kbp downstream of segment 13. 1 tggctgtgag ctccctgcga ggggtggaca ctcccagatg tgcaaagctc agccaccctc 61 cttctcctcc tctcttcctc ccaggagctg catctctgcg tctcccgcaa gcacttcgct 121 ctggaacggg gctgcaggct gcgtgggctg tcaccgggga actacagcgt gcgaatccgg 181 gccacctccc ttgcgggcaa cggctcttgg acggaaccca cctatttcta cgtgacagac 241 tattgtaagt ctccatggca gcctcagctg actggggctg tgcttagcac tgagcatggt 301 gggacattgc aggggatgac ttggagaggc cgcagtgctg gccctggcct tgactctcag 361 gcctatcagc tgctgcggtg cttgcc // LOCUS HUMINSR15 429 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 15. ACCESSION M32836 KEYWORDS insulin receptor. SEGMENT 15 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 429) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 92 + 194 human insulin receptor precursor, exon 15 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 92 + 194 human insulin receptor pre-msg < 1 > 428 hINSR mRNA and introns IVS < 1 91 hINSR intron N IVS 195 > 428 hINSR intron O BASE COUNT 117 a 67 c 82 g 163 t ORIGIN About 3.0 kbp downstream of segment 14. 1 cccacccatt ccaggagtgg atgtgatttt tgatgtgaac tttgttggaa acacattgat 61 atgaaacata tattttctta ttctatttca gtagacgtcc cgtcaaatat tgcaaaaatt 121 atcatcggcc ccctcatctt tgtctttctc ttcagtgttg tgattggaag tatttatcta 181 ttcctgagaa agaggtgagt tcagtgagtt cagtggtgtg ctgggaacag ttggttctct 241 gggggaaaac atgccttgat ataggtatag gcatatttaa gtttattatg aattttgctg 301 atataggatg tgtaacatgc aatttacaga taattgtcat aatatgatat acacaactct 361 ttattgtaaa ttccctctag acagttgatt ctcacagaat gtttttattg attttttttt 421 ttgcccaaa // LOCUS HUMINSR16 480 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 16. ACCESSION M32837 KEYWORDS insulin receptor. SEGMENT 16 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 480) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 261 + 328 human insulin receptor precursor, exon 16 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 261 + 328 human insulin receptor pre-msg < 1 > 480 hINSR mRNA and introns IVS < 1 260 hINSR intron O IVS 329 > 480 hINSR intron P BASE COUNT 123 a 131 c 109 g 117 t ORIGIN About 2.0 kbp downstream of segment 15. 1 aaaaacaaaa acaaaaacaa aacaaaaaaa aaaccaccca gggagggatg agtgctccca 61 tgttgatgca cttacatacc tgtctgatgg gcttccattc aaaacataaa ggtcccccat 121 ccctgcccta gactgcatct aggattatgg ggattctgct ggtaagggct gccatttgcc 181 ttggggagtc ttgtatgaaa cacctttctg cagagtccca tgagaatctc aagctaacgt 241 gcctcgtttt cctcctccag gcagccagat gggccgctgg gaccgcttta cgcttcttca 301 aaccctgagt atctcagtgc cagtgatggt gagtaccatc ccttccctgt gggtggccag 361 aaccctactc atcagcttcc tttgccttca ccattgagtg agagtgaagg atgggttccc 421 cagggaggcc aagaaaagcc ctcttattca tttgagcttg ccaaactgcc cttgctgcag // LOCUS HUMINSR17 485 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 17. ACCESSION M32838 KEYWORDS insulin receptor. SEGMENT 17 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 485) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 136 + 380 human insulin receptor precursor, exon 17 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 136 + 380 human insulin receptor pre-msg < 1 > 485 hINSR mRNA and introns IVS < 1 135 hINSR intron P IVS 381 > 485 hINSR intron Q BASE COUNT 96 a 119 c 162 g 108 t ORIGIN About 1.0 kbp downstream of segment 16. 1 cccggcatgg gtcctggatc acagaactca tttcatgagt gttttcgagg gggtttgggt 61 gagggcttgg gtggaaggtg gctgcagacc cccaagggat cctccaagga tgctgtgtag 121 ataagtaaga agtagtgttt ccatgctctg tgtacgtgcc ggacgagtgg gaggtgtctc 181 gagagaagat caccctcctt cgagagctgg ggcagggctc cttcggcatg gtgtatgagg 241 gcaatgccag ggacatcatc aagggtgagg cagagacccg cgtggcggtg aagacggtca 301 acgagtcagc cagtctccga gagcggattg agttcctcaa tgaggcctcg gtcatgaagg 361 gcttcacctg ccatcacgtg gtgagtccag tgggggtggg acatgggctg gctttcctga 421 cccttccctt tctctgcctc ctcctcctgc acagagcgac agaggacaca gggtgtatcc 481 tccta // LOCUS HUMINSR18 287 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 18. ACCESSION M32839 KEYWORDS insulin receptor. SEGMENT 18 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 287) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 117 + 227 human insulin receptor precursor, exon 18 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 117 + 227 human insulin receptor pre-msg < 1 > 287 hINSR mRNA and introns IVS < 1 116 hINSR intron Q IVS 228 > 287 hINSR intron R BASE COUNT 51 a 85 c 98 g 53 t ORIGIN About 2.0 kbp downstream of segment 17. 1 acgctgcatc caggccacag ggtgctgtgt gtgacataga caccagggag ggaggagaac 61 cctggtgagt cgaatcacgg accctcctcc aagaaccctg gttgcttgct ctgcaggtgc 121 gcctcctggg agtggtgtcc aagggccagc ccacgctggt ggtgatggag ctgatggctc 181 acggagacct gaagagctac ctccgttctc tgcggccaga ggctgaggta agctgcttcg 241 ggggacccag cggggtactc ggtggagcac ccgctcctgg cctcctc // LOCUS HUMINSR19 322 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 19. ACCESSION M32840 KEYWORDS insulin receptor. SEGMENT 19 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 322) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 45 + 204 human insulin receptor precursor, exon 19 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 45 + 204 human insulin receptor pre-msg < 1 > 322 hINSR mRNA and introns IVS < 1 44 hINSR intron R IVS 205 > 322 hINSR intron S BASE COUNT 81 a 76 c 79 g 86 t ORIGIN About 0.5 kbp downstream of segment 18. 1 gatcccagtg ctgctgaaac accaaccccg tgtttctgtt ttagaataat cctggccgcc 61 ctccccctac ccttcaagag atgattcaga tggcggcaga gattgctgac gggatggcct 121 acctgaacgc caagaagttt gtgcatcggg acctggcagc gagaaactgc atggtcgccc 181 atgattttac tgtcaaaatt ggaggttcgt ctggctttct gctttgaaaa cataacgacc 241 caggccaggt ttgatttcag aaggaagttg tctataatga gccgttaagt cttttctgat 301 aatataaagg ggcaagtact tc // LOCUS HUMINSR20 288 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 20. ACCESSION M32841 KEYWORDS insulin receptor. SEGMENT 20 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 288) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 115 + 244 human insulin receptor precursor, exon 20 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 115 + 244 human insulin receptor pre-msg < 1 > 288 hINSR mRNA and introns IVS < 1 114 hINSR intron S IVS 245 > 288 hINSR intron T BASE COUNT 61 a 55 c 102 g 70 t ORIGIN About 0.5 kbp downstream of segment 19. 1 gacgtgggcc aggtgaaccc ctcttagggc tctgtgagag gtggggcagt caaggtggca 61 gatgctagga ccaaggctga aggttaagag cgtgtgaacc ttttgtgttg tcagactttg 121 gaatgaccag agacatctat gaaacggatt actaccggaa agggggcaag ggtctgctcc 181 ctgtacggtg gatggcaccg gagtccctga aggatggggt cttcaccact tcttctgaca 241 tgtggtgagt tgtgtgtgga tgggtggatg gacgctgggc ttgaattc // LOCUS HUMINSR21 407 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 21. ACCESSION M32842 KEYWORDS insulin receptor. SEGMENT 21 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 407) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 101 + 235 human insulin receptor precursor, exon 21 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 101 + 235 human insulin receptor pre-msg < 1 > 407 hINSR mRNA and introns IVS < 1 100 hINSR intron T IVS 236 > 407 hINSR intron U BASE COUNT 73 a 75 c 118 g 141 t ORIGIN About 1.0 kbp downstream of segment 20. 1 ttgcgtgtgt gtgtgcgttt gcgtgtgtgt gtttgcgcgc gcgcgtgtgt gtgtgtgtct 61 aaatggcttc tttgttacta ctatcaactg tcatcggcag gtcctttggc gtggtccttt 121 gggaaatcac cagcttggca gaacagcctt accaaggcct gtctaatgaa caggtgttga 181 aatttgtcat ggatggaggg tatctggatc aacccgacaa ctgtccagag agagtgtaag 241 tgtagaaagg gtttaaggtg tgtgaggtgt tcgttgaaag ggtattgccc tttacacgtg 301 tgcttggttt tgcctttcct atgtctacac gctcaccgtg tttgcatgct gtatgttaca 361 ggtgtgtttg tgtttgcata gcttgtcttt acatgcatgc ttgcatt // LOCUS HUMINSR22 873 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 22. ACCESSION M32972 KEYWORDS insulin receptor. SEGMENT 22 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 873) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 83 437 human insulin receptor precursor, exon 22 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 83 434 human insulin receptor pre-msg < 1 873 hINSR mRNA and introns IVS < 1 82 hINSR intron U BASE COUNT 199 a 217 c 234 g 223 t ORIGIN About 2.0 kbp downstream of segment 21. 1 ctgcagggac aagagtgggg gtttgggagg atgcgtggca gggcccccag actcacccag 61 gacgtgtcct tctgccccgc agcactgacc tcatgcgcat gtgctggcaa ttcaacccca 121 agatgaggcc aaccttcctg gagattgtca acctgctcaa ggacgacctg caccccagct 181 ttccagaggt gtcgttcttc cacagcgagg agaacaaggc tcccgagagt gaggagctgg 241 agatggagtt tgaggacatg gagaatgtgc ccctggaccg ttcctcgcac tgtcagaggg 301 aggaggcggg gggccgggat ggagggtcct cgctgggttt caagcggagc tacgaggaac 361 acatccctta cacacacatg aacggaggca agaaaaacgg gcggattctg accttgcctc 421 ggtccaatcc ttcctaacag tgcctaccgt ggcgggggcg ggcaggggtt cccattttcg 481 ctttcctctg gtttgaaagc ctctggaaaa ctcaggattc tcacgactct accatgtcca 541 gtggagttca gagatcgttc ctatacattt ctgttcatct taaggtggac tcgtttggtt 601 accaatttaa ctagtcctgc agaggattta actgtgaacc tggagggcaa ggggtttcca 661 cagttgctgc tcctttgggg caacgacggt ttcaaaccag gattttgtgt tttttcgttc 721 cccccacccg cccccagcag atggaaagaa agcacctgtt tttacaaatt cttttttttt 781 tttttttttt tttttttttg ctggtgtctg agcttcagta taaaagacaa aacttcctgt 841 ttgtggaaca aaatttcgaa agaaaaaacc aaa // LOCUS BT1NAMTA 1091 bp ds-DNA PHG 10-JUL-1990 DEFINITION Bacteriophage T1 DNA N-6-adenine-methyltransferase (M.T1) gene, complete cds. ACCESSION J05393 KEYWORDS DNA N-6-adenine-methyltransferase. SOURCE Bacteriophage T1 DNA. ORGANISM Bacteriophage T1 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1091) AUTHORS Schneider-Scherzer,E., Auer,B., de Groot,E.J. and Schweiger,M. TITLE Primary structure of a DNA (N-6-adenine)-methyltransferase from Escherichia coli virus T1 JOURNAL J. Biol. Chem. 265, 6086-6091 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 171 824 DNA N-6-adenine-methyltransferase (M.T1) pept 824 1072 pot. protein HP 83 binding 161 164 ribosomal binding site (put.) binding 813 816 ribosomal binding site (put.) signal 141 146 TATA box BASE COUNT 345 a 205 c 266 g 275 t ORIGIN 1 aaaagggaag tttctcaaaa aggtccggga gcgtggcggc ttctctgccg tcgcatacgg 61 attcgggcaa ttcaagatcg caatttacga aatgatgaaa tagcactttt tgttaaaact 121 gccgggatgg aatctggcat tattatctca ccaaaacgag aggaataaaa atgaaagact 181 ttaatgatat cgaaactatc gactttgcag aaactggttg ctcattcact cgcgaagcaa 241 tagcatcagg cggttattat caggcattga aaacgccaac ctgtaaagag atttcagggc 301 gtcgatacaa ggggacaaat acccctgacg ctgttcgtga tttatggtca actccgcgag 361 aggttattgc ataccttgag ggtcgttatg ggaaatatga tctcgacgct gcggcaagcg 421 aagaaaataa agtttgcgag aagttttact ctcaggaaac aaactgctta aaacgttggt 481 ggggaaagaa taagcacgtt tggttaaatc ctccttatag ccgacctgat atatttgtca 541 actctactgc gtggtttact gaagcgcggc agaacgcagc tgaaataatc tggattgaag 601 cggacttgac tgaggatatt gacggcaatg aatacgcacg atccggtcgc ctggctttca 661 tatccggtga aactggaaag gccgtagacg gtaataacaa aggttcggta atttttatta 721 tgcgcgaact taaagaaggt gaggtgcaac agactcacta catcccaatc acaagcattt 781 gcccttcggt gaaaaacaaa cgagcaaagg tgaggaaagt atgatgagcg aaaaaatggt 841 tcctgttaaa ttaactgagc aaggtttatg gctactttat cgagctacgt gctgcgaaat 901 tatggagcga aacggattga ctcaggatgt tattggttgc gatctgtggg agttcactag 961 ttctcttgat atgcttttcg atgagataaa aaatgaatac atagagaact ggccttcaat 1021 catacagaaa gacgtggaag aacttaaagc tgatacaatc gtacagcact aattgctaaa 1081 actacccggc g // LOCUS STVBLSG 1130 bp ds-DNA BCT 10-JUL-1990 DEFINITION Streptoverticillum sp. blasticidin S-acetyltransferase (bls) gene, complete cds. ACCESSION M34537 KEYWORDS blasticidin S-acetyltransferase. SOURCE Streptoverticillum sp. (strain JCM4673) DNA. ORGANISM Streptoverticillum sp. Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1130) AUTHORS Perez-Gonzalez,J.A., Ruiz,D., Esteban,J.A. and Jimenez,A. TITLE Cloning and characterization of the gene encoding a blasticidin S acetyltransferase from Streptoverticillum sp JOURNAL Gene 86, 129-134 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 147 557 blasticidin S-acetyltransferase (bls) (147 could be 225) binding 136 139 ribosomal binding site (put.) BASE COUNT 151 a 420 c 394 g 165 t ORIGIN 1 gatcagcgcc ggcccccacc ggcactgtgc atcagcgtac ggccggggta cgacaacgga 61 agcggattgg caaaactgcc tggccccggt gtttatggtg agctttatgt tcagtattga 121 ggcggtgaac gacccggaac gacgcgatgt tgtccttgcc acggttgcag accgtcaacg 181 acgaacgttc gcccgccctg cgggcgttgc ggcgcacgcc ggtgatggag gcgcggccgc 241 tggaggtgta cgccacgtac gcctgcggcg agcgcgggga gctggcgggc gggctcgtcg 301 gtcatgtgca gtggcaatgg ctgcacgtgg acctgctgtg ggtggacgcg ggggcccgcg 361 gggcggggct gggctcgcgg ttgatcgcgc gggcggaggc ccgcgcccgg gaggagttcg 421 gctgcatcgg cagccaggtg gagacctggg acttccaggc gccggggttc taccagcggg 481 tggggtatcg cctcgcggcg agcatcccgg actatccgcc cgggatcacg agccacctgc 541 tggtgaagga gctttgaggc gccccgtcag gggcgcgggg ccgttactcc ggggctgcgc 601 cccggacccc cgggtggcgc gtcgactgcg ggccggtggg ggcttgtcgc gcagttcccc 661 gcgcccctta cggggcgcct ggtcgcgccc acgcggcgga gccgcatatc gagcacagcc 721 ccgcgcccct tacggggcgc tgctctaggc cacccgccgt gccccctccc ccgccgccgt 781 gccgaacagt cgtgccgtcc ccagtgcctc ggtgaccacc ttggtcaccc tttcctcatc 841 tgccccatcc accaaggcga ttgccgagcc gccgaagccg ccgcccgtca tccgggcccc 901 cagggccccc gccttcaccg ccgtctccac caccacgtcc aattccgcac aggacacccg 961 gaagtcgtcg cgcagcgagg cgtgcccctc cgtcagcagt gggcccacag ccctcgcatc 1021 ccccgcggcc agcagggccg cgacccgctc cacccggtcg ttctccgtca ccacgtgacg 1081 gaccaaggcg cgctccgcgg caggcaactc acccagtgcc gcctgcagac // LOCUS HUMGAPDH 1268 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) mRNA, complete cds. ACCESSION M33197 KEYWORDS glyceraldehyde-3-phosphate dehydrogenase. SOURCE Human lung cancer cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1268) AUTHORS Tokunaga,K., Nakamura,Y., Sakata,K., Fujimori,K., Ohkubo,M., Sawada,K. and Sakiyama,S. TITLE Enhanced expression of a glyceraldehyde-3-phosphate dehydrogenase gene in human lung cancers JOURNAL Cancer Res. 47, 5616-5619 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 61 1068 glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.1.12) /hgml_locus_uid="LM0055R" /nomgen="GAPD" /map="12p13" mRNA < 1 1268 GAPDH mRNA BASE COUNT 295 a 385 c 326 g 262 t ORIGIN 1 gttcgacagt cagccgcatc ttcttttgcg tcgccagccg agccacatcg ctcagacacc 61 atggggaagg tgaaggtcgg agtcaacgga tttggtcgta ttgggcgcct ggtcaccagg 121 gctgctttta actctggtaa agtggatatt gttgccatca atgacccctt cattgacctc 181 aactacatgg tttacatgtt ccaatatgat tccacccatg gcaaattcca tggcaccgtc 241 aaggctgaga acgggaagct tgtcatcaat ggaaatccca tcaccatctt ccaggagcga 301 gatccctcca aaatcaagtg gggcgatgct ggcgctgagt acgtcgtgga gtccactggc 361 gtcttcacca ccatggagaa ggctggggct catttgcagg ggggagccaa aagggtcatc 421 atctctgccc cctctgctga tgcccccatg ttcgtcatgg gtgtgaacca tgagaagtat 481 gacaacagcc tcaagatcat cagcaatgcc tcctgcacca ccaactgctt agcacccctg 541 gccaaggtca tccatgacaa ctttggtatc gtggaaggac tcatgaccac agtccatgcc 601 atcactgcca cccagaagac tgtggatggc ccctccggga aactgtggcg tgatggccgc 661 ggggctctcc agaacatcat ccctgcctct actggcgctg ccaaggctgt gggcaaggtc 721 atccctgagc tgaacgggaa gctcactggc atggccttcc gtgtccccac tgccaacgtg 781 tcagtggtgg acctgacctg ccgtctagaa aaacctgcca aatatgatga catcaagaag 841 gtggtgaagc aggcgtcgga gggccccctc aagggcatcc tgggctacac tgagcaccag 901 gtggtctcct ctgacttcaa cagcgacacc cactcctcca cctttgacgc tggggctggc 961 attgccctca acgaccactt tgtcaagctc atttcctggt atgacaacga atttggctac 1021 agcaacaggg tggtggacct catggcccac atggcctcca aggagtaaga cccctggacc 1081 accagcccca gcaagagcac aagaggaaga gagagaccct cactgctggg gagtccctgc 1141 cacactcagt cccccaccac actgaatctc ccctcctcac agttgccatg tagacccctt 1201 gaagagggga ggggcctagg gagccgcacc ttgtcatgta ccatcaataa agtaccctgt 1261 gctcaacc // LOCUS MUSMK2P 728 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK) mRNA, complete cds. ACCESSION M35833 J05447 KEYWORDS MK protein; retinoic acid-responsive protein. SOURCE Mouse (strain BALB/c) adult liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites) AUTHORS Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T. TITLE Structure of a retinoic acid-responsive gene, MK, which is transiently activated during the differentiation of embryonal carcinoma cells and the mid-gestation period of mouse embryogenesis JOURNAL J. Biol. Chem. 265, 9441-9443 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 728; for [1]) AUTHORS Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.Shyuichiro, 20-APR-1990, for release after publication. FEATURES from to/span description pept 44 466 retinoic acid-responsive protein MK precursor sigp 44 109 retinoic acid-responsive protein MK signal peptide matp 110 463 retinoic acid-responsive protein MK mRNA 1 728 MK2 mRNA BASE COUNT 184 a 211 c 206 g 127 t ORIGIN 1 caggccggag cgggagggag cgaagcatcg agcagtgagc gagatgcagc accgaggctt 61 cttccttctc gcccttcttg ccctcttggt ggtcacgtcc gcggtggcca aaaaaaaaga 121 gaaggtgaag aagggcagcg agtgttcgga gtggacctgg gggccctgca cccccagcag 181 caaggactgc ggcatgggct tccgcgaggg tacctgtggg gcccagaccc agcgcgtcca 241 ttgcaaggtg ccctgcaact ggaagaagga atttggagcc gactgcaaat acaagtttga 301 gagctggggg gcgtgtgatg ggagcactgg caccaaagcc cgccaaggga ccctgaagaa 361 ggcgcggtac aatgcccagt gccaggagac catccgcgtg actaagccct gcacctccaa 421 gaccaagtca aagaccaaag ccaagaaagg aaaaggaaag gactaagtca ggaggccaga 481 gagcctccgg cctcgcctgg agcctgaacg gagccctcct ctcccacagg cccaagatat 541 aacccaccag tgccttttgt cttcctgtca gctctgtcaa tcacgcctgt cctctcacgc 601 ccacaccaag tgcccaaagt ggggagggac aagagattct ggaaagtgag cctccccata 661 ccctcttttg ttctccccac cctgatactt gttattaaga aatgaataaa ataaactcac 721 ttttttcc // LOCUS MUSMKPG 2929 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK) gene, complete cds. ACCESSION M34094 J05447 KEYWORDS MK protein; alternative splicing; retinoic acid-responsive protein. SOURCE Mouse (strain BALB/c) adult liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2929) AUTHORS Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T. TITLE Structure of a retinoic acid-responsive gene, MK, which is transiently activated during the differentiation of embryonal carcinoma cells and the mid-gestation period of mouse embryogenesis JOURNAL J. Biol. Chem. 265, 9441-9443 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.Shyuichiro, 20-APR-1990, for release after publication. FEATURES from to/span description pept 1298 1373 retinoic acid-responsive protein (MK) precursor, exon 1 1491 1649 retinoic acid-responsive protein (MK) precursor, exon 2 1766 1927 retinoic acid-responsive protein (MK) precursor, exon 3 2631 2656 retinoic acid-responsive protein (MK) precursor, exon 4 sigp 1298 1363 retinoic acid-responsive protein (MK) signal peptide matp 1364 1373 retinoic acid-responsive protein (MK) 1491 1649 retinoic acid-responsive protein (MK) 1766 1927 retinoic acid-responsive protein (MK) 2631 2653 retinoic acid-responsive protein (MK) pre-msg 463 2918 MK3 mRNA and introns (minor alt.) pre-msg 1007 2918 MK2 mRNA and introns (major alt.) pre-msg 1048 2918 MK1 mRNA and introns (minor alt.) IVS 816 1296 MK3 intron A IVS 1052 1296 MK2 intron A IVS 1374 1490 MK1 intron A, and MK2 and MK3 intron B IVS 1650 1765 MK1 intron B, and MK2 and MK3 intron C IVS 1928 2630 MK1 intron C, and MK2 and MK3 intron D signal 85 91 GC box signal 143 149 GC box signal 274 280 GC box signal 598 604 GC box signal 852 858 GC box signal 910 916 GC box signal 939 945 GC box BASE COUNT 660 a 771 c 930 g 568 t ORIGIN 1 tggccaccaa catctcagat cacttcggga gatgggtctg ccccgatcct gacctctgcc 61 tagggcctta ggctcacagc gcctggggcg gagctgattt tcccgctcct gcagggatga 121 taacaatgaa agtaaaagag gtggggcggg ggccaggctt gggttctttg gtcttttggc 181 cctgtgccct ggagcagtcc cctccccctg gcttgtactg gggggggggg gggggatctg 241 cttgaggtga gcctgaggcc ccagggtcag gggtgggcgg ttatcacctc cgggggaagc 301 ccggtctgga acttctcaga cagctcttgt cagcgacaag atttaccaaa ctcatttcta 361 tgtgcttccc catccccccc aacgcccttc cctcctcctc ctcccccaaa cctgcactag 421 aaaaaggctc tcgagccttg ctcacccgga gccatctgag gtcccaggta cccagctccc 481 tgccacatca gagacccttc ttgcactctg agtgaactga ttaaaaaaaa aaaaaaaaaa 541 aaaaaaccaa gccggaggtg agccgggcct cgaagggaag gttcgcgggt gcggtggccg 601 ccccgagcct gtgacaccag gacatactcc cggggcccgc ggtgggcaag cgaagtggtg 661 acctgagagc tgacaggctg cgagagggaa aagtatagac aggcctagac caggggaagg 721 ggaggggata gagagctggg cctgctacga ggggacctga gccagaagcg cactggtaaa 781 accgaactcc aggaccagag acccagagat cagaggtgag aggcacagac gcgggagtcc 841 cggctcggcg aggggcggga gtggaggcgg ggactagggg ggtctgggga ggtgcgggtt 901 tggggggagg gggcgggtcc ttccacggga tggggggagg ggcgggggcc catgtgaccg 961 gctcagaccg gttctggaga caaaaggggc cttagcggcc ttagcgggac aggccggagc 1021 gggagggagc gaagcatcga gcagtgagcg agtgagcgca cgcagtggct gtggccccag 1081 tcccttcagg cggctgctct gccaccaagg gggctgaggt gggggtgggg gtacgctgag 1141 acatcggttc caagtcctcc ctccgtctcc cccttgtcgg tccgacgttt tgggcctgga 1201 aagtgggaca agtcagtcaa gggtgggagg tccttcccgc ggttcctagc ggagaagaga 1261 ctaggcgaga aactctaacc caggttttac ccctaggatg cagcaccgag gcttcttcct 1321 tctcgccctt cttgccctct tggtggtcac gtccgcggtg gccaaaaaaa aaggtgatgg 1381 gataggatgg gctcaggagt aaaagctggg gtgggcaggt gaggcaggcc gtgtgaccaa 1441 gtgctggtcc ggcacgccat gtccttaact ttgttccttg cgccctgtag agaaggtgaa 1501 gaagggcagc gagtgttcgg agtggacctg ggggccctgc acccccagca gcaaggactg 1561 cggcatgggc ttccgcgagg gtacctgtgg ggcccagacc cagcgcgtcc attgcaaggt 1621 gccctgcaac tggaagaagg aatttggagg tgaggtggcg cgcgggagga gggcgggaag 1681 ccagagggta tgtccttata aaccggaggc agggaggaca tccacaaccc tcctgtctct 1741 caccgtgggg ccactctccc atcagccgac tgcaaataca agtttgagag ctggggggcg 1801 tgtgatggga gcactggcac caaagcccgc caagggaccc tgaagaaggc gcggtacaat 1861 gcccagtgcc aggagaccat ccgcgtgact aagccctgca cctccaagac caagtcaaag 1921 accaaaggtc agcgaatatg gtggggttgt gggccaggct actccatgct ctgtctctgc 1981 agagcagtct taaagttagg aatgggcagg cacttgaggg ccactctcag gagatgctaa 2041 accctctgcc caagtaggaa ctactctttc tgttggatca tccgacctgg gttcctggga 2101 aaggcttgtc tttgtcaact gaggaaggtg gggtgggatc agggaggagt taactctgcg 2161 cttaaaacta tggaaaggcc tgtcccaaag gtacatgctg ctacctgact cccaacagct 2221 attgaggcca gcagggcaga ggtgactctg cccatttccc cggtgaggaa cttggagtac 2281 tctgatccta gatgaaaata gaaagttgaa agtcaggctt ggtagctcgt gcctgtaaaa 2341 agcggcactt caggactgag gcagtaacac tgccttgagt tcaaggttac agactgagag 2401 acttgagagt ctgtctttaa aggggggggg ggggcgcgag ggttaaaaag ttgaacgaat 2461 aaagaaagat ttcatatcac atggctgccc tttcccacca cttccaggtg aactggtcag 2521 tcaccactag ggggcaggat tttctctcct tgatggacat gtctgcgttg tctggtgagt 2581 ccgagctagg tcacccaccg cactaatgca tctccgttat tgttttccag ccaagaaagg 2641 aaaaggaaag gactaagtca ggaggccaga gagcctccgg cctcgcctgg agcctgaacg 2701 gagccctcct ctcccacagg cccaagatat aacccaccag tgccttttgt cttcctgtca 2761 gctctgtcaa tcacgcctgt cctctcacgc ccacaccaag tgcccaaagt ggggagggac 2821 aagagattct ggaaagtgag cctccccata ccctcttttg ttctccccac cctgatactt 2881 gttattaaga aatgaataaa ataaactcac ttttttccaa taaaagctt // LOCUS MUSCRRY01 676 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 1. ACCESSION M34164 KEYWORDS complement receptor. SEGMENT 1 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 676) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept 414 + 531 complement receptor (Crry; liver) precursor, exon 1 sigp 414 530 complement receptor (liver) signal peptide (put.) matp 531 + 531 complement receptor (liver) pep$ 414 + 660 complement receptor (spleen) precursor, exon 1 sigp 414 530 complement receptor (spleen) signal peptide (put.) matp 531 + 660 complement receptor (spleen) IVS 532 > 676 Crry intron A IVS 661 > 676 Crry intron A' BASE COUNT 148 a 162 c 193 g 173 t ORIGIN 1 atccgaattc atcataagga aataggttct tactgtatac tagacagggt atgcaactgt 61 cagctcactg ttgcagatta gggttaggct ccacccttgc agatttttaa aaggagtaag 121 gccgggctat atgccaaacc gagttcccat aatgccttgt tttctttgga gtcgaaggtt 181 cctgcaagtg gaaaacttcc tggagctgac ctactaggta ttgaaccagt ttctgcattg 241 ctgaatcaat ctcccaaggg taattccaca gaaatcccag gggcttggag taaacaagac 301 cgcgcctagc ccagctagag gaagttttat tccggaaccc agcgccattt ctgggtggga 361 ctgctttcta caccatttgc cgtaaaacgt tgtttgagaa cggtgtgagg ggaatggagg 421 tctcttctcg gagttcagag cctctggatc cggtgtggct ccttgtagcc ttcggccggg 481 gaggagtcaa gctagaagtt ttgctgctgt tcttgctgcc atttactttg ggtgagctgc 541 ggggaggcct ggggaagcac ggacacacgg ttcaccggga acccgcggta aataggctct 601 gcgcagactc caaacgctgg tctgggctgc ctgtgagtgc tcagcgcccc tttcccatgg 661 gtgagcgtgg ggcgcc // LOCUS MUSCRRY02 200 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 2. ACCESSION M34165 KEYWORDS complement receptor. SEGMENT 2 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 200) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 190 complement receptor (Crry; liver) precursor, exon 2 matp 11 + 190 complement receptor (liver) pep$ + 11 + 190 complement receptor (spleen) precursor, exon 2 matp + 11 + 190 complement receptor (spleen) IVS < 1 10 Crry intron A IVS < 1 10 Crry intron A' IVS 191 > 200 Crry intron B BASE COUNT 60 a 51 c 35 g 54 t ORIGIN Undetermined number of base pairs after segment 1. 1 cattcaacag gtcactgccc agccccatca cagcttcctt ctgccaaacc tataaatcta 61 actgatgaat ccatgtttcc cattggaaca tatttgttgt atgaatgtct cccaggatat 121 atcaagaggc agttctctat cacctgcaaa caagactcaa cctggacgag tgctgaagat 181 aagtgtatac gtgagtaact // LOCUS MUSCRRY03 120 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 3. ACCESSION M34166 KEYWORDS complement receptor. SEGMENT 3 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 120) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 110 complement receptor (Crry; liver) precursor, exon 3 matp + 11 + 110 complement receptor (liver) pep$ + 11 + 110 complement receptor (spleen) precursor, exon 3 matp + 11 + 110 complement receptor (spleen) IVS < 1 10 Crry intron B IVS 111 > 120 Crry intron C BASE COUNT 36 a 21 c 25 g 38 t ORIGIN Undetermined number of base pairs after segment 2. 1 tttttcatag gaaaacaatg taaaactcct tcagatcctg agaatggctt ggtacatgta 61 cacacaggca ttcagtttgg atcccgtatt aattatactt gtaatcaagg gtgagttggc // LOCUS MUSCRRY04 104 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 4. ACCESSION M34167 KEYWORDS complement receptor. SEGMENT 4 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 104) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 96 complement receptor (Crry; liver) precursor, exon 4 matp + 11 + 96 complement receptor (liver) pep$ + 11 + 96 complement receptor (spleen) precursor, exon 4 matp + 11 + 96 complement receptor (spleen) IVS < 1 10 Crry intron C IVS 97 > 104 Crry intron D BASE COUNT 20 a 20 c 27 g 37 t ORIGIN Undetermined number of base pairs after segment 3. 1 ctgtgtgtag ataccgcctc attggttcct cctctgctgt atgtgtcatc actgatcaaa 61 gtgttgattg ggatactgag gcacctattt gtgagtgtaa gttg // LOCUS MUSCRRY05 422 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 5. ACCESSION M34168 KEYWORDS complement receptor. SEGMENT 5 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 422) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 412 complement receptor (Crry; liver) precursor, exon 5 matp + 11 + 412 complement receptor (liver) pep$ + 11 + 412 complement receptor (spleen) precursor, exon 5 matp + 11 + 412 complement receptor (spleen) IVS < 1 10 Crry intron D IVS 413 > 422 Crry intron E BASE COUNT 106 a 100 c 104 g 112 t ORIGIN Undetermined number of base pairs after segment 4. 1 ctttgcccag ggattccttg tgagataccc ccaggcattc ccaatggaga tttcttcagt 61 tcaaccagag aagactttca ttatggaatg gtggttacct accgctgcaa cactgatgcg 121 agagggaagg cgctctttaa cctggtgggt gagccctcct tatactgtac cagcaacgat 181 ggtgaaattg gagtctggag cggccctcct cctcagtgca ttgaactcaa caaatgtact 241 cctcctccct atgttgaaaa tgcagtcatg ctgtctgaga acagaagctt gttttcctta 301 agggatattg tggagtttag atgtcaccct ggctttatca tgaaaggagc cagcagtgtg 361 cattgtcagt ccctaaacaa atgggagcca gagttaccaa gctgcttcaa gggtaagctc 421 ga // LOCUS MUSCRRY06 206 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 6. ACCESSION M34169 KEYWORDS complement receptor. SEGMENT 6 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 206) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 196 complement receptor (Crry; liver) precursor, exon 6 matp + 11 + 196 complement receptor (liver) pep$ + 11 + 196 complement receptor (spleen) precursor, exon 6 matp + 11 + 196 complement receptor (spleen) IVS < 1 10 Crry intron E IVS 197 > 206 Crry intron F BASE COUNT 61 a 33 c 58 g 54 t ORIGIN Undetermined number of base pairs after segment 5. 1 ctaattgcag gagtgatatg tcgtctccct caggagatga gtggattcca gaaggggttg 61 ggaatgaaaa aagaatatta ttatggagag aatgtaacct tggaatgtga ggatgggtat 121 actctagaag gcagttctca aagccagtgc cagtctgatg gcagctggaa tcctcttctg 181 gccaaatgtg tatctcgtaa gtacaa // LOCUS MUSCRRY07 44 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 7. ACCESSION M34170 KEYWORDS complement receptor. SEGMENT 7 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 44) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 34 complement receptor (Crry; liver) precursor, exon 7 matp + 11 + 34 complement receptor (liver) pep$ + 11 + 34 complement receptor (spleen) precursor, exon 7 matp + 11 + 34 complement receptor (spleen) IVS < 1 10 Crry intron F IVS 35 > 44 Crry intron G BASE COUNT 8 a 8 c 10 g 18 t ORIGIN Undetermined number of base pairs after segment 6. 1 tctctttcag gctcaatcag tggtctaatt gttggtaagt tctg // LOCUS MUSCRRY08 96 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 8. ACCESSION M34171 KEYWORDS complement receptor. SEGMENT 8 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 96) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 86 complement receptor (Crry; liver) precursor, exon 8 matp + 11 + 86 complement receptor (liver) pep$ + 11 + 86 complement receptor (spleen) precursor, exon 8 matp + 11 + 86 complement receptor (spleen) IVS < 1 10 Crry intron G IVS 87 > 96 Crry intron H BASE COUNT 27 a 10 c 19 g 40 t ORIGIN Undetermined number of base pairs after segment 7. 1 tcctgtttag gaattttcat tgggataatc gtctttattt tagtcatcat tgttttcatt 61 tggatgattc tgaagtataa aaaacggtga gtaaag // LOCUS MUSCRRY09 125 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 9. ACCESSION M34172 KEYWORDS complement receptor. SEGMENT 9 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 125) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 115 complement receptor (Crry; liver) precursor, exon 9 matp + 11 + 115 complement receptor (liver) pep$ + 11 + 115 complement receptor (spleen) precursor, exon 9 matp + 11 + 115 complement receptor (spleen) IVS < 1 10 Crry intron H IVS 116 > 125 Crry intron I BASE COUNT 46 a 24 c 25 g 30 t ORIGIN Undetermined number of base pairs after segment 8. 1 taccaattag caataccaca gatgaaaagt ataaagaagt gggtattcat ttaaattata 61 aagaagacag ctgtgtccgc cttcagtctc tgctcacaag tcaggagaac agcaggtaca 121 tatgc // LOCUS MUSCRRY10 128 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 10. ACCESSION M34173 KEYWORDS complement receptor. SEGMENT 10 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 128) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 56 complement receptor (Crry; liver) precursor, exon 10 matp + 11 53 complement receptor (liver) pep$ + 11 56 complement receptor (spleen) precursor, exon 10 matp + 11 53 complement receptor (spleen) IVS < 1 10 Crry intron I BASE COUNT 41 a 30 c 20 g 37 t ORIGIN Undetermined number of base pairs after segment 9. 1 tttgctgaag taccactagc ccagcacgga attcactcac tcaagaagtc tcctaaatag 61 cagcaacgtg aaatgagaac atgctctgtc tgtatcactt ttaaaataaa ctgtttcctt 121 ttaagatc // LOCUS MUSCRRYPS 1272 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry-ps) pseudogene DNA fragment. ACCESSION M34174 KEYWORDS complement receptor; pseudogene. SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1272) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review BASE COUNT 377 a 256 c 276 g 363 t ORIGIN 1 tgcccagccc catcacagct tccttctgcc aaacctataa atctaactga tgaatccatg 61 tttcccattg gaacatctgt gaaatatgaa tgtctcccag gatatatcaa gaggcagttc 121 tctatcacct gcaaacaaga ctcaacctgg acgagtgctg aagataagtg tatacgaaaa 181 caatgtaaaa ctcctttaga tcctcagaat ggcttggtac atgtacacac aggcattcag 241 tttggatccc gtattaatta tacttgtaat aaaggatacc gcctcattgg ttcctcctct 301 gctgtatgtg tcatcactga tcaaagtgtt gattgggata ctgaggcacc tatttgtgag 361 tggattcctt gtgatatacc cccaggcatt cccaatggag atttcttcag ttcaactaga 421 gaagactttc attatggaat ggtggttacc taccgctgca acactgatgc gagagggaag 481 gcgctcttta acctggtggt tatactgtac cagcaacgat ggtgaaattg gagtctggag 541 tggccctcct cctcagtgca ttggattcaa caaatgtact cctcctccct atgttgaaaa 601 tgcagtcatg ctgtctgaga acagaagctt gttttcctta agggatattg tggagtttag 661 atgtcaccct ggctttatca tgaaaggagc cagcagtgtg cattgtcagt ccctaaacaa 721 atgggagcca gagttaccaa gctgcttcaa gggagtgata tgtcgtctcc ctcaggagat 781 gagtggattc cagaaggggt tgggaatgaa aaaagaatat tattatggag agaatgtaac 841 cttggaatgc gaggatgggt atactctaga aggcagttct caaagccagt gtcagtctga 901 tggcagctgg aatcctcttc tggccaaaag tgtatcgcgc tcaatcagtg gtctaattgt 961 tggaattttc attgggatga tcatctttat tttattcatc attgttttca tttggatgat 1021 tctgaagtat aaaaaacgca ataccacaga tgaaaagtat aaagaagtgg gtattcattt 1081 aaattataaa ggagacagct gtgtctgcct tcagtctctg ctcacaagtc aggagaacag 1141 cactaccact agcccagcac agaattcact cgctcaagaa gtctcctaaa tagcagcaac 1201 gtgaaatgag aacatgtctt tctgtatcat ttttaaaata aactatttct tttaagaaaa 1261 aaaagaaaga aa // LOCUS BSURGRRNB 7430 bp ds-DNA BCT 10-JUL-1990 DEFINITION B.subtilis rrnB operon with 23S rRNA, 16SrRNA, 5S rRNA and tRNA gene cluster: Val-, Thr-, Lys-, Leu-cug-, Gly-ggc-, Leu-uua-, Arg-, Pro-, Ala-, Met-, Ile-, Ser-uca-, Met-f-, Asp-, Phe-, His-, Gly-gga-, Ile-, Asn-, Ser-agc- and Glu-tRNA. ACCESSION K00637 M10606 X00007 KEYWORDS 23S ribosomal RNA; 5S ribosomal RNA; ribosomal RNA; transfer RNA; transfer RNA-Ala; transfer RNA-Arg; transfer RNA-Asn; transfer RNA-Asp; transfer RNA-Glu; transfer RNA-Gly; transfer RNA-His; transfer RNA-Ile; transfer RNA-Leu; transfer RNA-Lys; transfer RNA-Met; transfer RNA-Phe; transfer RNA-Pro; transfer RNA-Ser; transfer RNA-Thr; transfer RNA-Val. SOURCE B.subtilis 168 DNA, library of Ferrari et al, clone pBC204 [1]; clone pGS227 [2]; clone pGS332 [3]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 4897 to 7430) AUTHORS Green,C.J. and Vold,B.S. TITLE Sequence analysis of a cluster of twenty-one tRNA genes in Bacillus subtilis JOURNAL Nucleic Acids Res. 11, 5763-5774 (1983) STANDARD simple staff_review REFERENCE 2 (bases 1 to 1168) AUTHORS Stewart,G.C. and Bott,K. TITLE DNA sequence of the tandem ribosomal RNA promoter for B.subtilis operon rrnB JOURNAL Nucleic Acids Res. 11, 6289-6300 (1983) STANDARD simple staff_review REFERENCE 3 (bases 1 to 7430; revises [1],[2]) AUTHORS Green,C.J., Stewart,G.C., Hollis,M.A., Vold,B.S. and Bott,K.F. TITLE Nucleotide sequence of the Bacillus subtilis ribosomal RNA operon, rrnB JOURNAL Gene 37, 261-266 (1985) STANDARD simple staff_review REFERENCE 4 (sites for [1],[2] and [3]) AUTHORS Su,S.L. and Dubnau,D. TITLE Binding of Bacillus subtilis ermC' methyltransferase to 23S rRNA JOURNAL Biochemistry 29, 6033-6042 (1990) STANDARD simple staff_entry COMMENT Draft entry and sequence in computer readable form for [1],[2],[3] kindly provided by K.F.Bott, 26-DEC-1985. The RNAs, encoded by the sequence presented below, are probably transcribed as one polycistronic unit, including the tRNA region, because there are no obvious terminator stem loop structures until after the end of the tRNA region at positions 7245-7272 and 7392-7413 [3]. [1] notes that though the Ile-tRNA-nau sequence has the methionine anticodon "cau", it is highly homologous to Ile-tRNA-gau; the "c" in the wobble position may be post-transcriptionally modified to recognize "aua" codons. Promoter P1 is located at positions 184-189 (-35 region) and 207-213 (-10 region), and P2 at 276-281 (-35 region) and 299-304 (-10 region). A third promoter region could be at positions 5517-5522. A potential stem-loop structure, necessary for processing of the mature 16S rRNA, is found at positions 327-360 [2]. FEATURES from to/span description rRNA 485 2034 16S rRNA rRNA 2203 5129 23S rRNA rRNA 5185 5300 5S rRNA tRNA 5322 5397 Val-tRNA tRNA 5430 5504 Thr-tRNA tRNA 5543 5618 Lys-tRNA tRNA 5629 5715 Leu-tRNA-cug tRNA 5721 5795 Gly-tRNA-ggc tRNA 5810 5895 Leu-tRNA-uua tRNA 5905 5981 Arg-tRNA tRNA 5997 6073 Pro-tRNA tRNA 6079 6151 Ala-tRNA tRNA 6172 6248 Met-tRNA tRNA 6251 6327 Ile-tRNA-nau tRNA 6334 6425 Ser-tRNA-uca tRNA 6443 6519 Met-tRNA-f tRNA 6531 6607 Asp-tRNA tRNA 6620 6695 Phe-tRNA tRNA 6712 6788 His-tRNA tRNA 6799 6872 Gly-tRNA-gga tRNA 6888 6964 Ile-tRNA-gau tRNA 6975 7049 Asn-tRNA tRNA 7053 7143 Ser-tRNA-agc tRNA 7169 7240 Glu-tRNA revision 504 504 c in [3]; t in [2] revision 571 573 tcc in [3]; tc in [2] revision 5029 5031 gga in [3]; ga in [1] anticdn 5355 5357 Val-tRNA anticodon tac anticdn 5463 5465 Thr-tRNA anticodon tgt anticdn 5576 5578 Lys-tRNA anticodon ttt anticdn 5663 5665 Leu-tRNA-cug anticodon cag anticdn 5753 5755 Gly-tRNA-ggc anticodon gcc anticdn 5844 5846 Leu-tRNA-uua anticodon taa anticdn 5939 5941 Arg-tRNA anticodon acg anticdn 6031 6033 Pro-tRNA anticodon tgg anticdn 6112 6114 Ala-tRNA anticodon tgc revision 6165 6167 act in [3]; at in [1] anticdn 6206 6208 Met-tRNA anticodon cat anticdn 6285 6287 Ile-tRNA-nau anticodon cat anticdn 6370 6372 Ser-tRNA-uca anticodon tga anticdn 6477 6479 Met-tRNA-f anticodon cat anticdn 6565 6567 Asp-tRNA anticodon gtc anticdn 6653 6655 Phe-tRNA anticodon gaa anticdn 6746 6748 His-tRNA anticodon gtg anticdn 6831 6833 Gly-tRNA-gga anticodon tcc anticdn 6922 6924 Ile-tRNA-gau anticodon gat anticdn 7007 7009 Asn-tRNA anticodon gtt anticdn 7087 7089 Ser-tRNA-agc anticodon gct anticdn 7202 7204 Glu-tRNA anticodon ttc BASE COUNT 1906 a 1694 c 2125 g 1705 t ORIGIN 65 bp upstream of MboI site; 280 degrees on the B.subtilis map. 1 ctttaatgct ccccttgtgg tcatcagtat ttagttcgtt tcacatacaa gaaaacgaaa 61 aaaacaacaa gatcacatga ctgatgtata tgttctttta agaaacttat atgatacaca 121 cgctttagaa atcatggcga ggattatagt ttatttgttt tatagatttt ttttaaaaaa 181 ctattgcaat aaataaatac aggtgttata ttattaaacg tcgctgatgc acagcggaca 241 caactagatg cttcaaaaca acttgaaaaa agttgttgac aaaaaagaag ctgaatgtta 301 tattagtaaa gctgcttcat tgagaagtaa cgaaatgatc tttgaaaact aaacaagaca 361 aaacgtacct gttaattcag tttttaaaaa tcgcactgcg atgtgcgtat catcaaacag 421 ggcctgcacg acgcaggtca cacaggtgtc gccgcaggat gcggtgaact taacctgtga 481 tccatttatc ggagagtttg atcctggctc aggacgaacg ctggcggcgt gcctaataca 541 tgcaagtcga gcggacaggt gggagcttgc tccgatgtta gcggcggacg ggtgagtaac 601 acgtgggtaa cctgcctgta agactgggat aactccggga aaccggggct aataccggat 661 ggttgtttga accgcatggt tcaaacataa aaggtggctt cggctaccac ttacagatgg 721 acccgcggcg cattagctag ttggtgaggt aacggctcac caaggcaacg atgcgtagcc 781 gacctgagag ggtgatcggc cacactggga ctgagacacg gcccagactc ctacgggagg 841 cagcagtagg gaatcttccg caatggacga aagtctgacg gagcaacgcc gcgtgagtga 901 tgaaggtttt cggatcgtaa agctctgttg ttagggaaga acaagtaccg ttcgaacagg 961 gcggtacctt gacggtacct aaccagaaag ccacggctaa ctacgtgcca gcagccgcgg 1021 taatacgtag gtggcaagcg ttttccggaa ttattgggcg taaagggctc gcaggcggtt 1081 tcttaagtct gatgtgaaag cccccggctc aaccggggag ggtcattgga aactggggaa 1141 cttgagtgca gaagaggaga gtggaattcc acgttgtagc ggtgaaatgc gtagagatgt 1201 ggaggaacac cagtggcgaa ggcgactctc tggtctgtaa ctgacgctga ggagcgaaag 1261 cgtggggagc gaacaggatt agataccctg gtagtccacg ccgtaaacga tgagtgctaa 1321 gtgttagggg gtttccgccc cttagtgctg cagctaacgc attgagcact ccgcctgggg 1381 agtacggtcg caagactgaa actcaaagga attgacgggg gcccgcacaa gcggtggagc 1441 atgtggttta attcgaagca acgcgaagaa ccttactagg tcttgacatc ctctgacaat 1501 cctagagata ggacgtcccc ttcggggcag agtgacaggt ggtgcatggt tgtcgtcagc 1561 tcgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccttgat cttagttgcc 1621 agcattcagt tgggcactct aaggtgactg ccggtgacaa accggaggaa ggtggggatg 1681 acgtcaaatc atcatgcccc ttatgacttg ggctacacac gtgctacaat ggacagaaca 1741 aagggcagcg aaccgcgagg ttaagccaat cccacaaatc tgttctcagt tcggatcgca 1801 gtctgcaact cgactgcgtg aagctggaat cgctagtaat cgcggatcag catgccgcgg 1861 tgaatacgtt cccgggcctt gtacacaccg cccgtcacac cacgagagtt tgtaacaccc 1921 gaagtcggtg aggtaacctt ttaggagcca gccgccgaag gtgggacaga tgattggggt 1981 gaagtcgtaa caaggtagcc gtatcggaag gtgcggctgg atcacctcct ttctaaggat 2041 attatacgga atataagacc caaggtctta taaacagaac gttccctgtc ttgtttagtt 2101 ttgaaggatc attccttcga aacgtgttct ttgaaaacta gataacagta gacatcacat 2161 tcaattagta acacaagata tcacatagtg attcttttta acggttaagt tagaaagggc 2221 gcacggtgga tgccttggca ctaggagccg atgaaggacg ggacgaacac cgatatgctt 2281 cggggagctg taagcaagct ttgatccgga gatttccgaa tggggaaacc caccactcgt 2341 aatggagtgg tatccatatc tgaattcata ggatatgaga aggcagaccc ggggaactga 2401 aacatctaag tacccggaga agagaaagca aatgcgattc cctgagtagc ggcgacgaac 2461 acgggatcag cccaaaccaa gaggcttgcc tctgtggttg taggacactc tgtacggagt 2521 tacaaaagaa cgaggtagat gaagaggtct ggaaagggcc cgccatagga ggtaacagcc 2581 ctgtagtcaa aacttcgttc tctcctgagt ggatcctgag tacggcggaa cacgtgaaat 2641 tccgtcggaa tccgggagga ccatctccca aggctaaata ctccctagtg accgatagtg 2701 aaccagtacc gtgagggaaa ggtgaaaagc accccggaag gggagtgaaa gagatcctga 2761 aaccgtgtgc ctacaagtag tcagagcccg ttaacggtga tggcgtgcct tttgtagaat 2821 gaaccggcga gttacgatcc cgtgcaaggt taagcagaag atgcggagcc gcagcgaaag 2881 cgagtctgaa tagggcgcat gagtacgtgg tcgtagaccc gaaaccaggt gatctaccca 2941 tgtccagggt gaagttcagg taacactgaa tggaggcccg aacccacgca cgttgaaaag 3001 tgcggggatg aggtgtgggt aggggtgaaa tgccaatcga acctggagat agctggttct 3061 ctccgaaata gctttagggc tagcctcaag gtaagagtct tggaggtaga gcactgattg 3121 gactaggggc cctcaccggg ttaccgaatt cagtcaaact ccgaatgcca atgacttatc 3181 cttgggagtc agactgcgag tgataagatc cgtagtcgaa agggaaacag cccagaccgc 3241 cagctaaggt cccaaagtat acgttaagtg gaaaaggatg tggagttgct tagacaacca 3301 ggatgttggc ttagaagcag ccaccattta aagagtgcgt aatagctcac tggtcgagtg 3361 actctgcgcc gaaaatgtac cggggctaaa cgtatcaccg aagctgcgga ctgttcttcg 3421 aacagtggta ggagagcgtt ctaagggctg tgaagccaga ccggaaggac tggtggacgg 3481 cttagaagtg agaatgccgg tatgagtagc gaaaagaggg gtgagaatcc ctccaccgaa 3541 tgcctaaggg ttcctgagga aggctcgtcc gctcagggtt agtcgggacc taagccgagg 3601 ccgaaaggcg taggcgatgg acaacaggtt gatattcctg taccacctcc tcaccatttg 3661 agcaatgggg ggtcgcagga ggatagggta agcgcggtat tggatatccg cgtccaagca 3721 gttaggctgg gaaataggca aatccgtttc ccataaggct gagctgtgat ggcgagcgaa 3781 atatagtagc gaagttcctg attccacact gccaagaaaa gcctctagcg aggtgagagg 3841 tgcccgtacc gcaaaccgtc acaggtaggc gaggagagaa tcctaaggtg atcgagagaa 3901 ctctcgttaa ggaactcggc aaaatgaccc cgtaacttcg ggagaagggg tgctctgtta 3961 gggtgcaagc ccgagagagc cgcagtgaat aggcccaggc gactgtttag caaaaacaca 4021 ggtctctgcg aagccgtaag gcgaagtata ggggctgacg cctgcccggt gctggaaggt 4081 taagaggagc gcttagcgta agcgaaggtg cgaattgaag ccccagtaaa cggcggccgt 4141 aactataacg gtcctaaggt agcgaaattc cttgtcgggt aagttccgac ccgcacgaaa 4201 ggcgcaacga tctgggcgct gtctcaacga gagactcggt gaaattatag tacctgtgaa 4261 gatgcaggtt acccgcgaca ggacggaaag accccgtgga gctttactgc agcctgatat 4321 tgaatgttgg tacagcttgt acaggatagg taggagcctt ggaaaccgga gcgccagctt 4381 cggtggaggc atcggtggga tactaccctg gctgtattga ccttctaacc ccccgccctt 4441 atcgggcggg gagacagtgt caggtgggca gtttgactgg ggcggtcgcc tcctaaaagg 4501 taacggaggc gcccaaaggt tccctcagaa tggttggaaa tcattcgcag agtgtaaagg 4561 cacaagggag cttgactgcg agacctacaa gtcgagcagg gacgaaagtc gggcttagtg 4621 atccggtggt tccgcatgga agggccatcg ctcaacggat aaaagctacc ccggggataa 4681 caggcttatc tcccccaaga gctccacatc gacggggagg tttggcacct cgatgtcggc 4741 tcatcgcatc ctggggctgt agtcggtccc aagggttggg ctgttcgccc attaaagcgg 4801 tacgcgagct gggttcagaa cgtcgtgaga cagttcggtc cctatccgtc gcgggcgctg 4861 gaaatttgag aggagctgtc cttagtacga gaggaccggg atggacgcac cgctggtgta 4921 ccagttgttc tgccaagggc atcgctgggt agctatgtgc ggacgggata agtgctgaaa 4981 gcatctaagc atgaagcccc cctcaagatg agatttccca ttccgcaagg aagtaagatc 5041 cctgaaagat gatcaggttg ataggtctga ggtggaagtg tggcaacaca tggagctgac 5101 agatactaat cgatcgagga cttaaccata tttttgaatg atgtcacacc tgttatctag 5161 ttttgagaga acactctcaa tttgtttggt ggcgatagcg aagaggtcac acccgttccc 5221 ataccgaaca cggaagttaa gctcttcagc gccgatggta gtcgggggtt tccccctgtg 5281 agagtaggac gccgccaagc aattgcacgt tagtgcaata tggaggatta gctcagctgg 5341 gagagcatct gccttacaag cagagggtcg gcggttcgag cccgtcatcc tccaccattt 5401 ttcattatac atatcggttt tacatatatg ccggtgtagc tcaattggta gagcaactga 5461 cttgtaatca gtaggttggg ggttcaagtc ctcttgccgg caccactttt atatgatata 5521 atattcaagt ctattgtaag aagagccatt agctcagttg gtagagcatc tgacttttaa 5581 tcagagggtc gaaggttcga gtccttcatg gctcaccatt tacatgttgc ggatgtggcg 5641 gaattggcag acgcgctaga atcaggctct agtgtcttta cagacgtggg ggttcaagtc 5701 ccttcatccg caccatttct gcggaagtag ttcagtggta gaacaccacc ttgccaaggt 5761 gggggtcgcg ggttcgaatc ccgtcttccg ctccaactat accatccacg ccggggtggt 5821 ggaattggca gacacacagg acttaaaatc ctgcggtagg tgactaccgt gccggttcaa 5881 gtccggccct cggcattaag ttttgcgccc gtagctcaat tggatagagc gtttgactac 5941 ggatcaaaag gttaggggtt cgactcctct cgggcgcgcc atgatctata tgaaatcggg 6001 aagtagctca gcttggtaga gcacatggtt tgggaccatg gggtcgcagg ttcgaatcct 6061 gtcttcccga ccattcttgg ggccttagct cagctgggag agcgcctgct ttgcacgcag 6121 gaggtcagcg gttcgatccc gctaggctcc acttgatttc aaaaactatt tggcggtgta 6181 gctcagctgg ctagagcgta cggttcatac ccgtgaggtc gggggttcga tcccctccgc 6241 cgctaccaat ggacctttag ctcagttggt tagagcagac ggctcataac cgtccggtcg 6301 taggttcgag tcctacaagg tccaccacta tacggaggaa tacccaagtc tggctgaagg 6361 gatcggtctt gaaaaccgac agggtgtcaa agcccgcggg ggttcgaatc cctcttcctc 6421 cgccatacat attcctaatc atcgcggggt ggagcagttc ggtagctcgt cgggctcata 6481 acccgaaggt cgcaggttca aatcctgccc ccgcaaccaa attttaaaat ggtccggtag 6541 ttcagttggt tagaatgcct gcctgtcacg caggaggtcg cgggttcgag tcccgtccgg 6601 accgccattt aaatacttag gctcggtagc tcagttggta gagcaacgga ctgaaaatcc 6661 gtgtgtcggc ggttcgattc cgtcccgagc caccatttat caatatgctt tggcggttgt 6721 ggcgaagtgg ttaacgcacc agattgtggc tctggcattc gtgggttcga ttcccatcaa 6781 tcgccccaaa taaaaattgc gggtgtagtt tagtggtaaa acctcagcct tccaagctga 6841 tgtcgtgggt tcgattccca tcacccgctc catttctata tcgtcatggg cctgtagctc 6901 agctggttag agcgcacgcc tgataagcgt gaggtcgatg gttcgagtcc attcaggccc 6961 accatgactt ttgttccaca gtagctcagt ggtagagcta tcggctgtta accgatcggt 7021 cgcaggttcg aatcctgcct gtggagccaa atggagaagt actcaagtgg ctgaagaggc 7081 gcccctgcta agggtgtagg tcgtgtaagc ggcgcgaggg ttcaaatccc tccttctccg 7141 ccatatgatt acagatatca taattatcgg cccgttggtc aagcggttaa gacaccgccc 7201 tttcacggcg gtaacacggg ttcgaatccc gtacgggtca tcccagaagc cttgcatatc 7261 ctgcaaggtt tttttgtttt tataaatcat gtatatgtct tagattttgt tctttatttt 7321 aaaaacagac tacaaaaatc tccatatatt tcgtttttct tcagaaaatg aagttaattg 7381 tctataagta taagccgttt cagggaaagg gctttttttt atttcttcga // LOCUS ECOAROCX 1690 bp ds-DNA BCT 10-JUL-1990 DEFINITION E.coli chorismate synthase (aroC) gene, complete cds. ACCESSION M33021 KEYWORDS aroC gene; chorismate synthase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1690) AUTHORS White,P.J., Millar,G. and Coggins,J.R. TITLE The overexpression, purification and complete amino acid sequence of chorismate synthase from Escherichia coli K12 and its comparison with the enzyme from Neurospora crassa JOURNAL Biochem. J. 251, 313-322 (1988) STANDARD simple staff_review FEATURES from to/span description pept 492 1562 chorismate synthase (EC 4.6.1.4) BASE COUNT 403 a 467 c 466 g 354 t ORIGIN 1 gtcgacgcgg tggatatctc tccagacgcg ctggcggttg ctgaacagaa catcgaagaa 61 cacggtctga tccacaacgt cattccgatt cgttccgatc tgttccgcga cttgccgaaa 121 gtgcagtacg acctgattgt cactaacccg ccgtatgtcg atgcgaagat atgtccgacc 181 tgccaaacaa taccgccacg agccggaact gggcctggca tctggcactg acggcctgaa 241 actgacgcgt cgcattctcg gtaacgcggc agattacctt gctgatgatg gcgtgttgat 301 ttgtgaagtc ggcaacagca tggtacatct tatggaacaa tatccggatg ttccgttcac 361 ctggctggag tttgataacg gcggcgatgg tgtgtttatg ctcaccaaag agcagcttat 421 tgccgcacga gaacatttcg cgatttataa agattaagta aacacgcaaa cacaacaata 481 acggagccgt gatggctgga aacacaattg gacaactctt tcgcgtaacc accttcggcg 541 aatcgcacgg gctggcgctc ggctccatcg tcgatggtgt tccgccagcc attccgctga 601 cggaagcgga cctgcaacat gacctcgacc gtcgtcgccc tgggacatcg cgctatacca 661 cccagcgccg cgagccggat caggtcaaaa ttctctccgg tgtttttgaa ggcgttacta 721 ccggcaccag cattggcttg ttgatcgaaa acactgacca gcgctctcag gattacagtg 781 cgattaagga cgttttccgt ccaggccatg ccgattacac ctacgaacaa aaatacggtc 841 tgcgcgatta tcgcggcggt ggacgttctt ccgcccgcga aaccgccatg cgcgtggcgg 901 caggagctat tgccaaaaaa tatctcgccg agaaatttgg tattgaaatc cgtggctgcc 961 tgacccagat gggcgacatt ccgctggata tcaaagactg gtcgcaggtc gagcaaaatc 1021 cgtttttttg cccggacccc gacaaaatcg acgcgttaga cgagttgatg cgtgcgctga 1081 aaaaagaggg cgactccatc ggcgctaaag tcaccgttgt tgccagtggc gttcctgccg 1141 gacttggcga gccggtcttt gaccgcctgg atgctgacat cgcccatgcg ctgatgagca 1201 tcaacgcggt gaaaggcgtg gaaattggcg acggctttga cgtggtggcg ctgcgcggca 1261 gccagaaccc cgatgaaatc accaaagacg gtttccagag caaccatgcg ggcggcattc 1321 tcggcggtat cagcagcggg cagcaaatca ttgcccatat ggcgctgaaa ccgacctcca 1381 gcattaccgt gccgggtcgt accattaacc gctttggcca agaagttgag atgatcacca 1441 aaggccgtca cgatccctgt gtcgggatcc gcgcagtgcc gatcgcagaa gcgaatgctg 1501 gcgatcgttt taatggatca cctgttacgg caacgggcgc aaaatgccga tgtgaagact 1561 gatattccac gctggtaaaa aatgaataaa accgcgattg cgctgctggc tctgcttgcc 1621 agtagcgcca gcctggcagc gacggcgtgg caaaaaataa cccaacctgt gccgggtagc 1681 gccaaatcga // LOCUS PFAMSA2 819 bp ds-DNA INV 10-JUL-1990 DEFINITION P.falciparum 45 kD merozoite surface antigen (MSA 2) gene, complete cds. ACCESSION M28891 KEYWORDS integral membrane protein; surface antigen. SOURCE P.falciparum DNA, clone 3D7. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 819) AUTHORS Smythe,J.A., Peterson,M.G., Coppel,R.L., Saul,A.J., Kemp,D.J. and Anders,R.F. TITLE Structural diversity in the 45-kilodalton merozoite surface antigen of Plasmodium falciparum JOURNAL Mol. Biochem. Parasitol. 39, 227-234 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by J.A. Smythe, 06-OCT-1989. FEATURES from to/span description pept 1 819 45 kD merozoite surface antigen precursor sigp 1 60 45 kD merozoite surface antigen signal peptide matp 61 819 45,000 merozoite surface antigen rpt 157 228 12 base repeat rpt 301 321 9 base repeat BASE COUNT 304 a 157 c 143 g 215 t ORIGIN 1 atgaaggtaa ttaaaacatt gtctattata aatttcttta tttttgttac ctttaatatt 61 aaaaatgaaa gtaaatatag caacacattc ataaacaatg cttataatat gagtataagg 121 agaagtatgg cagaaagtaa gccttctact ggtgctggtg gtactgctgg tggtagtgct 181 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg ttctggtgat 241 ggtaatggtg cagatgctga gggaagttca agtactcccg ctactaccac aactaccaaa 301 actaccacaa ctaccacaac tactaatgat gcagaagcat ctaccagtac ctcttcagaa 361 aatccaaatc ataaaaatgc cgaaacaaat ccaaaaggta aaggagaagt tcaagaacca 421 aatcaagcaa ataaagaaac tcaaaataac tcaaatgttc aacaagactc tcaaactaaa 481 tcaaatgttc cacccactca agatgcagac actaaaagtc ctactgcaca acctgaacaa 541 gctgaaaatt ctgctccaac agccgaacaa actgaatccc ccgaattaca atctgcacca 601 gagaataaag gtacaggaca acatggacat atgcatggtt ctagaaataa tcatccacaa 661 aatacttctg atagtcaaaa agaatgtacc gatggtaaca aagaaaactg tggagcagca 721 acatccctct taaataactc tagtaatatt gcttcaataa ataaatttgt tgttttaatt 781 tcagcaacac ttgttttatc ttttgccata ttcatataa // LOCUS PFAMSA2X 864 bp ds-DNA INV 10-JUL-1990 DEFINITION P.falciparum 45,000 merozoite surface antigen (MSA2) gene, complete cds. ACCESSION M28892 KEYWORDS integral membrane protein; surface antigen. SOURCE P.falciparum (isolate Indochina 1) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 864) AUTHORS Smythe,J.A., Peterson,M.G., Coppel,R.L., Saul,A.J., Kemp,D.J. and Anders,R.F. TITLE Structural diversity in the 45-kilodalton merozoite surface antigen of Plasmodium falciparum JOURNAL Mol. Biochem. Parasitol. 39, 227-234 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by J.A. Smythe, 06-OCT-1989. FEATURES from to/span description pept 1 864 45 kD merozoite surface antigen precursor sigp 1 60 45 kD merozoite surface antigen signal peptide matp 61 864 45 kD merozoite surface antigen rpt 169 312 12 base repeat sequence rpt 379 397 9 base repeat sequence BASE COUNT 288 a 157 c 179 g 240 t ORIGIN 1 atgaaggtaa ttaaaacatt gtctattata aatttcttta tttttgttac ctttaatatt 61 aaaaatgaaa gtaaatatag caacacattc ataaacaatg cttataatat gagtataagg 121 agaagtatga cagaaagtaa tcctcctact ggtgctagtg gtagtgctgg tggtagtgct 181 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg tggtagtgct 241 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg tggtagtgct 301 ggtggtagtg ctggttctgg tgatggtaat ggtgctaatc ctggtgcaga tgctgagaga 361 agtccaagta ctcccgctac taccacaact accacaacta ctaatgatgc agaagcatct 421 accagtacct cttcagaaaa tccaaatcat aataatgccg aaacaaatca agcaaataaa 481 gaaactcaaa ataactcaaa cgttcaacaa gactctcaaa ctaaatcaaa tgttccaccc 541 actcaagatg cagacactag aagtcctact gcacaacctg aacaagctga aaattctgct 601 ccaacagccg aacaaactga atcccccgaa ttacaatctg caccagagaa taaaggtaca 661 ggacaacatg gacatatgca tggttctaga aataatcatc cacaaaatac ttctgatagt 721 caaaaagaat gtaccgatgg taacaaagaa aactgtggag cagcaacatc cctcttaaat 781 aactctagta atattgcttc aataaataaa tttgttgttt taatttcagc aacacttgtt 841 ttatcttttg ccatattcat ataa // LOCUS XELRASX 1143 bp ss-mRNA VRT 10-JUL-1990 DEFINITION X.laevis ras protein mRNA, complete cds. ACCESSION M34657 KEYWORDS ras protein. SOURCE X.laevis defolliculated oocyte, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1143) AUTHORS Andeol,Y., Gusse,M. and Mechali,M. TITLE Characterization and expression of a Xenopus ras during oogenesis and development JOURNAL Dev. Biol. 139, 24-34 (1990) STANDARD simple staff_review FEATURES from to/span description pept 196 756 ras protein mRNA < 1 1143 ras protein mRNA BASE COUNT 346 a 275 c 279 g 243 t ORIGIN 1 gaattcgcca gtgttacaga atgggagttc tgaggcgctg tgactaatcc cccccacccc 61 cgcatattgg ggaaatccac cggcgggcag aaagccagag ggagaactaa ggggggccaa 121 accaaaggaa aacgcaggag ccaaagcctc cagaaacaca gggatccgtg acgagcccga 181 gtcggtgctg gtgaaatgac ggagtacaaa ctggtggtgg ttggtgctgg aggcgtgggg 241 aagagcgcac tcacaatcca gctcattcag aaccattttg tggacgagta tgatcctact 301 attgaggact cgtataggaa gcaggtggtg atagacgggg agacctgcct cctagatatc 361 ctggacactg cggggcaaga ggaatacagc gctatgaggg atcagtacat gcgcacggga 421 gaaggctttc tctgtgtctt tgctattaac aacacaaagt ccttcgagga cgtccatcat 481 tacagggaac agattaacag agttaaagat tccgatgacg ttcccatggt gttagttggt 541 aacaaatgcg acctcccatc ccggactgtg gacacaaagc aagcgcagga actggcaaag 601 agctatggta ttccttttat agagacctct gccaaaacta gacagggagt cgaagacgcc 661 ttctataccc tagtccgtga aatccgcaag cacaaggaga agatcagcaa cgggaaaaag 721 aaaaagtcct ccaaaaggaa gtgtgtcgtt ctttaacgtg ccaacctgcc cccccctgcc 781 atcctcgtgg atcagagaaa accgtgccat cacacacctg aagtcaaaga aaaaaaaagt 841 gtggactttt gtcgttgctg tggaaaccat tgaattgcca tgaaatttaa aaaaaaaacc 901 aaaacattga ccacttattt taacacaacc gataaatggc acaggctgtg ccccaatcgt 961 gtatatattc ttcatgaaca aactgtttta tcagaaagac agatgcaata gccccttctt 1021 tttaccccaa ttaaccctcc tcctggtttc tatttctccc tggaaaagac gttggtcgac 1081 cagaggggaa gaacctgccc aggcctttct tacagcccca tttgaataaa gattgaaaca 1141 ctc // LOCUS HUMSPTB 6765 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human beta-spectrin (SPTB) mRNA, complete cds. ACCESSION J05500 KEYWORDS beta-spectrin; spectrin. SOURCE Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and V252. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6765) AUTHORS Winkelmann,J.C., Chang,J.G., Tse,W.T., Marchesi,V.T. and Forget,B.G. TITLE Full length sequence of the cDNA for human erythroid beta-spectrin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Winkelmann, 08-MAY-1990. FEATURES from to/span description pept 96 6509 beta-spectrin /nomgen="SPTB" /map="14" /hgml_locus_uid="LS0033T" mRNA < 1 6765 SPTB mRNA signal 6716 6722 poly-A signal BASE COUNT 1627 a 1822 c 2144 g 1172 t ORIGIN Chromosome 14q23-q24. 1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag 61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa 121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg 181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg 241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg 301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca 361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc 421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac 481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc 541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag 601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag 661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta 721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca 781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc 841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct 901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc 961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg 1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca 1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca 1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca 1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag 1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg 1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc 1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata 1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga 1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg 1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc 1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg 1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg 1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga 1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg 1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc 1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg 1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg 2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc 2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc 2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt 2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga 2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc 2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg 2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc 2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat 2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgtcaggc ctgcgggagc 2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg 2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga 2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca 2761 ggttcgacat cctggaccag gagatgaaga ccttcatgac tcagattgat ggtgtgaacc 2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc 2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg 2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt 3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag 3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg 3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga 3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat 3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg 3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca 3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg 3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc 3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg 3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg 3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat 3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga 3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg 3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg 3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc 3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct 3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc 4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc 4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc 4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa 4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg 4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcgatcagac gacccgggca 4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga 4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg 4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa 4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg 4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta 4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg 4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg 4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc 4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt 4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct 4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc 4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg 5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg 5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca 5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa 5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc 5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca 5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg 5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc 5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga 5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca 5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg 5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg 5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc 5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct 5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc 5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca 5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc 6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt 6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc 6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg 6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca 6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg 6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga 6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg 6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac 6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg 6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg 6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag 6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa 6721 attttattgt ataaatatat caccatttac atgaggggaa acact // LOCUS STYEUTBC 2526 bp ds-DNA BCT 10-JUL-1990 DEFINITION S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes, complete cds. ACCESSION J05518 KEYWORDS ethanolamine ammonia-lyase. SOURCE S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2526) AUTHORS Faust,L.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M. TITLE Cloning, sequencing and expression of the genes encoding the alcohol-dependent ethanolamine ammonia-lyase of Salmonella typhimurium JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.M.Babior, 08-MAY-1990. FEATURES from to/span description pept 141 1499 ethanolamine ammonia-lyase (eutB) pept 1518 2378 ethanolamine ammonia-lyase (eutC) binding 130 133 ribosome binding site binding 1507 1510 ribosome binding site BASE COUNT 563 a 687 c 779 g 497 t ORIGIN 1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga 61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt 121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt 181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg 241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa 301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc 361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca 421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc 481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc 541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc 601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg 661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc 721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga 781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc 841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct 901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt 961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct 1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg 1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc 1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga 1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc 1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc 1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg 1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg 1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat 1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg 1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc 1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg 1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc 1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg 1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg 1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc 1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg 1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga 2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact 2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca 2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg 2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc 2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg 2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga 2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg 2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt 2521 gcctcc // LOCUS XELPCNA 1018 bp ss-mRNA VRT 10-JUL-1990 DEFINITION X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete cds. ACCESSION M34080 KEYWORDS nuclear protein; proliferating cell nuclear antigen. SOURCE X.laevis oocyte, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1018) AUTHORS Leibovici,M., Gusse,M., Bravo,R. and Mechali,M. TITLE Characterization and developmental expression of Xenopus proliferating cell nuclear antigen (PCNA) JOURNAL Dev. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Leibovici, 08-MAY-1990. FEATURES from to/span description pept 28 813 proliferating cell nuclear antigen (PCNA) mRNA < 1 1018 PCNA mRNA BASE COUNT 284 a 223 c 237 g 274 t ORIGIN 1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc 61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca 121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact 181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag 241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg 301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt 361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa 421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc 481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca 541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa 601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat 661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca 721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg 781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc 841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa 901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa 961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac // LOCUS YSCAAC2A 1333 bp ds-DNA PLN 10-JUL-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete cds. ACCESSION M34076 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1333) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator in yeast JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 235 1158 ADP/ATP-translocator protein (AAC2) BASE COUNT 388 a 209 c 301 g 435 t ORIGIN 1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc 61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg 121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa 181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt 241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt 301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat 361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt 421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat 481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg 541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct 601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc 661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg 721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc 781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct 841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg 901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga 961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa 1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg 1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc 1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc 1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag 1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc 1321 agccactgtt gtt // LOCUS YSCAAC3 1308 bp ds-DNA PLN 10-JUL-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete cds. ACCESSION M34075 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1308) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator in yeast JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 78 1034 ADP/ATP-translocator protein (AAC3) BASE COUNT 353 a 228 c 263 g 464 t ORIGIN 1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata 61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc 121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg 181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag 241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca 301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta 361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt 421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg 481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat 541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg 601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac 661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga 721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt 781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa 841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga 901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag 961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta 1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc 1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata 1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa 1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc 1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt // LOCUS ECODKSA 1273 bp ds-DNA BCT 10-JUL-1990 DEFINITION E.coli dnaK suppressor (dksA) gene, complete cds. ACCESSION M34945 KEYWORDS dnaK suppressor. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1273) AUTHORS Kang,P.J. and Craig,E.A. TITLE Identification and characterization of a new Escherichia coli gene that is a dosage-dependent suppressor of a dnaK deletion mutation JOURNAL J. Bacteriol. 172, 2055-2064 (1990) STANDARD simple staff_review FEATURES from to/span description pept 229 441 ORF 1 pept 619 1074 dnaK suppressor (dksA) BASE COUNT 343 a 301 c 333 g 296 t ORIGIN 1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa 61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc 121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata 181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg 241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt 301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg 361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa 421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg 481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg 541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc 601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt 661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa 721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa 781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac 841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac 901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc 961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg 1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag 1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga 1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct 1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt 1261 acgcatagaa gat // LOCUS STFCYCLI 2180 bp ss-mRNA INV 10-JUL-1990 DEFINITION Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds. ACCESSION M33880 KEYWORDS cyclin B. SOURCE Starfish (A.pectinifera) egg, cDNA to mRNA, clone lambda-gt10-cyc10. ORGANISM Asterina pectinifera Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa; Stelleroidea; Asteroidea; Spinulosida; Asterinidae. REFERENCE 1 (bases 1 to 2180) AUTHORS Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T. TITLE The starfish egg mRNA responsible for meiosis reinitiation encodes cyclin JOURNAL Dev. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Kishimoto, 11-MAY-1990. FEATURES from to/span description pept 126 1313 cyclin B (CYC) mRNA < 126 2180 cyclin B mRNA BASE COUNT 650 a 440 c 482 g 608 t ORIGIN 1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg 61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga 121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt 181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc 241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag 301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca 361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa 421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct 481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac 541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt 601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc 661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc 721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc 781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg 841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt 901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca 961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca 1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag 1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc 1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat 1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca 1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg 1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt 1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt 1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg 1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa 1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt 1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga 1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc 1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca 1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac 1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat 1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag 1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat 2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg 2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt 2161 gtaataaagt taatggaaat // LOCUS CHKMTTGHA 90 bp ds-DNA ORG 10-JUL-1990 DEFINITION Chicken mitochondrial His-tRNA gene. ACCESSION M34496 M34497 KEYWORDS transfer RNA-His. SOURCE Chicken (strain white leghorn) liver mitochondrial DNA. ORGANISM Mitochondrion Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae; Gallus gallus. REFERENCE 1 (bases 1 to 90) AUTHORS L'Abbe,D., Lang,B.F., Desjardins,P. and Morais,R. TITLE Histidine tRNA from chicken mitochondria has an uncoded 5'-terminal guanylate residue JOURNAL J. Biol. Chem. 265, 2988-2992 (1990) STANDARD simple staff_entry COMMENT the "n"s in the tRNA sequence are probably modified bases. FEATURES from to/span description tRNA 11 79 His-tRNA anticdn 41 43 His-tRNA anticodon gtg variant 10 10 t in DNA, n in tRNA variant 18 18 t in DNA; n in tRNA variant 19 19 a in DNA; n in tRNA variant 26 26 c in DNA; n in tRNA variant 27 27 c in DNA; n in tRNA variant 35 35 t in DNA; n in tRNA variant 45 45 t in DNA; n in tRNA BASE COUNT 27 a 22 c 17 g 24 t ORIGIN 1 acccctctat gcaaacatag tttaacccaa acattagatt gtgattctaa aaataggagt 61 ttaaccctcc ttgttcgccg aggggaggcc // LOCUS DDISAS1A 2145 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.discoideum GTP-binding protein (SAS1) gene, complete cds. ACCESSION M34456 KEYWORDS GTP-binding protein. SOURCE D.discoideum, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 2145) AUTHORS Saxe,S.A. and Kimmel,A.R. TITLE SAS1 and SAS2, GTP-binding protein genes in Dictyostelium discoideum with sequence similarities to essential genes in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 2367-2378 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1095 1706 GTP-binding protein (SAS1) BASE COUNT 929 a 249 c 238 g 727 t 2 others ORIGIN 1 gggaattatt aggacatcag gtttaaaacc tattcagaca ccagaataca atttgaattg 61 agcggcaacg ttcctttcac tctgcactac atcagcatta ttagagagaa aggttgaaaa 121 acctctatcg aaggtggtgg aattgctgag aagtaacagc aataaataaa acattcaaac 181 cgatagatga gaggttcaaa atccatctag ttagtagggc taaaaaacta caaatcataa 241 acccgatccg atacctaaga ctcctttttt tttttttttt tttttaataa atcaaataat 301 cacatgacct tggagtcttg gtctgcccac gaatttaaag tgcaaagttt attttattta 361 aactgggtgc atgcaaacat tactctatcg accgatttat ccaattttaa tactaaaatc 421 ttaaaaacca gaaagaanna ataataataa taataataat aataataata ataataataa 481 taataataat aataataata ataataataa taataataat aataataata ataataataa 541 taataataat aataataata ataataacaa ccttatttga aaattcaaat taaaaaaaaa 601 agaaatagct ttacatttta aaattaaaat tcataaataa aaccattata aaaatattga 661 agtatatcaa taggtttaat ttaattattg tttatttaat aaaaaaaaaa aaaaaaaaaa 721 aaaattattt aatcggttca atttaacttt ttcgaagaat tatttttttt aagaaaacat 781 ttcaacccaa aaaaataaaa aaaataaaaa aataaaaatt taaatcgaat ggttgaaatg 841 ttttcttaaa aaaacaaaaa ttaaaataaa ttttattttt tttgaattaa atttcaattc 901 agcaattcaa taattttaac gttttcactt catcaaaaat tataaataga atattaaaca 961 caacacaaca caactatcca aactaaaaca attaaaatca aaactctaat tttttataaa 1021 aatttattta ttttctcatc tcaataaaaa catttaaaaa cataattggt aatatagata 1081 tttttttcaa aataatgact tctccagcaa caaataaacc agcagcctac gattttttag 1141 ttaaattact tttaattggg gatagtggtg taggaaagtc atgtctttta ttacgttttt 1201 ctgatggttc tttcacacca agtttcatcg ctactattgg tatcgatttc aaaattcgta 1261 caattgaatt agagggtaaa agaattaaat tacaaatttg ggacactgca ggtcaagaaa 1321 gattcagaac tatcactaca gcatactatc gcggtgctat gggtatccta ttggtttatg 1381 atgtcactga tgaaaaatct tttggtagca ttagaaattg gattagaaat atcgagcaac 1441 atgcttcaga ctcagttaat aaaatgttaa tcggtaataa atgtgatatg accgaaaaga 1501 aagttgttga tagctcaaga ggtaaatcac ttgcagacga atatggtatt aaatttttag 1561 aaacttctgc caaaaacagt gtaaatgtag aggaagcctt tattggttta gcaaaagata 1621 ttaaaaaacg tatgattgat acaccaaatg atcctgatca taccatatgc attactccaa 1681 acaataagaa aaatacttgt tgttaaattg gggccatttt aattttcaca ttattagatg 1741 aaaaaaaaaa aaaaaaaaaa ctaaaattaa aagtaaaaaa cacttttttt tatttaaaaa 1801 tattattttt cattagtcat gaatggttac gtctaaacga tctaatattt ctctatagta 1861 gtgaattatt gcttcatgaa ttttagtgaa aagtttagct taataataat aataataata 1921 ataataataa taataataat aataataata ataataataa ataataataa caattttaaa 1981 attaaatatc caatgttgaa tattttaagt caaaaataat aataataatt ggaatgtatt 2041 ttaaaattaa aattcataaa taaactatta attattgttt attgccttta atggctaacc 2101 tattttttat agtttaaaaa taatttataa ttaatttttt taaat // LOCUS DDISAS2A 989 bp ds-DNA INV 10-JUL-1990 DEFINITION D.discoideum GTP-binding protein (SAS2) gene, complete cds. ACCESSION M34457 KEYWORDS GTP-binding protein. SOURCE D.discoideum DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 989) AUTHORS Saxe,S.A. and Kimmel,A.R. TITLE SAS1 and SAS2, GTP-binding protein genes in Dictyostelium discoideum with sequence similarities to essential genes in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 2367-2378 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 49 675 GTP-binding protein (SAS2) BASE COUNT 421 a 141 c 142 g 285 t ORIGIN 1 atcaatcaat aaactacaaa tttataatat agatattttt tcgaaataat gacttctcca 61 gcaacaaata aatcagcagc ctacgattat ttaattaaat tacttttaat cggtgatagt 121 ggtgtaggta aatcatgtct tttattacgt ttttctgaag attctttcac accaagtttc 181 atcactacta ttggtatcga tttcaaaatt cgtacaattg aattggaagg taaaagaatt 241 aaattacaaa tttgggatac tgcaggtcaa gaaagattca gaactatcac tacagcatac 301 tatcgtggtg ctatgggtat cctattggtt tatgatgtca ctgatgaaaa atcttttggt 361 aacattagaa attggattag aaatatcgag caacatgcta cagactctgt taataaaatg 421 ttaatcggta ataaatgtga tatggctgaa aagaaagttg ttgatagctc aagaggtaaa 481 tcacttgcag acgaatatgg tattaaattt ttagaaacct cagccaaaaa cagtataaat 541 gtagaggaag cctttattag tttagcaaaa gatattaaaa aacgtatgat tgatacacca 601 aatgaacaac cacaagttgt tcaaccaggt acaaatcttg gtgcaaataa caataagaaa 661 aaagcttgtt gttaaattgg gtgctatttt aattttcaca ttatattatt agataaaaat 721 aaaaaaaaaa aaaaaaatct taaaaaaaaa aaaaaaagtc atcaaaatta ttcacctaaa 781 aaaataacat ataaaccctg ggtttcaagg cagaggatga ttcacttaca acaacaacaa 841 caacaaccaa caacaacaac aacaaccaac aacaacaact aacaacaaca acaaataata 901 ataataataa aaataataat aataaatccc caagttgtga agttgtgttg aaattaataa 961 gagtgggagg tttatatcgc ataaataac // LOCUS HUMLAMBA 2850 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human lamin B mRNA, complete cds. ACCESSION M34458 KEYWORDS intermediate filament; lamin B. SOURCE Human T-cell line MOLT-4, cDNA to mRNA, clone LAM-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2850) AUTHORS Pollard,K.M., Chan,E.K.L., Grant,B.J., Sullivan,K.F., Tan,E.M. and Glass,C.A. TITLE In vitro posttranslational modification of lamin B cloned from a human T-cell line JOURNAL Mol. Cell. Biol. 10, 2164-2175 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 342 2102 lamin B mRNA < 342 2850 lamin B mRNA signal 2834 2839 polyA signal BASE COUNT 776 a 614 c 748 g 712 t ORIGIN 1 cgcgagcagg agacggcggc gggcgaaccc tgctgggcct ccagtcaccc tcgtcttgca 61 ttttcccgcg tgcgtgtgtg agtgggtgtg tgtgttttct tacaaagggt atttcgcgat 121 cgatcgattg attcgtagtt cccccccgcg cgcctttgcc ctttgtgctg taatcgagct 181 cccgccatcc caggtgcttc tccgttcctc taaacgccag cgtctggacg tgagcgcagg 241 tcgccggttt gtgccttcgg tccccgcttc gccccctgcc gtcccctcct tatcacggtc 301 ccgctcgcgg cctcgccgcc ccgctgtctc cgccgcccgc catggcgact gcgacccccg 361 tgccgccgcg gatgggcagc cgcgctggcg gccccaccac gccgctgagc cccacgcgcc 421 tgtcgcggct ccaggagaag gaggagctgc gcgagctcaa tgaccggctg gcggtgtaca 481 tcgacaaggt gcgcagcctg gagacggaga acagcgcgct gcagctgcag gtgacggagc 541 gcgaggaggt gcgcggccgt gagctcaccg gcctcaaggc gctctacgag accgagctgg 601 ccgacgcgcg acgcgcgctc gacgacacgg cccgcgagcg cgccaagctg cagatcgagc 661 tgggcaagtg caaggcggaa cacgaccagc tgctcctcaa ctatgctaag aaggaatctg 721 atcttaatgg cgcccagatc aagcttcgag aatatgaagc agcactgaat tcgaaagatg 781 cagctcttgc tactgcactt ggtgacaaaa aaagtttaga gggagatttg gaggatctga 841 aggatcagat tgcccagttg gaagcctcct tagctgcagc caaaaaacag ttagcagatg 901 aaactttact taaagtagat ttggagaatc gttgtcagag ccttactgag gacttggagt 961 ttcgcaaaag catgtatgaa gaggagatta acgagaccag aaggaagcat gaaacgcgct 1021 tggtagaggt ggattctggg cgtcaaattg agtatgagta caagctggcg caagcccttc 1081 atgagatgag agagcaacat gatgcccaag tgaggctgta taaggaggag ctggagcaga 1141 cttaccatgc caaacttgag aatgccagac tgtcatcaga gatgaatact tctactgtca 1201 acagtgccag ggaagaactg atggaaagcc gcatgagaat tgagagcctt tcatcccagc 1261 tttctaatct acagaaagag tctagagcat gtttggaaag gattcaagaa ttagaggact 1321 tgcttgctaa agaaaaagac aactctcgtc gcatgctgac agacaaagag agagagatgg 1381 cggaaataag ggatcaaatg cagcaacagc tgaatgacta tgaacagctt cttgatgtaa 1441 agttagccct ggacatggaa atcagtgctt acaggaaact cttagaaggc gaagaagaga 1501 ggttgaagct gtctccaagc ccttcttccc gtgtgacagt atcccgagca tcctcaagtc 1561 gtagtgtacg tacaactaga ggaaagcgga agagggttga tgtggaagaa tcagaggcga 1621 gtagtagtgt tagcatctct cattccgcct cagccactgg aaatgtttgc atcgaagaaa 1681 ttgatgttga tgggaaattt atccgcttga agaacacttc tgaacaggat caaccaatgg 1741 gaggctggga gatgatcaga aaaattggag acacatcagt cagttataaa tatacctcaa 1801 gatatgtgct gaaggcaggc cagactgtta caatttgggc tgcaaacgct ggtgtcacag 1861 ccagcccccc aactgacctc atctggaaga accagaactc gtggggcact ggcgaagatg 1921 tgaaggttat attgaaaaat tctcagggag aggaggttgc tcaaagaagt acagtcttta 1981 aaacaaccat acctgaagaa gaggaggagg aggaagaagc agctggagtg gttgttgagg 2041 aagaactttt ccaccagcag ggaaccccaa gagcatccaa tagaagctgt gcaattatgt 2101 aaaattttca actgtcttcc tcaaaataaa gaagtatggt aatctttacc tgtatacagt 2161 gcagagcctt ctcagaagca cagaatattt ttatatttcc tttatgtgaa tttttaagct 2221 gcaaatctga tggccttaat ttcctttttg acactgaaag ttttgtaaaa gaaatcatgt 2281 ccatacactt tgttgcaaga tgtgaattat tgacactgaa cttaataact gtgtactgtt 2341 cggaaggggt tcctcaaatt ttttgacttt ttttgtatgt gtgttttttc ttttttttta 2401 agttcttatg aggaggggag ggtaaataaa ccactgtgcg tcttggtgta atttgaagat 2461 tgccccatct agactagcaa tctcttcatt attctctgct atatataaaa cggtgctgtg 2521 agggagggga aaagcatttt tcaatatatt gaacttttgt actgaatttt tttgtaataa 2581 gcaatcaagg ttataatttt ttttaaaata gaaattttgt aagaaggcaa tattaaccta 2641 atcaccatgt aagcactctg gatgatggat tccacaaaac ttggttttat ggttacttct 2701 tctcttagat tcttaattca tgaggagggt gggggaggga ggtggaggga gggaagggtt 2761 tctctattaa aatgcattcg ttgtgttttt taagatagtg taacttgctt aaatttctta 2821 tgtgacatta acaaataaaa aagctctttt // LOCUS VIBANGRA 4379 bp ds-DNA BCT 10-JUL-1990 DEFINITION V.anguillarum trans-acting transcriptional activator (angR), S-acyl fatty acid synthesis thioesterase-like protein genes, complete cds, and outer membrane protien (omp), 3' end. ACCESSION M34504 KEYWORDS S-acyl fatty acid synthesis thioesterase-like protein; outer membrane protein; trans-acting transcriptional activatior. SOURCE V.anguillarum DNA, clone pJHC-A103. ORGANISM Vibrio anguillarum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 4379) AUTHORS Farrell,D.H., Mikesell,P., Actis,L.A. and Crosa,J.H. TITLE A regulatory gene, angR, of the iron uptake system of Vibrio anguillarum: Similarity with phage P22 cro and regulation by iron JOURNAL Gene 86, 45-51 (1990) STANDARD simple staff_entry COMMENT Fur protein is a product of the ferric uptake regulatory gene (fur). FEATURES from to/span description pept < 1 275 outer membrane receptor protein pept 361 3507 trans-acting transcriptional activator (angR) pept 3504 4262 S-acyl fatty acid synthase thioesterase-like protein (ORF6) binding 154 158 Fur binding site binding 348 353 ribosome binding site (put.) binding 3488 3493 ribosome binding site (put.) signal 34 39 -35a region (put.) signal 63 68 -10a region (put.) signal 109 114 -35b region (put.) signal 126 131 -10b region (put.) signal 3387 3392 -35c region (put.) signal 3405 3410 -10c region (put.) signal 3443 3448 -35d region (put.) signal 3456 3461 -10d region (put.) BASE COUNT 1320 a 914 c 902 g 1243 t ORIGIN 1 ggaacctacc agtgatgcgt caacttactc ttattggtca agcaaattac atgtcagagc 61 aatatattga tgcacaaaac actcaatcac tgtctgcaca gactattttt gatttaggtg 121 ctcgctataa ctctaccatc gccaatcaaa gtgtcatttg gcgtcttgcg gtcaacaacg 181 taaccgatga agcatattgg actaccaccc attacgctag ccttgcgttg ggtgcccctc 241 gtacggtgat gctatctgct acagcggatt tttaatctcg gtcaattttg cccttgacct 301 ttctggttaa gggcattcgt cttccccttc cccccatttg gctttttatg agaatttaga 361 atgaatcaaa atgaacatcc cttcgctttc cctgagacaa aattaccttt aacctccaat 421 caaaattggc agttatcaac ccaaagacag cgtactgaaa aaaaatcgat taccaatttt 481 acgtatcagg aatttgatta cgaaaacatt tcgagggaca cattagaacg ctgcctcaca 541 acaataatta agcatcaccc aatattcgga gctaagctca gtgacgactt ctacctccat 601 tttccgagca aaactcacat tgaaaccttt gcagttaatg acttaagtaa tgccttaaaa 661 caagatattg ataaacagtt ggccgatacg cgttctgcag taacgaaaag ccgctcacaa 721 gcgataatct ctatcatgtt tagtatattg ccaaaaaaca taatcaggct tcatgtacgc 781 ttcaactcag ttgttgtaga taatccaagt gttacgcttt tttttgagca gcttactcag 841 ttattatcgg gaagtcccct ttctttttta aatcaagaac agactatctc cgcatacaat 901 cacaaagtta ataatgagtt gcttagtgtt gatcttgagt ccgcaagatg gaatgaatat 961 attctaacac tacctagttc agcaaacctt cccacaattt gtgaacccga aaaactggat 1021 gaaaccgata tcactcgcag gtgcattaca ctgtcacaaa ggaagtggca gcagttggtt 1081 actgttagca aaaaacataa tgtcacaccg gagataactc ttgccagtat attttcgacc 1141 gttttatcac tctgggggca tcaaaaatac ctcatgatga gatttgatat caccaaaatc 1201 aatgactaca cgggcatcat aggccagttt accgaacctt tattagtggg tatgtccggc 1261 tttgagcaga gctttctttc tcttgttaaa aacaaccaaa aaaagttcga agaagcttat 1321 cattatgacg ttaaagtacc tgtttttcag tgtgttaata aattatctaa tatttcggat 1381 tctcaccgtt atcctgctaa tatcactttt tctagcgagc ttttaaacac aaaccatagc 1441 aaaaaagctg tatggggatg tcgtcaatca gccaatactt ggctttcttt acatgctgta 1501 atcgagcaag aacaacttgt cttacaatgg gacagccaag acgcaatctt cccaaaagac 1561 atgatcaaag atatgttaca tagttacacc gatttattag acttactcag ccaaaaagat 1621 gtcaactggg cacagccttt accaactttg ctgccaaaac atcaggagtc catacgcaat 1681 aaaataaatc aacagggaga cctagaacta actaaagaac tcctccatca gcgttttttt 1741 aaaaacgtag agtccacccc taatgctctt gcgattatcc atggtcaaga gtcattagat 1801 tatataactt tagcaagcta cgccaagagt tgtgcgggtg cactaaccga agctggagta 1861 aaatcaggag accgcgttgc tgtcactatg aataaaggca ttggtcaaat agtggcagta 1921 ttgggaatat tatatgctgg ggctatttat gttcctgtct ctctagatca accacaagaa 1981 aggcgggaaa gtatttatca aggtgctgga attaacgtta ttcttattaa cgaatcagat 2041 agtaaaaatt ccccttcaaa tgatcttttc tttttcctgg actggcaaac agcgataaag 2101 agtgagccaa tgcgtagccc tcaagatgtc gcgccaagtc aaccagccta tattatctac 2161 acatcaggct caacaggaac ccctaaggga gtggtgattt ctcaccaagg cgctcttaat 2221 acatgtatcg cgatcaatcg acgttatcaa attgggaaaa atgatcgagt attggctctt 2281 tcagcactac attttgacct ttcggtatac gacatctttg gcctactttc tgccggcggc 2341 actatcgtat tagtcagtga gcttgaaaga cgtgacccga ttgcttggtg tcaagcaatt 2401 gaggagcata atgtcaccat gtggaatagc gtcccagcat tatttgatat gttattaact 2461 tacgctactt gctttaactc tatcgctccc tcaaaactcc gtttaaccat gctttcggga 2521 gactggattg gattagattt accgcagcgt tatcgcaatt atcgtgtaga tggccaattt 2581 attgcgatgg gaggagccac cgaagcatcg atatggtcaa acgtctttga cgtagagaaa 2641 gttccgatgg agtggcgctc tatcccttat ggctatcctc tacctagaca acaatatcga 2701 gttgtcgatg acttggggcg agattgccca gattgggtag ctggcgaact ttggattggt 2761 ggtgacggta tcgcactggg gtattttgac gatgaattga aaacgcaagc tcagttttta 2821 catattgatg gccatgcttg gtatcgtact ggtgacatgg gctgttattg gccagatggt 2881 actcttgagt tcttggggcg aagagacaag caggtcaaag taggaggtta cagaattgag 2941 ttgggagaaa tcgaagttgc actcaataat ataccggggg tgcagcgtgc ggttgctatc 3001 gcagtgggca ataaagacaa aactctagca gcattcatcg ttatggattc ggagcaagca 3061 ccaatagtta cagcgccgtt ggatgcagaa gaagttcaac ttttgttgaa caaacaactg 3121 cctaactaca tggttcccaa acgcataatt ttccttgaaa ccttccccct aaccgctaat 3181 ggtaaagtcg atcataaagc tctaactcga atgactaacc gagaaaagaa aacatctcaa 3241 agcataaata aacctattat tactgcgagt gaagatagag tagccaaaat ttggaatgac 3301 gttcttggtc ctacagaact ctataaatcg agtgatttct ttttgtcggg aggagatgca 3361 tacaacgcaa tagaggtagt caaacgttgt cataaagctg gctatctaat caagctatca 3421 atgttgtacc gttattctac gattgaagct ttcgctatta tcatggaccg ttgtcgatta 3481 gcacctcagg aagaggctga gttatgagcc ctttaatcaa acttgcagcc tcttcgaggc 3541 tgcatgatgc aactcattat gttttatgcc cttttgcagg aggtggtagt ggtgcattta 3601 gacactggcg tacattatcc cttgaaaatg aagtgatttc ggtaatgctt tatcctggta 3661 gagaatttcg tatagacgac ccaacagtca taaacatcgg cacattagca gaagaaatga 3721 tccaagcttt aaaaacctgt aatcaacgaa tagaagatac gatcattgtc ggtcatagta 3781 tgggcgcgca agtggcgtat gaagcaagta aaaaactagt aaatcagggg ctatttctga 3841 aagggctgat catctctggt tgtcaagctc ctcatatcaa agggcgaagg ttactaggtg 3901 aatgcgatga taaaaccttt attcataatc tagtcgagat tggagggtgt gatccaagtt 3961 tagctaaaag tccagagtgg tggccgatat ttctgccagc tttgagggcg gactttacgg 4021 ctacagaaca gtatattttc acatcacttc caaatgataa ggaaggcctt cctatcccaa 4081 ctctattgat ttcaggtgat caagatagag aagctaactt ttcagaaata gaagagtgga 4141 aactttggtg taataaagtc gttgatcatt tagtggtcga gggcgggcat ttctatataa 4201 cagagcaacc tcaaatgatg cttgaatgca tccgggcttt atcaaccgaa acgactgcct 4261 aatactaagg ttcggttgat agatttttag acaaacaact tcaaacgaca agggtatgca 4321 tttaagcaat gcataccctg ggcttttcga tcaacactat tacttggttt ccggaattc // LOCUS VIBLUXABG 3200 bp ds-DNA BCT 10-JUL-1990 DEFINITION P.leiognathi luciferase alpha (luxA), beta (luxB) subunit, and gamma protein (luxG) genes, complete cds. ACCESSION M34564 KEYWORDS gamma protein; luciferase. SOURCE P.leiognathi (strain 554) DNA, clone pPHL[6,11,12]. ORGANISM Photobacterium leiognathi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 3200) AUTHORS Illarionov,B.A., Blinov,V.M., Donchenko,A.P., Protopopova,M.V., Karginov,V.A., Mertvetsov,N.P. and Gitelson,J.I. TITLE Isolation of bioluminescent functions from Photobacterium leiognathi: Analysis of luxA, luxB, luxG and neighboring genes JOURNAL Gene 86, 89-94 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 145 ORF1 pept 182 1246 luciferase alpha-subunit (luxA) pept 1295 2272 luciferase beta-subunit (luxB) pept 2293 2979 gamma protein (luxG) pept 3081 > 3200 ORF2 binding 171 174 ribosomal binding site (put.) binding 1284 1287 ribosomal binding site (put.) binding 2281 2284 ribosomal binding site (put.) binding 3072 3075 ribosomal binding site (put.) BASE COUNT 1092 a 616 c 586 g 906 t ORIGIN 1 tcgagcagcc attggcttag acagtgaagt gattgattta gttgatgata ttagtgagcc 61 aaactttgaa gatctcacca ttattacagt taatgaacgt cgtttgaaaa ataaaattga 121 aaacgaaatg ttcgctagcg cttaaaccaa tacctattca agtcatcaaa aggaaaagat 181 aatgaaattt ggcaatattt gtttctcata ccagccacca ggtgaatctc ataaagaagt 241 catggatcgc tttgttcgtc ttggcgttgc ttcagaagaa ttaaacttcg acaccttctg 301 gacacttgag caccacttca ctgaattcgg cctaacaggt aacttatatg ttgcttgtgc 361 caatattctt ggtcgtacca aaaaacttaa cgtcggcaca atgggtatcg tactaccaac 421 agctcaccct gctcgccaaa tggaagatct actgctactg gatcaaatgt caaaaggacg 481 ttttaacttt ggtgtagtac gtggtctata ccataaagat ttccgggtat ttggtgttac 541 gatggaagat tctcgttcga tcactgaaga tttccataaa atgatcatgg acggctctaa 601 atcaggcgtt ttacacactg atggtaaaaa cattgaattc ccagatgtaa atgtctatcc 661 agaggcctac ctagacaaga tccctacttg tatgacagcg gaatctgcgg cgacaacgac 721 ctggctagca gaacgtggtt tgccaatggt actgagctgg atcatcacca ccagcgagaa 781 aaaagcacag atggaactat acaatgaaat tgcagctgag catgggcacg atattcacaa 841 tatcgaccac agcatgacct tcatctgttc cgttaatgaa gatccagaaa aagcagaaag 901 tgtctgccgt gacttcctat caaactggta cgagtcctac accaatgcga ccaatatctt 961 taaagacagt aaccaaactc gtggttatga ctatcacaaa ggtcaatggc gtgactttgt 1021 actacaaggc cataccgata cccgtcgtcg tcttgattac agtaataacc taaaccctgt 1081 tggtacacct gaaaaatgta ttgaaattat ccagcgagat atcgatgcaa cagggatcaa 1141 caacatcacc cttggttttg aagcaaacgg ttctgagcaa gaaatcatcg catcgatgga 1201 acgcttcatg acacaagtgg cgccatacct aaaagatccg aaataaactg ccacattaaa 1261 gccattgaat taaattataa ataaggaaaa aaacatgaat tttggattat tctttctgaa 1321 ctttcagctc aaaggtatga catctgaagc agtactagac aacatgatcg atactattgc 1381 tttggttgat aaagacgagt accacttcaa aaccgcattt gtgaacgaac accatttttc 1441 taaaaacggt atcgttgggg cacctatgac agctgcaagt tttctactag gtttaactga 1501 acgccttcat attggttcat tgaatcaagt gatcaccact caccacccag tccgtattgc 1561 agaagaagct agcttacttg atcaaatgtc agatgggcgt tttattcttg ggttaagtga 1621 ttgtgttagt gatttcgaga tggacttctt taaacgccaa cgagatagcc aacaacaaca 1681 attcgaagcc tgttacgaaa ttctaaatga cggtatcact accaactact gttatgcgaa 1741 taatgacttt tataacttcc caaaaatctc tatcaaccca cactgtatta gtaaagaaaa 1801 cctaaaacag tatattttag cgaccagcat gggcgtggtg gaatgggctg cgaaaaaagg 1861 gttaccactg acttaccgct ggagtgatac gctggcagaa aaagaaaatt actatcaacg 1921 ttatttaact gtcgccgctg aaaataatgt cgacattact catgttgatc accaattccc 1981 attacttgtt aacattaatc cggatcgtga tattgctaaa caagaaatgc gtgactatat 2041 ccgtggttat attgctgaag cttacccaaa tacagatcaa gaagaaaaaa ttgaagagct 2101 aattaagcaa catgcggttg gtacagaaga tgaatattat gaatcatcta aatatgcttt 2161 agaaaaaaca ggttcaaaga atgtattgct atcttttgaa tcaatgaaaa ataaagccgc 2221 tgtcatcgac cttattaata tggttaatga aaaaatcaag aaaaatctat aataaataac 2281 aggataataa aaatgacaaa atggaattat ggcgtcttct tccttaattt ttaccatgta 2341 ggacagcaag agccatcatt aaccatgagc aatgcgttag aaacattacg tattatagat 2401 gaagatacat ctatctatga tgttgttgca tttagcgaac accacataga taaaagctac 2461 aatgatgaaa cgaaattagc gccatttgtt agccttggca aacaaattca tattttagcc 2521 accagccctg aaacggttgt aaaagcggct aaatatggga tgccactact gtttaaatgg 2581 gatgatagtc aacaaaagcg tatcgaatta ttaaaccatt accaagcagc tgcggctaaa 2641 tttaatgtcg atattgcagg tgttcgtcat cgattaatgt tatttgtcaa tgttaatgac 2701 aacccaacgc aagccaaagc tgagcttagc atttacttag aagattacct ctcttacacc 2761 caagcagaaa catccattga tgaaatcatc aatagcaatg ctgcaggcaa cttcgatacg 2821 tgtttacatc acgttgctga aatggctcaa ggtttaaata ataaagtcga tttcttattt 2881 tgctttgaat cgatgaaaga tcaagagaat aaaaaatcac taatgattaa ctttgataaa 2941 cgcgttatta attatagaaa agaacacaac cttaactaat tcagttaagt caatttaaat 3001 taaaacttcg tcaatcattg tcattattaa tggcagtgtg gcttcttacg ctgccattaa 3061 attttttatt aaggtgtaat atgactactt tattagatat tgatactaac gatattattg 3121 ttagttcaga actcgatgat attattttct catcatcacc gtttacatta acctttgatg 3181 agcaagaaaa attaaagcaa // LOCUS YSCSLP1A 3456 bp ss-mRNA PLN 10-JUL-1990 DEFINITION S.cerevisiae vacuolar function expression protein (SLP1) gene, complete cds. ACCESSION M34474 KEYWORDS . SOURCE S.cerevisiae, cDNA to mRNA, clone pYKK101. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3456) AUTHORS Wada,Y., Kitamoto,K., Kanbe,T., Tanaka,K. and Anraku,Y. TITLE The SLP1 gene of Saccharomyces cerevisiae is essential for vacuolar morphogenesis and function JOURNAL Mol. Cell. Biol. 10, 2214-2223 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 692 2767 SLP1 protein signal 536 544 TATA box BASE COUNT 1136 a 588 c 691 g 1041 t ORIGIN 1 ctgcagctaa tcacgtgctc acatctttac tcaatgagat tgatggtgtt gaagagttaa 61 agggtgtagt tattgtagcg gcgacgaata gacctgatga aatagatgct gctcttctaa 121 ggcctggtag gttagataga cacatttacg ttggccctcc agacgtaaac gcccgcttgg 181 aaatcttaaa gaagtgcaca aagaaattta atacagaaga gtctggagtc gatcttcatg 241 aattggcaga ccgtacagaa ggttattccg gagctgaagt tgtgctgctt tgtcaagaag 301 cgggcttggc tgccataatg gaagatttag atgtcgcaaa agtggaatta cgtcattttg 361 agaaagcttt taaaggaatt gctaggggca ttactccaga aatgctctct tattatgaag 421 agtttgctct aagaagcggt tcatcttcgt aagcttgttc atagtcaatt cttttccttt 481 gtgtgctcaa taatagtaga tagaaattat actgaactcc ggtcattttg tataatatat 541 taatcacttc acacgaacat acataaataa aatatcataa aggttagcaa attggaacta 601 gttatatgtt aattagttaa aagatagaaa attcgagaaa ggaagaaaaa gctgatattg 661 cccatctcca actttatcaa atcatttcac gatgaataga ttttggaata ctaagaaatt 721 ttcattaaca aatgccgatg gactatgtgc taccttaaat gagatatctc aaaatgatga 781 agttcttgtg gttcaaccaa gtgtattgcc agtactcaat agtttgctaa ctttccaaga 841 tttgactcaa tcaactcctg taaggaaaat tacgttactc gatgatcagc taagtgacga 901 tttaccgagt gccttaggca gcgttccgca aatggatctt atttttctta ttgatgtcag 961 aacatctctc cgactccctc cacaactgct tgatgctgct caaaagcaca atttatcatc 1021 tttgcatata atatactgtc gatggaaacc gtctttccaa aatactttgg aggatacaga 1081 gcaatggcaa aaggatggtt tcgatttgaa ttcaaaaaaa acacatttcc ctaacgtcat 1141 tgaatctcag ttaaaggagc tatcgaacga atataccctt tacccttggg atctcttgcc 1201 cttcccacag attgatgaaa atgttctatt gactcattcc ctttataaca tggaaaatgt 1261 aaacatgtat tatcccaact tacgttcttt gcagagtgcc acagagtcaa tactggttga 1321 tgatatggtc aattcgttgc agagcttgat ttttgaaact aatagtatca taacaaatgt 1381 tgtgtcgata ggtaatctgt ctaagagatg tagccatctt ttgaagaaac gaatcgatga 1441 gcatcaaaca gagaatgatt tattcatcaa gggtacgctt tatggtgaac gaaccaactg 1501 tggactagaa atggacttga ttatcttgga aaggaatacc gatcctataa cgccattgtt 1561 gacacaactt acgtatgcag gaatactaga tgatctatat gaattcaatt ctggcataaa 1621 gataaaggag aaagacatga acttcaatta taaggaagat aaaatatgga atgatttgaa 1681 atttttaaat tttgggtcga ttgggccgca gttaaataaa ttggcaaagg aactacaaac 1741 gcaatatgat acaaggcata aagccgagag cgtacatgaa atcaaagaat tcgttgattc 1801 cttaggttct ttgcaacaaa ggcaagcttt tttgaaaaat cacacaacct tatcatccga 1861 cgttttgaaa gtggtagaga ctgaagagta cggatctttc aataaaatct tagagttaga 1921 gctggaaatt ttgatgggaa atacacttaa taacgacatt gaagatatta tactcgagtt 1981 gcagtaccag tacgaggttg atcaaaagaa gattctcaga ttaatctgtt tattgtctct 2041 ttgtaaaaat tcacttcgag aaaaggatta tgaatatcta agaaccttta tgatcgactc 2101 ttggggcatt gaaaaatgct ttcaacttga atcattggct gagttaggat ttttcactag 2161 caaaacggga aaaactgatt tgcatattac aacaagtaag tcaacaagat tacagaaaga 2221 ataccgttat atttcacaat ggttcaatac agtacccata gaagacgagc atgctgccga 2281 taaaatcaca aatgagaacg atgacttctc ggaagccact tttgcttaca gtggtgtagt 2341 gcccttgaca atgagactgg ttcagatgtt atatgatagg tctatcttgt tccataatta 2401 ttcctcgcag cagcctttta tactgtcaag agaacctaga gtttctcaaa cggaggattt 2461 aattgaacag ttatatggag actcacatgc gatcgaagag agtatatggg tcccgggaac 2521 cattacaaaa aagatcaatg caagcatcaa gagcaataat agacggtcca tagacggatc 2581 taatgggaca tttcatgctg cagaggatat tgcactcgta gtattcctcg gaggtgtaac 2641 aatgggtgaa atagctataa tgaagcattt gcaaaaaata ctaggtaaaa aaggtatcaa 2701 taaaaggttt atcatcatcg ccgatggctt gatcaatggc acaaggatca tgaactctat 2761 atcttaatta ttatatgata gatttgttaa ttttttgtat atgcaaatgt gcttttttca 2821 ccaaacggtt tgcaccaatc atacgagaga agtgttcggt gtttacggaa aagctagggg 2881 actaagaaaa attgaaaata aaggctgaca gcagtagaaa ccattgtgct ggcttagtga 2941 tttataagaa tggttaatta gttttgtatc ctttattttc tagatagagc cacagagcaa 3001 actaaacaga aaagttatcc atttccatta cgcaatgttg tgccaacaga tgattagaac 3061 gacagctaag agaagtagca atatcatgac cagacctatt atcatgaaga ggtcagtaca 3121 cttcaaagac ggtgtgtatg aaaatatccc attcaaagtc aaaggaagaa agacacctta 3181 cgccttatct catttcgggt tcttcgctat tggatttgct gttccatttg ttgcctgcta 3241 tgttcaattg aaaaagtcag gtgcttttta aaacaccccc ctaagttgaa ggatagatgt 3301 gtgtacatag cgtgcttggt tgagacgttt tagagtgtgt tctttgctat tcctaggtgc 3361 gcatatcatc gttttattta tttgtacaat tttcttttca tatattcata atcctctcct 3421 tgtgccttcg tattgagacg gcgggaaaga aggatc // LOCUS CHKMHBLBA 2405 bp ds-DNA VRT 10-JUL-1990 DEFINITION Chicken MHC class II B-LBII-beta gene, complete cds. ACCESSION M29763 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Chicken (haplotype B12) DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2405) AUTHORS Zoorob,R., Behar,G., Kroemer,G. and Auffray,C. TITLE Organization of a functional chicken class II B gene JOURNAL Immunogenetics 31, 179-187 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 16; 2332 to 2405) AUTHORS Zoorob,R., Behar,G., Kroemer,G. and Auffray,C. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Zoorob, 06-NOV-1989, for release after publication. Author address: R.Zoorob Institut d'Embryologie Cellulaire et Moleculaire du CNRS et du College de France 49bis av. de la Belle Gabrielle F-94736 Nogent sur Marne France FEATURES from to/span description pept 828 918 MHC B-LII-beta chain, exon 1 1127 1396 MHC B-LII-beta chain, exon 2 1483 1764 MHC B-LII-beta chain, exon 3 1847 1957 MHC B-LII-beta chain, exon 4 2049 2072 MHC B-LII-beta chain, exon 5 2175 2188 MHC B-LII-beta chain, exon 6 pre-msg 808 2331 MHC B-LII-beta chain mRNA and intron IVS 919 1126 MHC B-LII-beta chain intron A IVS 1397 1482 MHC B-LII-beta chain intron B IVS 1765 1846 MHC B-LII-beta chain intron C IVS 1958 2048 MHC B-LII-beta chain intron D IVS 2073 2174 MHC B-LII-beta chain intron E BASE COUNT 380 a 728 c 902 g 395 t ORIGIN 1 ggatccatgg gtgacgtaag gatgaggttc cagcacatat tggacccttc tgcgtttgca 61 tggagggatc ttcgggggat ctttgtgatc ttcagtgatt ttcagtggtc tttggtggtc 121 ttcagtgctc ttcgttggtc tttgacaaag atgcagagga gcaccgctcc cagacggacc 181 ccccggggac cccatttgtc gccatcccca ctgggacatg cagccattga ccacagccct 241 ccggctgcga ccacccaact gattccttat ccaaagtcca ctctttgcac acttacctcc 301 aatttagtga taaggatgtg gcgtgggacc gtcccaatgg ccgcacacaa gtccaggtag 361 atgatatggg atgaccatga agggatcaca gagaggaaca cggggtgacc acgaggagca 421 acgaaggaaa cgctgagtga ccacgggcag aaaatggtgt gaccattagg ggacaacgag 481 agggaacaga agtagtaagg agtgagaatg gggtgacaaa gaggtgacca tggcataact 541 ttgataagac cattgggtga ccgcagggtg atggccatac catggggtga gcactggatg 601 accatggagg tcattggagg accatcgggt gggacgaggg ccgtggggac acccgtgggg 661 cggtgggacg ggggcagagt gtcagaagga gccccgcggc gcagaactct gcctggagac 721 gggtgacgcc gcccggcgcc gccgccgctc attggccctc cccgcccggc cccgggctcg 781 cggctggcgc ggggtgccgg gtcccccatc gtccggcggc agcagccatg gggagcgggc 841 gcgtcccggc ggcgggggcc gtgctggtgg cactgctggc gctgggagcc cggccggccg 901 ccggcacgcg gccctcgggt gagctcggag ccgcggcgcg gggacggcgc tgcgtccccc 961 ccggagaaac ccccggagcc cttctggccg tgcgcagcgc tcggggctgc ggggggacgg 1021 agggcggggg ggggcggcgg agccgtgggg ggcagcgggg ccggggaggg ggcggggggt 1081 gtggcggggg gcggctgtgt gccctgaccg tgccctctgc ccgcagcgtt cttcttctgc 1141 ggtgcgatat ccgagtgcca ctacctgaac ggcaccgagc gggtgaggta tctgcaaagg 1201 tacatctaca accggcagca gttcacgcac ttcgacagcg acgtggggaa atttgtggcc 1261 gattcaccgc tgggtgagcc gcaagctgaa tactggaaca gcaacgccga gcttctggag 1321 aaccgaatga atgaagtgga caggttctgc cggcacaact acgggggtgt ggagtccttc 1381 acggtgcaga ggagcggtga gtgccgcggg gcgcagcgcg gacggacggg caggcgccgc 1441 gctctggcgg tcggtccgca gcgctccccc cgtgccccgc agtggagccc aaggtgaggg 1501 tctcggcgct gcagtcgggc tccctgcccg aaaccgaccg tctggcgtgc tacgtgacgg 1561 gcttctaccc gccggagatc gaggtgaagt ggttcctgaa cgggcgggag gagacggagc 1621 gcgtggtgtc cacggacgtg atgcagaacg gggactggac gtaccaggtg ctggtggtgc 1681 tggagaccgt cccgcggcgc ggggacagct acgtgtgccg ggtggagcac gccagcctgc 1741 ggcagcccat cagccaggcg tggggtaagg cccccgggcc ctgccccgcc gcggggggag 1801 cgggagcgcg gcccgccgcg ctgagccgcc gccttcgtcc ccgcagagcc gccggcggac 1861 gcgggcagga gcaagctgct gacgggcgtg gggggcttcg tgctggggct cgtcttcctg 1921 gcgctggggc tcttcgtgtt cctgcgcggt cagaaaggtg agcgctgggg aggggggctg 1981 cgcggggggg gtcgggagcg gggggtgggg ggcagcgtcc gcgctgacct cgtctcgctg 2041 tgtttcaggg cgccccgtcg ccgccgctcc aggtaacgtc ccgttcccat tcccgttccc 2101 gttcccgttc ccgttccgcg ctgcgagcgg ccccgatccc ggcgcggggc tcagctctgc 2161 ccgtctcccc gcagggatgc tgaattagct gctgccccgc cgagccgctg cacccgcacc 2221 ccccgctctc ccggccgtcg cctcggctct ccctcgggct gccaccgcgt ccgttggaga 2281 tgtcgccacg atgcacgctt cgtccccatc ctaataaacg cgctgacttt gaccccgctg 2341 ttcgctgccc gtgaatcatt ggggactttc cgtcgcgtgg gaggagggga gggaagtgaa 2401 agctt // LOCUS CHKMHBLIIB 444 bp ds-DNA VRT 10-JUL-1990 DEFINITION Chicken MHC class II B-LBIII-beta gene, exon 1. ACCESSION M29764 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Chicken (haplotype B12) DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 444) AUTHORS Zoorob,R., Behar,G., Kroemer,G. and Auffray,C. TITLE Organization of a functional chicken class II B gene JOURNAL Immunogenetics 31, 179-187 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Zoorob, 06-NOV-1989, for release after publication. FEATURES from to/span description pept 125 / 215 MHC B-LIII-beta chain, exon 1 pre-msg 105 > 444 MHC B-LIII-beta chain mRNA and intron IVS 216 > 444 MHC B-LIII-beta chain intron A BASE COUNT 50 a 156 c 176 g 62 t ORIGIN 1 ctgatcgggg tacccgcaac ggagatctgc ctggagacgg gtgatgccgc ccagcccagg 61 cactcactgc tccagagcag cggcgcgggc tgccggcacc cttcctcctc ctccggcagc 121 agccatgggg agcggccgtg tcctggtggc cggggccgtg ctggtagcac tggtggcgct 181 gggagcacgg caggccgccg gcacgcggcc ctcaggtgag ctcggagtcc cggtgtgggg 241 atggtgcagg gtggtccctc ccggtgtctc ccggcgccca ccccagcccc gtgcgcagcg 301 ctcggagctc cgcggctcag gatgccggcg acagcgcgtc cgcagccgtc gtgggcgtgg 361 ggggcacggg acggagcgcg gacgggagtg gctttcgggt ctgccgaggg gcagctggct 421 cctgacggtg ccccctcccc gcag // LOCUS RATLY6A 1221 bp ds-DNA ROD 10-JUL-1990 DEFINITION Rat Ly6-A antigen gene, exon 2. ACCESSION M30692 KEYWORDS antigen. SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA and DNA, (library of Clontech), clone RK6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1221) AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U. TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated from rat kidney JOURNAL Immunogenetics 31, 104-111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Friedman, 14-DEC-1989, for release after publication. FEATURES from to/span description pept / 497 834 Ly6-A antigen, exon 2 (put.) (AA at 499) IVS < 1 496 Ly6-A intron A (no splice consensus) signal 1204 1209 poly-A signal BASE COUNT 286 a 315 c 314 g 306 t ORIGIN 1 gtagtccggc tgctggctga gttgtaaggc aggagggagg ctgggtgtgt tttgtcttgc 61 atgtagccct ctctgcagag ggcctggctt cactcacaca agcctggtaa catctggtac 121 atcgaactct aagaatcggc aagcccactg ctgccgtctc cttaagagtt catttaggga 181 gtctgtcagg aacttgggca ggagtccaca ctaagggaag cttacttccc aaacagtggt 241 gctgggtgga aagtggagga ctcatgagaa cccctagttt aagactttta gagaagcagt 301 ctgaagcact gtggagatgt ggtcccatcg ccatcctgga gtagggataa ttttgcccag 361 gagccccagc aatgggtcag aggagcaaaa cgacgacagc tgtaagtggt ctcagaagat 421 gctagaggaa acagaagatg aactggcagc tgagacttgg cggtaactta ctggcttcga 481 cactatgcgt gttactctca gggcctaaac tgctacaatt gcacgatgat cccatttggt 541 aatacctgct catcaactgc tacctgcccc taccctgatg gagtctgtgc tattcaggtg 601 gcagaagttg ttatgagctc tgtaagacag aaagtaaagg accatatttg ccttcccgtc 661 tgcccaacga gtcctcaaac aaccgagatc ctgggtactg ttgtcgacat gaagatttcc 721 tgttgcaata cagatctttg caacgcagca gggcccactg gaggcagcac ctggaccatg 781 gcaagggtgc ttctgttcag cctgggctca ttcctcctgc agaccttgct gtaatggctc 841 ctccaaggcc ccgccaccct tgtcctttta tcctcatgtg taatcactcc tccctggagc 901 cctctagtga taaattctga gtaatagaaa ctctgaggtg ggggtagggt gtggaacacc 961 ttgtttcaac tctatagccc ctgctgggta ggtgccccac tcccctctct agggctttca 1021 gatatgtact tcctggaatg ccattatgtt gtggtttgct gctcttggcc ctggaggcat 1081 gtggacagca cggggaagag acagaaaccc aaggcactgt gtgaccacct ccatccatac 1141 ataaaaatct ggggtcctgc agggttccca cacatgcctc tcaacatccc cctatttgag 1201 tccaataaac tctctgttct c // LOCUS RATLY6B 905 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat Ly6-B antigen mRNA, complete cds. ACCESSION M30689 KEYWORDS antigen. SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA, (library of Clontech), clone RK10. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 905) AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U. TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated from rat kidney JOURNAL Immunogenetics 31, 104-111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Friedman, 14-DEC-1989, for release after publication. FEATURES from to/span description pept 103 510 Ly6-B antigen (put.) signal 878 882 poly-A signal signal 599 604 poly-A signal BASE COUNT 206 a 248 c 213 g 238 t ORIGIN 1 ctcttgctct cctccagcca caagtggtct cagaagatgc tagaatgtag aggaaacaga 61 agatgaactg gcaggttttg cctgtgcgcc ccttctcaga ggatgaacag atcttgtgct 121 atgaagtcct gtgtgctcat ccttctcctg gccctactgt gtgcagaaag agctcagggg 181 ctaaactgct acaattgcac gatgatccca tttggtaata cctgctcatc aactgctacc 241 tgcccctacc ctgatggagt ctgcactatt caggtggcag aagttgttgt gagctctgta 301 agactgaaag taaagagcaa tctctgcctt cccggctgcc ccaagagtcc tcaaacacct 361 gaggtcctcg gtaccgttgt ccatgtgaat actgactgtt gcaatacaga tctttgcaac 421 gcagcaggtc ccactggagg cagcacgtgg accatggcag gggtgcttct gttcatcctg 481 ggctcagtcc tcctgcagac cttgctgtga tggaccctcc aaggccctgc cacccttgtc 541 cttttatcct tatgtgtaat cactccttcc tggagccctc tagtgataaa ttctgagtaa 601 taaaaattca gaggggggat tgagtgtgga acaccttgtt gcaactctat agccactgct 661 ggataggttc cccactcccc tctctagggc tttcagatat gtacttccta gaatgccatt 721 gtgttttggt ttgctgctct tggccctgga ggcaggggac agcacgggga agaggcagaa 781 acccaaggca ctgtgacacc acctccatcc atacataaaa atctggggtt ctgcagggtt 841 cccacacatg cctctgaaca tccccctatt tgagtccaat aaactctctg ttctcccacg 901 gaatt // LOCUS RATLY6C 931 bp ds-DNA ROD 10-JUL-1990 DEFINITION Rat Ly6-C antigen gene, complete cds. ACCESSION M30690 KEYWORDS antigen. SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA and DNA, (library of Clontech), clone RK3. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 931) AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U. TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated from rat kidney JOURNAL Immunogenetics 31, 104-111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Friedman, 14-DEC-1989, for release after publication. FEATURES from to/span description pept 76 262 Ly6C antigen, exon 1 (put.) 342 559 Ly6C antigen, exon 2 (put.) IVS 263 341 Ly6C antigen intron A (no splice consensus) signal 927 931 poly-A signal BASE COUNT 202 a 257 c 224 g 248 t ORIGIN 1 gccctgggac gtaattggaa gtctattaac tggctccaat ttccaaggtt ttctctgtgc 61 accccttctc tgaggatgaa cagttcttgc gctatgaagt cctgtatgct catctttttc 121 ctggccctac tgtgtgcaga aagagctcag ggcctaaagt gctacagttg catagaagtc 181 ccacttaatg ctaactgctc aacagctacc tgcccctact ctgatggagt gtgtgtttct 241 caggtgttag aagctgtaga gggtctccta gatgcaactt cccagggaac tgcaagagtc 301 tgagaggctg gttgcccttt ttgctctgcc actgagtgat cgctctgtaa gacggacagc 361 aaagagcaat ctctgccttc caatctgccc caagtttcct caaagaaccg agatcctggg 421 taccgttgtc tacacgaagg tttcctgttg caatacagat ctttgcaatg cagcaggtcc 481 cactggaggc agcacctgga ccgtggcagg ggtgcttctg ttcagcctgg gctcagtcct 541 cctggagacc ttgctgtgat ggcccctcca aggccccgcc acccttgtcc ttttagcctc 601 atgtgtaatc actcctctga agccctctag tgataaattc tgagtaatag aaactcccag 661 gtgggggtag ggtgtggaac accttgattc aactctatag cccctgctgg gtaggtgccc 721 cactcccctc tctaggactt tcagatctgt acttcctgga atgccattgt gttgtggttt 781 gctgctcttg gccctggagg cacatggaca gcacagggaa gaggcagaaa cccaaggcac 841 tgtgacacca cccccatcca tacataaaaa tctggggttc tgcagggttc ccacacatgc 901 ctctcaaggt tcccctattt tagtccaata a // LOCUS RATLY6CA 783 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat Ly6-C antigen mRNA, exon 2. ACCESSION M30691 KEYWORDS antigen. SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA, (library of Clontech), clone RK11. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 783) AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U. TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated from rat kidney JOURNAL Immunogenetics 31, 104-111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Friedman, 14-DEC-1989, for release after publication. FEATURES from to/span description pept / 1 403 Ly6-C antigen, exon 2 (put.) (AA at 2) signal 771 783 poly-A signal BASE COUNT 173 a 219 c 187 g 204 t ORIGIN 1 gaacagttct tgcgctatga agtcctgtat gctcatcttt ttcctggccc tactgtgtgc 61 agaaagagct cagggcctaa agtgctacag ttgcatagaa gtcccactta atgctaactg 121 ctcaacagct acctgcccct actctgatgg agtgtgtgtt tctcaggtgt tagaagctgt 181 agagggctct gtaagacgga cagcaaagag caatctctgc cttccaatct gccccaagtt 241 tcctcaaaga accgagatcc tgggtaccgt tgtctacacg aaggtttcct gttgcaatac 301 agatctttgc aatgcagcag gtcccactgg aggcagcacc tggaccgtgg caggggtgct 361 tctgttcagc ctgggctcag tcctcctgga gaccttgctg tgatggcccc tccaaggccc 421 cgccaccctt gtccttttag cctcatgtgt aatcactcct ctgaagccct ctagtgataa 481 attctgagta atagaaactc ccaggtgggg gtagggtgtg gaacaccttg attcaactct 541 atagcccctg ctgggtaggt gccccactcc cctctctagg actttcagat ctgtacttcc 601 tggaatgcca ttgtgttgtg gtttgctgct cttggccctg gaggcacatg gacagcacag 661 ggaagaggca gaaacccaag gcactgtgac accaccccca tccatacata aaaatctggg 721 gttctgcagg gttcccacac atgcctctca aggttcccct attttagtcc aataaactct 781 ctg // LOCUS RATTAG1 5040 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat axonal glycoprotein (TAG-1), mRNA, complete cds. ACCESSION M31725 KEYWORDS glycoprotein. SOURCE Rat 13 day old embryo spinal cord axon, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5040) AUTHORS Furley,A.J., Morton,S.B., Manalo,D., Karagogeos,D., Dodd,J. and Jessell,T.M. TITLE The axonal glycoprotein TAG-1 is an immunoglobulin superfamily member with neurite outgrowth-promoting activity JOURNAL Cell 61, 157-170 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Furley,A.J.W., 30-JAN-1990, for release after publication. FEATURES from to/span description pept 224 3346 axonal glycoprotein (TAG-1) precursor sigp 224 313 axonal glycoprotein signal peptide matp 314 3343 axonal glycoprotein BASE COUNT 1144 a 1486 c 1397 g 1013 t ORIGIN 1 gaattcccgc ccgctgccgc cacgccagga cagccagtgg ctaaggccgg cggggcaagc 61 agccctgagg ctggcagcag ggtctgctca ccaggcggcc gcagcagtgc cccagccaac 121 acccttcccg cactctaggt gtgcctgagt ctccagttga ttctcccgga gcggagctgc 181 ggctcctctc ttttggactc tgcctctgcc tgaaagaccc accatgggga cacacgccag 241 gaaaaaggca agcttgctgc tgctggtgct ggccacagtg gccctggtct cctctccagg 301 atggagtttt gcccagggaa ccccagctac ctttggaccc atcttcgaag agcaacccat 361 tggcctgcta ttcccagagg agtctgcaga ggatcaggtg acactggcgt gccgtgcccg 421 tgctagccct ccagccacct acaggtggaa gatgaatggc acagatatga acctggaacc 481 tggctcccgt caccagctga tggggggcaa cctggtcatc atgagcccca ccaagacaca 541 ggatgctggt gtctaccagt gcctagcctc caacccagta ggcactgtgg tcagcaagga 601 ggctgtcctc cgctttggct ttctacagga attctccaag gaggagagag accctgtgaa 661 aacccatgag ggctggggag tgatgctgcc ctgtaacccg cctgcccatt acccaggttt 721 gtcctaccgc tggctcctca acgagttccc caacttcatc ccaacggatg ggcgacactt 781 cgtgtcccag actacaggaa acctgtacat cgcccggacc aatgcctcag acctgggcaa 841 ctactcttgt ttggctacca gccacatgga cttttccacc aagagtgtct tcagcaaatt 901 tgcgcagctc aacctggctg cggaagatcc ccgactcttc gctcccagta tcaaagctcg 961 gttccccccg gagacctacg cactagttgg gcagcaagtc accctggagt gctttgcctt 1021 tgggaacccg gttccccgga tcaagtggcg caaagtggat ggttccttgt cccctcagtg 1081 ggccacagct gagcccaccc tgcagatccc cagcgtgagc tttgaagacg agggtaccta 1141 tgaatgtgag gcagagaact ccaagggtcg tgacaccgtc cagggacgca tcatcgtgca 1201 agctcagcct gagtggctaa aggtgatctc agacacagag gccgacattg gctccaactt 1261 acgttggggc tgtgcagcag caggcaaacc ccggcccatg gtgcgctggc tgagaaacgg 1321 ggaacctctg gcctcccaga accgggtgga ggtcttggct ggggacctgc gattctctaa 1381 gctgagcctg gaggactctg gcatgtacca gtgtgtggct gaaaacaagc atggcaccat 1441 ctatgccagt gctgagctgg ctgtacaagc tctggcccca gacttcaggc agaaccctgt 1501 gagacggctg atccctgcag ctcgaggcgg agagatcagc atcctgtgcc agcctcgcgc 1561 agccccaaaa gctacaatac tttggagcaa gggtactgag attttgggga acagtaccag 1621 agtgactgtc acttccgatg gcaccttgat catcagaaac atcagccgat ccgatgaagg 1681 caaatatacc tgctttgctg agaacttcat gggcaaagcc aacagtaccg ggatcctgtc 1741 cgtgcgcgat gcaaccaaga tcaccctggc tccctccagt gctgacatca acgtgggtga 1801 caacctgacc ctacaatgtc atgcctcgca cgaccccact atggacctca cgttcacctg 1861 gaccctggat gatttcccta ttgactttga taagcctgga ggtcactacc ggagagccag 1921 tgcgaaggag accattgggg acctgactat cctcaacgcc cacgtacgcc atggagggaa 1981 gtacacatgc atggcccaga ctgtggtaga tggtacatcc aaggaggcca cagtcctggt 2041 ccgaggtccc ccaggtcccc cagggggtgt ggtggtgaga gacatcggag acaccaccgt 2101 tcagcttagc tggagtcgtg gctttgacaa ccacagcccc attgccaagt acacgctgca 2161 agctcgtact ccaccctcgg ggaaatggaa gcaggttcgg accaatcctg tgaatatcga 2221 gggtaatgcc gagactgccc aggtgctggg tctcatgcct tggatggact atgagtttcg 2281 ggtttcagct agcaacatct tgggcactgg ggagcccagc gggccctcca gcaaaatccg 2341 cactaaggaa gcagtcccct cagtggcacc atcgggactc agtggagggg gaggagcccc 2401 tggagagctc atcatcaact ggactcccgt gtcacgggag taccagaacg gagacggctt 2461 cggctacctg ctgtccttcc gcaggcaagg cagctccagc tggcagactg cccgggtgcc 2521 tggcgctgat gcgcagtact tcgtctacgg caatgacagc atccagccct acacaccctt 2581 tgaggtcaag atccgaagct acaatcgccg gggggatggg cccgagagcc tcactgcgtt 2641 agtgtactca gcagaggaag agcccagggt ggcccctgcc aaggtctggg ccaaggggtc 2701 ctcatcttca gagatgaacg tgagctggga gcctgtgcta caagacatga acggcattct 2761 cctgggatat gagattcgct actggaaagc cggggacaac gaagcagccg ctgaccgagt 2821 gaggacagca gggctagaca ccagtgcccg agtcactggc ctgaacccca acaccaaata 2881 ccacgtaact gtgagggcct acaaccgggc cggcactgga cccgctagcc cttcagctga 2941 tgccatgacc gtgaagcccc cgccacggag acctcctggc aacatctcct ggactttctc 3001 aagctccagt ctcagcctta agtgggaccc tgtggttcct ctccgaaatg aatctacggt 3061 cactggctac aagatgctgt atcagaatga tttgcaccca actcctacgc tccacctcac 3121 cagcaagaac tggatagaaa taccagtacc cgaagacatt ggccacgctc tggtacagat 3181 tcgaaccaca gggcctggag gggatgggat ccccgcagaa gtccacattg tgagaaatgg 3241 aggcacaagc atgatggtgg agagcgccgc cgcccgccct gcccatcccg gacctgcgtt 3301 ctcctgcatg gtgatattga tgctcgctgg ctaccagaag ctctgatctc aacactgccc 3361 gccacgccca agctggacac ccaccctaac agacacagcg gctgaccaca gctccctttc 3421 gtccaaggtg gtccaacact gtgcctgagc gtggttggct tagacaccta ctcccaacag 3481 taccctttat gtaggaggta ggatattcct attctgccac aggatagaac catgcgagga 3541 aattttcttt aagtcaagag gcactgggca gtgacttcca tgataatagt actaggccta 3601 atgcctggac cccttggggt cttggtcgaa aggaacgggc ctttgattaa gcagatggtc 3661 ctttggggcc acaagtggca ctgccatctg agatcagagt accaggccca gcaggaacat 3721 gggcagcagt ggggtattgt tttccctcta tgaagcagag ggacctcttc tagtcctcac 3781 tggagaagca ccatggttgg tcccgacacg gtcttccatg actccctggc ttcctcggta 3841 gccaaggaca aggccctggg ttactgggga tagaagctca aaagggttga gaggctaccc 3901 cacccgatgg aaaggggcac cagcctaagc ccattggcca tcctggtggc actgccctct 3961 cagccagcac tgccaagcca atcctgtcgt cctccagatg gaatggtgga gtgacagagc 4021 cacttcaggt ggctatgtga ctaaagggct tgcctcgagg agttgccttg cctcatcaag 4081 atgcttcctt catggaccct ccagggtacg ggcaggagat gtccatctga acgctactct 4141 cttcccttca gctctgctgc aaacttgtgc ctgcctccac ctcccacaac tgcaggcccc 4201 agaaatcagc tctcaacaca gcatccattc tttgtcctgg gatagagagg catccgagaa 4261 gggccagcat caaagtggcc ctgcctgctt ccaggaatat cctccatcac ctggccacac 4321 ctgctcccca gaactgcctg gactactctc ttcagtcccc acaagaaaaa gggttaataa 4381 gggggggggg ggtggcctgc cttgagttct gggtagttac cagggataga ccagactacg 4441 ggagctgaag aagccttata acttgactta tccgtaccct acacttaaca gacgaggaaa 4501 tggaggtgca gaagggttag ggacttcttg ggggtcacat ggtctgtaag gacaaggcat 4561 ggtcagcaca gggtctcctc cccacctgtg ggaggctcta tagagagagg gaggatgttg 4621 agcagtcaca gcctgtcctc taggactctg gaggactctg gaggaggagc cctctgcttc 4681 aagaggttct ggctggtgag atggacaaat gagctccaac caaggcatag gcagattcca 4741 ggagtcaatg gcctggggca gccttctgct gggaactcgg cagggagcac tgtctggaag 4801 cctctcgggc ttgctcattt caagaagagg ccaaagcaag gacagagttc cttagacgag 4861 gaccctgcag cagcacgacc agaaaacccc agtgtccacg ccctcagccc acgggggcag 4921 cagagcaggc atttcaagat gcacttgccc tgctgctcct taggccattt ctgtagttta 4981 cagttagagc tctattttgt tatgggtttt taaacttcaa gccttgctct gtttttctgg // LOCUS MUSADAM01 2308 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 1 (non-coding). ACCESSION M34242 J04767 KEYWORDS adenosine deaminase. SEGMENT 1 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2308) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pre-msg 885 > 2308 adenosine deaminase (ADA) mRNA and introns IVS 1008 > 2308 ADA intron A BASE COUNT 479 a 615 c 698 g 516 t ORIGIN Chromosome 2. 1 cccacctcaa ggtgcgcaca agttacttaa ggaacttgct acaatatagc cctgctcccg 61 cccccaaaat cccaccaaac ctagagtatg gttctaaaca gctcacctgt taagtctcct 121 tggccaatcc tctagaagtt gaccatagta tgaagttttc tgcagcgtag tttttttctg 181 cccccctttc actactgtgt ctgagcacat gtgctgtgct ttgtagctga aactggcttt 241 attgctgcag aaaccagtcc actgtattta cccacagcac tgatgtgagc attctaaata 301 catctcgatg cgtgggcata tttatccagc gtaactgccc caggagagat gaactgtgtg 361 ttcctgtcca ccccctgtat cagcacctga gactagtctc agagtctctc tcacacacaa 421 cagtgttctc tgcatcccac ccgccctcac ctggtgaact ccggcagtcg ccgctaaatc 481 tccctaatta cacacttctt ctgccttgtg attctgcaac aagtgggtct atccctcaaa 541 atccagcccc ataaggcttc aggactgtgt ggctccagct tcagcctgca caaagtaggc 601 gcccaagcaa cactggaagc ctcggtactg aaggggcccg gaaggggcag gtgagacatt 661 ggagtcacgt ctgcaggggg ctcacctggg agcttcctag ggtgtagcca gcagggaagg 721 tctggggttc agaattccgg gaaatgcgcg ccagagttgc aggcgggggg gggggggggg 781 ggggggcggg gccgtggctc cggaaggcgg ggtctctctg tgggcgtagc gtgggcgggg 841 ctgtgcgggg cagcccggta aaaaagagcg tggcgggccg cggtctctga gagccatcgg 901 gaagcgaccc tgccagcgag ccaacgcaga cccagagagc ttcggcggag agaaccggga 961 acacgctcgg aaccatggcc cagacacccg cattcaacaa acccaaagta agcaccgagg 1021 ggctccgttg ccagggttct gtcgggctgt cccggggctt agcggggccc acctttggcg 1081 cctttaacct agaagcatgg agtggcaggg ggactcccgc aggcatctcc cctcgaccca 1141 ggccttagct tgcttccggg atgtcgagcg agagacgatg tggcagggag tgtccagaag 1201 ggctccgttg ccagggttct gtcgggctgt cccggggctt agcggggccc acctttggcg 1261 cctttaacct agaagcatgg agtggcaggg ggactcccgc aggcatctcc cctcgaccca 1321 ggccttagct tgcttccggg atgtcgagcg agagacgatg tggcagggag tgtccagaac 1381 ctgggggtgt ctctggtcgg ccttcgggtt cggctgctgt ctatgcgaac ctgggagtgc 1441 ctccagtcgg ccttcgggtt cggctgctgc ctatgccctg tgccctggag gtctcagcct 1501 cgctgtctgc caatgggcat ccagtgcggc ggggctgcac agctgtgtgg gactgggcta 1561 ggacctgggt gtctgagccc cagtagaatg gggcccaggg tctctagctg ttaaatgttc 1621 agtgtatggc tttatactta agtgttatga ttactttctg ggcaacaggt aacctaggtt 1681 tgtgggtgcg cccgtgggaa aatctatgat ccaaaccaga aaaggaaggg atagaggctt 1741 cagggtgcca ggaggaaccc ctacacatac tgaccgtttg gccatatggg tttatttggg 1801 atgaagtttt agcccattga ccccagagga gaacccttta tctgtctttc tgcaagctgt 1861 ggcttcttgg aaacagggag actccaggtc cccaaggcca gatttgcagc ccttacagat 1921 tctgtctagt cagccaggca aattgaactg gtcagcagaa gtgtgggact gagaactcag 1981 ggggagggat cagagacagt cacccttaga cttacccctc caagaaacag atgctgagtg 2041 gggggcgggg tggcagacgt atgaatcccg tgtgcatgtt gtgtcatata tgcgtgcatg 2101 gagggagcgg gagggaagat gggcagtggg cctgtattcc atgcacttac catagggaac 2161 acactctgcc cctctagcta gaggctagaa gggcagggca agtcttccta cccaaccaat 2221 gcctgctgca catcttgtct ggtggctcct gaccacagtt ggtgctctta gacatcaaag 2281 ggtgagtttt cttttgatgg tctgaatt // LOCUS MUSADAM02 207 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 2 (non-coding). ACCESSION M34243 J04767 KEYWORDS adenosine deaminase. SEGMENT 2 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 207) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pre-msg < 1 > 207 adenosine deaminase (ADA) mRNA and introns IVS < 1 78 ADA intron A IVS 141 > 207 ADA intron B BASE COUNT 39 a 58 c 54 g 56 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 1. 1 gctcctcggg ctctgtggtg gcttctgagg tgtcctctgg ctctgtggta tctcacgctc 61 tttttctgtc ccttgcaggt agagttacac gtccacctgg atggagccat caagccagaa 121 accatcttat actttggcaa gtaagtccaa ggacaaccac agaccttccc aggattgcag 181 agcgtgtaca gctcttcttg gggggcc // LOCUS MUSADAM03 382 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 3 (first expressed exon). ACCESSION M34244 J04767 KEYWORDS adenosine deaminase. SEGMENT 3 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 382) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept 235 + 299 adenosine deaminase (ADA, EC 3.5.4.4), exon 3 (first expressed exon) IVS < 1 176 ADA intron B IVS 300 > 382 ADA intron C BASE COUNT 86 a 117 c 94 g 85 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 2. 1 aacacacaca tgcctgatgc cagcaaagga ggcctgaagg cattggtacc cctggaatta 61 gagttacagc tggtcatggg cctccatgtg ggtctcgtct tctgcaagaa cagccagtgt 121 gctcttaccc accaagccct ggtgcagccc ctcacccttg actttatttt taggaggaag 181 agaggcatcg ccctcccggc agatacagtg gaggagctgc gcaacattat cggcatggac 241 aagcccctct cgctcccagg cttcctggcc aagtttgact actacatgcc tgtgattgcg 301 taagttgctc cccaaccctt gtgccccaca gtagcatcca tccctataac caaggtcagg 361 cctgagctgc tgctgtacaa gg // LOCUS MUSADAM04 346 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 4. ACCESSION M34245 J04767 KEYWORDS adenosine deaminase. SEGMENT 4 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 346) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 93 + 227 adenosine deaminase (ADA), exon 4 IVS < 1 92 ADA intron C IVS 228 > 346 ADA intron D BASE COUNT 77 a 85 c 111 g 73 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 3. 1 acagttgtag ttacctcgtt ggctactaga cgtcccaagg agctgagaaa ggttgccaac 61 ctgtgttctt cttcccttcc caggggctgc agagaggcca tcaagaggat cgcctacgag 121 tttgtggaga tgaaggcaaa ggagggcgtg gtctatgtgg aagtgcgcta tagcccacac 181 ctgctggcca attccaaggt ggacccaatg ccctggaacc agactgagtg agtgacatca 241 ctggaggggg ctgtgctgag cggggctctg agctgaggat ggagtgctta gagccctggc 301 ctggtccatg gactcagagc gactcagctc agtcctaagt gcacga // LOCUS MUSADAM05 385 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 5. ACCESSION M34246 J04767 KEYWORDS adenosine deaminase. SEGMENT 5 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 385) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 115 + 230 adenosine deaminase (ADA), exon 5 IVS < 1 114 ADA intron D IVS 231 > 385 ADA intron E BASE COUNT 84 a 115 c 94 g 92 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 4. 1 tctccatcta gaaatagaag ggcagagaga catcactaca tccctgctcc agttccatgg 61 ctgcccatgg tcttcccttg gcctaaagtc ctccctcttc ctctctccac acagagggga 121 cgtcacccct gatgacgttg tggatcttgt gaaccagggc ctgcaggagg ggaggcaagc 181 atttggcatc aaggtccggt ccattctgtg ctgcatgcgc caccagccca gtgagtaccg 241 ccgcaccctg ctggctgcct ggcctataac aaggtggacc gactatccag cgtccccacc 301 tcgtatttct agagttttct aaaaaacacc tgtgaacttt tggtgactct ggtgagtcct 361 taacaggaaa ttgggacttg cacag // LOCUS MUSADAM06 189 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 6. ACCESSION M34247 J04767 KEYWORDS adenosine deaminase. SEGMENT 6 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 189) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 18 + 145 adenosine deaminase (ADA), exon 6 IVS < 1 17 ADA intron E IVS 146 > 189 ADA intron F BASE COUNT 37 a 39 c 75 g 38 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 5. 1 ggcccgtgcc cctgcaggct ggtcccttga ggtgttggag ctgtgtaaga agtacaatca 61 gaagaccgtg gtggctatgg acttggctgg ggatgagacc attgaaggaa gtagcctctt 121 cccaggccac gtggaagcct atgaggtggg cctgagaagg ggagggtggc cctgggggag 181 cttgggtag // LOCUS MUSADAM07 307 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exons 7 and 8. ACCESSION M34248 J04767 KEYWORDS adenosine deaminase. SEGMENT 7 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 307) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 11 82 adenosine deaminase (ADA), exon 7 164 + 265 adenosine deaminase, exon 8 IVS < 1 10 ADA intron F IVS 83 163 ADA intron G IVS 266 > 307 ADA intron H BASE COUNT 68 a 86 c 85 g 68 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 6. 1 tcccttccag ggcgcagtaa agaatggcat tcatcggacc gtccacgctg gcgaggtggg 61 ctctcctgag gttgtgcgtg aggtaaggag ccagtgaccc cgggcctctt cttcctgatt 121 ctgttcctgt ccctggactc acctcctctc tgcttctcca caggctgtgg acatcctcaa 181 gacagagagg gtgggacatg gttatcacac catcgaggat gaagctctct acaacagact 241 actgaaagaa aacatgcact ttgaggtgag acgccaaggc agagagagtg agctctggct 301 accccgt // LOCUS MUSADAM08 249 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 9. ACCESSION M34249 J04767 KEYWORDS adenosine deaminase. SEGMENT 8 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 249) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 112 + 176 adenosine deaminase (ADA), exon 9 IVS < 1 111 ADA intron H IVS 177 > 249 ADA intron I BASE COUNT 56 a 77 c 60 g 56 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 7. 1 ctgaggcaat gaagcacaaa gctatccaga atagaacctc agctgggctc agccctgacc 61 agtctggccc cggccactat gccagccagc cacacatcct gccccttgca ggtctgcccc 121 tggtccagct acctcacagg cgcctgggat cccaaaacga cgcatgcggt tgttcggtga 181 gatctggttc cgggacccat tttgttttga ttccggaatt cacctatagt gagtcgtata 241 aattcgtaa // LOCUS MUSADAM09 340 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 10. ACCESSION M34250 J04767 KEYWORDS adenosine deaminase. SEGMENT 9 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 340) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 141 + 270 adenosine deaminase (ADA), exon 10 IVS < 1 140 ADA intron I IVS 271 > 340 ADA intron J BASE COUNT 81 a 77 c 92 g 90 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 8. 1 ttaatacgag aatgcaaccc tttgtgttgt ctaaggttgt ataaagatgg aagagggagg 61 tggtggaagg gcagtgatgg ttcttggagt gaagaggctc tctctctctc tcttttcttc 121 ctgcctggcc cctcccccag cttcaagaat gataaggcca actactcact caacacagac 181 gaccccctca tcttcaagtc caccctagac actgactacc agatgaccaa gaaagacatg 241 ggcttcactg aggaggagtt caagcgactg gtgagtatgt gtgagctatg agcctgacac 301 tggcccaggt gtgtgtgtgt gtgtatatgt gtgtgtgtgt // LOCUS MUSADAM10 279 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 11. ACCESSION M34251 J04767 KEYWORDS adenosine deaminase. SEGMENT 10 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 279) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 88 171 adenosine deaminase (ADA), exon 11 IVS < 1 87 ADA intron J IVS 188 > 279 ADA intron K BASE COUNT 67 a 88 c 61 g 63 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 9. 1 ggatctgttt cccccactat gatgcccttg cccttgctaa cagggctgct tccttccttg 61 tcctgactcc atgtttcccc cttctagaac atcaacgcag cgaagtcaag cttcctccca 121 gaggaagaga agaaggaact tctggaacgg ctctacagag aataccaata gccaccacag 181 actgacggta cgcttgtgca gggcgcaata accaccccac cacactgtcc tccttaactc 241 tgtgcgattg tggcagaagt cttgggcagg agcacacct // LOCUS MUSADAM11 442 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 12 (non-coding). ACCESSION M34252 J04767 KEYWORDS adenosine deaminase. SEGMENT 11 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 442) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pre-msg < 1 > 287 adenosine deaminase (ADA) mRNA and introns IVS < 1 69 ADA intron K signal 282 287 poly-A signal BASE COUNT 97 a 111 c 114 g 120 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 10. 1 ttctgtgctt ctaccatgcc ttacatgtca tgagacctga cctttctatt tctctgactt 61 gaccagcagg gcgggtcccc tgaagatggc aaggccactt ctctgagcct catcctgtgg 121 ataaagtctt tacaactctg acatattgac cttcattcct tccagacctt ggagaggcca 181 ggtctgtcct ctgattggat atcctggcta ggtcccaggg gacttgacaa tcatgcacat 241 gaattgaaaa ccttccttct aaagctaaaa ttatggtgtt caataaagca gctggtgact 301 ggtatcttgc agcacatggt gaatacggtc tcggggctgc tggctaggat gctaagaaag 361 gaggagcctg ggccctacgc tgagtgtcag gtctggggag ctagggtctc ttccgcaggt 421 cgactctaga gatccccggg ct // LOCUS TRBMVAT5A 1664 bp ss-mRNA INV 10-JUL-1990 DEFINITION T.brucei MVAT5-like variant surface glycoprotein mRNA, complete cds. ACCESSION M33825 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1664) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. 169, 730-736 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description pept 43 1596 MVAT5-like variant surface glycoprotein BASE COUNT 533 a 435 c 395 g 301 t ORIGIN 1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt 61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt 121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa 181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg 241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt 301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa 361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac 421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg 481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc 541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg 601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca 661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca 721 gcaaacttcg gccagtcgat ccctttcgca ggcggctatc taacaaggaa cccgacagcc 781 aacacagcca gcagcgccga cggtacggac tttgtaagca accccgaaga cagcaagata 841 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca 901 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat 961 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt 1021 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac 1081 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa 1141 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac 1201 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc 1261 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc 1321 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat 1381 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca 1441 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc 1501 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta 1561 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa 1621 gaatttttgc tactttaaaa acttctgata tattttaaca ccta // LOCUS TRBWRATATA 1544 bp ss-mRNA INV 10-JUL-1990 DEFINITION T.brucei WRATat A variant surface glycoprotein mRNA, complete cds. ACCESSION M33823 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1544) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. 169, 730-736 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description pept 42 1457 WRATat A variant surface glycoprotein mRNA < 1 1544 WRATat A mRNA BASE COUNT 545 a 387 c 345 g 267 t ORIGIN 1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc 61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt 121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac 181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg 241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg 301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc 361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa 421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct 481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca 541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg 601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac 661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca 721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa 781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac 841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg 901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt 961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg 1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca 1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg 1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc 1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag 1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt 1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga 1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg 1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta 1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg // LOCUS TRBWRATATB 1585 bp ss-mRNA INV 10-JUL-1990 DEFINITION T.brucei WRATat B variant surface glycoprotein mRNA, complete cds. ACCESSION M33824 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1585) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. 169, 730-736 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description pept 41 1504 WRATat B variant surface glycoprotein BASE COUNT 535 a 384 c 376 g 290 t ORIGIN 1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact 61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga 121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa 181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac 241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga 301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat 361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga 421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc 481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca 541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc 601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg 661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag 721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa 781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga 841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact 901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca 961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca 1021 cgacctcata acagctaaat ccggtacgac actaagcggc gcaatcactc ggctaaagca 1081 actacaaatc gccaaagcaa agctaaaagc aagacggcta ctgctgcaaa acagggaacg 1141 gcagcaaacg cgacttatgg cgctagcaga caagatgcaa gaattgtacc aagaggcctt 1201 acatgacgag gttcaactca ggaaggaagc gcagaacaaa cctcaagaaa caccagattc 1261 tgacaagcaa aaagcatgcg agaaatatca caacaagtca aaggaatgca aagaaaatgg 1321 ttgccaatgg agtggaactg aagaaaccat aggaaagtgc gaagctaaac ccaaagcagg 1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac 1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt 1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact 1561 tctggtatat tttaacacct ttaaa //