Path: utzoo!attcan!uunet!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 1 Aug 90 12:00:38 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 5349 Approved: lear@genbank.bio.net Checksum: 17446 305 LOCUS RATGGLUT 1060 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat gamma-glutamyltransferase gene, 5' promoter region. ACCESSION J05515 KEYWORDS gamma-glutamyltransferase. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1060) AUTHORS Rajagopalan,S., Park,J.-H., Patel,P.D., Lebovitz,R.M. and Lieberman,M.W. TITLE Cloning and analysis of the rat gamma-glutamyltransferase gene JOURNAL J. Biol. Chem. 265, 11721-11725 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Rajagopalan, 25-MAY-1990. FEATURES from to/span description mRNA 834 > 1060 gamm-glutamyltransferase mRNA BASE COUNT 235 a 277 c 275 g 273 t ORIGIN 1 cagctgcctt ctggaggacc aaactgttca ggggaaggac aggaagaaat gagcctgtgc 61 cttcaggtca gagtcatgcc tagatctggg cgggagagct acaagggata ctgaccagga 121 gatagggtgt tgtcccctcc cccctggggt ttggtatcct cctctgcctt aagagttgca 181 aatcgacttt cccacataac aggcaccaaa tccagttagg accaacccca ccttccaatc 241 caggggagag gaatgtcagc aatgcgtggg cgtgtccttc taatgtgttt tccttgagtg 301 ttgtatgtgg accatctgca tgctcggtac ccagaggcca tcaggtctct tggaacagga 361 attgttgatg tgaaatgcca tgtggttgct gggataggaa ctcaggactc cggaagaacc 421 ttctcttctc cagtccccct ctgttgtttt tttttttttt ttttttgaga tacgatctca 481 cactgtagca caggctaatc cagaactcac taggtaggtc agactgggct caaatcacag 541 cgattctgct tctgcttcct gagtgccagg gtttgcaggt gttagctatc atgcccagtc 601 ttaacatttc acacacgcca gtccaagtta ttaaaaaaca acccggcagt tgagggcagg 661 gccctcaagt cccacaactg gtgcgtgcgt accaagtcca atgcgggaaa ggcctggacc 721 cttgaaccct ttgggcggtt cacttgttag ctcttactac caaatcctgg gcttacacat 781 gaatgccagc ccctccctgc ccagttctgt gacccccttc cccgggcagc tcttgggaga 841 agtcatgcat acatggaggc ggtgccagcc tctttgactc cagagttcag cgggagacag 901 agggagctca tcacatcagg caccccagaa gagttctggg cctgcttcac gtttaacttt 961 gtgattttca ggagtaccag cctgctctaa cggtttcagg gaagattggc tgtgggtttc 1021 cgcagagtgt gggggagttc ctgcttatcc atacagctga // LOCUS ACMGAG 167 bp ss-RNA VRL 01-AUG-1990 DEFINITION Avian myelocytomatosis retrovirus gag gene, partial cds. ACCESSION M35626 KEYWORDS gag protein; oncogene. SOURCE Avian myelocytomatosis retrovirus (mutant MC29-10H) RNA. ORGANISM Avian myelocytomatosis retrovirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian myelocytomatosis viruses. REFERENCE 1 (bases 1 to 167) AUTHORS Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and Patschinsky,T. TITLE Structure of mutant and wild-type MC29 v-myc alleles and biochemical properties of their protein products JOURNAL Oncogene 1, 97-109 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 167 gag protein (AA at 1) BASE COUNT 38 a 47 c 52 g 30 t ORIGIN 1 ggggaggagc ttgcgagtac aggtccgccc gtggtggcca tgcctgtagt gattaacaca 61 gagggacccg cctggacccc tctggagcca aaattgatca caagactggc tgatacggtc 121 aggaccaagg gcttacgatc cccgattact atagcggcgg ccactcg // LOCUS ACMVMYC 333 bp ss-RNA VRL 01-AUG-1990 DEFINITION Avian myelocytomatosis retrovirus v-myc gene, partial cds. ACCESSION M35624 KEYWORDS oncogene; v-myc protein. SOURCE Avian myelocytomatosis retrovirus (mutant MC29-10A) RNA. ORGANISM Avian myelocytomatosis retrovirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian myelocytomatosis viruses. REFERENCE 1 (bases 1 to 333) AUTHORS Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and Patschinsky,T. TITLE Structure of mutant and wild-type MC29 v-myc alleles and biochemical properties of their protein products JOURNAL Oncogene 1, 97-109 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 333 v-myc protein (AA at 1) BASE COUNT 72 a 130 c 90 g 41 t ORIGIN 1 ggcctctacc tgcacgacct gggagccgcg gccgccgact gcatcgaccc ctcggtggtc 61 ttcccctacc cgctcagcga gcgcgccccg cgggccgccc cgcccggcgc caaccccgcg 121 gctctgctgg gggtcgacac gccgcccacg atccaccaac acaactacgc tgctcctccc 181 tccaccaagg tggaataccc agccgccaag aggctaaagt tggacagtgg cagggtcctc 241 aaacagatca gcaacaaccg aaaatgctcc agtccccgca cgttagactc agaggagaac 301 gacaagaggc gaacgcacaa cgtcttggag cgc // LOCUS ACMVMYCA 202 bp ss-RNA VRL 01-AUG-1990 DEFINITION Avian myelocytomatosis retrovirus v-myc gene, partial cds. ACCESSION M35625 KEYWORDS oncogene; v-myc protein. SOURCE Avian myelocytomatosis retrovirus (mutants MC29-10C and 10H) RNA. ORGANISM Avian myelocytomatosis retrovirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian myelocytomatosis viruses. REFERENCE 1 (bases 1 to 202) AUTHORS Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and Patschinsky,T. TITLE Structure of mutant and wild-type MC29 v-myc alleles and biochemical properties of their protein products JOURNAL Oncogene 1, 97-109 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 202 v-myc protein (AA at 1) BASE COUNT 34 a 77 c 62 g 29 t ORIGIN 1 ggcctctacc tgcacgacct gggagccgcg gccgccgact gcatcgaccc ctcggtcgtc 61 ttcccctacc cgctcagcga gcgcgccccg cgggccgccc cgcccgacga caagaggcga 121 acgcacaacg tcttggagcg ccagcgaagg aatgagctga agctgcgttt ctttgccctg 181 cgtgaccaga tacccgaggt gg // LOCUS HUM3BHSD 1565 bp ss-mRNA PRI 01-AUG-1990 DEFINITION Human placental 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase mRNA, complete cds. ACCESSION M35493 KEYWORDS 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase. SOURCE Human placenta, cDNA to mRNA, clone H3-beta-hp6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1565) AUTHORS Lorence,M.C., Murry,B.A., Trant,J.M. and Mason,J.I. TITLE Human 3-beta-hydroxysteroid dehydrogenase/delta-5->4isomerase from placenta: Expression in nonsteroidogenic cells of a protein that catalyzes the dehydrogenation/isomerization of C21 and C19 steroids JOURNAL Endocrinology 126, 2493-2498 (1990) STANDARD simple staff_review FEATURES from to/span description pept 31 1152 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase mRNA < 1 1565 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase BASE COUNT 417 a 381 c 376 g 391 t ORIGIN 1 gcggagtgat tcctgctact ttggatggcc atgacgggct ggagctgcct tgtgacagga 61 gcaggagggt ttctgggaca gaggatcatc cgcctcttgg tgaaggagaa ggagctgaag 121 gagatcaggg tcttggacaa ggccttcgga ccagaattga gagaggaatt ttctaaactc 181 cagaacaaga ccaagctgac agtgctggaa ggagacattc tggatgagcc attcctgaag 241 agagcctgcc aggacgtctc ggtcatcatc cacaccgcct gtatcattga tgtcttcggt 301 gtcactcaca gagagtctat catgaatgtc aatgtgaaag gtacccagct cctgttagag 361 gcctgtgtcc aagctagtgt gccagtcttc atctacacca gtagcataga ggtagccggg 421 cccaactcct acaaggaaat catccagaat ggccatgaag aagagcctct ggaaaacaca 481 tggcccgctc catacccaca cagcaaaaag cttgctgaga aggctgtact ggcggctaac 541 gggtggaatc tgaaaaacgg cggcaccctg tacacttgtg ccttacgacc catgtatatc 601 tatggggaag gaagccgatt cctttctgct agtataaacg aggccctgaa caacaatggg 661 atcctgtcaa gtgttggaaa gttctccact gttaacccag tctatgttgg caatgtggcc 721 tgggcccaca ttctggcctt gagggccctg caggacccca agaaggcccc aagcatccga 781 ggacagttct actatatctc agatgacacg cctcaccaaa gctatgataa ccttaattac 841 accctgagca aagagttcgg cctccgcctt gattccagat ggagctttcc tttatccctg 901 atgtattgga ttggcttcct gctggaaata gtgagcttcc tactcaggcc aatttacacc 961 tatcgaccgc ccttcaaccg ccacatagtc acattgtcaa atagcgtatt caccttctct 1021 tataagaagg ctcagcgaga tctggcgtat aagccactct acagctggga ggaagccaag 1081 cagaaaacgg tggagtgggt tggttccctt gtggaccggc acaaggagac cctgaagtcc 1141 aagactcagt gatttaagga tgacagagat gtgcatgtgg gtattgttag gagatgtcat 1201 caagctccac cctcctggcc tcatacagaa agtgacaagg gcacaagctc aggtcctgct 1261 gcctcccttt catacaatgg ccaacttatt gtattcctca tgtcatcaaa acctgcgcag 1321 tcattggccc aacaagaagg tttctgtcct aatcatatac cagaggaaag accatgtggt 1381 ttgctgttac caaatctcag tagctgattc tgaacaattt agggactctt ttaacttgag 1441 ggtcgttttg actactagag ctccatttct actcttaaat gagaaaggat ttcctttctt 1501 tttaatcttc cattccttca catagtttga taaaaagatc aataaatgtt tgaatgttta 1561 atgtg // LOCUS HUMMHB7B 1089 bp ss-mRNA PRI 01-AUG-1990 DEFINITION Human class I HLA-B7 mRNA, complete cds. ACCESSION M35444 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1089) AUTHORS Parham,P., Benjamin,R.J., Chen,B.P., Clayberger,C., Ennis,P.D., Krensky,A.M., Lawlor,D.A., Littman,D.R., Norment,A.M., Orr,H.T., Salter,R.D. and Zemmour,J. TITLE Diversity of class I HLA molecules: Functional and evolutionary interactions with T cells JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 529-543 (1989) STANDARD simple staff_review FEATURES from to/span description pept 1 1089 MHC HLA-B7 /hgml_locus_uid="LX0031C" /nomgen="HLA-A" /map="6p21.3" BASE COUNT 218 a 335 c 363 g 173 t ORIGIN 1 atgctggtca tggcgccccg aaccgtcctc ctgctgctct cggcggccct ggccctgacc 61 gagacctggg ccggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gagagaggag ccgcgggcgc cgtggataga gcaggagggg 241 ccggagtatt gggaccggaa cacacagatc tacaaggccc aggcacagac tgaccgagag 301 agcctgcgga acctgcgcgg ctactacaac cagagcgagg ccgggtctca caccctccag 361 agcatgtacg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtac 421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcctg gaccgccgcg 481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagcgg 541 agagcctacc tggagggcga gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gacaagctgg agcgcgctga ccccccaaag acacacgtga cccaccaccc catctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggtttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagccg 901 tcttcccagt ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagttc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttga // LOCUS MUSMUPE 872 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse major urinary protein mRNA, complete cds. ACCESSION M28649 KEYWORDS major urinary protein. SOURCE Mouse liver, cDNA to mRNA, clones 8-1 and 13-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 872) AUTHORS Bennett,A.L., Paulson,K.E., Miller,R.E. and Darnell,J.E.Jr. TITLE Aquisition of antigens characteristic of adult pericentral hepatocytes by differentiating fetal hepatoblasts in vitro JOURNAL J. Cell Biol. 105, 1073-1085 (1987) STANDARD simple staff_review FEATURES from to/span description pept 65 601 major urinary protein mRNA 43 872 major urinary protein BASE COUNT 266 a 188 c 170 g 248 t ORIGIN 1 gccacgatca caagaaagat gtggtcctga cagacagaca atcctattcc ctaccaaaat 61 gaagatgctg ctgctgctgt gtttgggact gaccctagtc tgtgtccatg cagaagaagc 121 tagttctacg ggaaggaact ttaatgtaga aaagattaat ggggaatggc atactattat 181 cctggccttt gacaaaagag aaaagataga agataatggc aactttagac tttttctgga 241 gcaaatccat gtcttggaga attccttagt tcttaaattc catactgtaa gagatgaaga 301 gtgctcggaa ttatctatgg ttgctgacaa aacagaaaag gctggtgaat attctgtgac 361 gtatgatgga ttcaatacat ttactatacc taagacagac tatgataact ttcttatggc 421 tcatctcatt aacgaaaatg atggggaaac cttccagctg atggggctct atggccgaga 481 accagatttg agttcagaca tcaaggaaag gtttgcacaa ctatgtgaga agcatggaat 541 ccttagagaa aatatcattg acctatccaa tgccaatcgc tgcctccagg cccgagaatg 601 aagaatggcc tgagcctcca gtgttgagtg gagacttctc accaggactc caccatcatc 661 ccttcctatc catacagcat ccccagtata aattctgtga tctgcattcc atcctgtctc 721 actgagaagt ccaattccag tctatccaca tgttacctag gatacctcat caagaatcaa 781 agacttcttt aaatttttct ttgatatacc catgacaatt tttcatgaat ttcttcctct 841 tcctgttcaa taaatgatta cccttgcact ta // LOCUS RATMHREC 1552 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Rat MHC class I IgG Fc region receptor large subunit p51 (FcRn) mRNA, complete cds. ACCESSION M35495 KEYWORDS IgG Fc region receptor large subunit p51; cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Rat 11 day old epithelium, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1552) AUTHORS Simister,N.E. and Mostov,K.E. TITLE Cloning and expression of the neonatal rat intestinal FC receptor, a major histocompatibilty complex class I antigen homolog JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 571-580 (1989) STANDARD simple staff_review FEATURES from to/span description pept 205 1305 IgG Fc region receptor large subunit p51 (FcRn) precursor sigp 205 270 IgG Fc region receptor large subunit p51 signal peptide matp 271 1302 IgG Fc region receptor large subunit p51 mRNA < 1 1552 FcRn mRNA BASE COUNT 312 a 420 c 443 g 377 t ORIGIN 1 tcagttctgt aattaattaa ctaacgtgga tcaaatgaga aggtgaaagt tcacacagga 61 gcactcctgt cgtcttggac tgggtctcca tcccaccatc cagtgccctg gtctacgaag 121 agtccacagg gaccttgtga agaatcaaca aggcggggtc cagaggagtc acgtgtgcct 181 tccactccgg gtcgccctgt caggatgggg atgtcccagc ccggggtcct cctcagcctc 241 ttattggtcc tcctgcctca gacctgggga gcggagcccc gtctcccact gatgtatcat 301 cttgcagctg tgtctgactt atcaacgggg cttccctctt tctgggccac gggctggctg 361 ggtgctcagc aatatctgac ctacaacaac ctgcggcagg aggctgaccc ctgtggggcc 421 tggatatggg aaaaccaggt gtcttggtat tgggagaagg agaccacgga tctgaaaagc 481 aaagaacagc tcttcttgga ggccatcagg accctggaga accaaataaa tgggaccttc 541 acactgcagg gcctgctggg ctgtgaactg gcccctgata attcttcatt gcccacggct 601 gtgtttgccc tcaatggtga ggagttcatg cggttcaacc caagaacggg caactggagt 661 ggggagtggc cggagacaga tatcgttggt aatctgtgga tgaagcaacc tgaggcggcc 721 aggaaggaga gcgagttcct gctaacttct tgtcctgagc ggctgctagg ccacctggag 781 aggggccgtc agaacctgga gtggaaggag ccgccatcta tgcgcctgaa ggcccgtcct 841 ggcaactctg gctcctcagt actgacctgt gctgctttct ccttctaccc gccggagctc 901 aagtttcgat tcctgcgcaa tgggctagcc tcaggctctg ggaattgcag cactggtccc 961 aatggtgatg gatctttcca tgcatggtca ttgctagagg tcaaacgtgg agatgaacac 1021 cattaccaat gtcaagtgga gcatgagggg ctggcccagc ctctcactgt ggacctagat 1081 tcgcccgcca gatcttctgt gcctgtggtc ggaatcattc ttggtttatt gctggtggta 1141 gtggccatcg cagggggtgt gctgctatgg aacaggatgc gaagtgggct gccagcccca 1201 tggctttctc tcagtggtga tgactctggc gacctattgc ctggtgggaa cttgcccccg 1261 gaggctgaac ctcaaggtgt aaatgccttt ccggccactt cctgatgcca acccaggccc 1321 catacccatt gcagcctgtg gggctgtgtg acctcctgaa ctgtctctga gcctcccgag 1381 ggagccctgg gctggatgtc ctcctcgtgg atcccttctt ttgtggcctg cttcagtttc 1441 ccctcttaat gtcaatggct atttccatct ccacataaat ttgggcccaa atctgtgtgt 1501 gcatcgttat tctcaggttt caggcagccg gaataaattg aacaagtttg ag // LOCUS YSCATP10 2343 bp ds-DNA PLN 01-AUG-1990 DEFINITION S.cerevisiae ATP10 (essential for mitochondrial ATPase complex assembly) gene, complete cds. ACCESSION J05463 KEYWORDS . SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2343) AUTHORS Ackerman,S.H. and Tzagoloff,A. TITLE ATP10, a yeast nuclear gene required for the assembly of the mitochondrial F1-Fo complex JOURNAL J. Biol. Chem. 265, 9952-9959 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Tzagoloff, 19-JUN-1990. The gene sequence submitted codes for a protein that is essential for the biosynthesis of the F1-F0 ATPase complex of the mitochondrial inner membrane. Bases 1 to 977 are shown on the complementary strand as shown in Fig. 6. FEATURES from to/span description pept 629 < 1 (c) ORF pept 976 680 (c) ORF pept 1444 2238 ATP10 protein BASE COUNT 692 a 447 c 490 g 714 t ORIGIN 1 agatcttttg gctcaggtat aaattcgaac gtctcgattt cccttatcag tttatggaat 61 ggcttaaacc aagatgaaga tttccctagc tctaattgaa cttgcaccat atatataact 121 tttccaaaag agtaaaaata caaatccaga ttatcgattt tattaaattc ttgccaatga 181 ctattgaacg taggtgggag tcgggcatta cttcttgtca caaacgctac tgtctttgcc 241 gtatgatttt tcagacattc aggctttctg ggaaacttgt caaattgaaa gctataatta 301 tatgaacctg gtttaacttt gaacggcttg gaggagccat caagagcatt ccatacatta 361 tctggaggga aaactctctg ttcgaatttc attaatgtat gaaaggattt gttgtcttgg 421 cccggcatca tcatgccatt ctgttgaaac atgtactctt gatcaatttt tgttaaggtc 481 tctgagaatc cttttagaat gacggaaatt ttccttatag atagcgcttt tgttaactga 541 agactaacta tccctgacat ttgatcatta gagctataaa actccccgtt gtacggtggg 601 tttaaggata ttgaaatttt tggagccatg gtttgacaaa ctgtatggtt ctcaaccttc 661 tctaatcaaa agcagaatct taaatataaa cactcacaga atatccgttg gtcaatgaag 721 taattctcct ttgtactggc tgctttttct cctctagttt atgtaattct acttttggat 781 gggtgcgact gcttttaatt gattgagtgg cggtgttaga agggctgtag agtcgaaggc 841 ttgtttctct cttacgcacc tcttgtgaaa agggcgtgca ccttccccag gaccctctct 901 caccctcaac ccgcattttg ctgagaattt tcaccaaggc cctaggtgat attagattcc 961 acctgactaa ttgcattaca gccgacccaa ggcaatatca gtttaataaa atatcatgta 1021 tctcaccctc ttcttggtat tagtaaagag acgcctgatc ttgtaacagt ggtgaagatt 1081 gtactagagc agaatcaaga atttaaaagt gtaaggcagg cagaggcgat gtacataaac 1141 ttcgaagtaa gaaatattta atagttctcg ccacatcact atgcagctat ataaaaacta 1201 ctataaacgt ttgttttgtt ccttacgcac aatatccttg cctagaaatc gtttttgaaa 1261 tttaaatttt tattaccatt tatttgattc gccttcagaa aaatatggaa gagtgcatat 1321 ttaaaaagga ctatttcagc atatagtaaa agtcaggtta tttgtttatt tgcgatatca 1381 gagtaactta aactaactat gcagggcact tttaaaaggt tttaccatcc cacgcttacg 1441 cggatgtcct tcttggataa attcctcaag cctatgatgg caacggcttc cccaaaggaa 1501 taccagatca aacaactggt caagccaata ggcttaacac aagcaccaag gaaaagcacc 1561 aaatactccc aggggaactc tttgagggat atgtttgatt cggaaaagac aaaccacaga 1621 gttaaagagt tggccgttga attcagcaaa tctggacttt atgacgtgca agtcttccaa 1681 aagacaaagg ggaaattgtt tatagctcca gtttcatatt ggaaagaaga taaagctttg 1741 ttttttcctc atttgatagg aacggcaatg gatggtacga aacaacagaa tatcgaggat 1801 atgttaaggg gtaaaaccag tatagtgagg ttatttagta cagcatctgg cgataagttg 1861 agtagttcat acttccaagg aatcgtagac gataacaaaa aaactgacta cttgactgaa 1921 gctgatgcgc gtttaagttt aaatgacagt aacgtccaaa tcatcgaggt caatcttgta 1981 gaaaacgctg tgaaaagtgc tctagtgaaa acgcttgctc gttgggccaa tcgcgttcca 2041 tcctggcgcc agccatttta tttcgaatat tctagaggcc aatggccatt ttccgtcagg 2101 gaagagctct tttgcaataa tgtcttttct ggatacgtct ttcttgtgga ccagcagtta 2161 aaaattaggt gggcagcttg cggggaggct actccatctg aaaaggaagc attgtggaag 2221 tttgccaaac gtctgtgaag ttgacgcttt gtgcggcggc caacaaggga tgggcggcta 2281 tttggcgatc cacaggacgg gtgtggtcgc catgatcgcg tagtcgatag tggctccaag 2341 tag // LOCUS DDIDPYK1A 1090 bp ss-mRNA INV 01-AUG-1990 DEFINITION D.discoideum protein-tyrosine kinase-1 (DPYK1) mRNA, complete cds. ACCESSION M33785 KEYWORDS protein-tyrosine kinase-1. SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1090) AUTHORS Tan,J.L. and Spudich,J.A. TITLE Developmentally regulated protein-tyrosine kinase genes in Dictyostelium discoideum JOURNAL Mol. Cell. Biol. 10, 3578-3583 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Tan, 20-APR-1990. FEATURES from to/span description pept < 1 1014 protein-tyrosine kinase-1 (DPYK1; AA at 1) BASE COUNT 371 a 198 c 186 g 335 t ORIGIN 1 cgcccatttg gtggttggga aactcaatca tcattatcac atccaccatc acgtccacca 61 ccacctccac caccaccacc acaactacca gttagatcag aatacgagat tgatttcaat 121 gaattagaat ttggtcaaac cattggtaaa ggtttctttg gtgaagtaaa gagaggttat 181 tggagagaga ctgatgttgc cataaaaatc atctatcgtg atcaattcaa aaccaaatca 241 tcattggtta tgtttcaaaa tgaagttgga atactaagta aattaagaca tccaaatgta 301 gttcaatttt tgggtgcatg tactgcagga ggtgaagatc atcattgtat agtaacagaa 361 tggatgggtg gaggtagttt aagacagttc ttgactgatc atttcaattt actcgaacaa 421 aatccacata ttcgtttgaa gttggctttg gatattgcaa aaggaatgaa ttatctacat 481 ggttggactc cacccattct tcatcgtgac ttatcctcaa gaaacatttt attggatcac 541 aacatcgatc caaagaatcc gttagtttcc tcaagacaag atattaaatg taagatctct 601 gattttggtc taagtagatt aaagaaggaa caagcctctc aaatgactca atcggttggt 661 tgtattccct acatggcacc agaggttttc aaaggcgata gtaatagtga aaagagtgat 721 gtttactcct atggcatggt tttgtttgaa ctattaacct ctgatgaacc tcaacaagat 781 atgaaaccaa tgaaaatggc tcacttggct gcttatgaat cttatcgtcc tccaattcca 841 ttaactacct cttccaagtg gaaagaaatt ctaactcaat gttgggattc taatcctgat 901 agtcgtccaa cctttaaaca aatcattgtt catctcaaag aaatggaaga tcaaggtgta 961 tcttcttttg catctgtacc tgttcaaact attgatactg gtgtttatgc ttaatttttt 1021 ttttataatt aaaaaaaaaa aaaacaaaac aaaaaaaaaa aataataata aatataatca 1081 cttcaactcg // LOCUS DDIDPYK2A 1291 bp ss-mRNA INV 01-AUG-1990 DEFINITION D.discoideum protein-tyrosine kinase-2 (DPYK2) mRNA, complete cds. ACCESSION M33784 KEYWORDS protein-tyrosine kinase-2. SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1291) AUTHORS Tan,J.L. and Spudich,J.A. TITLE Developmentally regulated protein-tyrosine kinase genes in Dictyostelium discoideum JOURNAL Mol. Cell. Biol. 10, 3578-3583 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Tan, 20-APR-1990. FEATURES from to/span description pept < 1 1233 protein-tyrosine kinase-2 (DPYK2; AA at 1) BASE COUNT 491 a 207 c 203 g 390 t ORIGIN 1 cgattctaca atacaacaaa ctctactaaa gatatcacat ttttagtttg tgataatcct 61 gattcaacta aagaaaagag taacgtttca aatacttcat caataatttc cgcttcaaat 121 ttaaatagac atataacacc aaattctcat atgagaccta gaggtagatc aatttctgaa 181 tctttaatta tgtcaccaat taataaagaa tctttaaatg atattcaaag agcaattgaa 241 agtgaaaaaa taaagaaaac taaatttgaa gaattaaaat caatattggg cgaaagagaa 301 tatataattg atataaatga tattcaattt atacaaaaag ttggagaagg tgcattcagt 361 gaagtttggg aaggttggtg gaaaggtatt catgttgcca taaaaaagtt aaagattata 421 ggagatgaag aacaattcaa agagagattc attagagagg ttcaaaattt gaaaaaagga 481 aatcatcaaa acattgtcat gtttattggt gcatgttata aaccagcatg tatcataaca 541 gagtatatgg caggtggtag tctttacaat atacttcata atccaaatag ttccactcca 601 aaagttaaat attctttccc attggttttg aaaatggcaa ccgacatggc attgggctta 661 ttacatcttc attccatcac cattgtgcat cgtgatttaa ccagtcaaaa cattctattg 721 gatgaattgg gtaatataaa gatctctgat tttggtttat ctgctgaaaa gagtagagaa 781 ggttcaatga caatgacaaa tggtggcatt tgcaatccaa gatggagacc acccgaattg 841 acaaagaatt taggtcacta ctcggaaaag gttgatgtct attgtttctc tctagtagtt 901 tgggaaattt taactggcga aattcctttc tctgatttag atggatctca acgatccgct 961 caagtagctt atgctggttt aagaccacca ataccagagt attgcgatcc tgaattaaaa 1021 ttactcttaa ctcaatgttg ggaggctgat ccaaatgata gacctccctt tacctatata 1081 gtaaacaaat taaaagaaat ctcttggaat aatccaattg gtttcgtctc tgatcaattc 1141 tatcaatata gcgaaccttc aactccaaga ttagcattat caaatcaatc ttcaaattca 1201 agtagtattt ctttatcacc aactaaatta taaaaaaaaa aaaaaaaaaa aacaaatttc 1261 aaacaccaaa caccaccact catcaaaatc g // LOCUS HUMSPTB 6765 bp ss-mRNA PRI 01-AUG-1990 DEFINITION Human beta-spectrin (SPTB) mRNA, complete cds. ACCESSION J05500 KEYWORDS beta-spectrin; spectrin. SOURCE Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and V252. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6765) AUTHORS Winkelmann,J.C., Chang,J.-G., Tse,W.T., Scarpa,A.L., Marchesi,V.T. and Forget,B.G. TITLE Full length sequence of the cDNA for human erythroid beta-spectrin JOURNAL J. Biol. Chem. 265, 11827-11832 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Winkelmann, 08-MAY-1990. FEATURES from to/span description pept 96 6509 beta-spectrin /nomgen="SPTB" /map="14" /hgml_locus_uid="LS0033T" mRNA < 1 6765 SPTB mRNA signal 6716 6722 poly-A signal BASE COUNT 1626 a 1822 c 2146 g 1171 t ORIGIN Chromosome 14q23-q24. 1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag 61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa 121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg 181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg 241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg 301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca 361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc 421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac 481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc 541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag 601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag 661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta 721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca 781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc 841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct 901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc 961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg 1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca 1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca 1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca 1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag 1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg 1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc 1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata 1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga 1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg 1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc 1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg 1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg 1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga 1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg 1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc 1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg 1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg 2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc 2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc 2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt 2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga 2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc 2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg 2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc 2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat 2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgcaggcc ctgcgggagc 2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg 2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga 2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca 2761 ggttcgacat cctggaccag gagatgaaga ccttgatgac tcagattgat ggtgtgaacc 2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc 2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg 2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt 3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag 3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg 3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga 3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat 3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg 3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca 3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg 3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc 3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg 3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg 3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat 3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga 3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg 3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg 3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc 3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct 3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc 4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc 4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc 4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa 4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg 4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcggtcagac gacccgggca 4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga 4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg 4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa 4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg 4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta 4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg 4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg 4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc 4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt 4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct 4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc 4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg 5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg 5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca 5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa 5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc 5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca 5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg 5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc 5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga 5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca 5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg 5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg 5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc 5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct 5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc 5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca 5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc 6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt 6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc 6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg 6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca 6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg 6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga 6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg 6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac 6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg 6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg 6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag 6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa 6721 attttattgt ataaatatat caccatttac atgaggggaa acact // LOCUS STYEUTBC 2526 bp ds-DNA BCT 01-AUG-1990 DEFINITION S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes, complete cds. ACCESSION J05518 KEYWORDS ethanolamine ammonia-lyase. SOURCE S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2526) AUTHORS Faust,L.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M. TITLE Cloning, sequencing and expression of the genes encoding the alcohol-dependent ethanolamine ammonia-lyase of Salmonella typhimurium JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.M.Babior, 08-MAY-1990. FEATURES from to/span description pept 141 1499 ethanolamine ammonia-lyase (eutB) pept 1518 2378 ethanolamine ammonia-lyase (eutC) binding 130 133 ribosome binding site binding 1507 1510 ribosome binding site BASE COUNT 563 a 687 c 779 g 497 t ORIGIN 1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga 61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt 121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt 181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg 241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa 301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc 361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca 421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc 481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc 541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc 601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg 661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc 721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga 781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc 841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct 901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt 961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct 1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg 1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc 1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga 1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc 1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc 1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg 1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg 1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat 1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg 1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc 1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg 1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc 1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg 1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg 1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc 1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg 1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga 2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact 2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca 2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg 2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc 2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg 2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga 2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg 2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt 2521 gcctcc // LOCUS XELPCNA 1018 bp ss-mRNA VRT 01-AUG-1990 DEFINITION X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete cds. ACCESSION M34080 KEYWORDS nuclear protein; proliferating cell nuclear antigen. SOURCE X.laevis oocyte, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1018) AUTHORS Leibovici,M., Gusse,M., Bravo,R. and Mechali,M. TITLE Characterization and developmental expression of Xenopus proliferating cell nuclear antigen (PCNA) JOURNAL Dev. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Leibovici, 08-MAY-1990. FEATURES from to/span description pept 28 813 proliferating cell nuclear antigen (PCNA) mRNA < 1 1018 PCNA mRNA BASE COUNT 284 a 223 c 237 g 274 t ORIGIN 1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc 61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca 121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact 181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag 241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg 301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt 361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa 421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc 481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca 541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa 601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat 661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca 721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg 781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc 841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa 901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa 961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac // LOCUS YSCAAC2A 1333 bp ds-DNA PLN 01-AUG-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete cds. ACCESSION M34076 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1333) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator in yeast JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 235 1158 ADP/ATP-translocator protein (AAC2) BASE COUNT 388 a 209 c 301 g 435 t ORIGIN 1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc 61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg 121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa 181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt 241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt 301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat 361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt 421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat 481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg 541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct 601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc 661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg 721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc 781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct 841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg 901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga 961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa 1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg 1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc 1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc 1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag 1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc 1321 agccactgtt gtt // LOCUS YSCAAC3 1308 bp ds-DNA PLN 01-AUG-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete cds. ACCESSION M34075 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1308) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator in yeast JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 78 1034 ADP/ATP-translocator protein (AAC3) BASE COUNT 353 a 228 c 263 g 464 t ORIGIN 1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata 61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc 121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg 181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag 241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca 301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta 361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt 421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg 481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat 541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg 601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac 661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga 721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt 781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa 841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga 901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag 961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta 1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc 1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata 1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa 1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc 1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt // LOCUS HUMMTVA1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 13) mitochondrial DNA sequences, 5' end. ACCESSION M28909 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 121 a 123 c 42 g 80 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat cacccccctc 361 agatagg // LOCUS HUMMTVA2 361 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 13) mitochondrial DNA sequences, 3' end. ACCESSION M28910 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 361) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 107 a 102 c 54 g 93 t 5 others ORIGIN 1 tttggtattt tcgtctgggg ggtgtgcacg cgatagcatt gcgagacgct ggagccggag 61 caccctatgt cgcagtatct gtctttgatt cctgccccat cctattattt atcgcaccta 121 cgttcaatat tacaggcgaa catacnctac taaagtgtgt taattaatta atgcttgtag 181 gacataataa taacaattaa atgtctgcac agccactttc cacacagaca tcataacaaa 241 aaatttncca ccaaaccccc ccnnntcccc ccgcttctgg ccacagcact taaacacatc 301 tctgccaaac cccaaaaaca aagaacccta acaccagcct aaccagattt caaattttat 361 c // LOCUS HUMMTVB1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 14) mitochondrial DNA sequences, 5' end. ACCESSION M28911 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 121 a 120 c 43 g 82 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acatatcaac cgtaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagg // LOCUS HUMMTVB2 356 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 14) mitochondrial DNA sequences, 3' end. ACCESSION M28912 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 356) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 105 a 104 c 52 g 90 t 5 others ORIGIN 1 ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc tggagccgga gcaccctatg 61 tcgcagtatc tgtctttgat tcctgcccca tcccattatt tatcgcacct acgttcaata 121 ttacaggcga acatacncta ctaaagtgtg ttaattaatt aatgcttgta ggacataata 181 ataacaattn aatgtctgca cagccacttt ccacacagac atcataacaa aaaatttncc 241 accaaacccc ccccnntccc cccgcttctg gccacagcac ttaaacacat ctctgccaaa 301 ccccaaaaac aaagaaccct aacaccagcc taaccagatt tcaaatttta tctttt // LOCUS HUMMTVC1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 11) mitochondrial DNA sequences, 5' end. ACCESSION M28905 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 119 a 123 c 45 g 79 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtacg gcaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagg // LOCUS HUMMTVC2 371 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 11) mitochondrial DNA sequences, 3' end. ACCESSION M28906 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 371) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 109 a 107 c 55 g 94 t 6 others ORIGIN 1 tctnccatgc atttggtatt ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc 61 tggagccgga gcaccctatg tcgcagcacc tgtctttgat tcctgcccca ttccattatt 121 tatcgcacct acgttcaata ttacaggcga acatacncta ctaaagtgtg ttaattaatt 181 aatgcttgta ggacataata ataacaatta aatgtctgca cagccacttt ccacacagac 241 atcataacaa aaaatttncc accaaacccc cccnnntccc cccgcttctg gccacagcac 301 ttaaacacat ctctgccaaa ccccaaaaac aaagaaccct aacaccagcc taaccagatt 361 tcaaatttta t // LOCUS HUMMTVD1 368 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 12) mitochondrial DNA sequences, 5' end. ACCESSION M28907 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 368) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 119 a 122 c 46 g 80 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacggtac cataaatact 121 tgaccacctg tagtacataa aaacccanac ccacatcaaa accctccccc catgcttaca 181 agcaagcaca gcaatcaacc ttcaactgtc acacatcaac tgcaactcca aagccacccc 241 tcacccacta ggatatcaac aaacctactc acccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agataggg // LOCUS HUMMTVD2 375 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 12) mitochondrial DNA sequences, 3' end. ACCESSION M28908 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 375) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 109 a 104 c 59 g 97 t 6 others ORIGIN 1 ggaggctctn ccatgcattt ggtattttcg tctggggggt gtgcacgcga tagcattgcg 61 agacgctgga gccggagcac cctatgtgca gtatctgtct ttgattcctg ccccattcca 121 ttatttatcg cacctacgtt caatattaca ggcgagcata cnctattaaa gtgtattaat 181 taattaatgc ttgtaggaca taataataac aattaaatgt ctgcacagcc actttccaca 241 cagatcataa caaaaaattt nccaccaaac ccccccnnnt ccccccgctt ctggccacag 301 cacttaaaca catctctgcc aaaccccaaa aacaaagaac cctaacacca gcctaaccag 361 atttcaaatt ttatc // LOCUS HUMMTVE1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 1-4) mitochondrial DNA sequences, 5' end. ACCESSION M28893 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 120 a 121 c 44 g 81 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acacattaac cgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagg // LOCUS HUMMTVE2 362 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 1-4) mitochondrial DNA sequences, 3' end. ACCESSION M28894 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 362) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 108 a 102 c 54 g 93 t 5 others ORIGIN 1 catttggtat tttcgtctgg ggggtgtgca cgcgatagca ttgcgagacg ctggagccgg 61 agcaccctat gtcgcagtat ctgtctttga ttcctgcccc atcctattat ttatcgcacc 121 tacgttcaat attacaggcg aacatacnct actaaagtgt gttaattaat taatgcttgt 181 aggacataat aataacaatt aaatgtctgc acagccactt tccacacaga catcataaca 241 aaaaatttnc caccaaaccc ccccnnntcc ccccgcttct ggccacagca cttaaacaca 301 tctctgccaa accccaaaaa caaagaaccc taacaccagc ctaaccagat ttcaaatttt 361 at // LOCUS HUMMTVF1 369 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 7) mitochondrial DNA sequences, 5' end. ACCESSION M28899 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 369) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 120 a 122 c 47 g 79 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatc tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ctcaactgtc atacatcaac cgcaactcca aagccactcc 241 tcagccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagggg // LOCUS HUMMTVF2 371 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 7) mitochondrial DNA sequences, 3' end. ACCESSION M28900 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 371) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 109 a 107 c 55 g 94 t 6 others ORIGIN 1 tctnccatgc atttggtatt ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc 61 tggagccgga gcaccctatg tcgcagtatc tgtctttgat tcctgcccca tcccattatt 121 tatcgcacct acgttcaata ttacaggcga acatacncta ccaaagtgtg ttaattaatt 181 aatgcttgta ggacataata ataacaatta aatgtctgca cagccacttt ccacacagac 241 atcataacaa aaaatttncc accaaacccc cccnnntccc cccgcttctg gccacagcac 301 ttaaacacat ctctgccaaa ccccaaaaac aaagaaccct aacaccagcc taaccagatt 361 tcaaatttta t // LOCUS HUMMTVG1 340 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 5) mitochondrial DNA sequences, 5' end. ACCESSION M28895 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 340) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 114 a 111 c 39 g 75 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatc tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gtaatcaacc ctcaactgtc atacatcaac cgcaactcca aagccacccc 241 tcagccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt // LOCUS HUMMTVG2 349 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 5) mitochondrial DNA sequences, 3' end. ACCESSION M28896 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 349) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 106 a 103 c 52 g 83 t 5 others ORIGIN 1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc 61 tatgtcgcag tatctgtctt tgattcctgc cccatcccat tatttatcgc acctacgttc 121 aatattacag gcgaacatac nctaccaaag tgtgttaatt aattaatgct tgtaggacat 181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt 241 tnccaccaaa cccccccnnn tccccccgct tctggccaca gcacttaaac acatctctgc 301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaa // LOCUS HUMMTVH1 348 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 8) mitochondrial DNA sequences, 5' end. ACCESSION M28901 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 348) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 115 a 115 c 38 g 79 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acacattaac tgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatcc // LOCUS HUMMTVH2 355 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 8) mitochondrial DNA sequences, 3' end. ACCESSION M28902 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 355) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 107 a 101 c 52 g 90 t 5 others ORIGIN 1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc 61 tatgtcgcag tatctgtctt tgattcctgc cccatcctat tatttatcgc acctacgttc 121 aatattacag gcgaacatac nctactaaag tgtgttaatt aattaatgct tgtaggacat 181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt 241 tnccaccaaa cccccccnnn tccccccgct tctggccaca gcacttaaac acatctctgc 301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaat tttat // LOCUS HUMMTVI1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 6) mitochondrial DNA sequences, 5' end. ACCESSION M28897 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 121 a 121 c 43 g 81 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gtaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagg // LOCUS HUMMTVI2 358 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 6) mitochondrial DNA sequences, 3' end. ACCESSION M28898 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 358) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 106 a 102 c 54 g 91 t 5 others ORIGIN 1 ttggtatttt cgtctggggg gtgtgcacgc gatagcattg cgagacgctg gagccggagc 61 accctatgtc gcagtatctg tctttgattc ctgccccatc ccattattta tcgcacctac 121 gttcaatatt acaggcgaac atacnctact aaagtgtgtt aattaattaa tgcttgtagg 181 acataataat aacaattaaa tgtctgcaca gccactttcc acacagacat cataacaaaa 241 aatttnccac caaacccccc cnnntccccc cgcttctggc cacagcactt aaacacatct 301 ctgccaaacc ccaaaaacaa agaaccctaa caccagccta accagatttc aaattttt // LOCUS HUMMTVJ1 365 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 9,10) mitochondrial DNA sequences, 5' end. ACCESSION M28903 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 365) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 120 a 122 c 43 g 79 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acaatcaacc gcaactccaa agccacccct 241 cacccactag gataccaaca aacctaccca cccttaacag tacatagcac ataaagccat 301 ttaccgtaca tagcacatta cagtcaaatc ccttctcgtc cccatggatg acccccctca 361 gatag // LOCUS HUMMTVJ2 355 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 9,10) mitochondrial DNA sequences, 3' end. ACCESSION M28904 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 355) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 107 a 102 c 52 g 90 t 4 others ORIGIN 1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc 61 tatgtcgcag tatctgtctt tgattcctgc cccatcccat tatttatcgc acctacgttc 121 aatattacag gcgaacatac nctattaaag tgtgttaatt aattaatgct tgtaggacat 181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt 241 tnccaccaaa ccccccccnn tccccccgct tctggccaca gcacttaaac acatctctgc 301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaat tttat // LOCUS HUMLD78A 3176 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human cytokine LD78 alpha gene, complete cds. ACCESSION D90144 KEYWORDS LD78; LD78 alpha; cytokine; inducible gene family; secreted peptide. SOURCE Human blood lymphocyte DNA , clone Lm LD-3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3176) AUTHORS Nakao,M., Nomiyama,H. and Shimada,K. TITLE Structures of human genes coding for cytokine LD78 and their expression JOURNAL Mol. Cell. Biol. 10, 3646-3658 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hisayuki Nomiyama Department of Biochemistry Kumamoto University Medical School 2-2-1 Honjo, Kumamoto 860 Japan Phone: 096-344-2111 Fax: 096-372-6140 FEATURES from to/span description pept 1155 1227 cytokine LD78 alpha precursor, exon 1 1916 2030 cytokine LD78 alpha precursor, exon 2 2451 2541 cytokine LD78 alpha precursor, exon 3 sigp 1155 1219 cytokine LD78 alpha signal peptide matp 1220 1227 cytokine LD78 alpha mature peptide 1916 2030 cytokine LD78 alpha mature peptide 2451 2538 cytokine LD78 alpha mature peptide pre-msg 1069 2957 cytokine LD78 alpha mRNA and introns IVS 1228 1915 cytokine LD78 alpha intron A IVS 2031 2450 cytokine LD78 alpha intron B signal 1041 1045 TATA box BASE COUNT 833 a 741 c 752 g 850 t ORIGIN 1 acccagggac ctatcacaca aatataagaa ctattcattc tttaaggcat gtatttccaa 61 gcctttgtat ttttttccat gcttagggtt ggcaaggaat atatatatat ttgtacaaat 121 atatatgtgt atatgtacaa atacatgtat atatagtaca aatatatata tatatttgta 181 caattcttca gactttgtag aatttgtata atgtcgtatc ttgctttttt taaccactga 241 tgttataagc atatttatgc cacttcattc attttagaga cttaataata aatgatctag 301 tggataattt atcattccct gatggagaaa aatttagctt tgtttatttt agagttataa 361 acgatgctgg gtcaggtatc tttatgtttg aagatggctc catatttggg ttgtttccac 421 agaactcttt cctagaaatg ctttttctag gttaatggct acagatattt ctaggcacct 481 gacatattga cacccacctc taaagtattt ttatgatcca caactagcgt ttaacacagc 541 gccctagtca ctacatgact aataaataga caaatgactg aaacatgacc tcatgctttc 601 tattcctcca gctttcattc agttctttgc ctctgggagg aggaagggtt gtgcagccct 661 ccacagcatc agcccatcaa ccctatccct gtggttatag cagctgagga agcagaattg 721 cagctctgtg ggaaggaatg gggctggaga gttcatgcac agaccagttc ttatgagaag 781 ggactgacta agaatagcct tgggttgaca tatacccctc ttcacactca caggagaaac 841 catttcccta tgaaactata acaagtcatg agttgagagc tgagagttag agaatagctc 901 aaagatgcta ttcttggata tcctgagccc ctgtggtcac cagggaccct gagttgtgca 961 acttagcatg acagcatcac tacgcttaaa aatttccctc ctcaccccca gattccattt 1021 ccccatccgc cagggctgcc tataaagagg agagctggtt tcagacttca gaaggacacg 1081 ggcagcagac agtggtcagt cctttcttgg ctctgctgac actcgagccc acattccgtc 1141 acctgctcag aatcatgcag gtctccactg ctgcccttgc tgtcctcctc tgcaccatgg 1201 ctctctgcaa ccagttctct gcatcacgtg agtctgagtt tcgttgtggg tatcaccact 1261 ctctggccat ggttagacca catcaatctt ttcttgtggc ctaaaagccc ccaagagaaa 1321 agagaacttc ttaaagggct gccaaacatc ttggtctttc tctttaagac ttttattttt 1381 atctctagaa ggggtcttag ccccctagtc tccaggtatg agaatctagg caggggcagg 1441 ggagttacag tcccttttac agatagaaaa acagggttcg aaacgaatca gttagcaaga 1501 ggcagaatcc agggctgctt acttcccagt ggggtatgtt gttcactctc cagctcactc 1561 taggtctccc aggagctctg tcccttggat gtcttatgag agatgtccaa ggcttctctt 1621 gggttggggt atgacttctt gaaccagaca aaattccctg aagagaactg agataagaga 1681 acagtccgtt caggtatctg gatcacacag agaaacagag aacccactat gaagagtcaa 1741 ggagaaagaa ggatacagac agaaacaaag agacatttct cagcaaaaat gcccaaatgc 1801 cttccagtca cttggtctga gcaagcctgc cttcctcaac tgctcgggga tcagaagctg 1861 cctggccttt tcttctgagc tgtgactcgg gctcattctc ttcctttctc cacagttgct 1921 gctgacacgc cgaccgcctg ctgcttcagc tacacctccc ggcagattcc acagaatttc 1981 atagctgact actttgagac gagcagccag tgctccaagc ccggtgtcat gtaagtgcca 2041 gtcttcctgc tcacctctat ggaggtaggg agggtcaggg ttggggcaga gacaggccag 2101 aaggctatcc tggaaaggcc cagccttcag gagcctatcg gggatacagg acgcagggct 2161 ccgaggtgtg acctgacttg gagctggagt gaggcatgtg ttacagagtc aggaagggct 2221 gccccagccc agaggaaagg gacaggaaga aggaggcagc gggacactct gagggccacc 2281 cctactgagt cactgagaga agctctctag acagagatag gcagggggcc cctgaaagag 2341 gagcaagccc tgagctgccc aggacagaga gcagaatggt ggggccatgg tgggcccagg 2401 attcccctgc tggattcccc agtgcttaac tcttcctccc ttctccacag cttcctaacc 2461 aagcgaagcc ggcaggtctg tgctgacccc agtgaggagt gggtccagaa atatgtcagc 2521 gacctggagc tgagtgcctg aggggtccag aagcttcgag gcccagcgac ctcggtgggc 2581 ccagtgggga ggagcaggag cctgagcctt gggaacatgc gtgtgacctc cacagctacc 2641 tcttctatgg actggttgtt gccaaacagc cacactgtgg gactcttctt aacttaaatt 2701 ttaatttatt tatactattt agtttttgta atttattttc gatttcacag tgtgtttgtg 2761 attgtttgct ctgagagttc ccctgtcccc tcccccttcc ctcacaccgc gtctggtgac 2821 aaccgagtgg ctgtcatcag cctgtgtagg cagtcatggc accaaagcca ccagactgac 2881 aaatgtgtat cggatgcttt tgttcagggc tgtgatcggc ctggggaaat aataaagatg 2941 ctcttttaaa aggtaaacca gtattgagtt tggttttgtt tttctggcaa atcaaaatca 3001 ctggttaaga ggaatcatag gcaaagatta ggaagaggtg aaatggaggg aaattgggag 3061 agatggggag ggctaccaca gagttatcca ctttacaacg gagacacagt tctggaacat 3121 tgaaactacg aatatgttat aactcaaatc ataacatgca tgctctagga gaattc // LOCUS HUMLD78B 3112 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human cytokine LD78 beta gene. ACCESSION D90145 KEYWORDS LD78; LD78 beta; cytokine; inducible gene family; secreted peptide. SOURCE Human placenta DNA, clone Lm LD-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3112) AUTHORS Nakao,M., Nomiyama,H. and Shimada,K. TITLE Structures of human genes coding for cytokine LD78 and their expression JOURNAL Mol. Cell. Biol. 10, 3646-3658 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hisayuki Nomiyama Department of Biochemistry Kumamoto University Medical School 2-2-1 Honjo, Kumamoto 860 Japan Phone: 096-344-2111 Fax: 096-372-6140 FEATURES from to/span description pept 1192 1267 cytokine LD78 beta precursor, exon 1 1953 2067 cytokine LD78 beta precursor, exon 2 2488 2578 cytokine LD78 beta precursor, exon 3 sigp 1192 1259 cytokine LD78 beta signal peptide matp 1260 1267 cytokine LD78 beta mature peptide 1953 2067 cytokine LD78 beta mature peptide 2488 2575 cytokine LD78 beta mature peptide pre-msg 1106 2995 cytokine LD78 beta mRNA and introns IVS 1268 1952 cytokine LD78 intron A IVS 2068 2487 cytokine LD78 intron B rpt 498 797 Alu repeat signal 1078 1082 TATA box BASE COUNT 756 a 775 c 780 g 801 t ORIGIN 1 ttagagactt aataataaag gatcttgtgg ataatttatc attccctgat agagaaaaat 61 ttagctttgc ttattttaga gttataaatg atgctgggtc aggtatcttt atgtttgaag 121 atggctccat atttgggttg tttccacaga actctttccc agaaatgctt tttctaggtt 181 aatggctaca catatttcta ggcacctgac atactgacac ccacctctaa agtattttta 241 tgatccacaa ctagcgttta acacagcgcc ccagtcactc cgagactaat aaatagacaa 301 atgactgaaa cgtgacctca tgctttctat tcctccagct ttcattgagt tcctttcctc 361 tgggaggact gggggttgtc tagccctcca cagcatcagc ccattgaccc tatccttgtg 421 gttatagcag ctgaggaagc agaattacag ctctgtggga aggaatgggg ctggagagtt 481 catgcataga ccaattcttt tttttttttt tttttgagat ggagtttcac ttttgttgcc 541 caggctggag tgcaatggca tgatctcagc tcaccacagc ccccacctcc tgggttcaag 601 cgattctcct gccctcagcc tcccgagtag ctgggattac aggcatgtgc caccacgcct 661 gactactttt gtatttttag tagagatgga gtttctcttt cttggtcagg ttggtctcaa 721 actcctgacc tcaggtgatc cgcagcctcg gcctcccaaa gtgttgggat tacaggtgtg 781 agcgaccatg cctggctgca tagaccagtt cttatgagaa gggatcaact aagaatagcc 841 ttgggttgac acacacccct cttcacactc acaggagaaa ccccatgaag ctagaaccag 901 tcatgagttg agagctgaga gttagagagt agctcagaga tgctattctt ggatatcctg 961 agcccctgtg gtcaccaggg accctgagtt gtgcaacact cagcatgaca gcatcactac 1021 acttaaaaat ttccctcctc acccccagat tccatttccc catccgccag ggctgcctat 1081 aaagaggaga gatggcttca gacatcagaa ggacgcaggc agcaaagagt agtcagtccc 1141 ttcttggctc tgctgacact cgagcccaca ttccatcacc tgctcccaat catgcaggtc 1201 tccactgctg cccttgccgt cctcctctgc accatggctc tctgcaacca ggtcctctct 1261 gcaccacgtg agtccatgtt gttgttgtgg gtatcaccac tctctggcca tggttagacc 1321 acatcagtct ttttttgcgg cctgagagcc ccgaagagaa aagaaggaag ttcttaaagc 1381 gctgccaaac accttggtct ttttcttcac aacttttatt tttatctcta gaaggggtct 1441 tagccctcct agtctccagg tatgagaatc taggcagggg caggggagtt acagtccctt 1501 gtacagatag aaaaacaggg ttcaaaacga atcagtttgc aagaggcaga atccagggct 1561 gcttacttcc cagtggggtc tgttgttcac tctccagctc accctaggtc tcccaggagc 1621 cctgtccctt ggatgtctta tgagagatgt ccagggcttc tcttgggctg gggtatgact 1681 tcttgaaccg acaaaattcc atgaagagag ctaagagaac agtccattca ggtatctgga 1741 tcacatagag aaacagagaa cccactatga agagtcaagg ggaaagagga atatagacag 1801 aaacaaagag acatttctct gcaaaacccc ccaaatgcct tgcagtcact tggtctgagc 1861 aagcctgccc tcctcaacca ctcagggatc agaagctgcc tggccttttc ttctgagctg 1921 tgactcgggc ttattctctc ctttctccgc agttgctgct gacacgccga ccgcctgctg 1981 cttcagctac acctcccgac agattccaca gaatttcata gctgactact ttgagacgag 2041 cagccagtgc tccaagccca gtgtcatgta agtgccagtc ttcctgctca cctctaggga 2101 ggtagggagt gtcagggtgg gggcagaaac aggccagaag gccatcctgg aaaggcccag 2161 ccttcaggag cctatcgggg atacaggacg cagggcactg aggtgtgacc tgacttgggg 2221 ctggagtgag gtgggtgtta cagagtcagg aagggctgcc ccaggccaga ggaaaggaac 2281 aggaagaagg aggcagcagg acactctgag ggcccccttg cctggagtca ctgagagaag 2341 ctctctagac ggagataggc agggggcccc tgagagagga gcaggccttg agctgcccag 2401 gacagagagc aggatgtcag gccatggtgg gcccaggatt ccccggctgg attccccagt 2461 gcttaactct tcctcccttc tccacagctt cctaaccaag agaggccggc aggtctgtgc 2521 tgaccccagt gaggagtggg tccagaaata cgtcagtgac ctggagctga gtgcctgagg 2581 ggtccagaag cttcgaggcc cagcgacctc agtgggccca gtggggagga gcaggagcct 2641 gagccttggg aacatgcgtg tgacctctac agctacctct tctatggact ggttattgcc 2701 aaacagccac actgtgggac tcttcttaac ttaaatttta atttatttat actatttagt 2761 ttttataatt tatttttgat ttcacagtgt gtttgtgatt gtttgctctg agagttcccc 2821 ctgtcccctc caccttccct cacagtgtgt ctggtgacga ccgagtggct gtcatcggcc 2881 tgtgtaggca gtcatggcac caaagccacc agactgacaa atgtgtatca gatgcttttg 2941 ttcagggctg tgatcggcct ggggaaataa taaagatgtt cttttaaacg gtaaaccagt 3001 attgagtttg gttttgtttt tctggcaaat caaaatcact agttaagagg aatcataggc 3061 aaagattagg aagaggtgaa atggagggaa actgggagag atggggagcg ct // LOCUS XELTRH 1442 bp ss-mRNA VRT 01-AUG-1990 DEFINITION X.laevis thyrotropin releasing hormone (TRH) mRNA, complete cds. ACCESSION M34699 K00931 J05514 KEYWORDS thyrotropin releasing hormone. SOURCE X.laevis skin, cDNA to mRNA, clone L4 and 8/136. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 478) AUTHORS Richter,K., Kawashima,E., Egger,R. and Kreil,G. TITLE Biosynthesis of thyrotropin releasing hormone in the skin of Xenopus laevis: Partial sequence of the precursor deduced from cloned cDNA JOURNAL EMBO J. 3, 617-621 (1984) STANDARD full staff_review REFERENCE 2 (bases 15 to 1442) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. TITLE Two precursors of thyrotropin releasing hormone from skin of Xenopus laevis: Each contains seven copies of the end product JOURNAL J. Biol. Chem. 265, 11731-11733 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.Kreil, 18-MAY-1990. FEATURES from to/span description pept 110 793 thyrotropin releasing hormone precursor matp 332 340 thyrotropin releasing hormone copy 1 matp 374 382 thyrotropin releasing hormone copy 2 matp 428 436 thyrotropin releasing hormone copy 3 matp 470 478 thyrotropin releasing hormone copy 4 matp 566 574 thyrotropin releasing hormone copy 5 matp 611 619 thyrotropin releasing hormone copy 6 matp 686 694 thyrotropin releasing hormone copy 7 mRNA < 1 1442 TRH mRNA conflict 139 139 t in [2]; c in [1] conflict 214 216 tct in [2]; ctc in [1] conflict 319 319 g in [2]; t in [1] BASE COUNT 460 a 286 c 334 g 362 t ORIGIN 1 agcacagagc agcacaagga cacactctgc atattgtgct gccggacaag gaggtgacag 61 ccagtcaggc tgagacaaag gaacttccag acctctgaca gcaggaaaga tggtgtctgt 121 ctggtggttg ctgcttcttg gtacaaccgt atctcacatg gtgcacacac aagagcagcc 181 tttactggag gaggacacag caccattaga tgatctggat gttcttgaga aagccaaagg 241 tatcctgatc cgcagtatcc tggagggatt tcaagaaggg caacaaaaca atagagatct 301 accagatgca atggaaatga tatctaagcg ccagcaccca gggaaacgat tccaggagga 361 gatagaaaag agacaacacc ctggaaagag ggatctggaa gatctgaatc tagagctttc 421 caaaaggcaa caccccggaa gaagatttgt ggatgatgta gagaagaggc aacatccagg 481 aaagagagaa gagggtgact ggagtaggag gtatctgaca gatgactcac gttatttgga 541 cctcctttct gatgtttcca ggagacagca cccaggcaaa agagttccag ccccattgtt 601 tacaaaacgt caacacccag gtaagagagt gacagaagaa gagggtgata ctgaatttga 661 aaactcgaag gaagtgggga agcgccagca tccaggaaag agatatgacc cttgtgaagg 721 ccctaatgcc tacaactgta actcaggaaa cattctaccg gattctgtag aagaattgag 781 ttttgggctt taagctgccc agccccttta ttagttccat ctgaccctaa atgattccca 841 atgaacacaa ctttctataa ttgttaaata acattgtatt aagtatcata catttctgga 901 aagcaagcag ctcttagaac acttcttcgc tttaaaaggc acctggggca taagagtatt 961 aagcttcaga cagtaacctg cccaccacag ggagggattc aacaatcaca attggctgag 1021 tgttcctttc ccttgtttgg cagtgagatc agataataaa tataagatgg ccaggaaagt 1081 ggactctttc ttttctgaaa atttgcaagt aacaccaaaa tataataatt tgcacactca 1141 gtagtattaa cgtgaagatc tcaagaaggt tataaattct tggtgatctg ctcaaagcat 1201 ttaattcata gttgcttcca tggtttgatg gggaatgcac attctaaatt gcttattgct 1261 aattagcgct tgccacacag ttctggtggt agatcttgat gaggcatatt caataaaagt 1321 agagcccata gtaaaatttg tgccccgtca gctttaagga tcctctgtaa gcaatatgtg 1381 ttgtgagggc cacttgtttc taaagtaata ttttcatttt aataaatatg tctactcaaa 1441 tg // LOCUS XELTRHA 2955 bp ss-mRNA VRT 01-AUG-1990 DEFINITION X.laevis thyrotropin releasing hormone mRNA, complete cds. ACCESSION M34698 J05514 KEYWORDS thyrotropin releasing hormone. SOURCE X.laevis, cDNA to mRNA, clone C6. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (sites) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. TITLE Two precursors of thyrotropin releasing hormone from skin of Xenopus laevis: Each contains seven copies of the end product JOURNAL J. Biol. Chem. 265, 11731-11733 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2955; for [1]) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by G.Kreil, 18-MAY-1990. FEATURES from to/span description pept 157 831 thyrotropin releasing hormone matp 379 387 thyrotropin releasing hormone copy 1 matp 421 429 thyrotropin releasing hormone copy 2 matp 475 483 thyrotropin releasing hormone copy 3 matp 517 525 thyrotropin releasing hormone copy 4 matp 613 621 thyrotropin releasing hormone copy 5 matp 658 666 thyrotropin releasing hormone copy 6 matp 733 741 thyrotropin releasing hormone copy 7 BASE COUNT 927 a 597 c 604 g 827 t ORIGIN 1 catgcagttt attagatata cagtacaatg aagtcagtta tgagaaatag caattgcagc 61 acaaggacac actctgcata ttgtgctgcc ggacaaggag gtgacagcca gtcaggctga 121 gacaaaggaa cttccagacc tctgacagca ggaaagatgg tgtctgtctg gtggttgctg 181 cttcttggta caaccgtatc tcacatggtg cacacacaag agcagccttt actggaggag 241 gacacagcac cattagatga tctggatgtt cttgagaaag ccaaaggtat cctgatccgc 301 agtatcctgg agggatttca agaagggcaa caaaacaata gagatctacc agatgcaatg 361 gaaatgatat ctaagcgcca gcacccaggg aaacgattcc aggaggagat agaaaagaga 421 caacaccctg gaaagaggga tctggaagat ctgaatctag agctttccaa aaggcaacac 481 cccggaagaa gatttgtgga tgatgtagag aagaggcaac atccaggaaa gagagaagag 541 ggtgactgga gtaggaggta tctgacagat gactcacgtt atttggacct cctttctgat 601 gtttccagga gacagcaccc aggcaaaaga gttccagccc cattgtttac aaaacgtcaa 661 cacccaggta agagagtgac agaagaagag ggtgatactg aatttgaaaa ctcgaaggaa 721 gtggggaagc gccagcatcc aggaaagaga tatgaccctt gtgaaggccc taatgcctac 781 aactgtaact caggaaacat tctaccggaa gaattgagtt ttgggcttta agctgcccag 841 cccctttatt agttccatct gaccctaaat gattcccaat gaacacaact ttctataatt 901 gttaaataac attgtattaa gtatcataca tttctggaaa gcaagcagct cttagaacac 961 ttcttcgctt taaaaggcac ctggggcata agagtattaa gcttcagaca gtaacctgcc 1021 caccacaggg agggattcaa caatcacaat tggctgagtg ttcctttccc ttgtttggca 1081 gtgagatcag ataaataaat ataagatggc caggaaagtg gactctttct tttctgaaaa 1141 tttgcaagta acaccaaaat ataataattt tgcactctgc agtgtattaa cgtgaagatc 1201 tcaagaaggt tataaattag gttataaatt cttggtgatc tgctcaaagc atttaattca 1261 tagttgcttc catggtttga tggggaatgc acattctaaa ttgcttattg ctaattagcg 1321 cttgccacac agttctggtg gtagatcttg atgaggcata ttcaataaaa gtagagccca 1381 tagtaaaatt tgtgccccgt cagctttaag gatcctctgt aagcaatatg tgttgtgagg 1441 gccacttgtt tctaaagtaa tattttcatt ttaataaata tgtctactca aatgacaaaa 1501 acattcatta tttcactaca ttatactcct tcccacagca attatgtacc tatgaatcct 1561 gatagaagac tgcagttttc ctcttatatc ctccatgttg gattcaccat aagtcaccaa 1621 aatatatcta tagggaagca cactatacac aatagcagtg acccccatcc agtggcttgt 1681 gggcaacaag ctactcacca acccccttgg ctgttgctcc cagtggccct aaagtaaggt 1741 gcataaaaaa accagatgaa cttgtcaaaa agagcctccc ttagactgcc ttgttccaca 1801 tagaggctac catatagcca atcacagccc ttatttggca cccccgggaa cttttttcat 1861 gcttgagttg ctccccaaat ctttttacag ttgaatatgt ctcatggcta aaaaaacgtg 1921 aggaccccgg cgtaatatag tataatatac acacactcac tttggaaaac tctatggaga 1981 tcaataagca cttttgggtt aaactatttt tttgatacaa tttgagcact ttatatatgg 2041 attttaaaga tattccgctt tagtagtctg tggtgcgctg ccccataaat atattggtga 2101 attattcacc acctactctt aacaattctg ctcaattcat ctagatgtta acataataca 2161 tcaccagtat cacaatggca gcgggaagca aagacattct gtagtgtcct gagaccagct 2221 aaagcctaga ggtggaccat aaataatgtc tattgcaggg tcagtacaaa caaaaacacc 2281 aaggctgctt tatacaaggc atatctaatt tgcaggtatt ttgctgaact attactccac 2341 acacaaagct tgagggacac agactaataa tctgctgaag gtttgcagga tggacagttg 2401 gacactgctt tgcttcaact ttattctagg cttgtgctct gatgtatgca gcgtcaaata 2461 ccagctgttg tttgactaca actcccagaa gcctcagcat actgagggtg gtatgcttga 2521 atgcttgaat gcttgaatac cgaaggctgt ctgtcctcca acacctcccg ttgatctccc 2581 gctccagctc ttattgtcat tccattgtat attttgtttt taaatgtata aagaaataaa 2641 aaaaaagtat gatatattca cccttcttct tctgagtata aaaagattta aatgaatgtg 2701 aaaataatat ttttatagac aacaatcttt gtgcagtgtt ggtaaataca tgtttattct 2761 gtatatagct attttaatat gcatactgaa agaatatata tatataataa gaagcatgaa 2821 catctcattg cctgggtatg aaacaataaa gattgcatct gataatgaag caaattcgct 2881 ctgtggcgca gtattatgtt gacctgatga tgaagttagg tctggtgcgc ttctcaatgt 2941 tcgtggcgct ggccc // LOCUS PVICSD 1107 bp ds-DNA INV 01-AUG-1990 DEFINITION P.vivax circumsporozoite protein gene, complete cds. ACCESSION M34697 KEYWORDS circumsporozoite protein. SOURCE P.vivax (strain Thai; isolate NYU Thai) sporozoite DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 286 to 798) AUTHORS Arnot,D.E., Stewart,M.J. and Barnwell,J.W. TITLE Antigenic diversity in Thai Plasmodium vivax circumsporozoite proteins JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 285; 799 to 1107) AUTHORS Arnot,D.E., Stewart,M.J. and Barnwell,J.W. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Arnot, 18-MAY-1990. The bases in reference [2] are identical to bases 140 to 442 and 995 to 1294 of the sequence of the North Korean strain published in Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106; accession number M20671. Author address: D.E.Arnot Dept. of Genetics University of Edinburgh West Mains Rd., Edingburgh, EM93JM Scotland FEATURES from to/span description pept 1 1107 circumsporozoite protein BASE COUNT 382 a 212 c 338 g 175 t ORIGIN 1 atgaagaact tcattctctt ggctgtttct tccatcctgt tggtggactt gttccccacg 61 cactgcgggc acaatgtaga tctgtccaag gccataaatt taaatggagt aaacttcaat 121 aatgtagacg ccagttcact tggcgcggca cacgtaggac aaagtgctag ccgaggcaga 181 ggacttggtg agaacccaga tgacgaggaa ggagatgcta aaaaaaaaaa ggatggaaag 241 aaagcagaac caaaaaatcc acgtgaaaat aagctgaaac aaccaggaga cagagcagat 301 ggacagccag caggagacag agcagatgga cagccagcag gtgatagagc agatggacaa 361 ccagcaggtg atagagctgg acagccagca ggagatagag cagatggaca gccagcagga 421 gacagagcag atggacagcc agcaggagac agagcagatg gacagccagc aggagacaga 481 gcagatggac agccagcagg tgacagagct ggacaaccag caggtgatag agctggacag 541 ccagcaggcg atagagcaga tggacagcca gcaggagata gagctggaca gccagcaggc 601 gatagagcag atggacagcc agcaggagat agagctggac aaccagcagg agatagagca 661 gatggacaac cagcaggaga tagagctgga cagccagcag gagatagagc tggacagcca 721 gcaggagata gagctggaca gccagcagga gatagagctg gacagccagc aggaaatggt 781 gcaggtggac aggcagcagg aggaaacgca ggaggacagg gacaaaataa tgaaggtgcg 841 aatgccccaa atgaaaagtc tgtgaaagaa tacctagata aagttagagc taccgttggc 901 accgaatgga ctccatgcag tgtaacctgt ggagtgggtg taagagtcag aagaagagtt 961 aatgcagcta acaaaaaacc agaggatctt actttgaatg accttgagac tgatgtttgt 1021 acaatggata agtgtgctgg catatttaac gttgtgagta attcattagg gctagtcata 1081 ttgttagtcc tagcattatt caattaa // LOCUS ATTRRA 119 bp ss-RNA RNA 01-AUG-1990 DEFINITION A.solani 5S rRNA. ACCESSION M35573 KEYWORDS 5S ribosomal RNA. SOURCE A.solani (strain CBS 277-32) 5S rRNA. ORGANISM Atractiella solani Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Auriculariales; Auriculariaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 119 5S rRNA BASE COUNT 29 a 30 c 34 g 26 t ORIGIN 1 aggtgcgacc ataccgtgtt gaaaattctg catcccgtcc gatctgcaaa gacaagcaac 61 acagggccca gtcagtagtg cggtgggtga ccacgtgcga atactgtggt gttgcactt // LOCUS CETRRA 118 bp ss-RNA RNA 01-AUG-1990 DEFINITION C.cornigerum 5S rRNA. ACCESSION M35577 KEYWORDS 5S ribosomal RNA. SOURCE C.cornigerum (strain FO 29225) 5S rRNA. ORGANISM Ceratobasidium cornigerum Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Metabasidiomycetidae; Metatremellales; Ceratobasidiaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 118 5S rRNA BASE COUNT 23 a 35 c 37 g 23 t ORIGIN 1 atccacggcc ataggacttc gaaagcaccg catcccgtcc gatctgcgca gttaaccgga 61 gtgccgccta gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt // LOCUS GRARRA 118 bp ss-RNA RNA 01-AUG-1990 DEFINITION G.phoenicis 5S rRNA. ACCESSION M35575 KEYWORDS 5S ribosomal RNA. SOURCE G.phoenicis (strain PB 4349) 5S rRNA. ORGANISM Graphiola phoenicis Eukaryota; Plantae; Thallobionta; Basidiomycotina; Teliomycetes; Ustilaginales; Graphiolaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 118 5S rRNA BASE COUNT 26 a 33 c 36 g 23 t ORIGIN 1 atctgcggcc atagaaccgt gaaaataccg catcccgtcc gatctgcgaa gtcaagcacg 61 gtatcgccta gtcagtactg cggtggggga ccacgcggga atcctgggtg ctgcagtt // LOCUS PLARRA 119 bp ss-RNA RNA 01-AUG-1990 DEFINITION P.peniophorae 5S rRNA. ACCESSION M35571 KEYWORDS 5S ribosomal RNA. SOURCE P.peniophorae (strain FO 22315) 5S rRNA. ORGANISM Platygloea peniophorae Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Auriculariales; Auriculariaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 119 5S rRNA BASE COUNT 27 a 35 c 36 g 21 t ORIGIN 1 atctgcggcc ataccgtgat gaacattccg cgtcccgtcc gatccgcgca gacaagcatc 61 acaggggcca gagagtattg acgtgggtga ccagtcgaga acactgtgct gccgcaggt // LOCUS PLERRA 119 bp ss-RNA RNA 01-AUG-1990 DEFINITION P.faginea 5S rRNA. ACCESSION M35574 KEYWORDS 5S ribosomal RNA. SOURCE P.faginea (strain FO 22315) 5S rRNA. ORGANISM Phleogena faginea Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Eutremellales; Phleogenaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 119 5S rRNA BASE COUNT 28 a 30 c 34 g 27 t ORIGIN 1 atgtgcgacc ataccaagct gaaaatactg catcccgtct gatctgcaca gtcaagcagc 61 ttagggccca gtcagtagtg cggtggggga ccatgcgcga acattgtggt gttgcactt // LOCUS SEPRRA 119 bp ss-RNA RNA 01-AUG-1990 DEFINITION S.carestianum 5S rRNA. ACCESSION M35572 KEYWORDS 5S ribosomal RNA. SOURCE S.carestianum (strain FO 25109) 5S rRNA. ORGANISM Septobasidium carestianum Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Septobasidiales; Septobasidiaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 119 5S rRNA BASE COUNT 25 a 37 c 36 g 21 t ORIGIN 1 atctggggcc ataccacagt gaacacaccg catcccgtcc gatctgcgca gttaaccact 61 gtagggccga gtcagtagtg cggtggggga ccacgcgcga atactctggt gccccaggt // LOCUS TULRRA 118 bp ss-RNA RNA 01-AUG-1990 DEFINITION T.violea 5S rRNA. ACCESSION M35576 KEYWORDS 5S ribosomal RNA. SOURCE T.violea (strain FO 29326) 5S rRNA. ORGANISM Tulasnella violea Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Tremellales; Tulasnellaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 118 5S rRNA BASE COUNT 30 a 29 c 31 g 28 t ORIGIN 1 atcttcggcc ataggacaga gaaaataccg catcccgtcc gatctgcgca gtcaagctct 61 gtaccgctta gttagtacca tagtggggga ccatatggga atcctgagtg ctgaagtt // LOCUS UTHRRA 118 bp ss-RNA RNA 01-AUG-1990 DEFINITION U.fusisporum 5S rRNA. ACCESSION M35578 KEYWORDS 5S ribosomal RNA. SOURCE U.fusisporum (strain FO 25106) 5S rRNA. ORGANISM Uthatobasidium fusisporum Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Tremellales; Tulasnellaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 118 5S rRNA BASE COUNT 23 a 35 c 37 g 23 t ORIGIN 1 atccacggcc ataggacttc gaaagcaccg catcccgtcc gatctgcgca gttaaccgga 61 gtgccgccta gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt // LOCUS C11CMIA 2149 bp ds-DNA BCT 01-AUG-1990 DEFINITION Plasmid pColBM-C1139 colicin lysis protein (cmi) gene, 5' end. ACCESSION M35683 KEYWORDS colicin lysis protein. SOURCE Plasmid pColBM-C1139 DNA. ORGANISM Plasmid pColBM-C1139 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 2149) AUTHORS Thumm,G., Oelschlaeger,T. and Braun,V. TITLE Plasmid pColBM-C1139 does not encode a colicin lysis protein but contains sequences highly homologous to the D protein (resolvase) and the oriV region of the miniF plasmid JOURNAL Plasmid 20, 75-82 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1108 1890 ORF pept 1991 > 2149 colicin lysis protein (cmi) BASE COUNT 512 a 510 c 578 g 549 t ORIGIN 1 gaattcatct tttggccgtt tacgtctgtt ccgttatcct gatgatacga tgttctgcac 61 gttctgccgg gaagatgcag atgattcgct taaaagtatt atgacccatc tctgggagct 121 ggatgcagag atgacagatc ctgtcatagc tatgtttaat cacgtctgag tgccgtgagt 181 gatttctgtc ttttatgcaa cagtgccaag atattgtaat caaaaaaaag cattaatgca 241 ttttggacag taatctattt taattgatga catagaggca ttaatctttc tttttcttca 301 ggaagatccg aaaactcctg gtcacggatc ttcctctccc ccacacaacg ccacctcctg 361 taagcacaac atgtggtgcc ggattcagct gctgatgaca ctatatgttg tgtcatctcc 421 ctgacctgtg atgcgtcgcg caggggcgga aaacagcgat atgatgattt cctcggcgtg 481 gtacacttcc ggaaagttgt gatattccgg aaagtcggat ctgacggaaa cggctctccg 541 gtaatttaac ggcgtggtta tatggatgct tgttatcatg gtgatgatga taacggcatg 601 atgttatcag acggcgtgac ggtaagggca gtgatgatgg atgacgttat cgcatgaccg 661 tccctgcccg gaaaagaaaa aaggagtcac ccatgttttt tattgagaat gaaggtcagg 721 ctgtcgccgg aacggattac tggcagtctg tacaggcgca ggccggatat gtctacctca 781 gctggaatgc cggcgcagcc aggctgcttg tcccggatgc ggcaaaacat ttactcaggg 841 agatgcgggg ggctgagtac gtcatcatca gtaagggagc actgcatggc cgcgatgcgc 901 tggaactggt atttgaagac ggcagcgatg cgccgtttgt gatccacatg ctgagtgagc 961 agtgcgatcg cctgctcccc gaaaacaacc agggaggggg ttttgttgtc accgtctgga 1021 cgcgtggcgg taaccagctc cgttatccgg gaaagtaccg ggttgtggaa aacctgcccg 1081 acgtttcccc gtggagtgaa cactgatatg cagcacctgc cggcaccgat ccaccatgcc 1141 cgggatgctg ttcagcttcc tgttgccatc gattatccgg cagcgctggc actccgccag 1201 atgtcgatgg ttcatgatga actgcccaaa tacctgctgg cccctgaagt gagcgccctg 1261 ctccattacg tcccggatct gcgccgcaag atgctgctgg ccacactgtg gaacaccggt 1321 gcgcgcatta atgaagcact ggcgctgacg cggggggatt tttcgctcac gcctccgtat 1381 ccgtttgtgc agctggccac tctgaagcag cggacagaaa aagccgccag gacggcagga 1441 agaatgcccg ccggtcagca gactcaccgg ctggttccgc tctccgactc ctggtacgtc 1501 agccagctgc agacgatggt agccacactg aaaatcccca tggaacggcg taataaacga 1561 acaggcagga cagagaaagc gcggatctgg gaagtgacgg acagaacggt caggacctgg 1621 attggggagg cggttgccgc cgctgccgct gatggtgtga cgttctctgt cccggtcacg 1681 ccacatacgt tccgccattc ctatgcgatg cacatgctgt atgccggtat accgcttaag 1741 gttctgcaga gtctgatggg gcataagtcc atcagctcaa cagaggtcta cacgaaggtg 1801 tttgcactgg atgtggctgc acggcaccgg gtgcagtttt cgatgcctga gtccgatgcg 1861 gtcacaatgc tgaaaaacag acatgcataa taagtcacaa ttatgaattg tgatttcttc 1921 tataaaaaag agaccactgc aatatgtgat ctcttgtatt atttcataat tgttaaagcc 1981 acttcacagt atgctcacat tgtacggata tattcgtaat gtttttttat atcgaatgaa 2041 cgacagaagt tgtggagatt ttatgaaagt aattagcatg aaatttattt ttattttaac 2101 gattattgct cttgctgctg tttttttctg gtctgaagat aaaggtccg // LOCUS DOGPPPP 427 bp ss-mRNA MAM 01-AUG-1990 DEFINITION Canine pancreatic polypeptide mRNA, complete cds. ACCESSION M35596 KEYWORDS pancreatic polypeptide. SOURCE Canine pancreas, cDNA to mRNA. ORGANISM Canis lupus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 427) AUTHORS Toothman,P. and Paquette,T.L. TITLE Canine pancreatic polypeptide complementary deoxyribonucleic acid sequence: Pancreatic polypeptide and insulin messenger ribonucleic acid distribution in the lobes of the pancreas JOURNAL Mol. Endocrinol. 1, 413-419 (1987) STANDARD simple staff_review FEATURES from to/span description pept 21 302 pancreatic polypeptide precursor sigp 21 107 pancreatic polypeptide signal peptide matp 108 215 pancreatic polypeptide matp 225 284 icosapeptide mRNA 1 427 pancreatic polypeptide mRNA BASE COUNT 88 a 149 c 115 g 75 t ORIGIN 1 tccgcccctt aggactcggg atgcctgccg cctgccgctg cctcttcctg ctgctcctgt 61 cagcctgtgt ggctctgttg ctgcagccgc cactgggtac ccggggggcc ccgctggagc 121 cagtgtatcc gggggacgat gccacaccag agcagatggc ccagtacgcg gctgagctcc 181 gcagatacat caacatgctg accaggccca ggtatgggaa aagagacaga ggagaaatgc 241 gggacatcct ggaatggggc tccccccatg cagccgcccc cagggagctg atggacgagt 301 aatgccacct ccaagtaatg ccacctctgc ctctcaggcc aatgccagcc tacctctccc 361 ctctgcaccc ctggccaaag cttgctccct gctctcacac acagactaaa taaagcaagt 421 caaagtc // LOCUS GVICG 296 bp ss-RNA circular VRL 01-AUG-1990 DEFINITION Grapevine viroid grapevine isolate (SHV-g(GV)) complete genome. ACCESSION M35717 KEYWORDS complete genome. SOURCE Grapevine viroid RNA. ORGANISM Grapevine viroid Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 296) AUTHORS Sano,T., Ohshima,K., Hataya,T., Uyeda,I., Shikata,E., Chou,T.-G., Meshi,T. and Okada,Y. TITLE A viroid resembling hop stunt viroid in grapevines from Europe, the United States and Japan JOURNAL J. Gen. Virol. 67, 1673-1678 (1986) STANDARD simple staff_review BASE COUNT 60 a 87 c 80 g 69 t ORIGIN 1 ctggggaatt ctcgagttgc cgcatcaggc aagcaaagaa aaaacaaggc agggaggtac 61 ttacctgaga aaggagcccc ggggcaactc ttctcagaat ccagcgagag gcgtggagag 121 agggccgcgg tgctctggag tagaggctct gcttcagaac accatcgatc gtcccttctt 181 ctttaccttc ttctggctct tccgatgaga cgcgaccggt ggcatcacct ctcggttcgt 241 cccaacctgc tttttgtcta tctgagcctc tgccgcggat cctctcttga gcccct // LOCUS HUMTCAJK 94 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor germline J-alpha RP DNA, partial cds. ACCESSION M35619 KEYWORDS T-cell receptor alpha-chain; antigen receptor; germline; joining exon. SOURCE Human T-cell line RPMI 8402 DNA, clone lambda-R15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 94) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 32 / 92 T-cell receptor germline J-alpha RP region (AA at 32) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" IVS 93 > 94 TCR intron signal 2 10 nonamer recombination signal signal 23 29 heptamer recombination signal BASE COUNT 25 a 23 c 22 g 24 t ORIGIN 1 aggtttctgt tatgaagcat ctcacagtgt aaataccggc actgccagta aactcacctt 61 tgggactgga acaagacttc aggtcacgct cggt // LOCUS HUMTCAJM 80 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor unproductively rearranged J-alpha AA/J-alpha AB DNA pseudogene, partial cds. ACCESSION M35621 KEYWORDS T-cell receptor alpha-chain; antigen receptor; joining exon; processed gene; pseudogene. SOURCE Human cell line AT5-B1 tumor DNA, clone lambda-A30. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 80) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept.ps / 30 > 78 T-cell receptor unproductively rearranged J-alpha AA/J-alpha AB region (AA at 30) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" recomb 26 27 J-alpha AA end/J-alpha AB start signal 7 15 nonamer recombination signal BASE COUNT 23 a 15 c 18 g 24 t ORIGIN 1 tatgttggtt tatgtagaga cacatataga ccgacaagct catctttggg actgggacca 61 gattacaagt ctttccaagt // LOCUS HUMTCAZI 520 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor productively rearranged V-alpha-J-alpha DNA, exons 1 and 2. ACCESSION M35617 KEYWORDS T-cell receptor alpha-chain; antigen receptor; joining exon; processed gene; variable region. SOURCE Human T-cell line RPMI 8402 DNA, clone lambda-R10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 520) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept 11 56 T-cell receptor V-alpha-J-alpha region, exon 1 /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" 166 / 513 T-cell receptor V-alpha-J-alpha region, exon 2 IVS 57 165 T-cell receptor intron A IVS 514 > 520 T-cell receptor intron B BASE COUNT 115 a 129 c 122 g 154 t ORIGIN 1 ttgctcagcc atgctcctgg agcttatccc actgctgggg atacattttg tcctgagtga 61 gtaaaaattt ctttatggtc tctagttcca caggttctga ctagaaatgc ttgcttttta 121 tactgagtct gcactgcttt cactgatagt acgttgtttt tccaggaact gccagagccc 181 agtcagtgac ccagcctgac atccacatca ctgtctctga aggagcctca ctggagttga 241 gatgtaacta ttcctatggg gcaacacctt atctcttctg gtatgtccag tcccccggcc 301 aaggcctcca gctgctcctg aagtactttt caggagacac tctggttcaa ggcattaaag 361 gctttgaggc tgaatttaag aggagtcaat cttccttcaa cctgaggaaa ccctctgtgc 421 attggagtga tgctgctgag tacttctgtg ctgtggttgg cactgccagt aaactcacct 481 ttgggactgg aacaagactt caggtcacgc tcggtaggta // LOCUS HUMTCAZJ 130 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor unproductively rearranged J-alpha RX/J-alpha RP DNA, partial cds. ACCESSION M35618 KEYWORDS T-cell receptor alpha-chain; antigen receptor; joining exon; processed gene. SOURCE Human T-cell line RPMI 8402 DNA, clone lambda-R15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 130) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept.ps / 71 / 127 T-cell receptor J-alpha RP region (AA at 71) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" pept.ps / 68 / 9 (c) T-cell receptor J-alpha RX (AA at 68) recomb 69 70 J-alpha RX end/J-alpha RP start BASE COUNT 29 a 42 c 22 g 37 t ORIGIN 1 tttaaagata gcttcactct cacttgcgtc cccattccaa atgtaaattt cctgtttccc 61 cccctccgtt accggcactg ccagtaaact cacctttggg actggaacaa gacttcaggt 121 cacgctcggt // LOCUS HUMTCAZL 97 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor germline J-alpha AA DNA, partial cds. ACCESSION M35620 KEYWORDS T-cell receptor alpha-chain; antigen receptor; germline; joining exon. SOURCE Human cell line AT5-B1 tumor DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 97) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 35 / 95 T-cell receptor germline J-alpha RP region (AA at 35) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" IVS 96 > 97 TCR intron signal 7 15 nonamer recombination signal signal 28 34 heptamer recombination signal BASE COUNT 31 a 19 c 21 g 26 t ORIGIN 1 tatgttggtt tatgtagaga cacataacac tgtgactacc tcaggaacct acaaatacat 61 ctttggaaca ggcaccaggc tgaaggtttt agcaagt // LOCUS HUMTCAZN 89 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor germline J-alpha AB DNA, partial cds. ACCESSION M35622 KEYWORDS T-cell receptor alpha-chain; antigen receptor; germline; joining exon. SOURCE Human cell line AT5-B1 tumor DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 89) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 30 > 87 T-cell receptor germline J-alpha RP region (AA at 30) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" IVS 88 > 89 TCR intron signal 2 10 nonamer recombination signal signal 23 29 heptamer recombination signal BASE COUNT 23 a 19 c 18 g 29 t ORIGIN 1 aggtttttgt agatctcagt atcactgtgt cttataacac cgacaagctc atctttggga 61 ctgggaccag attacaagtc tttccaagt // LOCUS MUSBMTA 141 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end. ACCESSION M35719 KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit. SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C. TITLE Expression of the beta-subunit gene of Murine thyrotropin results in multiple messenger ribonucleic acid species which are generated by alternative exon splicing JOURNAL Mol. Endocrinol. 1, 875-883 (1987) STANDARD simple staff_review FEATURES from to/span description pept 118 > 141 thyrotropin beta-subunit BASE COUNT 40 a 30 c 39 g 32 t ORIGIN 1 agcagtaact cactcatgca aagtaagatc ctgcagtagt gggtggagaa gactgagcgc 61 atacgagtgg agagaaaaat attctgcttc agtcaagagc tggggttgtt caaaagcatg 121 agtgctgccg tcctcctctc c // LOCUS MUSBMTB 99 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end. ACCESSION M35720 KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit. SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 99) AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C. TITLE Expression of the beta-subunit gene of Murine thyrotropin results in multiple messenger ribonucleic acid species which are generated by alternative exon splicing JOURNAL Mol. Endocrinol. 1, 875-883 (1987) STANDARD simple staff_review FEATURES from to/span description pept 76 > 99 thyrotropin beta-subunit BASE COUNT 26 a 24 c 29 g 20 t ORIGIN 1 agcagtaact cactcatgca aagtaagatc ctgcagtagt gggtggagaa gagtgaccgc 61 atacgagtgg agagcatgag tgctgccgtc ctcctctcc // LOCUS MUSBMTC 93 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end. ACCESSION M35721 KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit. SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 93) AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C. TITLE Expression of the beta-subunit gene of Murine thyrotropin results in multiple messenger ribonucleic acid species which are generated by alternative exon splicing JOURNAL Mol. Endocrinol. 1, 875-883 (1987) STANDARD simple staff_review FEATURES from to/span description pept 70 > 93 thyrotropin beta-subunit BASE COUNT 26 a 22 c 21 g 24 t ORIGIN 1 agcagtaact cactcatgca aagtaagaaa aatattctgc ttcagtgaag agctggggtt 61 gttcaaagca tgagtgctgc cgtcctcctc tcc // LOCUS MUSBMTD 52 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end. ACCESSION M35723 KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit. SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 52) AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C. TITLE Expression of the beta-subunit gene of Murine thyrotropin results in multiple messenger ribonucleic acid species which are generated by alternative exon splicing JOURNAL Mol. Endocrinol. 1, 875-883 (1987) STANDARD simple staff_review FEATURES from to/span description pept 29 > 52 thyrotropin beta-subunit BASE COUNT 13 a 17 c 10 g 12 t ORIGIN 1 agcagtaact cactcatgca aagtaagcat gagtgctgcc gtcctcctct cc // LOCUS MUSIGKCSU 444 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse Ig aberrantly rearranged kappa-chain mRNA V-J2-C-region, complete cds. ACCESSION M35669 KEYWORDS constant region; immunoglobulin light chain; joining exon; kappa-immunoglobulin; variable region. SOURCE Mouse myeloma MOPC-21, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 444) AUTHORS Carroll,W.L., Mendel,E. and Levy,S. TITLE Hybridoma fusion cell lines contain an aberrant kappa transcript JOURNAL Mol. Immunol. 25, 991-995 (1988) STANDARD simple staff_review FEATURES from to/span description pept 28 414 Ig kappa-chain V-J2-C-region precursor sigp 28 87 Ig kappa-chain V-J2-C-region signal peptide matp 88 411 Ig kappa-chain V-J2-C-region recomb 380 381 V-region end/J2-region start recomb 411 412 J2-region end/C-region start BASE COUNT 108 a 122 c 111 g 103 t ORIGIN Chromosome 6. 1 cagcatcctc tcttccagct ctcagagatg gagacagaca cactcctgtt atgggtactg 61 ctgctctggg ttccaggttc cactggtgac attgtgctga cacagtctcc tgcttcctta 121 gctgtatctc tggggcagag ggccaccatc tcatacaggg ccagcaaaag tgtcagtaca 181 tctggctata gttatatgca ctggaaccaa cagaaaccag gacagccacc cagactcctc 241 atctatcttg tatccaacct agaatctggg gtccctgcca ggttcagtgg cagtgggtct 301 gggacagact tcaccctcaa catccatcct gtggaggagg aggatgctgc aacctattac 361 tgtcagcaca ttagggagct tacacgttcg gaggggggac caagctggaa ataaaacggg 421 ctgatgctgc accaactgta tcca // LOCUS MUSLACPI 844 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse placental lactogen I (mPL-I) mRNA, complete cds. ACCESSION M35662 KEYWORDS placental lactogen I. SOURCE Mouse (strain Swiss-Webster) day 10 placenta, cDNA to mRNA, clone 1.5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 844) AUTHORS Colosi,P., Talamantes,F. and Linzer,D.I.H. TITLE Molecular cloning and expression of mouse placental lactogen I complementary deoxyribonucleic acid JOURNAL Mol. Endocrinol. 1, 767-776 (1987) STANDARD simple staff_review FEATURES from to/span description pept 42 716 placental lactogen I (mPL-I) precursor sigp 42 131 placental lactogen I (mPL-I) signal peptide matp 132 713 placental lactogen I (mPL-I) mRNA < 1 844 mPL-I mRNA signal 821 831 mPL-I poly-A signal BASE COUNT 243 a 188 c 176 g 237 t ORIGIN 1 ttcctcactt ggagcctaca ttgtggtgga tcttctcaga aatgcagctg actttgaatc 61 tttcaggctc cgcaggaatg caattgttgc tgctggtgtc aagcctactc ctttgggaga 121 atgtgtcctc caaaccaact gccatggtgc ccactgaaga cctgtatact cgtttggctg 181 aactgctcca taatacattt atcttggccg cagatgtgta tagggaattt gatttggatt 241 ttttcgataa aacttggata acagacagaa cacttcccct gtgtcatact gcttccatcc 301 atactccaga gaatcgagag gaagtccacg aaactaaaac tgaagacctt ctgaaagcaa 361 tgatcaatgt ttcaatttcc tggaaagaac ctctgaaaca cctggtgtct gcactgacgg 421 ctctcccagg agcttctgag agtatgggga aaaaagctgc tgacattaag ggcagaaacc 481 ttgtaattct ggagggactt cagacaatat acaacaggtc tcaggctaac attgaagaaa 541 atgaaaattt tgactaccct gcttggtctg gactcgaaga actgcagtca cctaacgaag 601 acactcatct ttttgccgtt tataatctat gccgctgcat taaaagggac atccataaga 661 tagacagcta tatcaaagtc ttgaggtgcc gagttgtctt tcagaacgaa tgttgagtgc 721 ccacccagcg aagccctgcc cacatggtct ttgttgaacc agacttgtaa tgctttcccc 781 tcctcagtta tgatgagcta taatggaatt attgtcataa aataaaataa aattatttag 841 attc // LOCUS BLYGSA 1621 bp ss-mRNA PLN 01-AUG-1990 DEFINITION Barley glutamate 1-semialdehyde aminotransferase (GSA) mRNA, complete cds. ACCESSION M31545 KEYWORDS glutamate 1-semialdehyde aminotransferase. SOURCE Barley (cv. Bonus) 5 day old dark grown seedling, cDNA to mRNA. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1621) AUTHORS Grimm,B. TITLE Primary structure of a key enzyme in plant tetrapyrrole synthesis: Glutamate 1-semialdehyde aminotransferase JOURNAL Unpublished (1990) Carlsberg Laboratory, Dept. of Physiology, Gamle STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grimm, 22-JAN-1990. FEATURES from to/span description pept 20 1429 glutamate 1-semialdehyde aminotransferase (GSA) precursor (EC 5.4.3.8) sigp 20 121 glutamate 1-semialdehyde aminotransferase signal peptide matp 122 1426 glutamate 1-semialdehyde aminotransferase signal 1598 1603 polyA signal mRNA < 1 1621 GSA mRNA BASE COUNT 362 a 363 c 459 g 437 t ORIGIN 1 ggagaaggaa ggcagcatca tggccggagc agcagccgcc gtggcctccg gcatatcgat 61 caggcctgta gccgcgccta agatctcgcg cgcgccccgc tctcggtcgg tggtgagggc 121 ggccgtctcc atagacgaga aggcttacac ggttcagaaa tccgaggaga tcttcaacgc 181 cgccaaggaa ttgatgcctg gtggtgttaa ttcaccagtc cgtgccttca aatcagtcgg 241 cgggcagccc atagtttttg attctgtgaa gggctctcat atgtgggatg tcgatggaaa 301 tgaatatatt gattatgttg gttcctgggg tcctgcaatc attggtcatg cagatgacaa 361 ggtgaatgct gcacttattg aaactctgaa gaagggtact agctttggtg ctccatgtgc 421 gttggagaat gtgttggctc aaatggtcat ctccgctgtg ccgagtatcg aaatggttcg 481 ttttgtaaat tcaggaacag aagcttgcat gggagcactc cgccttgtgc gtgcattcac 541 tgggagggaa aagattctca agtttgaagg ctgttaccat ggccatgcag attccttcct 601 tgttaaagca ggcagtggtg ttgccaccct cggcctccca gactcccctg gagtgcctaa 661 gggagccacc gttgggactc taacagcacc ttataatgat gctgatgcgg ttaaaaagct 721 gtttgaggat aacaaagggg agattgctgc agtcttcctt gagccggttg ttggcaatgc 781 tggcttcatt cctccgcagc ctgctttcct aaatgctctc cgtgaggtga ccaaacaaga 841 cggcgcactt ctggtgtttg atgaagtgat gactcctttc cgtttagctt atggtggggc 901 acaagagtac tttggaatca cccctgatgt gacaaccttg ggccaaatta ttggcggtgg 961 tcttccggtt ggtgcttacg gtggacggaa ggatatcatg gagatggttg ctccagcagg 1021 gccaatgtac caggcaggaa ccctcagtgg aaaccctcta gctatgactg ctggaatcca 1081 cactctcaag cgtctgatgg agcctggcac ctatgaatac ttagacaagg tcactggtga 1141 acttgtccgg ggcatattgg atgtgggcgc taaaacaggg cacgagatgt gtggaggaca 1201 catcagaggc atgttcggat tcttcttcgc aggtggccca gtgcacaact ttgatgatgc 1261 caagaagagt gacacagcga agtttgggag gttccaccgt ggaatgctgg gcgaaggcgt 1321 gtatctggca ccatcccagt tcgaggcagg ttttacaagc ttggcacaca ccacccaaga 1381 cattgagaaa accgtggagg ctgccgagaa ggttcttcga tggatataga tgatttggat 1441 tgcaaacctt ttgaagcttt tccttctgtt gtattctgtt agtttgtacg tggctgaagt 1501 ttagttttgt attgtatttt gttgtgcagc agcagtatct tgtctctagc ccatttttct 1561 tcttctgagt tagcatttgg ggtgattttg tcttggcaat aaaactttgg ctacgacctc 1621 c // LOCUS MUSSVSIV 541 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse seminal vesicle secretory protein IV (SVS IV) mRNA, 3' end. ACCESSION M35732 KEYWORDS seminal vesicle secretory protein IV. SOURCE Mouse adult seminal vesicle, cDNA to mRNA, clone p2A2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 541) AUTHORS Chen,Y.H., Pentecostt,B.T., McLachlan,J.A. and Teng,C.T. TITLE The androgen-dependent mouse seminal vesicle secretory protein IV: Characterization and complementary deoxyribonucleic acid cloning JOURNAL Mol. Endocrinol. 1, 707-716 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 329 seminal vesicle secretory protein IV (SVS IV) precursor (AA at 3) sigp < 1 50 seminal vesicle secretory protein IV (SVS IV) signal peptide matp 51 326 seminal vesicle secretory protein IV (SVS IV) mRNA < 1 541 SVS IV mRNA BASE COUNT 154 a 107 c 131 g 149 t ORIGIN 1 gtttgttcct cttttctctg cttctccttc tggtgacagg agccattggg aagaaaacta 61 aggaaaaatt cttgcagtcg gaagaaactg tcagagagag cttctcgacg ggaagcagag 121 gccatatgtc aagaagttct gagccagagg tatttgttag gccacaggac tccatcggtg 181 acgaagcttc tgaggaaatg agtagtagta gtagtagtag aagaagaagt aagattatct 241 ctagcagttc tgatggttct aatatggaag gtgagagttc atattcaaag agaaagaaga 301 gccggttttc tcaagatgca ctcgagtgat actgcattga ccagctgaac atctggacca 361 atatgctgga gccatatcgc cagaacagag cccatgatgt cttcagcata cagctcccat 421 gtggtctcag aggcagtccc tggatggcat ttacttccca tgcttgtttg tcttgaggtt 481 cttaaaccta acatttactc tggagctttc tttccaataa agagataaca attgcatcat 541 t // LOCUS NEMRPT 677 bp ds-DNA INV 01-AUG-1990 DEFINITION A.lumbricoides BamHI repetitive DNA. ACCESSION M35399 KEYWORDS BamHI repetitive sequence. SOURCE A.lumbricoides DNA, clone AL700-1. ORGANISM Ascaris lumbricoides Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; Ascaridida; Ascaridina; Ascaridoidea; Ascarididae. REFERENCE 1 (bases 1 to 677) AUTHORS Warren,T. and Pasternak,J.J. TITLE A related moderately repetitive DNA family in the nematodes Ascaris lumbricoides and Panagrellus silusiae JOURNAL Nucleic Acids Res. 16, 10833-10847 (1988) STANDARD simple staff_review FEATURES from to/span description rpt 1 677 BamHI repeat BASE COUNT 186 a 158 c 161 g 172 t ORIGIN 1 ggatccgagt aagtgtgcaa aaacagcatt atttatgtaa acgaagctca attacatttc 61 taagtgcaat tacggctgta tcacgggttg gcaactccat attccacgga aatccaccca 121 ttcaacgggt gcaattcccg tgagtatcgt aaaataggag agtgaaagct cagaatgcgg 181 ctagaatgtg tcatcttgtt gccaaatcgg agatatgtat cgtgtgaatt gacatgtatc 241 atgccaaggt aggtcggaaa ggccaaagaa aagcggaaac cagacggtcg gaaagtacag 301 aactcgattc ttgcgattgt gcatcttcga gttctggtaa gtgtaaatgc gagtccggtg 361 tctgatcgga tctgatcggc cagtgccgag gcttacacgt gactatcaca tagtctcact 421 ctttcactct tcccttttcg cgatttccga ttcagtgcta acaactcgac gtagacaccc 481 cactctttct cctgcgcatt cctatgccgg tcaccgattg ggtcgcaaaa tgccaaagga 541 cagggcatgt aagcccgcat cttaattgtt aagattcacc gatgaatcgt caaaaatttt 601 gcaaaagcta gtggaaaacg gggttttgag gcccgttcca ccggcaaacc gtcatcgtgc 661 gccgatcaga tggatcc // LOCUS PNGRPT 682 bp ds-DNA INV 01-AUG-1990 DEFINITION P.silusiae BamHI repetitive DNA. ACCESSION M35398 KEYWORDS BamHI repetitive sequence. SOURCE P.silusiae DNA, clone PS700-1. ORGANISM Panagrellus silusiae Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae. REFERENCE 1 (bases 1 to 682) AUTHORS Warren,T. and Pasternak,J.J. TITLE A related moderately repetitive DNA family in the nematodes Ascaris lumbricoides and Panagrellus silusiae JOURNAL Nucleic Acids Res. 16, 10833-10847 (1988) STANDARD simple staff_review FEATURES from to/span description rpt 1 682 BamHI repeat BASE COUNT 201 a 154 c 155 g 172 t ORIGIN 1 ggatccgcag cgaattgtgt aaaacagcat taattatgta aaagaagctc aattaacctt 61 tctaagtgca attgaggctg tatcacgggt tggcaacctc gtattccacg gaaatccacc 121 cattcaacgg gtgcgatttc gtgtttttcg taaaaatcgg attctgaagg ctagaatccg 181 gccagaatgt gtcatcttgt tccaaatgag agttatttga catctgaatc acatttgaaa 241 tgcaaagaca ggtcggaaag gccaaacaag agcgaaaacc cgcgggtcgc caaaagtacc 301 agaactcgat tcttgcgatt tttcgcattt tcgagttctg gtaagtgcaa aaagtttcga 361 tttcggatct gcatcggaat ctgattgccc acgtgccaga aggcttaaaa acgtgcacaa 421 accacatggt taccctttac cttgttttcg aaatttaaca aaaagtgcaa aaaccgggta 481 aaaacccatc tttggcctgc gcattgccaa tggcggtcat cgatgggtcg cgaagtgcca 541 aagggaccaa ggtgtaagcc cgcatcatat ctgttaagat tcatcgatga atcggccaat 601 attttgaaaa gctagtggaa aaacgcgttt tgacgcccgt ttccaccggc aaaccgtcat 661 cgtgcgccga tcagacggat cc // LOCUS TETTRGA 75 bp ss-tRNA RNA 01-AUG-1990 DEFINITION T.thermophila Gln-tRNA-UUG. ACCESSION M35400 KEYWORDS glutamine tRNA. SOURCE T.thermophila tRNA. ORGANISM Tetrahymena thermophila Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H. TITLE Dramatic events in ciliate evolution: Alteration of UAA and UAG termination codons to glutamine codons due to anticodon mutations in two Tetrahymena tRNAs-Gln JOURNAL EMBO J. 5, 1307-1311 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 75 Gln-tRNA modified 9 9 m1g modified 10 10 m2g modified 13 13 p modified 19 19 d modified 20 20 d modified 34 34 um anticdn 34 36 Gln-tRNA anticodon ttg modified 39 39 p modified 48 48 m5c modified 54 54 p modified 57 57 m1a BASE COUNT 15 a 19 c 21 g 18 t 2 others ORIGIN 1 ggttgtatgg tgtagcggaa agcaccgagg actttgaatc ctctgacctg ggttcgaatc 61 ccagtacgac ctcca // LOCUS TETTRGB 75 bp ss-tRNA RNA 01-AUG-1990 DEFINITION T.thermophila Gln-tRNA-CUA. ACCESSION M35401 KEYWORDS transfer RNA-Gln. SOURCE T.thermophila tRNA. ORGANISM Tetrahymena thermophila Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H. TITLE Dramatic events in ciliate evolution: Alteration of UAA and UAG termination codons to glutamine codons due to anticodon mutations in two Tetrahymena tRNAs-Gln JOURNAL EMBO J. 5, 1307-1311 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 75 Gln-tRNA modified 10 10 m2g modified 13 13 p modified 19 19 d modified 20 20 d anticdn 34 36 Gln-tRNA anticodon cta modified 37 37 t6a modified 39 39 p modified 48 48 m5c modified 54 54 p modified 57 57 m1a BASE COUNT 19 a 18 c 18 g 19 t 1 others ORIGIN 1 ggttctatag tatagcgcaa agtactgggg antctaaatc ccttgacctg ggttcgaatc 61 ccagtaggac ctcca // LOCUS TETTRGC 75 bp ss-tRNA RNA 01-AUG-1990 DEFINITION T.thermophila Gln-tRNA-UUA. ACCESSION M35402 KEYWORDS transfer RNA-Gln. SOURCE T.thermophila tRNA. ORGANISM Tetrahymena thermophila Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H. TITLE Dramatic events in ciliate evolution: Alteration of UAA and UAG termination codons to glutamine codons due to anticodon mutations in two Tetrahymena tRNAs-Gln JOURNAL EMBO J. 5, 1307-1311 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 75 Gln-tRNA modified 10 10 m2g modified 13 13 p modified 16 16 d modified 19 19 d modified 20 20 d modified 32 32 cm modified 34 34 um anticdn 34 36 Gln-tRNA anticodon tta modified 37 37 t6a modified 39 39 p modified 48 48 m5c modified 54 54 p modified 57 57 m1a BASE COUNT 16 a 17 c 20 g 19 t 3 others ORIGIN 1 ggttccatag tatagdggdd agtactgggg actttaaatc ccttgacctg ggttcgaatc 61 ccagtgggac ctcca // LOCUS BEGRR5S 120 bp ss-rRNA RNA 01-AUG-1990 DEFINITION B.alba 5S ribosomal RNA. ACCESSION M35565 KEYWORDS 5S ribosomal RNA. SOURCE B.alba (strain B18LD) rRNA. ORGANISM Beggiatoa alba Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Beggiatoaceae. REFERENCE 1 (bases 1 to 120) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 120 5S ribosomal RNA BASE COUNT 32 a 32 c 29 g 27 t ORIGIN 1 ttcttggcga ccatagcaaa taggaaccac ccgaccccat cccgaactcg gtagtgaaac 61 tgttctgcgc cgatgatagt gtggatactc tccatgtgaa agtaggttat cgccaagagc // LOCUS ECOHEMC 2092 bp ds-DNA BCT 01-AUG-1990 DEFINITION E.coli porphobilinogen deaminase (hemC) and uroporphyrinogen III synthase (hemD) genes, complete cds. ACCESSION X04242 M35827 KEYWORDS deaminase; hemC gene; hemD gene; porphobilinogen deaminase; uroporphyrinogen III synthase. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1957) AUTHORS Thomas,S.D. and Jordan,P.M. TITLE Nucleotide sequence of the hemC locus encoding porphobilinogen deaminase of Escherichia coli K12 JOURNAL Nucleic Acids Res. 14, 6215-6226 (1986) STANDARD simple automatic REFERENCE 2 (bases 1290 to 2092) AUTHORS Jordan,P.M., Mgbeje,B.I.A., Thomas,S.D. and Alwan,A.F. TITLE Nucleotide sequence for the hemD gene of Escherichia coli encoding uroporphyrinogen III synthase and initial evidence for a hem operon JOURNAL Biochem. J. 249, 613-616 (1988) STANDARD simple staff_review COMMENT Data kindly reviewed (11-SEP-1986) by P. Jordan FEATURES from to/span description pept 390 1331 porphobilinogen deaminase (hemC) pept 1328 2068 uroporphyrinogen III synthase (hemD) signal 330 335 put. -35 region rpt 63 67 inverted repeat A rpt 78 82 direct repeat 1 rpt 349 853 inverted repat A' signal 354 359 put. -10 region rpt 356 360 direct repeat 1 rpt 367 371 direct repeat 1 binding 377 381 put. ribosome binding site signal 1508 1522 pot. transcription termination signal BASE COUNT 495 a 540 c 566 g 491 t ORIGIN 1 caagacgtat cgcctgattt gctacccgtc atgactgtga ttccgccaac atcaacggta 61 acacgcggca ttcgggatat ttcgtatgtc aaaggtaacc gttaccactt ttcgcgcctg 121 gtttttttag tttcacgacg aaaaaatggt ctaaaacgtg atcaatttaa caccttgctg 181 attgaccgta aagaaagatg cgctacatac aagtgtagca ccgtttattc tctgtaaatt 241 ccttattaca acggcgtgaa acgcctgtca ggatccactg ccagacctca ttttacggtt 301 tgcgcaggcg tctacgtttc accacaacac tgacatcact ctggcaagga tgttaggatg 361 gaccacggat gataatgacg gtaacaagca tgttagacaa tgttttaaga attgccacac 421 gccaaagccc acttgcactc tggcaggcac actatgtcaa agacaagttg atggcgagcc 481 atccgggcct ggtcgttgaa ctggtaccga tggtgacgcg cggcgatgtg attcttgata 541 cgccgctggc gaaagtaggc ggaaaaggct tatttgtaaa agagctggaa gtcgcgctcc 601 tcgaaaatcg cgccgatatc gccgtacact caatgaaaga tgtgccggtt gaattcccgc 661 aaggtctggg actggtcact atttgtgagc gtgaagatcc tcgcgatgcc tttgtgtcca 721 ataactatga cagtctggat gcgttaccgg caggcagtat cgtcgggacg tccagtttac 781 gtcgccagtg ccaactggct gaacgccgtc cggatctgat tatccgctcc ctgcgcggca 841 acgtcggcac tcgcctgagc aaactggata acggcgaata cgatgccatc attcttgccg 901 tagccggact aaaacgttta ggtctggagt cacgtattcg cgccgcgttg ccacccgaga 961 tttctcttcc ggcggtagga caaggtgcgg tgggtattga atgccgcctt gatgattcac 1021 gcactcgcga gctgcttgcc gcgctgaatc accacgaaac tgcactgcgc gttaccgcag 1081 aacgcgccat gaatacccgt ctcgaaggcg catgtcaggt gccaattggt agctacgccg 1141 agcttattga tggcgaaatc tggctgcgtg ggctggtcgg cgcgccggac ggttcgcaga 1201 ttattcgcgg tgaacgccgc ggtgcgccgc aagatgccga acaaatgggg atttcgctgg 1261 cagaagagct actgaataac ggcgcgcgcg agatcctcgc tgaagtctat aacggagacg 1321 ccccggcatg agtatccttg tcacccgccc gtctcccgct ggagaagagt tagtgagccg 1381 tctgcgcaca ctggggcagg tggcctggca ttttccgctg attgagtttt ctccgggtca 1441 acaattaccg caacttgctg atcaactggc agcgctgggg gagagcgatc tgttgtttgc 1501 cctctcgcaa cacgcggttg cttttgccca atcacagctg catcagcaag atcgtaaatg 1561 gccccgacta cctgattatt tcgccattgg acgcaccacc gcactggcac tacataccgt 1621 aagtggacag aagattctct acccgcagga tcgggaaatc agcgaagtct tgctacaatt 1681 acctgaatta caaaatattg cgggcaaacg tgcgctgata ttacgtggca atggtggtcg 1741 tgagctaatt ggggataccc tgacggcgcg cggtgctgag gtcacttttt gtgaatgtta 1801 tcaacgatgc gcaatccatt acgatggtgc agaagaagcg atgcgctggc aagcccgcga 1861 ggtgacgatg gtcgttgtta ccagcggtga aatgttgcag caactctggt cactgatccc 1921 acaatggtat cgtgagcact ggttactaca ctgtcgacta ttggtcgtca gtgagcgttt 1981 ggcgaaactc gcccgggaac tgggctggca agacattaag gtcgccgata acgctgacaa 2041 cgatgcgctt ttacgggcat tacaataact ctcataacag gaagccataa tg // LOCUS LTTRR5S 117 bp ss-rRNA RNA 01-AUG-1990 DEFINITION L.discophora 5S ribosomal RNA. ACCESSION M35569 KEYWORDS 5S ribosomal RNA. SOURCE L.discophora (strain Stokes) rRNA. ORGANISM Leptothrix discophora Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Prosthecate bacteria. REFERENCE 1 (bases 1 to 117) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 27 a 35 c 32 g 23 t ORIGIN 1 atgcctgacg accatagcga ggtggtccca ctccttccca tcccgaacag gacagtgaaa 61 cgcctcagcg ccgatgatag tgcgcattcg cgtgtgaaag taggtcatcg tcaggct // LOCUS TBSACG 4776 bp ss-RNA VRL 01-AUG-1990 DEFINITION Tomato bushy stunt virus complete genome. ACCESSION M21958 M31019 KEYWORDS capsid protein; coat protein; complete genome; p19 protein; p22 protein; p33 protein; p41 protein; p92 protein. SOURCE Tomato bushy stunt virus (strain cherry), cDNA to viral RNA. ORGANISM Tomato bushy stunt virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Tombusvirus. REFERENCE 1 (bases 2621 to 4776) AUTHORS Hillman,B.I., Hearne,P., Rochon,D. and Morris,T.J. TITLE Organization of tomato bushy stunt virus genome: Characterization of the coat protein gene and the 3' terminus JOURNAL Virology 169, 42-50 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 2620) AUTHORS Hearne,P.Q., Knorr,D.A., Hillman,B.I. and Morris,T.J. TITLE The complete genome structure and synthesis of infectious RNA from clones of tomato bushy stunt virus JOURNAL Virology 177, 141-151 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Q.Hearne, 16-DEC-1988. Draft entry and computer-readable sequence for [2] kindly submitted by D.Knorr, 21-DEC-1989. The 5' terminal nucleotide was not determined. However, in the infectious constructs, two 5' terminal "g" residues are added, one of which is removed during subsequent replication in host plants. FEATURES from to/span description pept 166 1056 p33 protein pept 166 2622 p92 protein (read-through of p33) pept 2652 3818 p41 capsid protein pept 3888 4406 p19 protein pept 3856 4425 p22 protein mRNA 2621 4776 2.2kb subgenomic mRNA mRNA 3841 4776 0.9kb subgenomic mRNA BASE COUNT 1257 a 983 c 1315 g 1220 t 1 others ORIGIN 1 naaattctcc aggatttctc gacctagttc gtttatctgg tgacttgcgc taccgttgct 61 ttgcgtagag aatttctctc cataattatt atctttagtt gtggggtttg aaggttgggt 121 ctacctttcg gggggataaa ttgtaacttc caacaaacaa gcgacatgga gaccatcaag 181 agaatgattt ggcctaagaa agagattttt gtgggtgatt tcgcaaccgg agtgaatagg 241 acagttccgg tgaacatctt tcaattggtg tgtcgtgtgg ttctgagata catgaggaca 301 gggaaaatag agtgtgattc tgacagcatg actaagttta tagttgaatt actcaaaact 361 gattgtgctg ccaaatggga atggttcatg aagagacggc agaggggtga ttacattgtc 421 cctctatcta tagcctccat accaatcata ccgctgttga gttatgccac tagggtacgc 481 gcagtctcag tcaaggcttt tggcaatgaa ctatcgttca atgtcagggt gcctagacca 541 tctgtaccta agaaaggatt gctcctcaga ctggcggcag gtctagcgtt agctcctata 601 tgcgcgctgg ccgtgtacgc taccctacct agggaaaaac tgtcggtatt taagctgaga 661 actgaggcac gagcacacat ggaggatgag agagaagcga cagattgtct ggtggttgag 721 ccggcaaggg aacttaaggg taaagatggt gaggatctcc tcactggtag tagattgact 781 aaggtgatcg cgtccactgg gcgccctcgt cgaagacctt atgcggcaaa gatcgcacag 841 gtggcgagag caaaggtggg ttaccttaag aacagtccag agaatagact aatctaccag 901 agggtgatga tcgagatcat ggacaaagac tgcgtcaggt atgttgacag ggatgtcata 961 ttgcctttgg ctattggatg ctgttttgtc tatccggatg gagtggagga gtcggcggca 1021 ctatggggct cacaggagtc cctgggtgtc aaatagggag gcctagtacg tctacctggg 1081 gttgtaacac agatcaatcg agatatccca tctgatgtgt tacttcctca ggaggtgcta 1141 gaggttcgta caggacctcc caatgctaag gaccgtaata tatttatggt tgcaggttgc 1201 ccatcacagg cacggttctt agtacataat cactgcctga aaaaccttaa aaggggtctt 1261 gtggagagag tcttctgcgt agagagaaac gggaagctcg ctcgcactcc acaacctacc 1321 aaaggagcct ttggacgtct ttccccgttc aggaaagcgg tttgtgagaa ggttggggta 1381 gcccaccgac ttgggtatga tgggtttctg tcatactaca gcggtgcgaa actccgtact 1441 tacacacgag ccgtggagag tctgcatatc acacctgtct ccgagaggga tagtcacttg 1501 actaccttcg taaaagcaga gaagatatcg acgtctaagg gtgacccagc acctcgggtg 1561 attcagcctc gaaacccgag gtacaatgtg gaacttggaa gatatctacg gcatatggaa 1621 tccaagctga tgaaagctgt tgatggcgtg ttcggagaga cgacatgcat caaaggatac 1681 acagctgatg aggtaggtgc aattttccgg gctaaatggg acaggtttga taagcctgtc 1741 gccatagggc tcgatgcatc taggtttgat caacactgtt ccgttgaagc attgcaatat 1801 gagcatagct tctacagggc catgtaccct ggcaacaagc tcttgggcaa gttgttggaa 1861 tggcagctcc ataataaagg taaaggttat gttccagatg gaactataac ctatcgcaag 1921 gagggctgtc gcatgagtgg ggatataaac acctcgttgg gcaactatct actgatgtgt 1981 gcaatggtac atgggtacat gcgtcatctg gggattaatg agtttagtct ggcaaactgt 2041 ggggatgatt gcgtcctaat tgtcgaacgc aggaatctta agcagataca gagaacttta 2101 ccggagtatt tcctcaatct gggatatact atgaaggtgg agcaacctgt atttcaactg 2161 gaagaggttg aattttgcca ggcacaccca gtacagtttc aaggcggttg gaagatggtt 2221 cgaaacgtcc gtactgctat gagcaaggat gtgcactgtg tcaacaatat acgcgatttg 2281 gcgacgagga gagcttggag taatgctcaa catcatgggg gtctagcgct tagtgctggt 2341 attccagttg tggagacgtt ttactctagg tttaagcttt atgatgtacc tcgtaaacat 2401 caacgtattg acacggtcac aaatgtgcac aagtggcgtg gatccggtgg gagttatgtt 2461 gtgacccctg aatctagggc tagcttttgg gctgcctttg gactcacggg ggatgagcaa 2521 ctggctctgg aggaccgtct ggaaagatgg gagatggatc tgtttggaga ggagggtgtt 2581 gacgctcatg agcccagcat cctcgactcc gccgtagctt gaccaagaat acacacacgc 2641 aggatagaca catggcaatg gtaaagagaa acaacaacac gggaatgatc ccggtgagta 2701 caaagcaatt actggcattg ggtgcggccg ctggggccac agccttgcag ggatttgtca 2761 agaataatgg gatggccatc gttgaggggg ctgtcgatct gactaaaaga gcgtacaaag 2821 cagtgcggag aagaggaggt aagaaacagc agatgattaa tcatgtaggt ggtacaggtg 2881 gtgctataat ggcgccggta gcagtgacta gacaacttgt cggtagtaag cctaagttta 2941 ctggcaggac gtctggctct gtcacagtta cccaccgtga gtatctgtca caagtgaata 3001 attccacggg tttccaagtt aatgggggaa ttgtcggcaa tttgttacag cttaacccgt 3061 tgaatggtac attgttctct tggttgccag cgatagcatc caattttgat cagtacacat 3121 tcaacagcgt tgtgctacat tatgtgcccc tatgttcaac tactgaggta gggagagtgg 3181 ctatttactt tgataaggac tcagaagatc cagaacctgc tgatagagtt gagttggcga 3241 attacagcgt gcttaaagag acagcccctt gggctgaagc gatgcttagg gtacccaccg 3301 ataagattaa gagattttgt gatgacagtt ccacatctga tcacaaactt atcgacttgg 3361 gtcaattggg cattgctaca tatggtggcg ctgggactaa tgctgtgggg gatatcttta 3421 tctcgtacag tgttacgtta tatttccctc aacctacgaa cacactcctt agtaccagaa 3481 ggctcgacct tgctggcgct cttgtcacag catctggccc tggatacctc ctggtgtcta 3541 ggactgccac tgtattgaca atgacattcc gtgctacagg cacgtttgtc atatccggga 3601 cgtatcggtg cctcacggca acaacgttag gcttggctgg cggagtgaat gtcaatagta 3661 tcacagttgt agataacata ggtacagaca gtgcgttttt cataaattgt actgtctcta 3721 acctaccatc tgtggtgaca ttcacatcta ccggtatcac atctgccaca gtacattgcg 3781 tgcgcgcgac acgacagaat gatgtttctc taatttagtg tgtcctgcga ggggcctctt 3841 gaacaagacc agttcatgga tactgaatac gaacaagtca ataaaccatg gaacgagcta 3901 tacaaggaaa cgacgctagg gaacaagcta acagtgaacg ttgggatgga ggatcaggag 3961 gtaccacttc tcccttcaaa cttcctgacg aaagtccgag ttggactgag tggcggctac 4021 ataacgatga gacgaattcg aatcaagata atccccttgg tttcaaggaa agctggggtt 4081 tcgggaaagt tgtatttaag agatatctca gatacgacag gacggaagct tcactgcaca 4141 gagtccttgg atcttggacg ggagattcgg ttaactatgc agcatctcga tttttcggtt 4201 tcgaccagat cggatgtacc tatagtattc ggtttcgagg agttagtatc accgtttctg 4261 gagggtcgcg aactcttcag catctctgtg agatggcaat tcggtctaag caagaactgc 4321 tacagcttgc cccaatcgaa gtggaaagta atgtatcaag aggatgccct gaaggtactg 4381 agaccttcga aaaagaaagc gagtaagaca gactcttcag tctgagtttg tggagatgag 4441 tgtaaatctg gcatagcata caggttactc ttgttgggtt ctggatgtta ggatgacgag 4501 tcgactcggg ctccgcacta ggtttggtcg cctaggggat ggagatatgg aaagggtctc 4561 gtgtggtatc agtcggtcga aagacgcgct tccaacatgg gcctatggtc ggataagtct 4621 tagcaatacc agccagcatg aattggattc ctgtttacga aagttaggtg tcacttgtgg 4681 aagcggaccc agacacggtt gatctcaccc ttcggggggc tatagagatc gctggaagca 4741 ctaccggaca accggaacat tgcagaaatg cagccc // LOCUS THTRR5S 122 bp ss-rRNA RNA 01-AUG-1990 DEFINITION T.nivea 5S ribosomal RNA. ACCESSION M35563 KEYWORDS 5S ribosomal RNA. SOURCE T.nivea (strain JP2) rRNA. ORGANISM Thiothrix nivea Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Leucotrichaceae. REFERENCE 1 (bases 1 to 122) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 122 5S ribosomal RNA BASE COUNT 27 a 35 c 36 g 24 t ORIGIN 1 tttgcctggt gtccatagag cactggaacc acctgatccc atcccgaact cagaagtgaa 61 acggtgcatc gccgatggta gtgtggggcc tccccatgtg agagtaggtc aacgccaggc 121 gc // LOCUS THVRR5S 123 bp ss-rRNA RNA 01-AUG-1990 DEFINITION Thiovulum sp. 5S ribosomal RNA. ACCESSION M35570 KEYWORDS 5S ribosomal RNA. SOURCE Thiovulum sp. rRNA. ORGANISM Thiovulum sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Colorless sulfur bacteria. REFERENCE 1 (bases 1 to 123) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 123 5S ribosomal RNA BASE COUNT 30 a 28 c 30 g 35 t ORIGIN 1 tttggttggt gattacagag aaaaggtcac actcagctcc atttcgaacc tgaaagttaa 61 gcttttcttc gtcgataata ctgcccccta cgggggtggg acggtagatc gttgccaacc 121 att // LOCUS VITRR5S 118 bp ss-rRNA RNA 01-AUG-1990 DEFINITION V.beggiatoides 5S ribosomal RNA. ACCESSION M35566 KEYWORDS 5S ribosomal RNA. SOURCE V.beggiatoides (strain B23SS) rRNA. ORGANISM Vitreoscilla beggiatoides Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Beggiatoaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 30 a 35 c 31 g 22 t ORIGIN 1 cgcctgacga ccacagcgac tgtgaaccac ccgaccccat ctcgaactcg gtagtgaaac 61 cagtcagcgc cgatgatagt gtggcatatg ccatgtgaaa gtaggtcatc gtcaggct // LOCUS VITRR5SX 118 bp ss-rRNA RNA 01-AUG-1990 DEFINITION V.stercoraria 5S ribosomal RNA. ACCESSION M35567 KEYWORDS 5S ribosomal RNA. SOURCE V.stercoraria (strain VT1) rRNA. ORGANISM Vitreoscilla stercoraria Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Beggiatoaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 30 a 32 c 30 g 26 t ORIGIN 1 tgtttgacga ccatagcgag ttggtcccac gccttcccat cccgaacagg accgtgaaac 61 gacttagcgc cgatgatagt gtggattacc catgtgaaag taggtcatcg tcaaacgc // LOCUS VITRR5SXX 116 bp ss-rRNA RNA 01-AUG-1990 DEFINITION V.filiformis 5S ribosomal RNA. ACCESSION M35568 KEYWORDS 5S ribosomal RNA. SOURCE V.filiformis (strain ATCC 15551) rRNA. ORGANISM Vitreoscilla filiformis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Beggiatoaceae. REFERENCE 1 (bases 1 to 116) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 116 5S ribosomal RNA BASE COUNT 27 a 34 c 31 g 24 t ORIGIN 1 gcctgatgac catagcaagg tggtcccact ccttcccatc ccgaacagga cagtgaaacg 61 ccttagcgcc gatgatagtg cggttctccc gtgtgaaagt aggacatcgt caggct // LOCUS PVICSA 1895 bp ds-DNA INV 01-AUG-1990 DEFINITION Plasmodium vivax circumsporozoite protein gene, complete cds. ACCESSION M11926 M20671 J04090 KEYWORDS circumsporozoite protein. SOURCE P.vivax (strain Belem) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1529) AUTHORS Arnot,D.E., Barnwell,J.W., Tam,J.P., Nussenzweig,V., Nussenzweig,R.S. and Enea,V. TITLE Circumsporozoite protein of Plasmodium vivax: Gene cloning and characterization of the immunodominant epitope JOURNAL Science 230, 815-818 (1985) STANDARD simple staff_review REFERENCE 2 (bases 158 to 1294; revises [1]) AUTHORS Arnot,D.E., Barnwell,J.W. and Stewart,M.J. TITLE Does biased gene conversion influence polymorphism in the circumsporozoite protein-encoding gene of Plasmodium vivax? JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106 (1988) STANDARD full staff_entry REFERENCE 3 (bases 1 to 157; 1295 to 1895; revises [1]) AUTHORS Arnot,D.E. JOURNAL Unpublished (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2],[3] kindly submitted by D.E.Arnot, 14-SEP-1988. FEATURES from to/span description pept 158 1294 circumsporozoite protein BASE COUNT 674 a 347 c 471 g 403 t ORIGIN 1 ctgcataagg caaactcaca aacatccaaa aaaatataca tatatatatt tatatacacg 61 tgtatatatt attaagcggc ttaagttaag caagcaaaac agccaaaggc ctacaagtgt 121 aaacagcttc ctgcacacac gtatatacca gaacaagatg aagaacttca ttctcttggc 181 tgtttcttcc atcctgttgg tggacttgtt ccccacgcac tgcgggcaca atgtagatct 241 gtccaaggcc ataaatttaa atggagtaaa cttcaataat gtagacgcca gttcacttgg 301 cgcggcacac gtaggacaaa gtgctagccg aggcagagga cttggtgaga acccagatga 361 cgaggaagga gatgctaaaa aaaaaaagga tggaaagaaa gcagaaccaa aaaatccacg 421 tgaaaataag ctgaaacaac caggagacag agcagatgga cagccagcag gagacagagc 481 agatggacag ccagcaggtg atagagcaga tggacaacca gcaggagata gagcagctgg 541 acaaccagca ggagatagag cagatggaca gccagcagga gacagagcag atggacagcc 601 agcaggagac agagcagatg gacaaccagc aggagacaga gcagatggac aaccagcagg 661 tgatagagca gctggacaac cagcaggtga tagagcagct ggacaaccag caggagatag 721 agcagatgga cagccagcag gagatagagc agctggacag ccagcaggag atagagcaga 781 tggacagcca gcaggagata gagcagctgg acagccagca ggagatagag cagatggaca 841 gccagcagga gatagagcag ctggacagcc agcaggagat agagcagctg gacagccagc 901 aggagataga gcagctggac agccagcagg agatagagca gctggacagc cagcaggaaa 961 tggtgcaggt ggacaggcag caggaggaaa cgcaggagga ggacagggac aaaataatga 1021 aggtgcgaat gccccaaatg aaaagtctgt gaaagaatac ctagataaag ttagagctac 1081 cgttggcacc gaatggactc catgcagtgt aacctgtgga gtgggtgtaa gagtcagaag 1141 aagagttaat gcagctaaca aaaaaccaga ggatcttact ttgaatgacc ttgagactga 1201 tgtttgtaca atggataagt gtgctggcat atttaacgtt gtgagtaatt cattagggct 1261 agtcatattg ttagtcctag cattattcaa ttaagtagct gacatccatt attttcggcg 1321 tcctccacgg tgcatattaa gtgttttgtg ttttgtacat gcacataaat acttgcccgt 1381 agggacatga tttttttccc tttcttatga atgttccctg ctgtttgcac gtaactgtat 1441 gtacgtgcgc gtaaggcata gtaagtaaca cctcttacac attatgcgct tacgcacaat 1501 cagttgtgca attctagaaa acacgatatg agtattttta aacacttatc gtccaaaaaa 1561 acaaaaaaaa cagaaaaaac agaaaaaaca gaaaaaacaa aaaaaaacaa aaaaaaacaa 1621 aaaaaaacaa aaaaaacaca tttatattaa cttttccttt ttgattgacc cttttttgac 1681 gtatattttt tttttttttt cgtatgtatt atatatactg cttaacgtag agaacttaaa 1741 ttttgagaat gtattttttt ttaacaagtt aaaaaaagaa ctggtatttt tgggaattca 1801 aaaaatttgc aaattcaaaa gaggcgagtt aaaatttgcg ccgtggcaaa cggggtgcgt 1861 gcgggagtcg tgcaaatgtg gcttatatcc ggggg // LOCUS PVICSC 1375 bp ds-DNA INV 01-AUG-1990 DEFINITION Plasmodium vivax circumsporozoite protein gene, 3' end. ACCESSION M20670 J04090 KEYWORDS circumsporozoite protein. SOURCE P.vivax (strain North Korean) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1105) AUTHORS Arnot,D.E., Barnwell,J.W. and Stewart,M.J. TITLE Does biased gene conversion influence polymorphism in the circumsporozoite protein-encoding gene of Plasmodium vivax? JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1106 to 1375) AUTHORS Arnot,D.E. JOURNAL Unpublished (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Arnot, 14-SEP-1988. FEATURES from to/span description pept < 1 1105 circumsporozoite protein (AA at 2) BASE COUNT 464 a 260 c 407 g 244 t ORIGIN Sau3AI site. 1 agatctgtcc aaggccataa atttaaatgg agtaaacttc aataatgtag acgccagttc 61 acttggcgcg gcacacgtag gacaaagtgc tagccgaggc agaggacttg gtgagaaccc 121 agatgacgag gaaggagatg ctaaaaaaaa aaaggatgga aagaaagcag aaccaaaaaa 181 tccacgtgaa aataagctga aacaaccagg agacagagca gatggacagc cagcaggaga 241 cagagcagat ggacagccag caggagacag agcagatgga caggcagcag gaaatggtgc 301 aggtggacag ccagcaggtg atagagcagc tggacaacca gcaggcgatg gagcagctgg 361 acagccagca ggcgatagag cagatggaca gccagcagga gatagagcag ctggacagcc 421 agcaggcgat agagcagatg gacagccagc aggagataga gcagctggac agccagcagg 481 cgatagagca gatggacagc cagcaggaga tagagcagct ggacaggcag caggaaatgg 541 tgcaggtgga caggcagcag gaaatggtgc aggtggacaa ccagcaggag atagagcagc 601 tggacagcca gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca 661 gccagcagga gatagagcag ctggacagcc agcaggagat agagcagctg gacaggcagc 721 aggaaatggt gcaggtggac aggcagcagg aggaaatgcg gcaaacaaga aggcagaaga 781 cgcaggagga aacgcaggag gaaacgcagg aggacaggga caaaataatg aaggtgcgaa 841 tgccccaaat gaaaagtctg tgaaagaata cctagataaa gttagagcta ccgttggcac 901 cgaatggact ccatgcagtg taacctgtgg agtgggtgta agagtcagaa gaagagttaa 961 tgcagctaac aaaaaaccag aggatcttac tttgaatgac cttgagactg atgtttgtac 1021 aatggataag tgtgctggca tatttaacgt tgtgagtaat tcattagggc tagtcatatt 1081 gttagtccta gcattattca attaagtagc tgacatccat tattttcggc gtcctccacg 1141 gtgcatatta agtgttttgt gttttgtaca tgcacataaa tacttgcccg tagggacatg 1201 atttttttcc ctttcttatg aatgttccct gctgtttgca cgtaactgta tgtacgtgcg 1261 cgtaaggcat agtaagtaac acctcttaca cattatgcgt tacgcacaat cagttgtgca 1321 attctagaaa acacgatatg agtattttta aacacttatc gtgaccaaaa aaaca // LOCUS ECOHSEST 360 bp ds-DNA BCT 01-AUG-1990 DEFINITION E.coli heat-stable enterotoxin gene, complete cds. ACCESSION M34916 KEYWORDS heat-stable enterotoxin. SOURCE E.coli (strain 153837-2) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 360) AUTHORS Moseley,S.L., Hardy,J.W., Huq,M.I., Echeverria,P. and Falkow,S. TITLE Isolation and nucleotide sequence determination of a gene encoding a heat-stable enterotoxin of Escherichia coli JOURNAL Infect. Immun. 39, 1167-1174 (1983) STANDARD simple staff_review FEATURES from to/span description pept 48 266 heat-stable enterotoxin signal 268 301 pot. transcription termination signal BASE COUNT 115 a 54 c 65 g 126 t ORIGIN 1 ttctggtttt gattcaaatg ttcgtggatg ccatgtccgg aggtaatatg aagaaatcaa 61 tattatttat ttttctttct gtattgtctt tttcaccttt ccctcaggat gctaaaccag 121 tagagtcttc aaaagaaaaa atcacactag aatcaaaaaa atgtaacatt gcaaaaaaaa 181 gtaataaaag tggtcctgaa agcatgaata gtagcaatta ctgctgtgaa ttgtgttgta 241 atcctgcttg taccgggtgc tattaataat ataaagggaa ctaaacagtt ccctttatat 301 ttgttctgat tctgatgatg tctgtaacgt atgtacctgt tgctttgttg aataaatcga // LOCUS HUMRENA1 826 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exon 1. ACCESSION M10030 M34914 KEYWORDS aspartyl protease; renin. SEGMENT 1 of 5 SOURCE Human fetal liver DNA (library of Lawn et al.), clone lambda-III. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 826) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review REFERENCE 2 (bases 276 to 583) AUTHORS Shine,J., Hardman,J.A., Hort,Y.J., Tellam,J.T., Catanzaro,D.F., Morris,B.J. and Baxter,J.D. TITLE Structure of the human renin gene JOURNAL Trans Assoc Am Physicians 97, 63-69 (1984) STANDARD simple staff_review COMMENT There is only a single renin gene in the human haploid genome [1]. It is comprised of 10 exons encoding 406 amino acids. The first intron separates the 5' untranslated region and the signal peptide coding region from the remainder of the gene. Exon 2 comprises most of the sequence coding for the pro portion of the enzyme. Precise boundaries were not indicated by in figure 2 of [1], but were taken from the text and from other human renin entries. FEATURES from to/span description pept 626 + 723 preprorenin /hgml_locus_uid="LW0050B" /nomgen="REN" /map="1q32" sigp 626 685 renin signal peptide pre-msg 584 > 826 renin mRNA [1] IVS 724 > 826 renin intron A site 520 521 ga in [1]; gagca in [2] BASE COUNT 190 a 226 c 222 g 188 t ORIGIN Chromosome 1q32; 437 bp upstream of KpnI site. 1 gatctaccca ccttggcctc ccaaagtgct gggacaggtg tgagccacca tgcctggccc 61 ctctactctt ataattaaac cagctgttgc ttttcctgcc aagaaaccag tcatgaagat 121 tcacccatgt tctagatggg aaaactgggc tgtagctggg agaggccagt cagggacaaa 181 gccaaagtta atatagagaa tggagcttcc agggtatagg ggttgggtct gggctaggga 241 gctggaaacc taggttttac gcttgtccca gttttgatgt tagccctgac agtgctgttt 301 ctcatcagcc tctgcctgct ccaggggtca cagggccaag ccagatagag ggctgctagc 361 gtcactggac acaagattgc tttcccacag ctgtccttcc tccagcccct ctgctcccca 421 tccggaaacc tgggtaccct tcacccacct agctctgtcc cgcagtgaga tttattgctg 481 actgccctgc catctacccc agggtaataa atcagggcag agcagaattg caatcacccc 541 atgcatggag tgtataaaag gggaagggct aagggagcca cagaacctca gtggatctca 601 gagagagccc cagactgagg gaagcatgga tggatggaga aggatgcctc gctggggact 661 gctgctgctg ctctggggct cctgtacctt tggtctcccg acagacacca ccacctttaa 721 acggtaattg gtaactcagg cagagaaggg gtgggcaggg gtgtaggttc ccaccttccc 781 aacaccctgg cttttccaca tgcggtgtca ttcagtcctt acgatc // LOCUS HUMRENA2 373 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exon 2. ACCESSION M10128 KEYWORDS renin. SEGMENT 2 of 5 SOURCE Human fetal liver DNA, clones lambda-[III,V]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 373) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review FEATURES from to/span description pept + 105 + 255 preprorenin, exon 2 /nomgen="REN" /map="1q32" /hgml_locus_uid="LW0050B" matp 205 + 255 renin pre-msg < 1 > 373 renin mRNA IVS < 1 104 renin intron A IVS 256 > 373 renin intron B BASE COUNT 79 a 107 c 96 g 91 t ORIGIN Chromosome 1q32; about 4.8 kb after segment 1. 1 aacgttaaag gtggttgtac taaagagagg ggtttggcct cagggactca catgtggtgg 61 aggtacagca cttttctatt tttgcttcct ccaccctggg ccaggatctt cctcaagaga 121 atgccctcaa tccgagaaag cctgaaggaa cgaggtgtgg acatggccag gcttggtccc 181 gagtggagcc aacccatgaa gaggctgaca cttggcaaca ccacctcctc cgtgatcctc 241 accaactaca tggacgtgag tgcttggctc agcccctcgc tccctccctg tctcctttcc 301 ctcatggacc tagggctttc tttgctgcaa gactcaccct ttccaagctg tgtttgacga 361 aggcgctgag tag // LOCUS HUMRENA3 2480 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exons 3, 4 and 5. ACCESSION M10150 KEYWORDS renin. SEGMENT 3 of 5 SOURCE Human fetal liver DNA, clones lambda-[III,V]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2480) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review FEATURES from to/span description pept + 140 263 preprorenin, exon 3 /nomgen="REN" /map="1q32" /hgml_locus_uid="LW0050B" 879 997 preprorenin, exon 4 1949 + 2145 preprorenin, exon 5 matp + 140 263 renin 879 997 renin 1949 + 2145 renin pre-msg < 1 > 2480 renin mRNA IVS < 1 139 renin intron B IVS 264 878 renin intron C IVS 998 1948 renin intron D IVS 2146 > 2480 renin intron E BASE COUNT 568 a 710 c 622 g 579 t 1 others ORIGIN Chromosome 1q32; about 0.4 kb after segment 2. 1 ctgcaggaaa atggaaaccc cgacaggtat aggacctcgc ctggggcaag tctacacccg 61 agagccaaga gtgaagccag gcaagacccc aagcccaagg tcccctgagc ccctccagcc 121 ctctcttttt accccacaga cccagtacta tggcgagatt gggatcggga ccccacccca 181 aaccttcaaa gtcgtctttg acactggttc gtccaatgtt tgggtgccct cctccaagtg 241 cagccgtctc tacactgcct gtggtgagac ctaagaccca cagtgcctct cctccatccc 301 cctgccctac tgtgcatgag caatcctgcc caacacccag ctcccatccc tcttgccacc 361 aagggagtgg cttcctctct gcctctgtgc ccactgacat gtaggggaga ggggaagatg 421 tctcccgttt ttctgataca gccaccaagg ttaaaaacaa aaaaaggtcc aagaacccct 481 gagnacccag gaggaccagt tcccagtcgt cctgagattg agacaggact gaattctcaa 541 acccatccca ggcactcgga actcttccat ccctagtctt aatcaacaac ctcttactag 601 cacttactct gtgcctggca tacttctctg gtgttatcag tggttagtga ttactttaaa 661 ttccttcatt taggacaaaa ttctcgatgt atgggacact taggagagcc caagaaaccc 721 agtccttgat tgatgaagca catattccaa gccccctgac cctagggcca ctcatccctg 781 cacctaagct aaccagccat acccacaatg caccctgcct ctgagtcccc ctgtctgggc 841 cactcttgga caaacctgag cctctgtccc cctgccagtg tatcacaagc tcttcgatgc 901 ttcggattcc tccagctaca agcacaatgg aacagaactc accctccgct attcaacagg 961 gacagtcagt ggctttctca gccaggacat catcaccgta agttgggccg ccctaggtca 1021 tctgccccgg accccttctg tccccaggcc tctcctgacc caccagggcc cacacctgcg 1081 gggaggtaca ctgcagccca cttggagcct ggggagctga ggaacaccct actctgccac 1141 atctggtgtt gaaagcagca gtacctatgg gggagcaagc ctgggctacg ggctcaccgt 1201 tgggtggttt gtggatgttt ttgcatctaa cttgcatgta gggctgtcct gagccccgtg 1261 gctgcagtca agtaactcgt cccaagttca ccagctctga ctggggctac taccctagac 1321 tgaaatcctg ggtcagagtc aggctatttt agggtcaggc atagttttaa ggtcacatta 1381 gttgactctg ggactcaggt caaggctctc ttttcttttc catgtggccc atgtctgacc 1441 gtttcctcat cctggagttt ctcaggccct gctccatcag agttagggga ggggcacacg 1501 tggcacctga gaggaaatca gggtgattcc tgcctccctt cctttttctg ttgaactctg 1561 atataaagga ggaagaaggg caagcttgtc tgtgctaaag aaacccttcg cccatgataa 1621 gggtggggcc aagacccagt cctgccaggc acgaaagtct ggccactggg gaggggagga 1681 gctcttggac ttttcttttg cgcttggcag gaccaccctc tcagcctctg ctctccgatc 1741 cctggtcaac tctagctctc tctgggctcc gcagcagaga tgtgtattgg cacagagtgt 1801 gtgcgtgcag ggttgaggca atactcttac cccgatttct gtaccctgga gcatgtgtgc 1861 ccctgggatc cctagtgtgg atgcccagac cagactccaa ccaaggaggg gcagtgggct 1921 tggtctccta tggtccttcc tcccacaggt gggtggaatc acggtgacac agatgtttgg 1981 agaggtcacg gagatgcccg ccttaccctt catgctggcc gagtttgatg gggttgtggg 2041 catgggcttc attgaacagg ccattggcag ggtcacccct atcttcgaca acatcatctc 2101 ccaaggggtg ctaaaagagg acgtcttctc tttctactac aacaggtggg gactgggact 2161 ccaagggctg aggtgggggg caggagggga gaagagatgg ggagtggaag gagagtctgg 2221 gccagaattg taaagtgttt gtaacttagg tgacagccaa tcaatatcta gagctgtact 2281 agccaatatg gaaggcacta ttgcaaattt aaacttaact taaatacagc ttaagcatca 2341 attaagcatt caactggctg gcctcttagt tgtactagcc acagctcaat gcctggcagc 2401 cacggtggct agtaactaca gtctagtaca gtgcagatag agatatccag catgacagga 2461 catctataga cagcgccact // LOCUS HUMRENA4 3057 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exons 6, 7, 8, and 9. ACCESSION M10151 KEYWORDS renin. SEGMENT 4 of 5 SOURCE Human fetal liver DNA, clones lambda-[III,V]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3057) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review FEATURES from to/span description pept + 998 1006 preprorenin, exon 6 /nomgen="REN" /map="1q32" /hgml_locus_uid="LW0050B" 1572 1691 preprorenin, exon 7 2061 2202 preprorenin, exon 8 2466 + 2564 preprorenin, exon 9 matp + 998 1006 renin 1572 1691 renin 2061 2202 renin 2466 + 2564 renin pre-msg < 1 > 3057 renin mRNA IVS < 1 997 renin intron E IVS 1007 1571 renin intron F IVS 1692 2060 renin intron G IVS 2203 2465 renin intron H IVS 2565 > 3057 renin intron I BASE COUNT 750 a 799 c 825 g 683 t ORIGIN Chromosome 1q32; about 0.6 kb after segment 3. 1 aaaagaatag aggaggatca gagttcagag aaatctcaca gtaaaatgga gaggagtctc 61 cggtttggtg atagaaagtg aggccttgag aaaaggccaa ttggcggctc tgcattcagg 121 ggtggtcttt agaagaactg ttttagagga ggtgggggca aggccagatg gcaagaagtt 181 aagaggtgga cgacgtgggt gtcaggaagt ggaggtcatg agatgtacgc tgccctggga 241 cattcaacag ggaagggaat ggggggtggc gtgggggggt gagatccaga agcagaagag 301 gaagggtggg tgtttttaaa tgctagagga tgctcgagtg atcgcctgta ggtggaggaa 361 gaacccaata gaaagaaaga gattaaaaat gtggaaagaa gaggagctaa atgggggcac 421 tggagtttag aggccttgaa agagatgagg aaccagcaga taggaagaag ccaggtttta 481 cagaggagag ggctggcctc ttcttttatc ttgggatggg aaggagggaa catccagaga 541 gatactgaag tgttgagaga caggcaggag ggaatttgtg ctagcatata cacatacgag 601 ttccgaattt ataaaaacac aagtagtttg cagttgcaca aaataacata tgcacaccta 661 cacacccatg cacacatgtg catgtgaatt ctggaaaaac acatcacaca cacaggcatg 721 ccctggagac taggcctaca gtagtccctg agccaagtgc agtgaggagg aaaggaaggt 781 gaggggaatc atctccagac ggggcaccag gagcctggct ccagtccccc acttgttcac 841 tcatggactg ggtaacttca ggcaagtgac ttcgcctctt ggtgactcca ttgcctgaag 901 ggcaaagaga gtacataaca cccaccctgc caaacagcag ggtgatgagg ctggcatgaa 961 atgaagcttc ctttctgctg tctctctttc tctgcagaga ttccgagtaa ggagacaaaa 1021 cccccacatg gctgtgacct tccagtattc cccgagcacc tgacctagaa ttacacacgc 1081 caccggccca aaactcacat cagcaagtcc cagcctccgc tagatgccga agttctctgt 1141 ctctccttcc tgctctctcc atgccacctg cccaccccat acccaatagc ctccccaggg 1201 tcccctccca tgcacctgct caatcagcag caacccaaga gtgaggggtg tccatttgtg 1261 tcttgttcac atccactcac tgtccttgta cctgctcctt ttctgtgacc tctctgggga 1321 tgctttttgg gggaacagct ggactaccct ggaacaacct ctggttggtc ttggggaggg 1381 gaagaaaggc agagaagcag tatgttctgc atgcttccca acgacagctc cgagcctggc 1441 tgtctgtccc acattcctct gctctagagc cctctgtcct cccctcgacc cttgtgcaac 1501 cttccccaat tgcctgagtt gctgggtcct ggaggttatg ggtttccaag agcttctgat 1561 ctttccttta ggaattccca atcgctggga ggacagattg tgctgggagg cagcgacccc 1621 cagcattacg aagggaattt ccactatatc aacctcatca agactggtgt ctggcagatt 1681 caaatgaagg ggtcagaaat cctcagaccc tccccgggct ccaaaaaatg ctgccgtcac 1741 tggggttggg gagggcgggc gcggactgca ttaccatcct gccctctttc caaatgcagc 1801 cacttcttaa gcacagccac catttgctct ctgcctggct ctggtccagg ctggggcaga 1861 gagaagggag gggcctgggc cggagtggtg gaggccgaga gtaccttccc tcctctactc 1921 actgcctcaa cagccagcca gcgtggcgct ccacccaccc acccaccact caggaaggac 1981 atgcagcctg gcgtgcccat cagccttctg tctgtctgtc tgtctgtctg tctctctgtc 2041 tgactgtggc gctcccccag ggtgtctgtg gggtcatcca ccttgctctg tgaagacggc 2101 tgcctggcat tggtagacac cggtgcatcc tacatctcag gttctaccag ctccatagag 2161 aagctcatgg aggccttggg agccaagaag aggctgtttg atgtaagaag ccaaagaggg 2221 aaggtgctgt gggtgtgggg agcggccacc tggtatcggc tcacaaatcc cccaggcaaa 2281 tgaggccatc tcaggccttc gcttgttcac ctcacactct ccacacatgt ggctggtcac 2341 ccatggggcg gggcactgtc cccagccctc tccagcagag agacccaggg ccaccagcgc 2401 aggactcctt gtctgctgag acgtcgttcc atactcaaga aggctctctt tgccccccac 2461 cccagtatgt cgtgaagtgt aacgagggcc ctacactccc cgacatctct ttccacctgg 2521 gaggcaaaga atacacgctc accagcgcgg actatgtatt tcaggtgagg ttcgagtcgg 2581 ccccctcggt ggcagggaga aaggctggac agagaccctc aagagtgaca gattacaatg 2641 cacagatcat gttagaactg tagttctcaa acttggctgt gcatgtcacc tggagagctt 2701 tggaaaaatc caggtacctg ggccacatcc catacctatt aaatcagaac ctctagaagt 2761 gggacctggg gttcagtttc cccagatgat tccaatgtgt ggccatgttt gggcatcact 2821 atgcctgttc cctcatctcc attttctcat caaatactcc caataatcct atgctcctat 2881 attcttaccc tcttttcata atcaataggc ttagagaatt tgaataactt gtctaggatc 2941 agaagctaag gcaaactgta agctcctgaa ggaagcacgt tgcctgatgc cctgtttgcc 3001 tgggatctag cacaggggct aaacatagga atggtgcagt ccacgatggg gcaaaat // LOCUS HUMRENA5 763 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exon 10. ACCESSION M10152 KEYWORDS renin. SEGMENT 5 of 5 SOURCE Human fetal liver DNA, clone lambda-V. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 763) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review COMMENT A poly-adenylation signal is located at positions 357-362. FEATURES from to/span description pept + 22 183 preprorenin, exon 10 /nomgen="REN" /map="1q32" /hgml_locus_uid="LW0050B" matp + 22 180 renin pre-msg < 1 > 183 renin mRNA IVS < 1 21 renin intron I BASE COUNT 170 a 236 c 194 g 163 t ORIGIN Chromosome 1q32; about 0.6 kb after segment 4. 1 aaaactctcc ccctctgcca ggaatcctac agtagtaaaa agctgtgcac actggccatc 61 cacgccatgg atatcccgcc acccactgga cccacctggg ccctgggggc caccttcatc 121 cgaaagttct acacagagtt tgatcggcgt aacaaccgca ttggcttcgc cttggcccgc 181 tgaggccctc tgccacccag gcaggccctg ccttcagccc tggcccagag ctggaacact 241 ctctgagatg cccctctgcc tgccttatgc cctcagatgg agacattgga tgtggagctc 301 ctgctggatg cgtgccctga cccctcacag cccttccctg ctttgaggac aaagagaata 361 aagacttcat gttcacagcc tgttgcatct gggttcacta gggtttagaa cagagggagg 421 ggctgcgtga tcatgtgtgg acaggaatgt gacacagaca agctacacat tagcctaggc 481 cacaggttct tgcgtgcagg gatgatgcca tccatctgcc atcaacggga ctcaggtgga 541 gctgttacac aacctcaggt gggaagtctg aaaagagccg gaaccaagct ccctgctatc 601 gactcaggga ccaaggcgta atgctgtggc gagtagactg gggtcagaaa gttgtcccag 661 ctcacagaag ccagctctga gttcagactc tgctctgctg agctagtcag ccctgtctct 721 tgtccctgca aaactcccct cacctgtcct tatccacctg cag // LOCUS SYNT1RNAA 324 bp ds-DNA SYN 01-AUG-1990 DEFINITION Synthetic ribonuclease T1 gene, 3' end. ACCESSION M37098 M35733 M35736 KEYWORDS ribonuclease T1. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 324) AUTHORS Ikehara,M., Ohtsuka,E., Uesugi,S., Kikyodani,T., Aoyama,Y., Tokunaga,T. and Fujimoto,K. TITLE Synthesis and expression of RNase T1 gene JOURNAL Nucleic Acids Symp Ser 15, 197-200 (1984) STANDARD simple staff_review REFERENCE 2 (bases 1 to 324) AUTHORS Nishikawa,S., Morioka,H., Tokunaga,T., Aoyama,Y., Kikyotani,S., Fujimoto,K., Yanase,K., Tanaka,T., Uesugi,S., Ohtsuka,E. and Ikehara,M. TITLE Synthesis and expression of the native RNase T1 gene and several mutant genes JOURNAL Nucleic Acids Symp Ser 16, 287-290 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 321 ribonuclease T1 precursor (AA at 1) sigp < 1 6 ribonuclease T1 signal peptide matp 7 318 ribonuclease T1 BASE COUNT 73 a 98 c 75 g 78 t ORIGIN 1 ttcatggctt gcgactacac ctgcggcagc aactgctact ctagctctga cgtttctacc 61 gctcaggctg ctggctacca gctgcacgag gacggcgaaa ccgttggctc taactcttac 121 ccgcacaaat acaacaacta tgagggcttc gactttagcg tttcttctcc gtactacgaa 181 tggccgatcc tgtctagcgg cgacgtttac tccggtccag gtagcggtgc tgaccgtgta 241 gtattcaacg aaaacaacca gctcgctggc gttatcaccc acaccggcgc ttctggcaac 301 aactttgtag aatgcaccta atag // LOCUS TIPCDREG 209 bp ds-DNA BCT 01-AUG-1990 DEFINITION Plasmid pTiC58 promoter-active fragment CD25 DNA. ACCESSION M35735 KEYWORDS . SOURCE Plasmid pTiC58 DNA. ORGANISM Plasmid pTiC58 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 209) AUTHORS Tait,R.C. and Kado,C.I. TITLE Regulation of the virC and virD promoters of pTiC58 by the ros chromosomal mutation of Agrobacterium tumefaciens JOURNAL Mol. Microbiol. 2, 385-392 (1988) STANDARD simple staff_review BASE COUNT 60 a 37 c 41 g 71 t ORIGIN 1 gtcgacccgg gatccgcggc gataattcat aagtaatgta gtaattacct gattttatat 61 ttcaatttta ttgtaatata atttcaattg taataatata aaaataaata tcccttatgt 121 gttcttgatt tcgttttgta tatggctaga ttcccatctg ccacgacgag gaaatgctac 181 ggcggggcaa gttcagatcc cgggtcgac // LOCUS FIBGLUC 1426 bp ds-DNA BCT 01-AUG-1990 DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene, complete cds. ACCESSION M33676 M33311 KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; mised-linkage beta-glucanase. SOURCE F.succinogenes (strain S85) DNA, clone PJI5. ORGANISM Fibrobacter succinogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1426) AUTHORS Teather,R.M. and Erfle,J.D. TITLE DNA sequence of a Fibrobacter succinogenes mixed-linkage beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene JOURNAL J. Bacteriol. 172, 3837-3841 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.M.Teather, 11-APR-1990. FEATURES from to/span description pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase precursor (EC 3.2.1.73) sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase signal peptide matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase binding 132 137 ribosome binding site signal 62 66 -35 region signal 85 90 -10 region BASE COUNT 371 a 346 c 335 g 374 t ORIGIN 1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt 61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt 121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc 181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt 241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg 301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc 361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc 421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct 481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac 541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac 601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt 661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat 721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat 781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt 841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc 901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa 961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct 1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac 1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt 1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg 1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt 1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa 1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt 1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa // LOCUS BOVRS157A 824 bp ss-mRNA MAM 01-AUG-1990 DEFINITION Bovine retina-specific 15.7 kDa protein mRNA, complete cds. ACCESSION M34915 KEYWORDS . SOURCE Bovine retina, cDNA to mRNA, clone pCR18. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 824) AUTHORS Nakagawa,Y., Kuo,C.-H., Ishii,K., Shiosaka,S., Tohyama,M. and Miki,N. TITLE Cloning and characterization of a cDNA specific for bovine retina JOURNAL Neurosci. Res. 3, 300-310 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 138 581 retina-specific 15.7 kDa protein mRNA < 1 824 retina-specific mRNA signal 800 805 polyA signal BASE COUNT 178 a 219 c 224 g 203 t ORIGIN 1 tttagcctca gccgtgaccg gccccgtccc gcggcgccgg gagttcgtgt gaacgggtag 61 gtgtaccgac ttcgcccgtc cgtgaatccc gtggtcgcaa aggcccgcgc ggcgggccgg 121 gttctgccga taccttaatg ggctgtgcgc gaggagagcc tcaattgcaa gttggtcgag 181 gagatcgcca cgctggtgca gagctggcct cactagttgc ggctagtgta ggacgttgta 241 ctccgacatt ccgcaagccc ttccacacgg acagtcctag catccagggt cagtggcacc 301 ccttcaccaa caaaccgaca gcactggggt gctcctcgag aggtccagaa tcctgccccg 361 acccagcggc cagcacaatg aagaccaact ccatacccac agtttggact tttactccag 421 cagagggtgg ttcctgctcc tggtttgctt cacgggagac agatgaagcc accaatgggg 481 tacttcttgc ttgggataaa gaagagctgc ctgtctcttt tgatgtccac cgtgaggcag 541 ggactgtgag tctcctcatt cttagccagt tgacatcctg aaaccctgag aatcttcaga 601 gatttgactt ggtcttcatt tcttaaatcc aaatcaataa tagtgatctc aaatcaagtg 661 agggctttca aggctggctt ctgaagaatt ccttttggcc tgtttctgta gccagtgacc 721 aagagagtct gctgtgagct ggcattgggc taggccttgt atctatgtga tgtttgtgtg 781 cagttagaaa actgaagtta ataaatttgc caaggtcaca cttg // LOCUS CHKFRA2A1 360 bp ds-DNA VRT 01-AUG-1990 DEFINITION Chicken fra-2 oncogene gene, exon 1. ACCESSION D90104 KEYWORDS fos-related gene; fra-2 gene; oncogene. SEGMENT 1 of 4 SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 360) AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H. TITLE Isolation and characterization of fra-2, and additional member of the fos gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 241 + 339 fra-2 protein, exon 1 pre-msg < 1 > 360 fra-2 mRNA and introns IVS 340 > 360 fra-2 intron A BASE COUNT 39 a 123 c 113 g 85 t ORIGIN 1 tgtttttttg gttgtttttt ttttttgtcg gctttccgct ttttcttttt ttcttttttt 61 tccctttttc tatttttccc ccccttcttc ttctcccgct gcggactctc ccccggctgc 121 gggaggcgcg aggcagagcc cgagaggtcg gcacggagca gggggcgggg agacggcgag 181 ggagcggcgg ccgcggcgcg ggaaggcggg gacgcggctc ccccgggccg gcctcggacc 241 atgtaccagg actatcccgg gagcttcgac acctcctcca gaggcagcag cggctccccg 301 ggacaccccg agccctactc cgccggcgca gcccagcagg tagggccgcc tccgccccgt // LOCUS CHKFRA2A2 297 bp ds-DNA VRT 01-AUG-1990 DEFINITION Chicken fra-2 oncogene gene, exon 2. ACCESSION D90105 KEYWORDS fos-related gene; fra-2 gene; oncogene. SEGMENT 2 of 4 SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 297) AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H. TITLE Isolation and characterization of fra-2, and additional member of the fos gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 22 + 276 fra-2 protein, exon 2 pre-msg < 1 > 297 fra-2 mRNA and introns IVS < 1 21 fra-2 intron A IVS 277 > 297 fra-2 intron B BASE COUNT 68 a 103 c 74 g 52 t ORIGIN About 5 kbp after segment 1. 1 ctcccccacc tttcctccta gaaattccga gtagatatgc caggatcagg cagtgctttt 61 attcccacga tcaacgccat cacaaccagc caagacctgc agtggatggt gcagcccacc 121 gtcatcacct ccatgtccag cccgtactct cgctcgcacc cctacagcca cccactgccg 181 ccgctgtcct cggtggctgg acacacggcc cttcagcgac cgggcgtgat caaaaccatc 241 ggcaccacag tgggacggag acgaagggat gagcaggtaa ctgtgtgagc aggagga // LOCUS CHKFRA2A3 149 bp ds-DNA VRT 01-AUG-1990 DEFINITION Chicken fra-2 oncogene gene, exon 3. ACCESSION D90106 KEYWORDS fos-related gene; fra-2 gene; oncogene. SEGMENT 3 of 4 SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 149) AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H. TITLE Isolation and characterization of fra-2, and additional member of the fos gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 22 + 129 fra-2 protein, exon 3 pre-msg < 1 > 149 fra-2 mRNA and introns IVS < 1 21 fra-2 intron B IVS 130 > 149 fra-2 intron C BASE COUNT 42 a 34 c 46 g 27 t ORIGIN About 4 kbp after segment 2. 1 tttcttggca cttgcccata gctgtcgcct gaggaagaag agaagcgaag gatccggaga 61 gagaggaaca agctggcagc tgctaaatgt cgtaacaggc gccgagagct aacagagaaa 121 ctccaggcgg tacgtgctct gcatgcatt // LOCUS CHKFRA2A4 744 bp ds-DNA VRT 01-AUG-1990 DEFINITION Chicken fra-2 oncogene gene, exon 4. ACCESSION D90107 KEYWORDS fos-related gene; fra-2 gene; oncogene. SEGMENT 4 of 4 SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 744) AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H. TITLE Isolation and characterization of fra-2, and additional member of the fos gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 22 531 fra-2 protein, exon 4 pre-msg < 1 > 744 fra-2 mRNA and introns IVS < 1 21 fra-2 intron C BASE COUNT 180 a 206 c 209 g 149 t ORIGIN About 2 kbp after segment 3. 1 ttattccctt tttgtctgca ggaaactgag gtgctggagg aggaaaagtc agtgcttcaa 61 aaagagattg ctgagctcca gaaggagaag gagaaactag agttcatgct ggttgctcac 121 agccctgtgt gtaaaatcag ccctgaggaa cgtcggagcc caccaaccag cagcctccag 181 agcgttcgga ctggagcgag cggagcagtg gtggtgaagc aggagcctgt ggaggaagag 241 atcccatctt cctctttggt ccttgacaaa gctcagaggt ctgtcattaa gcccatcagc 301 attgctggag gttattatgg ggaggaggca ctcaacactc ccatcgtggt gacctcgaca 361 ccagccatca ctcctggttc ctccaacttg gtgttcacct accccaatgt cttggatcag 421 gagtctcctc tctccccgtc cgagtcctgc tccaaagctc accggaggag cagcagcagc 481 ggcgaccagt cctcggattc cttgaactct cccaccttgc tggcattgta atcccctgag 541 gcccccccat tgccagtgtg ttacatcccc cgcccggctc catggggaga cccctccatg 601 ggattagaga caggcacagg atcgttcaag cacaagggca gcaagaacaa gaatggggaa 661 atgctgcagc tccaggaaag agagtgagga ccaatgccag ctccctggag gcaggaaatg 721 gcaagggtgg gactgatgca ccag // LOCUS ECOTGP 7335 bp ds-DNA BCT 01-AUG-1990 DEFINITION E.coli tryptophan operon: entire DNA sequence. ACCESSION J01714 M12471 M12472 M25593 KEYWORDS anthranilate isomerase; anthranilate synthetase; attenuator; glutamine amidotransferase; isomerase; leader peptide; phosphoribosyl anthranilate synthetase; synthetase; transferase; trp operon; trpA gene; trpB gene; trpC gene; trpD gene; trpE gene; tryptophan synthetase. SOURCE Escherichia coli RNA and DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 5917 to 6133) AUTHORS Platt,T. and Yanofsky,C. TITLE An intercistronic region and ribosome-binding site in bacterial messenger RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 72, 2399-2403 (1975) STANDARD full staff_review REFERENCE 2 (bases 84 to 141) AUTHORS Bennett,G.N., Schweingruber,M.E., Brown,K.D., Squires,C. and Yanofsky,C. TITLE Nucleotide sequence of region preceding trp mRNA initiation site and its role in promoter and operator function JOURNAL Proc. Natl. Acad. Sci. U.S.A. 73, 2351-2355 (1976) STANDARD full staff_review REFERENCE 3 (bases 117 to 310) AUTHORS Squires,C., Lee,F., Bertrand,K., Squires,C.L., Bronson,M.J. and Yanofsky,C. TITLE Nucleotide sequence of the 5' end of tryptophan messenger RNA of Escherichia coli JOURNAL J. Mol. Biol. 103, 351-381 (1976) STANDARD full staff_review REFERENCE 4 (bases 230 to 272) AUTHORS Bertrand,K., Korn,L.J., Lee,F. and Yanofsky,C. TITLE The attenuator of the tryptophan operon of Escherichia coli: heterogeneous 3'-OH termini in vivo and deletion mapping of functions JOURNAL J. Mol. Biol. 117, 227-247 (1977) STANDARD full staff_review REFERENCE 5 (bases 230 to 272) AUTHORS Stauffer,G.V., Zurawski,G. and Yanofsky,C. TITLE Single base-pair alterations in the Escherichia coli trp operon leader region that relieve transcription termination at the trp attenuator JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 4833-4837 (1978) STANDARD full staff_review REFERENCE 6 (bases 6707 to 6863) AUTHORS Wu,A.M. and Platt,T. TITLE Transcription termination: nucleotide sequence at 3' end of tryptophan operon in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 5442-5446 (1978) STANDARD full staff_review REFERENCE 7 (bases 1 to 140) AUTHORS Bennett,G.N., Schweingruber,M.E., Brown,K.D., Squires,C. and Yanofsky,C. TITLE Nucleotide sequence of the promoter-operator region of the tryptophan operon of Escherichia coli JOURNAL J. Mol. Biol. 121, 113-137 (1978) STANDARD full staff_review REFERENCE 8 (bases 2351 to 2503) AUTHORS Miozzari,G.F. and Yanofsky,C. TITLE Gene fusion during the evolution of the tryptophan operon in enterobacteriaceae JOURNAL Nature 277, 486-489 (1979) STANDARD full staff_review REFERENCE 9 (bases 5932 to 6809) AUTHORS Nichols,B.P. and Yanofsky,C. TITLE Nucleotide sequences of trpA of Salmonella typhimurium JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 5244-5248 (1979) STANDARD full staff_review REFERENCE 10 (bases 117 to 256) AUTHORS Oxender,D.L., Zurawski,G. and Yanofsky,C. TITLE Attenuation in the Escherichia coli tryptophan operon: role of RNA secondary structure involving the tryptophan codon region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 5524-5528 (1979) STANDARD full staff_review REFERENCE 11 (bases 3422 to 4824) AUTHORS Christie,G.E. and Platt,T. TITLE Gene structure in the tryptophan operon of Escherichia coli: nucleotide sequence of trpC and the flanking intercistronic regions JOURNAL J. Mol. Biol. 142, 519-530 (1980) STANDARD full staff_review REFERENCE 12 (bases 230 to 296) AUTHORS Farnham,P.J. and Platt,T. TITLE A model for transcription termination suggested by studies on the trp attenuator in vitro using base analogs JOURNAL Cell 20, 739-748 (1980) STANDARD full staff_review REFERENCE 13 (bases 4810 to 6003) AUTHORS Crawford,I.P., Nichols,B.P. and Yanofsky,C. TITLE Nucleotide sequence of the trpB gene in Escherichia coli and Salmonella typhimurium JOURNAL J. Mol. Biol. 142, 489-502 (1980) STANDARD full staff_review REFERENCE 14 (bases 1761 to 2443) AUTHORS Nichols,B.P., Miozzari,G.F., van Cleemput,M., Bennett,G.N. and Yanofsky,C. TITLE Nucleotide sequences of the trpG regions of Escherichia coli, Shigella dysenteriae, Salmonella typhimurium and Serratia marcescens JOURNAL J. Mol. Biol. 142, 503-517 (1980) STANDARD full staff_review REFERENCE 15 (bases 6707 to 7335) AUTHORS Wu,A.M., Chapman,A.B., Platt,T., Guarente,L.P. and Beckwith,J. TITLE Deletions of distal sequence affect termination of transcription at the end of the tryptophan operon in E. coli JOURNAL Cell 19, 829-836 (1980) STANDARD full staff_review REFERENCE 16 (bases 279 to 1843) AUTHORS Nichols,B.P., van Cleemput,M. and Yanofsky,C. TITLE Nucleotide sequence of Escherichia coli trpE: anthranilate synthetase component I contains no tryptophan residues JOURNAL J. Mol. Biol. 146, 45-54 (1981) STANDARD full staff_review REFERENCE 17 (bases 5932 to 6809) AUTHORS Schneider,W.P., Nichols,B.P. and Yanofsky,C. TITLE Procedure for production of hybrid genes and proteins and its use in assessing significance of amino acid differences in homologous tryptophan synthetase alpha polypeptides JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 2169-2173 (1981) STANDARD full staff_review REFERENCE 18 (bases 6807 to 6856; 7057 to 7119) AUTHORS Wu,A.M., Christie,G.E. and Platt,T. TITLE Tandem termination sites in the tryptophan operon of Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 2913-2917 (1981) STANDARD full staff_review REFERENCE 19 (review; bases 77 to 6809; compiled) AUTHORS Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E., Horowitz,H., van Cleemput,M. and Wu,A.M. TITLE The complete nucleotide sequence of the tryptophan operon of Escherichia coli JOURNAL Nucleic Acids Res. 9, 6647-6668 (1981) STANDARD full staff_review REFERENCE 20 (bases 2504 to 3436) AUTHORS Horowitz,H., Christie,G.E. and Platt,T. TITLE Nucleotide sequence of the trpD gene, encoding anthranilate synthetase component II of Escherichia coli JOURNAL J. Mol. Biol. 156, 245-256 (1982) STANDARD full staff_review REFERENCE 21 (bases 57 to 137) AUTHORS Windass,J.D., Newton,C.R., De Maeyer-Guignard,J., Moore,V.E., Markham,A.F. and Edge,M.D. TITLE The construction of a synthetic Escherichia coli trp promoter and its use in the expression of a synthetic interferon gene JOURNAL Nucleic Acids Res. 10, 6639-6657 (1982) STANDARD full staff_review REFERENCE 22 (sites; mutational analysis of the regulatory region) AUTHORS Kolter,R. and Yanofsky,C. TITLE Genetic analysis of the tryptophan operon regulatory region using site-directed mutagenesis JOURNAL J. Mol. Biol. 175, 299-312 (1984) STANDARD full staff_entry REFERENCE 23 (bases 36 to 136) AUTHORS Brown,K.D., Bennet,G.N., Lee,F., Schweingruber,M.E. and Yanofsky,C. TITLE RNA polymerase interaction at the promoter-operator region of the tryptophan operon of Escherichia coli and Salmonella typhimurium JOURNAL J. Mol. Biol. 121, 153-177 (1978) STANDARD simple staff_entry COMMENT The tryptophan operon of E.coli consists of a repressor(trpR), a promoter(trpP), an operator(trpO), an attenuator which is part of a leader peptide region(trpL) and five structural genes: trpE(anthranilate synthetase), trpD(glutamine amido transferase and anthranilate 5-phosphoribosylpyrophosphate phosphoribosyl- transferase), trpC(phosphoribosyl anthranilate isomerase-indole glycerol phosphate synthetase), trpB(tryptophan synthetase beta) and trpA(tryptophan synthetase alpha). The promoter region covers approximately 40 bases upstream from the mRNA initiation site(75-116); the operator approximately 20 bases upstream with two-fold axes of symmetry around 104-105 and 109-110([2],[7],[20]). The attenuator region is the first 140 nucleotides(117-256) of the mRNA leader, a G-C rich region with a two-fold axis of symmetry around base 240 and an A-T rich region with its axis about bases 259-260; it provides a second site for control of transcription ([4],[5],[10],[12]). Two mRNA termination regions are reported: trpT (bases 6807-6856) and trpT' (bases 7057-7119), the first of which bears some similarity to the attenuator region ([18]). A chi site for recombination is localized between bases 2492 and 2501 and the trp-P2 promoter is located between bases 3240 and 3280 ([20]). The trpE gene is unusual in that it codes for no tryptophan residues([16]). The two enzymatic functions coded by trpG and trpD genes in S.marcescens are coded by the single trpD gene in E.coli and other enterobacteriaceae. This appears to have occurred via base changes at sites 2420 and 2438. The intercistronic regions for the structural genes show little superfluity: the trpE-trpD and trpB-trpA boundaries consist of 'tgatg'; the trpD-trpC boundary is 'taaatgatg' and the trpC-trpB boundary is 'taaggaaaggaacaatg'. All the cistrons show a high degree of homology with their correlates among the enterobacteriaceae. Sequence discrepancies in early work([3]) are corrected in later work from the same laboratory([10],[19]). [17] also sequenced S.typhimurium trpA region. [19] compiles sequences from [7],[8],[9],[11],[13],[14],[16],[20]. FEATURES from to/span description pept 143 187 trp operon leader peptide (putative) pept 279 1841 anthranilate synthetase component I /nomgen="trpE" pept 1841 3436 anthranilate synthetase component II: glutamine amidotransferase and phosphoribosyl anthranilate synthetase /nomgen="trpD" pept 3440 4798 anthranilate isomerase /nomgen="trpC" pept 4810 6003 tryptophan synthetase beta subunit /nomgen="trpB" pept 6003 6809 tryptophan synthetase alpha subunit /nomgen="trpA" mRNA 117 257 trp mRNA (alt.) [2],[3],[7],[10],[21] mRNA 117 6842 trp mRNA (alt.) [2],[3],[6],[7],[10],[18],[21] used revision 1787 1787 c in [16]; t in [14] revision 1793 1793 t in [16]; c in [14] conflict 3526 3530 gg in [19]; gaatg in [11] conflict 4289 4293 gc in [19]; gttgc in [11] conflict 5949 5949 c in [1]; a in [17] BASE COUNT 1740 a 1926 c 1960 g 1705 t 4 others ORIGIN 9 bp upstream from HhaI site [7]. 1 ctcaaggcgc actcccgttc tggataatgt tttttgcgcc gacatcataa cggttctggc 61 aaatattctg aaatgagctg ttgacaatta atcatcgaac tagttaacta gtacgcaagt 121 tcacgtaaaa agggtatcga caatgaaagc aattttcgta ctgaaaggtt ggtggcgcac 181 ttcctgaaac gggcagtgta ttcaccatgc gtaaagcaat cagataccca gcccgcctaa 241 tgagcgggct tttttttgaa caaaattaga gaataacaat gcaaacacaa aaaccgactc 301 tcgaactgct aacctgcgaa ggcgcttatc gcgacaatcc caccgcgctt tttcaccagt 361 tgtgtgggga tcgtccggca acgctgctgc tggaatccgc agatatcgac agcaaagatg 421 atttaaaaag cctgctgctg gtagacagtg cgctgcgcat tacagcttta ggtgacactg 481 tcacaatcca ggcactttcc ggcaacggcg aagccctcct ggcactactg gataacgccc 541 tgcctgcggg tgtggaaagt gaacaatcac caaactgccg tgtgctgcgc ttcccccctg 601 tcagtccact gctggatgaa gacgcccgct tatgctccct ttcggttttt gacgctttcc 661 gtttattgca gaatctgttg aatgtaccga aggaagaacg agaagccatg ttcttcagcg 721 gcctgttctc ttatgacctt gtggcgggat ttgaagattt accgcaactg tcagcggaaa 781 ataactgccc tgatttctgt ttttatctcg ctgaaacgct gatggtgatt gaccatcaga 841 aaaaaagcac ccgtattcag gccagcctgt ttgctccgaa tgaagaagaa aaacaacgtc 901 tcactgctcg cctgaacgaa ctacgtcagc aactgaccga agccgcgccg ccgctgccag 961 tggtttccgt gccgcatatg cgttgtgaat gtaatcagag cgatgaagag ttcggtggcg 1021 tagtgcgttt gttgcaaaaa gcgattcgcg ctggagaaat tttccaggtg gtgccatctc 1081 gccgtttctc tctgccctgc ccgtcaccgc tggcggccta ttacgtgctg aaaaagagta 1141 atcccagccc gtacatgttt tttatgcagg ataatgattt caccctattt ggcgcgtcgc 1201 cggaaagctc gctcaagtat gatgccacca gccgccagat tgagatctac ccgattgccg 1261 gaacacgccc acgcggtcgt cgcgccgatg gttcactgga cagagatctc gacagccgta 1321 ttgaactgga aatgcgtacc gatcataaag agctgtctga acatctgatg ctggttgatc 1381 tcgcccgtaa tgatctggca cgcatttgca cccccggcag ccgctacgtc gccgatctca 1441 ccaaagttga ccgttattcc tatgtgatgc acctcgtctc tcgcgtagtc ggcgaactgc 1501 gtcacgatct tgacgccctg cacgcttatc gcgcctgtat gaatatgggg acgttaagcg 1561 gtgcgccgaa agtacgcgct atgcagttaa ttgccgaggc ggaaggtcgt cgccgcggca 1621 gctacggcgg cgcggtaggt tatttcaccg cgcatggcga tctcgacacc tgcattgtga 1681 tccgctcggc gctggtggaa aacggtatcg ccaccgtgca agcgggtgct ggtgtagtcc 1741 ttgattctgt tccgcagtcg gaagccgacg aaacccgtaa caaagcccgc gctgtactgc 1801 gcgctattgc caccgcgcat catgcacagg agactttctg atggctgaca ttctgctgct 1861 cgataatatc gactctttta cgtacaacct ggcagatcag ttgcgcagca atgggcataa 1921 cgtggtgatt taccgcaacc atataccggc gcaaacctta attgaacgct tggcgaccat 1981 gagtaatccg gtgctgatgc tttctcctgg ccccggtgtg ccgagcgaag ccggttgtat 2041 gccggaactc ctcacccgct tgcgtggcaa gctgcccatt attggcattt gcctcggaca 2101 tcaggcgatt gtcgaagctt acgggggcta tgtcggtcag gcgggcgaaa ttctccacgg 2161 taaagcctcc agcattgaac atgacggtca ggcgatgttt gccggattaa caaacccgct 2221 gccggtggcg cgttatcact cgctggttgg cagtaacatt ccggccggtt taaccatcaa 2281 cgcccatttt aatggcatgg tgatggcagt acgtcacgat gcggatcgcg tttgtggatt 2341 ccagttccat ccggaatcca ttctcaccac ccagggcgct cgcctgctgg aacaaacgct 2401 ggcctgggcg cagcataaac tagagccagc caacacgctg caaccgattc tggaaaaact 2461 gtatcaggcg cagacgctta gccaacaaga aagccaccag ctgttttcag cggtggtgcg 2521 tggcgagctg aagccggaac aactggcggc ggcgctggtg agcatgaaaa ttcgcggtga 2581 gcacccgaac gagatcgccg gggcagcaac cgcgctactg gaaaacgcag cgccgttccc 2641 gcgcccggat tatctgtttg ctgatatcgt cggtactggc ggtgacggca gcaacagtat 2701 caatatttct accgccagtg cgtttgtcgc cgcggcctgt gggctgaaag tggcgaaaca 2761 cggcaaccgt agcgtctcca gtaaatctgg ttcgtccgat ctgctggcgg cgttcggtat 2821 taatcttgat atgaacgccg ataaatcgcg ccaggcgctg gatgagttag gtgtatgttt 2881 cctctttgcg ccgaagtatc acaccggatt ccgccacgcg atgccggttc gccagcaact 2941 gaaaacccgc accctgttca atgtgctggg gccattgatt aacccggcgc atccgccgct 3001 ggcgttaatt ggtgtttata gtccggaact ggtgctgccg attgccgaaa ccttgcgcgt 3061 gctggggtat caacgcgcgg cggtggtgca cagcggcggg atggatgaag tttcattaca 3121 cgcgccgaca atcgttgccg aactgcatga cggcgaaatt aaaagctatc agctcaccgc 3181 agaagacttt ggcctgacac cctaccacca ggagcaactg gcaggcggaa caccggaaga 3241 aaaccgtgac attttaacac gtttgttaca aggtaaaggc gacgccgccc atgaagcagc 3301 cgtcgctgcg aacgtcgcca tgttaatgcg cctgcatggc catgaagatc tgcaagccaa 3361 tgcgcaaacc gttcttgagg tactgcgcag tggttccgct tacgacagag tcaccgcact 3421 ggcggcacga gggtaaatga tgcaaaccgt tttagcgaaa atcgtcgcag acaaggcgat 3481 ttgggtagaa gcccgcaaac agcagcaacc gctggccagt tttcagaatg aggttcagcc 3541 gagcacgcga catttttatg atgcgctaca gggtgcgcgc acggcgttta ttctggagtg 3601 caagaaagcg tcgccgtcaa aaggcgtgat ccgtgatgat ttcgatccag cacgcattgc 3661 cgccatttat aaacattacg cttcggcaat ttcggtgctg actgatgaga aatatttcag 3721 gggtagcttt aatttcctcc ccatcgtcag ccaaatcgcc ccgcagccga ttttatgtaa 3781 agacttcatt atcgaccctt accagatcta tctggcgcgc tattaccagg ccgatgcctg 3841 cttattaatg ctttcagtac tggatgacga ccaatatcgc cagcttgccg ccgtcgctca 3901 cagtctggag atgggggtgc tgaccgaagt cagtaatgaa gaggaacagg agcgcgccat 3961 tgcattggga gcaaaggtcg ttggcatcaa caaccgcgat ctgcgtgatt tgtcgattga 4021 tctcaaccgt acccgcgagc ttgcgccgaa actggggcac aacgtgacgg taatcagcga 4081 atccggcatc aatacttacg ctcaggtgcg cgagttaagc cacttcgcta acggttttct 4141 gattggttcg gcgttgatgg cccatgacga tttgcacgcc gccgtgcgcc gggtgttgct 4201 gggtgagaat aaagtatgtg gcctgacgcg tgggcaagat gctaaagcag cttatgacgc 4261 gggcgcgatt tacggtgggt tgatttttgt tgcgacatca ccgcgttgcg tcaacgttga 4321 acaggcgcag gaagtgatgg ctgcggcacc gttgcagtat gttggcgtgt tccgcaatca 4381 cgatattgcc gatgtggtgg acaaagctaa ggtgttatcg ctggtggcag tgcaactgca 4441 tggtaatgaa gaacagctgt atatcgatac gctgcgtgaa gctctgccag cacatgttgc 4501 catctggaaa gcattaagcg tcggtgaaac cctgcccgcc cgcgagtttc agcacgttga 4561 taaatatgtt ttagacaacg gccagggtgg aagcgggcaa cgttttgact ggtcactatt 4621 aaatggtcaa acgcttggca acgttctgct ggcggggggc ttaggcgcag ataactgcgt 4681 ggaagcggca caaaccggct gcgccggact tgattttaat tctgctgtag agtcgcaacc 4741 gggcatcaaa gacgcacgtc ttttggcctc ggttttccag acgctgcgcg catattaagg 4801 aaaggaacaa tgacaacatt acttaacccc tattttggtg agtttggcgg catgtacgtg 4861 ccacaaatcc tgatgcctgc tctgcgccag ctggaagaag cttttgtcag tgcgcaaaaa 4921 gatcctgaat ttcaggctca gttcaacgac ctgctgaaaa actatgccgg gcgtccaacc 4981 gcgctgacca aatgccagaa cattacagcc gggacgaaca ccacgctgta tctcaagcgt 5041 gaagatttgc tgcacggcgg cgcgcataaa actaaccagg tgctggggca ggcgttgctg 5101 gcgaagcgga tgggtaaaac cgaaatcatc gccgaaaccg gtgccggtca gcatggcgtg 5161 gcgtcggccc tggccagcgc cctgctcggc ctgaaatgcc gtatttatat gggtgccaaa 5221 gacgttgaac gccagtcgcc taacgttttt cgtatgcgct taatgggtgc ggaagtgatc 5281 ccggtgcata gcggttccgc gacgctgaaa gatgcctgta acgaggcgct gcgcgactgg 5341 tccggtagtt acgaaaccgc gcactatatg ctgggcaccg cagctggccc gcatccttat 5401 ccgaccattg tgcgtgagtt tcagcggatg attggcgaag aaaccaaagc gcagattctg 5461 gaaagagaag gtcgcctgcc ggatgccgtt atcgcctgtg ttggcggcgg ttcgaatgcc 5521 atcggcatgt ttgctgattt catcaatgaa accaacgtcg gcctgattgg tgtggagcca 5581 ggtggtcacg gtatcgaaac tggcgagcac ggcgcaccgc taaaacatgg tcgcgtgggt 5641 atctatttcg gtatgaaagc gccgatgatg caaaccgaag acgggcagat tgaagaatct 5701 tactccatct ccgccggact ggatttcccg tctgtcggcc cacaacacgc gtatcttaac 5761 agcactggac gcgctgatta cgtgtctatt accgatgatg aagcccttga agccttcaaa 5821 acgctgtgcc tgcacgaagg gatcatcccg gcgctggaat cctcccacgc cttggcccat 5881 gcgttgaaaa tgatgcgcga aaacccggat aaagagcagc tactggtggt taacctttcc 5941 ggtcgcggcg ataaagacat cttcaccgtt cacgatattt tgaaagcacg aggggaaatc 6001 tgatggaacg ctacgaatct ctgtttgccc agttgaagga gcgcaaagaa ggcgcattcg 6061 ttcctttcgt cacgctcggt gatccgggca ttgagcagtc attgaaaatt atcgatacgc 6121 taattgaagc cggtgctgac gcgctggagt taggtatccc cttctccgac ccactggcgg 6181 atggcccgac gattcaaaac gccactctgc gcgcctttgc ggcaggtgtg actccggcac 6241 aatgttttga aatgctggca ctgattcgcc agaaacaccc gaccattccc attggcctgt 6301 tgatgtatgc caatctggtg tttaacaaag gcattgatga gttttatgcc cagtgcgaaa 6361 aagtcggcgt cgattcggtg ctggttgccg atgtgccagt tgaagagtcc gcgcccttcc 6421 gccaggccgc gttgcgtcac aacgtcgcac ctatcttcat ctgcccgcca aatgccgatg 6481 acgacctgct gcgccagata gcctcttacg gtcgtggtta cacctatttg ctgtcacgag 6541 caggcgtgac cggcgcagaa aaccgcgccg cgttacccct caatcatctg gttgcgaagc 6601 tgaaagagta caacgctgca cctccattgc agggatttgg tatttccgcc ccggatcagg 6661 taaaagcagc gattgatgca ggagctgcgg gcgcgatttc tggttcggcc attgttaaaa 6721 tcatcgagca acatattaat gagccagaga aaatgctggc ggcactgaaa gtttttgtac 6781 aaccgatgaa agcggcgacg cgcagttaat cccacagccg ccagttccgc tggcggcatt 6841 ttaactttct ttaatgaagc cggaaaaatc ctaaattcat ttaatattta tctttttacc 6901 gtttcgctta ccccggtcga tcgtyractt acgtcatttt tccgcccaac agtaatataa 6961 acaaacaaat taaacccgca acataacacc agtaaaatca ataattttct ctaagtcact 7021 tattcctcag gtaattctta atatatccag aatgttcctc aaaatatatt ttccctctat 7081 cttctcgttg cgcttaattt gactaattct cattagcgac taattttaat gagtgtcgac 7141 acacaacact catattaatg aaacaatgca acgcaacggg agaaataaca tggccgaaca 7201 tcgtggtggt tcaggaaatt tcgccgaaga ccgtgagaag gcatccgacg cagccgtaaa 7261 ggcggtcagc atagcggcgg taattttaaa aatgatcgca acgcgcatct gaagcgggta 7321 aaaaaggcgg tyrac // LOCUS HUMGSTH 808 bp ss-mRNA PRI 01-AUG-1990 DEFINITION Human glutathione S-transferase (GST) a-subunit mRNA, complete cds. ACCESSION M14777 KEYWORDS GSH S-transferase; glutathione S-transferase. SOURCE Human liver, cDNA to mRNA, clone pGTH1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 808) AUTHORS Tu,C.-P.D. and Qian,B. TITLE Human liver glutathione S-transferases: Complete primary sequence of an H-a subunit cDNA JOURNAL Biochem. Biophys. Res. Commun. 141, 229-237 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 67 735 glutathione S-transferase (GST, EC 2.5.1.18) /hgml_locus_uid="LL0130R" /nomgen="GST2" /map="6p12.2" mRNA < 1 808 GST mRNA signal 792 797 polyA signal BASE COUNT 252 a 175 c 192 g 189 t ORIGIN Chromosome 6p12.2. 1 agttgtcgag ccaggacggt gacagcgttt aacaaagctt agagaaacct ccaggagact 61 gctatcatgg cagagaagcc caagctccac tacttcaatg cacggggcag aatggagtcc 121 acccggtggc tcctggctgc agctggagta gagtttgaag agaaatttat aaaatctgca 181 gaagatttgg acaagttaag aaatgatgga tatttgatgt tccagcaagt gccaatggtt 241 gagattgatg ggatgaagct ggtgcagacc agagccattc tcaactacat tgccagcaaa 301 tacaacctct atgggaaaga cataaaggag agagccctga ttgatatgta tatagaaggt 361 atagcagatt tgggtgaaat gatcctcctt ctgcccgtat gtccacctga ggaaaaagat 421 gccaagcttg ccttgatcaa ggagaaaata aaaaatcgct acttccctgc ctttgaaaaa 481 gtcttaaaga gccatggaca agactacctt gttggcaaca agctgagccg ggctgacatt 541 catctggtgg aacttctcta ctacgtcgag gagcttgact ccagtcttat ctccagcttc 601 cctctgctga aggccctgaa aaccagaatc agcaacctgc ccacagtgaa gaagtttcta 661 cagcctggca gcccaaggaa gcctcccatg gatgagaaat ctttagaaga agcaaggaag 721 attttcaggt tttaataacg cagtcatgga ggccaagaac ttgcaatacc aatgttctaa 781 agttttgcaa caataaagta ctttacct // LOCUS MUSIGKACY 321 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse lysozyme-binding Ig kappa chain (HyHEL-10) V23-J2 region mRNA, partial cds. ACCESSION M35667 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region VK23. SOURCE Mouse hybridoma, cDNA to mRNA, clone 10K-106. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Mainhart,Smith-Gill-S.J., Lavoie,C., Feldman,T.B., Drohan,R.J. and Brooks,W.B.R. TITLE A three-dimensional model of an anti-lysozyme antibody JOURNAL J. Mol. Biol. 194, 713-724 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 321 lysozyme binding Ig kappa chain V23-J2 region (AA at 1) recomb 285 286 V23 region end/J2 region start BASE COUNT 88 a 80 c 75 g 78 t ORIGIN 1 gatattgtgc taactcagtc tccagccacc ctgtctgtga ctccaggaaa tagcgtcagt 61 ctttcctgca gggccagcca aagtattggc aacaacctac actggtatca acaaaaatca 121 catgagtctc caaggcttct catcaagtat gcttcccagt ccatctctgg gatcccctcc 181 aggttcagtg gcagtggatc agggacagat ttcactctca gtatcaacag tgtggagact 241 gaagattttg gaatgtattt ctgtcaacag agtaacagct ggccgtacac gttcggaggg 301 gggaccaagc tggaaataaa a // LOCUS MUSLTAGBSA 237 bp ds-DNA ROD 01-AUG-1990 DEFINITION Mouse SV40 transformed large T-antigen binding site DNA. ACCESSION M35500 KEYWORDS large T antigen. SOURCE Mouse (strain BALB/c) SV40 transformed cell line SVA31E7 DNA, clone p27. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 237) AUTHORS Lane,D.P., Simanis,V., Bartsch,R., Yewdell,J., Gannon,J. and Mole,S. TITLE Cellular targets for SV40 large T-antigen JOURNAL Proc. R. Soc. Lond., B, Biol. Sci. 226, 25-42 (1985) STANDARD simple staff_entry FEATURES from to/span description binding 84 118 large T-antigen binding site BASE COUNT 67 a 65 c 60 g 45 t ORIGIN 1 ggatccatcc cataatcagc ctctaaacgc tgacaccatt gcatacacta gcaagatttt 61 gctgaaagaa ccctgatata gctgtctctt gtgaggctat gccggggcct agcaaacaca 121 gaagtggatg ctcacagtca gctagtggat cacagggccc ccaatggagg agctagagaa 181 agtacccaag gagctaaagg gatcctctac gccggacgca tcgtggccag tcaccgc // LOCUS PEAIVSS 350 bp ds-DNA PLN 01-AUG-1990 DEFINITION Pea legumin J gene, exons 1 and 2 (partial). ACCESSION M26771 KEYWORDS legumin. SOURCE Pea DNA, clone pSP65LegJi. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 350) AUTHORS Brown,J.W.S., Feix,G. and Frendewey,D. TITLE Accurate in vitro splicing of two pre-mRNA plant introns in a HeLa cell nuclear extract JOURNAL EMBO J. 5, 2749-2758 (1986) STANDARD simple staff_entry FEATURES from to/span description pept < 1 48 legumin J, exon 1 (AA at 3) 50 144 legumin J, exon 2 pre-msg < 1 > 350 legumin J mRNA and introns IVS 49 186 legumin intron BASE COUNT 117 a 74 c 71 g 88 t ORIGIN 1 gaatacacgg aattcgagct cgcccgggga tcccattcaa ccccaagagt aagtaatagt 61 gtatccatac attacattat ctcttataaa ttgttcatac agcatgctca ttcgattata 121 actttaaaag tttctaatgt ataatttgtt atactaaatc aatcacacgt aaatatgtgt 181 atgcaggtat tttaccttgg tgggaaccca gaaacagagt tccccgaaac acaggaggaa 241 caacaaggaa ggcatcggca aaagcatagt taccctgttg gacgtaggag tggacatcac 301 caacaagaag aggaatggga tcctctagag tcgacctgca gcccaagctt // LOCUS RATCGM1AA 3190 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) mRNA, complete cds. ACCESSION M32474 J05417 KEYWORDS carcinoembryonic antigen-related protein. SOURCE R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3190) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analysis imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 122 2251 carcinoembryonic antigen-related protein precursor (CGM1) sigp 122 220 carcinoembryonic antigen-related protein signal peptide matp 221 2248 CGM1 protein BASE COUNT 871 a 798 c 693 g 828 t ORIGIN 1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta 61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca 121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct 181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc 241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga 301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact 361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt 421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac 481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt 541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc 601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga 661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc 721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac 781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta 841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt 901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc 961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga 1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc 1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac 1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta 1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt 1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc 1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct 1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta 1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta 1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact 1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac 1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt 1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc 1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag 1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc 1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa 1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca 1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg 2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct 2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag 2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct 2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga 2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg 2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca 2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac 2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc 2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct 2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt 2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag 2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa 2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat 2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct 2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag 2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct 3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt 3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag 3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt 3181 ttctccaaat // LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons 1 and 2. ACCESSION M32476 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 1 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2238) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene and analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 704 767 carcinoembryonic antigen-related protein (CGM1) precursor, exon 1 1770 + 2129 carcinoembryonic antigen-related protein, exon 2 sigp 704 767 carcinoembryonic antigen-related protein (CGM1) signal peptide 1770 1804 carcinoembryonic antigen-related protein signal peptide matp 1805 + 2129 carcinoembryonic antigen-related protein pre-msg 512 > 2238 CGM1 mRNA and introns IVS 768 1769 CGM1 intron A IVS 2130 > 2238 CGM1 intron B BASE COUNT 615 a 532 c 539 g 552 t ORIGIN 1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata 61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg 121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca 181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc 241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg 301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt 361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct 421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca 481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg 541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg 601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga 661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt 721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta 781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt 841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg 901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt 961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg 1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg 1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt 1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa 1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc 1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt 1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc 1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa 1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta 1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag 1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag 1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg 1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg 1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac 1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct 1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac 1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc 1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac 2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc 2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg 2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc 2221 tgcattgatc tacatgtt // LOCUS RATCGM1AC2 539 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32477 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 2 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 539) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 539 carcinoembryonic antigen-related protein intron B BASE COUNT 126 a 127 c 138 g 148 t ORIGIN 1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact 61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt 121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg 181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct 241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac 301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt 361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc 421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg 481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc // LOCUS RATCGM1AC3 828 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32478 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 3 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 828) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 828 carcinoembryonic antigen-related protein intron B BASE COUNT 193 a 255 c 190 g 190 t ORIGIN 1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct 61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc 121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca 181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa 241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata 301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt 361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt 421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg 481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag 541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc 601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga 661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac 721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac 781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc // LOCUS RATCGM1AC4 642 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 3. ACCESSION M32479 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 4 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 642) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 61 + 420 carcinoembryonic antigen-related protein (CGM1), exon 3 matp + 61 + 420 carcinoembryonic antigen-related protein pre-msg < 1 > 642 CGM1 mRNA and introns IVS < 1 60 CGM1 intron B IVS 421 > 642 CGM1 intron C BASE COUNT 151 a 160 c 154 g 177 t ORIGIN 1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag 61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc 121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc 181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat 241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt 301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct 361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca 421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt 481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg 541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt 601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg // LOCUS RATCGM1AC5 616 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 4. ACCESSION M32480 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 5 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 616) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 90 + 449 carcinoembryonic antigen-related protein (CGM1), exon 4 matp + 90 + 449 carcinoembryonic antigen-related protein pre-msg < 1 > 616 CGM1 mRNA and introns IVS < 1 89 CGM1 intron C IVS 450 616 CGM1 intron D BASE COUNT 152 a 153 c 147 g 164 t ORIGIN 1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa 61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac 121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct 181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat 241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg 301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt 361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa 421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg 481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag 541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca 601 ttgtctgatt cccttc // LOCUS RATCGM1AC6 654 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron 4. ACCESSION M32481 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 6 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 654) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 654 carcinoembryonic antigen-related protein intron D BASE COUNT 185 a 146 c 155 g 168 t ORIGIN 1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg 61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat 121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc 181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac 241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg 301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga 361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat 421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt 481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt 541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca 601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa // LOCUS RATCGM1AC7 492 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 5. ACCESSION M32482 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 7 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 492) AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 118 + 471 carcinoembryonic antigen-related protein (CGM1), exon 5 matp + 118 + 471 carcinoembryonic antigen-related protein pre-msg < 1 > 492 CGM1 mRNA and introns IVS < 1 117 CGM1 intron D IVS 472 492 CGM1 intron E BASE COUNT 134 a 134 c 105 g 119 t ORIGIN 1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc 61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct 121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg 181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt 241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag 301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg 361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa 421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac 481 tctcaaggtc tc // LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 6. ACCESSION M32483 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 8 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1341) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 640 / 987 carcinoembryonic antigen-related protein (CGM1), exon 6 matp + 640 / 987 carcinoembryonic antigen-related protein pre-msg < 1 > 1341 CGM1 mRNA and introns IVS < 1 639 CGM1 intron E IVS 988 1341 CGM1 intron F BASE COUNT 357 a 329 c 321 g 334 t ORIGIN 1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt 61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat 121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga 181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca 241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa 301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat 361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag 421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga 481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg 541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt 601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca 661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc 721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca 781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc 841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca 901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag 961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt 1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc 1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa 1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct 1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg 1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag 1321 atccgttgcc tgacggtcga c // LOCUS RATCGM4AA 4627 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM4) gene, exons 2 and 3. ACCESSION M32475 J05417 KEYWORDS carcinoembryonic antigen-related protein. SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM415-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites for [2]) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 4627) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept / 145 489 carcinoembryonic antigen-related protein (CGM4), exon 2 (AA at 147) 3379 / 3738 carcinoembryonic antigen-related protein exon 3 pre-msg < 1 > 4627 CGM4 mRNA and introns IVS < 1 144 CGM4 intron B IVS 490 3378 CGM4 intron C IVS 3739 > 4627 CGM4 intron D BASE COUNT 1213 a 1165 c 1162 g 1087 t ORIGIN 1 agatctgggt cgacctgcag gtcaacggat ctgggcctta gcaggagtgt gggcagagct 61 ctgggaaggc agaagtgtga ttttttaaaa aaccaacaga tttcacctgc tcaatatcga 121 tggttgctct gtcttccctt ttagcctccc ttctaacctg ttggctcctg actactgccc 181 aggtcaacat tgaatcggtg ccattcaatg tggttgaagg ggaaaacgtc cttcttcttg 241 tccacaatct gccagagaat ctcatagcct ttgcctggta tagagggctg aggaaaattg 301 gagtatacat actgaacact gaagtaagtg tgacggggcc aatgtacagc ggtagagaga 361 cagtgtacag caatggttcc ctgtgtatcc gcaatgtcac ccagaaggac acaggattct 421 acactctacg aacagtcaac acacgtggag aaactgtatc aacaacatcc ttgtacctct 481 atgtgtacag taagtgatac tttgtgaact ctgggtgttg tgtggggttc attccgtaga 541 cacacacaga agaggcaggc ctacctaccc tttgcattgt gtctccttat tgaggtgtga 601 acatttaact caggctaagg agagtaatgc caattgaata gaatccttct tttgacttta 661 ccttgtagtc agctggatgt gtggttaact cagtgaagga catcagccct tgtctagact 721 tctggggttc ttagcagtaa tgtgtccttg ggaaagacct tgagggaagg agattgggtt 781 tgaatgagat agccatagga tcctcatgga agtgagaacc agaaagccct ggctccagac 841 ctctgtcctg actcatctcc tgatggcccc gagaagcatt ttacaaaggc tggattctga 901 catctgttgg cagggaacag tgcttttgag gagcaaatcc ttgtgccaca tacaatcacc 961 tggtgcacgg ccatgagagc cacagttagg cgaggtctcc tggatctctc cagtgactca 1021 tcagggagag aatagaaaga cagatgtccc ggccactaag ttaactgtta tgatggcctt 1081 atgagacttc caggaaggtc atggttgcca ggaagaggga caaaggacac agatccccct 1141 gacagttgct tgtcctttgg ggtccagctc atagaagtct gtccgcaggc aaatgacacc 1201 aggctctgct gatgtggata gctccccaga tctgagctgc agttctccca gcgatcacga 1261 gggccgcctc agggaaacac aattaacacc cagaagagta tttgtctaaa ccaggaactt 1321 acctcctcct ctggctagct cccctgttcc tacagacatg ggggtcacac agccttctca 1381 gacctaccag ctgcctcctt ttctgctgcc ttgctaggga attatgtgta gtggctgctt 1441 tgtgtatttt ctttggaaaa gatagagtat cctaagggaa tcacccagac agaggttcaa 1501 ggcatctctg aaaggccagg cagcacatgg cagagccacc tcacagctca ggacccagag 1561 gaagtgtgcc caccatcttg aatccatgca tgggacgatg gagcccagag ctacgttcca 1621 ggactcaggt cacctcccac acactcaaga agtgaggctc ctgacacagc tgctcctggg 1681 ccccttttct ccctgagaat cctgactggt ggctgcagtg agaacacatc tgtcccctcc 1741 cccactcgtc acacagctgg cccttgggat cctcacacac atctctgtct ccttcctcct 1801 gagagcaaac tacctctttg acgggcactg agaacacagg gcagactggg tgcccagctg 1861 gttctgggtc acccagggag tgcagaggct cactcactgg tgctgactga gccaggaaga 1921 ggccagaaca gagggatgcc ccccgggtga gctgctgtct tcttagggca cagagatgct 1981 cagaggtttg tttgtcactg tgagctctgt ggcatgagac agaaagagcc cagaggagag 2041 gttaggtgtg taggactgag tgtgcacagg gcagagaaca gagttaccca cagcccacgg 2101 gactctggga tatgatcctg tctggcggag gctgagctca gaggatcaga gaacttggga 2161 gctgtattgg agcagatgtg ctacagactg aggacagatc tggccacaga gaccagggcg 2221 gtgctctgta ccatctgcaa acaatgcccc acctgttggt gctcctgctc acagatgagg 2281 agaccacatt ttacagtgtg tgagaggaga ggactcacct actgtctaaa gtctcttcaa 2341 ggggacaggg actggagaag agtttcaggt ttgtagggct gaaaacacta aagtataggg 2401 gctcatcatc atcatcatca ccaccgccat caccaccacc accaccacca ccaccaccac 2461 caccaccacc accaccacca ccatcatcat catcatcatg aggctcttgg taaataagaa 2521 gaagcagggg gaggaggaga ttattgtcaa cccacagttc accatcaatg agcccagtgt 2581 tctgaagact gaggttctca gctgtgatgc cccaaataag aaaccaagct ggtgttgatc 2641 agtgacatgg ctcagtggat ctgggtgttt gcttcatgtc tgacaacctg agaaccagtg 2701 aacacaagtt gtccctgacc tccacctagg gacggcgttt tgcacccaac acagacacac 2761 tgaggcatgc ccttgcacat gaactcatac accaatataa taagcaaatg cataaaaatt 2821 atagcaaatg gaagcagtca acactgtatt cccaaacata ctaatttgtt aaataaatcc 2881 atggccatgt attcattcat tcattcattc actcattcat ttactctcca agatatttga 2941 gttttctttt gcagtctttt ttttttaaaa gataatataa gacaaatccc agttctcatt 3001 attccctagc cctagactgg aagacgacca gtgaagaaag ctagaaggcg aatcagtcac 3061 taaaggacaa gaaacaaaag agtcagagtg tgacggtcgg gaggcttcac cccaacaccc 3121 atcgactgac actgagggtg agcagggatc tgaggacggt gaggcagggc catgttgaca 3181 cctgaggaga gagcagcata gagaggaaat gacaagtgag gggcgcggag tgcatggagg 3241 taatgcactg acctccacta gctagggcag ggagactccc acacctcagc tgaccactgg 3301 acacagctgc tcggactcag gcaccatctt agccaaatac taaagtcctg atgttgacgg 3361 atctctcttc ccttctagcc tctcttttca tctgtgggcg tccttttaac cctgccaagc 3421 tcactattga atcagtgccg cccagtgttg ctgaaggggg aagcgttctt ctcctcgttc 3481 acaatctcca ggacgagctt cgagggtttt tctggtacaa aggggcgtct atgtctagca 3541 accatgagat agcccgatac agaacagcaa agaattcaag tgtgccaggc cctgcccaca 3601 gtggtagaga gacggtgtac agcaatggat ccctcctgct ccagaatgtc acccggaatg 3661 acactgggtt ctacacccta cgcactctga aaagacatca gaaaatggaa ttggcacacg 3721 tgcaacttca ggtggacagt aagtgatttt ccgtgatcgt tcagtgctgg gtgggtcttt 3781 gacacacagg actgtcaccc ctggcatgtg gctacctcct ctctgccttt ttatccccat 3841 gttgtggtta accactatgt gcaggacaca tgtgatggaa agaaatgccc atgggtcaga 3901 cttatcatct gactctcccc tgtatcaagg acagtaactc aaccctaggt gctagactct 3961 gcccagtcat ctggggcatc ttgccatgca acgtgaggaa accatggatc ctcacagcgt 4021 ggtgagcacc aggaagctct gatctcagtc gtttgtccca gacttgactg caaatgtctc 4081 taggagcatt ttgtcaggag tgctgcttac tgcctctctc ctcacagcct gccatcctga 4141 tcttatagta acccaggaca ctgagcccag gggtgaaaat gctcccagtt gggctgggct 4201 ctcccacatc aatcaccaat taaaaatgta ctacaggtta gcccacaggt tattttggtg 4261 gtggcatttt aaattgaggc ccttgtttca aaaaattcta gcttgtgtta agttgacata 4321 aagccagcag cacgattcct gagccctccc caatacctat atctgccaag aagaccagac 4381 tgttcccacc catcatccgg ccttagtcct gggtgctata ggctgggacg tgagaacatg 4441 tggaatgtga agtctgagga tgaccgcagg tacaaaggag atgagaaagt cagagagtgt 4501 gtatccaggg tgtgtagaga ccaaaggtca ggggaggcat catcccaaag cacagtgtgc 4561 atgagtatgt gcaatgtctg aatgagggca gtgagggaca gccacggaga caccaaggac 4621 agagctc // LOCUS STMRGDA 2540 bp ds-DNA BCT 01-AUG-1990 DEFINITION S.coelicolor 16S rRNA gene and 23S rRNA, 5' end (rrnD) gene cluster. ACCESSION Y00411 M35377 KEYWORDS 16S ribosomal RNA; 23S ribosomal RNA. SOURCE S.coelicolor (strain 1147 A3(2)) DNA, clone RSC33. ORGANISM Streptomyces coelicolor Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 705 to 2230) AUTHORS Baylis,H.A. and Bibb,M.J. TITLE The nucleotide sequence of a 16S rRNA gene from Streptomyces coelicolor A3(2) JOURNAL Nucleic Acids Res. 15, 7176-7176 (1987) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 771 and 2196 to 2540) AUTHORS Baylis,H.A. and Bibb,M.J. TITLE Transcriptional analysis of the 16S rRNA gene of the rrnD gene set of Streptomyces coelicolor A3(2) JOURNAL Mol. Microbiol. 2, 569-579 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 144 ORF (AA at 1) rRNA 536 > 2231 16S rRNA gene rRNA 537 > 2231 16S rRNA gene rRNA 704 2231 16S rRNA rRNA 2507 > 2540 pot. 23S rRNA BASE COUNT 573 a 654 c 834 g 478 t 1 others ORIGIN 1 tgggcccgca tcaccatcgg cgtcctcgcc gagctggcct tcctggccta cgtctacgtt 61 ctgggcggcc gagccgtgcg cgacggcgag acgggtgacg tcgaggcagc cgaacgcagc 121 gccacggtgc caacagccgc ctgatgtgca tccacccctg cgagctgcta gtgtcctctt 181 cgttcccgca agagccgttg acacggagcg agcggggagg tagattcgaa cagttgcctg 241 gagacgggtt caccccagag ggcaacagtg aacatctacc agcttctccg aatcaacgaa 301 ttcgacgaag cactctcccg atgaatcgga aacgaaggcc ggtaagaccg gctcgaaagt 361 tctgataaag tcggagccgc cggaaaggga aacgcgaaag cgggaacctg gaaagcgccg 421 aggaaatcgg atcggaaaga tctgatagag tcggaaacgc aagaccgaag ggaagcgccc 481 ggaggaaagc ccgagagggt gagtacaaag gaagcgtgcc gttccttgag aactcaacag 541 cgtgccaaaa gtcaacgcca gatatgttga taccccgacc tgatcggatc tccgttcggg 601 ttgaggttcc tttgaagtaa cacaacagcg aggacgctgt gaacggtcgg attattcctc 661 cgactgttcc gctctcgtgg tgtcacccga ttacgggtat acattcacgg agagtttgat 721 cctggctcag gacgaacgct ggcggcgtgc ttaacacatg caagtcgaac gatgaaccac 781 ttcggtgggg attagtggcg aacgggtgag taacacgtgg gcaatctgcc cttcactctg 841 ggacaagccc tggaaacggg gtctaatacc ggatactgac cctcgcaggc atctgcgagg 901 ttcgaaagct ccggcggtga aggatgagcc cgcggcctat cagcttgttg gtgaggtaat 961 ggctcaccaa ggcgacgacg ggtagccggc ctgagagggc gaccggccac actgggactg 1021 agacacggcc cagactccta cgggaggcag cagtggggaa tgttgcacaa tgggcgaaag 1081 cctgatgcag cgacgccgcg tgagggatga cggccttcgg gttgtaaacc tctttcagca 1141 gggaagaagc gaaagtgacg gtacctgcag aagaagcgcc ggctaactac gtgccagcag 1201 ccgcggtaat acgtagggcg caagcgttgt ccggaattat tgggcgtaaa gagctcgtag 1261 gcggcttgtc acgtcggttg tgaaagcccg gggcttaacc ccgccactgc agtcgatacg 1321 ggcaggctag agttcggtag gggagatcgg aattcctggt gtagcggtga aatgcgcaga 1381 tatcaggagg aacaccggtg gcgaaggcgg atctctgggc cgatactgac gctgaggagc 1441 gaaagngtgg ggagcgaaca ggattagata ccctggtagt ccacgccgta aacggtgggc 1501 actaggtgtg ggcaacattc cacgttgtcc gtgccgcagc taacgcatta agtgccccgc 1561 ctggggagta cggccgcaag gctaaaactc aaaggaattg acgggggccc gcacaagcgg 1621 cggagcatgt ggcttaattc gacgcaacgc gaagaacctt accaaggctt gacatacacc 1681 ggaaagcatc agagatggtg ccccccttgt ggtcggtgta caggtggtgc atggctgtcg 1741 tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ttgtcccgtg 1801 ttgccagcaa gccttcgggg tgttggggac tcacgggaga ccgccgggtc aactcggagg 1861 aaggtgggga cgacgtcaag tcatcatgcc ccttatgtct tgggctgcac acgtgctaca 1921 atggccggta caatgagctg cgataccgca aggtggagcg aatctcaaaa agccggtctc 1981 agttcggatt ggggtctgca actcgacccc atgaagtcgg agtcgctagt aatcgcagat 2041 cagcattgct gcggtgaata cgttcccggg ccttgtacac accgcccgtc acgtcacgaa 2101 agtcggtaac acccgaagcc ggtggcccaa ccccttgtgg gagggagctg tcgaaggtgg 2161 gactggcgat tgggacgaag tcgtaacaag gtagccgtac cggaaggtgc ggctggatca 2221 cctcctttct aaggagcaca tagccgactg cagcgaaatg tcctgcacgg ttgctcatgg 2281 gtggaacgtt gactactcgg cacggtcttc ttgatggatc actagtactg cttcggcgtg 2341 gaacgtgact tcaaagaggg gttcgtgtcg ggcacgctgt tgggtatctg agggtacggc 2401 cgtgaggtcg ccttcagttg ccggccccgg taaaaatccg cgtgagtggg ttgtgacggg 2461 tggttggtcg ttgtttgaga actgcacagt ggacgcgagc atctgtggcc aagtttttaa 2521 gggcgcacgg tggatgcctt // LOCUS SUSCYIIAA 230 bp ds-DNA INV 01-AUG-1990 DEFINITION S.purpuratus cytoskeletal actin CyIIa gene, complete cds. ACCESSION M35321 M35322 KEYWORDS cytoskeletal actin SpG11A. SOURCE S.purpuratus DNA, clone pSpG11A. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 230) AUTHORS Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M. TITLE DNA sequence analysis and structural relationships among the cytoskeletal actin genes of the sea urchin Strongylocentrotus purpuratus JOURNAL J. Mol. Evol. 28, 72-86 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 219 > 230 actin CyIIa BASE COUNT 74 a 43 c 33 g 80 t ORIGIN 1 ttcgaattgt cactcattct tcaaataaag attgtgagat cacgcgtttt ctgtacccta 61 ccctacaaat acgtaggaca cctgggtatg tagtgaacct taaagtttat aaatgatgtt 121 cttgtttgtc catcaattta accgggaaaa aaatttatct gtctaatatc attatctatt 181 ttcacacttt tagatcaaac tagattaaac aaatcatcat gtgtgacgac // LOCUS SUSCYIIBA 1972 bp ds-DNA INV 01-AUG-1990 DEFINITION S.purpuratus cytoskeletal actin CyIIb gene, complete cds. ACCESSION M35323 KEYWORDS cytoskeletal actin CyIIb. SOURCE S.purpuratus DNA, clone pSpG11A. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 1972) AUTHORS Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M. TITLE DNA sequence analysis and structural relationships among the cytoskeletal actin genes of the sea urchin Strongylocentrotus purpuratus JOURNAL J. Mol. Evol. 28, 72-86 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 251 616 cytoskeletal actin CyIIb, exon 1 841 1087 cytoskeletal actin CyIIb, exon 2 1312 1829 cytoskeletal actin CyIIb, exon 2 pre-msg 1 1972 CyIIb mRNA and introns IVS 617 840 CyIIb intron A IVS 1088 1311 CyIIb intron B BASE COUNT 529 a 492 c 408 g 543 t ORIGIN 1 tcggcagttc aagaccacgt gtgtttcccg gattggtaaa ctccttatca cgaactcctt 61 atcagtaaaa cttacgagct ttgtacactt ttaatgactt ttcgattatt ctttcaagag 121 attttccctg ccacaaaatt acttagttct tttatttctc attcctgtgc aattccaatt 181 actagcattt tatttatgat ccatttttgt gtttttattt tagagtaaat aaaacgagaa 241 atcaatcatc atgtgtgacg acgatgttgc cgctcttgtc atcgacaacg gatccggtat 301 ggtgaaggcc ggattcgccg gagacgatgc cccaagggct gtcttcccat ccatcgttgg 361 cagaccccgt caccagggtg tcatggtcgg catgggacag aaggacagct acgtcggaga 421 cgaggcccag agcaagagag gtatcctcac cctgaagtac cccatcgagc acggtatcgt 481 caccaactgg gacgatatgg agaagatctg gcatcacacc ttctacaacg aactccgtgt 541 tgccccggag gagcaccccg tcctccttac cgaggctccc ctcaacccca aggccaacag 601 ggaaaagatg acacaggtta gaaaaagcaa tatgcctatt attgaagtaa tcaaattctc 661 aaaacaaata cattctcaca tttaaacatc ttaatttaag ctgtttatta atattaatat 721 caagtgagtt tcgttgttga aataacagcg attgactaaa atgaacttgt atcaaacttg 781 ttgtgattag tgaaatgaaa tcggtgatta acaattgttt tgttttcatg tcttctgcag 841 atcatgttcg agaccttcaa ctcacccgcc atgtacgtcg ctatccaggc cgtgctttcc 901 ctctacgcct ctggtcgtac cactggtatc gttttcgact ctggtgatgg tgtttcacac 961 acagtgccca tctacgaggg ttatgccctt ccccacgcca tcctccgtct ggacttggct 1021 ggacgtgatc tcacagacta cctgatgaag atccttaccg agcgtggcta ctctttcacc 1081 accaccggta agatatcttt tttttacaat caaagagtga gtgaagctat cacctgcatc 1141 ctgtgcttaa agaatattaa aaaaagagga gggaagatat tatatatgat taatgttcat 1201 tttctttgga ctttgacaat aacattttgg ggggatagaa agtgaatgtt gcttttcgtt 1261 atacattcgt aactaactaa tttcatcttg tttttttttt ctatcttgca gctgagcgtg 1321 aaatcgttcg tgacatcaag gagaagctct gctacgttgc tcttgacttt gagcaagaga 1381 tgcagactgc tgcctcatcc tcctccctcg agaagagcta cgagcttccc gacggacagg 1441 tcatcaccat tggcaacgag cgattccgtg ccccagaggc cctcttccag ccagccttcc 1501 ttggaatgga atccgctgga atccacgaga cctgctacaa cagcatcatg aagtgcgatg 1561 ttgacatccg taaggatctg tacgccaaca ctgttctgtc tggaggctcc accatgttcc 1621 caggaatcgc cgacaggatg cagaaggaga tcaccgccct tgccccacca accatgaaga 1681 tcaagatcat tgctcctcca gaaaggaaat actccgtatg gatcggaggc tccatccttg 1741 cctctctctc caccttccaa cagatgtgga tcagcaagca ggaatacgat gagtccggcc 1801 catccatcgt ccacaggaag tgcttctaaa caactcgctt ttggtgaaca aactcttgaa 1861 catcaatatc aaggaaacga ccatgatctc aaattgcaaa gtttaagtat gacaccattg 1921 cgggcaatgc agccgaaaaa ctcgcgcttt ctcaaaactt ggaggactgc ag // LOCUS SUSCYIIIBA 2918 bp ds-DNA INV 01-AUG-1990 DEFINITION S.purpuratus cytoskeletal actin CyIIIb gene, complete cds. ACCESSION M35324 KEYWORDS cytoskeletal actin CyIIIb. SOURCE S.purpuratus DNA, clone pSpG11A. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 2918) AUTHORS Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M. TITLE DNA sequence analysis and structural relationships among the cytoskeletal actin genes of the sea urchin Strongylocentrotus purpuratus JOURNAL J. Mol. Evol. 28, 72-86 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 371 736 cytoskeletal actin CyIIIb, exon 1 1634 1880 cytoskeletal actin CyIIIb, exon 2 2247 2764 cytoskeletal actin CyIIIb, exon 2 pre-msg 1 1972 CyIIIb mRNA and introns IVS 737 1633 CyIIIb intron A IVS 1881 2246 CyIIIb intron B BASE COUNT 850 a 668 c 559 g 841 t ORIGIN 1 acggttcggg catttaggga tagctttgat tttaagaatg ttaaaatgag aatgtcaaat 61 agcctaacgc tggtctgtgc cagtaaacat gaatcaattc caaatgttga tatattaata 121 gtcggggagt tcaaatagga caacatgttt cacggggata gaattatcag acataattat 181 aatcccactt tgtcgtgaat tttgttggtt gtatgaaagt tttttagacc gtttgaaagg 241 aaaacagacc tatgccaaat ccaccaccac gaattaacta gtctgcaaac aaagaaacta 301 aaattaatat ttctctgggt atgtttttct catattcagg acaggaaaac gaaattcaat 361 catcatgtgt atgtgtgacg atgatgttgc cgctcttgtc gtcgacaacg ggtccggaat 421 ggtgaaggcc ggattcgccg gagacgatgc cccaagggct gtctttccat ccatcgttgg 481 caggccccgt caccagggtg tcatggttgg tatgggacaa aaggacagct acgttggaga 541 cgaagcacag agcaagagag gtatcctcac cctgaagtac cctattgagc acggtatcgt 601 caccaactgg gacgatatgg agaagatctg gcatcacacc ttctacaacg agctccgtgt 661 tgccccagag gagcaccccg tccttctgac agaggccccc ctcaacccta aggccaacag 721 ggaaaagatg acacaggtaa ggatatagtg cggaattgca aaacattcct taaagatact 781 atgtctcttt tgcacccaac atcagattct gtagaacttt gcaggaacta taattatgac 841 ttgtcatgta tgtcctatct atgaaatcta aacattagca atgtcgtatt attcgaatta 901 tgcaaggaaa cccgtttatc ttctagactt cactgtcaga cttactgaca tctatttttc 961 tttattgtaa taacatacat acatttagct ttaacaggta catgagcatt tgtctacatc 1021 aataacccac tatttgtgac ggccaaaatt aaactgattg aatatttgta cagcacaaaa 1081 cgtacgacca atcggtgaaa gggtgtgaaa atgaaactat tacttaggtg atcgcaatta 1141 cttaactcga ttcgataact aatggtaaca tgtagttatt ttcccactaa aagccctttt 1201 taatcctttc gtttcgaagg aacttctaac ttagtttttt tccttcaaat gcagttggaa 1261 tttaatcttt tcattgttgg cctgcaaatg ggacatacag tagtaccttt aactgcattt 1321 tggcaggaat gaaatgaaca acggctacag atagcccacg tcaccaatag cctacataag 1381 cgaagaaaac tagtcggata cccccacacg accgacatat cgctctccct gaccaatcta 1441 aaatatcgtt tttctttttt aaagtccata aaatgctatg aaaacctttt cgtttcttta 1501 ctgcagtgaa aataaaagct gatacggact acgagtacaa aatcgcgaac attcagataa 1561 aaaagttgaa tttgcccagt ttataatccc tagagtttat tcttaattca aaaaaatatt 1621 cttcttttgt tagatcatgt ttgagacctt caactcgccc gccatgtacg tcgccatcca 1681 ggccgtgctt tccctctacg cctctggtcg taccactggt atcgttttcg actctggcga 1741 cggtgtttca cacactgtac caatctatga gggttacgcc ctcccccacg ccatcatccg 1801 tctggacttg gctggacgtg atcttaccga ttacctgatg aagatcctta ccgagcgtgg 1861 ctactctttc accaccactg gtaagacatg atatggataa tagcaatagc taatgatgat 1921 aattaaaata gggataattg ataatattag aatactaatg taaacagatg aatgtcttac 1981 caaagggcag tctgtctcgg gttttgaatt caaaaacctc acatctcgtt atctttaagc 2041 cgcagaccac aacacctgca tgttcatttt tttttttact gcttgttcaa atccttttga 2101 caaagcgaat atctgattag atcgataata attaataaca aataccctct aagtcccgga 2161 gtttcaacac atttccattg ttatcttcac attttacaat ttgtctgcaa ttgatatgtg 2221 actgcatcca ttattatctc ttacagctga gcgtgaaatc gtccgtgaca taaaggagaa 2281 gctctgctac gtagctcttg attttgagga ggagatgcaa actgctgcct catcctcctc 2341 cctcgagaag agctacgagc ttcccgacgg acaggtcatc accatcggca acgagcgatt 2401 tcgttgctca gagaccctct tacagccctc tttcattgga atggaatctg ctggaatcca 2461 tgagacctgt tataacagca tcatgaagtg cgatgttgac atccgtaagg atctatacgc 2521 caacaccgtt ctctccggag cttccaccat gttcccagga atcgctgaca ggatgcagaa 2581 agagattgtc gcccttgccc caccaaccat gaagatcaag atcatcgctc ctcctgagag 2641 gaaatactct gtatggatcg gaggctccat tcttgcctct ctctccacct tccaacagat 2701 gtggatcagc aagcaggaat acgatgagtc tggtccatcc atcgtccaca ggaagtgctt 2761 ctaaacaacc ttccaacaga tttggatcag caagcaggaa tacaatgagt ccggtccatc 2821 catcgtccaa gggaagtgct tctaaacaac ttgattttct tctacttcta atgagcaacc 2881 tgattttttt aattctgttt cactccatgt tgccacct // LOCUS WHTIVSS 310 bp ds-DNA PLN 01-AUG-1990 DEFINITION Wheat amylase gene, exons 2 and 3 (partial). ACCESSION M26770 KEYWORDS . SOURCE Wheat DNA, clone pSP64Amyi. ORGANISM Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 310) AUTHORS Brown,J.W.S., Feix,G. and Frendewey,D. TITLE Accurate in vitro splicing of two pre-mRNA plant introns in a HeLa cell nuclear extract JOURNAL EMBO J. 5, 2749-2758 (1986) STANDARD simple staff_entry FEATURES from to/span description pept < 1 109 amylase, exon 2 (AA at 1) 214 > 310 amylase, exon 3 pre-msg < 1 > 310 amylase mRNA and intron IVS 110 213 amylase intron 2 BASE COUNT 79 a 91 c 76 g 64 t ORIGIN 1 gaatacaagc ttgggctgca ggtcgacgca gaggctgtgg ccattcccct cggacaaggt 61 catgcagggc tacgcctaca tcctcacaca cccgggcata ccatgcatcg taagtagtag 121 cacactacac aacctcacca taacatttcg catcaaacgt accccacgat gtttgtgatc 181 tgaacttaca actacttggt tttgcgcgcg cagttctacg accatgtgtt cgactggaaa 241 ctgaagcagg agatcaccgc actggctacg gtcaggtcaa ggaacgggat ccccgggcga 301 gctcgaattc // LOCUS YSCMTARSA 384 bp ds-DNA ORG 01-AUG-1990 DEFINITION Yeast (S.cerevisiae) mitochondrial autonomously replicating sequence DNA. ACCESSION M35612 KEYWORDS . SOURCE S.cerevisiae (strain 992) mitochondrial DNA, clone pYmit1021. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 384) AUTHORS Mabuchi,T., Nishikawa,S. and Wakabayashi,K. TITLE The nucleotide sequence of mitochondrial ARS in Saccharomyces cerevisiae JOURNAL J. Gen. Appl. Microbiol. 30, 469-478 (1984) STANDARD simple staff_entry FEATURES from to/span description site 46 56 consensus autonomously replicating sequence site 126 136 consensus autonomously replicating sequence site 245 255 consensus autonomously replicating sequence site 290 300 consensus autonomously replicating sequence site 148 156 ori/rep GC cluster A site 187 194 ori/rep GC cluster A BASE COUNT 126 a 29 c 34 g 195 t ORIGIN 1 ccgccgcggg cggacgccgg aggagaatta tatttttata taataattta tatttctata 61 tatatatata tatattatat ataaatatta ttatatatat ttttatatat attataatta 121 tattcattaa tattttatta tagtggtggg ggtcccaatt attattttca ataataattt 181 atcatgggac ccggatatct tcttgttttt atttattatt ttttttaatt tattttaatt 241 atttatttat aatttatatt atacaattta ttatttcgtt aataccttta tttatattat 301 ataatatatt atattattat aatatattta ttgattatat taatacattt aactaatgtg 361 tgctctatat ttattgaata gttt // LOCUS YSCMTARSB 218 bp ds-DNA ORG 01-AUG-1990 DEFINITION Yeast (S.cerevisiae) mitochondrial Ser-tRNA, 3' end in and autonomously replicating sequence. ACCESSION M35613 KEYWORDS transfer RNA-Ser. SOURCE S.cerevisiae (strain 992) mitochondrial DNA, clone pYmit1S2SC-delta-11. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 218) AUTHORS Mabuchi,T., Nishikawa,S. and Wakabayashi,K. TITLE The nucleotide sequence of mitochondrial ARS in Saccharomyces cerevisiae JOURNAL J. Gen. Appl. Microbiol. 30, 469-478 (1984) STANDARD simple staff_entry FEATURES from to/span description tRNA < 1 49 Ser-tRNA site 138 148 consensus autonomously replicating sequence BASE COUNT 99 a 25 c 11 g 83 t ORIGIN 1 ctatcattag tctttattgg ctacgtaggt tcaaatccta catcatccgt aataatacat 61 atatataata ataattttaa tattattcct ataaaaataa aataaataaa taaataataa 121 taattaatta attttaataa atataaaata tataaaataa taataataat aattattatt 181 ttaataatat tatttatata atagtccggc ccgccccc // LOCUS MUSMDRXX 2873 bp ds-DNA ROD 01-AUG-1990 DEFINITION Mouse P-glycoprotein (mdr1a) gene, exons 1 and 2. ACCESSION M33580 KEYWORDS P-glycoprotein. SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line J774.2-vinblastine resistant subline J7.V1-1 DNA, clone pV1.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2873) AUTHORS Hsu,S.I.-H., Cohen,D., Kirschner,L.S., Lothstein,L., Hartstein,M. and Horwitz,S.B. TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter reveals the basis for differential transcript heterogeneity in multidrug-resistant J774.2 cells JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.S.Kirschner, 05-APR-1990. FEATURES from to/span description pept 2613 / 2677 P-glycoprotein, exon 2 (first expressed exon) pre-msg 1992 > 2873 P-glycoprotein mRNA and introns (alt.) pre-msg 1801 > 2873 P-glycoprotein mRNA and introns (alt.) IVS 2120 2606 P-glycoprotein intron A IVS 2678 > 2873 P -glycoprotein intron B signal 1904 1912 CAAT box signal 1956 1963 TATA box site 1880 1887 SP-1 site site 1921 1927 SP-1 site site 1937 1944 SP-1 site site 1869 1875 AP-1 site rpt 1 1300 L1Md repetitive element BASE COUNT 860 a 621 c 714 g 678 t ORIGIN Chromosome 5. 1 gaattctcac ctgaggaata ccgaatccag agaaacacct gaaaaaatgt tcaacatcct 61 taatcatcag ggaaatgcaa atcaaaacaa ccctgagatt ccacctcaca ccagtcagaa 121 tggctaagat caaaaattca ggtgacagca gatgctggcg aggatgtgga gaaagaggaa 181 cactcctcca ttgttggtgg gagtgcaggc ttgtacaacc actctggaaa tcagtctggc 241 ggttcctcag aaaactggac atagtactct cggaggatcc agcaatacct ctcctgggca 301 tatatccaga agatgcccca acaggtaaga aggacacatg ctccactatg ttcatagcag 361 ccttatttat aatagccaga agctggaaag aacctagatg cccctcaaca gaggaatgga 421 tacagaaaat gtggtacatc tacacaatgg agtactactc agctattaaa aagaatgaat 481 ttatgaaatt cctagccaaa tggatggacc tggggggcat catcctgagt gaggtaacac 541 attcacaaag aaactcacac aatatgtatt cactgataag tggatattag ccccaaacct 601 aggataccca agatataaga tataatttgc taaacacatg aaactcaagg agaatgaaga 661 ctgaagtgtg gacactatgc ccctccttag atttgggaac aaaacaccca tggaaggagt 721 tacagagacg gagtttggag ctgagatgaa aggatggacc atgtagagac tgccatagcc 781 agggatccac cccataatca gcatccaaac gctgacacca ttgcatacac tagcaagatt 841 ttattgaaag gacgcagatg tagctgtctc ttgtgagact atgccggggc cagcaaacac 901 agaagtggat gctcacagtc agctaatgga tggatcatag ggctcccaat ggaggagcta 961 gagaaagtag ccaaggagct aaagggatct gcaaccctat aggtggaaca acattatgag 1021 ctaaccagta ccccggagct cttgactcta gctgcatata tatcaaaaga tggcctagtc 1081 ggccatcact ggaaagagag gcccattgga cttgcaaact ttatatgccc cagtacaggg 1141 gaataccagg gccaaaaagg gggagtgggt gggcagggga gtgggggtgg gtggatatgg 1201 gggacttttg gtatagcatt ggaaatgtaa atgagttaaa tacctaataa aaaatggaaa 1261 aaaaaataaa ataaaaataa gatgaaactg gaaaaaaaaa gttatgttta ataattccaa 1321 ttgaactgta agaatttcag atgccctgga aaaacatgga cattggttta gtacctaaaa 1381 gttcaaaata ttatatattt ttaaatacca ttttacactg aaatactcca tttatatact 1441 ggggactgtc ctctttctgg tttgctttgt tttgtttaat aaaagaaata aaccaatcta 1501 cctgaggaac tgtgaactat attgaagaaa agcctgcacg ggggttctct taccttttca 1561 agagtgcttc aaagaaggga aatttactga caggcaaggt ctgtacccat tgtttaattg 1621 tctgttagat gttatgcata gaatacgtct tttaacttag ccaaatgcag aaggccaagt 1681 gcactatcta caaacacata actctatata tagacatgtg catggccgtg tagagatgag 1741 actctgcaag tgtgtctcta atgattcggg ggatatgagt ttgtctaatt gacctttgag 1801 agggaaacca gactgcacat ttcatctaca aatccaacct gtttcgcaat ttctccagca 1861 ataatacttg agtcaagctg ggccgggagc tggttaacct ccaggtcaaa ctcactggct 1921 gggcgggact gcgcctgggc gtagattgag catgctaaat ttactctcct gtccacagaa 1981 agcccaggca cagtggaaca gcggtttcca ggagctgctg gtcccatctt ccaaggctct 2041 gctcaactca gagccgcttc ttccaaagtc tacatcttgg tggactttgc agaggaaacc 2101 gggagtagag acacgtgagg taagcatttc ctaggaaggg tcgggtgttc cggataccag 2161 agcctggtcc gggtgtcagc gtaatcgtga gtctgtgggg accaagtggc gacacaagag 2221 tcgctccagg agcacccgca gcatcagctt tcaggacggt gttttccgcg ccaccctgtg 2281 ctgtggatct cgctgcccag ctcgcagcca ggggtggtgg aggagcgcgc cagggcgagg 2341 ggacccagca ggcgggtggc ggacctagag ccgagcaccc ggtccacgca ggtgacacag 2401 cttcccggga ttccccagtg agttacctcc aggccctctc cggcagcatc agggcggggc 2461 tcctcctcac cactgggctc tgcggggcag tgagctttgc ataaactctg gtcccgtgtt 2521 tggctaatga actgtggttt ctccccaggt cgtgatggaa cttgaagagg accttaaggg 2581 aagagcagac aagaacttct ccccaggtcg tgatggaact tgaagaggac cttaagggaa 2641 gagcagacaa gaacttctca aagatgggca aaaagaggta gccagattgt ttcactttcg 2701 tactttactt gtcttgtaca ttcgggcaat tagtttgtag cctccagcac tgtacttgat 2761 tagtgggtgt tatttcagac ttcagaaatg taaaccagcc cttggaagga actcctcgct 2821 tggagcagtc cttcaaatgt gtgtgacaga tcaatcaatg attctgtgaa ttc // LOCUS MUSMDR1A 4924 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse P-glycoprotein (mdr1a) mRNA, complete cds. ACCESSION M33581 KEYWORDS P-glycoprotein. SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line J774.2-vinblastine resistant subline J7.V1-1, cDNA to mRNA, library pUC18-cDNA and pGEM-zf, clones pV1.PRC2, pV1.3, pV1.20, and pV1.10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4924) AUTHORS Hsu,S.I.-H., Cohen,D., Kirschner,L.S., Lothstein,L., Hartstein,M. and Horwitz,S.B. TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter reveals the basis for differential transcript heterogeneity in multidrug-resistant J774.2 cells JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.S.Kirschner, 05-APR-1990. Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461 FEATURES from to/span description pept 137 3967 P-glycoprotein (mdr1a) mRNA < 1 4924 P-glycoprotein mRNA signal 4315 4320 poly-A signal signal 4898 4903 poly-A signal BASE COUNT 1450 a 1021 c 1210 g 1243 t ORIGIN Chromosome 5. 1 acagtggaac agcggtttcc aggagctgct ggtcccatct tccaaggctc tgctcaactc 61 agagccgctt cttccaaagt ctacatcttg gtggactttg cagaggaaac cgggagtaga 121 gacacgtgag gccgtgatgg aacttgaaga ggaccttaag ggaagagcag acaagaactt 181 ctcaaagatg ggcaaaaaga gtaaaaagga gaagaaagaa aagaaaccag cagtcagtgt 241 gcttacaatg tttcgttatg caggttggct agacaggttg tacatgctgg tgggaactct 301 ggctgctatt atccatggag tggcgctccc acttatgatg ctgatctttg gtgacatgac 361 agatagcttt gcaagtgtag gaaacgtctc taaaaacagt actaatatga gtgaggccga 421 taaaagagcc atgtttgcca aactggagga agaaatgacc acgtacgcct actattacac 481 cgggattggt gctggtgtgc tcatagttgc ctacatccag gtttcatttt ggtgcctggc 541 agctggaaga cagatacaca agatcaggca gaagtttttt catgctataa tgaatcagga 601 gataggctgg tttgatgtgc atgacgttgg ggagctcaac acccggctca cagatgatgt 661 ttccaaaatt aatgaaggaa ttggtgacaa aatcggaatg ttcttccagg caatggcaac 721 attttttggt ggttttataa taggatttac ccgtggctgg aagctaaccc ttgtgatttt 781 ggccatcagc cctgttcttg gactgtcagc tggtatttgg gcaaagatat tgtcttcatt 841 tactgataag gaactccatg cttatgcaaa agctggagca gttgctgaag aagtcttagc 901 agccatcaga actgtgattg cgtttggagg acaaaagaag gaacttgaaa ggtacaataa 961 caacttggaa gaagctaaaa ggctggggat aaagaaagct atcacggcca acatctccat 1021 gggtgcagct tttctcctta tctatgcatc atatgctctg gcattctggt atgggacttc 1081 cttggtcatc tccaaagaat actctattgg acaagtgctc actgtcttct tttccgtgtt 1141 aattggagca ttcagtgttg gacaggcatc tccaaatatt gaagccttcg ccaatgcacg 1201 aggagcagct tatgaagtct tcaaaataat tgataataag cccagtatag acagcttctc 1261 aaagagtggg cacaaaccag acaacataca aggaaatctg gaatttaaga atattcactt 1321 cagttaccca tctcgaaaag aagttcagat cttgaagggc ctcaatctga aggtgaagag 1381 cggacagacg gtggccctgg ttggcaacag tggctgtgga aaaagcacaa ctgtccagct 1441 gatgcaaagg ctctacgacc ccctagatgg catggtcagt atcgacggac aggacatcag 1501 aaccatcaat gtgaggtatc tgagggagat cattggtgtg gtgagtcagg aacctgtgct 1561 gtttgccacc acgatcgccg agaacattcg ctatggccga gaagatgtca ccatggatga 1621 gattgagaaa gctgtcaagg aagccaatgc ctatgacttc atcatgaaac tgccccacca 1681 atttgacacc ctggttggtg agagaggggc gcacgtgagt gggggacaga aacagagaat 1741 cgccattgcc cgggccctgg tccgcaatcc caagatcctt ttgttggacg aggccacctc 1801 agccctggat acagaaagtg aagctgtggt tcaggccgca ctggataagg ctagagaagg 1861 ccggaccacc attgtgatag ctcatcgctt gtctaccgtt cgtaatgctg acgtcattgc 1921 tggttttgat ggtggtgtca ttgtggagca aggaaatcat gatgagctca tgagagaaaa 1981 gggcatttac ttcaaacttg tcatgacaca gacagcagga aatgaaattg aattaggaaa 2041 tgaagcttgt aaatctaagg atgaaattga taatttagac atgtcttcaa aagattcagg 2101 atccagtcta ataagaagaa gatcaactcg caaaagcatc tgtggaccac atgaccaaga 2161 caggaagctt agtaccaaag aggccctgga tgaagatgta cctccagctt ccttttggcg 2221 gatcctgaag ttgaattcaa ctgaatggcc ttattttgtg gttggtatat tctgtgccat 2281 aataaatgga ggcttacagc cagcattctc cgtaatattt tcaaaagttg taggggtttt 2341 tacaaatggt ggcccccctg aaacccagcg gcagaacagc aacttgtttt ccttgttgtt 2401 tctgatcctt gggatcattt ctttcattac attttttctt cagggcttca catttggcaa 2461 agctggagag atcctcacca agcgactccg atacatggtt ttcaaatcca tgctgagaca 2521 ggatgtgagc tggtttgatg accctaaaaa caccaccgga gcactgacca ccaggctcgc 2581 caacgatgct gctcaagtga aaggggctac agggtctagg cttgctgtga ttttccagaa 2641 catagcaaat cttgggacag gaatcatcat atccctaatc tatggctggc aactaacact 2701 tttactctta gcaattgtac ccatcattgc gatagctgga gtggttgaaa tgaaaatgtt 2761 gtctggacaa gcactgaaag ataagaagga actagaaggt tctggaaaga ttgctacgga 2821 agcaattgaa aacttccgca ctgttgtctc tttgactcgg gagcagaagt ttgaaaccat 2881 gtatgcccag agcttgcaga taccatacag aaatgcgatg aagaaagcac acgtgtttgg 2941 gatcacgttc tccttcaccc aggccatgat gtatttttct tatgctgctt gtttccggtt 3001 cggtgcctac ttggtgacac aacaactcat gacttttgaa aatgttctgt tagtattctc 3061 agctattgtc tttggtgcca tggcagtggg gcaggtcagt tcattcgctc ctgactatgc 3121 gaaagcaaca gtgtcagcat cccacatcat caggatcatt gagaaaaccc ccgagattga 3181 cagctacagc acgcaaggcc taaagccgaa tatgttggaa ggaaatgtgc aatttagtgg 3241 agtcgtgttc aactatccca cccgacccag catcccagtg cttcaggggc tgagccttga 3301 ggtgaagaag ggccagacgc tggccctggt gggcagcagt ggctgcggga agagcacagt 3361 ggtccagctg ctcgagcgct tctacgaccc catggctgga tcagtgtttc tagatggcaa 3421 agaaataaag caactgaatg tccagtggct ccgagcacag ctgggcattg tgtcccaaga 3481 gcccattctc tttgactgca gcatcgcaga gaacattgcc tacggagaca acagccgggt 3541 cgtgtcttat gaggagattg tgagggcagc caaggaggcc aacatccacc agttcatcga 3601 ctcgctacct gataaataca acaccagagt aggagacaaa ggcactcagc tgtcgggtgg 3661 gcagaagcag cgcatcgcca tcgcacgcgc cctcgtcaga cagcctcaca ttttacttct 3721 ggacgaagca acatcagctc tggatacaga aagtgaaaag gttgtccagg aagcgctgga 3781 caaagccagg gaaggccgca cctgcattgt gatcgctcac cgcctgtcca ccatccagaa 3841 cgcggacttg atcgtggtga ttcagaacgg caaggtcaag gagcacggca cccaccagca 3901 gctgctggcg cagaagggca tctacttctc aatggtcagt gtgcaggctg gagcaaagcg 3961 ctcatgaact gtgaccatgt aagatgttaa gtatttttat tgtttgtatt catatatggt 4021 gtttaatcca agtcaaaagg aaaacactta ctaaaatagc cagttatcta ttttctgcca 4081 cagtggaaag catttagttt ggtttagagt cttcagaggc tttgtaatta aaaaaacaaa 4141 aatagataca gcatcaaatg gagattaatg ctttaaaatg cactataaaa tttataaaag 4201 ggttaaaagt gaatgtttga taatatatac ttttatttat actttctcat ttgtaactat 4261 aactgatttc tgcttaacaa attatgtatg tatcaaaaat tactgaaatg tttgtataaa 4321 gtatatatag tgaaactgag cattcatatt tttgagttat tttgctcaaa tgcatgcgaa 4381 attatatatt gtcccaactg ggatattgta cataatttta gcctttaaaa aacagtccat 4441 tactgggggg agggggcatc actctatggg caaagtgtta ctcagacatg ggcacctgag 4501 ttcagatccc taccacctaa gtaagcagac aaggtgtggt gtttttgtaa tgccagtgct 4561 agaggcagaa aagacagatc ctgcaggctc agtggctggc caaacagcct agccaacata 4621 gcgcgttcca ggttcagtga gaaaacttgt ctcaaaaatc agagggaaaa gcaaatgagg 4681 tgtcagccat gtgcactcat gcaaatgcca tacatgcaga agtatgtgca cacacacgca 4741 cacattaacc aacgactagc aaggaaaatg aaggtggata agaggggtgg gactgggaca 4801 aaggagggta cctggatgaa tatgactgaa ggacgttatg tacacatatg aaaacgtcgt 4861 actgaaactc actacaatgt atacttaata tattgctaat aaaatatttt taaaagaaaa 4921 aaat // LOCUS RICCPCTA 2526 bp ds-DNA ORG 01-AUG-1990 DEFINITION Rice chloroplast beta and epsilon subunit (atpB and atpE) genes, complete cds. ACCESSION M31464 Y00323 KEYWORDS atpB protein; atpE protein. SOURCE Rice chloroplast DNA, clone Ct-3. ORGANISM Chloroplast Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Oryza sativa. REFERENCE 1 (bases 1 to 2526) AUTHORS Moon,E., Kao,T.-h. and Wu,R. TITLE Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters from rice JOURNAL Nucleic Acids Res. 15, 4358-4359 (1987) STANDARD simple staff_review FEATURES from to/span description pept 398 1894 atpB protein pept 1891 2304 atpE protein BASE COUNT 770 a 459 c 563 g 734 t ORIGIN 1 cccccttttc ttattttgag tccaaatacc taaatactac gaaaattctc tgttgacagc 61 aatctatgct tcacagtagt atatattttg tatatcgaag tcctagataa gaaagtagag 121 taggcacaaa tcgtttacaa aaggcaaaat gtatatgaaa aaaagattga ttgaactttc 181 cgacgggctc attccatgag taaacgattg aatgggattc gtttgggcaa cgaaatcaag 241 tgctggtccc cttttctctc ttattgaatt aactaattca tttccttttg acttttggga 301 tttttggata tttttttggt gttgatttgg cattattcaa caagaaaaaa atcaaaattt 361 cgataaattc cttttttttg aaaattatgt gataattatg agaaccaatc ctactacttc 421 tcgtcccggg gtttctacaa ttgaagaaaa aagtacaggg cgtatcgatc aaattattgg 481 acccgtgctg gatgtcactt ttcccccggg caagttacct tatatttata atgctttggt 541 agtcaagagt cgagacactg agggtaagca aattaatgta acttgtgagg tacaacaatt 601 attaggaaat aatcgagtta gagctgtagc tatgagtgct acagatgggt tgatgagagg 661 aatggaagtg attgacacgg gagctcctct cagtgttcct gtcggtggag ctactcttgg 721 acgaattttc aacgttcttg gggagcctgt tgacaatttg ggtcctgtag atactagtgc 781 aacattccct attcatagat ccgcgcccgc ctttatcgag ttagatacga aattatccat 841 ctttgaaact ggtattaagg tggtcgatct tttagctcct tatcggcgtg gaggaaaaat 901 cggactattt gggggagctg gagtaggtaa aacagtactc atcatggaat taatcaacaa 961 tattgctaaa gctcacgggg gcgtatccgt atttggcgga gtaggggaac ggactcgtga 1021 aggaaatgat ctttatatgg aaatgaagga atctggagta attaatgaaa aaaatcttga 1081 ggaatcaaag gtagctctag tctatggcca aatgaatgaa ccgccaggag ctcgtatgag 1141 agttggtttg actgccctaa ctatggcaga atatttccga gatgttatta agcaagacgt 1201 gcttctattc atcgataata tctttcgttt tgttcaagca ggatcggagg tatctgcctt 1261 attagggaga atgccctctg cagtgggtta tcaacctact cttagtacag aaatgggttc 1321 tttgcaagaa agaattactt ctactaaaaa gggatctata acttcgatcc aagcggttta 1381 tgtacctgcg gacgatttga ccgaccctgc tcctgctaca acatttgcac atttggatgc 1441 tactaccgta ctttccagag gattagcttc caaagggatt tatcctgcag tagatccttt 1501 agattcaacc tcaactatgt tacaacctcg gatcgttggc aacgaacatt atgaaactgc 1561 gcaaagagtt aagcaaactt tacaacgtta caaagaactt caggacatta tcgcaattct 1621 tgggttggat gaattatcgg aggaggatcg tttaactgta gcaagagcac gaaaaattga 1681 gcgcttctta tcacaaccgt tttttgtggc agaagttttt accggttctc caggaaagta 1741 tgttggtctt gcagaaacta ttaggggatt tcaactaatc ctttccggag aattagacgg 1801 cctacccgaa caggcttttt atttggtggg taacatcgat gaagctagca cgaaagctat 1861 aaacttagaa gaggagaaca acttgaagaa atgaaattaa atctttatgt actgactcct 1921 aagcgaatta tttgggattg tgaagtgaaa gaaatcattt tatctactaa tagtggccaa 1981 attggcgtat taccaaacca cgcccccatt aacacagctg tagatatggg tcccttgaga 2041 atacgcctcc tcaacgatca atggttaacg gcggttctgt ggagcggttt tgccagaata 2101 gttaataatg agatcatcat tttaggaaat gatgcggaac tgggtagtga cattgatccg 2161 gaagaagctc aacaggcact tgaaatagcc gaagctaacg tgagtagagc tgagggtacg 2221 aaagaattgg ttgaagcgaa ggtagctctc agacgagcta ggatacgagt cgaggctgtt 2281 aattggattc ccccatctaa ttgaagacaa cccaacggtt tagttgatac aaagaaaaag 2341 ggaagagggg tagaaaaaat tattagatag cgaagcgaag tagggccaat gctatctagt 2401 aatttttcta cctacctacc tactattgga tttgaaccaa tgactcccgc cgtatgaaag 2461 caatactcta accactgagt taagtaggca atttatcacc acaaaggaag accctttact 2521 tcgatc // LOCUS RICCPCTB 2524 bp ds-DNA ORG 01-AUG-1990 DEFINITION Rice mitochondrial beta and epsilon subunit (atpB and atpE) pseudogenes, complete cds. ACCESSION M31465 Y00323 KEYWORDS pseudogene. SOURCE Rice chloroplast DNA, clone Ct-1. ORGANISM Chloroplast Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Oryza sativa. REFERENCE 1 (bases 1 to 2524) AUTHORS Moon,E., Kao,T.-h. and Wu,R. TITLE Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters from rice JOURNAL Nucleic Acids Res. 15, 4358-4359 (1987) STANDARD simple staff_review FEATURES from to/span description pept.ps 398 1392 atpB pseudogene pept.ps 1389 2302 atpE pseudogene BASE COUNT 769 a 458 c 564 g 733 t ORIGIN 1 cccccttttc ttattttgag tccaaatacc taaatactac gaaaattctc tgttggcagc 61 aatctatgct tcacagtagt atatattttg tatatcgaag tcctagataa gaaagtagag 121 taggcacaaa tcgtttacaa aaggcaaaat gtatatgaaa aaaagattga ttgaactttc 181 cgacgggctc attccatgag taaacgattg aatgggattc gtttgggcaa cgaaatcaag 241 tgctggtccc cttttctctc ttattgaatt aactaattca tttccttttg acttttggga 301 tttttggata tttttttggt gttgatttgg cattattcaa caagaaaaaa atcaaaattt 361 cgataaattc cttttttttg aaaattatgt gataattatg agaaccaatc ctactacttc 421 tcgtcccggg gtttctacaa ttgaagaaaa aagtacaggg cgtatcgatc aaattattgg 481 acccgtgctg gatgtcactt ttcccccggg caagttacct tatatttata atgctttggt 541 agtcaagagt cgagacactg agggtaagca aattaatgta acttgtgagg tacaacaatt 601 attaggaaat aatcgagtta gagctgtagc tatgagtgct acagatgggt tgatgagagg 661 aatggaagtg attgacacgg gagctcctct cagtgttcct gtcggtggag ctactcttgg 721 acgaattttc aacgttcttg gggagcctgt tgacaatttg ggtcctgtag atactagtgc 781 aacattccct attcatagat ccgcgcccgc ctttatcgag ttagatacga aattatccat 841 ctttgaaact ggtattaagg tggtcgatct tttagctcct tatcggcgtg gaggaaaaat 901 cggactattt gggggagctg gagtaggtaa aacagtactc atcatggaat taatcaacaa 961 tattgctaaa gctcacgggg gcgtatccgt atttggcgga gtaggggaac ggactcgtga 1021 aggaaatgat ctttatatgg aaatgaagga atctggagta attaatgaaa aaaatcttga 1081 ggaatcaaag gtagctctag tctatggcca aatgaatgaa ccgccaggag ctcgtatgag 1141 agttggtttg actgccctaa ctatggcaga atatttccga gatgttatta agcaagacgt 1201 gctctattca tcgataatat ctttcgtttt gttcaagcag gatcggaggt atctgcctta 1261 ttagggagaa tgccctctgc agtgggttat caacctactc ttagtacaga aatgggttct 1321 ttgcaagaaa gaattacttc tactaaaaag ggatctataa cttcgatcca agcggtttat 1381 gtacctgcgg acgatttgac cgaccctgct cctgctacaa catttgcaca tttggatgct 1441 actaccgtac tttccagagg attagcttcc aaagggattt atctgcagta gatcctttag 1501 attcaacctc aactatgtta caacctcgga tcgttggcaa cgaacattat gaaactgcgc 1561 aaagagttaa gcaaacttta caacgttaca aagaacttca ggacattatc gcaattcttg 1621 ggttggatga attatcggag gaggatcgtt taactgtagc aagagcacga aaaattgagc 1681 gcttcttatc acaaccgttt tttgtggcag aagtttttac cggttctcca ggaaagtatg 1741 ttggtcttgc agaaactatt aggggatttc aactaatcct ttccggagaa ttagacggcc 1801 tacccgaaca ggctttttat ttggtgggta acatcgatga agctagcacg aaagctataa 1861 acttagaaga ggagaacaac ttgaagaaat gaaattaaat ctttatgtac tgactcctaa 1921 gcgaattatt tgggattgtg aagtgaaaga aatcatttta tctactaata gtggccaaat 1981 tggcgtatta ccaaaccacg cccccattaa cacagctgta gatatgggtc ccttgagaat 2041 acgcctcctc aacgatcaat ggttaacggc ggttctgtgg agcggttttg ccagaatagt 2101 taataatgag atcatcattt taggaaatga tgcggaactg ggtagtgaca ttgatccgga 2161 agaagctcaa caggcacttg aaatagccga agctaacgtg agtagagctg agggtacgaa 2221 agaattggtt gaagcgaagg tagctctcag acgagctagg atacgagtcg aggctgttaa 2281 ttggattccc ccatctaatt gaagacaacc caacggttta gttgatacaa agaaaaaggg 2341 aagaggggta gaaaaaatta ttagatagcg aagcgaagta gggccaatgc tatctagtaa 2401 tttttctacc tacctaccta ctattggatt tgaaccaatg actcccgccg tatgaaagca 2461 atactctaac cactgagtta agtaggcaat ttatcaccac aaaggaagac cctttacttc 2521 gatc // LOCUS RICMTBEA 2281 bp ds-DNA ORG 01-AUG-1990 DEFINITION Rice mitochondrial beta and epsilon subunit (atpB and atpE) pseudogene, complete cds. ACCESSION M31466 Y00323 KEYWORDS pseudogene. SOURCE Rice mitochondrion DNA, clone Mt-0. ORGANISM Mitochondrion Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Oryza sativa. REFERENCE 1 (bases 1 to 2281) AUTHORS Moon,E., Kao,T.-h. and Wu,R. TITLE Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters from rice JOURNAL Nucleic Acids Res. 15, 4358-4359 (1987) STANDARD simple staff_review FEATURES from to/span description pept.ps 384 1657 atpB pseudogene pept.ps 1654 2067 atpE pseudogene BASE COUNT 710 a 416 c 488 g 667 t ORIGIN 1 cccctttctt attttgagtc caaataccta aatactatga aaattctctg ttgacagcaa 61 tctatgcttc acagtagtat atattttgta tatcgaagtc ctagataaga aatggagtag 121 gcacagatcc ttcacaaaag gcgaaatgta tatgaaaaaa agattgattg aactttccga 181 cggactcatg gaatgagtaa acgattgaat gggattcgtt tgggcaacga aatcaagtgc 241 tggtcccctt ttctctctta ttgaattaac taattcattt ccttttgact tttgttggat 301 ttttggatat ttttttggtg ttgatttggc attattcaac aagataaaaa gaaaaatttc 361 tataaattcc ttttttttta attatgagaa ccaatcctac tacttctcat cccggggttt 421 ctacaattga agaaaaaagt acagggcgta tcgatcaaat tattggaccc gtgctggatg 481 ccacttttcc cccgggcaag ttaccttata tttataacgc tttggtagtc gagacactga 541 gggtaagcaa attaatgtga cttgtgaggt acaacaatta ttaggaaata atcgagttag 601 aacgaaatta tccatctttg aaactggtat taaggtggtc gatcttttag ctccttatcg 661 gcgtggagga aaaatcggac tatttggggg aactggagta ggtaaaacag tactcatcat 721 ggaattaatc aacaatattg ctaaagctca tagaggcgta tccgtatttg gcggagtagg 781 ggaacggact cgtgaaggaa atgatcttta tatggaaata aaggagtaat taatgaaaaa 841 aatccttgag gaatcaaagg tagctctagt ctatggccaa atgaatgaac gccaggagct 901 cgtatgagag ttggtttgac tgccctaact atggcagaat atttccgaga tgttattaag 961 caagacgtgc ttctattcat cgataatatc tttcgttttg ttcaagcagg atcgggggta 1021 tttgccttat tagggagaat gccctctgca gtgggttatc aacctactct tagtacagaa 1081 atgggttctt tgcaagaaag aattacttct actaaaaagg gatctataac ttcgatccaa 1141 gcggtttatg tacctgcgga cgatttgacc gaccctgctc ctgccacaac atttgcacat 1201 ttggatgcta ctaccgtact ttccagagga ttagcttcca agggtattta tcctagatcc 1261 tttagattca acctcaacta tgttacaacc tcggatcgtt ggcaacgaac attatgaaac 1321 tgcgcaaaga gttaagcaaa ctttacaacg ttacaaagaa cttcaggaca ttatcgcaat 1381 tcttgggttg gatgaattat cggaggagga tcgtttaact gtagcaagag cacgaaaaat 1441 tgagcgcttc ctatcacaac cgttctttgt ggcagaagtt tttaccggtt ctccaggaaa 1501 gtatgttggt cttgcagaaa caattcgggg atttcaacta atcctttccg gagaattaga 1561 cggcctaccc gaacaggctt tttatttggt gggtaacatc gatgaagcta gcacgaaagc 1621 tataaactta gaagaggaaa acaacttgaa gaaatgaaat taaatcttta tgtactgact 1681 cctaagcgaa ttatttggga ttgtgaagtg aaagaaatca ttttttctac taatagtggc 1741 caaattggcg tattaccaaa ccacgccccc attaacacag ctgtagatat gggtcccttg 1801 agaatacgcc tcctcaacga tcaatggtta acggcggttc tgtggagcgg ttttgccaga 1861 atagttaata atgagatcat cattttagga aatgatgcgg aactgggtag tgacattgat 1921 ccggaagaag ctcaacaggc acttgaaata gccgaagcta acgtgagtag agctgagggt 1981 acgaaagaat tggttgaagc gaacgtagct ctcagacgag ctgggatacg agtcgaggct 2041 gttaattgga ttcccccatc taattgaaga caatccaacg gtttagttga tacaaagaaa 2101 aagggtctaa aaagttatta gatagcgaag cgaagtaagt ccaatgctat ctagtaattt 2161 ttctacctac ctacctacta ttggatttga accaatgact cccgccgtat gaaagcaata 2221 ctctaaccac tgagttaagt aggcaattta tcaccacaaa ggaagaccct ttacttcgat 2281 c //