Path: utzoo!attcan!uunet!maverick.ksu.ksu.edu!zaphod.mps.ohio-state.edu!usc!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 25 May 90 12:00:13 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 3651 Approved: lear@genbank.bio.net Checksum: 00646 225 LOCUS FCLCAPS 2493 bp ss-RNA VRL 25-MAY-1990 DEFINITION Feline calicivirus capsid protein gene (put.), complete cds. ACCESSION M32819 KEYWORDS capsid protein. SOURCE Feline calicivirus (CFI/68 FIV), cDNA to viral RNA, clone pCV[2,7]. ORGANISM Feline calicivirus Viridae; ss-RNA nonenveloped viruses; Caliciviridae. REFERENCE 1 (bases 1 to 2493) AUTHORS Neill,J.D. TITLE Nucleotide sequence and expression of the capsid protein gene of Feline calicivirus JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Neill, 14-MAR-1990. Pioneer Hi-Bred International, 7300 NW 62nd Ave., Johnston, IA 50131 FEATURES from to/span description pept 127 2133 capsid protein ORF (put.) pept 2130 2450 capsid protein ORF (put.) BASE COUNT 666 a 593 c 531 g 703 t ORIGIN 1 gcccccttct tacagcacag ccttggagca ttacaacagc caatttaatg gtgtggaggc 61 gcggactgac cagatcgata cgagtggcat ggccgccctt cactgtgatg tgttcgaagt 121 ttgagcatgt gctcaacctg cgctaacgtg cttaaatact atgattggga tcctcacatc 181 aaattggtaa tcaaccccaa caaatttcta catgttggct tctgcgataa ccctttaatg 241 tgttgttatc ctgaattact acctgaattt ggcaccatgt gggattgtga tcaatcgcca 301 ctccaagtct accttgagtc aatcctgggt gatgatgaat ggtcctccac tcatgaagca 361 attgacccag ttgtgccacc aatgcattgg gatgaagccg gaaaaatctt ccaaccacac 421 cctggcgtcc ttatgcatca cctcatctgt aaggttgcag aaggatggga cccaaacctg 481 ccacttttcc gcttggaagc ggacgatggt tccatcacga cacctgaaca gggaacaatg 541 gttggtggag tcattgctga gcccaacgcc caaatgtcaa ccgcagctga catggccact 601 gggaaaagtg tggactctga gtgggaagcc ttcttctcct ttcacactag tgtgaactgg 661 agcacatctg aaactcaggg gaagatactc tttaaacaat ccttaggacc attgctcaac 721 ccctacctta cccatcttgc aaagctgtat gttgcttggt ctggttctgt tgatgttagg 781 ttttctattt ctggatctgg tgtctttgga gggaaattag ctgctattgt tgtgccgcca 841 ggaattgatc ctgttcaaag tacttcaatg ctgcaatatc ctcatgtcct ctttgatgct 901 cgtcaagttg aacctgttat cttttccatt cccgatctaa gaagcacctt atatcacctt 961 atgtctgaca ctgataccac atcgttggta atcatggtgt acaatgatct tattaacccc 1021 tatgctaatg actcaaactc ttcgggctgc attgtcactg tggaaactaa accggggcca 1081 gatttcaagt ttcacctctt aaaacctcct gggtctatgt taactcacgg atctatccca 1141 tctgatctaa tcccaaaatc atcttcgctt tggattggaa atcggttttg gtctgacata 1201 accgattttg taattcggcc ttttgtgttc caggcaaatc gacactttga tttcaaccaa 1261 gagacagcag gttggagcac cccaaggttt cgcccaatta ctatcactat cagtgttaag 1321 gagtcagcaa agcttggtat tggagtggcc accgactaca ttgttcccgg cataccagat 1381 ggatggcccg acacaacaat cccaggtgag ttggtacctg ttggtgacta tgccatcact 1441 aatggcacca acaatgatat caccacagct gcgcagtacg atgcagccac tgagattaga 1501 aacaacacca atttcagagg catgtacatt tgtggttctc ttcaaagagc ttggggggat 1561 aagaagattt caaatactgc ttttatcaca accggcacgg ttgatggagc caaattgata 1621 cccagtaata ccattgacca aacaaaaatt gccgtattcc aagacacaca tgcgaataag 1681 catgtccaga cctcggacga cacattggcc ctgcttggtt atactggtat tggtgaggaa 1741 gcaattggtg ctgaccgcga tagagttgtg cgaattagcg tcctcccgga acgtggcgca 1801 cgtggtggca atcacccaat cttccacaaa aactctatca agcttggtta tgtaattagg 1861 tccattgatg tgttcaattc tcaaattctg catacctcta ggcaactttc cctcaatcat 1921 tacttattgt cgcctgactc ctttgctgtc tataggatta ttgactctaa tggatcctgg 1981 tttgacatag gcattgataa tgatggattt tcttttgttg gtgtatcaag tattggtaaa 2041 ttagagtttc ctttaactgc ctcctacatg ggaattcaat tggcaaaaat tcgacttgcc 2101 tctaacatta ggagtgtgat gacaaaatta tgaattcaat tttgggctta attgacactg 2161 tcacgaacac aattggcaaa gctcaacaaa tcgaattgga taaggctgca cttggtcagc 2221 aacgcgagct ggcactccaa cgtatgaact tggatcgcca ggctctaaat aatcaagtgg 2281 agcaatttaa caaactgctt gagcagaggg tacaaggccc aatccaatct gtgcgcctgg 2341 cacgcgcagc tggtttcagg gtcgaccctt actcatacac aaatcaaaac ttttatgacg 2401 atcaattaaa tgcaatcaga ctatcatata gaaatttgtt caagaattga tcacttaacc 2461 ctttgggtgc cgcacttgcg cctaacccca ggg // LOCUS RIRSTA56 2280 bp ds-DNA BCT 25-MAY-1990 DEFINITION R.tsutsugamushi scrub typhus antigen 56 (sta 56) gene, complete cds. ACCESSION M33004 KEYWORDS antigen; outer membrane protein; scrub typhus antigen 56. SOURCE R.tsutsugamushi (strain Karp) DNA, clone pRTS56H2.3. ORGANISM Rickettsia tsutsugamushi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae. REFERENCE 1 (bases 1 to 2280) AUTHORS Stover,C.K., Marana,D.P., Carter,J.M., Roe,B.A., Mardis,E. and Oaks,E.V. TITLE The 56 kilodalton major protein antigen of Rickettsia tsutsugamushi: Molecular cloning and sequence analysis of sta 56 gene, and precise identification of a strain specific epitope JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.V.Oaks, 16-MAR-1990. Walter Reed Institute, Dept. of Enteric Infections, Walter Reed Army Institute of Research, Washington, D.C. 20307 FEATURES from to/span description pept 556 2154 scrub typhus antigen 56 precursor (sta 56) sigp 556 621 scrub typhus antigen 56 signal peptide matp 622 2151 scrub typhus antigen 56 signal 77 82 -35 region signal 99 103 -10 region binding 545 550 ribosome binding site BASE COUNT 764 a 319 c 489 g 708 t ORIGIN 1 aagcttgttc attttttatg tgggctaatt ttagataatg caatgttagt ataattatgt 61 ggttaattaa tgtatcttga tttaagattt tatataaata taataagatt tatgtagggc 121 ttaattatta gcttaaaaaa ctgttgctat tttagctaaa aataaaagtt tgggcaagaa 181 aaattattaa taattgaagg tagttgttgc gtaaaaagct gtgttatgct atctaaggtt 241 aaatgtagca aggtgctaat agataattaa tgtattttcg aacgtgtctt taagctatat 301 ataagagcag tattctattg aatattgttt ctaagtatat aaaaaataaa aatacatttt 361 acaattgata aaacgctttg agcacatttt taacacagtg ttttatagat tgtttaaatt 421 attttacaag tactattaaa tattagtata ctaaataata gttttttgat ataaaactaa 481 agttagtgtg gctaaataat tagtttagaa tggttaccac taaaaaataa atttaattct 541 tttaaggaga ttagaatgaa aaaaattatg ttaattgcta gtgcaatgtc tgcgttgtcg 601 ttgccatttt cagctagtgc aatagaattg ggggaagaag gattagagtg tggtccttat 661 gctaaagttg gagttgttgg aggaatgatt actggcgtag aatctgctcg cttggatcca 721 gctgatgctg aaggcaaaaa acacttgtca ttaacaaatg ggctgccatt tggtggaacg 781 ttggctgcag gtatgacaat cgctccagga tttagagcag agataggtgt tatgtacctt 841 acaaatataa ctgctcaggt tgaagaaggt aaagttaagg cagattctgt aggtgagaca 901 aaggcagatt ctgtaggtgg gaaagatgct cctatacgta agcggtttaa acttacacct 961 cctcagccta ctataatgcc tataagtata gctgtacgtg actttgggat tgatattcct 1021 aaccagacct cagcagcaag cacaagccgc agcctcaggc ttaatgatga gcaacgtgct 1081 gcagctagga tcgcttggtt aaagaattgt gctggtattg actatagggt aaaaaaccct 1141 aatgatccta atgggcctat ggttataaat ccgatattgt taaatattcc acagggtaac 1201 cctaatcctg ttggaaatcc accgcagcga gcaaatccgc ctgcaggttt tgcgatacat 1261 aaccatgagc aatggaggca tttggtagtt gggcttgctg cattatcaaa tgctaataaa 1321 cctagcgctt ctcctgtcaa agtattaagt gataaaatta ctcagatata tagtgatata 1381 aagcatttgg ctgatatagc tggtattgat gttcctgata ctagtttgcc taatagtgca 1441 tctgtcgaac agatacagaa taaaatgcaa gaattaaacg atctattgga agagctcaga 1501 gaatcttttg atgggtatct tggtggtaat gcttttgcta atcagataca gttgaatttt 1561 gtcatgccgc agcaagcaca gcagcagggg caagggcagc aacagcaagc tcaagctaca 1621 gcgcaagaag cagtagcagc agcagctgtt aggcttttaa atggcaatga tcagattgcg 1681 cagttatata aagatcttgt taaattgcag cgtcatgcag gaattaagaa agcgatggaa 1741 aaattagctg cccaacaaga agaagatgca aagaatcaag gtgaaggtga ctgcaagcag 1801 caacaaggaa catctgaaaa atctaaaaaa ggaaaagaca aagaggcaga gtttgatctg 1861 agtatgattg tcggccaagt taaactctat gctgacgtaa tgataactga atcagtctca 1921 atatatgctg gtgttggtgc agggttagct tatacttctg gaaaaataga taataaggat 1981 attaaagggc atacaggcat ggttgcatca ggagcacttg gtgtagcaat taatgctgct 2041 gaaggtgtgt atgtggacat agaaggtagt tatatgtact cattcagtaa aatagaagag 2101 aagtattcaa taaatcctct tatggcaagt gtaagtgtac gctataactt ctagtttttt 2161 ctttaaaatt ataaaaaaag cagctaaaag ttctttacag ggtttttagc tgctttttca 2221 gagttttttt ataataataa aaataacttt attctttgct atttaattag cttgaagctt // LOCUS HUMTROPI 816 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human slow-twitch skeletal troponin I (TNN1) mRNA, complete cds. ACCESSION J04760 KEYWORDS slow-twitch skeletal troponin I; troponin I. SOURCE Human adult slow-twitch skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 816) AUTHORS Wade,R., Eddy,R., Shows,T.B. and Kedes,L. TITLE cDNA sequence, tissue-specific expression and chromosomal mapping of the human slow-twitch skeletal muscle isoform of troponin I JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Wade, 15-MAR-1990. FEATURES from to/span description pept 77 640 slow-twitch skeletal troponin I (TNN1) BASE COUNT 180 a 247 c 244 g 145 t ORIGIN Chromosome 1q12-qter.. 1 tagtctgcag tctacggcga ggcacaggcc agcccagctc cacgaggact gaacaaggtg 61 ctgtctcact gccaccatgc cggaagtcga gagaaaaccc aagatcactg cctcccgcaa 121 actcttgctg aagagcctga tgctggccaa ggccaaggaa tgctgggagc aggagcacga 181 ggagcgcgag gctgagaagg tgcgctacct ggcagagcgc atccccacgc tgcagacccg 241 tggcctgtcc ctcagtgccc tgcaggacct gtgccgggag ctgcacgcca aggtggaggt 301 ggtggatgag gagcgatacg acattgaggc caaatgcctc cacaacacca gggagattaa 361 ggacctgaag ctgaaggtga tggacctccg tgggaagttc aagcgcccgc ccctgcgtcg 421 agtccgtgtc tcggctgacg ccatgctccg ggccctgctg ggctccaagc acaaggtgtc 481 catggatctg cgggccaacc tcaagtctgt gaagaaggaa gacacagaga aggagcggcc 541 tgtggaggtg ggtgactgga ggaagaacgt ggaggccatg tctggcatgg aaggccggaa 601 gaagatgttt gatgccgcca atgctccgac ctcacaatag aggccagctt gctgtgctgc 661 gctctgagct cctgcttcat gcttcttctc caacccagct cactcacctc tctgcctgtg 721 tctggagcat cccttcccac ctctccccca cttcttccct ccagcctgca atgccctcct 781 ctggaactgg gattaaacag atacccaaga ggcagg // LOCUS ASOTAAG1 2935 bp ds-DNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G1) gene, complete cds. ACCESSION M33218 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 2935) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 607 772 Taka-amylase A (Taa-G1) precursor, exon 1 828 868 Taka-amylase A precursor, exon 2 955 1070 Taka-amylase A precursor, exon 3 1140 1248 Taka-amylase A precursor, exon 4 1317 1545 Taka-amylase A precursor, exon 5 1603 1765 Taka-amylase A precursor, exon 6 1830 1976 Taka-amylase A precursor, exon 7 2041 2281 Taka-amylase A precursor, exon 8 2360 2647 Taka-amylase A precursor, exon 9 sigp 607 669 Taka-amylase A signal peptide matp 670 772 Taka-amylase A 828 868 Taka-amylase A 955 1070 Taka-amylase A 1140 1248 Taka-amylase A 1317 1545 Taka-amylase A 1603 1765 Taka-amylase A 1830 1976 Taka-amylase A 2041 2281 Taka-amylase A 2360 2644 Taka-amylase A pre-msg 543 > 2789 Taa-G1 mRNA and introns IVS 773 827 Taa-G1 intron A (no splice consensus) IVS 869 954 Taa-G1 intron B IVS 1071 1139 Taa-G1 intron C IVS 1249 1316 Taa-G1 intron D IVS 1546 1602 Taa-G1 intron E IVS 1766 1829 Taa-G1 intron F IVS 1977 2040 Taa-G1 intron G IVS 2282 2359 Taa-G1 intron H signal 2784 2789 poly-A signal BASE COUNT 818 a 752 c 657 g 708 t ORIGIN 1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact 61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca 121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca 181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag 241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat 301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat 361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt 421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc 481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg 541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag 601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct 661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat 721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt 781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg 841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg 901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac 961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc 1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat 1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga 1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt 1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct 1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct 1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact 1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct 1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga 1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc 1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg 1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt 1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca 1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt 1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca 1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca 1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa 1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag 2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg 2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa 2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc 2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa 2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga 2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg 2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga 2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt 2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg 2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag 2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag 2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga 2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca 2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc 2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg // LOCUS ASOTAAG2A1 197 bp ds-DNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 5' end. ACCESSION M33220 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 197) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 195 > 197 Taka-amylase A (Taa-G2) precursor pre-msg 131 > 197 Taa-G2 mRNA and introns signal 2 11 CAAT box signal 95 100 TATA box BASE COUNT 59 a 52 c 36 g 50 t ORIGIN 1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa 61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg 121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc 181 cacagaaggc atttatg // LOCUS ASOTAAG2A2 198 bp ds-DNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 3' end. ACCESSION M33222 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 198) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa-G2) (AA at 1) pre-msg < 1 151 Taa-G2 mRNA and introns (alt.) pre-msg < 1 156 Taa-G2 mRNA and introns (alt.) BASE COUNT 45 a 48 c 53 g 52 t ORIGIN About 2.1 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc 181 gtcctccaga ggcgagga // LOCUS ASOTTAM1 191 bp ss-mRNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 3' end. ACCESSION M33219 KEYWORDS Taka-amylase A. SEGMENT 1 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 191) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 189 > 191 Taka-amylase A (Taa) mRNA 125 > 191 Taa mRNA BASE COUNT 56 a 52 c 32 g 51 t ORIGIN 1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca 61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt 121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga 181 aggcatttat g // LOCUS ASOTTAM2 156 bp ss-mRNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 5' end. ACCESSION M33221 KEYWORDS Taka-amylase A. SEGMENT 2 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 156) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa) (AA at 1) mRNA < 1 151 Taa mRNA (alt.) mRNA < 1 156 Taa mRNA (alt.) BASE COUNT 40 a 35 c 42 g 39 t ORIGIN About 1.5 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgactgag agcccgtctc gaatcc // LOCUS CIPNADGAPD 1354 bp ss-mRNA PLN 25-MAY-1990 DEFINITION M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH) mRNA, complete cds. ACCESSION J05223 KEYWORDS D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase; glyceraldehyde-3-phosphate dehydrogenase. SOURCE M.crystallinum, cDNA to mRNA. ORGANISM Mesembryanthemum crystallinum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Aizoaceae. REFERENCE 1 (bases 1 to 1354) AUTHORS Ostrem,J.A., Vernon,D.M. and Bohnert,H.J. TITLE Increased expression of a gene coding for NAD:glyceraldehyde-3- phosphate dehydrogenase during the transition from C-3 photosynthesis to crassulacean acid metabolism in Mesembryanthemum crystallinum JOURNAL J. Biol. Chem. 265, 3497-3502 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 55 1068 glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.1.12) BASE COUNT 315 a 308 c 345 g 386 t ORIGIN 1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct 61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc 121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac 181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag 241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg 301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga 361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt 421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac 481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg 541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc 601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg 661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt 721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg 781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct 841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat 901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga 961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac 1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga 1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc 1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt 1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta 1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc 1321 agcatttatg aaacattgct gtgagctttt gagt // LOCUS ECOPRIA 2658 bp ds-DNA BCT 25-MAY-1990 DEFINITION E.coli primosomal protein n' (priA) gene, complete cds, and cytR gene, 5' end. ACCESSION M33293 KEYWORDS cytR gene; priA gene; primosomal protein. SOURCE E.coli (strain W3110) DNA, clone pEL042. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2658) AUTHORS Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A. TITLE The priA gene encoding the primosomal, replicative n' protein of E.coli JOURNAL Unpublished (1990) Stanford U., Beckman Cntr., Stanford, CA 94305 STANDARD full automatic COMMENT Authorin sequence for [1] kindly submitted by G.C.Allen Jr., 26-MAR-1990. FEATURES from to/span description pept 64 2262 primosomal protein n' pept 2418 > 2658 cytR gene product signal 27 32 -10 region signal 5 10 -35 region binding 53 59 ribosome binding site BASE COUNT 578 a 738 c 756 g 586 t ORIGIN 1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat 61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg 121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag 181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag 241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga 301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc 361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt 421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa 481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa 541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa 601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg 661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc 721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa 781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg 841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc 901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt 961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt 1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac 1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc 1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt 1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg 1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac 1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt 1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac 1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc 1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg 1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg 1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca 1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg 1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc 1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac 1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa 1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg 1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg 2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg 2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg 2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt 2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg 2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg 2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg 2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg 2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata 2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac 2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc 2641 cggatatctg cgatcccc // LOCUS EWCTELRNA 657 bp ds-DNA INV 25-MAY-1990 DEFINITION E.crassus telomerase RNA component gene, complete cds. ACCESSION M33461 KEYWORDS telomerase RNA. SOURCE E.crassus DNA. ORGANISM Euplotes crassus Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; Spirotricha; Heterotrichida; Clevelandellina. REFERENCE 1 (bases 1 to 657) AUTHORS Shippen-Lentz,D. and Blackburn,E.H. TITLE Functional evidence for an RNA template in telomerase JOURNAL Science 247, 546-552 (1990) STANDARD simple staff_entry FEATURES from to/span description RNA 152 342 telomerase RNA component site 186 197 functional telomeric template BASE COUNT 202 a 122 c 111 g 222 t ORIGIN 1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga 61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt 121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg 181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg 241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt 301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg 361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt 421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat 481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga 541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac 601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt // LOCUS HS4DWXJ 160 bp ds-DNA VRL 25-MAY-1990 DEFINITION Epstein-Barr virus defective WZhet junction. ACCESSION M33474 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, clinical sample 9) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 160) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 87 90 WZhet junction BASE COUNT 41 a 45 c 41 g 33 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc 61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact 121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac // LOCUS HS4WXJ 181 bp ds-DNA VRL 25-MAY-1990 DEFINITION Epstein-Barr virus WZhet junction, HR-1 clone 5. ACCESSION M33473 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 181) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 108 111 WZhet junction BASE COUNT 50 a 42 c 46 g 43 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt 61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca 121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga 181 c // LOCUS HUMREGA 4251 bp ds-DNA PRI 25-MAY-1990 DEFINITION Human regenerating protein (reg) gene, complete cds. ACCESSION J05412 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4251) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. FEATURES from to/span description pept 1571 1634 regenerating protein (reg), exon 2 (first expressed exon) 2270 2388 regenerating protein, exon 3 2696 2833 regenerating protein, exon 4 3549 3660 regenerating protein, exon 5 3856 3923 regenerating protein, exon 6 pre-msg 1196 4116 reg mRNA and introns IVS 1224 1524 reg intron A IVS 1635 2269 reg intron B IVS 2389 2695 reg intron C IVS 2834 3548 reg intron D IVS 3661 3855 reg intron E signal 1169 1174 TATA box BASE COUNT 1161 a 927 c 869 g 1294 t ORIGIN 1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc 61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat 121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt 181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc 241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt 301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc 361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc 421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat 481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac 541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg 601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg 661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata 721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag 781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc 841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga 901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt 961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag 1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc 1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa 1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa 1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat 1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta 1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc 1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag 1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac 1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt 1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct 1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact 1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt 1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat 1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg 1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt 1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa 1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac 2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga 2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat 2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa 2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc 2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc 2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag 2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca 2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc 2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac 2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca 2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta 2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt 2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca 2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta 2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag 2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata 3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt 3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca 3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct 3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa 3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga 3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt 3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa 3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc 3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc 3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg 3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag 3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg 3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt 3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt 3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct 3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc 3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca 4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt 4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc 4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct 4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c // LOCUS HUMREGRELA 1524 bp ds-DNA PRI 25-MAY-1990 DEFINITION Human reg-related sequence, complete cds. ACCESSION J05413 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human esophageal mucosa DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1524) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. BASE COUNT 382 a 368 c 345 g 429 t ORIGIN 1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt 61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt 121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg 181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat 241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg 301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga 361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata 421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat 481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag 541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa 601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc 661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac 721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta 781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca 841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt 901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat 961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct 1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc 1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga 1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc 1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct 1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct 1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc 1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg 1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc 1501 tataaatcat ataatgtaaa ggat // LOCUS MDPCGA 4801 bp ss-RNA VRL 25-MAY-1990 DEFINITION Aleutian mink disease parvovirus complete genome. ACCESSION M20036 KEYWORDS complete genome. SOURCE Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (bases 1 to 4592) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 4593 to 4801) AUTHORS Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W., Perryman,S. and Wolfinbarger,J.B. TITLE Nucleotide sequence of the 5' terminal palindrome of Aleutian mink disease parvovirus (ADV) and construction of an infectious molecular clone JOURNAL J. Virol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by M.E.Bloom, 18-MAR-1990. Sequence reported below is (+) strand. FEATURES from to/span description ORF 116 1978 Left ORF ORF 1983 2207 Middle ORF2 ORF 1993 2211 Middle ORF1 ORF 2241 4399 Right ORF BASE COUNT 1740 a 912 c 943 g 1206 t ORIGIN 1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt 61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt 121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag 181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact 241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt 301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact 361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc 421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac 481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt 541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta 601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag 661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa 721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc 781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga 841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta 901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt 961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga 1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg 1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa 1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt 1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga 1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt 1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga 1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa 1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg 1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta 1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat 1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac 1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg 1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa 1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa 1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca 1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt 1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca 2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa 2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg 2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata 2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta 2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc 2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag 2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg 2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg 2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt 2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact 2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag 2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata 2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta 2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa 2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt 2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg 3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc 3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct 3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta 3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt 3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg 3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa 3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg 3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt 3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac 3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga 3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat 3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac 3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt 3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc 3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg 3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg 3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac 4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg 4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa 4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag 4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag 4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg 4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact 4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc 4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta 4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac 4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt 4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt 4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct 4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg 4801 a // LOCUS MDPUPS 3454 bp ss-RNA VRL 25-MAY-1990 DEFINITION Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial sequence. ACCESSION M32981 KEYWORDS . SOURCE Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (sites) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3454, for [1]) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. JOURNAL Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840 STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by M.E.Bloom 18-MAR-1990. Sequence reported below is (+) strand. BASE COUNT 1268 a 673 c 698 g 815 t ORIGIN 1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc 61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac 121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac 181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta 241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa 301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga 361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat 421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt 481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa 541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta 601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat 661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag 721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga 781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat 841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc 901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact 961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt 1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac 1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac 1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa 1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta 1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag 1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac 1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg 1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa 1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac 1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca 1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga 1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc 1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg 1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta 1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt 1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga 1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag 2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag 2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac 2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt 2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga 2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc 2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg 2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat 2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta 2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg 2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa 2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg 2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta 2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc 2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa 2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt 2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca 3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg 3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct 3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga 3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt 3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc 3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt 3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa 3421 aacctatggt tactttactt ggagaggcaa gctt // LOCUS MUSAA2DEL 300 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion fragment. ACCESSION M33468 KEYWORDS deletion mutant. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 130 133 deletion breakpoint BASE COUNT 82 a 45 c 76 g 97 t ORIGIN 1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact 61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt 121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt 181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata 241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa // LOCUS MUSSL20JA 300 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion fragment. ACCESSION M33467 KEYWORDS deletion mutant. SOURCE Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description pept.ps / 42 122 dilute gene, exon 3 179 > 300 dilute gene, exon 5 recomb 129 130 deletion breakpoint BASE COUNT 86 a 51 c 61 g 102 t ORIGIN 1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac 61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac 121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta 181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta 241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc // LOCUS PRVVP4 2359 bp ds-RNA VRL 25-MAY-1990 DEFINITION Porcine rotavirus capsid protein VP4 gene, complete cds. ACCESSION M33516 KEYWORDS capsid protein VP4. SOURCE Porcine rotavirus (Gottfried strain; serotype 4) DNA. ORGANISM Porcine rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 2359) AUTHORS Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K. TITLE Similarity of the outer capsid protein VP4 of the Gottfried strain of porcine rotavirus to that of asymptomatic human rotavirus strains JOURNAL J. Virol. 64, 414-418 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 10 2337 capsid protein VP4 BASE COUNT 812 a 406 c 442 g 699 t ORIGIN 1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa 61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg 121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat 181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca 241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac 301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga 361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg 421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta 481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat 541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa 601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag 661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta 721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag 781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt 841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca 901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt 961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc 1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac 1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg 1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt 1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc 1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca 1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga 1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac 1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg 1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata 1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt 1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct 1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc 1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata 1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga 1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct 1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat 1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat 2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc 2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata 2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc 2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa 2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag 2341 ctctatagag gatgtgacc // LOCUS RATSTAA 1000 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds. ACCESSION M33329 KEYWORDS hydroxysteroid sulfotransferase a. SOURCE Rat (strain Sprague-Dawley) female liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1000) AUTHORS Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K., Yamazoe,Y. and Kato,R. TITLE cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a strong homology in amino acid sequence with the senescence marker protein SMP-2 in rat livers JOURNAL Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 38 892 hydroxysteroid sulfotransferase a (STa) mRNA < 1 1000 STa mRNA signal 977 982 poly-A signal BASE COUNT 299 a 187 c 211 g 303 t ORIGIN 1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga 61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa 121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg 181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt 241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa 301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct 361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc 421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta 481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc 541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa 601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga 661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc 721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg 781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt 841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa 901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa 961 gtgaatgtgg tcattgaata aattctattc tggcattgtg // LOCUS SMFPOLENV 3534 bp ss-RNA VRL 25-MAY-1990 DEFINITION Simian foamy virus type 1 polymerase (pol) gene, 3' end; and envelope (env) gene, complete cds. ACCESSION M33561 KEYWORDS envelope protein; polymerase. SOURCE Simian foamy virus type 1, cDNA to viral RNA. ORGANISM Simian foamy virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Spumavirinae. REFERENCE 1 (bases 1 to 3534) AUTHORS Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A. TITLE Relationship of the env genes and the endonuclease domain of the pol genes of simian foamy virus type 1 and human foamy virus JOURNAL J. Virol. 64, 406-410 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 582 polymerase (AA at 1) pept 524 3481 envelope protein BASE COUNT 1162 a 645 c 670 g 1046 t 11 others ORIGIN 1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac 61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta 121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat 181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta 241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc 301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg 361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc 421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt 481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg 541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac 601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag 661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg 721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat 781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt 841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt 901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt 961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt 1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt 1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt 1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca 1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac 1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca 1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa 1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa 1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga 1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg 1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc 1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt 1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca 1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa 1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg 1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt 1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa 1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc 2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac 2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa 2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag 2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg 2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt 2341 atatttactn nnnnnnnnnn tggtaaccct gatggaagct gcccttcatg atgtttcgat 2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat 2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt 2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta 2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga 2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt 2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa 2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt 2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg 2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg 2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt 3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc 3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt 3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa 3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg 3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc 3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg 3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact 3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg 3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat // LOCUS ENHCYPA 495 bp ds-DNA INV 25-MAY-1990 DEFINITION Entamoeba histolytica cysteine protease gene, partial cds. ACCESSION M27307 KEYWORDS cysteine protease. SOURCE Entamoeba histolytica (strain HM-1) trophozoites DNA, clone E452. ORGANISM Entamoeba histolytica Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Schizopyrenida; Tubulina; Entamoebidae. REFERENCE 1 (sites; for [2]) AUTHORS Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H. TITLE Amplification and sequencing of genomic DNA fragments encoding cysteine proteases from protozoan parasites JOURNAL Mol. Biochem. Parasitol. 39, 1-8 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 495) AUTHORS Eakin,A.E. JOURNAL Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by A.E.Eakin, 24-AUG-1989. FEATURES from to/span description pept < 1 > 495 cysteine protease (AA at 1) BASE COUNT 161 a 75 c 107 g 152 t ORIGIN 1 caagggcaat gcgggtcgtg ctggactttc tgtacaactg cagttcttga aggaagagtt 61 aacaaagatc ttggaaaact ttactcattc tctgaacaac aattagttga ttgtgatgct 121 tctgataatg gatgtgaacg aggacatcca tctaactcac ttaaattcat ccaagaaaat 181 aatggattag gattagaaag cgattatcca tataaagctg ttgctggtac ttgcaagaaa 241 gttaaaaacg ttgctactgt tactggttct agaagagtta ctgatggaag tgaaactgga 301 cttcaaacta ttattgctga aaacggacct gttgctgttg gtatggatgc tagcagacca 361 tcattccaat tatataagaa aggaactatc tattctgata ctaaatgtag atcaagaatg 421 atgaatcact gtgttactgc tgttggttat ggttcaaata gtaatggtaa atattggatt 481 gtcaagaatt cctgg // LOCUS TRBCYPA 495 bp ds-DNA INV 25-MAY-1990 DEFINITION Trypanosoma brucei cysteine protease gene, partial cds. ACCESSION M27306 KEYWORDS cysteine protease. SOURCE Trypanosoma brucei gambiense trypomastigote DNA, clone B502. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (sites; for [2]) AUTHORS Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H. TITLE Amplification and sequencing of genomic DNA fragments encoding cysteine proteases from protozoan parasites JOURNAL Mol. Biochem. Parasitol. 39, 1-8 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 495) AUTHORS Eakin,A.E. JOURNAL Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by A.E.Eakin, 24-AUG-1989. FEATURES from to/span description pept < 1 > 495 cysteine protease (AA at 1) BASE COUNT 116 a 115 c 140 g 124 t ORIGIN 1 caagggcagt gtgggtcgtg ctgggccttt tcaactatcg gcaacatcga agggcagtgg 61 caggtggcag gaaatcctct cgtatccctc tcggagcaga ttctagtgta ctgtgatcca 121 ttgattggct gtggtggtgg gctgatggac aatgccttca actggatagt aaattcaaac 181 ggtggaaacg tattcacgga ggcgagctat ccctatgttt ctgggaatgg tgagcagcca 241 cagtgccaga tgaatggtca cgagatcggt gctgcgataa cagaccatgt tgacttaccg 301 caggatgagg acgccatcgc cgcgtatttg gcagaaaacc gtccccttgc tattgccgtt 361 gaagcaccgc agttttatgg acataacggt gggattctga cttcatgcac ctccgagcaa 421 ctggatcatg gtgtgctcct cgttggttac aatgataata gcaatccacc ctattggatc 481 gtcaagaact cctgg // LOCUS TRCCYPA 495 bp ds-DNA INV 25-MAY-1990 DEFINITION Trypanosoma cruzi cysteine protease gene, partial cds. ACCESSION M27305 KEYWORDS cysteine protease. SOURCE Trypanosoma cruzi (strain RA; Argentine) epimastigote DNA, clone R501. ORGANISM Trypanosoma cruzi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (sites; for [2]) AUTHORS Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H. TITLE Amplification and sequencing of genomic DNA fragments encoding cysteine proteases from protozoan parasites JOURNAL Mol. Biochem. Parasitol. 39, 1-8 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 495) AUTHORS Eakin,A.E. JOURNAL Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by A.E.Eakin, 24-AUG-1989. FEATURES from to/span description pept < 1 > 495 cysteine protease (AA at 1) BASE COUNT 94 a 143 c 158 g 100 t ORIGIN 1 cagggccaat gcggctcgtg ctgggccttc tccgccattg gcaacgttag tggccagtgg 61 tttcttgccg gtcacccgct gactaacctg tcggagcaga tgctcgtgtc gtgcgacaaa 121 acggactctg gctgcagtgg tggcctgatg aacaacgcct ttgagtggat tgtgcaggag 181 aataacggcg gcgtgtacac ggaggacagc tacccttatg cgtcgggcga ggggatatcg 241 ccgccgtgca cgacgtcagg ccacacggtg ggtgccacga ttaccggtca cgttgaatta 301 cctcaggacg aggcccaaat agccgcatgg cttgccgtca atggcccggt tgccgttgcg 361 cacgccagca gctggatgac ctacacgggc ggcgttatga cgagctgcgt ctccgagcag 421 ctggatcacg gccttcttct cgtcggctac aatgacagcg ccgcagtgcc gtactggatc 481 atcaagaact cgtgg // LOCUS STRDEXB 1800 bp ds-DNA BCT 25-MAY-1990 DEFINITION S.mutans dextran glucosidase (dexB) gene, complete cds. ACCESSION M30944 KEYWORDS dextran glucosidase. SOURCE S.mutans (strain INGBRITT) DNA. ORGANISM Streptococcus mutans Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 1800) AUTHORS Russell,R.R.B. and Ferretti,J.J. TITLE Nucleotide sequence of the dextran glucosidase (dexB) gene from Streptococcus mutans JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma City, OK 73190 FEATURES from to/span description pept 91 1701 dextran glucosidase (dexB) (EC 3.2.1.11) binding 78 83 ribosome binding site BASE COUNT 585 a 290 c 376 g 549 t ORIGIN 1 aaaacggctt aaattgtttt agattttctc agtctcttta ttagaactaa agaaactaac 61 atgtcataat aaattaaagg tggtgataag atgcaaaaac attggtggca caaggcaact 121 gtttatcaaa tttatccaaa atcttttatg gatacaaatg gtgatggaat tggtgatctc 181 aaaggtatta cgagtaaatt ggattatttg caaaagttag gggttatggc tatttggcta 241 tctccagttt atgatagccc catggatgac aatggctatg acattgcgaa ctatgaagca 301 attgcggata tttttggcaa tatggctgat atggataatt tgctgacgca ggcaaaaatg 361 cgcgacataa aaatcattat ggatctagtg gttaatcata cctcagatga acatacttgg 421 tttattgaag cacgtgagca tccagacagt tctgaacgcg attattatat ttggtgtgac 481 cagccaaatg atttggaatc tattttcggt ggttctgctt ggcagtatga tgataagtcc 541 gatcaatatt atttgcattt ttttagtaag aagcagccag atctaaactg ggaaaacgca 601 aacttacgtc agaagattta tgatatgatg aatttctgga ttgataaagg tattggcggc 661 tttcggatgg acgtcattga tatgattggg aaaattcctg ctcagcatat tgtcagtaac 721 ggaccaaaat tgcatgctta tcttaaggag atgaatgccg ctagttttgg tcaacatgat 781 ctgctgactg tgggggaaac ttggggagca acgcctgaga ttgcgaagca atattcaaat 841 ccagtcaatc acgaactctc tatgattttt caatttgaac atattggtct tcagcataaa 901 ccagaagctc ctaaatggga ttatgtgaag gaacttaatg ttcctgcttt aaaaacaatc 961 tttaataaat ggcagactga gttggaatta ggacaggggt ggaattcgtt attctggaat 1021 aaccatgacc tgcctcgtgt tttatcaatc tggggaaata cgggcaaata tcgtgagaag 1081 tctgctaaag cactggctat tcttcttcac cttatgcgtg ggacacctta tatttatcaa 1141 ggtgaagaga ttgggatgac caattatcct tttaaagatt taaatgaact tgatgatatt 1201 gaatcactta attatgctaa ggaagctttt acaaatggta agtctatgga aactatcatg 1261 gacagtattc gtatgattgg ccgtgataat gccagaacac ctatgcaatg ggatgcttct 1321 caaaatgccg gattttcaac agcggataaa acatggctgc cagttaatcc aaactataaa 1381 gacatcaatg ttcaagcagc tctgaaaaat tccaattcta tcttttacac ctatcaacaa 1441 ctcattcagc ttcgaaaaga aaatgattgg ctagtagatg ccgattttga attgctccct 1501 acagcggaca aagtatttgc ctatttacga aaggtaagag aagaaaggta tcttatagtg 1561 gtcaatgttt cagatcagga agaagttcta gagattgatg ttgacaaaca agaaactctc 1621 attagcaata caaatgaaag cgctgctctt gccaatcaca aactccagcc ttgggatgct 1681 ttttgtatta agataaacta ggatagaata agaaaacaac agacgatacc acttaaaaag 1741 tgacggcctg ttgttttttt aatgtttgta ttaagtaata aagcgattct gattgactat // LOCUS STRGBP 1968 bp ds-DNA BCT 25-MAY-1990 DEFINITION S.mutans glucan-binding protein (gbp) gene, complete cds. ACCESSION M30945 KEYWORDS glucan binding protein. SOURCE S.mutans (strain INGBRITT) DNA. ORGANISM Streptococcus mutans Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 1968) AUTHORS Banas,J.A., Russell,R.R.B. and Ferretti,J.J. TITLE Sequence analysis of the gene for the glucan-binding protein of Streptococcus mutans INGBRITT JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma City, OK 73190. FEATURES from to/span description pept 124 1815 glucan-binding protein (gbp) signal 12 17 -35 region signal 37 42 -10 region binding 112 116 ribosome binding site BASE COUNT 648 a 341 c 445 g 534 t ORIGIN 1 ccggctataa gttgaaatat tgtaggtatt aaaaactatc tttagtttag tatttacatt 61 aattttaaaa atgttatagt ggaagtgtca tgttgattac tattttttta aggaggtaaa 121 atgatgaaag aaaagacacg ttttaaactg cacaaggtta aaaagcagtg ggtggcgatt 181 gccgtgacta gtctagctct agctgcgata ttgtcaggag ctcacttgac tcaggctgag 241 gaacaatccg gcggtactga cagtaagcca agactgacag cgactgtaca ggaaagctca 301 gaacaaccaa ttacaaaagc tccagcagct gattcatctg tagaaaataa cagtgctaac 361 gctgttaaaa gttctgaaac agcagaggca gctgaagtat ccgatggagg cagagccagc 421 caaactgaag cagtaacaaa ccaaacaaac tctgaagagc accatccagc agaaaaagcg 481 acagccgttt ctggagaagc tcagtcagtg caaaatgctc catcagaaaa tgctgcccag 541 caggaaacgg ctaaaaccga gccagcgact gctgcagaaa ataatgacgc tgctccaacc 601 aatagcttct ttaaaaaaga tggtaaatgg tactacaaaa aggccgatgg acagctggca 661 accggttggc agataattga tggaaagcag ctctatttca accaagatgg tagtcaggtc 721 aaaggagaaa ttcatgtgga gacaggggat caaatcattt atcatcctgt tttcataagt 781 gattcacctt cagttttgga agtcaataag atttattact ttgatcctga tagtggtgaa 841 ctctggaagg atcgttttgt ctattctagt tatgcagatc ccctccatta tgaaaatatt 901 aaacatgaag gctggttcta tcttggagaa gatggaaagg ctgctatcgg ctggagaact 961 attggcggta aaaaatacta ttttgacact aatggtgttc aagtcaaagg aaagctaatt 1021 agtacagatg gcaattataa tctaattagc cagaagtatg gcaagaaatc tttcctagat 1081 cctgacaccg gtgaagcttg gactaatcgt tttgtcaatg caaagtatta tttctacaac 1141 tttgcaggat acgtctctac gacagactgg ttctatatgg gagccgatgg tatcggcgtg 1201 accgattggc aaaagatcga tggtatggat tactatttcg aaccttccag tggtattcag 1261 gttaaaggcg acattgctga gcgtgatggc aaggtctatt atttagatga agacagtgga 1321 caagttgtta agaatcgttt tggcacaaca cctgccgagc gtatcagtac agttgaggct 1381 cgtttcccta aaacttatta ttttggagcg gacggtagcc gcaaagatct aactggttgg 1441 cagattattg atggtaaaac ttattacttt aaggatgatc acagcataaa agcaaagtca 1501 gagtatagtc aaattggtgg ttctgtgcct gatgacggtt ttgcagagat tgatggtgat 1561 ggttactttt ttgatactca aggtcaattc gtaacgaata gatttgtcag aaaatacgac 1621 tacagtaata tttggtatta ttatggaagc gatggcaaac gtgtatcagg ctggcaaact 1681 atcgacggta agcgctacta ctttagccaa gatgaaaaga caaagggccg tcaaattaaa 1741 ggacaaacca tcactatcga tggtaaagaa tatacttttg acaaagacag cggtgaagtt 1801 atcaatagta actagttggt aaatcccatg gcacacaaaa acgagcagat ttcatactct 1861 gttcgttttt tcgccttaaa acttatatat ttataaatcg tcaataaagt gtttacttga 1921 aaacggtaaa tatgccaaga gtttgactgt tatcaattaa tgggaaag // LOCUS STRGTFS 4376 bp ds-DNA BCT 25-MAY-1990 DEFINITION S.downei glucosyltransferase S (gtfs) gene, complete cds. ACCESSION M30943 KEYWORDS glucosyltransferase S. SOURCE S.downei (strain MFe28) DNA. ORGANISM Streptococcus downei Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 4376) AUTHORS Gilmore,K.S., Russell,R.R.B. and Ferretti,J.J. TITLE Sequence analysis of a Streptococcus downei gtfs gene which specifies a glucosyltransferase synthesizing soluble glucans JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma City, OK 73190 FEATURES from to/span description pept 247 4344 glucosyltransferase S (gtfs) signal 127 132 -35 region signal 150 154 -10 region binding 233 237 ribosome binding site BASE COUNT 1282 a 944 c 1000 g 1150 t ORIGIN 1 tggcctatgg agtagcggtc attctcacca tcttgaacat caagttaatt attgatttgt 61 tccaataaat atgaatctct ttaaagaata aatttaaaga gatttttatg ttgttttaca 121 atctaatgac aaatttagga tttctgagat taaacttaaa taacatgtta aaataaattg 181 ataattaaaa ttatttggtt attaaagttt taattgtcga atttttttat aaggagaatg 241 atattcatgg aaaaaaattt gcgctacaaa ttgcataagg ttaaaaagca atgggtagcc 301 attggtgtga cgaccgtaac gcttagcttt ctagctggag gccaagtcgt tgcagccgac 361 acaaacaata atgacgggac aagtgttcag gtcaacaaaa tggtgccaag tgatcccaag 421 tttgacgctc aagcccaaaa cggtcaactt gctcaagcta tgtttaaggc tgctaatcaa 481 gcagatcaaa ctgctacaag tcaagttagc ccagcaacag atggcagggt tgataaccag 541 gtgacaccag cagctaatca acctgcggct aatgtggcca atcaagatgt ggccaatcca 601 gcgactgatg ccggagcact taatcgccaa tcggccgctg atacgtcaac agatggtaag 661 gctgtacctc agactagtga ccaaccaggt catctggaaa cagttgatgg caaaacttac 721 tatgtggatg ccaatggtca acggttgaaa aactattcaa tggtcattga tggcaagacc 781 tattactttg acggtcagac aggcgaagcg caaaccgact tgcctaagac cggtcaagct 841 aatcaagaca atgtgcccga tagctatcag gccaataatc aagcctatag taatgaggct 901 tctagctttg agacagttga taattacctg acagcggatt cctggtatcg ccctcgcaag 961 attttgaaaa atggccagag ctggcaggct agttcagaag gtgacctgcg tcctattctc 1021 atgacctggt ggccagatgc ggccaccaag gcagcctatg ccaacttctg ggctaaagaa 1081 ggcctgattt caggttctta caggcaaaat tctgccaatc ttgatgcggc tacccagaat 1141 attcaatctg ccattgaaaa gaagattgcc agtgaaggca ataccaactg gttgcgcgac 1201 aagatgtccc aattcgtcaa gtcacaaaac cagtggagta ttgcttctga aaatgagact 1261 gtctatccta atcaggatca tatgcaaggg ggggccctgc tcttttcaaa cagtaaggat 1321 accgagcatg ctaactcaga ctggcgcctg ctcaaccgca atccaacctt ccaaactggt 1381 aaacaaaagt attttacaac caactatgcg ggttatgaat tgctcttagc caatgatgtt 1441 gataattcta acccggtcgt ccaagccgaa cagctcaatc acctccacta ccttatgaac 1501 tggggtgaca ttgttatggg ggataaggat gctaactttg atggtgtccg agttgatgcg 1561 gttgataatg tcaatgccga cctgctacaa attcagaggg attattataa ggccaaatat 1621 ggcaccgatc aaaatgaaaa aaatgccatt gaccacctct ctatcctaga agcctggtct 1681 ggcaacgaca atgactacgt caaggaccaa aacaactttt ctctctctat tgacaatgat 1741 caacgcagtg gtatgttgaa agctttcggt tatgcttctg cctatcgtgg taatttgagt 1801 aatctcgcaa ctgctggctt gaaaaaccgt agtgcaaatc ctgatagtga cccagttcca 1861 aactatgtct ttattcgtgc ccatgactct gaggtacaga cacggattgc taagattatt 1921 cgtgagaaat tgggtaagac caatgctgat ggtttgacca atctgacctt agatgattta 1981 aacaaggcct ttgacatcta caaccaggat atgaatgcca ccgataaggt ttattatcct 2041 aataatttac cgatggccta tgcttggatg ctgcaaaata aggatacggt cactcgggtc 2101 tactatggtg atatgtacac cgataatggc cagtatatgg cgaccaagac cccattttat 2161 aatgctattg agaccctgct taagggccgg attaagtatg ttgctggtgg tcaagcggtc 2221 tcctacaagc aagattggtc tagtggaatt ttgacctcgg ttcgctatgg taagggagct 2281 aattcggcta gtgatgctgg gaacacggaa acccgcaatt ctggtatggc cctcctgatt 2341 aacaatcgac ctaacttcag ggcttatcgg aacttgacct tgaacatggg ggcagcccat 2401 aagagccaag cctaccgtcc tctcttgctg tctaccaagg atggcattgc cacctacctt 2461 aatgatagcg atgtggattc tcgccaatac aagtacaccg atagtcaggg gaacttaagc 2521 tttagtgcct ctgaactgca aagtgttgct aacgcccaag tctcaggtat gattcaggtt 2581 tgggtacccg tcggtgcggc agataaccaa gatgtccgta cttctccaag tacacaagcg 2641 accaaggatg gcaatatcta tcatcaaagt gacgccctag attcccaagt catctatgaa 2701 ggtttctcta atttccaagc cttcgcccaa agtcctgacc aatataccaa tgctgtcatt 2761 gctaaaaatg gcgacctctt taagtcttgg ggcattaccc aatttgaaat ggcaccgcag 2821 tacgtgtcat cagaagatgg aactttcctt gattccgtta ttttaaatgg ttatgccttc 2881 tcagaccgct atgacctggc tatgagtaag aataataaat atggctccaa gcaggatttg 2941 gccaatgcca ttaaaggact tcagtctgca ggtatcaagg tcttgtcaga cttggtaccg 3001 aaccaacttt ataatctgcc tggaaaagaa gtggtaacgg caacccgggt caatcaatat 3061 ggtcaagcca aatcaggggc gaccatcaat aagactcctt atgtggccaa tacccgttcc 3121 tatggtgact accaagaaca gtacggtgga aaattcttgg atgacttgca aaagctctac 3181 ccaagactct ttagcaccaa gcaaatttca acgggcaaac caattgatcc atctgttaag 3241 attaccaatt ggtcggctaa gtatttcaat ggctccaata ttttggggcg tggtgccaag 3301 tatgttctga gtgagggcaa taagtatctc aacttggctg atggcaagct cttcttgcca 3361 acggtcctca ataatactta tggtcagccg caagtatcgg ctaatggttt tatttctaag 3421 aatggcggta ttcattatct tgataaaaat ggtcaggaag tcaagaatcg gttcaaggaa 3481 atttctggca gctggtatta cttcgattct gacggtaaga tggctactgg aaaaacgaag 3541 attggcaatg atacctacct ctttatgcct aatggtaaac aacttaagga aggtgtctgg 3601 tatgatggta agaaggccta ctattatgat gataatggta ggacgtggac caataagggc 3661 ttcgttgaat ttagagttga cggtcaagac aagtggcgtt acttcaatgg tgatggcact 3721 atcgccattg gactagttag tctagataat cgcaccctct actttgatgc ctatggctat 3781 caagtcaagg gacaaaccgt gacgattaat ggtaagtcct atacctttga tgccgatcag 3841 ggtgacttgg tccaaacaga caatgccaat ccagctcctc agggtcaagc aggctggaaa 3901 ctcctaggag ataaccagtg gggctaccgc aaggacggtc aactcttgac gggtgagcaa 3961 actattgatg gtcaaaaggt cttcttccaa gataatggcg tccaagtcaa aggtggaact 4021 gcgacagatg cttcaggtgt cttgcgtttc tacgaccgtg accagggcca ccaagttggc 4081 aagggctggt actcaacctc cgacgataat tgggtctatg tcaatgaatc cggtcaagtt 4141 ctgacaggct tacaaaccat tgatggtcag acggtctatt tcgatgacaa gggtatccag 4201 gccaagggca aggctgtctg ggatgaaaat ggcaacctgc gttactttga tgccgattca 4261 ggtaatatgc ttcgggaccg ttggaagaac gttgacggaa actggtatta cttcaaccgc 4321 aatggcctag ccacgaggtg gtaagcctaa tactcgctaa atattaaaag agctcg // LOCUS VCHTCPA 1048 bp ds-DNA BCT 25-MAY-1990 DEFINITION V.cholerae major pilin subunit (tcpA) gene, complete cds. ACCESSION M33514 KEYWORDS antigen; pilin. SOURCE V.cholerae (strain Z17561 (Classical, Inaba)) DNA, clone pPM2114. ORGANISM Vibrio cholerae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1048) AUTHORS Faast,R., Ogierman,M.A., Stroeher,U.H. and Manning,P.A. TITLE Nucleotide sequence of the structural gene, tcpA, for a major pilin subunit of Vibrio cholerae JOURNAL Gene 85, 227-231 (1989) STANDARD simple staff_review FEATURES from to/span description pept 144 818 major pilin subunit (tcpA) precursor sigp 144 218 major pilin subunit (tcpA) signal peptide matp 219 815 major pilin subunit (tcpA) ORF 865 > 1048 ORF BASE COUNT 327 a 182 c 240 g 299 t ORIGIN 1 aaaaaataaa aaacacagca aaaatgacat ctgtcaattg taggtgactt tgtgtggtta 61 aatgtgcgtg ttgcttacgt tatctaaaaa agaccaagcg acgcatttct ttaaagacag 121 taaaatggtg gagttacata aatatgcaat tattaaaaca gctttttaag aagaaatttg 181 taaaagaaga acacgataag aaaaccggtc aagagggtat gacattactc gaagtgatca 241 tcgttctagg cattatgggg gtggtttcgg cgggggttgt tactctggcg cagcgtgcga 301 ttgattcgca gattatgacc aaggccgcgc aaagtctcaa tagtatccaa gttgcactga 361 cacagacata ccgtggtcta ggtaattatc cagcaacagc tgatgcgaca gctgctagta 421 agctaacttc aggcttggtt agtttaggta aaatatcatc cgatgaggca aaaaacccat 481 tcaatggtac aaatatgaat attttttcat ttccgcgtaa tgcagcagct aataaagcat 541 ttgcaatttc agtggatggt ctgacacagg ctcaatgcaa gacacttatt accagtgtcg 601 gtgatatgtt cccatatatt gcaatcaaag ctggtggcgc agtagcactt gcagatctag 661 gtgattttga gaattctgca gcagcggctg agacaggcgt tggtgtgatc aaatctatcg 721 ctcccgctag taagaattta gatctaacga acatcactca cgttgagaaa ttatgtaaag 781 gtactgctcc attcggcgtt gcatttggta acagctaatt caaataagtt tgtttaactt 841 aatcttaacg ttgcccatta aataatgggc aacttattaa attcaatgtg gtatcaatat 901 gagaaaatac caacaaggtg tcggattatt ggaggcgatt ctggcttctg cgtattaggg 961 atggcattgg tcgctgctgg gagctattac aagcgggaag ctgaactcat gattaaatcc 1021 agtaacgcat ttgatgttat tgagttgt // LOCUS HPCPOLYP 7310 bp ss-RNA VRL 25-MAY-1990 DEFINITION Hepatitis C virus polyprotein gene, partial cds. ACCESSION M32084 KEYWORDS polyprotein. SOURCE Hepatitis C virus, cDNA to viral RNA, clones K9-1 through 15e, isolated from chimpanzee (individual 910) blood plasma. ORGANISM Hepatitis C virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae incertae sedis. REFERENCE 1 (bases 1 to 7310) AUTHORS Choo,Q.-L., Richman,K. and Han,J. TITLE The nucleotide sequence of the Hepatitis C viral genome JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.Houghton, 22-FEB-1990. Chiron Corporation, 4560 Horton Street, Emeryville CA 94608. FEATURES from to/span description pept < 1 > 7310 polyprotein (AA at 3) BASE COUNT 1495 a 2218 c 2058 g 1539 t ORIGIN 1 caggctgtcc tgagaggcta gccagctgcc gaccccttac cgattttgac cagggctggg 61 gccctatcag ttatgccaac ggaagcggcc ccgaccagcg cccctactgc tggcactacc 121 ccccaaaacc ttgcggtatt gtgcccgcga agagtgtgtg tggtccggta tattgcttca 181 ctcccagccc cgtggtggtg ggaacgaccg acaggtcggg cgcgcccacc tacagctggg 241 gtgaaaatga tacggacgtc ttcgtcctta acaataccag gccaccgctg ggcaattggt 301 tcggttgtac ctggatgaac tcaactggat tcaccaaagt gtgcggagcg cctccttgtg 361 tcatcggagg ggcgggcaac aacaccctgc actgccccac tgattgcttc cgcaagcatc 421 cggacgccac atactctcgg tgcggctccg gtccctggat cacacccagg tgcctggtcg 481 actacccgta taggctttgg cattatcctt gtaccatcaa ctacaccata tttaaaatca 541 ggatgtacgt gggaggggtc gaacacaggc tggaagctgc ctgcaactgg acgcggggcg 601 aacgttgcga tctggaagac agggacaggt ccgagctcag cccgttactg ctgaccacta 661 cacagtggca ggtcctcccg tgttccttca caaccctacc agccttgtcc accggcctca 721 tccacctcca ccagaacatt gtggacgtgc agtacttgta cggggtgggg tcaagcatcg 781 cgtcctgggc cattaagtgg gagtacgtcg ttctcctgtt ccttctgctt gcagacgcgc 841 gcgtctgctc ctgcttgtgg atgatgctac tcatatccca agcggaggcg gctttggaga 901 acctcgtaat acttaatgca gcatccctgg ccgggacgca cggtcttgta tccttcctcg 961 tgttcttctg ctttgcatgg tatttgaagg gtaagtgggt gcccggagcg gtctacacct 1021 tctacgggat gtggcctctc ctcctgctcc tgttggcgtt gccccagcgg gcgtacgcgc 1081 tggacacgga ggtggccgcg tcgtgtggcg gtgttgttct cgtcgggttg atggcgctga 1141 ctctgtcacc atattacaag cgctatatca gctggtgctt gtggtggctt cagtattttc 1201 tgaccagagt ggaagcgcaa ctgcacgtgt ggattccccc cctcaacgtc cgaggggggc 1261 gcgacgccgt catcttactc atgtgtgctg tacacccgac tctggtattt gacatcacca 1321 aattgctgct ggccgtcttc ggaccccttt ggattcttca agccagtttg cttaaagtac 1381 cctactttgt gcgcgtccaa ggccttctcc ggttctgcgc gttagcgcgg aagatgatcg 1441 gaggccatta cgtgcaaatg gtcatcatta agttaggggc gcttactggc acctatgttt 1501 ataaccatct cactcctctt cgggactggg cgcacaacgg cttgcgagat ctggccgtgg 1561 ctgtagagcc agtcgtcttc tcccaaatgg agaccaagct catcacgtgg ggggcagata 1621 ccgccgcgtg cggtgacatc atcaacggct tgcctgtttc cgcccgcagg ggccgggaga 1681 tactgctcgg gccagccgat ggaatggtct ccaaggggtg gaggttgctg gcgcccatca 1741 cggcgtacgc ccagcagaca aggggcctcc tagggtgcat aatcaccagc ctaactggcc 1801 gggacaaaaa ccaagtggag ggtgaggtcc agattgtgtc aactgctgcc caaaccttcc 1861 tggcaacgtg catcaatggg gtgtgctgga ctgtctacca cggggccgga acgaggacca 1921 tcgcgtcacc caagggtcct gtcatccaga tgtataccaa tgtagaccaa gaccttgtgg 1981 gctggcccgc tccgcaaggt agccgctcat tgacaccctg cacttgcggc tcctcggacc 2041 tttacctggt cacgaggcac gccgatgtca ttcccgtgcg ccggcggggt gatagcaggg 2101 gcagcctgct gtcgccccgg cccatttcct acttgaaagg ctcctcgggg ggtccgctgt 2161 tgtgccccgc ggggcacgcc gtgggcatat ttagggccgc ggtgtgcacc cgtggagtgg 2221 ctaaggcggt ggactttatc cctgtggaga acctagagac aaccatgagg tccccggtgt 2281 tcacggataa ctcctctcca ccagtagtgc cccagagctt ccaggtggct cacctccatg 2341 ctcccacagg cagcggcaaa agcaccaagg tcccggctgc atatgcagct cagggctata 2401 aggtgctagt actcaacccc tctgttgctg caacactggg ctttggtgct tacatgtcca 2461 aggctcatgg gatcgatcct aacatcagga ccggggtgag aacaattacc actggcagcc 2521 ccatcacgta ctccacctac ggcaagttcc ttgccgacgg cgggtgctcg gggggcgctt 2581 atgacataat aatttgtgac gagtgccact ccacggatgc cacatccatc ttgggcatcg 2641 gcactgtcct tgaccaagca gagactgcgg gggcgagact ggttgtgctc gccaccgcca 2701 cccctccggg ctccgtcact gtgccccatc ccaacatcga ggaggttgct ctgtccacca 2761 ccggagagat ccctttttac ggcaaggcta tccccctcga agtaatcaag ggggggagac 2821 atctcatctt ctgtcattca aagaagaagt gcgacgaact cgccgcaaag ctggtcgcat 2881 tgggcatcaa tgccgtggcc tactaccgcg gtcttgacgt gtccgtcatc ccgaccagcg 2941 gcgatgttgt cgtcgtggca accgatgccc tcatgaccgg ctataccggc gacttcgact 3001 cggtgataga ctgcaatacg tgtgtcaccc agacagtcga tttcagcctt gaccctacct 3061 tcaccattga gacaatcacg ctcccccagg atgctgtctc ccgcactcaa cgtcggggca 3121 ggactggcag ggggaagcca ggcatctaca gatttgtggc accgggggag cgcccctccg 3181 gcatgttcga ctcgtccgtc ctctgtgagt gctatgacgc aggctgtgct tggtatgagc 3241 tcacgcccgc cgagactaca gttaggctac gagcgtacat gaacaccccg gggcttcccg 3301 tgtgccagga ccatcttgaa ttttgggagg gcgtctttac aggcctcact catatagatg 3361 cccactttct atcccagaca aagcagagtg gggagaacct tccttacctg gtagcgtacc 3421 aagccaccgt gtgcgctagg gctcaagccc ctcccccatc gtgggaccag atgtggaagt 3481 gtttgattcg cctcaagccc accctccatg ggccaacacc cctgctatac agactgggcg 3541 ctgttcagaa tgaaatcacc ctgacgcacc cagtcaccaa atacatcatg acatgcatgt 3601 cggccgacct ggaggtcgtc acgagcacct gggtgctcgt tggcggcgtc ctggctgctt 3661 tggccgcgta ttgcctgtca acaggctgcg tggtcatagt gggcagggtc gtcttgtccg 3721 ggaagccggc aatcatacct gacagggaag tcctctaccg agagttcgat gagatggaag 3781 agtgctctca gcacttaccg tacatcgagc aagggatgat gctcgccgag cagttcaagc 3841 agaaggccct cggcctcctg cagaccgcgt cccgtcaggc agaggttatc gcccctgctg 3901 tccagaccaa ctggcaaaaa ctcgagacct tctgggcgaa gcatatgtgg aacttcatca 3961 gtgggataca atacttggcg ggcttgtcaa cgctgcctgg taaccccgcc attgcttcat 4021 tgatggcttt tacagctgct gtcaccagcc cactaaccac tagccaaacc ctcctcttca 4081 acatattggg ggggtgggtg gctgcccagc tcgccgcccc cggtgccgct actgcctttg 4141 tgggcgctgg cttagctggc gccgccatcg gcagtgttgg actggggaag gtcctcatag 4201 acatccttgc agggtatggc gcgggcgtgg cgggagctct tgtggcattc aagatcatga 4261 gcggtgaggt cccctccacg gaggacctgg tcaatctact gcccgccatc ctctcgcccg 4321 gagccctcgt agtcggcgtg gtctgtgcag caatactgcg ccggcacgtt ggcccgggcg 4381 agggggcagt gcagtggatg aaccggctga tagccttcgc ctcccggggg aaccatgttt 4441 cccccacgca ctacgtgccg gagagcgatg cagctgcccg cgtcactgcc atactcagca 4501 gcctcactgt aacccagctc ctgaggcgac tgcaccagtg gataagctcg gagtgtacca 4561 ctccatgctc cggttcctgg ctaagggaca tctgggactg gatatgcgag gtgttgagcg 4621 actttaagac ctggctaaaa gctaagctca tgccacagct gcctgggatc ccctttgtgt 4681 cctgccagcg cgggtataag ggggtctggc gagtggacgg catcatgcac actcgctgcc 4741 actgtggagc tgagatcact ggacatgtca aaaacgggac gatgaggatc gtcggtccta 4801 ggacctgcag gaacatgtgg agtgggacct tccccattaa tgcctacacc acgggcccct 4861 gtacccccct tcctgcgccg aactacacgt tcgcgctatg gagggtgtct gcagaggaat 4921 atgtggagat aaggcaggtg ggggacttcc actacgtgac gggtatgact actgacaatc 4981 tcaaatgccc gtgccaggtc ccatcgcccg aatttttcac agaattggac ggggtgcgcc 5041 tacataggtt tgcgcccccc tgcaagccct tgctgcggga ggaggtatca ttcagagtag 5101 gactccacga atacccggta gggtcgcaat taccttgcga gcccgaaccg gacgtggccg 5161 tgttgacgtc catgctcact gatccctccc atataacagc agaggcggcc gggcgaaggt 5221 tggcgagggg atcacccccc tctgtggcca gctcctcggc tagccagcta tccgctccat 5281 ctctcaaggc aacttgcacc gctaaccatg actcccctga tgctgagctc atagaggcca 5341 acctcctatg gaggcaggag atgggcggca acatcaccag ggttgagtca gaaaacaaag 5401 tggtgattct ggactccttc gatccgcttg tggcggagga ggacgagcgg gagatctccg 5461 tacccgcaga aatcctgcgg aagtctcgga gattcgccca ggccctgccc gtttgggcgc 5521 ggccggacta taaccccccg ctagtggaga cgtggaaaaa gcccgactac gaaccacctg 5581 tggtccatgg ctgtccgctt ccacctccaa agtcccctcc tgtgcctccg cctcggaaga 5641 agcggacggt ggtcctcact gaatcaaccc tatctactgc cttggccgag ctcgccacca 5701 gaagctttgg cagctcctca acttccggca ttacgggcga caatacgaca acatcctctg 5761 agcccgcccc ttctggctgc ccccccgact ccgacgctga gtcctattcc tccatgcccc 5821 ccctggaggg ggagcctggg gatccggatc ttagcgacgg gtcatggtca acggtcagta 5881 gtgaggccaa cgcggaggat gtcgtgtgct gctcaatgtc ttactcttgg acaggcgcac 5941 tcgtcacccc gtgcgccgcg gaagaacaga aactgcccat caatgcacta agcaactcgt 6001 tgctacgtca ccacaatttg gtgtattcca ccacctcacg cagtgcttgc caaaggcaga 6061 agaaagtcac atttgacaga ctgcaagttc tggacagcca ttaccaggac gtactcaagg 6121 aggttaaagc agcggcgtca aaagtgaagg ctaacttgct atccgtagag gaagcttgca 6181 gcctgacgcc cccacactca gccaaatcca agtttggtta tggggcaaaa gacgtccgtt 6241 gccatgccag aaaggccgta acccacatca actccgtgtg gaaagacctt ctggaagaca 6301 atgtaacacc aatagacact accatcatgg ctaagaacga ggttttctgc gttcagcctg 6361 agaagggggg tcgtaagcca gctcgtctca tcgtgttccc cgatctgggc gtgcgcgtgt 6421 gcgaaaagat ggctttgtac gacgtggtta caaagctccc cttggccgtg atgggaagct 6481 cctacggatt ccaatactca ccaggacagc gggttgaatt cctcgtgcaa gcgtggaagt 6541 ccaagaaaac cccaatgggg ttctcgtatg atacccgctg ctttgactcc acagtcactg 6601 agagcgacat ccgtacggag gaggcaatct accaatgttg tgacctcgac ccccaagccc 6661 gcgtggccat caagtccctc accgagaggc tttatgttgg gggccctctt accaattcaa 6721 ggggggagaa ctgcggctat cgcaggtgcc gcgcgagcgg cgtactgaca actagctgtg 6781 gtaacaccct cacttgctac atcaaggccc gggcagcctg tcgagccgca gggctccagg 6841 actgcaccat gctcgtgtgt ggcgacgact tagtcgttat ctgtgaaagc gcgggggtcc 6901 aggaggacgc ggcgagcctg agagccttca cggaggctat gaccaggtac tccgcccccc 6961 ctggggaccc cccacaacca gaatacgact tggagctcat aacatcatgc tcctccaacg 7021 tgtcagtcgc ccacgacggc gctggaaaga gggtctacta cctcacccgt gaccctacaa 7081 cccccctcgc gagagctgcg tgggagacag caagacacac tccagtcaat tcctggctag 7141 gcaacataat catgtttgcc cccacactgt gggcgaggat gatactgatg acccatttct 7201 ttagcgtcct tatagccagg gaccagcttg aacaggccct cgattgcgag atctacgggg 7261 cctgctactc catagaacca cttgatctac ctccaatcat tcaaagactc // LOCUS YSCREP1A 1498 bp ds-DNA PLN 25-MAY-1990 DEFINITION S.cerevisiae rep1 protein gene, complete cds. ACCESSION M31942 KEYWORDS rep1 protein. SOURCE S.cerevisiae (strain SKQ-2n) plasmid Scp3 DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1498) AUTHORS Neuville,P., Bonneu,M. and Aigle,M. TITLE 2 micron plasmid heterogeneity in the yeast Saccharomyces cerevisiae: A new sequence for the rep1 gene JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Neuville, 13-FEB-1990. FEATURES from to/span description pept 295 1437 rep1 protein BASE COUNT 513 a 245 c 315 g 425 t ORIGIN 1 aagctttgac aaaaagcatt gagtagctaa ctcttctatg caatctatag ctgttttata 61 aggcatttaa tgacaaattg attgtttaga aacatactag tgaaagtagc cttaatcgcc 121 tctcgaaatt aataatgcag tatggtgtaa aaaatgcaac tcgcgttgct ctactttttt 181 ccgaatttcc gaatacgcag ctggggtgat tgatcgattt cgtatcaaaa gttttgttta 241 taaaaaccgc gaaatccttc ttcaacagat ggatttttac agtgctgaca tacaatgaca 301 acaactataa tagaaaataa ccgaaatatg aatggagaga gactacttgt ttgtattaag 361 caatgtatta tgcagcactt ccaacctatg atatacgatg aaagtagatg tgtaattgaa 421 accacaaggg gaactttttc agttcctgac aattataaga aatataaaac cttagcattt 481 gcttttgttg gacacgtact gaatacagat gacacaccgg taattgaaaa agaactggat 541 tggcctgatc ctgcactagt gtacaataca attgtcgatc gaatcagaaa tcacccagaa 601 ttatcacagt ttatatcggt tgcattcatt agtcaattaa aggcggccat aggggaggtt 661 agatatcaat gtaagaggca cgttaaaccg cagggaaagg gtataagaag gccgaacgta 721 ttttttagat acatggaatc cccgtttgtc aataccaagg ttactgactt cttttcttat 781 cttcgagact ataataaaat tgcctcagag tatcacaata atactaaatt cattctcacg 841 ttttcatgta aagcatattg ggcatctggc caaaactttt ccgccttgaa gaatgttatt 901 aggtgctcca tagttcatga atacatttct aggtttgtgg aaagagaaca gggtcttata 961 ggagatcagg agcaaccacg tgaagaaaac tcttcccctg aactaagtaa tgtgcaacat 1021 gaagtcaata atttaatcga acaagatgcc gcggcggatg aagggttgtg gagtgagata 1081 gattcattat gtgaaaaatg gcagtctgaa gcggaagacc aagctgaagc ggagataata 1141 gccgacagaa taattggaaa tagtcaaagg atggcgaacc ttaaaattcg tcgtacaaag 1201 ttcaaaagtg tcttgtatca tatactaaag gaactaattc aatctcaggg aaccgtaaag 1261 gtttatcgcg gtagtagctt ttcacacgat tctataaaga taagcttaca ttatgaagag 1321 cagcatatta cagccgtatg ggtctaccta acagtgaaat ttgaagagca ttggaaacca 1381 gttgatgtag aagtcgagtt taggtgcaag ttcaaggagc gaaaggtcaa tgagtaggtt 1441 atatagggat atagaagata tagaacatag caaagagctg cttttgagca atgtttgt // LOCUS MUSREN1C 758 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse renin (Ren-1-c) gene, 5' end. ACCESSION M34190 KEYWORDS renin. SOURCE M.musculus (strain C57BL/10) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 758) AUTHORS Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J., Pioli,D. and Brammar,W.J. TITLE The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d, and its upstream region JOURNAL Gene 84, 91-104 (1989) STANDARD simple staff_review FEATURES from to/span description pept 729 > 758 renin (Ren-1-c), exon 1 pre-msg 701 > 758 Ren-1-c mRNA and introns BASE COUNT 208 a 156 c 245 g 149 t ORIGIN 1 tctagagtca ttgggctcag ccacccttcc cacaccccca tgcctgccac cactctgctc 61 tgtgaccagg tcctgcttat cctataccta cctagcttgg tctcacaggc taagatttat 121 cagagctgcc ctgccatggg ccttgttggg cacctggcgt ggggaaataa aggaggtggg 181 cgagactcac cagagctcag agagatgggc gggcagacag caggggaagg cactgggtgg 241 tctggcagct ggaaatgctg ggaggccttc ttgggggaga ttagataaag ctcttcaggg 301 gaaggcctat tccatgactc cagcatggtg agtctagatg aaaggaggta gtctatggtt 361 ttagagcttt attgtagaag agagagagag aaggtagaga agtagaagcc agtcattgcc 421 atgaagaagg aagggggaga aggagagcga aaggtaagag taagaaaaca agagctttag 481 gagagagaca agagagagag gaggaggcaa gcagccaggt aactctgggg gtggagtctg 541 gacagcctac atgactgatg gccacagaat tatggagctg ggtccttggc cagaaaacag 601 gctgcctttc atggtcccac aggccctggg gtaataaatc aaagcagagc ctgtgataca 661 tggtgtgtat aaaagaaggc tcagggggtc tgggctacac agctcttaga aagccttggc 721 tgaaccagat ggacagaagg aggatgcctc tctgggca // LOCUS MUSREN1D 11231 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse renin (Ren-1-d) gene, complete cds. ACCESSION M32352 KEYWORDS renin. SOURCE M.musculus (strain DBA/2J) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 11231) AUTHORS Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J., Pioli,D. and Brammar,W.J. TITLE The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d, and its upstream region JOURNAL Gene 84, 91-104 (1989) STANDARD simple staff_review FEATURES from to/span description pept 1286 1377 renin (Ren-1-d), exon 1 4697 4847 renin, exon 2 5364 5487 renin, exon 3 6056 6174 renin, exon 4 6955 7151 renin, exon 5 8940 9059 renin, exon 6 9487 9631 renin, exon 7 9884 9982 renin, exon 8 10547 10708 renin, exon 9 pre-msg 1258 10887 Ren-1-d mRNA and introns IVS 1378 4696 Ren-1-d intron A IVS 4848 5363 Ren-1-d intron B IVS 5488 6055 Ren-1-d intron C IVS 6175 6954 Ren-1-d intron D IVS 7152 8939 Ren-1-d intron E IVS 9060 9486 Ren-1-d intron F IVS 9632 9883 Ren-1-d intron G IVS 9983 10546 Ren-1-d intron H BASE COUNT 2809 a 2725 c 3051 g 2646 t ORIGIN 1 ggtacttagg gagagtaata ctgccagcca gggtcaggtc acttcgacag ctctgccagg 61 cttctgagag gtacagggaa gaaggacagg gggcaaaagg ataggtctaa gggagtgaag 121 ggccagtagg agggggtcag ctggaagcca gctgcactta agacctcctt tcttttagag 181 ctacccttct gtccaccatc actaacccag gagggacacc caccctctcc ctgatgccag 241 aggttagatg acaggttact tttccaacta ggcagcagcc aaaaggaatc actcatgatc 301 aagatggaag actagggcag agcctaggag gagccagggg tgatttgaag gttaatatag 361 attaatagat aggttaatat ataatattag gttaatatgc aggtctcggg ggcatgagtt 421 ggtagggtct gagttagagg tttgaagcca gggtctatgc tgtctgtacc ctgacctcat 481 gctgaacagc acagctcctc attagcctct gtctaagcta gaggttgcgg ggccaggcca 541 aacagggact ctagagtcat tgggctcagc cacccttccc acacccccat gcctgccacc 601 actctgctct gcgaccaggt tctgcttatc ctatacctac ctaacttggt ctcacaggct 661 agaatttatc aggactgccc tgccatgggc cttgttgggc acctggcatg gggaaataaa 721 ggaggtgggc gagacctgtg tctcaccaga gctcagagtg actggatggg cagacagcag 781 gggaaggcac tgggtggtct ggcagctgga aacgctggga ggccttcttg ggggaaatta 841 gatgaagctc ttcaggggaa ggcctattcc atgactccag catggtgagt ctagatgaaa 901 ggaggtagtc tatggtttta gagctttatt gtagaagaga gagagagaag gtagagaagt 961 agatgccagc cattgccacg aagaaggaag ggggagaagg agagcaaaag gtaagagtaa 1021 gaaaacaaga gctttaggag agagacaaga gagagaggag gaggcaagca gccaggtaac 1081 tctgggggtg gagtctggac agcctacatg actgatggcc acagaattat ggagctgggt 1141 ccttggccag aaaacaggct gcctttcatg gtcccacagg ccctggggta ataaatcaaa 1201 gcagatcctg tgatacatgg tgtgtataaa agaaggctca gggggtctgg gctacacagc 1261 tcttagaaag ccttggctga accagatgga caggaggagg atgcctctct gggcactctt 1321 gttgctctgg agtccttgca ccttcagtct cccaacacgc accgctacct ttgaacggta 1381 acttgggcag agaaggggtg gcagggtaca ggaactggca tcttactaac gccctcagct 1441 gtctatgcgt tgggtcatcc agtccttttt ggccagccag tcagggatgt atgatcctgc 1501 tctgacaatc ctagggaata gcccaaagcc ctagtggacg gccataatta aggggtacca 1561 caacttcccc cttctactac ttttttaaaa ggcgaggggt cactggagtg actgtgtcat 1621 ttgtggtgaa aattttgggt ggagtggaag tatcagaaga aacatgtagg ccaaaggagt 1681 aacatgacta tcagttaatt aatctagata gacattaagt tttccttgtg atcatgtaac 1741 tccccccaag ttcagcagcc gtttggtttg gcgagagagt tcatctgtag ttggaagacc 1801 tcaagtctgt cctctgtggg gttcagctcg tgtctctgga tctcatgtgt gtcttgggag 1861 tcttctgtca gatgaggtgt ctggaggaag gcagctttgt ggtctgtagg tctcaggaga 1921 cctgtctatg ctggatctgc agctgcaggc actgcccttt acttttgtgg ggataaacct 1981 gtggggtaat cacagcagag gagcagaacc ctagcagcag gatgtctgca tagggtggat 2041 ggggaacgtt aagcctgcaa tggtttgctt ggaagcacat cctggcctgt tgttaggatc 2101 ctagaaaagc aagggtgttt gaggaatgta gtcctggaag gcatccttta agtctgcttt 2161 gggtggctaa ggaggaaaat atcatgttaa ttgtgtatct gggataggca gtaacagatt 2221 gtctgcaaaa ggaagctggt tgtcagttaa ggattggatg acggtaatca gcatggttaa 2281 ttcaacccag ggaagcacat tagatggctt ttggccctta aaggaaatca tgaatgctat 2341 ggagaaaaat aaacattaaa ataaagaagc aagaaaattt tggattgaaa agcttactag 2401 cagcagatga gggtaagcag caagctaatc tcaccaatcc tcagtccgcc attcaaaaac 2461 aggatgaaag cagagcctgc ttcaccggct gctttgaaga tttgatgaga ggataagcat 2521 agcacttccg atgccaggcg gggaccggtg taagaagcca tggttacagc atggattacc 2581 acagagtgta ggaaagggaa ggctcctggc agccccacga gggaggatga aacagccatt 2641 tagtgcagtt ccagggctgt aggaacaagg cttcttgagg caggagcagg gcagaggcaa 2701 gagaaggctc acccagcaat ggtctgggga ctatgatgag gaagtcctga aagagtcaag 2761 gcaatgagaa ctgtgaatct gcttggtgct taggagctgg tgggttgctt tcatcctaaa 2821 ttgagtctcc agacagacat cagcacccat gttgcagatg aggatgtcct ccagaaaggt 2881 ctgaggactc actggtcttt gtttccaaag ccctgccatc ttgctcccag catgcagcag 2941 agagcgtggg attaactaga gggcgtgggg ttctgtgctc cctgctctgt gtaaagtatt 3001 gagcaagtca ccttgagggg ctgagcaatg ggctttcttc tcaaggccat tacatgcctt 3061 gcatacacac aatgtccagc ctccacgctt agcctgagct gtgcctgggc gaaaaaacca 3121 tcaaggtgcc agaggacgaa gaggttagtg aatgctcaga accgtgcagc atgccctcag 3181 catcaggtgg ctctatcccc gttgtccctg taggctaggc ttgtggggag cactcagggg 3241 aaatggaccc attggctggc atccttggtg agagcaccta ggtatgggct caggtcttga 3301 aacattaagg atctaggaga ggaatagaag gggttttatg tttattttct tttgggacag 3361 agcctcatta tggagcctgg aactcactat atagaccaga ctggcctcaa acccacagag 3421 atcagccttc ctctgcctcc tgaggactga aattaatgtc atgccctacc acacaacaga 3481 gaggttttct aaaagaagga agggtcataa aagaaacagc aagcattggg tctttttcag 3541 gaaattggtg gagaactaga cctgtcttcc tcccttgaag agtggggaaa taaaacacac 3601 ccatctggtg agatggctgg ggtcaactaa cacaagagtg gaaatacttg caagccgtgc 3661 taggactaag tcatttctgc tggagattac tcggataggt gtgaaatatg ggcaaagctg 3721 ggaggtgagg cacggagtgc tgggaggaag gttagggctc acactggaga agaaagaggg 3781 ctctctggat gtctgggaaa gccctggctc aggactggtg gcatcttcct ccaactgcaa 3841 aacagtctgt gtggtcccgg cagtcctggg taggaaagac agacagacag agcttggact 3901 gtctctgttt ggcatttctg agtcatcttc atccttgtga accttcaatg tctccgccac 3961 caccgccgga aaccagggag ctagatgtca tgtggtttgt atgagggtgc gtgccaggca 4021 gctggcctgc ggccaggagt agctcaggga gctgctgttc taaatgaaat ctcttgaatt 4081 ttgccagtca ataatgattg gaggtgagga ggggcccagg attcaggccc aagacacatc 4141 agtccccagg tgaggcacac cggcaagcta tagacactgg cctatccaat ccaagtcaag 4201 ctaggttcat tgggcacata tcacctggct ctgtgatacc ccagggagca acaagtcctt 4261 gcagcctgct gtgggcatct gctgccactt acaggccatc cgtggtactg ccttgcccgt 4321 caatccaagg agatggcaag aggctgtata gaccttttat gttcctctcc agaacagtga 4381 ggaagagtag gtgttcaaca cacgatgatg ataatggtag atagatgaac agacggactg 4441 actgatgaac agacgggtgg atggacagac ggatggacgg atgaacaaac tgctttatct 4501 aactaaaata agatctggga attacttagt accagaataa gtgactccaa gaaaccccag 4561 ggaactaaaa gtgattcgta attgcaatgg gagagggtta gtgttagacg taccaaaaag 4621 caaggcaaca tctgggctct gtcaacgttg gagcactgca cgactcttct gtttttcttt 4681 ctctaccctg gagcagaatc ccactcaaga aaatgccctc tgtccgggaa atcctggagg 4741 agcggggagt ggacatgacc aggctcagtg ctgaaagggg cgtattcaca aagaggcctt 4801 ccttgatcaa tcttacctcc cccgtggtcc tcaccaacta cctgaatgtg agtcctggct 4861 catctgctat ctccttttct ctctgggact agcaggctcc cactccttcc acaccatgtt 4921 tgactgagag ccaaagagca ccccatagaa ctcctattta ccagccgaga agtctgatgg 4981 ggaggggctg ttgtgattgt tttgttgttt ggtttggttt gtttttgaga cagggtctca 5041 ctgtgtagcc tggcaagcct ggaactcact ttgtagacca gactgacctc gaactcacag 5101 agttctacct acctctgcct cccaggtgct gggattaaag catgtgacat catgcccagc 5161 aattttaatt gctttgatta agcttgcccc agcaagcctc ctgtatccca ggccagcaaa 5221 agacaatgct gagtcctcac atctcagcaa aaagaaagcc aagagccgag aaggggtcca 5281 gacaaccccc caaaccagag gccccaagcc ctgaagtcac ccccatctct ctgggcccca 5341 ttttcctttt gcccccgcca cagacccagt actacggcga gattggcatc ggtaccccac 5401 cccagacctt caaagtcatc tttgacacgg gttcggccaa cctctgggtg ccctccacca 5461 agtgcagccg cctctacctt gcttgtggta agagtcaaga gtcagcctcg tgcctctccc 5521 cagctctgcc tgctgcacct atgacctgct tctcctcagc tccacaccac tgactgagga 5581 agagagagga agtgtctctc ttctttttga aatggcagcc aagttaataa gccagctttg 5641 ggaatgcctg ggcacctgga aggccttttg ccatgtccca aggattcaga caggaacaaa 5701 gtgtaaaccc actccaggtc ttgaggcttt ttcgatgctc ccatgctgag tacttactct 5761 gtgcccagaa tcctctttag cactatatca gtgttagcaa tggctttaga actctgtgaa 5821 aaaaatgtct cagaaagcac gagcacataa gacgaaaaca aaaaaaacca ctatccttgc 5881 ctggtttaca tttttccaag ctggagactc aggaaactaa acacacacac acacacacac 5941 acacacacac acacacacct aagtcttcaa ccatgtttat catttgtcct gcctctgagc 6001 cttctatgta aatcaatctc tcttgtgctg actggggcct gtaaccctct gctagggatt 6061 cacagcctct atgagtcctc tgactcctcc agctacatgg agaacgggtc cgacttcacc 6121 atccactacg gatcaggaag agtcaaaggc ttcctcagcc aggacgtggt gactgtgagt 6181 aggatcgcct ctatttacca cagggcccat gtggttccag acaggtatac tccagccctc 6241 ctccctaccc ctcccactcc tctgcaggag cctgggaagt tgaagtgcag cctccttggc 6301 agcttcttct agctgaagct ccgaagccaa ttgcaggaga gccaaacctg ctgacactga 6361 tgcacaaggc tttgcatcta acttgcatgt ggatgtgtgc ctaaacccca cagtttcctt 6421 cctagtaacc agtctgactt tgctaattga gaccaggttt ggagtactga gattaggaat 6481 caggggtttg tgtccgggca gttttgattt ctttccttca gtgtctaagc ctgttctggc 6541 atatctcagc cccgctgtgt cagaggaaga gaaagacaca tacagaggtc agaggaaatc 6601 agtgcgagcc cttccctcgc actgcccccg cggaatctgg ggagggcaga aggagccctg 6661 gtaatgacga ggcatcctca gcctcggctg gtgggctagc acgaaagcgg gcctctagga 6721 gagttccggt cacctgtctg gcaggacctc tttctcagcc tttatagcat gagtcacccc 6781 catctctctg ggccccaagc atagaggtgt tgtcagtgtg tgtgcagact tgacggaata 6841 gtcttaactg ctgtacagag cgtgtgtatc tctgctgccc tggtatgaaa gccagagcag 6901 atcccagcca aggttaggca gtaacctatg tcccttctgg ttcttctctc acaggtgggt 6961 ggaatcactg tgacacagac ctttggagag gtcaccgagc tgcccctgat ccctttcatg 7021 ctggccaagt ttgacggtgt cctaggcatg ggctttcctg ctcaggccgt tggcggggtt 7081 acccctgtct ttgaccacat tctctcccag ggggtgctga aggaggaagt gttctctgtc 7141 tactacaaca ggtgggcctt tgacaccaag gccagggcag ggaggggtgg agagcaggga 7201 gagcagccac cgtccagttc aggatttata tgcttctaat taggtggttc tcagtcccca 7261 tctagaacca cagtagccaa ttagaacaaa tggaagtcaa gcattcggtc cccagggact 7321 ggccacacag acacagccag cttcatgcct ttgtttatct gttctcattg atccggttag 7381 tgctctgtta ccatgtgcta tggtaggctc caggggtgga cagtgaacag acgacccctg 7441 ccctccatcc tcacagagct cactgcccag caggggagat gagaggtaca acacaatgag 7501 tgccattcta aagctgcaga tggttttcag ttctaggcca ggattctggg cagttcagcc 7561 aggcctgggg aaactgtggt cagtgcccca gcaggggtgg gatctgctgc cagtggctgg 7621 gtttagaaat atggtctaag ctggaggtaa gatctttacg ggtaagaata gaggccaggg 7681 gtggtgagag tggaagggag tggtgtcctg gaacaatctt agaggaagca ggagtctctg 7741 gatcatgact ctggcagact tctgtgcaaa gaggggcatt gagccaatga caccactcgg 7801 aggctcctgc agtcaatggt cttgggactg tgtttcaaag agaccagaaa aacaaagtgt 7861 ctaagagtgg aagtagatgt cagaaagggg gaggggggac tccgcaaaat gtgggcagtt 7921 ctggaaattc aacagggaag gagatgtggg gcagagaagc gggagagggg aggtgggtgc 7981 tctaaacgct agaggatgct tgagcaatgc tgtgggtgga agaaggagag ctgtggaaaa 8041 cacatgcatg aggctgagaa gcaggagccc tggaagggat gaggacccag caacaggtag 8101 caagcagtcc tgggaagcag agacctgctt cctttccctt gggctgggaa gaaaggaaga 8161 tctagataag tcactgactc ttgcacagaa cctgtgtaca cctaacccat gtacatatgt 8221 gcacatacaa atgtgtgttg tgaattctaa gaaactccat catatacaca tgagtatctt 8281 aaaaccaccc aaggcaatcc tgagcttagg tagcatcatg cccgtggaga gagaagtgag 8341 ggtagtcagc tctggctggg aaggagagcc tggctccagc catttggtga ctcagttgcc 8401 tggggggggg gggtgtgaaa attaacctct accaaacaag ggttgctgat tctttcctac 8461 gaagtctccc ctctgtgtcc ctctttgcct cagcagaaat ctcaaggagg gaggagcccc 8521 atatggcccc aaagctccaa tacttcccag acccttaccc tggaatgata cccaccaccc 8581 acacactcca ttcatccagg cccaactcca ctaaatgtct gccttctccc tagtgtccac 8641 atgcccagga aagacgtgat ttgccacttg tctgcatcta cccaatgtcc ttgagctaga 8701 cagttttctg ggtgcaggtg aacagtccta gaaatacttc tgttacattt agggaggatc 8761 cagcaaggga agatgcaatg gggtctgtgt gcctcctgat gacagcttac agccttgcca 8821 tctacccact aaacccttcc tctatcccaa agtcctctgc cctaccccac accagactct 8881 actctccttc atccctgtat ctcgggtatg tctctaactt ccaatgttct ttcctttagg 8941 ggttcccacc tgctgggggg cgaggtggtg ctaggaggca gtgacccgca gcattaccaa 9001 ggcaattttc actatgtgag catcagcaag actgactcct ggcagatcac gatgaagggg 9061 tgggtcagca agcgtaactc tcccaaccat gcaaactgat gtcactgtgt gattggggtg 9121 ggctgggaat gggatgaggg ggacacttct tagggcacag ccaccatctg cctctctgct 9181 tgctctgtcc taggatgggg gcatgaaaat ttgaggatgg gaaggaaggg actcagggaa 9241 aggaggggga gtccggaggc tacatgctga tgcaaggaac agttagtacc tttccatagt 9301 gtctcggccc tcttccagtt catcatcaga ccatgtatgg cctcctatca cactacccta 9361 aattgggaaa ggcatgcagg cggcgggaaa agggtgcccg acagcttcta cctgcctgcc 9421 ttgtctgtct gtctgactgt ctgcctgcct gcctgcctgc ctgccagtct aactatagct 9481 ctgcagggtg tctgtggggt cttccaccct gctatgtgaa gaaggctgtg cggtagtggt 9541 ggacactggt tcatccttta tctcggctcc tacgagctcc ctgaagttga tcatgcaagc 9601 cctgggagcc aaggagaaga gaatagatga agtaagagat ccgtggggag ggctgggtgg 9661 tggtgagggg tggcagcagc actgtgcggc tccagcctgc aaatcgtcca gacagcactg 9721 gatcggaaac aaagctttcc cctcctcccc tctcccaccc cctgaatgta tggcatgtct 9781 tccatagaag gtgaccagag actggcacag gctctctggc acagggtgct ctagtctgct 9841 tcgaagcttt ccaccctgac aacccctctc tttattcccc cagtatgttg tgaactgtag 9901 ccaggtgccc accctccccg acatttcctt tgacctggga ggcagggcct acacactcag 9961 cagtacggac tacgtgctac aggtgaggct gggacaaggg tgggcagggg cactgaaggg 10021 ggacaagtgt aatgtacaga tcccccagta gactcgtggt tgatgaatac ggctcatgtc 10081 atcatctgga gaatttttag cagtcccagt acccaggcca catacaagtc tcgcctagca 10141 gatcggagct tccacaacag gcatctggga atcagagttt taaaacgact tccccagtag 10201 gttccattat gaagctctgc tgtgactcag tgagccccct ctctccatta ttgagtagcc 10261 ctctttctag tcctccatcc ttttcatgct cccaggctta gagaagttca atgacttgtt 10321 ttgggtcaga agctaagtga gcccgtgagc aaggggtaga cacaccattt gcctgatacc 10381 tactacagtg cctgcctacc ccaggaaggc tgggatttag gataggtaaa acagttctgg 10441 gaggatgtag gagagaggag gcatatactg aagagctgtt gcttcactct ggaggtggga 10501 atgcgttgtg tggcgggcag tagagcaagc ctctgcttct tgccagtatc cctacaggag 10561 agacaagctg tgcacactgg ctctccatgc catggacatc ccaccaccca ctgggcctgt 10621 ctgggtcctg ggtgccacct tcatccgcaa gttctataca gagtttgatc ggcataacaa 10681 tcgcattgga ttcgccttgg cccgctaagg ccctctgcca cccagtaacc ctaggccaag 10741 ccaagctggc actcctgggg gccattttgt ctggctttgt ccccaacata gggacactgg 10801 acacagagac cctaacgagt gtttgcccct tcacctgcac tcacccttcc ctgctttaag 10861 gaaaaaccga ataaagattt catgtttaaa gcctgtttcg gatgggttct ttggagtttg 10921 gaggaggtac acctgcatgg tcacacgtgg agaagactat gacacgtgag gccagagatg 10981 gaggtttgtt agtaaagtgc ttgcttttca agtgtgagaa cctgagtttg attccaggaa 11041 cccacactaa aaccaacaaa caaacaaaca gggctggaga gatggctcag tagttaagtg 11101 cactgagtgt tcttcctgat gtccttgcgt tcaattccca gcaacacacg tggtggctca 11161 caaccatgtg tatggggtct gatgccctct tctggcatgt ctaagagagt gataatgtac 11221 tcctaattaa t // LOCUS MUSREN2D 1116 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse renin (Ren-2-d) gene, 5' end. ACCESSION M34191 KEYWORDS renin. SOURCE M.musculus (strain DBA/2J) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1116) AUTHORS Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J., Pioli,D. and Brammar,W.J. TITLE The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d, and its upstream region JOURNAL Gene 84, 91-104 (1989) STANDARD simple staff_review FEATURES from to/span description pept 1087 > 1116 renin (Ren-2-d), exon 1 pre-msg 1059 > 1116 Ren-2-d mRNA and introns BASE COUNT 309 a 248 c 303 g 256 t ORIGIN 1 tctagagtca ttgggctcag cacccttccc acacccccat gcctgccacc actctgctct 61 gcgaccagtg tcttgcttat cctatacctc acctaagcta catgccataa atcaatgctt 121 gttattcaag gcatcatctc ctttgggggg cacttctcaa aaagtcctct cctgtcccta 181 gatatctgag actcctcagt ggccttccat ttagaagggt gttgtctctg aagtaaaaac 241 ttcagttcct ctcacaggct aagatttatc agagctgcct gccatgggcc ttgttggcac 301 tggcgtggga ataagggggt ggcgagactg tgtcccgcag agctcagaga ctgggcaggc 361 agacagcagg agaagacact gggtggtctg gcagctggaa acgctgggag gccttcttgc 421 gggagattag atgaggctat tcaggggaag gcctattcca tttctccagc atggtgagtc 481 tagatgaaaa gaggtagtga aatagaggcc agccattgcc acaaagaagg taagagtaag 541 agaacaagag cttaagaaag agccaagatg ggctggagag atggctcagt gggtaagagc 601 acccgactgc tcttccaaag gtccgaagtt caaatcccag caaccacatg gtggctcaca 661 accatctcat aatgagatct gactccctct tcttggagtg tctgaagaca gctacagtat 721 atttacatat aataaaaata aatttaaaaa ataaaaaaat aaaaataaat ttaagagaga 781 gagagaggag gagccaagca gccccattta tagtgggctg ggctaccttg ctgttgtagg 841 gtaactgtgg ggaggagcaa acctggctat tgccaggtaa ctctgggggg tggagttacc 901 ttgactactt gactgatggc cacagaatga tggaggtgga tccttggcca gaaaacaggc 961 tgcgtttcat ggtcccacag gccctggggt aataaatcag agcagagcct gtgatacatg 1021 gtgtgtataa aagaaggctc agggggtctg ggctacacag ctcttagaaa gccttggctg 1081 aaccagatgg acaggaggag gatgcctctc tgggca // LOCUS HUMASPX 7787 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human nonerythroid alpha-spectrin (SPTAN1) mRNA, complete cds. ACCESSION J05243 KEYWORDS alpha-fodrin; nonerythroid alpha-spectrin. SOURCE Human lung fibroblast cell line WI38, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 7787) AUTHORS Moon,R.T. and McMahon,A.P. TITLE Generation of diversity of nonerythroid spectrins: Multiple polypeptides are predicted by sequence analysis of cDNAs encompassing the coding region of human nonerythroid alpha- spectrin JOURNAL J. Biol. Chem. 265, 4427-4433 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.T.Moon, 12-DEC-1989. FEATURES from to/span description pept 103 7521 nonerythroid alpha-spectrin /hgml_locus_uid="LK0022P" /nomgen="SPTAN1" /map="9q33-q34" BASE COUNT 2147 a 1926 c 2239 g 1475 t ORIGIN 1 gaattcgggg aacggtgtgg agcggaggcc gcggaggctc ctcggtcctt cagcacccct 61 cggcccgacg cacccacgcc cctcaccccc cgagagccga aaatggaccc aagtggggtc 121 aaagtgctgg aaacagcaga ggacatccag gagaggcggc agcaggtcct agaccgatac 181 caccgcttca aggaactctc aacccttagg cgtcagaagc tggaagattc ctatcgattc 241 cagttctttc aaagagatgc tgaagagctg gagaaatgga tacaggaaaa acttcagatt 301 gcatctgatg agaattataa agacccaacc aacttgcagg gaaagcttca gaagcatcaa 361 gcatttgaag ctgaagtgca ggccaactca ggagccattg ttaagctgga tgaaactgga 421 aacctgatga tctcagaagg gcattttgca tctgaaacca tacggacccg tttgatggag 481 ctgcaccgcc agtgggaatt acttttggag aagatgcgag aaaaaggaat caaattgctg 541 caggcccaga acttggtgca gtacttacga gaatgtgagg acgtgatgga ctggatcaat 601 gacaaggaag caattgttac ttctgaagag ctgggccagg atctggagca tgtagaggtt 661 ttacagaaga aatttgaaga gtttcaaaca gatatggctg ctcatgaaga aagagttaat 721 gaagtgaacc agtttgctgc caaactcata caggagcagc accctgagga ggaactgatc 781 aagactaagc aggatgaagt caatgcagcc tggcagcggc tgaagggcct ggctctgcag 841 aggcagggga agctctttgg ggcagcagaa gttcagcgct ttaacaggga tgtggatgag 901 actatcagtt ggattaagga aaaggagcag ttaatggcct ctgatgattt tggccgagac 961 ctggcaagtg ttcaggctct gcttcggaag cacgagggtc tggagagaga tcttgctgct 1021 ctagaagaca aggtcaaagc cctgtgtgct gaggctgacc gcctgcaaca gtcccaccct 1081 ctgagtgcaa cacagattca agtgaagcga gaggaactga ttacaaactg ggagcagatc 1141 cgcaccttgg cggcagagag acatgcacgg ctcaatgatt catacaggct tcaacgcttc 1201 cttgctgact tccgtgacct caccagctgg gtgactgaga tgaaagccct catcaatgca 1261 gatgagcttg ccagtgatgt ggctggggct gaagccctgc tagatagaca ccaagagcac 1321 aagggtgaaa ttgatgccca tgaagacagc ttcaaatctg cagatgaatc tggacaggca 1381 ctgcttgctg ctggtcacta tgcctcagat gaagtgaggg agaagctgac cgtcctttcc 1441 gaggagagag cggcgctgct ggagctgtgg gagctgcgca ggcagcagta cgagcagtgc 1501 atggacctgc agctcttcta ccgggacact gagcaggtgg acaactggat gagcaagcag 1561 gaggcgttcc tgttgaatga agacttggga gatttcttgg atagtgtgga agcgcttctt 1621 aagaagcacg aagactttga gaaatccctt agtgcccagg aggaaaagat tacagcatta 1681 gatgaatttg caaccaagct aattcagaac aaccactatg caatggaaga tgtggccact 1741 cgccgagacg ctctgttgag ccgccgcaat gcccttcacg agagagccat gcgtcgccgg 1801 gcccagctag ccgattcttt ccatctgcag cagtttttcc gtgattctga tgagctcaag 1861 agttgggtga atgagaagat gaaaactgcc acagatgaag cttataaaga tccatccaac 1921 ctacaaggaa aagtacagaa gcatcaggct tttgaggctg agctctcagc aaaccagagc 1981 cgaattgatg ccttggagaa agctggccaa aagctgattg atgtcaacca ctatgccaag 2041 gatgaagtgg cagctcgtat gaatgaggtg atcagtttgt ggaagaaact gctagaggcc 2101 actgaactga aaggaataaa gcttcgtgaa gccaaccagc aacagcaatt taatcgcaat 2161 gttgaggata ttgaattgtg gctatatgaa gtagaaggtc acttggcttc ggatgattac 2221 ggcaaagatc ttaccaatgt gcagaacctc cagaagaaac atgccctgct agaggcagat 2281 gtggctgctc accaggaccg aattgatggc gtcaccattc aggcccgcca gttccaagat 2341 gctggccatt ttgatgcaga aaacatcaag aagaaacagg aagccctcgt ggctcgctat 2401 gaggcactca aggagcccat ggttgcccgg aagcagaagc tggccgattc tctgcggttg 2461 cagcagctct tccgggatgt tgaggatgag gagacgtgga ttcgagagaa agagcccatt 2521 gccgcatcta ccaacagagg taaggattta attggggtcc agaatctgct aaagaaacat 2581 caagccttac aagcagaaat tgctggacat gaaccacgca tcaaagcagt tacacagaag 2641 gggaatgcca tggtggagga aggccatttt gctgcagagg atgtgaaggc caagcttcac 2701 gagctgaacc aaaagtggga ggcactgaaa gcaaaagctt cccagcgtcg gcaggacctg 2761 gaggactctc tgcaggccca gcagtacttt gctgatgcta acgaggctga atcctggatg 2821 cgggagaagg aacccattgt gggcagcact gactatggca aggacgaaga ctctgctgag 2881 gctctactga agaaacacga agctttgatg tcagatctca gtgcctacgg cagcagcatc 2941 caggctttgc gagaacaagc acagtcctgc cggcaacaag tggcccccac ggatgatgag 3001 actgggaagg agctggtctt ggctctctac gactatcagg agaagagtcc ccgagaggtc 3061 accatgaaga agggagatat ccttacctta ctcaacagca ccaacaagga ttggtggaaa 3121 gtggaagtga acgatcgtca gggttttgtg ccggctgcgt acgtgaagaa attggacccc 3181 gcccagtcag cctcccggga gaatctcctg gaggagcaag gcagcatagc actgcggcag 3241 gagcagattg acaatcagac acgcataact aaggaggccg gcagtgtatc tctgcgtatg 3301 aagcaggtgg aagaactata tcattctctg ctggaactgg gtgagaagcg taaaggcatg 3361 ttggagaaga gttgcaagaa gtttatgttg ttccgtgaag cgaatgaact acagcaatgg 3421 atcaatgaga aggaagccgc tctgacaagt gaggaggtcg gagcagactt ggagcaggtt 3481 gaggtgctcc agaagaagtt tgatgacttc cagaaggacc tgaaggccaa tgagtcacgg 3541 ttgaaggaca ttaacaaggt agctgaagac ctggagtctg aaggtcttat ggcagaggag 3601 gtgcaggctg tgcaacaaca ggaagtgtat ggcatgatgc ccagggatga aactgattcc 3661 aagacagcct ccccgtggaa gtctgctcgt ctgatggttc acaccgtggc cacctttaat 3721 tccatcaagg agctgaatga gcgctggcgg tccctacagc agctggccga ggaacggagc 3781 cagctcttgg gcagcgccca tgaagtacag aggttccaca gagatgctga tgaaaccaaa 3841 gaatggattg aagagaagaa tcaagctcta aacacagaca attatggaca tgatctcgcc 3901 agtgtccagg ccctgcaacg caagcatgag ggcttcgaga gggaccttgc ggctctcggt 3961 gacaaggtaa actcccttgg tgaaacagca gagcgcctga cccagtccca tcccgagtca 4021 gcagaagacc tgcaggaaaa gtgcacagag ttaaaccagg cctggagcag cctggggaaa 4081 cgtgcagatc agcgcaaggc aaagttgggt gactcccacg acctgcagcg cttccttagc 4141 gatttccggg acctcatgtc ttggatcaat ggaatacggg ggttggtgtc ctcagatgag 4201 ctagccaagg atgtcaccgg agctgaggca ttgctggagc gacaccagga acaccggaca 4261 gaaatcgatg ccagggctgg cactttccag gcatttgagc agtttggaca gcagctgttg 4321 gctcacggac actatgccag ccctgagatc aagcagaaac ttgatattct tgaccaggag 4381 cgtgcagacc tggagaaggc ctgggttcag cgcaggatga tgctggatca gtgccttgaa 4441 ctgcagctgt tccatcggga ctgtgagcaa gctgagaact ggatggctgc ccgggaggcc 4501 ttcttgaata ccgaagacaa aggagactca ctggacagcg tagaggctct gatcaaaaaa 4561 catgaagact ttgacaaagc gattaacgtc caggaagaga agattgctgc tctgcaggcc 4621 tttgccgacc agctcatcgc tgccggccat tatgccaagg gagacatttc tagccggcgc 4681 aatgaggtct tggacaggtg gcgacgtctg aaagcccaga tgattgagaa aaggtcaaag 4741 ctaggagaat ctcaaaccct ccaacagttc agccgggatg tggatgagat tgaggcttgg 4801 atcagtgaaa aattgcaaac agcgagtgat gagtcgtaca aggatcccac caacatccag 4861 agcaagcacc agaagcacca ggcttttgaa gcagagctgc atgccaacgc tgaccggatc 4921 cgtggggtta tcgacatggg caactccctc attgaacgtg gagcctgtgc cggcaatgag 4981 gatgctgtca aggcccgcct ggctgcctta gctgaccagt ggcaattctt ggtgcaaaag 5041 tcagcggaaa agagccagaa actgaaagaa gccaacaagc agcagaactt caacacaggg 5101 atcaaggaca ttgcattctg gctgtctgag gtggaggccc tgctggcatc cgaagattat 5161 ggcaaagacc tggcttctgt gaacaacctg ctgaaaaagc atcaactgct ggaagcagat 5221 atatctgccc atgaggatcg cctgaaggac ctgaacagcc aggcagacag cctgatgacc 5281 agcagtgcct tcgacacctc ccaagtaaag gacaagaggg acaccatcaa cgggcgcttc 5341 cagaagatca agagcatggc ggcctcccgg cgagccaagc tgaatgaatc ccatcgcctg 5401 caccagttct tccgggacat ggatgacgag gagtcctgga tcaaggagaa gaagctgctg 5461 gtgggctcag aggactacgg ccgggaccta actggcgtgc agaacctgag gaagaagcac 5521 aagcggctgg aagcagaact ggctgcgcat gagccggcta ttcagggtgt cctggacact 5581 ggcaagaagc tgtccgatga caacaccatc gggaaagagg agatccagca gcggctggcg 5641 cagtttgtgg agcactggaa agagctgaag cagctggcag ctgcccgggg tcagcggctg 5701 gaagagtcct tggaatatca gcagtttgta gccaatgtgg aagaggaaga agcctggatc 5761 aatgagaaaa tgaccctggt ggccagcgaa gattatggcg acactcttgc cgccatccag 5821 ggcttactga agaaacatga agcttttgag acagccttca ccgtccacaa ggatcgcgtg 5881 aatgatgtct gcaccaatgg acaagacctc attaagaaga acaatcacca tgaggagaac 5941 atctcttcaa agatgaaggg cctgaacggg aaagtgtcag acctggagaa agctgcagcc 6001 cagagaaagg cgaacgtgga tgagaactcg gccttccttc agttcaactg gaaggcggac 6061 gtggtggagt cctggatcgg tgaaaaggag aacagcttga agacagatga ttatggccga 6121 gacctgtctt ctgtgcagac gctcctcacc aaacaggaaa cttttgacgc tgggctgcag 6181 gccttccagc aggaaggcat tgccaacatc actgccctca aagatcagct tctcgccgcc 6241 aaacacgttc agtccaaggc catcgaggcc cggcacgcct ccctcatgaa gaggtggagc 6301 cagcttctgg ccaactcagc cgcccgcaag aagaagcttc tggaggctca gagtcacttc 6361 cgcaaggtgg aggacctctt cctgaccttc gccaaaaagg cttctgcctt caacagctgg 6421 tttgaaaatg cagaggagga cttaacagac cccgtgcgct gcaactcctt ggaagaaatc 6481 aaagctttgc gcgaggccca cgacgccttc cgctcctccc tcagctctgc ccaggctgac 6541 ttcaaccagc tggccgagct ggaccgccag atcaagagct tccgcgtagc ctccaacccc 6601 tacacctggt ttaccatgga ggccctggag gagacctgga ggaacctaca gaaaatcatc 6661 aaggagaggg agctggagct gcagaaggaa cagcggcggc aggaggagaa cgacaagctg 6721 cgccaggagt ttgcccagca cgccaacgcc ttccaccagt ggatccaaga gaccaggaca 6781 tacctcctcg atgggtcctg tatggtggaa gagtcgggga ccctcgaatc ccagcttgaa 6841 gctaccaaac gcaagcacca ggaaatccga gccatgagaa gtcagctcaa aaagatcgag 6901 gacctggggg ccgccatgga ggaggccctc atcctggaca acaagtacac ggagcacagc 6961 accgtgggcc tcgcccagca gtgggaccag ctggaccagc tgggcatgcg catgcagcac 7021 aacctggagc agcagatcca ggccaggaac acaacaggtg tgactgagga ggccctcaaa 7081 gaattcagca tgatgtttaa acactttgac aaggacaagt ctggcaggct gaaccatcag 7141 gatggcaaat cttgcctgcg ctccctgggc tatgacctgc ccatggtgga ggaaggggaa 7201 cctgaccctg agttcgaggc aatcctggac acggtggatc cgaacagaga tggccatgtc 7261 tccttgcaag aatacatggc tttcatgatc agccgcgaaa ctgagaacgt caagtccagc 7321 gaggagattg agagcgcctt ccgggccctc agctcagagg gaaagcctta cgtgaccaag 7381 gaggagctct accagaacct gacccgggaa caagccgact actgcgtctc ccacatgaag 7441 cccatcgtgg acggcaaggg ccgcgagctc cccaccgcgt tcgactacgt ggagttcacc 7501 cgctcgcttt tcgtgaactg agccactccc tgggtcaccc acccctcgct gcttgccctg 7561 cgtcgccttg ctgcatgtcc gctcctctgt gtgctctcac tttccactgt aaccttaagc 7621 ctgcttagct tggaataaga cttaggagaa aatggtgctt cactaacccg cttccggtcc 7681 agtcacaatc atcatgtcac tgtgggaccc agatctgtgt cttgaagcag ctgccctcat 7741 tccgacttca gaaaatcgaa gcagctggcg cctccccttc ggaattc // LOCUS CIPFNRA 1419 bp ss-mRNA PLN 25-MAY-1990 DEFINITION M.crystallinum ferredoxin-NADP+ reductase (fnrA) mRNA, complete cds. ACCESSION M25528 KEYWORDS ferredoxin-NADP+ reductase. SOURCE M.crystallinum, cDNA to mRNA, clone lambdaMC-fnr1. ORGANISM Mesembryanthemum crystallinum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Aizoaceae. REFERENCE 1 (bases 1 to 1419) AUTHORS Michalowski,C.B., Schmitt,J.M. and Bohnert,H.J. TITLE Expression during salt stress and nucleotide sequence of cDNA for ferredoxin-NADP+ reductase from Mesembryanthemum crystallinum JOURNAL Plant Physiol. 89, 817-822 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.J.Bohnert, 13-JUN-1989. FEATURES from to/span description pept 75 1172 ferredoxin-NADP+ reductase precursor (fnrA; EC 1.6.7.1) sigp 75 230 ferredoxin-NADP+ reductase signal peptide matp 231 1169 ferredoxin-NADP+ reductase mRNA 45 1395 fnrA mRNA (alt.) mRNA < 1 1419 fnrA mRNA (alt.) BASE COUNT 416 a 283 c 322 g 398 t ORIGIN 1 tttttttttt ttctggttcc tcttcttctt cttcttcttc ttctgcaaag atatcttctt 61 caatcagctc agcaatggct gccgcagtaa ctgccgcagt ctctttccct tctactaaat 121 ccactcccct ttctaccaga acatcttccg tcatcaccca tgagaaaatt aacttcaaca 181 aggttcctct atactacagg aatgtatcag ttggtgggaa agtgggaacc attagggcag 241 tagcctcaga tgttgaggca cctgttgcaa aggtggagaa gcattcaaag aaaatggagg 301 aaggagtgat agtcaacaag tacaagccca agaaccctta cactggcaga tgcctcctca 361 acactaagat tactggagat gatgctcctg gagagacctg gcacatggtt ttctcccatg 421 aaggagagat tccctacaga gaggggcagt ctgttggagt gattcccgaa ggtattgaca 481 agaatggaaa gccccacaag ttgagattgt actcaattgc cagcaggccc ttgggcgact 541 ttggtgactc taaaactgtt tccttgtgtg taaaacgact tatctacacc aatgacaacg 601 gagaaatagt caagggtgtt tgctcaaact ttttgtgtga tttgaaacct ggtagtgaag 661 tagtgctaac tggaccagtt gggaaggaaa tgctgatgcc aaaagaccct aatgccacaa 721 ttatcatgct tgcaactgga actggtattg ctcctttccg ttcgttcttg tggaaaatgt 781 tctttgagaa gcacgatgac tacaagttca atggtttggc atggctcttc ttgggtgtac 841 ccactagcag ttctttgtta tacaaggagg aatttgagaa gatgaaggag aaagctccag 901 aaaacttcag gttggatttc gctgtgagca gagaacaaac taatgagaaa ggggagaaaa 961 tgtacatcca aacccgtatg gcacaatatg atagagagct atgggaattg ctcaagaaag 1021 acaacactta cgtctacatg tgtgggctga aaggcatgga gaaaggaatc gatgacatta 1081 tggtttcatt ggccgcagaa gatggtattg actggttcga ctacaagaag cagttgaaga 1141 aggcagaaca gtggaacgtg gaagtctact gataaacttg tacaaaactc tcttctttct 1201 ttttgtgaag cagaatatat ctatttgtca tctcccactg caccttgtag ataggtcagt 1261 ttcttttcga tattgttatg aatatcagct aattgcattg tagaagctat agccaaaaaa 1321 gctctgtttt ggagtactca tgtatcgcgt aattatactg gaaaccagtg tcaacacttc 1381 tacactgaga aattcatgag gatgaacaca ttatgctca // LOCUS HUMCAMPPK 3036 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human cAMP-dependent protein kinase type I-alpha subunit, complete cds. ACCESSION M33336 KEYWORDS cAMP-dependent protein kinase; cAMP-dependent protein kinase type I; protein kinase. SOURCE Human testis, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3036) AUTHORS Sandberg,M., Skalhegg,B. and Jahnsen,T. TITLE The two mRNA forms for the type I-alpha regulatory subunit of cAMP-dependent protein kinase from human testis are due to the use of different polyadenylation site signals JOURNAL Biochem. Biophys. Res. Commun. 167, 323-330 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 88 1233 protein kinase type I-alpha subunit /nomgen="PRKAR1" /map="7p13-qter" /hgml_locus_uid="LW0133F" BASE COUNT 821 a 552 c 682 g 981 t ORIGIN 1 gctgggagca aagcgctgag ggagctcggt acgccgccgc ctcgcacccg cagcctcgcg 61 cccgccgccg cccgtcccca gagaaccatg gagtctggca gtaccgccgc cagtgaggag 121 gcacgcagcc ttcgagaatg tgagctctac gtccagaagc ataacattca agcactgctc 181 aaagattcta ttgtgcagtt gtgcactgct cgacctgaga gacccatggc attcctcagg 241 gaatactttg agaggttgga gaaggaggag gcaaaacaga ttcagaatct gcagaaagca 301 ggcactcgta cagactcaag ggaggatgag atttctcctc ctccacccaa cccagtggtt 361 aaaggtagga ggcgacgagg tgctatcagc gctgaggtct acacggagga agatgcggca 421 tcctatgtta gaaaggttat accaaaagat tacaagacaa tggccgcttt agccaaagcc 481 attgaaaaga atgtgctgtt ttcacatctt gatgataatg agagaagtga tatttttgat 541 gccatgtttt cggtctcctt tatcgcagga gagactgtga ttcagcaagg tgatgaaggg 601 gataacttct atgtgattga tcaaggagag acggatgtct atgttaacaa tgaatgggca 661 accagtgttg gggaaggagg gagctttgga gaacttgctt tgatttatgg aacaccgaga 721 gcagccactg tcaaagcaaa gacaaatgtg aaattgtggg gcatcgaccg agacagctat 781 agaagaatcc tcatgggaag cacactgaga aagcggaaga tgtatgagga attccttagt 841 aaagtctcta ttttagagtc tctggacaag tgggaacgtc ttacggtagc tgatgcattg 901 gaaccagtgc agtttgaaga tgggcagaag attgtggtgc agggagaacc aggggatgag 961 ttcttcatta ttttagaggg gtcagctgct gtgctacaac gtcggtcaga aaatgaagag 1021 tttgttgaag tgggaagatt ggggccttct gattattttg gtgaaattgc actactgatg 1081 aatcgtcctc gtgctgccac agttgttgct cgtggcccct tgaagtgcgt taagctggac 1141 cgacctagat ttgaacgtgt tcttggccca tgctcagaca tcctcaaacg aaacatccag 1201 cagtacaaca gttttgtgtc actgtctgtc tgaaatctgc ctcctgtgcc tcccttttct 1261 cctctcccca atccatgctt cactcatgca aactgcttta ttttccctac ttgcagcgcc 1321 aagtggccac tggcatcgca gcttcctgtc tgtttatata ttgaaagttg cttttattgc 1381 accattttca atttggagca ttaactaaat gctcatacac agttaaataa atagaaagag 1441 ttctatggag actttgctgt tactgcttct ctttgtgcag tgttagtatt caccctgggc 1501 agtgagtgcc atgctttttg gtgagggcag atccagcacc tattgaatta ccatagagta 1561 atgatgtaac agtgcaagat tttttttttt aagtgacata attgtccagt tataagcgta 1621 tttagactgt ggccatatat gctgtatttc tttgtagaat aaatggtttc tcattaaact 1681 ctaaagatta gggaaatgga tatagaaaat cttagtatag tagaaagaca tctgcctgta 1741 attaaactag tttaagggtg gaaaaatgaa aatttttgct aattatcaat gggatatgat 1801 tggttcagtt ttttttttcc agagttgttg tttgccaagc taatctgcct ggtttattta 1861 tatcttgtta ttaatgtttc ttctccaatt ctgaaatact tttgagtatg gctatctata 1921 cctgcctttt aagtttgaaa ctaactcata gatgcaaata ttggttagta tttaactaca 1981 tctgcctcgg ctcacaaatt ccgattagac ctttatccag ctagtgccaa ataattgatc 2041 agatgctgaa ttgagaataa gaatttgagg tctacattct tggttgttaa tttagagcgt 2101 ttggttaaag tatgtccttc agctgactcc agtataatct cctctgctca ttaaactgat 2161 tccaggagat tggatttgct gtgactagat acagatggag caaatgtcct aacagagaaa 2221 tagaggtgat gctgctaaag ggagaaatgc caggcggaca aagttcagtg tcgggaattt 2281 tccccgtgac attcactggg gcatgagatt ttggaagaag ttttttactt tggtttagtc 2341 tttttttcct cctttttatt cagctagaat ttctggtggg ttgatggtag ggtataatgt 2401 gtctgtgttg cttcaaattg gtctgaaagg ctatcctgct gaaagtcctg ctttcctatc 2461 tagcatttat tcctctggca aacttttctt tcttttcttt tttaaagtaa acttgtgtat 2521 tgagtcttaa ctgtatttca gtattttcca gccttatgtg ttacattatt ccaatgatac 2581 ccaacagttt atttttatta tttttttaaa caaaatttca cagttctgta atgtaggcac 2641 ttttattttc attgtgattt atatataagg taatgtaggg ttatatttgg gagtgactgc 2701 aagcattttt ccatctgtgt gcaactaact gactctgtta ttgatccctt ctcctgccct 2761 ttcccaggta atttaaattg gtcatggtag atttttttca tagatttgaa aaacttttag 2821 gttgttacca agtatgaagt ataaatctgg ggaagaggtt ttatttacat tttagggtgg 2881 gtaagaaagc caccttgtta caaatttttt aatttccaaa ataatctata ttaaatgagg 2941 gtttctgatc tgtactttgt gtttagctac ctttttatat ttaaaaaatt aaaaatgaaa 3001 attatgttct tacaagctta aagcttgatt tgatct // LOCUS HUMIGHCU 426 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human Ig rearranged H-chain mRNA V-J4-region, hybridoma AE6-5, 5' end. ACCESSION M29468 KEYWORDS immunoglobulin heavy chain; joining exon; processed gene; variable region; variable region subgroup VH-I. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuUCVH, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 426) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept 1 > 426 Ig H-chain V-J4-region precursor /nomgen="IGHV" /map="14q32.33" /hgml_locus_uid="LK0080M" sigp 1 57 Ig H-chain signal peptide matp 58 > 426 Ig H-chain recomb 384 385 V-region DNA end/J4-region DNA start BASE COUNT 93 a 114 c 129 g 90 t ORIGIN 1 atggactgga cctggaggat cctcttcttg gtggcagcag ccacaggagc ccactcccag 61 gtgcagctgg tgcaatctgg ggctgaggtg aagaagcctg gggcctcagt gaaggtctcc 121 tgcgaggctt ctggatacac cttcaccggc cactatatgc actgggtcgg acaggcccct 181 ggacaagggc ttgagtggat ggggtggatc aaccctaaca gtggtggcac aaactatgca 241 cagaagtttc agggcagggt caccatcacc agggacacgt ccatcaacac agcctacatg 301 gagctgagca ggctgagatc tgacgacacg gccgtgtatt actgtgcgag agcctcatat 361 tgtggttatg actgctatta cttctttgac tactggggcc agggaaccct ggtcaccgtc 421 tcctcg // LOCUS HUMIGHCV 489 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human Ig rearranged anti-myelin H-chain mRNA V-J4-region, hybridoma AE6-5, 5' end. ACCESSION M29470 KEYWORDS autoantibody; immunoglobulin heavy chain; joining exon; processed gene; variable region; variable region subgroup VH-III. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuPopVH, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 489) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept 64 > 489 Ig H-chain V-J4-region precursor /nomgen="IGHV" /map="14q32.33" /hgml_locus_uid="LK0080M" sigp 64 120 Ig H-chain signal peptide matp 121 > 489 Ig H-chain recomb 450 451 V-region DNA end/J4-region DNA start BASE COUNT 101 a 121 c 144 g 123 t ORIGIN 1 cccagccctg ggattttcag gtgttttcat ttggtgatca ggactgaaca gagagaactc 61 accatggagt ttgggctgag ctggcttttt cttgtggcta ttttaaaagg tgtccagtgt 121 gaggtgcagc tgttggagtc tgggggaggc ttggtacagc ctgaggggtc cctgagactc 181 tcctgtgcag tctccggatt cacttttagc agctttgcca tgacctgggt ccgccaggct 241 ccagggaagg ggctggagtg ggtctcagct attagtacta gtggtggtag cacatactac 301 gcagagtccg tgaagggccg cttcaccatc tccagagaca attccatgca cacgctgtat 361 ctgcaaatga acagcctgag agccgaggac acggccgtct attactgtgc gaaaggtcct 421 acatattgta gtagaatcag ctgccctccg gactactggg gccagggaac cctggtcacc 481 gtctcctca // LOCUS HUMIGHVW 778 bp ds-DNA PRI 25-MAY-1990 DEFINITION Human Ig germline H-chain pseudogene V-J6-region, hybridoma AE6-5, 5' end. ACCESSION M29466 KEYWORDS germline; immunoglobulin heavy chain; joining exon; variable region; variable region subgroup VH-I. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone II4a, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 257 to 778) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 256) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept.ps 257 302 pseudo-Ig H-chain V-J6-region precursor, exon 1 /nomgen="IGHV" /map="14q32.33" /hgml_locus_uid="LK0080M" 384 > 778 pseudo-Ig H-chain V-J6-region precursor, exon 2 sigp.ps 257 302 pseudo-Ig H-chain signal peptide 384 394 pseudo-Ig H-chain signal peptide matp.ps 395 > 778 pseudo-Ig H-chain recomb 733 734 V-region DNA end/J6-region DNA start signal 157 164 octamer recognition sequence BASE COUNT 193 a 197 c 216 g 172 t ORIGIN 1 aacacagagt tcagggatgg gtaggggatg cgtggcctct aacaggatta cggcttgaac 61 cctcagcttc tacaattgtg tcgtccatgt gtcatgtatt tgctctttct catcctgggt 121 caggaattgg gctattaaat agcatccttc atgaatatgc aaataactga ggtgaatata 181 gatatctgtg tgccctgaga gcatcaccca aaaaccacac ccctccttgg gagaatccct 241 agatcacagc tctaccatgg actggacctg gagcatcctt ttcttggtgg cggcagcaac 301 actaacggac tccccagtcc cggggctgag agagaaacca ggccagtcat gtgagacttc 361 acccatcctg tgtctctcca caggtgccca ctcccaggtt cgactggtgc agtctggagc 421 tgacgtgaag aagcctgggg cctcagtgaa ggtctcctgc aaggcttctg gttacacctt 481 taccagctat gatatcagct gggtgcgaca ggcccctgga caagggcttg agtggatggg 541 atggatcagc gcttttaatg gtaacacaga ctatgcacag aagttccagg gcagggtcac 601 catgaccaca gacacatcca cgagcacagc ctacacggag ctgaggagcc tgagatctga 661 cgacacggcc gtgtattact gtgcgagaga cgggggcttg ggagtggtgg tgatagttga 721 aacctaaagc aagtactaca acgatatgga cgtctggggc caagggacca cggtcacg // LOCUS HUMIGKAW 405 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human Ig rearranged kappa-chain mRNA V-J1-region, hybridoma AE6-5, 5' end. ACCESSION M29467 KEYWORDS joining exon; kappa-immunoglobulin; processed gene. variable region subgroup VK-IV; variable region. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuUCVK, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 405) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept 1 > 405 Ig kappa-chain V-J1-region precursor /nomgen="IGKV" /map="2p12" /hgml_locus_uid="LF0081F" sigp 1 60 Ig kappa-chain signal peptide matp 61 > 405 Ig kappa-chain recomb 369 370 V-region DNA end/J1-region DNA start BASE COUNT 95 a 110 c 101 g 99 t ORIGIN 1 atggtgttgc agacccaggt cttcatttct ctgttgctct ggatctctgg tgcctacggg 61 gacatcgtga tgacccagtc tccagactcc ctggctgtgt ctctgggcga gagggccacc 121 atcaactgca agtccagcca gagtctttta tacagctcca acaataagaa ctacttagct 181 tggtaccagc agaaaccagg acagcctcct aagttgctca tttactgggc atctacccgg 241 gaatccgggg tccctgaccg attcagtggc agcgggtctg ggacagattt cactctcacc 301 atcagcagcc tgcaggctga agatgtggca gtttattact gtcagcaata ttatagtact 361 cctccgatgt tcggccaagg gaccaaggtg gaaatcaaac gaact // LOCUS HUMIGKAX 407 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human Ig rearranged anti-myelin kappa-chain mRNA V-J4-region, hybridoma AE6-5, 5' end. ACCESSION M29469 KEYWORDS autoantibody; joining exon; kappa-immunoglobulin; processed gene; variable region; variable region subgroup VK-IIIa. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuPopVK, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 407) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept 24 > 407 Ig kappa-chain V-J4-region precursor /nomgen="IGKV" /map="2p12" /hgml_locus_uid="LF0081F" sigp 24 83 Ig kappa-chain signal peptide matp 84 > 407 Ig kappa-chain recomb 368 369 V-region DNA end/J-region DNA start BASE COUNT 95 a 123 c 102 g 87 t ORIGIN 1 ctcagttagg acccagagga accatggaag ccccagctca gcttctcttc ctcctgctac 61 tctggctccc agataccacc ggagaaattg tgttgacaca gtctccagcc accctgtctt 121 tgtctccagg ggaaagagcc accctctcct gcagggccag tcagagtgtt agcagctact 181 tagcctggta ccaacagaaa cctggccagg ctcccaggct cctcatctat gatgcatcca 241 acaaggccac tggcgtccca gccaggttca gtggcagtgg gtctgggaca gacttcactc 301 tcaccatcag cagcctagag cctgaagatt ttgcagttta ttactgtcag cagagtagca 361 agtggcctct cactttcggc ggagggacca aggtggagat caaagga // LOCUS HUMLSP1A 1631 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human lymphocyte-specific protein 1 (LSP1) mRNA, complete cds. ACCESSION M33552 KEYWORDS lymphocyte-specific protein; phosphoprotein. SOURCE Human CTL AH2 cell line, cDNA to mRNA, clones hlsp1.[3,5-1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1631) AUTHORS Jongstra-Bilen,J., Young,A.J., Chong,R. and Jongstra,J. TITLE Human and mouse LSP1 genes code for highly conserved phosphoproteins JOURNAL J. Immunol. 144, 1104-1110 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 109 1128 lymphocyte-specific protein 1 (LSP1) mRNA < 1 1631 LSP1 mRNA BASE COUNT 373 a 496 c 500 g 262 t ORIGIN 1 tttaacaccc atagtaggcc taaaagcagc caccaattaa gaaagccaca gcacgtacac 61 ccactccagg gatctgccag caccctgtgg ggcccagact acaggctgat ggcggaggct 121 tcgagtgacc cgggtgctga ggagcgggaa gagttgctgg ggcccactgc tcagtggagc 181 gtggaggacg aggaggaggc cgtccacgag caatgccagc atgagagaga caggcagctt 241 caggcccagg acgaggaggg aggcggccat gtccccgagc ggccgaagca ggagatgctc 301 ctcagcctga agccctcgga ggcccctgaa ctggatgagg acgagggctt tggcgactgg 361 tcccagaggc cagagcagcg gcagcagcac gagggggcgc agggcgcctt ggacagcgga 421 gagccccccc agtgcaggag tcctgagggg gagcaagagg acaggcccgg cctgcatgcc 481 tacgaaaagg aggacagtga tgaagtccac ctggaggagt tgagtctgag caaggagggg 541 ccaggcccag aggacactgt ccaggacaac ctgggggccg caggggctga ggaggaacag 601 gaggagcacc agaaatgtca gcagcccagg acacccagcc ccttggtctt ggaggggacc 661 atcgaacaga gctcgcctcc cctgagccct accaccaaac tcatcgacag gaccgagtcc 721 ctaaaccgct ccatagagaa gagtaacagt gtgaagaaat cccagccaga cttgcccatc 781 tccaagattg atcagtggct ggaacaatac acccaggcca tcgagaccgc tggccggacc 841 cccaagctag cccgccaggc ctccatagag ctgcccagca tggctgtggc cagtaccaag 901 agtcggtggg agacgggtga ggtacaggct cagtctgcgg ccaagactcc gtcctgcaag 961 gatattgtgg ctggagacat gagcaagaaa agcctctggg agcagaaggg aggctccaag 1021 acctcatcaa caattaagag caccccatct gggaagaggt ataagtttgt ggccaccggg 1081 catgggaagt atgagaaggt gcttgtggaa gggggcccgg ctccctaggc gtcccatctc 1141 gcttcctggg tctgcaggtc cagccggctg gcaccctcca tgtacccagg ggagattcca 1201 gccagacacc cgccccccgg ccctggctaa gaagttgctt cctgttgcca gcatgaccta 1261 ccctcgcctc tttgatgcca tccgctgcca cctccttttg ctcctggacc ctttagcctc 1321 tctgcccttc cactctctga ccaccgcccc cgccctcccc acccagctcc gcttcttgtt 1381 acttggggga ggaaagaaac tcctgatcat tggccaaagg gacttacccc tggagaggcc 1441 aagtgccttc taggaagtta ggaggttgag gcacagcctg tgcagagagg gtgggtcacc 1501 cccccagatc caaggggaaa ctgcaggtca agggctgata acggccatgc aggatgcttg 1561 atgctgcgtc ccccgctgct tgccgccccc caccccgcca ttttgtataa taaagctccc 1621 tgtgtattct c // LOCUS MUSVG1A 1985 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse Vgr-1 protein mRNA, complete cds. ACCESSION J04566 KEYWORDS Vgr-1 protein. SOURCE Mouse (strain ICR) 8.5 day embryo (George), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1985) AUTHORS Lyons,K., Graycar,J.L., Lee,A., Hashmi,S., Lindquist,P.B., Chen,E.Y., Hogan,B.L.M. and Derynck,R. TITLE Vgr-1, a mammalian gene related to Xenopus Vg-1, is a member of the transforming growth factor beta gene superfamily JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 4554-4558 (1989) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by A.Lee. 22-MAY-1989, for release after publication. FEATURES from to/span description pept 16 1332 Vgr-1 protein BASE COUNT 448 a 566 c 551 g 420 t ORIGIN 1 ccttgaccgt aagacatgag gaagatgcaa aaggagatcc tgtcggtgct ggggcccccg 61 cacaggccgc ggcccctgca cggtctccag cagcctcagc ccccggtgct cccgccacag 121 cagcagcagc agcagcagca gcagcagacg gcccgcgagg agccccctcc agggcggctg 181 aagtccgctc cactcttcat gctggatctc tacaacgccc tgtccaatga cgacgaagag 241 gatggggcat cggagggtgt ggggcaagag cctgggtccc acggaggggc cagctcgtcc 301 cagctcaggc agccgtctcc cggcgctgca cactccttga accgcaagag tctcctggcc 361 ccgggacccg gtggcggtgc gtccccactg actagcgcgc aggacagcgc tttcctcaac 421 gacgcggaca tggtcatgag ctttgtgaac ctggtggagt acgacaagga gttctcccca 481 catcaacgac accacaaaga gttcaagttc aacctatccc agattcctga gggtgaggcg 541 gtgacggctg ctgagttccg cgtctacaag gactgtgtgg tggggagttt taaaaaccaa 601 acctttctta tcagcattta ccaagtcttg caggagcatc agcacagaga ctctgaccta 661 tttttgttgg acacccgggt ggtgtgggcc tcagaagaag gttggctgga atttgacatc 721 acagcaacta gcaatctgtg ggtggtgaca ccgcagcaca acatggggct ccagctgagt 781 gtggtgactc gggatggact ccacgtcaac ccccgtgcgg cgggcctggt gggcagagac 841 ggcccttacg acaagcagcc cttcatggtg gccttcttca aggtgagcga ggtccacgtg 901 cgcaccacca ggtcagcctc cagtcggcgg cggcagcaga gtcgcaaccg gtccacccag 961 tcgcaggacg tgtcccgggg ctccggttct tcagactaca acggcagtga gttaaaaaca 1021 gcttgcaaga agcatgagct ctatgtgagc ttccaggacc tgggatggca ggactggatc 1081 attgcaccca aaggctacgc tgccaactac tgtgatggag agtgttcctt cccactcaac 1141 gcacacatga atgccaccaa ccacgccatt gtacagacct tggtccacct tatgaatccc 1201 gagtacgtcc ccaaaccatg ctgcgcacca accaaactga atgccatctc ggttctttac 1261 ttcgatgata actccaatgt catcttgaaa aagtacagga atatggtcgt gagagcttgt 1321 ggttgccatt aagttgaagc tggtgtgtgt gtgtgggtgg gggcatggtt ctgccttgga 1381 ttcctaacaa caacatctgc cttaaaccac gaacaacagc acagcgaagc gggatggtga 1441 cacacagagg gatcgtgaca cgcagacaca tctcccgctg gtgccttacc cacggaggct 1501 tttatgagga ccttgtcaag ggctttccca gttcctaact gagcagttgc tggtctgcag 1561 gaagctggaa ggcttgtagt acaggcctgg aaactgcagt tacctaatgt tcgcctcccc 1621 caaccccgcc cggagtagtt ttagctttta gatctagctg cttgtggtgt aagtaaactt 1681 gaaggaatat taaatatccc tgggttgaaa gacccggtgg tggctctaca gcacccatcc 1741 cagggagatt tttgcagaca tccgaatgga ggggagaagg gcactctttc aggttccatt 1801 cccagcaagg gcagctcaca caggacctgc agcctggcca tcagcaggct ctgtggaggt 1861 gccttctgtc tactgttgta gttacgtgtt ttgtgttgac tctcggtggt gtgagaatgt 1921 actaatctct gtcaagacaa actgtagcat ttccacccca tcctcctccc tccctcacag 1981 aattc // LOCUS NPAIFP1A 1730 bp ds-DNA VRL 25-MAY-1990 DEFINITION AcNPV mutant with an S.frugiperda insertion element IFP1.6. ACCESSION M32775 KEYWORDS insertion element; insertion element IFP1.6. SOURCE AcNPV (mutant strain AcFP875.2) DNA. ORGANISM Autographa californica nuclear polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 1730) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 118 119 AcNPV DNA end/S.frugiperda DNA start recomb 1683 1684 S.frugiperda DNA end/AcNPV DNA start site 115 118 viral target sequence site 1684 1687 viral target sequence BASE COUNT 610 a 279 c 252 g 589 t ORIGIN 1 tcgtcgattg cgttttgatc aacgacttga gcagagacac gttaatcaac tgttcaaatt 61 gatccatatt aactatatca acccgatgcg tatatggtgc gtaaaatata ttttttaacc 121 ttttgaccgc cagagcataa acgcttcgct gtgccctcca cgccaagcca caaattcaac 181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat 241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta 301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc 361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat 421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta 481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg 541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat 601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa 661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa 721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc 781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc 841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa 901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa 961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta 1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc 1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg 1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat 1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa 1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt 1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca 1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat 1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt 1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa 1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga 1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa 1681 aggttaaccc tcttatactt tgcactctgc gttaatacgc gttcgtgtac // LOCUS NPAIFP2A 2225 bp ds-DNA VRL 25-MAY-1990 DEFINITION AcNPV with an S.frugiperda insertion element IFP2.2. ACCESSION M32779 KEYWORDS insertion element; insertion element IFP2.2. SOURCE AcNPV DNA. ORGANISM Autographa californica nuclear polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 2225) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 26 27 AcNPV DNA end/S.frugiperda DNA start recomb 2191 2192 S.frugiperda DNA end/AcNPV DNA start BASE COUNT 760 a 407 c 349 g 709 t ORIGIN 1 ccgtctcacc ataacaccgt ttttaccaca gaacatataa agagagtaga aatgaaatag 61 ttcctgattt gtatcagaac gcgtgtcgcc gtttctccca ccctttaatt tcactacaag 121 caagagacca atgtctgagc ggattacccc cacaaataaa taacacacat ttaaatctcg 181 aatttgctat tggttgtcac acaaacgcaa tgactgcatt tgacgtacac tctaacacgc 241 ataaaaactc atgtatgcta gggcgagaca aaggtcgatt gttgatatcg ataaaattta 301 agtaatcttg tttttgtttt acaagtttgt tgaatattgt gaaattgtat tcatgcgaaa 361 tagaaaccaa gtaataaaag taaaataaat aaaaaataca taactgaatt ttaaaaccaa 421 ttttattact caataagtta aaaatcagta aaaaataaat aaatcatagg cacttttgaa 481 cgtcaaagca aatataataa tattaataac gtctgtctgt cggtcagccc tctagtgaag 541 ctatttgctc gttccaaagt gtagattcct atggagaaga tcgaagatct gttcgttcga 601 tctttttcaa ttagattcac aatacaattc ttgataacat tgcttttctt tgcatcgatt 661 gcttgaacta tgtgagaaca atgtaaaact aatattcagt caaagcgata gaaaaaaaaa 721 ccttaagaac aacaaaatta atacagtctg gagctgacca gtcccagttg acagcgctcg 781 ttcaccaaca tgacacgtac accagcaccg tccaactcaa ccatgttgca gggaatcgaa 841 cgatccaatg cccgtgccac cgccagtgag acctttgagt gagcatgaca actctaagct 901 gacacaaatg cattctacta gtctaattta gctaattaca gggctctcac atttacaaaa 961 atctgtataa agtacagaac acattgaaat aacatggtta tattatttca ttttttaaat 1021 ttacgaaaga tgatatttta aatatcgcca tttttagaca aaaaaactga tttaatgaga 1081 ctaatttaaa attttctaaa ataaaatcag tttacttact gaattacaca ttgcgatatt 1141 gtgcggataa acgcctaatt ttaagtttct tttccctgac tttgttatta tgtaattatc 1201 aacatcgaaa tgttgcgaac gaaatatgct acatttagta ggaatccagg agcccatagt 1261 acaggttttc ctagtttggg ttcctgaatc ccaccacttg tcatttcaat tgtcttaaac 1321 ctggtcgtgt aacttgctac ataatttatt atactattaa ttattatata agcttagttt 1381 ttaagaccaa tgtctgtata aatatggtgg agataaaaat atacgcagtt tctaatctgt 1441 gtggtggaac ttgaaaaaaa aaaaataatt ctaccagttg tgtcaatcca caattttgtt 1501 acacctggat ctgttggaaa cctaaaacat aatattaaaa taaatcatat gtcacattct 1561 ataaaaaaat aaaaacataa aactcactca aaatctatat tactaattag tcgtaataaa 1621 taatacttat tatttaaatt ataaagtgtg ctattgatta taatagactg tatcatagta 1681 acaatcgaca ttgattacaa atttgcccaa cactatagat gtcatagatt ttgtggaaat 1741 gagcttttgt ttcggtactc gggactctcg caatggacgt catacattgc gcgattgctc 1801 acgtagtgca agattgtact catctatcgc aagcaatgta ttactattag tctgcttgcg 1861 attatatcgt ttttgtctat aaatttcata atactaatgt tataattaca aatatcacac 1921 gtatttttat gtttttggac ttaccggtga aaagaaattc cttctcgtgc tatgctacac 1981 acctggtttc ttttccgaca cgttataata gcgcaagacg gcataatata atatttttgt 2041 ggagatacgt acgctgtcaa aacatgacag cgattgctag ccggtaagta tgaacatttg 2101 agcgttatga cagcggtatg cacgttcgtt cgcaccctcg gcacgcgtgg cccgcccaca 2161 aggccaatcc agtctctttc ttttatccat ggtttttaca aatggaaatg tatttgtaaa 2221 acggc // LOCUS RATPFKFBP1 2917 bp ds-DNA ROD 25-MAY-1990 DEFINITION Rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase gene, 5' flank and exon 1. ACCESSION M26215 KEYWORDS 6-phosphofructo-2-kinase; alternative splicing; fructose-2,6-bisphosphatase. SEGMENT 1 of 2 SOURCE Rat liver DNA, clone lambda-20. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2917) AUTHORS Darville,M.I., Crepin,K.M., Hue,L. and Rousseau,G.G. TITLE 5' flanking sequence and structure of a gene encoding rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6543-6547 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.I.Darville, 08-AUG-1989. FEATURES from to/span description pept 1374 / 1401 M-type 6-phosphofructo-2-kinase (EC 2.7.1.105)/fructose-2,6-bisphosphatase (EC 3.1.3.46; PFK-2/FBPase-2), exon 1 pre-msg 1255 > 2917 PFK-2/FBPase-2 mRNA and introns (alt.) IVS 1402 > 2917 PFK-2/FBPase-2 intron A BASE COUNT 779 a 627 c 605 g 906 t ORIGIN 1 ggatccagac agaactccgg cttaagtgag cggcccctgg cggtggctgt gaggggcttc 61 tgcctggagt cctttgggga actttctcag ttgaaagatg gagcagttaa gaatttactt 121 agctagcctc cgttttaatg gagtctggag gaaggagact acctcaaatc ctccctgtag 181 agccatattt gtcttgactt tcaaagcgtg agtcagactt gcatttggct ccatttctca 241 gatgagcaaa atcgaggcct agaacattta agctccggag cagtatcttt tccctatcct 301 gtctacgctt tgtcctttca ttcttaggat aaaccaggaa gaggcactga agctaagtag 361 aagttgaata aattcgtctc agcctcacag aagttataga gccaagattt aagcccttct 421 taatcaggta aacgattgat tttctgtcca cacaactctg tgtcttttcc cgtgggcacc 481 actgcacttt acgatccatt ctagtgcaac atttaccaca gtgaacagct cccattgatt 541 tccattgtca ctagaccgct aacagcctga atgcagtggc catcttggac ccagcttata 601 tgataacgga ataaatattt agcagatgga tgaagcagca ggccagtagc gatgagttct 661 tgacttattt tttttaacag gaaatgccca atttccagga aaactataac tttgcatcgt 721 aaggtttttt cactgaaaat tgctacagag ttattcccac tcgaatacac acaccctcag 781 aatctcacag cctttaagcc atgatcttca gtgggttgct cctctaccct cagaggactt 841 aaaggtcact agctattgac attagaaaaa catctatgta attaaaagtc tgtgccttga 901 gcttttccat gcattgggag ttactgacta aagatcgagt cccaggagtg ggttaattag 961 gacaagggaa atgaacgttt ttcttcatgt ttccagattg ctttcaagca agggattttg 1021 ccaatgctgt actaaaggca gacagaaacc tttgtcccaa ggaaatccag cttttgctca 1081 cctcaatagc tggttttaat ttctccattc tccagtggct tgttaatatt agtactgacc 1141 tttggggcaa ggtgaacaca tggttggact gaagagaaaa cgctctgggt ggctcaggaa 1201 cttctttggc aagtacagca gctaatattt caacactgca cacacccctt aacaagggcg 1261 cttcctcagc cttcccagga accaacggat gtctccgagt tttctatttg atgccaccca 1321 ccggacctgc tttgggggtc tgtaaatgca agagagccat tggaaaatta gcgatggaag 1381 aaaaagcctc taagagaaca ggtaggtgag cctaaaggaa aaacagacag gtctttttac 1441 aggtcaaaaa tccttctttc tggcttcttg gaagattttt tctgcatagg attctgattt 1501 agctttatta aaggggatgg gagggttggg gaatgataat atgaatacct aattcaactg 1561 ccaatatttt atttttttaa ttattttatt aatattccaa atgttgcccc ctccctattg 1621 tcagtatttt aaaattcatt tgtacttgac tttttaggac cctctctaaa ggtcagaccc 1681 ctactatctt tgaaattcca atttttgttt tatttagttt tggaaaatga atttaactgt 1741 gcttcccatg ggagcctgaa actttctatg tagcctggag tggccatgat tgtgtaaccc 1801 tcccagccta gcttccacct atgtagtgta tgtaattttt taaattaaat gcacaataga 1861 gagtgcatct ctgacctctc ctctagtgta taaagttgag ttgatttgct gtgaaacttt 1921 gacacattct ttagtttctc tgggtgtcaa tggatacctt agtaaaacag aaattaaagc 1981 ccctagcctc cttcccacag aggactggcc tgatgatcca gaatctctaa taggcagaat 2041 ccagagttct tttgcttcag ggctttcatg ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt 2101 gtgtgtgtgt gtgtgtgtgt gtgtgtgaca ttcatgttct tggcttttat gtggcgtcca 2161 caggtagtac cacccacagc cctctctcta tgtgctctcc ttcctaagac tcgtcctaga 2221 taagaatgca gtgagctttg tggagcagac aggagtctga aaatagtggc ataatggggc 2281 cagagcagtc actgagaaca aagggattct aggctgctgt cccagaagtc ctgatatata 2341 ttcatacatg aatctagcct gtaatcatgt gattatccgt cagccttcta gcctataaca 2401 aatatattca caaccacaca tttcaagact taccaataag gagaccccat ttatgacttt 2461 attgcccatc ctatcttgat aaaatgagct tcctcaccac attgtacatg gaggtgggag 2521 caggctctat ggccgaacat catactgcat agtttcttcc ctagagatag ttctacttta 2581 tggttcagcc tttagccctg agtatgaagc tgagtaaacc ctgtgtctat aaccaataca 2641 ttaaaacgtg ttagaatttt atcattttat tctgtgccat cctaaagttt taaggttttt 2701 tttttttttt ggatgttgct ctttacctca gaaaaaccag ctctcccctg tccatttccc 2761 tgaaagctgt cataatatct accatggtga actgggattt cttgtcacca tgtttatctc 2821 ttctgcagca gagagatgta gattgtgaac atgatcactg aacaagtgtc ctccatatca 2881 gagcccaaga tgaacagtca gcaggtcaca gggatcc // LOCUS RATPFKFBP2 3720 bp ds-DNA ROD 25-MAY-1990 DEFINITION Rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase gene, exon 1'. ACCESSION M26216 KEYWORDS 6-phosphofructo-2-kinase; alternative splicing; fructose-2,6-bisphosphatase. SEGMENT 2 of 2 SOURCE Rat liver DNA, clone lambda-20. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3720) AUTHORS Darville,M.I., Crepin,K.M., Hue,L. and Rousseau,G.G. TITLE 5' flanking sequence and structure of a gene encoding rat 6- phosphofructo-2-kinase/fructose-2,6-bisphosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6543-6547 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.I.Darville, 08-AUG-1989. FEATURES from to/span description pept 2913 / 3009 L-type 6-phosphofructo-2-kinase (EC 2.7.1.105)/fructose-2,6-bisphosphatase (EC 3.1.3.46; PFK-2/FBPase-2), exon 1' pre-msg 2664 > 3720 PFK-2/FBPase-2 mRNA and introns (alt.) IVS 3010 > 3720 PFK-2/FBPase-2 intron A' BASE COUNT 1020 a 752 c 859 g 1089 t ORIGIN About o.1 kb after segment 1. 1 aagcttctct caaggctata atactgactt tcagagaaga tgaacactaa aactggtctt 61 aactaagaca gagttcagtg agaataaaag atacttcagg ctgaggaaga gcctaaaagc 121 aggaaaacgc tgtgttcgag ggtcctgtaa tcagccagtg tggttaaaca tgggggagtc 181 ctggggggta gaagactgat aggtggctta ggacttgatt tagaagaagt ttgaatgcta 241 gtctgagtca tttctctaag gcttgttttt tggtgtttct gagttcagaa ttgctaacgg 301 tagtgaacaa tttcatagtt agttctaggc agggttaaat tagggtgtca aacctggacc 361 ctgagtttac ccattcagac cttgtcagag gccttctagc tacctagctc agatttcttg 421 ctactaaaat tatcttcttg attcttggca gtgttgtaag gacaaaattg taaacataaa 481 agacatgttc atgttgttgg ctagattacc ttagacaagc catcccctcc tcaggcccat 541 ttccctgtgc tattactggg gaagtttcaa agaaacaaat aagataaaga aaatgttact 601 ttgtgcgcac tggaaaacat acaaagttac ggtaagatca ttaaaacaga cttccactaa 661 ttatgtgact tactaggatt aagagactca tgtcttgcaa agtctctttc tgagactcta 721 ggagttatct gctaatctgg aaaaaactca ttatattgta ttttttactt taaattgata 781 gataattgtt tatacttgtg tggtacagtg catgtatata atgtacaatg gttaatagct 841 ttatgtatca cctcaaatat ctatcatttc tcctttccac cagctatatt gaagtatcca 901 attatctgtt gtgaaccata gttatccaga tgtgctcttg aacatcagaa gttattcctc 961 ctgtttccca gaatccatcc tgtttgttct ctcttttctc cctctccttt gcccactagt 1021 ctatactcag tttctcacca atctctttta cctttgtcca gtaggtgaga gcatgtagta 1081 tttgatgttt tttgccagac ttaagttaac agaaacctag gaaggagttt acctttctaa 1141 ttgacactca agaagtcttt acagtataaa tgttaatccc gtgtcacaat aacgtgctag 1201 accttgcaga ttacttggag ggaaaaagat gaatcaatta ctgcccaaag gaaacaacag 1261 tttgaatcat atgtattcag cttccccagg tttcttcatg tgtatcaata tattgactct 1321 tcgctaacat tgagggaatg gaccctacta tccttatatc tagcaatggg cagaagtgag 1381 agtccaccca gacagtatgg ttcctaatcc cattgttatt gttcttaact gctctcatcc 1441 attgcctcag taggctcctt gagataggta ggccttcacg ttgaaagtct tatatgctca 1501 gcaactgcag tgaatctaac ccttctatag actaaggttc tgagaagata ttttaggaag 1561 tgtcttaaaa ggcaaagtaa agctgggagc atttgagggc agggactgga gatttgtttt 1621 cccgagagat tgcccttgca tactccagct cagtggaaga aagttgagcc tcctaatacc 1681 caatcactgg agctttgctg ggggtgggaa ggttacaaag aatgaagtaa aagggcactt 1741 ggacatcttc caggcaagcc atgcagttgc atgaattcta cagataaaag tctgaaaata 1801 actgggggaa tggcaataag agtggggaca agtgaaaggg agacatctgg atgagatatc 1861 tctgcctgtg gatttgtgct atttaaatag atgccattac ctcttattcc ccatagtctc 1921 ctcagtgtgg tttccacctc tcctaagatc ccatcccaac actttggctt gaagagtttt 1981 agcacgggtt tgaattagat tctgatgcct ttagtgctgg gtcatttcag cattggcagc 2041 catgtgagcg acccttactt atcggctctc tgcccagcaa tggctgccct cactctcatt 2101 cctaaggtgt cagatttact ctgatttatt taaccttggc tggagagagt ataaaggtga 2161 tcagatagtg ccaatctctt tgaagtgccc aggacagctc attttaagag ggagaggtcc 2221 ggtagagagg cagggaacaa actctaggga aagacagttt tacagtgcct tgttgaatag 2281 ggtgagtggt gaggtagtga aagtctactt ccagtttggg cctttaattg ctagctcaaa 2341 acattgcatt ttgattagga acattatgtt tgtggttttg ttgtgggggg gggggcttgt 2401 aatgaacctc tgtttctgag ttggagctta atgttctaat taggcctgct ttgaaatcaa 2461 tttcaaagct tagacctatc tccatcacct ttgtcatgca cacaacacag gatgtaagca 2521 catgatcttt ggtcttttat ttgcatactc tactagtcct gccttcattg gcgagaagta 2581 actggccaga cctgggctgt cttgctggct gccttccttg ctgggctcca acttatcttc 2641 tatgtacata gcccttggag ttcagaggcc tctgctgact tctgctcagc ctctggctct 2701 cagcacccac atccccactt gtacatctgg gagcagaaga aaaagaggga gctggagata 2761 atctgtgaga ggctccgaag cccagtacat ccccctgact cagccctgcc cacctgctgc 2821 tgtgactaca gcaacagaca agacagctag gaagatagga agtgaggtcc aataccttgt 2881 gggcagtggt gtcattagct gcaacgccta agatgtctcg agagatggga gaactcactc 2941 aaaccaggtt acagaagatc tggattccac acagcagcag tagcagcgtg ctgcaacggc 3001 gaaggggctg taagtgaggg ttgtcatttg tgggtggtgg tgatgggtgg gtctagaaga 3061 acaagtggtg ttccccatcc actcctaccc tcacaaagaa gttggaatag aagatcattt 3121 ggatgaaatt gctagagtgt aacaagaggc taatatgaga agggtaagga agtaagagaa 3181 aggcaggtag aaaagacaac gatgaggctt agataaaaag aacccaaact acagggaccc 3241 ctgacctaag ggctgtctcc ctttatttga gaaacggtcg gaatgcttct tgtctggaat 3301 gcgagcgttc tagtgtagtg ttacccgaga gtattttatt ttagcttctc ttatgaaagc 3361 tctggaaaca ggtcttgttt aatttatgcc agggtgaaag aaggtgatag ttcctggcgt 3421 ggtttctcgg aggctaggtg cttagtccac accagtaccg tgtcactgct cagcattgcc 3481 cttcctcttc caccatcctc ttaaaaataa ctgtacctgc ccagctgggg acaactgcgt 3541 acttggcctt gacattcaga gacagtcttg ctgttctgag gagtcgaggg gacagacagt 3601 atcacaaaga ctgatctggt gttcagttgg agcttaacat cctaattaga cctgatgtct 3661 ttattttaaa actcaggcct ttgaaatttt aaggaggagc agaggcaaag taagcaaagc // LOCUS SPFIFP1A 1801 bp ds-DNA INV 25-MAY-1990 DEFINITION S.frugiperda insertion element IFP1.6 DNA, clone lambda 879. ACCESSION M32776 KEYWORDS insertion element; insertion element IFP1.6. SOURCE S.frugiperda DNA, clone lambda 879. ORGANISM Spodoptera frugiperda Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Noctuoidea; Noctuidae. REFERENCE 1 (bases 1 to 1801) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description site 115 118 5' insertion target site site 1755 1758 3' insertion target site BASE COUNT 629 a 274 c 273 g 625 t ORIGIN 1 gagaatcaat gttcactgga aatatatccg ttttagtttg taacgaagaa aaagatgcga 61 tgaagtgttt tgtgacgtgg attcttagaa ggaagaattc aatgttctat tttattaacc 121 ttttgaccgc cagagcataa acgcttcgcc gtgccctcca cgccgagaca caaattcaac 181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat 241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta 301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc 361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat 421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta 481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg 541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat 601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa 661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa 721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc 781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc 841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa 901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa 961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta 1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc 1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg 1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat 1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa 1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt 1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca 1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat 1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt 1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa 1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga 1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa 1681 aggtaatgtg tttattttgt ttttctctgt gatctgtttc ttggcatcta tagatgtcgg 1741 tggcgcacag ggcattaata tctatcacct taccttttat tattacttta atgtataatg 1801 g // LOCUS SPFIFP1B 1820 bp ds-DNA INV 25-MAY-1990 DEFINITION S.frugiperda insertion element IFP1.6 DNA, clone lambda 883. ACCESSION M32777 KEYWORDS insertion element; insertion element IFP1.6. SOURCE S.frugiperda DNA, clone lambda 883. ORGANISM Spodoptera frugiperda Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Noctuoidea; Noctuidae. REFERENCE 1 (bases 1 to 1820) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description site 115 118 5' insertion target site site 1774 1777 3' insertion target site BASE COUNT 636 a 281 c 270 g 633 t ORIGIN 1 agagttaagt ggctttgatg atttgtgcat atagtgccat tgtgcataca aaaccaaaat 61 gttttatatt caaacttaat tttatcgttg tcatctaaaa cgaaagccat ttctttaacc 121 ttttgaccgc cagagcataa acgcttcgct gtgccctcca cgccaagcca caaattcaac 181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat 241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta 301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc 361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat 421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta 481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg 541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat 601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa 661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa 721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc 781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc 841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa 901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa 961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta 1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc 1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg 1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat 1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa 1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt 1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca 1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat 1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt 1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa 1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga 1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa 1681 aggacacgtg aaagtaatga tttaatgtgt ttattttgtt tttctctgtg atctgtttct 1741 tggcatctat agatgtcggt ggcgcacagg gcattaatcc attttctatc ataagttatt 1801 tgatacttag atctgtaagt // LOCUS SPFIFP1C 1741 bp ds-DNA INV 25-MAY-1990 DEFINITION S.frugiperda insertion element IFP1.6 DNA, clone lambda 889. ACCESSION M32778 KEYWORDS insertion element; insertion element IFP1.6. SOURCE S.frugiperda DNA, clone lambda 889. ORGANISM Spodoptera frugiperda Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Noctuoidea; Noctuidae. REFERENCE 1 (bases 1 to 1741) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description site 60 63 5' insertion target site site 1696 1699 3' insertion target site BASE COUNT 609 a 275 c 263 g 594 t ORIGIN 1 ggtaggcagt gacaatggaa cgccaattgc tacgaatctt acatacttca ttcgcttcat 61 taaccttttg accgccagag cataaacgct tcgctgtgcc ctccacgcca agccacaaat 121 tcaacgaaat aaccttgaaa atattaggga ttccaaaatg ttatcgataa accaaagtga 181 gtaataattt tcgtctgatt tttttttgaa taaactgctt ataacataaa aagaatattt 241 atttaaccta atctcccctg taataaactt aagaccacaa taatagcaac tatcaataaa 301 atatcagcct tttgagacac aaaaatacat ataaaattgc agcaagttac agcactatta 361 tctatcagtt acgcgccatt ggcatgtata catgccaatg gcgcggcaac tatcaatttt 421 ttttaaatct ctcgtttatg attagggatg tagctgaata atgtaactta tcgagttcat 481 ttatgtatta ggtaatcaaa tttgaaaact gctaaatagg tacagatttt agtgatatta 541 tttatacact ttcatattct taaaatagtt gccttgtatt tgtaaaatgt cacatgaaac 601 ttaaaaatat aaaaaatcaa caatcgtatt tatttattat gctcattaac agccttgtaa 661 ttgaaccgtc acgtgaaact taaaaatata taaaaaatca acaattgtgt ataattatta 721 tgctcaatac agccttgtat ttgtaaaatg tcacatgaaa cttaaaaata taaaaaatca 781 caatcgtatt tatttattat gctcattaac agccttgtaa ttgaaaaagg ttacgtgaaa 841 cttaaaaata tataaaaatc aacaattgta cctataaaac attatgctca ctaacagtct 901 tggaattgta aaatttaaaa ctaaattgta tattatctat aattattatc aataacaaaa 961 atttaataaa taaatagcaa ttataatgta ggtactcgat aaaatatttt ttgcaacttc 1021 cccgcattta aaaacacggc acgcgcttac gacgtcacga gtccgacatc tatacatgtc 1081 aatggcgcag aagggattgc atatcggtag tatgctagga tgcgcatgat tggcgaaaag 1141 ggaatgtcga taatattgaa aatatagtaa atatgctatt atacccaaga atttttcagt 1201 ttaaaatata gtatatttac cttttttctc agagactaag gttatatata tactttttaa 1261 tagttattac aaaaaataag cgagtcattt agctgaagtt ttaatattac ttgaattaca 1321 ctacatcact agtaggcatg tatacatgcc attggcgcga gagatacagt tcttttttag 1381 ccgatattta agttgttttt atcaatcaaa tattagttat tctatattgc aattttgttc 1441 actgtgtaca attatgtaat aactgggtat tcagatctta agatacatca ataaaacacg 1501 tgaaagtaat gatttaatgt gtttattttg tttttctctg tgatctgttt cttggcatct 1561 atagatgtcg gtggcgcaca gggcacgctt gcgcatgaca tctatacatg ccattggcgt 1621 tcaaaaggtg tttattttgt ttttctctgt gatctgtttc ttggcatcta tagatgtcgg 1681 tggcgcacag ggcattaaca taacttggta taaattgttg cataattgca ttgagagcag 1741 a // LOCUS SPFIFP6A 1792 bp ds-DNA INV 25-MAY-1990 DEFINITION S.frugiperda insertion element IFP1.6 DNA. ACCESSION M32780 KEYWORDS insertion element; insertion element IFP2.2. SOURCE S.frugiperda DNA, clone lambda 858. ORGANISM Spodoptera frugiperda Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Noctuoidea; Noctuidae. REFERENCE 1 (bases 1 to 1792) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry BASE COUNT 586 a 326 c 287 g 593 t ORIGIN 1 atgatgtttg tacataatgt tggtacgttt ctcccaccct ttaatttcac tacaagcaag 61 agaccaatgt ctgagcggat tacccccaca aataaataac acacatttaa atctcgaatt 121 tgctattggt tgtcacacaa acgcaatgac tgcatttgac gtacactcta acacgcataa 181 aaactcatgt atgctagggc gagacaaagg tcgattgttg atatcgataa aatttaagta 241 atcttgtttt tgttttacaa gtttgttgaa tattgtgaaa ttgtattcat gcgattcaaa 301 ttgtatcacc aacatgacac gtacaccagc accgtccaac tcaaccatgt tgcagggaat 361 cgaacgatcc aatgcccgtg ccaccgccag tgagaccttt gagtgagcat gacaactcta 421 agctgacaca aatgctgaaa tgcattctac tagtctaatt tagctaatta cagggctctc 481 acatttacaa taatctgtat aaagtacaga acacattgaa ataacatggt tatattattt 541 cattttttaa atttacgaaa gatgatattt taaatatcgc catttttaga caaaaaaact 601 gatttaatga gactaattta aaattttcta aaataaaatc agtttactta ctgaattaca 661 cattgcgata ttgtgcggat aaacgcctaa ttttaagttt cttttccctg actttgttat 721 tatgtaatta tcaacatcga aatgttgcga acgaaatatg ctacatttag taggaatcca 781 ggagcccata gtacaggttt tcctagtttg ggttcctgaa tcccaccact tgtcatttca 841 attgtcttaa acctggtcgt gtaacttgct acataattta ttatactatt aattattata 901 taagcttagt ttttaagacc aatgtctgta taaatatggt ggagataaaa atatacgcag 961 tttctaatct gtgtggtgga acttgaaaaa aaaaaaatta attctaccag ttgtgtcaat 1021 ccacaatttt gttacacctg gatctgttgg aaacctaaaa cataatatta aaataaatca 1081 tatgtcacat tctataaaaa aataaaaaca taaaactcac tcaaaatcta tattactaat 1141 tagtcgtaat aaataatact tattatttaa attataaagt gtgctattga ttataataga 1201 ctgtatcata gtaacaatcg acattgatta caaatttgcc caacactata gatgtcatag 1261 attttgtgga aatgagcttt tgtttcggta ctcgggactc tcgcaatgga cgtcatacat 1321 tgcgcgattg ctcacgtagt gcaagattgt actcatctat cgcaagcaat gtattactat 1381 tagtctgctt gcgattatat cgtttttgtc tataaatttc ataatactaa tgttataatt 1441 acaaatatca cacgtatttt tatgtttttg gacttaccgg tgaaaagaaa ttccttctcg 1501 tgctatgcta cacacctggt ttcttttccg acacgttata atagcgcaag acggcataat 1561 ataatatttt tgcggagata cgtacgctgt caaaacatga cagcgattgc tagccggtaa 1621 gtatgaacat ttgagcgtta tgacagcggt atgcacgttc gttcgcaccc tcggcacgcg 1681 tggcccgccc acaaggccaa tccagtctct ttcttttatc catggttttt acaaatggaa 1741 atgtatttgt aaaacggcgt tggtacatct gtacctaaaa atgatatagt at // LOCUS TACLSEG 7102 bp ss-RNA VRL 25-MAY-1990 DEFINITION Tacaribe virus L protein and P-11 protein genes, complete cds. ACCESSION J04340 M33513 KEYWORDS L protein; P-11 protein. SOURCE Tacaribe virus (strain T.RVL.II 573), cDNA to viral RNA. ORGANISM Tacaribe virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Arenaviridae. REFERENCE 1 (bases 1 to 449) AUTHORS Iapalucci,S., Lopez,N., Rey,O., Zakin,M.M., Cohen,G.N. and Franze-Fernandez,M.T. TITLE The 5' region of Tacaribe virus L RNA encodes a protein with a potential metal binding domain JOURNAL Virology 173, 357-361 (1989) STANDARD simple staff_entry REFERENCE 2 (bases 420 to 7102) AUTHORS Iapalucci,S., Lopez,R., Rey,O., Lopez,N., Franze-Fernandez,M.T., Cohen,G.N., Lucero,M., Ochoa,A. and Zakin,M.M. TITLE Tacaribe virus L gene encodes a protein of 2210 amino acid residues JOURNAL Virology 170, 40-47 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by S.Iapalucci, 08-AUG-1989. FEATURES from to/span description pept 70 357 P-11 protein pept 7072 440 (c) L protein BASE COUNT 2071 a 1653 c 1240 g 2138 t ORIGIN L segment. 1 cgcaccgggg atcctaggcg ttacgtgcac tcttctttat tgggctggat tacacaaaac 61 ttttcaagca tgggcaactg caatcgaact cagaaaccca gttcgtcttc taacaacctc 121 gagaagccac cacaagctgc agaatttaga agaacggcag agcccagtct atatgggagg 181 tacaattgca aatgttgttg gtttgctgac aaaaacttaa tcacctgtag tgaccattat 241 ctttgtctaa gatgtcacca aatcatgctg agaaactcag agctgtgcaa catctgctgg 301 aagcccctgc caacatccat aagagtcccg ctagaggcaa gtgctccgga tctgtaacca 361 acgacaaaca aggtcgctga ccccccgggg ggcccccaag ggggcccccg tgggggtggc 421 ccccggggga gtttgggttc tactcaatgt cctccactgt gtttctccta gctagaggtc 481 tgcacataat gcctctcaac ctcaaggacc cctggttgtc agatacaaca aggtctttca 541 gggacttact gtatgcaacc ttgaagttga caaattcaac ccagtccccg atcaactcca 601 gggacctgat caagccagat ctatgtaaat catagacaag gacaaagtcc tcttgttgga 661 atagcatcac ctgtgcagtt gattttaaat agggccagag ttggttcaac aaagcattca 721 catcttcagg gtcgtccagt gcgccaattg ccttcactat tatatcacca gtcaactcca 781 ctctgtaagt agaaactttc tcagttccca taaaaaagtg gccgtctctt aacaccaatg 841 gctccccttg atcgactggc ccattgagtg tataggcata cttgtcactc aattcagata 901 aaaatccacc aaagctgaat cttacatccg tcagaattag gttgtcattt gtctccacaa 961 gatctgcatc catcatcagg gaagccatat taaaagtgcc ttgacctgtg aatatcggaa 1021 gtcctgcaat catggaaact ccatccaaca agaggtgact tccccccgac cctgacactg 1081 actgcgtcac atccaccacc tgtaactcta cgcttgcttc tttaagatca atgactttag 1141 aaacacatgt aacactaaat gagtaagtca ttttcctctt aaacctgtac ttagttgatg 1201 ttctagaatg ctccacctgg atcaaaaacc tcaagtcaca atcctcatgg tcatcaaagt 1261 tcccaagaag atccagcaag tcattgatgt gtctccctgg agccagtggt tgctctccaa 1321 ttgtgatttt gtcaatgtac aatctaacat cttcaagatt tgaaatcaca taacccattg 1381 ctatgtcttc aataaacatc gctctactga tcttctgatc cctaactaag aaggtgaaat 1441 ctgttaactc acccaattca atctgaccat catcttccag ctccaatact tcattaaacc 1501 aaaaaaacga ttttaggcaa gaagttgaca tcactaaagg gttcaccatc gatgatagga 1561 tgatttgaac catgaagttg taacaaacct gttgaggact cagtccaact tccttatcta 1621 cgtaatcaga ctgcctaacg tgtgccatgg cacagtcgct ctgtttgact atgtagacac 1681 tattgtctct ggctttcagt aggtggctga cgattcctag attctcattc atcaaatcca 1741 atgctatgca aacatctaag aacttcagct tcctcccatc cagtgttctg ttcacttgca 1801 aatcgctcat gaagggagac aagtgctcat caaataattt cgggtaattt ttccgaatgg 1861 aatacaatac atgattcata cccagttgat cttctaattg acgaacaatt ctaggcttaa 1921 cagcccagaa caaattcctg ttacttaaca ccactggaat ctgtggctct ttcacagatg 1981 agaataccca ctcccctatt tcgcaggcat tggtcaaaac taaagaaaag taatcccaaa 2041 gaataggttt ggaaaatttt tcaatttttt ccagtgaggg tctgcaataa atcacacctc 2101 ttttcatgtt gcacacctca cacagacagg gcttcacctt tgtataaata tctttaatgt 2161 aaaggaaacc gccctcacca tctctgacac atttgctgcc tatactttta caaaaaccta 2221 taaaaccaga tgccactgaa gactggaatg cagatttgtt tatggactcc gctaagatct 2281 ttttaacacc ctttgtgaag ttccttgaca acctggactg caatgtctta atcagtgttg 2341 gcacctcttc tctctcaaga acccttctgc tcgtcattaa cttggtcctc aatactagtc 2401 ttaggtcacc aaaacttgac aaattcaccc atctaaagcc taaggcacgc ataatatctt 2461 gctctgagca acttgaataa cgcaagaacc ccacaagggc gtcatcacca tctctaccaa 2521 taagactgat caggttttcc tcaaaaactt tgcctctttt tatatcatta aaaacccttt 2581 tagcacagct tctaatgaag cctgtcatct catcatcaga aaatattgat tcaatcttcc 2641 tctgcaatcg ataacccctc gtcccatcca cccaatcttt tacatcttgt ttctcaacag 2701 caccaaaagg ggtttccttg aagccggagt atttgatcag ctggttcacc ctttgagaga 2761 tctttgagac aatttggaca ctaacaccat ttgcgacaca ctggtcacag attgtgtcta 2821 ttgtttctga caactgtgtt ggtgttttgc atttcacatt gtgcaacgct gcagcaacaa 2881 attgagttaa tagcggggtt tcttccccca tcacaaaaaa tcttgatttg aattcagcta 2941 caaaggtgcc cgctacactc ttggggctaa caaatttgtt caattttgag gagaggtact 3001 catgaaaaca caacatttcc aaccactcat cattaacttg attatcatct gaagcacaag 3061 gcaatttgat aagtgtgacc tggtcatcgc tagaagtgta ggaagttgga atgacatcat 3121 atagtaaatc tagcacataa tttaagaact gttctgttat caatccatac aaatcagaca 3181 tgttatgcag gatgccttgg cccatatcta gaacagacat tatgtgactt ggtatctctc 3241 cttggactgg catgaattgg tggaagaatt cctctcccac gcttgagcta ccacaaccca 3301 tcaaccctaa gcttctcttt agattgccag tacagtaagc ctcagcaaca tttactggaa 3361 cttctataac tttatggagg tgccatctca gcacatttaa gactggttca agattaataa 3421 tgttcctttc ttttggggtc ctaagttcca aaaattggag gaatgtcaag aaaagcgcag 3481 gactcatggt tggtccccat ttagagtgat caagagaaca tgataggtca ccattgttca 3541 ctgctaactt catgtcacag atggctctat caaattcttt ttctgaactg agacatgtat 3601 atctcatgga actgccaact gcctctgcaa agtcttcaac taaccttgtt atcagcttag 3661 tgttaagatc cccaacgtaa agttccctat tggaacccac ttgctccttg taacttaagc 3721 caaactttaa gttaccatta tttgaactga tactggtaaa ctcagtagga gaatcatcag 3781 agtaaaagca caggttcctt agtgcagcat tggtaaaata actcctatcc aacttatctg 3841 ctatagcttc agaattgctc tctctcatgc taagtcttac ttcatctttc actctcagtg 3901 cttcattagt gaaaccaagc cttgatctac tcctatgttc ataacttcct agtttttgat 3961 cataatgacc ttgtatcaat aagtatttga agcactcaaa atactcttca tcaaaatatg 4021 cagaagtagt gagatttttc aaaagcaatc ctagaggaca aacttctaaa ggtctctcca 4081 agaaatactc tggtttgctt gaggagtaat agaccacatc acatagttct ttatacaagt 4141 ctgaagcgag tacatcagga tcaaagtctt tcacctcatg aagtgagacc tctgtcctaa 4201 tctctctcat aatcccaaaa ggaccccaaa gttcctcaag tgtacctgtg ctgcttttaa 4261 ccctgccttc cttttgattc ggacttagat tgacatctcc agatttggag tccttcattt 4321 tgaccattgc taattgcacg gaatttttta tatcattgaa agcttctgct tgctcttctg 4381 tcaacccttc aaacatcatt gacagctctt cacaatctct ctgggtttta gaaccaatac 4441 taacaagccc ggttaagttt tttagtatca agtaatctaa tgtacttgga tcaatgttgt 4501 atcttccctt tgttttgaac ctttccgcca tggaagaaac acatgaagaa atcatcttct 4561 ctctatcata taccgaaaga acattcccaa gttcatctag ttttggtact actacgctct 4621 tattgctaga gaggtctaaa gcagtgctag taatgtttgg cttgaatggg tcgttcttaa 4681 gctcacctgt gacttttagc tttcctttat taaataatga gatgcaataa ctcaacagtt 4741 cattgctgac ccctggtaat ttggtgtcag aacagtcttt gagggtcttt gaataaaatt 4801 tatccagccc gtccaacatc attccttctt gagcttctgt cagttgacca tttaaagatg 4861 ggttgactat aacacagtta aaatcaattt tgggttcaat gaatttctca aaacatttga 4921 tttggtctgt taatctgtca ggtgtttcct ttgttattaa atgacaaagg taacttacat 4981 ttaagagata cttgaacctc ctagcaaaat aggggtccga atcaacacct tcaccaagaa 5041 ggcattggaa aaggtccact gccagcctct gaactatgac ttcagagcca gacatacact 5101 ccacttttaa tttggacagt agatccacgt ggtggaattg atttgagtat gccatcaaaa 5161 aataccttag cccttgcaga aatacctgat ttcttttgga cggggcgcat aaaatgctta 5221 acaataaaag tcttagtttc tctttggata tcaccataag atcactgttg aagtccagcc 5281 aagttgtcat ctcatcacac atctcaacca acacctcttg tgaaaagatt ggcagaaaaa 5341 atctcttcgg gtcacagtaa aaagagccct tctcgacaag aacaccatct tcacatgtgt 5401 aaattgagta gcatcctgaa cgctcacctg tcttttgata aaaaagaata acaatcagtc 5461 catcagttaa aaagaatctc tgttggtagc actcccttag tctcacattg ccaaaatacc 5521 ttgagtattc tttttcatta ataagcaacc ttgaagaaaa cgaggttttc atagagttta 5581 ttaaggaaag agacactgag gagagtattt tgaaagcctc aacctgttca gcatctgaca 5641 tcatgtgaaa tgtcttatgt ttacattcag tgttaaagtt ccccccagca gagtatctca 5701 taatgggtct atcagaagat attgtctgag ctatagatat tagcttggat aaatcaggat 5761 atcttgccca caccgatatt tgtgaccaga tatcttgttg agacctcctc aagattttag 5821 agcaattctc cacccactta acattcgtct ctattttctt tttaattaat ttcaccgcct 5881 ccaaaaaatc caaggccctg tcattgacta gtgtgaccat ttgacccagg atgagaccag 5941 accttctcaa agagtccagt gcttcgggtt tggctgtaaa taggtcatct aaaaagttta 6001 ccattatgag atctagaaag agcaacgtat ttcttcgggt gttaaaaact tttaatccct 6061 taattttatt gcaaatcgac aacacgccta aataggcctt ctcatggggg cagacaacat 6121 actgatctaa ttcagagttt gatttctccc tgtcctgcct ttttagtgct gtgaaagtca 6181 aactatagat tatgtccatg aatttgttga cacaaaaggg acaatctgag gtggtgttat 6241 cctcatgttt ccctaaccta attaaattgt tcagaagctc tgatctatta gttatcctaa 6301 acttccccaa gtttttgtca aaaacttcct ttttgtacca gagtcttaat ttcattaatt 6361 cttccctaat cacagattct cgaacatcgc ctttgactcc aacaaataat gatttggcca 6421 gtctatcatc atactccaaa agatttcttt tgagacactc tatattctca aatcctaaac 6481 tttgattacg aacgtttgaa agactgtgta tcagacgttg atactcttga tcctcaatat 6541 caatctctgc ggacgatcgg gatttccgca aaatttcaat taacaaccat ctgatcctct 6601 ccaccaccca ttcaggagtg aacctattag aataattagt tctaccgtca acaacaggaa 6661 caagtgtgat gcctgacctc agaaggtctt gtttgagatt tagtagtttt gtcatgtcac 6721 tcttccattt tctctcaaat tcttctggat tgaccctcac aaatgtttcc aatagtatta 6781 atgtttttcc agtgagtttg aatccatcag gtatcacttt agggagagtg ggacatggga 6841 ttccatgatc actgaggatc acatcaacat atttttggca ggtgttcaag tggcatgcat 6901 gagattcaca ggactctaac tccacaaggt tggagagaag tttaaaacct tcttgaagca 6961 gtgaccctga atgacagtgt gacaggaagg catctttctg gtgggcaaac tcatgcctat 7021 ttggaatgtg ttttctaacc aagtctttga gttcagacac agtttcatcc atggtcaagt 7081 gccgcctagg atccactgtg cg // LOCUS DRHBCDA 922 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.heteroneura bicoid (bcd) mRNA, 3' end. ACCESSION M32125 KEYWORDS bicoid protein. SOURCE D.heteroneura, cDNA to mRNA. ORGANISM Drosophila heteroneura Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 922) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 36 bicoid protein (bcd) (AA at 1) BASE COUNT 309 a 157 c 140 g 316 t ORIGIN 1 ggtggcaaca cgcagttcgc atactgtttc aattaggctg ctacattaat aataattgta 61 aagaaaaaaa aaaaaaaaaa acaatttcaa tgtttgtgga gtttatagct ttaaccgaca 121 cgttgtttcc tgattgtaca aagatctaat aattgtagat atccatatgc atagtattta 181 agccaattta acattgttta aaccaagtat gtaacgcata gttgtagctg ccgagttttt 241 taatgcattt tctactgaca taaactgaag tgccattatt tggaaacttt atttgtaaag 301 cgttctttga tctcaacgct gtctggctgg acatttgcca aagcccatga atgcccaacc 361 agacactgtt gagacgaata tgggctttaa ttgaacgctt tacagaagaa gtttataaat 421 acacataaaa tgcatttctt tcttataact caggccaact ggcatagcaa gtaaaactgt 481 actgatctaa atgaaaaatg tttctcttcc tgataatcgc acgtccttct atggacatta 541 tatgccatta tcagtcgaag atcgaacgtt taaccaataa tgttcttcag aatattttta 601 agctttgtta tgccacagtt tgcatgttgc aattaaacat tcttcacgtg agaattgttt 661 acaaattgtc tctaaacaca tgtgaaccac aggcctcact tatctaaacg aaatctgtac 721 ttactttcag aatgtaagat agttttatca attgaaaatg cattccaaat atgtatctca 781 gttacccttg aatggtgcac catttaaata ttattctaaa cgtaacttat ctatatatct 841 gtatgaaata gtatatttgg ggatatattt aaaatattca gagtcctgtt ttgtatacta 901 ttttaataaa tcttacaaac ta // LOCUS DRQBCDA 987 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.sechellia bicoid (bcd) mRNA, 3' end. ACCESSION M32124 KEYWORDS bicoid protein. SOURCE D.sechellia, cDNA to mRNA. ORGANISM Drosophila sechellia Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 987) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 90 bicoid protein (bcd) (AA at 1) BASE COUNT 301 a 204 c 169 g 313 t ORIGIN 1 tatcaatgca cgatggatac gataatgcaa gcgtataatc cccatcggaa cgccgcgggc 61 aactcgcagt ttgcctactg cttcaattag cctggacgag aggcgtgtta gagagtttca 121 ttagctttag gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat 181 ctacgcgtag aaagttaggt ctagtcctaa gatccgtgta aatggttccc agggaagttt 241 tatgtactag cctagtcagc aggcgacacg gattccagtg catatcttag tgatactcca 301 gttaactcca tactttccct gcaatacgct attcgcctca gatgtatctg ggtggttgct 361 ccattaaagc ccaggaatat ccagccagtt acatttgagg ccatttgggc ttaagcgtat 421 tccatggaaa gttttcgccc cacatttcgg aaattaaatt ccgagccatc aagaaaatct 481 tctctgttac aatttgacat agctaaaaac tgtactaatc aaaatgaaaa atgtttctct 541 tgggcgtaat ctcatacaat gattaccctt aaagatcgaa catttaaaca ataatatttg 601 atatgatatt ttcaatttct atgctatgcc aaagtgtctg acataatcaa acatttgcac 661 attctttgac caagaatagt cagcaaattg tattttcaat caatgcagac catatgttcc 721 agtttcggag attttttgct gccaaacgga atacttataa aaacccaaat tctatttaca 781 tcactaagaa gagcattgca atctgtttag cctcaagttt aattttaaaa tggtgcacct 841 ttgacgctgt ctctttaagc tttgtatttt taattacgaa aatatataag aactactcca 901 ctcgggtaaa ttgtgactaa ctacacaact taacaaattt cggaattata cttagatact 961 tatcccatat ttccgtccct ttctaga // LOCUS DRSBCDA 978 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.simulans bicoid (bcd) mRNA, 3' end. ACCESSION M32123 KEYWORDS bicoid protein. SOURCE D.simulans, cDNA to mRNA. ORGANISM Drosophila simulans Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 978) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 90 bicoid protein (bcd) (AA at 1) BASE COUNT 296 a 203 c 171 g 308 t ORIGIN 1 tatcaatgca cgatggatac gataatgcaa gcgtataatc cccatcggaa ccgcgcgggc 61 aactcgcagt ttgcctactg cttcaattag cctggacgag aggcgtgtta gagagtttca 121 ttagctttag gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat 181 ctacgcgtag aaagttaggt ctagtcctaa gatctgtgta aatggttccc agggaagttt 241 tatgtactag cctagtcagc aggcgacacg gattccagtg catatcttag tgatactcca 301 gttaactcca tactttccct gcaatacgct attcgcctca gatgtatttg ggtggttgct 361 ccactaaagc ccaggaatat ccagccagtt acatttgagg ccatttgggc ttaagcgtat 421 tccatggaaa gttttctccc cacatttcgg aaattaaatt ccgagccagc aagaaaatct 481 tctctgttac aatttgacat ggctaaaaac tgtactaatc aaaatgaaaa atgtttctct 541 tgggcgtaat ctcatacaat gattaccctt aaagatcgaa catttaaaca ataatatttg 601 atatgatatt ttcaatttct atgctatgcc aaagtgtctg acataatcaa acatttgcac 661 attctttgac caagaatagt cagcaaattg tattttcaat caatgcagac catatgttcc 721 agtttcggag attttttgct gccaaacgga atacttataa aaacccacat tctatttaca 781 tcactaagaa gagcattgca atctgtttag cctcaagttt aattttaaaa tggtgcacct 841 ttgacgttgt ctctttaagc tttgtatttt taattacgaa aatatataag aactactcca 901 ctcgggtaaa ttgtgactaa ctacacaact taacaaattc cgaaggatac ttagcccata 961 tttccgtccc tttctaga // LOCUS DRTBCDA 963 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.teissieri bicoid (bcd) mRNA, 3' end. ACCESSION M32121 KEYWORDS bicoid protein. SOURCE D.teissieri, cDNA to mRNA. ORGANISM Drosophila teissieri Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 963) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 90 bicoid protein (bcd) (AA at 1) BASE COUNT 281 a 195 c 168 g 319 t ORIGIN 1 tatcaatgta cgatggatac gataatgcaa gcgtataatc cccatcgtaa cgccgcgggc 61 aactcgcagt ttgcctactg cttcaattag cctggacgaa cggcgtatta gagagtttta 121 ttagccttat gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat 181 ctacgcgtag aaagttaggt ctagtcctaa tggttcccag ggaagtttta tgtagtagac 241 tagtcagcag gctacacgga ttccagtgca cgtcataggg ataccctagt ttactcgata 301 ctttccctgt aatacgcttt tcgcctcaat tgtatctggg tggctgctcc ataaagcccg 361 ggaatatcca accagttaca tttgaagcca tttgggcttt agcgtattcc acagaaagtt 421 atcgtcccac aacttatatt ccgagccagc aagaaaatct tctctgtaac tatttgacat 481 agctaaaaac tgtactaatc aaaatgaaaa atgtttctct tgggcgtaat cttatacaat 541 gattaccctc aaagatcgaa catttaagca ataatatttg atatgatatt ttcaatttct 601 atgctatgcc aaagcgtctg acataatcaa acatttgcac attcttcgac caagaatagt 661 cagcaaattg tattttcaat caatgcagat catgtgttcc agtttcggag attttttgct 721 gtcaacggaa taactatcct acccacactg tctatttcta tcacttagaa gagcattgaa 781 atctgttttg tttagcctca actttaattt taaaatggtg cacctttgat gttgtctttt 841 taagatttgt atttttaatt atgaaaatat ataagaacta ctccactcgg gtaatttgtg 901 actaacttag tattttaaca aatttcaagg gacgcgtagg cattctttcc ctctctttct 961 aga // LOCUS DRVBCDA 1023 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.virilis bicoid (bcd) mRNA, 3' end. ACCESSION M32122 KEYWORDS bicoid protein. SOURCE D.virilis, cDNA to mRNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1023) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 90 bicoid protein (bcd) (AA at 1) BASE COUNT 326 a 176 c 160 g 361 t ORIGIN 1 tatcaatgta cgatggatac gataatgcaa gcgtataatc cgcatcgtaa tgccggtggc 61 aacacgcagt tcgcctactg ttttaattag atcgggacac caggcgcccc atataatata 121 ataattattt taagtacaca aattttaatg tttgtggtgt ttctttagct ttaagtgaca 181 cgttagttcc tgattgtaca aagatttagt gattgtagat atctatgcgt agtattttaa 241 gcaaattatc aatattttaa atgaattttg taaagcttaa ttaagtcata gctgtaattt 301 tccaagcatt ttcaatgtat ttccataaga aacttccttg gtaaaacctt cattggtctc 361 aactctgtct ggctgggcat ttgccaaagc ccatgagtgc ccatccagac atacttgaga 421 caataatggg cttaagctga aggttttacc gaagaagttt tcgtatgaaa atctcatgta 481 agattcattt ctataatttt ggcaccttgg catagcttac aaaaaaactg cactgatcta 541 aatggaaaat gtttctcttc ctgataatcg cacgcctttc tttttggcct gccattatca 601 gccgaagatc gaacatttaa acaataatac tcgtcagaat atttttaatt cttgctatgc 661 cccggttttc atgttacaat tgaacattct tcacgtgaga attgttcaaa tattgtacca 721 caatacatga ataccacatg cctaagttat cgaaacgaat actatatatt actggaggat 781 tttcgttttt ttcttataat tgaaaatggg ttgaaaattt accagttacc ttacgaatcg 841 tgcacctttc aactgatgtt ttaaagctaa gttatcttta atatagtgca tttgggaata 901 tatattaaat gttcaataca atatttccat aaatttatat attaataaat cttacaaact 961 aaccaatttt aagaagtgtt ttgcttttaa cgttttacga taaacaaagt aaacgctttc 1021 gaa // LOCUS M33420 71 bp ds-DNA BAD 25-MAY-1990 DEFINITION Figure 3. Nucleotide sequence of junctions in the multicloning site and promoter probe plasmids. ACCESSION M33420 KEYWORDS . SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 71) AUTHORS Joseph-Liauzun,E., Fellay,R. and Chandler,M. TITLE Transposable elements for efficient manipulation of a wide range of gram-negative bacteria: Promoter probes and vectors for foreign genes JOURNAL Gene 85, 83-89 (1989) STANDARD simple staff_entry BASE COUNT 20 a 15 c 13 g 23 t ORIGIN 1 aattcggtaa tgactccaac ttattgatag agtactcata tgtctagact cgaggatccc 61 gtcgttttac a // LOCUS MUSTCM1AA 805 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor germline alpha-chain V-region, 5' end. ACCESSION M33586 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; germline; variable region. SOURCE Mouse (strain C.B20) lymphocyte DNA, clone p2111. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 805) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 188 > 526 T-cell receptor alpha-chain V-region sigp 188 244 T-cell receptor alpha-chain V-region signal peptide matp 245 526 T-cell receptor alpha-chain V-region BASE COUNT 169 a 218 c 203 g 215 t ORIGIN 1 caagaatcag atattcaaaa ccttccaggc tcgtgtttgc agcgtcagtt tttatgtgaa 61 cgggagactt ctccagcaga ctcttcccta cacctgcttc aagctcttgc tttcctcagc 121 ctctttcctg cctgtactgt tccagagttc ctccacgaca gagctgcagc cttcccaagg 181 ctcagccatg ctcctggcgc tcctctcagt cctggggata cactttctcc tgagagatgc 241 ccaagctcag tcagtgacac agcccgatgc tcgcgtcact gtctctcaag gagcctctct 301 gcagctgaga tgcaagtatt cctactctgg gacaccttat ctgttctggt atgtccagta 361 cccgcggcag gggctgctgc tgctcctcaa gtactattcc ggagacccag tggttcaagg 421 agtgaacggc tttgaggctg agttcagcaa gagtaactct tccttccacc cgcggaaagc 481 ctccgtgcac tggagcgact cggctgtgta cttctgtgct gtgagcgcac agtgtgtggg 541 gctgcagggg gagctgaaca caaacaccga tggctgtggt gctcagactc aagatctctc 601 ctgtggtctc tgtggcatct ctttggactt ggagagcaaa gaaggaaagt ttcgcagtct 661 cagctgtctt tctgcaactc atttccaact aaaacatcca ctatatgcaa tgctcaattg 721 tagttctgtg gaatgaggag actgtgatgt tatcctgagt gtatggctga gtgaacattg 781 cttgggcagt gactagacta gggta // LOCUS MUSTCM1AB 265 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33587 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 265) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 265 T-cell receptor alpha-chain V-region (AA at 2) BASE COUNT 54 a 78 c 67 g 64 t 2 others ORIGIN 1 cctctcctgg tacatccagc accatggccg tggcctccag tttctcctca agtactattc 61 gggaaaccca gtggttcaag gagtgaacgg cttcgaggct gagttcagca agagcgactc 121 ttccttccac cttcggaaag cctccgtgca ctggagcgac tcggctgtgt acttctgtgc 181 tgcgaccngn ttatctggta gcttcaataa gttgaccttt ggagcaggga ccagactggc 241 tgtgtgccca tacatccaga accca // LOCUS MUSTCM1AC 163 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33588 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 163) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 163 T-cell receptor alpha-chain V-region (AA at 3) BASE COUNT 42 a 42 c 44 g 35 t ORIGIN 1 tccacttgca gaaagcctca gtgcaagagt cagactcggc tgtgtactac tgtgctctga 61 ggccggggac acaggttgtg gggcagctca ctttcgggag cggtacaaga ctccaagttt 121 atgcaaacat ccagaaccca gaacctgctg tgtaccagtt aaa // LOCUS MUSTCM1AD 287 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33589 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 287) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 287 T-cell receptor alpha-chain V-region (AA at 3) BASE COUNT 69 a 88 c 67 g 63 t ORIGIN 1 ctattgcctt tttctggtat gtgcaatatc tcaacgaagc ccctcaggta ctcctgcgga 61 gctccacaga caacaagagg accgagcacc aagggttcca cgccactctc cataagagca 121 gcagctcctt ccatctgcag aagtcctcag cgcagctgtc agactctgcc ctgtactact 181 gtgctctgat ccggggacac aggttgtggg gcagctcact ttcgggagcg gtacaagact 241 ccaagtttat gcaaacatcc agaacccaga acctgctgtg taccagt // LOCUS MUSTCM1AE 283 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33590 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 283) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 283 T-cell receptor alpha-chain V-region (AA at 1) BASE COUNT 73 a 88 c 62 g 60 t ORIGIN 1 attgcctttt tctggtatgt gcaatatctc aacgaagccc ctcaggtact cctgcggagc 61 tccacagaca acaagaggac cgagcaccaa gggttccacg ccactctcca taagagcagc 121 agctccttcc atctgcagaa gtcctcagcg cagctgtcag actctgccct gtactactgt 181 gctctgagaa cctcaggagg aaactacaaa cctacgtttg ggaaagggac cagcctcgtg 241 gttcatccat acatccagaa cccagaacct gctgtgtacc agt // LOCUS MUSTCM1AF 246 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33591 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 246) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 246 T-cell receptor alpha-chain V-region (AA at 3) BASE COUNT 69 a 80 c 53 g 44 t ORIGIN 1 aatatctcaa cgaagcccct caggtactcc tgcggagctc cacagacaac aagaggaccg 61 agcaccaagg gttccacgcc actctccata agagcagcag ctccttccat ctgcagaagt 121 cctcagcgca gctgtcagac tctgccctgt actactgtgc tctgagaacc tcaggaggaa 181 actacaaacc tacgtttggg aaagggacca gcctcgtggt tcatccatac atccagaacc 241 cagaac // LOCUS SYNCATAA 66 bp ds-DNA SYN 25-MAY-1990 DEFINITION Synthetic chloramphenicol acetyl-transferase (cat) gene, partial cds. ACCESSION M33419 KEYWORDS chloramphenicol acetyl-transferase. SOURCE Synthetic DNA, clones pEJL4[30,49]. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Joseph-Liauzun,E., Fellay,R. and Chandler,M. TITLE Transposable elements for efficient manipulation of a wide range of gram-negative bacteria: Promoter probes and vectors for foreign genes JOURNAL Gene 85, 83-89 (1989) STANDARD simple staff_entry BASE COUNT 16 a 12 c 18 g 20 t ORIGIN 1 aattcggtga tgctgccaac ttactgattt agtggatccg gtgattgatt gagcaagctt 61 cgacga // LOCUS SYNIS1AA 65 bp ss-mRNA SYN 25-MAY-1990 DEFINITION Synthetic insertion element IS1. ACCESSION M33418 KEYWORDS . SOURCE Synthetic DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 65) AUTHORS Joseph-Liauzun,E., Fellay,R. and Chandler,M. TITLE Transposable elements for efficient manipulation of a wide range of gram-negative bacteria: Promoter probes and vectors for foreign genes JOURNAL Gene 85, 83-89 (1989) STANDARD simple staff_entry BASE COUNT 21 a 12 c 11 g 21 t ORIGIN 1 aattcggtaa tgactccaac ttattgatag agtactcata tgtctagact cgaggatcca 61 aattt //