Path: utzoo!attcan!uunet!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 4 Aug 90 12:00:30 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 6819 Approved: lear@genbank.bio.net Checksum: 51113 422 LOCUS M25435 120 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 6. Possible secondary structure model of 5S RNA. ACCESSION M25435 REFERENCE 1 (bases 1 to 120) AUTHORS Rabin,D. and Crothers,D.M. TITLE analysis of rna secondary structure by photochemical reversal of psoralen crosslinks JOURNAL Nucleic Acids Res. 7, 689-703 (1979) STANDARD unannotated staff_entry BASE COUNT 23 a 36 c 41 g 20 t ORIGIN 1 tgcctggcgg ccgtagcgcg gtggtcccac ctgaccccat gccgaactca gaagtgaaac 61 gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac tgccaggcat // LOCUS M25592 55 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 4. The nucleotide sequence of the fragment is given at the top of the figure. ACCESSION M25592 REFERENCE 1 (bases 1 to 55) AUTHORS Ogata,R.T. and Gilbert,W. TITLE dna-binding site of lac repressor probed by dimethylsulfate methylation of lac operator JOURNAL J. Mol. Biol. 132, 709-728 (1979) STANDARD unannotated staff_entry BASE COUNT 16 a 9 c 16 g 14 t ORIGIN 1 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacag // LOCUS M25594 75 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 4. (a) the cloverleaf model for the base pairing of native tRNA-Trp. ACCESSION M25594 REFERENCE 1 (bases 1 to 75) AUTHORS Jones,C.R., Kearns,D.R. and Muench,K.H. TITLE nuclear magnetic resonance of the base-pairing structure of the native and denatured conformers of escherichia coli transfer rna-trp JOURNAL J. Mol. Biol. 103, 747-764 (1976) STANDARD unannotated staff_entry BASE COUNT 13 a 21 c 24 g 14 t 3 others ORIGIN 1 aggggcgtag ttcaaddggd agagcaccgg tctccaaacc gggtgttggg agttcgagtc 61 tctccgcccc tgcca // LOCUS M25597 78 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 1: (A) The cloverleaf structure of E. coli tRNA-Met-f1. ACCESSION M25597 REFERENCE 1 (bases 1 to 78) AUTHORS Daniel,W.E.Jr. and Cohn,M. TITLE changes in tertiary structure accompanying a single base change in transfer rna. protein magnetic resonance and aminoacylation studies of escherichia coli trna-met-f1 and trna-met-f3 and their spin-labeled (s4u8) derivatives JOURNAL Biochemistry 15, 3917-3924 (1976) STANDARD unannotated staff_entry BASE COUNT 15 a 26 c 25 g 12 t ORIGIN 1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggatc gtcggttcaa 61 atccggcccc cgcaacca // LOCUS M26841 459 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 1. Nucleotide sequence coding for mature Il-1-beta and the corresponding amino acid sequence. ACCESSION M26841 REFERENCE 1 (bases 1 to 459) AUTHORS Gronenborn,A.M., Clore,G.M., Schmeissner,U. and Wingfield,P. TITLE a h-nmr study of human interleukin-1-beta JOURNAL Eur. J. Biochem. 161, 37-43 (1986) STANDARD unannotated staff_entry BASE COUNT 143 a 109 c 106 g 101 t ORIGIN 1 gcacctgtac gatcactgaa ctgcacgctc cgggactcgc agcaaaaaag cttggtgatg 61 tctggtccat atgaactgaa agctctccac ctccagggac aggatatgga gcaacaagtg 121 gtgttctcca tgtcctttgt acaaggagaa gaaagtaatg acaaaatacc tgtggccttg 181 ggcctcaagg aaaagaatct gtacctgtcc tgcgtgttga aagatgataa gcccactcta 241 cagctggaga gtgtagatcc caaaaattac caaaagaaga agatggaaaa gcgatttgtc 301 ttcaacaaga tagaaatcaa taacaagctg gaatttgagt ctgcccagtt ccccaactgg 361 tacatcagca cctctcaagc agaaaacatg cccgtcttcc tgggagggac caaaggcggc 421 caggatataa ctgacttcac catgcaattt gtgtcttcc // LOCUS M28421 142 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 5. The unique Xho I site of phi X174 DNA. ACCESSION M28421 REFERENCE 1 (bases 1 to 142) AUTHORS Lam,L.H. and Reynolds,R.J. TITLE dna sequence dependence of closely opposed cyclobuty1 pyrimidine dimers induced by uv radiation JOURNAL Mutat. Res. 178, 167-176 (1987) STANDARD unannotated staff_entry COMMENT BASE COUNT 43 a 27 c 28 g 44 t ORIGIN 1 gctttatcaa gataattttt cgactcatca gaaatatccg aaagtgttaa cttctgcgtc 61 atggaagcga taaaactctg caggttggat acgccaatca tttttatcga agcgcgcata 121 aatttgagca gatttgtcgt ca // LOCUS M28574 160 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 4. Sequence of tyrT DNA. ACCESSION M28574 REFERENCE 1 (bases 1 to 160) AUTHORS Drew,H.R. and Travers,A.A. TITLE dna bending and its relation to nucleosome positioning JOURNAL J. Mol. Biol. 186, 773-790 (1985) STANDARD unannotated staff_entry COMMENT BASE COUNT 40 a 42 c 38 g 40 t ORIGIN 1 ccggttacct ttaatccgtt acggatgaaa attacgcaac cagttcattt ttctcaacgt 61 aacactttac agcggcgcgt catttgatat gatgcgcccc gcttcccgat aagggagcag 121 gccagtaaaa agcattaccc cgtggtgggg gttcccgaga // LOCUS M28575 169 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 4. Sequence of tyrT DNA. ACCESSION M28575 REFERENCE 1 (bases 1 to 169) AUTHORS Drew,H.R. and Travers,A.A. TITLE dna bending and its relation to nucleosome positioning JOURNAL J. Mol. Biol. 186, 773-790 (1985) STANDARD unannotated staff_entry COMMENT BASE COUNT 42 a 46 c 40 g 41 t ORIGIN 1 aattccggtt acctttaatc cgttacggat gaaaattacg caaccagttc atttttctca 61 acgtaacact ttacagcggc gcgtcatttg atatgaagcg ccccgcttcc cgataaggga 121 gcaggccagt aaaaagcatt accccgtggt gggggttccc gagccccgg // LOCUS RCANIFR4 2583 bp ds-DNA BCT 04-AUG-1990 DEFINITION Rhodobacter capsulatus nitrogen fixation (nifR4 and nifA) genes, 5' end. ACCESSION X12358 KEYWORDS nifA gene; nifR4 gene; nitrogen fixation. SOURCE Rhodobacter capsulatus (strain SB1003) DNA, clone pRCN200. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 2583) AUTHORS Jones,R. and Haselkorn,R. TITLE The DNA Sequence of the Rhodobacter capsulata ntrA, ntrB and ntrC gene analogues required for nitrogen fixation JOURNAL Mol. Gen. Genet. (1988) In press STANDARD simple automatic COMMENT See X07567 for overlapping sequence [1] Author address Jones R., University of Southern California, Mathematics Department DRB289 University Park Los Angeles, CA 90089-1113 USA. Submitted (15-JUL-1988) on tape to the EMBL data library by: FEATURES from to/span description pept 644 1924 nifR4 protein pept 2302 > 2583 nifA protein BASE COUNT 406 a 906 c 835 g 436 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttgctt gcggggcccg ttacggttgg cacgcttctt gctggatcat ttctgaagcc 61 tcaaccgatc aaaaaggggg atgcgatgaa ggatctcttc gacgaaagcc tgacgctgga 121 caccgcgctg ccgcgcccca ggcaccgccc gccgccgcgc cctggcgccg gcaggccgcg 181 gcgaaggccc ccgacaccgc cgccttcctt gcgaatttcg tcaggatcgg agagatcgcc 241 gccccgaaac ccccggccgc ggcctgtttc cgcccgagga agaagccgcg gtcgtcgccg 301 agctgatcgc cgagatgcgc ccgatgttcc agcgcgacgg cggcgacatc gagctgatcg 361 gcctcaccgg cgccaccgtt caggtgcggc tttcgggcag ctgcgcgggc tgcatgatgt 421 cggcgcgcac gctttccacg gtgcaacacc agctgatcga aaccctgggc cgccggtccg 481 ggtcgtcccc gagatccggc actgagatga tcgcccgccg ccccgcccaa aaacacggca 541 caacccaggg tggcgcggcg cggatttcgg cacctgaccc tttcttcacc tgcggcgccc 601 tgccgcccca cgcaatgacc cgagtttgag ataggttgtc cccatggaac tcgcgcagac 661 gctttcccaa agacagacga tgcagatggc ggggcagatg ctccattccc tcgccatcct 721 tggcatgtcg tcgcaggatt tgtccgagca tctgaccgaa caggccacat cgaacccctt 781 tctgacctac cgcgccccgc ccgccttcat cgcccgcggc ggcgaggatt tcgacgccgt 841 gggcgcggtg gccgcgcaca agcccagcct gatggcccat gtggtcgatc agatcgagat 901 ggccttcacc gagacccccg accggctgct ggcgctgcgc tttgccgagg cgctggagcc 961 gagcggctgg ctgggccaga gcctcgacag catcgcgctg gcggcgggcg tgtcgctttc 1021 gcgcgccgaa tcgatgctgg cggtcctgca gggcttcgag cctaccggcc ttttcgcccg 1081 cgacctgtcg gattgcctga tcctgcaggc gcgcgaggcc gacatcctga cctgggaagt 1141 cgaaacgctg atccgcaaca ttcggctgat cgcggaaaac cggctgtccg atctggccga 1201 tctttgcgat tgcgacatcg gcgacatccc cgaaatcatc aagcagatcc ggcacttgaa 1261 cccgaagccg gggcttgctt tcgaccatca gccgacgccc gttttcccgc ccgatctgat 1321 cgccgtgcgc ggcgccgagg gctggacggt ggaactcaac cgcgccacct cgcccacgat 1381 caccgtgcgc gaagatcgtt tcgccgatgg caccgccgat gcgaaggcgc gcgcggaacg 1441 gcgaaggcgt ggccgaggcc cgggcgctgg cgaggcgctg gagcgacgcc gcgatacgct 1501 gttgcgcacc gccgccgtgc tggtggcgcg gcaatcggcc ttcctcgaca aggggcccgc 1561 ccatctggtg ccgctgacgc tggaggatgt ggcctcggaa ctgggcctgc atgcctcgac 1621 gatttcgcgc gcggtctcgg ggcggatgat ccagacccag acccgggcgc tgccgctgcg 1681 ggcgttcttt tcccgcgcgg tctcgaccca gggcggtggc gaggcggtgt cgcgcgacag 1741 ccttgatttc gtgcagcgca cgtgggcggc gaagatccgc caaaacccgc tgtcggatga 1801 tgcgatcgtg acgctcgccg aacgcgccgg gctgcggatc gcgcggcgga cggtggcgaa 1861 ataccgctcg acgctgggcc ttgcctcctc ctacgagcgt cgccgcgccg ccgccgcgcg 1921 ctgagtctgc gaaacgcgac ctctgccaga ccggaagcgc catttttttc ggcgctccgg 1981 ccattcccgc ctcaaagttg atcttaacga agcctcaatc acccccaagt tgcgacatat 2041 tggcacctga gactgtttgc gccgtgcatc cgggcgcaag cttgaccaag tctcgacatg 2101 tccgccacgg cacccaaccg cgatcgaacg atccacgcag cgcagactgc ggcaccccat 2161 gggagggggt gatcgtcgtt tgatcccccg gcgccgggca agcgagggag aacagtgatg 2221 atcgacatcc gggaccggct cgtcccccaa ccgcagcaag acatcgctcg gcgcgggcca 2281 cggccgaccg ccttgccttg gatgcgcttt acgagatcgc gaagaccttt gcggcggccg 2341 gatccggtgg cgaggtgccg agatcttcaa cgtgctgtcg tctttcctcg atctgcgcca 2401 tggggtgctg gcgttctggc cgagccgggc gagggcgcgg gggtgaaccc ctatgtcatc 2461 gccgcgacgg cctttcagcg cagccccgag gcccccgccg cggatgtgct gcccgatgcc 2521 gtggcgcgga tcgtctttcg ctccggcgtg cctttcgtca gctttgatct ggccgccgaa 2581 ttc // LOCUS ACTCPA 634 bp ds-DNA PLN 04-AUG-1990 DEFINITION A.deliciosa actinidin gene, 5' end. ACCESSION M35795 KEYWORDS actinidin. SOURCE A.deliciosa (var. deliciosa, strain cv.Hayward) leaf DNA, clone pKIWI 37. ORGANISM Actinidia deliciosa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Theales; Actinidiaceae. REFERENCE 1 (bases 1 to 634) AUTHORS Keeling,J., Maxwell,P. and Gardner,R.C. TITLE Nucleotide sequence of the promoter region from kiwifruit actinidin genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Keeling, 05-JUL-1990. Author address: J.Keeling Centre for Gene Technology Dept of Cellular and Molecular Biology University of Auckland Private Bag Auckland NEW ZEALAND email: SYSTEM%CELVAX.AUKUNI.AC.NZ FEATURES from to/span description pept 595 > 634 actinidin mRNA 558 > 634 actinidin mRNA signal 507 514 TATA box BASE COUNT 245 a 120 c 71 g 198 t ORIGIN 1 caaaccgtag ttacaaggtg ggcctgaacc agtttgctga cctgactgat gaggagttcc 61 gataaattag ctttcttttt ttgtcttcac ctattttctt tttcaaattt gatttacatt 121 tgtacattta aaattatttt cgccaagaaa aaccgaataa tataaaaatt tgatcaaaca 181 attaaaacaa ttcattaaag gcgaaaaata taaaaaccaa atcaactaaa aaatatcaac 241 aaaaaaataa ctaatcaaaa ctgggcataa atagaaagga caatttaata attttgtata 301 ataaaaacaa aaataaagta tttaaagttg tggttggata aggatttaaa gaagaaaaaa 361 aattaaatct aaatcattga aatttaattt tatatttttt ttctcttttt tctactgaat 421 ctgcagttcc aacagaacct ttaaaaaaaa ttgtgaaaat cattttttca aatgtcgtaa 481 gacccccccc accccccacg caccctatat aaaggccact ctctccctcc acattcacac 541 acctccaatc ccaatctttt tcttctaaaa ttcaaaaaac gagagagaac aaaaatgggt 601 ttgcccaaat ccttcgtgtc aatgtctctc ctct // LOCUS BOVAPOAI 933 bp ss-mRNA MAM 04-AUG-1990 DEFINITION B.taurus apolipoprotein A-I mRNA, complete cds. ACCESSION M35870 KEYWORDS apolipoprotein A-I. SOURCE B.taurus liver, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 933) AUTHORS O'hUigin,C., Chan,L. and Li,W.-H. TITLE Cloning and sequencing of bovine apolipoprotein A-I cDNA and molecular evolution of apolipoproteins A-I and B-100 JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C. O'hUigin, 27-JUN-1990. FEATURES from to/span description pept 73 870 apolipoprotein A-I precursor sigp 73 127 apolipoprotein A-I signal peptide matp 145 867 apolipoprotein A-I signal 914 919 poly-A signal BASE COUNT 201 a 269 c 313 g 150 t ORIGIN 1 cgccaattac aggcctcgaa agagtcctgt attgttattt ttcgtcacta cctccccggg 61 tcgggattca agatgaaagc cgtggtgctg accttggctg tgctcttcct gacggggagc 121 caggctcggc atttctggca gcaagatgac ccccagtcat cctgggatcg ggtgaaggat 181 tttgccaccg tgtatgtgga agcaatcaag gatagtggca gagactatgt ggcccaattc 241 gaagcctccg ctttgggaaa acagctcaac ctgaaactcc tggacaactg ggacaccctg 301 gccagcacgt tgtccaaagt gcgtgaacag ctgggcccag tgacccagga gttctgggac 361 aacctggaaa aggagaccgc gtcgctgagg caggagatgc acaaggacct ggaggaggtg 421 aagcagaagg tgcagcccta cctggacgag ttccagaaga agtggcacga ggaggtggag 481 atctaccgcc agaaggtggc gccgctgggc gaggagtttc gcgagggcgc gcgccagaag 541 gtgcaggagc tgcaggacaa gctgagccct ctggcccagg agctccgcga ccgcgcgcgc 601 gcccacgtgg agacgctgcg gcagcacgtg gcgccctaca gcgacgacct gcgccagcgg 661 ctgaccgcgc gcctggaggc gctcaaggag ggcggcggca gcctggccga gtaccacgcc 721 aaggccagcg agcagctgaa ggcgctgggc gagaaggcca agccggtgct ggaggacctc 781 cgccagggcc tgctgcccgt gctggagagc ctcaaggtca gcatcctggc cgccatagac 841 gaggcctcca agaagctgaa cgcccagtga gatgcctcgg gctgccccgc tggccatcac 901 ttcggtttct tagaataaac atttccggag tgg // LOCUS DOGINVOLA 864 bp ds-DNA MAM 04-AUG-1990 DEFINITION Dog involucrin gene, complete cds. ACCESSION M34442 KEYWORDS involucrin. SOURCE Dog adult skin keratinocyte DNA. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 864) AUTHORS Tseng,H. and Green,H. TITLE The involucrin genes of pig and dog: Comparison of their segments of repeats with those of prosimians and higher primates JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Tseng and H.Green, 23-MAY-1990. FEATURES from to/span description pept 1 858 involucrin rpt 232 513 repeat segment BASE COUNT 286 a 226 c 249 g 103 t ORIGIN 1 atgtcccagc aacacactct gccagtgacc ctgccccctg ccctcagtca ggggcccctc 61 aagcctgagt ctcctcccat cgatactcaa caagagcagg tgaaacagcc gacttcacta 121 cctgtcctgt gccaaaaaat gccctcttct ctcccaggga aggttccctt ggggcatggg 181 gagaaacata caagtcttgt gaagggggaa cctgagcaac agtgtgagcc acaggagcag 241 gaacagcagc agaaacaaca agagtcacag gagcagaaac tgcacctgga acagtgtctg 301 gaacagcatc aggaacaaca agagtcacag gaccagaaac tgtacccgga gcagtgtctg 361 gaacagcagc aggagcaaca agagtcacag gaccagaaac tgtacccgga gcagtgtctg 421 gaacagcagc aggagcaaca agagtcacag gagaaggaac ttcatctgga acaggaacag 481 cagaaggagg agctacagca gcaggaacag cagcagggaa aggaacagtg tgagaaacat 541 caggaagcaa aaaatctgga gcagcagcta gagcagatag gagcacaaag gaagcagcag 601 caaaaggaac agctggaaca ggaaaagaag ctcgtggacc agcacctgga tcaagagcca 661 gcacagagaa ctgagcaacc ggaaaagaaa gaagagcagg tgctggagca gcaggggcag 721 caggaggggc agctggagca gcctgccttt gtcccagctc aggtccaggt ccgagaaccc 781 ctgaagggag aagtcctgcc cctcatagag cagcagcacc agaagcagga ggttcatgac 841 cctccagaac accagtgagc accc // LOCUS DRODMRA 360 bp ds-DNA INV 04-AUG-1990 DEFINITION D.melanogaster dispersed middle repetitive DNA sequence. ACCESSION M16263 KEYWORDS dispersed middle repetitive DNA sequence. SOURCE D.melanogaster DNA, clone p112-0.4RA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 360) AUTHORS Waring,G.L. and Pollack,J.C. TITLE Cloning and characterization of a dispersed, multicopy, X chromosome sequence in Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987) STANDARD simple staff_review BASE COUNT 138 a 56 c 42 g 124 t ORIGIN X chromosome. 1 ccaaatttaa atgcaaatcg atttggattt taataacaaa ctcaacgagg tatgacattc 61 catatttggt ctattatttt caatgttatg atcaaaatac caaatatttt tttcacaaaa 121 aaccaggaaa acgaaatttg gcaaaaaatt taatatttaa gtagagtttt ttgggtataa 181 cttgactaaa aatggtcaca caactaaaat aattaccatt ttgtgctcct aattaccaat 241 actaatcatc cccatcactt tttggcggac tttgaaaaat taatttttgc ccaaattttc 301 gcattttttg taaggggtaa ctcatcaaaa tttttaaaaa atgccaaaaa aataaagttc // LOCUS DRODMRB 354 bp ds-DNA INV 04-AUG-1990 DEFINITION D.melanogaster dispersed middle repetitive DNA sequence. ACCESSION M16264 KEYWORDS dispersed middle repetitive DNA sequence. SOURCE D.melanogaster DNA, clone pX-12R. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 354) AUTHORS Waring,G.L. and Pollack,J.C. TITLE Cloning and characterization of a dispersed, multicopy, X chromosome sequence in Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987) STANDARD simple staff_review BASE COUNT 130 a 57 c 47 g 120 t ORIGIN X chromosome. 1 ccaaatttga atgcaaatcg atttggattt aaaaaacaaa ctcaacgaag atgacattcc 61 atatttgggc cattattttc aatgttttga tcaaaatacc gattattttc ttctcaaaaa 121 attagggaaa cgagttccgg ccaaaaatgc atatttaaga ttggtttttt gggtatactt 181 ggctaaaatg gtcagacagc aaaaaaatta ccatttattc tcctataaca aatactaacc 241 acccctttca cttttaaacg gattttgtaa aattaatttt tggccaaatt ttcgcatttt 301 ttttaagggg taccatcata aaaatttgca aaaaattgaa aaaatctaga attc // LOCUS DRODMRC 352 bp ds-DNA INV 04-AUG-1990 DEFINITION D.melanogaster dispersed middle repetitive DNA sequence. ACCESSION M16265 KEYWORDS dispersed middle repetitive DNA sequence. SOURCE D.melanogaster DNA, clone pX-30 P-0.4R. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 352) AUTHORS Waring,G.L. and Pollack,J.C. TITLE Cloning and characterization of a dispersed, multicopy, X chromosome sequence in Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987) STANDARD simple staff_review BASE COUNT 128 a 64 c 42 g 118 t ORIGIN X chromosome. 1 ccaaacttaa atgcaaatcg attgggattc aaaaaacaaa ctcaacgagg tatgacattc 61 catatttggg ccattatttc catgttttca tcaaaatacc gaatttttat cacaaaaaat 121 ctgggaaact atttttggca aaaactcaat tttcaagttg gctcttttgg ctataactga 181 ctaaaaatgg tcacagagca aaaattagta ccattttata ctccttataa ccaatactaa 241 ccaccccttt cactttttaa ctgatttagt aaaatttaat tttggccaaa ttttcgcatt 301 ttttgtaagg gtaacatcat aaatttgcaa aaaattgaaa aatcctagat tc // LOCUS DROTMLPA 5546 bp ss-mRNA INV 04-AUG-1990 DEFINITION D.melanogaster serrate (Ser) mRNA, complete cds. ACCESSION M35759 KEYWORDS Ser gene; serrate. SOURCE D.melanogaster pupa, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 5546) AUTHORS Fleming,R.J., Scottgale,T.N., Diederich,R.J. and Artavanis-Tsakonas,S. TITLE The gene serrate encodes a putative EGF-like transmembrane protein essential for proper ectodermal development in Drosophila melanogaster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.J.Fleming, 28-JUN-1990. Author address: R.J.Fleming Yale University Dept of Biology KBT 1104 New Haven, CT 06511 email: WHAKRIA@YALEVM FEATURES from to/span description pept 443 4657 serrate protein precursor sigp 443 679 serrate protein signal peptide matp 680 4657 serrate protein mRNA < 1 5546 BASE COUNT 1384 a 1446 c 1504 g 1212 t ORIGIN Chromosome 3 97F. 1 ccgagtcgag cgccgtgctt cgagcggtga tgagcccctt ttctgtcaac gctaaagatc 61 tacaaaacat cagcgcctat caagtggaag tgtcaagtgt gaacaaaaca aaaacgagag 121 aagcacatac taaggtccat ataaataata aataataatt gtgtgtgata acaacattat 181 ccaaacaaaa ccaaacaaaa cgaaggcaaa gtggagaaaa tgatacagca tccagagtac 241 ggccagttat tcagctatcc agagcaagtg tagtgtggca aaatagaaac aaacaaaggc 301 accaaaatct gcatacatgg gctaattaag gctgcccagc gaatttacat ttgtgtggtg 361 ccaatccaga gtgaatccga aacaaactcc atctagatcg ccaaccagca tcacgctcgc 421 aaacgccccc agaatgtaca aaatgtttag gaaacatttt cggcgaaaac cagctacgtc 481 gtcgtcgttg gagtcaacaa tagaatcagc agacagcctg ggaatgtcca agaagacggc 541 gacaaaaagg cagcgtccga ggcatcgggt acccaaaatc gcgaccctgc catcgacgat 601 ccgcgattgt cgatcattaa agtctgcctg caacttaatt gctttaattt taatactgtt 661 agtccataag atatccgcag ctggtaactt cgagctggaa atattagaaa tctcaaatac 721 caacagccat ctactcaacg gctattgctg cggcatgcca gcggaactta gggccaccaa 781 gacgataggc tgctcgccat gcacgacggc attccggctg tgcctgaagg agtaccagac 841 cacggagcag ggtgccagca tatccacggg ctgttcgttt ggcaacgcca ccaccaagat 901 actgggtggc tccagctttg tgctcagcga tccgggtgtg ggagccattg tgctgccctt 961 tacgtttcgt tggacgaagt cgtttacgct gatactgcag gcgttggata tgtacaacac 1021 atcctatcca gatgcggaga ggttaattga ggaaacatca tactcgggcg tgatactgcc 1081 gtcgccggag tggaagacgc tggaccacat cgggcggaac gcgcggatca cctaccgtgt 1141 ccgggtgcaa tgcgccgtta cctactacaa cacgacctgc acgaccttct gccgtccgcg 1201 ggacgatcag ttcggtcact acgcctgcgg ctccgagggt cagaagctct gcctgaatgg 1261 ctggcagggc gtcaactgcg aggaggccat atgcaaggcg ggctgcgacc ccgtccacgg 1321 caagtgcgat cgtccggggg aatgcgaatg cagacccggc tggcgtggtc cattgtgcaa 1381 cgagtgcatg gtctatcccg gctgcaagca tggttcctgc aacggcagcg cctggaaatg 1441 cgtgtgcgac accaactggg gtggcatatt gtgcgatcaa gatttaaatt tctgcggcac 1501 ccatgaaccc tgcaagcacg gcggcacctg cgaaaatacc gctccggaca aatatcggtg 1561 cacatgcgcc gagggcctct cgggcgagca gtgcgagatc gtggagcacc catgtgccac 1621 caggccatgc cgcaacggcg gcacatgcac actcaagacg agtaaccgaa ctcaagccca 1681 agtgtatcgc acatcacatg gcaggagcaa catgggccgg ccggtaagac gcagcagttc 1741 gatgcgcagc ctggatcacc tgcggccgga ggggcaggcg ctgaatggca gcagctcctc 1801 gggattggtg tccctaggtt cgctgcagct gcagcagcaa ctggcccccg acttcacttg 1861 cgactgcgca gccggatgga cgggaccgac atgcgaaata aatatcgacg agtgcgccgg 1921 gggtccctgc gagcatggtg gcacttgcat cgatctaatc ggtggctttc gatgtgaatg 1981 tccgccggag tggcatggcg atgtctgtca ggtggatgtg aacgagtgcg aggcgccgca 2041 ttccgccgga atcgctgcga acgcattgct gaccaccaca gccaccgcga ttattggtag 2101 taatctgagc agtactgctc ttctggccgc tctgaccagt gcagtggcat ccacatcctt 2161 ggccatcgga ccctgcatca atgccaagga gtgtcgcaat cagccgggtt cctttgcctg 2221 catctgcaag gagggctggg gcggagtgac ctgtgccgag aatctagatg actgtgtggg 2281 tcagtgccgg aatggagcca cctgcattga tctggtcaac gactataggt gcgcctgtgc 2341 ctctggattc acgggtcgcg attgcgagac ggacatagac gagtgcgcca cttccccgtg 2401 ccgaaacgga ggcgaatgtg tggacatggt gggcaaattc aattgcattt gcccacttgg 2461 ctactcgggt tctctgtgcg aggaggccaa ggagaactgc acaccgtcgc catgtttgga 2521 gggtcactgc ctcaacacgc ccgaaggata ctactgccat tgtccaccgg atcgcgccgg 2581 aaagcactgc gagcaactgc gtccgctctg ctcccagccg ccctgcaacg agggctgctt 2641 cgccaatgtc agcctagcga cgtcagcgac aacgacgacg acaaccacca caacggcgac 2701 aacgacaagg aagatggcca agccaagcgg attgccctgc agcggacacg gcagctgcga 2761 gatgagcgac gtgggcacct tctgcaaatg ccatgtgggc cacaccggca ccttctgcga 2821 gcacaatctc aacgaatgct cgccgaatcc ttgtcgaaat gggggaattt gccttgacgg 2881 cgacggcgat tttacatgcg agtgcatgtc gggctggaca ggtaaacgct gctcggagcg 2941 cgctacaggt tgttatgccg gtcagtgcca gaatggtggt acctgcatgc ctggagcccc 3001 ggacaaggct ctgcagccgc attgccgctg tgcgccaggt tggactggtc tgttttgcgc 3061 cgaggctatt gaccagtgtc gcgggcagcc gtgccacaat ggcggaacgt gcgagtcggg 3121 agcgggctgg ttccgctgcg tctgcgctca gggattctct ggtccagact gccgcatcaa 3181 tgtgaacgag tgctcgccac agccttgcca gggcggtgcc acctgcatcg acggaatcgg 3241 tggatacagc tgcatctgcc caccaggaag gcatggattg cggtgtgaaa ttttgctctc 3301 cgatcccaag tccgcctgcc agaacgcaag caacactatc tctccgtata cagctctaaa 3361 ccgaagccaa aactggctgg atattgctct aaccggaaga acagaagacg atgagaactg 3421 caatgcgtgt gtctgcgaaa acggcacctc tcggtgcacg aatctctggt gtggattgcc 3481 caattgctat aaggtggatc cgctctccaa gtcctcgaat ctgtccggtg tttgcaaaca 3541 gcacgaggtg tgcgttccgg cactgagtga gacatgcctg tcatcgcctt gtaatgttcg 3601 tggagattgc cgggcactgg aaccatcgcg tcgggttgct ccaccccgac tgccagccaa 3661 atctagctgc tggcccaatc aggccgtggt caacgagaac tgcgcccgac tcaccatcct 3721 tttggccctg gagcgagtgg gcaagggagc ttcggtggag ggtctctgct ccctggtaag 3781 ggtgctgctg gctgcccagt tgatcaagaa gccggcgagt acttttggcc aggatccggg 3841 aatgcttatg gtgctctgcg atctcaaaac gggcaccaat gataccgttg aactaactgt 3901 gtcgtccagt aaattaaatg atccccagct gccagtggcg gtgggtctgc tgggtgaact 3961 cctgagctcc aggcagttga atggcatcca gcggcgcaag gaactggagc tgcagcatgc 4021 aaaattggct gccctcacct ccattgtgga ggtcaagttg gaaacggccc gcgtggccga 4081 tggatcgggt catagtctgc tgataggagt gctatgcggt gtctttatag tcctggtggg 4141 attctcggtg ttcatcagtc tttactggaa acagcgtctg gcttatcgca ccagttcggg 4201 aatgaactta actccctccc tggatgcact gcgtcacgag gaggagaagt cgaataatct 4261 gcagaacgag gagaatctgc gaaggtatac aaatccgctg aagggcagca ccagttccct 4321 aagagcggcc accggcatgg aactaagcct caatcccgct ccggaattag ccgcctcggc 4381 ggcgagtagt tccgccttgc acagatcgca gccactattc ccgccatgcg atttcgagcg 4441 tgagctggac tccagtacgg gcctgaagca ggcgcacaag cggagctcac agattctgct 4501 gcacaaaacc caaaactcgg acatgcggaa gaacactgtg ggctcgctgg acagtccgcg 4561 taaggacttt ggcaagcggt cgatcaactg caagtccatg ccaccctctt cgggcgacga 4621 gggctccgat gtccttgcca ccactgtgat ggtttagccg tgatctcacc aaccaaccaa 4681 tcaagaaacc aaccagccgc ccacagccag ctcaaagttc caattgccac agcacgggcg 4741 ctatttccaa gtgcattagt agcgtaatta aaactaggat attgttaagg ataccaaggt 4801 aggccacaac ggagtggctc tgttgaaaac gtaaagttct aaaaatccag gtctctcaga 4861 caaagatgag gtacacaaat aaattggcta gttaatcaag catgttatgg ccacgggatg 4921 ggcaaattta tttgtatacc tgatcttatc ttaatactaa accagttttc tactattttt 4981 tttttgtgga tcaagcttaa aagttcagct aggcaggcgt tttccgcagt gccatgtcga 5041 tgtggaagcc caaaatattt aggttagata gtgtaatttc gaactcttct cttcgctaag 5101 caacatccta cacagtgtga tatttagtgt aacccaggcg cgcatttaca ttcaattaaa 5161 gacaatgata tataaatata aacgaaatca actccttggc tagcacaagc tgtatgtata 5221 tagttctcat ttaggatcgt cgcgctctat attgtgtata agctgtaaat actgtaaatt 5281 agcagttacc gttattgtat tttgtctata gttagattgg tactattaaa ctaagaacca 5341 gccgcaacgc gttagacttt aaaagttgtt tgcaattgta cgcaataata tagttttatg 5401 ctcgtagtta ggtagctgtg taaccgggta agattcaaac gattttgtac tgtattatat 5461 acctatctgt gtagtaatat ttatttatta tattaaattt gatctagacg caataaagta 5521 atatcaataa agatagtaaa agacat // LOCUS ECOLRRPA 495 bp ds-DNA BCT 04-AUG-1990 DEFINITION E.coli leucine-responsive-regulatory protein (Lrp) gene, complete cds. ACCESSION M35869 KEYWORDS Lrp gene; leucine-responsive-regulatory protein. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 495) AUTHORS Willins,D.A., Platko,J.V., Ryan,C. and Calvo,J.M. TITLE Lrp, a small, basic global regulatory protein of E.coli JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.M.Calvo, 27-JUN-1990. Author address: J.M.Calvo Cornell University Dept of Biochemistry Room 451 Biotechnology Bldg. Ithaca, NY 14853 email:DEKY@CRNLVAX5 FEATURES from to/span description pept 1 495 leucine-responsive-regulatory protein (Lrp) BASE COUNT 127 a 110 c 128 g 130 t ORIGIN 20 min. on E.coli map. 1 atggtagata gcaagaagcg ccctggcaaa gatctcgacc gtatcgatcg taacattctt 61 aatgagttgc aaaaggatgg gcgtatttct aacgtcgagc tttctaaacg tgtgggactt 121 tccccaacgc cgtgccttga gcgtgtgcgt cggctggaaa gacaagggtt tattcagggc 181 tatacggcgc tgcttaaccc ccattatctg gatgcatcac ttctggtatt cgttgagatt 241 actctgaatc gtggcgcacc ggatgtgttt gaacaattca ataccgctgt acaaaaactt 301 gaagaaattc aggagtgtca tttagtatcc ggtgatttcg actacctgtt gaaaacacgc 361 gtgccggata tgtcagccta ccgtaagttg ctgggggaaa ccctgctgcg tctgcctggc 421 gtcaatgaca cacggacata cgtcgttatg gaagaagtca agcagagtaa tcgtctggtt 481 attaagacgc gctaa // LOCUS HUMA1AT 143 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human alpha-1-antitrypsin (alpha-1-AT) mRNA, 3' end. ACCESSION M26123 KEYWORDS alpha-1-antitrypsin. SOURCE Human adult liver, cDNA to mRNA, clone PJN1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 143) AUTHORS Coutelle,C., Speer,A., Rogers,J., Kalsheker,N., Humphries,S. and Williamson,R. TITLE Construction and partial characterization of a human liver cDNA library JOURNAL Biomed. Biochim. Acta 44, 421-431 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 101 alpha-1-antitrypsin (alpha-1-AT; AA at 3) /hgml_locus_uid="LX0081X" /nomgen="PI" /map="14q32.1" BASE COUNT 37 a 53 c 19 g 34 t ORIGIN 1 ccgaggtcaa gttcaacaaa ccctttgtct tcttaatgat tgaccaaaat accaagtctc 61 ccctcttcat gggaaaagtg gtgaatccca cccaaaaata actgcctctc gctcctcaac 121 ccctcccctc catccctggc ccc // LOCUS HUMGAPDHG 5378 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene, complete cds. ACCESSION J04038 KEYWORDS glyceraldehyde-3-phosphate dehydrogenase. SOURCE Human DNA. REFERENCE 1 (bases 1 to 5378) AUTHORS Ercolani,L., Florence,B., Denaro,M. and Alexander,M. TITLE Isolation and complete sequence of a functional human glyceraldehyde-3-phosphate dehydrogenase gene JOURNAL J. Biol. Chem. 263, 15335-15341 (1988) STANDARD simple staff_entry REFERENCE 2 (bases 650 to 1157) AUTHORS Nasrin,N., Ercolani,L., Denaro,M., Kong,X.F., Kang,I. and Alexander,M. TITLE An insulin response element in the glyceraldehyde-3-phosphate dehydrogenase gene binds a nuclear protein induced by insulin in cultured cells and by nutritional manipulations in vivo JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5273-5277 (1990) STANDARD simple staff_entry COMMENT FEATURES from to/span description pept 1453 1481 glyceraldehyde-3-phosphate dehydrogenase, exon 2 (first expressed exon) (EC 1.2.2.12) 3116 3215 glyceraldehyde-3-phosphate dehydrogenase, exon 3 3306 3412 glyceraldehyde-3-phosphate dehydrogenase, exon 4 3542 3632 glyceraldehyde-3-phosphate dehydrogenase, exon 5 3723 3838 glyceraldehyde-3-phosphate dehydrogenase, exon 6 3931 4012 glyceraldehyde-3-phosphate dehydrogenase, exon 7 4206 4618 glyceraldehyde-3-phosphate dehydrogenase, exon 8 4723 4792 glyceraldehyde-3-phosphate dehydrogenase, exon 9 pre-msg 1138 4993 GAPDH mRNA and introns IVS 1190 1429 GAPDH intron A IVS 1482 3115 GAPDH intron B IVS 3216 3305 GAPDH intron C IVS 3413 3541 GAPDH intron D IVS 3633 3722 GAPDH intron E IVS 3839 3930 GAPDH intron F IVS 4013 4205 GAPDH intron G IVS 4619 4722 GAPDH intron H site 650 729 insulin response element A [2] site 729 867 insulin response element B [2] site 1049 1071 insulin response element A [2] signal 884 892 CAAT box signal 1108 1113 TATA box site 1447 1456 Kozak consensus sequence revision 877 879 ccc in [2]; cc in [1] BASE COUNT 994 a 1642 c 1627 g 1115 t ORIGIN 1 ggatcccctg ctgggagggg gcaggggacc tgttcccacc gtgtgcccaa gacctctttt 61 cccacttttt ccctcttctt gactcaccct gccctcaata tcccccggcg cagcagtgaa 121 agggagtccc tggctcctgg ctcgcctgca cgtcccaggg cggggaggga cttccgccct 181 cacgtcccgc tcttcgcccc aggctggatg gaatgaaagg cacactgtct ctctccctag 241 gcagcacagc ccacaggttt caggagtgcc tttgtgggag gcctctgggc ccccaccagc 301 catcctgtcc tccgcctggg gccccagccc ggagagagcc gctggtgcac acagggccgg 361 gattgtctgc cctaattatc aggtccaggc tacagggctg caggacatcg tgaccttccg 421 tgcagaaacc tccccctccc cctcaagccg cctcccgagc ctccttcctc tccaggcccc 481 cagtgcccag tgcccagtgc ccagcccagg cctcggtccc agagatgcca ggagccagga 541 gatggggagg gggaagtggg ggctgggaag gaaccacggg cccccgcccg agcccatggg 601 cccctcctag gcctttgcct gagcagaccg gtgtcactac cgcagagcct cgaggagaag 661 ttccccaact ttcccgcctc tcagcctttg aaagaaagaa aggggagggg gcaggccgcg 721 tgcagccgcg agcggtgctg ggctccggct ccaattcccc atctcagtcg ttcccaaagt 781 cctcctgttt catccaagcg tgtaagggtc cccgtccttg actccctagt gtcctgctgc 841 ccacagtcca gtcctgggaa ccagcaccga tcacctccca tcgggccaat ctcagtccct 901 tccccctacg tcggggccca cacgctcggt gcgtgcccag ttgaaccagg cggctgcgga 961 aaaaaaaaag cggggagaaa gtagggcccg gctactagcg gttttacggg cgcacgtagc 1021 tcaggcctca agaccttggg ctgggactgg ctgagcctgg cgggaggcgg ggtccgagtc 1081 accgcctgcc gccgcgcccc cggtttctat aaattgagcc cgcagcctcc cgcttcgctc 1141 tctgctcctc ctgttcgaca gtcagccgca tcttcttttg cgtcgccagg tgaagacggg 1201 cggagagaaa cccgggaggc tagggacggc ctgaaggcgg caggggcggg cgcaggccgg 1261 atgtgttcgc gccgctgcgg ggtgggcccg ggcggcctcc gcattgcagg ggcgggcgga 1321 ggacgtgatg cggcgcgggc tgggcatgga ggcctggtgg gggaggggag gggaggcgtg 1381 tgtgtcggcc ggggccacta ggcgctcact gttctctccc tccgcgcagc cgagccacat 1441 cgctcagaca ccatggggaa ggtgaaggtc ggagtcaacg ggtgagttcg cgggtggctg 1501 gggggccctg ggctgcgacc gcccccgaac cgcgtctacg agccttgcgg gctccgggtc 1561 tttgcagtcg tatgggggca gggtagctgt tccccgcaag gagagctcaa ggtcagcgct 1621 cggacctggc ggagccccgc acccaggctg tggcgccctg tgcagctccg cccttgcggc 1681 gccatctgcc cggagcctcc ttcccctagt ccccagaaac aggaggtccc tactcccgcc 1741 cgagatcccg acccggaccc ctaggtgggg gacgctttct ttcctttcgc gctctgcggg 1801 gtcacgtgtc gcagaggagc ccctccccca cggcctccgg caccgcaggc cccgggatgc 1861 tagtgcgcag cgggtgcatc cctgtccgga tgctgcgcct gcggtagagc ggccgccatg 1921 ttgcaaccgg gaaggaaatg aatgggcagc cgttaggaaa gcctgccggt gactaaccct 1981 gcgctcctgc ctcgatgggt ggagtcgcgt gtggcgggga agtcaggtgg agcgaggcta 2041 gctggcccga tttctcctcc gggtgatgct tttcctagat tattctctgg taaatcaaag 2101 aagtgggttt atggaggtcc tcttgtgtcc cctccccgca gaggtgtggt ggctgtggca 2161 tggtgccaag ccgggagaag ctgagtcatg ggtagttgga aaaggacatt tccaccgcaa 2221 aatggcccct ctggtggtgg ccccttcctg cagcggctca cctcacggcc ccgcccttcc 2281 cctgccagcc tagcgttgac ccgaccccaa aggccaggct gtaaatgtca ccgggaggat 2341 tgggtgtctg ggcgcctcgg ggaacctgcc cttctcccca ttccgtcttc cggaaaccag 2401 atctccaccg caccctggtc tgaggtctga ggttaaatat agctgctgac ctttctgtag 2461 ctgggggcct gggctggggc tctctcccat cccttctccc cacacacatg cacttacctg 2521 tgctcccact cctgatttct ggaaaagagc taggaaggac aggcaacttg gcaaatcaaa 2581 gccctgggac tagggggtta aaatacagct tcccctcttc ccacccgccc cagtctctgt 2641 cccttttgta ggagggactt agagaagggg tgggcttgcc ctgtccagtt aatttctgac 2701 ctttactcct gccctttgag tttgatgatg ctgagtgtac aagcgttttc tccctaaagg 2761 gtgcagctga gctaggcagc agcaagcatt cctggggtgg catagtgggg tggtgaatac 2821 catgtacaaa gcttgtgccc agactgtggg tggcagtgcc cacatggccg cttctcctgg 2881 aagggcttcg tatgactggg ggtgttgggc agccctggag ccttcagttg cagccatgcc 2941 ttaagccagg ccagcctggc agggaagctc aagggagata aaattcaacc tcttgggccc 3001 tcctgggggt aaggagatgc tgcattcgcc ctcttaatgg ggaggtggcc tagggctgct 3061 cacatattct ggaggagcct cccctcctca tgccttcttg cctcttgtct cttagatttg 3121 gtcgtattgg gcgcctggtc accagggctg cttttaactc tggtaaagtg gatattgttg 3181 ccatcaatga ccccttcatt gacctcaact acatggtgag tgctacatgg tgagccccaa 3241 agctggtgtg ggaggagcca cctggctgat gggcagcccc ttcataccct cacgtattcc 3301 cccaggttta catgttccaa tatgattcca cccatggcaa attccatggc accgtcaagg 3361 ctgagaacgg gaagcttgtc atcaatggaa atcccatcac catcttccag gagtgagtgg 3421 aagacagaat ggaagaaatg tgctttgggg aggcaactag gatggtgtgg ctcccttggg 3481 tatatggtaa ccttgtgtcc ctcaatatgg tcctgtcccc atctcccccc caccccggta 3541 ggcgagatcc ctccaaaatc aagtggggcg atgctggcgc tgagtacgtc gtggagtcca 3601 ctggcgtctt caccaccatg gagaaggctg gggtgagtgc aggagggccc gcgggagggg 3661 aagctgactc agccctgcaa aggcaggacc cgggttcata actgtctgct tctctgctgt 3721 aggctcattt gcagggggga gccaaaaggg tcatcatctc tgccccctct gctgatgccc 3781 ccatgttcgt catgggtgtg aaccatgaga agtatgacaa cagcctcaag atcatcaggt 3841 gaggaaggca gggcccgtgg agaagcggcc agcctggcac cctatggaca cgctcccctg 3901 acttgcgccc cgctccctct ttctttgcag caatgcctcc tgcaccacca actgcttagc 3961 acccctggcc aaggtcatcc atgacaactt tggtatcgtg gaaggactca tggtatgaga 4021 gctggggaat gggactgagg ctcccacctt tctcatccaa gactggctcc tccctgctgg 4081 ggctgcgtgc aaccctgggg ttgggggttc tggggactgg ctttcccata atttcctttc 4141 aaggtgggga gggaggtaga ggggtgatgt ggggagtacg ctgcagggcc tcactccttt 4201 tgcagaccac agtccatgcc atcactgcca cccagaagac tgtggatggc ccctccggga 4261 aactgtggcg tgatggccgc ggggctctcc agaacatcat ccctgcctct actggcgctg 4321 ccaaggctgt gggcaaggtc atccctgagc tgaacgggaa gctcactggc atggccttcc 4381 gtgtccccac tgccaacgtg tcagtggtgg acctgacctg ccgtctagaa aaacctgcca 4441 aatatgatga catcaagaag gtggtgaagc aggcgtcgga gggccccctc aagggcatcc 4501 tgggctacac tgagcaccag gtggtctcct ctgacttcaa cagcgacacc cactcctcca 4561 cctttgacgc tggggctggc attgccctca acgaccactt tgtcaagctc atttcctggt 4621 atgtggctgg ggccagagac tggctcttaa aaagtgcagg gtctggcgcc ctctggtggc 4681 tggctcagaa aaagggccct gacaactctt ttcatcttct aggtatgaca acgaatttgg 4741 ctacagcaac agggtggtgg acctcatggc ccacatggcc tccaaggagt aagacccctg 4801 gaccaccagc cccagcaaga gcacaagagg aagagagaga ccctcactgc tggggagtcc 4861 ctgccacact cagtccccca ccacactgaa tctcccctcc tcacagttgc catgtagacc 4921 ccttgaagag gggaggggcc tagggagccg caccttgtca tgtaccatca ataaagtacc 4981 ctgtgctcaa ccagttactt gtcctgtctt attctagggt ctggggcaga ggggagggaa 5041 gctgggcttg tgtcaaggtg agacattctt gctggggagg gacctggtat gttctcctca 5101 gactgagggt agggcctcca aacagccttg cttgcttcga gaaccatttg cttcccgctc 5161 agacgtcttg agtgctacag gaagctggca ccactacttc agagaacaag gccttttcct 5221 ctcctcgctc cagtcctagg ctatctgctg ttggccaaac atggaagaag ctattctgtg 5281 ggcagcccca gggaggctga caggtggagg aagtcagggc tcgcactggg ctctgacgct 5341 gactggttag tggagctcag cctggagctg agctgcag // LOCUS HUMVARNA 156 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human integrated adenovirus 2 VA RNA. ACCESSION M35961 KEYWORDS . SOURCE Human adenovirus 2 infected KB cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 156) AUTHORS Ohe,K. and Weissman,S.M. TITLE Nucleotide sequence of an RNA from cells infected with adenovirus 2 JOURNAL Science 167, 879-881 (1969) STANDARD simple staff_review BASE COUNT 26 a 46 c 53 g 31 t ORIGIN 1 gggcactctt ccgtggtctg gtggataaat tcgcaagggt atcatggcgt ggacgaccgg 61 ggttcgaacc ccggatccgt gatccatgcg gttaccgtcc gccgcccgtg cgtcgaaccc 121 aggtgtgcga cgtcagacaa cgggggagcg ctcctt // LOCUS MUSGFAPA 2511 bp ss-mRNA ROD 04-AUG-1990 DEFINITION Mouse glial fibrillary acidic protein mRNA, 3' end. ACCESSION M25937 KEYWORDS glial fibrillary acidic protein. SOURCE Mouse brain, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2511) AUTHORS Cowan,N.J., Lewis,S.A., Balcarek,J.M., Krek,V. and Shelanski,M. TITLE Structural implications of a cDNA clone encoding mouse glial fibrillary acidic protein JOURNAL Ann. N.Y. Acad. Sci. 455, 575-582 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 1212 glial fibrillary acidic protein (AA at 1) BASE COUNT 625 a 690 c 697 g 499 t ORIGIN 1 ctgggtacca tgccacgttt ctccttgtct cgaatgactc ctccactccc tgccagggtg 61 gacttctccc tggccggggc gctcaatgct ggcttcaagg agacacgggc gagcgagcgt 121 gcagagatga tggagctcaa tgaccgcttt gctagctaca tcgagaaggt ccgcttcctg 181 gaacagcaaa acaaggcgct ggcagctgaa ctgaaccagc ttcgagccaa ggagcccacc 241 aaactggctg atgtctacca ggcggagctt cgggagctgc ggctgcggct ggaccagctt 301 acggccaaca gtgcccggct ggaggtggag agggacaact ttgcacagga cctcggcacc 361 ctgaggcaga agctccaaga tgaaaccaac ctgaggctgg aggcagagaa caacctggct 421 gcgtatagac aggaggcaca tgaagccacc ctggctcgtg tggatttgga gagaaaggtt 481 gaatcgctgg aggaggagat ccagttctta aggaagatct atgaggagga agttcgagat 541 ctccgggagc agctggccca acagcaggtc cacgtggaga tggatgtggc caagccagac 601 ctcacagcgg ccctgagaga gattcgcact caatacgagg cagtggccac cagtaacatg 661 caagagacag aggagtggta tcggtctaag tttgcagacc tcacagacgc tgcgtcccgc 721 aacgcagagc tcctccgcca agccaaacac gaagctaacg actatcgccg ccaactgcag 781 gccttgacct gcgatctgga gtccctccgc ggcacgaacg agtccctaga gcggcaaatg 841 cgcgaacagg aagagcgcca tgcgcgggag tcggccagtt accaggaggc acttgctcgg 901 ctggaggagg agggccaaag cctcaaggag gagatggccc gccacctgca ggagtaccag 961 gatctactca acgttaagct agccctggac atcgagatcg ccacctacag gaaattgctg 1021 gagggcgaag aaaaccgcat caccattcct gtacagactt tctccaacct ccagatccga 1081 gaaaccagcc tggacaccaa atccgtgtca gaaggccacc tcaagaggaa catcgtggta 1141 aagactgtgg agatgcggga tggtgaggtc attaaggact cgaagcagga gcacaaggac 1201 gtggtgatgt gaggtgtgcc acctggtggc ccttgccatg cagtgtgagg gcccaaagct 1261 tagcctcaaa taggcctgtt tgccaggctc agttcccacc cacaccagca cttcccttcc 1321 ttccctggtt ttctgcctgt gtgctgccca aggcgtcaat caggtcataa gcttcataga 1381 tggcatagat ggcatatacc cttcaccttc aactaacagg atactcaccc caaaggcgca 1441 gtcaggaggg gagggaaccc cagctggtta gaattggaag ggaagaggaa agatgagcag 1501 agtagagaga tttaacaaat cacttccttc atccttgttg ttatggaaac cgttgccaga 1561 gctggaagtt tccacaggct gctggagcta gacaacaatt cagacagaaa gggaaagtcc 1621 ctgaggcaaa gtctctctag ccagagacct atgcatcccg aatgccacta aggcagtcct 1681 gaagggccct ccaggtgatg actccagtgt gtcagcccca ctgagcagct atgcaggttg 1741 actgcccaca ggcatgtgaa acttggttct cagcacttgg cagggatcta tggcataagt 1801 ggagagggaa ggtgtactgg acggcggaga ggagggctcc ctggccccta agtgtggatg 1861 cagagaggtg gagcccagga agggtctctg cttaggctgc agggtaccga atggcagagg 1921 cactggtaga gatcatttgg acactcggag ttgaaagtta caggcaatct gttacacttg 1981 gctctgaatc ctatcaatca aggaaataac ccgttctctg gaagacactg aaacaggaga 2041 gcaggacttc cgtccactgg gcagggtaca gatgtgtctc agttgtgaag gtctattcct 2101 ggctgcacag tccccatccg ctcaggtcat cttacccctg tgactgctct cagccctgga 2161 agaatccaca accatccttc caaggttgtc catccccaca atgactagct gttgctctcc 2221 aagctaaggg accattccct gctctatgca tatacgtaat gtcacctatt tagtatcatc 2281 ctatttgaga gtttgaggaa ctgaaacact gttctgttca agcacctggt gctatgcctt 2341 catattagag caccttctct gaggctgatt ggtgggcagg tagggaagac attgagcaga 2401 cagtgtccgc tcagttgtcc ttccctccct tccaaggtcc ctccctcttt ccaggacatc 2461 gcccccccac cccacccctc ctttccacct ccgctaacct ccagacgcac t // LOCUS MUSHOX61A 318 bp ss-mRNA ROD 04-AUG-1990 DEFINITION Mouse homeobox-6.1 protein (Hox-6.1) mRNA, 3' end. ACCESSION M35986 KEYWORDS Hox-6.1 gene; homeobox-6.1 protein. SOURCE Mouse male kidney, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Sharpe,P.T., Miller,J.R., Evans,E.P., Burtenshaw,M.D. and Gaunt,S.J. TITLE Isolation and expression of a new mouse homeobox gene JOURNAL Development 102, 397-407 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 318 homeobox-6.1 protein (AA at 1) site 30 210 homeobox BASE COUNT 94 a 80 c 100 g 44 t ORIGIN Chromosome 14E2. 1 aatcacagtg gggtcggtta cggagcggac cggaggcgcg gccgccagat ctactctcgg 61 taccagaccc tggaactgga gaaggaattt cacttcaacc gctacctaac tcggcgccgg 121 cgcatcgaga tcgccaatgc tctgtgcctg accgagcgac agatcaaaat ctggttccag 181 aacgcgcgca tgaagtggaa aaaagaatct aatctcacgt ccacactctc agggggtggc 241 ggaggggcaa ccgccgacag cctgggagga aaggaggaaa agcgagaaga gacagaagaa 301 gagaagcaga aagagtga // LOCUS PEAVICIL 190 bp ds-DNA PLN 04-AUG-1990 DEFINITION P.sativum vicilin mRNA, partial cds. ACCESSION M35978 KEYWORDS vicilin. SOURCE P.sativum, cDNA to mRNA, clone pRC 2.2.1. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 190) AUTHORS Gatehouse,J.A., Lycett,G.W., Croy,R.R.D. and Boulter,D. TITLE The post-translational proteolysis of the subunits of vicilin from pea (Pisum sativum L.) JOURNAL Biochem. J. 207, 629-632 (1982) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 190 vicilin (AA at 2) BASE COUNT 74 a 30 c 46 g 40 t ORIGIN 1 agagatagag aagattctct tagaagagca tgagaaagag acacatcaca gaagaagcct 61 tagggataag agacaacaga gccaagaaaa gaatgtaata gtcaaagtat caaagcaagt 121 gcaaagttac aaagctaagt tgactccagg tgatgttttt gtaattccgg caggtcatcc 181 tgttgccgta // LOCUS PIGINVOLA 1204 bp ds-DNA MAM 04-AUG-1990 DEFINITION Pig involucrin gene, complete cds. ACCESSION M34441 KEYWORDS involucrin. SOURCE Pig (Yorkshire) adult skin keratinocyte DNA. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1204) AUTHORS Tseng,H. and Green,H. TITLE The involucrin genes of pig and dog: Comparison of their segments of repeats with those of prosimians and higher primates JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Tseng and H. Green, 23-MAY-1990. FEATURES from to/span description pept 1 1044 involucrin rpt 247 831 repeat segment BASE COUNT 377 a 311 c 363 g 153 t ORIGIN 1 atgtcccagc aatacactct tccagtgacc ctgccccctg ccctcagtca ggagcccctc 61 aagcctgttt ctcctcccgc cgatacccag caggagcaag tgaaacaggc aactccgctg 121 cctgctccat gccagaagat gctctcggag ctcccagtgg aggtcccctt ggaggatgcg 181 gagaaacaca caactcttgt gaagggagtg cctgagcaag agtgtgagcc gcagccacag 241 gagccacagc agcaggaact gcatgtggaa cagcagcagc agcagcaaga atcacaggtg 301 caggaactgc atgtggatca gcagcagcag caacaagagt cacaggagca ggaactgcat 361 gtggatcagc agcagcagca gcaagagtca caggagcagg aactgcatgt ggatcagcag 421 cagcaacaag agtcacaggt gcaggaactg catgtgggtc atcatcagca gcagcaagag 481 tcacaggagc aggaactgca tgtggatcat catcagcagc agcaagagtc acaggagcag 541 gaactgcatg tggatcagca gcagcagcaa caagagtcac aggagcagga actgcatgtg 601 gatcagcagc agcagcagca agagtcacag gagcaggaac tgcatgtgga tcatcatcag 661 cagcagcaag agtcacaggt gcaggaactg catgtggatc atcagcagca gcagcaagag 721 tcacaggagc aggaactgca tgtggatcag caccagcagc agcaagagtc acaggagcag 781 gaactgcatg tggatcagca gcagcaggag ctccaggtgc aggaagtgca gcagcagcag 841 cagcagcagc aagaacagca ggaggaccat cagaaagcag aacacctgga gcaggaggaa 901 gcacaaaggg agcagcagct aaagggacag ctggaacagg agaagaaggg cgtgtaccag 961 cacctggatc aagagctaac aaagagagat gaacacctgg aaaagaaagg ggagcactgc 1021 tggagcagca ggagaagcct ctagagccag cagagcagca ggaggggcag ttgaaacagc 1081 ctgtgcttat tccagctcct ggccaggtcc aggagaccca ccccagtcca actgctgaag 1141 ggagaagttt tgccccctga gaagcaggac ataagtaacc acctctggac ataagtaacc 1201 accc // LOCUS FLANPB 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/Wilson-Smith/33 (H1N1) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30746 KEYWORDS nucleoprotein. SOURCE Influenza A /Wilson-Smith/33 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 507 a 318 c 409 g 331 t ORIGIN 1 agcaaaagca gggtagataa tcactcacag agtgacatcg aaatcatggc gaccaaaggc 61 accaaacgat cttacgaaca gatggagact gatggagaac gccagaatgc cactgaaatc 121 agagcatctg tcggaaaaat gattggtgga attggacgat tctacatcca aatgtgcacc 181 gaacttaaac tcagtgatta tgagggacgg ctgattcaga acagcttaac aatagagaga 241 atggtgctct ctgcttttga cgagaggagg aataaatatc tagaagaaca tcccagtgcg 301 gggaaagatc ctaagaaaac tggaggacct atatacagga gagtagatgg aaagtggatg 361 agagaactca tcctttatga caaagaagaa ataagacgaa tctggcgcca agctaataat 421 ggtgacgatg caacggctgg tctgactcac atgatgatct ggcactccaa tttgaatgat 481 gcaacttacc agaggacaag agctcttgtt cgcacaggaa tggatcccag gatgtgctca 541 ctgatgcagg gttcaaccct ccctaggagg tctggggccg caggtgctgc agtcaaagga 601 gttggaacaa tggtgatgga attgatcaga atgatcaaac gtgggatcaa tgatcggaac 661 ttctggaggg gtgagaatgg acggagaaca aggattgctt atgaaagaat gtgcaacatt 721 ctcaaaggga aatttcaaac agctgcacaa agagcaatgg tggatcaagt gagagagagc 781 cggaatccag gaaatgctga gttcgaagat ctcatctttc tagcacggtc tgcactcata 841 ttgagagggt cagttgctca caagtcctgc ctgcctgcct gtgtgtatgg acctgccgta 901 gccagtggat acgactttga aagagaggga tactctctag tcggaataga ccctttcaga 961 ctgcttcaaa acagccaagt atacagccta atcagaccaa atgagaatcc agcacacaag 1021 agtcaactgg tgtggatggc atgccattct gctgcatttg aagatctaag agtatcaagc 1081 ttcatcagag ggacgaaagt ggtcccaaga gggaagcttt ccactagagg agttcaaatt 1141 gcttccaatg aaaacatgga gactatggaa tcaagtaccc ttgaactgag aagcagatac 1201 tgggccataa ggaccagaag tggagggaac accaatcaac agagggcttc ctcgggccaa 1261 atcagcatac aacctacgtt ctcagtacag agaaatctcc cttttgacag accaaccatt 1321 atggcagcat tcactgggaa tacagagggg agaacatctg acatgagaac cgaaatcata 1381 aggctgatgg aaagtgcaag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcagc gagcccgatc gtgccctcct ttgacatgag taatgaagga 1501 tcttatttct tcggagacaa tgcagaggag tacgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPC 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/swine/Iowa/15/30 (H1N1) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30747 KEYWORDS nucleoprotein. SOURCE Influenza A/swine/Iowa/15/30 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1565; revises [1]) AUTHORS Gorman,O.T. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 509 a 318 c 411 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcg aaatcatggc gtctcaaggc 61 accaaacgat catacgaaca aatggaaact ggtggagaac gccagaatgc cacggaaatc 121 agagcatctg tcggaagaat gattggtgga attggaagat tctacatcca aatgtgcacc 181 gaactcaaac tcagtgatta tgagggacgg ctgattcaga acagcataac aatagagaga 241 atggtgctct ctgcttttga cgagaggagg aacaaatatc tggaagaaca tccaagcgct 301 gggaaagacc ctaagaaaac tggaggaccc atatacagga gaatagacgg aaagtggata 361 agagaactca tcctttatga caaagaagaa ataaggagaa tctggcgcca agccaacaat 421 ggtgaggata caacagccgg tctgactcac atgatgatct ggcattccaa tttgaatgat 481 gccacttatc agaggacaag agctcttgtt cgcaccggaa tggatcccag gatgtgctct 541 ctgatgcagg gttcaactct ccccaggagg tctggagccg caggtgctgc agtcaaagga 601 gttgggacag tggtgatgga gttgatcaga atgatcaaac gtggaatcaa tgatcggaac 661 ttctggaggg gtgaaaatgg acgaaggaca agaattgcct atgaaagaat gtgcaacatt 721 ctcaaaggga aatttcaaac agctgcacag agagcaatga tggatcaagt gagagagagc 781 cgaaacccag gaaatgctga aatcgaagat ctcatctttc tggcacgatc cgcactcata 841 ttacgaggat cagttgcaca caagtcctgt ctgcctgcct gtgtgtatgg acttgctgta 901 gccagtggac atgactttga aagagagggg tactctctgg tcggaataga ccctttcaga 961 ctgcttcaaa acagccaagt attcagccta atcagaccaa atgaaaaccc agcgcacaag 1021 agtcaattag tgtggatggc atgccattct gctgcatttg aagatttaag ggtatcaagc 1081 ttcataagag ggaaaagagt ggttccacga gggcagcttt ccaccagagg ggttcagatt 1141 gcttccaatg agaacatgga gactatggac tctagtactc ttgaactgag aagcagatac 1201 tgggccataa ggaccagaag tggaggaaac actaatcaac agagggcatc tgcgggccaa 1261 atcagcgtgc aacctacatt ctcggtgcag agaaatctcc cttttgagag agcaaccgtt 1321 atggcagcat tcactgggaa cacagaggga agaacatcag acatgagaac cgaaatcata 1381 aggataatgg aaagtgcaag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gagcccgatc gtgccttcct ttgacatgag taacgaagga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPD 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/swine/Tennessee/24/77 (H1N1) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30748 KEYWORDS nucleoprotein. SOURCE Influenza A/swine/Tennessee/24/77 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1565; revises [1]) AUTHORS Gorman,O.T. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 514 a 307 c 409 g 335 t ORIGIN 1 agcaaaagca gggtagataa tcactcaatg agtgacatcg aaatcatggc gtctcaaggc 61 accaaacgat catatgaaca aatggagact ggtggggaac gccaggatgc cacagaaatc 121 agagcatctg tcggaagaat gattggtgga atcggaagat tctacatcca aatgtgcact 181 gaactcaaac tcagtgacta tgagggacga ctgattcaaa atagcataac aatagagaga 241 atggtgctct ctgcttttga cgagagaagg aataaatacc tagaagagca tcccagtgct 301 gggaaagatc ctaagaaaac tggaggaccc atatatagaa gagtagacgg aaaatggatg 361 agggaactca tcctttatga caaagaagaa ataaggagag tttggcgcca agccaacaat 421 ggtgaagatg caacagccgg ccttacccat attatgattt ggcactccaa tctgaatgat 481 gccacctatc agagaacaag agctcttgtt cgcactggga tggatcccag aatgtgctcc 541 ctaatgcaag gttcaacact tcccagaagg tctggagccg caggtgctgc agtaaaagga 601 gttggaacaa tagcgatgga gttaatcaga atgatcaaac gtgggatcaa tgaccgaaac 661 ttctggaggg gtgaaaatgg acgaaggaca aggattgcat atgaaagaat gtgcaacatt 721 ctcaaaggga aatttcagac agctgcccag agggcaatga tggatcaagt gagagaaagt 781 cggaacccag gaaacgctga aattgaagat ctcattttcc tggcacggtc agcactcatt 841 ttaagagggt cagttgcaca taagtcttgc ctgcctgctt gtgtgtatgg gcttgcagta 901 gcgagtgggc atgactttga aagagaagga tattctctgg tcggaataga ccccttcaaa 961 ctacttcaaa acagtcaagt attcagcctg atcagaccaa atgaaaaccc agctcacaag 1021 agtcaactgg tgtggatggc atgccactct gccgcatttg aggatttaag agtatcaggc 1081 ttcataagag ggaagaaagt ggttccaaga ggaaagcttt ccacaagagg ggttcagatt 1141 gcttcaaatg agaatgtgga agctatggac tctagtaccc tggaactaag aagcaggtac 1201 tgggccataa ggaccagaag cgggggaaat accaatcaac agaaggcatc cgcaggccag 1261 atcagtgtgc aacctacatt ctcagtacaa aggaatctcc cttttgagag agcgaccgtt 1321 atggcagctt tcattgggaa caatgaggga cgaacatcag atatgcgaac tgaaatcata 1381 aggatgatgg aaagtgcaaa gccagaagat ttgtccttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgaa taatgagggg 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPE 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/swine/Netherlands/12/85 (H1N1) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30749 KEYWORDS nucleoprotein. SOURCE Influenza A/swine/Netherlands/12/85 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 503 a 307 c 419 g 336 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacattc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaacc ggaggagaac gccagaatgc tactgaaatc 121 agagcatctg ttgggggaat ggttggtgga attggaagat tctacataca gatgtgcact 181 gaactcaaac tcagtgacta tgaagggagg ctgatccaga atagcataac aatagagaga 241 atggttctct ctgcatttga tgagaggagg aacaaatacc tggaagaaca tcccagtgcg 301 gggaaggacc caaagaaaac tggaggtcca atctacaaaa agagagacgg aaaatggatg 361 agagagctga ttctgtatga caaagaggag atcaggagga tttggcgtca agcaaacaat 421 ggtgaagatg ctactgctgg tctcactcat ctgatgattt ggcattccaa cctgaatgat 481 gccacatatc agagaacaag agctctcgtg cgtactggga tggaccccag aatgtgctct 541 ctgatgcaag gatcaactct cccgaggaga tctggagctg caggtgcagc agtaaaggga 601 gttgggacga tggtaatgga actgattcgg atgataaagc gggggatcaa tgatcggaac 661 ttctggagag gcgaaaatgg gcgaagaaca agaattgcat atgagagaat gtgcaacatc 721 ctcaaaggga aatttcagac agcagcgcaa cgagcaatga tggaccaggt gcgagaaagc 781 agaaatcctg ggaatgctga gattgaagac cttatctttc tggcacgatc tgcactcatt 841 ctgagaggat cagtggctca taaatcctgc ctgcctgctt gtgtatatgg acttgttgtg 901 gcaagtggat atgactttga aagagaaggg tactctctag tcggaataga tcctttccgt 961 ctgctccaga acagccaggt gttcagcctc attagaccaa atgagaatcc agcacataag 1021 agtcagctgg tatggatggc atgccattct gcagcgtttg aagatctgag agtgtcaagt 1081 ttcatcagag ggacaaaagt ggtcccaaga ggacaactgt ccaccagagg agttcaaatt 1141 gcttcaaatg aaaacatgga aacaatggac tccagtactc ttgaactgag aagcaaatac 1201 tgggctataa gaaccaggag cggaggaaac accaaccaac agagagcatc tgcagggcaa 1261 atcagtgtac aacctacttt ctcggtacag agaaatcttc ctttcgagaa agcgaccatc 1321 atggcagctt ttacagggaa cactgaaggc agaacatctg acatgaggac tgaaattata 1381 agaatgatgg aaagtgccag accagaagat gtgtccttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPF 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/equine/London/1416/73 (H7N7) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30750 KEYWORDS nucleoprotein. SOURCE Influenza A/equine/London/1416/73 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 497 a 322 c 418 g 328 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatca aactcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc 121 agagcatctg ttggaaggat ggtgggagga atcggccggt tttatgttca aatgtgtact 181 gagctcaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagagagg 241 atggtgcttt cggcattcga cgaaagaaga aacaagtacc tggaggagca tcccagtgct 301 ggaaaagacc ccaagaaaac gggaggcccg atatacagaa ggagagatgg gaaatggatg 361 agagaactca tcctccatga taaagaagaa atcaggagga tctggcgtca ggccaacaat 421 ggtgaagatg ctactgctgg tctgactcac atgatgatct ggcactccaa tctcaatgac 481 accacctacc aaagaacaag ggctcttgtt cggactggga tggatcccag aatgtgctct 541 ctgatgcaag gatcaactct cccacggaga tctggagctg ccggtgctgc agtgaagggt 601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaac 661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaacatc 721 ctcaagggga aattccaaac agcagcacaa cgagcaatga tggaccaagt gagagagggc 781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcgcgatc agcactcatt 841 ctgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta 901 gccagtgggt atgacttcga gaaagaggga tactctctgg ttggaattga tcctttcaaa 961 ctactccaga acagccaaat tttcagtctg atcagaccga aagaaaatcc agcacacaag 1021 agccagctgg tgtggatggc atgccattct gcagcatttg aggacctgag agtttcgaat 1081 ttcattagag gaaccaaagt aatcccaaga ggacagttgg caaccagagg agtgcaaatt 1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat 1201 tgggcaataa ggaccaggag tggggggaac accagtcaac agagagcatc tgcaggacag 1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt 1321 atggctgcat tcactggaaa cactgagggg aggacttccg acatgagaac ggaaatcata 1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg 1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPG 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/equine/Kentucky/2/86 (H3N8) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30751 KEYWORDS nucleoprotein. SOURCE Influenza A/equine/Kentucky/2/86 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 502 a 318 c 411 g 334 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatca aagtcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc 121 agagcatctg tcggaaggat ggtgggagga atcggccggt tctatgttca gatgtgtact 181 gagcttaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagaaagg 241 atggtacttt cggcattcga cgaaagaaga aacaagtacc tcgaggagca tcccagtgct 301 gggaaagacc ccaagaaaac gggaggcccg atatacagaa ggaaagatgg gaaatggatg 361 agagaactca tcctccatga taaagaagaa atcatgagga tctggcgtca ggccaacaat 421 ggtgaagacg ctactgctgg tcttactcat atgatgatct ggcactccaa tctcaatgac 481 accacatacc aaagaacaag ggctcttgtt cgggctggga tggatcccag aatgtgctct 541 ctgatgcaag gatcaaccct cccacggaga tctggagctg ccggtgctgc agtaaaaggt 601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaat 661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaatatc 721 ctcaaaggga aattccaaac agcagcacaa cgggcaatga tggaccaagt gagggagggc 781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcacgatc agcactcatt 841 ttgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta 901 gccagtgggt atgactttga gaaggaagga tactctctgg ttggaattga tcctttcaaa 961 ctactccaga acagccaaat tttcagtcta atcagaccga aagaaaatcc agcacacaag 1021 agccagttgg tgtggatggc atgccattct gcagcatttg aggacctgag agttttgaat 1081 ttcattagag gaaccaaagt aatcccaaga ggacagttag caaccagagg agtgcaaatt 1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat 1201 tgggcaataa ggaccaggag tggagggaac accagtcaac agagagcatc tgcaggacag 1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt 1321 atggctgcat tcactgggaa cactgagcgg aggacttccg acatgagaac ggaaatcata 1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg 1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPH 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Massachussetts/26/80 (H13N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30752 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Massachussetts/26/80 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 492 a 317 c 429 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc 61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt 121 agggcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact 181 gaactcaagc tcagtgacaa tgagggacga ttgatccaaa acagcatcac catagagaga 241 atggtcctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc 301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg 361 agagaattag ttctgtatga caaagaagaa ctaagaagaa tctggcggca ggcaaacaat 421 ggggaagatg cgactgctgg tctcacccac ttgatgatct ggcattccaa tttgaatgac 481 gccacgtatc agagaactag agcacatgtg cgcacaggaa tggatcccag gatgtgctcc 541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtgaaggga 601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat 661 ttctggagag gtgaaaacgg acgaagaaca agaattgcct atgaaagaat gtgcaacatc 721 ctcaaaggga aatttcaaac agcagcacag cgggctatga tggatcaggt gcgggaaagc 781 cggaatcctg gaaatgctga aatagaggac cttatatttc tggctcgatc tgcacttatc 841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggcggtg 901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt 961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag 1021 agtcaattgg tgtggatggc atgccattct gctgcatttg aggatttgag agtgtcaagt 1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt 1141 gcatccaatg agaacatgga aacaatgaat tccagcactc ttgaattgag aagcaaatac 1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa 1261 gtcagtgttc aacccacttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc 1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata 1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcagacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPI 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Astrakhan/227/84 (H13N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30753 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Astrakhan/227/84 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 489 a 323 c 426 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggtgaac gccagaatgc cactgagatt 121 cgggcatctg tcggaaggat ggttggagga atcggaagat tctacataca gatgtgcact 181 gaactcaaac taagtgacaa cgaggggagg ctgatccaga acagtatcac aatagaaaga 241 atggtccttt ctgcgtttga tgagagaagg aacaggtact tggaggagca ccccagcgct 301 gggagggacc ccaagaagac tggaggacca atttacagaa ggagagaggg aaaatgggtg 361 agagaattgg tcctgtatga caaggaagaa ataagaagaa tctggcgaca agcaaacaat 421 ggagaagatg caactgccgg ccttacccac ttgatgatct ggcattctaa cttgaatgac 481 gccacatatc agaggactag agcccttgtg cgcactggga tggaccccag gatgtgctcc 541 cttatgcaag gatcgacact tccaagaagg tctggagcgg ctggtgcagc tgtaaaggga 601 gttgggacaa tggtgatgga actcatcaga atgataaaaa gaggggtcaa tgaccgtaac 661 ttctggagag gtgaaaatgg acgaagaaca agaattgcct atgaaagaat gtgcaacatt 721 ctcaaaggaa aattccaaac agctgcacaa cgagctatga tggatcaggt cagagaaagc 781 cgtagtcctg gaaatgctga aatcgaggac ctcatctttc tggccaggtc tgcacttatt 841 ctgaggggag cagtggctca caaatcatgc ctgcctgcct gtgtgcatgg acttgccgta 901 gcaagtggat acgactttga gagggaggga tactcccttg ttgggataga ccctttccgt 961 ctgctccaga acagccaggt attcagtcta atccgaccca atgaaaatcc agcacataag 1021 agtcagttgg tgtggatggc atgccattcc gctgcatttg aggatttgag agtgtcaagc 1081 ttcatcaggg gagcaagagt gttaccaaga gggcaactat ccacaagagg tgttcaaatt 1141 gcatccaatg agaacatgga gactatgagt tccagcactc ttgaattgag aagcaaatac 1201 tgggcaataa ggactagaag cggaggaaac accaaccaac aaagagcatc tgcaggacaa 1261 atcagtgtgc aacccacttt ctctgtgcaa agaaacctcc cctttgagag agcgaccatc 1321 atggctgcat tcacaggaaa tgcagaaggc agaacatctg atatgagaac tgagatcata 1381 cggatgatgg aaaatgcaag accagaggat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taaagaggga 1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPK 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Maryland/1824/78 (H13N9) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30755 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Maryland/1824/78 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 492 a 319 c 428 g 326 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc 61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt 121 agagcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact 181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga 241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc 301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg 361 agagaattag ttctgtatga caaagaagaa ctaaggagaa tctggcggca ggcaaacaat 421 ggggaagatg cgaccgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac 481 gccacgtatc agagaactag agcactagtg cgcacaggga tggatcccag gatgtgctcc 541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga 601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat 661 ttctggagag gtgaaaacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc 721 ctcaaaggga aatttcaaac agcagcacaa cgggctatga tggatcaagt gcgggaaagc 781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc 841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtgaatgg actggctgtg 901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt 961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag 1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt 1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt 1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattgag aagcaaatac 1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa 1261 gtcagtgttc aacccagttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc 1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata 1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc agggacgggg agtcttcgag 1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPL 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Maryland/1815/79 (H13N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30756 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Maryland/1815/79 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 490 a 319 c 429 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc 61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt 121 agagcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact 181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga 241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc 301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg 361 agagaattag ttctgtatga caaagaagaa ctaaggagaa tctggcggca ggcaaacaat 421 ggggaagatg cgaccgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac 481 gccacgtatc agagaactag agcactagtg cgcacaggga tggatcccag gatgtgctcc 541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga 601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat 661 ttctggagag gtgaaaacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc 721 ctcaaaggga aatttcaaac agcagcacaa cgggctatga tggatcaggt gcgggaaagc 781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc 841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggctgtg 901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt 961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag 1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt 1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt 1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattgag aagcaaatac 1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa 1261 gtcagtgttc aacccagttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc 1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata 1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPN 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/turkey/Minnesota/833/80 (H4N2) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30769 KEYWORDS nucleoprotein. SOURCE Influenza A/turkey/Minnesota/833/80 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 494 a 311 c 429 g 331 t ORIGIN 1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgaaatc 121 agggcatctg ttgggagaat ggttggtgga atcgggagat tctacataca gatgtgcaca 181 gaactcaaac tcagcgacta tgaaggaagg ctgatccaaa acagcataac aatagagaga 241 atggttctct cggcatttga tgagaggaga aacaaatatc tggaagagca tcccagtgct 301 ggaaaagacc ctaagaagac tggaggtcca atatacagga ggagagatgg gaaatgggtg 361 agagaactga tcctgtatga caaagaggag atcagaagga tttggcgtca agcgaataat 421 ggagaagacg caaccgctgg tctcacccat ttgatgatct ggcattccaa tctgaatgat 481 gccacatatc agagaacaag ggcactggtg cgtactggga tggatcccag gatgtgctct 541 ctgatgcaag gatcaactct tccgaggagg tctggggctg ctggagcagc agtgaaagga 601 gttggaacaa tggtgatgga attgattcgg atgatcaagc gagggatcaa tgatcggaat 661 ttctggagag gtgaaaatgg acggagaaca agaattgcat acgaaagaat gtgcaacatt 721 ctcaagggga aattccaaac agcagcacaa cgagcaatga tggaccaggt gagggaaagc 781 cggaatccag ggaatgctga aattgaagat ctaatctttc ttgcacggtc tgctctcatt 841 ctgaggggat cagtggctca taagtcctgc ctgcctgctt gtgtgtatgg acttgctgtg 901 gccagtggat acgactttga gagggaggga tactccctag tcggaattga tcctttccgt 961 ctgctccaaa acagtcaagt cttcagtctt atcagaccaa acgaaaatcc agcacataaa 1021 agtcagctgg tatggatggc atgtcattct gcagcatttg aagatctgag agtgtcaagc 1081 ttcatcagag gaacaagagt ggtcccaaga ggacaactgt ccaccagagg agttcagatt 1141 gcttcaaatg agaacatgga gacaatggac tccagtactc ttgaattgag gagcagatac 1201 tgggctataa ggaccagaag cggagggaac actaaccagc agagagcatc tgcaggacaa 1261 atcagcgtac agcccacatt ctctgtgcag agaaacctcc cattcgagag agcaaccatt 1321 atggcggcat ttacgggaaa cactgaaggc agaacttcag acatgagaac tgagatcata 1381 aggatgatgg aaaatgccag acctgaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaagga 1501 tcttatttct tcggagacaa tgcagaggaa tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPO 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Minnesota/945/80 (H13N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30757 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Minnesota/945/80 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Gorman,O.T. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990. The isolate mentioned in the published paper [1] is incorrect and should be "A/Gull/Minnesota/945/80". FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 490 a 321 c 426 g 328 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc 61 accaaacgat cttatgagca aatggaaact ggtggcgagc gccagaatgc caatgacatc 121 agagcatctg tcgggaggat ggttggggga attggaagat tctacataca gatgtgcact 181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga 241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc 301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg 361 agagaattag ttctgtatga caaagaagag ctaaggagaa tctggcggca ggccaacaat 421 ggggaagatg cgactgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac 481 gccacgtatc aaagaactag agcacatgtg cgcacaggga tggatcccag gatgtgctcc 541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga 601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat 661 ttctggagag gtgaagacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc 721 ctcaaaggga aatttcagac agcagcacaa cgggctatga tggatcaagt gcgggaaagc 781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc 841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggctgtg 901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt 961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag 1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt 1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt 1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattaag aagcaaatac 1201 tgggcaataa ggactagaag tggaggaaac accaaccagc aaagagcatc tgcgggacaa 1261 gtcagtgttc aacccacttt ctctgtgcag agaaacctcc cctttgaaag ggcgaccatc 1321 atggctgcat tcacaggaaa tccggagggc agaacatctg atatgagaac ggaaatcata 1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc agggacgggg agtcttcgag 1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPP 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/equine/Tennessee/5/86 (H3N8) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30758 KEYWORDS nucleoprotein. SOURCE Influenza A/equine/Tennessee/5/86 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 502 a 318 c 411 g 334 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatca aagtcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc 121 agagcatctg tcggaaggat ggtgggagga atcggccggt tctatgttca gatgtgtact 181 gagcttaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagaaagg 241 atggtacttt cggcattcga cgaaagaaga aacaagtacc tcgaggagca tcccagtgct 301 gggaaagacc ccaagaaaac gggaggcccg atatacagaa ggaaagatgg gaaatggatg 361 agagaactca tcctccatga taaagaagaa atcatgagga tctggcgtca ggccaacaat 421 ggtgaagacg ctactgctgg tcttactcat atgatgatct ggcactccaa tctcaatgac 481 accacatacc aaagaacaag ggctcttgtt cgggctggga tggatcccag aatgtgctct 541 ctgatgcaag gatcaaccct cccacggaga tctggagctg ccggtgctgc agtaaaaggt 601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaat 661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaatatc 721 ctcaaaggga aattccaaac agcagcacaa cgggcaatga tggaccaagt gagggagggc 781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcacgatc agcactcatt 841 ttgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta 901 gccagtgggt atgactttga gaaggaagga tactctctgg ttggaattga tcctttcaaa 961 ctactccaga acagccaaat tttcagtcta atcagaccga aagaaaatcc agcacacaag 1021 agccagttgg tgtggatggc atgccattct gcagcatttg aggacctgag agttttgaat 1081 ttcattagag gaaccaaagt aatcccaaga ggacagttag caaccagagg agtgcaaatt 1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat 1201 tgggcaataa ggaccaggag tggagggaac accagtcaac agagagcatc tgcaggacag 1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt 1321 atggctgcat tcactgggaa cactgagcgg aggacttccg acatgagaac ggaaatcata 1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg 1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPR 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/duck/New Zealand/31/76 (H4N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30760 KEYWORDS nucleoprotein. SOURCE Influenza A/duck/New Zealand/31/76 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 502 a 325 c 412 g 326 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgaaca gatggagact ggtggagaac gccagaatgc tactgagatc 121 agagcatctg ttgggagaat ggttggtgga atcggaaggt tctacataca gatgtgcact 181 gaactcaaac tcagtgacta tgaaggaagg ctaatccaaa acagcataac aatagagaga 241 atggttcttt ctgcatttga tgaaaggaga aacaagtact tggaggagca ccccagtgct 301 ggaaaggatc ctaagaaaac tggaggtcca atctatagaa gaagagatgg aaaatgggtg 361 agagaactga tcctgtacga caaagaggag atcaggagga tttggcgcca agcgaataat 421 ggagaggacg caactgctgg tcttacccat ttgatgatct ggcattccaa tcttaatgat 481 gcaacatacc agagaacaag agcccttgtg cgtactggga tggaccccag gatgtgctct 541 ctgatgcaag gctcaactct cccgaggagg tctggagctg caggtgctgc agtaaaagga 601 gttggaacaa tggtgatgga gctgattcga atgataaaac gaggaatcaa tgaccgtaac 661 ttctggagag gcgagaatgg acgaaggaca agaattgctt acgagaggat gtgcaacatt 721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggatcaggt gcgggagagc 781 cgaaatcctg gaaatgctga aattgaagac ctcatcttct tggcacgatc tgcactcatt 841 ctgaggggat cagtggctca caaatcctgc ctacctgctt gtgtatatgg attagctgta 901 gccagtggat atgactttga aagagaagga tactccctgg tcggaataga ccccttccgt 961 ctgctccaga acagccaggt attcagcctc atcagaccca atgaaaatcc agcacacaag 1021 agtcaactgg tgtggatggc atgccactct gcagcatttg aagacctgag agtttcaagc 1081 ttcatcaggg gaacaagagt gatcccgaga gggcaactgt ccactagagg ggtccaaatt 1141 gcttcaaatg aaaacatgga aacaatggac tctagcactc tcgaattgag aagcagatac 1201 tgggccataa ggaccagaag tggaggaaac accaaccaac agcgggcatc tgcgggacaa 1261 atcagcgtgc aacccacttt ctctgtgcaa agaaatcttc ctttcgaaag agcaaccatc 1321 atggcagcat ttacagggaa cactgaaggc agaacatctg acatgagaac tgagatcata 1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcagacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPS 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/grey teal/Australia/2/79 (H4N4) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30761 KEYWORDS nucleoprotein. SOURCE Influenza A/grey teal/Australia/2/79 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 513 a 327 c 401 g 324 t ORIGIN 1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgaaca aatggaaact ggtggagaac gccagaatgc tactgaaatc 121 agatcttctg ttggaagaat ggttggtgga attggaagat tctacataca gatgtgcact 181 gaattcaaac tcagcgacta tgaaggaagg ctgatccaaa acagcataac aatagagaga 241 atggtccttt ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagtgct 301 ggaaaagatc ctaagaagac tggaggtcca atctacagaa ggagagatgg aaagtggatt 361 agagagctga tcctatgcga caaagaggag atcaggagaa tctggcgcca agcgaataat 421 ggggaggatg caactgctgg ccttacccat ttgatgatat ggcattccaa cctcaatgat 481 gccacttacc agagaacaag agcccttgtg cgtactggga tggatcctag aatgtgctct 541 ctgatgcaag gctcaactct cccgagaaga tctggagctg ctggcgctgc agtaaaagga 601 gttgggacaa tggtgatgga actgattcgg atgataaaac ggggaatcaa tgaccgaaat 661 ttctggagag gcgaaaatgg gcgaagaaca agaattgcct atgagagaat gtgcaacatt 721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggaccaagt gcgggaaagc 781 cgaaatcctg ggaatgctga aattgaagac cttatatttc tggcacgatc tgcactcatc 841 ctaagaggat cagtagccca caaatcctgc ctacctgctt gtgtgtatgg attggccgta 901 gccagtggat atgactttga gaaagaagga tactccctag tcggaataga tcctttccgc 961 atgctccaga acagccaggt attcagtctc atcaggccca atgaaaattc agcacacaag 1021 agtcaattgg tatggatggc atgccactct gcagcatttg aagacctgag agtatcaagc 1081 ttcatcagag gaacaaaagt ggtcccaaga gggaaactct ccaccagagg agtccaaatc 1141 gcttcaaatg agaacatgga aacaatggac tctacaactc tcgaattgag aagcaggtac 1201 tgggctataa ggaccagaag cggaggaaac acaaatcagc agcgggcgtc agctggacaa 1261 atcagcgtac aacccacttt ctctgtgcag aggaatctcc cattcgagag agcaaccatt 1321 atggcagcat ttaccgggaa cactgaaggc agaacatccg acatgagaac tgagatcata 1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aagggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPT 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/duck/Czechoslovakia/56 (H4N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30762 KEYWORDS nucleoprotein. SOURCE Influenza A/duck/Czechoslovakia/56 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 489 a 315 c 432 g 329 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgaaca gatggaaact ggtggagagc gccagaatgc cactgagatc 121 agagcatctg ttgggagaat ggtcggtgga attgggaggt tttacataca gatgtgtact 181 gaactcaagc tcagtgacta tgaagggaga ctgatccaga acagcataac aatagagaga 241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgct 301 gggaaggacc ctaagaaaac tggaggtcca atctatagaa ggagagacgg aaaatgggtg 361 agagaattga tcctgtacga caaagaggag atcaggagga tttggcgcca agcaaacaat 421 ggagaagacg caactgctgg tctcactcac ttgatgatct ggcattccaa tctgaatgat 481 gccacatatc agagaacaag agctcttgta cgtactggga tggacccaag aatgtgctct 541 ctgatgcaag gatcaactct cccgaggagg tctggagctg ctggtgcagc agtgaaggga 601 gttgggacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat 661 ttctggagag gcgaaaatgg acgaagaaca aggattgcat atgagaggat gtgcaatatc 721 ctcaaaggga agttccaaac agcagcacaa agagcaatga tggaccaggt gcgggagagc 781 cggaatcctg ggaatgctga aattgaagac cttatcttcc tggcacggtc tgcactcatt 841 ctgagaggat cagtggctca taagtcctgt ctgcctgctt gtgtgtatgg acttgctgtg 901 gccagtggat acgactttga gagagagggg tactctctgg tcggaataga tcctttccgt 961 ctgctccaaa acagccaagt attcagcctc attagaccaa atgagaatcc agcacataag 1021 agccaactgg tatggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc 1081 ttcatcagag ggacaagagt ggtcccaaga gggacaactg tcaccagagg atttcaaatt 1141 gcttcaaatg agaacatgga aacaatggac tccagcactc ttgaactgag gagcagatat 1201 tgggctataa ggaccaagag cggaggaaac accaaccaac agagagcatc tgcaggacaa 1261 atcagtgtac aacccacttt ctcggtacag agaaatctcc ctttcgagag agcgaccatc 1321 atggcggcat ttacagggaa cactgaaggc agaacgtctg acatgaggac tgagatcata 1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPU 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/duck/Ukraine/2/60 (H11N8) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30763 KEYWORDS nucleoprotein. SOURCE Influenza A/duck/Ukraine/2/60 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 496 a 325 c 423 g 321 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggagatc gccagaatgc cactgagatc 121 agagcatctg ttgggaggat ggtcagcgga attggaagat tctacataca gatgtgcact 181 gaactcaaac tcagcgacta tgaagggagg ctgatccaga acagcataac aatagaaaga 241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgca 301 gggaaggatc ctaagaaaac cggaggtcca atctatagaa ggagagacgg aaaatggatg 361 agagagctga tcctgtatga caaagaggag atcaggagaa tttggcgcca agcgaacaat 421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcattccaa tctgaatgat 481 gccacatatc aaagaacaag agctctcgtg cgcactggaa tggaccccag aatgtgctcc 541 ctgatgcaag gatcaactct cccgaggaga tctggagctg ctggcgcagc agtaaaggga 601 gttgggacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat 661 ttctggagag gcgaaaatgg acgaagaaca aggattgcat atgagagaat gtgcaacatc 721 cttaaaggga aattccaaac agcagcacaa cgagcaatga tggaccaagt gcgggaaagc 781 cggaatcctg ggaatgctga aattgaagac cttatcttcc tggcacgatc tgcactcatt 841 ctgagaggat cagtggctca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg 901 gccagtggat acgacttcga gagagaggga tactctctgg tcggaataga tcctttccgt 961 ctgctccaaa acagccaggt gttcagtctc attagaccaa atgagaatcc tgcacataag 1021 agccaactgg tgtggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc 1081 ttcatcagag ggacaagagt ggtcccaaga gggcaactgt ccaccagagg agttcaaatt 1141 gcttcaaatg aaaacatgga aacaatggac tccagcactc ttgaactgag gagcagatac 1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc cgcaggacaa 1261 ataagtgtac aacccacttt ctcggtacag agaaatcttc ctttcgagag agcgaccatc 1321 atggcggcat ttacagggaa cactgaaggc agaacgtctg acatgaggac tgagatcata 1381 agaatgatgg agagtgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPV 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/mallard/Astrakhan/244/82 (H?N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30764 KEYWORDS nucleoprotein. SOURCE Influenza A/mallard/Astrakhan/244/82 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 500 a 317 c 421 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatca acatcatggc gtctcaaggc 61 accaaacgat cttatgaaca gatggaaact ggtggagaac gccagaatgc tactgagatc 121 agagcatctg ttggaagaat ggttggtgga attgggaggt tttacataca gatgtgcact 181 gaactcaaac tcagcgacta tgaagggagg ctgatccaga acagcataac aatagagaga 241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagaaca tcccagtgcg 301 gggaaggacc cgaagaaaac tggaggtcca atctaccgaa ggagagacgg gaaatggatg 361 agagagctga ttctgtatga caaagaggag atcaggagga tttggcgtca agcgaacaat 421 ggagaagacg caactgctgg tctcactcac ctgatgatct ggcattccaa cctaaatgat 481 gccacatacc agagaacaag agctctcgtg cgtactggga tggaccccag aatgtgctct 541 ctgatgcaag gatcaactct cccgaggaga tctggagctg ctggtgcagc agtaaaggga 601 gtcgggacga tggtgatgga actaattcgg atgataaagc gagggattaa cgatcggaat 661 ttctggagag gcgaaaacgg acgaagaaca aggattgcat atgagagaat gtgcaacatc 721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggatcaggt gcgagaaagc 781 agaaatcctg ggaatgctga aattgaagat ctcatctttc tggcacggtc tgcactcatc 841 ctgagaggat cagtggccca taagtcctgc ttgcctgctt gtgtgtacgg acttgctgtg 901 gccagtggat atgactttga gagagaaggg tactctctgg tcgggataga tcctttccgt 961 ctgcttcaaa acagccaggt cttcagtctc atcagaccaa atgagaatcc agcacataag 1021 agtcaactgg tatggatggc atgccattct gcagcatttg aggacctgag agtctcaagt 1081 ttcatcagag ggacaagagt ggtcccaaga ggacaactat ccaccagagg agttcaaatt 1141 gcttcaaatg agaacatgga aacaatggac tccagtactc ttgaactgag aagcagatat 1201 tgggctataa gaaccaggag tggaggaaac accaaccaac agagagcatc tgcaggacaa 1261 atcagtgtac aacccacttt ctcggtacag agaaatcttc ctttcgaaag agcgaccatc 1321 atggcggcat ttacagggaa cactgaaggc agaacatctg acatgaggac tgaaatcata 1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaagga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPW 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/budgerigar/Hokkaido/1/77 (H4N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30765 KEYWORDS nucleoprotein. SOURCE Influenza A/budgerigar/Hokkaido/1/77 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 500 a 322 c 428 g 315 t ORIGIN 1 agcaaaagca gggtagataa tcactcacag agtgacatcc acgacatggc gtctcaaggc 61 accaaacgat cctatgagca gatggaaact ggtggagaac gccagaatgc tactgagatc 121 agagcatctg ttgggagaat ggtcggtgga attggaagat tctacataca gatgtgcact 181 gaaatcaaac tcagtgacta tgaagggagg ctgattcaga acagcataac aatagagaga 241 atggtcctct ctgcatttga ggaaaggagg aacaaatacc tggaagagca ccccagtgcg 301 ggaaaggacc cgaagaaaac tggaggtcca atctacagaa ggagagatgg aaaatgggtg 361 agagaactga tcctgtatga caaagaggag ataaggagga tctggcgcca agcgaacaat 421 ggagaagacg caactgcagg tctcactcac ctgatgatct ggcattccaa cctgaatgat 481 gccacatatc agagaacaag agccctcgtg cgtactggga tggaccccag aatgtgctct 541 ctgatgcaag gatcaactct cccgagaaga tctggggcag ctggtgcagc aataaaggga 601 gttgggacga tggtgatgga gctgattcgg atgataaaac ggggaatcaa tgatcggaat 661 ttctggagag gcgaaaatgg gcgaagaaca aggactgcat acgagagaat gtgcaacatc 721 ctaaaaggaa aattccaaac agcagcacaa cgagcaatga tggatcaggt gcgagaaagc 781 aggaatcctg gaaatgctga gattgaagac cttatatttc tggcacggtc tgcactcatt 841 ctgagaggat cagtagctca taagtcctgc ctgcctgctt gtgtatatgg gcttgcagtg 901 gccagtgggt acgactttga gagagaggga tactctctgg tcggaataga tcctttccgc 961 ctgcttcaaa acagtcaggt gttcagcctc attagaccaa atgagaatcc agcacataaa 1021 agccaactag tctggatggc ttgccattct gcagcatttg aagacctgag agtctcaagc 1081 ttcatcagag ggacaagact agtcccaaga ggacaactgt ccactagagg agttcaaatt 1141 gcttcaaatg agaacatgga agcaatggac tccaatactc ttgagctgag gagcagatac 1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc tgcaggacaa 1261 atcagcgtgc aacctacttt ctcggtacag aggaatctcc ctttcgagag agctaccatc 1321 atggcggcat ttacagggaa caatgaagga aggacttctg acatgaggac tgagatcata 1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPX 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/ruddy turnstone/New Jersey/47/85 (H4N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30766 KEYWORDS nucleoprotein. SOURCE Influenza A/ruddy turnstone/New Jersey/47/85 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 491 a 317 c 423 g 334 t ORIGIN 1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgaaatc 121 agagcatctg ttgggagaat ggttggtgga atcggaagat tctacataca gatgtgcact 181 gaactcaaac tcagtgacta tgaagggaga ctgatccaaa acagcataac aatagagaga 241 atggttctct cggcatttga cgagaggaga aacaaatatc tggaagagca ccccagtgct 301 gggaaagacc ctaaaaagac tggaggtcca atctacagga ggagagatgg gaaatggatg 361 agagaactca tcctatatga caaagaggag atcagaagga tttggcgtca agcgaataat 421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcattccaa tctgaatgat 481 gccacatacc agagaacaag ggcacttgtg cgtactggga tggatcccag gatgtgctct 541 ctgatgcaag gctcaactct tccgaggagg tctggggctg ctggagcagc agtgaaaggg 601 gttggaacaa tggtgttgga attgatccgg atgatcaagc gagggatcaa tgatcggaat 661 ttctggagag gcgaaaatgg aaggaggaca aggattgctt atgaaagaat gtgcaacatt 721 ctcaaaggga aattccaaac agcagcacaa cgagcaatga tggatcaggt gagggaaagc 781 cggaatcccg ggaatgctga aattgaagat ctcatctttc ttgcacggtc tgctctcatt 841 ctgagaggat cagtggccca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg 901 gccagtggat acgactttga aagggaggga tactccctag tcggaattga tcctttccgt 961 ttgcttcaaa acagtcaagt cttcagtctt atcagaccaa acgaaaatcc agcacataaa 1021 agtcagctgg tatggatggc atgccactct gcagcatttg aagatctgag agtgtcaagc 1081 ttcattagag gaacaagagt agtcccaaga ggacaactgt ccaccagagg agttcagatt 1141 gcttcaaatg agaacatgga gacaatggac tccagcactc ttgaattgag gagcagatac 1201 tgggctataa ggaccagaag cggagggaac actaaccagc agagagcatc tgcagggcaa 1261 atcagcgtac agcccacatt ctctgtgcag aggaacctcc cattcgagag agcaaccatt 1321 atggcggcat ttacaggaaa cactgaaggc agaacttcag acatgagaac tgagatcata 1381 aggatgatgg aaaatgccag tcctgaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga 1501 tcttatttct tcggagacag tgcagaggaa tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPY 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/tern/South Africa/61 (H5N3) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30767 KEYWORDS nucleoprotein. SOURCE Influenza A/tern/South Africa/61 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 493 a 322 c 428 g 322 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggagagc gccagaatgc cactgagatc 121 agagcatctg ttgggagaat ggtcggtgga atcgggagat tctacataca gatgtgcact 181 gaactcaaac tcagtgacta tgaagggagg ctgatccaga acagcataac aatagagaga 241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgcg 301 gggaaggacc cgaagaaaac tggaggtcca atctatagaa ggagagacgg aaaatggatg 361 agagaactga tcctgtatga caaagaggag atcaggagga tttggcgcca agcaaacaat 421 ggagaagacg caactgctgg tctcactcat ctgatgatct ggcattccaa cttgaatgat 481 gccacatatc agagaacaag agctcttgtg cgtactggga tggaccccag aatgtgctct 541 ctgatgcaag gatcaactct cccgaggaga tctggagccg ctggtgcagc agtgaaggga 601 gttggaacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat 661 ttttggagag gcgaaaatgg acgaagaaca aggattgcat atgagagaat gtgcaacatc 721 ctcaaaggga aattccaaac agcagcacaa cgagcaatga tggaccaggt gcgagaaagc 781 cggaatcctg ggaatgctga aattgaagac cttatctttc tggcacggtc tgcactcatt 841 ctgagagggt cagtggctca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg 901 gccagtggat acgactttga gagagaggga tactctctgg tcggaataga ccctttccgt 961 ctgctccaaa acagccaggt attcagcctc attagaccaa aagagaatcc agcacataag 1021 agccaattgg tatggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc 1081 ttcatcagag ggacaagagt ggtcccaaga ggacaactgt ccaccagagg atttcaaatt 1141 gcttcgaatg agaacatgga ggcaatggac tccagtactc ttgaactgag gagcagatac 1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc tgcaggacaa 1261 atcagcgtac aacccacttt ctcagtacag agaaatcttc ctttcgagag agcgaccatc 1321 atggcggcat ttacagggaa cactgaaggc agaacatctg acatgaggac tgagatcata 1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPZ 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/chicken/Pennsylvania/1/83 (H5N2) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30768 KEYWORDS nucleoprotein. SOURCE Influenza A/chicken/Pennsylvania/1/83 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 492 a 313 c 427 g 333 t ORIGIN 1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgagatc 121 agagcatctg ttgggagaat ggttggtgga atcggaaggt tctacataca gatgtgcact 181 gaactcaaac tcagcgacca tgaagggaga ctgatccaga acagcataac aatagagaga 241 atggttctct cggcatttga tgagagaaga aacagatatc tggaagagca tcccagtgct 301 gggaaagatc ctaagaagac tgggggtcca atctacagaa ggagagatgg gaaatgggtg 361 agagaattga tcctctatga caaagaggag atcagaagaa tttggcgtca agcaaataat 421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcactccaa cctgaatgat 481 gccacatatc agagaacaag ggcacttgtg cgtaccggga tggaccccag gatgtgctct 541 ctgatgcaag gctcaacact tccaaggagg tctggtgctg ctggagcagc cgtgaaaggg 601 attggaacaa tggtaatgga attgatacgg atgatcaaac gagggatcaa tgataggaat 661 ttctggagag gcgaaaatgg gcggagaaca agaattgcat acgagagaat gtgtaacatt 721 ctcaagggga aatttcaaac agctatacaa cgggcaatga tggaccaagt gagggaaagc 781 cggaatcctg ggaatgctga aattgaagat ctcatctttc ttgcgcggtc tgctctcatt 841 ctgaggggat cagtggcaca taagtcctgc ctgcctgctt gtgtgtatgg acttgctgtg 901 gccagtggat acgattttga gagggaggga tactccctag tcgggattga tcctttccgt 961 ctgctccaaa acagtcaagt tttcagtctt ataagaccaa acgaaaatcc agcacataaa 1021 agtcagctgg tatggatggc atgtcactct gcagcatttg aggatctgag agtatcaagc 1081 ttcattagag gaacaagagt ggtcccaaga gggcaacttt ccaccagagg agttcagatt 1141 gcttcaaatg agaacatgga tgcaatagac tccagtactc ttgaattgag gagcagatac 1201 tgggccataa gaaccagaag cggaggaaac acaaatcagc agagagcatc tgcaggacaa 1261 atcagcgtac agcccacatt ctcagtgcag agaaacctcc cattcgagag agcaaccatc 1321 atggcggcat tcacaggaaa cacagaaggc agaacttcag acatgagaac tgagattata 1381 aggatgatgg agaatgccag acctgaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcgac gaacccgatc gtgccttcct ttgacatgag taacgagggg 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS HUMCEAB 2690 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human carcinoembryonic antigen (CEA) hsCGM1 gene, exons 1 and 2. ACCESSION M22433 D51537 KEYWORDS carcinoembryonic antigen. SOURCE Human fetal liver (lambda-hsCHM1-1 library) DNA, clone hsCHM1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2690) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept 1250 1313 carcinoembryonic antigen hsCHM1, exon 1 /nomgen="CEA" /map="19q13.1-q13.3" /hgml_locus_uid="LN0164S" 2158 / 2517 carcinoembryonic antigen hsCHM1, exon 2 IVS 1314 2157 carcinoembryonic antigen hsCHM1, intron A IVS 2518 > 2690 carcinoembryonic antigen hsCHM1, intron B BASE COUNT 823 a 637 c 653 g 577 t ORIGIN Chromosome 19q31.1-q31.3. 1 gaattccaca gcaataacca cgatgacaac caccatgtac tcaacacccg cctgggcacg 61 gggctcccac agcagctcac ttattcccaa caactctgca aggaggattt taccatcctc 121 cttttacaaa tcagggaatc aaggatcata gaagccacgt gcacttgtcc aagtcaacat 181 agttaagtga cagaaccatt agctgtcccc aggtacatct ggacataaag ttcatgttta 241 tgccactgtg tcagcatttc caaaaactga ttttaggcga aacgtaagta agctttttaa 301 aaactttaat acttatgcgt ttattttaat atacattgag aaaacattta agcacacatc 361 aaatctgtaa tttcatggac aatattgcat aagacaagga tgttttgtct ccaactcctg 421 gcctcaagcc atcctcccac catagctctc gagtagctgg gatcacagac tgagccaccg 481 atccctgcta ggacaggatg ttttgtaaac taaatttatt tagaaaaaag gatgaaatat 541 ataataataa aggtggtaca agctagagag aaaatcataa agtcagccta gaaatgtctg 601 gtgtctggat gacataaagc tacagcactg tgaagcctca ttctcagtta ctcccaggaa 661 attagagtca cataatgctg cagaaagaac agctcagaat cttagatccg gctttagccc 721 tagatatatc catttgtagg accccagaca tctctgtgac ctccttgctg ggagtaaatc 781 caaccttccc agacatgtga gaacagtaag aagaccctgc acacacaaag gagtttctcc 841 gtcacagaga aaataacacc aggttcaggg accccaggga ctctgcatgg tgctgacaga 901 cccaaggcca aggcagagca gaggtccacg ctggggaggg agggtcatcc tgttatgaaa 961 cagggatcca agtaagcctt gcttctcaga gcctggtctg ggcaactcaa atgtagacag 1021 aaggccccaa ggaagaagag aaaatgaggc aaaactgaga ggggagggga cagagaggtg 1081 acctgggcag agcttcaccc atgaccctgg aaagtgctcc tgccctggga ggaggctcag 1141 catggaaaga ggaaggacag cagagcctaa gtcacagtag ccctgactac agcattcctg 1201 gagcccaggc tcttttccac agaggaggaa agagcaggca gcagagacca tggggccccc 1261 ctcagcctct ccccacagag aatgcatccc ctggcagggg cttctgctca caggtgagtg 1321 gaggattcct gggagtgggc aagaggaggg atcacagaga atggctgggg tctcctgggg 1381 aggatggggc tctgataggg gacagaaggc ttctgctgaa gcctcagggg agagaacatc 1441 agagagggac acgggtcaca acaagacaat cacattgaac tgggattgat aagagggagg 1501 aaaatccatt gatcatgttt tccaagttaa tcattactgg ccactacaat tagaaaatga 1561 taagaataag aattacatca gggtgatact ttaaataaaa atataaccag ggcactaaaa 1621 cctgtctttg ccccaaccac aagttgcaaa ataaccacca ctccttaact catccaccag 1681 tatttgcaat caaattttag gcactggcgt acaacaaata tcagacaagt ctctgtgttc 1741 aaagagctta cactcttgca gagatgaaga tagacaccca aagagatcta gaatgtgagt 1801 tcaggtgttg acaagagccc tggagggaac agagcagaaa aaggtcagaa agggacgccc 1861 cagggtctct agaggaggtg tcaggggagg gatctcccaa ggatgccctg atgtgagcag 1921 gatctgaggg cagtggggag ggagccatgc agacccctgg ggaaggggat tccacacagg 1981 aaaatgccaa ggttagaggt gctgaagaaa gaaaggtcac gttactgacc ttaaccaagt 2041 gggacacacc tacactctca aggctgaagg gagaagagac tctctcagga cccagggccc 2101 catctttcca tcccaataca tgggtaccaa tattgactga tgctttctcc ctcctagcct 2161 cacttctaaa cttctggaac ccgcccacca ctgccaagct cactattgaa tccacgccgt 2221 tcaatgtcgc agaggggaag gaggtgcttc tacttgtcca caatctgccc caacatcttt 2281 ttggctacag ctggtacaaa ggggaaagag tggatggcaa cagtctaatt gtaggatatg 2341 taataggaac tcaacaagct accccagggg ccgcatacag cggtcgagag acaatataca 2401 ccaatgcatc cctgctgatc cacaatgtca cccagaatga cataggattc tacaccctac 2461 aagtcataaa gtcagatctt gtgaatgaag aagcaactgg acagttccat gtataccgtg 2521 agtatttcca catgacctct ggaggttggg ggtcagttct acttcccaca tatgggattg 2581 tacggcctgg gctgtgcctc tggccctctc tgcattacat tctgtatcag ggtttggaca 2641 tttagtgcag gacacacacg ggggagacaa acttccacag actagaattc // LOCUS HUMCEAC 781 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human carcinoembryonic antigen (CEA) hsCGM2 gene, exon 2. ACCESSION M22434 D51537 KEYWORDS carcinoembryonic antigen. SOURCE Human fetal liver (lambda-hsCGM2-1 library) DNA, clone hsCGM2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 781) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept / 221 / 583 carcinoembryonic antigen hsCGM2, exon 2 (AA at 223) /nomgen="CEA" /map="19q13.1-q13.3" /hgml_locus_uid="LN0164S" IVS < 1 220 carcinoembryonic antigen, intron A IVS 584 > 781 carcinoembryonic antigen, intron B BASE COUNT 220 a 195 c 184 g 182 t ORIGIN Chromosome 19q31.1-q31.3. 1 ctgcagatcc ctggggaaga ggattccgaa cagggaaatg taaggtcaga ggtgctgata 61 ggggacatgc tgctgtcatt gatccagtag gacacacaca cacacacaca cttacttcaa 121 gatgggggtg ggtgaagaga cctgctcagg atccagggcc ccatctttcc accccaatac 181 ataggtccca atattgactg atgttctctc cccctcctag cctcgctttt aaccttctgg 241 aacctgccaa acagtgccca gaccaatatt gatgtcgtgc cgttcaatgt cgcagaaggg 301 aaggaggtcc ttctagtagt ccataatgag tcccagaatc tttatggcta caactggtac 361 aaaggggaaa gggtgcatgc caactatcga attataggat atgtaaaaaa tataagtcaa 421 gaaaatgccc cagggcccgc acacaacggt cgagagacaa tataccccaa tggaaccctg 481 ctgatccaga acgtcaccca caatgacgca ggattctata ccctacacgt tataaaagaa 541 aatcttgtga atgaagaagt aaccagacaa ttctacgtat tctgtgagtg atacctccat 601 gacttctggg tgctgggggc cagttctact tcatacacac ggggttgtca ggcctgggtt 661 gtgcctgtgt ccccatctac attttatcca gtgttggagt ttgggcattt agtgaaggac 721 acacatgggg gagacaaact tctacagacc agaatccctt tcctgcatcc agaccctgca 781 g // LOCUS HUMPSBGA 1042 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human pregnancy-specific B-1-glycoprotein gene, exon 2. ACCESSION M22437 D51537 KEYWORDS glycoprotein; pregnancy-specific B-1 glycoprotein. SOURCE Human fetal liver (lambda-PSBG-1 library) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1042) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept / 497 / 862 pregnancy-specific B-1 glycoprotein, exon 2 (AA at 499) /nomgen="B1G1" /map="6" /hgml_locus_uid="LG0073P" IVS < 1 496 pregnancy-specific B-1 glycoprotein, intron A IVS 863 > 1042 pregnancy-specific B-1 glycoprotein, intron B BASE COUNT 289 a 241 c 274 g 238 t ORIGIN Chromosome 6. 1 gagctcacgc tgtcatgggg gggaagacag acatgcaaag agatctagaa tgtgaggtca 61 ggtgttgaca agaaccctgg agggagcaga gcagggaaag gtcagaaagg gaagacccag 121 ggtctctgaa gcaggcatca ggaaagaagt ctaacgatgc cctgatgtaa gcaggacctg 181 agggcattgt ggagggggcc atgcggaccc ctggggaaga ggattccaaa cagaaaaatg 241 ccaacgtcag aagtgttgaa ggaatgggcg tcatgctgct gaccttgacc tagtaggaca 301 gtaggacaca cacacataca cacacacaat cacacatgcc gcttttgtgt gtgtgtgtgt 361 gtgtgtgttt gtatgtgtgt gtgtgtctgt gttgtcaagg ctgaggactg aagagacctt 421 ctcaggaccc agggccccat gttttcacac caatacatag gtctcaatat tgactgatgt 481 tctctccacc tcctagcatc acttttaaac ttctggaacc tgcccaccac tgcccaagtc 541 acgattgaag ccgagccaac caaagtttcc gaggggaagg atgttcttct acttgtccac 601 aatttgcccc agaatcttac cggctacatc tggtacaaag ggcaaatgag ggacctctac 661 cattacatta catcatatgt agtagacggt gaaataatta tatatgggcc tgcatatagt 721 ggacgagaaa cagcatattc caatgcatcc ctgctgatcc agaatgtcac ccgggaggac 781 gcaggatcct acaccttaca catcataaag ggagatgatg ggactagagg agtaactgga 841 cgtttcacct tcaccttaca ccgtaagtga ttccacatga tccctgggtg ttgggggaca 901 ggggtcactt ctacttcaca cacacaggat tgtcaggcct ggactctgcc tgtgtcactc 961 actgcgttat gtcccatgct gggatttggg catttagtgc aggacacaca cagaggagac 1021 aaatttcaac agatcagaat tc // LOCUS ASNMTATP9 225 bp ds-DNA ORG 04-AUG-1990 DEFINITION A.nidulans mitochondrion ATPase subunit 9 gene, complete cds. ACCESSION M35967 KEYWORDS ATPase. SOURCE A.nidulans mitochondrion DNA. ORGANISM Mitochondrion Aspergillus nidulans Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae; Aspergillus nidulans. REFERENCE 1 (bases 1 to 225) AUTHORS Brown,T.A., Ray,J.A., Waring,R.B., Scazzocchio,C. and Davies,R.W. TITLE A mitochondrial reading frame which may code for a second form of ATPase subunit 9 in Aspergillus nidulans JOURNAL Curr. Genet. 8, 489-492 (1984) STANDARD simple staff_review FEATURES from to/span description pept 1 225 ATPase subunit 9 BASE COUNT 66 a 29 c 51 g 79 t ORIGIN 1 atgttacaat cagctagaat aataggaaca ggattagcta caacaggttt aatcggagca 61 ggtgtaggta tcggtgtagt atttggtgcc ttaatcttag gagttgcaag aaaccctgca 121 ttaagaggtc aactgttttc atatgcaatt ttaggatttg catttgcgga agcaactgga 181 ttatttgctt taatgatggc ttttttatta ttatatgtag cttaa // LOCUS DOGPHA2 562 bp ss-mRNA MAM 04-AUG-1990 DEFINITION Canine phospholipase A-2 mRNA, complete cds. ACCESSION M35301 KEYWORDS phospholipase A-2. SOURCE Canine pancreas, cDNA to mRNA. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 562) AUTHORS Kerfelec,B., LaForge,K.S., Puigserver,A. and Scheele,G. TITLE Primary structures of canine pancreatic lipase and phospholipase A-2 messenger RNAs JOURNAL Pancreas 1, 430-437 (1986) STANDARD simple staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.Scheele, 06-MAR-1990. FEATURES from to/span description pept 29 469 phospholipase A-2 precursor sigp 29 73 phospholipase A-2 signal peptide matp 95 466 phospholipase A-2 mRNA < 1 562 phospholipase A-2 mRNA BASE COUNT 143 a 167 c 128 g 124 t ORIGIN 1 ttctcttgct cagcaacttg accacaaaat gaaattcctc gtactggctg ctctgctcac 61 agtggccgct gccgaggggg gcatcagccc gcgggcagtt tggcagttcc gcaacatgat 121 caagtgcacg atccccgaga gtgacccctt gaaggactac aacgactacg gctgctactg 181 tggcctgggt ggatccggca cccctgtgga tgagttggac aagtgttgcc agacgcatga 241 ccactgctac tcagaagcca agaaactgga cagctgtaaa ttcctcctgg acaaccccta 301 caccaaaatc tactcatact catgctccgg ctctgagatc acctgcagca gcaaaaacaa 361 agactgccag gcctttatct gcaactgtga ccgcagtgct gccatctgct tttcaaaggc 421 tccgtataac aaggagcaca agaacctgga cactaagaag tactgctaga attgagtctc 481 ctctctggaa agcatcattc ctacatgcct cgtggccttc accctaccct gtgtcctcca 541 ataaagcact ttgttgaaag gc // LOCUS DOGPLIP 1493 bp ss-mRNA MAM 04-AUG-1990 DEFINITION Canine lipase mRNA, complete cds. ACCESSION M35302 KEYWORDS lipase. SOURCE Canine pancreas, cDNA to mRNA. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 1493) AUTHORS Kerfelec,B., LaForge,K.S., Puigserver,A. and Scheele,G. TITLE Primary structures of canine pancreatic lipase and phospholipase A-2 messenger RNAs JOURNAL Pancreas 1, 430-437 (1986) STANDARD simple staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.Scheele, 06-MAR-1990. FEATURES from to/span description pept 35 1438 lipase precursor sigp 35 85 lipase signal peptide matp 86 1435 lipase mRNA < 1 1493 lipase mRNA BASE COUNT 410 a 377 c 378 g 328 t ORIGIN 1 cgaaagggag ggaaggccca gaacatccca caggatggtg agcatctgga caatcgcgct 61 gtttctgctg ggagcagcca aagcaaagga agtttgctat gaacaaatcg gatgcttttc 121 tgacgcggag ccctgggccg ggacagcaat caggcctctg aaagttctcc cctggagccc 181 cgagagaatc ggcacccgct tcctgctcta caccaacaag aacccaaaca actttcaaac 241 tctccttccc tctgatccat caacgattga ggcatcaaat tttcaaacag acaagaagac 301 ccggttcacc atccatggct tcataaacaa gggagaagag aactggctgc tggatatgtg 361 caagaacatg ttcaaggttg aggaggtgaa ctgcatctgc gtggactgga agaaaggttc 421 ccaaacctca tatacacagg ctgccaacaa cgtgcgggtg gtgggcgccc aggtggccca 481 gatgctcagc atgctctcgg caaactacag ctactcacct tcccaagtcc agctcatcgg 541 ccacagccta ggagcccacg tggctgggga ggcagggagc aggactccag gtctgggcag 601 gattacaggg ttggatcctg tagaagcaag tttccagggc actcctgaag aggttcgact 661 tgatcccact gatgctgact ttgttgatgt gattcacaca gatgcagctc ccctgatccc 721 attcctgggt tttggaacaa gccaacagat gggtcacctt gacttcttcc ccaatggtgg 781 agaggaaatg ccaggatgca agaagaatgc cctgtcacag atcgtgaacc tagatggcat 841 ttgggaagga actcgggact tcgtggcttg caatcacctg agaagttaca agtattactc 901 agagagcatc ctcaaccctg atggatttgc ttcctacccc tgtgcttcct acagggcctt 961 tgaatctaac aagtgcttcc cctgcccaga tcaagggtgc ccacagatgg gtcactatgc 1021 tgataaattt gctgtcaaga caagtgatga gacacagaaa tacttcctga acaccggaga 1081 ttccagcaat tttgctcgct ggagatacgg ggtttctata acattgtctg ggaaaagagc 1141 cactggtcag gctaaagttg ctttgtttgg aagtaaggga aatactcatc aattcaatat 1201 cttcaagggg attctcaaac caggctctac tcattccaat gagtttgatg caaagcttga 1261 tgttggaaca attgagaaag tcaagtttct ttggaataac aacgtggtaa acccaacctt 1321 tcccaaagtg ggtgcagcca agatcaccgt gcaaaaggga gaggagaaaa cagtgcacag 1381 cttctgcagc gaaagcaccg tgagggagga tgttctgctc accctcacgc cctgttagcg 1441 tccagcgcaa cccgggcacc gtgttaacag caataaaaac cactgatgca ttt // LOCUS HUMPGDZ1 1106 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human MHC class II lymphocyte antigen (HLA-DNA) gene, complete cds. ACCESSION M31525 M35125 KEYWORDS cell surface glycoprotein; class II gene; lymphocyte antigen; major histocompatibility complex. SOURCE Human B-cell cell line PGF, cDNA to mRNA, clone PGDZ1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1106) AUTHORS Young,J.A.T. and Trowsdale,J. TITLE The HLA-DNA (DZA) gene is correctly expressed as a 1.1kb mature mRNA transcript JOURNAL Immunogenetics 31, 386-388 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.T.Young, 22-JAN-1990. FEATURES from to/span description pept 77 829 MHC HLA-DNA precursor /hgml_locus_uid="LV0117X" /nomgen="HLA-DNA" /map="6p21.3" sigp 77 151 MHC HLA-DNA signal peptide matp 152 826 MHC HLA-DNA mRNA 1 1106 MHC HLA-DNA mRNA BASE COUNT 204 a 354 c 289 g 259 t ORIGIN 1 cttcttcttt acctccgcct tgttcctgtc ctcaccacac ggactgagac tgatttgatt 61 aaagcaccag agtgtaatgg ccctcagagc agggctggtc ctggggttcc acaccctgat 121 gaccctcctg agcccgcagg aggcaggggc caccaaggct gaccacatgg gctcctacgg 181 acccgccttc taccagtctt acggcgcctc gggccagttc acccatgaat ttgatgagga 241 acagctgttc tctgtggacc tgaagaaaag cgaggccgtg tggcgtctgc ctgagtttgg 301 tgactttgcc cgctttgacc cgcagggcgg gctggccggc atcgccgcaa tcaaagccca 361 tctggacatc ctggtggagc gctccaaccg cagcagagcc atcaacgtgc ctccacgggt 421 gaccgtgctc cccaagtctc gggtggagct gggccagccc aacatcctca tctgcatcgt 481 ggacaacatc ttcccccctg tgatcaatat cacctggctg cgcaacggcc aaactgtcac 541 tgagggagtg gcccagacca gcttctattc ccagcctgac catttgttcc gcaagttcca 601 ctacctgccc ttcgtgccct cagccgagga cgtctatgac tgccaggtgg agcactgggg 661 cctggatgcg ccactcctca ggcattggga gctccaggtg cctattccac caccagatgc 721 catggagacc ctggtctgtg ccctgggcct ggccatcggc ctggtgggct tcctcgtggg 781 caccgtcctc atcatcatgg gcacatatgt gtccagtgtc cccaggtaat gatccttctg 841 agagaaatga cttgtgggag acaccctgca gatcctcatg ggtttgtgac agcccctgcg 901 tgctcagtgc cctttaagtg catcccgctg tgctgacttt gagtgggatc aacatctgtc 961 ctacgggtcc cctctttttt ggccccagta ttcatggcag ggtttgttgg acacctacta 1021 gcttcccttc ccattcaaca cacacacaca ttcttgctct acccaaagct ctggctggca 1081 gcactaaatg ctttggtggt gtttgc // LOCUS LACPEPXP 3963 bp ds-DNA BCT 04-AUG-1990 DEFINITION L.lactis X-prolyl dipeptidyl aminopeptidase (pepXP) gene, complete cds. ACCESSION M35865 KEYWORDS X-prolyl dipeptidyl aminopeptidase. SOURCE L.lactis lactis (strain NCDO 763, ML3) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 3963) AUTHORS Nardi,M., Chopin,M.-C., Chopin,A., Cals,M.-M. and Gripon,J.-C. TITLE Cloning and DNA sequence analysis of an X-prolyl dipeptidyl aminopeptidase determinant from Lactococcus lactis subsp. lactis NCDO 763 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Chopin, 02-JUL-1990. Author address: A.Chopin Institut National de la Recherche Agronomique Lab de G/032n/032tique Microbplenne Institut de Biotechnologie CRJ 78352 Jouy-en-Josas Cedex FRANCE FEATURES from to/span description pept 1105 3396 X-prolyl dipeptidyl aminopeptidase (pepXP) (EC 3.4.14.5) pept 933 64 (c) ORF1 (put.) pept 3398 3883 ORF3 (put.) BASE COUNT 1261 a 748 c 733 g 1221 t ORIGIN 1 actagtcatt gcacttatgg tcaaataaaa aaattactga caactctgtc agtaattttt 61 ttattaaaga taaatcattt taaataatgc aacggctgca agtgatgcaa gaattggtgc 121 caaaactgga acccatgcat accaccattt agatgaacct ttagcttcac ccaaaactga 181 ttttggtaag agactgtgaa ccaaacgtgg accaaagtca cgcgccggat tcaaaccagg 241 acctgtaggt ccaccaagtg caactacgag tcccataaca aggaatccta ggaacaagtg 301 agcaatcatt ttagaagctg aagctcctga tgcttgaacc caaatttgat tcataacgtc 361 tgatgaagaa acatctgctc cttgaccttt caagtagttt gtcatccaag taattgattg 421 actaccaaag aagatatttg tcgcagcaac cgcaccgaag aataatacaa atgaacctaa 481 gaactcattc aagaaaccat taatggttgc ccccaaacga gttttttcac tgttatcatc 541 aacgttgtcg atagttgaga aagttccaag aatagcattt ggattttgtg ttttaagata 601 gtatggacga taaaccataa cgattaataa ttgtccaaac atcgcaccta atacttgcgc 661 aataatgtat tgagcaacgt gtgcccaagg aaaaagtcct gaggctgcaa gtccaagtgt 721 gaaagctggg ttgatttgtg aagtaatatt accaaaagca actgctggca acattacgcc 781 aagtccgtaa ccccaaccaa taatcatcca agattgtgcg tgagcttttg tacctttaag 841 ttcaacgtta gcaaccgcac cattacccat aataatgagt agggcagttc caacaaattc 901 agtgatatat ttcactgtcc atgtaacatc cattttttat ggaaccctcc aaaatttgct 961 tttaatataa taacctcagc tattatacca gaaaagcacg tttgcttcaa atattcattg 1021 aaaatagcgc gcctttagtc tgattaatca ctgaaaattt gctataatta aaagtattat 1081 catgtttatt acggaggatt taaaatgcgc tttaaccatt tttcaattgt tgacaaaaat 1141 tttgatgaac aattagcaga acttgatcag cttggttttc gttggtctgt tttttgggat 1201 gaaaagaaaa tcttaaaaga ttttctcatt caaagtccta gtgatatgac tgctctacaa 1261 gcaactgctg aattagatgt cattgaattt ttaaaatcct ctatagaatt agattgggaa 1321 attttctgga atattgcttt gcaacttctt gattttgtac caaactttga ttttgagatt 1381 ggcaaagctt ttgagtatgc aaaaaatagc aacctccctc aaatcgaagc agaaatgacc 1441 acagaaaata tcatttctgc tttctactac ctcctttgta cgcgtcgaaa aactggaatg 1501 attttagttg aacattgggt atcagaggga cttcttcctt tagataatca ttatcatttt 1561 ttcaatgaca aatcgttggc aacctttgat tcttcacttc ttgaacgtga agtcctttgg 1621 gttgaaagtc cagtcgatag tgaacaacgg ggtgaaaatg atttgataaa aattcaaatc 1681 attcgtccaa aaagtaccga aaaacttcca gtggtaatga cggcgagtcc ttatcattta 1741 gggattaatg ataaagcaaa tgatttagcc ttacacgata tgaatgtaga acttgaagaa 1801 aaaaccagtc acgaaattca tgttgaacaa aagctcccgc aaaaattatc agcgaaagca 1861 aaggaacttc cgattgttga caaagctcct tatcgtttta ctcatggttg gacttattct 1921 ttaaatgact atttcttgac tagaggattt gcctctatct atgtggccgg tgttggaacc 1981 cgttcttctg acggctttca aacctcaggc gattaccaac aaatttatag tatgacagct 2041 gtcattgact ggttgaatgg tcgagctcgt gcttatactt ctcgtaaaaa gacacatgaa 2101 atcaaagcct cttgggcaaa tggtaaagtg gctatgactg gaaaatctta tctgggaacc 2161 atggcttatg gagccgctac tactggagtt gaaggtctag agcttatctt agccgaagct 2221 ggaatttctt cttggtataa ttattaccgt gaaaatggcc ttgtccgctc tcctggcggt 2281 tttcctggtg aagatctaga tgttcttgct gcgctcactt attcacgtaa tcttgatggt 2341 gctgatttct taaaaggtaa tgctgaatac gaaaaacgat tggcagaaat gactgctgct 2401 ttagaccgca aatctggtga ttacaatcaa ttttggcatg accgaaatta tctgataaat 2461 actgacaaag ttaaggctga tgtcttaatt gttcatggac tacaagattg gaatgtgaca 2521 ccagagcaag cctataattt ctggaaagct ctgccagaag gtcatgctaa acatgccttt 2581 ttacaccgtg gggctcatat ttatatgaat tcttggcagt ccattgattt ttctgaaacg 2641 attaatgctt actttgtcgc aaaattactg gatagagatt taaatttaaa tctcccacct 2701 gttattttac aggaaaattc taaagaccaa gtatggacga tgatgaatga ttttggtgcc 2761 aatacccaaa tcaaactccc tcttggtaaa acggccgttt cttttgctca attcgacaat 2821 aattatgacg atgagacttt caaaaagtat tctaaagatt tcaatgtctt taaaaaagac 2881 ttatttgaaa acaaagccaa tgaagctgtc attgatttag aacttccgtc aatgctgaca 2941 attaatggtc cagttgagct tgaactcaga ctaaaattaa atgacacgaa aggcttctta 3001 tctgctcaaa ttcttgattt tggtcagaaa aaacgtttgg aagataaagt cagagtgaaa 3061 gattttaaag tgcttgaccg tgggcgaaat ttcatgttag acgacttagt tgaactccca 3121 cttgttgaga gcccttatca gttagtcact aaaggcttta ctaatctcca aaatcaaagt 3181 ttactgacag tcagtgattt aaaggctgac gaatggttta caataaaatt tgaactacag 3241 ccaaccattt atcatttaga aaaagctgac aagcttcggg tcattctcta tagtactgac 3301 tttgaacata cggttcgtga taatcgtaaa gtgacttacg agattgattt atctcaatct 3361 aaactcatta ttcctattga aagtgtgaaa aattaatatg gcaattattg gaattcttgg 3421 aactccttac aacacagttg aacgttctcc tttttggtgg aacaaagtaa gctatacacg 3481 tcaatctttt attgatgttt tccaggaatt aggtcacact gtgattgttc ttcctgttga 3541 caaaaccgaa aatataaaaa attatctgcc attggtcgat aaaattgtac tgactggtgg 3601 cgttgatgtc agtccttatc tctatggcga ggaacctcat gctcaattag gaacgactga 3661 cccaattcgt gaccgttttg agcttgccgc aattaaagct gctctagaag cgaataaacc 3721 tattcttggt gtatgtcgtg gcctacaatt gctcaatgtt tattttggag gaactcttta 3781 tcaagacttg agcctaactt caagtcagat aaagcacctc caaagtccaa ctccccaaga 3841 agttccgaca catcacatta gcgtcgaaaa agaagttctt tagatttttt acctgaaaat 3901 tacatggtca attctttcca ccatcaagtc attaaggatt tgggtcaggg cttgcaggcc 3961 att // LOCUS PIGA1AG 668 bp ss-mRNA MAM 04-AUG-1990 DEFINITION Swine alpha-1 acid glycoprotein mRNA, 3' end. ACCESSION M35990 KEYWORDS alpha-1 acid glycoprotein. SOURCE Swine, cDNA to mRNA. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 668) AUTHORS Stone,R.T. and Maurer,R.A. TITLE Cloning and developmental regulation of alpha-1 acid glycoprotein in swine JOURNAL Dev. Genet. 8, 295-304 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 552 alpha-1 acid glycoprotein (AA at 1) mRNA < 1 668 alpha-1 acid glycoprotein mRNA BASE COUNT 181 a 186 c 170 g 131 t ORIGIN 1 ccgctgtgcg ccaacttgac ggcagtgccc atcaccaatg ccaccttgga cttgatctcc 61 ggcaagtggt attatatcgg ctcggccttc cgaaaccccc agtacaatga gtcggccaga 121 tccatccagg cagccttctt tttctttgat cccaagcccg cggaggacaa gataaacctc 181 agagagtacc agaccatagg gaaccagtgc atctataacg acagctccct gaaggtccat 241 cgggagaatg gaagcctatc gaaacacgag atgggcagag aacacgttgc tgacctgctg 301 ttaaccaagg tccccaagac cttcatgctc atcaattcct tgcacgataa gaacaacgtg 361 gggctctcct tctacgccga caaggcggag gtgactccgg agcagatgaa agagttccat 421 gatgctattg agtgcacggg aatccacaag tcagaaatca catacaccga cgagaaaaag 481 gatctgtgtg ggccgctgga gaagcagcat gaggaggaaa ggaagaagga gaaggagaag 541 gaggggtcct agcattgggt caggtccttg ggcaccttgg aggcccatcc tcagcacctc 601 ccacccaccc tttgcacctc aattctctcc ctcagttgca tcaataaaac ttcgcatttg 661 gtacaggc // LOCUS RATMTNDI 946 bp ds-DNA ORG 04-AUG-1990 DEFINITION Rat NADH-dehydrogenase (NDI) gene, complete cds. ACCESSION M35826 KEYWORDS NADH-dehydrogenase. SOURCE Rat mitochondrion DNA. ORGANISM Mitochondrion Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; Rattus norvegicus. REFERENCE 1 (bases 1 to 946) AUTHORS Quagliariello,C. and Spena,A. TITLE Sequenza nucleotidica del gene NDI del DNA mitocondriale di fegato di ratto JOURNAL Boll. Soc. Ital. Biol. Sper. 64, 211-218 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1 946 NADH-dehydrogenase (NDI) (att start codon) BASE COUNT 294 a 299 c 103 g 250 t ORIGIN 1 attaatatcc taacactcct aatcccaatc ttaattgcca taggccttct caccctagta 61 gaacggaaaa tcctaggcta catacaatta cgcaaaggcc ccaacaacga aggcccatat 121 ggtaaactac aaccatttgc agatgccata aaactattca taaaagaacc catacgccct 181 ctaaccacct caatatcact atttattatc gccccaaccc tctcccttac actagctcta 241 agcctatgaa ttcccttacc aataccccac ccccttatca acctcaacct aggcatacca 301 tttattctag ccacatccag cctttcggtc tactccattc tatgatcggg atgagcctca 361 aattcaaaat actccctatt cggagcccta cgagccgttg cccaaaccat ctcttacgaa 421 gtcacaatag ccttatacct cttatccgtc ctcctaataa gcggctcctt ctccctacaa 481 atacttatca ctacacaaga acatatctga ctattaatcc ccgcctgacc aatagccata 541 atatgataca tttcaaccct cgcagaaaca aatcgagctc ccttcgactt aacagaagga 601 gaatcagaat tagtctcagg ctttaacgtc gaatacgccg caggaccatt cgccctattc 661 ttcatagccg agtacaccaa cattattcta ataaacgccc taacatcaat tgtattccta 721 ggccccttat atcatatcaa ttaccctgaa ttatactcaa ccagcttcat aacagaaaca 781 ctacttctat ccacaacttt cctatgaatt cgagcatcct acccccgttt tcgatatgac 841 caactaatgc acctcctatg aaaaaatttc ctcccactaa cactagcatt ctgcatatga 901 tacatttccc tgccaatttt cctagcagga attccaccct acacat // LOCUS RICPETB 1682 bp ds-DNA PLN 04-AUG-1990 DEFINITION Rice chloroplast apocytochrome b6 (petB) gene, complete cds. ACCESSION M35995 KEYWORDS apocytochrome b6. SOURCE Rice (L. cv. Labelle) chloroplast DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1682) AUTHORS Cote,J.-C., Wu,N.-H. and Wu,R. TITLE Nucleotide sequence of the rice chloroplast apocytochrome b6 gene (petB) JOURNAL Plant Mol. Biol. 11, 873-874 (1988) STANDARD simple staff_review COMMENT Authors indicate two potential petB products for the given sequence. The first product contains an intron, and has its initiation codon at position 67. The second product has no intron, and is read straight through from position 829. FEATURES from to/span description pept 67 72 apocytochrome b6, exon 1 (alt.) 886 1527 apocytochrome b6, exon 2 (alt.) pept 829 1527 apocytochrome b6 (alt.) IVS 73 885 apocytochrome b6 intron A (put.) (no splice consensus) BASE COUNT 499 a 290 c 318 g 575 t ORIGIN 1 gctctacatt tctagacatt ctggtagttc gaccgtggaa ttttttggtt tcggtatctc 61 tggaatatga gtgtgtgact tgttagaatt tgctcctatt gataatacat agaaagcacc 121 tgttatctct atcaagatga ttctaattcg tcggatatta tttattctag tatctggaac 181 acgaaataga tagagtggat caagaaaaaa aaatgaaact atgattcata ttaactattc 241 agacctcgca accagactga aaaaaattca agtagttctt aataaaaata aaaaaagaaa 301 atttcttcct tccaattttg tttgcccaaa aaacaacttt ttttctctcg attttgtcga 361 gtcattacac cgattcaata aatgatcatc aagcggttct tattcgaaga acccttgcct 421 tttgtttagc ttgagactca atcatcgtgg ctctagtatg aatctaaggt tttaattgaa 481 ctgattcata ggatcgcaac aagataattt ctaccagaaa actactccaa tttttgcttt 541 atttatttat ctagtaaaac aagagtaaat ctgcattacg cacaaaaaaa agaaatccaa 601 aatagggaag agaaaaatca agaggcctct aatgatcaac atttgggaaa gaaagataga 661 cgagccaact tgagattttt tggcattatc atcacaaaga agaaattctg gatttttctt 721 atttcatatc ttcaaggcaa atcgacccaa cccagtggct gatgaagttt tgaacctttt 781 ttctaatatc cgttgaaaat ttgtgtgttt ctgtttgagc cgtacgagat gaaattctca 841 tatacggttc tcggaggggg gttcgggtta gttacctatc tcaataaagt atatgattgg 901 tttgaggaac gtcttgagat tcaggcaatt gcagatgata taactagtaa atatgttcct 961 cctcatgtca acatatttta ttgtttaggg ggaattacac ttacttgttt tctagtacaa 1021 gttgctaccg gttttgctat gactttttac tatcgcccaa ccgttacaga ggctttttcc 1081 tcggttcaat acataatgac cgaggccaac tttggttggt taatccgatc agttcatcga 1141 tggtcagcaa gtatgatggt tctaatgatg atcctgcacg tatttcgtgt gtatctcaca 1201 ggtgggttta aaaaaccccg cgaattaact tgggtcactg gtgtggtttt agctgtatta 1261 actgcatcgt ttggtgtaac tggttattct ttaccttggg atcaaattgg ttattgggca 1321 gtcaaaattg tgacaggtgt gcctgacgcg attccggtaa taggatcacc tttagtggag 1381 ttattacgtg gaagtgctag tgtgggtcaa tccactttga ctcgttttta tagtttacat 1441 acctttgtac ttcctctgct tactgccgta tttatgttaa tgcactttct aatgatacgt 1501 aagcaaggta tttcgggccc tttataagga aggcatctca tagagagttc taattctcat 1561 atatcatatc gggtaggttg tggtatttca ttgctacaaa catgggttat tctaaaataa 1621 gacatgtcat ttagatactt ctcttcaact ccgaactatt gtgatacaaa taatacaaat 1681 ag // LOCUS VAZTHYK 1524 bp ds-DNA VRL 04-AUG-1990 DEFINITION Varicella-zoster virus thymidine kinase gene, complete cds. ACCESSION M36160 KEYWORDS thymidine kinase. SOURCE Varicella-zoster virus (strain Oka vaccine) DNA. ORGANISM Varicella-zoster virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 1524) AUTHORS Mori,H., Shiraki,K., Kato,T., Hayakawa,Y., Yamanishi,K. and Takahashi,M. TITLE Molecular analysis of the thymidine kinase gene of thymidine kinase-deficient mutants of Varicella-zoster virus JOURNAL Intervirology 29, 301-310 (1988) STANDARD simple staff_review FEATURES from to/span description pept 420 1445 thymidine kinase BASE COUNT 434 a 354 c 327 g 408 t 1 others ORIGIN 1 ctggcgcata ccctcgcaaa actggtgata cttagtaggg gtatgtatat tagcgctaaa 61 acggcaagat tttaattcca ctataaaaca aacggtcttt ccggcaccac tggattccgt 121 ttgtataata caaacacaat cggggcgtcg gcgtcccaaa tttacttcaa acgacattga 181 tatgcgtaca gccctttgaa catccacgtg ggataacggc gacaggagtt ttgccagcct 241 cgggttgaac gcgtccgcga aacctcgacg tacgttatca atatcctttt tgagtacatc 301 gtaaaaacga gtgtggcaac gttgtcccaa acgaaaacac ttggcccgaa ttcgactagc 361 ggacatattt gaagttccgt cccagaagat aacctaagac gcgtttgtct acaataaaca 421 tgtcaacgga taaaaccgat gtaaaaatgg gcgttttgcg tatttatttg gacggggcgt 481 atggaattgg aaaaacaacc gccgccgaag aatttttaca ccactttgca ataacaccaa 541 accggatctt actcattggg gagcccctgt cgtattggcg taaccttgca ggggaggacg 601 ctatttgcgg aatttacgga acacaaactc gccgtcttaa tggagacgtt tcgcctgaag 661 acgcacaacg cctcacggct cattttcaga gcctgttctg ttctccgcat gcaattatgc 721 atgcgaaaat ctcggcattg atggacacaa gtacatcgga tctcgtacaa gtaaataagg 781 agccgtataa aattatgtta tccgaccgac acccaatcgc ctcaactata tgttttccct 841 tgtccagata cttagtggga gatatgtccc cagcggcgct tcctgggtta ttgtttacgc 901 ttcccgctga accccccggg accaacttgg tagtttgtac cgtttcactc cccagtcatt 961 tatccagagt aagcaaacgg gccagaccgg gagaaacggt taatctgccg tttgttatgg 1021 ttctgagaaa tgtatatata atgcttatta atacaattat atttcttaaa actaacaact 1081 ggcacgcggg ctggaacaca ctgtcatttt gtaatgatgt atttaaacag aaattacaaa 1141 aatccgagtg tataaaacta cgcgaagtac ctgggattga agacacgtta ttcgccgtnc 1201 ttaaacttcc ggagctttgc ggagagtttg gaaatattct gccgttatgg gcatggggaa 1261 tggagaccct ttcaaactgc ttacgaagca tgtctccgtt cgtattatcg ttagaacaga 1321 caccccagca tgcggcacaa gaactaaaaa ctctgctacc ccagatgacc ccggcaaaca 1381 tgtcctccgg tgcatggaat atattgaaag agcttgttaa tgccgttcag gacaacactt 1441 cctaaatata cctagtattt acgtatgtac cagtaaaaag atgatacaca ttgtcatact 1501 cgcgtgtacg tgtttttctt tttt // LOCUS HUMCAMA 2175 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human calmodulin mRNA, complete cds. ACCESSION J04046 KEYWORDS calmodulin. SOURCE Human teratoma cell line, cDNA to mRNA, clone lambda-ht6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2175) AUTHORS Fischer,R., Koller,M., Flura,M., Mathews,S., Strehler-Page,M.-A., Krebs,J., Penniston,J.T., Carafoli,E. and Strehler,E.E. TITLE Multiple divergent mRNAs code for a single human calmodulin JOURNAL J. Biol. Chem. 263, 17055-17062 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2175; revises [1]) AUTHORS Fischer,R., Koller,M., Flura,M., Mathews,S., Strehler-Page,M.-A., Krebs,J., Penniston,J.T., Carafoli,E. and Strehler,E.E. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.Fischer, 28-JUL-1988, and [2] 18-JUL-1990. [2] Author address: M.Koller Biochemistry Dept. Swiss Federal Inst. of Technology Universitaetsstr. 16 Zurich, CH biochemie@czheth5a FEATURES from to/span description pept 104 553 calmodulin mRNA 6 2175 CALM mRNA BASE COUNT 430 a 626 c 592 g 527 t ORIGIN 1 tgagtgtgga ggcgcggacg cgcggcggag ctggaactgc tgcagctgct gccgccgccg 61 gaggaacctt gatccccgtg ctccggacac cccgggcctc gccatggctg accagctgac 121 tgaggagcag attgcagagt tcaaggaggc cttctccctc tttgacaagg atggagatgg 181 cactatcacc accaaggagt tggggacagt gatgagatcc ctgggacaga accccactga 241 agcagagctg caggatatga tcaatgaggt ggatgcagat gggaacggga ccattgactt 301 cccggagttc ctgaccatga tggccagaaa gatgaaggac acagacagtg aggaggagat 361 ccgagaggcg ttccgtgtct ttgacaagga tgggaatggc tacatcagcg ccgcagagct 421 gcgtcacgta atgacgaacc tgggggagaa gctgaccgat gaggaggtgg atgagatgat 481 cagggaggct gacatcgatg gagatggcca ggtcaattat gaagagtttg tacagatgat 541 gactgcaaag tgaaggcccc ccgggcagct ggcgatgccc gttctcttga tctctctctt 601 ctcgcgcgcg cactctctct tcaacactcc cctgcgtacc ccggttctag caaacaccaa 661 ttgattgact gagaatctga taaagcaaca aaagatttgt cccaagctgc atgattgctc 721 tttctccttc ttccctgagt ctctctccat gcccctcatc tcttcctttt gccctcgcct 781 cttccatcca cgtcttccaa ggcctgatgc attcataagt tgaagccctc cccagatccc 841 cttggagcct ctgccctcct ccagcccgga tggctctcct tcattttggt ttgtttcctc 901 ttgtttgtca tcttattttg ggtgctgggg tggctgccag cctgtcccgg gacctgctgg 961 gagggacaag aggccctccc caggcagaag agcatgccct ttgccgttgc atgcaaccag 1021 ccctgtgatt ccacgtgcag atcccagcag cctgttgggg caggggtgcc aagagaggca 1081 ttccagaagg actgaggggg cgttgaggaa ttgtggcgtt gactggatgt ggcccaggac 1141 tgggtcgagg gggccaactc acagaagggg actgacagtg ggcaacactc acatcccact 1201 ggctgctgtt ctgaaaccat ctgattggct ttctgaggtt tggctgggtg gggactgctc 1261 atttggccac tctgcagatt ggacttgccc gcgttcctga agcgctctcg agctgttctg 1321 taaatacctg gtgctaacat cccatgccgc tccctcctca cgatgcaccc accgccctga 1381 gggcccgtcc taggaatgga tgtggggatg gtcgctttgt aatgtgctgg ttctcttttt 1441 ttttctttcc cctctatggc ccttaagact ttcattttgt tcagaaccat gctgggctag 1501 ctaaagggtg gggagaggga agatgggccc caccagctct caagagaaac gcacctgcaa 1561 taaaacagtc ttgtcggcca gctgcccagg gacggcagct acagcagcct ctgcgtcctg 1621 gtccgccagc acctcccgct tctccgtggt gacttggcgc cgcttcctca catctgtgct 1681 ccgtgccctc ttccctgcct cttccctcgc ccacctgcct gcccccatac tccccagcgg 1741 agagcatgat ccgtgccctt gcttctgact ttcgcctctg ggacaagtaa gtcaatgtgg 1801 gcagttcagt cgtctgggtt ttttcccctt ttctgttcat ttcatctggc tccccccacc 1861 acctccccac cccacccccc accccctgct tcccctcact gcccaggtcg atcaagtggc 1921 ttttcctggg acctgcccag ctttgagaat ctcttctcat ccaccctctg gcacccagcc 1981 tctgagggaa ggagggatgg ggcatagtgg gagacccagc caagagctga gggtaagggc 2041 aggtaggcgt gaggctgtgg acattttcgg aatgttttgg ttttgttttt tttaaaccgg 2101 gcaatattgt gttcagttca agctgtgaag aaaaatatat atcaatgttt tccaataaaa 2161 tacagtgact acctg // LOCUS BMEGDH1 2834 bp ds-DNA BCT 04-AUG-1990 DEFINITION B. megaterium glucose dehydrogenase gene and ORFs. ACCESSION D90043 KEYWORDS glucose dehydrogenase. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2834) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan. FEATURES from to/span description pept 1964 2749 glucose dehydrogenase (EC 1.1.1.47) ORF 181 867 ORF1 ORF 1086 1946 ORF2 signal 89 94 put. -35 region for ORF1 signal 116 121 put. -10 region for ORF1 signal 167 172 ORF2 ribosome binding site signal 893 928 termination signal signal 972 977 put. -35 region for ORF2 signal 997 1003 put. -10 region for ORF2 signal 1070 1076 ORF2 ribosome binding site signal 1949 1954 glucose dehydrogenase ribosome binding site signal 2751 2783 termination signal for glucose dehydrogenase BASE COUNT 896 a 442 c 644 g 852 t ORIGIN 1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga 61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag 121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa 181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg 241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt 301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat 361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca 421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat 481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag 541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta 601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat 661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct 721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt 781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt 841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg 901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt 961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt 1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt 1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt 1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt 1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg 1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta 1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca 1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg 1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag 1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa 1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat 1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca 1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg 1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc 1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga 1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag 1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga 1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg 2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa 2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt 2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt 2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt 2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga 2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt 2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa 2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat 2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg 2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc 2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc 2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca 2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga 2821 ctggtgttga attc // LOCUS BMEGDH2 1202 bp ds-DNA BCT 04-AUG-1990 DEFINITION B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene. ACCESSION D90044 KEYWORDS glucose dehydrogenase; isozyme. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1202) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura Department of Fermentation Technology Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan Phone: 06-877-5111 x4373 Fax: 06-876-9036 FEATURES from to/span description pept 125 910 glucose dehydrogenase signal 27 32 put. -35 region signal 49 55 put. -10 region signal 111 116 SD sequence signal 928 959 termination signal BASE COUNT 427 a 190 c 268 g 317 t ORIGIN 1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa 61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt 121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt 181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg 241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc 301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc 361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt 421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg 481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa 541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc 601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa 661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa 721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg 781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt 841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg 901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt 961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg 1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa 1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg 1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg 1201 ag // LOCUS BMOPTTHP1 1023 bp ss-mRNA INV 04-AUG-1990 DEFINITION B.mori PTTH mRNA. ACCESSION D90082 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to mRNA, clones P1, P2, C2, C9 and C19. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 1023) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept 34 708 preproPTTH matp 379 708 PTTH subunit signal 768 773 polyadenylation signal signal 826 831 polyadenylation signal signal 997 1002 polyadenylation signal signal 1004 1009 polyadenylation signal BASE COUNT 358 a 179 c 186 g 300 t ORIGIN 1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc 61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg 121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac 181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg 241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta 301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa 361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct 421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc 481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc 541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct 601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg 661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt 721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata 781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc 841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt 901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca 961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt 1021 aaa // LOCUS BMOPTTHP4 944 bp ss-mRNA INV 04-AUG-1990 DEFINITION B.mori preproPTTH mRNA. ACCESSION D90083 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to mRNA, clone P4 and C21. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 944) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept < 1 631 preproPTTH matp 302 631 PTTH subunit signal 691 696 polyadenylation signal signal 749 754 polyadenylation signal signal 918 923 polyadenylation signal signal 925 930 polyadenylation signal BASE COUNT 337 a 163 c 177 g 267 t ORIGIN 1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt 61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag 121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga 181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa 241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag 301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa 361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg 421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag 481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga 541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag 601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt 661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa 721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag 781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt 841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata 901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa // LOCUS HUMMTSDHB 958 bp ss-mRNA ORG 04-AUG-1990 DEFINITION Human mitochondrial succinate-ubiquinone oxidoreductase (EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA. ACCESSION D90047 KEYWORDS Ip; complex II; iron sulfur subunit; sdh B; succinate-ubiquinone oxidoreductase. SOURCE Human liver mitochondrion, cDNA to mRNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 958) AUTHORS Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M. TITLE Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning of iron sulfur(Ip) subunit of liver mitochondria JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kiyoshi Kita Department of Parasitology Juntendo University 2-1-1,Hongo Bunkyo-ku, Tokyo 113 Japan Phone: 03-813-3111 x3542 Fax: 03-814-9300 FEATURES from to/span description pept < 1 789 succinate-ubiquinone oxidoreductase (sdh B) (AA at 1) site 74 95 iron-sulfur binding site I site 167 179 iron-sulfur binding site II site 224 236 iron-sulfur binding site III BASE COUNT 319 a 212 c 204 g 223 t ORIGIN 1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt 61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg 121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag 181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt 241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac 301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat 361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag 421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag 481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc 541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg 601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc 661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat 721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct 781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt 841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata 901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HUMNCAW 2287 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human nonspecific cross-reacting antigen (NCA-W272) mRNA. ACCESSION D90064 KEYWORDS CEA; CEA gene family; PI-anchored membrane protein. SOURCE Human white blood cells, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2287) AUTHORS Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and Matsuoka,Y. TITLE Characterization of a cDNA clone encoding a new species of the nonspecific cross-reacting antigen (NCA), a member of the CEA gene family JOURNAL Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Fumiko Arakawa First Department of Biochemistry School of Medicine Fukuoka University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone: 092-801-1011 x2892 Fax: 092-801-3600 FEATURES from to/span description ORF 87 1136 nonspecific cross-reacting antigen ORF BASE COUNT 618 a 593 c 453 g 623 t ORIGIN 1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag 61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc 121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca 181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc 241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa 301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag 361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg 421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg 481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca 541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc 601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc 661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac 721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga 781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag 841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt 901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa 961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca 1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag 1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc 1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa 1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact 1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc 1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa 1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt 1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc 1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga 1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga 1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc 1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt 1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt 1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa 1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca 1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat 1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga 2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt 2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac 2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc 2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg 2281 gctgatc // LOCUS PIGDESTN 1666 bp ss-mRNA MAM 04-AUG-1990 DEFINITION Porcine destrin mRNA. ACCESSION D90053 J05290 KEYWORDS actin-binding protein; cofilin; destrin. SOURCE Pig adult brain, cDNA to mRNA, clone PD2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1666) AUTHORS Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S., Iida,K. and Yahara,I. TITLE Destrin, a mammalian actin-depolymerizing protein, is closely related to cofilin: Cloning and expression of porcine brain destrin cDNA JOURNAL J. Biol. Chem. 265, 5768-5773 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Moriyama Department of Biophysics and Biochemistry Faculty of Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan Phone: 03-821-2111 x4408 FEATURES from to/span description pept 54 551 destrin mRNA < 1 1666 destrin mRNA signal 1645 1660 polyadenylation signal BASE COUNT 492 a 322 c 372 g 480 t ORIGIN 1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt 61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt 121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag 181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa 241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg 301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc 361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg 421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc 481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat 541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc 601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg 661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct 721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga 781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt 841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt 901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt 961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac 1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca 1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca 1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt 1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta 1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag 1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc 1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg 1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga 1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt 1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta 1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taactt // LOCUS RATPMP70X 3324 bp ss-mRNA ROD 04-AUG-1990 DEFINITION Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA. ACCESSION D90038 J05256 KEYWORDS PMP70; peroxisomal membrane protein. SOURCE Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181, 189,201]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3324) AUTHORS Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T. TITLE The 70-kDa Peroxisomal Membrane Protein Is a Member of the Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Keiju Kamijo Department of Biochemistry Shinshu University School of Medicine Matsumoto 390 Japan Phone: 263-35-4600 x5182 Fax: 263-33-6458 FEATURES from to/span description pept 36 2015 peroxisomal membrane protein (PMP70) signal 3271 3276 polyadenylation signal (put.) BASE COUNT 921 a 658 c 773 g 972 t ORIGIN 1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga 61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc 121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg 181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac 241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta 301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac 361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact 421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg 481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag 541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc 601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta 661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg 721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa 781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta 841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa 901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct 961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg 1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca 1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc 1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg 1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta 1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg 1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa 1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg 1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat 1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc 1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa 1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg 1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg 1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac 1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca 1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc 1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa 1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt 2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta 2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca 2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat 2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg 2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga 2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct 2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt 2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat 2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt 2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt 2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg 2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac 2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg 2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg 2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca 2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc 3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg 3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa 3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt 3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat 3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag 3301 attaaaaaaa aaaaaaagga attc // LOCUS VACSANT 1525 bp ds-DNA VRL 04-AUG-1990 DEFINITION Vaccinia virus surface (S) antigen gene. ACCESSION D90076 KEYWORDS S gene; surface antigen. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 1525) AUTHORS Ueda,Y., Morikawa,S. and Matsuura,Y. TITLE Identification and nucleotide sequence of the gene encoding a surface antigen induced by vaccinia virus JOURNAL Virology (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Yoshiaki Ueda National Institute of Health Gakuen, Musashimurayama Tokyo 190-12 Japan Phone: 0425-61-0771 Fax: 0425-65-3315 FEATURES from to/span description pept 382 1437 surface antigen S BASE COUNT 568 a 229 c 258 g 470 t ORIGIN 1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata 61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat 121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa 181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat 241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac 301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata 361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca 421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc 481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt 541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt 601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa 661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta 721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca 781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc 841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga 901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat 961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta 1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat 1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa 1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact 1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga 1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc 1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt 1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata 1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt 1501 gctatagaat gaatgaggat acgcg // LOCUS YSCA1 881 bp ds-DNA PLN 04-AUG-1990 DEFINITION S. cerevisiae acidic ribosomal protein A1 (YSCA1). ACCESSION D90072 X13682 KEYWORDS acidic ribosomal protein; ribosomal protein. SOURCE S. cerevisiae (strain IFO-40028) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 315; 631 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989) STANDARD full staff_entry REFERENCE 3 (bases 277 to 742) AUTHORS Tsurugi,K. and Mitsui,K. TITLE cDNA and deduced amino acid sequence of acidic ribosomal protein A1 from Saccharomyces cerevisiae JOURNAL Nucleic Acids Res. 16, 3574-3574 (1988) STANDARD simple automatic COMMENT These data kindly submitted in computer readable form by: Kazuhiro Mitsui Department of Biochemistry Yamanashi Medical college Tamaho, Nakakoma-gun Yamanashi 409-38 Japan Phone: 0552-73-1111 x2257 FEATURES from to/span description pept 313 633 acidic ribosomal protein A1 signal 125 135 UASrpg box1 signal 182 192 UASrpg box2 signal 716 721 poly(A) signal variant 303 303 a in [1]; g in [3] variant 684 685 tt in [1]; t in [3] variant 719 719 a in [1]; t in [3] BASE COUNT 270 a 165 c 157 g 289 t ORIGIN 1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt 61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca 121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa 181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca 241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa 301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac 361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca 421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac 481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct 541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc 601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg 661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa 721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg 781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc 841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c // LOCUS YSCCDC23X 3107 bp ds-DNA PLN 04-AUG-1990 DEFINITION S. cerevisiae CDC23 protein gene. ACCESSION D90081 KEYWORDS CDC23 protein. SOURCE Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line D22) DNA, clone YX34. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3107) AUTHORS Doi,A. and Doi,K. TITLE Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces cerevisiae JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Doi The Institute of Scientific and Industrial Research Osaka University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111 Fax: 06-877-4977 FEATURES from to/span description ORF 765 2645 ORF for CDC23 site 153 161 calcium-binding site BASE COUNT 1010 a 595 c 629 g 873 t ORIGIN 1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact 61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt 121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc 181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt 241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg 301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct 361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt 421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga 481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag 541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat 601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact 661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc 721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc 781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac 841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag 901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag 961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt 1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt 1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc 1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta 1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc 1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga 1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag 1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa 1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact 1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat 1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta 1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg 1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc 1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg 1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt 1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg 1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact 1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt 2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca 2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca 2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt 2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg 2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca 2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat 2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg 2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa 2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta 2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata 2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat 2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc 2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact 2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc 2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc 2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga 3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga 3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag // LOCUS BIM127 8482 bp ss-RNA VRL 04-AUG-1990 DEFINITION Bovine immunodeficiency-like virus, complete proviral genome. ACCESSION M32690 KEYWORDS complete genome. SOURCE Bovine immunodeficiency-like virus (isolate HXB3), proviral DNA, clone 127. ORGANISM Bovine immunodeficiency-like virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 8482) AUTHORS Garvey,K.J., Oberste,M.S., Elser,J.E., Braun,M.J. and Gonda,M.A. TITLE Nucleotide sequence and genome organization of biologically active proviruses of the bovine immunodeficiency-like virus JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Kindly submitted in computer-readable form prior to publication by Dr. M.A. Gonda, Frederick Cancer Research Facility, Frederick, MD. Clones 127 and 106 (see , Accession number M32691) were obtained from a single virus isolation; the latter, 106, has 8391 nucleotides as the result of a sizeable deletion in env and two minor deletions in the LTR R regions. The Q and S orfs are named by analogy with similar orfs in visna; other short reading frames can be identified. FEATURES from to/span description pept 316 1746 gag polyprotein pept < 1572 4742 pol polyprotein pept 4601 5197 protein Q (put.) pept 5228 5539 protein S (put.) pept 5415 8129 env polyprotein BASE COUNT 2699 a 1798 c 2017 g 1968 t ORIGIN 1 ttcagctcgt gtagctcatt agctccgagc tccccaacct acagcctgag aggcactggc 61 tcggttgggt agccagcctt tcgggtaata aaggcttgtt ggcattcggc atctacccgt 121 gcctcctgtc ttgtcttact cgagcgaacc cacaactccg tcctgctgag ctcacagctc 181 gcggggcggt gaagaacacc caacagttgg cgcccaacgt ggggctcgag taagagagac 241 tcggctcgag taaaagaaga cccagctcga acgagaagac tccggacagg tgagtagttg 301 cgtgttttcc ccgggatgaa gagaagggag ttagaaaaga agcttcgtaa ggttagggtg 361 acaccccaac aggataaata ttatactata gggaatcttc aatgggccat tagaatgata 421 aatctaatgg ggatcaaatg tgtgtgtgac gaggagtgct cggcagcaga ggtagccctt 481 atcataaccc aattttcagc tttagactta gaaaattctc ctatcagagg taaggaggag 541 gtggccataa aaaatactct gaaggttttc tggtccctgc tggcggggta caaaccagag 601 agtacagaaa cggccctagg atattgggag gcctttacat atagagaaag ggaggccaga 661 gctgataagg aaggcgaaat taagagtatt tacccttccc taacacagaa cacacagaat 721 aagaagcaga catcgaatca gacaaacact caatcattac cagctatcac tactcaagat 781 ggtactccta ggtttgatcc tgacctcatg aagcagctta agatctggtc agacgccact 841 gaaagaaatg gggttgacct tcatgcagtg aatatattag gggtcattac agcaaaccta 901 gtacaggaag aaattaaact cctcttgaat agtacaccca agtggagatt agatgtacaa 961 cttatagaat caaaagtaag agagaaagaa aatgcccaca gaacgtggaa acagcatcat 1021 ccagaagccc caaaaacaga tgaaatcatc ggtaaggggc ttagttctgc tgaacaagcc 1081 accctgatct cagtagaatg cagagaaact ttcagacagt gggtgctgca ggcagctatg 1141 gaggtggcac aggcaaaaca tgctacccca ggtcccatca acattcatca gggacccaag 1201 gagccgtaca cagactttat aaatagatta gtggcagccc ttgaaggtat ggcggctcca 1261 gaaaccacaa aagaatactt actccaacat ctatctattg atcatgccaa tgaagactgc 1321 cagtctattc taagaccttt gggacccaac accccaatgg agaaaaaatt agaagcatgt 1381 agggtagtgg gatctcagaa atcaaagatg caatttttgg tagcagctat gaaagaaatg 1441 gggatccaat caccaattcc agcagtcttg cctcacacac cagaagcata tgcctcccaa 1501 acctcagggc ccgaggatgg taggagatgt tacggatgtg ggaagacagg acatttgaag 1561 aggaattgta aacagcaaaa atgctaccat tgtggcaaac ctggccacca agcaagaaac 1621 tgcaggtcaa aaaacgggaa gtgctcctct gccccttatg ggcagaggag ccaaccacag 1681 aacaattttc accagagcaa catgagttct gtgaccccat ctgcaccccc tcttatatta 1741 gattagacaa acagcctttt ataaaggtgt tcataggggg aagatgggta aaagggttag 1801 tagacactgg agcagatgag gtagtgctta agaacataca ttgggatagg ataaaagggt 1861 atccagggac accaattaaa caaattgggg taaatggagt aaatgtggcc aaaaggaaga 1921 cccacgtaga gtggagattt aaggataaga ctgggataat tgatgtcttg ttctcagata 1981 ctcctgtaaa cctttttggg agatctcttc tacgtagcat agtgacttgc ttcaccctac 2041 ttgttcacac agaaaaaatc gaacccctac ccgtcaaggt aaggggacca gggcctaagg 2101 taccccagtg gcccttgaca aaagaaaagt atcaggctct taaggaaatt gtgaaagatc 2161 ttttagcaga aggaaaaatt tccgaagctg cttgggataa cccatataat accccagttt 2221 ttgttataaa gaaaaaggga acgggaagat ggaggatgct aatggatttt agggaattaa 2281 ataagataac agttaaagga caagaattct ctacaggctt accttaccct ccaggaatta 2341 aggaatgtga acacttaact gcaatagata taaaagatgc ctactttact atccctttac 2401 atgaggactt tagacccttt acagccttct ctgtagtccc tgtaaatcga gaaggaccta 2461 tagagaggtt ccagtggaat gttctaccac aaggatgggt atgtagccct gccatttatc 2521 agactaccac ccagaagatt atagaaaaca ttaaaaagag tcacccagat gtcatgttgt 2581 atcaatatat ggatgatttg ttgattgggt ctaataggga tgatcataag caaatagtgc 2641 aggaaatcag ggataagtta ggatcatatg gtttcaagac tccagatgaa aaggtccagg 2701 aagagagagt gaaatggatc ggttttgagc tcacacccaa gaaatggcgt tttcagccca 2761 ggcaactaaa gataaaaaac ccactcacag taaatgaatt acagcaatta gtaggtaatt 2821 gtgtttgggt acagccagaa gtaaaaatcc ctctataccc cttaaccgat ctactgaggg 2881 ataagaccaa tctccaagaa aagatacaac taacaccaga agccatcaag tgtgtagaag 2941 aattcaatct aaaactaaaa gatccagaat ggaaagatag aataagagaa ggagcagaat 3001 tagtcataaa aatacagatg gttcctcggg gcatagtatt tgatctgttg caagatggaa 3061 atcccatatg gggaggagta aaaggactaa attatgatca ttcaaacaaa ataaaaaaga 3121 tacttagaac tatgaatgag ctgaacagaa cagtggtaat tatgacagga agagaagcta 3181 gtttcctgct tcctgggtct tctgaagatt gggaagcggc actccagaag gaagaaagtc 3241 taacacaaat attcccagta aagttttata ggcactcctg cagatggacc tccatatgtg 3301 ggccagtaag agaaaatcta accacctact atactgacgg agggaagaaa gggaaaacag 3361 ctgcagcagt atattggtgt gaaggaagga ctaagtcaaa ggtatttcca ggaaccaatc 3421 aacaggcgga attgaaggcc atatgcatgg ctctcttgga tggaccacca aaaatgaata 3481 tcataacaga tagtagatac gcctatgagg gaatgagaga agaaccagaa acgtgggcca 3541 gggaaggaat ctggctggag attgccaaga tattgccctt taagcagtac gtgggggtcg 3601 ggtgggtgcc tgcacataaa gggataggag gaaatacaga ggcagatgaa ggagttaaga 3661 aagccttaga acagatggcc ccgtgtagcc ctcctgaggc cattctatta aaaccaggag 3721 aaaaacaaaa tctggagaca gggatctaca tgcaggggct tagaccacaa agcttcctcc 3781 caagagcaga cttaccagta gccatcacag gaaccatggt agattcagag ctacagctac 3841 agctacttaa cataggaact gagcatataa gaatccaaaa agatgaggtc ttcatgacct 3901 gtttcctaga aaatatcccc tcagccactg aagatcatga gagatggcat acctcaccag 3961 acattttggt taggcagttc catctcccta agagaatagc taaagagata gtagccagat 4021 gccaagaatg taaaaggaca accactagcc cagtcagagg aacaaacccc agaggtcgat 4081 tcttatggca gatggacaat actcactgga ataaaacaat tatttgggta gcagtagaga 4141 caaattcagg attagtggaa gctcaggtga tccctgaaga aacagcacta caagtagctc 4201 tctgcatttt acagctaatc cagagatata cagttcttca cttacatagt gacaacgggc 4261 cgtgctttac tgcacacagg atagaaaatc tatgtaagta tctggggatc acaaaaacta 4321 cgggaatacc ctacaaccca caatcccagg gagttgtaga aagagcccac agagatctaa 4381 aagacagatt ggcagcttat cagggagatt gtgaaaccgt agaagcagcc cttagcctcg 4441 cattagtttc tttaaataaa aaaagagggg gaataggggg ccatacacca tatgaaatat 4501 acctagaatc agaacatacc aaataccaag accaactaga acaacaattt tcaaaacaaa 4561 aaattgaaaa gtggtgttac gtaaggaaca gaagaaagga atggaaagga ccctacaaag 4621 tgttgtggga cggagacggg gcagcagtaa tagaggaaga gggaaaaaca gccttatatc 4681 cacaccgtca tatgcgcttc atcccccccc cagattcaga tatccaagat gggagttcgt 4741 gaggcagaca gaatacagca tgaccgcgtg cgtaagaaaa gggaaattag tccttactta 4801 ccagtacgcg atctggaaaa gagtctggac gatagaaaca ggatttacag atccaagtct 4861 gtttatgacc ccagctggaa cacacaccac tgaagaaata ggtcacttag atctcttttg 4921 gcttaggtac tgttcatgtc cgcatgagat gcccccgtgg ctagacttcc ttagaggcac 4981 cctcaatcta cgcatttcct gtcgacgcgc tcttcaagcg tcagtgttga ctagcacccc 5041 tagacactcc ctccaacgct tagctgcact tcagctgtgc actaacgcat gtctctgttg 5101 gtacccgtta ggacgcatca acgacaccac cccgttgtgg ttgaactttt cgtctgggaa 5161 ggaaccaacg atccaacaac tgagtggcca cccctaactc gtcgtaacat tcatagattg 5221 tggcaatatg cccggacctt gggtggcgat gataatgttg ccacagccca aagaaagctt 5281 tggaggaaag ccaattggct ggcttttctg gaacacgtgc aaaggaccta ggcgggactg 5341 tccacattgt tgttgtccca tatgtagttg gcattgtcag ctttgctttt tgcagaaaaa 5401 tctaggaatc aactatggat caggacctag acggcgcgga acgcggggaa aggggaggag 5461 gatccgaaga actgcttcag gaggagatca acgaagggag gctgacagcc agagaagctt 5521 tacaaacatg gatcaataac ggtgagatcc acccttgggt cctggcagga atgctgtcca 5581 tgggagtagg aatgctacta ggagtatatt gtcagttacc agacacactg atttggatac 5641 taatgtttca attatgcctt tattggggtt tgggtgaaac atctagagaa ttagacaagg 5701 atagttggca gtgggtcaga agtgtattta taatagcaat attgggaact ctcactatgg 5761 caggaactgc tttggccgac gacgatcaaa gtactttaat ccccaatatc acaaaaattc 5821 ctacaaagga cacggaaccc ggttgcacct atccgtggat attaatcctc ttgattttgg 5881 ctttcatact gggaattctg ggtataatac ttgtcttgag acgcagcaac tcggaggata 5941 tattggcagc cagagatacc atagattggt ggctctcagc taatcaggaa atacctccaa 6001 agtttgcttt cccaataata ttaatatctt cccctctagc aggcataata ggatattatg 6061 tcatggaaag gcacttagag atcttcaaaa agggatgtca aatttgtggg agcctgagca 6121 gcatgtgggg aatgcttttg gaagaaattg gcaggtggct cgcacgtagg gaatggaatg 6181 ttagtagagt aatggttatc ctcttaatca gcttcagttg gggaatgtat gtcaataggg 6241 taaatgcctc agggtcacat gtagccatgg tcaccagccc tccagggtac cgcatagtga 6301 atgataccag ccaggcacct tggtattgct tctcctcggc accaatccca acgtgtagtt 6361 cctctcagtg gggagacaaa tattttgagg agaaaataaa cgagacactg gtcaaacagg 6421 tgtatgaaca ggccgcgaaa cattcgagag ccacatggat tgaacctgat ctattggagg 6481 aagcagtcta tgagctagct ctgttatcag ctaatgacag tcgtcaggtg gtggtagaaa 6541 atggtacaga cgtatgtagc tcacagaact cgagcacaaa caaaggccac ccaatgacgc 6601 ttctaaagtt gagagggcag gtgtcagaaa cttggatagg gaattcctcc ctccagtttt 6661 gtgtccagtg gccatatgtc ttggtaggtc ttaataatag tgatagtaat attagcttca 6721 attcgggaga ttggatagca accaattgta tgcacccaat tacactaaat aaaagtgcac 6781 aagatctagg aaaaaatttt ccgagactaa catttcttga cggacaactg tcccagttga 6841 agaacacact gtgcggacat aacacaaact gtttgaaatt tggaaacaag tccttcagta 6901 caaattccct aatactatgc caagacaacc ccatcggcaa cgacaccttt tatagcctaa 6961 gtcattcctt ctcaaaacag gcctctgccc ggtggattct tgtaaaggtc cccagctatg 7021 ggtttgtggt agtaaatgac acagatacac caccatccct ccgcatccga aagcctcgag 7081 cagtcggact agcaatattc ctgcttgtgc tggctatcat ggccatcaca tcctccttgg 7141 tggcagctac aacgctcgtg aaccagcaca cgacggctaa ggttgtggag agggttgtgc 7201 aaaatgtgtc atatattgct caaacccagg accaattcac ccacctgttc aggaatataa 7261 acaacagatt aaatgtccta caccatagag tttcatactt ggagtatgta gaggaaatca 7321 gacaaaaaca agtattcttt ggttgcaaac ctcatggaag gtattgccac tttgactttg 7381 gaccagagga agttggatgg aacaatagtt ggaatagcaa aacttggaat gatctacaag 7441 atgagtatga taagatagaa gaaaaaatat taaaaattcg agtggactgg ctcaatagct 7501 ccctgagtga cacacaggac acctttggcc tggagacctc tatttttgac catttagtgc 7561 aattgtttga ttggacttct tggaaagact ggataaaaat cattatagta atcattgtac 7621 tttggcttct gataaagatt ctcctaggta tgttaagaag ctgcgccaag gtcagccaga 7681 attaccaaca tctcccggcg gaggaggagg acggggacac agagccagaa agctccccgg 7741 cgagaggaga cccggcttct ggaagtctct acgagaattg gttgaacaaa ataggagaaa 7801 gcaagaacga cgcctatcgg gtctggacag aagaatacaa cagcttgagg atcttgttcg 7861 ccacatgtcg ctgggatctc ctgacccctc aactccttca gcttccgttc tttctgttaa 7921 ccctcctgct caaactcctt tgggacatct tccgccacgc tcctatttta aacttaaaag 7981 ggtggactgt ggggcagggt gggacctcag gacaacagca gcccccggac ttcccatatg 8041 tgaattggac tggatccagg gaacaaaata acccagaagg gggattagac tctggggctt 8101 ggtatgaagg cctgagaggt tctcagtaga ttgtaagtct tcggcgagac tgcatgtctg 8161 cacgtagaca ggaaatgttt atcttctcag ctgattgtgg ttaggccgat tactggaaac 8221 tagacaacct gattcattag tggttaagat tatgcataag tgctcgcaat gatgtagctg 8281 cttacgcttg cttactccgc cctgaaacgc ctaccttaac acgcaacacg cccacctgta 8341 agaatatata aaccatatct tcactctgta cttcagctcg tgtagctcat tagctccgag 8401 ctccccaacc tacagcctga gaggcactgg ctcggttggg tagccagcct ttcgggtaat 8461 aaaggcttgt tggcattcgg ca // LOCUS CEAVCG 9180 bp ss-RNA VRL 04-AUG-1990 DEFINITION Caprine arthritis encephalitis virus, complete proviral genome. ACCESSION M33677 KEYWORDS complete genome. SOURCE Caprine arthritis encephalitis virus (isolate Clements) proviral DNA, infectious molecular clone. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9180) AUTHORS Saltarelli,M., Querat,G., Konings,D.A.M., Vigne,R. and Clements,J.E. TITLE Nucleotide sequence and transcriptional analysis of an infectious clone of CAEV JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Kindly provided in computer-readable form prior to publication by Dr. Janice Clements, Johns Hopkins University, Baltimore, MD., 10 April 1990. The annotation shown below is tentative. FEATURES from to/span description pept 512 1858 gag polyprotein pept < 1717 5046 pol polyprotein (NH2-terminus uncertain) pept 5006 5695 protein Q (put.) pept 5688 5951 protein S (put.) pept 6012 8837 env polyprotein ORF 8405 8794 ORF 4 BASE COUNT 3505 a 1466 c 2301 g 1908 t ORIGIN 1 gagttctagg agagtccctc ctagtctctc ctctccgagg aggtaccgag acctcaaaat 61 aaaggagtga ttgccttact gccgagtgga gagtgattac tgagcggccg gtgtatcggg 121 agtcgtccct taatctgtgc aataccagag cggctctcgc agctggcgcc caacgtgggg 181 cccgaggaga agaaaagaaa gcggccctga gaactcggct tctgaaaaag aggaagagga 241 caagttgcta tagcaacaag agagaagaag tagagcaaag gtccagtggc tcggaaaaag 301 aggaactgaa acttcgggga cgcctgaagg agtaaggtaa gtgactctgc tgtacgcggg 361 gcgaggcaga ggtttccttc taaattgaaa gagaagtgtt gctgcgagag gtcttggtgg 421 tcgagaatcc tgtacaaaaa aaaggaggga tctcggtcag gaccaggacc cctgggagta 481 atacaacagc aacaccgtaa gaaaatccgc catggtgagt ctagatagag acatggcgag 541 gcaagtctcc ggggggaaaa gagattatcc tgagctcgaa aaatgtatca agcatgcatg 601 caagataaaa gttcgactca gaggggagca cttgacagaa ggaaattgtt tatggtgcct 661 taaaacatta gattacatgt ttgaggacca taaagaggaa ccttggacaa aagtaaaatt 721 taggacaata tggcagaagg tgaagaatct aactcctgag gagagtaaca aaaaagactt 781 tatgtctttg caggccacat tagcgggtct aatgtgttgc caaatgggga tgagacctga 841 gacattgcaa gatgcaatgg ctacagtaat catgaaagat gggttactgg aacaagagga 901 aaagaaggaa gacaaaagag aaaaggaaga gagtgtcttc ccaatagtag tgcaagcagc 961 aggagggaga agctggaaag cagtagattc tgtaatgttc cagcaactgc aaacagtagc 1021 aatgcagcat ggcctcgtgt ctgaggactt tgaaaggcag ttggcatatt atgctactac 1081 ctggacaagt aaagacatac tagaagtatt ggccatgatg cctggaaata gagctcaaaa 1141 ggagttaatt caagggaaat taaatgaaga agcagaaagg tggagaagga ataatccacc 1201 acctccagca ggaggaggat taacagtgga tcaaattatg ggggtaggac aaacaaatca 1261 agcagcagca caagctaaca tggatcaggc aaggcaaata tgcctgcaat gggtaataaa 1321 tgcattaaga gcagtaagac atatggcgca caggccaggg aatccaatgc tagtaaagca 1381 aaaaacgaat gagccatatg aagattttgc agcaagactg ctagaagcaa tagatgcaga 1441 gccagttaca cagcctataa aagattatct aaagctaaca ctatcttata caaatgcatc 1501 agcagattgt cagaagcaaa tggatagaac actaggacaa agagtacaac aagctagtgt 1561 agaagaaaaa atgcaagcat gtagagatgt gggatcagaa gggttcaaaa tgcaattgtt 1621 agcacaagca ttaaggccag gaaaaggaaa agggaatgga cagccacaaa ggtgttacaa 1681 ctgtggaaaa ccgggacatc aagcaaggca atgtagacaa ggaatcatat gtcacaactg 1741 tggaaagaga ggacatatgc aaaaagaatg cagaggaaag agagacataa ggggaaaaca 1801 gcagggaaac gggaggaggg ggatacgtgt ggtgccgtcc gctcctccta tggaataact 1861 tcagcaccac ctatggttca ggtccgcata ggttcccagc agaggaactt gttatttgat 1921 accggggcgg accgaactat agttagatgg catgagggct cgggaaaccc agccggaagg 1981 ataaaactgc aaggaatagg aggaatagta gaaggagaaa aatggaataa tgtagaatta 2041 gaatataaag gagaaacaag aaagggaaca atagtagtgt taccacaaag tccagtagaa 2101 gtattaggac gagataacat ggcccgattt ggaataaaga taataatggc aaatttagag 2161 gaaaaaagaa tcccaattac aaaagtaaaa ttgaaagagg gatgtacggg tccacatgtc 2221 ccacaatggc cattaacaga agagaaatta aaaggtctaa cagaaatcat agataaatta 2281 gtggaagaag gaaaactagg aaaggcaccc ccacattgga catgtaatac tccaatcttt 2341 tgcataaaaa agaaatcagg gaagtggaga atgttaatag atttcagaga attgaacaaa 2401 cagacagaag atttaacaga agcgcagtta ggactcccgc atccgggagg actacaaaag 2461 aaaaaacatg ttacaatatt ggacatagga gatgcatatt ttactatacc cctatatgaa 2521 ccatatcgag agtacacatg ttttactcta ttaagtccta ataatctagg accatgtaaa 2581 agatactatt ggaaagtgct gccacaaggt tggaaattga gtccatctgt atatcaattt 2641 actatgcagg agatcttaga ggattggata cagcagcatc cagaaattca atttggcata 2701 tatatggatg atatttacat aggaagtgat ttagaaatta aaaagcatag agaaatagtg 2761 aaagatttag ccaattatat tgcccaatat ggattcactc tgccagaaga gaagagacaa 2821 aagggatatc cagcaaaatg gctaggattt gaactacacc cgcagacctg gaaatttcag 2881 aagcatacat tacctgaatt aacaaaggga acaataacat taaataaatt acagaaatta 2941 gtaggagaat tagtatggag acaatccata attgggaaaa gcattcctaa cattctgaaa 3001 ttaatggaag gagatagaga attacaaagt gaaagaaaaa ttgaagaagt acatgtgaaa 3061 gaatgggaag catgtaggaa aaaattagaa gaaatggaag gaaattatta taataaagac 3121 aaagatgtct atggacaatt ggcttgggga gacaaagcta tagaatatat agtgtatcag 3181 gagaaaggga aaccattatg ggtaaatgtg gttcacaata taaagaacct aagcatcccg 3241 caacaggtta ttaaagcagc gcaaaaatta acccaagaag tcatcattag gacaggaaaa 3301 ataccatgga tattgttgcc agggaaagaa gaagattgga gactagaatt gcaattaggg 3361 aacatcacat ggatgccaaa attttggtcc tgttatcgag gacatacaag atggagaaaa 3421 agaaatataa tagaagaagt agtagaaggg cctacatatt atacagatgg aggaaaaaag 3481 aataaagtag gaagtctagg gttcatagta tcaacagggg aaaaatttag aaagcatgaa 3541 gagggcacaa accagcaact agaattaaga gccatagagg aagctctaaa acaagggcct 3601 caaacaatga atttagtaac agatagtaga tatgcatttg aatttttatt aagaaattgg 3661 gatgaagaag taataaagaa tccaattcaa gcaagaatta tggaaattgc ccacaagaaa 3721 gataggatag gagtgcattg ggtgccagga cataaaggga ttccccaaaa tgaagaaata 3781 gacaaatata tttcggaaat atttcttgca aaagaaggag aaggaattct cccaaaaaga 3841 gaagaggatg cagggtatga tttaatatgc ccagaagagg ttaccataga gccaggacaa 3901 gtgaaatgca tccccataga gctaagatta aatttaaaga aatcacaatg ggctatgatt 3961 gctacaaaaa gcagcatggc tgccaaagga gtgttcacac aaggaggaat catagactca 4021 ggatatcagg gacaaataca ggtaataatg tataatagca ataaaatagc agtagtcata 4081 ccccaaggga gaaaatttgc acaattaata ttaatggata aaaagcatgg aaaattggaa 4141 ccctgggggg aaagcagaaa aacagaaagg ggagaaaaag gatttgggtc tacaggaatg 4201 tattggatag aaaatattcc tctggcagag gaagaccaca caaaatggca tcaagatgcc 4261 cgatcattgc atctagaatt tgaaattcca agaacagcag cagaagacat agtaaatcaa 4321 tgtgaaatat gcaaagaagc gaggacacct gcagtaatta gaggcggaaa caaaaggggg 4381 gtaaatcatt ggcaagtgga ttatacccat tatgaaaata tcatactatt agtatgggta 4441 gaaacaaatt caggactaat atatgcagaa aaagtaaaag gagaatcagg gcaagaattc 4501 agaataaaag tgatgcattg gtatgcatta tttggtccag agtcattgca gtcagacaat 4561 ggacctgcat ttgcagcaga gcccacacag ctgttaatgc aatacctagg agtaaaacac 4621 acaacaggca taccttggaa tccacagtct caggctatag tagaaagggc acatcaacta 4681 ttgaaaagca ctttaaagaa gttccagcca caatttgtcg ctgtagaatc agccatagca 4741 gcagccctag tcgccataaa tataaaaaga aagggtgggc tggggacaag ccctatggat 4801 atttttatat ataataaaga acagaaaaga ataaataata aatataataa aaattctcaa 4861 aaaattcaat tctgttatta cagaataagg aaaagaggac atcaggagag tggaaaggac 4921 caacccaggt actgtggaaa ggggaaggag ccaattgtgg taaaggatat agaaagtgaa 4981 aagtatttag taatacctta caaagatgca aaattcatcc cgccaccaac aaaagaaaag 5041 gaataaaaaa cctggaccag aattaccctt agcactatgg atacatatag cagaaagcat 5101 taatggggat agctcatggt acataacaat gagactgcaa cagatgatgt ggggaaaaag 5161 aggaaataag ttacaatata agaatgaaga cagggaatat gaaaattggg aaattacatc 5221 atggggatgg aaaatgcacc taaggagagt gaaacaatgg atacaagaca acaggagagg 5281 aagcccatgg cagtacaaag taggaggaac atggaaaagt ataggagtgt ggttcctgca 5341 agcaggagat tacagaaagg tagacaggca cttctggtgg gcatggagga tactgatatg 5401 ttcctgcagg aaagaaaagt ttgatataag agaatttatg agaggaagac atagatggga 5461 tttgtgcaaa tcctgtgctc aaggagaagt agtaaagcat actagaacaa aaagtctgga 5521 aagactagta ctgctacaga tggtagaaca gcatgtgttt caagtattgc cattgtggag 5581 agccaggaga agtagtacaa cagatttccc atggtgcagg gacacaacgg gatacacgca 5641 tgcgtggtct gtccaggagt gctggttgat ggaatatctc ttagaggatg agtgaagaac 5701 tgcctcaaag aagggagaca catccagaag aacttgtaag gaacgtacgg gaaagagaaa 5761 gggatacatg gcaatggaca agcatcagag tacctgcgga aatactgcaa agatggcttg 5821 ctatgcttag gtcaggcaga aatagaaaga aagtgtatag agaaatgcaa aaatggatgt 5881 ggatacatcc caaggcgcct gtgattaggg cctgtggatg cagactatgt aacccggggt 5941 ggggaacata atcaagggaa taataaatgc aaataaatgt aactaacaag tagcaaaagt 6001 gtctgtgtta gatggatgct ggggccagat acatgcgctt aactgggaag gaaaactggg 6061 ttgaagtaac catggacgga gagaaggaaa ggaaaagaga aggtttcact gcgggacagc 6121 aaggtaagta tcaaccccag gtaagtaagc aaatagggaa cagaaatact aacccatgct 6181 ttgcctataa agggatattc ctatggagga tatcactaac aatgtggata ttgctaggga 6241 taaatatgtg tgtcagtgca gaggattaca taacactaat atcagatccc tatgggttct 6301 cacccataaa aaatgtgtct ggggtaccag tgacttgtgt aacaaaagaa ttcgcaaaat 6361 ggggatgtca accactagga gcgtaccctg atccagaaat agaatacaga aatgtgagtc 6421 aggaagtagt gaaagaagta tatcaagaga attggccatg gaatacatat cattggcctc 6481 tctggcaaat ggagaatgtt aggtactggt taaaagaaaa tatgcaagaa aatcaacaga 6541 gaaaaaataa tacaaaagag ggtatagagg aattattagc aggaactata aggggaagat 6601 tctgtgtacc atacccattt gccttgttaa aatgcacaaa gtggtgctgg tatacagcgg 6661 ccataaacaa cgagtcagga aaagcaggaa aaataaaaat aaattgcaca gaagcaagag 6721 cagtctcctg tacagaggac atgccattag cctcaataca aagagcatat tgggatgaga 6781 aagacagaga gagcatggcc tttatgaata tcaaagcatg tgatagcaac ctaaggtgtc 6841 agaaaagacc tggagggtgt atggaaggat accctatccc agtaggagca gaaataatcc 6901 ctgaaagtat gaaataccta aggggagcaa agagtcagta tgggggaata aaagataaga 6961 atggagaatt aaaattacca ttaacattaa gagtgtgggt aaaattagca aatgtgtcag 7021 aatgggtaaa tgggacaccc ccggattggc aagacagaat taacggatcc aaaggaataa 7081 atgggacgct ctggggagag cttaacagta tgcatcacct aggatttgcc cttagccaga 7141 acggcaaatg gtgtaactac accggggaaa taaaattagg gcaagaaaca ttccaatatc 7201 attacaagcc aaactggaac tgtaccggga attggacgca atatccggtg tggcaagtga 7261 ttagaaacct ggatatggtg gaacatatga caggagaatg tgtgcagaga ccacaaaggc 7321 acaatataac agtaggaaat ggaaccataa cagggaattg cagtacaaca aactgggatg 7381 gatgtaattg ctcacgatca ggaaactacc tatataacag ctctgaggga ggattgttat 7441 taattctgtg cagacaaaac agcaccctaa caaggatcct gggaacaaat acaaattgga 7501 caactatgtg gggaatatac aaaaattgtt caggatgcga gaatgcaaca ttagacaaca 7561 caggagaagg aaccttagga ggtgtagcta ataagaactg tagcttgcct cataaaaatg 7621 agagcaacaa gtggacttgt gccccaagac aaagagatgg aaaaacagat tcgctataca 7681 tagcaggagg aaaaaagttt tggacacgaa ttaaggccca attcagctgt gaaagtaaca 7741 taggacaatt agatggaatg ttgcatcagc aaatactatt gcaaaaatat caagtaatta 7801 aggtaagagc ttatacatat ggggtgatag aaatgccaga aaactatgca aaaacaagaa 7861 tcataaacag gaaaaaaaga gaactcagcc acaagaggaa gaagagaggc gttggcttgg 7921 tcattatgct agttatcatg gcaatagtag ctgccgcagg ggcttctctg ggagtcgcaa 7981 acgcgattca gcagtcttac actaaggcag ctgtccagac ccttgctaat gcaactgctg 8041 cacagcagga tgtgttagag gcaacctatg ccatggtaca gcatgtggct aaaggcgtac 8101 gaatcttgga agctcgagtg gaagctatca cagatagaat aatgctatac caagaattgg 8161 attgttggca ctatcatcaa tactgtataa cctctacaaa aacagaagta gcaaaatata 8221 tcaattggac gaggtttaag gataattgca catggcagca gtgggagaga ggattacagg 8281 ggtatgatac aaacttaaca atactgttaa aggaatcagc agcaatgaca caactagcag 8341 aagagcaagc aaggaggata ccagaagtat gggaaagttt aaaagacgtc tttgattggt 8401 caggatggtt ctcatggcta aagtatattc ctattatagt agtaggatta ttaggatgca 8461 ttctgataag agctgtgata tgtgtatgtc aacctcttgt gcagatatac agaactctaa 8521 gtaccccgac ataccaacgg gtcacagtca tcatggaaac aagagcagac gtcgcaggag 8581 aaaatcagga ttttggcgat ggcttagagg aatcagacaa cagcgaaaca agcgaaagag 8641 tgacagtaca gaaagcttgg agccgtgcct gggagctttg gcagaactca ccctggaagg 8701 agccatggaa aaggggcctg ctgaggctgc tcgtccttcc gctgacgatg ggaatctgga 8761 taaatggatg gcttggagaa caccacaaaa ataaaaaaag aaagggtgac tgtgagacat 8821 gggctaaaga ggactaataa caagctaggc caaattcctg taaatcactt ggggggttat 8881 aagaaaagca agttcactat gacaaagcaa aatgtaaagg ccaaattcct gtaaatcact 8941 tggggggtta taagaaaagc aagttcacta tgacaaagca aaatgtaacc gcaagtgctg 9001 acagatgtaa cagctgacat atcagctgat gcttgctcat gctgacactg tagctctgag 9061 ctgtatataa ggagaagctt gctgcttgca cttcagagtt ctaggagagt ccctcctagt 9121 ctctcctctc cgaggaggta ccgagacctc aaaataaagg agtgattgcc ttactgccga // LOCUS CEAVLTRM 522 bp ss-RNA VRL 04-AUG-1990 DEFINITION Caprine arthritis encephalitis virus proviral long terminal repeat. ACCESSION M14149 KEYWORDS long terminal repeat. SOURCE Caprine arthritis encephalitis virus (CAEV) unintegrated proviral DNA. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 522) AUTHORS Hess,J.L., Pyper,J.M. and Clements,J.E. TITLE Nucleotide sequence and transcriptional activity of the caprine arthritis-encephalitis virus long terminal repeat JOURNAL J. Virol. 60, 385-393 (1986) STANDARD full staff_review COMMENT [1] presents the LTR for CAEV in a fashion that includes sequence data downstream of the 5' LTR (bases 499-522) and sequence data upstream of the 3' LTR (bases 1-48). For this reason, two "pre-msg" entries are found in the FEATURES table, one for the 5'end of the unspliced transcript and the other for the 3' end of the unspliced transcript. Similarly, both the TATA box and the polyadenylation signal are annotated. The CAEV LTR and the Visna LTR are approximately 50% homologous. [1] did not find a significant level of homology of CAEV LTR with any other retroviral LTR. [1] showed that, unlike Visna and HIV, CAEV LTR is not transactivated. Plasmids constructed with CAEV LTR and CAT showed no evidence of transactivated up-regulation in four cell types (SCP, GSM, SAM and primary goat macrophages). This may explain CAEV's lack of cytotoxicity. FEATURES from to/span description pre-msg < 1 420 genomic mRNA, 3' end (see COMMENT) pre-msg 336 > 522 genomic mRNA, 5' end (see COMMENT) site 336 336 mRNA cap site LTR 49 498 long terminal repeat rpt 87 157 71 bp direct repeat, 5' copy rpt 158 228 71 bp direct repeat, 3' copy rpt 336 420 R repeat binding 499 515 Lys-tRNA primer binding site signal 306 310 TATA box signal 393 398 poly-A signal BASE COUNT 163 a 108 c 137 g 114 t ORIGIN 84 bp upstream of AluI site. 1 aaatggatgg cttggagaac accacaaaaa taaaaaaaga aagggtggac tgtgagacat 61 gggctaaaga ggactaataa caagctaggc caaattcctg taaatcactt ggggggttat 121 aagaaaagca agttcactat gacaaagcaa aatgtaaagg ccaaattcct gtaaatcact 181 tggggggtta taagaaaagc aagttcacta tgacaaagca aaatgtaacc gcaagtgctg 241 acagatgtaa cagctgacat atcagctgat gcttgctcat gctgacactg tagctctgag 301 ctgtatataa ggagaagctt gctgcttgca cttcagagtt ctaggagagt ccctcctagt 361 ctctcctctc cgaggaggta ccgagacctc aaaataaagg agtgattgcc ttactgccga 421 gtggagagtg attactgagc ggccggtgta tcgggagtcg tcccttaatc tgtgcaatac 481 cagagcggct ctcgcagctg gcgcccaacg tggggcccga gg // LOCUS CEAVPOL 651 bp ss-RNA VRL 04-AUG-1990 DEFINITION Caprine arthritis encephalitis virus proviral pol gene, partial cds. ACCESSION K03327 KEYWORDS . SOURCE Caprine arthritis encephalitis virus proviral DNA. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (sites; for [2]) AUTHORS Chiu,I.-M., Yaniv,A., Dahlberg,J.E., Gazit,A., Skuatz,S.F., Tronick,S.R. and Aaronson,A. TITLE Nucleotide sequence evidence for relationship of AIDS retrovirus to lentiviruses JOURNAL Nature 317, 366-368 (1985) STANDARD full staff_review REFERENCE 2 (bases 1 to 651) AUTHORS Chiu,I.-M. JOURNAL Unpublished (1985) Ohio State U. Medical Sch., Columbus, Ohio STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by I.-M.Chiu, 09-OCT-1985. FEATURES from to/span description pept < 1 > 651 pol protein (reverse transcriptase; AA at 1) BASE COUNT 269 a 102 c 122 g 158 t ORIGIN 434 bp upstream of BglII site 1 gtcccacaat ggccattgac agaagaaaaa ttaaaaggtc tatcagaaat agtagataaa 61 ttagtaaaaa aaaaaaaact aggaaaggca cccccacatt ggacatgtaa tactccaata 121 ttttgcataa aaaagaaatc aggaaaatgg aaaatgttaa tagatttcag aaaattaaac 181 aaacaaacag aggatttaac agaagcgcaa ttaggacttc cgcatccggg gggactacaa 241 aagaaaaaac atgttacaat attagacata ggagatgcat attttactat acccctatat 301 aaaccatatc gagagtacac ttgctttact ctattaagtc ctaataatct aggaccatgt 361 aaaagatatt attggaaggt tctaccacaa ggttggaaac tgagtccatc tgtatatcaa 421 tttaccatgc aggagatctt aggggaatgg atacaggagc atcctgaaat tcaatttaga 481 atatatatgg atgatatcta cataagaagt gatttagaaa taaaaaagca tagagagata 541 gtggaggaat tagccaatta tattgcccaa tatagattca ctctgccaga agagaaaagg 601 caagagagat atccagcaaa atggttaggg tatgaactac atcctcaggc c // LOCUS EAEVWU5 2580 bp ss-RNA VRL 04-AUG-1990 DEFINITION Equine arthritis encephalitis virus, envelope gene, complete cds. ACCESSION X16988 KEYWORDS envelope protein. SOURCE Equine arthritis encephalitis virus, clone pWU5. ORGANISM Equine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 2580) AUTHORS McGuire,T.C., Lacy,P.A. and O'Rourke,K. TITLE cDNA sequence of the env gene of a pathogenic equine infectious anemia lentivirus variant JOURNAL Nucleic Acids Res. 18, 196 (1990) STANDARD full staff_review COMMENT Kindly submitted in computer-readable form by K. O'Rourke, Washington State University, Pullman WA. FEATURES from to/span description pept 1 2580 env polyprotein BASE COUNT 891 a 428 c 565 g 696 t ORIGIN Start of env cds. 1 atggtcagca tcgcattcta tggggggatc ccagggggaa tctcaacccc tattacccaa 61 cagtcagaaa aatccaagta tgaggaaaac acaatgtttc aaccttattg ttataataat 121 gacagtaaga acagcatggc agaatcgaag gaagcaagag accaagaaat gaacctgaaa 181 gaagaatcta aagaagaaaa aagaagaaat gactggtgga aaataggtat gtttctgtta 241 tgcttagcag gaactactgg aggaatactt tggtggtatg aaggactccc acagcaacat 301 tatatagggt tggtagcgat agggggaaga ttaaacggat ctggccaatc aaatgctata 361 gaatgctggg gttccttccc ggggtgtaga ccatttcaaa attacttcag ttatgagacc 421 aatagaagca tgcatatgga taataatact gctacattat tagaagctta tcatagagag 481 ataacattca tttataagtc ttcttgcaca gatagtgatc attgtcaaga gtatcaatgt 541 aaaaaagtta atcttaattc ctctgactcc tctaactctg tacgtgttga ggatgtaacg 601 aacacagcgg aatattgggg atttaaatgg ctagaatgta atcaaacaga aaattttaag 661 actatattag tacctgaaaa tgaaatggta aatatcaatg atactgatac ctggatacct 721 aaggggtgta atgagacgtg ggcaagagtg aaacgttgtc ctatagatat tttatatggg 781 atacatccaa tcaggctgtg tgtacagcca ccattttttc tggtacagga gaaagggatt 841 gctgatactt ctagaattgg caattgtggg ccaacaatat ttcttggggt tttagaagat 901 aataagggag tagtacgggg ggattataca gcctgcaatg tgagtcgcct aaatataaat 961 agaaaggatt atacagggat ctatcaagta cctatatttt atacatgtac tttcactaac 1021 ataacttcct gcaataatga gccaataatc agtgttatca tgtatgaaac aaaccaggta 1081 caatatttat tgtgtaataa taataatagt aataattata attgtgtagt acaaagtttt 1141 ggagttatag gacaggcaca cttagaactg cctagaccta acaaaagaat aaggaatcaa 1201 agctttaacc aatataactg ctctataaat aacaaaacag aattagaaac atggaagtta 1261 gtaaagactt ctggcataac tcctttacct atttcttctg aagctaacac tggactaatt 1321 agacataaga gagattttgg tataagtgca atagtggcag ctattgtagc cgctactgct 1381 attgctgcta gcgctactat gtcttatgtt gctctaactg aggttaacaa aataatggaa 1441 gtacaaaatc atacttttga ggtagaaaat agtactctaa atggtatgga tttaatagaa 1501 cgacaaataa agatattata tgctatgatt cttcaaacac atgcagatgt tcaactgtta 1561 aaggaaagac aacaggtaga ggagacattt aatttaattg gatgtataga aagaacacat 1621 gtattttgtc atactggtca tccctggaat atgtcatggg gacatttaaa tgagtcaaca 1681 caatgggatg actgggtaag caaaatggaa gatttaaatc aagagatact aactacactt 1741 catggagcca ggaacaattt ggcacaatcc atgataacat tcaatacacc agatagtata 1801 gctcaatttg gaaaagacct ttggagtcat attggaaatt ggattcctgg attgggagct 1861 tccattataa aatatatagt gatgtttttg cttatttatt tgttactaac ctcttcgcct 1921 aagatcctca gggccctctg gaaagtgacc agtggtgcag ggtcctccgg cagtcgttac 1981 ctgaagaaaa aattccatca caaacatgca tcacgagaag acacctggga ccaggcccaa 2041 cacaacatac acctagcagg cgtgaccggt ggatcagggg acaaatacta caagcagaag 2101 tactccagga acgactggaa tggagaatca gaggagtaca acaggcggcc aaagagctgg 2161 gtgaagtcaa tcgaggcatt tggagagagc tatatttccg agaagaccaa aggggagatt 2221 tctcagcctg gggcggctat caacgagcac aagaacggct ctggggggaa caatcctcac 2281 caagggtcct tagacctgga gattcgaagc gaaggaggaa acatttatga ctgttgcatt 2341 aaagcccaag aaggaactct cgctatccct tgctgtggat ttcccttatg gctattttgg 2401 ggactagtaa ttatagtagg acgcatagca ggctatggat tacgtggact cgctgttata 2461 ataaggattt gtactagagg cttaaatttg atatttgaaa taatcagaaa aatgcttgat 2521 tatattggaa gagctttaaa tcctggcaca tctcatgtat caatgcctca gtatgtttag // LOCUS EIAV 8344 bp ss-RNA VRL 04-AUG-1990 DEFINITION Equine infectious anemia virus (EIAV), complete proviral genome. ACCESSION M16575 M11337 K03334 M14855 KEYWORDS complete genome. SOURCE Equine infectious anemia virus proviral DNA, clones 1369 and 409-2 [1]. ORGANISM Equine infectious anemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 8229) AUTHORS Kawakami,T., Sherman,L., Dahlberg,J., Gazit,A., Yaniv,A., Tronick,S.R. and Aaronson,S.A. TITLE Nucleotide sequence analysis of equine infectious anemia virus proviral DNA JOURNAL Virology 158, 300-312 (1987) STANDARD full staff_review REFERENCE 2 (bases 324 to 5122) AUTHORS Stephens,R.M., Casey,J.W. and Rice,N.R. TITLE Equine infectious anemia virus gag and pol genes: Relatedness to visna and AIDS virus JOURNAL Science 231, 589-594 (1986) STANDARD full staff_review REFERENCE 3 (bases 2300 to 2950) AUTHORS Chiu,I.-M. JOURNAL Unpublished (1985) Meloy Labs Inc, Springfield, VA 22151 STANDARD full staff_review REFERENCE 4 (sites; pept) AUTHORS Chiu,I.-M., Yaniv,A., Dahlberg,J.E., Gazit,A., Skuatz,S.F., Tronick,S.R. and Aaronson,A. TITLE Nucleotide sequence evidence for relationship of AIDS retrovirus to lentiviruses JOURNAL Nature 317, 366-368 (1985) STANDARD full staff_review REFERENCE 5 (bases 4769 to 8048; 8069 to 8344) AUTHORS Payne,S.L. JOURNAL Unpublished (1987) Dept Biochem, Louisiana St Univ, Baton Rouge, LA STANDARD full staff_review REFERENCE 6 (bases 4769 to 8048; 8069 to 8228) AUTHORS Rushlow,K., Olsen,K., Stiegler,G., Payne,S.L., Montelaro,R.C. and Issel,C.J. TITLE Lentivirus genomic organization: The complete nucleotide sequence of the env gene region of equine infectious anemia virus JOURNAL Virology 155, 309-321 (1986) STANDARD full staff_review REFERENCE 7 (sites; correction of 5348 [1] personal communication) AUTHORS Tronick,S.R. and Fuller,F. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Correction of [1] by [7] leads to agreement with [6] regarding the env cds. The sequence starting with coordinate 8230 is horse cellular DNA. FEATURES from to/span description pept 465 1925 gag polyprotein (put.) pept < 1682 5122 pol polyprotein (AA at 1682; NH2-terminus uncertain) pept 5312 7891 envelope polyprotein pept < 7234 7641 3' ORF protein (AA at 7233; NH2-terminus uncertain) pre-msg 209 8193 EIAV genomic RNA cell 8230 8344 horse cellular DNA binding 324 342 primer (Lys-tRNA) binding site variant 342 342 c in [1], t in [2] variant 2341 2341 t in [1], c in [2],[3] variant 2349 2349 t in [1], c in [2] variant 2436 2436 g in [1], a in [3] variant 2677 2677 t in [1], a in [2] variant 2865 2870 tggaag in [1], agtaaa in [2] variant 2870 2870 g in [1], a in [3] variant 3777 3777 g in [1], a in [2] variant 3858 3858 g in [1], a in [2] variant 4529 4530 ac in [1], ca in [2] variant 4573 4573 a in [1], g in [2] revision 5348 5348 gg in [6],[7]; [g] in [1] variant 6348 6348 a in [1], g in [5],[6] variant 6350 6350 g in [1], a in [5],[6] variant 7060 7060 c in [1], t in [5],[6] variant 8048 8069 gagttccgcatttgtgacgcgt in [1], gt in [5],[6] variant 8073 8073 g in [1], c in [5],[6] variant 8098 8098 a in [1], g in [5],[6] variant 8104 8104 g in [1], a in [5],[6] variant 8118 8119 ga in [1], gca in [5],[6] mRNA 210 8193 EIAV genomic mRNA BASE COUNT 2984 a 1358 c 1846 g 2156 t ORIGIN 1 tgtggggttt ttatgagggg ttttataaat gattataaga gtaaaaagaa agttgctgat 61 gctctcataa ccttgtataa cccaaaggac tagctcatgt tgctaggcaa ctaaaccgca 121 ataaccgcat ttgtgacgcg agttccgcat ttgtgacgcg ttaagttcct gtttttacag 181 tatataagta cttgtgttct gacaattggg actcagattc tgcggtctga gtcccttctc 241 tgctgggctg aaaaggcctt tgtaataaat ataattctct actcagtccc tgtctctagt 301 ttgtctgttc gagatcctac agttggcgcc cgaacaggga cctgagaggg gcgcagaccc 361 tacctgttga acctggctga tcgtaggatc cccgggacag cagaggagaa cttacagaag 421 tcttctggag gtgttcctgg ccagaacaca ggaggacagg taagatggga gaccctttga 481 catggagcaa ggcgctcaag aagttagaga aggtgacggt acaagggtct cagaaattaa 541 ctactggtaa ctgtaattgg gcgctaagtc tagtagactt atttcatgat accaactttg 601 taaaagaaaa ggactggcag ctgagggatg tcattccatt gctggaagat gtaactcaga 661 cgctgtcagg acaagaaaga gaggcctttg aaagaacatg gtgggcaatt tctgctgtaa 721 agatgggcct ccagattaat aatgtagtag atggaaaggc atcattccag ctcctaagag 781 cgaaatatga aaagaagact gctaataaaa agcagtctga gccctctgaa gaatatccaa 841 tcatgataga tggggctgga aacagaaatt ttagacctct aacacctaga ggatatacta 901 cttgggtgaa taccatacag acaaatggtc tattaaatga agctagtcaa aacttatttg 961 ggatattatc agtagactgt acttctgaag aaatgaatgc atttttggat gtggtacctg 1021 gccaggcagg acaaaagcag atattacttg atgcaattga taaaatagca gatgattggg 1081 ataatagaca tccattaccg aatgctccac tggtggcacc accacaaggg cctattccca 1141 tgacagcaag gtttattaga ggtttaggag tacctagaga aagacagatg gagcctgctt 1201 ttgatcagtt taggcagaca tatagacaat ggataataga agccatgtca gaaggcatca 1261 aagtgatgat tggaaaacct aaagctcaaa atattaggca aggagctaag gaaccttacc 1321 cagaatttgt agacagacta ttatcccaaa taaaaagtga gggacatcca caagagattt 1381 caaaattctt gactgataca ctgactattc agaacgcaaa tgaggaatgt agaaatgcta 1441 tgagacattt aagaccagag gatacattag aagagaaaat gtatgcttgc agagacattg 1501 gaactacaaa acaaaagatg atgttattgg caaaagcact tcagactggt cttgcgggcc 1561 catttaaagg tggagccttg aaaggagggc cactaaaggc agcacaaaca tgttataact 1621 gtgggaagcc aggacattta tctagtcaat gtagagcacc taaagtctgt tttaaatgta 1681 aacagcctgg acatttctca aagcaatgca gaagtgttcc aaaaaacggg aagcaagggg 1741 ctcaagggag gccccagaaa caaactttcc cgatacaaca gaagagtcag cacaacaaat 1801 ctgttgtaca agagactcct cagactcaaa atctgtaccc agatctgagc gaaataaaaa 1861 aggaatacaa tgtcaaggag aaggatcaag tagaggatct caacctggac agtttgtggg 1921 agtaacatat aatctagaga aaaggcctac tacaatagta ttaattaatg atactccctt 1981 aaatgtactg ttagacacag gagcagatac ttcagtgttg actactgcac attataatag 2041 gttaaaatat agagggagaa aatatcaagg gacgggaata ataggagtgg gaggaaatgt 2101 ggaaacattt tctacgcctg tgactataaa gaaaaagggt agacacatta agacaagaat 2161 gctagtggca gatattccag tgactatttt gggacgagat attcttcagg acttaggtgc 2221 aaaattggtt ttggcacagc tctccaagga aataaaattt agaaaaatag agttaaaaga 2281 gggcacaatg gggccaaaaa ttcctcaatg gccactcact aaggagaaac tagaaggggc 2341 taaagagata gtccaaagac tattgtcaga gggaaaaata tcagaagcta gtgacaataa 2401 tccttataat tcacccatat ttgtaataaa aaagaggtct ggcaaatgga ggttattaca 2461 agatctgaga gaattaaaca aaacagtaca agtaggaacg gaaatatcca gaggattgcc 2521 tcacccggga ggattaatta aatgtaaaca catgactgta ttagatattg gagatgcata 2581 tttcactata cccttagatc cagagtttag accatataca gctttcacta ttccctccat 2641 taatcatcaa gaaccagata aaagatatgt gtggaattgt ttaccacaag gattcgtgtt 2701 gagcccatat atatatcaga aaacattaca ggaaatttta caacctttta gggaaagata 2761 tcctgaagta caattgtatc aatatatgga tgatttgttc gtgggaagta atggttctaa 2821 aaaacaacac aaagagttaa tcatagaatt aagggcaatc ttactggaag agggttttga 2881 gacaccagat gataaattac aagaagtgcc accttatagc tggctaggtt atcaactttg 2941 tcctgaaaat tggaaagtac aaaaaatgca attagacatg gtaaagaatc caacccttaa 3001 tgatgtgcaa aaattaatgg ggaatataac atggatgagc tcaggggtcc cagggttgac 3061 agtaaaacac atagcagcta ctactaaggg atgtttagag ttgaatcaaa aagtaatttg 3121 gacggaagag gcacaaaaag agttagaaga aaataatgag aagattaaaa atgctcaagg 3181 gttacaatat tataatccag aagaagaaat gttatgtgag gttgaaatta caaaaaatta 3241 tgaggcaact tatgttataa aacaatcaca aggaatccta tgggcaggta aaaagattat 3301 gaaggctaat aagggatggt caacagtaaa aaatttaatg ttactgttgc aacatgtggc 3361 aacagaaagt attactagag taggaaaatg tccaacgttt aaggtaccat ttaccaaaga 3421 gcaagtaatg tgggaaatgc aaaaaggatg gtattattct tggctcccag aaatagtata 3481 tacacatcaa gtagttcatg atgattggag aatgaaattg gtagaagaac ctacatcagg 3541 aataacaata tacactgatg ggggaaaaca aaatggagaa ggaatagcag cttatgtgac 3601 cagtaatggg agaactaaac agaaaaggtt aggacctgtc actcatcaag ttgctgaaag 3661 aatggcaata caaatggcat tagaggatac cagagataaa caagtaaata tagtaactga 3721 tagttattat tgttggaaaa atattacaga aggattaggt ttagaaggac cacaaagtcc 3781 ttggtggcct ataatacaaa atatacgaga aaaagagata gtttattttg cttgggtacc 3841 tggtcacaaa gggatatgtg gtaatcaatt ggcagatgaa gccgcaaaaa taaaagaaga 3901 aatcatgcta gcataccaag gcacacaaat taaagagaaa agagatgaag atgcagggtt 3961 tgacttatgt gttccttatg acatcatgat acctgtatct gacacaaaaa tcatacccac 4021 agatgtaaaa attcaagttc ctcctaatag ctttggatgg gtcactggga aatcatcaat 4081 ggcaaaacag gggttattaa ttaatggagg aataattgat gaaggatata caggagaaat 4141 acaagtgata tgtactaata ttggaaaaag taatattaaa ttaatagagg gacaaaaatt 4201 tgcacaatta attatactac agcatcactc aaattccaga cagccttggg atgaaaataa 4261 aatatctcag agaggggata aaggatttgg aagtacagga gtattctggg tagaaaatat 4321 tcaggaagca caagatgaac atgagaattg gcatacatca ccaaagatat tggcaagaaa 4381 ttataagata ccattgactg tagcaaaaca gataactcaa gaatgtcctc attgcactaa 4441 gcaaggatca ggacctgcag gttgtgtcat gagatctcct aatcattggc aggcagattg 4501 cacacatttg gacaataaga taatattgac ttttgtagag tcaaattcag gatacataca 4561 tgctacatta ttatcaaaag aaaatgcatt atgtacttca ttggctattt tagaatgggc 4621 aagattgttt tcaccaaagt ccttacacac agataacggc actaattttg tggcagaacc 4681 agttgtaaat ttgttgaagt tcctaaagat agcacatacc acaggaatac catatcatcc 4741 agaaagtcag ggtattgtag aaagggcaaa taggaccttg aaagagaaga ttcaaagtca 4801 tagagacaac actcaaacac tggaggcagc tttacaactt gctctcatta cttgtaacaa 4861 agggagggaa agtatgggag gacagacacc atgggaagta tttatcacta atcaagcaca 4921 agtaatacat gagaaacttt tactacagca agcacaatcc tccaaaaaat tttgttttta 4981 caaaatccct ggtgaacatg attggaaggg acctactagg gtgctgtgga agggtgatgg 5041 tgcagtagta gttaatgatg aaggaaaggg aataattgct gtaccattaa ccaggactaa 5101 gttactaata aagccaaatt gagtattgtt gcaggaagca agacccaact accattgtca 5161 gctgtgtttc ctgaggtctc taggaattga ttacctcgat gcttcattaa ggaagaagaa 5221 taaacaaaga ctgaaggcaa tccaacaagg aagacaacct caatatttgt tataaggttt 5281 gatatatggg attatttggt aaaggggtaa catggtcagc atcgcattct atggggggat 5341 cccaggggga atctcaaccc ctattaccca acagtcagaa aaatctaagt gtgaggagaa 5401 cacaatgttt caaccttatt gttataataa tgacagtaag aacagcatgg cagaatcgaa 5461 ggaagcaaga gaccaagaaa tgaacctgaa agaagaatct aaagaagaaa aaagaagaaa 5521 tgactggtgg aaaataggta tgtttctgtt atgcttagca ggaactactg gaggaatact 5581 ttggtggtat gaaggactcc cacagcaaca ttatataggg ttggtggcga tagggggaag 5641 attaaacgga tctggccaat caaatgctat agaatgctgg ggttccttcc cggggtgtag 5701 accatttcaa aattacttca gttatgagac caatagaagc atgcatatgg ataataatac 5761 tgctacatta ttagaagctt atcatagaga gataacattc atttataagt cttcttgcac 5821 agatagtgat cattgtcaag agtatcaatg taaaaaagtt aatcttaatt cctctgactc 5881 ctctaactct gtacgtgttg aggatgtaac gaacacagcg gaatattggg gatttaaatg 5941 gctagaatgt aatcaaacag aaaattttaa gactatatta gtacctgaaa atgaaatggt 6001 aaatatcaat gatactgata cctggatacc taaggggtgt aatgagacgt gggcaagagt 6061 gaaacgttgt cctatagata ttttatatgg gatacatcca atcaggctgt gtgtacagcc 6121 accatttttt ctggtacagg agaaagggat tgctgatact tctagaattg gcaattgtgg 6181 gccaacaata tttcttgggg ttttagaaga taataaggga gtagtacggg gggattatac 6241 agcttgcaat gtgcgtcgcc taaatataaa tagaaaggat tatacaggga tctatcaagt 6301 acctatattt tatacatgta ctttcactaa cataacttcc tgcaataatg agccaataat 6361 cagtgttatc atgtatgaaa caaaccaggt acaatattta ttgtgtaata ataataatag 6421 taataattat aattgtgtag tacaaagttt tggagttata ggacaggcac acttagaact 6481 gcctagacct aacaaaagaa taaggaacca aagctttaac caatataact gctctataaa 6541 taacaaaaca gaattagaaa catggaagtt agtaaagact tctggcgtaa ctcctttacc 6601 tatttcttct gaagctaaca ctggactaat tagacataag agagattttg gtataagtgc 6661 aatagtggca gctattgtag ccgctactgc tattgctgct agcgctacta tgtcttatgt 6721 tgctctaact gaggttaaca aaataatgga agtacaaaat catacttttg aggtagaaaa 6781 tagtactcta aatggtatgg atttaataga acgacaaata aagatattat atgctatgat 6841 tcttcaaaca catgcagatg ttcaactgtt aaaggaaaga caacaggtag aggagacatt 6901 taatttaatt ggatgtatag aaagaacaca tgtattttgt catactggtc atccctggaa 6961 tatgtcatgg ggacatttaa atgagtcaac acaatgggat gactgggtaa gcaaaatgga 7021 agatttaaat caagagatac taactacact tcatggagcc aggaacaatt tggcacaatc 7081 catgataaca ttcaatacac cagatagtat agctcaattt ggaaaagacc tttggagtca 7141 tattggaaat tggattcctg gattgggagc ttccattata aaatatatag tgatgttttt 7201 gcttatttat ttgttactaa cctcttcgcc taagatcctc agggccctct ggaaagtgac 7261 cagtggtgca gggtcctccg gcagtcgtta cctgaagaaa aaattccatc acaaacatgc 7321 atcgcgagaa gacacctggg accaggccca acacaacata cacctagcag gcgtgaccgg 7381 tggatcaggg gacaaatact acaagcagaa gtactccagg aacgactgga atggagaatc 7441 agaggagtac aacaggcggc caaagagctg ggtgaagtca atcgaggcat ttggagagag 7501 ctatatttcc gagaagacca aaggggagat ttctcagcct ggggcggcta tcaacgagca 7561 caagaacggc tctgggggga acaatcctca ccaagggtcc ttagacctgg agattcgaag 7621 cgaaggagga aacatttatg actgttgcat taaagcccaa gaaggaactc tcgctatccc 7681 ttgctgtgga tttcccttat ggctattttg gggactagta attatagtag gacgcatagc 7741 aggctatgga ttacgtggac tcgctgttat aataaggatt tgtattagag gcttaaattt 7801 gatatttgaa ataatcagaa aaatgcttga ttatattgga agagctttaa atcctggcac 7861 atctcatgta tcaatgcctc agtatgttta gaaaaacaag gggggaactg tggggttttt 7921 atgaggggtt ttataaatga ttataagagt aaaaagaaag ttgctgatgc tctcataacc 7981 ttgtataacc caaaggacta gctcatgttg ctaggcaact aaaccgcaat aaccgcattt 8041 gtgacgcgag ttccgcattt gtgacgcgtt aagttcctgt ttttacagta tataagtact 8101 tgtgttctga caattgggac tcagattctg cggtctgagt cccttctctg ctgggctgaa 8161 aaggcctttg taataaatat aattctctac tcagtccctg tctctagttt gtctgttcga 8221 gatcctacac aaaggattag caatcagaat aatattagac tactcaaaag caacacataa 8281 agccagaaga cagctgagtc atgttttcaa attctgacaa aaaattattt acaacttaga 8341 attc // LOCUS HIV2BEN 10359 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2), complete proviral genome. ACCESSION M30502 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 2 (isolate BEN) proviral DNA, clones MK[2,6]. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (sites; for [2]) AUTHORS Kirchhoff,F., Jentsch,K., D Bachmann,B., Stuke,A., Laloux,C., Lueke,W., Stahl-Henning,C., Schneider,J., Nieselt,K., Eigen,M. and Hunsmann,G. TITLE A novel proviral clone of HIV-2: Biological and phylogenetic relationship to other primate immunodeficiency viruses JOURNAL Virology 177, 305-311 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 10359) AUTHORS Kirchhoff,F., Jentsch,K., D Bachmann,B., Stuke,A., Laloux,C., Lueke,W., Stahl-Henning,C., Schneider,J., Nieselt,K., Eigen,M. and Hunsmann,G. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Kindly submitted prior to publication by Dr. F. Kirchhoff, DPZ, Gottingen, 0551/3851-0. HIV2BEN was isolated from a German AIDS patient(with predominantly neurological complications) who was probably infected in Mali (Klemm et al., J Neuro 235,304,1988). The sequence was determined from two clones: bases 1903 to 5057 and 7293 to 9131 were from MK2 and bases 1 to 1902, 5058 to 7292 and 9132 to 10359 were from MK6. FEATURES from to/span description pept 1103 2668 gag polyprotein pept < 2326 5754 pol polyprotein (NH2 terminus uncertain) pept 5423 6070 vif protein pept 5898 6239 vpx protein pept 6239 6502 vpr protein pept 6402 6697 tat protein, exon 2 (first expressed exon) 8861 8957 tat protein, exon 3 (AA at 8862) pept 6628 6697 rev protein, exon 2 (first expressed exon) 8861 9102 rev protein, exon 3 pept 6704 9286 env polyprotein pept 9120 9893 nef protein LTR 1 855 5' LTR LTR 9505 10359 3' LTR binding 859 875 primer (Lys-tRNA) binding site signal 10211 10216 poly-A signal BASE COUNT 3506 a 2132 c 2598 g 2123 t ORIGIN Left end of viral genome (5'LTR) 1 tgcaagggat gttttacagt aggaggagac atagaatcct agacatatac ctagaaaaag 61 aggaagggat aataccagat tggcagaatt atactcatgg gccaggagta aggtacccaa 121 tgtacttcgg gtggctgtgg aagctagtat cagtagaact ctcacaagag gcagaggaag 181 atgaggccaa ctgcttagta cacccagcac aaacaagcag acatgatgat gagcatgggg 241 agacattagt gtggcagttt gactccatgc tggcctataa ctacaaggcc ttcactctgt 301 acccagaaga gtttgggcac aagtcaggat tgccagagaa agaatggaag gcaaaactga 361 aagcaagagg gataccatat agtgaataac aggaacaacc atacttggtc aaggcaggaa 421 gtagctacta agaaacagct gaggctgcag ggactttcca gaaggggctg taaccaaggg 481 agggacatgg gaggagctgg tggggaacgc cctcatactt actgtataaa tgtacccgct 541 tcttgcattg tattcagtcg ctctgcggag aggctggcag atcgagccct gagaggttct 601 ctccagcact agcaggtaga gcctgggtgt tccctgctgg actctcacca gtacttggcc 661 ggtactgggc agacggctcc acgcttgctt gcttaaagac ctcttcaata aagctgccag 721 ttagaagcaa gttaagtgtg tgttcccatc tctcctagtc gccgcctggt cattcggtgt 781 tcatctgagt aacaagaccc tggtctgtta ggacccttct cgctttggga atccaaggca 841 ggaaaatccc tagcaggttg gcgcccgaac agggacttga agaggactga gaagccctgg 901 aactcggctg agtgaaggca gtaagggcgg caggaacaaa ccacgacgga gtgctcctag 961 aaaggcgcgg gccgaggtac caaaggcggc gtgtggagcg ggagtgaaag aggcctccgg 1021 gtgaaggtaa gtacctacac caaaaactgt agccagaaaa ggcttgttat cctaccttta 1081 gacaggtaga agattgtggg agatgggcgc gagaaactcc gtcttgagag ggaaaaaagc 1141 agacgaatta gaaaaagtta ggttacggcc cggcggaaag aaaaagtaca ggttaaaaca 1201 tattgtgtgg gcagcgaatg aattggataa attcggattg gcagagagcc tgttggagtc 1261 aaaagaaggt tgccaaaaga ttctcagagt tttagatcca ttagtaccaa cagggtcaga 1321 aaatttaaaa agccttttta ataccgtctg cgtcatttgg tgcttgcacg cagaagagaa 1381 agtgaaagat actgaggaag caaagaaact agcacagaga catctagtgg cagaaactgg 1441 aactgcagag aaaatgccaa atacaagtag accaacagca ccacctagtg ggaaaagagg 1501 aaactacccc gtgcaacaag cgggtggcaa ctatgtccat gtgccactga gcccccgaac 1561 tctaaatgca tgggtaaaat tagtggagga aaagaagttc ggggcagaag tagtgccagg 1621 atttcaggca ctctcagaag gctgcacgcc ctatgatatt aatcaaatgc ttaattgtgt 1681 gggcgatcac caagcagcta tgcaaataat cagagagatt attaatgaag aagcagcaga 1741 ctgggattcg cagcacccaa taccaggccc cttaccagca ggacagctca gagacccaag 1801 agggtctgac atagcaggaa caacaagcac agtagatgaa cagatccagt ggatgtatag 1861 gccacaaaat cccgtaccgg tagggaacat ctacagaaga tggatccaaa tagggctgca 1921 aaagtgtgtc agaaagtaca acccaactaa catcttagac ataaaacagg gaccaaaaga 1981 accgttccaa agctatgtag acaggttcta caaaagcttg agggcagaac aaacagaccc 2041 agcagtaaaa aattggatga cccaaacgct gctaatacag aatgccaacc cagactgcaa 2101 gttagtacta aaaggactgg ggatgaatcc caccctagaa gagatgctaa ccgcctgcca 2161 gggggtaggc ggaccaggcc agaaagccag gctaatggct gaagccctaa aagaggctat 2221 gggaccaagc cctatcccat ttgcagcagc ccaacaaaga aaggcaatta ggtattggaa 2281 ctgtggaaag gagggacact cggcaagaca gtgccgagca cctagaagac agggctgctg 2341 gaagtgtggc aagccaggac acatcatggc aaactgcccg gaaagacagg caggtttttt 2401 agggttgggc ccacggggaa agaagcctcg caacttcccc gtgacccaag cccctcaggg 2461 gctgatacca acagcacctc cggcagatcc agcagcggaa ctgttggaga gatatatgca 2521 gcaagggaga aagcagaggg agcagaggga gagaccatac aaagaggtga cggaggactt 2581 gctgcacctc gagcagagag agacacctca cagagaggag acagaggact tgctgcacct 2641 caattctctc tttggaaaag accagtagtc acagcgtaca tcgaggatca gccggtagaa 2701 gtcttactag acacaggggc tgatgactca atagtagcag gaatagaatt aggggacaat 2761 tacactccaa aaatagtagg gggaataggg ggatttataa acaccaaaga atacaaaaat 2821 gtagaaataa aagtactaaa taaaagagta agagccacca taatgacagg agatacccca 2881 atcaacatct ttggcagaaa tattctgaca gccttaggca tgtcattaaa tttaccagtt 2941 gccaagatag agccaataaa agtaacattg aagccaggga aagatggacc aaggctgaaa 3001 caatggcccc taacaaaaga gaaaatagaa gcactaaaag agatctgtga aaaaatggaa 3061 aaagagggcc agctagaaga ggcacctcca actaatcctt ataatacccc cacatttgca 3121 attaagaaaa aggacaagaa caaatggagg atgctgatag attttagaga actaaataag 3181 gtgactcaag atttcacaga aattcagcta ggaattccac acccggcagg actagccaaa 3241 aagaaaagga tctctatatt agatgtaggg gatgcctatt tttccatacc actacatgaa 3301 gattttaggc agtatactgc atttacccta ccagcagtaa acaatatgga accaggaaaa 3361 agatatatat ataaagtctt gccacaagga tggaagggat caccagcaat ttttcaatac 3421 acaatgaggc aagtcttaga acctttcaga aaagcaaacc cagatgtcat tctcatccag 3481 tacatggatg atatcttaat agctagtgac aggacaggtt tagagcatga caaagtggtc 3541 ctgcagctaa aagaacttct aaatggccta gggttttcta ctccagatga gaagttccaa 3601 aaagaccctc catttcaatg gatgggctgt gaactatggc caactaaatg gaagctgcag 3661 aaactacaac tgccccagaa agacatatgg acagtcaatg acatccaaaa gctagtggga 3721 gtcttaaatt gggcggcaca aatctattca ggaataaaaa ccaaacactt atgtagacta 3781 attagaggaa aaatgacact cacagaagaa gtgcagtgga cagaactagc agaagcagag 3841 ctagaagaaa acaaaattat cttgagccag gaacaagaag gatattatta ccaagaagaa 3901 aaagaattag aggcaacaat ccaaaaaagc caaggacatc aatggacata caaaatacac 3961 caggaagaga aaatcctaaa agtaggaaag tatgcaaaga taaaaaatac ccataccaat 4021 ggggtcagat tactagcaca ggtagttcag aaaataggaa aagaggcact agtcatttgg 4081 ggacggatac caaaatttca cctgccagtg gagagagaga cctgggagca gtggtgggat 4141 aactactggc aagtgacatg gatcccagag tgggactttg tatctacccc accactggtc 4201 aggttaacat ttaacctagt aggagatcct ataccaggcg cagagacctt ctacacagat 4261 ggatcatgca atagacagtc aaaagaggga aaagcaggat atgtaacaga tagaggaaaa 4321 gacaaagtaa aagtattaga acaaactacc aatcagcagg cagaattaga agtctttcgg 4381 atggcactgg cagactcagg cccaaaggtt aatatcatag tagattcaca gtatgtaatg 4441 gggatagtag caggccagcc aacagagtca gaaaatagaa tagtgaacca gatcatagaa 4501 gaaatgataa agaaggaagc agtctatgtt gcatgggtcc cagcccataa aggcatagga 4561 ggaaaccagg aagtagacca tttagtaagt caaggcatca gacaagtatt attcctggaa 4621 aagatagagc ccgctcaaga ggaacatgaa aaatatcata gcattataaa agaactaacc 4681 cataaatttg gaatacccct tctagtagca agacagatag taaactcatg tgcccaatgc 4741 caacagaaag gagaagccat acatgggcaa gtaaatgcag aaataggcgt ttggcaaatg 4801 gactacacac acttagaagg aaaaatcatt atagtagcag tacatgttgc aagtggattc 4861 atagaagcag aagtcatccc acaggaatca ggaaggcaga cagcactctt cctattaaaa 4921 ctggccagta ggtggccaat aacgcacttg cacacagaca atggccccaa cttcacttca 4981 caggaagtga agatggtggc atggtgggta ggtatagaac aatcctttgg agtaccttac 5041 aacccacaaa gccagggagt agtagaagca atgaatcacc acctaaagaa tcagataagt 5101 agaattagag aacaggcaaa tacaatagaa acaatagtac tgatggcagt tcattgcatg 5161 aattttaaaa gaaggggagg aataggggat atgaccccag cagaaagact aatcaacatg 5221 attaccacag aacaagaaat acaattcctc caaagaaaaa attcaaattt taaaaatttc 5281 caggtctatt acagagaagg cagagatcag ctgtggaaag gacctggtga actactgtgg 5341 aagggagaag gagcagtcat agtcaaggta gggacagaca taaaagtagt accaagaagg 5401 aaggccaaga ttatcaggga ctatggagga agacaggaac tggatagtag tccccacctg 5461 gagggtgcca gggaggatgg agaaatggca tgcccttgtc aagtacctga aatacagaac 5521 aaaagaccta gaggaggtgc gctatgttcc ccaccacaag gtgggatggg catggtggac 5581 ttgcagcagg gtaatattcc cactacaagg aaaaagtcat ctagaaatac aggcatattg 5641 gaacctaaca ccagaaaaag gatggctctc ctctcatgca gtaagattaa cctggtatac 5701 agaaaagttc tggacagatg ttaccccaga ctgtgcagac atcctaatac atagcactta 5761 tttctcttgc tttacggcag gtgaagtaag aagagccatc agaggggaaa agttattgtc 5821 ctgctgcaac tatccccaag ctcataaagc acaggtacca tcacttcaat acctagccct 5881 agtagtagta caacaaaatg acagacccca gagaaagggt accgccagga aacagtggag 5941 aagagaccat tggagaggcc ttcgagtggc tagagaggac catagaagcc ttaaacaggg 6001 aggcagtgaa ccatctgccc cgagagctca ttttccaggt gtggcaaagg tcctggagat 6061 attggcatga tgaacaaggg atgtcagcaa gctacacaaa gtatagatat ttgtgcctaa 6121 tgcaaaaagc tatatttaca catttcaaga gagggtgcac ttgctggggg gaggacatgg 6181 gccgggaagg attggaagac caaggacctc cccctcctcc ccctccaggt ctagtctaat 6241 gactgaagca ccaacagagt ttcccccaga agatgggacc ccacggaggg acttagggag 6301 tgactgggta atagaaactc tgagggaaat aaaggaagaa gccttaagac attttgatcc 6361 ccgcttgcta attgctcttg gctactatat ccataataga catggagaca cccttgaagg 6421 cgccagagag ctcattaaaa ccctacaacg agccctcttc gtgcacttca gagcgggatg 6481 taaccgctca agaattggct aaacaaggag aagaactcct tgcccagctg caccgacccc 6541 tagagccatg cactaacaaa tgctattgta agcgatgcag tttccattgc cagctgtgtt 6601 tctcgaaaaa ggggctcgga atatcatatg agcgaaaggg cagacgaaga aggactccaa 6661 ggaaaactaa gactccttcg ccttctgcac cagacaagtg agtatggagc ctggtaggaa 6721 tcagctgttt gttgtcattt tactaacaag tgcttgctta gtatattgta gccagtatgt 6781 gactgttttc tatggcatac ccgcgtggaa aaatgcatct attcccttat tttgtgcaac 6841 taaaaataga gacacttggg ggaccataca gtgcttgcca gacaatgatg attatcagga 6901 aataatttta aatgtgacag aggcttttga tgcatggaat aatacagtga cagaacaagc 6961 agtagaagat gtctggcatc tatttgagac atcaataaaa ccatgtgtca agctaacacc 7021 tctatgtgtg gcaatgaatt gtagcagggt tcaagggaat accacgaccc cgaatcccag 7081 gacctcgagt tccacaacct cgagaccacc cacatccgca gcctccataa taaatgaaac 7141 ttctaactgc atagaaaaca acacatgcgc aggattaggg tatgaggaga tgatgcaatg 7201 tgagttcaat atgaaggggt tagaacaaga taagaaaagg aggtataagg acacatggta 7261 tttagaagat gtggtttgtg acaacacaac agctggcaca tgttacatga gacattgcaa 7321 cacatcaatc atcaaagagt catgtgataa gcactattgg gatgctatga ggtttagata 7381 ctgtgcacca ccgggctttg ccctattaag atgtaatgat accaactatt caggctttga 7441 acctaagtgc actaaagtag tagctgcttc atgcacaagg atgatggaaa cgcaaacttc 7501 tacttggttt ggctttaatg gcactagagc agaaaataga acatatatct attggcatgg 7561 cagagataat aggactatca ttagcttaaa caagtattat aatctcacaa tgcgttgtaa 7621 gagaccagga aataagacag ttttaccaat aacacttatg tcaggattag tgtttcactc 7681 tcagccaatc aacacaaggc ctaggcaggc atggtgccgg tttggaggca gatggaggga 7741 agccatgcag gaggtgaagc aaacccttgt acaacatccc agatacaaag gaatcaatga 7801 tacagggaaa attaacttta cgaaaccggg agcaggctca gacccggaag tggcatttat 7861 gtggactaac tgcagaggag aatttctcta ctgtaacatg acttggttcc tcaattgggt 7921 agaagacaag aaccaaacac ggcgcaacta ttgccatata aagcagataa ttaatacctg 7981 gcataaagta gggaaaaatg tatatttgcc tcctagggaa ggggagttgg cctgtgaatc 8041 aacagtaacc agcataattg ctaacattga catagataaa aatcggactc ataccaacat 8101 tacctttagt gcagaagtgg cagaactgta ccgattagaa ctgggagact acaaattaat 8161 agaaataaca ccaattggct tcgcacctac agatcagaga aggtactcct caactccagt 8221 gaggaacaaa agaggtgtgt tcgtgctagg gttcttgggt tttctcgcga cagcaggttc 8281 tgcaatgggc gcgcggtccc tgacgctgtc agcccagtcc cggactttac tggccgggat 8341 agtgcagcaa cagcaacagc tgttggacgt agtcaagaga caacaagaaa tgttgcgact 8401 gaccgtctgg ggaacgaaaa acctccaggc aagagtcact gctatcgaga agtacctaaa 8461 gcatcaggca cagctaaatt catggggatg tgcgtttaga caggtctgcc acactactgt 8521 accgtgggta aatgactctt tatcgcctga ctggaaaaat atgacatggc aggagtggga 8581 gaaacaagtc cgctacctag aggcaaatat cagtcaaagt ttagaagaag cccaaattca 8641 acaagaaaag aatatgtatg aattacaaaa attaaatagc tgggatattc ttggcaactg 8701 gtttgactta acctcctggg tcaagtatat tcaatatgga gtgcatatag tagtgggaat 8761 aatagcttta agaatagcaa tctatgtagt gcaattgtta agtagattta gaaagggcta 8821 taggcctgtt ttctcttccc cccccggtta tctccaacag atccatatcc acaaggaccg 8881 gggacagcca gccaacgaag gaacagaaga agacgtcgga ggcgacagtg gttacgactt 8941 gtggccttgg ccaataaact atgtgcagtt cctgatccac ctactgactc gcctcttgat 9001 cgggctatac aacatctgca gagacttact atccaagaac tccccgaccc gccgactgat 9061 ctcccagagt ctaacagcaa tcagggactg gctgagactt aaggcggccc aactgcaata 9121 tgggtgcgag tggatccaag aagctttcca agcattcgcg aggactacga gagagactct 9181 tgcgggcgcg tggggatggt tatgggaagc agcgcgacgc atcgggaggg gaatactcgc 9241 agttccaaga agaatcaggc agggagcaga actcgccctc ctgtgaggga cagcagtatc 9301 agcagggaga gtacatgaac agcccatgga gaaacccagc aacagaaaga cagaaagatt 9361 tgtataggca gcaaaatatg gatgatgtag attctgatga tgatgaccta ataggagttc 9421 ctgttacacc aagagtacca cggagagaaa tgacctataa attggcaata gatatgtcac 9481 attttataaa agaaaaaggg ggactgcaag ggatgtttta cagtaggagg agacatagaa 9541 tcctagacat atacctagaa aaagaggaag ggataatacc agattggcag aattatactc 9601 atgggccagg agtaaggtac ccaatgtact tcgggtggct gtggaagcta gtatcagtag 9661 aactctcaca agaggcagag gaagatgagg ccaactgctt agtacaccca gcacaaacaa 9721 gcagacatga tgatgagcat ggggagacat tagtgtggca gtttgactcc atgctggcct 9781 ataactacaa ggccttcact ctgtacccag aagagtttgg gcacaagtca ggattgccag 9841 agaaagaatg gaaggcaaaa ctgaaagcaa gagggatacc atatagtgaa taacaggaac 9901 aaccatactt ggtcaaggca ggaagtagct actaagaaac agctgaggct gcagggactt 9961 tccagaaggg gctgtaacca agggagggac atgggaggag ctggtgggga acgccctcat 10021 acttactgta taaatgtacc cgcttcttgc attgtattca gtcgctctgc ggagaggctg 10081 gcagatcgag ccctgagagg ttctctccag cactagcagg tagagcctgg gtgttccctg 10141 ctggactctc accagtactt ggccggtact gggcagacgg ctccacgctt gcttgcttaa 10201 agacctcttc aataaagctg ccagttagaa gcaagttaag tgtgtgttcc catctctcct 10261 agtcgccgcc tggtcattcg gtgttcatct gagtaacaag accctggtct gttaggaccc 10321 ttctcgcttt gggaatccaa ggcaggaaaa tccctagca // LOCUS HIV2D205 7817 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2), gag, pol, vif, vpX, vpR, partial env. ACCESSION X16109 KEYWORDS . SOURCE Human immunodeficiency virus type 2 (isolate D205) viral RNA. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 7817) AUTHORS Dietrich,U., Adamski,M., Kreutz,R., Seipp,A., Kuhnel,H. and Rubsamen-Waigmann,H. TITLE A highly divergent HIV-2-related isolate JOURNAL Nature 342, 948-950 (1990) STANDARD full staff_review COMMENT This isolate, also called HIV2-ALT by the authors, was taken from a healthy Ghanian woman. Tree analysis reveals that it likely arose prior to the branching of HIV2s and SIVs, i.e. the sooty mangabey and macaque isolates. [1] calls attention to an extension of the gag-pol overlap region resulting in a dilated (by 20 residues) pol orf. The env cds begins at 6671; tat-exon1 at 6375 and rev-exon1 at 6601, with the possible donor splice site at 6665. FEATURES from to/span description pept 1079 2644 gag polyprotein pept < 2299 5475 pol polyprotein (NH2 terminus uncertain) pept 5396 6046 vif protein pept 5877 6212 vpX protein pept 6212 6517 vpR protein pept 6375 / 6664 tat protein, exon 2 (first expressed exon) pept 6601 / 6664 rev protein, exon 2 (first expressed exon) pept 6744 > 6875 env polyprotein site 6876 6878 env polyprotein in-frame stop codon BASE COUNT 2705 a 1602 c 1930 g 1580 t ORIGIN 5'-terminus of 5'LTR 1 tggaagggat gtattatagt gagagaagac acagaatatt agacacatat tttgagaatg 61 aagaaggcat tgtgtctggc tggcaaaact atactcatgg gccagggata aggcatccca 121 aatactttgg ttggctgtgg aagctggtac cagtagaggt gccagcagcg acccgagagg 181 aggaggaaac ccattgccta atgcacccgg cacagatctc ctcatgggat gacatccatg 241 gggagactct tatctggcag tttgattccc tcctggcata tgattatgtg gctttcaata 301 ggtttccaga agagttcggg taccagtcag ggctaccaga ggaggagtgg aaggctagac 361 taaaagcaag agggatacct acagattagg caagagacag cagcataaac aggaactagc 421 tgacactgca caagaaggaa actagcagac actgcaggga ctttccaaaa gaggctgtaa 481 catgggagga gcaagggagg ggttggttgg gaacgcccat taaatctctg tataaatgta 541 cccgcttctc gcattgtatt cagtcgctct gcggagaggc tggcagattg agccctggga 601 ggttctctcc agcactagca ggtagagcct gggtgttccc tgctagactc tcaccagcac 661 taggccagtg ttgggcagac ggctccacgc ttgcttgctt aaaaccctct taataaactg 721 ccaattagaa gcaagtaagt gtgtgttccc atctctccta gtcgcgcctg gtcactcggt 781 gctccactga tagaccctgg tctgttagga cccttcttgc tttgggaaac cgaagcggga 841 aaatccctag cagattggcg cccgaacagg gacctgggaa gagtgagagt cttcaagagc 901 acagctgagt gaaggcagta agggcggcag gaatcaacca cgacggagag ctcctgtaaa 961 agcgcaggcc ggtaccaggc agcgtgagga gcgggaggag aagaggctcc gggagtaagg 1021 taagtactcc aaattttgca ttgtctactc taaagaggga gtagggcata agtgggagat 1081 gggcgcgaga ggctccgtcc tgtcagggaa gaaaacagat gaattagaaa aagttaggtt 1141 acggcccggc ggaaagaaaa agtatatgct aaaacatgta gtatgggcag tgaacgaatt 1201 agacagattt ggattagcag agagcctgtt ggaatcaaaa gaaggatgtc aaaaaattct 1261 taaagtctta gctccattag taccaacagg gtcagaaaat ttaaaaagcc tttttaatat 1321 cgtctgcgtc attttttgcc tgcacgcaga agagaaagtg aaagatacag aggaagcaaa 1381 aaagatagca cagagacatc tagcggcgga cacagaaaaa atgccagcta caaataaacc 1441 aacagcacca cctagcggcg gaaattatcc agtgcagcaa ctggctggca actacgtcca 1501 cctgccgcta agcccccgaa ccttaaatgc ttgggtaaag ttagtagaag aaaagaagtt 1561 cggggcagaa gtagtaccag gatttcaggc actatcagaa ggatgcaccc cttatgatat 1621 aaatcagatg ctaaattgtg taggagaaca tcaggcagcc atgcaaatta ttagagaaat 1681 aatcaatgag gaagcagcag actgggacca gcaacacccg tcaccaggcc caatgccggc 1741 aggacaactt agggacccaa gagggtcaga tatagcagga accaccagca cagtagagga 1801 acagatacag tggatgtaca gggcccaaaa tcctgtccca gtgggaaaca tttatagaag 1861 atggattcaa ttaggattgc agaaatgtgt ccgaatgtac aatcctacca acatattaga 1921 cataaagcag ggaccaaagg agcccttcca aagctatgta gatagattct acaaaagctt 1981 acgggcagaa caaacagacc cagcagtgaa aaattggatg acacaaacac tgctgattca 2041 gaatgctaac ccagattgca agttagtgct taagggcttg ggaatgaatc ccaccttaga 2101 ggaaatgcta acggcctgcc aagggatagg aggcccaggg cagaaggcaa ggctaatggc 2161 cgaagcctta aaagaggccc taacacctgc acccataccg tttgctgccg ttcaacaaaa 2221 agcagggaag agagggacag tgacatgctg gaactgtggc aaacagggac acacagccag 2281 gcaatgcagg gcccctagaa gacagggatg ctggaaatgt ggaaaaacag gacacatcat 2341 gtcaaaatgc ccagaaagac aggcgggttt tttagggtta ggaccctggg gaaagaagcc 2401 tcgcaacttc cccatgaccc aagtgcctca gggagtgaca ccatctgcac ccccgatgaa 2461 cccagcagag ggcatgacac ctcggggggc gacaccatct gcgccccctg cagatccagc 2521 agtggagatg ctgaaaagtt acatgcagat ggggagacaa cagagagaga gccgagagag 2581 accctacaag gaggtgacag aggatttgct gcacctcaat tctctctttg gagaagacca 2641 gtagtcaaag catgtatcga gggtcagtca gtagaagtat tactagacac aggagttgac 2701 gactcaatag tagcagggat agaattaggt agcaattaca ccccaaaaat agtaggaggg 2761 ataggagggt tcataaatac caaagaatac aaagatgtag aaatagaagt agtgggaaaa 2821 agagtaaggg caactataat gacaggagat accccaataa acatttttgg cagaaatatt 2881 ttaaatacct tgggcatgac tttaaatttc ccagtggcaa aggtagaacc agtaaaagtt 2941 gagttaaaac ctggaaaaga tgggccaaag atcagacaat ggcctctatc cagggaaaag 3001 atactagccc tcaaagaaat ctgtgaaaaa atggaaaagg agggacaatt agaagaagca 3061 ccccctacta atccatacaa cacacccacc tttgccataa agaagaaaga taagaacaaa 3121 tggagaatgc taatagactt tagagagtta aacaaggtaa cccaggactt cacagaagtc 3181 aactgggtat tccccacccg gcaggtggca gaaaaaagga gaataacagt aatagatgtg 3241 ggagatgcct acttcagtat cccactagat ccaaacttca gacagtatac agcattcacc 3301 ttgccatcag taaacaatgc agagccagga aagagataca tttataaagt tctaccacaa 3361 gggtggaagg ggtcccagtc aatttgtcaa tactccatga gaaaggtatt agatcctttt 3421 agaaaggcca acagcgatgt cattataatt cagtacatgg atgacatcct tatagcaagt 3481 gacagaagtg atctggagca cgacagggta gtgtcccaac taaaagagtt attaaatgac 3541 atgggattct ctaccccaga agaaaagttc caaaaagacc ctccgttcaa atggatgggt 3601 tatgagctct ggccaaaaaa gtggaaactg caaaaaatac aactgccaga aaaagaagtt 3661 tggacagtga atgcaattca aaaactggta ggagtattaa actgggcagc tcaactcttt 3721 cctggaatta agacaaggca catatgcaaa ctaattaggg gaaagatgac cctaacagaa 3781 gaagtacagt ggacagaact agcagaagca gagctacagg agaataaaat catcttagaa 3841 caggaacaag aaggatccta ctacaaggaa agggtaccgc tagaagcaac agtacagaaa 3901 aacctagcaa atcagtggac atacaaaatt catcagggaa ataaagtcct aaaagtagga 3961 aaatatgcaa aggttaaaaa cacgcacacc aacggggtaa gactactggc acatgtagtt 4021 cagaaaatag gcaaagaagc cctagtcatc tggggagaga taccagtgtt ccatctgcca 4081 gtagaaagag agacatggga ccagtggtgg acagattact ggcaagtaac ctggatccca 4141 gagtgggact ttgtctcgac cccaccatta ataagactag cctacaacct agtcaaagac 4201 cccctagaag ggagagaaac ctactacaca gatgggtcct gcaatagaac ctcaaaggaa 4261 ggaaaagcag gatatgtcac tgacagggga aaagataagg ttaaagtgtt agaacagaca 4321 acaaaccaac aagcagaact tgaagcattt gcattagcat taacagactc agaaccacaa 4381 gttaacatca tagtagattc acaatatgtc atgggaataa tagctgcaca gccaacagaa 4441 acagaatcac caatagtagc aaaaataatt gaagaaatga tcaaaaaaga ggcagtatat 4501 gtaggatggg taccagctca caagggactg ggtggtaatc aggaagtaga ccacctagta 4561 agtcaaggaa tcagacaggt cttgttccta gaaaaaatag aaccagccca ggaagagcat 4621 gaaaaatatc atggcaatgt aaaagaactg gtccataaat tcggaattcc acaattagtg 4681 gcaaaacaga tagtaaattc ctgtgataaa tgccaacaaa aaggggaagc tattcatgga 4741 caggtaaatg cagacctagg gacatggcag atggactgta cacatttaga aggaaaaatt 4801 ataatagtgg cagtccatgt agccagtggg tttatagaag cagaggtaat accccaagag 4861 acaggaagac agacagctct cttcctacta aagttggcca gcagatggcc tatcacacac 4921 ctacacacag acaacggtgc caacttcacc tcaccaagtg taaagatggt agcctggtgg 4981 gtaggaatag aacaaacttt tggagtaccc tataacccac aaagtcaagg agtagtggaa 5041 gcaatgaacc atcacctgaa aaatcaaata gacagactca gagaccaagc agtatcaata 5101 gagacagttg tactaatggc aactcactgc atgaatttta aaagaagggg aggaataggg 5161 gatatgaccc ctgcagaaag actagttaac atgataacca cagagcaaga aatacagttc 5221 ttccaagcaa aaaatttaaa atttcaaaat ttccaggtct attacagaga aggcagagat 5281 caactctgga agggacctgg tgaactattg tggaaagggg aaggagcagt catcataaag 5341 gtagggacag aaatcaaagt agtacccagg agaaaagcaa aaattataag gcactatgga 5401 ggaggaaaag gattggattg tagtgccgac atggaggata ccaggcaggc tagagagatg 5461 gcacagtctg attaagtatc ttaagtatag aacaggagag ttgcaacagg tctcttatgt 5521 ccctcaccac aaggtaggat gggcttggtg gacttgcagt agaataatat ttcccctaaa 5581 caaaggagca tggctagaag tccaaggata ttggaaccta accccagaaa ggggattctt 5641 gagctcctat gctgtaagac taacatggta tgagaggaac ttttatacag atgtaacacc 5701 tgatgtggca gaccagctac tgcatgggtc ttatttctct tgcttttcag ccaatgaagt 5761 aaggagagcc atcaggggag aaaagatatt gtcctactgc aactatccat cagctcacga 5821 agggcaggta ccaagcttac agtttctagc cctaagggtc gtacaggaag gaaaaaatgg 5881 atcccaggga gagagtgcca ccaggaaaca gcgacgaaga aacagtagga gaagcattcg 5941 cttggctaga aagaacaata acagagctca acagggtagc ggtcaaccat ttgccccgag 6001 aacttatttt ccaggtctgg cagaggtctt gggcatactg gcgtgaggaa cagggcatgt 6061 caattagcta taccaaatat agatacttgt tgctaatgca gaaagcaatg tttgtgcact 6121 atacaaaggg ctgtaggtgc ctgcaggagg gccatgggcc agggggatgg agatcaggac 6181 ctcctcctcc tcctccccca ggcctggcct aatggcagaa gcagccccag agatccctcc 6241 agagaacgag aacccacaaa gagaaccgtg ggaagagtgg ataggggaga tcctggagga 6301 aataaagcaa gaagccttaa agcattttga tcctcgcttg ctaactgcgc ttggtaactt 6361 tatctacagt aggcatggag atacccttgc aggagcagga gagctcatta aaatcctcca 6421 acgagccctc ttcctccact tcagagccgg ttgtcaacac tcaaggattg gacaatcagg 6481 gggaggaaat cctctctcaa ctataccgcc cccttaaggc atgcgataat acatgctact 6541 gtaagaaatg ctgctaccat tgccagcttt gttttcttaa aaagggtctt gggatatgtt 6601 atgaccgctc gagaaggaga tctgcaaaaa gagctaagac tactgcacct tctgcaccag 6661 acaagtgagt atggcatatt ttagcagccg cctgcctatt gcgctcctgc ttataggtat 6721 cagtgggttt gtatgtaaac aatatgttac tgtcttctat ggcatacccg catggaggaa 6781 cgcaacagtt cccctcattt gtgcaaccac aaacagagac acctggggaa ctgtacagtg 6841 tctcccagac aatggtgact acactgagat caggctaaac ataacagagg cttttgatgc 6901 atgggataat acagtgacac aacaggcagt agatgatgtg tggagactct ttgaaacctc 6961 cataaaacca tgtgtcaaac taaccccact gtgtgtggca atgaactgta gtaaaaccga 7021 aacaaaccca gggaatgcca gtagtactac caccactaag cctactacca cctctcgtgg 7081 gctgaaaacg attaacgaaa cagacccatg cataaaaaat gacagctgca caggactagg 7141 agaagaggaa ataatgcaat gtaattttag tatgacggga ctaagaagag atgagctaaa 7201 acaatataaa gacacctggt actcagaaga tttagagtgt aataatacca ggaagtatac 7261 cagcagatgc tatataagaa cctgcaacac aacaattatc caagagtcat gtgacaaaca 7321 ttattgggac agcttaaggt ttaggtattg tgctcccccg gggttttttc tactaagatg 7381 taatgatacc aactattcag gcttcatgcc caactgcagt aaggtagtag cgtcctcctg 7441 cacaagaatg atggaaacac agtcctctac atggtttggc ttcaatggta caagggcaga 7501 gaacaggaca tatatatatt ggcatgaaaa agacaatagg accatcataa gcttaaatac 7561 atactataat ttgtcaatac actgtaagag gccaggaaac aagacggttg taccaataag 7621 aaccgtgtca ggactacttt tccattcaca gcctatcaat aagagaccca gacaagcttg 7681 gtgctggttt aagggaaact ggacagaagc cataaaggag gtgaaaagga ccatcataaa 7741 acatcccagg tataaaggag gtgcaaaaaa tatcacaagc gtaaagttag tatcagaaca 7801 tggaaaaggt tcagatc // LOCUS HIV2GH1 9480 bp ds-DNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 proviral DNA, gag, pol, vif, vpX, vpR, tat, rev, env and nef genes. ACCESSION M30895 KEYWORDS . SOURCE Human immunodeficiency virus type 2 MOLT-4 cell (isolate GH1) proviral DNA, clone 8. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9480) AUTHORS Hasegawa,A., Tsujimoto,H., Maki,N., Ishikawa,K.-I., Miura,T., Fukasawa,M., Miki,K. and Hayami,M. TITLE Genomic divergence of HIV-2 from Ghana JOURNAL AIDS Res. Hum. Retroviruses 5, 593-604 (1989) STANDARD full staff_review COMMENT GH1 was isolated from a Ghanian AIDS patient; virus was cloned in MOLT-4 clone 8 cells. FEATURES from to/span description pept 544 2112 gag polyprotein pept < 1770 4937 pol polyprotein pept 4867 5514 vif protein pept 5342 5680 vpX protein pept 5680 5997 vpR protein pept 5843 6138 tat protein, exon 2 (first expressed exon) 8281 8377 tat protein, exon 3 (AA at 8282) pept 6069 6138 rev protein, exon 2 (first expressed exon) 8281 8522 rev protein, exon 3 (AA at 8283) pept 6148 8364 env polyprotein (see COMMENT) pept 8540 9307 nef protein BASE COUNT 3257 a 1957 c 2349 g 1917 t ORIGIN 1 cagtcgctct ggcagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctagactct caccagtgct tggccggcac tgggcagacg 121 gctccacgct tgcttgctta aagacctctt aataaagctg ccaattagaa gcaagttaag 181 tgtgtgttcc catctctcct agtcgccgcc tggtcattcg gtgttcatct gaataacaag 241 accctggtct gttaggaccc ttccgctttg ggaatccaag gcaggaaaat ccctagcagg 301 ttggcgccca acagggactt gaaagggact gagaagccct ggaacacggc tagtgaaggt 361 agtaaggcgg caggaacaaa ccacgacgga gtgctcctag aaaagcgcgg gccgaggtac 421 caaaggcggc gtgtggagcg ggagtgaaag aggcctccgg gtgaaggtaa gtgcctacac 481 cgaaaaccgt agccagaaaa ggctagttat cctatctttt agacaggtag aagattgtgg 541 gagatgggcg cgagaaactc cgtcttgaga gggaaaaaag cagatgaatt agaaaagatt 601 aggttacggc ccagcggaaa gaaaaagtac aggctaaaac atattgtgtg ggcagcgaac 661 gaattggaca aattcggatt ggcagagagc ctgttggagt caaaagaagg ttgccaaaag 721 attctcacag ttttagatcc attagtacca acagggtcag aaaatttaaa aagccttttt 781 aacacagtct gcgtcatttg gtgcttgcac gcagaagaga aagtgaaaga tactgaagaa 841 gcaaaaaaac tagtacagag acatctaggg gcagaaacag gaactgcaga aaaaatgcca 901 agtacaagta gaccaacagc accacctagc gggagaggaa gaaacttccc tgtacaacag 961 acaggcggtg gcaactatat ccacgtgcca ctgagccccc gaactctaaa tgcttgggta 1021 aaattagtag aggacaagaa gttcggggca gaagtagtgc caggatttca agcactctca 1081 gaaggctgca cgccctatga tatcaaccaa atgcttaatt gtgtgggcga tcaccaagca 1141 gctatgcaaa taatcagaga gattatcaat gacgaagcag cagattggga tgcacagcac 1201 ccaataccag gccccttacc agcagggcag cttagagacc caagggggtc tgacatagca 1261 ggaacaacaa gcacagtaga agaacagatc cagtggatgt ataggccaca aaatcccgtg 1321 ccggtaggga acatctacag aagatggatc cagatagggc tacagaagtg tgtcaggatg 1381 tacaacccaa ctaacatctt agacgtaaag cagggaccaa aggaaccgtt ccagagctat 1441 gtggacaggt tctataaaag cttgagggca gaacaaacag atccggcagt aaagaactgg 1501 atgacccaaa cgctgctaat acagaatgcc aacccagact gcaagttagt actaaaagga 1561 ctggggatga atcccaccct agaagagatg ctgactgcct gtcagggggt aggtggacca 1621 ggccaaaaag ccagactaat ggccgaagcc ctgaaggagg ctttgacacc accccctatc 1681 ccatttgcag cagcccaaca aagaaaggta attaggtgtt ggaactgcgg aaaggaggga 1741 cactcggcga gacagtgccg agcgcctaga agacagggct gctggaaatg tggcaagaca 1801 ggacacgtca tggcaaaatg cccagaaaga caggcaggtt ttttagggat gggtccatgg 1861 ggaaagaagc cccgcaactt ccccgtggcc caagctcctc cggggctgat accaacagca 1921 cccccagcag atccagcagt ggatctattg gaaagatata tgcagcaggg gagagagcag 1981 agggagcaga gggagagacc atacaaagag gtgacggaag acttactgca cctcgagcag 2041 ggaaaagcac ctcacagaga ggcgacagag gacttgctgc acctcaattc tctctttgga 2101 aaagaccagt agtcacagca tacatcgagg tacagccggt agaagtctta ttagacacag 2161 gggctgatga ctcaatagtg gcaggaatac agttagggga caattacgtt ccaaaaatag 2221 tggggggaat agggggattc ataaacacca aagagattaa aaatatagaa ataaaagtac 2281 taaataaaag agtaagagcc accataatga caggagatac cccaatcaac atctttggca 2341 gaaatatcct gacagcctta ggcatgtcat taaatttacc aatcgccaag atagaaccaa 2401 taaaagtaac gttgaagcca gggaaagatg gaccaagact gagacaatgg cccctaacaa 2461 aagagaaaat agaagcacta agagagatct gcgaaaaaat ggaaaaggag ggccaactag 2521 aagaggcacc tccaactaat ccttataata cccccacatt tgcaattaag aaaaaggaca 2581 agaacaaatg gaggatgcta atagacttta gagaactaaa cagggtgact caagatttca 2641 cagaaattca gctaggaatt ccacacccgg caggattagc caaaaagaaa aggatcactg 2701 tactagatgt aggggatgcc tacttttcca taccgctaca tgaagatttt aggcagtata 2761 ctgcatttac cctaccatca gtaaacaatg cagaaccagg aaaaagatat atatataagg 2821 tcttaccaca aggatggaag ggatcaccag caatttttca acacacaatg aggcaagtct 2881 tagaaccttt cagaaaagca aacccagatg tcattctcat ccaatacatg gatgatattt 2941 taatagctag tgacaggaca ggtttagagc atgacaaagt agtcctgcag ctaaaagaac 3001 ttctgaatgg cctagggttc tctaccccag atgagaagtt ccaaaaagac cctccgcttc 3061 aatggatggg ctatgaacta tggccaacta aatggaagct gcagaaatta caactgcccc 3121 agaaagaaat atggacagtc aacgacatcc agaagctggt gggagtctta aattgggcgg 3181 cacaaatcta tccaggaata aaaaccaaac atttgtgtag gctaatcaaa ggaaaaatga 3241 cactcacaga agaagtgcag tggacagagt tagcagaagc agagctagaa gaaaacaaaa 3301 ttatcttaag ccaggaacaa gaaggatact attaccaaga agaaaaagaa ttagaggcaa 3361 caatccaaaa aaatcaagac aatcagtgga catacaaaat acaccaggaa gagaaaatcc 3421 taaaagtagg gaaatatgca aagataaaaa atacccatac caatggggtc agattactag 3481 cacaggtagt tcagaaaata ggaaaagaag cactagtcat ctggggacgg ataccaaaat 3541 tccacttacc ggtggagaga gagacctggg agcagtggtg ggataactac tggcaagtga 3601 catggatccc agagtgggac ttcgtatcta ccccaccact agtcaggtta acatttaacc 3661 tagtaggaga tcctatacca ggcgcagaga ccttctacac agatggatca tgcaatagac 3721 agtcaaagga aggaaaagca cgatatgtga cagatagagg aagagacaag gtaagagtac 3781 tagaacgaac taccaatcag caagcagaat tagaagcctt tgcgatgaca ctgacagact 3841 cgggtccaaa agttaatatc atagtagact cacagtatgt aatgggaata gtagtaggcc 3901 aaccaacaga gtcagaaagt agaatagtaa accaaatcat tgaggacatg ataaagaaag 3961 aagcagtcta tgtcgcgtgg gtcccagccc ataaaggcat aggaggaaac caggaagtag 4021 accatttagt aagtcagggc atcagacaag tattgttcct agaaaggata gagcccgccc 4081 aagaagaaca tgaaaaatat catagcaata tgaaagaact aacccataaa tttggaatac 4141 cccaactagt agcaagacag atagtaaaca catgtgccca atgccaacag aaaggagagg 4201 ccatacatgg gcaggtaaat gcagaaatag gtgtctggca aatggactgc acacacttag 4261 aaggaaagat cattatagta gcagtacatg ttgcaagtgg attcatagaa gcagaagtta 4321 tcccacagga atcaggaagg cagacagcgc tcttcctatt aaaactggcc agtaggtggc 4381 caataacaca cttacacaca gacaatggct ccaacttcac ttcacaggaa gtgaagatgg 4441 tggcatggtg gataggtata gagcaatcct ttggagtacc ttacaaccca caaagccagg 4501 gagtagtaga agcaatgaat caccacttaa agaatcagat aagtagaatt agagaacagg 4561 caaatacaat agaaaccata gtactaatgg cagttcattg catgaatttt aaaagaaggg 4621 gaggaatagg ggatatgacc ccagcagaaa gactaatcaa catgattacc acagaacaag 4681 aaatacaatt cctccaaaga aaaaattcaa attttaaaaa cttccaggtc tattacagag 4741 aaggcagaga tcagctgtgg aaaggacctg gagaactact gtggaaggga gacggagcag 4801 tcatagtcaa ggtaggggca gacataaaag taataccaag aaggaaggcc aagattatca 4861 gagactatgg aggaaggcaa gaactggata gttcccacct ggagggtgcc agggaggagg 4921 atggagaggt ggcatagtct tgtcaagtac ctaaaataca gaacaagaga cctagaggag 4981 gtgcgctatg ttccccacca caaggtggga tgggcgtggt ggacttgcag cagggtaata 5041 ttccctctaa aaggagaaag tcatctagag atacaggcat attggaacct aacgccagaa 5101 aaaggatggc tctcctctca ttcagtgaga ataacctggt atacagaaag gttctggaca 5161 gatgttaccc cagactatgc agacatccta atacatagca cttatttctc ttgctttacg 5221 gcaggtgaag taagaagagc catcagaggg gaaaagttat tgtcctgctg caactatccc 5281 caagctcata aagtacaggt accatcactt caatacctag ccctagtggt agtgcaacaa 5341 aatgacagac cccagagaaa gggtaccgcc aggaaacagt ggagaagaga ccattggaga 5401 ggccttcgag tggctagaca ggactataga agccttaaac agagaggcag tgaaccatct 5461 gccccgagag ctcattttcc aggtgtggca aaggtcctgg agatactggc atgatgacca 5521 agggatgtca ccaagctaca caaagtatag atatttgtgc ctgatgcaga aagctgtatt 5581 tatacatttc aagagagggt gcacttgcct ggggggagga catggaccgg gaggatggag 5641 atcaggacct ccccctcctc cccctccagg tctagtctaa tgactgaagc accaacagag 5701 tttcccccag aagatgggac cccacggagg gaactagggg gtgactgggt aataagaatt 5761 ctgggagaaa taaaggaaga agccttaaag cattttgatc cccgcttgct aatcgctctt 5821 ggcaactata tccatagtag acatggagac acacctgaag gcgccagaga gctcattaga 5881 atcctacaac gagccctctt cgtgcacctc agagcagggt gtaaccgctc aagaattagc 5941 caaacaagga gaagaactcc tttcccagct gcaccgaccc ctagaggcat gtactaactc 6001 atgctattgt aagcagtgca gcttccattg ccagctgtgt ttcttgaaaa aagggctcgg 6061 gatatggtat gcacgaaaaa gcagacggag aagaactcca agaaagacta agactcattc 6121 gtcttctgca tcagacaagt gagtaagatg tgtggtaaga gtctactatg tgttgccagc 6181 ttgctagcta gtgcttactt agtatattgc acccaatatg tgactgtttt ctatggcgtg 6241 cccgtgtgga gaaatgcatc cattcccctc ttttgtgcaa ctaaaaacag agatacttgg 6301 ggaaccatac agtgcaagcc agacaatgat gattatcaag aaataacctt gaatgtgaca 6361 gaggctttcg acgcatggga taatacagta acagaacaag cagtagaaga tgtctggagt 6421 ctatttgaga catcaataaa accatgcgtc aaactaacac ctctatgtgt agcaatgagt 6481 tgtaacagca ccactaacaa cacaaccaca acagggagca caacagggat gtcagagata 6541 aatgaaactt ctccatcata ctcagacaac tgcacaggat tggggaagga agagatagtc 6601 aattgtcagt tctatatgac aggattagag agagataaga aaaaacagta taatgaaaca 6661 tggtactcaa aagatgtagt ttgtgaatca aataatacca aagatgggaa aaacagatgt 6721 tacatgaacc attgcaacac atcagtcatc acagagtcat gtgataagca ctattgggat 6781 gctataaagt ttagatactg tgcaccaccg ggttatgccc tactaagatg caatgatacc 6841 aattattcag gcttcgagcc aaagtgttct aaagtagtag cttctacatg cacaaggatg 6901 atggaaacgc aaacctccac ttggtttggc tttaatggca ctagggcaga gaatagaaca 6961 tatatctatt ggcatggcag agataatagg actatcatca gcttaaacaa gtattataac 7021 ctttctatac attgtaagag accaggaaac aagacagttg taccaataac actcatgtca 7081 gggttagtgt ttcactccca gccaatcaat acaagaccta ggcaagcatg gtgctggttc 7141 aaaggcaagt ggagggaagc catgcaggag gtaaagcaaa cccttataaa acatcccagg 7201 tataaaggaa ccaacgacac aaagaatatt aactttacaa aaccaggaag aggctcagac 7261 ccagaagtgg catacatgtg gactaactgc agaggagaat ttctctactg caacatgact 7321 tggttcctca attgggtaga aaacagaccg aatcagacac agcacaatta tgcgccgtgc 7381 catataaggc aaataattaa tacctggcac aaggtaggga aaaatgtata tttgcctcct 7441 agagaaggac agttgacctg caactcaaca gtgaccagca taattgctaa cattgacgta 7501 aatagtaacc agacaaatat tacctttagt gcagaggtgg cagaactata ccgattagaa 7561 ttgggagatt ataaattaat agaagtaaca ccaattggct tcgcacctac gagagagaaa 7621 agatattcct cggctccagt gaggaacaaa agaggtgtgt ttgtgctagg gttcttgggt 7681 tttctcgcga cagcaggttc tgcaatgggc gcggcgtccc tgacgctgtc ggctcagtcc 7741 cggactttac tggccgggat agtgcagcaa cagcaacagc tgttggacgt agtcaagaga 7801 caacaagaaa tgttgcgact gaccgtctgg ggaacgaaaa atctccaggc aagagtcact 7861 gctatcgaga agtacttaaa ggaccaggca cagttaaatt catgggggtg tgcgtttagg 7921 caggtctgcc acactactgt accatgggta aatgattctt tatcgcctga ctggaacaat 7981 atgacatggc aggaatggga aaaacaagtc cgctacctag aggcaaatat cagtcaaagt 8041 ttagaacagg cacaaattca acaagaaaag aatatgtatg aattacaaaa gctaaatagc 8101 tgggatgtct ttggcaactg gtttgattta acctcctgga tcaagtatat tcaatatgga 8161 gtttatatag tagtaggagt gatagtttta agaatagcaa tatatatagt gcaattgtta 8221 agtagactta gaaagggcta taggcctgtt ttctcttccc ctcccggtta tctccaacag 8281 atccatatcc acacggaccg gggacagcca gccaacgaag gaacagaaga agacgacaga 8341 gacgacgatg gttacgactt gtagccttgg ccgataaact atatacattt cctgatccac 8401 ctactgactc gcctcttgac cgggctatac aagatctgca gagacttact atccacgaac 8461 tccccgaccc accgactgat ctcccagaat ctaacagcaa tcagggactg gctgagactt 8521 aaagcggcct acctgcaata tgggggcgag tggatccaag aagcattcca agcattcgca 8581 aagactacga gagagactct tgcgagcgca tggggggggt tatgtgcagc agtgcaacgc 8641 gtcgggaggg gaatactcgc agtcccaaga aggatcaggc aaggggcaga aatcgccctc 8701 ctgtgaggga cagcagtatc ggcagggaga tttcatgaac accccatgga gaaccccggc 8761 aatagaagga cagaaaaagt tatacaagca gcaaaatatg gatgatatag attctagtga 8821 tgatgaccta gtgggagttc ctgttacacc aagagtacca ctaagagcaa tgacctataa 8881 attggcagta gatatgtcac attttataaa gaaaagggga ctggatggga tgttttacag 8941 tagggataga catagaatcc tagacttgta cctagaaaag gaggaaggga taataccaga 9001 ttggcagaat tatactcatg ggccaggagt aaggtaccca atgtgtttcg ggtggctatg 9061 gaagctagta ccagtagatg tctcacaaga ggcagaggac gatgagacca actacttaac 9121 acacccagca caaacaagca gacatgatga cgagcatgga gaaacactac tttggcggtt 9181 tgaccccacg ctggcctatg attacaaggc tttcattctg catccagaag agtttgggca 9241 caagtcagga ttgccagaga aagagtggaa ggcaaaactg aaagcaagag ggataccata 9301 tagttaacaa caggaacagc catacttggt caaggcagga agtagctact gagaacagct 9361 gaggctgcag ctgcagggac tttccagaag gggctgtaac caagggaggg acatgggagg 9421 agctggtggg gaacgccctc atacttactg tataaatgta ctcgcttctt acattgtatt // LOCUS HIV2NIHZA 9431 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2) proviral gag, pol, vif, vpX, vpR, tat, rev, env and nef genes. ACCESSION J03654 KEYWORDS . SOURCE Human immunodeficiency virus type 2 (isolate FG) proviral DNA, clone NIHZ. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9431) AUTHORS Zagury,J.F., Franchini,G., Reitz,M., Collalti,E., Starcich,B., Hall,L., Fargnoli,K., Jagodzinski,L., Guo,H.G., Laure,F. and et,al. TITLE Genetic variability between isolates of human immunodeficiency virus (HIV) type 2 is comparable to the variability among HIV type 1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5941-5945 (1988) STANDARD full staff_review COMMENT Hard copy of sequence kindly provided by G. Franchini (August 1987). The NIHZ clone has a large deletion in the nef cds and LTR relative to other HIV2 sequences; nevertheless, it is biologically active. FEATURES from to/span description pept 547 2106 gag polyprotein pept < 1824 4931 pol polyprotein (NH2 terminus uncertain; AA at 1824) pept 4861 5508 vif protein pept 5336 5674 vpX protein pept 5674 5991 vpR protein pept 5837 6132 tat protein, exon 2 (first expressed exon) 8272 8368 tat protein, exon 3 (AA at 8273) pept 6063 6132 rev protein, exon 2 (first expressed exon) 8272 8525 rev protein, exon 3 (AA at 8274) pept 6139 8709 env polyprotein pept 8543 9085 nef protein LTR < 1 301 5' LTR LTR 8928 > 9431 3' LTR rpt < 1 177 R repeat 5' copy rpt 9254 9431 R repeat 3' copy binding 305 321 primer (Lys-tRNA) binding site signal 9404 9409 poly-A signal BASE COUNT 3213 a 1913 c 2348 g 1957 t ORIGIN Cap site of genomic RNA. 1 agtcgctctg gcagaggctg gcagattgag ccctgggagg ttctctccag cactagcagg 61 tagagcctgg gtgttccctg ctagactctc accagtgctt ggccggcgct gggcagacgg 121 ctccacgctt gcttgcttaa agacctctca ataaagctgc caattagaag caagttaagt 181 gtgtgctccc atctctccta gtcgccgcct ggtcatccgg tgttcacctg tataacaaga 241 ccctggtctg ttaggaccct tcttgctttg ggaaaccgag gcaggcggaa aatccctagc 301 aggttggcgc ccgaacagga cttgaagaag actgagaagc cttggaacaa gggctgggtg 361 aaggcagtaa gggcggcagg aacaaaccag cacggagtgc tcctagaaag gcgcgggccg 421 aggtaccaaa gcggcgtgtg gacgggagtc gagaggcctc cgggtgaagg gtaagtacct 481 acaccaaaaa ctgtagccaa agagggcttg ttatcctacc tttagacagg tagaagattg 541 tgggagatgg gcgcgagaaa ctccgtcttg agagggaaaa aagcagatga attagaaaaa 601 attaggttac ggcccggcgg aaagaaaaag tacaagttaa aacatattgt gtgggcagcg 661 aatgaattgg acagattcgg attagcagag agcctgttgg agtcaaaaga aggttgccaa 721 aaaattctta cagttttaga tccattagta ccgacaggtt cagaaaattt aaaaagcctt 781 tttaatactg tctgcgtcat ctggtgcata cacgcagaag agaaagtgaa agatactgaa 841 ggagcaaaac aaatagtaca gagacatcta gtggcagaaa caggaactgc agagaaaatg 901 ccaaatacaa gtagaccaac agcaccacct agcgggaaaa acttccccgt gcagcaagta 961 gctggcaact atacccatat accactgagt cccggaactc taaatgcttg ggtaaaatta 1021 gtggaggaaa agaaattcgg ggcagaagta gtgccaggat ttcaggcact ctcagaaggc 1081 tgcacgccct atgatattaa tcagatgctt aattgtgtgg gcgaccatca agcagctatg 1141 caaataatca gggaaattat taatgaggaa gcagcagact gggatgttgc acatcccata 1201 ccaggcccct taccagcggg gcagcttaga gagccaagag ggtctgacat agcagggaca 1261 acaagcacag tagaagagca gatccagtgg atgtttagac cacaaaatcc tgtaccagta 1321 ggcaacatct ataggagatg gattcagata ggactgcaaa agtgcgtcag gatgtacaat 1381 ccaaccaaca tcctagacat aaatcaagga ccaaaggaac cattccaaag ctatgtagat 1441 aggttctaca aaagcttgag ggcagaacaa acagatccag cagtaaaaaa ttggatgacc 1501 caaacactgc taatacagaa tgccaaccca gattgtaaat tagtactaaa aggattagga 1561 atgaatccta ccttagaaga aatgctgacc acctgtcagg gggtaggggg accaggccag 1621 aaagctagat taatggcaga agccctaaag gaggtcatgg caccagcccc catcccattt 1681 gcagcagccc agcagagaaa gacattcaag tgctggaatt gtggaaagga aggacactcg 1741 gcaagacaat ggagcgcgcc tagaagacag ggctgctgga agtgtggtaa gtcaggacat 1801 gtcatggcaa attgcccaga tagacaggct ggttttttag ggattggcca ctggggaaag 1861 aaggcccgca acttccccgc ggcccaagtc ccgcaggggc taacaccaac agcaccccca 1921 ttggatccag cagtggacct actggagaaa tatatgcagc aaggaaaaag gcaaagggag 1981 cagagagaga gaccgtacaa ggaagtgaca gaggacttac tgcgtttcga gcaggcagag 2041 acaccatgca gggagacgac agaggacttg ctgcacctca attctctctt tggaaaagac 2101 cagtagtcac agcacacatt gagggtcagc cagtagaagt tttgttagac acaagggcta 2161 acgactcaat agtagcagga atagagttag ggagcaatta tagtccaaaa atagtaggag 2221 gaataggggg attcataaat accaaggaat ataaaaatgt agaaatagaa gtcctaggta 2281 aaagggtaaa agccaccata atgacaggtg ataccccgat caacattttt ggcagaaatg 2341 ttctgacagc tttaggcatg tcattaaacc tgccagttgc caagatagaa ccaataaaaa 2401 taatgctaaa gccagggaaa gatggaccaa gactaaaaca atggccttta acaaaagaaa 2461 aaatagaagc actaaaagaa atctgtgaaa aaatggaaaa agaaggccag ctagaagaag 2521 cacctccaac taatccttat aataccccca catttgcaat caagaaaaag gacaaaaaca 2581 aatggagaat gctaatagac tttagggagc taaacaaggt aactcaggat ttcacagaaa 2641 ttcagttagg aattccacac ccagcaggat tggccaaaaa gaggagaatt actgtactag 2701 atgtagggga tgcttacttt tccataccac tacatgagga ctttagacag tatactgcat 2761 tcactctacc atcagtaaac aatgcagaac caggaaaaag atatatatac aaagtcctac 2821 cacaggggtg gaaggggtca ccagcaattt ttcaatacac aatgaggcag atcttagaac 2881 cattcagaaa agcaaacgag gatgtcatta tcattcagta catggatgat atcttaatag 2941 ccagcgacag gacagactta gaacatgaca aagtggtcct gcagctaaaa gaacttctaa 3001 atggactagg attttccacc ccagatgaga aattccagaa agaccctcca tatcgctgga 3061 tgggctatga attatggcca actaaatgga agttgcaaaa aatacagctg ccccaaaaag 3121 aagtatggac agtcaatgac atccaaaagc tagtgggtgt cctaaattgg gcagcacaaa 3181 tctacccagg gataaagacc aaacacctat gtaggctaat tagaggaaaa atgacactca 3241 cagaagaggt acagtggaca gaattagcag aagcagagct agaggaaaac agaattatct 3301 taagccagaa acaagaagga cactattacc aggaagaaaa aaagttagaa gcaacagtcc 3361 agaaagatca agacaatcag tggacatata aggtacacca gggggagaaa attctcaaag 3421 gtgggaaaat atgcaaagat aaaaaatacc cataccaacg ggtcagattg ttagcacagg 3481 tagttcaaaa aataggaaaa gaagcactag tcatttgggg acggatacca aaatttcacc 3541 taccagtaga gagagatacc tgggagcagt ggtgggataa ctactggcaa gtaacatgga 3601 tcccagactg ggactttgta tctaccccac cactggtcag gctagcattt aacctggtag 3661 gagagcctgt accaggcgca gaaactttct acacagatgg atcctgcaat aggcagtcaa 3721 aagaagggaa agcaggatat ataacagata gagggagaga cagggtaaaa gtattagagc 3781 aaactaccaa tcagcaagca gaattagaag cctttgcaat ggcactaaca gactcaggtc 3841 caaaagctaa tattatagta gactcacaat atgtaatggg gatagtagca ggccaaccaa 3901 cagagtcaga gaatagaata gtaaatcaga tcatagaaga aatgataaag aaagaagcaa 3961 tctatgttgc gtgggtccca gcccacaaag gcataggagg aaaccaggaa gtagatcatt 4021 tagtaagtca gggcattaga caagtattat tcctagagaa aatagaaccc gcgcaggaag 4081 aacatgaaaa atatcatagc aatataaaag aactgtccca taaatttgga atacccaagc 4141 tagtggcaag acaaatagta aacacatgtg cccatgtgca acagaaaggg gaggctatac 4201 atgggcaagt aaatgcagaa ctaggcactt ggcaaatgga ctgcacacat ttagaaggaa 4261 aagtcattat agtagcagta catgttgcaa gtggatttat agaggcagaa gttatcccac 4321 aggaatcagg aaggcaaaca gcactgttct tactgaaact ggccagtagg tggccaataa 4381 cacacctgca cacagataat ggtgccaact tcacttcaca ggaagtgaaa atggtagcat 4441 ggtgggtagg tatagaacaa acctttggag tgccttacaa tccacaaagc caaggagtag 4501 tagaagcaat gaatcatcat ctaaaaaatc agatagacag aattagagag caggcaaata 4561 cagtagaaac aatagtatta atggcagttc attgcatgaa ttttaaaaga aggggaggaa 4621 taggggatat gaccccagca gaaagaataa tcaatatgat caccacagaa caagaaatac 4681 aattcctcca agcaaaaaat tcaaaattaa aaaattttcg ggtctatttc agagaaggca 4741 gagatcagct gtggaaagga cctggggaac tgctgtggaa gggagacgga gcagtcatag 4801 tcaaggtagg gacagaaata aaagtagtac caagaaggaa agccaagatc attaaagact 4861 atggaggaag gcaagagatg gatagtggtt cccatttgga gggtgccagg gaggatggag 4921 agatggcata gcttggtcaa gtacttgaaa tacagaacaa aagacttaga gaaggtatgc 4981 tatgttcccc atcataaggt ggggtgggca tggtggactt gcagcagggt aatattccca 5041 ctgaaagaga acagtcatct ggagatacaa gcatattgga acttaacacc agaaaaagga 5101 tggctctcct ctcactcagt gagaataact tggtatacag aaaagttctg gacagatgtt 5161 accccagact gtgcagatac cctaatacat agcacttatt tctcttgctt tacggcaggt 5221 gaagtaagaa gagccatcag aggggaaaaa ttattgtcct gctgcaaata cccccgagcg 5281 catagatccc aggtaccgtc acttcaattt ctggccttag tggtagtgca acaaaatgac 5341 agatcccagg gaaacagtgc caccaggaaa cagcggcgag gagactatcg aagaggcctt 5401 cgcatggcta gacaggacag tagaggctat aaacagagag gcagtgaatc accacccacg 5461 agagctcatt ttccaggtct ggcagaggtc ctggagatac tggcatgatg aacaagggat 5521 gtcaacaagc tacacaaaat atagatattt gtgcttaata cagatggcaa tgtacatgca 5581 tgctaagaga gatggcactt gcctgggggg aggcatgggc cagaagggtg gagatcaggg 5641 acctccccct cctccacctc caggtttagt ctaatgactg aagcaccaac agaactcccc 5701 ccggaggata ggaccccacc gagggagcca ggggatgcat gggtaataga aatcctgaga 5761 gaaatagaag aagaagcttt aaggcatttt gaccctcgct tgctaactgc tcttggcaga 5821 tatatctata ctaggcatgg agacaccctt gaaggcgcca gagagctcat tagaatcctg 5881 caacgagccc tcttcgcgca cttcagagca ggatgtggcc actcaagaat tggccagaca 5941 aggggaggaa atcctctctc agctataccg acccctagag gcatgcacta actcctgcta 6001 ttgcaagaaa tgctgctacg attgccagct gtgtttttta caaaaggggc tcgggatatg 6061 gtatgaccga aagggcagac gaagaaggac tccaaagaaa actaaggctc atccgtcttc 6121 tgcatcagac aagtgagtat gaagggtagt aagaatcaac tgctgattgc tattgtacta 6181 gctagtgctt acctaataca ttgcaagcaa tttgtgactg ttttctacgg catacccgcg 6241 tggaggaatg catccattcc cctgttttgt gcaaccaaaa atagagatac ttggggaaca 6301 atacagtgct taccagacaa tgatgattat caggaaataa ctctaaatgt gacagaggct 6361 ttcgatgcat ggaataatac agtaacagaa caagcagtag aggatgtctg gaatctattt 6421 gagacatcaa taaaaccatg tgtcaaatta acacccttat gtgtagcaat gaactgtaca 6481 aggaacatga ccacatggac agggaggaca gacacccaaa atatcacaat tataaatgac 6541 acttcccatg cgcgtgcaga caactgcaca ggattaaagg aggaagaaat gatcgactgt 6601 cagtttagta tgacaggatt agagagagac aagagaaaac agtatactga agcatggtac 6661 tcaaaagatg tggtttgtga caataacacc tcaagtcaga gcaagtgtta catgaaccat 6721 tgcaacacat cagtcatcac agagtcatgt gataagcact attgggatgc tatgaggttt 6781 agatactgtg caccaccggg ttttgcctta ctaagatgca atgataccaa ttattcaggc 6841 tttgcaccta actgctctaa agtagtagct gctacatgca ccagaatgat ggaaacgcaa 6901 acttctacat ggtttggctt taatggcact agagcagaaa atagaacata tatctattgg 6961 catggtaaag ataacagaac tattatcagc ttaaataact tttataatct cactatgcat 7021 tgtaagaggc cgggaaataa gacagtgtta ccaataacat ttatgtcagg gtttaagttt 7081 cactcccagc cagtcatcaa taaaaaaccc aggcaagcat ggtgttggtt cgaaggccaa 7141 tggaaggaag ccatgcagga ggtgaaggag acccttgcga aacatcccag atataaaggg 7201 aacaggagcc gcacagagaa tattaaattt aaagcaccag ggagaggctc agacccagaa 7261 gtaacataca tgtggactaa ctgcagagga gaatctctct actgcaacat gacttggttc 7321 ctcaattggg tagagaacag gacgggtcag aaacagcgca attatgcacc gtgccgtata 7381 aggcaaataa ttaatacctg gcacagggta gggaaaaatc tatatttgcc tcccagggaa 7441 ggggagttga cctgcaactc aacggtgacc agcataattg ccaacattga tgcgggagat 7501 caaacaaata ttacctttag tgcagaggcg gcagaactat accgattgga attgggggat 7561 tacaaattag tagaaatcac accaattggc ttcgcaccta catcagtaaa gagatactcc 7621 tctgctcacc agagacatac aagaggtgtg tttgtgctag ggttcctagg ttttctcgca 7681 acagcaggtt ctgccatggg cgcggcgtcc ttgacgctgt cagctcagtc tcggacttta 7741 ttggccggga tagtgcagca acagcaacag ctgttggatg tggtcaagag acaacaagaa 7801 atgttgcgac tgaccgtctg gggaacaaaa aatctccagg caagagtcac tgctatagag 7861 aagtacctaa aggaccaggc gcagctaaat tcatggggat gtgcgtttag acaagtctgc 7921 cacacttctg taccatgggt aaatgatacc ttgacacctg attggaacaa tatgacgtgg 7981 caggaatggg aacaaaaagt ccgctacctg gaggcaaata tcagtcaaag tctagaacaa 8041 gcacaaattc agcaagaaaa gaatatgtat gagctacaaa aattaaatag ctgggatgtt 8101 tttaccaatt ggcttgactt cacctcctgg gtcaggtata ttcaatatgg agtttatgta 8161 gtagtaggaa tagtagcttt aagaatagta atatatatag tacagatgtt aagtagactt 8221 agaaagggct ataggcctgt tttctcttcc ccccccggtt acatccaaca gatccatatc 8281 cacaaggacc aggaacagcc agccagagaa gaaacagaag aagacgttgg aagcaacggt 8341 ggagacagat cttggccttg gccgatagca tatatacatt tcctgatccg cctgctgatt 8401 cgcctcttga ccgggctata caacatctgc agggacttac tatccaggat ctccccgatc 8461 ctccaaccaa tcttccagag tctccagaga gcactaacag caatcagaga ctggctgagg 8521 cttaaagcag cctacctgca gtatgggtgc gagtggatcc aagaagcgtt ccaagccctt 8581 gcaaggacta caagagagac tcttgcaggc gcggggagag acttgtggag ggcgctgcaa 8641 cgaatcggga gggggatact tgcagtccca cgaaggatca ggcagggagc agaactcgcc 8701 ctcctgtgag ggacagcgat atcagcaggg agattttgta aataccccat ggagaacccc 8761 agcagcagaa agggagaaag aattgtacaa acagcaaaat atggatgatg tagatctaga 8821 tgatgatgac caagtaggat tccctgtcac accaagagta ccattaagac caatgacatt 8881 caaattggca gtagatatgt ctcattttat aaaagaaaaa gggggactgg aagggctgtt 8941 ttatagtcag agaagacata gaatcttaga cttatactta gacaaggctt ttactctgta 9001 cccagaggaa tttgggcata attcaggact gccagagaaa gagtggaagg cgagactgaa 9061 agcaagggga ataccattta gttaaaaaca ggaacagcta tacttggtca gggcaggaag 9121 tagctactga gaacagctga gactgcaggg actttccaga aggggctgta accaggagag 9181 ggacatggga ggagctggtg gggaacgccc ttcatacttt ctgtataaat gtacccgctg 9241 cttgcattgt attcagtcgc tctggcagag gctggcagat tgagccctgg gaggttctct 9301 ccagcactag caggtagagc ctgggtgttc cctgctagac tctcaccagt gcttggccgg 9361 cgctgggcag acggctccac gcttgcttgc ttaaagacct ctcaataaag ctgccaatta 9421 gaagcaagtt a // LOCUS HIV2RODZ 9671 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2, complete proviral genome. ACCESSION M15390 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 2 (isolate ROD) proviral DNA, clones lambda-ROD[27,35]. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 9361 to 9671) AUTHORS Clavel,F., Guyader,M., Guetard,D., Salle,M., Montagnier,L. and Alizon,M. TITLE Molecular cloning and polymorphism of the human immunodeficiency virus type 2 JOURNAL Nature 324, 691-695 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 9671) AUTHORS Guyader,M., Emerman,M., Sonigo,P., Clavel,F., Montagnier,L. and Alizon,M. TITLE Genome organization and transactivation of the human immuno- deficiency virus type 2 JOURNAL Nature 326, 662-669 (1987) STANDARD full staff_review COMMENT The ROD isolate was obtained from offshore Senegal in 1985. Its sequence was determined from two clones: bases 1 to 2658 are from ROD 27 and bases 2659 to 9671 are from ROD 35. An infectious clone of this virus has been constructed by Keith Peden, Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD. 21205 (301)955-3652. FEATURES from to/span description pept 546 2114 gag polyprotein pept < 1829 4939 pol polyprotein (NH2 terminus uncertain; AA at pept 4869 5516 vif protein pept 5344 5682 vpX protein pept 5682 5999 vpR protein pept 5845 6140 tat protein, exon 2 (first expressed exon) 8307 8403 tat protein, exon 3 (AA at 8308) pept 6071 6140 rev protein, exon 2 (first expressed exon) 8307 8539 rev protein, exon 3 (AA at 8309) pept 6147 8723 env polyprotein pept 8557 9327 nef protein LTR < 1 299 5' LTR LTR 8942 > 9671 3' LTR rpt < 1 173 R repeat 5' copy rpt 9497 9671 R repeat 3' copy binding 303 319 primer (Lys-tRNA) binding site signal 9649 9654 poly-A signal BASE COUNT 3314 a 1972 c 2401 g 1984 t ORIGIN Cap site of genomic RNA. 1 ggtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctagactct caccagcact tggccggtgc tgggcagacg 121 gccccacgct tgcttgctta aaaacctctt aataaagctg ccagttagaa gcaagttaag 181 tgtgtgctcc catctctcct agtcgccgcc tggtcattcg gtgttcacct gagtaacaag 241 accctggtct gttaggaccc ttcttgcttt gggaaaccga ggcaggaaaa tccctagcag 301 gttggcgcct gaacagggac ttgaagaaga ctgagaagtc ttggaacacg gctgagtgaa 361 ggcagtaagg gcggcaggaa caaaccacga cggagtgctc ctagaaaggc gcgggccgag 421 gtaccaaagg cagcgtgtgg agcgggagga gaagaggcct ccgggtgaag gtaagtacct 481 acaccaaaaa ctgtagccga aagggcttgc tatcctacct ttagacaggt agaagattgt 541 gggagatggg cgcgagaaac tccgtcttga gagggaaaaa agcagatgaa ttagaaagaa 601 tcaggttacg gcccggcgga aagaaaaagt acaggctaaa acatattgtg tgggcagcga 661 ataaattgga cagattcgga ttagcagaga gcctgttgga gtcaaaagag ggttgtcaaa 721 aaattcttac agttttagat ccaatggtac cgacaggttc agaaaattta aaaagtcttt 781 ttaatactgt ctgcgtcatt tggtgcatac acgcagaaga gaaagtgaaa gatactgaag 841 gagcaaaaca aatagtgcgg agacatctag tggcagaaac aggaactgca gagaaaatgc 901 caagcacaag tagaccaaca gcaccatcta gcgagaaggg aggaaattac ccagtgcaac 961 atgtaggcgg caactacacc catataccgc tgagtccccg aaccctaaat gcctgggtaa 1021 aattagtaga ggaaaaaaag ttcggggcag aagtagtgcc aggatttcag gcactctcag 1081 aaggctgcac gccctatgat atcaaccaaa tgcttaattg tgtgggcgac catcaagcag 1141 ccatgcagat aatcagggag attatcaatg aggaagcagc agaatgggat gtgcaacatc 1201 caataccagg ccccttacca gcggggcagc ttagagagcc aaggggatct gacatagcag 1261 ggacaacaag cacagtagaa gaacagatcc agtggatgtt taggccacaa aatcctgtac 1321 cagtaggaaa catctataga agatggatcc agataggatt gcagaagtgt gtcaggatgt 1381 acaacccgac caacatccta gacataaaac agggaccaaa ggagccgttc caaagctatg 1441 tagatagatt ctacaaaagc ttgagggcag aacaaacaga tccagcagtg aagaattgga 1501 tgacccaaac actgctagta caaaatgcca acccagactg taaattagtg ctaaaaggac 1561 tagggatgaa ccctacctta gaagagatgc tgaccgcctg tcagggggta ggtgggccag 1621 gccagaaagc tagattaatg gcagaggccc tgaaagaggt cataggacct gcccctatcc 1681 cattcgcagc agcccagcag agaaaggcat ttaaatgctg gaactgtgga aaggaagggc 1741 actcggcaag acaatgccga gcacctagaa ggcagggctg ctggaagtgt ggtaagccag 1801 gacacatcat gacaaactgc ccagatagac aggcaggttt tttaggactg ggcccttggg 1861 gaaagaagcc ccgcaacttc cccgtggccc aagttccgca ggggctgaca ccaacagcac 1921 ccccagtgga tccagcagtg gatctactgg agaaatatat gcagcaaggg aaaagacaga 1981 gagagcagag agagagacca tacaaggaag tgacagagga cttactgcac ctcgagcagg 2041 gggagacacc atacagggag ccaccaacag aggacttgct gcacctcaat tctctctttg 2101 gaaaagacca gtagtcacag catacattga gggtcagcca gtagaagtct tgttagacac 2161 aggggctgac gactcaatag tagcaggaat agagttaggg aacaattata gcccaaaaat 2221 agtaggggga atagggggat tcataaatac caaggaatat aaaaatgtag aaatagaagt 2281 tctaaataaa aaggtacggg ccaccataat gacaggcgac accccaatca acatttttgg 2341 cagaaatatt ctgacagcct taggcatgtc attaaatcta ccagtcgcca aagtagagcc 2401 aataaaaata atgctaaagc cagggaaaga tggaccaaaa ctgagacaat ggcccttaac 2461 aaaagaaaaa atagaagcac taaaagaaat ctgtgaaaaa atggaaaaag aaggccagct 2521 agaggaagca cctccaacta atccttataa tacccccaca tttgcaatca agaaaaagga 2581 caaaaacaaa tggaggatgc taatagattt cagagaacta aacaaggtaa ctcaagattt 2641 cacagaaatt cagttaggaa ttccacaccc agcagggttg gccaagaaga gaagaattac 2701 tgtactagat gtaggggatg cttacttttc cataccacta catgaggact ttagaccata 2761 tactgcattt actctaccat cagtgaacaa tgcagaacca ggaaaaagat acatatataa 2821 agtcttgcca cagggatgga agggatcacc agcaattttt caacacacaa tgagacaggt 2881 attagaacca ttcagaaaag caaacaagga tgtcattatc attcagtaca tggatgatat 2941 cttaatagct agtgacagga cagatttaga acatgatagg gtagtcctgc agctcaagga 3001 acttctaaat ggcctaggat tttctacccc agatgagaag ttccaaaaag accctccata 3061 ccactggatg ggctatgaac tatggccaac taaatggaag ttgcagaaaa tacagttgcc 3121 ccaaaaagaa atatggacag tcaatgacat ccagaagcta gtgggtgtcc taaattgggc 3181 agcacaactc tacccaggga taaagaccaa acacttatgt aggttaatca gaggaaaaat 3241 gacactcaca gaagaagtac agtggacaga attagcagaa gcagagctag aagaaaacag 3301 aattatccta agccaggaac aagagggaca ctattaccaa gaagaaaaag agctagaagc 3361 aacagtccaa aaggatcaag agaatcagtg gacatataaa atacaccagg aagaaaaaat 3421 tctaaaagta ggaaaatatg caaaggtgaa aaacacccat accaatggaa tcagattgtt 3481 agcacaggta gttcagaaaa taggaaaaga agcactagtc atttggggac gaataccaaa 3541 atttcaccta ccagtagaga gagaaatctg ggagcagtgg tgggataact actggcaagt 3601 gacatggatc ccagactggg acttcgtgtc taccccacca ctggtcaggt tagcgtttaa 3661 cctggtaggg gatcctatac caggtgcaga gaccttctac acagatggat cctgcaatag 3721 gcaatcaaaa gaaggaaaag caggatatgt aacagataga gggaaagaca aggtaaagaa 3781 actagagcaa actaccaatc agcaagcaga actagaagcc tttgcgatgg cactaacaga 3841 ctcgggtcca aaagttaata ttatagtaga ctcacagtat gtaatgggga tcagtgcaag 3901 ccaaccaaca gagtcagaaa gtaaaatagt gaaccagatc atagaagaaa tgataaaaaa 3961 ggaagcaatc tatgttgcat gggtcccagc ccacaaaggc atagggggaa accaggaagt 4021 agatcattta gtgagtcagg gtatcagaca agtgttgttc ctggaaaaaa tagagcccgc 4081 tcaggaagaa catgaaaaat atcatagcaa tgtaaaagaa ctgtctcata aatttggaat 4141 acccaattta gtggcaaggc aaatagtaaa ctcatgtgcc caatgtcaac agaaagggga 4201 agctatacat gggcaagtaa atgcagaact aggcacttgg caaatggact gcacacattt 4261 agaaggaaag atcattatag tagcagtaca tgttgcaagt ggatttatag aagcagaagt 4321 catcccacag gaatcaggaa gacaaacagc actcttccta ttgaaactgg caagtaggtg 4381 gccaataaca cacttgcata cagataatgg tgccaacttc acttcacagg aggtgaagat 4441 ggtagcatgg tggataggta tagaacaatc ctttggagta ccttacaatc cacagagcca 4501 aggagtagta gaagcaatga atcaccatct aaaaaaccaa ataagtagaa tcagagaaca 4561 ggcaaataca atagaaacaa tagtactaat ggcaattcat tgcatgaatt ttaaaagaag 4621 ggggggaata ggggatatga ctccatcaga aagattaatc aatatgatca ccacagaaca 4681 agagatacaa ttcctccaag ccaaaaattc aaaattaaaa gattttcggg tctatttcag 4741 agaaggcaga gatcagttgt ggaaaggacc tggggaacta ctgtggaaag gagaaggagc 4801 agtcctagtc aaggtaggaa cagacataaa aataatacca agaaggaaag ccaagatcat 4861 cagagactat ggaggaagac aagagatgga tagtggttcc cacctggagg gtgccaggga 4921 ggatggagaa atggcatagc cttgtcaagt atctaaaata caaaacaaag gatctagaaa 4981 aggtgtgcta tgttccccac cataaggtgg gatgggcatg gtggacttgc agcagggtaa 5041 tattcccatt aaaaggaaac agtcatctag agatacaggc atattggaac ttaacaccag 5101 aaaaaggatg gctctcctct tattcagtaa gaataacttg gtacacagaa aagttctgga 5161 cagatgttac cccagactgt gcagatgtcc taatacatag cacttatttc ccttgcttta 5221 cagcaggtga agtaagaaga gccatcagag gggaaaagtt attgtcctgc tgcaattatc 5281 cccgagctca tagagcccag gtaccgtcac ttcaatttct ggccttagtg gtagtgcaac 5341 aaaatgacag accccagaga gacagtacca ccaggaaaca gcggcgaaga gactatcgga 5401 gaggccttcg cctggctaaa caggacagta gaagccataa acagagaagc agtgaatcac 5461 ctaccccgag aacttatttt ccaggtgtgg cagaggtcct ggagatactg gcatgatgaa 5521 caagggatgt cagaaagtta cacaaagtat agatatttgt gcataataca gaaagcagtg 5581 tacatgcatg ttaggaaagg gtgtacttgc ctggggaggg gacatgggcc aggagggtgg 5641 agaccagggc ctcctcctcc tccccctcca ggtctggtct aatggctgaa gcaccaacag 5701 agctcccccc ggtggatggg accccactga gggagccagg ggatgagtgg ataatagaaa 5761 tcttgagaga aataaaagaa gaagctttaa agcattttga ccctcgcttg ctaattgctc 5821 ttggcaaata tatctatact agacatggag acacccttga aggcgccaga gagctcatta 5881 aagtcctgca acgagccctt ttcacgcact tcagagcagg atgtggccac tcaagaattg 5941 gccagacaag gggaggaaat cctctctcag ctataccgac ccctagaaac atgcaataac 6001 tcatgctatt gtaagcgatg ctgctaccat tgtcagatgt gttttctaaa caaggggctc 6061 gggatatgtt atgaacgaaa gggcagacga agaaggactc caaagaaaac taagactcat 6121 ccgtctccta caccagacaa gtgagtatga tgaatcagct gcttattgcc attttattag 6181 ctagtgcttg cttagtatat tgcacccaat atgtaactgt tttctatggc gtacccacgt 6241 ggaaaaatgc aaccattccc ctcttttgtg caaccagaaa tagggatact tggggaacca 6301 tacagtgctt gcctgacaat gatgattatc aggaaataac tttgaatgta acagaggctt 6361 ttgatgcatg gaataataca gtaacagaac aagcaataga agatgtctgg catctattcg 6421 agacatcaat aaaaccatgt gtcaaactaa cacctttatg tgtagcaatg aaatgcagca 6481 gcacagagag cagcacaggg aacaacacaa cctcaaagag cacaagcaca accacaacca 6541 cacccacaga ccaggagcaa gagataagtg aggatactcc atgcgcacgc gcagacaact 6601 gctcaggatt gggagaggaa gaaacgatca attgccagtt caatatgaca ggattagaaa 6661 gagataagaa aaaacagtat aatgaaacat ggtactcaaa agatgtggtt tgtgagacaa 6721 ataatagcac aaatcagacc cagtgttaca tgaaccattg caacacatca gtcatcacag 6781 aatcatgtga caagcactat tgggatgcta taaggtttag atactgtgca ccaccgggtt 6841 atgccctatt aagatgtaat gataccaatt attcaggctt tgcacccaac tgttctaaag 6901 tagtagcttc tacatgcacc aggatgatgg aaacgcaaac ttccacatgg tttggcttta 6961 atggcactag agcagagaat agaacatata tctattggca tggcagagat aatagaacta 7021 tcatcagctt aaacaaatat tataatctca gtttgcattg taagaggcca gggaataaga 7081 tagtgaaaca aataatgctt atgtcaggac atgtgtttca ctcccactac cagccgatca 7141 ataaaagacc cagacaagca tggtgctggt tcaaaggcaa atggaaagac gccatgcagg 7201 aggtgaagga aacccttgca aaacatccca ggtatagagg aaccaatgac acaaggaata 7261 ttagctttgc agcgccagga aaaggctcag acccagaagt agcatacatg tggactaact 7321 gcagaggaga gtttctctac tgcaacatga cttggttcct caattggata gagaataaga 7381 cacaccgcaa ttatgcaccg tgccatataa agcaaataat taacacatgg cataaggtag 7441 ggagaaatgt atatttgcct cccagggaag gggagctgtc ctgcaactca acagtaacca 7501 gcataattgc taacattgac tggcaaaaca ataatcagac aaacattacc tttagtgcag 7561 aggtggcaga actatacaga ttggagttgg gagattataa attggtagaa ataacaccaa 7621 ttggcttcgc acctacaaaa gaaaaaagat actcctctgc tcacgggaga catacaagag 7681 gtgtgttcgt gctagggttc ttgggttttc tcgcaacagc aggttctgca atgggcgcgg 7741 cgtccctgac cgtgtcggct cagtcccgga ctttactggc cgggatagtg cagcaacagc 7801 aacagctgtt ggacgtggtc aagagacaac aagaactgtt gcgactgacc gtctggggaa 7861 cgaaaaacct ccaggcaaga gtcactgcta tagagaagta cctacaggac caggcgcggc 7921 taaattcatg gggatgtgcg tttagacaag tctgccacac tactgtacca tgggttaatg 7981 attccttagc acctgactgg gacaatatga cgtggcagga atgggaaaaa caagtccgct 8041 acctggaggc aaatatcagt aaaagtttag aacaggcaca aattcagcaa gagaaaaata 8101 tgtatgaact acaaaaatta aatagctggg atatttttgg caattggttt gacttaacct 8161 cctgggtcaa gtatattcaa tatggagtgc ttataatagt agcagtaata gctttaagaa 8221 tagtgatata tgtagtacaa atgttaagta ggcttagaaa gggctatagg cctgttttct 8281 cttccccccc cggttatatc caacagatcc atatccacaa ggaccgggga cagccagcca 8341 acgaagaaac agaagaagac ggtggaagca acggtggaga cagatactgg ccctggccga 8401 tagcatatat acatttcctg atccgccagc tgattcgcct cttgaccaga ctatacagca 8461 tctgcaggga cttactatcc aggagcttcc tgaccctcca actcatctac cagaatctca 8521 gagactggct gagacttaga acagccttct tgcaatatgg gtgcgagtgg atccaagaag 8581 cattccaggc cgccgcgagg gctacaagag agactcttgc gggcgcgtgc aggggcttgt 8641 ggagggtatt ggaacgaatc gggaggggaa tactcgcggt tccaagaagg atcagacagg 8701 gagcagaaat cgccctcctg tgagggacgg cagtatcagc agggagactt tatgaatact 8761 ccatggaagg acccagcagc agaaagggag aaaaatttgt acaggcaaca aaatatggat 8821 gatgtagatt cagatgatga tgaccaagta agagtttctg tcacaccaaa agtaccacta 8881 agaccaatga cacatagatt ggcaatagat atgtcacatt taataaaaac aaggggggga 8941 ctggaaggga tgttttacag tgaaagaaga cataaaatct taaatatata cttagaaaag 9001 gaagaaggga taattgcaga ttggcagaac tacactcatg ggccaggagt aagataccca 9061 atgttctttg ggtggctatg gaagctagta ccagtagatg tcccacaaga aggggaggac 9121 actgagactc actgcttagt acatccagca caaacaagca agtttgatga cccgcatggg 9181 gagacactag tctgggagtt tgatcccttg ctggcttata gttacgaggc ttttattcgg 9241 tacccagagg aatttgggca caagtcaggc ctgccagagg aagagtggaa ggcgagactg 9301 aaagcaagag gaataccatt tagttaaaga caggaacagc tatacttggt cagggcagga 9361 agtaactaac agaaacagct gagactgcag ggactttcca gaaggggctg taaccaaggg 9421 agggacatgg gaggagctgg tggggaacgc cctcatattc tctgtataaa tatacccgct 9481 agcttgcatt gtacttcggt cgctctgcgg agaggctggc agattgagcc ctgggaggtt 9541 ctctccagca gtagcaggta gagcctgggt gttccctgct agactctcac cagcacttgg 9601 ccggtgctgg gcagacggcc ccacgcttgc ttgcttaaaa acctccttaa taaagctgcc 9661 agttagaagc a // LOCUS HIV2ST 9672 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2), complete proviral genome. ACCESSION M31113 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 2 (isolate HIV-2/ST) proviral DNA, clone JSP4-27. REFERENCE 1 (bases 1 to 9672) AUTHORS Kumar,P., Hui,H., Kappes,J.C., Haggarty,B.S., Hoxie,J.A., Arya,S.K., Shaw,G.M. and Hahn,B. TITLE Molecular characterization of an attenuated human immunodeficiency virus type 2 isolate JOURNAL J. Virol. 64, 890-901 (1990) STANDARD full staff_review REFERENCE 2 (sites; corrections bases 6952, 7003, 7926) AUTHORS Hahn,B. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Kindly provided in computer-readable form by B. Hahn, University of Alabama, Birmingham. The corrections of nts 6952, 7003 and 7926, from 'c' to 't' result in amino acid changes. This isolate, obtained from an asymptomatic Senegalese woman, has low pathogenicity, infecting cells at a relatively slow rate and causing little cell death or fusion. FEATURES from to/span description pept 548 2113 gag polyprotein pept < 1717 1767 pol polyprotein (NH2 terminus uncertain; AA at 1717) 1771 4938 pol polyprotein pept 4868 5515 vif protein pept 5343 5681 vpX protein pept 5681 5776 vpR protein 5780 5998 vpR protein pept 5844 6139 tat protein, exon 2 (first expressed exon) 8291 8387 tat protein, exon 3 (AA at 8292) pept 6070 6139 rev protein, exon 2 (first expressed exon) 8291 8544 rev protein, exon 3 (AA at 8293) pept 6149 8728 env polyprotein site 1768 1770 pol in-frame stop codon site 5777 5779 vpR in-frame stop codon revision 6952 1 t in [2]; c in [1] revision 7003 1 t in [2]; c in [1] revision 7926 1 t in [2]; c in [1] pept 8562 9329 nef protein pre-msg 1 9672 genomic mRNA pre-msg 1 9672 tat, rev, nef subgenomic mRNA rpt 9499 9672 R repeat 3' copy binding 305 321 primer (Lys-tRNA) binding site signal 9650 9655 poly-A signal BASE COUNT 3304 a 1991 c 2412 g 1965 t ORIGIN 1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctagactct caccagtgct tggccggcac tgggcagacg 121 gctccacgct tgcttgctta aaagacctct taataaagct gccagttaga agcaagttaa 181 gtgtgtgctc ccatctctcc tagtcgccgc ctggtcattc ggtgttcatc taaagtaaca 241 agaccctggt ctgttaggac cctttctgct ttgggaaacc aaggcaggaa aatccctagc 301 aggttggcgc ccgaacaggg acttgaagaa gactgagaag ccttggaaca cggctgagtg 361 aaggcagtaa gggcggcagg aacaaaccac gacggagtgc tcctagaaaa gcgcaggccg 421 aggtaccaag ggcggcgtgt ggagcgggag tgaaagaggc ctccgggtga aggtaagtgc 481 ctacaccaaa tacagtagcc agaagggctt gttatcctac ctttagacgg gtagaagatt 541 gtgggagatg ggcgcgagaa actccgtctt gagagggaaa aaagcagacg aattagaaaa 601 gattaggtta cggcccggcg gaaagaaaaa atataggcta aaacatattg tgtgggcagc 661 gaatgaattg gacagattcg gattggcaga gagcctgttg gagtcaaaag agggttgcca 721 aaaaattctt acagttttag atccattagt accgacaggg tcagaaaatt taaaaagcct 781 ttttaatact gtctgcgtca tttggtgtat acacgcagaa gagaaagcga aagatactga 841 agaagcaaaa caaaaggtac agagacatct agtggcagaa acaaaaacta cagaaaaaat 901 gccaagtaca agtagaccaa cagcaccacc tagcgggaac ggaggaaact tccccgtaca 961 acaagtggcc ggcaactata cccatgtgcc actaagtccc cgaaccctaa atgcttgggt 1021 aaaactagta gaggaaaaga agttcggggc agaagtagtg ccaggatttc aggcactctc 1081 agaaggctgc acgccctatg atattaatca aatgcttaat tgtgtgggcg accatcaagc 1141 agctatgcaa ataatcaggg aaattattaa tgaagaagca gcagattggg acgcacaaca 1201 cccaatacca ggccccttac cagcggggca gctcagggag ccaaggggat ctgacatagc 1261 agggacaaca agcacagtag aagagcagat ccagtggatg tttaggccac aaaatcctgt 1321 accagtagga agcatctata gaagatggat ccagataggg ctacagaagt gcgtcaggat 1381 gtacaaccca accaacatcc tagacataaa acagggacca aaggagccat tccagagtta 1441 tgtagataga ttctacaaga gcttgagggc agaacaaaca gatccagcag taaaaaattg 1501 gatgacccaa acactgctag tgcagaatgc caacccagac tgtaagttag tactaaaagg 1561 actagggata aatcctacct tagaagaaat gctaaccgcc tgtcaggggg taggtggacc 1621 aggccagaaa gccagattaa tggcagaagc cttaaaggag gccatggcac cagcccccat 1681 cccatttgca gcagcccaac agagaaggac aattaagtgc tggaattgcg gaaaggaagg 1741 gcactcggca agacaatgcc gagcacctag aagacaaggc tgctggaaat gtggcaaggc 1801 aggacacatc atggcaaaat gcccagaaag acaggcgggt tttttagggt tgggcccatg 1861 gggaaagaag ccccgcaatt tccctgtggc ccaaatcccg caggggctga caccaacagc 1921 acccccgata gacccagtag aggacctact agagaagtac atgcagcaag ggaaaaggca 1981 gagagagcag agagagaggc catacaaaga agtgacagag gacttcctgc agctcgagaa 2041 acaagagaca ccatgcagag agacgacaga ggacttgctg cacctcaatt ctctctttgg 2101 aaaagaccag tagtcacagc acatgttgag ggccagccag tagaagtttt gctagacaca 2161 ggggctgacg actcaatagt agcaggcgta gagttaggga gcaattatag tccaaagata 2221 gtagggggaa tagggggatt cataaatacc aaagaatata aaaatgtaga aataagagta 2281 ttaaataaaa gagtaagagc caccataatg acaggtgata ccccaatcaa catttttggc 2341 agaaacattc tgacagcctt aggcatgtca ttaaatctac cagtcgccaa gatagaacca 2401 ataaaaataa tgctgaagcc aggaaaggat ggaccaaaac tgagacaatg gcccttaaca 2461 aaagaaaaaa tagaggcact aaaagagatc tgtgagaaaa tggaaagaga gggccagcta 2521 gaggaggcac ctccaactaa tccttataat acccccacat ttgcaatcaa gaaaaaggac 2581 aaaaacaaat ggagaatgct aatagatttt agagaactaa acaaggtaac tcaagacttc 2641 acagaaatcc agttaggaat tccacaccca gcaggactag ccaagaagaa acgaattact 2701 gtcctagatg taggggatgc ttacttttcc ataccactac atgaggattt tagacagtat 2761 actgcattta ctctaccatc aataaacaat gctgaaccag gaaaaagata catatataaa 2821 gtctcaccac agggatggaa gggatcacca gcaatttttc agtacacaat gaggcaggtc 2881 ttagaaccat tcagaaaagc aaacccggat atcattctca ttcagtacat ggatgatatc 2941 ttgatagcca gcgacaggac agatttagaa catgacagag tggttctgca gctaaaggaa 3001 cttctaaatg gcctgggatt ttccacccca gatgagaagt tccaaaaaga ccctccatac 3061 caatggatgg gctatgaact gtggccaact aaatggaagc tgcaaagaat acaattgccc 3121 caaaaggaag tatggacagt caatgacatc caaaaactgg tgggtgtcct aaattgggca 3181 gcacaaatct acccagggat aaagaccaga aacttatgta ggttaatcag aggaaaaatg 3241 acactcacag aagaggtaca gtggacagaa ttagcagaag cggaactaga agaaaacaaa 3301 atcatcttaa gccaggaaca agaaggatgc tattaccaag aggaaaagga gctagaagca 3361 acagtccaaa aagatcaaga caatcagtgg acatataaga tacaccaggg aggaaaaatt 3421 ctaaaagtag gaaaatatgc aaaggtaaaa aatacccaca ccaacggagt cagactccta 3481 gcacaagtag ttcaaaaaat aggaaaagaa gcactagtca tttggggacg aataccaaaa 3541 tttcacctac cagtagaaag agatacctgg gaacagtggt gggataacta ctggcaagtg 3601 acatggatcc cagactggga cttcatatct accccgccac tggtcagatt agtatttaac 3661 ctggtgaaag atcccatact aggcgcagaa accttctaca cagatggatc ctgcaataag 3721 caatcaagag aaggaaaagc aggatacata acagatagag gaagagacaa ggtgaggcta 3781 ttagagcaaa ccaccaatca gcaagcagaa ttagaagcct ttgcgatggc agtaacagac 3841 tcaggtccaa aggccaacat tatagtagac tcacaatatg taatgggaat agtagcaggc 3901 caaccaacag agtcagagag taaaatagta aatcaaatca tagaagaaat gataaaaaag 3961 gaagcaatct atgttgcatg ggtcccagcc cataaaggca taggaggaaa tcaggaggta 4021 gatcacttag taagtcaggg catcagacaa gtattattcc tagagaaaat agaacccgct 4081 caggaggaac atgaaaaata tcatagcaat gtaaaagaac tatcccataa atttggactg 4141 cccaaattag tggcaagaca aatagtaaac acatgcaccc aatgtcagca gaaaggggag 4201 gctatacatg ggcaagtaaa tgcagaatta ggcacttggc aaatggactg cacacactta 4261 gaaggaaaaa tcattatagt agcagtacat gttgcaagtg gatttataga agcagaagtc 4321 atcccacagg aatcaggaag gcaaacggca ctcttcctac taaaactggc cagtaggtgg 4381 ccaataacac atttgcacac agacaatggt gccaacttca cttcacagga agtaaagatg 4441 gtggcatggt ggataggtat agaacaatcc ttcggagtac cttacaatcc acaaagccaa 4501 ggagtagtgg aagcaatgaa tcaccaccta aaaaatcaga taagcagaat tagagagcag 4561 gcaaacacag tagaaacaat agtactaatg gcagttcatt gcatgaattt taaaaggagg 4621 ggaggaatag gggatatgac cccagcagaa agactaatca atatggtcac tgcagaacag 4681 gaaatacaat tcctccaagc aaaaaattca aaattacaaa attttcgggt ctatttcaga 4741 gaaggcagag atcagctgtg gaaaggacct ggggaactac tgtggaaggg ggacggagca 4801 gtcatagtca aggtaggggc tgacataaaa ataataccaa gaaggaaagc taagatcatc 4861 aaagactatg gaggaaggca agagatggat agcggttcca acttggaggg tgccagggag 4921 gatggagagg tggcatagcc ttatcaagta tctaaaatac agaacaggag atctagagaa 4981 ggtgtgctat gttccccacc ataaggtggg atgggcgtgg tggacttgca gcagggtaat 5041 attcccatta aaaggagaaa gtcatctgga gatacaggca tactggaacc taacaccaga 5101 aaaaggatgg ctctcctcct attcagtaag actaacttgg tatacagaaa aattctggac 5161 agatgttacc ccagactgtg cggactccct aatacatagc acttatttct cttgctttac 5221 ggcaggcgaa gtaagaagag ccatcagagg ggaaaagcta ttatcctgct gcaactaccc 5281 ccaagcccat aagtaccagg taccgtcact ccagtttctg gccttagtgg tagtgcaaca 5341 aaatggcagg ccccagagag acaataccac caggaaacag tggcgaagaa actatcggag 5401 aggccttcga gtggctagac aggacggtag aagccataaa cagagaggca gtgaaccacc 5461 tgccccgaga gcttattttc caggtgtggc aaaggtcctg gagatactgg catgatgaac 5521 aaggaatgtc aataagttac acaaagtata gatatttgtg cctaatgcag aaagctatgt 5581 tcatacattc taagagaggg tgcacttgcc tggggggagg acatgggccg ggaggatgga 5641 gatcaggacc tccccctcct ccccctccag gtctagtcta atgactgaag caccaacaga 5701 gtctcccccg gaggatagga ccccaccgag ggagccaggg gatgagtggg taatagaaac 5761 cctgagagag ataaaataag aagctttaaa gcactttgac cctcgcttgc taattactct 5821 tggcaactat atctatgcta gacatggaga cacccttgaa ggcgccagag ggctcattag 5881 gatcctacaa cgagccctcc tcttgcactt cagagcagga tgcggccgct caaggattgg 5941 tcagcccagg ggacgaaatc ctttatcagc tataccaacc cctagaggca tgcgataaca 6001 aatgttactg taaaaagtgc tgctaccatt gccagatgtg ttttttaaac aaggggctcg 6061 ggatatggta tgaacgaaag ggcagaagaa gaagaactcc gaagaaaact aaggctcatt 6121 cgtcttctgc atcagacaag tgagtaagat gtgtggtagg aatcaactat ttgttgccag 6181 cttgctagct agtgcttgct taatatattg cgtccaatat gtgactgttt tctatggcgt 6241 gcccgtgtgg agaaatgcat ccattcccct cttttgtgca actaaaaata gagatacttg 6301 gggaaccata cagtgcttgc cagacaatga tgactatcag gaaatagctt taaatgtgac 6361 agaggccttc gacgcatgga ataatacagt aacagaacaa gcagtagaag atgtctggag 6421 tctatttgag acatcaataa aaccatgcgt caaactaaca cccttatgtg tagcaatgcg 6481 ttgtaacagc acaactgcaa aaaacacaac ctccacacca acaaccacca caacagcaaa 6541 cacaacaata ggagagaatt cttcatgcat acgcacagac aactgcacag ggttgggaga 6601 agaagagatg gtcgactgtc agttcaatat gacaggatta gagagggata agaaaaaact 6661 atataatgaa acatggtact caaaagatgt agtctgtgaa tcaaatgaca ccaagaaaga 6721 gaaaacatgt tacatgaacc actgcaacac atcagtcatc acagagtcat gtgacaagca 6781 ctattgggat actatgaggt ttagatattg tgcaccaccg ggttttgccc tgctaagatg 6841 caatgatacc aattattcag gctttgagcc caattgttct aaggtagtag ctgctacatg 6901 tacaaggatg atggaaacgc aaacctccac ttggtttggc tttaatggca ctagggcaga 6961 aaatagaaca tatatctatt ggcatggtag ggataataga actatcatta gcttaaacaa 7021 gttttataat ctcaccgtac attgtaagag gccaggaaac aagacagttg taccaataac 7081 actcatgtca gggttagtgt ttcactccca gccaatcaat agaagaccca ggcaagcatg 7141 gtgctggttc aaaggcgagt ggaaggaagc catgaaggag gtgaagctaa cccttgcaaa 7201 acatcccagg tataaaggaa ccaacgacac agaaaaaatt cgttttatag cgccaggaga 7261 acgctcagac ccagaagtgg catacatgtg gactaactgc agaggagaat ttctctactg 7321 caatatgact tggttcctca attgggtaga aaacagaacg aatcagacac agcacaatta 7381 tgtgccatgc catataaagc aaataattaa tacctggcac aaggtaggga aaaatgtata 7441 tttgcctcct agggaaggac agttaacctg caactctaca gtgaccagca taattgctaa 7501 cattgacgga ggagagaacc agacaaatat tacctttagt gcagaggtgg cagaactata 7561 ccgattagaa ttgggggatt ataaattgat agaagtaaca ccaattggct ttgcacctac 7621 accagtaaaa agatactcct ctgctccagt gaggaataaa agaggtgtat tcgtgctagg 7681 gttcttaggt tttctcacga cagcaggagc tgcaatgggc gcggcgtcct tgacgctgtc 7741 ggctcagtct cggactttat tggccgggat agtgcagcaa cagcaacagc tgttggacgt 7801 ggtcaagaga caacaagaaa tgttgcgact gaccgtctgg ggaacaaaaa atctccaggc 7861 aagagtcact gctatcgaga aatacttaaa ggaccaggcg caactaaatt catggggatg 7921 tgcgtttaga caagtctgcc acactactgt accatgggta aatgacacct taacgcctga 7981 ttggaacaac atgacatggc aggaatggga gcaacgaatc cgcaacctag aggcaaatat 8041 cagtgaaagt ttagaacagg cacaaatcca gcaagaaaag aacatgtatg aactacaaaa 8101 attaaatagc tgggatgttt ttggcaactg gtttgattta acctcctgga tcaaatatat 8161 tcagtatgga gtttatatag tagtaggaat aatagtttta agaatagtaa tatatgtagt 8221 acaaatgtta agtagactta gaaagggcta taggcctgtt ttctcttccc cccccgctta 8281 cttccaacag atccatatcc acaaggaccg ggaacagcca gccagagaag aaacagaaga 8341 agacgttgga aacagcgttg gagacaattg gtggccctgg ccgataagat atatacattt 8401 cctgatccgc cagctgattc gcctcttgaa cagactatac aacatctgca gggacttact 8461 atccaggagc ttccagaccc tccaactaat ctcccagagt cttcggagag cattgacagc 8521 agtcagagac tggctgagat ttaacacagc ctacctgcaa tatgggggcg agtggatcca 8581 agaagcgttc cgagccttcg cgagggctac gggagagact cttacaaacg cctggagagg 8641 cttctggggg acactgggac aaattgggag gggaatactt gcagtcccaa gaaggatcag 8701 gcagggggca gaaatcgccc tcctgtgagg gacggcggta tcaacaggga gattttatga 8761 ataccccatg gagagcccca gcagaagggg agaaaggctc gtacaagcaa caaaatatgg 8821 atgatgtaga ttcagatgat gatgacctag taggggtccc tgtcacacca agagtaccat 8881 taagagaaat gacatatagg ttggcaagag atatgtcaca tttgataaaa gaaaaggggg 8941 gactggaagg gctgtattac agtgatagga gacgtagagt cctagacata tacttagaaa 9001 aggaagaggg aataattgga gactggcaga actatactca tggaccagga gtaaggtatc 9061 caaagttctt tgggtggtta tggaagctag taccagtaga tgtcccacaa gagggagatg 9121 acagtgagac tcactgctta gtgcatccag cacaaacaag caggtttgat gacccgcatg 9181 gagaaacatt agtttggagg tttgacccca cgctagcttt tagctacgag gcctttattc 9241 gatacccaga ggagtttggg tacaagtcag gcctgccaga ggatgaatgg aaggcaagac 9301 tgaaagcaag agggataccg tttagctaaa aacaggaaca gctatacttg gtcagggcag 9361 gaagtaacta acagaaaaca gctgagactg cagggacttt ccagaagggg ctgttaccag 9421 gggagggaca tgggaggagc cggtggggaa cgccctcata ctttctgtat aaatgtaccc 9481 gctactcgca ttgtattcag tcgctctgcg gagaggctgg cagattgagc cctgggaggt 9541 tctctccagc actagcaggt agagcctggg tgttccctgc tagactctca ccagtgcttg 9601 gccggcactg ggcagacggc tccacgcttg cttgcttaaa agacctctta ataaagctgc 9661 cagttagaag ca // LOCUS HIVBOS1 253 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 1, envelope region. ACCESSION M27910 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate 1153), clone 1153-1. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 253) AUTHORS Ou,C.-Y.et.al. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer-readable form by C.-Y. Ou, CDC, Atlanta. The sequence below, designated 1153-1, appears to be a major form of 11 sibling sequences taken from a Boston isolate. BASE COUNT 98 a 46 c 54 g 55 t ORIGIN Approximately 1170 nt from env start (BRU coordinates). 1 ttcaacacaa ctgtttaata gtacttggac ttggaatggt actgaaaggt caaataacac 61 tgaaggaaat atcacactcc catgcagaat aaaacaaatt ataaacatgt ggcaggaagt 121 aggaaaagca atgtatgccc ctcccatcag aggacaaatt agatgttcat caaatattac 181 agggctacta ttaacaagag atggtggtaa tgagaccaac gagaccgaga ccttcagacc 241 tggaggagga aat // LOCUS HIVJFL 2553 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 1 proviral env gene, 5' end. ACCESSION M31451 KEYWORDS envelope protein. SOURCE Human immunodeficiency virus type 1 (isolate JFL) proviral DNA, clone I-BL2. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 2553) AUTHORS McNearney,T., Westervelt,P., Thielan,B., Trowbridge,D.B., Garcia,J., Whittier,R. and Ratner,L. TITLE Limited sequence heterogeneity among biologically distinct human immunodeficiency virus type 1 isolates from individuals involved in a clustered infectious outbreak JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 1917-21 (1990) STANDARD full staff_review COMMENT This sequence was kindly provided prior to publication by Dr. Lee Ratner, Washington University School of Medicine, St. Louis. Clone I-BL2 and twelve other clones sequenced from seven isolates, taken from one donor and two TA-AIDS recipients, all have pre-mature stop codons in env. The isolate itself is described as monocytropic. FEATURES from to/span description pept 1 1851 envelope polyprotein 1855 1992 envelope polyprotein 1996 > 2553 envelope polyprotein site 1852 1854 envelope polyprotein in-frame stop codon site 1993 1995 envelope polyprotein in-frame stop codon BASE COUNT 890 a 435 c 603 g 625 t ORIGIN Env start codon. 1 atgagagtga aggagatcag gaagaattgt cagcacttat ggagatgggg caccatgctc 61 cttggaatgt tgatgatctg tagtgctgca ggaaaattgt gggtcacagt ctattatggg 121 gtacctgtat ggaaagaagc aaccaccact ctattttgtg catcagacgc taaagcatat 181 gacacagagg tacataatgt ttgggccaca catgcctgtg tacccacaga ccctaaccca 241 caagaagtaa aattggaaaa tgtgacagaa aattttaaca tgtggaaaaa tgacatggta 301 gaacagatgc atgaggatat aatcagttta tgggatcaaa gcctaaagcc atgtgtaaaa 361 ttaaccccac tctgtgttac tttaaattgc attaatttga agaatgttac taatatcaat 421 aatagttggg gaaagatgga ggaaggagaa ataaaaaact gctctttcaa taccaccaca 481 agcataaaaa ataagatgca gagagaatat gcactttttt ataaacttga tgtagtacca 541 atagataagg ataatacaag ctatacattg ataaattgta acacctcaac cattacacag 601 gcctgtccaa aggtatcctt tgaaccaatt ccaatacatt attgtacccc ggctggtttt 661 gcgattctaa agtgtaatga taagaagttc aatggaaaag gaccatgtaa aaatgtcagc 721 acagtacaat gtacacatgg aattaggcca gtagtgtcaa ctcacttgct gttaaatggc 781 agtctagcag aaggagaggt agtaattaga tctgaaaatt tcatggacaa tgctaaaacc 841 ataatagtac agctgaataa atctatagca attaattgta caagacctaa caacaataca 901 agaaaaagta tcactctagg accagggagg gcattttata caacaggaga cataatagga 961 gacataagac aagcacattg taaccttagt agagtacaat ggaataacac tttaaaacag 1021 atagttataa agttaagaga acagtttggg aataaaacaa tagtctttaa gcaatcctca 1081 ggaggggacc cagaaattgt aatgcacact tttaattgtg gaggggaatt tttctactgt 1141 aattcaacac agctgtttaa tagtacttgg ctgtccaatg gtacttggaa tgatactgaa 1201 gagtcaaaag acactagaga acctatcata ctcccatgca gaataaaaca aattataaac 1261 aggtggcagg aagtaggaaa agcaatgtat gcccctccca tcaacggaca aattagatgt 1321 tcatcaaata ttacagggct actattaaca agagatggtg gtaaagagaa tggggccgag 1381 atcttcagac ctggaggagg aaatatgagg gacaattgga gaagtaaatt atataaatat 1441 aaagtagtaa aaattgagcc attaggagta gcacccacca aggcaaagag aagagtggtg 1501 cagagaaaaa aaagagcagt gggaacgata ggagctatgt tccttgggtt cttgggagca 1561 gcaggaagca ctatgggcgc agcgtcgatg acgctgacgg tacaggccag actattattg 1621 tctggtatag tgcaacagca aaacaatttg ctgagggcta ttgaggcgca acagcatctg 1681 ttgcaactca cagtctgggg catcaagcag ctccaggcaa gagtcctggc tgtggaaaga 1741 tacctacagg atcaacagct cctggggatt tggggttgct ctggaaaact catttgcacc 1801 actactgtgc cttggaatgc tagttggagt aataaatctc tggatgagat ttaaaataac 1861 atgacctgga tgcagtggga gagagaaatt agcaattaca caagcctaat atacacctta 1921 attgaagaat cgcagaacca acaagaaaag aatgaactag aattattgga attggataag 1981 tgggcaagtt tgtgaaattg gtttgacata acaaactggc tgtggtctat aaaaatattc 2041 ataatgatag tagcaggttt agtaggttta agaatagttt ttgctgtgct ttctatagtg 2101 aatagagtta ggcagggata ctcaccattg tcattccaga cccgcctccc aaccccgagg 2161 ggacccgaca ggcccggagg aatcgaagaa gaaggtggag agagagacag agacagatcc 2221 ggttacttag tgactggatt cttagcactt atctgggaag acctgtggag cctgtgcctc 2281 ttcagctacc accgcttgag agacttactc ttgattgtaa cgaggattgt ggaactcctg 2341 ggacgcaggg ggtgggaagc cctcaaatat tggtggaatc tcctacagta ttggagtcag 2401 gaactaaaga atagtgctgt tagtttgctt aacgctacag ctatagcagt agctgagggg 2461 acagatagga ttatagaagt agtacaaaga attggtagag ctatcctcca catacctgta 2521 agaataagac agggcttaga aagggctttg cta // LOCUS RATPKATA1 4644 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene, exons 1-5. ACCESSION D90055 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 1 of 4 SOURCE Rat (Sprague-Dawley) liver DNA, clone PT147. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4644) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of twO closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1 Chuo-ku, Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 The sequence 1487-1594 encodes presumptive leader peptide. FEATURES from to/span description pept 1487 1687 3-ketoacyl-CoA thiolase, exon 1 1827 1920 3-ketoacyl-CoA thiolase, exon 2 2800 2857 3-ketoacyl-CoA thiolase, exon 3 4064 4143 3-ketoacyl-CoA thiolase, exon 4 4381 + 4423 3-ketoacyl-CoA thiolase, exon 5 pre-msg 1373 > 4644 3-ketoacyl-CoA thiolase mRNA and introns IVS 1688 1826 3-ketoacyl-CoA thiolase, intron A IVS 1921 2799 3-ketoacyl-CoA thiolase, intron B IVS 2858 4063 3-ketoacyl-CoA thiolase, intron C IVS 4144 4380 3-ketoacyl-CoA thiolase, intron D signal 1223 1228 GC box signal 1263 1268 GC box signal 1295 1300 GC box signal 1346 1351 GC box signal 1202 1206 CAAT box signal 1217 1221 CAAT box signal 1327 1331 CAAT box signal 1238 1244 enhancer core signal 1464 1470 enhancer core BASE COUNT 1013 a 1303 c 1238 g 1090 t ORIGIN 1 ggatcccacg cggggcctcc cgcagacatg gcaagcaacc ctgggccccg gttctgtacg 61 gtgcgctcgc ttgttgagct tcctacttcc cttaaggctc ctgccaaccc ctcctccttt 121 cgctttccga gaagctccgc cccttgctag aatctggaat ccggacccac aacacgcagc 181 ctgaaattca acccacttga gttctcacct ccccttgaag gtgtgagagc ttatagaaac 241 cgcctctgac gtccgccttg gcccaccgac ctgtcctggg cgatctggta gtggtggcta 301 tgcgtggtga agtgattcag ccttgtttag gacttgacag gagttacagc tcttccgcac 361 agtaatttag attctggtaa tagggaggga agagacggac tggcttcgta agcagtaacc 421 tgaaggctct gtctacctta gccatctggg acaagtggtg accgtgcttt ttaaagggaa 481 agtccacgac cttgcccgag aatagacatt tagggcatcc tgtcctttgt tatcttagac 541 agacaacact accttccgag ggtctgcaaa aagcacacgg acctatcact ttactcttta 601 acgttgggct tagaccaagg aagagtgttt cgttatttcg ctcggttaat ttacttaggt 661 gtcaatcacc agtactagat ctcccccttc tcctcccccg ggcttagtag aattccttgg 721 ggaaagtggg cattaatggc gttatcccaa acgaacgtag aacaggctta agacatgtgg 781 ggcagtgcca cagttagaac ctgcctgcca ttcagcacct tcacaccagc cggacaccca 841 aaattaattt gttttctgag agagcttctg tgacagccca atagggttgg ggttgaacta 901 agtacctcaa cattatctag gtcctcggtt ctgggttttg agacccgggg ccgagcacag 961 caaattttga ggagaaacaa gtgtcttccc tggctcttcc agcaaaggga gaagccctga 1021 gtcgagctcc gccttccacg ccgctgcgcg gtgacctcag aaggagggac tccctcgaga 1081 ctgagttcct cttaggaaac cagaagtttg gggcgcctcc tttccgtgag tctaccaaag 1141 gttgtgcaaa cactagtttc cagtctgcct ttctagggga ggattaagag tacgccttat 1201 gattggcttc cgtaagattg gcgggcgggg ctttgcgtgg aaagagtctt aggacgggtc 1261 ttgggcgggg ctctggggca aggggtgagt gggtgggcgg ccctcaagca gacacccctt 1321 aatctaattg gttgtgtcta agagtgggcg ggccctctaa agaccccatc gcaccactct 1381 cagcattatc ctacccgagc tctgatccac tttcaggcct cgtgaggtag agggctggcc 1441 tgcgcctgcg cctgccatca ttttggtttg ttaagcaagg cagagcatga gcgagtcggt 1501 gggacgcacc tccgcgatgc atcggctgca ggtagtgctg ggccacctgg ccggccgacc 1561 cgagtcgagc tccgcgctgc aagccgcgcc ctgctccgct accttcccgc aggcttcggc 1621 ctccgacgtg gtggtggtgc acggacggcg cacccccatc ggccgcgccg gccgcggcgg 1681 cttcaaggtg aggcccctcg ggcgggcgag aggagcagaa ggcggccaga gtgggtggga 1741 tcctgccctc gggggtccgg tgagcgcaga gacggcgggt ggggatgaag gagttgggct 1801 caccgctgct gcacctcgcc ctgcaggaca ccacccccga cgagcttctg tcggccgtgt 1861 tgaccgcggt tctccaggac gtgaagctaa agcctgagtg tttgggagac atctctgtgg 1921 gtgagctccc ctgagggtcc tgcaccaccc cacccagcct ctctttctta ccagaacctg 1981 ggtttcactg agcaagacct acggattcca ccgatttcag tgggatggtc aaagggccct 2041 tttacagggt tcgcctaaga tcattgggaa acacagacat ctacattatg gttcataaca 2101 gtagcagaat tacagttctg aagtagcaat gaaaaataat tttgtggttg ggggtcacca 2161 cagcatgagg aactgtatta aaagggtcac agcactaaga tagttgagaa ccattgatct 2221 aaagcccaga agttcctgga gacttccagg aggtgagaaa acgcagttct ctacctacat 2281 gaacatcctg acccgggtac gggctgcaga tgggttcacc ccaccctgcg ctatccttga 2341 gttacaacag agtataggac atggttttgt ccccaggtgc cagcctgctc ctcgtgcctg 2401 caccatcgca gggtttggag gaggatactt ggggaggtga acagttgcag gatctgacct 2461 ctttcaccca cagcccgtct tctcacagcg ttcaggctgt gcactgtgtc tcttacccta 2521 gtgtctctcc tactaccgag caagcacccc actccccacc ccaaccgtga tttagaagag 2581 cataccttgg ccccttcagc actgggctgg tagatgactg gctgttgggt agcttggctc 2641 cccattttcc agactccaga cttctatagc ttcacacgag cctgggaaaa tcctaaagtg 2701 atccgttaga aagatctgga atgtttcctc tctggaactc acagctgaac ctcagtgtgc 2761 agaacactac ttggtcctga ataaggtttc tatctacagg taacgtactt gagccaggag 2821 ccggagcagt catggcgcgc attgcccaat ttctgaggta acccttctac ctctggtggg 2881 ggattctacc accatgaacc agtccgtagg tgacaggtta tcaggttgga cagtagatag 2941 caagtcagcc agaatccttg gacccaacag tgctcctact gggttaacca acccagaagc 3001 aggataccca ggctttggga ccagagccaa ggtgaccaga tgcttcccag ggccgaactt 3061 ttgttaccct tggaagcatg gactctgacc tcatatgtca gataaataat cacgtgacca 3121 cttgatcagg atctttactt catttgctac ttgagctgcc tgagaaccct gggctcagtc 3181 agttatcaga agcttgacgg tagtggggtg ggggaagcag ggtctggaca tcttggccct 3241 tcaggtaggg aagcagtagc tcatttgcag gattaagagc agcagtagcc agagtccaaa 3301 ccggggaaaa ccttttgtct agagacacgg aaagggtcag acttagcctt cctgtccctc 3361 ttgtaacttt cctgttcatc tctcacctgc ttcttcattt tgactttccc ttctctgctg 3421 tactggccac cctctcccta tacactcgct ctgcctattc tgtcccaata gagagctctc 3481 tccctgctca cctagctctg gccagtccct gcccagacac ctcggaagct ctcaagcatc 3541 agatgagtga ccttcttgtc caccaagtca ctatgtgcct ccctgtcgct aagacacttg 3601 caatggaggg tctggaaaca catgcctgct accttgagct tctaagctca ggcaggggga 3661 gctcgtttgg tccacccaca ggcactcttc ctctccttca gaagttttga caaaggaaag 3721 gtagacatct ttgcccacaa agagactagc caccagccta agagctaggt ctcatgggca 3781 aggcttccaa ccatctcctc agttgcaagt ggctgtagcg acagggtttg cctagtagtt 3841 gacccctctg catctatcta ccctagtggg agcaaggcca gctgttacct cctggtctgt 3901 acactgctgt gggccataat gggaccctaa agactcgtgg gtcctgcttt ccatgacaca 3961 gggtcgagtc cggggctgac cacacatcca aggatgacta accaccatct cccccaggga 4021 tggctgcaca tccaggctga ccgccatctc ccctctgttt cagtggcatc ccagagaccg 4081 tgcctctgtc agcagtcaac agacagtgtt catcgggact gcaggcagtg gccaacattg 4141 ctggtaagta gtgattcctg ggtgcccatg agtggaccct gaccttggtc cttcctcagg 4201 accctaggca ggaagagtgg aggggcaccc attgccacac cccaagcctt tttgggtgct 4261 cccgggtctt ccatgaggct ccgttgaatg cacccctgag aacagggagg gaacgggctg 4321 tgctgctccc agggcgggcc cgctccggcc aaggataatt atggagtgct ttcgttgcag 4381 gtggcatcag aaatgggtct tacgacattg gcatggcctg tgggtaagat ggccttccgc 4441 cgggccttat taaccaacag caataatcaa actaatttct ctctcagttc ggaggggccg 4501 tgattccagg cttctccccg cccacgaaga cagctgaata gggtgggctt ccagaatgtg 4561 tctaggagtg gcccaaccct ggctgctcag gaatgggcca atcttagcac aagcagtaac 4621 cggaattggg tcctctaagg tacc // LOCUS RATPKATA2 1205 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene, exon 6. ACCESSION D90056 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 2 of 4 SOURCE Rat (Sprague-Dawley) liver DNA, clone PT147. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1205) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 283 + 381 3-ketoacyl-CoA thiolase, exon 6 pre-msg < 1 > 1205 3-ketoacyl-CoA thiolase mRNA and introns IVS < 1 282 3-ketoacyl-CoA thiolase intron D IVS 382 > 1205 3-ketoacyl-CoA thiolase intron E BASE COUNT 241 a 351 c 302 g 311 t ORIGIN 1 ggtaccagta ggctgtgcta gggactgcct gtttcctgca tctgccctgg gagtgtgtct 61 ggtagagtgc cgggtcctaa gcttgaggga aagaaggctc ggctaccttg taacctccag 121 ctgcccttct caggcaggct tcctagagag cttacctcac tccccaaggg ctccctgggc 181 ctttattccc tagcacttgt cacacagatg gctgtcccta gggcttccta tgaaaactga 241 gagctcattc cttcttcttc tccccctctt cctcctcctc agggtggagt ccatgtccct 301 gtctaacaga gggaaccctg ggaatatttc ctcccgcctg ctggagagtg acaaagccag 361 agactgcctg attcctatgg ggtgagtgtt catgggaagc tgtgtatgta tccactcacc 421 gttctgcctg tccttcacca gacagcagca tactcttctc tgactggcag tactcagctt 481 cctcttcttc ctctgactgt ctcatgctcg acgggaactg ggtatgagat caatgctgcc 541 cttgacctcg ctttcctgag gtccccccta tacaacacac acacacacac acacacacac 601 acacacacac acactcgttc ctctccaact ctacaaatgt ccaaatgcac ctcactctac 661 attactgtgc ctgaggtgcc tgtgtcttag ggaaggcagt gggagtcctc aaagccttgg 721 gaggcctcaa tagttgaaga cacggggagc ttggcttttg ttattgcggg ggagggctcg 781 ggctctgaga taccttgtaa gaagtctgtt ccatctctcc agtgctttat aattctgcat 841 ccgtgtctct gagctctcct cagtttgtcc aagggcaagg cattctggga ggctcaccat 901 aggagaacca taggctggcc tggctttctc cacatcctgt gtgtttccaa gagcagccta 961 gaagccgcag agaaactggg gccagatagt gtccccagca tcctcctgcc tatgtcttat 1021 gggcagagac gactcacccg gggggaacag ttccttctgt aggccccacg ttcaggctgc 1081 tcttcattct ttgaggtgtg agtctttgag catccgtgga catgggcgct cagtatttaa 1141 gatgagccag gagtccctgg ggcgtggact gaagctctgt ctctgacgtc gacgccagag 1201 gtcca // LOCUS RATPKATA3 470 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene, exon 7. ACCESSION D90057 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 3 of 4 SOURCE Rat (Sprague-Dawley) liver DNA, clone PT147. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 470) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 80 + 160 3-ketoacyl-CoA thiolase, exon 7 pre-msg < 1 > 470 3-ketoacyl-CoA thiolase mRNA and introns IVS < 1 79 3-ketoacyl-CoA thiolase intron E IVS 161 > 470 3-ketoacyl-CoA thiolase intron F BASE COUNT 120 a 91 c 154 g 105 t ORIGIN 1 cgcaagccta gtccttcatc atgtccctgg ccctcaggca ggcagaccca cggcctctgc 61 tttcctgtgt gtgcactagg ataacctcgg agaatgtggc tgagcggttt ggcatctcac 121 ggcagaagca agatgccttc gcgctggcct ctcagcagaa gtgagtgcag agggatgggc 181 tgagggtggg gctggagagt ggatcgggta gagttttgcc aagaaacaaa ctgtaggttg 241 atgtccttat gcaggtgaag gcaggtacag gatagatcga agcctgtcgt tagatgagaa 301 ggaaggatga gcaggagaaa agttttaggg aggaggagac cggagagagg aaagagcagc 361 caggagaact tggaggcaga tgttaacatt tctctctgca catttacagg tcgttatgac 421 tgttcttaag ggatggatac agggctttgt atgtctcggt gaacaataac // LOCUS RATPKATA4 2264 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketocayl-CoA thiolase (EC 2.3.1.16) A gene, exons 8-12. ACCESSION D90058 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 4 of 4 SOURCE Rat (Sparague-Dawley) liver DNA, clone PT147. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2264) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 301 491 3-ketocayl-CoA thiolase, exon 8 633 812 3-ketocayl-CoA thiolase, exon 9 1089 1144 3-ketocayl-CoA thiolase, exon 10 1230 1375 3-ketocayl-CoA thiolase, exon 11 1696 1771 3-ketocayl-CoA thiolase, exon 12 pre-msg < 1 2030 3-ketocayl-CoA thiolase mRNA and introns IVS < 1 300 3-ketoacyl-CoA thiolase intron F IVS 492 632 3-ketocayl-CoA thiolase intron G IVS 813 1088 3-ketocayl-CoA thiolase intron H IVS 1145 1229 3-ketocayl-CoA thiolase intron I IVS 1376 1695 3-ketocayl-CoA thiolase intron J signal 2012 2017 polyadenylation signal BASE COUNT 446 a 615 c 703 g 500 t ORIGIN 1 gagtctgctc agtgaaaccc aggttctggt aagaaagaga ggctgggtgg ggtggtgcag 61 gaccctcagg ggagctcacc cacagagatg tctcccaaac actcaggctt tagcttcacg 121 tcctggagaa ccgcggtcaa cacggccgac agaagctcgt cgggggtggt gtcctgcagg 181 gcgaggtgca gcagcggtga gcccaactcc ttcatcccca cccgccgtct ctgcgctcac 241 cggacccccg agggcaggat cccaaacatg cctatgtctt actgcccacg gtgctggcag 301 ggcagcaagt gcccagagca aaggctgctt ccgtgctgag atcgtacctg tgacaaccac 361 tgtcctcgat gacaagggtg acaggaaaac catcaccgtg tctcaggatg agggtgtccg 421 ccccagcacc accatggagg gcctggccaa gctgaagcct gccttcaagg atggaggctc 481 taccacggct ggtgagcgtg gccaggagta gggccaatct cagtggcttg ttctactctg 541 agacctggag atgaccaggc cgcctgggaa ggtgatggtg gggggctggc tctcctcagg 601 cctgctgacg tctcctggtc caccttctcc aggaaactcc agtcaggtga gtgatggagc 661 agccgccgtc ctgctggccc ggaggtccaa ggctgaagaa ctgggcctcc ccatccttgg 721 cgtcctgagg tcctatgcag tggtcggggt ccctcctgac atcatgggca tcggacctgc 781 ctatgccatc cctgcggcct tgcagaaagc aggtgaggtg gctccttctc accctgtgct 841 tggatccttc attacctgga tctgggactg gggagtgctg gggtttgacc ttctgggtgc 901 tgtggatgga gcgggctagc accctacacc caagaccttc ctggcactac agctggctgg 961 gtctcactct cttgtcccta gaagacaaac atcagactcc tgggatctag gcttgggata 1021 gagtcctggg ggagtccaaa gtcctgccca gtggttgttg tcaattctgg cctatgcctc 1081 acccacaggg ctgactgtga atgacataga catctttgag atcaatgagg cctttgcaag 1141 tcaggtgagc ctgggtgttg tggtaggatt gactgagcca gcagttcaga gcagatggag 1201 actttagctc ctgcgcttgc tccttccagg ccctctactg tgtggagaag ctgggaattc 1261 ctgcagagaa ggtgaacccc ctggggggtg caatagccct gggccacccc ctgggctgca 1321 ccggagcaag gcaggtggtc acgctgctca atgagctgaa gcgccgaggc agacggtaag 1381 gctgctcctc gtggggtgca gtgggggtta ttagagctgg ggtgtctgat agctggggcg 1441 tggggaggtg agcacacacg caggggttct gaacttggga atggaggggt gagagcctgt 1501 cttccagaag ccttgcttca aaaacagcat aaccatgcat gccacctggg tgtgtgtctg 1561 tccggggctc ggttcccaca gatctgctgt gctggcttgg actctgccca tgaacccttg 1621 tgactgcagc ctgcctgatg ggagcatggg ttggggtttg gctgactcga gccaacactg 1681 gttctttttc cccagggctt atggcgtggt gtccatgtgc attgggactg ggatgggagc 1741 cgctgctgtc tttgaatacc ctgggaactg aggccctgac tgcaggcact acccagagag 1801 tcctatagta gtgtctggag agggatggta caggagccat cttcgtggga cactcagcag 1861 tggagggatt tgtcacagca cttcaattca gaagatgtag tcgatgttgg aacaggaggt 1921 ggaactgccc tgtcaagtac cccaagccat gctaaagtga gcatgggaca cccaggttgc 1981 aaagccatct gtacctctga cggatgggtg tagtaaatgt gtgctgtctc attttggtgg 2041 ctgtcatatc cttcgcaggc agctttcgcc tcctggattc tgtgtataaa tgattaagaa 2101 ccagggtgtc tccggtgaga tctagctgta tgcaggctct ggggacctgt catgaaagac 2161 ctcagctgac accattgccc tagaagaggc tcccccatcc caggagtcat ctagccccta 2221 gccacccccc cccccgctct ccagctgtgg gctatgcaga attc // LOCUS RATPKATB1 2234 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exons 1-3. ACCESSION D90059 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 1 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clone PTB1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2234) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 The sequence, 1251-1328 encodes leader peptide. FEATURES from to/span description pept 1251 1421 peroxisomal 3-ketoacyl-CoA thiolase, exon 1 1562 1655 peroxisomal 3-ketoacyl-CoA thiolase, exon 2 1985 + 2042 peroxisomal 3-ketoacyl-CoA thiolase, exon 3 pre-msg 1223 > 2234 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns IVS 1422 1561 peroxisomal 3-ketoacyl-CoA thiolase intron A IVS 1656 1984 peroxisomal 3-ketoacyl-CoA thiolase intron B signal 848 854 enhancer core signal 1037 1045 enhancer core signal 1114 1118 CAAT box signal 1163 1167 CAAT box signal 1192 1198 TATA box homolog site 1099 1108 NFK-B site BASE COUNT 460 a 614 c 647 g 513 t ORIGIN 1 agtgctctta accactgagc tatctctcca gccctccact ttattttctg aggcggtatc 61 tctaaacacg gtgatttgac tagggtaggg agccactctg cccctcggat cctctgcctt 121 ccccacctaa gcactaagga agcctgccta gcatcgatgt gtgcagtggg gatctgatct 181 ctggtctccc acttatgggg tcgagcactt taccctctga gccctctcgc actcccccct 241 cctttcgagt tttgggctct cctggggact gagaaaaccc tgggtgattg tcgtggtttg 301 gggctgacct gtcactgtga ttggggaacc cagccttact ctttaaggtt ggtaaaacat 361 tctagatcca gggagcccca attcatccac tcagcaggaa accattgacc agctaatgta 421 tcaggcacac tgcagacacc tctgctcagg ggggaggaga gtgggcgctg caggggtgtc 481 catagatgag agcatttcaa atgggcatcc agaggtgtca agacccgcag gacactctca 541 gagacctttg aaccacttca gtgctttaat aaaaccggtt cttccagcct gttagctgcc 601 aaagattagg cctgagaatc tcaattactc aagtatccag cttggggcca gggacccagg 661 agagacaaga ggcaaatgga acagaaaaca gagtcgggac ctctatctcc attcatagtg 721 taaagctgtt accatggctc cagaacctcc tgccttttgt gtcatagtgt gcgatggtca 781 tatttgaggg agggtatcct ggttcatgtt cagaaaccca gaagacttcg aggtcctacc 841 tacagtgtgg attgcaaaat gctagaaacg acaggaaagt tccagacact ggaactggga 901 ggggtttcaa aagtgcgcac aaactttttt ttaaattata tttgaaaaaa aaatcttatt 961 ttgtagccca gactggcctt gaacttgcgc agaaaccact tgcaccatca caggtgcaga 1021 cggatgagct taaggagtgg ttttgaagtt tatttgacca gcattgagaa gctgcgaggt 1081 ggtggtggga tggatggtgg gactttcctt ctgattggct gtagaatggg taggcgggac 1141 ttatgaaaaa tgcaaggctc ccattggctc tgccctgtgc cagccggtgt cttttattct 1201 ctggggtcag cacccagagc ccaggctaat cggtgggacg cacctccgcg atgcatcggc 1261 tgcaggtagt gctgggccac ctggccggcc ggtccgagtc gagctccgcg ctgcaagccg 1321 cgccctgctc tgcgggcttc ccgcaggcct cggcctccga cgtggtggtg gtgcacggac 1381 ggcgcacccc catcggccgc gccggccgcg gcggcttcaa ggtgaggccc ctcgggcggg 1441 cgaggggagc agaaggcggc catagtgggt gggatcctgg ccctcggggg tccggtgacc 1501 gcagagacgc cgggtgggaa tgaaggagct gggctcaccg ctgttgcgcc tcgccctgca 1561 ggacaccacc cccgacgagc ttctgtcggc cgtgttgacc gcggttctcc aggatgtgaa 1621 gctaaagcct gagtgtttgg gagacatctc tgtgggtgag ctcccctgag ggtcctgcac 1681 cggctcttcc ccgccctcct ccttcccggg atccgtctct cacctcacag agcagagtta 1741 ctgaagggca ggagtctgga cctctgaggt ctgaaccagg ctgcccccga tggccagtct 1801 caagcctaga accttgggcg ccccctcgtg gcacatctgc caactgagag gcgctggaaa 1861 gcgggaacct ctctcgaaag gctgtgattc ttggagagat ctggactgtc tcctgtctgg 1921 ggcatgtttt acttcaggcc tcgggacacc acctggcact gagtcaggac tcttctgtcc 1981 acaggcaatg tgcttcagcc cggggccgga gccgcaatgg cacgcatcgc ccagttcctg 2041 aggtaatctt tctggatccg tcttagttgt ccaccatgac tgccgaaggc ccttggcact 2101 gtgtgtgtca cggtggaaga tcaggttgaa tgtagagact gaggcaactg gagtccactg 2161 acctgacctc tgctcgataa ccttttccct acttagaggg agtgtgttgc tggcccatgg 2221 ccaaggcgcc cggg // LOCUS RATPKATB2 913 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exon 4 and 5. ACCESSION D90060 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 2 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 913) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 353 432 peroxisomal 3-ketoacyl-CoA thiolase, exon 4 673 + 715 peroxisomal 3-ketoacyl-CoA thiolase, exon 5 pre-msg < 1 > 913 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns IVS 433 672 peroxisomal 3-ketoacyl-CoA thiolase intron D BASE COUNT 173 a 234 c 275 g 231 t ORIGIN 1 tgtctcagat gagcacaaga aatgctgttc catctcagag tcttcaagat agaggaaaga 61 ggttgtctgg gcccacggca ggcctccctg gtccctcttg agaactatgg gacagatgag 121 aatgtgcttg ggggctcttg tgtgtagcta cggagacaga tctgcacagt agctgcttct 181 gctcttgtgg atctgctcca gtgggggttg cctgttgtcc tgagctatat gcacactgct 241 ggggctcttg tgggggccca gaagtctggt ggggcctgct ttccatgaga gagggtcatg 301 tccagggatg gctgcacgtc cagggctgac cgccatctcc cctctgtttc agtggcatcc 361 cagagaccgt gcctctgtca gcagtcaaca gacagtgttc atcgggactg caggcagtgg 421 ccaacattgc tggtaagtga tgattcctgg gtgcccatga gtggaccctg accttggtcc 481 ttgaccctca ggaccctagg caggaagagt ggagggacac ccattgccat accccaagcc 541 tttttgggtg ctcccggggt cttccgtgag gctccgttga atgcactccc gagaacaggg 601 agggaacggg ctgtgctgct ccagggcggg cccgttccgg ccaaggataa ttatggagtg 661 ctcttggtgc aggtggcatc agaaatgggt cttacgacat tggcatggcc tgtgggtaag 721 atggccttcc cgccgggctt attaaccaat gcaataatca aactaaaatt tttctctcct 781 tgactgtgat tctccttgcc cactgtgaca gctggctggg ctgggctccc agggatgcag 841 cccaggttgt ggctctgagt atcgtgacct cttttgttct cacaactttg cttgtcacat 901 tggggatctg cag // LOCUS RATPKATB3 145 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exon 6. ACCESSION D90061 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 3 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 145) AUTHORS Hijikata,M., Wen,J.-k., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 22 + 120 peroxisomal 3-ketoacyl-CoA thiolase, exon 6 pre-msg < 1 > 145 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns BASE COUNT 32 a 38 c 41 g 34 t ORIGIN 1 gagctcactc tttctcctca gggtggagtc catgaccctg tctgagagag gcaaccctgg 61 gaatatttcc tcccgtcttc tggagaacga gaaggccaga gactgcctga ttcctatggg 121 gtgagtggcg acaggaatct ctcaa // LOCUS RATPKATB4 369 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exon 7. ACCESSION D90062 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 4 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 369) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 242 + 322 peroxisomal 3-ketoacyl-CoA thiolase, exon 7 pre-msg < 1 > 369 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns BASE COUNT 66 a 94 c 123 g 86 t ORIGIN 1 ggggggaaca gttccttctg taggccccac gttcaggctg ctcttcactc tttgaggtgt 61 gagtctttga gcatccgtgg acatgggcgc tcagtattta agatgagcca ggagtccctg 121 gggcgtggac tgaagctctg tctctgacgt ggacgcccag aggtgcaagt ctagtccttc 181 atcgtgtccc tggccctcag gccagcagac ccatggcctc tgctttcctg tgtcgcacta 241 ggataacctc ggagaatgtg gctgagcggt ttggcatctc acggcagaag caggatgcct 301 tcgcgctggc ctctcagcag aagtgagtgc agagggatgg gctgaggtgg gcagagatgc 361 tgagctggg // LOCUS RATPKATB5 2787 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exons 8-12. ACCESSION D90063 J05269 KEYWORDS 3-ketoacyl-coA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 5 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clone PTB2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2787) AUTHORS Hijikata,M., Wen,J.-k., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-Ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute 5-1-1 Tsukiji, Chuo-ku Tokyo 104 Japan Phone: 03-542-2511 Fax: 03-545-3567 FEATURES from to/span description pept + 987 1177 peroxisomal 3-ketoacyl-CoA thiolase, exon 8 1326 1505 peroxisomal 3-ketoacyl-CoA thiolase, exon 9 1782 1837 peroxisomal 3-ketoacyl-CoA thiolase, exon 10 1923 2068 peroxisomal 3-ketoacyl-CoA thiolase, exon 11 2387 2462 peroxisomal 3-ketoacyl-CoA thiolase, exon 12 pre-msg < 1 2742 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns IVS 1178 1325 peroxisomal 3-ketoacyl-CoA thiolase intron H IVS 1506 1781 peroxisomal 3-ketoacyl-CoA thiolase intron I IVS 1838 1922 peroxisomal 3-ketoacyl-CoA thiolase intron J IVS 2069 2386 peroxisomal 3-ketoacyl-CoA thiolase intron K signal 2724 2729 polyadenylation signal BASE COUNT 572 a 722 c 864 g 629 t ORIGIN 1 ctcacggtgg ggcagagatg gcctgagctg gggctctcac tgtggtacta aggtgttggt 61 tagccccatg actttagctg ggaggtggaa tggttcagcc cggagtgtgg ggtgggtctt 121 cccagttctt aactattcac tggggtcggt ctctctagag acctcctcct cctctaccct 181 gcttcttagg acagaaggac ccagttctga acagacatgg tggtcactgg tgatccttag 241 gcctggagtg tggggtcagt ggtgctaaga ccggatcccc ctcctaccac aagatgaaga 301 cctgggtttg cagcggtggc ctgaggccat cattccagaa aaccagagcc ttaggcccgt 361 gggaaacaga agggcaggtt ggccactcaa atcatggatt gagaacagag ggaggatgga 421 gtgtaagcca ggtggcctca ccttttagag ttggccagca gtgtgcgttc cctggaagcc 481 gtgcgttcct gcccacaggc atgactctac ccacttgttc tgttcaccct gacaacagcg 541 tgggaatcag gaaataaatc caggaggtgc acctcgcttt taaatttaag aggggaagca 601 ggtgaagttt cgcttttagc agtcaaacat gggccttgaa gaatccggca cagacattcc 661 caggggtgca gcctcgttgt ggaagagacc tctgagcagg ctaagcagct agcggattaa 721 gaatcaggag acagaatagc agctctgaac tccctgctgt tggctgggga tgagtactgc 781 caaaccggga aaatcaggag ctgctgctga gtctgggaga gcttcgggtg cgcctttccc 841 tccctgggga atcttagctg tcactcagag ctccatcagg gccacaaagg gctgtggttg 901 cattgccttt ctgagttagc caagcccagg gactgggagg aaagactaca aaggggcccg 961 actctcgctg tccatggtgc tggcagggca gcaagcgccc agagcaaagg ctgcttccgt 1021 gctgagatcg tgcctgtgac aaccactgtc ctcgatgaca agggtgacag gaaaaccatc 1081 accgtgtctc aggatgaggg tgtccgcccc agcaccacca tggagggcct ggccaagctg 1141 aagcctgcct tcaaggatgg aggctctacc acggctggtg agcgtgggcc aggattaggg 1201 ccgatctcgg tggcttgttc tactctgaga cctggagctg accaggccgc ctgggaaggt 1261 gatagtcagg ggctggctct ctcaggccct gcctgacgtc tcctcgtccc acccttcctc 1321 cccaggaaac tccagtcagg tgagtgatgg agcagccgcc gtcctgctgg cccggaggtc 1381 caaggctgaa gaactgggcc tccccatcct tggcgtcctg aggtcctatg cagtggtcgg 1441 ggtccctcct gacatcatgg gcatcggacc tgcctatgcc atccctgcgg ccttgcagaa 1501 agcaggtgag gtggctcctt ctcaccctgt gcttggatcc ttcattacct ggatctggga 1561 ctggggagtg ctggggtttg accttctggg tgctgtggat ggagcgggct agcaccctac 1621 acccaagacc ttcctggcac tacagctggc tgggtctcac tctcttgtcc ctagaagaca 1681 aacatcagac tcctgggatc taggcttggg atagagtcct gggggagtcc aaagtcctgc 1741 ccagtgattg ttgtcaattc tggcctatgc ctcacccaca gggctgactg tgaatgacat 1801 agacatcttt gagatcaatg aggcctttgc aagtcaggtg agcctgggtg ttgtggtagg 1861 attgactggg ccagcagttc agagcagatg gagactttag ctcctgcgct tgctccttcc 1921 aggccctcta ctgtgtggag aaggtgggaa ttcctgcaga aaaggtgaac cccctggggg 1981 gtgcaatagc cctgggccac cccctgggct gcaccggagc aaggcaggtg gtcacgctgc 2041 tcaatgagct gaagcgccga ggcagacggt aaggctgctc ctcgtggggt gctgtggggg 2101 ttattagagc tggggtgtcc aatagctggg ggcgtgggga ggtgagcaca cacacagggg 2161 ttctgaactg ggaatggagg ggcgagacct gtcttccaga agccttgctt caaaaacagc 2221 atgactgtgc ctgccactgg gtgtgtgtct gtccggggct cggttcccac agatctgctg 2281 tgctggcttg gactctgccc atgaaccctt gtggctgcag ccctggctgg tgggagcatg 2341 ggttggggtt tggctactcg agccaacact ggttcttttt ccccagggct tatggcgtgg 2401 tgtccatgtg catcgggact gggatgggag ccgctgctgt ctttgaatac cctgggaact 2461 gaggccctga ctgcaggcac tacccagaga gtcctatagt agtgtctgga gagggatggt 2521 acagaagcca tcttcgtggg acactcagca gtggagggat ttgtcacagc actttaattt 2581 agaaaatgta atcgcaatac gggcacaagg gcatccaatc cactcctctc agttgaacca 2641 aagacagcgc acgtcacacg tgaccacagg acaacatgtt tctgagggac agtggcctca 2701 tgaatagtgg gcatgagggg catagtaaat gtatgttatc tcatcttgct tgcagtcatt 2761 tcccccaggc cggctcttgc tttgttg // LOCUS NPOCAP 2102 bp ds-DNA VRL 04-AUG-1990 DEFINITION Orgyia pseudotsugata polyhedrosis virus capsid protein (P39) gene, complete cds. ACCESSION M21042 M23429 KEYWORDS capsid protein; structural protein. SOURCE Orgyia pseudotsugata polyhedrosis virus DNA, clone cosmid #58. ORGANISM Orgyia pseudotsugata polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 2102) AUTHORS Blissard,G.W., Quant-Russell,R.L., Rohrmann,G.F. and Beaudreau,G.S. TITLE Nucleotide sequence, transcriptional mapping, and temporal expression of the gene encoding p39, a major structural protein of the multicapsid nuclear polyhedrosis virus of Orgyia pseudotsugata JOURNAL Virology 168, 354-362 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by G.W.Blissard, 03-NOV-1988. The p39 protein is found in both the PDV and BV virion phenotypes; P39 mRNAs are approximately 2600 nt in length, are expressed late (24-48 hpi), and have two major transcription initiation sites. At each major transcription start site, two transcripts are initiated within the ATAAG consensus. Transcripts initiate at positions 2 and 3 of the ATAAG consensus. FEATURES from to/span description pept 124 1179 capsid protein mRNA 27 > 1179 capsid protein mRNA (alt.) mRNA 28 > 1179 capsid protein mRNA (alt.) mRNA 75 > 1179 capsid protein mRNA (alt.) mRNA 76 > 1176 capsid protein mRNA (alt.) BASE COUNT 493 a 607 c 585 g 417 t ORIGIN Map position 57-58. 1 ccggcgcagc gcaccgtttc tttatataag aatctaaaat tagatacaag aggtcttggc 61 taaaattgat tgtataagag atttcttgct ctatcaaaac gtcggccccc atcgccgcag 121 aaaatggcgc tcgtgtcacc cggggtgtcg tcgcgacggt cgaccaacca ctgcatcttt 181 ggcgcgattg aaccgttcga ctcgtgcgtc acgtacaggt cgccatgctc gtcggacgcg 241 tcggtggacg acggctggtt catctgcgac taccacctga agttgcgctt caaaatgtcc 301 aaaatggtgc tgcccattta cgacgaagac gacaaccagt acaagcgcac catagcgcgg 361 catttggtgg ggcacaaaga aaggggcgta aagcgcattc tggtgcccac cagagccaac 421 tacatgaccg tgttcaattt gccgggcatg atgctggccg agcagctgat ttttcatttg 481 atctacgaca accgtttgga agtcaatcgc atttgcgcct cgttgaaaaa caacgaaaac 541 ttcattgaca acacgtacag cgttgtggag agcgtgtaca gcgcgacgcg caacattctg 601 tcgctcacag acccgcaagc gtactgctcg cgcgtggcca acgacgacgt gcgctttttc 661 gacgcgaacg tagtcgacaa caactaccag gcgggcaacg gcgacaccgt gtttaacaac 721 atgcccgggt ttttgcgcaa cctaatcagg cgcgccgtgg cgcccgaaac gctgcagatc 781 gacagcgaag acctgcgcct gcgaaactgc aacacctgtg tgattaacaa caccggcttg 841 gtagcaaccg tgacaaacac cgagctgtac aacccggtgc gcagctcgga catcattaaa 901 acgcggccca atcgcttgca gattagaaac gttttgaagt ttgagggcga cacacgcgcg 961 ttggagcgca ccttgggccg ctacgaggaa taccccatgt acgtgccctt gtttttgggc 1021 taccagctgg tcaacctgca aaacgacatt ttgcgcgcca acaacttttt acccgcgccg 1081 tttggcgtgc cgcaggccgt aaacaacctg gaggcccaag cgcccgcagc gcccgcaccc 1141 gcagcgcccg cgcccgcgcc cgcagcgccc gttgtttaat tccgttgcca tggacacggt 1201 gaggctgcag tgccacattt gctgctcggt gggcgaaatc aaaaattatt ttttgcagcc 1261 cgtcgacgca atcaccattt tgcccattgt ggagctgcac acgtgccggc accaattgtg 1321 cgtcatgtgc gtgcgcaaaa ttgcgcagcg cggccgagac aagcgcgtcg agtgccccat 1381 gtgccgccgc aaaaacgcgc actttaacgt gtacagcgtg aaccgtaatt cggtggatgt 1441 gttgcgctgc agcgtggccg acgtgcgcga gcacgggcgc tttggcggat tggcggacgc 1501 cgcgtcgctg gctcgtgggc tgtttgagcc cagcttgctt gaagccgagc ccgcgccgga 1561 caactcgttt ggcccaaacg agttacaact tgtgctaaaa cgtctaaagg cgcaaattga 1621 ggcgcaaaca agaactaatt atgatttgca actgcaagca actgctttgg agcgaacaat 1681 cgaagaagca aacgaccgct tgggtaaaag ccgcggcgac tacagcgacg cgtgcaagct 1741 gatggacgag cttcgcggcg acaggctgcg cgcggagcgc gcggtcaagg cgctcgccga 1801 cgcgcacgcc cagtgggcgg acaagaacgc caaaatgcgg cgcgagaacg acaggttgac 1861 aaatgaaaac attggtttga ttcgggacaa caatttattt aaacaaaaca ctgcacgcaa 1921 acgtaaaatc gctccgtaac cgcctttgtt tgcgcgtcaa agtccacctc gtagacgtgt 1981 cgctcgttta cgctgccgca gcggtcccag ttgtccgtcg tgcaccacgc cgcattgtgt 2041 cggcggcaca aatacagcgc ttcgttgaat tgcgccggcg gcagattgaa gcggccgagc 2101 tc // LOCUS PT2REGA 462 bp ds-DNA PHG 04-AUG-1990 DEFINITION Bacteriophage T2 translational repressor (regA) gene, complete cds. ACCESSION M27737 KEYWORDS RNA binding protein; translational repressor. SOURCE Bacteriophage T2, cDNA to viral mRNA. ORGANISM Bacteriophage T2 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 462) AUTHORS Miller,E.S. and Jozwik,C.E. TITLE Sequence analysis of conserved regA and variable orf43.1 genes in T4-like bacteriophages JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.S.Miller, 08-SEP-1989. The T2 RegA protein coding sequence corresponds to positions 9839-10207 of the T4 regA sequence contained in the entry with accession number M10160. The complete sequence reported herein would correspond to positions 9804-10266. An RNA secondary structure is detected by primer extension reactions on mRNA templates at positions 407-424. An RNA nuclease sensitive site is noted at position 431. FEATURES from to/span description pept 36 404 translational repressor protein (RegA) BASE COUNT 169 a 66 c 88 g 139 t ORIGIN 1 gaaacaactc aaaaaactag cattggaatg gtaaaatgat tgaaattact cttaaaaaac 61 ctgaagattt tctgaaagta aaagaaactt tgactcgtat gggaattgct aataataaag 121 ataaagttct gtatcagtcc tgtcatattc ttcagaaaaa aggactatac tatatcgttc 181 attttaaaga aatgcttcgt atggatggtc gccaagttga aatgacagaa gaagatgaag 241 ttcgtcgtga ttcgattgca tggctattag aagattgggg actaattgaa atcgttcctg 301 gtcaaagaac ttttatgaaa gatttaacta ataacttccg agttatttct tttaaacaaa 361 aacatgaatg gaaactcgtt cctaaatata cgattggtaa ttaagcaagg ggcttcggcc 421 ccttatttgg agtataatat atcaagagcc taataactcg gg // LOCUS PT4G55G43 13062 bp ds-DNA PHG 04-AUG-1990 DEFINITION Bacteriophage T4 genes 55, alpha-gt, 47, 46, 45, 44, 62, regA, and 43. ACCESSION M10160 J02510 X00769 KEYWORDS DNA polymerase; DNA polymerase accessory protein; alpha-glucosyltransferase; exonuclease; polymerase; translational repressor. SOURCE Bacteriophage T4 DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 7513 to 8269) AUTHORS Spicer,E.K., Noble,J.A., Nossal,N.G., Konigsberg,W.H. and Williams,K.R. TITLE Bacteriophage T4 gene 45: Sequences of the structural gene and its protein product JOURNAL J. Biol. Chem. 257, 8972-8979 (1982) STANDARD full staff_review REFERENCE 2 (bases 7513 to 10735) AUTHORS Spicer,E.K. and Konigsberg,W.H. TITLE ; JOURNAL (in) Mathews,C.K., Kutter,E.M., Mosig,G. and Berget,P.B. (Eds); Bacteriophage T4: 299-301; American Society for Microbiology, Washington, DC (1983) STANDARD full staff_review REFERENCE 3 (bases 9747 to 10265) AUTHORS Trojanowska,M., Miller,E.S., Karam,J., Stormo,G. and Gold,L. TITLE The bacteriophage T4 regA gene: Primary sequence of a translational repressor JOURNAL Nucleic Acids Res. 12, 5979-5993 (1984) STANDARD full staff_review REFERENCE 4 (bases 8257 to 9272) AUTHORS Spicer,E.K., Nossal,N.G. and Williams,K.R. TITLE Bacteriophage T4 gene 44 DNA polymerase accessory protein: Sequences of gene 44 and its protein product JOURNAL J. Biol. Chem. 259, 15425-15432 (1984) STANDARD full staff_review REFERENCE 5 (bases 1 to 7084) AUTHORS Gram,H. and Rueger,W. TITLE Genes 55, alpha-gt, 47, and 46 of bacteriophage T4: The genomic organization as deduced by sequence analysis JOURNAL EMBO J. 4, 257-264 (1985) STANDARD full staff_review REFERENCE 6 (bases 6985 to 7512) AUTHORS Hsu,T. and Karam,J. JOURNAL Unpublished (1985) Med U of South Carolina, Charleston, SC. STANDARD full staff_review REFERENCE 7 (bases 9781 to 10291) AUTHORS Adari,H.Y., Rose,K., Williams,K.R., Konigsberg,W.H., Lin,T.-C. and Spicer,E.K. TITLE Cloning, nucleotide sequence, and overexpression of the bacteriophage T4 regA gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1901-1905 (1985) STANDARD full staff_review REFERENCE 8 (bases 7513 to 12985; revises [1],[2],[4],[7]) AUTHORS Spicer,E.K. JOURNAL Unpublished (1986) Yale U School of Med, New Haven, CT. STANDARD full staff_review REFERENCE 9 (bases 7513 to 12985; revises [1],[2],[4],[7],[8]) AUTHORS Spicer,E.K. JOURNAL Unpublished (1987) Yale U School of Med, New Haven, CT. STANDARD full staff_review REFERENCE 10 (bases 10205 to 13062) AUTHORS Spicer,E.K., Rush,J., Fung,C., Reha-Krantz,L.J., Karam,J.D. and Konigsberg,W.H. TITLE Primary Structure of T4 DNA Polymerase: Evolutionary relatedness to eucaryotic and other procaryotic DNA polymerases JOURNAL J. Biol. Chem. 263, 7478-7486 (1988) STANDARD full staff_entry REFERENCE 11 (bases 9184 to 9903) AUTHORS Rush,J., Lin,T.-C., Quinones,M., Spicer,E.K., Douglas,I., Williams,K.R. and Konigsberg,W.H. TITLE The 44P subunit of the T4 DNA polymerase accessory protein complex catalyzes ATP hydrolase JOURNAL J. Biol. Chem. 264, 10943-10953 (1989) STANDARD full staff_entry REFERENCE 12 (bases 10212 to 10215) AUTHORS Andrake,M., Guild,N., Hsu,T., Gold,L., Tuerk,C. and Karam,J. TITLE DNA polymerase of bacteriophage T4 is an autogenous translational repressor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7942-7946 (1988) STANDARD full staff_entry COMMENT Gene 45 protein is required for both T4 DNA replication and T4 late gene transcription [1]. Gene 44 protein is a DNA polymerase accessory protein required for T4 DNA replication [4]. Gene 55 protein associates with the modified host RNA polymerase and is necessary for late RNA transcription [5]. Gene 43 encodes a T4 DNA polymerase [8]. [5] notes several open reading frames not included in the FEATURES table. A draft entry and computer-readable copy of the sequence in [8],[9] were kindly provided by E.K.Spicer 13-MAR-1986 and 27-JUL-1987. FEATURES from to/span description pept 25 582 g55 protein pept 2356 3558 alpha-glucosyltransferase (alpha-gt) pept 3735 4754 exonuclease g47 pept 5198 6880 exonuclease g46 (gtg start codon) pept 7579 8262 DNA polymerase accessory protein g45 pept 8313 9272 DNA polymerase accessory protein g44 pept 9274 9837 DNA polymerase accessory protein g62 pept 9839 10207 translation repressor protein (regA) pept 10286 12982 DNA polymerase (g43) revision 8707 8707 g in [8]; c in [2],[4] revision 8835 8835 c in [8]; g in [2],[4] revision 8950 8950 g in [8]; c in [2],[4] revision 9135 9136 cg in [8]; gc in [2],[4] revision 9484 9484 c in [8]; g in [2] revision 9929 9929 g in [2],[3],[8]; aaag in [7] revision 10212 10215 gggg in [12]; gg in revision 10551 10552 ct in [8]; tc in [2] revision 10807 10807 a in [9]; c in [8] revision 12597 12599 ttg in [9]; tg in [8] revision 12605 12607 aaa in [9]; aa in [8] revision 12609 12610 tt in [9]; t in [8] revision 12612 12613 tt in [9]; t in [8] revision 12618 12619 aa in [9]; a in [8] revision 12644 12646 aaa in [9]; aa in [8] BASE COUNT 4471 a 1954 c 2512 g 4125 t ORIGIN About 45.5 kb on genomic map. 1 tatgaattga gctaagaggt gaaaatgtca gaaactaagc ctaaatataa ttacgtaaac 61 aataaagagc ttttacaagc tattattgat tggaaaacag aattagcaaa taataaagac 121 ccaaataaag tagttcgtca gaatgatact atcggattag ccattatgct tattgcagaa 181 ggcttatcta aacgtttcaa cttttcagga tacacccagt cttggaaaca agaaatgatt 241 gcagatggta tagaagcttc tattaagggg cttcacaatt ttgatgaaac gaaatataaa 301 aacccacatg cgtatataac tcaagcttgt tttaatgcat tcgtccaacg tattaaaaaa 361 gaacgtaagg aagttgcaaa gaaatatagt tacttcgttc acaatgtcta tgacagccgt 421 gacgacgata tggttgcgtt agtagatgaa acttttattc aagacatcta tgataaaatg 481 acgcattacg aagaatcaac ctatagaaca ccgggggctg aaaagaaaag tgttgtagat 541 gattctccta gtttggattt tttatatgag gctaacgatt aacctctccg gattcttgga 601 agaaatacct gaagttgaag ctattcccta tttacttaaa atgtatctca gggaagtttt 661 agctcttgac attgatattg atccagaaaa tccgtatgat accgctttta aatctaatgg 721 tgtagaatta aactatcggt atcatttaac agatgatgat ttttatttta tattagagaa 781 ataatatgac tgataaaccc gaaattaatg atgaagtgga aaagcttatt tcttctattg 841 aagaaaagaa ccgtcttgaa gcagaaagaa aagcaaataa gttattgtct aaaaacaaac 901 gcgaactgaa tcgtctttat aagcacgctc agatcgcagc tgaaaataat aattttgctc 961 aatacgaata tgctatcaag aaaagtcggg atattctaaa acagccatat aacgatgaac 1021 tcatcagtat tctttggaag actactagat cgcagattga ggatatgatt gatgcttaca 1081 cacgtaaaat tcaagcgtct taaaattaat gcaggattta ctgaatcttt gaatggtcat 1141 ctttgcgtga aaatttctga aaaagaatac cgtgatagtt caatcaaaga agttaatcct 1201 cctattgtaa gagcagaccc taatatgaaa gtgtgggttg attcttatca agtcaaaaaa 1261 tggtggcaac tgtgaaagat gaacatccga cttctgaaat agattataat aaaattcgtt 1321 cctctagaga ggaaatgatg agacgcttta aagaggctca tgataaagct aaagcagaag 1381 gaactataac atataagcgc ataaaattta aaagttctaa cgagcctctg tatggcgtat 1441 tatgtggata ggagcttcgg ctcctatatt gctttataaa ttttttggta aaataaacta 1501 aaatgaatta tttaaacata tataatttac tgatcgaaaa aggactaaac agaggcaatg 1561 ataaaagtct tttgacatac tatactgaaa cacatcatat aattcctaga tgtatgggtg 1621 gtacagatga taaaactaat ttagttcttt tgacaccaga agaacatttt accgcgcatt 1681 tattattatt taaaatttat agacttccta aattggcttt agctattcgt atgatgtgtt 1741 atagttctga cggaacaaga ctaaataata aaatgtatgg atggattaaa acagctgtat 1801 cttcttcaat atcggaaagc atgaaagagt tctggaaaga tgatgataat aaaaaataca 1861 tgtctaacgc tagacgaaat gctggaaaac caatttatca atatgattta aatggaaatt 1921 ttattcgcaa atatagatgt attacagatg ctgcagaaga tatgtcatat agctgttcaa 1981 cttcaattaa acaatgtgta gatggtaaaa ggaaaaccgc tggaggattt caatggaaat 2041 attattattc tgataacata ggaaagccct ctaggatgtc taatgcaact aaacaaaaaa 2101 tgtctaaatc taaacgtggg ataactcaga aaagaaatgt gccagtattt caatatgata 2161 ctactggaaa attattacga gtatttccta gaattaaaga cgcagctgta tccgttaaag 2221 gatgcatgtc taatattaaa aagtgcatat ctggaaaaag caaaattgct tatggatatg 2281 tttgggccta ttcgtagcgt tttatagaaa ataaaatatt atttacatga tttattaaat 2341 gaaaagagga aaactatgcg tatttgcatt tttatggctc gaggtcttga aggttgtggt 2401 gtaacaaaat tctcactcga gcaacgtgat tggtttatta aaaatggtca tgaagtaact 2461 ttggtttatg ctaaagataa atcatttact cgtacaagtt ctcatgacca caaatcattt 2521 tcaattccag ttattttagc taaagaatac gataaagcac ttaagctagt aaatgattgt 2581 gatattctaa ttattaattc tgttcctgct acttccgttc aagaagctac gattaataac 2641 tataaaaaac ttttagataa tattaaacct tctattcgtg ttgtagttta tcagcatgat 2701 cattctgttc tttctttgcg tcgaaatttg ggattagaag aaactgttcg tcgagctgat 2761 gttattttta gccattctga taatggtgat tttaataaag ttctgatgaa agaatggtat 2821 ccagaaactg tttctctgtt tgatgatatt gaagaagcac cgacagtata taattttcag 2881 cctcctatgg atattgtgaa ggttcggtca acttattgga aagatgtttc tgaaattaac 2941 atgaatatca accgttggat tggtcgtacg actacatgga aaggttttta ccagatgttt 3001 gattttcatg aaaaattctt aaaacctgct ggtaaatcca ctgtaatgga aggtctggaa 3061 cgttcccctg cttttattgc aattaaggaa aaaggtattc cgtatgaata ttacggtaat 3121 cgtgagattg ataaaatgaa tctcgcgccg aatcaaccgg cacaaatcct agattgttat 3181 attaatagtg aaatgcttga acgaatgagt aaatctggct ttggatatca gttgagtaaa 3241 cttaaccaga aatacttaca acgctcactc gaatatactc atctcgagct tggtgcatgt 3301 ggaacaattc cggtattttg gaaatctact ggcgaaaatt taaaattccg tgttgataat 3361 actcctttga cctcgcatga tagcggtatc atttggtttg atgaaaatga tatggaatca 3421 acatttgaac gtattaaaga actgtcatct gaccgagctc tttatgaccg tgagcgagaa 3481 aaagcatatg aatttttgta tcagcatcaa gattcaagct tctgctttaa agaacagttt 3541 gacattatta caaaataaag ggcttcggcc ctttagcttt atacggagtt tgatataatg 3601 atatttcttg gatatgtgat actttttctt gcattttatc tattcactag agcatgttgg 3661 attgggttct ttagcactcc agatggattt atttcaataa ttttattttg catttcaatg 3721 acggttcttg atatatgaaa attttaaatt taggtgattg gcatttaggc gttaaagctg 3781 atgatgagtg gattcgcggt attcaaattg atggaattaa gcaagcgata gaatattcta 3841 agaaaaatgg aattactacc tggattcaat acggtgatat ttttgatgtg cgaaaagcga 3901 tcacacataa aactatggag tttgcccgtg aaatagttca aacgcttgat gatgctggaa 3961 ttacattaca cactattgta ggaaatcacg atctccacta taaaaatgta atgcatccaa 4021 atgcttctac tgagcttttg gctaaatatc ctaatgttaa agtgtatgat aagcctacta 4081 cagtagattt tgacgggtgt ttgattgatt tgattccttg gatgtgcgaa gaaaatactg 4141 gtgaaattct tgagcatatc aaaacttcat ctgcttcttt ttgtgttggt cactgggaac 4201 tgaatggatt ttatttttat aaaggaatga aatctcatgg tcttgaaccc gatttcctta 4261 agacttataa agaagtgtgg tctggtcact tccatactat ttctgaggct gcaaacgtca 4321 gatatattgg gacaccatgg acactaaccg caggtgacga gaatgaccct cgtgggttct 4381 ggatgtttga tacagaaaca gaacgaacgg aatttattcc aaacaatact acctggcatc 4441 gtagaattca ttatccattt aaaggaaaaa ttgactataa agattttaca aatctatcag 4501 tacgtgttat agtaactgaa gtagacaaaa atctgacgaa gttcgaatct gaactagaaa 4561 aagttgtgca ttcattacga gttgtgtcaa agattgataa ctctgtcgag tcagatgaca 4621 gtgaagaagt tgaagttcaa tcgcttcaga cattaatgga agagtatatt aatgcaattc 4681 cagacatcac tgattctgac cgtgaagcac ttattcaata tgcaaatcag ctatatgtag 4741 aggcaacaca atgacttttg atgaatttaa aaatgttatg atgagtcagc attttaaatg 4801 tgaagtaaaa gatgatattg gtcataaaga aattattgaa tactggtttg aaccgttaga 4861 ggttgaagat aattgtatta aaaaggttac tgtctgcact gactgggctg tatcttttaa 4921 cttcaacatt ttagataatg acacacctaa atcattacga gatatggctg tatcttgtat 4981 taaggatgca tactgtgaag ttttcgacat ttgacattaa tgatgaattc atagcaaata 5041 ttgattatac cgaagaagat tctagatatg ttggaataat ttatatcaca tcaaaaacag 5101 cacaaggcgt tgtttgcatg gctgaatttg atgaatactt tttagattat gatgatatga 5161 tagaatggtc taaaagatac attaaaagga atcttttgtg aagaatttta aacttaatag 5221 agtaaaatat aaaaatataa tgtcagttgg tcaaaatggg attgacattc aattagataa 5281 ggttcaaaaa actcttatta ctggacgaaa tggcggtggt aagtctacta tgctagaagc 5341 catcacattt gggctttttg gtaagccatt tcgcgatgta aagaaaggtc aattaataaa 5401 cagcacaaat aagaaagaac ttttagttga actgtggatg gaatatgatg agaaaaagta 5461 ctatatcaaa agaggacaaa aaccaaacgt tttcgaaatc accgttaacg gtacacgtct 5521 taatgaatct gccagcagta aagatttcca agcagaattt gaacagctta tcggaatgtc 5581 atatgccagt ttcaagcaga ttgttgtcct tggtacagca gggtataccc ctttcatggg 5641 tttgtcgacc cctgcgcgaa gaaagcttgt ggaagacctg cttgaggtag gaacattagc 5701 tgaaatggat aagcttaata aagcactaat acgtgaatta aattcgcaga accaagtgct 5761 tgatgttaaa aaagatagta ttatccaaca aattaaaata tataatgata atgttgaacg 5821 ccagaaaaaa ttaacgggtg acaaccttac tcgtctacaa aatatgtatg atgatttggc 5881 aaaagaagct agaacgctaa aatcggaaat agaagaagct aacgaaagat tagttaatat 5941 tgttttggac gaagacccga ctgatgcatt taataaaatc ggtcaagaag catttttaat 6001 taaatcaaaa attgactcgt ataataaagt cattaatatg tatcacgaag gtggattatg 6061 tccaacctgt ttgtcacaat tgagttccgg tgataaagtt gtttctaaaa ttaaagataa 6121 agtttctgaa tgcacacatt cgtttgaaca gctttcaaca catcgtgata atttaaaagt 6181 tcttgttgat gaataccgag ataatattaa aacccagcag tcgttggcaa atgatattcg 6241 caataaaaag caatctctga tcgcagcagt agataaagct aaaaaagtta aagcggctat 6301 agaaaaagca tcttctgagt ttattgacca tgctgatgaa atagcactgc ttcaagaaga 6361 acttgataaa attgttaaga caaaaactaa tttagtaatg gaaaaatacc accgaggaat 6421 tttgactgat atgctcaaag attctggtat taaaggtgct attattaaaa agtacattcc 6481 attatttaat aagcagatta accattatct taaaataatg gaagcggatt atgtgtttac 6541 attagatgaa gaatttaatg agacaattaa atcccgtggt cgtgaagatt ttagttatgc 6601 ttcatttagt gaaggtgaaa aggcacgaat cgatattgct cttttattta cttggcgtga 6661 tattgcttct atcgtatctg gtgttagtat tagtacatta attcttgatg aagtgtttga 6721 tgggtcattt gatgccgaag gtattaaagg tgtagctaat attataaatt caatgaaaaa 6781 cactaacgtt tttataattt cgcataaaga ccatgacccg caagaatatg gtcagcatct 6841 tcaaatgaag aaagttggtc gatttactgt aatggtttaa tttataagag attatgcttt 6901 aatttattag agtataatct ctatggagga aaaacatgga atattcaact ggacagcatc 6961 tattaactat tcctgaaata aaacgatata ttctgagaaa taatttttct aatgaagagc 7021 atatagttac tgaatctatg cttaggaatg catttaaagc agaatataca aaaataatgt 7081 ccaatagaaa tgaagcttgg actgttactg attattatga ctaaaggtgt attatgacta 7141 aaattactgt gaattatact gttgatgtaa aagatattca gccaaaacac gtgcgttctg 7201 aatcaaatcc acaaaaccaa aataaaattc gtcgagcatg tgttttgtct ctttctgata 7261 acgcaatgga agttattcag aacaaaatta aatctgcacc tgctcgtcat gcgtattatg 7321 aagctatcga tcgtgaagta agtaataaat ggattgaact aatgcgcaaa catactacag 7381 aatccctaaa cgccggtgct aaatttatta tgacttcatg tggtgaaacg ccttgaagat 7441 gattattgcg gtaatgcaga tgaacgtcta attgttgctg ctcaaattgt tgcggaaaca 7501 attgcggctg attttaacgt taattgcttt attaaattag ttataaaatt aaatctcatt 7561 tgaattgaag gaaattacat gaaactgtct aaagatacta ctgctctgct taaaaatttc 7621 gctactatta actctggtat tatgcttaaa tccggtcaat ttattatgac tcgcgcagtt 7681 aatggtacaa cttatgcgga agcaaatatt tctgacgtta ttgattttga tgtagcaatt 7741 tacgatttga acggttttct cggtattctg tctttagtta atgatgcaga aatttcccag 7801 tcagaagatg gaaatattaa aattgctgat gcccgctcaa caattttttg gcgagcagcc 7861 gatccgagta cagatgttgc tcctaataaa ccaattccat tcccggtagc atctgctgtt 7921 actgaaatta aagctgaaga ccttcaacag ctgttgcgtc tatctcgtgg tctgcaaatt 7981 gatacaattg ctatcacggt aaaagaaggt aaaatcgtaa ttaacggttt taataaagta 8041 gaagattctg ctctgacccg tgttaaatat tctttgactc ttggtgatta tgatggtgaa 8101 aatacattta atttcattat caatatggca aatatgaaaa tgcaaccagg aaattataaa 8161 cttctgcttt gggcaaaagg taaacaaggt gctgctaaat ttgaaggtga acacgcgaat 8221 tatgtggtag ctcttgaagc tgattctacc cacgattttt aatagagggc ttcggccttt 8281 ataatttaca ctaaaacttg aatgaggaaa ttatgattac tgtaaatgaa aaagaacaca 8341 ttcttgaaca gaaatatcgt ccatctacta tcgatgaatg tattcttccc gcttttgata 8401 aagaaacctt taaatctatt acaagtaaag gtaagattcc acatattatt cttcattctc 8461 cttctccagg aacaggtaaa acaactgtag caaaagcatt atgtcatgat gtaaatgctg 8521 atatgatgtt tgtgaatggg tcagattgta aaattgattt cgttcgtggt cctttgacta 8581 attttgccag cgccgcttca tttgatggtc gtcaaaaagt aatcgttatt gatgaatttg 8641 accgttcagg gttagcagag tctcagcgac atcttcgttc ctttatggaa gcttatagtt 8701 caaactgtag tattattatt actgctaata atattgatgg tattattaaa ccgcttcagt 8761 cacgctgccg agttattaca ttcggtcaac caactgatga agataaaatt gaaatgatga 8821 agcagatgat tcgtcgattg actgaaatct gcaagcatga aggaattgct atagctgata 8881 tgaaagttgt agcagctttg gttaaaaaga attttcctga ttttcgtaaa actattggcg 8941 agctcgatag ttattcgtct aaaggtgttt tggatgctgg tattttatca ctggttacta 9001 acgatcgtgg tgctattgat gatgttcttg agtctctcaa aaataaagat gttaaacaac 9061 tcagagcttt agcaccaaaa tatgcggctg attattcgtg gttcgtgggt aaacttgccg 9121 aagaaatcta ttcacgtgta actccacaaa gtattattcg tatgtacgaa attgtcggcg 9181 aaaataatca gtatcatggt attgcagcta atactgaatt gcatttagct tatcttttca 9241 ttcaattagc atgcgaaatg cagtggaagt gatatgagct tatttaaaga tgatattcaa 9301 ttaaacgagc atcaagttgc ttggtattca aaagattgga cagctgtcca atccgctgct 9361 gattctttta aagaaaaagc agaaaatgaa ttttttgaaa taattggagc tattaataat 9421 aaaactaaat gctctattgc tcaaaaagat tattcaaaat tcatggttga aaatgcatta 9481 tcacaatttc cagagtgtat gccagctgta tatgctatga atttaattgg atcaggctta 9541 agtgatgaag ctcattttaa ttatctaatg gctgcagttc ctcgtggtaa aagatatggt 9601 aaatgggcaa aactggttga agattccacc gaagtattga ttattaagtt acttgctaag 9661 cggtatcaag ttaatacaaa tgatgcaatt aactataaat caattcttac taaaaatgga 9721 aaacttcctt tagtattaaa agaactaaaa ggtttagtca cggatgattt tttgaaagaa 9781 gtgactaaga acgtaaaaga acagaaacaa ctcaaaaaac tagcattgga atggtaaaat 9841 gattgaaatt actcttaaaa aacctgaaga ttttctgaaa gtaaaagaaa ctttgactcg 9901 tatgggaatt gctaataata aagataaagt tctatatcag tcctgtcata ttcttcagaa 9961 aaaaggacta tactatatcg ttcattttaa agaaatgctt cgtatggatg gtcgtcaagt 10021 tgaaatgaca gaagaagatg aagttcgtcg tgattcgatt gcatggctat tagaagattg 10081 gggactaatt gaaatcgttc ctggtcaaag aacttttatg aaagatttaa ctaataactt 10141 ccgagttatt tcttttaaac aaaaacatga atggaagctc gttcctaaat atacgattgg 10201 taattaagca aggggcttcg gccccttatt tggagtataa tatatcaaga gcctaataac 10261 tcgggctata aactaaggaa tatctatgaa agaattttat atctctattg aaacagtcgg 10321 aaataacatt gttgaacgtt atattgatga aaatggaaag gaacgtaccc gtgaagtaga 10381 atatcttcca actatgttta ggcattgtaa ggaagagtca aaatacaaag acatctatgg 10441 taaaaactgc gctcctcaaa aatttccatc aatgaaagat gctcgagatt ggatgaagcg 10501 aatggaagac atcggtctcg aagctctcgg tatgaacgat tttaaactcg cttatataag 10561 tgatacatat ggttcagaaa ttgtttatga ccgaaaattt gttcgtgtag ctaactgtga 10621 cattgaggtt actggtgata aatttcctga cccaatgaaa gcagaatatg aaattgatgc 10681 tatcactcat tacgattcaa ttgacgatcg tttttatgtt ttcgaccttt tgaattcaat 10741 gtacggttca gtatcaaaat gggatgcaaa gttagctgct aagcttgact gtgaaggtgg 10801 tgatgaagtt cctcaagaaa ttcttgaccg agtaatttat atgccattcg ataatgagcg 10861 tgatatgctc atggaatata tcaatctttg ggaacagaaa cgacctgcta tttttactgg 10921 ttggaatatt gaggggtttg acgttccgta tatcatgaat cgtgttaaaa tgattctggg 10981 tgaacgtagt atgaaacgtt tctctccaat cggtcgggta aaatctaaac taattcaaaa 11041 tatgtacggt agcaaagaaa tttattctat tgatggcgta tctattcttg attatttaga 11101 tttgtacaag aaattcgctt ttactaattt gccgtcattc tctttggaat cagttgctca 11161 acatgaaacc aaaaaaggta aattaccata cgacggtcct attaataaac ttcgtgagac 11221 taatcatcaa cgatacatta gttataacat cattgacgta gaatcagttc aagcaatcga 11281 taaaattcgt gggtttatcg atctagtttt aagtatgtct tattacgcta aaatgccttt 11341 ttctggtgta atgagtccta ttaaaacttg ggatgctatt atttttaact cattgaaagg 11401 tgaacataag gttattcctc aacaaggttc gcacgttaaa cagagttttc cgggtgcatt 11461 tgtgtttgaa cctaaaccaa ttgcacgtcg atacattatg agttttgact tgacgtctct 11521 gtatccgagc attattcgcc aggttaacat tagtcctgaa actattcgtg gtcagtttaa 11581 agttcatcca attcatgaat atatcgcagg aacagctcct aaaccgagtg atgaatattc 11641 ttgttctccg aatggatgga tgtatgataa acatcaagaa ggtatcattc caaaggaaat 11701 cgctaaagta tttttccagc gtaaagactg gaaaaagaaa atgttcgctg aagaaatgaa 11761 tgccgaagct attaaaaaga ttattatgaa aggcgcaggg tcttgttcaa ctaaaccaga 11821 agttgaacga tatgttaagt tcagtgatga tttcttaaat gaactatcga attacaccga 11881 atctgttctc aatagtctga ttgaagaatg tgaaaaagca gctacacttg ctaatacaaa 11941 tcagctgaac cgtaaaattc tcattaacag tctttatggt gctcttggta atattcattt 12001 ccgttactat gatttgcgaa atgctactgc tatcacaatt ttcggccaag tcggtattca 12061 gtggattgct cgtaaaatta atgaatatct gaataaagta tgcggaacta atgatgaaga 12121 tttcattgca gcaggtgata ctgattcggt atatgtttgc gtagataaag ttattgaaaa 12181 agttggtctt gaccgattca aagagcagaa cgatttggtt gaattcatga atcagttcgg 12241 taagaaaaag atggaaccta tgattgatgt tgcatatcgt gagttatgtg attatatgaa 12301 taaccgcgag catctgatgc atatggaccg tgaagctatt tcttgccctc cgcttggttc 12361 aaagggcgtt ggtggatttt ggaaagcgaa aaagcgttat gctctgaacg tttatgatat 12421 ggaagataag cgatttgctg aaccgcatct aaaaatcatg ggtatggaaa ctcagcagag 12481 ttcaacacca aaagcagtgc aagaagctct cgaagaaagt attcgtcgta ttcttcagga 12541 aggtgaagag tctgtccaag aatactacaa gaacttcgag aaagaatatc gtcaacttga 12601 ctataaagtt attgctgaag taaaaactgc gaacgatata gcgaaatatg atgataaagg 12661 ttggccagga tttaaatgcc cgttccatat tcgtggtgtg ctaacttatc gtcgagctgt 12721 tagcggttta ggtgtagctc caattttgga tggaaataaa gtaatggttc ttccattacg 12781 tgaaggaaat ccatttggtg acaagtgcat tgcttggcca tcgggtacag aacttccaaa 12841 agaaattcgt tctgatgtgc tatcttggat tgaccactca actttgttcc aaaaatcgtt 12901 tgttaaaccg cttgcgggta tgtgtgaatc ggctggcatg gactatgaag aaaaagcttc 12961 gttagacttc ctgtttggct gatagaataa atctaggacc tccaggtccc tttttcatac 13021 aagtaatata aatctatact tatgaaaaag agatgattct gg // LOCUS PT6REGA 462 bp ss-mRNA PHG 04-AUG-1990 DEFINITION Bacteriophage T6 translational repressor protein (regA), complete cds. ACCESSION M27738 KEYWORDS RNA binding protein; translational repressor. SOURCE Bacteriophage T6, cDNA to viral RNA. ORGANISM Bacteriophage T6 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 462) AUTHORS Miller,E.S. and Jozwik,C.E. TITLE Sequence analysis of conserved regA and variable orf43.1 genes in T4-like bacteriophages JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by E.S. Miller, 08-SEP-1989. The T2 RegA protein coding sequence corresponds to positions 9839-10207 of the T4 regA sequence contained in the entry with accession number M10160. The complete sequence reported herein would correspond to positions 9804-10266. An RNA secondary structure is detected by primer extension reactions on mRNA templates at positions 407-424. An RNA nuclease sensitive site is noted at position 431. Author address: E.S.Miller Dept. of Microbiology Box 7615 North Carolina State University Raleigh, NC 27695 E-mail: esm@cgc.ncsu.edu FEATURES from to/span description pept 36 404 translational repressor protein (regA) site 431 431 RNA endonuclease site BASE COUNT 170 a 64 c 89 g 139 t ORIGIN 1 gaaacaactc aaaaaactag cattggaatg gtaaaatgat tgaaattact cttaaaaaac 61 ctgaagattt tctgaaagta aaagaaactt tgactcgtat gggaattgct aataataaag 121 ataaagttct gtatcagtcc tgtcatattc ttcagaaaaa aggattatac tatatcgttc 181 attttaaaga aatgcttcgt atggatggtc gccaagttga aatgacagaa gaagatgaag 241 ttcgtcgtga ttcaatagca tggttgttag aagattgggg actgattgaa atcgtacctg 301 gtcaaagaac ttttatgaaa gatttaacta ataacttccg agttatttct tttaaacaaa 361 aacatgaatg gaaactcgtt cctaaatata cgattggtaa ttaagcaagg ggcttcggcc 421 ccttatttgg agtataatat atcaagagcc taataactcg gg // LOCUS RB18REGA 674 bp ds-DNA PHG 04-AUG-1990 DEFINITION Bacteriophage RB18 translational repressor protein (regA) and Orf43.1, complete cds. ACCESSION M33533 M27739 KEYWORDS RNA binding protein; translational repressor. SOURCE Bacteriophage RB18, cDNA to viral mRNA. ORGANISM Bacteriophage RB18 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 674) AUTHORS Miller,E.S. and Jozwik,C.E. TITLE Sequence analysis of conserved regA and variable orf43.1 genes in T4-like bacteriophages JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by E.S.Miller, 03-APR-1990. The T2 RegA protein coding sequence corresponds to positions 9839-10207 of the T4 regA sequence contained in the entry with accession number M10160. The complete sequence reported herein would correspond to positions 9815-10255. Author address: E.S.Miller Dept. of Microbiology Box 7615 North Carolina State University Raleigh, NC 27695 E-mail: esm@cgc.ncsu.edu FEATURES from to/span description pept 25 393 translational repressor protein (regA) pept 395 616 Orf43.1 peptide site 643 643 RegB endonuclease site BASE COUNT 251 a 106 c 115 g 202 t ORIGIN 1 aaaaactagc attggaatgg taaaatgatt gaaattactc ttaaaaaacc tgaagatttt 61 ctgaaagtaa aagaaacttt gactcgtatg ggaattgcta ataataaaga taaagttctg 121 tatcagtcct gtcatattct tcagaaaaaa ggactatact atatcgttca ttttaaagaa 181 atgcttcgta tggatggtcg ccaggttgaa atgacagaag aagatgaagt tcgtcgtgat 241 tcgattgcat ggctattaga agattgggga ctgattgaaa tcgtccctgg tcaaagaact 301 tttatgaaag atttaactaa taacttccga gttatttctt ttaaacaaaa acatgaatgg 361 aaactcgttc ctaaatatac gattggtaat taatatgact gctataactc cacaagaata 421 catggcgtct cttaaagaaa aatataatct ttctgcaaca gaaacacttt ttgatttacc 481 agaaaacctc caactaaaat ttcaggtaga atttcaaaaa ttaattcatc cagaacaaaa 541 acactttact gcagtcgtta agtcaattaa tgcagatgga atgacaattt ttcaccgaca 601 aatagtacta atttaagcaa ggggcttcgg ccccttattt ggagtataat atatcaagag 661 cctaataact cggg // LOCUS RB51REGA 700 bp ss-mRNA PHG 04-AUG-1990 DEFINITION Bacteriophage RB51 translational repressor protein (regA) and Orf43.1, complete cds. ACCESSION M33532 KEYWORDS RNA binding protein; translational repressor. SOURCE Bacteriophage RB51, cDNA to viral mRNA. ORGANISM Bacteriophage RB51 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 700) AUTHORS Miller,E.S. and Jozwik,C.E. TITLE Sequence analysis of conserved regA and variable orf43.1 genes in T4-like bacteriophages JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by E.S.Miller, 03-APR-1990. The T2 RegA protein coding sequence corresponds to positions 9839-10207 of the T4 regA sequence contained in the entry with accession number M10160. The complete sequence reported herein would correspond to positions 9815-10255. Author address: E.S.Miller Dept. of Microbiology Box 7615 North Carolina State University Raleigh, NC 27695 E-mail: esm@cgc.ncsu.edu FEATURES from to/span description pept 25 393 translational repressor protein (regA) pept 404 640 Orf43.1 peptide site 671 671 RegB endonuclease site BASE COUNT 255 a 114 c 125 g 206 t ORIGIN 1 aaaaactagc attggaatgg taaaatgatt gaaattactc ttaaaaaacc tgaagatttt 61 ctgaaagtaa aagaaacttt gactcgtatg ggaattgcta ataataaaga taaagttctg 121 tatcagtcct gtcatattct tcagaaaaaa ggactatact atatcgttca ttttaaagaa 181 atgcttcgta tggatggccg tcaagttgaa atgacagaag aagatgaagt tcgtcgtgat 241 tcgattgcat ggctgttaga agattgggga ctgattgaaa tcgttcctgg tcaaagaact 301 tttatgaaag atttaactaa taacttccga gttatttctt ttaaacaaaa acatgaatgg 361 aaactcgttc ctaaatatac gattggtaat taaggtaaat tatatgacac aatcagagat 421 tcagtttacc gaagcttttc taatattcga aaatctaaaa caaacatatg gtctgtcttc 481 tacccaactg gtttttgacc tcccggaaga taaacaaata gaattccaaa aagaattcca 541 acgattagtt tacccacgcc aacgcgcatt tcatgcgatt atcaaaacca caaataaaga 601 tggcaagtcc gttatatcgc gttgtacaat agaaatttaa agcaaagggg cttcggcccc 661 ttatttggag tataatatat caagagccta ataactcggg // LOCUS HUMQBPC1 235 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein gene, 5' flank with an LTR-like sequence. ACCESSION M35760 J05438 KEYWORDS ubiquinone-binding protein. SEGMENT 1 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 235) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Common protein binding sites in the 5'-flanking regions of human genes for cytochrome c1 and for ubiquinone-binding protein JOURNAL J. Biol. Chem. 265, 8159-8163 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.Suzuki, 08-AUG-1989. FEATURES from to/span description site 26 33 Mt3 element site 103 110 Mt1 element site 114 122 Mt4 element BASE COUNT 52 a 59 c 56 g 68 t ORIGIN 1 tagccaaccg ggttcggatt gtgcagtcca actccagcca atggagtcag gacacagtag 61 cagggacaag ctgcgttaga cataaaaacc tctgctttcc tttgtttagg gtgctctcgt 121 ggcaaccaga cttaccagga gctctattct gcaaaagtaa atttgccttg ctgagagacc 181 ctttgtcctt tggctcagtg ttggttcttc tttgcagcac cgagcatttg tttcc // LOCUS HUMQBPC2 1567 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human ubiquinone-binding protein (QP) gene, exon 1. ACCESSION M35761 M26705 J05438 KEYWORDS ubiquinone-binding protein. SEGMENT 2 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1567) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Common protein-binding sites in the 5'-flanking regions of human genes for cytochrome c-1 and ubiquinone-binding protein JOURNAL J. Biol. Chem. 265, 8159-8163 (1990) STANDARD full staff_review REFERENCE 2 (bases 888 to 1232) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.Suzuki, 08-AUG-1989. FEATURES from to/span description pept 1151 + 1169 ubiquinone-binding protein (QP), exon 1 IVS 1170 > 1567 ubiquinone-binding protein intron A site 425 432 Mt3 element site 525 533 Mt1 element site 559 567 Mt4 element BASE COUNT 442 a 339 c 339 g 447 t ORIGIN 1 cttcctgatt agtagtgtag cagtatagca tatgacccta gaattaagag ccccaaattc 61 gatctggatt ttcagattct ggaagaagaa cctggactct tctagtcatg tgctcttgga 121 caagttattc aacctttctg agcctcagct ttctcctctc tacaatgaga taattacagt 181 acttttagaa ttaaaagata gtttagggga gtgagagatc acctccaacc ctccctcccc 241 tttgctctta tgtgggctct ttggtgaatc taggaaccaa attgacacat ggcagattaa 301 caagagaaaa gcatactaat tttactagtt ttacatgtac atagggatcc gcacaaaaga 361 gtgaagtggc caatgtaaga tgcctttata ctttttacac gaactacaaa tcagaagaaa 421 ggggatctgg ctaggggcag taaatttctc gcggagtcat taggagaaaa gtgtgtgtgt 481 tggggtgtaa acttagtgga agataacggt tacttcggaa agtatattca ggtccaaagc 541 agcccccagt ttcgtctctg gtgataggtg ctattttctc cctctggtaa gggacgggta 601 ctccagagga atcctggagg aagctttatg cgcttgctgc atgcaggaag agataggtca 661 tcaaaccttt tctgaaacta caatttctcc aatattttca actcgaaata atcaatgtac 721 caattccata tatttttggg atggcacatc catcagtcct tcaatagaaa gtaagacgat 781 tgcccttaaa acatttagct tcctgcctgg cacaaagagc tgaattaata ctgtccacgc 841 tcccatagct attaacagca aataacggga acccagcttt aagccttggt ctcctaattc 901 ctgacctgtg ttcttcccat cacatcacaa ccaatacttc ttagaatgta gcacgcaaaa 961 tgccccgcct actccattgg cctcaccctg gccgagaggt gcttcagcag gtgctgaacg 1021 gcagtcgtca gaactgcgcc tgcgcaagcg gcctttctct gttcgcgatg tgacgtaacg 1081 cgcctgcgga ctgggcccag cttgtcctct atgacttacc cagaaggcaa cgcttctctt 1141 tctggtcaaa atggctggta agcaggccgg taagtaactg ggggtcttct tggccggcat 1201 cgtcgcgctt caccgtttat ttttgcctgc aggtcaacgg atcacaggaa ataattgtaa 1261 acttttcatt tcaagagagc tggagttgta gggaaaagac agtgagattt cagcatagac 1321 tggcaggtct aaatgatcct catcctctaa aatttttact cgatattaca gttatttcat 1381 ttggccacat tctttaaaaa tgtatgtatt ttggggagga ttttgatacc attagtcatt 1441 taaacattaa tatggattcg taaaagaagt ccatttacct cctcctaagc aactaagttc 1501 ccccaaggga atggatattg aaaaaggatg agaaataggt aattttaaaa ctgtttgggg 1561 aacaggg // LOCUS HUMQBPC3 236 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein gene, intron A with an LTR-like sequence. ACCESSION M33945 J05438 KEYWORDS ubiquinone-binding protein. SEGMENT 3 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 236) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Common protein binding sites in the 5'-flanking regions of human genes for cytochrome c1 and for ubiquinone-binding protein JOURNAL J. Biol. Chem. 265, 8159-8163 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.Suzuki, 08-AUG-1989. FEATURES from to/span description site 26 33 Mt3 element site 103 110 Mt1 element site 114 122 Mt4 element BASE COUNT 57 a 55 c 54 g 70 t ORIGIN 1 aaccgggttc agcttagatt gtacggtcca actctagtca atggagtcag gatacagtag 61 cagggacaag ctgcgttaga cataaaaacc tctgctttcc tttgttcggg gtgctctcgt 121 ggcaaccgaa cctatgagga gcaaccttat gcaaaagtaa atttgccttg ctgagagacc 181 ctttgtcctt tgtctcagtg ctagttcttc tttgcagcac caagcatttg ttttca // LOCUS HUMQBPC4 114 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (QP) gene, exon 2. ACCESSION M26706 KEYWORDS ubiquinone-binding protein cytochrome bc1 complex subunit QP. SEGMENT 4 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 114) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 23 + 94 ubiquinone-binding protein (QP), exon 2 IVS < 1 22 ubiquinone-binding protein intron A IVS 95 > 114 ubiquinone-binding protein intron B BASE COUNT 34 a 17 c 29 g 34 t ORIGIN 1 ttaacagtgt gacaattatc agtttcagca tcaggcaagt ggctggatgg tattcgaaaa 61 tggtattaca atgctgcagg attcaataaa ctgggtaagc agctgtgctg ttta // LOCUS HUMQBPC5 208 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (QP) gene, exon 3. ACCESSION M26707 KEYWORDS QP gene; ubiquinone-binding protein. SEGMENT 5 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 208) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 21 + 188 ubiquinone-binding protein (QP), exon 3 IVS < 1 20 ubiquinone-binding protein intron B IVS 189 > 208 ubiquinone-binding protein intron C BASE COUNT 73 a 36 c 48 g 51 t ORIGIN 1 aactttgtca tctatcatag ggttaatgcg agatgataca atatacgagg atgaagatgt 61 aaaagaagcc ataagaagac ttcctgagaa cctttataat gacaggatgt ttcgcattaa 121 gagggcactg gacctgaact tgaagcatca gatcttgcct aaagagcagt ggaccaaata 181 tgaagagggt agcacagctt attactcc // LOCUS HUMQBPC6 233 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (QP) gene, exon 4. ACCESSION M26730 KEYWORDS QP gene; ubiquinone-binding protein. SEGMENT 6 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 233) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 21 97 ubiquinone-binding protein (QP), exon 4 IVS < 1 20 ubiquinone-binding protein intron C BASE COUNT 82 a 23 c 49 g 79 t ORIGIN 1 atatttcatt cattctttag aaaatttcta ccttgaaccg tatctgaaag aggttattcg 61 ggaaagaaaa gaaagagaag aatgggcaaa gaagtaatca tgtagttgaa gtctgtggat 121 gcagctgtta tgaagatggt taaacttgaa acaaacaatt ttaagaatta tttggtctga 181 agatgtttta ctttaaataa atgtctattg taatggctgg agtttttgaa ttc // LOCUS HUMQBPCA 498 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein mRNA, complete cds. ACCESSION M26700 KEYWORDS ubiquinone-binding protein. SOURCE Human fibroblast SV40 transformed cell line GM637, cDNA to mRNA, clones 2 and 6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 498) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept 53 388 ubiquinone-binding protein BASE COUNT 172 a 72 c 122 g 132 t ORIGIN 1 agcttgtcct ctatgactta cccagaaggc aacgcttctc tttctggtca aaatggctgg 61 taagcagggc gtttcagcat caggcaagtg gctggatggt attcgaaaat ggtattacaa 121 tgctgcagga ttcaataaac tggggttaat gcgagatgat acaatatacg aggatgaaga 181 tgtaaaagaa gccataagaa gacttcctga gaacctttat aatgacagga tgtttcgcat 241 taagagggca ctggacctga acttgaagca tcagatcttg cctaaagagc agtggaccaa 301 atatgaagag gaaaatttct accttgaacc gtatctgaaa gaggttattc gggaaagaaa 361 agaaagagaa gaatgggcaa agaagtaatc atgtagttga agtctgtgga tgcaggtgtt 421 atgaagatgg ttaaacttga aacaaacaat tttaagaatt atttggtctg aagatgtttt 481 actttaaata aatgtcta // LOCUS HUMQBPIC 75 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPI) gene, exon 2. ACCESSION M37387 KEYWORDS ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 75) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept / 3 / 75 ubiquinone-binding protein (HQP1), exon 2 (AA at 5) BASE COUNT 22 a 11 c 21 g 21 t ORIGIN 1 agtttcagca tcggcaagtg gctggatggt attcgaaaat ggtattacaa tgctgcagga 61 ttcaataaac tgggt // LOCUS HUMQBPIIC 172 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPII) gene, complete cds. ACCESSION M26701 KEYWORDS ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 172) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review COMMENT Intron A starts after nucleotide 98. FEATURES from to/span description pept 80 / 170 ubiquinone-binding protein (HQPII), exon 1 IVS 171 > 172 HQPII intron A site 98 99 site of 2.3 kb intron BASE COUNT 42 a 40 c 47 g 43 t ORIGIN 1 gactgaacgc gcctgcggac tgggcccagc ttgtcctcta tgacttaccc agaaggcaac 61 gcttctcttt ctggtcaaaa tggctggtaa gcaggccgtt tcagcatcag gcaagtggct 121 ggatggtatt cgaaaatggt attacaatgc tgcaggattc aataaactgg gt // LOCUS HUMQBPIIIC 100 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPIII) gene, complete cds. ACCESSION M26702 KEYWORDS ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept 80 / 98 ubiquinone-binding protein (HQPIII), exon 1 IVS 99 > 99 HQPIII intron BASE COUNT 20 a 30 c 27 g 23 t ORIGIN 1 gactgaacgc gcctgcggac tgggcccagc ttgtcctcta tgacttaccc acaaggcaac 61 gcttctcttt ctggtcaaaa tggctggtaa gcaggccggt // LOCUS HUMQBPIVC 465 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPIV) pseudogene, complete cds. ACCESSION M26703 KEYWORDS pseudogene; ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 465) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept.ps 80 392 pseudo-ubiquinone-binding protein (HQPIV), exon 1 BASE COUNT 180 a 66 c 110 g 109 t ORIGIN 1 aaaataaaat aaaataaatt agataaataa ataaataaaa agagagaggg agaaggcaac 61 atttcacgtt tgggtcaaga tggctggcag gctggctacc gcagcatcag acaggtagct 121 ggatggtatt tgaaaatggt atgacaatgc tgcaggattc gagaaactaa agtaaatgtg 181 agatgagaca atatatgaag atgaagatgt aaaggaagcc ataagacggc tttctaacgg 241 caggatgttt cgcattagaa cactggacct atgagacagc agatcttgcc ttaatagcac 301 ggaccaaaca tgaagaggat aaattctacc atgagccata tctgaaaccg cttatttggg 361 aaagagaaga atgagaatgg tcaaagaagt catcacatat ttgaagttcg tggatgtagc 421 tgcctcaagt atttttatga aaatggttaa acctgaaaca tataa // LOCUS HUMQBPVC 357 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPV) pseudogene, complete cds. ACCESSION M26704 KEYWORDS pseudogene; ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 357) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept.ps 71 169 pseudo-ubiquinone-binding protein (HQPV) BASE COUNT 159 a 51 c 64 g 83 t ORIGIN 1 catttttagt tcccaagagc cgggcataaa caagagacct gagggaatcc tcctctctta 61 aaaaacaaag cagtggacca actatgaaga ggaaaatttc tgccttgaac tgtatctgaa 121 agatgttatt tgggaaagaa aagaaagaga agaatgggca aagaagtaat cacatggttg 181 aagtctgtgg atgcaactgt tatgaagatg gttaaacttg aaataaacaa ttttaagaat 241 tatttggtct gcagatattt tactttaaat aaatgtctat tgtaatggta aaaaaaacaa 301 aaacaaaaac aaaacaaaac aaaaacaaaa acaaaacaaa accaaaaaaa cctcagg //