Path: utzoo!utgpu!news-server.csri.toronto.edu!bonnie.concordia.ca!uunet!bionet!will From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Herpes simplex virus type 1 (HSV-1) dbp/pol genes Message-ID: Date: 28 May 91 18:42:11 GMT Sender: will@genbank.bio.net Distribution: bionet Lines: 202 Approved: lear@genbank.bio.net Checksum: 39202 14 LOCUS HSVHSV1PO 8400 bp ds-DNA VRL 28-MAY-1991 DEFINITION Herpes simplex virus type 1 (HSV-1) dbp/pol genes ACCESSION X03181 KEYWORDS DNA binding protein; DNA polymerase; origin of replication; overlapping genes; unidentified reading frame. SOURCE Herpes simplex virus DNA. ORGANISM Herpes simplex virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 8400) AUTHORS Quinn,J.P. and McGeoch,D.J. TITLE DNA sequence of the region in the genome of herpes simplex virus type 1 containing the genes for DNA polymerase and the major DNA binding protein JOURNAL Nucleic Acids Res. 13, 8143-8163 (1985) STANDARD full automatic COMMENT SWISS-PROT; P04293; DPOL$HSV11. SWISS-PROT; P04296; DNBI$HSV11. From EMBL entry HEHSV1PO; dated 12-APR-1990. FEATURES Location/Qualifiers misc_feature 123..123 /note="put. mRNA 3' terminus (dbp gene)" misc_feature complement(148..153) /note="polyadenylation signal (dbp gene)" CDS complement(205..3792) /note="DNA binding protein (dbp gene) (aa 1-1196)" /codon_start=3792 misc_feature 4058..4058 /note="mRNA 5' terminus (dbp gene)" promoter complement(4077..4082) /note="pot. TATA box (dbp gene)" repeat_region 4124..4131 /note="direct repeat 1" misc_feature 4143..4286 /note="Ori L palindrome" repeat_unit 4143..4224 /note="inverted repeat A" repeat_unit 4225..4286 /note="inverted repeat A'" repeat_region 4272..4279 /note="direct repeat 1" promoter 4317..4321 /note="pot. TATA sequence (pol gene)" promoter 4345..4350 /note="pot. TATA sequence (pol gene)" CDS 4546..8250 /note="DNA polymerase (pol gene) (aa 1-1235)" /codon_start=4546 misc_feature 8092..8092 /note="put. 3' terminus (URF)" misc_feature complement(8116..8121) /note="put. polyadenylation signal (URF)" CDS complement(8201..>8400) /note="unidentified reading frame" /codon_start=8400 misc_feature 8287..8292 /note="polyadenylation signal (pol gene)" misc_feature 8316..8316 /note="put. 3' terminus (pol gene)" BASE COUNT 1440 a 2806 c 2746 g 1408 t ORIGIN 1 cggcctcccg cgggcccgcc gaccggcaag ccgggagtcg gcggcgcgtg cgtttctgct 61 ctattcccag acaccgcgga gaggaatcac ggcccgccca gagatataga cacggaacac 121 aaacaagcac ggatgtcgta gcaataattt attttacaca cattccccgc cccgccctag 181 gttcccccac cccccaaccc ctcacagcat atccaacgtc aggtctccct ttttgtcggg 241 gggcccctcc ccaaacgggt catccccgtg gaacgcccgt ttgcggccgg caaatgccgg 301 tcccggggcc cccgggccgc cgaacggcgt cgcgttgtcg tcctcgcagc caaaatcccc 361 aaagttaaac acctccccgg cgttgccgag ttggctgact agggcctcgg cctcgtgcgc 421 cacctccagg gccgcgtccg tcgaccactc gccgttgccg cgctccaggg cacgcgcggt 481 cagctccatc atctcctcgc ttaggtactc gtcctccagg agcgccagcc agtcctcgat 541 ctgcagctgc tgggtgcggg gccccaggct tttcacggtc gccacgaaca cgctactggc 601 gacggccgcc ccgccctcgg agataatgcc ccggagctgc tcgcacagcg agctttcgtg 661 cgctccgccg ccgaggcttg aggccgcgca cacaaacccg gcccggggac aggccaggac 721 gaacttgcgg gtgcggtcaa aaataaggag cgggcacgcg tttttgccgc ccatcaggct 781 ggcccagttc ccggcctgaa acacacggtc gttgccggcc atgccgtagt acttgctgat 841 gctcaacccc aacacgacca tggggcgcgc cgccatgacg ggccgcagca ggttgcagct 901 ggcgaacatg gacgtccacg cgcccggatg cgcgtccacg gcgtccatca gcgcgcgggc 961 cccggcctcc aggcccgccc cgccctgcgc ggaccacgcg gccgcagcct gcacgctggg 1021 gggacggcgg gaccccgcga tgatggccgt aagggtgttg atgaagtatg tcgagtgatc 1081 gcagtaccgc agaatctggt ttgccatgta gtacatcgcc agctcgctca cgttgttggg 1141 ggccaggtta ataaagttta tcgcgccgta gtccagggaa aactttttaa tgaacgcgat 1201 ggtctcgatg tcctcgcgcg acaggagccg ggcgggaagc tggttgcgtt ggagggccgt 1261 ccagaaccac tgcgggttcg gctggttgga ccccgggggc ttgccgttgg ggaagatggc 1321 cgcgtggaac tgcttcagca gaaagcccag cggtccgagg aggatgtcca cgcgcttgtc 1381 gggcttctgg taggcgctct ggaggctggc gacccgcgcc ttggcggcct cggacgcgtt 1441 ggcgctcgcg cccgcgaaca acacgcggct cttgacgcgc agctccttgg gaaaccccag 1501 ggtcacgcgg gcaacgtcgc cctcgaagct gctctcggcg ggggccgtct ggccggccgt 1561 taggctgggg gcgcagatag ccgccccctc cgagagcgcg accgtcagcg ttttggccga 1621 cagaaacccg ttgttaaaca tgtccatcac gcgccgccgc agcaccggtt ggaattgatt 1681 gcgaaagttg cgcccctcga ccgactgccc ggcgaacacc ccgtggcact gactcagggc 1741 caggtcctgg tacacggcga ggttggatcg ccgcccgaga agctgaagca gggggcacgg 1801 cccgcacgcg tacgggtcca gcgtcaggga catggcgtgg ttggcctcgc ccagaccgtc 1861 gcgaaacttg aagttcctcc cctccaccag gttgcgcatc agctgctcca cctcgcggtc 1921 cacgacctgc ctgacgttgt tcaccaccgt atgcagggcc tcgcggttgg tgatgatggt 1981 ctccagccgc cccatggccg tggggaccgc ctggtccacg tactgcaggg tctcgagttc 2041 ggccatgacg cgctcggtcg ccgcgcggta cgtctcctgc atgatggtcc gggcggtctc 2101 ggatccgtcc gcgcgcttca gggccgagaa ggcggcgtag tttcccagca cgtcgcagtc 2161 gctgtacatg ctgttcatgg tcccgaagac gccgatggct ccgcgggcgg cgctggcgaa 2221 ctttggatgg cgcgcccgga ggcgcatgag cgtcgtgtgt acgcaggcgt ggcgcgtgtc 2281 gaaggtgcat aggttacagg gcacgtcggt ctggttggag tccgcgacgt atcgaaacac 2341 gtccatctcc tggcgcccga cgatcacggc gccgtcgcag cgctccaggt aaaacagcat 2401 cttggccagc agcgccgggg aaaacccaca cagcatggcc aggtgctcgc cggcaaattc 2461 ctgggttccg ccgacgaggg gcgcggtggg ccgaccctcg aacccgggca ccacgtgtcc 2521 ctcgcggtcc acctgtgggt tggccgccac gtgggtcccg ggcacgagga agaagcggta 2581 aaaggagggt ttgctgtggt cctttgggtc cgccgggccg gcgtcgtcca cctcggtgag 2641 atggagggcc gagttggtgc taaataccat ggcccccacg agtcccgcgg cgcgcgccag 2701 gtacgccccg acggcgttgg cgcgggccgc ggccgtgtcc tggccctcga acagcggcca 2761 cgcggagatg tcggtgggcg gctcgtcaaa gacggccatc gacacgatag actcgagggc 2821 cagggcggcg tctccggcca tgacggaggc caggcgctgt tcgaacccgc ccgccgcgcc 2881 cttgccgccg ccgtcgcgcc cgccccgcgg ggtcttaccc tggctggctt cgaaggccgt 2941 gaacgtaatg tcggcgggga gggcggcgcc ctcgtggttt tcgtcaaacg ccaggtgggc 3001 ggccgcgcgg gccacggcgt ccacgtttcg gcatcgcagt gccacggcgg cgggtcccac 3061 gaccgcctcg aacaggaggc ggttgagggg gcggttaaaa aacggaagcg ggtaggtaaa 3121 tttctccccg atcgatcggt ggttggcgtt gaacggctct gcgatgacac ggctaaaatc 3181 cggcatgaac agctgcaacg ggtacacggg tatgcggtgc acctccgccc cgcctatggt 3241 taccttgtcc gagcctccca ggtgcagaaa ggtgttgttg atgcacacgg cctccttgaa 3301 gccctcggta acgaccagat acaggagggc gcggtccggg tccaggccga ggcgctcaca 3361 cagcgcctcc cccgtcgtct cgtgtttgag gtcgccgggc cggggggtgt agtccgaaaa 3421 gccaaaatgg cggcgtgccc gctcgcaaag tcgcgtcagg ttcggggcct gggtgctggg 3481 gtccaggtgc cggccgccgt gaaagacgta cacggacgag ctgtagtgcg agggcgtcag 3541 tttcagggac accgcggtac ccccgagccc cgtcgtgcga gaacccacga ccacggccac 3601 gttggcctca aagccgctct ccacggtcag gcccacgacc aggggcgcca cggcgacgtc 3661 ggaatcgccg ctgcgtgccg acagtaacgc cagaagctcg atgccttcgg acggacacgc 3721 gcgagcgtac acgtatccca ggggcccggg ggggaccttg atggtggttg ccgtcttggg 3781 ctttgtctcc atgtcctttt gtcaatcggt ccgcgaacgg aggtaatccc ggcacgacga 3841 cggacgcccg acaaggtatg tctcccgagc gtcaaaatcc gggggggggc ggcgacggtc 3901 aaggggaggg ttggagaccg gggttgggga atgaatccct ccccttcacc gacaaccccc 3961 cgggtaacca cggggtcgcc gatgaacccc ggcggccggc aacgcggggt ccctgcgaga 4021 ggcacagatg cttacggtca ggtgctccgg gtcgggtgcg tctggtatgc ggttggtata 4081 tgtacacttt acctgggggc gtgccggtcc gccccagccc ctcccacgcc ccgcgcgtca 4141 tcagccggtg ggcgtggccg ctattataaa aaaagtgaga acgcgaagcg ttcgcacttt 4201 gtcctaataa tatatatatt attaggacaa agtgcgaacg cttcgcgttc tcactttttt 4261 tataatagcg gccacgccca ccggctacgt cactctcctg tcggccgccg gcggtccata 4321 agcccggccg gccgggccga cgcgaataaa ccgggccgcc ggccggggcg ccgcgcagca 4381 gctcgccgcc cggatccgcc agacaaacaa ggcccttgca catgccggcc cgggcgagcc 4441 tgggggtccg gtaattttgc catcccaccc aagcggcttt ttgggttttt ctcttccccc 4501 ctccccacat tcccctcttt aggggttcgg gtgggaacaa ccgcgatgtt ttccggtggc 4561 ggcggcccgc tgtcccccgg aggaaagtcg gcggccaggg cggcgtccgg gttttttgcg 4621 cccgccggcc ctcgcggagc cagccgggga cccccgcctt gtttgaggca aaacttttac 4681 aacccctacc tcgccccagt cgggacgcaa cagaagccga ccgggccaac ccagcgccat 4741 acgtactata gcgaatgcga tgaatttcga ttcatcgccc cgcgggtgct ggacgaggat 4801 gcccccccgg agaagcgcgc cggggtgcac gacggtcacc tcaagcgcgc ccccaaggtg 4861 tactgcgggg gggacgagcg cgacgctcct ccgcgtcggg tcgggcggct tctggccgcg 4921 gcgtcgcgcc tgtggggcgg cgtggaccac gccccggcgg ggttcaaccc caccgtcacc 4981 gtctttcacg tgtacgacat cctggagaac gtggagcacg cgtacggcat gcgcgcggcc 5041 cagttccacg cgcggtttat ggacgccatc acaccgacgg ggaccgtcat cacgctcctg 5101 ggcctgactc cggaaggcca ccgggtggcc gttcacgttt acggcacgcg gcagtacttt 5161 tacatgaaca aggaggaggt cgacaggcac ctacaatgcc gcgccccacg agatctctgc 5221 gagcgcatgg ccgcggccct gcgcgagtcc ccgggcgcgt cgttccgcgg catctccgcg 5281 gaccacttcg aggcggaggt ggtggagcgc accgacgtgt actactacga gacgcgcccc 5341 gctctgtttt accgcgtcta cgtccgaagc gggcgtgtgc tgtcgtacct gtgcgacaac 5401 ttctgcccgg ccatcaagaa gtacgagggt ggggtcgacg ccaccacccg gttcatcctg 5461 gacaaccccg ggttcgtcac cttcggctgg taccgtctca aaccgggccg gaacaacacg 5521 ctagcccagc cggcggcccc gatggccttc gggacatcca gcgacgtcga gtttaactgt 5581 acggcggaca acctggccat cgaggggggc atgagcgacc taccggcata caagctcatg 5641 tgcttcgata tcgaatgcaa ggcggggggg gaggacgagc tggcctttcc ggtggccggg 5701 cacccggagg acctggtcat ccagatatcc tgtctgctct acgacctgtc caccaccgcc 5761 ctggagcacg tcctcctgtt ttcgctcggt tcctgcgacc tccccgaatc ccacctgaac 5821 gagctggcgg ccaggggcct gcccacgccc gtggttctgg aattcgacag cgaattcgag 5881 atgctgttgg ccttcatgac ccttgtgaaa cagtacggcc ccgagttcgt gaccgggtac 5941 aacatcatca acttcgactg gcccttcttg ctggccaagc tgacggacat ttacaaggtc 6001 cccctggacg ggtacggccg catgaacggc cggggcgtgt ttcgcgtgtg ggacataggc 6061 cagagccact tccagaagcg cagcaagata aaggtgaacg gcatggtgaa catcgacatg 6121 tacgggatta taaccgacaa gatcaagctc tcgagctaca agctcaacgc cgtggccgaa 6181 gccgtcctga aggacaagaa gaaggacctg agctatcgcg acatccccgc ctactacgcc 6241 gccgggcccg cgcaacgcgg ggtgatcggc gagtactgca tacaggattc cctgctggtg 6301 ggccagctgt tttttaagtt tttgccccat ctggagctct cggccgtcgc gcgcttggcg 6361 ggtattaaca tcacccgcac catctacgac ggccagcaga tccgcgtctt tacgtgcctg 6421 ctgcgcctgg ccgaccagaa gggctttatt ctgccggaca cccaggggcg atttaggggc 6481 gccggggggg aggcgcccaa gcgtccggcc gcagcccggg aggacgagga gcggccagag 6541 gaggaggggg aggacgagga cgaacgcgag gagggcgggg gcgagcggga gccggagggc 6601 gcgcgggaga ccgccggcag gcacgtgggg taccaggggg ccagggtcct tgaccccact 6661 tccgggtttc acgtgaaccc cgtggtggtg ttcgactttg ccagcctgta ccccagcatc 6721 atccaggccc acaacctgtg cttcagcacg ctctccctga gggccgacgc agtggcgcac 6781 ctggaggcgg gcaaggacta cctggagatc gaggtggggg ggcgacggct gttcttcgtc 6841 aaggctcacg tgcgagagag cctcctcagc atcctcctgc gggactggct cgccatgcga 6901 aagcagatcc gctcgcggat tccccagagc agccccgagg aggccgtgct cctggacaag 6961 cagcaggccg ccatcaaggt cgtgtgtaac tcggtgtacg ggttcacggg agtgcagcac 7021 ggactcctgc cgtgcctgca cgttgccgcg acggtgacga ccatcggccg cgagatgctg 7081 ctcgcgaccc gcgagtacgt ccacgcgcgc tgggcggcct tcgaacagct cctggccgat 7141 ttcccggagg cggccgacat gcgcgccccc gggccctatt ccatgcgcat catctacggg 7201 gacacggact ccatctttgt gctgtgccgc ggcctcacgg ccgccgggct gacggccgtg 7261 ggcgacaaga tggcgagcca catctcgcgc gcgctgtttc tgccccccat caaactcgag 7321 tgcgaaaaga cgttcaccaa gctgctgctg atcgccaaga aaaagtacat cggcgtcatc 7381 tacgggggta agatgctcat caagggcgtg gatctggtgc gcaaaaacaa ctgcgcgttt 7441 atcaaccgca cctccagggc cctggtcgac ctgctgtttt acgacgatac cgtctccgga 7501 gcggccgccg cgttagccga gcgccccgcg gaggagtggc tggcgcgacc cctgcccgag 7561 ggactgcagg cgttcggggc cgtcctcgta gacgcccatc ggcgcatcac cgacccggag 7621 agggacatcc aggactttgt cctcaccgcc gaactgagca gacacccgcg cgcgtacacc 7681 aacaagcgcc tggcccacct gacggtgtat tacaagctca tggcccgccg cgcgcaggtc 7741 ccgtccatca aggaccggat cccgtacgtg atcgtggccc agacccgcga ggtagaggag 7801 acggtcgcgc ggctggccgc cctccgcgag ctagacgccg ccgccccagg ggacgagccc 7861 gccccccccg cggccctgcc ctccccggcc aagcgccccc gggagacgcc gtcgcctgcc 7921 gaccccccgg gaggcgcgtc caagccccgc aagctgctgg tgtccgagct ggccgaggat 7981 cccgcatacg ccattgccca cggcgtcgcc ctgaacacgg actattactt ctcccacctg 8041 ttgggggcgg cgtgcgtgac attcaaggcc ctgtttggga ataacgccaa gatcaccgag 8101 agtctgttaa aaaggtttat tcccgaagtg tggcaccccc cggacgacgt ggccgcgcgg 8161 ctccggaccg cagggttcgg ggcggtgggt gccggcgcta cggcggagga aactcgtcga 8221 atgttgcata gagcctttga tactctagca tgagcccccc gtcgaagctg atgtccctca 8281 ttttacaata aatgtctgcg gccgacacgg tcggaatctc cgcgtccgtg ggtttctctg 8341 cgttgcgccg gaccacgagc acaaacgtgc tctgccacac gtgggcgacg aaccggtacc //