Path: utzoo!utgpu!watserv1!watmath!uunet!cs.utexas.edu!usc!apple!bionet!daemon From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: <9004091910.AA09315@haploid.lanl.gov.LANL.GOV> Date: 9 Apr 90 19:10:53 GMT Sender: daemon@genbank.BIO.NET Distribution: bionet Lines: 1361 Approved: lear@genbank.bio.net Checksum: 40449 91 LOCUS BLCNNS 961 bp ss-RNA VRL 15-MAR-1990 DEFINITION Bunyamwera virus small RNA segment, N and NSs protein. ACCESSION D00353 KEYWORDS N protein; NSs protein; SRNA; nonstructural protein; nucleocapsid protein; small RNA. SOURCE Bunyamwera virus, cDNA to viral RNA, clones pBU[NS14,N3/59,N308, N309,N93]. ORGANISM Bunyamwera virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Bunyavirus. REFERENCE 1 (bases 1 to 961) AUTHORS Elliott,R.M. TITLE Nucleotide sequence analysis of the small(S) RNA segment of Bunyamwera virus, the prototype of the family Bunyaviridae JOURNAL J. Gen. Virol. 70, 1281-1285 (1989) STANDARD full staff_entry COMMENT Submitted in computer readable form by R.M. Elliott on 18-Jan-1989. The virus contains the negative sense strand; the positive strand is shown below. FEATURES from to/span description pept 86 787 N protein pept 105 410 NSs protein BASE COUNT 298 a 187 c 215 g 261 t ORIGIN 1 agtagtgtac tccacactac aaacttgcta ttgttgaaaa tcgctgtgct attaaatcca 61 acagaaggtc attaaaggct ctttaatgat tgagttggaa tttcatgatg tcgctgctaa 121 caccagcagt acttttgacc cagaggtcgc atacgctaac tttaagcgtg tccacaccac 181 tgggcttagt tatgaccaca tacgaatctt ctacattaaa ggacgcgaga ttaaaactag 241 tctcgcaaaa agaagtgaat gggaagttac acttaacctt gggggctgga agattactgt 301 atataatacg aattttcctg gcaaccggaa caacccagtt cctgacgatg gtcttaccct 361 ccaccgcctc agtggattcc ttgccaggta cctacttgag aagatgctga aagtcagtga 421 accagagaaa ttgattatta aatcaaaaat aatcaaccct ttggctgaaa agaatgggat 481 cacttggaat gatggagagg aagtttatct ctctttcttc ccaggatcag agatgttctt 541 aggaactttc agattctacc ccttagcaat cgggatctac aaagttcagc gcaaggaaat 601 ggaaccaaaa taccttgaga aaacaatgcg gcagaggtac atgggactag aagcagcaac 661 ttggactgtt agtaaattga cagaagttca gtctgcactg acagttgtct ctagcttagg 721 ttggaagaaa accaatgtta gtgcagctgc cagggacttc cttgctaaat tcggaatcaa 781 catgtaagca gggatgcatt tttaatcggg ctaaagtcat ctgttttaat ttggctaaaa 841 gggttgtttc aacccacaaa ataacagctg cttgggtggg tggttgggga cagaaagaca 901 gcgggctaaa tcaacattat attgttaatg gtattttaag ttttaggtgg agcacactac 961 t // LOCUS DEN2NGC 2357 bp ss-RNA VRL 15-MAR-1990 DEFINITION Dengue virus type 2 (New Guinea C strain), cDNA to genomic RNA. ACCESSION D00346 KEYWORDS E protein; M protein; prM protein; structural protein. SOURCE Dengue virus type 2 (New Guinea C strain), cDNA to genomic RNA. ORGANISM Dengue virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Flaviviridae; Flavivirus (arbovirus group B). REFERENCE 1 (bases 1 to 2357) AUTHORS Gruenberg,A., Woo,W.S., Biedrzycka,A. and Wright,P.J. TITLE Partial Nucleotide Sequence and Deduced Amino Acid Sequence of the Structural Proteins of Dengue Virus Type 2, New Guinea C and PUO-218 Strains; JOURNAL J. Gen. Virol. 69, 1391-1398 (1988) STANDARD simple staff_review COMMENT Nucleotide 1 in the NGC sequence corresponds to nucleotide 77 counting from the 5'end of the DEN-2(JAM) sequence. FEATURES from to/span description pept 21 > 2357 viral polyprotein matp 21 362 C protein matp 363 860 prM protein matp 636 860 M protein matp 861 2345 E protein matp 2346 2357 NS 1 protein (amino end) BASE COUNT 782 a 471 c 595 g 509 t ORIGIN 20bp upstream from the C protein amino terminal end 1 aattagagag cagatctctg atgaataacc aacgaaaaaa ggcgagaaat acgcctttca 61 atatgctgaa acgcgagaga aaccgcgtgt cgactgtaca acagctgaca aagagattct 121 cacttggaat gctgcaggga cgaggaccat taaaactgtt catggccctg gtggcgttcc 181 ttcgtttcct aacaatccca ccaacagcag ggatactgaa gagatgggga acaattaaaa 241 aatcaaaagc cattaatgtt ttgagagggt tcaggaaaga gattggaagg atgctgaaca 301 tcttgaacag gagacgcaga actgcaggca tgatcattat gctgattcca acagtgatgg 361 cgttccattt aaccacacgt aacggagaac cacacatgat cgtcagtaga caagagaaag 421 ggaaaagtct tctgtttaaa acagaggatg gtgtgaacat gtgtaccctc atggccatgg 481 accttggtga attgtgtgaa gatacaatca cgtacaagtg tccttttctc aggcagaatg 541 aaccagaaga catagattgt tggtgcaact ctacgtccac atgggtaact tatgggacgt 601 gtaccaccac aggagaacac agaagagaaa aaagatcagt ggcactcgtt ccacatgtgg 661 gaatgggact ggagacacga actgaaacat ggatgtcatc agaaggggcc tggaaacatg 721 cccagagaat tgaaacttgg atcttgagac atccaggctt taccataatg gcagcaatcc 781 tggcatacac cataggaacg acacatttcc aaagagccct gattttcatc ttactgacag 841 ctgtcgctcc ttcaatgaca atgcgttgca taggaatatc aaatagagac tttgtagaag 901 gggtttcagg aggaagctgg gttgacatag tcttagaaca tggaagctgt gtgacgacga 961 tggcaaaaaa caaaccaaca ttggattttg aactgataaa aacagaagcc aaacaacctg 1021 ccactctaag gaagtactgt atagaggcaa agctgaccaa cacaacaaca gattctcgct 1081 gcccaacaca aggagaaccc agcctaaatg aagagcagga caaaaggttc gtctgcaaac 1141 actccatggt ggacagagga tggggaaatg gatgtggatt atttggaaaa ggaggcattg 1201 tgacctgtgc tatgttcaca tgcaaaaaga acatgaaagg aaaagtcgtg caaccagaaa 1261 acttggaata caccattgtg ataacacctc actcagggga agagcatgca gtcggaaatg 1321 acacaggaaa acatggcaag gaaatcaaaa taacaccaca gagttccatc acagaagcag 1381 agttgacagg ctatggcact gtcacgatgg agtgctctcc gagaacgggc ctcgacttca 1441 atgagatggt gttgctgcaa atggaaaata aagcttggct ggtgcacagg caatggttcc 1501 tagacctgcc gttgccatgg ctgcccggag cggacacaca aggatcaaat tggatacaga 1561 aagagacatt ggtgactttc aaaaatcccc atgcgaagaa acaggatgtt gttgttttgg 1621 gatcccaaga aggggccatg cacacagcac tcacaggggc cacagaaatc cagatgtcat 1681 caggaaactt actgttcaca ggacatctca agtgcaggct gaggatggac aaactacagc 1741 tcaaaggaat gtcatactct atgtgcacag gaaagtttaa agttgtgaag gaaatagcag 1801 aaacacaaca tggaacaata gttatcagag tacaatatga aggggacggt tctccatgta 1861 agatcccttt tgagataatg gatttggaaa aaagacatgt tttaggtcgc ctgattacag 1921 tcaacccaat cgtaacagaa aaagatagcc cagtcaacat agaagcagaa cctccattcg 1981 gagacagcta catcatcata ggagtagagc cgggacaatt gaagctcaac tggtttaaga 2041 aaggaagttc tatcggccaa atgattgaga caacaatgag gggagcgaag agaatggcca 2101 ttttaggtga cacagcttgg gattttggat ccctgggagg agtgtttaca tctataggaa 2161 aggctctcca ccaagttttc ggagcaatct atggggctgc cttcagtggg gtctcatgga 2221 ctatgaaaat cctcatagga gtcattatca catggatagg aatgaattca cgcagcacct 2281 cactttctgt gtcactagta ttggtgggag tcgtgacgct gtatttggga gttatggtgc 2341 aggccgatag tggttgc // LOCUS HS1IRLULR 13052 bp ds-DNA VRL 15-MAR-1990 DEFINITION Herpes simplex virus type 1 (HSV-1) genome, rightmost part of the long unique region (UL) and all of the internal long repeat region (IRL). ACCESSION D00374 KEYWORDS IE110; IE63; UL54; UL55; UL56; immediate-early protein; internal long repeat region; long repeat region; long unique region; transcriptional activator; transcriptional modulating protein. SOURCE HSV-1 (strain 17) DNA, clones BamHI b, XhoI c, BamHI k, HpaI s plus v. ORGANISM Herpes simplex virus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 13052) AUTHORS Perry,L.J. and McGeoch,D.J. TITLE The DNA sequences of the long repeat region and adjoining parts of the long unique region in the genome of herpes simplex virus type 1 JOURNAL J. Gen. Virol. 69, 2831-2846 (1988) STANDARD full staff_entry COMMENT There were two small divergences within the two versions of the UL proximal part of RL (discussed in [1]). FEATURES from to/span description pept 413 1951 immediate-early transcriptional modulating protein IE63 (gene UL54) ORF 2175 2735 ORF of gene UL55 ORF 3602 3009 (c) ORF of gene UL56 pept 10787 10731 (c) IE110 exon 1 9965 9299 (c) IE110 exon 2 9162 7559 (c) IE110 exon 3 mRNA 275 1974 IE63 mRNA pre-msg 10935 7350 (c) IE110 mRNA and introns IVS 10730 9967 (c) IE110 intron 1 IVS 9298 9163 (c) IE110 intron 2 rpt 3837 4017 reiteration set 1 rpt 4224 4244 reiteration set 2 rpt 4465 4496 reiteration set 3 rpt 7170 7317 reiteration set 4 rpt 10422 10583 reiteration set 5 rpt 12007 12060 reiteration set 6 rpt 12730 12952 reiteration set 7 refnumbr 1 1 numbered 113322 in [1] signal 247 251 TATA box signal 3792 3786 (c) TATA box signal 10962 10958 (c) TATA box signal 1956 1961 polyadenylation signal signal 2777 2782 polyadenylation signal signal 2880 2875 (c) polyadenylation signal signal 2884 2879 (c) polyadenylation signal signal 7372 7367 (c) polyadenylation signal signal 7412 7407 (c) polyadenylation signal variant 1055 1062 eight c residues in HpaI s plus v clone; seven c residues in BamHI b clone site 3836 3836 end of UL BASE COUNT 1933 a 4879 c 4243 g 1997 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccaac gaccccgccc atgggtccca attggccgtc ccgttaccaa gaccaaccca 61 gccagcgtat ccacccccgc ccgggtcccc gcggaagcgg aacggggtat gtgatatgct 121 aattaaatac atgccacgta cttatggtgt ctgattggtc cttgtctgtg ccggaggtgg 181 ggcgggggcc ccgcccgggg ggcggaacga ggaggggttt gggagagccg gccccggcac 241 cacgggtata aggacatcca ccacccggcc ggtggtggtg tgcagccgtg ttccaaccac 301 ggtcacgctt cggtgcctct ccccgattcg ggcccggtcg ctcgctaccg gtgcgccacc 361 accagaggcc atatccgaca ccccagcccc gacggcagcc gacagcccgg tcatggcgac 421 tgacattgat atgctaattg acctcggcct ggacctctcc gacagcgatc tggacgagga 481 cccccccgag ccggcggaga gccgccgcga cgacctggaa tcggacagca gcggggagtg 541 ttcctcgtcg gacgaggaca tggaagaccc ccacggagag gacggaccgg agccgatact 601 cgacgccgct cgcccggcgg tccgcccgtc tcgtccagaa gaccccggcg tacccagcac 661 ccagacgcct cgtccgacgg agcggcaggg ccccaacgat cctcaaccag cgccccacag 721 tgtgtggtcg cgcctcgggg cccggcgacc gtcttgctcc cccgagcagc acgggggcaa 781 ggtggcccgc ctccaacccc caccgaccaa agcccagcct gcccgcggcg gacgccgtgg 841 gcgtcgcagg ggtcggggtc gcggtggtcc cggggctgcc gatggtttgt cggacccccg 901 ccggcgtgcc cccagaacca atcgcaaccc tgggggaccc cgccccgggg cggggtggac 961 ggacggcccc ggcgcccccc atggcgaggc gtggcgcggc agtgagcagc ccgacccacc 1021 cggaggccag cggacacggg gcgtgcgcca agcacccccc ccgctaatga cgctggcgat 1081 tgcccccccg cccgcggacc cccgcgcccc ggccccggag cgaaaggcgc ccgccgccga 1141 caccatcgac gccaccacgc ggttggtcct gcgctccatc tccgagcgcg cggcggtcga 1201 ccgcatcagc gagagctttg gccgcagcgc acaggtcatg cacgacccct ttggggggca 1261 gccgtttccc gccgcgaata gcccctgggc cccggtgctg gcgggccaag gagggccctt 1321 tgacgccgag accagacggg tctcctggga aaccttggtc gcccacggcc cgagcctcta 1381 tcgcactttt gccggcaatc ctcgggccgc atcgaccgcc aaggccatgc gcgactgcgt 1441 gctgcgccaa gaaaatttca tcgaggcgct ggcctccgcc gacgagacgc tggcgtggtg 1501 caagatgtgc atccaccaca acctgccgct gcgcccccag gaccccatta tcgggacgac 1561 cgcggctgtg ctggataacc tcgccacgcg cctgcggccc tttctccagt gctacctgaa 1621 ggcgcgaggc ctgtgcggcc tggacgaact gtgttcgcgg cggcgtctgg cggacattaa 1681 ggacattgca tccttcgtgt ttgtcattct ggccaggctc gccaaccgcg tcgagcgtgg 1741 cgtcgcggag atcgactacg cgacccttgg tgtcggggtc ggagagaaga tgcatttcta 1801 cctccccggg gcctgcatgg cgggcctgat cgaaatccta gacacgcacc gccaggagtg 1861 ttcgagtcgt gtctgcgagt tgacggccag tcacatcgtc gcccccccgt acgtgcacgg 1921 caaatatttt tattgcaact ccctgtttta ggtacaataa aaacaaaaca tttcaaacaa 1981 atcgcccctc gtgttgtcct tctttgctca tggccggcgg ggcgtgggtc acggcagatg 2041 gcgggggtgg gcccggcgta cggcctgggt gggcggaggg aactaaccca acgtataaat 2101 ccgtccccgt tccaaggccg gtgtcatagt gcccttagga gcttcccgcc cgggcgcatc 2161 cccccttttg cactatgaca gcgacccccc tcaccaacct gttcttacgg gccccggaca 2221 taacccacgt ggccccccct tactgcctca acgccacctg gcaggccgaa acggccatgc 2281 acaccagcaa aacggactcc gcttgcgtgg ccgtgcggag ttacctggtc cgcgcctcct 2341 gtgagaccag cggcacaatc cactgctttt tctttgcggt atacaaggac acccaccaca 2401 cccctccgct gattaccgag ctccgcaact ttgcggacct ggttaaccac ccgccggtcc 2461 tacgcgaact ggaggataag cgcggggtgc ggctgcggtg tgcgcggccg tttagcgtcg 2521 ggacgattaa ggacgtctct gggtccggcg cgtcctcggc gggagagtac acgataaacg 2581 ggatcgtgta ccactgccac tgtcggtatc cgttctcaaa aacatgctgg atgggggcct 2641 ccgcggccct acagcacctg cgctccatca gctccagcgg catggccgcc cgcgcggcag 2701 agcatcgacg cgtcaagatt aaaattaagg cgtgatctcc aaccccccca tgaatgtgtg 2761 taaccccccc caaaaaaata aagagccgta acccaaccaa accaggcgtg gtgtgagttt 2821 gtggacccaa agccctcaga gacaacgcga caggccagta tggaccgtga tacttttatt 2881 tattaactca caggggcgct taccgccaca ggaataccag aataatgacc accacaatcg 2941 cgaccacccc aaatacagca tggcgccaca ccacgccaca acagccctgt cgccggtatg 3001 gggcatgatc agacgagccg cgccgcgcgt tgggccctgt acagctcgcg cgaattgacc 3061 ctaggaggcc gccacgcgcc cgagttttgc gttcgtcgct ggtcgtcggg cgccaaagcc 3121 ccggacggct gttcggtcga acgaacggcc acgacagtgg cataggttgg ggggtggtcc 3181 gacatagcct cggcgtacgt cgggaggccc gacaagaggt cccttgtgat gtcgggtggg 3241 gccacaagcc tggtttccgg aagaaacagg ggggttgcca ataacccgcc agggccaaaa 3301 ctccggcgct gcgcacgtcg ttcggcgcgg cgccgggcgc gccgagcggc tcgctgggcg 3361 gcttggcgtg agcggccccg ctccgacgcc tcgccctctc cggaggaggt tggcggaatt 3421 ggcacggaca acaggggccc agcagagtac ggtggaggtg ggtccgtggg ggtgtccaga 3481 tcaataacga caaacggccc ctcgttccta ccagacaagc tatcgtaggg gggcggggga 3541 tcagcaaacg cgttccccgc gctccataaa cccgcgtcgg gttgcgccgc ctccgaagcc 3601 atggatgcgc cccaaagcca cgactcccgc gcgctaggtc cttggggtaa tggaaaaggc 3661 cctactcccc atccaagcca gccaagttaa cgggctacgc cttcgggaat gggactggca 3721 ccccggcgga ttttgttggg ctggcatgcg tcgcccaacc gagggccgcg tccacgggac 3781 gcgcctttta taaccccggg ggtcattccc aacgatcaca tgcaatctaa ctggctcccc 3841 tctccccccc tctcccctct ccccccctct cccctctccc cccctctccc ctctcccccc 3901 ctctcccctc tccccccctc tcccctctcc ccccctctcc cctctccccc cctctcccct 3961 ctccccccct ctcccctctc cccccctctc ccctctcccc ccctctcccc tctcccctct 4021 gctctttccc cgtgacaccc gacgctgggg gcgtggctgc cgggaggggc cgcggatggg 4081 cgggcctact tggtttcccg cccccccccc ccccccccga accgccccgc cggctttgcc 4141 cccctttgat cccctgctac ccccaacccg tgctggtggt gcgggttggg gggggatgtg 4201 ggcgggggtg cgcgggaggt gtcggtggtg gtggtggtgg tggtagtagg aatggtggtg 4261 aggggggggg ggcgctggtt ggtcaaaaaa gggagggacg ggggccggca gaccgacggc 4321 gacaacgctc cccggcggcc gggtcgcggc tcttacgagc ggcccggccc gcgctcccac 4381 cccccgggcc gtgtccttgc tttccccccg tctccccccc ccccgccttc tcctcctcct 4441 cctcgttttt ccaaaccccg cccacccggc ccggcccggc ccggcccggc ccggccaccg 4501 ccgcccaccc acccacctcg ggatacccag ccccggtccc ccgttccccg ggggccgtta 4561 tctccagcgc cccgtccggc gcgccgcccc ccgccgctaa accccatccc gcccccggga 4621 ccccacatat aagcccccag ccacacgcaa gaacagacac gcagaacggc tgtgtttatt 4681 taaataaacc aatgtcggaa taaacaaaca caaacacccg cgacgggggg acggagggga 4741 cggagggagg gggtgacggg ggacgggaac agacacaaaa acaaccacaa aaaacaacca 4801 cccaccgaca cccccacccc agtctcctcg ccttctccca cccaccccac gcccccactg 4861 agcccggtcg atcgacgagc acccccgccc acgcccccgc ccctgccccg gcgacccccg 4921 gcccgcacga tcccgacaac aataacaacc ccaacggaaa gcggcggggt gttgggggag 4981 gcgaggaaca accgagggga acgggggatg gaaggacggg aagtggaagt cctgataccc 5041 atcctacacc cccctgcctt ccaccctccg gccccccgcg agtccacccg ccggccggct 5101 accgagaccg aacacggcgg ccgccgcagc cgccgcagcc gccgccgaca ccgcagagcc 5161 ggcgcgcgca ctcacaagcg gcagaggcag aaaggcccag agtcattgtt tatgtggccg 5221 cgggccagca gacggcccgc gacacccccc ccccgcccgt gtgggtatcc ggccccccgc 5281 cccgcgccgg tccattaagg gcgcgcgtgc ccgcgagata tcaatccgtt aagtgctctg 5341 cagacagggg caccgcgccc ggaaatccat taggccgcag acgaggaaaa taaaattaca 5401 tcacctaccc acgtggtgct gtggcctgtt tttgctgcgt catctcagcc tttataaaag 5461 cgggggcgcg gccgtgccga tcgcgggtgg tgcgaaagac tttccgggcg cgtccgggtg 5521 ccgcggctct ccgggccccc ctgcagccgg ggcggccaag gggcgtcggc gacatcctcc 5581 ccctaagcgc cggccggccg ctggtctgtt ttttcgtttt ccccgtttcg ggggtggtgg 5641 gggttgcggt ttctgtttct ttaacccgtc tggggtgttt ttcgttccgt cgccggaatg 5701 tttcgttcgt ctgtcccctc acggggcgaa ggccgcgtac ggcccgggac gaggggcccc 5761 cgaccgcggc ggtccgggcc ccgtccggac ccgctcgccg gcacgcgacg cgaaaaaggc 5821 cccccggagg cttttccggg ttcccggccc ggggcctgag atgaacactc ggggttaccg 5881 ccaacggccg gcccccgtgg cggcccggcc cggggccccg gcggacccaa ggggccccgg 5941 cccggggccc cacaacggcc cggcgcatgc gctgtggttt ttttttcctc ggtgttctgc 6001 cgggctccat cgcctttcct gttctcgctt ctcccccccc ccttcttcac ccccagtacc 6061 ctcctccctc ccttcctccc ccgttatccc actcgtcgag ggcgccccgg tgtcgttcaa 6121 caaagacgcc gcgtttccag gtaggttaga cacctgcttc tccccaatag agggggggga 6181 cccaaacgac agggggcgcc ccagaggcta aggtcggcca cgccactcgc gggtgggctc 6241 gtgttacagc acaccagccc gttcttttcc ccccctccca cccttagtca gactctgtta 6301 cttacccgtc cgaccaccaa ctgccccctt atctaagggc cggctggaag accgccaggg 6361 ggtcggccgg tgtcgctgta accccccacg ccaatgaccc acgtactcca agaaggcatg 6421 tgtcccaccc cgcctgtgtt tttgtgcctg gctctctatg cttgggtctt actgcctggg 6481 gggggggagt gcgggggagg gggggtgtgg aaggaaatgc acggcgcgtg tgtacccccc 6541 ctaaagttgt tcctaaagcg aggatacgga ggagtggcgg gtgccggggg accggggtga 6601 tctctggcac gcgggggtgg gaagggtcgg gggagggggg gatggagtac cggcccacct 6661 ggccgcgcgg gtgcgcgtgc ctttgcacac caaccccacg tcccccggcg gtctctaaga 6721 agcaccgccc cccctccttc ataccaccga gcatgcctgg gtgtgggttg gtaaccaaca 6781 cgcccatccc ctcgtctcct gtgattctct ggctgcaccg cattcttgtt ttctaactat 6841 gttcctgttt ctgtctcccc cccccccacc cctccgcccc accccccaac acccacgtct 6901 gtggtgtggc cgaccccctt ttgggcgccc cgtcccgccc cgccacccct cccatccttt 6961 gttgccctat agtgtagtta accccccccg ccctttgtgg cggccagagg ccaggtcagt 7021 ccgggcgggc aggcgctcgc ggaaacttaa cacccacacc caacccactg tggttctggc 7081 tccatgccag tggcaggatg ctttcgggga tcggtggtca ggcagcccgg gccgcggctc 7141 tgtggttaac accagagcct gcccaacatg gcacccccac tcccacgcac ccccactccc 7201 acgcaccccc actcccacgc acccccactc ccacgcaccc ccactcccac gcacccccac 7261 tcccacgcac ccccactccc acgcaccccc actcccacgc acccccactc ccacgcatcc 7321 ccgcgataca tccaacacag acagggaaaa gatacaaaag taaaccttta tttcccaaca 7381 gacagcaaaa atcccctgag ttttttttta ttagggccaa cacaaaagac ccgctggtgt 7441 gtggtgcccg tgtctttcac ttttcccctc cccgacacgg attggctggt gtagtgggcg 7501 cggccagaga ccacccagcg cccgaccccc ccctccccac aaacacgggg ggcgtccctt 7561 attgttttcc ctcgtcccgg gtcgacgccc cctgctcccc ggaccacggg tgccgagacc 7621 gcaggctgcg gaagtccagg gcgcccacta gggtgccctg gtcgaacagc atgttcccca 7681 cgggggtcat ccagaggctg ttccactccg acgcgggggc cgtcgggtac tcggggggca 7741 tcacgtggtt acccgcggtc tcggggagca gggtgcggcg gctccagccg gggaccgcgg 7801 cccgcagccg ggtcgccatg tttcccgtct ggtccaccag gaccacgtac gccccgatgt 7861 tccccgtctc catgtccagg atgggcaggc agtcccccgt gatagtcttg ttcacgtaag 7921 gcgacagggc gaccacgcta gagacccccg agatgggcag gtagcgcgtg aggccgcccg 7981 cggggacggc cccggaagtc tccgcgtggc gcgtcttccg ggcacacttc ctcggccccc 8041 gcggcccaga agcagcgcgg gggccgaggg aggtttcctc ttgtctccct cccagggcac 8101 cgacggcccc gcccgaggag gcggaagcgg aggaggacgc ggccccggcg gcggaagagg 8161 cggcccccgc gggggtcggg gccgaggagg aagaggcaga ggaggaagag gcggaggccg 8221 ccgaggacgt caggggggtc ccgggcccac cctggccgcg cccccccggc cctgagtcgg 8281 agggggggtg cgtcgccgcc ctcttggccc ctgccggcgc gaggggggga cgcgtggact 8341 ggggggaggg gttttcctgg cccgacccgc gcctcttcct cggacgcacc gccgcctcct 8401 gctcgacaga ggcggcggag gggagcgggg cggcgccgga gggggcggcg ccgcgggagg 8461 gcccgtgccc accctccacg cccggccccc ccgagccgcg cgccaccgtc gcacgcgccc 8521 ggcacagact ctgttcttgg ttcgcggcct gagccaggga cgagtgcgac tggggcacac 8581 ggcgcgcgtc cgcggggcgg gcggccggct ccgccccggg ggccggggcg cgggggccgg 8641 gccccggagg cggcgctcgc acgcacgggg ccacggccgc gcgggggcgc gcgggtcccg 8701 acgcggccgc ggacgcgggg ggcccggggc ggggggcgga gcctggcatg ggcgccgcgg 8761 ggggcctgtg gggagaggcc gggggggagt cgctgatcac tatggggtct ctgttgtttg 8821 caaggggggc gggtctgttg acaagggggc ccgtccggcc cctcggccgc cccgcctccg 8881 cttcaacaac cccaacccca accccaaccc ccccggaggg gccagacgcc ccccgcggcg 8941 ccgcggctcg cgactggcgg gagccgccgc cgccgctgct gttggtggtg gtgttggtgt 9001 tactgctgcc gtgtggcccg atgggcgccg aggggggcgc tgtccgagcc gcggccggct 9061 ggggggctgc gtgagacgcc ccgcccgtca cggggggcgc ggcggcgcct ctgcgtgggg 9121 gggcgcgggg cgtccggcgg ggggcgggcg gtacgtagtc tgctgcaaga gacaacgggg 9181 ggcgcgatca ggttacgccc cctccccggc ccgccctttc ctcgcccgcc cgcctattcc 9241 tccctccccc cccctcctcc tcctcctccc ccagggtcct tgccgccccc cgcctcaccg 9301 tcgtccaggt cgtcgtcatc ctcgtccgtg gtgggctccg ggtgggtggg cgacagggcc 9361 ctcaccgtgt gcccccccag ggtcaggtac cgcggggcga accgctgatt gcccgtccag 9421 ataaagtcca cggccgtgcc cgccctgacg gcctcctcgg cctccatgcg ggtctggggg 9481 tcgttcacga tcgggatggt gctgaacgac ccgctgggcg tcacgcccac tatcaggtac 9541 accagcttgg cgttgcacag cgggcaggtg ttgcgcaatt gcatccaggt tttcatgcac 9601 gggatgcaga agcggtgcat gcacgggaag gtgtcgcagc gcaggtgggg cgcgatctca 9661 tccgtgcaca cggcgcacac gtcgccctcg tcgctccccc cgtcctctcg agggggggcg 9721 cccccgcaac tgccggggtc ttcctcgcgg ggggggctcc cccccgagac cgccccccca 9781 tccacgccct gcggccccag cagccccgtc tcgaacagtt ccgtgtccgt gctgtccgcc 9841 tcggaggcgg agtcgtcgtc atggtggtcg gcgtcccccc gcccccccac ttcggtctcc 9901 gcctcagagt cgctgctgtc cggcaggtct cggtcgcagg gaaacaccca gacatccggg 9961 gcgggctaag gggaaaaaag gggggcgggt aagaatgggg ggggatttcc cgcgtcaatc 10021 agcacccacg agttccccct ctcccccccc cgcctcacaa agtcctgccc ccctgctggc 10081 ctcggaagag gggggagaaa ggggtctgca accaaaggtg gtctgggtcc gtcctttgga 10141 tcccgacccc tcttcttccc tcttctcccg ccctccagac gcaccggagt cgggggtccc 10201 acggcgtccc ccaaatatgg cgggcggctc ctccccaccc ccctagatgc gtgtgagtaa 10261 ggggggcctg cgtatgagtc agtggggacc acgcccccaa cacggcgacc ccggtccttg 10321 tgtgtttgtt gtgggggcgt gtctctgtgt atgagtcagg gggtcccacg gcgaccccgg 10381 gccctgcgtc tgagtcaaag gggccatgtg tatgtgttgg gggtctgtat atataaagtc 10441 agggggtcac atggcgaccc ccaacagggc gaccccggtc cctgtatata tagggtcagg 10501 gggttccgca ccccctaaca tggcgccccc ggtccctgta tatatagtgt cacggggttc 10561 cacgccccct aacatggcgc cccaacatgg cgcccggctc ccgtgtatga gtgggggtcc 10621 cccaacatgg cggccggttc cagtgtaagg gtcgggggtc ccccaacatg gcgcccccca 10681 atatggcgcc ccccaatatg gcgccccaga catggcgccc ggcccctcac ctcgcgctgg 10741 gggcggccct caggccggcg ggtactcgct ccggggcggg gctccatggg ggtcgtatgc 10801 ggctggaggg tcgcggacgg agggtccctg ggggtcgcaa cgtaggcggg gcttctgtgg 10861 tgatgcggag agggggcggc ccgagtctgc ctggctgctg cgtctcgctc cgagtgccga 10921 ggtgcaaatg cgaccagact gtcgggccag ggctaactta taccccacgc ctttcccctc 10981 cccaaagggg cggcagtgac gattccccca atggccgcgc gtcccagggg aggcaggccc 11041 accgcggggc ggccccgtcc ccggggacca acccggcgcc cccaaagaat atcattagca 11101 tgcacggccc ggcccccgat ttgggggccc aacccggtgt cccccaaaga accccattag 11161 catgcccctc ccgccgacgc aacaggggct tggcctgcgt cggtgccccg gggcttcccg 11221 ccttcccgaa gaaactcatt accatacccg gaaccccagg ggaccaatgc gggttcattg 11281 agcgacccgc gggccaatgc gcgaggggcc gtgtgttccg ccaaaaaagc aattagcata 11341 acccggaacc ccaggggagt ggttacgcgc ggcgcgggag gcggggaata ccggggttgc 11401 ccattaaggg ccgcgggaat tgccggaagc gggaagggcg gccggggccg cccattaatg 11461 agtttctaat taccataccg ggaagcggaa caaggcctct tgcaagtttt taattaccat 11521 accgggaagt gggcggcccg gcccattggg cggtaactcc cgcccaatgg gccgggcccc 11581 gaagactcgg cggacgctgg ttggccgggc cccgccgcgc tggcggccgc cgattggcca 11641 gtcccgcccc cgaggcggcc cgccctgtga gggcgggctg gctccaagcg tatatatgcg 11701 cggctcctgc catcgtctct ccggagagcg gcttggtgcg gagctcccgg gagctccgcg 11761 gaagacccag gccgcctcgg gtgtaacgtt agaccgagtt cgccgggccg gctccgcggg 11821 ccagggcccg ggcacgggcc tcgggcccca ggcacggccc gatgaccgcc tcggcctccg 11881 ccacccggcg ccggaaccga gcccggtcgg cccgctcgcg ggcccacgag ccgcggcgcg 11941 ccaggcgggc ggccgaggcc cagaccacca ggtggcgcac ccggacgtgg ggcgagaagc 12001 gcacccgcgc gggggtcgcg ggggtcgcgg gggtcgcggg ggtcgcgggg gtcgcggggg 12061 gctccggcgc cccctccccg cccgcgcgtc gcaggcgcag gcgcgccagg tgctccgcgg 12121 tgacgcgcag gcggagggcg aggcgcggcg gaaggcggaa ggggcgcgag ggggggtggg 12181 aggggtcagc cccgcccccc gggcccacgc cgggcggtgg gggcccgggg ggcggggggc 12241 ggcggcggtg ggccgggcct ctggcgccga ctcgggcggg gggctgtccg gccagtcgtc 12301 gtcatcgtcg tcgtcggacg cggactcggg aacgtggagc cactggcgca gcagcagcga 12361 acaagaaggc gggggcccac cggcgggggg cggcggcggg gcggccgcgg gcgcgctcct 12421 gaccgcgggt tccgagttgg gcgtggaggt tacctgggac tgtgcggttg ggacggcgcc 12481 cgtgggcccg ggcggccggg ggcggcgggg gccgcgatgg cggcggcggc gggccatgga 12541 gacagagagc gtgccggggt ggtagagttt gacaggcaag catgtgcgtg cagaggcgag 12601 tagtgcttgc ctgtctaact cgctagtctc ggccgcgggg ggcccgggct gcccgccgcc 12661 accgctttaa agggccgcgc gcgacccccg gggggtgtgt tttggggggg gcccgttttc 12721 ggcgtctggc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg 12781 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg 12841 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg 12901 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cccgcggccc 12961 cgccccccac gcccgccgcg cgcgcgcacg ccgcccggac cgccgcccgc cttttttgcg 13021 cgcgcgcgcg cccgcggggg gcccgggctg cc // LOCUS HS5IE5KB1 2520 bp ds-DNA VRL 15-DEC-1989 DEFINITION Human cytomegalovirus genome, BamHI-HindIII fragment (5'-terminal part of the 5 kb transcript from the immediate-early region). ACCESSION D00328 KEYWORDS immediate-early gene; transforming region. SOURCE Human cytomegalovirus (strain AD169) genomic DNA, clone pAT153 provided by J. D. Oram and R. G. Downing. ORGANISM Human cytomegalovirus Viridae; ds-DNA enveloped viruses; Herpesviridae; Betaherpesvirinae. REFERENCE 1 (bases 1 to 2520) AUTHORS Kouzarides,T., Bankier,A.T. and Barrell,B.G. TITLE Nucleotide sequence of the transforming region of human cytomegalovirus JOURNAL Mol. Biol. Med. 1, 47-58 (1983) STANDARD full staff_entry REFERENCE 2 (sites; 5 kb RNA start site) AUTHORS Plachter,B., Traupe,B., Albrecht,J. and Jahn,G. TITLE Abundant 5 kb RNA of human cytomegalovirus without a major translational reading frame JOURNAL J. Gen. Virol. 69, 2251-2266 (1988) STANDARD full staff_entry COMMENT In [2], the 5' end of the 5 kb RNA was determined by primer extension. In [1], the BamHI-HindIII fragment was described as the sequence containing the region capable of transforming NIH3T3 cells. FEATURES from to/span description site 1664 1664 5 kb RNA start site BASE COUNT 647 a 713 c 589 g 571 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccgca gcagtccgtt ggcggagtcc gaggagtgct gaccgccgct cccgccgccg 61 ccaccgccac caccagcgcc gccgcctcca ccaccaccgg cagacgagga cgactttttg 121 cgccgttctt cgtgacgctg ttcctgcctt cgccgttgct gttcctccgc agaagggccg 181 tcgcgagtcc cgccgctgcc acccagcgga ggacacgcag acggcggaag cggtagacgc 241 ggcgccgcaa ccaccgcctc cgctggagga ttcgccgtgg tttttcaggt aatgccgcac 301 gtaagtcact tgcaaattac cgttctcgga aatcatggtg agcagcgcgc tctcattggg 361 tccgctggag cccaccaccg aggagacgga tttgttgaag acgataccgc cgcgtacaaa 421 gaggtgctcc tgcagctccc cgtcgcccgt aatgtcaata gacatgaagc cctgctgcgt 481 cttggcgccg gccgaagcct cgccgtgctg cataatggta gcgcagagcc agcccttgtt 541 gaggtgcagc accttgccat cgccgtccac gcagttgacc agacgcgcgg tatcgaagac 601 gaactggcgc acgtcgaaag tctgattgac gctttgatgc aggatgcgat taggattcgc 661 aaaagtccag tattttcgca cgacggtagt agggagatcc atgacgcggc ggcgcaaagc 721 gcgagcgcaa cgctcgtcgg aggccgtgga gcgagtgccg ccgcagccgg cagagcgccg 781 aaccccgtcg cagactctat ttatacatca tctttccagc ccgcctagca acacccacaa 841 acaacgtcac gacgcaacgt ggttaaacag tacgtttatt aaagtaactg ggtgaacgac 901 accggagcgg actgcaaatc gcaacgctac tttctcgagt gcagatactc ttcgagacgg 961 ctaaacaacg tgtccaactc gctgagacct ttccgcgtgc cgttatccga tttcctttcc 1021 gcctcctgag acagccgact aatcacggac ttatctccgc aacctaacag aggctgggag 1081 cccgacaaaa gtaaaacagc gtgctatgaa cacgttgtta cctctgtgcg gacagcgccg 1141 ccacagagac acttacacat tgccgcatgt ctttgtagat ggattctagc gtcgagcgca 1201 tactatgcaa ttccgtcttg agtccgggat agacgtggtc gcctgcggga aacacgatct 1261 ccagataccg cctcaacaac cagtccatga cgctgcatcc ccaacagcct ttgaccaccg 1321 taccgtcgag ccacacggag tagtcgtcct cacgttgcta caagaggaaa actacgtcac 1381 ccgacacgcg gaaaagaaag accgtcgcaa taaaccgtac ctacgtgacc taccaacgta 1441 ggttttactc gatgaaaggt gacgcggaga tcttgcaatc tggtcgcgta atcctctgga 1501 cgacactgcg gctttgtatt ctttatcgtc gtcgtcgccg gcttcgcctc ctcggaagcg 1561 cctagaaaaa agacgatcag gaccagagag gaagagacca tcaccgacag catcgccgca 1621 gcatgccgtc ccagtccgcc gcccaactgc gcgtcccagg taggtggtcc tttatgttat 1681 gatgtttttg tcaatttttt ttttcaattt ctttcttccg cggttagaat agtttctgta 1741 ggaaccaatt atcaatctga cgggttatcg tcaccacttg atggcaaaac gaaatttttt 1801 ttttcattgc cttgaagtct ctcccgccac caccaccacc gccgttgtct ccggctggag 1861 atcaagacga aattcctcct ctctaaaaaa aaaggtggtg ggcttaattg atcatggcaa 1921 gaagaaaaac tatactgaat aaactgtgtg caaaactact agtaacaaca aaaatagcga 1981 ctagatacac cacggacaat ctcagcagat actctctcaa aagaaaaaaa agacgccgta 2041 acgtcggaga atctggtatc tactgcctga cgaatttttt tttcgtccat gtatgtgatt 2101 acgagtagta gtggtatgta gaacaagaag aaaaatcgta gtccccaaaa ggataataaa 2161 aataacactc atagagaatc acagattttc tctagacaac tctctatcca aataacgaat 2221 gtgaagcgta caaagtaaga tattcaaaga atagcacctt catagattca tttcagcttt 2281 ctactccttg taatttaaag ttgcactaaa caaagctctt aaagaaggtt cgagccgctc 2341 tcgatcactc atcgatcacg cgagtcttat tattccacca caacgtaaca ttcttcactt 2401 tgtagagaca ctttatcgta gagtaaccct cgatttccta gctgttgttt tttgattatt 2461 ttgttcgctc taagagagat actcgaaatc ctacttacac caaggaccct acatcatcgc // LOCUS HS5IE5KB2 1291 bp ds-DNA VRL 15-DEC-1989 DEFINITION Human cytomegalovirus genome, 3'-terminal part of the 5 kb transcript from the immediate-early region. ACCESSION D00327 KEYWORDS immediate-early gene. SOURCE Human cytomegalovirus (strain AD169) genomic DNA, clones pGJ0.1, pGJ0.2, pGJ0.3, pGJ0.5, pGJ0.6, and pGJ0.7. ORGANISM Human cytomegalovirus Viridae; ds-DNA enveloped viruses; Herpesviridae; Betaherpesvirinae. REFERENCE 1 (bases 1 to 1291) AUTHORS Plachter,B., Traupe,B., Albrecht,J. and Jahn,G. TITLE Abundant 5 kb RNA of human cytomegalovirus without a major translational reading frame JOURNAL J. Gen. Virol. 69, 2251-2266 (1988) STANDARD full staff_entry COMMENT One of the predominant transcripts from the immediate early region is a 5 kb RNA. This sequence analysis revealed multiple stop codons throughout the AT-rich potential coding region. FEATURES from to/span description RNA < 1 1267 5 kb RNA (3'-terminal part) (alt.) RNA < 1 1280 5 kb RNA (3'-terminal part) (alt.) BASE COUNT 384 a 333 c 275 g 299 t ORIGIN 15 bp upstream of EcoRI site. 1 ctttttattt tttcgaattc atgttcgaaa acacaagctt ccataacaag aacccgtacc 61 gaagaaaagt tccatcgact aaaaagaaaa aagaaaacga agcaagacct cgacgacaac 121 aacacatcaa agaaagacga ccagctgatt atgttcttag aattccacac acccgcgagc 181 cgatccgcaa acgtcgtgcg aggcgcgctt tctctggctc gacacaatga tcacaccgca 241 cgctatagac acgtcgtcgt ggacgacgat gacctcaggc cacgaatgac aaccaacatg 301 ggcaaagtcc aattagccaa aaagacgacg attctaagaa ttgatgaatc ctcgatatac 361 gcctatcgat aggtttcaat tgtgtcatat acatcaaatg aaaaacagga cgcacgataa 421 aagcttcctt acagcataac tgtaacatac gatcatggaa catctcctca catacctttc 481 tcctctcaca taggaaaaca aaactctttt ttttctttcc tgtcaaggaa aaaatcaatg 541 taccaccaca tcactttctc ctcggtcccg gcgacggatg ggcgcgcacg cggacaaaga 601 cccaccggcc acttccactt attttttgtt gttaatcgtc ttctcccccg cacgcggacg 661 accaccaacg ctagctgctc attccgtcaa ccagtcacac cgcgcacgga gaaggggccg 721 gggtccgcgg gcacccgcgg cggaggcgcg gttccctctc tctaattccc tggaaaacaa 781 gtaatgacaa acaaaaagac gacaaaaggt ctctattctg ctacatgaga gaaattatag 841 ctgttggcaa tttttcaaaa tacatgttat aaggcatcct ctctgccaca cgcgcagtca 901 cggataggat cagtgcgtat tcattataaa aaaaaacaca aacaacccat atatgtgaag 961 cagaatgatg accgaccgca cggagcgacg ccgtcgactg tcagcctcgc gaggagacac 1021 cgcggaccgg ggaaacggat aagtttacga acagaaatct caaaagacgc tgacccgata 1081 agtaccgtca cggagacacg gtggtttttt attgaatttc cagtgtatcg agccaccgtg 1141 atgcaggtac ggtggtttta tgtaaagtgc cgctatctat aggcgatgtg ttcctgacgg 1201 tgtgtgtttt tttggggata gacaacgtgg ttcttgtacg tggtttttac cctgctcaat 1261 aaagtcacgt tttccttaca ggtgttgtgt c // LOCUS HSE1GB 4283 bp ds-DNA VRL 15-DEC-1989 DEFINITION Equine herpesvirus 1(EHV1) glycoprotein B (gB) gene and 3' end of an overlapping upstream gene with homology to the HSV1 ICP18.5 gene. ACCESSION D00401 KEYWORDS gB gene; glycoprotein; glycoprotein B. SOURCE Equine herpesvirus 1 (isolate HVS 25A) genomic DNA, clones pMAC[209, 221]. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 4283) AUTHORS Whalley,J.M., Robertson,G.R., Scott,N.A., Hudson,G.C., Bell,C.W. and Woodworth,L.M. TITLE Identification and nucleotide sequence of a gene in equine herpesvirus 1 analogous to the herpes simplex virus gene encoding the major envelope glycoprotein gB JOURNAL J. Gen. Virol. 70, 383-394 (1989) STANDARD full staff_entry COMMENT Submitted in computer readable form by Whalley,J.M. on 19-Nov-1988. The EHV1 gB ORF appears to be overlapped at its5' end by 135 nt of the 3' end of an upstream ORF the potential translation product of which has approximately 50% identity with HSV gene ICP 18.5 and VZV gene 30 product. FEATURES from to/span description ORF 951 3893 equivalent to the gB glycoprotein gene of HSV. ORF < 1 1089 analogous ORF to HSV1 ICP18.5. sigp 951 1205 signal peptide. signal 719 723 putative CAT box. signal 802 806 putative TATA box. signal 3902 3907 putative polyA signal. site 823 831 similar sequence to putative HSV1 mRNA start site. BASE COUNT 1090 a 1168 c 1118 g 907 t ORIGIN map position aprox. 0.41-0.44 unit. 1 ctgcagaggc tcacggaccc agacaccagc aacagagagg ccctcaagca gctgctgggt 61 cgcatagggg tggataccga cgacggggcc ggcgagttgg gggacgcctt agacgtggat 121 ttggataatc taggtggggc ccctcctgtc aacagcaccc cctgtggtga ggacgccctc 181 tgtcgaaccg tttccgagga acgcccgtgg gacaaacttt tagagcgggc gactgcggat 241 gcttcgcagc gcaggcgcat gtacgcggag cgtctgtcaa agcgttccat cgccagtttg 301 gggcgctgcg tgcgcgaaca gcgaagagaa ctagaaaaaa ccctgagagt taacgtgtat 361 ggcgaagtgc tgctacatac gtacgtatcg tcctacaacg ggttttgcgc caggcgcggg 421 ttttgcgcgg cggtgagtcg agcgggtacc atcatagata accgctctag cacgtccgcg 481 ttcgactcgc atcagttcat gaaggcggcg ctgcttcgcc accccattga ccagtcgctc 541 atgccgtcca taacacacaa gtttttcgag ctgatcaacg ggcccgtgtt tgacaacgct 601 ggccacaact ttgcgcagcc gccaaacacg gcattatatt acagcgttga aaacgttggg 661 ttgttaccgc atctcaagga ggaactagct cggtttatga ttactgcggc taaaggtgat 721 tggtcaatta gcgagtttca aaggttttat tgctttgagg gagtgacagg tgtgacggcc 781 acgcagcggc tggcgtggaa atatatcggg gagctcatcc tagccgccgc agtattctcc 841 tcggttttcc actgtggaga ggtgcgcctc ctgcgcgcag atcgtaccta cccggactcc 901 agcggcgcac agcgctgcgt gagcggcatt tacataacct acgaggcgtc atgtcctctg 961 gttgccgttc tgtcggcggc tccacatggg gcaattggcg cggagacggt ggtgatttac 1021 gacagcgacg tgttctctct cctgtatgca gtgctccagc agctggctcc tggatcggga 1081 gccaactagg caatgttgga aacttactcg ccacccccca cccgctggga aagccggcat 1141 catcgagggt gggcacaata gttctagcct gtttgttgct ttttggaagc tgtgttgtta 1201 gagccgtacc caccacgcca agccccccaa ctagtactcc cacttccatg tcaacgcact 1261 cccatgggac agtagaccct acgctgctcc ccacagaaac gcccgaccca ctcagactgg 1321 ctgtgcgcga gtccggtata ctcgctgagg atggagactt ttacacctgc ccaccgccta 1381 ccggatccac cgtcgtacgc atcgaaccac ctagaacttg ccccaagttt gaccttggga 1441 gaaacttcac ggaggggatt gctgttattt ttaaggaaaa catcgctccc tacaaattca 1501 gggcaaacgt atactacaag gacatcgttg taacacgtgt gtggaaagga tacagccata 1561 cgtccctgtc cgacagatac aatgacaggg ttccggtttc ggtggaggag atcttcggtc 1621 tcatcgacag taagggaaaa tgttcgtcaa aggccgagta cctcagagat aacatcatgc 1681 accacgcgta ccacgacgac gaggacgagg tggagcttga tttgtgccgt ccaagtttgc 1741 aactccgggg ggccagagcc tggcagacca ccaacgatac tacgtcttac gtggggtgga 1801 tgccatggag gcactacacg tcaacgtctg tcaactgcat cgtcgaggag gtggaggcgc 1861 ggtccgtcta cccctacgac tccttcgccc tgtccaccgg tgatattgtg tacgcgtctc 1921 cgttttacgg cctgagggct gccgctcgca tagagcacaa tagctacgcg caggagcgtt 1981 tcaggcaagt tgaagggtac aggccccgcg acttagacag taaactacaa gccgaagagc 2041 cggttaccaa aaattttatc actaccccgc atgtcaccgt cagctggaac tggaccgaga 2101 agaaagtcga ggcgtgtacg ctgaccaaat ggaaagaggt cgacgaactc gtcagggacg 2161 agttccgcgg gtcctacaga tttactattc gatccatctc gtctacgttt atcagtaaca 2221 ctactcaatt taagttggaa agtgcccccc ttactgaatg tgtatccaaa gaagcaaagg 2281 aagccataga ctcgatatac aaaaagcagt acgagtctac gcacgtcttt agcggtgatg 2341 tggaatatta cctggcacgc ggggggttct taattgcatt cagacctatg ctctccaacg 2401 aactcgccag gctgtacctg aacgagcttg tgagatctaa ccgcacctac gacctaaaaa 2461 atctattgaa ccccaatgca aacaataaca ataacaccac gcgaagacgc aggtctctcc 2521 tgtcagtacc agaacctcag ccaacccaag atggtgtgca tagagaacaa attctacatc 2581 gcttgcacaa acgagcagtg gaggcaacgg caggtaccga ttcttccaac gtcaccgcca 2641 aacagctgga gctcatcaaa accacgtcgt ctatcgagtt tgccatgcta cagtttgcat 2701 acgatcacat ccaatcccac gtcaatgaaa tgctaagtag aatagcaact gcgtggtgta 2761 ccctccaaaa caaagagcgg accctatgga acgaaatggt gaagattaac ccgagcgcca 2821 tagtctccgc aacccttgac gagcgagttg cagcgagggt cctgggggac gtgatagcta 2881 taacgcactg cgccaaaata gagggcaacg tgtacttgca aaactccatg cgctcgatgg 2941 acagtaacac gtgctactcc cgcccccccg taacatttac aattactaag aatgcaaaca 3001 acagagggtc gatagaaggc cagctgggag aggagaacga gattttcacg gagcgcaagc 3061 tgatcgagcc gtgcgccctc aatcagaagc gctactttaa gtttggcaaa gagtacgttt 3121 actacgagaa ctacacgttc gtccgcaaag tgccccccac ggaaatcgag gttatcagca 3181 cgtacgttga actaaacttg acccttttgg aagaccgcga gtttctgccc ctggaggtgt 3241 acacgcgggc tgagctggag gacaccggcc tgctagacta cagcgaaata cagcgccgca 3301 accagctcca cgctctcagg ttttacgaca tcgacagcgt ggtcaacgtg gacaataccg 3361 cagtgattat gcaggggatc gccagctttt tcaagggcct gggtaaagtg ggggaggccg 3421 tgggaacgct cgttctcggc gccgccggcg ctgttgtttc aaccgtatct ggaatagctt 3481 cgtttttaaa caacccattt ggggggctag ccatcggcct gctggtaatc gccggcctgg 3541 tagctgcgtt ttttgcttac agatatgtaa tgcagatccg cagtaacccc atgaaagctc 3601 tataccccat aacaacaaag gccttgaaaa acaaagccaa aacttcctac ggccagaacg 3661 aggaggacga tgggagcgac tttgatgagg ccaagcttga agaggctcgc gaaatgatca 3721 aatacatgtc tatggtttcg gccctggaaa agcaggaaaa gaaagctata aagaaaaaca 3781 gtggggttgg cctgatcgcc agtaacgtct caaagctggc cctgcgaagg cgcggtccca 3841 aatatacccg actccaacag aacgatacca tggaaaatga aaaaatggtt taaacatgtt 3901 taataaatat tatgacacgt actcaaagtg tgacctcata tttgcataac cactttctag 3961 ttccggcccc aaggatattt aagcctagta tctccgccga ggtttcatcc tcattcacca 4021 actcacactt agagttgacg cttcctcttg cgcctttgct ctcgccgctc ctgtgttagc 4081 gtatactgcc caagaaatgg attctccacg cggtatctcc acagctaccg gtgatgccca 4141 cgccgaggcc gcggtttccc cagccgcgaa atccagataa aaacgaagcc cccgatgtag 4201 acggaccaga agccactact gagtgtttag accacaccta cacccaacag acaagcgggg 4261 gtgatggcct agatgctatc gat // LOCUS HSEIEP 8174 bp ds-DNA VRL 15-DEC-1989 DEFINITION Equine herpesvirus type 1 immediate-early protein gene, complete cds. ACCESSION J04366 KEYWORDS immediate-early protein; nonstructural protein; regulatory gene. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 8174) AUTHORS Grundy,F.J., Baumann,R.P. and O'Callaghan,D.J. TITLE DNA sequence and comparative analyses of the equine herpesvirus type 1 immediate early gene JOURNAL Virology 172, 223-236 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by D.J.O'Callaghan, 13-JUN-1989. FEATURES from to/span description pept 988 5451 immediate-early protein signal 334 339 TATA box signal 5735 5740 polyA signal BASE COUNT 1171 a 2937 c 2790 g 1276 t ORIGIN 1 cccggggagg agacgcatgc agatgagatg tgcatcgagg tgtcatggcg tccaggggcg 61 ttcaccttta tgcatatgag aggcgctatt cggcatcccg ttggcgcgac gcgcttccct 121 gggaggagac atacgcaaat tagaaacgac acacgggttc taattggttg gagcgggggg 181 gaggcgaaaa gcgcatgcaa atgcaaagcg cgggaccggg ccccataggc tagagccgct 241 acacgcccac cgcccatcat caacggccaa tcacaatcga tagtgtgggc tggccactcc 301 cactaggggg aaggcaaaac tccctcgtag tagtataaag cacctgttgc ttacccatcg 361 tagcatcgcg gactagagag cctttcagct cactggacca gccagccttc gaggactatc 421 gatcgcatct tggaaagctt acccgctctt ggcactcctt cttcggcttg cggaggtaag 481 agctccccgg ggacacgacc ggcttcgatc tgcttcttct cccggggaga gcgttagaga 541 acggggcgag tgccaaaaag gccatggaac ccctccaaca acgatgtccc gagggggtgg 601 ctccgaggcc cgcttcgacc tagcggtcga agcgcggtgg ggatacttac ctcgaagccg 661 gcgaaggcta taccttcccc gggcagaccc gggcggcttc tgcctcggcg gagctcggcg 721 cggaagcctg gatatctgac ggggcgtggt taccacccaa gcgggggaga ggcccgggcc 781 gcccgcgttc ccttttacca ttcggctccg ctccaactca acatcttttc cgcctctgct 841 tttccagggt agagaagcgg cgcccgtcgt ccgagcgccc gccgcggaac cccgccaccg 901 gccacccgcc aaccttccct tctcggtctt ccgagcgagc cttctcgtgc ggttggttct 961 cgaccccgaa gccggagcta gcacgccatg gccagccagc gcagcgactt cgccccggac 1021 ctctacgact tcatcgagag caacgacttc ggcgaggacc ccctcatccg cgcagccagc 1081 gcggccgaag aggggttcac ccagcccgcc gcgcccgacc tgctgtacgg cagccagaac 1141 atgttcgggg tggacgacgc tccgctctcc accccggtgg tggtcatccc tccgccgtct 1201 ccggctcccg agccccgcgg agggaaggcg aagcggtcgc cctcggccgc cggcagcggc 1261 ggtcctccta ccccggcggc tgccgcccag ccggcgtccc cggcacccag cccggctccg 1321 gggctcgccg cgatgctgaa gatggtccac tcctccgtgg ccccggggaa cggtcgccgg 1381 gccacgggct cctcatcacc cggcggtggg gacgcggccg acccggtcgc cctcgacagc 1441 gataccgaga cctgcccggg gtccccgcag cccgagtttc catcctcggc ctccccgggc 1501 ggagggtccc cggcaccccg ggtccggagc atctccatct catcgtcgtc ctcgtcctcg 1561 tcctcgatgg acgaggacga ccaggcggat ggtgccgggg cgagtagctc ctcttcgtcg 1621 tcctccgacg acagcgacag cgacgaaggc ggcgaggagg agacccctcg cccgcggcac 1681 tcgcagaacg ccgcgaagac cccgtcggcc gccggctctc ccgggccgtc ctccggaggg 1741 gatcgcccgg ccgctggggc cgccaccccg aagagctgcc gctccggcgc cgcttccccc 1801 ggcgcacccg ctccggctcc agcttcggcg cccgctccca gccgcccggg aggaggcctc 1861 ctccctccgg gggctcgcat tttagagtac ctggagggcg tccgcgaggc caatctggcc 1921 aagacgctgg agaggcccga accgcccgcg gggatggctt ctccgccggg ccggagccct 1981 caccggctcc ccaaggacca gcgtccgaaa tcggctctgg cgggagcgtc gaagcgcaag 2041 cgggccaacc ccagacccag accccagacc cagacccagg caccggccga ggaggccccg 2101 cagacggccg tgtgggactt gctggacatg aactcatccc aggctaccgg ggcggcggca 2161 gcagcagcat cggccccggc ggcggcttcg tgcgccccgg gcgtctacca gcgcgagccg 2221 cttctcaccc cgtccgggga cccctggccc gggtcggatc caccaccgat ggggagggtg 2281 cgatacgggg ggaccgggga ctcgcgggac gggctgtggg acgaccccga gatagtcctg 2341 gccgcctcgc gctacgccga ggcgcaggcc ccagtaccgg tcttcgtgcc ggagatgggg 2401 gactccacca agcagtacaa cgctctggtc cgcatggtgt tcgagagccg cgaagccatg 2461 tcctggctgc agaactctaa gctcagcggg caagaccaga acctggcgca gttctgccag 2521 aagttcatcc acgctccgcg cggacacggg tccttcatca ccgggagcgt ggccaacccc 2581 ctgccccaca tcggggacgc catggcggcc gggaacgcgc tctgggccct gccacacgcg 2641 gccgcctcgg tggccatgag ccgccgctac gatcgcactc agaagagctt catcctccag 2701 agcctccggc gcgcctacgc ggacatggcc tacccgagag acgaggcggg gaggccggac 2761 tcactcgccg ccgtggccgg ctgcccggcc caggccgccg ctgccgcggc cagccagcaa 2821 cagcccgagg ccccggcgcc ctcggtccgc gtccgcgaag cgtacacccg ggtctgcgcg 2881 gccctcgggc cccgacgcaa ggctgccgcg gccgcggccg ctccggggac cagggcgccc 2941 aggccgtccg ccttcagact cagggagctc ggggacgcct gcgtgctggc ctgccaggcc 3001 gtcttcgagg ccctcctgcg cctccgcggc ggggcgtccg ccgtccccgg actggacccc 3061 agcgagatcc cctctcccgc ctgccctccc gaggcgctgt gctccaaccc ggccgggctg 3121 gagacggcgg ccctctccct ctacgaactc agggacctgg tcgagcgggc caggctcctc 3181 ggggactctg accctaccca ccgcctgggc tccgacgagc tgcgcctcgc ggtgcgcgcc 3241 gttctggtgg tggcccggac cgtggcgccg ctggtgcgct acaacgccga gggggcccgg 3301 gcccgggcct cggcctggac cgtcacccag gccgtgttca gcatacccag cctggtcggg 3361 gggatgttgg gggaggccgt gtccctgctg gccccaccga ctcggtccca gcagccctca 3421 tcgtcctcgc ccggcggcga gcccttctcc ggctccgcgg ccgcggaggg gagccttcag 3481 accctgccgc ccctgtggcc caccgtcccc gggaagcagt ccgcgacggt cccctcgtcc 3541 cactcccagt ccccccagca ctcccagagc ggcggaggcg ccggggctac gaccgccacc 3601 tgctgccggg ccacccagac aaacgcccgc tcccgggggc agcagcacca gccgcagaag 3661 gcccgctccc ctcaggcggc cgcctccccg gcccacctca gccaggaggc gatgcccggc 3721 tcctcctcgg acgaccgtgc catccacggg cgccccaggg gcaagagcgg caagcggcgc 3781 tccgagcccc tggagccggc ggcccaggcc ggagcctcgg cctccttctc ctcgtccgcc 3841 cgggggtacg atccctcggg gccggtcgac agccctccgg cccccaagcg cagggtggcc 3901 accccgggcc accaggctcc ccgggccctg ggacccatgc cagccgaggg ccccgaccgt 3961 cggggcggat tcaggcgcgt tccccgcgga gactgccaca ctccgcggcc cagcgacgcg 4021 gcttgcgcgg cctactgtcc ccccgagctg gtggcggagc tcatcgacaa ccagctgttc 4081 cccgaggcct ggcgcccggc gctcaccttc gatccccagg ccctggccac catcgcggcc 4141 cgctgcagcg gccccccggc ccgggacggc gcgcgcttag gggagctggc ggccagcggc 4201 ccgctgagac ggagggccgc ctggatgcac cagatccccg accccgagga cgtgaaggtg 4261 gtggtcctct actccccgct ccaggacgag gacctgctgg gcggactccc ggcctcccgc 4321 cccggcggct ctcggcgcga gcccctctgg tccgacctca aggggggact ctcggcgctg 4381 ctggcggccc tggggaaccg catcctcacc aagcggtccc acgcctgggc cggcaactgg 4441 accggggccc cggacgtctc ggccctcaac gcccaggggg tcctgctgct gtcgaccggg 4501 gacctggcct tcaccggctg cgtcgagtac ctctgcctgc gcctgggctc cgccaggcgc 4561 aagctcctgg tgctggacgc ggtctccacc gaggattggc cccaggacgg tcccgcgatc 4621 agccagtacc acatctacat gcgggccgcc ctgactccgc gggtcgcctg cgccgtgcgc 4681 tggcccgggg agcgccacct cagccgcgcg gtcctcacct ccagcaccct cttcgggccc 4741 ggactgttcg cgagggccga ggccgcgttc gcgcgcctgt acccggactc tgcgcccctg 4801 aggctgtgcc gctcctccaa cgtggcctac acggtggaca ctcgcgccgg cgagcgcacc 4861 cgcgttcccc tggctccgag ggagtaccgc cagcgcgtcc tgcccgacta cgacggctgc 4921 aaggacatgc gggcccaggc cgagggcctc gggttccacg acccggactt tgaggagggc 4981 gccgcgcaga gccaccgcgc ggccaaccga tggggactcg gggcctggct gcgccccgtg 5041 tacctcgcct gcggccggcg cggcgctggg gccgtggagc cctcggagct tctgatcccc 5101 gagctgctga gcgagttctg ccgggtggcg ctgctggagc ccgacgccga ggccgagccc 5161 ctggtgctgc ccatcaccga ggctccccgc cgccgagccc cgcgggtcga ctgggagccc 5221 gggttcggct ctcgctccac ctcggtcctg cacatggggg ccacggagct gtgcctgccg 5281 gagcccgacg acgagctcga gatcgacggg gccggcgatg tggagctggt ggttgagcac 5341 cccggcccga gccccggcgt ggcccaggcc ctccgccgcg ctcccatcaa gatcgaggtg 5401 gtgtcggacg acgaggacgg aggagactgg tgcaatccgt acctctcctg aacacgatgg 5461 agcgcctccc tgcggccgaa aacaagaaaa atcagtacat ccacaactat gtgtccgccc 5521 agcacaacgc agactccgcc tagactcccg cctccatccg ctgacgctga accccgcccc 5581 gccctctgct gacgcgaaga caaggccctc cccggacgac atgtgaggaa cgaagggggc 5641 gttgtatcta gcagcccacg ttccttattg ctcacatgtc tgcccaatcg gtgggcactt 5701 ccaggctttc ccctatcgct gagtggttgt ttttaataaa gtttttttta aattttgatt 5761 gaccgcgtgg tctttgttta ctgggcgggt tgatgggcgg gttgatgggc gggttgatgg 5821 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg 5881 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg 5941 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg 6001 gcgggttgat gggcgggttg atggttcctg ctcctcccct tcctgctcct ccccttcctg 6061 ctcctcccct tcctgctcct ccccttcctg ctcctcccct tcctgctcct ccccttcctg 6121 ctcctcccct tcctgctcct ccccttcctg ctcctcccct tcctgctcct ccccttcctg 6181 ctcctcccct tcctgctcct ccccttccgc tacgtcacta ccgcctacgt cactaccgga 6241 ctcctcccct tccgcttccg gccacgcccc ttccggtgag ccccagcata gcagtgagcc 6301 ccagcatagc agtgacgtca ctttgacccc cccccttaga ccacgccccc ctattcaaat 6361 gcggggggga gacgcgggct gggggggcca ggctctctct cgggcgcggg cccgtgaccc 6421 ttgaccagat atggcccggg gccaggctct ctctcgggcg cgggcccgtg acccttgacc 6481 agatatggcc cggggccagg ctctctctcg ggcgcgggcc cgtgaccctt gaccagatat 6541 ggcccggggc caggctctct ctcgggcgcg ggcccgtgac ccttgaccag atatggcccg 6601 gggccaggct ctctctcggg cgcgggcccg tgacccttga ccagatatgg cccggggcca 6661 ggctctctct cgggcgcggg cccgtgaccc ttgaccagat atggcccggg gccaggctct 6721 ctctcgggcg cgggcccgtg acccttgacc agatatggcc cggggccagg ctctctctcg 6781 ggcgcgggcc cgtgaccctt gaccagatat ggcccggggc caggctctct ctcgggcgcg 6841 ggcccgtgac ccttgaccag atatggcccg gggccaggct ctctctcggg cgcgggcccg 6901 tgacccttga ccagatatgg cccggggcca ggctctctct cgggcgcggg cccgtgaccc 6961 ttgaccagat atggcccggg gccaggctct ctctcgggcg cgggcccgtg acccttgacc 7021 agatatggcc cggggccagg ctctctctcg ggcgcgggcc cgtgaccctt gaccagatat 7081 ggcccggggc caggctctct ctcgggcgcg ggcccgtgac ccttgaccag atatggcccg 7141 gggccaggct ctctctcggg cgcgggcccg tgacccttga ccagatatgg cccggggcca 7201 ggctctctct cgggcgcggg cccgtgaccc ttgaccagat atggcccggg gccaggctct 7261 ctctcgggcg cgggcccgtg acccttgacc agatatggcc cggggccagg ctctctctcg 7321 ggcgcgggcc cgtgaccctt gaccagatat ggcccggggc caggctctct ctcgggcgcg 7381 ggcccgtgac ccttgaccag atatggcccg gggccaggct ctctctcggg cgcgggcccg 7441 tgacccttga ccagatatgg cccgggtaga gagagactgg gttcagaaga gccagagtgg 7501 gtctgtaaag acaagggagt gggacgcggg tggtgggaag tggctcaaca ccgtggccgg 7561 agatggttgg ggagggggaa aatgggggaa atatagtaaa ctagtttact actggtacta 7621 ttccacggtt atagcatttc taagctggtc cgaggaggag agtagaaagg actcaatgtg 7681 tccatttgtg tgatatatag tctgtgaccc ctagtaacac tactgccatt agtttctccc 7741 cactatatgc tcagcttgtc tataccgcgc tcacactcag gaggttaggt gtgctaatag 7801 gccaatcggg gggggggggg ggtgtggtgg taaatagcgg catcccccct agagcagata 7861 aactggagtt taatagggct agggcagggg gctagggcag ggggctaggg cagggggcta 7921 gggcaggggg ctagggcagg gggctagggc agggggctag ggcagggggc tagggcaggg 7981 ggctagggca gggggctagg gcagggggct agggcagggg gctagggcag ggggctaggg 8041 cagggggcta gggcaggggg ctagggcagg gggctagggc agggggctag ggcagggggc 8101 tagggcaggg ggctagggca gggggctagg gcagggggct agggcagggg gctagggcag 8161 ggggctaggg cagg // LOCUS MLVCASBRE 3335 bp ds-DNA VRL 15-MAR-1989 DEFINITION Murine leukemia virus (Cas-Br-E MuLV), 3' end of proviral genome, encoding pol polyprotein, partial cds, and env polyprotein, complete cds. ACCESSION M14702 KEYWORDS env gene; glycoprotein; pol gene; provirus. SOURCE Murine leukemia virus (isolate pBR-NE-8) proviral DNA, from mouse brain. ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 3335) AUTHORS Rassart,E., Nelbach,L. and Jolicoeur,P. TITLE Cas-Br-E murine leukemia virus: Sequencing of the paralytogenic region of its genome and derivation of specific probes to study its origin and the structure of its recombinant genomes in leukemic tissues JOURNAL J. Virol. 60, 910-919 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 850 pol polyprotein pept 790 2775 env polyprotein matp 2185 2772 Prp15E glycoprotein LTR 2817 3335 3' long terminal repeat rpt 3192 3259 3' LTR R region rpt 2817 2828 inverted terminal repeat copy A rpt 3324 3335 inverted terminal repeat copy B BASE COUNT 851 a 928 c 798 g 758 t ORIGIN 84 bp upstream of HindIII site. 1 gatcgatttc accgaggtaa aacctagatt gtatggctat aagtatcttt tagtttttgt 61 agatactttc tctggctgga tagaagcttt cccaaccaag aaagaaaccg ccaaggtcgt 121 gactaagaaa ctgctagaag agatcttccc taggttcggc atgccgcagg tattgggaac 181 tgacaatggg cctgccttcg tctccaaggt gagtcagaca gtggccgatc tgttggggat 241 tgattggaaa ttacattgtg catacagacc ccaaagctca ggtcaggtag aaagaatgaa 301 taggaccatc aaggagactt taactaaatt aacgcttgca actggctcta gagactgggt 361 cctcctactc cccttagccc tgtaccgagc ccgcaacacg ccgggccccc atggcctcac 421 cccatatgag atcttatatg gggcaccccc gccccttgta aacttccctg accctgacat 481 gaccagagtt actaacagcc cctctctcca agctcactta caggctctct acttagtcca 541 gcacgaagtt tggagaccac tggcggcagc ttaccaagaa caactggacc ggccggtggt 601 gcctcaccct taccgggtcg gcgacaccgt gtgggtccgc cgacatcaaa ccaagaacct 661 agaacctcgc tggaaaggac cttacacagt cctgctgacc acccccaccg ctctcaaagt 721 ggacggcatc tctgcgtggg tacacgccgc tcacgtaaag gcagcaacga cttctccggc 781 cagaacagca tggaaggtcc agcgttctca aaatccccta aagataagac tatcgagaga 841 gccttcctag gggttttggg gatcttattc gtgacaggag ggttagcgag cagagacaac 901 ccccaccagg tatataatat aacttgggaa gtaacaaatg gagaacaaga cactgtgtgg 961 gcagtaaccg gcaaccaccc cttgtggact tggtggccag acctcacacc agacctttgt 1021 atgctggccc tacatggccc aactcattgg ggcctagaca accaccctcc atattcctct 1081 cccccggggc ccccttgttg ttcaggagat gcaggggctg tgtcaggctg tgctagagac 1141 tgtgatgagc ccttgacctc ttactccccc cggtgcaata cagcctggaa tagactgaaa 1201 ctggcccggg taacacatgc acctaaagag ggattttata tctgccctgg gtcacatcgc 1261 cccaggtggg ctcggtcgtg cgggggtcta gacgcctatt attgtgcctc ctgggggtgc 1321 gaaactacag gccgagcagc ctggaaccca acttcatctt gggactatat cacagtaagc 1381 aataatttaa cttcctcaca ggccaccaaa gcctgcaaaa ataatggctg gtgcaacccc 1441 cttgtcatac gattcacggg tccaggaaaa agggccacct cctggactac aggtcatttc 1501 tggggactgc gcctgtacat ctctggacat gacccagggc tcacttttgg gattcggcta 1561 aaagtgacag atctgggacc tagagttcca atagggccaa atcctgtctt gtcagatcag 1621 cgaccgccct cccggcctgt acctgccaga cctcccccac cttcagcctc accttccact 1681 cccaccatac ctccacagca ggggaccggg gacaggttac ttaatctggt ccagggagcc 1741 tacctcacac tcaatatgac tgatcccacc agaacccagg agtgttggtt atgcctagtc 1801 tccgagcctc cgtattatga aggggtggcc gtgttgagag agtacactag tcatgagacg 1861 gcacctgcta actgctcctc cggatcccaa cataagctga ccttatctga ggtaactgga 1921 cagggaagat gtctaggaac ggttcccaaa actcaccagg ctctatgcaa ccgcaccgag 1981 cccaccgtaa gtggttccaa ttacttggtg gctcccgaag gtaccctctg ggcatgcagc 2041 accgggctca ctccctgtct gtctactact gtgctcaact taaccactga ttactgtgtc 2101 ctagttgaac tctggccaaa ggtgacctac cactcccctg actatgtcta tactcagttt 2161 gaaccagggg ccagattccg aagagagccg gtgtcgctga ccctcgccct gctaccagaa 2221 ggtctcacca tgggtggaat tgccgcagga gtagggacag ggacaactgc cctggtcgcc 2281 acccaacagt ttcaacaact tcaggctgct atgcacaacg acctcaagga agttgaaaaa 2341 tcaattacta atctagaaaa gtctctgacc tcgctgtcag aagtggtttt gcagaaccgc 2401 agaggcctag atctactatt tctaaaagag ggaggccttt gcgcggctct aaaagaagag 2461 tgctgctttt atgcagacca cacaggatta gtgagagata gcatggccaa acttagagaa 2521 agactaaacc agagacaaaa attgtttgaa tcaggacaag ggtggtttga aggactgttt 2581 aataggtccc catggttcac aaccctgata tccactatta tgggccctct gatagtactt 2641 ttattaatcc tacttttcgg accctgcatt ctcaatcgat tggtccaatt tgttaaagac 2701 aggatctcag tggtccaggc tctggttttg actcagcaat atcaccagct aaaacccata 2761 gagtacgagc cgtgaataaa ataaaagatt ttatttagtt tccagaaaaa ggggggaatg 2821 aaagacccac catcaggttt agcaagctag cttaagtaac gccatttatt ttgcaaggcc 2881 tggaaaaata ccgaactgag aatagggaag ttcggatcaa ggtcaggaac agaaaaacag 2941 ctgaagttgg gccaaacagg atatctgtgg taagcagttt cggccccggc ccgaggccag 3001 aacagatggt ccccagatat ggcccaatcc tcagcagttt ctagggaccc atcagatgtt 3061 ttcaggctgc cccaaagacc tgaagtgacc ctgtgcctta tttgaactaa ccaatcagct 3121 cgcttctcgc ttcggtttgc gcgcttctgc tccccgagct ctataaaaga gcacacaacc 3181 cctcactcgg cgcgccagtc ctccgataga ctgagtcgcc cgggtacccg tgtatccaat 3241 aaaccctctt gctgttgcat ccgactggtg gtctcgctgt tccttgggag ggtctcctca 3301 gagtgattga ctacccgcct cgggggtctt tcatt // LOCUS PPMCG 5089 bp ds-DNA VRL 15-MAR-1990 DEFINITION Monkey B-lymphotropic papovavirus complete genome. ACCESSION M30540 KEYWORDS complete genome; large T-antigen; small T-antigen; viral protein. SOURCE Monkey C-lymphotropic papovavirus DNA, clone pL02, passed in human B-lymphoblastoid cell line BJA-B. ORGANISM Monkey B-lymphotropic papovavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 5089) AUTHORS Furuno,A., Kanda,T. and Yoshiike,K. TITLE Monkey B-lymphotropic papovavirus genome: The entire DNA sequence and variable regions JOURNAL Jpn J Med Sci Biol 39, 151-161 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 152 721 small T-antigen pept 152 388 large T antigen, exon 1 744 2600 large T antigen, exon 2 pept 3770 2664 (c) VP-1 pept 4362 3649 (c) VP-3 pept 4719 3649 (c) VP-2 rpt 4724 4783 repeat copy A rpt 4784 4843 repeat copy B BASE COUNT 1516 a 1015 c 1054 g 1504 t ORIGIN 695 bp upstream of HindIII site. 1 cccctagcct cctcctcttc tttcaacaaa gagagaggct ttggaggctt ttccaaaaac 61 tcattaggta agctgccctg agatattttc ccatataatt aagtattaag gccacctagg 121 taattaaatt tattccattt tattcacagc catggaccaa acgctgtcta aggaggagag 181 aaatgagctt atggatttat tgcaaataac tagagctgca tggggaaatc tttctatgat 241 gaaaaaagcc tataaaaatg tctccaagct ctaccatcct gataaaggag gagattcagc 301 taaaatgcag cggctcaatg aattatttca aagggtccag gttaccttga tggagataag 361 gagtcaatgt ggatcctctt cttcccaggt agcttggttt ttttgggatg agaattttag 421 aaccctagga gcttttctag gagaaaaatt taatgaaaaa attattggac tctaccctac 481 ttgcactaaa tttgtaagag ctaattgtaa ttgtatagta tgtctgctaa aaaagcagca 541 tgcaggtaca aaaaaaaatt taaaaaagcc atgtttagtc tggggagaat gttggtgcta 601 caaatgttat ttagtatggt ttggctttcc tgaggatttc acctcttttc gctactggac 661 ccttcttatg gcaaatatgg atttatctat gctcaagctt tggacggaac tgggattcta 721 atgtaagtat ttttattttc tagggttact tcagtgagga cttctacttt gggcctacca 781 cctttcaata tagccctatg gatcgagatg cagttcggga ggatcttcca aatccagggg 841 aagggtcttg ggggaaatgg tggagagagt ttgttaatag gcaatgttgt gatgatttgt 901 tttgctcaga aacaatgagt agttcaagtg atgaagacac ccccccagcg gcgcaacctc 961 ctcctcctcc tgccccttcc ccagaagaag aggatgaaat agaatttgta gaagagaccc 1021 caagttcctg tgatggatct tcttctcaaa gctcctacac ctgcaccccc cctaaaagga 1081 agaaaactga agaaaagaag ccagatgatt ttcctgtatg tttatattcc tttttaagtc 1141 atgcaattta tagtaataag actatgaata gttttttaat atatactact ttggagaaag 1201 ccaggcaact gtataaaact gtggaaaaat ctaaaattgt agttgatttt aaggctagtt 1261 tttcttatca ggatgaggaa ggggaggggt gtttgctgtt tttaattact ttaggaaaac 1321 atagagtgtc tgctgttaag catttttgtg tatcccaatg tacttttagt tttattcatt 1381 gtaaagctgt tgttaaacct ctagagttat ataagacctt aagtaaacca ccttttaagt 1441 tgttggaaga gaacaaaccg ggtgtatcca tgtttgagtt ccaagaggag aaggaacagt 1501 ctgttaattg gcaagaaata tgtaactttg caaatgaggc caacatttct gatgtcttat 1561 tgttgcttgg catctacata gattttgcag tggaacctgg caaatgtggc aagtgtgaaa 1621 aaaagcagca caaattccac tataattatc acaaagcaca tcatgccaat gcttgcctct 1681 tcttggagag tagagcccaa aaaaacattt gccaacaagc agttgaccag gtcctagcag 1741 ctaaaaggtt aaaattagta gaatgcagta gaattgaatt attagaagag agatttttgc 1801 agctttttga tgaaatggat gacttcctgc atggtgagat agaaattcta agatggatgg 1861 cgggtgtggc ctggtacacc attttactag ataattcttg ggatgttttt caaaatatcc 1921 tacaattaat aactaccagc caacccaaaa aaaggaatgt cctgataaag ggaccaatta 1981 acagtggtaa aactactttg gcttctgctt tcatgcattt ttttgatggc aaagctctaa 2041 atataaattg tcctgcagat aaactgtcct ttgaacttgg ctgtgctatt gatcaattct 2101 gtgttttgtt agatgatgtg aagggccaaa taaccttaaa taagcacttg caaccaggtc 2161 aaggggtaaa taatcttgat aacctgagag atcatcttga tggaacaatt aaagttaatt 2221 tagaaaagaa acatgtaaac aaaaggagtc aaatttttcc cccggttatt atgactatga 2281 atgagtactt gttgcctcct accataggag ttagatttgc tcttcatctg catttaaaac 2341 ctaaggctta tcttaaacaa agcctggaaa aaagtgacct ggtagccaaa agaatattaa 2401 attcaggata tactattttg ctccttttgt tatggtacaa tcctgtggat tcttttactc 2461 caaaagtgca agaaaaagtg gtgcaatgga aagaaaccct tgaaaaatat gtgtcaatta 2521 ctcagtttgg taatattcag caaaatatca ttgatggaaa agaccccttg catggaattg 2581 taattgaaga acaaatgtaa ataatgtaat catcattttc tgttttattt ctggtacaat 2641 aaagtcttac aatgcattca gcctcacata tcatttgaga cagggagaac agtctggttc 2701 tgacaaaatt tatcaacata tctattaagg tcagggtccc ctgggagtcc ttctgttccc 2761 tcaaatattc tgacttcttc cacttgtcct gagacccctt ccattggttg tccctgaatt 2821 tggggcataa gaccagagaa gaagctattt agaagagagc tgacaggata aggattttta 2881 acaatccttt tcctgagggt cacattgaaa tatctgggaa gccccctcca actttgggtt 2941 tcagaatagt tggtatgaac tccagcaata tcagcacaag acagaaacag tttgtcccct 3001 ttacaaagag gcccaactcc attttcatcc agcagcacag ttgtgacaga attagtgaac 3061 tgcataactg gtggggtggt ggctccccct gtaaaactcc cataatatct agtattttca 3121 tttttagagg ggtcagggca ccacacctcc actgggtact ttccatcttt atccagcaag 3181 gctttggcct ttggatctag gccttggttt cctggtttca tatttttaat agcaactaca 3241 tcatcaggat aggtagctgt agagctagca actaggcctt ggagttccag gggctctcct 3301 ccaacagcaa acatgtgata ggtagtgccc tgcacgggga cacaccctga ggatgaccca 3361 tagatgtact ttcctccctg gtgcaaatta actagtgagg aaattccaac aacttcagtc 3421 tttacagaca ctgcttccca catcaaaatg gtgtcacagg tcatgtcttc atttaggagg 3481 gggagtttaa taacagctac tgaataacaa ggaagggtgc ctttgttggg ggtgtcagag 3541 gccttactga aagcagtatt tatagaatta ctatatccat acaagtcctc agaaggaata 3601 ttatttccca ttctaggatt aagataggcc tcaatttggg taatagcatc aggccctgtt 3661 cttacttcta gcacctctac tcctcctttt actaggagcc tggggacggg agcgggaata 3721 gggcatgttt ttttgcatgc tccgtcttgc ctttttcttt gaggggccat cttcttcttt 3781 ctccaattta ttaagctcca cttcccaggt gggagttata tcaccatata aacctagaat 3841 tagaggaagc atccagtctt gagttactct ttggtgtgcc cctcctggag cagtataatg 3901 ttctacatac tgaccagacc taggttcatc ataacccagt tctcttcttg ccctttgacc 3961 ttccctattt tcttgatatt caaaatcagc tctacttgga ggaggctctc ctctgtttct 4021 atattcttgt cttagctgga tgggatttct agcaggtaga tacctgtaat aatcttggac 4081 actggaataa atatggacag gcccactggt caaagcccac ctagcatttt cagcaatttg 4141 agccaaggtg tgactcagtt cattggtact tctgacagcc acagcccttg tagcttgacc 4201 aatttgcaaa gtggcctgtc tcatcaaatg cctccatact tctctgccca cagcatgaaa 4261 caatgattca ccccagtcaa gtacagcatt caggtagtag ctaaaagagg taaatcccgg 4321 gaacaaataa tcaacttgag gaaaccaagg cacaagagcc atattaacaa ctggtacttc 4381 tttggagtat ccaaaagttg tcactcctgc agcaaccaca gcactggcac ctgaaacagt 4441 ttgaaaaaaa actcctattc ctatggcatt gttgagagct gttgggatag cacttaggag 4501 ggaaaactgc tctgttgtaa gtccagtaag agacaaggcc tctagagtac taagtccagc 4561 aagatccact gcttctattt caatgagcca ggctgcctca gtacttacag cagcaaaagc 4621 ctccccagta aggatagcat caactgtaaa tccagtactt aagcttaatt cagcagcaat 4681 ttcagaaata ttaaacaaaa gagataatac accccccatt tcttaccaaa tggcgggcta 4741 atttaaaaaa ggcgggcttc ttggcggcgc tgatgtaaat gagtaccaaa tggcgggcta 4801 atttaaaaaa ggcgggcttc ttggcggcgc tgatgtaaat gagtaacttc ctctacttga 4861 ggttgctaag taggttgcta agcgccacct agcaactaga ccgcagaaca gttgtttgtc 4921 acttatcagg aaatgtcaca aaaagtcccc gggcggtgcg gtgagcgagt ctaaccacag 4981 cttcctctat cagttgattc tgcaaaaaca acctgttatt gaagtctgca agtctgcaaa 5041 atcactatgg caaccctagt tttttttacc tggtataaga ggccagggg // LOCUS PVYAAA 9704 bp ss-RNA VRL 15-MAR-1990 DEFINITION Potato virus Y (N strain) genomic RNA, complete. ACCESSION D00441 KEYWORDS 38K protein; HC protein; NIa protein; NIb protein; Vpg protein; capsid protein; genome-linked protein; helper component protein; inclusion protein; polymerase; polyprotein; protease. SOURCE Potato virus Y (N strain), 5'end of genomic RNA and cDNA to genomic RNA. ORGANISM Potato virus Y Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Potyvirus. REFERENCE 1 (bases 1 to 9704) AUTHORS Robaglia,C., Durand-Tardif,M., Tronchet,M., Boudazin,G., Astier-Manifacier,S. and Casse-Delbart,F. TITLE Nucleotide sequence of potato virus Y (N strain) Genomic RNA JOURNAL J. Gen. Virol. 70, 935-947 (1989) STANDARD full staff_entry COMMENT Most of the sequence was obtained from a shotgun cloning procedure. The 150 nucleotides at the 5'end were directly sequenced on the virul RNA. FEATURES from to/span description virion 1 9704 genomic RNA pept 185 9376 polyproteins matp 185 1009 putative extreme 5'protein matp 1010 2656 putative helper component protein, HC matp 2657 3655 putative 38K protein matp 3656 5557 cytoplasmic inclusion protein matp 5558 5713 putative genome linked protein, Vpg matp 5714 7009 putative nuclear inclusion protein NIa matp 7010 8572 putative nuclear inclusion protein NIb matp 8573 9376 putative capsid protein BASE COUNT 3004 a 1818 c 2273 g 2609 t ORIGIN putative 5'end of RNA genome. 1 aattaaaaca actcaataca acataagaaa aacaacgcaa aaacactcat aaacgctcat 61 tctcactcaa gcaacttgct aagtttcagt ttaaatcatt tccttgcaat tctctagaac 121 aatattggaa accatttcaa ctcaacaagc aatttcatca cttccaacca atttcagatc 181 ctcaatggca acttacatgt caacaatctg ttttggttcg tttgaatgca agctaccata 241 ctcaccagcc tcttgcgagc atattgtgaa ggaacgagaa gtgccggctt ccgttgatcc 301 tttcgcagat ctggaaacac aacttagtgc acgattgctc aagcaaaaat atgctactgt 361 tcgtgtgctc aaaaacggta cttttacgta ccgatacaag actgatgccc agataatgcg 421 cattcagaag aaactggaga ggaaggatag ggaagaatat cacttccaaa tggccgctcc 481 tagtattgtg tcaaaaatta ctatagctgg cggagatcct ccatcaaagt ctgagccaca 541 agcaccaaga gggatcattc atacaactcc aaggatgcgt aaagtcaaga cacgccccat 601 aataaagttg acagaaggcc agatgaatca cctcattaag cagataaaac agattatgtc 661 ggagaaaaga gggtctgtcc acttaattag taagaaaacc actcatgttc aatataagaa 721 gatacttggt gcatactccg cagcggttcg aactgcacat atgatgggtt tgcgacggag 781 agtggactcc gatgtgatat gtggacagtt ggacttttgc aacgtctcgc tcggacggac 841 aaatggttcc aatcaagtcc gcactatcaa catacgaagg ggtgatagtg gagtcatctt 901 gaacacaaaa agcctcaaag gccactttgg tagaagttca ggaggcttgt tcatagtgcg 961 tggatcacac gaagggaaat tgtatgatgc acgttctaga gttactcaga gtattttaaa 1021 ctcaatgatc cagttttcga atgccgacaa tttttggaag ggtctggacg gtaattgggc 1081 acgaatgaga tatccttcgg atcacacatg tgtagctggt ttacctgtcg aagattgtgg 1141 tagggtagct gcattgatgg cacacagtat ccttccgtgc tataagataa cttgccccac 1201 ctgtgctcaa cagtatgcca gcttgccagt tagcgatctg tttaagctat tgcataaaca 1261 tgcaagagat ggtttgaatc gattgggagc ggataaagac cggtttatac atgttaataa 1321 gttcttgata gcgttagagc atctaactga accggtggac ctgaatctcg agcttttcaa 1381 tgagatattt aaatccatag gggagaaaca gcaagcaccg ttcaagaatt taaatgtctt 1441 aaataatttc ttcctgaaag gaaaagaaaa tacagctcat gaatggcagg tagctcaatt 1501 gagtttgctc gaattagcaa ggttccagaa gaacagaact gataacatca agaaaggtga 1561 tatatctttc ttcagaaata aattatctgc caaggcaaac tggaatctgt atttgtcgtg 1621 cgacaaccag ctggataaaa atgcaaactt cctctgggga caaagggagt atcatgctaa 1681 gcggtttttc tcaaacttct ttgaggaaat tgatccagca aagggatact cagcatatga 1741 aatccgcaag catccaagtg gaacaaggaa gctctcaatt ggtaacttag ttgtcccact 1801 tgatttagct gagtttaggc agaagatgaa aggtgactat aggaaacaac caggggtcag 1861 caaaaagtgc acgagttcga aagatggtaa ttatgtgtat ccctgttgtt gcacaacact 1921 tgatgatggt tcagccattg aatcaacatt ctatccacca actaaaaagc accttgtaat 1981 tggcaatagt ggtgaccaaa aatttgttga tttaccaaaa ggggattcgg agatgttata 2041 cattgccaag cagggttatt gttatattaa cgtgtttctt gcaatgctga ttaacattag 2101 cgaggaggat gcaaaggatt tcacaaagaa agttcgcgac atgtgtgtgc caaagcttgg 2161 aacctggcca actatgatgg atttggcgac cacttgtgct caaatgagaa tattctatcc 2221 tgacgtacat gatgcagaat tgcccagaat attggttgac catgacactc aaacgtgtca 2281 tgtggttgac tcatttggct cgcagacaac tggatatcat attctaaaag catccagcgt 2341 gtctcaactt atcttgtttg caaatgatga attagaatct gatataaaac attatagagt 2401 tggtggtgtt cctaatgcta gccctgaact tgggtccaca atatcacctt tcagagaagg 2461 aggagttata atgtctgagt cggcagcgct gaaactgctt ttgaagggaa tttttagacc 2521 taaggtgatg agacagttgc tgttagatga gccttacctg ttgattctat caatactatc 2581 ccctggcata ctgatggcta tgtataataa tgggattttt gaacttgcgg tgaggttgtg 2641 gattaatgag aaacaatcca tagctatgat agcatcgcta ctatcagctt tagccctacg 2701 agtgtcagcg gcagaaacac tcgtcgcaca gaggattata attgatgctg cagctacaga 2761 cctccttgat gctacgtgtg atgggttcaa cctacatcta acgtacccca ctgcattgat 2821 ggtgttgcaa gttgttaaga atagaaatga atgtgatgat accctattca aggcgggttt 2881 tccaagttac aacacgagcg tcgtacagat tatggaaaaa aattatctaa atctcttgaa 2941 cgatgcttgg aaagatttaa cttggcgaga aaattatccg caacatggta ctcatacaga 3001 gcaaaacgct ctatccactc ggtacataaa acccacagaa aaggcagatt tgaaagggtt 3061 atacaacata tcaccacaag cgttcttggg ccgaagcgcc caggtggtca aaggcactgc 3121 ctcaggattg agcgagcgat ttaataatta tttcaatact aagtgtgtaa atatttcatc 3181 ctttttcatt cgtagaatct ttaggcgttt gccaaccttt gtcacttttg ttaactcatt 3241 attagttatt agtatgttaa ccagcgtagt ggcagtgtgt caggcaataa ttttagatca 3301 gaggaagtat aggagagaaa tcgagttgat gcagatagag aagaatgaga ttgtctgcat 3361 ggagctatat gcaagtttac agcgcaaact tgaacgcgat ttcacatggg atgagtacat 3421 tgagtatttg aagtcagtaa accctcagat agttcagttt gctcaagcgc agatggaaga 3481 atatgatgtg cgacaccagc gttccacacc agttgttaaa aatttggaac aagtggtagc 3541 atttatggct ttagtcatca tggtgtttga tgctgaaagg agtgattgcg tgttcaaaac 3601 tctcaataaa tttaagggtg tcctttcctc actggattat gaagttagac atcagtcctt 3661 agacgatgtg atcaagaatt ttgatgagag gaatgagatt attgattttg aattgagtga 3721 ggacacaatt cgaacttcat cagtgctaga tacaaagttt agtgattggt gggatcgaca 3781 aatccagatg ggacatacac ttccacatta cagaactgag gggcacttca tggaatttac 3841 aagagcaact gctgttcaag tggctaatga cattgcccat agcgaacacc tagacttttt 3901 agtacgggga gctgttgggt ctggaaagtc aactgggttg cctgttcatc ttagtgtggc 3961 cggatctgtg cttttaattg aaccaacgcg accactagcg gagaacgttt tcaaacagct 4021 atctagtgaa ccattcttca agaagccaac actgcgtatg cgtggaaata gtatatttgg 4081 ctcttctcca atctccgtca tgactagcgg atttgcgcta cactacttcg ccaataatcg 4141 ctctcaatta gctcagttca actttgtaat atttgatgag tgtcatgttc tggatccttc 4201 cgcgatggcg ttccgcagtc tgctgagtgt ttatcatcaa gcatgcaaag tattaaaagt 4261 gtcagctact ccagtgggaa gagaggttga attcacaaca cagcagccag tcaagttaat 4321 agtggaggac acagtgtctt tccaatcatt tgttgatgca caaggttcta aaactaatgc 4381 tgatgttgtt cagtttggtt caaacgtact tgtgtacgtg tcgagctaca atgaagttga 4441 caccttggcc aagctcctaa cagacaagaa tatgatggtc acaaaggttg atggcagaac 4501 aatgaagcac ggttgcctag aaattgtcac aaaaggaacc agtgcgagac cacattttgt 4561 tgtagcaacc aacataattg agaatggagt gactttggac atagacgtgg ttgtggactt 4621 tgggttgaaa gtctcaccgt tcttggacat tgacaatagg agcattgctt acaataaggt 4681 gagtgttagc tatggtgaga gaattcaaag gctgggtcgt gttggacgct tcaagaaagg 4741 agtagcattg cgcattggac acactgaaaa gggaattatt gaaattccaa gcatggtcgc 4801 tactgaggcg gctcttgctt gctttgcata taacttgcca gtgatgacag gaggcgtttc 4861 aactagtctg attggcaatt gtactgtgcg ccaggttaaa acaatgcagc aatttgaatt 4921 gagtcccttc tttatccaga atttcgttgc tcatgatgga tcaatgcatc ctgtcataca 4981 tgacattctt aaaaagtata aacttcgaga ttgtatgacg cctttgtgcg atcagtctat 5041 accatacagg gcatcgagta cttggttatc ggttagtgaa tatgagcgac ttggagtggc 5101 cttagaaatt ccaaagcaag tcaaaattgc attccatatc aaagagatcc ctcctaagct 5161 ccacgaaatg ctttgggaaa cggttgtcaa gtacaaagac gtttgcttat ttccaagcat 5221 tcgagcatcg tccatcagca aaatcgcata cacattgcgt acagatctct tcgccatccc 5281 aagaactcta atattggtgg agagattgct tgaagaggag cgagtgaagc agagccaatt 5341 cagaagtctc atcgatgaag ggtgctcaag catgttttca attgttaact taaccaacac 5401 tctcagagct agatatgcaa aagattacac cgcagagaac atacaaaaac ttgagaaggt 5461 gagaagtcaa ctaaaagaat tctcaaattt ggatggttct gcatgtgagg agaatttaat 5521 aaagaggtat gagtcgttgc agttcgttca tcaccaagct gcgacgtcac ttgcaaagga 5581 tctcaagttg aaggggattt ggaacaagtc attagtggct aaagacttga tcatagcagg 5641 cgctgttgca attggtggaa taggactcat atatagttgg ttcacacaat cagttgagac 5701 tgtgtctcat caagggaaaa ataaatccaa aagaatccaa gccttgaagt ttcgccatgc 5761 tcgtgacaaa agggctggct ttgaaattga caacaatgat gacacaatag aggaattctt 5821 cggatctgca tacaggaaaa agggaaaagg taaaggtacc acagttggta tgggtaagtc 5881 aagcaggagg ttcatcaaca tgtatgggtt tgatccaaca gagtactcat tcatccaatt 5941 cgttgatcca ctcactgggc ggcaaataga agaaaatgtc tatgctgaca ttagagatat 6001 tcaagagaga tttagtgaag tgcgaaagaa aatggttgag aatgatgaca ttgaaatgca 6061 agccttgggt agtaacacga ccatacatgc atacttcagg aaagattggt gtgataaagc 6121 tttgaagatt gatttaatgc cacataaccc actcaaagtt tgtgacaaaa caaatggcat 6181 tgccaaattt cctgagagag agctcgaact aaggcagact gggccagctg tagaagtcga 6241 tgtgaaggac ataccagcac aggaggtgga gcatgaagct aaatcgctca tgagaggctt 6301 gagagacttc aacccaattg cccaaacagt ttgtaggctg aaagtatctg ttgaatatgg 6361 ggcatcagag atgtacggtt ttggatttgg agcatacata gtagcgaacc accatttatt 6421 taggagttac aatggttcca tggaggtgca atccatgcac ggtacattca gggtgaagaa 6481 tctacacagt ttgagcgttc tgccaattaa aggtagggac atcatcctca tcaaaatgcc 6541 gaaagatttc cctgtctttc cacagaaatt gcatttccga gctcctacac agaatgaaag 6601 aatttgttta gttggaacca acttccaaga gaagtatgct tcgtcgatca tcacagaaac 6661 aagcactact tacaatatac caggcagcac attctggaag cattggattg aaacagataa 6721 tggacattgt ggactaccag tggtgagcac cgccgatgga tgtatagtcg gaattcacag 6781 tctggcaaac aatgcacaca ccacgaacta ctactcagcc ttcgatgaag attttgaaag 6841 caagtacctc cgaaccaatg agcacaatga atgggtcaag tcttgggttt ataatccaga 6901 cacagtgttg tggggcccgt tgaaacttaa agacagcact cccaaagggt tattcaaaac 6961 aacaaagctt gtgcaagatc taatcgatca tgatgtagtg gtggagcaag ctaagcattc 7021 tgcatggatg tttgaagcct tgacaggaaa tttgcaagct gtcgcaacaa tgaagagcca 7081 attagtaacc aagcatgtag ttaaaggaga gtgtcgacac ttcacagaat ttctgactgt 7141 ggatgcagag gcagaggcag aggcattctt caggcctttg atggatgcgt atgggaaaag 7201 cttgctaaat agagatgcgt acatcaagga cataatgaag tattcaaaac ctatagatgt 7261 tggtgtcgtg gatcggatgc atttgaggaa gccatcaata gggttatcat ctacctgcaa 7321 tgtgcacggc ttcaagaagt gtgcatatgt cactgatgag caagaaattt tcaaagcgct 7381 caacatgaaa gctgcagtcg gagccagtta tgggtgcaaa aagaaagact attttgagca 7441 tttcactgat gcagataagg aagaaatagt catgcaaagc tgtctgcgat tgtataaagg 7501 tttgcttggc atttggaacg gatcattgaa ggcagagctc cggtgtaagg agaagatact 7561 tgcaaataag acgaggacgt tcactgctgc acctctagac actttgctgg gtggtaaagt 7621 gtgtgttgat gacttcaata atcaatttta ttcaaagaat attgaatgct gttggacagt 7681 tgggatgact aagttttatg gtggttggga taaactgctt cggcgtttac ctgagaattg 7741 ggtatactgt gatgctgatg gctcacagtt tgatagttca ctaactccat acctaatcaa 7801 tgctgttctc accatcagaa gcacatacat ggaagactgg gatgtggggt tgcagatgct 7861 gcgcaattta tacactgaga ttgtttacac accaatttca actccagatg gaacaattgt 7921 caagaagttt agaggtaata atagtggtca accttctacc gttgtggata attctctcat 7981 ggttgtcctt gctatgcatt acgctctcat taaggagtgc gttgagtttg aagaaatcga 8041 cagcacgtgt gtattctttg ttaatggtga tgacttattg attgctgtga atccggagaa 8101 agagagcatt ctcgatagaa tgtcacaaca tttctcagat cttggtttga actatgattt 8161 ttcgtcgaga acaagaagga aggaggaatt gtggttcatg tcccatagag gcctgctaat 8221 cgagggtatg tacgtgccaa agcttgaaga agagagaatt gtatccattc tgcaatggga 8281 tagagctgat ctgccagagc acagattaga agcgatttgc gcagctatga tagagtcctg 8341 gggttattct gaactaacac accaaatcag gagattctac tcatggttat tgcaacagca 8401 accttttgca acaatagcgc aggaagggaa ggctccttat atagcaagca tggcactaag 8461 gaaactgtat atggataggg ctgtggatga ggaagagcta agagccttca ctgaaatgat 8521 ggtcgcatta gatgatgagt ttgagcttga ctcttatgaa gtacaccatc aagcaaatga 8581 cacaattgat gcaggaggaa gcaacaagaa agatgcaaaa ccagagcagg gcagcatcca 8641 gccaaacccg aacaaaggaa aggataagga tgttaatgca ggcacatctg ggacacatac 8701 tgtgccgaga atcaaggcta tcacgtccaa aatgagaatg cccacaagca agggagcaac 8761 cgtgccaaac ttagaacatt tgcttgagta tgctccacaa caaattgata tttcaaatac 8821 tcgggcaact caatcacagt ttgatacgtg gtatgaggca gtgcggatgg catacgacat 8881 aggagaaact gagatgccaa ctgtgatgaa tgggcttatg gtttggtgca ttgaaaatgg 8941 aacctcgcca aatgtcaacg gagtttgggt tatgatggat gggaatgaac aagttgagta 9001 cccgttgaaa ccaatcgttg agaatgcaaa accaaccctt aggcaaatca tggcacattt 9061 ctcagatgtt gcagaagcgt atatagaaat gcgcaacaaa aaggaaccat atatgccacg 9121 atatggttta attcgaaatc tgcgggatat gggtttagcg cgttatgcct ttgactttta 9181 tgaggtcaca tcacgaacac cagtgagggc tagggaagcg cacattcaaa tgaaggccgc 9241 agcattgaaa tcagcccaac ctcgactttt cgggttggac ggtggcatca gtacacaaga 9301 ggagaacaca gagaggcaca ccaccgagga tgtctctcca agtatgcata ctctacttgg 9361 agtcaagaac atgtgatgta gtgtctctcc ggacgatata taagtattta catatgcagt 9421 aagtattttg gcttttcctg tactactttt atcataatta ataatcgttt gaatattact 9481 ggcagatagg ggtggtatag cgattccgtc gttgttagtg accttagctg tcggttctgt 9541 attattaagt cttagataaa aagtgccggg ttgttgttgt gtgactgatc tatcgattag 9601 gtgatgctgt gattctgtca tagcagtgac tatgtctgga tttagttact tgggtgatgc 9661 tgtgattctg tcatagcagt gactgtaaac ttcaatcagg agac // LOCUS ROBTRFVP2 2687 bp ss-RNA VRL 15-MAR-1990 DEFINITION Bovine rotavirus mRNA for RNA binding protein VP2. ACCESSION X14057 X14507 KEYWORDS RNA binding protein. SOURCE Bovine rotavirus. ORGANISM Bovine rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 2687; enum. 1 to 2687) AUTHORS Cohen,J. TITLE ; JOURNAL Unpublished (1989) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 2687; enum. 1 to 2687) AUTHORS Kumar,A., Charpilienne,A. and Cohen,J. TITLE Nucleotide sequence of the gene encoding for the RNA binding protein (VP2) of RF bovine rotavirus JOURNAL Nucleic Acids Res. 17, 2126-2126 (1989) STANDARD simple automatic COMMENT *source: strain=RF; Data kindly reviewed (21-APR-1989) by Cohen J. [1] Author address Cohen J., INRA, Station de Virologie et d'Immunologie Moleculaires , INRA, CRJ, Domaine de Vilvert, 78350 Jouy en Josas, France. Submitted (16-JAN-1989) on tape to the EMBL data library FEATURES from to/span description pept 17 2659 VP2 protein (AA 1-880) site 1622 1637 leucine zipper (AA 536-557) site 2009 2074 leucine zipper (AA 665-686) BASE COUNT 997 a 425 c 520 g 745 t ORIGIN 1 ggctattaaa ggttcaatgg cgtacaggaa acgtggagcg cgccgtgagg cgaatataaa 61 taataatgac cgaatgcaag agaaagatga cgagaaacaa gatcaaaaca atagaatgca 121 gttgtctgat aaagtacttt caaagaaaga ggaagtcgta accgacagtc aagaagaaat 181 taaaattgct gatgaagtga agaaatcgac gaaagaagaa tctaaacaat tgcttgaagt 241 tttgaaaaca aaagaagagc accaaaaaga gatacaatat gaaattttgc aaaaaacgat 301 accaacattt gaaccaaaag agtcaatatt gaaaaaattg gaggatatca aaccggaaca 361 agcgaagaag cagactaagc tatttagaat atttgaaccg agacagctac caatttatag 421 agcgaatggt gaaaaagagt tgcgtaacag atggtattgg aagctgaaga aagatacttt 481 accagatgga gattatgatg ttagagaata ctttctaaat ttgtatgatc aggttcttac 541 tgaaatgcca gattatttac tattaaaaga tatggcagtt gaaaataaaa attcgagaga 601 tgccggtaaa gttgttgatt ctgaaacagc aagtatctgt gatgctatat ttcaagatga 661 ggaaacagaa ggtgcagtga gacgattcat tgcggagatg agacagcgcg tacaagctga 721 cagaaacgtt gtcaattacc catcaatatt gcatccaata gattacgctt ttaatgagta 781 ttttttgcaa caccaattag ttgaaccatt gaataatgat ataatattca attacattcc 841 tgaaaggata aggaatgacg ttaactatat acttaatatg gacagaaatc tgccatcaac 901 agctagatat ataagaccta atttactaca agacagactg aatttgcatg acaattttga 961 atccttgtgg gatacaataa caacttcaaa ctatattctg gcaagatcgg tagtaccaga 1021 tttaaaggaa ttagtttcaa ccgaagcgca aattcaaaaa atgtcacaag acttgcaact 1081 agaagcatta acaatacagt cagaaacgca gtttttaaca ggtataaact cacaagcagc 1141 aaatgactgt ttcaaaactc tgattgcagc aatgttaagt caacgaacca tgtcgcttga 1201 tttcgtgact acaaattata tgtcattaat ttcaggcatg tggttactaa ctgtagtgcc 1261 aaatgacatg ttcataaggg aatcattggt tgcatgtcaa ctggctatag tgaatacaat 1321 aatatatcca gcgttcggaa tgcaacgaat gcattataga aacggagacc cacaaagacc 1381 atttcagata gcagaacaac aaatacaaaa ttttcaagta gcgaattggc tgcattttgt 1441 caataacaat caatttagac aagtagttat tgatggtgta ttgaatcagg tgctgaatga 1501 caatattaga aatggacatg tcattaatca attgatggaa gctttaatgc aactatcacg 1561 acaacagttt ccaacaatgc ctgttgatta taagaggtca atccagcgtg gaatattatt 1621 gctatcaaat aggcttggtc aattagttga tttaactagg ttattagctt acaactacga 1681 aacactaatg gcatgtgtta cgatgaatat gcaacatgtt cagactttga caacagaaaa 1741 attacagtta acttcagtca catcgttgtg tatgcttatt ggaaatgcaa ccgttatacc 1801 cagcccgcag acattgtttc actattataa tgttaatgtt aattttcatt caaattataa 1861 tgaaagaatt aatgatgcag tggccataat aactggagct aatagactaa atttatatca 1921 gaaaaagatg aaggcaatag ttgaagattt tttaaaaaga ttacatattt tcgatgtagc 1981 tagagttcca gatgatcaaa tgtatagatt aagggataga ctacgactat tgccagtaga 2041 agtaagacga ttggatattt ttaatttgat actgatgaac atggatcaga tagaacgcgc 2101 atcagataaa attgcgcaag gtgttattat tgcgtaccgc gatatgcaat tggaaagaga 2161 cgaaatgtat ggctacgtga atatagctag aaatttagat gggttccagc aaataaacct 2221 agaagaattg atgagaacag gcgattatgc acaaataact aacatgctct tgaataatca 2281 accagtagcg ctagttggag ctcttccatt tgttacagac tcgtcagtca tatcgttgat 2341 agcgaacgtt gacgctacag tttttgccca aatagttaaa ttacggaaag ttgatacctt 2401 gaaaccaata ttgtataaaa taaattcaga ttcgaatgac ttttacctag ttgccaacta 2461 tgattgggtg cctacttcaa ccacaaaagt atataagcaa gttccacagc aatttgattt 2521 cagaaattcg atgcatatgt taacatcaaa tcttactttc actgtttact ctgatctgct 2581 tgcattcgta tcggccgata cagtagaacc tataaatgca gttgcatttg ataatatgcg 2641 catcatgaac gagttgtaaa cgccaacccc actgtggaga tatgacc // LOCUS VACH3K 4536 bp ds-DNA VRL 15-DEC-1989 DEFINITION Vaccinia virus, HindIII K fragment. ACCESSION D00382 KEYWORDS nonessential gene; serine protease inhibitors. SOURCE Vaccinia virus HindIII K fragment originally from vaccinia virus strain WR, cloned in pBR322, was a gift from R. Wittek, transformed into Escherichia coli strain TG1. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus. REFERENCE 1 (bases 1 to 4536) AUTHORS Boursnell,M.E.G., Foulds,I.J., Campbell,J.I. and Binns,M.M. TITLE Non-essential genes in the vaccinia virus HindIII K fragment: a gene related to serine protease inhibitors and a gene related to the 37K vaccinia virus major envelope antigen JOURNAL J. Gen. Virol. 69, 2995-3003 (1988) STANDARD full staff_entry COMMENT One gene, predicted to encode a 42.2K protein, is highly related to the family of serine protease inhibitors. It shows approximately 25% identity to human antithrombin III and 19% identity to the cowpox virus 38K protein gene which is also related to serine protease inhibitors. The product of another gene shows a similar high level of identity to the 37K vaccinia virus major envelope antigen. The existance of viable deletion mutants and recombinants containing foreign DNA inserted into both these genes indicates that they are non-essential. FEATURES from to/span description ORF 50 < 1 (c) ORF KO, amino end. ORF 1381 272 (c) ORF K1 ORF 1697 1431 (c) ORF K2 ORF 3023 1749 (c) ORF K3 ORF 3185 3051 (c) ORF K4 ORF 3604 3200 (c) ORF K5 ORF 3818 3573 (c) ORF K6 ORF 3957 4406 ORF K7 ORF 4235 4041 (c) ORF8 BASE COUNT 1454 a 789 c 712 g 1581 t ORIGIN 1 bp upstream of HindIII site 1 aagcttttca gctgcttaga cttccaagta ttaattcgtg acagatccat gtctgaaacg 61 agacgctaat tagtgtatat tttttcattt tttataattt tgtcatattg caccagaatt 121 aataatatct ctaatagatc tgattagtag atacatggct atcgcaaaac aacatataca 181 catttaataa aaataatatt tattaagaaa attcagattt cacgtaccca tcaatataaa 241 taaaataatg attccttaca ccgtacccat attaaggaga ttccacctta cccataaaca 301 atataaatcc agtaatatca tgtctgatga tgaacacaaa tggtgtatta aattccagtt 361 tttcaggaga tgatctcgcc gtagctacca taatagtaga tgcctctgct acagttcctt 421 gttcgtcgac atctatcttt gcattctgaa acattttata aatatataat gggtccctag 481 tcatatgttt aaacgacgca ttatctggat taaacatact aggagccatc atttcggcta 541 tcgacttaat atccctctta ttttcgatag aaaatttagg gagtttaaga ttgtacactt 601 tattccctaa ttgaaacgac caatagtcta attttgcagc cgtaatagaa tctgtgaaat 661 gggtcatatt atcacctatt gccaggtaca tactaatatt agcatcctta tacggaaggc 721 gtaccatatc atattcttcg tcatcgattg tgattgtatt tccttgcaat ttagtaacta 781 cgttcatcat gggaaccgtt ttcgtaccgt acttattagt aaaactagca ttgcgtgttt 841 tagtgatatc aaacggatat tgccatatac ctttaaaata tatagtatta atgattgccc 901 atagagtatt attgtcgagc atattagaat ctactacatt agacataccg gatctacgtt 961 ctactataga attaatttta ttaaccgcat ctcgtctaaa gtttaatcta tataggccga 1021 atctatgata ttgttgataa tacgacggtt taatgcacac agtattatct acgaaacttt 1081 gataagttag atcagtgtac gtatatttag atgttttcag cttagctaat cctgatatta 1141 attctgtaaa tgctggaccc agatctcttt ttctcaaatc catagtcttc aataattcta 1201 ttctagtatt acctgatgca ggcaatagcg acataaacat agaaaacgaa taaccaaacg 1261 gtgagaagac aatattatca tcttgaatat ttttatacgc tactataccg gcattggtaa 1321 atccttgcag acgataggta gacactgaac acgttaacga tagtatcaat aacgcaatca 1381 tgattttatg gtattaataa ttaaccttat ttttatgttc ggtataaaaa ttattgatgt 1441 ctacacatcc ttttgtaatt gacatctata tatccttttg tataatcaac tctaatcact 1501 ttaactttta cagttttccc taccagttta tccctatatt caacatatct atccatatgc 1561 atcttaacac tctctgccaa gatagcttca aagtgaggat agtcaaaaag ataaatatat 1621 agagcataat ccttctcgta tactctgccc tttattacat cacccgcatt gggcaacgaa 1681 taacaaaatg caagcatctt gttaacgggc tcgtaaattg ggataaaaat tatgttttta 1741 tatctatttt attcaagaga atattcagga atttcttttt ccggttgtat ctcatcgcag 1801 tatatatcat ttgtacattg tttcatattt tttaatagtc tacacctttt agtaggacta 1861 gtatcgtaca attcatagct gtattttgaa ttccaatcac gcataaaaat atcttccaat 1921 tgttgacgaa gacctaatcc atcatccggt gtaatattaa tagatgctcc acatgtatcc 1981 gtaaagtaat ttcctgtcca atttgaggta cctatatacg ccgttttatc ggttaccata 2041 tatttggcat ggtttaccct agaatacgga atgggaggat cagcatctgg tacaataaat 2101 agctttactt ctatatttat gtttttagat tttagcatag cgatagatct taaaaagttt 2161 ctcatgataa acgaagatcg ttgccagcaa ctaatcaata gcttaactga cacttgtctg 2221 tctatagcgg ctcttcttaa ttcatcttct atataaggcc aaaacaaaat attgcctgcc 2281 ttcgaataaa taatagggat aaagttcata acagatacat aaacgaattt actcgcattt 2341 ctgatacatg acaataaagc ggttaaatca ttggttcttt ccatagtaca tagttgttgc 2401 ggtgcagaag caataaatac agagtgtgga acgccgctta cgttaatact aagaggatga 2461 tctgtattat aatacgacgg ataaaagttt ttccaattat atggtagatt gttaactcca 2521 agataccagt atacctcaaa aatttgagtg agatccgctg ccaagttcct attattgaag 2581 atcgcaatac ccaattcttt gacctgagtt agtgatctcc aatccatgtt agcgcttcct 2641 aaataaatat gtgtattatc agatatccaa aattttgtat gaagaactcc tcctaggata 2701 tttgtaatat ctatgtatcg tacttcaact ccggccattt gtagtctttc aacatccttt 2761 aatggtttgt tagatttatt gacggctact ctaactcgta ctcctctttt gggtaattgt 2821 acaatcttgt ttaatattat cgtgccgaaa ttcgtaccca cttcatccga taaactccaa 2881 taaaaagatg atatatctag tgtttttgtg gtattggata gaatttccct ccacatgtta 2941 aatgtagaca aatatacttt atcaaattgc atacctatag gaatagtctc tgtaatcact 3001 gcgattgtat tatccggatt cattttattt gttaaaagaa taatcctata tcacttcact 3061 ctattaaaaa tccaagtttc tatttctttc atgactgatt ttttaacttc atccgtttcc 3121 ttatgaagat gatgtttggc accttcataa atttttattt ctctattaca atttgcatgt 3181 tgcatgaaat aatatgcacc taaaacatcg ctaatcttat tgtttgttcc ctggagtatg 3241 agagtcgggg ggtgttaatc ttggaaatta tttttctaac cttgttggta gccttcaaga 3301 cctgactagc aaatccagcc ttaatttttt catgattgat taatgggtcg tattggtatt 3361 tataaacttt atccatatct ctagatactg attctggaca tagctttccg actggcgcat 3421 ttagtgtgat ggttcccata agtttggcag ctagcagatt cagttttgaa acagcatctg 3481 cattaactag aggagacatt agaatcattg ctgtaaacaa gtttggatta tcgtaagagg 3541 ctagctccca tggaatgacc caataagtag atttaatagt taccacgtgc tgtaccaaag 3601 tcatcaatca tcattttttc accattactt cttccatgtc caatatgatc atgtgagaat 3661 actaaaattc ctaacgatga tatgttttca gctagttcgt cataacgtcc agaatgttta 3721 ccagctccat gacttatgaa tactaatgcc ttaggatatg taataggttt ccaatatatg 3781 taatcattgt ccagattgaa catacagttt gcactcatga ttcacgttat ataactatca 3841 atattaacag ttcgtttgat gatcatatta tttttatgtt ttattgataa ttgtaaaaac 3901 atacaattaa atcaatatag aggaaggaga cggctactgt cttttgtgag atagtcatgg 3961 cgactaaatt agattatgag gatgctgttt tttactttgt ggatgatgat aaaatatgta 4021 gtcgcgactc catcatcgat ctaatagatg aatatattac gtggagaaat catgttatag 4081 tgtttaacaa agatattacc agttgtggaa gactgtacaa ggaattgatg aagttcgatg 4141 atgtcgctat acggtactat ggtattgata aaattaatga gattgtcgaa gctatgagcg 4201 aaggagacca ctacatcaat tttacaaaag tccatgatca ggaaagttta ttcgctacca 4261 taggaatatg tgctaaaatc actgaacatt ggggatacaa aaagatttca gaatctagat 4321 tccaatcatt gggaaacatt acagatctga tgaccgacga taatataaac atcttgatac 4381 tttttctaga aaaaaaattg aattgatgat ataggggtct tcataacgca taattattac 4441 gttagcattc tatatccgtg ttaaaaaaaa ttatcctatc atgtatttga gagttttata 4501 tgtagcaaac atgatagctg tgatgccaat aagctt //