Path: utzoo!attcan!uunet!snorkelwacker!bionet!lear From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 15 Aug 90 21:12:44 GMT Sender: lear@genbank.BIO.NET Distribution: bionet Lines: 4007 Approved: lear@genbank.bio.net Checksum: 24265 242 LOCUS THARGA 635 bp ds-DNA BCT 11-AUG-1990 DEFINITION T.acidophilum 5S rRNA gene. ACCESSION M32297 KEYWORDS 5S ribosomal RNA. SOURCE T.acidophilum (strain 122-1B2) DNA, clone pT5PH-23. ORGANISM Thermoplasma acidophilum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoplasmales. REFERENCE 1 (bases 1 to 635) AUTHORS Ree,H.K. and Zimmerman,R.A. TITLE Organization and expression of the 16S, 23S and 5S ribosomal RNA genes from the archaebacterium Thermoplasma acidophilum JOURNAL Nucleic Acids Res. 18, 4471-4478 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.K.Ree 23-FEB-1990. FEATURES from to/span description rRNA 213 335 5S rRNA RNA 211 346 5S RNA ( 3' end +/- 4 bp) site 184 191 put. promoter site 333 354 put. terminator BASE COUNT 175 a 140 c 140 g 180 t ORIGIN 1 ctgcagcctt atcatgggaa aatgggcatc tgctggagac gactatgtac ttcatgcatc 61 catattatct tatgatatta aaatgctaaa gcgcatgatc gcttcggaag gaatatcgac 121 agggcgagat tcgctgtctc ataatcgccc ggatctgcgc tgaagccgta aaatcacgaa 181 aatcttatat agatgtgttc tatatagtgt tcggcaacgg tcatagcagc agggaaacac 241 cagatcccat tccgaactcg acggttaagc ctgctgcgta ttgcgttgta ctgtatgccg 301 cgagggtacg ggaagcgcaa tatgctgtta ccacttttga aatgaaagtt ttttacatct 361 atctttaatt cagtggcaat gattaatcct tggtcatcat ctgatttctt cgactatgag 421 aggctaaaga gaggaattcg gcataggcga tatgaacgtt cccttcgatc attttctctt 481 caggacttat acttggccag cgatccatag actatatcga ctacgcttta aagcatcaca 541 tgaaattcaa cgttatgact ggcctcatgc cttcaggaga gatgcatctt ggcaacaaga 601 gcgccataga ccaggtcatc ttcttccaga agctt // LOCUS THARGG 4154 bp ds-DNA BCT 11-AUG-1990 DEFINITION T.acidophilum 23S ribosomal RNA gene. ACCESSION M32298 KEYWORDS 23S ribosomal RNA. SOURCE T.acidophilum (strain 122-1B2) DNA, clones pTH1-1, pL8 and pTH3-7. ORGANISM Thermoplasma acidophilum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoplasmales. REFERENCE 1 (bases 1 to 4154) AUTHORS Ree,H.K. and Zimmermann,R.A. TITLE Organization and expression of the 16S, 23S and 5S ribosomal RNA genes from the archaebacterium Thermoplasma acidophilum JOURNAL Nucleic Acids Res. 18, 4471-4478 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.K.Ree 23-FEB-1990. FEATURES from to/span description rRNA 521 3426 23S RNA RNA 429 3473 23S RNA gene (3' end +/- 5 bp) BASE COUNT 1097 a 903 c 1177 g 977 t ORIGIN 1 tggtgttaag gtggatgcac tatatctatg agagctaccg tgagatggcc ataagggaga 61 tacgcgatat gatagagtag ctgcataagg caattcggaa taacagacgc ggcggttgtt 121 catcgcatag gtagagtaaa tctcacagag gactccgttg ttatagcggt atctgcgcct 181 catagaggac aggcgtttga cgcatgcaga tacataatcg accagataaa ggagagagcg 241 cccatatgga aacgtgacat cgtgcaggga tctcctgcga gctggcactg aacccatttt 301 ttacggtatc agcaagaaaa agttaattaa aatatatttc aaattctgaa aagtgagtat 361 tataaaatta ttggataatt gggcttccgg atcaaaatgc ttatatccct cttaatgata 421 tagtccatac acgcttacaa tgcacctttt aatgcatcac attaaaaggg aagatgtcga 481 attttgatga gactagatga gagatggcac aatggattgt gctctgctaa tctgcctaga 541 ggatggcttg gttcgggcgc cgaagaagga cgtgccaagc tgcgataagc cttggggagg 601 cgcatggagc cttagatcca aggatctccg aatgggactt cctgccgtaa ggcactccga 661 aaggagaggg aacccgggga attgaaacat cttagtaccc ggaggaaaag aaatcaattg 721 agataccgtt agtaaaggcg atcgaaagcg gtagaaggca aaccgaatag cccttcgaaa 781 gaagggaaag atgtggagtt tggtcttcct ctaatgcctc ctgaagcgag atgaatcttc 841 tggaaagaag agccttagaa ggtgatagcc ctgtaatcga agcttcagaa gctacaaggg 901 gaagtaacca gagtaccatg cgtcgttttt cgcgtgggaa tttgggtggc actaacatcc 961 aaccttaaat acgtcccgag tccgatagcg aacaaagtac cgtgagggaa agctgaaaag 1021 aaacccggaa gggtggtgaa aagagcctga aactaggcag agataaactt atagggcagt 1081 taagaggtga agtcgttaac tacgatggat cgctgtccta ttgtccgtgt tgaagaacgg 1141 gccagggagt tctgacgagt ggcaaggtta atcctgaaag gagtagccgt agcgaaagca 1201 actacccgca cagcaatggg ggggtggcgt ggtaaacgcg tttagtcact cgtgagagac 1261 ccgaagccgg tcgatctaca cctgagtagg ttgaagctca gtgaaagctg ggtggaggac 1321 cgaacctatt ctgatgtgca aatcgtttgg atgacttggg tgtaggggtt aaaggccaat 1381 ctaggccggc aatagcgggt tccccccgat actacccgca ggtagacctc gatggagatt 1441 ctcggcgagg tagagcgacc gattggttgg taagcagtcg aaaggctgcg ccgacttgtc 1501 aaactccgaa cttgtcgaga tcgtagaagt cgggtgctag ggggcaggga taagctttgc 1561 ttccgtgatg ggaacaacca agacgagggt taaggtccct aagttctagt taagtgcact 1621 aaataggttt gtggccaaag acagtgggga ggtaggctca gaagcagcca tccttcaaag 1681 agtgcgtaac agctcactca ccgaggtcac atgcctagaa gatggaaggg gctaaaacta 1741 gacaccgaga ccttcgagca ccgaaaggtg atctggtagg ggggcgtgcc atgtggatag 1801 aagtctcccc gagaggaggg atggaccgca tggtatcgcg gatcctggtg aaagtagcag 1861 agaagaaccg tgagaatcgg ttccgccgaa agggctaggg ttccttggca atgttcgtca 1921 gccgagggtt agtcgatcct aaggccatac ctaacaggat atggtcgaag gggaagccgg 1981 ttaatattcc ggcacactga acgttttgcc ctgtatgaga aggttcaggg taggggcggt 2041 acgggtgcca acgtatttat gctcataagc ggatggagag tcgtaatgac gagaagttcg 2101 cgaaagagcg tatgttcccg tttgggaatc gcctcgatcc ccggatccca tgaaaatcat 2161 gcaggggtca ggttcagtat cgtaccaaga accgacactg gtgcccctag gtgagaagcc 2221 taaggcgttt tgggataatg gacgcgaggg aaatcggcaa aatagctccg tatcttcggt 2281 ataaggagtg cctattccgt gagggaatag gtcgcagtga cgaagggact ccgactgttt 2341 accacaaaca cagatcgctg ctagtccgta aggatgtgta tagcggttga aacctgccca 2401 gtgctggtac ctgaaagccc cgtacaaggg gaagaagggc cagtaaacgg cgggggtaac 2461 tatgaccctc ttaaggtagc gtaataccta gccgcttaat tggcggcttg catgaaggtt 2521 caacgtgggt cccactgtcc ccgcgttcag cccagtgaaa ttgatgtact ggtgcacaat 2581 ccagtctctc ccacgtgaaa gcgaagtccc cgtggagctt tactgcagcc tgtagctgtg 2641 gtgcgatcct gactgcgtag tgtaggaagg agccgtcgaa gctctggttt cggccggagt 2701 ggaggcgcca atgaaacact tccctctcgg gattgcgtca ctaacctctt cggagggaca 2761 actattggtg ggcagtttgg gtggggcgcc acgcccctaa caacgtaaca ggggccccca 2821 aaggtcagct caggagggtc agaaatcctc cgtagagtgt aaaagcaaaa gctggcttga 2881 ctgtgttgca gacaactagc aacgcagatg cgaaagcagg gtttagcgaa ccacccagtt 2941 cctccttagt gggggcgggt gataagagag aagttacccc agggataact gagtcgtcct 3001 cggcaagagt acacatcgac ccgagggttt gctacttcga tgtcgtctgt tcctatcctg 3061 gtgctgcata aggtgccaag ggtggggctg ttcgcccatt aaaagggatc ctgagatggg 3121 ttcactacgt cgcgagacag tagggttgct tctccgtggg agtgctcgat gtctgagggg 3181 aaggggcctt tagtacgaga ggaacaaggg ctcgtgacct ctagtttacc ggttgtctgg 3241 caaggcatcg ccgggtagcc acgtcatacg cggataagag ctgaaagcat ctaagctcga 3301 agccgccccc gaaaatagac atcgtcatca gatcgcctct agaagagagg tttgatagag 3361 ccgggatgta aggatcgagc ttcggcgaga ttttaagtcc acggctacta aagatcgaag 3421 gcacaatcca tgctaaattc tcgtctagtt gaagtcataa gcgtgtttga tttttgtttt 3481 ttattttttt gccgttgtag tgtgaactat gatgaaaaat tatattgata taaaagagaa 3541 atagaaaaaa tgatcactgc tttatatttt ctatctgagc ctctttcagt atctgatcgg 3601 ttatgctgtc gaaaatctca cgtatcttat catttataac gacggcaggt attccgttgt 3661 caccgttttc tgctatttct ggtattagag gtatttttcc caggaacggg aggttgtatt 3721 cctttgcgga tttctccccg ccgccgttct tgaatatgta tgtgagctta ccgcagtgcg 3781 cacacaaaac cgctcatgtt ctctattatg ccaaggacag ggagcttcag ctgttttgca 3841 aagtttattg ccttctttgc atcgagaagg gcgacatcct gcggcgttac gactatgaca 3901 acgccatttg attctggtat cagctgggcc acgctcagtg caacgtctcc ggtgccaggc 3961 ggcatgtcaa gcacaacaac atccgtatcc ttccagttta cgtcttccag aaactgctgt 4021 attgccttgt gcatcagcga accacgccat attatcggcg tatcctctgt cggtatgagg 4081 aaacccatgg acaccacgcg aacaccgtat cttgtctctg ccggtattat tccctcctcg 4141 tcagcgtaaa gctt // LOCUS HIVANT70 583 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral LTR region. ACCESSION M31171 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate ANT70) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 583) AUTHORS De Leys,R., Vanderborght,B., Haesevelde,M.V., Heyndrickx,L., van Geel,A., Wauters,C., Bernaerts,R., Saman,E., Nijs,P., Willems,B., Taelman,H., van der Groen,G., Piot,P., Tersmette,T., Huisman,J.G. and van Heuverswyn,H. TITLE Isolation and partial characterization of an unusual human immunodeficiency retrovirus from two persons of west-central African origin JOURNAL J. Virol. 64, 1207-16 (1990) STANDARD full staff_review COMMENT Kindly provided prior to publication in computer-readable form by R. De Leys, Innogenetics, Antwerp. BASE COUNT 162 a 133 c 153 g 135 t ORIGIN 5'-terminus of 5'LTR. 1 tggaagggtt aatttactcc cataaaagag cagaaatcct ggatctttgg gtgtataaca 61 ctcagggatt cttccctgat tggcagaact acacaccagg accaggaacc aggttcccac 121 tgacatttgg atggttgttt aaactagtac cagtgtcaga agaagaggca gaaagactag 181 gaaatacatg tgagagggct aatctcctgc atccagcatg tgcccatgga tttgaagata 241 cacataaaga aatactgatg tggaagtttg atagatctct aggcaacacc catgttgcta 301 tgataactca cccagagctc ttccagaagg actaaaaact gctgacctga agattgctga 361 cactgtggaa ctttccagca aagactgctg acactgcggg gactttccag tgggagggac 421 agggggcggt tcggggagtg gctaaccctc agaagctgca tataagcagc cgctttctgc 481 ttgtaccggg tctcggttag aggaccaggt ctgagcccgg gagctccctg gcctctagct 541 gaacccgctc gttaacgctc aataaagctt gccttgagtg aga // LOCUS HIVBAN1 59 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral gag region. ACCESSION M27953 KEYWORDS . SEGMENT 1 of 2 SOURCE Human immunodeficiency virus type 1 (isolate BAN) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 59) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by M. Goodenow. Sibling sequences are available (see 1989: I-A-172). BASE COUNT 26 a 7 c 10 g 16 t ORIGIN 1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacta // LOCUS HIVBAN2 250 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral gag region. ACCESSION M27944 KEYWORDS . SEGMENT 2 of 2 SOURCE Human immunodeficiency virus type 1 (isolate BAN) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 250) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT Sibling sequences are available (see 1989: I-A-173). BASE COUNT 96 a 40 c 42 g 72 t ORIGIN 1 aaattaaccc cactctgtgt tactttaaat tgcactaagt tgaagaatgt tacttatgcc 61 aataatagta gtagggaaaa agaagaaatg aaaaactgct ctttcaccac aagaataggg 121 aataaggtgc agaaagaata tgcacttttt tataaacttg atgtagtacc aatagataag 181 gataatacaa gctttatatt gatacattgt aatacctcga ccattacaca ggcttgtcca 241 aaggtatcct // LOCUS HIVBRU 9229 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION K02013 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1 (HIV-1), isolate BRU proviral DNA clone lambda-J19. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9229) AUTHORS Wain-Hobson,S., Sonigo,P., Danos,O., Cole,S. and Alizon,M. TITLE Nucleotide sequence of the AIDS Virus, LAV JOURNAL Cell 40, 9-17 (1985) STANDARD full staff_review REFERENCE 2 (bases 1713 to 1748; revision of [1]) AUTHORS Alizon,M., Wain-Hobson,S., Montagnier,L. and Sonigo,P. TITLE Genetic variability of the AIDS virus: Nucleotide sequence analysis of two isolates from African patients JOURNAL Cell 46, 63-74 (1986) STANDARD full staff_review COMMENT The original LAV, sometimes called LAV-1 to distinguish it from HIV2 (LAV-2), is now referred to as HIV-1bru. An infectious clone of this virus has been constructed by Keith Peden, Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD 21205 (301) 955-3652. HIVNL43 is also an infectious clone having for its 3' half a clone of the BRU isolate. The tat sequence for the L20 clone is available (see 1989: I-A-54). FEATURES from to/span description pept 336 1874 gag polyprotein pept < 1631 4678 pol polyprotein (NH2-terminus uncertain; AA at 1631) pept 4623 5201 vif protein pept 5141 5431 vpr protein pept 5412 5626 tat protein, exon 2 (first expressed exon) 7972 8017 tat protein, exon 3 (AA at 7973) pept 5551 5626 rev protein, exon 2 (first expressed exon) 7972 8246 rev protein, exon 3 (AA at 7974) pept 5643 5888 vpu protein pept 5803 8388 envelope polyprotein pept 8390 9010 nef protein IVS 290 5358 tat, rev, nef subgenomic mRNA intron 1 IVS 5627 7971 tat cds intron 2 IVS 5627 7971 rev cds intron 2 LTR < 1 180 5' LTR LTR 8679 > 9229 3' LTR rpt < 1 97 R repeat 5' copy rpt 9133 9229 R repeat 3' copy binding 182 199 primer (Lys-tRNA) binding site revision 1712 1749 atttcttcagagcagaccagagccaacagccccaccag in [2]; ag in [1] signal 9205 9210 poly-A signal BASE COUNT 3289 a 1656 c 2232 g 2052 t ORIGIN Cap site of genomic RNA. 1 ggtctctctg gttagaccag atttgagcct gggagctctc tggctaacta gggaacccac 61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt 121 gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca 181 gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag 241 gactcggctt gctgaagcgc gcacggcaag aggcgagggg aggcgactgg tgagtacgcc 301 aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa 361 gcgggggaga attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat 421 ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg 481 gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc 541 agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc 601 atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa 661 acaaaagtaa gaaaaaagca cagcaagcag cagctgacac aggacacagc agccaggtca 721 gccaaaatta ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac 781 ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga 841 tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa 901 acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag 961 ctgcagaatg ggatagagtg catccagtgc atgcagggcc tattgcacca ggccagatga 1021 gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat 1081 ggatgacaaa taatccacct atcccagtag gagaaattta taaaagatgg ataatcctgg 1141 gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac 1201 caaaagaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag 1261 cttcacagga ggtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcgaacccag 1321 attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag 1381 catgtcaggg agtgggagga cccggccata aggcaagagt tttggctgaa gcaatgagcc 1441 aagtaacaaa ttcagctacc ataatgatgc aaagaggcaa ttttaggaac caaagaaaga 1501 ttgttaagtg tttcaattgt ggcaaagaag ggcacatagc cagaaattgc agggccccta 1561 ggaaaaaggg ctgttggaaa tgtggaaagg aaggacacca aatgaaagat tgtactgaga 1621 gacaggctaa ttttttaggg aagatctggc cttcctacaa gggaaggcca gggaattttc 1681 ttcagagcag accagagcca acagccccac catttcttca gagcagacca gagccaacag 1741 ccccaccaga agagagcttc aggtctgggg tagagacaac aactccctct cagaagcagg 1801 agccgataga caaggaactg tatcctttaa cttccctcag atcactcttt ggcaacgacc 1861 cctcgtcaca ataaagatag gggggcaact aaaggaagct ctattagata caggagcaga 1921 tgatacagta ttagaagaaa tgagtttgcc aggaagatgg aaaccaaaaa tgataggggg 1981 aattggaggt tttatcaaag taagacagta tgatcagata ctcatagaaa tctgtggaca 2041 taaagctata ggtacagtat tagtaggacc tacacctgtc aacataattg gaagaaatct 2101 gttgactcag attggttgca ctttaaattt tcccattagt cctattgaaa ctgtaccagt 2161 aaaattaaag ccaggaatgg atggcccaaa agttaaacaa tggccattga cagaagaaaa 2221 aataaaagca ttagtagaaa tttgtacaga aatggaaaag gaagggaaaa tttcaaaaat 2281 tgggcctgaa aatccataca atactccagt atttgccata aagaaaaaag acagtactaa 2341 atggagaaaa ttagtagatt tcagagaact taataagaga actcaagact tctgggaagt 2401 tcaattagga ataccacatc ccgcagggtt aaaaaagaaa aaatcagtaa cagtactgga 2461 tgtgggtgat gcatattttt cagttccctt agatgaagac ttcaggaagt atactgcatt 2521 taccatacct agtataaaca atgagacacc agggattaga tatcagtaca atgtgcttcc 2581 acagggatgg aaaggatcac cagcaatatt ccaaagtagc atgacaaaaa tcttagagcc 2641 ttttagaaaa caaaatccag acatagttat ctatcaatac atggatgatt tgtatgtagg 2701 atctgactta gaaatagggc agcatagaac aaaaatagag gagctgagac aacatctgtt 2761 gaggtgggga cttaccacac cagacaaaaa acatcagaaa gaacctccat tcctttggat 2821 gggttatgaa ctccatcctg ataaatggac agtacagcct atagtgctgc cagaaaaaga 2881 cagctggact gtcaatgaca tacagaagtt agtgggaaaa ttgaattggg caagtcagat 2941 ttacccaggg attaaagtaa ggcaattatg taaactcctt agaggaacca aagcactaac 3001 agaagtaata ccactaacag aagaagcaga gctagaactg gcagaaaaca gagagattct 3061 aaaagaacca gtacatggag tgtattatga cccatcaaaa gacttaatag cagaaataca 3121 gaagcagggg caaggccaat ggacatatca aatttatcaa gagccattta aaaatctgaa 3181 aacaggaaaa tatgcaagaa cgaggggtgc ccacactaat gatgtaaaac aattaacaga 3241 ggcagtgcaa aaaataacca cagaaagcat agtaatatgg ggaaagactc ctaaatttaa 3301 actacccata caaaaggaaa catgggaaac atggtggaca gagtattggc aagccacctg 3361 gattcctgag tgggagtttg tcaatacccc tcctttagtg aaattatggt accagttaga 3421 gaaagaaccc atagtaggag cagaaacgtt ctatgtagat ggggcagcta gcagggagac 3481 taaattagga aaagcaggat atgttactaa tagaggaaga caaaaagttg tcaccctaac 3541 tgacacaaca aatcagaaga ctgagttaca agcaattcat ctagctttgc aggattcggg 3601 attagaagta aatatagtaa cagactcaca atatgcatta ggaatcattc aagcacaacc 3661 agataaaagt gaatcagagt tagtcaatca aataatagag cagttaataa aaaaggaaaa 3721 ggtctatctg gcatgggtac cagcacacaa aggaattgga ggaaatgaac aagtagataa 3781 attagtcagt gctggaatca ggaaagtact atttttagat ggaatagata aggcccaaga 3841 tgaacatgag aaatatcaca gtaattggag agcaatggct agtgatttta acctgccacc 3901 tgtagtagca aaagaaatag tagccagctg tgataaatgt cagctaaaag gagaagccat 3961 gcatggacaa gtagactgta gtccaggaat atggcaacta gattgtacac atttagaagg 4021 aaaagttatc ctggtagcag ttcatgtagc cagtggatat atagaagcag aagttattcc 4081 agcagaaaca gggcaggaaa cagcatactt tcttttaaaa ttagcaggaa gatggccagt 4141 aaaaacaata catacagaca atggcagcaa tttcaccagt actacggtta aggccgcctg 4201 ttggtgggcg ggaatcaagc aggaatttgg aattccctac aatccccaaa gtcaaggagt 4261 agtagaatct atgaataaag aattaaagaa aattataggc caggtaagag atcaggctga 4321 acatcttaag acagcagtac aaatggcagt attcatccac aattttaaaa gaaaaggggg 4381 gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag acatacaaac 4441 taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt acagggacag 4501 cagagatcca ctttggaaag gaccagcaaa gctcctctgg aaaggtgaag gggcagtagt 4561 aatacaagat aatagtgaca taaaagtagt gccaagaaga aaagcaaaga tcattaggga 4621 ttatggaaaa cagatggcag gtgatgattg tgtggcaagt agacaggatg aggattagaa 4681 catggaaaag tttagtaaaa caccatatgt atgtttcagg gaaagctagg ggatggtttt 4741 atagacatca ctatgaaagc cctcatccaa gaataagttc agaagtacac atcccactag 4801 gggatgctag attggtaata acaacatatt ggggtctgca tacaggagaa agagactggc 4861 atctgggtca gggagtctcc atagaatgga ggaaaaagag atatagcaca caagtagacc 4921 ctgaactagc agaccaacta attcatctgt attactttga ctgtttttca gactctgcta 4981 taagaaaggc cttattagga catatagtta gccctaggtg tgaatatcaa gcaggacata 5041 acaaggtagg atctctacaa tacttggcac tagcagcatt aataacacca aaaaagataa 5101 agccaccttt gcctagtgtt acgaaactga cagaggatag atggaacaag ccccagaaga 5161 ccaagggcca cagagggagc cacacaatga atggacacta gagcttttag aggagcttaa 5221 gaatgaagct gttagacatt ttcctaggat ttggctccat ggcttagggc aacatatcta 5281 tgaaacttat ggggatactt gggcaggagt ggaagccata ataagaattc tgcaacaact 5341 gctgtttatc catttcagaa ttgggtgtcg acatagcaga ataggcgtta ctcaacagag 5401 gagagcaaga aatggagcca gtagatccta gactagagcc ctggaagcat ccaggaagtc 5461 agcctaaaac tgcttgtacc acttgctatt gtaaaaagtg ttgctttcat tgccaagttt 5521 gtttcacaac aaaagcctta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa 5581 gacctcctca aggcagtcag actcatcaag tttctctatc aaagcagtaa gtagtacatg 5641 taatgcaacc tatacaaata gcaatagcag cattagtagt agcaataata atagcaatag 5701 ttgtgtggtc catagtaatc atagaatata ggaaaatatt aagacaaaga aaaatagaca 5761 ggttaattga tagactaata gaaagagcag aagacagtgg caatgagagt gaaggagaaa 5821 tatcagcact tgtggagatg ggggtggaaa tggggcacca tgctccttgg gatattgatg 5881 atctgtagtg ctacagaaaa attgtgggtc acagtctatt atggggtacc tgtgtggaag 5941 gaagcaacca ccactctatt ttgtgcatca gatgctaaag catatgatac agaggtacat 6001 aatgtttggg ccacacatgc ctgtgtaccc acagacccca acccacaaga agtagtattg 6061 gtaaatgtga cagaaaattt taacatgtgg aaaaatgaca tggtagaaca gatgcatgag 6121 gatataatca gtttatggga tcaaagccta aagccatgtg taaaattaac cccactctgt 6181 gttagtttaa agtgcactga tttggggaat gctactaata ccaatagtag taataccaat 6241 agtagtagcg gggaaatgat gatggagaaa ggagagataa aaaactgctc tttcaatatc 6301 agcacaagca taagaggtaa ggtgcagaaa gaatatgcat ttttttataa acttgatata 6361 ataccaatag ataatgatac taccagctat acgttgacaa gttgtaacac ctcagtcatt 6421 acacaggcct gtccaaaggt atcctttgag ccaattccca tacattattg tgccccggct 6481 ggttttgcga ttctaaaatg taataataag acgttcaatg gaacaggacc atgtacaaat 6541 gtcagcacag tacaatgtac acatggaatt aggccagtag tatcaactca actgctgttg 6601 aatggcagtc tagcagaaga agaggtagta attagatctg ccaatttcac agacaatgct 6661 aaaaccataa tagtacagct gaaccaatct gtagaaatta attgtacaag acccaacaac 6721 aatacaagaa aaagtatccg tatccagagg ggaccaggga gagcatttgt tacaatagga 6781 aaaataggaa atatgagaca agcacattgt aacattagta gagcaaaatg gaatgccact 6841 ttaaaacaga tagctagcaa attaagagaa caatttggaa ataataaaac aataatcttt 6901 aagcaatcct caggagggga cccagaaatt gtaacgcaca gttttaattg tggaggggaa 6961 tttttctact gtaattcaac acaactgttt aatagtactt ggtttaatag tacttggagt 7021 actgaagggt caaataacac tgaaggaagt gacacaatca cactcccatg cagaataaaa 7081 caatttataa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagcgga 7141 caaattagat gttcatcaaa tattacaggg ctgctattaa caagagatgg tggtaataac 7201 aacaatgggt ccgagatctt cagacctgga ggaggagata tgagggacaa ttggagaagt 7261 gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca 7321 aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg 7381 ttcttgggag cagcaggaag cactatgggc gcacggtcaa tgacgctgac ggtacaggcc 7441 agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg 7501 caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg 7561 gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa 7621 ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag 7681 atttggaata acatgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta 7741 atacattcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg 7801 gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat 7861 ataaaaatat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta 7921 ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc 7981 ccaaccccga ggggacccga caggcccgaa ggaatagaag aagaaggtgg agagagagac 8041 agagacagat ccattcgatt agtgaacgga tccttagcac ttatctggga cgatctgcgg 8101 agcctgtgcc tcttcagcta ccaccgcttg agagacttac tcttgattgt aacgaggatt 8161 gtggaacttc tgggacgcag ggggtgggaa gccctcaaat attggtggaa tctcctacag 8221 tattggagtc aggaactaaa gaatagtgct gttagcttgc tcaatgccac agccatagca 8281 gtagctgagg ggacagatag ggttatagaa gtagtacaag gagcttgtag agctattcgc 8341 cacataccta gaagaataag acagggcttg gaaaggattt tgctataaga tgggtggcaa 8401 gtggtcaaaa agtagtgtgg ttggatggcc tactgtaagg gaaagaatga gacgagctga 8461 gccagcagca gatggggtgg gagcagcatc tcgagacctg gaaaaacatg gagcaatcac 8521 aagtagcaat acagcagcta ccaatgctgc ttgtgcctgg ctagaagcac aagaggagga 8581 ggaggtgggt tttccagtca cacctcaggt acctttaaga ccaatgactt acaaggcagc 8641 tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca 8701 acgaagacaa gatatccttg atctgtggat ctaccacaca caaggctact tccctgattg 8761 gcagaactac acaccagggc caggggtcag atatccactg acctttggat ggtgctacaa 8821 gctagtacca gttgagccag ataaggtaga agaggccaat aaaggagaga acaccagctt 8881 gttacaccct gtgagcctgc atggaatgga tgaccctgag agagaagtgt tagagtggag 8941 gtttgacagc cgcctagcat ttcatcacgt ggcccgagag ctgcatccgg agtacttcaa 9001 gaactgctga catcgagctt gctacaaggg actttccgct ggggactttc cagggaggcg 9061 tggcctgggc gggactgggg agtggcgagc cctcagatgc tgcatataag cagctgcttt 9121 ttgcctgtac tgggtctctc tggttagacc agatttgagc ctgggagctc tctggctaac 9181 tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttca // LOCUS HIVELI 9176 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION K03454 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1, isolate ELI, proviral DNA, clone E-H12. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9176) AUTHORS Alizon,M., Wain-Hobson,S., Montagnier,L. and Sonigo,P. TITLE Genetic variability of the AIDS virus: Nucleotide sequence analysis of two isolates from African patients JOURNAL Cell 46, 63-74 (1986) STANDARD full staff_review COMMENT Sibling sequences of the ELI isolate are available (see 1989: I-179) FEATURES from to/span description pept 336 1838 gag polyprotein pept < 1634 4642 pol polyprotein (NH2-terminus uncertain; AA at 1634) pept 4587 5165 vif protein pept 5105 5395 vpr protein pept 5376 5590 tat protein, exon 2 (first expressed exon) 7915 7999 tat protein, exon 3 (AA at 7916) pept 5515 5590 rev protein, exon 2 (first expressed exon) 7915 8195 rev protein, exon 3 (AA at 7917) pept 5607 5852 vpu protein pept 5770 8331 envelope polyprotein precursor pept 8333 8953 nef protein IVS 5591 7914 tat cds intron 2 IVS 5591 7914 rev cds intron 2 LTR < 1 180 5' LTR LTR 8625 > 9176 3' LTR rpt < 1 97 R repeat 5' copy rpt 9078 9175 R repeat 3' copy binding 182 199 primer (Lys-tRNA) binding site signal 9151 9156 mRNA polyadenylation signal BASE COUNT 3333 a 1632 c 2179 g 2032 t ORIGIN Cap site of genomic RNA. 1 ggtctctctg gttagaccag atttgagcct gggagctctc tggctagcta gggaacccac 61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt 121 gtgactctgg taactagaga tccctcagac ccctttagtc agagtggaaa atctctagca 181 gtggcgcccg aacagggacc tgaaagcgaa agtagaacca gaggagctct ctcgacgcag 241 gactcggctt gctgaagcgc gcacggcaag aggcgagggg cagcgactgg tgagtacgct 301 aaaatttttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa 361 gcgggggaaa attagataaa tgggaaaaaa ttcggttacg gccaggagga aagaaaaaat 421 atagactaaa acatatagta tgggcaagca gggagctaga acgatatgca cttaatcctg 481 gccttttaga aacatcagaa ggctgtaaac aaataatagg gcagctacaa ccagctattc 541 agacaggaac agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtac 601 ataaaggaat agatgtaaaa gacaccaagg aagctttaga aaagatggag gaagagcaaa 661 acaaaagtaa gaaaaaggca cagcaagcag cagctgacac aggaaacaac agccaggtca 721 gccaaaatta tcctatagtg cagaacctac aggggcaaat ggtacatcag gccatatcac 781 ctagaacttt gaacgcatgg gtaaaagtaa tagaagaaaa ggctttcagc ccagaagtaa 841 tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa 901 acacagtggg gggacatcaa gcagccatgc aaatgctaaa agagaccatc aatgaagaag 961 ctgcagaatg ggataggtta catccagtgc atgcagggcc tattgcacca ggccagatga 1021 gagaaccaag gggaagtgat atagcaggaa ctactagtac ccttcaggaa caaatagcat 1081 ggatgacaag taacccacct atcccagtag gagaaatcta taaaagatgg ataattgtgg 1141 gattaaataa aatagtaaga atgtatagcc ctgtcagcat tttggacata agacagggac 1201 caaaggaacc ttttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag 1261 cttcacagga tgtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcaaacccag 1321 attgcaagac tatcttaaaa gcattgggac cacaggctac actagaagaa atgatgacag 1381 catgtcaggg agtggggggg cccagccata aagcaagagt tctggctgag gcaatgagcc 1441 aagcaacaaa ttcagttact acagcaatga tgcagagagg caattttaag ggcccaagaa 1501 aaattattaa gtgtttcaat tgtggcaaag aagggcacat agcaaaaaat tgcagggccc 1561 ctaggaaaaa gggctgttgg agatgtggaa aggaaggaca ccaactaaaa gattgcactg 1621 agagacaggc taatttttta gggagaattt ggccttccca caagggaagg ccggggaact 1681 ttctccaaag cagaccagag ccaacagccc caccagcaga gagcttcggg tttggggaag 1741 agataacccc ctctcaaaaa caggagcaga aagacaagga actgtatcct ttaacttccc 1801 tcaaatcact ctttggcaac gaccccttgt cgcaataaaa atagggggac agctaaagga 1861 agctctatta gatacaggag cagatgatac agtattagaa gaaatgaatt tgccaggaaa 1921 atggaaacca aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca 1981 aatacccata gaaatctgtg gacagaaagc tataggtaca gtattagtag gacctacgcc 2041 tgtcaacata atcggaagaa atttgttgac ccagattggc tgcactttaa attttccaat 2101 tagtcctatt gaaactgtac cagtaaaatt aaagccagga atggatggcc caaaagttaa 2161 acaatggcca ttgacagaag aaaaaataaa agcattaaca gaaatttgta cagatatgga 2221 aaaggaagga aaaatttcaa gaattgggcc tgaaaatcca tacaatactc caatatttgc 2281 cataaagaaa aaagacagta ccaagtggag aaaattagta gatttcagag aacttaataa 2341 gagaactcaa gatttctggg aagttcaatt aggaataccg catcctgcag ggctgaaaaa 2401 gaaaaaatca gtaacagtac tggatgtggg tgatgcatat ttttcagttc ccttagatga 2461 agattttagg aaatataccg cctttaccat atctagtata aacaatgaga caccagggat 2521 tagatatcag tacaatgtgc ttccacaggg atggaaagga tcaccggcaa tattccaaag 2581 tagcatgaca aaaatcttag agccctttag aaaacaaaat ccagaaatgg ttatctatca 2641 atacatggat gatttgtatg taggatctga cttagaaata gggcagcata ggacaaaaat 2701 agagaaatta agagaacatc tattgaggtg gggatttacc agaccagata aaaaacatca 2761 gaaagaaccc ccatttcttt ggatgggtta tgaactccat cctgataaat ggacagtaca 2821 gtctataaaa ctgccagaaa aggagagctg gactgtcaat gatatacaga acttagtgga 2881 gagattaaac tgggcaagcc agatttatcc aggaattaaa gtaagacaat tatgtaaact 2941 ccttagggga accaaagcac taacagaagt aataccacta acagaagaag cagaattaga 3001 actggcagaa aacagggaaa ttttaaaaga accagtacat ggagtgtatt atgacccatc 3061 aaaagactta atagcagaaa tacagaaaca agggcacggc caatggacat accaaattta 3121 tcaagaacca tttaaaaatc tgaaaacagg aaagtatgca agaatgaggg gtgcccacac 3181 taatgatgta aagcaattag cagaggcagt gcaaagaata tccacagaaa gcatagtgat 3241 atggggaagg actcctaaat ttagactacc catacaaaag gaaacatggg aaacatggtg 3301 ggcagagtat tggcaagcca cttggattcc tgagtgggaa tttgtcaata cccctccttt 3361 agtaaaatta tggtaccagt tagagaagga acccataata ggagcagaaa ctttctatgt 3421 agatggggca gctaatagag agactaaatt aggaaaagca ggatatgtta ctgacagagg 3481 aagacagaaa gttgtccctt tgactgacac gacaaatcag aagactgagt tacaagcaat 3541 taatctagcc ttgcaggatt cgggattaga agtaaacata gtaacagatt cacaatatgc 3601 attaggaatc attcaagcac aaccagataa gagtgaatca gagttagtca atcaaataat 3661 agagcagtta ataaaaaagg aaaaggttta cctggcatgg gtaccagcac acaaaggaat 3721 tggaggaaat gaacaagtag ataaattagt cagtcaagga atcaggaaag tactattttt 3781 ggatggaata gataaggctc aagaagaaca tgagaaatat cacaacaatt ggagagcaat 3841 ggctagtgat tttaacctac cacccgtggt agcaaaagaa atagtagcta gctgtgataa 3901 atgtcagcta aaaggagaag ccatgcatgg acaagtagac tgtagtccag gaatatggca 3961 attagattgt acacacttag aaggaaaagt tatcctggta gcagttcatg tagccagtgg 4021 ctatatagaa gcagaagtta ttccagcaga aacagggcag gaaacagcat attttctttt 4081 aaaattagca ggaagatggc cagtaaaagt agtacataca gacaatggca gcaatttcac 4141 cagtgctgca gttaaggccg cctgttggtg ggcaggtatc aaacaggaat ttggaattcc 4201 ctacaatccc caaagtcaag gagtagtaga atctatgaat aaagaattaa agaaaattat 4261 aggacaggta agagatcaag ctgaacatct taagacagca gtacaaatgg cagtattcat 4321 ccacaatttt aaaagaagaa gggggattgg gggatacagt gcaggggaaa gaataataga 4381 cataatagca acagacatac aaactaaaga attacaaaaa caaattataa aaattcaaaa 4441 ttttcgggtt tattacagag acagcagaga tccaatttgg aaaggaccag caaagctcct 4501 ctggaaaggt gaaggggcag tagtaataca agacaagagt gacataaagg tagtaccaag 4561 aagaaaagta aagattatta gggattatgg aaaacagatg gcaggtgatg attgtgtggc 4621 aagtagacag gatgaggatt aaaacatgga aaagtttagt aaaacaccat atgtatgttt 4681 caaagaaagc taacagatgg ttttatagac atcactatga aagcccccac ccaaaaataa 4741 gttcagaagt acacatccca ctaggagaag ctagactggt aataaaaaca tattggggtc 4801 tgcatacagg agaaagagaa tggcatctgg gtcagggagt ctccatagaa tggaggaaaa 4861 ggagatatag cacacaagta gaccctggcc tggcagacca actaattcat atgtattatt 4921 ttgattgttt ttcagaatct gctataagaa aagccatatt aggagatata gttagtccta 4981 ggtgtgagta tcaagcagga cataacaagg taggatccct acagtatttg gcactaacag 5041 cattaatagc accaaaacag ataaagccac ctttgcctag tgttaggaag ctaacagaag 5101 atagatggaa caagccccag cagaccaggg gccacagagg gagccataca atgaatgggc 5161 attagagctt ttagaggagc ttaagagtga agctgttaga cattttccta ggatatggct 5221 ccatagctta ggacaacata tttatgaaac ttatggggat acctgggtag gagttgaagc 5281 tataataaga atactgcaac aattactgtt tattcatttc agaattgggt gtcaacatag 5341 cagaataggc attattcgac agagaagagc aagaaatgga tccagtagat cctaacctag 5401 agccctggaa ccatccagga agtcagccta ggactccttg taacaagtgt cattgtaaaa 5461 agtgttgcta tcattgccca gtttgcttct taaacaaagg cttaggcatc tcctatggca 5521 ggaagaagcg gagacagcga cgaggacctc ctcaaggcgg tcaggctcat caagttccta 5581 taccaaagca gtaagtagta catgtaatgc aacctttagg gataatagca atagcagcat 5641 tagtagtagc aataatacta gcaatagttg tgtggaccat agtattcata gaatatagaa 5701 ggataaaaaa gcaaaggaga atagactgtt tacttgatag aataacagaa agagcagaag 5761 acagtggcaa tgagagcgag ggggatagag agaaattgtc aaaactggtg gaaatggggc 5821 atcatgctcc ttgggatatt gatgacctgt agtgctgcag acaatctgtg ggtcacagtt 5881 tattatgggg tgcctgtatg gaaggaagca accaccactc tattttgtgc atcagatgct 5941 aaatcatatg aaacagaggc acataatatc tgggccacac atgcctgtgt acccacggac 6001 cccaacccac aagaaatagc actggaaaat gtgacagaaa actttaacat gtggaaaaat 6061 aacatggtgg aacagatgca tgaggatata atcagtttat gggatcaaag cctaaaacca 6121 tgtgtaaaat taaccccact ctgtgtcact ttaaactgta gtgatgaatt gaggaacaat 6181 ggcactatgg ggaacaatgt cactacagag gagaaaggaa tgaaaaactg ctctttcaat 6241 gtaaccacag tactaaaaga taagaagcag caagtatatg cactttttta tagacttgat 6301 atagtaccaa tagacaatga tagtagtacc aatagtacca attataggtt aataaattgt 6361 aatacctcag ccattacaca ggcttgtcca aaggtatcct ttgagccaat tcccatacat 6421 tattgtgccc cagctggttt tgcgattcta aagtgtagag ataagaagtt caatggaaca 6481 ggcccatgca caaatgtcag cacagtacaa tgtacacatg gaattaggcc agtggtgtca 6541 actcaactgc tgttgaatgg cagtctagca gaagaagagg tcataattag atccgaaaat 6601 ctcacaaaca atgctaaaaa cataatagca catcttaatg aatctgtaaa aattacctgt 6661 gcaaggccct atcaaaatac aagacaaaga acacctatag gactagggca atcactctat 6721 actacaagat caagatcaat aataggacaa gcacattgta atattagtag agcacaatgg 6781 agtaaaactt tacaacaagt agctagaaaa ttaggaaccc ttcttaacaa aacaataata 6841 aagtttaaac catcctcagg aggggaccca gaaattacaa cacacagttt taattgtgga 6901 ggggaattct tctactgtaa tacatcagga ctgtttaata gtacatggaa tattagtgca 6961 tggaataata ttacagagtc aaataatagc acaaacacaa acatcacact ccaatgcaga 7021 ataaaacaaa ttataaagat ggtggcaggc aggaaagcaa tatatgcccc tcctatcgaa 7081 agaaacattc tatgttcatc aaatattaca gggctactat tgacaagaga tggtggtata 7141 aataatagta ctaacgagac ctttagacct ggaggaggag atatgaggga caattggaga 7201 agtgaattat ataaatataa ggtagtacaa attgaaccac taggagtagc acccaccagg 7261 gcaaagagaa gagtggtgga aagagaaaaa agagcaatag gattaggagc tatgttcctt 7321 gggttcttgg gagcagcagg aagcacgatg ggcgcacggt cagtgacgct gacggtacag 7381 gccagacaat taatgtctgg tatagtgcaa cagcaaaaca atttgctgag ggctatagag 7441 gcgcaacagc atctgttgca actcacggtc tggggcatta aacagctcca ggcaagaatc 7501 ctggctgtgg aaagatacct aaaggatcaa cagctcctag gaatttgggg ttgctctgga 7561 aaacacattt gcaccactaa tgtgccctgg aactctagtt ggagtaatag atctctaaat 7621 gagatttggc agaacatgac ctggatggag tgggaaagag aaattgacaa ttacacaggc 7681 ttaatatata gcttaattga ggaatcgcag acccagcaag aaaagaatga aaaagaattg 7741 ttggaattgg acaagtgggc aagtttgtgg aattggttta gcataacaca atggctgtgg 7801 tatataaaaa tattcataat gataatagga ggcttgatag gtttaagaat agtttttgct 7861 gtgctttctt tagtaaatag agttaggcag ggatactcac ctctgtcgtt tcagaccctc 7921 ctcccagccc cgaggggacc cgacaggccc gaaggaacag aagaagaagg tggagagcga 7981 ggcagagaca gatccgtgag attgctgaac ggattctcgg cacttatctg ggacgacctg 8041 cggagcctgt gcctcttcag ctaccaccgc ttgagagact taatcttaat tgcagtgagg 8101 attgtagaac ttctgggacg cagggggtgg gacatcctca aatatctgtg gaatctccta 8161 cagtattgga gtcaggaact gaggaacagt gctagtagct tgtttgatgc catagcaata 8221 gcagtagctg aggggacaga tagagttata gaaataatac aaagagcttg cagagctgtt 8281 cttaacatac ccagaagaat aagacagggc ttagaaaggt ctttacttta aaatgggtgg 8341 caaatggtca aaaagtagta tagtgggatg gcctgctata agggaaagaa taagaagaac 8401 taatccagca gcagatgggg taggagcagt atctcgagac ctggaaaaac atggggcaat 8461 cacaagtagc aatacagcaa gtactaatgc tgactgtgcc tggctagaag cacaagaaga 8521 gagcgacgag gtgggctttc cagtcagacc ccaggtacct ttaagaccaa tgacttacaa 8581 agaagctcta gatctcagcc actttttaaa agaaaagggg ggactggaag ggctaatttg 8641 gtccaaaaag agacaagaga tccttgatct ttgggtctac aacacacaag gcatcttccc 8701 tgattggcaa aactacacac cagggccagg gatcagatat ccactaacct ttggatggtg 8761 ctacgagcta gtaccagttg atccacagga ggtagaagaa gacactgaag gagagaccaa 8821 cagcttgtta caccctatat gccagcatgg aatggaggac ccggagagac aagtgttaaa 8881 atggagattt aacagcagac tagcatttga gcacaaggcc cgagagatgc atccggagtt 8941 ctacaaaaac tgatgacacc gagctttcta caagggactt tccgctgggg actttccagg 9001 gaggcgtgga ctgggcggga ctggggagtg gctaaccctc agatgctgca tataagcagc 9061 tgctttttgc ctgtactggg tctctctggt tagaccagat ttgagcctgg gagctctctg 9121 gctagctagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaa // LOCUS HIVHXB2 9718 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION K03455 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate HXB2) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 653) AUTHORS Starcich,B., Ratner,L., Josephs,S.F., Okamato,T., Gallo,R.C. and Wong-Staal,F. TITLE Characterization of long terminal repeat sequences of HTLV-III JOURNAL Science 227, 538-540 (1985) STANDARD full staff_review REFERENCE 2 (bases 493 to 674; 9577 to 9718) AUTHORS Ratner,L., Haseltine,W., Patarca,R., Livak,K.J., Starcich,B., Josephs,S.J., Doran,E.R., Rafalski,J.A., Whitehorn,E.A., Baumeister,K., Ivanoff,L., Petteway,S.R.Jr., Pearson,M.L., Lautenberger,J.A., Papas,T.S., Ghrayeb,J., Chang,N.T., Gallo,R.C. and Wong-Staal,F. TITLE Complete nucleotide sequence of the AIDS virus, HTLV-III JOURNAL Nature 313, 277-284 (1985) STANDARD full staff_review REFERENCE 3 (bases 1 to 9718) AUTHORS Ratner,L., Fisher,A., Jagodzinski,L.L., Mitsuya,H., Liou,R.-S., Gallo,R.C. and Wong-Staal,F. TITLE Complete nucleotide sequences of functional clones of the AIDS virus JOURNAL AIDS Res. Hum. Retroviruses 3, 57-69 (1987) STANDARD full staff_review REFERENCE 4 (bases 5611 to 5611; revises [3]) AUTHORS Ratner,L. JOURNAL Unpublished (1987) Washington U. Med School, St. Louis, MO STANDARD full staff_review REFERENCE 5 (bases 6224 to 8794) AUTHORS Reitz,M.S. JOURNAL Unpublished (1988) STANDARD full staff_review COMMENT Submitted in computer-readable form by L. Ratner. A second envelope gene sequence for the HXB2D clone has been independently determined by Reitz and co-workers ([5]); the seven or so differences from the Ratner sequence ([3] and [4]) have not been resolved nor annotated herein. The vpU cds annotated below does not possess a start codon in the normal position (bases 6061 to 6063; "ACG"). As there are no nearby in-frame upstream or downstream start codons, HXB2 may not possess a functional vpU cds. This clone is infectious [3]. Sibling sequences are available (see 1989: I-A-47). FEATURES from to/span description pept 789 2291 gag polyprotein pept < 2084 5095 pol polyprotein (NH2-terminus uncertain; AA at 2084) pept 5040 5618 vif protein pept 5558 5794 vpr protein pept 5830 6044 tat protein, exon 2 (first expressed exon) 8378 8423 tat protein, exon 3 (AA at 8379) pept 5969 6044 rev protein, exon 2 (first expressed exon) 8378 8652 rev protein, exon 3 (AA at 8380) pept < 6061 6309 vpu protein (AA at 6061, put.) pept 6224 8794 envelope polyprotein pept 8796 9167 nef protein (premature termination) IVS 743 5776 tat, rev, nef mRNA intron 1 IVS 6045 8377 tat cds intron 2 IVS 6045 8377 rev cds intron 2 IVS 6045 8377 tat, rev, nef mRNA intron 2 LTR 1 634 5' LTR LTR 9085 9718 3' LTR rpt 454 551 R repeat 5' copy rpt 9538 9635 R repeat 3' copy binding 636 653 primer (Lys-tRNA) binding site signal 9611 9616 poly-A signal BASE COUNT 3411 a 1773 c 2370 g 2164 t ORIGIN 5' terminus of 5' LTR-U3. 1 tggaagggct aattcactcc caacgaagac aagatatcct tgatctgtgg atctaccaca 61 cacaaggcta cttccctgat tagcagaact acacaccagg gccagggatc agatatccac 121 tgacctttgg atggtgctac aagctagtac cagttgagcc agagaagtta gaagaagcca 181 acaaaggaga gaacaccagc ttgttacacc ctgtgagcct gcatggaatg gatgacccgg 241 agagagaagt gttagagtgg aggtttgaca gccgcctagc atttcatcac atggcccgag 301 agctgcatcc ggagtacttc aagaactgct gacatcgagc ttgctacaag ggactttccg 361 ctggggactt tccagggagg cgtggcctgg gcgggactgg ggagtggcga gccctcagat 421 cctgcatata agcagctgct ttttgcctgt actgggtctc tctggttaga ccagatctga 481 gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct 541 tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 601 agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacctgaaag 661 cgaaagggaa accagagctc tctcgacgca ggactcggct tgctgaagcg cccgcacggc 721 aagaggcgag gggcggcgac tggtgagtac gccaaaaatt ttgactagcg gaggctagaa 781 ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat cgatgggaaa 841 aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 901 gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 961 gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 1021 tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 1081 aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaaaaaa gcacagcaag 1141 cagcagctga cacaggacac agcaatcagg tcagccaaaa ttaccctata gtgcagaaca 1201 tccaggggca aatggtacat caggccatat cacctagaac tttaaatgca tgggtaaaag 1261 tagtagaaga gaaggctttc agcccagaag tgatacccat gttttcagca ttatcagaag 1321 gagccacccc acaagattta aacaccatgc taaacacagt ggggggacat caagcagcca 1381 tgcaaatgtt aaaagagacc atcaatgagg aagctgcaga atgggataga gtgcatccag 1441 tgcatgcagg gcctattgca ccaggccaga tgagagaacc aaggggaagt gacatagcag 1501 gaactactag tacccttcag gaacaaatag gatggatgac aaataatcca cctatcccag 1561 taggagaaat ttataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata 1621 gccctaccag cattctggac ataagacaag gaccaaagga accctttaga gactatgtag 1681 accggttcta taaaactcta agagccgagc aagcttcaca ggaggtaaaa aattggatga 1741 cagaaacctt gttggtccaa aatgcgaacc cagattgtaa gactatttta aaagcattgg 1801 gaccagcggc tacactagaa gaaatgatga cagcatgtca gggagtagga ggacccggcc 1861 ataaggcaag agttttggct gaagcaatga gccaagtaac aaattcagct accataatga 1921 tgcagagagg caattttagg aaccaaagaa agattgttaa gtgtttcaat tgtggcaaag 1981 aagggcacac agccagaaat tgcagggccc ctaggaaaaa gggctgttgg aaatgtggaa 2041 aggaaggaca ccaaatgaaa gattgtactg agagacaggc taatttttta gggaagatct 2101 ggccttccta caagggaagg ccagggaatt ttcttcagag cagaccagag ccaacagccc 2161 caccagaaga gagcttcagg tctggggtag agacaacaac tccccctcag aagcaggagc 2221 cgatagacaa ggaactgtat cctttaactt ccctcaggtc actctttggc aacgacccct 2281 cgtcacaata aagatagggg ggcaactaaa ggaagctcta ttagatacag gagcagatga 2341 tacagtatta gaagaaatga gtttgccagg aagatggaaa ccaaaaatga tagggggaat 2401 tggaggtttt atcaaagtaa gacagtatga tcagatactc atagaaatct gtggacataa 2461 agctataggt acagtattag taggacctac acctgtcaac ataattggaa gaaatctgtt 2521 gactcagatt ggttgcactt taaattttcc cattagccct attgagactg taccagtaaa 2581 attaaagcca ggaatggatg gcccaaaagt taaacaatgg ccattgacag aagaaaaaat 2641 aaaagcatta gtagaaattt gtacagagat ggaaaaggaa gggaaaattt caaaaattgg 2701 gcctgaaaat ccatacaata ctccagtatt tgccataaag aaaaaagaca gtactaaatg 2761 gagaaaatta gtagatttca gagaacttaa taagagaact caagacttct gggaagttca 2821 attaggaata ccacatcccg cagggttaaa aaagaaaaaa tcagtaacag tactggatgt 2881 gggtgatgca tatttttcag ttcccttaga tgaagacttc aggaagtata ctgcatttac 2941 catacctagt ataaacaatg agacaccagg gattagatat cagtacaatg tgcttccaca 3001 gggatggaaa ggatcaccag caatattcca aagtagcatg acaaaaatct tagagccttt 3061 tagaaaacaa aatccagaca tagttatcta tcaatacatg gatgatttgt atgtaggatc 3121 tgacttagaa atagggcagc atagaacaaa aatagaggag ctgagacaac atctgttgag 3181 gtggggactt accacaccag acaaaaaaca tcagaaagaa cctccattcc tttggatggg 3241 ttatgaactc catcctgata aatggacagt acagcctata gtgctgccag aaaaagacag 3301 ctggactgtc aatgacatac agaagttagt ggggaaattg aattgggcaa gtcagattta 3361 cccagggatt aaagtaaggc aattatgtaa actccttaga ggaaccaaag cactaacaga 3421 agtaatacca ctaacagaag aagcagagct agaactggca gaaaacagag agattctaaa 3481 agaaccagta catggagtgt attatgaccc atcaaaagac ttaatagcag aaatacagaa 3541 gcaggggcaa ggccaatgga catatcaaat ttatcaagag ccatttaaaa atctgaaaac 3601 aggaaaatat gcaagaatga ggggtgccca cactaatgat gtaaaacaat taacagaggc 3661 agtgcaaaaa ataaccacag aaagcatagt aatatgggga aagactccta aatttaaact 3721 gcccatacaa aaggaaacat gggaaacatg gtggacagag tattggcaag ccacctggat 3781 tcctgagtgg gagtttgtta atacccctcc cttagtgaaa ttatggtacc agttagagaa 3841 agaacccata gtaggagcag aaaccttcta tgtagatggg gcagctaaca gggagactaa 3901 attaggaaaa gcaggatatg ttactaatag aggaagacaa aaagttgtca ccctaactga 3961 cacaacaaat cagaagactg agttacaagc aatttatcta gctttgcagg attcgggatt 4021 agaagtaaac atagtaacag actcacaata tgcattagga atcattcaag cacaaccaga 4081 tcaaagtgaa tcagagttag tcaatcaaat aatagagcag ttaataaaaa aggaaaaggt 4141 ctatctggca tgggtaccag cacacaaagg aattggagga aatgaacaag tagataaatt 4201 agtcagtgct ggaatcagga aagtactatt tttagatgga atagataagg cccaagatga 4261 acatgagaaa tatcacagta attggagagc aatggctagt gattttaacc tgccacctgt 4321 agtagcaaaa gaaatagtag ccagctgtga taaatgtcag ctaaaaggag aagccatgca 4381 tggacaagta gactgtagtc caggaatatg gcaactagat tgtacacatt tagaaggaaa 4441 agttatcctg gtagcagttc atgtagccag tggatatata gaagcagaag ttattccagc 4501 agaaacaggg caggaaacag catattttct tttaaaatta gcaggaagat ggccagtaaa 4561 aacaatacat actgacaatg gcagcaattt caccggtgct acggttaggg ccgcctgttg 4621 gtgggcggga atcaagcagg aatttggaat tccctacaat ccccaaagtc aaggagtagt 4681 agaatctatg aataaagaat taaagaaaat tataggacag gtaagagatc aggctgaaca 4741 tcttaagaca gcagtacaaa tggcagtatt catccacaat tttaaaagaa aaggggggat 4801 tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca tacaaactaa 4861 agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca gggacagcag 4921 aaattcactt tggaaaggac cagcaaagct cctctggaaa ggtgaagggg cagtagtaat 4981 acaagataat agtgacataa aagtagtgcc aagaagaaaa gcaaagatca ttagggatta 5041 tggaaaacag atggcaggtg atgattgtgt ggcaagtaga caggatgagg attagaacat 5101 ggaaaagttt agtaaaacac catatgtatg tttcagggaa agctagggga tggttttata 5161 gacatcacta tgaaagccct catccaagaa taagttcaga agtacacatc ccactagggg 5221 atgctagatt ggtaataaca acatattggg gtctgcatac aggagaaaga gactggcatt 5281 tgggtcaggg agtctccata gaatggagga aaaagagata tagcacacaa gtagaccctg 5341 aactagcaga ccaactaatt catctgtatt actttgactg tttttcagac tctgctataa 5401 gaaaggcctt attaggacac atagttagcc ctaggtgtga atatcaagca ggacataaca 5461 aggtaggatc tctacaatac ttggcactag cagcattaat aacaccaaaa aagataaagc 5521 cacctttgcc tagtgttacg aaactgacag aggatagatg gaacaagccc cagaagacca 5581 agggccacag agggagccac acaatgaatg gacactagag cttttagagg agcttaagaa 5641 tgaagctgtt agacattttc ctaggatttg gctccatggc ttagggcaac atatctatga 5701 aacttatggg gatacttggg caggagtgga agccataata agaattctgc aacaactgct 5761 gtttatccat tttcagaatt gggtgtcgac atagcagaat aggcgttact cgacagagga 5821 gagcaagaaa tggagccagt agatcctaga ctagagccct ggaagcatcc aggaagtcag 5881 cctaaaactg cttgtaccaa ttgctattgt aaaaagtgtt gctttcattg ccaagtttgt 5941 ttcataacaa aagccttagg catctcctat ggcaggaaga agcggagaca gcgacgaaga 6001 gctcatcaga acagtcagac tcatcaagct tctctatcaa agcagtaagt agtacatgta 6061 acgcaaccta taccaatagt agcaatagta gcattagtag tagcaataat aatagcaata 6121 gttgtgtggt ccatagtaat catagaatat aggaaaatat taagacaaag aaaaatagac 6181 aggttaattg atagactaat agaaagagca gaagacagtg gcaatgagag tgaaggagaa 6241 atatcagcac ttgtggagat gggggtggag atggggcacc atgctccttg ggatgttgat 6301 gatctgtagt gctacagaaa aattgtgggt cacagtctat tatggggtac ctgtgtggaa 6361 ggaagcaacc accactctat tttgtgcatc agatgctaaa gcatatgata cagaggtaca 6421 taatgtttgg gccacacatg cctgtgtacc cacagacccc aacccacaag aagtagtatt 6481 ggtaaatgtg acagaaaatt ttgacatgtg gaaaaatgac atggtagaac agatgcatga 6541 ggatataatc agtttatggg atcaaagcct aaagccatgt gtaaaattaa ccccactctg 6601 tgttagttta aagtgcactg atttgaagaa tgatactaat accaatagta gtagcgggag 6661 aatgataatg gagaaaggag agataaaaaa ctgctctttc aatatcagca caagcataag 6721 aggtaaggtg cagaaagaat atgcattttt ttataaactt gatataatac caatagataa 6781 tgatactacc agctatagct tgacaagttg taacacctca gtcattacac aggcctgtcc 6841 aaaggtatcc tttgagccaa ttcccataca ttattgtgcc ccggctggtt ttgcgattct 6901 aaaatgtaat aataagacgt tcaatggaac aggaccatgt acaaatgtca gcacagtaca 6961 atgtacacat ggaattaggc cagtagtatc aactcaactg ctgttaaatg gcagtctagc 7021 agaagaagag gtagtaatta gatctgtcaa tttcacggac aatgctaaaa ccataatagt 7081 acagctgaac acatctgtag aaattaattg tacaagaccc aacaacaata caagaaaaag 7141 aatccgtatc cagagaggac cagggagagc atttgttaca ataggaaaaa taggaaatat 7201 gagacaagca cattgtaaca ttagtagagc aaaatggaat aacactttaa aacagataga 7261 tagcaaatta agagaacaat tcggaaataa taaaacaata atctttaagc aatcctcagg 7321 aggggaccca gaaattgtaa cgcacagttt taattgtgga ggggaatttt tctactgtaa 7381 ttcaacacaa ctgtttaata gtacttggtt taatagtact tggagtactg aagggtcaaa 7441 taacactgaa ggaagtgaca caatcaccct cccatgcaga ataaaacaaa ttataaacat 7501 gtggcagaaa gtaggaaaag caatgtatgc ccctcccatc agtggacaaa ttagatgttc 7561 atcaaatatt acagggctgc tattaacaag agatggtggt aatagcaaca atgagtccga 7621 gatcttcaga cttggaggag gagatatgag ggacaattgg agaagtgaat tatataaata 7681 taaagtagta aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt 7741 gcagagagaa aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc 7801 aggaagcact atgggcgcag cctcaatgac gctgacggta caggccagac aattattgtc 7861 tggtatagtg cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt 7921 gcaactcaca gtctggggca tcaagcagct ccaagcaaga atcctagctg tggaaagata 7981 cctaaaggat caacagctcc tagggatttg gggttgctct ggaaaactca tttgcaccac 8041 tgctgtgcct tggaatgcta gttggagtaa taaatctctg gaacagatct ggaatcacac 8101 gacctggatg gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat 8161 tgaagaatcg caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg 8221 ggcaagtttg tggaattggt ttaacataac aaattggctg tggtatataa aattattcat 8281 aatgatagta ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa 8341 tagagttagg cagggatatt caccattatc gtttcagacc cacctcccaa tcccgagggg 8401 acccgacagg cccgaaggaa tagaagaaga aggtggagag agagacagag acagatccat 8461 tcgattagtg aacggatcct tggcacttat ctgggacgat ctgcggagcc tgtgcctctt 8521 cagctaccac cgcttgagag acttactctt gattgtaacg aggattgtgg aacttctggg 8581 acgcaggggg tgggaagccc tcaaatattg gtggaatctc ctacagtatt ggagtcagga 8641 actaaagaat agtgctgtta gcttgctcaa tgccacagcc atagcagtag ctgaggggac 8701 agatagggtt atagaagtag tacaaggagc ttgtagagct attcgccaca tacctagaag 8761 aataagacag ggcttggaaa ggattttgct ataagatggg tggcaagtgg tcaaaaagta 8821 gtgtgattgg atggcttact gtaagggaaa gaatgagacg agctgagcca gcagcagatg 8881 gggtgggagc agcatctcga gacctggaaa aacatggagc aatcacaagt agcaacacag 8941 cagctaccaa tgctgcttgt gcctggctag aagcacaaga ggaggaggag gtgggttttc 9001 cagtcacacc tcaggtacct ttaagaccaa tgacttacaa ggcagctgta gatcttagcc 9061 actttttaaa agaaaagggg ggactggaag ggctaattca ctcccaaaga agacaagata 9121 tccttgatct gtggatctac cacacacaag gctacttccc tgattgacag aactacacac 9181 cagggccagg ggtcagatat ccactgacct ttggatggtg ctacaagcta gtaccagttg 9241 agccagataa gatagaagag gccaataaag gagagaacac cagcttgtta caccctgtga 9301 gcctgcatgg gatggatgac ccggagagag aagtgttaga gtggaggttt gacagccgcc 9361 tagcatttca tcacgtggcc cgagagctgc atccggagta cttcaagaac tgctgacatc 9421 gagcttgcta caagggactt tccgctgggg actttccagg gaggcgtggc ctgggcggga 9481 ctggggagtg gcgagccctc agatcctgca tataagcagc tgctttttgc ctgtactggg 9541 tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg 9601 cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt 9661 gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagca // LOCUS HIVMAB 59 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) gag region. ACCESSION M27955 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate MAB), clone 01. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 59) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson. Sibling sequences are available (see 1989: I-A-184). FEATURES from to/span description pept < 1 > 59 gag protein (AA at 2) BASE COUNT 24 a 7 c 12 g 16 t ORIGIN Within gag cds. 1 ctataaaaga tggataatcc tggggttaaa taaaatagta agaatgtata gccctgtca // LOCUS HIVMN 9738 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION M17449 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1 (isolate MN), proviral DNA (non-infectious). ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9738) AUTHORS Gurgo,C., Guo,H.-G., Franchini,G., Aldovini,A., Collalti,E., Farrell,K., Wong-Staal,F., Gallo,R.C. and Reitz,M.S.Jr. TITLE Envelope sequences of two new United States HIV-1 isolates JOURNAL Virology 164, 531-536 (1988) STANDARD full staff_review COMMENT Sequence kindly provided in computer-readable form by M. Reitz, N.C.I., Bethesda, MD. 20892 U.S.A. The MN isolate was taken from a pediatric AIDS patient in 1984. The pol coding sequence shows an in-frame stop codon at 3783; the nef protein is prematurely truncated at position 9357. The vpU protein is prematurely truncated at position 6142. FEATURES from to/span description pept 787 2307 gag polyprotein pept < 2091 3782 pol polyprotein (NH2-terminus uncertain; AA at 2091; in-frame stop codon at 3783) 3786 5111 pol polyprotein pept 5056 5634 vif protein pept 5574 5864 vpr protein pept 5845 6059 tat protein, exon 2 (first expressed exon) 8396 8486 tat protein, exon 3 (AA at 8397) pept 5984 6059 rev protein, exon 2 (first expressed exon) 8396 8667 rev protein, exon 3 (AA at 8398) pept 6076 6144 vpu protein (premature termination) pept 6239 8809 envelope polyprotein pept 8811 9359 nef protein (premature termination at 9357 relative to other HIV-1 sequences) LTR 1 633 5' LTR LTR 9106 9738 3' LTR rpt 453 550 R repeat 5' copy rpt 9558 9655 R repeat 3' copy binding 635 652 primer (Lys-tRNA) binding site site 3783 3785 pol cds in-frame stop codon signal 9631 9636 poly-A signal BASE COUNT 3463 a 1769 c 2344 g 2162 t ORIGIN Left end of viral genome. 1 tggatgggtt aatttactcc caaaagagac aagacatcct tgatctgtgg gtctaccaca 61 cacaaggcta cttccctgat tggcagaact acacaccagg gccagggatc agatatccac 121 tgacctttgg atggtgcttc aagctagtac cagttgagcc agagaagata gaagaggcca 181 ataaaggaga gaacaactgc ttgttacacc ctatgagcca gcatggatgg atgacccgga 241 gagagaagtg ttagtgtgga agtctgacag ccacctagca tttcagcatt atgcccgaga 301 gctgcatccg gagtactaca agaactgctg acatcgagct atctacaagg gactttccgc 361 tggggacttt ccagggaggt gtggcctggg cgggaccggg gagtggcgag ccctcagatc 421 gtgcatataa gcagctgctt tctgcctgta ctgggtctct ctggttagac cagatctgag 481 cctgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa agcttgcctt 541 gagtgcttca agtagtgtgt gcccgtctgt tatgtgactc tggtagctag agatccctca 601 gatcctttta ggcagtgtgg aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc 661 gaaagaaaaa ccagagctct ctcgacgcag gactcggctt gctgaagcgc gcacggcaag 721 aggcgagggg cggcgactgg tgagtacgcc aaaaattctt gactagcgga ggctagaagg 781 agagagatgg gtgcgagagc gtcggtatta agcgggggag aattagatcg atgggaaaac 841 attcggttaa ggccaggggg aaagaaaaaa tataaattaa aacatgtagt atgggcaagc 901 agggagctag aacgattcgc agtcaatcct ggcctgttag aaacatcaga aggctgtaga 961 caaatactgg gacagctaca accatccctt cagacaggat cagaagaact taaatcatta 1021 tataatacag tagcaaccct ctattgtgtg catcaaaaga tagagataaa agacaccaag 1081 gaagctttag agaaaataga ggaagagcaa aacaaaagta agaaaaaagc acagcaagca 1141 gcagctgaca caggaaacag aggaaacagc agccaagtca gccaaaatta ccccatagtg 1201 cagaacatcg aggggcaaat ggtacatcag gccatatcac ctagaacttt aaatgcatgg 1261 gtaaaagtag tagaagagaa ggctttcagc ccagaagtaa tacccatgtt ttcagcatta 1321 tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg gggacatcaa 1381 gcagccatgc aaatgttaaa agagaccatc aatgaggaag ctgcagaatg ggatagattg 1441 catccagtgc atgcagggcc tattacacca ggccagatga gagaaccaag gggaagtgac 1501 atagcaggaa ctactagtac ccttcaggaa caaataggat ggatgacaaa taatccacct 1561 atcccagtag gagaaatcta taaaagatgg ataatcctgg gattaaataa aatagtaagg 1621 atgtatagcc cttccagcat tctggacata agacaaggac caaaggaacc ctttagagac 1681 tatgtagacc ggttctataa aactctaaga gccgagcaag cttcacagga ggtaaaaaac 1741 cggacgacag aaaccttgtt ggtccaaaat gcgaacccag attgtaagac tattttaaaa 1801 gcattgggac cagcagctac actagaagaa atgatgacag catgtcaggg agtgggagga 1861 cctggtcata aagcaagagt tttggcggaa gcgatgagcc aagtaacaaa ttcagctacc 1921 ataatgatgc agagaggcaa ttttaggaat caaagaaaga ttatcaagtg cttcaattgt 1981 ggcaaagaag ggcacatagc caaaaattgc agggccccta ggaaaagggg ctgttggaaa 2041 tgtggaaagg aaggacacca aatgaaagat tgtactgaga gacaggctaa ttttttaggg 2101 aagatctggc cttcctgcaa gggaaggcgg aattttcctc agagcagaac agagccaaca 2161 gccccaccag aagagagctt caggtttggg gaagagacaa caactcccta tcagaagcag 2221 gagaagaagc aggagacgat agacaaggac ctgtatcctt tagcttccct caaatcactc 2281 tttggcaacg acccattgtc acaataaaga taggggggca actaaaggaa gctctattag 2341 atacaggagc agatgataca gtattaggag aaatgaattt gccaagaaga tggaaaccaa 2401 aaatgatagg gggaattgga ggttttatca aagtaagaca gtatgatcag ataaccatag 2461 gaatctgtgg acataaagct ataggtacag tattagtagg acctacacct gtcaacataa 2521 ttggaagaaa tctgttgact cagcttgggt gcactttaaa ttttcccatt agtcctattg 2581 aaactgtacc agtaaaatta aagccaggaa tggatggccc aaaagttaaa caatggccat 2641 tgacagaaga aaaaataaaa gcattaatag aaatttgtac agaaatggaa aaggaaggga 2701 aaatttcaaa aattgggcct gaaaatccat acaatactcc agtatttgcc ataaagaaaa 2761 aagacagtac taaatggaga aaattagtag atttcagaga acttaataag aaaactcaag 2821 acttctggga agttcaatta ggaataccac atcctgcagg gttaaaaaag aaaaaatcag 2881 taacagtact ggatgtgggt gatgcatatt tttcagttcc cttagataaa gacttcagga 2941 agtatactgc atttaccata cctagtataa acaatgaaac accagggatt agatatcagt 3001 acaatgtgct tccacaggga tggaaaggat caccagcaat attccaaagt agcatgacaa 3061 aaatcttaga gccttttaga aaacaaaatc cagacatagt tatctatcaa tacatggatg 3121 atttgtatgt aggatctgac ttagaaatag ggcagcatag agcaaaaata gaggaactga 3181 gacgacatct gttgaggtgg ggatttacca caccagacaa aaaacatcag aaagaacctc 3241 cattcctttg gatgggttat gaactccatc ctgataaatg gacagtacag cctatagtgc 3301 taccagaaaa agacagctgg actgtcaatg acatacagaa gttagtggga aaattgaatt 3361 gggcaagtca gatttacgca gggattaaag taaagcaatt atgtaaactc cttagaggaa 3421 ccaaagcact aacagaagta ataccactaa cagaagaagc agagctagaa ctggcagaaa 3481 acagggaaat tctaaaagaa ccagtacatg gagtgtatta tgacccatca aaagacttaa 3541 tagcagaagt acagaagcag gggcaaggcc aatggacata tcaaatttat caagagccat 3601 ttaaaaatct gaaaacaggc aaatatgcaa gaatgagggg tgcccacact aatgatgtaa 3661 aacaattaac agaggcagtg caaaaaatag ccacagaaag catagtaata tggggaaaga 3721 ctcctaaatt tagactaccc atacaaaaag aaacatggga aacatggtgg acagagtata 3781 cgtaagccac ctggattcct gagtgggagg ttgtcaatac ccctccctta gtgaaattat 3841 ggtaccagtt agagaaagaa cccatagtag gtgcagaaac tttctatgta gatggggcag 3901 ctaacaggga gactaaaaaa ggaaaagcag gatatgttac taacagagga agacaaaagg 3961 ttgtctccct aactgacaca acaaatcaga agactgagtt acaagcaatt catctagctt 4021 tgcaagattc agggttagaa gtaaacatag taacagactc acaatatgca ttaggaatca 4081 ttcaagcaca accagataaa agtgaatcag agttagtcag tcaaataata gagcagttaa 4141 taaaaaagga aaaggtctat ctggcatggg taccagcaca caaaggaatt ggaggaaatg 4201 aacaagtaga taaattagtc agtgctggaa tcaggaaagt actattttta gatggaatag 4261 ataaggccca agaagaccat gagaaatatc acagtaattg gagagcaatg gctagtgact 4321 ttaacctacc acctatagta gcaaaagaaa tagtagccag ctgtgataaa tgtcagctaa 4381 aaggagaagc catgcatgga caagtagact gtagtccagg aatatggcaa ctagattgta 4441 cacatttaga aggaaaagtt atcctggtag cagttcatgt agccagtgga tacatagaag 4501 cagaagttat tccagcagag acagggcagg agacagcata ctttctctta aaattagcag 4561 gaagatggcc agtaaaaaca atacatacag acaatggccc caatttcacc agtactacgg 4621 ttaaggccgc ctgttggtgg acgggaatca agcaggaatt tggcattccc tacaatcccc 4681 aaagtcaagg agtaatagaa tctatgaata aagaattaaa gaaaattata ggacaggtaa 4741 gagatcaggc tgaacatctt aagagagcag tacaaatggc agtattcatc cacaatttta 4801 aaagaaaagg ggggattggg gggtacagtg caggggaaag aatagtaggc ataatagcaa 4861 cagacataca aactaaagaa ctacaaaaac aaattacaaa aattcaaaat tttcgggttt 4921 attacaggga cagcagagat ccactttgga aaggaccagc aaagcttctc tggaaaggtg 4981 aaggggcagt agtaatacaa gataataatg acataaaagt agtgccaaga agaaaagcaa 5041 aggtcattag ggattatgga aaacagacgg caggtgatga ttgtgtggca agcagacagg 5101 atgaggatta gaacatggaa aagtttagta aaacaccata tgtatatttc aaagaaagct 5161 aaaggacggt tttatagaca tcactatgaa agcactcatc caagaataag ttcagaagta 5221 cacatcccac taggggatgc tagattggta ataacaacat attggggtct gcatacagga 5281 gaaagagact ggcatttagg tcagggagtc tccatagaat ggaggaaaaa gagatatagc 5341 acacaagtag accctgacct agcagaccac ctaattcatc tgcattactt tgattgtttt 5401 tcagactctg ccataagaaa ggccatatta ggacatagag ttagtcctat ttgtgaattt 5461 caagcaggac ataacaaggt aggacctcta cagtacttgg cactaacagc attaataaca 5521 ccaaaaaaga taaagccacc tttgcctagt gttaagaaac tgacagagga tagatggaac 5581 aagccccaga agaccaaggg ccacagaggg agccatacaa tcaatgggca ctagagcttt 5641 tagaggagct taagaatgaa gctgttagac attttcctag gatatggctc catggcttag 5701 ggcaacatat ctatgaaact tatggggata cttgggcagg agtggaagcc ataataagaa 5761 ttctacaaca actgctgttt attcatttca gaattgggtg tcgacatagc agaataggca 5821 ttattcgaca gaggagagca agaaatggag ccagtagatc ctagactaga gccctggaag 5881 catccaggaa gtcagcctaa gactgcttgt accacttgct attgtaaaaa gtgttgcttt 5941 cattgccaag tttgtttcac aaaaaaagcc ttaggcatct cctatggcag gaagaagcgg 6001 agacagcgac gaagagctcc tgaagacagt cagactcatc aagtttctct accaaagcag 6061 taagtagtac atgtaatgca acctttagta atagcagcaa tagtagcatt agtagtagca 6121 ggaataatag caatagttgt gtgatccata gtattcatag aatataggaa aataagaaga 6181 caaagaaaaa tagacaggtt aattgataga ataagcgaaa gagcagaaga cagtggcaat 6241 gagagtgaag gggatcagga ggaattatca gcactggtgg ggatggggca cgatgctcct 6301 tgggttatta atgatctgta gtgctacaga aaaattgtgg gtcacagtct attatggggt 6361 acctgtgtgg aaagaagcaa ccaccactct attttgtgca tcagatgcta aagcatatga 6421 tacagaggta cataatgttt gggccacaca agcctgtgta cccacagacc ccaacccaca 6481 agaagtagaa ttggtaaatg tgacagaaaa ttttaacatg tggaaaaata acatggtaga 6541 acagatgcat gaggatataa tcagtttatg ggatcaaagc ctaaagccat gtgtaaaatt 6601 aaccccactc tgtgttactt taaattgcac tgatttgagg aatactacta ataccaataa 6661 tagtactgct aataacaata gtaatagcga gggaacaata aagggaggag aaatgaaaaa 6721 ctgctctttc aatatcacca caagcataag agataagatg cagaaagaat atgcacttct 6781 ttataaactt gatatagtat caatagataa tgatagtacc agctataggt tgataagttg 6841 taatacctca gtcattacac aagcttgtcc aaagatatcc tttgagccaa ttcccataca 6901 ctattgtgcc ccggctggtt ttgcgattct aaaatgtaac gataaaaagt tcagtggaaa 6961 aggatcatgt aaaaatgtca gcacagtaca atgtacacat ggaattaggc cagtagtatc 7021 aactcaactg ctgttaaatg gcagtctagc agaagaagag gtagtaatta gatctgagaa 7081 tttcactgat aatgctaaaa ccatcatagt acatctgaat gaatctgtac aaattaattg 7141 tacaagaccc aactacaata aaagaaaaag gatacatata ggaccaggga gagcatttta 7201 tacaacaaaa aatataatag gaactataag acaagcacat tgtaacatta gtagagcaaa 7261 atggaatgac actttaagac agatagttag caaattaaaa gaacaattta agaataaaac 7321 aatagtcttt aatcaatcct caggagggga cccagaaatt gtaatgcaca gttttaattg 7381 tggaggggaa tttttctact gtaatacatc accactgttt aatagtactt ggaatggtaa 7441 taatacttgg aataatacta cagggtcaaa taacaatatc acacttcaat gcaaaataaa 7501 acaaattata aacatgtggc aggaagtagg aaaagcaatg tatgcccctc ccattgaagg 7561 acaaattaga tgttcatcaa atattacagg gctactatta acaagagatg gtggtaagga 7621 cacggacacg aacgacaccg agatcttcag acctggagga ggagatatga gggacaattg 7681 gagaagtgaa ttatataaat ataaagtagt aacaattgaa ccattaggag tagcacccac 7741 caaggcaaag agaagagtgg tgcagagaga aaaaagagca gcgataggag ctctgttcct 7801 tgggttctta ggagcagcag gaagcactat gggcgcagcg tcagtgacgc tgacggtaca 7861 ggccagacta ttattgtctg gtatagtgca acagcagaac aatttgctga gggccattga 7921 ggcgcaacag catatgttgc aactcacagt ctggggcatc aagcagctcc aggcaagagt 7981 cctggctgtg gaaagatacc taaaggatca acagctcctg gggttttggg gttgctctgg 8041 aaaactcatt tgcaccacta ctgtgccttg gaatgctagt tggagtaata aatctctgga 8101 tgatatttgg aataacatga cctggatgca gtgggaaaga gaaattgaca attacacaag 8161 cttaatatac tcattactag aaaaatcgca aacccaacaa gaaaagaatg aacaagaatt 8221 attggaattg gataaatggg caagtttgtg gaattggttt gacataacaa attggctgtg 8281 gtatataaaa atattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc 8341 tgtactttct atagtgaata gagttaggca gggatactca ccattgtcgt tgcagacccg 8401 ccccccagtt ccgaggggac ccgacaggcc cgaaggaatc gaagaagaag gtggagagag 8461 agacagagac acatccggtc gattagtgca tggattctta gcaattatct gggtcgacct 8521 gcggagcctg ttcctcttca gctaccacca cagagactta ctcttgattg cagcgaggat 8581 tgtggaactt ctgggacgca gggggtggga agtcctcaaa tattggtgga atctcctaca 8641 gtattggagt caggaactaa agagtagtgc tgttagcttg cttaatgcca cagctatagc 8701 agtagctgag gggacagata gggttataga agtactgcaa agagctggta gagctattct 8761 ccacatacct acaagaataa gacagggctt ggaaagggct ttgctataag atgggtggca 8821 aatggtcaaa acgtgtgact ggatggccta ctgtaaggga aagaatgaga cgagctgaac 8881 cagctgagct agcagcagat ggggtgggag cagcatcccg agacctggaa aaacatggag 8941 cactcacaag tagcaataca gcagctacca atgctgattg tgcctggcta gaagcacaag 9001 aggaggagga agtgggtttt ccagtcaaac ctcaggtacc tttaagacca atgacttaca 9061 aagcagcttt agatcttagc cactttttaa aagaaaaggg gggactggat gggttaattt 9121 actcccaaaa gagacaagac atccttgatc tgtgggtcta ccacacacaa ggctacttcc 9181 ctgattggca gaactacaca ccagggccag ggatcagata tccactgacc tttggatggt 9241 gcttcaagct agtaccagtt gagccagaga agatagaaga ggccaataaa ggagagaaca 9301 actgcttgtt acaccctatg agccagcatg gatggatgac ccggagagag aagtgttagt 9361 gtggaagtct gacagccacc tagcatttca gcattatgcc cgagagctgc atccggagta 9421 ctacaagaac tgctgacatc gagctatcta caagggactt tccgctgggg actttccagg 9481 gaggtgtggc ctgggcggga ccggggagtg gcgagccctc agatcgtgca tataagcagc 9541 tgctttctgc ctgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg 9601 gctaactagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag 9661 tgtgtgcccg tctgttatgt gactctggta gctagagatc cctcagatcc ttttaggcag 9721 tgtggaaaat ctctagca // LOCUS HIVNDK 9143 bp ds-DNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION M27323 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1 (isolate NDK), proviral DNA infectious clone. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9143) AUTHORS Spire,B., Sire,J., Zachar,V., Rey,F., Barre-Sinoussi,F., Galibert,F., Hampe,A. and Chermann,J.-C. TITLE Nucleotide sequence of HIV1-NDK a highly cytopathic strain of the human immunodeficiency virus HIV1 JOURNAL Gene 81, 275-84 (1989) STANDARD full staff_review COMMENT Kindly provided prior to publication by J.-C. Chermann, Pasteur Institute, Marseille. This is an infectious molecular clone of an isolate estimated to be 10,000 times more cytopathic in vitro than a prototypical HIV-1. [1] reports that only minor sequence differences appear to be responsible for the "acute biological effect". FEATURES from to/span description pept 334 1827 gag polyprotein pept < 1623 4631 pol polyprotein (NH2-terminus uncertain) pept 4576 5154 vif protein pept 5094 5384 vpR protein pept 5365 5579 tat protein, exon 2 (first expressed exon) 7883 7928 tat protein, exon 3 (AA at 7884) pept 5504 5579 rev protein, exon 2 (first expressed exon) 7883 8157 rev protein, exon 3 (AA at 7885) pept 5596 5841 vpU protein pept 5759 8299 env polyprotein pept 8301 8924 nef protein pre-msg 1 9143 genomic mRNA BASE COUNT 3307 a 1617 c 2193 g 2026 t ORIGIN 5'-terminus of 5' R repeat (putative mRNA start). 1 ggtctctctg gttagaccag atttgagcct gggagctctg gctaattagg gaacccactg 61 cttaagcctc aataaagctt gccttgagcg cttcaagtag tgtgtgcccg tctgttgtgt 121 gactctggta actagagatc cctcagatcc gtttagtccg agtgaaaaat ctctagcagt 181 ggcgcccgaa cagggacctg aaagcgaaag tagaaccaga gaagatctct cgacgcagga 241 ctcggcttgc tgaagcgcgc acagcaagag gcgaggggca gcgactggtg agtacgctga 301 aatttttgac tagcggaggc tagaaggaga gagatgggtg cgagagcgtc agtattaagc 361 gggggaaaat tagatacatg ggaaagaatt cggttacggc caggaggaaa gaaaaaatat 421 gcactaaaac atttgatatg ggcaagcagg gagctagaac gatttacact taatcctggc 481 cttttagaga catcagaagg ctgtaaacaa ataataggac agctacaacc atctattcaa 541 acaggatcag aagaaattag atcattatat aatacagtag caaccctcta ttgtgtacat 601 gaaaggatag aggtaaaaga caccaaagaa gctgtagaaa agatggagga agaacaaaac 661 aaaagtaaga aaaagacaca gcaagcagca gctgatagca gccaggtcag ccaaaattac 721 cctatagtgc agaacctaca ggggcaaatg gtacatcagg ccatatcacc tagaactttg 781 aacgcatggg taaaagtaat agaagaaaag gccttcagcc cggaagtaat acccatgttt 841 tcagcattat cagaaggagc caccccacaa gatttaaaca ccatgctaaa cacagtgggg 901 ggacatcaag cagctatgca aatgctaaaa gagaccatca atgacgaagc tgcagaatgg 961 gacagattac atccagtgca tgcagggcct gttgcaccag gccaaatgag agaaccaagg 1021 ggaagtgata tagcaggaac tactagtacc cttcaggaac aaatagcatg gatgacaagc 1081 aacccaccta tcccagtagg agaaatctat aaaagatgga taatcctggg attaaataaa 1141 atagtaagaa tgtatagccc tgtcagcatt ttggacataa gacagggacc aaaggaacct 1201 tttagagact atgtagaccg gttctataaa actctaagag ccgagcaagc ttcacaggat 1261 gtaaaaaact ggatgacaga aaccttgttg gtccaaaatg caaacccaga ttgtaaaact 1321 atcttaaaag cattgggacc acaggctaca ctagaagaaa tgatgacagc atgccaggga 1381 gtgggggggc ccggccataa agcaagagtt ttggctgagg caatgagcca agtaacaggt 1441 tcagctactg cagtaatgat gcagagaggc aattttaagg gcccaagaaa aagtattaag 1501 tgtttcaact gtggcaagga agggcacaca gcaaaaaatt gcagggcccc tagaaaaaag 1561 ggctgttgga aatgcggaag ggaaggacac caaatgaaag attgcactga aagacaggct 1621 aattttttag ggaagatttg gccttcccac aagggaaggc cggggaattt tcttcagagc 1681 agaccagagc caacagcccc accagcagag agcttcgggt ttggggagga gataaccccc 1741 tctcagaaac aggagcagaa agacaaggaa ctgtatcctt tagcttccct caaatcactc 1801 tttggcaacg acccctcgtc acaataaaga tagggggaca gctaaaggaa gctctattag 1861 atacaggagc agatgataca gtattagaag aaataaattt gccaggaaaa tggaagccaa 1921 aaatgatagg gggaattgga ggttttatca aagtaagaca gtatgatcaa atactcatag 1981 aaatctgtgg atataaagct atgggtacag tattagtagg acctacacct gtcaacataa 2041 ttggaagaaa tttgttgacc cagattggct gcactttaaa ttttccaatt agtcctattg 2101 aaactgtacc agtaaaatta aagccaggaa tggatggccc aaaagttaaa caatggccat 2161 tgacagaaga aaaaataaaa gcattaacag aaatttgtac agaaatggaa aaggaaggaa 2221 aaatttcaag aattgggcct gaaaatccat ataatactcc aatatttgcc ataaagaaaa 2281 aagacagtac caagtggaga aaattagtag atttcagaga acttaataag agaactcaag 2341 atttctggga ggttcaatta ggaataccgc atcctgcagg gctgaaaaag aaaaaatcag 2401 taacagtact ggatgtgggt gatgcatatt tctcagttcc cttagatgaa gattttagga 2461 aatataccgc atttaccata cctagtataa acaatgagac accagggatt agatatcagt 2521 acaatgtgct cccacaggga tggaaaggat caccggcaat attccaaagt agcatgacaa 2581 aaatcttaga gccctttaga aaacaaaatc cagaaatagt tatctatcaa tacatggatg 2641 atttgtatgt aggatctgac ttagaaatag ggcagcatag aacaaaaata gaggaattaa 2701 gagaacatct attgaggtgg ggatttacca caccagataa aaaacatcag aaagaacctc 2761 catttctttg gatgggttat gaactccatc ctgataaatg gacagtacag cctataaacc 2821 tgccagaaaa agaaagctgg actgtcaatg atatacagaa gttagtgggg aaattaaact 2881 gggcaagcca gatttatgca ggaattaaag taaagcaatt atgtaaactc cttaggggaa 2941 ccaaagcact aacagaagta gtaccactaa cagaagaagc agaattagaa ctggcagaaa 3001 acagggaaat tctaaaagaa ccagtacatg gagtgtatta tgacccatca aaagacttaa 3061 tagcagaact acagaaacaa ggggacggcc aatggacata ccaaatttat caagaaccat 3121 ttaaaaatct aaaaacagga aagtatgcaa gaacgagggg tgcccacact aatgatgtaa 3181 aacaattaac agaggcagtg caaaaaatag ccacagaaag catagtgata tggggaaaga 3241 ctcctaaatt taaactaccc atacaaaagg aaacatggga aacatggtgg atagagtatt 3301 ggcaagccac ctggattcct gagtgggaat ttgtcaatac ccctccttta gtaaaattat 3361 ggtaccagtt agagaaggaa cccataatag gagcagaaac tttctatgta gatggggcag 3421 ctaatagaga gactaaatta ggaaaagcag gatatgttac tgacagagga agacagaaag 3481 ttgtcccttt cactgacacg acaaatcaga agactgagtt acaagcaatt aatctagctt 3541 tacaggattc gggattagaa gtaaacatag taacagattc acaatatgca ctaggaatca 3601 ttcaagcaca accagataag agtgaatcag agttagtcag tcaaataata gagcagctaa 3661 taaaaaagga aaaggtttac ctggcatggg taccagcaca caaaggaatt ggaggaaatg 3721 aacaagtaga taaattagtc agtcagggaa tcaggaaagt actatttttg gatggaatag 3781 ataaggctca ggaagaacat gagaaatatc acaacaattg gagagcaatg gctagtgatt 3841 ttaacctacc acctgtggta gcgaaagaaa tagtagctag ctgtgataaa tgtcagctaa 3901 aaggagaagc catgcatgga caagtagact gtagtccagg aatatggcaa ttagattgta 3961 cacatctgga aggaaaagtt atcctggtag cagttcatgt agccagtggc tatatagaag 4021 cagaagttat tccagcagaa acggggcaag aaacagcata ctttctctta aaattagcag 4081 gaagatggcc agtaaaagta gtacatacag ataatggcag caatttcacc agtgctacag 4141 ttaaggccgc ctgttggtgg gcagggatca aacaggaatt tggaattccc tacaatcccc 4201 aaagtcaagg agtagtagaa tctatgaata aagaattaaa gaaaattata ggacaggtaa 4261 gagatcaagc tgaacatctt aagacagcag tacaaatggc agtatttatc cacaatttta 4321 aaagaaaagg ggggattggg ggatacagtg caggggaaag aataatagac ataatagcaa 4381 cagacataca aactagagaa ttacaaaaac aaatcataaa aattcaaaat tttcgggttt 4441 attacaggga cagcagagat ccaatttgga aaggaccagc aaagcttctc tggaaaggtg 4501 aaggggcagt agtaatacaa gacaatagtg acataaaggt agtaccaaga agaaaagtaa 4561 agatcattag ggattatgga aaacagatgg caggtgatga ttgtgtggca agtagacagg 4621 atgaggatta acacatggaa aagtttagta aaataccata tgtatgtttc aaagaaagct 4681 aacagatggt tttatagaca tcactatgac agccaccacc caaaaataag ttcagaagta 4741 cacatcccac taggagaagc tagactggta gtaacaacat attggggtct gcatacagga 4801 gaaaaagaat ggcatctggg tcagggagtc tccatagaat ggaggaaaag gagatatagc 4861 acacaagtag accctggcct ggcagaccaa ctaattcata tgtattattt tgattgtttt 4921 gcagaatctg ctataagaaa agccatatta ggacatatag ttagtcctag ttgtgagtat 4981 caagcaggac ataacaaggt aggatcctta cagtatttgg cactagcagc attaatagca 5041 ccaaaaaaga taaagccacc tttgcctagt gttaggaagc taacagaaga tagatggaac 5101 aagccccaga agaccaaggg ccgcagaggg agccatacaa tgaatggaca ttagagcttt 5161 tagaggagct taagagtgaa gctgtcagac attttcctag gatatggctc catagcttag 5221 gacaacatat ctatgaaact tatggggata cctgggcagg tgttgaagct ataataagaa 5281 ttctgcaaca actactgttt attcatttca gaattgggtg tcaacatagc agaataagta 5341 ttactcgaca gagaagagca agaaatggat ccagtagatc ctaatctaga gtcctggaac 5401 catccaggaa gtcagcctag gactgcttgt aataagtgtc attgtaaaaa gtgttgctat 5461 cattgccaag tttgcttcat aacgaaaggc ttaggcatct cctatggcag gaagaagcgg 5521 agacagcgac gaaaacctcc tcaaggcgat caggctcatc aagttcctat accagagcag 5581 taagtattcc atgtaatgca acctttagtg ataatagcaa tagcagcatt agtagtagca 5641 ataataatag caatagttgt gtggaccata gtatacatag aatataggag gataaaaagg 5701 caaagaaaaa tagactgttt aattgataga ataagagaaa gagcagaaga cagtggcaat 5761 gagagcgagg gagaaagaga ggaattgtca aaacttgtgg aaatggggca tcatgctcct 5821 tgggatgttg atgacctgta gtgctgcaga agatttgtgg gtcacagttt attatggggt 5881 gcctatatgg aaggaagcaa ctaccactct attttgtgca tcagatgcta aagcatataa 5941 aaaagaggca cataatatct gggccacaca tgcctgtgta cccacggacc ccaacccaca 6001 agaaatagaa ttggaaaatg tgacagaaaa ctttaacatg tggaaaaata acatggtgga 6061 acaaatgcat gaggatataa tcagtttatg ggatcaaagc ctaaaaccat gtgtaaaatt 6121 aactccactc tgtgtcactt taaactgcac tgatgaattg aggaacagca agggcaatgg 6181 gaaggtagaa gaggaggaaa aaaggaaaaa ctgctctttc aatgtaaggg ataagaggga 6241 gcaagtatat gcactttttt ataaacttga tatagtgcca atagacaata ataataggac 6301 caatagtact aattataggt taataaattg tgatacctca accattacac aggcttgtcc 6361 aaagatatcc tttgaaccaa ttcccataca tttttgtgcc ccagctggtt ttgcaattct 6421 aaagtgtaga gataagaagt tcaatgggac aggcccatgc tcaaatgtca gcacagtaca 6481 atgtacacat ggaattagac cagtggtgtc aactcaactg ctgttgaatg gcagtctagc 6541 agaagaagag atcataatta gatctgaaaa tctcacaaac aatgttaaaa ccataatagt 6601 acagcttaat gcatctatag taattaattg tacaaggccc tacaaatata caaggcaaag 6661 gacatcgata ggactaaggc aatcactcta tacaataaca ggaaaaaaga agaaaacagg 6721 atacatagga caagcacatt gtaaaattag cagagcagaa tggaataaag ctttacaaca 6781 ggtagctaca aaactaggaa accttcttaa caaaacaaca ataactttta agccatcctc 6841 aggaggggac ccagaaatta catcacacat gcttaattgt ggaggggact tcttctactg 6901 taatacatca agactgttta atagtacatg gaatcagact aatagtacag ggttcaataa 6961 tggcacagtc acactcccat gcagaataaa acaaattgta aacttgtggc agagagtagg 7021 aaaagcaatg tatgcccctc ccatcgaagg actaattaaa tgttcatcaa atattacagg 7081 gctactatta acaagagatg gtggtgcaaa taatagttct catgagacca tcagacctgg 7141 aggaggagat atgagggaca attggagaag tgaattatat aaatataaag tagtaaaaat 7201 tgaaccaata ggagtagcac ccaccaaggc aaggagaaga gtggtggaaa gagaaaaaag 7261 agcaatagga ctaggagctg tgttccttgg gttcttggga gcagcaggaa gcacgatggg 7321 cgcagcgtca gtgacgctga cggtacaggc cagacagtta atgtctggta tagtgcacca 7381 gcaaaacaat ttgctgaggg ctatagaggc gcaacaacat ctgttgcaac tcacggtctg 7441 gggcattaaa cagctccagg caagagtcct ggctgtggaa agatacctaa gggatcaaca 7501 gctcctagga atttggggtt gctctggaag gcacatttgc accactaatg taccctggaa 7561 ctctagttgg agtaatagat ctctagatga gatttggcag aacatgacct ggatggagtg 7621 ggaaagagaa attgacaatt acacaggctt aatatacagc ttaattgaag aatcgcagat 7681 ccagcaagaa aagaatgaaa aagaattatt ggaattggac aaatgggcaa gtttgtggaa 7741 ttggtttagc ataacaaaat ggctgtggta tataaaatta ttcataatga tagtaggagg 7801 cctgataggt ttaagaatag tttttgctgt gctttctgta gtaaatagag ttaggcaggg 7861 atactcacct ctgtcatttc agaccctcct cccagtcccg aggggacccg acaggcccga 7921 agaaatagaa gaagaaggtg gagagcgagg cagagacaga tccattcgat tggtgaacgg 7981 attatttgca cttttctggg acgacctgag gaacctgtgc ctcttcagct accaccgctt 8041 gagagactca atcttaattg cagcgaggat tgtggaactt ctgggacgca gggggtggga 8101 agccctcaaa tacctgtgga acctcctgca gtattggagt caggaactga ggaatagtgc 8161 tagtagcttg cttgatacca tagcaatagc agtagctgag aggacagata gggttataga 8221 agtagtacaa agagcttgca gagctattct taacgtaccc agaagaataa gacagggctt 8281 ggaaaggctt ttgctataaa atgggtggca aatggtcaaa aagtagttta gttggatggc 8341 ctgctataag ggaaagaata agaaaaactg atccagcagc agatggggta ggagcagtat 8401 ctcgagacct ggaaaaacat ggggcaatca caagtagcaa tacagcaagt actaatgata 8461 cctgtgcatg gctggaagca caagaggaga gcgaggaggt gggctttcca gttagacctc 8521 aggtaccttt aagaccaatg acttacaaag aagctgttga tcttagccac tttttaaaag 8581 aaaagggggg actggaaggg ctaatttggt ccaagaaaag acaagagatc cttgatcttt 8641 gggtctacaa cacacaaggc atcttccccg attggcagaa ctacacacca gggccaggga 8701 tcagataccc actgaccttt ggatggtgct tccagctagt accagttgat ccacaggagg 8761 tagaagaggc cactgaaaga gaggacaact gcttgttaca ccctatgtgc cagcaaggaa 8821 tggaggaccc ggagagacaa gtgttaatgt ggagatttaa cagcagacta gcacttgagc 8881 acaaggcccg agagctgcat ccggagttct acaaagactg ctgacatcga gtttcctaca 8941 agggactttc cgctggggac tttccaggga ggcgtggcct gggcgggact tgggagtggc 9001 taaccctcag atgctgcata taagcagctg cttttcgcct gtactgggtc tctctggtta 9061 gaccagattt gagcctggga gctctctggc taattaggga acccactgct taagcctcaa 9121 taaagcttgc cttgagtgct tca // LOCUS HIVOYI 9190 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral DNA, complete genome. ACCESSION M26727 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1 (isolate OYI) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9190) AUTHORS Wain-Hobson,S. and Pasteur Institute,Paris. TITLE ; JOURNAL AIDS Res. Hum. Retroviruses (1990) In press STANDARD full staff_review COMMENT Kindly provided in computer-readable form by Simon Wain-Hobson, Pasteur Institute, Paris. OYI, designated elsewhere as isolate 397, was taken from a healthy Gabonese presenting an atypical Western Blot. Tree analysis (see Part III) reveals that the sequence is closely related to the North American isolate SF2 and the European virus HAN(across the genome). This is the first report of a virus from Africa that clusters with North American rather than Zairean viruses: OYI and SF2 differ by approximately 7% in envelope. The single C -> S substitution at residue 22 of the OYI tat protein renders it inactive, but may not account for the avirulence of the virus. The pattern of env termination and nef start is not what is typically encountered. Sibling sequences for OYI(397) are available (see 1989:I-A-181). FEATURES from to/span description pept 336 1835 gag polyprotein pept < 1628 4639 pol polyprotein pept 4584 5162 vif protein pept 5102 5392 vpR protein ORF 5599 5721 vpu ORF pept 5373 5587 tat protein, exon 2 (first expressed exon) 7918 8008 tat protein, exon 3 (AA at 7919) pept 5512 5587 rev protein, exon 2 (first expressed exon) 7918 8192 rev protein, exon 3 (AA at 7920) pept 5767 8334 env polyprotein pept 8336 8971 nef protein binding 182 199 primer (Lys-tRNA) binding site pre-msg 1 9190 genomic mRNA signal 9166 9171 poly-A signal BASE COUNT 3304 a 1630 c 2202 g 2054 t ORIGIN 1 ggtctctcta gctagaccag atctgagccc gggagctctc tggctaacta gggaacccac 61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt 121 gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca 181 gtggcgcccg aacagggacc ttaaagtgaa agtggaacca gaggagctct ctcgacgcag 241 gactcggctt gcttagcgcg cacggcaaga ggcgaggggc ggcgactggt gagtacgcca 301 aaaatttttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa 361 gcgggggaga attagataaa tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat 421 atcaactaaa acatatagta tgggcaagca gggagctaga acgattcgca attaatcctg 481 gcctgttaga aacatcagaa ggttgtagac aaatactggg acagctacaa ccatccctta 541 agacaggatc agaagaaatt agatcattat ataatacagt agcaactctt tattgtgtgc 601 atcaaaagat agaggtaaaa gacaccaagg aagctttaga taagatagag gaagagcaaa 661 acaaaagtaa gaaaaaagca cagcaaacag cagctgacac aggaaacagc agccaggtca 721 gccaaaatta ccctatagta cagaaccttc aggggcaaat ggtacatcag cccatatcac 781 ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtaa 841 tacccatgtt ttcagcatta gcagaaggag ccaccccaca agatctaaac accatgctaa 901 acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag 961 ctgcagaatg ggatagattg catccagtac atgcagggcc tattgcacca ggccagatga 1021 gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat 1081 ggatgacaaa taatccacct atcccagtag gagaaatcta taaaagatgg ataatcctgg 1141 gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac 1201 caaaggaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag 1261 cttcacagga tgtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcaaacccag 1321 attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag 1381 catgtcaggg agtgggggga cccggccata aagcaagagt tttggctgaa gcaatgagcc 1441 aagtaaattc agtcaccgta atgatgcaga aaggcaattt taagaaccaa agaaagactg 1501 ttaagtgttt caattgtggc aaagaagggc acatagccaa aaattgcagg gctcctagga 1561 aaaagggctg ttggaaatgt ggaagggaag gacaccaaat gaaagattgt actgagagac 1621 aggctaattt tttagggaag atctggcctt cccacaaggg aaggccaggg aattttcttc 1681 agaacagacc agagccaaca gccccaccag cagagagctt cgggtttggg gaagagacaa 1741 caactccccc tcagaagcag gagccgatag acaagggact gtatccttta acctccctca 1801 gatcactctt tggcaacgac ccatcgtcac aataaagata ggggggcaac taaaagaagc 1861 tctattagat acaggagcag atgatacagt attagaagaa atgaatttgc caggaagatg 1921 gaaaccaaaa atgatagggg gaattggagg ttttatcaaa gtaagacagt atgatcagat 1981 actcatagaa atctgtggac ataaagctat aggtacagta ttagtaggac ctacacctgt 2041 caacataatt ggaagaaatc tgttgactca gcttggttgt actttaaatt ttcccattag 2101 tcctattgaa actgtaccag taaaattaaa gccaggaatg gatggcccaa aagttaaaca 2161 atggccattg acagaagaga aaataaaagt attaatagaa atttgtacag aaatggaaaa 2221 ggaagggaaa atttcaaaag ttgggcctga aaatccatac aatactccag tatttgccat 2281 aaagaaaaaa gacagtacta aatggagaaa attagtagat ttcagagaac ttaataagag 2341 aactcaggac ttctgggaag tccaattagg aataccacat ccagcagggt taaaaaagaa 2401 aaaatcagta acagtactgg atgtgggtga tgcatacttt tcagttccct tagataaaga 2461 cttcagaaag tatactgcat ttaccatacc tagtataaac aatgagacac cagggattag 2521 atatcagtac aatgtgcttc cacagggatg gaaaggatca ccagcgatat tccaaagtag 2581 tatgacaaaa atcttagagc cttttagaaa acaaaatcca gacatagtta tctatcaata 2641 catggatgat ttgtatgtag gatctgactt agaaataggg cagcatagaa caaaaataga 2701 ggaactgaga caacatctgt tgaggtgggg atttaccaca ccagacaaaa aacatcagaa 2761 agaacctcca ttcctttgga tgggttatga actccatcct gataaatgga cagtacagcc 2821 tataatgctg ccagaaaaag acagctggac tgtcaatgac atacagaagt tagtgggaaa 2881 attaaattgg gcaagtcaga tttatgcagg gattaaagta aagaacttat gtaaactcct 2941 taggggaacc aaagcactaa cagaagtaat accactaaca gaagaagcag agctagaact 3001 ggcagaaaac agggagattc taaaagaacc agtacatgga gtgtattatg acccatcaaa 3061 agacttagta gcagaattac agaaacaggg acaaggccaa tggacatatc aaatttatca 3121 agagccattt aaaaatctga aaacaggaaa gtatgcaaga atgaggggtg cccacactaa 3181 tgatgtaaaa cagttaacag aggcagtgca aaaaataacc caagaaagca tagtaatatg 3241 gggaaagact cctaaattta aactacccat acaaaaagaa acatgggaag catggtggac 3301 ggagtattgg caagccacct ggattcctga atgggagttt gtcaataccc ctcccttagt 3361 aaaattatgg taccagttag agaaagaccc catagtagga gcagaaactt tctatgtaga 3421 tggggcagct aatagggaga ctaaattagg aaaagcagga tatgttactg acagaggaag 3481 acagaaagtt gtctccctaa ctgacacaac aaatcagaag actgaattac aagcaattca 3541 tctagctctg caggattcgg gattagaagt aaacatagta acagactcac aatatgcatt 3601 aggaatcatt caagcacaac cagataaaag tgaatcagag ttagtcagtc aaataataga 3661 gcagttaata aaaaaggaaa aggtctacct ggcatgggta ccagcacaca aaggaattgg 3721 aggaaatgaa caagtagata aattagtcag tgctggaatc aggaaagtac tatttttgga 3781 tggaatagat aaggcccaag aggaacatga gaaatatcac agtaactgga gagcaatggc 3841 tagtgatttt aacctgccac ctgtagtagc aaaagaaata gtagccagct gtgataaatg 3901 ccagctaaaa ggagaagcca tgcatggaca agtagactgt agtccaggaa tatggcaact 3961 agattgtaca catttagaag gaaaaattat cctggtagca gttcatgtag ccagtggata 4021 tatagaagca gaagttattc cagcagagac agggcaggaa acagcatact ttatcttaaa 4081 attagcagga agatggccag taaaaacaat acatacagac aatggcagca atttcaccag 4141 tactacggtt aaggccgcct gttggtgggc agggatcaag caggaatttg gcattcccta 4201 caatccccaa agtcaaggag tagtagaatc tatgaataat gaattaaaga aaattatagg 4261 acaagtaaga gatcaggctg aacatcttaa gacagcagta caaatggcag tatttatcca 4321 caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa tagtagatat 4381 aatagctaca gacatacaaa ctaaagaact acaaaaacaa attacaaaaa ttcaaaattt 4441 tcgggtttat tacagggaca gcagagaacc actttggaaa ggaccagcaa agcttctttg 4501 gaaaggtgaa ggggcagtag taatacaaga taatagtgac ataaaagtag tgccaagaag 4561 gaaagcaaag atcattaggg attatggaaa acagatggca ggtgatgatt gtgtggcaag 4621 tagacaggat gaggattaga acatggaaaa gtttagtaaa acaccatatg tatgtttcaa 4681 agaaagctaa gggatggttt tatagacatc actatgaaag cactcatcca agaataagtt 4741 cagaagtaca catcccacta ggggatgcta ccttggtagt aacaacatat tggggtctgc 4801 atacaggaga aagagaatgg catttgggcc agggagcctc tatagaatgg aggaaaaaga 4861 gatatagcac acaagtagac cctggcctag cagaccaact aattcataca tattattttg 4921 attgtttttc agaatctgct ataagaaatg ccatactagg aaatatagtt agtcctaggt 4981 gtgaatatcc agcaggacat aacaaggtag gatctctaca atacttggca ctagcagcat 5041 tgataaaacc aaaaaagata aagccacctt tgcctagtgt tacgaaacta acagaggata 5101 gatggaacaa gccccagaag accaagggcc acagagggag ccatacaatg aatggacact 5161 agagctttta gaggagctta agagtgaagc tgttagacat tttcctagga tatggctcca 5221 tagcttagaa caatatatct atgaaactta tggggatact tgggaaggag tggaagccat 5281 aataagaatt ctgcaacaac tgctgtttat tcatttcaga attgggtgtc aacatagcag 5341 aataggcatt actcgacaga gaagagcaag aaatggagcc agtagatcct agactagagc 5401 cctggaagca tccaggaagt cagcctaaga ctgctagtaa caattgctat tgtaaaaggt 5461 gttgccttca ctgtcaagtt tgtttcacaa aaaaaggctt aggcatttcc tatggcagga 5521 agaagcggag acagcgacga agagctcctc aagacagtaa gactcatcaa gtttctctat 5581 caaagcagta agtaatacat gtaatatact ctttacaaat attagcaata gtagcattag 5641 tagtagtaac aataatagca atagttgtgt ggaccatagt actcttagaa tataggaaaa 5701 tattaagaca aagaaaaata gacaggttaa ttgatagaat aagagaaaga gcagaagaca 5761 gtggcaatga cagcgagggg gaccaggaag aattatcagc gcttgtggag atggggcacc 5821 atgctccttg ggatgttgat gatatgtagt gctgcagaaa atttgtgggt cacagtctat 5881 tatggggtac ctgtgtggaa agaagcaacc accactctat tctgtgcatc agatgctaga 5941 gcatatgcta cagaggtaca taatgtttgg gccacacatg cctgtgtacc cacagacccc 6001 aacccacaag aagtagtatt gggaaatgtg acagaaaatt ttgacatgtg gaaaaataat 6061 atggtagaac aaatgcaaga agatataatc agtttatggg atcaaagcct aaagccatgt 6121 gtaaaattaa ccccactctg tgttacttta gattgcactg atgttaatac cactagtagt 6181 agtttgagga atgctactaa taccacaagt agtagttggg aaacgatgga gaaaggagaa 6241 ttaaaaaact gctctttcaa taccaccaca agcataagag ataagatgca ggaacaatat 6301 gcactttttt ataaacttga tgtattacca atagataaga atgatactaa atttaggtta 6361 atacattgta acacctcaac cattacacag gcctgtccaa agatatcctt tgagccaatt 6421 cccatgcatt attgtactcc ggctggtttt gcgattctaa agtgtaatga taagaagttc 6481 aatggaacag gaccatgtac aaatgtcagc acagtacaat gtacacatgg aattaagcca 6541 gtagtgtcaa ctcaactgct gttaaatggc agcctagcag aagaagaggt aataattaga 6601 tctagcaatt tcacaaacaa tgctaaaatc ataatagtac agctgaataa atctgtagaa 6661 attaattgta caagacccaa caacaataca agaaacagga tatcaatagg accagggaga 6721 gcatttcata caacaaaaca aataatagga gatataagac aagcacattg taaccttagt 6781 agagcaacat gggagaaaac tttagaacag atagctacaa aattaagaaa acaatttagg 6841 aataaaacaa tagcctttga tcgatcctca ggaggggatc cagaaattgt aatgcacagt 6901 tttaattgtg gaggggaatt tttctactgt aatacatcac aactgtttaa tagtacttgg 6961 aatgatacta caagggcaaa tagcactgaa gtaactatca cactcccatg tagaataaaa 7021 caaattgtaa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagtgga 7081 caaattagat gttcatcaaa gattacaggg ctgctattaa caagagatgg tggtaagaac 7141 accacgaacg ggatcgaaat cttcagacct gcaggaggag acatgaggga caattggaga 7201 agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag 7261 gcaaggagaa gagtggtgca gagagaaaaa agagcagtgg gaatgctagg agctatgttc 7321 cttgggttct tgggagcagc aggaagcact atgggcgcac ggtcgatgac gctgacggta 7381 caggccagac aattattgtc tggtatagtg cagcagcaga acaatctgct gagggctatt 7441 gaggcgcaac agcacctgtt gcaactcaca gtctggggca tcaagcagct ccaggcaaga 7501 gtcctggctg tggaaagata cctaaaggat caacagctcc tagggatttg gggttgctct 7561 ggaaagctca tttgcaccac tactgtgcct tggaatgcta gttggagtaa taaatctctg 7621 aatgagattt gggataacat gacctggatg cagtgggaaa gagaaattga caattacaca 7681 cacttaatat acaccttaat tgaagaatcg cagaaccaac aggaaaagaa tgaacaagaa 7741 ttattggaat tggataagtg ggcaggtttg tggagttggt ttagcataac aaactggctg 7801 tggtatataa gaatattcat aataatagta ggaggcttgg taggtttaag aatagttttt 7861 gctgtacttt ctatagtgaa tagagttagg cagggatact caccattatc gtttcagacc 7921 cgcctcccaa cccagagggg acccgacagg cccgaaggaa tcgaagaaga aggtggagag 7981 agagacagag acagatccgg aagattagtg gatggattct tagcacttat ctgggacgat 8041 ctgcggagcc tgtgcctctt cagctaccac cgcttgagag acttaatctt gattgtagcg 8101 aggattgtgg aacttctggg acgcaggggg tgggaagtcc tgaaatattg gtggaatctc 8161 ctgcagtatt ggagccagga actaaagaat agtgtcatta gcttgctcaa cgccacagct 8221 atagcagtag ctgaggggac agatagggtt atagaaatag tacaaagagc ttatagagct 8281 tttctcaata tacctagaag aataagacag ggcttggaaa gggctttgct ataagatggg 8341 tggcaagtgg tcaaaatgta gtatgaaggg atggcctact ataagggaaa gaatgaagcg 8401 agctgagcta cagccacctg agccagcagc agaaggggtg ggagcagcat ctcgagacct 8461 ggaaaaacat ggagcaatca ctagtagcaa tacagcagct actaatgctg attgtgcctg 8521 gctagaagca caagaggatg aagaggtggg ttttccagtc agacctcagg tacctttaag 8581 accaatgact tacaagggag ctttagatct tagccacttt ttaaaagaaa aggggggact 8641 ggaagggcta atttactccc agaaaagaca agatattctt gatctgtggg tctatcacac 8701 acaaggctat ttccctgatt ggcagaacta cacaccaggg ccagggatca gatatccact 8761 gtgctttgga tggtgcttca agttagtacc aatggatcca gaccaggtag aagaggccaa 8821 cgaaggagag aacaacagct tgttacatcc tataagcctg catgggatgg atgacccaga 8881 gaaagaagtg ttagtgtgga agtttgacag ccgcctagca tttcgtcaca tggcccgaga 8941 ggtgcatccg gagtactaca aagactgctg acatcgagct ttctacaggg gactttccgc 9001 tggggacttt ccagggaggc gtggcctggg cgggaccggg gagtggcgag ccctcagatg 9061 ctgcatataa gcagctgctt tttgcctgta ctgggtctct ctagctagac cagatctgag 9121 cccgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa agcttgcctt 9181 gagtgcttca // LOCUS HIVPHIL1 259 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), envelope region. ACCESSION M27911 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate 799), clone B7. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 259) AUTHORS Ou,C.-Y. and colleagues CDC Atlanta GA, JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by C.-Y. Ou, CDC, Atlanta. The sequence below, designated 799-B7, appears to be a major form of the sibling sequences 799, taken from a hemophilia patient in Philadelphia. FEATURES from to/span description pept < 1 > 259 env region (AA at 2) BASE COUNT 99 a 45 c 56 g 59 t ORIGIN Approximately 1172 nt from env start (BRU coordinates). 1 ttcaacacaa ctgtttaata gtacttggaa tgttactgaa gggttaaata acactgcaaa 61 taacagtgaa ggaactatca cactcccatg cagaataaaa caaattataa acatgtggca 121 ggaagtagga aaagcaatgt atgcccctcc catcagagga caaattagat gttcatcaaa 181 tattacaggg ctgctattaa caagagatgg tggtaatgaa actaacggga ccgagatctt 241 cagacctgga ggaggagat // LOCUS HIVSWB881 59 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), gag region. ACCESSION M27952 KEYWORDS . SEGMENT 1 of 3 SOURCE Human immunodeficiency virus type 1 (isolate SWB88), clone 01. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 59) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson. B88 is a so-called "slow virus" isolated in Sweden. Sibling sequences available (see 1989: I-A-174). FEATURES from to/span description pept < 1 > 59 gag region (AA at 2) BASE COUNT 26 a 8 c 10 g 15 t ORIGIN 1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacca // LOCUS HIVSWB882 314 bp ss-DNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), tat first expressed exon of isolate L1.01. ACCESSION M31462 KEYWORDS . SEGMENT 2 of 3 SOURCE Human immunodeficiency virus type 1 (isolate B88). REFERENCE 1 (bases 1 to 314) AUTHORS Meyerhans,A., Cheynier,R., Albert,J., Seth,M., Kwok,S., Sninsky,J., Morfeldt-Manson,L., Asjo,B. and Wain-Hobson,S. TITLE Temporal fluctuations in HIV quasispecies in vivo are not reflected by sequential HIV isolations JOURNAL Cell 58, 901-910 (1989) STANDARD full staff_review COMMENT Kindly submitted in computer-readable form by S. Wain-Hobson, Pasteur Institute, Paris. The B88 isolate is considered to be a "slow-low" viral form. Sibling sequences from pbl's and viruses propagated on pbl's are available. This sequence corresponds to the earliest sample major form, L1.01. FEATURES from to/span description pept 30 / 244 tat exon 2 (first expressed exon) BASE COUNT 111 a 57 c 70 g 76 t ORIGIN 1 cattattcaa cagaggagag caagaagaaa tggagccagt agatcctaga ctagagccct 61 ggaagcatcc aggaagtcag cctaagactg cttgtaacaa ttgctattgt aaaaagtgtt 121 gctttcattg ccaagtttgt ttcacaaaaa aaggcttagg catctcctat ggcaggaaga 181 agcggagaca gcgacgaaga gctcctcagg acagtgagac tcatcaagtt tctctatcaa 241 agcagtaagt agtacatgta atgcaatctt tagaaatatt agcaatagta gcattagtag 301 tagcaataat acta // LOCUS HIVSWB883 250 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), envelope region. ACCESSION M27947 KEYWORDS . SEGMENT 3 of 3 SOURCE Human immunodeficiency virus type 1 (isolate SWB88), clone 01. REFERENCE 1 (bases 1 to 250) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL J. Acq. Immun. Defic. Syndrome. 2, 344-8 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson. This stretch comes from the first hypervariable region of env. Sibling sequences are available (see 1989: I-A-175). FEATURES from to/span description ORF < 3 245 env ORF (AA at 3) BASE COUNT 96 a 40 c 42 g 72 t ORIGIN 1 aaattaaccc cactctgtgt tactttaaat tgcactaagt tgaagaatgt tacttatgcc 61 aataatagta gtagggaaaa agaagaaatg aaaaactgct ctttcaccac aagaataggg 121 aataaggtgc agaaagaata tgcacttttt tataaacttg atgtagtacc aatagataag 181 gataatacaa gctttatatt gatacattgt aatacctcga ccattacaca ggcttgtcca 241 aaggtatcct // LOCUS ALFLBA 578 bp ss-mRNA PLN 11-AUG-1990 DEFINITION Alfalfa leghemoglobin mRNA, complete cds. ACCESSION M36100 KEYWORDS leghemoglobin. SOURCE Alfalfa 15 day old nodule, cDNA to mRNA, clone pNL549. ORGANISM Medicago sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 578) AUTHORS Barker,D.G., Gallusci,P., Lullien,V., Khan,H., Gherardi,M. and Huguet,T. TITLE Identification of two groups of leghemoglobin genes in alfalfa (Medicago sativa) and a study of their expression during root nodule development JOURNAL Plant Mol. Biol. 11, 761-772 (1988) STANDARD simple staff_review FEATURES from to/span description pept 7 450 leghemoglobin BASE COUNT 189 a 91 c 126 g 172 t ORIGIN 1 agaaatatga gtttcaccga taagcaggag gctttagtca atagctcatg ggaggcattc 61 aaacaaaacc ttcctcgtta tagtgttttt ttctacactg tcatattaga gaaagcacct 121 gcagcaaaag gtttgttctc ttttcttaag aactcagctg aagtacaaga tagtcctcaa 181 ctccaagctc acgctgaaaa ggtttttgga ctggtgcgtg actcagctgt tcaacttcga 241 gcaacagggg gggtagtttt gggagatgct acgttgggtg ctatccacgt tcgaaaggga 301 gttgttgatc ctcattttgt ggtggttaaa gaggctttgc tcaaaacaat aaaagaagca 361 gctggagaca aatggagtga agagcttaac actgcttggg aagtagctta tgatgcactg 421 gcaactgcaa ttaaaaaagc aatgagttaa acctgtgatg atctattatc ataaaaagat 481 taataaataa aatatgtatt actaaaactt gttaaacaag ttcctataat aaattatgat 541 aaaaatattt atgaaatatt gttagtgttt atgccggt // LOCUS ALFLBB 521 bp ss-mRNA PLN 11-AUG-1990 DEFINITION Alfalfa leghemoglobin mRNA, complete cds. ACCESSION M36101 KEYWORDS leghemoglobin. SOURCE Alfalfa 15 day old nodule, cDNA to mRNA, clone pNL154. ORGANISM Medicago sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 521) AUTHORS Barker,D.G., Gallusci,P., Lullien,V., Khan,H., Gherardi,M. and Huguet,T. TITLE Identification of two groups of leghemoglobin genes in alfalfa (Medicago sativa) and a study of their expression during root nodule development JOURNAL Plant Mol. Biol. 11, 761-772 (1988) STANDARD simple staff_review FEATURES from to/span description pept > 1 396 leghemoglobin BASE COUNT 175 a 77 c 112 g 157 t ORIGIN 1 aaatcattta aacaaaaccc tggtaatagt gttttgttct acactataat attggagaaa 61 gcccctgcag caaagggcat gttctctttt cttaaggact cagctggagt acaggatagc 121 cctaaactcc aaagccatgc tgaaaaagtt tttggaatgg tgcgcgattc agctgctcaa 181 ctccgagcaa caggaggagt agttttggga gatgctacat tgggtgcaat ccacattcag 241 aaaggagttg ttgatcctca ttttgcggtg gttaaagaag ctctgctgaa aacaataaag 301 gaagtatcag gagataaatg gagcgaagaa ttgaacactg cttgggaagt agcctatgat 361 gcattggcaa ctgcaattaa gaaggcaatg gtttaaattt gggatgatct atagttatta 421 ctcaatttaa taaataaaaa atgtattact aaaacttgtt taacaagttt ctataataaa 481 tgttgttgaa atgactatta aatattattt gagtttatgt t // LOCUS CLTPHYC 1077 bp ds-DNA BCT 11-AUG-1990 DEFINITION Calothrix sp. phycocyanin alpha (PC2-alpha) and phycocyanin beta (PC2-beta) genes, complete cds. ACCESSION M36276 KEYWORDS phycocyanin. SOURCE Calothrix sp. (strain PCC 7601 (Fremyella diplosiphon UTEX 481)) DNA. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 1077) AUTHORS Capuano,V., Mazel,D., Tandeau de Marsac,N. and Houmard,J. TITLE Complete nucleotide sequence of the red-light specific set of phycocyanin genes from the cyanobacterium Calothrix PCC 7601 JOURNAL Nucleic Acids Res. 16, 1626-1626 (1988) STANDARD full staff_review FEATURES from to/span description pept 1 519 phycocyanin beta (PC2-beta) pept 589 1077 phycocyanin alpha (PC2-alpha) BASE COUNT 276 a 259 c 242 g 300 t ORIGIN 1 atgcttgacg cttttactaa agtagtttcc caggctgata ctagaggcgc atacattagc 61 gatgcagaaa tcgatgcatt gaagacaatg gttgctgctg gttctaagcg catggatgtt 121 gttaaccgca ttaccggcaa tgcctccaca atcgttgcta atgcagctcg cgctttattt 181 gaagaacaac cccagttgat tgcacctggt ggtaatgctt acaccaaccg tcgtatggct 241 gcttgcttac gcgacatgga aattatctta cgctatgtaa cctacgcagt atttgcaggt 301 gatgctagcg ttttagacga tcgctgcttg aacggtcttc gtgaaacata tcaagcatta 361 ggcgttcctg gcgcttccgt atctactggc gttcaaaaga tgaaagaagc tgcgatcgct 421 attgctaacg accccagcgg tgttacccgt ggtgactgta gctccttaat gtctgaattg 481 ggtagctact tcgatcgcgc tgctgctgca gttggctagt aaatcgctag ttaattattt 541 ctactttctg acgcataact ttagatattt gttgagagaa taatcagcat gaaaacacct 601 ttaaccgaag ccgtagctac cgcagattcc caaggtcggt ttttgagctc cacagaactt 661 caagttgctt ttggtcgttt ccgtcaagca tctgctagct tggatgcagc taaagcatta 721 agcagcaaag ctaacagctt ggctcaaggt gccgttaacg cagtttacca aaagttcccc 781 tacaccaccc aaatgcaagg taagaacttc gcttctgacc aacgcggtaa agacaagtgt 841 gcacgcgaca tcggttacta cattcgcatc gttacctact gtctagttgc tggtggtaca 901 ggccctctag atgactacct aattggtggt ttggcagaaa tcaaccgcac ctttgatctg 961 tctcccagct ggtacgttga agctctcaaa tacatcaaag ctaatcatgg tttgagtggc 1021 gatcctgctg ttgaagcaaa ctcctacatt gactacgcaa tcaacgccct aagctag // LOCUS PINACT 745 bp ds-DNA PLN 11-AUG-1990 DEFINITION P.contorta (lodgepole pine) actin gene, last two exons. ACCESSION M36171 KEYWORDS actin. SOURCE P.contorta (var. latifolia) seed DNA. ORGANISM Pinus contorta Eukaryota; Plantae; Embryobionta; Pinophyta; Pinicae; Pinatae; Pinaceae. REFERENCE 1 (bases 1 to 745) AUTHORS Kenny,J.R., Dancik,B.P., Florence,L.Z. and Nargang,F.E. TITLE Nucleotide sequence of the carboxyl-terminal portion of a lodgepole pine actin gene JOURNAL Can. J. For. Res. 18, 1595-1602 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 420 actin, exon X (AA at 1) 555 620 actin, exon X + 1 IVS 421 554 actin intron A BASE COUNT 196 a 124 c 172 g 253 t ORIGIN 1 aagcttgctt atgtagccct tgactatgag caagaacttg aaacatcgaa gagtagttct 61 tcacaagaaa agaattatga gcttcctgat ggacaggtca ttaccgttgg tgctgagcgt 121 ttcagatgtg ctgaggtttt gttccagcca tctctcattg gaatggaagc agcgggtatc 181 catgagacta catacaattc cattatgaaa tgtgatgtgg atatcagaaa agatctctat 241 ggaaacattg ttcttagtgg tggttctacc atgtttcctg gtatagctga tcgtatgagc 301 aaagagatca ctgcacttgc tcccagtagc atgaaaatca aagttgtagc accccctgag 361 aggaagtaca gtgtttggat cggaggttct atcttggcat ctcttagcac atttcagcag 421 gttattctca actttctaca tgccggaact tgaggttctt gtttggtttg gtggtattta 481 atgtgctttg caatatataa atattatttc tttgattgta tatttaacca tgccttgatt 541 gtttttgtat gcagatgtgg attgccaaga ccgagtatga ggagtcaggt ccctcaattg 601 tccacagaaa gtgtttttaa acagagtgct ctattgatta tgctttatga ttggccttta 661 ggtatattgt gagggtggaa gacacgatta ccgagatttt gatttagtat catgctttag 721 gatttgcatc tgtatagttg aattc // LOCUS RATMHCI 1355 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Rat MHC class I RT1.C/E mRNA, 3' end. ACCESSION X16979 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Rat liver, cDNA to mRNA, clone RT.BM1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1355) AUTHORS Parker,K.E., Carter,C.A. and Fabre,J.W. TITLE A rat class I cDNA clone with an Alu-like sequence and mapping to two genes in RT1.C/E JOURNAL Immunogenetics 31, 211-214 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 857 MHC RT1.C/E (AA at 3) mRNA < 1 1355 MHC RT1.C/E mRNA rpt 1210 1349 type 2 Alu-like repeat BASE COUNT 322 a 334 c 387 g 312 t ORIGIN 1 attgggagcg ggagactcgg aaagccaggg acaccgggag gaacttcaaa gtgaatctga 61 ggaccctgct ccgctactac aatcagagtg acgacgaatc tcacacgctg cagtggatgt 121 acggctgcga cgtggggccc gatgggcacc tgctccgcgg ttattgtcag gaggcttatg 181 acggccggga ttatatctcc ttgaacgagg acctgcgctc ctggaccgcg actgacatgg 241 cctcacaggc ctctaagatc aagtcagagg aagttggtga ggcccatcac cagagggcct 301 acctgcaagg tccttgcgtg gagtggctcc acacatacct acatctggga aaggagacgc 361 tgctgcgctc agacccccca aaggcacatg tgacccttca ccccagacct gaaggtgatg 421 tcaccctgag gtgctgggcc ctgggcttct accctgctga catcaccttg acctggcaat 481 tgaatgggga ggacctgacc caggacatgg agcttgtgga gaccaggcct gcaggggatg 541 gaaccttcca gaagtgggca gctgtggtag tgccttctgg ggaggagctg aagtacacat 601 gccatgtgga gcatgaggga ctgcctgaac ctctcgccct gagatgggag ccttctccat 661 tcaccgactc cagcatgcca gtgattgttg tccttggagc tgtggccatc attggagctg 721 tggccatcat tggagctgtg gccatcattg gagctgttgt gaggaggagg aagagaaaca 781 caggtgaaaa aggcagctat gctcatgttc taggcagcaa ggctttccag atctctgact 841 ggcctcagaa ggcatgaaaa gacagttgcc tcgtggggct gagcgacaag ttctgactaa 901 acatctgagg gagctgcttc ccctgaggac aatgtgaaga actggcaatc ccagaccttc 961 ttgcctccct ggacacctga cttcacaaca gaaaataatc tgactctggg ctgggtgttt 1021 cctattgctt ttatctgcac tgagtggaaa tgcggtttcc aacagtgtaa cgcagtaccg 1081 ttgtgtggtg gtagtgacat tgcacacagg tttgtgagct gggggtgtga tgtgcttttg 1141 ttactgtagg cagaacaagc tagaagtcca ccattcctct tatggtcatt gttttaaaac 1201 aaaacaaatg ctggagagat ggctcagtgt taagagcact gacttctctt acagaggtcc 1261 tggttcaaat cccagcaacc acatggtggc tcacaaccat atgtaatgag atctgatgct 1321 ctcttctgat gtgtctgaac acagctatat atgat // LOCUS SCOMTCOIII 1120 bp ds-DNA ORG 11-AUG-1990 DEFINITION S.commune mitochondrial cytochrome oxidase III (COIII) gene, complete cds. ACCESSION M36270 KEYWORDS cytochrome oxidase III. SOURCE S.commune mitochondrial DNA. ORGANISM Mitochondrion Schizophyllum commune Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes; Agaricales; Schizophyllaceae; Schizophyllum commune. REFERENCE 1 (bases 1 to 1120) AUTHORS Phelps,L.G., Burke,J.M., Ullrich,R.C. and Novotny,C.P. TITLE Nucleotide base sequence of the mitochondrial COIII gene of Schizophyllum commune JOURNAL Curr. Genet. 14, 401-403 (1988) STANDARD simple staff_review FEATURES from to/span description pept 147 953 cytochrome oxidase III (COIII) BASE COUNT 387 a 155 c 158 g 420 t ORIGIN 1 ggatcttaca aagaatatta tctttcaact ttcttttaaa taataaaatt acttgttaat 61 taattcaaat tacaagtgta aagccatagt ataaaacaaa atttataaat aaataaataa 121 aaaataacaa ataataaata acaaaaatga caagtattaa attctatcaa tcatttagtg 181 cacacttagt acaacattca ccttggccaa ttttagttag tttcagtctt tttaatttag 241 ctattggtac agtattaact atgcatggat acagtcacag ttcaacaact tttgatttag 301 gtttagcagt aactgttgga tcaattttat tatggactag agatattgta attgaaggat 361 ctttcttagg agatcacaca aaacaagtac aagaaggatt aatcatagga tttatcttat 421 ttattataag tgaagtattc gcttttattt cagtattctg ggcttatttc catagtgctt 481 tatcaccagc tgtagaatta ggatcaacat ggccaccagt aggaataatt ccattagata 541 ccttttcatt accattattt aatacaataa ttttattatc tagtggagct tttgttacct 601 atggacacca tgctattttc tcaggaaaaa gattagattc aatcatagga ttatttttaa 661 cagtagcctt agcattaatc tttagttact ttcaagcatt tgaatatatt catgctggat 721 tttccatgtc cgatagtgta tttggaacag tattttttgc aagtacagga ctacatggga 781 tacatgttat gcttgggact ctttttcttt tcgtatcttt tcttagacaa gttaactatc 841 aaactacaaa agaacacaat attggacttg aaacaagtat tctatattgg cactttgttg 901 atcttgtgtg gttattctta tttcttgttg tttacttttg gggtggtgca taattatttc 961 ttttctttta ttgaaagtaa taactattca ttagaactta taaactatgt gcaatttcca 1021 attttattcg gatttatcaa tacaggggga aaaacttctc ccttagaaaa tacaagaaat 1081 tgaagacaaa gaattaaatg taataaataa aataattaca // LOCUS STYHSQ 135 bp ds-DNA SYN 11-AUG-1990 DEFINITION S.potsdam and S.typhimurium hsdS recombinant DNA. ACCESSION M15940 KEYWORDS DNA binding protein; type I restriction and modification system. SOURCE S.potsdam and S.typhimurium DNA, clone lambda-hsd SQ. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 135) AUTHORS Fuller-Pace,F.V. and Murray,N.E. TITLE Two DNA recognition domains of the specificity polypeptides of a family of type I restriction enzymes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9368-9372 (1986) STANDARD full staff_review COMMENT The hsdS genes of the SB (S.typhimurium) and SP (S.potsdam) have a conserved sequence of around 100 bp flanked by two nonhomologous variable regions of around 500 bp. Recombination between the hsdS genes of SB and SP generates a system (SQ) with a different recognition specificity. The crossover point lies in the central conserved region. Concomitant with the generation of a new combination of flanking variable regions is the recombination of minor differences in the central conserved region. A polypeptide domain encoded on the 5' side of the crossover dictates recognition of the trinucleotide component "aac" (positions 25-27) of the target sequence, and a second domain, encoded on the 3' side of the crossover (positions 50-54), governs recognition of the tetra- or penta-nucleotide component. Parts of the variable regions are implicated in the determination of the specificity of interaction between protein and DNA. FEATURES from to/span description pept < 1 > 135 hsdS specificity protein (AA at 1) recomb 32 101 SP DNA end/SB DNA start (uncertain within span) BASE COUNT 46 a 39 c 24 g 26 t ORIGIN Unreported. 1 ataccaatcc cgtcacttgc tgaacaaaaa atcatcgccg aaaaactcga tacgctgctg 61 gcgcaggtag acagcaccaa agcacgtctt gagcaaatcc cacaaatcct gaaacgtttt 121 cgccaatcag tgata // LOCUS RATCYP2A3A 14315 bp ds-DNA ROD 11-AUG-1990 DEFINITION Rat cytochrome P450 II A3 (CYP2A3) gene, complete cds. ACCESSION M33190 KEYWORDS coumarin 7-hydroxylase; cytochrome P450 II A3. SOURCE Rat (Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 14315) AUTHORS Ueno,T. and Gonzalez,F. TITLE Complete sequence of the rat CYP2A3 gene specifically transcribed in lung JOURNAL Nucleic Acids Res. 18, 4623-4623 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Gonzalez 23-MAR-1990, for release after publication. FEATURES from to/span description pept 3446 3625 cytochrome P450 II A3, exon 1 3892 4054 cytochrome P450 II A3, exon 2 4736 4885 cytochrome P450 II A3, exon 3 5184 5344 cytochrome P450 II A3, exon 4 6879 7055 cytochrome P450 II A3, exon 5 8457 8598 cytochrome P450 II A3, exon 6 9049 9236 cytochrome P450 II A3, exon 7 9806 9947 cytochrome P450 II A3, exon 8 11123 11304 cytochrome P450 II A3, exon 9 pre-msg 3422 11497 cytochrome P450 II A3 mRNA and introns IVS 3626 3891 intron A IVS 4055 4735 intron B IVS 4886 5183 intron C IVS 5345 6878 intron D IVS 7056 8456 intron E IVS 8599 9048 intron F IVS 9237 9805 intron G IVS 9948 11122 intron H signal 11466 11471 poly-A signal BASE COUNT 3819 a 3801 c 2926 g 3769 t ORIGIN Chromosome 7. 1 gaattccaac agagacccca cttgtgcacc cgaaatgata cacccagaag aaggtacaag 61 cagagctaag cgaggcagag gctcccatca taacctacta atcaaaacga tagcagcgtg 121 aacatgttca caagcatgga ggacttgggg acaacagtgg ctcgctaagg aggagaaagc 181 aaacatgagt cgatagtcta cagtgagaga tgatagacaa caaggcaaag tcctgtgcag 241 gcttcagctc catccagagc aatctgaaca tctaacgctg atcagaaatt gcccaacagg 301 gcaccaagcc tccaggataa acttcctact cacagatcct tgagtatggt ggaagagggc 361 actgatgctt ggctttgatg aagaagcact ggggtgggag gcttctcaca caccaccagc 421 ttctttaggg ttgtcctgga ggcatttcaa aatgagacag gccttgcttg cccaccgtaa 481 ctgaatgtta ccaaagacac caggagcttt ggctacagga catagaaata tcaatctcaa 541 actgaccagg aaattgtctc cttagtggtt ctgggagtat cattggtctg accaagtgtg 601 gaccctctcg ctacaataaa aacaagaggc tcccaaaggc agattgaagt tcagaggaac 661 atcaatccat atcctttttt cttgtcaagt aataagatgt gataataaca gatgaataaa 721 atagctattt attgacttta gcattaactc aaccattact accaagaaga agattcacct 781 ataaagttca gtttgttatt gaagtcagct cttcaaggac acacaacaga caagatctca 841 gggacactga ttggattgcc caggagttgg tatctgccca ggttagggtt ccattctatg 901 ctgtgccatt gctctcaatc acggaaccag aaagggagac tggaaccatc agagggagac 961 ttcccaaggt tgctgcttat tcactctgga aacctgtctg agatgcttgc tgctgttccc 1021 acatgtctct ggcatgtgac cgctgtcacc gatccttcat tcacgcccct gtgtctctcc 1081 tgcataactc ctgttcctgc tctgtcattc atgtgcccgg aaaaaaaaat caataaggac 1141 aaatacactt gatctctcct gaactctcca agctctgttc ccattctggc ccccgtgccg 1201 aacatgagaa catgcctgac cactcagggt ccttctgggt gcctcctcca ctgagaaaca 1261 gattctctaa attacccctt cctcctattc aagtggaaac aaggggagag gagaggtccg 1321 tgctgggtat agtgcaggtg caggagagat tgatgcctgc caagcctagg ctaatcccat 1381 tccccccccc gccccttcct cctccaaaat caacaaacaa cctttctctc tgctttcttc 1441 tcggcccttc gtttcctccc caaccggtct cagagtccaa cagcctaaaa tgagatgagc 1501 taattgccac tcatgagctc caacctcacc caatggtcta atgctgggat accatcatgg 1561 gagggaccca tccatcaaag cctgccagca tccacccaag gtccctccca gggtctcaag 1621 gaagccaggg tcagcaagca cctagtgttt atatcagagc ctgctggaac caaatccata 1681 cccagcatgg tgccacactt atgccttcac tcaaacaatt ccagtgacca cctgccgccc 1741 ctctcaaggt ctctctcagg gtaagaatat caccaatatc cttttggtgt caagctccgt 1801 gctaactcct ttttcacctt agtgcttgcc tgggggctat ctcaatgcca accccatcct 1861 cctccctcct gggagcccat cctgcagcta agcttaattc caggtgtctc cagggtgagc 1921 tgagctggat aatgaggagg aaagagaatc aaatccctgc tcctttatga cccggtcatc 1981 tcaatgaggt tggtgatgca ccaactgcac aatgactaag attccacaga tgggatacac 2041 atcatcttcc tgggctacgg gttcatcaaa actgtccaga ttattaaaga agacttacaa 2101 tgtgagagaa cagtagggtc attttccctt ctccatagcc accctcagca tcccagggtc 2161 ctcttctgtt ccccatgctg gacgctcatc taaatctctc cacctcagcc cactgataac 2221 atccaggcat cagcacacat acccaccccc caccaaatat gacctttatt tccccaatgc 2281 tctcttctca ctggtgggaa agatgtctag acagggaaag ctaaaagtat gagcaagagt 2341 agatgttaat gcctctgact ttcgggcttt ccatgtatgt gccccccatg ccacccctat 2401 gttttgttca cgtgcaccca tagaaaacat gtgtaaatta cagcatcatt gagccagtga 2461 ctctgatggc ctcctctaca aatctcgata ggaatccagt ggggctttcg taaattagaa 2521 ctttgctacc atattggagt tgtgattatc atttctcaag acctagacag acccccagat 2581 attggctcta gtactgtgcc aaattccagt gaaatatgac tctggtcttc cctacctttc 2641 tcagatctta tgttccagca atccatccta gactcagaga cttctctggg ttctgagcat 2701 taagagacag catgaagcca taaccaaagt gcatccacgc cggggcacag gtgttgactc 2761 caagatcctc ctgcagtttg tgtccttcca ggcactcccc atccacagcc tttcttgact 2821 caagatccat aacccatagc cctggagtcg ccaattctcg gttccttaca ttcaggaccc 2881 acttcattga aagtctccca gggttggctg tggaagtgta ggtctgtaat cccagcactc 2941 agagcagaca ggtggatctc tgtagtctat gtacagactc cggactaccc agaactacat 3001 agtgaaaact tgtctcaaac aagaaagaaa agtgatcttt caacttcata gccattcatc 3061 ctatttcgaa caaatgaatt atagggttct tctctcctaa gaccctaata tggctatcac 3121 ttgtgccctt tctcaaagga agggtctcta gtatacatgt catgaaggat ccagttctgg 3181 cttctctcta ctgtcctctg taatgcatag ttttggagca cttcctcctt agacaccaca 3241 aactcagtct actccctgtt aatctgtcat ggagccctcc tcctccttga gtgttggcta 3301 tgtcccaaac taggtgcatg acgggggtag tctggaggta aaattagaca gttgcataat 3361 caagaccaaa ggccagcctt ctatctcaga atgtataaaa acaagctaat tcagccatca 3421 ctgtgtctgt catcccatcg ccaccatgct ggcctcagga ctccttctgg tggcctcagt 3481 ggccttcctc agtgtcctgg tcttgatgtc tgtctggaag cagaggaagc tctcagggaa 3541 gctgcctcct ggacccaccc cattgccctt catcgggaac tacctccagc tgaacacaga 3601 gaaaatgtac agctctctca tgaaggtgcc actgggcatg gagatgggtc gaatgggggc 3661 gtggagggat gcccaatttt gattgaggtt ttgtggtagg tattgaccag atgggagtgt 3721 aattttaggc caggggaatt gtgaagaaat cagcactttc taggccagtg gtatcacatc 3781 ttgggacacc cagtctgata actattagaa ctctagagag atgaatccca gaccggatat 3841 caaagaatga accatcccct cttaccttct ctgatcattc tcctccatca gatcagccaa 3901 cgttacggtc ctgtattcac catccacctg ggacctcgcc gagttgtggt gctgtgcgga 3961 caggaggcag tcaaggaggc tctggtggac caagctgagg aattcagtgg tcggggagag 4021 caggccacct tcgactggct tttcaaaggc tatggtgagg ggaagaccca ggggaatgtg 4081 gcagaggaca tttgagggca tcagtttccc cagatttctc cctgactgtc tatcccctcc 4141 aagcaatctt ctccccacat ccctgtctca gactggtctc tctgccagtg tttcctgact 4201 gccccacctc cacccatgcc tccttttcca cctagcaatt gctgtcggaa tcctgccgcc 4261 tctaagggga tccgaagctt ccctgcaacc cacaccatca ttttcagctt ctgcgattgc 4321 tttttctctt cctctcctct gctttccagt gtcagacatc acagaacgtt tgttcctagg 4381 ttctatctac cagcaacgat ttcttctctt tccagactct ctttgtgtct ctctctttat 4441 ttccctctct ttttttgctc aattacactt ctttcacaat tgtatccacc ctcagctctc 4501 tgcagtctcc tgtctctcac ctctgtctct gcacctctgt ctccctcttc ttactgtcca 4561 gtaacttgct cttttacctg tactttcgtg gtgccttctt gtgttttgca tttgcatttc 4621 caaggatctg tgtgcatctt tttccttctt gaccgcctga aggggaacag cgcaagggtg 4681 gtcaatggcc agccacgccc cctgacttct cctggcttct cattcacctt cccaggcgta 4741 gccttcagca gcggggagcg agccaaacag ctaaggcgct tctccatcgc cacgctgcgg 4801 gacttcggcg tgggcaagcg tggcatcgag gagcgtatcc aagaggaggc gggctttctc 4861 atcgagtcat ttcgaaagac gaacggtaag cggagcaccg ttctccggac ctaccaaggg 4921 aaagcagcac catgtcccag gatagaaact aggctaggga agtgtatgct ggtcccctac 4981 tgtgggatct ggtctcagcg tttttggttc agcgctgcag ccagtgccct tcttgagttc 5041 tcagattcaa agtatctgat gctaacaccc aggcgatgct ttttgcctag ttccaatgtc 5101 tccctttctt tcctcctcct cctccccttc ctccccctcc tccccctcct cctcctccct 5161 ctcctcctct tccccatcca caggtgccct cattgacccc accttctatc tgagccggac 5221 agtctccaat gtcattagct caatagtctt cggggaccgc ttcgactatg aggacaaaga 5281 gttcctgtca ctgcttcgaa tgatgctggg aagcttccag ttcacagcta cctccacggg 5341 gcaggtaacc tgtttcaacc ttgtcctgtc atacctcctg accttgactg ccagctcact 5401 cccaaactgc ttccagcagg caccttcaca ctagcccttg cagttttgca agaacaaata 5461 aatgtcaata tgtagacatc ctcaacagct gcccctgaaa tcaaaagctg gcagtctatc 5521 tccctttccc cctccctttt ccccttcccc tttccccctc cccttccccc tcccctctcc 5581 ctcccctccc cttccctctc atgtacacac acacacacac acacacacac acaccacaca 5641 cacacacaca gccactcagc tccttcagct gggcacatgt taccaccaca atttgtttta 5701 atttctaaca caagattgcc ataggaaggg tccgctgaaa acaccacccc tcttcagttc 5761 ctgaacatct ggcactgcag aaaacacaga gctagcccac tggagtatgt agatgagtgc 5821 cccttactca gccacggaac taagcaaacg tctgaggctc caagcgaaat ttctgcacag 5881 atgtccccca aacatccaag taagggtctt tcaacacagc tcattcacac tcctaagcac 5941 gaccatctgt gaagctgttt gataacaaag caggtgaaga tatggatatc agcactactt 6001 aaaatagaaa taataagaca attttggaca agcatacatt gtgtattgag ccttttcagc 6061 cccatacccc ccaccttccc tgttctcaca cttcctcctc ttcctcctcc tcctcttcct 6121 cttcctcctc ttcctcctcc tccttctgtt agtcctgttt gtccctgaga cagttttatt 6181 tctacttcta agtgatacac atgtacatat atatgcatat gtatgtacaa cgtgatttga 6241 attaatacca cagattgaaa gaacacatac acgggttctc taatactggt taaatttatt 6301 tgatatgatc tcccattgtg tctgctttcc tgaaaatttc cataccattg tttttaatgg 6361 ctgaaaaatt ttacacacac acacacacac acatacacat atacatatat acatatacat 6421 atatatctca cattttctct ttcattacct tgtctttgga cagctagttt gagtgcataa 6481 ttcagttact gtgactaccc aggactggtt ggtttaatac tggaaaacct gcgaaccttc 6541 accatccaaa ccctgctagt atttgaacag ctgcctctag ctaatgtcac ccaaatacct 6601 aagcatctac agatgaaact tcagcccaac aactgaatgt cattcaagcc agccatgcgc 6661 aagaactcca aacctatata tgcgaccaaa tccacctcac ttaaaatctt ggacaatagc 6721 cctctcactt aaactataag aaaactatat tgcctccttg ccctaccagg ttacctgaac 6781 ctcaggacag gtgacagaaa agtcccttgc ctgaaaacag ccaccacacc tgttggattt 6841 tatgcagtct tggtttctcc tcctttttcc accctcagct ctatgagatg ttctcttctg 6901 tgatgaaaca cctgccaggc ccccagcaac aggcctttaa ggagctgcag gggctggagg 6961 acttcataac caagaaggtg gaacagaatc agcgcacgct ggatcccaat tccccaaggg 7021 acttcatcga ctctttcctc atccgaatgc tggaggtaaa cccagaagtt agggagcatg 7081 gcgggaggtc caaagccaag caggatgaaa tcacgcttgg ggaatggggg taatggtgga 7141 ccacttaaat cagtcctctc aatataaacc tcacatatta ccactcagca ccactgttcc 7201 tttcacttct cagtctcagg aggatggaca gactgagatc acagccagga tgtggtgctg 7261 ccatatgcca ggcaccattt taaaccattt taaaggctta gtgatatcca cttatttagt 7321 attcacacct tggacgaggt tgtgttatga ctcccatttt acaaatgagt aggctgcagc 7381 tctgagaata taggcaactc gtcctgcatc acagctgtct caggctgaca cacaacttaa 7441 cgttgcctgt cctcattgtc attcacgtct gtggcacagg tgacctagct ttattgtgac 7501 cttctctcac ctatatggaa cagaggcagg ggtgttttgt gataggctgt agccctagtt 7561 ggcctgaaac ttgctctgca caccaggcaa gtctcaaacc tgccgtggtc ctcctgaggc 7621 tgctttctga tggcttggat tatcaacatg tggtatccgg ctgagatgga acttcatgta 7681 gtctgaatgc aaaggattga gggatgagat tcccctagaa aatattaagg ttcgtaatga 7741 agagatggga tttattgcct agcattcatt gtctgagggt tggaaagatg ggtcagtggt 7801 taagagcact ggctgctctt ccagagtgcc agtgggttca atcccagcac atgcatggca 7861 gctcacaacc atgtcatttc agtcccagta gatccaatgc cctctttggg cctccatggt 7921 tactgaacgc atgtagtgct ctgacacaca tgcaggcgaa acactcatac actcaaaata 7981 atttttaaaa agctctgtat caggatgggg atttagctca gtggtagagc acttgcctag 8041 gaagcgcaag gccctgggtt tggtccccag ctccgaaaaa aagaacaaaa aaaaagctct 8101 gtatctacct ctgaaaaacc atctcaagtt ttccctgact tcctccaaca tgacgtcagt 8161 accattgttt cttgggctct cttcagcctt cctttcctaa tttcgaccat taaataactg 8221 acgactaagt tagtaacaat cagacaagca ctaagtcacg aatgacagaa gagcatttat 8281 ggtcccaggt cacttccaaa tgtctgggcc ccggagagaa ctcagcatag ccagggttcc 8341 tcttctaagc cttaggtctg gcccaaagag aagtgagttc aagttgggaa aaaaaagcta 8401 gaagggtacc aagagtgtaa ggaatggatt tgacgtgaat atttccctcc ctgcaggaaa 8461 agaagaaccc caatactgag ttctacatga agaacttggt gctgactacc ctaaatctct 8521 tctttgccgg cacagagacc gtcagcacca ccctgcgtta cggctttctg ttgctcatga 8581 agcacccgga tattgagggt aacgaaggct tcaagggaca ggaaagcagg ggttgagggg 8641 agcataacct ggatatagat tctaaatttg catcttttat ctgccacaat gtcccctacc 8701 tttctcaaga aaccggactc tcaaacagct agctaccccc tgctttgcct agacagagcc 8761 acagctggtg ggcaccagtc tctagttgct taaccaactg gtgtgctgct cgccacactt 8821 cagaggagtt acttccccgt ctcccgggtc tcctcgcaac tcccgtacaa ttatggtatg 8881 atccatccag ctcacgggaa acttcttcac caatgcagga gttgcacata ttgaaaatga 8941 tctacaatgc catacactgt tccatggcct ccatcccgtc ttagtttctg aactatcatc 9001 tctactgtgg cgttcttgga ttcctaaacc ggtccctgtt cttctcagct aaggttcacg 9061 aggagattga ccgggtgatt ggcaggaacc ggcaggccaa gtatgaggac cgaatgaaga 9121 tgccctacac ggaggctgtg atccacgaga tccagagatt tgcagacatg atccccatgg 9181 gcctggctcg cagggtcacc aaggacacca agtttcgaga gttcctcctc cccaaggtgc 9241 tgattcacct agcctagagt gacctctaga cctcttcatg ggcctccagg acctcctgct 9301 cattagaaac cacccggccc ctctcttggt ttcttagtga caggctttct ctgtcaccac 9361 atcccactgc cctcctactt agagactcct atatcttgtc ttcaccccag actttcccac 9421 tcaggaattt taactccatg cctcttaaga agtataaact taatttatct agatcttgac 9481 tgtgttggag ataggagtct catgtctgct atatatcctg tttggggcta cacactttca 9541 atacccccaa atcttgactt gagaggcaca atttgcccca gaagacaaag cctgtattac 9601 tcatctgtcc tggctactag aacatgagtc ccacatcccc catatcttaa ggataacctc 9661 acattcggca aagctcctcc ctcaaaggtc ccctaactca ctatctttac cttcacttgg 9721 ataccctagc tcaccctttt aacccctgtg ggccagcagg acattcatcc aaacatacct 9781 catggagtct cttcttcatc accagggtac tgaagtattt cctatgctgg gctctgtact 9841 gaaagaccct aagttcttct ccaaccccaa cgacttcaac ccaaagcact tcctagatga 9901 caagggacag tttaagaaga gtgatgcctt tgtgcccttt tccattggta agaacccact 9961 gccgagtgcc agaccactgc tcacaccaac agataagctt ctccattgct tctgtctgat 10021 gcagccaaat atttctctct attttggaat attctcttgc atctaactct cggctagtta 10081 gctgtagctc ccataattcc caagcaccat gtatctgaac cctggagacg tgatggtcaa 10141 actttagtga taggtgaagg gaatggaacc tcaaactaag ccagaggtct ttctcaggtc 10201 acacaaccag ttcttaaaaa ggatgtggcc acatggcagc cataccattc cctatagctg 10261 tagggaaagg tgtaaaagta ttttttaatc tcctgatgta aatgtctccg aggcttactg 10321 cctccctctg ctaacctagg cctagccctg gaagcttcta gcctctgaac aatcttatct 10381 agcctagaat gttttcagcc tctgagactt gctgctaaat aagctcacac cttcctagtt 10441 ccctctgatc tctggatggc tggttcaact cagctgtttt ggctcaaact cctctaacag 10501 attcagtctc tctctttgcc tctcctgagc tgctctactt ggctccatgc taactttggc 10561 aatctgttct gatcttctgt ctccttctca ttctctggct tgttctgtct tcacctgggc 10621 ctagctcgct ctctcagtaa cctgcctcta tataactgtc ccagttaaac tgcccccttt 10681 ctccatgttt ctctctcttc actgttctct ttgtctgcca ctcaattaga cgtcattttc 10741 aaacatgggt gctttcttct acaaactaac tttatcttca ttgtttggaa tttaaagtgt 10801 gtaccaaggg tttgcattcc agccagaggg attaaaggtt gtgtgctaag gctgagccac 10861 tccacaccta gaaacaggtt tttcagtaaa taacacaatc tcggggttca cagtgtgatc 10921 gaatatcctg caacagaagg gaatccacat ttctcattgt tacagcctgg tatggatcaa 10981 cccactacaa gtacaaatac aactgaggct caggagttag agcgttcctt aaacgtctca 11041 gcacttcctc ccatgaagaa tgcagccagg gatttgctat cggcaaaaac tggcttcacc 11101 ttcccctgtc ttgtctcttt aggaaaacgg tattgtttcg gggaaggact ggcaaggatg 11161 gaactctttc tcttcctcac aaacatcatg cagaacttct gcttcaaatc cccacaggca 11221 ccccaggaca tcgatgtgtc tcctagactt gtgggctttg ccacaatccc accaaactac 11281 actatgagtt tcttgtcccg ttgagccagc attgcgtgag agtaaaggga agaatggggg 11341 ttgaaactta gaaagggtcg gatgtactga agaaacagag gaaaaagagt agacctgctg 11401 aagatatata tttcaaaggt agagccagag aagggggaaa tatctactat actgtgaata 11461 ggaataataa aataataaaa gtagatatta tttatggcat aagtagtctt tgtcaagttt 11521 atgctaaggg gtcttacaat ctcacttaat actcagaagt gcatgggaag gggaagagag 11581 ttcatgctca ttttattagg gtcacattct tgatcttatg tttccctagt ctaagtttac 11641 caaagtctca cagaatccaa gagctcacta atatttggac aaagatctgt gttcattggc 11701 cttgggcaga ttttcaaata tcaccacctc tcagataaaa catgaaagct caagtctatt 11761 gccacatcat cctgggcaga tctttgggtg ggcaacctgc caggctctac cacttgcctg 11821 gccacactct gatctactta cagagaaaac tcagggcctg caccatgaca tgaatgtgtg 11881 ctgatttatg gatgcccacg taagtgtggt gcagggtttc ctcaagttat gcacgagtat 11941 attttctgtc atgagttggc ttccaaaggc caagtgtttg ctcacagata agagtcagtt 12001 aaagcagttg aaaagatgtg ctgtggtaca ctggtgtaaa tctggtgagt agatgcctgt 12061 gcgatgcagt ctcaacagca gtggacacag cagccagcac tttctaggct tcttactttt 12121 atgatgtcag tattagtcat acccaggagg aagggacaga tataaggata tcaaggtgac 12181 atctttgaga actgtattgc aagccctccc tggtagtaaa tttcaaccct gcaagctgca 12241 taagcttcct gggataatga agagaagaag gagggaacag gggtgaggat ggctggggag 12301 gggatggggt agagtaggga gaatgactca attagaaaag tgcttgccaa gcaataatga 12361 ggacccgagt ttggatcctc agcacccctc aaaagcccag cgtaatggca caagcctgtc 12421 accctgatgc tgtgaggtgc gtgaaaattg cctggtccct gaagatcact gggcagccca 12481 gtccagtcaa atggtgaact cgggatttag agaccctgtc tcaaagaaca aagtggagac 12541 aggaagatca acagagtcaa ctaacctggg ggctagctca ctagaccctt ggggctctca 12601 aagactgagc caccaaccaa ggtgcacacg tggcctggac agaggccccc agcacatacg 12661 tagcatatgt acatctcaat ctttatgtgg gttccccaac aactggagca ggagctatca 12721 taaagctatt gcctgtcagt ggaatccatt cccctaactg ggctgccttg tctcacctcc 12781 gtgggagagg gtgcacttag ccctacagag acttgatgca caaggaaggg aatataccca 12841 ggtgggcccc atcctctcag aagagaaggg gatgggagat gggggagagg tttgtaagga 12901 gggaacagga gttgggcatt gttcaggatg gaaaatgaat aaataaatta attaatgggg 12961 aaaactggag agaaactaaa gaagacatgt actgttgacc tgtggcctct acacacccat 13021 gtacacatgc acctgcatgc acatacaaaa atacacacac acacacacac acacacacac 13081 acacacacac acagactagc tcatttttag caactggcaa gaaagccttt gaaagtgagg 13141 gagggaaaaa aaactaacac attctttaat gctgaaaaac attaaaataa caaacaaatg 13201 aaaagctcct aaagacgtga cctgggctgc tgagacaggc cacattcaac caacgatgtt 13261 gcattacata gcagcagagc ttgcagcctt ctgaagggca tccgtgacct tcttcaggct 13321 aagctgtatg ttagggttca aagtttccca tcactttatg aatattagcc taaccagagt 13381 aactttgatg aacgagatgg ccagaaacca acagcaaagt caaaaccaca aaagccatgg 13441 gtatgtggaa attaagagac gtgccatcaa aattgtttct tttcaaataa aacttaggca 13501 tataccgagg tagtagagag acagttcaat ggttaaaagc acagaatgct catgcagagg 13561 acttgagttg ggtccctggc actcatgtct ggcagcctaa agtcagctgt aactaactaa 13621 ctaactaact aactccaggg gatccaatgc catcctctgg attccatggg gcctgcatgt 13681 atatatgcat atgcacacac acacacacac acacacacac acacaaagac acacacaaag 13741 atacacacaa agacacatga ctttaaaaat aagtcttttt aaaaatataa ctattaatga 13801 aattaatgaa gtgcaactaa tgttttccat acatgtataa attatgcaat gactgaatca 13861 tagcaacaaa aaatgtagac ttatacaaca gtggttcaaa gaaatattac atggtaaatc 13921 acaaaacact taaataatgg aacaaaaata aatacacaca gacttacagg atgagaacag 13981 ccacagcagg agatgcaggt aaactcatag catggttttt aaagaaatgt caaattcata 14041 aacggtgttt taaattttta cccactagat tgaaagactc actaaactga tgtttttttt 14101 tctctttttt gtgattatac atacttattt tactgggtat tttttattta catttcaaat 14161 gtgattccct ttcgaggttt cccggccata tgcactctat cacaactccc tccccttctt 14221 ctataagggt gttccctctc cccaaccacc ccaccttcct gccccccacc gacattcccc 14281 tgcactgggg gtccagcctc agcagggatc aaggg // LOCUS ACARRDX 119 bp ss-rRNA RNA 11-AUG-1990 DEFINITION A.castellani 5S ribosomal RNA. ACCESSION M28984 KEYWORDS 5S ribosomal RNA. SOURCE A.castellani ribosomal RNA from Viluy encephalomyelitis patient KPN.. ORGANISM Acanthamoeba castellanii Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina; Acanthamoebidae. REFERENCE 1 (bases 1 to 119) AUTHORS Chumakov,K.M. and Karavanov,A.S. TITLE Preliminary characterization of an organism isolated from a case of Viluy encephalomyelitis indicates a protozoal, rather than viral, aetiology JOURNAL J. Gen. Microbiol. 132, 1127-1133 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 26 a 36 c 37 g 20 t ORIGIN 1 ggatacggcc atactgcgca gaaagcaccg cttcccatcc gaacagcgaa gttaagctgc 61 gcgaggcggt gttagtactg gggtgggcga ccacccggga atccaccgtg ccgtatcct // LOCUS BMOCHORB 538 bp ss-mRNA INV 11-AUG-1990 DEFINITION B.mori early chorion protein mRNA, 3' end. ACCESSION M36291 KEYWORDS early chorion protein; eggshell protein. SOURCE B.mori, cDNA to mRNA, clone m2G12. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 538) AUTHORS Eickbush,T.H., Rodakis,G.C., Lecanidou,R. and Kafatos,F.C. TITLE A complex set of early chorion DNA sequences from Bombyx mori JOURNAL Dev. Biol. 112, 368-376 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 482 early chorion protein precursor (AA at 3) sigp < 1 44 early chorion protein (AA at 1) matp 45 479 early chorion protein signal 518 523 poly-A signal BASE COUNT 104 a 128 c 159 g 147 t ORIGIN 1 tcctcatctg tgcttcggca ttattcgtcc agtctgctgt cagtcaatat ctgagtcgcg 61 tcggctgtgg ttgtggaggg gttggcaatg gcttgggcta cggtggtatc ggctataacg 121 gctatgtcgg taacgatatc ggcgctgctg gagctctagg agcttccaat ggagggtgtc 181 tcaacgtcgt cagctcctct gccgctccta ctagtttagg tgtagcctct gagaactctt 241 acgagggaac tgtcggtgtc tgcggtaatc tgcccctctt gggtactgca atcgtcaccg 301 gcgagttctc tactggcggc cttggtggta ttaactacgg ctgcggtaat ggcgctgttg 361 ggatcaccgc tgaagatcgc gccggtatcg gctacgccgg cggcctcggc tacggatcag 421 gctacggact cggctacgga ggctacgccg gcaatggatg tggatgcggt ggcgcctatt 481 gaatgctaga atatatataa aaaaacacgt tttttttaat aaattccttt aaaaaaat // LOCUS BMODNA 586 bp ss-mRNA INV 11-AUG-1990 DEFINITION B.mori early chorion protein mRNA, 3' end. ACCESSION M36290 KEYWORDS early chorion protein; eggshell protein. SOURCE B.mori, cDNA to mRNA, clone m6F6. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 586) AUTHORS Eickbush,T.H., Rodakis,G.C., Lecanidou,R. and Kafatos,F.C. TITLE A complex set of early chorion DNA sequences from Bombyx mori JOURNAL Dev. Biol. 112, 368-376 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 483 early chorion protein precursor (AA at 1) sigp < 1 39 early chorion protein (AA at 1) matp 40 480 early chorion protein signal 575 580 poly-A signal BASE COUNT 117 a 133 c 166 g 170 t ORIGIN 1 ttgtctggat gtgctctcgt ggctcagttc acagttggcc agtacatcgg ccgcgtgaac 61 aatggttgtg gatgcgggag tttcgacaac cgtggcctcg gttacaccgc tggctgtggt 121 ctcaccgctg ctagttctct tgcagcctcc catggaggag ggttattcgt cgtcacctcc 181 tctgccgcgc ctgctggtct cggcatagct tccgagaaca gatacgaagg cgctgtcgat 241 gtgtgcggca acattccatt cttgggcacc gctgatgtcg caggcgagtt ccccactgcg 301 ggcattggtg agatcaacta cggctgcggc gatggagcag tcgccattac cgctgaaggt 361 ggtctcggct acgctggagg acttgactac actggtggac tcggctatgc tagtggactt 421 ggctacggct taggctatgg agaatacgtt ggatgcagtt gtggttgtgg tgacatctac 481 tagaatggat attgttatat taatcttgta caagtatgta cattcaggtt gtaattttta 541 caagaattaa attttatttt tccgaataaa ctctggtata taattt // LOCUS BTHINSECT 977 bp ds-DNA BCT 11-AUG-1990 DEFINITION B.thuringiensis 27.3 Kd cytolytic insecticidal protein gene, complete cds. ACCESSION M35968 KEYWORDS 27.3 Kd cytolytic insecticidal protein. SOURCE B.thuringiensis morrisoni (mosquitocidal isolate PG-14) DNA. ORGANISM Bacillus thuringiensis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 977) AUTHORS Galjart,N.J., Sivasubramanian,N. and Federici,B.A. TITLE Plasmid location, cloning, and sequence analysis of the gene encoding a 27.3-kilodalton cytolytic protein from Bacillus thuringiensis subsp. morrisoni (PG-14) JOURNAL Curr. Microbiol. 16, 171-177 (1987) STANDARD simple staff_review FEATURES from to/span description pept 67 816 27.3 Kd cytolytic insecticidal protein BASE COUNT 349 a 161 c 149 g 318 t ORIGIN 1 aagtttaaaa tcataattta atgttgaaag gccactattc taattaactt aaggagttgt 61 ttatttatgg aaaatttaaa tcattgtcca ttagaagata taaaggtaaa tccatggaaa 121 acccctcaat caacagcaag ggttattaca ttacgtgttg aggatccaaa tgaaatcaat 181 aatcttcttt ctattaacga aattgataat ccgaattata tattgcaagc aattatgtta 241 gcaaatgcat ttcaaaatgc attagttccc acttctacag attttggtga tgccctacgc 301 tttagtatgg caaaaggttt agaaatcgca aacacaatta caccgatggg tgctgtagtg 361 agttatgttg atcaaaatgt aactcaaacg aataaccaag taagtgttat gattaataaa 421 gtcttagaag tgttaaaaac tgtattagga gttgcattaa gtggatctgt aatagatcaa 481 ttaactgcag cagttacaaa tacgtttaca aatttaaata ctcaaaaaaa tgaagcatgg 541 attttctggg gcaaggaaac tgctaatcaa acaaattaca catacaatgt cctgtttgca 601 atccaaaatg cccaaactgg tggcgttatg tattgtgtac cagttggttt tgaaattaaa 661 gtatcagcag taaaggaaca agttttattt ttcacaattc aagattctgc gagctacaat 721 gttaacatcc aatctttgaa atttgcacaa ccattagtta gctcaagtca gtatccaatt 781 gcagatctta ctagcgctat taatggaacc ctctaatctt agtagctata tttattaaag 841 atggtaatat cacaagtata aatacttgtg gtattaccta ccattcttaa attatatcca 901 aaatcatgcg ttaatctaca ttcccctttc tctaaaattt gttcttcaca catccacatt 961 tttcgataaa acaatct // LOCUS ECOPYRI 133 bp ds-DNA BCT 11-AUG-1990 DEFINITION E.coli ATCase regulatory protein (pyrI) gene, 3' end. ACCESSION M28578 KEYWORDS ATCase regulatory protein. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 133) AUTHORS Cunin,R., Jacobs,A., Charlier,D., Crabeel,M., Herve,G., Glansdorff,N. and Pierard,A. TITLE Structure-function relationship in allosteric aspartate carbamoyltransferase from Escherichia coli: I. Primary structure of a pyrI gene encoding a modified regulatory subunit JOURNAL J. Mol. Biol. 186, 707-713 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 81 ATCase regulatory protein (pyrI) (EC 2.1.3.2) signal 100 129 transcription termination signal BASE COUNT 41 a 27 c 29 g 36 t ORIGIN 1 cgaaaacgcg ccaatgatat cgcgctcaaa tgcaaatact gtgaaaaaga gttttcccat 61 aatgtggtgc tggccaatta attgcggttg gtaataaaag ttggctccct ataatgagcc 121 agacttttta ccg // LOCUS HUMBHD 1269 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human placental 17-beta-hydroxysteroid dehydrogenase mRNA, complete cds. ACCESSION M36263 KEYWORDS placental 17-beta-hydroxysteroid dehydrogenase. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1269) AUTHORS Peltoketo,H., Isomaa,V., Maeentausta,O. and Vihko,R. TITLE Complete amino acid sequence of human placental 17-beta- hydroxysteroid dehydrogenase deduced from cDNA JOURNAL FEBS Lett. 239, 73-77 (1988) STANDARD simple staff_review FEATURES from to/span description pept 10 996 placental 17-beta-hydroxysteroid dehydrogenase precursor (EC 1.1.1.62) matp 13 993 placental 17-beta-hydroxysteroid dehydrogenase mRNA < 1 1269 placental 17-beta-hydroxysteroid dehydrogenase mRNA BASE COUNT 220 a 378 c 437 g 234 t ORIGIN 1 agtctcacca tggcccgcac cgtggtgctc atcaccggct gttcctcggg catcggcctg 61 cacttggccg tacgtctggc ttcagatcca tcccagagct tcaaagtgta tgccacgttg 121 agggacctga aaacacaggg ccggctgtgg gaggcggccc gggccctggc atgccctccg 181 ggatccctgg agacgttgca gctggacgta agggactcaa aatccgtggc cgctgcccgg 241 gaacgcgtga ctgagggccg cgtggacgtg ctggtgtgta acgcaggcct gggcctgctg 301 gggccgctgg aggcgctggg ggaggacgcc gtggcctctg tgctggacgt gaatgtagta 361 gggactgtgc ggatgctgca ggccttcctg ccagacatga agaggcgcgg ttcgggacgc 421 gtgttggtga ccgggagcgt gggaggattg atggggctgc ctttcaatga cgtttattgc 481 gccagcaagt tcgcgctcga aggcttatgc gagagtctgg cggttctgct gctgcccttt 541 ggggtccact tgagcctgat cgagtgcggc ccagtgcaca ccgccttcat ggagaaggtg 601 ttgggcagcc cagaggaggt gctggaccgc acggacatcc acaccttcca ccgcttctac 661 caatacctcg cccacagcaa gcaagtcttt cgcgaggcgg cgcagaaccc tgaggaggtg 721 gcggaggtct tcctcaccgc tttgcgcgcc ccgaagccga ccctgcgcta cttcaccacc 781 gagcgcttcc tgcccctgct gcggatgcgc ctggacgacc ccagcggctc caactacgtc 841 accgccatgc accgggaagt gttcggcgac gttccggcaa aggccgaggc tggggccgag 901 gctgggggcg gggccgggcc tggggcagag gacgaggccg ggcgcagtgc ggtgggggac 961 cctgagctcg gcgatcctcc ggccgccccg cagtaaaggc ttcctcagcc gctgtctccc 1021 gcgcccttct ttgtcccctg ggtctgtgtg gtccctgggg atggggcggc ggtagcagct 1081 gtgggtggct aattaagata gatcgcgtta gccagtttta ccagcgcagc taggcgcgat 1141 ggcgtcgcct gtaatgccag cgctttggga ggcggaggca ggaggatcgc tcaagccccg 1201 gagttggaga ccagccagag caacacagtg agacccccat ctctacaaaa ataaagaaaa 1261 tttaaaaat // LOCUS HUMC1Q 84 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human complement component 1, beta-chain (C1QB) mRNA, partial cds. ACCESSION M36278 KEYWORDS complement component 1. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 84) AUTHORS Reid,K.B.M., Bentley,D.R. and Wood,K.J. TITLE Cloning and characterization of the complementary DNA for the B chain of normal human serum C1q JOURNAL Philos. Trans. R. Soc. Lond., B, Biol. Sci. 306, 345-354 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 84 complement 1-q subcomponent beta-chain (AA at 1) /hgml_locus_uid="LR0054Y" /nomgen="C1QB" /map="1p" BASE COUNT 18 a 25 c 22 g 19 t ORIGIN 1 gccaccgaca agaactcact actgggcatg gagggtgcca acagcatctt ttccgggttc 61 ctgctctttc cagatatgga ggcc // LOCUS HUMGAPDA 197 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase mRNA, 3' flank. ACCESSION M36164 KEYWORDS glyceraldehyde-3-phosphate dehydrogenase. SOURCE Human pancreatic adenocarcinoma cell line HPAF, cDNA to mRNA, clone pII5B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 197) AUTHORS Schek,N., Hall,B.L. and Finn,O.J. TITLE Increased glyceraldehyde-3-phosphate dehydrogenase gene expression in human pancreatic adenocarcinoma JOURNAL Cancer Res. 48, 6354-6359 (1988) STANDARD simple staff_review FEATURES from to/span description signal 175 180 poly-A signal /hgml_locus_uid="LM0055R" /nomgen="GAPD" /map="12p13" BASE COUNT 50 a 69 c 46 g 31 t 1 others ORIGIN 1 ccctggacca ccagccccag caagagcaca agaggaagag agagaccctc actgctgggg 61 agtccctgcc acactcagtc ccccaccaca ctgaatctcc cctcctcaca gttgccatgt 121 agaccccttg aagaggggag gggcctaggg agccgcacct tgtcatgtac catcaataaa 181 gtaccctgtg ctnaacc // LOCUS HUMGYPAA 445 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human glycophorin A mRNA, partial cds. ACCESSION M36281 KEYWORDS glycophorin A. SOURCE Human, cDNA to mRNA, clone pHGpA-3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 445) AUTHORS Siebert,P.-D. and Fukuda,M. TITLE Molecular biological study of the structure and expression of human glycophorin A JOURNAL Rev. Fr. Transfus. Immunohematol. 24, 251-266 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 445 glycophorin A precursor (AA at 2) /hgml_locus_uid="LW0071X" /nomgen="GYPA" /map="4q28-q31" sigp < 1 67 glycophorin A signal peptide (AA at 1) matp 68 > 445 glycophorin A BASE COUNT 135 a 103 c 84 g 123 t ORIGIN 1 tgatctcagg atgtatggaa aaataatctt tgtattacta ttgtcagcaa ttgtgagcat 61 atcagcatca agtaccactg gtgtggcaat gcacacttca acctcttctt cagtcacaaa 121 gagttacatc tcatcacaga caaatgatac gcacaaacgg gacacatatg cagccactcc 181 tagagctcat gaagtttcag aaatttctgt tagaactgtt taccctccag aagaggaaac 241 cggagaaagg gtacaacttg cccatcattt ctctgaacca gagataacac tcattatttt 301 tggggtgatg gctggtgtta ttggaacgat cctcttaatt tcttacggta ttcgccgact 361 gataaagaaa agcccatctg atgtaaaacc tctcccctca cctgacacag acgtgccttt 421 aagttctgtt gaaattgaga acccc // LOCUS HUMGYPCAA 915 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human glycophorin C mRNA, complete cds. ACCESSION M36284 KEYWORDS glycophorin C; syaloglycoprotein. SOURCE Human reticulocytes and fetal liver, cDNA to mRNA, clones pGCW5 and pGCF22. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 915) AUTHORS Cartron,J.-P., Colin,Y., Le Van Kim,C., Rahuel,C., Blanchard,D., Bloy,C. and London,J. TITLE Structure of human erythrocyte glycophorin C deduced from cDNA analysis JOURNAL Rev. Fr. Transfus. Immunohematol. 24, 267-285 (1986) STANDARD simple staff_review COMMENT The first 6 nucleotides at the 5' end were not present in the cDNA clones but where deduced from the known protein sequence. FEATURES from to/span description pept 1 387 glycophorin C /hgml_locus_uid="LM0084R" /nomgen="GYPC" /map="2q14-q21" mRNA < 1 915 glycophorin C mRNA BASE COUNT 228 a 249 c 262 g 176 t ORIGIN 1 atgtggtcga cgagaagccc caacagcacg gcgtggcctc tcagcctcga gcctgatccg 61 gggatggcct ctgcctccac cacaatgcat actaccacca ttgcagagcc tgatccaggg 121 atgtctggat ggccggatgg cagaatggag acctccaccc ccaccataat ggacattgtc 181 gtcattgcag gtgtgattgc tgctgtggcc atcgtcctag tctccctcct cttcgtcatg 241 ctgcgctaca tgtaccggca caagggcacg taccacacca atgaggccaa gggcacggag 301 tttgctgaga gtgcagatgc agccctgcag ggagaccctg ccctccaaga tgctggtgat 361 agcagcagaa aggagtactt tatttgaggg acaacagact tcacttccct gaatgcctcc 421 cccatctcca tcaggaaaaa tacaccccat cgcccagtat ccccgtcgat accaccagac 481 agagagagag agtacacttg atttcttccc gagatagtta cctagaaaca ctaggtgcct 541 gcccaaggag gaacggagga ggactcgcgc tacaagaggc cactcccagg gacccaggga 601 ggcgatggcc accccagagg ccaccttttg ctccacggag gtgggagaga atctgggcac 661 atggggcccc ctagggcagt gcaggacaac atcagctcac tggcaggaaa gtccttgttg 721 agggtgaggg ggtgctgggg tacccggggg ctggggaagc aaggaaataa gtcatctgta 781 tgctgactgg ggataatggc atcaatgtca gtccttgact ttggggggaa cagcaggtgc 841 cagagctaaa aggtaccttt gtctgccatt gatccagcta agaacgattg gaaataaatt 901 ggaaatgtaa ccgag // LOCUS HUMIGDD 102 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36090 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient AW. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 102) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 102 Ig H-chain VDJ-region (AA at 2) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 26 a 23 c 36 g 17 t ORIGIN 1 gagagctgag gacacggctg tgtattactg tgcgaaagat cgagcagcaa cggacgggct 61 actactacgg tatggacgtc tggggccaag ggaccacggt ca // LOCUS HUMIGDE 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36091 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient AW. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig H-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 22 a 24 c 30 g 24 t ORIGIN 1 cctgcagctg aactctgtaa ctcccgagga cacggctgtg tattactgtg caagaggact 61 gggtggcttt tgatatctgg ggccaaggga caatcgtcac // LOCUS HUMIGDF 101 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36092 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient AW. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 101) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 101 Ig heavy-chain V-D-J region (AA at 3) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 17 a 27 c 34 g 23 t ORIGIN 1 ccgccgcgga cacggccgtg tattactgtg ccagggtagg atattgtact aatggtgttt 61 ccgggatggt tcgacccctg gggccaggga accctggtca c // LOCUS HUMIGDG 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36093 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 22 a 34 c 25 g 19 t ORIGIN 1 ccatcacact cctacctgca tggagcacct aaggcctcgg acaccgccat gtattactgt 61 gcgaccattg actactgggg ccagggaacc ctggtcacgg // LOCUS HUMIGDH 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36094 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 23 a 30 c 24 g 23 t ORIGIN 1 ccatcacact cctacctgca tggagcacct aaggcctcgg acaccgccat gtattactgt 61 gcgagggggg atggttacta ctactactac atggacgtct // LOCUS HUMIGDI 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36095 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept.ps < 1 > 100 Ig heavy-chain V-D-J region (AA at 3) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 25 a 22 c 30 g 23 t ORIGIN 1 aaatgaacag cctgagagct gaggacacgg ctgtgtattg aacccacggt ggtagctgct 61 actctgcttt tgatatctgg ggccaaggga caatggtcac // LOCUS HUMIGDJ 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36096 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 24 a 23 c 30 g 23 t ORIGIN 1 tctgagagcc gaggacacgg ctgtgtatta ctgtgcaaca gggccgtcac tggttattat 61 aaactactac tacggtatgg acgtctgggg ccaagggcca // LOCUS HUMIGDK 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36097 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 2) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 26 a 24 c 33 g 17 t ORIGIN 1 ggcgtatctg caaatgaaca gcctgaaaac cgaggacacg gccgtgtatt actgtactag 61 acgggacggt atggacgtct ggggccaagg gaccacggtc // LOCUS HUMIGDL 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36098 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 21 a 18 c 33 g 28 t ORIGIN 1 acggccgtgt attactgtgc gagagatagg ggatttctac gagtagtggt ggtagctgct 61 actctgcttt tgatatctgg ggccaaggga caatggtcac // LOCUS HUMIGDM 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene D-J region, partial cds. ACCESSION M36099 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 24 a 15 c 29 g 32 t ORIGIN 1 ggtaggttta gaatgaggtc tgtgtcactg tggtattacg atattttgac tggttattat 61 aaactactac tacggtatgg acgtctgggg ccaagggcca // LOCUS LAMPYRIB 135 bp ds-DNA PHG 11-AUG-1990 DEFINITION Bacteriophage lambda mutant truncated regulatory protein (pyrI) gene from E.coli, 3' end. ACCESSION M28579 KEYWORDS ATCase regulatory protein. SOURCE Bacteriophage lambda (strain lambda OTC3) DNA. ORGANISM Bacteriophage lambda Viridae; ds-DNA nonenveloped viruses; Siphoviridae. REFERENCE 1 (bases 1 to 135) AUTHORS Cunin,R., Jacobs,A., Charlier,D., Crabeel,M., Herve,G., Glansdorff,N. and Pierard,A. TITLE Struture-function relationship in allosteric aspartate carbamoyltransferase from Escherichia coli: I. Primary structure of a pyrI gene encoding a modified regulatory subunit JOURNAL J. Mol. Biol. 186, 707-713 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 75 ATCase regulatory protein BASE COUNT 54 a 24 c 22 g 35 t ORIGIN 1 cgaaaacgcg ccaatgatat cgcgctcaaa tgcaaatact gtgaaaaaga gttttatact 61 aagttggcat tataaaaaag cattgcttat caatttgttg caagcaacag gtcactatca 121 gtcaaaataa aatca // LOCUS MUSBGALA 240 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse beta-1,4-galactosyltransferase mRNA, 5' end. ACCESSION M36289 KEYWORDS beta-1,4-galactosyltransferase. SOURCE Mouse, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 240) AUTHORS Shaper,J.H., Hollis,G.F. and Shaper,N.L. TITLE Evidence for two forms of murine beta-1,4-galactosyltransferase based on cloning studies JOURNAL Biochimie 70, 1683-1688 (1988) STANDARD simple staff_review FEATURES from to/span description pept 52 > 240 beta-1,4-galactosyltransferase (long form) precursor sigp 52 111 beta-1,4-galactosyltransferase (long form) signal peptide (put.) matp 112 > 240 beta-1,4-galactosyltransferase (long form) pept 91 > 240 beta-1,4-galactosyltransferase (short form) BASE COUNT 27 a 91 c 77 g 45 t ORIGIN Chromosome 4. 1 gggccgtcct ctcagccgta gcccaccccc tcttaaagcc gcggcgggaa gatgaggttt 61 cgtgagcagt tcctgggcgg cagcgccgcg atgccgggcg cgaccctgca gcgggcctgc 121 cgcctgctcg tggccgtctg cgcgctgcac ctcggcgtca ccctcgtcta ttacctctct 181 ggccgcgatc tgagccgcct gccccagttg gtcggagtct cctctacact gcagggcggc // LOCUS MUSIGHADN 348 bp ds-DNA ROD 11-AUG-1990 DEFINITION Mouse rearranged Ig heavy-chain gene V-D-J3 region, partial cds. ACCESSION M36103 KEYWORDS immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma GA733 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Caton,A.J. TITLE Comparative sequences analysis of CO17-1A antigen-specific mono- clonal antibodies JOURNAL Hybridoma 5, 11-16 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 348 Ig heavy-chain V-J region (AA at 1) recomb 294 303 V-region end/D-region start recomb 303 304 D-region end/J3-region start BASE COUNT 91 a 83 c 91 g 83 t ORIGIN 1 caggttcagc tacagcagtc tggagctgag gtgatgaggc ctggggcctc agtgaagata 61 tcctgcaagg ctactggcta cacattcact aggtactaca tacaatgggg taaaaacaga 121 gggcctggac atggccttga gtggattgga gagattttac ctggaactct tactaattac 181 aatgagaaat tcaagggcaa ggccgcattc actgcagata gatcctccaa cacagcctac 241 atgcaactca gcagccttac atctgaggac tctgccgtct attactgtgc aagagatggt 301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca // LOCUS MUSIGHADO 348 bp ds-DNA ROD 11-AUG-1990 DEFINITION Mouse rearranged Ig heavy-chain gene V-D-J3 region, partial cds. ACCESSION M36104 KEYWORDS immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma CO17A-1A DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Caton,A.J. TITLE Comparative sequences analysis of CO17-1A antigen-specific mono- clonal antibodies JOURNAL Hybridoma 5, 11-16 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 348 Ig heavy-chain V-J region (AA at 1) recomb 294 303 V-region end/D-region start recomb 303 304 D-region end/J3-region start BASE COUNT 81 a 79 c 101 g 87 t ORIGIN 1 caggtccagt tgcagcagtc tttagctgag ctggtaaggc ctgggacttc agtgaaggtg 61 tcctgcaagg cttctggata cgccttcact aattacttga tagagtgggt aaagcagagg 121 cctggacagg gccttgagtg gattggggtg attaatcctg gaagtggtgg tactaactac 181 aatgagaagt tcaagggcaa ggcaacactg actgcagaca aatcctccag cactgcctac 241 atgcagctca gcagcctgac atctgatgac tctgcggtct atttctgtgc aagagatggt 301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca // LOCUS RICGICR 524 bp ds-DNA PLN 11-AUG-1990 DEFINITION O.sativa Gly-tRNA gene. ACCESSION M35993 KEYWORDS transfer RNA-Gly. SOURCE O.sativa DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 524) AUTHORS Reddy,P.S. and Padayatty,J.D. TITLE Effects of 5' flanking sequences and changes in the 5' internal control region on the transcription of rice Gly-tRNA-GCC gene JOURNAL Plant Mol. Biol. 11, 575-583 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA 381 451 Gly-tRNA anticdn 413 415 Gly-tRNA anticodon cgg BASE COUNT 166 a 72 c 98 g 187 t 1 others ORIGIN 1 gcgcatgttt gtttcagctt aagattattg taatntagat tatttaaatc aagattactt 61 taagtttgga ttataataag ctgatataaa ataagctaca agttgtttgt ttttctagat 121 tattaaaggc atctaagggt agtgggtttt tagccacttc aattaatctg gaaaaagctc 181 atatagaaaa gattattgga ttatagtaat ctggcttata gttataataa tctagcatca 241 ataatctact tgtttgtttc agcttacttc taataattta gtttataata attctgaaac 301 tgaattaaac agggtcttag acatatgagt tggatttgtg tagagctgaa attagtttac 361 tcgtagaatc aagaacgaaa gcaccagtgg tctagtggta gaatagtacc ctgccacggt 421 acagacccgg gttcgattcc cggctggtgc attattaatt tttggctctt ccattttaga 481 taaagacaat tggcaacttg atgcatttat gaacaaatct gcgc // LOCUS M27244 1413 bp ds-DNA UNA 11-AUG-1990 DEFINITION Figure 2. The sequence and predicted coding capacity of the HOX-2.5 h14 cDNA. ACCESSION M34857 M27244 REFERENCE 1 (bases 1 to 1413) AUTHORS Bogarad,L.D., Utset,M.F., Awgulewitsch,A., Miki,T., Hart,C.P. and Ruddle,F.H. TITLE the developmental expression pattern of a new murine homeo box gene: hox-2.5 JOURNAL Dev. Biol. 133, 537-549 (1989) STANDARD unannotated staff_entry COMMENT FEATURES from to/span description BASE COUNT 366 a 386 c 311 g 350 t ORIGIN 1 gcgccggggg agctgctcaa acagggcacg cccgagtaca gtttggaaac ttcggcgggc 61 agggaggctg tcctgtctaa tcaaagagct ggctacgggg acaataaaat ttgcgaagga 121 agcgaggaca aagagaggcc gggtcaaacc aacccctctg ccaactggct gcacgcccgc 181 tcttcccgga aaaagcgctg tccctacacc aaataccaga cgctggagct agagaaggag 241 tttctgttca atatgtacct caccagggac cgcaggcatg aagtggccag actcctcaat 301 ctgagtgaga gacaagtcaa aatctggttt cagaaccggc ggatgaaaat gaagaagatg 361 aataacggag caggcaaaga gtaaagatga ccaccccccc cttccccagc tcactcttat 421 tatttatgtg atggtcaaaa agccactgct gtctgggtgt acccaacgtg agtggggaag 481 agtatctcct ctttaaaatc cctcatctgc acctggagcc tgtctcctct cctttgcgcc 541 tctctctctc tctctctctc tctctctctc tctctctctc tctctctcca ctctcctggt 601 gttaagagga aattttatta agttagaagc tagctatact tggttcctag aaaagtgatg 661 agcctcgctc cgaggaaaga gaccctagtc gagctcctag catccctgct aatttttctg 721 gaaaacccca gcccctctct tccagtctgc ctgttccctc cctgtaacct agccagacgc 781 cacccaggta cagtttaact gcacacagcc cagtggatgc ccacagacat agagtttgga 841 gtctatgcta atcatcgtgg agaaaggaca tcaagacgtt gttcatcaaa ccaagcaggg 901 cagctctcag atcaggttca ttttggttgt gccctgatcc aaacacacac taggaagatg 961 tctagagatg ccctcaatga ggcactccag tcccaaagtc tcaacatccc aatatgtatc 1021 ctatggaatt gccagctaga cccatatctg gggaggcctg tgggagatgt ggccattcct 1081 gtccccaagc tgttgtgagc tagaaagggc aagaaaaagc agggagtggt tttatgaagg 1141 taccattagt ggagtcccct cgtattttcc ctgcaggaag tgtttaattc tgcatgtttc 1201 ctttccttcc aactaaagga ggtcaaagtg ggttgtagag ccctgacagt gctgtttccc 1261 cattcataca ttcctatcct catttgacag aatgtaactt ttcttgctga ccacagtctc 1321 agatttctct ctatcctcca actttgaaat caactctcct ttccaccaaa cccatcttgc 1381 accagcaaag acaagtctcc cattgtaaaa aaa // LOCUS CHKBGLOA 168 bp ds-DNA VRT 11-AUG-1990 DEFINITION Chicken beta-3-globin gene, exon N. ACCESSION M33204 KEYWORDS beta-3-globin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 168) AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D. TITLE Chromosomal arrangement of the chicken beta-type globin genes JOURNAL Cell 24, 669-677 (1981) STANDARD simple staff_review FEATURES from to/span description pept / 7 / 168 beta-3-globin, exon N (AA at 7) IVS < 1 6 beta-3-globin intron N (no splice consensus) BASE COUNT 37 a 58 c 38 g 35 t ORIGIN 1 tacaggctgc tgatcgtcta cccctggacc cagaggttct ttgcgtcctt tgggaacctc 61 tccagcccca ctgccatcct tggcaacccc atggtccgcg cccacggcaa gaaagtgctc 121 acctcctttg gggatgctgt gaagaaccta gacaacatca agaacacc // LOCUS CHKBGLOB 90 bp ds-DNA VRT 11-AUG-1990 DEFINITION Chicken beta-1-globin gene, exon N. ACCESSION M33205 KEYWORDS beta-1-globin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 90) AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D. TITLE Chromosomal arrangement of the chicken beta-type globin genes JOURNAL Cell 24, 669-677 (1981) STANDARD simple staff_review FEATURES from to/span description pept / 1 / 90 beta-1-globin, exon N (AA at 1) BASE COUNT 22 a 29 c 20 g 19 t ORIGIN 1 aggttctttg ataacttcgg gaacctctcc agccccaccg ccatcattgg taaccccaag 61 gtccgtgctc acggcaaaaa agtgctgagc // LOCUS CHKBGLOC 234 bp ds-DNA VRT 11-AUG-1990 DEFINITION Chicken beta-2-globin gene, exon N. ACCESSION M33206 KEYWORDS beta-2-globin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 234) AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D. TITLE Chromosomal arrangement of the chicken beta-type globin genes JOURNAL Cell 24, 669-677 (1981) STANDARD simple staff_review FEATURES from to/span description pept / 7 / 228 beta-2-globin, exon N (AA at 7) IVS < 1 6 beta-2-globin intron N (no splice consensus) IVS 229 > 234 beta-2-globin intron N+1 BASE COUNT 53 a 68 c 65 g 48 t ORIGIN 1 tgcaggctgc tgatcgtcta cccctggacc cagaggttct ttgcgtcctt tgggaacctg 61 tccagtgcca ccgccatcat tggcaacccc atggttcgcg cccacggcaa gaaagtgctg 121 agctcctttg gggaggctgt aaagaacctg gacaacatca agaagagctt tgcccagctg 181 agcaaactgc actgtgacaa gctgcacgtg gaccccgaga acttcagggt gagt // LOCUS CHKBGLOD 121 bp ds-DNA VRT 11-AUG-1990 DEFINITION Chicken beta-4-globin gene, exon N. ACCESSION M33207 KEYWORDS beta-4-globin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 121) AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D. TITLE Chromosomal arrangement of the chicken beta-type globin genes JOURNAL Cell 24, 669-677 (1981) STANDARD simple staff_review FEATURES from to/span description pept / 1 / 121 beta-4-globin, exon N (AA at 1) BASE COUNT 19 a 43 c 29 g 30 t ORIGIN 1 ctccttgggg acatcctgat catcgtcctg gcctcccact ttcccaggga tttcactcct 61 gcctgccaat ttgcctggca gaagctggtc aacgttgtgg cccatgccca ggcccgcaag 121 t // LOCUS ECOTRYA 87 bp ss-RNA RNA 11-AUG-1990 DEFINITION E.coli Tyr-tRNA. ACCESSION M35994 KEYWORDS transfer RNA-Tyr. SOURCE E.coli. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 87) AUTHORS Doctor,B.P., Loebel,J.E., Sodd,M.A. and Winter,D.B. TITLE Nucleotide sequence of Escherichia coli tyrosine transfer ribonucleic acid JOURNAL Science 163, 693-695 (1969) STANDARD simple staff_review FEATURES from to/span description tRNA 1 87 Tyr-tRNA anticdn 36 38 Tyr-tRNA anticodon rta modified 8 8 p modified 18 18 gm modified 20 20 q modified 41 41 p modified 66 66 p BASE COUNT 19 a 27 c 24 g 16 t 1 others ORIGIN 1 gggtgggttc cccgagcggg caaagggagc agactrtata tctgccggtc acagacttcg 61 aaggttcgaa tccttcccca cccacca // LOCUS HUMMHDCBA 142 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human MHC class II HLA-DC-beta chain mRNA, 5' end. ACCESSION M35982 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA, clone pII-beta-5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 142) AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A. TITLE Signal sequences distinguish class II histocompatibility antigen beta chains of different loci JOURNAL Scand. J. Immunol. 19, 91-97 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 142 MHC HLA-DC-beta chain precursor (AA at 2) sigp < 1 79 MHC HLA-DC-beta signal peptide matp 80 > 142 MHC HLA-DC-beta BASE COUNT 27 a 40 c 44 g 31 t ORIGIN 1 aggggggatc cccggagacc ttcgggtagc aactgtcacc ttgatgctgg cgatcctgag 61 ctcctcactg gctgagggca gagactctcc cgaggatttc gtgtaccagt ttaagggcct 121 gtgctacttc accaacggga cg // LOCUS HUMMHDRAA 157 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human MHC class II HLA-DR-alpha chain mRNA, 5' end. ACCESSION M35979 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA, clone pII-alpha-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 157) AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A. TITLE Signal sequences distinguish class II histocompatibility antigen beta chains of different loci JOURNAL Scand. J. Immunol. 19, 91-97 (1984) STANDARD simple staff_review FEATURES from to/span description pept 8 > 157 HLA-DR-alpha chain precursor sigp 8 82 HLA-DR-alpha chain signal peptide matp 83 > 157 HLA-DR-alpha chain BASE COUNT 43 a 31 c 40 g 43 t ORIGIN 1 gaagaaaatg gccattagtg gagtccctgt gctaggattt ttcatcatag ctgtgctgat 61 gagcgctcag gaatcatggg ctatcaaaga agaacatgtg atcatccagg ccgagttcta 121 tctgaatcct gaccaatcag gcgagtttat gtttgac // LOCUS HUMMHDRBB 157 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human MHC class II HLA-DR-beta chain mRNA, 5' end. ACCESSION M35980 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA, clone pII-beta-3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 157) AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A. TITLE Signal sequences distinguish class II histocompatibility antigen beta chains of different loci JOURNAL Scand. J. Immunol. 19, 91-97 (1984) STANDARD simple staff_review FEATURES from to/span description pept 8 > 157 MHC HLA-DR-beta chain precursor sigp 8 94 MHC HLA-DR-beta chain signal peptide matp 95 > 157 MHC HLA-DR-beta chain BASE COUNT 26 a 42 c 46 g 43 t ORIGIN 1 ctccagcatg gtgtgtctga ggctccctgg aggctcctgc atggcagttc tgacagtgac 61 actgatggtg ctgagctccc cactggcttt ggctggggac accagaccac gtttcttgga 121 gtactctacg tctgagtgtc atttcttcaa tgggacg // LOCUS HUMMHDRBC 154 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human MHC class II HLA-DR-beta chain mRNA, 5' end. ACCESSION M35981 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA, clone pII-beta-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 154) AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A. TITLE Signal sequences distinguish class II histocompatibility antigen beta chains of different loci JOURNAL Scand. J. Immunol. 19, 91-97 (1984) STANDARD simple staff_review FEATURES from to/span description pept 5 > 154 MHC HLA-DR-beta chain precursor sigp 5 91 MHC HLA-DR-beta chain signal peptide matp 92 > 154 MHC HLA-DR-beta chain BASE COUNT 27 a 37 c 49 g 41 t ORIGIN 1 cagcatggtg tgtctgaggc tccctggagg ctcctgcatg gcagttctga cagtgacact 61 gatggtgctg agctccccac tggctttggc tggggacacc agaccacgtt tcttggagga 121 ggttaagttt gagtgtcatt tcttcaacgg gacg // LOCUS MUSIAPA 494 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse intracisternal A particle mRNA, 5 'end. ACCESSION M36322 KEYWORDS intracisternal A particle. SOURCE Mouse endogenous A type retrovirus embryonal carcinoma cell line PCC3, cDNA to mRNA, clone EC26. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 494) AUTHORS Morgan,R.A., Christy,R.J. and Huang,R.C.C. TITLE Murine A type retroviruses promote high levels of gene expression in embryonal carcinoma cells JOURNAL Development 102, 23-30 (1988) STANDARD simple staff_review FEATURES from to/span description mRNA 477 > 494 intracisternal A particle mRNA BASE COUNT 111 a 124 c 125 g 134 t ORIGIN 1 ctgcagctag aaaggctctg cggcatctct gagcctattc tagggagaca tgtcatcttt 61 caagaaggtt gagtgtccaa gtgtccttct ctgcaggcaa aacgacaagg gagcaggtca 121 gggttcctct gggtaaaagc ctgtgagcct aagagcctaa tcctgtacat ggctccttaa 181 cctacacatt ggggatttga cctctatctc cactctcatt aatatgggtg gcctatttgc 241 ctcttattaa aaggaaaggg ggagatgttg ggagccgccc ccacattcgc cgttacaaga 301 tggcgctgac agctgtgttc taagtggtaa acaaataatc tgcgcatgtg ccgagggtgg 361 ttctccactc catgtctctg ccttccccgt gacgtcaact cggccatggc tgcagccaat 421 cagggagtga cacgtcctag gcgaaggata attctcctta atagggacgg gttttcgttt 481 tttctctctc tctt // LOCUS MUSIAPB 492 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse intracisternal A particle mRNA, 5 'end. ACCESSION M36323 KEYWORDS intracisternal A particle. SOURCE Mouse endogenous A type retrovirus embryonal carcinoma cell line PCC3, cDNA to mRNA, clone EC32. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 492) AUTHORS Morgan,R.A., Christy,R.J. and Huang,R.C.C. TITLE Murine A type retroviruses promote high levels of gene expression in embryonal carcinoma cells JOURNAL Development 102, 23-30 (1988) STANDARD simple staff_review FEATURES from to/span description mRNA 475 > 492 intracisternal A particle mRNA BASE COUNT 111 a 118 c 129 g 134 t ORIGIN 1 ctgcagtaga aaggctctgc ggcatattga gcctattcta gggagacatg tcatctttca 61 agaaggttga gtgtccaagt gtccttctct gcaggcaaaa cgacacggga gcaggtcagg 121 gttgcctctg ggtaaaagcc tgtgagccta agagctaatc ctgtaatggc tccttaacct 181 acacattggg gatttgacct cttctccact ctcattaata tgggtggcct atttgcctct 241 tattaaaagg aaagggggag atgttgggag ccgcccccac attcgccgtt acaagatggc 301 gtgacagctg tgttctaagt ggtaaacaaa taatctgcgc atgtgccgag ggtggttctc 361 cactccatgt gctctgcctt cccggtgacg tcaactcggc catggctgca gccaatcagg 421 gagtgacacg tcctaggcga aggataattc tccttaatag ggacgggttt tatcgttttt 481 tctcgctctc tt // LOCUS PT4SB 122 bp ds-DNA PHG 11-AUG-1990 DEFINITION Bacteriophage T4 mutant Ser-tRNA suppressor gene. ACCESSION M35988 KEYWORDS serine transfer-RNA. SOURCE Bacteriophage T4 (strain R24) DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 122) AUTHORS McClain,W.H. TITLE Specific duplications fostered by a DNA structure containing adjacent inverted repeat sequences JOURNAL J. Mol. Biol. 204, 27-40 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA < 1 > 122 Ser-tRNA BASE COUNT 25 a 35 c 36 g 26 t ORIGIN 1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ccggagcgac 61 tgccggtctt taaaaccggc agtcgctccg gcgactcata ggttcaaatc ctatcgcctc 121 cg // LOCUS PT4SC 121 bp ds-DNA PHG 11-AUG-1990 DEFINITION Bacteriophage T4 mutant Ser-tRNA suppressor gene. ACCESSION M35989 KEYWORDS serine transfer-RNA. SOURCE Bacteriophage T4 (strain R66) DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 121) AUTHORS McClain,W.H. TITLE Specific duplications fostered by a DNA structure containing adjacent inverted repeat sequences JOURNAL J. Mol. Biol. 204, 27-40 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA < 1 > 121 Ser-tRNA BASE COUNT 24 a 36 c 34 g 27 t ORIGIN 1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ctccggcgac 61 tccggtcttt aaaaccggca gtcgctccgg cgactcatag gttcaaatcc tatcgcctcc 121 g // LOCUS PT4SEA 122 bp ds-DNA PHG 11-AUG-1990 DEFINITION Bacteriophage T4 mutant Ser-tRNA suppressor gene. ACCESSION M35987 KEYWORDS serine transfer-RNA. SOURCE Bacteriophage T4 (strain R51) DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 122) AUTHORS McClain,W.H. TITLE Specific duplications fostered by a DNA structure containing adjacent inverted repeat sequences JOURNAL J. Mol. Biol. 204, 27-40 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA < 1 > 122 tRNA-Ser BASE COUNT 24 a 35 c 36 g 27 t ORIGIN 1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ctccggcgac 61 tgccggtttt aaagaccggc agtcgctccg gcgactcata ggttcaaatc ctatcgcctc 121 cg // LOCUS VFARDNAA 1234 bp ds-DNA PLN 11-AUG-1990 DEFINITION V.faba ribosomal spacer DNA repeat. ACCESSION M36318 KEYWORDS direct repeat; spacer DNA. SOURCE V.faba DNA, clone VER 1. ORGANISM Vicia faba Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 1234) AUTHORS Kato,A., Yakura,K. and Tanifuji,S. TITLE Repeated DNA sequences found in the large spacer of Vicia faba rDNA JOURNAL Biochim. Biophys. Acta 825, 411-415 (1985) STANDARD simple staff_review FEATURES from to/span description rpt 78 228 direct repeat copy A rpt 1064 1123 direct repeat copy A BASE COUNT 320 a 329 c 304 g 281 t ORIGIN 1 agacaagtcc atgcaaagga ctaggcaaga acgcattgcg taggccatgg tttgcgttgc 61 ttccgcctca tggcccaccg acacgcgaaa agccgaaaaa catgaaggga tcgcccgaat 121 ggtacgaaac cttgcattcc atctcaattt ggtgtcagat taatggatga aaaaattcag 181 accggaactc attgcgtagc ccgacgttgg cttgctttcc gccccttgga gaaaccgcac 241 acgcacgcgt tccggtcgga aattgaaagt agaccgaggg cttgtcgaaa cgctacgaaa 301 ctccttggac actctcaatg tgttatttgg gaagtgcatg caaaaatcca tcgggagatt 361 attcccctaa ggcgcgtttg cctccgctcc cgtcggaaaa accgcacggg ggcccgttcg 421 ggccggaaat tcaaaattgt cgaaacgcta cgaaactttg cagatccctt ctatgtgtta 481 tttgggactt ccatgcaaaa attcatcggg agattattcc cctaaggcgc gtttgcctcc 541 gctcccgttc atggagaaac cgcacacgca cgcgttccgg tcggaaattg aaagtagacc 601 gagggcttgt cgaaacgcta cgaaactcct tggacactct caatgtgtta tttgggaagt 661 gcatgcaaaa atccatcggg agattattcc cctaaggcgc gtttgcctcc gctcccgtcg 721 gaaaaaccgc acgggggccc gttcgggccg gaaattcaaa attgtcgaaa cgctacgaaa 781 ctttgcagat cccttctatg tgttatttgg gacttccatg caaaaattga tcgggagatt 841 attcccctaa ggcgcgtttg cctccgctcc cgttcatgga gaaaccgcac acgcacgcgt 901 tccggtcgga aattgaaagt agaccgaggg cttgtcgaaa cgctacgaaa ctccttggac 961 actctcaatg tgttatttgg gaagtgcatg caaaaatcca tcgggagatt attcccctaa 1021 ggcgcgtttg cctccgctcc cgtcggaaaa accgcacggg ggcccgacac gcgaaaagcc 1081 gaaatatatc gagggatcgc ccgaatggca tgaaacttgg catgccatct caatttggtg 1141 gcagattaat gcatgaaaaa ttccagaccg gaactcattc cgtagcccga cgtttgtttc 1201 cttccgcccc ttgaaaaaaa ctcaattgca cgcg // LOCUS VFARDNAB 149 bp ds-DNA PLN 11-AUG-1990 DEFINITION V.faba ribosomal spacer DNA repeat. ACCESSION M36319 KEYWORDS . SOURCE V.faba DNA, clone VER 6. ORGANISM Vicia faba Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 149) AUTHORS Kato,A., Yakura,K. and Tanifuji,S. TITLE Repeated DNA sequences found in the large spacer of Vicia faba rDNA JOURNAL Biochim. Biophys. Acta 825, 411-415 (1985) STANDARD simple staff_review BASE COUNT 39 a 40 c 35 g 35 t ORIGIN 1 ccgacacgcg aaaagccgaa atatatcgag ggatcgcccg aatggcatga aacttggcat 61 gccatctcaa tttggtggca gattaatgca tgaaaaattc cgaccggaac tcattccgta 121 gcccgacgtt tgtttgcttc cgccccttg // LOCUS XEBEBNA1 185 bp ds-DNA VRT 11-AUG-1990 DEFINITION X.borealis external transcribed spacer and 18S rRNA gene, 5' end. ACCESSION M36006 KEYWORDS 18S ribosomal RNA. SEGMENT 1 of 2 SOURCE X.borealis DNA, clone pXbr101. ORGANISM Xenopus borealis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 185) AUTHORS Furlong,J.C. and Maden,B.E.H. TITLE Patterns of major divergence between the internal transcribed spacers of ribosomal DNA in Xenopus borealis and Xenopus laevis, and of minimal divergence within ribosomal coding regions JOURNAL EMBO J. 2, 443-448 (1983) STANDARD simple staff_review FEATURES from to/span description rRNA 23 > 185 18S ribosomal RNA site 1 22 external transcribed spacer BASE COUNT 43 a 44 c 46 g 52 t ORIGIN 1 ggttcccccc ggagccgagg gctacctggt tgatcctgcc agtagcatat gcttgtctca 61 aagattaagc catgcacgtg taagtacgca cggccggtac agtgaaactg cgaatggctc 121 attaaatcag ttatggttcc tttgatcgct ccatctgtta cttggataac tgtggtaatt 181 ctaga // LOCUS XEBEBNA2 1589 bp ds-DNA VRT 11-AUG-1990 DEFINITION X.borealis 18S rRNA gene, 3' end, 5.8S rRNA gene, complete, ITS2, and 28S ribosomal RNA gene, 5 ' end. ACCESSION M36008 M36010 M36012 M36014 M36016 KEYWORDS 18S ribosomal RNA; 28S ribosomal RNA; 5.8S ribosomal RNA. SEGMENT 2 of 2 SOURCE X.borealis DNA, clone pXbr101. ORGANISM Xenopus borealis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1589) AUTHORS Furlong,J.C. and Maden,B.E.H. TITLE Patterns of major divergence between the internal transcribed spacers of ribosomal DNA in Xenopus borealis and Xenopus laevis, and of minimal divergence within ribosomal coding regions JOURNAL EMBO J. 2, 443-448 (1983) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 1 18S ribosomal RNA site 401 954 internal transcribed spacer 1 (ITS1) rRNA 955 1116 5.8S ribosomal RNA site 1117 1452 internal transcribed spacer 2 (ITS2) rRNA 1453 > 1589 28S ribosomal RNA BASE COUNT 287 a 560 c 546 g 196 t ORIGIN About 1500 bp after segment 1. 1 gaattcccag taagtgcggg tcataagctc gcgttgatta agtccctgcc ctttgtacac 61 accgcccgtc gctactaccg attggatggt ttagtgaggt cctcggatcg gccccgccgg 121 ggtcggcaac ggccctggcg gagcgccgag aagacgatca aacttgacta tctagaggaa 181 gtaaaagtcg taacaaggtt tccgtaggtg aacctgcgga aggatcatta accgcccgtc 241 gctactaccg attggatggt ttagtgaggt cctcggatcg gccccgccgg ggtcggcaac 301 ggccctggcg gagcgccgag aagacgatca aacttgacta tctagaggaa gtaaaagtcg 361 taacaaggtt tccgtaggtg aacctgcgga aggatcatta acgagagagg gcgagaggct 421 cacggtgcga gagcgcggct ctgccgcacc cccgccaccg tcccggctcc ccccgcggac 481 gacgacaggt ccgggcgcgg gctcccgccc gccccgacac gaggagagag cggaaaagga 541 acaacaacaa aaaaaacgag cccgggtacc tagccggggg ggggcgaggg cggggacggg 601 gggcagaggg ccctccgacc ccccctccct cctcccgggc ggttcgaaga ccccggccca 661 cggccggcgg cggcgacgag gggcgagccg tccggaggcg gggggcgggc gcggaggggg 721 gacggggggg aggcggaggt cacggccgga cgcggccgcg tccctccgcc cgcccccccc 781 ccgtctgtcc gcgccccgcc ccgcgcgccg gtcgcgtcag tccccgccgc cccgtcggcg 841 gcgccccggg gcgcccgggg cccgggggcc cgccccgtcc agcttccacg gcgcccgcgc 901 gcgccggtac ccctgccgcg aggcaacgag agagaaaaac aaacgaaaaa aatctcgcga 961 ctcttagcgg tggatcactc ggctcgtgcg tcgatgaaga acgcagctag ctgcgagaat 1021 tagtgtgaat tgcaggacac attgatcatc gacacttcga acgcaccttg cggccccggg 1081 ttcctcccgg ggctacgcct gtctgagggt cgctccgacg tccatcgccc ccgccgggtc 1141 cccgacccgg ggaaaacggg cggcgcggct ggggccgtcg caggggtcga gggacgcccc 1201 cccaccgccc ttccggggga ggtgaggggg ccccgcgacc ccttcgtccc cccaaggcca 1261 gaccccgacc atcccggcgg ccccccccag ggccccggcg cggctgtctg tgggaaaccg 1321 atttcacggc tgccggcgcc gcgggcgctg gggacccgcc ggggcgggaa gagcccccgg 1381 gcgggccggg gccggggaag gcgcgcttga acccgcgccc ccccgcgtcc cccgcccgcc 1441 cccccgctcg actcagacct cagatcagac gtggcgaccc gctgaattta agcatattac 1501 taagcggagg aaaagaaact aaccaggatt cccccagtaa cggcgagtga agagggaaga 1561 gcccagcgcc gaatccccgc ccgcccggc // LOCUS BOVTGFB 1117 bp ss-mRNA MAM 11-AUG-1990 DEFINITION Bovine transforming growth factor-beta-1 (TGF beta-1) mRNA, 3' end. ACCESSION M36271 KEYWORDS transforming growth factor-beta-1. SOURCE Bovine fibropapilloma, cDNA to mRNA, (library of Okayama and Berg). ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1117) AUTHORS Van Obberghen-Schilling,E., Kondaiah,P., Ludwig,R.L., Sporn,M.B. and Baker,C.C. TITLE Complementary deoxyribonucleic acid cloning of bovine transforming growth factor-beta-1 JOURNAL Mol. Endocrinol. 1, 693-698 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 950 transforming growth factor-beta-1 precursor (AA 75 at 3) matp 612 947 transforming growth factor-beta-1 BASE COUNT 244 a 379 c 301 g 193 t ORIGIN 1 aggccatact ggccctttac aacagtaccc gcgaccgggt ggccggggaa agtgccgaaa 61 cggagcctga gccagaggcg gactactacg ccaaggaggt cacccgcgtg ctaatggtgg 121 aatacggcaa caaaatctat gacaaaatga agtctagctc gcacagcata tatatgttct 181 tcaacacgtc cgagctccgg gaagcggtgc ccgaacctgt gttgctctct cgggcagacg 241 tgcgcctgct gaggctcaag ttaaaagtgg agcagcacgt ggagctgtac cagaaatata 301 gcaacaattc ctggcgctac ctcagcaacc ggctgctcgc ccccagcgac tcaccggagt 361 ggctgtcctt tgacgtcact ggagttgtgc ggcagtggct gacccgcaga gaggaaatag 421 agggctttcg cctcagtgcc cactgttcct gtgacagtaa agataacacg cttcaagtgg 481 acattaacgg gttcagttcc ggccgccggg gtgacctcgc caccattcac ggcatgaacc 541 ggcccttcct gctcctcatg gccacccctc tggagagggc ccagcacctg cacagctccc 601 gccaccgccg agccctggac accaactact gcttcagctc cacagaaaag aactgctgtg 661 ttcgtcagct ctacattgac ttccggaagg acctgggctg gaagtggatt catgaaccca 721 aggggtacca cgccaatttc tgcctggggc cctgccctta catctggagc ctggatacac 781 agtacagcaa ggtcctggcc ctgtacaacc agcacaaccc gggcgcttcg gcggcgccgt 841 gctgcgtgcc tcaggcgctg gagcccctgc ccatcgtgta ctacgtgggc cgcaagccca 901 aggtggagca gttgtccaac atgatcgtgc gctcctgcaa gtgcagctga ggccccgtcc 961 caccccaaca gcccccgccc cgtagccccg ccacccggca gcccggcccc gcccccgccc 1021 gcctcaccag gactgtattt aaggacacag caccaccccc ccccccccac tcccatcaag 1081 cccacctggg gtccattaaa ggtggcgaga ggaaaaa // LOCUS FSBPROLA 1148 bp ss-mRNA VRT 11-AUG-1990 DEFINITION O.keta (chum salmon) prolactin mRNA, complete cds. ACCESSION M36267 KEYWORDS prolactin. SOURCE O.keta pituitary gland, cDNA to mRNA. ORGANISM Oncorhynchus keta Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Salmoniformes; Salmonoidei; Salminidae. REFERENCE 1 (bases 1 to 1148) AUTHORS Song,S., Trinh,K.-Y., Hew,C.L., Hwang,S.-J., Belkhode,S. and Idler,D.R. TITLE Molecular cloning and expression of salmon prolactin cDNA JOURNAL Eur. J. Biochem. 172, 279-285 (1988) STANDARD simple staff_review FEATURES from to/span description pept 19 654 prolactin precursor sigp 19 87 prolactin signal peptide matp 88 651 prolactin mRNA 1 1148 prolactin mRNA BASE COUNT 319 a 284 c 239 g 306 t ORIGIN 1 gtccgaaaag aaggaaagat ggctcgccga tcccagggta ccaaactcca cttagcagtt 61 ctgtgtctag ttgtgtcctg tcatgccatt ggccttagtg acctaatgga gagagcttcc 121 cagcgatcag acaagcttca ctcactcagc acttccctca ccaaggacct ggactctcac 181 ttcccaccaa tgggacgagt gatgatgcca cgcccgtcta tgtgtcacac ctcctcgctc 241 cagacaccca aggacaagga gcaagcactc aaggtatcgg agaatgagct gatctccctg 301 gctcgctacc tcctcctggc ctggaacgat cccctgctgc tgctctcctc agaggcgccg 361 actctgcccc acacgccgtc caatggcgac atcagcagta agatcaggga actgcaggac 421 tactccaaga gcctgggaga tggactggac ataatggtca acaagatggg accctcctcc 481 cagtacattt cttcaatccc cttcaagggt ggagacctcg gcaatgacaa gacctcccgc 541 ctcatcaact tccacttcct catgtcttgc ttccgcaggg actcccacaa aatcgacagt 601 ttcctcaagg tccttaggtg ccgggctaca aatatgcgac cagaaacatg ttaggagaaa 661 atggcaggat tttggttctg gattgttcca ttttcaaact gatagtcaaa atggggtagg 721 aattcaagag agtgattgtc ttgttgtagt ttgattttgt gaaataacta atgctgccac 781 ctacatcaca tttggactat tcatagatct atacagttag tagtcatcct ggttatctga 841 taccgacatg tttacctata catatcttta taggtagttt atgttcggta atgcattatg 901 catcagaaaa atgatgcaac ctaagtattg gtcacttata tgatcatatt caatagattc 961 atatatagat aacaaaggat cctcctagtc attcttagta gatatacagg atggctgata 1021 ggtagatgcc atgaaccgtc atgattaaca atctagaact tgaatatagg taactcagcc 1081 tttgtaaact attgataatt gcttccaaga gttctgttta cacaataaat caagtaagat 1141 tttgcaat // LOCUS PETIRAB 183 bp ds-DNA ORG 11-AUG-1990 DEFINITION P.hybrida chloroplast rps19' and rp12 genes, partial cds. ACCESSION M35955 KEYWORDS inverted repeat; rp12; rps19'. SOURCE Petunia hybrida chloroplast DNA. ORGANISM Chloroplast Petunia hybrida Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae; Petunia hybrida. REFERENCE 1 (bases 1 to 183) AUTHORS Aldrich,J., Cherney,B.W., Williams,C. and Merlin,E. TITLE Sequence analysis of the junction of the large single copy region and the large inverted repeat in the petunia chloroplast genome JOURNAL Curr. Genet. 14, 487-492 (1988) STANDARD simple staff_review FEATURES from to/span description pept > 183 143 (c) rp12 gene product (AA at 181) pept 91 < 1 (c) rps19' gene product (gtg start codon) BASE COUNT 58 a 28 c 26 g 71 t ORIGIN 1 cgcccatacc ataattcaga attcatcaaa tattagataa atttaataaa tgattggcta 61 caaaggggtt tttttttagt gaacgtgtca caagcttact cctatttttt ttttttttgt 121 aaagtttttg taaaacgaag aattaaattc gatttcctct cctatttact atggcgacga 181 aga // LOCUS PETIRABX 162 bp ds-DNA ORG 11-AUG-1990 DEFINITION P.hybrida chloroplast rps19 and rp12 genes, partial cds. ACCESSION M37322 KEYWORDS inverted repeat; rp12; rps19. SOURCE Petunia hybrida chloroplast DNA. ORGANISM Chloroplast Petunia hybrida Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae; Petunia hybrida. REFERENCE 1 (bases 1 to 162) AUTHORS Aldrich,J., Cherney,B.W., Williams,C. and Merlin,E. TITLE Sequence analysis of the junction of the large single copy region and the large inverted repeat in the petunia chloroplast genome JOURNAL Curr. Genet. 14, 487-492 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 41 rp12 gene product (AA at 3) pept 92 > 162 rps 19 gene product (gtg start codon) BASE COUNT 70 a 26 c 21 g 45 t ORIGIN 1 tcttcgtcgc catagtaaat aggagaggaa atcgaattta attcttcgtt ttacaaaaac 61 ttacaaaaaa aaaaaaaata ggagtaagct tgtgacacgt tcactaaaaa aaaacccctt 121 tgtagccaat catttattaa ataaaattga taagcttaac ac // LOCUS RATATROPO 1633 bp ds-DNA ROD 11-AUG-1990 DEFINITION Rat alpha-tropomyosin and N5 genes, 5' flank. ACCESSION J05467 KEYWORDS alpha-tropomyosin. SOURCE Rat (Sprague-Dawley) adult liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1633) AUTHORS Herrera,V.L.M. and Ruiz-Opazo,N. TITLE Regulation of alpha-tropomyosin and N5 genes by a shared enhancer JOURNAL J. Biol. Chem. 265, 9555-9562 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by N.Ruiz-Opazo, 04-JUN-1990. FEATURES from to/span description signal 569 772 enhancer BASE COUNT 424 a 509 c 405 g 294 t 1 others ORIGIN 1 bp upstream of HindIII site. 1 aagcttaaag agatgggggg ggggggtgtc ctcctgcctt cagtgcagac ctggctggcc 61 cctttccctc tgcagagttc accctacgga ttgaaccatt tccgtagcta atttgtagca 121 tgttttcagc accagcaccc aagtcgctaa cagagaaaag aaggacnccc accccccacc 181 cccaatagat ccgagcaagg aaaaaaatga aaagctagat ctgaagggat tttaggaatc 241 cactaccctc tgaatttaga tatgaggaaa cctgaactaa caagatggat ggctcagcag 301 gtaacggagc ttgctatcaa gctcaaagac ctgtgcacgg atcgaacacc cacatggtag 361 aaacaactga ttcttaaaga gttgctctct gaccgccaca gcaccaccac gccacgccca 421 cgcccctata agtaaacgac taaagtaagg gaagctacag catggaagct gtttctaatc 481 acagctacac aatgtagctg ttagctagct aatgactgaa gaccagaccc ctcccactcc 541 ccacgatgtt aactgagtct aaactcaaca cacacacaca cacacacaca cacacacaca 601 cacacacctt ggcacactcc ttctaccctt tccaaagcac agcacaggca cataactcag 661 caaaccacac acacacacac acacacacac acacacacac acacacacac acaccttggc 721 acactccttc taccctttcc aaagcacagc acaggcacat aactcagcaa accgggccct 781 aagttcctaa tgtattttac tgcccacctc cagcttgggt gaaacgattg ggagacttcc 841 tgtgtgtatc attatgccca ccatcacggg cagagagaca gggcctcagg aatggccccc 901 acaccttcct ccacctctat ttctctgcgt gggttgaaat ttccagactg tgagagccca 961 ggactatgac tggtgggtgg actcagagct gctagaagga cgcagctagg tactgagaac 1021 tcccaggccc agatcgaagg tgcggtggct gtgactggca gcccccggac cttacagtga 1081 gcaccccagg acccggaggc caagtgagat ggtgcgcaca caccacctgc cctgttcccg 1141 ggattccctg tgactcccga cagactcttc tggggaccct gaccgccagg cgacggctct 1201 gggaggtggc tagagactcc tcgccagtac cgggctcctc tccagcgagg tccgggagct 1261 ctttcccagg cgtgcaggtt tagaaattaa ggggagcctc gcccggccgc ggtctcggag 1321 gctcgggtcc ccatatatag tcatatccac cgtcgactgg gagcggcagg cggcagcgaa 1381 tggcgtgagg ccccgcggga ggagcgagga gggggcatgg ggcggaggca ggagactggg 1441 aaagggggaa ggaggaacaa gcttttccaa aaaagtattg gctgacctaa ggaatgcggt 1501 cgccccctgg gaaagtacat atcagggagc agcaggcacg tccgcgctcg caccttccgg 1561 cctctgccac cctaccgcgc tcgcgtctcc tctggctcct ctcgctaagg gcccctcgcc 1621 accgccacca tgg // LOCUS STMXYLI 1201 bp ds-DNA BCT 11-AUG-1990 DEFINITION S.violaceoniger xylose isomerase gene, complete cds. ACCESSION M36269 KEYWORDS glucose isomerase; xylose isomerase. SOURCE S.violaceoniger DNA. ORGANISM Streptomyces violaceoniger Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1201) AUTHORS Drocourt,D., Bejar,S., Calmels,T., Reynes,J.P. and Tiraby,G. TITLE Nucleotide sequence of the xylose isomerase gene from Streptomyces violaceoniger JOURNAL Nucleic Acids Res. 16, 9337-9337 (1988) STANDARD simple staff_review FEATURES from to/span description pept 24 1193 xylose isomerase BASE COUNT 187 a 455 c 402 g 157 t ORIGIN 1 cgtacggaac aaggagcagc gccatgagct tccagcccac ccccgaggac aagttcacct 61 tcggtctgtg gaccgtcggc tggcagggaa gggacccgtt cggcgacgcc acccgccctg 121 ccctcgaccc ggtcgagacc gtgcagcgcc tggccgagct gggcgcctac ggagtgacct 181 tccacgacga cgacctgatc cccttcgggt cgtccgacac cgagcgcgag tcgcacatca 241 agcggttccg ccaggccctg gacgccactg gcatgacggt gccgatggcc accacgaacc 301 tcttcaccca ccccgtcttc aaggacggcg gtttcaccgc caacgaccgc gacgtgcgcc 361 gctacgcgct gcgcaagacg atccgcaaca tcgacctggc ggccgagctg ggcgccaaga 421 cgtacgtcgc ctggggcggc cgtgagggcg ccgagtccgg tggcgccaag gacgtgcgcg 481 acgccctcga ccgcatgaag gaggcgttcg acctcctcgg cgagtacgtc accgcccagg 541 gctacgacct ccgcttcgcc atcgagccca agcccaacga gccccgcggc gacatcctcc 601 tgcccaccgt cggccacgcc ctggccttca tcgagcgcct ggagcgcccg gagctgtacg 661 gcgtcaaccc ggaggtcggc cacgagcaga tggccggcct gaacttcccg cacggcatcg 721 cgcaggccct gtgggcgggc aagctcttcc acatcgacct caacggccag tccggcatca 781 agtacgacca ggacctgcgg ttcggcgccg gcgacctgcg ggcggcgttc tggctggtcg 841 acctcctgga gagcgccggt tacgagggcc cgcggcactt cgacttcaag ccgccgcgga 901 ccgaggactt cgacggcgtg tgggcctcgg ccgagggctg catgcgcaac tacctgatcc 961 tcaaggagcg cgcggccgcc ttccgcgccg acccggaggt gcaggaggcc ctgcgcgccg 1021 cgcgtctgga ccagctggcc cagccgaccg cggcggacgg cctggaggcc ctgctcgccg 1081 accgcaccgc gttcgaggac ttcgacgtgg aggcggccgc cgcgcgcgcg gcatggccgt 1141 tcgaacgcct cgaccagctg gcgatggacc acctgctggg cgcgcgcggc tgacggtcgc 1201 g // LOCUS BNAALS 3315 bp ds-DNA PLN 11-AUG-1990 DEFINITION B.napus chloroplast acetolactate synthase (ALS) gene, complete cds. ACCESSION M20498 KEYWORDS acetolactate synthase. SOURCE B.napus (cv. Westar) 3 week old leaf DNA. ORGANISM Brassica napus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 3315) AUTHORS Wiersma,P.A., Schmiemann,M.G., Condie,J.A., Crosby,W.L. and Moloney,M.M. TITLE Isolation, expression and phylogenetic inheritance of an acetolactate synthase gene from Brassica napus JOURNAL Mol. Gen. Genet. 219, 413-420 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.L.Crosby, 06-DEC-1988, for release after publication. FEATURES from to/span description pept 784 2697 acetolactate synthase (ALS) precursor (EC 4.1.3.18) sigp 784 1011 acetolactate synthase (ALS) signal peptide matp 1012 2694 acetolactate synthase (ALS) signal 2111 2116 transcriptional stop sequence (put.) signal 446 459 promoter element (put.) signal 665 668 CAAT box signal 705 709 TATA box signal 2894 2899 polyA signal BASE COUNT 954 a 712 c 702 g 947 t ORIGIN 1 tctagatata aaagtatttt attacttaaa aagtatattt ttttgttaat taaagatatc 61 ttttagatgg actattacta ttttgtgaac tttctttttt ttctgaaatc atattatata 121 tacatatttt tttgttttta aattattttt taaaacttta taaactttcc ttttttattt 181 atattgttat ttgaaaaatt ttaagaaagt aaagaataat atttaaatgt acttttttta 241 tttcattagg gatatcatag taatcaacca ccgtgagagt taacgtgagt gcgacaccta 301 aaaaagtgac ttctcaaata atattatgga gatgtatcta caagaaaaat aaacattgtt 361 gtgtaaattt tgaataaatg actgagggca tccacatggt aaaatttact aagagtatct 421 gaacaaaaat attaaaataa aaataaaaag aaagaaagag aaatcagaaa acaagagaag 481 ccattggtta agaagagact cacacattct tttgacagag tttttctacg gatgacattt 541 attgaatgag acaacaaatt ttacagcatt aagttttatt catctaatta acctattaat 601 aattttatgt taagagacat ggataaagaa acttattgct gcagacgctc tgagatgggg 661 ttggcaatat tagttacgag ccaacttgaa gtgtcacatg catgtataaa ttttagttct 721 ctctcacaga gtttatttac tcattcaaat taagcaattt ctcgcaacac tccattttgc 781 accatggctt cgttttcgtt cttcggcacc attccgtcgt ctcccacaaa agcttccgtc 841 ttctccctgc cggtgtcggt aactacgctc ccgtccttcc cgcgccgccg tgctactcgt 901 gtctccgttt ccgccaactc gaagaaagac caagaccgca cagcttcacg tcgagagaat 961 ccgagcacat tcagctccaa atacgctccc aacgtgcccc gcagtggcgc agacatcctg 1021 gtcgaagccc tggagcgtca aggagtggac gtagtcttcg cttacccagg aggcgcatca 1081 atggagatcc atcaagccct aactcgctcc aacacaatcc gaaacgtcct tccccgtcac 1141 gaacaaggag gtatcttcgc cgccgagggt tacgctcgtt cctccggtaa acccggaatc 1201 tgcatcgcca cttccggtcc aggagctatg aatctcgtca gcggattagc cgacgccctg 1261 tttgacagcg tacccctcat cgcaatcaca ggacaggtcc ctcgccggat gattggtacc 1321 atggcgttcc aggagacacc cgttgttgag gtaacgagga ctataacgaa acataactat 1381 cttgttatgg aagttgatga tatacctagg atcgttcgag aagctttctt tctagctact 1441 tcggttagac cgggaccggt tcttatagac gtccccaaag atgttcagca acagtttgcg 1501 attcctaact gggaacagcc tatgcgctta cctctttaca tgtctacgat gcctaaaccc 1561 cccaaagttt ctcacttaga gcagattctt aggttggttt cggagtctaa gaggcctgtc 1621 ttgtacgttg gaggtggttg tctgaactcg agtgaggaac tgcgcagatt tgtggaactt 1681 actggcatcc ctgttgctag tacgttcatg ggacttggat cgtatccttg tgacgatgaa 1741 gagttctctc tgcaaatgct aggaatgcat ggaacagtgt acgctaatta cgctgtcgag 1801 tatagcgatc ttctgcttgc ttttggggtt aggtttgacg accgtgtgac cggaaagctt 1861 gaggcctttg ctagccgggc caagatcgtg catattgata ttgattctac cgaaatcggg 1921 aagaacaaga cacctcatgt gtcggtgtgt tgtgatgttc agctagcctt gcaagggatg 1981 aacgaggttc ttgagaaccg acgagatgtg cttgacttcg gggaatggag atgtgaattg 2041 aacgaacaga gactaaagtt ccctctccgc tacaagacgt ttggggaaga gattcctcca 2101 cagtacgcca ttcaactact tgacgagcta accgacggga aggcaattat cactactggt 2161 gtcgggcaac accagatgtg ggccgcccaa ttctacagat tcaagaaacc ccgccaatgg 2221 ctgtcttcag gaggcctagg agccatgggg ttcggtcttc ctgcagccat gggagccgct 2281 atagccaacc cgggagcagt ggttgtcgac attgatgggg atggtagctt catcatgaac 2341 attcaagaac tggcaaccat cagggttgag aatcttccag tcaaggtttt gctgattaat 2401 aatcagcacc tcggaatggt ccttcagtgg gaagaccact tctacgcagc taacagagcc 2461 gattcttttc tgggagaccc ggcgaaccct gaggcggtat tcccggatat gctgttgttc 2521 gccgcatcgt gcggtatacc agccgccagg gtcaccagaa gggaggacct ccgagaggca 2581 atccagacaa tgctggacac acctggacca ttcttgttgg atgtggtctg tcctcaccag 2641 gaccatgtgt taccactcat ccctagtggc ggcaccttca aggacattat tgtgtagtac 2701 tactttaatg tttgattatt ctatgctctc ttctgttttg taataacaaa gtaaaaccag 2761 tttgcaatat cagaagatct tctgattatc aatatcatgc tactctcata atgaaaaacg 2821 caaacttcac tctttattta tatgagaata ataactgtag tgtttacata acaatttata 2881 agtatgaaag actaataaaa aaaaagttat atacttgata aaaaggcaga agtaagttta 2941 caaatgtagt aggccatcat aacgatctga taaggaattt ttgtataggg actaatacct 3001 ctatgaagga gaacatgtat cgcaacgagt tgacatatgt gtgaacctca gatggactcg 3061 atgttaagtt ctacgttcct gcatgaaaaa aaaaaaacat acaatttttt tcaagcatgg 3121 gtaaccttat tgaatacgaa caatccagac gaaaatgggg caaaactacc ttgagatctg 3181 cacctgaaca gaaaactcca ggaaccagac ctgtgatcat cacaactcta aaagctgaac 3241 tatcctgctg tatggtctca aatgtgttct gcatctcctt atcaatagca tttttcggct 3301 actggcctgt ctaga // LOCUS HUM5AR 2102 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human steroid 5-alpha-reductase mRNA, complete cds. ACCESSION M32313 KEYWORDS dihydrotestosterone; steroid 5-alpha-reductase. SOURCE Human adult prostate, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2102) AUTHORS Andersson,S. and Russell,D.W. TITLE Structural and biochemical properties of cloned and expressed human and rat steroid 5-alpha-reductases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3640-3644 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.W. Russell, 23-FEB-1990, for release after publication. FEATURES from to/span description pept 31 810 steroid 5-alpha-reductase (EC 1.3.99.5) mRNA < 1 2102 steroid 5-alpha-reductase mRNA BASE COUNT 470 a 482 c 486 g 664 t ORIGIN 1 gggcatggag cacgctgccc agccctggcg atggcaacgg cgacgggggt ggcggaggag 61 cgcctgctgg ccgcgctcgc ctacctgcag tgcgccgtgg gctgcgcggt cttcgcgcgg 121 aatcgtcaga cgaactcagt gtacggccgc cacgcgctgc ccagccacag gctccgagtg 181 ccggcgcggg ccgcctgggt ggtgcaggag ctgccctcgc tggccctgcc gctctaccag 241 tacgccagcg agtccgcccc gcgtctccgc agcgcgccca actgcatcct cctggccatg 301 ttcctcgtcc actacgggca tcggtgctta atttacccgt ttctgatgcg aggaggaaag 361 cctatgccac tgttggcatg tacaatggcg attatgttct gtacctgtaa cggctatttg 421 caaagcagat acttgagcca ttgtgcagtg tatgctgatg actgggtaac agatccccgt 481 tttctaatag gttttggctt gtggttaaca ggcatgttga taaacatcca ttcagatcat 541 atcctaagga atctcagaaa accaggagat actggataca aaataccaag gggaggctta 601 tttgaatacg taactgcagc caactatttt ggagaaatca tggagtggtg tggctatgcc 661 ctggccagct ggtctgtcca aggcgcggct tttgctttct tcacgttttg ttttttatct 721 ggtagagcaa aagagcatca tgagtggtac ctccggaaat ttgaagagta tccaaagttc 781 agaaaaatta taattccatt tttgttttaa gtgcgttttt catgaaatta tcttcaactt 841 gaagctttcc aatggcgctt ctctatggac tttgtaaata agttatatct ttgtaatttt 901 cctgctactt tatcattttc aagatgtcct ctaggaattt tttttctagt aattttgcaa 961 tctacctaat aagtacctaa atacgctgaa atggaggttg aatatcctac tgtgtaacag 1021 gtcagaattt caagctctgg gtaataactg ctgatatttt ttctaatttc aaatttacct 1081 cttttggcta tgtcttgcca agtgtgtatg agactagact ttacaactgt ctttgatggc 1141 attttcagaa caataaatgt cacaatccct tctatagccc cctacagtga tctcttcaag 1201 gtcaactgca gtgttgcttc cctcccccta tagggctgga atctgtctag gagccctctc 1261 tcggaggcca cagaggctgg gggtagccat tgtgcagtca tggcccgggg gaaacttgcc 1321 aaccttcgtg tcaggtgctg tgtgtaagtg gagaacttgg ggatagagga ggaagctcct 1381 cgtggccctt ccaaggtgag gcaaaggcat ctggacttgt tccagcccag cccaccgggt 1441 gacatcaccg ggcagggagg ggtgctggtg gtggttcata cggagtaagc tgctctgcct 1501 gtgtgagtgg ctcctgggcc ctaaacaggc acctttaggc catgggtcac tcaccgtgag 1561 ccatcaatgt gctctggtct gacatggttt ctctctgtct tctagtctag acctagtttt 1621 tttgttctgt tccccacgta tggatatagt agagattgtt gtctgtgaaa tttctctttt 1681 gtagattttg agttttccct tgtagtgtaa agaatgatca ctttctgtaa caataacaag 1741 accacttttt aagatttatc ctgtttgttc tttgttgatt gaaacataat aattgttaaa 1801 attctctaca gccttctttt tcttccatag ctaatcttcc ttctaatagt ttttgctttc 1861 tgttttgctg ttgttgcttt gcaaagcttt cccctcatag cctgtacctg ttatcaatat 1921 aaaataatct tcctgttgaa tgcttcatga cttgaattct actttgataa aaacattgcc 1981 atactgcttt ttatcttgat gaattcatct ggcattgctt tgccttatca tctcatctgg 2041 agtttttaaa tgccatttgt ttcagttgtc tttaacaaca taataaatag actttgccat 2101 tt // LOCUS SPIBADH 1797 bp ss-mRNA PLN 11-AUG-1990 DEFINITION Spinach betaine-aldehyde dehydrogenase (BADH) mRNA, complete cds. ACCESSION M31480 KEYWORDS betaine-aldehyde dehydrogenase. SOURCE Spinach salt-stressed leaf, cDNA to mRNA. ORGANISM Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae. REFERENCE 1 (bases 1 to 1797) AUTHORS Weretilnyk,E.A. and Hanson,A.D. TITLE Molecular cloning of a plant betaine-aldehyde dehydrogenase, an enzyme implicated in adaptation to salinity and drought JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2745-2749 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by A.D.Hanson, 27-MAR-1990,, for release after publication. FEATURES from to/span description pept 68 1561 betaine-aldehyde dehydrogenase (BADH) (EC 1.2.1.8) BASE COUNT 504 a 324 c 443 g 526 t ORIGIN 1 cgttgcgtgc tcgccttacc ctctcaactc aatttcttca acccaatttc ttcgcattta 61 accaagaatg gcgttcccaa ttcctgctcg tcagctattc atcgacggag agtggagaga 121 acccattaaa aaaaatcgca tacccgtcat caatccgtcc actgaagaaa tcatcggtga 181 tattccggca gccacggctg aagatgtgga ggttgcggtg gtggcagctc gaagagcctt 241 taggaggaac aattggtcag caacatctgg ggctcatcgt gccacatact tgcgtgctat 301 tgctgctaag ataacagaaa aaaaagatca tttcgttaaa ctggaaacca ttgattctgg 361 gaaacctttt gatgaagcag tgctggacat tgatgacgtt gcttcatgtt ttgaatattt 421 tgccggacaa gcagaagctc ttgatggtaa acaaaaggct ccagtcaccc tgcctatgga 481 aaggttcaaa agtcatgttc tcaggcagcc ccttggtgtt gttggattaa tatccccatg 541 gaattaccca cttctaatgg ctacatggaa aattgctcca gcacttgctg ctgggtgtac 601 agctgtactt aagccatccg agttggcatc tgtgacttgt ctagaattcg gtgaagtttg 661 caacgaagtg ggacttcctc caggcgtgtt gaatatcttg acaggattag gtccagatgc 721 tggtgcacca ttagtatcac accccgatgt tgacaagatt gcctttactg ggagtagtgc 781 cactggaagc aaggttatgg cttctgctgc ccaattggtt aagcctgtta cattagaact 841 tgggggtaaa agtcctattg tagtgtttga agatgttgat attgataaag ttgtggaatg 901 gactattttt ggctgtttct ggacaaatgg tcaaatatgt agtgcaacgt ctagactgct 961 tgtgcatgaa agtattgcag ctgagtttgt tgataagctt gtaaaatgga cgaaaaacat 1021 taaaatttct gacccatttg aagaaggatg ccggcttggc cctgttatta gtaaaggaca 1081 gtacgacaaa attatgaagt tcatatcaac agcaaagagt gagggggcaa ctattttgta 1141 tggaggttcc cgtcctgagc atttgaagaa aggttattac attgaaccca ccattgtaac 1201 tgatatctcc acatccatgc aaatatggaa agaggaagtt tttggccctg tcttgtgtgt 1261 taaaacattt agttccgaag atgaagccat tgcattggca aatgatacag agtacggttt 1321 agctgctgct gtgttttcta atgatcttga aagatgtgag aggataacga aggctctaga 1381 agttggagct gtttgggtta attgctcaca accatgcttt gttcaagctc cttggggagg 1441 catcaagcgt agtggttttg gacgtgaact tggagaatgg ggtatccaga attacttgaa 1501 tatcaagcag gtgactcaag atatttctga tgaaccatgg ggatggtaca agtctccttg 1561 aagctatgat caaatttgaa tgacggtgtt gtttttgtta agtgagcagc ggttggactg 1621 taccttgaaa tggttcgcag agaaggtcga gtttacagta aaaatggatt gaataaaggg 1681 ttggttgatg cagaagtcca acaagcatag cttaattttg ttgtatcatg taatagtgtg 1741 tattatttca gacaattgag ttggatcaga aataagatga tacagagttt gacaatg // LOCUS ECORTSA 1407 bp ds-DNA BCT 11-AUG-1990 DEFINITION E.coli rts gene, complete cds. ACCESSION M36321 KEYWORDS rts gene. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1407) AUTHORS Flamm,J.A., Friesen,J.D. and Otsuka,A.J. TITLE The nucleotide sequence of the Escherichia coli rts gene JOURNAL Gene 74, 555-558 (1988) STANDARD simple staff_review FEATURES from to/span description pept 294 1304 rts gene product (gtg start codon) BASE COUNT 383 a 316 c 298 g 410 t ORIGIN 1 gcgcgcattc tatggagaca tgcgagttca tgcaactaaa aaattgcata atttgtttta 61 ttggtcacat tttatgcgac acgatgaaga aacagccgaa cggagggcag attagcacac 121 tttttcaaca tcattgtgct caacaatgcg ctcctgctaa accataattc tttttatcag 181 atggaatatc tgtcacattg cttttcaacg atagcttcct ggcagagatt ttttcttatt 241 attcctcccc atctggtgtt accctcctgc ccattaaccc attcaacaga actgtgacgc 301 gccatggcaa atatcgcttt gccgatagag ctatgaccgc cagaaacatg cttatgagta 361 taaaagagca aacgttaatg acgccttacc tacagtttga ccgcaaccag tgggcagctc 421 tgcgtgattc cgtacctatg acgttatcgg aagatgagat cgcccgtctc aaaggtatta 481 atgaagatct ctcgttagaa gaagttgccg agatctattt acctttgtca cgtttgctga 541 acttctatat aagctcgaat ctgcgccgtc aggcagttct ggaacagttt cttggtacca 601 acgggcaacg cattccttac attatcagta ttgctggcag tgtcgcggtg gggaaaagta 661 caaccgcccg tgtattgcag gcgctattaa gccgttggcc ggaacatcgt cgtgttgaac 721 tgatcactac agatggcttc cttcacccta atcaggttct gaaagaacgt ggtctgatga 781 agaagaaagg cttcccggaa tcgtatgata tgcatcgcct ggtgaagttt gtttccgatc 841 tcaaatccgg cgtgccaaac gttacagcac ctgtttactc acatcttatt tatgatgtga 901 tcccggatgg agataaaacg gttgttcagc ctgatatttt aattcttgaa gggttaaatg 961 tcttacagag cgggatggat tatccacacg atccacatca tgtatttgtt tctgattttg 1021 tcgatttttc gatatatgtt gatgcaccgg aagacttact tcagacatgg tatatcaacc 1081 gttttctgaa attccgcgaa ggggctttta ccgacccgga ttcctatttt cataactacg 1141 cgaaattaac taaagaagaa gcgattaaga ctgccatgac attgtggaaa gagatcaact 1201 ggctgaactt aaagcaaaat attctaccta ctcgtgagcg cgccagttta atcctgacga 1261 aaagtgctaa tcatgcggta gaagaggtca gactacgcaa ataatttgca ggggagcgaa 1321 tactcccctt tcttattttt ctgcactacg cagggatatt tcaccgccca tccagggttt 1381 tattattcca tcctgctcaa gtaataa // LOCUS MUSBCLQ 1056 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse complement component subunit Clq B-chain mRNA, complete cds. ACCESSION M36293 KEYWORDS complement component C1q. SOURCE Mouse Englebreth-Holm-Swarm tumor, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1056) AUTHORS Wood,L., Pulaski,S. and Vogeli,G. TITLE cDNA clones coding for the complete murine B chain of complement Clq: Nucleotide and derived amino acid sequences JOURNAL Immunol. Lett. 17, 59-62 (1988) STANDARD simple staff_review FEATURES from to/span description pept 157 918 complement component subunit Clq B-chain precursor sigp 157 231 complement component subunit Clq B-chain signal peptide matp 232 915 complement component subunit Clq B-chain BASE COUNT 236 a 320 c 285 g 215 t ORIGIN 1 acagcaaggg aaggttcttc cttcctctag ggacccagac ttccgctttc tgaggacagc 61 agcaggctct gggctctggg aatccactgc tgtccggcct agaagcatca cagaacacca 121 ggattccata cacaggaagc ccctgaggct gagctgatga agacacagtg gggtgaggtc 181 tggacacacc tgttactgct gcttctaggt tttctccatg tgtcctgggc ccaaagcagc 241 tgcaccgggc cccctggcat ccctggcatc cctggggtcc ctggggttcc tggctctgat 301 ggccaacctg gcactccagg gattaaaggg gagaaagggc tccctggact ggctggagac 361 cttggtgagt ttggagagaa aggggaccca gggatccctg ggactccagg caaagttggc 421 cctaagggtc ccgtcggccc taagggtact ccaggcccct ctggaccccg cggtcccaaa 481 ggcgattctg gggactacgg ggctacacag aaagtcgcct tctctgccct gaggaccatc 541 aacagcccct tgcgaccgaa ccaggtcatt cgcttcgaaa aggtgatcac caacgcgaac 601 gagaactatg agccacgcaa cggcaagttc acctgcaagg tgcctggcct ctactacttc 661 acctatcatg ccagctcccg gggcaacctg tgtgtgaatc tcgttcgtgg ccgcgatcgg 721 gacagcatgc agaaagtagt caccttctgt gactatgccc agaacacctt ccaggtgacc 781 acaggtgggg tagtcttgaa gctagagcaa gaggaggttg ttcacctgca ggccacagac 841 aagaactccc tcctgggcat tgagggtgcc aacagcatct tcactggctt tctgcttttc 901 cctgacatgg atgcgtaatc acggggtcaa attacaccta tccaacacca tcttcctgct 961 cctgcagcaa tcctccctgg acccctgaca tcaccccctt gactgcctga aacccagacc 1021 agagccctgt agatgttaca gaacgaatgg gtcaat // LOCUS MUSHOX23R 3542 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse Hox2.3 protein mRNA, complete cds. ACCESSION X06762 M36183 KEYWORDS Hox2.3 protein; homeo box. SOURCE Mouse plasmacytoma cell line, cDNA to mRNA, clones pRO.8, pR1.2, and genomic clones L13 and L2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3542) AUTHORS Meijlink,F., Verrijzer,P., De Laaf,L., De Graaff,W. and Deschamps,J. TITLE A mouse homeobox containing gene on chromosome 11: Sequence and tissue-specific expression JOURNAL Nucleic Acids Res. 15, 6773-6786 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1184 to 2206; revises [1]) AUTHORS Verrijzer,P., De Graaff,W., Deschamps,J. and Meijlink,F. TITLE Nucleotide sequence of the Hox2.3 gene region JOURNAL Nucleic Acids Res. 16, 2729-2729 (1988) STANDARD simple staff_review FEATURES from to/span description pept 2187 2840 Hox2.3 polypeptide mRNA 2100 3373 Hox2.3 mRNA site 2595 2774 homeo box site 3371 3371 hox2.2 mRNA BASE COUNT 738 a 924 c 951 g 890 t 39 others ORIGIN Chromosome 11. 1 gagctcggcc ccgcgagcga cgcaggagct gggcctccca cagcagcgtc ccccgccgcg 61 ccagtccccg ctagtggtag tatctcgtaa tagcttctgt gtgtgagcta ccgtggatct 121 ccttcccttc tcttgggggt ccggggggaa aaaaagaaaa ggattttaag caaggactcc 181 ctcgtcctgc gagggtgatc gactgcggcc tggcagaacc ccctcgcccc cgccccatgt 241 aaaaaagcct ccttgtgcaa tggtctgttt cctttgaacg tgcttctttg taatgaccga 301 ggtaccgatt tctgctaagt tttcccaaca acatgaaact gcctattcac gccgtaattc 361 tttctgtctc ccgctcactt tctctctttc tctcgctctc tttctctcac cgcgtcccca 421 tctttcctcg caaccccctc tccccgctgc cctccctagc tggctttctc tcttgcttct 481 ctcttttcct cctgtccccc cacccccacc ccctttggtt tgacaatttt gtcttaagtg 541 tttctcaaaa gagattactt tagttagcat gcgcgctgtg agcattgtta aaagtgttct 601 taggtttact gtgaagagaa tgtatcctgt atctgtgaat tgctttatgg gggggaggga 661 gggctaatta tatattttgt tgttcctcta tactttgttc tgttgtctgc gcctgaaaag 721 ggcggaagag ttacaataaa gtttacaagc agagaacccg agnnnnnnnn nnnnnnnnnn 781 nncccgggcc gccgctcctc attcgctcct aggcgccttg cagggctggg ggtggggggg 841 agctggtcag caggctcctg ggctggccta ggctaggtcg ctgagaggag ggggcggggg 901 cgggggctgg aagcaggtgg tgcgagtccc tgggcccagg ggcgcagggg gtgagggagg 961 cggctgaacg tgattggagg agagaggatc gagggagggg agccaagaga aaccccctcc 1021 ccttgcattc tgaggctgaa ggaccaggga gactccagcg cccaggccgc tcttgggaag 1081 agatctaccc aggctggtgg ctagtgtccc ccgccgcttt tctctttgtt tccgtgtgtg 1141 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tttggggnnn nnnnnnnnnn 1201 nnnnnngggg tgagaaagat gcagcgcaag acttctgagt ttgtatttcc attttttcct 1261 tggactcagg ttggggaaac aggagcagag ggaagcggtt cctccctacc tcccctctct 1321 gggacgtcgt cactttctcc gacgtttcta ggcctcggct tgccgcagcc ttgcttcctt 1381 cgttgcttct gccttcctgg cagccacgct ccagtgagtg aggcatccgc cttccggaac 1441 cgggaaagca aagagccgga cccaagcctc cttcctccct tcctttttct cccagccccc 1501 attccattct ttttaaattt tgtatatctt tttattgtca tcagaaatct gagcgtccaa 1561 cgccttattg ggttggatct ctgccttagg gacgccttgg tcacatctag ttactacaac 1621 tggggcacta agacaaccgg gaggccaggc tcgcctcctc ttgggagaag agcagcagct 1681 cggatgaatt aacccaaatt aataaatatt cggccagcac ccaccgacca agttgcgaac 1741 attcaatccc tgcgtctctc tggctctgta accggctggg ggaaatgggt gggggatgac 1801 aacacggttc cctcagaggt tatttatttt ctcttccact caattccttc ttccccaaat 1861 ctcgcctgca agctgcctcc agcccgcggg ggtcgacagc ggcccttaag cccccagccc 1921 caatccgcag agctcggcct tcccattcat tattgatcat attttataaa tcgaacgcga 1981 cacaattttt tccacattac tgggagcctc cgggaggccg tcatagcatt ggccgagggg 2041 atatcacgtg ggccggggtc acgtggtcag aagaggaaaa agggggtcct tttggtgtaa 2101 atctggactc taattctgta atatatcaag gaatctcgta aaaccgacac taaaacgtcc 2161 ccgactacaa atcatccggc caaattatga gttcattgta ttatgcgaat gctttatttt 2221 ctaaatatcc agccgcaagt tcggttttcg ctccaggagc cttccccgaa caaacttctt 2281 gcgcctttgc ttccaacccc cagcgcccgg gctatggagc aggtccgggc gctcctttct 2341 ccgcctcggt gcagggtctg tactccggcg ggggggccat ggcgggccag agcgcggctg 2401 gcgtctatgc ggccggctac gggctcgaac cgagttcctt caacatgcac tgcgcgccct 2461 ttgagcagaa cctctccggg gtgtgtccgg gcgacgccgc caaggccggt ggcgccaagg 2521 agcagaggga ctcggacttg gcggccgaga gtaacttccg gatctacccc tggatgcgaa 2581 gctcagggcc tgaccgaaag cgaggccgcc agacctacac gcgctaccag accctggagc 2641 tggagaaaga atttcactac aatcgctacc tgactcggcg gaggcgcatc gagatcgcgc 2701 acacgctctg cctcaccgaa agacagatca agatctggtt tcagaaccgg cgcatgaagt 2761 ggaaaaagga gaacaaaacc tcaggaccgg gaaccaccgg ccaggacaag gcggaaggcg 2821 aggaggagga ggaagagtga gggacagaga aagccaagag gaggaaagcg aagagaggga 2881 gaacccaatt atgggaactg aagcatgaaa ctcaaataaa ggggcaaact atttaaatga 2941 agaggtctaa aaataataag gaaagagaag ggtgaaattt gggtttctta acactgtaaa 3001 aaaatatact acctatggga aagtgtgttg tctgtttttg tacagtatcg gaaggacatt 3061 atctacctgt tctgtggctt tctggaatgt gcctcccctt ttctatgttg ctagtaaggt 3121 ctttgtaaaa tcttgctgtt ttgtaagccc tctttgaagc tgtctttgtg aactgtggtt 3181 ccagatgaac agattagtgc ggttccttac ctaccccata cttcccagta gcagtactag 3241 aggggttgta tggagccctg aggatccact agcttctgcg tctggtgcat tttggctgct 3301 gtttctaggt cctattcacg agcctctttc tgtatatctg aaggatggaa aataaaacag 3361 gattaaatat caacagacac gctgtgtttt ctgtctccct ctggctgggg gaggtgtaag 3421 ggttggtagt gggcgagtgt cccagagccc ctctgctggt ctttttctgt cggtgctggg 3481 tttgctttta ttttccagat agcctctgct atagagactg ccagctagtg aatccccaac 3541 at // LOCUS POLVP1A 900 bp ss-RNA VRL 11-AUG-1990 DEFINITION Poliovirus type 3 VP1 gene, partial cds. ACCESSION M37321 M36161 M36162 KEYWORDS . SOURCE Poliovirus type 3 (P3/Leon/1947) RNA. ORGANISM Poliovirus type 3 Unclassified. REFERENCE 1 (bases 1 to 900) AUTHORS Minor,P.D., Evans,D.M.A., Schild,G.C., Ferguson,M. and Almond,J.W. TITLE Identification of an antigenic site in the neutralization of type 3 poliovirus JOURNAL Rev Infect Dis 6, 516-518 (1984) STANDARD simple staff_review REFERENCE 2 (bases 34 to 900) AUTHORS Almond,J.W., Cann,A.J., Minor,P.D., Reeve,P., Schild,G.C., Hauptmann,R. and Stanway,G. TITLE Nucleotide sequence from neurovirulent and attenuated strains of type 3 poliovirus JOURNAL Rev Infect Dis 6, 487-493 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 900 VP1 (AA at 1) BASE COUNT 238 a 250 c 206 g 206 t ORIGIN 1 ggtattgaag atttgatttc tgaagttgca cagggcgccc taactttgtc actcccgaag 61 caacaggata gcttacctga tactaaggcc agtggcccgg cgcattccaa ggaggtacct 121 gcactcactg cagtcgagac tggagccacc aatcctctgg caccatccga cacagttcaa 181 acgcgccacg tagtccaacg acgcagcagg tcagagtcca caatagaatc attcttcgca 241 cgcggggcgt gcgtcgctat tattgaggtg gacaatgaac aaccaaccac ccgggcacag 301 aaactatttg ccatgtggcg cattacatac aaagatacag tgcagttgcg ccgtaagttg 361 gagtttttca catactctcg ttttgacatg gaattcacct tcgtggtaac cgccaacttc 421 accaacgcta ataatgggca tgcactcaac caggtgtacc agataatgta catcccccca 481 ggggcaccca caccaaagtc atgggacgac tacacttggc aaacatcttc caacccgtcc 541 atattttaca cctatggggc tgccccggcg cgaatctcag tgccatacgt ggggttagcc 601 aatgcttact cgcactttta cgacggcttc gccaaggtgc cattgaagac agatgccaat 661 gaccagattg gtgattcctt gtacagcgcc atgacagttg atgactttgg tgtattggca 721 gttcgtgttg tcaatgatca caaccccact aaagtaacct ccaaagtccg catttacatg 781 aaacccaaac acgtacgtgt ctggtgccct agaccgccgc gcgcggtacc ttattatgga 841 ccaggggtgg actataagaa caacttggac cccttatctg agaaaggttt gaccacatat // LOCUS YSCATRP3A 1240 bp ds-DNA PLN 11-AUG-1990 DEFINITION S.cerevisiae glutamine amidotransferase (TRP3) gene, 3' end. ACCESSION M36300 KEYWORDS TRP3 gene; glutamine amidotransferase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1240) AUTHORS Aebi,M., Furter,R., Prantl,F., Niederberger,P. and Huetter,R. TITLE Structure and function of the TRP3 gene of Saccharomyces cerevisiae: Analysis of transcription, promoter sequence, and sequence coding for a glutamine amidotransferase JOURNAL Curr. Genet. 8, 165-172 (1984) STANDARD simple staff_review FEATURES from to/span description pept 401 > 1240 glutamine amidotransferase (TRP3) mRNA 309 > 1240 TRP3 mRNA (alt.) mRNA 314 > 1240 TRP3 mRNA (alt.) mRNA 320 > 1240 TRP3 mRNA (alt.) BASE COUNT 352 a 267 c 259 g 362 t ORIGIN 1 ttcacttttt ttgcttctta taggcctttt tgaactattt tctgttaggg gcttaaagaa 61 agcgcagcaa atggctgatt ttaaatgctg ttttctcgaa agcaatgctg atgatgtacc 121 acaaagtagt atttgcgctt agcttgtcga tataagtatg cctttgagcc agtattaaac 181 ttcctactat ctcttctgga cctcaccgtg ctattcatat gactgacgac ggaaaatgcc 241 tcttgtagta atcattgaat gatcatcaat ggctcgttga ctcattctaa tctatagcag 301 atattaaggt acgcaaaaag ttcgacaagg agctggtcag tattaacaca ttaccaaatc 361 tgtttggttc tacatagaac gccataaaag taagaaaaaa atgtctgtgc acgctgcaac 421 aaacccaatc aataagcatg tggttctaat tgacaactac gattccttta cctggaacgt 481 ttacgagtac ttgtcccagg agggcgccaa agtgagcgtc taccgtaacg atgcaattac 541 agttccagaa attgccgcct tgaatcccga cacattgctt atctcgcttg gactaggcca 601 cccaaagaca gattctggca tttcaagaga ctgtatccgg tattttactg ggaaaattcc 661 tgtatttgga atctgtatgg gccagcaatg catgtttgac gtatttggtg gtgaagttgc 721 ctacgctggt gagattgtcc acggtaaaac gtccccaatc tctcacgaca actgtggaat 781 tttcaagaac gtgccgcaag gtattgctgt gacaagatac cattcattgg ccgggacaga 841 atcgtcccta ccatcctgct tgaaggttac tgcgagtacc gaaaatggaa ttatcatggg 901 tgtaagatac aagaagtaca ctgtagaagg tgtgcaattt catccggaat ccatcttgac 961 cgaggaaggt catctgatga tcaggaacat tttaaatgtc agtggaggca cttgggagga 1021 aaacaaatca tctccttcaa attctatttt ggaccgtatc tatgctcggc gtaaaataga 1081 cgtcaatgag cagtctaaaa tcccaagttt cacctttcaa gacttacaat ctaactatga 1141 tttaggtctt gccccaccgt tacaggattt ctacacggtg ttgtcatcat cccataaaag 1201 agccgttgtt cttgctgaag tcaagcgtgc ctctccatcg //