Path: utzoo!attcan!uunet!munnari.oz.au!uokmax!apple!snorkelwacker!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 15 Aug 90 12:00:25 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 5936 Approved: lear@genbank.bio.net Checksum: 12444 365 LOCUS RHPNIFDK 3500 bp ds-DNA PLN 15-AUG-1990 DEFINITION Parasponia rhizobium nifD and nifK genes coding for the alpha- and beta-subunits of the Mo-Fe protein of nitrogenase, complete cds. ACCESSION X01139 KEYWORDS nitrogenase. SOURCE Parasponia rhizobium (strain ANU289) DNA, clones pR289nif-[3,4,5]. ORGANISM Parasponia rhizobium Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Hamamelidae; Urticales; Ulmaceae. REFERENCE 1 (bases 1 to 3500) AUTHORS Weinman,J.J., Fellows,F.F., Gresshoff,P.M., Shine,J. and Scott,K.F. TITLE Structural analysis of the genes encoding the molybdenum-iron protein of nitrogenase in the Parasponia rhizobium strain ANU289 JOURNAL Nucleic Acids Res. 12, 8329-8344 (1984) STANDARD simple staff_review COMMENT EMBL features not translated to GenBank features: key from to description PRM 108 124 consensus promotor sequence SITE 135 135 transcription start RBS 163 168 pot. ribosome binding site RBS 1753 1758 pot. ribosome binding site SITE 3322 3355 pot. stem-loop structure FEATURES from to/span description pept 176 1678 Mo-Fe protein alpha-subunit pept 1767 3308 Mo-Fe protein beta-subunit BASE COUNT 826 a 957 c 992 g 725 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctccg tgcaaagcgc gatgtcgcct tcgcaacaac aaccagcccc atcggacgaa 61 acgcgctaac tgtttttatt tattctgctt tttgtgctcg cgccgcgctg gcatgctcgt 121 tgcagtcttg ttcaagaagc tgctcccgca cagttaattc ttgaaggaca tcagcatgag 181 tctcgccacg acccagagca tcgcagaaat cagggctcgc aataaagagc tgatcgagga 241 ggtgctgaaa gtctatccgg agaagaccgc gaaacggcgt gccaagcacc tcaacgttca 301 ccaagccggc aagtcggact gcggggtcaa gtccaacatc aaatcaatac ctggtgtgat 361 gacaatcaga ggctgcgcct atgcaggatc caaaggggtg gtctggggac cgatcaagga 421 catggtccat atcagccatg gcccggtcgg ctgtggtcag tattcgtggg gctcgcgtcg 481 caactattat gttggcacga cgggcgtcga tagtttcgtg accctgcagt tcacctccga 541 cttccaggaa aaggacatcg tatttggcgg cgacaagaag ctgatcaaag tccttgacga 601 aatccaggag ctgttcccgc tcaacaacgg catcaccatc caatcggaat gcccgatcgg 661 actgatcggg gacgacatcg aggctgtgtc aagatcgaaa tccaaagaat acggcggcaa 721 gaccatcgtg cctgttcgct gtgagggctt tcgcggcgtg tcgcaatcgc ttggccacca 781 cattgccaat gacgcggtgc gcgattggat cttcgacaag ctagagcccg agggcgaacc 841 aaagttccag ccgacgccct acgacgttgc gatcatcgga gactacaata ttggcggcga 901 tgcctggtca tcgcgcattc tgctggaaga aatgggcttg cgggtgattg cgcagtggtc 961 cggcgacggt tccctcgccg aactcgaagc aacgccgaag gcaaagctca atattctgca 1021 ttgctaccgt tccatgaact acatctcccg ccacatggag gagaagtttg gcatcccctg 1081 gtgcgagtac aacttcttcg gaccgtcgaa gatcgcagaa tcgctgcgca agattgcggg 1141 ctatttcgac gacaagatca aggaaggcgc cgagcgagta attgaaaaat accagccact 1201 ggtggacgcc gtaatcgcaa aatatcgccc ccgcctggag ggcaagactg tgatgctgta 1261 cgtcggcggg cttcgtccac gtcatgtgat tggcgcgtac gaggatctcg gcatggaagt 1321 cgtgggcacc ggatacgagt tcggccacaa cgacgattat cagcgcaccg cccagcacta 1381 cgttaaggac agcacgctca tctacgacga cgtcaatggc tatgaattcg agcgcttcgt 1441 cgaaaaggtc caaccagatc tggttggctc gggcatcaag gagaaatacg ttttccaaaa 1501 gatgggtgtg ccgttcccgg agatgcattc ctgggactat tccggcccat atcacggcta 1561 tgacggcttt gcgatcttcg cgcgggacat ggacatggct gtcaactcgc cgatctggaa 1621 gaagacgaag gccccctgga aggaagctgc gaagccgaag ctcttggctg cagaataaca 1681 agcacttggt tccacaatag agcgatcaat cccgctctct gcggagagct ggggcgacat 1741 catttcgata gtgaaggatc ttaacaatgg cgcagagtgc agaccatgtg ctcgatcatc 1801 tcgaactgtt ccgcggtcca gaataccaac aaatgctggc cgacaagaag atgttcgaga 1861 atccccgcga tcctgccgag gtcgaacgta tccgagcagt gacgaaaacg cccgaatatc 1921 gcgagaagaa ttttgcggag gcgcttgcgg taaatccggc caaggcttgc cagccgcttg 1981 gcgccgtatt cgtctcggtt ggttttgaag gcacgctgcc cttcgtccat ggctcgcagg 2041 gctgcgtggc ctattaccgc agccatctgt cgcggcactt caaggagccg agctcctgcg 2101 tgtcttcgtc gatgacggaa gacgccgctg tattcggggg gctgaacaat atgatcgatg 2161 gcctcgccaa cagctacaac atgtacaaac ccaagatgat ttgctcgacg acctgcatgg 2221 ccgaggtgat cggcgatgac ctgaacgcct tcatcaagac atcaaaagaa aaaggctcgg 2281 ttcggcggag ttcgactcct ttcgcgcaca ctccagcgtt cgtcggcagc cacgtcaccg 2341 gctatgacaa cgcactcaag ggcattctcg agcacttttg gaacggcaag gccggaacgg 2401 cgccgaagct ggagcgcaaa ccaaacgagg caatcaacat catcggcggt ttcgatggca 2461 ataccgttgg aaaccttcgt gagatcaagc gaatcttagc gttgatgggc atcaaacaca 2521 cgattctcgc cgataactct gaagtcttcg ataccccgac tgatggcgag ttccggatgt 2581 atgacggcgg tacccacgtg gaggacacgg ccaacgcgat tcacgccaag gcgacaatct 2641 ccatgcagca atggtgtacg gaaaaaacgc tgccgttcgt gtccgagcat ggacaggacg 2701 ttgtgtcttt caattacccg gtaggtgtat ccgcgacgga tgatcttctc gtggccttgt 2761 cacgcatcag cggcaaggag attccggagc aactcgcgcg agagcgtggc cgcttggttg 2821 atgccatcgc ggattccagc gcgcatatcc atggcaagaa gttcgcgatc tacggcgatc 2881 cggatctctg ctatgggttg gctgcctttc tgctcgaact cggcgccgag cctactcatg 2941 tgctgtccac caacggcaac aacgtggcag gagaaaatgc gacgctgttt gcaggctcgc 3001 catttggaga acttccagcc tatccgggac gagacctctg gcacatgcgc tcgctcttgt 3061 tcacagagcc ggttgacttt ctgattggca acacccatgg caagtacctg gagcgtgaca 3121 ctggaacgcc attgatccgc atcggctttc caatttttga tcggcatcac catcaccgct 3181 tccctgtatg gggctatcag ggcggcctga atgtgctggt gaagatcctc gacaagatct 3241 tcgacgaaat cgacaagaag accagcgttc ttggcaaaac tgactacagt ttcgacatca 3301 ttcgttgatg acgggcagtg cgcgtgggct cgccgaaaca gcggcgagcc cacgctgggc 3361 actggttgac attgaaattt tcttccgctg agaggaaaat gctgatgagt tcgtctagtc 3421 ggccacggtc cagggtattt tcaggcgaac cgggctgccg aagaatggaa gtaagtcgga 3481 ggctgagcgc aagaagggct // LOCUS RHPNIFH 2030 bp ds-DNA PLN 15-AUG-1990 DEFINITION Parasponia rhizobium nitrogenase (nifH) gene, iron protein component. ACCESSION K00487 KEYWORDS nifH gene; nitrogenase; unidentified reading frame. SOURCE Parasponia rhizobium (strain ANU289) DNA. ORGANISM Parasponia rhizobium Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Hamamelidae; Urticales; Ulmaceae. REFERENCE 1 (bases 1 to 2030) AUTHORS Scott,K.F., Rolfe,B.G. and Shine,J. TITLE Nitrogenase structural genes are unlinked in the nonlegume symbiont Parasponia rhizobium JOURNAL DNA 2, 141-148 (1983) STANDARD full staff_review COMMENT [1] states the iron protein subunit is encoded on a separate operon from other components of the nitrogenase enzyme complex, unlike previously studied nitrogen-fixing prokaryotes. FEATURES from to/span description pept 576 1460 nifH (nitrogenase iron protein) mRNA 421 > 1460 nifH mRNA BASE COUNT 430 a 572 c 618 g 410 t ORIGIN 5 bp upstream of PstI site 1 ctgcagggcc cttgtaaggc gcttcttgct gcctttaagc tcatgcgcac cgatctgatc 61 agctggatca atcgggaggt cagccgcaca attgatctcg tcatcctcga ccacgaaccc 121 catcgccggc cacttgcctt gaggttctga cctcgacctg catattgctc tccgcggatt 181 gccgccactg gcttgcaaga agaggagcaa gtcccgttcc agttgaggaa atcgaaccag 241 atcatgccaa accggcgttt tccggttgat gggtgtggcc gttgttcgtt ttctgacagc 301 cgcgcagatc ctgtccggtg caaacctccc tggggtagct cagcggctcg ttggcttttt 361 agagcgtaat caagaagctt aataagcgcg gacagtgttg gcatggcgat tgctgttgag 421 ttgcagcaac actgagtgag ggctgggtgc acgccgacgc gtaagacgag cgatgcgctc 481 cttcccttga acccgtgtgc cccgtttctg agagagaaac aagctcgcgt gtcggaagca 541 cgcaactttt ggcaaatcgg ttgatggaga acaacatgtc ttcactgaga caaatcgcgt 601 tctacggaaa gggcggcatc ggcaagtcga ccacgtccca gaatacgttg gcggcactgg 661 ccgagatggg ccagaaaatc ctgatcgtgg gatgcgatcc taaggcggac tcgacgcgcc 721 tcatcctgca cgcgaaggcg caggacacga ttttgagcct tgcagcgagc gctggcagcg 781 tggaagacct cgaactcgag gacgtgatga aggtcggcta caaggacatc cgatgcgtgg 841 agtccggtgg tcccgagccg ggtgtcggct gcgcgggccg cggcgtcatc acctcgatca 901 atttcctgga ggagaacggc gcctatgaga acattgacta tgtctcatat gacgtgctcg 961 gcgacgtcgt ttgcggtggc tttgcgatgc cgatccggga aaacaaggcg caggagatct 1021 atatcgtgat gtctggagaa atgatggcaa tgtatgccgc aaacaatatc tccaaaggta 1081 tcctgaaata cgccaactct ggcggcgtgc ggctgggcgg cctgatctgc aacgagcggc 1141 agaccgataa ggagctggag ctggcggagg cgctggccaa gaagttaggt actcagctga 1201 tctacttcgt gccgcgcgac aatgtggtgc agcatgccga gctacggcgc atgacggtgc 1261 tggagtatgc ccctgagtcg cagcaggccg atcactatcg caatcttgcg accaaggttc 1321 acaacaatgg cggcaaaggc atcattccga ctccgatctc catggatgag ctcgaggaca 1381 tgctgatgga gcatggcatt atgaagcccg tcgacgaatc catcgtcggc aagaccgccg 1441 ccgaactcgc ggcctcgtaa aggtcgcggg tcgcggcctt gtgaaggcgc gcgacggatg 1501 ccggtctccc tcacccccca tccggggaga ccggcattct gacgattatc tgaccagcca 1561 gagtggagct ggcaaccgtg accgctatgg gaacccaaaa catcatgaca ggagcgcact 1621 tccttccgct tatggcttct tgcgccgtcg aggcgagcag caaggtgcaa agaggaattg 1681 cgacctaccg agcgctcact ggcgtcctcc tgaagaggcc gacattgcga ccgacagcaa 1741 tttcgattgc catgtcctgg cgtcaatcct ggcggccgct cgatggatgg tggcccgctt 1801 cccgagcgcc ctgtccgcca ccagctggcg accctgctcg cagcaatttc catcggttga 1861 ggtcgatatc tcggagcagc tcctggcgtc taagtgcgat gagaatgacg agatcgcgat 1921 ggtgcgcgat cttttgctca agcaacgctc gacggacggg catattcggg ctggctagcc 1981 gcgatgattg cgcgccgcgc catagagcca gatcacctgt gggaagatct // LOCUS RHPHBEM 1520 bp ds-DNA PLN 15-AUG-1990 DEFINITION P.andersonii haemoglobin gene, complete cds. ACCESSION M36509 KEYWORDS haemoglobin. SOURCE P.andersonii DNA. ORGANISM Parasponia andersonii Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Hamamelidae; Urticales; Ulmaceae. REFERENCE 1 (bases 1 to 1520) AUTHORS Landsmann,J., Dennis,E.S., Higgins,T.J.V., Appleby,C.A., Kortt,A.A. and Peacock,W.J. TITLE Common evolutionary origin of legume and non-legume plant haemoglobins JOURNAL Nature 324, 166-168 (1986) STANDARD simple staff_review FEATURES from to/span description pept 198 313 haemoglobin, exon 1 436 550 haemoglobin, exon 2 877 993 haemoglobin, exon 3 1153 1293 haemoglobin, exon 4 IVS 314 435 haemoglobin intron A IVS 551 876 haemoglobin intron B IVS 994 1152 haemoglobin intron C BASE COUNT 470 a 281 c 276 g 493 t ORIGIN 1 ttatcttact aaaaagaaaa cgaaaataaa aaacccaaag atatggctcc ccaataccct 61 gaagagttac acacgatccc cattttttct actatatata cagagtgcct tcaccagatt 121 ttccaaacac actccaacat atcccattgc ccaaataaaa atttctcagc ttttagtccc 181 ctcaacccac agaagccatg agcagctcag aagttaacaa agttttcaca gaggagcagg 241 aagctctggt ggtgaaagca tgggctgtaa tgaagaagaa ctctgctgaa ctgggtcttc 301 aattcttcct caagtaagtc aaaattatat atagtacact ttttatttac tttgcttctt 361 ttatagacca agtttttgaa taaaagggta ctattttttt ttcctgaaaa aaattggttg 421 attgaaactt tgcaggatat ttgagattgc accgtctgcc aagaacttgt tctcttattt 481 gaaggactct ccggttcctt tggagcagaa cccaaagctc aagccccatg ctacgactgt 541 cttcgttatg gtaaagccaa cttttgttct cctattccct tatcctaatt ttacaagaat 601 ctaatgttaa taaaatagta ttttgcctat ttaaacaacc aaaaatttag acacaactat 661 ataaaacatt taaattcttg tggtttatga taccttgatc tacaatgatt ccaacttccc 721 gtgttgcatt tatgagttgt gctagcaaca gtcgcatcac agtcgtctat tccagaaagg 781 acgactgtga ctcttgagac atatcaaagc aaagctcagc aatttttatg tttctcactt 841 gctctgttct ttttctctgg tacttgtcct ggaaagacat gtgagtctgc ggttcaactt 901 cggaaagccg gaaaagtgac agtgaaagaa tcagacttga aaagaattgg ggctatccac 961 ttcaaaactg gcgtagttaa tgaacatttt gaggtactac cctggccact tagtagatat 1021 aattccctaa gtgtaatcca aacatttgtt gtttagagtc aaattattat tattctgtat 1081 ggtggttctt gaataatcga tcttattatg gtatttacta attatattat gcatgggaaa 1141 aacgatttgt aggtcacaag gtttgcactt ttggagacca taaaggaagc agtaccagaa 1201 atgtggtcac ctgagatgaa gaacgcatgg ggagtagctt atgatcagtt ggttgctgcc 1261 atcaagttcg aaatgaaacc ctccagtact tgagaatttt tatagttctt ggaacaattg 1321 ggtttgaata atgtgacaaa acttatactt aattacgttt gcatgagaga gaggtaataa 1381 ttgcatagtg tataacttgc atatgtatca tagtgtgacg caatctctcc acttgtgttg 1441 ttcatcttgt tcaaaaggaa ttagtctttc actttacatt ttgggtggaa gtatggaatg 1501 aaatcagagt ttcattgatt // LOCUS PT7RNAA 266 bp ds-DNA PHG 15-AUG-1990 DEFINITION Bacteriophage T7 RNA polymerase gene 1, 3' end. ACCESSION M24964 M24965 ORGANISM Bacteriophage T7 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 266) AUTHORS Osterman,H.L. and Coleman,J.E. TITLE T7 ribonucleic acid polymerase-promoter interactions JOURNAL Biochemistry 20, 4884-4892 (1981) STANDARD simple staff_review FEATURES from to/span description pept < 1 201 RNA polymerase (gene 1; AA at 1) mRNA < 1 266 gene 1 mRNA BASE COUNT 72 a 66 c 63 g 65 t ORIGIN 1 ccggctgacg ctgcgaacct gttcaaagca gtgcgcgaaa ctatggttga cacatatgag 61 tcttgtgatg tactggctga tttctacgac cagttcgctg accagttgca cgagtctcaa 121 ttggacaaaa tgccagcact tccggataaa ggtaacttga acctccgtga catcttagag 181 tcggacttcg cgttcgcgta acgccaaatc aatacgactc actatagagg gacaaactca 241 aggtcattcg caagagtggc ctttat // LOCUS PT7RNAB 139 bp ds-DNA PHG 15-AUG-1990 DEFINITION Bacteriophage T7 class III RNA polymerase promoter L1 fragment. ACCESSION M24966 ORGANISM Bacteriophage T7 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 139) AUTHORS Osterman,H.L. and Coleman,J.E. TITLE T7 ribonucleic acid polymerase-promoter interactions JOURNAL Biochemistry 20, 4884-4892 (1981) STANDARD simple staff_review FEATURES from to/span description mRNA 58 > 139 L1 mRNA BASE COUNT 44 a 25 c 28 g 42 t ORIGIN 1 cggtatttaa ttaaatattc tccctgtggt ggctcgaaat taatacgact cactataggg 61 agaacaatac gactacggga gggttttctt atgatgacta taagacctac taaaagtaca 121 gactttgagg tattcactc // LOCUS PT7RNAC 141 bp ds-DNA PHG 15-AUG-1990 DEFINITION Bacteriophage T7 L2 nonpromoter fragment. ACCESSION M24967 ORGANISM Bacteriophage T7 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 141) AUTHORS Osterman,H.L. and Coleman,J.E. TITLE T7 ribonucleic acid polymerase-promoter interactions JOURNAL Biochemistry 20, 4884-4892 (1981) STANDARD simple staff_review BASE COUNT 36 a 35 c 35 g 35 t ORIGIN 1 cggaagtgct ggcattttgt ccaattgaga ctcgtgcaac tggtcagcga actggtcgta 61 gaaatcagcc agtacatcac aagactcata tgtgtcaacc atagtttcgc gcactgcttt 121 gaacaggttc gcagcgtcag c // LOCUS SIVSMMM7 1210 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) pol region. ACCESSION M27256 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate SMM-M7). ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 1210) AUTHORS Li,Y. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT This sequence corresponds to the 3' third of the pol gene. Kindly provided in computer-readable form by Yen Li. Author address:Y.Li New England Regional Primate Research Center Southborough, Massachusetts 01772 (508-481-0400). BASE COUNT 478 a 207 c 269 g 256 t ORIGIN 1 gcccggccag taatccgccc accattgctc ccgaatttcg acccctcctc tagtcagatt 61 agtgttcaat ttggtaaagg atcccatcga agaaatagga acattttatg tggatggctc 121 ttgcaataaa cagtcaaaag agggaaaagc aggatacata acagacagaa ggaggagcaa 181 aataaagttc ttagaacaga ctaccaatca gcgagcagaa ttagaagcct ttctcatggc 241 agtaacagat tcaggagcag aggcaaatat tatagtagat tctcaatatg tgatggggat 301 agtgacaagg caacccactg aatcagaaag taaaatagta aatcagataa tagaagaaat 361 gatcaaaaag acagcagtat atgtgacata ggtaccagct cataaaggtc taggaagaaa 421 tcaagaaata gaccatttag ttagtcaaag gattaggcaa gtcttgttcc tagaaaagat 481 agaaccagcc caagaagagc acgaaaaata tcacagcaat gtaaaagaat tggtctttaa 541 atttaggata ccaagattag tagcaaaaca gatagtagat acctgtgata aatgccagca 601 gaaaggagaa gctatacata gacaggtaaa cacagagtta agaatttggc aaatagactg 661 cacacaccta gagggcaaag ttgttatagt agcagtacat gtggctagtg gattcataga 721 ggcagaagta atcccacaag aaacaggaag acagacagca ttgttcctgt taaaattagc 781 tagcaggtgg cccatcacac acctgcacac agataatggt gctaactttg cttcgcaaga 841 agtaaagatg gtagcctagt gggcagatat agaacacacc tttaaggtac catataatcc 901 acaaagtcaa agagtagtag aagcaatgaa tcatcaccta aagaatcaga tagagagaat 961 tagagagcag gcaaattcag tagaaacaat agtgctcatg gcagttcatt gcatgaattt 1021 taaaagaagg ggaggaatag gggatatgac cccagcagaa agattaatta atatgatcac 1081 cacagaacaa gaaatacaat tccaacaatc aaaaaattca aaatttaaaa attttcgggt 1141 ctatttcaga gaaggcagag accaactgtg gaaaggaccc ggtgaattac tgtggaaagg 1201 ggaaggagca // LOCUS ADEAD5A 180 bp ds-DNA VRL 15-AUG-1990 DEFINITION Adenovirus type 5 packaging domain region. ACCESSION M36423 KEYWORDS . SOURCE Adenovirus type 5 (strain dl309) DNA. ORGANISM Mastadenovirus h5 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 180) AUTHORS Graeble,M. and Hearing,P. TITLE Adenovirus type 5 packaging domain is composed of a repeated element that is functionally redundant JOURNAL J. Virol. 64, 2047-2056 (1990) STANDARD simple staff_review FEATURES from to/span description site 1 156 packaging domain BASE COUNT 48 a 27 c 52 g 53 t ORIGIN 1 gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag taaatttggg 61 cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga agtgaaatct 121 gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg gcatttgacc // LOCUS CAJFJAAB 1932 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.coli flagellin (flaB) gene, complete cds. ACCESSION M35141 KEYWORDS flaB gene; flagellin. SOURCE C.coli (strain VC167, serogroup LIO 8) DNA. ORGANISM Campylobacter coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1932) AUTHORS Guerry,P., Logan,S.M., Thornton,S. and Trust,T.J. TITLE Genomic organization and expression of Campylobacter flagellin genes JOURNAL J. Bacteriol. 172, 1853-1860 (1990) STANDARD simple staff_review FEATURES from to/span description pept 211 1932 flagellin (flaB) mRNA 185 > 1932 flagellin mRNA BASE COUNT 638 a 325 c 387 g 582 t ORIGIN 1 taacaaatcc aagcctagta gtaatactag gcttttttat ttctaaataa aacttggaac 61 attctttagc gtttactgta atttatacaa atccaagcct agtagtaata ctaggctttt 121 tttatttcta aataaaattt caatttgaat caaaacttgg aacacttctt gctttaatct 181 tttcgatgca atattttgaa aggatttaaa atgggtttta gaataaacac caacatcggt 241 gcattgaacg cacatgcaaa ttcagttgtt aatgctaggg agcttgacaa gtctttaagt 301 agacttagct caggtcttag aatcaactcc gcagcagatg atgcttcagg gatggcgata 361 gcagattctt tgcgttcaca agcagcaact ttaggtcaag ctataaacaa tggtaatgat 421 gctataggta tcttgcaaac tgcagataag gctatggatg agcaacttaa aatcttagat 481 accatcaaga ctaaagcgac tcaagctgct caagatggtc aaagcttaaa aacaagaact 541 atgcttcaag cagacatcaa ccgtttgatg gaagaacttg ataatatcgc aaataccact 601 tcatttaatg gcaaacaact tttaagtggt ggttttacca atcaagaatt ccaaatcggt 661 tcaagttcaa atcaaactat taaagcaagt ataggagcaa ctcagtcttc taaaatcggt 721 gtaacaagat ttgaaacagg ttcacaaagt ttttcttcag gcactgtagg acttactatt 781 aaaaactaca acggtatcga agattttaaa tttgatagtg tagtgatttc tacttcagta 841 ggaacaggtc ttggagcttt ggctgaagag atcaacagaa atgcagataa aacaggaatt 901 cgtgcaactt ttgatgtaaa atctgtagga gcctatgcaa taaaagcagg aaatacttct 961 caggattttg ctatcaatgg ggttgttatc ggacaaataa attataatga cggtgataac 1021 aatggtcaac ttatctcagc tatcaatgct gtaaaagata caactggtgt tcaagcctct 1081 aaagatgaaa atggtaaact tgttcttact tcggccgatg gtagagggat taaaatcaca 1141 ggtagcatag gtgtaggagc tggtatattg cacactgaaa attatggaag gttatcttta 1201 gttaaaaatg atggtagaaa tatcaatata agtggaacag gtctttcagc tataggtatg 1261 ggtgctacag acatgatttc tcaatcttca gtatctctaa gagagtcaaa agggcaaatt 1321 tcagcagcca atgctgatgc tatgggcttt aatgcttata atggcggcgg cgctaagcaa 1381 attattttcg cttctagtat tgcaggattt atgtctcagg ctggttcagg cttctctgct 1441 ggttcgggat tttcagtagg tagtggtaaa aattattcag ccattttatc agcttctata 1501 cagatagtat ctagcgcagc ttctatcagt agcacctatg ttgtttctac tggttcaggt 1561 ttctctgctg gttcaggtaa ttctcaattt gcagctttaa gaataagtac agtaagtgct 1621 catgatgaaa ctgcaggtgt aactacactt aagggtgcaa tggctgtgat ggatatagca 1681 gaaactgcta ttaccaattc tgatcaaatc agagcggata taggtgctgt gcaaaatcag 1741 ctccaagtaa cgataaataa tattaccgta acccaggtaa atgttaaagc agcagaatca 1801 accataagag atgtggattt cgctgcagaa agtgcaaatt tttctaagta caatatcctt 1861 gcgcagtcgg gttcatatgc tatgagccaa cgtaacgctg tgcaacaaaa tgtcttaaaa 1921 cttttacaat aa // LOCUS CAJFLA 1719 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.coli flagellin gene, complete cds. ACCESSION M26945 KEYWORDS flagellin. SOURCE C.coli (strain VC167) DNA. ORGANISM Campylobacter coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1719) AUTHORS Logan,S.M., Trust,T.J. and Guerry,P. TITLE Evidence for posttranslational modification and gene duplication of Campylobacter flagellin JOURNAL J. Bacteriol. 171, 3031-3038 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 1 1719 flagellin BASE COUNT 563 a 284 c 365 g 507 t ORIGIN 1 atgggatttc gtattaacac aaatgttgca gcattaaatg ctaaagcaaa ttcggatcta 61 aacagcagag cattagatca atcactttca agactcagtt caggtcttag aatcaactcc 121 gcagcagatg tagcttcagg gatggcgata gcagatagtt taagatctca ggcaaatact 181 ttgggtcagg ctatatctaa tggtaatgat gctttaggta tcttgcaaac tgcagataag 241 gctatggatg agcaacttaa aatcttagat accatcaaga ctaaagcgac tcaagctgct 301 gaagatggtc aaagcttaaa aacaagaact atgcttcaag cagacatcaa ccgtttgatg 361 gaagaacttg ataatatcgc aaataccact tcatttaatg gcaaacaact tttaagtggt 421 ggttttacca atcaagaatt ccaaatcggt tcaagttcaa atcaaactat taaagcaagt 481 ataggagcaa ctcagtcttc taaaatcggt gtaacaagat tgaacaggtt cacaaagttt 541 tcttcaggca ctgtagggct tactatcaaa aactacaacg gtatcgaaga ttttaaattt 601 gatagtgtag tgatttctac ttcagtagga acaggtcttg gagctttggc tgaagagatc 661 aacagaaatg cagataaaac aggaattcgt gcaacttttg atctaaaatc tgtaggagcc 721 tatgcaataa aagcaggaaa tacttctcag gattttgcta tcaatggggt tgttataggt 781 aaggttgatt attcagatgg tgatgagaat ggttctttaa tttcagctat caatgctgta 841 aaagatacaa ctggtgttca agcctctaaa gatgaaaatg gtaaacttgt tcttacttcg 901 gccgatggta gagggattaa aatcacaggt agcataggtg taggagctgg tatattgcac 961 actgaaaatt atggaaggtt atctttagtt aaaaatgatg gtagagatat caatataagt 1021 ggaacaggtt tttcagctat aggtatgggt gctacagaca tgatttctca atcttcagta 1081 tctctaagag agtcaaaagg gcaaatttca gcagccaatg ctgatgctat gggctttaat 1141 gcttataatg gcggcggcgc taagcaaatt attttcgctt ctagtattgc agggtttatg 1201 tctcaggctg gttcaggctt ctctgctggt tcgggatttt cagtaggtag tggtaaaaat 1261 tattcagcca ttttatcagc ttctatacag atagtatcta gcgcagcttc tatcagtagc 1321 acctatgttg tttctactgg ttcaggtttc tctgctggtt caggtaattc tcaatttgca 1381 gctttaagaa taagtacagt aagtgctcat gatgaaactg caggtgtaac tacacttaag 1441 ggtgcaatgg ctgtgatgga tatagcagaa actgctatta ccaatcttga tcaaatcaga 1501 gcggatatag gttctgtgca aaatcaaatc acatcgacta taaacaacat tactgtaacc 1561 caggtaaatg ttaaatcagc agaatcacaa atcagagatg tagattttgc aagcgagagt 1621 gcaaattact ctaaagcaaa tatattggct caaagtggtt cttatgctat ggctcaagca 1681 aattcaagcc agcaaaatgt tttaagatta ctacagtag // LOCUS CHKLNKPA1 215 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 2. ACCESSION M35035 KEYWORDS cartilage link protein. SEGMENT 1 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP532 and lambda gLP12.1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 51 to 65 and 182 to 195) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 215) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1,2] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pept 87 + 186 cartilage link protein, exon 2 (first expressed exon pre-msg < 1 > 215 cartilage link protein mRNA and introns IVS < 1 60 cartilage link protein intron A IVS 187 > 215 cartilage link protein intron B BASE COUNT 63 a 43 c 47 g 62 t ORIGIN 1 gaattccata aagggttcca aaaaattgat gagcctttct gttatgtgat gcccttacag 61 tgaagaagat tcttgtgact gtgaagatga caagtctact ctttctggtg ctgatttctg 121 tctgctgggc agaacctcat cctgacaact caagcctgga gcatgagagg attattcaca 181 tccaaggtaa ggaaatacat cagaaaacgc ctttt // LOCUS CHKLNKPA2 460 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 3. ACCESSION M35036 KEYWORDS cartilage link protein. SEGMENT 2 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP39.13 and lambda gLP33.7. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 49 to 63 and 429 to 442) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 460) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pept + 59 + 433 cartilage link protein, exon 3 pre-msg < 1 > 460 cartilage link protein mRNA and introns IVS < 1 58 cartilage link protein intron B IVS 434 > 460 cartilage link protein intron C BASE COUNT 145 a 90 c 113 g 112 t ORIGIN 1 tctgtaaaag gtggagtgca gactaattct cctttttgtt tttctccttg aattgtagaa 61 gaaaatggac cccgcctact tgtggtagca gaacaagcta agatcttctc tcagcgaggt 121 ggcaacgtca cactgccttg taaattttac catgaacaca catcaacagc tggctcagga 181 acccacaaaa tccgggtcaa gtggaccaaa ctcacctcag attacctcaa agaagtggat 241 gtctttgtcg caatgggaca ccacagaaag agctacggaa agtatcaggg cagagtgttt 301 ctgagggaaa gcagtgagaa cgatgcctct cttataatca cgaatataat gctggaggat 361 tatgggagat acaagtgcga agtgattgaa ggattagagg acgacacagc agtggtagct 421 ctgaatttgg aaggtaggta acatctaatg tagacttaaa // LOCUS CHKLNKPA3 427 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 4. ACCESSION M35037 KEYWORDS cartilage link protein. SEGMENT 3 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP33.7 and lambda gLP10.1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 44 to 58 and 352 to 365) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 427) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pept + 54 + 356 cartilage link protein, exon 4 pre-msg < 1 > 427 cartilage link protein mRNA and introns IVS < 1 53 cartilage link protein intron C IVS 357 > 427 cartilage link protein intron D BASE COUNT 99 a 108 c 105 g 115 t ORIGIN 1 aaaaaccctt ctagtgggga ttacccccag ctcacctctt tttgccattt caggtgttgt 61 tttcccctat tctccacgtc tgggtcgtta caacctaaac ttccatgagg ctcagcaagc 121 ttgcctggac caggactcca tcattgcctc cttcgaccag ctctacgagg cctggaggtc 181 agggctggac tggtgcaatg ctggctggct cagtgatggt tcagtgcagt accctatcac 241 caagcccaga gagccctgtg gagggaagaa tacggtgccc ggtgtcagaa actatggctt 301 ctgggataaa gagaggagcc gatatgatgt tttctgcttt acttcaaact tcaatggtaa 361 gaacctggtt tacatttacc ttgcaagggt ctttttccat gctttaaaaa gaaagagatg 421 ccagcgg // LOCUS CHKLNKPA4 826 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 5. ACCESSION M35038 KEYWORDS cartilage link protein. SEGMENT 4 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP10.1 and lambda gLP39.23. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 15 to 29) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 826) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pept + 25 314 cartilage link protein, exon 5 pre-msg < 1 > 826 cartilage link protein mRNA and introns IVS < 1 24 cartilage link protein intron D signal 786 792 AATAAA sequence BASE COUNT 262 a 166 c 162 g 236 t ORIGIN 1 atggctccct ccgtctctcc ccaggtcgtt tttactacct aatacaccca accaagctga 61 cctatgatga agccgtgcag gcctgcctga aggatggcgc tcagattgcc aaggttgggc 121 agatattcgc tgcctggaag ctccttggtt atgaccgctg tgatgccggc tggctggcag 181 acggcagcgt ccgctacccc atctccagac ccagaaagcg ctgcagcccc aacgaggctg 241 ccgtccgctt tgtaggcttt cctgataaaa agcacaagct gtatggtgtc tactgtttca 301 gagcttacaa ctgaaaatac ctagagctgc aacagtcttt aattcattaa gaacatgtga 361 aatatttcga tatgaactcg tgcaagttac caaaactgtg ataaaccttt cttacttact 421 gtagagtcat tttcataaac caaaaccatt aatttgtttt tgtttctgtt taaatatttt 481 tgtaaaagta tcattccata gatatttaaa aataatataa gtttaatgga agctctaggt 541 aagaagagcc aaattcttta agctacgtca tcccaacaaa atataatttt catgaatggg 601 gcatgcaata gagcttgaca attgctagga cacaattatg gaatgtaagg ctactcaaag 661 cagaagcttt taaaagcaca aattttacat gtttgtaccc gtttgagata cacagcaaat 721 tgattgtatc tggagttttg aattaagatg tttttgttta taggggtcag tgaggttttg 781 caaaaaataa aaattaaaaa aaaaaaaaaa aaaaaaaaag gccgcc // LOCUS CHKLNKPA5 217 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 6. ACCESSION M35039 KEYWORDS cartilage link protein. SEGMENT 5 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP10.1 and lambda gLP39.23. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 217) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pre-msg < 1 217 cartilege link protein mRNA and intron signal 44 49 poly-A signal signal 98 103 poly-A signal BASE COUNT 102 a 24 c 22 g 69 t ORIGIN 1 tataatattt aatatttctt aagctattta cacatcacaa gaaaataaaa aattggaaaa 61 aaaaatcaaa tgatcaagtc ttagaagaag attattgaat aaaatctgaa accagctatt 121 aaggtttaga agagaagaag tactttattt ccttacatct tatctgtatc taaatataca 181 tctgtttttt aaactatcaa tgaaaaaaaa aaaaaaa // LOCUS CHTCRPA 3012 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.trachomatis 9-kD and 60-kD cysteine-rich and 15 kD serine-rich outer membrane protein genes, complete cds. ACCESSION M35148 M23180 M35161 KEYWORDS cysteine-rich outer membrane protein; serine-rich outer membrane protein. SOURCE C.trachomatis (serovar L1) DNA. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 753 and 1715 to 2577) AUTHORS Lambden,P.R., Everson,J.S., Ward,M.E. and Clarke,I.N. TITLE Sulfur-rich proteins of Chlamydia trachomatis: Developmentally regulated transcription of polycistronic mRNA from tandem promoters JOURNAL Gene 87, 105-112 (1990) STANDARD simple staff_review REFERENCE 2 (bases 483 to 3012) AUTHORS Clarke,I.N., Ward,M.E. and Lambden,P.R. TITLE Molecular cloning and sequence analysis of a developmentally regulated cysteine-rich outer membrane protein from Chlamydia trachomatis JOURNAL Gene 71, 307-314 (1988) STANDARD simple staff_review FEATURES from to/span description pept 185 451 9-kDa cysteine-rich outer membrane protein pept 703 2259 60-kDa cysteine-rich outer membrane protein precursor sigp 703 735 60-kD serine-rich outer membrane protein signal peptide matp 736 2256 60-kDa cysteine-rich outer membrane protein pept 2437 2889 15-kDa serine-rich outer membrane protein mRNA 93 2296 CrP operon mRNA (alt.) mRNA 159 2296 CrP operon mRNA (alt.) mRNA 160 2296 CrP operon mRNA (minor alt.) mRNA 2406 2965 SrP mRNA BASE COUNT 898 a 537 c 678 g 899 t ORIGIN 1 tttgtttgct ttgatttgct aattacctgt tattagacga tttgttttaa aaaacaattg 61 atataatttt tattttataa tgtaatattg tctatgaggg ctagtttctt ttattattaa 121 aagaattgct tttatcgata aaagaaactt caagagccct tttctagaaa ggagtctgga 181 agttatgaaa aaaactgctt tactcgctgc tttatgtagt gttgtttctt taagtagttg 241 ttgtcgtatc gttgactgtt gcttcgaaga tccatgcgca cctatccaat gttcaccttg 301 tgaatctaag aagaaagacg tagacggtgg ttgcaactct tgtaacgggt atgtcccagc 361 ttgcaaacct tgcggagggg atacgcacca agatgctgaa catggccctc aagctagaga 421 aattccagtt gacggcaaat gcagacaata ggtagcgcaa gttaagagcc tacccacaac 481 agatgtagtt agtaaggaag ttggcttcct tactaactat ttcggctaac aagaaaatgt 541 tgagggtaaa agttagttaa taacaatttc tacccgatgg cagacaaaaa ataatctatg 601 cgaataggag atcctatgaa caaactcatc agacgagcag tgacgatctt cgcggtgact 661 agtgtggcga gtttatttgc tagcggggtg ttagagacct ctatggcaga gtttatctct 721 acaaacgtta ttagcttagc tgacaccaaa gcgaaagaca acacttctca taaaagcaaa 781 aaagcaagaa aaaaccacag caaagagact cccgtaaacc gtaaaaaggt tgctccggtt 841 catgagtcta aagctacagg acctaaacag gattcttgct ttggcagaat gtatacagtc 901 aaagttaatg atgatcgtaa tgttgaaatc acacaagctg ttcctaaata tgctacggta 961 ggatctccct atcctgttga aattactgct acaggtaaaa gggattgtgt tgatgttatc 1021 attactcagc aattaccatg tgaagcagag ttcgtacgca gtgatccagc gacaactcct 1081 actgctgatg gtaagctagt ttggaaaatt gaccgcttag gacaaggcga aaagagtaaa 1141 attactgtat gggtaaaacc tcttaaagaa ggttgctgct ttacagctgc aacagtatgc 1201 gcttgtccag agatccgttc ggttacaaaa tgtggacaac ctgctatctg tgttaaacaa 1261 gaaggcccag agaatgcttg tttgcgttgc ccagtagttt acaaaattaa tgtagtgaac 1321 caaggaacag caacagctcg taacgttgtt gttgaaaatc ctgttccgga tagttacgct 1381 cattcttctg gacagcgtgt actaacgttt actcttggag atatgcaacc tggagagcac 1441 agaacaatta ctgtagagtt ttgtccgctt aaacgtggtc gtgctaccaa tatagcaatg 1501 gtttcttact gtggaggaca taaaaataca gcaagcgtaa caactgtgat caacgagcct 1561 tgcgtacaag taagtattgc aggagcagat tggtcttatg tttgtaagcc tgtagaatat 1621 gtgatctccg tttccaatcc tggagatctt gtgttgcgag atgtcgtcgt taaagacact 1681 ctttctcccg gagtcacagt tcttgaagct gcaggagctc aaatttcttg taataaagta 1741 gtttggactg tgaaagaact gaatcctgga gagtctctac agtataaagt tctagtaaga 1801 gcacaaactc ctggacaatt cacaaataat gttgttgtga agagctgctc tgactgtggt 1861 acttgtactt cttgcgcaga agcgacaact tactggaaag gagttgctgc tactcatatg 1921 tgcgtagtag atacttgtga ccctgtttgt gtaggagaaa atactgttta ccgtatttgt 1981 gtcaccaaca gaggttctgc agaagataca aatgtttctt taatgcttaa attctctaaa 2041 gaactgcaac ctgtatcctt ctctggacca actaaaggaa cgattacagg caatacagta 2101 gtattcgatt cgttacctag attaggttct aaagaaactg tagagttttc tgtaacattg 2161 aaagcagtat cagctggaga tgctcgtggg gaagcgattc tttcttccga tacattgact 2221 gttccagttt ctgatacaga gaatacacac atctattaat ctttgatttt atcgatgtgt 2281 aggtgccgtc cagggattcc tgggcggctt tttttgttat ctatatgaaa ataaaagagt 2341 tcattttcgt tctcagagca tattctagat gggtttttga aaaaaataag tgtttgtgta 2401 gactccctgc tcacaaccaa aaaaggaatg taaaatatga gcactgtacc cgttgttcaa 2461 ggagctggat cttccaattc ggcacaggat atttccacta gttctgtacc attaacactg 2521 caagggcgta tatcgaatct tctatcttcc actgcattta aggtgggatt agtggtgatg 2581 ggactacttt tagtgatggc tacgatattc ctagtttcgg cagcttcgtt tgtaaatccc 2641 atctatctag ctattcctgc tattgtggga tgcgtgaata tctgcgtagg aattttatcc 2701 atggaaggat actgttctcc ggagagatgg agcttatgta agaaggtatt aaaggcttca 2761 gaagatatca tcgatgatgg gcagataaac aactctaata aagtgtttac tgatgagagg 2821 ttgaatgcca taggtggggt agtggaatct ctatctagaa gaaatagtct ggtggatcag 2881 acccaatgat aagagattgc tctataggca aaagatgata gcggcagttt ttatggatga 2941 tctgctgaca gatgatgtat ggaaagggag gaggaaagag tcctcctccc agattttatt 3001 gagctggagt tt // LOCUS DDIGP80A 1545 bp ss-mRNA INV 15-AUG-1990 DEFINITION D.discoideum membrane-associated glycoprotein (gp80) mRNA, complete cds. ACCESSION M36545 KEYWORDS gp80 gene; membrane-associated glycoprotein. SOURCE D.discoideum, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1545) AUTHORS Siu,C.-H., Wong,L.M., Lam,T.Y., Kamboj,R.K., Choi,A. and Cho,A. TITLE Molecular mechanisms of cell-cell interaction in Dictyostelium discoideum JOURNAL Biochem. Cell Biol. 66, 1089-1099 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1 1545 membrane-associated glycoprotein (gp80) precursor sigp 1 48 membrane-associated glycoprotein (gp80) signal peptide matp 49 1542 membrane-associated glycoprotein (gp80) BASE COUNT 502 a 332 c 209 g 502 t ORIGIN 1 atgaaatttt tattagtatt gataatatta tataatattt taaatagtgc acattcagct 61 ccaacaataa cagctgtttc aaatggaaaa tttggtgttc caacatatat taccattaca 121 ggtactggat ttacaggaac tccagttgta actattggtg gccagacctg tgatccagtt 181 attgtagcca ataccgcatc gttacaatgc caattttctg ctcaattagc tccaggaaat 241 tcaaattttg atgttattgt aaaggttggt ggtgtaccat ctacaggtgg taatggtctt 301 tttaaatata cacctccaac tctttcaaca atatttccaa ataatggaag aattggtatg 361 attttagttg atggaccatc caatatatct ggatacaaat taaatgtgaa cgactctatt 421 aactctgcta tgttatctgt tactgctgat tcagtatccc caacaattta tttcctcgtg 481 ccaaatacaa tcgctggtgg tctacttaat cttgaactca ttcaaccatt tggcttttca 541 acaattgtaa cttccaaatc agtgttttct ccaaccatta catcaatcac cccattagct 601 tttgatctca caccaaccaa tgtaaccgtc actggtaaat actttgttac tacagctagt 661 gttacaatgg gaagtcatat ctatacagga ttgactgttc aagatgatgg aacaaattgt 721 catgttattt ttactactcg ttcagtttat gaatcatcaa atactataac tgctaaagct 781 tcaacaggtg tcgatatgat ttatttagac aatcaaggta atcaacaacc aataactttt 841 acatataacc caccaaccat tacttcaaca aaacaagtca atgactctgt tgagatctca 901 acaaccaata ctggtactga tttcactcaa atttctttaa ccatgggaac ctcaagccca 961 acaaaccttg taatcactgg tacaaatgaa aagattgtta taactcttcc acatgctctt 1021 ccagaaggtg aaattcaatt caatttgaaa gctggtatct caaatgttgt cacatcaact 1081 ttattagtta ctccggttat aaatagtgtc actcaagcac ctcacaatgg tggaagtatt 1141 acaatttcag gtatcttttt aaacaatgcc catgtttcga ttgttgttga ccaaaatact 1201 actgatatag tttgtgctcc agattcaaat ggtgaatcaa tcatttgtcc agttgaagct 1261 ggtagtggta ctattaattt agtcgttaca aactataaaa actttgcttc agatccaact 1321 attaaaactg aagccacaac ctctacaacc tatacaattc cagacactcc aactccaact 1381 gatacagcca ccccatctcc aactccaact gaaacagcca ccccatctcc aactccaaaa 1441 ccaaccagca caccagaaga aactgaagca ccttcatcag caacaactct tatttcacca 1501 ttatctttaa ttgttatttt catttctttt gttttattaa tttaa // LOCUS ECOMANXF 1474 bp ds-DNA BCT 15-AUG-1990 DEFINITION E.coli enzyme III-Man function protein (manX (ptsL)) gene, complete cds, and manY (pel) gene, 5' end. ACCESSION M36404 KEYWORDS enzyme III-Man function protein; manX gene; manY gene; pel gene; ptsL gene. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1474) AUTHORS Saris,P.E.J., Liljestroem,P. and Palva,E.T. TITLE Nucleotide sequence of manX (ptsL) encoding the enzyme III-Man (II-a-Man) function in the phosphotransferase system of Escherichia coli K-12 JOURNAL FEMS Microbiol. Lett. 49, 69-73 (1988) STANDARD simple staff_review FEATURES from to/span description pept 258 1205 enzyme III-Man function protein (manX (ptsL)) pept 1268 > 1474 manY (pel) gene product mRNA 120 > 1474 manXYZ operon mRNA (5' end put.) BASE COUNT 411 a 319 c 376 g 368 t ORIGIN 1 cctttgcaaa cgaatgtgac aaggatattt tacctttcga aatttctgct aatcgaaagt 61 taaattacgg atcttcatca cataaaataa ttttttcgat atctaaaata aatcgcgaaa 121 cgcaggggtt tttggttgta gcccttatct gaatcgattc gattgtggac gacgattcaa 181 aaatacatct ggcacgttga ggtgttaacg ataataaagg aggtagcaag tgaccattgc 241 tattgttata ggcacacatg ggttggggct gcagagcagg ttgcttaaaa cggcagaaag 301 tgctgttagg cgagcaggaa aacgtcggct ggatcaattt cgttccaggt gaaaatgccg 361 aaacgctgat tgaaaagtac aacgctcagt tggcaaaact cgacaccact aaaggcgtgc 421 tgtttctcgt tgatacatgg ggaggcagcc cgttcaatgc tgccagccgc attgtcgtcg 481 acaaagagca ttatgaagtc attgcaggcg ttaacattcc aatgctcgtg gaaaggttaa 541 tggcccgtga tgatgaccca agctttgatg aactggtggc actggcagta gaaacaggcc 601 gtgaaggcgt gaaagcactg aaagccaaac cggttgaaaa agccgcgcca gcacccggtg 661 ccgcagcacc aaaagcggct ccaactccgg caaaaccaat ggggccaaac gactacatgg 721 ttattggcct tgcgcgtatc gacgaccgtc tgattcacgg tcaggtcgcc acccgctgga 781 ccaaagaaac caatgtctcc cgtattattg ttgttagtga tgaagtggct gcggataccg 841 ttcgtaagac actgctcacc caggttgcac ctccgggcgt aacagcacac gtagttgatg 901 ttgccaaaat gattcgcgtc tacaacaacc cgaaatatgc tggcgaacgc gtaatgctgt 961 tatttaccaa cccaacagat gtagagcgtc tcgttgaagg cggcgtgaaa atcacctctg 1021 ttaacgtcgg tggtatggca ttccgtcagg gtaaaaccca ggtgaataac gcggtttcgg 1081 ttgatgaaaa agatatcgag gcgttcaaga aactgaatgc gcgcggtatt gagctggaag 1141 tccgtaaggt ttccaccgat ccgaaactga aaatgatgga tctgatcagc aaaatcgata 1201 agtaacgtat tgtgttgatt atcactcagt tttcacactt aagtcttacg taaacaggag 1261 aagtacaatg gagattacca ctcttcaaat tgtgctggta tttatcgtag cctgtatcgc 1321 aggtatggga tcaatcctcg atgaatttca gtttcaccgt cctctaatcg cgtgtaccct 1381 ggtgggctat cgttcttggg gatatgaaaa ccggtattat tatcggtggt acgctggaaa 1441 tgatcgcgct gggctggatg aacatcggtg ctgc // LOCUS FSCCKPA 1428 bp ss-mRNA VRT 15-AUG-1990 DEFINITION T.californica creatine kinase mRNA, complete cds. ACCESSION M36427 KEYWORDS creatine kinase. SOURCE T.californica electric organ, cDNA to mRNA, clone CK52g8. ORGANISM Torpedo californica Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Chondrichthyes; Elasmobranchii; Euselachii; Neoselachii; Squalomorphii; Torpediniformes; Torpedinoidea; Torpedinidae. REFERENCE 1 (bases 1 to 1428) AUTHORS West,B.L., Babbitt,P.C., Mendez,B. and Baxter,J.D. TITLE Creatine kinase protein sequence encoded by a cDNA made from Torpedo californica electric organ mRNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 7007-7011 (1984) STANDARD simple staff_review FEATURES from to/span description pept 90 1235 creatine kinase (E.C. 2.7.3.2) BASE COUNT 348 a 398 c 394 g 288 t ORIGIN 1 ggtcacccac accagcggta gttccagcac caagcaggac aaggtccaga gtggttcacc 61 gtgcgccagg agtcagccaa cctccaacca tgcctttcgg aaacactcac aataaatgga 121 agctgaacta ttcggcggcg gaagaattcc ccgacctcag caagcacaac aaccacatgg 181 ccaaggcttt aaccctggac atctacaaga aacttcggga caaggagact ccaagtggct 241 tcaccctcga tgatatcatc cagacaggag tggacaaccc aggtcacccc ttcatcatga 301 ccgtgggctg cgtggctggc gatgaggaat gctacgaggt tttcaaggac ctgttcgatc 361 ccgtcattga ggaccgccac ggtggctaca aaccaactga caagcacaag actgacctga 421 accaggagaa cctgaagggc ggcgatgacc tcgacccgaa ttacgtcctg agcagccggg 481 tgcgcactgg ccgcagcatc aagggcatcg ccctgcctcc tcactgcagc cgcggggagc 541 gccgtctggt tgagaagctc tgcatagacg gtctcgccac cttgacgggc gagttccagg 601 gcaagtacta ccccctctcc tccatgtctg atgcagagca gcagcagctg atcgatgacc 661 acttcctgtt tgacaaaccc atctctcctc tgcttctcgc ctctggcatg gctcgggact 721 ggcccgatgg ccggggcatt tggcataaca acgacaagac cttcctggtc tgggtcaacg 781 aggaggacca cctccgagtc atctcgatgc agaaaggtgg caacatgaag gaggtcttca 841 ggcgcttctg cgttggtctg aagaagatcg aggacatttt cgtgaaggct ggccgtggct 901 tcatgtggaa cgagcacctg ggctacgtcc tgacctgccc gtccaacctg ggcactggcc 961 tccgtggtgg tgtccacgtg aaaatccctc acctctgcaa gcacgagaag ttcagcgagg 1021 tcctcaagag aacgaggctg cagaaacgtg ggacaggtgg agtggatacc gcagcggttg 1081 gcagcatcta tgacatctcc aacgccgacc gtctgggctt ctccgaggtg gaacaggtcc 1141 agatggtggt ggacggtgtg aagctgatgg tcgagatgga gaagaggctg gaaaatggga 1201 aaagcatcga tgacctgatg ccggctcaga agtagacctt gggttggctg ggtgcctgcc 1261 actctgagat gccttgaaat atcacaggtc gcgaactttg aactttccca ctccaatctt 1321 tcttggccac agatctcgtg tctcaaatga ggaagcagaa ggtttggttt catcacattc 1381 agatttgcta gacacaattt taaccttgat gacacattaa taaaatat // LOCUS HUMLBPP2A 1541 bp ss-mRNA PRI 15-AUG-1990 DEFINITION Human phosphatase 2A-beta catalytic subunit mRNA, complete cds. ACCESSION M36511 KEYWORDS phosphatase 2A-beta catalytic subunit. SOURCE Human lung fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1541) AUTHORS Hemmings,B.A., Wernet,W., Mayer,R., Maurer,F., Hofsteenge,J. and Stone,S.R. TITLE The nucleotide sequence of the cDNA encoding the human lung protein phosphatase 2A-beta catalytic subunit JOURNAL Nucleic Acids Res. 16, 11366-11366 (1988) STANDARD simple staff_review FEATURES from to/span description pept 22 951 phosphatase 2A-beta catalytic subunit BASE COUNT 436 a 296 c 327 g 482 t ORIGIN 1 ccgagcccca gcccggccgc catggacgac aaggcgttca ccaaggagct ggaccagtgg 61 gtcgagcagc tgaacgagtg taagcagctg aacgagaacc aagtgcggac gctgtgcgag 121 aaggcaaagg aaattttaac aaaagaatca aatgtgcaag aggttcgttg ccctgttact 181 gtctgtggag atgtgcatgg tcaatttcat gatcttatgg aactctttag aattggtgga 241 aaatcaccgg atacaaacta cttattcatg ggtgactatg tagacagagg atattattca 301 gtggagactg tgactcttct tgtagcatta aaggtgcgtt atccagaacg cattacaata 361 ttgagaggaa atcacgaaag ccgacaaatt acccaagtat atggctttta tgatgaatgt 421 ctgcgaaagt atgggaatgc caacgtttgg aaatatttta cagatctctt tgattatctt 481 ccacttacag ctttagtaga tggacagata ttctgcctcc atggtggcct ctctccatcc 541 atagacacac tggatcatat aagagccctg gatcgtttac aggaagttcc acatgagggc 601 ccaatgtgtg atctgttatg gtcagatcca gatgatcgtg gtggatgggg tatttcacca 661 cgtggtgctg gctacacatt tggacaagac atttctgaaa cctttaacca tgccaatggt 721 ctcacactgg tttctcgtgc ccaccagctt gtaatggagg gatacaattg gtgtcatgat 781 cggaatgtgg ttaccatttt cagtgcaccc aattactgtt atcgttgtgg gaaccaggct 841 gctatcatgg aattagatga cactttaaaa tattccttcc ttcaatttga cccggcgcct 901 cgtcgtggtg agcctcatgt tacacggcgc accccagact acttcctata aatttctcct 961 gggaaacctg cctttgtatg tggaagtata cctggctttt taaaatatat gtatttaaaa 1021 acaaaaagca acagtaatct atgtgtttct gtaacaaatt gggatctgtc ttggcattaa 1081 accacatcat ggaccaaatg tgccatacta atgatgagca tttagcacaa tttgagactg 1141 aaatttagta cactatgttc tagataggtc agtctaacag tttgcctgct gtatttatag 1201 taaccatttt cctttggact gttcaagcaa aaaaggtaac taactgcttc atctcctttt 1261 gcgcttattt ggaaatttta gttatagtgt ttaactggca tggattaata gagttggagt 1321 tttattttta agaaaaattc acaagctaac ttccactaat ccattatcct ttattttatt 1381 gaaatgtata attaacttaa ctgaagaaaa ggttcttctt gggagtatgt tgtcataaca 1441 tttaaagaga tttcccttca tttaaactaa attactgttt tatgttgatc tgcatatttc 1501 tgtatatttg tcatgacagt gcttgcatcc tatttggtgt g // LOCUS HUMPDEGA 978 bp ss-mRNA PRI 15-AUG-1990 DEFINITION Human cGMP phosphodiesterase gamma-subunit (PDEG) mRNA, complete cds. ACCESSION M36476 KEYWORDS cGMP phosphodiesterase gamma-subunit. SOURCE Human retina, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 978) AUTHORS Tuteja,N., Danciger,M., Klisak,I., Tuteja,R., Inana,G., Mohandas,T., Sparkes,R.S. and Farber,D.B. TITLE Isolation and characterization of cDNA encoding the gamma-subunit of cGMP phosphodiesterase in human retina JOURNAL Gene 88, 227-232 (1990) STANDARD simple staff_review FEATURES from to/span description pept 102 365 cGMP phosphodiesterase gamma-subunit (PDEG) mRNA < 1 978 PDEG mRNA BASE COUNT 213 a 341 c 257 g 167 t ORIGIN 1 ccgcactcac agcacagccc cctgagaccc gccctgcact tgaccgcagc aggagggagt 61 ccaggagcca aggttgccgc ggtgtctccg tcagcctcac catgaacctg gaaccgccca 121 aggctgagtt ccggtcagcc accagggtgg ccgggggacc tgtcaccccc aggaaagggc 181 cccctaaatt taagcagcga cagaccaggc agttcaagag caagccccca aagaaaggcg 241 ttcaagggtt tggggacgac atccctggaa tggaaggcct gggaacagac atcacagtca 301 tctgcccttg ggaggccttc aaccacctgg agctgcacga gctggcccaa tatggcatca 361 tctagcacga ggcccctgct gaagtccaga ccctccccct cctgcccact atgctaaacc 421 ctgctcagga ttcctgttga ggagatgacc tccctagccc cagatggcac ctggacacca 481 ggatgggact gcaacctcag gtctccccct acatattaat accagtcacc aggagcccac 541 cacctccctc taggatgccc cctcagggtg gccaggccct gctcaacatc tggagacaca 601 ggcccacccc tcagtcctgc ccacagagag gcttggtcgg tctccactcc cagggagaac 661 gggaagtgga ccccagcccg ggagcctgct ggaccccaga tcgtcccctc ctcccagctg 721 gaaagctagg gcaggtctcc ccagagtgct tctgcacccc agccccctgt cctgcctgta 781 aggggataca gagaagctcc ccgtctctgc atcccttccc aggggggtgc ccttagtttg 841 gacatgctgg gtagcaggac tccagggcgt gcacggtgag cagatgaggc cccaagctca 901 tcacaccagg gggccatcct tctcaataca gcccgccctt gcagtcccta tttcaaaata 961 aaattagtgt gtccttgc // LOCUS HUMSON3A 1449 bp ds-DNA PRI 15-AUG-1990 DEFINITION Human son3 protein gene, partial cds. ACCESSION M36428 KEYWORDS son3 protein. SOURCE Human placenta DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1449) AUTHORS Berdichevskii,F.B., Chumakov,I.M. and Kiselev,L.L. TITLE Determination of the nucleotide sequence of the son3 fragment of the human genome: Identification of a new protein with an unusual structure and homology with DNA-binding proteins JOURNAL Mol. Biol. 22, 639-646 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 1449 son3 protein (AA at 1) BASE COUNT 487 a 348 c 329 g 285 t ORIGIN 1 cgggctctgc tcagccctaa agaaagtagt ggaggagaaa aagaagtacc tccccctcct 61 aaagagacac tgcctgattc aggattttct gccaatattg aggatattaa tgaagcagat 121 ttagtgagac cgttacttcc taaggacatg gaacgtctta caagccttag agctggcatt 181 gaaggacctt tacttgcaag tgatgttgga cgtgacagat ctgctgccag cccggttgta 241 agtagtatgc cagaaagagc ttcagagtct tcttcagagg aaaaagatga ttatgaaatt 301 tttgtaaaag ttaaggacac tcacgaaaaa agcaagaaaa ataagaaccg tgataagggg 361 gagaaagaga agaaaagaga tcctcattta agatctcgaa gtaagcgttc caaatcttct 421 gaacacaaat cacgcaagcg taccagtgaa tctcgttcta gggcaagaaa gagatcatct 481 aagtccaagt ctcatcgctc tcagacacgt tcacggtcac gttcaagacg caggaggaga 541 agcagcagat caagatcaaa gtctagagga agaagatctg tatcaaaaga gaagcgcaaa 601 agatctccaa agcacagatc caagtctagg gaaagaaaaa gaaaaagatc aagctccagg 661 gataaccgaa agacagttag agctcgaagt cgaaccccaa gtcgtcggag tcggagtcat 721 actccaagtc gtcgacgaag gtctagatct gtgggtagaa gaaggagctt tagcatttcc 781 ccaagccgcc gcagccgcac ccccagccgc cgcagccgca cccccagccg ccgcagccgc 841 acccccagcc gccgcagccg cacccccagc cgccggagcc gcacccctag ccgtcggagc 901 cgcaccccaa gccgccggag aagatcaagg tctgtggtaa gaagacgaag cttcagtatc 961 tcaccagtca gattaaggcg atcaagaaca cccttaagaa gaaggtttag cagatctccc 1021 atccgtcgta aaagatccag gtcttctgaa cgaggcagat cacccaaacg tctgacagat 1081 ttggataagg ctcaattact tgaaatagcc aaagctaatg cagctgccat gtgtgctaag 1141 gctggtgtcc ctttaccacc aaacctaaag cctgcacctc cacctactat agaagagaaa 1201 gttgctaaaa agtcaggagg agctactata gaagaactaa ctgagaaatg taaacagatc 1261 gcacagagta aagaagatga tgatgtaata gtgaataaac ctcatgtttc ggatgaagag 1321 gaagaagaac ctccttttta tcatcatccc tttaaactca gtgaacccaa acctattttt 1381 ttcaatctga atattgctgc agcaaaacca actccaccaa aaagccaggt aacattaaca 1441 aaagaattc // LOCUS MYXGFA 2269 bp ds-DNA VRL 15-AUG-1990 DEFINITION Myxoma virus growth factor and M-T9 genes, complete cds. ACCESSION M15806 M35234 KEYWORDS M-T9 gene product; growth factor. SOURCE Myxoma virus (strain Lausanne) DNA, clone pMYH-1. ORGANISM Myxoma virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Leporipoxvirus. REFERENCE 1 (bases 1 to 1421) AUTHORS Upton,C., Macen,J.L. and McFadden,G. TITLE Mapping and sequencing of a gene form myxoma virus that is related to those encoding epidermal growth factor and transforming growth factor alpha JOURNAL J. Virol. 61, 1271-1275 (1987) STANDARD full staff_review REFERENCE 2 (bases 584 to 2269) AUTHORS Upton,C., Macen,J.L., Wishart,D.S. and McFadden,G. TITLE Myxoma virus and malignant rabbit fibroma virus encode a serpi protein important for virus virulence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Computer-readable sequence for [1] kindly provided by C.Upton, 09-MAY-1987. Draft entry and computer-readable sequence for [2] kindly submitted by C.Upton, 14-JUN-1990. Author address: C.Upton University of Alberta Dept of Biochemistry 471 Med Sci Bldg Edmonton Alberta, CANADA T6G 2H7 email: USERCU11@ualtamts FEATURES from to/span description pept 204 461 growth factor pept 717 2246 M9-R gene product BASE COUNT 685 a 441 c 540 g 603 t ORIGIN 1239 bp upstram of DdeI site; about 13 kb from 3' viral end. 1 ttaaacaaga tacaacatac ggacgcggct atgttctcgg aagtcataga cggtattgtc 61 gcggaagaac agcaggtgat tggatttatt cagaaaaaat gtaaatataa cacgacatac 121 tacaatgtac gtagcggcgg gtgtaaaata tccgtctatc taaccgcggc agttgttggc 181 tttgtcgcat acggaatact aaaatggtac cgagggacct agtcgcaact ctcttatgtg 241 cgatgtgtat tgtacaggca acgatgcctt cgttggataa ttatctgtat attattaaac 301 gtattaaact atgtaacgac gactataaaa actattgtct aaataacgga acctgtttca 361 ccgtagcatt aaacaatgtt tcacttaacc cgttttgtgc gtgtcatatt aactacgtgg 421 gaagccgatg tcagtttatt aatctaatta ccattaagta acccgtttta catgtataat 481 aatacatacg tatttttaga taactttaat aaataacatt gtataaactt acttatcata 541 tacggtacac ataacgaata acactacatg tttttatata tacataggtt tggaaaaaac 601 ttaatcacga acgtatcatt agacaatgac tccatctagg aggggttttg ggaactacgt 661 acacgatata ttcacatcgc gaaaacataa ataataattt tttacaacga ttcacgatgt 721 cgcgcacttt attgagattt ctggaagatg gtgcaatgag cgacgtaaca gtcgtcgccg 781 gggactcgac gtttctcggg cataaagtta ttttatctct tcactcggat tacttctatc 841 gtctgtttaa tggagacttt acctcgcccg atacggttac gctggacgcg acggacgatg 901 ccgttcgtac ggtgtttacg tatatgtacg cgggatgtga cgggttaaac gatcgtacga 961 tagacgattt acaatccatt atcgtattgg cggactacct gggtataacg aaactggtgg 1021 acgaatgcgt acgtcgtatc gtatctaaag tggacgtatt aaactgcgta ggggtatata 1081 cgtttgcgga gacgtatcat ataacggact tgcagcgggc ggccaaaacg tttttaacag 1141 aactactggg gtctaaagaa gcgttcgaag aactatccca agacgatgcg gttatcgcgt 1201 taagggaaac gcgtaacatt gtcgatagac gatccattct tagagcgatc ctgttatggg 1261 ttcgaaaatg tccagatcgt atcgaacaac taaaggtgtt agtcgccgcc gtagacgacg 1321 tagacgacga tgacaacgta tatacgatct acgagagata cgctgaagaa ctaaaggata 1381 tgatcgcgtg tccattatcc tataattgcg tcgttgtggt cgacagagat agatacgttc 1441 gcctcattaa cccagacacc ctatggagta aacgcgtgac gtacatacgt aaacgcgcca 1501 taggcgatcg attcaccgtc gtttgtatga acaacgttct atactgttta gggggtacgt 1561 tagacggggc acccacgtgt gacgtgttgg cctacgatct actgacgaac gaatacagtt 1621 taatgccgga gatgggacac tatagacgta atgcgtcggc gtgtatcgta aatggatata 1681 tatacgtcgt aggaggcgta gacgaagaaa acagattaat cggttccgta gagtactggc 1741 aacccggaat ggaggaatgg cacgacgctc cttatctaca ggcgaacgta gaaacggcta 1801 cggtgtgtta caggaacgag ttgtggatcg taggaggcac cgtggactta tatcatccca 1861 cgtttataag cgcagttaag aaattaacag acaatcgatg gatgtcgatg gaacctcttc 1921 ccgaaccacg atcgggtgct acgaccgtcg tgtataataa tcgattatac tgcataggcg 1981 gaaggataca cggtggcgcg tacacaaatc acgtctacaa ctatttagac gagtcacgta 2041 cgtgggaacg ggtaggggat atggcgaacg tacgcagaaa tcccagttgt tgtgtgtaca 2101 ataaggcgat ttacgtattg ggagggaata caaacgccgt agagaaatac aacgggtgga 2161 agtggcaaga ggtaggtaat atatccacgt atcccgcgtg taataatacc gcgtatccat 2221 ttttttatac caacgacgag atataaaacg agtatgatat acaagtcgt // LOCUS MYXMAP1A 2204 bp ds-DNA VRL 15-AUG-1990 DEFINITION Myxoma virus MAP1 gene, complete cds, and M-T8 gene, 5' end. ACCESSION M35233 KEYWORDS M-T8 gene product; MAP1 gene product serpi protein. SOURCE Myxoma virus (strain Lausanne) DNA, clone pBU-3. ORGANISM Myxoma virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Leporipoxvirus. REFERENCE 1 (bases 1 to 2204) AUTHORS Upton,C., Macen,J.L., Wishart,D.S. and McFadden,G. TITLE Myxoma virus and malignant rabbit fibroma virus encode a serpi protein important for virus virulence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Upton, 14-JUN-1990. Author address: C.Upton University of Alberta Dept of Biochemistry 471 Med Sci Bldg Edmonton Alberta, CANADA T6G 2H7 email: USERCU11@ualtamts FEATURES from to/span description pept 363 1472 MAP1 gene product pept 1450 > 2204 M-T8 gene product BASE COUNT 592 a 520 c 569 g 523 t ORIGIN 1 ggatccgtaa caacacgtgt gtcgtagcgt atacataatg ccgtaaatga cagtcataaa 61 accatcgagt cgtcccaggc cgaggaaaaa caaaaatata aaagtaaata catacagaac 121 gagcgccatg gatctctctc cgggaagtgt ccacgagggt atcgtatatt ttaaagacgg 181 aatattcaaa gtccgcctac tcggatacga gggacacgag tgtattcttt tggactatct 241 gaactacagg caagacacgt tggatcggtt gaaggaacga ctcgtgggac gcgtgattaa 301 aacgcgagtc gttcgcgcgg acggtttata cgtggacctg cgacgttttt tttgagggtt 361 aaatgaagta tctggtcctc gtcttatgtt taacgtcgtg cgcgtgtcga gatatcggac 421 tatggacgtt ccgatacgtc tacaacgaaa gcgacaacgt cgtgttctca ccgtacggct 481 tgacctccgc gttgtccgtg ttacggatcg cggcgggcgg taacacgaaa cgagaaatag 541 acgtccccga atccgtcgtg gaggactccg acgcctttct cgcgttacgg gagttgttcg 601 tagacgcatc cgttccgtta cgtcccgagt ttacggcgga gttctcctcg cgattcaata 661 cctccgtgca acgcgtgacg tttaactcgg agaacgtcaa agacgtcatt aactcgtacg 721 ttaaggataa gacgggagga gacgtcccac gcgtattgga cgcctcccta gaccgagata 781 ctaaaatgct gctattgagc tccgttcgta tgaagacgag ctggagacac gtattcgacc 841 cttcgttcac gacggatcaa cctttttatt ccggaaacgt cacatacaag gtacgtatga 901 tgaataaaat agatacgttg aaaacggaga cgtttacgct tagaaacgtg ggatactccg 961 taacggaact gccgtataaa cggcgtcaaa cggccatgtt gctcgtcgtt ccggacgact 1021 tgggagagat cgtgcgggcc ctcgatcttt ctctagtacg cttctggata cgcaacatga 1081 ggaaagacgt gtgtcaggtg gtaatgccca agttctccgt cgaatcggtc ctggatctga 1141 gggacgccct ccagagactg ggggtgcgag acgcgttcga tccatcccgg gcggacttcg 1201 gtcaggcgtc cccgtcgaac gatctatacg tcacgaaggt gttacagacg tccaagatag 1261 aggcggacga acggggaacg acggcgtcga gcgacacagc catcaccctc atccccagga 1321 acgccctcac ggcgatcgtg gcgaacaaac cgtttatgtt tctcatctat cacaagccta 1381 caacgaccgt gttgtttatg ggaacgataa caaagggtga aaaagtaata tacgatacgg 1441 agggtcgaga tgatgtcgta tcctctgtat aaactctttt tgaagggtaa actatgcgac 1501 gtcgaaatcg tcgcggaagg caaaagcatc cgagcgcatc ggttggtgct ttccgcgtat 1561 tctaaatact tttacaactt gtttaatggg aatttcttag aaaaaaacgt agacgtaatc 1621 gacttagaag cggattataa aaccgtattt gacgtgattt attacatgta tacagaatcg 1681 atagaattac acaaagggaa taccgaatcc attttctcat tggttcatta cctacagatt 1741 aaacccctga ttaaaaaatg tatctacgag tttaacagca tcgtgaacga agaaaactgt 1801 atacgtctgt ttaagttcgc cgaattatac gacctgtccg agttgaaacg cagggcgcga 1861 tggcttatgc ccagtctcgt tatgaatgag aaagatcgcc tgcgggagat gtccttggac 1921 gacctatccc tgatgttagt ccagatacgg aacacggtcg atcgaagtat cgctttgtcg 1981 gcgatcacgg aatggataca gacaaacgtt cgcgaacgta ggagacacgc cgtccatctg 2041 gcgacgtgtt taggggatgt cccaggaacc gcatcctcca gagccgtata caaacactac 2101 atgtcggaac tacgtattcg ggttacggaa tttcaaccgg cgtatcacaa ctgcgtcgtg 2161 tacctgggag gatcgatgaa aggtcgcgtc accgccctgg atcc // LOCUS MZEMT2BATP 2054 bp ss-mRNA ORG 15-AUG-1990 DEFINITION Maize mitochondrial F-1-ATPase subunit-2 mRNA, complete cds. ACCESSION M36087 KEYWORDS ATPase subunit-2. SOURCE Maize (inbred line A188) embryo kernel mitochondrion, cDNA to mRNA. ORGANISM Mitochondrion Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Zea mexicana. REFERENCE 1 (bases 1 to 2054) AUTHORS Ehrenshaft,M. and Brambl,R. TITLE Respiration and mitochondrial biogenesis in germinating embryos of maize JOURNAL Plant Physiol. 93, 295-304 (1990) STANDARD simple staff_review FEATURES from to/span description pept 6 1667 F-1-ATPase subunit-2 BASE COUNT 421 a 539 c 558 g 536 t ORIGIN 1 cggccatggc gtcccgccgg gtcgtctcct cgctcctccg ctccgcgtcc cgcctgcggg 61 ccgcctcgcc cgctgctcca cgaccgcgcg cgccaccgca ccgcccgtcc ccggccgggt 121 acctcttcaa ccgcgctgcc gcctacgcct cttccgccgc ggcccaggcg gcacctgcca 181 ccccgccgcc ggccaccggg aagaccgggg ggggcaagat caccgacgag ttcaccggcg 241 ctggcgccat cggccaggtg tgccaggtga tcggcgccgt cgttgacgtg cgcttcgatg 301 agggcctccc gcccatcctc acggcgctcg aggtgctcga caacaacatc cgcctcgtgc 361 tcgaggtggc gcagcacctt ggcgagaaca tggtgcgcac catcgctatg gacggcacgg 421 aggggctcgt ccgcggccag cgcgtcctca acactggctc ccccatcacc gtgcctgttg 481 gcagggctac ccttggacgc atcataaatg ttattggtga accgattgat gagaagggtg 541 acataaagac aaaccacttc ctccctattc atcgtgaagc ccctgctttt gttgagcagg 601 ccactgagca gcaaattctt gttactggaa tcaaggtcgt ggatcttctt gcaccctacc 661 aaaggggtgg aaagattggt ctcttcggtg gtgcaggagt gggtaaaact gtgctcatta 721 tggagttgat caacaatgtt gctaaggccc atggtggttt ctctgtgttt gctggtgttg 781 gagaacgtac ccgtgaaggt aatgatctgt acagggaaat gattgaaagt ggtgtcatta 841 agctagatga caagcagagc gaaagcaagt gtgctcttgt ttacgggcag atgaatgagc 901 ccccgggtgc tcgtgctcgt gttgggttga ctggtttgac tgttgctgaa catttccgtg 961 atgctgaagg acaagatgtg cttctgttta ttgacaacat tttccgtttt actcaggcaa 1021 actctgaggt gtctgctctt cttggacgta tcccatctgc tgtgggatac cagccaaccc 1081 ttgccactga tcttggagga ctgcaagagc gtattacgac aacaaagaag ggttctatta 1141 catctgtgca ggccatctac gtgcctgccg atgacttgac ggatcctgct cctgctacta 1201 cctttgccca tcttgatgct acaactgtgt tgtcacgaca gatctctgag cttggtattt 1261 atcctgctgt tgatccactg gattccacat caagaatgct ttctccccac gtgctgggtg 1321 aggatcacta caacactgct cgtggtgtgc agaaggttct tcagaactac aaaaatcttc 1381 aggatattat tgctatcttg ggtatggatg agctcagtga ggatgacaag ctgacagtcg 1441 cccgtgcaag aaagattcag cgtttcctga gccagccttt ccatgtcgct gaagttttca 1501 cgggtgctcc aggaaagtat gtggagctga aggaaagcgt gaagagtttc cagggtgttt 1561 tggatgggaa gtatgatgac ctccctgagc agtcattcta catggttggt ggcattgagg 1621 aagtcattgc taaggctgag aaaattgcca aggagtctgc ttcataagga ggcttcttgc 1681 ttgttcaacc ctgtacaagt tccatttttg gattttaagc gtttatttat gcttttccca 1741 gttaggcatg acgagctgga gagtccatct cctgctgaga gatgtttgtt ttacccttct 1801 ttgcttcctc caccttacac ccaaataagc aactgcagtg ccgttggttt tggctgcacc 1861 caaactacat gactgaagaa acttgtggcc tgtgtaacgc gaatccatca gaacgccaaa 1921 gttatggctt ctggttgtgg caaattatgg ttcctccctg ttcggttgag tggttgcatt 1981 ctggaggtat tgttctggac tcaggctaat gattgtgcgt gcaactgttt cggagtcatt 2041 tcaaagggtt atcc // LOCUS PFAMTSSU 935 bp ds-DNA ORG 15-AUG-1990 DEFINITION P.falciparum mitochondrial small subunit rRNA gene. ACCESSION M23443 KEYWORDS small subunit ribosomal RNA. SOURCE P.falciparum (strain C10) mitochondrial DNA. ORGANISM Mitochondrion Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae; Plasmodium falciparum. REFERENCE 1 (bases 1 to 935) AUTHORS Gardner,M.J., Bates,P.A., Ling,I.T., Moore,D.J., McCready,S., Gunasekera,M.B.R., Wilson,R.J.M. and Williamson,D.H. TITLE Mitochondrial DNA of the human malarial parasite Plasmodium falciparum JOURNAL Mol. Biochem. Parasitol. 31, 11-18 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 935 small subunit ribosomal RNA BASE COUNT 376 a 91 c 132 g 336 t ORIGIN 1 aagcttgata aagtaatatt tcttttagga agacagtatt attaaaatat tgtaaacttt 61 ttattttatt tttaaatatt gataaaaata aaaaatagta tttgctattt tctgtgccag 121 cagcagcggt aatacagaaa tgcaagcgtt attcatttta ttaggcgtaa agcgttttaa 181 ggttttatat taattttatg tttaaatatt taaattaaat ttaaaataaa ttaataaata 241 ataatataat agagtattat aaaagtatta agaatttttt gagaagtagt gaaatacaat 301 gatacaaaaa agaatatcaa aggcggaagc ataatactat ataattactg acacttaaaa 361 acgaaagcta aggtagcaaa taggattaga taccctagta gtcttagctg taaactatga 421 atattttata tttatatttt ataaatataa taactaacgt gataaatatt ccgcctgagt 481 agtatattcg caagaatgaa attcaaagga attgacggga gcttatacaa gtggtggaac 541 atgtggctta attcgatgca acacgataaa ccttaccaaa atttaacaat atttttaata 601 ttaagaaatt aatattttaa taaaatatat aggtagtgca tggctgtcgt cagttcgtgc 661 tgtgaagtgt taattttagt attataacga acgtaacctt ttataaaaaa aatttttata 721 ataaataata ataaagatta cgtcaagtca ttatgctcct tatattttgg gctgctcacg 781 tgttacataa aatattacaa tattttatta tatgttaaat ataataatta aaatatattt 841 atagttcaga ttataaattg aaactcattt atataaagat ggaatcacta gtaatcgcta 901 atcagaatta tagcggtgaa taagttctta agctt // LOCUS PSEALGR3A 120 bp ds-DNA BCT 15-AUG-1990 DEFINITION P.aeruginosa alginate synthesis regulatory protein (algR3) gene, 5' end. ACCESSION M35259 KEYWORDS alginate synthesis regulatory protein. SOURCE P.aeruginosa (strain 8882) DNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 120) AUTHORS Kato,J., Misra,T.K. and Chakrabarty,A.M. TITLE AlgR3, a protein resembling eukaryotic histone H1, regulates alginate synthesis in Pseudomonas aeruginosa JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2887-2891 (1990) STANDARD simple staff_review FEATURES from to/span description pept 70 > 120 alginate synthesis regulatory protein (algR3) mRNA 45 > 120 algR3 mRNA BASE COUNT 24 a 39 c 37 g 20 t ORIGIN 1 cgaacccgtt ggcgagaggg ggtttgcggg tctagtatgg gcgcaaccac gtccgcctgg 61 aggcacgtca tgtcggccaa caagaagccc gtcaccaccc ccttgcacct gttgcagcaa // LOCUS STYOMPH 992 bp ds-DNA BCT 15-AUG-1990 DEFINITION S.typhimurium cationic 16 kD outer membrane protein (ompH) gene, complete cds. ACCESSION J05101 M36486 KEYWORDS ompH gene; outer membrane protein. SOURCE S.typhimurium (strain LT2 subline, isolate SH5014) DNA, clones pUCHS[14,16]. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 992) AUTHORS Koski,P., Rhen,M., Kantele,J. and Vaara,M. TITLE Isolation, cloning, and primary structure of a cationic 16 kDa outer membrane protein of Salmonella typhimurium JOURNAL J. Biol. Chem. 264, 18973-18980 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 992) AUTHORS Koski,P., Hirvas,L. and Vaara,M. TITLE Complete sequence of the ompH gene encoding the 16-kDa cationic outer membrane protein of Salmonella typhimurium JOURNAL Gene 88, 117-120 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.S.Vaara 02-SEP-1989. FEATURES from to/span description pept 311 796 cationic outer membrane protein precursor (gtg start codon) sigp 311 370 cationic outer membrane protein signal peptide matp 371 793 cationic outer membrane protein signal 142 147 -35 region signal 165 170 -10 region signal 854 879 transcription termination signal binding 293 305 ribosome binding site BASE COUNT 281 a 224 c 260 g 227 t ORIGIN 334 bp upstream of PstI site. 1 gatccgtcat ctgcgccgtc agatgtaccg gattacagcg atccaggcaa catccgtatg 61 tccgcgggta tcgcattaca atggatgtcc cattggggcc gttggtcttc tcctacgccc 121 agccgtttaa aaagtacgat ggagacaaag ccgagcagtt ccagtttaac attggtaaaa 181 cctggtaatt gttcactgca aaggaatgca ttggtagtgt agcgatgact tttggcgatg 241 cccccaggga tcgccaggcc acgcaaagag ctgtaccttc gggtgcaaat gggatggtaa 301 ggagtttatt gtgaaaaagt ggttattagc tgcaggtctt ggtttggcga tggtaacgtc 361 cgcacaggct gctgacaaaa ttgcaatcgt caacatgggt aatctgttcc aacaggttgc 421 gcagaagacg ggtgtatcca atacactgga aaacgaattt aaaggccgtg cggctgaact 481 gcaaaaaatg gaaaccgatc tgcaatctaa aatgcagcgt ctgcaatcca tgaaagcagg 541 tagcgatcgt actaagctgg aaaaagacgt gatgtctcag cgccagactt tcgcacaaaa 601 agcgcaggct tttgagaaag atcgcgctcg tcgttccaac gaagaacgca acaaactggt 661 gactcgtatc cagactgcgg tgaaaaaagt ggctaacgac cagagtatcg atctggtggt 721 agacgcaaac accgttgctt acaacagcag cgatgtgaaa gacatcaccg ctgacgtact 781 gaaacaggtt aaataagtaa tgcccttcaa ttcgactggc tgacttagca gaacagttgg 841 atgcagaatt acacggtgat ggcgatatcg tcatcaccgg cgttgcgtcc atgcaatgtg 901 caacaacagg ccacattacg tttatggtga atcctaagta ccgtgaacac ttaggtttat 961 gccaggcttc tgcggttgtc atgacgcagg ac // LOCUS SIVAGM155 9794 bp ds-DNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV), complete genome. ACCESSION M29975 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate 155) proviral DNA, clone 4. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9794) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted prior to publication and in a computer-readable form by Phillip Johnson, Georgetown University, Rockville MD (301- 496-2976). The 155 isolate is from a monkey imported from Kenya. FEATURES from to/span description pept 931 2493 gag polyprotein pept 2199 5342 pol polyprotein pept 5260 5958 vif protein pept 5741 6100 vpx protein pept 6051 6268 tat protein, exon 2 (first expressed exon) 8492 8633 tat protein, exon 3 (AA at 8493) pept 6208 6268 rev protein, exon 2 (first expressed exon) 8492 8700 rev protein, exon 3 (AA at 8494) pept 6275 8581 env polyprotein pept 8724 9416 nef protein LTR 1 726 5' LTR LTR 9070 9794 3' LTR rpt 1 625 R repeat 5' copy rpt 9578 9794 R repeat 3' copy binding 727 744 primer (Lys-tRNA) binding site signal 9675 9680 poly-A signal BASE COUNT 3321 a 1905 c 2450 g 2118 t ORIGIN 1 tggatgggat ttattactcc gataggagaa ataagatcct taatctgtat gccctcaatg 61 aatggggaat cattgatgat tggaacgcat ggtcaaaagg acctgggata agatacccga 121 ggtgctttgg cttctgcttc aagctagtac cggttgccct gcatgaggaa gcagaaacat 181 gtgaaaggca ttgcttggta cacccagcac aactgcatga agaccctgat ggtataaatc 241 atggagaaat attggcatgg aagtttgatc caatgttggc tgttcagtac gacccctcaa 301 gggagtactt tacagactta tattcaacag ttggtacagg aaactagccg accacaggct 361 tgcggtttcc tggttgccta ggagatgaca ttaagaactg ctgacgggac tttccagcac 421 gggactttcc aaggcgggac atgggcggta cggggagtgg ctttaccctc agagctgcat 481 aaaagcagat gctcgctggc ttgtaactca gtctcttact aggagaccag cttgagcctg 541 ggtgttcgct ggttagccta acctggttgg ccaccagggg taaggactcc ttggcttaga 601 aagctaataa acttgcctgc attagagctt atctgagtca agtgccctca ttaatgcctc 661 actcttgaac gggagaagtt ccttactggg ttctctctca aacccaggcg agagaaactc 721 cagcatggcg cccgaacagg gacttgagtg aaggcacgta cagctgagaa gacgtcggac 781 gcgaaggaac cgcggggtgc gacgtgaccg agaagggctc ggtgagtagg cttctcgagt 841 gccgggaaaa agctcgagcc tagttagagg actaggaagg gccgtagccg taactactct 901 gggcaagtag ggcaggcgga cgggtacgta atgggggcgg ctacctcagc actgaatagg 961 agacaattag atgaatttga gcatatacga cttcgcccga acggaaagaa aaagtatcaa 1021 attaaacatt taatatgggc aggcaagaag atggaccgct tcggcctcca tgagaagtta 1081 ttggagacag aggaaggttg taaaaagatc atagaagttc tctctcccct agaaccaaca 1141 gggtcggaag gaatgaaaag tctgtataat ctggtgtgcg tattgctttg cgtccaccaa 1201 gaaaagaaag tgaaagacac agaggaagct ttagcaatag taagacaatg ctgccaccta 1261 gtggacaaag aaaaaactgc agttacgcca cctggtggac agcagaaaaa taacacagga 1321 ggaacagcga cacctggtgg cagccaaaat tttcccgcac aacagcaagg gaatgcatgg 1381 gtgcatgtac cactttcacc tcgcacccta aatgcatggg taaaagcagt agaagagaaa 1441 aaatttgggg cagaaatagt acccatgttc caagccctct cagaaggctg caccccatat 1501 gacatcaatc agatgcttaa tgtcttagga gatcatcagg gggccttgca aatagtgaaa 1561 gaaataatta atgaggaagc agcccagtgg gatgtaaccc acccaccgcc ggcaggcccc 1621 ttgccagcgg gacagctcag ggatccgggg ggatcagata tagcagggac cactagtaca 1681 gtgcaagagc agctagagtg gatctatact gctaacccaa gggtagatgt aggggccatc 1741 tatcgaagat ggatcatcct agggttacaa aaatgtgtaa aaatgtacaa tccagtgtct 1801 gttttagata tcagacaagg gcccaaagaa ccattcaaag attatgtaga cagattctat 1861 aaagcaataa gagcagaaca agcttcagga gaagtcaaac aatggatgac agaatctttg 1921 ctcattcaga atgccaaccc agattgcaaa gtaattttga agggcctagg gatgcacccc 1981 actcttgaag aaatgctgac agcctgtcaa ggggtgggag gcccaagtta caaagccaaa 2041 gtcatggcag aaatgatgca gaacctgcag agtcagaaca tggtacagca gggaggtgga 2101 aggggaagac caagaccccc gccaaagtgt tacaactgtg gaaaatttgg ccacatgcag 2161 aggcagtgtc ctgagccaag aaaaataaaa tgtcttaaat gtggaaagcc agggcactta 2221 gcaaaagact gcaggggaca ggtgaatttt ttagggtatg gccggtggat ggggacaaaa 2281 ccaagaaatt ttcccgcagc cactcttggg gcggaaccaa gtgcgccccc tccaccgaac 2341 aactctacac cttacgaccc agcaaagaag ctcctgcagc agtatgcaga gaaagggaaa 2401 caaatgagaa atcagaacag aaacccccca gcgaacaatc cagattggaa cgagggatat 2461 tctttgaact ccctctttgg agaagaccaa taaggacctg tataatagga ggaactgccg 2521 ttaaggcatt attagataca ggggcagatg acactataat aaaggataca gatttacaat 2581 taaggggatc atggagacca aaaatagtag gaggaattgg gggagggtta aacgtaaaag 2641 aatatgataa tgtagaagta caattggaag acaagatatt aagaggaaca gtcctcatag 2701 gagcaactcc catcaatatc ataggaagaa actttttagc ccaggcagga gccaaattag 2761 tgatggggca attgtcgcag acaataccaa tcaccccggt acgcttaaag gaaggggcca 2821 gaggaccacg attgaagcaa tggccactct ctaaagaaaa aataatagcc ctgcaagaaa 2881 tttgcaaaac attagaggaa gaaggaaaat taagcagggt agggggagac aatgcataca 2941 atacaccagt attctgtata aggaaaaaag acaaatcaca gtggagaatg ctggtagatt 3001 tcagggaact caacaaagct acacaagact tctttgaagt ccaattaggt ataccccatc 3061 cagcagggtt aaagaaaatg aagcaaataa ccattataga tgtgggggat gcatattata 3121 gcataccact ggatcctgag tttagaaaat acacagcttt caccatccct acggtaaaca 3181 atgagggacc aggcataaga tatcaattta attgcctacc gcagggctgg aagggatccc 3241 cgacaatttt ccaaaacaca gcatcaaaaa ttctagaaga aataaagaaa gaattaaaac 3301 agctgacgat tgtccagtac atggatgacc tctgggtagg atcacaagaa gagggtccaa 3361 agcatgatca gctagtacaa acacttagga atagattgca agaatgggga ttagaaacac 3421 cagagaaaaa ggtgcaaaga gaacctccct ttgagtggat gggatataaa ttatggcctc 3481 ataaatggaa gttacaaagt atagaattag agaagaaaga acaatggaca gtgaatgatc 3541 ttcagaaatt ggtagggaaa ttaaattggg cagcacaatt atatccagga ttgagaacaa 3601 aaaatatctg taagctactt agaggaaaga aaaatttatt agacgtggta gaatggaccc 3661 cagaggcaga agcagagtac gaagaaaaca aggagatcct aaaaacagag caagaaggta 3721 cttattatgc accagaaaaa ccccttaggg cagcagtaca gaaattagga gatgggcaat 3781 ggtcatacca attcaagcag gaaggaaaaa tcttaaaggt agggaagttc gccaaacaga 3841 aagctactca caccaatgag ttgcgtgtac tagcaggagt agtacagaaa atagggaaag 3901 aggccctagt aatttgggga caattaccca cttttgaact cccagtggag agggacacat 3961 gggaacaatg gtgggcagac tattggcaag tcagttggat acccgaatgg gactttgtca 4021 gtgttccgcc cttagtaact ttgtggtata cactgactaa ggaacccatc ccgggagagg 4081 atgtctacta tgtagatgga gcctgtaata gacagtcgaa agagggaaaa gcaggctaca 4141 taacccaaca aggcaaacaa agagtacaac agctagaaaa cacaacaaat caacaagctg 4201 aactgacagc cataaaaatg gccttggagg atagcggccc taaagtcaat atagtaacag 4261 attcacaata tgcgatgggc atattgacag cacagcccac acagagtgac tccccactag 4321 tagaacaaat aatagcacag atggtacaga aagaagccat ctatctgcaa tgggtacctg 4381 ctcataaagg tatagggggc aatgaagaaa tagacaaatt agtaagcaag ggagttagaa 4441 gaatattgtt cattggcagg atagaagaag cacaagaaga acatgatagg tatcacagta 4501 actggagaaa tctagcagac acatttggat tgccacaaat agtagctaaa gaaattgtag 4561 caatgtgccc aaaatgtcaa gtaaaagggg aaccaataca tggacaagta gatgcttcac 4621 caggagtgtg gcagatggac tgcacacata tagaaggaaa aatagtgata gtagcggtcc 4681 atgtagccag tgggtttata gaagcagagg ttatccctag ggaaacagga aaagagacag 4741 caaagttctt gttaaaaata ataggaagat ggcccatcac tcacctccat acagataatg 4801 gaccaaattt cacttctcag gaagtagctg ctatgtgctg gtggggaaag gtagaacaca 4861 caacgggggt accatataat ccacagtccc agggatctat agaaagtatg aacaaacaat 4921 tgaaagagat aattggaaaa ataagagatg actgtcaata tacagaaaca gcagtactta 4981 tggcctgcca cattcacaat tttaaaagaa agggaggaat aggggggcta acagctgcag 5041 agagactaat aaatatgata acaacacaat tagaaatcaa cactctacaa accaaaatcc 5101 aaaaaatttt gaattttaga gtctactaca gagaaggcag agatccagtg tggaagggac 5161 ctgctcgcct gatctggaaa ggagaaggcg cggtagttct caaggaaggt gaagaactga 5221 aggtagttcc gagaaggaaa gcaaaaatca taaaagacta tgagccaaga aaaacattgg 5281 gtgatgagac tcacctggaa ggtgcaggag gaagtgatca ccaaatggca ggggatagtt 5341 agatattgga tgaataaaag gaatctgaaa tgggaataca aaatgcatta tcaaatcact 5401 tgggcatggt acactatgag cagatatgta atacccctcc caggaagtgg agaaatccat 5461 gtggatatct attggcattt agctccaaaa caaggatggc tctcaactta tgcagtagga 5521 atacaatatg ttagcctagt aaatgataaa tatagaacag aattagatcc caatacagca 5581 gactccatga tacattgtca ttattttacc tgttttacag atagagccat ccaacaggca 5641 ctaaggggaa acaggttcat cttctgtcaa tttccaggag gacataaact aacaggtcag 5701 gtaccctcct tgcaatattt agcattacta gcccatcaaa atggcctcag gaagagatcc 5761 cagagaggag agaccaggag gactagaaat ttgggatctc agcagggagc cgtgggacga 5821 atggctcaga gatatggtag aagaaatcaa caacgaagcc aaactgcatt ttggccgaga 5881 actcctatac caagtatgga attattgtca ggaggaaggg gagagacagg gaagacccat 5941 agcggaaagg gcatataagt attatcgctt agttcagaaa gctctctttg tgcatttccg 6001 gtgtggatgt cgcaggagac aaccctttga gccatacgag gagaggagaa atggacaagg 6061 gggaggaaga ccaggacgtg tcccaccagg acttgattaa acaatacagg aaaccccttg 6121 agacatgtac aaataaatgc ttttgcaaaa aatgctgtta tcattgccaa ttctgcttct 6181 tacggaaagg actaggtatt acctatcatg cctttaggac cagaagaaag aagattgctt 6241 cggctgatcg cattcctgta ccgcagcagt aagtatgaca aagttcttag gaatttttat 6301 agtattagga atagggatag gaatagggat aagtacaaaa cagcagtgga taacagtgtt 6361 ctatggagta ccagtatgga aaaacagctc agtccaagct ttttgcatga cacctactac 6421 taggttgtgg gcaactacta attgcatacc agatgatcat gactatacag aagtaccact 6481 gaatataaca gagccatttg aagcatgggc agacagaaat cccttagtag cacaagcagg 6541 aagtaacatt cacctgctgt ttgaacagac attaaagccc tgtgtaaagc tatcacctct 6601 atgtatcaaa atgaattgtg tagagttaaa aggctccgca acctctaccc cagcaacctc 6661 tactacggca ggaaccaaac taccctgtgt tagaaataaa acagactcca acctacagtc 6721 atgcaacgac accatcatag aaaaggagat gaatgacgag gcagcgtcaa actgcacctt 6781 tgctatggct gggtacatta gggaccaaaa gaagaattac tcagtagtat ggaatgatgc 6841 agaaatcttt tgtaagcgta gtacatcgca taatgggaca aaagagtgct atatgatcca 6901 ctgtaatgat tcagttataa aggaagcttg tgataagaca tattgggatg aattaagact 6961 aagatattgt gctccagcag gatacgcttt gcttaaatgt aatgattggg attatgcagg 7021 atttaagcca gaatgttcta atgtttcagt agtgcattgc acaactttaa tgaatacaac 7081 agtaaccact ggtctgttat tgaatggaag ctattcagaa aatcgaaccc agatctggca 7141 aaaacatgga gtgagcaatg actcagtgtt aatcttgctc aataagcatt ataacctgac 7201 agttacatgc aaaaggccag ggaataagac agtcttgcca gtaacgataa tggcaggatt 7261 agtcttccac tcacagaagt ataatacaag actaaggcag gcctggtgcc acttccaggg 7321 caattggaaa ggagcttgga aggaagtaca agaggaaata gtaaaattac caaaagaacg 7381 gtaccaaggc accaatgata caaacaaaat ctttttgcaa agacaatttg gagacccaga 7441 agcagcaaat ctatggttca actgtcaagg ggaattcttc tactgtaaaa tggactggtt 7501 tttaaattat ctgaataatt taacagtgga tgctgatcat aatcattgta aaaacaacgc 7561 agggaaaggt cgaagtccag gtccctgtgt acagagaact tatgttgcct gccatatccg 7621 atctgtcata aatgattggt atactatatc aaagaaaaca tatgctccac caagagaagg 7681 acatttgcag tgcacgtcca cagttactgg gatgacagta gagctaaact ataataacca 7741 gaacaggaca aatgtaacat tgagtcccca gatagaaacc atctgggcgg cagaattggg 7801 cagatacaaa ttggtagaga ttacaccaat tggatttgca cccacagaag tcaggcgata 7861 cacgggaggc caagagaggc aaaaacgagt cccgttcgtg ctagggttcc taggcttctt 7921 gggagctgct gggactgcaa tgggagcagc ggcgacagcc ctgacggtcc agtctcagca 7981 tttacttgct gggatattgc agcagcagaa gaatctgctg gcggctgtgg gagctcaaca 8041 gcagatgttg aagctgacca tttggggtgt gaaaaacctc aatgcccgcg tcacagctct 8101 tgagaagtac ctggcggatc aggcacggtt aaacgcttgg gggtgcgcgt ggaaacaagt 8161 atgtcataca acagtaccct ggacgtggaa taatacacca gagtggaata atatgacctg 8221 gttggagtgg gaaaaacaga tagaaggatt ggagggcaac ataacaaaac aattggaaca 8281 ggcaagggaa caagaggaaa agaatttgga tgcttatcaa aagttgtcag actggtcgag 8341 tttttggtct tggttcgatt tttcaaaatg gctgaacatt ttaaagatag gctttttggc 8401 agtaataggc gttatagggt taagattgct ttacacatta tatacttgca tagctagggt 8461 taggcagggt tactctcctt tatctcctca gatccatatc catccgtgga agggacagcc 8521 agacaacgca ggagagccag aagaaggtgg aagaacaggc aaaagcaaat ctacgcatta 8581 gcagaaagaa tttgggggac gagacaagag gaccagttgg tgcaggcaat tgaccaattg 8641 gttcttgaca ctcagcatct ggttacacaa cagctgcctg accctccttc tcaagcttag 8701 aagcgcctgg cagtacttac aatatgggct tggggagctc aaagccgcag cacaagaagc 8761 agttaaccat ctggcgagct ttgcacgcaa cgcggcacac cagatatggc ttgcttgcag 8821 atccgcttat cgggcaatca tcaactctcc aagaagagtg cgacaagggc ttgaggaagt 8881 ccttaattag gaagagaaat ggcaacatga ctccagaagg aagacgtcta caggacgggg 8941 accaatggga tgaatggtca gatgaagaag atgaagtggg atttccagta agaccaagag 9001 tgccactaag acaaataaca tacaaacttg cagtagattt ttcgcacttt ttaaaagaaa 9061 agggaggact ggatgggatt tattactccg ataggagaaa taagatcctt aatctgtatg 9121 ccctcaatga atggggaatc attgatgatt ggaacgcatg gtcaaaagga cctgggataa 9181 gatacccgag gtgctttggc ttctgcttca agctagtacc ggttgccctg catgaggaag 9241 cagaaacatg tgaaaggcat tgcttggtac acccagcaca actgcatgaa gaccctgatg 9301 gtataaatca tggagaaata ttggcatgga agtttgatcc aatgttggct gttcagtacg 9361 acccctcaag ggagtacttt acagacttat attcaacagt tggtacagga aactagccga 9421 ccacaggctt gcggtttcct ggttgcctag gagatgacat taagaactgc tgacgggact 9481 ttccagcacg ggactttcca aggcgggaca tgggcggtac ggggagtggc tttaccctca 9541 gagctgcata aaagcagatg ctcgctggct tgtaactcag tctcttacta ggagaccagc 9601 ttgagcctgg gtgttcgctg gttagcctaa cctggttggc caccaggggt aaggactcct 9661 tggcttagaa agctaataaa cttgcctgca ttagagctta tctgagtcaa gtgccctcat 9721 taatgcctca ctcttgaacg ggagaagttc cttactgggt tctctctcaa acccaggcga 9781 gagaaactcc agca // LOCUS SIVAGM3 9625 bp ds-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) proviral, complete genome. ACCESSION M30931 KEYWORDS complete genome. SOURCE Simian immunodeficiency virus (isolate AGM3) from African Green monkey proviral genomic DNA. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9625) AUTHORS Baier,M., Garber,C., Mueller,C., Cichutek,K. and Kurth,R. TITLE Complete nucleotide sequence of a simian immunodeficiency virus from African green monkeys: A novel type of intragroup divergence JOURNAL Unpublished (1990); Paul-Ehrlich-Institute, 6070 Langen 1, Germany STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Baier 20-DEC-1989. This sequence was taken from an infectious molecular clone (used for heterologous infection of the pigtail macaque). The 3' LTR sequence does not appear to match the 5' LTR sequence. FEATURES from to/span description pept 431 1996 gag polyprotein pept > 1687 4827 pol polyprotein (NH2 terminus uncertain) pept 4763 5461 vif protein pept 5244 5603 vpX protein pept 5554 5771 tat protein, exon 2 (first expressed exon) 8013 8154 tat protein, exon 3 (AA at 8014) pept 5711 5771 rev protein, exon 2 (first expressed exon) 8013 8221 rev protein, exon 3 (AA at 8015) pept 5778 8411 env polyprotein pept 8245 8934 nef protein BASE COUNT 3324 a 1827 c 2383 g 2091 t ORIGIN 1 cagtctctta ctaggagacc agcttgagcc tgggtgttcg ctggttagcc taacctggtt 61 ggccaccagg ggtaaggact ccttggctta gaaagctaat aaatcttcgc tgcattagag 121 cttctctgag tcaagtgccc tcattgacgc ctcactcttg aacgggtaaa acttccttac 181 tgggttctct ctcaacccag gcgagagaaa ctccagcagt ggcgcccgaa cagggacttg 241 acttgagtga aggcacgtac agctgagaag acgtcggacg cgaaggaagg cgcggggtgc 301 gacgtgacca agaagggctt ggtgagtagg cttctcgagt gccgggaaaa agctcgagcc 361 tagttagagg actaggaagg gccgtagcca taactactct gggcaagtag ggcaggcgga 421 cgggtacgca atgggggcgg ctacctcagc actaaatagg agacaattag acaaatttga 481 gcatatacga cttcgcccga ccggaaagaa aaagtaccaa attaaacatt taatatgggc 541 aggcaaggaa atggagcgct tcggcctcca tgagagatta ctagaatcag aagaaggatg 601 taagaagatc atagaagtac tctacccgct agaaccaaca gggtcggagg gcttaaaaag 661 tctgtttaac cttgtgtgcg tattgttttg cgtacacaaa gataaggaag tgaaagacac 721 agaagaagca gtagcaatag taagacaatg ctgccatcta gtggagaaag aaagaaatgc 781 agaaagaaat acaacagaga catctagtgg acaaaagaaa aatgacaagg gagtaacagt 841 gccacctggt ggcagtcaaa atttcccagc acaacaacag ggaaatgcat ggatacatgt 901 gcccttgtca ccacgcacct taaatgcgtg ggtaaaagca gtagaggaga aaaaattcgg 961 agcagaaata gtgcccatgt tccaggcttt atcagaaggg tgcacaccct atgacatcaa 1021 tcaaatgctt aatgtcctgg gagaccatca aggggcgcta caaatagtaa aagaaatcat 1081 caatgaggaa gcagcccagt gggatatagc tcacccacca ccagcaggac cattaccagc 1141 aggacaactc agagacccta gaggctctga catagcagga accaccagca cagtgcaaga 1201 acagctggaa tggatataca cagccaatcc cagagtagat gtgggtgcca tctatagaag 1261 gtggattatc ctggggttgc aaaaatgtgt aaaaatgtac aacccagtgt ctgtcttaga 1321 cataagacag gggcccaaag aagcattcaa agactacgta gataggttct acaaagcaat 1381 aagagctgag caggcctcag gagaagtaaa acagtggatg acagaatcat tactcattca 1441 gaatgctaat ccagactgta aagtcatcct aaagggcctg ggaatgcatc ccactctaga 1501 agaaatgtta actgcctgtc aaggagtggg aggaccaagt tacaaagcaa aagtgatggc 1561 agaaatgatg caaaatatgc aaagccagaa catgatgcaa cagggcggtc agagaggaag 1621 accaagaccc ccagtaaagt gttacaattg tggaaaattt ggccatatgc aaagacaatg 1681 ccctgaacca agaaagatga gatgcttgaa atgtgggaaa ccagggcatt tagcaaaaga 1741 ttgcagagga caggtaaatt ttttagggta tggccggtgg atgggagcga aacccagaaa 1801 ttttcccgcc gctactcttg gggtggagcc aactgcgccc cctccaccga gtccatacga 1861 ccctgcaaag aagctcctgc agcaatatgc agacaagggg aagcagttga gggaacaaag 1921 gaaaaaacca ccagcagtga atcccgattg gacagaggga tattctttga actccctctt 1981 tggagaagac caataaaaac agtttacata gaaggggtcc ccatcagagc attattagat 2041 acgggggcag atgataccat tataaaagaa gcagatttac aattatcagg aacatggaaa 2101 ccaaaaataa tagggggcat tggaggggga ctcaatgtaa aagagtatag tgatagggaa 2161 gtaagattgg aagacaaaat tttgagaggg accatattga taggaagcac tcccataaac 2221 ataattggaa gaaatatatt agcaccagca ggagccaaat tagtaatggg tcaactgtca 2281 gaacaaattc ccattacccc tgtgaaatta aaagaagggg ctagaggacc tttcttaaaa 2341 caatggcccc tctccaaaga aaaaataaaa gccttacagg aaatatgtga ccaattagag 2401 aaagaaggaa aaattagcaa gataggagga gagaatgcat acaacactcc agtgttttgc 2461 ataaagaaaa aagacaagtc acaatggaga atgttagtag attttaggga actaaacaaa 2521 gcaacacaag attttttcga agtacagtta ggcatacctc atccatcagg gttcgaaaag 2581 atgacggaaa taacagtatt agacataggg gatgcctatt attcaatacc attagaccca 2641 gagtttagaa agtataccgc ttttaccatt ccatcagtaa ataatcaagg gccaggtact 2701 agatatcagt tcaactgtct tccacaagga tggaagggat ccccaactat ttttcagaac 2761 acagcagctt ccattctaga agaaataaaa aaggagttaa aacccctaac cattgtgcaa 2821 tacatggatg acctatgggt agggtctcag gaagatgaat acacgcatga tcggttggta 2881 gaacaactaa gaatgaaatt aagtgcctgg ggattagaaa caccagacaa gaaagtacag 2941 aaaaaaccac cttatgagtg gatgggatac aaattgtggc cacacaagtg gcagataagc 3001 agcatagaat tagaagacaa agaagaatgg actgtaaatg atatacaaag actagtgggg 3061 aaactaaatt gggcagcaca gctttaccca ggactcagaa ctaaaaactt gtgtaaatta 3121 atcagaggaa aaaagaactt actagaaaca gtaacctgga cagaggaagc agaagcagaa 3181 tatgcagaaa acaaagagat cttaaaaacg gaacaggaag ggacctacta caaaccagga 3241 agacccatca gagcagcagt gcaaaaacta gaaggaggtc aatggagtta ccaattcaag 3301 caagagggac aagtattaaa agtaggtaaa tacacaaagc agaaaaacac tcataccaat 3361 gagttccgtg tattggcagg attagtacaa aaactttgta aagaatcttt agttatatgg 3421 ggagagttgc cagtccttga actcccaata gagagggaag tatgggaaca atggtgggct 3481 gattactggc aggtaagttg gattccagac tgggaatttg tcagtacccc acccctagta 3541 aaattatggt ataccctgac aaaagaaccc ataccaaagg aagatgtcta ctatgtggat 3601 ggagcttgta atagaaattc aagggaagga aaagcaggat atatcacaca atatgggaaa 3661 caaagggtgg aaaaattaga aaatacaaca aaccagcaag cagaattaat ggccataaaa 3721 atggcactag aagatagtgg gcctaatgta aacatagtaa cagattcaca atatgcaatg 3781 ggaatattaa ctgcccaacc cacacagagt gactcaccct taatagaaca aattatagca 3841 ctaatggtac aaaaacatca gatatacttg caatgggtac cagcagacaa agggatagga 3901 ggcaatgaag agatagataa actagtaagt caagggatga ggaaaatttt atttttagaa 3961 aaaatagaag aagcccagga ggaacatgaa aggtaccata ataattggag gaacttagca 4021 gacacttatg ggctaccaca aattgtggca aaagaaatag tagccatgtg tccaaaatgt 4081 cagataaaag gggaaccagt ccatgggcaa gtagatgcct cgccaggggt atggcaaatg 4141 gactgtacac atttagaagg caaggtaatc atagtagcag tccatgtagc cagtggattc 4201 atagaagcag aagttatacc tagagaaaca gggaaagaaa cagcaaaatt tttattaaag 4261 atactaagta gatggcccat aacccaactg catacagaca atggacccaa ttttacgtct 4321 caagaagtag cagcaatgtg ttggtgggga aaaatagaac acaccacagg tgtaccctat 4381 aaccctcaat cacaaggctc tatagagagt atgaataaac agttaaaaga aataattggg 4441 aaaataagag atgactgtca atacacagaa acagcagtac ttatggcatg ccacatccac 4501 aattttaaaa gaaagggagg aatagggggg ttaacaccgg cagagagatt aatcaatatg 4561 attactacac aattagaatt acaacaccta caaaccaaaa ttcaaaaaat tttaaatttt 4621 agagtctact acagagaagg gagagatcct gtctggaaag gaccaggaca gttaatttgg 4681 aaaggggaag gtgcagtggt catcaaagga ggtgtggaat taaaagaata cccaagaagg 4741 aaagcaaaaa ttataaagga ttatgaacca agaaaaagaa tgggtgatga gagtaacttg 4801 gaaggtgccg gaggagctga taactaaatg gcaagggata gtgaggtact ggatgaggac 4861 tagaaaatta gactggaaat atcgaatgca ctaccaaatt acatgggcat ggtacacaat 4921 gagtagatat gagatacccc tagggcaaca tggaagtata catgtagatc tatattggca 4981 tctgacacca gaaaagggat ggctatcaac atatgctgag gggatacagt atctaagcaa 5041 tagggatcct tggtatagga cagaattgga tcctgcaaca gcagatagcc tgatacatac 5101 ccattatttt acttgtttta cagaaagggc catcaggaaa gccctattgg gacagaggtt 5161 caccttctgt cagttccccg agggacacaa gaaaacagga caggtaccct ctttgcaata 5221 cttagctctc cttgcacacc aaaatggcct caggcagaga tcccagagaa gcaagaccgg 5281 gggaactaga aatatgggat ttgagcaggg agccgtggga cgaatggcta agagacatgc 5341 tagaagatat caatcaggaa gccaagatgc attttgggcg cgagctcctg ttccaagtat 5401 ggaactattg tcaggaggag ggagaaagga atcgcactcc catgctagaa agggcttata 5461 aatattataa attggtgcaa aaagctctct ttgtgcattt ccggtgtgga tgccgcagaa 5521 gacaaccctt tgaaccatac gaagaaagga gggatggaca agggggagga cgagcagggc 5581 gcgtaccacc aggacttgat tgaacaactc aaagcacccc tgaagcggtg tacaaacaag 5641 tgctattgta aatgttgctg ttatcactgt cagctttgct ttttacaaaa gggattaggt 5701 gttacctatc atgcccctag gatcagaaga aagaagattg ctccgcttga tcgctttcct 5761 gaacaaaaac agtgagtatg aagctgacat tactgatagg gatactatta atagggatag 5821 gagtagtgct taatacaagg caacaatggg tcacagtatt ttatggagta ccagtatgga 5881 aaaacagctc agtacaggct ttctgcatga cacccaccac cagactatgg gcaactacta 5941 actcgatacc agatgatcat gactacacag aggtaccatt aaacatcact gaaccatttg 6001 aagcatgggc tgacagaaac cccttagtag cacaagcagg aagtaatata cacctgctat 6061 ttgagcagac tctgaagcca tgtgtaaaat tatcaccttt gtgcattaaa atgtcctgtg 6121 tagaattgaa ctcctctgag cctaccacca ctcctaaaag taccacggcc tcaacaacca 6181 atatcacagc ctcaacaacc actttgccgt gtgtccagaa caagacaagt actgtgttag 6241 aatcatgtaa tgaaacaatc atagaaaagg aattaaatga agagcctgct tctaattgta 6301 catttgcaat ggcagggtat gtaagagatc agaaaaagaa gtattcagtg gtgtggaatg 6361 atgcagaaat catgtgtaag aagggtaaca attctaacag agaatgttat atgattcatt 6421 gtaatgattc agttataaaa gaagcctgtg ataaaacata ttgggatgag ttaagattaa 6481 ggtactgtgc cccggcaggg tttgctttat taaaatgcaa cgattatgat tatgcagggt 6541 ttaagacaaa ctgttctaat gtttcagtgg tgcattgtac taacttgata aatacaacag 6601 tgactactgg actgttgttg aatgggagct actcagagaa tcgaacccag atatggcaga 6661 aacatagagt aagcaatgac tcagtgttag tgttatttaa taaacattac aatctaacag 6721 ttacttgcaa aagaccagga aacaaaacag tcttaccagt aacaatcatg gcagggctag 6781 tgtttcattc tcagaggtac aatacaaggc tgagacaagc ttggtgtcac ttccagggca 6841 actggagagg agcctggaaa gaagtaaaaa atgaaatagt aaaattacca aaagatagat 6901 accaaggaac caatgatact gaagagattt atctgcagag actatttgga gatccagaag 6961 cagcaaattt atggtttaat tgtcaggggg aattcttcta ttgtaaaatg gattggtttc 7021 taaattacct gaataatcgt acagtagatc cggaccataa tccgtgtaat ggtacgaagg 7081 gaaaaggtaa ggcaccagga ccctgtgcac aaagaacata tgttgcttgc catatacgat 7141 ctgtcattaa tgattggtac acactatcaa ggaaaaccta tgcaccgcca agagaagggc 7201 acttgcaatg cacatccacg gtaacgggta tgtcagtgga gctaaattac aatagtaaga 7261 acaggactaa tgtaacatta agtccccaga tagaaaccat ctgggcagca gaattgggca 7321 ggtacaaatt agtagaaatt acaccaattg gcttcgcacc cacagaagta agaaggtata 7381 cgggaggtca tgacagaaca aagcgagtcc cgttcgtgct agggttccta ggcttcttag 7441 gagctgctgg gactgcaatg ggagcagcgg cgacagccct gacggtccag tctcagcatt 7501 tacttgctgg gatactgcag cagcagaaga atctgctggc ggctgtggag gctcaacagc 7561 agatgttgaa gctgaccatt tggggtgtga aaaacctcaa tgcccgcgtc acagctcttg 7621 agaagtacct agaggaccag gcgcggttga atgcttgggg gtgcgcatgg aagcaagtct 7681 gtcatacaac cgtaccgtgg cagtggaata ataggacccc tgattggaat aatatgactt 7741 ggctggaatg ggaaagacag atatcgtatt tggaaggtaa cataacaaca caattagagg 7801 aagccagagc acaggaggag aagaatttgg atgcatacca aaaattaagt agttggtcag 7861 atttctggtc ttggttcgat ttctcaaagt ggctgaacat tctaaaaata ggatttttgg 7921 atgtactagg tattatagga ttaagattgc tttatacagt atattcttgc atagctaggg 7981 ttaggcaggg ttactctcct ctttctccac agatccatat ccacccgtgg aagggacagc 8041 cagacaacgc agaagggcca ggagaaggtg gagacaagcg caagaacagc tccgagcctt 8101 ggcagaaaga atctggcaca gcagagtgga agagcaactg gtgcaagcga ttgaccaatt 8161 ggtgctcgat cagcagcatc tggctataca acagttgcct gaccctccta gttcatctta 8221 ggagcgcttt ccagtacata caatatgggc ttggggaact caaagccgca gcacaagaag 8281 cagttgtcgc tttggcacgc cttgcacaaa acgcgggcta ccagatatgg cttgcttgca 8341 gatccgctta tagggcaatc atcaactctc caagaagagt gcgacaaggc cttgaaggaa 8401 tccttaatta ggaagagaaa tggtaaaatg actccagaag gaagaaaatt acaagaagga 8461 gataaatggg atgaatggtc tgatgaagaa gatgaagtag gatttccagt aagaccaaga 8521 gtgccgctaa gacaaatgac ctataaatta gcggtggact tttcgcactt tttaaaagaa 8581 aaggggggac tggatgggat ttattactcc gacaggagga atcagatcct aaacctgtac 8641 gccctcaatg agtggggaat cattgatgat tggaatgctt ggtcagaagg accaggaatc 8701 agatacccaa gatgcttcgg cttctgcttt aaattggtac cagtagacct gcatgaggaa 8761 gcagagactt gtgagagaca ttgcctggtg catccagcac aagtgaggga agaccctgat 8821 ggaatcaacc atggagaagt cttggtctgg aagtttgatc ccatgttagc agtccaatat 8881 gaccctaaca gaaaatatct cactgacatg catgatcttg gcaagaggaa gtagctaacc 8941 gcaggcttgt ggttaagcac atcaccatgg tgatgacatt aagaactgct gacgggactt 9001 tccagcaagg gactttccag ggcgggtcat gggcggtacg gggagtggct ttaccctcag 9061 agctgcataa aagcagatgc tcgctggctt gtaactcagt ctcttactag gagaccagct 9121 tgagcctggg tgttcgctgg ttagcctaac ctggttggcc accaggggta aggactcctt 9181 ggcttagaaa gctaataaat cttcgctgca ttaggcagag acttgtgaga gacattgcct 9241 ggtgcatcca gcacaagtga gggaagaccc tgatggaatc aaccatggag aagtcttggt 9301 ctggaagttt gatcccatgt tagcagtcca atatgaccct aacagaaaat atctcactga 9361 catgcatgat cttggcaaga ggaagtagct aaccgcaggc ttgtggttaa gcacatcacc 9421 atggtgatga cattaagaac tgctgacggg actttccagc aagggacttt ccagggcggg 9481 tcatgggcgg tacggggagt ggctttaccc tcagagctgc ataaaagcag atgctcgctg 9541 gcttgtaact cagtctctta ctaggagacc agcttgagcc tgggtgttcg ctggttagcc 9601 taacctggtt ggccaccagg ggtaa // LOCUS SIVAGM691 683 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) long terminal repeat. ACCESSION M33719 KEYWORDS . SEGMENT 1 of 2 SOURCE Simian immunodeficiency virus (isolate ver-1 (692)) from African green monkey proviral DNA. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 683) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted prior to publication in computer-readable form by Phillip Johnson. The ver-1 isolate is from a monkey imported from Ethiopia. Author address:Phillip Johnson Georgetown University Rockville, MD (301-496-2976) FEATURES from to/span description LTR 1 683 long terminal repeat BASE COUNT 174 a 149 c 187 g 173 t ORIGIN 1 tggatgggat ttattactcc gaaagaaggg aaaagatttt gaacctgtat gcattaaatg 61 aatggggaat catagatgat tggcaagctt atactccagg tccaggcatc agatatccaa 121 gatgctttgg gttctgtttt gaattagtgc cagtggacct tagtgaggaa gcgcaaggat 181 gtgaaaggca ctgtctggtc catcctgctc aattacagga ggatccagat ggtatctggc 241 atggagaaac attggtctgg agattcaatc ccatgctagc atgcaaggcc atgccaggag 301 tgttcaatga catgcatgca acagtgggga agtagcttgc ggttagcgcg tccgggacct 361 gtgtaccaac cagcatagca accatgctaa tgagctaggg actttccaga aggggagtgg 421 tttaaccctc agatattgta tataagcaga tgctcttggg cttgtaactc agtgctctta 481 ctaggagcca gctagagcct gggtgttcgc tggtagccta acctggactg gccctccagg 541 ggtaagagcc tccacggctt gaatgcttaa taaaccttgc ctgcattaga agtacttcga 601 gtcgtgtggt cccattgccg cctccgttca cgggaatcct caatactggg ttctctcttg 661 cccaggggag agaaactcca gca // LOCUS SIVAGM692 1542 bp ss-DNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) gag gene, complete cds. ACCESSION M29974 KEYWORDS . SEGMENT 2 of 2 SOURCE Simian immunodeficiency virus (isolate ver-1 (692)) from African green monkey proviral DNA, clone ver-1(692). ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 1542) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted prior to publication in computer-readable form by Phillip Johnson. The ver-1 isolate is from a monkey imported from Ethiopia. Author address:Phillip Johnson Georgetown University Rockville, MD (301-496-2976) FEATURES from to/span description pept 1 1542 gag polyprotein BASE COUNT 532 a 299 c 405 g 306 t ORIGIN 1 atgggttcgg gttcctcagc actgtcaggg agaaaattag accaatttga acatatacgt 61 cttcgcccga acggaaagaa aaagtaccaa ttgaaacatt taatatgggc aggcaaggaa 121 atggagcgct ttggcctcca tgaaaagttg ttagaaacag aagaggggtg taaaaagatc 181 atagaagtat tgcttccctt agaaccaacc gggtcggaag gtttaaaaag cctgttcaat 241 ttgacctgcg tcatttgctg cattcatcag gaagcgaaag tgaaagacac agaggaagca 301 gtaataagaa taaagcaaca gtgccatcta gtggacaaag gtgagaatgc agccaaagga 361 atagataaga caacaccgac acctagtggt aggagtcaaa attacccggc acaacagcag 421 aataatgtat gggtacatgt gccacttagc cccagaacat taaatgcttg ggtaaaagta 481 attgaagaaa agaaatttgg agcagagata gttcccatgt ttcaggccct gtcagaagga 541 tgtaccccat atgatgtgaa ccaaatgttg aatgttctag gagaccatca gggggccctg 601 cagatagtga aagaggtcat caatgaagaa gctgcccagt gggacattac acatccccca 661 ccagcagggc cgctcccagc agggcaattg agagatccaa gggggtcaga catagcaggg 721 actactagta ccattcaaga acaactagaa tggatttaca cagccaaccc aagaatagac 781 gtgggagcta tctataggag atgggtaata gcagggctgc aaaaatgtgt cagaatgtat 841 aatccaacag gggttctgga tataagacaa ggaccaagag aatcttttag cgattatgta 901 gatagattct acaaggccct gagagcagaa caagcctctc aggatgttaa gaattggatg 961 acagacactc tgttgattca aaatgctaac ccagagtgta aggtcattct gaaagggcta 1021 ggcatgcacc ctaccttgga agaaatgctt acggcatgcc agggagtagg gggaccccaa 1081 tacaaagcca aattgatggt agaaatgatg aatcaaatgc agggggtcaa catggtacag 1141 caagcaggaa taggaggtag agggagagga agaccagtta aatgctacaa atgtggaaaa 1201 tttgggcatg tgcagaaaaa ttgcactcaa aaagggccag tagtatgcct gaaatgtgga 1261 aaacctggcc attttgctcg agattgcaga ggagcagtaa attttttagg gtatggcagg 1321 tggatgggag caaaaccaaa aaatttttta gaacacagag cagcagtccc ctccgcccct 1381 ccaccgccgc acaacccagg ggcgtacgac gaagccactc ggcttctgga gaaatatacc 1441 caagagggag cccaacaaag gagaaaagta gagaagagct cccaagcggg gagggaggaa 1501 gaggattatt ccttgaaatc cctctttgga gaagaccaat aa // LOCUS SIVAGM90 723 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) long terminal repeat. ACCESSION M33718 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate 90) from African green monkey proviral DNA, PCR clone 03F. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 723) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted in computer-readable form prior to publication by Phillip Johnson, Georgetown University, Rockville, MD (301-496- 2976). Author address:Phillip Johnson Georgetown University Rockville, MD (301-496-2976) FEATURES from to/span description LTR 1 723 long terminal repeat BASE COUNT 187 a 160 c 202 g 174 t ORIGIN 1 tggatgggat ttattactcc gaaaggagga atagaatcct caacctatat gctcttaatg 61 aatggggaat cattgatgat tggaatgcat ggtcagcagg accaggcata agatatcccc 121 gctgctttgg cttttgcttc aagttagtac cggtagagat gcatgaagag gcagaaacct 181 gtgagagaca ttgcttggtg catcctgcac aagtaaaaga ggaccccgat ggcatcagtc 241 atggagagac cttggtctgg aagtttgacc cctatgttag cagtgcagta tgacccaaac 301 agacagtatt tagaagacat gcatgcactg gtgaagagga agtagctaac cgcaggcttg 361 tggttaagcc gttgccgggg agatgacatt tgaaactgct gacaagggac tttccaaggg 421 actttccagg gcgggccatg ggcggtacgg ggagtggttt taccctcaga gctgcataaa 481 agcagatgct cgctggcttg taactcagtc tcttactagg agaccagctt gagcctgggt 541 gttcgctggt tagcctaacc tggttggcca ccaggggtaa ggactccttg gcttggaaag 601 ctaataaaca ttgcctgcat tagagcttat ccgagtcaag tgccctcatt gacgcctcac 661 tcaagcaggg gaaccgttcc ttactgggtt ctctctctga cccaggcgag agaaactcca 721 gca // LOCUS SIVMNDGB1 9215 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) gag, pol, vif, vpR, tat, rev, env and nef genes. ACCESSION M27470 X15781 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate GB1) from African mandrill. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9215) AUTHORS Tsujimoto,H., Hasegawa,A., Maki,N., Fukasawa,M., Miura,T., Speidel,S., Cooper,R.W., Moriyama,E.N., Gojobori,T. and Hayami,M. TITLE Sequence of a novel simian immunodeficiency virus from a wild-caught African mandrill JOURNAL Nature 341, 539-541 (1989) STANDARD full staff_entry COMMENT The mandrill virus is distinct from all other primate immuno- deficiency viruses, thus it can be regarded as a type 4 virus. There is neither a vpX nor a vpU coding sequence. The splice sites and coding regions for tat and rev are tentative. FEATURES from to/span description pept 450 1958 gag polyprotein pept < 1745 4774 pol (NH2-terminus uncertain; AA at 1745) pept 4728 5246 vif pept 5227 5541 vpR protein pept 5471 5730 tat protein, exon 2 (first expressed exon) 7950 8037 tat protein, exon 3 (AA at 7951) pept 5590 5677 rev protein, exon 2 (first expressed exon) 7950 8167 rev protein, exon 3 (AA at 7952) pept 5661 8126 env polyprotein pept 8170 8814 nef binding 275 291 primer (Lys-tRNA) binding site signal 9191 9196 poly-A signal BASE COUNT 3323 a 1478 c 2196 g 2218 t ORIGIN 1 ggagtctcta ctacagaggc taagggttgt atctctgagc agatcccctt agagcaagga 61 ccagagtcct gagtgactgg gtctgagcac ctcactcggg gctgatcacc tcgaggtagt 121 ggaactcctt gcttgcttgc tattgtcttc aataaagtaa cttagaatta gagcaagtga 181 gtaagtgtta tccattgtgc gcctctcttc taaacctgtt gtgttctcat ttagagaaca 241 gaaggacttc tagttaaccc tagaagcctt tcagtggcgc ccgaacagga cttgaagaga 301 ggcactgaca cttgaggcag agcactccgc ctggaagaag caggttgaag gagagtggac 361 tggtctgaag acgccaggag gtgagtcagt gggactgact ttacaagaat tagttgtacc 421 ctagtgtaag gggcagcata gtcagagcaa tgggtaatgg gaactctgcc ttgttaggga 481 ctgatttgga taaatttgag aaaataagat taaagagagg tggtaaaaaa tgttatagat 541 tgaaacacct ctgttggtgt aaaggtgaat tagatagatt tggcttatcg gataaactcc 601 ttgaaacaca gcaaggatgt gaaaaaatcc tctcagtatg ttggccatta tatgaccaag 661 gatcagataa tctaaaagct ttggtaggga cagtctgtgt tgtagcctgc atacacgcag 721 gtatagaaat taagagcaca caagatgctt taaaaaaatt aaaagtcata acaagaaagg 781 aagaaaagca ggaggatgaa agtaagaatt tccctgtaca aagggatgca gcaggacagt 841 atcagtatac tccaataagt cctaggatta tacagacatg ggtaaaaaca gtggaagaaa 901 agaagtggaa accggaggtc atccctctat tctcagcatt gacagaagga gcaatcagtc 961 atgatttgaa tatcatgctg aatgcagtag gagatcatca gggagcaatg caagtcttaa 1021 aagatgtaat taatgagcaa gcagcagaat gggatctaac acatcctcaa caacaaccag 1081 cacaaccagg aggaggatta aggacccctt caggctctga tatagcagga actacttcta 1141 cagtggaaga acaattggca tggatgaata tgcaacaaaa tgcaatcaat gtaggaacaa 1201 tctataagag ttggattata ctgggcatga atagattggt aaaaagtcat tgtccaataa 1261 gtataacaga tgtaagacag ggaccaaagg aagcttttaa agactatgta gatagattct 1321 acaatgtaat gagagcagaa caagcttcag gagaagtaaa gatgtggatg cagcagcatc 1381 tgcttataga aaatgcaaac ccagaatgca agcagatttt gagaagctta gggaaaggag 1441 caactttaga ggaaatgttg gaagcatgtc agggagtagg tgggccacaa cataaagcca 1501 gattaatggc agaaatgatg agaacagtgg taggacaatc acaaaatttt gtgcagcaga 1561 gagggcctca aagaggacca gttagacaac ctactggaag gaaacctatc tgcttcaact 1621 gtaataaaga agggcatgta gcaaggttct tcaaggcccc tagaaggaaa gggtgctgga 1681 attgtggagc aatggatcat cagaaagctc aatgccctaa gccagctcag cagcagaggg 1741 ttaatttttt agggtatggc ccttggggtc cctccaaacc ggggaattat ccggcacaag 1801 aggtgactcc aacagctcca ccattagagg agaaacctct gcagaaaact ctgagcactt 1861 atcagaaatt agggagaggg ctcaggcaga agatgaagga ggagaagaga gaggaggatt 1921 ttcattccct gagtactctc tttcaagaag accaatagaa gaggtctcag tggatggtgt 1981 cactataaga gctctactag atacaggagc tgatgatacc atctttaatg aaagaaatat 2041 aaaattaaaa ggaaattggc agccaaaaat tataggggga ataggtggaa acttaagagt 2101 aaaacagtat gataatgtat atgtagaaat aagagggaag ggaacatttg ggacagtatt 2161 gataggacct actccaatag atataatagg gagaaacata atggaaaaat taggaggaaa 2221 attaatattg gcacaattgt ctgataaaat accaataaca aaagtgaaat taaaaccagg 2281 agtagatgga cccagaataa aacaatggcc tttaagtaaa gagaaaatag ttggtcttca 2341 gaaaatatgt gatagattag aggaggaagg aaaaattagt agggtagatc caggaaataa 2401 ttacaataca cctatctttg ccataaagaa gaaggataaa aatgaatgga gaaaattaat 2461 agactttaga gaattaaaca agttaacaca ggattttcat gaattacagt taggtatacc 2521 tcacccagca ggaataaaaa agtgtaaaag aataacagtc ctagatatag gggatgccta 2581 ttttagtata cctctggatc cagattatag accctatact gcctttacgg taccatcagt 2641 taataatcaa gcaccaggaa aaagatacat gtataatgtt cttcctcaag ggtggaaggg 2701 aagtccatgt atctttcaag ggacagtagc atcactgctg gaggtattta gaaagaacca 2761 tccaacagta cagttatatc aatacatgga tgatttgttt gtagggtcag actatacagc 2821 agaagagcat gagaaagcta tagtagaatt aagggcttta ttaatgacat ggaacttaga 2881 aacacctgaa aagaaatatc agaaagaacc tccctttcat tggatggggt atgagttaca 2941 cccagataag tggaagatag aaaaggttca actaccagaa ttagcagaac agccaacagt 3001 aaatgaaata cagaaattgg taggtaaatt aaattgggct gcacagttat atcctgggat 3061 caaaacaaaa caactgtgca agctaataag aggaggacta aacataacag agaaagtcac 3121 aatgacagaa gaagcaagac tggaatatga acaaaataaa gagatcttgg ctgaagaaca 3181 agaagggtct tattatgatc ctaataagga attatatgta agatttcaga aaacaacagg 3241 aggagatata tcatttcaat ggaagcaagg aaataaggtt ttaagagcag ggaaatatgg 3301 gaaacagaaa acagcacata gtaatgacct catgaaattg gcaggtgcta cgcagaaggt 3361 aggaagagaa agtatagtaa tctggggttt tgtaccaaaa atgcagatac ccactacaag 3421 ggagatatgg gaagattggt ggcatgagta ttggcagtgt acatggatac cagaagtaga 3481 atttatcagc acacctatgt tagaaaggga atggtatagc ttgtccccag aacctctaga 3541 gggggtagaa acatattatg ttgatggagc agctaacagg gacagtaaaa tgggaaaagc 3601 aggatatatt acagatagag gttttcaaag ggtagaagaa tatctaaata ccaccaatca 3661 gcagacagaa ttacatgcag taaaactagc tctagaagat agtggaagtt atgttaacat 3721 agtaacagat tcacaatatg tagtaggtat actagcaagc agacctactg aaacagatca 3781 ccccatagta aaggaaataa tagaattaat gaaaggaaaa gaaaaaattt atttaagttg 3841 gctaccagca cacaaaggga taggagggaa tgagcaaata gataagctag taagttcagg 3901 aatcagaaaa gtcttattcc tacaaaatat agaaccagca caggaagaac atgagaaata 3961 tcatagcaat gaagcacaat taagagagaa attccactta ccagctctag tagccaaaca 4021 gattgtgcaa agttgcagta agtgctgtca tcatggagag cccataaagg gacagacaga 4081 tgcttcactt ggagtctggc agatagattg cacacatctg gaaaatcaaa ttattatagt 4141 agcagtgcat gtagcttcag gcttcatgaa ggcagaagtt ataacagcag aaactggaaa 4201 aaagacagca gagtttctgt taaagttagc agcacaatgg cctattagta aactacacac 4261 agataatggg cctaacttta ctagtcagga agtagaaacc atgtgttggt ggttagggat 4321 agaacacaca tttggaatac cctataaccc acaaagtcag ggggtagtgg aaaataaaaa 4381 taagtatcta aaagaattga ttgagaaaat aagagaagat tgcaaagaat taaaaacagc 4441 agtagccatg gccacattca ttcataattt taaacaaagg ggaggactag gggggatgac 4501 agcaggagag agaatagtaa atatgatcaa tacagaatta gaatatcaat atcaacaaaa 4561 tcaaatttca aaaaatttaa attttaaggt ttacttcaga gaaggaagag atcagctgtg 4621 gaaaggacct ggtatccttt tgtggaaagg agaaggggca gtagttttaa aatatcaaga 4681 agagataaag atagtaccta gaagaaagtg taaaataata aaagattatg gagagagtgg 4741 aaagaatagt caggttaact tggaaagtgt ctagtcagag aatagaaaag tggcactggt 4801 tagtaagaag acagatggca tgggccactg caaataatga ggaaggatgt tggtggctgt 4861 atcctcattt tatggcttat aatgaatggt atacttgcag taaagtagtg attataataa 4921 atagggacat aagattaata gttagaagct attggcattt gcaaatagag gtaggatgct 4981 taagtactta tgcagtaagc atagaagcag tagttagacc gccacccttt gagaaagagt 5041 ggtgtacaga gataactcca gaggtagcag atcatctaat acatttacat ttttatgact 5101 gcttcatgga cagtgcagtt atgaaagcca tcaggggaga agaagtgtta aaagtttgta 5161 gatttccagc tggccataaa gcacaaggtg ttctctcttt gcagtttctc tgcttgagag 5221 tcatctatgg gccagaagag agatgagcaa gtatcagaag atcaaggacc tcccagagag 5281 ccatacaatc agtggctagc agatactatg gaggaaataa aggaagaagc aagaaagcac 5341 ttccctctca ttatcctaaa tgcagtatca gaatattgtg tgcaaaacac agggagtgag 5401 gaagaggcct gtgagaaatt tattacctta atgaatagag ccatttgggt ccacctagct 5461 caagggtgtg atggaacctt cagggaaaga agaccacaac tgcccccctc aggattcagg 5521 ccaagaggag atagattata agcaactgct agaagagtat tatcagcctt tgcaagcttg 5581 tgagaataaa tgctggtgca agaaatgctg ctttcattgt atgctttgct ttcaaaagaa 5641 gggtttagga ataaggtacc atgtctacag gaaacgtgta ccaggaacta ataagaagat 5701 acctggtagt ggtgaagaag ctatacgaag gtaagtatga agtgtccagg tctttttctt 5761 atactatgtt tagcctacta gtaggtatta taggaaaaca atatgtgaca gtcttctatg 5821 gagtaccagt atggaaggaa gctaaaacac atttgatttg tgctacagat aattcaagtc 5881 tctgggtaac cactaattgc ataccttcat tgccagatta tgatgaggta gaaattcctg 5941 atataaagga aaattttaca ggacttataa gggaaaatca gatagtttat caagcatggc 6001 atgctatggg aagtatgtta gataccatac ttaagccatg tgtaaagatt aacccatatt 6061 gtgttaagat gcaatgtcag gaaacagaaa atgtatcagc aacaacagct aagcctataa 6121 ctacacctac tactacatct acagttgcaa gtagtacaga gatttactta gatgtagata 6181 aaaataatac agaagaaaag gtagagagga atcatgtatg taggtataac ataacaggac 6241 tatgcaggga ttcgaaggaa gaaatagtaa caaattttag aggggatgat gtgaaatgtg 6301 aaaataatac ttgctatatg aatcattgta atgagtcagt taatacagaa gactgtcaga 6361 agggactttt gataagatgt attttaggtt gtgtgcctcc aggatatgtc atgttaagat 6421 ataatgagaa gttaaataat aataaattgt gtagcaatat atcagcagtg cagtgtactc 6481 agcacttagt agccacagta agtagctttt ttggctttaa tggaactatg cataaggaag 6541 gagaattgat acccatagat gataaatata ggggcccaga ggaatttcat caaaggaagt 6601 ttgtctataa ggtgccagga aaatatggct taaagataga atgtcacaga aaaggaaata 6661 ggtcagtagt gagtactcca tcagctacag gattattatt ttatcatggg ttagaacctg 6721 gaaagaattt aaagaaaggc atgtgcacct tcaaaggacg ttgggggtta gcactttgga 6781 gtctagctaa agaactaaat aaattaaatg actccatcaa agtgaaccag acctgtaaaa 6841 attttactag cactggagag gagaacaaac aaaacacgga caagcaaaag gagtttgcca 6901 aatgcataaa gactcttaag atagataatt atactacatc aggagataga gcagcagaaa 6961 tgatgatgat gacatgtcaa ggtgaaatgt tcttctgtaa tgtaacaaga atcatgaggg 7021 catggaatga tcctaatgag aagaagtggt atccttatgc ctcatgtcaa attaggcaaa 7081 tagtagatga ctggatgcaa gtaggaagaa agatatattt accacctaca tcaggattta 7141 ataatcacat aaggtgtaca catagggtaa cagaaatgta ctttgaaatg caaaagatag 7201 atagtaatga aacaaaaatg caaattaaat tcttgcctcc cagtgaaacc tccaatcaat 7261 ttgttgctta tggagctcat tataaattag tcaaaataat gccaattggc atagcaccta 7321 cagatgtgaa aagacacact ttacctgaac atcataaaga gaagagagga gcagtaatac 7381 ttggtatcct tggtctgctc tcgctggcag gatccgcgat gggctcagtg tcggtggcac 7441 tgactgtcca atctcagtct ttggtgactg ggatagtgga acaacaaaaa cagttgttga 7501 agctcataga gcaacagtct gaactcttaa aactcaccat atggggagta aagaatttac 7561 agactcgcct gaccagtttg gagaattata tcaaggacca agctttgctg tctcaatggg 7621 ggtgttcatg ggcacaggtg tgtcatactt ctgtagagtg gactaataca agcatcactc 7681 caaattggac atcagaaact tggaaggaat gggagacaag aactgattat ctgcaacaaa 7741 acattacaga aatgttaaaa caggcatatg atcgagagca aagaaacaca tatgaattac 7801 agaagttagg agaccttaca tcttgggcaa gttggtttga ctttacttgg tgggttcaat 7861 acttaaaatg gggagttttc ttagtgttag gaattatagg attaagaatt ttgttagcct 7921 tatggaatac aataagtagg tttaggcagg gctatcgacc tgtcttttca caggactgcc 7981 agcagaacct ataccgcaaa cggccagaca acggagaaga agaaagcaac agcttagaac 8041 taggagagca caactccgag aacttgaagg aagaatcctt aaacagatcc ttgatagagg 8101 acctgaccag ctttgccagg gagtgaccaa tttggctttg gctgaaaaat ctgagagcag 8161 caattgaata tgggttcctc gcagtccaag aagcgatcag aagcttgggt tcgctactcg 8221 tcagctttgc ggcaattagt tggagggccg gttacaccgg atggctacaa gcaaatagaa 8281 tcttcacagg gtgcagagaa gcaatcattg ctgcggggac gtgcatatgg cacatactca 8341 gaaggattag acaaagtgca gaacgacccc ttaactaaag atgagaaact tgacttaaca 8401 cagcaggatc cagaagagga ggaagaagtt ggatttcctg tgtgtcgcca agtttcctta 8461 agagtgccat catacaaaga tctgatagac ttctctcatt ttataaaaga aaagggggga 8521 ctgggaggga tatattatag caggagaaga gaagaaatcc tagatctcta tgcagagaat 8581 gagtggggat ttgaacctgg atggcaacag tatacgacag gtccaggaac cagatatcct 8641 aagacatttg gattcctgtt taagctggaa ccagtgagca gagctatagg agatgagtat 8701 gcagctaaca atcatctgtt acactcctcc cagttatgtc ctcaggaaga tccagaagga 8761 gagaccctca tgtggtctgg gaccctcatc ttgcctatga ctttgcagca ttaacatatc 8821 accctgagtg tttcaataag gctaagagta ttgaacatct gccattttgg aagaggaagt 8881 agcctaaccg caaaaccaca tcctactgca gaactgtagt tgcttggcaa cctgcttagc 8941 aacctggact ggcgcttgcg cgctaggaag ggactttcca aacagggagg gggaggctcg 9001 ccccatgctg ctatataagc agctgcattt cgcttgttcg ggagtctcta ctacagaggc 9061 taagggttgt atctctgagc agatcccctt agagcaagga ccagagtcct gagtgactgg 9121 gtctgagcac ctcactcggg gctgatcacc tcgaggtagt ggaactcctt gcttgcttgc 9181 tattgtcttc aataaagtaa cttagaatta gagca // LOCUS SIVMNE 9628 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) complete proviral genome. ACCESSION M32741 KEYWORDS complete genome. SOURCE Simian immunodeficiency virus from captive Macaque nemestrina proviral DNA, clone 8. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9628) AUTHORS Benveniste,R.E., Heidecker,G., Greenwood,J. and Gonda,M.A. TITLE ; JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Kindly submitted in computer-readable form by R. Benveniste. The gag protein sequence was reported in J. Virol. 62, 2587-2595, 1988. This molecular clone, after transfection into T-cell lines, produces infectious viral particles. In particular, clone 8 has been inoculated intravenously into two pig-tailed macaques causing CD4 lymphocyte depletion; see J. Virol. 62, 2091-2101, 1988. The env cds is truncated as is the case with MM251 and MM142. Author address:R.Benveniste National Cancer Institute Frederick, MD (301-698-5836) FEATURES from to/span description pept 533 2053 gag polyprotein pept < 1708 4878 pol polyprotein (NH2-terminus uncertain) pept 4808 5452 vif protein pept 5280 5618 vpX protein pept 5619 5924 vpR protein pept 5770 6065 tat protein, exon 2 (first expressed exon) 8280 8376 tat protein, exon 3 (AA at 8281) pept 5996 6065 rev protein, exon 2 (first expressed exon) 8280 8533 rev protein, exon 3 (AA at 8282) pept.ps 6072 8779 env protein (premature stop codon) pept 6072 8276 env protein 8280 8717 env protein pept 8551 9342 nef protein site 8277 8279 env protein in-frame stop codon BASE COUNT 3294 a 1807 c 2379 g 2148 t ORIGIN 5' end of 5' LTR R region (putative mRNA start). 1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctagactct caccagcact tggccggtgc tgggcagagt 121 ggctccacgc ttgcttgctt aaagacctct tcaataaagc tgccttttag aagtaagcca 181 gtgtgtgctc ccatctctcc tagtcgccgc ctggtcaact cggtactcga taataagaag 241 accctggtct gttaggaccc tttctgcttt gggaaaccga agcaggaaaa tccctagcag 301 attggcgccc gaacagggac ttgaaggaga gtgagagact cctgagtacg gctgagtgaa 361 ggcagtaagg gcggcaggaa ccaaccacgg cggagtgctc ctagaaaggc gcgggtcggt 421 accagacggc gtgaggagcg ggagagaaga ggcctccggt tgcaggtaag tgcaacacaa 481 aaaagagata gctgtctttt atccaggaag ggataataag atagagtggg agatgggcgc 541 gagaaactcc gtcttgtcag ggaagaaagc agatgaatta gaaaaaatta ggctacgacc 601 cggcgggaag aaaaagtaca tgttgaagca tgtagtatgg gcagcaaatg aattagatag 661 atttggatta gcagaaagcc tgttggagaa caaagaagga tgtcaaaaaa tactttcggt 721 cttagctcca ttagtgccaa caggctcaga aaatttaaag agcctttata atactgtctg 781 cgtcatctgg tgcattcacg cagaagagaa agtgaaacac actgaggaag caaaacagat 841 agtgcagaga cacctagtgg tggaaacagg aacagcagaa actatgccaa aaacaagtag 901 accaacagca ccatctagtg gcagaggagg aaattaccca gtacaacaag taggtggtaa 961 ctatacccac ctaccattaa gcccgagaac attaaatgcc tgggtaaaat tgatagagga 1021 gaagaaattt ggagcagaag tagtgccagg atttcaggca ctgtcagaag gctgcacccc 1081 ctatgacatt aatcagatgt taaattgtgt gggagaacat caagcagcta tgcagattat 1141 cagagaaatt ataaacgagg aggctgcaga ttgggacttg cagcacccac aacaagctcc 1201 acaacaagga cagcttaggg agccgtcagg atcagacatt gcaggaacaa ctagtacagt 1261 agatgaacaa atccagtgga tgtacagaca acagaacccc ataccagtag gcaacattta 1321 caggagatgg atccaactgg ggttgcaaaa atgtgtcaga atgtataacc caacaagcat 1381 tctagatgta aaacaagggc caaaagagcc atttcagagc tatgtagaca ggttctacaa 1441 aagcttaaga gcagaacaaa cagatccagc agtaaagaat tggatgactc aaacactgct 1501 gattcaaaat gctaacccag attgcaagct agtgctgaag gggctgggta tgaatcccac 1561 cctagaagaa atgctgacgg cttgtcaagg agtaggagga ccaggacaaa aggcaagatt 1621 aatggcagaa gccctgaaag aggcccttgc accagggcca ctcccttttg cagcagccca 1681 acagaaggga ccaagaaagc caattaagtg ttggaattgt gggaaagagg gacactctgc 1741 aaggcaatgc agaaccccaa gaagacaggg ctgctggaaa tgtggacaaa tgggccatgt 1801 tatggccaaa tgcccagaca gacaggcagg ttttttaggc tttggcccat ggggaaagaa 1861 gccccgcaat ttccccatgg cccaaatgca tcaggggctg acgccaactg ctcccccaga 1921 ggacccagct gtggatctgc taaaaaacta catgcagttg ggcaaacagc agagagaaag 1981 caaaaggaag ccttacaagg aggtgacaga ggatttgctg cacctcaatt ctctctttgg 2041 agaagaccag tagtcactgc tcatattgag ggacagcctg cagaagtatt attagataca 2101 ggggctgatg attctattgt agcaggaata gagttaggtc cacattatac cccaaaaata 2161 gtaggaggaa taggaggttt tattaatact aaagaataca aaaatgtaaa aatagaagtt 2221 ttaggcaaaa ggattaaagg gacaatcatg acaggggaca ccccgattaa catttttggt 2281 agaaatttgc taacagctct ggagatgtct ctaaatttcc ccatagctaa ggtagagcct 2341 gtaaaagtca ccttaaagcc aggaaaagat ggaccaaaat tgaggcagtg gccattatca 2401 aaagaaaaga tagttgcatt aagagaaatc tgtgaaaaga tggaaaagga tggtcagttg 2461 gaggaagctc ccccgaccaa tccatacaac acccccacat ttgccataaa gaaaaaggac 2521 aagaacaaat ggagaatact gatagatttt agggaactaa ataaggtcac tcaggacttt 2581 acagaagtcc aattgggaat accacaccct gcaggactag caaaaaggaa gaggatcaca 2641 gtactggatg taggtgacgc atatttctcc atacctctag atgaagaatt taggcagtac 2701 actgctttta ctttaccatc agtaaataat gcagaaccag gaaaacgata catttataag 2761 gttctgcctc aggggtggaa ggggtcacca gccatcttcc aacacactat gagaaatgtg 2821 ctggaaccct tcaggaaggc aaatccagat gtgaccttag tccagtatat ggatgacatc 2881 ttagtagcta gtgacaggac agacctggaa catgacaggg tagttttaca gttaaaggaa 2941 ctcttaaata gcatagggtt ttctacccca gaagagaagt tccaaaaaga tcccccattt 3001 caatggatgg ggtatgaatt gtggccaaca aaatggaagt tgcaaaagat agagttgcca 3061 caaaaagaga cctggacagt gaatgatata cagaagttag taggagtatt aaattgggca 3121 gctcaaattt atccaggtat aaaaaccaaa catctctgta ggttaattag aggaaaaatg 3181 actctaacag aggaagttca gtggactgag atggcagagg cagaatatga ggaaaataaa 3241 ataattctca gtcaggaaca agaaggatgt tattaccaag aaggcaagcc attagaggcc 3301 acggtaataa agaatcagga caatcagtgg tcttataaga ttcaccaaga agacaaaata 3361 ctaaaagtag gaaaatttgc aaagataaaa aatacacata ccaatggagt tagactatta 3421 gcacatgtaa tacagaaaat aggaaaggaa gcaatagtga tctggggaca ggtcccaaaa 3481 ttccacttac cagttgagaa agatgtatgg gaacagtggt ggacagacta ttggcaggta 3541 acctggatac cgaaatggga ttttatctca acaccaccac tagtaagatt agtcttcaat 3601 ctggtaaagg accctataaa gggagaagaa acctattatg tagatggatc atgtaataaa 3661 cagtcaaaag aagggaaagc aggatatatc acagataggg gcaaagacaa agtaaaagtc 3721 ttagaacaga ctactaatca acaagcagaa ttggaagcat ttctcatggc attggcagac 3781 tcagggccaa aggcaaatat tatagtagat tcacaatatg ttatgggaat aataacagga 3841 tgccctacag aatcagagag caggctagtt aaccaaataa tagaagaaat gattaaaaag 3901 acagaaattt atgtagcatg ggtgccagca cacaaaggta taggaggaaa ccaagaaata 3961 gaccacctag ttagtcaagg gattagacaa gttctcttct tggaaaagat agagccagca 4021 caagaagaac atgataaata ccatagtaat gtaaaagaat tggtattcaa atttggatta 4081 cccagactag tggccaaaca gatagtagac acatgtgata aatgtcatca gaaaggagaa 4141 gctatacatg ggcaggtaaa ttcagatcta gggacttggc aaatggattg tacccatcta 4201 gagggaaaaa taatcatagt tgcagtacat gtagctagtg gattcataga agcagaagta 4261 attccacaag agacaggaag acagacagca ctatttctgt taaaattggc aagcagatgg 4321 cctattacgc atctacacac agataatggt gccaactttg cttcgcaaga agtaaagatg 4381 gttgcatggt gggcagggat agagcacacc tttggggtac catacaatcc acagagtcag 4441 ggagtagtgg aagcaatgaa tcaccatcta aaaaatcaaa tagatagaat cagggaacaa 4501 gcaaattcaa tggaaaccat agtattaatg gcagttcatt gcatgaattt taaaagaagg 4561 ggaggaatag gggatatgac tccagcagaa agattactta acatgatcac tacagaacaa 4621 gaaatacaat tccaacaatc aaaaaactca aaatttaaaa attttcgggt ctattacaga 4681 gaaggcagag atcagctgtg gaaaggacct ggtgagctat tgtggaaagg ggaaggagca 4741 gtcgtcttaa aggtagggac agacattaag gtagtaccca gaagaaaggc taagattatc 4801 aaagattatg gaggaggaaa agaggtggat agcagttccc acatggagga taccggagag 4861 gctagagagg tggcatagcc tcataaaata tctgaaatat aaaactaaag atctacaaaa 4921 ggtttgctat gtgccccatc ataaggtcgg atgggcatgg tggacctgca gcagagtaat 4981 cttcccacta caagaaaaaa gccaattaga agtacaaggg tattggaatt tgacaccaga 5041 aagagggtgg ctcagtactc atgcagtgag aataacctgg tactcaagga acttttggac 5101 agatgtaaca ccagactgtg cagacatttt actgcatagc acttatttcc cttgctttac 5161 agcgggagaa gtgagaaggg ccatcagggg agaacaactg ctgtcttgct gcaggttccc 5221 gagagctcat aagacccagg taccaagtct acagtactta gcactgagag tagtaagtta 5281 tgtcagatcc cagagagaga atcccacctg gaaacagtgg agaagagaca ataggagaag 5341 ccttcgaatg gctaaacaga acagtagagg agataaacag agaggcagta aaccacctac 5401 caagggagtt gattttccag gtttggcaaa ggtcttggga atactggcat gatgaacaag 5461 ggatgtcgca aagctatgta aagtacagat acttgtgttt aatacaaaag gctttattta 5521 tgcattgcaa gaaaggctgt agatgtctag gggaaggaca tggggcaggg ggatggagac 5581 caggacctcc tcctcctccc cctccaggac tagcataaat ggaagaaaga cctccagaag 5641 atgaaggccc acaaagggaa ccatgggatg aatgggtagt ggaggttctg gaggaactga 5701 aagaagaagc tttaaaacat tttgatcctc gcttgctaac tgcgcttggt aatcatatct 5761 ataatagaca tggagacacc cttgagggag caggagaact cattaaaatc ctccaacggg 5821 cgctcttcat gcacttcaga ggcggctgca cccactctag aatcggccaa tctggaggag 5881 gaaatcctct ctcaactata ccgccctcta gaagaatgct ataacacatg ctattgcaaa 5941 aagtgttgct accattgcca gttttgtttt cttaaaaagg gcttggggat atgttatgag 6001 cagtcacgca gaagaagaag aactccgaag aaggctaagg ctaatacatc ttctgcatca 6061 aacaagtaag tatgggatgt cttgggaatc agctgcttat cgccatcttg tttctaagtg 6121 cctatgggat ctattgcatt caatatgtca cagtctttta tggtgtacca gcttggagga 6181 atgcgacaat tcccctcttc tgtgtaacca ggaataggga tacttgggga acaactcagt 6241 gcctaccaga taatgatgat tattcagaat tggcccttaa tattacagaa agctttgatg 6301 cttgggagaa tacagtcaca gaacaggcaa tagaggatgt atggcatctc tttgagacct 6361 caataaagcc ttgtgtaaaa ttaaccccat tatgcattac tatgaaatgc aacaaaagtg 6421 agacagataa atggggattg acaaaatcat caacaacaac agcaccaaca gcaataccaa 6481 caaaagcaga ggcaataaaa gtggtcaatg agaatagtcc ttgtataaat catgataatt 6541 gcacaggctt ggaacaagag ccaatgataa gctgtaaatt caacatgaca gggttaaaaa 6601 gagacaagag aagagagtac aatgaaactt ggtactctgc agatttggtt tgtgaacaag 6661 gtaatagcac tgaaaatgaa agtagatgtt acatgaatca ctgtaacact tctgttattc 6721 aagaatcttg tgacaaacat tattgggatg ctattagatt taggtattgt gcacctccag 6781 gttatgcttt gcttagatgt aatgacacaa attattcagg ctttatgcct aactgttcta 6841 aggtggtggt ctcttcatgc acaagaatga tggagacaca gacttctact tggtttggct 6901 ttaatggaac tagagcagaa aatagaactt atatttactg gcatagcaaa gataatagga 6961 ctataattag tttgaataag tattataatc taacaatgaa atgtagaaga ccaggaaata 7021 agacagtttt accagtcacc atcatgtctg gattggtttt ccactcacaa ccaatcaatg 7081 ataggccaaa acaggcatgg tgtaggtttg aaggaaattg gaaggaggca ataaaagagg 7141 taaagcagac cattgtcaaa catcccaggt atactggaac taacaatact gataaaatca 7201 atttgacggc tcctggagga ggagatccgg aagttacctt catgtggaca aattgcagag 7261 gagagtttct ctactgtaaa atgaattggt ttctaaattg ggtagaagat aagaatctga 7321 ctggaactac ccagaagcca caggaacggc ataaaaggaa ttacgtgcca tgtcatatta 7381 gacaaataat caacacttgg cataaagtag gcagaaatgt ttatttgcct ccaagagagg 7441 gagacctcac gtgtaattcc acagtgacca gtctcatagc aaacatagat tggattgatg 7501 gaaaccaaac taatatcacc atgagtgcag aggtggcaga actgtatcga ttggaattgg 7561 gagattataa attagtagag atcactccaa ttggcttggc ccccacaaat gtgaagaggt 7621 acactactgg tggcacctca agaaataaaa gaggggtctt tgtgctaggg ttcttaggtt 7681 ttctcgcaac ggcaggttct gcaatgggcg cggcgtcgtt gacgctgacc gctcagtccc 7741 ggactttatt ggctgggata gtgcagcaac agcaacagct gttggacgtg gtcaagagac 7801 aacaagaatt gttgcgactg accgtctggg gaacaaagaa cctccagact agagtcactg 7861 ccatcgagaa gtacttaaag gaccaggcgc agctaaatgc ttggggatgt gcatttagac 7921 aagtctgcca tactactgta ccatggccaa atgcaaatct aacaccaaat tggaacaatg 7981 agacttggca agagtgggag cgaaaggttg acttcttgga ggaaaatata acggcccttt 8041 tagaagaggc acaaattcaa caagaaaaga acatgtatga attacaaaag ttgaatagct 8101 gggatgtgtt tggcaattgg tttgaccttg cttcttggat aaggtatata caatacggag 8161 tttatatagt tgtaggagta atactgttaa gaatagtgat ctatatagta caaatgctag 8221 ctaagttaag gcaagggtat aggccagtgt tctcttcccc accttcttat ttccagtaga 8281 cccatatccg acaggaccag gcactgccaa ccaaagaagg aacagaagga gacggtggag 8341 gcagcggtgg caacagctcc tggccttggc agatagaata tattcatttc ctgatccgcc 8401 aactaatacg cctcttgact tggttattca gcaactgcag aaccttgcta tcgagagcat 8461 accagatcct ccaaccaata ttccagagat tctccacgac cctacagaga atccgagaag 8521 tcctcaggac tgaactaacc tacctacaat atgggtggag ctacttccaa gaggcggtcc 8581 aagtcgcctg gagatctgcg acagagactc ttgcgggcgc gtggggagac ttatgggaga 8641 ctctgggaag agttggaaga tggatactcg caatccctag gaggatcaga caagggctcg 8701 agcttactct cttgtgaggg acagaaatac aatcagggac agtttatgaa tactccatgg 8761 aaaaacccag ctggagagag ggaaaaatta gcatacagaa aacaaaatat agatgatata 8821 gatgaagaag ataatgactt ggtaggggta ccagtgaggc cacgagttcc cttaagaata 8881 ataagttaca aattggcagt agatatgtct cattttataa aagaaaaggg gggactggaa 8941 gggatttatt acagtgaaag aagacataaa atcttagaca tgtacttaga aaaggaagaa 9001 ggcatcatgc cagattggca gaattacacc tcgggaccag gacctagata cccaaagaca 9061 tttggctggc tatggaaatt agtccctgta aatgtatcag atgaggcaca ggagggtgag 9121 gagaattatt tactgcatcc agctcaaact tcccagtggg atgacccttg gggagaggtt 9181 ctagtatgga agtttgatcc aactctagcc tacacttatg aggcatatat tagataccca 9241 gaagagtttg gaagcaagtc aggcctgtca gaggaagagg ttagaagaag gctaaccgca 9301 agaggcctct taaaaatggc tgacaagagg gaaactagct gagacagcag ggactttcca 9361 taaggggatg tcatggggag gtactgggga ggagccggtc gggaacaccc actttcttga 9421 tgtataaata tcactgcatt tcgctctgta ttcagtcgct ctgcggagag gctggcagat 9481 tgagccctgg gaggttctct ccagcactag caggtagagc ctgggtgttc cctgctagac 9541 tctcaccagc acttggccgg tgctgggcag agtggctcca cgcttgcttg cttaaagacc 9601 tcttcaataa agctgccttt tagaagta // LOCUS SIVAGM677 2438 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus LTR and gag gene, complete cds. ACCESSION M29973 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate 677,(gri-1)) from African green monkey. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 2438) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted prior to publication by P. Johnson, Georgetown University, Rockville MD (301-496-2976). The remainder of this complete genomic sequence will become available later in 1990. The gri-1 isolate is from a monkey imported from Ethiopia. Author address:P.Johnson Georgetown University Rockville, MD (301-496-2976) FEATURES from to/span description pept 897 2438 gag polyprotein LTR 1 688 5' LTR rpt 461 588 R repeat 5' copy binding 689 706 primer (Lys-tRNA) binding site BASE COUNT 752 a 503 c 689 g 494 t ORIGIN 1 tggatgggat atattactct gaaagaagag aaaagatcct gaatttgtat gccttgaacg 61 agtggggaat aatagatgat tggcaagctt actcaccagg cccggggata aggtacccga 121 gagtctttgg cttctgcttt aagctagtcc cagtggacct gcatgaggag gcacgcaact 181 gtgagagaca ctgtctgatg catccagcac agatggggga agatcctgat ggaatagatc 241 atggagaagt cttggtctgg aagtttgacc cgaagttggc ggtggagtac cgcccggaca 301 tgtttaagga catgcacgaa catgcaaagc gctagtgtca gcactttgcg gttgggactt 361 tccgccaggg actttccaca gtgggtggat cggaggcggt acaggggcgg tactgggagt 421 ggctttcccc tcagagctgc ataaaagcag atgctcgctg gcttgtaact cagtctctta 481 ctaggagacc agctagagcc tgggtgttcg ctggttagcc taacccggtt ggccaccggg 541 ggtaaggact ccttggcttc atatagctca ataaacctgc tcgcttagtc gctatattgg 601 agtcaagtgc tcattgctgc gccgagcctc tagaggtgaa cctctcttac tgggttctcc 661 tgtacccagg tgggagaaac tccagcagtg gcgcccgaac agggacttga gaagaggcat 721 cggcaccgac cgctgagttg ctgagcgtcg gagagggacg actcaggtag ggtgagagcc 781 tacgagtttt ttgctaccta gtcagcgaga aaggctaggc cgcgacaggg gcgcgggtcc 841 cattagtggc aaccaaccca gttggacgaa gggttggtag gggacgggtc ggagcaatgg 901 gcgggggtca ctcagcactg tcagggagaa gcctcgacac gttcgagaag attaggctac 961 gtccgaacgg gaaaaagaag taccaaatta aacatttaat atgggcagga aaagaaatgg 1021 aacgatttgg gttacatgag aaacttttag aaacaaaaga aggctgtcaa aaaatcatag 1081 aagttttaac cccgttggaa ccgacaggct ccgaggggct aaaagctctg tttaatttgt 1141 gctgcgtcat ttggtgcatt cacgcagaac agaaagtgaa agacacagag gaagctgtag 1201 taacagttaa gcaacactac catctagtgg acaaaaatga gaaagcagct aaaaagaaaa 1261 atgagacaac agcgccacct ggtggcgaat caagaaatta cccagtagta aatcagaata 1321 atgcctgggt acaccagcct ttgtctccgc gcacgttaaa tgcgtgggtc aaatgcgtgg 1381 aggaaaaaag gtggggagca gaagtagtcc ccatgttcca agcactctca gagggatgtc 1441 tctcctatga tgtaaatcag atgctcaatg taataggaga ccatcagggg gcattacaaa 1501 ttcttaagga agtcattaat gaagaagcag cagagtggga caggacacac agaccaccag 1561 ctggcccgtt accagcaggg cagctaagag acccgacagg gtcagatata gcaggaacta 1621 ccagctcaat tcaggaacaa atagagtgga ccttcaatgc caatccaaga atagacgtag 1681 gggcacaata cagaaaatgg gttattttgg gcttacaaaa ggtagtgcag atgtacaatc 1741 cccaaaaggt cctagacatt cgacagggac ctaaagaacc cttccaggac tatgtagaca 1801 gattctataa agccctgaga gcagaacaag caccacagga tgttaaaaat tggatgacac 1861 aaactttgct tatccagaat gccaatccgg attgtaaatt gattctgaaa ggattgggaa 1921 tgaatccaac cttggaggaa atgctaatag cttgccaggg agtaggaggg ccacaacata 1981 aggctaagct aatggtagaa atgatgagta atggacagaa tatggtccaa gtgggacctc 2041 agaaaaaggg cccccgaggg ccgctaaaat gctttaattg tggcaaattt ggacatatgc 2101 aaagggaatg caaggcacca agacagatca aatgctttaa gtgcggcaaa attggccata 2161 tggcaaaaga ctgcaagaat ggacaggcaa attttttagg gtatggccat tggggaggag 2221 cgaaaccaag aaattttgtg caatacagag gagacacagt tggtctggaa ccaacagccc 2281 ccccaatgga aacagcttac gatccagcaa agaagctcct ccagcagtat gcagagaagg 2341 gacagcgcct gagagaggag agagaacaga caaggaaaca gaaggagaaa gaagtggagg 2401 atgtttcctt gagctccctc tttggaggag accaatga // LOCUS BOVMHDQBQ1 624 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exon 2. ACCESSION M30008 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 1 of 2 SOURCE Bovine (Holstein individual 2042) DNA, clone Q1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 624) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept / 172 + 438 MHC DQ-beta cell surface glycoprotein, exon 2 (AA at 174) pre-msg < 1 > 624 MHC DQ-beta mRNA and introns IVS < 1 171 MHC DQ-beta intron A IVS 439 > 624 MHC DQ-beta intron B BASE COUNT 103 a 192 c 243 g 86 t ORIGIN Chromosome 23. 1 cccgggttca cagcgggagg cgcagggccg ggctggagcg caacaggggt tgagaggcgg 61 cgggtttcag gtttagggac cctctggcgg cggcggcacc tccccatctg gccgagcggc 121 gccgcgtggg gctgtggggc tgagcctgac cgagcggctg tctccccgca gaggatttcg 181 tggtccagtt taagggcctg tgttacttca ccaacgggac ggagcgagtg cggctcgtgg 241 tcagacacat ctacaaccgg gaggagtacg cgcggtttga cagcgacgtg aacgagtacc 301 gggcggtgac ctctggggcg ccgcacgccg agtactggaa cagccagaag gacctcctgg 361 agcagaggcg ggccgaggtg gacagggtgt gcagacacaa ctaccaggtg gctgccccct 421 tcacctggca gcggctaggt gagtacgggc tgccctccgc gggcccgccc tccacccgag 481 actcagcgcg ggagggggcc gggtctccag ggcggggttc ccaggcccgc atagggacag 541 ggaggccggg gcttcgcgga ggggcaggga ccgacgctcc gcggaaatgg acactcgcag 601 ccctggacct ctccccgcag aggc // LOCUS BOVMHDQBQ2 1151 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exons 3 and 4. ACCESSION M30007 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 2 of 2 SOURCE Bovine (Holstein individual 2042) DNA, clone Q1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1151) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 113 394 MHC DQ-beta cell surface glycoprotein, exon 3 870 / 980 MHC DQ-beta cell surface glycoprotein, exon 4 pre-msg < 1 > 980 MHC DQ-beta mRNA and introns IVS < 1 112 MHC DQ-beta intron B IVS 395 869 MHC DQ-beta intron C BASE COUNT 243 a 310 c 324 g 274 t ORIGIN Chromosome 23, about 3.7 kb after segment 1. 1 tggaatccgg ggatcttcct actctggaac cgaggaagga ctcttctcca tgggagacgt 61 gctgtgcggt ctcatgtctc actgtgtctt ttcctgtctg ttcctccctc agtggaacct 121 acagtgacca tctccccgtc caggactgag gctctaaacc accacaacct gctggtctgc 181 tcggtgacag atttctatcc gggccagatc aaggttcggt ggttccggaa tgaccgggag 241 gagacagctg gtgttgtgtc cacccctctt attaggaacg gggactggac cttccagatc 301 ctcgtgatgc tggaaatgac cccccagcga ggagatgtct acacctgccg cgtggagcac 361 cccagcctcc agagtcccat ctcagtggag tggcgtaagg gcacttggtc tcctttcact 421 gtgggcccta caggataggg cagacagagc ttcccgggtt catcccatct cacctctagt 481 ccccagcatc cctactgaaa tcagaggaca caagagtgct catacctcat agcaggggca 541 ttggaagagc ctagttacat tgtctttcca gatacgggag ctcactcaca caccatggcc 601 ccagagcccc acccagggag ctctgcagga gtgacaggtc caaggttatg catgtgtcct 661 tgaggggcag ggattggctt tctctgctta ttcaccttcc cagtctgtcc aaggatcttt 721 tgctgggtcc ctcacctggg ggtggttaga atgaagaact gagttcccct ggtacttcca 781 cttcctgtac ctcagactgg acttcaggat tctcaaggga cactgtggga tgtggagaca 841 aatgctgaca ctcaggctct gctccccagg ggcgcagtct gaatctgccc agagcaagat 901 gctgagtggt gttgggggct tcgtgctggg gctgatcttc ctcgggctgg gcctcattat 961 ccgtcacagg agccagaagg gtaaggagct ctggggacat ggggaagact ttgactggga 1021 ccttcttctc agggaggctc tagatgtagc tcttttccct gaccctgaca taaaggaggt 1081 taaggtggtg gcaggaagaa acaagcaacc tagggagaga ctgaagtctt actttactga 1141 ttgaaaggta g // LOCUS BOVMHDQBY1 779 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exon 1. ACCESSION M30006 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 1 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone Y1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 779) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept 467 + 575 MHC DQ-beta cell surface glycoprotein, exon 1 pre-msg 467 > 779 MHC DQ-beta mRNA and introns IVS 576 > 779 MHC DQ-beta intron A signal 351 357 CAAT box signal 384 391 TATA box site 292 304 X box site 324 333 Y box BASE COUNT 204 a 179 c 164 g 232 t ORIGIN Chromosome 23. 1 ggatcctgaa gggctacagt ccatggggtc gtaaagagta gaacacaact cattaattaa 61 cactttcact tttattttcc catacctcaa attctaagaa caacaggttt taaataaata 121 tcacagaaat atctactctt gaatcatttt ttttcattat ttaaactcct aaggcattca 181 atattcagat attttataac tgagagaaca ttttcatctc tatccagtgt aatttgatta 241 ggacacagtg ccaggcatta gattaagaac cttcaaaaaa aaaatgtcta cccagaaaca 301 gatgaagttt ttccgctcca ctgctgattg gtcccttttc tagggactct ccaatcttgc 361 catacatgga agctctcata ggctttttat tctgtgaagt aggctcacca gatccactgt 421 gtttgagctg tgttgactac cattagttct tcctttgttc tcaattatgt ttgggatggt 481 ggctctgcgg atccccagag ccctctggac agcagttgtg atggtgaccc tggtgatgct 541 gagcacccca ggggctgagg gcagagactc accaagtaag tgcagggcag ctgctccctg 601 gagccaccac actggggagc aggctctgag ggacccttgg gctggggtgt gatcttggga 661 tactgtcttt tatcacacat ttcctcccat tgggaatgag ggctatgtta cattctcatt 721 tccaccctct aaggacaagg tgaggacaat tcccctccca caggtttaac cctgggaat // LOCUS BOVMHDQBY2 977 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exon 2. ACCESSION M30005 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 2 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone Y1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 977) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 559 + 825 MHC DQ-beta cell surface glycoprotein, exon 2 pre-msg < 1 > 977 MHC DQ-beta mRNA and introns IVS < 1 558 MHC DQ-beta intron A IVS 826 > 977 MHC DQ-beta intron B BASE COUNT 191 a 264 c 338 g 182 t 2 others ORIGIN Chromosome 23, about 0.9 kb after segment 1. 1 actggcgcaa ctgttggaag gcgatcggtg cgggcctctt cgctattagc cagctggacg 61 aaagggggat gtgctgcaag gcgattaagt tgggtaacgc cagggttttc ccagtcacga 121 cgttgtaaaa cgacgccagt gccaagctta attctacagg tcctttctca tcccttgaac 181 tctcctgttg tcgtttgtct ctgaggttcc caggagttca gggtaaaatg ggatttaatg 241 tgagaatctt ttaagtatag agatggatgc aaaatcaacc tgccgccctg tttacttgat 301 tctgagcctc tagggatcac aggtcctagg gctctctcag cgtcaggcct cctcacatcc 361 tgggagccct cagagggggc ggnaagcccg ggttcacagc gggaggcgca gggccgggct 421 ggagcggaac agggtttgag aggcggctgg tttcaggttt aaagaccccg tggcggcggc 481 ggcacctccc catctggccg agcggcgccg cgtggggctg tggggctgag cctgacagag 541 cggctgtctc ccccgcagag gatttcgtgg tccagtttat gggccagtgt tatttcacca 601 acgggacgga gcgggtgcgg tacgtgacca gatacatcta caaccaggag gagtacgcgc 661 gcttcgacag cgactggggc gagtaccggg cgctgacccg ctggcggccg gccgccgagt 721 actggaacag ccagaaggac atcctggagc agacgtgggc cgaggtggac agggtgtgca 781 gaaacaacta ccaggtggaa gcccccttca cctggcagcg gcaaggtgag tgccggnctc 841 tccgcggggc cgccctccac ccgccaggac ttcgcgcagg gagggactga gtcctccgag 901 gcggtcccca gaccctcgaa tgggacagag gggcgctgag ggacagggga ccgagggcac 961 agcgtatggg gcggggg // LOCUS BOVMHDQBY3 1199 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exons 3 and 4. ACCESSION M30004 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 3 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone Y1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1199) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 129 + 410 MHC DQ-beta cell surface glycoprotein, exon 3 pept + 907 + 1017 MHC DQ-beta cell surface glycoprotein, exon 4 pre-msg 467 > 1199 MHC DQ-beta mRNA and introns IVS < 1 128 MHC DQ-beta intron B IVS 411 906 MHC DQ-beta intron C IVS 1018 > 1199 MHC DQ-beta intron D BASE COUNT 263 a 338 c 321 g 277 t ORIGIN Chromosome 23, about 3.7 kb after segment 2. 1 atctaaatcc aagccttgga atccaacgat ctttccactc tggtatcaag gaatgactcc 61 tgcccatggg agacatgctg tgcggtctca tgtctcactg tgtcttttcc tgtctgttcc 121 tccctcagtg gaacctacag tgaccatctc cccgtccagg acagaggctc taaaccacca 181 caacctgctg gtctgctcgg tgacggattt ctatccgggc cagatcaagg ttcggtggtt 241 ccggaatgac cgggaggaga cagccggcgt tgtgtccacc cctcttatag ggaatgggga 301 ctggaccttc cagatcctcg tgatgctgga aatgaccccc cagcgaggag atgtctacac 361 ctgccgcgtg gagcacccca gcctccagag ccccatcatg gtggagtggc gtaagggcac 421 ttggtttcct ttcactgtgg gcctaccgga cagggcagac agagcttccc ctgtccatgc 481 cctctcatcc cttgtcccca gcatcactac tgaactggaa atcacaggac acaagagtgc 541 tcatgcctcc tagcacaggc atcagaagag ccaaatcaca ttgtcttttc acatacaggg 601 agctcactgt acacatcatg gccccagagc ccagcctggt agctctgtag aactgactgg 661 tgaccatagt cttaaggtct aaggttatgg aagtgtccct gagagcaggg atccactttc 721 accttctctc acctgcccac tgtgtccaaa gatctgttgg tgggtccctc ccctggggtg 781 gtcagaatgg agagccacgt tcccctgaca cctccacctc ctgtacctca gactagacct 841 caagcttcct aaaggaatac catgagatgt ggggacaaac gctgacactc gggctctgct 901 ccccaggggc acagtctgaa tctgcccaga gcaagatgct gagtggtgtt gggggcttcg 961 tgctggggct gatcttcctc gggctgggcc tcattatccg tcacaggagc cagaagggta 1021 aggaactctg gggaaatggg aagatgggct gtgattcaga ccctctgttc agatcagcct 1081 ctgcctctga atgtagctct ttcctcctga tcctgaaacg gggaggcggg gctggggatg 1141 ggaggaaatg aacaacctag ggagacattg gagtttgact ttactagttt gaaagggta // LOCUS BOVMHDQBY4 883 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exon 5. ACCESSION M30003 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 4 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone Y1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 883) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 521 534 MHC DQ-beta cell surface glycoprotein, exon 5 pre-msg < 1 841 MHC DQ-beta mRNA and introns IVS < 1 520 MHC DQ-beta intron D site 263 276 MHC DQ-beta g/t cluster implicated to contribute additional information to polyadenylation BASE COUNT 200 a 201 c 226 g 256 t ORIGIN Chromosome 23, about 0.3 kb after segment 3. 1 tttgtgtcat gagatctttt gtagacattg tgacccctag cagaaggtgc tctatttctg 61 ttctgtgtca gtgggattgt gggacaggta aaggagggaa gggtgtgaga tgagtgtgcc 121 tgggcgcagt gtctcattca tgacctgttc cctgctatgg aatcaagagt tagggaagaa 181 gtttctgtag gaggttctgt aggaagctcc tgaggttgtt ccccagaacc aggccataac 241 tttgatggca cctttctgtg aaacttggag ccagagctct ggtttgaaag atagacacca 301 ggatatcacc tactttgtgc cacatgttgg tgcctactgc ctgtgggcat ttataagtga 361 ttgaatgtgg tagaaagaag gtgaactatc actgcaattt actaaaaaat tgaaatcttc 421 atatccctca gaaggacaac agctgcttcc tggcttccca tgcctccttg ttaggttgaa 481 tgtgcgtgcc tgtgtgctga tcactctctc tcttctacag ggctcatgcg ctgactcctg 541 aggatatttt gggattggtg tttgctcttc tataatgtgt gcctgatctt gcccggaatt 601 cccagattcc tgtcagcctg tcccactctg agatcagagt caggtcacca ggtcatttcc 661 cgtggccatc ccccaaccac ggatctggct gtgatgctgc ttcctccact gaccctggaa 721 tctctgcctg tgcgttgtca gctgaatcta ctcagatccc aaaagcttct gacatagaca 781 tcagaagggg gacggagagt gtccccgcta gtctttagcc cagtgtttag aagctattaa 841 tcagataaga gagacacctc aaggttgatg gagtttcacc agg // LOCUS BOVMHDRB1 459 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta gene, exon 2. ACCESSION M30012 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 1 of 3 SOURCE Bovine (Holstein individual 2042) DNA, clone A1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 459) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept / 21 + 290 MHC DR-beta cell surface glycoprotein, exon 2 (AA at 23) pre-msg 21 290 MHC DR-beta mRNA and introns IVS < 1 20 MHC DR-beta intron A IVS 291 > 459 MHC DR-beta intron B BASE COUNT 108 a 92 c 169 g 90 t ORIGIN Chromosome 23. 1 gatctatcct ctctctgcag cacatttcct ggagtattct aagagcgagt gtcatttctt 61 caacgggacc gagcgggtgc ggttcctgga cagatactac actaatggag aagagaccgt 121 gcgcttcgac agcgactggg gcgagttccg ggcggtgacc gagctggggc cgcaggaccg 181 cgagtactgg aacagccaga aggacttcct ggaggagaag cgggccgagg tggacagggt 241 gtgcagacac aactacgggg gtatggagag tttcactgtg cagcggcgag gtgagcgcgg 301 gggtggactg gccagtgtgg agcagtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 361 gtgtgagaga gagagagaga gacagagaca gagacagaga cagagataga cagacagaaa 421 cagagatact tcactcactc tggtcgagtg tgtaccgac // LOCUS BOVMHDRB2 427 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta gene, exon 3. ACCESSION M30013 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 2 of 3 SOURCE Bovine (Holstein individual 2042) DNA, clone A1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 427) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 71 + 352 MHC DR-beta cell surface glycoprotein, exon 3 pre-msg < 1 > 427 MHC DR-beta mRNA and introns IVS < 1 70 MHC DR-beta intron B IVS 353 > 427 MHC DR-beta intron C BASE COUNT 95 a 124 c 107 g 101 t ORIGIN Chromosome 23, about 2.7 kb after segment 1. 1 ctgaaaggca gctaaccaag gagacttact ctgttgtcct cactgattcc ctccaccttt 61 tctctcctag tggagcctac agtgactgtg tatcctgcaa agactcagcc cctgcagcac 121 cacaacctcc tggtctgctc tgtgaacggt ttctacccag gccacattga agtcaggtgg 181 ttccggaacg cccatgaaga ggaggctggg gtgatctcca caggcctgat ccagaatgga 241 gactggacct tccagaccat ggtgatgctt gaaacagttc ctcagagtgg agaggtctac 301 acctgccaag tggatcaccc cagccggacg agccctatca cagtagaatg gagtgagctt 361 tctgatctca taaatccctc acccactgtg gagggggctt gctttcctct gagtgtcccc 421 tgagtgt // LOCUS BOVMHDRB3 276 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta gene, exon 4. ACCESSION M30014 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 3 of 3 SOURCE Bovine (Holstein individual 2042) DNA, clone A1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 276) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 116 / 226 MHC DR-beta cell surface glycoprotein, exon 4 pre-msg < 1 > 226 MHC DR-beta mRNA and introns IVS < 1 115 MHC DR-beta intron C BASE COUNT 61 a 63 c 68 g 84 t ORIGIN Chromosome 23, about 0.35 kb after segment 2. 1 attctgattc ttccgggtag ccttctttcc tcattcccat agttcacaat ttcagcatca 61 caattagaga agagaatttg ggataaaaat gactaaaact ggcttctttt ctcaggggca 121 cggtctgact ctgctcagag caagatgatg agtggagtcg ggggcttcgt tctgggtctg 181 ctcttccttg ccgtggggct cttcatctac ttcaggaatc agaaaggtaa ggagcttgtt 241 ctttggacag ctgagcctcc ccactgactt ttggag // LOCUS BOVMHDRBE1 483 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta pseudogene, exon 1. ACCESSION M30011 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein; pseudogene. SEGMENT 1 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone E4. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 483) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept.ps 167 + 236 pseudo-MHC DR-beta, exon 1 pre-msg < 167 > 483 pseudo-MHC DR-beta mRNA and introns IVS 237 > 483 pseudo-MHC DR-beta intron A signal 9 14 CAAT box BASE COUNT 114 a 100 c 124 g 145 t ORIGIN Chromosome 23. 1 gagctcaccc aatccaggaa caaagatatg agccatttgt tggtatcact tggaatgtgg 61 gtggaggagg gctcatgtct ttactgagtg agacttccct gctcccccac accttgtctt 121 ttcctgttct ccagcatggt gtgactgttt ccccagaggc tcctggatgg cagctctgac 181 agtgatactg atggtgatga accctcccct ggcttgggcc agggacaccc acataagtgc 241 gtacctttcc ggcgggggtg aggggggtga gctatcatgg gatgggggga aggaagggag 301 ctagctttgt cactgtattc aggccatgtc ccttaaaatt gtgacatatt cttcatacta 361 tatatagtgg ctaagctgag tctgaataat tggtaacatt ttctgatgtt catatgtaac 421 atcagtgtac cttatggtat atttcaatat ataggggaat ttattcattc acattatatt 481 gaa // LOCUS BOVMHDRBE2 929 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta pseudogene, exon 2. ACCESSION M30010 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein; pseudogene. SEGMENT 2 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone E4. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 929) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept.ps + 228 + 500 pseudo-MHC DR-beta, exon 2 pre-msg < 1 > 929 pseudo-MHC DR-beta mRNA and introns IVS < 1 227 pseudo-MHC DR-beta intron A IVS 501 > 929 pseudo-MHC DR-beta intron B BASE COUNT 203 a 209 c 302 g 215 t ORIGIN Chromosome 23, about 5.4 kb after segment 1. 1 gtcgaccact gaagccactt ggagacctga ggggtctcct ctgcccacct tcgcctccct 61 gcactgtagg cagatgaaag aagggcccgt ggtagttcag gggtgcctgt ggagccaatg 121 agggagccct agtggccttc ctgtgcttgg gcagccctca ttggtggccg tcacatcagt 181 tccttcctgg gagcccacca ggtgaccgaa tcctggtgtg cccacagcac atttgatggt 241 gcagggcaag tccgagtgtc atttctccat ccggactgag caggtacgat tcttggccag 301 atacttctat aaccagaagg agttggtgca ttttgtcagc aacgatgtgg gtgagttcag 361 ggcagtgacc gagcggggca ggctcttcgc tgagagttgg aatcatcaga aggacttagt 421 ggagtgaacg caggctgtgg tggacacgtt ctgcagatac aactactgga ttggggagag 481 cttcatcctg cagcagcaag gtgagcacag gggtgggcgg ccaggggact ggggacagtg 541 tgtgtgtgtg tgtgtgtgtg tgagagagag agagagagac aaagagatag agagactgag 601 tcccggtgaa tgtgttgtat tatgagcaag tatgcttaag gagagttcct gtgagagcat 661 gttgcctgga gaaatgacac ttggacttgc cctgcaccat gaaatttgct gtgggaacag 721 caggattcgg tcaccctggt gggctcccag gaaggaactg atgtgacggc caccaatgac 781 gggctgccca agcacaggag ggccactagt gctccctcat tggctttaca ggcacccctc 841 aactaccatg ggttcttctt tcatctgcct gtatgacttt gtcagttatt gtgaaggaag 901 agacagtgtg tgtggtgggg ggagtacct // LOCUS BOVMHDRBE3 548 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta pseudogene, exon 3. ACCESSION M30002 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein; pseudogene. SEGMENT 3 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone E4. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 548) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept.ps + 12 + 293 pseudo-MHC DR-beta, exon 3 pre-msg < 1 > 548 pseudo-MHC DR-beta mRNA and introns IVS < 1 11 pseudo-MHC DR-beta intron B IVS 294 > 548 pseudo-MHC DR-beta intron C site 425 427 in-frame stop codon BASE COUNT 122 a 139 c 134 g 153 t ORIGIN Chromosome 23, about 5.8 kb after segment 2. 1 tttcctccta gtggaggatc ctacagtgac tgtgtatcct gcaaagaccc agcctctgca 61 gcaccacaac ctcctggtct gctctgtgaa tggtttctat ccaggacacg ttgaagtcag 121 gtggttccag aacggccatg aagaggctgg agtgatctcc acaggcctga tccagaatgg 181 agactggacc ttccagaccg tggtgatgct tgaaacagtt cctcagagtg gagaggtcta 241 cgcctgccaa gtggagcacc ccagccggac gagccctctc acagtggaat ggagtgagaa 301 gctttctgat ctcgtaagtt cctcacccac caagaagggg gcttgctcac ctctgagtgt 361 caggtttctc ctctctccat accatatttt ttatttgctt catgctcttt ctttcttagc 421 acaaattgtt ggggagtagc tctgtgatag cctgtgttag aaatcctctg atagtttaca 481 gatatcgttt gatagtttct atcaatacct atacctgctg gtgagacagt tcttcctggc 541 aggcagag // LOCUS BOVMHDRBE4 206 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta pseudogene, exon 4. ACCESSION M30009 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein; pseudogene. SEGMENT 4 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone E4. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 206) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept.ps + 86 / 196 pseudo-MHC DR-beta, exon 4 pre-msg < 1 > 196 pseudo-MHC DR-beta mRNA and introns IVS < 1 85 pseudo-MHC DR-beta intron C BASE COUNT 47 a 42 c 54 g 63 t ORIGIN Chromosome 23, about 0.35 kb after segment 3. 1 cttccaggca accttcttct cccatcctca aaagcttagg gaagttggat tgggataaga 61 tcactgaaac ttacttcttt tctaggggca tgatctgact ctgctcagag caggatgatg 121 agtggagtca ggggctttgt tgtgggtctg ctcttccttg ggatcaggtt gttcatctac 181 tttaggaatc agaaaggtaa ggatcc // LOCUS VECPCE30 143 bp ds-DNA SYN 15-AUG-1990 DEFINITION Expression vector pCE30, partial sequence. ACCESSION M36426 KEYWORDS expression vector. SOURCE Synthetic DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 143) AUTHORS Elvin,C.M., Thompson,P.R., Argall,M.E., Hendry,P., Stamford,N.P.J., Lilley,P.E. and Dixon,N.E. TITLE Modified bacteriophage lambda promoter vectors for overproduction of proteins in Escherichia coli JOURNAL Gene 87, 123-126 (1990) STANDARD simple staff_entry BASE COUNT 37 a 35 c 40 g 31 t ORIGIN 1 agggcagcat tcaaagcaga aggctttggg gtgtgtgata cgaaacgaag cattgggatc 61 cccgggaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc 121 caacttaatc gccttgcagc aca // LOCUS CHKPPPTH 1723 bp ss-mRNA VRT 15-AUG-1990 DEFINITION Chicken parathyroid hormone mRNA, complete cds. ACCESSION M36522 KEYWORDS parathyroid hormone. SOURCE Chicken parathyroid gland, cDNA to mRNA, clones cPTH-[11,12,3]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1723) AUTHORS Khosla,S., Demay,M., Pines,M., Hurwitz,S., Potts,J.T.Jr. and Kronenberg,H.M. TITLE Nucleotide sequence of cloned cDNAs encoding chicken preproparathyroid hormone JOURNAL J. Bone Miner. Res. 3, 689-698 (1988) STANDARD simple staff_review FEATURES from to/span description pept 128 487 parathyroid hormone precursor sigp 128 202 parathyroid hormone signal peptide matp 221 484 parathyroid hormone BASE COUNT 626 a 311 c 331 g 455 t ORIGIN 1 ttttaaagtt agatttaagg gatccactaa accaattcag tagtctttaa atatacttga 61 catcaagaca cagccatctg ctgacatacc ccaaccagaa aactgttaag gacaatatct 121 gataaaaatg acttctacaa aaaatctggc caaggccata gtgattttat atgctatatg 181 tttttttaca aactctgatg gaagaccaat gatgaagaga tcggtgagtg agatgcaatt 241 aatgcataac cttggagagc atcgacacac tgtggagaga caggactggc ttcagatgaa 301 gctgcaggat gtgcacagtg cccttgagga tgccaggacc cagaggcctc gaaacaagga 361 ggatattgtc ctgggggaga taagaaaccg gaggctgctc cctgagcatt tgcgggcagc 421 agtgcagaag aaatccattg acctggacaa agcttacatg aatgtactct ttaaaactaa 481 gccatgatga aaagaccaag agcattataa ctgtccaagt aagcacatgt ctgtagatca 541 ctgaccagtt agggcatttt atttattatt ttttttttaa ctcaaactat gataaggatt 601 aaaggctcca tgccagactg tagccccact gagatgggta tttcacaact aaatagtaaa 661 gtgtatttat aggccaccca tggccattgc tgctaactcc caggtatctt ttaaatggct 721 aatgtaactc attaacttcc aggagaatta aaaacaaatg gcaaaacaaa aaacaacaaa 781 gaccacctgc aatagaataa gaaagttgaa aaacatttaa gaccagttct accactccta 841 tatggagagc atttgtctgt aatctttaga cctactagta ctgtaaacta acaacgtaat 901 ataggcataa ctgcattatg cctagggtta aacttcaagt ttgtcctaat gaaaggaacg 961 caaacttaaa tccactctta ctttcccaag aaggcctaaa gccagaccaa tgtcagtaac 1021 atagacaaag ctgcatgata ataacttagg attaaagagt gcgaacatga aaaatagaag 1081 gaacccaaag cttaagatta aagtagaatg aaataaattg tgcatgaaaa agaagaacga 1141 agttttacaa gatactgaaa tgaaagggag gtttattaac tttccctctt aattatgagc 1201 tgtcaccttt tggaactgca ggaacagtga gagcagagat tgtagcatat atgtatgcaa 1261 agccctaact atagaactgg gaaatggttc aacacgagat aaaaacaaga cttgtttcaa 1321 ttgttatcat ctctccttca gtcaataatc tatgagtttc tgtatattgt gcttaggcca 1381 catgggtaag tggctcacat aaaattactc atcttcacat gtgcacttat acagaattgg 1441 gatttcagtt tgttaaaacc ctgaaattac aaccattaaa atatagaaat caaaacctgg 1501 gaaccatcag ttaaaatata agcaggattc agaaagaatt tgacaggaac atggatggga 1561 gaaaatgatg ataataatat agaaaagaaa gcagcaaata taaaatgatt ttgaattgta 1621 tagacaagta tgtgcttatg acctcgacca cttctgaata ataagaatat ttcccctgta 1681 gaagtgacag cagtttcctc ccaatgttcc actgtgagaa ttc // LOCUS CUC11SGB 1684 bp ss-mRNA PLN 15-AUG-1990 DEFINITION Pumpkin 11-S globulin beta-subunit mRNA, complete cds. ACCESSION M36407 KEYWORDS 11-S globulin beta-subunit. SOURCE Pumpkin (cv. Kurokawa Amakuri Nankin) cotyledon mRNA, clone pPG-beta-2. ORGANISM Cucurbita pepo Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Violales; Cucurbitaceae. REFERENCE 1 (bases 1 to 1684) AUTHORS Hayashi,M., Mori,H., Nishimura,M., Akazawa,T. and Hara-Nishimura,I. TITLE Nucleotide sequence of cloned cDNA for pumpkin 11-S globulin beta- subunit JOURNAL Eur. J. Biochem. 172, 627-632 (1988) STANDARD simple staff_review FEATURES from to/span description pept 31 1473 11-S globulin beta-subunit precursor sigp 31 93 11-S globulin beta-subunit signal peptide matp 94 918 11-S globulin beta-subunit gamma-chain matp 919 1470 11-S globulin beta-subunit delta-chain BASE COUNT 457 a 406 c 463 g 358 t ORIGIN 1 ctaatagccc ttctcttctc cataccagca atggctcgct cttctctttt taccttttta 61 tgtttagcag ttttcatcaa tggctgcctc tctcagattg agcagcagag cccctgggaa 121 ttccaaggca gcgaagtatg gcaacagcac cgctaccaat ctcctagagc ctgtcgtctt 181 gagaatcttc gagctcaaga ccccgttcgc cgggctgagg cggaggcgat cttcactgaa 241 gtctgggacc aggacaacga tgagttccag tgcgccggcg tcaatatgat ccgccataca 301 atccggccca aaggtctgct tcttcctggt ttctctaatg ctcctaaact catcttcgtc 361 gcccaaggct tcggtattcg cggcattgca atccccggct gtgcagagac ttaccagact 421 gatttacgaa gatcgcaatc ggccggatct gcgttcaaag accagcatca gaagatccgc 481 cccttcagag agggagatct cctcgtcgtc ccggccggag tttctcactg gatgtataat 541 cgaggacagt ccgatctcgt tttgatcgta ttcgctgaca ctcgcaacgt cgcaaaccaa 601 atcgatccct acctcagaaa attctacctt gccggaaggc cagagcaggt agaaagaggc 661 gtagaggaat gggaaagaag tagccgaaag ggatcttccg gcgagaaatc aggcaatata 721 ttcagcggat ttgcagacga atttctagag gaagctttcc agatcgacgg tggactggtt 781 aggaagctaa agggagaaga cgacgagaga gacagaatcg tgcaggtcga cgaagatttc 841 gaggtgcttc taccggagaa agatgaagaa gagagatcga gaggaagata catcgaatca 901 gaatcagaat cggagaatgg cttagaagaa accatttgca cactccgatt aaagcaaaac 961 atcggccgat ctgttcgcgc cgacgtgttc aacccacgcg gcggccgaat ctccacggcc 1021 aactaccata ccctccccat tctccgccaa gtccgcctta gcgccgaacg aggagtcctc 1081 tacagcaacg cgatggtggc gccgcactac acagtgaaca gtcactcagt gatgtacgcg 1141 acgagaggca acgcgagagt gcaggtggtg gacaacttcg ggcagtcagt gttcgacggc 1201 gaggtccggg aaggacaggt actgatgatt ccgcagaact tcgtggtgat taaacgagca 1261 agcgacagag gattcgagtg gatcgcattc aagacgaacg acaacgcaat cacgaatctg 1321 ctggcggggc gagtgtcgca gatgaggatg ttgccgctgg gagtgctgtc gaacatgtac 1381 cggatctcga gagaggaggc gcagaggctg aagtacgggc agcaggagat gagggtgctc 1441 agccccggaa ggtcgcaggg aagaagagag tgaaaatgaa gaagtgggta gtgggtaatg 1501 ggtaatggga aatatatata tatggtagta gtaatctaat gtaatttagt gaataaagag 1561 cgagctttca ggtgatgccg ccgacgagcc ctgcttgtta ccggccggaa aaaatggaga 1621 aatctctcag aaagacaccg agttttaata ataaaagtaa taatattcgc ctcttttttc 1681 cttc // LOCUS DROKINLA 2175 bp ds-DNA INV 15-AUG-1990 DEFINITION D.melanogaster kinesin-like protein (nod) gene, complete cds. ACCESSION M36195 KEYWORDS kinesin-like protein; nod gene. SOURCE D.melanogaster DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2175) AUTHORS Zhang,P., Brodeur,B.A., Goldstein,L.S.B. and Hawley,R. TITLE A kinesin-like protein required for the distributive chromosome segregation in Drosophila JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Zhang, 06-JUL-1990. Author address: P.Zhang Albert Einstein College of Medicine Molecular Genetics Dept. 1300 Morris Park Avenue Bronx, NY 10461 FEATURES from to/span description pept 72 2072 kinesin-like protein (nod) mRNA 1 2175 nod mRNA BASE COUNT 557 a 594 c 561 g 463 t ORIGIN 1 caaagtaaaa taattacggt gaatgcaagc caattgtgca ttattcaaac aacttcaatt 61 cttcaatctg catggagggc gccaaattaa gcgcagttcg gattgcggtc cgcgaggcgc 121 cgtaccgcca gttcttgggg cgtcgggagc ccagcgtcgt ccagtttccg ccatggagcg 181 acggaaagtc gttaatagtg gatcagaatg aattccactt cgatcacgcc tttcccgcga 241 ccatcagcca ggatgagatg taccaggcgc tgatcttgcc gctggtggac aagctgctcg 301 agggattcca gtgcactgca ctcgcctacg gccagacggg aacgggcaag agctactcaa 361 tgggcatgac acctccggga gagatactgc ccgagcacct gggtattctg cctcgcgccc 421 tgggcgacat ttttgagcgc gtgaccgccc ggcaggagaa caacaaggat gcgattcagg 481 tgtacgcctc cttcatagag atctacaatg agaaaccctt cgatctgctg ggctccacgc 541 cacatatgcc catggtggcg gcgcgttgcc agcgatgcac ctgccttcct ttgcacagcc 601 aggcggatct gcatcacatc ttggagctag gcactcgcaa tcgacgcgtt cgtcccacca 661 atatgaattc caatagttcg cgatcccatg ccatagtcac cattcacgtg aagagtaaaa 721 cccatcactc gcggatgaat attgtggatc tggccggttc agaaggcgtg cggcgaactg 781 ggcacgaggg cgtggccagg caggagggcg tcaacatcaa tctgggcctg ttgagcatca 841 acaaggtggt gatgtccatg gcggcgggcc acacagtgat accataccgc gacagcgtcc 901 ttaccacagt tctgcaggcc tcgctaaccg cgcagtcgta tctgaccttt ctggcctgca 961 tcagtccgca tcaatgcgat ctcagcgaga cgttgtccac cctgcgtttt ggcaccagtg 1021 ccaagaagct tcggctgaat ccgatgcaag tggcgcgcca gaagcaatcg ctggccgcac 1081 ggacaacaca cgtcttccgc caagcgctat gcacctcgac ggccatcaag tcaaacgcag 1141 ccaatcataa tagcatagtg gttccaaaat ccaaatatag cacaaccaag ccgctgagcg 1201 ccgtgctcca tcgaactcgc tccgaacttg gcatgacgcc caaagctaag aaaagggctc 1261 gcgagctatt ggagctggag gagaccacgc tggagctctc gtctatacac attcaggaca 1321 gcagtctgag tctgttgggt ttccatagcg atagcgataa ggataggcat ttaatgcctc 1381 ccccaacagg gcaagagcca aggcaagcca gcagccagaa ctctacgcta atgggcattg 1441 tcgaagagac cgagcccaag gaatcgtcaa aggtgcaaca gtcaatggtt gcccccacgg 1501 tgcccacaac tgtacgctgc cagctgttca acaccaccat cagtcccatc agtctacggg 1561 catccagctc tcagcgagaa cttagcggca tccagccaat ggaggagaca gtagtggctt 1621 cgccacagca gccatgcctt cgtcgttccg tgcgtctagc gagtagcatg cgttcgcaga 1681 actatggagc cattcccaag gttatgaatt tgcggcgcag cacgcggctg gcgggaatcc 1741 gggaacatgc cacctccgtt gttgtgaaaa acgagacgga tgcgataccg caccttcgaa 1801 gtacagtgca aaaaaaacgt acgcgaaacg tgaaacctgc gcccaaggcc tggatggcca 1861 ataatacaaa atgttttctg gacctgctta acaatggaaa cgttaagcaa ttgcaggaga 1921 ttccagggat cggtccaaag tccgccttta gtttggcctt gcacagatcc cgcctgggtt 1981 gcttcgagaa tctttttcaa gtcaaatccc tgcccatttg gtcgggaaat aaatgggaac 2041 gattttgtca aattaactgt ctcgacactt gatacaatta ctaattaaat agcattttaa 2101 ttcgaatata gtatagtgat tgttatttat gtggcatata ctttgatttt acaactatag 2161 taggagtaaa aaaag // LOCUS HAMCADCA 3902 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Hamster carbamoyl-phosphate synthetase mRNA, partial cds. ACCESSION J05503 KEYWORDS carbamoyl-phosphate synthetase. SOURCE Hamster cell line 165-28, cDNA to mRNA, clone pCAD142. ORGANISM Mesocricetus auratus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 3902) AUTHORS Simmer,J.P., Kelly,R.E., Rinker,A.G.Jr., Scully,J.L. and Evans,D.R. TITLE Mammalian carbamyl phosphate synthetase (CPS): cDNA sequence and evolution of the CPS domain of the Syrian hamster multifunctional protein CAD JOURNAL J. Biol. Chem. 265, 10395-10402 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 3902 carbamoyl-phosphate synthetase (E.C.6.3.5.5; AA at 3) BASE COUNT 829 a 1056 c 1135 g 882 t ORIGIN 1 tcaggcccct ggcaccagag gtttctatta agaccccacg ggtattcaat gcagggggtg 61 cccctcggat ctgtgccttg gactgcggcc tcaagtataa tcagatcaga tgtctctgcc 121 agcttggggc tgaggttact gtggtgccct ggaaccacga attagacagt cagaagtatg 181 atggcctttt tctgagtaat ggacctggcg atcctgcctc ttatcctggt gtggtagcca 241 cactgaaccg cgtcttgtct gagcccaatc cccgacctgt gtttggaatc tgccttggac 301 accagctgtt ggctttagcc attggggcca aaacttacaa aatgaggtat ggaaaccgag 361 gccacaacca gccctgttta ctggtgggca ccgggcgctg ctttctgacg tctcagaatc 421 acgggtttgc cgtggatgca gactcgctgc cagcaggctg gactccgctc ttcaccaatg 481 ccaacgactg ttccaacgaa ggcattgtac atgacagcct gccctttttc agtgtccagt 541 ttcacccaga gcaccgagct ggcccttcag atatggaact gctttttgat gtatttctgg 601 agactgtgag agaggctgta gctgggaacc ccgggggcca gacagttaaa gagcggttgg 661 tgcagcgcct ctgtccccct gggcttctca ttcctggttc tgggcttcca ccaccacgga 721 aggttctgat cctaggctct gggggcctct ccattggcca ggctggagaa tttgactact 781 caggctctca ggccattaaa gccctgaagg aggagaacat ccagacgctg ctgatcaacc 841 ccaacattgc tacagtgcag acctcgcagg ggctggcaga caaggtctac ttccttccca 901 ttacacctca ctacgtaacc caggtgattc ggaatgaacg cccagatggt gtgttactga 961 cttttggggg ccaaacagcc cttaactgcg gtgtagaact gaccaaagcc ggagtgctag 1021 ctcggtatgg ggttcgggtc ttgggtacac ctgtggagac cattgaactg actgaggacc 1081 gacgagcctt cgcggccagg atggctgaga tcggagagca tgtagccccc agcgaagcgg 1141 caaattctct tgaacaggct caggcagctg ctgagcgact gggctaccct gtgctggtgc 1201 gtgcagcctt tgccctgggt ggtcttggtt ctggctttgc ttccaccaaa gaggaactct 1261 cagctcttgt ggctccagct ttcgcccata ccagccaggt gctgatagac aagtctctga 1321 agggctggaa ggagattgaa tatgaggtgg tgagagacgc ctatggcaac tgtgtgacgg 1381 tatgtaacat ggagaactta gacccactgg gcatccacac tggtgagtcc atagtggtgg 1441 cgcccagcca gacgctgaat gacagagagt accaacttct gcgacggaca gctatcaaag 1501 tcacccagca cctggggatc gtcggggagt gcaacgtgca gtatgccttg aacccggagt 1561 ctgagcagta ttacatcatt gaagtaaatg ccaggctgtc tcgaagctct gccctggcca 1621 gtaaggccac aggctatcct ctagcctatg tggcagccaa gctggcgttg ggcattcccc 1681 tgccggagct caggaactct gtcactgggg gaacagcagc ctttgagcct agcctggact 1741 actgtgtggt aaagattcct cgatgggacc tcagcaagtt cttgcgtgtc agtacgaaga 1801 ttgggagctg tatgaagagt gttggtgaag tcatgggcat tggacgctca tttgaagagg 1861 ccttccaaaa ggccctgcgc atggtggatg agaactgtgt gggcttcgac catacagtga 1921 agccagtcag tgatgtggag ttggagacac caacagataa gcggatcttt gtggtggctg 1981 ctgctctgtg ggctggctac tcggtggagc gcctgtatga gctcacacgc atcgactgct 2041 ggttcctgca tcgaatgaag cgtatcgtga cccacgccca gttgctggaa caacaccgag 2101 gacagccgtt gtctcaagac ctgctgcacc aggccaagtg cctcggcttc tcagacaaac 2161 aaattgccct tgcagtcctg agcacagagc tggcggttcg aaagctacgt caggaactgg 2221 gaatctgccc tgcagtgaaa cagattgaca cagttgcggc tgagtggcca gcacagacca 2281 attacctgta cctgacatac tggggcaaca cccatgacct cgactttcga actcctcacg 2341 tcctggtcct tggctctggt gtctaccgca tcggctccag tgttgagttt gactggtgtg 2401 ccgtcggctg catccagcag ctccggaaga tgggttataa gaccatcatg gtgaactaca 2461 acccagagac agtcagcaca gactatgaca tgtgcgaccg actctacttt gatgagatct 2521 cctttgaggt ggtgatggac atctatgagc tggagaaccc cgacggcgtg atcctgtcca 2581 tgggtggaca gctgcccaac aacatggcca tggctctgca tcggcagcag tgccgagtgc 2641 tgggcacctc cccggaagcg atcgattcag ctgagaaccg gttcaagttc tcccggcttc 2701 tagataccat cggcatcagc cagcctcagt ggcgtgaact cagtgacctc gagtctgctc 2761 gccagttctg ccagactgtg gggtacccct gtgtggtgcg cccctcctat gtgctcagcg 2821 gtgccgctat gaatgtggcc tacactgatg gggacctgga gcgcttcctg agcagtgcgg 2881 ccgctgtctc caaggagcac cccgtggtca tctccaaatt catccaggaa gcaaaggaga 2941 ttgatgtgga cgctgtggcc tgcgatggcg tcgtgtcagc cattgccatc tccgagcacg 3001 tggagaatgc aggtgtgcat tcaggggatg ctacgctggt caccccccca caagacatca 3061 cccccaaaac tctggagcgg atcaaagcca ttgtgcatgc cgtggggcag gaactacagg 3121 tcacagggcc cttcaatctg cagctcattg ccaaggatga ccagctgaaa gttattgagt 3181 gcaatgtgcg tgtctctcgc tccttcccct tcgtgtctaa gacgctgggt gttgacctag 3241 tggccttggc cacgaggatc atcatgggag agaaggtaga acccatcgga ctcatgacgg 3301 gctctggagt cgtgggagta aaggtgcctc agttctcctt ctcgcgcttg gcgggtgctg 3361 atgtggtgct gggcgtggag atgaccagta ctggagaagt agctggcttt ggagagagcc 3421 gttgtgaggc ctacctcaaa gccatgctta gcactggctt taagatcccc aagaagaaca 3481 tcctgctgac catcggcagc tacaagaaca aaagtgagct gctcccgact gtgcggttgc 3541 tggagagcct gggctatagc ctctacgcca gcctgggtac ggcggacttc tacactgagc 3601 acggggtcaa ggtgacagct gtggactggc actttgaaga ggctgtggat ggcgagtgcc 3661 cgccacagcg gagcatcttg gatcagctgg ctgagaatca ctttgagtta gtgattaacc 3721 tgtcaatgcg tggggccggg ggtcgacggc tttcctcctt cgtcaccaag ggctaccgca 3781 cgcggcgcct ggctgctgac ttctctgtgc ctctcatcat cgacatcaag tgcaccaaac 3841 tcttcgtgga ggccctgggt cagattggcc ccgccccgcc tttgaaggtt catgtagact 3901 gc // LOCUS LEIKPDNP 376 bp ds-DNA ORG 15-AUG-1990 DEFINITION L.aethiopica kinetoplast DNA. ACCESSION M36194 KEYWORDS . SOURCE Kinetoplast L.aethiopica (strain 1467/85) promastigote, clone R3,. ORGANISM Kinetoplast Leishmania aethiopica Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 376) AUTHORS Laskay,T., Kiessing,R., Rinke de Wit,T.F. and Wirth,D.F. TITLE Generation of species-specific DNA probes for Leishmania aethiopica JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Rinke de Wit, 06-JUL-1990. Author address: T.F.Rinke de Wit Leiden University Hospital Rijnsbugerweg 10 2300 RC Leiden THE NETHERLANDS email:WBLGIPHAR@HLERUL52.BITNET BASE COUNT 113 a 99 c 70 g 94 t ORIGIN 1 ctctaatagc ccaggaccta tcgtcgccac tctccgaact atagaaagac ccgcgctgta 61 ggcacaatag gaccaactgt actacctgca gtggctagac cactactggc aaatcaatag 121 aactattacc tttaactata agtgatttaa ctttaaccta taatagaaca ttattcgtcg 181 ctcattcccg ggccccacgt agcctttccc atgaagttcg tataccgact ctacggttca 241 agtttatata ccggttcact ccgttgcacc atggtgacct tacgtcacta gatacaattg 301 atattaataa ttaaatacag ccaagatagg cggcatgtgc cacagagtag cggcaggaag 361 ccagccaatg agcata // LOCUS LMIB19KP 938 bp ss-mRNA INV 15-AUG-1990 DEFINITION L.migratoria basic 19kD hemolymph protein mRNA, complete cds. ACCESSION M36206 KEYWORDS basic 19k protein. SOURCE L.migratoria adult female fat body, cDNA to mRNA, clone lambda-LmF2. ORGANISM Locusta migratoria Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Orthoptera; Caelifera; Acrididea; Acridoidea; Acrididae. REFERENCE 1 (bases 1 to 938) AUTHORS Kanost,M.R., Bradfield,J.Y., Cook,K.E., Locke,J., Wells,M.A. and Wyatt,G.R. TITLE Gene structure, cDNA sequence, and developmental regulation of a low molecular weight hemolymph protein from Locusta migratoria JOURNAL Arch. Insect Biochem. Physiol. 8, 203-217 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Kanost, 06-JUL-1990. FEATURES from to/span description pept 54 572 basic 19k protein precursor sigp 54 95 basic 19k protein signal peptide matp 108 569 basic 19k protein BASE COUNT 231 a 267 c 222 g 218 t ORIGIN 1 agctctgctg tctcctgtcc actccacacc acaggctcag taccaggatc aggatgaagc 61 tggtggtggc tgcagttctc gcgatggccg cgtcgcggtg gcggcgcctg tcggcccacg 121 gccaggtgcc gtccagcacg tgcgccgaca tgctgcccgt gcacggcaac gcaatgccca 181 gcacagccct gccctacacc atcaccgtgt cgcccacctc cgtcaacggc ggcgacaccg 241 tcagagtgca catctcgggc acggaggagt tccgcggcgt ctacctgcag cgaggagggg 301 ccaagagcag taggagagtt cctgctgccc gccggagaga acaacaagat cgccctgtcc 361 gactgcccgc cggacacaac aacgccttct catacatttc gcgcacaccc ctggacacac 421 tggacatcga ctggaaggca ccatacacca gcgatgaaat cgttttcagg gctactttcg 481 tcaagagctt ctccgagttc tgggtcggcg ttgagtcacc gaagatcaca ttgggaccgc 541 tacgtcaact tgacaacgca gttgctgctt agtgactgaa gtcgccatat tcatatacga 601 gcacatccag tactgatgtc ctagtttatc acaacatcgc cgcaccacca ctttcacgtt 661 ctctactact aaaatggtag ataaatcgct tattacagct gttagctgca tataagagaa 721 gcgtttcaaa acgagaaact ctttttgatt ttgtactgag ggaattcaag taaagatttg 781 acaggcagac gtcaccatct tgttcaagac ttggcatcca gtttgcctgt ctgctgtgtg 841 tttgtagatg ctcacacttc ttgtgatatt tactaccaca aattttgtac tcaagacttg 901 aagaattgaa atatattctc taattaatat aaaaaaaa // LOCUS MUSALDAA 8190 bp ds-DNA ROD 15-AUG-1990 DEFINITION Mouse aldolase A gene, complete cds. ACCESSION J05517 KEYWORDS aldolase A. SOURCE Mouse (strain RIII S/J and Blue Spruce (outbred Swiss Webster)) adult DNA, clone lambda 16. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 8190) AUTHORS Stauffer,J.K., Colbert,M.C. and Ciejek-Baez,E. TITLE Nonconservative utilization of aldolase A alternative promoters JOURNAL J. Biol. Chem. 265, 11773-11782 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.K.Stauffer, 15-JUN-1990. FEATURES from to/span description pept 4301 4412 aldolase A, exon 4 (E.C. 4.1.2.13) (first expressed exon) 4493 4704 aldolase A, exon 5 5125 5179 aldolase A, exon 6 5278 5438 aldolase A, exon 7 5727 5810 aldolase A, exon 8 5908 6082 aldolase A, exon 9 6222 6421 aldolase A, exon 10 6519 6614 aldolase A, exon 11 pre-msg 1700 > 6614 aldolase A mRNA and introns (alt.) pre-msg 1740 > 6614 aldolase A mRNA and introns (alt.) IVS 1804 1951 aldolase A intron A (put.) IVS 2023 4278 aldolase A intron B (put.) IVS 2205 4278 aldolase A intron C (alt.) IVS 3256 4278 aldolase A intron C (alt.) IVS 4413 4492 aldolase A intron D IVS 4705 5124 aldolase A intron E IVS 5180 5277 aldolase A intron F IVS 5439 5726 aldolase A intron G IVS 5811 5907 aldolase A intron H IVS 6083 6221 aldolase A intron I (no splice consensus) IVS 6422 6518 aldolase A intron J signal 3032 3036 CAAT box signal 1673 1676 TATA box signal 2235 2240 TATA box signal 3089 3094 TATA box signal 3132 3137 TATA box BASE COUNT 1676 a 1884 c 2061 g 1876 t 693 others ORIGIN 1 gatccttgct ttttgaagcc ttagaatgaa gccagcattc ctggccttgg gagggcaggc 61 acgggagact ccaaggcctg gggaaagcaa ctctagtcca aaccagtttc tcttgctggt 121 tgtagtcttt tgggcaaacc actgagtttc tatctcatta ttttgtgatg agccccccac 181 gagtgtgacc cccattcaag gtggctcaga agcagagtgc ttgccttgtg tttgtgacat 241 cccaagttca attcatcact gaggaaaccc ctccctttaa gatttatctt atctctgaac 301 gttttcccga ttgtatgact cgtatgtatc tgaggaagtc agaagaaatg tcagatcccc 361 caggatcttg ggatctggag tcgtgatggc tgtgagtcac tgtatatatg tgctggagct 421 gaactcaggt cctctggaac agccattgct cttaaccact gagccatggt ccggacacct 481 ggcttagaca gggtcccttt ctgtcagtgg ttctcaacct gtgggttatg gccctttgtg 541 ggggtggagg tgggtattaa cttatacagg gctgacctaa ggttataaaa acccagatat 601 ttatgattca taacagcaaa attacaggtg taaagtagca acaaaaattc ttttttggtt 661 gggagtacca caacatgggg aactgtatta aaaggtagca ttaggaaggt tgggcaccac 721 tgctctcgta gccctggcta tcctagaact caaatagtag atcaggctgg tccaaactga 781 cagagatcta tctctgccag cgtcagcact aggaagtgag taaattccat gatagccagg 841 ccatacagtg aaaccctgtc tcaaaacagg acaagaggaa ccccagtact tagtaggttg 901 aagtaaggat tgtcattttt tttgaggcca gcttgggttt catggctctt gactagtctg 961 agctgtagag ggagagcctg tctcacgagg aagcttagga gggagatatt atagtttggt 1021 ttatgccagc aagaaagtcc aaagtcccag aaattatctt catgaggatt gaaacatgtt 1081 ttctggtcct gacttcctct aggttgcata gggctttgag agtatagtat acctactatg 1141 tgcgcataca cacacgcgcg cgcgcgcgca cgcgacacac acaggaccca gtgggacaga 1201 tactttatca ctgctgctgt tcagcatgga gggagcttct ttccagtgct ttgtctctcc 1261 gtccactggg cctggtgggt gggtgctcct cagccctctg cttacccacc tctctcttct 1321 cctttagggt tgggcccctc gatgccctgg cctgctgccc actgtgtgac tgtgcctgtg 1381 cctgccagct cccagactgc cagagcctca actgcctctg tttcgagatc aagctcagat 1441 gaaagatggg gctggggacg ttgttctttg gggagtggcc agtccccagg gccccctcta 1501 tgatcctcag gacatcatta tactggagct atggatggca ggcccagcct aattacctgg 1561 gttccttgag ttctctgaaa ggcaggattc tgagagccct tggaccgctg aaaagggcct 1621 gatgctctgg ccagtgcccc tgcctttctt cctctccctt ccctgataaa ctattgtatg 1681 tgaggtagga tcgagacatt gctcacccag gcaacagtgt gggaggtttc tgccaacctg 1741 gactatcagg ataaagggat ggccagccac accctgcctt tagactcctg gttattttaa 1801 gaggtgagta tcctgcctga ctctgctctc ctttggaaaa aaaaaaaaag ttcaaccacc 1861 agcaggcacc agagtcaagg gaggagggaa ccagaggagg gcagtgggag gcaatatcta 1921 gatgttttcc cttcttgttc tgccttaaca gatcctggac ctgagactga tttcttgact 1981 aatttcactg tatttccaag gaagaggttc ctctaaagac cggtgagtga gcagtggcac 2041 ctcctcctct caaggcaaac caaagctgcc tcttcttcac cccccacgca gggatgaatg 2101 tcaggagcct caggtttccc taaatatagg tcccggccgc gggattcgtg gtggggaaag 2161 ggcaggggtt accgagaagg tctgggacac tggtgcgggg gtgtgtaggg gaggggtggg 2221 gagtaggagc tgccttaaaa cccagccctg gactgccggg ctcactctct gctgaccggg 2281 ctctgcggct tctgtcactg cgccacaggt gggccgctat ccggattgca ggatgggaat 2341 gggggttgcg gattgggacc tgaggaaact gactgctctg agagttacag ggtgacaaga 2401 gagctccgag acggattttt ttattttgga gaaggaaatc aggttcggga aagacctgtc 2461 tggcttgggc cagtccttgt cggtcatttc ctcaaactgg gtgtgtttag ctcgcgggtg 2521 gtgcctcccg ccaatctgct aggcaacgcc aggcctggat acgccactca gttccgatgt 2581 ggccggcaca ctagttctgg gaggttttgc ctgcgtacca tgtcactcgc cgtgctctgg 2641 ccagggagag atggaatgng ccctgcattt tagtcaagcg acgaagcagg caggcaggga 2701 ggctccgaag ctctgcgttc ttagcagtga cgtcaggctg caactacaca gccggaagcc 2761 tgggtcttgg aggagaggcc agccaccatc tcactctgac cccctcccta ctcttcgcca 2821 acccacattc cggctgagtc acatgttccg cgcgcgccag gcaggggttg gggggggggg 2881 tgttgggggg ggggggtggt gacctgcggg atgtggctcg agtcacgtcc tagcggggcg 2941 gaggagggat cgtgttctag ccgcttgtct cctccccagt gccgcctcct atcggagcat 3001 cttggggcgg tctgcgcaca gtgcccacct tcaattgacg gttcccgtcc ctgcaaggga 3061 aaaaacctgc agagggcgga gcggcgcctt taaatgtccg gggccccgcc tccggtcccc 3121 cccaacccag ctgaataggc tgggttctct tggaacgcgc agcagaacca ggttctggtg 3181 accctagccg ttcgctcctt agtcctttcg cctacccacc ggcgtaccag gcagacccac 3241 cccgtcctgt gccaggtgag cgccatttac acgtgctcgg ggaagggtct atggggttag 3301 gatcttgggc cggtggcggg cagtgcagag ccgtcttccc cacggcccct cacttctcct 3361 ttttctaccc ccacgcttgc ccccagcccn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3481 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3781 nnnnnnnnnc tggttctctc ttaactctcg cctttgggtt gctatgtggc tttgagcaca 3841 gatcatttct ttcttgggct ctttcagatg agggtattag gctcctgccc tattcgtgat 3901 ccttaaattc taaaatatcc cggttcaatt ttgtttctag gcaaggtgac ccatggcaac 3961 gcgcaggcca gatgggtcag cttcaacatg accgctgtcc tggctctggc ttcttcttcc 4021 ccagttggcc agtgagcgaa cccactctga gctgggcaac acccagcaac agacagagtt 4081 aggaaaggta caggaagagg caggtctagt atagggaagt cgggagtagg ggagagctct 4141 gggacaggaa gtatcccagg accctcaggg agtggggcag gggaggtggg ggctagtgcc 4201 ctggcctcca ggaagctttg taccggggag accatgggat ggtccaacta agcgctggtc 4261 tctgcctccc tcacccagga aagcaactgc caccggcacc atgccccacc catacccagc 4321 actgaccccg gagcagaaga aggagctgtc tgacatcgct caccgcattg tggctccggg 4381 caagggcatc ctggctgcag atgagtccac cggtgcggta caggagaaga aagggaggag 4441 gacccaggtt ggagctagca ggctgatccc ttatctccat catgactttt aggaagcatt 4501 gccaagcgcc tgcagtccat tggcaccgag aacaccgagg agaacaggcg cttctaccgc 4561 cagctgctgc tgactgcaga cgaccgtgtg aatccctgca ttgggggggt gatcctcttc 4621 cacgagacac tgtaccagaa ggcagatgat ggacgtccct tcccccaagt tatcaagtcc 4681 aagggtggtg ttgtgggcat taaggtaaga gggcagactc tggggggggg gtaagattag 4741 aggaggatct cggagaaagg gattaatagg tagggagggg gtaatatggc tagcaggcct 4801 agagactcag gtggatgtat cagcataatt ttttttcagt gtttggggtg aacttaggtc 4861 cttgtgcatg tcggcaagcg cgctgttgcc aacttaatgg ttccctgtga tacaagaagg 4921 tgatttcatg gtgaagaagt gaaaaggttt tctcagtgtg cagtagcacc aggtccctct 4981 agtccagtta acattctctc aaatatacac atcttttctc ataaatatgt gcaagccatg 5041 agaggctaca gtgaaaggtg aagtttgggc ctgggtagag gagacagggg ccataaagct 5101 gactgctggt ctcctccctg gcaggtagat aagggtgtgg tgcccctggc aggaaccaat 5161 ggcgagacaa ctacccaggg taagaatgat ctgcctgcct ccttcccttc tccaccagct 5221 catcagagtt ccagagtgag tctgatcaaa agccttctct ttattcttcc ccttcagggc 5281 tggatgggct gtctgaacgc tgtgcccagt ataagaagga tggagccgac tttgccaagt 5341 ggcgctgtgt gctaaagatt ggggaacata ctccctcggc cctggccatc atggaaaatg 5401 ccaatgttct ggcccgttat gccagcatct gccagcaggt gggattggac tacttcctaa 5461 cacattgatg cagcgcgggc tagctttctg tctatctgcc aggatatctg cctcctcaga 5521 gcagctgctc tcaatacccg ctgtggccag gtcttgagtg gaggtctgca atgtagaggt 5581 ggcaacaggt gtacaggcag attgatagga ttgcttgtcc cctgtaaact gctgaggcct 5641 ttgaagcctg ggtctctgtc atcaagttaa tggtgaggag gctcctagtc aggaggcctt 5701 gcctcattac cctgtccctc ccacagaatg gcattgtacc cattgtggag cctgaaattc 5761 tccctgatgg ggaccatgac ttgaagcgct gccagtatgt tactgagaag gtagtgccat 5821 ctgctgtaga tagtgtgtgc tgcgcgtagt atcgtttcac ttctcgtctg cnnnnnnnnn 5881 nnnnnnnnnn nccctgctgt cttccaggtc ctggcggctg tctacaaggc tctgagcgac 5941 caccatgtct atctggaagg cacattgctg aagcccaaca tggtcacccc tggccatgct 6001 tgcacccaga aattttccaa tgaggagatt gccatggcaa cggtcacagc acttcgtcgc 6061 acagtgcccc ctgctgtcac tggtgaggcc actcctcatc ttggtggtga ggtggatgca 6121 ccatcacatt tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6181 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6241 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6301 nnngccatgg gccttgactt tctcctatgg tcgagccctg caggcctctg ctctaaaggc 6361 ctggggtggg aagaaggaga acctgaaggc agcccaggag gagtacatca agcgcgccct 6421 ggtaaggcag gcaggcaggc gtggaagtgt gaacaggtgc ctgggcgggg tggggaggga 6481 ctcaagaaga gaattcctct gattcctctt ccttttaggc caacagcctc gcttgtcaag 6541 gaaagtatac cccaagtggc cagtctggag ccgcagccag tgaatctctc ttcatctcta 6601 accatgccta ctaaccagag ctgaactaag gctgctccat caacactcca ggcccctgcc 6661 tacccacttg ctattgaaga ggggtcttca ggctctttcc catcactctt gctgccctcg 6721 tgtgcggtgt tgtctgtgaa tgctaaatct gccatccctt ccagcccact gccaataaac 6781 aactatttaa gggggagtct gttgttcatg tcttgtaggg tataggggag ggctgaggaa 6841 agagctactt gggttcttct tcttggacag taaaaggaag gggttttttg accagagctt 6901 tgagaaaggc atagtattat gggatgttct ttgcctacat ctaattgaag gtaactttta 6961 cactaattaa tattcagttt aagccaacca agggcttatg aatacttggc aaggattgta 7021 tcagggctaa cacatttatg cgttttgggg actatggagc tttggagacg agatctctct 7081 gcagtgacat aggtatacag ctcactgcag aactcttggg ttccaggttg agaatggagc 7141 ctcagagctg ctgatgttcc ctggtgatag aataagaagc acatcaaacc atgggccact 7201 gtatcttgcc acattatatt gagtgtagtc ggtgtgctag tgcacacttt aatccagcac 7261 tcaggaggca gaggcaggca ggaggcaact ggaactcaca aagtgagttc caggacagcc 7321 agggctatac agagaaaccc tgtcttgaaa aaaaaaaatt ctggcctaaa tgaatggata 7381 cagtgtatct gcctttggag gccaaaaggc gtgtatcaag tgctagcttc tggcaagata 7441 agaaacctta aggagtaggg cttcgactat actcagtagc agagtcttgc atggtactca 7501 tggttgtgag cacatgtggt gctaactgct gagtctctct cagtccatca tactctagta 7561 tatagtcaga gactctagat actgacgact agactagact cgtcgtctnn nnnnnnnnnn 7621 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7681 nnnnnnnnnn nnnnnnnnnn nnnnnnnntt ccttcccaag catctttttc tttgacactt 7741 tcgttttcag tgatctgcgt agaattgtct tactaggagt atcaaagcat agtctccact 7801 gtcctaatat tcccatgtat tggccaatag tcaaagctat gcgcaggctg tggatagagc 7861 ccagtggctg agtacccaaa gctctggttc cttccccagt gctgcaaggg aaaactcaaa 7921 tccctatgct tccccaaact tcagcctccc attttactgc tcatcacgta cttgtagcct 7981 tgctctctag aattctgtag cccacactgg ccttgaactc tcaagatctg ctttccaagt 8041 actgggatga aaggcatgtg ctattctcct agcttctatg aggcgatcct ttttatttta 8101 tatacattgg tattaactga atgtgtgtat gtgtgtagtg tgatccggta cgagctcgag 8161 cgtatagtga gtcgatacat catgcgcgct // LOCUS MUSCR2AA 2102 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse complement receptor (Cr2) gene, 5' end. ACCESSION M36470 KEYWORDS Cr2 gene; complement receptor. SOURCE Mouse (strain Balb/c) spleen, cDNA to mRNA, clone 31-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2102) AUTHORS Kurtz,C.B., O'Toole,E., Christensen,S.M. and Weis,J.H. TITLE The murine complement receptor gene family. IV. Alternative splicing of Cr2 gene transcripts predicts two distinct gene products that share homologous domains with both human Cr2 and Cr1 JOURNAL J. Immunol. 144, 3581-3591 (1990) STANDARD simple staff_review FEATURES from to/span description pept 67 > 2102 complement receptor (Cr2) BASE COUNT 590 a 472 c 452 g 588 t ORIGIN 1 ctcttcctct ccttgctaca ggctcacaac tcacagagcg caacctgcca ttggactgct 61 gcacacatgg gatccttggg ttcgctctgg gttttcttca ctctcatcac tccaggagtt 121 cttggtcagt gtaagttgct gccaaagtat tcttttgcta aaccttctat tgtgagtgat 181 aaatctgagt ttgccattgg aacaacttgg gaatacaaat gtcgccctgg gtattttagg 241 aagtcattta ttatcacctg cttagaaacc tccaagtggt cagatgctca gcagttctgt 301 aaacgtaaac catgtatgaa tcctcaagaa cccctccatg gttctgtgca tataaacacg 361 ggtatcgagt ttgggtcaac aattacgtat tcttgtaatc aaggatatcg actcattggt 421 gactcgtctg ctacatgtat tgtatcagac aatactgtaa tgtgggataa tgatatgcct 481 ctttgtgaat ctattccttg tgagtcacct ccagccatct ccaatggaga cttctacagc 541 agcagcagag acagcttttt ctatgggatg gtagtaactt attattgcca taccggaaag 601 aatagggaaa aactgtttga tctggtgggt gagaagtcaa tatattgtac cagcaaagac 661 aatcaagttg gcatctggaa tagtccacct cctcagtgta ttcctagagt caagtgccca 721 atgccagaaa ttgaaaatgg actagtggag tctggattta aacactcctt cttcttaaat 781 gatacagtaa tatttaagtg caaatctggc tttaccatga aaggcagcag aatagcatgg 841 tgccagccaa acagcaaatg gagccctcca ttgccaacat gcttcatggg atgtctacca 901 cctcaaaata tcctccatgg tgattataac aaaaaggatg agttcttctc tgttggccag 961 aaagtgtcat atacgtgtaa ccctggctat actctcattg gaactaacct cgtggagtgt 1021 acatccttgg gaacctggag caatacagtc ccgacatgtg aagtgaaatc atgtgatgca 1081 attccaaacc atcttctcca tggccgtgtg tttcttcccc ctaatctcca gcttggggca 1141 gaggtttcct ttgtttgtga cttagggttc cagttaaaag gcaaaccttc tagtcagtgt 1201 atcccagaag gagagacagt aatctggaat aataagtttc ctgtctgtga acagatttct 1261 tgtgaccctc ctcctgaagt caaaaatgct cggaaaccct attattctct tcccatagtt 1321 cctggaactg ttctgaggta cacttgttca cctagctacc gcctcattgg agaaaaggct 1381 atcttttgta taagtgaaaa tcaagtgcat gccacctggg ataaagctcc tcctatatgt 1441 gaatctgtga ataaaaccat ttcttgctca gatcccatag taccaggggg attcatgaat 1501 aaaggatcta aggcaccatt cagacatggt gattctgtga catttacctg taaagccaac 1561 ttcaccatga aaggaagcaa aactgtctgg tgccaggcaa atgaaatgtg gggaccaaca 1621 gctctgccag tctgtgagag tgatttccct ctggagtgcc catcacttcc aacgattcat 1681 aatggacacc acacaggaca gcatgttgac cagtttgttg cggggttgtc tgtgacatac 1741 agttgtgaac ctggctattt gctcactgga aaaaagacaa ttaagtgctt atcttcagga 1801 gactgggatg gtgtcatccc gacatgcaaa gaggcccagt gtgaacatcc aggaaagttt 1861 cccaatgggc aggtaaagga acctctgagc cttcaggttg gcacaactgt gtacttctcc 1921 tgtaatgaag ggtaccaatt acaaggacaa ccctctagtc agtgtgtaat tgttgaacag 1981 aaagccatct ggactaagaa gccagtatgt aaagaaattc tctgcccacc acctccacct 2041 gttcgtaatg gaagtcatac aggcagcttt tcagaaaatg taccatatgg aagcacagtt 2101 ac // LOCUS NEUALCA 1639 bp ds-DNA PLN 15-AUG-1990 DEFINITION N.crassa allantoicase (alc) gene, complete cds. ACCESSION J02927 KEYWORDS allantoicase. SOURCE N.crassa (strain Oak Ridge), clone pALC-1. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 1639) AUTHORS Lee,H., Fu,Y.-H. and Marzluf,G.A. TITLE Nucleotide sequence and DNA recognition elements of alc, the structural gene which encodes allantoicase, a pirine catabolic enzyme of Neurospora crassa JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.A.Marzluf, 12-JUL-1990. FEATURES from to/span description pept 250 340 allantoicase (alc), exon 1 413 1386 allantoicase (alc), exon 2 IVS 341 412 alc intron A signal 129 135 TATA box BASE COUNT 383 a 441 c 443 g 372 t ORIGIN 1 cgttgcagat cgaatacgac ggttaggtac gacgaagaag gaccacgatt gtcgttgctg 61 ttacgtactt tgacctcctc aacgcactat cttgcttaag ctatcgctct tgtctgtcgc 121 tgtggtgata taaattctgc gcctgctctt ggtttattcc gaggacgctc gttccatctc 181 tgtttttttt ttctctctgt gacatcgagg actgaagtct cacttattca aatacacatt 241 tccctcacca tgaccgacat cgattacaag ctcgaggctg ttccggccac tcggattgcc 301 gccgatgata tcgacaagac tttccgttcc agcaccatcg gtccgtagca tccatctcac 361 caaacatggc aacccaaacc tttcaactaa cggaagtcga gctgggatac agatcttatc 421 tcaggggctc tcggtggcaa ggtttccggt ttctcggacg aatggttcgc cgaagcagcc 481 aacctcctca ctcctacagc cccaatccgc cagccgggaa agatggttta caccggcgcc 541 tggtatgacg gatgggagac aaggagacac aaccctgccg agttcgactg ggttgtgatc 601 cgtctgggcg tcgcctcggg taccgtcgag ggtgtcgaga ttgacacggc tttcttcaac 661 ggcaaccatg cgcccgccat ctcggtcgag ggttgcttca gccaaaacga cgatgaggtt 721 ctgtcatgga agggcgagct gggtggatgg gagactattc ttggcgttca agagtgcggc 781 ccttcgcaga gattctgctg gaaactcgag aaccctacca agaagcagta cacccatgtg 841 cgactaaaca tgtaccccga cggcggcatt gccaggttcc gtctgtttgg acacgccgta 901 ccggtcttcc ccgacaatac ggatgccatc tttgacttgg cggctgccca gaacggcgga 961 gttgcgatct cctgcagtga ccagcacttt ggtaccaagg acaaccttat ccttccgggc 1021 cgcggcaagg acatgggcga cggttgggag acagcacgct cgcgcaccaa gggccacgtc 1081 gactggacca tcatcagact cggcgcgccc ggctacattc agaatttcat ggtcgacacg 1141 gctcacttcc gcggtaacta cccccagcag gtcaagctgc aacgtatcga gtggaagagc 1201 gagggcaggc cgggagcgga ttctgagggc tggacagagg ttgttgagcc catcaagtgc 1261 ggtcccgatc aggaacaccc tgtcgagagc ttggtgaagg acaagccgtt cacccacgtc 1321 aagctcatca ttgtgcctga cggcggagtg aaaagactgc gggtgtttgc gaagagggct 1381 gtttaagaaa ttaccaagct atatatctga aggcaattat tcggtgagag cagcatttac 1441 ggggagccat caacagcgag cgatccacat aaaaaggggg aggacctcat ttagtatgat 1501 gggcaacgag tgcagtcatt tagccgcgaa gaatcgaaat ctctcagatc tttgattgtc 1561 tgcgcttaag taacaaagtc taattctcaa tcagctttcg tcgtagagta aaattagaag 1621 gatgcacggc tgcccacga // LOCUS RATINHA 1561 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat inhibin alpha-subunit mRNA, complete cds. ACCESSION M36453 KEYWORDS inhibin. SOURCE Rat female (strain Sprague-Dawley) ovary, cDNA to mRNA, clone rINA-13. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1561) AUTHORS Woodruff,T.K., Meunier,H., Jones,P.B.C., Hsueh,A.J.W. and Mayo,K.E. TITLE Rat inhibin: Molecular cloning of alpha- and beta-subunit complementary deoxyribonucleic acids and expression in the ovary JOURNAL Mol. Endocrinol. 1, 561-568 (1987) STANDARD simple staff_review FEATURES from to/span description pept 256 1356 inhibin alpha-subunit precursor sigp 256 954 inhibin alpha-subunit signal peptide matp 955 1353 inhibin alpha-subunit mRNA < 1 1561 inhibin alpha-subunit mRNA BASE COUNT 308 a 465 c 440 g 348 t ORIGIN 1 ggacactaga atgctgtgtt gttagaggag tggagagagg aagatgtgct aagtgtagca 61 gtacacacct ataatcctag cacttgagag gttgaaggca ggaggatgag acattcaggt 121 cattcttagc tacatgaaga gtttaaggcc agcacggatt acaggatatc tgtttctggg 181 gaaaaaggag gggaagagag agaggaaagg gcaaagggca gagtgtgggc tccctgtcgt 241 cagggcaaga gaactatggt gatccagccg tctctgctgc tccttttgct gttgactcta 301 caggatgtgg acagctgcca ggggccagaa cttgtccggg agcttgtcct ggccaaagtg 361 aaggcactat tcctagatgc cttggggccc ccagcaatgg atggggaagg tgggggtcct 421 ggaataaggc ggctgcctcg aagacatgcc cttgggggct tcatgcacag gacctctgaa 481 ccagaggagg aggatgtctc ccaggccatc cttttcccag ccacaggtgc cacctgtgag 541 gatcaggcag ctgctggagg gcttgcccag gagcctgagg aaggtctctt cacttatgta 601 ttccggccat cccaacacat acgcagccac caggtgactt cagcccagct gtggttccac 661 acggggctcg acaggaagag cacagcagcc tccaatagct ctaggcccct gctagatctt 721 ctggtgctgt catctggggg gcccatggct gtgcctgtgt ccttgggaca gagcccccca 781 cgctgggctg tcctgcacct ggcggcctcc gctttccctc tgttgaccca ccccatcctc 841 gtgttgctgc tgcggtgccc actctgttct tgctcaggcc ggcctgagac cactcctttc 901 ctggtggccc acactagggc tcgagccccc agtgcggggg agagggctcg acgttcagct 961 ccctcgatgc cttggccttg gtctcctgca gccttgcgtt tgctgcagag gcctccagag 1021 gaaccctctg cccatgcctt ctgccatcga gctgccctca acatctcctt ccaggagctg 1081 ggctgggacc gctggatcgt acaccctccc agcttcattt tccactactg ccatggtagc 1141 tgcgggatgc ccacatctga tctgcccctg ccagtccctg gggctccccc taccccggct 1201 cagcccctgt ttttggtgcc aggggccaag ccctgctgtg cagctctacc agggagcatg 1261 aggtccctac gcgtccgaac cacctcagat ggaggctact ctttcaagta tgagatggta 1321 ccgaacctca ttacacaaca ctgtgcttgt atctaaaagc acctcgtctc ctcctccaca 1381 gccactggcc accatcacct caccatccca cggtcggtcg gtcggtcggt cgtcagctag 1441 gaggaaggtg ggtgtggaaa gtagacagtt tccacttcct tttcccttca tctttctgtc 1501 tgaggcttcc acaccccact ccacccaggt cctgtggata acaataaaga aggaagtgtg 1561 t // LOCUS RATINHB 1543 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat inhibin beta-A-subunit mRNA, complete cds. ACCESSION M37482 KEYWORDS inhibin. SOURCE Rat female (strain Sprague-Dawley) granulosa cell, cDNA to mRNA, clone rINB-5. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1543) AUTHORS Woodruff,T.K., Meunier,H., Jones,P.B.C., Hsueh,A.J.W. and Mayo,K.E. TITLE Rat inhibin: Molecular cloning of alpha- and beta-subunit complementary deoxyribonucleic acids and expression in the ovary JOURNAL Mol. Endocrinol. 1, 561-568 (1987) STANDARD simple staff_review FEATURES from to/span description pept 163 1437 inhibin beta-A-subunit precursor sigp 163 1086 inhibin beta-A-subunit signal peptide matp 1087 1434 inhibin beta-A-subunit mRNA < 1 1543 inhibin beta-A subunit mRNA BASE COUNT 435 a 356 c 454 g 298 t ORIGIN 1 ctctgacctc atgagacaag agccggctgg caaaacagaa gggacccgaa agagaatttg 61 ctgaagagga gaaggaaaaa agtccaaaaa acctgtacgt gaggggtggg gaggaaaagc 121 agggccttta aagaaggcaa ccacacgact tttgctgcca ggatgccctt gctttggctg 181 agaggatttc tgttggcaag ttgctggatt atagtgagga gttcccccac cccaggatcc 241 gaggggcacg gcgcagcccc ggactgcccg tcctgtgcgc tggccaccct tccgaaggat 301 ggacctaact ctcagccaga gatggtagag gctgtcaaga agcacatctt aaacatgctg 361 cacttgaaga agagacccga tgtcacccag ccggtaccca aggcggcgct tctcaacgcg 421 atcagaaagc ttcatgtggg taaagtgggg gaaaacgggt atgtggagat agaggacgac 481 attggcagga gggccgaaat gaatgaactc atggagcaga cctcggagat catcaccttt 541 gccgagtcag gcacagccag gaagacactg cattttgaga tttccaagga aggcagtgac 601 ctgtcagtcg tggagcgtgc agaagtctgg ctcttcctga aagtccccaa ggccaacagg 661 accaggacca aagtcaccat ccgtctgttt cagcagcaga agcatccaca gggcagcttg 721 gacatggggg atgaggccga ggaaatgggc ttgaaggggg agaggagtga actgttgcta 781 tcagagaaag tggtagatgc tcggaagagc acttggcaca tcttcccagt gtctagcagc 841 atccagcgcc tgctggacca ggggaagagt tccctggatg tgcggattgc ttgtgaacag 901 tgccaggaga gcggtgccag cctagtgctc ctgggcaaga agaagaagaa agaggtggat 961 ggagacggga agaagaaaga cggaagtgac ggagggctgg aagaggaaaa agaacagtca 1021 cacagacctt tcctcatgct gcaggctagg cagtctgaag accatcctca ccgcaggcgt 1081 aggcggggct tggagtgtga tggcaaggtc aacatttgct gtaagaaaca gttctttgtc 1141 agcttcaagg atattggctg gaatgactgg atcattgctc cctctggcta tcatgccaac 1201 tattgtgagg gtgagtgccc aagccacata gcaggcacct ctgggtcctc actctccttc 1261 cactcaacag tcattaacca ctaccgcatg aggggtcaca gcccctttgc caaccttaag 1321 tcatgctgtg tgcccaccaa gctgagaccc atgtccatgc tgtattatga tgatggtcaa 1381 aacattatca aaaaggacat tcagaacatg attgtggagg agtgtggctg ctcctagagt 1441 tgccaggtcc cagagcaaat ggatctaggg tgtccaggaa aagacagtgg caaatgaaga 1501 aaaatatata agatttctgc ctaaacaaga caaccagaaa aat // LOCUS RSBMNP 1201 bp ss-RNA VRL 15-AUG-1990 DEFINITION Bovine syncytial virus major nucleocapsid protein (N) mRNA, complete cds. ACCESSION M35076 KEYWORDS major nucleocapsid protein. SOURCE Bovine syncytial virus (strain A51908) MDBK cell, cDNA to mRNA. ORGANISM Bovine syncytial virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Spumavirinae. REFERENCE 1 (bases 1 to 1201) AUTHORS Samal,S.K., Zamora,M., McPhillips,T.H. and Mohanty,S.B. TITLE Molecular cloning and sequence analysis of bovine respiratory syncytial virus mRNA encoding the major nucleocapsid protein JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.K.Samal, 12-JUL-1990. Author address: S.K.Samal Univ Maryland at College Park Dept. Veterinary Medicine College Park, MD 20742 FEATURES from to/span description pept 16 1191 major nucleocapsid protein mRNA 1 1200 major nucleocapsid protein mRNA BASE COUNT 434 a 196 c 270 g 301 t ORIGIN 1 ggggcaaata caaaaatggc tcttagcaag gtcaaactaa atgacacttt caacaaggat 61 caactgttat caaccagcaa atatactatt caacgtagta caggtgacaa cattgatata 121 cccaattatg atgtacaaaa acatctcaat aagttgtgtg gtatgctact aataacagaa 181 gatgccaatc ataaatttac aggattgata ggtatattat atgctatgtc ccgattgggg 241 agagaagata cccttaaaat actcaaagat gcaggctacc aagtaagggc caatggggtt 301 gatgtgataa cacatcgaca ggatgtgaat ggaaaagaaa tgaaatttga agtgctaaca 361 ttagtcagct taacatcaga agttcaaggc aatatagaaa tagagtcaag gaagtcttac 421 aaaaagatgc taaaagagat gggagaggta gccccagaat acagacatga ctctcctgat 481 tgtggtatga tagtgctatg tgttgctgct ttggttataa caaaattagc agcaggtgat 541 agatcaggcc tcactgcagt cattaggaga gccaacaatg tactaaggaa tgaaatgaaa 601 cgatacaaag gacttatccc gaaagatata gctaacagct tctatgaagt gattgaaaag 661 taccctcatt acatagatgt attcgtacat tttggcattg ctcaatcctc aactagagga 721 ggtagtaggg tagaaggaat ctttgcaggg ttattcatga atgcatatgg agcaggtcaa 781 gtgatgttaa gatggggtgt attagccaaa tcagtcaaga acattatgct tggtcatgcc 841 agcgtgcaag cagaaatgga acaggttgta gaggtctatg aatatgcaca aaagttaggt 901 ggagaagctg gtttttatca catattgaac aaccctaaag catcactgtt atccttaaca 961 caattcccca acttctctag tgtagtccta ggcaatgctg caggactagg tataatgggt 1021 gagtatagag gtacaccaag aaaccaagac ttgtatgatg ctgccaaagc atatgcggaa 1081 caattaaaag agaatggggt catcaattac agtgtattag atctgactac agaggaacta 1141 gaggcaatca agaaccaatt gaatcccaaa gacaatgatg tggaactgtg agttaataaa 1201 a // LOCUS URELOCAB 558 bp ds-DNA BCT 15-AUG-1990 DEFINITION U.urealyticum urease locus proteins A and B, complete cds. ACCESSION M36190 KEYWORDS urease locus-encoded protein. SOURCE U.urealyticum (serotype 8) DNA. ORGANISM Ureaplasma urealyticum Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 558) AUTHORS Willoughby,J.J., Russell,W.C., Thirkell,D. and Burdon,M.G. TITLE PCR primers that detect Ureaplasma species and a study of the urease locus by 'PCR walking' JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.J.Willoughby, 27-JUN-1990. Author address: J.J.Willoughby University of St. Andrews Biochemistry and Microbiology North Street St. Andrews, Fife KY16 9AL SCOTLAND FEATURES from to/span description pept 23 349 urease locus protein A pept 436 522 urease locus protein B BASE COUNT 209 a 78 c 111 g 160 t ORIGIN 1 tttataagga gataatgatt atatgtcagg atcatcaaat caattcactc caggtaaatt 61 agtaccagga gcaattaact tcgctgaagg cgaaaatgtg atgaacgaag gtagagaagc 121 aaaagtaatc agcattaaaa atactggtga ccgtcctatc caagttggat cacatttgca 181 cttatttgaa acaaatagtg cattagtatt ctttgatgaa aaaggaaacg aagacaaaga 241 acgtaaagtt gcttatggac gtcgtttcga tattctcagt actgctattc gttttgaacc 301 aggagacaaa aaagaagttt cagttattga tttagtcgga acacgttgaa gtttgaggtg 361 taaacggctt agttaacggc aaaaccttaa aaaataatct atttacaagt ttctatatag 421 acgaagggga acattatgtt taaaatttca agaaaaaatt actcagatct atatggtatc 481 acaactggtg atagcgttag attaggagac acaaatcttt gagttaaagt tgaaaaagac 541 ttaactactt atggcgaa // LOCUS YSCFUR1A 2123 bp ds-DNA PLN 15-AUG-1990 DEFINITION S.cerevisiae uracil phosphoribosyltransferase (FUR1) gene, complete cds. ACCESSION M36485 KEYWORDS uracil phosphoribosyltransferase. SOURCE S.cerevisiae (strain FL100, ATCC 28383) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2123) AUTHORS Kern,L., de Montigny,J., Jund,R. and Lacroute,F. TITLE The FUR1 gene of Saccharomyces cerevisiae: Cloning, structure and expression of wild-type and mutant alleles JOURNAL Gene 88, 149-157 (1990) STANDARD simple staff_review FEATURES from to/span description pept 895 1650 uracil phosphoribosyltransferase (FUR1) mRNA 886 1791 FUR1 mRNA (alt.) mRNA 888 1791 FUR1 mRNA (alt.) signal 841 848 TATA box signal 1896 1901 poly-A signal BASE COUNT 659 a 427 c 392 g 645 t ORIGIN 1 atcgataaaa gaactaatgt ttcccaaaga aataggaaaa agggaataaa gaataatagg 61 ccccacaaag acataaacag cagtcctgac tggggcaact gcacagagga accgattggc 121 agagcgaaaa agcaaacggc atgaacaggg ccaagaactc tcggaatttt accactaata 181 ttaaattgca gcgacaacat tttggcgaag aaatacaagg tggccagcca gccttgtgat 241 atctacaaat tcagatgctt cagataaatt gttaatgcta ttcaacctaa ctttgggagt 301 aaaccaagaa aacttgaaaa atgttctgga aaacatttct caggtgcaga tagctcaaat 361 tagggttaga gacctgcctt caggatctgc caccgctaag gtccgtctgg catatcctac 421 aacacagtct ttggagaagg taagaaaact gttccatggc gctctagttg atggaaggcg 481 catccaagtg gtgattgcat ctgatgaatc gtcccacttg tcgtattaga gtttgtcaac 541 gacactcaca aggtatttaa tcagcaaaat ccccgccaca aactattttt ttgaagacat 601 gctttctcat gactgcctaa taacaatacc tcattctact agtaatcgac ctatgtaatt 661 atttcataaa ctataaagca ggtcattgca ataacagaaa ggccggtttt tctataagct 721 tatctcatcg cataaaaaat cgacagttgt aattatctcc ggcggacttt tccctttccg 781 tctttttttt caaaattttt ttttttttca cttcttcttt caaagctgcc tcaaaagaga 841 tatatatatt ggtaagaatc ctcttccaat actagcttca tttcttcttg aaccatgaac 901 ccgttattct ttttggcttc tccattcttg taccttacat atcttatata ttatccaaac 961 aaagggtctt tcgttagcaa acctagaaat ctgcaaaaaa tgtcttcgga accatttaag 1021 aacgtctact tgctacctca aacaaaccaa ttgctgggtt tgtacaccat catcagaaat 1081 aagaatacaa ctagacctga tttcattttc tactccgata gaatcatcag attgttggtt 1141 gaagaaggtt tgaaccatct acctgtgcaa aagcaaattg tggaaactga caccaacgaa 1201 aacttcgaag gtgtctcatt catgggtaaa atctgtggtg tttccattgt cagagctggt 1261 gaatcgatgg agcaaggatt aagagactgt tgtaggtctg tgcgtatcgg taaaatttta 1321 attcaaaggg acgaggagac tgctttacca aagttattct acgaaaaatt accagaggat 1381 atatctgaaa ggtatgtctt cctattagac ccaatgctgg ccaccggtgg tagtgctatc 1441 atggctacag aagtcttgat taagagaggt gttaagccag agagaattta cttcttaaac 1501 ctaatctgta gtaaggaagg gattgaaaaa taccatgccg ccttcccaga ggtcagaatt 1561 gttactggtg ccctcgacag aggtctagat gaaaacaagt atctagttcc agggttgggt 1621 gactttggtg acagatacta ctgtgtttaa ataaatcaca cccgaacacc atcttgaagg 1681 ttcagaacgg ctgaagccat atcaactttg ggtttctact gttttaaatt tcctttctcg 1741 ttttaaactt ttgttgccgt ctcttctact atcaattttt gttgttcatg catgtttaat 1801 tacctttttt gtaaaaataa tataaacgta ccaatggtca tttataacaa atatgcttga 1861 aaaatctaac gactctgttt cttacattag gttcgaataa acacggtaca tgtcctctag 1921 ccaatctgac atttttggtc caaagtcttt gaaaggtaga taaccccgtt aaaatagaac 1981 caccaatcca tgtagtatat tttctttctg aaggggctat aatctttatc taggatgttc 2041 ctttggttaa tgcctccaaa tcccatagca ttcggtctcc aaagccttta agcgttgtag 2101 ttccgccact taggattatc gat // LOCUS YSCMET16A 1986 bp ds-DNA PLN 15-AUG-1990 DEFINITION S.cerevisiae 3'-phosphoadenylyl sulfate reductase (MET16) gene, complete cds. ACCESSION J05591 KEYWORDS 3'-phosphoadenylyl sulfate reductase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1986) AUTHORS Thomas,D., Barbey,R. and Surdin-Kerjan,Y. TITLE Gene-enzyme relationship in the sulfate assimilation pathway of Saccharomyces cerevisiae: Study of the 3'-phosphoadenylylsulfate (PAPS) reductase structural gene JOURNAL J. Biol. Chem. (1990) In press STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Surdin-Kerjan, 28-JUN-1990. FEATURES from to/span description pept 792 1562 3'-phosphoadenylyl sulfate reductase (MET16) signal 686 692 TATA-box site 643 648 cis-acting element in general control of AA synthesis site 612 618 UAS (methionine metabolism) BASE COUNT 670 a 368 c 379 g 569 t ORIGIN 1 atgcatcttg cctctttgat attggttgga tcttcttatg gcttccacga actctcttgt 61 gtaaatatct ggatttctac cgtcctcaat gtattgaaca acttccaagg gaatgtccac 121 cttagacaag ctggattgag gatcgttgct tctcacgttc agcttgtaca agcgatccac 181 atttctttgc aagttggtga tcattccctt ggtggcttct ggagtaccag gaaaatcata 241 tatcgagaca cctaattcaa cgaaggactc aataatcgaa gccacttggt cttgagtagt 301 ggccagttct tgctgcaatt gttcattgtt agtgctgttt ccattcatct tatcggttta 361 tttttctata tatttgcctc tttctcaaac aggagttagt agttaaaagt acgaagttct 421 tgttctttaa tgcgcgctga caaaagaatt ggataaaaga gaatggtggg gggacaagaa 481 ggaaatttgt cctagtttaa catgaatggc atcttgttac cgggtggaca tcacctattg 541 attctaaata tctttacggt ttatcatact gttctttatt ccgtcgttat tctttttatt 601 tttatcatca tttcacgtgg ctagtaaaag aaaagccaca acatgactca gcaaatctcg 661 acaaagtaaa agctcataga gatagtatta tattgatata aaaaaagtat actgtactgt 721 ttgtaacctt ttcaatgctt taagatcaaa actaaggcca gcaaaggtat caacccatag 781 caactcataa aatgaagacc tatcatttga ataatgatat aattgtcaca caagaacagt 841 tggatcattg gaatgaacaa ctaatcaagc tggaaacgcc acaggagatt attgcatggt 901 ctatcgtaac gtttcctcac cttttccaaa ccactgcatt tggtttgact ggcttggtta 961 ctatcgatat gttgtcaaag ctatctgaaa aatactacat gccagaacta ttatttatag 1021 acactttgca ccatttccca caaactttaa cactaaaaaa cgagattgag aaaaaatact 1081 accagcctaa aaatcaaacc attcacgtat ataagccgga tggatgtgaa tcggaggcag 1141 attttgcctc gaaatacggg gatttcttat gggagaaaga tgatgacaag tacgattatc 1201 tggccaaagt ggaacctgca catcgtgcct acaaagagct acatataagt gctgtgttta 1261 ctggtagaag aaaatcacaa ggttctgccc gctcccaact gtcgattatt gaaatagacg 1321 aacttaatgg aatcttaaaa ataaatccat tgatcaattg gacgttcgag caggttaaac 1381 agtatataga tgcaaacaat gtaccataca acgaactttt ggaccttgga tatagatcca 1441 ttggtgatta ccattccaca caacccgtca aggaaggtga agatgagaga gcaggaagat 1501 ggaagggcaa ggcaagaccg agtgtggaat tcatgaagcc agccgattcg cgcaattttt 1561 aaagcaagat gcctagatag atagagtacg atatataacc atatgtatgt gactaattat 1621 ttattcctta ataacaccaa tgattacaac tttctaaagc tggcggagaa ttcgcgctgt 1681 acgagaaaag agcgaaaaca gaggaatatt caaactaaga accaaactgc gataaagagg 1741 attgaaagga aaaacgaaag aaaaggtaaa ctgacaaata tatacattaa ccgatgggta 1801 atttcagatt tcctataaaa accaagctac caccagggtt tatcaatgct cgcatactta 1861 gggataactt caaaagacaa caatttaaag agaatgaaat ccttgttaaa tctttgaaat 1921 tcatcgctag aaatatgaac cttccaacaa aactgaggtt ggaggctcag ttaaaactaa 1981 atgcat // LOCUS CHT59KD 2429 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.trachomatis 59-kDa immunogenic protein (SK59) gene, complete cds. ACCESSION M31119 KEYWORDS antigen; immunogenic protein. SOURCE C.trachomatis L2 (strain LGV-2 434BU) elementary body DNA, clone beta-1. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 2429) AUTHORS Kahane,S., Weinstein,Y. and Sarov,I. TITLE Cloning, characterization and sequence of a novel 59-kDa protein of Chlamydia trachomatis JOURNAL Gene 90, 61-67 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Weinstein, 05-JAN-1990, for release after publication. FEATURES from to/span description pept 466 2043 59-kDa immunogenic protein BASE COUNT 700 a 626 c 417 g 686 t ORIGIN 1 ggatcccgaa ttgggtaact ctcagaccca cacataaggc catatgctcg agtacgtgag 61 ccactccact agaatcttgc gggcaagtcc gaaaagaaat attgaacaca ttttcatcat 121 catcattcac gatcatcatg atcgttgccc cggtcggagt atgttccact tcaattagct 181 tgctctcgat ctcgggaaga tcctgactca acttgactac aaaatttcta taggtatccc 241 agttttcata tcccactcaa tcttctataa tagagaagct tgttgcatct ccctattttc 301 gattcaccta acatagaaga cagctactgt gagctcttat atccacacaa atattctttc 361 tgaaggcttc tcttattaaa aaaaaagacg ggactcgatt gagtccccat actagactag 421 cttcctaaaa tataaggcca ggactactcg tctgatttca agacgatgaa tcgcaccaca 481 tctcccttga gaaaccataa ggagaacatt ctctcccttt cgagtttttc aaaacctgat 541 ttaactcttc aacggaagcg acctcgctgc ctattcaccg ctaagataag ctgtccagga 601 gcgacgcctg cagaagctgc aggcgagctg cctccacagc aactaccaga atcctcgggt 661 atctgctgcc aatccgagtt tcttacaaat ttctggagta atgtaagtct cacggactcc 721 catcttctgg caacgctgaa acgccatcct ctgttgggtc tgtgtaaccg tcacaggtat 781 ctcgcttgtt ttcccttcac gacgattttt aaaataacac gagtccctgg catcattagg 841 gaaatggcat tacgcaacgc actcaaagac tctacttctt tttccattgt aagccacaat 901 gacatcttct tccagccccg ctttttctgc tggagaacct ttaacaacat ccgtcaccaa 961 acgttccgta cacttttcca atttgtaaca agtagccaat tcagaatcta tcggttgcaa 1021 ggtaactccc aaaaagcctc ttgttacctg cccatcacta atcaattgat caatgactcg 1081 tttagccatc aagctaggaa tagcaaaccc tattccaata tatcccccgc taccactgac 1141 aatggcagta ttaaccccga taacttgacc attgattgtt taacaatgga ccgccctgaa 1201 ttcccaggat taatggcagc atctgttgta acaaatcttc gaaatctaca atatgtagat 1261 gatttcttcc tttcagcact aacgaccccg atagtgaccg ttgcttgcaa tccaaaagga 1321 tttccaatag caatagccca gtcacctatc tgcagtcgat cagaattccc aaaagtcaaa 1381 aatggtaatt tctctgctgt aattttgatc acagcaagat ctgtttttgg atctaacccc 1441 acgatcttag gctgtgtatt tttgtccatc gtggagagta acatgaattt ttcctgcatc 1501 ctcgactaca tgatggttag taacaacata accattcgaa ttcagaaaca tagaacccag 1561 ttcctcttac agcatcacgc cgctgctgcg gacgctgctg ctctctatcg aaggcaaccc 1621 aaaaaatcga ttaaaaaatt cgtcattaaa ataatcaaaa acaaaagggt tctcttgaag 1681 cctcttttgt ttcctggaga agcaatagcc tggttccctg ttttaggaaa attttcatat 1741 atatcaactc caggacgttg ccttagacgc gcgacccgag taaaacctcg ggatacttct 1801 ttaggagatc ttcttgtgaa acctcttgat ctccgtgagg atactgcaag acaaatatca 1861 gccattaaga atctttcttt gacgcactat agcctagcat tggcgaagag aaaacgtgat 1921 gtcgatagca acacacataa taataatctt ttcaatcatc ttttccttga taagcgatct 1981 gcgtctagcc cggtttttca tttatgcacc ataacaagca gatatgcagc atacaaaatc 2041 taatgatgca aatcaaggag actactctga tgattctcca atctaaaaaa ctaacgtggt 2101 tttagaacgg atgcaaccgg cctctccaat cagtgcagga gattctacaa cggtaacccc 2161 tgcctgtctc aaagcttctt gtttgctaaa agcatcccca cttttccctg aaataatagc 2221 tcctgcatgt cccatacgtt tccctttggg gagccgtagc tcctgcaata aatgcaatca 2281 caggcttact actatgttga cgtatccaat ctgcagcttc ctcttcagcg cttccaccaa 2341 tctccaatca taagaacagc ttctgtttgg ctatcctttt caaactcttg gagaggcatc 2401 gataaaagat gtgccacttt aaaggatcc // LOCUS AFAAZU 810 bp ds-DNA BCT 15-AUG-1990 DEFINITION A.denitrificans azurin (azu) gene, complete cds. ACCESSION M30388 KEYWORDS azurin. SOURCE A.denitrificans (strain NCTC8582) DNA. ORGANISM Alcaligenes denitrificans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 810) AUTHORS Hoitink,C.W.G., Woudt,L.P., Turenhout,J.C.M., van De Kamp,M. and Canters,G.W. TITLE Isolation and sequencing of the Alcaligenes denitrificans azurin-encoding gene: Comparison with the genes encoding blue copper proteins from Pseudomonas aeruginosa and Alcaligenes denitrificans JOURNAL Gene 90, 15-20 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.W.Canters, 01-DEC-1989, for release after publication. FEATURES from to/span description pept 307 756 azurin (azu) precursor sigp 307 366 azurin signal peptide matp 367 753 azurin site 190 203 fnr-box site 251 267 ntrA-box binding 296 299 ribosome binding site signal 778 800 terminator BASE COUNT 160 a 238 c 245 g 167 t ORIGIN 1 cccgccgctg tgctgccttg catgctcgaa ctctacttgt ttgcaattgt ttgcaggcat 61 cctacgaaga tggaagaccc ttcgtattgc ggtttgtcaa tgggcacggt ttcggtgcgc 121 cggatgggcc aataccccta tgcggcatgg ggatttcccc tgtttttggg catctgaacg 181 gggtgggatt gatgtccgtc aatagcgcgc ttttttcgcc gtcttagact tgtgcgtggc 241 ggcagcgacg caggcatgtg cctggcgcga gtcgaagaat ggccgccctg tttacggaga 301 gtctccatgc tggcaaaagc caccctagct atcgttctgt ccgcagccag cctgcccgtg 361 ctggctgctc aatgcgaagc aaccatcgaa agcaacgacg ccatgcagta caacctgaag 421 gaaatggtcg ttgacaaaag ctgcaagcag ttcacggtgc acctcaagca cgtcggcaag 481 atggccaagg tcgccatggg ccacaactgg gtgctgacca aggaagccga caagcagggc 541 gtcgccactg acggcatgaa cgccggcctg gcgcaggact acgtgaaggc gggcgatacc 601 cgtgtcatcg cgcacaccaa ggtcatcggc ggcggcgaat cggattcggt aacgttcgac 661 gtgtccaagc tgaccccggg cgaagcctat gcctacttct gctcgttccc cggccactgg 721 gccatgatga agggcacgct caagctgagc aactgacccc gccctagcgc gcagataccg 781 gcccagggcc ggtttttttt gtcttggggc // LOCUS PSEAZU 1287 bp ds-DNA BCT 15-AUG-1990 DEFINITION P.aeruginosa azurin (azu) gene, complete cds. ACCESSION M30389 KEYWORDS azurin. SOURCE P.aeruginosa (strain CIT135) DNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1287) AUTHORS Hoitink,C.W.G., Woudt,L.P., Turenhout,J.C.M., Van De Kamp,M. and Canters,G.W. TITLE Isolation and sequencing of the Alcaligenes denitrificans azurin-encoding gene: Comparison with the genes encoding blue copper proteins from Pseudomonas aeruginosa and Alcaligenes denitrificans JOURNAL Gene 90, 15-20 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.W.Canters, 01-DEC-1989, for release after publication. FEATURES from to/span description pept 213 < 1 (c) ORF1 pept 489 935 azurin (azu) precursor sigp 489 548 azurin signal peptide matp 549 932 azurin pept > 1287 985 (c) ORF2 (AA at 1287) site 318 333 ntrA-box site 403 416 fnr-box signal 958 985 terminator (bidirectional azu and ORF2) binding 476 481 ribosome binding site binding 224 220 (c) ribosome binding site (ORF1) BASE COUNT 229 a 423 c 428 g 207 t ORIGIN 1 ctgcaggctc tgcgggatga tcccgatcac ttcgctgccg gcggccaatg cggcgtccgc 61 cacggtgccc atcagaccga ccgcgccgcc accgtagacc agggtcaggc cgcgctcggc 121 caggtgccgg ccgagggcca cggcggcttc ctggtagacc ggggaagcgc cggggctggc 181 gccacagaat acgcagacgg aacgcaaggt catgatcgac tcctgtcggg ggtggaaaaa 241 ggcgcacagg gtagcggctg ggagcgcttc gaccaagccg tgcgaagcgt tgccggacgt 301 tgcgtcgcag gcgcgaagcg gcacatctgt gctaaaacag gagttccccg tagtaaacgc 361 cgggcagatc ccgctcgatg ccccgccacg tccggttcgg gtttgacctg aatcagtgga 421 actcggtgcc cgatcgggca gtctgctctt tcaggattca tcgcccaacc tgcctaggag 481 gctgctccat gctacgtaaa ctcgctgccg tatccctgct gtccctgctc agtgcgccgc 541 tgctggctgc cgagtgctcg gtggacatcc agggtaacga ccagatgcag ttcaacacca 601 atgccatcac cgtcgacaag agctgcaagc agttcaccgt caacctgtcc caccccggca 661 acctgccgaa gaacgtcatg ggccacaact gggtactgag caccgccgcc gacatgcagg 721 gcgtggtcac cgacggcatg gcttccggcc tggacaagga ttacctgaag cccgacgaca 781 gccgcgtcat cgcccacacc aagctgatcg gctcgggcga gaaggactcg gtgaccttcg 841 acgtctccaa gctgaaggaa ggcgagcagt acatgttctt ctgcaccttc ccgggccact 901 ccgcgctgat gaagggcacc ctgaccctga agtgatgcgc gagcgatccg ctgcatgaaa 961 aagcccggcc gctgccgggc tttttcatgg gcgcgcgccg ggctcagcgc gcgtagctgc 1021 cgccatcgcc tcgccggcca gttggtgcac gcgccgggtc ggatgccact cgtcccagaa 1081 gtagtactgg tccgggttgg cgcaggccgg gcggacgctg ggctgggtcg gctggcaggg 1141 cgcgtccagc tccaccaggc catagcgcgc cgggttgcgc cgcaagtggc ggctgaaggt 1201 gagatggtcg aaccagctca gctccaggcc gcgggtcttg cgcagggcgg cgagctggat 1261 cggcaggctg gcgttgactg cctgcag // LOCUS MZEADH1CM 6167 bp ds-DNA PLN 15-AUG-1990 DEFINITION Z.mays alcohol dehydrogenase (ADH-1 C-m allele) gene, complete cds. ACCESSION M32984 KEYWORDS alcohol dehydrogenase. SOURCE Z.mays DNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 6167) AUTHORS Osterman,J.C. and Dennis,E.S. TITLE Molecular analysis of the ADH1-Cm allele of maize JOURNAL Plant Mol. Biol. 13, 203-212 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Osterman, 18-MAR-1990. FEATURES from to/span description pept 1217 1250 alcohol dehydrogenase, exon 1 (ADH-1) (EC 1.1.1.1) 1785 1921 alcohol dehydrogenase, exon 2 2019 2065 alcohol dehydrogenase, exon 3 2482 2807 alcohol dehydrogenase, exon 4 2894 2976 alcohol dehydrogenase, exon 5 3070 3145 alcohol dehydrogenase, exon 6 3487 3548 alcohol dehydrogenase, exon 7 3636 3731 alcohol dehydrogenase, exon 8 3823 3984 alcohol dehydrogenase, exon 9 4085 4201 alcohol dehydrogenase, exon 10 pre-msg 1110 > 4201 ADH-1 mRNA and introns IVS 1251 1784 ADH-1 intron A IVS 1922 2018 ADH-1 intron B IVS 2066 2481 ADH-1 intron C IVS 2808 2893 ADH-1 intron D IVS 2977 3069 ADH-1 intron E IVS 3146 3486 ADH-1 intron F IVS 3549 3635 ADH-1 intron G IVS 3732 3822 ADH-1 intron H IVS 3985 4084 ADH-1 intron I BASE COUNT 1574 a 1335 c 1378 g 1880 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccaata ggctagtcac ttttacttta gcttctgaga tccaaacagt cacttaggac 61 atgtttggaa gcacaccagt ttttaaaaaa ctttttccta tcctcaattt ctagaaaatg 121 gtttatgaaa aaaaatttgg gtgggatgtt tgtaacccag tttctagttt tttttataaa 181 gagagtagct tcttggtttt agttagagga gagtagcttc ttggttttta agaaactggg 241 aatccagttt ctataaactg gaacataaat aagtatattt ggaatcactt tagtttgtac 301 aaaccgattt cttagaaatt ggatgcttat aaataggccc tcaatgtcct tgttgggttt 361 atgaaattta catctattac cacattttta aaaatagagg aagagtatgc tagtagttat 421 gtataaaaaa actagaaact gtttttttta aaaaaaaact gagttccagt ttcctttatc 481 taattctttt ataagctatt ttttagaaaa ggatagaaac tgtttttaaa aaaactggtg 541 tgcttctgtt taactcttcg taagaacagt gttacgtccc gtgtctatat tttgcttttg 601 ttgaaagcca tcgtaagtac atgcttgcgt gggtgaaatg ccatcgcaat gctacaactt 661 ttcggctccc tcctgcttcg gtgcttccac atgccctgca cggcgtctag aaaccctaat 721 gattcagcag cacacctgtc cgcctagccg cctacgcgta cacagaaaac aaattttttg 781 tccacacacg cgcgcgctcc gagccgcaga tccgagctag cgcggcgcat ccgacggcca 841 cgacagcgcg gtgccgtcct ccgccgccac cgcttggcgc ttgtccgcac cccccaccag 901 tccaccacct cccccacgag cgaaaaccac ggtccacgga ccacggctat gttccactcc 961 aggtggaggc tgcagccccg gtttcgcaag ccgcgccgtg gtttgcttgc ccacaggcgg 1021 ccaaaccgca ccctccttcc cgtcgtttcc catctcttcc tcctttagag ctaccactat 1081 ataaatcagg gctcattttc tcgctcctca caggctcgtc tcgctttgga tcgattggtt 1141 tcgtaagtgg tgagggactg agggtctcgg agtggattga tttgggattc tgttcgaaga 1201 tttgcggagg ggggcaatgg cgaccgcggg gaaggtgatc aagtgcaaag gtccgccttg 1261 tttctcctct gtctcttgat ctgactaatc ttggtttatg attcgttgag taattttggg 1321 gaaagcttcg tccacagttt tttttttcga tgaacagtgc cgcagtggcg ctgatcttgt 1381 atgctatcct gcaatcgtgg tgaacttatt tcttttatat ccttcactcc catgaaaggc 1441 tagtaatctt tctcgatgta acatcgtcca gcactgctat taccgtgtgg tccatccgac 1501 agtctggctg aacacatcat acgatattga gcaaagatct atcttccctg ttctttaatg 1561 aaagacgtca ttttcatcag tatgatctaa gaatgttgca acttgcaagg aggcgtttct 1621 ttctttgaat ttaactaact cgttgagtgg ccctgtttct cggacgtaag gcctttgctg 1681 ctccacacat gtccattcga attttaccgt gtttagcaag ggcgaaaagt ttgcatcttg 1741 atgatttagc ttgactatgc gattgctttc ctggacccgt gcagctgcgg tggcatggga 1801 ggccggcaag ccactgtcga tcgaggaggt ggaggtagcg cctccgcagg ccatggaggt 1861 gcgcgtcaag atcctcttca cctcgctctg ccacaccgac gtcgacttct gggaggccaa 1921 ggtatctaat cagccatccc atttgtgatc tttgtcagta gatatgatac aacaactcgc 1981 ggttgacttg cgccttcttg gcggcttatc tgtcttaggg gcagactccc gtgttccctc 2041 ggatctttgg ccatgaggct ggagggtatg ttctattccc cgatttactt cactatgttg 2101 ctgactatat atgtgctgtg tttatatttt gcatatttat tatgtttttg cgtctgaatt 2161 tatgggtatg gttggtggtc tttgtttact gttttactag atgcatgtgg aagagtcaga 2221 agaaatagtt tttgtttgaa atggtatacc aacggttgga tattatctgt gtggacatca 2281 gatgttctgg gttactggca gtggactttt gacagattta tctatgattc tttcattagc 2341 agtttcttcg gctaatttac tcttactatt ttttcagtat acaaaggcac gtacagcttg 2401 gattgtgtag aatcatttta gatctgttat ctgaggcaaa tttgcttatt ctagccgcct 2461 gaaaattctt gattttgcca gtatcataga gagtgttgga gagggtgtga ctgacgtagc 2521 tccgggcgac catgtccttc ctgtgttcac tggggagtgc aaggagtgcg ctcactgcaa 2581 gtcggcagag agcaacatgt gtgatctgct caggatcaac accgaccgcg gtgtgatgat 2641 tgccgatggc aagtcgcggt tttcaatcaa tgggaagcct atctaccact ttgttgggac 2701 ttccaccttc agcgagtaca ctgtcatgca tgtcggttgt gttgcaaaga tcaatcctca 2761 ggctcccctt gataaagttt gcgtccttag ctgtggtatt tctaccggta agttcattta 2821 ctacattttg gtgtggatgc tggagtacat ttatcttgag atgctgagtt acacaaattc 2881 tttatctgtt taggtcttgg tgcatcaatt aatgttgcaa aacctccgaa gggttcgaca 2941 gtggctgttt tcggtttagg agccgttggt cttgccgtaa gtgttgaaac gatttgcttg 3001 ttctatgacc tttcaattgc aatgagaacg tgtgttgggt ttgcatctga ttaccctgcg 3061 catggttagg ctgcagaagg tgcaaggatt gctggagcgt caaggatcat tggtgtcgac 3121 ctgaacccca gcagattcga agaaggtaca gtacacacac atgtatatat gtatgatgta 3181 tcccttcgat cgaaggcatg ccttggtata atcactgagt agtcatttta ttactttgtt 3241 ttgacaagtc agtagttcat ccatttgtcc cattttttca gcttggaagt ttggttgcac 3301 tggccttggt ctaataactg agtagtcatt ttattacgtt gtttcgacaa gtcagtagct 3361 catccatctg tcccattttt tcagctagga agtttggttg cactggcctt ggactaataa 3421 ctgattagtc attttattac attgtttcga caagtcagta gctcatccat ctgtcccatt 3481 tttcagctag gaagttcggt tgcactgaat ttgtgaaccc aaaagaccac aacaagccgg 3541 tgcaggaggt ctgtttcttt acccaaggca acaaaaggtt atcacagctt atgctgaact 3601 tggccataac attcaataat tcctttatgg tctaggtact tgctgagatg accaacggag 3661 gggtcgaccg cagcgtggaa tgcactggca acatcaatgc tatgatccaa gctttcgaat 3721 gtgttcatga tgtaagtata tgtatacact ctcagctact ttcattctcc aggttccctt 3781 catccagaca tgcatgttct aaccgccgcc ctcgtgatcc agggctgggg tgttgccgtg 3841 ctggtgggtg tgccgcataa ggacgctgag ttcaagaccc acccgatgaa cttcctgaac 3901 gaaaggaccc tgaaggggac cttctttggc aactataagc cacgcactga tctgccaaat 3961 gtggtggagc tgtacatgaa aaaggtaaat tgcaaagtgc tgttccttcg gtttccttac 4021 cagccgagct tttgctgaaa aactgttaag aatcgttcct gcaattctgc ttggctctgc 4081 acaggagctg gaggtggaga agttcatcac gcacagcgtc ccgttcgccg agatcaacaa 4141 ggcgttcgac ctgatggcca agggggaggg catccgctgc atcatccgca tggagaacta 4201 gatttcgctg tctagtttgt gatctggctg ggcttggggt taataaagga ggcaatgcta 4261 gcctgccctt tcgatgagga ggtacataca cgctggcgat ggaccgcgct tgtgtgtcgc 4321 gttcagtttg gcttttgcca agcagtaggg tagcttcccg tgtcggtaat tatatggtat 4381 gaaccatcac cttttggcgc aatacatggt atgaacgtaa gatacaaatt ccaactacct 4441 ctagctcgct tgtgtgctat atgtatctct ctcgacggat gacacaagat cgcttctata 4501 tccgaagtga aactaaaagg agaaggaaaa gaaggtaaca gaataggaac cggtttggtg 4561 agaattggag aggattcatg aaagagaaaa tcccttttca ttaaatttta aatagcaagt 4621 gatttactct ctcatgatct cctccagttt ccatttcatc aaaacaaacc ttattcattt 4681 tcccctctaa tctctttctt gtcaccaccg gtggagcaag gtgattaaag agactaaatt 4741 attattcaat gaatagtagg ggttttagcc cctcaattcc tccaatacct ttgctcccaa 4801 ataagggggt gtttggtttc tagggactaa ttcctccaat acctttgctc ccaaataagg 4861 gggtgtttgg tttctaggga ctaatgttta gtcccatcat ttttttttct attttagtct 4921 ataaattgct aaatatagaa actaaaataa attaaaatat agttttagtt tctatatttg 4981 acaattttag aactaaaatg gaataaaatg tagggactaa aaattagtct agaaatcaaa 5041 caccccctaa atccctaaga gccgaggaag gggattaaaa aggataaaat cttctttgtg 5101 ttcaatttta aataggactc gccgtatcgg taaggccttg ttcgtttaca ttggattgca 5161 cctggaatcg ttccggctaa tcaaagttta tataaattag agaagcaatc cggatcggaa 5221 tcgttccgac ccaccaatcc gacgcaaacg aacaaggcct aaggcttcgc ggcggggctc 5281 gcagtccgga cgccggagag ggggagtgga gatggagaat gacaaggggg tgttctggaa 5341 agtttccttt ccaagagtaa gggtggttgg tttcgtacac taatttttaa gagcgtttgg 5401 ttaagaaaca gagaaaaatg gagtaactct attcttattt tttatgttta gttttcatta 5461 aaaaaggagc agaataccac ttgaagttct tatatagaaa tttatcataa atagttaaaa 5521 tgctctcact ccataaaaac aatcggatgc tagcgctctt cttcctatcc taccctctat 5581 attcatatga ctctttaacc aaacagagaa cggagcggct ccgctctatt ttactcttca 5641 accaaataaa aaaggagcaa ctctgtttgt catacgcgga atagaacgga tttatcctca 5701 aaaactagaa tggagcccct ctattttagt cgattctcca accaaacgca tagtgtctcc 5761 atttcattct attttagtct ctaaattgac aaatacataa actaaattat attttaagtt 5821 ttcgtattta atcaatccct accaaccaaa cactccctaa tttcgcatat cagccccaaa 5881 tcaagagtgg ttgacccatc gagacgttat cggcggatca aaggcatgcc ccgctaagca 5941 ataagtgtct aaactaacgt gccgtcgatc tcattaaaca gcaccacgag ctaaacagaa 6001 tgccaacctc aaaatcaaac atcacctgga tgctggatct gacatccgac ctaggtgcta 6061 ggcaacgatt gtgcgtagtg ctgaccatat ttgagatttt cactttattt attaaaaaaa 6121 agaggccagc agggtgggcc gctacccggc ctggtggccg agctaga // LOCUS CFICMCASE 1828 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.uda endoglucanase gene, complete cds. ACCESSION M36503 KEYWORDS endoglucanase. SOURCE C.uda CB4 DNA. ORGANISM Cellulomonas uda Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 1 to 1828) AUTHORS Nakamura,K., Misawa,N. and Kitamura,K. TITLE Sequence of a cellulase gene of Cellulomonas uda CB4 JOURNAL J. Biotechnol. 4, 247-254 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 105 1184 endoglucanase BASE COUNT 348 a 542 c 557 g 381 t ORIGIN 1 ctgcagagtc agggaggcag cgctcacgta atattgcagc gtgaccgcgt gttctctgtc 61 tctgacgttc agtttcttta ctaccatcca taatgagtga atttatgccc ctgcgtgctt 121 tagtggcggt gatagtgaca acggcagtaa tgctggtgcc ccgggcgtgg gcgcagacgg 181 cctgggagcg ttataaggcc cgttttatga tgccggacgc gcgtatcatt gataccgcca 241 atggcaatgt gtcgcatacg gaaggccagg gcttcgccat gctcctggcg gtggcgaata 301 acgatcgccc ggcgttcgac aagctgtggc agtggacgga cagcaccctg cgcgacaagt 361 ctaacgggct gttttactgg cgctataacc cggtggcgcc ggacccgatc gccgataaaa 421 acaacgccac cgatggcgat accctgatcg cctgggcgct gctgcgcgcg caaaagcagt 481 ggcaggacaa gcgctacgcc acggcctccg atgccatcac cgcctccctg ctgaaatata 541 cggtggtgac tttcgccggt cgccaggtga tgctcccggg cgtgaagggg tttaaccgca 601 acgaccacct gaaccttaac ccctcctatt tcatcttccc ggcctggcgg gcctttgcgg 661 agcggacgca cctgaccgcc tggcggacat tgcagagtga cgggcaggcg ctgctggggc 721 aaatgggctg ggggaaatcg catctgccca gcgactgggt ggcgctgcgg gcggatggca 781 agatgctgcc ggccaaagag tggccgccgc ggatgagttt cgatgcgatc cgtatcccgc 841 tgtatatctc gtgggtcgat ccgcacagcg ccttgctcgc accgtggaaa gcctggatgc 901 agagttaccc gcgcctgcaa actccggcgt ggatcaacgt tagcaccaac gaggtcgccc 961 cgtggaatat ggccggcggc ctgctggcgg tgcgtgattt aacgcttggc gaaccgctgg 1021 aacgccgcag attgacgaca aggatgatta ttactccgcc agcctcaagc tgctggtctg 1081 gctggcgaaa caggatcagc gctagcgctg tgatggcttt gcaggtttct cagcccgtat 1141 gcctgcgggc tgagagaaaa gagcaggaac gtctcacgat gtaaggccgc cagaataggc 1201 ggccttgtcg cttattgcgg ataaggcacc caactgccgc cattcagctg gacataaggc 1261 ttgccctgat actggataac gatggcgttg gcgttttcgg acaccgccgc gctctgcggc 1321 aggttggcga catactgctg ccagttgacg ctgtcttcgc tgaacatttt gccgtcgagg 1381 gcgcgcgcac caccagctcc gacaccgcca ggtagctgct gggctgatcg atgataattg 1441 gcgcgccttc atgtggcgcc ttcatgccga agaatttcac cgccgtcggg acgttagtga 1501 tcgacgggct cgggatatcc cgcaggccag acacctgcat cttatcgccc ttcagcgcgc 1561 cgccgtgttc cggcaccacc accaccatca ccttacgccc cgatttttcc agttcggtga 1621 agaagttatc caggtcgtca aacagcttct gcgcccgcac tttgtagtcc gcggttttgc 1681 tttgccccgg gaagtgattg ccgtcatgca gcggcagggt gttatagaac gtggcgctcc 1741 gcggattgct gctggcctct tcggtcttca gccacgggtt gagaaccgcg agatcctcat 1801 acactggcga accatcaaat gcctgcag // LOCUS HUMTAPA1 1496 bp ss-mRNA PRI 15-AUG-1990 DEFINITION Human 26-kDa cell surface protein TAPA-1 mRNA, complete cds. ACCESSION M33680 KEYWORDS 26-kDa cell surface protein TAPA-1; target of antiproliferative antibody. SOURCE Human cell line OCI-LY8, cDNA to mRNA, clones 7-3 and 8-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1496) AUTHORS Oren,R., Takahashi,S., Doss,C., Levy,R. and Levy,S. TITLE TAPA-1, the target of an anti-proliferative antibody, defines a new family of transmembrane proteins JOURNAL Mol. Cell. Biol. 10, 4007-4015 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by S.Levy, 10-APR-1990, for release after publication. FEATURES from to/span description pept 239 949 26-kDa cell surface protein TAPA-1 signal 1455 1460 Poly-A signal BASE COUNT 257 a 504 c 413 g 322 t ORIGIN 1 ccattgtgct ggaaaggcgc gcaacggcgg cgacggcggc gaccccaccg cgcatcctgc 61 caggcctccg cgcccagccg cccacgcgcc cccgcgcccc gcgccccgac cctttcttcg 121 cgcccccgcc cctcggcccg ccaggccccc ttgccggcca cccgccaggc cccgcgccgg 181 cccgcccgcc gcccaggacc ggcccgcgcc ccgcaggccg cccgccgccc gcgccgccat 241 gggagtggag ggctgcacca agtgcatcaa gtacctgctc ttcgtcttca atttcgtctt 301 ctggctggct ggaggcgtga tcctgggtgt ggccctgtgg ctccgccatg acccgcagac 361 caccaacctc ctgtatctgg agctgggaga caagcccgcg cccaacacct tctatgtagg 421 catctacatc ctcatcgctg tgggcgctgt catgatgttc gttggcttcc tgggctgcta 481 cggggccatc caggaatccc agtgcctgct ggggacgttc ttcacctgcc tggtcatcct 541 gtttgcctgt gaggtggccg ccggcatctg gggctttgtc aacaaggacc agatcgccaa 601 ggatgtgaag cagttctatg accaggccct acagcaggcc gtggtggatg atgacgccaa 661 caacgccaag gctgtggtga agaccttcca cgagacgctt gactgctgtg gctccagcac 721 actgactgct ttgaccacct cagtgctcaa gaacaatttg tgtccctcgg gcagcaacat 781 catcagcaac ctcttcaagg aggactgcca ccagaagatc gatgacctct tctccgggaa 841 gctgtacctc atcggcattg ctgccatcgt ggtcgctgtg atcatgatct tcgagatgat 901 cctgagcatg gtgctgtgct gtggcatccg gaacagctcc gtgtactgag gccccgcagc 961 tctggccaca gggacctctg cagtgccccc taagtgaccc ggacacttcc gagggggcca 1021 tcaccgcctg tgtatataac gtttccggta ttactctgct acacgtagcc tttttacttt 1081 tggggttttg tttttgttct gaactttcct gttacctttt cagggctgat gtcacatgta 1141 ggtggcgtgt atgagtggag acgggcctgg gtcttgggga ctggagggca ggggtccttc 1201 tgcccctggg gtcccagggt gctctgcctg ctcagccagg cctctcctgg gagccactcg 1261 cccagagact cagcttggcc aacttggggg gctgtgtcca cccagcccgc ccgtcctgtg 1321 ggctgcacag ctcaccttgt tccctcctgc cccggttcga gagccgagtc tgtgggcact 1381 ctctgccttc atgcacctgt cctttctaac acgtcgcctt caactgtaat cacaacatcc 1441 tgactccgtc atttaataaa gaaggaacat caggcatgct aaaaaaaaaa aaaaaa // LOCUS DROSYNCL 3727 bp ds-DNA SYN 15-AUG-1990 DEFINITION Synthetic cloning vector encoding heat-shock protein 82/neomycin phosphotransferase fusion protein (hsp82-neo) gene, complete cds. ACCESSION M32616 KEYWORDS heat-shock protein 82; neomycin phosphotransferase. SOURCE Synthetic, D.pseudoobscura, D.melanogaster and bacterial DNA, clone pHS85. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3727) AUTHORS Sass,H. TITLE P-transposable vectors expressing a constitutive and thermoinducible hsp82-neo fusion gene for Drosophila germline transformation tissue-culture transfection JOURNAL Gene 89, 179-186 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Saas 06-MAR-1990, for release after publication. FEATURES from to/span description pept 2068 2925 heat-shock protein 82/neomcyn phosphotransferase fusion protein (hsp82-neo) IVS 1005 2067 hsp82 intron A pre-msg 6 3610 hsp82-neo fusion protein mRNA and intron site 6 868 D.pseudoobscura hsp82 gene 5' flank site 869 1004 D.pseudoobscura heat-shock protein 82, exon 1 site 2068 2126 D.pseudoobscura hsp82 truncated exon 2 site 2127 2142 coding linker site 2143 3269 neomycin phosphotransferase coding sequence site 3270 3610 non-coding 3'flank of D.melanogaster hsp82 gene with Poly-A signal site 3620 3726 multiple cloning site (MCS) BASE COUNT 950 a 866 c 882 g 1029 t ORIGIN 23 on XR. 1 ggatccgatg gatttttacc atattattat tatttctagc cacgttgcaa ctctatgtca 61 gtaccggaaa tagcagccct ggagtctctt agcctctaga aacggctaga acattctacg 121 cttgtggttg gttttcattg aaagcaggcg tcttttatat actttacggt atatagctac 181 atgtatataa tggtatactt catcaatatc atcaatctat gaattttaat ttttaagagt 241 acatatataa attaacatgg gggatatagt tctcaatacc caagtatttg aattttccat 301 ctctcatcgg gggtaattca tgaaccggtt ccagccgaaa aatgaacgaa attcatgaga 361 gattattttt tcgggattgc ttgccaatac atttcggaaa aacaaaatgt actacatttt 421 tgtcatctca gggtgctcca attaattatg aatgctacga cactacaaag cagcttggaa 481 atccgaattt taacaataat taaaggaaat agggtatagc gtatataggg tatcatagct 541 gaaacgggta taccaacaat aatgacgcag cacttacgtt tcactccgta ctcacttacg 601 atttatgctt ataatttttg ttcacctctt ttacttaaac ctcactttaa aaacaatcaa 661 ataaatggga gtatttatgt atatttctaa gattacggcg gtattgttct gctgtctgcg 721 gtcacactgg ttttcagcct cggtgcaact ctgtttcagt accggaaata gcagccctgg 781 attctcgtag cctctagaaa cgtctagaaa attctacgct tggggttggt ttgctataaa 841 agcaggcggg ccgactgttg ccggctcgag tcttgaaaaa tttttgtcca gtgaaggtgc 901 gtttgcttag agcgcagtgc aacaaagtga atttattcta cacaaatcga agtgaaaata 961 tatatatatt tttatctctg ctgttaaatt aaaacacata caaggtaagc gttaacaatg 1021 aaagtgcatt tatttaacaa aatgtaaaga tctgctgtgg tgcaatgctt gctgcgcgtc 1081 tgctgatgaa aagttcttga cccaaatgca gaaaatcaat agaatctgtg aaatcttcta 1141 taatcttaaa attagattaa agttctattt ttttgcccga gtttgtaacc acgggcgata 1201 aaaagtagct ttacgcctcg cacaccaata cacgaacaga aaaattatgc cggctgtaat 1261 atgagctcgg cgcgaaattt ctagatgacc ggttcttaga acatcaacct tgcatgtcca 1321 acaaatgctg gttaattaaa gacgtgcctt aacttaattt tcttggcaca cgtgcttatt 1381 tgaattcagt cttttgcact tgccatgcac acagccacac atatgtgaat ttgcgaattt 1441 gccactcatg catacactca tgtatgttcc atcatcgaga aaattcgaaa atcgtgaatc 1501 aaacttcggc atgaatcaaa tttcaaagag gtctttgttt ccacctggtt ctagaagttt 1561 cctttcgcgt gcttggatac ctatcttatg cataaacggt ttctgcacat gtaacttgaa 1621 cacatacaca cttgcaaaca tatgtatgta catatgcata ccctgaccac aaaattttca 1681 gcaaacttta gccgtacatc aaaccaccaa agagctgtgc tgttgtcaag gagaattttc 1741 ttccagaaag cttcaattag attgtttatc tgggggtgat gtacgcattg gacaacccta 1801 tgcgctctag aaacttccag taaatgttaa ctggatgtac aatgggtaca tccctaagcg 1861 tgcgagtgta tgcgtgttcg ctaactgtaa tgtatgtgtg ttcgtgtgcg aaagagaaaa 1921 ggatgagaag tctgccattt tgaaataaaa agattttgtg ctaggggggt ggggaaatat 1981 gattatcgaa aatgggcagt gaacaatgca gctgcatatt taatgagttg tgactaattc 2041 tcgtgtggta ttttcttgct cttccagatg cccgaagaag ctgagacttt cgcattccag 2101 gctgagattg ctcagcttat gtcgttgatc cggccaagct tggatggatt gcacgcaggt 2161 tctccggccg cttgggtgga gaggctattc ggctatgact gggcacaaca gacaatcggc 2221 tgctctgatg ccgccgtgtt ccggctgtca gcgcaggggc gcccggttct ttttgtcaag 2281 accgacctgt ccggtgccct gaatgaactg caggacgagg cagcgcggct atcgtggctg 2341 gccacgacgg gcgttccttg cgcagctgtg ctcgacgttg tcactgaagc gggaagggac 2401 tggctgctat tgggcgaagt gccggggcag gatctcctgt catctcacct tgctcctgcc 2461 gagaaagtat ccatcatggc tgatgcaatg cggcggctgc atacgcttga tccggctacc 2521 tgcccattcg accaccaagc gaaacatcgc atcgagcgag cacgtactcg gatggaagcc 2581 ggtcttgtcg atcaggatga tctggacgaa gagcatcagg ggctcgcgcc agccgaactg 2641 ttcgccaggc tcaaggcgcg catgcccgac ggcgaggatc tcgtcgtgac ccatggcgat 2701 gcctgcttgc cgaatatcat ggtggaaaat ggccgctttt ctggattcat cgactgtggc 2761 cggctgggtg tggcggaccg ctatcaggac atagcgttgg ctacccgtga tattgctgaa 2821 gagcttggcg gcgaatgggc tgaccgcttc ctcgtgcttt acggtatcgc cgctcccgat 2881 tcgcagcgca tcgccttcta tcgccttctt gacgagttct tctgagcggg actctggggt 2941 tcgaaatgac cgaccaagcg acgcccaacc tgccatcacg agatttcgat tccaccgccg 3001 ccttctatga aaggttgggc ttcggaatcg ttttccggga cgccggctgg atgatcctcc 3061 agcgcgggga tctcatgctg gagttcttcg cccaccccgg gctcgatccc ctcgcgagtt 3121 ggttcagctg ctgcctgagg ctggacgacc tcgcggagtt ctaccggcag tgcaaatccg 3181 tcggcatcca ggaaaccagc agcggctatc cgcgcatcca tgcccccgaa ctgcaggagt 3241 ggggaggcac gatggccgct ttggtcgatc gatgataaac ataaaaccaa ataaacaaca 3301 agcaaatgtg ttttaaaaat ctaacttctg agcgagtatt tattgggggg aataaacaat 3361 ctatgaatcg gattctttgc gcagcagctg ctcaatggcc tccaccgtgg acactccgtt 3421 ggttatcatt attatcttgt ttcgcgatcg agatcccttg tccaaagaaa cgtcgctctt 3481 tcgaagacct agaactttcg acagaaactt gaccagttcg gcgttagctt ctccctcgct 3541 gggcggagcg gcgatttgga cgcccactcc ttcaaagcca attcctgtga ttccgttctg 3601 cttagccccc ccggaattgg gtacccccac cgcggtggcg gccgctctag aactagtgga 3661 tcccccgggc tgcaggaatt cgatatcaag cttatcgata ccgtcgacct cgaggggggg 3721 cccggta // LOCUS ECOARGD 1221 bp ds-DNA BCT 15-AUG-1990 DEFINITION E.coli acetylornithine aminotransferase (argD) gene, complete cds. ACCESSION M32796 KEYWORDS acetylornithine aminotransferase. SOURCE E.coli (K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1221) AUTHORS Heimberg,H., Boyen,A., Crabeel,M. and Glansdorff,N. TITLE Escherichia coli and Saccharomyces cerevisiae acetylornithine aminotransferases: Evolutionary relationship with ornithine aminotransferases JOURNAL Gene 90, 69-78 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by A.H.T.Boyen 13-MAR-1990, for release after publication. FEATURES from to/span description pept 1 1221 acetylornithine aminotransferase (argD) (EC 2.6.1.11) BASE COUNT 261 a 290 c 376 g 294 t ORIGIN 73 minutes. 1 atggcaattg aacaaacagc aattacacgc gcgactttcg atgaagtgat cctgccgatt 61 tatgctccgg cagagtttat tccggtaaaa ggtcagggca gccgaatctg ggatcagcaa 121 ggcaaggagt atgtcgattt cgcgggtggc attgcagtta cggcgttggg ccattgccat 181 cctgcgctgg tgaacgcgtt aaaaacccag ggcgaaactc tgtggcatat cagtaacgtt 241 ttcaccaatg aaccggcgct gcgtcttggg cgtaaactga ttgaggcaac gtttgccgaa 301 cgcgtggtgt ttatgaactc cggcacggaa gctaacgaaa ccgcctttaa actggcacgc 361 cattacgcct gtgtgcgtca tagcccgttc aaaaccaaaa ttattgcctt ccataacgct 421 tttcatggtc gctcgctgtt taccgtttcg gtgggtgggc agccaaaata ttccgacggc 481 tttgggccga aaccggcaga catcatccac gttcccttta acgatctcca tgcagtgaaa 541 gcggtgatgg atgatcacac ctgtgcggtg gtggttgagc cgatccaggg cgagggcggt 601 gtgacggcag cgacgccaga gtttttgcag ggcttgcgcg agctgtgcga tcaacatcag 661 gcattattgg tgtttgatga agtgcagtgc gggatggggc ggaccggcga tttgtttgct 721 tacatgcact acgcgttagc gccggatatt ctgacctctg cgaaagcgtt aggcggcggc 781 ttcccgatta gcgccatgct gaccacggcg gaaattgctt ctgcgtttca tcctggttct 841 cacggttcca cctacggcgg taatcctctg gcctgtgcag tagcgggggc ggcgtttgat 901 atcatcaata cccctgaagt gctggaaggc attcaggcga aacgccagcg ttttgttgac 961 catctgcaga agatcgatca gcagtacgat gtatttagcg atattcgcgg tatggggctg 1021 ttgattggcg cagagctgaa accacagtac aaaggtcggg cgcgtgattt cctgtatgcg 1081 ggcgcagagg ctggcgtaat ggtgctgaat gccggaccgg atgtgatgcg ttttgcaccg 1141 tcgctggtgg tggaagatgc ggatatcgat gaagggatgc aacgtttcgc ccacgcggtg 1201 gcgaaggtgg ttggggcgta a // LOCUS YSCARG8 1272 bp ds-DNA PLN 15-AUG-1990 DEFINITION S.cerevisiae acetylornithine aminotransferase (ARG8) gene, complete cds. ACCESSION M32795 KEYWORDS acetylornithine aminotransferase. SOURCE S.cerevisiae FL100 DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1272) AUTHORS Heimberg,H., Boyen,A., Crabeel,M. and Glansdorff,N. TITLE Escherichia coli and Saccharomyces cerevisiae acetylornithine aminotransferases: Evolutionary relationship with ornithine aminotransferases JOURNAL Gene 90, 69-78 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by A.H.T.Boyen 13-MAR-1990, for release after publication. FEATURES from to/span description pept 1 1272 acetylornithine aminotransferase (ARG8) (EC 2.6.1.11) BASE COUNT 404 a 230 c 283 g 355 t ORIGIN 1 atgtttaaaa gatatttatc cagtacgtca tcaagaagat ttacaagcat tttagaggaa 61 aaggcctttc aagtgaccac ttactctaga cctgaagatc tatgtataac tagaggtaaa 121 aatgcaaagc tgtatgatga cgtgaatggt aaagaatata tcgatttcac cgcaggtatt 181 gcggtgaccg cattaggcca tgcaaatcct aaagtggcag aaattctgca ccatcaggct 241 aacaaactgg ttcattcctc caacctttac ttcactaagg aatgtttgga tttaagtgaa 301 aagattgttg aaaagaccaa gcaattcggt ggtcaacacg acgcctcaag agtattttta 361 tgtaattctg gtacggaagc aaatgaagct gctttgaagt ttgcaaagaa acatggtata 421 atgaaaaatc ctagcaagca aggcattgtt gcatttgaga actcttttca tggccgtact 481 atgggcgctt tatctgtcac ttggaatagt aaatatagaa ctccttttgg ggatttggtt 541 ccccatgtct cattcttaaa tttgaatgac gaaatgacca aactacaaag ttatatcgag 601 accaaaaagg acgagattgc tggtttaatt gtcgagccca tacaaggtga aggtggggtt 661 tttcccgtag aagttgaaaa gctaaccgga ttgaagaaaa tatgtcaaga taatgatgtg 721 attgtcattc atgatgaaat tcaatgcggt ttgggccgtt caggtaaact atgggctcat 781 gcttatttac caagtgaggc tcatccggat atttttacat ctgccaaagc attgggaaat 841 ggcttcccca tcgctgccac catcgtcaat gaaaaagtta ataatgcttt gagagttggt 901 gaccacggca ccacgtatgg tggtaatccg ctggcctgtt ctgtaagcaa ctatgttttg 961 gataccatag cagacgaagc ttttttgaaa caagtctcta agaagagtga tatcttacaa 1021 aagcgcttgc gcgaaattca agccaaatat ccaaatcaaa taaagactat cagaggaaaa 1081 ggtttgatgc ttggtgctga gttcgtcgaa ccacccaccg aggtcatcaa aaaggccaga 1141 gaattgggac ttttgatcat taccgctggt aagagtaccg ttagatttgt tcccgcatta 1201 acgattgaag acgaactaat cgaagaaggg atggatgctt ttgaaaaggc tattgaagcg 1261 gtttacgctt aa // LOCUS MZEMTMINI 1445 bp ds-DNA ORG 15-AUG-1990 DEFINITION Maize mitochondrion 1.4 kb minicircle DNA open reading frame. ACCESSION M36398 KEYWORDS . SOURCE Maize mitochondrion 1.4 kb minicircle DNA. ORGANISM Mitochondrion Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Zea mexicana. REFERENCE 1 (bases 1 to 1445) AUTHORS Smith,A.G. and Pring,D.R. TITLE Nucleotide sequence and molecular characterization of a maize mitochondrial plasmid-like DNA JOURNAL Curr. Genet. 12, 617-623 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 120 353 ORF 1 pept 1240 1356 ORF 2 pept 1245 1403 ORF 3 BASE COUNT 375 a 327 c 320 g 423 t ORIGIN 1 gaattccttc ctttggtcgg actactcttt ttaggttatt gccttcggtc aaccctaaat 61 aagttgattg tcaaattgcg ctgtaactgc attcagttga atatgcggat attttatcaa 121 tgaatctcga tatcctgttg ataaagattg gatttcttgc gattctgatc gttttatcaa 181 tccaaatcat cgatgaatat ttccataaag tgatctgtga tcctttagtc tcaatatcag 241 ttgtttcctg ccgggataac ttgggttatg ctagccacct acttctacaa acaggtgaga 301 tccacctggg tgggttcgaa tcccatctgc tagatgcgtg gtcatggaat tgaaacctct 361 atggctggcc caagggaacc ggtcttgtcg attgacctag cttaggaaga gcccagtgaa 421 cctatccaca agtcaacccc cagggataat ggaaaacctc attcgcccat tggcaaacac 481 ttaaatatga ggacattcct ctggcaagac aggttagaga cttgagagac taaagacaag 541 aaggcacagg ttgtagtttt cttccaaggc caaaagcccc gcatggtgga agaagctact 601 ggtaagtccg agggggggct taactgcgat agttgaccga cgcgacgcta taccggaaag 661 gccttcgggg tgttgaaagt atggaacttt tattctcgca tagcttggga aagggtatcc 721 ggtgaaactc cccttaaaag ggtttttccc ccgtaccccc ttttcccaaa aaatttttta 781 aaaaaagtgg atcagtgaac ctatctttat ctgattaaat cagtggttag gttcactact 841 atttatagat aacaacccta gccttggggg gacaccccct ccccccaatc ccccctgtct 901 ggttttgttt taaaccaagt ttgcagggcg agcttgtttt gttatttata attagttatt 961 tcatgtttga tccgagcttc gggataggga acctctcttg tcagaaaggc ttccctctcc 1021 cttggtctct tgaaacagga cttttattca ctcagctatg cttcccggaa atccggatta 1081 aagaataaag acttctatac ctttccggga agcagagcag agggaaacgg agccctcgcc 1141 ccggagggga atcaattctc tggtttatcg ttcttatgct gttgcggtta taacgatagg 1201 aattactaga taacatcctc taggaattac tagataacaa tggaatggtt gagcctacta 1261 tctcaagtgt tggaaggctc aacctacttg cttgtccctc tccactatcg ttccggtctt 1321 accttccctc gagtccgatc tcgggaaggc gcttaggcag gggccccaag actaagcagg 1381 taatacaata cctatattta tagagggctt ttacctcgat aaatgagggc gcttcctata 1441 atgtg // LOCUS NGOTEM1A 1199 bp ds-DNA BCT 15-AUG-1990 DEFINITION N.gonorrhoeae plasmid pFA7 beta-lactamase (TEM-1) gene, 3' end. ACCESSION M36543 KEYWORDS beta-lactamase. SOURCE N.gonorrhoeae plasmid pFA7 DNA. ORGANISM Neisseria gonorrhoeae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1199) AUTHORS Sanchez-Pescador,R., Stempien,M.S. and Urdea,M.S. TITLE Rapid chemiluminescent nucleic acid assays for detection of TEM-1 beta-lactamase-mediated penicillin resistance in Neisseria gonorrhoeae and other bacteria JOURNAL J. Clin. Microbiol. 26, 1934-1938 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 21 beta-lactamase (TEM-1) (AA at 1) BASE COUNT 344 a 237 c 178 g 440 t ORIGIN 1 tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat 61 ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg 121 accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccta tctataaact 181 cttggcttgg ttctaatccc tctaaacgat tattatcaat agccgctcta accgcttttt 241 ctcggcttaa tttttctgtc tctgttataa aattgcttat tcattcttgt tcttctttca 301 aaaaaaagtt aagtaaaata cctacctaaa tttttactag ttcgcaatct acgagcttat 361 aacctcgttt tttcaattca tttaaaaaat cagattttga gcctaatttg atctattgct 421 atcgttaccc gctagaaata cccagtaatt acgcaaatct tcattggtaa ctttcgtaat 481 atctgtgtaa tgatcttcga gtatttttaa gcaatctcta gcccataaac cgtactcgtg 541 attgctcatc ttagggtttt gcttatcgag tttgacgaac ttcccatact tgtttttatg 601 tggaaatact ggccgtttgc aacttcttca attttttgag ctgttcgttt tttactacca 661 atcacaaaat ttaaagagtg aatagtacgc ccacgcttga tttgttcaac ctcaacgact 721 aaatcagatt tctcgttaat ctcagttatt gcaggttcca aaacacgttg atttaatgaa 781 ttaaatctag gtattattca acctgaagcc attctttagt tttctactgt aatttcacga 841 ctaccaacag agcgatattg tgtaattagc tcataaattc gaattgaatg tacactgttg 901 aaataagcga tatgtttgag ttgatattgc gtgaattgcc ctttaagttg cgttaggtat 961 ggcataactt catcagtcat tgcaattcta aaacgcccct ctttctgaaa tatgttctag 1021 aggaaaccca acgaaattca gttacacggt ctttatcttc agttttaaca cttcggtcat 1081 aaatccgttt tatagccgcc tgaatttgct tataggcgtt atcttggctt atttctggaa 1141 actcacggac aaaatcagcc accgtaaaat caaaaatttt ttgattagat ttcggatcc // LOCUS FLANAX 1461 bp ss-RNA VRL 15-AUG-1990 DEFINITION Influenza A/Chile/1/83 (H1N1), neuraminidase (seg 6), cDNA to mRNA. ACCESSION M24783 M33023 KEYWORDS neuraminidase. SOURCE Influenza virus type A, cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1461) AUTHORS Schreier,E., Roeske,H., Driesel,G., Kuenkel,U., Petzold,D.R., Berlinghoff,R. and Michel,S. TITLE Complete nucleotide sequence of the neuraminidase gene of the human influenza virus A/Chile/1/83 (H1N1) JOURNAL Arch. Virol. 99, 271-276 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 21 1433 neuraminidase BASE COUNT 466 a 263 c 343 g 388 t 1 others ORIGIN 1 agcaaaagca ggagtttaaa atgaatccaa atcagaaaat aataaccatt ggatcaatct 61 gtatgacaat cggaataatt agtctaatat tgcaaatagg aaatattatt tcaatatggg 121 ttagccactc aatccaaact ggaagtcaaa accacactgg aatatgcaac caaagaatca 181 ttacttatga aaatagcacc tgggtaaatc aaacatatgt caatattaac aacactaacg 241 ttgttgctgg aaaggacaca acttcagtga cattagccgg caattcatct ctttgtccta 301 tccgtgggtg ggctatatac agcaaagaca acagcataag aattggttcc aaaggagatg 361 tttttgtcat aagagaacct tttatatcat gttctcactt ggaatgcaga accttttttc 421 tgacccaagg tgctctatta aatgacaagc attcaaatgg gaccgttaag gacagaagcc 481 cttatagggc cttaatgagc tgtcctatag gtgaagctcc gtctccatac aattcaaggt 541 ttgaatcagt tgcttggtca gcaagcgcat gtcatgatgg catgggctgg ctaacaatcg 601 gaatttctgg tccagatgat ggagcagtgg ctgtactaaa atacaacggc ataataactg 661 aaaccataaa aagttggagg aagcgaatat taagaacaca agagtctgaa tgtgtctgtg 721 taaacggttc atgttttacc ataatgaccg atggcccgag taatggacct gcctcgtaca 781 gaatcttcaa aatcgagaag gggaagatta ctaaatcaat adagttggat gcacccaatt 841 ctcattacga ggaatgttcc tgttacccag acaccggcac agtgatgtgt gtgtgcagag 901 acaattggca tggttcgaat cgaccttggg tgtcttttaa tcaaaacctg gattatcaaa 961 taggatacat ctgcagtggg gttttcggtg acaatccgcg tcccaaagat ggaaaaggca 1021 gctgtgatcc agtaactgtt gatggagcag acggagtaaa ggggttttca tacaggtatg 1081 gtaatggtgt ttggatagga aggactaaaa gtaacagctc cagaaaggga tttgagatga 1141 tttgggatcc taatggatgg acagataccg atagtaattt cttagtgaaa caggatgtag 1201 tggcaatgac tgattggtca gggtacagcg gaagtttcgt tcaacatcct gagctaacag 1261 gattggactg tatgaggcct tgcttctggg ttgaattaat cagaggacga cctagagaaa 1321 agacaacaat ctggactagt gggagcagca tttctttttg tggcgtgaat agtgatactg 1381 caaattggtc ttggccagac ggtgccgagt tgccattcac cattgacaag tagtccgttg 1441 aaaaaactcc ttgtttctac t // LOCUS YSPURA4 1764 bp ds-DNA PLN 15-AUG-1990 DEFINITION S.pombe orotidine-5'-phosphate decarboxylase (ura4) gene. ACCESSION M36504 KEYWORDS orotidine-5'-phosphate decarboxylase. SOURCE S.pombe DNA. ORGANISM Schizosaccharomyces pombe Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1764) AUTHORS Grimm,C., Kohli,J., Murray,J. and Maundrell,K. TITLE Genetic engineering of Schizosaccharomyces pombe: A system for gene disruption and replacement using the ura4 gene as a selectable marker JOURNAL Mol. Gen. Genet. 215, 81-86 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 534 1328 orotidine-5'-phosphate decarboxylase (ura4) mRNA 484 > 1328 ura4 mRNA BASE COUNT 550 a 274 c 349 g 591 t ORIGIN 1 aagcttagct acaaatccca ctggctatat gtatgcattt gtgttaaaaa agtttgtata 61 gattatttaa tctactcagc attctttctc taaataggaa tttgttactt aatggagaaa 121 aaaatgtttc gatttaccta gtgtatttgt ttgtatactc acgtttaatt tcaaacatcc 181 attctatctt gtgtaatttt tggcatggtg aaaaagataa tcagccttat aatctttaca 241 aaagtaagaa attctgtaaa taagccttaa tgcccttgct ttaaattaaa atggttcttt 301 ttcatgataa tgtttgcact ttgtgaatat attttagata gttctgtgag gtataattaa 361 gatgttttag agacttatac aattttgtct ttataaattc ttaattgatt ttaccatccc 421 agtttaacta tgcttcgtcg gcatctctgc acatgtcgtg ttttcttacc gtattgtcct 481 accaagaacc tcttttttgc ttggatcgaa attaaaggtt taaaagcaaa gttatggatg 541 ctagagtatt tcaaagctat tcagctagag ctgaggggat gaaaaatccc attgccaagg 601 aattgttggc tttgatggaa gaaaagcaaa gcaacttgtc agtcgcggtc gatttgacga 661 agaaatccga aatcttagaa ttggtagata aaattggacc ctatgtctgt gttatcaaga 721 cacatattga cgttgtcgag gatttcgacc aggatatggt agaaaaactg gtggccttag 781 gtaaaaagca tcgttttctt atctttgagg atcgcaaatt cgcagacatt ggaaataccg 841 tcaagctaca atatgcatct ggtgtgtaca aaattgcttc ttgggctcat atcacaaatt 901 gccatacagt gccaggcgag ggtattatac aaggcctcaa agaagttggt ttacctttgg 961 gacgtggtct cttgcttttg gctgaaatgt cttccaaagg ctctttggct actggttcct 1021 acacagagaa aaccttagaa tggtttgaga agcataccga tttttgcttt ggctttatag 1081 ctggtcgtcg atttcctaac cttcaaagcg actacataac tatgtcccct ggtatcggct 1141 tggatgttaa aggagacggg ctgggacagc aatatcgtac tcctgaagaa gtgattgtaa 1201 actgcggtag cgatatcatc attgttggtc gtggagtcta tggagctggt cgtaatcctg 1261 ttgtcgaagc caagagatat agagaagctg gttggaaggc atatcagcaa agactttctc 1321 agcattaaaa aaagactaat gtaaaatttt tttggttggt tattgaaaaa gtcgatgcct 1381 tgtttgcgtt tgttttccta ggcgttttat gtcagaaggc atttagaatt agtatacaag 1441 tactctttgg taaaatttta tgtagcgact aaaatattaa ctattataga taaacacctt 1501 gggaataaaa agtaatttgc tatagtaatt tattaaacat gctcctacaa cattaccaca 1561 atcttttctc ttggattgac attgaataag aaaagagtga atttttttag acttgtaatg 1621 ataactatgt acaaagccaa tgaaagatgt atgtagatga atgtaaaata ccatgtagac 1681 aaacaagata aaacttggtt ataaacattg gtgttggaac agaataaatt agatgtcaaa 1741 aagtttcgtc aatatcacaa gctt // LOCUS BMEGDH1 2834 bp ds-DNA BCT 15-AUG-1990 DEFINITION B. megaterium glucose dehydrogenase gene and ORFs. ACCESSION D90043 KEYWORDS glucose dehydrogenase. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2834) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan. FEATURES from to/span description pept 1964 2749 glucose dehydrogenase (EC 1.1.1.47) ORF 181 867 ORF1 ORF 1086 1946 ORF2 signal 89 94 put. -35 region for ORF1 signal 116 121 put. -10 region for ORF1 signal 167 172 ORF2 ribosome binding site signal 893 928 termination signal signal 972 977 put. -35 region for ORF2 signal 997 1003 put. -10 region for ORF2 signal 1070 1076 ORF2 ribosome binding site signal 1949 1954 glucose dehydrogenase ribosome binding site signal 2751 2783 termination signal for glucose dehydrogenase BASE COUNT 896 a 442 c 644 g 852 t ORIGIN 1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga 61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag 121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa 181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg 241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt 301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat 361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca 421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat 481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag 541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta 601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat 661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct 721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt 781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt 841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg 901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt 961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt 1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt 1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt 1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt 1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg 1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta 1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca 1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg 1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag 1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa 1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat 1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca 1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg 1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc 1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga 1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag 1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga 1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg 2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa 2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt 2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt 2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt 2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga 2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt 2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa 2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat 2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg 2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc 2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc 2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca 2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga 2821 ctggtgttga attc // LOCUS BMEGDH2 1202 bp ds-DNA BCT 15-AUG-1990 DEFINITION B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene. ACCESSION D90044 KEYWORDS glucose dehydrogenase; isozyme. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1202) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura Department of Fermentation Technology Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan Phone: 06-877-5111 x4373 Fax: 06-876-9036 FEATURES from to/span description pept 125 910 glucose dehydrogenase signal 27 32 put. -35 region signal 49 55 put. -10 region signal 111 116 SD sequence signal 928 959 termination signal BASE COUNT 427 a 190 c 268 g 317 t ORIGIN 1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa 61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt 121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt 181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg 241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc 301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc 361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt 421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg 481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa 541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc 601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa 661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa 721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg 781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt 841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg 901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt 961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg 1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa 1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg 1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg 1201 ag // LOCUS BMOPTTHP1 1023 bp ss-mRNA INV 15-AUG-1990 DEFINITION B.mori PTTH mRNA. ACCESSION D90082 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to mRNA, clones P1, P2, C2, C9 and C19. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 1023) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept 34 708 preproPTTH matp 379 708 PTTH subunit signal 768 773 polyadenylation signal signal 826 831 polyadenylation signal signal 997 1002 polyadenylation signal signal 1004 1009 polyadenylation signal BASE COUNT 358 a 179 c 186 g 300 t ORIGIN 1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc 61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg 121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac 181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg 241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta 301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa 361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct 421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc 481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc 541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct 601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg 661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt 721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata 781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc 841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt 901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca 961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt 1021 aaa // LOCUS BMOPTTHP4 944 bp ss-mRNA INV 15-AUG-1990 DEFINITION B.mori preproPTTH mRNA. ACCESSION D90083 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to mRNA, clone P4 and C21. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 944) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept < 1 631 preproPTTH matp 302 631 PTTH subunit signal 691 696 polyadenylation signal signal 749 754 polyadenylation signal signal 918 923 polyadenylation signal signal 925 930 polyadenylation signal BASE COUNT 337 a 163 c 177 g 267 t ORIGIN 1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt 61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag 121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga 181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa 241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag 301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa 361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg 421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag 481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga 541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag 601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt 661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa 721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag 781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt 841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata 901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa // LOCUS HUMMTSDHB 958 bp ss-mRNA ORG 15-AUG-1990 DEFINITION Human mitochondrial succinate-ubiquinone oxidoreductase (EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA. ACCESSION D90047 KEYWORDS Ip; complex II; iron sulfur subunit; sdh B; succinate-ubiquinone oxidoreductase. SOURCE Human liver mitochondrion, cDNA to mRNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 958) AUTHORS Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M. TITLE Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning of iron sulfur(Ip) subunit of liver mitochondria JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kiyoshi Kita Department of Parasitology Juntendo University 2-1-1,Hongo Bunkyo-ku, Tokyo 113 Japan Phone: 03-813-3111 x3542 Fax: 03-814-9300 FEATURES from to/span description pept < 1 789 succinate-ubiquinone oxidoreductase (sdh B) (AA at 1) site 74 95 iron-sulfur binding site I site 167 179 iron-sulfur binding site II site 224 236 iron-sulfur binding site III BASE COUNT 319 a 212 c 204 g 223 t ORIGIN 1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt 61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg 121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag 181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt 241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac 301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat 361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag 421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag 481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc 541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg 601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc 661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat 721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct 781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt 841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata 901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HUMNCAW 2287 bp ss-mRNA PRI 15-AUG-1990 DEFINITION Human nonspecific cross-reacting antigen (NCA-W272) mRNA. ACCESSION D90064 KEYWORDS CEA; CEA gene family; PI-anchored membrane protein. SOURCE Human white blood cells, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2287) AUTHORS Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and Matsuoka,Y. TITLE Characterization of a cDNA clone encoding a new species of the nonspecific cross-reacting antigen (NCA), a member of the CEA gene family JOURNAL Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Fumiko Arakawa First Department of Biochemistry School of Medicine Fukuoka University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone: 092-801-1011 x2892 Fax: 092-801-3600 FEATURES from to/span description ORF 87 1136 nonspecific cross-reacting antigen ORF BASE COUNT 618 a 593 c 453 g 623 t ORIGIN 1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag 61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc 121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca 181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc 241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa 301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag 361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg 421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg 481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca 541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc 601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc 661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac 721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga 781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag 841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt 901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa 961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca 1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag 1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc 1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa 1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact 1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc 1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa 1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt 1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc 1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga 1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga 1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc 1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt 1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt 1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa 1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca 1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat 1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga 2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt 2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac 2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc 2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg 2281 gctgatc // LOCUS PIGDESTN 1666 bp ss-mRNA MAM 15-AUG-1990 DEFINITION Porcine destrin mRNA. ACCESSION D90053 J05290 KEYWORDS actin-binding protein; cofilin; destrin. SOURCE Pig adult brain, cDNA to mRNA, clone PD2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1666) AUTHORS Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S., Iida,K. and Yahara,I. TITLE Destrin, a mammalian actin-depolymerizing protein, is closely related to cofilin: Cloning and expression of porcine brain destrin cDNA JOURNAL J. Biol. Chem. 265, 5768-5773 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Moriyama Department of Biophysics and Biochemistry Faculty of Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan Phone: 03-821-2111 x4408 FEATURES from to/span description pept 54 551 destrin mRNA < 1 1666 destrin mRNA signal 1645 1660 polyadenylation signal BASE COUNT 492 a 322 c 372 g 480 t ORIGIN 1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt 61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt 121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag 181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa 241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg 301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc 361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg 421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc 481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat 541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc 601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg 661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct 721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga 781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt 841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt 901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt 961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac 1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca 1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca 1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt 1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta 1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag 1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc 1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg 1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga 1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt 1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta 1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taactt // LOCUS RATPMP70X 3324 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA. ACCESSION D90038 J05256 KEYWORDS PMP70; peroxisomal membrane protein. SOURCE Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181, 189,201]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3324) AUTHORS Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T. TITLE The 70-kDa Peroxisomal Membrane Protein Is a Member of the Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Keiju Kamijo Department of Biochemistry Shinshu University School of Medicine Matsumoto 390 Japan Phone: 263-35-4600 x5182 Fax: 263-33-6458 FEATURES from to/span description pept 36 2015 peroxisomal membrane protein (PMP70) signal 3271 3276 polyadenylation signal (put.) BASE COUNT 921 a 658 c 773 g 972 t ORIGIN 1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga 61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc 121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg 181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac 241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta 301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac 361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact 421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg 481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag 541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc 601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta 661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg 721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa 781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta 841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa 901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct 961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg 1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca 1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc 1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg 1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta 1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg 1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa 1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg 1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat 1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc 1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa 1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg 1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg 1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac 1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca 1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc 1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa 1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt 2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta 2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca 2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat 2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg 2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga 2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct 2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt 2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat 2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt 2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt 2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg 2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac 2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg 2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg 2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca 2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc 3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg 3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa 3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt 3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat 3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag 3301 attaaaaaaa aaaaaaagga attc // LOCUS VACSANT 1525 bp ds-DNA VRL 15-AUG-1990 DEFINITION Vaccinia virus surface (S) antigen gene. ACCESSION D90076 KEYWORDS S gene; surface antigen. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 1525) AUTHORS Ueda,Y., Morikawa,S. and Matsuura,Y. TITLE Identification and nucleotide sequence of the gene encoding a surface antigen induced by Vaccinia virus JOURNAL Virology 177, 588-594 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Yoshiaki Ueda National Institute of Health Gakuen, Musashimurayama Tokyo 190-12 Japan Phone: 0425-61-0771 Fax: 0425-65-3315 FEATURES from to/span description pept 382 1437 surface antigen S BASE COUNT 568 a 229 c 258 g 470 t ORIGIN 1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata 61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat 121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa 181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat 241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac 301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata 361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca 421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc 481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt 541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt 601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa 661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta 721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca 781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc 841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga 901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat 961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta 1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat 1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa 1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact 1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga 1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc 1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt 1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata 1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt 1501 gctatagaat gaatgaggat acgcg // LOCUS YSCA1 881 bp ds-DNA PLN 15-AUG-1990 DEFINITION S. cerevisiae acidic ribosomal protein A1 (YSCA1). ACCESSION D90072 X13682 KEYWORDS acidic ribosomal protein; ribosomal protein. SOURCE S. cerevisiae (strain IFO-40028) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 315; 631 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989) STANDARD full staff_entry REFERENCE 3 (bases 277 to 742) AUTHORS Tsurugi,K. and Mitsui,K. TITLE cDNA and deduced amino acid sequence of acidic ribosomal protein A1 from Saccharomyces cerevisiae JOURNAL Nucleic Acids Res. 16, 3574-3574 (1988) STANDARD simple automatic COMMENT These data kindly submitted in computer readable form by: Kazuhiro Mitsui Department of Biochemistry Yamanashi Medical college Tamaho, Nakakoma-gun Yamanashi 409-38 Japan Phone: 0552-73-1111 x2257 FEATURES from to/span description pept 313 633 acidic ribosomal protein A1 signal 125 135 UASrpg box1 signal 182 192 UASrpg box2 signal 716 721 poly(A) signal variant 303 303 a in [1]; g in [3] variant 684 685 tt in [1]; t in [3] variant 719 719 a in [1]; t in [3] BASE COUNT 270 a 165 c 157 g 289 t ORIGIN 1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt 61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca 121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa 181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca 241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa 301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac 361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca 421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac 481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct 541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc 601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg 661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa 721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg 781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc 841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c // LOCUS YSCCDC23X 3107 bp ds-DNA PLN 15-AUG-1990 DEFINITION S. cerevisiae CDC23 protein gene. ACCESSION D90081 KEYWORDS CDC23 protein. SOURCE Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line D22) DNA, clone YX34. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3107) AUTHORS Doi,A. and Doi,K. TITLE Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces cerevisiae JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Doi The Institute of Scientific and Industrial Research Osaka University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111 Fax: 06-877-4977 FEATURES from to/span description ORF 765 2645 ORF for CDC23 site 153 161 calcium-binding site BASE COUNT 1010 a 595 c 629 g 873 t ORIGIN 1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact 61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt 121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc 181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt 241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg 301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct 361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt 421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga 481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag 541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat 601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact 661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc 721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc 781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac 841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag 901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag 961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt 1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt 1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc 1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta 1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc 1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga 1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag 1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa 1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact 1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat 1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta 1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg 1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc 1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg 1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt 1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg 1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact 1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt 2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca 2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca 2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt 2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg 2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca 2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat 2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg 2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa 2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta 2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata 2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat 2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc 2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact 2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc 2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc 2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga 3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga 3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag // LOCUS BSPRSDA 2996 bp ds-DNA BCT 15-AUG-1990 DEFINITION Bacillus sp. raw-starch-digesting amylase gene. ACCESSION D90112 KEYWORDS alpha amylase; raw-starch-digesting amylase. SOURCE Bacillus sp.(strain B1018) DNA. ORGANISM Bacillus sp. Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2996) AUTHORS Itokor,P., Tsukagoshi,N. and Udaka,S. TITLE Nucleotide sequence of the raw-starch-digesting amylase gene from Bacillus sp. B1018 and its strong homology to the cyclodextrin glucanotransferase genes JOURNAL Biochem. Biophys. Res. Commun. 166, 630-636 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Shigezo Udaka Department of Food Science and Technology, Faculty of Agriculture Nagoya University Furo-cho, Chikusa-ku Nagoya 464 Japan Phone: 052-782-5111 x6356 Fax: 052-781-4447 FEATURES from to/span description pept 313 2454 raw-starch-digesting amylase precursor (EC 3.2.1.1) sigp 313 393 raw-starch-digesting amylase signal peptide matp 394 2454 raw-starch-digesting amylase mature peptide binding 302 306 ribosome binding site signal 101 106 -35 region signal 125 130 -10 region rpt 2567 2580 inverted repeat rpt 2585 2598 inverted repeat BASE COUNT 764 a 852 c 774 g 606 t ORIGIN 10 bp upstream of RsaI site. 1 ttatttgagt acattttatg tattcccaca ttgcgcccga tatctacgct tagaaaaaaa 61 tcgtcggaaa agcgccccaa aaaattttta ttgttattta ttgacagttg tattcgcttt 121 catctacaat gatggaggaa cgcaatactc gatataattt aagggccatg cattccgtga 181 ccgcacaccc ggtatggaac aaccccggta tctcgatgga gaagccgggg ttttttgtcg 241 ccctttttta ggaggtgatc cggcgacagc ggatcaagcc tggaattcaa ataattacat 301 aggaggtata acatgaagaa atttctgaaa atgacagccg cgttttccct gggattatcc 361 ctggcgttcg ggcttttcag ccccgcccag gccgcgccgg atacctcggt atccaacaag 421 caaaatttca gcaccgacgt catctatcaa attttcaccg acaggttttc ggacggcaat 481 cccgccaaca atccgaccgg cgcggcgttt gacggaacct gcacgaacct ccggctgtat 541 tgcggcggcg actggcaggg catcatcaac aaaatcaacg acggttacct gaccgggatg 601 ggcgttaccg ccatctggat ctcccagccg gtcgaaaaca tctacagcat catcaattat 661 tccggcgtca acaacacggc ctatcacggc tactgggccc gggacttcaa gaagacgaat 721 ccggcctacg gcacgattgc ggacttccag aacctgatcg ccgccgcgca tgccaaaaac 781 atcaaagtca ttatcgactt cgccccgaac catacgtcgc ccgcctcgtc cgaccagcct 841 tcctttgcgg aaaacggccg gctgtacgat aacggcacgc tgctcggggg atacacgaac 901 gatacgcaga acctgttcca ccataacggc ggcacggact tttccacgac cgaaaacggc 961 atctacaaaa acctgtacga tctcgccgac ctgaaccata acaacagcac gtcggacgtc 1021 tacttgaagg acgcgatcaa aatgtggctg gatctcggca tcgacggcat ccgcatggat 1081 gcggtgaagc atatgccgtt cggctggcag aagagcttta tggctgccgt caacaactat 1141 aagccggtct ttaccttcgg cgaatggttc ctgggcgtaa acgaagtagg cccggaaaac 1201 cataagtttg ccaacgaatc cggcatgagc ctgcttgatt tccgttttgc ccaaaaggtg 1261 cggcaggtgt tccgggacaa caccgacaat atgtacggcc tgaaggcgat gctggagggc 1321 tccgcagccg attacgccca ggtggatgac caggtgacgt tcatcgacaa ccatgacatg 1381 gagcgtttcc acgcaagcaa tgcaaaccgc cggaagctgg agcaagcgct ggcgttcacg 1441 ctgatcctcg cgcgcgtccc cgccatttat tacggcaccg agcagtacat gtcgggtggg 1501 accgatccgg acaaccgggc gcggatccct tccttctcca cgtcgacgac cgcctatcaa 1561 gtcattcaaa agctggcgcc gctgcgcaag tccaacccgg ccatcgccta cggatcgacg 1621 caggagcgct ggatcaacaa cgacgtgctc atttatgagc gcaaattcgg cagcaacgtt 1681 gccgtcgttg ccgtcaaccg caatttgaac gcgccggctt ccatttcggg acttgtcact 1741 tccctgccgc aaggcagcta caatgacgtc cttggcggcc ttctgaacgg caacacgtta 1801 acggtaggct ccggcggagc cgcctccaat ttcacgcttg cggccggcgg cacggcggtg 1861 tggcagtaca ccgcggcaac ggcgacgccg accatcgggc atgtcgggcc gatgatggcc 1921 aagccgggcg tgacgatcac gatcgacggc cgcggcttcg gctctagcaa aggcaccgtc 1981 tacttcggca cgacggcggt gagcggcgcc aacatcacgt cttgggaaga cacgcagatc 2041 aaagtgaaaa ttccggccgt cgcaggcggc atctacaaca ttaaagtcgc aaacgccgcc 2101 ggaacggcaa gcaacgtgta cgacaacttc gaggtattgt ccggagacca ggtcagcgtc 2161 cgcttcgtgg tcaacaacgc gacaacggcc cttgggcaaa atctctacct gacgggcaat 2221 gtcagcgagc tggggaactg ggacccggca aaagcgatcg ggccgatgta caaccaggtc 2281 gtttaccaat atccgaactg gtattatgac gtcagcgttc cggccggcaa aacgatcgag 2341 ttcaagtttt tgaaaaaaca aggctccacc gtcacgtggg aaggcggcag caaccacacc 2401 ttcaccgcgc cgtccagcgg caccgcgacc attaacgtga attggcagcc ataaggcgtg 2461 agggataggc ggctggcatt cattggaaaa ggcggactat atgacgtccg ttccgtgagc 2521 aacgctcatc gctccgttca aaccgccaca aggctgatct tcagccaaaa aaagagggga 2581 cctttcccct ctttttttat ttccgttgac taacggtatt cccaaaaatt acattggggg 2641 ataagctccc tcccctctaa tagcaataac aagagcgtaa acccaaccag gtgatccata 2701 gcgtgcggtc gcctttaatc ccggtatcaa aatgtatcct accttacaaa aatgatcgga 2761 tcatacaaaa tagtgcgtac tactcaacga aatagaacct acatacagaa cgatcgatcc 2821 agatttcaac gaacggcacg gtcgtttaaa aaaatggtgt gcggggtgcg agaatatgca 2881 agaatatcaa ctgactttga aagataagcg gatcgtatgg gggaaggcga tcgaccttga 2941 gcctctcatt ggcaaatatc ctggcgactc gattagacag ggcatgaacg aagctt // LOCUS HUMALPL 3101 bp ds-DNA PRI 15-AUG-1990 DEFINITION Human alkaline phosphatase (EC 3.1.3.1) gene. ACCESSION D90054 KEYWORDS alkaline phosphatase. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3101) AUTHORS Matsuura,S., Kishi,F. and Kajii,T. TITLE Characterization of a 5'-flanking region of the human liver/bone/ kidney alkaline phosphatase gene: Two kinds of mRNA from a single gene JOURNAL Biochem. Biophys. Res. Commun. 168, 993-1000 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Fumio Kishi Department of Pediatrics Yamaguchi University School of Medicine Ube, Yamaguchi 755 Japan Phone: 0836-22-2258 Fax: 0836-22-2696 FEATURES from to/span description pre-msg 2130 3101 alkaline phosphatase mRNA and intron IVS 2341 3101 alkaline phosphatase intron rpt 375 664 Alu sequence rpt 2631 2926 Alu sequence BASE COUNT 775 a 752 c 692 g 882 t ORIGIN chromosome 1; map position p34-36.1. 1 aagctttctc cagcgagtat gatggtttct gcaggttctt ggcataaagc ctttatcaga 61 ttaaggaaat tcttttcaat acctggtttg ctgagggctt ctgtcacatc gttttctgtg 121 accccattcc ctctccctag gtgagcacgt caagtttgat cagggtgtta aactgccacc 181 cctgtgccta tgattcccaa atttatactc taacccagac ttctttttca aatgccagag 241 ccaaatattc agctgcctcc ttagtgtctc cacttctaaa agacatctcc aactcaacat 301 atccaaaaac aagttcctga ttgtctccac ctcatgcctc aaaagaccac cccaaacgcc 361 gaaaggctga atgctttttt ctttttcttt tttttttttt tctgagatgg agtctcactc 421 tgttgcccag gctggactgc agtgatgcga tctcagctca ctgcaaactc tgcttcctgg 481 gttcaagtga ttctcctacc tcagcctctc aggtagctgg gactacaggt gcacaccacc 541 atgcccagct aatttttgta gagagagttt caccatgttg gccaggctgg tctcaaacac 601 ctgaccttaa gggatccacc cgcctcagcc tctcaaagtg ctgggattac aggtgtgagc 661 catcgcactt ggctcggtag tatatggctc agaaacattg ccatttacaa tagttcccca 721 aaaagcaaaa ttcttaggta taaatctgga ttcagagtcc agaatgctaa ccattacacg 781 atggaacccg taggtataaa tctaagaaaa catatccaag atctacaggc tgaagactac 841 agagtgctga taaaaccgaa gaactctgac tgaatgagtg gagagacgtg gtgtcttcat 901 gactgggcaa ctccatgtgg tatagacgta aaccctccca cattgatctg tggatttaat 961 accataccta tcaaaaacac agtggtggag gacagatcag ggatcgccag gtttagggat 1021 ggggggattg tgtaactata aagaacgcaa gagagatttt tggggtggca gagctgttct 1081 gggtcctgac ggtggcggtg gtggttacat aaatctatcc atgtgtcaaa cgtcagaaca 1141 ctcattttac acttgggggc aacagaaatc cctccctctg gagggggtga ctgatggtaa 1201 cctgattgct aattctggaa tcaggagccc tgtggtcagg tttctgctct gcaacttcct 1261 gttggtaacc ttgggcaagt ctccgtccag agccttggtt ttctcatctg taaaaggaga 1321 tgataggtcc ttttctgtcc actgcatagc tgattagtga aacatcatgg tgaaattctt 1381 tatgaactat ggagtgcagc acatagactt gctttcattt tgtcagtatc ctttatagat 1441 tgttcatgta agctcccaaa gagtagtatt tattttattg aaataaaatg cacgtagaga 1501 aaaatgtgtg tatcatacat tgacagctga acccaccgtg taaccagcac ccacccaccc 1561 agatcaatca taaaccgaac cgcaccagca ccccagcagc ccgttcccgt ttccgtaccc 1621 tccacgtgga gcctccgttc tgtctcccaa cgccctgggt tagtttttat actttctgtc 1681 atcggaatca cactgtaagt gctcttgggt ttagcttcct ttgctcaagc ttaccttgtg 1741 cgattcattc atgttgttgt gaggagctgt ggatcatcca ttctccttgc tgtctgtggt 1801 ggtttctgtg ttgtgaacac acacaatgta ttatccagcc tgccgtagat ggaggcagtt 1861 ttgaagccat tataaacagg gctgatgtgc acattctgct ggagagaaac gggtcccagg 1921 gtacaggtag gatgatcagc ttcggtagat cctgccggtt ttcccatgcg ctgtgcctgt 1981 ctgcactcca ccaacggcga gcggaccttc cggtagttaa acatcttcac gaactcttgg 2041 actttcctgc acacacagag aagataattt tggatggctc ttcccttccc cccacaacct 2101 tccttagggc actggctttc aactgatgta aatatttact atgccaagca ctaggagggc 2161 agagacaaac aagacaaagt cctcacactt agaaactccc ggtgtggcag ctgagatggc 2221 ccaggaaaga actatattac cttcaaaaag agaggtacat gcgatgtttg aggtggcatg 2281 aagctcagtg gtgttatatt ggaatgagtg agtgaccatc ctggagcctt cctgaaagag 2341 gtgacttcat ttttaagtga ttttaaataa tagtttaatg aattagtatt tcgtattcag 2401 ttaataacat ttttctgatt ttaggatttg ctatagaaat atttggaaac cgtaaagtag 2461 aacaaaaaaa aaatgtagga atcatctgaa attccaaatt ctaccactca cagttaagtg 2521 ttgttagatg ttagatgtgg gatattgcct tttaatttcc actctgcgcc gctaccccca 2581 gcccctaccc cagagccgtc acttctggca ctggagcgca gcttgcgtgg tttttttttt 2641 tttttttttt tttttgagac agagtcctgc ctgtcgccca ggctggagtg cagtggcgcg 2701 atctcggctc actgcaactc cccctcccgg gttcacgcca ttctcctgcc tcagcctcct 2761 gagtagctgg gactacaggc gcccgctacc tctcccggct aattttttgt atttttagta 2821 gagacggggt ttcactgtgt tagccaggat ggtctcgatc tcctgacttc gtgatccgcc 2881 cgcctcggcc tcccaaagtg ggcagatcac ctgaggtaga gagttcgaga ccagacctga 2941 ccaacatgga ccccatctct actaaaaata caaaattggc cagggcatgg tggcgcaagc 3001 tgctaatccc agccactcag ggaggctgag gctggaaaat tgcttgaacc cgacctgcag 3061 gcatgcaagc ttggcgtaat catggtcata gctgttttcc t // LOCUS RATCNRAA 2337 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat calcineurin A alpha mRNA, complete cds. ACCESSION D90035 KEYWORDS calcineurin; calcineurin A alpha; calmodulin binding protein; calmodulin-dependent protein phosphatase; isoform. SOURCE Rat brain, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2337) AUTHORS Ito,A., Hashimoto,T., Hirai,M., Takeda,T., Shuntoh,H., Kuno,T. and Tanaka,C. TITLE The Complete Primary Structure of Calcineurin A, a Calmodulin Binding Protein Homologous with Protein Phosphatases 1 and 2A JOURNAL Biochem. Biophys. Res. Commun. 163, 1492-1497 (1989) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Takayoshi Kuno Department of Pharmacology Kobe University School of Medicine 7-5-1 Kusunoki-cho, Chuo-ku Kobe 650 Japan Phone: 078-341-7451 x3273 Fax: 078-351-6531 Peptides, 78-329 and 391-414, seem to be putative catalytic domain and calmodulin binding domain, respectively. FEATURES from to/span description ORF 208 1773 calcineurin A alpha signal 1944 1950 polyadenylation signal BASE COUNT 649 a 523 c 596 g 569 t ORIGIN 1 cgggaggagg agtgaaggcg gcggcggcgg aggagggacg cgcggagccg gcagtaactt 61 tcgagccagc ccagagcccg gagctccagc cgagcggttt gcagcgcggc ggcgcggcgc 121 tgagtgtctg gcccgccggt gcggtcgggg tgtgcagtcg gacgggacca gcagcgcgtc 181 gctgtccccc cctcccggtg actggagatg tccgagccca aggcgattga tcccaagttg 241 tcgactacgg acagggtggt gaaagccgtt ccatttccgc caagtcaccg gctgacagca 301 aaggaagtgt ttgataacga tgggaagcct cgtgtggata tcttaaaagc acatctcatg 361 aaggaaggca ggctggaaga aagtgtcgcg ttgagaataa taacagaggg tgcttcgatt 421 ctccgacagg aaaaaaactt gctggatatt gatgccccag tcacagtttg cggggacatc 481 catggacaat tctttgactt gatgaagctc tttgaagtgg gaggatctcc tgccaacact 541 cgctacctct tcttagggga ctatgttgac agagggtact tcagtatcga atgtgtgctg 601 tatttgtggg ccttgaaaat tctttacccc aaaacactgt ttttacttcg tggaaaccat 661 gaatgtaggc acctaacaga gtatttcacg tttaaacaag aatgtaaaat aaagtattca 721 gaacgcgttt atgacgcctg tatggatgcc ttcgactgcc ttcccctggc tgcgctgatg 781 aaccaacaat tcctgtgtgt acacggtggt ttgtctccag agattaacac tctagatgac 841 atcagaaaat tagaccgatt caaagaacca cctgcttatg ggcctatgtg tgacatcttg 901 tggtcagacc ccctggagga ctttggaaat gagaagactc aggaacattt cactcacaac 961 acagtcaggg gttgttcgta cttctacagt tacccggctg tatgtgactt cctgcagcac 1021 aataatttgt tgtccatact ccgagcccac gaagcccagg acgcagggta ccgcatgtac 1081 aggaaaagcc aaacaactgg cttcccgtct ctaattacga tcttctcggc accaaattac 1141 ttagatgtgt acaataataa agctgcagtg ttgaagtacg agaacaacgt gatgaacatc 1201 aggcagttca actgctcccc ccatccgtac tggctcccaa atttcatgga tgttttcacc 1261 tggtcgctgc catttgttgg ggagaaagtg actgagatgc tggtaaacgt cctgaacatc 1321 tgctcagatg atgaactggg gtcagaagaa gatggatttg acggagccac ggctgcagcc 1381 cggaaggagg tcatcaggaa caagatccga gcaataggca aaatggccag agtattctca 1441 gttctcagag aagagagtga gagcgttcta actctgaagg gcctgacccc gactggcatg 1501 ctccccagcg gagtgctctc tggcgggaaa caaactctgc aaagcgctac tgttgaggcc 1561 attgaggctg atgaagccat caaaggattc tcaccacaac ataagattac cagcttcgag 1621 gaggccaagg gcttagaccg aattaacgag aggatgccgc ctcgcagaga cgccatgcct 1681 tccgacgcca accttaactc catcaacaag gctctcgcct cagagactaa cggcacagac 1741 agcaacggca gtaatagcag caatattcag tgaccacttc ctgttcactt tttttttttg 1801 agctgcaggg catgatgggt ttgctgcatc tcagcagttg gatgttcttg cctctgacgg 1861 tagcttgttt gctctggggg ggccaggaat tggattcagt ttacactatc atgaaaaaaa 1921 aaaagaggga gagagagaga gataataaaa ctatattttg gtgagggtgg tgattaaaca 1981 cctcttttgg gtatgccttt aaaaatgctt ctaggaaaaa aaaagtttta aaaagaaagc 2041 taatgctagt ctatacttca atgttagggg aatgaacacg ttttcctagc gcactgggga 2101 cttttagata ggttaatgaa aggcctttta ttctgttact ggacacgaaa actttgtcta 2161 atttcttata ctctattgta cgtttacagt cgcagcacta aaaatggatg acatcaaaca 2221 tttttaaaca gaaaaaaaag atgtacaaac taaataagga ctatttattg ataatgtttt 2281 gctactcttg tcagacaatg gctataaact gaattaggca gtcttaaaaa aaaaccg // LOCUS PHALPO 5710 bp ds-DNA PLN 15-AUG-1990 DEFINITION P.chrysosporium lignin peroxidase genes, complete cds. ACCESSION M37701 M22720 KEYWORDS lignin peroxidase. SOURCE P.chrysosporium (strain BKM-F-1767 (ATCC 24725)) DNA. ORGANISM Phanerochaete chrysosporium Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes; Agaricales; Corticiaceae. REFERENCE 1 (bases 3402 to 5365) AUTHORS Walther,I., Kaelin,M., Reiser,J., Suter,F., Fritsche,B., Saloheimo,M., Leisola,M., Teeri,T., Knowles,J.K.C. and Fiechter,A. TITLE Molecular analysis of a Phanerochaete chrysosporium lignin peroxidase gene JOURNAL Gene 70, 127-137 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3543; 5096 to 5710) AUTHORS Huoponen,K., Ollikka,P., Kaelin,M., Walther,I., Maentsaelae,P. and Reiser,J. TITLE Characterization of lignin peroxidase-encoding genes from lignin-degrading basidiomycetes JOURNAL Gene 89, 145-150 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.Reiser, 22-FEB-1989. FEATURES from to/span description pept 652 712 lignin peroxidase lpoB, exon 1 770 923 lignin peroxidase lpoB, exon 2 976 1032 lignin peroxidase lpoB, exon 3 1087 1301 lignin peroxidase lpoB, exon 4 1354 1395 lignin peroxidase lpoB, exon 5 1474 1552 lignin peroxidase lpoB, exon 6 1603 2026 lignin peroxidase lpoB, exon 7 2079 2143 lignin peroxidase lpoB, exon 8 2197 2218 lignin peroxidase lpoB, exon 9 pept 5098 5038 (c) lignin peroxidase lpoA, exon 1 4977 4824 (c) lignin peroxidase lpoA, exon 2 4770 4714 (c) lignin peroxidase lpoA, exon 3 4659 4445 (c) lignin peroxidase lpoA, exon 4 4391 4350 (c) lignin peroxidase lpoA, exon 5 4296 4218 (c) lignin peroxidase lpoA, exon 6 4167 3744 (c) lignin peroxidase lpoA, exon 7 3689 3625 (c) lignin peroxidase lpoA, exon 8 3562 3541 (c) lignin peroxidase lpoA, exon 9 IVS 713 769 lpoB intron A IVS 924 975 lpoB intron B IVS 1033 1086 lpoB intron C IVS 1302 1353 lpoB intron D IVS 1396 1473 lpoB intron E IVS 1553 1602 lpoB intron F IVS 2027 2078 lpoB intron G IVS 2144 2196 lpoB intron H IVS 2219 769 lpoB intron I IVS 5037 4978 (c) lpoA intron A IVS 4823 4771 (c) lpoA intron B IVS 4713 4660 (c) lpoA intron C IVS 4444 4392 (c) lpoA intron D IVS 4349 4297 (c) lpoA intron E IVS 4217 4168 (c) lpoA intron F IVS 3743 3690 (c) lpoA intron G IVS 3624 3563 (c) lpoA intron H BASE COUNT 1204 a 1613 c 1594 g 1299 t ORIGIN 1 agctcacttt acctatacac atctgcattc agtccttcca gttctctgac cctaacatcc 61 ggtaaatgta ccttcagtga tcgggacgga aggtatgggc ctttcgcata ggtgggtaat 121 ctgcgactgt atgttttgta tggtaccctg agacagtcac ttactgtttc tgctcgctcc 181 aggtaccatt gtcccgcctc tgcgtgattt ccgaggctgg actggcccat ctctgcccac 241 cctgtcctca tctgccaaga gccatcggaa tgccaagccg tgaccactcc aaccggtccc 301 gttctctcag ccactgcgca agtttcttac aggagggctg cttcgccgtt cattcgcggc 361 ctccggatag ctagcgagct tcgatgctcg tggccaatta tggaagcagt cgttgatcgc 421 accggtcccg tactgccttc gctcacaagc cgtgttgttg cgagactctc attcgctggc 481 tcagggtatt gtgcctgttt gctgaggcac agtgcagtca atacacactt gtctcgtcag 541 gacgcggttt gacattccgt ggtgcgtgaa acggtataaa agggatacgc gatttgcagc 601 atatcctcag gccattcgtc ttctacagcc caagttccaa gtcaaacggt catggccttc 661 aagcagctcg tcgcagcgat ttccctcgca ctctcgctca ccactgccaa tggtacgcac 721 cgcttctgca tgctgtgata acgggccccg actaacgcct ccgctgcagc cgccgtggtc 781 aaggagaagc gcgccacctg ctccaacggc gccaccgttg gcgacgcgtc ctgctgtgct 841 tggttcgatg tcctcgacga catacagcag aacctgttcc aaggaggcca gtgcggcgct 901 gaggcccacg agtctatccg tctgtaagtc aatacgctgg tgttgcgcca aggtcataga 961 ttcactttgc tgcagcgtgt tccacgatgc tattgccatc tctcctgcta tggaggccca 1021 gggcaagttc gggtatgtct ttccggcatg gcaatatttt acagcagaca ctgagatatt 1081 gcgcagcggt ggtggtgctg acggctccat catgatcttc gacgacatcg agcccaactt 1141 ccaccctaac attggcctcg acgagattat caacctccag aagccgttcg tccagaagca 1201 cggtgtcacc cctggtgact tcatcgcctt cgccggtgct gtcgcgctca gcaactgccc 1261 gggtgcccca cagatgaact tcttcactgg tcgtcgtcct ggtacgtctc ctctacgaat 1321 cgatctcgac acctcattca tatcgcctta tagctaccca gcccgcaccc gatggtctcg 1381 ttcccgagcc tttccgtgag tttgcagacc acttcatcgc atagttctta gctgacctct 1441 tcatcgcata gttcttagct gacttcagca cagacaccgt cgaccagatc atcgctcgtg 1501 ttaacgatgc cggcgagttc gacgagctcg agcttgtctg gatgctttcc gcgtaagtga 1561 ctgccgcctc gaatttccat cccgacttac accccgattc agccactccg ttgctgcagt 1621 caacgacgtg gacccgaccg tccagggcct gcccttcgac tccacccccg gaatcttcga 1681 ctcgcagttc ttcgtcgaga ctcagttccg tggtatcctc ttccccggct ccggtggcaa 1741 ccagggtgag gtcgagtccg gtatggctgg cgagatccgc atccagaccg accacactct 1801 cgcccgcgac tcccgcaccg cttgcgagtg gcagtcgttc gtcaacaacc agtccaagct 1861 cgtctccgac ttccagttca tcttccacgc cctcacccag ctcggccagg acccgaacgc 1921 gatgaccgac tgctcggatg tcatcccgat ctcgaagccc atccccggca accttccgtt 1981 ctcgttcttc ccccctggca agagcatgaa ggatgttgag caggctgtag tatccgattc 2041 agtccttgtc gcagagctta tgctgacggc ttctgcagtg cgccgagacc cccttcccca 2101 gcctcgtcac tctccccggc cccgcgacct ctgtcgctcg catgtgagta tctccgacgg 2161 tctatgaagc ccccagctga catattcctc ttccagcccc ccgccgccgg gtgcttaagt 2221 cattctatcg gtcatctttg gctgaaacgg agtatttgga atacggctca ctcgtaacgg 2281 taacttgcgc tcaagtgttt agaaatgtct cctttgtatc tacgcgattg gtccgctttt 2341 gacgatagat cgttactgtg ttcattgaaa ttctcgtccg cgcgccctgg agcgaaccgg 2401 ttagcattgc cacacgagag ctcttccgtt gctccaactc gagctgtaat ggtccaacgc 2461 tccacgctac atcaatttaa cctctcatgg gtacggtgta ttcggcaagt ttatctcaca 2521 taataagagg cacgctatca ttcgacgata caagaacatg agccttcgct tcgtttatga 2581 tattggttca ctgtcgagct aatttctgag ggttagcgct ctgacatgat cagctacagg 2641 aacggaggcc gtaccttgaa tgtgcccata aacccgctgt cttattcttc tcaaattgat 2701 tcttcatgtt tgaatcacgt ttgcaggtgc attcgtgtac ctgcggcgcg tacacgcggt 2761 atgtattggt cgcaaatcgc atcatggtga gatcttgctc ttcactcttg aagttgctac 2821 cgtataccac catgtgcagg aattctcgta catccctgtt tctcctcgaa tgtatgtgga 2881 gccagggaaa ccctaacccc ggattctgct gagatgcgtc gatgcacgca gccgtagcgg 2941 aggtccgtga ggtccgctcc ggccacgaag caggggccgt cctgaccggt cgaaggtcat 3001 gtcgtgcgac atagtcggct tccaggagga cgatatcgac caatacgtcg aaaggaggag 3061 actgcgggtc taggctggac gctgtttgcg agggcccggg ggagaacgag gccattggga 3121 gtcagcgaga ttattgaata gtcgaagggt attcattgag tcactaaggg aaacacttct 3181 gagccgctgg tagtacttgt gtatgcccgg gttctgcgcc tgataattag cctcgctcct 3241 ccgttgacgt tgggttttgg caataggaca tcaccacttt caccacgcgg acgcaatgcg 3301 aagggcacga gtggtatctc aatagctagt taccttccaa gaccctcaat catgatcgga 3361 agaagaggat gtgcaccgat atttcataag cccacggcag atatcgtaag agagtagacg 3421 aatgagattc gtagttaggt gcagagatac gatgaatgaa atctagtaaa gccgaagttc 3481 cgtcacgagt tagccggcca ccgttacagt cggtttgagg agtattctgt atggcatcat 3541 ttaagcaccc ggaggcggag ggctggagaa ggagcatgtc agcccagatt gcatttcctg 3601 aaagatctca tggattgtac tcacatgcgc tggacggacg tctcggggcc cgggagagtg 3661 gtgagagtcg ggaagggggt ctccgcacac tgtcatgcga tgttcagcag ccactctact 3721 gcatggtggg gtgaaatacg caccgcctgc tcaacgtcct tgatggtctt gccagcgggg 3781 aagaacgaga atgggaggtt gccagggatg ggcttggact gcgggataac atccgagcag 3841 tcggtcatcg cgttcgggtc ctggccgagc tgggtgaggg cgaggaagat gaactggaag 3901 tcatcgacga gcttggactg gttgttgacg aaggactgcc attcacacgc cgtgcgcgag 3961 tcgcgggcga tagtgtggtc ggactggatg cgaatttcgc cagggagcgg cgactcgacc 4021 tcgccttggt tgccaccaga gccggggaag gcggtaccac gaagctgagt ctcgacgaag 4081 aactgggagt cgaagattcc gggggtcgag tcaaagggca gaccctggac ggtcgggtcg 4141 acgtcgttca ccgctgcgac ggagtgcctg tcgaggtctc aggaagggag tgtcgaagtc 4201 aacagtgagt gacttacgcg gagagcatcc agacaagctc gagctcatcg aactcgcctg 4261 cgtcgttgac acggttgatg atttggtcga cagtgtctgc atgctagtca gtatagaccg 4321 cacctaactg cttggataag accacttacg gaagggctcg gggacaaggc catcaggagc 4381 gggctgggta gctaaagcag acagttagtt cgtaccatcc gcaaagcgag ttttgcaggt 4441 ataccaggtg cacgaccagt gaagaagttc atctgcgggg caccagggca gttgctgagc 4501 gcgacacgac cagcgaaggc gatgaagtca ccaggggtga caccgtgctt ctgaacgaat 4561 ggcttctgga gcttgacgat ctcgtcgaga ccgatgttag ggtggaacgc agtctcgata 4621 tcgtcgaaga tcatgatgga gccgtcagca ccaccgccgc tgcaaggagg gatcagcaaa 4681 cgactaggtg gcgcaacgcg ggtggcaact tacccgaact tgccctgtgc ctccatggcg 4741 ggcgaaattg cgatggagtc gtggaagacg ctgggcgggg tgttcaaaca tgcatagcag 4801 gagatcgcga cgggatcact cacagacgaa tcgactcgtg cgcctcagcg ccgcactggc 4861 cgccgtggaa caggttctgc tggatatcat ccaggacgtc gaaccaagcg cagcacgacg 4921 catcgccgac ggtcttgccg ttggaacagg tggcgcgctt ctcgatcacc gcagccgctg 4981 cacaagacga cgttcagcat gcagtccact ggtcaacgct aactgcgatg ggcataccgt 5041 tcgcagccga gagcaagaga gcgagagaga tagctgcgaa gagctgcttg aaggccatgt 5101 ccgctgtgtt gctggtgctg agtgggactg aagagactgg atgtctgagg gactgcggtg 5161 gtcctgtcgc ccttttatac cctaggcgtg gtcgacgtcc tggtattgtt cgccgtagaa 5221 cagtgtcgaa tcgacgtgac gcggtgcgcg gacatgcacg acactgcgcc agccaatgag 5281 gacgctgcca aaacgcagcc tgtgagcgag ttggtgcggt gccggcaacc atcaccgact 5341 cgtctcacat ttgggccact gcgtcgagcg cagttcgcgc cggcaccgct gttgaatagc 5401 acgcgagctc tgcaagaaag aatagggcgg cccatgagaa cagaaatccg agtcagagga 5461 attaactgcg cgtgccgatg agtcttgaca tgaggatgat ctaacgaaga gaccttgcat 5521 tgagccgttt ccagtgctgc caggggtaat cagtcggcat tactgccaag tccggggatg 5581 tactgctagc tcactcccat cgcaatatgt caccgagtat tgcctttgtg aacataccat 5641 tgattcggtc ccgatcatgc acgaacgact cccgcaaagt ggggccagtg actatcacgt 5701 ccgtgctcag // LOCUS XANXCAA 2333 bp ds-DNA BCT 15-AUG-1990 DEFINITION X.campestris major extracellular endoglucanase (engXCA) gene, complete cds. ACCESSION M32700 KEYWORDS major extracellular endoglucanase. SOURCE X.campestris DNA. ORGANISM Xanthomonas campestris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2333) AUTHORS Gough,C.L., Dow,J.M., Keen,J., Henrissat,B. and Daniels,M.J. TITLE Nucleotide sequence of the engXCA gene encoding the major endoglucanase of Xanthomonas campestris pv. campestris JOURNAL Gene 89, 53-59 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.L.Gough, 09-MAR-1990. FEATURES from to/span description pept 383 1864 major extracellular endoglucanase (engXCA) precursor sigp 383 457 major extracellular endoglucanase signal peptide (put.) matp 458 1861 major extracellular endoglucanase (put.) BASE COUNT 444 a 800 c 734 g 355 t ORIGIN 1 gaattcccgg ggatcacaaa cgacgcgaac aagccgacct gcgggtccac gcctgcgacg 61 aacgcaaagg cgatgacttc gggaatcagg gcgaacgtgg caacggcgcc agccatcagt 121 tcgcgcgcag gcgaggcgcc attgcgccag ttcggtgcgc aggaaggaca tgggggacac 181 tccagggaca agaacgacat gcctgcggac agcgcgcagg gggcactagt gtgcgggaaa 241 cggccgctcc cgcagccgcg atgtgatcgg tgcggcaatg gtgttttctg tggggacgat 301 cacaccacgc gacgcgcgca cagaccaaga tgcccgcctt accgcgctcg ggtgtcgagc 361 ccggttctct agggagatca ccatgtccat attcaggacc gcaagcacgc tcgctttggc 421 caccgccctc gcactggccg ccgggccggc cttcagctat tccatcaaca acagcaggca 481 gatcgtcgac gacagcggca aggtcgtgca gctcaagggt gtgaacgtgt tcggcttcga 541 aaccggcaac cacgtgatgc atggcctgtg ggcacgcaac tggaaggaca tgatcgtgca 601 gatgcagggc ctgggcttca acgccgtgcg cctgccgttc tgcccggcca cgctgcgtag 661 cgacaccatg ccggccagca tcgactacag ccgcaacgcc gacctgcagg gcctgacctc 721 gctgcagatc ctcgacaagg tgatcgccga attcaatgcg cgcggcatgt atgtgctgct 781 ggatcaccac acccccgatt gcgccggcat ttccgagctc tggtacaccg gctcctatac 841 cgaggcacag tggctggccg acctgcgctt tgtggccaac cgctacaaga acgtgccgta 901 tgtactcggc ctggatctga agaacgaacc gcacggcgcc gccacctggg gtaccggcaa 961 cgccgccacc gattggaaca aggctgccga gcgcggctcg gccgcggtgt tggcggtcgc 1021 gccgaagtgg ctgatcgcgg tggaaggcat caccgacaac ccggtgtgct ccaccaacgg 1081 cggcatcttc tggggcggca acctgcagcc gctggcctgc accccgctca acatcccggc 1141 caaccgcctg ctgctggccc cgcacgtgta cggcccggac gtgttcgtgc agtcgtactt 1201 caacgacagc aacttcccca acaacatgcc cgccatctgg gaacgccatt tcggtcagtt 1261 cgccggcacg catgcgctgt tgctgggcga gttcggtggc aagtacggcg aaggcgacgc 1321 acgcgacaag acctggcagg acgcgctggt gaagtacctg cgcagcaagg gcatcaacca 1381 gggcttctac tggtcgtgga atcccaacag cggcgacacc ggcggcatcc tgcgcgatga 1441 ctggaccagc gtgcgccagg acaagatgac cctgctgcgc acgctgtggg gcaccgccgg 1501 caataccacg ccgacgccga ctcccacacc tacgcccaca ccgacaccga cgcctacccc 1561 cacgccgacg cccaccccgg gcaccagcac cttcagcacc aaggtgatcg cctcgccggt 1621 ggtggggtcg gcagcgcgaa aactgccggc ggcatcgcgg ctggcttgcc attggccggc 1681 cagcagcacg ggttggagag tctgggtcat cgcggcacct tcggttacgt ggaagcgccc 1741 gcacgcagca cgggcgatcg aacggcggat gagggtaacg cgcctgcgac gtgccacccg 1801 tttgaatcgt ggaccactac cggcaccggc ccatacaacg cagcacgcac cgcggctgcg 1861 ctaaacaagg ccgcgcgacg gcggtggcgc gtgctcagtg caggctgggc gcggtggcga 1921 tggcgtggtc gatcaccttc agcgctgcct cgcgctcggc accgtccacc accaggcgtg 1981 gcgcacggac acgctcgctg cccaggccca ccttttcctg caccagtttg atcagctgca 2041 cgaacttggg cacggtatcc aggcgcagca gcggcaggaa ccagtcgtac agttccttgg 2101 cggcggggta accgccgtcg cgtgccagtt cgaacaggcg taccgactcc ttcggactac 2161 tgtacttgac cagcccggcg atccacccct tggcgcccat gctcaggcct tcgacgatgg 2221 cgtcgtccat gccgaccagc agcgccagac gatcgcccag caattcctgc agcgcggcga 2281 agcggcgcac atcgccggaa gattccttta ctgcctgcag gattggggaa ttc // LOCUS FLAHANENJ8 1458 bp ss-RNA VRL 15-AUG-1990 DEFINITION Influenza virus A/NJ/8/76 (H1N1) hemagglutinin/neuraminidase (seg 4) gene, complete cds. ACCESSION M27970 KEYWORDS hemagglutinin/neuraminidase. SOURCE Influenza virus A/NJ/8/76, cDNA to viral RNA, clones pNA[6,28], passed in embryonated eggs. REFERENCE 1 (bases 1 to 1458) AUTHORS Miki,T., Nishida,Y., Hisajima,H., Miyata,T., Kumahara,Y., Nerome,K., Oya,A., Fukui,T., Ohtsuka,E., Ikehara,M. and Honjo,T. TITLE The complete nucleotide sequence of the influenza virus neuraminidase gene of A/NJ/8/76 strain and its evolution by segmental duplication and deletion JOURNAL Mol. Biol. Med. 1, 401-413 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 21 1430 hemagglutinin/neuraminidase precursor sigp 21 125 hemagglutinin/neuraminidase signal peptide matp 126 1427 hemagglutinin/neuraminidase BASE COUNT 462 a 257 c 343 g 396 t ORIGIN 1 agcaaaagca ggagtttaaa atgaatacaa atcaaagaat aataaccatt gggacaatct 61 gtctaatagt tggaataatt agtctattat tgcagatagg aaatataatc ttgttatgga 121 tgagccattc aattcagact ggagaaaaaa gccatcctaa ggtatgcaac caaagtgtca 181 ttacctatga aaacaacaca tgggtgaacc agacttatgt aaacattagc aataccaata 241 ttgctgctgg acagggtgtg actccaataa tactagccgg caattcctct ctttgcccaa 301 tcagtgggtg ggctatatac agcaaagaca atagcataag gattggttcc aaaggagaca 361 tttttgtcat gagagagcca ttcatttcat gctctcactt ggaatgcaga accttttttc 421 tgacccaagg cgctttgctg aatgacaggc attctaatgg aaccgtcaag gacaggagtc 481 cttatagaac cttaatgagc tgccccatcg gtgaagctcc atctccgtac aattcaaggt 541 tcgaatcagt tgcttggtca gcaagtgcat gccatgatgg aatgggatgg ctaacaatcg 601 ggatttccgg tccagataat ggagcagtgg ctgttttaaa atacaatggt ataataacag 661 atacaataaa aagttggaga aacaaaatat taagaacaca agagtctgaa tgtgtttgta 721 taaacggttc gtgttttact ataatgactg acggcccaag caatgggcaa gcctcgtaca 781 aattattcaa aatggagaaa gggaagatta ttaagtcaat tgagctggat gcacctaatt 841 accactatga ggaatgctcc tgttaccctg atacaggcaa agtggtgtgt gtgtgcagag 901 acaattggca tgcttcgaat cgaccatggg tctctttcga tcagaatctt gattatcaaa 961 tagggtacat atgcagtggg gttttcggtg ataatccgcg ttctaatgat gggaaaggca 1021 attgtggccc agtactttct aatggagcaa atggagtgaa ggggttttca tttagatatg 1081 gcaatggtgt ttggatagga agaactaaaa gtatcagctc tagacgtgga tttgagatga 1141 tttgggatcc taatggatgg acagaaactg atagtagttt ctctatgaag caagatatta 1201 tagcattaac tgattggtcg ggatacagtg gaagttttgt ccaacatcct gaattaacag 1261 gaatgaactg cataaggcct tgtttctggg tagagttaat cagagggcaa cccaaggaga 1321 gcacaatctg gactagtgga agcagcattt ctttctgtgg cgtgaacagt ggcactgcaa 1381 gctggtcatg gccagacgga gctgatctgc cattcaccat tgacaagtag tttatccaaa 1441 aaactccttg tttctact // LOCUS HUMHIS3PRM 1125 bp ds-DNA PRI 15-AUG-1990 DEFINITION Human histone H3 gene, complete cds. ACCESSION M26150 KEYWORDS histone. SOURCE Human HeLa cell DNA, clone pST519. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1125) AUTHORS Marashi,F., Helms,S., Shiels,A., Silverstein,S., Greenspan,D.S., Stein,G. and Stein,J. TITLE Enhancer-facilitated expression of prokaryotic and eukaryotic genes using human histone gene 5' regulatory sequences JOURNAL Biochem. Cell Biol. 64, 277-289 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 557 964 histone H3 /hgml_locus_uid="LV0006C" /nomgen="H3F2" /map="1q21" mRNA 520 > 964 histone H3 mRNA (5' end + / - 4 bp) signal 422 425 CAAT box signal 463 468 CAAT box signal 485 492 TATA box BASE COUNT 298 a 283 c 267 g 277 t ORIGIN 1 gcagcggcgt gataacagct cactgtaacc tcgaactcgg gctcaagcga tcctcatcga 61 cagccttctg agtagctggg attacaggcg agagcgccac gcccgactaa gagcattttc 121 taattgccca cacttcttat gcgacaccca gaaaaataca attttaaata aagcgcatat 181 gcaaataacc ctaatcgtct ccaatattca ctgatttctt ttttatattt taactagaaa 241 caattggagg tttccgcgtt gctttgtgtg gttgtaaatt ttaagacttc aggaaacttt 301 tccagtacaa gacttgtcca acagtggata tagcagctaa ggggttaaca aaatgacgtc 361 agagtagcta cggtaatggg caggagcctc tcttaatctg caaccaagca cagagatgga 421 ccaatccagg aagggcgcgg ggatttttga atttacttgg gtccaatggt tggtggtctg 481 actctataaa agaagagtag ctctttcctt tcctccacag acgtctctgc aggcaaagct 541 tttctgtggt tttgccatgg ctcgtactaa acagacagct cggaaatcca ccggcggtaa 601 agcgccacgc aagcagctgg ctaccaaggc tgctcgcaag agcgcgccgg ctaccggggg 661 cgtgaaaaag cctcaccgtt accgcccggg cactgtggct ctgcgcgaga tccgccgcta 721 ccaaaagtcg accgagttgc tgattcggaa gctgccgttc cagcgcttgg tgcgagaaat 781 cgcccaagac ttcaagaccg atcttcgatt ccagagctcg gcggtgatgg cgctgcagga 841 ggcttgtgag gcctacttgg tagggctctt tgaggacaca aacctttgcg ccatccatgc 901 taagcgagtg actattatgc ccaaagacat ccagctcgct cgccgcattc gcggagaagc 961 gtaaatgtaa agtcactttt tcatcagtct taaaacccaa aggctctttt cagagccacc 1021 cacttattcc aacgaaagta gctgtgataa ttttttgttg tcttaacaga acaaatttct 1081 aaggaccccc ccggaaagca ttagactatg gcttaaagtt gatac // LOCUS MUSTUBMA1 786 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse alpha-tubulin gene M-alpha-1, 3' end. ACCESSION M28729 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 786) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 597 alpha-tubulin (AA at 1) signal 773 778 poly-A signal BASE COUNT 186 a 187 c 210 g 203 t ORIGIN 1 gaattccaga ccaacctggt accctaccct cgtatccact tccctctggc cacttatgcc 61 cctgtcatct ctgctgagaa agcctaccac gagcagcttt ctgtagcaga gatcaccaat 121 gcctgctttg agccagccaa ccagatggtg aaatgtgacc ctcgccatgg taaatacatg 181 gcttgctgcc tgctgtaccg tggtgatgtg gttcccaaag atgtcaatgc tgccattgcc 241 accatcaaga ccaagcgtac catccagttt gtggactggt gccccactgg cttcaaggtt 301 ggcattaact accagcctcc cactgtggta cccggtggtg acctggccaa ggtgcagaga 361 gctgtgtgca tgctgagcaa caccacagcc attgctgagg cctgggctcg cctagatcac 421 aagtttgatc tgatgtatgc caagcgtgcc tttgtgcact ggtatgtggg tgagggcatg 481 gaggagggtg agttctctga ggcccgtgag gacatggctg ccctagagaa ggattatgag 541 gaggttggtg tggattctgt ggaaggcgag ggggaggaag aaggagagga atactaaatt 601 aaatgtcaca aggtgctgct tccacaggga tgtttattgt gttccaacac agaaagttgt 661 ggtctgatca gttaatttct atgtggcaat gtgtgctttc atacagttac tgacttatga 721 atgattgatt ttgacagaga ccccaagctg cccatttcac ttatgggttt taaataaaat 781 actccc // LOCUS MUSTUBMA2 1198 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse alpha-tubulin gene M-alpha-2, 3' end. ACCESSION M28727 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1198) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 1059 alpha-tubulin (AA at 1) BASE COUNT 259 a 329 c 303 g 307 t ORIGIN 1 gcaaataact atgcccgtgg ccactacacc attggcaagg agatcattga ccttgtcctg 61 gacaggattc gcaagctggc tgaccagtgc acgggtctcc agggcttgtt cgttttccac 121 agctttggcg ggggaactgg ctctggcttc acctccctgc tgatggagcg gctctctgtg 181 gattacggaa agaagtccaa gctggagttc tccatttacc cagcccccca ggtttccact 241 gctgtggttg agccctacaa ttccatcctc accacccaca ccaccctgga gcactctgat 301 tgtgccttca tggtagacaa tgaggccatc tatgacatct gtcgtagaaa cctcgacatt 361 gagcgcccaa cctacaccaa ccttaaccgc cttattagcc agattgtgtc ttccatcact 421 gcttccctca gatttgatgg ggccctcaat gttgatctga cagaattcca gaccaacctg 481 gtaccctacc ctcgcatcca cttccctctg gccacttatg cccctgtcat ctctgctgag 541 aaagcctacc atgagcagct ttctgtagca gagatcacca atgcctgctt tgagccagcc 601 aaccagatgg tgaaatgtga ccctcgccat ggtaaataca tggcttgctg cctgctatac 661 cgtggtgatg tggttcccaa agatgtcaat gctgccattg ccaccatcaa gaccaagcgc 721 acgatccagt ttgtagactg gtgccccact ggcttcaagg ttggcattaa ttaccagcct 781 cccactgtgg tacccggtgg tgacctggcc aaggtgcaga gagctgtgtg catgctgagc 841 aacaccacag ccattgctga ggcctgggct cgcctagatc acaagtttga tctgatgtat 901 gccaagcgtg cctttgtgca ctggtatgtg ggtgagggca tggaggaggg tgagttctct 961 gaggcccgtg aggacatggc tgccctagag aaggattatg aggaggttgg tgtggattct 1021 gtggaaggcg agggggagga agaaggagag gagtactaag tccattcctt gagccccctg 1081 tgtcgtcaaa tgctccagta ttagttgcag gcacctgatg cttctgtgct gtttccattc 1141 tgtgatcatg tcttctccat gttgtacctc ttaagttttc catgatgtct caaactaa // LOCUS MUSTUBMB2 488 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse beta-tubulin gene M-beta-2, 3' end. ACCESSION M28739 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 488) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 303 beta-tubulin (AA at 1) signal 472 477 poly-A signal BASE COUNT 121 a 116 c 131 g 120 t ORIGIN 1 cccaacaacg tcaagacggc cgtgtgtgac atccctcctc gtggcctcaa gatgtcagcc 61 accttcattg gcaacagcac tgccatccag gagctgttca agcgcatctc ggagcagttc 121 actgccatgt tccggcgcaa ggctttcctg cactggtaca cggctgaggg catggacgag 181 atggagttca ccgaggcgga gagcaacatg aatgacctgg tgtctgagta ccagcagtac 241 caggatgcca cggccgatga gcagggcgag ttcgaggagg aggagggtga agatgaggct 301 tgagaacttc tcagatacag tgtgcaccct tagtgaactt ctgttgtcct ccagcattgg 361 tctttctatt tgtaaattat ggtgctcagt ttgcctctgt cagaaattca ctgttgatgt 421 aatagtgtga acctctttca agatcacagt attgtctcag aaatctatat gaataaaaaa 481 gcatgtgg // LOCUS MUSTUBMB4 1454 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse beta-tubulin gene M-beta-4, 3' end. ACCESSION M28730 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1454) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 1140 beta-tubulin (AA at 1) BASE COUNT 327 a 456 c 376 g 295 t ORIGIN 1 gtcgacctgg aacccggcac catcgactct gtccgctccg gcccttttgg ccagatcttt 61 cggccagaca actttgtatt tggtcaatcc ggagcaggca acaactgggc caagggtcac 121 tacaccgagg gcgcgcagtt agtggatgcc gtcctggacg tggtgcgcaa agaggcggaa 181 agctgcgact gtctccaggg cttccagctc acccactcgc tcggaggtgg caccggctca 241 ggcatgggga ccttgctcat cagcaagatc cgagaggagt ttccagacag gatcatgaat 301 acgttcagcg tggtgccatc acccaaggtg tctgacacgg tggtggagcc ctacaatgcc 361 acactgtctg tgcatcagct ggtggagaac actgatgaga cctactgcat cgacaacgag 421 gccctgtacg acatctgctt ccgtacgctc aagctgacca cgcccacgta cggggacctc 481 aaccacctcg tgtcagccac catgagtgga gtcaccacct gcctacgttt cccgggccag 541 ctcaatgcag acctacgcaa gctggctgtg aacatggtgc cattcccccg tctccacttc 601 ttcatgccag gattagcacc cttgaccagc aggggcagcc agcagtaccg ggccctcacc 661 gtccctgagc tgacccaaca ggtgttcgat gctaagaaca tgatggctgc gtctgacccg 721 agacacggtc gctacctgac tgtggctgct gtcttccggg gacggatgtc catgaaggag 781 gtagacgagc agatgttaag tgtgcagagc aagaacagca gttacttcgt tgagtggatc 841 cccaacaatg tgaaggcagc cgtatgtgac atcccgcccc gcggcctgaa gatggcagcc 901 accttcatcg gcaacagcac tgccatccag gagctgttca agcgcatctc ggagcagttc 961 accgccatgt tcagacgcaa ggccttcctg cactggtaca cggccgaagg catggacgag 1021 atggagttta cggaagcaga gagcaatatg aacgacctgg tgtccgagta ccagcagtac 1081 caggatgcca ctgctgaaga gggcgagttc gaagaggagg ctgaagagga ggtggcttaa 1141 gtctcctgcc atcactctgt ccctggggcc caccagcaaa gctttgaccc taagcatcac 1201 acccctgcac ctagttgcct cattccctag gaccccatga gcatcttcac catgaggcca 1261 agcccaggtt gcttctattt gcttcacctt taactcctaa accccactgt ctctccaacc 1321 tgccagggaa gggctcttct agttcccatg agcgcccctc aacacatgta cacacgcaca 1381 cacactccac cttcttagat cttgaaaatc ctttccttta tgccctgtcc cttccccagc 1441 actcctgaac cgat // LOCUS MUSTUBMB5 542 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse beta-tubulin gene M-beta-5, 3' end. ACCESSION M28732 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 542) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 363 beta-tubulin (AA at 1) signal 524 529 poly-A signal BASE COUNT 126 a 136 c 149 g 131 t ORIGIN 1 gaggtggatg agcagatgct caatgtgcag aacaagaata gcagctactt cgtggaatgg 61 atccccaaca atgtcaagac agctgtctgt gacatcccac cgcgtggcct caagatggca 121 gtcaccttca ttggaaacag cacagccatc caggagctgt tcaagcgcat ctctgagcag 181 tttacggcta tgttccgccg gaaggctttc ctccactggt acacggctga gggcatggac 241 gagatggagt tcaccgaggc tgagagcaac atgaacgacc tggtgtctga gtaccagcag 301 taccaggatg ccaccgctga agaggaagag gatttcggag aggaggcaga agaggaggcc 361 taacggcaga gagccctgca tcagctcagg ctgcttagac tccctcagcc tttctccaac 421 tgccctttgt cctccagttt ctttctgctg cctctgtctt gtatttgttt tgcttctgtt 481 ttctcattgg gggtaaatgg tgcctggcac atggcaggca ctcaataaat atttgtttgt 541 gg // LOCUS XELPAL 353 bp ss-mRNA VRT 15-AUG-1990 DEFINITION X.laevis parvalbumin mRNA, 3' end. protein. ACCESSION M28644 KEYWORDS parvalbumin. SOURCE X.laevis tadpole, cDNA to mRNA, clone lambda-PV1. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 353) AUTHORS Kay,B.K., Shah,A.J. and Halstead,W.E. TITLE Expression of the Ca2+ -binding protein, parvalbumin, during embryonic development of the frog, Xenopus laevis JOURNAL J. Cell Biol. 104, 841-847 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 339 parvalbumin (AA at 1) BASE COUNT 92 a 86 c 90 g 85 t ORIGIN 1 agatttacta tggcattcgg tggtatcctg agtgaggctg acatctctgc tgccctgcag 61 aactgccaag ctgctgactc cttcaacttc aaaactttct ttgcccagtc tggtctgagc 121 agcaagtccg cagatgatgt gaaaaacgtc tttgccatcc tcgaccagga caggagcggc 181 ttcattgagg aagaggaact gaagttgttc ctccagaact tcagcgcaag tgccagggca 241 ctgactgatg ctgaaaccaa ggccttcctg gcagctggtg actctgatgg tgatggcaaa 301 attggagttg aagaattcca gtccctagtc aaaccttgaa gaagtaagac caa // LOCUS RATMLVI4 100 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat Moloney murine leukemia provirus Mlvi-4 mRNA, partial sequence. ACCESSION M36432 KEYWORDS provirus. SOURCE Rat Moloney murine leukemia virus-induced T-cell lymphoma, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 100) AUTHORS Tsichlis,P.N., Lee,J.S., Bear,S.E., Lazo,P.A., Patriotis,C., Gustafson,E., Shinton,S., Jenkins,N.A., Copeland,N.G., Huebner,K., Croce,C., Levan,G. and Hanson,C. TITLE Activation of multiple genes by provirus integration in the Mlvi-4 locus in T-cell lymphomas induced by Moloney murine leukemia virus JOURNAL J. Virol. 64, 2236-2244 (1990) STANDARD simple staff_entry FEATURES from to/span description mRNA < 1 > 100 Mlvi-4 mRNA recomb 73 74 Rat DNA end/provirus DNA start BASE COUNT 24 a 29 c 19 g 28 t ORIGIN 1 ttactggaag ccctcctcat catgggattt catcacagta aacaacaatc tcacctctga 61 ccaggctgtc caggattctc ctcatggttt gtcgaaggtc //