Path: utzoo!attcan!uunet!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 14 Aug 90 12:02:27 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 17725 Approved: lear@genbank.bio.net Checksum: 26064 1048 LOCUS RATNCAMC 1195 bp ds-DNA ROD 14-AUG-1990 DEFINITION Rat neural cell adhesion molecule (NCAM-C) gene, exon 1. ACCESSION M32612 KEYWORDS neural cell adhesion molecule. SOURCE Rat (strain Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1195) AUTHORS Chen,A., Reyes,A. and Akeson,R. TITLE Transcription initiation sites and structural organization of the extreme 5' region of the rat neural cell adhesion molecule gene JOURNAL Mol. Cell. Biol. 10, 3314-3324 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Akeson 06-MAR-1990, for release after publication. FEATURES from to/span description pept 1142 / 1193 neural cell adhesion molecule, exon 1 (NCAM-C) pre-msg 947 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 948 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 950 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 951 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 952 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 955 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 956 > 1193 NCAM-C mRNA and introns (alt.) IVS 1194 > 1195 NCAM-C intron 1. site 307 484 homopurine-homopyrimidine region BASE COUNT 324 a 275 c 333 g 263 t ORIGIN 1 gaattcttgc tcttgctcct gtgccgtaat acacaccgaa aagccagggc agcagtggag 61 ggagaggaag agaggattcc aatacttgca agcagtttct ggtaatttct gaatactact 121 tctaaatgtc tctgcatttt cttctaactg cccccccgca cccccccccc acttctgaga 181 tctgagttgt gctgctgtga gttgctccct ctgtgggaag gtaaacttca taggtaagag 241 aaaacaaaca acaaagcaaa cctccacaat cctcatttct ctccaaattt ccaatttcct 301 tcacttagga ggaagaggag gaagaggagg aagagggtga ggaggaggaa gaggaagaag 361 aggaagaaga ggaagaagag gaagaagagg aagaagagga agaagaggat gagggaaagg 421 aggagaagag gaggaggagg aggaggaaga aaaggagaag aaggagaagg agaaaaggag 481 gaagccattt gcttctattg caaggacttt agtaactaca tggggtgggt gtgggggctc 541 tacaactttg aaaagcgagc tgaatctaaa aattctttct ttttttttcc taattattta 601 aacgttcaaa ttcccgatta aggaacgctg gagggcagaa ggatctccaa aggcttgagt 661 gaaggaaaag ggtgaaaaga agaaatccca gcttctgcct ggaaggttct gtgtgaaaga 721 gcccggctcc cttggtgact ccaggccgtg ttttgcaggc ggccgcatct gcctcccctg 781 tctcttacct cctggatgtt aggaactatt tgtggtcggc gtggtggaag gacacagtga 841 ggttctcacc tccgcccccc gcccgtcgct cgcatcccca gttccatcaa agccaacccg 901 ggccagcgca gggatctccg agttgcgagt gtgctgaggc tggggctgtc actcattctc 961 cgctcagcgg tgaacgcagc tcggcagtgg ctggcaagaa acaattctgc aaaaataatc 1021 atacccagcc tggcaattgt ctgctcctcg gtccattgct ccgccgccgt ccacagtcgc 1081 ttgcaaggga aggcactgaa tttaccgcgg ccagaacatc cctcccagcc ggcagtttac 1141 aatgctgcga actaaggatc tcatctggac tttgtttttc ctgggaactg caggt // LOCUS YSCTEC1A 2308 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae TEC1 protein gene, complete cds. ACCESSION M32797 KEYWORDS TEC1 protein. SOURCE S.cerevisiae (strain GRF18) haploid cell DNA, clone pBC43-2. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2308) AUTHORS Laloux,I., Dubois,E., Dewerchin,M. and Jacobs,E. TITLE TEC1, a gene involved in the activation of Ty1 and Ty1-mediated gene expression in Saccharomyces cerevisiae: Cloning and molecular analysis JOURNAL Mol. Cell. Biol. 10, 3541-3550 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by I.M.Laloux 13-MAR-1990. FEATURES from to/span description pept 395 1855 TEC1 protein (TEC1) signal 289 294 TATA box BASE COUNT 781 a 513 c 410 g 604 t ORIGIN Chromosome 2, right arm. 1 tgcgtgtttc agattgcgtt tgcttattat tctgggaaac tctccggaaa caagctgagc 61 tggactccac gaatggtccc tcacaaggct cttggctcat gcacagttaa aaagcacctg 121 tagctaccat cagcaatcca tggtgctgga gtttctctcg atggtacggc ccatagctgg 181 agaaaatagc ttattaccaa aaaagtaatt cccccgcaga ggtatatatt aagtttgaaa 241 caaaaggctt gcttaatttt ttacatatga caaccctttt cttactttta taaatgttta 301 tgaaacagga gattcttcga ctggtgccag attcctccca tatcgacaag aagaataatc 361 cacctatttc aacaattctg atacctgttt aaccatgagt cttaaagaag acgactttgg 421 caaggataat tctagaaata tagaatcata tactggtaga atttttgacg tatatataca 481 aaaagattcg tattcacagt cggccttgga tgatatgttt ccagaagccg tagtttcaac 541 cgccgcttgt gtgaaaaatg aagcggagga taacatcaat ctcatagaca cgcatcctca 601 attcgaactg gtaaatactg gactgggtgc taaatcggac gatttgaaat ctccatcagc 661 aaaggctacg ttcactgaca agcagaggaa gaatgaagta ccaaatatat ctgtgagcaa 721 ctactttccc ggacaaagta gcgaaacgtc gtcaacaacg gaatcttgga ctatcggttg 781 tgataagtgg tcagaaaagg tagaagaggc attccttgag gcacttagac tgataatgaa 841 aaatgggacc acaaaaataa aaataagaaa tgccaatttt ggaagaaacg agctgatttc 901 attatatatc aagcacaaaa ccaacgagtt cagaaccaaa aagcaaattt cttcccatat 961 tcaagtctgg aagaagacca tacaaaacaa aatcaaggac tcgctgaccc tatcatcaaa 1021 ggagaaggag cttctacacc ttatcgaaca tggcgctgaa caaactactg aaaactcaaa 1081 cctgttttat gacatatttg aagaaattat cgactctcta ccttcagtca gtgattctgg 1141 aagtttaacc cctaaaaacc tctatgtaag taataatagc agtggattgt cagtacattc 1201 aaaactgctt acgccaatca ctgcttccaa cgagaaaaaa attgaaaatt tcataaaaac 1261 taatgctgca tctcaagcca aaacccccct catttacgct aagcacattt atgaaaacat 1321 agacggctac aagtgcgttc cgtcaaagag gcctcttgaa caactttccc ccacggaact 1381 ccaccaggga gatcgcccca ataaggctag cttttccaac aagaaggcaa tcctggagag 1441 tgcaaaaaaa atcgaaatag agcagagaaa gataatcaac aaataccaaa gaatttcccg 1501 catacaagaa catgaaagta atcctgagtt cagttccaat tccaattccg gttcagagta 1561 cgaatcggag gaagaagtag tcccaagatc agccacagtc acacaactcc aaagcagacc 1621 agtgccatac tacaagaata atggaatgcc ctactcactc tccaaagtac gaggaaggcc 1681 catgtatcca agacctgctg aagatgctta caatgccaat tatattcaag gtctgcccca 1741 gtaccaaaca tcttattttt cgcagctgtt attatcatca ccccagcatt acgaacattc 1801 tccacatcaa aggaacttta cgccatccaa ccaatcgcat gggaactttt attaaatgtc 1861 tacatacata catacatctc gtacataaat acgcatacgt atcttcgtag taagaaccgt 1921 cacagatatg attgagcacg gtacaattat gtattagtca aacattacca gtctcgaaca 1981 aaagctactc ctgcaacact cttctatcgc acatgtatgg ttcttattgt ttcccgagtt 2041 cttttttact gacgcgccag aacgagtaag aaagttctct agcgccatgc tgaaattttt 2101 ttcacttcaa cggacagcga ttttttttct ttttcctccg aaataatgtt gcagcggttc 2161 tcgatgcctc aagaattgca gaagtaaacc agccaataca catcaaaaaa caactttcat 2221 tactgtgatt ctctcagtct gttcatttgt cagatattta aggctaaaag gaaatgttgt 2281 cgagactatc tttattgagt aactcgag // LOCUS YSCACO1A 3130 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae aconitase (ACO1) gene, complete cds. ACCESSION M33131 KEYWORDS aconitase; isocitrate hydro-lyase. SOURCE S.cerevisiae (strain S288C) DNA, clone SG7-1. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3130) AUTHORS Gangloff,S.P., Marguet,D. and Lauquin,G.J.-M. TITLE Molecular cloning of the yeast mitochondrial aconitase gene (ACO1) and evidence of a synergistic regulation of expression by glucose plus glutamate JOURNAL Mol. Cell. Biol. 10, 3551-3561 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.J.-M.Lauquin 21-MAR-1990, for release after publication. FEATURES from to/span description pept 648 2987 aconitase (ACO1) (EC 4.2.1.3) binding 357 364 HAP2-HAP3 binding site signal 503 513 TATA box BASE COUNT 907 a 604 c 668 g 951 t ORIGIN 1 gaattccaaa ggctcacaaa tggcattccg tactgatata cttcgcactt tacatatgct 61 tatataaaaa tgccagtttc ccatcgtcaa ttacgcagag gtagccttcg tttattttct 121 ctctttttgt atatcattgt attaatcata atccattgta ttttactttg tcttatctgg 181 ctttaatgaa tgacgccggt cacacgcggg tgcctgattc tcgattgtgc caagccattt 241 gggcacggtg tcaaattacc taaaaaatgg ccgagagccg caaaagggag gtccgcgggg 301 ccgggcaata ccctttgttt ttcgagcatt tcggcgccga aatcggaaag gtcctgacca 361 atcaatagag aaattagtgc atacgagaaa aatttgaacc cttcagttgt tctccgcagg 421 cgactttaac catcaaacct ccaatcgctg ccggttttcg gaaaggcaag cacaaaaagg 481 gaggacaagg aaaatctttt gttatatata taatcttccg gttttagagg ttaattagga 541 ggagtatgtt gcttaattcc gttgtccctt gttctgttca ctctttctta gttattacat 601 agtagaacga aggtaataaa tactatcatt attactatag atatacaatg ctgtctgcac 661 gttctgccat caagagaccc attgttcgtg gtcttgcgac agtctccaac ttgactagag 721 attcaaaagt caaccaaaac ttattagaag atcattcttt tattaactac aagcagaatg 781 tggaaacgct ggatatcgta agaaaaagat taaacaggcc atttacctac gcggaaaaga 841 ttttgtacgg tcacttggat gaccctcatg gtcaagatat tcagagaggt gtttcatacc 901 taaaattaag accagatcgt gttgcctgtc aagatgctac tgctcaaatg gctattttac 961 aatttatgtc cgctggttta ccacaggttg ctaagccagt cactgtccac tgtgaccatt 1021 tgattcaagc acaagttggt ggtgaaaaag atttgaagag agctatagat ctaaacaagg 1081 aagtttatga tttcttggcc tctgccactg cgaaatataa catgggtttc tggaagccag 1141 gttccggtat cattcaccaa attgttctgg aaaactacgc tttcccaggt gctttgatca 1201 ttggtactga ctcccataca ccaaatgctg gtggtttagg tcaattggct attggtgttg 1261 gtggtgctga tgccgttgat gttatggcag gtcgtccatg ggaattgaag gctccaaaga 1321 tcttaggtgt taagttgact ggtaagatga acggttggac ttctccaaag gatattattt 1381 tgaaattggc tggtatcaca actgtcaaag gtggtactgg taaaattgtt gaatattttg 1441 gtgatggtgt tgacactttc tccgctactg gtatgggtac catttgtaat atgggtgctg 1501 aaatcggtgc taccacatct gttttcccat tcaacaaatc tatgattgaa tatttggaag 1561 caactggtcg tggtaagatc gctgactttg ctaaattata ccacaaggat ctattatctg 1621 ctgataagga tgctgaatac gatgaggtcg tcgaaattga cttgaacact ctggaaccat 1681 acatcaatgg gccatttacc cccgatttgg ctactccagt ttctaagatg aaggaagttg 1741 ctgttgctaa taactggcca ttggatgtca gagtcggttt gatcggttct tgtaccaatt 1801 cctcttatga agatatgtct cgttcagcat ccattgtcaa ggacgctgct gctcatggtt 1861 tgaaatccaa gaccattttc actgttactc caggttctga acaaatcaga gccactattg 1921 aacgtgatgg ccaattagaa accttcaaag aatttggtgg tatcgttttg gcaaacgcct 1981 gtggcccatg tattggtcaa tgggatcgta gagatatcaa gaaaggtgac aagaatacta 2041 ttgtttcctc ttacaacaga aatttcactt ctagaaatga tggtaaccca caaactcatg 2101 cttttgttgc atctccagaa ttagtaactg cgttcgccat tgcgggtgat ttgagattca 2161 accctctaac agacaaatta aaggacaagg atggtaatga gttcatgttg aaaccaccac 2221 atggtcgatg gtttgcctcg aaagaggtta tgatgctggt gagaacactt accaagctcc 2281 acctgcagac cgtagccacc gttgaagtta aagtttctcc aacttcagac cgtctacaac 2341 tgttgaaacc attcaaacct tgggatggta aggatgctaa agacatgcca atcttgatta 2401 aggccgtcgg taagacaact actgatcata tttctatggc tggtccatgg ttgaaataca 2461 gaggtcattt agaaaacatt tctaataact atatgattgg tgctattaat gctgaaaaca 2521 agaaggctaa ctgtgttaaa aatgtatata ctggtgaata caaaggtgtt ccagacactg 2581 ctagagatta cagagaccaa ggtatcaagt gggttgttat tggtgatgaa aactttggtg 2641 aaggttcctc tcgtgaacac gctgctttgg aaccaagatt cttgggcggt ttcgctatca 2701 tcacaaagtc tttcgctcgt atccatgaaa ctaacttgaa aaaacaaggt ctattgccat 2761 tgaacttcaa gaacccagct gactatgaca agatcaaccc tgatgacaga atcgatattc 2821 tgggtctagc tgaattggct ccaggtaagc ctgtaacaat gagagttcat ccaaagaatg 2881 gtaagccatg ggatgctgtg ttgacccata ctttcaacga tgagcaaatt gaatggttca 2941 aatatggttc tgccttaaat aaaattaagg ccgatgagaa gaaataatga aaacattgtt 3001 ataatctttt aaaggttatt atttattttg tcttctgtac acgtaccctt gtttatcttt 3061 tctgccttaa atttaatgac gttcggctgg agaagtcaag actatgaaat atatctcgta 3121 atttatgatc // LOCUS HIVTRA1 59 bp ss-RNA VRL 14-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), gag region. ACCESSION M27956 KEYWORDS . SEGMENT 1 of 2 SOURCE Human immunodeficiency virus type 1 (isolate TRA), clone 01. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 59) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson. Isolated in Paris. Sibling sequences are available (see 1989: I-A-169). FEATURES from to/span description pept < 1 > 59 gag protein (AA at 2) BASE COUNT 26 a 8 c 10 g 15 t ORIGIN 1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacca // LOCUS HIVTRA2 268 bp ss-RNA VRL 14-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), envelope region. ACCESSION M27948 KEYWORDS . SEGMENT 2 of 2 SOURCE Human immunodeficiency virus type 1 (isolate TRA), clone 01. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 268) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT Sibling sequences are available (see 1989: I-A-170). FEATURES from to/span description pept < 1 > 268 env protein (AA at 1) BASE COUNT 95 a 46 c 54 g 73 t ORIGIN 1 caattaaccc cactctgtgt tactttaaat tgcactgatt atttggggaa tgctactaat 61 accactagta gtagcggggg agggatggag agaggagaaa taaaaaactg ctctttcaat 121 atcacgacaa gcataagaga taagatgcag agagaatatg cactttttta taaacttgat 181 gtagtaccaa cagataatga taatactagc tataggttga taagttgtaa cacctcagtc 241 attacacagg cctgtccaaa agtatcct // LOCUS FLAH1424 1749 bp ss-RNA VRL 14-AUG-1990 DEFINITION Influenza A/Mallard/Gurjev/263/82 hemagglutinin subtype H14 gene. ACCESSION M35997 KEYWORDS hemagglutinin. SOURCE Influenza A/H14/263/82 RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (sites) AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R. TITLE Molecular characterization of a new hemagglutinin, subtype H14, of influenza A virus JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1749) AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Kawaoka, 27-JUN-1990. Author address: Y.Kawaoka St. Jude Childrens Research Hospital 332 N. Lauderdale P.O. Box 318, Memphis, TN 38101 BASE COUNT 575 a 360 c 412 g 402 t ORIGIN 1 agcaaaagca ggggaaaatg attgcactca tattggttgc actggctctg agccacactg 61 cttattctca gatcacaaat gggacaacag gaaaccccat tatatgcttg gggcatcatg 121 cagtggaaaa cggcacatct gttaaaacac taacagacaa tcacgtagaa gttgtgtcag 181 ctaaagaatt agttgagacg aaccacactg atgaactgtg cccaagcccc ttgaagcttg 241 tcgacgggca agactgccac ctcatcaatg gtgcattggg gagtccaggc tgtgaccgtt 301 tgcaggacac cacttgggat gtcttcattg aaaggcccac tgcagtagac acatgttatc 361 cattcgacgt cccagattac cagagtctca gaagcatcct agcaagcagt gggagtttgg 421 agttcatcgc cgaacaattc acctggaatg gtgtcaaagt tgacggatca agcagtgctt 481 gtttgagggg cggtcgcaac agcttcttct cccgactaaa ctggctaacc aaagcaacaa 541 atggaaacta tggacctatt aacgtcacta aagaaaatac gggctcttat gtcaggctct 601 atctctgggg agtgcatcac ccatcaagcg ataatgagca aacggatctc tacaaggtgg 661 caacagggag agtaacagta tctacccgct cggaccaaat cagtattgtt cccaatatag 721 gaagtagacc gagggtaagg aatcagagcg gcaggataag catctactgg accctagtaa 781 acccagggga ctccatcatt ttcaacagta ttgggaattt gattgcacca agaggccact 841 acaaaataag caaatctact aagagcacag tgcttaaaag tgacaaaagg attgggtcat 901 gcacaagccc ttgcttaact gataaaggtt cgatccaaag tgacaaacct tttcagaatg 961 tatcaaggat tgctatagga aactgcccga aatatgtaaa gcaagggtcc ctgatgttag 1021 caactggaat gcgcaacatc cctggcaaac aggcaaaggg cttatttggg gcaattgctg 1081 gattcattga aaatggttgg caaggcctga ttgatgggtg gtatggattc aggcaccaaa 1141 atgctgaagg aacaggaact gctgcagacc tgaagtcaac tcaggcagcc attgatcaga 1201 taaatggcaa gctgaacaga ttgatagaga agacaaatga aaaatatcac caaatagaaa 1261 aggaattcga acaggtggaa ggaagaatac aagaccttga gaagtacgtt gaggacacta 1321 agattgattt gtggtcatac aatgctgaat tgctagtagc actagagaat cagcacacaa 1381 tagatgtcac agactccgaa atgaacaagc tttttgaaag agtaagaagg caattaagag 1441 agaatgcaga agatcaaggc aacggttgtt tcgagatatt ccatcagtgt gacaacaatt 1501 gtatagaaag cattagaaac ggaacttatg accacaacat ctacagggat gaagccatca 1561 acaatcgaat caaaataaat cctgtcactt tgacgatggg gtacaaggac ataatcctgt 1621 ggatttcttt ctccatgtca tgctttgtct tcgtggcact gattctggga tttgttctat 1681 gggcttgtca aaacgggaat atccgatgcc aaatctgtat ataaagaaaa aacacccttg 1741 tttctactc // LOCUS FLAH14244 1716 bp ss-RNA VRL 14-AUG-1990 DEFINITION Influenza A/Mallard/Gurjev/244/82 hemagglutinin subtype H14 gene. ACCESSION M35996 KEYWORDS hemagglutinin. SOURCE Influenza A/H14/244/82 RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (sites) AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R. TITLE Molecular characterization of a new hemagglutinin, subtype H14, of influenza A virus JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1716) AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Kawaoka, 27-JUN-1990. Author address: Y.Kawaoka St. Jude Childrens Research Hospital 332 N. Lauderdale P.O. Box 318, Memphis, TN 38101 BASE COUNT 562 a 346 c 405 g 403 t ORIGIN 1 ttggttgcac tggctctgag ccaaactgct tattctcaga tcacaaatgg gacaacagga 61 aaccccatta tatgcttggg gcaccatgca gtggaaaacg gcacatctgt taaaacacta 121 acagacaatc acgtggaagt tgtgtcagct aaagaattag ttgagacgaa acacactgat 181 gaactgtgcc caagcccctt gaagcttgtc gacgggcaag actgcgacct catcaatggt 241 gcattgggga gtccaggctg tgatcgtttg caggacacca cttgggatgt cttcattgaa 301 aggcccactg cagtagacac atgttatcca ttcgacgtcc cagattacca gagtctcaga 361 agcattctag caagcagtgg gagtctggag ttcatcgccg aacaattcac ctggaatggt 421 gtcaaagttg acggatcaag cagtgcttgt ttgaggggcg gtcgcaacag cttcttctcc 481 cggctaaact ggctaaccaa agcaacaaat ggaaactatg gacctattaa cgtcactaaa 541 gaaaatacgg gctcttatgt caggctctat ctctggggag tgcatcaccc atcaagcgat 601 aatgagcaaa cggatctcta caaggtggca acagggagag tgacagtatc tacccgctcg 661 gatcaaatca gtattgttcc caatatagga agtagaccga gagtaaggaa tcagagcggc 721 aggataagca tctactggac cctagtaaac ccaggggact ccatcatttt caacagtatt 781 gggaatttga ttgcaccaag aggccattac aaaataagta aatctactaa gagcacagtg 841 cttaaaagtg acaaaaggat tgggtcatgc acaagccctt gcttaactga taaaggttcg 901 atccaaagtg acaaaccttt tcagaatgta tcaaggattg ctataggaaa ctgcccgaag 961 tatgtaaagc aagggtcctt gatgttagca actggaatgc gcaacatccc tggcaaacag 1021 gcaaagggct tatttggggc aattgctgga ttcattgaaa atggttggca aggcctgatt 1081 gatgggtggt atggattcag gcaccaaaat gctgaaggaa caggaactgc tgcagacctg 1141 aagtcaactc aggcagccat tgatcagata aatggcaagc tgaacagatt aatagaaaaa 1201 acaaatgaaa aatatcacca aatagaaaag gaattcgaac aggtggaagg aagaatacaa 1261 gaccttgaga agtacgttga ggacactaag attgatttgt ggtcatacaa tgctgaattg 1321 ctagtcgcac tagagaatca gcacacaata gatgtcacag actccgaaat gaacaagctt 1381 tttgaaagag taagaaggca attaagagag aatgcagaag atcaaggcaa cggttgtttc 1441 gagatattcc atcagtgtga caacaattgt atagaaagta ttagaaacgg aacttatgac 1501 cacaacatct acagggatga agccatcaac aatcgaatca aaataaatcc tgtcactttg 1561 acgatggggt acaaggacat aatcctgtgg atttctttct ccatgtcatg ctttgtcttc 1621 gtggcactga ttttgggatt tgttctatgg gcttgtcaaa acgggaatat ccgatgccaa 1681 atctgtatat aaagaaaaaa cacccttgtt tctact // LOCUS DOGCPLA01 458 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 1 (non-coding). ACCESSION M28140 J04961 KEYWORDS pancreatic lipase. SEGMENT 1 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 458) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pre-msg 386 > 458 pancreatic lipase mRNA and intron IVS 419 > 458 pancreatic lipase intron A signal 274 277 CAAT box signal 351 354 TATA box BASE COUNT 133 a 94 c 99 g 132 t ORIGIN 1 ttggagggca tttccaccta ttctaagtga gaaatactac cctccattat tggattttgc 61 tgctgtgttc ttactggaac ttgtaaacta tttgatgata acatagaaaa tgggaaagag 121 gggccgcatt ggctcagaat tacatgcaca ggtacatcgg tacaaaggta gggatctttt 181 acctcctttt aatcaagcat tctttcatct tcccagactg tgggaagatc tgactactat 241 aattagtctt gttttgtgta tttgagcaga ttacaatatt aagtgcacac agcgtgttct 301 atcatcacct gctgtttcct tgaagcggcc tccaaatgat aaggctccag tataaaaaaa 361 aaaaaaaagg cctgggtact tgattgaaag ggagggaagg cccagaacat cccacagggt 421 aagtcacctc ttctgtcctc ccaccagggt tgaagcag // LOCUS DOGCPLA02 129 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 2. ACCESSION M28141 J04961 KEYWORDS pancreatic lipase. SEGMENT 2 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 129) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept 41 + 89 pancreatic lipase precursor, exon 2 (EC 3.1.1.3) (first expressed exon) sigp 41 + 89 pancreatic lipase signal peptide IVS < 1 40 pancreatic lipase intron A IVS 90 > 129 pancreatic lipase intron B BASE COUNT 36 a 26 c 37 g 30 t ORIGIN About 38 bp after segment 1. 1 aataatcggt gagactgaga atgttccgaa tttattgtag atggtgagca tctggacaat 61 cgcgctgttt ctgctgggag cagccaaagg taggaaacgt cacaccggcc cctcgggaag 121 gacttaaat // LOCUS DOGCPLA03 235 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 3. ACCESSION M28142 J04961 KEYWORDS pancreatic lipase. SEGMENT 3 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 235) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 195 pancreatic lipase precursor, exon 3 sigp + 41 42 pancreatic lipase signal peptide matp 43 + 195 pancreatic lipase IVS < 1 40 pancreatic lipase intron B IVS 196 > 235 pancreatic lipase intron C BASE COUNT 59 a 66 c 59 g 51 t ORIGIN About 506 bp after segment 2. 1 ggggagctaa gccgtgagca gagtcatcct tcctctacag caaaggaagt ttgctatgaa 61 caaatcggat gcttttctga cgcggagccc tgggccggga cagcaatcag gcctctgaaa 121 gttctcccct ggagccccga gagaatcggc acccgcttcc tgctctacac caacaagaac 181 ccaaacaact ttcaagtgag acctctgtca ttttagtgcc attgtgactg gagtg // LOCUS DOGCPLA04 206 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 4. ACCESSION M28143 J04961 KEYWORDS pancreatic lipase. SEGMENT 4 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 206) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 166 pancreatic lipase precursor, exon 4 matp + 41 + 166 pancreatic lipase IVS < 1 40 pancreatic lipase intron C IVS 167 > 206 pancreatic lipase intron D BASE COUNT 55 a 60 c 44 g 47 t ORIGIN About 445 bp after segment 3. 1 ggctacaaaa taatgcgacc ctttcccact tttatcacag actctccttc cctctgatcc 61 atcaacgatt gaggcatcaa attttcaaac agacaagaag acccggttca tcatccatgg 121 cttcatagac aagggagaag agaactggct gctggatatg tgcaaggtag gggccggctc 181 ccaggctcac agtctgccct gctccc // LOCUS DOGCPLA05 215 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 5. ACCESSION M28144 J04961 KEYWORDS pancreatic lipase. SEGMENT 5 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 215) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 175 pancreatic lipase precursor, exon 5 matp + 41 + 175 pancreatic lipase IVS < 1 40 pancreatic lipase intron D IVS 176 > 215 pancreatic lipase intron E BASE COUNT 44 a 58 c 67 g 46 t ORIGIN About 2090 bp after segment 4. 1 gtgcccctgt catgacatcc tggtgtctct ccaacatcag aacatgttca aggttgagga 61 ggtgaactgc atctgcgtgg actggaagaa aggttcccaa acctcatata cacaggctgc 121 caacaacgtg cgggtggtgg gcgcccaggt ggcccagatg ctcagcatgc tctcggtgag 181 tctgccggct gggctcctgc tgaggtaggc agtgc // LOCUS DOGCPLA06 189 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 6. ACCESSION M28145 J04961 KEYWORDS pancreatic lipase. SEGMENT 6 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 189) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 149 pancreatic lipase precursor, exon 6 matp + 41 + 149 pancreatic lipase IVS < 1 40 pancreatic lipase intron E IVS 150 > 189 pancreatic lipase intron F BASE COUNT 49 a 55 c 49 g 36 t ORIGIN About 957 after segment 5. 1 accacaaaaa tgttctgaaa atatgatctt ccttctccag gcaaactaca gctactcacc 61 ttcccaagtc cagctcatcg gccacagcct gggagcccac gtggctgggg aggcagggag 121 caggactcca ggtctgggca ggattacagg taagacccaa ggggcaggct ccagattttg 181 tccccagta // LOCUS DOGCPLA07 380 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 7. ACCESSION M28146 J04961 KEYWORDS pancreatic lipase. SEGMENT 7 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 380) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 1 + 120 pancreatic lipase precursor, exon 7 matp + 1 + 120 pancreatic lipase IVS 121 > 380 pancreatic lipase intron G BASE COUNT 99 a 89 c 101 g 91 t ORIGIN About 1340 bp after segment 6. 1 ggttggatcc tgtagaagca agtttccagg gcactcctga agaggttcga cttgatccca 61 ctgatgctga ctttgttgat gtgattcaca cagatgcagc tcccctgatc ccattcctgg 121 gtgagcccaa tgacgcttta gttgcaaaca cgtgcaataa tgtttgaacc aaaaagagcc 181 ctgtgtggtg aaaagctcat tgctgctcca aacatttcag gttttggaac aagccaacag 241 atgggtcacc ttgacttctt ccccaatggt ggagaggaaa tgccaggatg caagaagaat 301 gccctgtcac agatcgtgga cctagatggc atttgggaag gtaaagccca agcagaacgg 361 ggcgtcactt ccctggggtg // LOCUS DOGCPLA08 198 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 8. ACCESSION M28147 J04961 KEYWORDS pancreatic lipase. SEGMENT 8 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 198) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 159 pancreatic lipase precursor, exon 8 matp + 41 + 159 pancreatic lipase IVS < 1 40 pancreatic lipase intron G IVS 160 > 198 pancreatic lipase intron H BASE COUNT 38 a 61 c 40 g 59 t ORIGIN About 1366 bp after segment 7. 1 ccccaaccac ccgtgcagac ttccttgttt ctttctccag gaactcggga cttcgtggct 61 tgcaatcacc tgagaagtta caagtattac tcagagagca tcctcaaccc tgatggattt 121 gcttcctacc cctgtgcttc ctacagggcc tttgaatctg taagctatcg tccttccttg 181 agtccctcgt ggacggtt // LOCUS DOGCPLA09 211 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 9. ACCESSION M28148 J04961 KEYWORDS pancreatic lipase. SEGMENT 9 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 211) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 42 + 171 pancreatic lipase precursor, exon 9 matp + 42 + 171 pancreatic lipase IVS < 1 41 pancreatic lipase intron H IVS 172 > 211 pancreatic lipase intron I BASE COUNT 55 a 56 c 41 g 59 t ORIGIN About 675 bp after segment 8. 1 gatttacatc ttacagctgg ctctctctct tttctcctta gaacaagtgc ttcccctgcc 61 cagatcaagg gtgcccacag atgggtcact atgctgataa atttgctgtc aagacaagtg 121 atgagacaca gaaatacttc ctgaacaccg gagattccag caattttgct cgtaagtttt 181 accttgaccc tgctcaccca tgtcaaggag a // LOCUS DOGCPLA10 189 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 10. ACCESSION M28149 J04961 KEYWORDS pancreatic lipase. SEGMENT 10 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 189) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 149 pancreatic lipase precursor, exon 10 matp + 41 + 149 pancreatic lipase IVS < 1 40 pancreatic lipase intron I IVS 150 > 189 pancreatic lipase intron J BASE COUNT 47 a 38 c 43 g 61 t ORIGIN About 1871 bp after segment 9. 1 gtgcatgagt caattgtact cttttctatc ccctgtgcag gctggagata cggggtttct 61 ataacattgt ctgggaaaag agccactggt caggctaaag ttgctttgtt tggaagtaag 121 ggaaatactc atcaattcaa tatcttcaag taagtttcct ctttcagcgt cacgtctcac 181 tggatgcta // LOCUS DOGCPLA11 248 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 11. ACCESSION M28150 J04961 KEYWORDS pancreatic lipase. SEGMENT 11 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 248) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 208 pancreatic lipase precursor, exon 11 matp + 41 + 208 pancreatic lipase IVS < 1 40 pancreatic lipase intron J IVS 209 > 248 pancreatic lipase intron K BASE COUNT 72 a 42 c 59 g 75 t ORIGIN About 756 after segment 10. 1 gtttctatgc atgagaatgt ttctatctct ttatttcaag ggggattctc aaaccaggct 61 ctactcattc caatgagttt gatgcaaagc ttgatgttgg aacaattgag aaagtcaagt 121 ttctttggaa taacaacgtg gtaaacccaa cctttcccaa agtgggtgca gccaagatca 181 ccgtgcaaaa gggagaggag aaaacagtgt atgtatatct ttgctggatt gtgtctaagg 241 atgttggt // LOCUS DOGCPLA12 184 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 12. ACCESSION M28151 J04961 KEYWORDS pancreatic lipase. SEGMENT 12 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 184) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 37 100 pancreatic lipase precursor, exon 12 matp + 37 97 pancreatic lipase pre-msg < 1 156 pancreatic lipase mRNA and intron IVS < 1 36 pancreatic lipase intron K signal 135 140 poly-A signal BASE COUNT 44 a 60 c 39 g 41 t ORIGIN About 3005 bp after segment 11. 1 tcacaacact tacaaagctc tcattttggt ctgcaggcac agcttctgca gcgaaagcac 61 cgtgagggag gatgttctgc tcaccctcac gccctgttaa cgtccaggcg caacccgggc 121 accgtgttaa cagcaataaa aaccactgat gcatttaccc actcccacgt ctgtcgctgc 181 tgtc // LOCUS MZECPSECX 395 bp ds-DNA ORG 14-AUG-1990 DEFINITION Maize chloroplast 50S ribosomal protein (secX) gene, complete cds. ACCESSION M35956 KEYWORDS 50S ribosomal protein. SOURCE Maize chloroplast DNA. ORGANISM Chloroplast Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Zea mexicana. REFERENCE 1 (bases 1 to 395) AUTHORS Markmann-Mulisch,U., von Knoblauch,K., Lehmann,A. and Subramanian,A.R. TITLE Nucleotide sequence and linkage map position of the secX gene in maize chloroplast and evidence that it encodes a protein belonging to the 50S ribosomal subunit JOURNAL Biochem. Int. 15, 1057-1067 (1987) STANDARD simple staff_review FEATURES from to/span description pept 165 278 50S ribosomal protein (secX) binding 151 154 secX ribosomal binding site (put.) BASE COUNT 153 a 55 c 74 g 113 t ORIGIN 1 gaagcgtacc gaagactcga aggatactga agatttgaag gataccaaag attcaaagga 61 ttaggttttt ctggggtaat aacttccaag tttcaaaatt taagtgaaga gacttatttt 121 ttccaaaaga atagattcat agtttaagaa aggaataccc atatatgaaa ataagagctt 181 ccgttcgtaa aatttgtaca aaatgtcgac tgattcgcag gcgtgggcga attagagtta 241 tttgttccaa tccgaagcat aaacaaagac aggggtaatc tttcgaaaaa gaagcttttc 301 tttctaatag cttttctttc taattaaagt aaaaagtacc cacggaaatg cccaaattgc 361 aaataaaaaa aatgaaagta aaggatatat tttaa // LOCUS ATHTGYA 210 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana tRNA-Tyr gene. ACCESSION M35957 KEYWORDS transfer RNA-Tyr. SOURCE A.thaliana leaf DNA, clone pAtT1. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 210) AUTHORS Stange,N., Gross,H.J. and Beier,H. TITLE Wheat germ splicing endonuclease is highly specific for plant pre-tRNAs JOURNAL EMBO J. 7, 3823-3828 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA 71 107 Tyr-tRNA, exon 1 120 155 Tyr-tRNA, exon 2 RNA 71 181 precursor Tyr-tRNA (+/- 5 bp) IVS 108 119 Tyr-tRNA intron (no splice consensus) BASE COUNT 59 a 37 c 44 g 70 t ORIGIN 1 ctaacttaat acaaatcatt gtgttcatag aatatttaga tcagtacaca tgcatgaaat 61 agaatacaat ccgaccttag ctcagttggt agagcggagg actgtagtag acgcagatta 121 tccttaggtc actggttcga atccggtagg tcggaatttg ctcccacatg agagcttttt 181 atttttcttt cgttgtgaca ttaaggtttt // LOCUS ATHTGYB 210 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana tRNA-Tyr gene. ACCESSION M35958 KEYWORDS transfer RNA-Tyr. SOURCE A.thaliana leaf DNA, clone pAtT3. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 210) AUTHORS Stange,N., Gross,H.J. and Beier,H. TITLE Wheat germ splicing endonuclease is highly specific for plant pre-tRNAs JOURNAL EMBO J. 7, 3823-3828 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA 71 107 Tyr-tRNA, exon 1 120 155 Tyr-tRNA, exon 2 RNA 71 178 precursor Tyr-tRNA (+/- 5 bp) IVS 108 119 Tyr-tRNA intron (no splice consensus) BASE COUNT 66 a 34 c 41 g 69 t ORIGIN 1 tcataagttt aaatttgatt aaaaagcttc tcttgaatca tcaacatgca taaagtgtta 61 taatacaaaa ccgaccttag ctcagttggt agagcggagg actgtagttg acgcagataa 121 tccttaggtc actggttcga atccggtagg tcggatcatt aaaattgaaa gttttttttt 181 tccatttcgt ttcaaagtgc ttagacaaga // LOCUS STMCHOPA 3104 bp ds-DNA BCT 14-AUG-1990 DEFINITION Streptomyces sp. cytochrome-P450-like protein (choP) gene and cholesterol oxidase (choA) gene, complete cds. ACCESSION M31939 J03356 KEYWORDS cholesterol oxidase; cytochrome-P450-like protein. SOURCE Streptomyces sp. (strain SA-COO) DNA, clone pCO-[1,3]. ORGANISM Streptomyces sp. Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 963 to 3104) AUTHORS Ishizaki,T., Hirayama,N., Shinkawa,H., Nimi,O. and Murooka,Y. TITLE Nucleotide sequence of the gene for cholesterol oxidase from a Streptomyces sp JOURNAL J. Bacteriol. 171, 596-601 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1450) AUTHORS Horii,M., Ishizaki,T., Paik,S.-Y., Manome,T. and Murooka,Y. TITLE An operon containing the gene for cholesterol oxidase and a cytochrome P-450-like protein from Streptomyces sp JOURNAL J. Bacteriol. 172, 3644-3653 (1990) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence for [1] kindly provided by Y.Murooka, 01-DEC-1989. Draft entry and printed sequence for [2] kindly submitted by Y.Murooka, 02-FEB-1990, for release after publication. FEATURES from to/span description mRNA 100 > 1283 choP mRNA pept 141 1286 cytochrome-P450-like protein (choP) pept 1361 3001 cholesterol oxidase precursor (choA, EC 1.1.3.6) sigp 1361 1486 cholesterol oxidase signal peptide matp 1487 2998 cholesterol oxidase mRNA 1384 > 3104 choA mRNA signal 65 70 -35 region signal 89 94 -10 region binding 129 134 choP ribosomal binding site (put.) binding 1340 1345 choA ribosomal binding site (put.) BASE COUNT 518 a 1234 c 932 g 420 t ORIGIN 1 ggtacccccg gtaaccgaac cccgtgcccg tacccgcagg tagcccgccg ggggaacgat 61 ctcgttgaca gccttcacat cgcctccata cggtcatttc gacggcgccg aacatgccga 121 tcactcgtga aagggcatac atgacccagg ctgcgccggt gaccttctcg acagtccgtg 181 agaattactt cggtccgccg gcggagatgc aggcactgcg gcacaaggca cccgtgacgc 241 gcacggcctt cgccgacggc cgccccggct ggctcgtcac cggatattcc gccgcgcgtg 301 cggtgttatc cgattcccgt ttcaccgccc gcggagaacg cgagcacccc gccgtgccgc 361 gcgccgcgac cctggaggac gagcgctgcc gcaggctgat cgccggacag ttcaccgccc 421 gccggatgcg gcagctcacc gggcggaccg agcggatcgt ccgcgaacac ctcgacgcca 481 tggagcacat gggctccccg gccgacctcg tcgagcactt cgccctcccc gtcccctcgc 541 tcgtcatcgc cgaactgctc ggcgtgcccc cccccgaccg ggagcacttc cagcacgaca 601 ccctccggtg gggcggcttc ggccggtcca ccgaggaggt caccgaggcc ttcgtctccc 661 tcggcggcca actgcagcgc ctcgtgcggc tcaagcgcac cgagccgggc gacgacctcc 721 tcagcggcct gatcgccgcc gaccccgcgc tcaccgacga ggaactcgcc tcgatcgcct 781 tcctgttgct ggtcgccggc cacgggacca ccgcccacca gatcgccctc ggcgccttcc 841 tgctgctcga acaccccgac cagctggccg cgctgcgcgc cgaccccgcc ctgacggaga 901 gcgcggtcga ggagctgctg cgccacctga gcgtcgtcca ccacggcccc acccgtgccg 961 cgctgcagga cgccgacatc gagggcaccc ccgtcaaggc gggcgaggtc gtcgtggtct 1021 cgctcggcgc cgccaaccgc gaccccgcac gcttcgagcg gcccgacgcc gtggatgtca 1081 cccgcgagga caccggccac ctcgccttcg ggcacggcat gcaccagtgc ctcggccggc 1141 aactggcccg catcgaactg cgcgtcgccc tcaccgccct cctcgaacgc ttcccgcacc 1201 tgcgcctggc ctgccccgcg gcggagatcc cgctccgcca cgacatgcag gtctacggcg 1261 ccgaccggct gccggtcgcc tggtgaccgt ccccgtcacc cccccgaacc accttcccca 1321 cccctgaaga aacgtctcgg aaaggtcagg tagaccccgc gtgactgcac aacagcacct 1381 gtcccgccgc cgcatgctcg gcatggccgc cttcggcgcc gccgccctcg ccgggggcac 1441 caccatcgcc gccccccgtg cggccgccgc cgccaagtcc gcggcggaca acggcggtta 1501 cgtccccgcc gtcgtcatcg gcaccggcta cggcgcggcc gtctccgcgc tgcgcctcgg 1561 cgaggcgggt gtgcagaccc tgatgctgga gatgggccag ctgtggaacc agcccggccc 1621 cgacggcaac atcttctgcg gcatgctcaa cccggacaag cggtccagct ggttcaagaa 1681 ccgcaccgag gccccgctcg gcagcttcct ctggctcgac gtcgtcaacc ggaacatcga 1741 cccctacgcg ggtgtcctgg accgtgtgaa ctacgaccag atgtcggtct acgtgggccg 1801 cggcgtcggc ggcggctcgc tcgtcaacgg cggcatggcc gtggagccca agcgctcgta 1861 cttcgaggag atcctcccgc gggtcgactc ctccgagatg tacgaccgct acttcccccg 1921 cgccaactcc atgctccgcg tcaaccacat cgacaccaag tggttcgagg acaccgagtg 1981 gtacaagttc gcccgcgtct cgcgcgagca ggcgggcaag gccggtctcg gcaccgtctt 2041 cgtccccaac gtctacgact tcggctacat gcagcgcgag gccgcgggcg aggtgcccaa 2101 gtccgccctg gcgaccgagg tcatctacgg caacaaccac ggcaagcaga gcctggacaa 2161 gacctacctg gccgccgcac tcggcaccgg caaggtcacc atccagaccc tgcaccaggt 2221 caagacgatc cgtcagacga aggacggcgg ctacgcgctg accgtcgagc agaaggacac 2281 cgacggcaag ctcctggcca ccaaggagat ctcctgccgc tacctgttcc tcggcgcggg 2341 cagcctcggc tccaccgaac tgctggtgcg cgcccgcgac accggcaccc tgccgaacct 2401 caactccgag gtgggcgcgg gctggggccc caacggcaac atcatgaccg cccgggccaa 2461 ccacatgtgg aaccccaccg gcgcccacca gtcctccatc cccgccctcg gcatcgacgc 2521 gtgggacaac agcgactcct cggtcttcgc ggagatcgcc cccatgccgg ccggcctgga 2581 gacgtgggtc agcctctacc tcgcgatcac caagaacccc cagcgcggca ccttcgtgta 2641 cgacgccgcg acggaccgcg cgaagctcaa ctggacccgt gaccagaacg cccccgcggt 2701 caacgcagcc aaggcgctgt tcgaccggat caacaaggcg aacggcacga tctaccggta 2761 cgacctcttc ggcacccagc tgaaggcctt cgccgacgac ttctgctacc acccgctcgg 2821 cggctgcgtc ctgggcaagg cgacggacga ctacggccgc gtcgccggtt acaagaacct 2881 ctacgtgacc gacggttcgc tgatcccggg ttccgtcggc gtcaacccgt tcgtgaccat 2941 cacggcgctg gccgagcgga acgtcgagcg catcatcaag caggacgtca cggcgtcgta 3001 atcacgcacc gaccgggccg gaagggaaga acggcccggt caccgcacca tcccggaagg 3061 aacgcaccat tcccgtgaac gccgacgaca acagcctgtg gatc // LOCUS STMCEFDA 2494 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.clavuligerus isopenicillin N epimerase (cefD) gene and deacetoxycephalosporin C synthetase (DAOCS) gene, complete cds. ACCESSION M32324 M24140 KEYWORDS deacetoxycephalosporin C synthetase; isopenicillin N epimerase. SOURCE S.clavuligerus DNA, clone pOW380. ORGANISM Streptomyces clavuligerus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1520 to 2494) AUTHORS Kovacevic,S., Weigel,B.J., Tobin,M.B., Ingolia,T.D. and Miller,J.R. TITLE Cloning, characterization, and expression in Escherichia coli of the Streptomyces clavuligerus gene encoding deacetoxycephalosporin C synthetase JOURNAL J. Bacteriol. 171, 754-760 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1706) AUTHORS Kovacevic,S., Tobin,M.B. and Miller,J.R. TITLE The beta-lactam biosynthetis genes for isopenicillin N epimerase and deacetoxycephalosporin C synthetase are expressed from a single transcript in Streptomyces clavuligerus JOURNAL J. Bacteriol. 172, 3952-3958 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.R.Miller, 26-FEB-1990, for release after publication. FEATURES from to/span description pept 284 1480 isopenicillin N epimerase (cefD) pept 1559 2494 deacetoxycephalosporin C synthetase (DAOCS) mRNA 152 > 1706 cefD mRNA BASE COUNT 387 a 931 c 791 g 385 t ORIGIN 1 ccatgggtga taactcctgt ggagcgggac aaaaggtcga aagattcaaa tgccgagcag 61 cccggacggt ttactgcata atgtgggcta cctatgctgg acgacaggac tcttgaagtg 121 ctcttcggct ggtcttcaga atctcttcgc tatttttccg aagcggacag aaaatcttcc 181 tgttcgagat cgagtcatga ctgctacgct gcatcgggcc gatcgaattt cgatggctgc 241 actgaattga ctgttcttct ggatttgcgg gagatgcgtt gacatggcgg tagccgactg 301 ggaagaagcc cgcggccgta tgctgctcga ccccaccgtc gtcaacctca acaccggctc 361 cgggggaccg ctgccgcgca gcgccttcga gcgggtcacc ggcttccgcg cccatctcgc 421 ggccgagccg atggacttcc tgctccgcga ggtccccgca ctgctgtggc aggcgcggga 481 gagcctcgcc cgcctcatcg gcggggaccc gctgcgcctc gccctggcca ccaacgtcac 541 cgccgccgtc aacctcgtcg cgtcgtcact gcgcctcgaa gcgcccggcg agatcctgct 601 cagcgacgac gagtacacgc ccatgcgctg gtgctgggag cgggtcgccc ggcggcacgg 661 cctggagctg aggacgttcc ggctgcccga gctgccctcg gacccggccg agatcaccgc 721 ggcggcggtc gccgcgatgg gaccgcgcac ccggctgttc ttcttcagcc atgtcgtctc 781 cacgaccggg ctgatcctgc ccgccgccga actgtgcgag gaggcccgcg cacggggcat 841 caccaccgtg gtcgacggcg cccacgcacc cggcttcctc gacctcgacc tctcccggat 901 cccctgcgac ttctacgcgg gcagcggcca caaatggctg ctcgccccca ccggggtcgg 961 ctttctccac ctcgcccccg gccgcctgga agaactggag cccacccagg tgagctgggc 1021 gtacgagccc ccggagggca gcggcccgcc ggccgcgcgc gaccgcttcg gcagcacacc 1081 cgggctgcgc agactcgaat gcgaggggac ccgggacatc tgcccctggc tcgccacacc 1141 ggagtcgatc gacttccagg ccgagctggg ccccggggcg atccgcgccc gccgccgcga 1201 gctgacggac cacgcgcgcc gcctgctcgc cgaccgcccg ggccgcacgc tcctcacccc 1261 cgactccccg gagctgtccg gcggcatggt ggcctaccgg ctgcccccgg gaaccgacgc 1321 ggccgagctg cgccgggggc tctgggagcg cttccggatc gaggccgcgg tggccgagca 1381 gccgcccggg ccggtgctcc ggatctcggc gaacttctac accaccgaag aggagatcga 1441 ccgcctggcg gacgcgctgg acgcgctgac gggcgagtga tccccccggc tcgcggaccg 1501 cctcccccgc gctgttgacc acccggttca cggattacga gaggatcagt gagagttgat 1561 ggacacgacg gtgcccacct tcagcctggc cgaactccag cagggcctgc accaggacga 1621 gttccgcagg tgtctgaggg acaagggcct cttctatctg acggactgcg gtctgaccga 1681 caccgagctg aagtcggcca aggacctcgt catcgacttc ttcgagcacg gcagcgaggc 1741 ggagaagcgc gccgtcacct cgcccgtccc caccatgcgc cgcggcttca ccgggctgga 1801 gtcggagagc accgcccaga tcaccaatac cggcagctac tccgactact cgatgtgcta 1861 ctcgatgggc accgcggaca acctcttccc gtccggtgac ttcgagcgga tctggaccca 1921 gtacttcgac cgccagtaca ccgcctcccg cgcggtcgcc cgggaggtcc tgcgggcgac 1981 cgggaccgag cccgacggcg gggtcgaggc cttcctcgac tgcgagccgc tgctgcggtt 2041 ccgctacttc ccgcaggtcc ccgagcaccg cagcgccgag gagcagcccc tgcggatggc 2101 gccgcactac gacctgtcga tggtcaccct catccagcag acaccctgcg ccaacggctt 2161 cgtcagcctc caggccgagg tcggcggcgc gttcacggac ctgccctacc gtccggacgc 2221 cgtcctcgtc ttctgcggcg ccatcgcgac cctggtgacc ggcggccagg tcaaggcccc 2281 ccggcaccat gtcgcggccc cccgcaggga ccagatagcg ggcagcagcc gcacctccag 2341 tgtgttcttc ctccgtccca acgcggactt caccttctcc gtcccgctgg cgcgcgagtg 2401 cggcttcgat gtcagcctgg acggcgagac cgccacgttc caggattgga tcgggggcaa 2461 ctacgtgaac atccgccgca catccaaggc atag // LOCUS AMTMTCYB 225 bp ds-DNA ORG 14-AUG-1990 DEFINITION Salamander (A.tigrinum) mitochondrial cytochrome b, partial cds. ACCESSION M26230 M28046 KEYWORDS cytochrome; cytochrome b. SOURCE A.tigrinum liver mitochondrial DNA. ORGANISM Mitochondrion Ambystoma tigrinum Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Caudata; Ambystomatidae; Ambystoma tigrinum. REFERENCE 1 (bases 1 to 225) AUTHORS Kocher,T.D., Thomas,W.K., Meyer,A., Edwards,S.V., Paeaebo,S., Villablanca,F.X. and Wilson,A.C. TITLE Dynamics of mitochondrial DNA evolution in animals: Amplification and sequencing with conserved primers JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6196-6200 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 225) AUTHORS Kocher,T.D. and White,T.J. TITLE Evolutionary analysis via PCR JOURNAL (in) Erlich,H. (Ed.); PCR Technology: Princip. and Appl. for DNA amplification: 137-147; Stockton Press, New York (1989). STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1],[2] kindly submitted by T.D.Kocher, 18-JUL-1989. Bernie 12-SEP-1989 initial full entry [1] FEATURES from to/span description pept < 1 > 225 cytochrome b (AA at 1) BASE COUNT 68 a 39 c 35 g 83 t ORIGIN 1 acagggttat ttttagctat acattataca gcagatacat catcagcatt ctcatccgta 61 gcccacattt gccgagatgt aaattatggt tgacttatac gaaatattca cgcaaacggc 121 gcttcattct tttttatttg tatctttctt catattggtc gaggaatata ttacggctca 181 tatatgttca aagaaacatg aaacattgga gtaattttat tattt // LOCUS BLYLEC 972 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Barley root-specific lectin mRNA, complete cds. ACCESSION M29280 M24846 KEYWORDS lectin. SOURCE Barley (var. Betzes) embryo, cDNA to mRNA, clone BLc3. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 972) AUTHORS Lerner,D.R. and Raikhel,N.V. TITLE Cloning and characterization of root-specific barley lectin JOURNAL Plant Physiol. 91, 124-129 (1989) STANDARD simple staff_review COMMENT Draft entry and sequence for [1] kindly submitted by N.Raikhel, 16-MAY-1989. FEATURES from to/span description pept 16 654 lectin precursor sigp 16 93 lectin signal peptide (put.) matp 94 606 lectin BASE COUNT 197 a 285 c 291 g 199 t ORIGIN 1 cagaaaacaa gaaggatgaa gatgatgagc accagggccc tcgctctcgg cgcggccgcc 61 gtcctcgcct tcgcggcggc gaccgcgcac gcccagaggt gcggcgagca gggcagcaac 121 atggagtgcc ccaacaacct ctgctgcagc cagtacgggt actgcggcat gggcggcgac 181 tactgcggca agggctgcca gaacggcgcc tgctacacca gcaagcgctg cggcactcag 241 gccggcggca agacatgccc taacaaccac tgctgcagcc agtggggtta ctgcggcttc 301 ggcgccgagt actgcggcgc cggctgccag ggcggcccct gccgcgccga catcaagtgc 361 ggcagccagg ccggcggcaa gctttgcccc aacaacctct gctgcagcca gtggggttac 421 tgcggcctcg gctccgagtt ctgcggcgag ggctgccagg gcggtgcttg cagcaccgac 481 aagccgtgcg gcaaggccgc cggcggcaaa gtttgcacca acaactactg ctgcagcaag 541 tggggatcct gtggcatcgg cccgggctac tgcggcgcag gttgccagag cggcggctgc 601 gacggtgtct tcgccgaggc catcgccgcc aactccactc ttgtcgcaga atgatgatct 661 tgctaatggc agtattattg caacgacgaa taatccgtgg cagttttgtt gccacgtacg 721 gtctcccttc acttactttt agcactagtc cttaataatt ctccagcctt gcaatatgac 781 gtgcaggttg ctacatgcat ggacatattg cagtgagaag tactgtgtgg caatataggg 841 tgtactattg ttgccacaaa tttagttctt tcttgttacg tacgtacagt tgtcaggatg 901 catgcatccc cgttgtaatg ttggagtact ccatgatttc gttgcaatat atatattgcc 961 atgagtctaa ag // LOCUS CHKMTCYB 225 bp ds-DNA ORG 14-AUG-1990 DEFINITION Chicken mitochondrial cytochrome b, partial cds. ACCESSION M26231 M28048 KEYWORDS cytochrome; cytochrome b. SOURCE Chicken mitochondrial DNA. ORGANISM Mitochondrion Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae; Gallus gallus. REFERENCE 1 (bases 1 to 225) AUTHORS Kocher,T.D., Thomas,W.K., Meyer,A., Edwards,S.V., Paeaebo,S., Villablanca,F.X. and Wilson,A.C. TITLE Dynamics of mitochondrial DNA evolution in animals: Amplification and sequencing with conserved primers JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6196-6200 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 225) AUTHORS Kocher,T.D. and White,T.J. TITLE Evolutionary analysis via PCR JOURNAL (in) Erlich,H. (Ed.); PCR Technology: Princip. and Appl. for DNA amplification: 137-147; Stockton Press, New York (1989). STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1],[2] kindly submitted by T.D.Kocher, 18-JUL-1989. Bernie 12-SEP-1989 initial full entry [1] Mia 13-AUG-1990 citation update FEATURES from to/span description pept < 1 > 225 cytochrome b (AA at 1) BASE COUNT 56 a 84 c 35 g 50 t ORIGIN 1 accgccctac tactagccat gcactacaca gcagacacat ccctagcctt ctcctccgta 61 gcccacactt gccggaacgt acaatacggc tgactcatcc ggaatctcca cgcaaacggc 121 gcctcattct tcttcatctg tatcttcctt cacatcggac gaggcctata ctacggctcc 181 tacctctaca aggaaacctg aaacacagga gtaatcctcc tcctc // LOCUS HUMMUC18A 2940 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human MUC18 gene mRNA (3' variant), complete cds. ACCESSION M29277 KEYWORDS MUC18 glycoprotein; cell adhesion molecule; immunoglobulin-like protein; integral membrane glycoprotein. SOURCE Human (isolate JuSo) melanoma (cell line Mel JuSo), cDNA to mRNA, clone zapy1, drop 1, drop 4, drop 4.2, drop 4.7, drop 4.11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2940) AUTHORS Lehmann,J.M., Riethmueller,G. and Johnson,J.P. TITLE MUC18, a marker of tumor progression in human melanoma, shows sequence similarity to the neural cell adhesion molecules of the immunoglobulin superfamily JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9891-9895 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by J.P.Johnson, 06-OCT-89. FEATURES from to/span description pept 8 1819 MUC18 glycoprotein precursor sigp 8 91 MUC18 glycoprotein signal peptide matp 92 1816 MUC18 glycoprotein site 122 373 immunoglobin like (V set) domain I site 464 691 immunoglobin like (V set) domain II site 793 985 immunoglobin like (C2 set) domain site 1073 1246 immunoglobin like (C2 set) domain site 1334 1522 immunoglobin like (C2 set) domain BASE COUNT 668 a 861 c 856 g 555 t ORIGIN 1 gggaagcatg gggcttccca ggctggtctg cgccttcttg ctcgccgcct gctgctgctg 61 tcctcgcgtc gcgggtgtgc ccggagaggc tgagcagcct gcgcctgagc tggtggaggt 121 ggaagtgggc agcacagccc ttctgaagtg cggcctctcc cagtcccaag gcaacctcag 181 ccatgtcgac tggttttctg tccacaagga gaagcggacg tcatcttccg tgtgcgccag 241 ggccagggcc agagcgaacc tggggagtac gagcaagcgg ctcagcctcc aggacagagg 301 ggctactctg gccctgactc aagtcacccc ccaagacgag cgcatcttct tgtgccaggg 361 caagcgcctc ggtcccagga gtaccgcatc cagctccgcg tctacaaagc tccggatgcc 421 aaacatccag gtcaaccccc tgggcatccc tgtgaacagt aaggagcctg aggaggtcgc 481 tacctgtgta gggaggaacg ggtaccccat tcctcaagtc atctggtaca agaatggccg 541 gcctctgaag gaggagaaga accgggtcca cattcagtcg tcccagactg tggagtcgag 601 tggtttgtac accttgcaga gtattctgaa ggcacagctg gttaaagaag acaaagatgc 661 ccagttttac tgtgagctca actaccggct gcccagtggg aaccacatga aggagtccag 721 ggaagtcacc gtccctgttt tctacccgac agaaaaagtg tggctggaag tggagcccgt 781 gggaatgctg aaggaagggg accgcgtgga aatcaggtgt ttggctgatg gcaaccctcc 841 accacacttc agcatcagca agcagaaccc cagcaccagg gaggcagagg aagagacaac 901 caacgacaac ggggtcctgg tgctggagcc tgcccggaag gaacacagtg ggcgctatga 961 atgtcaggcc tggaacttgg acaccatgat atcgctgctg agtgaaccac aggaactact 1021 ggtgaactat gtgtctgacg tccgagtgag tcccgcagcc cctgagagac aggaaggcag 1081 cagcctcacc ctgacctgtg aggcagagag tagccaggac ctcgagttcc agtggctgag 1141 agaagagaca gaccaggtgc tggaaagggg gcctgtgctt cagttgcatg acctgaaacg 1201 ggaggcagga ggcggctatc gctgcgtggc gtctgtgccc agcatacccg gcctgaaccg 1261 cacacagctg gtcaagctgg ccatttttgg ccccccttgg atggcattca aggagaggaa 1321 ggtgtgggtg aaagagaata tggtgttgaa tctgtcttgt gaagcgtcag ggcacccccg 1381 gcccaccatc tcctggaacg tcaacggcac ggcaagtgaa caagaccaag atccacagcg 1441 agtcctgagc accctgaatg tcctcgtgac cccggagctg ttggagacag gtgttgaatg 1501 cacggcctcc aacgacctgg gcaaaaacac cagcatcctc ttcctggagc tggtcaattt 1561 aaccaccctc acaccagact ccaacacaac cactggcctc agcacttcca ctgccagtcc 1621 tcataccaga gccaacagca cctccacaga gagaaagctg ccggagccgg agagccgggg 1681 cgtggtcatc gtggctgtga ttgtgtgcat cctggtcctg gcggtgctgg gcgctgtcct 1741 ctatttcctc tataagaagg gcaaggctgc cgtgcaggcg ctcagggaag caggagatca 1801 cgctgccccc gtctcgtaag accgaacttg tagttgaagt taagtcagat aagctcccag 1861 aagagatggg cctcctgcag gcagcagcgg tgacaagagg gctccgggag accagggaga 1921 gaaatacatc gatctgaggc attagccccg aatcacttca gctcccttcc ctgcctggac 1981 cattcccagc tccctgctca ctcttctctc agccaaagct caaagggact agagagaagc 2041 ctcctgctcc cctcgcctgc acaccccctt tcagagggcc actgggttag gacctgagga 2101 cctcacttgg ccctgcaagg cccgcttttc agggaccagt ccaccaccat ctcctccacg 2161 ttgagtgaag ctcatcccaa gcaaggagcc ccagtctccc gagcgggtag gagagtttct 2221 tgcagaacgt gttttttctt tacacacatt atgctgtaaa tacgctcgtc ctgccagcag 2281 ctgagctggg tagcctctct gagctggttt cctgccccaa aggctggcat tccaccatcc 2341 aggtgcacca ctgaagtgag gacacaccgg agccaggcgc ctgctcatgt tgaagtgcgc 2401 tgttcacacc cgctccggag agcaccccag cagcatccag aagcagctgc agtgcaagct 2461 tgcatgcctg cgtgttgctg caccaccctc ctgtctgcct cttcaaagtc tcctgtgaca 2521 ttttttcttt ggtcagaggc caggaactgt gtcattcctt aaagatacgt gccggggcca 2581 ggtgtggctc acgcctgtaa tcccagcact ttgggaggcc gaggcggcgg atcacaaagt 2641 cagacgagac catcctggct aacacggtga aaccctgtct ctactaaaaa tacaaaaaaa 2701 aattagctag gcgtagtggt tggcacctat agtcccagct actcggaagg ctgaagcagg 2761 agaatggtat gaatccagga ggtggagctt gcagtgagcc gagaccgtgc cactgcactc 2821 cagcctgggc aacacagcga gactccgtct cgagccggcc ggttgcgcgg gccctcggac 2881 cctcagagag gcgagggttc gagggcacga gttcgaggcc aacctggtcc acatgggttg // LOCUS HUMTELB 262 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human DNA from the end of the chromosomal telomere. ACCESSION M19947 J04078 KEYWORDS repetitive sequence. SOURCE Human placental DNA, clone pHuR 143. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 262) AUTHORS Moyzis,R.K., Buckingham,J.M., Cram,L.S., Dani,M., Deaven,L.L., Jones,M.D., Meyne,J., Ratliff,R.L. and Wu,J.R. TITLE A highly conserved repetitive DNA sequence, (TTAGGG)n, present at the telomeres of human chromosomes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6622-6626 (1988) STANDARD full staff_review COMMENT Draft entry and computer readable sequence [1] kindly submitted by S.Jett 27-JUL-88. The (ttaggg)n telomeric sequence is found at the ends of all human chromosomes as well as in some other higher eukaryotic genomes. FEATURES from to/span description site 80 81 rare insertion/substitution site 243 244 rare insertion/substitution BASE COUNT 44 a 0 c 131 g 87 t ORIGIN Unreported. 1 tttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg 61 gttagggtta gggttaggga gtagggttag ggttagggtt agggttaggg ttagggttag 121 ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag 181 ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag 241 ggtgttaggg ttagggttag gg // LOCUS LCOLHICDH 1320 bp ds-DNA BCT 14-AUG-1990 DEFINITION L.confusus L-2-hydroxyisocaproate dehydrogenase (L-HicDH) gene, complete cds. ACCESSION M31425 M28050 KEYWORDS L-2-hydroxyisocaproate dehydrogenase. SOURCE L.confusus (strain DSM20196) DNA. ORGANISM Lactobacillus confusus Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1320) AUTHORS Lerch,H.-P., Frank,R. and Collins,J. TITLE Cloning, sequencing and expression of the L-2-hydroxyisocaproate dehydrogenase-encoding gene of Lactobacillus confusus in Escherichia coli JOURNAL Gene 83, 263-270 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 328 1260 L-2-hydroxyisocaproate dehydrogenase ORF < 1 243 ORF (AA at 1) mRNA 301 1304 LDH mRNA (3' end put.) signal 1267 1303 transcription termination signal binding 318 323 ribosome binding site signal 267 272 -35 region signal 290 295 -10 region BASE COUNT 333 a 263 c 353 g 371 t ORIGIN 1 ttctttttgt acaccatgac aatgtttgct gctaagacat cttcgccagt tgaaacggcc 61 cgtttgtcag gtatggcgca agctggtggt tactttatgt ctgcattcgg tccaatgctt 121 tatggtatgg cttttaccgc aaatccgaat ggggtcatcc aaaacgtcgt ttatttggta 181 ttggtgattg tgatgattgt cgctgccgtg atgatggcga tgactaagca tttgtttgac 241 taaaataatg atgagatgtt gttcagtggt gaacaacgtc tcttttttgt ataatgaaca 301 tacgacaaaa acaatttggg ggatattatg gcacgtaaga ttggaattat cggccttgga 361 aacgttgggg ctgcagtagc gcacggattg attgcacaag gtgtagccga cgactacgtc 421 tttattgatg caaacgaagc aaaggtgaag gctgatcaaa ttgatttcca agacgcaatg 481 gcgaacttgg aagcgcacgg taacattgtg attaacgatt gggcagcctt ggctgatgct 541 gatgttgtga tttcaacact ggggaacatc aagttgcaac aagacaaccc aaccggtgac 601 cgttttgctg agttgaagtt taccagcagc atggtgcaat cagtcggcac aaacttgaag 661 gaatctggtt tccacggcgt attggtcgtg atttcaaacc cggtcgacgt gattacggcc 721 ttgttccaac acgtgactgg tttcccagct cacaaggtta tcggaaccgg tactttgctt 781 gacacggcgc gtatgcaacg tgcagttggt gaggcgtttg atttggatcc acgttctgtt 841 tcaggttaca acttgggtga gcacggtaac tcacaattcg tagcttggtc aacggtgcgc 901 gtgatgggtc aaccaatcgt gacgttggct gatgccggcg atattgactt ggcggccatc 961 gaagaggaag cacgtaaggg tggcttcacg gtcttgaatg gtaagggcta cacgagttat 1021 ggtgttgcaa cgtcagcaat ccgcattgcc aaggctgtta tggctgacgc gcatgctgaa 1081 ttggttgtct caaatcgtcg cgatgacatg ggaatgtact tgtcataccc agcgattatt 1141 ggtcgcgatg gtgtcttggc agaaacgacg cttgatttga cgacggatga gcaagaaaag 1201 cttttgcaat cacgtgacta catccaacaa cgtttcgacg aaattgtgga tacactctaa 1261 aaacacaaaa agtggcttca atctggtgat tggaccactt ttttctaatt gtgcatgaaa // LOCUS SIVAGM266 300 bp ss-RNA VRL 14-AUG-1990 DEFINITION Simian (African green monkey) immunodeficiency virus, isolate 266, partial pol cds. ACCESSION M22974 KEYWORDS . SOURCE Simian (African green monkey) immunodeficiency virus proviral DNA, isolate 266. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 300) AUTHORS Li,Y., Naidu,Y.M., Daniel,M.D. and Desrosiers,R.C. TITLE Extensive genetic variability of simian immunodeficiency virus from African green monkeys JOURNAL J. Virol. 63, 1800-1802 (1989) STANDARD full staff_review COMMENT Hard copy of sequence kindly provided by R.C. Desrosiers, 25-JAN-1989. FEATURES from to/span description pept < 1 > 300 pol polyprotein (AA at 1) pre-msg < 1 > 300 genomic mRNA pre-msg < 1 > 300 tat, rev, nef subgenomic mRNA BASE COUNT 108 a 59 c 74 g 59 t ORIGIN 1 gcaggactac tggcaggcag ttggatacct gactggacct ttgtaagcgt cccacctctg 61 gtaacattgt ggtacacctt aacaaaggaa cccatcccag gagaagatgt gtattatgta 121 gatggagctt gcaacaggaa ctcaagagaa ggaaaagcag ggtatataac tcagcaaggg 181 aaacagagag tggagaagct agaaaatacc acaaatcagc aagcagaatt aacagctata 241 aaaatggcct tagaagacag tggtcccagg gtaaacattg ttacagactc tcaatatgca // LOCUS STYPOL3E 454 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.typhimurium DNA polymerase III epsilon subunit gene, 5' end. ACCESSION M26045 M29726 KEYWORDS polymerase III. SOURCE S.typhimurium (strain LT2) DNA, clone pFF16. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 454) AUTHORS Lancy,E.D., Lifsics,M.R., Kehres,D.G. and Maurer,R. TITLE Isolation and characterization of mutants with deletions in dnaQ, the gene for the editing subunit of DNA polymerase III in Salmonella typhimurium JOURNAL J. Bacteriol. 171, 5572-5580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.A.Maurer, 10-JUL-1989. FEATURES from to/span description pept 59 > 454 polymerase III epsilon (editing) subunit BASE COUNT 107 a 90 c 96 g 104 t 57 others ORIGIN 1 gtagacttcc tgtaattgaa atcgataaca aaacgcaagt ctgacataaa tgaccgatat 61 gagcactgca attacacgac agatcgtcct cgataccgaa accaccggta tgaatcagat 121 aggcgcgcac tatgaaggtc acaagattat tgagatcggt gcggttgagg tgataaaccg 181 tcgtctgacc ggcaacaatt ttcatgttta cctgaagccc gatcgccttg tcgatccaga 241 ggcttttggc gtacacggta ttgccgatga gtttctgctg gataagccgg tttttgctga 301 tgtggtcgat gagtttcttg attatatnnn nggcgcggag ctggtcatcc ataacgcatn 361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncctaaaac 421 caatactttc tgcaaagtta ccgacagcct ggcg // LOCUS RHMSYRMA 2636 bp ds-DNA BCT 14-AUG-1990 DEFINITION R.meliloti syrM gene, complete cds. ACCESSION M33495 KEYWORDS symR gene. SOURCE R.meliloti (strain 1021) DNA, clones pMB[1,2]. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 2636) AUTHORS Long,S.R. JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 850 to 1990) AUTHORS Barnett,M.J. and Long,S.R. TITLE DNA sequence and translational product of a new nodulation-regulatory locus: SyrM has sequence similarity to NodD protein JOURNAL J. Bacteriol. 172, 3695-3700 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.R.Long, 02-APR-1990, for release after publication. FEATURES from to/span description pept 991 1971 SyrM protein BASE COUNT 534 a 769 c 792 g 541 t ORIGIN 1 gcatgcgcga cgagctgctg aacgagagtc tgttcttcgg tctcagccat gccagcaggg 61 ccatttccaa atggtcgacg actgcaatac gttccggccg cactcgtcgc tcggataccg 121 caccccggca gcctatcctg ggatcatcgc cgcaaccgag ccagttggtg tatcgaaaac 181 ggtcgaggct ctaatcgccg ctggatgaaa agttcagtgg caggtccgaa aggccacgta 241 ttcggagcca acctcctcta gcgatcgctc tctcctgaaa acttgacagg gcgtcccccg 301 accgccttgc cgaggcggcg cgtgcccgac ggcggcctcg acagcaagcc ggtctcgacg 361 gtcttctatc ggcgctgcag gcaatgggca tttgaaaccg tttcaagagt ctccttaaca 421 gagctcgcgc cgggtgaaga ccctgtatga tggatagtga agtgcagggg cggctgctta 481 ccggccgttg agtgaggcta tgtatccaaa acgtggatga cagtcatgcg aaaacgcgat 541 ttctcctgtc tcgccgaatg ccattacaag aacgttagat ataaccggag ggcagaagtg 601 ggcagcgaga tcaattccga cagactcctt tgcaatgaaa gcatgatttg cgaccagggc 661 ctcagatgtg ttgaggcagt tttcttcagt aagtttcagg cggggccact ttggcgagat 721 cattttgtgg cttgctcgtt ggcactgaaa tatgatgtga gccggcaatt cctcgaaagc 781 attctggacc agcgggtctt cgacatccat caggctaggc ctgcgaggcc cggggggcgt 841 cacagccaag tacttctgat taacggaaaa tcacgacgct tcccatttgt cggcgggcca 901 ggaaagtctt tggcacgcac acccatattg tgacgacctg gatcttcggg atgtggcgcg 961 ggaacattac ctgccgagga gaatagatcc atggatcagc ccacttggaa gcggccgcat 1021 agggccaaat ttgccggtgt cagtgacgcc gcacaacagc ggcaaatgcc gaaccttgcg 1081 tccatcgacc taaacctact ggtggatctt gaggctctgc tgcagtatcg gcacattact 1141 caagcggccc agcacgtcgg ccgtagccaa ccggcgatga gcagagcctt gtcaaggctg 1201 cgcggcatgt taaaggacga tcttctggtt gccggctcta gaggcttggt cctgacgcct 1261 ctggctgaat gcttgaccca gatgctgcct tcagtactag acgcgattcg ccagatgatg 1321 aacctcagct tggctccggc gcaacggcga tggaaggtga caatggctat gcccgatcac 1381 caagcggtgg ttctgctgcc acatctgttg ccgcggttac acgagcgcgc ccctcatctc 1441 gacattgtga ccgatccgct tttgggcggc gcgctggggt tacttgaaca aggtgagatc 1501 gacgtggtcg tcgggcagat gggcgccgct ccgctcggct acttgcggcg cagactctac 1561 gccgacagct tcacctgcgt gctacgccac aatcacccgg ccttggcgca ggagtggacg 1621 atcgaggctt tcgcggcctt gcgccacgtc gccattgcct cggaacctga cgagcttttc 1681 ggtcagatct atgacaggtt aaccaaactc ggactgcagc gtggtgatcc gatggtggtt 1741 tccaccgtac tgaccgcggc ggtcttgatc gcggcgactg actcagtgct ggttgtacca 1801 agccgcgtgg cgacccgtgt cgccgccatg ctgtcgctcg cggtgattcc cccgcctgtg 1861 gaactgagac catacgaagt cgcgctgatc tggcacgagc ggtgccatcg agacccagag 1921 catcgctggc tgcgtggcga aatcgccgcc gcagcgtcca cggccggtta gacgaagact 1981 agcgaatagg cgaggatggg tgagcataat gaatgacgac ggtttgatga ttgactatcc 2041 cggggcggtg gaggacccgg tcgatgcgcg atttgcccga tgactcacac ccgggagggg 2101 gtgagtccga gcgcgcctac cacatgcggc tgggtccaga caccgccgtt ttcgtcaacc 2161 ggccgaagtc gcaagggggc acgagcgagg atcgagttgc aacgatgagc cgtcatagcg 2221 tcatagcggc tgtgcacgcc ccaatcccgg cgggttcgcg gcgttgtgtc aaatgctcct 2281 ctcatcgcta ccgatgacaa tacgcgaccc tgctccgaac gtaggtgttc cgtcagtcgc 2341 cccacgtcgc cagcgcggac tgcgatccgc tgagacggaa ctatgcttag ccgccgcatt 2401 ccgtcgcggt gtgggaaatt gcacgcgcag cagagtcttg acgatgctaa aaccatgagg 2461 agcccttgga ggcaccggtc ttttcagagg tgctctctct gctgcgcgcc atccatactt 2521 tggacgatct gcaccgtcga gaacgcaata acgctcctgg cttctgccta acgatacggt 2581 ccgacccggc gtgaatcgac atgagcttag cgtacccatc gtcgcaaggt gcatgc // LOCUS ECCCPL 615 bp ss-mRNA INV 14-AUG-1990 DEFINITION E.granulosus cyclophilin mRNA, 3'end. ACCESSION J04664 M24616 KEYWORDS cyclophilin. SOURCE E.granulosus, cDNA to mRNA, clone EA21. ORGANISM Echinococcus granulosus Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Cestoda; Eucestoda; Cyclophyllidea; Taeniidae. REFERENCE 1 (bases 1 to 615) AUTHORS Lightowlers,M.W., Haralambous,A. and Rickard,M.D. TITLE Amino acid sequence homology between cyclophilin and a cDNA-cloned antigen of Echinococcus granulosus JOURNAL Mol. Biochem. Parasitol. 36, 287-290 (1989) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Lightowlers, 02-MAY-1989. This E.granulosus antigen exhibits 71% amino acid homology with human cyclophilin. FEATURES from to/span description pept < 1 488 antigen ORF (cyclophilin) (AA at 3) mRNA < 1 615 CPL mRNA BASE COUNT 145 a 135 c 166 g 169 t ORIGIN 1 tgggcgtgaa gtgcttcttc gacatctcca ttggcggtaa gccagcgggt cgcatcgtct 61 ttgccctgtt tgatgacgtc ccaaagacag tagagaactt ccgtgcactg tgcaccggtg 121 agaagggttt cggatacaag ggctcgaagt ttcaccgaat tattcccggt tttatgtgcc 181 aaggtggtga ttttactgcc gggaatggta ccggtggcaa gagcatatac gggagcaaat 241 ttgaggatga aaatttcaat cacaagcaca gcaagccgat gatgctctcg atggcgaatg 301 cgggtaagaa caccaatggc tcgcaattct tcatcactac cgccgtcacc agctggcttg 361 atgggaagca tgttgtcttt ggtgaggtag aaagtggcga agatgttgtt aaggatatgg 421 aggccgtcgg tagcagtagt ggcaagactt cccaagaagt actgattact gactgcggtc 481 agctgtaaat caatcggccg aacgttcact ctattccctt tattaattgt atcgcgctaa 541 tcctgctcct cattgcttct ttacccgttt gtggccgtaa cgtgtttcaa caatgaagtc 601 gttgtgcgta ttttg // LOCUS ECOINTDLP 2269 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli ArgU-tRNA gene overlap with prophage DLP12 integrase gene, complete cds. ACCESSION M27155 KEYWORDS insertion element IS3; integrase; transfer RNA-ArgU. SOURCE E.coli (strain K12) DNA, clones pDC1 and pDM1614. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2269) AUTHORS Lindsey,D.F., Mullin,D.A. and Walker,J.R. TITLE Characterization of the cryptic lambdoid prophage DLP12 of Escherichia coli and overlap of the DLP12 integrase gene with the tRNA gene ArgU JOURNAL J. Bacteriol. 171, 6197-6205 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.F.Lindsey, 16-AUG-1989. A homologous region to bacteriophage P22 excisionase is located at positions 1423-1205. Positions 1727-1422 are 97.4 % identical to phage lambda positions 31627-31322 encoding the C-terminal end of exonuclease, and bases 1726-2126 are 92 % identical to phage lambda bases 39972-40372 encoding the C-terminal 102 residues of the acid P gene product and the N-terminal 31 residues of the ren gene product. FEATURES from to/span description tRNA 73 149 Arg-tRNA tRNA 48 227 Arg-tRNA precursor (alt.) tRNA 48 237 Arg-tRNA precursor (alt.) pept 1328 165 (c) DLP12 integrase trns 2127 > 2269 insertion element IS3 (alpha-4/beta-4) signal 14 19 -35 region signal 36 41 -10 region BASE COUNT 539 a 596 c 512 g 622 t ORIGIN 1 agatcaaaag ccattgactc agcaagggtt gaccgtataa ttcacgcgat tacaccgcat 61 tgcggtatca acgcgccctt agctcagttg gatagagcaa cgaccttcta agtcgtgggc 121 cgcaggttcg aatcctgcag ggcgcgccat tacaattcaa tcagttacgc cttctttata 181 tcctccataa tttcagagtg ggacatattt gggacattat caccaaaaat gtcgtctatt 241 ttcctcgcat gctctgtcaa atgattaggc gcaaggtgag catacctacg aaccatttct 301 atggactccc atccgcccat ttcctgaagc actgataatg ggacgcctga ctgaatcagc 361 cagcttgccc aggtgtgtct gaggtcatgg aaacggaaat cttcaattcc tgcacgacga 421 caagctgata gccatgatgt cttgctgtcg atgcgcatct tcctgaccgc aggcgttgat 481 gttccatctg ctcgcttagc cgccttggta tgtacaaaca cccatttgtg atgcttgcct 541 atttgatcac gcaacacttt acaggcggta tcgttcagcg ccacaccaat ggcgcggttt 601 gatttgctct cttctggatt cacccaggca actcgtcgct gcatgtcgat ttgttgccat 661 tccagattta tgatgttcga ctttctcaga ccagttgcca gcgcaaactt gacgacagat 721 ttcagtggtt cggggcactc atcaataagg cgttttgctt cctccttttc cagccatctg 781 actcgcttgt ttctgaccgc tggtatcttg atgacaggcg ctttttccag ccacttccag 841 tcgcgttctg cagcacggag aatggccttt atcatggcaa gatgctttgc ctttgtctga 901 gttgatactg gctttggttc ataaacaggc agttctttac ctttcctgat ggcggcctga 961 actttctgtt tccatatttc tttcgtcttt ctgttatgca ttctgcttac agcagagtaa 1021 atctttgcct ccgagatatc tttaagcctt ataccctcaa aatgttcaag ccagaactca 1081 atccggcttt tatctgaatc gagagatttt ttatcagctt tttcctcaag ccatcttagg 1141 caggcctctt caaaagtgac atcaggtaaa tcccctagct tttctactcg ccagagttct 1201 gcttttcgct tgtcgtgcaa ctcctgagct tgccgtttgt cctttgtgcc aagagattcc 1261 ttaattcgtt tcccgcccgg gagcgaatac gaggcatacc atatttcatt tctgcggaag 1321 agtgacattt tctttcctct gttatgccat cacccgcgct cacctggaca gtatgcagcg 1381 gagactgaag cgccgcaatg caggcttgcc gtgttgtgag gtacccggat attatcgtga 1441 ggatgcgtca tcgccattgc tccccaaata caaaaccaat ttcagccagt gcctcgtcca 1501 ttttttcgat gaactccggc accatctcgt caaaactcgc catgtacttt tcattccgct 1561 caatcacgac ataatgcagg ccttcacgct tcatgcgcgg gtcatagttg gcaaagtacc 1621 aggcatcttt tcgcgtcacc cacatgctgt actgcacctg ggccatgtaa gccgatttta 1681 ttgcctcgaa accaccgagc cggaatttca tgaaatcccg ggaggtacga gtattgccgg 1741 aagcgtggcc tgtatccgga tgcagagtct tatccgtgga aatcgaacgc gcattactgg 1801 ttggttacca acttgtacca gaacatgcgg gccaatgcgc tggctgacgc ggaattacgg 1861 cgcaaggctg ccgatgaact gacctgtatg acagcgcgaa ttaaccgtgg tgagacgata 1921 cctgaaccag taaaacaact tcctgttatg ggcggtagac ctctaaatcg tgcacaggct 1981 ctggcgaaga tcgcagaaat taaagctaag ttcggactga aaggagcaag tgtatgacgg 2041 gcaaagaggc aattattcat tacctgggga cgcataagag cttctgtgca caggacgttg 2101 ccgcggtaac aggcgcaacc gtaatctgat cttacccagc aatagtggac acgcggctaa 2161 gtgagtaaac tctcagtcag aggtgactca catgacaaaa acagtatcaa ccagtaaaaa 2221 accccgtaaa cagcattcgc ctgaatttcg cagtgaagcc ctgaagctt // LOCUS MHVJHM2CA 49 bp ss-RNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone JHM-a. ACCESSION M22059 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain JHM) RNA, clone JHM-a. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 49) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 16 a 10 c 4 g 19 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taatctaaac tttataaacg gcacttcct // LOCUS MHVJHM2CB 44 bp ss-RNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone JHM-b. ACCESSION M23059 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain JHM) RNA, clone JHM-b. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 44) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 14 a 9 c 4 g 17 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taaactttat aaacggcact tcct // LOCUS MHVJHM2CC 49 bp ds-DNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone B1-c. ACCESSION M23060 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain B1) RNA, clone B1-c. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 49) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 16 a 10 c 4 g 19 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taatctaaac tttataaacg gcacttcct // LOCUS MHVJHM2CD 44 bp ds-DNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone B1-d. ACCESSION M23061 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain B1) RNA, clone B1-d. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 44) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 14 a 9 c 4 g 17 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taaactttat aaacggcact tcct // LOCUS MHVJHM2CE 45 bp ds-DNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone JHM-2c-e. ACCESSION M23062 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain JHM-2c) RNA, clone JHM-2c-e. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 45) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 14 a 10 c 4 g 17 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taatctaatc taaacggcac ttcct // LOCUS MHVJHM2CF 40 bp ds-DNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone JHM-2c-f. ACCESSION M23063 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain JHM-2c) RNA, clone JHM-2c-f. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 40) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 12 a 9 c 4 g 15 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taatctaaac ggcacttcct // LOCUS MUSPDGFA 906 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse platelet-derived growth factor A chain (PDGFA) mRNA, complete cds. ACCESSION M29464 KEYWORDS platelet-derived growth factor; platelet-derived growth factor A chain. SOURCE Mouse F9 embryonal carcinoma cell line, cDNA to mRNA, clone F9A5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 906) AUTHORS Mercola,M. JOURNAL Unpublished (1989) STANDARD full staff_review REFERENCE 2 (sites for [1]) AUTHORS Mercola,M., Wang,C., Kelly,J., Brownlee,C., Jackson-Grusby,L., Stiles,C. and Bowen-Pope,D. TITLE Selective expression of PDGF A and its receptor during early mouse embryogenesis JOURNAL Dev. Biol. 138, 114-122 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Mercola, 26-OCT-1989. FEATURES from to/span description pept 63 653 platelet-derived growth factor A chain BASE COUNT 217 a 250 c 270 g 169 t ORIGIN 1 cgggctggag cgcccgcccc ggccctcgcc tccctgccga gcttccagcg cctcgggacg 61 cgatgaggac ctgggcttgc ctgctgctcc tcggctgcgg atacctcgcc catgccctgg 121 ccgaggaagc cgagataccc cgggagttga tcgagcggct ggctcgaagt cagatccaca 181 gcatccggga cctccagcga ctcttggaga tagactccgt aggggctgag gatgccttgg 241 agacaagtct gagagcccat gggtcccatg ccattaacca tgtgcccgag aagcggcctg 301 tgcccattcg caggaagaga agtattgagg aagccattcc tgcagtttgc aagaccagga 361 cggtcattta cgagatacct cggagccagg tggaccccac atcggccaac ttcctgatct 421 ggcccccatg tgtggaggtg aagcgctgca ctggctgttg taacaccagc agcgtcaagt 481 gccagccttc acgggtccac caccgcagtg tcaaggtggc caaagtggag tatgtcagga 541 agaagccaaa attgaaagag gtccaggtga ggttagagga acacctggag tgtgcatgtg 601 cgacctccaa cctgaaccca gaccatcggg aggaggagac agatgtgagg tgagatagag 661 ccggccgccc ctctcccggg gacacgatgt gcgtggcgtg tgacattcct gaacatacta 721 tgtatggtgc ttcattgcca atgtgcgtgc ggtctttgtt ctcctccgtg aaaaaccgtg 781 tccgaggaca aagagacagt gtccgtttgt tcagtgtgac atcaaagcaa gtactgtagc 841 actcagagag acagtgagaa agcttccttg tcacagagag cgaaaacaaa accacaaaaa 901 agaaaa // LOCUS ORFPRTPS 660 bp ds-DNA VRL 14-AUG-1990 DEFINITION Orf virus homologue of retroviral pseudoprotease gene, complete cds. ACCESSION M30023 J04371 KEYWORDS pseudoprotease. SOURCE Orf virus (strain NZ2) DNA. ORGANISM Orf virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Parapoxvirus. REFERENCE 1 (bases 1 to 660) AUTHORS Mercer,A.A., Fraser,K.M., Stockwell,P.A. and Robinson,A.J. TITLE A homologue of retroviral pseudoproteases in the parapoxvirus, orf virus JOURNAL Virology 172, 665-668 (1989) STANDARD full staff_entry COMMENT Draft entry and clean copy of sequence [1] kindly submitted by A.A.Mercer 12-JUL-1989. FEATURES from to/span description pept 100 579 pseudoprotease homologue BASE COUNT 157 a 170 c 166 g 167 t ORIGIN 1 gctggacttg cgccgcgcca tgctgacccg cccctgattt ttctggagaa agtgtaaatt 61 gtacaccccg tagtcgatcg gccgctcgcc accctagcca tggagttctg ccacacggaa 121 actcttcagg ttgttcgcct cagccaaaac gccactatac cagccagagg atctcctggt 181 gccgcaggat tagacctatg tagtgcatac gactgtgtca tcccttcgca ttgcagtcgc 241 gtagtattta ctgaccttct gataaagccc ccttctggtt gctacggacg catagcacca 301 cgttctggcc ttgcagtcaa acactttatc gacgtgggcg ctggcgtgat cgacgaagac 361 taccgaggca acgtcggagt ggtgctgttt aactttggaa acagtgactt tgaggttaag 421 aagggagaca ggatagctca gctaatatgc gaacgcatat cgtgcccagc agttcaagag 481 gttaactgct tggacaacac tgaccgcggc gactctggat ttggttctac tggatcgggt 541 gcttgtggtg gccgcgatac agcgtggtac atcagctaat caaaattgta aaatgtagct 601 tctttttatt cgagagtctc gcacagttgc gtagataaca ctaattacaa cacactttaa // LOCUS SP8ALU156 192 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage SP82 Alu156 promoter region DNA. ACCESSION M22133 M20661 J03904 KEYWORDS promoter. SOURCE Bacteriophage SP82 (in B.subtilis) DNA, clone Alu156. ORGANISM Bacteriophage SP82 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 192) AUTHORS McAllister,C.F. and Achberger,E.C. TITLE Effect of polyadenine-containing curved DNA on promoter utilization in Bacillus subtilis JOURNAL J. Biol. Chem. 263, 11743-11749 (1988) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by E.C.Achberger, 14-SEP-1988. The nucleotide sequence contains a promoter from bacteriophage SP82 that is recognized by the major B.subtilis RNA polymerase, the sigma-43-containing enzyme. DNA with this sequence displays sequence-dependent DNA curvature. FEATURES from to/span description site 131 136 -35 region site 147 152 -10 region mRNA 159 > 174 Bacteriophage SP82 mRNA start mRNA 156 > 192 Alu156 mRNA BASE COUNT 62 a 45 c 38 g 47 t ORIGIN 1 gaattccccg gatccgtcct tccaagaaag atatcctaac agcacaagag cggaaacacg 61 ttttgttcta catccagaac aacctctgct aaaattcctg aaaaattttg caaaaagttg 121 ttgactttct ctacgaggtg tggcataata atcttaacaa cagcaggacg ctaggacgga 181 tccggggaat tc // LOCUS SP8BAL129 156 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage SP82 Bal129 promoter region DNA. ACCESSION M22134 M20660 J03904 KEYWORDS promoter. SOURCE Bacteriophage SP82 (in B.subtilis) DNA, clone Bal129. ORGANISM Bacteriophage SP82 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 156) AUTHORS McAllister,C.F. and Achberger,E.C. TITLE Effect of polyadenine-containing curved DNA on promoter utilization in Bacillus subtilis JOURNAL J. Biol. Chem. 263, 11743-11749 (1988) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by E.C.Achberger, 14-SEP-1988. The nucleotide sequence contains a promoter from bacteriophage SP82 that is recognized by the major B.subtilis RNA polymerase, the sigma-43-containing enzyme. This is the site of transcription initiation for a major early mRNA for Bacteriophage SP82. FEATURES from to/span description site 60 65 -35 region site 83 88 -10 region mRNA 95 > 126 Bacteriophage SP82 mRNA start mRNA 95 > 156 Bal129 mRNA BASE COUNT 60 a 29 c 29 g 38 t ORIGIN 1 gaattccccg gatccgtcac ccctaagaaa aaatatctac agaaaatatg aaaaagttgt 61 tgacatttct tcccatccat gctataataa agtcatagag aacaacacta tcaaatgaat 121 ggagagattg atatgatgaa tggggatgac ggatcc // LOCUS XELBGAI 2027 bp ds-DNA VRT 14-AUG-1990 DEFINITION X.laevis adult beta-globin gene (beta-AI), 5' flank. ACCESSION M27933 KEYWORDS beta-globin; globin. SOURCE X.laevis adult DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2027) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta-globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.Knoechel, 13-SEP-1989. BASE COUNT 792 a 266 c 284 g 685 t ORIGIN 1 agatcttgat acgttaactt tactagaaaa taatttaaac cccaatagcc tggttttgct 61 tccaatatgg tttaattata ccttagtttt caggataatg gatctttctg taatttggat 121 cttcatgcct taactgtacc agaaaatcat ttaaacttta aataaaccca atttgcttcc 181 agtacagttt aattatatct tagtttggat aagtacaagg tactgtttta ttattacagt 241 gaaaaaggta atcattttaa aaaaaaaata tatattattt ggataaaatg gagtctatgt 301 gtgatggcct ttccgtaatt ctcggtttct ggcaaacgga tctcatacct gtaataggta 361 tataaaaaac acacattaaa aaatactaca tatatattta tattcttttt tttttttaaa 421 gtgtgtaaat tcatgtcttt aaaataataa aatgtattta tatatatata tatatatata 481 tatatatata tatatatata tatatatata tatatatata tacttcaaca aaaaatttgc 541 caaattcata catacaaaaa aaataaaata ataattttaa ataattgaat ctgtctagct 601 gtttatattc tctgctctgc tggatctgac tcctgaaaaa atgtgcagaa gccatttgat 661 ttacagagct ggaggagaat ggctacatta gtttaaaagc cagaaccagg agaggatgca 721 ggcaacaaaa atggatacac acaaattaac gtctattaca attatattta caaataacct 781 taaagccaac ttttttaaaa attattatat attgtaaagt tgcttagaaa ccaatttttt 841 acttataggg agcaaaaaat agggagatcc tgtaaaacag aagctgcacc aaacatagat 901 caagctatcg agctttccat acgtatacat ttatttgaaa ggcactgtta aggagccacg 961 gtgctgtaca gtgcataaaa gtacaatata tatatataaa agtatacaca gggaagacaa 1021 atcacacaat gaatatacac agagctcata tcagaacaaa cagcttaagt gctttgtggt 1081 aagagacaca gtgggaagga ggtccctgtc ccgtagagct tacagtctta cagagctcag 1141 ttcaatccat atgactcaac catttaatta ataaaataat ttgcctttta atcattaatt 1201 aattccacac ttccatgtat aaatggaata tatatgtaag atttatatgt aatagctata 1261 tatgtaagat ttgatatttt ttttgtagga ataaaatgaa aatcaggcaa ataaaaaaca 1321 acatatatat gtttaaaaac ggtgttaatt tctatgcaac atgacatgaa aaagactttt 1381 caatattttt acatatgtat acataagata tatgattgaa ccatttgaat aataaaatag 1441 cttttgcctt accatcatta attattctac catactgtat gtaaaaagca ttgctatatg 1501 taagatttga ttatattttg ttgtaggaat aaaatgaatt ccaggcatat aaaaacacat 1561 ttataaaaaa catttataaa aaacactaca tatacatata catatatata tatatatata 1621 tatatatata tatatatatg tatatatata agtttaaaaa gtgtgttaat ttataatgtc 1681 tttctggaaa tagaatttca cacttcattg tatacaaaat tattaatatt tgtaatattt 1741 gattatatta tgttgtaggg ataaaatgaa taccaggcat ataaaaacac actttaaaaa 1801 aaaaaaaata catagataat aatttaataa tttgtattta ttttttctta atattctagc 1861 tctgctgtaa taaaaaaaac atgcatctaa aagtggtgcc aaatgggagg gtacaaatgg 1921 gctgggcaaa tgtaacgtgt gcttatccta gccaatcaac aggcagagtg gaaaggggca 1981 gtgcatcctt acagctacat aaagtctgat ggatggagaa ttagagc // LOCUS XELBGLI 6777 bp ds-DNA VRT 14-AUG-1990 DEFINITION X.laevis larval beta-globin gene (beta-LI), 5' flank. ACCESSION M27934 KEYWORDS beta-globin; globin. SOURCE X.laevis larval DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 6777) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta- globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.Knoechel, 13-SEP-1989. BASE COUNT 2121 a 1247 c 1228 g 2181 t ORIGIN 1 gaattcaaag cttttttttt tattaaacag ttttattgta ttttcaaacg aaaaacaagc 61 agaggtaaga cagtcaacag gttaacatta atgctgcgtg aagggtacta tacattgtgt 121 ttacattaca acttgttgga cattgatatg tcacttctgt gaatttgaag ctttacattt 181 aactaaaatt tgtgatggaa tgtctaacct gcatcccagt ccaaagaaat ttcaaagtag 241 aagatgacat aatgttggta gagatagtga tgagcggatt ttttgccagg tatggatttt 301 ggcaaaattc cgcgcttctt cgtctgcaat tttttttccc aaaactgcag caaaaatcca 361 ccataaccaa aaagtcacaa agacaaaatt gtcgcagaga caagaaagtc acagtaagac 421 ttgatgctcc tgattcactt gcactgacca caccactgta ttaaagggca gagaggggac 481 tataatgcag agacccatgg tccaggctcc tatgaccatg gggtctgctg tatagctgag 541 ctagttacac cagtaaggcc aaaacaaaat ggacttgcat gctggaataa tatgcaaaac 601 tgctgcagtg cctgttttta atctgctggg taaagagtgc aattacaaat gttaggataa 661 ttgcttatca tactctgctg catacactta ggggcccatt tacttagctc gagtgaagga 721 atagaggaaa aaaacttaga atttcgattg ttttttttgg ctacttcgac ttcgacctta 781 gacttcgaat cgaacgattc aaactaaaaa tcgtttgact attcgaccat tcgatagtca 841 aagtactgtc tctttaagaa aaaactttga ccacctagtt cgccacatta aagctaccga 901 agtcaatgtt agcctatggg gaaggtcccc atatgctttg ctagcttttt ttggtcaaaa 961 ataaaccatt cgatcgatgg attaaaatcc ttcgaatcga tcgaacgaat aatgctaaat 1021 cctttgactt cgatattcga actcgaagga tttaacttcg acagtcgaaa atcgagggtt 1081 aattaaccct cgatattcga ccttaagtaa atttgcactt attattgcaa atatttgggt 1141 ccatgacaga gtcatctgta tataatgtga aattacaaat actggtgcct cccctgtttt 1201 actttgctct atgtgagaaa aataatggag tcagtgccat acatatcctt gtgtgtatgg 1261 tggaaattgt agatgtcttg ggggcaaatt tactaaaggg cgaagtggct aacgctaggg 1321 aaaattcgcc agcgttacgt caatttgcca cttcgacaat ttagtttacg gttaccatgg 1381 cgaaaattcg ctagcaatgt aaatagacca gcgcaacttc acaccctaac gctggcgaag 1441 tcaggatgcc cacattcccc ctacatttcc taacatatgg cacctgaatt atactagggg 1501 cacatgtgta gggctttttt taagtttccc tgggcctctg tagtgttatg tatttgctgc 1561 agcaatatac atgtatacaa atttccaatc ggtagcgtaa cctcgaaccg ctgatcgtaa 1621 catcactagc gcaacttcgc aaatgattgg taacttgtgt gcaacttcgg atcttcgtga 1681 atttgcgcag ccactgcgaa gctatgcctg gcgaagtgcg gcgaatgcaa gtctcgggat 1741 ctccgcaggt aagtaaattt gccccatggt cagaggcaag gccagattat gtactaggtg 1801 acctaagaat caatactgtc cattctaaaa gtgcaagttc ataagtgccc gcaactacag 1861 aaacaatagg ggagaactaa caatctgttg taaacaacat tacaaggttg gctccctcat 1921 tgtttatatt atagctgtat aactgtaatg atgagtacga tctaagatat aatgaatctt 1981 attgcaggca aaacaatcct gttgattaat taatgcttaa attatcagaa attacaaaaa 2041 cctcaggtcc tgtgcattct ggataacagg tcccatacct gtactaaaac atgggaccag 2101 ggtgtctgca ttgatcaatc acctctttta tgattgtttt gggccatcac tctacttcaa 2161 gatgctgatg atatattacc aataaatgtt atattatata cttaaaaatc ttaattgaat 2221 taatatagtc aaatccttga tggagacaga cctagtagta tcatggataa taaaactagc 2281 aacagcaagc attggcccga cttgccatct tggagtcttg aaggaatctt ccacctttga 2341 ggaaaattgg agacagcttg tctatttttc aacctcttct aatatctaat tgaagaagat 2401 ccttacatac tgtatgtggt ggaaaatgca tgtttcttta aagatatgct gattgttgca 2461 ccaatctttg ctcaaagatc ttataagaaa tctttaagca tgactgtctg caactatgac 2521 tattataaaa tcctttccat gtagagtttt catccttttt gtgggtcaaa ggctgcccct 2581 cagcaatatc aggggaatga aattaaagtc acaaagagca aaacaattcg caccaatagg 2641 actaaaaatc cacatctcgc aatgcaatat tgttccttaa actgttattg taattgcgaa 2701 ttttaattgg ccattgcgga ttttaattgc gcactcttaa gaagtgcttg aagttgtcgt 2761 aatcttttgg agcaaacata acgacttttt cattaagagg tttaattaca ttgacgcatt 2821 ggcgcaaact ataaaatttg caaatggtct tccactgtcg gaagtggtcg caaaacagtt 2881 tctgggctcg caaaagctat attaaatttg cgaaagcaaa atgtgttcgc gcaaaggtat 2941 aacttttgca ttgcgaatag ttttccgtta gcaactttta ttgcattccc ctgtaaatat 3001 ctaataagca tggcctcgag cccaaaagac acctttttag gtaaagaaat aaatggggat 3061 ttcattctat aagtaattga atttgcacta aatattagta agtcggtttc ttgccctact 3121 ccaaccaaac tcaagaactt tcatttatta aagcacaaga aactctaact cacatattaa 3181 caaatagtta tagttggtca aattgtagct cagttaaggg tatattatat atttctgttt 3241 gttcgggtgt ggtgggccag tttttgaaac agtcaactgt tttacttaca gcagatgtcc 3301 aggtggcttg ccatatcttt gtcaaaaaca aatatattgt cagtattgtt ttttcaacat 3361 ctgccttagt tagataagaa ttgacaatat agaaccaagg gtatctaaaa atgctgctct 3421 gccttgcaat ctaatggtgg gtgggtccaa tgatttgtgt atttgcctga aaaaaaggga 3481 atattgttct ctcccttacc ttttttccca aagaaattgt ttcttttaat gtgtccaaaa 3541 tacagcaact tcagtcttgt gatttgagct tcaagtgaga tcagagacat gatttgctca 3601 gggatccatt tgtttgtgtt cctttcttcc acagtattct caaaagtctt tactaaaacc 3661 aaaatttatt agtacatttc cttgtactgc cacatttaca tctattaaga gtgacatcaa 3721 atactataac tggacaattc ccaaagtaac ctcagtacat gttaaaatat cgttgacgtc 3781 ttccatgtct cattctaagt gtcaatctgc tacttgacta taagattttt gttgtttata 3841 agtgacccag taaggcaaaa gctatacata actagctgcc cacaaactgg ccaatataaa 3901 gggagaagga aaattgttgt tcccactgga gttgttcccc tggttgggga aaaaatacta 3961 ttttgtatac aaaatgctgt tctggggtca ccaggagaga gcttctgatg ttcagggcca 4021 ggtagtgaca taagcctgag aataagactt aggggcacat tcaagctcgg gtgaatgaat 4081 agagggaaaa aaactcctcg actatcgaat tggcgtaaat tcgcctgagt agaatgattc 4141 aaatagattg agcgaaaaaa cgctgcgact attcgcccat cgatagtcga agtattgtct 4201 cttttaaaaa tcatttgact gcctacttcg ccagataaaa cctaccgaat tgctttaaaa 4261 gcctatggga aagtcccata ggcttctttt ctacgttttt gatcgaataa aaaggcattc 4321 gatcgaatat tcgatcgaat gaaaatcctt cgattgaata ttcgatcgtg cccattcgat 4381 tattcgccag cgcgtaaatt tgcccgaatt ccctattcga ttccattctc cagtcgaatt 4441 tcgagggatt taacccctcg aaattcgacc cttgatacat ctgcccctta gtgtgccaac 4501 ttgctcattg tgtgcatgtg tgtgacatgc cataaggctc tcttattaag cgcatgtatg 4561 tgatgaaaca taaccatccc cactgggagc tccttcatgg tttagcagaa tagcgctcac 4621 taccagcttt ttattcaaaa actgatattg tttccctcaa ccagagtata agctctatta 4681 gcttgcacca tcagtggggg aatttttttt cccctattag gtttccttta agctgcaaac 4741 ttgacctctc cttcccatct gcagtatatt gaccaatata agggaccaac cccacagtaa 4801 gatatctatt gtgtatgttt caaaatccca ttaggtaagg acagtacatt tatgtggtcc 4861 ctataggccc tcattatgat ctaattattg ggtcaatccg tcgtttttgg tacagtggtg 4921 ccagccttga actagagtgg taaaagaggg ctttgttggc tctttgagca tatcatagag 4981 ccttcagcaa aagttcactt tttaaatgta caccaatgaa tggagatttt tgaggccccc 5041 aaaattgtat tgctgtagat cctgcaacag ccaatgatcc ctttatctgc tctgaaatct 5101 tttttgtcgc tgctgctgct actggttaaa tacagtatag ttgaaaaaat ataggctttg 5161 agaataaaac ctgatgttca tttgcttttt aattattact ttacatcccc tttaaaaata 5221 tatacacatc actattccat gcattacact catttttaat tagacaaatc tataagaaat 5281 tctgcgagat gacacttttc atgataagca ttttgtaaaa ttgtaatatg ttcagttttt 5341 ttttttaaaa gttcaatgcc acactttatt tcaaaatgta ttaaggtgca gtaattatat 5401 taaataaatg tattgtaggg tacatgaata tatgtaacat ttaaaatgtg tgtttatgca 5461 cttctttcaa gtacagtaca tttgcactgt gatcaaatat taatttgaac tttaacagtc 5521 ctatctctac acctttatct tgtcctgggg atcagtctgt tttttagtga tatcttgtaa 5581 cacagaactt taaacaaaag ggctccgttt tgcacgtaga cctgtttgtg aatccatggc 5641 aattctgcca cctaaagcat acataacatt tagcatcttt ttttggtgtt ttttagacag 5701 atgatggtat agccatttgt gcaaataaaa tcagatattt tatcccaaat tatttgtgct 5761 gttagttgta tagggtttca acaaaatatc ttatttatca tttagagcaa atacttatgt 5821 gttacagtat ctgcaagtag tcaagtttga gcttaaaatt cccataattc ataattaagg 5881 ggatggctta gtataaaaaa acgtggaaaa aaaaacgtgt acagttatgc ttttatattg 5941 ccttgtaagt tcttttttat actattatta ttttaatgac cacgttttga attattgcat 6001 ggatttatga aaaccagttt aattgcaaag aggctcctaa aaattattta ttataagtta 6061 aaatttagta tatgcgtgca tgtatatgta acaatgcact ctcatatcta gtaaaaatca 6121 aagttgaagt aaagtgtata actaagtttg acctttctca ggcattaatg atcccagagg 6181 aaggccacac tatgtgacca aaacattgga ctacatttat taaatacatt taccttgatt 6241 tcttcaacac aatttgaaag ttcctccatg agctaatata aatttataaa gagagagagt 6301 gagtaaaaca tttttatcag aaaacagtgg cagagtaaat tctttcatac ttacaaaaga 6361 gtgctactat gcgcaacatt aacttgacat ttttgaattg tacctaatgc aattcatgat 6421 atttaaattg aatacattaa ttttaattat ttaattgtcc tgaaatctct acaggttcaa 6481 aaaaataatt ccatttatta catttatttt gtacacttaa ttatctactg ttaagtgtca 6541 caattgccct catttgatgt gggtttaagt ttcatgttgt tataaagaat caactttaca 6601 atttaagaac tatatggcat tccacatata caaaagatat attagcttaa ggttaaaaat 6661 ttattttgaa ggcaataggg tggggtggag gaaaaaaaat atgacacagc agaaatgcac 6721 aatgggtgtg actcagcatg gccatataaa gcaaggccaa caactcaaag gaacagc // LOCUS XELBGLII 910 bp ds-DNA VRT 14-AUG-1990 DEFINITION X.laevis larval beta-globin gene (beta-LII), 5' flank. ACCESSION M27932 KEYWORDS beta-globin; globin. SOURCE X.laevis larval DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 910) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta-globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.Knoechel, 13-SEP-1989. BASE COUNT 322 a 152 c 128 g 308 t ORIGIN 1 ggatcccttg tctggaaacc agttatccag agggctccaa attatggaaa ggccatctcc 61 catagactca attttaatca aattattatt attttttttt tacaaattaa tgcctttttc 121 aatgtattaa taaaacagta ccttgatccc aaaattggag gcaaaacaat ccagtttgtt 181 ttatttaatg tttaaatatt ttttttaata ttgttttgat ccaaattaca gaaaggcccc 241 ttatccagaa aaacctccat ttaggataag gataacaggt ccaatacatt cataccctgt 301 acaaatctat gctatgttta attacttata aatagatcca catttcaatg gatatttcta 361 gaatatcgta ataacggtat atacttgttc aaagacaaac acatttaatg acctatgcct 421 aactggaata acagtcaagg aaatttaatg gaataatagg tatttcggag ctttccattt 481 attaacccta caaacaacta gttgttgttt caggaaacag cagtagttct atttggctta 541 catcttgaac aaaagcaaag ttgctatagt tttctttttc gtgtaaggaa agaaatgact 601 tgtgtcttta tctctacatt aaaaatgtat ctgccacaca gaatactttc tttttttaac 661 ttatctatag ataacgtatg tgcacccaaa ttgtagctgt gttacatcag cataattaag 721 tgcacacatg aagaaaaaaa atgacagatt gacaaaatgt tatattatat ggtaaggtct 781 cttggataat agcccttatc agtcataact ggttacaaat acagaaaaaa tgaggtgaca 841 cagcataaat gatatgaata cgtcactaac ttacacccct ataaatcaca aggttaaaat 901 attttttttt // LOCUS YSCADK1 2367 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae adenylate kinase (ADK1) gene, complete cds, and histone (H2A-1) gene, 3' end. ACCESSION M18455 KEYWORDS adenylate kinase; histone. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2367) AUTHORS Konrad,M. JOURNAL Unpublished (1988) STANDARD full staff_entry REFERENCE 2 (sites for [1]) AUTHORS Konrad,M. TITLE Analysis and in vivo disruption of the gene coding for adenylate kinase (ADK1) in the yeast Saccharomyces cerevisiae JOURNAL J. Biol. Chem. 263, 19468-19474 (1988) STANDARD simple staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly provided by M.Konrad 21-JUL-1988. FEATURES from to/span description pept 917 1585 adenylate kinase (EC 2.7.4.3) pept < 1 359 histone H2A-1 protein (AA at 3) ORF 2009 > 2367 ORF BASE COUNT 777 a 473 c 435 g 682 t ORIGIN 1 bp upstream of HindIII site; chromosome 4. 1 aagcttctca atctagatct gctaaggctg gtttgacatt cccagtcggt agagtgcaca 61 gattgctaag aagaggtaac tacgcccaaa gaattggttc tggtgctcca gtctacttga 121 ctgctgtctt ggaatatttg gccgctgaaa ttttagaatt agctggtaat gctgctaggg 181 ataacaagaa gaccagaatt attccaagac atttgcaatt ggctatcaga aatgatgacg 241 aattgaacaa gctattgggt aacgttacca ttgcccaagg tggtgttttg ccaaacatcc 301 atcaaaactt gttgccaaag aagtctgcca aggctaccaa ggcttctcaa gaattataag 361 atcggttctg gtattttaaa gaaggcggaa ggaactaaac tgcttctcca ttgtaactat 421 attaagtgtc tattagacct ttctgttttt cctttttatt tttctatata taatttattg 481 ggtatataat taacggtaac atatatcatg catgatataa attttttgtt gtgtcttcct 541 gttttctctg ttgctctgtt taggttcatt gggcactgtt gcttacttga acatgattga 601 gtagcaaaca cattctgtgc tttgttcact ttgatagtgt gacgaaattt tcacaaccaa 661 aggaaaaaag ctcacgattg cgcgatcctg tcacggagaa ctgtccgcag cagcccgcgg 721 cttctgtcct tattcgccca tttttttttg attttcgact ttttcactct ggctagtttt 781 attacgcata tttgtatata tatatacgca taaatttctg aaatggatta cctggtttat 841 cagaaacgag agctttaata aagaaacatt aacgtttctc tggtaaagtc accacacagc 901 atcaaatata acagtaatgt ctagctcaga atccattaga atggtcctaa ttggcccacc 961 tggtgccggt aaaggtactc aagctccaaa tttgcaagag cgtttccatg ccgctcactt 1021 ggccactggt gacatgttga gatctcaaat cgcaaagggc actcaattag gtttggaagc 1081 aaagaaaatt atggaccaag gtggtttagt ctctgatgac attatggtta acatgatcaa 1141 ggatgaattg accaacaatc cagcttgtaa gaatgggttc atcttggacg gtttcccaag 1201 aaccattcct caggctgaaa aattggacca aatgttgaaa gaacaaggaa ctcctttgga 1261 aaaagccatc gaattgaagg ttgatgatga attgttggtt gccagaatta ccggtagatt 1321 aattcaccca gcctctggca gatcctacca caagatcttt aacccaccaa aggaagacat 1381 gaaggatgac gtcaccggtg aagctttagt tcaaagatct gatgacaatg cagacgcctt 1441 gaagaagaga ttagctgctt accatgctca aaccgaacca attgttgact tttacaaaaa 1501 gaccggtatc tgggctggtg ttgatgcttc ccaacctcct gctactgttt gggctgacat 1561 cttgaacaag ctaggtaagg attaatgacc ttgcgcaatg tcttctaaat atcttttctt 1621 ttttttaaat taatgaaaat ataataaatt tttgtatgtg agtacatata tccgcagtat 1681 actataaatg aaataataac ataagtgttt gtaactggct ataacgtcct taaacatgtg 1741 cactgccatt aagtacttag tcaatatttt tatcgttgag aacgaacgca aatcatatta 1801 ctaaatagat gaacaaaagc gcatcacata aacaaaaacg gtataacagt aatagtgaag 1861 ctttttataa acctataacg aaataagcgc ttttcaggtt ttataattca gtcagagttc 1921 taactggaca tcgttttgca ggggataaaa aaaaaaagga agcttcaacc cacaatacca 1981 aaaaagcgaa gaaaacagcc aatgccaaat gacaataaga cacccaaatg ggtccagcac 2041 tcccaagttt actaaaaaac cagtaacccc gaatgataaa attcctgaaa gagaagaaaa 2101 atccaatgaa gtgaagacac ctaaaattcc attattcacg tttgccaaaa gcaaaaacta 2161 ttcaaggccg tcaaccgcaa ttcatacctc acctcatcaa ccaagtgatg taaaaccgac 2221 ttcccataaa cagttgcaac agccaaaatc ctcaccactg aaaaaaaata actataattc 2281 ttttcctcac tcaaatctgg aaaaaataag caacagcaaa ctactctccc ttcttcggtc 2341 caaaacgtca gcaggaagaa ttgaatc // LOCUS YSCSSB1A 1842 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Yeast (S.cerevisiae) heat shock protein 70 (YG101, SSB1) mRNA, complete cds. ACCESSION M25395 KEYWORDS heat shock protein. SOURCE Yeast (S.cerevisiae, strain GRF-18) DNA, and cDNA to mRNA, clone F430. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (sites for [2]) AUTHORS Normington,K., Kohno,K., Kozutsumi,Y., Gething,M.-J. and Sambrook,J. TITLE S.cerevisiae encodes an essential protein homologous in sequence and function to mammalian BiP JOURNAL Cell 57, 1223-1236 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1842) AUTHORS Normington,K.D. JOURNAL Unpublished (1989) Univ, of Texas SW Medical Center, Dallas, Texas STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.D.Normington, 09-JUN-1989. FEATURES from to/span description pept 1 1842 heat shock protein 70 BASE COUNT 510 a 385 c 418 g 529 t ORIGIN 1 atggctgaag gtgttttcca aggtgctatc ggtatcgatt taggtacaac ctactcttgt 61 gttgctactt acgaatcctc cgttgaaatt attgccaacg aacaaggtaa cagagtcacc 121 ccatctttcg ttgctttcac tccagaagaa agattgattg gtgatgctgc caagaaccaa 181 gctgctttga acccaagaaa cactgtcttc gatgctaagc gtttgattgg tagaagattc 241 gacgacgaat ctgttcaaaa ggacatgaag acctggcctt tcaaggttat cgacgtcgat 301 ggtaacccag tcatcgaagt ccaatacttg gaagaaacca agactttctc cccacaagaa 361 atttccgcta tggttttgac caagatgaag gaaattgctg aagctaagat tggtaagaag 421 gttgaaaagg ccgtcattac tgtcccagct tactttaacg acgctcaaag acaagctacc 481 aaggatgccg gtgccatttc tggtttgaac gttttgcgta tcatcaacga acctactgcc 541 gctgctattg cttacggtct aggtgctggt aagtccgaaa aggaaagaca tgttttgatt 601 ttcgatttgg gtggtggtac tttcgatgtt tccttgttgc acattgctgg tggtgtttac 661 actgttaaat ctacttccgg taacactcac ttgggtggtc aagatttcga caccaacttg 721 ttggaacact tcaaggccga attcaagaag aagactggtt tggacatctc cgacgatgcc 781 agagctttga gaagattgag aactgctgct gaaagagcta agagaacctt atcttctgtc 841 actcaaacta ccgttgaagt tgactctttg tttgacggtg aagatttcga atcctctttg 901 actagagcta gatttgaaga cttgaacgcc gcattgttca agtctacttt ggaacctgtt 961 gaacaagttt tgaaggatgc taagatctct aagtctcaaa tcgacgaagt tgtcttggtt 1021 ggtggttcca ccagaattcc aaaggtccaa aagttgttgt ctgacttctt tgacggtaag 1081 caattggaaa aatctattaa cccagatgaa gctgttgctt acggtgctgc tgttcaaggt 1141 gctatcttga ccggccaatc cacatctgac gaaaccaagg acttgttgtt gttagatgtt 1201 gctccattat ctctaggtgt tggtatgcaa ggtgacatgt tcggtatcgt tgttccaaga 1261 aacactactg ttccaaccat caagagaaga acctttacta catgtgctga caaccaaacc 1321 accgttcaat tcccagtcta ccaaggtgaa cgtgttaact gtaaagaaaa cactttgttg 1381 ggtgaattcg acttgaagaa catcccaatg atgccagctg gtgaaccagt cttggaagct 1441 atcttcgaag ttgatgctaa cggtatcttg aaggttactg ccgtcgaaaa gtctaccggt 1501 aagtcttcta acatcactat ctctaacgct gttggtagat tgtcttctga agaaattgaa 1561 aagatggtta accaagctga agagttcaag gctgccgatg aagcttttgc caagaagcac 1621 gaagctagac aaagattgga atcctacgtt gcctccatcg aacaaactgt cactgaccca 1681 gtcttgtctt ctaaattgaa gagaggttcc aagtccaaga ttgaagctgc tttgtccgat 1741 gctttggctg ctttgcaaat cgaagaccca tctgctgatg aattgagaaa ggctgaagtt 1801 ggtttgaaga gagttgtcac caaggccatg tcttctcgtt aa // LOCUS ATHGPA1 2030 bp ss-mRNA PLN 14-AUG-1990 DEFINITION A.thaliana G protein alpha subunit 1 (GP-alpha-1) mRNA, complete cds. ACCESSION M32887 KEYWORDS G protein. SOURCE A.thaliana (strain Landsberger) flow DNA and, cDNA to mRNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 2030) AUTHORS Ma,H., Yanofsky,M.F. and Meyerowitz,E.M. TITLE Molecular cloning and characterization of GPA1, a G protein alpha subunit gene from Arabidopsis thaliana JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3821-3825 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.Meyerowitz, 16-MAR-1990, for release after publication. FEATURES from to/span description pept 600 1751 G protein alpha subunit 1 (GP-alpha-1) pept 490 552 ORF1 pept 568 591 ORF2 mRNA 401 1885 GP-alpha-1 mRNA mRNA 401 1921 GP-alpha-1 mRNA (alt.) signal 312 318 pot. TATA box rpt 17 36 inverted repeat 1A rpt 188 207 inverted repeat 1B rpt 323 339 inverted repeat 2A rpt 341 357 inverted repeat 2B BASE COUNT 631 a 346 c 413 g 640 t ORIGIN Chromosome 2 near erecta. 1 gttaacttaa tagtatataa aataaaaatg catataggtt ccgtaattaa tctcttatcg 61 tcacgagagg cacatctttt tccaacattt gaccactctc tctctctctc tcaggacctt 121 tcggcgtaat ttcgtcttcc cctttgctta acattttctt tctttctttt tgaccaaata 181 ttaaaaatat atccattttt attttatttt taattaaatt cataatttgc atttgtaata 241 gaaaaaaaaa gaagagaata aatccaaaag agtgaagcaa aaacattaaa gcggaaagaa 301 agtggtaaaa caataataga aacaggagaa gcagaagtac tacttcttct tcttctgctc 361 tcttctcaga ccttgttttg tactttcttc ttcttcttct ttcttcttct tgtttgcgaa 421 ctccgatatc ttcttcacta cctttgactc catttctttt tcttcttcag gtgtaggcat 481 tgtcttgtta tgagaagcaa ctgtagctgg aagctcaagt atttgttttt agctgtggag 541 cttgaatctt gatagttttc gacttctatg ttattacctg tggggatata gaaacaatca 601 tgggcttact ctgcagtaga agtcgacatc atactgaaga tactgatgag aatacacagg 661 ctgctgaaat cgaaagacgg atagagcaag aagcaaaggc tgaaaagcat attcggaagc 721 ttttgctact tggtgctggg gaatctggaa aatctacaat ttttaagcag ataaaacttc 781 tattccaaac gggatttgat gaaggagaac taaagagcta tgttccagtc attcatgcca 841 atgtctatca gactataaaa ttattgcatg atggaacaaa ggagtttgct caaaatgaaa 901 cagattctgc taaatatatg ttatcttctg aaagtattgc aattggggag aaactatctg 961 agattggtgg taggttagac tatccacgtc ttaccaagga catcgctgag ggaatagaaa 1021 cactatggaa ggatcctgca atccaggaaa cttgtgctcg tggtaatgag cttcaggttc 1081 ctgattgtac gaaatatctg atggagaact tgaagagact atcagatata aattatattc 1141 caactaagga ggatgtactt tatgcaagag ttcgcacaac tggtgtcgtg gaaatacagt 1201 tcagccctgt gggagagaat aaaaaaagtg gtgaagtgta ccgattgttt gacgtgggtg 1261 gacagagaaa tgagaggagg aaatggattc atctgtttga aggtgtaaca gctgtgatat 1321 tttgtgctgc catcagcgag tacgaccaaa cgctctttga ggacgagcag aaaaacagga 1381 tgatggagac caaggaatta ttcgactggg tcctgaaaca accctgtttt gagaaaacat 1441 ccttcatgct gttcttgaac aagttcgaca tatttgagaa gaaagttctt gacgttccgt 1501 tgaacgtttg cgagtggttc agagattacc aaccagtttc aagtgggaaa caagagattg 1561 agcatgcata cgagtttgtg aagaagaagt ttgaggagtt atattaccag aacacggcgc 1621 cggatagagt ggacagggta ttcaaaatct acaggacgac ggctttggac cagaagcttg 1681 taaagaaaac gttcaagctc gtagatgaga cactaagaag gagaaattta ctggaggctg 1741 gccttttatg accttattat tacatatctc tagtaaatta cctctcctta ttattataag 1801 aaaaactcga aaactgaatg accgtgtaat ttatctttcg ggacaaaaga cttagcgatt 1861 caaaatctaa tgtgtctcga tggctacgac tagtttctat tttatcattg tttttgttaa 1921 cattcctctg tctttgactt cttatttttt ttctcatcaa aaacatctca ttttgatctt 1981 gtttttgggg ttatattatt attaaaatga ggcatccaca tcccgaaatc // LOCUS BLYABA 998 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Barley abscisic acid (ABA) mRNA, complete cds. ACCESSION M36000 KEYWORDS abscisic acid. SOURCE Barley (H.vulgare L. cv. Himalaya) seed, cDNA to mRNA, clone pHV A1-1. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 998) AUTHORS Hong,L., Uknes,S.J. and Ho,T.-h.D. TITLE Cloning and characterization of a cDNA encoding a mRNA rapidly- induced by ABA in barley aleurone layers JOURNAL Plant Mol. Biol. 11, 495-506 (1988) STANDARD simple staff_review FEATURES from to/span description pept 120 761 abscisic acid (ABA) mRNA < 1 998 ABA mRNA BASE COUNT 249 a 284 c 323 g 142 t ORIGIN 1 gtgccggtag taaatcatga gcatctcttg cgactcgaaa cgtagtacag caacagccta 61 aagcgagtcc gagtggtgat tccagttcgt gtttgtttga gctagatcgt gagacgaaga 121 tggcctccaa ccagaaccag gggagctacc acgccggcga gaccaaggcc cgcaccgagg 181 agaagaccgg gcagatgatg ggcgccacca agcagaaggc ggggcagacc accgaggcca 241 ccaagcagaa ggccggcgag acggccgagg ccaccaagca gaagaccggc gagacggccg 301 aggccgccaa gcagaaggcc gccgaggcca aggacaagac ggcgcagacg gcgcaggcgg 361 ccaaggacaa gacgtacgag acggcgcagg cggccaagga gcgcgccgcc cagggcaagg 421 accagaccgg cagcgccctc ggcgagaaga cggaggcggc caagcagaag gccgccgaga 481 cgacggaggc ggccaagcag aaggccgccg aggcaaccga ggcggccaag cagaaggcgt 541 ccgacacggc gcagtacacc aaggagtccg cggtggccgg caaggacaag accggcagcg 601 tcctccagca ggccggcgag acggtggtga acgccgtggt gggcgccaag gacgccgtgg 661 caaacacgct gggcatggga ggggacaaca ccagcgccac caaggacgcc accaccggcg 721 ccaccgtcaa ggacaccacc accaccacca ggaatcacta gacgcatgcg ttcgcgctta 781 atttccgttc ctttagtcgt gtttggtcgt tcgagggcct tctacatatt tcatatttgt 841 atgtttccac tctttcatga tttccgctca tttagtgtaa agtttgcctc cgatttgatg 901 tactcgtctc tggttctgta atgagttata atccatgggc tttggtgtaa atggataacg 961 aggacactcg aaggcggcaa taaagttgta tgtgatcg // LOCUS MVARRNAG 1466 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.vannielii 16S ribosomal RNA gene. ACCESSION M36507 KEYWORDS 16S rRNA; small subunit ribosomal RNA. SOURCE M.vannielii DNA. ORGANISM Methanococcus vannielii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanococcales; Methanococcaceae. REFERENCE 1 (bases 1 to 1466) AUTHORS Jarsch,M. and Boeck,A. TITLE Sequence of the 16S ribosomal RNA gene from Methanococcus vannielii: Evolutionary implications JOURNAL Syst. Appl. Microbiol. 6, 54-59 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA 1 1466 16S ribosomal RNA BASE COUNT 335 a 372 c 473 g 286 t ORIGIN 1 attccggttg atcccgccgg aggctactgc tattggggtt cgactaagcc atgcgagtct 61 atggtttcgg ccatggcgga cggctcatta acacgtggtt aacttaacct caggtggagc 121 ataaccttgg gaaactgagg ataattctcc ataagaaaag cagtctggaa cgattctttt 181 ctgaaagcat atgcgcccga ggataggact gcgctcgatt aggtagttgg tggggtaatg 241 gcccaccaag cctacgatcg atacgggcct tgagagaggg agcccggaga tggggactga 301 gacacggccc caggccctac ggggcgcagc aggcgcgaaa cctccgcaat gcacgaaagt 361 gcgacggggg gaccccaagt gctcatgcac agcatgggct tttatcaagt gtaaacagct 421 tgaggaataa gggctgggca agttcggtgc cagcagccgc ggtaataccg acggcccgag 481 tggtagccac tcttattggg cctaaagcgt ccgtagccgg tccagtaagt ccctgtttaa 541 attctctggc ttaaccagag gactggcagg gatactgctg gacttgggac cgggagagga 601 caagggtact ccaggggtag cggtgaaatg tgttgatcct tggaggacca cctatggcga 661 aggcacttgt ctggaacggg tccgacggtg agggacgaaa gccaggggcg cgaaccggat 721 tagatacccg ggtagtcctg gccgtaaact ctgcgaacta ggtgtcacct gggcctcgag 781 cccaggtggt gccgaaggga agccgttaag ttcgccgcct ggggagtacg gtcgcaagac 841 tgaaacttaa aggaattggc gggggagcac cacaacgggt ggagcctgcg gtttaattgg 901 attcaacgcc gggcatctca ccacgagcga cagcatgatg acggccaggt tgacgacctt 961 gcctgaagcg ctgagaggtg gtgcatggcc atcgtcagct cgtaccgcga ggcgtcctgt 1021 taagtcaggt aacgagcgag acccgtgccc tatgttgcga ctactttctc cggaaggtaa 1081 gcactcatag gggaccgcta gcgctaagct agaggaagga gcgggcaacg ataggtccgc 1141 atgccccgaa tctcctgggc tacacgcggg ctacaatggc taggacaatg ggctgctacc 1201 ctgaaaaggg acgcgaatct ccgaaaccta gtcgtagttc ggatcgtggg ctgtaactcg 1261 cccacgtgaa gctggaatcc gtagtaatcg cagttcataa tactgcggtg aatgtgtccc 1321 tgctccttgc acacaccgcc cgtcacacca cccgagttgg gttcaggtga ggccttggcc 1381 tttggctagg gtcgaacctg ggctcagcga ggggggtgaa gtcgtaacaa ggtagccgta 1441 ggggaacctg cggctggatc acctcc // LOCUS TOM2A11 526 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Tomato fruit-specific protein mRNA, complete cds. ACCESSION M21775 X13741 KEYWORDS fruit-specific protein. SOURCE Tomato (strain UC82B) red fruit stage, cDNA to mRNA, clone 2A11. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 526) AUTHORS Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M. TITLE Isolation and characterization of a fruit-specific cDNA and the corresponding genomic clone from tomato JOURNAL Plant Mol. Biol. 13, 639-651 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.E.Rose, 09-DEC-1989, for release after publication. FEATURES from to/span description pept 34 324 fruit-specific protein BASE COUNT 163 a 97 c 92 g 174 t ORIGIN 1 tgctcatcaa ttagcaatta atccaaaacc attatggctg ccaaaaattc agagatgaag 61 tttgctatct tcttcgttgt tcttttgacg accactttag ttgatatgtc tggaatttcg 121 aaaatgcaag tgatggctct tcgagacata cccccacaag aaacattgct gaaaatgaag 181 ctacttccca caaatatttt gggactttgt aacgaacctt gcagctcaaa ctctgattgc 241 atcggaatta ccctttgcca attttgtaag gagaagacgg accagtatgg tttaacatac 301 cgtacatgca acctgttgcc ttgaacaata tcaatgatct atcgatcgat ctatctatct 361 atttatctgt ctctgcgcgt atagtgttgt ctgtaccttt ggtgtgaaga atatgaataa 421 agggatacat atatctagat atattctagg taatgtccta ttgtatttaa aatttgtagc 481 aatgattgtt tgaataaaaa cataccatga gtgaaataat tattcc // LOCUS TOM3H11 774 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Tomato fruit-specific protein mRNA, complete cds. ACCESSION M21776 X13742 KEYWORDS fruit-specific protein. SOURCE Tomato (strain UC82B) red fruit stage, cDNA to mRNA, clone 3H11. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 774) AUTHORS Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.E.Rose, 09-DEC-1989, for release after publication. Author address: R.E.Rosen, Calgene Inc. 1920 Fifth St. Davis, CA 95616 FEATURES from to/span description pept 214 504 fruit-specific protein BASE COUNT 247 a 143 c 119 g 265 t ORIGIN 1 tttttttgag caaagggcaa ctcagatatc caaagatgaa tccaacatat agcttacagc 61 tgggagaaca ttgtctaact cttctgaaat ttaaatgtta tccagaatcc ttcatcataa 121 aataatatca aaatgcaaat ctattttttc tactcttgtc tagcttcaac tttcttcttc 181 tgctcatcaa ttagcaatta atccaaaacc attatggctg ccaaaaattc agagatgaag 241 tttgctatct tcttcgttgt tcttttgacg accactttag ttgatatgtc tggaatttcg 301 aaaatgcaag tgatggctct tcgagacata cccccacaag aaacattgct gaaaatgaag 361 ctacttccca caaatatttt gggactttgt aacgaacctt gcagctcaaa ctctgattgc 421 atcggaatta ccctttgcca attttgtaag gagaagacgg accagtatgg tttaacatac 481 cgtacatgca acctgttgcc ttgaacaata tcaatgatct atcgatcgat ctatctatct 541 atttatctgt ctctgcgcgt atagtgttgt ctgtaccttt ggtgtgaaga atatgaataa 601 agggatacat atatctagat atattctagg taatgtccta ttgtatttaa aatttgtagc 661 aatgattgtt tgaataaaaa cataccatga gtgaaataat tattccacat taattcacgt 721 atttatttca cttatgatac gtatttttgt tcctttcgcg taaaaaaaaa aaaa // LOCUS TOMFSPG 4654 bp ds-DNA PLN 14-AUG-1990 DEFINITION Tomato fruit-specific protein gene, complete cds. ACCESSION M21777 X13743 KEYWORDS fruit-specific protein. SOURCE Tomato (strain VFNT Cherry) DNA. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 4654) AUTHORS Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M. TITLE Isolation and characterization of a fruit-specific cDNA and the corresponding genomic clone from tomato JOURNAL Plant Mol. Biol. 13, 639-651 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.E.Rose, 09-DEC-1989, for release after publication. FEATURES from to/span description pept 1379 1445 fruit-specific protein, exon 1 2160 2383 fruit-specific protein, exon 2 pre-msg 1326 2640 mRNA and introns IVS 1446 2159 fruit-specific protein intron signal 1064 1067 TATA-box (put.) signal 1135 1138 TATA-box (put.) signal 1190 1193 TATA-box (put.) signal 1291 1297 TATA box (put.) signal 2640 2640 polyA signal BASE COUNT 1709 a 723 c 642 g 1580 t ORIGIN 1 ctcgagccct ttaaaaagta tagtcaatat ttacggtgac cgtgaatttc ttaattatga 61 tatataattt aaaagaaatc atgatcacat tctactgatg agaacatgtg ctaatcaagg 121 gaaaacatgg atgtgaaaaa tactttttgt taaaagtaaa aaaaaatgtg aaattttgtt 181 agttatttac tacctataca ttatttgagc atgtgcaaac tttacaaata cctaatagaa 241 gattttcacc tgcctgtata tatgtaaatt aattataatg aacactctca cataaaataa 301 ttatcagtat atacattaat acttgccctc cacaatgaat taaataaaat gtagaacatg 361 atctacactt caataaaact aagaccataa agaataattt caaaatatac acatgtcaac 421 aataaattat ttgcatatta tattaactta ctaaacaatc tttacttttg aaatataaaa 481 ataatcaagt tataagtctg ctcaaagtaa agcacttgtt agactcatct gattttgaga 541 aggtaagcaa attgatggtg cataatagtc acaagtaaaa tataaaatag atttcattag 601 taaaattgtt ttttactttc tttatatata attatcaata tccttcaatg gtaggttaat 661 tatattgtta acttcttgtt gaattaaagc aataagacaa gaatattaaa gataaaagaa 721 caataaaaat agaaagacta agagataaga gttttcttat tcttctttca ataagtatca 781 tcaagtgtat acaatataaa tttttgtatt tttgatctat ctatttataa tgttatatat 841 aagcatacaa aagatcagtc ataaatatga ctttaatcat gaaaataatg aaagagatta 901 tgaaggcgta aggttactag aataatagtc attaaaaaaa ggggttatct ttataattga 961 ataattgatg aagtaatgga gataattagt gagcataaat ttttttaaaa aaatggacat 1021 ttacactata atattttata acactttccc ttaaacatct aggtataaat aatgagtctt 1081 gtcaaaatct tagtaggaaa aattctgtga aattttttta gtgaaaacaa atgatataaa 1141 tatcttgaat actcattatt tgttgtctca ttaaaaatct tatctgacct ataaaataaa 1201 ttatttgctc aactcaaaat agtttttcat tctaaaatta gtataattat tagtgaatat 1261 ttaattaaca taattgtata ctaaggggcc tataaattgg attcttctca aagaaaaata 1321 aaatcaccac acaactttct tcttctgctc atcaattagc aattaatcca aaaccattat 1381 ggctgccaaa aattcagaga tgaagtttgc tatcttcttc gttgttcttt tgacgaccac 1441 tttaggttca caacacttct cccttatttt gttttcttaa tttcttggaa gtcatatgca 1501 tgtgtttggt atcatggtat atatataaag gaaaatattt ttcttaatta ctggttttct 1561 aatgtttggt aggtaatcgg aaattattat gagataatga acttgcaaag tcattattat 1621 ataacttttt ttttatactt tgatttaaga attcattttt ctcattttat ataaacttat 1681 ttttcaacag aaaatatttt tcgaactatt caaacacacc ctaagacatt acatatatat 1741 atatatacac cctccgtttt atattactta atgcctattg agttggccca ccctttaaga 1801 atgattcaat tagagatatg ttttactaaa ttaacctatg ctttaagact ctaaatttgg 1861 ctattactat tttacgttgt aatttaatga caaacatttc ataatgacta tagtctgaac 1921 ttaattagac agacgtatct atagtttgct tactaatgat tcatagctat atatttggag 1981 aggagagaga caaacgatat taagaaaggg aggagagagg cgaggtaaat ctgaaataga 2041 gaagagaaag gcaaccaatt ttgatcatct atcatacttt tgattattat ttttattata 2101 tgtacgttta cattacagtt ttcgaattct tacattaatc ttaatcataa tatatacagt 2161 tgatatgtct ggaatttcga aaatgcaagt gatggctctt cgagacatac ccccacaaga 2221 aacattgctg aaaatgaagc tacttcccac aaatattttg ggactttgta acgaaccttg 2281 cagctcaaac tctgattgca tcggaattac cctttgccaa ttttgtaagg agaagacgga 2341 ccagtatggt ttaacatacc gtacatgcaa cctgttgcct tgaacaatat caatgatcta 2401 tcgatcgatc tatctatcta tttatctgtc tctgcgcgta tagtgttgtc tgtacctttg 2461 gtgtgaagaa tatgaataaa gggatacata tatctagata tattctaggt aatgtcctat 2521 tgtatttaaa atttgtagca atgattgttt gaataaaaac ataccatgag tgaaataatt 2581 attccacatt aattcacgta tttatttcac ttatgatacg tatttttgtt cctttcgcgt 2641 agatttttga tccttttccc ttttgaatat taaacattaa acacaaataa tgtttattaa 2701 attaagttaa tatttttatt tagctattta tatttttatt tgaaatcaaa cttgataaat 2761 atttataaag ataattaaca agtaatgtga cactaacacc atgtaatatt atcttgtcgt 2821 tatttatgat aatattttaa aattataatt tcagttaaaa aattattaaa aaaacatact 2881 tttaaaaagt gagttagcct ccgctaccca catacttatg aattggacta gttgtttttt 2941 gacccacaaa aagaatgggc taattaaacc tgacctatca aatttcagaa tctgcataga 3001 ttagtccgaa cgaaatgagt cagcccgtat tgaacaaaat atcaacaagg acgttatgta 3061 aagatgttta agaaggaaaa aagatttcta atacatatgg actttcaata tcccaacttt 3121 gtctggcgat ctgaaccctg cttagtttgt tgatcattaa cttgtcttgc tatgtattta 3181 agatttaaac tttatatgtt taaacttaca gaaaatacat ataaatctct caagacttgg 3241 caacataatt tactttagta cttaaactac atgaaaattt aaatatcctt ttaacatctt 3301 tgaagtgaat taaattatca caatccgagc ctacaccttg gacgtggccg gcactcaaga 3361 accagtgctg gtccccaagc taaccctcat cctgactgac tacaagcgga aggctaactt 3421 aagtatacaa aagcttaaaa ctgaataaaa taaactttac aaggttttaa cacaaatgaa 3481 caactttgaa gaaaataata tattcaacta gccataaaat agacaacttt agtctttaaa 3541 acatttaata aaataaatgc aaaatataga ctccttaact aaactgacta tctatggagc 3601 ctctaattga taaagatgga agtcgggaca agaccacgac atcctgacta aactgagaag 3661 taaataaaat cccccggaaa aaaaggagcc tcaccatggc taactcgaac tcggggatat 3721 atcaatgaag ctcctgttga tgatcttgaa gacatgtctc tgcatcatca aaagatgcag 3781 gccaaatggc tcagtacgta aaatgtacga gtatgtaagg gaaattctaa agtataacat 3841 aagcttgata cttgaataaa aggaaacata cttacctctt ttcaactcaa ctcaaattaa 3901 gaataagata ctcaactcaa agattaggta ttcaacgcaa atatggcact ctactcaatg 3961 aagtacaaat taactcagga tactcgactt aagatactca actcccgaca ctcaactgaa 4021 ctcatttcaa tataaagcag cttaaaacaa gttcagtata aagtaaagtt gtttaaaaac 4081 atgatgtcaa ctctgtgtgt ataataagga tacaacataa ctttgaaatg tatataaaaa 4141 tacaattaac tgatgtatat aaaaatacat taatctatgg gagattctct aaccgacaac 4201 catcacttaa gggctaagat gatgatatag cgatctaccg cacgctgcca tcgcatctta 4261 tacccggcca aaggtataag acctgaactg cctaatgaat ccactaataa actgttaaaa 4321 ggaatcatct aaaaagtatg acccttttct acccatagtg gctaacatgg tttatggggg 4381 ctgtgagtta tctgaactct cccccatatc ggtgctcaat actactccaa aaaatatact 4441 gctcttatgt ttaaaaacat actgattctg tggtttgaaa ttattgctta aagcttagat 4501 ttttgaaaag ctctcttttg aaaatcgtag tttccttttt cttctattaa agctagacat 4561 aggctatgta gaactctagc ttaccttcct tctcaaaagt ttgaaaacat ttgcttagat 4621 tcttagggac tacttagttc ccttgttgga attc // LOCUS ACLP322P 1337 bp ds-DNA SYN 14-AUG-1990 DEFINITION Synthetic plasmid pWH1266 origin of replication (ori) region. ACCESSION M36473 KEYWORDS . SOURCE A.calcoaceticus lwoffi plasmid and pBR322 DNA, clone pWH1266. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 1337) AUTHORS Hunger,M., Schmucker,R., Kishan,V. and Hillen,W. TITLE Analysis and nucleotide sequence of an origin of DNA replication in Acinetobacter calcoaceticus and its use for Escherichia coli shuttle plasmids JOURNAL Gene 87, 45-51 (1990) STANDARD simple staff_entry FEATURES from to/span description orgrpl 310 337 origin of replication BASE COUNT 447 a 229 c 251 g 410 t ORIGIN 1 gatcgtagaa atatctatga ttatcttgaa gaacgcaacc ctatagcagc tattgaaatt 61 gatgatttaa ttgaagaaaa gacagattta gttgttgata atcgactgat ggggcgcaca 121 ggcagacaga aagatactag ggagttagtg atacatccgc attatgtggt tgtatatgac 181 atcactgata taatacggat actcagagtg ctacacacat cgcaggagtg gtcatgactt 241 actcatgtac tttggattat ttagtgttat aaaatcctga tttataaatt ttttttgtta 301 aaaaagataa aagccccttg caattgcttg gggctttacc gtaatttatg gggtacagat 361 cttcgatact gacatatcgg caatcgaaag cattaaggtt tgacgaccgc taatgatttc 421 accacagggg cttaatgtac ctgtcttaaa ttctaaggtt ttaactcgct ttgtcaagca 481 tagaccccaa aaatttagcc aatgtctgta actcaatctg tccatgtgtg ggtgatgagg 541 tacagtgacg ctagcacaca tcggaaaaac gctattacta ggggaactga acagagtagc 601 ggacgcaatg agtagtcatt taattggcgg ttatgagcgt gttcaggcgg tgctatcaat 661 cgtaatcata acagtggcag cttgatacag tgatgtcatc cctgatgcga aagcgaccga 721 ccgacggtac atcgaatggg aatactttag ggtgattttt aagaatcgct ctagggtgag 781 tatttcccat tcagctctgc tccctccctc tggtacttta atcaaaagca ctactaaaca 841 tatgttttta aataaaaaat attgatatag agataatatt agtaagaata attaaacaat 901 tgaatataga taaatcattg ttaaataaag attaattatt aaaatgaatg tatacttata 961 tataaatcaa tgatttaaaa tatttgataa agaaaacttt tcaaaaaaaa tataattgag 1021 attgtgtcat ttcggtcaat tcttaatatg ttccacgcaa gttttagcta tggtgctaaa 1081 cagaaatttg ctgaaaaaga acttttcact gaactggtta aaatgtaagc agcctgagag 1141 ccgccaaaaa ttttaaaaac aaaccgcctt aatcatcttc aaaaaatacc tctaaaacct 1201 caccatttgc gttttaagac ccatatttca tcctgccctt atgttcccat gctgatagct 1261 ataaagtgtc tgtaatcgct tcctatgacg ttctaggctg ttgataactt ttggaacaac 1321 gcaaaatgtt aaaatcc // LOCUS ATHUXP1A 1078 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana ubiquitin extension protein (UBQ1) gene, complete cds. ACCESSION J05507 KEYWORDS gene UBQ1; ubiquitin extension protein. SOURCE A.thaliana (variety Columbia) DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 1078) AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D. TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure, localization, and expression of their promoters in transgenic tobacco JOURNAL J. Biol. Chem. 265, 12486-12493 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Callis, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 44 146 ubiquitin extension protein (UBQ1) 234 320 ubiquitin extension protein 406 508 ubiquitin extension protein 888 954 ubiquitin extension protein 1031 1057 ubiquitin extension protein pre-msg < 1 > 1078 UBQ1 mRNA and introns IVS 147 233 UBQ1 intron A IVS 321 405 UBQ1 intron B IVS 509 887 UBQ1 intron C IVS 955 1030 UBQ1 intron D BASE COUNT 295 a 195 c 214 g 374 t ORIGIN 1 ccttgtttag ggtttctacg tgagagaaga cgaaacacaa aagatgcaga tcttcgtgaa 61 aaccttgacc ggcaagacca tcactctcga ggtcgagagc agcgacacca tcgacaatgt 121 caaggccaag atccaagaca aagaaggtat cattcttcct cactcaatct ggattcttct 181 ctttagcttt ttgaaattca gatctcttat catttacttg tttctccttt aaggaatccc 241 tccggatcag cagagattga tcttcgccgg aaagcagctc gaagatggcc gtactttggc 301 tgactacaac atccagaaag gtacgaaatc atccgaatcc ttctgttgat catttcgatg 361 atctgattgt ataaactcta atggattgtt atcatttgta aacagaatct acacttcatc 421 ttgtgttgag gcttagagga ggtattattg agccttcctt gatgatgctt gctcgtaagt 481 acaatcagga taagatgatt tgccgcaagt aagtttcaat ttctccctaa gattgtagtt 541 ttgttgatta atagttcaaa gtaccaaatg tttattagtg tgttagggtt ttagttgttt 601 taatgggaat taacttagct tctcaagcct tatacaatgc gtgttgaggt tataaatgtt 661 cccaatatca aacatttcca ttctctaaac tttgtcagct ggtcaaattt ctcatctttc 721 cagatgattt tcttgaatgg tacatttttg caaatggttt tgaaaatgtg ttgaagcttg 781 tatttttgat tgctgaaaca tcttgtttgt tggatagttt gtcatctatg gtccgatctg 841 atgtctcaat gagattattc taatgtaaat ttcgcatctt ttggtaggtg ctatgctcgt 901 cttcacccaa gagctgtcaa ctgcaggaag aagaagtgtg gtcacagcaa ccaggtttgt 961 ttttgttact tgtttctttt tggtgtttat cagctgaaat gatatctgaa caaacttttt 1021 ttatttgcag ttgaggccta agaagaagat caagtagaga gactcttatc aagaatcc // LOCUS ATHUXP2A 976 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana ubiquitin extension protein (UBQ2) gene, complete cds. ACCESSION J05508 KEYWORDS gene UBQ2; ubiquitin extension protein. SOURCE A.thaliana (variety Columbia) DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 976) AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D. TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure, localization, and expression of their promoters in transgenic tobacco JOURNAL J. Biol. Chem. 265, 12486-12493 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Callis, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 62 164 ubiquitin extension protein (UBQ2) 243 329 ubiquitin extension protein 433 535 ubiquitin extension protein 771 837 ubiquitin extension protein 929 955 ubiquitin extension protein pre-msg < 1 > 976 UBQ2 mRNA and introns IVS 165 242 UBQ2 intron A IVS 330 432 UBQ2 intron B IVS 536 770 UBQ2 intron C IVS 838 928 UBQ2 intron D BASE COUNT 262 a 164 c 209 g 341 t ORIGIN 1 cttcattttt tacactgcaa acgccgtttc gagacttccg gcgaagaaga cgaacgcaaa 61 gatgcagatt ttcgtgaaaa cgctaaccgg caagaccatc accctcgagg tcgagagcag 121 tgacaccatc gacaatgtca aggccaagat ccaggacaaa gaaggtatat tttctaatgc 181 tttttaaaaa ttgtaacttt gaaattgtga tctcttagtg attcgttttt gctctgtttc 241 aggaattcct ccggaccagc agaggttgat cttcgccgga aaacagcttg aagatggtcg 301 taccttggct gactacaaca tccagaaagg tctctttctc gaatttgaat gttttggatt 361 ctttagaaaa tgaaactagt atagctatga ttgttttgtt gatccaattc tgttttgttt 421 actataaaac agagtcgact ttgcatcttg ttctgaggct taggggaggt atcattgagc 481 cttcattgat gatgttggct cgtaagtata accaagacaa aatgatttgt cgcaagtaag 541 atctctgatc ttttctcttt gttgttctaa gtgtaccatg tccatgtgtg atgattatag 601 taaaaatggg gaattaactt aggggtgaac tattattcat tttctacgtg gattatgatt 661 ctcttagttt gctgataaat gtattgaaat ctatctagtt ctttcactgt aacactatgg 721 ttgtgtgaaa actgtgtctc atgtttgagt tgttaacctt gttggtttag gtgttatgct 781 cgtctccacc caagagctgt gaactgcagg aagaagaagt gtggccatag caaccaggtt 841 tggttttttg tctctttaga ttttaaaatt ccacagaatt tgtttgattt gttagttgtt 901 aagtgtgatt ttctgtttat ttttgcagtt gaggccaaag aagaagatca agtgatcttt 961 tgttcaccaa aagctt // LOCUS ATHUXP5A 720 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana ubiquitin extension protein (UBQ5) gene, complete cds. ACCESSION J05539 KEYWORDS gene UBQ5; ubiquitin extension protein. SOURCE A.thaliana (variety Columbia) DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 720) AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D. TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure, localization, and expression of their promoters in transgenic tobacco JOURNAL J. Biol. Chem. 265, 12486-12493 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Callis, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 112 585 ubiquitin extension protein (UBQ5) site 339 340 ubiquitin-extension protein junction BASE COUNT 209 a 159 c 164 g 188 t ORIGIN 1 aagcccaaat aaaataaggg tagtaacgta attgagctaa gaaaccctag aggtctcctt 61 cgcctacaaa atccccattt tgataatctt cagcagccgt tgcctcaaaa gatgcagatc 121 ttcgtgaaaa ccctaacggg gaagaccata acccttgagg ttgaatcatc cgacaccatc 181 gacaatgtga aggcgaagat ccaagacaag gaaggaatcc caccggacca gcagcgattg 241 attttcgccg ggaaacagct tgaagacggc cgtaccctcg ccgactacaa catccagaag 301 gaatcgacgc ttcatctcgt cctccgtctc cgtggtggtg ctaagaagag gaagaagaag 361 acttacacca agccgaagaa gatcaagcac aagcataaga aggttaagct cgctgttctt 421 cagttttaca aaatcgatgg atctggaaag gttcagcgtt tgaggaagga gtgccctaac 481 gcaacctgtg gagctgggac tttcatggcg agtcattttg atcgccatta ctgtggtaag 541 tgtggactca cctacgttta ccagaaagaa ggagttgaag cttgatctgt tgtagcggta 601 gatcgatcct tctctttctc tcttatcttt taaaaactgc atttctattt gggaattttg 661 taagctcttt aatttgagtt atcatggatt ctatgttgaa catctttcgt tctggattat // LOCUS ATHUXP6A 681 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana ubiquitin extension protein (UBQ6) gene, complete cds. ACCESSION J05540 KEYWORDS gene UBQ6; ubiquitin extension protein. SOURCE A.thaliana (variety Columbia) DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 681) AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D. TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure, localization, and expression of their promoters in transgenic tobacco JOURNAL J. Biol. Chem. 265, 12486-12493 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Callis, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 102 575 ubiquitin extension protein (UBQ6) site 329 330 ubiquitin-extension protein junction BASE COUNT 191 a 154 c 153 g 183 t ORIGIN 1 gagctccaaa accctagttt ctcaaccact actcttttat ttcttctcac cacttaaaga 61 gtttccccag aaattttctt ccgccgtaaa agcaaaaaaa gatgcagatc ttcgtgaaaa 121 ccctaacggg gaagacgatc actctcgagg tcgagtcctc tgacaccatc gacaatgtca 181 aggccaagat ccaagacaag gaaggaatcc caccggacca gcagcgattg attttcgccg 241 gaaagcagct cgaagacgga cgtaccttag ccgattacaa catccagaag gaatcaacgc 301 ttcaccttgt ccttcgtctc cgtggaggtg ctaagaagag gaagaagaag acctacacca 361 agcctaagaa gatcaagcac aagcacaaga aggtcaagct cgctgttctt cagttctaca 421 aggttgatgg ttcaggtaag gttcagcgtt tgaggaagga gtgccctaac gcaacctgtg 481 gtgctgggac tttcatggcg agtcatttcg atcgtcacta ctgtggtaag tgtggtctca 541 cctacgttta ccagaaagaa ggagctcagg aatgattttc atctcgatct ctatcatttt 601 gaattgaata ctgctttttt tttggaattt ggaagttgtt tttggatgtt gtggatctta 661 tgttgaactt gtttgaattt c // LOCUS BLYG1HORDA 1614 bp ds-DNA PLN 14-AUG-1990 DEFINITION Barley gamma-1 hordein storage protein gene, complete cds. ACCESSION M36378 KEYWORDS gamma-hordein; storage protein. SOURCE Barley (cv. Carina) leaf DNA, clone lambda-hor-gamma-1. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1614) AUTHORS Cameron-Mill,N. and Brandt,A. TITLE A gamma-hordein gene JOURNAL Plant Mol. Biol. 11, 449-461 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 379 1296 gamma-1 hordein precursor sigp 379 435 gamma-1 hordein signal peptide matp 436 1293 gamma-1 hordein storage protein signal 1372 1377 polyA signal signal 1428 1433 polyA signal BASE COUNT 516 a 433 c 271 g 394 t ORIGIN 1 agatccagtt tatccctcac aacttttaaa acatgacaaa gttagtttta gaaggattgg 61 aagatagata cttatgtgag atgtaaagtg aataagatga gtcagcacct attatcaacc 121 tcatcgtgca cttagaattt atgtacaacc acaaagcaac tttgatgatg tgtccaaaat 181 tacctttgta tgtagtgcca ccaaacacaa acattccaaa tgatcagttc gataagcacg 241 aacgcacatt aaaagaggga agaaaccatg aaaaggaacc atgatatgac atctataaag 301 aggcctacat tttgtagatc acataccatc atccatcctt tacattacta agaaaggttt 361 agctaacaca aatccaccat gaagatctta atcatcctta caatccttgc catggcgact 421 accttcgcca cctctgagat gcaagtcaac cctagtgtcc aagtacaacc gacacaacaa 481 caaccatatc ccgagtctca gcaaccattc atcagtcaat cacaacaaca atttccccag 541 ccacaacaac cattccctca gcaaccacaa cagccatttc ctcaatccca gcagcaatgc 601 ctccagcaac cacaacacca atttccccaa cccacacaac aattccccca acggccacta 661 ctaccattta ctcacccgtt cctaacattc cccgaccaac tactccctca gccaccacac 721 caatcattcc cccagccccc acaatcatac cctcagccac cgttacaacc ttttcctcag 781 cccccgcaac aaaaatatcc cgagcaacca caacaaccat ttccttggca acaaccaact 841 attcagctat atctccaaca acaactgaat ccgtgcaagg agtttctctt gcaacagtgc 901 aggcctgtgt cattgttatc gtacatctgg tcaaagatcg tgcaacagag cagttgtcgg 961 gtgatgcagc aacaatgttg cctgcaattg gcacagatcc ctgaacaata caaatgcaca 1021 gcaatcgata gcattgtgca tgccatcttc atgcagcaag gccaaagaca aggcgtgcag 1081 atcgtgcaac aacaacctca gccgcaacag gtgggtcaat gtgtgctcgt ccagggtcag 1141 ggcgtcgtcc aaccgcaaca actagctcaa atggaagcaa tcagaacgtt ggtgttgcag 1201 tccgttccat ccatgtgcaa ctttaatgtc ccacctaact gctccaccat caaagcacca 1261 tttgttggtg tagtcaccgg tgttggagga caatgaaaag atgagagcta taataattag 1321 tacatggatc atcgttgcat aatcgatgga ccaatcgatg taataacgat aaataaagtg 1381 tcgtgcacca tgatgtgtga ttcgaccagg acttgtccaa tcgtgggaat aaaagtcata 1441 aaaagttctt ctaagtgctt catttcttat actttttccc attcattgtg tatgtaaaag 1501 tcgatacaaa actggaatcg taagtctatc cgatctatga tttagtttag acaactagat 1561 taatgttcct tcggtgttag tcgtgcattg tagtaggttc aatctggtgg tacc // LOCUS BOLSLSGRA 1286 bp ss-mRNA PLN 14-AUG-1990 DEFINITION B.campestris S-locus-specific glycoprotein mRNA, partial cds. ACCESSION M36301 KEYWORDS S-glycoprotein; S-locus-specific glycoprotein; glycoprotein. SOURCE B.campestris, cDNA to mRNA. ORGANISM Brassica campestris Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 1286) AUTHORS Takayama,S., Isogai,A., Tsukamoto,C., Ueda,Y., Hinata,K., Okazaki,K. and Suzuki,A. TITLE Sequences of S-glycoproteins, products of the Brassica campestris self-incompatibility locus JOURNAL Nature 326, 102-105 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 1257 S-6-glycoprotein BASE COUNT 345 a 278 c 316 g 347 t ORIGIN 1 ggatccgtcg tcttgattct attttgtcct gccttttcga tcaacacttt gtcgtctaca 61 gaatctctta gaatctcaag caacagaaca cttgtatctc caggtaataa cttcgaactc 121 ggcttcttcc gaaccaactc aagttctcgt tggtatctcg ggatatggta caagaaattg 181 ctcgacagaa cctatgtatg ggttgccaac agagataacc cactctccaa tgccattgga 241 accctcaaaa tctcaggcaa taatcttgtc ctccttggtc acaccaataa atctgtttgg 301 tcgacgaatc ttactagagg aaatgagaga cttccggtgg tggcagacgt tctctctaat 361 ggaaacttcg tgatgcgaga ctccagtaac aacgacgcaa gtgaatactt gtggcaaagt 421 ttcgattacc ctacggatac tttgcttcca gagatgaaac tgggttacga cctcaaaaca 481 gggttgaaca ggttccttac atcatggaga agttcagatg atccatcaag cggggatttc 541 tcgtacaagc tcgaaacccg aagccttcct gagttttatc tatggcatgg gatctttcca 601 atgcatcgga gtggtccatg gaatggagtc cgatttagtg gcataccaga ggaccaaaag 661 ctgagttaca tggtgtacaa cttcacagag aatagtgaag aggtcgctta tacattccga 721 atgaccaaca acagcatcta ctcgagattg acactaagtt ccgaaggcta ttttcagcga 781 cttacgtgga atccgtcaat agggatatgg acagcgttct ggtcttctcc agtggacccc 841 cagtgcgata catacataat gtgcgggcct tacgcttact gtggcgtgaa cacatcacct 901 gtttgtaact gtatccaagg gttcaatccc cggaatatac agcagtggga tcagagagtc 961 tgggcaggtg ggtgtataag gaggacgcgg cttagctgca gtggagatgg ttttacaagg 1021 atgaagaaca tgaagctgcc agaaactacg atggcgattg tcgaccgcag tattggtgtg 1081 aaagaatgtg agaagaggtg ccttagcgat tgtaattgta ctgcttttgc aaatgcggat 1141 atccggaatg gtgggacggg ttgtgtgatt tggaccggac ggcttgacga tatgcggaat 1201 tacgttgctc acggtcaaga tctttatgtc agattggctg ttgctgacct tgtttagctc 1261 tttctcttaa aataaagcac ggatcc // LOCUS BOVFPKPA 1835 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Bovine fructose 6-phosphate, 2-kinase: fructose 2,6-bisphosphatase mRNA, complete cds. ACCESSION M34241 KEYWORDS bifunctional enzyme; fructose 2,6-bisphosphatase; fructose 6-phosphate,2-kinase. SOURCE Bovine heart, cDNA to mRNA, clones BH[9G,7H]. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1835) AUTHORS Sakata,J. and Uyeda,K. TITLE Bovine heart fructose 6-phosphate, 2-kinase/fructose 2,6-bisphosphatase: Complete amino acid sequence and localization of phosphorylation sites JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4951-4955 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Uyeda, 11-MAY-1990, for release after publication. FEATURES from to/span description pept / 30 1622 fructose 6-phosphate,2-kinase:fructose 2,6-bisphosphatase (EC 2.7.1.105 and EC 3.1.3.46) BASE COUNT 466 a 493 c 493 g 383 t ORIGIN 1 caaccgacat ctcctgaaga gttgccatgt ccgggaatcc tgcctcttcc tcagaacaga 61 acaacaatag ctatgaaacc aaagcaagtc tccgaatatc agagaagaaa tgttcatggg 121 catcttacat gacaaactct ccaaccctca tcgttatgat tggcttgcca gcccgtggca 181 aaacctacgt gtccaagaaa cttacacgct acctcaactg gattggggtg cccaccaaag 241 tgtttaatct tggggtgtat cggcggcaag cagtcaagtc ctataagtcc tatgacttct 301 tccggcatga caacgaggag gccatgaaga ttcgcaaaca gtgtgccctg gtggcactga 361 aagacgtgaa ggcgtatctc acggaggaga gcgggcagat tgcggtgttt gatgctacca 421 ataccactcg agagaggagg gacttgattt tgaactttgc cgaggagaat tctttcaagg 481 tgttctttgt ggagtccgtg tgcgatgatc ctgatgtcat tgctgccaac atcctggagg 541 taaaggtgtc gagccccgac taccctgaaa ggaacaggga gaatgtgatg gatgacttcc 601 taaagaggat tgagtgctac aaagtcacct atcaacccct tgacccagac agccatgaca 661 aggatctttc tttcatcaag gtgataaacg tgggccagag atttctcgtg aacaaagtcc 721 aggactacat ccagagcaag atcgtctact acctcatgaa tatccacgtc caccctcgca 781 ccatctacct ttgccggcat ggagagagcg agttcaacct cttggggaag attgggggtg 841 actcaggcct ctcagtgcga ggaaaacagt ttgcccaggc tctaaggaag tttctggagg 901 aacaggagat agcagacctc aaagtgtgga cgagccagtt gaaaaggact atccagaccg 961 cagaatctct gggggtgacc tacgagcagt ggaagattct gaacgagatt gatgctggcg 1021 tgtgcgagga gatgacttac gcggagattc aggagcagta tccggatgag tttgcgcttc 1081 gagatgaaga gaaatatctg taccgatatc ctggagggga gtcataccag gacctcgtgc 1141 agcggttgga gccagtcatc atggagctgg agcgccaggg caatgtcctc gttatctccc 1201 accaggctgt catgcgctgc ctcctggcct acttcttgga caagggtgca gatgagctac 1261 catacctgag gtgccctctc cataccatct tcaaacttac tcctgtggcc tacgggtgca 1321 aagtggaaac aattaaactc aatgtggagg ctgtgaacac gcaccgtgac aagccaacta 1381 acaactttcc caagagccaa acccctgtaa ggatgagaag gaacagcttt acgcctctgt 1441 ccagttcgaa tacaatcagg cgtccaagaa attacagtgt tgggagccgg cccctccagc 1501 ccctcagccc gctccgtgcc ctggacacgc aagaaggggc cgaccagccg aagacccaag 1561 cagagacctc gcgggctgca cacaggctcc cgtctccagc gccccccacg tcgccctcct 1621 gatggcgtgg aggctgaggc cagaacgctc ccccggagac tggggtctgc tgaagacctg 1681 ggcgtgtcag ctccactggg gctggagcag gaagtcccgt ggggctcgga cattcggggc 1741 cacccgagtg agcctggaaa agctgccctc catttcttcc cctcatgcct aatgaaaacc 1801 gtgaaaagct cggcttccct gtacctggcc cctcg // LOCUS BOVTHYRA 2045 bp ds-DNA MAM 14-AUG-1990 DEFINITION Bovine thyroglobulin gene, 5' end. ACCESSION M35823 KEYWORDS thyroglobulin. SOURCE Bovine DNA, clone CBT1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 2045) AUTHORS Ledent,C., Parnentier,M. and Vassart,G. TITLE Tissue-specific expression and methylation of thyroglobulin- chloromyhenicol acetyltransferase fusion gene in transgenic mice JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.Ledent, 27-JUN-1990. FEATURES from to/span description mRNA 2037 > 2045 thyroglobulin mRNA, exon 1 signal 2006 2011 TATA box rpt 826 1025 bovine monomeric repeat rpt 1712 1816 bovine monomeric repeat BASE COUNT 467 a 508 c 579 g 491 t ORIGIN Chromosome 14q11-q15. 1 gatccaggag ggagaggtga gagggccaga ggtgaggagc ctgcagagag aatggagacc 61 caagccagga gcgggctcac ggaggccaga ccaaggagtt ggtgctgagt ttccgtagag 121 catcctcctg gctcacgagg catcagatga tggtaggagc tgagtcatcg tgagaaagag 181 gttttcacgg tgtggaggca gcgcctggca tggctctctc caagctcatc gcttcattgc 241 tggagtggag agaagcacct ctccttccag gtacactgtt ttgactcaga aggaagcccg 301 ggcggctctc accctcactg actctgctgt gcctcttccc aggggccagg ccagggagaa 361 tgactcagca cacaagctgg cctgtgcaag accctgcctg catgcatggg tgggggaagc 421 atgaggagtg ggctttctgg tagacctcag tcatggtctc agcattattt tttaaaagga 481 ttttatttat ttgttttgat gtggaccagt tttaaaatct ttattgaatt tgttaccatg 541 ttgcttctgt tttatgtttt ggttttttgg ccacgaggca tgggggatgt tagctccccg 601 accagggact gaacccttgc cccctgcatg gaaaggtcaa gtcttaactt cttgactgcc 661 taggaagtcc cggtttcaac attttagaaa attcatctgc tttagaacac ctggacctgg 721 agagctggca agggctgcac atggccaggc tcacggtaat attacagcgt ctccatagcc 781 tgggcagaac ttggcttgtg tgattggggc tgtgggctcc ccaggtggtg ctagtggtaa 841 agaacccttc tgccagtgca ggagacataa gagatgcggg ttctatccct ggatcaggaa 901 gatcccctgg agaaggaaat ggcaacctgc tccaggattc ttgcctggag aatcccaagg 961 acagaagagc ctggcgggcc gtagtccatg gcgttgcaaa gagtcagaca caaccgaagc 1021 gacttagcac ccaatggggg ctggagaggg ccaagctggg ggattctaat tctccagtaa 1081 atgccagcat gtcttctcaa atcatcttta cctaaaacct aaggctcatt gtgatctccc 1141 tcactcccaa gtgctctccc acacctgccc acctaccttc tgaacaccca tggagcctgt 1201 ctccatcccc caccccttta atgggagaag ccactggaga aagctcactg atgtacaggc 1261 tttatgtgga agacaagctt cctgctgcct tttggttgtc tgacgtcctg ggacagaggg 1321 gaaaggggga tgactacgag tatgactgtg cgtgtgtttg gcttatctca tcaaaatctc 1381 tacattctgt gttaatggat ctgcctgttt tgttccctgc catatcctca tggcctagaa 1441 tagtgtctgc ttctctatca gactctaaag aaacattgct aggagggaag gaaggagcat 1501 ggatgaggag ggagggagca ttgtgtttct ctcacggtgg gcctgaacgt gtggcccacc 1561 aagttgttaa ctttggcctt tacccctgaa gatgaattat gaagccacac ccccagttct 1621 tccttggtgg ctcagatggt caagaatcca cctgcaatgc gggagacctg ggtttgatcc 1681 ctgggttggg aagatcccct ggagaaggga atggctaccc actccagtat tctggcctgg 1741 agaatcccat ggacagagga gcctggcggg atgcagtcca tggggtctca gagagtcaga 1801 tgtgactgag cgactttcac acacattcgt ccctggttct gctcccctac agcctccaca 1861 agattttcac cccacactgg ccacatgagt gtcctccagg ggaacagacg caggtggagg 1921 acctccttgt gaccagcaga gaaaacaggg tgggcactgc ttccctgagt gcctgtgggt 1981 gggggctaag tacccacagc agtgctataa aggctccttg gccagagccc taaggtgggc 2041 agcag // LOCUS BRPTUBBA 4571 bp ds-DNA INV 14-AUG-1990 DEFINITION B.pahangi beta-tubulin gene, complete cds. ACCESSION M36380 KEYWORDS beta-tubulin. SOURCE B.pahangi DNA, clone pBTY. ORGANISM Brugia pahangi Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Gnathostomatidae. REFERENCE 1 (bases 1 to 4571) AUTHORS Guenette,S., Prichard,R.K., Klein,R.D. and Matlashewski,G. TITLE Characterization of a beta-tubulin gene and beta-tubulin gene products of Brugia pahangi JOURNAL Biochem. Pharmacol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Guenette, 11-JUL-1990. FEATURES from to/span description pept 396 452 beta-tubulin, exon 1 891 999 beta-tubulin, exon 2 2115 2341 beta-tubulin, exon 3 2424 2552 beta-tubulin, exon 4 2718 2880 beta-tubulin, exon 5 3002 3192 beta-tubulin, exon 6 3299 3394 beta-tubulin, exon 7 3630 3816 beta-tubulin, exon 8 4047 4234 beta-tubulin, exon 9 pre-msg < 395 > 4400 beta-tubulin mRNA and introns IVS 453 890 beta-tubulin intron A IVS 1000 2114 beta-tubulin intron B IVS 2342 2423 beta-tubulin intron C IVS 2553 2717 beta-tubulin intron D IVS 2881 3001 beta-tubulin intron E IVS 3193 3298 beta-tubulin intron F IVS 3395 3629 beta-tubulin intron G IVS 3817 4046 beta-tubulin intron H signal 66 69 reverse CAAT box signal 90 95 weak TATA box signal 4395 4400 polyA signal BASE COUNT 1278 a 783 c 910 g 1600 t ORIGIN 1 ggtgtgcaga tagcacgagg aaatgagagg agcagagaag tagggcgcag tcagacagtt 61 ggatgattgc gcctcccgtg tcacttccct atataccttt gcaaggagga catgacaaag 121 ttcgaggtta tttctcgacg cgtctctctg tatactcaca tgtctgtctg tctgtctgtt 181 tgtccgtctg cctgtctgcc ttggttgcta ggaagcccgt cgactttctc cacagtttag 241 tgagtgtttg tgtgatggat aggatggttg gcgcgcggat gggtcggcca aacagttctg 301 tgactgattg tagttgccgg atgacttgat cagtaattac cggattgatt gcaggtctca 361 tttcggtcga caagatttca ttaagtgttt aagctatgag agaaattgtc cacgttcaag 421 ctggtcaatg tggcaaccag attggtgcca aggtattgat tttctgctct ttctctttat 481 cctttaggaa gatgactcat cagagggtat ctttccgcag aaattttccc gagatataat 541 tcccacttcc cctcattgaa attgttttgg tatttaggta tcatcgtcaa aatattccat 601 ttcatttctt aatgtgctat cgtttatcat ctgctcttcg attgtatctt ctcccttctt 661 tgcttcattg attcctaatg caatacataa tacctaccat ctatcacggt taataaaatt 721 cgatgttaga tcacaaagat caagttgtga taatctttcg tataataagt tgttgttttc 781 aacaatctgc agttaggtaa gatgtctgag gagcaagcaa aatgtggatg taggggatta 841 ctgaagttaa aaattgaaaa ttttaaacat aacttgagtt tttttttcag ttctgggaag 901 taatatcgga tgaacatggt gttcaacctg atggtacata caaaggtgat tcagacctgc 961 aaattgaacg aatcaacgtc tactataatg aagcgaatgg ttggtccatt attggagttt 1021 tttttcttct ttgctttgga atgttctttt ttgttttctc tactagttgt tgccactcgt 1081 tttcacacac attatacata gttctacata cacatgcata ccacacatag tcccatatat 1141 aaacaattgt tgtctgacat cgaacattag ttcggcggat gaggttcatc agggcgtcct 1201 caaaataatg tgttcgagaa ttaaatgaag aatgtctgcg tctcttttca aggtccatcg 1261 tggtgaaatc atttgagaca taaggagatg gatggactgt ttaaattaga taaatttaaa 1321 tacaagtttc agaataaccg acataatcaa agatgttttt gttacagaaa tatttgggaa 1381 tgattatagc aagattgtaa aacagatatt ttgacatcta cctgtctttg atcacaacta 1441 ttgtttatga tatttttggg atttttgcaa tgtaatgaca cttattttta gaatatcgta 1501 aacactttct gacctaactt gtgatttgtt cgaagcctgg tagtgaaatt tatttggcgg 1561 ttgtgaaaat acagtaaata accgcactat ataaggaaag aataacattg tcactttaac 1621 caaacaaatc tcaaattgtc gtcacttcaa ccaaaacctt atcaaacagt gtttttgacc 1681 tattctaact tctaacccca gtgtttcttg attgtcttga ttgttattct ataacaccta 1741 caaagaataa gaattccttc aattcaaact tttcacgagt tattcgtctt tgtcagtcta 1801 cactatgagg acttcgtcgt caacttattc caagataaaa aactttcagt gcaacagtct 1861 tctttttttt ctcatctctt cgtaatggaa ttaaaaccaa acaagacgtt cgtcatatgt 1921 tagcttgtca gtaatgtttg taatacatct ttattttcgt ctttttgttt ttcctctctt 1981 cattatgcag tgaaaaggag tatttttctt taggttttct gtaaagatat acttgataat 2041 agattgccat aatgaaccaa gcatcgttgt aaatcttcac ttcgtctcaa gcgatattaa 2101 ttttttgtgt tcagggggca aatatgtgcc acgagcagtc cttgttgatt tggaaccagg 2161 taccatggat tctattcgag gaggtgagtt cgggcaacta ttccgacctg acaattttgt 2221 ttttgggcaa agtggagctg gcaacaactg ggctaaggga cattatacgg aaggtgcgga 2281 actagttgat aatgtgttgg acgtgatacg aaaagaagct gagggatgcg attgtcttca 2341 ggtacggatt gccatagttt tataagacat tttaattgtg gatgtgtttt tattttaaag 2401 aaaaaattaa aaaatatttt cagggatttc aactaacgca ttcacttggt ggtggtaccg 2461 gttccggcat gggaacattg ctgatctcga aaattcgtga ggagtatccg gatcgaatta 2521 tgagctcttt ttcggttgtg ccatcgccca aagtatgtat ttgaatttct gattcttatt 2581 tttaacatct gttatatcta ttttaaacat agctccataa cttttaagtg gcggttgtct 2641 cacaaagtgg tttttcggct atcatttcat tttcatagtt gaagaaaaag tatgctagac 2701 gtattaatga tttccaggta tcagatgttg tgttggaacc ctacaatgca acattatcag 2761 tccaccaact agttgaaaac actgacgaaa ctttctgcat tgataacgag gctttgtatg 2821 acatctgctt ccgaacgttg aagttggcaa atccaactta cggtgacctc aaccatttgg 2881 gttcgtcctt taatttattc ttcgaactgt ccggtttaaa tgcattctat ttaactattt 2941 ataaaattcg gatttaattg taatttattc aactcttcta aaggttaaaa tttctattta 3001 gtgtctgtga caatgtcggg agtaacaact tgcttacgtt tccctggaca gttgaacgcc 3061 gatctccgta aacttgccgt caatatggtg ccattcccac ggttgcattt ctttatgcca 3121 ggatttgctc ctctctctgc tcgtgatgct gctgcttatc gagccctcaa tgttgctgaa 3181 cttactcaac aggtctctgt ttatttttat ggtcagggtc tctttttttg aaagtaatcc 3241 atttataatc gctttaaatg tctcctgttt tatgtttttt aatatcttat ttcttcagat 3301 gtttgatgcc aaaaatatga tggcagcatg tgatccgcgt catggtcgtt acctaaccgt 3361 agctgccatg ttccgaggta gaatgtctat gcgggtaagt gaattttgat aatctccatg 3421 ttaaactcag ccgctgaagc atgaagattt taatccatgt gatgtcggtt tttcttgacc 3481 tgaacttagt ggtgtatcac ttggtcttag ttatgttctt cctatttcca gagtgcttgc 3541 ttaactgcaa ttcttttgtg gaagcttgtt tttgatgtag ggtttttttc ttcatcgctg 3601 tttcaaagtt tttaataagt aggtcttagg aagtagacga gcaaatgatg caagtacaga 3661 ataagaattc atcgtatttc gttgaatgga ttccaaataa cgtaaagaca gctgtttgcg 3721 acattccacc acgtggatta aagatgagcg caacatttat tggaaataca acagctatac 3781 aagaactttt caagcgaatt tccgaacagt ttactggtga acttattaat tccatctcat 3841 ttgagcttag tttgatagct aaaaactaaa tttctggcaa gattgaaaag agatacaatt 3901 gggtatgtgt aggttacatg atttttttgc catattatga gttttatgca aaaggtgaac 3961 atttttaata tgtttgttcc gaatcgtctg aacttgcgtt ctatggaggt tatacttaag 4021 tcgatccgat tttttaatga tttcagccat gttccgacgt aaagcattct tgcattggta 4081 tactggcgaa ggtatggatg aaatggaatt cacggaagcg gagagtaata tgaatgactt 4141 ggtgtccgaa tatcaacaat atcaggatgc gacggctgat gaagaaggtg atcttcagga 4201 aggtgaatcg gaatacattg aacaggaaga gtgagcacaa agtgtcagct ttgtgaaaaa 4261 ggaactttta cagtatttta tttatcgtct tttccatgtt tttatgttgt tacattttat 4321 tttgttactc gctttcgaca tatattagca agttttaaag aattatagcg gcaaatctca 4381 ttgttgtact tctcaataaa ttacaatcaa aagttaatga ttcctaatgc aaagtcgtaa 4441 ggcagaaatc acgtgaaaaa cagtaatcat gattctctga aagttccaat ttgaaataaa 4501 gtactaattt tatgtgggca gtggacggta ttccgtgaca gaagatgcta cagaagattg 4561 cagcagtttt t // LOCUS BSPCELALKA 3498 bp ds-DNA BCT 14-AUG-1990 DEFINITION Bacillus sp. alkaline cellulase gene, complete cds. ACCESSION M27420 KEYWORDS alkaline cellulase; endo-beta-1,4-glucanase. SOURCE Bacillus sp. (strain KSM-635) DNA, clone pBC100. ORGANISM Bacillus sp. Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3498) AUTHORS Ozaki,K., Shikata,S., Kawai,S., Ito,S. and Okamoto,K. TITLE Molecular cloning and nucleotide sequence of a gene for alkaline cellulase from Bacillus sp. KSM-635 JOURNAL J. Gen. Microbiol. 136, 1327-1334 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Ito, 30-AUG-1989, for release after publication. FEATURES from to/span description pept 605 3430 alkaline cellulase precursor (EC 3.2.1.4) sigp 605 691 alkaline cellulase signal peptide matp 692 3427 alkaline cellulase binding 530 536 SD1 ribosomal binding site binding 595 600 SD2 ribosomal binding site signal 471 476 -35 region signal 494 499 -10 region BASE COUNT 1133 a 605 c 704 g 1056 t ORIGIN 1 ggcacctgag gaagtagcga atcctgttct tgatgcacat ccatacttaa cttctggatt 61 tgccttcatg agtcgtgatg aaaacggaag tgcaccatta catggactgt ttgcatttaa 121 ttattcggca ctgattagct gtggcatttc cgcttctgct ctttctggaa tgaagtacgg 181 ggtcccaaga cttgtcactg ccattgccga tcagttattc caagatgatc gagacgagat 241 tctaaaggac ttctttgagt atgatgagaa ggagtttgtc ggaaactggc ctttaaacgt 301 ctaaatgaac ataatagcga aagggcttaa ccaaaaatat gaattgaacc cacataaatt 361 tgtgggtttt tattaatcaa aaaaatggta aataaaccta ttttaacaat gcttataacc 421 atttttctat ttattgcata aaaaaatcag taaaaaaatt catttatatg tagacgtaaa 481 ttaacaaata ttatattata tatacgaaag cggtttcgaa aaatagagga aggaggagag 541 tttttagttt ttgttgtttg tttattgtaa gcgtttacta ttaatacatt tctgggaggt 601 tattatgaaa ataaagcaaa ttaaacaatc tttatctttg cttttaatca tcacactcat 661 tatgtcacta tttgttccta tggcttcagc aaacacaaat gagtctaagt ctaatgcatt 721 tcctttttct gatgttaaaa aaacttcttg gtcttttcca tatataaagg atttatatga 781 gcaagaagtt attacaggaa catctgcaac aacgttctct ccaacagatt ccgttactcg 841 tgcacaattt acagtgatgc ttacccgtgg tcttggacta gaagcatctt ctaaagatta 901 cccttttaaa gatcgtaaaa actgggctta caaagaaatt caagctgcat atgaagctgg 961 aattgtaact gggaaaacaa acggtgaatt tgcaccaaat gaaaacatta ctcgtgaaca 1021 aatggctgct atggccgtac gtgcttatga atacttagaa aatgagctat ctttaccaga 1081 agagcaaaga gaatataatg actcttcttc tatttcaacc tttgctcaag atgctgttca 1141 aaaagcatac gtattagagc taatggaagg aaatacagat ggatattttc aaccaaaaag 1201 aaactctact agagaacagt ctgctaaagt tatctctact ttactttgga aagtagctag 1261 tcatgattat ttataccata cagaagctgt taaaagccct tcagaagctg gtgcgcttca 1321 gttagtagaa ctaaacggac aattaacact agctggtgaa gatggtactc ccgttcaatt 1381 acgtggaatg agtacacatg gcctacaatg gttcggtgaa atcgtaaacg aaaacgcttt 1441 cgtagcacta tcgaatgatt ggggatctaa catgattcgt ctcgctatgt acattggcga 1501 aaatggatat gcaacaaacc ctgaagtaaa agatttagtt tatgaaggaa ttgaattagc 1561 gtttgagcac gatatgtatg taattgttga ctggcatgta catgctcctg gtgatcctag 1621 agcggatgta tactcaggtg cttatgattt cttcgaagaa attgctgatc attacaaaga 1681 tcatccgaaa aaccattata tcatttggga actagcaaac gaaccaagtc caaataataa 1741 cggtggacct ggattaacaa atgatgaaaa aggttgggaa gctgtaaaag aatatgcaga 1801 gccaatcgtt gaaatgttgc gtgaaaaagg tgacaacatg attttagttg gaaatcctaa 1861 ctggagccaa cgtcctgact tatcagctga caacccaatt gatgcagaaa atatcatgta 1921 ttctgttcac ttctacacag gctcacatgg cgcttctcac attggttacc ctgaaggaac 1981 accaagctct gaacgttcta atgttatggc taacgttcgt tatgctctag acaatggcgt 2041 tgctgtgttt gcgacagagt ggggtacgag tcaagcgaat ggagatggag gaccttattt 2101 tgatgaagct gatgtttggc ttaatttctt aaacaaacat aacattagct gggcaaactg 2161 gtcgttaacg aacaaaaatg agatttctgg agcatttaca ccttttgagc ttggtagaac 2221 agatgctaca gatcttgatc caggtgctaa tcaagtatgg gcacccgagg aactaagttt 2281 atctggtgaa tatgttcgtg ctcgtattaa aggaattgag tatacaccta tcgaccgcac 2341 aaaattcaca aagcttgttt gggattttaa cgatggaaca acacaaggat tccaagttaa 2401 tggagacagc cctaacaaag aaagcattac tttaagtaat aataatgatg cattacaaat 2461 tgaaggatta aatgtaagta atgatatttc tgaaggaaac tactgggata atgtacgcct 2521 gtcagctgat ggctggagtg aaaatgtaga tattttaggt gctacagagc ttacaattga 2581 tgttatcgtt gaagaaccga caacagtttc aattgctgct attccacaag gacctgctgc 2641 tggctgggct aacccgacta gagcaattaa agtaactgaa gacgatttcg aatctttcgg 2701 agatggatac aaagctctcg taactattac ttctgaagat tcaccttcac ttgaaaccat 2761 tgcaactagt cctgaagaca atacaatgag caatatcatt ctatttgtag gtactgaaga 2821 tgcagatgtt atttctttag ataatatcac ggtttctggt actgagattg aaattgaagt 2881 tattcacgat gaaaaaggaa cagcaacact tccttctact tttgaagatg gaactcgcca 2941 aggctgggat tggcatacag aatcaggagt taagacagct cttacaattg aagaagctaa 3001 tggatctaac gctctttcat gggaatatgc gtatcctgaa gtaaaaccaa gtgatggttg 3061 ggctactgct cctcgtctag acttctggaa agacgaacta gttcgtggca caagcgacta 3121 tattagtttt gacttttaca tcgatgcagt tcgtgcttct gaaggtgcta tatcaattaa 3181 cgccgttttc caaccacctg caaacgggta ttggcaagaa gttccaacta catttgaaat 3241 tgatttaaca gagcttgatt ctgcaactgt aacttctgat gagttgtatc attatgaagt 3301 aaaaattaac attagagaca ttgaggctat tacagacgat acagagcttc gtaacttatt 3361 actaatcttt gctgatgaag acagtgattt tgctggtaga gtttttgttg ataatgtaag 3421 atttgaataa tttaaaaaca gtagatagag agactctcta tctacctgtt tattgcttac 3481 tattcgtctt ccactttt // LOCUS BSUPHOA3A 743 bp ds-DNA BCT 14-AUG-1990 DEFINITION B.subtilis alkaline phosphatase (phoAIII) gene, complete cds. ACCESSION M33634 KEYWORDS alkaline phosphatase. SOURCE B.subtilis (strain 168) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 743) AUTHORS Bookstein,C., Edwards,C.W., Kapp,N.V. and Hulett,F.M. TITLE The Bacillus subtilis 168 alkaline phosphatase III gene: Impact of a phoAIII mutation on total alkaline phosphatase synthesis JOURNAL J. Bacteriol. 172, 3730-3737 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.M.Hulett, 06-APR-1990, for release after publication. FEATURES from to/span description pept 503 > 743 alkaline phosphatase precursor (phoAIII, EC 3.1.3.1) sigp 503 598 alkaline phosphatase signal peptide matp 599 > 743 alkaline phosphatase BASE COUNT 221 a 149 c 131 g 242 t ORIGIN 1 gatccgtaaa catggctgct tccattttat ttctgtatgt aaatttggat atttattttt 61 tatccaagtt aacacctctt ccgttctttc tcggtggaac gggtcagcaa tccccgctaa 121 gtactcagaa aaaacgtcca tgttgttccc tcctaaaata aaatgatgcc actccatatg 181 aactgctcag acatgtgctt acacaatacc tcttgccaac gtaatacttt aaattgttca 241 agatgtaagc gggttgcttc tatgcgacta tagcagattt cagtccgcct atccacggcc 301 agtatctatt cctccaattt atttctttag tactatcgct ctttccttct aaaacttctc 361 ataaaagaat aaccattatt taagggtgcc agttcattat tcttgtaaat ccaatcttta 421 aaatcgatta atactagctt aacagtttaa aaatataatt gggttgtcat tgagattcat 481 ctatatttag gaggttatcc agttgaaaaa attcccaaag aaattactgc ctatcgcggt 541 tttatcatca attgcgttca gcagcttagc cagcggcagt gtgcctgaag ccagcgccca 601 ggaaaagaaa aaggggaacc aagacgaaat taaaaatgtt attgttctga ttggtgatgg 661 tatgggtgtg tcttatacgt ctgcttatcg atacttaaag gataataaaa agacaaaagt 721 tgtggagcct acggcttttg atc // LOCUS CHTSIGAA 2180 bp ds-DNA BCT 14-AUG-1990 DEFINITION C.trachomatis sigma-A protein (sigA) gene, complete cds. ACCESSION M36475 KEYWORDS sigma factor; sigma-70 homolog. SOURCE C.trachomatis (strain MoPn) DNA. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 2180) AUTHORS Engel,J.N. and Ganem,D. TITLE A polymerase chain reaction-based approach to cloning sigma factors from eubacteria and its application to the isolation of a sigma -70 homolog from Chlamydia trachomatis JOURNAL J. Bacteriol. 172, 2447-2455 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 360 2075 sigma-A protein (sigA) BASE COUNT 679 a 380 c 517 g 603 t 1 others ORIGIN 1 tctagaggga tttgcccaat gggagcgaga tttattaaca gaatctcgaa cagaacgctc 61 tggcaaaatc gtaattaaaa cgaaagcttt tggagccact ccgttctact attgtgtagt 121 aacgttagat aaaggacctt tggcagaaca tgtattaggg gttttatacc ctgcaaaagc 181 aagttttttt acaaatcttt cctatattta atgataaata gggatgtcgg tgtaggaagt 241 ttttctaggg ccgaatctga tcagctggtt ttttatctaa aaatccctag atttcttgtt 301 ttccttaaga taaactgtca ctttctaggc cgatttttcc ttagttttaa tttgttagca 361 tgcgcatcca tacgctagat agtcaacgtg ctgatgcagc ccaagaagaa gaaatccaaa 421 gaaagttaga agagctagtc actcttgcta aggatcaagg gttcatcacg tatgaagaaa 481 ttaatgaaat tcttccccct tctttcgatt cgccagaaca gatagatcaa gttttaattt 541 ttctggcggg gatggacgtt caagtcttaa accaagcaga cgtagagcgg cagaaagaaa 601 gaaaaaaaga agctaaagag ctagaagggt tggctaagcg ttctgaggga acgcctgatg 661 atccagtgcg tatgtatctg aaggaaatgg gtacagttcc tcttcttaca agagaagagg 721 aggtggagat ttctaaaagg atagaaaaag ctcaggtaca aatagaaaga attattttac 781 gctttcgtta ttcgactaaa gaagcggttt ctattgcgca atacttaatt aatggtaagg 841 aacgatttga taagatcgtt tccgnaaaag aggtggaaga taagacacat tttcttaatc 901 ttttgccgaa gttaatctct ttgctgaaag aagaggactc ttacttagaa gagcggcttt 961 tggctttgaa agatccggca ctgtccaagc aagatcaagc aaagttgaac gatgagcttg 1021 aaaaatgccg tattcgaaca caagcttatc tgagatgttt tcattgtcgt cacaatgtca 1081 cggaagattt tggagaagtt gtttttaaag cttatgactc gttcttacag ttggaacaac 1141 aaatcaatga tttaaaggtt cgtgctgaaa gaaataagtt tgctgcagca aaactagctg 1201 cagcccgacg caagctttac aaacgagaag ttgcagcggg acgtactctt gaagagttta 1261 aaaaagatgt gcggatgttg cagcgttgga tggataaaag ccaggaagca aagcaagaga 1321 tggtggaatc caacttacgt ttggtgatct ctatcgcgaa aaaatatacc aaccgagggc 1381 tgtctttctt ggatttgatt caagaaggaa atatgggctt aatgaaagct gttgaaaaat 1441 ttgagtatcg ccggggttat aaattttcaa cttatgccac ttggtggatt cgtcaggctg 1501 tgacgcgagc tattgccgat caggcaagaa ccattcgaat tcctgttcat atgatcgaga 1561 ccattaataa ggtgcttcgt ggagccaaga aattaatgat ggaaactgga aaagagccta 1621 cgcccgaaga actcggagag gaactaggtt tcactccaga ccgtgttcga gaaatttata 1681 agatcgctca gcatccgatt tctttacagg ctgaggttgg agatagtgga gaaagctctt 1741 ttggagattt cttggaagat acagctgttg aatctccggc agaggcaaca ggctactcca 1801 tgttgaaaga caaaatgaag aaagtgctaa aaacgcttac tgatcgcgaa cgttttgttt 1861 tgatccatcg gtttggcctt ttagatggcc gtcccaaaac tttggaagag gtaggctccg 1921 cgttcaacgt gacacgagag cggattcgac aaatcgaagc caaagctttg cgaaaaatgc 1981 gtcatcctat ccgttccaaa cagctacgag cattcttgga tttattggaa gaagagaaga 2041 ctggttcggg caagattaag agttataaga attaaggatt ccttttcata aggagtatgc 2101 ttgtatcggt tagacgtaac gaactttcgt gtttgggtat ctataggggt ctcagagcag 2161 gaacgctatc acaagcaacc // LOCUS CUSRGSBHA 1137 bp ds-DNA PLN 14-AUG-1990 DEFINITION Melon 17S rRNA, 5.8S rRNA, and 25S rRNA gene region. ACCESSION M36377 KEYWORDS 17S ribosomal RNA; 25S ribosomal RNA; intraribosomal spacer region. SOURCE C.melo 4-wk old cotyledon DNA, clone pML17. ORGANISM Cucumis melo Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Violales; Cucurbitaceae. REFERENCE 1 (bases 1 to 1137) AUTHORS Kavanagh,T.A. and Timmis,J.N. TITLE Structure of melon rDNA and nucleotide sequence of the 17-25S spacer region JOURNAL Theor. Appl. Genet. 76, 673-680 (1988) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 259 17S rRNA rRNA 476 638 5.8S rRNA rRNA 876 > 1137 25S rRNA BASE COUNT 253 a 333 c 324 g 227 t ORIGIN 1 atagatcatt gcaattgttg gtcttcaacg aggaattcct agtaagcgcg agtcatcagc 61 tcgcgttgac tacgtccctg ccctttgtac acaccgcccg tcgctcctac cgattgaatg 121 gtccggtgaa gtgttcggat cgcggcgacg tgggcggttc gctgcccgcg acgtcgcgag 181 aagtccactg aaccttatca tttagaggaa ggagaagtcg taacaaggtt tccgtaggtg 241 aactgcggaa ggatcattgt cgatgcctaa acatcaaacg acccgcgaac gcgtttaaaa 301 acaaactgtt cgcgttaggg gcggggggaa gcatgctctt tggctgcctc ctccccttcc 361 aacgcgttta aacaaaaccc cgcgcaggtc gcgccaagga acttgaaatg aattcgcctg 421 tcccctgccc cggcctcggc gtgcggggga tggagcattc tagtcgtatt actaacaacg 481 actctcggca acggatatct cggctctcgc atcgatgaag aacgtagcga aatgcgatac 541 ttggtgtgaa ttgcaggatc ccgcgaacca ccgagtcttt gaacgcaagt tgcgcccgga 601 gccttctggc cgagggcacg tctgcctggg cgtcacgcat cgctgccccc accacacaac 661 tctccccatg cggggtcgtt gtgaaggcag ggacacacac tggcctcccg tacgcaccgt 721 cgtgcggatg gcttaaattt gagtcctcga tgctcgtcgt cgcgacacta cggtggttga 781 ttcaactcgg tgacgcgtct cgacctcgac gtcgacttca cggactcctt cacgaccctt 841 cgaacgccgc cccttaaaag gacgacgctc tcgacgcgac cccaggtcag gcgggactac 901 ccgtgagagt ttaagcatat caataagcgg aggaaaagaa acttacaagg attcccctag 961 taacggcgag cgaaccggga agagcccagc ttgagaatcg ggcgtcctcg acgtccgaat 1021 tgtagtctgg agaagcgtcc tcagcggcgg accgggcaca agtccctgga aggggcgcca 1081 gagagggtga gagagcccgt tgcgctcgga ccctgtcgca ccacgaggcg ctgtcaa // LOCUS DOGFIXA 3080 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Canine blood clotting factor IX mRNA, complete cds. ACCESSION M33826 KEYWORDS blood clotting cascade protein; blood clotting factor IX; protease. SOURCE Dog hemophilic liver, cDNA to mRNA, clone dF9.28. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 3080) AUTHORS Axelrod,J.H., Read,M.S., Brinkhous,K.M. and Verma,I.M. TITLE Phenotypic correction of factor IX deficiency in skin fibroblasts of hemophilic dogs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5173-5177 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.H.Axelrod, 23-APR-1990, for release after publication. FEATURES from to/span description pept 22 1380 factor IX protein precursor sigp 22 138 factor IX signal peptide matp 139 1377 factor IX blood circulatory protein matp 577 1377 factor IX activated protein matp 673 1377 factor IX activated protein mRNA < 1 3080 factor IX mRNA BASE COUNT 878 a 640 c 677 g 885 t ORIGIN Chromosome X. 1 aagcagcgcc tgaataggat catggcagaa gcatcgggcc tcgtcaccgt ctgcctttta 61 ggatatctac tcagtgccga atgtgcagtt tttcttgatc gtgaaaatgc caccaaaatt 121 ctgagtcggc caaagaggta taattcaggt aaactggaag agtttgttcg agggaacctt 181 gagagagaat gtatagaaga aaagtgcagt tttgaagaag cacgggaagt ttttgaaaac 241 actgaaaaaa ccactgaatt ttggaagcaa tatgttgatg gagatcaatg tgaatccaat 301 ccatgtttaa atgacggtgt atgcaaggat gacattaatt cctatgaatg ttggtgtcga 361 gctggatttg aaggaaagaa ctgtgaatta gatgtaacat gcaacattaa gaatggcaga 421 tgcaagcagt tttgtaaatt gggccccgat aacaaggtgg tttgttcctg tactacggga 481 taccaacttg cggaagacca aaggtcctgt gaaccagcag tgccatttcc atgtggaaga 541 gtttctgtcc ctcacatttc tatgacacgc acccgtgctg aaactctttt ttccaatatg 601 gactatgaaa attcaactga agtggaaaaa attttggata acgtcaccca accgcttaac 661 gacttcactc gagttgttgg tggaaaagat gccaaaccag gtcaattccc ttggcaggtc 721 cttttgaatg ggaaagttga tgcattctgc ggaggttcca tcatcaatga aaaatgggtg 781 gtaactgcag cccactgtat tgagcctgat gttaaaatta ccatagttgc aggtgagcat 841 aacaccgaga agagggaaca tacagagcag aagcgaaacg tgattcgcac tattcttcac 901 cacagctata atgcaactat taataagtac aaccatgaca tcgcccttct ggaactggat 961 gagcccttaa cgctgaacag ctatgtaaca cctatttgca ttgctgacag ggaatactcg 1021 aacatcttcc tcaaatttgg gtctggctat gtgagtggct gggggagagt cttcaacaaa 1081 gggcgatcgg cttcaattct tcaatacctt aaagttccac ttgttgaccg agccacgtgc 1141 cttcggtcca cgaagttcac catttataac aacatgttct gtgctggctt ccatgaggga 1201 ggtaaagatt catgccaggg cgatagtggg ggaccccatg tcaccgaagt agaaggcata 1261 agtttcttaa ctgggattat tagctggggt gaagagtgtg cgatgaaagg gaagtatgga 1321 atatatacca aggtgtcccg gtatgtcaac tggattaaag aaaagacgaa gctcacctaa 1381 agaataatgt atttccaagg ttgacacgtt tagggtagaa aatggacaag gtcctttact 1441 aactaatcac tttttttatc tctttagatt tgcactatat acattctctg catactgctt 1501 tttctctttc tggggagaaa tctatctaga attcctattt tactagacta agtgaattag 1561 aaaatgtaat cactacggga atgtactgtg atgggacttg tgaccactcc cacaggtcta 1621 gcccttggca ccattgtgag gttaggttat tcctccgtcc cgtcagtgtt aagtttctcc 1681 actggggcaa ctccctgatt ctccctcctt ggcagcattc catgttccag acctttctta 1741 cctttcccat ggaaagcatc atcaaaatgt gttagatcta catccaggat atttgatcta 1801 gttcagaaca agtctaacac ctcacttgtt aaggaagaac acaggagcaa ctgacaggtt 1861 gcaactcacc agaaaacact atttcctttt ctgtatgctt attcctgctt ccttatctct 1921 tctgtttcct aatcctgaaa tcagtgtctc tctttctctt tctctctctc tttttcctac 1981 agaggtttaa aggagggaag ggacacatca tgctgtttta ctactgtcca cagttataca 2041 tgtctatcaa acccagactt gctttcagtt tggtctttga cttgcttttc ggagcatagg 2101 gatgaagcca ggtgcctgaa gaccttcagg aaaatttgtc tgaaagagtc atgttactga 2161 aatacatggc atcgtggaag gaatgaccga cccatcagaa tacgtccaat aagcatttgt 2221 aattgtgttg tgatagaggt aactaagaag agtgacatca actcctgcgt ctcatccccc 2281 atgtgaaaaa actgtgaact aaaagagaac agccagtgtg caacgtagaa ctagtagagt 2341 cttaaggaag aattcattgg tgggtctcca gtactggccg gagctaagga agaagttgcc 2401 ctcgaccaaa gaacatgagc accctgtctc ttaaactagc atgtccccac agtggagaag 2461 gatgtactgg tggcttaaag gcatgagtca ggggtgtctg ggtggctcag tggttgagcc 2521 tttggctcag ggcatgatcc cagcgtcctg ggattgagtc ccgcatcagg cttcctgcag 2581 ggggcctgct tctccctctg cttatgtctc tgcttctctg tgtgtgtgtg tgtgtctctc 2641 atgaaaaaaa taaataaaat ctttaaaaaa aaaaaaggta tgagtcattc cagtgagcca 2701 tctcactccc ccttttctgg tttcaggttc tctaggcaac cttttgatta taattaggcc 2761 ttccatattg aattttctaa agagttgctg accaacctct gcatgtactt tagtgtgtta 2821 ggctggctcc ctttatgaat taataaactg ttgttctggt tgatactgta gctttttgtg 2881 aatgtggctg atgtgaatca gtcatcccac atgttgttat atatgagact actgacaaaa 2941 tcatgtttgg gaccactttg acctgaccaa gctgctgcct tcacctcacc cccagccagg 3001 catttacttt agtccttcaa atattaagta ccaataaatg tcttttcaaa tttcaaaaaa 3061 aaaaaaaaaa aaaaaaaaaa // LOCUS ENHSREHPA 722 bp ss-mRNA INV 14-AUG-1990 DEFINITION E.histolytica serine rich protein (SHEHP) mRNA, complete cds. ACCESSION M34438 KEYWORDS serine rich protein. SOURCE E.histolytica (strain HM1:IMSS), cDNA to mRNA. ORGANISM Entamoeba histolytica Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Schizopyrenida; Tubulina; Entamoebidae. REFERENCE 1 (bases 1 to 722) AUTHORS Stanley,S.L.Jr., Becker,A., Kunz-Jenkins,C., Foster,L. and Li,E. TITLE Molecular cloning and expression of a membrane antigen of Entamoeba histolytica possessing multiple tandem repeats JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4976-4980 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.L.Stanley,Jr., 22-MAY-1990, for release after publication. FEATURES from to/span description pept 4 705 serine rich protein precursor sigp 4 42 serine rich protein signal peptide matp 43 702 serine rich protein ORF 188 688 ORF2 site 247 582 multiple tandem repeats BASE COUNT 315 a 121 c 118 g 168 t ORIGIN 1 tcaatgttcg catttttatt gtttattgca ttcactagtg caactaatat cattcttgat 61 ttggatcaag aagtaaaaga tactaatatc tacggtgtat ttttaaagaa cgaagctagt 121 cctgaaaagc ttgaagaagc tgaagaaaaa gaaaaaagta gctcagcaaa accagaatca 181 agttcaaatg aagataatga agatgatgaa gatgaaaaag caagttcaag tgataactca 241 gaatcaagct caagtgataa accagataat aaaccagaag caagttcaag tgataaacca 301 gaagcaagtt caagtgataa accagataat aaaccagaag caagttcaag tgataaacca 361 gataataaac cagaagcaag ttcaagtgat aaaccagata ataaaccaga agcaagttca 421 agtgataaac cagataataa accagaagca agctcaagtg ataaaccaga taataaacca 481 gaagcaagct caactaataa accagaagca agctcaacta ataaaccaga agcaagctca 541 actaataaac cagaagcaag ctcaactaat aaaccagaag caagctcaac tagtaattca 601 aatgataaat caggaagtag ttcagataac gataataata accttgatgc tgcatcaagt 661 ccattcattg ttttctgtgc tatcattata gctatcatct tctaaatgag ttatttgact 721 tt // LOCUS EUBBAIOA 6212 bp ds-DNA BCT 14-AUG-1990 DEFINITION Eubacterium sp. bile acid-inducible operon (baiBCDEA2F), complete cds. ACCESSION M36292 KEYWORDS 7-alpha-dehydroxylation; bile acid-inducible operon. SOURCE Eubacterium sp. DNA. ORGANISM Eubacterium sp. Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 1 to 6212) AUTHORS Mallonee,D.H., White,W.B. and Hylemon,P.B. TITLE Cloning and sequencing of a bile acid-inducible operon from Eubacterium sp. VPI-12708 JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Mallonee, 10-JUL-1990. Medical College of Virginia-VCU Dept. of Microbiology Box 678 MCV Station Richmond, VA 23298-0678 FEATURES from to/span description pept 24 1586 baiB protein pept 1634 3256 baiC protein pept 3235 3552 baiD protein pept 3597 4097 baiE protein pept 4130 4879 baiA2 protein pept 4932 6212 baiF protein BASE COUNT 1864 a 1324 c 1698 g 1326 t ORIGIN 1 aaaagatatt aagcattaag aaaatgcaca aaaaatcagc gtgtgagagg gagggcaagg 61 agttgaagcg tgactttttt aacaagttta atttggggac atcgaacttt gtcacgccgg 121 gaaaacagtt ggaatacgtt tcggaatgca agccagattc tactgcggtc atttgcttag 181 ataaagaaca gaactgttcc gttattactt ggcatcagct gcacgtctat tccagccagc 241 tggcatggta ccttatagaa aatgagattg gcccggggtc gatcgtactt acaatgtttc 301 cgaacagcat cgagcacatt attgcggtat ttgcaatctg gaaggcgggc gcctgctata 361 tgcccatgtc ctataaggcg gcggaatccg agatcaggga ggcctgcgat accatccacc 421 cgaatgcggc ttttgcggaa tgcaagattc caggattaaa attctgcctt agcgcagacg 481 agatatatga ggcgatggaa ggaagatcca aggagatgcc ttcggaccgt ctggccaatc 541 cgaacatgat atccttatca ggcggaacca gcggaaagat gaagttcatc cgtcagaacc 601 ttccatgcgg gctggacgat gagacgatca gaagctggtc tttgatgtct ggaatgggat 661 ttgagcagcg ccagctgctg gtaggcccgc tgtttcatgg cgcgcctcac tccgcggcgt 721 ttaatggact gttcatgggc aacaccctgg tactgaccag gaacctttgc ccgggaaata 781 tcctgaacat gattaagaaa tataagattg aatttataca gatggtgccg accctgatga 841 accggcttgc caaactggag ggagtcggaa aagaagactt tgcatccctg aaggcgctgt 901 gccatacagg gggcgtctgt tctccctggc ttaagcagat ctggatcgac ctgctggggc 961 ctgaaaagat ctatgagatg tattccatga cggaatgcat cggccttacc tgcatccggg 1021 gagacgagtg ggtgaagcat ccgggaagca tcggacggcc agtgggcgat agcaaggtgt 1081 ctatccggga tgagaatggc aaggaagttg cgccttttga gattggcgag atctatatga 1141 cagcgccggc ctcctatctg gttaccgagt acatcaattg ggaaccgctg gaagtgaaag 1201 agggaggctt ccgaagcgta ggggatatcg gctacgtgga tgagcagggc tatctgtact 1261 tttctgaccg gcgcagcgac atgctggtat caggcggaga aaacgtgttc gccaccgaag 1321 tcgagacggc gcttttgaga tataaggata tcctggacgc tgtagtggta gggataccgg 1381 atgaagatct ggggcgaagg ctccatgcgg tcattgagac agggaaagag ataccggcag 1441 aggaactgaa aacattcctg agaaagtatc tgactccata taagatacca aagacgttcg 1501 agttcgtaag gagcatacga aggggagaca atggaaaggc cgacaggaag cggatcctgg 1561 aagattgtat tgcccgcggg ggatgattct ataaatgcaa agaaaacaaa ttatataaag 1621 gaggagtaac aaaatgagtt acgaagcact tttttcacca ttcaaggtca gaggactgga 1681 acttaaaaac cgtatcgtcc tgcctggaat gaacaccaag atggcaaaga acaagcacga 1741 cataggcgag gatatgatag cctaccatgt tgccagggca aaagcgggat gcgcgttaaa 1801 tatatttgaa tgcgtagcat tatgtccggc gcctcacgct tatatgtata tggggcttta 1861 tacggaccat catgtagaac agcttaagaa attgacggat gcagtccatg aagcaggcgg 1921 caagatgggc atccagctgt ggcatggagg attcagcccg cagatgttct ttgacgagac 1981 caacaccctg gaaactccgg acactcttac ggtagagagg attcatgaga tcgtagaaga 2041 attcggacgc ggcgcaagga tggctgttca ggctggattt gacgcagtag aattccatgc 2101 ggctcacagt tatctgcctc acgagttctt aagccctgga atgaacaaac gtacggatga 2161 gtacggcgga agttttgaga accgctgcag attctgttat gaagtcgttc aggcaatccg 2221 ttccaatatc ccggatgaca tgccattctt tatgcgtgca gactgcatcg acgaattaat 2281 ggaacagacc atgacagagg aagagatcgt tacatttatc aataagtgcg cagaacttgg 2341 cgtggatgtg gcagaccttt cccgtggaaa cgcgacttca ttcgcaaccg tatatgaagt 2401 tccgccattc aacctggctc atggcttcaa catagagaat atttacaaca tcaaaaagca 2461 gatcaatatc ccggttatgg gagttggccg tatcaataca ggagagatgg caaacaaggt 2521 cattgaagaa ggcaagtttg acctggtagg catcggacgc gcccagcttg cagatccaaa 2581 ctggatcacc aaagtaagag aaggcaaaga agacctgatc cgccactgta tcggatgtga 2641 ccagggatgc tatgacgcag tcatcaatcc aaagatgaag catatcacct gcacccacaa 2701 tccaggattg tgcttagagt atcagggaat gccaaagaca gacgctccta agaaagtcat 2761 gatcgtagga ggcggaatgg caggcatgat cgctgcggaa gtattaaaga ccagaggcca 2821 taacccggta atcttcgagg catccgacaa gcttgcagga cagttcaggc tggcaggcgt 2881 agcgccgatg aagcaggatt gggcagatgt tgcagaatgg gaagcaaaag aagtagagcg 2941 ccttggaatc gaagtacgtc tgaataccga agtgactgca gagaccatca aggaattcaa 3001 tccggataat gtcatcatcg cagtaggctc tacctatgcg ctgcctgaga ttccgggaat 3061 cgacagccca agcgtatact cccagtatca ggtactgaaa ggggaagtaa atccgacagg 3121 ccgtgtagcc gttatcggat gcggactggt tggtacggaa gtcgcagaac ttctggcatc 3181 cagaggcgca caggtaatcg cgatcgagag gaagggcgta ggtaccggcc ttagatgctt 3241 cgcagaatgt tcatgaaccc ggaattcaaa tattacaaga tcgccaagat gtccggaaca 3301 aatgtcaccg ctttagagca gggcaaggtt cactacatca tgacagacaa gaagaccaaa 3361 gaagtgacgc agggagtcct ggaatgcgac gctaccgtta tctgtacagg aattaccgca 3421 cgtccaagcg atgggcttaa ggcaagatgc gaagaacttg gaatcccggt tgaggtgatc 3481 ggagacgctg ctggcgcaag agactgcacg atcgcgacac gcgaaggcta tgacgcagga 3541 atggcaatct agaaaatcag aacttatcaa tcttacatat agaaaggatg atacatatga 3601 cattagaaga gagagttgaa gcattagaaa aagaattgca ggagatgaag gatattgagg 3661 caatcaagga actgaaagga aagtatttcc gctgcctgga cggaaagatg tgggatgagc 3721 tggagaccac cctgtcacca aatatcgtaa cctcttattc caacgggaaa ctggtattcc 3781 atagcccgaa ggaagttacc gattacttaa agagctcgat gccaaaagaa gagatcagca 3841 tgcatatggg ccacacgccg gagatcacca ttgacagcga gactacggct acgggcagat 3901 ggtatctgga agatagactg atctttacgg acggtaagta caaagacgta ggaatcaatg 3961 gcggcgcgtt ctatacagac aaatatgaga agatagacgg ccagtggtac atccttgaaa 4021 ccggctatgt acgaatctat gaagaacatt tcatgcgtga tccaaagatc catatcacga 4081 tgaacatgca caaataagaa tattgtaaaa gaaaggcagg agtaagagta tgaatctcgt 4141 acaagacaaa gttacgatca tcacaggcgg cacaagaggt attggattcg ccgctgccaa 4201 aatatttatc gacaatggcg caaaagtatc catcttcgga gagacgcagg aagaagtaga 4261 tacagcgctt gcacagttaa aagaacttta tccggaagaa gaggttctgg gattcgcgcc 4321 ggatcttaca tccagagacg cagttatggc agcggtaggc caggtagcac agaaatatgg 4381 cagactggat gtcatgatca acaatgcagg aattaccagc aacaacgtat tctccagagt 4441 gtctgaagaa gagttcaagc atattatgga catcaacgta acaggcgtat tcaacggcgc 4501 atggtgcgca taccagtgca tgaaggatgc caaaaagggc gttatcatca acacggcatc 4561 cgttacaggc atcttcggat cactctcagg cgtaggatat ccggccagca aggcaagcgt 4621 gatcggactc acccatggac ttggaagaga gatcatccgc aagaatatcc gtgtagtagg 4681 agtggctcct ggagttgtga acacggatat gaccaatggc aatcctccgg agatcatgga 4741 aggatatctg aaggcgcttc cgatgaagag aatgcttgag ccggaagaga tcgctaatgt 4801 atacctgttc ctggcatctg acttggcaag cggcattacg gctactacgg tcagcgtaga 4861 cggggcttac agaccataat tttaattttt actaagtaga atatgtgata tagaaaagga 4921 gatataaaaa catggctgga ataaaagatt ttccaaaatt cggagctctt gcagggctta 4981 agatacttga cagcggatct aacatcgccg gacctttagg cggaggcctt ctggcagaat 5041 gcggagcaac ggtcatccat tttgaaggac caaagaaacc tgataaccag agaggatggt 5101 acggctatcc acagaatcac cgtaatcagc tgtctatggt agcagacatc aaatctgaag 5161 aaggaagaaa gatcttcctt gatctgatca aatgggcaga tatctgggta gagtcatcca 5221 aaggcggaca gtatgacagg ctgggacttt ccgatgaagt catctgggaa gtaaatccta 5281 agattgccat cgtgcacgta tccggatatg gacagacagg agacccgtct tacgttacac 5341 gtgcatccta tgacgcagta ggccaggcat tcagcggcta tatgtcactg aacggaacaa 5401 cggaagcgct gaagatcaat ccttatctga gcgatttcgt atgcggactt accacatgct 5461 gggctatgct tgcctgctat gtaagcacca ttcttaccgg aaaaggcgaa tctgttgacg 5521 ttgcacagta cgaagcgctg gcacgtatca tggacggacg tatgatccag tacgctacag 5581 acggcgtgaa gatgccaaga accggcaata aggatgcgca ggctgccctg ttcagcttct 5641 acacctgtaa agacggacgt acgatcttta tcggaatgac tggcgcggaa gtatgtaaga 5701 gaggcttccc gatcatcgga cttccggtac ctggaaccgg agacccggac ttcccggaag 5761 gcttcacagg ctggatgatc tatactcctg taggacagag aatggaaaag gctatggaga 5821 agtatgtatc tgagcatacg atggaagaag tagaggctga gatgcaggca caccagattc 5881 catgccagag agtatacgag ctggaagact gcctgaacga tcctcactgg aaagcacgtg 5941 gaactattac ggagtgggat gacccgatga tgggacatat cacaggcctt ggactgatca 6001 acaagttcaa gagaaatcct tccgaaatct ggagaggcgc tccgctgttc ggtatggata 6061 accgcgatat cctgaaagac ctgggatatg acgatgcaaa gatcgatgaa ctctatgagc 6121 agggcatcgt caatgaattc gaccttgaca ctactatcaa acgctataga ctggatgaag 6181 taattccaca tatgagaaag aaagaggagt aa // LOCUS FDICPECDA 2371 bp ds-DNA BCT 14-AUG-1990 DEFINITION F.diplosiphon phycoerythrin-associated linker protein (cpeCD) gene, complete cds. ACCESSION M33832 KEYWORDS cpeC gene; cpeD gene; phycoerythrin-associated linker protein. SOURCE F.diplosiphon DNA. ORGANISM Fremyella diplosiphon Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 2371) AUTHORS Federspiel,N.A. and Grossman,A.R. TITLE Characterization of the light-regulated operon encoding the phycoerythrin-associated linker proteins from the cyanobacterium Fremyella diplosiphon JOURNAL J. Bacteriol. 172, 4072-4081 (1990) STANDARD full staff_entry COMMENT Authorin copy for [1] kindly submitted by N.A.Federspiel, 24-APR-1990, for release after publication. FEATURES from to/span description pept 454 1314 phycoerythrin-associated linker protein (cpeC) pept 1355 2104 phycoerythrin-associated linker protein (cpeD) mRNA 267 > 2371 cpeCD mRNA binding 440 444 cpeC ribosomal binding site binding 1339 1343 cpeD ribosomal binding site BASE COUNT 679 a 529 c 514 g 649 t ORIGIN 1 gatctgctca agcagtagat ttattaatca aagcaggggc gacggagtga agtggggatt 61 ggggactggg gatgaggagg atgaggggga tgaggaggat gagggggatg aggggaaata 121 accattaccc attccccatt ccccattacc caatttccca tgcccattac aaatagtttg 181 tgcaaattga gtgcaaaatt ctctcatttc atcaagattt acaaatcttg atgtacatct 241 ttccgcgctc ataagaaaat gatcataaca aagcgtataa aacgcctaaa accttgagca 301 tctaggaaat cacagctttc atgcttatgg aagctggtaa tttgggttgc aacaaatcaa 361 tccaaagaat tgatgagcct ggatgagcaa tctgaaattt atttacaaat cgaaacaaat 421 ttcttaaact ttcgttaaca ggagaaacaa ttaatgccat ttggaccagc ttcacgcttg 481 ggagtcagcc tatttgatga aactcctccc gttgagtggg taccaggtcg ctcacaagaa 541 gaagcagaaa caatcattcg ggcaatctat cggcaagtat taggtaatgc ctatgtgatg 601 gaaagtgagc ggcttgctgt gcctgaatcc cagtttaagc ggggtgagtt gagcgtccgc 661 gagtttgtca gagcagtggc taaatctgaa ctatatcgtt ctcgcttttt caccagttgt 721 gcgcgctacc gagccattga actcaacttc cgccatctat tgggtcgtcc accactagat 781 ttagaagaaa tgcgctccca cagcacaatc cttgatactc aagggtttga agctgagatt 841 gattcttata tcgatggtga tgagtatcag tctacttttg gcgagaacat tgtaccttac 901 atccgaggct ataaaaccga agcgcttcag agcatggtgc aatttactca taccttccaa 961 ctggtacgag gtgcttctag cagcagcctg aagggtgact tatctggcaa ggctcctaag 1021 ctgaatgcat tagtgattca aagcacacca acagcagtaa tttcacctgc tagcgctgga 1081 gcaaccttct cgacaccacc tactggtgcc cgtacccgtc ttggagtcga tgctagtgct 1141 ggtggcaaag tttaccgcat tgaagttaca ggttatcgtg ccaaaacctt caataatatt 1201 tccaagtttc gccgttccaa tcaagtcttt ctggtgccct acgaaaagct ctctcaagag 1261 tatcaacgga ttcaccagca aggcggcgtg atcgcaagta tcactcctgt ataaattagg 1321 tgcaaactta aaaaattgag gagcagaaat tttaatggca tcccagacaa ttcttgaact 1381 ttggccctct agtagcttag aagaagttca aactattatc cgtgcagttt acaaacaggt 1441 tttaggcaac cctcatgtta tggagagtga gcggttggtg acagcagaat cacaattatg 1501 cgatcgctcc atcaccgtgc gggaatttgt ccgcagcgtt gccaagtctg atttttatcg 1561 caaccgctac ttccaatcct gcgctcccta ccgatttgta gaacttaact tcttacattt 1621 gcttggtcgc gcaccccagg atcaaagaga agtttccgaa cacatcgttc gtactgtagc 1681 tgaaggctac gatgctgaaa ttgactccta tatcgatagt agtgaatatg aagcagcctt 1741 tggtgaaaac gtagtgcctt actatcgtgg tagaagtagc gaagccaact ccaagcaagt 1801 aggcttcaac cgcatatttg cccttgatcg cggccctgcc caaattgata gtgcagttaa 1861 atcggctcaa ttggtctatg ctgttgctac taacagcgcc aacgcgatca aagcctcttc 1921 atccacagtc attggctctg gaactgaaaa acgattcaaa atcttggtgc aaggttccaa 1981 attcgacagt ccccgacgca tcagtaccac tgagtacatt gttccagcta gtaagatgac 2041 tccccaaatt cagcggatta atcgtacttc tggcaaaatc gtcagcatta ctgaaattgt 2101 ctaaccttta acagggtggg cattaatact tgtctgttct aaatgttgaa ccgattagta 2161 ttaacaactc gtaattcgta gttactaatt cgtaattacg ttacggatag ggatttagct 2221 cccacctgta acggcatgat tttatagaag tcggagactc aaaccctcag cttaattacg 2281 aattacgaat tacgaattac gaattagtaa tttaggttga gacattgccc gccccagatc 2341 attcactttt ttaatttgta attttcaaaa a // LOCUS HIVPRBAC 582 bp ds-DNA SYN 14-AUG-1990 DEFINITION Synthetic HIV type-I protease (syn BAC) gene, complete cds. ACCESSION M36450 KEYWORDS HIV type-1 protease; HIV-PR gene. SOURCE Synthetic oligonucleotide DNA, clone pET3AM. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 582) AUTHORS Cheng,Y.-S.E., McGowan,M.H., Kettner,C.A., Schloss,J.V., Erickson-Viitanen,S. and Yin,F.H. TITLE High-level synthesis of recombinant HIV-1 protease and the recovery of active enzyme from inclusion bodies JOURNAL Gene 87, 243-248 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 7 573 HIV type-I protease (BAC) BASE COUNT 120 a 125 c 170 g 167 t ORIGIN 1 gaattcatgg aagatctggc tttcctgcaa ggtaaggctc gtgagttctc ttctgaacag 61 actcgtgcta actctccgac tcgtcgtgag ctgcaggttt ggggtcgtga taacaactct 121 ccgtctgagg ctggcgccga tcgtcagggt accgtttctt ttaactttcc gcaggttact 181 ctgtggcagc gtccactggt tactattaag attggtggtc agctgaagga agctctgctg 241 gataccggtg ctgatgatac tgtactggag gagatgtctc tcccgggccg ttggaagccg 301 aagatgatcg gtggtattgg tggtttcatt aaggtacgtc agtatgatca gattctgatt 361 gagatctgtg gccataaggc gatcggtacc gttctggtgg gcccgactcc ggttaacatt 421 atcggccgta acctgctgac tcagatcggt tgtactctga actttccgat ttctccgatt 481 gagactgttc cggttaagct gaagccgggt atggatggtc cgaaggttaa gcagtggcca 541 ctgactgagg agaagattaa ggctctggtt taatagaagc tt // LOCUS HNNMTPIT 1336 bp ds-DNA ORG 14-AUG-1990 DEFINITION H.annuus mitochondrial plasmid pIT DNA. ACCESSION M36422 KEYWORDS mitochondrial plasmid. SOURCE H.annuus (strain CANP3) mitochondrial plasmid pIT DNA, clone pITC20. ORGANISM Mitochondrion Helianthus annuus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Asterales; Asteraceae; Compositae; Helianthus annuus. REFERENCE 1 (bases 1 to 1336) AUTHORS Perez,C., Dujon,B., Heizmann,P. and Berville,A. TITLE Sequence of a mitochondrial plasmid of sunflower (Helianthus annuus) and its relationship to other mitochondrial plasmids JOURNAL Plant Sci. 58, 59-69 (1988) STANDARD simple staff_entry BASE COUNT 382 a 314 c 290 g 350 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctttt gttccctcaa caagcaacta atcacaagga aagaaaacaa aagacaaaga 61 aaggcggttt tgagacataa aaaataaatg ctaaacttgc tttttaatct caaaaggtcg 121 atcttgttgt tttgagaact caaatgcagc tatcattatc ggggtgctcc gccacccctc 181 gtttccatca tccctcgact gcagtcgagg gtacgtctcc cttccctaaa aaaagatcga 241 gctttttact tattacagaa gagagagagc accttacaga agagagacac atcgaccgtt 301 ttttacttat tacagaagag agatagcacc ttacagagag atagcacatc gaccgttttt 361 tacttattac agaagagaga gacctaaaaa ggttcaaatg agaaatatct taagtgatcg 421 tggaaactga gctttgtaaa ggagtagaga gtgtaagacg gggttgcgag tctttgcgag 481 ttgggtagtc acctctccaa cccattccgg acgttgcccc aagcccccta gcagcgctct 541 cgaaactcta caatgcgcag acaagaaggc gattgagtta acggaagcgt ctcaaaaaca 601 ttctttttga agaacaagaa gaagactcag cgctgggtaa gtggtttaag acttttcccg 661 accctcggct ccttggggag tagtgccccc aagacacagc cacgccaatc cgacgcggct 721 ccatacccct agtttagtat atagagctgt ttttcttttt ttctctgagg agcaaagaaa 781 aaagtaaaaa atttttttat ttttggacgg ccagcccggt aggaaggtga gataaatctc 841 cccctccccg gcctccccct ctgttaaggt cgagtcctta ctcaattcat cggatgaacg 901 gactcggctg gacttgttag ctgcataaaa gcatagaaca tagcccggag aatgaagatc 961 acccgaactt cattcccctg gctacaaccg gcagcaggag aagctcttct ctttgctggc 1021 caagttatcg ccatggggcc cccccccctc ccatggcttg ctttgttgct gctcttttac 1081 tgcctatagc tgacaagtcc gctctccccg acttaacacc gagaaaaaca aaaaacagct 1141 ctatatacta ggggaggaga gcggagtgag tgcaggccct tcaacgagag cattgttttg 1201 tggtccgggt tcggtattga attatgaaac tcctttctta ttgatgattt acggcctttt 1261 aaaacagttg ttcagtctca aatggtaccc tatattgatc ttgaaagaaa tcattttgtc 1321 tgtcaagatg gtcata // LOCUS HSE1GBA 3958 bp ds-DNA VRL 14-AUG-1990 DEFINITION Equine herpesvirus type 1 glycoprotein B (gB) gene, complete cds. ACCESSION M36298 KEYWORDS antigen; glycoprotein B; structural protein. SOURCE Equine herpesvirus type 1 (strain Ab1) DNA, from COS cells. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 3958) AUTHORS Bonass,W.A., Elton,D.M., Stocks J,-M., Killington,R.A., Meredith,D.M. and Halliburton,I.W. TITLE Molecular analysis of the Equine herpesvirus type-1 strain Ab1, glycoprotein B gene and its expression in COS cells JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.A.Bonass, 10-JUL-1990. Leeds University Dept. of Microbiology Leeds LS2 9JT, UNITED KINGDOM FEATURES from to/span description pept 603 3545 glycoprotein B precursor (gB) sigp 603 860 glycoprotein B signal protein matp 603 3542 glycoprotein B mRNA 475 > 3958 glycoprotein B mRNA signal 370 373 CAT box signal 453 458 TATA box signal 3554 3560 polyA signal site 2165 2166 internal cleavage site BASE COUNT 1019 a 1075 c 1010 g 854 t ORIGIN 1 gttaacgtgt atggcgaagt gctgctacat acgtacgtat cgtcctacaa cgggttttgc 61 gccaggcgcg ggttttgcgc ggcggtgagt cgagcgggta ccatcataga taaccgctct 121 agcacgtccg cgttcgactc gcatcagttc atgaaggcgg cgctgcttcg ccaccccatt 181 gaccagtcgc tcatgccgtc cataacacac aagtttttcg agctgatcaa cgggcccgtg 241 tttgacaacg ctggccacaa ctttgcgcag ccgccaaaca cggcattata ttacagcgtt 301 gaaaacgttg ggttgttacc gcatctcaag gaggaactag ctcggtttat gattactgcg 361 gctaaaggtg attggtcaat tagcgagttt caaaggtttt attgctttga gggagtgaca 421 ggtgtgacgg ccacgcagcg gctggcgtgg aaatatatcg gggagctcat cctagccgcc 481 gcagtattct cctcggtttt ccactgtgga gaggtgcgcc tcctgcgcgc agatcgtacc 541 tacccggact ccagcggcgc acagcgctgc gtgagcggca tttacataac ctacgaggcg 601 tcatgtcctc tggttgccgt tctgtcggcg gctccacatg gggcaattgg cgcggagacg 661 gtggtgattt acgacagcga cgtgttctct ctcctgtatg cagtgctcca gcagctggct 721 cctggatcgg gagccaacta ggcaatgttg gaaacttact cgccaccccc cacccgctgg 781 gaaagccggc atcatcgagg gtgggcacaa tagttctagc ctgtttgttg ctttttggaa 841 gctgtgttgt tagagccgta cccaccacgc caagcccccc aactagtact cccacttcca 901 tgtcaacgca ctcccatggg acagtagacc ctacgctgct ccccacagaa acgcccgacc 961 cactcagact ggctgtgcgc gagtccggta tactcgctga ggatggagac ttttacacct 1021 gcccaccgcc taccggatcc accgtcgtac gcatcgaacc acctagaact tgccccaagt 1081 ttgaccttgg gagaaacttc acggagggga ttgctgttat ttttaaggaa aacatcgctc 1141 cctacaaatt cagggcaaac gtatactaca aggacatcgt tgtaacacgt gtgtggaaag 1201 gatacagcca tacgtccctg tccgacagat acaatgacag ggttccggtt tcggtggagg 1261 agatcttcgg tctcatcgac agtaagggaa aatgttcgtc aaaggccgag tacctcagag 1321 ataacatcat gcaccacgcg taccacgacg acgaggacga ggtggagctt gatttggtgc 1381 cgtccaagtt tgcaactccg ggggccagag cctggcagac caccaacgat actacgtctt 1441 acgtggggtg gatgccatgg aggcactaca cgtcaacgtc tgtcaactgc atcgtcgagg 1501 aggtggaggc gcggtccgtc tacccctacg actccttcgc cctgtccacc ggtgatattg 1561 tgtacgcgtc tccgttttac ggcctgaggg ctgccgctcg catagagcac aatagctacg 1621 cgcaggacag tttcaggcaa gttgaagggt acaggccccg cgacttagac agtaaactac 1681 aagccgaaga gccggttacc aaaaatttta tcactacccc gcatgtcacc gtcagctgga 1741 actggaccga gaagaaagtc gaggcgtgta cgctgaccaa atggaaagag gtcgacgaac 1801 tcgtcaggga cgagttccgc gggtcctaca gatttactat tcgatccatc tcgtctacgt 1861 ttatcagtaa cactactcaa tttaagttgg aaagtgcccc ccttactgaa tgtgtatcca 1921 aagaagcaaa ggaagccata gactcgatat acaaaaagca gtacgagtct acgcacgtct 1981 ttagcggtga tgtggaatat tacctggcac gcggggggtt cttaattgca ttcagaccta 2041 tgctctccaa cgaactcgcc aggctgtacc tgaacgagct tgtgagatct aaccgcacct 2101 acgacctaaa aaatctattg aaccccaatg caaacaataa caataacacc acgcgaagac 2161 gcaggtctct cctgtcagta ccagaacctc agccaaccca agatggtgtg catagagaac 2221 aaattctaca tcgcttgcac aaacgagcag tggaggcaac ggcaggtacc gattcttcca 2281 acgtcaccgc caaacagctg gagctcatca aaaccacgtc gtctatcgag tttgccatgc 2341 tacagtttgc atacgatcac atccaatccc acgtcaatga aatgctaagt agaatagcaa 2401 ctgcgtggtg taccctccaa aacaaagagc ggaccctatg gaacgaaatg gtgaagatta 2461 acccgagcgc catagtctcc gcaacccttg acgagcgagt tgcagcgagg gtcctggggg 2521 acgtgatagc tataacgcac tgcgccaaaa tagagggcaa cgtgtacttg caaaactcca 2581 tgcgctcgat ggacagtaac acgtgctact cccgcccccc cgtaacattt acaattacta 2641 agaatgcaaa caacagaggg tcgatagaag gccagctggg agaggagaac gagattttca 2701 cggagcgcaa gctgatcgag ccgtgcgccc tcaatcagaa gcgctacttt aagtttggca 2761 aagagtacgt ttactacgag aactacacgt tcgtccgcaa agtgcccccc acggaaatcg 2821 aggttatcag cacgtacgtt gaactaaact tgaccctttt ggaagaccgc gagtttctgc 2881 ccctggaggt gtacacgcgg gctgagctgg aggacaccgg cctgctagac tacagcgaaa 2941 tacagcgccg caaccagctc cacgctctca ggttttacga catcgacagc gtggtcaacg 3001 tggacaatac cgcagtgatt atgcagggga tcgccagctt tttcaagggc ctgggtaaag 3061 tgggggaggc cgtgggaacg ctcgttctcg cggccgccgg cgctgttgtt tcaaccgtat 3121 ctggaatagc ttcgttttta aacaacccat ttggggggct agccatcggc ctgctggtaa 3181 tcgccggcct ggtagctgcg ttttttgctt acagatatgt aatgcagatc cgcagtaacc 3241 ccatgaaagc tctatacccc ataacaacaa aggccttgaa aaacaaagcc aaaacttcct 3301 acggccagaa cgaggaggac gatgggagcg actttgatga ggccaagctt gaagaggctc 3361 gcgaaatgat caaatacatg tctatggttt cggccctgga aaagcaggaa aagaaagcta 3421 taaagaaaaa cagtggggtt ggcctgatcg ccagtaacgt ctcaaagctg gccctgcgaa 3481 ggcgcggtcc caaatatacc cgactccaac agaacgatac catggaaaat gaaaaaatgg 3541 tttaaacatg tttaataaat attatgacac gtactcaaag tgtgacctca tatttgcata 3601 accactttct agttccggcc ccaaggatat ttaagcctag tatctccgcc gaggtttcat 3661 cctcattcac caactcacac ttagagttga cgcttcctct tgcgcctttg ctctcgccgc 3721 tcctgtgtta gcgtatactg cccaagaaat ggattctcca cgcggtatct ccacagctac 3781 cggtgatgcc cacgccgagg ccgcggtttc cccagccgcg gaaatccaga taaaaacgga 3841 agcccccgat gtagacggac cagaagccac tactgagtgt ttagaccaca cctacaccca 3901 acagacaagc gggggtgatg gcctagatgc tatcgatacg gacgatctct ggagatgg // LOCUS HSE1GPEIA 4707 bp ds-DNA VRL 14-AUG-1990 DEFINITION Equine herpes virus type 1 glycoproteins gE, gI and 10 kDa protein genes, complete cds. ACCESSION M36299 KEYWORDS glycoprotein gE; glycoprotein gI. SOURCE Equine herpesvirus type 1 (strain Ab1) DNA, clone pUC19. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 4707) AUTHORS Elton,D.M., Bonass,W.A., Killington,R.A., Meredith,D.M. and Halliburton,I.W. TITLE DNA sequence analysis of the 4.7Kb BamHI to EcoRI fragment of the Equine herpesvirus type-1 short unique region JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.M.Elton, 10-JUL-1990. University of Leeds Dept of Microbiology Leeds LS2 9JT, UNITED KINGDOM FEATURES from to/span description pept 594 1868 glycoprotein I precursor (gI) sigp 594 701 glycoprotein I precursor (gI) matp 702 1865 glycoprotein I precursor (gI) pept 2101 3753 glycoprotein E precursor (gE) sigp 2101 2208 glycoprotein E precursor (gE) matp 2209 3750 glycoprotein E precursor (gE) pept 4478 > 4707 10 kDa protein ORF < 1 484 ORF1 ORF 3750 4140 ORF4 signal 544 550 gI TATA box signal 1924 1929 gI polyA signal signal 1955 1958 gE CAT box signal 4139 4144 gE polyA signal BASE COUNT 1204 a 1291 c 1107 g 1105 t ORIGIN 1 gaattcaact tcccacaagg agagcatatg acatggttga agttctggtt cgtctacgat 61 ggtggaaacc taccagtgca gttttatgaa gcccaggcat tcgcaagacc cgtgcctccg 121 gataaccacc ctggatttga ttctgttgag tcggagatta cacaaaataa aacagacccg 181 aaaccaggcc aggcggaccc caaacccaat cagcctttta agtggcccag catcaaacac 241 ttggccccaa gactcgatga ggtggatgag gtcatagagc ccgtaacaaa gcccccaaaa 301 acgtctaaga gcaactctac gtttgtgggc atcagcgtcg gtttgggtat cgccggccta 361 gtattggtgg gcgtcattct atacgtctgc ttgcgtcgga agaaggaact gaaaaagtct 421 gcacagaacg gcttgactcg cctacgctcg acctttaagg atgttaaata tacccagctt 481 ccgtaaacag tgttgcgtaa cctgctggga ggtgtccacg gccttaaagc ttcgcggttt 541 ggagatataa cgcacaacct acaacaaacg cgacacagca agtagtagtc gctatggcca 601 aactcactgg gatgttcagc gctgcgatat tactgtctat ggctatatgc tcaaccgcaa 661 tcatatatcg cggagaacat atgagcatgt acctaaacgc cagctctgag tttgccgtgt 721 accccactga tcagtccctt gttttggttg gccacttgct ctttctcgac ggacaacgct 781 tacccaccac caactatagt gggctgatcg aattgattca ttacaactac tccagcgttt 841 gctacactgt tatccaaacg atatcgtatg aatcatgccc gcgtgtagcc aacaatgctt 901 tcagatcgtg cctccacaaa acttctaagc actaccacga ctatttccga gtcaatgcct 961 ctgttgaaac caacgttctc ttaaacatca caaagccaca gcctacagat tccggggcgt 1021 atatccttcg cgtaaaactt gaccacgcgc caaccgcaga tgtttttgga gtttccgcct 1081 ttgtttacga tctaaaatct aaaacggtcc ccgatccaat gcccaccaca caaacggtag 1141 aacctacaac gagctatgtg tcgactccca catacgacta taccgatgac gtaaccaccg 1201 aaactgaatc cacatcaaca tctacccaac aggcgatgac ctccactcaa acccctagcg 1261 ctacatgggg aacccagcta accacagagc tgccgacaaa cgaaactgtg gttattggtc 1321 aggaggccct gttatgccat tggttccagc catcgacaag ggtgccgacc ctgtatctgc 1381 atctgttggg acgcactggc aatctcccgg aagatgttct actggtcgaa gactctgagt 1441 ttcttcgtac cacatcgcct gcacataggc cttctgcatc acccgctgac ggtgatgatt 1501 ttaaacagac aaactcaact tcccttaagg cgcgcaacaa gatcgtcgca atggtggtta 1561 tcccgaccgc gtgtgtacta atgctcctgt tggtggttgt cggtgccatc ataaacggtg 1621 ccgtgcgcaa acatttattg agttgcgcaa gccgcaggat ctaccgctcc ggacaggggg 1681 gcgcatcggc ggccgaacgg agacggctga cttgcggtcc tactttagcc gcgtcatcgg 1741 agtcgctggc cgacgataca acgtcatcac ctccaacccc caaaccttcg aagaaaacca 1801 agttggagac cgatccgctt atggaacagc tgaaccggaa actggaggcc atcaaagaag 1861 aatcatagtt gtgggggtag atggggttgg tattaaagtt tgtgtattat cgattttata 1921 tttattaaaa tttgtgaaac ataaacatct tgtgcaatgt ttacattatt tgtgattggg 1981 acggtccact gggaggtggt acaactcggg tttaaagctc tggatgtttg gtaggaaact 2041 cacagttctc cactttggcg tcaaagcaat cagacgtcta attcgaagta gaacgtcaca 2101 atggagctgt tggccgcaag tcgcgcttgt atattttttg ggctagtaac agtactcgat 2161 gcgtggggag tccaacaagt tgaactttcc gagggggctt gggctatgat cgacggaagg 2221 gacgttttaa cccctactaa cacaactact cgggtcacaa aggcctggac gtttttggaa 2281 acccctcccg gttgcgctgg cgacatatca gttaagaagg tgtgcgtgag ccatagtctg 2341 tgcgaagata acattataat aggaaagcac tgtaacctct taactgggga acatggcatt 2401 gcgttggccg agtttaacgt agtaaacgga tcgctgcgca gaacagacga tgtgtacttt 2461 gtgaatggta cagtctttcc aatccttgcc gaaacccgca gcgtcctaca aatccatagg 2521 gcaaccccct ctatcgcagg ggtttacacc ctccacgttt ccatcgacgg aatgatgaaa 2581 cactccgtcg tgctgctcac cgtcaagaag ccgcccaaac aaccgcaacc acgcttgcgc 2641 gttaagaccc cgccacccgt aaccgttcct caggttcccg taaagaccca cacggatttt 2701 gtggtgcacg gataccactc gcgcgtgtac gctgatggcg aatctttcga gctgtcggtg 2761 aacctggagt cacatatcgt agagcccagc ttcagcgcgg agattcagtg gtactatatg 2821 aatacatcat cgtcatcatg cgatctattt cgagttttcg aaacctgcat ctttcacccg 2881 acagccatgg cctgcctgca cccggaacaa cacacctgca gcttcacatc ccccatcaga 2941 gcgaccaaga tcctacaccg ggtgtatgga aactgcagcg atcatggaaa ttcgtggcct 3001 tctaggtgcc atagcactct gctgggcaat cgtctatact ttattcaacc agcacagaac 3061 agagtggacc tgttgttcaa agacactccc gcgtcggcta ccgggctgta tgtgtttgta 3121 ttattgtaca acggacatcc ggaggcgtgg acgtatacgc tgctgtcaac cgcaaatcac 3181 tttatgaatg tgcttactga cgtgacccgc ccacggctag gagagcactt ttatacggac 3241 ctcgggcaca aaatcatcac tcctcatcca tctgtagcta ccactgaaga gttgggagct 3301 tggactcgac actacctcgc ctttttgctg gttattatct gcacgtgcgc ggcgctgcta 3361 gttgcattgg tggtgtgggg ctgtattctc tacatccgaa gcaaccgtaa gccgtatgaa 3421 gtgctgaacc cctttgaaac ggtttacacg agcgttccaa gcaacgaccc ctcggacgag 3481 gtcttggtgt ttgagcgcct agcttcggac tctgacgact ccttcgactc tgattcagac 3541 gaagagttgg aatacccacc acctcccaaa ccagctccac agctcccacc ataccagttt 3601 gtagacgggg gagacgcccc tagcggcagg tccggattca aggtttggtt ccgcgataca 3661 cccgaggcgt ccccggttcc tcttcataaa ccaacgctac agggtccaga ctacagccgg 3721 gtagcgtcga agctaaagtc gatactaaaa tgagcagcaa cagcgataac acagagtgct 3781 tcgggggagt caactatgcc gagggaatgc gcaacggtaa acgcaaccct gtcagaaaca 3841 gcacctttca agagtatctc gacgcgcgta acgcgcgtta tcccagatcc ggctcaacct 3901 ccgattccga cgaggactac acaaccagat caaagtacga gtcagatgtc agcgagttta 3961 aaaaaatgat ggatctggaa actctacctc ccccaaaggc tgagccgcaa gctcagaagg 4021 ccgagcctga tgctgcgaag gaggagccag tcagcaccac tagctacatc ttaaacgaat 4081 gggtggctcc tatgattggg cattttctgg caatgtgtat gtatgagttg cttttcaaat 4141 aaaaacaaac attaacccct gtaaacatcc gtttgtctac tgtgtatgat agagttaaac 4201 ccaaccctag agagttatgt atttaatgcc ctgggacccc gcggaagtca tatatccctc 4261 ggccccctca tttgggcgca cattgcctgc ccggcggcag tcttactccc ttagctcgcc 4321 ctcttgcata agataaacta ttcccctccc agctagtttc acccaccaga ttaagcgagg 4381 ttttccctct cagcgatcac ttttcaccac cgaagaacag gccctcatcg gtttccctcc 4441 gtgttttccc atccatctat ccaaccacta cattttcatg gagaaggcgg aggctgccgc 4501 agttgttata cccctgtcag tttccaaccc cagctaccgt ggaagcggta tgtccgacca 4561 agaagtaagc gaagaacaat ctgctggaga tgcctgggtg tctgcagcaa tggcagccgc 4621 agaggcggtg gctgctgccg ctacctccac cggaattgat aacactaacg actacacgta 4681 caccgctgct tctgagaatg gggatcc // LOCUS HUMADRA2RA 2072 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human alpha-2-adrenergic receptor (alpha-2 c2) gene, complete cds. ACCESSION M34041 KEYWORDS G-protein coupled receptor protein; alpha-2-adrenergic receptor; plasma membrane protein. SOURCE Human placenta DNA, clone alpha-2 C2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2072) AUTHORS Lomasney,J.W., Lorenz,W., Allen,L.F., King,K., Regan,J.W., Yang-Feng,T.L., Caron,M.G. and Lefkowitz,R.J. TITLE Expansion of the alpha-2-adrenergic receptor family: Characterization of a human alpha-2-adrenergic receptor subtype, the gene for which is located on chromosome 2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5094-5098 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.W.Lomasney, 03-MAY-1990, for release after publication. FEATURES from to/span description pept 413 1765 alpha-2-adrenergic receptor (alpha-2 C2) /hgml_locus_uid="LY0100G" /nomgen="ADRA2RL1" /map="2" BASE COUNT 316 a 705 c 660 g 391 t ORIGIN Chromosome 2. 1 ctgcaggcgc ggctggggcg aaacgtgcga gctgagcggg cgcaaggtcc tccgcgcctc 61 ctttaagaac cggcccagcc cggcccgcgc ccccagagcg tacggcatcc gcgtggcgga 121 gggcgcgact ttctccggtc ccgggcggga cggggacggc ggcgggacaa cttgggaaac 181 ttctctgggg cggacggcag ggaccccggg caccgtggag gaggatgtag gagggcggct 241 gctggtcctg ggtgttcccg acctcctagg ccccgctcgt ccaggccatg gggctccagc 301 gccctcgcgg cgccggaggg gcgacgctct tgtctagccg agccgggcag cgctgtcgtc 361 cacggtgcgc actgggcggg cagcgctccc tctgcccacc tcccgccccg tcatggacca 421 ccaggacccc tactccgtgc aggccacagc ggccatagcg gcggccatca ccttcctcat 481 tctctttacc atcttcggca acgctctggt catcctggct gtgttgacca gccgctcgct 541 gcgcgcccct cagaacctgt tcctggtgtc gctggccgcc gccgacatcc tggtggccac 601 gctcatcatc cctttctcgc tggccaacga gctgctgggc tactggtact tccggcgcac 661 gtggtgcgag gtgtacctgg cgctcgacgt gctcttctgc acctcgtcca tcgtgcacct 721 gtgcgccatc agcctggacc gctactgggc cgtgagccgc gcgctggagt acaactccaa 781 gcgcaccccg cgccgcatca agtgcatcat cctcactgtg tggctcatcg ccgccgtcat 841 ctcgctgccg cccctcatct acaagggcga ccagggcccc cagccgcgcg ggcgccccca 901 gtgcaagctc aaccaggagg cctggtacat cctggcctcc agcatcggat ctttctttgc 961 tccttgcctc atcatgatcc ttgtctacct gcgcatctac ctgatcgcca aacgcagcaa 1021 ccgcagaggt cccagggcca agggggggcc tgggcagggt gagtccaagc agccccgacc 1081 cgaccatggt ggggctttgg cctcagccaa actgccagcc ctggcctctg tggcttctgc 1141 cagagaggtc aacggacact cgaagtccac tggggagaag gaggaggggg agacccctga 1201 agatactggg acccgggcct tgccacccag ttgggctgcc cttcccaact caggccaggg 1261 ccagaaggag ggtgtttgtg gggcatctcc agaggatgaa gctgaagagg aggaagagga 1321 ggaggaggag gaggaagagt gtgaacccca ggcagtgcca gtgtctccgg cctcagcttg 1381 cagccccccg ctgcagcagc cacagggctc ccgggtgctg gccaccctac gtggccaggt 1441 gctcctgggc aggggcgtgg gtgctatagg tgggcagtgg tggcgtcgaa gggcgcacgt 1501 gacccgggag aagcgcttca ccttcgtgct ggctgtggtc attggcgttt ttgtgctctg 1561 ctggttcccc ttcttcttca gctacagcct gggcgccatc tgcccgaagc actgcaaggt 1621 gccccatggc ctcttccagt tcttcttctg gatcggctac tgcaacagct cactgaaccc 1681 tgttatctac accatcttca accaggactt ccgccgtgcc ttccggagga tcctgtgccg 1741 cccgtggacc cagacggcct ggtgagcccg cctgcgctgc ccctgtgggt tggtgcggtg 1801 gcgccggggt caccctgctt cttgccctgc tgtgtgtggc tgcctcccct gggctttctg 1861 ctccctgccc agatcctgta ggcctcatct taggaacccc ttgggagggg tgggcagggg 1921 ggctgctagc aagggtccca gtgaagcttc cccttgccgg cttagctgtg ggggacccct 1981 tctccaccct ctccctgagc acaggccgat ggaggtggtt caaatcctct ggaacatagc 2041 caagaccagg agaagagaga gcactttctt cc // LOCUS HUMARF1BA 1816 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human ADP-ribosylation factor (ARF1) mRNA, complete cds. ACCESSION M36340 M25203 KEYWORDS ADP-ribosylation factor. SOURCE Human foreskin fibroblast, cDNA to mRNA, clone pCD2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 29 to 866) AUTHORS Bobak,D.A., Nightingale,M.S., Murtagh,J.J., Price,S.R., Moss,J. and Vaughan,M. TITLE Molecular cloning, characterization, and expression of human ADP-ribosylation factors: Two guanine nucleotide-dependent activators of cholera toxin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6101-6105 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1816) AUTHORS Kahn,R.A., Kern,F.G., Clark,J., Gelmann,E.P. and Rulka,C. TITLE Human ADP-ribosylation factors (ARF's): A functionally conserved family of GTP-binding proteins JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer copy of sequence [1] kindly submitted by D.Bobak, 02-JUN-1989. Draft entry and computer-readable sequence for [2] kindly submitted by R.A.Kahn, 10-JUL-1990. National Cancer Institute Lab of Biol. Chem. DCT Bldg. 37, RM 5D-02 Bethesda, MD 20892 FEATURES from to/span description pept 78 623 ADP-ribosylation factor (ARF1) BASE COUNT 405 a 512 c 469 g 430 t ORIGIN 1 ggggaaaacc aacgcctggc tcggagcagc agcctctgag gtgtccctgg ccagtgtcct 61 tccacctgtc cacaagcatg gggaacatct tcgccaacct cttcaagggc ctttttggca 121 aaaaagaaat gcgcatcctc atggtgggcc tggatgctgc agggaagacc acgatcctct 181 acaagcttaa gctgggtgag atcgtgacca ccattcccac cataggcttc aacgtggaaa 241 ccgtggagta caagaacatc agcttcactg tgtgggacgt gggtggccag gacaagatcc 301 ggcccctgtg gcgccactac ttccagaaca cacaaggcct gatcttcgtg gtggacagca 361 atgacagaga gcgtgtgaac gaggcccgtg aggagctcat gaggatgctg gccgaggacg 421 agctccggga tgctgtcctc ctggtgttcg ccaacaagca ggacctcccc aacgccatga 481 atgcggccga gatcacagac aagctggggc tgcactcact acgccacagg aactggtaca 541 ttcaggccac ctgcgccacc agcggcgacg ggctctatga aggactggac tggctgtcca 601 atcagctccg gaaccagaag tgaacgcgac ccccctccct ctcactcctc ttgccctctg 661 ctttactctc atgtggcaaa cgtgcggctc gtggtgtgag tgccagaagc tgcctccgtg 721 gtttggtcac cgtgtgcatc gcaccgtgct gtaaatgtgg cagacgcagc ctgcggccag 781 gctttttatt taatgtaaat agtttttgtt tccaatgagg cagtttctgg tactcctatg 841 caatattact cagctttttt tattgtaaaa agaaaaatca actcactgtt cagtgctgag 901 aggggatgta ggcccatggg cacctggcct ccaggagtcg ctgtgttggg agagccggcc 961 acgcccttgg cttagagctg tgttgaaatc cattttggtg gttggtttta acccaaactc 1021 agtgcatttt ttaaaatagt taagaatcca agtcgagaac acttgaacac acagaaggga 1081 gaccccgcct agcatagatt tgcagttacg gcctggatgc cagtcgccag cccagctgtt 1141 cccctcggga acatgaggtg gtggtggcgc agcagactgc gatcaattct gcatggtcac 1201 agtagagatc cccgcaactc gcttgtcctt gggtcaccct gcattccata gccatgtgct 1261 tgtccctgtg ctcccacggt tcccaggggc caggctggga gcccacagcc accccactat 1321 gccgcaggcc gccctaccca ccttcaggca gcctatggga cgcaggcccc atctgtccct 1381 cggtccgcgt gtggccagag tggtccgtcg tccccaacac tcgtgctcgc tcagacactt 1441 tggcaggatg tctggggcct caccagcagg agcgcgtgca agccgggcag gcggtccacc 1501 tagacccaca gcccctcggg agcaccccac ctctgtgtgt gatgtagctt tctctccctc 1561 agcctgcaag ggtccgattt gccatcgaaa aagacaacct ctactttttt cttttgtatt 1621 ttgataaaca ctgaagctgg agctgttaaa tttatcttgg ggaaacctca gaactggtct 1681 atttggtgtc gtaggaacct cttactgctt tcaatacacg attagtaatc aactgttttg 1741 tatacttgtt ttcagttttc atttcgacaa acaagcactg taattatagc tattagaata 1801 aaatctctta actatt // LOCUS HUMARF2A 1529 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human ADP-ribosylation factor 2 (ARF2) mRNA, complete cds. ACCESSION M36341 M31890 KEYWORDS ADP-ribosylation factor 2. SOURCE Human breast cancer cell line MDA-MB 231, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 766) AUTHORS Monaco,L., Murtagh,J.J., Newman,K.B., Tsai,S.-C., Moss,J. and Vaughan,V. TITLE Selective amplification of an mRNA and related pseudogene for a human ADP-ribosylation factor, a guanine nucleotide-dependent protein activator of cholera toxin JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 66 to 1529) AUTHORS Kahn,R.A., Kern,F.G., Clark,J., Gelmann,E.P. and Rulka,C. TITLE Human ADP-ribosylation factors (ARF's): A functionally conserved family of GTP-binding proteins JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by L.Monaco, 02-FEB-1990. Draft entry and computer-readable sequence for [2] kindly submitted by R.A.Kahn, 10-JUL-1990. National Cancer Institute Lab of Biol. Chem. DCT Bldg. 37, RM 5D-02 Bethesda, MD 20892 FEATURES from to/span description pept 130 672 ADP-ribosylation factor 2 (ARF2) BASE COUNT 427 a 294 c 295 g 513 t ORIGIN 1 ctgcctccct ctttcttcct ccgctctttc tcttccctct cgtttagttt gcctggagct 61 tgaaaggaga aagcacgggg tcgccccaaa ccccttctgc ttctgcccat cacaagtgcc 121 actaccgcca tgggcctcac tatctcctcc ctcttctccc gactatttgg caagaagcag 181 atgcgcattt tgatggttgg attggatgct gctggcaaga caaccattct gtataaactg 241 aagttagggg agatagtcac caccattcct accattggtt ttaatgtgga aacagtagaa 301 tataagaaca tttgtttcac agtatgggat gttggtggtc aagatagaat taggcctctc 361 tggaagcatt acttccagaa tacccagggt cttatttttg tggtagatag caacgatcgt 421 gaaagaattc aggaagtagc agatgagctg cagaaaatgc ttctggtaga tgaattgaga 481 gatgcagtgc tgctactttt tgcaaacaaa caggatttgc caaatgctat ggccatcagt 541 gaaatgacag ataaactagg gcttcagtct cttcgtaaca gaacatggta tgttcaagcc 601 acttgtgcaa cacaaggaac tggtctgtat gaaggacttg actggctgtc aaatgagctt 661 tcaaaacgtt aaatgaaatt ggatatctaa ccaaggacat gtttgataaa attggtctag 721 gcttgttaca acaaaattag tttgtatctt ggttattaaa cagtatctgg gactggtttg 781 ggcagaatat taaacttatt ttgttgccaa ttattgttta ccgagtataa tgttgctatt 841 tagcaatgtg cttggtttta aagaaattct ccttgggaaa aaagtatcct cttttaattt 901 tacttcccat aagcgtaaat gcctggacat agctcttgtg aacctttaaa taaattgttt 961 gagtgttttt gagccccaga caaataatgt tttaaagtta tcccttgcta ctttactgat 1021 acctttatca ttcctgagac agtttgctaa tttaaaaatg tagcattcca tttgtattta 1081 tttctctccc ttgccaaaaa gattttctaa tactgcttgt accagccaga gaaagatcca 1141 aaacactact cagctctctt gcactgagga aatttttccc cctacattga ctcctggcct 1201 acatcagcca aacttaacct tggtggggtt tggatttgat agccaattag ttctgtgctg 1261 gttgcaaaga attgatattt agatggtttt taatactcag cagattgtct tcccatattg 1321 tgtctttttt atgttgcatg ttgcttttgt tatcagcctg attttttgct cagtatatga 1381 tagttctgct gatgttttgt ttattgggca gacatatctt cattaagagt ttttggaaaa 1441 ctcatcaaat tcgatgaata cattttcttc ataacccatt tggaattatt cctaataaaa 1501 tgataaaata cgtaaaaaaa aaggaattc // LOCUS HUMCDC25HS 2055 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human cdc25Hs mRNA, complete cds. ACCESSION M34065 KEYWORDS mitotic inducer. SOURCE Human HeLa cell line D98/AH-2, cDNA to mRNA, clone BSK1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2055) AUTHORS Sadhu,K., Reed,B.I., Richardson,H. and Russell,P. TITLE Human homolog of fission yeast cdc25 mitotic inducer is predominantly expressed in G-2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5139-5143 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Russell, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 211 1632 CDC25Hs ORF mRNA < 1 2055 cdc25Hs mRNA BASE COUNT 576 a 472 c 477 g 530 t ORIGIN 1 caggaagact ctgagtccga cgttggccta cccagtcgga aggcagagct gcaatctagt 61 taactacctc ctttccccta gatttccttt cattctgctc aagtcttcgc ctgtgtccga 121 tccctatcta ctttctctcc tcttgtagca agcctcagac tccaggcttg agctaggttt 181 tgtttttctc ctggtgagaa ttcgaagacc atgtctacgg aactcttctc atccacaaga 241 gaggaaggaa gctctggctc aggacccagt tttaggtcta atcaaaggaa aatgttaaac 301 ctgctcctgg agagagacac ttcctttacc gtctgtccag atgtccctag aactccagtg 361 ggcaaatttc ttggtgattc tgcaaaccta agcattttgt ctggaggaac cccaaaatgt 421 tgcctcgatc tttcgaatct tagcagtggg gagataactg ccactcagct taccacttct 481 gcagaccttg atgaaactgg tcacctggat tcttcaggac ttcaggaagt gcatttagct 541 gggatgaatc atgaccagca cctaatgaaa tgtagcccag cacagcttct ttgtagcact 601 ccgaatggtt tggaccgtgg ccatagaaag agagatgcaa tgtgtagttc atctgcaaat 661 aaagaaaatg acaatggaaa cttggtggac agtgaaatga aatatttggg cagtcccatt 721 actactgttc caaaattgga taaaaatcca aacctaggag aagaccaggc agaagagatt 781 tcagatgaat taatggagtt ttccctgaaa gatcaagaag caaaggtgag cagaagtggc 841 ctatatcgct ccccgtcgat gccagagaac ttgaacaggc caagactgaa gcaggtggaa 901 aaattcaagg acaacacaat accagataaa gttaaaaaaa agtatttttc tggccaagga 961 aagctcagga agggcttatg tttaaagaag acagtctctc tgtgtgacat tactatcact 1021 cagatgctgg aggaagattc taaccagggg cacctgattg gtgatttttc caaggtatgt 1081 gcgctgccaa ccgtgtcagg gaaacaccaa gatctgaagt atgtcaaccc agaaacagtg 1141 gctgccttac tgtcggggaa gttccagggt ctgattgaga agttttatgt cattgattgt 1201 cgctatccat atgagtatct gggaggacac atccagggag ccttaaactt atatagtcag 1261 gaagaactgt ttaacttctt tctgaagaag cccatcgtcc ctttggacac ccagaagaga 1321 ataatcatcg tgttccactg tgaattctcc tcagagaggg gcccccgaat gtgccgctgt 1381 ctgcgtgaag aggacaggtc tctgaaccag tatcctgcat tgtactaccc agagctatat 1441 atccttaaag gcggctacag agacttcttt ccagaatata tggaactgtg tgaaccacag 1501 agctactgcc ctatgcatca tcaggaccac aagactgagt tgctgaggtg tcgaagccag 1561 agcaaagtgc aggaagggga gcggcagctg cgggagcaga ttgcccttct ggtgaaggac 1621 atgagcccat gataacattc cagccactgg ctgctaacaa gtcaccaaaa agacactgca 1681 gaaaccctga gcagaaagag gccttctgga tggccaaacc caagattatt aaaagatgtc 1741 tctgcaaacc aacaggctac caacttgtat ccaggcctgg gaatggatta ggtttcagca 1801 gagctgaaag ctggtggcag agtcctggag ctggctctat aaggcagcct tgagttgcat 1861 agagatttgt attggttcag ggaactctgg cattcctttt cccaactcct catgtcttct 1921 cacaagccag ccaactcttt ctctctgggc ttcgggctat gcaagagcgt tgtctacctt 1981 ctttctttgt attttccttc tttgtttccc cctctttctt ttttaaaaat ggaaaaataa 2041 acactacaga atgag // LOCUS HUMDBTG 1401 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human delta-beta-thalassemia gene region DNA. ACCESSION M36375 KEYWORDS delta-beta-thalassemia. SOURCE Human 30 yr old japanese female blood cell DNA, clone Jp-thal. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1401) AUTHORS Shiolawa,S., Yamada,H., Takihara,Y., Matsunaga,E., Ohba,Y., Yamamoto,K. and Fukumaki,Y. TITLE Molecular analysis of japanese delta-beta-thalassemia JOURNAL Blood 72, 1771-1776 (1988) STANDARD simple staff_entry BASE COUNT 390 a 341 c 237 g 433 t ORIGIN 1 gagctcttta gtttaattag attccattgg tcaatttttg cttttgctgc aattgctttt 61 cacgctttca tcatgaaatc tgtgcccgtg tttatatcat gaatagtatt gccttgattt 121 ttttctaggc tttttatagt ttggggtttt tcatttaagt ctctaatagc tcatagaatc 181 tttcttcaac tcctgcaatc aaagggaaac gtaataggat tcttgtaaca gagggaactc 241 tattagcaaa catgggtcca agaacagctc cccccaccac caccaccacc acccgccccc 301 accatccctg ctcctggagc tggagtgcaa tggcacaatc acctactcct cccaggttca 361 agcaattccc ttgcctcagc ctcccgagta gctaggacta caggtacatg ccaccacacc 421 cagctaattt tttgtatttt agtagagatg gggtttcacc atgttggcca ggatggtctc 481 gatctcctga cctcgtgatc tgcccacttc agcctcccaa agtgctggga ttacaggcgt 541 gagccactgc tcccggccaa taatagcttt ttaaaaaaaa ttctgtgatt cttaaaccag 601 tcttgagttg tcctgcgcta aaactaatga gcaaatatta aactcttctt aaacagttct 661 ctttaagtgt ctctttctcc cccactgaga attcttccat gacagagcct ataacttaaa 721 catccctatt accagactac caggaacatc taaatagaca gccctcacaa caatgcttca 781 cattcattac tatcacaaca ccattattgg gtattaataa ctccccttta tgggtgagaa 841 agcagggtaa gattagataa tcgagtctca gaggaaacat acgacgttct caaattcacc 901 caacaagcca atggcagatt caggaattga gctctgttct tactccaaac gccatgctct 961 cgcaaactcc ttatttaaat ttttttttct ttttttactg tggcatgaac aaacacatgc 1021 tgatgactgt agtctatatt tggaataact attttgaacg ctacaacgat atttttgagt 1081 tgacataacg tatgaaaata cactactgag ttcatgctca catttactct gaatacttcc 1141 gcgtttgttt catatactat ctgaaggtta tacagtacct aatttattta taactgtctt 1201 caatgcacta caacactgac cttgtgcaca gaattcttgg gtctggaagg tgaggttatt 1261 tgatcctgcc ccctgctatt tgaaggatca tcccacacag cagagaatct attaatgcaa 1321 agcaagtgag agagagttat ctcaacctca ctaacttact tgcagaaagg ttctgtacta 1381 ctatagctgc ggaatcaccc c // LOCUS HUMHER3A 4975 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human epidermal growth factor receptor (HER3) mRNA, complete cds. ACCESSION M34309 KEYWORDS HER3 gene; epidermal growth factor receptor. SOURCE Human cell line MDA-MB-361, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4975) AUTHORS Plowman,G.D., Whitney,G.S., Neubauer,M.G., Green,J.M., McDonald,V.L., Todaro,G.J. and Shoyab,M. TITLE Molecular cloning and expression of another epidermal growth factor receptor-related gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4905-4909 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 16-MAY-1990, for release after publication. FEATURES from to/span description pept 199 4227 HER3 protein precursor sigp 199 255 HER3 signal peptide matp 256 4224 HER3 protein BASE COUNT 1195 a 1309 c 1327 g 1144 t ORIGIN Chromosome 12q13. 1 ctctcacaca cacacacccc tcccctgcca tccctccccg gactccggct ccggctccga 61 ttgcaatttg caacctccgc tgccgtcgcc gcagcagcca ccaattcgcc agcggttcag 121 gtggctcttg cctcgatgtc ctagcctagg ggcccccggg ccggacttgg ctgggctccc 181 ttcaccctct gcggagtcat gagggcgaac gacgctctgc aggtgctggg cttgcttttc 241 agcctggccc ggggctccga ggtgggcaac tctcaggcag tgtgtcctgg gactctgaat 301 ggcctgagtg tgaccggcga tgctgagaac caataccaga cactgtacaa gctctacgag 361 aggtgtgagg tggtgatggg gaaccttgag attgtgctca cgggacacaa tgccgacctc 421 tccttcctgc agtggattcg agaagtgaca ggctatgtcc tcgtggccat gaatgaattc 481 tctactctac cattgcccaa cctccgcgtg gtgcgaggga cccaggtcta cgatgggaag 541 tttgccatct tcgtcatgtt gaactataac accaactcca gccacgctct gcgccagctc 601 cgcttgactc agctcaccga gattctgtca gggggtgttt atattgagaa gaacgataag 661 ctttgtcaca tggacacaat tgactggagg gacatcgtga gggaccgaga tgctgagata 721 gtggtgaagg acaatggcag aagctgtccc ccctgtcatg aggtttgcaa ggggcgatgc 781 tggggtcctg gatcagaaga ctgccagaca ttgaccaaga ccatctgtgc tcctcagtgt 841 aatggtcact gctttgggcc caaccccaac cagtgctgcc atgatgagtg tgccgggggc 901 tgctcaggcc ctcaggacac agactgcttt gcctgccggc acttcaatga cagtggagcc 961 tgtgtacctc gctgtccaca gcctcttgtc tacaacaagc taactttcca gctggaaccc 1021 aatccccaca ccaagtatca gtatggagga gtttgtgtag ccagctgtcc ccataacttt 1081 gtggtggatc aaacatcctg tgtcagggcc tgtcctcctg acaagatgga agtagataaa 1141 aatgggctca agatgtgtga gccttgtggg ggactatgtc ccaaagcctg tgagggaaca 1201 ggctctggga gccgcttcca gactgtggac tcgagcaaca ttgatggatt tgtgaactgc 1261 accaagatcc tgggcaacct ggactttctg atcaccggcc tcaatggaga cccctggcac 1321 aagatccctg ccctggaccc agagaagctc aatgtcttcc ggacagtacg ggagatcaca 1381 ggttacctga acatccagtc ctggccgccc cacatgcaca acttcagtgt tttttccaat 1441 ttgacaacca ttggaggcag aagcctctac aaccggggct tctcattgtt gatcatgaag 1501 aacttgaatg tcacatctct gggcttccga tccctgaagg aaattagtgc tgggcgtatc 1561 tatataagtg ccaataggca gctctgctac caccactctt tgaactggac caaggtgctt 1621 cgggggccta cggaagagcg actagacatc aagcataatc ggccgcgcag agactgcgtg 1681 gcagagggca aagtgtgtga cccactgtgc tcctctgggg gatgctgggg cccaggccct 1741 ggtcagtgct tgtcctgtcg aaattatagc cgaggaggtg tctgtgtgac ccactgcaac 1801 tttctgaatg gggagcctcg agaatttgcc catgaggccg aatgcttctc ctgccacccg 1861 gaatgccaac ccatgggggg cactgccaca tgcaatggct cgggctctga tacttgtgct 1921 caatgtgccc attttcgaga tgggccccac tgtgtgagca gctgccccca tggagtccta 1981 ggtgccaagg gcccaatcta caagtaccca gatgttcaga atgaatgtcg gccctgccat 2041 gagaactgca cccaggggtg taaaggacca gagcttcaag actgtttagg acaaacactg 2101 gtgctgatcg gcaaaaccca tctgacaatg gctttgacag tgatagcagg attggtagtg 2161 attttcatga tgctgggcgg cacttttctc tactggcgtg ggcgccggat tcagaataaa 2221 agggctatga ggcgatactt ggaacggggt gagagcatag agcctctgga ccccagtgag 2281 aaggctaaca aagtcttggc cagaatcttc aaagagacag agctaaggaa gcttaaagtg 2341 cttggctcgg gtgtctttgg aactgtgcac aaaggagtgt ggatccctga gggtgaatca 2401 atcaagattc cagtctgcat taaagtcatt gaggacaaga gtggacggca gagttttcaa 2461 gctgtgacag atcatatgct ggccattggc agcctggacc atgcccacat tgtaaggctg 2521 ctgggactat gcccagggtc atctctgcag cttgtcactc aatatttgcc tctgggttct 2581 ctgctggatc atgtgagaca acaccggggg gcactggggc cacagctgct gctcaactgg 2641 ggagtacaaa ttgccaaggg aatgtactac cttgaggaac atggtatggt gcatagaaac 2701 ctggctgccc gaaacgtgct actcaagtca cccagtcagg ttcaggtggc agattttggt 2761 gtggctgacc tgctgcctcc tgatgataag cagctgctat acagtgaggc caagactcca 2821 attaagtgga tggcccttga gagtatccac tttgggaaat acacacacca gagtgatgtc 2881 tggagctatg gtgtgacagt ttgggagttg atgaccttcg gggcagagcc ctatgcaggg 2941 ctacgattgg ctgaagtacc agacctgcta gagaaggggg agcggttggc acagccccag 3001 atctgcacaa ttgatgtcta catggtgatg gtcaagtgtt ggatgattga tgagaacatt 3061 cgcccaacct ttaaagaact agccaatgag ttcaccagga tggcccgaga cccaccacgg 3121 tatctggtca taaagagaga gagtgggcct ggaatagccc ctgggccaga gccccatggt 3181 ctgacaaaca agaagctaga ggaagtagag ctggagccag aactagacct agacctagac 3241 ttggaagcag aggaggacaa cctggcaacc accacactgg gctccgccct cagcctacca 3301 gttggaacac ttaatcggcc acgtgggagc cagagccttt taagtccatc atctggatac 3361 atgcccatga accagggtaa tcttgggggg tcttgccagg agtctgcagt ttctgggagc 3421 agtgaacggt gcccccgtcc agtctctcta cacccaatgc cacggggatg cctggcatca 3481 gagtcatcag aggggcatgt aacaggctct gaggctgagc tccaggagaa agtgtcaatg 3541 tgtagaagcc ggagcaggag ccggagccca cggccacgcg gagatagcgc ctaccattcc 3601 cagcgccaca gtctgctgac tcctgttacc ccactctccc cacccgggtt agaggaagag 3661 gatgtcaacg gttatgtcat gccagataca cacctcaaag gtactccctc ctcccgggaa 3721 ggcacccttt cttcagtggg tctcagttct gtcctgggta ctgaagaaga agatgaagat 3781 gaggagtatg aatacatgaa ccggaggaga aggcacagtc cacctcatcc ccctaggcca 3841 agttcccttg aggagctggg ttatgagtac atggatgtgg ggtcagacct cagtgcctct 3901 ctgggcagca cacagagttg cccactccac cctgtaccca tcatgcccac tgcaggcaca 3961 actccagatg aagactatga atatatgaat cggcaacgag atggaggtgg tcctgggggt 4021 gattatgcag ccatgggggc ctgcccagca tctgagcaag ggtatgaaga gatgagagct 4081 tttcaggggc ctggacatca ggccccccat gtccattatg cccgcctaaa aactctacgt 4141 agcttagagg ctacagactc tgcctttgat aaccctgatt actggcatag caggcttttc 4201 cccaaggcta atgcccagag aacgtaactc ctgctccctg tggcactcag ggagcattta 4261 atggcagcta gtgcctttag agggtaccgt cttctcccta ttccctctct ctcccaggtc 4321 ccagcccctt ttccccagtc ccagacaatt ccattcaatc tttggaggct tttaaacatt 4381 ttgacacaaa attcttatgg tatgtagcca gctgtgcact ttcttctctt tcccaacccc 4441 aggaaaggtt ttccttattt tgtgtgcttt cccagtccca ttcctcagct tcttcacagg 4501 cactcctgga gatatgaagg attactctcc atatcccttc ctctcaggct cttgactact 4561 tggaactagg ctcttatgtg tgcctttgtt tcccatcaga ctgtcaagaa gaggaaaggg 4621 aggaaaccta gcagaggaaa gtgtaatttt ggtttatgac tcttaacccc ctagaaagac 4681 agaagcttaa aatctgtgaa gaaagaggtt aggagtagat attgattact atcataattc 4741 agcacttaac tatgagccag gcatcatact aaacttcacc tacattatct cacttagtcc 4801 tttatcatcc ttaaaacaat tctgtgacat acatattatc tcattttaca caaagggaag 4861 tcgggcatgg tggctcatgc ctgtaatctc agcactttgg gaggctgagg cagaaggatt 4921 acctgaggca aggagtttga gaccagctta gccaacatag taagaccccc atctc // LOCUS HUMMEHA 959 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human microsomal epoxide hydrolase (HmEH) mRNA, partial cds. ACCESSION M36374 KEYWORDS microsomal epoxide hydrolase. SOURCE Human liver, cDNA to mRNA, clone HmEH7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 959) AUTHORS Craft,J.A., Jackson,M.R. and Burchell,B. TITLE Partial nucleotide sequence of a cloned cDNA for human liver microsomal epoxide hydrolase JOURNAL Biochem. Soc. Trans. 15, 708-709 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 959 microsomal epoxide hydrolase (AA at 3) (EC 3.3.2.3) BASE COUNT 217 a 272 c 251 g 219 t ORIGIN 1 cttcagtgct gggctttgcc atctactggt tcatctcccg ggacaaagag gaaactttgc 61 cacttgaaga tgggtggtgg gggccaggca cgaggtccgc agccagggag gacgacagca 121 tccgcccttt caaggtggaa acgtcagatg aggagatcca cgacttacac cagaggatcg 181 ataagttccg tttcacccca cctttggagg acagctgctt ccactatggc ttcaactcca 241 actacctgaa gaaagtcatc tcctactggc ggaatgaatt tgactggaag aagcaggtgg 301 agattctcaa caagtaccct cacttcaaga ctaaaattga agggctggac atccacttca 361 tccacgtgaa gcccccccag ctgcccgcag gccatacccc gaagcccttg ctgatggtga 421 acggctggcc cggctctttc tacgagtttt ataagatcat cccactcctg actgacccca 481 agaaccatgg cctgagcgac gagcacgttt ttgaagtcat ctgcccttcc atccctggct 541 atggcttctc agaggcatcc tccaagaagg ggttcaactc ggtggccacc gccaggatct 601 tttacaagct gatgctgcgg ctgggcttcc aggaattcta cattcaagga ggggactggg 661 ggtccctgat ctgcactaat atggcccagc tggtgcccag ccacctgaaa ggcctgcact 721 tgaacatggc tttggtttta agcaacttct ctaccctgac cctcctcctg ggacagcgtt 781 tcgggaggtt tcttggcctc actgagaggg atgtggagct gctgtacccc gtcaaggaga 841 aggtattcta cagcctgatg agggagagcg gctacatgca catccagtgc accaagcctg 901 acaccgtagg ctctgctctg aatgactctc ctgtgggtct ggctgcctat attctagag // LOCUS HUMRNPSMBA 847 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human small nuclear ribonucleoprotein particle SmB mRNA, 3' end. ACCESSION M34081 KEYWORDS lupus autoantigen; small nuclear ribonucleoprotein particle. SOURCE Human HeLa cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 847) AUTHORS Elkon,K.B., Hines,J.J., Chu J,-L. and Parnassa,A.P. TITLE Epitope mapping of recombinant HeLa SmB and B' peptides obtained by the polymerase chain reaction JOURNAL J. Immunol. 145, 636-643 (1990) STANDARD full staff_entry COMMENT Authorin copy for [1] kindly submitted by K.B.Elkon 23-APR-1990, for release after publication. FEATURES from to/span description pept < 1 675 small nuclear ribonucleoprotein particle (SmB) mRNA < 1 847 snRNP SmB mRNA BASE COUNT 170 a 251 c 252 g 174 t ORIGIN 1 aagatgctgc agcatattga ttacaggatg aggtgcatcc tgcaggacgg ccggatcttc 61 attggcacct tcaaggcttt tgacaagcac atgaatttga tcctctgtga ctgtgatgag 121 ttcagaaaga tcaagccaaa gaactccaaa caagcagaaa gggaagagaa gcgagtcctc 181 ggtctggtgc tgctgcgagg ggagaatctg gtctcaatga cagtagaggg acctcctccc 241 aaagatactg gtattgctcg agttccactt gctggagctg ccgggggccc agggatcggc 301 agggctgctg gcagaggaat cccagctggg gttcccatgc cccaggctcc tgcaggactt 361 gctgggccag tccgtggggt tggcgggcca tcccaacagg tgatgacccc acaaggaaga 421 ggtactgttg cagccgctgc agctgctgcc acagccagta ttgccggggc tccaacccag 481 tacccacctg gccgtggggg tcctccccca cctatgggcc gaggagcacc ccctccaggc 541 atgatgggcc cacctcctgg tatgagacct cctatgggtc ccccaatggg gatcccccct 601 ggaagaggga ctccaatggg catgccccct ccgggaatgc ggcctcctcc ccctgggatg 661 cgaggccttc tttgaccctt ggccacagag tatggaagta gctccgcaga ggcgtgggct 721 cgattcctca gggccacgtt accacagacc tgtttgtttc ttatgctgtt gttcgtggag 781 tctcatggga ttgtctggtt tccttacagg gccccctccc ccgggaatgc gcccaccaag 841 gccctag // LOCUS HUMRNPSMBB 702 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human small nuclear ribonucleoprotein particle SmB' mRNA, 3' end. ACCESSION M34082 KEYWORDS lupus autoantigen; small nuclear ribonucleoprotein particle. SOURCE Human HeLa cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 702) AUTHORS Elkon,K.B., Hines,J.J., Chu J,-L. and Parnassa,A.P. TITLE Epitope mapping of recombinant HeLa SmB and B' peptides obtained by the polymerase chain reaction JOURNAL J. Immunol. 145, 636-643 (1990) STANDARD full staff_entry COMMENT Authorin copy for [1] kindly submitted by K.B.Elkon 23-APR-1990, for release after publication. FEATURES from to/span description pept < 1 702 small nuclear ribonucleoprotein particle (SmB') BASE COUNT 147 a 215 c 212 g 128 t ORIGIN 1 aagatgctgc agcatattga ttacaggatg aggtgcatcc tgcaggacgg ccggatcttc 61 attggcacct tcaaggcttt tgacaagcac atgaatttga tcctctgtga ctgtgatgag 121 ttcagaaaga tcaagccaaa gaactccaaa caagcagaaa gggaagagaa gcgagtcctc 181 ggtctggtgc tgctgcgagg ggagaatctg gtctcaatga cagtagaggg acctcctccc 241 aaagatactg gtattgctcg agttccactt gctggagctg ccgggggccc agggatcggc 301 agggctgctg gcagaggaat cccagctggg gttcccatgc cccaggctcc tgcaggactt 361 gctgggccag tccgtggggt tggcgggcca tcccaacagg tgatgacccc acaaggaaga 421 ggtactgttg cagccgctgc agctgctgcc acagccagta ttgccggggc tccaacccag 481 tacccacctg gccgtggggg tcctccccca cctatgggcc gaggagcacc ccctccaggc 541 atgatgggcc cacctcctgg tatgagacct cctatgggtc ccccaatggg gatcccccct 601 ggaagaggga ctccaatggg catgccccct ccgggaatgc ggcctcctcc ccctgggatg 661 cgagggcccc ctcccccggg aatgcgccca ccaaggccct ag // LOCUS HUMTPMYOC 1191 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human skeletal muscle beta-tropomyosin mRNA, complete cds. ACCESSION X06825 M36268 KEYWORDS actin-binding protein; beta-tropomyosin. SOURCE Human skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1191) AUTHORS Liautard,J.P. JOURNAL Unpublished (1988) CRBM du CNRS, 34033 Montpelier Cedex, France STANDARD simple automatic REFERENCE 2 (bases 1 to 1191) AUTHORS Widada,J.S., Ferraz,C., Capony,J.-P. and Liautard,J.-P. TITLE Complete nucleotide sequence of the adult skeletal isoform of human skeletal muscle beta-tropomyosin JOURNAL Nucleic Acids Res. 16, 3109-3109 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 107 961 beta-tropomyosin mRNA < 1 1191 beta-tropomyosin mRNA BASE COUNT 321 a 320 c 368 g 182 t ORIGIN 1 cccgctccgt cctcctcgcc tgccaccggt gcacccagtc cgctcaccca gcccagtccg 61 tccggtcctc accgcctgcc ggccggccca ccccccaccg caggccatgg acgccatcaa 121 gaagaagatg cagatgctga agctggacaa ggagaacgcc atcgaccgcg ccgagcaggc 181 cgaagccgac aagaagcaag ctgaggaccg ctgcaagcag ctggaggagg agcagcaggc 241 cctccagaag aagctgaagg ggacagagga tgaggtggaa aagtattctg aatccgtgaa 301 ggaggcccag gagaaactgg agcaggccga gaagaaggcc actgatgctg aggcagatgt 361 ggcctccctg aaccgccgca ttcagctggt tgaggaggag ctggaccggg cccaggagcg 421 cctggctaca gccctgcaga agctggagga ggccgagaag gcggctgatg agagcgagag 481 aggaatgaag gtcatcgaaa accgggccat gaaggatgag gagaagatgg aactgcagga 541 gatgcagctg aaggaggcca agcacatcgc tgaggattca gaccgcaaat atgaagaggt 601 ggccaggaag ctggtgatcc tggaaggaga gctggagcgc tcggaggaga gggctgaggt 661 ggccgagagt aaatgtgggg acctagagga ggagctgaaa attgttacca acaacttgaa 721 atccctggag gcccaggcgg acaagtattc caccaaagaa gataaatatg aagaggagat 781 caaactgttg gaggagaagc tgaaggaggc tgagacccga gcagagtttg ccgagaggtc 841 tgtggcaaag ttggagaaaa ccatcgatga cctagaagat gaagtctatg cccagaagat 901 gaagtacaag gccattagcg aggaactgga caacgcactc aatgacatca cctccctctg 961 agccccacgc ccagcgtgcc acctcagctc tcttctctcc tctcctttcc attctctcta 1021 tggggagggg agagcaggca ggaggagcag aaattgccaa cattgcacag ccaggctggg 1081 agcagcctag ggagagcccc catcatgccc accacccact ctggcactgg cttcatcctt 1141 tacctatccc cttccaccct cctttgcttg cttaataaat tctgaacttg g // LOCUS HUMTRHA 1322 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human thyroid releasing hormone (TRH) mRNA, complete cds. ACCESSION M36317 KEYWORDS thyroid releasing hormone. SOURCE Human brain, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1322) AUTHORS Mandel,G. and Goodman,R.H. TITLE Using the brain to screen cloned genes JOURNAL Trends Neurosci. 10, 101-104 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 103 870 thyroid releasing hormone precursor (TRH) sigp 103 174 thyroid releasing hormone signal peptide matp 175 867 thyroid releasing hormone BASE COUNT 315 a 344 c 372 g 291 t ORIGIN 1 tccttggatt cgggagtatt gcaaactcta cccagccagt ttgcactctt cagctcagca 61 tcttggaaag ctctgcagag tctccacttc gcagactcca ggatgccggg accttggttg 121 ctgctggctc tggctttgat cttcacccta actggtatcc ctgaatcctg cgccttgccg 181 gaggcagccc aggaggaagg tgcagtgact cctgaccttc ctggcctgga gaatgttcag 241 gtccggccag aacgtcgatt cttgtggaaa gacctccagc gggtgagagg ggacctcggt 301 gctgccttag actcctggat cacaaaacgc cagcatccag gcaaaaggga ggaggaggaa 361 aaagacattg aagctgaaga gaggggagac ttgggagaag ggggagcctg gagactccac 421 aaacgacagc accccggccg acgtgccaac caggacaagt attcatgggc agatgaggag 481 gacagtgact ggatgccacg gtcctggtta ccagatttct ttctggattc ctggttctca 541 gatgtccccc aagtcaagcg gcagcaccct ggcaggcgat ccttcccctg gatggagtct 601 gatgtcacca agaggcaaca tccaggccgg aggttcatag atcccgagct ccaaagaagc 661 tgggaagaaa aagagggaga gggtgtctta atgcctgaga aacgccagca tcctggcaaa 721 agggcattgg gtcatccctg tgggccccag gggacttgtg gtcaaacagg cctgctccag 781 cttctaggtg acctgagcag gggtcaggag accctggtga agcaaagccc acaagtggaa 841 ccctgggaca aggaacctct ggaggagtaa ggccagagtc aggctttagg tctaggatga 901 tgtaagccct gtattcccta tcctgttccc ttcactagct gtcctctctt agatgctaac 961 cctgggccct ctgtacatcg tccacccaaa ccccttcctt accgacttca gagactttag 1021 aaagccagtc aggaagttaa aaccctactt atcccttcaa gcaagggggt gggagtcaca 1081 ccccttcagc actggccaag atggttcttt cacaccttct aagggacctc ctcagaaagg 1141 aagggtagaa ttgaaatgtt ttggtgttaa aacttctgta atctgcccca tgtggtaaga 1201 ctgacctggt atagcttcag cgcatcctcc aaggttgggg tccctgagca gtttgggaga 1261 tgtttagata tgtccttggg gtggggggac gcatcttgta tataaccctg ctcttgggaa 1321 tc // LOCUS MAARNA1A 163 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-1. ACCESSION M35975 KEYWORDS RNA-1 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-S) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 163) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 103 > 163 115 kDa protein ORF 92 > 163 ORF1 ORF 107 > 163 ORF2 BASE COUNT 39 a 38 c 24 g 62 t ORIGIN 5' end of RNA-1. 1 ggtttttatc ttacacacgc ttgtgcaaga tagttaatcc atttattttt tccttgtatt 61 ttccacagta ttacgttcat tcaatactgt gatgaattca ctatgaatgc tgacgcccta 121 tccaccgatt cctgccttag tatgcgagaa cctttatccc atg // LOCUS MAARNA1B 115 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-1. ACCESSION M35976 KEYWORDS RNA-1 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-B) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 115) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 101 > 115 115 kDa protein ORF 105 > 115 ORF1 BASE COUNT 29 a 25 c 18 g 43 t ORIGIN 5' end of RNA-1. 1 ggtttttatc ttacacacgc ttgtgcaaga tagttaatcc atttattttc cttgcgcttt 61 ccacagcatt acgttcattc aatactgtga agaattcatt atgaatgctg acgcc // LOCUS MAARNA2A 108 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-2. ACCESSION M36389 KEYWORDS RNA-2 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-S) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 108) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 56 > 108 100 kDa protein BASE COUNT 26 a 19 c 19 g 44 t ORIGIN 5' end of RNA-2. 1 ggtttttatc ttttcgcgat tcaaaagata agtttcagag tctaatcttt tcaatatgtt 61 cactcttttg agatgtcttg gatttggcgt caacgacctt accgatat // LOCUS MAARNA2B 109 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-2. ACCESSION M36390 KEYWORDS RNA-2 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-B) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 109) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 57 > 109 100 kDa protein BASE COUNT 25 a 19 c 20 g 45 t ORIGIN 5' end of RNA-2. 1 ggtttttatc ttttcgcgat tgaaaagata agtttcctga gtctaatctt ttcaatatgt 61 tcactctttt gagatgtctt ggatttggcg tcaacgacct taccgatat // LOCUS MAARNA3B 305 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-3. ACCESSION M36391 KEYWORDS RNA-3 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-S) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 305) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 259 > 305 35 kDa protein BASE COUNT 87 a 57 c 48 g 113 t ORIGIN 5' end of RNA-3. 1 ggttttcatc ttacacacgc ttgtgcaaga tagttaatca ttccaattca actcaattaa 61 cgttttttac agtgtaattc gtacttttcg taagtaagtt tctgtaaaag cgtttcttgt 121 tttaatttgg cctaacacgt aattcgtact cttcgtgagt aagttgtgtt agccatacct 181 atcctttaaa ttcctgtcaa tttaaaaaga aaatcattcc cattcgcgta attcgtactc 241 ttcgtgagta agttgcaaat ggagaataca aaaacaaatg cctcgagttc tggaatgttt 301 tcttc // LOCUS MAARNA3C 290 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-3. ACCESSION M36392 KEYWORDS RNA-3 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-B) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 290) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 244 > 290 35 kDa protein BASE COUNT 90 a 52 c 41 g 107 t ORIGIN 5' end of RNA-3. 1 ggttttaata ccattttcaa aatattccaa ttcaactcaa ttaacccttt tacagtgtaa 61 ttcgtacttt tcgtaagtaa gtttctgtaa aagcgtttct tgttttaatt tggcctaaca 121 cgtaattcgt actcttcgtg agtaagttgt gttagccata tctatccttt aaatttttgt 181 caaatttaaa aaagaaaatc attcccattc gcgtaattcg tactcttcgt gagtaagttg 241 caaatggaga atacaaaaac aaatgcctcg agaactggaa tatcttcttc // LOCUS MBFRRDA 1476 bp ds-DNA RNA 14-AUG-1990 DEFINITION M.formicicum 16S rRNA. ACCESSION M36508 KEYWORDS 16S ribosomal RNA. SOURCE M.formicicum (strain DSM 1312) rRNA. ORGANISM Methanobacterium formicicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 1476) AUTHORS Lechner,K., Wich,G. and Boeck,A. TITLE The nucleotide sequence of the 16S rRNA gene and flanking regions from Methanobacterium formicicum: The phylogenetic relationship between methanogenic and halophilic archaebacteria JOURNAL Syst. Appl. Microbiol. 6, 157-163 (1985) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 1476 16S rRNA BASE COUNT 348 a 355 c 472 g 301 t ORIGIN 1 agtccgtttg atcctggcgg aggccactgc tattgggttt cgattaagcc atgcaagtcg 61 aacgatcctt cggggtcgtg gcgtacggct cagtaacacg tggataacct aaccttagga 121 ctgggataac cctgggaaac tggggataat accggatatg tagggctgcc tggaatggtt 181 ccctattgaa atgttccgac gcctaaggat ggatctgcgg cagattaggt agttggcggg 241 gtaaatgccc accaagccag taatctgtac gggttgtgag agcaagagcc cggagatgga 301 acctgagaca aggttccagg ccctacgggg cgcagcaggc gcgaaacctc cgcaatgcac 361 gaaagtgcga cgggggaaac ccaagtgcca ctcttaacgg ggtggctttt cttaagtgta 421 aaaagctttt ggaataagag ctgggcaaga ccggtgccag ccgccgcggt aacaccggca 481 gctcaagtgg tggccgtttt tattgggcct aaagcgttcg tagccggctt gataagtctc 541 tggtgaaatc tcacggctta accgtgagaa ttgctggaga tactattagg cttgaggccg 601 ggagaggtta gcggtactcc cggggtaggg gtgaaatcct ataatcccgg gaggaccacc 661 tgtggcgaag gcggctaact ggaacgggcc tgacggtgag taacgaaagc caggggcgcg 721 aaccggatta gatacccggg tagtcctggc cgtaaacgat gtggacttgg tgttgggatg 781 gctccgagct gccccagtgc cgaagggaag ctgttaagtc caccgcctgg gaagtacggt 841 cgcaagactg aaacttaaag gaattggcgg gggagcacca caacgcgtgg agcctgcggt 901 ttaattggat tcaacgccgg acatctcacc aggggcgaca gcagaatgat agccaggttg 961 atgaccttgc ttgacaagct gagaggaggt gcatggccgc cgtcagctcg taccgtgagg 1021 cgtcctgtta agtcaggcaa cgagcgagac ccacgccctt agttaccagc ggatccttcg 1081 ggatgccggg cacactaagg ggaccgccag tgataaactg gaggaaggag tggacgacgg 1141 taggtccgta tgccccgaat cccctgggct acacgcgggc tacaatggtt aggacaatgg 1201 gttccgacac tgaaaggtgg aggtaatctc ctaaacctgg ccttagttcg gattgagggc 1261 tgtaactcgc cctcatgaag ctggaatgcg tagtaatcgc gtgtcataac cgcgcggtga 1321 atacgtccct gctccttgca cacaccgccc gtcacgccac ccaaaaaggg tttggatgag 1381 gccatagtct ttggttatgg tcgaatctag gttctttgag gagggcgaag tcgtaacaag 1441 gtagccgtag gggaacctgc ggctggatca cctcct // LOCUS MSHLTRERA 190 bp ss-RNA VRL 14-AUG-1990 DEFINITION Harvey murine sarcoma virus LTR 75-bp direct-repeat enhancer region. ACCESSION M26322 KEYWORDS enhancer region; long terminal repeat. SOURCE Harvey murine sarcoma virus, cDNA to viral RNA, clone pM14-1. ORGANISM Harvey murine sarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine sarcoma viruses. REFERENCE 1 (bases 1 to 190) AUTHORS Ostrowski,M.C., Huang,A.L., Kessel,M., Wolford,R.G. and Hager,G.L. TITLE Modulation of enhancer activity by the hormone responsive regulatory element from mouse mammary tumor virus JOURNAL EMBO J. 3, 1891-1899 (1984) STANDARD simple staff_entry FEATURES from to/span description rpt 21 95 75 bp direct-repeat enchancer rpt 96 170 75 bp direct-repeat enchancer BASE COUNT 31 a 54 c 48 g 57 t ORIGIN 1 ggctggaccg catctgggga ccatctgttc ttggccctga gccggggcag gaactgtcta 61 ccacagatat cctgtttggc ccatattcag ctgttctatc tgttcttggc cctggagccg 121 ggcaggaact gtctaccaca gatatcctgt ttggcccata ttcagctgtt tctctgttcc 181 tgaccttgat // LOCUS MUSKERDA 1381 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse keratin D mRNA, complete cds. ACCESSION M36376 KEYWORDS keratin D. SOURCE Mouse F9 teratocarcinoma stem cell, cDNA to mRNA, clone B53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1381) AUTHORS Alonso,A., Weber,T. and Jorcano,J.L. TITLE Cloning and characterization of keratin D, a murine endodermal cytoskeletal protein induced during in vitro differentiation of F9 teratocarcinoma cells JOURNAL Roux's Arch. Dev. Biol. 196, 16-21 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 50 1321 keratin D mRNA < 1 1381 keratin D mRNA site 28 42 TCTCC motif BASE COUNT 366 a 373 c 401 g 241 t ORIGIN 1 aactcctgtt ctggtctctc gcttcgctct cctctcctct ccagacaaga tgagcttcac 61 aacccgctcc accaccttct ccaccaacta ccggtccctg ggctctgtgc ggactcccag 121 ccagcgggtc cggcctgcca gcagcgcagc cagcgtctat gcaggtgctg ggggctccgg 181 gtcacggata tccgtgtccc gctccgtctg gggtggctct gtggggtccg caggcctggc 241 gggaatgggt ggaatccaga ccgagaagga gaccatgcaa gacctgaacg atcgcctggc 301 cagctaccta gacaaggtga agagtctgga aactgagaac aggagactgg agagcaaaat 361 ccgggaacat ctggagaaga aggggcccca gggcgtcaga gactggggcc actacttcaa 421 gatcatcgaa gacctgaggg ctcagatcct tgcgaattct gtggacaatg cccgcatcgt 481 cttgcagatc gacaatgccc gccttgccgc cgatgacttt agagtcaagt atgagacaga 541 actagccatg cgccagtctg tggagagcga catccatgga ctccgcaagg tggtagatga 601 caccaacatc acaaggctgc agctggagac agaaatcgag gcactcaagg aagaacttct 661 gttcatgaag aagaatcatg aagaggaagt ccaaggtctg gaagcccaga ttgccagctc 721 tggattgact gtggaagtgg atgcccccaa atctcaggac ctcagcaaga tcatggcgaa 781 catccgcgcc cagtatgaag cgctgggcca gaagaaccgc gaggaactgg acaagtactg 841 gtctcagcag attgaggaga gtaccacagt tgtcaccacc aagtctgccg aaatcaggga 901 cgctgagacc acactcacgg agctgagacg caccctccag accttggaga ttgacttgga 961 ctccatgaaa aaccagaaca tcaacttgga gaacagcctc ggggatgtgg aggcccgata 1021 caaggcacag atggagcagc tcaatggggt ccttctgcat ctggagtcag agctggcaca 1081 aactcgggca gagggccagc gccaggccca ggaatatgaa gccctcttga acatcaaggt 1141 gaagcttgag gcagagattg ccacctaccg ccgcttgctg gaggatggag aagatttcag 1201 tctcaacgat gccctagact cgagcaactc catgcaaact gtgcagaaga caactacccg 1261 taagatcgtg gatggcagag tggtgtccga gactaatgac accagagttc tgaggcactg 1321 aggcagagaa ggagggaacc cctaggaact gagggaccaa taaaagttga gagctcactg 1381 g // LOCUS MUSMBPMLD1 792 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse myelin basic protein (shi-mld) gene, exon 1. ACCESSION M36273 M36274 KEYWORDS myelin basic protein; shi-mld gene. SEGMENT 1 of 2 SOURCE Mouse mld mutated DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 792) AUTHORS Okano,H., Tamura,T., Miura,M., Aoyama,A., Ikenaka,K., Oshimura,M. and Mikoshiba,K. TITLE Gene organization and transcription of duplicated MBP genes of myelin deficient (shi-mld) mutant mouse JOURNAL EMBO J. 7, 77-83 (1988) STANDARD simple staff_entry FEATURES from to/span description mRNA 713 > 792 shi-mld mRNA signal 623 628 GC box signal 630 634 CAAT box signal 680 685 TATA box BASE COUNT 194 a 234 c 206 g 158 t ORIGIN Chromosome 18. 1 cctccagcgg aatccgggag agggattgct cagtgccctg cttccctgga ctgtaagctg 61 cagaaagatg tgggaagtcc tgttctccac tgagaacact aaaagcacct tttgtcaaac 121 gaccgcttca catctggggc ttgtgcactg gtggcctttt aaaccagaga caacccacaa 181 gatacctaac ctgcggggct ctctggtaca gtgagcaact caggaaatgc tttggcttga 241 ttgctgtggg ctctcaggcc atcgccctct ggagtggttc ttttaatgag aacctgaaga 301 ttggcccctg agccatgtat accaagcaag ctcaatccag gttagctccc tctggttggg 361 gcaagctaac gtgctccttg ggccccgcgc gtaactgtgc gttttatagg agacagctag 421 ttcaagaccc caggaagaaa gcggctttgt ccccctctag gcctcgtaca ggcccacatt 481 catatctcat tgttgttgca ggggaggcag atgcgatcca gaacaatggg acctcggctg 541 aggacacggc ggtgacagac tccaagcaca cagcagaccc aaagaataac tggcaaggcg 601 cccacccagc tgacccaggg aaccgccccc acttgatccg cctcttttcc cgagatgccc 661 cgggaaggag gacaacacct tcaaagacag gccctcagag tccgacgagc ttcagaccat 721 ccaagaagac cccacagcag cttccggagg cctggatgtc atggcatcac agaagagacc 781 ctcacagcga tc // LOCUS MUSMBPMLD2 200 bp ds-DNA ROD 14-AUG-1990 DEFINITION Figure 7. Nucleotide sequences near the inverted exon 3. ACCESSION M36275 KEYWORDS myelin basic protein; shi-mld gene. SEGMENT 2 of 2 SOURCE Mouse mld mutated DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 200) AUTHORS Okano,H., Tamura,T., Miura,M., Aoyama,A., Ikenaka,K., Oshimura,M. and Mikoshiba,K. TITLE Gene organization and transcription of duplicated MBP genes of myelin deficient (shi-mld) mutant mouse JOURNAL EMBO J. 7, 77-83 (1988) STANDARD simple staff_entry FEATURES from to/span description pept / 43 / 144 shi-mld myelin basic protein, exon 3 (AA at 43) pre-msg < 1 > 200 shi-mld mRNA and introns IVS < 1 42 shi-mld intron B IVS 145 > 200 shi-mld intron C BASE COUNT 54 a 62 c 40 g 44 t ORIGIN Chromosome 18; about 11 kbp after segment 1. 1 ctacgccctc tccatcctca gctgctcgct tctctctttc aggactcaca cacgagaact 61 acccattatg gctccctgcc ccataattcg cagcacggcc ggacccaaga tgaaaaccca 121 gtagtccatt tcttcaagaa cattgtaagt gactgtgcac aggggaacca agaaatcatc 181 aagggaaagg ttgtggatcc // LOCUS MUSMIGP4A 1284 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse monokine induced by gamma interferon (MIG) mRNA, complete cds. ACCESSION M34815 KEYWORDS MIG protein; gamma interferon-induced; monokine induced by gamma interferon; platelet factor 4-related protein; secreted protein. SOURCE Mouse (strain BAB/14) macrophage cell line RAW 264.7, cDNA to mRNA, clone 119/24. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1284) AUTHORS Farber,J.M. TITLE A macrophage mRNA selectively induced by gamma-interferon encodes a new member of the platelet factor 4 family of cytokines JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5238-5242 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.M.Farber, 04-JUN-1990, for release after publication. FEATURES from to/span description pept 95 475 gamma interferon induced monokine precursor (MIG) sigp 95 157 gamma interferon induced monokine signal peptide matp 158 472 gamma interferon induced monokine signal 6 16 TATA region BASE COUNT 401 a 316 c 247 g 320 t ORIGIN 1 tttcctaaat aaatatgatc cccaagaaca tgctctctaa agacattctc ggacttcact 61 ccaacacagt gactcaatag aactcagctc tgccatgaag tccgctgttc ttttcctttt 121 gggcatcatc ttcctggagc agtgtggagt tcgaggaacc ctagtgataa ggaatgcacg 181 atgctcctgc atcagcacca gccgaggcac gatccactac aaatccctca aagacctcaa 241 acagtttgcc ccaagcccca attgcaacaa aactgaaatc attgctacac tgaagaacgg 301 agatcaaacc tgcctagatc cggactcggc aaatgtgaag aagctgatga aagaatggga 361 aaagaagatc aaccaaaaga aaaagcaaaa gagggggaaa aaacatcaaa agaacatgaa 421 aaacagaaaa cccaaaacac cccaaagtcg tcgtcgttca aggaagacta cataagagac 481 cattacttta ccaacaagca ccctgaatct taatgggttt tagattgtac tgaaaagcct 541 tccctggcag agcagccttt aatacatagg cttttaatac attaactcaa ctacaaaaca 601 taaagtgtta atttgaaatt ataactaact ttaggaagtt aattgcaaaa ctccaatagt 661 aacaattgct agaggcaaaa actctgtgtt ctacacagcc aacaaaattt catcacgccc 721 ttgagcctag tcgtgataac atcagatctg ggcaagtgtc cctttccttc atagctatcc 781 aatgcacaac agctgtctgg cttccagagc cacacatttg gcagcctccg gagacttctg 841 aggctcacgt caccaagtcc caggcctgtc tgtttgctgg tgagctagat agacctcacc 901 aagctggaga ggccctcggc agctgcattt gggtcagcct agagcccctg cacacattgt 961 gtctcagaga tggtgctaat ggttttgggg ttctacagtg gagaccacca gagttggcct 1021 tcagaacctc ccacgtagct ttcgagacca tgggatttca ttattaactt gatcccatct 1081 tcagagctta ttctaagttt gcctcttcaa taaaactctc ctagaaggtt gtggctgtag 1141 cttagtggca gaacacttgg tgttgcaggg accaggtcct tcactaacag tgcaaaaact 1201 taaccaattt aaagaacatt ttctggctac tcaaattctc ttaaatttat tcctgtttca 1261 caagtaaaca cttcgctgct atct // LOCUS MUSZFPB1 561 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger (Zfp-35) gene, exon 1. ACCESSION J04770 KEYWORDS zinc finger protein. SEGMENT 1 of 3 SOURCE Mouse (strain 129) cosmid library pcos2EMBL DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 561) AUTHORS Cunliffe,V., Williams,S. and Trowsdale,J. TITLE Genomic analysis of a mouse zinc finger gene Zfp-35, which is unregulated during spermatogenesis JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by Cunliffe,V., 06-JUN-1990. FEATURES from to/span description pre-msg 233 > 561 zinc finger protein (Zfp-35) mRNA and introns IVS 286 > 561 Zfp-35 intron A BASE COUNT 141 a 135 c 149 g 136 t ORIGIN Chromosome 18 bands B3-C. 1 ccctagaagt tttgacatcg aaccacagca aagataagtt tgagtcaagc gggtccccct 61 atgctaaagc tgggagtcgc tttagggaac ttcctgttcc taattcacag agcgcgtact 121 tgggagggtg caccatagag ctgtgatcct cccacattcc tagagcggcc aagcgaggtc 181 gcattgcatc caaacggatg ttgaaggaat tgtgggtggc ggcgtttgac cggtagtttc 241 ttcatcgtcg gagagtaagg gaggcttctt gggcatggag agcaggtggg ttcttggcct 301 agtgttctgt ggacagcaag ttcgtgctcc ctgggccttt cattcaactg catgaacgtc 361 cgagatgttg gggctggtgg aataacgaag aggatgagcc cacactaatc agcctgcaga 421 gctggaaaag agatacctga cttgggttag aatcgaagga acgagagaga ataacatact 481 cttaattttt aaagccagta ttaccccccc cccccccaca cacacacacc gaattatcgt 541 tccatttacg cctctacttt a // LOCUS MUSZFPB2 340 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger (Zfp-35) gene, exon 2. ACCESSION M36145 KEYWORDS zinc finger protein. SEGMENT 2 of 3 SOURCE Mouse (strain 129) cosmid library pcos2EMBL DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 340) AUTHORS Cunliffe,V., Williams,S. and Trowsdale,J. TITLE Genomic analysis of a mouse zinc finger gene Zfp-35, which is unregulated during spermatogenesis JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by V,Cunliffe, 06-JUN-1990. FEATURES from to/span description pept 231 + 279 zinc finger protein (Zfp-35) pre-msg < 1 > 340 Zfp-35 mRNA and introns IVS < 1 66 Zfp-35 intron A IVS 280 > 340 Zfp-35 intron B BASE COUNT 90 a 79 c 66 g 105 t ORIGIN Chromosome 18 bands B3-C; unknown number of bps after segment 1. 1 gtcaccaaga ggaaacgcac atgaaagtta aatatatttt ttctccttta cgtttttcat 61 atctagaaac tctagtctca gaaatgacat ctctcctctg tacaatggtt ctcatctgag 121 ctcagggcta ccaagttgcc tgctggcttc aaaccattat gtggacaaaa ggatcctaca 181 gaagaaatgg taattctcca gttgcttcct gaggtgtcca gttgtgccct atggagatcc 241 aattcagtta cgaatctcaa gaccatcact ttctgtcagg tgagcctcag tgcatttcag 301 gttacgtggg tgatagcagt attcttcact gtctacactc // LOCUS MUSZFPB3 2268 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger (Zfp-35) gene, exon 3. ACCESSION M36146 KEYWORDS zinc finger protein. SEGMENT 3 of 3 SOURCE Mouse (strain 129) cosmid library pcos2EMBL DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2268) AUTHORS Cunliffe,V., Williams,S. and Trowsdale,J. TITLE Genomic analysis of a mouse zinc finger gene Zfp-35, which is unregulated during spermatogenesis JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by V,Cunliffe, 06-JUN-1990. FEATURES from to/span description pept + 87 1780 zinc finger protein (Zfp-35) pre-msg < 1 2166 Zfp-35 mRNA and introns IVS < 1 86 Zfp-35 intron B signal 2125 2130 polyA signal signal 2148 2153 polyA signal BASE COUNT 685 a 488 c 526 g 569 t ORIGIN Chromosome 18 bands B3-C; unknown number of bps after segment 2. 1 ctgcagggaa atactcattt gttctacctg gagctttctt acaacatgga gcgcattttc 61 catttagatt ttgtttatta ttccagatgg tgagactaag atcaagattg gagagccagc 121 tacagaggag gaaatgacag gaaagattgg aacagtgact gaagagtccg gcagccttga 181 ggaggatgtt ccccatgact ccagaggcaa agaattccgt gaatttggag aagaattgaa 241 cgatcagatg cttttcagaa gaagacagta taactgtgat gaatgtgacc aaagctttgc 301 ttggagtaca ggtctcatta ggcaccaaag aactcattgg aaaccttacg aatgtgagga 361 atgtggaaag gcctttcgga tgagctcagc cctggttctg catcagagaa ttcatactgg 421 agagaagccc tatccttgta gttggtgtat taaaagtttt agtcggagct cagaccttat 481 taaacatcaa agagtccaca ctggtgaaaa accttacaag tgtgatgagt gtggcaaggc 541 cttcagtcag agctcagatc ttatgataca tcagagaatc cacacaggag aaaaacccta 601 ccaatgtagt cattgtagta aaagctttag ccagcactca ggcatggtta aacatctgag 661 aatccacact ggagagaagc cttatatgtg taaccattgt tacaaacatt tcagtcagag 721 ctctgatctt ataaaacatc aaaggatcca cactggggag aaaccataca agtgtgatgt 781 gtgtgggaag gcctttagtc agagctctga tcgtattctc catcagagaa tccacactgg 841 ggagaagcca tatccgtgtg ctcagtgtaa caaaagtttt agtcaaaatt cagaccttat 901 taaacacaga aggatccaca ctggagagaa accgtataaa tgtagtgaat gtgggaaggc 961 ttttaaccag agctcagtcc ttattctgca tcaaagaatt cataccggag agaaacccta 1021 tccgtgtaac cagtgtacca aaagctttag taggctttca gatcttatta atcatcaacg 1081 aattcacacc ggagagaagc cttacccatg tagtcagtgc agtaaaatgt ttagtagaag 1141 gtcacacctt gttaaacatt acagaatcca cactggggag aagccctatg agtgtgataa 1201 gtgtggcaaa actttcagtc agagttccaa ccttattctc caccagagaa tccacactgg 1261 agagaaacca tacccgtgca acagctgttc taaaagcttc agtcgcggtt cagatctcat 1321 aaagcatcag agggtccaca ctggagaaaa accatacacg tgcaatctgt gcagtaagag 1381 tttcagtcaa agttcagacc tcactaaaca tcagagagtg cattctgggg agaagcccta 1441 ccactgtagt agttgtaaca aagcctttcg tcagagttct gaccttattc tccaccacag 1501 agttcacacg ggagaaagac catatgcgtg tacacagtgc cctagaagtt tcagtcaaaa 1561 gtcagacctg attaaacacc agagaattca cactggagag aagccatata aatgtatgtg 1621 tgggaaggct ttcagtcagt gctcagcctt tacccttcat cagagaatcc acactggaga 1681 gaaaccgtat ccctgtgctc agtgtggcaa aagcttcagt cagcgctctg atctggttaa 1741 ccatcaaaga gttcacgctg atcaaaaact acaaatgtga ctgatacctg tgtgaagcct 1801 tcagcactgt gaccaatctt actggatatc agaggatctg caggagccgt ttgcacctct 1861 ctgaaacgag aaaaaggaag agtgcctccc aggcggaatc aaagcaggtc agttggcaga 1921 gttcccggct tgctgatgca tggaacaatg agaagaggtg tgcttgaggc tcaagtcaag 1981 ggctgaatct tctgttgacc ttggtggaag gaacggggct tactggatgg gaacagatgt 2041 cttgtgtgta tattgcagca aattgttaat actggcttgt ggctttgctc cccatggtac 2101 tgtagtataa aaaaggctat gagaaataca ctcactgctg gcgtggtaat gacctgcagc 2161 cctcccatgt cagtctcttg catcgttttt ctgtctttcc tataatcatc ctcacttccc 2221 cctcaggggc tgttcgactt cacgccggcc ggactagttc aaggatcc // LOCUS PP1PAR1 285 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage miniplasmid P1 parA gene, 5' end. ACCESSION M36424 KEYWORDS centromeric-like DNA. SEGMENT 1 of 2 SOURCE Bacteriophage P1 plasmid prophage DNA, from E.coli. ORGANISM Bacteriophage P1 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 285) AUTHORS Austin,S. and Abeles,A. TITLE The partition functions of P1, P7, and F miniplasmids JOURNAL Basic Life Sci. 30, 215-226 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 232 > 285 parA protein binding 218 222 parA ribosomal binding site signal 149 154 parA -35 region signal 171 176 parA -10 region BASE COUNT 98 a 57 c 60 g 70 t ORIGIN 1 aagcttggct gcccattgaa gcaggaaatc accaaaatga ttcaggctac aacctgaacg 61 tagaagaaat ccgcgtcctt tatgcgtgga ggatgccaaa gcatgttgtg acacacttgg 121 caaaggagta agcatgcaga gaatgctatg tacaagcatc tacgcataca ttattatttt 181 atgcagcatt tttaattaaa ttcaaaaata cagcataaag gatgactttc gatgagtgat 241 tccagccagc ttcacaaggt tgctcaaaga gcaaacagaa tgctc // LOCUS PP1PAR2 178 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage miniplasmid P1 parB gene, 3' end. ACCESSION M36425 KEYWORDS centromeric-like DNA. SEGMENT 2 of 2 SOURCE Bacteriophage P1 plasmid prophage DNA, clone pALA161, from E.coli. ORGANISM Bacteriophage P1 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 178) AUTHORS Austin,S. and Abeles,A. TITLE The partition functions of P1, P7, and F miniplasmids JOURNAL Basic Life Sci. 30, 215-226 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 51 parB protein BASE COUNT 62 a 42 c 21 g 53 t ORIGIN 2150 bp after segment 1. 1 tcgataaaaa gccgaagcct taaactttcg ccattcaaat ttcactatta actgactgtt 61 tttaaagtaa attactctaa aatttcaagg tgaaatcgcc acgatttcac cttggatttt 121 accttcctcc cctcctcccg aaaaaaataa aaaaattgct tgtcacgaga aagtcaac // LOCUS PP7C4RNA 612 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage P7 c4-RNA (c4, orfx, ant1) gene, 5' end. ACCESSION M35139 KEYWORDS antirepressor protein; antisense RNA; c4 repressor. SOURCE Bacteriophage P7 (strain c1.48ts) DNA, from E.coli (strain C600). ORGANISM Bacteriophage P7 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 612) AUTHORS Citron,M. and Schuster,H. TITLE The c4 repressors of bacteriophages P1 and P7 are antisense RNAs JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Schuster, 06-JUL-1990. FEATURES from to/span description pept 362 583 orfx pept 580 > 612 ant1 protein binding 346 354 orfx ribosomal binding site mut 278 278 c in P7c1.48ts; t in P7c4.2ts signal 46 74 promoter 51a signal 86 114 promoter 51b site 31 47 Op 51 site 146 152 presumptive c4 target site a1 site 160 167 presumptive c4 target site b1 site 271 278 c4 element b' site 285 291 c4 element a' site 315 321 c4 target site a2 site 344 351 c4 target site b2 BASE COUNT 147 a 158 c 128 g 179 t ORIGIN 1 acaaataata tatctattgc acgacactaa tctgctctaa taaatttgta tttttaagtc 61 acgaatgcta tcttttcgca tcatattgac cttttaatcg ttcaggctta tagttccgcc 121 gtcgtagcaa attctgcgac cgggtttagc agcctgaatg attgtgcgga cagccgcaga 181 tatccgatat tgcggtattt ttgtgtccgt aaaaccgcgt tacgcccaaa ttatggtggg 241 gcgtgatggg gaggcttcgg cctgctggtt tcacaatcgc cagtctgcta accccgtcac 301 gtcctgccac ctgtttagca gcgggtagca ggttgttaaa cctgattgtg aggccgtaac 361 tatggttaat gccaatcctt gcgcacgcca agaattcatc tggcgtttct attcctgtaa 421 aaaacaccac tatcacttcg tcatcgcagc aacagaagac gaagcacgct ctcaattgcc 481 tgatggcccc tgcattttta ctgcccgttt ttcaactaac tcgcgcaatt cacttagtta 541 ctggaacctc cccttctctg ccgacgttca ggggggttta tgaaaaaacc tctcgtcacc 601 cgtaatgaca ta // LOCUS PSEPGLAA 1986 bp ds-DNA BCT 14-AUG-1990 DEFINITION P.solanacearum polygalacturonase A (PGA, pglA) gene, complete cds. ACCESSION M33692 KEYWORDS alpha-1,4-endopolygalacturonase A. SOURCE P.solanacearum Race 1 (strain AW) DNA, clone pJE8. ORGANISM Pseudomonas solanacearum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1986) AUTHORS Huang,J. and Schell,M.A. JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1715) AUTHORS Huang,J. and Schell,M.A. TITLE DNA sequence analysis of pglA and mechanism of export of its polygalacturonase product from Pseudomonas solanacearum JOURNAL J. Bacteriol. 172, 3879-3887 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Schell, 13-APR-1990, for release after publication. FEATURES from to/span description pept 100 1689 polygalacturonase A precursor (pglA) sigp 100 162 polygalacturonase A signal peptide matp 163 1686 polygalacturonase A mRNA 58 > 1986 pglA mRNA signal 27 59 promoter region signal 27 31 -30 Pseudomonas promoter consensus binding 89 92 pglA ribosomal binding site (put.) BASE COUNT 351 a 685 c 616 g 334 t ORIGIN 1 ctgcagcacg cccttcgcgc agacgaaggc gtgccgctgc ttcgctgatc ccggcgcatt 61 gcgtcctcac aaaagacctc tcttttcgga gaattgacct tgaatcaccg ctacacactc 121 ctcgcactcg ccgccgcagc cctgtctgcc ggagcgcacg caaccggtac gtccgtcacg 181 gccccgtggg gcgaggtggc cgagcccagc ctgccggcgg attcggcggt ctgcaagacg 241 ctgtcggcct cgatcacgcc gatcaaaggc tccgtcgatt cggtggacgg caaccccgcc 301 aactcccagc cggacgcgag ccgcatccag tccgccatcg acaactgccc ggccggccag 361 gccgtcaagc tcgtgaaggg cagcgccggc gaatccggct tcctgagcgg ctcgctgaag 421 ctcaagtcgg gcgtgacgct gtggatcgac accggcgtga ccctgttcgc ctcgcgcaac 481 ccggccgact acgacaatgg cctgggcacc tgcggcaccg ccaccaccag caacgacaag 541 tcgtgcaacg cgctgatcgt ggcgcgcgat acggcgggca gcggcatcgt cggcgccggc 601 gccatcgatg gccgcggcgg cagcctggtc accagcggcc cgaacgcgaa ccggctcacc 661 tggtgggaca tcgcttacct gaacaagacc aaggggctga accagcagaa cccgcgcctg 721 atccagacgt ataacggcag cgcgttcacg ctgtacggcg tgacagtgca gaactcgccc 781 aacttccata tcgtcaccac cggcacgtcg ggcgtaacgg catggggcat caagatcgtg 841 acgccgagcc tggcctacgc cgtggcgggc tacaagtgcc cgtccggctc cacgcccgac 901 aaggtcacgc ccgcgacctg cttcacgccc gagacggtca agaacaccga cggcttcgac 961 ccggggcaat cgaccaacgt ggtgctggcc tattcctaca tcaacacggg tgacgaccac 1021 gtcgcggtca aggccagcag cggcccgacg cgcaacctgc tcttcgcgca caaccatttc 1081 tactatgggc acggcctgtc gatcggcagc gaaaccaata ccggcgtgag caacatgctg 1141 gtgaccgacc tgacgatgga cggcaacgac agctcggccg gcaacggcct gcgcatcaag 1201 tccgatgcct cgcgcggcgg caaggtcacc aacatcgtct acgacggcat ctgcatgcgc 1261 aacgtcaagg agccgctggt gttcgacccc ttctacagca gcgtgaaggg ttcgctctat 1321 ccgaacttca ccaacatcgt cgtcaagaac ttccatgacc tgggcagcgc caagtccatc 1381 aagcgcacga tgacgttcct gggctacaaa gccaacaagc agaagaaccc gctgacgatc 1441 acgctcgaca acgtggtctt cgacggcacg ctgccggcct tcgaggggtc ccactacggc 1501 gggccggctt cgcccaacgg ggtgcacttc accttcggcg gcaccggtcc ggtgagcttt 1561 gccgacgcca tcgtcacgtc gtcgaccacc gatgtgacgg tgaccggcac gccgggcacg 1621 gccgcggcgg tggattgcag caaggccttc gtgccgctga agtccgtggc accgacctcg 1681 ccgatctgat cgggcgactc cgctcagcgg tacccgttct gccgcttgta ctggcggatg 1741 gcttcgtcgt ggttggggtc ctcgtgtgca ttgtgcgcgc gctgatcttc gaactcacgg 1801 atggcattgg cctgagcctc cgtcagtttg ggcagcggcg gcttgcgttg gggtgacggg 1861 gtggtgctgt acgtcaccgg ctggctgtcg gcggagttcg tcttcgtagg cgatgtggtg 1921 tccgcccttt tcaccgtggt agtcgttgcg cgcttcggtc tcgcggtcca gttgggtccg 1981 ctgcag // LOCUS RATCATHHA 1362 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat cathepsin H mRNA, complete cds. ACCESSION M36320 Y00708 KEYWORDS cathepsin H; cysteine protease. SOURCE Rat (strain Wistar) kidney, cDNA to mRNA, clone lambda-H44. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1362) AUTHORS Ishidoh,K. JOURNAL Unpublished (1987) Tokyo Metro Inst of Med Sci,Tokyo 113, Japan STANDARD simple staff_entry REFERENCE 2 (bases 1 to 1362) AUTHORS Ishidoh,K., Imajoh,S., Emori,Y., Ohno,D., Kawasaki,H., Minami,Y., Kominami,E., Katunuma,N. and Suzuki,K. TITLE Molecular cloning and sequencing of cDNA for rat cathepsin H JOURNAL FEBS Lett. 226, 33-37 (1987) STANDARD simple staff_entry COMMENT EMBL features not translated to GenBank features: key from to description SITE 873 874 processing site SITE 211 213 pot. glycosylation site SITE 298 301 pot. glycosylation site SITE 685 688 pot. glycosylation site FEATURES from to/span description pept 4 1005 cathepsin H pre-pro-peptide sigp 4 342 cathepsin H signal peptide matp 343 1002 cathepsin H mRNA < 1 1362 cathepsin H mRNA signal 1320 1325 poly A signal BASE COUNT 369 a 341 c 343 g 309 t ORIGIN 1 gcgatgtgga ctgcgctgcc cctgctgtgc gctggggcct ggctgctgag tgctggggcc 61 accgctgagc tgaccgtgaa tgccatagaa aagtttcact ttacgtcatg gatgaaacag 121 catcaaaaga cgtacagctc gagggagtac agccacaggc tgcaggtgtt tgccaacaac 181 tggaggaaga ttcaagccca caaccagagg aaccacacat ttaaaatggg attgaaccag 241 ttttcagata tgagctttgc cgaaataaaa cacaaatacc tctggtcaga gcctcagaat 301 tgctcagcca ccaaaagtaa ctacctccgt ggtactggcc cctacccatc ctccatggac 361 tggaggaaga aaggaaatgt cgtttcacca gtgaagaacc agggggcctg tggcagctgc 421 tggactttct caaccactgg ggccctagag tcagctgtgg ctattgccag tgggaaaatg 481 atgaccttgg ctgagcagca gctggtggat tgtgcccaga acttcaacaa tcatggctgc 541 caaggaggtc tccccagcca ggccttcgag tacatcctgt acaacaaggg catcatggga 601 gaggacagct acccttacat aggcaagaat ggtcagtgca aattcaaccc agaaaaggcc 661 gtcgcgttcg tcaagaatgt tgtcaacatc acactcaatg atgaggctgc aatggtagag 721 gctgtggccc tatacaatcc tgtgagcttt gcctttgagg tgactgaaga ttttatgatg 781 tataaaagtg gtgtctactc cagtaactcc tgtcataaaa ctccagataa agtaaaccat 841 gcagtcctgg ctgttggcta tggagaacag aatggattac tctactggat tgtgaaaaac 901 tcttggggct ccaactgggg gaacaatggg tacttcctca ttgagcgtgg aaagaacatg 961 tgtggcctgg ctgcctgtgc ctcctacccc atccctcagg tataagccac ggctgcacag 1021 gccaactgct tggcagacaa agggaggaac tggtcctacg atgagaatgc cgccctggag 1081 aaagttgttc agaaatccac ccagaggccc tctcactcct gagtctagac gcctaaagac 1141 aagtaaggaa gaacttgacc agcaacaagc ccgcccatgt gacgacatca ccagccatac 1201 gctttgtttg aatatggttt ttaaatgacc caaaaccatg tggacctaga atcttctctt 1261 ttcagctctc ttcatatagg gaaagctccg atggcttacc ttttctatgt tgtatattca 1321 ataaatgcag tgaacgcctg cctgggctgg gcctagaccc ct // LOCUS RATDRTKCA 471 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat delayed-rectifier-type K+ channel mRNA, complete cds. ACCESSION M36461 KEYWORDS delayed-rectifier-type K+ channel. SOURCE Rat neonatal heart and ovariectomized, diethylstilbestrol-primed uterus, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 471) AUTHORS Folander,K., Smith,J.S., Antanavage,J., Bennett,C., Stein,R.B. and Swanson,R. TITLE Cloning and expression of the delayed-rectifier I-s-K channel from neonatal rat heart and diethylstilbestrol-primed rat uterus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2975-2979 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 41 433 delayed-rectifier-type K+ channel BASE COUNT 110 a 132 c 122 g 107 t ORIGIN 1 agagttttgc tccacatcag ggaaaccgtg aagccccagg atggccctgt ccaattccac 61 aactgttctg ccttttctgg cctcgctgtg gcaggaaaca gatgagccgg gtggcaatat 121 gtcggcggac ttggctcgta ggtcccagct ccgagatgac agcaagctgg aggctctcta 181 tatcctcatg gtgctgggtt tcttcggctt cttcaccctg ggcatcatgc tgagttacat 241 ccgatccaag aagctggaac actcgcacga ccctttcaac gtgtacatcg agtcggacgc 301 ctggcaggag aaaggcaagg ccctcttcca ggcccgcgtt ctggagagct tcagagcttg 361 ctatgtcatt gaaaaccagg cggccgtaga acaacctgcc acacaccttc ctgaactgaa 421 gccactgtca tgaaccccat agttaattaa tagacaagtg ataagtgggt c // LOCUS RATPAPA 1603 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat prostatic acid phosphatase (rPAP) mRNA, complete cds. ACCESSION M32397 KEYWORDS prostatic acid phosphatase. SOURCE Rat (strain Sprague-Dawley) prostate, cDNA to mRNA, clone rPAP-[145,343A]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1603) AUTHORS Roiko,K., Jaenne,O.A. and Vihko,P. TITLE Primary structure of rat secretory acid phosphatase and comparison to other acid phosphatases JOURNAL Gene 89, 223-229 (1900) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Vihko, 02-MAR-1990, for release after publication. FEATURES from to/span description pept 41 1186 prostatic acid phosphatase precursor (rPAP; EC 3.1.3.2) sigp 41 133 prostatic acid phosphatase signal peptide matp 134 1183 prostatic acid phosphatase mRNA < 1 1603 rPAP mRNA signal 1583 1588 polyA signal BASE COUNT 434 a 414 c 387 g 368 t ORIGIN 1 cacctcctac agctgccaca agcagctctc ttcccccacc atgagagctg tccctctgca 61 cctcgtcggg acagcaagcc tcacccttgg cttcttgctc ctgctatctc tccgcctgga 121 cccaggccaa gccaaggagt tgaagtttgt gacattggtg ttccggcatg gagaccgagg 181 tcccatcgag acctttccta atgaccccat taaggaatcc tcgtggccac aaggatttgg 241 ccaactcacc aagtggggca tgggacagca ctacgaactc ggaagttata taaggagaag 301 atacgggaga ttcttgaaca actcctataa acatgaccag gtttatatcc gaagcacaga 361 tgttgacagg actctgatga gcgctatgac aaacctcgca gccctgtttc cccctgaggg 421 gatcagcatc tggaatccca gactgctctg gcagcccatc ccagtgcaca ccgtgtctct 481 ctctgaggat cggttgctat acctgccttt cagggactgt cctcgctttc aagaactcaa 541 gagtgagact ttaaaatctg aggagttcct gaagaggctt caaccatata aaagcttcat 601 agacaccttg ccatcgctgt cgggattcga ggaccaggat ctttttgaaa tctggagtag 661 gctttacgac cctttatatt gcgagagtgt tcacaatttc accttccgca cctgggccac 721 agaggacgcc atgactaagt tgaaggagtt gtcagaatta tctctgttat ctctttatgg 781 aattcacaag cagaaagaga aatctagact ccaggggggc gtcctggtca atgaaattct 841 caagaacatg aagcttgcaa ctcaaccaca gaaggccagg aagttgatca tgtattctgc 901 atatgacact actgtgagtg gcctgcagat ggcgctagag ctttataatg gacttctacc 961 tccctacgct tcctgccaca taatggaatt gtaccaggat aatgggggga ccttcgtgga 1021 gatgtactac cggaatgaga cccagaacga gccctaccca ctcacgctgc cgggctgtac 1081 ccacagctgc cctctggaga agtttgcaga gctactggac cccgtgatcc cccaggactg 1141 ggccacagag tgtatgggca caagcaacca ccaagcgtcg ctgtaaaaga acaaccagtt 1201 gccagtgtaa tcaggaaaag agttaggtca aacttgtggg tctgccgggg atccatgatt 1261 agcacaaaga actctattgc tgaaggattc gaagattttg cttgagcggg tgagtaaagc 1321 tgccctggga atgtctgatg taaaaacagg tttcagacga agcctaccaa gttagcgtgc 1381 aagcatgaaa aagaagacac ttatcccagg ctaatgggca aaggacctga ttcacctgac 1441 aacaacgctc acctgttccc ctgccttctc tcacgaggca aaaagatgag acaggaggac 1501 acctggagga gttttccata cccagcctag aacgatcaga ttttatatgt atgtcaaaag 1561 acgctgggta gcagaactga ataataaaag cctcttggtg tcc // LOCUS RHLNIFHA 524 bp ds-DNA BCT 14-AUG-1990 DEFINITION R.leguminosarum nitrogen fixation protein nifH gene, 5' end. ACCESSION M36435 KEYWORDS nitrogen fixation protein. SOURCE R.leguminosarum DNA, clone pGB5. ORGANISM Rhizobium leguminosarum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 524) AUTHORS Roelvink,P.W., Harmsen,M., van Kammen,A. and van den Bos,R.C. TITLE The nifH promoter region of Rhizobium leguminosarum: Nucleotide sequence and promoter elements controlling activation by NifA protein JOURNAL Gene 87, 31-36 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 435 > 524 nifH protein mRNA 364 > 524 nifH mRNA binding 420 426 nifH ribosomal binding site BASE COUNT 124 a 139 c 137 g 124 t ORIGIN 1 gttaacccta tgcgggcacc agacgcggca tggcgcttgt catcgccgga gacagcgctt 61 tgcggcaagt gatcagaatt tgttacggcc ggtctggtga tggctgctct aacaaagtcc 121 tagaagctac gcatggataa atgagacatc ctgccggttg gtcagatatt gcacaaacgg 181 ccaccattca ttggccctca gagggtgaca atctatttcg acattgtcat cacctttgtc 241 ggttacttga caagcgtttg tgttctcaat cccgtatttt acgcccacat agctggaatc 301 gcggcgaatt attgttcgtc agctcaatcg gccgcttggc acgaatcttg agagctattg 361 agaggcagcg gaacggccgc cgcattccgt tgcgggtaac caaattgctt cgaacacatg 421 aaggaacgcc aagcatggca gctctgcgtc agatcgcatt ctatggaaaa ggcggaattg 481 gcaagtccac tacgtcccaa aacacgctgg ccgcccttgt cgac // LOCUS STYAHPCFA 2608 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.typhimurium alkyl hydroperoxide reductase (ahpCF) gene, complete cds. ACCESSION J05478 KEYWORDS alkyl hydroperoxide reductase; oxidoreductase. SOURCE S.typhimurium (strain TN1379) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2608) AUTHORS Tartaglia,L.A., Storz,G., Brodsky,M.H., Lai,A. and Ames,B.N. TITLE Alkyl hydroperoxide reductase from Salmonella typhimurium: Sequence and homology to thioresoxin reductase and other flavoprotein disulfide oxidoreductases JOURNAL J. Biol. Chem. 265, 10535-10540 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Tartaglia, 05-JUN-1990. FEATURES from to/span description pept 166 720 alkyl hydroperoxide reductase C22 protein (aphC) pept 969 2534 alkyl hydroperoxide reductase F52a protein (aphF) mRNA 142 > 2608 ahpCF mRNA binding 155 160 aphC ribosomal binding site (put.) binding 959 964 aphF ribosomal binding site (put.) signal 105 110 ahpCF -35 region signal 129 134 ahpCF -10 region BASE COUNT 652 a 655 c 724 g 577 t ORIGIN 13 minutes on S.typhimurium map. 1 tcaggcggcc tttttacttt agatggctga cagggcgcag tgttcacaat gtcgcccgaa 61 tcgggttgtt agttaacgct tattgatttg ataatggaaa cgcattagcc gaatcagcaa 121 aatttcgtta acttactcct caacgaaaac acggaggaag tatagatgtc cttaattaac 181 accaaaatca aacctttcaa aaaccaggcg ttcaaaaacg gtgaattcat tgaagtcacc 241 gagaaagata ccgaaggccg ctggagcgtc ttcttcttct acccggccga ttttaccttt 301 gtttgcccga ctgaactggg tgacgttgcc gaccattacg aagaactgca gaagctgggc 361 gtagacgttt attccgtctc aaccgatact cacttcacgc acaaagcatg gcacagcagc 421 tctgaaacta tcgcaaaaat caaatatgcg atgatcggcg acccgactgg cgccctgacc 481 cgtaacttcg acaacatgcg tgaagatgaa ggtctggcag acgtgcgacc tttgttgttg 541 acccgcaggg tatcatccag gcgatcgaag ttaccgctga aggtatcggc cgtgacgcgt 601 ctgacctgct gcgtaaaatt aaagcagcac agtacgttgc cgctcaccca ggcgaagtat 661 gcccggcgaa atggaaagaa ggcgaacgac tctggctcca tccttagacc tggtcggtaa 721 aatctaaatt tccgtcgtct ttcacgccat agctgcgttg gcgtcgcctg ctcaccccgg 781 tcacttactt cagtaagctc ccggggattc tcaggctagc cgccttactc tgacgtgaaa 841 tactttggaa attgtttcag tcatgggcgc acccgcgccc atttcatttc agcaccatga 901 cgcaagttgc attaatgcag tcccgcgaag tcggcttgca tgatgatgtt ttcagagagg 961 gaagaagaat gctcgacaca aatatgaaaa cccagctcag ggcttacctt gagaaactga 1021 ccaaacctgt tgagctgatt gccacgctgg atgacagcgc taaatcggcg gaaatcaagg 1081 aactgctggc tgaaatcgcc gagctgtccg acaaagtcac gtttaaagaa gacaatactc 1141 tgccggtgcg caagccgtct ttcctgatta cgaatccagg ttctcagcaa gggccgcgct 1201 ttgccggttc tccgctgggc cacgaattta cctcgctggt tttggcgctg ctgtggaccg 1261 gcggtcatcc gtcaaaagaa gcgcagtcgc tgctggagca gattcgcgat attgacggcg 1321 attttgaatt cgaaacttat tattcactct cctgccataa ctgcccggac gtagtgcagg 1381 cgctgaacct gatggcggtg ctgaacccgc gtattaaaca tacggcaatt gatggcggca 1441 ccttccagaa cgaaatcacc gaacgtaacg tgatgggcgt tccggcggta tttgttaacg 1501 gtaaagagtt cggccaggga cgtatgactc tgaccgagat tgtcgctaaa gtggatactg 1561 gcgcagaaaa acgcgcggcg gaagcgctga acaaacgcga tgcgtatgac gtactgattg 1621 tcggctccgg cccggcgggc gcggcggcgg cggtctactc ggcgcgcaaa ggcatccgta 1681 ccggtctgat gggcgagcgc ttcggcggtc aggtgctgga taccgtggat atcgaaaact 1741 atatctcggt gccgaaaacc gaaggccaga aactggcggg cgcgctgaaa gcgcatgtca 1801 gcgattacga cgttgacgtt atcgacagcc agagcgcaag taaactggtt ccggccgcaa 1861 cagaaggcgg cttgcatcag attgaaacgg cgtccggcgc agtattgaaa gcgcgtagta 1921 tcatcatcgc caccggcgcg aaatggcgca atatgaatgt gccgggcgag gatcagtatc 1981 gcaccaaagg cgtcacctat tgtccgcact gcgacggtcc gttatttaaa gggaagcgtg 2041 tggcggtcat cggcggcggc aactctggcg ttgaagcggc tatcgatctg gcaggtattg 2101 tagaacatgt tacgttgctg gagttcgcgc cggagatgaa ggcggatcag gtgctacagg 2161 acaaagtgcg tagtctgaaa aatgtcgata tcattttgaa tgcgcaaacg acagaagtga 2221 agggcgacgg ttcgaaagtg gtgggactgg agtatcgtga ccgtgtcagc ggcgatattc 2281 atagcgtcgc gctggcaggg atttttgtgc agattggtct gttgccgaat acccactggc 2341 tggaaggcgc gcttgagcgt aaccgcatgg gcgagatcat tattgacgcc aaatgtgaaa 2401 ccagcgttaa gggcgtcttc gccgcaggcg attgcaccac cgttccttac aaacagatta 2461 tcatcgccac gggcgaaggg gcgaaagcct cgttgagcgc ctttgattat ctgattcgca 2521 ccaaaatcgc ataaaaagaa gtaagcacac ctgcaaggcc agttaactgg ccatcgtaaa 2581 tggccgatag tgtaagctat tcgggcct // LOCUS TMGCG 6355 bp ss-RNA VRL 14-AUG-1990 DEFINITION Tobacco mild green mosaic virus complete genome. ACCESSION M34077 M22483 KEYWORDS coat protein. SOURCE Tobacco mild green mosaic virus (strain U2-TMV), cDNA to viral RNA, from N.tabacum cv. Samsum. ORGANISM Tobacco mild green mosaic virus Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Tobamovirus. REFERENCE 1 (bases 6127 to 6355) AUTHORS Garcia-Arenal,F. TITLE Sequence and structure at the genome 3' end of the U2-strain of Tobacco mosaic virus, a histidine-accepting tobamovirus JOURNAL Virology 167, 201-206 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 6355) AUTHORS Solis,I. and Garcia-Arenal,F. TITLE The complete nucleotide sequence of the genomic RNA of the tobamovirus tobacco mild green mosaic virus JOURNAL Virology 177, 553-558 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Garcia-Arenal, 10-FEB-1989. The RNA appears to have a tRNA- like, L-shaped structure at the 3' terminus, linked to a quasi- continuous double-helical stalk, with five pseudoknots involved in the formation of the whole structure. However, the structure of U2-TMV RNA is less stringently conserved than the 3' termini of "vulgare" and other histidine-accepting tobamoviruses. Draft entry and computer-readable sequence for [1] kindly submitted by F.Garcia-Arenal, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 71 3406 126 kDa protein pept 71 3403 183 kDa protein 3407 4900 183 kDa protein pept 4890 5660 28.5 kDa protein pept 5666 6145 coat protein BASE COUNT 1933 a 1112 c 1489 g 1821 t ORIGIN 1 gatgttttaa tagttttcga caacaacaat taaaacaaaa acaacatatt acaaacaaca 61 aacaacaaca atggcacaca tacaatctat aattagcaac gcccttcttg aaagcgtgag 121 tggtaaaaac actctcgtta atgaccttgc aagaaggcgc atgtacgata cggccgtgga 181 agaatttaac gcccgcgacc gtagaccaaa ggtcaacttt tccaaaacta ttagcgaaga 241 gcaaacgctt ctagtctcca acgcgtaccc ggagttccag attacctttt ataatactca 301 aaatgccgta cacagtttgg ctggaggttt gagagcatta gaattggaat atctgatgct 361 acaagttccc tatggatcgc cgacatatga tataggtggg aactttgcag cacatttgtt 421 caaaggcagg gattacgtgc attgctgtat gcccaatctg gacatacgag atataatgag 481 gcacgaagga caaaaggact caattgagat gtatttgtcc agattgtctc gttctaacaa 541 ggtaattcct gagtttcaaa gggaggcttt taacaggtat gcagaagctc ccaacgaagt 601 ctgctgctct aaaacttttc aggattgtcg aatacatccg ccagagaata gtggtagaag 661 atacgctgtt gctctgcaca gtttgtatga tattcctgtg catgagtttg gagctgcgtt 721 aatatctaag aatatacatg tatgttatgc agcttccatt ttggcagaag cattattact 781 agaccagacg gaggttacgc ttaatgaaat aggcgcaact ttcaaaagag aaggtgatga 841 tgtttctttt ttctttgctg atgaaagtac tttaaattat agtcataaat acaaaaatat 901 cttgcattat gtagttaaat cttactttcc tgcttctagt agaatagttt actttaagga 961 atttttagtc actagggtta atacttggtt ttgtaaattt accaaagtag atacctatat 1021 tctgtacaag agtgttagac aagtagggtg tgatagtgat cagttctatg aggcgatgga 1081 agacgccttt gcttacaaga aaaccttggc catgttcaac actgaaagag caatctttag 1141 agacacggct tcggttaact tttggttccc taagatgaag gacatggtga tagtaccgct 1201 gtttgagggt tctattacca gcaaaaagat gacaaggagt gaggtcattg ttaatcgtga 1261 cttcgtttac acagtgctta atcatatcag aacatatcaa gccaaagcgt taacttacca 1321 gaacgtatta tctttcgtgg agtctataag atcccgcgtg ataatcaatg gtgttactgc 1381 taggtctgaa tgggatgtag ataaagcaat tcttcaaccc ttgtcaatga ctttcttctt 1441 gcagactaag ctggctgcgc ttcaagacga tatagtaatg ggaaagtttc ggtgcttgga 1501 taagaccact tctgaactta tttgggatga ggtgggcaaa ttttttggaa acgttttccc 1561 cactatcaaa gagagattgg tgagcaggaa aattctggat gtaagtgaga atgctctgaa 1621 gatcaagatc ccagatctgt atgtcacatg gaaagacagg ttcgtagctg aatacaccaa 1681 gtctgaggag ttaccgcatc tagatatcaa gaaggactta gaagaagctg agcaaatgta 1741 cgacgcgtta tcagaattat ctatccttaa gggtgctgat aatttcgata tcgcgaagtt 1801 caaagacatg tgcaaggctt tagatgttag tcctgatgtg gcagcacgag taatcgttgc 1861 agtggccgag aatagaagcg gtttaactct tacttttgat aagccaaccg aggagaatgt 1921 ggctaaggct cttaaaagca cggcgtctga ggccgtggta tgtcttgaac cgacatccga 1981 agaggtgaac gtaaataaat tttctattgc tgagaaaggg agattgcctg tgtgtgcaga 2041 aagtcatggt ttgacgaatg ctaacttaga gcaccaggag ttggagtccc tcaacgattt 2101 ccataaggct tgcgtggata gtgtgattac aaagcaaatg gcatcggttg tctacactgg 2161 ctcactcaaa gttcaacaaa tgaagaacta tgtggacagt ttggcagctt cgttgtccgc 2221 cactgtatca aatctatgca agtcactaaa ggatgaagtc gggtatgatt ctgattccag 2281 ggagaaagtt ggtgtttggg atgtcacttt gaaaaagtgg ctcctcaaac ctgcggccaa 2341 aggtcattca tggggagttg tcctggatta caaggggaaa atgtttactg cacttctatc 2401 ttatgaagga gatagaatgg tgactgagag cgactggagg agggtggctg tatcatctga 2461 tacaatggta tattctgata ttgcaaagct ccaaaatctg aggaaaacaa tgagagacgg 2521 tgaaccccac gaacctactg caaagatggt acttgtggat ggggtgcctg gttgtggaaa 2581 gtacaaagga gattttgaaa gatttgatct tgatgaggat ttgatcttgg ttcctggaaa 2641 acaagctgct gctatgatca gaagaagggc taattcatct ggactgataa gagccacaat 2701 ggacaatgtg agaacggtag attcacttct aatgcatcca aaaccgcgat cacacaagag 2761 gctttttatt gatgaagggt tgatgctgca caccggttgt gttaacttcc tggtgcttat 2821 ctctggttgc gacatcgcat acatttacgg agatacacag cagattcctt tcattaacag 2881 agttcagaat ttcccgtatc ccaaacattt tgagaagctg caagtggatg aagttgagat 2941 gaggaggacc acactgagat gcccaggtga tgtgaatttt ttcctacaat cgaagtacga 3001 aggagcggtg acaaccactt caactgtaca acgatcggtc tcatctgaga tgataggcgg 3061 taagggagta ctaaacagtg tttccaaacc actaaaaggg aaaattgtaa ctttcactca 3121 ggctgataaa tttgagttag aggagaaggg ctataagaat gtgaacaccg ttcatgagat 3181 ccaaggagaa acctttgaag atgtgtcgct ggtcagattg acggcaactc cactgactct 3241 gatttccaag tcttccccgc atgttctagt cgctctgact agacacacaa agagcttcaa 3301 atattacacc gtagtgttag atcctttagt acagataatt agtgatttgt cttctttaag 3361 ctccttcctt ttagaaatgt atatggtaga agcaggtagt agatagcaat tacagatgga 3421 tgcagtgttc aaaggtcata atctctttgt ggcaacacct aaatcaggag actttccaga 3481 tctacagttc tattacgatg tatgcctccc tggtaatagt actatactta acaagtatga 3541 tgctgttacc atgaggttac gtgataatag tcttaatgtg aaggattgtg ttcttgattt 3601 ttccaaaagt attccgatgc caaaggaggt gaaaccatgt ctagagccag ttttgcgtac 3661 cgcggcggaa ccgccaaggg ctgcaggact actcgaaaat ctggttgcaa tgattaaaag 3721 aaatttcaac gcaccagacc tgacggggac gattgacatt gagagcaccg catctgttgt 3781 agtagataag ttttttgata gctattttat taaaaaagaa aaatacacaa aaaatattgc 3841 tggagtgatg acgaaggatt caatgatgag atggttggaa aacaggaaag aagtactatt 3901 ggacgacttg gctaactaca attttacaga tctgccggcc atcgatcagt acaagcacat 3961 gatcaaggct caaccaaaac agaaattgga cctttcaatt cagaatgaat accctgctct 4021 gcaaacaatt gtctaccatt cgaagcagat caacggtatt ttggccggtt tctcagagct 4081 tacaaggttg ctgctcgagg catttgattc taagaagttt cttttcttta ctaggaaaac 4141 tccagaacag attcaagaat ttttctcgga tctcgactcg cacgttccta tggatgtgtt 4201 agaactggat atttctaagt atgataagtc acagaacgag tttcattgtg ctgtagagta 4261 tgaaatatgg aaaagattgg gtctcaatga gtttttggcc gaagtgtgga aacaagggca 4321 caggaaaaca actttgaagg attacattgc tggaatcaag acatgtctgt ggtatcaaag 4381 gaaaagcggt gatgtgacta ctttcatcgg caatactgtt ataatagcag cttgcttggg 4441 ttcaatgtta ccgatggaaa aggtcataaa aggtgctttt tgtggagacg attccgtttt 4501 gtattttcca aagggtttgg atttccctga cattcagtca tgtgctaatc tcatgtggaa 4561 ttttgaggcc aaactgtata gaaagaggta cggttacttt tgtggtagat acatcataca 4621 ccatgataag ggagcaatag tgtattatga tcctttgaag ttgatctcca aacttggggc 4681 aaaacatatc aaggattatg atcacttaga agagttaagg gtgtctttgt gcgatgttgc 4741 ttgttcgctc ggaaactggt gcttaggctt tccgcagctg aacgcagcta tcaaggaggt 4801 tcataaaacc gcgattgatg gttcgtttgc ttttaattgt gttaacaaat ttttgtgtga 4861 taaattttta tttagaactt tgtttttaaa tggctgttag tctcagagat actgtcaaaa 4921 ttagcgagtt cattgatctt tcgaaacagg atgagatact tccggcattc atgactaagg 4981 tcaagagtgt tagaatatcg actgtggaca agattatggc tgttaagaat gatagtcttt 5041 ctgatgtaga tttacttaaa ggtgttaagt tagttaagaa agggtatgtg tgcttagctg 5101 atttggtagt gtctggggag tggaatctcc cggataactg ccgtggtggt gtcagtgttt 5161 gtattgtaga taagagaatg aaaaggagta aggaagcaac gctgggtgcg tatcacgccc 5221 ctgcttgcaa aaagaatttt tcttttaagc taatccctaa ttattcaata acatccgagg 5281 atgctgagaa gcacccgtgg caagtgttag tgaatatcaa aggagtggct atggaagaag 5341 gatactgtcc tttatctttg gagttcgttt caatttgtgt agtacataaa aataatgtaa 5401 gaaaaggttt gagggaacgt attttgagtg tgacagacgg ctcgccaatt gaactcactg 5461 aaaaggttgt tgaggagttc gtggatgaag taccaatggc tgtgaaactc gaaaaggttc 5521 cggaaaacaa aaaagaaatg gtaggtaata atgttaataa taagaaaata aataacagtg 5581 gtaagaaggg ttttaaaatt gaggaaattg aggataatgt aagtgatgac gagtctatcg 5641 cgtcatcgag tacgttttaa tcaatatgcc ttatacaatc aactctccga gccaatttgt 5701 ttacttatct tccgcttacg cagatcctgt gcagctgatc aatctgtgta caaatgcatt 5761 gggtaaccag tttcaaacgc aacaagctag gacaacagtc caacagcaat ttgcggatgc 5821 ctggaaacct gtgcctagta tgacagtgag atttcctgca tcggatttct atgtgtatag 5881 atataattcg acgcttgatc cgttgatcac ggcgttatta aatagctttg atactagaaa 5941 tagaataata gaggttgata atcaacccgc accgaatact actgaaatcg ttaacgcgac 6001 tcagagggta gacgatgcta ctgtagctat aagggcttca atcaataatt tggctaatga 6061 actggttcgt ggaactggca tgttcaatca agcaggcttt gagactgcta gtggacttgt 6121 ctggaccaca actccggcta cttagctatt gttgtgagat ttcctaaaat aaagtcgctg 6181 aagacttaaa attcagggtg gctgatacca aaatcagcag tggttgttcg tccacttaaa 6241 tataacgatt gtcatatctg gatccaacag ttaaaccatg tgatggtgta tactgtggta 6301 tggcgtaaaa catcggagag gttcgaatcc tcccctaacc gccggtagcg gccca // LOCUS TRNTNPIA 1333 bp ds-DNA BCT 14-AUG-1990 DEFINITION Transposon Tn21 integrase (tnpI) gene, complete cds, and adenyltransferase (aadA), 5' end. ACCESSION M33633 KEYWORDS integrase. SOURCE Transposon Tn21 DNA, clone pMON90. ORGANISM Transposon Tn21 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1333) AUTHORS Mercier,J., Lachapelle,J., Couture,F., Lafond,M., Vezina,G., Boissinot,M. and Levesque,R.C. TITLE Structural and functional characterization of tnpI a recombinase locus in Tn21 and related transposons JOURNAL J. Bacteriol. 172, 3745-3757 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.C.Levesque, 06-APR-1990, for release after publication. FEATURES from to/span description ORF 209 613 potential ORF1 ORF 665 1060 potential ORF2 ORF 1150 137 (c) potential ORF3 ORF 1298 > 1333 adenyltransferase (aadA) site 1288 1291 integration site GTTA BASE COUNT 286 a 396 c 408 g 243 t ORIGIN 1 ggatccatca ggcaacgacg ggctgctgcc ggccatcagc ggacgcaggg aggactttcc 61 gcaaccggcc gttcgatgcg gcaccgatgg ccttcgcgca ggggtagtga atccgccagg 121 attgacttgc gctgccctac ctctcactag tgaggggcgg cagcgcatca agcggtgagc 181 gcactccggc accgccaact ttcagcacat gcgtgtaaat catcgtcgta gagacgtcgg 241 aatggccgag cagatcctgc acggttcgaa tgtcgtaacc gctgcggagc aaggccgtcg 301 cgaacgagtg gcggagggtg tgcggtgtgg cgggcttcgt gatgcctgct tgttctacgg 361 cacgtttgaa ggcgcgctga aaggtctggt catacatgtg atggcgacgc acgacaccgc 421 tccgtggatc ggtcgaatgc gtgtgctgcg caaaaaccca gaaccacggc caggaatgcc 481 cggcgcgcgg atacttccgc tcaagggcgt cgggaagcgc aacgccgctg cggccctcgg 541 cctggtcctt cagccaccat gcccgtgcac gcgacacgtg ctcgcgcagg ctgggtgcca 601 agctctcggg taacatcaag gcccgatcct tggagccctt gccctcccgc acgatgatcg 661 tgccgtgatc gaaatccaga tccttgaccc gcagttgcaa accctcactg atccgcatgc 721 ccgttccata cagaagctgg gcgaacaaac gatgctcgcc ttccagaaaa ccgaggatgc 781 gaaccacttc atccggggtc agcaccaccg gcaagcgccg cgacggccga ggtcttccga 841 tctcctgaag ccagggcaga tccgtgcaca gcaccttgcc gtagaagaac agcaaggccg 901 ccaatgcctg acgatgcgtg gagaccgaaa ccttgcgctc gttcgccagc caggacagaa 961 atgcctcgac ttcgctgctg cccaaggttg ccgggtgacg cacaccgtgg aaacggatga 1021 aggcacgaac ccagtggaca taagcctgtt cggttcgtaa gctgtaatgc aagtagcgta 1081 tgcgctcacg caactggtcc agaaccttga ccgaacgcag cggtggtaac ggcgcagtgg 1141 cggttttcat ggcttgttat gactgttttt tggggtacag tctatgcctc gggcatccaa 1201 gcagcaagcg cgttacgccg tgggtcgatg tttgatgtta tggagcagca acgatgttac 1261 gcagcagggc agtcgcccta aaacaaagtt aaacatcatg agggaagcgg tgatcgccga 1321 agtatcgact caa // LOCUS TTERRDA 1504 bp ss-rRNA RNA 14-AUG-1990 DEFINITION T.tenax 16S rRNA. ACCESSION M35966 KEYWORDS 16S ribosomal RNA. SOURCE T.tenax RNA. ORGANISM Thermoproteus tenax Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoproteales; Thermoproteaceae. REFERENCE 1 (bases 1 to 1504) AUTHORS Leinfelder,W., Jarsch,M. and Bock,A. TITLE The phylogenetic position of the sulfur-dependent archaebacterium Thermoproteus tenax: Sequence of the 16S rRNA gene JOURNAL Syst. Appl. Microbiol. 6, 164-170 (1985) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 1504 16S rRNA BASE COUNT 297 a 448 c 563 g 196 t ORIGIN 1 aaaccggttg atcctgccgg acctgaccgc tatcggggtg gggctaagcc atgcgagtcg 61 cgcgcccggg gcgccgggcg cggcgcacgg ctcagtaaca cgtacccaac ctaacctcgg 121 gagggggaca accccgggaa actggggctg atcccccata ggggaagggc gctggaaggc 181 cccttcctcc aaagggatcg cgggcgatct cccgcggtcc gcccgagggt gggggtacgg 241 cccatcaggt tgttggcggg gtaacggccc gccaagccga agacgggtag gggcggtgag 301 agccgtgagc cccgagatgg gcactgagac aagggcccag gccctacggg gtgcagcagg 361 cgcgaatact ccgcaatgcg ggcaaccgcg acggggccac cccgagtgcc gggcgaagag 421 cccggctttt gcccggtgta aggagccggg cgaataagcg gggggtaagt ctggtgtcag 481 ccgccgcggt aataccagcc ccgcgagtgg tcagggtgat tactgggctt aaagcgcccg 541 tagccggccc ggcaagtcgc tcctgaaatc cccaggctca acctgggggc agggggcgat 601 actgccgggc tagggggcgg gagaggccgc cggtactccg ggggtagggg cgaaatccta 661 taatccccgg aggaccacca gtggcgaaag cgggcggcca gaacgcgccc gacggtgagg 721 ggcgaaagcc gggggagcaa aggggattag atacccctgt agtcccggcc gtaaacgatg 781 cgggctagct gtcggccggg cttagggccc ggccggtggc gtagggaaac cgttaagccc 841 gccgcctggg gagtacggcc gcaaggctga aacttaaagg aattggcggg ggggcaccac 901 aaggggtgaa gcttgcggct taattggagt caacgccgga aaccttaccc ggggcgacag 961 caggatgaag gccaggctaa cgaccttgcc ggacgagctg agaggaggtg catggccgtc 1021 gtcagctcgt gccgtgaggt gtccggttaa gtccggcaac gagcgagacc cccaccccta 1081 gttgctaccc cgctcttcgg ggcggggggc acactagggg gactgccggc gtaagccgga 1141 ggaaggaggg ggcgacggca ggtcagtatg ccccgaaacc ccggggctgc acgcgagctg 1201 caatggcggg gacagcggga tccgaccccg aaagggggag gcaatcccgt aaaccccgcc 1261 ccagtaggga tcgagggctg caactcgccc tcgtgaacgt ggaatcccta gtaaccgcgt 1321 gtcaccaacg cgcggtgaat acgtccctgc cccttgcaca caccgcccgt cgcaccaccc 1381 gagggagttc tctgcgaggc ccctcgcttg gggcaaccca ggtgggggga cgagcagaga 1441 actcccgagg ggggtgaagt cgtaacaagg tagccgtagg ggaacctgcg gttggatcac 1501 ctcc // LOCUS VACPOLRDA 15212 bp ds-DNA VRL 14-AUG-1990 DEFINITION Vaccinia virus RNA polymerase (rpo30), polyA polymerase large subunit and DNA polymerase genes, complete cds. ACCESSION M36339 KEYWORDS DNA polymerase; RNA polymerase; polyA polymerase large subunit. SOURCE Vaccinia virus (strain WR) DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 15212) AUTHORS Gershon,P., Jones,E.V., Moss,B. and Ahn,B.Y. TITLE Eukaryotic transcriptional elongation factor homolog is an integral subunit of the vaccinia virus encoded RNA polymerase JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 2041 to 8280) AUTHORS Gershon,P., Jones,E.V., Moss,B. and Ahn,B.Y. TITLE Eukaryotic transcriptional elongation factor homolog is an integral subunit of the vaccinia virus encoded RNA polymerase JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Authorin copy for [1] kindly submitted by B.Y.Ahn, 10-JUL-1990. National Institutes of Health NIAID/LVD 9000 Rockville Pike Bethesda, Maryland 20892 FEATURES from to/span description pept 1621 182 (c) polyA polymerase large subunit pept 3831 1618 (c) ORF1 pept 4530 3958 (c) ORF2 pept 5365 4586 (c) RNA polymerase subunit (rpo30) pept 5414 6439 ORF3 pept 6576 8279 ORF4 pept 8361 8861 ORF5 pept 8986 9807 ORF6 pept 12834 9814 (c) DNA polymerase pept 12866 13153 ORF7 pept 13537 13148 (c) ORF8 BASE COUNT 4992 a 2561 c 2400 g 5259 t ORIGIN 1 aagcttcctg cgtactcaaa gtagataaac cctcatcgcc cgcgtgtgag agaagacctt 61 cgtccccgtc cagatgcgag agaatgaata accctggaaa acaagttccg tttatgagga 121 cggacatgct acaaaatatg ttcgcggcta atcgcgacaa cgtggcgtcg agacttttga 181 actaaaatac aattatatcc ttttcgatat taataaatcc gtgtcgtcca ggttttttat 241 ctctttcagt atgtgaatag ataggtattt tatctctatt catcatcgaa tttaagagat 301 ccgataaaca ttgtttgtat tctccagatg tcagcatctg atacaacaat atatgtgcac 361 ataaacctct ggcacttatt tcatgtacct tccccttatc actaaggaga atagtatttg 421 agaaatatgt atacatgata ttatcatgaa ttagatatac agaatttgta acactctcga 481 aatcacacga tgtgtcggcg ttaagatcta atatatcact cgataacaca ttttcatcta 541 gatacactag acatttttta aagctaaaat agtctttagt agtgacagta actatgcgat 601 tattttcatc gatgatacat ttcatcggca tattattacg cttaccatca aagactatac 661 catgtgtata tctaacgtat tctagcatgg ttgccatacg cgcattaaac ttttcaggat 721 ctttggatag atcttccaat ctatctattt gagaaaacat ttttatcatg ttcaatagtt 781 gaaacgtcgg atccactata tagatattat ctataaagat tttaggaact acgttcatgg 841 tatcctggcg aatattaaaa ctatcaatga tatgattatc gttttcatct tttatcacca 901 tatagtttct aagatatggg attttactta atataatatt atttcccgta ataaatttta 961 ttagaaatgc caaatctata agaaaagttc tagaattagt ctgaagaata tctatatcgc 1021 cgtatagtat atttggatta attagatata gagaatatga tccgtaacat atacaacttt 1081 tattatggcg tctaagatat tcttccatca acttattaac atttttgact agggaagata 1141 cattatgacg tcccattact tttgccttgt ctattactgc gacgttcata gaatttagca 1201 tatctcttgc caattcttcc attgatgtta cattataaga aattttagat gaaattacat 1261 ttggagcttt aatagtaaga actcctaata tgtccgtgta tgtggtcact aatacagatt 1321 gtagttctat aatcgtaaat aatttaccta tattatatgt ttgagtctgt ttagaaaagt 1381 agctaagtat acgatctttt atttctgatg cagatgtatc aacatcggaa aaaaatcttt 1441 ttttattctt ttttactaaa gatacaaata tgtctttgtt aaaaacagtt attttctgaa 1501 tatttctagc ttgtaatttt aacatatgat attcgttcac actaggtact ctgcctaaat 1561 aggtttctat aatctttaat gtaatattag gaaaagtatt ctgatcagga ttcctattca 1621 ttttgaggat ttaaaactct gattattgtc taatatggtc tctacgcaaa ctttttcaca 1681 gagcgataga gtttttgata actcgttttt cttaagaaat ataaaactac tgtctccaga 1741 gctcgctcta tcttttattt tatttaattc gatacaaact cctgatactg gttcagaaag 1801 taattcatta attttcagtc ctttatagaa gatatttaat atagataata caaaattttc 1861 agttcttgat atcgatctga ttgatcctag aactagatat attaataacg tgctcattag 1921 gcagtttatg gcagcttgat aattagatat agtatattcc agttcatatt tattagatac 1981 cgcattgccc agattttgat attctatgaa ttcctctgaa aataaatcca aaataactaa 2041 acattctatt ttttgtggat tagtgtactc tcttccctct atcatgttca ctactggtgt 2101 ccacgatgat aaatatctag agggaatata atatagtcca taggatgcca atctagcaat 2161 gtcgaataac tgtaatttga ttcttcgttc ttcattatga attgattctt gaggtataaa 2221 cctaacacaa attatattat tagacttttc gtatgtaatg tctttcatgt tataagtttt 2281 taatcctgga atagaatcta ttttaatgag gcttttaaac gcagagttct ccaacgagtc 2341 aaagcataat actctgttgt ttttcttata tacgatgtta cgattttctt ctttgaatgg 2401 aataggtttt tgaattagtt tataattaca acataataga taaggaagtg tgcaaatagt 2461 acgcggaaaa aacataatag ctcccctgtt ttcatccatg gttttaagta aatgatcact 2521 ggcttcttta gtcaatggat attcgaacat taaccgtttc atcatcattg gacagaatcc 2581 atatttctta atgtaaagag tgatcaaatc attgtgttta ttgtaccatc ttgttgtaaa 2641 tgtgtattcg gttatcggat ctgctccttt ttctattaaa gtatcgatgt cgatctcgtc 2701 taagaattca actatatcga catatttcat ttgtatacac ataaccatta ctaacgtaga 2761 atgtatagga agagatgtaa cgggaacagg gtttgttgat tcgcaaacta ttctaataca 2821 taattcttct gttaatacgt cttgcacgta atctattata gatgccaaga tatctatata 2881 attattttgt aagatgatgt taactatgtg atctatataa gtagtgtaat aattcatgta 2941 tttcgatata tgttccaact ctgtctttgt gatgtctagt ttcgtaatat ctatagcatc 3001 ctcaaaaaat atattcgcat atattcccaa gtcttcagtt ctatcttcta aaaaatcttc 3061 aacgtatgga atataataat ctattttacc tcttctgata tcattaatga tatagttttt 3121 gacactatct tctgtcaatt gattcttatt cactatatct aagaaacgga tagcgtccct 3181 aggacgaact actgccatta atatctctat tatagcttct ggacataatt catctattat 3241 accagaatta atgggaacta ttccgtatct atctaacata gttttaagaa agtcagaatc 3301 taagacctga tgttcatata ttggttcata catgaaatga tctctattga tgatagtgac 3361 tatttcattc tctgaaaatt ggtaactcat tctatatatg ctttccttgt tgatgaagga 3421 tagaatatac tcaatagaat ttgtaccaac aaactgttct cttatgaatc gtatatcatc 3481 atctgaaata atcatgtaag gcatacattt aacaattaga gacttgtctc ctgttatcaa 3541 tatactattc ttgtgataat ttatgtgtga ggcaaatttg tccacgttct ttaattttgt 3601 tatagtagat atcaaatcca atggagctac agttcttggc ttaaacagat atagtttttc 3661 tggaacgaat tctacaacat tattataaag gactttgggt agataagtgg gatgaaatcc 3721 tattttaatt aatgcgatag ccttgtcctc gtgcagatat ccaaacgctt ttgtgatagt 3781 atggcattca ttgtctagaa acgctctacg aatatctgtg acagatatca tctttagaga 3841 atatactagt cgcgttaata gtactacaat ttgtattttt taatctatct caataaaaaa 3901 attaatatgt atgattcaat gtataactaa actactaact gttattgata actagaatca 3961 gaatctaatg atgacgtaac caagaagttt atctactgcc aatttagctg cattattttt 4021 agcatctcgt ttagattttc catctgcctt atcgaatact cttccgtcga tgtctacaca 4081 ggcataaaat gtaggagagt tactaggccc cactgattca atacgaaaag accaatctct 4141 cctagtaatt tggcagtact cattaataac ggtgacaggg ttagcacctt tccaatcaat 4201 aattttttta gccggaataa catcatcaaa agacttatga tcctctctca ttgatttttc 4261 gcgggataca tcatctatta tgacgtcagc catagcatca gcatccggct tatccgcctc 4321 cgttgtcata aaccaacgag gaggaatatc gtcggagctg tacaccatag cactacgttg 4381 aagatcgtac agagctttat taacttctcg cttctccata ttaagttgtc tagttagttg 4441 tgcagcagta gctccttcga ttccaatggt tttaatagcc tcacacacaa tctctgcgtt 4501 agaacgctcg tcgatataga ttttagacat ttttagagag aactaacgca atcagtaata 4561 aaactaattt attttatcat tttttttatt catcatcctc tggtggttcg tcgtttctat 4621 cgaatgtagc tctgattaac ccgtcatcta taggtgatgc tggttctgga gattctggag 4681 gagatggatt attatctgga agaatctctg ttatttcctt gttttcatgt atcgattgcg 4741 ttgtaacatt aagattgcga aatgctctaa atttgggagg cttaaagtgt tgtttgcaat 4801 ctctacacgc gtgtctaact agtggaggtt cgtcagcggc tctagtttga atcatcatcg 4861 gcgtagtatt cctactttta cagttaggac acggtgtatt gtatttctcg tcgagaacgt 4921 taaaataatc gttgtaactc acatccttta ttttatctat attgtattct actcctttct 4981 taatgcattt tataccgaat aagagatagc gaaggaattc tttttcggtg ccgctagtac 5041 ccttaatcat atcacatagt gttttatatt ccaaatttgt ggcaatagac ggtttatttc 5101 tatacgatag tttgtttctg gaatcctttg agtattctat accaatatta ttctttgatt 5161 cgaatttagt ttcttcgata ttagattttg tattacctat attcttgatg tagtactttg 5221 atgatttttc catggcccat tctattaagt cttccaagtt ggcatcatcc acatattgtg 5281 atagtaattc tcggatatca gtagcggtta ccgccattga tgtttgttca ttggatgagt 5341 aactactaat gtatacattt tccatttata acacttatgt attaactttg ttcatttata 5401 ttttttcatt attatgttga tattaacaaa agtgaatata tatatgttaa taattgtatt 5461 gtggttatac ggctacaatt ttataattag tgaaagtcag tgtccgatga tcaatgacga 5521 tagctttact ctgaaaagaa agtatcaaat cgatagtgcg gagtcaacaa taaaaatgga 5581 taagaagagg acaaagtttc agaatagagc caaaatggta aaagaaataa atcagacaat 5641 aagagcagca caaactcatt acgagacatt gaaactagga tacataaaat ttaagagaat 5701 gattaggact actactctag aagatatagc accatctatt ccaaataatc agaaaactta 5761 taaactattc tcggacattt cagccatcgg caaagcatca cggaatccaa gtaagatggt 5821 atatgctctg ctgctttaca tgtttcccaa tttgtttgga gatgatcata gattcattcg 5881 ttatagaatg catccaatga gtaaaatcaa acacaagatc ttctctcctt tcaaacttaa 5941 tcttattaga atattagtgg aagaaagatt ctataataat gaatgcagat ctaataaatg 6001 gagaataatt ggaacacaag ttgataaaat gttgatagct gaatctgata aatatacaat 6061 agatgcaagg tataacctaa aacccatgta tagaatcaag ggaaaatctg aagaagatac 6121 cctctttatc aaacagatgg tagaacaatg tgtgacatcc caggaattgg tggaaaaagt 6181 gttgaagata ctgtttagag atttgttcaa gagtggagaa tacaaagcgt acagatacga 6241 tgatgatgta gaaaatggat ttattggatt ggatacacta aaattaaaca ttgttcatga 6301 tatagttgaa ccatgtatgc ctgttcgtag gccagtggct aagatactgt gtaaagaaat 6361 ggtaaataaa tactttgaga atccgctaca tattattggt aaaaatcttc aagagtgcat 6421 tgactttgtt agtgaatagg catttcatct ttctccaata ctaattcaaa ttgttaaatt 6481 aataatggat agtataaata gttattagtg ataaaatagt aaaaataatt attagaataa 6541 gagtgtagta tcatagataa ctctcttcta taaaaatgga ttttattcgt agaaagtatc 6601 ttatatacac agtagaaaat aatatagatt ttttaaagga tgatacatta agtaaagtaa 6661 acaattttac cctcaatcat gtactagctc tcaagtatct agttagcaat tttcctcaac 6721 atgttattac taaggatgta ttagctaata ccaatttttt tgttttcata catatggtac 6781 gatgttgtaa agtgtacgaa gcggttttac gacacgcatt tgatgcaccc acgttgtacg 6841 ttaaagcatt gactaagaat tatttatcgt ttagtaacac aatacaatcg tacaaggaaa 6901 ccgtgcataa actaacacaa gatgaaaaat ttttagaggt tgccaaatac atggacgaat 6961 taggagaact tataggcgta aattatgact tagttcttaa tccattattt cacggagggg 7021 aacccatcaa agatatggaa atcatttttt taaaactgtt taagaaaaca gacttcaaag 7081 ttgttaaaaa attaagtgtt ataagattac ttatttgggc ttacctaagc aagaaagata 7141 caggcataga gtttgcggat aatgatagac aagatatata tactctattt caacaaactg 7201 gtagaatcgt ccatagcaat ctaacagaaa cgtttagaga ttatatcttt cccggagata 7261 agactagcta ttgggtgtgg ttaaacgaaa gtatagctaa tgatgcggat attgttctta 7321 atagacacgc cattaccatg tatgataaaa ttcttagtta tatatactct gagataaaac 7381 agggacgcgt taataaaaac atgcttaagt tagtttatat ctttgagcct gaaaaagata 7441 tcagagaact tctgctagaa atcatatatg atattcctgg agatatccta tctattattg 7501 atgcaaaaaa cgacgattgg aaaaaatatt ttattagttt ttataaagct aattttatta 7561 acggtaatac atttattagt gatagaacgt ttaacgagga cttattcaga gttgttgttc 7621 aaatagatcc cgaatatttc gataatgaac gaattatgtc tttattctct acgagtgctg 7681 cggacattaa acgatttgat gagttagata ttaataacag ttatatatct aatataattt 7741 atgaggtgaa cgatatcaca ttagatacaa tggatgatat gaagaagtgt caaatcttta 7801 acgaggatac gtcgtattat gttaaggaat acaatacata cctgtttttg cacgagtcgg 7861 atcccatggt catagagaac ggaatactaa agaaactgtc atctataaaa tccaagagta 7921 gacggctgaa cttgtttagc aaaaacattt taaaatatta tttagacgga caattggctc 7981 gtctaggtct tgtgttagat gattataaag gagacttgtt agttaaaatg ataaaccatc 8041 ttaagtctgt ggaggatgta tccgcattcg ttcgattttc tacagataaa aaccctagta 8101 ttcttccatc gctaatcaaa actattttag ctagttataa tatttccatc atcgtcttat 8161 ttcaaaggtt tttaagagat aatctatatc atgtagaaga attcttggat aaaagcatcc 8221 atctaaccaa gacggataag aaatatatac ttcaattgat aagacacggt agatcataga 8281 acagaccaaa tatattatta ataatttgta tatacataga tataattatc acatattaaa 8341 aattcacaca tttttgataa atgggaactg ctgcaacaat tcagactccc accaaattaa 8401 tgaataaaga aaatgcagaa atgattttgg aaaaaattgt tgatcatata gttatgtata 8461 ttagtgacga atcaagtgat tcagaaaata atcctgaata tattgatttt cgtaacagat 8521 acgaagacta tagatctctc attataaaaa gtgatcacga gtttgtaaag ctatgtaaaa 8581 atcatgcaga gaaaagttct ccagaaacgc aacaaatgat tatcaaacac atatacgaac 8641 aatatcttat tccagtatct gaagtactat taaaacctat aatgtccatg ggtgacataa 8701 ttacatataa cggatgtaaa gacaatgaat ggatgctaga acaactctct accctaaact 8761 ttaacaatct ccgcacatgg aactcatgta gcataggcaa tgtaacgcgt ctgttttata 8821 cattttttag ttatctgatg aaagataaac taaatatata agtataatcc cattctaata 8881 ctttaacctg atgtattagc atcttattag aatattaacc taactaaaag acataacata 8941 aaaactcatt acatagttga taaaaagcgg taggatataa atattatggc tgccaccgtt 9001 ccgcgttttg acgacgtgta caaaaatgca caaagaagaa ttctagatca agaaacattt 9061 tttagtagag gtctaagtag accgttaatg aaaaacacat atctatttga taattacgcg 9121 tatggatgga taccagaaac tgcaatttgg agtagtagat acgcaaactt agatgcaagt 9181 gactattatc ccatttcgtt gggattactt aaaaagttcg agtttctcat gtctctatat 9241 aaaggtccta ttccagtata cgaagaaaaa gtaaatactg aattcattgc taatggatcg 9301 ttctctggta gatacgtatc atatcttcga aagttttctg ctcttccaac aaacgagttt 9361 attagttttt tgttactgac ttccattcca atctataata tcttgttctg gtttaaaaat 9421 actcagtttg atattactaa acacacatta ttcagatacg tctatacaga taatgccaaa 9481 cacctggcgt tggctaggta tatgcatcaa acaggagact ataagccttt gtttagtcgt 9541 ctcaaagaga attatatatt taccggtccc gttccaatag gtatcaaaga tataaatcac 9601 cctaatctta gtagagcaag aagtccatcc gattatgaga cattagctaa tattagtact 9661 atattgtact ttaccaagta tgatccggta ttaatgtttt tattgtttta cgtacctggg 9721 tattcaatta ctacaaaaat tactccagcc gtagaatatc taatggataa actgaatcta 9781 acaaagagcg acgtacaact gttgtaaatt attttatgct tcgtaaaatg taggttttga 9841 accaaacatt ctttcaaaga atgagatgca taaaacttta ttatccaata gattgactat 9901 ttcggacgtc aatcgtttaa agtaaacttc gtaaaatatt ctttgatcac tgccgagttt 9961 aaaacttcta tcgataattg tttcatatgt tttaatattt acaagttttt tggtccatgg 10021 tacattagcc ggacaaatat atgcaaaata atatcgttct ccaagttcta tagtttctgg 10081 attattttta ttatattcag taaccaaata catattaggg ttatctgcgg atttataatt 10141 tgagtgatgc attcgactca acataaataa ttctagagga gacgatctac tatcaaattc 10201 ggatcgtaaa tctgtttcta aagaacggag aatatctata catacctgat tagaattcat 10261 ccgtccttca gacaacatct cagacagtct ggtcttgtat gtcttaatca tattcttatg 10321 aaacttggaa acatctcttc tagtttcact agtaccttta ttaattctct caggtacaga 10381 ttttgaattc gacgatgccg agtatttcat cgttgtatat ttcttcttcg attgcataat 10441 cagattctta tataccgcct caaactctat tttaaaatta ttaaacaata ctctattatt 10501 aatcagtcgt tctaactcct ttgctatttc tatggactta tctacatctt gactgtctat 10561 ctctgtaaac acggagtcgg tatctccata cacgctacga aaacgaaatc tgtaatctat 10621 aggcaacgat gttttcacaa tcggattaat atctctatcg tccatataaa atggattact 10681 taatggattg gcaaaccgta acataccgtt agataactct gctccattta gtaccgattc 10741 tagatacaag atcattctac gtcctatgga tgtgcaactc ttagccgaag cgtatgagta 10801 tagagcacta tttctaaatc ccatcagacc atatactgag ttggctacta tcttgtacgt 10861 atattgcatg gaatcataga tggccttttc agttgaactg gtagcctgtt ttaacatctt 10921 tttatatctg gctctctctg ccaaaaatgt tcttaatagt ctaggaatgg ttccttctat 10981 cgatctatcg aaaattgcta tttcagagat gaggttcggt agtctaggtt cacaatgaac 11041 cgtaatatat ctaggaggtg gatatttctg aagcaagagc tgattattta tttcttcttc 11101 caatctattg gtactaacaa cgacaccgac taatgtttcc ggagatagat ttccaaagat 11161 acacacatta ggatacagac tgttataatc aaagattaat acattattac taaacatttt 11221 ttgttttgga gcaaatacct taccgccttc ataaggaaac ttttgttttg tttctgatct 11281 aactaagata gttttagttt ccaacaatag ctttaacagt ggacccttga tgactgtact 11341 cgctctatat tcgaatacca tggattgagg aagcacatat gttgacgcac ccgcgtctgt 11401 ttttgtttct actccataat actcccacaa atactgacac aaacaagcat catgaataca 11461 gtatctagcc atatctaaag ctatgtttag attataatcc ttatacatct gagctaaatc 11521 aacgtcatcc tttccgaaag ataatttata tgtatcatta ggtaaagtag gacataatag 11581 tacgacttta aatccatttt cccaaatatc tttacgaatt actttacata taatatcctc 11641 atcaacagtc acataattac ctgtggttaa aacctttgca aatgcagcgg ctttgccttt 11701 cgcgtccgta gtatcgtcac cgatgaacgt catttctcta actcctctat ttaatacttt 11761 acccatgcaa ctgaacgcgt tcttggatat agaatccaat ttgtacgaat ccaatttttc 11821 agatttttga atgaatgaat atagatcgaa aaatatagtt ccattattgt tattaacgtg 11881 aaacgtagta ttggccatgc cgcctactcc cttatgacta gactgatttc tctcataaat 11941 acagagatat acagcttcct ttttgtccgg agatctaaag ataatcttct ctcctgttaa 12001 taactctaga cgattagtaa tatatctcag atcaaagtta tgtccgttaa aggtaacgac 12061 atagtcgaac gttagttcca acaattgttt agctattcgt aacaaaacta tttcagaaca 12121 taaaactagt tctcgttcgt aatccatttc cattagtgac tgtatcctca aacatcctct 12181 atcgacggct tcttgtattt cctgttccgt taacatctct tcattaatga gcgtaaacaa 12241 taatcgttta ccacttaaat cgatataaca gtaacttgta tgcgagattg ggttaataaa 12301 tacagaagga aacttcttat cgaagtgaca ctctatatct agaaataagt acgatcttgg 12361 gatatcgaat ctaggtattt ttttagcgaa acagttacgt ggatcgtcac aatgataaca 12421 tccattgtta atctttgtca aatattgctc gtccaacgag taacatccgt ctggagatat 12481 cccgttagaa atataaaacc aactaatatt gagaaattca tccatggtgg cattttgtat 12541 gctgcgtttc tttggctctt ctatcaacca catatctgcg acggagcatt ttctatcttt 12601 aatatctaga ttataactta ttgtctcgtc aatgtctata gttctcatct ttcccaacgg 12661 cctcgcatta aatggaggag gagacaatga ctgatatatt tcgtccgtca ctacgtaata 12721 aaagtaatga ggaaatcgta taaatacggt ctcaccattt cgacatctgg atttcagata 12781 taaaaatctg ttttcaccgt gactttcaaa ccaattaatg caccgaacat ccatttatag 12841 aatttagaaa tatattttca tttaaatgaa tcccaaacat tggggaagag ccgtatggac 12901 cattattttt atagtacttt cgcaagcggg tttagacggc aacatagaag cgtgtaaacg 12961 aaaactatat actatagtta gcactcttcc atgtcctgca tgtagacggc acgcgactat 13021 cgctatagag gacaataatg tcatgtctag cgatgatctg aattatattt attatttttt 13081 catcagatta tttaacaatt tggcatctga tcccaaatac gcgatcgatg tgacaaaggt 13141 taacccttta taaacttaac ccattataaa acttatgatt agtcacgact gaaataaccg 13201 cgtgattatt ttttggtata attctacacg gcatggtttc tgtgactatg aattcaaccc 13261 ccgttacatt agtgaaatct ttaacaaaca gcaagggttc gtcaaagaca taaaactcat 13321 tgtttacaat cgaaatagac cccctatcac acttaaaata aaaaatatcc ttatccttta 13381 ccaccaaata aaattctgat tggtcaatgt gaatgtattc acttaacagt tccacaaatt 13441 tatttattaa ctccgaggca catacatcgt cggtattttt tatggcaaac tttactcttc 13501 cagcatccgt ttctaaaaaa atattaacga gttccattta tatcatccaa tattattgaa 13561 atgacgttga tggacaaatg atacaaataa gaaggtacgg tacctttgtc caccatctcc 13621 tccaattcat gctctatttt gtcattaact ttaatgtatg aaaacagtac gccacatgct 13681 tccatgacag tgtgtaacac tttggataca aaatgtttga cattagtata attgttcaag 13741 actgtcaatc tataatagat agtagctata atatattcta tgatggtatt gaagaagatg 13801 acaaccttgg catattgatc atttaacaca gacatggtat caacagatag cttgaatgaa 13861 agagaatcag taattggaat aagcgtcttc tcgatggagt gtccgtatac caacatgtct 13921 gatattttga tgtattccat taaattattt agttttttct ttttattctc gttaaacagc 13981 atttctgtca acggacccca acatcgttga ccgattaagt tttgattgat ttttccgtgt 14041 aaggcgtatc tagtcagatc gtatagccta tccaataatc catcgtctgt gtgtagatca 14101 catcgtacac tttttaattc tctatagaag agcgacagac atctggagca attacagaca 14161 gcaatttctt tattctctac agatgtaaga tacttgaaga cattcctatg atgatgcaga 14221 attttggata acacggtatt gatggtatct gttaccataa ttcctttgat ggctgatagt 14281 gtcagagcac aagatttcca atctttgaca atttttagca ccattatctt tgttttgata 14341 tctatatcag acagcatggt gcgtctgaca acacaaggat taagacggaa agatgaaatg 14401 attctctcaa catcttcaat ggataccttg ctattttttc tggcattatc tatatgtgcg 14461 agaatatcct ctagagaatc agtatccttt ttgatgatag tggatctcaa tgacatggga 14521 cgtctaaacc ttcttattct atcaccagat tgcatggtga tttgtcttct ttcttttatc 14581 ataatgtaat ctctaaattc atcggcaaat tgtctatatc taaaatcata atatgagatg 14641 tttacctcta caaatatctg ttcgtccaat gttagagtat ttacatcagt tttgtattcc 14701 aaattaaaca tggcaacgga tttaatttta tattcctcta ttaagtcctc gtcgataata 14761 acagaatgta gataatcatt taatccatcg tacatggttg gaagatgctt gttgacaaaa 14821 tctttaattg tcttgatgaa ggtgggacta tatctaacat cttgattaat aaaatttata 14881 acattgtcca taggatactt tgtaactagt tttatacaca tctcttcatc ggtaagccca 14941 gacagaatat cgtgaacagg tggtatatta tattcatcag atatacgaag aacaatgtcc 15001 aaatctatat tgtttaatat attatataga tgtagtgtag ctcctacagg aatatcttta 15061 actaagtcaa tgatttcatc aaccgttaga tctattttaa agttaatcat ataggcattg 15121 atttttaaaa ggtatgtagc cttgactaca ttctcattaa ttaaccattc caagtcactg 15181 tgtgtaagaa gattatattc tatcataagc tt // LOCUS YSCERD2A 3327 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae ER lumen protein retaining receptor (ERD2) gene, complete cds. ACCESSION M34777 KEYWORDS ER lumen protein retaining receptor; HDEL receptor. SOURCE S.cerevisiae (strain SEY2102) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3327) AUTHORS Semenza,J.C., Hardwick,K.G., Dean,N. and Pelham,H.R.B. TITLE ERD2, a yeast gene required for the receptor-mediated retrieval of luminal ER proteins from the secretory pathway JOURNAL Cell 61, 1349-1357 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Semenza, 01-JUN-1990, for release after publication. FEATURES from to/span description pept 1337 1358 ER lumen protein retaining receptor, exon 1 (ERB2) 1456 2093 ER lumen protein retaining receptor, exon 2 ORF 2492 2960 ORF1 IVS 1359 1455 ERB2 intron A site 1359 1364 consensus splice donor site 1416 1422 branchpoint sequence site 1453 1455 acceptor sequence BASE COUNT 996 a 639 c 574 g 1118 t ORIGIN 1 aagcttcctg acggaatgtt aattctcgtt gaccctgagc aggctgttga gcccggggat 61 ttctgcatag ccagacttgg gggtgatgag tttaccttca agaaactgat ccatccaaac 121 catttttggg tttagttgct gcttccgctg gtatccttca agatgtcatt gaaggtaagt 181 acgatcttga ggccggcgaa aacaaattca acttttaaat caaaatttta gaataaaagt 241 ctactgcatt aacaaagata aatttctaat ttttttatgt atattatatt ttttggaaga 301 gaaaatattt gtagtgcttc tctacactgt tttttttttt tcgttcatgt tgtctttcct 361 ctcctgaata tatttaattt aataaatcta tcttatctga catatagaag ctgtttcgat 421 agagttttcg actggccaac agttgttact gactggataa aaatttacga tttttcctcc 481 aatcttggct ttcaaaatga gaaataagag tcttgaaatt caaagaatgt atctatgtat 541 gaatatagaa tatgctcaaa tatttagttt tgtataagga agttctaaat aattttccat 601 ggcgtgcttt taccaaagta ctgaacaggg agattattta tcattttttt tcttactctt 661 cgagttgaag tcgattgtag gtgatgacat gaacattaaa taaaatgtac ttgaacaaat 721 cagaccagca aacacccaca tcgcaatggt actgggtttt ctcgttattc tgtggaagaa 781 ttcaattacc ggattattgt taaaaaagcc cattagaagt atcctaacga ttgattaaag 841 taaattgtta ttcacgctac tatcgcaatc gtccaactga accttattct ttattgtata 901 taatctctgc cgagaaattc ttatacttat gtatctggga attcaccagg ttagaatacc 961 ttcccctcct caatttctaa tttttttttc tacccttatt tttctctctt gcttcatcgc 1021 tctcaaagct aaaagctaat acagcaaaca aaagtgttat taggaattca attgagtcgt 1081 gaaagaaaaa taatattcgt cttacttgag gactcacttt ggtcaattga gattaaagtg 1141 tagtctgcac cactgaattc ctcgttttcc tttaaagatc cgggatggct tatctgcggt 1201 acccggactg aaaacgacgt tttagctaaa tattgaacac gttaaaaata agttcaaatt 1261 tcgaggcaga tgaatttgct gagggcactg ttcaagagtg agatatacgc gcaagttgag 1321 gcataatcag taagcaatga atccgtttag aatcttaggt atgttactat ttggagtttc 1381 atgaggcttt tcccgccgta gatcgaaccc aatcttacta acagagaaag ggctttttcc 1441 cgaccatcaa gacaggtgat ttatcacatc taaccagtat actgatcctg attcataata 1501 tcaagaccac aaggtacatt gaaggtattt ctttcaagac ccaaacgttg tacgctttgg 1561 ttttcataac acgatacttg gatctcttga cttttcactg ggtatcccta tacaatgctc 1621 taatgaaaat atttttcatt gtatctaccg cttacattgt agtgctatta caagggtcta 1681 aaagaaccaa caccattgcg tataatgaaa tgcttatgca tgataccttt aagatccagc 1741 atttactaat tgggagtgct ctaatgagtg tttttttcca tcacaagttc acttttcttg 1801 aattagcatg gagtttttct gtatggttgg agagtgtggc tattctacct caattgtaca 1861 tgctatctaa gggtgggaag actagaagtc taactgttca ttatattttt gccatgggat 1921 tatacagagc attgtatatt cctaactgga tttggaggta cagcacggaa gataaaaaat 1981 tggacaagat tgccttcttc gcgggacttt tgcaaactct gttgtactct gatttctttt 2041 acatttacta cactaaagtc atcagaggaa agggtttcaa actgccaaaa taaaaaaaaa 2101 acctgtttta aatagtgttc gcagcaatat atacaaattt caatatatca atgctatacc 2161 aaaagaaaaa aaaaatacat aaaattgatg ttgatattgg tctatttctt tttatactat 2221 gatatgtatg cattaatctc tttttagctc ataaaattct ttccttacac catccttggt 2281 gacaataagg atttccagcc catcacccac ttgtatatgt ctttctgtag cagaagtgaa 2341 cgagtctctc accagtttga tgacttcttc cacggacaag tatttcaaag gctttttgac 2401 tttaccgttt gtacctggct catattgaat ttttgaaatt aacctgattg tccaaaaatg 2461 gcatgatcaa tgatgccgca gcaccacctg ctctacactg ttctctttcg taggagccaa 2521 actgggtcga acgaatagac agcgccctta ccatcttcgt caagacccgc aatgatcgta 2581 tgaacgtagt aagggaaaaa cctcttcccg tacagaagat gttgaatgtt ccttgctgca 2641 gagtttatag atagtttttt gtcgttgtgg tcgaaatggt accattttac actatttttg 2701 aatcttttta ctaaagcgtc gccgtctgct gcaaatccat tcgccgacat gactatgtta 2761 tcaccacaat caaaaacctt gggttcataa cgagaattaa ttgagtaatc ggtgatattt 2821 cttgtatcgc ctgctaacac tgcgaaatct tcacctgcaa tgcccaggat tgtaccacca 2881 ttatcaccgt aaggattgaa ttgatgttca atgggtgtat ttgacgcctc cgaagagtat 2941 tctgatgcaa tagtggccat attttgttta actttatagt tcaatagtct tggctactct 3001 ctttccaact cagttcacct tgttattata ccgcttgttt ttgccaccct ttgagtttct 3061 cgatccttta agttggaaaa gataaaaaca attatcgctc cgtacgaaca ccaaatctct 3121 gcgacgcaga gacggcaata caaaatcaga aaaaaaaaac aatttatcct atgcttcact 3181 agtcacaggc tctacgaaac attctatcgc ctcactgcct ttacatcact taagcaagaa 3241 gttttaaata ttacggcaat tcctttgcag caataccttt actttatttc gagagaaaat 3301 agatcgtaaa atgcctactc caagctt // LOCUS YSCLYS2A 4980 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae alpha-aminoadipate reductase (LYS2) gene, complete cds. ACCESSION M36287 M14967 M14968 KEYWORDS LYS2 gene; alpha-aminoadipate reductase. SOURCE S.cerevisiae (strain S288C) DNA, clone pDP6. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 746 and 3921 to 4980) AUTHORS Fleig,U.N., Pridmore,R.D. and Philippsen,P. TITLE Construction of LYS2 cartridges for use in genetic manipulations of Saccharomyces cerevisiae JOURNAL Gene 46, 237-245 (1986) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 4632) AUTHORS Morris,M.E. and Jinks-Robertson,S. TITLE Nucleotide sequence of the LYS2 gene of Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Jinks-Robertson, 10-JUL-1990. Emory University Dept. of Biology 1510 Clifton Road N.E. Atlanta, GA 30322 FEATURES from to/span description pept 297 4475 alpha-aminoadipate reductase (LYS2; EC 1.2.1.31) BASE COUNT 1523 a 942 c 1026 g 1489 t ORIGIN Chromosome II. 1 tctagaggca tcgcacagtt ttagcgagga aaactcttca atagttttgc cagcggaatt 61 ccacttgcaa ttacataaaa aattccggcg gtttttcgcg tgtgactcaa tgtcgaaata 121 cctgcctaat gaacatgaac atcgcccaaa tgtatttgaa gacccgctgg gagaagttca 181 agatatataa gtaacaagca gccaatagta taaaaaaaaa tctgagttta ttacctttcc 241 tggaatttca gtgaaaaact gctaattata gagagatatc acagagttac tcactaatga 301 ctaacgaaaa ggtctggata gagaagttgg ataatccaac tctttcagtg ttaccacatg 361 actttttacg cccacaacaa gaaccttata cgaaacaagc tacatattcg ttacagctac 421 ctcagctcga tgtgcctcat gatagttttt ctaacaaata cgctgtcgct ttgagtgtat 481 gggctgcatt gatatataga gtaaccggtg acgatgatat tgttctttat attgcgaata 541 acaaaatctt aagattcaat attcaaccaa cgtggtcatt taatgagctg tattctacaa 601 ttaacaatga gttgaacaag ctcaattcta ttgaggccaa tttttccttt gacgagctag 661 ctgaaaaaat tcaaagttgc caagatctgg aaaggacccc tcagttgttc cgtttggcct 721 ttttggaaaa ccaagatttc aaattagacg agttcaagca tcatttagtg gactttgctt 781 tgaatttgga taccagtaat aatgcgcatg ttttgaactt aatttataac agcttactgt 841 attcgaatga aagagtaacc attgttgcgg accaatttac tcaatatttg actgctgcgc 901 taagcgatcc atccaattgc ataactaaaa tctctctgat caccgcatca tccaaggata 961 gtttacctga tccaactaag aacttgggct ggtgcgattt cgtggggtgt attcacgaca 1021 ttttccagga caatgctgaa gccttcccag agagaacctg tgttgtggag actccaacac 1081 taaattccga caagtcccgt tctttcactt atcgcgacat caaccgcact tctaacatag 1141 ttgcccatta tttgattaaa acaggtatca aaagaggtga tgtagtgatg atctattctt 1201 ctaggggtgt ggatttgatg gtatgtgtga tgggtgtctt gaaagccggc gcaacctttt 1261 cagttatcga ccctgcatat cccccagcca gacaaaccat ttacttaggt gttgctaaac 1321 cacgtgggtt gattgttatt agagctgctg gacaattgga tcaactagta gaagattaca 1381 tcaatgatga attggagatt gtttcaagaa tcaattccat cgctattcaa gaaaatggta 1441 ccattgaagg tggcaaattg gacaatggcg aggatgtttt ggctccatat gatcactaca 1501 aagacaccag aacaggtgtt gtagttggac cagattccaa cccaacccta tctttcacat 1561 ctggttccga aggtattcct aagggtgttc ttggtagaca tttttccttg gcttattatt 1621 tcaattggat gtccaaaagg ttcaacttaa cagaaaatga taaattcaca atgctgagcg 1681 gtattgcaca tgatccaatt caaagagata tgtttacacc attattttta ggtgcccaat 1741 tgtatgtccc tactcaagat gatattggta caccgggccg tttagcggaa tggatgagta 1801 agtatggttg cacagttacc catttaacac ctgccatggg tcaattactt actgcccaag 1861 ctactacacc attccctaag ttacatcatg cgttctttgt gggtgacatt ttaacaaaac 1921 gtgattgtct gaggttacaa accttggcag aaaattgccg tattgttaat atgtacggta 1981 ccactgaaac acagcgtgca gtttcttatt tcgaagttaa atcaaaaaat gacgatccaa 2041 actttttgaa aaaattgaaa gatgtcatgc ctgctggtaa aggtatgttg aacgttcagc 2101 tactagttgt taacaggaac gatcgtactc aaatatgtgg tattggcgaa ataggtgaga 2161 tttatgttcg tgcaggtggt ttggccgaag gttatagagg attaccagaa ttgaataaag 2221 aaaaatttgt gaacaactgg tttgttgaaa aagatcactg gaattatttg gataaggata 2281 atggtgaacc ttggagacaa ttctggttag gtccaagaga tagattgtac agaacgggtg 2341 atttaggtcg ttatctacca aacggtgact gtgaatgttg cggtagggct gatgatcaag 2401 ttaaaattcg tgggttcaga atcgaattag gagaaataga tacgcacatt tcccaacatc 2461 cattggtaag agaaaacatt actttagttc gcaaaaatgc cgacaatgag ccaacattga 2521 tcacatttat ggtcccaaga tttgacaagc cagatgactt gtctaagttc caaagtgatg 2581 ttccaaagga ggttgaaact gaccctatag ttaagggctt aatcggttac catcttttat 2641 ccaaggacat caggactttc ttaaagaaaa gattggctag ctatgctatg ccttccttga 2701 ttgtggttat ggataaacta ccattgaatc caaatggtaa agttgataag cctaaacttc 2761 aattcccaac tcccaagcaa ttaaatttgg tagctgaaaa tacagtttct gaaactgacg 2821 actctcagtt taccaatgtt gagcgcgagg ttagagactt atggttaagt atattaccta 2881 ccaagccagc atctgtatca ccagatgatt cgtttttcga tttaggtggt cattctatct 2941 tggctaccaa aatgattttt accttaaaga aaaagctgca agttgattta ccattgggca 3001 caattttcaa gtatccaacg ataaaggcct ttgccgcgga aattgacaga attaaatcat 3061 cgggtggatc atctcaaggt gaggtcgtcg aaaatgtcac tgcaaattat gcggaagacg 3121 ccaagaaatt ggttgagacg ctaccaagtt cgtacccctc tcgagaatat tttgttgaac 3181 ctaatagtgc cgaaggaaaa acaacaatta atgtgtttgt taccggtgtc acaggatttc 3241 tgggctccta catccttgca gatttgttag gacgttctcc aaagaactac agtttcaaag 3301 tgtttgccca cgtcagggcc aaggatgaag aagctgcatt tgcaagatta caaaaggcag 3361 gtatcaccta tggtacttgg aacgaaaaat ttgcctcaaa tattaaagtt gtattaggcg 3421 atttatctaa aagccaattt ggtctttcag atgagaagtg gatggatttg gcaaacacag 3481 ttgatataat tatccataat ggtgcgttag ttcactgggt ttatccatat gccaaattga 3541 gggatccaaa tgttatttca actatcaatg ttatgagctt agccgccgtc ggcaagccaa 3601 agttctttga ctttgtttcc tccacttcta ctcttgacac tgaatactac tttaatttgt 3661 cagataaact tgttagcgaa gggaagccag gcattttaga atcagacgat ttaatgaact 3721 ctgcaagcgg gctcactggt ggatatggtc agtccaaatg ggctgctgag tacatcatta 3781 gacgtgcagg tgaaaggggc ctacgtgggt gtattgtcag accaggttac gtaacaggtg 3841 cctctgccaa tggttcttca aacacagatg atttcttatt gagatttttg aaaggttcag 3901 tccaattagg taagattcca gatatcgaaa attccgtgaa tatggttcca gtagatcatg 3961 ttgctcgtgt tgttgttgct acgtctttga atcctcccaa agaaaatgaa ttggccgttg 4021 ctcaagtaac gggtcaccca agaatattat tcaaagacta cttgtatact ttacacgatt 4081 atggttacga tgtcgaaatc gaaagctatt ctaaatggaa gaaatcattg gaggcgtctg 4141 ttattgacag gaatgaagaa aatgcgttgt atcctttgct acacatggtc ttagacaact 4201 tacctgaaag taccaaagct ccggaactag acgataggaa cgccgtggca tctttaaaga 4261 aagacaccgc atggacaggt gttgattggt ctaatggaat aggtgttact ccagaagagg 4321 ttggtatata tattgcattt ttaaacaagg ttggattttt acctccacca actcataatg 4381 acaaacttcc actgccaagt atagaactaa ctcaagcgca aataagtcta gttgcttcag 4441 gtgctggtgc tcgtggaagc tccgcagcag cttaaggttg agcattacgt atgatatgtc 4501 catgtacaat aattaaatat gaattaggag aaagacttag cttcttttcg ggtgatgtca 4561 cttaaaaact ccgagaataa tatataataa gagaataaaa tattagttat tgaataagaa 4621 ctgtaaatca gctggcgtta gtctgctaat ggcagcttca tcttggttta ttgtagcatg 4681 aatcatattt gccttttttt cctgtaattc aatgattctt gcttctatac tatcctcaat 4741 gcaaaacctt gtgatcttca caggtcgata ctgaccaatt ctatgaactc tatcaccact 4801 ttgccattca acactagggt tccaccatgg gtctaaaatg aatacttgcg aagcttcaca 4861 aagattcaaa gcaacaccgc ccgcctttaa actgaccaag aaaacctcgc attgaatgtt 4921 gttcatgaaa tacttgatgg tttcatctct ttgcgtcggt gacatactac cctgaagctt // LOCUS YSCMTATPSA 2820 bp ds-DNA ORG 14-AUG-1990 DEFINITION Yeast mitochondrial oxi3 gene, 3' end, ATPase subunit 3 (aap1) and ATPase subunit 6 (oli2) gene, complete cds. ACCESSION M36379 KEYWORDS oli2 region. SOURCE Yeast (strain J69-1B) mitochondrial DNA, clone. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 2820) AUTHORS Novitski,C.E., Macreadie,I.G., Maxwell,R.J., Lukins,H.B., Linnane,A.W. and Nagley,P. TITLE Biogenesis of mitochondria: Genetic and molecular analysis of the oli2 region of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Curr. Genet. 8, 135-146 (1984) STANDARD simple staff_entry FEATURES from to/span description pept < 1 63 cytochrome c oxidase subunit I (oxi3) pept 1019 1165 ATPase subunit 3 (aap1) pept 1872 2651 ATPase subunit 6 (oli2) binding 701 709 ribosomal binding site C (put.) binding 1814 1821 ribosomal binding site A (put.) binding 1835 1845 ribosomal binding site B (put.) BASE COUNT 1159 a 200 c 183 g 1278 t ORIGIN 1 gaattcttat taacttctcc accagctgta cactcattta atacaccagc tgtacaatct 61 taagttataa tatttaatta tttacttaat aattaaaaaa aaaagtaaat attatatcta 121 aaacttaata atataataat aatattctta taaaaaaaaa tatataaaaa aaaaatatat 181 aaaatttatt aaaatatctt aatatattat cctttttggg aactatataa ataaatacta 241 atataatcct attatatata tatatatata aaataatata tatatataat taatataaat 301 aatatttata attatttttt taataatata tataatttaa tatattaatt aatattatat 361 aattattaaa tatattataa tattattatt attttataat aaaaatattt ttaatactaa 421 ttattattta ttatttataa atatataaat agtatgttta atattattaa tactaaaaaa 481 aagtataatt ataattagga tctacaatac atttatctga taaatattaa tattaatatt 541 aatatttata ttaataaagg gattaaatta attgtatcca atttaattaa attatagata 601 tattatttat aatattaata tattgtttta ttaaaaaggt aaaaatagtt tttattttat 661 atataaatat aggatataaa taaatatatt atatatatat taagaatata tttatatttt 721 acatataatt atttataata taaatatctc cgcaaagccg gattaatgta attatttaat 781 aattttattt aataatttat taaaataaat atttacattt aataatattt atattatgtc 841 agttatttta tattaatgtt taatctatta taatattttt ttttataaat atattattta 901 tattaataaa tatatatatt atttttataa tatatatata tttttattaa atatttatta 961 aatatttatt aaattattat aatgttgtta ttaatcttat taaaaaatat atataaaaat 1021 gccacaatta gttccatttt attttatgaa tcaattaata tatggtttct tattaatgat 1081 tctattatta attttattct cacaattctt tttacctatg atcttaagat tatatgtatc 1141 tagattattt atttctaaat tataatatat attattaatt tatttattca tataaatatt 1201 attattatat ataaatatta ataatattta tacttattta ataataataa taaaataaaa 1261 aaaataatta taatttaata tatttaatat atttccttac agactatata tttatatata 1321 tatattaaat acaatttaat ttaatttaat tatgttattt attaaataaa gttatattat 1381 aatataataa caatattata tattattata taattataat atattttaat ataattatca 1441 aaagaaataa taaaaaatat taataagaat ataatttaat aattattaaa aaaattctta 1501 tttatagtcc ggtccgcccc cgcggggcgg accccaaagg aggagtaata aaaattatta 1561 aatacaaata ttatatatat ataattcatt atatatatat atatataata attaatctta 1621 ttttttatat atttatttat atatctattt atatttatat tttatatata tttatttata 1681 tatctaaggg gtttggtccc tcccccgtaa gtataagtat acgggggagg gtccctcact 1741 atttatattt atatttatat ttatatttat atttatattt tatatatttt atatatttat 1801 aaataaagta taataagata taattatgat taattattta taagttatag ttttataaat 1861 ttataattat tatgtttaat ttattaaata catatattac atcaccatta gatcaatttg 1921 agattagact attatttggt ttacaatcat catttattga tttaagttgt ttaaatttaa 1981 caacattttc attatatact attattgtat tattagttat tacaagttta tatctattaa 2041 ctaataataa taataaaatt attggttcaa gatgattaat ttcacaagaa gctatttatg 2101 atactattat aaatatgctt aaaggacaaa ttggaggtaa aaattgaggt ttatatttcc 2161 ctatgatctt tacattattt atgtttattt ttattgctaa tttaattagt atgattccat 2221 attcatttgc attatcagct catttagtat ttattatctc tttaagtatt gttatttgat 2281 taggtaatac tattttaggt ttatataaac atggttgagt attcttctca ttattcgtac 2341 ctgctggtac accattacca ttagtacctt tattagttat tattgaaact ttatcttata 2401 ttgctagagc tatttcatta ggtttaagat taggttctaa tatcttagct ggtcatttat 2461 taatggttat tttagctggt ttactattta attttatgtt aattaattta tttactttag 2521 tattcggttt tgtaccttta gctatgatct tagctattat gattttagaa ttcgctattg 2581 gtattatcca atcttatgtt tgacttatct taacagcatc atacttaaaa gatacattat 2641 acttacatta aattataaaa taaaaattat ataaaaaaaa tataataata ataataatat 2701 aaaataaaaa ataaaaaata aaacaatgaa aaaacaaaat ttaaattcta ttttattaat 2761 gtatattaat tatattatta attattttaa taatattcat aaaaatcaat taaaaaaaga // LOCUS YSCSPO7A 1846 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae SPO7 gene, complete cds, and FUN14 and FUN37 genes, 5' end. ACCESSION M36073 KEYWORDS antimutator phenotype; premeiotic DNA replication. SOURCE S.cerevisiae (strain AB972) DNA, clone lambda-G12b and pLF31. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1846) AUTHORS Whyte,W., Koepp,L.H., Lamb,J., Crowley,J.C. and Kaback,D.B. TITLE Molecular cloning of chi I DNA from S.cerevisiae JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.B.Kaback, 05-JUL-1990. FEATURES from to/span description pept 564 1343 SPO7 protein pept 1624 > 1832 FUN14 protein pept 375 < 1 (c) FUN37 protein BASE COUNT 556 a 308 c 415 g 567 t ORIGIN Chromosome I. 1 gtcgactgtg gtgttgtcac tactcaacgt attcgcacta ctaacactga aattgaggtt 61 tggttgcaat tgtctgtatg tttcggtggc atcttgtaat gggatatcag tagagttgcg 121 catgaatttc tccaattgct gtgcatcgga gtataaataa ctcagagaac catttatcct 181 ggacctcgta gaaaaatcta aagaattgaa tgtattggga gtagatttgt tggaaatttg 241 caggtgtatt gctgagggaa ttcggaaatc taataatgtt ctcgatgtgg ccgttatatc 301 ctcgtagcta ttttgcgtac tccaatgggt gctctgataa aatgccctta gtacttggtc 361 catatagggt agcatcaaga tcggtcttct ctgttcgtgt ctttttccta acgtatattt 421 gctttgtttc ttcactcaac aataaagtca aagtaaaatt aaatactaat tattcttaaa 481 agggaagatg cgaaatttag cgaaaatcta ttgattatac acacaaagga agaaaggtag 541 tggaaagcta aataaaggag gtcatggagc cagagagcat aggcgatgtg gggaaccatg 601 cccaggatga tagtgccagt atagtgtccg ggcctcgcag gcgttctact agcaagacat 661 ccagtgcgaa gaatatacgg aactccagta atatctctcc agcatcgatg attttcagga 721 atttgttgat actggaggat gatttaagac gccaagctca cgaacaaaag atactgaagt 781 ggcaattcac tttgttctta gcgtctatgg ccggtgtagg cgcatttacc ttctacgaac 841 tttatttcac ttcagattat gtcaagggcc tccatagggt tattttgcaa ttcactcttt 901 ctttcatttc cattactgta gttctttttc atatcagtgg acaatataga agaactatcg 961 tcattccaag aagatttttt acctctacta ataaagggat taggcagttt aatgtgaagc 1021 tagttaaagt acagtctacg tgggacgaga aatacacaga ttcagtaaga tttgtgagtc 1081 gaacaattgc ttattgtaat atttattgtt tgaaaaaatt tctgtggctt aaagacgata 1141 atgccattgt gaaattttgg aaaagtgtca cgatacaatc ccaaccgagg atcggagctg 1201 tggatgtgaa attagtcctc aaccccagag catttagtgc agagattaga gaaggatggg 1261 agatttatag agacgagttt tgggccaggg aaggtgctag aagacgcaaa caagcgcacg 1321 aactccgacc taaatcagaa tgaaagagtt ggagggcttc ttccttcgaa taagaggtca 1381 tatttaccta tgtaaaattg taaccatcta tgttcacaca taaattatat tttatacatt 1441 attagaagtg aagctgttgt gtcgtgaaaa ttttacaaat ccgtcatttc atatttaagt 1501 tttccaacaa gtgctagaaa acctaggggt tgttgaaatt ggttaaacaa ggcatcttat 1561 tatacataca acagcataac gctagagggg caagaaggaa gaacttaaaa taataggtgt 1621 aaaatgactt tggcttttaa tatgcaacgg ttggtgtttc gtaatttgaa tgttgggaag 1681 cgcatgttca agaacgtccc cttatggagg tttaatgtcg ccaataaatt aggaaagccc 1741 ttaactcgct ctgtagggtt aggcggtgct ggcatagttg ctggtggctt ttacttgatg 1801 aatcgccagc cttctaagtt gatattcaat gattctttag gggcag // LOCUS YSKERD2A 1248 bp ds-DNA PLN 14-AUG-1990 DEFINITION K.lactis ER lumen protein retaining receptor (ERD2) gene, complete cds. ACCESSION M34844 KEYWORDS ER lumen protein retaining receptor; HDEL receptor. SOURCE K.lactis (SEY2102) DNA. ORGANISM Kluyveromyces lactis Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1248) AUTHORS Lewis,M.J., Sweet,D.J. and Pelham,H.R.B. TITLE The ERD2 gene determines the specificity of the luminal ER protein retention system JOURNAL Cell 61, 1359-1363 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Lewis, 06-JUN-1990, for release after publication. FEATURES from to/span description pept 390 414 ER lumen protein retaining receptor, exon 1 (ERD2) 572 1206 ER lumen protein retaining receptor, exon 2 IVS 415 571 ERD2 intron A site 415 420 consensus splice donor site 528 534 branchpoint sequence site 569 571 acceptor sequence BASE COUNT 368 a 235 c 202 g 443 t ORIGIN 1 gaattcttca ctcttcaccc ctcctctcta acgagactcg gataaacaga ctgtaacgat 61 gacgacgacg acgatgatga taatgattat gatgatgatg attgaatgtg ccatttgacg 121 tttttgacct gtaaacccct tttgtaaata gcctctgtcc tttttaagta cttatgcggt 181 gtctgttaat acaatctttg tggctcacgt gatttattat ttttctcaat gtgggccaaa 241 attgcaatct cttaactctg aaaaataaaa caaggcagta ctaatcgctt cttaaacact 301 tgttctttaa agattgaaat aaactaccga catttgtcgt tctaagttaa attcaaatca 361 aaacagcttt caattgagtg tgtcacatta tgttgaacgt tttcagaata gcaggtatgt 421 gaaaagaact gattttatat cattgttgga cgtcaacctc gatcggaacg ttatatcttg 481 gttctgaatg atgatttcca tcggcttatc tctactggca gcatctttac taacattgtt 541 attaccaact attttgtaat cgagtgttta ggtgatttct ctcatttggc tagtatcatc 601 attttgatac aatcgatcac aacatctaac tcagttgatg gtatctcatt gaaaactcaa 661 ctgctataca ccttggtctt tatcacacgt tatttgaacc tatttaccaa atggacctcc 721 ttgtacaact tcttaatgaa aattgttttc atttcatctt cggtttacgt cattgtgtta 781 atgcgccaac aaaaatttaa aaaccctgtc gcatatcaag acatgatcac cagagatcaa 841 tttaaaatca agtttttaat agtaccatgc attctcctag gattaatttt caattatcgt 901 ttcagtttta tacaaatatg ctggtccttc tctctatggt tggaaagtgt tgcaatcctt 961 cctcaattgt ttatgttgac taaaacaggt aaagcaaaac aattgacatc tcattatatt 1021 tttgcattgg ggttataccg tgccttgtac attccaaatt ggatatggag gtattatacc 1081 gaagaaagat tcgataaatt atcagttttc accggtgtga ttcaaacttt agtgtactct 1141 gatttcttct acatctatta tcagaaggtt atcaaactcg gtggagattt agaattacca 1201 caatagatga caggatttgc atctccactt gcagtcagtt cgaagctt // LOCUS CELCKINII 3180 bp ds-DNA INV 14-AUG-1990 DEFINITION C.elegans casein kinase II-alpha gene, complete cds. ACCESSION J05274 KEYWORDS casein kinase II-alpha. SOURCE C.elegans DNA. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 3180) AUTHORS Hu,E. and Rubin,C.S. TITLE Casein kinase II from Caenorhabditis elegans: Properties and developmental regulation of the enzyme; Cloning and sequence analyses of cDNA and the gene for the catalytic subunit JOURNAL J. Biol. Chem. 265, 5072-5080 (1990) STANDARD simple staff_review FEATURES from to/span description pept 250 288 casein kinase II-alpha, exon 1 533 703 casein kinase II-alpha, exon 2 763 890 casein kinase II-alpha, exon 3 940 1024 casein kinase II-alpha, exon 4 1084 1380 casein kinase II-alpha, exon 5 1838 1962 casein kinase II-alpha, exon 6 2018 2099 casein kinase II-alpha, exon 7 2301 2456 casein kinase II-alpha, exon 8 pre-msg 211 3073 casein kinase II-alpha mRNA and introns IVS 289 532 casein kinase II-alpha intron A IVS 704 762 casein kinase II-alpha intron B IVS 891 939 casein kinase II-alpha intron C IVS 1025 1083 casein kinase II-alpha intron D IVS 1381 1837 casein kinase II-alpha intron E IVS 1963 2017 casein kinase II-alpha intron F IVS 2100 2300 casein kinase II-alpha intron G BASE COUNT 974 a 580 c 544 g 1082 t ORIGIN 1 actttcgcag cgggaatatc tcaccacgca cgagttcaca tttggttttt tctctgcgct 61 ctctcgcatt tgcgtactct tttcatgttc tggcatgtat tttcaaaatt attcgggttt 121 tatcagtttt cccgagattt tactatttct actgaaaatt tcgttctaat aattcaatta 181 taacagccgt tttgtcaaaa cttgctcaaa aataacgtgt tacttgtttc agatcgttat 241 tcttgggaaa tgcctccgat tcctagccgg gcgagggtct acgccgaggt gagcaagttt 301 gtatagccag gagttaatta ttgaacagaa tatattgctc atacataact cacaatgacg 361 gaattaacgt tatccgtttt agttgtggaa tactgtgaaa cactgtacgc cacatttgtt 421 agattttata aatgttcata ggaatttata gtctcgtaaa gaaaaactat ttgtaaaagt 481 cactaaatta tgtttttatg agtttatata ttataaatca caattgtttc aggtgaatcc 541 gtcgagacca cgcgagtact gggactatga ggctcacatg atcgaatggg gacaaatcga 601 tgattatcag ctggtacgca agttgggacg tggaaaatat tcggaggttt ttgaaggatt 661 caagatgagc actgacgaga aagttgttgt caagattctc aaggtaagtt ttacaaaaat 721 aattgtaata aagctttcaa aatgtaattt gtttgttttc agccagtcaa aaagaagaag 781 atcaagcgtg agattaaaat tttggagaat cttcgtggcg ggacaaacat catcaccctc 841 ctcgacgtcg tcaaggatcc tatttcgaga actccggctc ttattttcga gttggtttac 901 tatttgaaga tttttagaat taataatttc tatttgaagg catgttaaca attcggactt 961 caagcaattg tatcaaaccc tcagtgacta tgatatccgt tactacctct acgagctctt 1021 gaaggtactt ttttcgtctg tccattaatt tttgaattca taacgtttca atttattctc 1081 caggctctcg acttctgcca ttctcaagga attatgcaca gagatgttaa gccgcacaat 1141 gtgatgattg acgccgaaaa acgggaactt cgattaatag attggggatt agccgaattc 1201 tatcatccaa gacaggacta taatgttcgt gttgcttcta gatatttcaa gggacccgag 1261 cttcttgtcg attatcaatg ttacgattat tcgttggata tgtggagttt gggatgcatg 1321 ttggctagta tgatcttcag gaaggaacca ttcttccacg gccatgataa ctacgatcag 1381 gtaagcactt taagctgaag tagcaaagaa ccgcaaatcc acaaatttca ttaaaatcca 1441 atattggaat atagaaaaca tttgatttat ctgattatgg cttttatttc agtgagaaca 1501 atggggaaaa acgcttaaaa aattaccgga aagtttaaaa tttccgaaaa aaaccgaaaa 1561 ttttccaatt ttgagcattt ttcaattgtt ccagctgcaa aaaactattt tcagataaag 1621 aaattatttt tttgaattcc atagctaaaa ttttactgat aagtttggct tattccgtag 1681 aatttcttgc cggcgattga ttcgcacccc tgagctgaaa tttaaatgaa taaataatat 1741 tgtcagggta tttattaaat attcatcaat aatttttatg atttttaata tttataaatt 1801 cacgaaatct gattttctta taaaaagttt tttgcagcta gtccgcattg caaaggtatt 1861 gggaactgat gagctttacg agtacattgc gagatatcac attgatctcg atccacgctt 1921 caacgatatt ctcgggcgac actcaagaaa acgctgggaa aggtatattt tgatcgaagc 1981 gaagtttcat gtttttctta aaattcattt gttgtagatt tatccacgct gaaaatcaac 2041 atcttgtcac cccagaggct cttgatttcc ttgataaact tcttcgctac gatcacgcgg 2101 taagaatcgg aaaaaagaga aattacacca aattagttac tgaaaaccag tttaacaaga 2161 acacagagaa ataaaaacaa ttttcacttg ttttattgaa aaaaaagaca tttgatactc 2221 gaaactaaac tataaaatat cttttatttg ttaaattaat ttatttccag aatcttttaa 2281 ttaaatttca ccatattcag gaacgcctaa ctgctcaaga ggccatgggt cacgaatact 2341 tccgcccagt tgttgaagca catgctcgtg ccaatggaac tgaacaagct gacggacagg 2401 gtgcatcgaa ttccgcttct tctcaatcct ccgatgccaa aattgacggc gcttaaaatt 2461 tctttctatt ttttttttaa ttttcctgtt gactatttcc ttttttatta atttccacac 2521 cactcctctc catcattatt cctttgctct aaaaattggc taagatatta aatcttcccc 2581 gcttctccga aaatttccct ctacatacca caaaagtgcc taatcttttc gtgagaaacg 2641 ttctttgcaa attctatttt taggattttt ttgtgtaatt catgcctctg aaattccgtt 2701 tcaattttaa atcttttttg tcccatttca tattgttctc cttcttctcc cctttcctct 2761 tcctcaaaca accacaatca tcatttttat aaatcgaatc acagttcgtg agaaaaaaaa 2821 ctgttacttg tatatcaata ttgtctttgg ttttgaaaaa cctactattg ttcctatgtt 2881 gagttgtgaa ctatgttcac ttttagagtt ttagatttaa aatgctcaac ggacgagtgc 2941 tccttcgatc gattgctcgc cggacaatgg catcgaaaat tgagccagga tctcgcagcc 3001 gatttatttg gatgtccagg ctaccgcgcc aatggtaatt aaatggaaat aataaagcat 3061 gattaaaaat aatataaaaa acttgtatct ggaaaaaatt aacgaaactt ttcagataca 3121 tttagatttc gaatgagttt ttttctaact aggcacagtt tagctagtag gcattctaaa // LOCUS DMORGX 1650 bp ds-DNA BCT 14-AUG-1990 DEFINITION D.mobilis ribosomal 16S RNA gene. ACCESSION M36474 KEYWORDS ribosomal 16S RNA. SOURCE D.mobilis DNA. ORGANISM Desulfurococcus mobilis Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoproteales; Desulfurococcaceae. REFERENCE 1 (bases 1 to 1650) AUTHORS Kjems,J., Garrett,R.A. and Ansorge,W. TITLE The sequence of the 16S RNA gene and its flanking region from the archaebacterium Desulfurococcus mobilis JOURNAL Syst. Appl. Microbiol. 9, 22-28 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 71 1565 16S ribosomal RNA BASE COUNT 323 a 492 c 602 g 233 t ORIGIN 1 ccgacgaggg ggagggagcc acttaagccg agcgctccag aaaccccggc cccaacccca 61 agcggccgta actccggttg atcctgccgg tcccgaccgc tatcggggtg gggctaagcc 121 atgggagtcg cacgctccgc cgctgcgggg cgtggcggac ggctgagtaa cacgtggcta 181 acctaccctc gggaggggga taacaccggg aaactggtgc taatccccca taggggagga 241 ggcctggaag ggttcctccc cgaaagggtg tggcaggggt taacgctgct acaccgcccg 301 aggatggggc tacggcccat taggttgttg gcggggtaac ggcccgccaa gccgataatg 361 ggtaggggcc gtgagagcgg gagcccccag atgggcactg agacaagggc ccaggcccta 421 cggggcgcac caggcgcgaa acctccgcaa tgcgggaaac cgtgacgggg ccaccccgag 481 tgccccctta cgggggcttt tccccgctgt aggaaggcgg gggaataagc ggggggcaag 541 tctggtgtca gccgccgcgg taataccagc cccgcgagtg gtcgggacga ttattgggcc 601 taaagcgccc gtagccggcc cggcaagtcc cctcctaaat tcccgggctc aacccgggga 661 ctggagggga tactgccggg ctagggggtg ggagaggccg agggtactcc cggggtaggg 721 gcgaaatcct ataatcccgg gaggaccacc agtggcgaag gcgctcggct ggaacacgcc 781 cgacggtgag gggcgaaagc cgggggagcg aaccggatta gatacccggg tagtcccggc 841 tgtaaacgat gcgggctagg tgttgggtgg gcttagagcc cacccagtgc cgcagggaag 901 ccgttaagcc cgccgcctgg ggagtacggc cgcaaggctg aaactcaaag gaattggcgg 961 gggagcacca caaggggtgg agcctgcggt tcaattggag tcaacgccgg gaatctcacc 1021 gggggagaca gcaggatgac ggccaggtta aaggccttgc ctgacgcgct gagaggaggt 1081 gcatggccgt cgccagctcg tgctgtgaag tgtccggtta agtccggaaa cgagcgagac 1141 ccccacccct agttgctacc cggggctacg gctccggggc acactagggg gactgccgcc 1201 gtttaaggcg gaggaaggag ggggccacgg caggtcagca tgccccgaac cccccgggct 1261 acacgcgggc tacaatggcg gggacagcgg gatccgaccc cgaaaggggg aggcaatccc 1321 tcaaaccccg ccgtggttgg gatcgagggc tgcaactcgc cctcgtgaac gaggaatccc 1381 tagtaaccgc gcgtcaacat cgcgcggtga atacgtccct gctccttgca cacaccgccc 1441 gtcgctccac ccgaggggag ggggagtgag gcccggcccc ttgggtcggg tcgaactccc 1501 cctccctgag gggggagaag tcgtaacaag gtagccgtac cggaaggtgc ggctggatca 1561 cctcctgcct caggccgggg ctggggccgg ggctggaatg cgctaaggct taagtggctc 1621 ccgcctcctc tattcattga tgcaacacgg // LOCUS ECOHBB221 3703 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli beta-globin gene, complete cds. ACCESSION M34058 KEYWORDS beta-globin. SEGMENT 1 of 2 SOURCE E.coli (strain HB101) DNA, isolated from patient GD. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3703) AUTHORS Thein,S.L., Hesketh,C., Taylor,P., Temperley,I.J., Hutchinson,R.M., Old,J.M., Wood,W.G., Clegg,J.B. and Weatherall,D.J. TITLE Molecular basis for dominantly inherited inclusion body beta-thalassemia JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.L.Thein, 07-MAY-1990. Author address: S.L.Thien Institute of Molecular Medicine, MRC Molecular Haematology Unit John Radcliffe Hospital Oxford, OX3 9DU U.K. FEATURES from to/span description pept 1612 1703 beta-globin, exon 1 1834 2056 beta-globin, exon 2 2907 + 3015 beta-globin, exon 3 IVS 1704 1833 beta-globin intron A IVS 2057 2906 beta-globin intron B IVS 3016 > 3703 beta-globin intron C signal 1484 1492 CAAT box signal 1532 1538 TATA box BASE COUNT 1055 a 707 c 736 g 1205 t ORIGIN 1 tctatttatt tagcaataat agagaaagca tttaagagaa taaagcaatg gaaataagaa 61 atttgtaaat ttccttctga taactagaaa tagaggatcc agtttctttt ggttaaccta 121 aattttattt cattttattg ttttatttta ttttatttta ttttatttta ttttgtgtaa 181 tcgtagtttc agagtgttag agctgaaagg aagaagtagg agaaacatgc aaagtaaaag 241 tataacactt tccttactaa accgacatgg gtttccaggt aggggcagga ttcaggatga 301 ctgacagggc ccttagggaa cactgagacc ctacgctgac ctcataaatg cttgctacct 361 ttgctgtttt aattacatct tttaatagca ggaagcagaa ctctgcactt caaaagtttt 421 tcctcacctg aggagttaat ttagtacaag gggaaaaagt acagggggat gggagaaagg 481 cgatcacgtt gggaagctat agagaaagaa gagtaaattt tagtaaagga ggtttaaaca 541 aacaaaatat aaagagaaat aggaacttga atcaaggaaa tgattttaaa acgcagtatt 601 cttagtggac tagaggaaaa aaataatctg agccaagtag aagacctttt cccctcctac 661 ccctactttc taagtcacag aggctttttg ttcccccaga cactcttgca gattagtcca 721 ggcagaaaca gttagatgtc cccagttaac ctcctatttg acaccactga ttaccccatt 781 gatagtcaca ctttgggttg taagtgactt tttatttatt tgtatttttg actgcattaa 841 gaggtctcta gttttttatc tcttgtttcc caaaacctaa taagtaacta atgcacagag 901 cacattgatt tgtatttatt ctatttttag acataattta ttagcatgca tgagcaaatt 961 aagaaaaaca acaacaaatg aatgcatata tatgtatatg tatgtgtgta catatacaca 1021 tatatatata tatttttttt cttttcttac cagaaggttt taatccaaat aaggagaaga 1081 tatgcttaga actgaggtag agttttcatc cattctgtcc tgtaagtatt ttgcatattc 1141 tggagacgca ggaagagatc catctacata tcccaaagct gaattatggt agacaaaact 1201 cttccacttt tagtgcatca atttcttatt tgtgtaataa gaaaattggg aaaacgatct 1261 tcaatatgct taccaagctg tgattccaaa tattacgtaa atacacttgc aaaggaggat 1321 gtttttagta gcaatttgta ctgatggtat ggggccaaga gatatatctt agagggaggg 1381 ctgagggttt gaagtccaac tcctaagcca gtgccagaag agccaaggac aggtacggct 1441 gtcatcactt agacctcacc ctgtggagcc acaccctagg gttggccaat ctactcccag 1501 gagcagggag ggcaggagcc agggctgggc ataaaagtca gggcagagcc atctattgct 1561 tacatttgct tctgacacaa ctgtgttcac tagcaacctc aaacagacac catggtgcac 1621 ctgactcctg aggagaagtc tgccgttact gccctgtggg gcaaggtgaa cgtggatgaa 1681 gttggtggtg aggccctggg caggttggta tcaaggttac aagacaggtt taaggagacc 1741 aatagaaact gggcatgtgg agacagagaa gactcttggg tttctgatag gcactgactc 1801 tctctgccta ttggtctatt ttcccaccct taggctgctg gtggtctacc cttggaccca 1861 gaggttcttt gagtcctttg gggatctgtc cactcctgat gctgttatgg gcaaccctaa 1921 ggtgaaggct catggcaaga aagtgctcgg tgcctttagt gatggcctgg ctcacctgga 1981 caacctcaag ggcacctttg ccacactgag tgagctgcac tgtgacaagc tgcacgtgga 2041 tcctgagaac ttcagggtga gtctatggga cccttgatgt tttctttccc cttcttttct 2101 atggttaagt tcatgtcata ggaaggggag aagtaacagg gtacagttta gaatgggaaa 2161 cagacgaatg attgcatcag tgtggaagtc tcaggatcgt tttagtttct tttatttgct 2221 gttcataaca attgttttct tttgtttaat tcttgctttc tttttttttc ttctccgcaa 2281 tttttactat tatacttaat gccttaacat tgtgtataac aaaaggaaat atctctgaga 2341 tacattaagt aacttaaaaa aaaactttac acagtctgcc tagtacatta ctatttggaa 2401 tatatgtgtg cttatttgca tattcataat ctccctactt tattttcttt tatttttaat 2461 tgatacataa tcattataca tatttatggg ttaaagtgta atgttttaat atgtgtacac 2521 atattgacca aatcagggta attttgcatt tgtaatttta aaaaatgctt tcttctttta 2581 atatactttt ttgtttatct tatttctaat actttcccta atctctttct ttcagggcaa 2641 taatgataca atgtatcatg cctctttgca ccattctaaa gaataacagt gataatttct 2701 gggttaaggc aatagcaata tttctgcata taaatatttc tgcatataaa ttgtaactga 2761 tgtaagaggt ttcatattgc taatagcagc tacaatccag ctaccattct gcttttattt 2821 tatggttggg ataaggctgg attattctga gtccaagcta ggcccttttg ctaatcatgt 2881 tcatacctct tatcttcctc ccacagctcc tgggcaacgt gctggtctgt gtgctggccc 2941 atcactttgg caaagaattc accccaccag tgcagccaca cctatcagtg gtgtggctaa 3001 tgccctggcc cacaagtatc actaagctcg ctttcttgct gtccaatttc tattaaaggt 3061 tcctttgttc cctaagtcca actactaaac tgggggatat tatgaagggc cttgagcatc 3121 tggattctgc ctaataaaaa acatttattt tcattgcaat gatgtattta aattatttct 3181 gaatatttta ctaaaaaggg aatgtgggag gtcagtgcat ttaaaacata aagaaatgaa 3241 gagctagttc aaaccttggg aaaatacact atatcttaaa ctccatgaaa gaaggtgagg 3301 ctgcaaacag ctaatgcaca ttggcaacag ccctgatgcc tatgccttat tcatccctca 3361 gaaaaggatt caagtagagg cttgatttgg aggttaaagt tttctatgct gtattttaca 3421 ttacttattg ttttagctgt cctcatgaat gtcttttcac tacccatttg cttatcctgc 3481 atctctcagc cttgactcca ctcagttctc ttgcttagag ataccacctt tcccctgaag 3541 tgttccttcc atgttttacg gcgagatggt ttctcctcgc ctggccactc agccttagtt 3601 gtctctgttg tcttatagag gtctacttga agaaggaaaa acagggggca cggtttgact 3661 gtcctgtgag cccttcttcc ctgcctcccc cactcacagt gac // LOCUS ECOHBB222 552 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli beta-globin gene, complete cds. ACCESSION M34059 KEYWORDS beta-globin. SEGMENT 2 of 2 SOURCE E.coli (strain HB101) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 552) AUTHORS Thein,S.L., Hesketh,C., Taylor,P., Temperley,I.J., Hutchinson,R.M., Old,J.M., Wood,W.G., Clegg,J.B. and Weatherall,D.J. TITLE Molecular basis for dominantly inherited inclusion body beta-thalassemia JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.L.Thein, 07-MAY-1990. Author address: S.L.Thein Institute of Molecular Biology, MRC Molecular Haematology Unit John Radcliffe Hospital, Oxford, OX3 9DU, U.K. FEATURES from to/span description pept + 169 272 beta-globin, exon 4 IVS < 1 168 beta-globin intron C BASE COUNT 153 a 113 c 108 g 178 t ORIGIN 1 tctgcatata aattgtaact gatgtaagag gtttcatatt gctaatagca gctacaatcc 61 agctaccatt ctgcttttat tttatggttg ggataaggct ggattattct gagtccaagc 121 taggcccttt tgctaatcat gttcatacct cttatcttcc tcccacagct cctgggcaac 181 gtgctggtct gtgtgctggc ccatcacttt ggcaaataat tcaccccacc agtgcaggct 241 gcctatcaga aagtggtggc tggtgtggct aatgccctgg cccacaagta tcactaagct 301 cgctttcttg ctgtccaatt tctattaaag gttcctttgt tccctaagtc caactactaa 361 actgggggat attatgaagg gccttgagca tctggattct gcctaataaa aaacatttat 421 tttcattgca atgatgtatt taaattattt ctgaatattt tactaaaaag ggaatgtggg 481 aggtcagtgc atttaaaaca taaagaaatg aagagctagt tcaaaccttg ggaaaataca 541 ctatatctta aa // LOCUS HUMETN1 682 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human endothelin (ETN) gene, complete cds. ACCESSION M25377 J04819 J05489 KEYWORDS endothelin. SEGMENT 1 of 4 SOURCE Human leukocyte DNA and human umbilical vein endothelial cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 682) AUTHORS Bloch,K.D., Friedrich,S.P., Lee,M.-E., Eddy,R.L., Shows,T.B. and Quertermous,T. TITLE Structural organization and chromosomal assignment of the gene encoding endothelin JOURNAL J. Biol. Chem. 264, 10851-10857 (1989) STANDARD simple staff_review REFERENCE 2 (bases 2 to 421) AUTHORS Lee,M.-E., Bloch,K.D., Clifford,J.A. and Quertermous,T. TITLE Functional analysis of the endothelin-1 gene promoter: Evidence for an endothelial cell-specific cis-acting sequence JOURNAL J. Biol. Chem. 265, 10446-10450 (1990) STANDARD simple staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by K.D.Bloch (06-AUG-89). FEATURES from to/span description pept 519 + 582 endothelin prepropeptide, exon 1 /nomgen="EDN1" /map="unassigned" /hgml_locus_uid="LU0066V" pre-msg 252 > 682 ETN mRNA and introns (5' end put.) IVS 583 > 682 ETN intron A BASE COUNT 121 a 178 c 169 g 213 t 1 others ORIGIN Chromosome 6. 1 ggcaggttta gcaaaggtct ctaatgggna ttttattttt cttagccctg cccccgaatt 61 gtcagacggc ggcgtctgcc tctgaagtta gcagtgattt ctttcgggcc tggccttatc 121 tccggctgca cgttgcctgt tggtgactaa taacacaata acattgtctg gggctggaat 181 aaagtcggag ctgtttaccc ccactctaat aggggttcaa tataaaaagc cggcagagag 241 ctgtccaagt cagacgcgcc tctgcatctg cgccaggcga acgggtcctg cgcctcctgc 301 agtcccagct ctccaccgcc gcgtgcgcct gcagacgctc cgctcgctgc cttctctcct 361 ggcaggcgct gcttttctcc ccgttaaagg gcacttgggc tgaaggatcg ctttgagatc 421 tgaggaaccc gcagcgcttt gagggacctg aagctgtttt tcttcgtttt cctttgggtt 481 cagtttgaac gggaggtttt tgatcccttt ttttcagaat ggattatttg ctcatgattt 541 tctctctgct gtttgtggct tgccaaggag ctccagaaac aggtaggcac gctcgttgac 601 ttgtaagtct cggaattaca agttagtgtg ttcttatcca ccttcatgct tttcttgctt 661 ctatttttcc ccgttctttt ta // LOCUS MUSKER19 5422 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse keratin 19 gene, complete cds. ACCESSION M36120 KEYWORDS keratin 19. SOURCE Mouse (strain C34/HeNCrlBR) embryo fibroblast DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5422) AUTHORS Lussier,M., Filion,M., Compton,J.G., Nadeau,J.H., Lapointe,L. and Royal,A. TITLE The mouse keratin 19-encoding gene: Sequence, structure and chromosomal assignment JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.Lussier, 06-JUL-1990. FEATURES from to/span description pept 375 803 keratin 19, exon 1 3728 3810 keratin 19, exon 2 4029 4185 keratin 19, exon 3 4553 4714 keratin 19, exon 4 4832 4957 keratin 19, exon 5 5070 5324 keratin 19, exon 6 pre-msg 304 > 5422 keratin 19 mRNA and introns IVS 804 3727 keratin 19 intron A IVS 3811 4028 keratin 19 intron B IVS 4186 4552 keratin 19 intron C IVS 4715 4831 keratin 19 intron D IVS 4958 5069 keratin 19 intron E signal 184 188 CAAT box signal 278 284 ATA box signal 5402 5408 poly-A signal BASE COUNT 1255 a 1429 c 1431 g 1307 t ORIGIN 1 gatatcgtca ctccggaagt tagagttgtc taagcctgtg tagtaagggg ctgaagggcc 61 agaaaaggga cgtgacatgt tggcagtagc tttggagtgg gctggggcgg ggcagctctg 121 ggaaggactg agacctctgg ctcctgggag gggagaggta ggagcagaat cgccaggaat 181 tgaccaatgg ggaaagagcc catatttgca ctctgggagc ttggaaattt ctgatacccg 241 ccccttcaac atctccatcc cccttcccgc cccgggcata aaaagccaca ggtgagggcc 301 ttgtcactcc tcctgcggcc agcagttctc agacctgcgt ccctttttcc ttcgctctgg 361 tctccctcct catcatgact tcctatagct atcgccagac ctcagctatg tcttcctttg 421 ggggtacggg cgggggttca gtacgcattg ggtcaggggg tgttttccgc gcacccagca 481 tccacggggg ctcaggtggc cgcggcgtgt ccgtgtcctc cacccgcttc gtgacctcgt 541 cctccgggag ctatggcgga gtccgcggtg gaagttttag tgggaccctg gctgtgtctg 601 atgggctgct gtctggcaat gagaagatca ccatgcaaaa cctcaatgat cgtctcgcct 661 cctacttgga caaggtgcgc gccctagagc aggccaatgg cgagctggag gtgaagatcc 721 gcgactggta ccagaagcag ggacccggac cctcccgaga ttacaaccac tactttaaga 781 ccatcgagga cttgcgcgac aaggtgggta gcggccaggc cctgggaggt gcacctgttg 841 gggtggtgcg gcagggcggg gaggttgtgg aaactcagcc cgcgggtgga caagcccgtt 901 actacgggcc gggcaggaac ccgcagggtg gggcacaaag tctgggctgg accctgggat 961 tcgagtggag gcggggtgga agggaactga gacaggttaa gtgggtgggg ctttcggaag 1021 cggtggggac ctgcttggat agtcagggat atctatccaa agagagcatt taacacagct 1081 ttgggctgcc cagagatgaa gataccgagc cgcagcctca gctgggggtg gggtgttggg 1141 gagggctggg gagctgccgg atattagtct gtaggctcag aggtcacctg agcctgtggc 1201 cctcttctgg gactcttgag cattgctttc tccagttgtg tgggttgttc ctgtctggtt 1261 ttctttacag cagctgtcct ctcactctcc tcctctccag cccaggggac cacctaggct 1321 cacacggagc tacgagaaga atgtaaaacc tgggggctgc acagtaaaac cagatcttaa 1381 aaaaaaaaac ttccagtgag gtgtggtagc tcaggcctgt aatcccagga gcaggtggat 1441 gagttcatag ctatgcccag ctacagttaa cttgaatcta gtcagggcta catgagtctc 1501 taggctccca cccggccgtc aaacagcccg aacagacaga cagacagaca gacagacaga 1561 cagacagaca gacaacaact ctaacacgaa acaaaagaaa aataaaaaca attattttag 1621 taagtaagag ggaggtggaa gtcttgaccc tctgcttctg tggacctctc tcctggagtc 1681 ccccggtcca gggaagacaa cctgcaagtc tctgtgggct ccaacatcca acaggctggt 1741 tgggtggggc attgccctca ctcctaggga agagctatgt cctgaagaaa tgtaccagag 1801 accaggcagg cgtttaatag cgtcctctcg ctagtgctcc caacatgcag ttctcaagag 1861 cattttccag ggctgtgtga acttagcatg gagctagaca cacaacacag caagcagcta 1921 gagaggtgtt tttttgaccc agtgatggac agctagaatc ctaggcggta cacatctttc 1981 tttctcttgc tttgtccctt ccccttccac acactcccag cacgcatatc tagaagcaat 2041 tcttactccg ctgtgaatgc tctcctccgg gtttttatat tctgttgttt tcgtcaacaa 2101 ccccccaccc ccaaactatc tatgtagctg agcgtggtgg ctgatacgtg gtggcttatc 2161 cagcattgag aaactgaggc agaaagaagg acgatgaagt taatgccagt tgaaccatac 2221 agtgagactc tgtctcaaaa tgtcaaacca gagctggggc gcagtggcac acacctttaa 2281 tcccagcact tggggaggca gagaaggtgg atctctgtga attcaaggcc agtctacaga 2341 gtgagtccag gacagccagg gctacaaaga gaaatcctgt ctcgaacccc ctagtccttc 2401 acccccaaag tctaacaaac cagaggaaac caacccaacc caacccaacc caacccaacc 2461 caacccaacc caacccaacc aaaaacatta aaccaaaata atcctgcaga ttccaacctg 2521 gtgacaggat aagtggccat gaccttaatc tttgaaaaac actgaaccct ggtactgaat 2581 tggtggtaaa cttaatgatt tgagtgaacc cctcaaacca gagtcttcat attgggtaaa 2641 ctcaggattt ccagtcacgt acagtggcag tactaaccta ggagtgaggc tgagttacca 2701 tgaagagagt acaaggtgcc tggcacatca ccgaagcaga atgtgctcta agtgtgggac 2761 acagacaggg ataagtgttg cttttctttt ctttaggatt tatttattta cttatctatt 2821 aatttattta tatgtatatt ggtgttttgc ttgcatgtat gtctgtgtga ggaacttgaa 2881 ttgcaaacag ttgtgagctg ccacgagggt gctgcaaatt gaactctgtt cctttggaag 2941 aatatccagg gctcttaatc tccaagttac ctctccagtc cctgggataa atgatccatc 3001 catccttcct tccttccttc cttccttcct tccttccttc cttccttcct tccgtctttc 3061 ctctttcttt cttttttttt tttttttggt ttttggtttt tggttttttg agacagagtt 3121 tctctgtgta gccctggctg gctcaaactc agaaatcctt ctgcctctgc ctcccaagtg 3181 gtggggttaa aggcatgtgc caccactgcc tggagggata aatgattctt ttttttttaa 3241 gatttattta tttattatat ctaagtacac tgtagctgtc ttcagacact ctggaagaga 3301 gcgtcagatc tagtttcgga tggttgtgag ccaccatgtg gttgctggga tttgaactca 3361 ggaccttagg aagagcagtc agtgctctta accgctgagc catctctcca gcccaggata 3421 aatgattctt aaagcttcat gtttctaggt ccttttcaga tcacttagag tttcacctgc 3481 acaggtcttg gccatgccct tctggaattg cctctttgcc tgcctccctt cacccaacct 3541 cctttacagt gtgttgtgct ccagctctgg ggctgtggat cgatgccaaa gtctcccaga 3601 gagcagacta gtgggtagtg tcaagggagt gggggctatc cattcctctc tccttctctg 3661 ggaagaaagt agattgataa tactggcttg ccaaactggt gccaactgcc agcctcctcc 3721 cttctagatt cttggtgcca ccattgacaa ctccaagatt gtcctacaga ttgacaatgc 3781 tcgcctggct gcagatgact tcagaaccaa gtaagtgtcc ctgtattggg ggctgcggaa 3841 gccagggcaa gtgagaggag agcagcggca ggtcttatag tgcttgatga agtagaggcc 3901 tggactctgc ccttctccac ttgagctggc tcaaagcaat gccccccctc cccgactcct 3961 aaaccctggg aaagttggca gagacacaga tccagcacta gcctgactgc aactatttct 4021 tcggaaaggt ttgagacaga acacgccttg cgtctgagcg tggaggccga catcaacggc 4081 ctgcgccggg tgctggatga gctgactctg gccaggactg acctggagat gcagattgag 4141 agcctgaagg aggagctggc ctacctgaag aagaaccatg aggaggtggg tcaagctggg 4201 ccttctgtcc attttgtccc aggattggga gaccctatta tcttgggctt actgaccatg 4261 acctcatctc ttggccctgg ctttgatcat gcctctggca gtctcagcag ggccatgaca 4321 tcatcactat gatgacctta ctgaccatga catcatcact atggccctgg ctttgatcat 4381 gcctctcgca gtctcagcag ggcatggtaa ccatgacatc tactttccag tgctggctcc 4441 tagatttgta cctccaaacc agccgtcttc agctcagggc ttgggggagt gacctgatcc 4501 aagttcagca tgtgtaggtc ttgacatgcc cttgactgtg gtgtcttcac aggaaattac 4561 tgccctgagg agccaggtgg gtggccaggt cagtgtggag gtggattcca ctcccggtgt 4621 cgacctagcc aagatcctga gtgagatgag aagtcagtat gagatcatgg ccgagaagaa 4681 ccggaaggat gctgaagcca cctaccttgc tcgggtatgc agaggatgct ggtatcccgt 4741 gagggtggcg aggagagacc ctgaaccctc acccatccac ccgcagagag aagcttgttg 4801 cttattttct ccttaacttg tcttgttaca gattgaggag ctgaacaccc aggtcgccgt 4861 ccactctgag cagatccaga taagcaagac cgaagtcacg gaccttcgac ggaccctcca 4921 gggccttgag attgagctgc agtcccagct cagcatggta cgtgtctcca ccctacggcc 4981 tgcacacttg tgccctggca acctcggtgc ctgtgcccgt gttcaggaac ctgcctctgc 5041 tgagtcacgc cccctttccc tcccctcaga aagctgccct ggaaggcacg ctggcagaga 5101 cggaggcccg ttatggagtc cagctgtcac agatccagag cgtgatcagc ggttttgaag 5161 cccagctgag cgacgtgcgt gccgacatag agcgccagaa ccaggagtat aagcagctca 5221 tggacatcaa gtccaggctg gagcaggaga tcgccaccta ccgcagcctg ctggagggcc 5281 aggaagccca ctacaacaat ctgcccaccc ccaaggccat ctgagctacc agcgagactc 5341 ccctgggaag gggcctgact ggggtgataa aagtttactc taacccctcc ctcgacttgt 5401 caataaaact atcctccaag gg // LOCUS VIBHEMOX 1586 bp ds-DNA BCT 14-AUG-1990 DEFINITION V.parahaemolyticus thermolabile hemolysin (TL) gene, complete cds. ACCESSION M36437 KEYWORDS thermolabile hemolysin. SOURCE V.parahaemolyticus DNA. ORGANISM Vibrio parahaemolyticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1586) AUTHORS Taniguchi,H., Hirano,H., Kubomura,S., Higashi,K. and Mizuguchi,Y. TITLE Comparison of the nucleotide sequences of the genes for the thermostable direct hemolysin and the thermolabile hemolysin from Vibrio parahaemolyticus JOURNAL Microb. Pathog. 1, 425-432 (1986) STANDARD simple staff_review FEATURES from to/span description pept 124 1380 thermolabile hemolysin precursor (TL) sigp 124 183 thermolabile hemolysin signal peptide matp 184 1377 thermolabile hemolysin BASE COUNT 466 a 378 c 361 g 381 t ORIGIN 1 tcgactgtct ggagtattta ctcagaatta gagggtttag tgagaaaaat tctcatctaa 61 acaatgttat agccaagtat tttttcaatg tgcttgggtc aataaccatt acaagaagag 121 tgaatgatga aaaaaacaat cacactatta actgcattac tcccgcttgc ttctgcagtt 181 gccgaagagc caaccttatc accagaaatg gtttcagcgt ctgaagtgat cagcacgcaa 241 gaaaaccaaa cctataccta tgttcgctgt tggtatcgca ccagctactc gaaagatgat 301 ccagcgaccg attgggaatg ggcaaaaaac gaagatggta gctacttcac cattgacggc 361 tactggtgga gctccgtttc atttaaaaac atgttctaca ccaacacgtc gcaaaacgtt 421 atccgtcagc gttgtgaagc aacattagat ttggcgaacg agaacgcaga cattacgttc 481 ttcgccgctg acaatcgctt ctcatacaac cacacgatct ggagcaacga cgcagcaatg 541 cagccagatc aaatcaacaa agtggttgca ctcggtgaca gcttgtctga tacaggcaac 601 atctttaacg catcacaatg gcgcttccct aacccgaaca gctggttctt aggtcacttc 661 tccaacggtt ttgtgtggac agaatacatt gccaaagcga agaaccttcc gctctacaac 721 tgggcagttg gcggcgcggc tggtgagaac caatacatcg cgctaacagg ggttggtgat 781 caagtttctt cgtacttaac ctacgcaaaa ctggcgaaga actacaaacc agcaaacacc 841 ttgtttacgc ttgagtttgg tttgaatgac ttcatgaact acaaccgtgg cgttccagaa 901 gtgaaagcgg attatgcaga agcactgatt cgtttgacgg acgcaggtgc gaagaacttc 961 atgttgatga cactgccaga tgcgacgaaa gcgcctcagt ttaagtactc aacacaagaa 1021 gagatcgaca aaattcgtgc gaaagtgctt gagatgaacg agttcatcaa ggcacaagcg 1081 atgtactaca aagcgcaagg ttacaacatc acgttgtttg atactcacgc cttgttcgag 1141 acgctaactt ctgcgcccga agagcacggt ttcgtgaacg cgagcgatcc ttgtttggac 1201 atcaaccgct catcgtctgt cgattacatg tacacccacg cattgcgctc tgagtgtgca 1261 gcgtctggtg ctgagaagtt tgtgttctgg aatgtcacgc atccaacaac agcaactcac 1321 cgctatgttg cagagaaaat gctagaaagt agcaacaact tagccgagta ccgtttctaa 1381 ccggacacgg cttctgagtt gaaaccttat cttcgtacac acgttgataa cgaacacatc 1441 gtggccattt ttatcgaagg aacgttgtgg tcacagcagt cacaacgcta aacaagttac 1501 agtggcgcga cgtcggttcc ccctaaagac tggtaaagcg taacctgagt gagaaactgg 1561 ttgtagcggt tctccaacaa agaagc // LOCUS CELCAPKCC1 195 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 1. ACCESSION M37114 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 1 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 195) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 121 + 190 cAMP-dependent protein kinase catalytic subunit C, exon 1 pre-msg 1 > 195 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS 191 > 195 intron A BASE COUNT 63 a 47 c 48 g 37 t ORIGIN 1 gtttgaggta acttaaacag aaagcagcga agaccacaca agagaagact gcaagtgtct 61 tctgaagtgt gacaacgtcc catccagtcc cgatcgtggt cttcgtctac cagcaaaaca 121 atgctcaagt ttctgaaacc aaaatcgtcg gacgagggaa gctcgaagga caacaagaac 181 tcggcgtcgc gtatg // LOCUS CELCAPKCC2 75 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 2. ACCESSION M37115 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 2 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 75) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 70 cAMP-dependent protein kinase catalytic subunit C, exon 2 pre-msg < 1 > 75 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron A IVS 71 > 75 intron B BASE COUNT 26 a 17 c 18 g 14 t ORIGIN 1 attttcagtc aaagaattcc tggacaaggc acgcgaagac ttcaaacaac gatgggagaa 61 tccggctcag gtaca // LOCUS CELCAPKCC3 142 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 3. ACCESSION M37116 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 3 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 142) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 137 cAMP-dependent protein kinase catalytic subunit C, exon 3 pre-msg < 1 > 142 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron B IVS 138 > 142 intron C BASE COUNT 44 a 27 c 35 g 36 t ORIGIN 1 attttcagaa cacggcgtgt cttgacgact ttgataggat aaaaaccctt ggaaccggat 61 cattcgggcg cgtgatgctt gtgaagcata agcagtcggg caactattat gccatgaaaa 121 ttctagacaa acaaaaggtt gt // LOCUS CELCAPKCC4 113 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 4. ACCESSION M37117 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 4 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 113) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 108 cAMP-dependent protein kinase catalytic subunit C, exon 4 pre-msg < 1 > 113 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron C IVS 109 > 113 intron D BASE COUNT 30 a 23 c 27 g 33 t ORIGIN 1 attttcaggt tgtaaaactc aagcaagtgg agcacacgtt gaacgaaaag cgcattctcc 61 aggctatcga ttttccgttt ttggtgaaca tgacgttctc tctcaagggt gag // LOCUS CELCAPKCC5 222 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 5. ACCESSION M37118 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 5 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 222) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 217 cAMP-dependent protein kinase catalytic subunit C, exon 5 pre-msg < 1 > 222 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron D IVS 218 > 222 intron E BASE COUNT 50 a 60 c 44 g 68 t ORIGIN 1 tatttcagac aactccaact tgtacatggt gctcgaattc atctcgggag gtgagatgtt 61 ctcacattta cgccgaatcg gacgtttctc cgagccccac tcgcgtttct acgccgctca 121 aattgtgctt gcctttgagt accttcactc cttagatctc atctatcgtg atctgaaacc 181 cgagaacttg cttatcgatt ccactggata tctaaaggta gg // LOCUS CELCAPKCC6 397 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 6. ACCESSION M37119 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 6 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 397) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 392 cAMP-dependent protein kinase catalytic subunit C, exon 6 pre-msg < 1 > 397 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron E IVS 393 > 397 intron F BASE COUNT 119 a 73 c 99 g 106 t ORIGIN 1 attttaaggt aaccgatttc ggatttgcga aacgtgtcaa aggacgaacg tggacattgt 61 gcggcacacc agaatacttg gcacccgaga ttattttgtc aaaaggatac aataaggctg 121 tcgattggtg ggcacttggt gtgttgatct acgaaatggc cgccggttat ccgccgttct 181 ttgccgatca accgattcaa atttatgaga agattgtgtc aggaaaagtg aaattcccgt 241 cgcacttctc gaatgagttg aaagacttgc tgaagaattt gctccaagtt gatcttacaa 301 agcgatacgg taacttgaaa aatggagttg ccgatatcaa aaatcacaaa tggtttggaa 361 gtaccgactg gattgccatc tatcagaaga aggtgag // LOCUS CELCAPKCC7 729 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 7. ACCESSION M37120 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 7 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 729) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 134 cAMP-dependent protein kinase catalytic subunit C, exon 7 pre-msg < 1 729 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron F BASE COUNT 197 a 197 c 86 g 249 t ORIGIN 1 aatttcagat cgaagcaccg ttccttccaa aatgccgtgg ccctggcgac gcatctaact 61 ttgacgacta cgaggaagag ccgcttcgta tctcgggcac cgagaaatgc gccaaagagt 121 ttgccgagtt ttaattcatt atctttgtcg attccttcca taagcttttt gtttcaaatt 181 tcactctctc tctctctctt ttccatattc ctcaaaattg tactagtcca aaacaaaacc 241 ctgtccctat tgtccggtgg tttgtcttcc ctcacccaaa ccttctcatt ctcagccaaa 301 ttgaattcaa tattgttgcg acaattgaaa atgatgaaga tgcttactat ttcttccctt 361 ccttctgaac cccttatttt ttaatgttca tacattttct tgccccgatc ccgtccccct 421 aataataatt tgtaaaaata atagattgga atttgctcta atatatataa ttattatata 481 tcaaaatttt ctagtcaacc cctgactttc cctcctccaa ttcagcctcc cgcaaacttt 541 tttctagaaa agttgacttc tgtctcaaca aagaaatgga atcccacaaa accattaatt 601 gtatgttttt ttctcatttt ccttttcctc tctcatgtat acatatatat gtattttaaa 661 cacttttcac cccccccccc cccacacaca ctcaccgcaa aaatccccca gtcgtcataa 721 aaagtagtt // LOCUS DRODEC1A 3784 bp ds-DNA INV 14-AUG-1990 DEFINITION D.melanogaster defective chorion-1 fc125 (dec-1) gene, complete cds. ACCESSION M35887 M35688 X07697 KEYWORDS dec-1 protein; eggshell protein. SOURCE D.melanogaster ovary, cDNA to mRNA and genomic DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 121) AUTHORS Hawley,R.J. and Waring,G.L. TITLE Cloning and analysis of the dec-1 female-sterile locus, a gene required for proper assembly of the Drosophila eggshell JOURNAL Genes Dev. 2, 341-349 (1988) STANDARD simple automatic REFERENCE 2 (bases 44 to 3784) AUTHORS Waring,G.L., Hawley,R.J. and Schoenfeld,T. TITLE Multiple proteins are produced from the dec-1 eggshell gene in Drosophila by alternative RNA splicing and proteolytic cleavage events JOURNAL Dev. Biol. (1990) In press STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Waring, 28-JUN-1990. EMBL features not translated to GenBank features: key from to description PRM 14 18 put. TATA-box SITE 44 44 transcription initiation site FEATURES from to/span description pept 119 3490 fc125 protein precursor (dec-1) sigp 119 187 fc125 protein signal peptide matp 188 3487 fc125 protein BASE COUNT 1103 a 999 c 1004 g 678 t ORIGIN Chromosome 7C-X. 1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga 61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat 121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag 181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga 241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc 301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag 361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa 421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc 481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg 541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc 601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc 661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc 721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc 781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg 841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca 901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc 961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa 1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca 1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga 1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt 1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag 1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc 1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg 1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc 1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt 1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa 1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag 1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc 1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca 1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa 1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg 1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat 1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat 1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca 2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag 2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca 2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt 2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca 2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag 2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca 2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca 2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga 2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc 2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga 2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg 2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa 2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga 2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca 2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac 2941 aatggccagg acaataaacc cgaagcaacc aggcgaagtc ggtggatcgg agagccagaa 3001 gagcaactct aatcctccaa cgacgttgac accagctcca caagagcaac cgcaagagca 3061 tcgagtccac aaaagtccat catcagcacc atccgaaacc gaaatcgaaa acgcaccatc 3121 atctgacccc caagtgggtt ccattttcac ctacggcgag ggattgctgc atccgtttat 3181 gggtctgctg ccggtggaga ggcccgatga tccgtggaac cagaagccat acgatcctca 3241 ccatcctctc tacacgggcg gtgggagcta cgatgcctat ctgagggacg gtcgccatcg 3301 aagggacacg cacatcatgg gtcaaggtac gcagcatggc atcctgacac ccggcatgtt 3361 ggagagactt cttcggatca agatggattt ccagcgtaga tttcccccat ttatacaagg 3421 gtatgctgaa tcaccatacg aatcttaccc gtgtagaggt acagcctccg gttctgggca 3481 agatttctaa gccgaaaact aagaccaagc ccaagaatga ggacgaaccc gtcttcgaat 3541 tgggagccgc tgagcgtagc ctgttcgagg acgagaccaa cgattcttta gagaaagatc 3601 ccgaaccaga accggatgag gaggatgacc gggatgtgga agagcccagc gagagcagtg 3661 aacctagggg attcagcagc aaaaaaagcc gggatgagaa cgacatcgat tacttcaatt 3721 tcgatgacga ttgatgttga tgattgactt ttattataat tgccaataaa attaatatta 3781 taaa // LOCUS DRODEC1B 3798 bp ds-DNA INV 14-AUG-1990 DEFINITION Drosophila melanogaster defective chorion-1 fc106 (dec-1) gene, complete cds. ACCESSION M35888 M35688 X07697 KEYWORDS dec-1 protein; eggshell protein. SOURCE D.melanogaster ovary, cDNA to mRNA and genomic DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 121) AUTHORS Hawley,R.J. and Waring,G.L. TITLE Cloning and analysis of the dec-1 female-sterile locus, a gene required for proper assembly of the Drosophila eggshell JOURNAL Genes Dev. 2, 341-349 (1988) STANDARD simple automatic REFERENCE 2 (bases 44 to 3798) AUTHORS Waring,G.L., Hawley,R.J. and Schoenfeld,T. TITLE Multiple proteins are produced from the dec-1 eggshell gene in Drosophila by alternative RNA splicing and proteolytic cleavage events JOURNAL Dev. Biol. (1990) In press STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Waring, 28-JUN-1990. EMBL features not translated to GenBank features: key from to description PRM 14 18 put. TATA-box SITE 44 44 transcription initiation site FEATURES from to/span description pept 119 2971 fc106 protein precursor (dec-1) sigp 119 187 fc106 protein signal peptide matp 188 2968 fc106 protein BASE COUNT 1106 a 1003 c 1008 g 681 t ORIGIN Chromosome 7C-X. 1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga 61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat 121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag 181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga 241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc 301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag 361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa 421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc 481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg 541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc 601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc 661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc 721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc 781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg 841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca 901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc 961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa 1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca 1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga 1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt 1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag 1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc 1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg 1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc 1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt 1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa 1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag 1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc 1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca 1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa 1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg 1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat 1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat 1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca 2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag 2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca 2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt 2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca 2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag 2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca 2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca 2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga 2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc 2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga 2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg 2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa 2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga 2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca 2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac 2941 aatggccagg gcgacgcatt tcagacaata aacccgaagc aaccaggcga agtcggtgga 3001 tcggagagcc agaagagcaa ctctaatcct ccaacgacgt tgacaccagc tccacaagag 3061 caaccgcaag agcatcgagt ccacaaaagt ccatcatcag caccatccga aaccgaaatc 3121 gaaaacgcac catcatctga cccccaagtg ggttccattt tcacctacgg cgagggattg 3181 ctgcatccgt ttatgggtct gctgccggtg gagaggcccg atgatccgtg gaaccagaag 3241 ccatacgatc ctcaccatcc tctctacacg ggcggtggga gctacgatgc ctatctgagg 3301 gacggtcgcc atcgaaggga cacgcacatc atgggtcaag gtacgcagca tggcatcctg 3361 acacccggca tgttggagag acttcttcgg atcaagatgg atttccagcg tagatttccc 3421 ccatttatac aagggtatgc tgaatcacca tacgaatctt acccgtgtag aggtacagcc 3481 tccggttctg ggcaagattt ctaagccgaa aactaagacc aagcccaaga atgaggacga 3541 acccgtcttc gaattgggag ccgctgagcg tagcctgttc gaggacgaga ccaacgattc 3601 tttagagaaa gatcccgaac cagaaccgga tgaggaggat gaccgggatg tggaagagcc 3661 cagcgagagc agtgaaccta ggggattcag cagcaaaaaa agccgggatg agaacgacat 3721 cgattacttc aatttcgatg acgattgatg ttgatgattg acttttatta taattgccaa 3781 taaaattaat attataaa // LOCUS DRODEC1C 5740 bp ds-DNA INV 14-AUG-1990 DEFINITION Drosophila melanogaster defective chorion fc177 (dec-1) gene, complete cds. ACCESSION M35889 M35688 X07697 KEYWORDS dec-1 protein; eggshell protein. SOURCE D.melanogaster ovary, cDNA to mRNA and genomic DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 121) AUTHORS Hawley,R.J. and Waring,G.L. TITLE Cloning and analysis of the dec-1 female-sterile locus, a gene required for proper assembly of the Drosophila eggshell JOURNAL Genes Dev. 2, 341-349 (1988) STANDARD simple automatic REFERENCE 2 (bases 44 to 5740) AUTHORS Waring,G.L., Hawley,R.J. and Schoenfeld,T. TITLE Multiple proteins are produced from the dec-1 eggshell gene in Drosophila by alternative RNA splicing and proteolytic cleavage events JOURNAL Dev. Biol. (1990) In press STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Waring, 28-JUN-1990. EMBL features not translated to GenBank features: key from to description PRM 14 18 put. TATA-box SITE 44 44 transcription initiation site FEATURES from to/span description pept 119 4888 fc177 protein precursor (dec-1) sigp 119 187 fc177 protein signal peptide matp 188 4885 fc177 protein BASE COUNT 1716 a 1481 c 1496 g 1046 t 1 others ORIGIN Chromosome 7C-X. 1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga 61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat 121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag 181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga 241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc 301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag 361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa 421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc 481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg 541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc 601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc 661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc 721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc 781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg 841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca 901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc 961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa 1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca 1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga 1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt 1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag 1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc 1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg 1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc 1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt 1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa 1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag 1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc 1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca 1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa 1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg 1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat 1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat 1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca 2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag 2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca 2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt 2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca 2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag 2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca 2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca 2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga 2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc 2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga 2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg 2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa 2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga 2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca 2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac 2941 aatggccagg acaataaacc cgaagcaacc aggcgaagtc ggtggatcgg agagccagaa 3001 gagcaactct aatcctccaa cgacgttgac accagctcca caagagcaac cgcaagagca 3061 tcgagtccac aaaaggttag cacatttcca caggtttggg caagaggcag ggttaaatgc 3121 gaccacatcc aagggttgcg gatgtggcag gttggattgt ctgtgcggca ggagttgtcg 3181 atgcggaaga agaggcttgg aatctagagt ggtttctagt agaacttcgg gaacttgcca 3241 gtgcaaagcc agtcaccgga acaagcgttg tgaatatgga accttggaga ccatcgatga 3301 gggttctctt aacgaactta gaagagaata taaactggga ctaaaagaga tcacgctgag 3361 tccagatgaa gatcccgccg aggccctgat gcgctacaat gcggcctcca ttcgcgaagc 3421 tttggagcga gccagtatgg aaccgctgga gatcggtggg gatcagtatg aagaggatgc 3481 gcaacaggag cccatggagg aggagcaact acagcatgat cccaacacag agccccagta 3541 caatcataag gactttgtgc ggctaaccac ttccacagca tctccgatca caagtaccac 3601 agaagctgcc actccaacag gcagtgactc tacttcagaa gcaactgtga cacctgaagt 3661 caccaccacc actagcacat ccacaacaac aacaaccgaa agcaccaaag atgagggact 3721 ggacatgcag caagactcac aggctgaggc tgagagttct catgtcacca agtcgatatc 3781 gaagcaggag gccganattc atcagctgca cagtattgtg gaggagctga agaatgagat 3841 ccttaaactg aatttgcgtt gcagcacgat aatatcaaac aatgtggcca aagaaccggt 3901 cacagagaag aacccacccg tggtggagga accatccaag caggaggata agccaaaggt 3961 cgaggagaaa gtgattgccg aagagcaagc cccagtggag cccgaggaag atctggaaga 4021 ggacgaggat agtacctcta ttagtacgac cacagagact ccttcaccaa gtggtagtta 4081 ctcaacaaaa cctggacttt cccttggatc gcctagagtg gatgaacaaa gtggctcgtc 4141 caataaactg gactacgacg acgataacaa ttggcaacgc attctggcca atcgtggcta 4201 tgacacggac tacctaacca aatcccatga gcgtcagttt gcccagggtc aaaacctgga 4261 gatgcccaag aactgcaatt acgatggaaa tggcagccaa gagtatggcc cctatccgga 4321 gttccaagca gatgaaccca gcacggatac ggagggcaag gctaaaaggg ctttaagtgt 4381 gaaacaacag gcgcagctgc tcaatgcggc cctaaatgat agtggaagcg attcgagcga 4441 tggcaccacc accaccacca caccctctcc gtacgcgatg aggggaaagt ttgtgagacg 4501 gaggagcacg gcaagaaggg ttcccattcc aaagatagga aaggcgagcg acgaggtatg 4561 ggtacgatct cctcgtcaag cgaaaatgcc ccaacgaccc aagaagtcca tgtccaagcc 4621 gaaaaagcag agctcgcagg tgactacgca ggccaccgtg agcagcacta agctggacag 4681 tttggtggat gtgctcaagg atctggtgcg cctgcaaatt caaaaggaga aaaaatccag 4741 cctattaaga acccagagca ataacctttc gaaaaccaaa cccaaatcta ttaagcccgt 4801 caaagtcatc aagcgaaaaa ggttgcgcag gaggcaacac aagtctattg ccacgaccat 4861 caggagtccc attcaaacga aagcataaga tttcagccca tttaactaat tcacagatcc 4921 caactaatcc tatacataat ttagatatag tataagcata ttcactattc gttcgcaatt 4981 cattcatatc catccatctt tcatttgcat atccctcgtc catatcgcca gtccatcatc 5041 agcaccatcc gaaaccgaaa tcgaaaacgc accatcatct gacccccaag tgggttccat 5101 tttcacctac ggcgagggat tgctgcatcc gtttatgggt ctgctgccgg tggagaggcc 5161 cgatgatccg tggaaccaga agccatacga tcctcaccat cctctctaca cgggcggtgg 5221 gagctacgat gcctatctga gggacggtcg ccatcgaagg gacacgcaca tcatgggtca 5281 aggtacgcag catggcatcc tgacacccgg catgttggag agacttcttc ggatcaagat 5341 ggatttccag cgtagatttc ccccatttat acaagggtat gctgaatcac catacgaatc 5401 ttacccgtgt agaggtacag cctccggttc tgggcaagat ttctaagccg aaaactaaga 5461 ccaagcccaa gaatgaggac gaacccgtct tcgaattggg agccgctgag cgtagcctgt 5521 tcgaggacga gaccaacgat tctttagaga aagatcccga accagaaccg gatgaggagg 5581 atgaccggga tgtggaagag cccagcgaga gcagtgaacc taggggattc agcagcaaaa 5641 aaagccggga tgagaacgac atcgattact tcaatttcga tgacgattga tgttgatgat 5701 tgacttttat tataattgcc aataaaatta atattataaa // LOCUS DROGLDYYR 6458 bp ds-DNA INV 14-AUG-1990 DEFINITION D.melanogaster glucose dehydrogenase (GLD) gene, complete cds. ACCESSION M29298 X07358 X13581 X13582 KEYWORDS YYRR box; glucose dehydrogenase. SOURCE D.melanogaster (strain OregonR) DNA, clone pEG25D20. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1460 to 1531) AUTHORS Cavener,D.R. JOURNAL Unpublished (1988) Vanderbilt Univ,Dept Mol Bio,Nashville,TN 37235 STANDARD full staff_entry REFERENCE 2 (bases 1460 to 1531) AUTHORS Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M., Schonbaum,C. and Xiao,X. TITLE The YYRR box: A conserved dipyrimidine-dipurine sequence element in Drosophila and other eukaryotes JOURNAL Nucleic Acids Res. 16, 3375-3390 (1988) STANDARD full staff_entry REFERENCE 3 (bases 78 to 1142 and 3792 and 4208) AUTHORS Whetten,R., Organ,E., Krasney,P., Cox-Foster,D. and Cavener,D. TITLE Molecular structure and transformation of the glucose dehydrogenase gene in Drosophila melanogaster JOURNAL Genetics 120, 475-484 (1988) STANDARD full staff_entry REFERENCE 4 (bases 1 to 6458) AUTHORS Krasney,P.A., Carr,C.M. and Cavener,D.R. TITLE Evolution of the glucose dehydrogenase gene in Drosophila JOURNAL Mol. Biol. Evol. 7, 155-177 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Cavener, 20-OCT-1990. FEATURES from to/span description pept 3838 4005 glucose dehydrogenase (GLD), exon 2 (EC 1.1.99.10) 4079 4199 glucose dehydrogenase, exon 3 4539 6088 glucose dehydrogenase, exon 4 pre-msg 502 6458 glucose dehydrogenase mRNA and introns IVS 837 3828 GLD intron A IVS 4006 4078 GLD intron B IVS 4200 4538 GLD intron C signal 394 400 GLD TATA box site 1457 1532 YYRR box tetranucleotide repeat region BASE COUNT 1782 a 1464 c 1418 g 1785 t 9 others ORIGIN Chromosome 3-84C8. 1 tatctgtctg acattaatat ccaaaaatgt aataaattca catccacaaa ttgtgaaaaa 61 aaatcaacaa cgcagttctg cagccgttcg actttatttt gccagtgctt cttaacttgg 121 ctggaaatcg ttaaactcgc aggcgcacga gcaagcagct ttttgtgtgg gttgtttagc 181 cgaaagcggt ggttgaagaa acctgtgacg tcttagccga agtcaggggt gcttaaagaa 241 agttttacaa cactagacca tattcatgag taaagggttg agtaataaaa tacataaaac 301 gtaagaaata ataataatac agattctaaa agttattagg taaaatttag accaatttag 361 acctactcat tgcaaacact caaaagctcc cgattcagac caagtttcag agagcgcagc 421 tttgcggcca gctttaagct gtctttcgtt gagttcgagc ttttcgtcag tttaaaaaga 481 ctggcgcctg ctggtcagaa gctgagtcgg taacggtctg cgtctgcgcg cagttcgaac 541 aagttgagaa agagaccaac agaaagccca tccaagtgca gtgatcaata cggtaactga 601 caaaaaccct agaagtcagg gcttaaaaac gattttgcag cgctgccagt gggtttttgt 661 gtgataaaaa aaaagcggct cagaaaaact tgctgacagc agatagcaca cacgtttttg 721 tcgtcttcgg tccattgaaa aattttcccg aggcattttc tataaggaat aaacaattaa 781 ttcaatattt aaagcataga aggaaactag acaccacatc accggactct acgatcgtaa 841 gttgatgcaa tcgtctttat ttctattatt ttctgccttt tcggtttttg cacaacccca 901 aaatccaaaa attcgcgatg tccgtttctg gcattgagga agctcaaaag atttggacag 961 ctttttggcc cgaagtctgc tggaaatttg ccattgtcat aagcccaaga aacggaattt 1021 tgttcggtct tctgatggct cacggcgata aatttactgc actttgtttg caatagctcc 1081 aattcgattc ggtttgtttt tctggcacag agcacagtat gcctcacgat ttctgctcta 1141 gactaaaact ataatcccaa atatgaaaac cgaattagct aattagttaa ggctaaagtc 1201 tgaactggaa tcttctgact aagatcccac cccttccaaa cgtggacctt acacaattca 1261 ctcaaaattc gcatagcttt atctggcgcc tttgtggcac tcataaattg cattacttta 1321 tgattattat tctaatatat gcacggttcc acattaataa gcacccgaaa aactatgcct 1381 aaacatatgg cagaagcggc aacgagataa ctgtataaca aggcccaaaa tcaagccggt 1441 atctacggta ttgttcacac tgactggctg actggctgac tggctgactg actgcctgat 1501 tgagtgactg actgactgac tgactaactg acaggcagct caaagaaacc ttcaaacgca 1561 gcaacgaaga caatgccaag ctatatcatt gaagttatcg tgcgagtgca aaacaaacca 1621 ccagaagaca actgcagctc gcgtcggttt tattcgacac ccgtgtcgct tcagtgaaca 1681 agggattatg caccatgctt tacatccgat aaagttgtta attccaaaaa aaaaagtcta 1741 attatgacag cgccattgat aatcgtgtca gataagttac acaggaagtg cgagtaatca 1801 aatgataaca agcaaaaatg tcgttaggct gattaaataa ttgtcttcat gttctcacca 1861 ttaaacaaaa aaatatttta aaagtcattg gtttctctga attttttatt taaattaaaa 1921 attctccatc tgctataaca catattttta tttttttaac gaaacattat tttaacttaa 1981 ctggctaaac tgcagataaa atcagaagtc caagtcctcc aaaaatctgt atattagaga 2041 tcatttaacc ctaataaacc aggcggaaaa taattaagtc ggggaaaatg tattgcggct 2101 atcccctttt cgtatcccta ggattttccc accaatttcc acgagccgcc acgcagaagt 2161 ggacctaaat ctgttagcta ggcaaaactt gccccgaaca acttttgatt ttctgagtgc 2221 tgcacgactt ttgcacgcgt ttttctcggc gtttgaagtt cgtctgtggc cacatttgtt 2281 gcatggcttt tactaaactg ctggaaaacg ggctgcgagg tcaactgcct ctcttttcga 2341 ttttccacgt tcgcgtgagn tgcgtatgtt ctgcgacttt gttttgactg tttttttgtt 2401 tttctatttt tttgtgggct cgcgctctgc ttcactttcc tggcatcctt caactttaaa 2461 agcgaaatgc aattgaaatg aatgaaaaat cgtgttccaa atgtcagatg cgagccccat 2521 tgtggcatcc aaaacagacg aactgtagca attgcatggt cttctcgatc tccgcttagg 2581 tgattctctt gtttcactct cggtttcagc ttgcgtttct gtttcaatta cttttctgct 2641 ttggttttgg cttcggtttc tgtttctgtt tctgtttctg ttttcggatg gatacaaatt 2701 gcaatttgat tgcttcaagc tgttcgagtt aattgaggtc aaaagcgctg tcagcatatt 2761 ccaaaaatgc actgatttcg gggatcggtt cggtttgggt ttcggtttca attttgggcg 2821 cctaattaac cgcccaaaaa ggagtgtgaa gagcaatatt ggaggcaagc aattaagatg 2881 gtttcttcac ataaaaggag cttagaacgg atcccgtttc agagctgctc gggagtcaat 2941 ctgatcagac ccgggggtca taagtcgacc accaacgaga cgaacaaatg acataattac 3001 attgcacaat actcaaccgt ggttggctac cgtaattcca tgcagcctca ttgcttctgt 3061 tttaggcatt tgctcgtcaa tttcgaaatt ccaaatagga aagaaaaggg aagccagcca 3121 gacatttttg gccgctcggg gctaaggtca tgtgaaacta tgcagaagct gccgccgacg 3181 gaacaaaagc caagctcgga tgatgagcta tgacttctgt ccgtccggcg ttctttttag 3241 acctttcgta atcgccaatg cagggggcag ttgccgtttg tttgcaaatg gcaagatata 3301 gttgccgntt gtttgcaaat ggcnnagata taaaaaatat cttttttgnt ttcaatttca 3361 tttcgntttt ttcctttcgg ccgcttatgc cgctgtgtgc atgggacaga cgacgcccat 3421 tgtctctcgg tgagtactct tattaattac agtactgctg ctggagcgtt actatgataa 3481 tgtttagtct ttgtctcagt ggacaaggtt aaacagaagt tcatgttgcg gtcttcggaa 3541 aggtttatga ctttcccaat ggctttcaat taatcagctg gaaattaaat aattaaatga 3601 ccatatcatt ttgcgattgc ttttaataaa aagagtatat tgctgacaac gtctttatcg 3661 ttctaacttt gtaaaacttt aaacaagcta gaaagagact tagaacacga acttgtagct 3721 ttatttanaa aaaaagttcg tgctctaagn cntttgtatt ttgaaacttg atcatagttt 3781 tttacctatt taatacttaa aataagaaat tcccattgat ttttttagtc tatcaacatg 3841 tccgccagcg cctcagcctg cgattgtttg gtgggcgtac ccactgggcc caccctggcc 3901 tccacatgtg gtggtagcgc cttcatgctg ttcatgggcc tcctggaggt ctttatccgc 3961 tcccagtgtg atctcgagga tccctgcgga agggccagca gtcgggtaag ctgtaaatga 4021 cgaaggccac taagagtggc tacccatatc tcaatataga tttccaacat tattacagtt 4081 tcgatcggag ccggactacg agtacgattt cattgtcatt ggcggcggct cagcgggctc 4141 tgtggtggcc tctcgactgt ccgaggtgcc ccaatggaag gtgcttttga ttgaagccgg 4201 taagggatgt gccccagcaa ctcataccca tacacatacc tacatactta agttacactg 4261 ctccggcaat tagaggcacg ttatgtggca tttgtttaga agcaattgaa gcttaacagc 4321 ggcataaact gcaacaatgg caccaacact aatgccactt gggcaagact cggcggcaaa 4381 gaagcaaccg caactgtcac gagcattatg gagcgataat ttctagtcct aatatagaac 4441 cgacaaacta atcatacact tatataaact acaaaaaact ataacaacat gatatttttt 4501 aattatgtat ttaacagtat cttaaccgtc ttacacaggt ggcgatgaac ccgtgggagc 4561 ccagataccc tcgatgttcc tcaactttat tggcagcgac attgactacc gctacaacac 4621 ggaacctgag ccaatggcct gtctgtcctc catggagcag cgctgctact ggccacgtgg 4681 caaggtactt ggcggaacct cggtattgaa cggaatgatg tatgtccgcg gcaaccggga 4741 ggactatgat gattgggcag cggatggaaa tcctggttgg gcctacaacg acgtgcttcc 4801 gttcttcaaa aagtcagagg acaacctgga tctggatgaa gtgggcacgg aatatcacgc 4861 caagggtggc ctcctgccag tgggcaagtt cccatacaac ccacctctat cctacgctat 4921 tctgaaggcc ggcgaggagt tgggcttctc cgttcacgat ctaaatggtc agaactctac 4981 tggatttatg attgcccaga tgacggcccg aaatggcatc agatatagct cagcccgggc 5041 tttcctgcgt ccggctcgca tgcgcaacaa cctgcatatc ctgctgaaca ccacggccac 5101 caagatcctt attcatccgc acactaagaa tgttctggga gtggaagtca gtgaccaatt 5161 cggcagcacc cgaaaaatcc tggtcaagaa ggaggtggtt ttaagtgctg gcgctgtgaa 5221 ctctccgcat atcctactac taagtggagt gggtcccaag gacgaactgc agcaggtgaa 5281 tgtgaggacc gtgcacaacc taccaggtgt gggcaagaat ctacacaatc acgtgaccta 5341 ctttaccaac ttcttcattg acgatgcgga cacggcgcca cttaactggg ccacggctat 5401 ggagtacttg ctctttcggg atggtctcat gtccggcact ggcatttcgg atgtgaccgc 5461 aaagctggcc actcgctatg ccgacagtcc cgaacgtccc gatctccagc tgtactttgg 5521 cggctatctg gccagttgtg cccgtacagg acaggtgggc gagttgctct cgaataactc 5581 tcggtccatt cagatcttcc cagctgtttt gaaccctcga tcgcggggat ttattggcct 5641 gcgatctgca gatcctctgg agccgccgcg tattgtggcc aattacctaa cacacgagcg 5701 ggatgtgaag acgctggtgg agggcattaa gtttgtcatc cgattgtcgc aaacgacgcc 5761 gttgaagcag tacggcatgc ggctggacaa gacggtggtc aagggatgcg aggcccatgc 5821 ctttggtagc gatgcctact gggagtgcgc cgtgaggcag aacacgggtc cggagaatca 5881 ccaggctggc tcctgcaaaa tgggtcccag ccacgacccc atggcggtgg tcaaccacga 5941 gttgcgggtc cacggaattc gcggactccg agtcatggac accagcataa tgcccaaggt 6001 cagctcggga aacacccatg cccccgccgt gatgatcgct gagaagggcg cctacctgct 6061 gaagagggcc tggggcgcca aggtctgacg cgtggatgcg acgtggacgt tgcatagagt 6121 aatttaatca gaatccaagc gaaaattaca gcgagtaaaa ttttaagcga aattacactt 6181 atggcacctc agatccaaaa ctaaactatt ttaccctaat ccaatcttat ttaatccgat 6241 cctattatga aggtttctca ctctaggcac accgcgttgc gcaatttcca ttaaagttat 6301 ctataagtta acaggctccc ttaaaactaa ctactgttac ccactgaata tgccgtcaaa 6361 cttcttgaac agcacactca gctcaaatta aagacaaata aaacttatta ggtatctagt 6421 aaatttacta aaattccaaa ttaaatttat gtgaattc // LOCUS DRPGLDYYR 7382 bp ds-DNA INV 14-AUG-1990 DEFINITION D.pseudoobscura glucose dehydrogenase (GLD) gene, complete cds. ACCESSION M29299 X07359 KEYWORDS YYRR box; glucose dehydrogenase. SOURCE D.pseudoobscura DNA, clone pBy2.1. ORGANISM Drosophila pseudoobscura Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 2131 to 2198) AUTHORS Cavener,D.R. JOURNAL Unpublished (1988) Vanderbilt Univ,Dept Mol Bio,Nashville,TN 37235 STANDARD full staff_entry REFERENCE 2 (bases 2131 to 2198) AUTHORS Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M., Schonbaum,C. and Xiao,X. TITLE The YYRR box: A conserved dipyrimidine-dipurine sequence element in Drosophila and other eukaryotes JOURNAL Nucleic Acids Res. 16, 3375-3390 (1988) STANDARD full staff_entry REFERENCE 3 (bases 1 to 7382) AUTHORS Krasney,P.A., Carr,C.M. and Cavener,D.R. TITLE Evolution of the glucose dehydrogenase gene in Drosophila JOURNAL Mol. Biol. Evol. 7, 155-177 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Cavener, 20-OCT-1990. FEATURES from to/span description pept 3959 4126 glucose dehydrogenase (GLD), exon 2 (EC 1.1.99.10) 4203 4323 glucose dehydrogenase, exon 3 5530 7079 glucose dehydrogenase, exon 4 pre-msg 777 > 7382 GLD mRNA and introns IVS 1136 3948 GLD intron A IVS 4127 4202 GLD intron B IVS 4324 5529 GLD intron C site 2129 2200 YYRR box tetranucleotide repeat region BASE COUNT 1983 a 1667 c 1611 g 2121 t ORIGIN Chromosome 3-84C8. 1 gaattccaat catattaatt ttaatatcat gaagcacacc aggagttttc ttatctattt 61 gttttttttt ttcaaagcat acatgactct acttttcttt ggtttcatat cactttttgg 121 aagttctcaa ctctcaaata caaagtatat tctgaacaag tattattaca tattactatc 181 ttactttttt atcaaaacct ttacaaatat ttttccactt tcaatgccca ggcttggcga 241 aaacaacaat actagaacgt agttccactc agaggtactt tattttacct ctgttcggtc 301 cgcttggaaa tcattaaacc acgcagggcc tctccgcgga aagagtgcct ttttgtgtgg 361 gctgtgtgct gtctgagagc gggggttgaa gaaacctgtg acgtcttatt gaagcatagg 421 ggtggctggc tgccataatc tggagttttg cagcactgtc aaagctgtat gagtcatcaa 481 atattcagta aagttcttat agaactaaag cttttaatgc atagttttta ttttgattgt 541 tatttttaga agttttagaa tgaattgtat ttgtttttac aaaaagaaat ctctcttttg 601 attaacagca aagcccaact taacgaagcc aaacgacgca gcgcgctagc aagcttgcca 661 accagcttta agctgccgct tcgtcaaatg aaaccttcgt taagccaaaa gcttacctca 721 gtggcgcagc gcggcttttg ggaagcttaa aaagcctcgc gctcgccggc cagagactga 781 gtcggtaacg gtctgcgtct gcgcgcagtt caaaacaggt tgaaaaaaac gaggcccagt 841 gcttaaaaga agggaacact aaaaacccta gaaggcagcc caacaaatac cgttgtacgc 901 atgagtgtct ttgtgttgtg tccaagaaaa taagaagaag aagaaactgc aacaagaaac 961 gtttaaataa attgtttaca attgtgcatt ttttacgagt cgaacagttc gtttttatca 1021 tctccatgcc attgagcaat tggcctagag gcatttctat ttttcaataa ggaataaata 1081 attaaatttt caaaaattta acaaacagaa aaggaaacct gaaccgacat agaccgtaag 1141 ttgatgcaat cgcctgatgc ccgatacctg atggctgatg cctgatgcct gagtttttct 1201 tttatttcgc tattttctgc gtgtttcggc catgtccgtt ttggcattga ggaagctaaa 1261 cgaaaactgt tttggccagc ctctgcctct tggactgctg gaaattttac cattgtcata 1321 agcccaagac ccaagaaacg gaacttcaac agagtcttgt gttgtgtcta ttggtcttct 1381 gatgcctcat ggcgataaat tgagttgcac ttttattgta tttttgtttt ttgcaatagc 1441 tccaattcca attcgattcg ttttgtttgc cgtcagtgta catgtctcac gattttactg 1501 cctctagaag accaaaaccc gttccagttc atagttggag gtccattggc caaatgaatg 1561 actacaattt gccgcaatgt gtttgagcat taaatttgct atagttttta tccactgatt 1621 atgttccagt ttttaagcgg aggggaagtt gaaaattgct ggtctgcaaa ttgactccaa 1681 agtagatgtg aaatttcttg aaatgatgtg aatttttctc agtttaattg aatgatggat 1741 tgccaatgga tgccaatgaa gttcttcaaa ttattgggtg tgaaagtctc acaaattttg 1801 tgaatttgat ggggaagcta ttattatgac aggtccaaag atctcacccc atccaaagag 1861 ccaaagacag agcccttaca caatccccgc actataatca catggtttta tctggcgcct 1921 ttgtcatggc acgcataaat tgcattactt tatgattata attctaataa atgcacagtt 1981 cccacattaa taagcgccct cgccctcccc tactcagaaa ccatcgctaa acatatggca 2041 aggaggtaca actaaaactc caaccaaacc caaaaaaact ctatctctgt ctgcggtatt 2101 gttcacaccc ctgagcacaa gaccgacata ctgactgact gactgactgg ctgactgact 2161 gactcactaa ctgactgatt gactgactgg cagactgaca actcaaagaa accttcaaat 2221 gcaacggcag caatgaagac aatgccaagc gatatcaata caaaaaccac aagaagacaa 2281 ctgcaacgca actacgatta taatcgacat gcgggatcgg atcgggttgg atcgcttcag 2341 tgcacaagat attatgtaca tccgataaaa ttgttaatac aaaaaaatca agctgttcat 2401 aattatgaaa aaaaactcac agataacatt gttgtcgcaa aaggaagtgt gaataggcca 2461 aagaagaggc acaaaaagtg tcattaatta attgaataat tacagagaca tgacagagag 2521 cagcacttgc ttaggatttc ccttaagttt tccatagatt tccatgccac aatactggaa 2581 aagcaaaact tgtcccgaac aacttttaat tttctgattg ctgcacgact tttgcacgcg 2641 tttttctttg gcttttgtga agttcgcctg tacttgtacc cgtagccaaa gcctgcctgg 2701 aaaacttgtt gcgaggtcaa ctgcctctct taaagccaca gttgcgtatg cactctggct 2761 ttagtttggc tttgctttac tttcgtggaa tccttcaact ttaaaacaag aaaaggacac 2821 caaaatccca ttcaaattga atgaaaaatc gtgtttcaaa tgtcaaatgc gaacaccaaa 2881 gaaacaaaaa aaaaacagac gaactgtagc aattgtagca tgatcttctc tggtgattct 2941 ctggctctga ttttttgggt ttcattttca gtttcgggct ttggtggttg tttttgtttt 3001 tggcttccac gtttggtttg gttgcaaatt gcaatttgat tgctttaagc cgtttcgatt 3061 taattgaggt caaaaacgtt gtcggattct tcccaaaaat gtgctgattt gtgggttggg 3121 ttttgggctt taaaagtaca tatttgtcta attgtcatgc ctggatgtgg tatgtagcaa 3181 ctttatcggt ctccccaaaa ctctcactta aaaatcactc aaatctcacc cgcccagagg 3241 tcatcaaaac gtttgcaaat gacataatta taatggacat tactcaatct ctcattgctt 3301 cttgttttag gcagtaggcc ccagcagaga gtacgagtaa tttgaaattt caaataggaa 3361 aagagccaga gcccgagcct tggccgctcg tccggggcta aggtcatgtg aaactatgca 3421 gctgactgaa caaaagccaa gagctaagac ccaaccaaga tgatgagctc tgtctgtctg 3481 acttggcctt ttggtaattg ccaatgctgc tgctgctgct ggccagttgc cgtttgccaa 3541 atgcccaacg atatcaaaaa agtatctttt tctgatttta attccatttt attcagcttt 3601 tgtttttact tcttcgactg cgatgacgcc cattgtctgc tgcggttggt acttttatta 3661 attgcagtac tgctgctgtt gctggagaat tacttagatg attttttgtc tttgccatgc 3721 cccatggatc ggcaaggtta aacagaagtt tggaattgcg gaatttggaa ttatatttat 3781 gattttgccc tttggctttg gttttgtgtt taatcaacgg caaattatta aaacaaacaa 3841 aaatacgagt agatacattt aaatattgaa ttataatttc tgaagctttg tttctcaagt 3901 tcgtttctta aatctttcgt tttctgctct tttcattcca cagcacagcc ttatcataat 3961 ggccaccagt ccatcctcct gcgattgttt ggtgggcgta cccacggggc ccaccctcgc 4021 ctccacctgc ggcggcagcg ccttcatgct gttcatgggc cttttggagg tcttcatacg 4081 ctcccaatgc gacctcgagg atccttgcgg acgggccagc acgcgggtaa gccaaagatc 4141 cgtatatggt caagacgtgt ggctaaccag atcacattca atccaatcca ttttgttggc 4201 agtttcgttc ggagccggat tacgagtacg atttcattgt cattggcggc ggctcggccg 4261 gctctgtggt tgcctcacgt ctctccgagg tgccccagtg gaaggtgttg ctgatcgaag 4321 ccggtaagtg ttgttgtttc gtctctcgcg tgtgtgtggg tgtgtgtgag tgttgcatgc 4381 aacatgttgc cagccgcagt gtgctcacgg gcaattagat gcacgttatg tatgtgcatt 4441 gtttagaagt atattgaagc ctaacagcac cagcagcagc agcagcacca gcagcagcta 4501 agacttgggc aaccgcaact ctgccacgag cattatgtaa actaaacgag ctgctgccac 4561 tgccacctag agcagcggct ccattgctgc ccccaaggca gttccccatg caaatgtctt 4621 cttgctatat aggatttgct gcatgccccc gccaatgcct catggaggtt ttttacgttc 4681 cgcttgactt gacttttttg cattggaata attgttggtt ttttgtacct tacatttttt 4741 tatgaatgaa ttataatgca tgaacaatac ggggaacgtg gaacttcgga acgagtgtgt 4801 agggcattca cttttcctta acgagaagca aatatgcggt aaactctgct gaagaggctt 4861 tttagcactg aataatttta catttaattt gcgatatttg aaaactggat ctatagcacc 4921 aggttttaga ggaaagcatt agcaaatcac ttaaatcttt gatggaatat attacaatcg 4981 tttaaaaccc aaaaaacaaa acgaaaatcc ctctatttgg tcggagcacc aaaaaacacc 5041 tcattaaaca tcataatttt tcgccattca tcactaataa cttaattagt tattcaggtg 5101 tctagaaaat cacgttaata cctctctcta tcgctgggaa aacattctca tagatcccat 5161 tagacccgtc ttatcagcat tcttctttct ggttggctta ctttgtgggt taccgtctat 5221 tcgtcgtttt caattgtata cgtgacaatt tttatgccca gtcactaatt acactaattg 5281 cataatgaaa ccctcgagga aagagaaaca ttttcaaagt tgtcaatcgg gttggcggta 5341 cttatacaat gccccagtca ggctgcagtt aagtgcaatt agttgtggcc attacttggc 5401 ttattgtcta tgaaaactac actttcaaag actccaaact catgctcatc acaagtatgg 5461 tttaatcttg ggttgacatt agacatgcaa ttcatcaaat aaaaattcgt attgtttgta 5521 ccatttcagg tggcgatgag ccggtgggtg cacagatccc ctccatgttc ctgaatttca 5581 ttggcagtga cattgactat cgctacaata cggagcccga gaggatggcc tgcttgtcct 5641 ccatggagca gcgttgctac tggcctcgcg gcaaggtcct aggcggcact tcggtgatga 5701 atggcatgat gtacatacgc ggcaatcgcg aggactacga cgaatgggcg gcccagggaa 5761 atccgggctg gtcctaccag gatgtcctgc cgttcttcaa gaaatcagag gataatctcg 5821 aattggatgc cgtgggcacc gaataccatg ccaagggcgg ccttttgcca gtgggcaagt 5881 tcccctacaa tcctccactg tcttatgccc tgctgaaggc cggcgaggag atgggcttct 5941 cggtgcagga tctaaacgga cagaattcga caggcttcat gattgcccag atgacggcca 6001 ggaatggcat tcgctatagt tcggccaggg ctttcctaag acccgctcgc atgcgtaaca 6061 atctgcacat attgctgaac accactgtga ccaaggtgct catccatccg ggaaccaaaa 6121 acgttgtggg tgtggaggtt agcgatcagt ttggcagcat gcgcaagatc cttgtcaaga 6181 aggaggtgat cgtcagtggt ggtgcggtga actcgccaca gatcctgctg ctgagtggcg 6241 taggacccaa agaggatctg caaaaggtga atgtccgtcc agtccatcat ctcccgggcg 6301 tgggaaagaa tctccagaac catgtggcgt acttcacgaa cttcttcatc gatgatgcgg 6361 atacggcgcc cctcaactgg gccacggcca tggagtactt gctcttccgc gatggcctga 6421 tgtcgggcac tggcatctcc gatgtcaccg caaagatggc cacccgctgg gccgatcgcc 6481 ccaatctgcc cgatctgcag ctctattttg gcggctacct ggccagctgt gcccgcaccg 6541 gacaggtggg agaactcctc tcgaacaact ctcgagccat acagatattc ccggctgtac 6601 tcaaccccaa gtctcgtggt tacattaccc tgcgctcggc cgatcccctg gatccccccc 6661 gcatctttgc caactatctg acggatgagc gcgatgttaa gaccctggtg gagggcatca 6721 agttcgccat aaggctgtcg cagacgtcac ccttgaagca gtacggcatg cggctggaca 6781 agaccgtggt gaagggctgt gagtcgcacg ccttcgcgag cgatgcctac tgggagtgcg 6841 ctgtgcgcca gaacaccggg ccggagaatc atcaggcggg ctcctgtaaa atgggtccct 6901 cccacgatcc catggcggtg gttaatcacg agctgagggt ccatggcgtg cggggactgc 6961 gcgtgatgga caccagcatt atgcccaagg tgacggcggg caatacccat gcaccggcgg 7021 tgatgattgc ggagaagggc gcctatctgc tgaagcgggc ctggggtgcc aaggtctgac 7081 gcgtggatgc gacgtggacg ttgcatagag taatttaatc aaattccaac gaaaattact 7141 acaaattcaa atttaagcga aaattattat taaggaagct ctgaactttt tctataccta 7201 ttataaaacc tttttggccc gatccaatcc gatctgaacc catgccccac tctaagcact 7261 ctgcgttgcg caatctacga ctcgacaatc gctctcgata agtttaacca tctaacagta 7321 ctattattta aactaattct tgttaaaaac ccattgaaat atcgaaattt cgaatatcac 7381 tc // LOCUS DRVGLDYYR1 945 bp ds-DNA INV 14-AUG-1990 DEFINITION D.pseudoobscura glucose dehydrogenase (GLD) gene, 5' end. ACCESSION M29300 KEYWORDS glucose dehydrogenase. SEGMENT 1 of 2 SOURCE D.virilis DNA, clone pCG79. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 945) AUTHORS Krasney,P.A., Carr,C.M. and Cavener,D.R. TITLE Evolution of the glucose dehydrogenase gene in Drosophila JOURNAL Mol. Biol. Evol. 7, 155-177 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Cavener, 20-OCT-1990. FEATURES from to/span description pept 943 > 945 glucose dehydrogenase (GLD; EC 1.1.99.10) mRNA 545 > 945 GLD mRNA BASE COUNT 325 a 163 c 189 g 268 t ORIGIN Chromosome 3-84C8. 1 gcatcctcgg taaaaagtgc acttcgagga tattttttgg tgatatttaa attttttcgg 61 atttgcttca aacgtattat ttaaagctat acataaaaac gttatgaata cgtgttttaa 121 cgatttggaa aatttaaacc ggcaacattg gtaaatttgg ggaaaacgtt tgtatgaaag 181 ttgtttactt accgtccgat cagttttaaa atcattttca aagctatacc cgttagttta 241 atatatatta ttgatcgttt tcattcaaag tatatatttc gttaacgctc gtaactttaa 301 tgtttgcatg ttaataaata gtcatttcaa tagttagaat atgtatttat attaaattaa 361 aaaaataaat attatgccag caagcaattg gagagcgaca agcgagcttt aagctacaca 421 acgtgaggcg cagctggaac cagttggctc ccgaaccaac tgaaaaacga accagccaac 481 agctcaagct tcgtattggc gtttttgggg ggagtttaaa aagacgcgcg ctgccggcct 541 ttcactgagt cgacgccggt ctgcgattgc gcgcagttca aaaccaagtt gctaaaatca 601 tagccaggcc taaactgcat gcagacttgt gagctgcccg ttaagccaaa gaaggcaaaa 661 ctaagtactg tgcgtgtgtt actgtgctac tgtgtgtgct aacgaatgaa aaagaaaaaa 721 aatcaccaaa aaataaaccg aaaaacattt aaaaaatagt ttttgtcaaa ttattgtgca 781 cgtcttcggt ggttgtgcaa tttgccaaaa aggcattacc gataaggaga ggcacaataa 841 gaaaataaaa gccaaaagag ctaattgttt gttttgcggc aaaccgaaaa ggaaactggc 901 caacaactgg acacgggcat agaaaacttg aattaataca acatg // LOCUS DRVGLDYYR2 50 bp ds-DNA INV 14-AUG-1990 DEFINITION D.virilis YYRR box located in intron-1 of Gld. ACCESSION X07360 KEYWORDS YYRR box; glucose dehydrogenase. SEGMENT 2 of 2 SOURCE D.virilis DNA, clone pCG79. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 50; enum. 1 to 50) AUTHORS Cavener,D.R. JOURNAL Unpublished (1988) Vanderbilt Univ,Nashville,TN 37235 STANDARD simple automatic REFERENCE 2 (bases 1 to 50) AUTHORS Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M., Schonbaum,C. and Xiao,X. TITLE Title is missing from this entry JOURNAL Nucleic Acids Res. (1988) In press STANDARD simple automatic COMMENT *source: library=Charon 30; clone=pCG79; EMBL features not translated to GenBank features: key from to description RPT 3 50 tetranucleotide repeat SITE 3 50 CTGA motif BASE COUNT 15 a 14 c 11 g 10 t ORIGIN About 2 kbp downstream of segment 1. 1 aactgactca ctgactcact gactgactga ctgacggact gacagactga // LOCUS HUMSYB1A1 727 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 1. ACCESSION M36196 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 1 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 727) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept 721 + 722 synaptobrevin 1 (SYB1), exon 1 IVS 723 > 727 synaptobrevin 1 intron A signal 544 549 TATA box BASE COUNT 130 a 217 c 257 g 123 t ORIGIN Chromosome 12. 1 aagcttcgtg cttgctttgg tgggacggaa ccccgcatcc ggcatctggc accgggagct 61 tgtggctgtg tctcgaggct cctgcaccct ctcagagcgt tagtcagagg cggggcggtg 121 gtgcggagcg agggccagag cagggcacca gggctgggcg cgcaccagga agccgccgcc 181 tccagctgaa tgcgcgagtc ccggcgcggg gtggggcagg ggcgggcacg gtcttcagta 241 actgcctagc gattggttgg gaggcaccac aaatgcctaa caaactgctg aagcgccctc 301 cgaagggaca gacctcatct ccaccccgcc ccgtccctcc gccagactgg acgcaaacta 361 tggtcgcatt gatcactgaa ggatcctcac agcaaccgct cctttccgga gtcggatgag 421 aggagagttg tgactggcaa ttggcagggg cggggcgggc taggcctgta gcgctgggcg 481 accgtcctgg gcatggattg ggccgcgggg ttgtcaccgt tatccgggag gcgtggtcag 541 cactaataaa ggcggaggcc ggcgcggcag ctgcagtaag ttccagcgca cgtagaccgc 601 ggggtggtcg gcgcgaggcg gagcttggca gttccgtcca cttcagccgc agcgtccctc 661 gccgggtgtc tcgccgcagc ctccggagag gaacagaccc tcactctctc tgtcagaaaa 721 atgtgag // LOCUS HUMSYB1A2 153 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 2. ACCESSION M36197 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 2 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 153) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 22 + 148 synaptobrevin 1 (SYB1), exon 2 pre-msg < 1 > 153 SYB1 mRNA and introns IVS < 1 21 SYB1 intron A IVS 149 > 153 SYB1 intron B BASE COUNT 37 a 49 c 42 g 25 t ORIGIN Chromosome 12. 1 tctgcacatg tctttccaca ggtctgctcc agctcagcca cctgctgaag ggacagaagg 61 gactgcccca ggtgggggtc cccctggccc tcctcctaac atgaccagta acagacgact 121 acagcaaacc caggcacaag tggaggaggt agg // LOCUS HUMSYB1A3 186 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 3. ACCESSION M36198 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 3 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 186) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 23 + 181 synaptobrevin 1 (SYB1), exon 3 pre-msg < 1 > 186 SYB1 mRNA and introns IVS < 1 22 SYB1 intron B IVS 182 > 186 SYB1 intron C BASE COUNT 50 a 36 c 60 g 40 t ORIGIN Chromosome 12. 1 tttgtttctg tgccctcctc aggtggtgga catcatacgt gtgaacgtgg acaaggtcct 61 ggagagggac cagaagctgt cagagctgga tgaccgagct gatgccttgc aggcaggagc 121 atcacaattt gagagcagtg ctgccaagct aaagaggaag tattggtgga aaaactgcaa 181 ggtgaa // LOCUS HUMSYB1A4 79 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 4. ACCESSION M36199 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 4 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 79) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 23 + 74 synaptobrevin 1 (SYB1), exon 4 pre-msg < 1 > 79 SYB1 mRNA and introns IVS < 1 22 SYB1 intron C IVS 75 > 79 SYB1 intron D BASE COUNT 16 a 14 c 19 g 30 t ORIGIN Chromosome 12. 1 ttttttgtcc atgtttcctc agatgatgat catgctggga gccatctgtg ccatcatcgt 61 ggtagttatt gtaagtaag // LOCUS HUMSYB1A5 428 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 5. ACCESSION M36200 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 5 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 428) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 22 38 synaptobrevin 1 (SYB1), exon 5 pre-msg < 1 > 428 SYB1 mRNA and introns IVS < 1 21 SYB1 intron D BASE COUNT 85 a 139 c 89 g 115 t ORIGIN Chromosome 12. 1 cttcctgatt cctgtgtcca gtctactttt ttacttgaga atgtacccac cccttccctg 61 ttgtccattg ccatccacat tcatgtcctc tgccctctgt ttgctctctc aacacacttc 121 cccacccacc gtcctccatt ccagcccagg cttctccatc acccattcct cctttttcgt 181 tgcgttcatt tgcactctgt ccctcaacac tagaaatgct gctcgtggca caatctaagt 241 cattacccga agagcaacag tggcgcctcc tccctgcctg ctttttctgt actctcaagt 301 tcccccaaag ccccaaagag ttggaggcca agggaagggg cagggagggg agtggctgag 361 gcgaagtacc catgaagctg cccagacttg ggaggagaag agtatcggtg cccatggtga 421 cttctaga // LOCUS HUMSYB2A1 954 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 1. ACCESSION M36201 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 1 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 954) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept 948 + 949 synaptobrevin 2 (SYB2), exon 1 IVS 950 > 954 SYB2 intron A BASE COUNT 125 a 361 c 305 g 163 t ORIGIN Chromosome 17. 1 ctgcagtcgg gtcgctgacc taagcctggg gcaaagctag gtcttggcca gtcagcagcc 61 accacgtgaa gcagcccctg cccccatcac ccccggcagc cttgtgccct cctcctcggg 121 gtcctcagct caccccacct cagtcgggcc tgctccagcg gttaagtacc cctagtctga 181 accttaatcg tgaggcgggg cagccccctc cccacaccct gtagacactc ccccgtcctc 241 cgccccctgc ttttcttcct accctcctac ccagtggcca gggaccccag gcttcctctg 301 agacatccac cagcggtgat gggttgggac caatctcccc cactcctctc caccctagca 361 gtaaggtccc cgctctcgat actgccccag cacctgcact ggccctctct gatttcccaa 421 ctccagccca ttccaagtgc tgaaggcggg tcgctaggtg ctgggacctg ggcgcggggc 481 aagaccttgg gctgtgtggg gtgcaggctg ggtttactgg gttcacgatt gagggagtgg 541 tctgggggtt aggtgtaggg ccaggctgcg ttgtgctcta aggggcaggc ggcaggcggt 601 ttgggggggc tgtggtcaga cagggtgggg tggcggtggg tttggggaag ccgcagcagc 661 ccgccttggg gtcccgcgag tgccgaggga gctgccgggg catggtggcg cggggcccgt 721 ggtcctgccc ggcacgaccc ccgccccgca gcgccccgca gcgccccgcc cccgcggccc 781 cgctcgcgac gcgtctcccg cggcgccccg cccccgcccg cagtgcccgg atgcgggtga 841 cgcgcggccg ccatctttcc gtcccgggca gccagcgcca gtcggagcca gcgcgagcgc 901 cgccgccatc actgccgctg ccaagtcctc cacccgctgc ccccgccatg tgag // LOCUS HUMSYB2A2 148 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 2. ACCESSION M36202 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 2 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 148) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 23 + 143 synaptobrevin 2 (SYB2), exon 2 pre-msg < 1 > 148 SYB2 mRNA and introns IVS < 1 22 SYB2 intron A IVS 144 > 148 SYB2 intron B BASE COUNT 25 a 58 c 41 g 24 t ORIGIN Chromosome 17. 1 ttaatccgtc cctgctcctc aggtctgcta ccgctgccac ggccccccct gctgccccgg 61 ctggggaggg tggtccccct gcaccccctc caaacctcac cagtaacagg agactgcagc 121 agacccaggc ccaggtggat gaggtgag // LOCUS HUMSYB2A3 186 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 3. ACCESSION M36203 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 3 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 186) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 23 + 181 synaptobrevin 2 (SYB2), exon 3 pre-msg < 1 > 186 SYB2 mRNA and introns IVS < 1 22 SYB2 intron B IVS 182 > 186 SYB2 intron C BASE COUNT 47 a 49 c 61 g 29 t ORIGIN Chromosome 17. 1 gcctcgtggg cccttctccc aggtggtgga catcatgagg gtgaacgtgg acaaggtcct 61 ggagcgagac cagaagctgt cggagctgga cgaccgtgca gatgcactcc aggcgggggc 121 ctcccagttt gaaacaagcg cagccaagct caagcgcaaa tactggtgga aaaacctcaa 181 ggtaag // LOCUS HUMSYB2A4 77 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 4. ACCESSION M36204 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 4 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 77) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 21 + 72 synaptobrevin 2 (SYB2), exon 4 pre-msg < 1 > 77 SYB2 mRNA and introns IVS < 1 20 SYB2 intron C IVS 73 > 77 SYB2 intron D BASE COUNT 16 a 24 c 15 g 22 t ORIGIN Chromosome 17. 1 tcctcgcctc tcacccccag atgatgatca tcttgggagt gatttgcgcc atcatcctca 61 tcatcatcat aggtgag // LOCUS HUMSYB2A5 146 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 5. ACCESSION M36205 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 5 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 146) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 27 43 synaptobrevin 2 (SYB2), exon 5 pre-msg < 1 > 146 SYB2 mRNA and introns IVS < 1 26 SYB2 intron D BASE COUNT 24 a 65 c 19 g 38 t ORIGIN Chromosome 17. 1 tcatcttctt ccccctctct ccacagttta cttcagctct taaatccccg aggagtctgc 61 cctgcctaga gaagggcctc tcccccaacc ctcagccgtt cctccacctc tcagccatat 121 ctttcagccc ccactcccct ggatcc // LOCUS SPICPATP 4066 bp ds-DNA ORG 14-AUG-1990 DEFINITION Spinach plastid genes atpI-H-F for ATP synthase CF(O) subunits IV, III and I. ACCESSION X03775 M35501 KEYWORDS ATP synthase proteolipid; ATP synthetase; ATP synthetase CFo genes; atpA gene; atpF gene; atpH gene; atpI gene; synthetase. SOURCE Spinach chloroplast DNA. ORGANISM Chloroplast Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae; Spinacia oleracea. REFERENCE 1 (bases 1793 to 2048) AUTHORS Alt,J., Winter,W., Sebald,W., Moser,J.G., Schedel,R., Westhoff,P. and Herrmann,R.G. TITLE Localization and nucleotide sequence of the gene for the ATP synthase proteolipid subunit on the spinach plastid chromosome JOURNAL Curr. Genet. 7, 129-138 (1983) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 4066) AUTHORS Hennig,J. and Herrmann,R.G. TITLE Chloroplast ATP synthase of spinach contains nine nonidentical subunit species, six of which are encoded by plastid chromosomes in two operons in a phylogenetically conserved arrangement JOURNAL Mol. Gen. Genet. 203, 117-128 (1986) STANDARD simple staff_entry COMMENT EMBL features not translated to GenBank features: key from to description OPR 1369 1414 pot. stem-loop structure OPR 2049 2079 pot. stem-loop structure OPR 3775 3799 stem-loop structure pot. transcription terminator INVREP 2319 2326 inverted repeat A INVREP 2334 2341 inverted repeat A' INVREP 2522 2534 inverted repeat B INVREP 3348 3359 inverted repeat B' SITE 2602 2604 pot. translation stop codon SITE 3312 3314 pot. translation stop codon Between positions 2584 and 3340 six stem-loop structures I(1)-I(6) are described. They designate secondary structures resembling those of class II introns FEATURES from to/span description pept 357 1100 CFo subunit IV (atpI) pept 1793 2038 CFo subunit III (atpH) proteolipid pept 2434 2578 CFo subunit I (atpF) 3343 3752 CFo subunit I (atpF) pept 3818 > 4066 CF1 subunit alpha (atpA) pre-msg 2434 3749 primary transcript of atpF gene IVS 2579 3342 put. intron (no splice consensus) binding 348 351 atpI ribosome binding site binding 1780 1783 atpH ribosome binding site binding 2423 2426 atpF ribosome binding site binding 3810 3813 atpA ribosome binding site signal 206 211 atpI -35 region signal 241 246 atpI -10 region signal 1693 1698 atpH -35 region signal 1710 1715 atpH -35 region signal 1724 1728 atpH -10 region signal 1744 1749 atpH -10 region BASE COUNT 1362 a 591 c 777 g 1336 t ORIGIN 1 ctatttgttt aattgataca aattgtaacc cggatctcgc agctctttcg attccggcaa 61 atgatgacgc tatagcttca attcgattaa ttcataccaa attagtattt gcaatttgtg 121 aaggccgctc tagctatata agaaatcctt gattcataat aaaatcaaaa attgacttcc 181 taaattctct atcggttact agattttgaa tctcaaaaac tagttcaaaa taacagggga 241 tattatgtaa ttaatcagta tccgaaatag aaaattttaa ttaaagtaga caagtcgaga 301 aagagctgat tgaatcaaaa taattttttt aagttatatt tctgtaagag gacaatatga 361 atgttctatc atattcaatc aacccgctaa aggggttata tgctatatcg ggtgtggaag 421 taggtcaaca tttttattgg caaataggag gtttccaaat ccatgacaag gcccttataa 481 cttcttgggt tgtaattgct atcttattag gttcagctgc tatagctgtt cggagtccgc 541 aaacaattcc gactggtggt caaaattttt ttgaatatgt ccttgaattc atccgagacg 601 tgagcaaaac tcaaattggc gaagaatatc gcccgtgggt tccctttatt gggactatgt 661 ttctatttat ttttgtttct aattggtcag gggctctttt accttggaaa atcatacagt 721 tacctcatgg ggagttagcc gcacccacga acgatataaa tactactgtt gctttagctt 781 tactcgcgtc agtagcctat ttctatgcag gtcttacaaa aaaaggatta ggttattttg 841 gtaaatacat tcaaccaact ccaattcttt tacccattaa catcttagaa gatttcacaa 901 aaccgctatc acttagtttt cgacttttcg gaaatatatt agcggatgaa ttagtagttg 961 ttgttcttgt ttctttagta cctttagtgg ttcctatacc tgtcatgttt cttggcttat 1021 ttacaagtgg tattcaggct cttatttttg caactttagc cgcagcttat ataggcgaat 1081 ccctggaagg tcatcattga tttgtcttag gaagagttta tctcctagtt tagatatatg 1141 tgtatgtgtg gctcaagata ctctataaag ataatctatt tagagcatat aaatatccaa 1201 atacatacag tctagtggta atagaaaaaa cgatattcga gaagtgtaaa aaaaaaagac 1261 gttggttagt cgagagggga tacccctgtt atatggaatc taatgactat aagctaattc 1321 ttgcagattc gatgtttcga agaatgttca aaaattcgat tgaatttcaa atataatagg 1381 cggtttacgt tatgtaagaa acctatgtat attttatatt agatattgac aagttatata 1441 tgaaagaata tttaatttgc cctacttgaa tttggataga gacaccaacc gacgaagtcc 1501 tttcctattc gtttatgact gcgaattgaa tggataaaca gacaaaatat aaaaaaaggt 1561 cgaagaatga ttagaaaaag aaaaaggaaa tggaaaaact caagttgtat tgattcagaa 1621 agactcaaca atataggaac taaaaaaatc aagtctttct aattatttaa taatattaat 1681 tattatttca actggataaa tattagcaat ggaataatta agtcataatg cattggttga 1741 ttgtatcatt aaccatttct tttttttttt tgtgtgtgtg aggaacttta tcatgaatcc 1801 actgattgct gccgcatccg ttattgctgc tggattggct gtagggttgg cttctattgg 1861 acctggagtt ggtcaaggta ctgctgcggg acaagctgta gaaggtattg cgagacagcc 1921 cgaagcagaa ggaaaaatac gaggtacttt attacttagt ttagcgttta tggaagcttt 1981 aacaatttat ggattggttg tagcattagc gcttttattt gcgaatcctt ttgtttaatc 2041 cgaaaaggaa aagaaatagg agaaatacac atttctttta tagtcttgaa cttgcaggtt 2101 gctttttcac atttataaga aaatatcgct cctacacaat tacttattcg ttgagaaaat 2161 aatacacgga aaggacttaa tttgaggatg aagaattcgt gttacccact cggtttcttc 2221 tttcctcccc ctttttagtc cgaaggagaa gtgttgcaac aaagagagta tttcgaaatt 2281 cacatgaaac ctagtaccta attagtaatt ctataaatcc aataagtatt attcttattg 2341 ggaatctcaa taaaaaaaat aaaattcatt tagaagtagc aaacaattga aataatacaa 2401 cgattttttt gtttatctat aagaggagat catatgaaaa atgtaaccga ttctttcgtt 2461 ttcttgggtc actggccatc cgccgggagt ttcgggttta ataccgatat tttagcaaca 2521 aatctaataa atctcagtgt agtgcttggt gtattgatct tttttggaaa gggagtgtgt 2581 gcgggttgtt tatttcaaaa ataggttgga ttcaaccaac tgtaccgctt tttttaatta 2641 gggcgaaagg tgcatgattt cacgaatgac ttctgaaatt aataataaag aaatcatatg 2701 taagaactag agcatttcgt gatttgttgg taaatatact ttgattctct atcaaccaat 2761 aatgtgggac cataacatgg ttaaaactta aattgtttga agtccaggca cagcagggta 2821 ttctttctac caccatgtta atacttaaat ataccgaata ccgaggctaa aattaaataa 2881 agtaaagaaa catagttaca aatttttcga tatataacac tcatgtcgat aaaattgttg 2941 gaatctctta tttattgtta taggaaaaat gtttcggctt tttttttatt aagtaaataa 3001 atgccaaagg ctgagtcgat gacctacgta taaaataaga aacatttttg gatttgaata 3061 aaaaaaaaac aactttgctg acaattactt atatattttt ttttgggggg tcagaagagt 3121 tctccgaata ttctagtctt gattagtgat gggtttccat tttttgtcga ctatgaacag 3181 agaagagagg ataagttcat tacattcaaa aaagatatgg aaatttgcca taaaaaattg 3241 aagtaattga gcgtgagagc caaatgaatt gaaaaattca cgtttggttc gggaagggat 3301 catgaatgtt ttgaaatgaa tggaaagata atctactttc attaagtgat ttattagata 3361 atcgaaaaca gaggatcttg aatactattc gaaattcaga agaactacgc ggaaaggcca 3421 ttgaacagct ggaaaaagcc cgggctcgct taaagaaagt cgaaatggac gcggatcagt 3481 ttcgagtgaa tggatattct gaaatagaac gagaaaagat gaatttgatt aattcaactt 3541 ataaaacttt agaacaattt gaaaattaca aaaacgaaac cattcagttt gaacaacaaa 3601 aagcgattaa tcaagtccga cagcgggttt tccaacaagc cttacaagga gctctaggaa 3661 ctctgaatag ttgtttgaac aatgagttac atttacggac catcaatgct aatattggca 3721 tgtttggtgc gatgaacgaa ataactgatt agtccttcta tttaattgta ggtattattt 3781 ttttgtttca aaaaaaaaaa gaatttaaga aagactcatg gcaaccattc gagcagatga 3841 aattagcaaa attatccgtg aacgtattga aggatataat cgagaagtaa aggttgtaaa 3901 taccggtaca gtgcttcaag tgggtgacgg cattgctcgt attcacggtc ttgatgaagt 3961 aatggcaggt gaattagtag aatttgaaga gggaacaata ggtattgctc tgaatttgga 4021 atcaaataat gttggtgttg tattaatggg tgacgggttg atgata // LOCUS DRVSEV1 4245 bp ds-DNA INV 14-AUG-1990 DEFINITION D.virilis sevenless gene, exon 2. ACCESSION M34543 KEYWORDS sevenless protein. SEGMENT 1 of 3 SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (sites) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. TITLE Comparison of the sevenless genes of Drosophila virilis and Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5351-5353 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 4245) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.M.Rubin, 24-MAY-1990, for release after publication. FEATURES from to/span description pept / 2804 + 3629 sevenless protein, exon 2 (AA at 2805) (5' end is putative) IVS 3630 > 4245 sevenless protein intron A BASE COUNT 1103 a 957 c 922 g 1261 t 2 others ORIGIN 1 tataagtgtg tcgtattgtt ggccaataat tagccgagaa tgtcaaatca cttgctgtta 61 attgatgttt atcgagagta gagtcgaact aatcgttgtt acagcttggc tctgcttgtg 121 ctttgaatag tgtactttga gctgcataaa ttcagctgat agagttggcg cattttcggt 181 ggcacttcaa gcattatgtg taactattcc agatagatac ctttgtgctc tggaatcgcc 241 catgacgctg ggtaaatggg gctagctggc tacgtcaacc gaagtttcgt cgcatctttg 301 tcgttgttga tcgtcttgct ctggcttgcg acttcggctg gggttgtttt tatactctgt 361 acacatggaa attggacaaa tggggtataa tgggcagacg gtagcaactc gtaaaccatg 421 aatttcccat atgtatttct gatcgggaca aagagctgcg tcggcctggc tatgtctgtc 481 cgtccgttaa gataatgaaa attcaatact taacaaacgc agttccagct tttcttggaa 541 gtttttaaga aatggaagaa acttttatat ttgtgtgcgg ctagagtctt acgcttgggc 601 ttacacacac ttcttatgct ttatattaag cggcgaaatt tcataacgat cggactataa 661 ataccgaagt tctgccaaaa cacgttgact ttgatgtggg tgctaggcaa gcaagcggca 721 tgctttgggg tgtttgtttt tttttgtgta caatctttgt aaacgacagc ttgtcgcttt 781 gtccagggta acgcttagtc ggtcaattct gatcagagca cttacttgtt tttggagctg 841 tcgcagtcgt tgtcgcagtc gcaatcgccg gttgccgctt gtcgtttgtc gtttgccgtt 901 tgccgtttgc cggttccatc agttaagcga cgatcgcggc acggttaagt acgcagcgct 961 cagatataaa tacatagcca caggctgcag atacatttgt ggatataaag atacggctgt 1021 gacatattag ccgcgcgact ctctgaagta gcttttgaag ttcaatcccg tcccgcactg 1081 tcccgcttcg cccaatcttg caaaaagggc tacaataaaa tgcaattaag ctgctcgaat 1141 caattctcag caaattgaat taaagcgcgt gaagagctcc tctctacaca cacacacccc 1201 ccctccaccc ctccacccac acccataagc aataacaaca agcccctgat atgtgggcga 1261 gtgttatccg aaactcgaag tccatagcct cgagctcgag aactttgaga actttgccca 1321 cggagtgctg tgaaaatata caaaatgcag atgggcataa acagaaatta acgccaaacg 1381 gaattgctcg ctcatagaat aatgcgaata attaattaat tacgggcaat tttttgcaca 1441 attttttttt ttttttttgt tttttgatgc aattaaaaat gttttgtgtg catattttct 1501 ctggcatttt cgcgcgctta attagtgatt tatttatttt atttatgcat gtttttcttt 1561 tttttttttt tggttttcat ttttctcttg tgctgtgtgt gtttttggag ttaattaata 1621 aagcggcatt ttaacatttt ttgttatttt atgtgcgaat cggcaaatca attgcatttc 1681 attaatatat tcctccgacg gaatttatag cgattccatg cggtttcttc ttaatctgca 1741 tttatgcatt tgtgcggaga aagtgatccc cgtcccaccc aaagacccgc gctctatcac 1801 tcgcacctaa atatagtttt tcccataagg cttaaattgt tgagggggga cttttgtagt 1861 ttttgtggtt tgttgtgatt atttgatgat tattttcata tttattgatt atacacccaa 1921 aaatgttgac aggaatctcc cgatttgccc gattgtattt cactttcccc ttattcagct 1981 tgcaatcgaa gctagaacct tgtgcttgga tatgtgcaca tatttgggtg tcatatttca 2041 gtctcgaaan tttcatatag atcgggcaat atgcagtgca gttaacagag aacggggacc 2101 gaagccaagc cgtggtgtgt ctctataggc agtagctaga gtctgatccg aactccaatc 2161 tgtgttctct atataaaaag ttctccgatt ctttgccaag tatctgaatc aattgcaagc 2221 tttttttttt tatgctcagc agttgcgagc aggagacaaa ccaaacttta agccgggctt 2281 ctatttcagg agattccaat tccaattgtt tgcgtagatc gtaacagatc aaaaataggt 2341 acattgaact gttttggggc aattggttta gtttattatt gttgcgtgtt tgtcgattaa 2401 cataatatgt taattcattt atatgcatgt gccgagacgt ttctgatttt tcgagttttt 2461 cgttttcaat ttcagcaaat tgtgcgctaa aatgtaaatg caaaaacgtc gagccgagct 2521 tcttcaaatc gaaatcgaac gacaacaaat tagaaaccag cccagagaca gtcgcacaca 2581 cacacacaca aacatacact cgagcacaca cacacactcg gacacgcaca accattcgca 2641 cacacacaca cacacacata ccgtggtcaa agtcagagtc ggacggacgg cagctccggc 2701 cagttccgac cgccggaacg cagagtccca cttgccactt ggcattgccg tggcgtaggt 2761 cgagcttaat ttttgtgctt cgatgaacat ttaaattgtg gactatgttt tggcgagaag 2821 atgcagcaca gcagcagcag caacagcaac aacaacaaca gcaacagcag cagcagcaac 2881 agccgccgca tccacccaaa cgcttgagct ttagtttcaa tgttaaaatc gctgtcaatg 2941 tgaacaccaa gatgtccacg actcacatca accaggagcg ctccaagcag caaacaacaa 3001 ccggatctcg atctcgatct cgttccaatt ccaattccag tgtcagttgc aaaggcgacg 3061 gcgatcgccg tgttcggcgg cacacaacgc ggcttgtggg cctgcgtcag cagctgttgc 3121 atctgggacg tcagctaaac ccgggccaat tcttggtcac gggtcacggc ggcatctcca 3181 cgatactaat tgccaatttg ctgctgctgc tgctgctctc gctctgctgc aatgtctgtt 3241 gccgctccca catcgagccg gatcaaaact tgacaccaac aacaacatca ccagctgctg 3301 ttgcagttgt tccgatgttg ctgcccctgg cccagaccca tatgcgaccc cagctggaca 3361 gcgatgtggt cgagaaggtg gccgtttgga caaagcatgt cggtgcggcg ccacccagca 3421 tcgccgaggg cattgccatc agcagcgtgg tcagaatgcc acccagcatc caaaccccaa 3481 ccgaaactgt gcgcaggcaa gaacaacaac gacaacagca gcaacagcaa caagaggcgg 3541 ctgcggctgc ggctgccgac gccgccatcg atgagcgcat cgtactggag cgcgtcaccc 3601 gcgactgtgt gcagcgctgc attgtggagg tgagtcgcta gttgattaac gggcggttta 3661 aactataact aaatggaatt gttattgtta ttgttattgt gaccttgagc aacgtgcttg 3721 tgcaccgtaa gtacagtttg gtatgtccgt tattccatac ttcctggcaa agcgcatgtt 3781 agttaacaga atccttcaac ttatcgtaac ttacttgctt tgggtgtccc aatgcacttg 3841 acttgccttg acagttcttt aatactctta aggactttct agttattgta gagctttgct 3901 tgcactaaac taacttgttc agatcgttct aaaatctttc tggcaaagct catgttagtg 3961 aactacagca cacttatcat aaagctgtag ctctggcaag caaggcttta agggcttaag 4021 acttaaagag tttctagtgc tggtatatca atgaagccat aaagcttttt tataaactta 4081 acttgttccg acatttacat tatacttcct ggcaaagcgt atgttagtaa aatgcagcat 4141 tttaaacgta tcgtaacttg ctctggatat caacatacgt ggctttaggc catagattgt 4201 gctagcaagc ttaagaattt aacattaacg ctccganaaa aagtt // LOCUS DRVSEV2 5687 bp ds-DNA INV 14-AUG-1990 DEFINITION D.virilis sevenless gene, exon 3,4,5,6 and 7. ACCESSION M34544 KEYWORDS sevenless protein. SEGMENT 2 of 3 SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (sites) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. TITLE Comparison of the sevenless genes of Drosophila virilis and Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry REFERENCE 2 (bases 1 to 5687) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.M.Rubin, 24-MAY-1990. FEATURES from to/span description pept + 1897 1971 sevenless protein (sev), exon 3 4050 4115 sevenless protein, exon 4 4177 4512 sevenless protein, exon 5 4574 4810 sevenless protein, exon 6 4867 + 5186 sevenless protein, exon 7 IVS < 1 1896 sev intron B IVS 1972 4049 sev intron C IVS 4116 4176 sev intron D IVS 4513 4573 sev intron E IVS 4811 4866 sev intron F IVS 5187 + 5687 sev intron G BASE COUNT 1665 a 1222 c 1168 g 1629 t 3 others ORIGIN 1 ccaacagata taaccatgtc cctctgtcct tttatctgtc accaagcgta ctagtctctt 61 agttttaaag ctattgtctc gaaacctctc tgttgcaagc agtatatatg ttagtactgg 121 atcggattgg atcgaagttc ttgtaagaaa aacttctttt caagatatct tgaccaaaac 181 cgtcctctac ttgttaaact atgctccttc tatataagct caattatatg taatttccat 241 ccttaagata ggtaaatgat caaaatgcga tgcccaaatg tttataagct ataaacttat 301 ctaaagtctg gcattttgta gacatttgag ctgtttataa gttcagttta agctgagata 361 atatgaaaac ttgaatataa tttagtattc ggttggactt tgtattaagt tttgacataa 421 ataagggctt tttttatagc ttgttgtttc gcagctttaa tgctgagaca gaccataacc 481 tgcgcttgtc tgttagaaat ttgtctgtta aagacctttg atccattatt ttactatgta 541 ttactaggtt aaccaagatt cgagaagctc tcgagttgtt tgccttcgct ctgaattggc 601 atgtcgctat ttttggctgc ccctcgccgc aacaacaatg aatcgatcgt ttgcggaccg 661 agtcggagat ctctttacga tcgatcacgt gctaaaagcg aaaagaaaga gaaaaaaagg 721 cagaaactcg tttggcgtga cgccgttaaa tgtcgatttg gtcaatagat ttatgcattc 781 gtaatgcgcg aatatctcag aggttgttaa aatgcctcta aagtcaataa tcaggtctat 841 gtttgagttg gcggctgtca cttgagaaaa gtagctaaat caaatcatca aaatagctat 901 aaaaagctgc attacctaag cagagtgatc tacaaaagcg ctgctgcttt ttgcggagtt 961 ttcttttcaa aaaaaagtta gactaaataa attagctaca aaatagccag aactgcaggc 1021 aatggcaaat atgacaagtt tcgtgtgaaa aacagcccat tctagcttgt aaatgacagc 1081 acaaaggtcc cagctctatt tcggcagtaa gccagaaaaa gggcacttct ttgttttctc 1141 gctgaaaata gccaaaatct agttttgaaa tagccataac tgactataaa atagccaaac 1201 tagcagacac ttgcaaatgt cagctattat caagttgggt agttgggtaa aaaaactcta 1261 gctagaaaat ggcaaaattt gttcgttgcg gatagggaga atagccaagt ctcgctataa 1321 aatagctaaa ttgacaacac aactctagcc tttttttttc aataaagcta gaacaagggc 1381 taaatcagct aaaaaagaat ttccggtatg caaatagaaa attctagtta aataaagcag 1441 aaatagtaat atactagctc tagctctttg ctctatttag cttgaaaatg gccaagtata 1501 cccaaaaaaa aaaatagctg atttctttct ttgctgtaat cagttagaaa atagtctagt 1561 ctagctagaa aatagtgaat tgctaacata ctgatccaag ctcactctct ctgttttgct 1621 tatatttagc ttgaaaatag ctatcttggc aatattggct gtagccaatt ctcgccctgc 1681 ctgctaccga tctgcctttt cacattgctt ttatctataa atgttaaaga ctgtgaattg 1741 attcattgcc attgacattg aattgaaaat tcttcccacg atttggattt cccttactct 1801 ttctttcctt gcccctgctt gctgcggcat ttccgttgca aatcggggct cttttgtgct 1861 aattgatttt cacttggatt tctttctttt ttccaggagg atctgtttct ggacgagttt 1921 ggcattaaat gcgagaaggc cgacaacagt gacaagtgct ataaaacacg agtaagtctg 1981 ccccccaaaa accaacacgt ccccctcgcc gaaacggtgt caagtgtttt gacaataacc 2041 ccccctcaag catgccccat tatccccaac taagcaaaag gccacaaagt cgcgtaattg 2101 gaaaccacct ttcaacttta ccagacaacg tgacgacgga ccgagcgacc gaccgaccga 2161 atgacccata aaaaaaaaca agaacaaatt cccagaaacc agaaacgaaa cgtatttacg 2221 gctcaaaaac cctgacaaaa aggaatatac ttaagcaaac gcacgaaaag aatggcaatg 2281 tctgtcttca ttgagccgaa gattaaatac ccttttatac agctcgtaat tgatgagaat 2341 atcttgcaat ggattctacc ttaccatagg atctatatga tatagtagtc caatgataac 2401 agaaagcaag gatatctctc acnagtatag aaaaagattc tctagctgtg cgatataaat 2461 ataaatagaa ncccaacaca aataactttt gcctgtgaca tggctattca caaatgccat 2521 atgccctttt gatacacact tgcaacagtt acagggtatc aggagtggca agcgagggat 2581 agccagcggt aaggccatca ggcacttgtt aaggaaaaat cataaacaca ttcgatatcc 2641 aagccagttc tgcctaacat gtatttcgga ttaaccacag ccagttgagg ttgggagtag 2701 ggagtgggat cgacctttga tgtgattaag ttaccaagca acacagctga gcagcttttt 2761 actggctttt aattgtcgcc atgctcgtga gtttagctag ttgcatacga actcaaagtg 2821 catcaggcgc ggccaatcgt gctccagctc cagacagcaa aggaaaccgt cgaacaacaa 2881 ttgtaaacgc tgctgtcggc tctcaattag tttgcttgca aggcatcgga tccggcggtt 2941 aatttccgtt tgagttttac agcaatccat cagggtacaa tagtttatct agaaattagc 3001 tgtgagcttg gctagcgcta agagacttac agactcgtac ttacctaata ataacatgga 3061 attttgctat cacgaaatac aagagtagtt attgaacata aaatatattt tcagagcata 3121 gtctaacagc tcactagtta tctgaattgt gtacgcccaa aggtaaatag tgttgagtgt 3181 gaaatagttt aaatatgggg aacagttggc tctgcttttt atacccttgc agagggtatt 3241 ataattttgt cgtgaaatgt gtaacgcata gaaggagaca tctctgaccc cataaagtat 3301 atatattctt aatcagcatc aacagccgag tcgatatagc catgtccgtc tgttcgtctg 3361 tccgtctgtc tgtttccatg tcgaactagt ccctcagttt taaagctatc ttaatgaaac 3421 tttgcagaac tccctctttc tgttgcactc agcacatatg tgaaaaccag ctggatcgga 3481 ccactatatc atatagctgc cataggaacg atcggtcgaa aattaagttt ttgtatgaaa 3541 aacattttgt tatcaagata tcttgaccaa actcggcatt tattagtctt actatgctcc 3601 tcatatatat gcaaaatctt attaagatcg gaccactata tcatatagct gccataggaa 3661 cgatcggtcg aaaattaagt tgtatgaaaa aacattttgt ttatcaagat atcttgacca 3721 aactcggcat ttactatttt cccggtactt cttagatagg ggcaaagcac tatgagcatt 3781 atgaaaaggt tgggtctgca agggtattag atctttggcg tgccgaagat agcccttctt 3841 tctcgttttt actggataag caaacgagtg cgaggcaaac gagggtatct cctagactac 3901 ttttgtagtg cactgatatt aatagctttt taaggctgat taaactttca gctggcttta 3961 gataagcctc actttactat gcacctcctg tcttttccct gatgggccaa ctcaacctca 4021 actaatctga tcgtaatctc tctttgcagt gcaacaaggg ctgtgcgcaa tggtatcgcg 4081 cgctgaagga gatcgagccc tgccaggagg cctgtgtgag tgatatctga cagctgccgt 4141 cttaatgata gcttcttaac tgcattattt tgataggctt ccacacagtt ttatccgtac 4201 gatatgccct gcattggagc ctgcgagacg gcgcagcgcg attattggca tatgcagcgg 4261 ctggccatgg ctcggttggt ggagacaacg cagccgcagc tgttggagat gacggatgag 4321 tcgagcactt tgaccatcaa atgggccatg cagttcccgg agaactattt ggccagtcga 4381 ccctttaata tacaatacca gcaagtggac aatcaatcgg aacctgaatg gcacaatctg 4441 gcggactacg actgcgatga atactacgta tgcgaaattc tggaggcgct tgtgccctac 4501 acccgctaca aggtatgtcc atcatatatt aagtgaaata tctctggaga tttaaatttg 4561 aactcttttt cagtttcgct tcgagctgcc atttggggag agcagcgaag acgtgctcta 4621 ttcgccagca acgcctgttt acgaaacgcc gatggagggc gcccccatct ctgcgcccat 4681 tatcgtggca ctgctggcgc tggatgagca tcatgtgttc gtgcactggc gtcccggacg 4741 ctatagcaat gcgcccattg aaggctatcg cgtgctcttg accagcgcag gcaacaccag 4801 ccgcgaacag gtacgtaacg agctccaact gccaagctgc caaatctaaa cgctcatctc 4861 taccagctgc tgcctgctca acgtactagc tgcatctttg cccagctgca gccgctcacc 4921 aactacaccg tggcgctgac catgatcaac aaacagggcg agggaccgtc tacggtggtc 4981 agcattgtga ccaagtcgcc cctggagcca cagcagctgc agagcgtgct gcttgcgagc 5041 gagcacagca tcatctggca atcactggag ccagcgggcg agacacgcct cctctacacc 5101 tctgagccgg cggccattag cgattttacc ttttcgcagc gcgagcagcg gctctggctg 5161 ctggacgaat tgggtcagtt gcacaggtga gcaattggcc atttcagtcg ctcttcaatt 5221 atcttttcga gagtgaagat tatttttttt tgtaggattt cgaggtgaaa ttacgtaaag 5281 tttctacagt aggaaaagct ttctagctag atcatcagtc agtcagtcat atatggatgg 5341 ggagctgcag ttaagctcta gccaagacca gattttgttc agcctcttta gtcacagtta 5401 aatcagtcag tcagtcaatc actcagtcag tcgaatgaat atataattta gtacgatcaa 5461 ttgtgactgt tccttgatca tcagtcagtt agttattcaa tcngtttaat caaaaatcaa 5521 tcagtaaagt aaattggcta ctacagctac aggttgtcca aaaaaatgga tagtttagga 5581 tacattcaat cagccgtcaa tcaatcagtc agtcataata tgatatactt ctattatgcc 5641 tctctatgag ttcagctaca gctaagacca actggggtgt gtgttga // LOCUS DRVSEV3 8779 bp ds-DNA INV 14-AUG-1990 DEFINITION D.virilis sevenless gene, exons 8, 9, 10, 11 and 12. ACCESSION M34545 KEYWORDS sevenless protein. SEGMENT 3 of 3 SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (sites) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. TITLE Comparison of the sevenless genes of Drosophila virilis and Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5351-5353 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 8779) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.M.Rubin, 24-MAY-1990. FEATURES from to/span description pept + 340 4973 sevenless protein (sev), exon 8 5836 6113 sevenless protein (sev), exon 9 6174 6365 sevenless protein (sev), exon 10 7286 7722 sevenless protein (sev), exon 11 7791 8175 sevenless protein (sev), exon 12 IVS < 1 339 sev intron H IVS 4974 5835 sev intron I IVS 6114 6173 sev intron J IVS 6366 7285 sev intrn K IVS 7723 7790 sev intron L BASE COUNT 1930 a 2456 c 2435 g 1958 t ORIGIN 1 tctggtagat gagcgttaga ttggcagctt ggcagttgga gctctcaagc tctatatata 61 tataccattt gtcctgactg attgactgat tgacggactg attggggatc taaatttccc 121 actcccataa ctcggccaaa actcaaccga ttttcatgag gcttggcttt ttattcatgg 181 tttggcctgt atatcaattt ggcattttca aagatccttg agagaattca ttttataggc 241 gccacgggca agggcgggtc taccggctgg tccgtcataa gtacatcgcc caatctctaa 301 gctctacttc agtttagact tgcaattttc tgtttacagc cagctgctgg atgagacgac 361 gacaagcgct gcccggcgtc tgcggctgga gctgccctca aatgggagca gccagtggac 421 gccacgcaag ctgagcctgg actggttgca gcgacgcctc tacatagcag cgcaggcgaa 481 cagcagcgac ggggcagaag gaggatttga gctgtttagc agcaatctgg agggcggcga 541 tgtgcagatg gcgggcgtgc agctgggcct ggtcgtggag cagctggagc tggacgccct 601 gaacggctgg ctcttctggt gcgatgcaga cagcctctgg cgtctcgatc tgagcagcaa 661 gcagcagctg cgccttacac agcccgcagg agcacccgga cgtttcatgt tggagccgca 721 gcgctggctg ctgcacgtgc tgctgccgca ggagaatcag ctgctggagc tgagctacga 781 tggtgggcac aagcatgcgc tggcactgtc caacgacagc tggcgcggat ttgcctggtc 841 cagcgatcag gcacagctgc tactggcgaa tgagacacag ttgcagctgc tggacggaca 901 gacgctggtg ccgcttgcca actggtcgcc ggatggcggc tgctgcgctt tgctgcccct 961 ggaacgtcgc cgccagccgc tgtcactgga gccgccggcg ccgcgcgagt tgcgcgccct 1021 gctaggagcg cagggcgcgc acataacatg gcagccgccg gcagctaatc cctaccagac 1081 tgcaacggcg gccgcgcgca atttcagcta cgagctggag gtgctcgacg tggccagcca 1141 aagcgcctac aacatacgca acatacgcgt gccccacttt ggcctggagc gcctgcaggc 1201 ggacaatctc tatcagctgc gcgtgcgcgc aaacaatgcg gccggccgtg cgggcgtgtg 1261 gacagcgccg ctggcgacgc gcacctggcc cctgggcgat catcgtctgc gctgggccac 1321 gcagcgcggc tcgctgtaca ccacgaacga gctgggcggc cagctgcagc cgctgcccgt 1381 gcagctggca tcgtcgcccg ggccgctagc cctggtcaat gccagcgtgg cgtactatgt 1441 cagcgggcgg gagcaatcgc tgcactgcgt caatttgctg cagccgcagc tgagctgcac 1501 cgacgagcgg ctggagcatg taggcgccgt tgcctatgac tggcgcggcg gtctgctcta 1561 ctggacggat ctggcgcgcg attgtgtcca gcgtctggat ccgttcagcg gtgagcgcga 1621 gctgttgccc atctttggcg cccgccattt ggccctcgac tcggcccagg gacatctcta 1681 ctactcgagc tcggcgcatt tggcgcgccg ttcgctcagc gctctctcca cccatcagcc 1741 ggagctggag tactatcatg tcaatggact ggccggccag attagcggct tctgtctgga 1801 tctgccgcag cgacacatct actggcttgt cgccggcaac tcggcgctgc atctctatcg 1861 cacagcattg tcggcgggcg gcagccaggc cgccgtccca ctgcagctgc tgaccacgct 1921 gccggcggcg gacgcactgc cgcatacgct gcagcatctg gcgccgctgg gtgcgctgct 1981 ctggctggcg gcggatggcc ggggcgcgca tctgctgcgg ctcgcggccc aactggagac 2041 ggacacggat actatgcgtc tgctgccgga gggcctggtc gagccgctca gcgccgtgca 2101 gctgctggag cgttcggctg ggccgccgcc accgccgccg gacgagggcg tgcgtccgct 2161 cgccgtgccg ccggacagtg tgcacatcga cgagggcggc cactggaatg actttcgtgt 2221 gcgctggcag cctgctgcca gtggcggcaa tcacagcgtc tgctacaagc tgctgctgga 2281 gcacggcagc gagcgtctga tcacgctgga gctcctcacg ccctttgccc gcatcacaca 2341 actggcccag gcgccgctcg gtctgcgcat cagcattacg ccgcacaccg cctggcgcgc 2401 cggatccacc acccgggtgc agctggacac gcccgttgcg gcgcccacgc agccgcggcg 2461 gctgcgcgtc tttgtcgagc gtcaggcggc gccgctgcag ttggcgccga atgtaagcgc 2521 tctgctgcgc tgggatgtgc ccgaggagca tgcgggcagt cagtcgctgc agtatcgcat 2581 cagctgctgg cgcggcagcg agttgcattc ggagctgctg ctcaatcaga gcacattgga 2641 ggcgcgcgtc gagcatctgc agccggagga gacgtatcgt tttcaggtgc aggcccatgt 2701 ggccgccacg ggcctggccg ccggcgccac cagccatgcg ctgcacgttt cgcccgaggt 2761 gcaatcggtg ccgcgtctgc tctatgccaa tgccgagcac attggcgaac tggatctgga 2821 tacgggccat cgcaagcagc tggtgcacac cgccagtcct gtcgagcatt tggttgtgct 2881 gcagggcgag cagcgtttgc tttgggtcaa cgagcacgtc gagctgttga gccatgtgcc 2941 gggtaaggcg ccggccaagc tggcacgcat gcgtgccgag gtgctggccc tgaccgtcga 3001 ctgggtgcag cgcatcgtct actgggccga gctggatgcc gcggacggcg gctgtgtcat 3061 ctatagcctg gatctctgtc gcttcgatgg acgcattctg cagggcgaac gtctctggag 3121 cacgccgcgc ggccagctgc tgcgcgatct tgtcgccctg ccgcatgccc gccagcttgt 3181 ctggctgcag cacgatctgg actcgcgcaa tgccacgctg cagggtcgca gcctggccaa 3241 tggcagcgcg ctcacctttg agggcgtcac gttaccgctg tggcgcctct tcgagggcag 3301 ccaggagccg ctggccgaga cccttaatct ggtggatcat ttgggccgtc tgtgcgttta 3361 ccatgtggcc cggcaactgt gcacgtccag cgcactgcga gcccagctca atctgctgaa 3421 cgatgatatc ggccagctgg cccaggatcc gggctatctg tatgctctgc gcaatggtag 3481 tgtgcgcgca tatggccgcc gccggcagca actggaattt ctgctggaac tgcagccgga 3541 cgaggtacgt ctgctgcgcg cctacaacta tcaggcgtat cccagtcgtc gctgcctgct 3601 gctgcccacc acggctgccg ccctcgaatc aacgccgtcc agctgcgagg agacgcagtg 3661 cagcctccag ctgccggcgc tcagcgccgc accagactgc ccgctgccgg tgcccgggct 3721 taactatcag ctgaatctta gcagcagcag cagaagcgca cagctggagc tgcgctcgct 3781 gcactccgcc gccggcctga cgctcaacat aagtcaactg cagccctatc aggcctacga 3841 gctgcgcgct caggtcggca gctactatca gcaacaactg ggccaggagc cgctgcagct 3901 gcccgtcctg acgctgcaca cggctgcggc cacgcccagt gcgccacgaa atttcagcgg 3961 acgggcactg agtcccagcg aactggagct cagctggctg gcgccgctgg agctgcgcag 4021 cgcgagtgtc tattatacgc tgcactggca gctgcagctg gaggataccg aggagcagtc 4081 acaggagcag cccgcccagg agcagcgcgt ggagacggcg ggcgtgcagc ggctgacggg 4141 tctgcagccc gcacgactct atcaggtctg gctgcaggcg catgccacgc ccagcaaata 4201 caatagcagc gggcggctgc tgatccgcag ctatgcgccg ctgccgccgc tgcagctgat 4261 cgagctgaac gcgtacggaa tgacactcgc ctggccgggc acaccggatg cgctcagttc 4321 gctcaccctg gaatgtcagt cgctgcgcga acagctccag ttcaatgtgg ccggaaatca 4381 tacacaaatg cggctggcac cgctgcagcc gaagacgcgc tactcgtgcc gcctggccct 4441 ggcgtatgcc gccacgccgg gcgcacccat ttactttggt cccagtcatg agtacgaaac 4501 gctgggtgat gcgcccagcg cgccgggtcg gccgcagctg gagcatatag ccggcgaaat 4561 ctttcgcgtc agctggacgc cagctctgga caacggttca cccattttgc tatacaatct 4621 ggaggctctg caggcccggc gcaccaatcg tcgtcggcgg cggcgccgag agactacgct 4681 ttccctattg ccctgggccg aggagccgct ggtcatcgag gatcagtggc tggacttttg 4741 taataccacc gagctgagtt gcattgtgcg cgaattgcat acccgtcgcc tgctgctgtt 4801 tcgcgtccgt gcccgcaatc ggccgcacgg ctggggaccc tacagcgagg acagtgaacg 4861 catcgctgag ccctttgtct cgcccgagaa gcgcggctcc ctcgtcttgg ccatcattgc 4921 gcccgctgcc attgtttcca gctgtgtgct cgccttggtg ctcgtgcgca aatgtaagtt 4981 caaaatgaat atttttccgc ccgacaatca gtcaatcatt caactagtcg tcggtcagtt 5041 tgtcaatctt tcagtcgaag aatcaatcga taagaaaatg agtggctaaa taagtaagtt 5101 agtcaatcaa atagtctgtc tttttgtcaa tcagtcgatc aatcaatcgc tcaatccatc 5161 tggtattgcc attcagccaa ttaaccagtc aatccagcaa tcattcaatt agtagtccgt 5221 aggttatcag tcaactatga taaccaatca atccgtcagc ttgataattt gtcgtttaac 5281 tagtcaagaa atcaatgtta atcaagtaat tggccaatta atcagtcagt tcagcagact 5341 gtctagtata actgtgaatt gaaggttcat gaatctgtca gtcaaacggc caatcaaatc 5401 aatcatttgg aaaatttttg atcacccaat ctcaaagtaa ttaagtgagg caagcagtca 5461 attaggcaga taaacagtta atcaggatgt cagtcaatca atcaatcata aatagtctgc 5521 cacctaatcc atcatgattc aatcagtcaa tcagccaata ctcacgtagt caattgatat 5581 tccggacaat cagtcagcca gtctgaaatt aatcagtaaa tcaatcgaat atgagtcagt 5641 caatcaatca attaacacgt aatcaatcaa tcaatcagtc aatggctcat taaatcaata 5701 agcagatcag aaaaatgttt aatcattcac tcaatcagtc agcctaatca atcatgaccc 5761 tttcattcaa tcaatcaatc aatcagtcag tcacatacac aagactcgct aattagtaaa 5821 tctgcgattt tgcagtgcaa aagcgtcgtc atcgcgccaa aaagctgctg cagcagagcc 5881 gcccaagcat ctggagcaat ctttcggctc tgcagacgca gcagcagctg ctggccgcac 5941 ggagtcgcac cttctcgatg tcgctgagcg atgcggacat tgcgctgctg ccgcagatca 6001 actggaaccg gctgacgctg ttgcgtttcc tgggcagcgg cgcctttggc gaggtgtacg 6061 agggtcagct gcaggccgag gacgaggcgc agccgcagcg tgttgccatc aaggtgagtg 6121 gagcagcagt tttttttttt gccacaattc aaggattctc ctcgccttta cagagtctgc 6181 gcaagggcgc cagcgagttc gcggagctgc tgcaggaagc gcagctgatg agcaatttca 6241 agcacgaaaa catcgtttgc ctgattggca tctgctgtga cacggactcc atatcgctga 6301 ttatggagca catggaggcc ggcgatttgc tcagctattt gcgtgcggca aggcccagtt 6361 cacaggtgag cgagcaagca ttaatattta ttatacgctg tagccattgc aaatgggtaa 6421 agggtaaact gtttttgtgc atatgcatgt gacgcacagt atgtccatgc ctttaatctt 6481 gatcaggacg cactgcgttc gtctaggcag gttcctcggt ctctcctttc gttttccaaa 6541 tatcgtaaaa tatcgaaaca aggttttttt tatttacttt atttaattta tttgttaatg 6601 gtcgacaaac acgagtgtct ttctaattat ttaatacatc aaatcatgaa aattaaatac 6661 taaaaaccta aaagcatcac cgactagata tataaagcgt taagtagata ggataaagag 6721 ataccatatg gatatattta gagctcatag cgggaaatag tcagtgagtt cccgaaaata 6781 tatgtacatg ttgaagatta taaatgctag atacaccaca atatggatga ggattaagcg 6841 cctttttgga tgcctccaat caaaactatt ccaaaatcaa aaggatttag aggttccgca 6901 ttcagctttt gtatgtacca acattttgat gccatattta cacgctgaaa gttttacaaa 6961 gatcggacaa taaacaccaa agttcctcga gaatgccata actgaaaatg tggaaccaaa 7021 gaaaagtaga cagaagagag acaaatgtga gtctgttgtg gcagccggat acaggatatc 7081 tctagtcgag cacttgaatt gagcactctg atgccatatt tatacgctga aagtttcatt 7141 atgatcggac cataaacacc gaagatcctc gagaattcca tatctgaaaa tgtggaatca 7201 agcaatagta gagattcagc tagaatgtgt gtgtggctac agggaatatt ctagtcgagc 7261 actcttatgt atggttcgtt tgcaggaagc gctatccaag ctgcagctgc cggagctgtt 7321 gagcatgtgc ctggacgtgg ccaatggctg cagctatatg gaggatatgc actttgtgca 7381 tcgcgatctc gcctgccgca attgtctggt gtcggatgga gccgcaattg ggggacggcg 7441 cattgttaaa attggcgact ttgggctggc acgcgacatc tacaagagcg actattaccg 7501 gaaggagggc gagggactgc tgcccgtgcg ctggatggcg ctcgagagcc tggtggatgg 7561 cctatttagc acacagtcag atgtgtgggc cttcggtgtg ctctgctggg agatattcac 7621 gctcggccag caaccgtatg cggcgcgcaa caacttcgag gtgctcgccc atgtcaagga 7681 gggcggacgg ctgcagcagc cggagcgctg tcccgagaag ctgtaagtgg gccagctctt 7741 ccagttaact tctctctact caatcactgt cccatttatg ctgtccacag ttatgcgctg 7801 ctcttgcaat gctggcgctc tgagccctgg gagagaccca gctttaagcg ctgcctcagc 7861 acgctgcagg ccctcagctc ggatttgcgg cgcactgaaa tgctggccac ggatgagacg 7921 ccgttggttt ctgcattgtg cgctttcaag ccggacgcca aggtgcgttt cgatgatgcg 7981 ccgcagcgtc ttacgctgca cttggacgcc aaagatacgg ttagcacaac cgatgccgat 8041 acgacaggaa gcccaaccac acccaccgca ccaaccacgc ccaccacgac caccagcaca 8101 atcgcagtcg taagcacagc tccatcgtcc gaaaatggcc agctctatgc aaatgagggc 8161 atttcgggcc tttgaaatcc cctttcattt ggatctgccg tcgcaagttg cggcgccaca 8221 aattgatgaa cattcgcatt cgcatgtttg ggcgcactca accatgtaac gcccacttgc 8281 cgaaacaccg cccacacgcg catacacgat tgtgggctta gaattatggt ttctatttaa 8341 tcgatttact attctgtact gaatccaaaa gctaaacaag accaaaagat tgacctcaac 8401 aagtcctaca tgtaaatcaa atggttctgt ttagtttttc ttttttgttt ttgttgtttt 8461 tttttttttt ttggtatacc caacaagacg agtatgctta ttcgaatata caagtagagc 8521 ctaaagtcga atcaataaat atcatgatat ttttgtagaa atagccttta tcattaaatt 8581 attatatttt gcctcaatca ggctgattga cggttgatga tcaatcaaat ataaggtatg 8641 cgctactcga aaatattata aataaatgaa ctcaacagta aatgaagata ttcttgtaaa 8701 accaggctgc atgttttagt ctgcctgatt gattagctaa cggactgact gatcatcagt 8761 caatcttttc gagtatgcg // LOCUS ECOLAMBAA 186 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli lambda receptor (lamB) gene, 5' end. ACCESSION M24997 KEYWORDS lamB; lambda receptor. SOURCE E.coli (K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 186) AUTHORS Clement,J.-M., Hedgpeth,J., Hofnung,M. and Jacob,F. TITLE Sequence de l'ADN correspondant a la region du peptide signal pour le recepteur de lambda chez E. coli K12 JOURNAL C. R. Acad. Sci., D, Sci. Nat. 289, 1033-1036 (1979) STANDARD simple staff_entry FEATURES from to/span description pept 82 > 186 lambda receptor precursor sigp 82 156 lambda receptor signal peptide matp 157 > 186 lambda receptor BASE COUNT 51 a 46 c 48 g 41 t ORIGIN 1 ccggcgttta agcaccccac aaaacacaca aagctgtcac aggtgatgtg aaaaaagaaa 61 agcaatgact caggagatag aatgatgatt actctgcgca aacttcctct ggcggttgcc 121 gtcgcagcgg gcgtaatgtc tgctcaggca atggctgttg atttccacgg ctatgcacgt 181 tccggt // LOCUS HUMCRPR 1797 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human cysteine-rich peptide mRNA, complete cds. ACCESSION M33146 KEYWORDS cysteine-rich protein. SOURCE Human normal term placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1797) AUTHORS Liebhaber,S.A., Emery,J.G., Urbanek,M., Wang,X. and Cooke,N.E. TITLE Characterization of a human cDNA encoding a widely expressed and highly conserved cysteine-rich protein with an unusual zinc-finger motif JOURNAL Nucleic Acids Res. 18, 3871-3879 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.A.Liebhaber, 22-MAR-1990, for release after publication. FEATURES from to/span description pept 36 617 cysteine-rich protein BASE COUNT 405 a 507 c 493 g 392 t ORIGIN 1 cctgccgccc ctgcgccgcc gagccagctg ccagaatgcc gaactgggga ggaggcaaga 61 aatgtggggt gtgtcagaag acggtttact ttgccgaaga ggttcagtgc gaaggcaaca 121 gcttccataa atcctgcttc ctgtgcatgg tctgcaagaa gaatctggac agtaccactg 181 tggccgtgca tggtgaggag atttactgca agtcctgcta cggcaagaag tatgggccca 241 aaggctatgg ctacgggcag ggcgcaggca ccctcagcac tgacaagggg gagtcgctgg 301 gtatcaagca cgaggaagcc cctggccaca ggcccaccac caaccccaat gcatccaaat 361 ttgcccagaa gattggtggc tccgagcgct gcccccgatg cagccaggca gtctatgctg 421 cggagaaggt gattggtgct gggaagtcct ggcataaggc ctgctttcga tgtgccaagt 481 gtggcaaagg ccttgagtca accaccctgg cagacaagga tggcgagatt tactgcaaag 541 gatgttatgc taaaaacttc gggcccaagg gctttggttt tgggcaagga gctggggcct 601 tggtccactc tgagtgaggc caccatcacc caccacaccc tgcccactcc tgcgcttttc 661 atcgccattc cattcccagc agctttggag acctccagga ttatttctct gtcagccctg 721 ccacatatca ctaatgactt gaacttgggc atctggctcc ctttggtttg ggggtctgcc 781 tgaggtccca ccccactaaa gggctcccca ggcctgggat ctgacaccat caccagtagg 841 agacctcagt gttttgggtc taggtgagag caggcccctc tccccacacc tcgccccaca 901 gagctctgtt cttagcctcc tgtgctgcgt gtccatcatc agctgaccaa gacacctgag 961 gacacatctt ggcacccaga ggagcagcag caacaggctg gagggagagg gaagcaagac 1021 caagatgagg aggggggaag gctgggtttt ttggatctca gagattctcc tctgtgggaa 1081 agaggttgag cttcctggtg tccctcagag taagcctgag gagtcccagc ttagggagtc 1141 actattggag gcagagaggc atgcaggcgg ggtcctagga gcccctgctt ctccaggcct 1201 cttgcctttg agtctttgtg gaatggatag cctcccacta ggactgggag gagaataacc 1261 caggtcttaa ggaccccaaa gtcaggatgt tgtttgatct tctcaaacat ctagttccct 1321 gcttgatggg aggatcctaa tgaaatacct gaaacatata ttggcattta tcaatggctc 1381 aaatcttcat ttatctctgg ccttaaccct ggctcctgag gctgcggcca gcagagccca 1441 ggccagggct ctgttcttgc cacacctgct tgatcctcag atgtggaggg aggtaggcac 1501 tgcctcagtc ttcatccaaa cacctttccc tttgccctga gacctcagaa tcttcccttt 1561 aacccaagac cctgcctctt ccactccacc cttctccagg gacccttaga tcatcactcc 1621 acccctgcca ggccccaggt taggaatagt ggtgggagga aggggaaagg gctgggcctc 1681 accgctccca gcaactgaaa ggacaacact atctggagcc acccactgaa agggctgcag 1741 gcatgggctg tacccaagct gatttctcat ctggtcaata aagctgttta gaccaga // LOCUS HUMCSPBB 910 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human cytotoxin serine protease-C mRNA, complete cds. ACCESSION M36118 KEYWORDS cytotoxin serine protease-C. SOURCE Human lymphokine activated cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 910) AUTHORS Klein,J.L., Selvakumar,A., Trapani,J.A. and Dupont,B. TITLE Characterization of a novel human cytotoxic lymphocyte specific serine protease cDNA clone (CSP-C) JOURNAL Tissue Antigens (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Klein, 06-JUL-1990. FEATURES from to/span description pept 53 793 cytotoxin serine protease-C precursor sigp 53 157 cytotoxin serine protease-C signal peptide matp 158 790 cytotoxin serine protease-C BASE COUNT 231 a 245 c 250 g 184 t ORIGIN 1 ttcgatatcg agctcctcca acagctctga cctgggcagc cttcctgaga aaatgcagcc 61 attcctcctc ctgttggcct ttcttctgac ccctggggct gggacagagg agatcatcgg 121 gggccatgag gccaagcccc actcccgccc ctacatggcc tttgttcagt ttctgcaaga 181 gaagagtcgg aagaggtgtg gcggcatcct agtgagaaag gactttgtgc tgacagctgc 241 tcactgccag ggaagctcca taaatgtcac cttgggggcc cacaatatca aggaacagga 301 gcggacccag cagtttatcc ctgtgaaaag acccatcccc catccagcct ataatcctaa 361 gaacttctcc aacgacatca tgctactgca gctggagaga aaggccaagt ggaccacagc 421 tgtgcggcct ctcaggctac ctagcagcaa ggcccaggtg aagccagggc agctgtgcag 481 tgtggctggc tggggttatg tctcaatgag cactttagca accacactgc aggaagtgtt 541 gctgacagtg cagaaggact gccagtgtga acgtctcttc catggcaatt acagcagagc 601 cactgagatt tgtgtggggg atccaaagaa gacacagacc ggtttcaagg gggactccgg 661 ggggcccctc gtgtgtaagg acgtagccca aggtattctc tcctatggaa ataaaaaagg 721 gacacctcca ggagtctaca tcaaggtctc acacttcctg ccctggataa agagaacaat 781 gaagcgcctc taacagcagg catgagacta accttcctct gggcctgacc atctctggga 841 cagaggcaag aatccccaag ggtgggagtc ggggttgcag gactgtaata aatggatcga 901 gcacgatatc // LOCUS HUMFKBP 1532 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human FK506-binding protein (FKBP) mRNA, complete cds. ACCESSION M34539 KEYWORDS FK506-binding protein; immunosuppressive agent FK506-binding protein; peptidylprolyl cis-trans isomerase. SOURCE Human peripheral blood T-cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1532) AUTHORS Maki,N., Sekiguchi,F., Nishimaki,J., Miwa,K., Hayano,T., Takahashi,N. and Suzuki,M. TITLE Complementary DNA encoding the human T-cell FK506-binding protein, a peptidylprolyl cis-trans isomerase distinct from cyclophilin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5440-5443 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by N.Takahasi, 5-16-1990, for release after publication. FEATURES from to/span description pept 79 405 FK506-binding protein (FKBP) signal 1507 1512 Poly-A signal BASE COUNT 341 a 386 c 339 g 466 t ORIGIN 1 gaattcgggc cgccgccagg tcgctgttgg tccacgccgc ccgtcgcgcc gcccgcccgc 61 tcagcgtccg ccgccgccat gggagtgcag gtggaaacca tctccccagg agacgggcgc 121 accttcccca agcgcggcca gacctgcgtg gtgcactaca ccgggatgct tgaagatgga 181 aagaaatttg attcctcccg ggacagaaac aagcccttta agtttatgct aggcaagcag 241 gaggtgatcc gaggctggga agaaggggtt gcccagatga gtgtgggtca gagagccaaa 301 ctgactatat ctccagatta tgcctatggt gccactgggc acccaggcat catcccacca 361 catgccactc tcgtcttcga tgtggagctt ctaaaactgg aatgacagga atggcctcct 421 cccttagctc cctgttcttg gatctgccat ggagggatct ggtgcctcca gacatgtgca 481 catgagtcca tatggagctt ttcctgatgt tccactccac tttgtataga catctgccct 541 gactgaatgt gttctgtcac tcagctttgc ttccgacacc tctgtttcct cttccccttt 601 ctcctcgtat gtgtgtttac ctaaactata tgccataaac ctcaagttat tcattttatt 661 ttgttttcat tttggggtga agattcagtt tcagtctttt ggatataggt ttccaattaa 721 gtacatggtc aagtattaac agcacaagtg gtaggttaac attagaatag gaattggtgt 781 tggggggggg gtttgcaaga atattttatt ttaatttttt ggatgaaatt tttatctatt 841 atatattaaa cattcttgct gctgcgctgc aaagccatag cagatttgag gcgctgttga 901 ggactgaatt actctccaag ttgagagatg tctttgggtt aaattaaaag ccctacctaa 961 aactgaggtg gggatgggga gagcctttgc ctccaccatt cccacccacc ctccccttaa 1021 accctctgcc tttgaaagta gatcatgttc actgcaatgc tggacactac aggtatctgt 1081 ccctgggcca gcagggacct ctgaagcctt ctttgtggcc tttttttttt ttcatcctgt 1141 ggtttttcta atggactttc aggaattttg taatctcata actttccaag ctccaccact 1201 tcctaaatct taagaacttt aattgacagt ttcaattgaa ggtgctgttt gtagacttaa 1261 cacccagtga aagcccagcc atcatgacaa atccttgaat gttctcttaa gaaaatgatg 1321 ctggtcatcg cagcttcagc atctcctgtt ttttgatgct tggctccctc tgctgatctc 1381 agtttcctgg cttttcctcc ctcagcccct tctcacccct ttgctgtcct gtgtagtgat 1441 ttggtgagaa atcgttgctg cacccttccc ccagcaccat ttatgagtct caagttttat 1501 tattgcaata aaagtgcttt atgcccgaat tc // LOCUS HUMINTA2 272 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human interferon alpha-2' gene, partial cds. ACCESSION M28028 KEYWORDS interferon alpha-2'. SOURCE Human leukocytes DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 272) AUTHORS Ovchinnikov,Y.A., Sverdlov,E.D., Monastyrskaya,G.S., Tsarev,S.A., Zaitseva,E.M., Arsenyan,S.G., Chakhmakhcheva,O.G., Novokhatskii,A.S., Aspetov,R.D. and Kuznetsov,V.P. TITLE Expression of mutant human interferon alpha-2 in Escherichia coli cells JOURNAL Mol. Biol. 18, 36-45 (1984) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 272 interferon alpha-2' (AA at 2) BASE COUNT 63 a 76 c 66 g 67 t ORIGIN 1 actggtggcc ctcctggtgc tcagctgcaa gtcaagctgc tctgtgggct gtgatctgcc 61 tcaaacccac aacctgggta gcaggaggac cttgatgctc ctggcacaga tgaggagaat 121 ctctcttttc tcctgcttga aggacagaca tgactttgga tttccccagg aggagtttgg 181 caaccagttc caaaaggctg aaaccatccc tgtcctccat gagatgatcc agcagatctt 241 caatctcttc agcacaaagg actcatctgc tg // LOCUS HUMMLC1AA 836 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human embryonic myosin alkali light chain (MLC1) mRNA, complete cds. ACCESSION M36172 KEYWORDS embryonic myosin alkali light chain. SOURCE Human embryo, cDNA to mRNA, clone GT14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 836) AUTHORS Arnold,H.-H., Lohse,P., Seidel,U. and Bober,E. TITLE A novel human myosin alkali light chain is developmentally regulated: Expression in fetal cardiac and skeletal muscle and in adult atria JOURNAL Eur. J. Biochem. 178, 53-60 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 57 650 embryonic myosin alkali light chain (MLC1) BASE COUNT 199 a 238 c 232 g 167 t ORIGIN 1 cagtctctcg gtttcttctc agatcactcc tctgccaaag atcccaacaa gacaacatgg 61 ctcccaagaa gcctgagcct aagaaggagg cagccaagcc agctccagct ccagctccag 121 cccctgcacc agcccctgcc ccagctcctg aggctcccaa ggaacctgcc tttgacccca 181 agagtgtaaa gatagacttc actgccgacc agattgaaga gttcaaagag gccttttcat 241 tgtttgaccg gaccccgact ggagagatga agatcaccta cggccagtgc ggggatgtac 301 tgcgggccct gggccagaac cctaccaatg ccgaggtgct gcgtgtgctg ggcaagccca 361 agcctgaaga gatgaatgtc aagatgctgg actttgagac gttcttgccc atcctgcagc 421 acatttcccg caacaaggag cagggcacct atgaggactt cgtggagggc ctgcgtgtct 481 ttgacaagga gagcaatggc acggtcatgg gtgctgagct tcggcacgtc cttgccaccc 541 tgggagagaa gatgactgag gctgaagtgg agcagctgtt agctgggcaa gaggatgcca 601 atggctgcat caattatgaa gcctttgtca agcacatcat gtcagggtga agcagagtct 661 tccaggtgcc tggcccttgg ctttagccat accagggtga gttaaagaga ggccccggct 721 gggtgagctg agatggagtc ctcgacttat caccacacca ctgccccaag gaccttacag 781 gccctccctg ttaataaaca gctctaacac ggccaggctg ggctctggga ttctga // LOCUS MUSAIC2B 3113 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse interleukin 3 receptor-like protein (AIC2B) mRNA, complete cds. ACCESSION M34397 KEYWORDS IL-3 receptor; interleukin 3 receptor. SOURCE Mouse, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3113) AUTHORS Gorman,D.M., Itoh,N., Kitamura,T., Schreurs,J., Yonehara,S., Yahara,I., Arai K,-I. and Miyajima,A. TITLE Cloning and expression of a novel interleukin 3 receptor-like gene: Identification of another member of the cytokine receptor gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5459-5463 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.M.Gorman, 22-MAY-1990, for release after publication. FEATURES from to/span description pept 288 2978 interleukin 3 receptor-like protein (AIC2B) precursor sigp 288 353 interleukin 3 receptor-like protein signal peptide matp 354 2975 interleukin 3 receptor-like protein signal peptide BASE COUNT 727 a 955 c 806 g 625 t ORIGIN 1 gagctgactg ggagatggtg gggagcgagc tactctggca gaactaaatg tcatggggca 61 cagataaata ggaagagcct gcaactcact ggcacttgga ggctcccgaa ggaggctgcc 121 tgtcgcccaa gcacagagcc acaaaggatg cagtctagga gggaagaatc acaagccctg 181 taagatgagt ggagccaaac ccccagccaa gcaccaatac agaaccccgg gacaatgagg 241 acacccccct gcccatagct tccagtgcag ccaccaaaag tgccaaaatg gaccagcaaa 301 tggcactcac atgggggctg tgctacatgg cactggtggc tctctgttgg ggacacgggg 361 tgacagaggc agaagaaacg gtccctctga agactctgca gtgctacaat gactacacca 421 accacatcat ctgcagctgg gcggacacag aggatgccca ggggctaatc aacatgaccc 481 tctatcacca gctagagaaa aaacagccag tgtcctgtga gctcagtgag aaactcatgt 541 ggtcagagtg cccgtcatcc caccgctgtg tgcccagaag atgtgtcatc ccctatacac 601 gattttccat cacaaacgaa gactactact ccttccggcc agatagtgat ctgggcatcc 661 agctcatggt gccacttgcc cagaatgtgc agccaccact tcccaagaac gtcagcatca 721 gctcctctga ggatcgtttc ctgctggagt ggagtgtgtc ccttggggat gcccaggtct 781 cctggctttc atcaaaggac atagagtttg aggtggctta taagcggctt caggactcct 841 gggaggatgc ctacagtctc cacactagca aatttcaggt gaatttcgag ccaaagctat 901 tcctacccaa cagcatctat gcgccccgtg tgcgcactcg gctgtacccg ggttcaagct 961 tgtctgggag acccagcaga tggagcccag aggctcactg ggactcccag ccaggggaca 1021 aggcccagcc acagaacctt caatgcttct ttgatgggat ccagtccctc cactgctcct 1081 gggaggtgtg gacccagacg actggctctg tttcctttgg gctcttctat cgccccagcc 1141 ctgtagctcc ggaggagaaa tgctctccgg tggtgaagga gccgccgggg gccagtgtct 1201 acacccgcta ccattgcagt ctacctgtgc ctgagcccag tgcacacagc cagtacacag 1261 tctctgttaa gcacctggaa caagggaagt tcatcatgag ctataaccac atccagatgg 1321 agcctccaac cctcaacctg accaagaaca gagacagcta cagcctgcat tgggaaactc 1381 agaagatggc ttactcattc attgagcaca cattccaggt ccagtacaag aagaaatcgg 1441 acagctggga ggacagcaag acagagaacc tagatcgagc ccatagcatg gacctctccc 1501 agctggagcc agacacctca tactgcgcca gggtgagggt caagcccatc tctaactacg 1561 atgggatctg gagcaagtgg agcgaagagt acacttggaa gactgactgg gtgatgccca 1621 cgctgtggat agtcctcatc ctggtctttc tcatcctcac cttgctcctg atccttcgct 1681 ttggctgtgt ctctgtatac aggacgtaca ggaagtggaa ggaaaagatc cccaacccca 1741 gcaagagcct cctgttccag gatggaggta aaggtctctg gcctcctggc agcatggcag 1801 ccttcgccac taagaacccc gctctccagg ggccacagag caggcttctt gctgagcaac 1861 agggggagtc atatgcacat ttggaagaca acaacgtgtc acctctcact atagaggacc 1921 ctaatataat tcgagttcca ccatccgggc ctgatacaac cccagctgcc tcatccgaat 1981 ccacagagca acttcccaat gttcaagtag agggaccaac tcctaacaga cctaggaagc 2041 aattacccag ctttgacttc aatgggccct acctggggcc tccccaatcc cactctctgc 2101 ctgatctccc agaccagctg ggttcccccc aggtgggtgg gagcctgaag ccagcactgc 2161 caggctcctt ggagtacatg tgtctggccc ctggaggtca agtgcaactg gttccattgt 2221 cccaggtgat ggggcagggc caggctatgg atgtgcagtg tgggtccagc ctggagacct 2281 cagggagccc ttctgtggag ccaaaggaga accctccagt tgagctgagc atggaggaac 2341 aggaggcacg ggacaaccca gtgactctgc ccataagctc tgggggccct gagggcagta 2401 tgatggcctc tgattatgtc actcctggag atccggtgct cactctgccc acagggcccc 2461 tgtctacctc tctgggcccc tctctagggt tgccctcagc ccaaagcccc agtctctgtc 2521 ttaagctgcc cagggtcccc tctggaagcc cagctctagg gccaccaggg tttgaggact 2581 atgtggagct gcctccaagt gtgagccagg ctgccaagtc ccctccaggc catcctgctc 2641 ctcctgtggc aagcagcccc acagtgatcc caggagagcc cagggaggaa gtgggcccag 2701 catccccaca tcccgaaggc ctccttgttc ttcagcaggt tggggactac tgcttcctcc 2761 ctggcctggg acctggctcc ctctcaccac acagtaagcc accctctcca agtctgtgtt 2821 ctgagactga ggacctagtc caggacttgt ctgtcaaaaa gtttccctat cagcccatgc 2881 cccaggcgcc agccattcag tttttcaagt ccctaaagca tcaggactac ctgtccctgc 2941 ccccttggga caatagccag tctgggaagg tgtgctgagt ctgtctcctc ccaatctcac 3001 cagcagcctg gcaccgcagc ctgtggtcct cagcctgagc atcaccacag aagcctctct 3061 gagttcacac tcctccttgc tcccagccct gacatggcaa tacccccacc tgt // LOCUS MUSIGHADP 363 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36209 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 363 Ig heavy-chain V region (AA at 1) BASE COUNT 88 a 89 c 98 g 88 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagaaatcgc 301 tactatggta actactggta cttcgatgtc tggggcgcag ggaccacggt caccgtctcc 361 tca // LOCUS MUSIGHADQ 357 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36210 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H146-24B1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy-chain V region (AA at 1) BASE COUNT 88 a 83 c 99 g 87 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagaggaaac 301 tataggtacg acgagtttgc ttactggggc caagggactc tggtcactgt ctctgca // LOCUS MUSIGHADR 336 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36211 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H147-25H1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 336) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 336 Ig heavy-chain V region (AA at 1) BASE COUNT 81 a 77 c 94 g 84 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatggggc 301 tactatggta acctactggt acttcgatgt ctgggg // LOCUS MUSIGHADS 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H163-130F2. ACCESSION M36212 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130F2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 86 a 83 c 97 g 88 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatactat 301 aggtacgaca ggtttgctta ctggggccaa gggactctgg tcactgtctc tgca // LOCUS MUSIGHADT 357 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H163-130H9. ACCESSION M36213 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130H9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy-chain V region (AA at 1) BASE COUNT 86 a 91 c 102 g 78 t ORIGIN 1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg 61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aaggggctac 301 ggtagccact ggtacttcga tgtctggggc gcagggacca cggtcaccgt ctcctca // LOCUS MUSIGHADU 331 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H2-4C2. ACCESSION M36214 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H2-4C2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 331 Ig heavy-chain V region (AA at 2) BASE COUNT 94 a 79 c 83 g 75 t ORIGIN 1 acctgagctg aagaagcctg gagagacagt caagatctcc tgcaaggctt ctggatatac 61 cttcacaaac tatggaatga attgggtgaa gcaggctcca ggaaagggtt taaagtggat 121 gggctggata aacacctaca ctggagagcc aacatatgct gatgacttca agggacggtt 181 tgccttccct ttggaaacct ctgccagcac tgcctatttg cagatcaaca acctcaaaaa 241 tgaggacacg gctacatatt tctgtgcaag atatgactac aactactggt acttcgatgt 301 ctggggcgca gggaccacgg tcaccgtctc c // LOCUS MUSIGHADV 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-1VH. ACCESSION M36215 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 88 a 86 c 99 g 81 t ORIGIN 1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg 61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aagtcagggg 301 tatggttacg ttgctatgga ctactggggt caaggaacct cagtcaccgt ctcc // LOCUS MUSIGHADW 360 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-10VH. ACCESSION M36216 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy-chain V region (AA at 1) BASE COUNT 89 a 90 c 92 g 89 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatcttac 301 tacggtagta gctactactt tgactactgg ggccaaggca ccactctcac agtctcctca // LOCUS MUSIGHADX 360 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-22VH. ACCESSION M36217 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-22. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy-chain V region (AA at 1) BASE COUNT 91 a 89 c 96 g 84 t ORIGIN 1 caggtgcagc tgaaggagtc aggacctggc ctggtggcgc cctcacagag cctgtccatc 61 acttgcactg tctctgggtt ttcattaacc agctatggtg tacactgggt tcgccagcct 121 ccaggaaagg gtctggagtg gctgggagta atatgggctg gtggaagcac aaattataat 181 tcggctctca tgtccagact gagcatcagc aaagacaact ccaagagcca agttttctta 241 aaaatgaaca gtctgcaaac tgatgacaca gccatgtact actgtgccag agggcactat 301 ggttacgacg gttactatgc tatggactac tggggtcaag gaacctcagt caccgtctcc // LOCUS MUSIGHADY 342 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-23VH. ACCESSION M36218 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-23. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 342) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 342 Ig heavy-chain V region (AA at 1) BASE COUNT 89 a 83 c 92 g 78 t ORIGIN 1 gaggtccagc ttcagcagtc aggacctgag ctggtgaaac ctggggcctc agtgaagata 61 tcctgcaagg cttctggata cacattcact gactacaaca tgcactgggt gaagcagagc 121 catggaaaga gccttgagtg gattggatat atttatcctt acaatggtgg tactggctac 181 aaccagaagt tcaagagcaa ggcaacattg actgtagaca attcctccag cacagcctac 241 atggagctcc gcagcctgac atctgaggac tctgcagtct attactgtgc aagagggggg 301 ctacgccctt ttgcttactg gggccaaggg actctggtca ct // LOCUS MUSIGHADZ 357 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-5VH. ACCESSION M36219 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy-chain V region (AA at 1) BASE COUNT 87 a 88 c 100 g 82 t ORIGIN 1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg 61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aagatgccct 301 tatggtaact acgtttggta cttcgatgtc tggggcgcag ggaccacggt caccgtc // LOCUS MUSIGHAEA 366 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-7VH. ACCESSION M36220 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 366) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 366 Ig heavy-chain V region (AA at 1) BASE COUNT 90 a 92 c 94 g 90 t ORIGIN 1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagactc 61 tcctgtgcaa cttctgggtt caccttcact gattactaca tgagctgggt ccgccagcct 121 ccaggaaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca 181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc 241 ctctatcttc aaatgaacac cctgagagct gaggacagtg ccacttatta ctgtgcaaga 301 gacccccctt actacggcta cgctatggac tactggggtc aaggaacctc agtcaccgtc 361 tcctca // LOCUS MUSIGHAEB 328 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H230-2VH. ACCESSION M36221 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 328) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 328 Ig heavy-chain V region (AA at 2) BASE COUNT 87 a 86 c 80 g 75 t ORIGIN 1 acctggcctg gtggcgccct cacagagcct gtccatcaca tgcactgtct cagggttctc 61 attaaccagc tatggtgtaa gctgggttcg ccagcctcca ggaaagggtc tggagtggct 121 gggagtaata tggggtgacg ggagcacaaa ttatcattca gctctcatat ccagactgag 181 catcagcaag gataactcca agagccaagt tttcttaaaa ctgaacagtc tgcaaactga 241 tgacacagcc acgtactact gtgccaaacc ttatgattac gaagggtact actttgacta 301 ctggggccaa ggcaccactc tcacagtc // LOCUS MUSIGHAEC 331 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H230-5VH. ACCESSION M36222 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 331 Ig heavy-chain V region (AA at 2) BASE COUNT 74 a 88 c 86 g 83 t ORIGIN 1 ccctgggata ttgcagccct cccagaccct cagtctgact tgttctttct ctgggttttc 61 actgagcact tctggtatgg gtgtgagctg gattcgtcag ccttcaggaa agggtctgga 121 gtggctggca cacatttact gggatgatga caagcactat aacccatccc tgaagagccg 181 gctcacaatc tccaaggata cctccagcaa ccaggtattc ctcaagatca ccagtgtgga 241 cactgcagat actgccacat actactgtgc tcgaagagat ggttacgacg aggggtttgc 301 ttactggggc caagggactc tggtcactgt c // LOCUS MUSIGHAED 334 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H250-6VH. ACCESSION M36223 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H250-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 334 Ig heavy-chain V region (AA at 2) BASE COUNT 75 a 81 c 95 g 83 t ORIGIN 1 gggaggctta gtgaagcctg gagggtccct gaaactctcc tgtgcagcct ctggattcac 61 tttcagtagc tatgccatgt cttgggttcg ccagtctcca gagaagaggc tggagtgggt 121 cgcagaaatt agtagtggtg gtagttacac ctactatcca gacactgtga cgggccgatt 181 caccatctcc agagacaatg ccaagaacac cctgtacctg gaaatgagca gtctgaggtc 241 tgaggacacg gccatgtatt actgtgcaag ggatctcgac tatggtaact cctggtttgc 301 ttactggggc caagggactc tggtcactgt ctct // LOCUS MUSIGHAEE 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H280-10VH. ACCESSION M36224 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 88 a 90 c 90 g 86 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatctaag 301 atcccgtact actttgacta ctggggccaa ggcaccactc tcacagtctc ctca // LOCUS MUSIGHAEF 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H280-15VH. ACCESSION M36225 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 95 a 88 c 89 g 82 t ORIGIN 1 caggtccagc ttcagcagtc tggggctgaa ctggcaaaac ctggggcctc agtgaagatg 61 tcctgcaagg cttctggcta cacctttact agctactgga tgcactgggt aaaacagagg 121 cctggacagg gtctggaatg gattggatac attaatccta gcactggtta tactgagtac 181 aatcagaagt tcaaggacaa ggccacattg actgcagaca aatcctccag cacagcctac 241 atgcaactga gcagcctgac atctgaggac tctgcactgt attactgtgc aagatgggtc 301 tattactatg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctca // LOCUS MUSIGHAEG 360 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H35-C10VH. ACCESSION M36226 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy-chain V region (AA at 1) BASE COUNT 84 a 87 c 100 g 89 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagagggcga 301 actatgattc ccgcctggtt tgcttactgg ggccaaggga ctctggtcac tgtctctgca // LOCUS MUSIGHAEH 345 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H35-C7VH. ACCESSION M36227 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 345) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 345 Ig heavy-chain V region (AA at 1) BASE COUNT 85 a 78 c 96 g 83 t 3 others ORIGIN 1 caggttcagc tgcagcagtc tggagctgaa ctggtaaagc ctggggcttc agtgaagttg 61 tcctgcaagg cttctggcta caccttcaca agctatgata taaactgggt gaggcagagg 121 cctgaacagg gacttgagtg gattggatgg atttttcctg gagatggtag tactaagtac 181 aatgagaagt tcaagggcaa ggccacactg actacagaca aatcctccag cacagcctac 241 atgcagctca gcaggctgac atctgaggac tctgctgtct atttctgtgc aaactggnac 301 gggtttgctt actggnncca agggactctg gtcactgtct ctgca // LOCUS MUSIGHAEI 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H61-15VH. ACCESSION M36228 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H61-15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 90 a 89 c 93 g 82 t ORIGIN 1 caggtgcagc tgaaggagtc aggacctggc ctggtggcgc cctcacagag cctgtccatc 61 acttgcactg tctctgggtt ttcattaacc agctatggtg tacactgggt tcgccagcct 121 ccaggaaagg gtctggagtg gctgggagta atatgggctg gtggaagcac aaattataat 181 tcggctctca tgtccagact gagcatcagc aaagacaact ccaagagcca agttttctta 241 aaaatgaaca gtctgcaaac tgatgacaca gccatgtact actgtgccag agggcattac 301 tacggctact atgctatgga ctactggggt caaggaacct cagtcaccgt ctcc // LOCUS MUSIGHAEJ 351 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H9-A15VH. ACCESSION M36229 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-A15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 351) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 351 Ig heavy-chain V region (AA at 1) BASE COUNT 96 a 82 c 91 g 81 t 1 others ORIGIN 1 gttcagctgc agcagtctgg agctgagctg atgaagcctg ggncctcagt gaagatatcc 61 tgcaaggcta ctggctacag attcagtagc tactggatgg agtgggtaaa gcagaggcct 121 ggacatggcc ttgagtggat tggagagctt ttacctggaa atagtaatac taattacaat 181 gagaagttca agggcaaggc cacattcgct gcagatacat cctccaacac agcctacatg 241 caactcagca gcctgacatc tgaggactct gccgtctatt actgtgcaag agatctgaaa 301 aatggtgact cgttctactt tgactactgg ggccaaggca ccactctcac a // LOCUS MUSIGHAEK 333 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H9-D3VH. ACCESSION M36230 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-D3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 333) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 333 Ig heavy-chain V region (AA at 1) BASE COUNT 95 a 73 c 84 g 80 t 1 others ORIGIN 1 cagatccagt tggtccagtc tgnacctgag ctgaagaagc ctggagagac agtcaagatc 61 tcctgcaagg cttctgggta taccttcaca aactatggaa tgaactgggt gaagcaggct 121 ccaggaaagg atttaaagtg gatgggctgg ataaacacct atactggaga gccaacattt 181 gctgatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat 241 ttgcagatca acaacctcaa aaatgaggac acggctacat atttctgtgc aagaagagaa 301 aattcctact ggtatttcga tgtctggggc gca // LOCUS MUSIGHAEL 334 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. IC5-3A4HV. ACCESSION M36231 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma IC5-3A4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 334 Ig heavy-chain V region (AA at 2) BASE COUNT 86 a 69 c 95 g 84 t ORIGIN 1 gggagactta gtgaagcctg gagggtccct gaaactctcc tgtgcagcct ctggattcac 61 tttcagtagt tatggcatgt cttggtttcg ccagattccg gagaagaggc tggagtgggt 121 cgcaaccatt agtggtggtg gtagttatat ttactatgta gacagtgtga aggggcgatt 181 caccatctcc agagacaatg ccaagaacaa cctgtacctg caaatgagca gtctgaggtc 241 tgaggacacg gccttgtatt attgtgcaag agggaatata atataccacg aggactatgc 301 tatggactac tggggtcaag gaacctcagt cacc // LOCUS MUSIGHAEM 328 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L11-2E2HV. ACCESSION M36232 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L11-2E2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 328) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 328 Ig heavy-chain V region (AA at 2) BASE COUNT 85 a 78 c 96 g 69 t ORIGIN 1 ggcagagctt gtgaagccag gggcctcagt caagttgtcc tgcacagctt ctggcttcaa 61 cattaatgac acctatatgc actgggtgaa gcagaggcct gaacagggcc tggagtggat 121 tggaaggatt gatcctgcga atggtaatac taaatatgac ccgaagttcc agggcaaggc 181 cactataaca gcagacacat cctccaacac agcctacctg cagttcagca gcctgacatc 241 tgaggacact gccgtctatt actgtgatag agaggactac ggtagtaggt acggggggtg 301 gtacttcgat gtctggggcg cagggacc // LOCUS MUSIGHAEN 331 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L2-10C1HV. ACCESSION M36233 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-10C1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 331 Ig heavy-chain V region (AA at 2) BASE COUNT 90 a 78 c 89 g 74 t ORIGIN 1 acctgagctg aagaagcctg gagagacagt caagatctcc tgcaaggctt ctgggtatac 61 cttcacaaac tatggaatga actgggtgaa gcaggctcca ggaaagggtt taaagtggat 121 gggctggata aacacctaca ctggagagcc aacatatgct gatgacttca agggacggtt 181 tgccttctct ttggaaacct ctgccagcac tgcctatttg cagatcaaca acctcaaaaa 241 tgaggacacg gctacatatt tctgtgcaag atgggactac gggtactggt acttcgatgt 301 ctggggcgca gggaccacgg tcaccgtctc c // LOCUS MUSIGHAEO 328 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L2-5D2HV. ACCESSION M36234 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-5D2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 328) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 328 Ig heavy-chain V region (AA at 2) BASE COUNT 76 a 85 c 82 g 83 t 2 others ORIGIN 1 ccctgggata ttgcagccct cccagaccct cagtctgact tgttctttct ctgggttttc 61 actgagcact tctggtatgg gtgnnagctg gattcgtcag ccttcaggaa agggtctgga 121 gtggctggca cacatttact gggatgatga caagcgctat aacccatccc tgaagagccg 181 gctcacaatc tccaaggata cctccagaaa ccaggtattc ctcaagatca ccagtgtgga 241 cactgcagat actgccacat actactgtgc tcggtcgtat ggtaatgggg attactatgc 301 tatggactac tggggtcaag gaacctca // LOCUS MUSIGHAEP 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H120-2HV. ACCESSION M36235 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H120-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 89 c 78 g 74 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgta cacgttcgga 301 ggggggacca agctggaaat aaaa // LOCUS MUSIGHAEQ 317 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H130-5VK. ACCESSION M36236 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 317) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 317 Ig heavy-chain V region (AA at 1) BASE COUNT 91 a 76 c 71 g 79 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttccgtacac gttcggaggg 301 gggaccaagc tggaaat // LOCUS MUSIGHAER 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H130-6VK. ACCESSION M36237 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 92 c 78 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAES 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H146-24B1VK. ACCESSION M36238 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H146-24B1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 90 c 75 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc 301 tcggggacaa agttggaaat aaaa // LOCUS MUSIGHAET 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H147-25H1VK. ACCESSION M36239 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H147-25H1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 84 a 89 c 77 g 74 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact aacgttcgga 301 ggggggacca agctggaaat aaaa // LOCUS MUSIGHAEU 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H163-130H9. ACCESSION M36240 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130-F2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 92 c 78 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAEV 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H163-130H9. ACCESSION M36241 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130H9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 92 c 78 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAEW 321 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H2-4C2. ACCESSION M36242 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H2-4C2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 321 Ig heavy-chain V region (AA at 1) BASE COUNT 93 a 79 c 69 g 80 t ORIGIN 1 aatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca 121 gatggaactc ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccacttacta ttgtcagcac tatagtgagc ttccgtggac gttcggtgga 301 ggcaccaagc tggaaatcaa a // LOCUS MUSIGHAEX 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-1. ACCESSION M36243 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 90 c 75 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc 301 tcggggacaa agttggaaat aaaa // LOCUS MUSIGHAEY 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-10. ACCESSION M36244 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 90 c 75 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc 301 tcggggacaa agttggaaat aaaa // LOCUS MUSIGHAEZ 303 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-22. ACCESSION M36245 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-22. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 303 Ig heavy-chain V region (AA at 1) BASE COUNT 82 a 78 c 73 g 70 t ORIGIN 1 tctcacaaat tcatgtccac atcagtagga gacagggtca gcatcacctg caaggccagt 61 caggatgtgg gtactgctgt agcctggtat caacagaaac cagggcaatc tcctaaacta 121 ctgatttact gggcatccac ccggcacact ggagtccctg atcgcttcac aggcagtgga 181 tctgggacag atttcactct caccattagc aatgtgcagt ctgaagactt ggcagattat 241 ttctgtcagc aatatagcag ctatcctcgg acgttcggtg gaggcaccaa gctggaaatc 301 aaa // LOCUS MUSIGHAFA 303 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-23. ACCESSION M36246 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-23. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 303 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 71 c 70 g 83 t 1 others ORIGIN 1 tctccatcct ccttatctgc ctctctggga gaaagagtca gtctcacttg tcgggcaagt 61 caggacattg gtagtagctt aaactggctt cagcaggaac cagatggaac tattaaacgc 121 ctgatctacg ccacatccag tttagattct ggtgtcccca aaaggttcag tggcagtagg 181 tctgggtcag attattctct caccatcagc agccttgagt ctgaagattt tgtagactat 241 tactgtctac aatatgctag ttctccgtac acgttcggag gggggaccaa gctgnaaata 301 aaa // LOCUS MUSIGHAFB 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-5. ACCESSION M36247 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 79 a 92 c 77 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAFC 297 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-7. ACCESSION M36248 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 297) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 297 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 72 c 73 g 69 t 5 others ORIGIN 1 aaattcatgt ccacatcagt aggaganagg gtcagcatca cctgcaaggc cagtcaggat 61 gtgagtactg ctgtagcctg gtatcaacag aaaccaggac aatctcctaa actactgatt 121 tactcggcat cctaccggta cactggagtc cctgatcgct tcactggcag tggatctggg 181 nnggatttca ctttcaccat cagcagtgtg caggctgaag acctggcagt ttattactgt 241 cagcnacatn atagtactcc gtggacgttc ggtggaggca ccaagctgga aatcaaa // LOCUS MUSIGHAFD 315 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H230-2. ACCESSION M36249 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 315) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 315 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 80 c 75 g 77 t ORIGIN 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccattcac gttcggctcg 301 gggacaaagt tggaa // LOCUS MUSIGHAFE 303 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H230-5. ACCESSION M36250 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 303 Ig heavy-chain V region (AA at 1) BASE COUNT 77 a 77 c 75 g 74 t ORIGIN 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccgtggac gttcggtgga 301 ggc // LOCUS MUSIGHAFF 318 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H250-6. ACCESSION M36251 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H250-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 318 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 81 c 77 g 76 t 1 others ORIGIN 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccgtgnac gttcggtgga 301 ggcaccaagc tggaaatc // LOCUS MUSIGHAFG 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H280-10. ACCESSION M36252 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 79 a 92 c 77 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAFH 312 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H280-15. ACCESSION M36253 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 312 Ig heavy-chain V region (AA at 1) BASE COUNT 72 a 88 c 79 g 73 t ORIGIN 1 gaaaatgtgc tcacccagtc tccagcaatc atgtctgcat ctccagggga aaaggtcacc 61 atgacctgca gggccagctc aagtgtaagt tccagttact tgcactggta ccagcagaag 121 tcaggtgcct cccccaaact ctggatttat agcacatcca acttggcttc tggagtccct 181 gctcgcttca gtggcagtgg gtctggcacc tcttactctc tcacaatcag cagtgtggag 241 gctgaagatg ctgccactta ttactgccag cagtacagtg gttacccgta cacgttcgga 301 ggggggacca ag // LOCUS MUSIGHAFI 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H35-C10. ACCESSION M36254 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 79 a 92 c 77 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAFJ 302 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H35-C7. ACCESSION M36255 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 302) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 302 Ig heavy-chain V region (AA at 1) BASE COUNT 75 a 76 c 77 g 74 t ORIGIN 1 gcttctttgg ctgtgtctct agggcagagg gccaccatat cctgcagagc cagtgaaagt 61 gttgatagtt atggcaatag ttttatgcac tggtaccagc agaaaccagg acagccaccc 121 aaactcctca tctatcgtgc atccaaccta gaatctggga tccctgccag gttcagtggc 181 agtgggtcta ggacagactt caccctcaca attaatcctg tggaggctga tgatgttgca 241 acctattact gtcagcaaag taatgaggat ccgctcacgt tcggtgctgg gaccaagctg 301 ga // LOCUS MUSIGHAFK 321 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H61-15. ACCESSION M36256 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H61-15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. BASE COUNT 95 a 75 c 78 g 71 t 2 others ORIGIN 1 gacatcaaga tgacccagtc tccatcctcc atgtatgcnt cgctgggaga gagagtcact 61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca 121 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca 181 agattcagtg gcagtggatc tgggcaanat tattctctaa ccatcagcag cctggagtct 241 gacgatacag caacttatta ctgtctacag catggtgaga gcccgtacac gttcggaggg 301 gggaccaagc tggaaataaa a // LOCUS MUSIGHAFL 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H9-A15. ACCESSION M36257 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-A15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 72 a 94 c 76 g 82 t ORIGIN 1 caaattgttc tttcccagtc tccagcagtc atgtctgcat ctcctgggga gaaggtcacc 61 ttgacctgca gtgccagctc aagtgtaagt tccacctact tgtactggta ccagcagaag 121 ccaggatcct cccccaaact ctggatttat agcacatcca acctggcttc tggagtccct 181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaatcag cagcatggag 241 gctgaagatg ctgcctctta tttctgccat cagtggagta gttacccacc gacgttcggt 301 ggaggcacca agctggaaat caaa // LOCUS MUSIGHAFM 300 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H9-D3. ACCESSION M36258 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-D3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 300 Ig heavy-chain V region (AA at 1) BASE COUNT 81 a 69 c 63 g 87 t ORIGIN 1 cagattacat cctccctgtc tgtctctctg ggagacagag tcaccatcag ttgcagtgca 61 agtcagggca ttaacagtta tttaaactgg tatcagcaga aaccggatgg aactgttaaa 121 ctcctgatct atttcacatc aattttatat tcaggagtcc catcaaggtt cagtggcagt 181 gggtctggga cagattattc tctcaccatc agcaacttgg aacctgaaga tattgccact 241 tactattgtc agcagtatac tattcttcca ttcacgttcg gctcggggac aaagttggaa // LOCUS MUSIGHAFN 318 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. IC5-3A4. ACCESSION M36259 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma IC5-3A4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 318 Ig heavy-chain V region (AA at 1) BASE COUNT 87 a 80 c 75 g 76 t ORIGIN 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc 61 atcccctgca aggccagtca ggatgtaggt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaaatact gatttactgg gcatccaacc ggcaaattgg agtccctgat 181 cgcttcacag gcagtggatc tgggacagat ttcactctca ccattagcta tgtgcagtct 241 gaagacttgg cagattattt ctgtcagcaa tatagcacct atcccacgtt cggtgctggg 301 accaagctgg agctgaaa // LOCUS MUSIGHAFO 312 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L11-2E2. ACCESSION M36260 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L11-2E2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 312 Ig heavy-chain V region (AA at 1) BASE COUNT 92 a 76 c 67 g 77 t ORIGIN 1 gacatccaga tgacacagtc tccatcctca ctgtctgcat ctctgggagg caaagtcacc 61 atcacttgca aggcaagcca agacattaac aagtatatag cttggtacca acacaagcct 121 ggaaaaggtc ctaggctgtt catacattat acatctacat tacagccagg catcccatca 181 aggttcagtg gaagtgggtc tgggagacat tattccttca gcatcagcaa cctggagcct 241 gaagatattg caacttatta ttgtctacag tatgataatc ttctgacgtt cggtggaggc 301 accaagctgg aa // LOCUS MUSIGHAFP 321 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L2-10C1. ACCESSION M36261 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-10C1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 321 Ig heavy-chain V region (AA at 1) BASE COUNT 93 a 77 c 71 g 80 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttccgtggac gttcggtgga 301 ggcaccaagc tggaaatcaa a // LOCUS MUSIGHAFQ 288 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L2-5D2. ACCESSION M36262 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-5D2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 288) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 288 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 65 c 68 g 73 t 4 others ORIGIN 1 cagtctccag cctccctgtc tgcatctgtg ggagaaactg tcaccatcac atgtcgagca 61 agtgggaata ttcacaatta tttagcatgg tatcagcaga nacagggaaa atctcctcag 121 ctcctggtct ataatgcaaa aaccttagca gatggtgtgc catcaaggtt cagtggcagt 181 ggatcaggaa cacaatattc tctcaagatc aacagcctgc agcctgaaga ttttgggagt 241 tattactgtc aacatttttg gnnnactccg tacacgttcg gagggggg // LOCUS MUSIGHAFR 363 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36207 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H120-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 363 Ig heavy-chain V region (AA at 1) BASE COUNT 85 a 88 c 101 g 89 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc gaggatttac 301 tacggtagta gctactggta cttcgatgtc tggggcgcag ggaccacggt caccgtctcc 361 tca // LOCUS MUSIGHAFS 348 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36208 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 348 Ig heavy-chain V region (AA at 1) BASE COUNT 96 a 80 c 93 g 78 t 1 others ORIGIN 1 cagatccagt tggtccagtc tgnacctgag ctgaagaagc ctggagagac agtcaagatc 61 tcctgcaagg cttctgggta taccttcaca aactatggaa tgaactgggt gaagcaggct 121 ccaggaaagg gtttaaagtg gatgggctgg ataaacacct acactggaga gccaacatat 181 gctgatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat 241 ttgcagatca acaacctcaa aaatgaggac acggctacat atttctgtgc aagaggagat 301 ggtaactact ggtacttcga tgtctggggc gcagggacca cggtcacc // LOCUS MUSMHEA2A1 478 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II A-beta 2 gene, exon 2. ACCESSION M30158 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Mouse (haplotype SAX2) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 478) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept / 154 + 423 A-beta 2 chain, exon 2 (AA at 156) IVS < 1 153 A-beta 2 chain intron A IVS 424 > 478 A-beta 2 chain intron B BASE COUNT 118 a 99 c 132 g 129 t ORIGIN 1 aaataaatgc caataagcat taaatgacat tgactttgtc tcttaatgtt gggcttttga 61 gcacattgct atctgactta gtgtaccagg tacaggcttt ccctgaggtg cggctggttc 121 ctaactggtt cctaatcctt ttatggtttc cagagaattt tgtgattcag gcaaaggcag 181 actgttactt caccaatggg acagaaaagg ttcacttcct ggtcaggttc atcttcaacc 241 tggaggagta tttacatttc gacagtgacc tggggatgtt cgtggcgctg acggagctgg 301 gggaacctga cgctgaccag tggaacaaac gcctagatct cctggagatg agcagagccg 361 ctgtgaacat ggtctgcagg cagaagtaca agctgggggc ccccttcatc gtggagagaa 421 atggtgggta acagcaaggt ctaggggttc tccgtgagaa ccttgtaata ggcagttt // LOCUS MUSMHEA2A2 526 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II A-beta 2 gene, exon 3. ACCESSION M30159 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (haplotype SAX2) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 526) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept + 49 / 330 A-beta 2 chain, exon 3 (AA at 49) IVS < 1 48 A-beta 2 chain intron B IVS 331 > 526 A-beta 2 chain intron C BASE COUNT 115 a 135 c 136 g 140 t ORIGIN 1 tctagatgct cctgtccttg ctgtgccttt tgtttcttct cttgccagtg cctccagagg 61 tgacagtgta cccagagagg accccattgc tgcagcagca caacctgctg ctctgctctg 121 tgacaggctt ctaccccagg gacataagtg tcaagtggtt ccggaatgga caggaggagc 181 ggtctggggt catgtccact ggccttgtta ggaatggaga ctggaccttc cagacaacag 241 taatgctgga aatgatccca gagcttggtg acatctacag ctgcctcgtg gagcaccccg 301 gcctcctgag acctgtttct gtggcatgga gtgagcttca tttagttttc tgtatttctg 361 gaccttgacc tgtaggagct atcatttctg tgttcatctt gatgacaaga caagttcccc 421 tgatctaaga atcctagtgc cagggcagga gaaaacagga caggcatcca ttctctgaat 481 gtccaagaga ggctgttaac ccacaacttg cctgtaatcc tgggac // LOCUS MUSMHEAII1 491 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II A-beta 2 gene, exon 2. ACCESSION M30156 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Mouse (haplotype SAX1) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 491) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept / 158 + 427 A-beta 2 chain, exon 2 (AA at 160) IVS < 1 157 A-beta 2 chain intron A IVS 428 > 491 A-beta 2 chain intron B BASE COUNT 123 a 97 c 135 g 136 t ORIGIN 1 aaataaatgt caataagcat taaatgacat taagtgactt tgtctcttaa tgttgggctt 61 ttgagcacat tgctatctga cttagtgtac caggtacagg ctttcccaga ggtgcggctg 121 gttcctaact ggttcctaat ccttttatgg tttccagaga attttgtgat tcaggcaaag 181 gcggactgtt acttcaccaa tgggacagaa aaggttcact tcctggtcag gttcatcttc 241 aacctggagg aatatttaca tttcgacagt gacctgggga tgtttgtggc gctgacggag 301 ctgggggaac ctgacgctga ccagtggaac aaacgcctgg atctcctgga gacgagcaga 361 gccgctgtga acatggtctg caggcagaag tacaatctgg gggccccctt cattgtggag 421 agaaatggtg ggtaacagca aggtctaggg gttctccgtg agaaccttgt aataggcatt 481 tgaaaggttt t // LOCUS MUSMHEAII2 592 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II A-beta 2 gene, exon 3. ACCESSION M30157 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (haplotype SAX1) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 592) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept + 49 / 330 A-beta 2 chain, exon 3 (AA at 50) IVS < 1 48 A-beta 2 chain intron B IVS 331 > 592 A-beta 2 chain intron C BASE COUNT 138 a 147 c 148 g 159 t ORIGIN 1 tctagatgct cctgtccttg ctgtgccttt tgtttcttct cttgccagtg tctccagagg 61 tgacagtgta cccagagagg accccattgc tgcagcagca caacctgctg ctctgctctg 121 tgacaggctt ctatcccggg aacataagtg tcaagtggtt ccggaatgga caggaggaga 181 ggtctggggt catgtccact ggccttgtta ggaatggaga ctggaccttc cagacaacag 241 taatgctgga aataatccca gagcttggtg acatctacag ctgcctcgtg gagcaccccg 301 gcctcctgag acctgtttct gtggcatgga gtgagcttta ttagttttct gtaattctgg 361 accttgacct gtaggagctg tcatttctgt gtcttgatga caagacaagt tcccctgatc 421 taagaatcct aatgccaggg gcaggagaaa acaggacagg catccattct ctgaatgtcc 481 aagagataga taaggaggct gttaacccac aacttgcctc taaccctggg acctagtatc 541 ctttagcact ttaatcttta cagacactgc cactggctat aagtcaggga tc // LOCUS MUSMHEB2B1 518 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II E-beta 2 gene, exon 2. ACCESSION M30160 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Mouse (haplotype SAX) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 518) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept / 92 + 358 E-beta 2 chain, exon 2 (AA at 94) IVS < 1 91 E-beta 2 chain intron A IVS 359 > 518 E-beta 2 chain intron B BASE COUNT 120 a 94 c 151 g 153 t ORIGIN 1 ctgcaggtgt cacagttcct ttcactgact gccattctgg agcattgtct gtcctcacag 61 acatcctgta attgtgttca tgtacccaca gcacgttttc tggagcagtt gaaggctgag 121 tgtcactact tcaatgggaa ggagcgtgtg tggagtgtga ccagattcat ctataaccag 181 gaagagtttg tccgctttaa cactgacttt gggaagttcc tggcagtgac tgagctgggg 241 cggcccatag ctgagtactt gaacacccag aaggacatgc tggacaatta tcgtgcctca 301 gtggacaggt gcagaaataa ctatgacctt gtggatatct tcatgttgaa cttaaaaggt 361 aagcattaga tagagagtag atgggttgag gggtgtgtgt gtgtgtgtgt gtgtgtgtgt 421 gtgtgtgtgt gtgtgtaaga gaggatagtt ctgtgtcacg tgtgtggaga tcataagaca 481 ggttactgat tctcgccatc tatcctatgg tcctgaga // LOCUS MUSMHEB2B2 573 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II E-beta 2 gene, exon 3. ACCESSION M30161 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (haplotype SAX) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 573) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept + 97 / 378 E-beta 2 chain, exon 3 (AA at 97) IVS < 1 96 E-beta 2 chain intron B IVS 379 > 573 E-beta 2 chain intron C BASE COUNT 138 a 155 c 143 g 137 t ORIGIN 1 gagatcttta gcctgggtgg atggaggcag gtaggcagcc cacagggaga acagctctat 61 cctcagggct ctctctcacc tccctttctc ctctagctga gcccaaggtg actgtgtatc 121 ctgcaaagac acagcctctg gaacaccaca acctcctggt ctgctctgtg aatgacttct 181 accctggcag cattgaagtc agatggttcc ggaatggcga ggaggagaag actggagttg 241 tgtccaccgg actgatccaa aatggagatt ggacctacca gaccctgttg atgctggaga 301 tggttcctca gggtggagag gtttacacct gccaggtgga gcatcccagc ctgaccagcc 361 ctgtcacagt ggagtggagt gagtgggaaa cccttgactc tgcaaatgcc caaccaccat 421 ataataggag tgactttctc tgcctgttcc ctttctctga ccctgtaaat ccctgcctac 481 cagggaagca catgggtgac tccacaggaa aaatgaatac aatttccaag ttactagctt 541 ttattcctca cctagttcaa catctatgcc agt // LOCUS MUSMHEBI1 445 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II E-beta 1 gene, exon 2. ACCESSION M30154 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Mouse (haplotype SAX) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 445) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept / 11 + 280 E-beta 1 chain, exon 2 (AA at 13) IVS < 1 10 E-beta 1 chain intron A IVS 281 > 445 E-beta 1 chain intron B BASE COUNT 101 a 108 c 161 g 75 t ORIGIN 1 gtctccacag cacggttttt ggaacaagct aaacatgagt gtcatttcta caacggaacg 61 cagcgcgtgc ggtttctgtt gagacaaatc cacaacaggg aggagaacgt gcgcttcgac 121 agcgacctgg gcgagttccg cgcagtgaac gagctggggc ggccagacgc cgagtacttc 181 aacagccaga aggaggtcct ggaggatttg cgggccgctg tggacacgta ctgcagacac 241 aactacgaga tcttagataa cttccttgtg ccgcggagag gtgagacagg aggcggtggg 301 tggggcggaa ccacggtgag ggtggggctg tggggagcag caggaggcgg tgcgcacgtg 361 cgcaggagcc gcaggggatg ctgggttccc tgccgctgga gccacaggcg gctttaagca 421 gcctcatggc aggagaacag aattc // LOCUS MUSMHEBI2 525 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II E-beta 1 gene, exon 3. ACCESSION M30155 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (haplotype SAX) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 525) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept + 102 / 383 E-beta 1 chain, exon 3 (AA at 102) IVS < 1 101 E-beta 1 chain intron C IVS 384 > 525 E-beta 1 chain intron D BASE COUNT 116 a 144 c 141 g 124 t ORIGIN 1 ggatccagtc tggatggatg gaggcaggca ggcaggcagg cagcctacag gaaggacagc 61 tccaccctca tggcaccttc tcacctctct ttctcctcta gttgagccta cagtgactgt 121 gtaccctaca aagacacagc ccctggaaca ccacaacctc ctggtctgct ctgtgagtga 181 cttctaccca ggcagcattg aagtcagatg gttccggaat ggtgaggagg agaagacggg 241 aattgtgtcc actggcctga tccgaaatgg agactggact ttccagatgc tggtgatgct 301 ggagacggtt cctcaggtgg aagaggttta cacctgccag gtggagcatc ccagcctgac 361 cagccctgtc acagtccagt ggagtgagtg ataacttcca tactctgtga atgtcattcc 421 accagggagg gggcgtggct tatccctgcc tgtcagcttt ctctgcccac atactctttc 481 cactggtttt atgctgtcct gactttcacc acagcttaaa gggaa // LOCUS MUSMMTVA 448 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse mammary tumor virus long terminal repeat. ACCESSION M35959 KEYWORDS long terminal repeat. SOURCE Mouse lymphoma cell line MLA DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 448) AUTHORS Yanagawa,S.-I., Murakami,A. and Tanaka,H. TITLE Extra mouse mammary tumor proviruses in DBA/2 mouse lymphomas acquire a selective advantage in lymphocytes by alteration in the U3 region of the long terminal repeat JOURNAL J. Virol. 64, 2474-2483 (1990) STANDARD simple staff_entry BASE COUNT 150 a 70 c 95 g 133 t ORIGIN 1 aatccaatag gtagattatt aattactatg ttaagaaatg aatcattatc ttttagtact 61 atttttactc aaattcagaa gttagaaatg ggaatagaaa atagaaagag acgctcagcc 121 tcagttaaag aacaggtgca agagctaacg gccgcaggcc tagaagtaaa aaagggaaag 181 aagagtgtgc ttgtcaaaat aagacacagg tggtggcaac cagggactta taggggacct 241 tacatctaca gaccaacaga tgcccccttt cttaaaacaa ggatgtgaga caggtggttt 301 cctgagtcgg tttggtatca aatgttctaa tctgagctct tagtgttcta ttttcctgtg 361 ttcttttgga atctatccaa gtcttatgta aatgcttatg taaaccatga tataaaagag 421 tgctgatttt tttgagtaaa tttgcaac // LOCUS MUSMMTVB 527 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse mammary tumor virus long terminal repeat. ACCESSION M35960 KEYWORDS long terminal repeat. SOURCE Mouse lymphoma cell line DL8 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 527) AUTHORS Yanagawa,S.-I., Murakami,A. and Tanaka,H. TITLE Extra mouse mammary tumor proviruses in DBA/2 mouse lymphomas acquire a selective advantage in lymphocytes by alteration in the U3 region of the long terminal repeat JOURNAL J. Virol. 64, 2474-2483 (1990) STANDARD simple staff_entry BASE COUNT 169 a 82 c 120 g 156 t ORIGIN 1 aatccaatag gtagattatt aattactatg ttaagaaatg aatcattatc ttttagtact 61 atttttactc aaattcagaa gttagaaatg ggaatagaaa atagaaagag acgctcagcc 121 tcagttgaag aacaggtgca agagctaacg gccgcaggcc tagaagtaaa aaagggaaag 181 aagagtgtgc ttgtcaaaat aggagacagg tggtggcaac cagggactta taggggacct 241 tacatctaca gacggttagc ttggcttccc agggttcaaa ctgttcttaa aacaaggatg 301 tgagacaagt gcgcaacggt tagcttggct tcccagggtt caaactgttc ttaaaacaag 361 gatgtgagac aagtggtttc ctgagttggt ttggtatcaa atgttctaat ctgagctctt 421 agtgttctat tttcctatgt tcttttggaa tctatccaag tcttatgtaa atgcttatgt 481 aaaccatgat ataaaagagt gctgattttt ttgagtaaac ttgcaac // LOCUS PFDSYN 85 bp ds-DNA SYN 14-AUG-1990 DEFINITION Bacteriophage fd synthetic G2 promoter region DNA. ACCESSION M24953 KEYWORDS promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 85) AUTHORS Ovchinnikov,Y.A., Efimov,V.A. and Chakhmakcheva,O.G. TITLE Synthesis of a polynucleotide corresponding to the promoter region of bacteriophage fd DNA JOURNAL FEBS Lett. 100, 341-346 (1979) STANDARD simple staff_entry FEATURES from to/span description mRNA < 1 14 G2 mRNA BASE COUNT 34 a 17 c 15 g 19 t ORIGIN 1 aaatcagtct ttaccctgtc tattatcgtc agaagcaaag cgaattgcat caaaaagatt 61 aagaggaagc ccgaaagact tcaaa // LOCUS PRMSPCA 200 bp ds-DNA BCT 14-AUG-1990 DEFINITION P.vulgaris spc r-protein operon L24 gene, 3' end, and L5 gene, 5' end. ACCESSION M36264 KEYWORDS r-protein. SOURCE P.vulgaris DNA. ORGANISM Proteus vulgaris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 200) AUTHORS Mattheakis,L.C., Vu,L. and Nomura,M. TITLE Nucleotide sequence of the L24/L5 intercistronic region of Proteus vulgaris, Serratia marcescens and Salmonella typhimurium JOURNAL J. Mol. Biol. 204, 326-329 (1988) STANDARD simple staff_entry REFERENCE 2 (sites) AUTHORS Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M. TITLE Translational regulation of the spc operon in Escherichia coli: Identification and structural analysis of the target site for S8 repressor protein JOURNAL J. Mol. Biol. 204, 309-329 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 57 r-protein L24 pept 71 > 200 r-protein L5 BASE COUNT 64 a 36 c 47 g 53 t ORIGIN 1 ttcgaagacg gcaaaaaagt tcgtttcttc aagtctaata aagaaactat caagtaattt 61 ggagtatacg atggcgaaac tgcatgatta ctacaaagac gaggtagtcc aaaaactgat 121 gtctcagttt ggttaccatt ctgtcatgca agtccctcgg gtcgagaaga tcaccctgaa 181 tatgggtgtt ggtgaagcga // LOCUS PSCTETR 1110 bp ds-DNA BCT 14-AUG-1990 DEFINITION Plasmid pSC101 tetracycline resistance element repressor protein (tetR) gene, complete cds. ACCESSION M36272 KEYWORDS tetracycline resistance element repressor protein. SOURCE Plasmid pSC101 DNA. ORGANISM Plasmid pSC101 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1110) AUTHORS Brow,M.A.D., Pesin,R. and Sutcliffe,J.G. TITLE The tetracycline repressor protein of pSC101 JOURNAL Mol. Biol. Evol. 2, 1-12 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 42 701 tetracycline resistance element repressor protein (tetR) BASE COUNT 235 a 298 c 351 g 226 t ORIGIN 1 aagcttatcg atgataagct gtcaaacatg agaattcgcg aatgaacaag ctccaacgcg 61 aggccgtgat ccgaaccgcg ctcgaactgc ttaacgacgt gggcatggaa ggtctaacga 121 cgcgccgact ggctgagcgc ctcggggtgc aacagccagc gctctactgg catttcaaga 181 acaagcgtgc gttgctcgac gcacttgccg aagccatgct gacgataaat cacacgcatt 241 cgacgccaag ggatgacgac gactggcgtt cgttcctgaa gggcaatgca tgcagttttc 301 gacgggcgtt gctcgcttat cgcgatggcg cgcgtattca tgccgggacg cggccagccg 361 cgccgcagat ggaaaaagcc gacgcgcagc ttcgcttcct ttgcgatgct ggcttttcgg 421 caggtgacgc gacctatgcg ttgatggcaa tcagctactt caccgtcggc gctgttcttg 481 agcagcaagc tagcgaggca gacgccgagg agcggggcga agatcagttg accacctcag 541 cgtctacgat gccggcgcgc ctacagagcg cgatgaaaat cgtctacgaa ggcggtccgg 601 acgcggcatt cgagcgaggc ctggctctca tcatcggcgg tcttgaaaaa atgaggctca 661 ctacgaacga cattgaggtg ctgaagaatg ttgacgaatg acagggggcg gcaggtgcgg 721 agggcgcggt tgcttcgtca tatgaagcaa agtcacctag ctgaattaat gggtgtggat 781 caggcaaccg tgtcgcgctg ggagcggggc acccttgcat tgtcggatgg gaggtggtca 841 gcggttcttc aattgcttac cgggccttcc gattcatcgt acgacgctgc gctgaagcgt 901 ctggtgcaat cctccgccca caaagtccat ctggtagcga ccggacacat tgtttgctcg 961 cggcatctcc ggccaggcaa agggaattgc ggattgacct agccgaactc cttggtaaat 1021 cgctgcgtgt ttatgcgtcc ccgagatagt tgcggccgac tctgcgctta atgggctcgg 1081 ttggcatgag gggcggctgg ggtcactcga // LOCUS PTVCOMPL 359 bp ss-RNA VRL 14-AUG-1990 DEFINITION Potato spindle tuber viroid complete sequence. ACCESSION M36163 KEYWORDS . SOURCE Potato spindle tuber viroid RNA. ORGANISM Potato spindle tuber viroid Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 359) AUTHORS Gross,H.J. and Riesner,D. TITLE Viroids: A class of subviral pathogens JOURNAL Angew Chem. Int. Ed. Engl. 19, 231-243 (1980) STANDARD simple staff_entry BASE COUNT 73 a 108 c 102 g 76 t ORIGIN 1 cggaactaaa ctcgtggttc ctgtggttca cacctgacct cctgagcaga aaagaaaaaa 61 gaaggcggct cggaggagcg cttcagggat ccccggggaa acctggagcg aactggcaaa 121 aaaggacggg ggggagtgcc cagcggccga caggagtaat tcccgccgaa acagggtttt 181 cacccttcct ttcttcgggt gtccttcctc gcgcccgcag gaccacccct cgcccccttt 241 gcgctgtcgc ttcggctact acccggtgga aacaactgaa gctcccgaga accgcttttt 301 ctctatctta cttgcttcgg ggcgagggtg tttagccctt ggaaccgcag ttggttcct // LOCUS RATCBFB 1415 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat CCAAT binding transcription factor-B subunit (CBF-B) mRNA, complete cds. ACCESSION M34238 KEYWORDS CCAAT binding transcription factor-B subunit; heteromeric mammalian transcription factor; nuclear protein; transcription activator. SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pCBF-B1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1415) AUTHORS Maity,S.N., Vuorio,T. and de Crombrugghe,B. TITLE The B subunit of a heteromeric rat CCAAT-binding transcription factor shows a striking sequence identity with the yeast Hap2 transcription factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5378-5382 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.N.Maity, 11-MAY-1990, for release after publication. FEATURES from to/span description pept 171 1196 CCAAT binding transcription factor-B subunit (CBF-B) BASE COUNT 371 a 371 c 417 g 256 t ORIGIN 1 cagcggcagt ggcggcggcg cggcggctgg agcctctgat tgggtttcgg ggtccggtac 61 tggagccaat cagcgcgggc agcgaaccgg gggagcgagg cacggagtcc ctacctggca 121 gctgcctggg atctgtagag tgaagcttca ggacacaacg tggccgggcc atggagcagt 181 atacagcaaa tagcaatagt tccacagagc agatcgtggt gcaggctggc cagattcagc 241 agcagcagca gggtggtgtc actgctgtcc agctgcagac tgaggcccag gtggcatccg 301 cctcaggcca gcaagtccag accctccagg tagttcaggg gcagccatta atggtgcaag 361 tcagtggagg ccagctgatt acatcaactg gccaacccat catggtacag gctgtgcctg 421 gtggacaagg ccaaactatc atgcaagtac ctgtgtctgg aacacaaggt ttacagcaga 481 tacagttggt accccctgga cagatccaga tccagggcgg gcaggctgtg caggtgcaag 541 gccagcaggg acagacccag cagatcatca ttcagcagcc acagaccgcg gtcaccgctg 601 gccagactca gacacaacaa cagattgctg tccagggaca gcaagtggcc cagactgctg 661 aagggcagac tattgtctac cagccagtta atgcagatgg cacaatcctc cagcaaggca 721 tgatcaccat cccagcagcc agtttggcag gggcacagat cgttcagaca ggagccaata 781 ccaacacaac cagtagtgga caagggactg tcactgtgac actgccagtg gcagggaatg 841 tggtcaactc aggaggaatg gtcatgatgg taccaggggc tggctctgtg cctgctatcc 901 aaagaatccc tttacctggg gcagagatgc tggaagaaga gccgttgtat gtgaatgcca 961 aacagtatca ccgcatcctt aagaggagac aagcgagggc caagctagag gcagaaggga 1021 agatcccaaa ggaaagaagg aaatacctcc atgagtctcg gcaccgacac gccatggcac 1081 ggaagcgtgg ggaagggggc cgcttcttct ctccaaaaga aaaggacagt cctcacatgc 1141 aggatccaaa ccaagctgat gaagaagcca tgacacagat catccgagtt tcctaaccac 1201 aggagggtgg cagagcggat gagtcacagt cctctccact gttcctggaa atggactgac 1261 ttccagttgg gcactgacag tctcactctg ccctttctca ggacagaaac tacttagctc 1321 agtattacag ctgcagtgat ggctgacaaa ctgaagaggc aagcttttgt ctctcccttt 1381 cattgaggac ctgttcagac tgttggtgac attga // LOCUS RATRTG2 902 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat thyroglobulin (rTg-2) mRNA, complete cds. ACCESSION M35965 KEYWORDS thyroglobulin. SOURCE Rat (strain Fisher) thyrocyte, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 902) AUTHORS Graves,P.N. and Davies,T.F. TITLE A second thyroglobulin messenger RNA species (rTg-2) in rat thyrocytes JOURNAL Mol. Endocrinol. 4, 155-161 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 732 thyroglobulin (rTg-2) mRNA < 1 902 thyroglobulin mRNA BASE COUNT 239 a 216 c 234 g 213 t ORIGIN 1 atgatgacct tggtcttgtg ggtctcgact ttgttgagct cagtctgcct ggtagcagcc 61 aacatctttg agtaccaagt ggatgcacag ccactccgcc cctgtgagct gcaaagggag 121 aaggcctttc tgaagcagga tgaatatgtt ccacagtgct ctgaagatgg aagtttccag 181 acagttcagt gccaaaatga tggccaatct tgctggtgtg tggattctga tggcacggaa 241 gtacctggca gcaggcagct gggaaggcca acagcatgtc tgtccttctg ccagctacac 301 aagcaacgaa tcctgctgag cagctacatt aacagcacag atgctctcta ccttcctcag 361 tgccaggatt cagggaacta tgcaccagtg cagtgcgacc tgcagcaagt gcaatgctgg 421 tgtgtagaca cagaggggat ggaggtgtat ggcacccgcc agcaggggag gccaacacgg 481 tgtccaagga gctgtgagat aagaagccgc cgtctcctcc atggagtagg agacaaatcg 541 ccaccccagt gcgatgcaga tggggagttt atgcctgtcc agtgcaagtt tgtcaacacc 601 acagacatga tgatttttga tctgattcac aactacaaca gcaacaaacc tcacctcaaa 661 tgccctagta catctgttat actgtggcaa gagtcaagag cacctttaac atgtgctgaa 721 gaatcatcct aattggaata tagaagacta tggggaaatc ctgagaggca gatcgtctag 781 tcacctgcaa gaccacctat ggtggccaat ttgacttgtg tgtattgtta tgctgtgtat 841 agctttctgc caaagactgc ctgccctggc ccaggagaga ataaacacac tgttaattta 901 tt // LOCUS SMASPCA 201 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.marcescens spc operon r-protein L24 gene , 3' end, and L5 gene, 5' end. ACCESSION M36265 KEYWORDS r-protein. SOURCE S.marcescens DNA. ORGANISM Serratia marcescens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 201) AUTHORS Mattheakis,L.C., Vu,L. and Nomura,M. TITLE Nucleotide sequence of the L24/L5 intercistronic region of Proteus vulgaris, Serratia marcescens and Salmonella typhimurium JOURNAL J. Mol. Biol. 204, 326-329 (1988) STANDARD simple staff_entry REFERENCE 2 (sites) AUTHORS Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M. TITLE Translational regulation of the spc operon in Escherichia coli: Identification and structural analysis of the target site for S8 repressor protein JOURNAL J. Mol. Biol. 204, 309-329 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 57 r-protein L24 pept 72 > 201 r-protein L5 BASE COUNT 65 a 40 c 46 g 50 t ORIGIN 1 ttcgaagacg ctaaaaaagt ccgtttcttc aagtctaaca gcgaaactat caagtaattt 61 ggagtaatac gatggcgaaa ctgcatgatt actacaaaga cgaggtagtc aaacaactga 121 tgtctcagtt tgattacaac tctgtcatgc aagtccctcg ggtcgagaag atcaccctga 181 acatgggtgt tggtgaagcg a // LOCUS STYSPCA 201 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.typhimurium spc operon r-protein L24 gene, 3' end, and L5 gene, 5' end. ACCESSION M36266 KEYWORDS r-protein. SOURCE S.typhimurium DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 201) AUTHORS Mattheakis,L.C., Vu,L. and Nomura,M. TITLE Nucleotide sequence of the L24/L5 intercistronic region of Proteus vulgaris, Serratia marcescens and Salmonella typhimurium JOURNAL J. Mol. Biol. 204, 326-329 (1988) STANDARD simple staff_entry REFERENCE 2 (sites) AUTHORS Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M. TITLE Translational regulation of the spc operon in Escherichia coli: Identification and structural analysis of the target site for S8 repressor protein JOURNAL J. Mol. Biol. 204, 309-329 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 57 r-protein L24 pept 72 > 201 r-protein L5 BASE COUNT 67 a 38 c 46 g 50 t ORIGIN 1 ttcgaagacg gtaaaaaagt ccgtttcttc aagtctaaca gcgaaactat caagtaattt 61 ggagtagtac gatggcgaaa ctgcatgatt actacaaaga cgaagtagtt aacaaactca 121 tgactgagtt taactacaat tctgtcatgc aagtccctcg ggtcgagaag atcaccctga 181 acatgggtgt tggtgaagcg a // LOCUS GCOADDA 1771 bp ds-DNA PLN 14-AUG-1990 DEFINITION G.verrucosa 18S ribosomal RNA gene. ACCESSION M33638 KEYWORDS 18S ribosomal RNA. SOURCE G.verrucosa (isolate Norway) male tetrasporangial DNA. ORGANISM Gracilaria verrucosa Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; Florideophycideae; Gracilariaceae; Gracilariaceae. REFERENCE 1 (bases 1 to 1771) AUTHORS Bird,C.J., Rice,E.L., Murphy,C.A., Liu,Q.Y. and Ragan,M.A. TITLE Nucleotide sequences of the 18S ribosomal RNA gene from the red algae Gracilara tikvahiae McLachlan, Gracilaria verrucosa (Hudson) Papenfuss and Gracilariopsis sp JOURNAL Nucleic Acids Res. 18, 4023-4024 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Ragan, 08-APR-1990. Atlantic Res. Lab., National Research Council of Canada, 1411 Oxford St., Halifax, Nova Scotia, CANADA B3H 3Z1 BASE COUNT 454 a 366 c 498 g 453 t ORIGIN 1 caacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc 61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct 121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccaaa 181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt 241 tttttctgat cgcaattatt gcgacgcacc gttcaaattt ctgacctatc aactttggat 301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg 361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc 421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta 481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc 541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc 601 tcgtagtcgg attttggtgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg 661 cctttgtgga tgggagctag gtggtgctta attggatcac ctagctgccg ccaccgttta 721 ctgtgaaaaa aatagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata 781 atagaatagg acccggtcct attttgttgg tttgcttgaa tcgggtaatg attaagaggg 841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg 901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga 961 tcgaagacga tcagataccg tcgtagtctt tactataaac gatgaggact ggagatcgga 1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg 1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg 1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg 1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt 1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag 1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc 1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc 1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat 1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa 1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct 1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat 1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt 1741 ttccgtaggt gaacctgcag aaggatcaga a // LOCUS GRO18SAA 1782 bp ds-DNA PLN 14-AUG-1990 DEFINITION Gracilariopsis sp. 18S ribosomal RNA gene. ACCESSION M33639 KEYWORDS 18S ribosomla RNA. SOURCE Gracilariopsis sp. (isolate England-1) diploid tetrasporophyte DNA. ORGANISM Gracilariopsis sp. Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; Florideophycideae; Gracilariaceae; Gracilariaceae. REFERENCE 1 (bases 1 to 1782) AUTHORS Bird,C.J., Rice,E.L., Murphy,C.A., Liu,Q.Y. and Ragan,M.A. TITLE Nucleotide sequences of the 18S ribosomal RNA gene from the red algae Gracilara tikvahiae McLachlan, Gracilaria verrucosa (Hudson) Papenfuss and Gracilariopsis sp JOURNAL Nucleic Acids Res. 18, 4023-4024 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Ragan, 08-APR-1990. BASE COUNT 443 a 372 c 515 g 452 t ORIGIN 1 caacctggtt gatcctgcca gtggtatatg cttgtctata ggactaagcc atgcaagtgc 61 aagtatgagt gtattataca acgaaactgc gaatggctct gtaaaacagc tataatttct 121 tcggtgctaa atactactcg gataaccgta gtaattctag agctaatacg tgcctacaca 181 gcgacgcaag tcgtggtaca aattagatac acaagccagc ttttttggtg attctagatt 241 ttttttctga tcgcacattg ttgcgacgca ccgttcaaat ttctgaccta tcaactttgg 301 atggtaaggt aatggcttac catggttgtg acgggtaacg gaccgtgggt gcgggactcc 361 ggagagggag cctgagagac ggctaccaca tccaaggaag gcagcaggcg cgcaacttac 421 ccaatccgga caccgggagg tagtgacaag aaatagcaat agagggcccg atgggttttc 481 taattggaat gagaacaaga taaacagctt atcgaggagc cagcagaggg caagtctggt 541 gccagcagcc gcggtaattc cagctctgta agcgtatacc aaagttgttg cagttaaaac 601 gctcgtagtc gaattttggc gtgtgatttg ggtcgttctc gcggacgcgc tcaggttgtg 661 cgcctttgtg gagagggggt gtggtggtgc ttgagtgcgc tgccatgctg ccgccaccgt 721 ttactgtgaa aaaattagag tgttcaaagc aggcgattgc cctgaataca ttagcatgga 781 ataatagaat aggacccggt tctattttgt tggtttgttt gaatcaggta atgattaaca 841 gggacggttg ggggcattcg tattccggcg ctagaggtga aattcttgga ttgtcggaag 901 acgaacagct gcgaaagcgt ctgccaagga cgttttcatt gatcaagaac gaaagtaagg 961 ggatcgaaga cgatcagata ccgtcgtagt ctttactata aacgatgagg actggagatc 1021 ggacaagact gatttatggc ttgttcggca tccttcgaga aatcaaagtg tttgctttct 1081 ggggggagta tggtcgcaag gctgaaactt aaaggaattg acggaagggc atcaccgggt 1141 gtggagcctg cggcttaatt tgactcaaca cgggaaaact taccaggtca ggacatagta 1201 aggattgaca gattgagagc tctttcttga ttctatggtt ggtggtgcat ggccgttctt 1261 agttggtgga gtgatctgtc tggttaattc cgttaacgag cgagacctgg gcgtgctagc 1321 taggcgccgt taccaattga gtttggtagc gaggcggtgc cttcctagac ggactgtggg 1381 cgtctagccc acggaagctc caggcaataa caggtctgag atgcccttag atgtcctggg 1441 ccgcacgcgt gctacactga acggcccaac gagtggagga tgtgcgagag catttccgaa 1501 tctccaaatc cgttcgtgat ggggatcgac ggttgcaatt ttccgtcgtc aacgaggaat 1561 accttgtaag cgcgggtcat catcccgcgc tgaatacgtc cctgcccttt gtacacaccg 1621 cccgtcgctc ctaccgattg agtggtccgg tgaggccttg ggagggctgg atgcgctgtt 1681 tttatgcaga cattttggcc tgaacctggt caaaccttat cacttagagg aaggagaagt 1741 cgtaacaagg tttccgtagg tgaacctgca gaaggatcag aa // LOCUS REO1SIG1 1463 bp ds-RNA VRL 14-AUG-1990 DEFINITION Reovirus type 1 sigma-1 protein (S1) gene, complete cds. ACCESSION M35963 KEYWORDS cell-attachment protein; hemagglutinin; sigma-1 protein. SOURCE Reovirus type 1 (strain Lang) RNA. ORGANISM Reovirus type 1 Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1463) AUTHORS Nibert,M.L., Dermody,T.S. and Fields,B.N. TITLE Structure of the reovirus cell-attachment protein: A model for the domain organization of sigma-1 JOURNAL J. Virol. 64, 2976-2989 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 14 1426 sigma-1 protein BASE COUNT 426 a 291 c 370 g 376 t ORIGIN 1 gctattcgcg cctatggatg catctctcat tacagagata cggaaaatag tactccaact 61 atctgtatca agcaatggct cccagtcaaa agaaatcgag gaaatcaaga aacaagtcca 121 ggtcaacgtt gatgatatca gggctgccaa tattaaactc gacggacttg gaagacagat 181 tgctgacatc agcaatagca tctcaaccat tgagtcaaga ttgggtgaga tggataatcg 241 acttgtgggt atctcgagtc aggtcacgca attatctaac tcagttagcc agaacactca 301 gagcatatcc tcattgggtg acagaatcaa tgctgtcgaa ccacgagttg acagtctgga 361 tacggtcacg tctaatctca ctggacgaac atccactttg gaggcagatg ttggaagctt 421 acggacagaa ctagcagcgc taacaacacg ggtgacaact gaggttacaa ggttagatgg 481 tctaatcaat agtggccaga attcgattgg tgagctatcc acaagactat ccaatgtgga 541 gacgtctatg gtgacgacgg ctggacgggg actgcagaaa aacggaaaca ccttgaacgt 601 cattgtaggt aatggaatgt ggtttaatag ttctaatcaa ttgcagctcg acctttcggg 661 gcaatcaaaa ggggtgggat ttgtcggcac aggaatggtg gttaagattg atactaatta 721 ttttgcttac aatagtaatg gagagattac attggtgagt caaatcaatg aattgccatc 781 gcgcgtatca acactggaat cagcgaaaat cgattcagtt ttacctccat taaccgtacg 841 cgaagcgagc ggcgtacgta ccctgagctt tggttatgat acgagcgatt ttacaatcat 901 caactccgta ctgtcgttac ggtcacgttt gactcttccg acatacaggt accctctgga 961 gctcgacaca gcaaataata gagtgcaggt ggcagatcgt tttggcatgc gcacgggtac 1021 ttggacggga caattgcaat atcagcaccc acaattgagt tggagagcaa atgtcacttt 1081 gaatttgatg aaggtggatg attggttggt gttgagcttt tctcagatga cgactaactc 1141 aataatggca gatgggaaat ttgtgattaa ttttgtgtct gggttatctt ctggatggca 1201 gacgggggat actgaaccat cgtcaactat tgatccatgg tctacgacat ttgccgcggt 1261 ccaatttcta aataacggtc aacgcattga tgcgtttagg atcatgggag tatcggaatg 1321 gacggatgga gaattagaga ttaagaatta tggtggcaca tacaccggtc atactcaagt 1381 atattgggct ccgtggacga tcatgtatcc atgcaatgtg aggtgaatct agcgcgaacc 1441 ctcggcacaa ggggtcaatc atc // LOCUS REO2SIG1 1440 bp ds-RNA VRL 14-AUG-1990 DEFINITION Reovirus type 2 sigma-1-s protein (S1) gene, complete cds. ACCESSION M35964 KEYWORDS cell-attachment protein; hemagglutinin; sigma-1-s protein. SOURCE Reovirus type 2 (strain Jones) RNA. ORGANISM Reovirus type 2 Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1440) AUTHORS Nibert,M.L., Dermody,T.S. and Fields,B.N. TITLE Structure of the reovirus cell-attachment protein: A model for the domain organization of sigma-1 JOURNAL J. Virol. 64, 2976-2989 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 14 1402 sigma-1-s protein BASE COUNT 384 a 314 c 383 g 359 t ORIGIN 1 gctattcgca ctcatgtcgg atctagtgca gctcataaga agggagatct tactgttaac 61 tgggaatgga gaatcagcca actcgaaaca cgagatcgag gaaattaaga aacaaattaa 121 agacatctct gctgatgtca acaggatcag taacatcgtt gattcaatcc aaggacaact 181 gggtggatta tctgtacgcg tgtcagccat tgaatcggga gttagtgaga acggcaatcg 241 aattgataga ctcgagcgag atgtctccgg catatcggct agcgttagcg gaatcgattc 301 gcgtttatcc gagctgggtg accgagtcaa tgttgcagaa cagcgaattg gccagttgga 361 tacagtcacg gataatctcc ttgagcgagc atcaagactg gaaactgaag tatcagccat 421 tactaatgac cttggatcat tgaatacgag ggtgacgact gaattgaacg atgtccgcca 481 aactattgct gcgatagaca cgcgtctcac gacactggag accgatgccg tgacgtcggt 541 tggtcaaggg cttcagaaga ctgggaactc gattaaggtt attgtgggta cggggatgtg 601 gttcgaccgc aataatgttc tgcagttatt cgtatcgaac cagcagaaag ggttgggatt 661 catagacaat ggaatggtag tgaaaataga tacccagtat ttcagcttcg atagcaatgg 721 caacataact ctgaacaaca acataagtgg tctgccggcg cgaacaggtt ccctcgaggc 781 atctcgtatc gatgtggtag cgccaccgct tgtgatacag tctactggta gcactcggct 841 actgcgtctc atgtacgagg ctgtggactt cgtggttact aacaacgttc tcacactgag 901 aaatcgatcg gtcacgccaa cattcaagtt tcctctggag ttgaatagtg ctgataactc 961 agtgagcatt catagaaatt accgcattag acttgggcaa tggtcaggtc aattggaata 1021 tcacacgccg agtttgcgtt ggaatgctcc cgtcacggtt aatttgatgc gagtagacga 1081 ttggctcatt ttgagtttta ctcggttttc gacgagcggc atcttagcgt caggaaagtt 1141 tgtattgaac ttcgtaactg gtttgtctcc agggtgggcg actgggagta ccgagccctc 1201 gacaactact aacccactgt caacgacgtt tgctgcaatt cagttcatca atgggtcatc 1261 tcgcgtagac gcctttagaa tcttgggagt cgcagagtgg aatgccgggg aactagagat 1321 cacgaattat ggcggaacat atacagcgca taccaatgtc gactgggcgc cgatgaccat 1381 tatgtaccca tgtctgggct gaggatccgg gtgctccact cggcacagtg gcgactcatc // LOCUS MARCMYC 100 bp ds-DNA ROD 14-AUG-1990 DEFINITION Woodchuck mutant c-myc gene, intron A and breakpoint region. ACCESSION M30385 KEYWORDS c-myc oncogene. SOURCE Woodchuck (individual W64, chronically infected with woodchuck hepatitis virus [WHV]) hepatocellularcarcinoma (HCC) DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 100) AUTHORS Moeroey,T., Marchio,A., Etiemble,J., Trepo,C., Tiollais,P. and Buendia,M.-A. TITLE Rearrangement and enhanced expression of c-myc in hepatocellular carcinoma of hepatitis virus infected woodchucks JOURNAL Nature 324, 276-279 (1986) STANDARD simple staff_review FEATURES from to/span description IVS < 1 > 100 c-myc intron A recomb 48 49 cellular DNA of unknown function end/c-myc DNA start BASE COUNT 24 a 26 c 29 g 21 t ORIGIN 3 bp upstream of BGlII site. 1 ggagatctca ggaagtaagg cactcagaga gacagtgggc gattcgtagg attccccacg 61 gcgattccaa ctcgcccccg ctctttttaa gaagttggcg // LOCUS MARWHV1 120 bp ds-DNA ROD 14-AUG-1990 DEFINITION Woodchuck DNA/woodchuck hepatitis virus DNA, 5' junction. ACCESSION M23563 KEYWORDS c-myc oncogene. SEGMENT 1 of 2 SOURCE Woodchuck (individual W74) hepatocellular carcinoma DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 120) AUTHORS Hsu,T.-y., Moeroey,T., Etiemble,J., Louise,A., Trepo,C., Tiollais,P. and Buendia,M.-A. TITLE Activation of c-myc by woodchuck hepatitis virus insertion in hepatocellular carcinoma JOURNAL Cell 55, 627-635 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 6 c-myc oncogene recomb 92 95 woodchuck DNA end/virus DNA start BASE COUNT 41 a 23 c 25 g 31 t ORIGIN 1 gcctaagttg aactgttgga gggaggaact ggaatctcct aaattctcac tccttactaa 61 gggaaagcaa ggaaaaaggt tccttctgac tgaaggcata aaaaaccatt atcctaattt // LOCUS MARWHV2 40 bp ds-DNA ROD 14-AUG-1990 DEFINITION Woodchuck DNA/woodchuck hepatitis virus DNA, 3' junction. ACCESSION M23564 KEYWORDS c-myc oncogene. SEGMENT 2 of 2 SOURCE Woodchuck (individual W74) hepatocellular carcinoma DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 40) AUTHORS Hsu,T.-y., Moeroey,T., Etiemble,J., Louise,A., Trepo,C., Tiollais,P. and Buendia,M.-A. TITLE Activation of c-myc by woodchuck hepatitis virus insertion in hepatocellular carcinoma JOURNAL Cell 55, 627-635 (1988) STANDARD simple staff_entry FEATURES from to/span description recomb 15 16 virus DNA end/woodchuck DNA start BASE COUNT 5 a 14 c 7 g 14 t ORIGIN About 900 bp after segment 1. 1 agggcagcat tgatctacgc ttcctgcttc cctctttctc // LOCUS BSUPAIA 1367 bp ds-DNA BCT 14-AUG-1990 DEFINITION B.subtilis negative regulator of protease synthase and sporulation (pai) genes, complete cds. ACCESSION M36471 KEYWORDS regulatory protein. SOURCE B.subtilis (strain DB104) DNA, clone pPAI121. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1367) AUTHORS Honjo,M., Nakayama,A., Fukazawa,K., Kawamura,K., Ando,K., Hori,M. and Furutani Y, TITLE A novel Bacillus subtilis gene involved in negative control of sporulation and degradative-enzyme production JOURNAL J. Bacteriol. 172, 1783-1790 (1990) STANDARD simple staff_review FEATURES from to/span description pept 100 618 negative regulator pai 1 pept 640 1263 negative regulator pai 2 BASE COUNT 504 a 205 c 283 g 375 t ORIGIN 1 tttttggaat ggtaacaagg tcagtatgtg ttccgtgttt aaatatataa aagggacaca 61 agacaattga attttaaata aaggggctga atccgaacaa tgagtgtaaa aatgaaaaaa 121 tgcagccggg aagatttaca aacacttcaa caattgagta ttgaaacatt caatgacatt 181 tttaaagaac agaactcacc tgaaaatatg aaagcctatt tagaaagcgc atttaacact 241 gagcagctgg aaaaagagtt atctaatatg tcttcgcaat tcttttttat ttactttgat 301 catgaaatcg ctggatatgt aaaggtcaat atcgatgatg ctcagtctga agaaatgggt 361 gctgaatcac ttgaaatcga gagaatttat ataaagaaca gctttcaaaa acatgggctt 421 ggcaaacatc tgctgaataa agcgatagaa attgcgctgg aacgtaataa aaagaacatt 481 tggctaggtg tgtgggaaaa aaatgaaaat gccattgcct tttataagaa aatggggttt 541 gttcagaccg gcgcccactc attttatatg ggtgatgaag aacaaacgga tttaatcatg 601 gctaaaacac tcatataata tttttcgaag gggatgatga tgtatattcc aaaatatttt 661 aaagtcacaa atgctgaaga aatttggaat tttgtacaag aaaactcctt tggtacggtt 721 gtcacaacag aacaaggaaa accaattgcc acacatttgc cgttaggatt caataaaaaa 781 gacgatcatt actatattac ggggcatttc gcttatggaa atcctcagtg gagaacgttt 841 gaagcctgtg aagatgtgct tgtgatgttt cagggaccgc acgcttatat ttcttcttcc 901 tggtattcgc gcgaaaatgt tccgacgtgg aattatcagg ctgtccacat gtatggaaaa 961 gcaagtatgt tagagaaaga tgaattagca gaagaattaa caatcatgtt ggagaaatac 1021 gaaaaacatc gtgacaatcc tgtgttatgg gataagcttt ctccaaaact cttagaaagt 1081 gaattgaaag gtattgttgg atttaagatt aaggtggaag acattcaagc ggcatataaa 1141 ttaagccaga accgaaatga aacggattat atgaacgtca ttgaacagtt gcaaaatgaa 1201 gaaaatccaa atgcaaaaca aatggcagaa ctgatggaag ataaactcaa aaagcaaata 1261 taagcttact ttgctgaaga agtagcagaa aacgccaact caagcggaga accctggcac 1321 ttggcgtttt tttatgatta atatttatcc cattaaaaat ggaattc // LOCUS HUMA2MGL 2041 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human alpha-2-macroglobulin mRNA, 3' end. ACCESSION M36501 KEYWORDS alpha-2-macroglobulin. SOURCE Human liver, cDNA to mRNA, clone pha2ml. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2041) AUTHORS Bell,G.I., Rall,L.B., Sanchez-Pescador,R., Merryweather,J.P., Scott,J., Eddy,R.L. and Shows,T.B. TITLE Human alpha-2-macroglobulin gene is located on chromosome 12 JOURNAL Somat. Cell Mol. Genet. 11, 285-289 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 1932 alpha-2-macroglobulin (AA at 1) /hgml_locus_uid="LH0012N" /nomgen="A2M" /map="12p13.3-p12.3" BASE COUNT 561 a 528 c 481 g 471 t ORIGIN 1 cccgccttcc tagctgtccc agtggagaag gaacaagcgc ctcactgcat ctgtgcaaac 61 gggcggcaaa ctgtgtcctg ggcagtaacc ccaaagtcat taggaaatgt gaatttcact 121 gtgagcgcag aggcactaga gtctcaagag ctgtgtggga ctgaggtgcc ttcagttcct 181 gaacacggaa ggaaagacac agtcatcaag cctctgttgg ttgaacctga aggactagag 241 aaggaaacaa cattcaactc cctactttgt ccatcaggtg gtgaggtttc tgaagaatta 301 tccctgaaac tgccaccaaa tgtggtagaa gaatctgccc gagcttctgt ctcagttttg 361 ggagacatat taggctctgc catgcaaaac acacaaaatc ttctccagat gccctatggc 421 tgtggagagc agaatatggt cctctttgct cctaacatct atgtactgga ttatctaaat 481 gaaacacagc agcttactcc agagatcaag tccaaggcca ttggctatct caacactggt 541 taccagagac agttgaacta caaacactat gatggctcct acagcacctt tggggagcga 601 tatggcagga accagggcaa cacctggctc acagcctttg ttctgaagac ttttgcccaa 661 gctcgagcct acatcttcat cgatgaagca cacattaccc aagccctcat atggctctcc 721 cagaggcaga aggacaatgg ctgtttcagg agctctgggt cactgctcaa caatgccata 781 aagggaggag tagaagatga agtgaccctc tccgcctata tcaccatcgc ccttctggag 841 attcctctca cagtcactca ccctgttgtc cgcaatgccc tgttttgcct ggagtcagcc 901 tggaagacag cacaagaagg ggaccatggc agccatgtat ataccaaaga cctgctggcc 961 tatgcttttg ccctggcagg taaccaggac aagaggaagg aagtactcaa gtcacttaat 1021 gaggaagctg tgaagaaaga caactctgtc cattgggagc gccctcagaa acccaaggca 1081 ccagtggggg atttttacga accccaggct ccctctgctg aggtggagat gacatcctat 1141 gtgctcctcg cttatctcac ggcccagcca gccccaacct cggaggacct gacctctgca 1201 accaacatcg tgaagtggat cacgaagcag cagaatgccc agggcggttt ctcctccacc 1261 caggacacag tggtggctct ccatgctctg tccaaatatg gagcagccac atttaccagg 1321 actgggaagg ctgcacaggt gactatccag tcttcaggga cattttccag caaattccaa 1381 gtggacaaca acaaccgcct gttactgcag caggtctcat tgccagagct gcctggggaa 1441 tacagcatga aagtgacagg agaaggatgt gtctacctcc agacatcctt gaaatacaat 1501 attctcccag aaaaggaaga gttccccttt gctttaggag tgcagactct gcctcaaact 1561 tgtgatgaac ccaaagccca caccagcttc caaatctccc taagtgtcag ttacacaggg 1621 agccgctctg cctccaacat ggcgatcgtt gatgtgaaga tggtctctgg cttcattccc 1681 ctgaagccaa cagtgaaaat gcttgaaaga tctaaccatg tgagccggac agaagtcagc 1741 agcaaccatg tcttgattta ccttgataag gtgtcaaatc agacactgag cttgttcttc 1801 acggttctgc aagatgtccc agtaagagat ctgaaaccag ccatagtgaa agtctatgat 1861 tactacgaga cggatgagtt tgcaattgct gagtacaatg ctccttgcag caaagatctt 1921 ggaaatgctt gaagaccaca aggctgaaaa gtgctttgct ggagtcctgt tctcagagct 1981 ccacagaaga cacgtgtttt tgtatcttta aagacttgat gaataaacac tttttctggt 2041 c // LOCUS HUMPPA1CO 1500 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human alpha-1 collagen type I mRNA, 5' end. ACCESSION M36546 KEYWORDS alpha-1 collagen type I. SOURCE Human, cDNA to mRNA, clone pHUCI. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1500) AUTHORS Tromp,G., Kuivaniemi,H., Stacey,A., Shikata,H., Baldwin,C.T., Jaenisch,R. and Prockop,D.J. TITLE Structure of a full-length cDNA clone for the prepro-alpha1(I) chain of human type I procollagen JOURNAL Biochem. J. 253, 919-922 (1988) STANDARD simple staff_review FEATURES from to/span description pept 84 > 1500 alpha-1 collagen type I /hgml_locus_uid="LG0047H" /nomgen="COL1A1" /map="17q21.3-q22" sigp 84 149 alpha-1 collagen type I signal peptide matp 567 > 1500 alpha-1 collagen type I matp 618 > 1500 alpha-1 collagen type I alpha chain BASE COUNT 257 a 488 c 488 g 267 t ORIGIN 1 ggcacgcgga gtgtgaggcc acgcatgagc ggacgctaac cccctcccca gccacaaaga 61 gtctacatgt ctagggtcta gacatgttca gctttgtgga cctccggctc ctgctcctct 121 tagcggccac cgccctcctg acgcacggcc aagaggaagg ccaagtcgag ggccaagacg 181 aagacatccc accaatcacc tgcgtacaga acggcctcag gtaccatgac cgagacgtgt 241 ggaaacccga gccctgccgg atctgcgtct gcgacaacgg caaggtgttg tgcgatgacg 301 tgatctgtga cgagaccaag aactgccccg gcgccgaagt ccccgagggc gagtgctgtc 361 ccgtctgccc cgacggctca gagtcaccca ccgaccaaga aaccaccggc gtcgagggac 421 ccaagggaga cactggcccc cgaggcccaa ggggacccgc aggcccccct ggccgagatg 481 gcatccctgg acagcctgga cttcccggac cccccggacc ccccggacct cccggacccc 541 ctggcctcgg aggaaacttt gctccccagc tgtcttatgg ctatgatgag aaatcaaccg 601 gaggaatttc cgtgcctggc cccatgggtc cctctggtcc tcgtggtctc cctggccccc 661 ctggtgcacc tggtccccaa ggcttccaag gtccccctgg tgagcctggc gagcctggag 721 cttcaggtcc catgggtccc cgaggtcccc caggtccccc tggaaagaat ggagatgatg 781 gggaagctgg aaaacctggt cgtcctggtg agcgtgggcc tcctgggcct cagggtgctc 841 gaggattgcc cggaacagct ggcctccctg gaatgaaggg acacagaggt ttcagtggtt 901 tggatggtgc caagggagat gctggtcctg ctggtcctaa gggtgagcct ggcagccctg 961 gtgaaaatgg agctcctggt cagatgggcc cccgtggcct gcctggtgag agaggtcgcc 1021 ctggagcccc tggccctgct ggtgctcgtg gaaatgatgg tgctactggt gctgccgggc 1081 cccctggtcc caccggcccc gctggtcctc ctggcttccc tggtgctgtt ggtgctaagg 1141 gtgaagctgg tccccaaggg ccccgaggct ctgaaggtcc ccagggtgtg cgtggtgagc 1201 ctggcccccc tggccctgct ggtgctgctg gccctgctgg aaaccctggt gctgatggac 1261 agcctggtgc taaaggtgcc aatggtgctc ctggtattgc tggtgctcct ggcttccctg 1321 gtgcccgagg cccctctgga ccccagggcc ccggcggccc tcctggtccc aagggtaaca 1381 gcggtgaacc tggtgctcct ggcagcaaag gagacactgg tgctaaggga gagcctggcc 1441 ctgttggtgt tcaaggaccc cctggccctg ctggagagga aggaaagcga ggagctcgag // LOCUS MACTRIIS1 280 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 1. ACCESSION M37561 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 1 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 280) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept 156 + 270 triosephosphate isomerase, exon 1 (E.C. 5.3.1.1) IVS 271 > 280 triosephosphate isomerase intron A signal 46 49 CAAT box signal 93 98 TATA box BASE COUNT 50 a 77 c 111 g 42 t ORIGIN 1 gcaggagggc gggcgggggg cagggtctcc gggggaccgg tcgggccatg gcggaggacg 61 gcgaggaggc ggagttccac ttctcggcgc tctatataag cgggcagtgg cccggctgcg 121 cgcagacact gaccttcagc gtctcgctcc atgccatggc gccttccagg aagttcttcg 181 ttggggggaa ctggaagatg aacgggcgga agcagaatct gggggagctc atcggcactc 241 tgaacgcggc caaggtgccg gcagacaccg gtaagcctcg // LOCUS MACTRIIS2 144 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 2. ACCESSION M37562 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 2 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 144) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 134 triosephosphate isomerase, exon 2 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron A IVS 135 > 144 triosephosphate isomerase intron B BASE COUNT 33 a 38 c 39 g 34 t ORIGIN 1 ttgccctcag aagtggtttg tgctcccccc actgcctata tcgacttcgc ccggcagaag 61 ctagatccca agattgctgt ggctgcacag aactgctaca aagtgactaa tggggccttt 121 actggggaga tcaggtgaga tcga // LOCUS MACTRIIS3 105 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 3. ACCESSION M37563 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 3 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 105) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 95 triosephosphate isomerase, exon 3 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron B IVS 96 > 105 triosephosphate isomerase intron C BASE COUNT 23 a 23 c 35 g 24 t ORIGIN 1 cttcctttag ccctggcatg atcaaagact gcggagccac gtgggtggtc ctggggcact 61 cagagagaag acatgtcttt ggggagtcag atgaggttag tagcc // LOCUS MACTRIIS4 153 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 4. ACCESSION M37564 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 4 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 153) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 143 triosephosphate isomerase, exon 4 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron C IVS 144 > 153 triosephosphate isomerase intron D BASE COUNT 39 a 29 c 49 g 36 t ORIGIN 1 ttctcaacag ctgattgggc agaaagtggc ccatgctctg gctgagggac tcggagtaat 61 cgcctgtatt ggggagaagc tagatgaaag ggaagctggc atcactgaga aggttgtttt 121 cgagcagaca aaggtcatcg caggtatctc ttg // LOCUS MACTRIIS5 102 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 5. ACCESSION M37570 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 5 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 102) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 7 + 92 triosephosphate isomerase, exon 5 (E.C. 5.3.1.1) IVS < 1 6 triosephosphate isomerase intron D IVS 93 > 102 triosephosphate isomerase intron E BASE COUNT 26 a 25 c 31 g 20 t ORIGIN 1 ctgcagataa tgtgaaagac tggagcaagg ttgtcctggc ctatgagcct gtgtgggcca 61 ttggtactgg caagactgca acgccccaac aggtaaccga cg // LOCUS MACTRIIS6 108 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 6. ACCESSION M37571 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 6 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 108) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 98 triosephosphate isomerase, exon 6 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron E IVS 99 > 108 triosephosphate isomerase intron F BASE COUNT 26 a 24 c 32 g 26 t ORIGIN 1 cttgttctag gcccaggaag tacatgagaa gctccgagga tggcttaagt ccaacgtctc 61 tgaagcagtg gctcagagca cccgtatcat ttatggaggt gagtggct // LOCUS MACTRIIS7 632 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 7. ACCESSION M37572 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 7 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 632) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 129 triosephosphate isomerase, exon 7 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron F BASE COUNT 122 a 201 c 154 g 155 t ORIGIN 1 cccttcccag gctctgtgac tggggcaacc tgcaaggagc tggccagcca gcctgacgtg 61 gatggcttcc ttgtgggtgg tgcttccctc aagcccgaat tcgtggacat catcaatgcc 121 aaacaatgag ccccatccac cttccctacc cttcctgcca agccagggac taagcagccc 181 agaagcccag tcactgcccc ttccccgcac atgcttctga tggtgtcgcc tgctccttcc 241 tgtggcctca tccaaactgt accttccttt actatcttca gcctgtaatg gttgggacca 301 ggccaatccc ttctccactt actataatgg ttggaactaa atgtcaccaa ggtggcttct 361 cctcggctga gagatggaag gggtgggatt tgctcctggg ttccctaggc cctagtgagg 421 gcaggagaga agccatcctc tcccttctta cacagtgagg ccgagatcct cccctccgaa 481 ggcaggagtg ccgccctctc ccatggtgcc catgcctgtg tgctgtgtac gtgaaccacc 541 cacatgtgag ggaataaaca cctagcacta ggtcttgtgg tctgtctgcc ttcactggac 601 ttgcccagat aatcttcctt tttgaggcag ct // LOCUS MUS2NF3MK 2470 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger protein (mkr3) mRNA, 3' end. ACCESSION M36514 KEYWORDS mkr3 gene; zinc finger protein. SOURCE Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2470) AUTHORS Chowdhury,K., Rohdewohld,H. and Gruss,P. TITLE Specific and ubiquitous expression of different Zn finger protein genes in the mouse JOURNAL Nucleic Acids Res. 16, 9995-10011 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 1287 zinc finger protein (mkr3; AA at 1) BASE COUNT 695 a 501 c 488 g 786 t ORIGIN 1 ataaaaactc acactggtga gaagcccttc cggtgtaaag tatgtgcaag gacttttaga 61 aattcctcat gccttaagac taactttcga attcacactg gaataaaacc atacaaatgt 121 aattactgtt ggaaagcctt cactgcacgc tcaggcctta ctaagcatgt actaattcac 181 aatggtgaga agccctatga gtgtaaggag tgtgggaaag ccttcagtac atcttccggc 241 cttgttgaac atataagaat tcatacagga gagaagccct ttgaatgtta ccagtgtggg 301 aaagccttgg tccattcctc atcccttgtt ggacatttaa gaactcacac tggagagaaa 361 ccctttgagt gtaatcagtg tgacaaaact tttacacgat cttcttatct tcgtattcat 421 atgcgaactc acaccggaga gaaaccatat gaatgtaaag agtgtgggaa aactttccct 481 gagcgctcat gccttactaa acacataagg acacacactg gtgaaaggcc ctatgaatgt 541 aaggagtgtg acaaaggctt tattagcttt gctcagctta ctgtacacat aaaaactcat 601 agttctgaga gaccctttca gtgtaaggta tgcacaaaat cttttagaaa ctcttcatcc 661 cttgagaccc acttccgaat tcacactgga gtaaaaccct ataaatgcag ttactgtggg 721 aaagccttca ctgctcgttc aggccctacc atccatttac gcaatcacac tggggagaaa 781 tcctatgcat gccaagagtg tggaaaggcc tttagcactt cctcaggcct tattgcacat 841 ataagaagtc acaaaggaga gaaacccttt gaatgtgacc actgtgggaa ggcctttcct 901 tcttcatctt atcttaatgt gcatttgaaa attcacactg gggaaaaacc ctttcggtgt 961 acggtatgtg ggaaaacatt tacgtgttct tcttaccttc cggttcacat gcgaactcac 1021 actggaggga ggccttttcg gtgtataata tgtggaaggt catttttgtg gtcctcgtac 1081 cttagagttc acatgcgaat tcacactgga gagaaaccct atgtatgtca gtactgtgga 1141 aaagctttta cagagcactc aggccttaat aaacatttac ggaaacacac aggagagaaa 1201 ccatatgaat ataaggaatg tggggaaaac ttcactactt ctgctgatgc taatgaacat 1261 gaaactcccc actggggaga cctttgaatg taagggatca gaaaattgat agccgttgac 1321 catcatttgc aatcaaaatt tatagtctat acaagtctta ttgataagca acgtataaaa 1381 aacattcggt tgctatgtgc tgtaaaaaca actcattctg aggatgctct gagtttaagg 1441 attgtgggag aaatcaaaat ttcatgatat atgtatgaac tgaaggtttg taatataggg 1501 aacgaaaacc tttagtgttt ctatgagtgt tgatgcaaga catggagaaa tgtttctatg 1561 ttcacgtgtg gtctcacagt tcgagaaatg tctttctgaa gtttgggaag taactttaca 1621 tttgaacaca agctacatgc gtgtaagagg ggatttgctt cagtgtcttg gatggcaaca 1681 cttggctttc ttgggtgctt ctctatgaac tttgaatatt tctccatttg tgtattttta 1741 tgacctcaat tccattgtta gttttccttt ttaaaatatt cctgtttgat gtgattgaaa 1801 cttgttttta tctagtatct gttatatgtc ccatgttggc attgtgaacc taatgacaca 1861 aactcccatc tcctcactac ttgttagttt atttgttgtt gtttttttcc cctcttttct 1921 ttggagacag tctcactttc tggcctagta taacatgaaa gttatgtagc ctagactggt 1981 ccctagctca tggcagtcta gacccagcct tctgagtgca tagattacag gtgcaagcca 2041 ccatacctgc tctttctact gttttaagta taaacttcac tagcactgaa aaaaccatca 2101 tgttttacaa ttgtgttcag gttcttaaca caagtaggtt tggatctgac caatacaatc 2161 ccttactctg agttctaagt ctttggtagc cattccattc tctgcttgga gttcagcttt 2221 tctttaggct ttgcttctta gatcatgtag tatctgtatt aactctactt agctttttta 2281 tttttgtgac agggtcttgc tgtagctcaa actgacctca aactcgcaaa cctttttgcc 2341 tcagacttac tgctacagtt ataggcatgt gccttcatgc ctagctggcg tatatcaatc 2401 tacataatgt tctcaaggtg gaactgtgtt atggtcagtt attttaagac atctaattat 2461 cctttggcct // LOCUS MUS2NF4MK 1258 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger protein (mkr4) mRNA, partial cds. ACCESSION M36515 KEYWORDS mkr3 gene; zinc finger protein. SOURCE Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 14. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1258) AUTHORS Chowdhury,K., Rohdewohld,H. and Gruss,P. TITLE Specific and ubiquitous expression of different Zn finger protein genes in the mouse JOURNAL Nucleic Acids Res. 16, 9995-10011 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 1258 zinc finger protein (mkr4; AA at 1) BASE COUNT 356 a 335 c 321 g 246 t ORIGIN 1 aaagtccttt ttccaagtat cttctctgtt caggcaccgg agatccacac tggagaaaag 61 ctctacgact gtagccattg tgggaaaggc ttctcttaca actcagacct caggatacat 121 cagaagatcc atacaggaga gaaacgccat ggctgcgtcg actgtggcaa agccttcaca 181 caaaagtcca cactgaggat gcaccagaag atccacacgg gcgagagggc ctatgtctgt 241 attgagtgtg gacaggcttt tatccagaag acacacctgg tcgcacaccg aagaattcac 301 tcaagagtga ggccctgtgt atctctcgac cgtgcgaaac ccttcagcag cgctcccaac 361 ctccttccgc gtaagaaagt tcaaatgaga gagaaatcgt ccatctgcgc ggagtgtggg 421 aaggccttta cctacaggtc ggagctaatc attcatcaga gaactcacac gggagagaag 481 ccttatcagt gcggggactg tggcaaagcc ttcacccaga agtcagccct caccgtgcac 541 cgaagaatcc acacggggga gaagtcgtat gtgtgtgtga agtgcgggct agccttcgtg 601 caaagggcac acttggatgc acaccaggtg attcacactg gagaaaaacc ttaccagtgt 661 ggtcattgtg ggaaattctt cacttccaag tcgcaactcc acgtgcacaa gcggattcac 721 acgggggaaa aaccctatgt gtgcagtaac tgtgggaagg cctttgccaa caggtcaaat 781 ctcatcacac atcagaaaac tcatacaggg gagaaagcct atgtctgtgc aaggtgtggg 841 aaagcattca ctcagaggtc agaacctgtt acacaccaga ggatacatac cggagagaag 901 ccatatgggt gtagaccctg cgggaaagcc tttacccaga aatcacacct cagtatacac 961 gagaagattc acaccggaga gagacagtac ggatgccgag actgtgggaa agccttcaac 1021 cagaaatcca tactcatcgt gcaccagaaa atccacacgg gggagaaacc gcaggtgtgt 1081 gccgagtgcg ggcgagcttt catccgcaag tccaacttca tcacccatca gcgcatccat 1141 actggcgaga aaccttatgg atgcaccgac tgcggcaagt cctttacctc caagtcgcag 1201 ctcctagtgc accggccaat ccacaccggc gagaaacctt acgtgtgtgc cgagtgtg // LOCUS MUS2NF5MK 2433 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger protein (mkr5) mRNA, 3' end. ACCESSION M36516 KEYWORDS mkr3 gene; zinc finger protein. SOURCE Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2433) AUTHORS Chowdhury,K., Rohdewohld,H. and Gruss,P. TITLE Specific and ubiquitous expression of different Zn finger protein genes in the mouse JOURNAL Nucleic Acids Res. 16, 9995-10011 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 1845 zinc finger protein (mkr5; AA at 1) BASE COUNT 752 a 516 c 536 g 629 t ORIGIN 1 cagcagtctg tacaggaagc ccaggatctg cttccaaggc aggactcaca tgctgaacgg 61 gtaacaggca gaacttggag cactaaactt gagtgctcca ctttcagaga tcaggattct 121 gagtgtacgt ttgaaaggaa tgagcaagag acagtcactc cgaacagagc cttctcagag 181 gggagagatg gtatgtgtat cgaatctgga agatggtttc atttgaacag ttcagacgag 241 agaagtcata actgtgactc aggtaaaagt ttctcctcaa atccagtagt tgtaaaggaa 301 accggaatct gttcaggaaa aaaacttttc caatgtaatg aatgtaagaa aactttcacc 361 cagagctcct cccttaccgt tcaccagaga attcatactg gagagaagcc ctataaatgc 421 aatcagtgtg ggaaggcctt cagtgacggg tcctccttcg cacggcacca gaggtaccat 481 acaggcaaga agccgtacga gtgcccggag tgtggcaaag ctttcataca gaacacctcg 541 ctcgttcgtc actggaggta ctatcacact ggggagaaac ccttcgattg catcgactgt 601 gggaaagcct tcagtgacca catagggctt aatcaacaca ggaggattca cactggagag 661 aaaccataca cgtgtgaagt gtgtcacaaa tcctttaggt acggctcatc cctcactgtg 721 caccaaagga ttcatactgg agagaaaccg tacgagtgtg agatttgcag aaaagccttc 781 agccaccatg catccctcac tcagcatcaa cgagtgcatt ctggagaaaa gccttttaaa 841 tgtaaagagt gtgggaaagc ttttaggcag aatatacacc ttgctagtca ctggaggatc 901 cataccgggg agaagccctt tgagtgtggg gaatgtggga aatctttcag catcagctca 961 cagcttgcca cgcatcagag aattcataca ggagagaagc cctttgagtg taaggtttgt 1021 aggaaagctt ttaggcagaa tatacacctt gctagtcact ggagaatcca taccggggag 1081 aagccctttg agtgtgggga atgtgggaaa tctttcagca tcagctcaca gcttgccacg 1141 catcagagaa ttcatacagg agagaagcct tatgaatgta aggtttgtag gaaagcattt 1201 acgcagaagg ctcaccttgc acaacatcag aagactcata cgggggagaa gccatacgag 1261 tgcaaggaat gtggcaaggc cttcagccag accactcacc tcattcaaca ccagagggtt 1321 catactggag agaagcccta taaatgcctg gaatgtggga aggcctttgg tgataactca 1381 tcctgtactc aacatcagag acttcacact ggcctgaggc cttatgagtg tgtggagtgt 1441 ggaaagacat tcaagaccaa gtcatccctt atctgttatt gtagatgtca cacgggagag 1501 aaaccttatg agtgcagtgc atgtggtaaa gcctttagcc actggcagtc ccttagtgta 1561 catcagagaa ttcattcggg gaagaaaccg tatgaatgta aggaatgtcg gaaaaccttc 1621 atccagattg gacaccttaa ccaacataag agagtccata ccggggagag aacgtacaac 1681 tacaagaagg gcaggagggc cttcaggcag actgcacact ttgctcacca tcagcagatt 1741 cattccggaa agtcacctgc tcaccactct ttgccttcca catcgaatcc tgtggatctc 1801 ttctccaaat tcgtctggaa tccatcctca ctgccatcat cataacctca tattttcacc 1861 acacttgttt atcaacttgt ctctttgccc ctttgttcat tcttgtccct tatgagtcct 1921 tcatattaca accaggttgg attatttttg gagtgtaaat gtaattaatt tgctcattta 1981 aagcttatat tcactgggga aaaaaaaaga agttagctga aacataaaga aacctcatga 2041 atctaagaaa tgttccatat cttgctctgg gtagtatatc tgtaaatctt agcaagccct 2101 gtatgtagta ttttaagaac tttctgactt tgttatcctt taatttttaa tgctgttata 2161 ttaagaaaaa cacatttata ataaatatgg tattttaaat aaaaagtatt atatcttata 2221 gctatttttc agataaacaa aaacaaattt gatacaaagt aaaatatgta taaggttggt 2281 gttccagaac aaaatctgca ggaatattta cagcatggta taaagcaacg aaaccaggtc 2341 atccaccaac attttgcctt agattttcta atatttcagc atatgagtag cacagtatta 2401 aaacactata ggtagtagca tttcacagaa ttc // LOCUS ORCHRCOM 1579 bp ds-DNA MAM 14-AUG-1990 DEFINITION O.orca highly repetitive element DNA. ACCESSION M36451 KEYWORDS repetitive element. SOURCE O.orca DNA. ORGANISM Orcinus orca Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Odontoceta; Delphinidae; Orcininae. REFERENCE 1 (bases 1 to 1579) AUTHORS Widegren,B., Arnason,U. and Akusjaervi,G. TITLE Characteristics of a conserved 1,579-bp highly repetitive component in the killer whale, Orcinus orca JOURNAL Mol. Biol. Evol. 2, 411-419 (1985) STANDARD simple staff_review FEATURES from to/span description rpt 1 1579 highly repetitive element BASE COUNT 368 a 358 c 333 g 520 t ORIGIN 1 atcttatgat gtttcttttt ttgtgtgact tatttcatgt agaatcatcg tacctgaatc 61 cactcattat gctgctacgg gcctgatgac atagatttca ttgctgagtg atattgcatt 121 gtacgtaagt accacaaagt ctttatccat ttttcgcttt ctgcgatact gaacttgtac 181 cataaacgag gttcttgtaa acagagccgt cacaaacttt gggatggctg tgtctttttc 241 attttaattt caactaagct ataggaccat aagtggaagt gccctaggct ctgttgcttt 301 gttgtttaga tgtttcagga aacaccatac acttctccag agtggctgtt ggcaatttac 361 atcccgccca tcagcataac aaggctccca attctccatg gcctgtcccg cctttctgga 421 ttttacactt ttttcagatg gcccttttga ccgtggggca gtgagacttc atgtagtgca 481 gatttccttt gcaagcttgc ttggttggcc aaaaagttcg tatgcgtttc tttctgaata 541 tattcaggaa aaaacgcata agaccttttt ggccaagtgc atcattgtgg acgttctgcc 601 tgttttccta tgctttacat gcaattccag tctacctcct gaaatcggtg tctggcaatt 661 ttgccccgct ttcaagtcct cttggcagcc ttacttcaat atatttttgg acgatagctg 721 tcatttataa ctctgcaggt ttgtgaatca cagtgcccct gagctccttt cttcaactgg 781 ctttcttgtg agctggctgc aacaccgcag gattgcttga ggccctcgtg tggttccggc 841 agggcacgct gagccttttg ttaattcctc ttcctggtgg gaaatgaggg ttaaatttgc 901 ccgtccagac acctccagct agtctctcat tgattctccc tattcctgtt cattttccgc 961 agaaattgca aactgggcca aacaggaggt taaaggcact gactctccaa gtggggagag 1021 tgttagtaaa gcgtctggaa tcttgcaccc gagtaccagg ggacgaaaac tgagacacat 1081 ttgaacacgt ttcccgatca cacggtggat catactctgg gttccacatg catgttttag 1141 ctgaaggaag aatcccttaa acctggagag ttgagaccca tggaatgggt accatgcaat 1201 atgacttcaa agggtctgta tttgctaact gaaactcact aatcctatca ctgctgcgtt 1261 tatgccactg tacacacgct tgattccctt tcggagacat ataagtccac aggttttaag 1321 attcttacta gtcaggtata ttcttaggag tttaatatgg agtgttgagt ccactcgatc 1381 agcaaggagt agctcttgtc tattaaatat ttggcttatg gaacggtatc tgtgctgatt 1441 tcaatctctg gttttatgca gcacctcaac tcacctttcc ccttaagcaa gcataagttg 1501 ctgttctaca tttgagaccc tgttctgttt tgtaattcag ttcctgtgta gccaagttta 1561 cattccgtgt agtagtgat // LOCUS RICADH2A 3779 bp ds-DNA PLN 14-AUG-1990 DEFINITION Rice alcohol dehydrogenase (adh2) gene, complete cds. ACCESSION M36469 KEYWORDS adh2 gene; alcohol dehydrogenase. SOURCE Rice DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 3779) AUTHORS Xie,Y. and Wu,R. TITLE Molecular analysis of an alcohol dehydrogenase-encoding genomic clone (adh2) from rice JOURNAL Gene 87, 185-191 (1990) STANDARD simple staff_review FEATURES from to/span description pept 639 669 alcohol dehydrogenase (adh2), exon 1 768 904 alcohol dehydrogenase, exon 2 1017 1063 alcohol dehydrogenase, exon 3 1672 1998 alcohol dehydrogenase, exon 4 2114 2195 alcohol dehydrogenase, exon 5 2301 2376 alcohol dehydrogenase, exon 6 2471 2529 alcohol dehydrogenase, exon 7 2628 2723 alcohol dehydrogenase, exon 8 2854 3012 alcohol dehydrogenase, exon 9 3149 3247 alcohol dehydrogenase, exon 10 IVS 670 767 adh2 intron A IVS 905 1016 adh2 intron B IVS 1064 1671 adh2 intron C IVS 1999 2113 adh2 intron D IVS 2196 2300 adh2 intron E IVS 2377 2470 adh2 intron F IVS 2530 2627 adh2 intron G IVS 2724 2853 adh2 intron H IVS 3013 3148 adh2 intron I BASE COUNT 1002 a 799 c 867 g 1111 t ORIGIN 1 taatcattac caacagatat cttagaaaaa aaagctatta ttcctcaaaa ggaaaaaaaa 61 ggctattatt ccttacaaga aaaaagaaga aggcaatgca ttatgctttt ctaaatgtct 121 agattaattc atagttggac agatatgacc gcatatcaac gttaattact gccgatttga 181 ctgttgatta gacatatctt agcactaaac ctcactgaaa attcggagaa tttactattg 241 gagagaagaa aaggcatcca tcctccttgt cttcactctg caatctgcat catcatccgc 301 ggaaaccgga aaaccaccga agctcgaaaa ccggcggagg caagagcagc tagcagcatg 361 cagcagacgc agagccttcc tcttcttcca agcaaaaaat attcagatga cgcctcctcc 421 tctttccccc ctcctcccac cttgcttggt tcctttcagg cgtacttgcc cgcgagactc 481 ctaccactct ctgatcccta tatatacacc ccctcctccc ctctcgtctc ctcaccagaa 541 ttccaagcaa cgaactgcga gtgattcaag aaaaaagaaa acctgagctt tcgatctcta 601 cggagtggtt tcttgttctt tgaaaaagag ggggattaat ggcgacaggg aaggtgatca 661 agtgcaaagg tccgttcttg ctgttcttgg agcggttcgt gagaaaatta gtttgatgtt 721 ttgtggggag gagaagctga ttagatcggg ttttggggtt tttgcagcgg cggtggcatg 781 ggaggccggg gaagcgctgt cgatcgagga ggtggaggtt gcgccgccgc agcgcatgga 841 ggtccgcgtc aagatcctct acaccgccct ctgccacacc gacgtctact tctgggaggc 901 caaggtaatt aatcagcatc tgccatggct gagctttctc accttctcaa gcttgcaaga 961 atgatcagtg actaactaac ttggtgtgtg tgtggtttct gctttgatct tcgcaggggc 1021 aaacacctgt tttccctagg atcttgggcc atgaagctgg agggtatgtg taatctgaac 1081 caaccctgtc caattcttgt tcttcttcta ctacactgtt tagtggttca gtctattctg 1141 ttcttgtatt gttttttttt tgttgaaaat ggtttttggt gtttagtgga atttagtgtg 1201 tacttgtacg tcataggttg tttggtttat ggtattcggt tactaactat agttagtgtg 1261 cacgttgggg agtcctgatt agtcagtggc attttcatta agttcagtaa agaaatctac 1321 caagtgtcca attgtgcatg atattttacc tggtttggat aaatcatatg aaggaatcaa 1381 agaacagcgt cattgatgac atatacaacg acaattggat tcctaggata atatactttt 1441 cttcatgtgc tcctctatgc tagtgcttta ttactttctc tcctttgatg ttttgaggta 1501 ctcaaaggaa aaagacacac tgaatcatta tccatcttat gaactgttta aacttgactt 1561 acttatcact tctctgctca aatggacccc tttttttttt catttccatg gaagatatat 1621 gtgtaatcct aacaatcaaa ttgatctaac aaaacaattt tttgctcaca gcattgtgga 1681 gagtgtggga gagggtgtga ccgaactcgc cgcgggcgac catgtcctcc cggtgttcac 1741 cggcgagtgc aaggagtgtg atcactgcaa atcggaggag agcaacatgt gtgacctcct 1801 caggatcaac gtcgaccgcg gcgtcatgat cggcgacggc aagtcccgat tcaccatcaa 1861 ggggaagccc atcttccact ttgttggcac ttccaccttc agcgagtaca ccgtcatcca 1921 tgtcggctgc ctcgcgaaga tcaacccgga ggcgcctctc gacaaggtct gcattctcag 1981 ctgcggtttc tcgtccaggt aagatgatgg atggcttcat gattatgatg acgtagtagt 2041 tttcacagtg aaatttggaa aggtcttttt ttttttggat gcattaagtt gatgggtttt 2101 gctgttcttg caggtttggt gcgacagtga atgtcgcgaa accgaaaaag ggccagaccg 2161 tagctatttt cggtcttgga gctgtaggtc ttgctgtaat gggcacacgc acaaagatat 2221 attcatttcc atttgatttg cagattgttg agatcttgaa actcaaactg atgaatgttt 2281 gtaaaattac atgattatag gctatggaag gtgccaggct gtctggagca tcgaggatca 2341 ttggtgtgga cctgaaccct gcgaaattcg aacaaggtac aatcttttat catctttgag 2401 tgaaaattac atgtttggta tatgttctgg attgaacact gaactctgca acttgttttc 2461 atctcttcag ctaagaagtt tggctgcact gacttcgtaa accccaagga ccacagcaag 2521 ccagtgcagg tacatcacat tgcctcacta atttctctta ttatcacaca ctgttcatga 2581 catggagtgt acatacttaa acatcaactc tgcaatggtg atctcaggtg ctgattgaga 2641 tgaccaacgg tggactcgac agggccgttg aatgcacggg caacatcaac gccatgatat 2701 cctgcttcga atgtgtccat gatgtcagtc tctttcttac aaacatgaaa catgatagta 2761 acaaattgaa gatgatcatt tcagcaatgc ctccagttct gctgtaaact gtatcctgaa 2821 aactaagagt ggcaatctga aacctgaatc cagggatggg gcgttgcggt gctggtcggc 2881 gtgaaggatg atgtgttcaa gacccacccg atgaacttcc tcaacgagaa gacgctgaag 2941 gggacgttga tcttcggtaa ctacaagccg cgcaccgacc tgcccaacgt cgtcgagctg 3001 tacatgaaga aggtaaaaat aattgcagaa cagaatcaga tttgcaacct gaatgtgcat 3061 tgttcagaaa tctatggtca gtgattccaa tgagccactg attgatcact attttatgct 3121 aatgatttct ggtgtttgat gtctgaagga gctggagctg gagaagttca tcacgagcgt 3181 gccgttctcg gagatcaaca cggcgttcga tctcatgctc aaggggagag tctccgctgc 3241 gtcatgagta tgtatgagta gagatgatga tgatctatct ctctctctct ctctctgcag 3301 tttcagtgca tacaagaaac caaaaatgaa agcaaaaaat ggatgagaag tttgaggttt 3361 tccatttaag tttaataatg gtcgtgagac agggcttgtg tggagtcagc tctgtttact 3421 ccctcctttc atggcttgat cttgtttatt gtcagcattg tggcttagca gattttcacc 3481 agttttaccc tgtaaattag taccattctg atcaatcgta ataaactact agcagtgtga 3541 ggtactattg ctggaactga attgattgtg cagtaaagag gtgaatagca gccagtatga 3601 aggttttaca atttgaacat cgaaacgaaa tccacacgaa attcataaat ttcatcgcat 3661 cttcgtgaag cttaccagat tctatcgatt cgatctagct ccaataggtc gatgggagca 3721 tctattgatt ttttcgacag acgaaaattt tgcaagggac tagctaggga gttgtcgac // LOCUS YSCCTA1A 2785 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae catalase A (CTA1) gene, complete cds. ACCESSION M36510 KEYWORDS catalase A. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2785) AUTHORS Cohen,G., Rapatz,W. and Ruis,H. TITLE Sequence of the Saccharomyces cerevisiae CTA1 gene and amino acid sequence of catalase A derived from it JOURNAL Eur. J. Biochem. 176, 159-163 (1988) STANDARD simple staff_review FEATURES from to/span description pept 820 2367 catalase A (CTA1) BASE COUNT 917 a 529 c 571 g 768 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcttag aaggtgaaga aatagtacta gattcatatg attgttataa taatggtgaa 61 ttgttgcctc aactaatact tttggtccaa atcttgacaa ttctttgcca aattccaggt 121 ttatgcaaac tggacataaa agcaatggaa aggcaagtgg agagaattgt aaagaagtgt 181 ttacaattga tagaaggtgc ccgcgccact acaaactgta gtgccacatg gaaacgttgt 241 attatgaagc gtctagccga ttaccccata aaaaagtgcg tttctatcga aaaaccttcg 301 aaaggaaact cattaacaag ggaagaacta agagatgtta tggctcggag agttttgaaa 361 agcgaaatag attcgctgca agtttgtgaa gaaaccatcg acaagaatta caaggttatt 421 cctgatgaaa agctgctaac taatatttta aagagaaagt tgacagagga agaaaaaagc 481 tctgtcaaac gtccttgcgt gaagaagtga gcggttgttc taaccactat ttaaagccgc 541 aattagtaat gcaaaaagtt ggccggaatt agccgcgcaa gttggtgggg tcccttaatc 601 cgaaaaagga cggctttaac aaatataaac tccgaaaatc cccacagtga cagaattgga 661 gaaacaacca gttttgatat cgccatacat ataaagagat gtagaaagca ttcttcactg 721 taatgtccaa atcgtacatt tgaatttctt gtaggtttat ttaaaaggta agttaaataa 781 atataatagt acttacaaat aaatttggaa ccctagaaga tgtcgaaatt gggacaagaa 841 aaaaatgaag taaattactc tgatgtaaga gaggatagag ttgtgacaaa ctccactggt 901 aatccaatca atgaaccatt tgtcacccaa cgtattgggg aacatggccc tttgcttttg 961 caagattata acttaattga ttctttggct catttcaaca gggaaaatat tcctcaaagg 1021 aatccacatg ctcatggttc tggtgccttc ggctattttg aagtaaccga tgacattact 1081 gatatctgcg ggtctgctat gtttagtaaa attgggaaaa gaacgaaatg tctaacaaga 1141 ttttcgactg tgggtggtga taaaggtagt gccgacacgg ttcgtgatcc aagggggttt 1201 gccaccaaat tctacactga agaaggtaat ttagattggg tctacaataa tacaccggta 1261 ttctttatca gagacccttc caagttccct cactttatcc acacacagaa gagaaaccca 1321 caaaccaacc taagggatgc tgacatgttt tgggatttcc tcaccactcc tgaaaatcag 1381 gtggccattc atcaagtaat gatccttttt tcagaccgtg gtacccctgc caactaccgt 1441 agtatgcatg gttattctgg tcatacctat aaatggtcca ataaaaacgg agattggcat 1501 tatgtgcaag ttcatatcaa aaccgatcaa ggaataaaga atttgaccat agaagaggct 1561 accaaaattg cgggatccaa tccagattac tgccagcagg atttatttga ggctattcag 1621 aatggaaact atccttcctg gacagtttat attcaaacaa tgaccgaacg cgatgccaaa 1681 aaattaccat tttcagtctt tgatttgact aaagtatggc ctcaggggca attcccttta 1741 cggcgtgtgg gtaagattgt tttgaacgag aatccactga acttcttcgc acaggtggaa 1801 caagctgcct tcgcccccag taccacggtt ccttaccaag aagcaagcgc tgatccagta 1861 ttacaggccc gtttgttttc atatgcggat gctcatagat acaggctagg tcctaacttc 1921 catcaaatac ccgtaaactg tccatatgca tctaaatttt tcaatcccgc tatcagagat 1981 ggaccgatga atgttaacgg caacttcggc tcagaaccta catatttggc caacgataaa 2041 tcgtacacgt atatccaaca ggacagaccc attcaacaac accaagaggt atggaatggg 2101 ccagctatcc cttatcattg ggcaacatcc ccaggtgatg tagatttcgt gcaagcaaga 2161 aatctctacc gcgttttggg taaacaacct ggacagcaaa agaacttggc atataacatc 2221 ggcattcatg tagaaggcgc ctgtcctcaa atacagcagc gcgtttatga tatgtttgct 2281 cgtgttgata agggactatc tgaggcaatt aaaaaagtag ctgaggcaaa acatgcttct 2341 gagctttcga gtaactccaa attttgaaac gctcaagtaa caaatgagtg gcgttgtttc 2401 cacgacaatt atttatgata gtgtgtattt ttaacacatt ttatttatta caatttatgt 2461 attttgttat gaattattta tttatacgac taataggtga tgctcatatt ctcgtgttag 2521 aaagttaaaa aaattatcat ttcacacata ggaaagctcg tcgcgccggg gaaaaagctg 2581 aggaatctct attattaggg gtaaagttca acacattcag tatgagataa gtgtgtcttc 2641 aagagagatg cagcactgag tagggaacca agaaacgatg tctgaattac tagatagctt 2701 tgagacagag tttgcgaaat tttataccga cagcaatctg gaagagacaa accttcaaaa 2761 atgtcttgat catactcatg aattc // LOCUS YSCILV1A 2766 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae threonine deaminase (ILV1) gene, complete cds. ACCESSION M36383 KEYWORDS ILV1 gene; threonine deaminase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2766) AUTHORS Kielland-Brandt,M.C., Holmberg,S., Petersen,J.G.L. and Nilsson-Tillgren,T. TITLE Nucleotide sequence of the gene for threonine deaminase (ILV1) of Saccharomyces cerevisiae JOURNAL Carlsberg Res. Commun. 49, 567-575 (1984) STANDARD simple staff_review FEATURES from to/span description pept 661 756 ORF1 pept 778 2508 threonine deaminase (ILV1) BASE COUNT 797 a 569 c 510 g 890 t ORIGIN 1 gaattcgttc tcattacatg gccgcccacc atgcaccttt acatttcgta tttctttcct 61 ctctgcgcta tatctttggg aaagatttgc aatcaaggca aacacccgct tgtccccttt 121 caattcttgt gttatttcaa aaaccatcct ttcataatat atttgtattt ctttcgccct 181 ggactctacc agttcatatt tagaaccttc cacatatcgt ccaattctgc agcccacatc 241 tttttccacc acgatacggg aaacagaatg ggtccttgga ttctcgctaa acagtctctc 301 tattcccctg ttcaaaaccc tcaagatatt tgtttcccgc aacagctgca attgcaattg 361 atcaatccta tgcgaaaatg ccgagtttat gttattcaag acgcatttta aaaaattcac 421 tagcggctcc ttgaaattca ttatgtctga tgaatatgaa aaccttttcc tgactaccaa 481 gactctttaa ctcttctctc tttattgcat attatctctg ctattttgtg acgttcaatt 541 ttaattgacg cgaaaaagaa aaaataagaa gggcaaaaag aaaaagcgca gcgggtagca 601 aatttggaat cgcataaaaa gaaaaaaaaa atatcaaaga aaaagagtca tctcaaacat 661 atgtctgcag atacttcatt atcagctttg aaaacttttg ttgttgctgc tttgagttct 721 ttcttgtgtg agtgctacaa gccacattta aactaagtca attacacaaa gttagtgatg 781 tcagctactc tactaaagca accattatgt acggttgttc ggcaaggtaa acagtccaaa 841 gtgtctggat tgaacctttt gagactaaag gctcatttgc acagacaaca cctgtcacct 901 tccttgataa aactacactc tgaattgaaa ttggatgagc tgcaaactga taacacccct 961 gattacgtcc gtttagtttt aaggtcctct gtatacgatg ttattaatga atctccaatc 1021 tctcaaggtg taggtttgtc ttcccgtcta aacacgaatg tcatcttgaa aagagaagat 1081 ctattgcctg ttttctcttt caagcttcgt ggtgcctata acatgattgc caagttggac 1141 gattctcaaa gaaaccaggg tgttattgcc tgttcagctg ggaatcatgc ccaaggtgtg 1201 gcctttgctg ctaaacactt gaaaatacct gctactatcg ttatgcctgt ttgtacacca 1261 tctattaagt atcaaaatgt ctcgagatta gggtctcaag tcgtcctata tggtaacgat 1321 tttgacgagg ctaaggctga atgtgccaaa ttggctgaag agcgtggctt gacgaacatt 1381 cctcctttcg atcatcctta tgtcattgcc ggtcaaggta ctgtagctat ggaaatccta 1441 agacaagtac gtaccgctaa taagatcggt gctgtctttg ttcccgtcgg cggtggtggt 1501 ttaattgctg gtattggtgc ttatttgaaa agggttgctc ctcatatcaa aactattggt 1561 gttgaaactt acgatgcggc cactttacat aattccttgc aacgcaacca gagaactcct 1621 ttacctgtgg tgggtacttt tgccgatggt acgtctgtgc gtatgattgg tgaagaaaca 1681 tttagagtcg cccaacaagt ggttgatgaa gttgttcttg ttaacactga cgaaatctgt 1741 gctgcagtaa aggatatttt tgaagatact agaagtattg tagaaccatc tggtgccctt 1801 tcagtagccg gtatgaagaa atacatctct accgtacatc cagaaattga ccacactaaa 1861 aacacctatg ttcccatcct ttctggtgct aacatgaact ttgatagatt aagatttgtt 1921 tccgaacgtg ctgttcttgg tgaaggaaag gaagtcttca tgttagttac tttacccgac 1981 gtccctggtg cgttcaagaa aatgcaaaag atcatccacc caagatctgt cactgaattc 2041 tcttaccgtt acaatgaaca tcgtcatgag tcctctagtg aagtgcccaa ggcttacatt 2101 tacacttctt tcagcgtcgt tgacagagaa aaggaaatca agcaagttat gcaacagttg 2161 aatgctttag gttttgaagc tgtggatatc tccgataacg aattggctaa atctcatggt 2221 agatacttgg ttggtggtgc ttctaaggtt cctaatgaaa gaattatttc atttgaattc 2281 cctgaaagac caggtgcctt gactaggttc cttggaggcc taagcgattc ttggaatctt 2341 actttattcc attatagaaa ccatggtgcc gatatcggta aggttttagc tggtatttcc 2401 gttcctccaa gggaaaactt aaccttccaa aaattcttgg aagatttagg ctacacttat 2461 catgatgaaa ctgataacac tgtttatcaa aaattcttga aatattaaaa ccgacaattt 2521 actttataaa tttacgcaac aacttgttag gtaaaatgaa cgtcgtcata atttaaatac 2581 tagacagcgc gtgtatgtta ctcatattta tactcatatc atgtgaacct ataaaccaat 2641 tcaatacaag atttacagtt ctcttgtttt cttcttccag ttttcccctt ttatttcctg 2701 tagaataaac ttaaacagtg atcattaatt tataattttc tgctttgtaa atacggcggt 2761 aatgga // LOCUS YSPRPS6A 1940 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.pombe ribosomal protein S6 (rps6) gene, complete cds. ACCESSION M36382 KEYWORDS ribosomal protein S6. SOURCE S.pombe DNA. ORGANISM Schizosaccharomyces pombe Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1940) AUTHORS Gross,T., Nischt,R., Gatermann,K., Swida,U. and Kaeufer,N.F. TITLE Primary structure of the ribosomal protein gene S6 from Schizosaccharomyces pombe JOURNAL Curr. Genet. 13, 57-63 (1988) STANDARD simple staff_review FEATURES from to/span description pept 495 1214 ribosomal protein S6 (rps6) precursor sigp 495 614 ribosomal protein S6 signal peptide matp 615 1211 ribosomal protein S6 mRNA 478 1240 rps6 mRNA (alt.) mRNA 478 1260 rps6 mRNA (alt.) BASE COUNT 592 a 371 c 355 g 622 t ORIGIN 1 gatcatctct atacgctagc gttttgttga gtaaataatt ctgttaatga tgtacaaacc 61 attgagttca gctagttatt taaaaaatgc aacgagtata tcttataaaa ttaagtcata 121 caagacatca atttcctcgt aacgatgata aatcacatct atggaaacat agcagactat 181 gactttaata ttaattacct ttctccataa attaaacgaa aattaaagat gagatgaaaa 241 gtgataatta tagtaacttg taagttatat tcaacagtaa aatcctaatt tccttttcaa 301 ttttctgatt cacaatgtat atattagcta cccgcgtgtt tgtttactaa aaatagtgca 361 ttaatcatcc atcgcaaaat atttataacg agagatataa agacttgtaa atctatccca 421 aattgtgact gctcttagag gagaaactta gctagtcaaa aatacaacat tactgccggg 481 aacctcaacg caaaatgaag ttaaacattt cttatccagc aaatggtaca caaaaattga 541 ttgaaatcga cgatgatcgt cgtcttcgag tctttatgga gaagagaatg ggacaagagg 601 ttccaggtga ctctgtaggc ccggagtttg ctggatatgt cttcaaaatt accggtggta 661 acgacaaaca aggatttccc atgtttcaag gcgtcttgct tcctcaccgt gttcgtcttt 721 tgcttcgtgc tggccaccct tgctatcgcc cacgccgtga tggtgagcgc aagcgtaaat 781 ctgtccgtgg ttgcatcgtc ggccaagatt tggctgtcct tgctcttgcc attattaaac 841 aaggtgaaca agatatccct ggtttgaccg atgttaccgt ccccaagcgt cttggtccaa 901 agcgtgcctc taagatccgc cgcttcttta acttgtccaa ggaagatgat gtccgtcaat 961 tcgttattcg tcgtgaggtt gttcccaaaa aggaaggaaa gaagccttat accaaggccc 1021 ctaagattca acgtcttgtt actcctagaa ctcttcaaca caagcgccac cgctttgctc 1081 ttaagcgcag acaagctgaa aagaaccgtg aagaggctgc tgagtttgct caacttatgg 1141 ctaagcgtgt cgctgaggct aagcaaaaga gagaggtcgt caaggctcgc cgtgcctctt 1201 ctttgaaaaa gtaaataggc atttttaagt aactttggtc atctccatat tattattttg 1261 ctggaacttt tcttatgggg tttattgttg ttgaataaac gtagttagta acattctcta 1321 agcatcgttt gtattcactt atgagtagga atgttaactg taattaggat tatcttcacg 1381 aagctgcttt ttgtggtaat ccttcatgct tttaaataat tttaggatct atatagtaga 1441 ggtaattaat gatattctaa tcaatctagc ttcaacacac acatgatcaa tttttatacg 1501 ttttttaatc attcatatac actacgctaa aaccttttac agattgtttt actaaaataa 1561 ttattatctg cgaaacactc cttctatttt cttgaggata tgctaggaga aaatattata 1621 ctataagtat aagagggggt ttgataaaac tatttaatta cctctgctca ctagagtttt 1681 tatcctttat gttcctatct aagtcgccta aatctcaaca atttggcgca gacaaaagtt 1741 gtttgcgttg cgctagtttc tacaaccaca ccactaatac caactagcaa gaaccagaat 1801 ttctgagcta ctttaggtag taaaataaga ggtgaattgt gagccccggt ttgctttttt 1861 agacataagg cgaacacgag ttagtgctgt ctgttgccaa attattttaa ttcggcttta 1921 tcaaaacaat atagcatcga // LOCUS CNAGMDHA 1334 bp ss-mRNA PLN 14-AUG-1990 DEFINITION C.vulgaris glyoxysomal malate dehydrogenase mRNA, complete cds. ACCESSION M33148 KEYWORDS glyoxysomal malate dehydrogenase. SOURCE C.vulgaris (variety Sugar Baby) cotyledon, cDNA to mRNA. ORGANISM Citrullus vulgaris Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Violales; Cucurbitaceae. REFERENCE 1 (bases 1 to 1334) AUTHORS Gietl,C. TITLE Glyoxysomal malate dehydrogenase from watermelon is synthesized with an amino-terminal transit peptide JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5773-5777 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Gietl, 22-MAR-1990. FEATURES from to/span description pept 70 1140 glyoxysomal malate dehydrogenase precursor (EC 1.1.1.37) sigp 70 180 glyoxysomal malate dehydrogenase signal peptide matp 181 1137 glyoxysomal malate dehydrogenase mRNA < 1 1334 glyoxysomal malate dehydrogenase mRNA BASE COUNT 340 a 275 c 340 g 379 t ORIGIN 1 caacgctaag ttcccaaagg tttctgatct tgaagcggtt ggtttgtttt tctgtttgtc 61 aaactaatta tgcagccgat tccggatgtt aaccagcgca ttgctcgaat ctctgcgcat 121 cttcatcctc ccaagtctca gatggaggag agttcagctt tgaggagggc gaattgccgg 181 gctaaaggcg gagctcccgg gttcaaagtc gcaatacttg gcgctgccgg tggcattggc 241 cagccccttg cgatgttgat gaagatgaat cctctggttt ctgttctaca tctatatgat 301 gtagtcaatg cccctggtgt caccgctgat attagccaca tggacacggg tgctgtggtg 361 cgtggattct tggggcagca gcagctggag gctgcgctta ctggcatgga tcttattata 421 gtccctgcag gtgttcctcg aaaaccagga atgacgaggg atgatctgtt caaaataaac 481 gcaggaattg tcaagactct gtgtgaaggg attgcaaagt gttgtccaag agccattgtc 541 aacctgatca gtaatcctgt gaactccacc gtgcccatcg cagctgaagt tttcaagaag 601 gctggaactt atgatccaaa gcgacttctg ggagttacaa tgctcgacgt agtcagagcc 661 aatacctttg tggcagaagt attgggtctt gatcctcggg atgttgatgt tccagttgtt 721 ggcggtcatg ctggtgtaac cattttgccc cttctatctc aggtgaagcc tccaagttct 781 ttcacacaag aagagattag ttacctgact gataggattc aaaatggtgg aacagaagtt 841 gtcgaggcca aagcaggagc tggctcagca actctctcaa tggcttatgc tgccgttaag 901 tttgcagatg catgcctcag gggcttaaga ggagatgctg gtgtcattga atgcgcgttt 961 gtgtcttctc aggtgactga acttccattc tttgcatcaa aagtacgact tggtcgcaat 1021 ggtatcgaag aagtatactc ccttggcccg ctaaatgagt atgagaggat tggattggag 1081 aaagcgaaga aagagttggc aggaagcatt gagaagggag tttccttcat cagaagctga 1141 agagatgcca attaccatta gttttaatag aaacattcca tctcttatag attacttgtg 1201 ctcaatgttt tcctggagat tgaagttgat tgaaatgata ccacaccacg tatttttata 1261 ctaataaaac tatatcgcca tcatgtcgat atttaatgca caaccaaaag ggttggatta 1321 gagtaccttt tatg // LOCUS HAMACAPT 1837 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA, partial cds. ACCESSION J05590 M22755 KEYWORDS N-acetylglucosamine-1-phosphate transferase; glycosyl transferase; tunicamycin-resistance. SOURCE Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone TRG-[4,22]. ORGANISM Cricetulus longicaudatus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 889 to 960) AUTHORS Lehrman,M.A., Zhu,X. and Khounlo,S. TITLE Amplification and molecular cloning of the hamster tunicamycin- sensitive N-acetylglucosamine-1-phosphate transferase gene: The hamster and yeast enzymes share a common peptide sequence JOURNAL J. Biol. Chem. 263, 19796-19803 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 1837) AUTHORS Zhu,X. and Lehrman,M.A. TITLE Cloning, sequence and expression of a cDNA encoding hamster UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate transferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and sequence for [1] kindly submitted by M. Lehrman, 04-APR-1989. Draft entry and computer-readable sequence for [1] kindly submitted by M.Lehrman, 12-JUN-1990. FEATURES from to/span description pept 148 1374 N-acetylglucosamine-1-phosphate transferase (GPT) binding 67 79 dolichol binding site binding 222 234 dolichol binding site BASE COUNT 363 a 515 c 418 g 541 t ORIGIN 1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg 61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg 121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg 181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc 241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg 301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc 361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc 421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg 481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct 541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta 601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc 661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat 721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta 781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt 841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga 901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc 961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct 1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg 1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc 1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag 1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt 1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc 1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc 1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca 1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt 1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt 1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc 1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat 1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag 1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc 1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt // LOCUS HS4NA3ABC 10089 bp ds-DNA VRL 14-AUG-1990 DEFINITION Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C genes, complete cds. ACCESSION M34440 KEYWORDS nuclear antigen. SOURCE Epstein-Barr virus type 2 (isolate AG876) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 10089) AUTHORS Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E., Rickinson,A. and Kieff,E. TITLE Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their EBNA-3A, EBNA-3B and EBNA-3C genes JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.T.Sample, 23-MAY-1990. Kieff Laboratory Harvard Medical School 75 Francis Street Boston, MA 02115 FEATURES from to/span description pept 189 524 nuclear antigen EBNA-3A, exon 1 610 3051 nuclear antigen EBNA-3A, exon 2 pept 3237 3593 nuclear antigen EBNA-3B, exon 1 3672 6155 nuclear antigen EBNA-3B, exon 2 pept 6277 6636 nuclear antigen EBNA-3C, exon 1 6711 9560 nuclear antigen EBNA-3C, exon 2 IVS 525 609 EBNA-3A intron IVS 3594 3671 EBNA-3B intron IVS 6637 6710 EBNA-3C intron BASE COUNT 2495 a 2837 c 2619 g 2138 t ORIGIN 1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct 61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag 121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg 181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag 241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca 301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc 361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac 421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc 481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg 541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt 601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat 661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg 721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg 781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact 841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg 901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg 961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc 1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct 1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac 1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat 1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg 1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca 1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg 1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg 1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct 1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa 1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac 1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc 1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc 1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg 1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg 1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa 1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca 1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca 2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag 2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc 2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac 2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg 2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa 2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg 2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg 2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc 2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg 2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg 2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa 2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg 2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct 2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat 2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt 2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg 3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg 3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat 3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc 3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga 3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg 3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac 3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag 3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca 3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc 3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag 3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc 3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga 3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac 3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct 3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata 3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa 3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag 4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt 4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga 4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata 4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag 4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct 4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg 4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa 4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg 4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca 4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag 4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc 4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg 4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca 4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca 4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga 4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt 4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac 5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc 5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac 5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct 5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc 5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc 5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc 5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc 5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc 5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc 5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag 5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta 5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg 5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac 5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac 5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc 5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct 6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt 6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt 6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg 6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta 6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg 6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat 6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac 6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc 6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta 6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca 6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc 6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc 6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct 6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg 6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat 6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca 6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg 7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag 7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga 7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc 7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt 7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta 7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc 7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg 7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag 7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc 7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac 7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag 7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg 7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag 7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt 7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga 7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta 7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac 8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc 8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc 8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc 8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc 8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc 8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg 8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac 8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg 8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg 8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata 8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc 8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact 8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc 8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt 8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag 8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc 9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac 9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc 9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc 9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac 9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg 9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag 9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa 9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg 9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca 9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga 9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta 9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca 9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg 9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg 9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg 9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag 9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc 10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt 10081 ggaggatcc // LOCUS HUMPAPB 3061 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human prostatic acid phosphatase mRNA, complete cds. ACCESSION M34840 KEYWORDS glycoprotein; prostatic acid phosphatase; secretory protein. SOURCE Human prostate, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1199) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3061) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.L.Van Etten, 08-JUN-1990. Chemistry Department Purdue University West Lafayette, IN 47907 FEATURES from to/span description pept 15 1175 prostatic acid phosphatase precursor (EC 3.1.3.2) /hgml_locus_uid="LA0168F" /nomgen="ACPP" /map="3q21-qter" sigp 15 110 prostatic acid phosphatase signal peptide matp 111 1172 prostatic acid phosphatase BASE COUNT 917 a 642 c 681 g 821 t ORIGIN Chromosome 3q21-qter. 1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc 61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga 121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg 181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg 241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt 301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg 361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc 421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc 481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg 541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag 601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg 661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga 721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat 781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc 841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc 901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga 961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc 1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt 1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa 1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag 1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc 1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca 1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt 1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa 1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt 1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt 1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga 1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc 1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa 1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt 1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat 1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata 1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga 1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag 2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag 2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt 2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta 2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca 2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa 2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc 2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac 2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa 2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat 2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac 2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc 2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc 2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat 2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc 2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat 2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta 3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa 3061 a // LOCUS LDV3ECPA 1064 bp ss-RNA VRL 14-AUG-1990 DEFINITION Lactate dehydrogenase-elevating virus capsid protein gene, complete cds. ACCESSION M34856 KEYWORDS viral capsid protein. SOURCE Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones A16, B24, B63, B90, B103, C44, dt4. ORGANISM Lactate dehydrogenase-elevating virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae. REFERENCE 1 (bases 1 to 1064) AUTHORS Godeny,E.K., Speicher,D.W. and Brinton,M.A. TITLE Map location of lactate dehydrogenase-elevating virus (LDV) capsid protein (Vp1) gene JOURNAL Virology (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.K.Godeny, 06-JUN-1990. FEATURES from to/span description pept 637 984 capsid protein pept 135 650 unidentified ORF BASE COUNT 267 a 223 c 265 g 309 t ORIGIN 3' terminus region. 1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc 61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc 121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt 181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg 241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt 301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg 361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc 421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg 481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta 541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga 601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc 661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat 721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc 781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga 841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt 901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat 961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg 1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt // LOCUS MBIFRHADGB 3903 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase (frhADGB) gene, complete cds. ACCESSION J02914 KEYWORDS 8-hydroxy-5-deazaflavin-reducing hydrogenase. SOURCE M.thermoautotrophicum (strain delta-H) DNA. ORGANISM Methanobacterium thermoautotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 3903) AUTHORS Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T. TITLE Cloning, sequence determination, and expression of the genes encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin reducing hydrogenase from Methanobacterium thermoautotrophicum delta-H JOURNAL Biochemistry (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.Alex, 25-MAY-1990. FEATURES from to/span description pept 298 1515 F420 hydrogenase alpha-subunit (frhA) pept 1524 2000 F420 hydrogenase delta-subunit (frhD) pept 2114 2824 F420 hydrogenase gamma-subuint (frhG) pept 2837 3682 F420 hydrogenase beta-subunit (frhB) binding 284 293 ribosomal binding site (put.) binding 1514 1520 ribosomal binding site (put.) binding 2100 2106 ribosomal binding site (put.) binding 2823 2832 ribosomal binding site (put.) BASE COUNT 1201 a 878 c 965 g 859 t ORIGIN 1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta 61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac 121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt 181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt 241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg 301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg 361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg 421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg 481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc 541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac 601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac 661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt 721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac 781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa 841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc 901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa 961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata 1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca 1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt 1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg 1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta 1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag 1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca 1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc 1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa 1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg 1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt 1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc 1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg 1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg 1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa 1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa 1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg 1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc 2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga 2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag 2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact 2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga 2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga 2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac 2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt 2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc 2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc 2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt 2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct 2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg 2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct 2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac 2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata 2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg 3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa 3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga 3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta 3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat 3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat 3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga 3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg 3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg 3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga 3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga 3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat 3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt 3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa 3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt 3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat 3901 cga // LOCUS MUSLRCNA 1751 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse loricrin gene, complete cds. ACCESSION M34398 KEYWORDS cell envelope protein; loricrin. SOURCE Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1751) AUTHORS Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D., Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M., Yuspa,S.H. and Roop,D.R. TITLE Identification of a major keratinocyte cell envelope protein, loricrin JOURNAL Cell 61, 1103-1112 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Rothnagel, 22-MAY-1990. FEATURES from to/span description pept 71 1516 loricrin signal 1740 1745 polyA signal BASE COUNT 223 a 493 c 706 g 329 t ORIGIN 1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc 61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg 121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg 181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg 241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg 301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg 361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg 421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg 481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg 541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg 601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg 661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc 721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg 781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg 841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg 901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg 961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg 1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg 1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc 1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg 1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag 1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg 1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg 1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg 1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac 1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc 1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac 1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct 1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca 1741 ataaatttgc t // LOCUS PFAGIPA 1906 bp ds-DNA INV 14-AUG-1990 DEFINITION P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete cds. ACCESSION J05544 KEYWORDS glucosephosphate isomerase. SOURCE P.falciparum (strain 7G8) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1906) AUTHORS Kaslow,D.C. and Hill,S. TITLE Cloning metabolic pathway genes by complementation in Escherichia coli: Isolation and expression of Plasmodium falciparum glucosephosphate isomerase JOURNAL J. Biol. Chem. 265, 12337-12341 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.C.Kaslow, 04-JUN-1990. FEATURES from to/span description pept 28 1803 glucosephosphate isomerase (EC 5.3.1.9) BASE COUNT 782 a 174 c 289 g 661 t ORIGIN Chromosome 14. 1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat 61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat 121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct 181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa 241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc 301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata 361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat 421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg 481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa 541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat 601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat 661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct 721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt 781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg 841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta 901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga 961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat 1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat 1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca 1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa 1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat 1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa 1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg 1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa 1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt 1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta 1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt 1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat 1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat 1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag 1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt 1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt // LOCUS RATGADA 3198 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat glutamic acid decarboxylase (GAD) mRNA, complete cds. ACCESSION M34445 KEYWORDS glutamic acid decarboxylase. SOURCE Rat brain and hypothalamus, cDNA to mRNA, (libraries of D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and pCR22HP. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3198) AUTHORS Wyborski,R.J., Bond,R.W. and Gottlieb,D.I. TITLE Characterization of a cDNA coding for rat glutamic acid decarboxylase JOURNAL Mol. Brain Res. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.I.Gottlieb, 23-MAY-1990. FEATURES from to/span description pept 184 1965 glutamic acid decarboxylase mRNA < 1 3198 glutamic acid decarboxylase mRNA signal 3174 3179 polyA signal BASE COUNT 869 a 742 c 767 g 820 t ORIGIN 1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca 61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc 121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag 181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat 241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc 301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag 361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa 421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct 481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg 541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac 601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac 661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt 721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta 781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc 841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca 901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc 961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc 1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg 1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt 1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat 1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca 1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg 1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc 1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa 1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag 1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac 1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc 1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt 1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa 1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc 1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg 1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc 1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc 1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca 2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg 2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa 2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat 2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc 2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca 2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag 2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta 2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca 2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac 2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag 2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt 2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta 2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt 2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt 2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc 2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa 3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta 3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt 3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa 3181 taaaaaatat tctccatg // LOCUS RATTSHRA 5270 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat thyroid stimulating hormone receptor mRNA, complete cds. ACCESSION M34842 KEYWORDS TSH receptor; thyroid stimulating hormone receptor; transmembrane protein. SOURCE Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5270) AUTHORS Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C., McBride,O.W. and Kohn,L.D. TITLE Cloning, chromosomal assignment, and regulation of the rat thyrotropin receptor by thyrotropin, agents which increase cAMP levels and thyroid autoantibodies JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5677-5681 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.D.Kohn, 05-JUN-1990. FEATURES from to/span description pept 55 2349 thyroid stimulating hormone receptor precursor sigp 55 117 thyroid stimulating hormone receptor signal peptide matp 118 2346 thyroid stimulating hormone receptor signal 5256 5261 poly-A signal BASE COUNT 1415 a 1343 c 1176 g 1336 t ORIGIN 1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg 61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga 121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag 181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact 241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat 301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa 361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta 421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct 481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct 541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg 601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg 661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga 721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct 781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag 841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac 901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat 961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc 1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag 1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag 1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac 1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca 1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg 1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc 1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc 1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac 1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt 1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc 1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg 1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc 1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac 1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg 1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt 1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc 1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg 2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc 2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag 2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa 2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt 2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca 2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa 2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct 2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc 2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac 2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg 2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag 2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga 2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat 2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc 2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat 2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt 3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta 3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat 3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac 3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga 3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt 3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa 3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct 3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt 3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt 3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata 3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt 3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac 3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat 3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga 3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat 3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag 3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc 4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc 4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg 4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa 4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga 4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag 4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc 4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg 4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc 4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct 4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg 4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca 4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa 4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta 4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga 4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg 4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc 4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg 5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca 5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt 5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg 5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg // LOCUS STMBLIPA 840 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.clavuligerus beta-lactamase inhibitory protein gene, complete cds. ACCESSION M34538 KEYWORDS beta-lactamase inhibitory protein. SOURCE S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP. ORGANISM Streptomyces clavuligerus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 840) AUTHORS Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E. TITLE Isolation and characterization of a beta-lactamase inhibitory protein from Streptomyces clavuligerus and cloning and analysis of corresponding gene JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.E.Jensen, 25-MAY-1990. Dept. of Microbiology University of Alberta Edmonton, Alberta Canada T6G 2E9 FEATURES from to/span description pept 67 672 beta-lactamase inhibitory protein precursor (bliP) sigp 67 174 beta-lactamase inhibitory protein signal peptide matp 175 669 beta-lactamase inhibitory protein binding 52 63 ribosomal binding site signal 686 724 transcription termination signal BASE COUNT 140 a 248 c 320 g 132 t ORIGIN 1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga 61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc 121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg 181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc 241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc 301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc 361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg 421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg 481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg 541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc 601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg 661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc 721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg 781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc // LOCUS TRBPARPA 763 bp ss-mRNA INV 14-AUG-1990 DEFINITION T.brucei procyclin (pPS-1) mRNA, complete cds. ACCESSION M34790 KEYWORDS procyclic acidic repetitive protein; procyclin. SOURCE T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone pPS-1. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 763) AUTHORS Dorn,P.L., Aman,R.A. and Boothroyd,J.C. TITLE Inhibition of protein synthesis results in super-induction of procyclin RNA JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Boothroyd, 04-JUN-1990. Stanford University School of Medicine Dept of Microbiology and Immunology Sherman Fairchild Science Building Stanford, CA 94305-5402 USA FEATURES from to/span description pept 29 466 procyclin mRNA < 1 763 procyclin mRNA rpt 215 393 12 bp degenerate repeats BASE COUNT 192 a 215 c 156 g 200 t ORIGIN 1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct 61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc 121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg 181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga 241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga 301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga 361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc 421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt 481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc 541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt 601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc 661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag 721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt // LOCUS W10PPIBCA 4761 bp ds-DNA BCT 14-AUG-1990 DEFINITION Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes, complete cds, and ippD gene, 3' end. ACCESSION M34849 KEYWORDS antigen. SOURCE Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA, clones HB101/pHS[4108,5357]. ORGANISM Plasmid pWR100 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. TITLE Nucleotide sequence of the invasion plasmid antigen B and C genes (ipaB and ipaC) of Shigella flexneri JOURNAL Microb. Pathog. 4, 345-357 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 538 and 3987 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. JOURNAL Unpublished (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Baudry, 05-JUN-1990. Center for Vaccine Development University of Maryland at Baltimore 10 South Pine Street Baltimore, MD 21201 FEATURES from to/span description pept 606 1073 polypeptide I (ippI) pept 1079 2821 polypeptide B (ipaB) pept 2841 3932 polypeptide C (ipaC) pept 3983 > 4761 ipaD protein pept < 1 549 unidentified ORF binding 595 600 ippI ribosomal binding site binding 1065 1070 ipaB ribosomal binding site binding 2829 2834 ipaC ribosomal binding site binding 3971 3977 ipaD ribosomal binding site signal 556 561 ippI -10 region signal 2723 2728 ipaC -35 region signal 2745 2750 ipaC -10 region BASE COUNT 1769 a 952 c 805 g 1235 t ORIGIN 1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat 61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat 121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct 181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca 241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat 301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc 361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg 421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca 481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat 541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag 601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa 661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca 721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct 781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta 841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat 901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag 961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat 1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat 1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga 1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac 1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa 1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct 1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa 1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga 1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat 1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat 1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga 1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca 1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat 1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa 1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg 1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc 1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga 1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt 2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt 2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca 2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc 2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc 2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac 2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac 2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat 2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc 2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc 2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa 2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga 2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc 2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg 2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata 2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc 2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat 3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg 3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag 3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag 3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga 3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc 3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa 3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc 3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa 3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc 3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt 3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc 3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca 3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca 3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc 3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag 3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg 3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc 4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa 4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa 4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac 4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc 4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt 4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga 4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata 4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag 4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa 4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc 4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac 4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat 4741 agacaatatg ttaaaaagct t // LOCUS YEPOMPHA 824 bp ss-mRNA BCT 14-AUG-1990 DEFINITION Y.enterocolitica cationic outer membrane protein (ompH) gene, complete cds. ACCESSION M34854 KEYWORDS cationic outer membrane protein; enterobacterial outer membrane protein; lipopolysaccharide binding protein; secretory protein; structural membrane protein. SOURCE Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 824) AUTHORS Hirvas,L., Koski,P. and Vaara,M. TITLE The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and comparison to known ompH sequences JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.H.Hirvas, 06-JUN-1990. University of Helsinki FEATURES from to/span description pept 324 818 ompH protein precursor (gtg start codon) sigp 324 392 ompH protein signal peptide matp 393 815 ompH protein binding 308 317 ompH ribosomal binding site signal 120 125 ompH -35 region signal 143 148 ompH -10 region BASE COUNT 257 a 147 c 198 g 221 t 1 others ORIGIN 1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg 61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg 121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt 181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt 241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa 301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag 361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa 421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag 481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc 541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga 601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc 661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta 721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta 781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac // LOCUS YSCEXG1A 2056 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase (EXG1) gene, complete cds. ACCESSION M34341 KEYWORDS 1,3-glucan glucanohydrolase; exo-1,3-B-glucanase. SOURCE S.cerevisiae (strain AB320) DNA, clone YEp13. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2056) AUTHORS Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A., Nebreda,A.R., Mendez,E. and del Rey,F. TITLE Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1, in the yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.del Rey, 17-MAY-1990. Instituto de Microbiologia bioquimica Facultad de Biologia CSIC/Universidad de Salamanca 37008 Salamanca, SPAIN FEATURES from to/span description pept 363 1709 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase precursor (EXG1, EC 3.2.1.58) sigp 363 482 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase signal peptide matp 483 1706 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase mRNA 265 > 1709 EXG1 mRNA signal 153 159 TATA box BASE COUNT 610 a 395 c 396 g 655 t ORIGIN Chromosome XII-right arm. 1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg 61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac 121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa 181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat 241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt 301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta 361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta 421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa 481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg 541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg 601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg 661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg 721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc 781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca 841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt 901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca 961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat 1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca 1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca 1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact 1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct 1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg 1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga 1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg 1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt 1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct 1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg 1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc 1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat 1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt 1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa 1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta 1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat 1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt 2041 gagcttttcc gccggc // LOCUS CORDTXRA 2529 bp ds-DNA BCT 14-AUG-1990 DEFINITION C.diphtheriae toxin repressor (dtxR) gene, complete cds. ACCESSION M34239 KEYWORDS regulatory protein; toxin repressor. SOURCE C.diphtheriae (strain C7) DNA, clone pHH2500.. ORGANISM Corynebacterium diphtheriae Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 1 to 1390) AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R. TITLE Molecular cloning and DNA sequence analysis of the diphtheria tox iron-dependent regulatory element (dtxR) from Corynebacterium diphtheriae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5968-5972 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2529) AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.R.Murphy, 11-MAY-1990. University Hospital Dept Medicine-Rm E613 88 East Newton Street Boston, MA 02118-2393 FEATURES from to/span description pept 681 1361 toxin repressor protein (dtxR) pept 25 456 ORF1 pept 1384 2370 ORF3 signal 595 629 promoter sequence binding 664 669 ribosomal binding site (put.) binding 1200 1283 DNA-binding region (put.) BASE COUNT 691 a 615 c 593 g 630 t ORIGIN 1 bp upstream of HindIII site. 1 aagctttctc gaatcaaacg cgagatgtat caacagctcg gacgtgaagc aaccaacgaa 61 gagcttgcgg aagaatctgg cattgaagaa tccaaaattg aaatgttgct tcgccagtct 121 cgtgacccag taagcctaga tatgccagtt ggtgccgatg aagaggctcc acttggtgac 181 tttatcgaag attccgaggc gactgatgcc gaatcagctg tagttgcgtc gttgcgtcac 241 tctgatatca gaacagtctt ggacactttg gagcaacgtg aacaggacgt tattcgcctt 301 cgttatgggc tcgatgatgg agttccccgc actttagatc aaattggtcg tcaatttgga 361 ctgtctcgtg aaagagttcg ccaaattgaa cgcgaggtca tgtcgaaact gcgtgacggt 421 gcccgtgccg aaaaacttcg agcatacgca caataaagcg tttgcttaga tatgcctacc 481 aataaagaca taaacgccta ttaaaagcaa tctttagatt aggcgttata attaaagtct 541 catcgaaaaa cgcgctgcgg gactacaacg caacaagaaa actattccat atttttcacg 601 ctacaattcg ttgtagattg ataggaattg atcaccagca cacaacagtc tccatggcac 661 tataaggaaa gaggcttaca atgaaggact tagtcgatac cacagagatg tacttgcgta 721 ctatctatga gctggaagaa gagggagtca cccctcttcg cgctaggatc gctgagcgtc 781 tggaacaatc tggacctaca gttagccaaa ccgttgcccg tatggagcgc gatggacttg 841 tcgttgtcgc ctcagaccgc agtctacaaa tgacaccgac aggccgcact ttagcgactg 901 cagttatgcg taaacatcgc ttagctgagc gccttcttac cgatatcatt ggtctagata 961 tcaataaagt tcacgatgaa gcctgccgct gggaacacgt tatgagtgac gaagttgaac 1021 gcaggctcgt gaaagtattg aaagatgtca gtcggtcccc cttcggaaac ccaattccag 1081 gtctcgacga actcggcgta ggcaattctg acgcggcagc ccccggaact cgcgttattg 1141 acgctgccac cagcatgccc cgcaaagtac gcattgttca gattaacgaa atctttcaag 1201 ttgaaacgga tcagtttaca cagctcctcg atgctgacat ccgtgttgga tcagaagtcg 1261 aaattgtaga tagagacggc cacatcacgt tgagccacaa tggaaaagat gtcgaactcc 1321 tcgatgatct ggctcacact attcgtatcg aagaactcta aatactaaag gcggcgaaat 1381 tagatgaaac tcctcgttac cggtggcgcc ggatacgtag gaagtgtctg ttccactgtt 1441 ctgctcgaac aggggcacga agtaacaatt gtcgataatc ttacaaccgg caatcgcgat 1501 gctgttccgc taggagccac ttttgtcgag ggtgatatca aagacgttgc ggaaaacgtt 1561 ttatcatctg attcttttga tgcggttctt cacttcgcgg ctcgttcgct tgtaggcgaa 1621 tcagttgaaa agccagatga atattggcag cacaacatgg tgaccacact cgcacttctt 1681 gatgccatga aacgtaataa cgtgcgaaac attgtctttt cttctacagc tgctacatac 1741 ggcgaacccg aaacagtccc gattactgaa gacgctccta cccacccaac gaatccatac 1801 ggcgcaacca agctgtctat cgactacgcg atcacatcat acgcacatgc atatgggttt 1861 gctgcgacga gtctgcggta cttcaacgta gctggcgcat atggtttggt gggtgaaaat 1921 cgagaaatag aaacccatct cattccgctc gttttacaag tagcactggg acaccgtgac 1981 aagattttca tgtttggtga cgactggcca actgaagatg gaacccctat ccgcgattac 2041 attcatattc gtgatcttgc agatgcccat attttggcac ttcaaagtaa cgtcgaaggc 2101 agccaccgga tctttaatct cggatccggt gaaggatatt cggtcaaaca ggtcattgat 2161 acctgcagag aagttaccgg tcatcccatc cccgctgagg tagctcctcg acgggctggt 2221 gatcctgcag tgctcattgc atcatcggct aaggcgcaat ccgaactagg atggaagcca 2281 caacgcactg atctgcacac gattgtttct gatgcttggg cttttacgtc tcaactcgga 2341 gacaaagccc atagcgcttc tcgtggataa caacgagtag gtttatcttt tgcagtccgc 2401 ggtgtgatta cttaagtatt cacaccgcgg attttacatt tgttgactca ccaaaatgtt 2461 tctgagcttc ctgactccca ttgataacaa gatcaacaat ggatccagat agacctcgat 2521 aataagctt // LOCUS ECOANSBA 1643 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds. ACCESSION M34234 KEYWORDS L-asparaginase II; ansB gene. SOURCE E.coli (strain K12 isolate JM108) DNA, clones lambda-ansB and pansB7. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1643) AUTHORS Bonthron,D.T. TITLE L-asparaginase II of Escherichia coli K-12: Cloning and sequencing of the ansB gene and its position on the chromosomal map JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.T.Bonthron, 11-MAY-1990. FEATURES from to/span description pept 346 1392 L-asparaginase II (ansB) (EC 3.5.1.1) sigp 346 411 L-asparaginase II signal peptide matp 412 1389 L-asparaginase II binding 335 339 ribosomal binding site (put.) site 1400 1425 transcription termination signal (put.) BASE COUNT 442 a 404 c 405 g 392 t ORIGIN 63.8 min on K12 map. 1 gcggtaaccc attacagaat gtgctgggaa gcctgggggg gctgcaatcc tcaatccaaa 61 ccgagtggaa aaagcaggaa aaagatttcc agcagtttgg caaagatgtt tgtagccgcg 121 ttgtgactct ggaagatagc cgcaaagccc tggtcgggaa tttaaaataa tcctctattt 181 taagacggca taatactttt ttatgccgtt taattcttcg ttttgttacc tgcctctaac 241 tttgtagatc tccaaaatat attcacgttg taaattgttt aacgtcaaat ttcccataca 301 gagctaaggg ataatgcgta gcgttcacgt aactggagga atgaaatgga gtttttcaaa 361 aagacggcac ttgccgcact ggttatgggt tttagtggtg cagcattggc attacccaat 421 atcaccattt tagcaaccgg cgggaccatt gccggtggtg gtgactccgc aaccaaatct 481 aactacacag tgggtaaagt tggcgtagaa aatctggtta atgcggtgcc gcaactaaaa 541 gacattgcga acgttaaagg cgagcaggta gtgaatatcg gctcccagga catgaacgat 601 aatgtctggc tgacactggc gaaaaaaatt aacaccgact gcgataagac cgacggcttc 661 gtcattaccc acggtaccga cacgatggaa gaaactgctt acttcctcga cctgacggtg 721 aaatgcgaca aaccggtggt gatggtcggc gcaatgcgtc cgtccacgtc tatgagcgca 781 gacggtccat tcaacctgta taacgcggta gtgaccgcag ctgataaagc ctccgccaac 841 cgtggcgtgc tggtagtgat gaatgacacc gtgcttgatg gccgtgacgt caccaaaacc 901 aacaccaccg acgtagcgac cttcaagtct gttaactacg gtcctctggg ttacattcac 961 aacggtaaga ttgactacca gcgtaccccg gcacgtaagc ataccagcga cacgccattc 1021 gatgtctcta agctgaatga actgccgaaa gtcggcattg tttataacta cgctaacgca 1081 tccgatcttc cggctaaagc actggtagat gcgggctatg atggcatcgt tagcgctggt 1141 gtgggtaacg gcaacctgta taaatctgtg ttcgacacgc tggcgaccgc cgcgaaaacc 1201 ggtactgcag tcgtgcgttc ttcccgcgta ccgacgggcg ctaccactca ggatgccgaa 1261 gtggatgatg cgaaatacgg cttcgtcgcc tctggcacgc tgaacccgca aaaagcgcgc 1321 gttctgctgc aactggctct gacgcaaacc aaagatccgc agcagatcca gcagatcttc 1381 aatcagtact aatcgcctcg ccccggtatc gtgccggggc tttttcactt cagactcacg 1441 tccattgcca attttaatta ccctaatgat aatcaccgga ataaattatt ccgcgtgagg 1501 tttttcgggt gaaaaagcaa tggattgttg gcacggcact gcttatgttg atgactggta 1561 atgcctgggc agatggcgaa ccgccaactg aaaatatctt aaaagatcaa ttcaaaaagc 1621 agtatcacgg cattctcaag ctt // LOCUS HUMADE5RC 1946 bp ds-DNA SYN 14-AUG-1990 DEFINITION Synthetic human/adenovirus type 5 recombination junction. ACCESSION M34061 KEYWORDS . SOURCE Human fetal bone marrow fibroblasts transformed with Adeno5/SV40 recombinant virus (cell line H13.1) DNA, clone pH13.1SV1. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 1946) AUTHORS Romani,M., De Ambrosis,A., Alhadeff,B., Purrello,M., Gluzman,Y. and Siniscalco,M. TITLE Preferential viral integration at the highly recombinogenic chromosomal site 1p36 in human cells transformed with an Adeno5/SV40 hybrid JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Romani 08-MAY-1990. FEATURES from to/span description recomb 429 430 adenovirus type 5 end/Human DNA start rpt 430 1134 tandem Alu repeats BASE COUNT 492 a 548 c 510 g 396 t ORIGIN Chromosome 1p36. 1 tttgacccgg aacgcggaaa acacctgaga aaaacacctg ggcgagtctc cacgtaaacg 61 gtcaaagtcc ccgcggccct agacaaatat tacgcgctat gagtaacaca aaattattca 121 gatttcactt cctcttattc agttttcccg cgaaaatggc caaatcttac tcggttacgc 181 ccaaaattta ctacaacatc cgcctaaaac cgcgcgaaaa ttgtcacttc ctgtgtacac 241 cggcgcacac caaaaacgtc acttttgcca catccgtcgc ttacatgtgt tccgccacac 301 ttgcaacatc acacttccgc cacactacta cgtcacccgc cccgttccca cgccccgcgc 361 cacgtcacaa actccacccc ctcattatca tattggcttc aatccaaaat aaggtatatt 421 attgatgata tataatatag aggctgaggc aggagggttg cttgagcctg ggaggctgcg 481 gctgcagtga gccatgatca caccactgca cttcagcctg ggcaacagag cgagactccg 541 tctcaaaaaa tagtaatagg ccaggcgcgg tggctcatgc cagtaatccc agcgctttgg 601 gaggctgagg caggcagatc acctgaggtc aggagttcaa gaccagcctg gccaacatgg 661 tgaaaccctg tctctactaa aaatacaaaa attagttggg catggtggcg gcgcacacca 721 gtaatcccag ctactcaaga ggctgagggc acttgggcag cagaggaggt tgtagtgagc 781 agagatcacg ccactgcact ctagcctggg caacagaggg agactctgtc tcaaaaaata 841 ataggccggg cacggtggct catgcctgta atcccagcac tttgggaggc cgaggcgggt 901 ggatcacctg agatcaggag tttaagacca gcctggccaa catggtgaaa ccccatctct 961 actaaaaata caaaaaatta gccaggcgtg gtggcgggcg tctgtaatcc cagctacttg 1021 ggaggctgaa gcaggagaat cacttgagcc caggaggcag aggttgcagt gagctgagat 1081 catgccatta cactccagcc tgggcaacaa gagtgaaact ccgtctcaaa aaataataat 1141 aataatgata ataataataa taattgcatc tattcactga gcgcctcctg tacaccaggc 1201 accatgtggc attaggggcc catgctgtca tcatattctg ccagccaccc atggggtggt 1261 ggggtaattc cccaggccac acctgtccac acctctcttc aggtactgcg tgggcatgtg 1321 cggagacggc gccaatgact gtggggccct gaaggcggct gatgtcggca tctcgctgtc 1381 ccaggcagaa gcctcagtgg tctcaccctt cacctcgagc atggccagta ttgagtgcgt 1441 gcccatggtc atcaggtaag gcaggcaggg accggtgggt gagggctcag cagggctggt 1501 cagccaacct gggtccccta atgtccgtgc cctgccaccc agggaggggc gctgttccct 1561 tgacacttcg ttcagcgtct tcaagtacat ggctctgtac agcctgaccc agttcatctc 1621 cgtcctgatc ctctacacgg tgagtatctg cagagctccg tacccagagc cgtcccagga 1681 ctagggaggg gacacagccc tgtgccttgg agctggcagg tcccagatag atgacgtggg 1741 cccagatcct gaccctgcca ctctccggct gtgtggttgg agaggtggca aggcttgggg 1801 gagatgagaa aaacaccagg cacggagcct ggtacccact aggtcctcac aaatggcagc 1861 agctgtcatg ggctaacatc ttgaccgcaa ctctgggagc tccccaatct gcagggaaga 1921 ctgaacggag gcagggccaa aaccct // LOCUS MCLPCCA 1701 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.laminosus phycoerythrocyanin (PCC) alpha and beta subunit genes, complete cds, and linker protein, 5' end. ACCESSION M34254 KEYWORDS light-harvesting biliprotein; phycoerythrocyanin. SOURCE M.laminosus (strain Fischerella PCC 7603) DNA. ORGANISM Mastigocladus laminosus Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 1701) AUTHORS Eberlein,M. and Kufer,W. TITLE Genes encoding both subunits of phycoerythrocyanin, a light- harvesting, biliprotein from the cyanobacterium Mastigocladus laminosus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Kufer, 11-MAY-1990. Botanisches Institut Ludwig-Maximilians- Universitaet Muenchen Menzingerstr. 67 D-8000 Muenchen 19 Federal Republic of Germany FEATURES from to/span description pept 279 797 phycoerythrocyanin beta-subunit pept 884 1372 phycoerythrocyanin alpha-subunit pept 1528 > 1701 phycoerythrocyanin linker protein precursor matp 1531 > 1701 phycoerythrocyanin linker protein BASE COUNT 526 a 312 c 383 g 480 t ORIGIN 1 agctttacat ttaaggatat ggcgctcaaa gaatgggagt tttacagtta ttaaataaag 61 ttcataaaaa agaatacaag taatatataa aatacgaaac taaggtttga aagtaattgc 121 aaatatcaag gcaaacgaaa agataaaggc taattgttgc acgtaaaaat ggtatttttt 181 tgatttttag gtgtagttat tggcggattt tatctggtct cggtttgtct tgattaaaaa 241 tgcaatagct gccatgaaat cgaggagaaa atagaaaaat gcttgatgct ttttccagag 301 tagttgaaca agctgataaa aaaggtgctt acctaagcaa tgatgaaata aatgctttgc 361 aggctatagt tgcagatagc aacaaacgtt tggatgtagt gaatcggctt acaagcaacg 421 cctcctcaat tgtggcgaat gcttatcgtg ctttagtagc agaacgtccg caggtattta 481 atcctggtgg tccttgtttt caccatcgca accaagctgc ttgcatccgc gatttaggat 541 ttattctgcg ttatgttacg tactccgtac ttgctggtga cactagtgtc atggacgatc 601 gctgcttgaa tggtttgcgc gaaacctatc aagctcttgg tactcccggt gatgctgtag 661 cttctggaat taaaaaaatg aaagaagctg cacttaaaat tgctaacgat cccaatggca 721 ttaccaaagg tgattgcagt caattaatgt ctgaattagc aagttatttt gaccgagcag 781 cggctgctgt tgcttaatgc agtcaaaggc gcgactaagt gacaaaattt tcaaactctt 841 ttttagtaat taatttgaca agtttttgga gaagaagcga atcatgaaga ccccgttaac 901 cgaagcgatc gccgccgccg atcttcgcgg ttcttatttg agcaacacag aattgcaagc 961 agtatttggt cgtttcaatc gcgctcgtgc tggtttagaa gcagcaagag cattcgctaa 1021 caacggtaag aaatgggcgg aagcagcagc caatcacgta tatcaaaaat ttccctacac 1081 tacccaaatg caaggacctc agtatgcttc tactccagaa gggaaagcaa agtgcgtgcg 1141 ggatatcgac cattacttac gcacaattag ctactgctgt gtagttggcg gtactggtcc 1201 cttggatgac tatgtagttg caggcttaaa ggaatttaac agcgccctgg gtttatctcc 1261 cagctggtac atagctgctc tagaatttgt ccgcgacaat cacggtttga ccggtgatgt 1321 tgctggtgaa gccaacacct acatcaacta tgcaattaat gcattgagct aaatctaaac 1381 agcgaatgtc ctttgtattt gcttaatagc taatggctga tagctgatgg ctaatggcta 1441 atggaaaaat gcaattagca attagcagtg agttattggc cattaactgt gtttgagaaa 1501 aatgataaaa atggagcgat tgttaacatg agtacatcag tcgcagaacg cctcgcaatt 1561 aaagatgaag tagacaaaaa aattgaattg cgccctaatt ggagcgaaga cgagctgcaa 1621 attgttttta aaacagccta cgagcaagtt tttggacgtc agggattata tgccagtcaa 1681 agatttgcaa ctgccgaagc t // LOCUS YSCTSDNAA 238 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-ii. ACCESSION M34310 KEYWORDS . SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-ii. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 238) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. 10, 4415-4419 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 146 g 92 t ORIGIN 1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg 61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg 121 gtgtggtgtg tgggtgtggt gtgtgggtgt ggtgtgggtg tgggtgtggt gtgtgtgggt 181 gtggtgtgtg ggtgtgggtg tggtgtggtg tgtgtgggtg tgtgggtgtg gtgtgtgt // LOCUS YSCTSDNAB 297 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-i. ACCESSION M34311 KEYWORDS . SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-i. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 297) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 184 g 113 t ORIGIN 1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg 61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg 121 gtgtggtgtg ggtgtgggtg tgggtgtggt gtgtgtgtgg gtgtggtgtg tgggtgtggg 181 tgtgtgtggg tgtgggtgtg ggtgtggtgt ggtgtgtggg tgtggtgtgt gggtgtggtg 241 tgtgtgggtg tggtgtgtgt gggtgtgggt gtgtgtgggt gtgggtgtgg tgtgtgg // LOCUS YSCTSDNAC 325 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-ii. ACCESSION M34312 KEYWORDS . SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-4-ii. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 325) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 199 g 126 t ORIGIN 1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg 61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg 121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg gtgtgtgggt gtggtgtggg tgtggtgtgt 181 gggtgtgtgg gtgtgggtgt gtgtggtgtg tggtgtgtgt ggtgtgggtg tggtgtgtgg 241 gtgtgggtgt ggtgtgtggg tgtgggtgtg gtgtgtgtgt gtgggtgtgg tgtgtgggtg 301 tgggtgtggt gtgtgggtgt gtggg // LOCUS YSCTSDNAD 248 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-i. ACCESSION M34313 KEYWORDS . SOURCE S.cerevisiae (strain XS595-6C) extrachromosomal plasmid DNA, clone YLP108CA-4-i. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 248) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 151 g 97 t ORIGIN 1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg 61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg 121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg tgtgtgtggg tgtggtgtgg gtgtggtgtg 181 tgggtgtgtg tgggtgtggt gtgtgtgggt gtgtgggtgt ggtgtgtgtg ggtgtggtgt 241 gtgggtgg // LOCUS PLYORIA 375 bp ds-DNA VRL 14-AUG-1990 DEFINITION Human polyomavirus BK (strain WWT) origin of replication. ACCESSION M34048 KEYWORDS origin of replication. SOURCE Polyomavirus BK (strain WWT) DNA, from human urine. ORGANISM Polyomavirus BK Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 375) AUTHORS Sundsfjord,A., Johansen,T., Flaegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL J. Virol. 64, 3864-3871 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 375 regulatory region site 143 210 P block site 211 249 Q block site 250 302 R block BASE COUNT 123 a 86 c 95 g 71 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt 301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta 361 aactggacaa aggcc // LOCUS PLYORIB 424 bp ds-DNA VRL 14-AUG-1990 DEFINITION Human polyomavirus BK (strain UT) origin of replication. ACCESSION M34049 KEYWORDS origin of replication. SOURCE Human polyomavirus BK (strain UT) DNA, from human urine. ORGANISM Polyomavirus BK Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 424) AUTHORS Sundsfjord,A., Johansen,T., Flaegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL J. Virol. 64, 3864-3871 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 424 regulatory region site 143 210 P block site 211 249 Q block site 250 261 R block site 263 315 P block site 316 349 Q block site 350 361 R block BASE COUNT 134 a 103 c 105 g 82 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa 301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc 361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa 421 ggcc // LOCUS MXAFRZE 2493 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.xanthus gliding motility regulatory protein (frzE) gene, complete cds. ACCESSION M35192 KEYWORDS gliding motility regulatory protein. SOURCE M.xanthus (strain DZF1) DNA. ORGANISM Myxococcus xanthus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; Myxococcaceae. REFERENCE 1 (bases 1 to 2493) AUTHORS McCleary,W.R. and Zusman,D.R. TITLE FrzE of Myxococcus xanthus is homologous to both CheA and CheY of Salmonella typhimurium JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5898-5902 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.R.McCleary, 05-JUN-1990. FEATURES from to/span description pept 64 2397 gliding motility regulatory protein (frzE) BASE COUNT 375 a 848 c 926 g 344 t ORIGIN 1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg 61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc 121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag 181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt 241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag 301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc 361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc 421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg 481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc 541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc 601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc 661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc 721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc 781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg 841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg 901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc 961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc 1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag 1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg 1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg 1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc 1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc 1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg 1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc 1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag 1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc 1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg 1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg 1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg 1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc 1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac 1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc 1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc 1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc 2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag 2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag 2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg 2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg 2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc 2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga 2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg 2461 caaggggctg cgtgcgctcg cggcccgggg cct // LOCUS RATPCBB 463 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat PCB-binding protein mRNA, complete cds.. ACCESSION J05536 KEYWORDS PCB binding protein; binding protein. SOURCE Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone pPCB6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 463) AUTHORS Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J., Gillner,M., Gustafsson,J.-A. and Lund,J. TITLE Cloning, structure, and expression of a rat binding protein for polychlorinated biphenyls: Homology to the hormonally regulated progesterone-binding protein uteroglobin JOURNAL J. Biol. Chem. 265, 12690-12693 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.Lund, 15-JUN-1990. FEATURES from to/span description pept 58 348 PCB binding protein precursor sigp 58 114 PCB binding protein signal peptide matp 115 345 PCB binding protein BASE COUNT 117 a 156 c 84 g 106 t ORIGIN 1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg 61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg 121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat 181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag 241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg 301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat 361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta 421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc // LOCUS YSBCPR 2990 bp ds-DNA PLN 14-AUG-1990 DEFINITION C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds. ACCESSION M35199 KEYWORDS NADPH-cytochrome P450 reductase. SOURCE C.tropicalis (ATCC 750) DNA, clone pTS1. ORGANISM Candida tropicalis Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 2990) AUTHORS Sutter,T.R., Sanglard,D. and Loper,J.C. TITLE Isolation and characterization of the alkane-inducible NADPH- cytochrome P450 oxidoreductase gene from Candida tropicalis: Identification of invariant residues within similar amino acid sequences of divergent flavoproteins JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by T.R.Sutter, 15-JUN-1990. Author name and address T.R.Sutter CIIT, P.O. Box 12137 Research Triangle Park, NC 27709 FEATURES from to/span description pept 554 2596 NADPH-cytochrome P450 reductase signal 198 204 TATA box BASE COUNT 988 a 442 c 558 g 1002 t ORIGIN 1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa 61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg 121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag 181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag 241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag 301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa 361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata 421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta 481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc 541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt 601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt 661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa 721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa 781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc 841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt 901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac 961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac 1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg 1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt 1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa 1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga 1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt 1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga 1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa 1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat 1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa 1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat 1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg 1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga 1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc 1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt 1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt 1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac 1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca 2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag 2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg 2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg 2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt 2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa 2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa 2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc 2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga 2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca 2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta 2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga 2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat 2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc 2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta 2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt 2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta // LOCUS HLTLYS 642 bp ss-mRNA INV 14-AUG-1990 DEFINITION H.rufescens sperm lysin mRNA, complete cds. ACCESSION M34388 KEYWORDS sperm lysin. SOURCE H.rufescens testis, cDNA to mRNA, clones r13 and r14. ORGANISM Haliotis rufescens Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae. REFERENCE 1 (bases 1 to 642) AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D. TITLE Species-specific sequences of abalone sperm lysin, the sperm protein that creates a hole in the egg envelope JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5792-5796 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.D.Vacquier, 22-MAY-1990. FEATURES from to/span description pept 16 480 sperm lysin precursor sigp 16 69 sperm lysin signal peptide matp 70 477 sperm lysin mRNA < 1 642 sperm lysin mRNA signal 625 630 poly-A signal BASE COUNT 202 a 124 c 151 g 165 t ORIGIN 1 cgaacagatt acaagatgaa gctgttggtg ctttgcattt ttgcgatgat ggcgacattg 61 gcaatgtcac gcagctggca ttatgttgaa cccaagtttt taaacaaggc ttttgaagtt 121 gcactgaagg tacagatcat tgctggattc gatagagggc tggtcaaatg gcttagagtt 181 catggcagaa cattgtctac cgttcagaag aaggcgctgt acttcgtcaa tagacgttac 241 atgcagaccc actgggcgaa ttacatgctc tggataaata agaagatcga tgcactgggc 301 agaactccag ttgttggtga ctacacacgg ttaggtgccg aaattggaag acgtattgat 361 atggcttact tttacgactt cctgaaagat aagaacatga tcccaaaata tctgccatac 421 atggaagaaa ttaatagaat gcgcccggca gatgtcccag ttaaatacat gggcaagtga 481 aacccatgaa gaagacagct tcatgcaccg tgcatgtcaa cagatacaga aaacatcgtc 541 aaacacgggg agctatgaca ccatcctaac atgggttggc tatttgttgc acgattttta 601 aaacggaaag attgtacatt ccagaataaa tagatttact ac // LOCUS HLTLYS2 650 bp ss-mRNA INV 14-AUG-1990 DEFINITION H.corrugata sperm lysin mRNA, complete cds. ACCESSION M34389 KEYWORDS sperm lysin. SOURCE H.corrugata testis, cDNA to mRNA, clones p1 and p3. ORGANISM Haliotis corrugata Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae. REFERENCE 1 (bases 1 to 650) AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D. TITLE Species-specific sequences of abalone sperm lysin, the sperm protein that creates a hole in the egg envelope JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5792-5796 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.D.Vacquier, 22-MAY-1990. FEATURES from to/span description pept 16 483 sperm lysin precursor sigp 16 69 sperm lysin signal peptide matp 70 480 sperm lysin mRNA 1 650 sperm lysin mRNA signal 635 640 poly-A signal BASE COUNT 194 a 134 c 162 g 160 t ORIGIN 1 ggaacagatt acaagatgaa gctgttggtg ctttgccttt ttgcgatgat ggcgacattg 61 gctgtgtcgc gccaccgctt ccgttttatt ccacataagt atatacgcaa ggagtttgaa 121 gttgcactga aggtagagat cattgccgga ttcgatagaa cgctggtcaa atggcttaga 181 gttcatggcg gacgtttgag tacagttcag aagaaggcgc tgtacttcgt caatagacgt 241 tacatgcaga cccactggca gaattacatg ctctggatag tgaggaagac cgatgcactg 301 ggcagacctc cagttgttgc tgactacagt cggttaggtg ccgaaattgg aagacgtatt 361 gatatggcat atttttacaa cttcctgaac gggaggaaca tgatcccaaa atatctgcca 421 tacatggagg aaattaatag aatgcgcccg gcagatgtcc cagttgcaaa cagaggcaag 481 tgaaaccaac gaagaagaaa gctgcatgca ccctgcatgt caacagatac agaaaacatc 541 gtccgtcaaa cacgaggagc tgtgacagca tccatcccaa catgggttgg ctatttgttg 601 cacgatttta acacagaaag attgtacatt ccagaataaa tagatttact // LOCUS HUMROSSA 1843 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human 52 kD ribonucleoprotein Ro/SSA, complete cds. ACCESSION M34551 KEYWORDS ribonucleoprotein. SOURCE Human thymocyte, cDNA to mRNA, clone FI18.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1843) AUTHORS Itoh,K., Itoh,Y. and Frank,M.B. TITLE Heterogeneity in human Ro/SSA ribonucleoproteins: The 52 and 60 kd Ro/SSA proteins are encoded by separate genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.B.Frank, 18-MAY-1990. Author address: M.B.Frank Oklahoma Medical Res Foundation Arthritis and Immunology program 825 WE 13th St, Oklahoma City, OK 73104 FEATURES from to/span description pept 39 1466 52 kD Ro/SSA ribonucleoprotein signal 1827 1832 poly-A signal BASE COUNT 474 a 458 c 488 g 423 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgggc acactgctgt ttaacggcac acttgacaat ggcttcagca gcacgcttga 61 caatgatgtg ggaggaggtc acatgcccta tctgcctgga ccccttcgtg gagcctgtga 121 gcatcgagtg tggccacagc ttctgccagg aatgcatctc tcaggttggg aaaggtgggg 181 gcagcgtctg tcctgtgtgc cggcagcgct ttctgctcaa gaatctccgg cccaatcgac 241 agctagccaa catggtgaac aaccttaaag aaatcagcca ggaggccaga gagggcacac 301 agggggaacg gtgtgcagtg catggagaga gacttcacct gttctgtgag aaagatggga 361 aggccctttg ctgggtatgt gcccagtctc ggaaacaccg tgaccacgcc atggtccctc 421 ttgaggaggc tgcacaggag taccaggaga agctccaggt ggcattaggg gaactgagaa 481 gaaagcagga gttggctgag aagttggaag tggaaattgc aataaagaga gcagactgga 541 agaaaacagt ggaaacacag aaatctagga ttcacgcaga gtttgtgcag caaaaaaact 601 tcctggttga agaagaacag aggcagctgc aggagctgga gaaggatgag agggagcagc 661 tgagaatcct gggggagaaa gaggccaagc tggcccagca gagccaggcc ctacaggagc 721 tcatctcaga gctagatcga aggtgccaca gctcagcact ggaactgctg caggaggtga 781 taattgtcct ggaaaggagt gagtcctgga acctgaagga cctggatatt acctctccag 841 aactcaggag tgtgtgccat gtgccagggc tgaagaagat gctgaggaca tgtgcagtcc 901 acatcactct ggatccagac acagccaatc cgtggctgat actttcagaa gatcggagac 961 aagtgaggct tggagacacc cagcagagca tacctggaaa tgaagagaga tttgatagtt 1021 atcctatggt cctgggtgcc cagcactttc actctggaaa acattactgg gaggtagatg 1081 tgacaggaaa ggaggcctgg gacctgggtg tctgcagaga ctctgtgcgc aggaaggggc 1141 actttttgct tagttccaag agtggcttct ggacaatttg gttgtggaac aaacaaaaat 1201 atgaggctgg cacctacccc cagactcccc tccaccttca ggtgcctcca tgccaagttg 1261 ggattttcct ggactatgag gctggcatgg tctccttcta caacatcact gaccatggct 1321 ccctcatcta ctccttctct gaatgtgcct ttacaggacc tctgcggccc ttcttcagtc 1381 ctggtttcaa tgatggagga aaaaacacag cccctctaac cctctgtcca ctgaatattg 1441 gatcacaagg atccactgac tattgatggc tttctctgga cactgccact ctccccattg 1501 gcaccgcttc tcagccacaa accctgcctc ttttccccat gaactctgaa ccacctttgt 1561 ctctgcagag gcatccggat cccagcaagc gagctttagc agggaagtca cttcaccatc 1621 aacattcctg ccccagatgg ctttgtgatt ccctccagtg aagcagcctc cttatatttg 1681 gcccaaactc atcttgatca accaaaaaca tgtttctgcc ttctttatgg gacttaagtt 1741 ttttttttct cctctccatc tctaggatgt cgtctttggt gagatctcta ttatatcttg 1801 tatggtttgc aaaagggctt cctaaaaata aaaacccgaa ttc // LOCUS TRBBS12 4004 bp ds-DNA INV 14-AUG-1990 DEFINITION T.brucei expression site-associated protein (BS1.1 and BS1.2) genes, 5' end and complete cds. ACCESSION M34520 KEYWORDS expression site-associated protein. SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA, clone BS1.1-2. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 4004) AUTHORS Hobbs,M.R. and Boothroyd,J.C. TITLE An expression-site-associated gene family of trypanosomes is expressed in vivo and shows homology to a VSG gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Hobbs, 22-MAY-1990. FEATURES from to/span description pept < 1 129 expression site-associated protein (BS1.1) pept 1318 2151 expression site-associated protein (BS1.2) site 3083 3604 RIME sequence BASE COUNT 1181 a 650 c 1052 g 1121 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcaact atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg 61 gcagcagtac atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga 121 gtgctgtgaa ggaaggatgc gacagaacgt gcgctgctta gtgtgaaaga ttatggtaat 181 ggagggttgt gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg 241 ggcaaaagta acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa 301 ggaaagatgt gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg 361 agatgtaagg ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa 421 tttagttaca acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt 481 ctactgttta atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc 541 gaatttcact tctccagaat ctttatttta ggtgcgctat agatgatagt ccaaagagaa 601 ggaattacaa ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga 661 ttcttttcta tgtcacttgt gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg 721 aaatggggcg tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt 781 atgatgcata gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta 841 ggtggaagcg gctaatgatt gctagcagca ccaacagagg aagttaattg ttgacgtagc 901 cgtttgtgaa tgagatagaa ggttattggg tggaacatta tcttggaaag agtttttctt 961 ccactaaaaa tgtgtagagt ggtggtttcg agaaaatggt ttagatattg aggaagaaac 1021 aaggatttga ggtgcaagtt gatgcgctgt atgatttgtc ttgccgttgc tgttacttat 1081 tgttgagttg gtatttgagg aaaatgtttt ttcagtgtgc gttgttatat acggctgtga 1141 gttggtgatg tttatttgaa tgatagcttg ataggataag atgaaataat actaagcatg 1201 tattcttctg gctgttcgcc gaaatggtaa tctatgttac ccatagcacg aaaaaagttt 1261 ttttactcag taactgaagc ggaaagtttt aaaaagcata atttaaaaaa gttgataatg 1321 atgaagtttt ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa 1381 aataaaagga atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgaactat 1441 cttaaaggaa tagcgcacag agtaaacagc gagagtgccg tggttacgaa aaaactatca 1501 gatttgaaaa tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct 1561 ggcgagcagg attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt 1621 acattccagc aggagttgga ggaaatgagg aagccatccg cgttagcagc agctgcagct 1681 gggatagcag ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg 1741 tcatcacagt tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa 1801 gaatgttttg atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta 1861 aaagagtcgg cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat 1921 agcttgggtg cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga 1981 acagatactg acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt 2041 ggtggagtga tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa 2101 tacgtgatgc tacagagtac ggtgcagtac ggtggaccga ggatccagta aagtgagcat 2161 atttaaggat gtcatccgcc tgtttgcgcg cttccaagaa gcaaaaaatg aagtgatgaa 2221 taaaataaaa actactgtgg atgaattggc gaaatgcata gggcagaaag aggttgagct 2281 cactgatgat cagctttacg aggaatttat ctgggagacc atacacagat tggagctgtc 2341 aaagagagtg agtgaacaac tctctttagg tgaagaagag gagacgatcc tgaaatctaa 2401 ctatactgca gaaccggtcc gtggactttc acggtagcgg ggtccaacgc ggcagcagta 2461 catttgagtg tttctaccgc tgcactttgt ttttcagttt tattgttggg agtgctgtga 2521 aggaaggatg cgacagaagc tgcgctgctt agtgtgaaag attatggtaa tggagggttg 2581 tgaaagattg ggggaacaaa aacctatttc ttttatttgg gggaacaaat gggcaaaagt 2641 aacgtaagtt tccagtggga gtggtatgtg tgtgtgtatg gggctggcta aggaaagatg 2701 tgagttcggc atgtggtatg tacaagctac gaaaacgtgt gaaacaaaac gagatgtaag 2761 gggaaaatgt aacaaccaac tatgttaaat ttcaggagac tatttttcaa atttagttac 2821 aacaaagtaa atgtcaaata atgccaacta taaaggaaaa ctagactaat tctactgttt 2881 aatgtgacga gaattgtggc gtgaaaccta gtaactgctt atcatggcgt cgaatttcac 2941 ttctccagaa tctttatttt agtgcgctat agatgatagt ccaaagagaa ggaattacaa 3001 ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga ttcttttcta 3061 tgtctcttgt gaaggtcgat ctttttgcta ccccctggcg atgccggcca cctcaacgtg 3121 gtgccagggt ccagtgaccc cgtatcattg ggggaagcca agagccagca gcgttccttt 3181 catggggaac actgctgtag ctccggctac gcatcataca gcacagggat cagcaggcgt 3241 cttgctggga caccgttttt catttgtcgg tccctgggca cgtgccagcg tcgatcagca 3301 gtatcatccg cgctaagatg ctgctgtccg gtgatgtgga cctccaaaaa aggattgcca 3361 ttttggcatc ttttggagag tccagggtgg gaggcttctc gccccatctg ctgtattccg 3421 ttcaactgcg gacgtacaac aaaaaattat agagggtgtg ttaggatgaa tgaaaaaggg 3481 agactctgcc acagtcgcca gaccgatagc actctcaggg ctctacggtg ttggctgatg 3541 gccgcgcaga ggggggaaac tctcacgaag gcacgaagaa aattcaaaaa aaaatttttg 3601 ctaccacagg ctgtaaggat aacgctggag atggagccga ctgtggtttt tcttgtgaat 3661 gcgtatcttc gtaaccggtt gttaagaagg gatcggacga aggaatttgt ctgagtatga 3721 ggtaatgtcg aggttttctt tcattggatg tagtaataat ctccaaattt gttggaacat 3781 ataatattat tgcaataaat gcttgaagag tggaattatt gatggattga cttgagtatc 3841 atgattccaa tgaacactct tataatcctt cgaatagcaa acgtatgaac aaattaacgt 3901 agcgtcatgt agacatggac atttttgtaa tgatacgact gctattgcac atctgaaaat 3961 ttctatgacg gaccagtgcc gttaatcacc gtacaattga attc // LOCUS TRBBS134 2396 bp ds-DNA INV 14-AUG-1990 DEFINITION T.brucei expression site-associated protein (BS1.3 and BS1.4) genes, 3' and 5' ends. ACCESSION M34521 KEYWORDS expression site-associated protein. SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA, clone BS1.3-4. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 2396) AUTHORS Hobbs,M.R. and Boothroyd,J.C. TITLE An expression-site-associated gene family of trypanosomes is expressed in vivo and shows homology to a VSG gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Hobbs, 22-MAY-1990. FEATURES from to/span description pept 1836 > 2396 expression site-associated protein (BS1.4) ORF < 1 1206 expression site-associated protein (BS1.3) BASE COUNT 744 a 357 c 642 g 653 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttgcaa ctccatgaag tgttatcatc catttccttc ggtagcttgg gtgtgaaaaa 61 tattcgagga ggaaatggga gggatggatg caatttggtg agaacagaca ctaacggtat 121 attaaatggg ggttcaccaa cacggcacaa cttaacgtgg ggtggtggag tgatgaattt 181 tggatcctat caaaacggaa gtatgtacgt tgagggtggt gaatacggtg atgctacaga 241 gtacggtgcg gtacggtgga ctgaagatcc cagtaaagtg agcatatttg aggatgtcat 301 tcgcctgttt gcgcgcttcc aagaagcaaa aaatgcagtg atgaggagaa taaaaactac 361 tgtggatgaa ttgacgaaat gtatagggca gaaggaggct gaactcacta atgatcagat 421 ttacgaggaa tttatctggg agaccataca cagattggag ctgtcaaaga gagtgagtga 481 acaaccctct ttaggtgaag aagaggagac aatcctgaaa tctaactata ctgcagaacc 541 tgtccgtgga cctttcacgg gagcggggtc caacacagta gcagtacatt tgagtgtttc 601 taccgctgca ctctgttgtt tagctttatt gttgggagtg ctgtgaggga aggatgcgac 661 agaaactgcg ctgcttagcg tgaaagatta tggtaatgga gggttgggaa agattggggg 721 aacaaaaacc tatttctttt atttggggga acaaatgggc aaaagtaacg tgagtttcca 781 gcgggagtgg tgtgtgtgtg catggggctg actaaggaaa gatgtaagtt cggaatgtgg 841 tatgtacaag ctacgaaaac gtgtgaaaca aaacgatatg taaggggaaa atgtaacaac 901 caactatgtt aaatttcaag agactatttt tcaaatttag ttacaacaaa gtaaatgtca 961 aataatgcca actataaagg aaaactagac taattctact gtttaatgtg acgagaattg 1021 tggcgtgaaa cctagtaacg accacgtcac gatgtggaat ttcacttctc cagaatcttt 1081 gttttagtgc gctatagatg atagtccaaa gagaaggaat tgcagttatt ataacccatc 1141 caaagataga tcaaagactc ctcttctccg agtgattctt ttctatgtca cttgtggaag 1201 gtcgatcttg gaacaatatc gaacgtgtga agcctgaaat gaggcgttat tattgcttct 1261 ctttttggtg gttgttagag ttatataccg ctatggtatg atgcacagag gtaacgctaa 1321 aaataagtga caaggcgttg tagtagggcg tatggtagtt ggaagcggct aatgattgct 1381 ggcctcacca acagaggaag ttaattgttg acgtagccgt ttgtgaatga gatagaaggt 1441 tatcgggtgg aacattatct tggaaagagt ttttcatcac ctaaaaatgt gtagagtggt 1501 ggtttggaga aaatggttta gatattgagg aggaaacaag gatttgaggt gcaagttgat 1561 gcgctgtaag atttgtatta ccgttggtgt tacttattgt tgagtgagta tttgagggaa 1621 atgttatggg aggtgcttcg ttgttacata cggctctgag tcagtggtgt tcatttgaat 1681 gaaatcttga taggataaga tgaaataata ctaagcatgt attcttctgt gctgccgccg 1741 aaatggtaat ctatcttacc tacagcacga aaaatttatt ttttttactc agtaacaaag 1801 cggaaagttt taaaagcata atttaaaaaa gtgatatgat gaggttttgg tttgtgttgt 1861 tggccctttt gggaaaaaaa acacatgcat attatgaaaa tgaaaggaat gcattaaacg 1921 caaccgccgc taataaagtt tgtgggctat cgaactatct taaaggagta gcgcacagag 1981 taaacagcga gagtgctgtg gttacggaaa aactatcaga tttgaaaatg agaagcatcc 2041 agttgcagct atcagtaatg cggaacagag tgccttctgg cgagaaggat tgtaaagaca 2101 taaggacact cttgaaaaca gtattgagga atgagtttac attccagcag gagttggagg 2161 aaatgaggaa cgcatccgcg ttagcagcag ctgcagctgg gttggcagct ggaagactgg 2221 aagagtggat ttttgtattt gcacaggcag cagacaggtc atcacagttt tgcataagtg 2281 tggggaagca tatcgcggct gagcacggca acttgcaaga atgttttgat ggaacaattg 2341 gacctgaaac cctttacaaa attgaggatt cacgtgtaaa agagtcggcg cagaaa // LOCUS TRBBS15 714 bp ss-mRNA INV 14-AUG-1990 DEFINITION T.brucei expression site-associated protein (BS1.5) gene, partial cds. ACCESSION M34522 KEYWORDS expression site-associated protein. SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.4, cDNA to mRNA, clone BS1.5. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 714) AUTHORS Hobbs,M.R. and Boothroyd,J.C. TITLE An expression-site-associated gene family of trypanosomes is expressed in vivo and shows homology to a VSG gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Hobbs, 22-MAY-1990. FEATURES from to/span description pept < 1 > 714 expression site-associated protein (BS1.5) (AA at 1) BASE COUNT 226 a 116 c 203 g 169 t ORIGIN 1 agcgagagtg ccgtggttac ggaaaaacta tcagatttga aaatgagaag catccagttg 61 cagttatcag taatgcggaa cagagtgcct tctggcgaga aggattgtaa agacatcagg 121 acactcttga aaacagtatt gaggaatgag tttacattcc agcaggagtt ggaggaaatg 181 aggaacgcat ccgcgttagc agcagctgca gctgggttgg cagctggaag actggaagag 241 tggatttttg tatttgcaca ggcagcagac aggtcatcac agttttgcat aagtgtgggg 301 aaaactatcc caccggaaca aaacaacttg caagaatgtt ttgatggaac aattggacct 361 gaaacccttt acaaaattga ggattcacgt gtaaaagagt ctgcaaagaa aagcttgcaa 421 ctccatgaag cgttatcatc catttccttc aatagtttgg gtgcggaaag tatccgagga 481 ggaaatggga aggatggatg taatttggtg agaacagaca ctgacggtat attaaatggg 541 ggttcaccaa cacggcacaa tttaacgtgg ggtggtggag tgatgaattt tggatcctat 601 caaaacggaa gtatgtacgt tgagggggga gaatacggtg atgctacaga gtacggtgca 661 gtacggtgga ctgaagatcc cagtaaagta agcatattta aggatgtcat tcgc // LOCUS ECOPOLBDA 4666 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli DNA polymerase (polB) gene, 5' flank. ACCESSION M35371 KEYWORDS DNA polymerase; polB gene. SOURCE E.coli (strain W3110) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 4666) AUTHORS Chen,H., Sun,Y., Stark,T., Beattie,W. and Moses,R. TITLE Nucleotide sequence and deletion analysis of the polB gene of E.coli JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Chen, 20-JUN-1990. Author address: H.Chen Baylor College of Medicine Cell Biology and IMG One Baylor Plaza Houston, TX 77030 email: hchen@vulture.bcm.tmc.edu FEATURES from to/span description signal 1209 1214 -35 region signal 1229 1234 -10 region binding 1250 1255 ribosome binding site site 1193 1208 lexA box BASE COUNT 1100 a 1278 c 1257 g 1031 t ORIGIN 1 aagcttgcat gcctgcaggt cgactctaga ggatcctcgc tggtggcgcg caccataccg 61 tcttcagcca tgcactgaac ctcaacgata tgcgccaatt cgccgagatg cacgacattg 121 aaatcacggt gattgataac gacacacgcc tgccagcgtt taaagacgcg ctgcgctgga 181 acgaagtgta ttacgggttt cgtcgctaag tagccgcatc cggtatgtaa cgcctgatgc 241 gacgctgacg cgtcttatct ggcctacacg ctgcgatttt gtaggccgga taagcaaagc 301 gcatccggca ttcaacgcct gatgcgacgc tggcgcgtct tatcaggcct acgcgctgcg 361 attttgtagg ccggataagc aaagcgcatc cggcattcaa cgcctgatgc gacgctggcg 421 cgtcttatca ggcctacacg ctgcgatttt gtaggccgga taagcaaagc gcatccggca 481 cgaaggagtc aacatgttag aagatctcaa acgccaggta ttagaagcca acctggcgct 541 gccaaaacac aacctggtca cgctcacatg gggcaacgtc agcgccgttg atcgcgagcg 601 cggcgtcttt gtgatcaaac cttccggcgt cgattacagc gtcatgaccg ctgacgatat 661 ggtcgtggtt agcatcgaaa ccggtgaagt ggttgaaggt acgaaaaagc cctcctccga 721 cacgccaact caccggctgc tctatcaggc attcccctcc attggcggca ttgtgcatac 781 gcactcgcgc cacgccacca tctgggcgca ggcgggtcag tcgattccag caaccggcac 841 cacccacgcc gactatttct acggcaccat tccctgcacc cgcaaaatga ccgacgcaga 901 aatcaacggc gaatatgagt gggaaaccgg taacgtcatc gtagaaacct ttgaaaaaca 961 gggtatcgat gcagcgcaaa tgcccggcgt tctggtccat tcccacggcc cgtttgcatg 1021 gggcaaaaat gccgaagatg cggtgcataa cgccatcgtg ctggaagagg tcgcttatat 1081 ggggatattc tgccgtcagt tagcgccgca gttaccggat atgcagcaaa cgctgctgga 1141 taaacactat ctgcgtaagc atggcgcgaa ggcatattac gggcagtaat gactgtataa 1201 aaccacagcc aatcaaacga aaccaggcta tactcaagcc tggttttttg atggattttc 1261 agcgtggcgc aggcaggttt tatcttaacc cgacactggc gggacacccc gcaagggaca 1321 gaagtctcct tctggctggc gacggacaac gggccgttgc aggttacgct tgcaccgcaa 1381 gagtccgtgg cgtttattcc cgccgatcag gttccccgcg ctcagcatat tttgcagggt 1441 gaacaaggct ttcgcctgac accgctggcg ttaaaggatt ttcaccgcca gccggtgtat 1501 ggcctttact gtcgcgccca tcgccaattg atgaattacg aaaagcgcct gcgtgaaggt 1561 ggcgttaccg tctacgaggc cgatgtgcgt ccgccagaac gctatctgat ggagcggttt 1621 atcacctcac cggtgtgggt cgagggtgat atgcacaatg gcactatcgt taatgcccgt 1681 ctgaaaccgc atcccgacta tcgtccgccg ctcaagtggg tttctataga tattgaaacc 1741 acccgccacg gtgagctgta ctgcatcggc ctggaagcgt gcgggcagcg catcgtttat 1801 atgctggggc cggagaatgg cgacgcctcc tcgcttgatt tcgaactgga atacgtcgcc 1861 agccgcccgc agttgctgga aaaactcaac gcctggtttg ccaactacga tcctgatgtg 1921 atcatcggtt ggaacgtggt gcagttcgat ctgcgaatgc tgcaaaaaca tgccgagcgt 1981 taccgtcttc cgctgcgtct tgggcgcgat aatagcgagc tggagtggcg cgagcacggc 2041 tttaaaaacg gcgtcttttt tgcccaggct aaaggtcggc taattatcga cggtatcgag 2101 gcgctgaaat ccgcgttctg gaatttctct tcattctcgc tggaaactgt cgctcaggag 2161 ctattaggcg aaggaaaatc tatcgataac ccgtgggatc gaatggacga aattgaccgc 2221 cgtttcgccg aagataaacc tgcgctggca acttataacc tgaaagattg cgagctggtg 2281 acgcagatct tccacaaaac tgaaatcatg ccatttttac tcgaacgggc aacggtgaac 2341 ggcctgccgg tggaccgaca cggcggttcg gtggcggcat ttggtcatct ctattttccg 2401 cgaatgcatc gcgctggtta tgtcgcgcct aatctcggcg aagtgccgcc gcacgccagc 2461 cctggcggct acgtgatgga ttcacggcca gggctttatg attcagtgct ggtgctggac 2521 tataaaagcc tgtacccgtc gatcatccgc acctttctga ttgatcccgt cgggctggtg 2581 gaaggcatgg cgcagcctga tccagagcac agtaccgaag gttttctcga tgcctggttc 2641 tcgcgagaaa aacattgcct gccggagatt gtgactaaca tctggcacgg gcgcgatgaa 2701 gccaaacgcc agggtaacaa accgctgtcg caggcgctga aaatcatcat gaatgccttt 2761 tatggcgtgc tcggcaccac cgcctgccgc ttcttcgatc cgcggctggc atcgtcgatc 2821 accatgcgtg gtcatcagat catgcggcaa accaaagcgt tgattgaagc acagggctac 2881 gacgttatct acggcgatac cgactcaacg tttgtctggc tgaaaggcgc acattcggaa 2941 gaagaagcgg cgaaaatcgg tcgtgcactg gtgcagcacg ttaacgcctg gtgggcggaa 3001 acgctgcaaa aacaacggct gaccagcgca ttagaactgg agtatgaaac ccatttctgc 3061 cgttttctga tgccaaccat tcgcggagcc gataccggca gtaaaaagcg ttatgccgga 3121 ctgattcagg agggcgacaa gcagcggatg gtgtttaaag ggctggaaac cgtgcgcacc 3181 gactggacgc cgctggccca gcagtttcag caggagctat acctgcgcat cttccgcaac 3241 gagccatatc aggaatatgt acgcgaaacc atcgacaaac tgatggcggg tgaactggat 3301 gcgcgactgg tttaccgtaa acgccttcgc cgtccgctga gcgagtatca gcgtaatgtg 3361 ccgcctcatg tacgcgccgc tcgccttgcc gatgaagaaa accaaaagcg tggtcgcccc 3421 ttgcaatatc agaatcgcgg caccattaag tacgtatgga ccaccacagg cccggagccg 3481 cctggactac caacgttcac cactggatta cgaacactat ctgacccgcc agctacaacc 3541 cgtggcggag ggaatactcc cttttattga ggataatttt gctacactta tgaccgggca 3601 acttgggcta ttttgagcaa aaaaaagagt tcgccagata ccattttgat gcgtgacgaa 3661 tgctttgcca tccagtacca tagcgccctt tccattcctg gacctgaata acaccactac 3721 ctcataagca cggtagcggg tggttattgc ctgcaattaa agatatagag ccgaacacat 3781 atgcctttta cacttggtca acgctggatc agcgatacag aaagcgaatt gggacttgga 3841 accgttgtcg cggtggatgc gcgaactgtc actttacttt tcccatctac tggtgaaaac 3901 cgtctgtacg cacgcagtga ttcccccgtg acccgcgtga tgttcaaccc tggtgatacc 3961 attaccagcc atgacggctg gcagatgcaa gtcgaagaag taaaagaaga aaatggcttg 4021 ctgacctata tcggtactcg cctggatact gaagaggtcc ggcgtagccc tgcgtgaagt 4081 tttccttgat agcaaactgg tgttcagcaa accgcaggca ccgtctgttt gccgggcaga 4141 ttgaccgtat ggaccgcttt gcgctgcgtt atcgcgcgcg taaatattcc agcgaacagt 4201 tccgtatgcc gtacagcggc ctgcgcggtc agcgtaccag cctgatccgc atcagctcaa 4261 catcgctcat gatgttggtc gccgccacgc gccgcgcgtc ctgctggctg acgaagtggg 4321 tttagggaaa accattgaag ccgggatgat cctgcatcag caactgctct ctggcgctgc 4381 tgaacgtgtg ctaattatcg tcccggaaac cttacagcat cagtggctgg tagaaatgct 4441 gcgccgtttc aacctgcgct ttgcgctatt tgatgatgag cgttatgccg aagctcagca 4501 cgatgcttac aacccgtttg acaccgtgaa gcggcgcacg aaaaacgcga aagcgtttca 4561 cgataaatgc gaaaacttta gctttcgcgc ttcaaatgaa acagatgtat taattactgc 4621 tttttattca ttacatgggg atccccgggt accgagctcg aattcc // LOCUS HUMBIGFII 1387 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human insulin-like growth factor binding protein 2 (IGFBP2) mRNA, complete cds. ACCESSION M35410 KEYWORDS insulin-like growth factor binding protein 2. SOURCE Human 67-year old retina, cDNA to mRNA, clone AS200. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1387) AUTHORS Agarwal,N., Hsieh,C.-L., Sills,D., Swaroop,M., Desai,B., Francke,U. and Swaroop,A. TITLE Sequence analysis, expression and chromosomal localization of a gene, isolated from a subtracted human retina cDNA library, that encodes an insulin-like growth factor binding protein (IGFBP2) JOURNAL Exp. Eye Res. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Swaroop, 20-JUN-1990. FEATURES from to/span description pept 64 1050 insulin-like growth factor binding protein 2 (IGFBP2) precursor /hgml_locus_uid="LN0220S" /nomgen="IBP1" /map="7p13-p12" sigp 64 180 insulin-like growth factor binding protein 2 signal peptide matp 181 1047 insulin-like growth factor binding protein 2 mRNA < 1 1387 IGFBP2 mRNA signal 1362 1367 poly-A signal binding 175 197 ATP binding site BASE COUNT 232 a 455 c 477 g 223 t ORIGIN 1 gtgccacctg cccgcccgcc cgctcgctcg ctcgcccgcc gcgccgcgct gccgaccgcc 61 agcatgctgc cgagagtggg ctgccccgcg ctgccgctgc cgccgccgcc gctgctgccg 121 ctgctgccgc tgctgctgct gctactgggc gcgagtggcg gcggcggcgg ggcgcgcgcg 181 gaggtgctgt tccgctgccc gccctgcaca cccgagcgcc tggccgcctg cgggcccccg 241 ccggttgcgc cgcccgccgc ggtggccgca gtggccggag gcgcccgcat gccatgcgcg 301 gagctcgtcc gggagccggg ctgcggctgc tgctcggtgt gcgcccggct ggagggcgag 361 gcgtgcggcg tctacacccc gcgctgcggc caggggctgc gctgctatcc ccacccgggc 421 tccgagctgc ccctgcaggc gctggtcatg ggcgagggca cttgtgagaa gcgccgggac 481 gccgagtatg gcgccagccc ggagcaggtt gcagacaatg gcgatgacca ctcagaagga 541 ggcctggtgg agaaccacgt ggacagcacc atgaacatgt tgggcggggg aggcagtgct 601 ggccggaagc ccctcaagtc gggtatgaag gagctggccg tgttccggga gaaggtcact 661 gagcagcacc ggcagatggg caagggtggc aagcatcacc ttggcctgga ggagcccaag 721 aagctgcgac caccccctgc caggactccc tgccaacagg aactggacca ggtcctggag 781 cggatctcca ccatgcgcct tccggatgag cggggccctc tggagcacct ctactccctg 841 cacatcccca actgtgacaa gcatggcctg tacaacctca aacagtgcaa gatgtctctg 901 aacgggcagc gtggggagtg ctggtgtgtg aaccccaaca ccgggaagct gatccaggga 961 gcccccacca tccgggggga ccccgagtgt catctcttct acaatgagca gcaggaggct 1021 cgcggggtgc acacccagcg gatgcagtag accgcagcca gccggtgcct ggcgcccctg 1081 ccccccgccc ctctccaaac accggcagaa aacggagagt gcttgggtgg tgggtgctgg 1141 aggattttcc agttctgaca cacgtattta tatatggaaa gagaccagca ccgagctcgg 1201 cacctccccg gcctctctct tcccagctgc agatgccaca cctgctcctt cttgctttcc 1261 ccgggggagg aagggggttg tggtcgggga gctggggtac aggtttgggg agggggaaga 1321 gaaattttta tttttgaacc cctgtgtccc ttttgcataa gattaaagga aggaaaagta 1381 aagtgtg // LOCUS HUMLBPA 1431 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human lipopolysaccharide binding protein (LBP) mRNA, complete cds. ACCESSION M35533 KEYWORDS lipopolysaccharide binding protein. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites; for [2]) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. TITLE Structure and function of lipopolysaccharide binding protein JOURNAL Science (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1431) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Tobias, 21-JUN-1990. Author address: P.S.Tobias Department of Immunology, IMM-12 10466 N. Torrey Pines Rd La Jolla, CA 92037 FEATURES from to/span description pept 1 > 1431 lipopolysaccharide binding protein (LBP) precursor sigp 1 75 lipopolysaccharide binding protein signal peptide matp 76 1431 lipopolysaccharide binding protein BASE COUNT 319 a 417 c 359 g 336 t ORIGIN 1 atgggggcct tggcaagagc cctgccgtcc atactgctgg cattgctgct tacgtccacc 61 ccagaggctc tgggtgccaa ccccggcttg gtcgccagga tcaccgacaa gggactgcag 121 tatgcggccc aggaggggct attggctctg cagagtgagc tgctcaggat cacgctgcct 181 gacttcaccg gggacttgag gatcccccac gtcggccgtg ggcgctatga gttccacagc 241 ctgaacatcc acagctgtga gctgcttcac tctgcgctga ggcctgtccc cggccagggc 301 ctgagtctca gcatctccga ctcctccatc cgggtccagg gcaggtggaa ggtgcgcaag 361 tcattcttca aactacaggg ctcctttgat gtcagtgtca agggcatcag catttcggtc 421 aacctcctgt tgggcagcga gtcctccggg aggcccacag gttactgcct cagctgcagc 481 agtgacatcg ctgacgtgga ggtggacatg tcgggagatt cggggtggct cttgaacctc 541 ttccacaacc agattgagtc caagttccag aaagtactgg agagcaggat ttgcgaaatg 601 atccagaaat cagtgtcctc cgatctacag ccttatctcc aaactctgcc agttacaaca 661 gagattgaca gtttcgccga cattgattat agcttagtgg aagcccctcg ggcaacagcc 721 cagatgctgg aggtgatgtt taagggtgaa atctttcatc gtaaccaccg ttctccagtt 781 accctccttg ctgcagctga ggaacacaac aaaatggtct actttgccat ctcggattat 841 gtcttcaaca cggccagcct ggtttatcat gaggaaggat atctgaactt ctccatcaca 901 gatgacatga taccgcctga ctctaatatc cgactgacca ccaagtcctt ccgacccttc 961 gtcccacggt tagccaggct ctaccccaac atgaacctgg aactccaggg atcagtgccc 1021 tctgctccgc tcctgaactt cagccctggg aatctgtctg tggaccccta tatggagata 1081 gatgcctttg tgctcctgcc cagctccagc aaggagcctg tcttccggct cagtgtggcc 1141 actaatgtgt ccgccacctt gaccttcaat accagcaaga tcactgggtt cctgaagcca 1201 ggaaaggtaa aagtggaact gaaagaatcc aaagttggac tattcaatgc agagctgttg 1261 gaagcgctcc tcaactatta catccttaac accctctacc ccaagttcaa tgataagttg 1321 gccgaaggct tcccccttcc tctgctgaag cgtgttcagc tctacgacct tgggctgcag 1381 atccataagg acttcctgtt cttgggtgcc aatgtccaat acatgagagt t // LOCUS HUMPEC12L 2344 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human cell 12-lipoxygenase gene, complete cds. ACCESSION M35418 KEYWORDS lipoxygenase. SOURCE Human platelet/erythroleukemia cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2344) AUTHORS Funk,C.D., Furci,L. and FitzGerald,G.A. TITLE Molecular cloning, primary structure, and expression of the human platelet/erythroleukemia cell 12-lipoxygenase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5638-5642 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.D.Funk, 20-JUN-1990. FEATURES from to/span description pept 42 2033 12-lipoxygenase signal 2324 2329 poly-A signal mRNA < 1 2344 12-lipoxygenase mRNA BASE COUNT 514 a 692 c 642 g 496 t ORIGIN 1 ggaggacccg gctcccctcg cctaagctgc tggggggcgc catgggccgc taccgcatcc 61 gcgtggccac cggggcctgg ctcttctccg ggtcgtacaa ccgcgtgcag ctttggctgg 121 tcgggacgcg cggggaggcg gagctggagc tgcagctgcg gccggcgcgg ggcgaggagg 181 aggagtttga tcatgacgtt gcagaggact tggggctcct gcagttcgtg aggctgcgca 241 agcaccactg gctggtggac gacgcgtggt tctgcgaccg catcacggtg cagggccctg 301 gagcctgcgc ggaggtggcc ttcccgtgct accgctgggt gcagggcgag gacatcctga 361 gcctgcccga gggcaccgcc cgcctgccag gagacaatgc tttggacatg ttccagaagc 421 atcgagagaa ggaactgaaa gacagacagc agatctactg ctgggccacc tggaaggaag 481 ggttacccct gaccatcgct gcagaccgta aggatgatct acctccaaat atgagattcc 541 atgaggagaa gaggctggac tttgaatgga cactgaaggc aggggctctg gagatggccc 601 tcaaacgtgt ttacaccctc ctgagctcct ggaactgcct agaagacttt gatcagatct 661 tctggggcca gaagagtgcc ctggctgaga aggttcgcca gtgctggcag gatgatgagt 721 tgttcagcta ccagttcctc aatggtgcca accccatgct gttgagacgc tcgacctctc 781 tgccctccag gctagtgctg ccctcgggga tggaagagct tcaggctcaa ctggagaaag 841 aacttcagaa tggttccctg tttgaagctg acttcatcct tctggatgga attccagcca 901 acgtgatccg aggagagaag caatacctgg ctgcccccct cgttatgctg aagatggagc 961 ccaatgggaa gctgcagccc atggtcatcc agattcagcc tcccagcccc agctctccaa 1021 ccccaacact gttcctgccc tcagaccccc cacttgcctg gctcctggca aagtcctggg 1081 tccgaaattc agatttccaa ctgcacgaga tccagtatca cttgctgaac actcacctgg 1141 tggctgaggt catcgctgtc gccaccatgc ggtgcctccc aggactgcac cccatcttca 1201 agttcccgat cccccatatc cgctacacca tggaaatcaa cacccgggcc cggacccaac 1261 tcatctcaga tggaggaatt tttgataagg cagtgagcac aggtggaggg ggccatgtac 1321 agttgctccg tcgggcggca gctcagctga cctactgctc cctctgtcct cctgacgacc 1381 tggctgaccg gggcctgctg ggactcccag gtgctctcta tgcccatgat gctttacggc 1441 tctgggagat cattgccagg tatgtggagg ggatcgtcca cctcttctac caaagggatg 1501 acatagtgaa gggggaccct gagctgcagg cctggtgtcg ggagatcacg gaggtggggc 1561 tgtgccaggc ccaggaccga ggtttccctg tctccttcca gtcccagagt caactctgcc 1621 atttcctcac catgtgcgtc ttcacgtgca ctgcccagca tgccgccatc aaccagggcc 1681 agctggactg gtatgcctgg gtccctaatg ctccatgcac aatgcggatg cccccaccca 1741 ccaccaagga agatgtgacg atggccacag tgatggggtc actacctgat gtccggcagg 1801 cctgtcttca aatggccatc tcatggcatc tgagtcgccg ccagccagac atggtgcctc 1861 tggggcacca caaagaaaaa tatttctcag gccccaagcc caaagctgtg ctaaaccaat 1921 tccgaacaga tttggaaaag ctagaaaagg agattacagc ccggaatgag caacttgact 1981 ggccctatga atatctgaag cccagctgca tagagaacag tgtcaccatc tgagccctag 2041 agtgactcta cctgcaagat ttcacatcag ctttaggact gacatttcta tcttgaattt 2101 catgctttcc taaagtctct gctgctaagg ctctatttcc tcccccagtt aaacccctac 2161 attagtatcc cactagccca ggggagcagt aaactttctc tgcaaagact agatcctttt 2221 ttacgctttg cagaccgcat agtcactgtc tcaactactc agctctcctg ctgcagcatg 2281 aaggcagcca cagacaacat ggaaatgagt gtgactatgt tccaataaaa ctttatggac 2341 actg // LOCUS HUMRALBA 1327 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human GTP-binding protein (RALB) mRNA, complete cds. ACCESSION M35416 KEYWORDS GTP-binding protein. SOURCE Human retina, cDNA to mRNA, clone AS181. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1327) AUTHORS Hsieh,C.-L., Swaroop,A. and Francke,U. TITLE Chromosomal localization and cDNA sequence of human RALB, a GTP binding protein JOURNAL Somat. Cell Mol. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Swaroop, 20-JUN-1990. FEATURES from to/span description pept 171 791 GTP-binding protein (RALB) mRNA < 1 1327 RALB mRNA signal 1303 1308 poly-A signal BASE COUNT 368 a 273 c 373 g 313 t ORIGIN Chromosome cen-q13. 1 gagcccggca gctcaatgac aaatcggtgg aggacggctg gggtccggcc ccgggagggc 61 ccggggcgcg tttaagagct gcgggccggg tgcggacggc ggaggcggcg ggactggtcc 121 ctgctcttca gtgggtcatc tgtgtgtcac agcctcagaa gaccagcgag atggctgcca 181 acaagagtaa gggccagagc tccttggccc tccacaaggt gatcatggtt ggcagcggag 241 gcgttggcaa gtcagccctg acgcttcagt tcatgtatga cgagtttgta gaagactatg 301 aacctaccaa agctgacagt tatagaaaga aagtggttct tgatggggaa gaagttcaga 361 tagatattct ggacaccgct gggcaagagg actacgcagc cattcgagat aactactttc 421 ggagtgggga agggtttctt cttgtgttct caatcacaga acatgaatcc tttacagcaa 481 ctgccgaatt cagggaacag attctccgtg tgaaggctga agaagataaa attccactgc 541 tcgtcgtggg aaacaagtct gacctagagg agcggaggca ggtgcctgtg gaggaggcca 601 ggagtaaagc cgaagagtgg ggcgtgcagt acgtggagac gtcagcgaag acccgggcca 661 acgtggacaa ggtgttcttt gacctaatga gagaaatcag aacaaagaag atgtcagaaa 721 acaaagacaa gaatggcaag aaaagcagca agaacaagaa aagttttaaa gaaagatgtt 781 gcttactatg agtgtcaagg tgacggatga agccagctgc tcctaaggac acagggctgg 841 gttggtaaag agaaggctat ggttgacttc ttgcttgtgc ttcccactct ccccgacttc 901 attcactcaa acttctttaa atggggaaaa atatttgtga ctctgtggct ggcagaagaa 961 ataagcccat gcaagtggaa gggctgcttt gtcaggaggt tgtggaattt ctttcttctc 1021 cccttcttcc ctcccaaaag cttagctatg tataaagtgc cacagatagg aaacagctgt 1081 taattacaaa gagaaagaat tgtcatagca tcttattttg ttcctagttt tataacatta 1141 ccatccttcg ttttgaacta cagatgttgt agtgggtttt ggaggaggga gtggagtaag 1201 atgccctccc acttttatca gtttagtagt agtactgaga aaaatccctt cagctctaag 1261 aacactgaaa aatccaccga ttttttgggt aagcttcttg gcaataccct gtggatctga 1321 aacagct // LOCUS LACLACR 1332 bp ds-DNA BCT 14-AUG-1990 DEFINITION L.lactis lactose phosphotransferase system repressor (lacR) gene, complete cds. ACCESSION M35375 KEYWORDS lactose phosphotransferase system repressor; lactose repressor. SOURCE L.lactis (strain MG1820) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1332) AUTHORS Van Rooijen,R.J. and Devos,W.M. TITLE Molecular cloning, transcriptional analysis, and nucleotide sequence of LACR, a gene encoding the repressor of the lactose phosphotransferase system of Lactococcus lactis JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.J.Van Rooijen, 20-JUN-1990. Author address: R.J.Van Rooijen Netherlands Inst for Dairy Research Kernhemseweg 2 P.O. Box 20 6710 BA EDE THE NETHERLANDS FEATURES from to/span description pept 370 1155 lactose repressor (lacR; alt.) pept 388 1155 lactose repressor (lacR; alt.) mRNA 79 1245 lactose repressor mRNA signal 1215 > 1155 transcription termination signal binding 353 357 ribosomal binding site signal 45 51 -35 region signal 68 74 -10 region BASE COUNT 469 a 207 c 198 g 458 t ORIGIN 1 gatatcaaac attcaaacaa aacgcaacta tttttgttaa ttttttgttt ttttttattt 61 gtttttttaa aaaatagata acaccgttaa attattgttc atttttgttc atttaatcca 121 tcacaaaatg gacgtgaaat atctattcag gtattacaaa agtcttttac tttctataac 181 ttactgatta agaggtccta ctttattttc gtcttataca aaatctgacc taagctaata 241 tacgtcaatc ctctgttctt atttcatcat ctaacgtttg tttttgtttg aaattgtttg 301 ttttaccttg aaaatattat cttttatgat acaattaaaa gagaattatc tttggaaaaa 361 aattacttta tgaaagaaag tcttcatatg aacaaaaaac gacgattaga aaaaatttta 421 gatatgttaa agattgatgg gaccataacc ataaaagaaa taatagatga actagatatt 481 tccgatatga cagcccgtag agaccttgat gctctagaag ctgatggact tttaacacgt 541 actcatggtg gtgcacaatt gctttcctct aaaaagccac ttgaaaagac acatatcgag 601 aagaaaagtc taaatacaaa agaaaaaatt gacattgcta aaaaagcctg ctctttaatc 661 aaagatggcg atactatttt tattggaccc ggaactacac ttgtacaact ggcattagaa 721 ttgaaaggtc gtaaaggtta taaaattcgt gtcattacaa atagtctccc tgtgttcttg 781 attctaaatg atagcgaaac cattgattta ttgcttcttg gcggtgaata tagagaaata 841 actggagctt ttgtaggttc aatggcttcg acaaatttaa aagcaatgag atttgccaaa 901 gcttttgttc gtgcaaatgc tgttacccat aattctattg ctacatatag tgacaaggaa 961 ggtgtgattc aacaacttgc cctaaacaat gctgtagaaa aattcttatt agtagacagt 1021 actaaattcg atcgatacga tttctttaac ttctacaatc tagatcaact cgataccatc 1081 attacagata accagattag ccctcaacac ttagaggaat ttagccagta cactactatt 1141 ttaaaagcgg actagaatta tgacttataa aaatattgga ctactcttaa ataattagac 1201 ataaaaaaag caccgtatga atcaaacaat tctacggtgt ttttttgtta tttctaatgt 1261 atggtttgtc gaaaatatgt acacattatt taactttcca aaaaattgga gttttcttga 1321 taattggata tc // LOCUS MARCMYCA 1391 bp ds-DNA ROD 14-AUG-1990 DEFINITION Woodchuck c-myc protein gene, exon 1. ACCESSION M35498 KEYWORDS c-myc protein. SOURCE Woodchuck (Marmota monax) DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 1391) AUTHORS Wei,Y., Hsu,T.Y., Tiollais,P., Buendia,M.A. and Etiemble,J. TITLE Evolutionary conservation of target sequences for cis-acting regulation in c-myc exon 1 and its upstream sequences JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Buendia, 21-JUN-1990. FEATURES from to/span description pre-msg 818 > 1391 c-myc mRNA and introns IVS 1378 > 1391 c-myc intron 1 site 159 220 P0 promoter site 790 794 TATA box site 964 968 TATA box BASE COUNT 317 a 385 c 383 g 306 t ORIGIN 1 ctcagcgatt agtgcgtctt gcgggaatag ccgcttccca cacccggccg ggtggaagtc 61 tgagcctgct gggcaaaacg agcgatatct gctgttttgg cagcaaacta ggggattcat 121 tctgggtggg aagtgcccaa tctagatagc tgtgcataca taatgcataa tgaattacac 181 tcacacaacc tcaagaaatg taataggtat gtattcataa cactctccaa gtatatgtgg 241 caaggcattg ctgcgttatt ttaattattc cagaaatcat tttcctccct acctcctctg 301 tcatttatcc ctaacactcc atatactgaa tgcgcactca taaatattcc ttctgcccgc 361 ctgtcttcat aagacttatt ttcaaaatgc tgctctttcc ccagccttag ggaggcgccc 421 ggccgcccgg gacgtgcgtg cgcggccgtg ggtacatggt gtattctcag tgttgagggt 481 gagggcagct gttccacctg tgttaattgg aacacgcagg acgagaatgc agtttgtcag 541 agtactgcgc cagaggagca gcagagaaag ggaaaggatt taaacaggag caaaagaaaa 601 tggtaggcgc gcgcagttaa ttcttgctgc gcccttatac tgtttacatc cgatagctgg 661 agtgccgggc tgcggggctg agtctcctcc ccttccctca ctcggcagtg cccctcccag 721 gttcccaaag ccgagggcgg ggagaaagaa aaaaaaaaga ttccgtggaa tccccgccca 781 ccagcccttt ataatacgag ggtctgcgcg cccgaggacc cctgagctgc gcttctcgtg 841 gccgccaaca tcgccgcgcc ccggcggccg ctcttggctc ccctcctgcc tagagaaggg 901 cagggcttct tagaggcttg gcgggaaaaa gaagcgaggg ggagggatcg cgcgtaacag 961 cagtataaaa gtcgttttcg gggctttatc tcactcgctg tagtaattcc agcgagagac 1021 agagggagtg agcgggcgaa cccgtgaggt ggaagaaccg agcagagctg ctccgggcgt 1081 cctgggaagg gaaacccgga gtgaaaggag acttagtctt ctgaccagcg cccccacccc 1141 agccctcccg cggagcccct ccagggtccg caaccgcgaa actttgccct ttgctgcggg 1201 cggacacttt gcactggaac ttaaaatacc cgatcgagga cgcgactctc cggagcgggg 1261 aggctatact gcctatttgg ggacactttt ccccgccttt acccaggacc cgctcctctg 1321 aaagcgctcc tggctgccgt ttgaaggctg gatttccttc gggtagttga aaacccggta 1381 agcaccagat c // LOCUS ONGOSTLE 214 bp ds-DNA INV 14-AUG-1990 DEFINITION O.volvulus recombinant antigen gene, 3' end. ACCESSION M35370 KEYWORDS T-cell epitope; recombinant antigen. SOURCE O.volvulus DNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 214) AUTHORS Colina,K.F., Perler,F.B., Matsumura,I., Meda,M. and Nutman,T.B. TITLE The identification of an Onchocerca-specific recombinant antigen containing a T cell epitope JOURNAL Unpublished (19900 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.B.Nutman, 19-JUN-1990. Author address: T.B.Nutman Inst. Lab. of Parasitic Diseases NIAID National Institutes of Health Bldg 4, Rm 126 9000 Rockville Pike Bethesda, MD 20892 email: tbn@helix.nih.gov FEATURES from to/span description pept < 1 93 recombinant antigen (AA at 1) site 91 93 nematode splice junction BASE COUNT 81 a 37 c 31 g 65 t ORIGIN 1 gaattcagtg taagaagcag cagaacattt caatcattac gaagatatat atacaacatt 61 tctttcttct tcattcttga gttgcatatg taaattcaaa aataattacg atttaatgaa 121 ttgagcaagc ataacttttc ccagcaagta taacaaagtt ttgcgaggaa cgaactcaga 181 aaactttcac ttatgtaaaa ttgcgcacaa gacc // LOCUS PSEIAAL 2766 bp ds-DNA BCT 14-AUG-1990 DEFINITION P.syringae IAA-lysine synthetase (iaaL) gene, complete cds. ACCESSION M35373 KEYWORDS IAA-lysine synthetase. SOURCE P.syringae savastanoi DNA. ORGANISM Pseudomonas syringae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2766) AUTHORS Roberto,F.F., Klee,H., White,F., Nordeen,R. and Kosuge,T. TITLE Expression and fine structure of the gene encoding N-epsilon-(indole-3-acetyl)-L-lysine synthetase from Pseudomonas savastanoi JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5797-5801 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.J.Klee, 20-JUN-1990. FEATURES from to/span description pept 1100 2287 IAA-lysine synthetase (iaaL) pept 95 1003 ORF1 BASE COUNT 637 a 760 c 754 g 615 t ORIGIN 1 gaattccata gcgtgcgggg cttggaggag cgccgcggcc tgagtatctg tggctaaccc 61 ttgcggcttc ggtgctggtc gctgtcgagc agctatgcgt gcagatcctg cgcagtgcag 121 gcttcggcaa acaggcgatg tggaccctgc tgacggggac ggccgccgtt gccatcgcag 181 atcccctgct tattgtggcg ttcgatctgg gcctggtggg tgccggcatc gctacctgat 241 atcgagcctg gtatcggcct gtctggggtt ttactacgtt caccgagtcg cccatctgac 301 ctgtcgggtc agcctgaaga acctgtcagg tgacatcaga aatatcgggc gaaccgcctt 361 gccagcggtg attggcaacc tggcaactcc agtgggcatg gcctacgtaa tggctgcgat 421 ggcgccgttc ggatctcagg cgctggcgac tatcggggtg atcgacaggg tcattcaggt 481 tgctttttgt gtcgtgttcg ccttgcccgg tgcgctgatc ccgatactgg ggcaaaacct 541 gggcgcaatg aacactgctc gcgtgtctca agccataaag atgacgtacg gattgttgat 601 cggctacggc tcagtgacct cgctgttact cattctgctc gctgagccat tagccagctt 661 gtttcatctc gccgctgaac gccaagtcgt gttcttcgcg ttctgccgat ggggcggcgc 721 tctggacgct catcgggctg caattcattg ccacctcagt cttcctcagt atggggcgac 781 cggcgtacgt cacactgttc ggctggttcc gcgccacctg ggaaccatgc cgttcgtgtg 841 gtatggggca cataaatttg gcagcgtcgg ggtaatgctc gggcagttgc tgggtaacac 901 catagtggcc ttttgtgcct gcgtggctcg cgcatctgct catgaaaaag atgttggaca 961 tcgagatcca ttcaataggg aaccgatccc tccacaggag taactgataa tccacgtttt 1021 gcccaccctt ggctgtcgtc aggtgggcag gatgtccagg atgtccagga aatcaaaaaa 1081 cggactatag aggactcgca tgactgccta cgatatggaa aaggaatgga gtagaatttc 1141 cattactgcc gctaaaatcc accagaacaa cgattttgaa ggattcactt atcaggactt 1201 cagaacccac gtaccgatca tggacaaaga cggcttcgcg gcacagactg aacgctgtct 1261 agagcgcaat gagcgaaact gcctgatcgg ctttaccagt ggcaccagcg gcaacatcaa 1321 acgctgttat tactactacg actgcgaagt cgatgaagac agctccctct ccaacgtctt 1381 ccgcagcaac ggctttattc tgcccggtga tcgctgcgcc aacctgttca cgatcaacct 1441 gttttctgct ctgaacaaca cgattaccat gatggccggt aactgcggtg ctcacgtcgt 1501 gtccgtaggt gacatcaccc tggtgaccaa gagccatttc gaagcgctta actcgatcaa 1561 gctcaacgta ctgctcggcg tgccatccac tatcttgcag ttcatcaatg ccatgcaaca 1621 taacggtgtg cacatcaata tcgagaaggt tgtcttcacc ggcgagagcc tgaaaacttt 1681 ccagaagaaa atcatcaggc aagcctttgg cgaacaagtc tccatcgtcg gtgtgtatgg 1741 cagttccgag ggcggcattc tcggtttcac caacagccct tgccacactg aatacgagtt 1801 tctgtccgac aagtatttca tcgaaaaaga aggcgacagc atcctcatca cctcgctgac 1861 ccgagaaaac tttacgccgc tgctgaggta tcgcctagga gacaccgcaa ccctttcgat 1921 gaaaggcgac aagctctacc tgacagacat ccagcgggag gacatgagct tcaacttcat 1981 gggcaacctc atcgggctgg gcatcattca gcaaacgatt aaacagacac tgggccgatc 2041 gctggaaatc caggttcacc tgtcagtgac cgaagagcgc aaggaactgg tgaccgtttt 2101 cgttcaggcc tctgaagtcg atgaagacga acgcgtcaga atcgaaacag ccatcgccga 2161 tatccccgac atcaaagagg cgtatcagaa aaaccaaggc accgtgtcgg tcctgcgcaa 2221 ggatgccaga gactacgcgg tctcggagcg aggcaaaatg ctctacatca tcgaccgccg 2281 aaactgaatg gctgatgtga acgagtgagt agctgcaccg acggggcctt tggcggtgtc 2341 ggtgcagttt tttagaggat tcggaagcgc cagaggtcag agtccacgaa actggaacga 2401 actgggcagc ctgcggctgc aaattgtggg attttgaaat cggttatcat agccgaaatc 2461 gagtcgatcc ctcctcagca caggcttaca catggcgtca gagaccaaaa aacgtaaacg 2521 ggcgagccgg gcaaaagcca aggcaaagca gacccgtctc caacgcgccg ggcatactac 2581 cttcgtgccc gataccgact tttccttcga tatcgatcct ttcggtgatg tcgatctttg 2641 tagttgctgc cagacaacgt atctgaacga catgtttccc gacgcttctt gcgtaaggct 2701 ttagatgaga gaagggccag gcggattcgc atcaccgccg tcattcacca cgatgaggag 2761 ccgcct // LOCUS RABLPBA 1446 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Rabbit lipopolysaccharide binding protein (LBP) protein mRNA, complete cds. ACCESSION M35534 KEYWORDS lipopolysaccharide binding protein. SOURCE Rabbit liver, cDNA to mRNA. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (ases 1 to 1446ites; for [2] AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. TITLE Structure and function of lipopolysaccharide binding protein JOURNAL Science (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1446) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Tobias, 21-JUN-1990. Author address: P.S.Tobias Department of Immunology, IMM-12 10466 N. Torrey Pines Rd La Jolla, CA 92037 FEATURES from to/span description pept 1 > 1446 lipopolysaccharide binding protein (LBP) precursor sigp 1 78 lipopolysaccharide binding protein signal peptide matp 79 1446 lipopolysaccharide binding protein BASE COUNT 316 a 454 c 391 g 285 t ORIGIN 1 atggggacct gggccagggc cctgctgggg tccaccctgc tgagcctgct gctcgcagct 61 gccccgggag ctctgggcac caaccccggc ctcatcacca ggatcaccga caaaggcctg 121 gagtacgcgg ccagggaggg gctgctggct ctgcagagaa agctcctgga agtcacgctg 181 ccggattccg atggggactt caggatcaaa catttcgggc gtgcacagta caagttctac 241 agtctgaaaa tccccagatt cgagctgctc cgtggcaccc tgaggcccct ccccggccag 301 ggcctgagtc tcgacatctc cgacgcctac atccacgtgc ggggcagctg gaaggtgcgc 361 aaggcgttcc tgagactgaa gaactccttt gacctgtatg tcaagggcct caccatttcc 421 gtccacctcg tgttgggcag cgagtcctcc gggaggccca cggtcaccac ctccagctgc 481 agcagcgaca tccagaacgt ggagttggac atagaggggg acctggagga gctgctgaac 541 ctcctccaaa gccagatcga tgccaggctg cgcgaagtgc tggagagcaa gatttgcagg 601 cagattgagg aagccgtgac ggcccacctg cagccttatc tacagacact gccagtcaca 661 acgcagatcg acagctttgc cggcattgac tacagcttga tggaggcccc ccgggcaaca 721 gctgggatgt tggatgtgat gtttaagggt gaaattttcc ctctggatca ccgcagccca 781 gtggacttcc ttgctccagc catgaacctc cccgaggctc acagccgaat ggtctacttt 841 tccatctccg attacgtctt caacaccgcc agcctggcct accacaagtc agggtactgg 901 aacttctcca tcacagacgc catggttccg gccgacctca acatccggcg gaccaccaag 961 tccttccgac ccttcgttcc cctgcttgcc aatctctacc ccaacatgaa cttggagctc 1021 caagggacag tgaactcgga acaactggtg aacctcagca ccgagaatct gttagaggaa 1081 cccgagatgg atattgaggc cttggtggtc ctgcccagct ctgccaggga gcctgtcttc 1141 cggctgggtg tggccactaa tgtgtctgcc acactgacct tgaacaccag gaagatcact 1201 gggttcctga agccgggaag gctacaggtg gaactgaaag aatccaaagt cggaggattc 1261 aatgtggagc tgttggaagc tctcctcaac tactacattc tcaacaacct ctaccccaag 1321 gtcaatgaga agttggccca ccgcttcccg ctccctctgc tgaggcacat tcagctctac 1381 gacctgcttc tccagaccca cgagaacttc ctgctcgtgg gcgccaacat ccagtacagg 1441 agagtt // LOCUS RATUKATPA 1484 bp ss-mRNA ROD 14-AUG-1990 DEFINITION R.norvegicus gastric (H+,K+)-ATPase beta-subunit mRNA, complete cds. ACCESSION M35535 KEYWORDS (H+,K+)-ATPase beta-subunit. SOURCE R.norvegicus gastric mucosa oxyntic cell, cDNA to mRNA, clone RG4. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1484) AUTHORS Canfield,V.A., Okamoto,C.T., Chow,D., Dorfman,J., Gros,P., Forte,J.G. and Levenson,R. TITLE Cloning of the H,K-ATPase beta subunit: Tissue-specific expression, chromosomal assignment, and relationship to Na,K-ATPase beta subunits JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.A.Canfield, 21-JUN-1990. Author address: V.A.Canfield Yale University School of Medicine Dept. of Cell Biology P.O. Box 3333 New Haven, CT 06510 email: levenson@YALEMED FEATURES from to/span description pept 176 1060 gastric (H+,K+)-ATPase beta-subunit (E.C. 3.6.1.3) mRNA 1 1484 gastric (H+,K+)-ATPase beta-subunit BASE COUNT 363 a 457 c 362 g 302 t ORIGIN 1 ctgacttctg ggacagtgga ggacagatag cacgcaagcc ccagccctcc cttatgttta 61 tagaggcgat agcggagaac tgatagctgg ttctgatgcc tttggcctca cacagaggag 121 actataagcc ccagaggacg ctccctgggc ccagtccagg caagcaggag aggacatggc 181 agccctgcag gagaagaagt catgcagcca gcgcatggcc gaattccggc aatactgttg 241 gaacccggac actgggcaga tgctgggccg caccccagcc cggtgggtgt ggatcagcct 301 gtactatgca gctttctacg tggtcatgac tgggctcttt gccttgtgca tctatgtgct 361 gatgcagacc attgatccct acacccccga ctaccaggac cagttaaagt caccgggggt 421 aaccttgaga ccggatgtgt atggggaaag agggctgcag atttcctaca acatctctga 481 aaacagctcc tgggctggcc tcacacacac cctccacagc ttcttagcgg gctacacccc 541 agcatcccag caggacagca tcaactgttc gtctgaaaag tacttcttcc aggagacctt 601 ttctgctccg aaccatacca agttctcctg caagttcacg gcggacatgc tacagaattg 661 ctcaggcctg gtggacccca gtttcggctt tgaggaggga aagccctgct tcattattaa 721 aatgaacagg attgtcaagt tcctgcccag caacaacacg gctccccgag tggactgcac 781 cttccaggat gacccccaaa agccccggaa ggacattgaa cccctgcagg tccagtacta 841 tccccccaat ggtaccttca gtctccacta cttcccctac tacggcaaga aagcacagcc 901 ccactacagc aaccctctgg tggcggcaaa gttcctcaac gtccccaaaa acacgcaagt 961 cctcatcgtg tgcaagatca tggcggacca cgtgaccttc gacaaccccc acgaccccta 1021 tgaagggaag gtggagttca agctcacaat acagaagtaa ggagtaggcg tggctgtcca 1081 ccccagagcc tggtggaccc tgagggacca ctcttcctga ctgacatcat cggctggcca 1141 gcatgcacgg ccacttcatg gttcagagct gacaccactg cccatctgcc gacagcagga 1201 agtgctcctt cccagcactc cctgagcacc accagctttg aactgaaacc cgacgtgcgc 1261 acgcacgttt gcaatcccgt gcggttaaca caggaaccca gagtccggct accactaagg 1321 gacaacccat ctgtagggca tttctatcct gtgaccattt gtctgtcctg cactttgata 1381 tgaactatgg gtccacatca gtgtaacact ggtcaccccg gcctccagtt tgtgcttctg 1441 gggccacagc ccctaggtca ttaaaacaaa ctatagtaaa gtta // LOCUS YSCMYO2A 5675 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae myosin-1 isoform (MYO2) gene, complete cds. ACCESSION M35532 KEYWORDS myosin-1. SOURCE S.cerevisiae (strain GRF88) DNA, clone 10-2B. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 5675) AUTHORS Johnston,G.C., Prendergast,J.A. and Singer,R.A. TITLE The S.cerevisiae MYO2 gene encodes an essential myosin for vectorial transport of vesicles JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.C.Johnston, 21-JUN-1990. Author address: G.C.Johnston Dalhousie University 7E Tupper Medical Bldg. Halifax, N.S. CANADA B3H 4H7 email: JOHNSTON@AC.DAL.CA FEATURES from to/span description pept 581 5305 myosin 1 isoform (MYO2) binding 138 171 ATP-binding site binding 443 523 actin-binding site rpt 926 981 heptad repeat rpt 1010 1086 heptad repeat BASE COUNT 1994 a 986 c 1110 g 1585 t ORIGIN Chromosome 15 right arm. 1 gatcaataaa taaataggct cgaagacgcc tcagaactcc ggtcactggt ttgtcttgtt 61 gatatacgat gtgccaagcg ccgtttctcg atgcttatct ggtttagttt acgctgttaa 121 aaccaaaacc ccaacagatt ttcgacccta acgtatgtag ggctaaaata gatattgagt 181 aggttacaat taattattgg caattgcacc tagtgacaca tttacgaaaa cgtagggcaa 241 aaactattac ccgacccagg gctattttgt gattttttcc ttttttttgt ttatgatcgc 301 gcttctcgaa aagccaaata tcagaaatcc caaacacgcc ttcatttgat acgattcgta 361 gcctgcgttt cagagatcta tcaactttgc aaggccaatc agagaacaaa aaagtctcgc 421 aaagtcattt cacttttctc gcttgaaatt attcgttcga tttctggctg cttgcttgtt 481 ttttgttttc taaggtacta ttcgacacca ttccattgga cagcgatact tataccattg 541 tacatatagg acataaaaac agcagatatt acagcgtata atgtcttttg aagtgggtac 601 acgatgctgg tatccccata aagaattggg ctggattggg gcggaagtaa tcaaaaatga 661 gttcaacgac ggcaagtacc acctggagtt acaattggaa gacgatgaaa tcgtgtccgt 721 ggacacaaaa gacttgaata acgataagga ccaatctcta ccgcttctta gaaaccctcc 781 cattttggaa gcaacggaag atttgacctc tttatcttac ttgaatgagc cagctgtttt 841 acatgccatc aaacagcgct attctcaatt gaatatctac acatactcgg gtattgttct 901 gattgctaca aacccttttg atcgtgtcga ccagctttat acacaagaca tgatccaagc 961 atatgcggga aagcgcagag gtgaactgga acctcacttg tttgccattg ccgaagaagc 1021 gtataggttg atgaaaaatg acaaacaaaa tcaaaccatt gtggtaagtg gtgaatctgg 1081 tgctggaaaa acggtttctg ccaagtatat tatgcgttat tttgcttctg tagaagagga 1141 aaattccgct actgtacaac atcaagtgga aatgtcggaa acagaacaaa agattctagc 1201 tacaaaccct atcatggaag catttggtaa tgctaagact accagaaatg acaattcttc 1261 cagatttggt aagtatctag aaattttatt cgataaggac acatctatta ttggagcaag 1321 gatccgcaca tacttgttgg aacggtccag attagtttac cagccgccaa ttgagagaaa 1381 ctaccacata ttttatcaat taatggctgg attaccagct caaaccaagg aggaattgca 1441 tcttaccgat gcctcagatt acttctacat gaaccaaggc ggtgacacca agatcaacgg 1501 tattgatgat gccaaagaat acaaaattac agtagatgca ttgacattag tcggaatcac 1561 caaggaaact caacaccaaa tatttaagat cttggccgca cttctgcata tcggtaacat 1621 agaaattaaa aaaactagaa atgatgcatc actatcagct gatgagccaa acctgaaact 1681 ggcgtgcgaa ttgctgggaa ttgatgccta caactttgcc aaatgggtca ccaaaaagca 1741 gatcattaca aggtcagaga aaattgtttc gaatctaaat tatagtcaag ctctggttgc 1801 caaagattcc gtggctaagt ttatttattc cgcccttttc gattggcttg tggaaaatat 1861 caacaccgtg ttatgcaacc cggctgtgaa cgaccaaatt agctcattta ttggtgttct 1921 ggatatttat gggtttgaac attttgaaaa aaattcattt gaacaatttt gtattaacta 1981 tgccaacgaa aaactacaac aagagttcaa ccaacatgtt ttcaaattag agcaagaaga 2041 atacgttaaa gaagaaattg aatggtcttt tatagagttt aatgataatc aaccttgtat 2101 tgatctgatt gaaaacaagt tgggtatttt atcactgctt gacgaagaaa gtaggttacc 2161 tgctggttcc gacgaatctt ggacccaaaa actttatcaa actttggata aatctcctac 2221 gaacaaagta ttttctaaac caagattcgg gcaaactaaa tttatcgtga gccattatgc 2281 tctagatgtc gcttatgatg tggaaggatt tattgaaaaa aatagagaca ccgtatctga 2341 cggacatttg gaagtgttga aggcttctac caacgagaca ctaataaata tcttagaggg 2401 attagaaaaa gctgccaaaa aactggaaga agcgaaaaag cttgaattag agcaggctgg 2461 cagtaaaaag ccaggtccga taagaacggt taacaggaaa cccactttag gttccatgtt 2521 taagcaatct ttgattgaac taatgaatac catcaactca actaatgttc attatattcg 2581 ttgtataaag cctaatgcag ataaagaagc ttggcaattt gataatttga tggtgttgtc 2641 tcaactcaga gcctgtggtg ttttggaaac tattagaata tcttgtgctg ggtttccttc 2701 taggtggact tttgaagaat ttgtattaag atattacatc ttgataccac atgagcagtg 2761 ggacctaatc ttcaaaaaaa aggaaactac agaagaagat atcatatcag tggttaaaat 2821 gatcctagat gctactgtaa aggacaaatc caagtaccag attggtaata caaaaatttt 2881 cttcaaagca ggtatgcttg catatctgga aaaacttaga agcaataaga tgcataattc 2941 aattgttatg atccagaaga aaattagagc taaatattac cgtaagcagt atttgcaaat 3001 atctcaggcc atcaagtatt tgcagaacaa catcaaaggt ttcatcattc gtcaacgcgt 3061 taatgatgaa atgaaagtta actgtgcaac tttattacag gccgcttaca ggggtcattc 3121 catccgtgcc aatgtgttca gcgtattgag aacaattaca aatttgcaaa agaaaattag 3181 aaaggaacta aaacaaagac aactgaaaca agaacatgaa tataatgctg cggtaactat 3241 tcaaagtaaa gttaggacct ttgagccgag atcgagattt ttacgcacta aaaaagacac 3301 tgttgttgtc caatctttga tcagaagaag agctgctcaa aggaaattga aacaattgaa 3361 ggcagacgct aaatcagtta atcatctgaa agaagtgagc tataaattag agaataaagt 3421 gattgaactg acgcagaatc tagcatccaa ggtcaaagaa aataaagaaa tgacagaaag 3481 aattaaagaa ctacaggttc aagtggaaga aagtgccaag ttacaagaga cattagaaaa 3541 tatgaaaaaa gagcacttaa tagatattga taatcagaaa tctaaggata tggaattaca 3601 aaaaactatt gagaacaatt tgcaatccac tgaacaaact ctaaaggacg ctcaattaga 3661 gttggaggac atggttaaac aacatgatga attgaaagaa gaatctaaaa agcaacttga 3721 agaattagag caaacaaaga aaacattggt tgaataccag acattaaacg gagacttgca 3781 aaacgaagtt aaatctttaa aggaagaaat tgctaggtta caaactgcca tgtcgctggg 3841 caccgttact actagtgtac tacctcaaac accattaaag gatgtaatgg gaggcggtgc 3901 ttcaaatttc aacaatatga tgcttgagaa ttccgactta tctcctaatg atttgaatct 3961 aaagtctaga tctactccat cgtccggaaa caaccacatt gattcattga gtgtcgatcg 4021 cgaaaatggt gtcaatgcta cacaaatcaa tgaagagtta tacaggttat tggaggacac 4081 tgaaattttg aatcaagaaa tcacggaagg cctgttaaag ggattcgaag taccggatgc 4141 tggtgtagct attcaactaa gtaaaagaga cgttgtttat ccggctagaa tactgattat 4201 agttttaagt gaaatgtgga gatttgggct gaccaagcaa agtgaaagct ttcttgccca 4261 agtattgact acaattcaaa aagttgtcac tcaattgaag ggtaacgatt taattccaag 4321 cggtgtattc tggttagcaa acgttagaga gttatactca tttgtggtgt ttgctctaaa 4381 ctctatttta accgaagaaa cgttcaaaaa cggcatgacc gatgaggagt ataaggagta 4441 tgtttcattg gtcacagaac taaaggatga tttcgaagct ctaagttata atatatataa 4501 catttggctg aagaaattgc agaagcaatt gcaaaaaaag gccatcaatg ctgtggtcat 4561 ctccgaatca ttaccaggtt tcagcgcggg agaaaccagc gggtttttga acaaaatttt 4621 tgctaacact gaagaatata caatggacga cattttgacc tttttcaaca gcatatactg 4681 gtgcatgaaa tcttttcata ttgagaatga agtgttccat gctgtagtca caaccttatt 4741 gaattatgtg gatgcaattt gttttaacga attaatcatg aaacgtaatt tcttgtcgtg 4801 gaaaaggggt cttcaattga actacaacgt tactagatta gaggaatggt gcaagacgca 4861 tggcttgaca gatggtactg agtgcttaca acatttgatt cagaccgcta agctactgca 4921 agtccgtaag tatactatcg aagacattga tatcttaaga ggaatttgtt attcgctaac 4981 acctgcacaa ttgcaaaaat tgatttcaca ataccaggtg gcagactatg agtctccaat 5041 tccacaggaa atcttaagat acgttgctga tatagttaag aaagaagctg cgttatcttc 5101 atcaggtaat gattctaagg gtcacgagca tagcagcagt atatttatca ctccagaaac 5161 aggtccattt actgacccat tcagtttgat aaagacaaga aaatttgacc aagtagaagc 5221 ctatatacca gcgtggttat ccttgccctc aactaagaga atagttgacc ttgttgccca 5281 acaagtcgtt caagacggcc actaaaactg atggcgcgag aaacaaaatt gtacatgaat 5341 gctaaaaaaa gaaatgacaa aaaaagagaa aaaaaaaaat gaaactacat agttaattaa 5401 taatagaagt atttgtcaat agtatgataa tgaaatcgat attatggaag atattaaccg 5461 cgcgccgtat tagtgtacac tatattaaac tacattttgc ttcttactga atttataaat 5521 tatgattata ttattattac tattatgact actgtatata tttttagaat tagatcggga 5581 accgatgagc gttagctgaa atggacgacg ataaggaacg ataattacca ctagtaaaat 5641 aataacaact aagaataaac acattctcat tttta // LOCUS BSURPOF 3078 bp ds-DNA BCT 14-AUG-1990 DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB), complete cds. ACCESSION M34995 KEYWORDS RNA polymerase; minor sigma-37 factor. SOURCE B.subtilis (strains 168 and Marburg), clones lambda-[21,gt11,1,11,21,31]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 368 to 2297) AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W. TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus subtilis RNA polymerase: Isolation, nucleotide sequence, chromosomal locus, and cryptic function JOURNAL J. Bacteriol. 169, 771-778 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 3078) AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W. TITLE Similar organization of the sigB and spoIIA operons encoding alternate sigma factors of Bacillus subtilis RNA polymerase JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.W.Price, 18-FEB-1987. Draft entry and computer-readable sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990. FEATURES from to/span description pept 200 529 ORF V pept 526 1008 ORF W pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg start codon) pept 1762 2361 ORF X mRNA 131 2368 sigB operon mRNA signal 131 168 sigB operon promoter signal 2368 2401 sigB transcription termination signal binding 188 194 ORF V ribosomal binding site binding 511 519 ORF W ribosomal binding site binding 954 961 sigB ribosomal binding site binding 1750 1753 ORF X ribosomal binding site BASE COUNT 999 a 575 c 755 g 749 t ORIGIN 40 degrees on B.subtilis map. 1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt 61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt 121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg 181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat 241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct 301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat 361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg 421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt 481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct 541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg 601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc 661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg 721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa 781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac 841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat 901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg 961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga 1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg 1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca 1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc 1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg 1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc 1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga 1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg 1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac 1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca 1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct 1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat 1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga 1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta 1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct 1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac 1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc 1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg 2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt 2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc 2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata 2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta 2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat 2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt 2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg 2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac 2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga 2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac 2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca 2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag 2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa 2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt 2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt 2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc 3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg 3061 caaactgctg cgaagctt // LOCUS HUMERPR 1624 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human erythropoietin receptor mRNA, complete cds. ACCESSION M34986 KEYWORDS erythropoietin receptor. SOURCE Human cell line OCI-M1, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1624) AUTHORS Ehrenman,K. and St John,T. TITLE Sequence of the human erythropoietin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.St.John, 07-JUN-1990. Author address: T.St.John Hutchinson Cancer Res Center 1124 Columbia St. Seattle, WA Email: TSTJOHN@FHCRCVAX FEATURES from to/span description pept 1 1527 erythropoietin receptor precursor sigp 1 72 erythropoietin receptor signal peptide matp 73 1524 erythropoietin receptor BASE COUNT 276 a 537 c 498 g 313 t ORIGIN 1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct 61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc 121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg 181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc 241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt 301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta 361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat 421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc 481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac 541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag 601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc 661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg 721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg 781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag 841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac 901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc 961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg 1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc 1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg 1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc 1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct 1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca 1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct 1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta 1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg 1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata 1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc 1621 agga // LOCUS HUMGFP40H 4379 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete cds. ACCESSION M30135 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor; megakeryoblastic leukemia cell growth factor. SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4379) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 12-DEC-1989. FEATURES from to/span description pept 796 909 P40 T-cell and mast cell growth factor precursor, exon 1 1022 1057 P40 T-cell and mast cell growth factor precursor, exon 2 1140 1172 P40 T-cell and mast cell growth factor precursor, exon 3 2461 2592 P40 T-cell and mast cell growth factor precursor, exon 4 4096 4215 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 796 849 P40 T-cell and mast cell growth factor signal peptide matp 850 909 P40 T-cell and mast cell growth factor 1022 1057 P40 T-cell and mast cell growth factor 1140 1172 P40 T-cell and mast cell growth factor 2461 2592 P40 T-cell and mast cell growth factor 4096 4212 P40 T-cell and mast cell growth factor IVS 910 1021 hP40 intron A IVS 1058 1139 hP40 intron B IVS 1173 2460 hP40 intron C IVS 2593 4095 hP40 intron D signal 742 747 TATA box signal 4354 4359 polyA signal rpt 2965 3265 Alu repeat binding 370 376 AP-2 binding site binding 624 629 IRF-1 binding site binding 627 633 AP-1 binding site BASE COUNT 1208 a 921 c 904 g 1346 t ORIGIN 1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt 61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc 121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca 181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct 241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact 301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc 361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag 421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa 481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt 541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt 601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa 661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt 721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc 781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc 841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac 901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat 961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta 1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac 1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg 1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg 1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat 1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt 1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt 1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag 1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa 1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc 1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg 1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc 1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc 1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat 1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac 1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc 1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg 1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt 2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc 2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga 2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc 2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa 2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa 2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt 2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag 2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg 2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac 2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt 2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc 2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc 2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct 2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag 2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa 2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg 3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg 3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat 3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca 3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc 3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt 3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac 3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg 3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc 3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt 3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca 3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca 3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact 3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga 3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt 3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc 3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc 3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg 4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa 4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac 4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg 4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc 4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga 4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa // LOCUS MEFDNABP 1710 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds. ACCESSION M34778 KEYWORDS DNA binding protein HMfB. SOURCE M.fervidus DNA. ORGANISM Methanothermus fervidus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (sites; for [2]) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. TITLE HMf, a DNA binding protein isolated from the hypothermophilic archaeon Methanothermus fervidus, is most closely related to histones JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5788-5791 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1710) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Sandman, 01-JUN-1990. Author address: K.Sandman Department of Microbiology Ohio State University 484 W. 12th Ave. Columbus, Ohio 43210 E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU FEATURES from to/span description pept 578 787 DNA binding protein HMfB pept 1662 805 (c) ORF285 signal 450 457 promoter signal 523 530 promoter signal 790 814 transcription termination signal BASE COUNT 619 a 263 c 265 g 561 t 2 others ORIGIN 1 bp upstream of HindIII site. 1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa 61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag 121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag 181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg 241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat 301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat 361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa 421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt 481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt 541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat 601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct 661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag 721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa 781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat 841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc 901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt 961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat 1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa 1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg 1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata 1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc 1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc 1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc 1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat 1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc 1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt 1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt 1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa 1681 gttttgtttt acccattgct tctaaatatt // LOCUS MUSINT1A 5607 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds. ACCESSION K02593 M34750 KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene. SOURCE Mouse (BALB/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1098 to 5606) AUTHORS Van Ooyen,A. and Nusse,R. TITLE Structure and nucleotide sequence of the putative mammary oncogene int-1; Proviral insertions leave the protein-encoding domain intact JOURNAL Cell 39, 233-240 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 5607) AUTHORS Nusse,R., Theunissen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A., Otte,A., Schuuring,E. and van Ooyen,A. TITLE The Wnt-1 (int-1) oncogene promoter and its mechanism of activation by insertion of proviral DNA of the mouse mammary tumor virus JOURNAL Mol. Cell. Biol. 10, 4170-4179 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [2] kindly submitted by R.Nusse, 31-MAY-1990. FEATURES from to/span description pept 1779 1882 Wnt-1 protein (exon 1) 2452 2705 Wnt-1 protein (exon 2) 3279 3544 Wnt-1 protein (exon 3) 4002 4490 Wnt-1 protein (exon 4) IVS 1883 2451 Wnt-1 cds intron A IVS 2706 3278 Wnt-1 cds intron B IVS 3545 4001 Wnt-1 cds intron C site 1133 1134 MMTV insertion site in tumor 17 [1] site 1394 1395 MMTV insertion site in tumor 35 [1] site 1631 1632 MMTV insertion site in tumor 102 [1] site 4495 4496 MMTV insertion site in tumor 53 [1] mRNA 1428 5398 Wnt-1 mRNA (alt.) mRNA 1595 5398 Wnt-1 mRNA (alt.) signal 1562 1566 TATA box signal 1573 1577 TATA box BASE COUNT 1188 a 1681 c 1519 g 1219 t ORIGIN Chromosome 15. 1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc 61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac 121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac 181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa 241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg 301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg 361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac 421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct 481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact 541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac 601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag 661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg 721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga 781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca 841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact 901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc 961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca 1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg 1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga 1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg 1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg 1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc 1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc 1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg 1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt 1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa 1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag 1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact 1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt 1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc 1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg 1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg 1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag 1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct 2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc 2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct 2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt 2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg 2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg 2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg 2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt 2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga 2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat 2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg 2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa 2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc 2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag 2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg 2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg 2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc 3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag 3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag 3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact 3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac 3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat 3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc 3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg 3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc 3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg 3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg 3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag 3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag 3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt 3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca 3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac 3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt 3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc 4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc 4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc 4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg 4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat 4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt 4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc 4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg 4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc 4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg 4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc 4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga 4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc 4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac 4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac 4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc 4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa 4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc 5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc 5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac 5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga 5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac 5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct 5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat 5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca 5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga 5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat 5581 gaccctaaat gagaccaact agggatc // LOCUS MUSP40M 3808 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete cds. ACCESSION M30136 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor. SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210 DNA, clone lambda-S40.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3808) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 21-NOV-1989. FEATURES from to/span description pept 644 757 P40 T-cell and mast cell growth factor precursor, exon 1 846 881 P40 T-cell and mast cell growth factor precursor, exon 2 965 997 P40 T-cell and mast cell growth factor precursor, exon 3 2184 2315 P40 T-cell and mast cell growth factor precursor, exon 4 3393 3512 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 644 697 P40 T-cell and mast cell growth factor signal peptide matp 698 757 P40 T-cell and mast cell growth factor 846 881 P40 T-cell and mast cell growth factor 965 997 P40 T-cell and mast cell growth factor 2184 2315 P40 T-cell and mast cell growth factor 3393 3509 P40 T-cell and mast cell growth factor IVS 758 845 mP40 intron A IVS 882 964 mP40 intron B IVS 998 2183 mP40 intron C IVS 2316 3392 mP40 intron D signal 589 594 TATA box binding 227 233 AP-2 binding site binding 475 480 IRF-1 binding site binding 478 484 AP-1 binding site BASE COUNT 1012 a 903 c 869 g 1024 t ORIGIN Chromosome 13. 1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca 61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt 121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag 181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc 241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga 301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt 361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt 421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga 481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg 541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta 601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat 661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg 721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga 781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac 841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag 901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt 961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc 1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct 1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat 1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga 1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa 1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac 1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac 1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc 1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca 1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta 1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac 1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc 1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct 1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta 1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat 1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg 1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc 1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg 2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa 2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt 2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt 2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa 2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca 2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc 2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag 2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg 2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct 2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag 2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg 2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag 2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat 2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc 2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct 2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc 3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga 3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg 3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga 3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa 3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt 3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc 3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc 3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga 3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg 3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag 3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga 3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac 3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat 3781 aggtaggatg gagcaaagac ttggtaat // LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete cds. ACCESSION M29938 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone ZQ30-1. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1105) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 13 777 MHC SLA-DQ alpha chain precursor sigp 13 81 MHC SLA-DQ alpha chain signal peptide matp 82 774 MHC SLA-DQ alpha chain mRNA < 1 1105 MHC SLA-DQ mRNA BASE COUNT 254 a 290 c 255 g 306 t ORIGIN 1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc 61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta 121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag 181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa 241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg 301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact 361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac 421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt 481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc 541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat 601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag 661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc 721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc 781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa 841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc 901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg 961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc 1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1081 aaacaataaa tcacttcatg aggtt // LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete cds. ACCESSION M29939 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 69 836 MHC SLA-DQ alpha chain precursor sigp 69 137 MHC SLA-DQ alpha chain signal peptide matp 138 833 MHC SLA-DQ alpha chain BASE COUNT 273 a 313 c 266 g 322 t ORIGIN 1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg 61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga 121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg 181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat 241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta 301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca 361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt 421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca 481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt 541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca 601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata 661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga 721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct 781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca 841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac 901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct 961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga 1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc 1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1141 aaacaataaa tcacttcatg aggtttttgt actg // LOCUS PIGMHDQBA 212 bp ds-DNA MAM 14-AUG-1990 DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end. ACCESSION M29944 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 212) AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F., Gustafsson,K. and LeGuern,C. TITLE Class II genes of miniature swine: II. Molecular identification and characterization of B (beta) genes from SLA-c haplotype JOURNAL Immunogenetics 31, 1-6 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept < 1 12 MHC SLA-DO beta chain BASE COUNT 38 a 58 c 52 g 64 t ORIGIN 1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg 61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc 121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct 181 cttacccttc tgcctttctt cctcctactt gt // LOCUS PP2LEND 266 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage P2 left end DNA. ACCESSION M34756 KEYWORDS . SOURCE Bacteriophage P2 DNA. ORGANISM Bacteriophage P2 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 266) AUTHORS Ziermann,R. and Calendar,R. TITLE Characterization of the cos sites of Bacteriophages P2 and P4 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Ziermann, 05-JUN-1990. Author address: R.Ziermann University of California Dept. Mol. Cell Biology Berkeley, CA 94720 Email: calendar@garnet.berkeley.edu FEATURES from to/span description site 1 19 cohesive end of phage DNA site 20 266 left end of phage DNA BASE COUNT 41 a 58 c 99 g 68 t ORIGIN Map position 0.0-0.8%. 1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct 61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt 121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg 181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc 241 gagccagccg tttatctcgc ggatcc // LOCUS RCABCHH 314 bp ds-DNA BCT 14-AUG-1990 DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and F108) genes, 3' end, and 5' end respectively. ACCESSION M34843 KEYWORDS . SOURCE R.capsulata DNA. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 314) AUTHORS Yang,Z. and Bauer,C.E. TITLE Rhodobacter capsulatus genes involved in early steps of the bacteriochlorophyll biosynthetic pathway JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.E.Bauer, 04-JUN-1990. FEATURES from to/span description pept < 1 79 bacteriochlorophyll biosynthesis protein (bchH) (AA at 2) pept 108 > 314 bacteriochlorophyll biosynthesis protein (F108) BASE COUNT 59 a 87 c 119 g 49 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga 61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg 121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg 181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg 241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca 301 agcgcgtgct gcag // LOCUS TREANTAT 999 bp ds-DNA ORG 14-AUG-1990 DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle. ACCESSION M34848 KEYWORDS . SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA. ORGANISM Kinetoplast Trypanosoma evansi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma evansi. REFERENCE 1 (bases 1 to 999) AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S., Steinert,M. and Hamers,R. TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Bajyana Songa, 05-JUN-1990. FEATURES from to/span description site 367 516 kinetoplast minicircle common region (ORI) BASE COUNT 391 a 78 c 203 g 327 t ORIGIN 1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt 61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt 121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg 181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt 241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt 301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat 361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg 421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt 481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag 541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag 601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca 661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat 721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac 781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa 841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg 901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat 961 aaaataatgg ataaaagagt ttataggtga agttgaagt // LOCUS VIBSCRRE 1524 bp ds-DNA BCT 14-AUG-1990 DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds, and sucrose transport (scrA) protein, 5' end. ACCESSION M35009 KEYWORDS Scr repressor protein; sucrose transport protein. SOURCE V.alginolyticus DNA. ORGANISM Vibrio alginolyticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1524) AUTHORS Blatch,G.L. and Woods,D.R. TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr repressor-encoding gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Blatch, 10-JUN-1990. Author address: G.L.Blatch University of Cape Town Dept. Microbiology Private Bag Rondebosch 7700 SOUTH AFRICA Email: bltgre01.uctvax@f4.n494.z5.fidonet.org FEATURES from to/span description pept 36 < 1 (c) sucrose transport protein (scrA) pept 271 555 Scr repressor protein (scrR) pept 791 1090 ORF5 binding 264 267 scrR ribosomal binding site binding 780 785 ORF5 ribosomal binding site signal 178 183 scrA -10 region signal 203 208 scrA -35 region signal 190 195 scrR -10 region signal 162 167 scrR -10 region signal 762 767 ORF5 -35 region signal 742 747 ORF5 -35 region site 78 91 repressor operator site (put.) site 229 242 repressor operator site (put.) site 12 71 ScrR helix-turn-helix motif BASE COUNT 466 a 344 c 321 g 393 t ORIGIN 1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc 61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg 121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt 181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc 241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg 301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa 361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc 421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg 481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt 541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga 601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt 661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac 721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag 781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt 841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga 901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg 961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga 1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta 1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc 1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt 1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt 1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat 1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct 1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca 1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa 1501 gccaattgtg atattgaaga agct // LOCUS XELNRP1A 2431 bp ss-mRNA VRT 14-AUG-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA, complete cds. ACCESSION M34894 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2431) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 74 1117 pot. RNA-binding protein (nrp-1B) mRNA < 1 2431 nrp-1B mRNA signal 2407 2418 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence site 921 941 alternative exon (put.) BASE COUNT 663 a 522 c 543 g 703 t ORIGIN 1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt 61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc 121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact 181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt 241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa 301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt 361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt 421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga 481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac 541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa 601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt 661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat 721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc 781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc 841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc 901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag 961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag 1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc 1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct 1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct 1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga 1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat 1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg 1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat 1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta 1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa 1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat 1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac 1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg 1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc 1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat 1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc 1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga 1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct 2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg 2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc 2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat 2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg 2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca 2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata 2401 gttcaaaata aactatgaat tttattcatt t // LOCUS XELNRP1B 2324 bp ss-mRNA VRT 14-AUG-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA, complete cds. ACCESSION M34895 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2324) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 29 1051 pot. RNA-binding protein (nrp-1B) mRNA < 1 2324 nrp-1B mRNA signal 2306 2311 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence BASE COUNT 647 a 467 c 520 g 690 t ORIGIN 1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc 61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac 121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt 181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca 241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga 301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat 361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca 421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg 481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt 541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc 601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat 661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag 721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac 781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg 841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc 901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat 961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat 1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg 1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct 1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat 1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta 1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa 1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa 1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct 1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa 1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc 1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact 1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg 1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt 1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct 1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag 1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag 1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt 1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta 2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac 2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt 2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat 2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa 2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc // LOCUS YSJSACDHY 2569 bp ds-DNA PLN 14-AUG-1990 DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds. ACCESSION M34929 KEYWORDS saccharopine dehydrogenase. SOURCE Y.lipolytica (strain W29) DNA. ORGANISM Yarrowia lipolytica Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2569) AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and Gaillardin,C. TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia lipolytica JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.M.Fournier, 06-JUN-1990. ORF1 is probably not translated, since a lacZ fusion/expression experiment did not produce results. FEATURES from to/span description pept 649 1929 ORF1 pept 1764 655 (c) saccharopine dehydrogenase mRNA 582 2060 ORF1 mRNA (alt.) mRNA 598 2060 ORF1 mRNA (alt.) mRNA 599 2060 ORF1 mRNA (alt.) mRNA 602 2060 ORF1 mRNA (alt.) mRNA 606 2060 ORF1 mRNA (alt.) mRNA 609 2060 ORF1 mRNA (alt.) mRNA 1807 639 (c) ORF2 mRNA (alt.) mRNA 1809 639 (c) ORF2 mRNA (alt.) mRNA 1811 639 (c) ORF2 mRNA (alt.) mRNA 1812 639 (c) ORF2 mRNA (alt.) mRNA 1814 639 (c) ORF2 mRNA (alt.) signal 521 528 TATA box signal 565 575 TATA box BASE COUNT 668 a 573 c 666 g 662 t ORIGIN 1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag 61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt 121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca 181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa 241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag 301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc 361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc 421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt 481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca 541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt 601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc 661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt 721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc 781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc 841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac 901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt 961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca 1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt 1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc 1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt 1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata 1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc 1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc 1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct 1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg 1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg 1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc 1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt 1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag 1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg 1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag 1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg 1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga 1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa 2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca 2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg 2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc 2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct 2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag 2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga 2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat 2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat 2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc // LOCUS HUMVINC 5102 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human vinculin mRNA, complete cds. ACCESSION M33308 KEYWORDS cytoskeletal protein; vinculin. SOURCE Human endothelial cells, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5102) AUTHORS Weller,P.A., Ogryzko,E.P., Corben,E.B., Zhidkova,N.I., Patel,B., Price,G.J., Spurr,N.K., Koteliansky,V.E. and Critchley,D.R. TITLE Complete sequence of human vinculin and assignment of the gene to chromosome 10 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5667-5671 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.A.Weller, 28-MAR-1990. FEATURES from to/span description pept 51 3251 vinculin BASE COUNT 1379 a 1248 c 1273 g 1202 t ORIGIN Chromosome 10. 1 gaattccact tctctgtcgc ccgcggttcg ccgccccgct cgccgccgcg atgccagtgt 61 ttcatacgcg cacgatcgag agcatcctgg agccggtggc acagcagatc tcccacctgg 121 tgataatgca cgaggagggc gaggtggacg gcaaagccat tcctgacctc accgcgcccg 181 tggccgccgt gcaggcggcc gtcagcaacc tcgtccgggt tggaaaagag actgttcaaa 241 ccactgagga tcagattttg aagagagata tgccaccagc atttattaag gttgagaatg 301 cttgcaccaa gcttgtccag gcagctcaga tgcttcagtc agacccttac tcagtgcctg 361 ctcgagatta tctaattgat gggtcaaggg gcatcctctc tggaacatca gacctgctcc 421 ttaccttcga tgaggctgag gtccgtaaaa ttattagagt ttgcaaagga attttggaat 481 atcttacagt ggcagaggtg gtggagacta tggaagattt ggtcacttac acaaagaatc 541 ttgggccagg aatgactaag atggccaaga tgattgacga gagacagcag gagctcactc 601 accaggagca ccgagtgatg ttggtgaact cgatgaacac cgtgaaagag ttgctgccag 661 ttctcatttc agctatgaag atttttgtaa caactaaaaa ctcaaaaaac caaggcatag 721 aggaagcttt aaaaaatcgc aattttactg tagaaaaaat gagtgctgaa attaatgaga 781 taattcgtgt gttacaactc acctcttggg atgaagatgc ctgggccagc aaggacactg 841 aagccatgaa gagagcattg gcctccatag actccaaact gaaccaggcc aaaggttggc 901 tccgtgaccc tagtgcctcc ccaggggatg ctggtgagca ggccatcaga cagatcttag 961 atgaagctgg aaaagttggt gaactctgtg caggcaaaga acgcagggag attctgggaa 1021 cttgcaaaat gctagggcag atgactgatc aagtggctga cctccgtgcc agaggacaag 1081 gatcctcacc ggtggccatg cagaaagctc agcaggtatc tcagggtctg gatgtgctca 1141 cagcaaaagt ggaaaatgca gctcgcaagc tggaagccat gaccaactca aagcagagca 1201 ttgcaaagaa gatcgatgct gctcagaact ggcttgcaga tccaaatggt ggaccggaag 1261 gagaagagca gattcgaggt gctttggctg aagctcggaa aatagcagaa ttatgtgatg 1321 atcctaaaga aagagatgac attctacgtt cccttgggga aatatctgct ctgacttcta 1381 aattagcaga tctacgaaga caggggaaag gagattctcc agaggctcga gccttggcca 1441 aacaggtggc cacggccctg cagaacctgc agaccaaaac caaccgggct gtggccaaca 1501 gcagaccggc caaagcagct gtacaccttg agggcaagat tgagcaagca cagcggtgga 1561 ttgataatcc cacagtggat gaccgtggag tcggtcaggc tgccatccgg gggcttgtgg 1621 ccgaagggca tcgtctggct aatgttatga tggggcctta tcggcaagat cttctcgcca 1681 agtgtgaccg agtggaccag ctgacagccc agctggctga cctggctgcc agaggggaag 1741 gggagagtcc tcaggcacga gcacttgcat ctcagctcca agactcctta aaggatctaa 1801 aagctcggat gcaggaggcc atgactcagg aagtgtcaga tgttttcagc gataccacaa 1861 ctcccatcaa gctgttggca gtggcagcca cggcgcctcc tgatgcgcct aacagggaag 1921 aggtatttga tgagagggca gctaactttg aaaaccattc aggaaagctt ggtgctacgg 1981 ccgagaaggc ggctgcggtt ggtactgcta ataaatcaac agtggaaggc attcaggcct 2041 cagtgaagac ggcccgagaa ctcacacccc aggtggtctc ggctgctcgt atcttactta 2101 ggaaccctgg aaatcaagct gcttatgaac attttgagac catgaagaac cagtggatcg 2161 ataatgttga aaaaatgaca gggctggtgg acgaagccat tgataccaaa tctctgttgg 2221 atgcttcaga agaagcaatt aaaaaagacc tggacaagtg caaggtagct atggccaaca 2281 ttcagcctca gatgctggtt gctggggcaa ccagtattgc tcgtcgggcc aaccggatcc 2341 tgctggtggc taagagggag gtggagaatt ccgaggatcc caagttccgt gaggctgtga 2401 aagctgcctc tgatgaattg agcaaaacca tctccccaat ggtgatggat gcaaaagctg 2461 tggctggaaa catttccgac cctggactgc aaaagagctt cctggactca ggatatcgga 2521 tcctgggagc tgtggccaag gtcagagaag ccttccaacc tcaggagcct gacttcccgc 2581 cgcctccacc agaccttgaa caactccgac taacagatga gcttgctcct cccaaaccac 2641 ctctgcctga aggtgaggtc cctccaccta ggcctccacc accagaggaa aaggatgaag 2701 agttccctga gcagaaggcc ggggaggtga ttaaccagcc aatgatgatg gctgccagac 2761 agctccatga tgaagctcgc aaatggtcca gcaagggcaa tgacatcatt gcagcagcca 2821 agcgcatggc tctgctgatg gctgagatgt ctcggctggt aagagggggc agtggtacca 2881 agcgggcact cattcagtgt gccaaggaca tcgccaaggc ctcagatgag gtgactcggt 2941 tggccaagga ggttgccaag cagtgcacag ataaacggat tagaaccaac ctcttacagg 3001 tatgtgagcg aatcccaacc ataagcaccc agctcaaaat cctgtccaca gtgaaggcca 3061 ccatgctggg ccggaccaac atcagtgatg aggagtctga gcaggccaca gagatgctgg 3121 ttcacaatgc ccagaacctc atgcagtctg tgaaggagac tgtgcgggaa gctgaagctg 3181 cttcaatcaa aattcgaaca gatgctggat ttacactgcg ctgggttaga aagactccct 3241 ggtaccagta ggcacctggc tgagcctggc tggcacagaa acctctacta aaaagaagga 3301 aaatgatctg agtcccagga gctgcccaga gttgctggga gctgaaaaat cacatcctgg 3361 cctggcacat cagaaaggaa tgggggcctc ttcaaattag aagacattta tactcttttt 3421 tcatggacac tttgaaatgt gtttctgtat aaagcctgta ttctcaaaca cagttacact 3481 tgtgcaccct ctatcccaat aggcagactg ggtttctagc ccatggactt cacataagct 3541 cagaatccaa gtgaacacta gccagacact ctgctctgcc cttgttccct aggggacact 3601 tccctctgtt tctctttcct tggctcccat tcactcttcc agaatcccaa gacccagggc 3661 ccaggcaaat cagttactaa gaagaaaatt gctgtgcctc ccaaaattgt tttgagcttt 3721 ccatgttgct gccaaccata ccttccttcc ctgggctgtg ctacctgggt ccttttcaga 3781 agtgagcttt gctgctacag gggaaggtgg cctctgtgga gccccagcat atgggggcct 3841 ggattcattt cctgcccttc ctcagtttaa tccttctagt ttcccacaat ataaaactgt 3901 acttcactgt caggaagaaa tcacagaatc atatgattct gcttttacca tgcccctgag 3961 caatgtctgt gctagggaaa ctccccgtcc catatcctgc ctcagcccgc caaggtagcc 4021 atcccatgaa cacactgtgt cctggtgctc tctgccactg gaagggcaga gtagccaggg 4081 tgtggccctg ccatcttccc agcagggcca ctcccggcac tccatgctta gtcactgcct 4141 gcagaggtct gtgctgaggc cttatcattc attcttagct cttaattgtt cattttgagc 4201 tgaaatgctg cattttaatt ttaaccaaaa catgtctcct atatcctggt ttttgtagcc 4261 ttcctccaca tcctttctaa acaagatttt aaagacatgt aggtgtttgt tcatctgtaa 4321 ctctaaaaga tcctttttaa attcagtcct aagaaagagg agtgcttgtc ccctaagagt 4381 gtttaatggc aaggcagccc tgtctgaagg acacttcctg cctaagggag agtggtattt 4441 gcagactaga attctagtgc tgctgaagat gaatcaatgg gaaatactac tcctgtaatt 4501 cctacctccc tgcaaccaac tacaaccaag ctctctgcat ctactcccaa gtatggggtt 4561 caagagagta atgggtttca tatttcttat caccacagta agttcctact aggcaaaatg 4621 agagggcagt gtttcctttt tggtacttat tactgctaag tatttcccag cacatgaaac 4681 cttatttttt ccaaagccag aaccagatga gtaaaggagt aagaaccttg cctgaacatc 4741 cttccttccc acccatcgct gtgtgttagt tcccaacatc gaatgtgtac aacttaagtt 4801 ggtcctttac actcaggctt tcactatttc ctttaaaatg aggatgatta ttttcaaggc 4861 cctcagcata tttgtatagt tgcttgcctg atataaatgc aatattaatg cctttaaagt 4921 atgaatctat gccaaagatc acttgttgtt ttactaaaga aagattactt agaggaaata 4981 agaaaaatca tgtttgctct cccggttctt ccagtggttt gagacactgg tttacacttt 5041 atgccggatg tgcttttctc caatatcagt gctcgagaca cagtgaagca aattaaaaaa 5101 aa // LOCUS HUMPSBGA01 134 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 1. ACCESSION M32622 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 1 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 134; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept 50 + 113 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 1 sigp 50 + 113 pregnancy-specific beta-1-glycoprotein 4 signal peptide IVS 114 > 134 PSG4 intron A BASE COUNT 33 a 44 c 36 g 21 t ORIGIN Chromosome 19q13.2-13.3. 1 ggatcctagg cttatctcca cagaggagaa cacacaagca gcagagacca tggggcccct 61 ctcagcccct ccctgcacac acctcatcac ctggaagggg gtcctgctca caggtgagga 121 gagaacttcc tggg // LOCUS HUMPSBGA02 1010 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 2. ACCESSION M32623 M22436 D51537 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 2 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2] and hsCGM4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1010) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 3 (bases 1 to 1010; for [2]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Thompson, 07-FEB-1989; for [2],[3], 06-MAR-1990. FEATURES from to/span description pept + 468 + 833 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 2 sigp + 468 505 pregnancy-specific beta-1-glycoprotein 4 signal peptide matp 506 + 833 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 467 PSG4 intron A IVS 834 > 1010 PSG4 intron B BASE COUNT 281 a 225 c 267 g 237 t ORIGIN Chromosome 19q13.2-13.3, about 650 bp after segment 1. 1 gagctcacac agtcatcggg gggggaagac agacatgcta agtgatctag aatgtgagat 61 caggtgttga caagaaccct ggagggagga gagcagggaa aggtcagaaa gggaagaccc 121 agggtctctg aaggaggtat caggaaagaa gtctaaggat gccctgatgt gagcaggacc 181 tgagggcagt gtggaggggg ccgtgcggac cctggggaag aggaatccaa aaagaaaaat 241 gccaaggtca gaagtgttga aggaatgggg gtcatgctgc tgatcttgac ctagtgggac 301 agtaggacac acacacatac actcacgccc ctttagtgtg tgtatgtgtt tgtatgtgtg 361 tgtttgtgtg tcttcaaggc tgaggattga agagaccttc tcaggaccca tcttttcacc 421 ccaatacata ggtctcaata ttaactgatg ctctctgtac ctcctagcat cacttttaaa 481 cttctggaat ccgcccacaa ctgcccaagt cacgattgaa gcccagccac ccaaagtttc 541 tgaggggaag gatgttcttc tacttgtcca caatttgccc cagaatcttg ctggctacat 601 ttggtacaaa gggcaaatga catacctcta ccattacatt acatcatatg tagtagacgg 661 tcaaagaatt atatatgggc ctgcatacag tggaagagaa agagtatatt ccaatgcatc 721 cctgctgatc cagaatgtca cgcaggagga tgcaggatcc tacaccttac acatcataaa 781 gcgacgcgat gggactggag gagtaactgg acatttcacc ttcaccttac accgtgagtg 841 attccacatg atccctgggt gttgggggac aggggtcact tctacttcac acacacagga 901 ttctcaggcc tggactctgc ctgtgtccct ctctgcatta agtccatgct ggggtttggg 961 catttagtgc aggacacaca gaggagacaa atttcaacag atcagaattc // LOCUS HUMPSBGA03 430 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 3. ACCESSION M32624 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 3 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 430; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 124 + 402 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 3 matp + 124 + 402 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 123 PSG4 intron B IVS 403 > 430 PSG4 intron C BASE COUNT 108 a 117 c 103 g 102 t ORIGIN Chromosome 19q13.2-13.3, about 5500 bp after segment 2. 1 ttccatgggc ttttaaggac ttgggtgggc tgagaggtgg gaaatgccaa ctctgattga 61 aaaatgcctt tggaagaatc aaagatgcca cacagggcaa acctctctct gttttctgca 121 cagtggagac tcccaagccc tccatctcca gcagcaactt aaatcccagg gaggccatgg 181 aggctgtgat cttaacctgt gatcctgcga ctccacccgc aagctaccag tggtggatga 241 atggtcagag cctccctatg actcacaggt tgcagctgtc caaaaccaac aggaccctct 301 ttatatttgg tgtcacaaag tatattgcag gaccctatga atgtgaaata cggaacccag 361 tgagtgccag ccgcagtgac ccagtcaccc tgaatctcct ccgtgagtat cctctgttcc 421 tctgtgagcc // LOCUS HUMPSBGA04 573 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 4. ACCESSION M32625 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 4 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 573; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 181 365 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 4 369 + 435 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 4 matp + 181 + 435 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 180 PSG4 intron C IVS 436 > 573 PSG4 intron D site 366 368 internal stop codon BASE COUNT 146 a 149 c 132 g 146 t ORIGIN Chromosome 19q13.2-13.3, about 270 bp after segment 3. 1 agcttgtgat gggagaaaca ggtgtaaggt ctcaggctcc agatcagtga acacagcggg 61 gatttggctg ggacttcagt gttgcgactt ggctcacagg gtcactgtgg cccttccaca 121 gaccaggatt ttcccttccc tctgacaatg tcacctgtga ctttattctc tttgctccag 181 atggcctgga tgcccccacc atttcttcct catacaccta ttaccataca ggggaattcc 241 ccaatctctc ctgcctcaca gactctcacc cactggcaga gcattcttgg ctgattgatg 301 ggaagttcca gcaatcagca caagtgtgct ttatccccca aatcactaaa acatatagag 361 gggtttaggt ctgtttcatc cataactcag ccactgatgg aacaaatctc ataatcaaga 421 ggatcatagt gcctggtaag tggatccctg gagcattggc aatatgtttt ccagtgaagc 481 tatctatgct atcagggaaa agccaccgtc cctcgtcaaa gggagaggga aaatcaaaaa 541 cccagcacag ggaatatgtt tcgtctccgg ggg // LOCUS HUMPSBGA05 647 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 5. ACCESSION M32626 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 5 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 647; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 158 + 436 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 5 matp + 158 + 436 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 157 PSG4 intron D IVS 437 > 647 PSG4 intron E BASE COUNT 183 a 168 c 147 g 149 t ORIGIN Chromosome 19q13.2-13.3, about 1800 bp after segment 4. 1 gatcatcatg catctatctt gtgacagaag caccagctat tggctttcaa ggactcgggt 61 gggctgagag gtgggagatg ccaactctga ttgaagtatg cctgtggagg aatcaaaggt 121 gccacacagg acaatcttct ctctgttatc cacacagcaa agctgtccaa gccctacatc 181 acaatcaaca acttaaaccc cagagagaat aaggatgtct taaccttcac ctgtgaacct 241 aagagtaaga actacaccta catttggtgg ctaaatggtc agagcctccc tgtcagtccc 301 agggtaaagc gacccattga aaacaggatc ctcattctac ccaatgtcac gagaaatgaa 361 acaggacctt atcaatgtga aatacgggac cgatatggtg gcatccgcag tgacccagtc 421 accctgaatg tcctctgtga gtatcttttt gttcctctgt gggccaggac accagcttaa 481 atccaaacca ccagaggcca ggtctctcag tctctctcca gtccaagtat agacaccttt 541 acttctggac atccgaggtg gccatgactc cctgccctgg gaaaacctag gtaggcacag 601 cttaaccaag aatataaggg gaggggacgc tcttgtcatg gagactt // LOCUS HUMPSBGA06 558 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 6. ACCESSION M32627 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 6 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 558; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 45 / 299 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 6 matp + 45 / 299 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 44 PSG4 intron E IVS 300 > 558 PSG4 intron F BASE COUNT 161 a 144 c 103 g 150 t ORIGIN Chromosome 19q13.2-13.3, about 200 bp after segment 5. 1 ttcctcggat gacatcacct atggctttat tctctttgcg ccagatggtc cagacctccc 61 cagcatttac ccttcattca cctattaccg ttcaggagaa aacctctact tgtcctgctt 121 cgccgagtct aacccacggg cacaatattc ttggacaatt aatgggaagt ttcagctatc 181 aggacaaaag ctctctatcc cccaaataac tacaaagcat agtgggctct atgcttgctc 241 tgttcgtaac tcagccactg gcaaggaaag ctccaaatcc atcacagtca aagtctctgg 301 taagtggatc ccagcatcct tggcaatagg gatttaggtg gagtctatct ggcattcagg 361 gaagagtgag gaaaacattt ttattcccag cctgtgtccc atgggcacaa gcaaatccca 421 aattctactc ctgaacactc ccaatttgtc taagaacttc caaaacttta acaaacaggc 481 tgatatcttc ttaaaattcc cagcctagac caagcaggga gaacattgat ttcattgaaa 541 taattgacaa tattgagg // LOCUS HUMPSBGA07 621 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 1. ACCESSION M32628 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 7 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 621; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept 360 + 423 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 1 sigp 360 + 423 pregnancy-specific beta-1-glycoprotein 5 signal peptide IVS 424 > 621 PSG5 intron A BASE COUNT 176 a 135 c 193 g 113 t 4 others ORIGIN Chromosome 19q13.2-13.3, about 7000 bp after segment 6. 1 tatgtgtgtg actgtgtggg tctgtgtgtg tctcttctgt gtgtgtgtgt ctgcacaaag 61 tgtggttgag gtttggtgaa agaatcactg ctgaaaaatg cagaggcctc cacaattccc 121 agggacctga aacacagaca aaaggaaaaa cagaaggagg acaaggaggc aggactgaga 181 gaggagggga cagagaggtg tcctgggcct gaccccaccc atgagcctga gaagtgctcc 241 tgccctggag agaggctcag cacagaagga ggaaggacag cacagcctac agccgtgctc 301 aggaagtttc tggatcctag gctcagctcc acagaggaga acacgcaggc gcagagacca 361 tggggcccct ctcagcccct ccctgcacac agcacatcac ctggaagggg ctcctgctca 421 caggtgagga gagaacttcc tgggagagga caggaggagg aagaagagtg actggattgg 481 gtctcctgga gaggatnnnn ttctaaaaaa taaaagaagt ccagcacttt gggaggatga 541 ggtgggtgga ttatgagatc aggagttcaa ggtcagtcct gccaacacag tgaagccctg 601 tctctactaa aaatacaaaa a // LOCUS HUMPSBGA08 173 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, intron A. ACCESSION M32629 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 8 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 173; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description IVS < 1 > 173 pregnancy-specific beta-1-glycoprotein 5 intron A BASE COUNT 56 a 29 c 62 g 26 t ORIGIN Chromosome 19q13.2-13.3, about 560 bp after segment 7. 1 gagctcacgc tgtcatgggg aggaagacag acatgcaaag agatctagaa tgtgaggtca 61 ggtgttgaca agaaccctgg agggagcaga gcagggaaag gtcagaaagg gaagacccag 121 ggtctctgaa ggacgtgtca ggaaagaagt ctagagatgc cctgatgtga gca // LOCUS HUMPSBGA09 547 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 2. ACCESSION M32630 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 9 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 547; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 9 + 374 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 2 sigp + 9 47 pregnancy-specific beta-1-glycoprotein 5 signal peptide matp 48 + 374 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 8 PSG5 intron A IVS 375 > 547 PSG5 intron B BASE COUNT 149 a 139 c 117 g 142 t ORIGIN Chromosome 19q13.2-13.3, about 300 bp after segment 8. 1 cctcctagca tcacttttaa acttctggaa cctgcctatc actgctcaag tcacgattga 61 agccctgcca cccaaagttt ccgaggggaa ggatgttctt ctacttgtcc acaatttgcc 121 tcagaatctt gctggctaca tctggtacaa aggacaactg atggacctct accattacat 181 tacatcatat gtagtagacg gtcaaataaa tatatatggg cctgcataca ctggacgaga 241 aacagtatat tccaatgcat ccctgctgat ccagaatgtc acccgggaag acgcaggatc 301 ctacacctta cacatcataa agcgaggtga taggactaga ggagtaactg gatatttcac 361 cttcaactta taccgtgagt gattccacat gatccctggg tgttggggga caggggtcat 421 ttctacttca cacacacagg attgtcaggc ctgcactgtg cctgtgtccc tctctgcatt 481 atgtcccatg ctgcggtttg ggcatttagt gcaggacaca cacagaggag acaaatttca 541 acagatc // LOCUS HUMPSBGA10 458 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 3. ACCESSION M32631 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 10 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 458; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 167 + 445 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 3 matp + 167 + 445 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 166 PSG5 intron B IVS 446 > 458 PSG5 intron C BASE COUNT 117 a 122 c 108 g 111 t ORIGIN Chromosome 19q13.2-13.3, about 5200 bp after segment 9. 1 gagctctcag attgccatgc atctgtcttg taacacacac acctgccatg ggcttttaag 61 gacttgggtg ggctgacagg tgggaaatgc caactctgat tgaaaagtgc ctttggagga 121 atcaaaggtg ccacacaggg caatcttctc tctgttttct gcaaagtgga aactcccaag 181 ccctccatct ccagtagcaa cttaaacccc agggaggcca tggaggctgt gatcttaacc 241 tgtgatcctg cggcttcaga cacaagctac ctgtggtgga tgaatggtca gagcctccct 301 atgactcaca ggttgcagct gtccaaaacc aacaggaccc tctttatatt tggtgtcaca 361 aagtatattg caggacccta tgaatgtgaa atacggaacc cagtgagtgc cagccacagt 421 gacccagtca ccctgaatct cctccgtgag tatcctct // LOCUS HUMPSBGA11 359 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 4. ACCESSION M32632 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 11 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 359; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 95 267 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 4 272 + 347 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 4 matp + 95 + 347 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 94 PSG5 intron C IVS 348 > 359 PSG5 intron D site 268 270 internal stop codon BASE COUNT 89 a 108 c 65 g 97 t ORIGIN Chromosome 19q13.2-13.3, about 300 bp after segment 10. 1 acttggctca cagggtcact gtggcccttc cacagaccag gattttccct tccctctgac 61 aatgtcacct gtgactttat tctctttgct ccagatggcc tggatgcccc caccatttct 121 tcctcataca cctattacca tacaggggaa gtccccaatc tctcctgcct cacagactct 181 cacccactgg cagagcattc ttggctgatt gatgggaagt tccagcaatc agcacaagtg 241 tgctttattc cccaaatcac caaaacatag agggatctat gtctgtttca tccataactc 301 agccactgat ggaacaaatc tcataatcaa gaggatcata gtccctggta agtggatcc // LOCUS HUMPSBGA12 381 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 5. ACCESSION M32633 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 12 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 381; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 33 + 311 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 5 matp + 33 + 311 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 32 PSG5 intron D IVS 312 > 381 PSG5 intron E BASE COUNT 111 a 105 c 81 g 84 t ORIGIN Chromosome 19q13.2-13.3, about 2000 bp after segment 11. 1 acaggacaat cttctctctg ttatccacac agtgaagctg cccaagccct acatcaccat 61 caacaactca aaacccaggg agaataagga tgtcttagcc ttcacctgtg aacctaagag 121 tgagaactac acctacattt ggtggctaaa tggtcagagc ctcccggtca gtcccagggt 181 aaagcgaccc attgaaaaca ggatcctcat tctacccagt gtcacgagaa atgaaacagg 241 accctatgaa tgtgaaatac gggaccgaga tggtggcatg cgcagtgacc cagtcaccct 301 gaatgtcctc tgtgagtatc ttttgttcct ctgtgggcca ggacaccagc ttaaatctaa 361 atgaccacaa gccagtcctc t // LOCUS HUMPSBGA13 540 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 6. ACCESSION M32634 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 13 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 540; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 40 + 294 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 6 matp + 40 + 294 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 39 PSG5 intron E IVS 295 > 540 PSG5 intron F BASE COUNT 132 a 142 c 108 g 158 t ORIGIN Chromosome 19q13.2-13.3, about 300 bp after segment 12. 1 ggatgacatc acctgtggct tcattcttct ttgctccaga tggtccagac ctccccagca 61 tttacccttc attcacctat taccgttcag gagaaaacct ctacttgtcc tgcttcgcgg 121 aatctaaccc accggcagag tatttttgga caattaatgg gaagtttcag caatcaggac 181 aaaagctctc tatcccccaa attactacaa agcatagagg gctctatact tgctctgttc 241 gtaactcagc tactggcaag gaaagctcca aatccatgac agtcgaagtc tctggtaagt 301 ggatcccagc atccttggca atagggtttt aggtggagtc tacctggctt tcagagaaga 361 gttaggaaaa catttttatt cccagcctgt gtcccatggg cacaagcaaa tcccaaattc 421 tcctcctgaa cccttccaat ttgtctctac aaactctctt ctccttgttt ttctgttttc 481 tcatggctga ccttgtgtct ggcctaagaa agttagggag ggggctttat cagccctgag // LOCUS HUMPSBGA14 320 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 7. ACCESSION M32635 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 14 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 320; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 202 245 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 7 matp + 202 245 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 201 PSG5 intron F IVS 286 > 320 PSG5 intron G BASE COUNT 94 a 78 c 54 g 94 t ORIGIN Chromosome 19q13.2-13.3, about 4300 bp after segment 13. 1 ggatcaccat cctagaaaca tggtgagagg gaaaaaaaaa actgcaagaa tataatcatc 61 tcccatcaat ttcccaagag gaataatgta gtacttgaag aaacaattat agagtacctc 121 atgctacatg cctcttcctg aggctccccc atgtaaaata acatcacctt cattccttct 181 ttacttttct ttccatgaca gctccttcag gaataggacg tcttcctctc cttaatccaa 241 tatagcagcc gtgaagtcat ttctgtattt caggaagact ggcaggtatg atggcctttt 301 ctcttatcct ggtcctgcag // LOCUS HUMPSBGB1 237 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 4. ACCESSION M32617 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 1 of 5 SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 237; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept < 1 + 14 pregnancy-specific beta-1-glycoprotein (PSG7), exon 4 (AA at 2) IVS 15 > 237 PSG7 intron D BASE COUNT 76 a 52 c 50 g 59 t ORIGIN Chromosome 19q13.2-13.3. 1 gatcatagtc cctggtaagt ggatcctgga gcattggcaa tatgttttcc agggaagtct 61 atctggctat cagggaagag ccacctgccc tctgcagagg agagggaaaa tcaaaactca 121 ggaaagggaa tatatttctg ctccaaaacc accaggtttt cctgtacctc actctttcta 181 gatcattctt cagactatgc gcactaacaa tgaacaatct gaaaagaaat taagaaa // LOCUS HUMPSBGB2 435 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 5. ACCESSION M32618 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 2 of 5 SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 435; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 25 + 303 pregnancy-specific beta-1-glycoprotein 7 (PSG7), exon 5 IVS < 1 24 PSG7 intron D IVS 304 > 435 PSG7 intron E BASE COUNT 120 a 123 c 87 g 105 t ORIGIN Chromosome 19q13.2-13.3; about 1800 bp after segment 1. 1 atcttctctc tgttattcac acagtgaagc tgcctaagcc ctacatcacc atcaataact 61 taaaccccag ggagaataag gatgtctcaa ccttcacctg tgaacctaag agtgagaact 121 acacctacat ttggtggcta aatggtcaga gcctcccggt cagtcccagg gtaaagcgac 181 gcattgaaaa caggatcctc attctaccca gtgtcacgag aaatgaaaca ggaccctatc 241 aatgtgaaat acgggaccga tatggtggca tccgcagtga cccagtcacc ctgaatgtcc 301 tctgtgagta tccttgagta tcctttcttt cctctgtggg ccaggacaca ccagcttaac 361 tccaaacgac cagaggccag tcctctcagt ctctctccgg tccaagtata gacatattta 421 cttctggaca tccga // LOCUS HUMPSBGB3 274 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 6. ACCESSION M32619 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 3 of 5 SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 274; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 8 / 262 pregnancy-specific beta-1-glycoprotein 7 (PSG7), exon 6 IVS < 1 7 PSG7 intron E IVS 263 > 274 PSG7 intron F BASE COUNT 72 a 78 c 52 g 72 t ORIGIN Chromosome 19q13.2-13.3; about 300 bp after segment 2. 1 cctccagatg gtccagacct ccccagaatt tacccttcat tcacctatta ccattcagga 61 gaaaacctct acttgtcctg ttttgcggac tctaacccac cggcacagta ttcttggaca 121 attaatggga agtttcagct atcaggacaa aagctttcta tcccccagat tactacaaag 181 catagcgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga aagctccaaa 241 tccgtgacag tcagagtctc tggtaagtgg atcc // LOCUS HUMPSBGB4 206 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 6 (PSG6) gene, exon 1. ACCESSION M32620 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 4 of 5 SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 206; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept 50 + 113 pregnancy-specific beta-1-glycoprotein 6 (PSG6) precursor, exon 1 sigp 50 + 113 pregnancy-specific beta-1-glycoprotein 6 signal peptide IVS 114 > 206 PSG6 intron A BASE COUNT 59 a 50 c 66 g 31 t ORIGIN Chromosome 19q13.2-13.3; about 7800 bp after segment 3. 1 ggatcctagg ctcatctcca caggggagaa cacacagaca gcagagacca tgggacccct 61 ctcagcccct ccctgcactc agcacatcac ctggaagggg ctcctgctca caggtgagga 121 gaggacttcc tgggagagga caggaggagg aagcagagtg actggatggg gtatctggag 181 aggatggggt tctaaaaaat aaaaga // LOCUS HUMPSBGB5 1026 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 6 (PSG6) gene, exon 2. ACCESSION M32621 M22435 D51537 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 5 of 5 SOURCE Human leukocyte DNA, clones lambda-PSG6/7-1 and hsCGM3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1026) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 3 (bases 439 to 846; for [2]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Thompson, 07-FEB-1989; for [2],[3], 06-MAR-1990. This gene was named human carcinoembryonic antigen (hsCGM3) in [1], but [2] says it is actually one of the pregnancy-specific beta-1-glycoproteins. FEATURES from to/span description pept + 484 / 846 pregnancy-specific beta-1-glycoprotein 6 (PSG6) precursor, exon 2 sigp + 484 521 pregnancy-specific beta-1-glycoprotein 6 signal peptide matp 522 / 846 pregnancy-specific beta-1-glycoprotein 6 IVS < 1 483 PSG6 intron A IVS 847 > 1026 PSG6 intron B BASE COUNT 291 a 235 c 264 g 236 t ORIGIN Chromosome 19q13.2-13.3; about 1200 bp after segment 4. 1 gagctcacac tctcatgggg aggaagacag acatgcaaag agatatagaa tgtgaggtca 61 ggtgttgaca agaaccctag agggagcaga gcagggaaag gtcagaaagg gaagacccag 121 ggtctctgaa gcaggcatca ggaaagaagt ctaaggatgc cctgatgtga gcaggacctg 181 agggcagtgt ggagggggcc gtgcggaccc ctggggaaga ggattgcaaa cagaaaaatg 241 ccaaggtcag gagtgttgaa ggaatggggg tcatgctgct gaccttgacc tagtaggaca 301 gtaggacaca cacacataca cacacacaaa cacacatgcc cttttgtgtg tgtgtgtttg 361 tatgtgtgtg tgtgcatatc ttcaaggctg atgattgaag agaccttctc aggacacagg 421 gccccatctt ttcaccccaa tacataggtc caaatattaa ctgatgctgt ctctacctcc 481 tagcatcact tttaaacttc tggaacctgc ccaccactgc ccaagtaata attgaagcca 541 agccacccaa agtttccgag gggaaggatg ttcttctact tgtccacaat ttgccccaga 601 atcttactgg ctacatctgg tacaaagggc aaatgacgga cctctaccat tacattacat 661 catatgtagt acacggtcaa attatatatg ggcctgccta cagtggacga gaaacagtat 721 attccaatgc atccctgctg atccagaatg tcacacagga ggatgcagga tcctacacct 781 tacacatcat aaagcgaggc gatgggactg gaggagtaac tggatatttc actgtcacct 841 tatactgtga gtgattccgc atgatccctg ggtgttgggg ggcaggggtc atttctactt 901 cacacacaca gaattgtcag gcctggactc tgcctgtgtc actctctgca ttatgtccca 961 tgctggggtt tgggcattta gtgcaggaca cacacagagg agacacattt caacagatca 1021 gaattc // LOCUS MSQRTR1A 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 16bs. ACCESSION M35448 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 16bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 76 a 78 c 64 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gatctcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttactca atcttcactc tttaagctt // LOCUS MSQRTR1B 348 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 916. ACCESSION M35449 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 916. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 348) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 348 T1 retroposon BASE COUNT 79 a 75 c 63 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatat actcgacagt cgtcttaact ttaaactgca 61 gcttgatgag gttctactaa aagctaatcg aactcttggg tttattttac gttttacctc 121 tatttttaga gattaaagct tcttaagaaa cctttattat gctctggtaa ggcctcttct 181 tgaatatgct agcatcatct ggaatcctcc tactattgat ggctgttcga gaattgaaag 241 cattcagcgc ctttttacca gggttgcttt tcgtcgtttg ttcggtgctg cctcactacc 301 tccctatgaa acgagattgc agttattcaa tcttcactct ttaagctt // LOCUS MSQRTR1C 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 52a. ACCESSION M35450 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 52a. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 78 a 76 c 64 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcccattc 181 ttgaatatgc tagtatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gccctactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1D 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone L34. ACCESSION M35451 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone L34. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 77 a 77 c 64 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gccctactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1E 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 914a. ACCESSION M35452 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 914a. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 77 a 76 c 65 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctgtggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1F 343 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 13bs. ACCESSION M35453 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 13bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 343) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 343 T1 retroposon BASE COUNT 78 a 73 c 60 g 132 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta taatttacag tcgtcttaac tttaaactgc 61 agcttgatga gattctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagtatcatc tggaattgtc ctactattga tggctgttcg agaattgaat 241 gccttcagcg cctctttacc aggattgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcagtta ttcaatcttc actctttaag ctt // LOCUS MSQRTR1G 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 12bs. ACCESSION M35454 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 12bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 78 a 79 c 62 g 130 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acccttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcatcgttt tttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttactca atcttcactc tttaagctt // LOCUS MSQRTR1H 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 9bs. ACCESSION M35456 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 9bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 77 a 77 c 64 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1I 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 6bs. ACCESSION M35457 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 6bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 78 a 76 c 64 g 131 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagtatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1J 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 4bs. ACCESSION M35461 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 4bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 76 a 77 c 64 g 132 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gatctcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1K 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 1bs. ACCESSION M35462 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 1bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 78 a 80 c 64 g 127 t ORIGIN 1 gtgttgtgtc catccgtgac cttggtataa tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagctaatc gaacccttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcctaagca acctttattg tgctttggta aggcctattc 181 ttgaatatgc tagtatcctc tggaatcctc ctactattga ttgctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1L 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 5bs. ACCESSION M35463 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 5bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 81 a 80 c 64 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1M 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 3bs. ACCESSION M35464 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 3bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 80 a 80 c 65 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggagtcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1N 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 11bs. ACCESSION M35465 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 11bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 82 a 80 c 64 g 123 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacca 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1O 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 14bs. ACCESSION M35466 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 14bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 81 a 82 c 64 g 122 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaacc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgtct gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1P 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 31b. ACCESSION M35467 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 31b. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 80 a 80 c 65 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctgcta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1Q 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 15bs. ACCESSION M35468 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 15bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 81 a 80 c 64 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1R 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 17bs. ACCESSION M35469 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 17bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 82 a 79 c 64 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgacttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agattaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1S 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 18bs. ACCESSION M35470 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 18bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 82 a 81 c 64 g 122 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaaccctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgatag cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1T 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 19bs. ACCESSION M35471 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 19bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 81 a 82 c 64 g 122 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaacc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgtct gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1U 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 28bs. ACCESSION M35472 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 28bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 79 a 80 c 64 g 126 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatgt ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc tagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS SHRMTRRDA 1136 bp ss-RNA RNA 14-AUG-1990 DEFINITION Brine shrimp mitochondrion 16S ribosomal RNA precursor. ACCESSION X12965 M21833 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE Brine shrimp mitochondrion, cDNA to RNA, clones pArLSU-[1,2]. ORGANISM Mitochondrion Artemia salina Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Branchiopoda; Sarsostraca; Anostraca; Artimiidae; Artemia salina. REFERENCE 1 (sites) AUTHORS Sastre,L. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 1136; enum. 1 to 1136) AUTHORS Sastre,L. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 3 (bases 518 to 1129) AUTHORS Palmero,I., Renart,J. and Sastre,L. TITLE Isolation of cDNA clones coding for mitochondrial 16S ribosomal RNA from the crustacean Artemia JOURNAL Gene 68, 239-248 (1988) STANDARD full staff_review REFERENCE 4 (bases 1 to 1136; revises [1],[2],[3]) AUTHORS Sastre,L. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Author address: Sastre L., Instituto de Investigaciones Biomedicas del CSIC, Av Arzobispo Morcillo 4, 28029 Madrid, Spain. Submitted (22-SEP-1988) on tape to the EMBL data library. [1] Submitted (17-NOV-1988) to the EMBL data library. [2] Draft entry and computer-readable sequence for [3] kindly provided by L.Sastre, 01-MAR-1989; for [4], 12-MAY-1989. FEATURES from to/span description rRNA 1 1136 16S ribosomal RNA revision 548 550 cct in [2]; ct in [1] revision 555 557 act in [2]; at in [1] revision 778 780 gtc in [2]; gc in [1] revision 906 908 tta in [2]; ta in [1] revision 981 982 tc in [2]; ttc in [1] revision 1003 1007 cagcc in [2]; cc in [1] revision 1111 1112 ta in [4]; tca in [2] BASE COUNT 382 a 196 c 218 g 340 t ORIGIN 1 atttaaaaca atattatatt agtaaaagaa tcgaatttat tcaaggtcta agtactgaaa 61 aggaaagtct actaattaca aagaagttat tgtttgtacc ttttgtatca gggtttatta 121 aatttcaatc tttattgctg aatcccgaaa caaggtgatc tatattggtt aaagatgatt 181 gtttcataaa tctcttcaag accatatcgt tgcgaaaata tttcgtacct ttatatctgg 241 ttggcagaat actcctttag ggtactccgc aaggaagatt ttctagggga cgagctctag 301 aaaaaattat attataatta tcaagaaaaa catgactagg cttaaaagtt gcctcggttt 361 aataaccatt gttagtaaaa tgataatttc aaaataatta attaagactg gccgatgcta 421 attacataaa tttagcaatc acaatgatag aattagtaga actataaatg cacttaacac 481 aatattcctc acaaaatccc aaaggaattc ggcaaaactc atctccgcct gtttaacaaa 541 gacatcgcct cttaactttg ggaggtcgag cctgcccact gatgaattaa agggccgtgg 601 tatactgacc atgcgaaggt agcataatca ttagcctttt gatttgaggc tggaatgaat 661 ggtttgacga gagatggtct gtctcttcga ttaaattgaa gttaatcttt aagtgaaaaa 721 gcttaaatgt acttggaggg cgataagacc ctatagatct ttacatttaa ttcttttgtc 781 ttgcggtagg taattagaca gagtaaaaca atgttcggtt ggggcgacgg taagaacaga 841 ataaacactt acaacataaa cacatcaata aatgaccatt gatccttaga tgaataaaga 901 ccaagttacc ttagggataa cagcgtaatt ctttttgaga gttcaaatcg acaaaagagt 961 ttgcgagcct cgatgttggt tcagggaccc tactcggtgc agcagcctag agaggcagtc 1021 tgttcgactg ttaaacccct acgtgatctg agttcagacc gacgtaagcc aggttagttt 1081 ctacccccaa ggcaaatcca ctcatgatag tacgaaagga cctcttgaga ttttat // LOCUS RATI145TR 9852 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat inositol-1,4,5-triphosphate receptor mRNA, complete cds. ACCESSION J05510 KEYWORDS inositol-1,4,5-triphosphate receptor. SOURCE Rat adult brain, cDNA to mRNA, clones pI16, pI17, pI19, and pI25. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 9852) AUTHORS Mignery,G.A., Newton,C.L., Archer,B.T.III. and Suedhof,T.C. TITLE Structure and expression of the rat inositol-1,4,5-triphosphate receptor JOURNAL J. Biol. Chem. 265, 12679-12685 (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Suedhof, 19-APR-1990. FEATURES from to/span description pept 330 8579 inositol-1,4,5-triphosphate receptor (alt., clone pI16) pept 330 1280 inositol-1,4,5-triphosphate receptor, exon 1 (alt., clones pI17, pI19, and pI25) 1326 8579 inositol-1,4,5-triphosphate receptor, exon 2 (alt., clones pI17, pI19, and pI25) IVS 1281 1325 inositol-1,4,5-triphosphate receptor intron A (no splice consensus) signal 9033 9083 poly-A signal signal 9832 9837 poly-A signal variant 752 752 a in pI16; g in pI25 variant 802 802 g in pI16; a in pI17 variant 1923 1923 g in pI16; a in pI17 variant 2181 2181 g in pI16; a in pI17 variant 3340 3340 g in pI16; a in pI17 BASE COUNT 2688 a 2324 c 2498 g 2342 t ORIGIN 1 tggggtcact tccttcacac aggactttgg cggaaatctc tacacccgtg tctcgtcatc 61 cccttcgaga gcattaacta tttaaaacca ggagagaaag cgcacgccga gaggaggtgt 121 gggtgttccg ctcccatcct aacggaacga gctccctctt cgcggacatg ggattgccca 181 gcggctgcta acccctgtcc tggtcctgat ccctcaaacc ggcgtggctc cccggtcacc 241 aaggagctga ctacaaagga tttgcatcct tggctgggag tccattggct acagagtgcc 301 tgacctgggt caggcttttc aacaaggaca tgtctgacaa aatgtctagt ttcctacata 361 tcggagacat ttgttctctg tatgcagagg gatctacgaa tggatttatc agcaccttgg 421 gcttggttga tgaccgttgc gttgtacagc cagaagctgg ggaccttaac aatccaccca 481 agaaattcag agactgcctc tttaagctat gtcctatgaa tcgatattct gcacagaagc 541 agttctggaa agctgctaag cccggggcca acagtacaac agatgcggtg ctgctcaaca 601 aattgcatca tgctgcagac ttggaaaaga agcagaatga gacagaaaac aggaaattgc 661 tggggacggt catccagtat ggcaacgtga tccagctcct acatttgaaa agcaataaat 721 acttaactgt gaataagagg cttcctgccc tactggagaa gaacgccatg agagtgacgt 781 tggatgaggc tggaaatgaa gggtcttggt tttacattca gccgttctac aagctccgct 841 ccatcggaga cagtgtggtc ataggtgaca aggtagtttt gaaccctgtc aatgctgggc 901 agccgctaca tgccagcagc catcagctgg tggataaccc gggctgcaat gaggtcaact 961 ccgtcaactg taacacaagc tggaaaatag tgcttttcat gaaatggagt gataacaaag 1021 atgacattct aaaaggaggt gatgtggtaa gactcttcca tgctgagcaa gagaagtttc 1081 tcacgtgcga tgagcacagg aagaagcagc acgtcttcct gaggacaacc ggcagacagt 1141 cagccacgtc ggccaccagc tccaaagccc tgtgggaagt ggaggtagtc cagcatgacc 1201 catgtcgggg tggagctggg tactggaata gcctcttccg gttcaagcac ctggctacag 1261 ggcattactt ggcagcagag gtagaccctg actttgagga agaatgcctg gagtttcagc 1321 cctcagtgga ccctgatcag gacgcatctc ggagcaggtt gagaaatgcc caagaaaaaa 1381 tggtgtactc tctggtctct gtgcctgagg gcaacgacat ctcctccatc tttgagctag 1441 accccacaac tctacgtgga ggtgacagcc ttgtcccaag gaactcctat gtccggctca 1501 gacacctgtg tacaaacacc tgggttcaca gcacaaacat ccccattgac aaggaagagg 1561 aaaaacctgt catgctgaag attggtacct ctcccctgaa ggaggacaag gaagcttttg 1621 ccatagtccc cgtttctcct gctgaggttc gggacctgga ctttgccaat gatgccagca 1681 aggtgctggg ctccatcgct gggaagttgg aaaagggcac catcacccag aatgagagaa 1741 ggtctgtcac caagctgctg gaagacttgg tttactttgt cacgggtgga acgaactctg 1801 gccaagacgt gcttgaagta gtcttctcta agcccaaccg agagcggcag aagctgatga 1861 gggaacagaa tattctcaag cagatcttca agcttttgca ggcccccttc acggactgcg 1921 gggatggccc aatgcttcgg ctggaggagc tgggggacca gcggcacgct cctttcagac 1981 acatttgccg actctgctac agggtcctga gacactcaca gcaagactac aggaagaacc 2041 aggagtacat agccaagcag tttggcttca tgcagaagca gattggctat gatgtgctgg 2101 ccgaagacac catcactgcc ctgctccaca acaaccgaaa gctcctggaa aagcacatca 2161 ccgcagcaga gattgacacg tttgtcagcc tggtgcgaaa gaacagggag cccaggttct 2221 tggattacct ctctgacctc tgtgtatcca tgaataagtc aatccctgtg acccaggagc 2281 tcatttgtaa agctgtgctg aatcccacca atgctgacat cctgattgag accaagttgg 2341 ttctttctcg ttttgagttt gaaggcgttt ccaccggaga gaatgctctg gaagctgggg 2401 aagatgagga agaggtgtgg ctgttctgga gggacagcaa caaagagatc cgtagcaaga 2461 gtgttaggga attggcacaa gatgctaaag agggacagaa ggaagacagg gatgtcctca 2521 gctactacag gtatcagctg aacctctttg caaggatgtg tctggaccgc cagtacctgg 2581 ccatcaatga aatatccggg cagctggatg ttgacctcat cctccgctgc atgtccgatg 2641 agaacctccc ctatgacctc agagcatcct tttgccgcct catgcttcac atgcatgtgg 2701 accgagatcc ccaggagcag gtgacccctg tgaaatacgc ccgtctgtgg tcagaaattc 2761 cctccgagat cgccattgac gactatgaca gcagtggagc atcgaaggat gaaatcaagg 2821 agaggtttgc tcagacgatg gagtttgtgg aggagtacct gagagatgtg gtttgtcaaa 2881 gattcccctt ttctgataag gagaaaaata agctcacatt tgaggttgtg aacttagcca 2941 ggaacctcat atactttggc ttctacaact tttctgacct tctccgatta accaagatcc 3001 tcttggcaat cttagactgt gttcatgtga caaccatctt ccccattagc aagatgacga 3061 aaggcgaaga gaataaaggc agcaacgtga tgagatctat ccatggagtt ggggagctca 3121 tgacccaggt ggtgcttcgg ggaggaggct ttttgcccat gactcctatg gctgctgccc 3181 ctgaaggaaa tgtgaagcag gcagagccag agaaagagga tatcatggtc atggacacca 3241 agttaaagat cattgaaata ctccagttta ttttgaatgt gagattggat tataggatct 3301 cctgcctcct gtgtatattt aagcgggagt ttgatgaaag caattcccag tcatcagaaa 3361 catcctccgg aaacagcagc caagaagggc caagtaatgt accaggtgct cttgactttg 3421 aacacattga agagcaagca gaaggcatct ttgggggaag tgaggagaac acacctctgg 3481 acttggatga tcatggtggc agaaccttcc tcagggtcct gctccacttg acaatgcatg 3541 attacccacc cctggtgtct ggggccctgc agctcctctt ccggcatttc agccagaggc 3601 aggaggtcct tcaggccttc aaacaggttc aactgctggt tactagccaa gatgtggaca 3661 actacaaaca gatcaagcaa gacttggacc agctaagatc catcgtggag aagtctgagc 3721 tctgggtata caaaggccag ggtcccgatg agcctatgga cggagcctct ggtgaaaacg 3781 aacataagaa aactgaggag gggaccagta agccactgaa gcacgaaagc accagcagct 3841 acaactaccg agtggtgaaa gagattctga ttcgacttag caagctctgt gttcaagaga 3901 gcgcgtcggt gaggaagagc aggaagcagc agcagcgact gctgaggaac atgggtgccc 3961 atgctgtggt gctggagctg ctgcagatcc cctatgagaa ggccgaagat accaagatgc 4021 aggagatcat gaggctggct cacgaatttc tgcagaattt ctgcgcaggc aaccagcaga 4081 accaagcttt gctacataaa cacataaacc tgtttctcaa cccagggatc ctggaagcag 4141 tgaccatgca gcacatcttc atgaacaact tccagctgtg cagtgagatc aacgagagag 4201 tggtccagca tttcgttcac tgcatagaga cccacggtcg aaatgtccag tatatcaagt 4261 ttctccagac tattgtcaag gccgaaggga aattcattaa aaagtgccaa gacatggtca 4321 tggctgagct tgtcaattct ggagaggacg tcctcgtgtt ctacaatgac agagcctctt 4381 tccagactct gatccagatg atgcggtccg agcgtgaccg gatggatgag aacagccctc 4441 tcttcatgta ccacatccat ctggtggagc tcttggctgt gtgcacagag ggcaagaatg 4501 tgtacacaga gatcaagtgc aactccctgc tcccgctcga tgacatcgtt cgtgtggtca 4561 ctcatgaaga ctgtatcccc gaggttaaga ttgcttacat caacttcctg aatcactgct 4621 atgtggacac tgaggtggag atgaaggaga tttacacaag taatcacatg tggaagttgt 4681 ttgagaattt cctcgtggac atctgcaggg cctgtaacaa cacaagtgac aggaagcatg 4741 cagactcggt tctggagaag tacgtcactg aaatcgtgat gagcattgtc accaccttct 4801 tcagctctcc cttctcagac cagagtacca ctctgcagac tcgccagcct gtctttgtgc 4861 agctcctgca aggcgtgttc cgagtttacc actgcaactg gttgatgccg agccagaaag 4921 cctcggtgga gagctgtatc cgcgtgctct ctgacgtagc caagagccgg gccatagcca 4981 ttcctgttga cctagacagc caagtcaaca acctcttcct gaagtcccac aacattgtgc 5041 agaaaacagc catgaactgg cggttatcag cccgcaacgc tgctcgtaga gactctgtcc 5101 tggcagcttc cagagactac cgaaatatta ttgagaggtt acaggacatc gtgtctgccc 5161 tggaggaccg actcaggccc ctggtccagg ccgagctgtc tgtgctcgtg gatgttctac 5221 acagaccgga actgcttttc cctgagaaca cggatgccag gaggaaatgc gaaagtggag 5281 gtttcatctg caagctcata aaacacacca agcagctgct agaggagaat gaagagaaac 5341 tgtgcattaa agtcttacag accctcaggg aaatgatgac caaagataga ggctatggag 5401 aaaagcaaat ttccattgat gaattggaaa atgccgagct gccacagcca ccggaagctg 5461 agaactccac agaggagctt gaaccaagtc cacccctgag gcaactggaa gaccataaaa 5521 ggggtgaggc actccggcaa attttggtca accgttacta tggaaacatc agaccttcag 5581 gaagaagaga gagccttacc agctttggca atggtccact atcaccagga ggacccagca 5641 agcctggggg aggaggggga ggtcctgggt ccggctccac aagccggggt gagatgagcc 5701 tggctgaggt tcaatgtcat ctcgacaagg aaggcgcctc caatctggtc atcgacctca 5761 taatgaatgc atccagcgac cgagtattcc acgaaagcat tctcctggcc atcgcacttc 5821 tggaaggagg caacaccact atccagcact catttttctg ccggctgacg gaagataaga 5881 aatcagagaa gttcttcaag gtcttttacg atcgaatgaa ggtggcccag caggaaatca 5941 aggcgacagt gaccgtgaac accagcgatt tgggaaacaa aaagaaagat gatgaagtgg 6001 accgggatgc cccatcccga aagaaagcca aagagcccac aacacagata acagaagagg 6061 tccgggatca gctcttggaa gcatctgctg ccaccagaaa agcctttacc accttccgga 6121 gggaggccga ccctgatgac cattaccaat ctggggaggg cacccaggct acaacggaca 6181 aagccaagga tgacctagag atgagtgcag tcatcaccat catgcagcct atcctgcgct 6241 tcctgcagct cctgtgtgaa aaccacaacc gcgatctgca gaatttcctt cgttgccaaa 6301 ataataagac caactacaac ttggtgtgtg agacactgca gtttctggac tgtatctgtg 6361 ggagcacaac tggaggcctt ggtcttctcg gactgtacat aaatgaaaag aatgtagctc 6421 ttatcaacca aaccctggag agtctgactg agtactgtca agggccttgc catgaaaacc 6481 agaactgcat cgccacccac gagtccaatg gcatcgatat catcacagcc ctcatcctta 6541 acgatatcaa ccctctgggg aagaaacgga tggacctggt gttagaactg aagaacaatg 6601 cttcaaagct actcctggcc atcatggaaa gcagacacga tagtgaaaat gcagagagga 6661 tcctgtacaa catgaggccc aaggagctgg tggaagtgat caagaaggcc tacatgcaag 6721 gcgaagtgga gtttgaggat ggggagaacg gtgaggatgg agcagcctcc cccaggaacg 6781 tgggccataa catctacatc ctcgctcacc agttggctcg gcataacaaa gaacttcaaa 6841 ccatgctgaa acctggaggc caggtggatg gcgatgaagc tctggagttc tatgcgaagc 6901 acacagcaca aattgagatt gtcagactgg accgaactat ggaacagatt gtcttccctg 6961 tgcccagcat ctgtgaattt ctgactaagg aatcgaaact tcgaatatat tacaccacag 7021 agagggatga gcaaggcagc aagatcaatg acttcttcct gcgctccgag gacctcttta 7081 atgagatgaa ctggcagaag aaacttcgag cccagcctgt cttgtactgg tgcgcccgaa 7141 acatgtcttt ctggagcagc atctccttca acctggccgt cctgatgaac ctgctggtgg 7201 cgtttttcta cccatttaaa ggagtgagag gaggaacact agagccgcac tggtccggcc 7261 tcctgtggac agccatgctc atctctctgg caattgtcat tgcgctgccc aagccccatg 7321 gcatccgggc cttaattgct tctacaatcc tacgactgat attttcagtt ggattacagc 7381 ccacactgtt tctgctggga gctttcaatg tctgcaataa aatcatcttc ctgatgagct 7441 ttgtgggcaa ctgtgggacc ttcaccagag gctaccgagc catggttctg gacgttgagt 7501 tcctctatca tttgctgtat ctactgatct gtgccatggg cctcttcgta catgagtttt 7561 tctatagttt gctgcttttt gatttagtgt acagagagga gactttgctt aatgtcatta 7621 aaagtgtcac ccgcaatgga cggcccatca tcttgacagc ggccctggct ctgattctgg 7681 tttacctgtt ctcaattgtg ggctacctgt tcttcaagga tgactttatc ttggaagtag 7741 acaggttgcc caatgagaca gctggtccag aaactggtga gagtttggcc aacgatttcc 7801 tgtactctga tgtgtgccgg gtagagacgg gggagaactg cacctctccg gcacccaaag 7861 aagagctgct ccctgtggaa gaaacggaac aggataagga gcacacgtgt gagacactgc 7921 tcatgtgcat cgtcaccgtt ctgagtcacg ggctgcggag tgggggcgga gtaggagatg 7981 tgctcaggaa gccatccaaa gaggagcctc tgtttgctgc aagggtgatc tatgacctcc 8041 tcttcttctt catggtcatc atcattgtcc tgaacctcat ttttggggtc atcatcgaca 8101 cctttgctga cctgaggagt gagaagcaga agaaggaaga gatcttaaaa accacgtgct 8161 tcatttgtgg cttggaaagg gacaagtttg acaataagac cgtcaccttt gaagagcaca 8221 ttaaggaaga acacaacatg tggcactatc tgtgcttcat cgtgctggtg aaagtgaagg 8281 actccaccga gtacaccggg cccgagagct acgtggcaga gatgatcagg gaaagaaacc 8341 ttgattggtt ccccaggatg agagccatgt ccctggtcag cagtgactct gaaggggaac 8401 agaacgagct gaggaacctg caggagaagc tggagtccac catgaagctg gtcaccaatc 8461 tttctggcca gctgtcagaa ctaaaggacc agatgacaga acaaaggaag cagaaacaaa 8521 gaattggcct tctaggacat cctcctcaca tgaatgtcaa cccacagcag ccagcctaag 8581 caaatgagac ggagggactg tactcaaccc tttttaaatt attgtcagcg tgggtctggc 8641 tcttggttct gatctgccct ctaaggttac atgtgcgctt agtacatttg taaatactcg 8701 gttttgtatt gtatgtatat gattgcaatt ctcagaggtt tggactttcg tattgtaatt 8761 agctctgttg gcatggtgac ttgtcactcc tgccaaaaat attaaaaatg ccttttttgg 8821 aaggactaca gaaagtacct gatttgcact tgaaccagat tatagattta aaaagtattt 8881 gacatgtatt ttgttattta aaactagaat agccagtatt tatgtttttt ataaaactgt 8941 gcaatacaaa ttatgcaatc accataactc tgtaactcct gagtgtctta agggagtgcg 9001 catctttgaa gctgatctgt taatacttat gtaataaatg gttaaatatc aaatgatgat 9061 gctgctgcca aaattatatt aatagcaagt ttctggcccc tgggcaattt tgtaccttgt 9121 aattatccta tggtgatgct gtttctcgtt gctaatggca ttagtgcctc gtatcctagt 9181 gataactcca ggtctgtgaa ccattcaaac aacattcatt tttgagacaa agcaacttta 9241 gtttcaaggg taattttaag cttcaaaatt aatcatttaa agtatttctt taagagagcc 9301 atgttagagg ctcacacgtt agcttgaaag gagtcgatga aataattttt ttaaagggaa 9361 ctttttacac gacgtttgga ataacagcat attgctgacc agtcagtgtc atctcccagg 9421 tgaattttga tgtcacgtta tagtcaaatg agctagctta tggtttctag attctcttca 9481 tctgaaccaa catgatgcag taggtgagaa gttattatgc atatatacac atatacatcc 9541 atataaggca aagtaggagc tgtcccctta ggatgcatag ctgccccctt agggtgcaga 9601 gctcaacact gacagtgctg ttctgagaga gccacgtttg ggttttattt ctttgtcaca 9661 tgatttcttt tctggatgtg tgcaagtata acaggaagtg ttttctacct gttgcttgtt 9721 ttgtacctgg gtctcgtttt actagaccat ctctgcacaa aagtttaaaa aatgaaccgt 9781 aagcagagtt ctgaagcaag tcaagtttgt aagtgcatac ctgaaattat ttaataaatg 9841 acgcagaata ct //