Path: utzoo!attcan!uunet!ns-mx!iowasp.physics.uiowa.edu!maverick.ksu.ksu.edu!zaphod.mps.ohio-state.edu!brutus.cs.uiuc.edu!apple!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Database Update Message-ID: Date: 3 Jul 90 12:00:24 GMT Sender: root@genbank.BIO.NET Distribution: bionet Lines: 5511 Approved: lear@genbank.bio.net Checksum: 18905 333 LOCUS DROANNIX 1104 bp ss-mRNA INV 03-JUL-1990 DEFINITION D.melanogaster annexin IX mRNA, 3' end. ACCESSION M34068 J05501 KEYWORDS annexin IX. SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-6. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1104) AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and Suedhof,T.C. TITLE Two novel annexins from Drosophila melanogaster: Cloning, characterization and differential expression in development JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.C.Suedhof, 04-MAY-1990. FEATURES from to/span description pept < 1 891 annexin IX (AA at 1) signal 1091 1096 poly-A signal BASE COUNT 281 a 300 c 306 g 217 t ORIGIN 1 attctgcgca aggcgatgaa gggcttcggc accgacgaga aggccatcat cgagatcctg 61 gccaggcgtg gcatcgtcca gcgtttggag atcgctgagg cgttcaagac ctcgtacggc 121 aaggatctga tctcggacct caagtccgag ctgggcggca agttcgagga tgttatcctg 181 gctctgatga cgccgctgcc ccagttctat gcccaggagc tgcacgacgc catctcggga 241 ctgggaaccg acgaggaggc catcatcgag atcctctgca cgctgtccaa ctacggcatt 301 aagaccattg cccagttcta cgagcagagc ttcggcaagt ccctagagtc cgacctaaag 361 ggcgacacca gtggccactt caagcggctg tgcgtctcgc tcgtccaggg caaccgggat 421 gagaaccagg gcgtggacga ggccgcggcc atcgccgatg cccaggctct gcacgacgcc 481 ggtgagggac agtggggcac agatgagtcc accttcaact cgatcctgat cacccgctcc 541 taccagcagc tgcgccagat cttcctcgaa tacgagaatc tgtcgggcaa cgacatcgag 601 aaggccatca agcgggagtt tagcggctcc gtggagaagg gtttcctggc catcgtcaag 661 tgctgcaagt ccaagatcga ctacttttcg gagcgcctgc acgactccat ggccggcttg 721 ggcaccaagg acaagacgct gatccgcatc atcgtcagcc ggtcggagat cgatctgggt 781 gacatcaagg aggcattcca gaacaagtac ggcaagagct tggagtcctg gatcaaggag 841 gatgccgaga ccgatattgg atacgtcctg gtcactctta cggcttggta gacggaagca 901 gccggaatat ccgaatatct atgagcaata ccccactgtt caagtagaaa atgccaaaaa 961 aaaaaacgtt gcatttcccc aaaaaaaagt ataacaaaag cgaagaacaa atggagttgg 1021 tctatataca gtagttgtga tgtgttctaa aaatccaatc tacaaaacgc ttagtatttt 1081 ccctctgtgc aataatcgga attc // LOCUS DROANNX 1192 bp ss-mRNA INV 03-JUL-1990 DEFINITION D.melanogaster annexin X mRNA, complete cds. ACCESSION M34069 J05501 KEYWORDS annexin X. SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-16. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1192) AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and Suedhof,T.C. TITLE Two novel annexins from Drosophila melanogaster: Cloning, characterization and differential expression in development JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.C.Suedhof , 04-MAY-1990. FEATURES from to/span description pept 91 1056 annexin X signal 1175 1184 poly-A signal BASE COUNT 271 a 348 c 363 g 210 t ORIGIN Chromosome 93B or 19A-4,7. 1 gaattccaaa agtcccagga gaaagactga ttcgtgtgaa gtcgtctact gaagagccac 61 aaggaaccca aggaatcttc cagctgcata atggaataca aacccgtgcc cacggttaag 121 gacgcagctc ccttcgacgc ctcccaggac gcccaggtgc tgcgggcggc gatgaaggga 181 ttcggcaccg acgagcagga aatcatcgac gtgctcgtcg gcaggagcaa ccagcagagg 241 cagacgatca aggcggttta cgaagcggag ttcgagcgcg acctggtgga cgatcttaag 301 gacgagctgg gaggcaagtt cgaggacgtg atcgtgggtc taatgatgcc accagtggag 361 tacctgtgca agcaactgca cgccgccatg gcgggcatcg gaaccgagga ggccacgctc 421 gtcgagatcc tgtgcaccaa gaccaacgag gagatggccc agatcgtggc cgtctacgag 481 gagcgctacc agcgcccgct ggccgagcag atgtgcagcg agacctccgg ctttttccgc 541 cgcctgctca cgctgatcgt gaccggagta cgtgacggac tggacacgcc cgtcgacgtc 601 ggtcaggcca aggagcaggc cgcccagctc tactcggccg gcgaggccaa gctgggaacg 661 gacgaggagg tcttcaaccg gatcatgtcg cacgccagct tcccgcagct gcgacttgtc 721 ttcgaggagt acaaggtgct ctccgggcag accatcgagc aggccatcaa gcacgagatg 781 tccgacgagc tgcacgaggc catgatggcc atagttgagt gcgtccagtc accggcggcc 841 ttcttcgcca accgcctcta caaggccatg aatggcgccg gcaccgatga cgccacgctc 901 atccgcatca tcgtcagccg ctcggagatc gacctggaga ccattaagca ggagttcgag 961 cggatctaca accgtacgct gcacagcgcc gtggtggacg cggagacctc tggtgactac 1021 aagcgggccc tgacagccct acttggatcc gcctaggccc gaggatgtgg cagctggtcc 1081 gcccaatatt ttattcgtgt taatagcttt gatcgtagtg tgccttttag gaaaatcgct 1141 tttaatgtcg tctgcgcatg cgcacactgt tggcaataaa taaacggaat tc // LOCUS NEUMPPX 2038 bp ss-mRNA PLN 03-JUL-1990 DEFINITION N.crassa matrix processing peptidase (MPP) mRNA, complete cds. ACCESSION J05484 KEYWORDS matrix processing peptidase. SOURCE N.crassa, cDNA to mRNA. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 2038) AUTHORS Schneider,H., Arretz,M., Wachter,E. and Neupert,W. TITLE Matrix processing peptidase of mitochondria: Structure-function relationships JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by W.Neupert, 17-APR-1990. FEATURES from to/span description pept 41 1774 matrix processing peptidase BASE COUNT 439 a 626 c 552 g 421 t ORIGIN 1 cccacattac gctgccgcat cacaattcct tgttgcagcc atgctgaatc gcttccggcc 61 agcgcggcta gtagcccaat cctccagatg cttgcccttg acgagggcgc gggcaggtcc 121 cttgcccgtt aacaatgcca ggactttggc tacgagagcc gctgctgtca acaccaagga 181 accgaccgaa cgcgacaaca tcaccactct ctccaatggt gtccgtgtcg cttccgagga 241 ccttcccgat gccttctccg gtgtaggtgt ctacatcgac gcggggtccc gatatgagaa 301 cgactatgtc cggggtgcca gtcacatcat ggaccggcta gccttcaagt ctacaagtgc 361 gaggactgcg gacgaaatgc tcgaaactgt tgagaagctc ggtggtaaca ttcagtgcgc 421 ttcttcgcgc gagtctatga tgtaccaggc ggccaccttc aacaaggcta ttcccaccgc 481 tgttgagctc atggccgaga ccatccgcga tcccaagctt acggacgagg agctggaggg 541 acagatcatg acggcgcaat atgaggtcaa cgagatctgg tccaaggccg aactgatcct 601 gcccgagttg gtgcacatgg ctgccttcaa ggacaacact cttggcaacc cgttgctttg 661 tcccaaggag aggttggatt acatcaaccg ggatgtcatc caaacatacc gcgacgcttt 721 ctacaggccc gagcgccttg ttgttgcctt tgctggtgtg cctcatgaga gggccgtcaa 781 gctcgcagag aagtactttg gtgatatgaa ggcctccgat gctcccggtc tctcgaggac 841 aggttccgaa acctccgtcg actcgctagt gtccgagtcc agcgaggcct cgagtgaatc 901 ttcatcatcc tcctcggact cttccgagtc gagtggcggg ctgctctcca agcttttctc 961 tcccaaggcc aagaaagcca cccccaaccc cttcctcacc cgggtaccta ttagcaccga 1021 agacttgact cggcctgctc actacacagg cggtttcctc accctcccat cacagccccc 1081 accgctcaac cccaaccttc ccacatttac tcacatacag ctcgccttcg agggcctcgc 1141 catctcggac gacgacatct acgccctcgc caccctgcag accctcctcg gcggcggcgg 1201 ctccttctct gccggcggtc ccggcaaggg catgtactcg cgtctctaca ctaacgttct 1261 caaccagcac ggctgggttg agtcctgcgt ggccttcaac cactcataca cggactcggg 1321 tctcttcggc atcgccgcct cgtgctaccc gggtcgcacc ctgcccatgc tccaggtcat 1381 gtgccgcgag ctgcacgccc tcaccaccga ccatggctac tcggccctgg gcgagctcga 1441 ggtttcgcgc gccaagaacc agctccgcag cagcctcctg atgaacctcg agagccgcat 1501 ggtcgagctc gaggatctgg gccgccaagt tcaggttcac ggtcgcaaga tcccggtccg 1561 cgagatgacg cgccgtatca acgagctgac ggtcaaggac ctccgaaggg tcgctaagcg 1621 cgtggttggt ggcatggcga ataacgccgg ccagggaagc ggtgcgccga cggtggtgct 1681 gcaggaggcg acggtgcaag gactcaagac tacggagctg gggtgggatc agatccagga 1741 tacaattgct cagtggaagc tcggtagacg gtaaacgttt gtcaagggga aaaaaagagt 1801 agggcgtgga gaagttatgt aagaggagcg ctgtattgaa cttggcgaca cgcacacacc 1861 ggaacgataa aggcgtttta ggttccccac gagcataggg aagaggctag atggttgctc 1921 tgtacaatcg caacttttct tggtgagtta tacaagatgt gtccaggtac atctttgcct 1981 taccatactg tacgatagca atgaagattt tctgatatat caaaagtcaa aagtcaaa // LOCUS HUMCYP2DG 5503 bp ds-DNA PRI 03-JUL-1990 DEFINITION Human debrisoquine 4-hydroxylase mutant allele (CYP2D6-MA1) gene, complete cds. ACCESSION M33189 KEYWORDS debrisoquine 4-hydroxylase. SOURCE Human individual MAGA DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5503) AUTHORS Gonzalez,F.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Gonzalez, 23-MAR-1990, for release after publication. Author address: F.Gonzalez National Cancer Institute Bldg. 37 Rm. 3E-24 National Institute of Health Bethesda, Md 20892 FEATURES from to/span description pept 814 993 debrisoquine 4-hydroxylase, exon 1 1696 1877 debrisoquine 4-hydroxylase, exon 2 2419 2571 debrisoquine 4-hydroxylase, exon 3 2661 2820 debrisoquine 4-hydroxylase, exon 4 3254 3430 debrisoquine 4-hydroxylase, exon 5 3621 3762 debrisoquine 4-hydroxylase, exon 6 3970 4157 debrisoquine 4-hydroxylase, exon 7 4612 4753 debrisoquine 4-hydroxylase, exon 8 4852 5030 debrisoquine 4-hydroxylase, exon 9 pre-msg 726 5103 debrisoquine 4-hydroxylase mRNA and introns IVS 994 1695 debrisoquine 4-hydroxylase intron A IVS 1878 2418 debrisoquine 4-hydroxylase intron B IVS 2572 2660 debrisoquine 4-hydroxylase intron C IVS 2821 3253 debrisoquine 4-hydroxylase intron D IVS 3431 3620 debrisoquine 4-hydroxylase intron E IVS 3763 3969 debrisoquine 4-hydroxylase intron F IVS 4158 4611 debrisoquine 4-hydroxylase intron G IVS 4754 4851 debrisoquine 4-hydroxylase intron H signal 689 702 TATA box BASE COUNT 1066 a 1537 c 1851 g 1049 t ORIGIN Chromosome 22. 1 ggctgggaag tggggtactt ggtgccgggt ctgtatgtgt gtgtgactgg tgtgtgtgag 61 agagaatgtg tgccctaagt gtcagtgtga gtctgtgtat gtgtgaatat tgtctttgtg 121 tgggtgattt tctgcgtgtg taatcgtgtc cctgcaagtg tgaacaagtg gacaagtgtc 181 tgggagtgga caagagatct gtgcaccatc aggtgtgtgc atagcgtctg tgcatgtcaa 241 gagtgcaagg tgaagtgaag ggaccaggcc catgatgcca ctcatcatca ggagctctaa 301 ggccccaggt aagtgccagt gacagataag ggtgctgaag gtcactctgg agtgggcagg 361 tgggggtagg gaaagggcaa ggccatgttc tggaggaggg gttgtgacta cattagggtg 421 tatgagccta gctgggaggt ggatggccgg gtccactgaa accctggtta tcccagaagg 481 ctttgcaggc ttcaggagct tggagtgggg agagggggtg acttctccga ccaggcccct 541 ccaccggcct accctgggta agggcctgga gcaggaagca ggggcaagaa cctctggagc 601 agcccatacc cgccctggcc tgactctgcc actggcagca cagtcaacac agcaggttca 661 ctcacagcag agggcaaagg ccatcatcag ctccctttat aagggaaggg tcacgcgctc 721 ggtgtgctga gagtgtcctg cctggtcctc tgtgcctggt ggggtggggg tgccaggtgt 781 gtccagagga gcccatttgg tagtgaggca ggtatggggc tagaagcact ggtgcccctg 841 gccgtgatag tggccatctt cctgctcctg gtggacctga tgcaccggcg ccaacgctgg 901 gctgcacgct actcaccagg ccccctgcca ctgcccgggc tgggcaacct gctgcatgtg 961 gacttccaga acacaccata ctgcttcgac caggtgaggg aggaggtcct ggagggcggc 1021 agaggtgctg aggctcccct accagaagca aacatggatg gtgggtgaaa ccacaggctg 1081 gaccagaagc caggctgaga aggggaagca ggtttggggg acttcctgga gaagggcatt 1141 tatacatggc atgaaggact ggattttcca aaggccaagg aagagtaggg caagggcctg 1201 gaggtggagc tggacttggc agtgggcatg caagcccatt gggcaacata tgttatggag 1261 tacaaagtcc cttctgctga caccagaagg aaaggccttg ggaatggaag atgagttagt 1321 cctgagtgcc gtttaaatca cgaaatcgag gatgaagggg gtgcagtgac ccggttcaaa 1381 ccttttgcac tgtgggtcct cgggcctcac tgctcaccgg catggaccat catctgggaa 1441 tgggatgcta actggggcct ctcggcaatt ttggtgactc ttgcaaggtc atacctgggt 1501 gacgcatcca aactgagttc ctccatcaca gaaggtgtga cccccacccc cgccccagga 1561 tcaggaggct gggtctcctc cttccacctg ctcactcctg gtagccccgg gggtcgtcca 1621 aggttcaaat aggactagga cctgtagtct ggggggatcc tggcttgaca agaggccctg 1681 accctccctc tgcagttgcg gcgccgcttc ggggacgtgt tcagcctgca gctggcctgg 1741 acgccggtgg tcgtgctcaa tgggctggcg gccgtgcgcg aggcgatggt gacccgcggc 1801 gaggacacgg ccgaccgccc gcctgtgccc atcacccaga tcctgggttt cgggccgcgt 1861 tcccaaggca agcagcggtg gggacagaga cagatttccg tgggacccgg gtgggtgatg 1921 accgtagtcc gagctgggca gagagggcgc ggggtcgtgg acatgaaaca ggccagcgag 1981 tggggacagc gggccaagaa accacctgca ctagggaggt gtgagcatgg ggacgagggc 2041 ggggcttgtg acgagtgggc ggggccactg ccgagacctg gcaggagccc aatgggtgag 2101 cgtggcgcat ttcccagctg gaatccggtg tcgaagtggg gggcggggac cgcacctgtg 2161 ctgtaagctc agtgtgggtg gcgcggggcc cgcggggtct tccctgagtg caaaggcggt 2221 cagggtgggc agagacgagg tgggcaaagc cctgccccag ccaagggagc aaggtggatg 2281 cacaaagagt gggccctgtg accagctgga cagagccagg gactgcggga gaccaggggg 2341 agcatagggt tggagtgggt ggtggatggt ggggctaatg ccttcatggc cacgcgcacg 2401 tgcccgtccc acccccaggg gtgttcctgg cgcgctatgg gcccgcgtgg cgcgagcaga 2461 ggcgcttctc cgtctccacc ttgcgcaact tgggcctggg caagaagtcg ctggagcagt 2521 gggtgaccga ggaggccgcc tgcctttgtg ccgccttcgc caaccactcc ggtgggtgat 2581 gggcagaagg gcacaaagcg ggaactggga aggcggggga cggggaaggc gaccccttac 2641 ccgcatctcc cacccccaag acgccccttt cgccccaacg gtctcttgga caaagccgtg 2701 agcaacgtga tcgcctccct cacctgcggg cgccgcttcg agtacgacga ccctcgcttc 2761 ctcaggctgc tggacctagc tcaggaggga ctgaaggagg agtcgggctt tctgcgcgag 2821 gtgcggagcg agagaccgag gagtctctgc agggcgagct cccgagaggt gccggggctg 2881 gactggggcc tcggaagagc aggatttgcg tagatgggtt tgggaaagga cattccagga 2941 gaccccactg taagaagggc ctggaggagg aggggacatc tcagacatgg tcgtgggaga 3001 ggtgtgcccg ggtcaggggg caccaggaga ggccaaggac tctgtacctc ctatccacgt 3061 cagagatttc gattttaggt ttctcctctg ggcaaggaga gagggtggag gctggcactt 3121 ggggagggac ttggtgaggt cagtggtaag gacaggcagg ccctgggtct acctggagat 3181 ggctggggcc tgagacttgt ccaggtgaac gcagagcaca ggagggattg agaccccgtt 3241 ctgtctggtg taggtgctga atgctgtccc cgtcctcctg catatcccag cgctggctgg 3301 caaggtccta cgcttccaaa aggctttcct gacccagctg gatgagctgc taactgagca 3361 caggatgacc tgggacccag cccagccccc ccgagacctg actgaggcct tcctggcaga 3421 gatggagaag gtgagagtgg ctgccacggt ggggggcaag ggtggtgggt tgagcgtccc 3481 aggaggaatg aggggaggct gggcaaaagg ttggaccagt gcatcacccg gcgagccgca 3541 tctgggctga caggtgcaga attggaggtc atttgggggc taccccgttc tgtcccgagt 3601 atgctctcgg ccctgctcag gccaagggga accctgagag cagcttcaat gatgagaacc 3661 tgcgcatagt ggtggctgac ctgttctctg ccgggatggt gaccacctcg accacgctgg 3721 cctggggcct cctgctcatg atcctacatc cggatgtgca gcgtgagccc atctgggaaa 3781 cagtgcaggg gccgagggag gaagggtaca ggcgggggcc catgaacttt gctgggacac 3841 ccggggctcc aagcacaggc ttgaccagga tcctgtaagc ctgacctcct ccaacatagg 3901 aggcaagaag gagtgtcagg gccggacccc ctgggtgctg acccattgtg gggacgcatg 3961 tctgtccagg ccgtgtccaa caggagatcg acgacgtgat agggcaggtg cggcgaccag 4021 agatgggtga ccaggctcac atgccctaca ccactgccgt gattcatgag gtgcagcgct 4081 ttggggacat cgtccccctg ggtgtgaccc atatgacatc ccgtgacatc gaagtacagg 4141 gcttccgcat ccctaaggta ggcctggcgc cctcctcacc ccagctcagc accagcccct 4201 ggtgatagcc ccagcatggc tactgccagg tgggcccact ctaggaaccc tggccaccta 4261 gtcctcaatg ccaccacact gactgtcccc acttgggtgg ggggtccaga gtataggcag 4321 ggctggcctg tccatccaga gcccccgtct agtggggaga caaaccagga cctgccagaa 4381 tgttggagga cccagcgcct gcagggagag ggggcagtgt gggtgcctct gagaggtgtg 4441 actgcgccct gctgtggggt cggagagggt actgtggagc ttctcgggcg caggactagt 4501 tgacagagtc cagctgtgtg ccaggcagtg tgtgtccccc gtgtgtttgg tggcaggggt 4561 cccagcatcc tagagtccag tccccactct caccctgcat ctcctgccca gggaacgaca 4621 ctcatcacca acctgtcatc ggtgctgaag gatgaggccg tctgggagaa gcccttccgc 4681 ttccaccccg aacacttcct ggatgcccag ggccactttg tgaagccgga ggccttcctg 4741 cctttctcag caggtgcctg tggggagccc ggctccctgt ccccttccgt ggagtcttgc 4801 aggggtatca cccaggagcc aggctcactg acgcccctcc cctccccaca ggccgccgtg 4861 catgcctcgg ggagcccctg gcccgcatgg agctcttcct cttcttcacc tccctgctgc 4921 agcacttcag cttctcggtg cccactggac agccccggcc cagccaccat ggtgtctttg 4981 ctttcctggt gaccccatcc ccctatgagc tttgtgctgt gccccgctag aatggggtac 5041 ctagtcccca gcctgctccc tagccagagg ctctaatgta caataaagca atgtggtagt 5101 tccaactcgg gtcccctgct cacgccctcg ttgggatcat cctcctcagg gcaaccccac 5161 ccctgcctca ttcctgctta ccccaccgcc tggccgcatt tgagacaggg gtatgttgag 5221 gctgagcaga tgtcagttac ccttgcccat aatcccatgt cccccactga cccaactctg 5281 actgcccaga ttggtgacaa ggactacatt gtcctggcat gtggggaagg ggccagaatg 5341 ggctgactag aggtgtcagt cagccctgga tgtggtggag agggcaggac tcagcctgga 5401 ggcccatatt tcaggcctaa ctcagcccac cccacatcag ggacagcagt cctgccagca 5461 ccatcacaac agtcacctcc cttcatatat gacaccccaa aac // LOCUS CHKCOLCARB 1394 bp ss-mRNA VRT 03-JUL-1990 DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan mRNA, 5' end, clone 7 and 13. ACCESSION M28659 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SOURCE Chicken 17 day old embryo cartilage, cDNA to mRNA, clones 7 and 13. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1394) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. FEATURES from to/span description pept 147 > 1394 alpha(IX) collagen-proteoglycan BASE COUNT 372 a 341 c 358 g 323 t ORIGIN 1 tccctccccg ctgactgcgt ggggcaggag gagcattctg cacccattca tactctcgtt 61 aacaggactt atgacaggga accagagagt gtgaatatat acaccaaata ttcacatgtg 121 agacgtgaag aaaaccagca gagaagatga aaagcaactg gaaaattaca gctttcttgt 181 atatgtgtag ttttctgggg tctttcatct cagctaccta ccagcaacaa tcaagattgc 241 cagtcattct gggtgctcgt caaagaactg atctctgccc aacaatcagg attggcgaag 301 atgacttgcc aggctttgac ctgatttctc agttccagat agaaaaagct gcttctcaag 361 gaattgtcca gagagtagtg ggttctactg ctctacaagt ggcttataaa ttgggaccca 421 atgtagactt caggattcca accagtgcaa tatattccaa tggattgcct gatgaatact 481 cctttcttac tacttttcgg atgactggag ccacacttca gaaatactgg actatttggc 541 agattcagga ttcttcagga aaagaacaag ttggagtgaa tctcaatggt ccaatgaaaa 601 gcgttgagtt ttcttataaa ggagtggatg gaagtctcca gactgcatca tttttacatt 661 tgcctttctt gtttgattcc caatggcaca agcttatgat aagtgtggaa acaaccagcg 721 ttacactttt tattgactgt ataaaggtag aaaccctaaa cataaaacca aaggggaaaa 781 tcagtgttga tggcttctca gtgcttggaa gactcaaaaa taatcctcaa atttcagttc 841 cgtttgaagt ccagtggatg ccgattcact gcgatcccct gcggccccag agagaaggtt 901 gtggtgagct cccagcccgg ataagccaga cagtgattga gagaggtctt cctggtccac 961 caggcccccc aggtccacca gggccaccag gagttcctgg cattgatggc atcgatggag 1021 agagaggacc taacggcccc cccggtccac cgggtccgga cggcgacgca ggcaaagcgg 1081 gatccccggg cctgcctgga gagccaggag ctgatgggtt aacaggccct gatggatcac 1141 caggtgccac aggaccgaaa ggacagaagg gtgagccagg acctccaggt gctcgtggac 1201 ttccgggcaa gggtcttctt ggaccacccg gtccagctgg tgctgcagga cttcccggtg 1261 aagtaggccg tgctggccca cctggtgatc caggaaaaag gggaccacca ggaccaccag 1321 gaccaccagg ccctcgagga acaattggtc tgcaagacgg tgacccattg tgtcccaatg 1381 cttgtccacc tggc // LOCUS CHKCOLCARC 776 bp ss-mRNA VRT 03-JUL-1990 DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan mRNA, 5' end, clone YM43. ACCESSION M28660 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SOURCE Chicken 17 day old embryo cartilage, cDNA to mRNA, clone YM43. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 776) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. FEATURES from to/span description pept 255 > 776 alpha(IX) collagen-proteoglycan BASE COUNT 119 a 319 c 235 g 103 t ORIGIN 1 gaattcccga cacccccacc tgcatcaccc cccccccatc tcgcagtccc tcgcccccat 61 caaagcccct ttgtgccacc tccgtcgcca cccggcccca gaatagcagc acgctcacct 121 gcaggggggg tcggagccag cgcctgccct cgtcccccgc tgctccatat taatcagccc 181 cttcctcctc ctcctcctcc tcctcctcct gccggtccct ccgcagtccg acacttacag 241 ccccgctccc ggccatggcc caccgcagcc ccgcgctctg cctgctgctc ctgcacgctg 301 cctgcctctg cctggcccag ctccgggggc caccaggaga gcccggccca cgagggcccc 361 caggtccgcc aggagtgccg ggagcggatg gcattgatgg tgacaaaggc tctcccggag 421 cccccggctc cccaggtgcc aaaggggagc ccggagcccc gggtccggat gggcctccag 481 ggaagccagg cttagacggt cttacgggag ccaaagggag ccggggccca tggggggggc 541 aaggactgaa gggtcagcct ggactgccgg ggccgccggg gctccccggt ccctcgctgc 601 caggaccacc cgggctgcca ggccaggtcg gactgcccgg ggagatcgga gtgccaggac 661 ccaagggcga tcctggaccc gatggcccac ggggcccccc gggtccccca gggaaacccg 721 gccccccagg acacatccaa ggagtggagg gaagcgcaga tttcttgtgc ccgacc // LOCUS CHKCOLCOR 602 bp ss-mRNA VRT 03-JUL-1990 DEFINITION Chicken cornea alpha-1(IX) collagen-proteoglycan mRNA, 5' end. ACCESSION M28658 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SOURCE Chicken 8 day old embryo cornea, cDNA to mRNA, clone IN212. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 602) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. FEATURES from to/span description pept 89 > 602 alpha(IX) collagen-proteoglycan BASE COUNT 121 a 175 c 186 g 120 t ORIGIN 1 tgcagctgaa aggtgaactg ggctgtaagg cacattttgg atttctgtgt attgtagcac 61 ctaggtggct gcaaaatctg tccccactat ggcctgggct gcatggggcc ctctgcttct 121 cgggcttttc ttgcagattt tttgcctctg ccttgctcaa agaggtcttc ctggtccacc 181 aggcccccca ggtccaccag ggccaccagg agttcctggc attgatggca ttgatggaga 241 gagaggacct aacggccccc ccggtccacc gggtccggac ggcgacgcag gcaaagcggg 301 atccccgggc ctgcctggag agccaggagc tgatgggtta acaggccctg atggatcacc 361 aggtgccaca ggaccgaaag gacagaaggg tgagccagga cctccaggtg ctcgtggacc 421 tccgggcaag ggtcttcttg gaccacctgg tccagctggt gctgcaggac ttcccggtga 481 agtaggccct gctggcccac ctggtgatcc aggaaaaagg ggaccaccag gaccaccagg 541 accaccaggc cctcgaggaa caattggtct gcaagatggt gacccattgt gtcccaatgc 601 tt // LOCUS CHKCOLG1 840 bp ds-DNA VRT 03-JUL-1990 DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan gene, exon 1, and cornea alpha-1(IX) collagen-proteoglycan gene, 5' flank. ACCESSION M28662 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SEGMENT 1 of 2 SOURCE Chicken DNA. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 840) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. FEATURES from to/span description pept 807 / 820 cartilage alpha(IX) collagen-proteoglycan, exon 1 pre-msg 661 > 840 cartilage alpha(IX) collagen-proteoglycan IVS 821 > 840 cartilage alpha(IX) collagen-proteoglycan intron A BASE COUNT 249 a 202 c 157 g 232 t ORIGIN 1 ccacccgtga gaattcctca agtgaaaatg caaatgaaca gaaattataa attgttcaga 61 aactgagtat atgttctcca aatttctctg aacgaggccc ctctctttgg aaagtataat 121 gtgtgtgtga ataacaactg aacaacagga gtcctcttag taatgcctat gtgcattcct 181 tgaaaaggtt caagtttaag cagtaaaagt ccttttaaat aattggtttt attcagaaga 241 atcaactagg acactaccag ataggcttct ccagagacct tctgatggat aaatcaacaa 301 gaactgaaaa tatcttcttt ataggactga tgttcttttc ttgtgaaagt ttttagcttt 361 aacaccacag tgaagccacc agtttccaca aaatcccttg gtacatgtta ttattctttt 421 atctgcctca ctgaacagtg cccctgccat ttggtgactg gcatcgctta actcatatag 481 tgttaatctt tctaccctga tgtcggcata agcagcaccc ctttcttcac tctcttggct 541 tctttatatt cagctggctc cagagatccg ccctcagacc ccaccaggat acagacgtct 601 gtccagcccc cacctccttc cctttgcaag attaaaacca acccagcagc ctgcacctcc 661 ctccccgctg agtcctgcgt ggggcaggag gagcattctg cacccattca tactctcgtt 721 aacaggactt atgacaggga accagagagt gtgaatatat acaccaaata ttcacatgtg 781 agacgtgaag aaaaccagca gagaagatga aaagcaactg gtaagagaac aagtgggatt // LOCUS CHKCOLG2 840 bp ds-DNA VRT 03-JUL-1990 DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan gene, exons 6 and 7, and cornea alpha-1(IX) collagen-proteoglycan gene, exon 1. ACCESSION M28661 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SEGMENT 2 of 2 SOURCE Chicken DNA, clones 13 and 26. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 840) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. The first amino acid for the open reading frame for exon 6 which is indicated in the features as starting at nucleotide 9 could start at nucleotide 11. FEATURES from to/span description pept 636 / 707 cornea alpha(IX) collagen-proteoglycan, exon 1 pept / 9 86 cartilage alpha(IX) collagen-proteoglycan, exon 6 (AA at 9) 750 / 770 cartilage alpha(IX) collagen-proteoglycan, exon 7 pre-msg 541 > 840 cornea alpha(IX) collagen-proteoglycan mRNA and introns pre-msg < 1 > 840 cartilage alpha(IX) collagen-proteoglycan mRNA and introns IVS < 1 8 cartilage alpha(IX) collagen-proteoglycan intron E IVS 87 749 cartilage alpha(IX) collagen-proteoglycan intron F IVS 771 > 840 cartilage alpha(IX) collagen-proteoglycan intron G IVS 708 > 840 cornea alpha(IX) collagen-proteoglycan intron A BASE COUNT 181 a 214 c 222 g 223 t ORIGIN 1 cctaccagtt tgaagtccag tggatgctga ttcactgcga tcccctgcgg ccccagagag 61 aaggttgtgg tgagctccca gcccgggtga cccgcgttcc cagcctgaca gtgctgaact 121 gggctgccac taaatctatg aagttcacag gagcttcatt tttccccgtc tatgtccaga 181 gaagtctatt tcaccatacc tgactgaaat ttggtgcctt tagcaatcca gccccctgga 241 gtagcagcct tactttaact cttccatgcc ttcctatctt ttccttctca gccagtgcta 301 gggtcagagg cttttgaaag atatccctga cagcgaagag agactgctgt ctccttgcag 361 actcctgggc aacctgaggg agggaaaccc ttgcctggga ggtgagggag ggtgccaaaa 421 caacagcgag cagggcaaag ggttaaaggt actgctgtca ttcaatcctc ttcctcccag 481 ccttcagctc tcctccaatc ccacgaccct ctcccaggca gttaataagg aactgtgagg 541 ggtgccttgc agctgaaagg tgaactgggc tgtaaggcac attttggatt tctgtgtatt 601 gtagcaccta ggtggctgca aaatctgtcc ccactatggc ctgggctgca tggggccctc 661 tgcttctcgg gcttttcttg cagatttttt gcctctgcct tgctcaagta agtttattct 721 gactttatac ctgtttttct cccttacaga taagccagac agtgattgag gtaagtgtga 781 gggaagggat ggtgctgcat cgtaagggaa agggtttgga tgaagagggg ctgaaggctg // LOCUS RATIRF1A 2078 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Rat interferon regulatory factor 1 (IRF-1) mRNA, complete cds. ACCESSION M34253 KEYWORDS interferon regulatory factor 1; transcription factor. SOURCE Rat cell line Nb2-11c T-cell, cDNA to mRNA, clones 25,4b. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2078) AUTHORS Yu-Lee,L.-Y., Hrachovy,J.A., Stevens,A.M. and Schwarz,L.A. TITLE Interferon regulatory factor 1 is an immediate-early gene under transcriptional regulation by prolactin in Nb2 T cells JOURNAL Mol. Cell. Biol. 3087, 3094 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.-Y.Yu-Lee, 11-MAY-1990. FEATURES from to/span description pept 198 1184 interferon regulatory factor 1 (IRF-1) BASE COUNT 542 a 544 c 540 g 452 t ORIGIN 1 ctcgacgaag gagtaggacg agctctcact gtctgagcca aaccgaaccg ggccgagctg 61 agccgaggtc agcggtggcc agaggaaccc agcatctcgg gcatcattcg ctccgtgcac 121 gcatcgtgta cctacaccgc aactccgtgc ctcattcccg ggtaccctct gtgactcgct 181 cctgcagcaa agccaccatg cctatcactc ggatgcgaat gagaccctgg ctagagatgc 241 agattaattc caaccaaatt ccagggctga gctggatcaa taaagaagag atgatcttcc 301 agatcccatg gaagcatgct gccttgcacg gttgggatat caacaaggat gcctgtctgt 361 tccggagctg ggccattcac acaggccgat acaaagctgg ggaaaaagag ccagatccca 421 agacttggaa ggcaaacttc cggtgtgcca tgaactccct accagacatc gaggaagtga 481 aggaccagag caggaacaag ggcagctctg ctgtacgcgt gtaccggatg ctgccacccc 541 tcaccaagaa ccagaggaaa gagagaaagt ccaagtccag ccgtgacact aagagcaaaa 601 ccaagaggaa gctgtgcgga gattctagcc ctgacacctt atctgacgga ctgagcagct 661 ctactctgcc tgatgaccac agcagttaca cagctcaggg atacctgggt caggacttgg 721 acatggacag ggacattacc ccagctctgt caccgtgcgt cgtcagcagc agtctctctg 781 agtggcatat gcagatggac atcatgccag acagcaccac tgatctgtac aacttgcagg 841 tgtcgcccat gccctccacc tctgaagctg caacagatga ggatgaggaa gggaagttac 901 ctgaggacat catgaagctc tttgaacagt ctgagtggca gccgacgcac gtggatggca 961 agggatactt gctcaatgaa ccaggagccc aactctctac tgtctatgga gacttcagct 1021 gcaaggagga accagagatc gacagccctg gaggggacat cgagataggc atacagcgtg 1081 tcttcacaga gatgaagaat atggaccccg tcatgtggat ggacaccctg ctgggcaact 1141 ctaccaggcc gccctccatt caggctattc cttgtgcacc ataatttggg tccctgaccc 1201 gttcttgccc tcctgagtga gctaggtcca gcatcatggt ggctgtgata caacataaag 1261 ctaaacttcc gtggacccct tgatgtggca aaacataatc ccattgccaa gcagggaagg 1321 gaccaaacca tcctccttgg gtcagtggac tgactcttca gagcttagga ggcagggtct 1381 aagtttttca agctggtcct gactcctagg aagatggatt ggcgttctga ggttagtgtg 1441 aggcagagga cctggacgga agttaccttc tagctctttg aaagcttcat tgcttagaga 1501 gggtctcacc actgggctgg cctgggggat agaccagcgc ccacagaaga gcattgcact 1561 ggccttaggg ctggctccac actgggagac aattgcacta agtcctattc ccaaagaact 1621 gctgcccttc ccaaccgagc cctgggatgg ttctagagcc agtgaaatgt gaaggaaaaa 1681 atggggtcct gtgagggttg tctcccttag cctcagaggg attctgcctc actccctgct 1741 ccagctgtgg ggctcaggaa aaaaaaatgg cactttctct gtggactttg ccacatttct 1801 gatcagaagt gtacactaac atttctccca agtcttggcc tttgcattta tttatatagt 1861 gccttgccct gtgcctgctg tctctcctca ggcctcagca gtcctcagca ggcccaggga 1921 gggggttgtg agcgccttgg cgtgactctg aacattggaa acgccaccta actactaagt 1981 tgtgtctgat ctcgtgtgga tctgtgtaaa tatgtatatt catcttttta taaaaaccta 2041 agttgtttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HUMINTB1A 1146 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human integrin beta-1 subunit mRNA, 3' end (cytoplasmic domain). ACCESSION M34189 KEYWORDS integrin; integrin beta-1 subunit cytoplasmic domain. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1146) AUTHORS Altruda,F., Cervella,P., Tarone,G., Botta,C., Balzac,F., Stefanuto,G. and Silengo,L. TITLE A human integrin beta-1 subunit with a unique cytoplasmic domain JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by F.Altruda, 10-MAY-1990. FEATURES from to/span description pept < 1 117 integrin beta-1 subunit (AA at 1) (cytoplasmic domain) BASE COUNT 373 a 184 c 222 g 367 t ORIGIN 1 aagcttttaa tgataattca tgacagaagg gagtttgcta aatttgaaaa ggagaaaatg 61 aatgccaaat gggacacggt aagttacaaa acatccaaaa agcaaagtgg cttataaagt 121 aaatgtaata ctcctaagac ttatgtatta gctgtcaggc tgattattaa agtcctttct 181 aagtatttta ttcccccaaa agtttcttac tcaaggaatt tgcatttagt gaaaaacaga 241 aagcatccta aatatatccc attgaaacaa aacattgatt ataagcatgt atattctggt 301 tcatgtggcc gatattttta tttctttaat gattttgatc ctaaatctgc cttttcatct 361 aatgtgaagt agaatcctaa ataatgttat ctgtgtagca agctattcaa tgggaaagct 421 gcttctttct ttaaaacaaa caaacaaaaa aaaccttcag tggaaagcca aattccaaaa 481 ggttatatac caagcttgtc caactcgcag ctcgtcggcc aggacatgca gcccagaata 541 gctttgaatg tggccccaac acaaatttgt aaactttctt agaaattgta attattatta 601 ttattttttt ttggtaactt tttttaaagc tcatcagcta tcgttagtgt attttatgtg 661 tggcccaaga cagttcttct tcttgccagt gtggcccagg gaagccaaaa gattggacac 721 ccctgctata tactatatga ttccatttag aggacattct ggaaaagcaa aactgtaggg 781 gcaaaaatca gtggttgcta ggggctggaa tgggggaaag tgttgaccac agaggggcgt 841 aagggatctt ccttgggatg acttgattgt gggtggattt atgtatttga aaactcacag 901 aactatgtac tttaaaaaga tgtatgttcc tctatgaaaa ttatatctca gtaaactttg 961 gcttataaaa atcttaaaag ccctaagtga ccgaaaggtt atgttagcat tgagtgcttt 1021 gaaatatgga gtcagagggt ggggtaacca aatgttggcc tttgtgtatt catcttttga 1081 tacaagaaag caatgccaat cttcagtatt tttaaattgt aaatgaattt tgtagttccc 1141 gaattc // LOCUS NEUAMTR 5928 bp ds-DNA PLN 03-JUL-1990 DEFINITION N.crassa mating type protein gene, complete cds. ACCESSION M33876 KEYWORDS mating type protein. SOURCE N.crassa (strain 74-ORS-A) DNA. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 5928) AUTHORS Grotelueschen,J., Metzenberg,R.L. and Glass,N.L. TITLE The Neurospora crassa A mating type region JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by N.L.Glass, 26-APR-1990. FEATURES from to/span description pept 4121 4283 mating type protein, exon 1 4343 5046 mating type protein, exon 2 IVS 4284 4342 mating type protein intron A signal 3858 3906 GC signal signal 3945 3949 CAAT box signal 5319 5326 poly-A signal BASE COUNT 1470 a 1459 c 1526 g 1473 t ORIGIN 1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg 61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga 121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct 181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct 241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag 301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa 361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga 421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa 481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg 541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg 601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg 661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc 721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg 781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc 841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg 901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc 961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat 1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa 1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc 1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg 1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt 1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa 1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc 1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca 1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat 1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac 1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca 1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta 1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga 1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt 1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa 1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac 1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc 1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc 2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg 2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga 2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg 2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca 2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca 2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc 2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca 2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat 2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag 2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc 2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca 2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg 2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag 2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg 2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata 2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg 3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca 3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca 3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt 3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt 3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca 3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag 3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac 3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta 3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca 3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac 3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg 3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca 3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt 3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt 3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc 3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa 3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc 4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt 4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat 4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc 4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt 4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat 4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag 4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat 4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg 4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg 4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg 4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag 4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat 4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt 4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct 4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc 4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat 4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg 5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca 5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc 5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg 5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg 5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg 5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg 5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca 5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa 5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct 5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg 5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag 5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg 5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc 5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc 5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac // LOCUS HUMNCADH 3451 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human N-cadherin mRNA, complete cds. ACCESSION M34064 KEYWORDS N-cadherin; cell adhesion molecule; transmembrane protein. SOURCE Human muscle, cDNA to mRNA, clones lambda-[4-10,1-5,13,14]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3451) AUTHORS Walsh,F.S., Barton,C.H., Putt,W., Moore,S.E., Kesell,D., Spurr,N. and Goodfellow,P.N. TITLE The N-cadherin gene maps to human Chromosome 18 and is not linked to the E-cadherin gene JOURNAL J. Neurochem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.H.Barton, 08-MAY-1990. FEATURES from to/span description pept < 1 2247 N-cadherin (AA at 1) BASE COUNT 1041 a 703 c 724 g 983 t ORIGIN Chromosome 18. 1 gactgggtca tccctccaat caacttgcca gaaaactcca ggggaccttt tcctcaagag 61 cttgtcagga tcaggtctga tagagataaa aacctttcac tgcggatacg tgtaactggg 121 ccaggagctg accagcctcc aactggtatc ttcattctca accccatctc gggtcagctg 181 tcggtgacaa agcccctgga tcgccagcag aatgcccggt ttcatttagg ggcacatgca 241 gtagatatta atggaaatca agtggagacc cccattgaca ttgtcatcaa tgttattgac 301 atgaatgaca acagacctga gttcttacac caggtttgga atgggacagt tcctgaggga 361 tcaaagcctg gaacatatgt gatgaccgta acagcaattg atgctgacga tcccaatgcc 421 ctcaatggga tgttgaggta cagaatcgtg tctcaggctc caagcacccc ttcacccaac 481 atgtttacaa tcaacaatga gactggtgac atcatcacag tggcagctgg acttgatcga 541 gaaaaagtgc aacagtatac gttaataatt caagctacag acatggaagg caatcccaca 601 tatggccttt caaacacagc cacggccgtc atcacagtga cagatgtcaa tgacaatcct 661 ccagagttta ctgccatgac gttttatggt gaagttcctg agaacagggt agacatcata 721 gtagctaatc taactgtgac cgataaggat caaccccata caccagcctg gaacgcagtg 781 tacagaatca gtggcggaga tcctactgga cggttcgcca tccagaccga cccaaacagc 841 aacgacgggt tagtcaccgt ggtcaaacca atcgactttg aaacaaatag gatgtttgtc 901 cttactgttg ctgcagaaaa tcaagtgcca ttagccaagg gaattcagca cccgcctcag 961 tcaactgcaa ccgtgtctgt tacagttatt gacgtaaatg aaaaccctta ttttgccccc 1021 aatcctaaga tcattcgcca agaagaaggg cttcatgccg gtaccatgtt gacaacattc 1081 actgctcagg acccagatcg atatatgcag caaaaatatt taagatacac taaattatct 1141 gatcctgcca attggctaaa aatagatcct gtgaatggac aaataactac aattgctgtt 1201 ttggaccgag aatcaccaaa tgtgaaaaac aatatatata atgctacttt ccttgcttct 1261 gacaatggaa ttcctcctat gagtggaaca ggaacgctgc agatctattt acttgatatt 1321 aatgacaatg cccctcaagt gttacctcaa gaggcagaga cttgcgaaac tccagacccc 1381 aattcaatta atattacagc acttgattat gacattgatc caaatgctgg accatttgct 1441 tttgatcttc ctttatctcc agtgactatt aagagaaatt ggaccatcac tcggcttaat 1501 ggtgattttg ctcagcttaa tttaaagata aaatttcttg aagctggtat ctatgaagtt 1561 cccatcataa tcacagattc gggtaatcct cccaaatcaa atatttccat cctgcgcgtg 1621 aaggtttgcc agtgtgactc caacggggac tgcacagatg tggacaggat tgtgggtgcg 1681 gggcttggca ccggtgccat cattgccatc ctgctctgca tcatcatcct gcttatcctt 1741 gtgctgatgt ttgtggtatg gatgaaacgc cgggataaag aacgccaggc caaacaactt 1801 ttaattgatc cagaagatga tgtaagagat aacattttaa aatatgatga agaaggtgga 1861 ggagaagaag accaggacta tgacttgagc cagctgcagc agcctgacac tgtggagcct 1921 gatgccatca agcctgtggg aatccgacga atggatgaaa gacccatcca cgccgagccc 1981 cagtatccgg tccgatctgc agccccacac cctggagaca ttggggactt cattaatgag 2041 ggccttaaag cggctgacaa tgaccccaca gctccaccat atgactccct gttagtgttt 2101 gactatgaag gcagtggctc cactgctggg tccttgagct cccttaattc ctcaagtagt 2161 ggtggtgagc aggactatga ttacctgaac gactgggggc cacggttcaa gaaacttgct 2221 gacatgtatg gtggaggtga tgactgaact tcagggtgaa cttggttttt ggacaagtac 2281 aaacaatttc aactgatatt cccaaaaagc attcagaagc taggctttaa ctttgtagtc 2341 tactagcaca gtgcctgctg gaggctttgg cataggctgc aaaccaattt gggctcagag 2401 ggaatatcag tgatccatac tgtttggaaa aacactgagc tcagttacac ttgaatttta 2461 cagtacagaa gcactgggat tttatgtgcc tttttgtacc tttttcagat tggaattagt 2521 tttctgttta aggctttaat ggtactgatt tctgaaacga taagtaaaag acaaaatatt 2581 ttgtggtggg agcagtaagt taaaccatga tatgcttcaa cacgcttttg ttacattgca 2641 tttgctttta ttaaaataca aaattaaaca aacaaaaaaa ctcatggagc gattttatta 2701 tcttggggga tgagaccatg agattggaaa atgtacatta cttctagttt tagactttag 2761 tttgtttttt ttttttttca ctaaaatctt aaaacttact cagctggttg caaataaagg 2821 gagttttcat atcaccaatt tgtagcaaaa ttgaattttt tcataaacta gaatgttaga 2881 cacattttgg tcttaatcca tgtacacctt tttatttctg tatttttcca cttcactgta 2941 aaaatagtat gtgtacataa tgttttattg gcatacgtct atggagaagt gcagaaactt 3001 cagaacatgt gtatgtatta tttggactat ggattcaggt tttttgcatg tttatatctt 3061 tcgttatgga taaagtattt acaaaacagt gacatttgat tcaattgttg agctgtagtt 3121 agaatactca atttttaatt tttttaattt ttttattttt tattttcttt ttggtttggg 3181 gagggagaaa agttcttagc acaaatgttt tacataattt gtaccaaaaa aaaaaaaaaa 3241 ggaaaggaaa gaaaggggtg gcctgacact ggtggcacta ctaagtgtgt gtttttttaa 3301 aaaaaaaatg gaaaaaaaaa agcctttaaa ctggagagac ttctgacaac agctttgcct 3361 ctgtattgtg taccagaata taaatgatac acctctgacc ccagcgttct gaataaaatg 3421 ctaattttgg ataacaaaaa aaggggaatt c // LOCUS MHVNSGII 870 bp ss-RNA VRL 03-JUL-1990 DEFINITION Murine hepatitis virus non-structural protein gene-2 (NS2). ACCESSION M34035 KEYWORDS non structural protein. SOURCE Murine hepatitis virus (strain MHV-JHM), cDNA to viral RNA. ORGANISM Murine hepatitis virus A59 Unclassified. REFERENCE 1 (bases 1 to 870) AUTHORS Schwarz,B., Routledge,E. and Siddell,S.G. TITLE The coronavirus MHV 30 kDa non-structural protein NS2 is not essential for virus replication in transformed murine cells JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.G.Siddell, 04-MAY-1990. Author address: S.G.Siddell Inst of Virology Univ of Wuerzburg Versbacherstrasse 7 8700 Wuerzburg FEATURES from to/span description pept 40 837 non-structural protein-2 (NS2) BASE COUNT 269 a 147 c 187 g 267 t ORIGIN 1 gcgatagcct agtaaatgtt aaataaatct atacttgtca tggctgcgag aatggccttt 61 gctgacaagc ctaatcattt tataaacttt cctctagccc aatttagtgg ctttatgggt 121 aagtatttaa agcttcagtc tcaacttgtg gaaatgggtt tggactgtaa attacaaaag 181 gtaccacatg ttagtattac cctgcttgac attaaagcag accaatacaa acaggtggaa 241 tttgcaatac aagaaataat agatgatctg gcggcatatg agggagatat tgtctttgac 301 aaccctcata tgcttggcag atgtcttgtt cttgatgtta aaggatttga agagttgcat 361 gaagatattg ttgaaattct ccgcagaagg ggttgcactg cagatcaatc cagacaatgg 421 attccgcact gcactgtggc ccaatttgat gaagaaaaag aaataaaaga aatgcaattc 481 tattttaaat tgcccttcta tctcaagcat aacaacctac ttacggatgc taggcttgag 541 cttgtgaaga taggttcttc caaagtaggt gggttttatt gtagtgaact aagtatttgg 601 tgtggtgaga gactttgtta caagccccca acccccaaat tcagtgatat atttggctat 661 tgctgcatag ataaaatacg tggtgattta gaaataggag acctaccgcc agatgatgag 721 gaagcgtggg ccgagctaag ttaccactat caaagaaaca cctacttctt cagacatgtg 781 cacgataata gtatctattt tcgtaccgta tgtagaatga agggttgtat gtgttgattt 841 gtttttacac tattagtgta ataaacttat // LOCUS MCAMV6 1904 bp ds-DNA VRL 03-JUL-1990 DEFINITION Cauliflower mosaic virus (CaMV) gene six protein gene, complete cds. ACCESSION M23620 KEYWORDS gene six protein. SOURCE Cauliflower mosaic virus (strain D4) DNA. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (sites) AUTHORS Daubert,S. and Routh,J. TITLE Determinants of symptomatology in the DNA sequence CaMV JOURNAL mol plant microb interact (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1904; for [1]) AUTHORS Daubert,S. and Routh,J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1], [2] kindly submitte by S.Daubert, 08-AUG-1989, for release after publication. FEATURES from to/span description pept 1 1563 gene six protein signal 1629 1635 TATA box signal 1547 1550 CCAT enhancer 1 BASE COUNT 645 a 450 c 379 g 430 t ORIGIN bps 5774 to 7678 of genome. 1 atggagaaca tagaaaaact cctcatgcaa gagaaaatac taatgctaga gctcgatcta 61 gtaaaagcaa aaataagctt agcaagagct aacggctctt cgcaacaagg agaactctct 121 ctccaccgtg aaacaccgga aaaagaagaa gcagttcatt ctgcactggc cacttttacg 181 ccaacccaag taaaagctat tccagagcaa acggctcctg gtaaagaatc aacaaatccg 241 ttgatggcta gtatcttgcc aaaagatatg aattcagttc agactgaaat taggctcaaa 301 aggccatcgg acttcttacg tccttatcag ggaatttcaa tcccacaaaa atctgagctt 361 aacagcacag ttactcttca cggagtagaa tcgggtattc aacaccctca tatcaactac 421 tacgttgtgt ataacggtcc acacgccggt atatacgatg actggggttg tacaaaggcg 481 gcaacaaacg gcgttcccgg agttgcacaa aagaagtttg ccactattac agaggcaaga 541 gcagcagctg acgcatacac aacaagtcag caaacagaca ggttgaactt catccccaaa 601 ggagaagctc aactcaagcc caagagcttt gcgaaggcct taaccagccc atcaaagcaa 661 aaagcccact ggctcacgct aggaaccaaa aggcccagca gtgatccagc cccaaaagag 721 atctcctttg ccccggagat caccatggac gactttctct atctctacga tctaggaaga 781 aagttcgacg gagaaggtga cgataccatg ttcaccactg ataatgagaa gattagcctc 841 ttcaatttca gaaagaatgc tgacccacag atggttagag aggcctacgc agcaggtctc 901 atcaagacga tctacccgag caataatctc caggagatca aataccttcc caagaaggtt 961 aaagatgcag tcaaaagatt caggactaac tgcatcaaga acacagagaa agatatattt 1021 ctcaagatca gaagtactat tccagtatgg acgattcaag gcttgcttca taaaccaagg 1081 caagtaatag aaattggagt ctctaagaaa gtagttccta ctgaatcaaa ggccatggag 1141 tcaaaaattc agatcgagga tctaacagaa ctcgccgtga agactggcga acagttcata 1201 cagagtcttt tacgactcaa tgacaagaag aaaatcttcg tcaacatggt ggagcacgac 1261 actctcgtct actccaagaa tatcaaagat acagtctcag aagaccaaag ggctattgag 1321 acttttcaac aaagggtaat atcgggaaac ctcctcggat tccattgccc agctatctgt 1381 cacttcatcg aaaggacagt agaaaaggaa ggtggcacct acaaatgcca tcattgcgat 1441 aaaggaaagg ctatcattca agatgcctct accgacagtg gtcccaaaga tggaccccca 1501 cccacgagga gcatcgtgga aaaagaagac gttccaacca cgtcttcaaa gcaagtggat 1561 tgatgtgaca tctccactga cgtaagggat gacgcacaat cccactaccc ttcgcaagac 1621 ccttcctcta tataaggaag ttcatttcat ttggagagga cacgctgaaa tcaccagtct 1681 ctctctacaa gactatctct ctctattttc tccagaataa tgtgtgagta gtttcccgat 1741 aagggaatta gggttcttat agggtttcgc tcatgtgttg agcatataag aaacccttag 1801 tatgtatttg tatttgtaaa atacttctat caataaaatt tctaattcct aaaaccaaaa 1861 tccagtacta aaatccagat ctcctaaagt ccctatagat cttt // LOCUS CREAPCYN 577 bp ss-mRNA PLN 03-JUL-1990 DEFINITION C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds. ACCESSION J05524 KEYWORDS apoplastocyanin. SOURCE C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone PC6-2. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 1 to 577) AUTHORS Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and Bogorad,L. TITLE Isolation and characterization of a complementary DNA clone for an algal pre-apoplastocyanin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by K.Hill, 11-MAY-1990. FEATURES from to/span description pept 22 459 apoplastocyanin (PC6-2) precursor sigp 22 162 apoplastocyanin signal peptide matp 163 456 apoplastocyanin BASE COUNT 91 a 189 c 171 g 126 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc 61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt 121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg 181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag 241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc 301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc 361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac 421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat 481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac 541 gcagagagcg tgtagctctg taactttttc ggaattc // LOCUS RATSVPIIA 4161 bp ds-DNA ROD 03-JUL-1990 DEFINITION Rat seminal vesicle secretion II protein (SVS II) gene, complete cds. ACCESSION J05443 KEYWORDS seminal vesicle secretion II protein. SOURCE Rat (strain CHARLES RIVER) male seminal vesicle epithelial cell DNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4161) AUTHORS Harris,S.E., Harris,M.A., Johnson,C.M., Bean,M.F., Dodd,J.G., Matusik,R.J., Carr,S.A. and Crabb,J.W. TITLE Structural characterization of the rat seminal vesicle secretion II protein and gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.E.Harris, 25-APR-1990. FEATURES from to/span description pept 2065 2137 seminal vesicle secretion II protein (SVS II) precursor, exon 1 2377 3548 seminal vesicle secretion II protein precursor, exon 2 sigp 2065 2130 seminal vesicle secretion II protein signal peptide matp 2131 3545 seminal vesicle secretion II protein signal 1934 1946 CAAT box signal 2013 2020 TATA box BASE COUNT 1243 a 902 c 822 g 1194 t ORIGIN 1 tttcgatcca atgtgtggat tactcaccaa gtgtctgtct ttctttcttt ctctctttct 61 ctctttcttc ctcccttcct tccttccttc cttccttcct tccttccttc cttccttcct 121 ttctgttcaa ttgctcgttt ctcccttcat ctctcgccag tataccgcac actcaaactt 181 aaattttcat ttcaatgcgt tctcttctgg cacgtgcagc ataattacac tcatgattgt 241 caactccgtg atctgtttgc acaccttacc ccccccccca aggttttatc tgcatttaaa 301 aaaaagggat tatcaagaaa tttattctta attcagaaat gtgatcaaag ttgtcagatc 361 cgttctttac tgcctcctgt tggaaaaaaa aaatatccag ttcctggatt tttctaaaac 421 acagaaaaga gacctgggac aggggtatag gattgagcag gcatggtgag caattttata 481 ctgaatagat tcattgtgac ttaccggtct cctgagggaa ataatcactt ttcccaggta 541 gagagcagcc tagcaagaga tcagagtgca agcataaaac ccatgtgctt tataagtgta 601 tttattttat gcattttctg tttataagga catgagtgga ctttttattt gtcccttcca 661 tacaggacta cctagactat tgggatggga tgactgaaaa tatgttttca agtagacttc 721 cttccggaac taccttcata tggttctgaa ggcaaagtgg aacactgcac gggtgtcctc 781 ttctcccaag aacttggcca tggcgtcgtc gttttgagtc tatgtctgag ccacgaatgc 841 cataacagcc cttcctgtta ctctcacagt ggcacagagc tgtttctaaa caagaaggaa 901 gtcttccatc ttgtgtcagg atgctaatga cgtcaccaat ggcagtaagt gttcaccaca 961 gcccgttgct aaggcaatta tgttatccct cctgtcagag tttcctgtat taaaatatac 1021 tgagtttaat tttatgtcgg attccatgac atacattcag caaggaaacc aacagtatct 1081 tttgttcttt caacagtgat ttcctgtcac catttaactg ttgtctcgcc cccattcttt 1141 aaaatgtctc tgcacctcac cttgcctccc agatacactc ccaaactcat ttccctggac 1201 acacttgaaa tgttgctgct agcaagccac agctaccacg tcttctctgt cagggttcta 1261 gacaactcat ctaaagcagc accaggtctc tcttagaaat cagacatcgg atgtcatggt 1321 catagtatac ctcacagcta ctttggacat tcatgggccc agtattattt tccagggctg 1381 aggtttaact caagagcctc atgctcacat ggctggtggt ctggccacac agctatgact 1441 cgtctccatt tattcttcaa acttttattc ggagctccgt tgtgttctgt tgtctcctgt 1501 gcctttctat atgtgtgact gctcctttgc ctgtaaatga gaagctatgt caaattcaac 1561 gtaaaaaagg caacttcatg ggcttctgtg agatagcatg ctaaacagtg tcagctccac 1621 tacactgtga ccaggaaaat ttgatcaggc cctggttact ctcggagcat aaaagaagaa 1681 aaaaaaatct cttccccgct ctactctgga ttttgtttga aaataaaagg tccaatctgt 1741 ccttataaaa catgcataga ataaatatac tagaaaacac actttgtttg caaagggtat 1801 gtgataaagt cagagggttg ataaagattt gctgaggctt atgacataga aaaggtccct 1861 gacattgcat ccctgtgcaa agtacctggg aacattacca atgtccccaa ctgtgcagag 1921 gggaggaagt tgacatttag agataatttt tttaaaaaag caggcagtgc ttttgtagtg 1981 tcagttatat ctgtaataca tccagctaga gatatataaa tgtgaaagtc agctcagctc 2041 tcagtgaagg tccttcttga caagatgaag tcctctgtct tcattctatc tctgttcctc 2101 cttctggaaa gacaggcagc tgtggttgga cagtatggtg agtagggaga tggtgactag 2161 agggaaagtc actcagggag aatgttttta agggtgctct gggagtagca gatcctttca 2221 taggggaatt tttttttaaa tgagacctaa ttcttctcta ctgaaaacca aaacccttgt 2281 gggaacatca atggttttat gaggaaattt tggaaatgag acttggaagg actgtgcaga 2341 tcatgtaact taaaccttcc tcctctcaat taccaggtgg gacaaaaggt cacttccaga 2401 gcagctcatc agggtttatg cttggtcaga aaggccacct caattttggg ctcaaaggag 2461 gaagtgagga agcagctgaa gaaagcattt tcatgcaatc acaacaccag atgttcggcc 2521 aggatggtgg tgacatggcg cagacaagtg tttcacaaga gcatacaggt gtaaaggggg 2581 ccgcgatttg tcgtaaagga caagtatccc aattgaaatc ccaagaatcc caaataaaat 2641 cctttagaca agtaaaatcc agtggacagc tgaaatctgg aggatcccaa ttaaaatcct 2701 ttggacaagt gaaatccagt gagtcccaat taaaatcctt tggccaagtg aaagccagtg 2761 ggtcccaatt aaaatccttc ggacaagtga aagccagtgg gtcccaatta aaatcctatg 2821 gacaaatgaa atccagtggg tcccaagtga aatcctttgg acaaatgaaa tccagtgggt 2881 cccaagtaaa atcctttgga caaatgaaag ccagtgagtc ccaaataaaa tcctttggac 2941 aaagaaaatc ccaaggtggt caactacaat cctatggcca aatgaaatcc tatgggcaga 3001 cgaaatccct agaatcccag gccaaatcct tcggacaagt aaagtcccaa agtggccaaa 3061 tgaaatcctc ctatggtcag agaaaatcct atggtgaaga gactcaactg aagtctttcg 3121 accaagatgc ccaactaaaa tcctatggtc aacaaaaatc ccaaaaacaa tcctccttta 3181 gccaagtaaa atctcaaagt gcccaactaa agtcctttgg ccaacaaaaa tccctcaaag 3241 ggttttctca acaaactcaa cagaaaggat ttgccatgga tgaagatttg tcacaagtgc 3301 ggaaacaatt tgacgatgat gacctctctg tacaacagaa gtctacccaa cagatgaaaa 3361 cagaggaaga cttatcccaa tttggacaac aacgacaatt tggacaagaa cgctcccaat 3421 cctataaagg atatcttgca caatacagaa agaaattaca ggaacaacaa caacagaaaa 3481 attttaatca ggataacttt tttacaaagg gaggggcagg cctatatcag gctcaactta 3541 agggataaca tattcactga gcaactgaag accaagatca atgtcaaggt atgttccacc 3601 aagtaggaag atattatcca aatttacttg tggtatatag gaatcctgga tccattatgg 3661 attgataccc atttgttact atcagtagaa gtattgttac acacttttag aaggatgaag 3721 aacagaccct ggtaaaatga gtccttgtag agtaaaggca gagtaagcaa gctaagttaa 3781 caattggtcc tgaattacta cattcaggga gcacttttca gtgcttctct gagcacagac 3841 agtttatatt attaaatgtg taccacctat gcaatcatat ttaacatttc atgatggaat 3901 cttatttgtt cttacacttt gacttgataa aaaaaaaaga ttggtttctt gcttatattg 3961 gtataaggtg gtattgcagc tgagctcttt ctctacacca gtgcgtgttc ttgagtcccc 4021 tgggacctct gctttccatc acaatccatg gggttaagga ttagctgctt ttccatcaga 4081 tggaagattt ggttacaaag atctctgcct ggagcagaca ctatattcag ttgtatgtcc 4141 aatggtgacc ctgttgaatt c // LOCUS CODCPRRKA 94 bp ss-RNA RNA 03-JUL-1990 DEFINITION Codium fragile chloroplast 4.5S RNA, complete cds. ACCESSION M35276 M15192 KEYWORDS 4.5S RNA. SOURCE C.fragile chloroplast RNA. ORGANISM Chloroplast Codium fragile Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Brypsidales; Codiaceae; Codium fragile. REFERENCE 1 (bases 1 to 94) AUTHORS Francis,M.A., Balint,R.F. and Dudock,B.S. TITLE A novel variety of 4.5 S RNA from Codium fragile chloroplasts JOURNAL J. Biol. Chem. 262, 1848-1854 (1987) STANDARD simple staff_review FEATURES from to/span description RNA 1 94 4.5 S RNA BASE COUNT 35 a 11 c 14 g 34 t ORIGIN 1 aagtcctagt tgctataaat tcttaaatca aattatgtca gatttttaat aaaaagcagc 61 atttgtattt gaaaattgtt taggaactag gcac // LOCUS HAMAPBRBD 2339 bp ds-DNA ROD 03-JUL-1990 DEFINITION Hamster apolipoprotein (apoB) gene, partial cds (LDL receptor-binding domain). ACCESSION M35187 KEYWORDS apolipoprotein B. SOURCE Hamster DNA. ORGANISM Mesocricetus auratus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2339) AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N. TITLE Sequence of the putative low-density lipoprotein receptor-binding regions of apolipoprotein B in mouse and hamster JOURNAL Gene 87, 309-310 (1990) STANDARD simple staff_review COMMENT Phone call to T.J.Smith on 26-JUN-1990 made sequence clarifications on line 4 and line 17 of printed sequence. The hamster sequence should be 1 bp to the left on these lines. FEATURES from to/span description pept < 1 > 2339 apolipoprotein (apoB) (LDL receptor-binding domain) (AA at 1) BASE COUNT 725 a 519 c 450 g 645 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccagc ttcctcgcct ctcacacaca attgagatac ctgcttttgg cagacttcat 61 ggaatcctga aaatccagtc tcccctcttt atattagatg caaatgccaa catacagaat 121 gtaactactt tagagaacaa agcagagatt gtggcctcca tcgctgctac aggagagtcc 181 gaaattgaag ctctcaattt tgattttcaa gcacaagctc aattcttgga gctaaaccct 241 aatcctctga tcctgaagga atccatgaac ttctccagca agcatgcgag aatggagcat 301 gagggtgaga tactattttc tggaaagttc attgagggaa aattggacac ggtcgcaagt 361 ttacagacag agaaaaatat ggtggagttt aataatggta tgattgtcaa gataaacaat 421 ccaatcatcc ttgacagtca cacaaagtat tttcacaagt tgagtatccc caggctggac 481 ttctccagta aggcttcctt taacaatgaa atcaagatgc tattagaagc tggacatgta 541 gcatggactt cttcagggac tgggtcatgg aattgggcct gtcccaactt ctcagatgag 601 ggcacacatt cgtccaaaat tagcttcact gtagaaggac ccattgcttt ttttggcttg 661 tctaataaca tcaacggcaa acacctgagg gttatccaga aattggctta tgaatctggc 721 ttcctcaact attccatgtt ggaagttgag tcaaaagttg aatctcagca tgtgggttcc 781 agcattctaa ctggcaaggg aacggtactg ctcagggagg caaaggcaga aatgactggc 841 gagcacaatg ctgacttgaa tggaaaagtt attgggactt tgaaaaactc tctttccttt 901 tcagcacaac catttatgat tactgcatcc acaaataatg atgggaattt gaaagttagt 961 tttccactaa agttgactgg gaaaatagac ttcctgaata actatgcact atttttgagt 1021 cctcatgccc agcaagcaag ctggcaagtg agtgctaggt tcaatcagta caaatataat 1081 caaaattttt ctgctataaa caatgaacat aacatagaag cccatgtagg aatgaatgga 1141 gatgccaacc tggatttctt aaccatacct ctaacaattc ctgaagtgaa actaccttac 1201 atagggctca cgactccctt gctgaaggat ttctccatat gggaagaaac aggcttgaaa 1261 gaatttttga agacaacaaa gcaatcgttt gatttaagtg taaaagctca atataaaaag 1321 aacagagaca ggcattccat tgcgattcct ctgaatgggt tttatgagtt tattctcaac 1381 aatgtcgact ccgggatagg gaagattggg aaagtcagag acagcgcatt agactatctt 1441 atttcatcct ataatgaagc aaaaaacaag tttgaaaatt cccttattca gccctccagg 1501 acctttcaaa agcgtggata cactatccca tttgtcaaca ttgaagtgac tccattcact 1561 gtagagacac tggcctccag ccatgtgatc ccaaaagcaa taaatacccc cagtgttcac 1621 attctgggcc ctaatgtcat tgtgccttca tacaggttag tgctgccctc cctggagctg 1681 ccagtccttc gtgtccccag gaatctactc aagttttccc tcccagattt caaggaattg 1741 agaacaattg acaatattta tattccagct cttggcaatt ttacctatga tttttccttt 1801 aaatcaagtg tcatcacgct gaataccaac gttggacttt ataaccggtc agacatcgtt 1861 gctcatttcc tttcttcctc ttcatttgtc acggatgccc tgcagtacaa attagagggt 1921 acttcacgtc tgactcggaa aagaggattg aagctagcca cagccgactc tctcactaac 1981 aaatttgtaa agggcaatca tgatagcacc tttagcttaa ccaagaaaaa catggaagca 2041 tcagtgaaaa caactgcaaa cctccatgct cccattttaa caatgaactt caagcaggaa 2101 cttaatggaa atgccaagtc aaagcccatt gtctcatcat ccattgaact aaactatgac 2161 ttcaattcct caaagctgta ctctactgct aaaggaggtg ttgaccacaa gtttagctta 2221 gaaagtctca cttcctactt ttccattgag tcatccacca aaggaaatat caagggatct 2281 gtcctttccc aggaatattc aggaagtgtt gccagtgagg ccaacacata cctgaattc // LOCUS MUSAPBRBD 2354 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouse apolipoprotein (apoB) gene, partial cds (LDL receptor-binding domain). ACCESSION M35186 KEYWORDS apolipoprotein B. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2354) AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N. TITLE Sequence of the putative low-density lipoprotein receptor-binding regions of apolipoprotein B in mouse and hamster JOURNAL Gene 87, 309-310 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 2354 apolipoprotein (apoB) (LDL receptor-binding domain) (AA at 1) BASE COUNT 757 a 525 c 423 g 649 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccaac ttcctcacct ctcacataca attgaaatac ctgcttttgg caaactgcat 61 agcatcctta agatccaatc tcctctcttt atattagatg ctaatgccaa catacagaat 121 gtaacaactt cagggaacaa agcagagatt gtggcttctg tcactgctaa aggagagtcc 181 caatttgaag ctctcaattt tgattttcaa gcacaagctc aattcctgga gttaaatcct 241 catcctccag tcctgaagga atccatgaac ttctccagta agcatgtgag aatggagcat 301 gagggtgaga tagtatttga tggaaaggcc attgagggga aatcagacac agtcgcaagt 361 ttacacacag agaaaaatga agtagagttt aataatggta tgactgtcaa agtaaacaat 421 cagctcaccc ttgacagtca cacaaagtac ttccacaagt tgagtgttcc taggctggac 481 ttctccagta aggcttctct taataatgaa atcaagacac tattagaagc tggacatgtg 541 gcattgacat cttcagggac agggtcatgg aactgggcct gtcccaactt ctcggatgaa 601 ggcatacatt cgtcccaaat tagctttact gtggatggtc ccattgcttt tgttggacta 661 tccaataaca taaatggcaa acacttacgg gtcatccaaa aactgactta tgaatctggc 721 ttcctcaact attctaagtt tgaagttgag tcaaaagttg aatctcagca cgtgggctcc 781 agcattctaa cagccaatgg tcgggcactg ctcaaggacg caaaggcaga aatgactggt 841 gagcacaatg ccaacttaaa tggaaaagtt attggaactt tgaaaaattc tctcttcttt 901 tcagcacaac catttgagat tactgcatcc acaaataatg aaggaaattt gaaagtgggt 961 tttccactaa agctgactgg gaaaatagac ttcctgaata actatgcatt gtttctgagt 1021 ccccgtgccc aacaagcaag ctggcaagcg agtaccagat tcaatcagta caaatacaat 1081 caaaactttt ctgctataaa caatgaacac aacatagaag ccagtatagg aatgaatgga 1141 gatgccaacc tggatttctt aaacatacct ttaacaattc ctgaaattaa cttgccttac 1201 acggagttca aaactccctt actgaaggat ttctccatat gggaagaaac aggcttgaaa 1261 gaatttttga agacaacaaa gcaatcattt gatttgagtg taaaggctca atataaaaag 1321 aacagtgaca agcattccat tgttgtccct ctgggtatgt tttatgaatt tattctcaac 1381 aatgtcaatt cgtgggacag aaaatttgag aaagtcagaa acaatgcttt acattttctt 1441 accacctcct ataatgaagc aaaaattaag gttgataagt acaaaactga aaattccctt 1501 aatcagccct ctgggacctt tcaaaatcat ggctacacta tcccagttgt caacattgaa 1561 gtatctccat ttgctgtaga gacactggct tccaggcatg tgatccccac agcaataagc 1621 accccaagtg tcacaatccc tggtcctaac atcatggtgc cttcatacaa gttagtgctg 1681 ccacccctgg agttgccagt tttccatggt cctgggaatc tattcaagtt tttcctccca 1741 gatttcaagg gattcaacac tattgacaat atttatattc cagccatggg caactttacc 1801 tatgactttt cttttaaatc aagtgtcatc acactgaata ccaatgctgg actttataac 1861 caatcagata tcgttgccca tttcctttct tcctcttcat ttgtcactga cgccctgcag 1921 tacaaattag agggaacatc acgtctgatg cgaaaaaggg gattgaaact agccacagct 1981 gtctctctaa ctaacaaatt tgtaaagggc agtcatgaca gcaccattag tttaaccaag 2041 aaaaacatgg aagcatcagt gagaacaact gccaacctcc atgctcccat attctcaatg 2101 aacttcaagc aggaacttaa tggaaatacc aagtcaaaac ccactgtttc atcatccatt 2161 gaactaaact atgacttcaa ttcctcaaag ctgcactcta ctgcaacagg aggcattgat 2221 cacaagttca gcttagaaag tctcacttcc tacttttcca ttgagtcatt caccaaagga 2281 aatatcaaga gttccttcct ttctcaggaa tattcaggaa gtgttgccaa tgaagccaat 2341 gtatatctga attc // LOCUS RATBPTT 1035 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Rat beta-tachykinin mRNA, complete cds. ACCESSION M35277 M15191 KEYWORDS neurokinin A; substance P; tachykinin. SOURCE Rat (Sprague-Dawley) rostral portion of the caudate putamen, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1035) AUTHORS Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D. TITLE Three rat preprotachykinin mRNAs encode the neuropeptides substance P and neurokinin A JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987) STANDARD simple staff_review FEATURES from to/span description pept 100 492 beta-preprotachykinin sigp 100 156 beta-tachykinin signal peptide (3' end could be 171) matp 172 204 substance P matp 294 321 neurokinin mRNA < 1 1035 beta-preprotachykinin mRNA BASE COUNT 289 a 212 c 245 g 289 t ORIGIN 1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc 61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg 121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat 181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gcccgagccc 241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta 301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt 361 tatgggcatg gtcagatctc tcacaaaagg cataaaacag attcctttgt tggactaatg 421 ggcaaaagag ctttaaattc tgtggcttat gaaagaagcg caatgcagaa ctacgaaaga 481 aggcgtaaat aaaccctgta acgcactatc tattcatctc catctgtgtc cgcgagcagt 541 gagcggtaaa ataaaaatgt gcgctatgag gaatgattat ttatttaata tcaaatgttg 601 ttatgagtga aaaactcaaa aaagtgttta ttttttcata ttgtgccaat aagcattgta 661 attctaatgt ggtgacctcc tcagacagaa gtagaaatta gttgtaactt cagcaaagca 721 cagtgttgat ggagttgtac aagtttgcca gcgatgcaag tctccaaaga cagaaaggct 781 gctgtgaggc agtgcaggcg gctgctgctg gaggcagaga aactcctgtg tgtcttgcgc 841 ttcccttggt tgcttttatc ctaatgatgt actgagagtt tggtatctga ctctatttgt 901 atcctagcag catgtttcct gtgttgtgac tatatagaga tgtttttaaa agtttcaatg 961 tacttctctg gtcttcagtc attgtatgat gtgttgtgat agctaccatt ttaaataaaa 1021 gaatgtatct tcagg // LOCUS CHPRGIT 1051 bp ds-DNA PRI 03-JUL-1990 DEFINITION Chimpanzee rRNA gene internal transcribed spacer 1 (ITS1). ACCESSION M30947 KEYWORDS internal transcribed spacer. SOURCE Chimpanzee DNA. ORGANISM Pan troglodytes Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 1 to 1051) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.L.Gonzalez, 20-DEC-1989. BASE COUNT 69 a 411 c 425 g 146 t ORIGIN Acrocentric chromosomes 14, 15, 17, 22, 23. 1 acggagccga aggggggcgc gaggccgcgg cggcgccgcc gcgcgcttcc ctccccccca 61 ccccgccgca acgcggcgcg tgcgcgggcg gggcccgtgt gccgttcgtt cgttcgttcg 121 ctgcccggcc ccgccgccgc gagagccgag aactcgggag ggcgacgttg gggggagagc 181 gagagagaca gaaagaaggg ggcgcgtgtt cgctgcgcgt gtcgtggggc cggcggggag 241 cggtccccgg cctcgggccc gacggacgtg tgtgtcggcg ggcgcggggg cggttctcgg 301 cggcgtcacg gcgggtttgg gggggggggt ctcggtgccc tcctccccgc cggggcccgt 361 cgtccggccc cgccgcgcgc cggctccccg tcgtcggggc cgggccggat tcccgtcgcc 421 gcctccgccg cgcgccgctc cgcgccaccg ggcacggccc cgctcgctct ccccggcctt 481 cccgctaggg cgtctcgagg gtcgggggcc ggacgccggt ccccccctcc tcgtccgccc 541 ccgccgtcca ggtacctagc gcgttccggc gcggaggttt aaagacccct tgggggatcg 601 cccgtccgcc cgcgggtcgg gggcggtggt gggcccgcgg gggagtcccg tcgggagggg 661 cccggcccct cccgcgcctc ccccgcggac tccgcccccg gccggggccg cgccgcctcg 721 ccggctcggg tcgcggcggc cgtcgggtgg gggctttacc cggcggccgt cgcgtgcgcg 781 cgtgccgcgc gtgtggcgtg cgccccgcgc cgtgggggcg ggaacccccc gggcgcctgt 841 ggggtggtgt ccgcgctcgc ccctgcgtgg gcggcgcgcg cctccccgtg gtgtgaaacc 901 ttccgacccc tctccggagt ccggtcccgt ttttgctgtc tctctggccg gcctgaggca 961 accccctctc ctctgggggg gggggacgtg ccgcgccagg agggcctccc ggtgtgtttg 1021 tcgggagcgc cctcgccaaa tcgacctcgt a // LOCUS CHPRGITX 2512 bp ds-DNA PRI 03-JUL-1990 DEFINITION Chimpanzee 28S ribosomal RNA gene fragment. ACCESSION M30950 KEYWORDS 28S ribosomal RNA. SOURCE Chimpanzee DNA. ORGANISM Pan troglodytes Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 193 to 261; 375 to 945; 1042 to 1079; 1334 to 1357; 1742 to 1958; 2190 to 2204) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2512) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by I.L.Gonzalez, 20-DEC-1989. FEATURES from to/span description rRNA < 1 > 2512 28S ribosomal RNA BASE COUNT 397 a 837 c 924 g 354 t ORIGIN Chromosomes 14, 15, 17, 22, 23. 1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg 61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg 121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga 181 cccctccacc cgccctccct tccccccgcc gcccctcctc ctcctccccg gagggggcgg 241 gctccggcgg gtgcgggggt gggcgggcgg ggccgggggt ggggtcggcg ggggaccgtc 301 ccccgaccgg cgaccggccg ccgccgggcg catttccacc gcggcggtgc gccgcgaccg 361 gctccgggac ggctgggaag gcccggcggg gaaggtggct cggggggccc cgtccgtcct 421 cctcctcccc ccccgtctcc gccccccggc cccgcgtcct cccccgggag ggcgcgcggg 481 tcggggcggt ggcggcggcg gcggcggtgg cggcggtggc ggcgggaccg aaaccccccc 541 cgagtgttac agccccccgg cagcagcact cgccgaatcc cggggccgag ggagcgagac 601 ccgtcgccgc gctctccccc ctcccggcgc ccacccccgc ggggatatcc tccgcgaggg 661 gggtctcccc cgcgggggcg cgccggcgtc tcctcgtggg ggggccgggc cacccctccc 721 acggcgcgac cgctctccca cccctcctcc ccgcaacccc cctctcccgg cgacggggag 781 ggccgcgcgc gggtcggggg gcggggcgga ctgtccccag tgcgccccgg gcgggtcgcg 841 ccgtcgggcc cgggggaggt tctctcgggg ccacgcgcgc gtcccccgaa gagggggacg 901 gcggagccga gcgcacgggg tcggcggcga tgtcggccac ccacccgacc cgtcttgaaa 961 cacggaccaa ggagtctaac acgtgcgcga gtcgggggct cgcacgaaag ccgccgtggc 1021 gcaatgaagg tgaaggccgg cgcgctcgcc ggccgaggtg ggatccgagg cctctccagt 1081 ccgccgaggg cgcaccaccg gcccgtctcg cccgccgcgc cggggaggtg gagcacgagc 1141 gcacgtgtta ggacccgaaa gatggtgaac tatgcctggg cagggcaagc cagaggaaac 1201 tctggtggag gtccgtagcg gtcctgacgt gcaaatcggt cgtccgacct gggtataggg 1261 gcgaaagact aatcgaacca tctagtagct ggttccctcc gaagtttccc tcaggatagc 1321 tggcgctctc gcagacccga cgcacacccc cccacgcagt tttatccggt aaagcgaatg 1381 attagaggtc ttggggccga aacgatctca acctattctc aaactttaaa tgggtaagaa 1441 gcccggctcg ctggcgtgga gccggggtgg aatgcgagtg cctagtgggc cacttttggt 1501 aagcagaact ggcgctgcgg gatgaaccga acgccgggtt aaggcgcccg atgccgacgc 1561 tcatcagacc ccagaaaagg tgttggttga tatagacagc aggacggtgg ccatggaagt 1621 cggaatccgc taaggagtgt gtaacaactc acctgccgaa tcaactagcc ctgaaaatgg 1681 atggcgctgg agcgtcgggc ccatacccgg ccgtcgccgg cagtcgagag tggacgggag 1741 cggcgggggc ggcgcgggcg tgtgcgcgcg cgcgtgtgtg cgtgtgtgtc ggagggcggc 1801 ggcggtggcg gcgggggtgg ggtcctcccc ctcccccacg ccgcctcccc tcctcccacc 1861 caccaccgcc gccgccaccc ccgctccccg cccccggagc cccgcggacg ctacgccgcg 1921 acgagtagga gggccgctgc ggtgagcctt gaagcctagg gcgcgggccc gggtggagcc 1981 gccgcaggtg cagatcttgg tggtagtagc aaatattcaa acgagaactt tgaaggccga 2041 agtggagaag ggttccatgt gaacagcagt tgaacatggg tcagtcggtc ctgagagatg 2101 ggcgagcgcc gttccgaagg gacgggcgat ggcctccgtt gccctcggcc gatcgaaagg 2161 gagtcgggtt cagatccccg aatccggagt ggcggagatg ggcgccgcga ggcgtccagt 2221 gcggtaacgc gaccgatccc ggagaagccg gcgggagccc cggggagagt tctcttttct 2281 ttgtgaaggg cagggcgccc tggaatgggt tcgccccgag agaggggccc gtgccttgga 2341 aagcgtcgcg gttccggcgg cgtccggtga gctctcgctg gcccttgaaa atccggggga 2401 gagggtgtaa atctcgcgcc gggccgtacc catatccgca gcaggtctcc aaggtgaaca 2461 gcctctggca tgttggaaca atgtaggtaa gggaagtcgg caagccggat cc // LOCUS GORRGIT 987 bp ds-DNA PRI 03-JUL-1990 DEFINITION Gorilla rRNA gene internal transcribed spacer 1 (ITS1). ACCESSION M30948 KEYWORDS internal transcribed spacer. SOURCE Gorilla DNA. ORGANISM Gorilla gorilla Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 1 to 987) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.L.Gonzalez, 20-DEC-1989. BASE COUNT 65 a 398 c 390 g 134 t ORIGIN Chromosomes 22 and 23. 1 acggagcgaa gggcgaggcc gcggcggtgg cgccgccgcg tgcttccctc ccccccaccg 61 acgcggcgcg tgcgcgggcg gggcccgtgc cgttcgttcg ttcgttcgtt cgctgcccgg 121 ccccgccgcc gcgagagccg aggactcggg agggagacgg ggggggagaa gagaaaggag 181 gcctgtccgt gtgtgcgtgt cgtggggccg gccgcgctgg tgagcggcgg cgaggcctcc 241 ccggccgcgg cccgacgacg tgtgtgtcgg cgggtgcggg ggcggttctc ggcggcgtca 301 cggcgggttt ggggcctcgg tgccctcctc cccgccgggg cccgtcgtcc ggccccgccg 361 ccggcccccc cgtcgtcggg gccggccggg ttcccgtcgc cgccgccgcc gccgccgtcg 421 tcgcctccgc cgcgccaccg ggaccggccc cgctcgctct ccccggcctt cccgctaggg 481 cgtctcgagg gtcgggggcc ggacgccggt ccccccctcc tcgtccgccc ctccccgccg 541 ttccaggtac ctagcgcgtt ccggcgcgga ggtttaaaga ccccttgggg gatcgcccgt 601 ccgccccgtg ggtcgggggc ggtgggcccg cgggggggtc ccgtcgggag gggcccggcc 661 cctcccgcgc ctccaccgcg gactccgccc cccggccggg gccgcggcgg ccgtcgggtg 721 ggggctttac ccggcggccg tgcgcccccg cgccgtgggg gcgggaaccc ccgggcgcct 781 gtggggcgtg tcagcgctcg cccccgcgtg ggcgccgcgc ctccccgtgg tgtgaaacct 841 tccgacccct ctccggagtc cggtcccgtt tgctgtccgt ctggccggcc tgaggcaacc 901 ccccctcctc cgtggggggg gggggacgtg ccgcgccagg agggccctcc cggtgtcggg 961 agcgccctcg ccaaatcgac ctcgtta // LOCUS GORRGITX 2467 bp ds-DNA PRI 03-JUL-1990 DEFINITION Gorilla 28S ribosomal RNA gene fragment. ACCESSION M30951 KEYWORDS 28S ribosomal RNA. SOURCE Gorilla DNA. ORGANISM Gorilla gorilla Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 193 to 261; 375 to 944; 1041 to 1079; 1334 to 1354; 1737 to 1913; 2145 to 2159) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2467) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by I.L.Gonzalez, 20-DEC-1989. FEATURES from to/span description rRNA < 1 > 2467 28S ribosomal RNA BASE COUNT 386 a 819 c 912 g 350 t ORIGIN Chromosomes 22 and 23. 1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg 61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg 121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga 181 cccctccacc cgccctccct tcccccgccg cccctcctcc tcctccccgg agggggcggg 241 ctccggcggg tgcggggggt gggcgggcgg ggccgggggt ggggtcggcg ggggaccgtc 301 ccccgaccgg cgaccggccg ccgccgggcg catttccacc gcggcggtgc gccgcgaccg 361 gctccgggac ggctgggaag gcccggcggg gaaggtggct cggggggccc cgtccgtccg 421 tccgtccgtc ctcctccccc gtctccgccc cccggccccg cgtcctccct cgggaagggg 481 cgcgcgggtc ggggcggcgg cggcggcggt ggcggcggcg gcggcggcgg cgggaccgaa 541 acccccccga gtgttacagc cccccggcca gccagccatc gccgaatccc ggggccgagg 601 gagcgagacc cgtcgccgcg ctctcccccc tcccggcgcc cacccccgcg ggggtccccc 661 gcgagggggt cccccgcggg ggcgcgccgg cgtctcctcg tgggggggcc gggccacccc 721 tcccacggcg cgaccgctct cccacccctc gcttccccgc acactccccc ggcgacgggg 781 tgccgcgcgc gggtcggggg gcggggcgga ctgtccccag tgcgccccgg gcgggtcgcg 841 ccgtcgggcc cgggggaggt tctcccgggg ccacgcgcgc gtcccccgaa gagggggacg 901 gcggagcgag cgcacggggt cggcggcgat gtcggctacc cacccgaccc gtcttgaaac 961 acggaccaag gagtctaaca cgtgcgcgag tcgggggctc gcacgaaagc cgccgtggcg 1021 caatgaaggt gaaggccggc gcgctcgccg gccgaggtgg gatcccgagg cctctccggt 1081 ccgccgaggg cgcaccaccg gcccgtctcg cccgccgcgc cggggaggtg gagcacgagc 1141 gcacgtgtta ggacccgaaa gatggtgaac tatgcctggg cagggcaagc cagaggaaac 1201 tctggtggag gtccgtagcg gtcctgacgt gcaaatcggt cgtccgacct gggtataggg 1261 gcgaaagact aatcgaacca tctagtagct ggttccctcc gaagtttccc tcaggatagc 1321 tggcgctctc gcagacccct cctccccccc acgcagtttt atccggtaaa gcgaatgatt 1381 agaggtcttg gggccgaaac gatctcaacc tattctcaaa ctttaaatgg gtaagaagcc 1441 cggctcgctg gcgtggagcc gggtggaatg cgagtgcctg tgggccactt ttggtaagca 1501 gaactggcgc tgcgggatga accgaacgcc gggttaaggc gcccgatgcc gacgctcatc 1561 agaccccaga aaaggtgttg gttgatatag acagcaggac ggtggccatg gaagtcggaa 1621 tccgctaagg agtgtgtaac aactcacctg ccgaatcaac tagccctgaa aatggatggc 1681 gctggagcgt cgggcccata cccggccgtc gccggcagtc gagagtggac gggagcggcg 1741 ggggcggcgc gcgcgcgcgc gtgtggggtc ggagggcggc gtgtgggcgg tggggtcctc 1801 gcccccctcc cccgcgcctc ccctcctccc acccccgctc cccgcccccg ggagccccgc 1861 ggacgctacg ccgcgacgag taggagggcc gctgcggtga gccttgaagc ctagggcgcg 1921 ggcccgggtg gagccgccgc aggtgcagat cttggtggta gtagcaaata ttcaaacgag 1981 aactttgaag gccgaagtgg agaagggttc catgtgaaca gcagttgaac atgggtcagt 2041 cggtcctgag agatgggcga gcgccgttcc gaagggacgg gcgatggcct ccgttgccct 2101 cggccgatcg aaagggagtc gggttcagat ccccgaatcc ggagtggcgg agatgggcgc 2161 cgcgaggcgt ccagtgcggt aacgcgaccg atcccggaga agccggcggg agccccgggg 2221 agagttctct tttctttgtg aagggcaggg cgccctggaa tgggttcgcc ccgagagagg 2281 ggcccgtgcc ttggaaagcg tcgcggttcc ggcggcgtcc ggtgagctct cgctggccct 2341 tgaaaatccg ggggagaggg tgtaaatctc gcgccgggcc gtacccatat ccgcagcagg 2401 tctccaaggt gaacagcctc tggcatgttg gaacaatgta ggtaagggaa gtcggcaagc 2461 cggatcc // LOCUS ORARGIT 1070 bp ds-DNA PRI 03-JUL-1990 DEFINITION Orangutan rRNA gene internal transcribed spacer 1 (ITS1). ACCESSION M30949 KEYWORDS internal transcribed spacer. SOURCE Orangutan DNA. ORGANISM Pongo pygmaeus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 1 to 1070) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.L.Gonzalez, 20-DEC-1989. BASE COUNT 60 a 438 c 438 g 134 t ORIGIN Chromosomes 11, 12, 13, 14, 15, 16, 17, 22, 23. 1 acggagcgaa gagcgaggcc cgcggcggcg ccgccgcggc gtccttcctc gtcggccggc 61 cggccgcgtt tctcccccgc ttcccgcggc gcgtgcgcgg gcggggcccg tgccgttcgc 121 gcgcacgcgc gggcgtgcgt gcgtgcgtcg cccggccccg ccggccgcga gagccggaga 181 acctcgggag ggagagagag gggggagaga gagagcggtg tgtgtgtgcg cgcgcgcgtg 241 tctcgggggc ggccggcgcg gcggggagcg gtccccggcc gcggccccga cgtgtgtgtc 301 ggcgggcgcg ggtgcggtcc tcggcggcgt cgcggcgggg tggggggtgt ctcggtgccc 361 ctccccgccg gggcccgtcg tcccgtcccc gacccgccgg ctccgcgtcg ggggccggcc 421 gggttcccgc cgcccccgtc gcctccgcca cgccgcgcca ccgggccggg ccggcccggc 481 ccgccccgct cgctctcccc ggccttcccg ctagggcgtc tcgagggtcg ggggccggac 541 gccggtcccc gcgcctcctc gtccgccccc ccctcccccc gccgtccagg tacctagcgc 601 gttccggcgc ggaggtttaa agaccccttg ggggatcgcc cgtccgcccg tgggtcgggg 661 gcggtgggcc cgcgtgggga gtcccgtcgg gaggggcccg gcccctcccg cgcctccacc 721 gcggactccg cccccccggc cggggcgctg ccgccgccgc cgcggtcgcg gcggccgtcg 781 ggtgggggct ttacccggcg gccgtcgtgc cgtccgtcgc gcgcgtgccc cgcgccgtgg 841 gggcgggaac cccccgggcg cctgtggggt ggtgtccgcg ctcgcccccg cgtgggcggc 901 gcgcgcctcc ccgtggtgtg cgacaccttc cgacccctct ccggagtccg gtcccgtttg 961 ccgtctgact ggccggcctg aggcgacccc cccctgcggg ggggaagtgc cgcgccaggg 1021 gcgagggcct cccggtgtgt cgggggcgcc ctcgcccgat cgagctcgta // LOCUS ORARGITX 2487 bp ds-DNA PRI 03-JUL-1990 DEFINITION Orangutan 28S ribosomal RNA gene fragment. ACCESSION M30952 KEYWORDS 28S ribosomal RNA. SOURCE Orangutan DNA. ORGANISM Pongo pygmaeus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 193 to 263; 377 to 985; 1084 to 1120; 1376 to 1394; 1780 to 1933; 2165 to 2179) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2487) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by I.L.Gonzalez, 20-DEC-1989. FEATURES from to/span description rRNA < 1 > 2487 28S ribosomal RNA BASE COUNT 393 a 814 c 921 g 359 t ORIGIN Chromosomes 11, 12, 13, 14, 15, 16, 17, 22, 23. 1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg 61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg 121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga 181 cccctccacc cgccctccct cccccgccgc ccctcctcct cctccccgcg gggagggggc 241 gggctccggc gggtgcgggg gtgggcgggc ggggccgggg gtggggtcgg cgggggaccg 301 tcccccgacc ggcgaccggc cgccgccggg cgcatttcca ccgcggcggt gcgccgcgac 361 cggctccggg acggctggga aggcccggtg gggaaggtgg ctcggggggc cccgtccgtc 421 cgtccgtccg tcctcctccc tcctcccccc tcgtcttccc cccggccccg cgtcctccct 481 cgggagggcg cgcgggtcgg gggcggcggc gggggtggct gctgctgctg ctgcggcggc 541 ggcgggaccg aaccccccga gtgttacagc cccggcagca gcgctcgccg aacccggggc 601 cgagggagcg agacccgtcg ccgcgctctc ccccctcccg gcgcccaccc ccgcgggggt 661 cccccgcgag ggggtccccc ccgcgggggc gcgccggcgt ctcctcgcgt ggggggccgg 721 gccgcccctc ccacggcgcg accgctctcc cacccccccc ttccccgcgc acccccggcg 781 acgggggccc gcgcgggcgg ggggggcggg gcggactgtc cccagtgcgc cccgggcggg 841 tcgcgccgtc gggcccgggg aagagagagg gagaggaggg ggttctcctc ctcctcctcc 901 cctctcgggg ccacgcgcgc gtccctcgaa gagggggacg gcggagccga gcgcacgggg 961 tcggcggcga tgtcggccac ccacccgacc cgtcttgaaa cacggaccaa ggagtctaac 1021 acgtgcgcga gtcgggggct cgcacgaaag ccgccgtggc gcaatgaagg tgaaggccgg 1081 cgcgctcgcc ggccgaggtg ggatcccgag gcctctccag tccgccgagg gcgcaccacc 1141 ggcccgtctc gcccgccgcg ccggggaggt ggagcacgag cgcacgtgtt aggacccgaa 1201 agatggtgaa ctatgcctgg gcagggcgaa gccagaggaa actctggtgg aggtccgtag 1261 cggtcctgac gtgcaaatcg gtcgtccgac ctgggtatag gggcgaaaga ctaatcgaac 1321 catctagtag ctggttccct ccgaagtttc cctcaggata gctggcgctc tcgcagactc 1381 gaccgaccga ccgcagtttt atccggtaaa gcgaatgatt agaggtcttg gggccgaaac 1441 gatctcaacc tattctcaaa ctttaaatgg gtaagaagcc cggctcgctg gcgtggagcc 1501 gggcgtggaa tgcgagtgcc tagtgggcca cttttggtaa gcagaactgg cgctgcggga 1561 tgaaccgaac gccgggttaa ggcgcccgat gccgacgctc atcagacccc agaaaaggtg 1621 ttggttgata tagacagcag gacggtggcc atggaagtcg gaatccgcta aggagtgtgt 1681 aacaactcac ctgccgaatc aactagccct gaaaatggat ggcgctggag cgtcgggccc 1741 atacccggcc gtcgccggca gtcgagagtg gacgggagcg gcgggggcgg ggtgcgtgcg 1801 ggtgtggggg tgtgtgtggg ggggggtcct ccccccccgc cactcctcct cctcccaccc 1861 ctcccccgga gcagccccgc ggacgctacg ccgcgacgag taggagggcc gctgcggtga 1921 gccttgaagc ccagggcgcg ggcccgggtg gagccgccgc aggtgcagat cttggtggta 1981 gtagcaaata ttcaaacgag aactttgaag gccgaagtgg agaagggttc catgtgaaca 2041 gcagttgaac atgggtcagt cggtcctgag agatgggcga gcgccgttcc gaagggacgg 2101 gcgatggcct ccgttgccct cggccgatcg aaagggagtc gggttcagat ccccgaatcc 2161 ggagtggcgg agacgggcgc cgcgaggcgt ccagtgcggt aacgcgaccg atcccggaga 2221 agccggcggg agccccgggg agagttctct tttctttgtg aagggcaggg cgccctggaa 2281 tgggttcgcc ccgagagagg ggcccgtgcc ttggaaagcg tcgcggttcc ggcggcgtcc 2341 ggtgagctct cgctggccct tgaaaatccg ggggagaggg tgtaaatctc gcgccgggcc 2401 gtacccatat ccgcagcagg tctccaaggt gaacagcctc tggcatgttg gaacaatgta 2461 ggtaagggaa gtcggcaagc cggatcc // LOCUS ACCRRSAA 1536 bp ss-rRNA RNA 03-JUL-1990 DEFINITION A.calcoaceticus 16S ribosomal RNA. ACCESSION M34139 KEYWORDS 16S ribosomal RNA. SOURCE A.calcoaceticus (strain 33604) ribosomal RNA. ORGANISM Acinetobacter calcoaceticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1536) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1536 16S ribosomal RNA BASE COUNT 387 a 329 c 460 g 330 t 30 others ORIGIN 1 ttaactgaag agtttgatca tggctcagat tgaacgctgg cggcaggctt aacacatgca 61 agtcgagcgg ggaaggttgc ttcggtaact gactagcggc ggacgggtga gtaatgctta 121 ggaatctgcc atttagtggg ggacaacatt ccgaanggaa tgctaatacc gcatacgtcc 181 tacaggagaa agcaggggat ctccggacct tgcgctaaat gatgagccta agtcggatta 241 gctagttggt ggggtaaagg cctaccaagg cgacgatctg tagcgggtct gagaggatga 301 tccgccacac tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata 361 ttggacaatg ggcgcaagcc ngatccagcc atgccgcgtg tgtgaagaag gccttttggt 421 tgtaaagcac tttaagcgag gaggaggctc tcttagttaa tacctaagat gagtggacgt 481 tactcgcaga ataagcaccg gctaactctg tgccagcagc cgcggtaata cagagngtgc 541 gagcgttaat cggatttact gggcgtaaag cgtgcgtagg cggcttttta agtcggatgt 601 gaaatccccg agcttaactt gggaattgca ttcgatactg ggaagctaga gtatgggaga 661 ggatggtaga attccaggtg tagcggtgaa atgcgtagag atctggagga ataccgatgg 721 cgaaggcagc catctggcct aatactgacg ctgaggtacg naagcatggg gagcaaacag 781 gattagatac cctggtagtc catgccgtaa acgatgtcta ctagccgttg gggcctttga 841 ggctttagtg gcgcagctaa cgcgataagt agactgcctg gggagtacgg tcgcaagact 901 aaaactcaaa tgaattgacg ggggcncgca caagcggtgg agcatgtggt ttaattcgat 961 gcaacgcgaa gaaccttacc tggccttgac atactagaaa ctttccagag atggattggt 1021 gccttcggga atctagatac aggtgctgca tggctgtcgt cagctcgtgt cgtgagatgt 1081 tgggttaagt cccgcaacga gcgcaaccct tttccttact tgccagcatt tcggatggga 1141 actttaagga tactgccagt gacaaactgg aggaaggcgg ggacgacgtc aagtcatcat 1201 ggcccttacg gctagggcta cacacgtgct acaatggtcg gtacaaaggg ttgctaccta 1261 gcgataggat gctaatctca aaaagccgat cgtagttcgg attggagtct gcaactcgac 1321 tccatgaagt cggaatcgct agtaatcgcg gatcagaatg ccgcggtgaa tacgttcccg 1381 ggccttgtac acaccgcccg tcacaccatg ggagtttgtt gcaccagaag tagctagcct 1441 aactgcaaag agggcggtta ccacggtgtg gccgatgact agggnnnnnn ngtaacaagn 1501 nnnnnnnnnn ngaacctgnn nnnngatcac ctcctt // LOCUS BDERRSAA 1553 bp ss-rRNA RNA 03-JUL-1990 DEFINITION B.stolpii 16S ribosomal RNA. ACCESSION M34125 KEYWORDS 16S ribosomal RNA. SOURCE B.stolpii (strain uki-2) ribosomal RNA. ORGANISM Bdellovibrio stolpii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1553) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept Microbiology, 131 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1553 16S ribosomal RNA BASE COUNT 381 a 320 c 429 g 316 t 107 others ORIGIN 1 agcatnnaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa 61 gtcgaacgtg aaagtccttc gggatgagta aagtggcgca cgggtgagta acacgtaggt 121 gacctgcctt ttagagggga ataaccagaa gaaattttgg ctaatgccgc atacgaagca 181 cggttttaag actgtgcttg aaagaatgcc tctgcatatg ngcattcgct attagatggg 241 cctgcgggac attagctagt tggtggggta aaggcctacc aaggcgacga tgtctatccg 301 gtctgagagg atgatcggac acactggaac tgagacacgg tccagactcc tacgggaggc 361 agcagtgggg aatattgcgc aatgggggaa accctgacgc agcaacgccg cgtgagtgag 421 gaaggacttc ggtctgtaaa gctctgttaa tgtggaaaaa tggcagttgg tctaataggc 481 cnattgtttg atggtacaca tagaggaagc accggctaac ttcgtgccag cagccgcggt 541 aatacgaagg gtgcnagcgt tgttcggatt tattgggcgt aaagcgcgcg taggcggacc 601 tgcaagtcag atgtgaaatc tcggggctca acctcgaaac tgcgtctgaa actacaggtc 661 tagaatctcg gagggggaag gggaatatcg catgtagggg taaaatccgt agatatgcga 721 tggaacacca gaggcgaagg cgccttcctg gacgagtatt gacgctgagg cncnnaagcg 781 tggggatcaa acaggattag ataccctggt agtccacgct gtaaacgatg aacactagat 841 attggaggat ttgacccctt cagtgtcgta gctaacgcgt caagtgttcc gcctgggaag 901 tacggtcgca agactaaaac tcaaaggaat tgnnnnnnnn nngcacaagn nnnngattat 961 gnngtttaat tcgnngcaac gcgcagaacc ttacctaggc ttgaaatcct acgaatccct 1021 tttaaacgag ggagtgctct tcggagaatg tagtgacagg cgctgcatgg ctgtcgtcag 1081 ctcgtgtcgt gagatgttgg gttaagtctc gcaacgagcg caacccccat ttttagttgc 1141 cagcattaag ttgggcactc tagaaagact gcntgggcta accaggagga aggtggggat 1201 gacgtcaagt cctcatggcc cttatgtcta gggctacaca cgtaatacaa tggtcggtac 1261 aaagggatgc gaactcgcga gggggagcca atctcaaaaa accgatctca gtccggattg 1321 gagtctgcaa ctcgactcca tgaagttgga atcgcgagta atcgcggatc agcacgccgc 1381 ggtgaatacg ttcccgggcc ttgtacacac cgcccgtcac accatgggag ttgtttttac 1441 ctgaagnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1501 nnnnnnngta acaagnnnnn nnnnnnngaa cctgnnnnnn gatcacctcc ttt // LOCUS PLTRRSAA 1525 bp ss-rRNA RNA 03-JUL-1990 DEFINITION P.staleyi 16S ribosomal RNA. ACCESSION M34126 KEYWORDS 16S ribosomal RNA. SOURCE P.staleyi (strain ATCC 27377) ribosomal RNA. ORGANISM Planctomyces staleyi Prokaryota; Bacteria; Eubacteriomycetes; Eubacteriales. REFERENCE 1 (bases 1 to 1525) AUTHORS Woese,C.R. and Oyalzu,H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 15-MAY-1990. Author address: C.R.Woese University of Illinois Dept Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1525 16S ribosomal RNA BASE COUNT 376 a 348 c 482 g 315 t 4 others ORIGIN 1 caattgaaga gtttgatcct ggctcagaat gaacgttggc ggcatggatt aggcatgcaa 61 gtcgtgcgcg atatgtagca atacatggag agcggcgaaa gggagagtaa tacgtaggaa 121 cctaccttcg ggtctgggat agcggcggga aactgccggt aataccagat gatgtttccg 181 aaccaaaggt gtgattccgc ctgaagaggg gcctacgtcg tattagctag ttggtagggt 241 aatggcctac caaggcaaag atgcgtatgg ggtgtgagag catgccccca ctcactggga 301 ctgagacact gcccagacac ctacgggtgg ctgcagtcga gaatcttcgg caatgggcga 361 aagcctgacc gagcgatgcc gcgtgcggga tgaaggcctt cgggttgtaa accgctgtcg 421 taggggatga agtgctaggg ggttctccct ctagtttgag ctgaacctag gaggaagggc 481 cggctaatct cgtgccanna gccgcggtaa tacgagaggc ccaaacgtta ttcggattta 541 ctgggcttaa agagttcgta ggcggtcttg taagtggggt gtgaaatccc tcggctcaac 601 cgaggaactg cgctccaaac tacaagactt gagggggata gaggtaagcg gaactgatgg 661 tggagcggtg aaatgcgttg atatcatcag gaacaccgga ggcgaaggcg gcttactggg 721 tcctttctga cgctgaggaa cgaaagctag gggagcaaac gggattagat accccggtag 781 tcctagccgt aaacgatgag cactggaccg gagctctgca cagggtttcg gtcgtagcga 841 aagtgttaag tgctccgcct ggggagtatg gtcgcaaggc tgaaactcaa aggaattgac 901 gggggctcac acaagcggtg gaggatgtgg cttaattcga ggctacgcga agaaccttat 961 cctagtcttg acatgcttag gaatcttcct gaaagggagg agtgctcgca agagagcctt 1021 tgcacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgtcgggt taagtccctt 1081 aacgagcgaa acccttgtcc ttagttacca gcgcgtcatg gcggggactc taaggagact 1141 gccggtgtta aaccggagga aggtggggat gacgtcaagt cctcatggcc tttatgatta 1201 gggctgcaca cgtcctacaa tggtgcacac aaagcgacgc aaactcgtga gagccagcta 1261 atcgcaaaaa atgtacctca gttcggattg caggctgcaa ctcgcctgca tgaagctgga 1321 atcgctagta atcgcgggtc agcataccgc ggtgaatntg ttcctgagcc ttgtacacac 1381 cgcccntcaa gccacgaaag tgggggggac ccaacagcgc tgccgtaacc gcaaggaaca 1441 aggcgcctaa ggtcaactcc gtgattggga ctaagtcgta acaaggtagc cgtaggggaa 1501 cctgcggctg gatcacctcc tttct // LOCUS RDCRRSAA 1478 bp ss-rRNA BCT 03-JUL-1990 DEFINITION R.purpureus 16S ribosomal RNA. ACCESSION M34132 KEYWORDS 16S ribosomal RNA. SOURCE R.purpureus (strain 6770) ribosomal RNA. ORGANISM Rhodocyclus purpureus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1478) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1478 16S ribosomal RNA BASE COUNT 367 a 345 c 470 g 289 t 7 others ORIGIN 1 tgaactgaag agtttgatcc tggctcagat tgaacgctgg cggcatgcct tacacatgca 61 agtcgaacgg taacgggncc ttcgggcgcc gaacgagtgg cgaacgggtg agtaatgcat 121 cggaacatgc cctgaagtgg gggataacgt agcgaaagtt acgctaatac cgcatattct 181 gtgagcagga aagcagggga ccttcgggcc ttgcgctttg ggagtggccg atgtcggatt 241 agctagttgg tggggtaaaa gcctaccaag gcaacgatcc gtagcgggtc tgagaggatg 301 atccgccaca ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat 361 tttggacaat gggcgaaagc ctgatccagc catgccgcgt gagtgaagaa ggccttcggg 421 ttgtaaagct ctttcggcgg ggaagaaatc gggtttccta atacggaacc cggatgacgg 481 tacccgaaga agaagcaccg gctaactacg tgccagcagc cgcggtaata cgtagggtgc 541 nagcgttaat cggaattact gggcgtaaag cgtgcgcagg cggttgtgta agacagacgt 601 gaaatccccg ggctcaacct gggaactgcg tttgtgactg cacagctaga gtacggcaga 661 ggggggtgga attccacgtg tagcagtgaa atgcgtagag atgtggagga acaccgatgg 721 cgaaggcagc cccctgggcc aatactgacg ctcatgcacg naagcgtggg gagcaaacag 781 gattagatac cctggtagtc cacgccctaa acgatgtcaa ctaggtgttg gtggggttaa 841 acccattagt gccgtagcta acgcgtgaag ttgaccgcct ggggagtacg gcggcaaggt 901 taaaactcaa aggaattgac gggganccgc acaagcggtg gatgatgtgg attaattcga 961 tgcaacgcga aaaaccttac ctacccttga catgtcagga atcctgagga gactcgggag 1021 tgcccgaaag ggnacctgaa cacaggtgct gcatggcngt cgtcagctcg tgtcgtgaga 1081 tgttgggtta agtcccgcaa cgagcgcaac ccttgtcatt aattgccatc attcagttgg 1141 gcactttaat gaaactgccg gtgacaaacc ggaggaaggt ggggatgacg tcaagtcctc 1201 atggccctta tgggtagggc ttcacacgtc atacaatggt cggtccatag ggttgcnaac 1261 ccgcgagggg gagctaatcc cagaaagccg atcgtagtcc ggattgcagt ctgcaactcg 1321 actgcatgaa gtcggaatcg ctagtaatcg cggatcagca tgtcgcggtg aatacgttcc 1381 cgggtcttgt acacaccgcc cgtcacacca tgggagcggg ttctgccaga agtagttagc 1441 ctaaccgcaa ggagggcgat taccacggca gcgttcgt // LOCUS HUMFGF2H 3365 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34185 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3365) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 256 2457 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 786 a 962 c 917 g 700 t ORIGIN 1 gcaccgagcg ccgccgggag tcgagcgccg gccgcggagc tcttgcgacc ccgccaggac 61 ccgaacagag cccgggggcg gcgggccgga gccggggacg cgggcacacg cccgctcgca 121 caagccacgg cggactctcc cgaggcggaa cctccacgcc gagcgagggt cagtttgaaa 181 aggaggatcg agctcactgt ggagtatcca tggagatgtg gagccttgtc accaacctct 241 aactgcagaa ctgggatgtg gagctggaag tgcctcctct tctgggctgt gctggtcaca 301 gccacactct gcaccgctag gccgtccccg accttgcctg aacaagatgc tctcccctcc 361 tcggaggatg atgatgatga tgatgactcc tcttcagagg agaaagaaac agataacacc 421 aaaccaaacc gtatgcccgt agctccatat tggacatccc cagaaaagat ggaaaagaaa 481 ttgcatgcag tgccggctgc caagacagtg aagttcaaat gcccttccag tgggacccca 541 aaccccacac tgcgctggtt gaaaaatggc aaagaattca aacctgacca cagaattgga 601 ggctacaagg tccgttatgc cacctggagc atcataatgg actctgtggt gccctctgac 661 aagggcaact acacctgcat tgtggagaat gagtacggca gcatcaacca cacataccag 721 ctggatgtcg tggagcggtc ccctcaccgg cccatcctgc aagcagggtt gcccgccaac 781 aaaacagtgg ccctgggtag caacgtggag ttcatgtgta aggtgtacag tgacccgcag 841 ccgcacatcc agtggctaaa gcacatcgag gtgaatggga gcaagattgg cccagacaac 901 ctgccttatg tccagatctt gaagactgct ggagttaata ccaccgacaa agagatggag 961 gtgcttcact taagaaatgt ctcctttgag gacgcagggg agtatacgtg cttggcgggt 1021 aactctatcg gactctccca tcactctgca tggttgaccg ttctggaagc cctggaagag 1081 aggccggcag tgatgacctc gcccctgtac ctggagatca tcatctattg cacaggggcc 1141 ttcctcatct cctgcatggt ggggtcggtc atcgtctaca agatgaagag tggtaccaag 1201 aagagtgact tccacagcca gatggctgtg cacaagctgg ccaagagcat ccctctgcgc 1261 agacaggtaa cagtgtctgc tgactccagt gcatccatga actctggggt tcttctggtt 1321 cggccatcac ggctctcctc cagtgggact cccatgctag caggggtctc tgagtatgag 1381 cttcccgaag accctcgctg ggagctgcct cgggacagac tggtcttagg caaacccctg 1441 ggagagggct gctttgggca ggtggtgttg gcagaggcta tcgggctgga caaggacaaa 1501 cccaaccgtg tgaccaaagt ggctgtgaag atgttgaagt cggacgcaac agagaaagac 1561 ttgtcagacc tgatctcaga aatggagatg atgaagatga tcgggaagca taagaatatc 1621 atcaacctgc tgggggcctg cacgcaggat ggtcccttgt atgtcatcgt ggagtatgcc 1681 tccaagggca acctgcggga gtacctgcag gcccggaggc ccccagggct ggaatactgc 1741 tacaacccca gccacaaccc agaggagcag ctctcctcca aggacctggt gtcctgcgcc 1801 taccaggtgg cccgaggcat ggagtatctg gcctccaaga agtgcataca ccgagacctg 1861 gcagccagga atgtcctggt gacagaggac aatgtgatga agatagcaga ctttggcctc 1921 gcacgggaca ttcaccacat cgactactat aaaaagacaa ccaacggccg actgcctgtg 1981 aagtggatgg cacccgaggc attatttgac cggatctaca cccaccagag tgatgtgtgg 2041 tctttcgggg tgctcctgtg ggagatcttc actctgggcg gctccccata ccccggtgtg 2101 cctgtggagg aacttttcaa gctgctgaag gagggtcacc gcatggacaa gcccagtaac 2161 tgcaccaacg agctgtacat gatgatgcgg gactgctggc atgcagtgcc ctcacagaga 2221 cccaccttca agcagctggt ggaagacctg gaccgcatcg tggccttgac ctccaaccag 2281 gagtacctgg acctgtccat gcccctggac cagtactccc ccagctttcc cgacacccgg 2341 agctctacgt gctcctcagg ggaggattcc gtcttctctc atgagccgct gcccgaggag 2401 ccctgcctgc cccgacaccc agcccagctt gccaatggcg gactcaaacg ccgctgactg 2461 ccacccacac gccctcccca gactccaccg tcagctgtaa ccctcaccca cagcccctgc 2521 tgggcccacc acctgtccgt ccctgtcccc tttcctgctg gcaggagccg gctgcctacc 2581 aggggccttc ctgtgtggcc tgccttcacc ccactcagct cacctctccc tccacctcct 2641 ctccacctgc tggtgagagg tggcaaagag gcagatcttt gctgccagcc acttcatccc 2701 ctcccagatg ttggaccaac acccctccct gccaccaggc actgcctgga gggcagggag 2761 tgggagccaa tgaacaggca tgcaagtgag agcttcctga gctttctcct gtcggtttgg 2821 tctgttttgc cttcacccat aagcccctcg cactctggtg gcaggtgcct tgtcctcagg 2881 gctacagcag tagggaggtc agtgcttcgt gcctcgattg aaggtgacct ctgccccaga 2941 taggtggtgc cagtggctta ttaattccga tactagtttg ctttgctgac caaatgcctg 3001 gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc ctggggccca 3061 gccccaaact gggggctctg tatatagcta tgaagaaaac acaaagtgta taaatctgag 3121 tatatattta catgtctttt taaaagggtc gttaccagag atttacccat cgggtaagat 3181 gctcctggtg gctgggaggc atcagttgct atatattaaa aacaaaaaag aaaaaaaagg 3241 aaaacgtttt taaaaaggtc atatattttt tgctactttt gctgttttat ttttttaaat 3301 tatgttctaa acctattttc agtttaggtc cctcaataaa aattgctgct gcttcaaaaa 3361 aaaaa // LOCUS HUMFGF3H 3503 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34186 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3503) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 527 2722 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 777 a 1044 c 1005 g 677 t ORIGIN 1 gcggaaccca aggacttttc tccggtccga gctcggggcg ccccgcaccg ggacggtacc 61 cgtgctgcag tcgggcacgc cgcgggcccg ccgggggcct ccgcagggcg atggagccgg 121 tctgcaagga aagtgaggcg ccgccgctgc gttctggagg aggggggcac aaggtctgga 181 gaccccgggt ggcggacggg agccctcccc ccgccccgcc tccggggcac cagctccggc 241 tccattgttc ccgcccgggc tggaggcgcc gagcaccgag cgccgccggg agtcgagcgc 301 cggccgcgga gtcttgcgac cccgccagga cccgaacaga gcccgggggc ggcgggccgg 361 agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc ccgaggcgga 421 acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg tggagtatcc 481 atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt ggagctggaa 541 gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta ggccgtcccc 601 gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc 661 ctcttcagag gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac 721 atccccagaa aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt 781 caaatgccct tccagtggga ccccaaaccc cacactgcgc tggttggaaa atggcaaaga 841 attcaaacct gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat 901 aatggactct gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta 961 cggcagcatc aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat 1021 cctgcaagca gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat 1081 gtgtaaggtg tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa 1141 tgggagcaag attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt 1201 taataccacc gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc 1261 aggggagtat acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt 1321 gaccgttctg gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga 1381 gatcatcatc tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt 1441 ctacaagatg aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa 1501 gctggccaag agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc 1561 catgaactct ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat 1621 gctagcaggg gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga 1681 cagactggtc ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga 1741 ggctatcggg ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt 1801 gaagtcggac gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa 1861 gatgatcggg aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc 1921 cttgtatgtc atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg 1981 gaggccccca gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc 2041 ctccaaggac ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc 2101 caagaagtgc atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt 2161 gatgaagata gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa 2221 gacaaccaac ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat 2281 ctacacccac cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct 2341 gggcggctcc ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg 2401 tcaccgcatg gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg 2461 ctggcatgca gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg 2521 catcgtggcc ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta 2581 ctcccccagc tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt 2641 ctctcatgag ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa 2701 tggcggactc aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc 2761 tgtaaccctc acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc 2821 tgctggcagg agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact 2881 cagctcacct ctccctccac ctcctctcca cctgctggtg agaggtggca aagaggcaga 2941 tcttttcact gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc 3001 accaggcact gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc 3061 ttcctgagct ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac 3121 tctggtggca ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc 3181 tcgattgaag gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac 3241 tagtttgctt tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg 3301 ggggcagtgt tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga 3361 agaaaacaca aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt 3421 accagagatt tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata 3481 tattaaaaac aaaaaaaaaa aaa // LOCUS HUMFGF4H 2283 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34187 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2283) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 417 1325 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 566 a 621 c 612 g 484 t ORIGIN 1 ggagcccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga ggggggcaca 61 aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct ccggggcacc 121 agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc gccgccggga 181 gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga gcccgggggc 241 ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc 301 ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg 361 tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt 421 ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta 481 ggccgtcccc gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg 541 atgatgactc ctcttcagag gagaaagaaa cagataacac caaaccaaac cgtatgcccg 601 tagctccata ttggacatcc ccagaaaaga tggaaaagaa attgcatgca gtgccggctg 661 ccaagacagt gaagttcaaa tgcccttcca gtgggacccc aaaccccaca ctgcgctggt 721 tgaaaaatgg caaagaattc aaacctgacc acagaattgg aggctacaag gtccgttatg 781 ccacctggag catcataatg gactctgtgg tgccctctga caagggcaac tacacctgca 841 ttgtggagaa tgagtacggc agcatcaacc acacatacca gctggatgtc gtggagcggt 901 cccctcaccg gcccatcctg caagcagggt tgcccgccaa caaaacagtg gccctgggta 961 gcaacgtgga gttcatgtgt aaggtgtaca gtgacccgca gccgcacatc cagtggctaa 1021 agcacatcga ggtgaatggg agcaagattg gcccagacaa cctgccttat gtccagatct 1081 tgaaggtaat catggcacca gtcttcgtgg gccagtctac tgggaaggag accactgtct 1141 cgggggctca agttcctgtg ggcaggctca gttgcccccg aatgggatca ttcctcacgc 1201 ttcaggcaca cacactccat ctcagtaggg atctagccac atcccccagg actagtaaca 1261 gaggtcacaa agtggaggtg agctgggaac agagggctgc agggatgggt ggtgctggtc 1321 tgtaataagc tttgagagca acgtcactgg ggctttgggg tcagctacac aaggaaggca 1381 tttggacccc tgccttttca ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt 1441 cttagccctg tgttctgagt tacgtacgat ctttctggca aatggggtgc atgataagag 1501 catctcttac gaagagttgg aaaaacaaat gccatatata aattctaagc catatgagga 1561 cgaggagtaa tggcattttc ttcctttttc ctctcactcc cagacattca ttgtccctga 1621 atgctccatt aatccaggga aggtaattgc ctaaatctcc agtggatctc gcaacaggaa 1681 ggaaccagaa gctgggaaag ttgtttacct ctttgtccca gagttagacc tcatcctccc 1741 ctagcttagc tgtctcagag atatactggc cctcccttct cttctctttg ctgctggtgc 1801 taaaactgct ctgtaggtca ttggccactg tctccactca caacccctgc tccagtcctg 1861 gagggagtgg gttaaacaca aatagaacat tccatttgaa gcagtgattc tttttttttt 1921 tttttttttt taatcaaatg ctttggactt ttgaagtcca cttgttctgt acttgtaaaa 1981 gggaaagaag gccgggcgca gtcgtcacgc ctgtaatccc agcactttag atcacttgag 2041 gtcaggagtt tgagaccagc ccggccaaca tggtgaaacc ccatctctac taaaaataca 2101 aaaattagct gtgcatagtg gttggcacct gtagtcccag ctactcagga ggctgaggca 2161 agctaactgc ttgaacccag aaggcagagg ttgcagtgag ctgagatcac gccactgcac 2221 tccagcctgg gtgacagagt gagtgagact ctgcgttaaa aaaaaaaaaa aaaaaaaaaa 2281 aaa // LOCUS HUMFGF5H 1625 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) secreted form mRNA, complete cds. ACCESSION M34188 KEYWORDS FGF receptor; fibroblast growth factor receptor. SOURCE Human female placenta endothelial cell line HUVEC, cDNA to mRNA, clone h5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1625) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 523 1425 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 368 a 480 c 489 g 288 t ORIGIN 1 cggaacccaa ggacttttct ccggtccgag ctcggggcgc cccgcaggcg acggtacccg 61 tgctgcagtc gggcacgccg cgggcccggg gcctccgcag ggcgatggag cccggtctgc 121 aaggaaagtg aggcgccgcc gctgcgttct ggaggagggg ggcacaaggt ctggagaccc 181 cgggtggcgg acgggagccc tccccccgcc ccgcctccgg ggcaccagct ccggctccat 241 tgttcccgcc cgggctggag gcgccgagca ccgagcgccg ccgggagtcg agcgccggcc 301 gcggagctct tgcgaccccg ccaggacccg aacagagccc gggggcggcg ggccggagcc 361 ggggacgcgg gcacacgccc gctcgcacaa gccacggcgg actctcccga ggcggaacct 421 ccacgccgag cgagggtcag tttgaaaagg aggatcgagc tcactgtgga gtatccatgg 481 agatgtggag ccttgtcacc aacctctaac tgcagaactg ggatgtggag ctggaagtgc 541 ctcctcttct gggctgtgct ggtcacagcc acactctgca ccgctaggcc gtccccgacc 601 ttgcctgaac aagatgctct cccctcctcg gaggatgatg atgatgatga tgactcctct 661 tcagaggaga aagaaacaga taacaccaaa ccaaaccccg tagctccata ttggacatcc 721 ccagaaaaga tggaaaagaa attgcatgca gtgccggctg ccaagacagt gaagttcaaa 781 tgcccttcca gtgggacccc aaaccccaca ctgcgctggt tgaaaaatgg caaagaattc 841 aaacctgacc acagaattgg aggctacaag gtccgttatg ccacctggag catcataatg 901 gactctgtgg tgccctctga caagggcaac tacacctgca ttgtggagaa tgagtacggc 961 agcatcaacc acacatacca gctggatgtc gtggagcggt cccctcaccg gcccatcctg 1021 caagcagggt tgcccgccaa caaaacagtg gccctgggta gcaacgtgga gttcatgtgt 1081 aaggtgtaca gtgacccgca gccgcacatc cagtggctaa agcacatcga ggtgaatggg 1141 agcaagattg gcccagacaa cctgccttat gtccagatct tgaaggtaat catggcacca 1201 gtcttcgtgg gccagtctac tgggaaggag accactgtct cgggggctca agttcctgtg 1261 ggcaggctca gttgcccccg aatgggatca ttcctcacgc ttcaggcaca cacactccat 1321 ctcagtaggg atctagccac atcccccagg actagtaaca gaggtcacaa agtggaggtg 1381 agctgggaac agagggctgc agggatgggt ggtgctggtc tgtaataagc tttgagagca 1441 acgtcactgg ggctttgggg tcagctacac aaggaaggca tttggacccc tgccttttca 1501 ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt cttagccctg tgtcctgagt 1561 tacgtacgat ctttctggca aatggggtgc atgataagag catctcttac gaagagttgg 1621 aaaaa // LOCUS SYNLACZA 6476 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD1.27. ACCESSION M34296 KEYWORDS lacZ. SOURCE Cloning vector pPD1.27. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6476) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3500 3501 E.coli lacZ end/SV40 start recomb 3634 3635 SV40 end/synthetic start recomb 3657 3658 synthetic end/pUC19 start recomb 4076 4077 pUC19 end/C.elegans sup-7 start recomb 4442 4443 C.elegans sup-7 end/pUC19 start BASE COUNT 1560 a 1646 c 1676 g 1594 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc agctgagcgc cggtcgctac cattaccagt tggtctggtg tcaaaaataa 3241 taataaccgg gcaggccatg tctgcccgta tttcgcgtaa ggaaatccat tatgtactat 3301 ttaaaaaaca caaacttttg gatgttcggt ttattctttt tcttttactt ttttatcatg 3361 ggagcctact tcccgttttt cccgatttgg ctacatgaca tcaaccatat cagcaaaagt 3421 gatacgggta ttatttttgc cgctatttct ctgttctcgc tattattcca accgctgttt 3481 ggtctgcttt ctgacaaact cggaacttgt ttattgcagc ttataatggt tacaaataaa 3541 gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt 3601 tgtccaaact catcaatgta tcttatcatg tctggatcga caaagtcaaa gcggccgcct 3661 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct 3721 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 3781 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 3841 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 3901 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 3961 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4021 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa 4081 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc 4141 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt 4201 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt 4261 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt 4321 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac 4381 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg 4441 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 4501 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 4561 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 4621 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 4681 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 4741 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 4801 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 4861 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 4921 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 4981 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5041 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 5101 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 5161 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 5221 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 5281 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 5341 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 5401 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 5461 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 5521 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 5581 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct 5641 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 5701 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 5761 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 5821 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 5881 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 5941 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6001 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6061 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 6121 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 6181 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 6241 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 6301 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 6361 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 6421 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZB 6096 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD8.02. ACCESSION M34297 KEYWORDS lacZ. SOURCE Cloning vector pPD8.02. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6096) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3275 3276 synthetic end/unknown DNA start recomb 3696 3697 unknown DNA end/C.elegans sup-7 end recomb 4062 4063 C.elegans sup-7 end/pUC19 start BASE COUNT 1458 a 1571 c 1616 g 1451 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcctact agtcggccgt acgggccctt aaggccgcct gatgcggtat tttctcctta 3301 cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg 3361 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt 3421 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc 3481 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat 3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 3661 tcatgagaca ataaccctga taaatgcttc aataatacaa ttttcagaat acgttttttg 3721 tgggcttggg tatattgttt ttaatgttat acttgcagtc gtgaaatttg attttcaaat 3781 ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt gtcaaaaacc aatttcaaca 3841 aattttcgtg tgagaaatac attaccagaa ggcatttttt cacacgatta gcattttgga 3901 ctactttatt aaatttttgc gtgtaatttt gaattaaatt gtattatatt actacttaaa 3961 aaacaaaaaa tttgaccact gagcggatcg aacgcccaac ctttcgatct agagtcgaac 4021 gcgctaccat tgcgccaagc agtcatgtta ttctctcttg tcattgaaaa aggaagagta 4081 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 4141 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 4201 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 4261 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 4321 gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 4381 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 4441 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 4501 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 4561 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 4621 ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 4681 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 4741 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 4801 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 4861 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 4921 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 4981 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 5041 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 5101 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 5161 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 5221 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 5281 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 5341 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 5401 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 5461 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 5521 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 5581 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 5641 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 5701 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 5761 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 5821 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 5881 agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 5941 acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 6001 tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 6061 ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZC 7376 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD8.33. ACCESSION M34298 KEYWORDS lacZ. SOURCE Cloning vector pPD8.33. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7376) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 3524 3425 unknown DNA end/C.elegans sup-7 end recomb 4555 4556 synthetic end/pUC19 start recomb 4976 4977 pUC19 end/C.elegans sup-7 start recomb 5342 5343 C.elegans sup-7 end/pUC19 start BASE COUNT 1863 a 1793 c 1814 g 1906 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat 3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat 3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac 3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg 3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca 3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca 3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc 3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt 3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg 3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa 3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga 3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa 3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa 4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg 4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg 4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa 4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa 4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc 4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta 4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa 4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca 4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt aaggccgcct 4561 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct 4621 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 4681 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 4741 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa 4981 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc 5041 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt 5101 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt 5161 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt 5221 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac 5281 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg 5341 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 5401 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 5461 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 5521 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 5581 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 5641 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 5701 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 5761 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 5821 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 5881 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5941 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 6001 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 6061 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 6121 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 6181 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 6241 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 6301 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 6361 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 6421 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 6481 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct 6541 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 6601 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 6661 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 6721 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 6781 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 6841 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6901 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6961 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 7021 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 7081 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 7141 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 7201 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 7261 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 7321 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZD 5730 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD16.43. ACCESSION M34299 KEYWORDS lacZ. SOURCE Cloning vector pPD16.43. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 5730) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3264 3265 synthetic end/unknown DNA start recomb 3513 3514 unknown DNA end/synthetic start recomb 3524 3525 synthetic end/pUC19 start BASE COUNT 1348 a 1488 c 1580 g 1314 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcctact agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga 3301 aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg 3361 gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa 3421 ctatgcggca tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca 3481 cagatgcgta aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat 3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 3661 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta 3721 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg 3781 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 3841 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 3901 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg 3961 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 4021 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 4081 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 4141 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt 4201 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag 4261 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 4321 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 4381 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 4441 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 4501 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 4561 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 4621 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 4681 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 4741 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 4801 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 4861 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 4921 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 4981 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 5041 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 5101 cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg 5161 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 5221 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 5281 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca 5341 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 5401 ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 5461 ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 5521 caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca 5581 ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc 5641 attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga 5701 gcggataaca atttcacaca ggaaacagct // LOCUS SYNLACZE 7010 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD16.51. ACCESSION M34300 KEYWORDS lacZ. SOURCE Cloning vector pPD16.51. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7010) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 4544 4545 synthetic end/pUC19 start recomb 4793 4794 pUC19 end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 4524 4525 unknown DNA end/synthetic start recomb 4804 4805 synthetic end/pUC19 start BASE COUNT 1753 a 1710 c 1778 g 1769 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat 3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat 3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac 3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg 3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca 3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca 3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc 3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt 3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg 3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa 3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga 3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa 3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa 4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg 4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg 4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa 4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa 4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc 4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta 4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa 4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca 4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt tcgtctcgcg 4561 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct 4621 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc 4681 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 4741 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcggccttaa 4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 4981 aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 5041 attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 5101 tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 5161 gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 5221 cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 5281 tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 5341 agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 5401 tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 5461 tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 5521 tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact 5581 acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg 5641 accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg 5701 tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat 5761 cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc 5821 tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat 5881 actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt 5941 tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 6001 cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 6061 gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 6121 tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt 6181 gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 6241 gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 6301 ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 6361 acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagcattg 6421 agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 6481 cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 6541 tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 6601 gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 6661 ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 6721 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 6781 cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca 6841 ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat 6901 taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg 6961 tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct // LOCUS SYNLACZF 7088 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD18.32. ACCESSION M34301 KEYWORDS lacZ. SOURCE Cloning vector pPD18.32. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7088) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 4622 4623 synthetic end/pUC19 start recomb 4871 4872 pUC19 end/synthetic start recomb 3322 3323 synthetic end/unknown DNA start recomb 4602 4603 unknown DNA end/synthetic start recomb 4882 4883 synthetic end/pUC19 start BASE COUNT 1780 a 1724 c 1797 g 1787 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct 3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc 3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc 3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata 3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta 3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac 3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg 3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa 3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc 3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt 3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg 3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta 4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttaaa 4081 attaaacatt tttctaaatt ttatatgatt tcttttaaat ttgcaaaaat tacttaaatt 4141 tgaattcccg cgcaaatgag tgacttcatt ttctgcatta ttgtgttttc cggctatatt 4201 aataggtatt tgtttgtgtt tttctttatt ttatgattcg aactccaatt tgtaaatttt 4261 cgaacatatt tccctaaaga aaaaatatga ttaatctgga aaaattggaa aattattttt 4321 caaataaaaa acaaagaaaa aaatgaagaa aaacctatta gtttggccat aaaacgcaaa 4381 aatgtcgaaa atgacgtcac tcatctgcgc gggaaatcaa gaataattcg gcctttttta 4441 tttttttgga aaatcgtaaa acatttagaa aaatttttta atagttatag tgggactgta 4501 ttctgtcatt tagggcaaaa gccagagacg ctactccacc gttaacatga attatgaata 4561 ttattgcgac aagacccaaa cattgataaa ccgcaaatct agcctactag tcggccgtac 4621 gggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc 4681 ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc 4741 gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc agagcagatt 4801 gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 4861 cgcatcaggc ggccttaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat 4921 aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat 4981 ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata 5041 aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 5101 tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 5161 agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 5221 cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 5281 taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 5341 tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 5401 tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 5461 cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 5521 gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 5581 cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 5641 actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 5701 ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 5761 tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 5821 tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 5881 acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 5941 ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 6001 ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 6061 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 6121 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 6181 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 6241 aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 6301 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 6361 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 6421 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 6481 cctacagcgt gagcattgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 6541 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 6601 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 6661 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 6721 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 6781 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 6841 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 6901 cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 6961 cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 7021 ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 7081 aaacagct // LOCUS SYNLACZG 6563 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD21.28. ACCESSION M34302 KEYWORDS lacZ. SOURCE Cloning vector pPD21.28. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6563) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 4097 4098 synthetic end/pUC19 start recomb 4346 4347 pUC19 end/synthetic start recomb 3322 3323 synthetic end/unknown DNA start recomb 4077 4078 unknown DNA end/synthetic start recomb 4357 4358 synthetic end/pUC19 start BASE COUNT 1587 a 1650 c 1723 g 1603 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct 3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc 3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc 3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata 3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta 3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac 3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg 3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa 3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc 3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt 3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg 3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta 4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttcct 4081 actagtcggc cgtacgggcc ctttcgtctc gcgcgtttcg gtgatgacgg tgaaaacctc 4141 tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga 4201 caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct taactatgcg 4261 gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc 4321 gtaaggagaa aataccgcat caggcggcct taagggcctc gtgatacgcc tatttttata 4381 ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt 4441 gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 4501 acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 4561 tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc 4621 agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 4681 cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 4741 aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 4801 gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 4861 agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 4921 aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 4981 gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 5041 ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc 5101 aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt 5161 aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc 5221 tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc 5281 agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca 5341 ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca 5401 ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt 5461 ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 5521 acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg 5581 agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc 5641 ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag 5701 cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa 5761 gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 5821 cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc 5881 gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta 5941 caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc ccgaagggag 6001 aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct 6061 tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga 6121 gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc 6181 ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt 6241 atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg 6301 cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg 6361 caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc 6421 cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc 6481 accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata 6541 acaatttcac acaggaaaca gct // LOCUS SYNLACZH 6562 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD22.04. ACCESSION M34303 KEYWORDS lacZ. SOURCE Cloning vector pPD22.04. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6562) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 179 180 synthetic end/E.coli trpS start recomb 263 264 E.coli trpS end/synthetic start recomb 267 268 synthetic end/E.coli lacZ start recomb 3261 3262 E.coli lacZ end/synthetic start recomb 4096 4097 synthetic end/pUC19 start recomb 4345 4346 pUC19 end/synthetic start recomb 3321 3322 synthetic end/unknown DNA start recomb 4076 4077 unknown DNA end/synthetic start recomb 4356 4357 synthetic end/pUC19 start BASE COUNT 1587 a 1650 c 1722 g 1603 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt gagggtaccg agctcagaaa aaatgactgc tccaaagaag aagcgtaagg 181 taccggtggg tgaagaccag aaacagcacc tcgaactgag ccgcgatatt gcccagcgtt 241 tcaacgcgct gtatggcgag atcgatcccg tcgttttaca acgtcgtgac tgggaaaacc 301 ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 361 gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc 421 gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag tgcgatcttc 481 ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac gatgcgccca 541 tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc acggagaatc 601 cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag gaaggccaga 661 cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac gggcgctggg 721 tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca tttttacgcg 781 ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt tatctggaag 841 atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg cataaaccga 901 ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc agccgcgctg 961 tactggaggc tgaagttcag atgtgcggcg agttgcgtga ctacctacgg gtaacagttt 1021 ctttatggca gggtgaaacg caggtcgcca gcggcaccgc gcctttcggc ggtgaaatta 1081 tcgatgagcg tggtggttat gccgatcgcg tcacactacg tctgaacgtc gaaaacccga 1141 aactgtggag cgccgaaatc ccgaatctct atcgtgcggt ggttgaactg cacaccgccg 1201 acggcacgct gattgaagca gaagcctgcg atgtcggttt ccgcgaggtg cggattgaaa 1261 atggtctgct gctgctgaac ggcaagccgt tgctgattcg aggcgttaac cgtcacgagc 1321 atcatcctct gcatggtcag gtcatggatg agcagacgat ggtgcaggat atcctgctga 1381 tgaagcagaa caactttaac gccgtgcgct gttcgcatta tccgaaccat ccgctgtggt 1441 acacgctgtg cgaccgctac ggcctgtatg tggtggatga agccaatatt gaaacccacg 1501 gcatggtgcc aatgaatcgt ctgaccgatg atccgcgctg gctaccggcg atgagcgaac 1561 gcgtaacgcg aatggtgcag cgcgatcgta atcacccgag tgtgatcatc tggtcgctgg 1621 ggaatgaatc aggccacggc gctaatcacg acgcgctgta tcgctggatc aaatctgtcg 1681 atccttcccg cccggtgcag tatgaaggcg gcggagccga caccacggcc accgatatta 1741 tttgcccgat gtacgcgcgc gtggatgaag accagccctt cccggctgtg ccgaaatggt 1801 ccatcaaaaa atggctttcg ctacctggag agacgcgccc gctgatcctt tgcgaatacg 1861 cccacgcgat gggtaacagt cttggcggtt tcgctaaata ctggcaggcg tttcgtcagt 1921 atccccgttt acagggcggc ttcgtctggg actgggtgga tcagtcgctg attaaatatg 1981 atgaaaacgg caacccgtgg tcggcttacg gcggtgattt tggcgatacg ccgaacgatc 2041 gccagttctg tatgaacggt ctggtctttg ccgaccgcac gccgcatcca gcgctgacgg 2101 aagcaaaaca ccagcagcag tttttccagt tccgtttatc cgggcaaacc atcgaagtga 2161 ccagcgaata cctgttccgt catagcgata acgagctcct gcactggatg gtggcgctgg 2221 atggtaagcc gctggcaagc ggtgaagtgc ctctggatgt cgctccacaa ggtaaacagt 2281 tgattgaact gcctgaacta ccgcagccgg agagcgccgg gcaactctgg ctcacagtac 2341 gcgtagtgca accgaacgcg accgcatggt cagaagccgg gcacatcagc gcctggcagc 2401 agtggcgtct ggcggaaaac ctcagtgtga cgctccccgc cgcgtcccac gccatcccgc 2461 atctgaccac cagcgaaatg gatttttgca tcgagctggg taataagcgt tggcaattta 2521 accgccagtc aggctttctt tcacagatgt ggattggcga taaaaaacaa ctgctgacgc 2581 cgctgcgcga tcagttcacc cgtgcaccgc tggataacga cattggcgta agtgaagcga 2641 cccgcattga ccctaacgcc tgggtcgaac gctggaaggc ggcgggccat taccaggccg 2701 aagcagcgtt gttgcagtgc acggcagata cacttgctga tgcggtgctg attacgaccg 2761 ctcacgcgtg gcagcatcag gggaaaacct tatttatcag ccggaaaacc taccggattg 2821 atggtagtgg tcaaatggcg attaccgttg atgttgaagt ggcgagcgat acaccgcatc 2881 cggcgcggat tggcctgaac tgccagctgg cgcaggtagc agagcgggta aactggctcg 2941 gattagggcc gcaagaaaac tatcccgacc gccttactgc cgcctgtttt gaccgctggg 3001 atctgccatt gtcagacatg tataccccgt acgtcttccc gagcgaaaac ggtctgcgct 3061 gcgggacgcg cgaattgaat tatggcccac accagtggcg cggcgacttc cagttcaaca 3121 tcagccgcta cagtcaacag caactgatgg aaaccagcca tcgccatctg ctgcacgcgg 3181 aagaaggcac atggctgaat atcgacggtt tccatatggg gattggtggc gacgactcct 3241 ggagcccgtc agtatcggcg gaattccaac tgagcgccgg tcgctaccat taccaacttg 3301 tctggtgtca aaaataatag gggccgctgt catcagatcg ccatctcgcg cccgtgcctc 3361 tgacttctaa gtccaattac tcttcaacat ccctacatgc tctttctccc tgtgctccca 3421 ccccctattt ttgttattat caaaaaaact tcttcttaat ttctttgttt tttagcttct 3481 tttaagtcac ctctaacaat gaaattgtgt agattcaaaa atagaattaa ttcgtaataa 3541 aaagtcgaaa aaaattgtgc tccctccccc cattaataat aattctatcc caaaatctac 3601 acaatgttct gtgtacactt cttatgtttt ttttacttct gataaatttt ttttgaaaca 3661 tcatagaaaa aaccgcacac aaaatacctt atcatatgtt acgtttcagt ttatgaccgc 3721 aatttttatt tcttcgcacg tctgggcctc tcatgacgtc aaatcatgct catcgtgaaa 3781 aagttttgga gtatttttgg aatttttcaa tcaagtgaaa gtttatgaaa ttaattttcc 3841 tgcttttgct ttttgggggt ttcccctatt gtttgtcaag agtttcgagg acggcgtttt 3901 tcttgctaaa atcacaagta ttgatgagca cgatgcaaga aagatcggaa gaaggtttgg 3961 gtttgaggct cagtggaagg tgagtagaag ttgataattt gaaagtggag tagtgtctat 4021 ggggtttttg ccttaaatga cagaatacat tcccaatata ccaaacataa ctgtttccta 4081 ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct 4141 gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac 4201 aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg 4261 catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg 4321 taaggagaaa ataccgcatc aggcggcctt aagggcctcg tgatacgcct atttttatag 4381 gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg 4441 cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga 4501 caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat 4561 ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca 4621 gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc 4681 gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca 4741 atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg 4801 caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca 4861 gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata 4921 accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag 4981 ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg 5041 gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca 5101 acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta 5161 atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct 5221 ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca 5281 gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag 5341 gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat 5401 tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt 5461 taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa 5521 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 5581 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 5641 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 5701 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 5761 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 5821 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 5881 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 5941 accgaactga gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga 6001 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 6061 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 6121 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 6181 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 6241 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 6301 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc 6361 aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc 6421 gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca 6481 ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa 6541 caatttcaca caggaaacag ct // LOCUS SYNLACZI 6567 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD22.11. ACCESSION M34304 KEYWORDS lacZ. SOURCE Cloning vector pPD22.11. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6567) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 184 185 synthetic end/E.coli trpS start recomb 268 269 E.coli trpS end/synthetic start recomb 272 273 synthetic end/E.coli lacZ start recomb 3266 3267 E.coli lacZ end/synthetic start recomb 4101 4102 synthetic end/pUC19 start recomb 4350 4351 pUC19 end/synthetic start recomb 3326 3327 synthetic end/unknown DNA start recomb 4081 4082 unknown DNA end/synthetic start recomb 4361 4362 synthetic end/pUC19 start BASE COUNT 1587 a 1651 c 1724 g 1605 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt gcttggaggg taccgagctc agaaaaaatg actgctccaa agaagaagcg 181 taaggtaccg gtgggtgaag accagaaaca gcacctcgaa ctgagccgcg atattgccca 241 gcgtttcaac gcgctgtatg gcgagatcga tcccgtcgtt ttacaacgtc gtgactggga 301 aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 361 taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 421 atggcgcttt gcctggtttc cggcaccaga agcggtgccg gaaagctggc tggagtgcga 481 tcttcctgag gccgatactg tcgtcgtccc ctcaaactgg cagatgcacg gttacgatgc 541 gcccatctac accaacgtaa cctatcccat tacggtcaat ccgccgtttg ttcccacgga 601 gaatccgacg ggttgttact cgctcacatt taatgttgat gaaagctggc tacaggaagg 661 ccagacgcga attatttttg atggcgttaa ctcggcgttt catctgtggt gcaacgggcg 721 ctgggtcggt tacggccagg acagtcgttt gccgtctgaa tttgacctga gcgcattttt 781 acgcgccgga gaaaaccgcc tcgcggtgat ggtgctgcgt tggagtgacg gcagttatct 841 ggaagatcag gatatgtggc ggatgagcgg cattttccgt gacgtctcgt tgctgcataa 901 accgactaca caaatcagcg atttccatgt tgccactcgc tttaatgatg atttcagccg 961 cgctgtactg gaggctgaag ttcagatgtg cggcgagttg cgtgactacc tacgggtaac 1021 agtttcttta tggcagggtg aaacgcaggt cgccagcggc accgcgcctt tcggcggtga 1081 aattatcgat gagcgtggtg gttatgccga tcgcgtcaca ctacgtctga acgtcgaaaa 1141 cccgaaactg tggagcgccg aaatcccgaa tctctatcgt gcggtggttg aactgcacac 1201 cgccgacggc acgctgattg aagcagaagc ctgcgatgtc ggtttccgcg aggtgcggat 1261 tgaaaatggt ctgctgctgc tgaacggcaa gccgttgctg attcgaggcg ttaaccgtca 1321 cgagcatcat cctctgcatg gtcaggtcat ggatgagcag acgatggtgc aggatatcct 1381 gctgatgaag cagaacaact ttaacgccgt gcgctgttcg cattatccga accatccgct 1441 gtggtacacg ctgtgcgacc gctacggcct gtatgtggtg gatgaagcca atattgaaac 1501 ccacggcatg gtgccaatga atcgtctgac cgatgatccg cgctggctac cggcgatgag 1561 cgaacgcgta acgcgaatgg tgcagcgcga tcgtaatcac ccgagtgtga tcatctggtc 1621 gctggggaat gaatcaggcc acggcgctaa tcacgacgcg ctgtatcgct ggatcaaatc 1681 tgtcgatcct tcccgcccgg tgcagtatga aggcggcgga gccgacacca cggccaccga 1741 tattatttgc ccgatgtacg cgcgcgtgga tgaagaccag cccttcccgg ctgtgccgaa 1801 atggtccatc aaaaaatggc tttcgctacc tggagagacg cgcccgctga tcctttgcga 1861 atacgcccac gcgatgggta acagtcttgg cggtttcgct aaatactggc aggcgtttcg 1921 tcagtatccc cgtttacagg gcggcttcgt ctgggactgg gtggatcagt cgctgattaa 1981 atatgatgaa aacggcaacc cgtggtcggc ttacggcggt gattttggcg atacgccgaa 2041 cgatcgccag ttctgtatga acggtctggt ctttgccgac cgcacgccgc atccagcgct 2101 gacggaagca aaacaccagc agcagttttt ccagttccgt ttatccgggc aaaccatcga 2161 agtgaccagc gaatacctgt tccgtcatag cgataacgag ctcctgcact ggatggtggc 2221 gctggatggt aagccgctgg caagcggtga agtgcctctg gatgtcgctc cacaaggtaa 2281 acagttgatt gaactgcctg aactaccgca gccggagagc gccgggcaac tctggctcac 2341 agtacgcgta gtgcaaccga acgcgaccgc atggtcagaa gccgggcaca tcagcgcctg 2401 gcagcagtgg cgtctggcgg aaaacctcag tgtgacgctc cccgccgcgt cccacgccat 2461 cccgcatctg accaccagcg aaatggattt ttgcatcgag ctgggtaata agcgttggca 2521 atttaaccgc cagtcaggct ttctttcaca gatgtggatt ggcgataaaa aacaactgct 2581 gacgccgctg cgcgatcagt tcacccgtgc accgctggat aacgacattg gcgtaagtga 2641 agcgacccgc attgacccta acgcctgggt cgaacgctgg aaggcggcgg gccattacca 2701 ggccgaagca gcgttgttgc agtgcacggc agatacactt gctgatgcgg tgctgattac 2761 gaccgctcac gcgtggcagc atcaggggaa aaccttattt atcagccgga aaacctaccg 2821 gattgatggt agtggtcaaa tggcgattac cgttgatgtt gaagtggcga gcgatacacc 2881 gcatccggcg cggattggcc tgaactgcca gctggcgcag gtagcagagc gggtaaactg 2941 gctcggatta gggccgcaag aaaactatcc cgaccgcctt actgccgcct gttttgaccg 3001 ctgggatctg ccattgtcag acatgtatac cccgtacgtc ttcccgagcg aaaacggtct 3061 gcgctgcggg acgcgcgaat tgaattatgg cccacaccag tggcgcggcg acttccagtt 3121 caacatcagc cgctacagtc aacagcaact gatggaaacc agccatcgcc atctgctgca 3181 cgcggaagaa ggcacatggc tgaatatcga cggtttccat atggggattg gtggcgacga 3241 ctcctggagc ccgtcagtat cggcggaatt ccaactgagc gccggtcgct accattacca 3301 acttgtctgg tgtcaaaaat aataggggcc gctgtcatca gatcgccatc tcgcgcccgt 3361 gcctctgact tctaagtcca attactcttc aacatcccta catgctcttt ctccctgtgc 3421 tcccaccccc tatttttgtt attatcaaaa aaacttcttc ttaatttctt tgttttttag 3481 cttcttttaa gtcacctcta acaatgaaat tgtgtagatt caaaaataga attaattcgt 3541 aataaaaagt cgaaaaaaat tgtgctccct ccccccatta ataataattc tatcccaaaa 3601 tctacacaat gttctgtgta cacttcttat gtttttttta cttctgataa attttttttg 3661 aaacatcata gaaaaaaccg cacacaaaat accttatcat atgttacgtt tcagtttatg 3721 accgcaattt ttatttcttc gcacgtctgg gcctctcatg acgtcaaatc atgctcatcg 3781 tgaaaaagtt ttggagtatt tttggaattt ttcaatcaag tgaaagttta tgaaattaat 3841 tttcctgctt ttgctttttg ggggtttccc ctattgtttg tcaagagttt cgaggacggc 3901 gtttttcttg ctaaaatcac aagtattgat gagcacgatg caagaaagat cggaagaagg 3961 tttgggtttg aggctcagtg gaaggtgagt agaagttgat aatttgaaag tggagtagtg 4021 tctatggggt ttttgcctta aatgacagaa tacattccca atataccaaa cataactgtt 4081 tcctactagt cggccgtacg ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa 4141 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag 4201 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta 4261 tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa taccgcacag 4321 atgcgtaagg agaaaatacc gcatcaggcg gccttaaggg cctcgtgata cgcctatttt 4381 tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa 4441 atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca 4501 tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 4561 aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 4621 acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 4681 acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 4741 ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg 4801 ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 4861 caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 4921 ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 4981 aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg 5041 aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 5101 tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 5161 aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 5221 cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 5281 ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga 5341 gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 5401 agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 5461 atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 5521 cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 5581 cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 5641 cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 5701 tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact 5761 tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 5821 ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 5881 aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 5941 cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag 6001 ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 6061 agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac 6121 ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca 6181 acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg 6241 cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 6301 gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 6361 tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 6421 ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt 6481 aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg 6541 gataacaatt tcacacagga aacagct // LOCUS SYNLACZJ 7242 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD26.77. ACCESSION M34305 KEYWORDS lacZ. SOURCE Cloning vector pPD26.77. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7242) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 4776 4777 synthetic end/pUC19 start recomb 5025 5026 pUC19 end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 4756 4757 unknown DNA end/synthetic start recomb 5036 5037 synthetic end/pUC19 start BASE COUNT 1780 a 1801 c 1846 g 1815 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcgaaac aaatcatctg acaccaccac cgtctgatgg atcgttctca tctccgtctc 3301 cacattatta tccgacgact acatcgacac cgaatcgaat ggaaacaagt ccggagtaca 3361 tgtttaacca tgaaatggtg ggtagatgat tattaaaatg tttaagaaaa ttaaataatt 3421 tgttttaggc accaccggtc aatgcgatgt ggtatactac accacctcct tatcaagatc 3481 caaactatcg tcatgtgcct ccaaatactg catttcaaaa tgcagagcaa atgaatggct 3541 ccttctactg ttaatctatt taattcatta atttttcatt tattgactgt atcccggatg 3601 tttcttgtcc tcccaacata tctcctaact gctcggttca ttttaaatat gctcatctca 3661 ctacatcacc cagacactgg tccccacaga gttttttgta tactatttcg ggtcattttt 3721 cttattctag actaatattg taagctataa gttgtagaat aattattgat ccaaatcaga 3781 ttaagagtat aagctttgtt ttttctcctt ttctttataa cttgttacaa tttttgaaat 3841 tccctttttt gacaggcttt tattacactg taactgtgtt tcttatcttg caaacattta 3901 atgaattgta attctttagt atcttgaggg ctttttgttt ttcgaattat tgaagctcaa 3961 agttccagtt ttactacgat ccagcgaatt ctcctcattt cgatccgatg caattgactt 4021 cagatcaata ttggttgcct gaaagaaata attgtgagca tttttgtcaa aaaacagaga 4081 actcaccatt ctcgaggctc ccgttccagg agcagtactt ggtgatggac acgtagattg 4141 attaaaccaa accaaaggtt ctttcagagt caacttacag cctcgagcgt agtccgtgat 4201 agcttctcgc agaacactga aaattggaaa tttattggaa taaaaacttt ttctgcactt 4261 tatagaataa aaaaatcatg aatttacccg aatttaacct ccgaatcgta gaccaaattg 4321 tccaagtaga tggaaatcac cttgaacatc ggatgttttt catatgctga aaataaatta 4381 atgaatttat gtaatttttt aaataattac ttttcaattt ggtgaacaat tcctgcttct 4441 ttgcataggc atctggacga gtgagtcctt tccaatcaat caatgtggtg tcgacctcga 4501 gggggggccc ggtacccagc ttttgttccc tttagtgagg gttaattccg agcttggcgt 4561 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 4621 taggagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagg taactcacat 4681 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 4741 aatgaatcgg ccaacgccta ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg 4801 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 4861 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 4921 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg 4981 tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcggcctt aagggcctcg 5041 tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg 5101 gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 5161 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 5221 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 5281 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 5341 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 5401 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 5461 tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 5521 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 5581 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 5641 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 5701 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 5761 cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 5821 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 5881 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 5941 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 6001 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 6061 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 6121 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 6181 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 6241 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 6301 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 6361 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 6421 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 6481 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 6541 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 6601 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcat tgagaaagcg 6661 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 6721 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 6781 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 6841 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 6901 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 6961 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 7021 cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 7081 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 7141 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 7201 gtggaattgt gagcggataa caatttcaca caggaaacag ct // LOCUS SYNLACZK 6620 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD34.110. ACCESSION M34306 KEYWORDS lacZ. SOURCE Cloning vector pPD34.110. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6620) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 237 238 synthetic end/E.coli trpS start recomb 321 322 E.coli trpS end/synthetic start recomb 325 326 synthetic end/E.coli lacZ start recomb 3319 3320 E.coli lacZ end/synthetic start recomb 4154 4155 synthetic end/pUC19 start recomb 4403 4404 pUC19 end/synthetic start recomb 3379 3380 synthetic end/unknown DNA start recomb 4134 4135 unknown DNA end/synthetic start recomb 4414 4415 synthetic end/pUC19 start BASE COUNT 1592 a 1665 c 1731 g 1632 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc tcgagaaagc tggcaaaggg ctcttgtcct gctaatcgta 181 ctactcttca tcgtcatctt cgttattact gttttgttcg tcataagatc taacaaggta 241 ccggtgggtg aagaccagaa acagcacctc gaactgagcc gcgatattgc ccagcgtttc 301 aacgcgctgt atggcgagat cgatcccgtc gttttacaac gtcgtgactg ggaaaaccct 361 ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 421 gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 481 tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct 541 gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc 601 tacaccaacg taacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg 661 acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg 721 cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc 781 ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc 841 ggagaaaacc gcctcgcggt gatggtgctg cgttggagtg acggcagtta tctggaagat 901 caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact 961 acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta 1021 ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct 1081 ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc 1141 gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa 1201 ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac 1261 ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat 1321 ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat 1381 catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg 1441 aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac 1501 acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc 1561 atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc 1621 gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg 1681 aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat 1741 ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt 1801 tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc 1861 atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc 1921 cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat 1981 ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat 2041 gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc 2101 cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa 2161 gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc 2221 agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat 2281 ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg 2341 attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc 2401 gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag 2461 tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat 2521 ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac 2581 cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg 2641 ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc 2701 cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa 2761 gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct 2821 cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat 2881 ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg 2941 gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga 3001 ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat 3061 ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc 3121 gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc 3181 agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa 3241 gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg 3301 agcccgtcag tatcggcgga attccaactg agcgccggtc gctaccatta ccaacttgtc 3361 tggtgtcaaa aataataggg gccgctgtca tcagatcgcc atctcgcgcc cgtgcctctg 3421 acttctaagt ccaattactc ttcaacatcc ctacatgctc tttctccctg tgctcccacc 3481 ccctattttt gttattatca aaaaaacttc ttcttaattt ctttgttttt tagcttcttt 3541 taagtcacct ctaacaatga aattgtgtag attcaaaaat agaattaatt cgtaataaaa 3601 agtcgaaaaa aattgtgctc cctcccccca ttaataataa ttctatccca aaatctacac 3661 aatgttctgt gtacacttct tatgtttttt ttacttctga taaatttttt ttgaaacatc 3721 atagaaaaaa ccgcacacaa aataccttat catatgttac gtttcagttt atgaccgcaa 3781 tttttatttc ttcgcacgtc tgggcctctc atgacgtcaa atcatgctca tcgtgaaaaa 3841 gttttggagt atttttggaa tttttcaatc aagtgaaagt ttatgaaatt aattttcctg 3901 cttttgcttt ttgggggttt cccctattgt ttgtcaagag tttcgaggac ggcgtttttc 3961 ttgctaaaat cacaagtatt gatgagcacg atgcaagaaa gatcggaaga aggtttgggt 4021 ttgaggctca gtggaaggtg agtagaagtt gataatttga aagtggagta gtgtctatgg 4081 ggtttttgcc ttaaatgaca gaatacattc ccaatatacc aaacataact gtttcctact 4141 agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga 4201 cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa 4261 gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca 4321 tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta 4381 aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat ttttataggt 4441 taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg 4501 cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 4561 ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt 4621 ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga 4681 aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga 4741 actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat 4801 gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 4861 agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 4921 cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 4981 catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 5041 aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5101 gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5161 aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat 5221 agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg 5281 ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 5341 actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5401 aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 5461 gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta 5521 atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 5581 tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 5641 tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 5701 ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 5761 agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa 5821 ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 5881 tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 5941 gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6001 cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg aagggagaaa 6061 ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 6121 agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 6181 tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6241 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6301 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6361 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6421 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 6481 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 6541 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 6601 atttcacaca ggaaacagct // LOCUS SYNLACZL 5808 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD16.01. ACCESSION M34307 KEYWORDS lacZ. SOURCE Cloning vector pPD16.01. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 5808) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 3342 3343 synthetic end/unknown DNA start recomb 3591 3592 unknown DNA end/synthetic start recomb 3602 3603 synthetic end/pUC19 start BASE COUNT 1375 a 1502 c 1599 g 1332 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggcctactag tcggccgtac gggccctttc gtctcgcgcg 3361 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg 3421 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg 3481 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat 3541 gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc ggccttaagg 3601 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt 3661 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 3721 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 3781 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 3841 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 3901 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 3961 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 4021 cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc 4081 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 4141 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 4201 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 4261 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 4321 acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 4381 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac 4441 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 4501 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 4561 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 4621 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac 4681 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg 4741 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 4801 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 4861 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 4921 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 4981 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 5041 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 5101 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 5161 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag 5221 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 5281 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 5341 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 5401 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 5461 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 5521 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 5581 aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt 5641 aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta 5701 atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta 5761 tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct // LOCUS HUMMHDQ3L 967 bp ds-DNA PRI 03-JUL-1990 DEFINITION Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33841 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 967) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 961 967 inverted repeat B rpt 560 810 R region signal 792 797 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 530 536 TATA box BASE COUNT 254 a 235 c 222 g 256 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct 121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc 301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt 361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa 421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa 481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg 541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg 601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc 661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg 721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc 781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga 841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct 901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac 961 ccctaca // LOCUS HUMMHDQ5L 960 bp ds-DNA PRI 03-JUL-1990 DEFINITION Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33842 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 960) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 954 960 inverted repeat B rpt 555 803 R region signal 785 790 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 525 531 TATA box BASE COUNT 250 a 232 c 219 g 259 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct 121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag 301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa 361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt 421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa 481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt 541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca 601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac 661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct 721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat 781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg 841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa 901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca // LOCUS MUSMHEBF1 573 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-f gene, exon 1. ACCESSION M35677 M34123 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 3 SOURCE Mouse inbred strain B10.M) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 573) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept 301 + 394 MHC E-beta-f, exon 1 IVS 395 > 573 MHC E-beta-f intron A BASE COUNT 135 a 151 c 138 g 149 t ORIGIN Chromosome 17. 1 cagctgcctc tgcctcctga gtgctgggat atgaggcatg gccagcagcc cagactgtgt 61 atccatgtaa tgaagagaac tgcaagtttc agaagggaac ctgcaaactg aatctctaac 121 taggaactga tgatgctgaa cttctttgat gctgattggc tcccagcact ggccttaccc 181 aatccagtgg caaagcagtg aatgccctgt ctcttattat cttagcaatg agtaaagaga 241 ataaagttac agtctgaagc ttgccttccc ctctgactct cgtgtctcct ctcctgcagc 301 atgatgtggc tccccagagt tccctgtgtg gcagctgtga tcctgttgct gacagtgctg 361 agccctccag tggctttggt cagagactcc agacgtaaat gcacacctca ggtgctggga 421 tgctcggggt cggggaagga aggagctaac attctcactg tccagtccaa gtccctcgaa 481 actattgata tcttctgtga gcatgcacag tcctcacatg aactctaaac tatgtcccca 541 aacagacgcc tggatgtttg tgctctcaga tct // LOCUS MUSMHEBF2 495 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-f gene, exon 2. ACCESSION M35678 M34123 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 3 SOURCE Mouse inbred strain B10.M) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 495) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept + 61 + 330 MHC E-beta-f, exon 2 IVS < 1 60 MHC E-beta-f intron A IVS 331 > 495 MHC E-beta-f intron B BASE COUNT 111 a 112 c 180 g 92 t ORIGIN About 3.0 kb after segment 1; chromosome 17. 1 cagctgagag ggactcgggc atcttgtcgg cagagaagaa gataattctt gtctccacag 61 catggttttt ggaatactgt aaatctgagt gtcatttcta caacgggacg cagcgcgtgc 121 ggtttctgaa aagatacttc tacaacctgg aggagaacct gcgcttcgac agcgacgtgg 181 gcgagttccg cgcggtgacc gagctggggc ggccagacgc cgagaactgg aacagccagc 241 cggagatcct ggaggatgcg cgggccgcgg tggacacgta ctgcagatac aactatgaga 301 tcttggataa attccttgtg cggcggagag gtgagacagg acagggtggg tggggcggaa 361 ccacggtgag ggtggggctg tggggagcag cagaaggcgg tgcgcatgtg cgcaggagcc 421 gcagggaatg ctgggttccc tgcagctgga gccacaggcg cttttaagca gcctcttggc 481 aggggaacgg aattc // LOCUS MUSMHEBF3 2155 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-f gene, exons 3,4,5 and 6. ACCESSION M35679 M34123 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 3 of 3 SOURCE Mouse inbred strain B10.M) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2155) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept + 122 403 MHC E-beta-f, exon 3 971 1081 MHC E-beta-f, exon 4 1484 1507 MHC E-beta-f, exon 5 1802 1815 MHC E-beta-f, exon 6 IVS < 1 121 MHC E-beta-f intron B IVS 404 970 MHC E-beta-f intron C IVS 1082 1483 MHC E-beta-f intron D IVS 1508 1801 MHC E-beta-f intron E BASE COUNT 459 a 586 c 569 g 541 t ORIGIN About 3.9 kb after segment 2; chromosome 17. 1 gatccattct ggatggatag atggaggtag gcaggcaggc aggcaggcag gcaggcatgc 61 agacagccta caaggaggac agctccaccc tcatggctcc ttctcacctc tctttctcta 121 gttgagccta cggtgactgt gtaccccaca aagacgcagc ccctggaaca ccacaacctc 181 ctggtctgct ctgtgagtga cttctaccct ggcaacattg aagtcagatg gttccggaat 241 ggcaaggagg agaaaacagg aattgtgtcc acgggcctgg tccgaaatgg agactggacc 301 ttccagacac tggtgatgct ggagacggtt cctcagagtg gagaggttta cacctgccag 361 gtggagtatc ccagcctgac cgaccctgtc acggtcgagt ggagtgagtg gtaacttcca 421 gactctgtga atgcccgccc gggtgggtgt ggtttatccc tgcctgtcag ctttctccac 481 ccacacactc tttccactgg ctttgtgctg tcctgccttt caccatggct tacagtgtag 541 gtgcgtgaag cttctacaag cacagttgcc ccctgggaag cagttatgcc cccatagact 601 catctgagcc tgccagtgac ataacaggtc ctggaatctt cttggcccct gctgcagtct 661 ctgccgttgc tgggttgtgt tcctcctgct gctgctgctg ctgacgatgg acaaggagca 721 gtgcagggtc atgactgaac tcagggacat atagtcatag ctctgccttt gctacccctc 781 agagctcagc agcttcctgt cagctcggct caggcctgtt tggttggttt ctcaacatga 841 ccaggaatgt tgacagccag atcttctaga acacacttct tccttgggct caaagctccg 901 agtctcaggg gtccggagtg gaaatgggat ttgggctaaa accctccaaa cctttggctt 961 cctttctcag aagcacagtc cacatctgca cagaacaaga tgttgagtgg agttgggggc 1021 ttcgtgctgg gcctcctctt cctcggagcg gggctgttca tctacttcag gaaccagaaa 1081 ggtaaggagc ctggtgggag ccccaactcc atagcatttc agggaaaagc catggctttg 1141 ttctcaggat gccattggcc ctgtgacctc aggtttcatt ggattctgaa tgcaacagtc 1201 tgtggttact tgatttgacc ctgaggaggg ataacacatg ggagagttaa gttgattctg 1261 gcttgagacc tgaggacaga ggaaggctgg ggggagccat gggcactgcc ggtgactgaa 1321 gctccctaag cccctccctc tgtccatgct cctcttggtt ctgtgtgctc tgggcagtat 1381 taccagagga atctcaggtg gcagctcaga gtctggggac atgtgtctgg ggacagatct 1441 gccttcatgc atgtaagcat ctattttatt ctctcttttc taggacagtc tggacttcag 1501 ccaacaggta acacccattg tcttctctca gagacagatc tgctttccct acagtatggg 1561 ggctggggtg atggactcag ggcacaaaat ggggaagact gagatcccag ggttggccag 1621 gcagttagca ctgagccttg ctccctgcac ttactgaagc ctgtgctctg aagcagcaat 1681 gactcggggc atgagaagtt cctctctgct cactgccatg ctgtaaggag aggcctgaag 1741 cagtcagaga agccactgca gagtgaggtc tggaaacagc cctgtcccct gtgctctaca 1801 ggactcctga gctgagatga agtaacaagg ctgaaggaag gagttccccc ccgtgtctcc 1861 atgccatgaa aacatgtcct gcttggccca catccctcca gagacactgc tcttccagga 1921 cctggctcct cctgattctc caccctggag atctgtgctc ctgatggctg cttatccctg 1981 acccaggcct tgcagctccc agaacagagg ccccactctt cacatctcct gtcccctttt 2041 gtcccttgcc ttttgtctgg cacttctgag ccagtctgct gtcatatgct tttttacatt 2101 tttctcaaat aaacaaataa tgaaagtcat ctgcttcata gagtttcaag cagaa // LOCUS MUSMHEBQ1 574 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-q gene, exon 1. ACCESSION M35680 M34124 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 3 SOURCE Mouse inbred strain B10.M) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 574) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept 301 + 394 MHC E-beta-q, exon 1 IVS 395 > 574 MHC E-beta-q intron A BASE COUNT 132 a 152 c 143 g 147 t ORIGIN Chromosome 17. 1 cagctgcctc tgcctcctga gtgctgggat atgaggcatg gccagcagcc cagactgagt 61 atccatgtaa tgaagagaac tgcaagtttc agaaggggac ctgcaaactg aatctctaac 121 tagcaactga tgatgctgga ctcctttgat gctgattggc tcccagcact ggccttaccc 181 aatccagtgg caaagcagtg aatgccctgt ctcttattat cttagcaatg agtaaagaga 241 ataaagttac agtctgaagc ttgccttccc ctctgactcc tgtgtctcct ctcctgcagc 301 atggtgtggc tccccagagt tccctgtgtg gcagctgtga tcctgttgct gacagtgctg 361 agccctccag tggctttggt cagagactcc agacgttaag tgcacacctc aggtgctggg 421 atgctcgggg tcggggaagg aaggagctaa cattctcact gtccaggcca agtccctcgg 481 aactattgat atcttctgtg agcatgcaca gtcctcacat gaactctaaa ctatgtcccc 541 aaacagaagc ctggatgttt gtgctctcag atct // LOCUS MUSMHEBQ2 495 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-q gene, exon 2. ACCESSION M35681 M34124 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 3 SOURCE Mouse inbred strain B10.G) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 495) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept + 61 + 330 MHC E-beta-q, exon 2 IVS < 1 60 MHC E-beta-q intron A IVS 331 > 495 MHC E-beta-q intron B BASE COUNT 113 a 116 c 176 g 90 t ORIGIN About 3.0 kb after segment 1; chromosome 17. 1 cagctgagag ggactcgggc atcttgtcgg cagagaagaa gataattctt gtctccacag 61 catggttttt ggaatactgt aaatctgagt gtcatttcta caacgggacg cagcgcgtgc 121 ggtttctgaa aagatacttc tacaacctgg aggagaacct gcgcttcgac agcgacgtgg 181 gcgagttccg cgcggtgacc gagctggggc ggccagacgc cgagaactgg aacagccagc 241 cggagatcct ggagcaaaag cgggccgcgg tggacacgta ctgcagacac aactatgaga 301 tcttcgataa cttccttgtg cggcggagag gtgagacagg acagggtggc tggggcggaa 361 ccacggtgag ggtggggctg tggggagcag cagaaggcgg tgcgcatgtg cgcaggagcc 421 gcagggaatg ctgggttccc tgcagctgga gccacaggcg cttttaagca gcctcttggc 481 aggggaacgg aattc // LOCUS MUSMHEBQ3 2159 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-q gene, exons 3,4,5 and 6. ACCESSION M35682 M34124 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 3 of 3 SOURCE Mouse inbred strain B10.G) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2159) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept + 126 407 MHC E-beta-q, exon 3 975 1085 MHC E-beta-q, exon 4 1488 1511 MHC E-beta-q, exon 5 1806 1819 MHC E-beta-q, exon 6 IVS < 1 125 MHC E-beta-q intron B IVS 408 974 MHC E-beta-q intron C IVS 1086 1487 MHC E-beta-q intron D IVS 1512 1805 MHC E-beta-q intron E BASE COUNT 459 a 589 c 572 g 539 t ORIGIN About 3.9 kb after segment 2; chromosome 17. 1 gatccattct ggatggatag atggaggtag gcaggcaggc aggcaggcag gcaggcaggc 61 atgcagacag cctacaagga ggacagctcc accctcatgg ctccttctca cctctctttc 121 tctagttgag cctacggtga ctgtgtaccc cacaaagacg cagcccctgg aacaccacaa 181 cctcctggtc tgctctgtga gtgacttcta ccctggcaac attgaagtca gatggttccg 241 gaatggcaag gaggagaaaa caggaattgt gtccacgggc ctggtccgaa atggagactg 301 gaccttccag acactggtga tgctggagac ggttcctcag agtggagagg tttacacctg 361 ccaggtggag catcccagcc tgaccgaccc tgtcacggtc gagtggagtg agtggtaact 421 tccagactct gtgaatgccc gcccgggtgg gtgtggttta tccccgcctg tcagctttct 481 ccacccacac actctttcca ctggctttgt gctgtcctgc ctttcaccat ggcttacagg 541 gtaggtgcgt gaagcttcta caagcacagt tgccccctgg gaagcagtta tgcccccata 601 gactcatctg agcctgccag tgacataaca ggtcctggaa tcttcttggc ccctgctgca 661 gtctctgccg ttgctgggtt gtgttcctcc tgctgctgct gctgctgacg atggacaagg 721 agcagtgcag ggtcatgact gaactcaggg acatatagtc atagctctgc ctttgctacc 781 cctcagagct cagcagcttc ctgtcagctc ggctcaggcc tgtttggttg gtttctcaac 841 atgaccagga atgttgactg ccagatcttc tagaacacac ttcttccttg ggctcaaagc 901 tccgagtctc aggggtccgg agtggaaatg ggatttgggc taaaaccctc caaacctttg 961 gcttcctttc tcagaagcac agtccacatc tgcacagaac aagatgttga gtggagttgg 1021 gggcttcgtg ctgggcctcc tcttcctcgg agcggggctg ttcatctact tcaggaacca 1081 gaaaggtaag gagcctggtg ggagccccaa ctccatagca tttcagggaa aagccatggc 1141 tttgttctca ggatgccatt ggccctgtga cctcaggttt cattggattc tgaatgcaac 1201 agtctgtggt tacttgattt gaccctgagg agggataaca catgggagag ttaagttgat 1261 tctggcttga gacctgagga cagaggaagg ctggggggag ccatgggcac tgccggtgac 1321 tgaagctccc taagcccctc cctctgtcca tgctcctctt ggttctgtgt gctctgggca 1381 gtattaccag aggaatctca ggtggcagct cagagtctgg ggacatgtgt ctggggacag 1441 atctgccttc atgcatgtaa gcatctattt tattctctct tttctaggac agtctggact 1501 tcagccaaca ggtaacaccc attgtcttct ctcagagaca gatctgcttt ccctacagta 1561 tgggggctgg ggtgatggac tcagggcaca aaatggggaa gactgagatc ccagggttgg 1621 ccaggcagtt agcactgagc cttgctccct gcacttactg aagcctgtgc tctgaagcag 1681 caatgactcg gggcatgaga agttcctctc tgctcactgc catgctgtaa ggagaggcct 1741 gaagcagtca gagaagccac tgcagagtga ggtctggaaa cagccctgtc ccctgtgctc 1801 tacaggactc ctgagctgag atgaagtaac aaggctgaag gaaggagttc ccccccgtgt 1861 ctccatgcca tgaaaacatg tcctgcttgg cccacatccc tccagagaca ctgctcttcc 1921 aggacctggc tcctcctgat tctccaccct ggagatctgt gctcctgatg gctgcttatc 1981 cctgacccag gccttgcagc tcccagaaca gaggccccac tcttcacatc tcctgtcccc 2041 ttttgtccct tgccttttgt ctggcacttc tgagccagtc tgctgtcata tgctttttta 2101 catttttctc aaataaacaa ataatgaaag tcatctgctt catagagttt caagcagaa // LOCUS RATHPA1 3282 bp ds-DNA ROD 03-JUL-1990 DEFINITION Rat haptoglobin (Hp) gene, exons 1,2 and 3. ACCESSION M34230 KEYWORDS haptoglobin. SEGMENT 1 of 3 SOURCE Rat (strain Wistar) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3282) AUTHORS Marinkovic,S. and Baumann,H. TITLE Structure, hormonal regulation, and identification of the interleukin-6- and dexamethasone-responsive element of the rat haptoglobin gene JOURNAL Mol. Cell. Biol. 10, 1573-1583 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1065 1069 haptoglobin (Hp), exon 1 2797 2879 haptoglobin, exon 2 3125 + 3226 haptoglobin, exon 3 pre-msg 1032 > 3282 Hp mRNA and intron IVS 1070 2796 Hp intron A IVS 2880 3124 Hp intron B IVS 3227 > 3282 Hp intron C BASE COUNT 858 a 743 c 830 g 831 t 20 others ORIGIN 1 ctaatttaaa aacgtttttt aaaacgagtg aagccattta ggttgagcgg ctacattagc 61 gtgaacagca ttccagaaca ggtgtcgggc tgaaacattg gttttctcct gggctgcgaa 121 cacagcgagt ctgccattga ggactctgtc tctacactag catgtggtgt ggctttccgc 181 taacaacaat cagaggagac acagcaggct catttcactg atttcaaatc ggaagacttt 241 tagcaacagg aagatgtcct catgggtcgg gaagcaactg tgaaacggaa ccgatttctt 301 tttactgttc tgtgggcgag actgcaggaa tttctacact ggatttaagt gattccgaga 361 taagtccaga gaagggagcc agtacaaggg tcccatgtca gtctacctat agagctttag 421 tcactctgag attgaagagc agtggaccaa gacccaataa ctcagtctgc tgcctgcaaa 481 ttccagagct ctccacaccc aggagatggt catgcttggg caggagagtt gaaaaaagaa 541 aagacttctt ttatagtctg agttaagggc tgggtcacaa gggtgtttaa aaaaaaaaaa 601 aaagagggct ggggatttag ctcagtggta gagcgcttac ctaggaagca caaggcctgg 661 gttcggtccc agctcgaaaa aaagaccaaa aaaaaaaaaa aaaaaaaaaa aaaaagagag 721 gtctcgtccc tctcccagtt aagtatcaga ttaacagccc ctattccccg tcccactctc 781 tggggttatc acactgcggt gggtgggagg ggtcgtgaag ttgctagatt tcttcatgat 841 ttgtaaaata acaccacgag gagagccaag tatgaagcaa gagctcagct cttgaaaagg 901 ggtttgcttt gtggttactg gaacagtcac tgaccttagc aaggccgaca ttgtgcaaac 961 acagaaatgg aagaaaagga ggtggggtga aaccgaagca taaaaagggt gagcaggagt 1021 cagcacagcg cacgccttct ggaaagaggt gagagaggcc cacgatgagg tgagtccaca 1081 gtccacactt ttgggcacac aatgcagatg tctctgggag agtgagaaaa tgggatgcag 1141 gaacagggcc gatgggcacc gttctgtggg agttaagccc gcagcctgca ggcgcatatg 1201 gcgagggata gagctgtgga tgcattgcaa cacactgtaa acttacctga agcgttgtga 1261 gacttttttt tttttttttg gtcttttttt tcggagctgg ggaccgaacc caggccttgc 1321 tgccttccta ggcaaagtcc gctctaccac tgagctaaat tccccaaccc cgcgttgtga 1381 gacttttgtt ttataacttg actatgcagt ttgagtgtga attttgttgg gtgaagacct 1441 caggctgaaa tgtcaaaggc aggaagtgaa gggaccagtg acaaagcccc ttcctccctg 1501 tgtccatgag agatgggcag gacagacagg gctttctatc tctaaggagg atctttccca 1561 gtgagatgaa aggttttgtt ttttaccagg catgcagcag cttcctggga tgctggctgt 1621 gctgttaaca gacttcctgc ttttaaagga acaaagacaa tagtcacaca gtctagtggc 1681 accatcaagg catccccctt cctttttaaa atcaaaatat aaagactttg aaggttacaa 1741 aaagactaga agcatagtgt ccaaaaggaa ttcctaactg gccagaatct acagggaatt 1801 ggttaccgtt taagtgtggt ctgtgtacca atggtggcca caagtcatgc tgagaggaag 1861 ccagttttct ccaggtaact tctggtttga tacacaatcc ctttttttaa aattatttat 1921 ttatttattt gtttgtttgt ttctgtgagt acactgtcgc tgtctttaga cacaccagaa 1981 gagggcatcg gatctcatta cagaggttgt gagccaccat gtgttgctgg gaattgaact 2041 caggacctct ggaagagcag tcagtcgtct taaccgctgg gaattgaact caggacctct 2101 ggaagagcag tcagtgctct taaccgctga gccctctctc cagccctgat atataacctt 2161 aagaccaaat acttatgaag taataggagc aagcacatgt gagttatata catatgtata 2221 tatttgggtc atagtgcaca cccagggatt ctagagctga ggcagggtga agtctgggag 2281 ttcaggagtt gtgacagcta gaaagatgga ctgtgtctnn nnnnnnnnnn nnnnnnnnta 2341 accttttcat tttggaattc caaaaagaga agagccaaat aaattagagc catcatcttt 2401 aagttagcta cgatgtccta acaatgtctt catagctgga acttaatgat gcgtgcagag 2461 gcttcccctt gctgacgttg tggtcaccac cagaggcaga ggcagaggca gaggctcact 2521 ttgctctgtg cctcctcccc agttggttct tgttccacct cccactctcg ggcgggagac 2581 aggcacttgt tatgtagcac tacgtaaagc cccgatcctc ctgcctcaga gtggagagct 2641 ggggtagcac atatgcttcc acactggtgc tgctttcctt cgggtcatgg tgctcccttt 2701 ctaagcttct acaaaattcc ccagtgacac cttgcttgcg tgtaatgcac aaatgcaaga 2761 agaccaactc tactccttct tgccacttct ctacagagcc ctgggagctg tcgtcactct 2821 cctgctctgg ggtcagcttt ttgctgtgga attgggcaat gatgccacag acattgaagg 2881 tgagtctcag gggtttccca ggagctgtgc accccagcag gctgtggccc tgtctgacca 2941 catcagtccc gcactgtatt aaggaagacc cagacctcct ctcgcctaga ccctcggggc 3001 ctcccggcct cagcttccac tcggtgcaag ggagtctggt gttcagggca gctccgtctc 3061 ttctggcttt gcacggggag catctgatca ccacagccct ttcctcgctt ctttctcttg 3121 gcagatgaca gctgcccaaa gcccccagag attgcaaacg gctatgtgga acacttggtt 3181 cgttatcgct gccgacagtt ctacaaacta cagaccgaag gagatggtaa ggctgtttga 3241 gcgggtaggg ctaggctgtc acaccagaac ttaagtgctg ct // LOCUS RATHPA2 482 bp ds-DNA ROD 03-JUL-1990 DEFINITION Rat haptoglobin (Hp) gene, exon 4. ACCESSION M34231 KEYWORDS haptoglobin. SEGMENT 2 of 3 SOURCE Rat (strain Wistar) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 482) AUTHORS Marinkovic,S. and Baumann,H. TITLE Structure, hormonal regulation, and identification of the interleukin-6- and dexamethasone-responsive element of the rat haptoglobin gene JOURNAL Mol. Cell. Biol. 10, 1573-1583 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 296 + 370 haptoglobin (Hp), exon 4 IVS < 1 295 Hp intron C IVS 371 > 482 Hp intron D BASE COUNT 118 a 108 c 120 g 136 t ORIGIN 1 ttaacccgtg agccgtctcc agtccaggga gtgtagtcta tctacgactt tgtacagcct 61 acattcctga caatttctaa gagcttcatt gtgtctttaa agctcccgtg gttgtcatag 121 cctccttttg ggagagacac tctttaattc cattttttca atgaggaaac tgaggacgga 181 gatgccaagg tagcttgtga ggggaagagt cttgatctga actctgacct cttcctgtcc 241 aactctttca tcaggccaca ttcattttct ctgagctcac ctccttttgt ttcaggaatc 301 tacaccttaa acagtgagaa gcaatgggtg aacccagctg ctggcgataa actccccaag 361 tgtgaggcag gtgggtgttg aggtcttaaa gcatggggct aaaatggggc catgtttctc 421 ttgtgtgcct gagtgagtaa gacagggtca gagagacacg ctgcaaagga ggacaatgac 481 ta // LOCUS RATHPA3 1245 bp ds-DNA ROD 03-JUL-1990 DEFINITION Rat haptoglobin (Hp) gene, exon 5. ACCESSION M34232 KEYWORDS haptoglobin. SEGMENT 3 of 3 SOURCE Rat (strain Wistar) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1245) AUTHORS Marinkovic,S. and Baumann,H. TITLE Structure, hormonal regulation, and identification of the interleukin-6- and dexamethasone-responsive element of the rat haptoglobin gene JOURNAL Mol. Cell. Biol. 10, 1573-1583 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 329 1107 haptoglobin (Hp), exon 5 IVS < 1 328 Hp intron D BASE COUNT 309 a 297 c 342 g 297 t ORIGIN 1 ctgcagaggc tctggaagaa tcagccacca ctgcttgcga aaccaacagt acaggaacac 61 tgcccttgcc acctgctccg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 121 tgtgtgtgtg tgtgtacgtg tgtataaata tatatatgta tatacctaca tacatatgta 181 atcgtcatca cacatacaca ttccacaatc ctcttgaaag tcaatgacag acctgaaagc 241 tgtgtacatt tcattcttag acaaagttgc cctgcagggg cctggtgtga actgctgctc 301 acatcggtct ctcctcctcc ctccgcagtg tgtgggaagc ccaagcatcc tgtggaccag 361 gtacagcgca tcatcggtgg ttccatggac gccaaaggca gctttccttg gcaggccaag 421 atgatctcca gacatggact caccactggg gccacactga tcagtgacca gtggctgctg 481 accactgccc aaaacctctt cctgaatcac agtgagaatg cgacagccaa ggacattgcc 541 cctaccttaa cactctatgt ggggaaaaac cagctggtgg agattgagaa ggtagttctc 601 caccccgagc gctctgtggt ggatatcggg ctgatcaagc tcaaacagaa agtgcttgtc 661 actgagaaag tcatgcctat ctgcctgcct tccaaagact acgtagcgcc aggccgcatg 721 ctatgtgtcc ggttgggggc gcggaatgtc aactttagat ttactgaacg tctcaagtat 781 gtcatgctgc ctgtggctga ccaggagaag tgtgagctgc actatgagaa aagcacagtg 841 cctgagaaga aaggcgctgt aactcctgtt ggggtacagc ccatcttgaa taagcatacc 901 ttctgtgctg gccttaccaa gtatgaggaa gacacttgct atggtgacgc tggcagtgcc 961 tttgccgtcc atgacacgga ggaggacacc tggtatgcag ctgggatcct gagctttgac 1021 aagagttgtg ccgtagctga gtatggtgtg tatgtgaagg caactgatct gaaggactgg 1081 gtccaggaaa caatggccaa gaactagttc agggctgact agagggctgc acacagtggg 1141 gcagggcaat tcaccctgga agaggaagta gaagggttgg ggacataatc tgagggctgc 1201 tagccctgca ttgctcagtc aataataaaa aacgagcttt ggacc // LOCUS MUSTCAXL 331 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma LD1. ACCESSION M34194 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 331 T-cell receptor alpha-chain (AA at 1) BASE COUNT 71 a 86 c 89 g 85 t ORIGIN 1 cagtcagtga cgcagcccga tgctcgtgtc actgtctctg aaggagcctc tctgcagctg 61 agatgcaagt attcctcctc tgtgacacct tatctgttct ggtatgtcct gtacccgcgg 121 caggggctgc agctgctcct caagtactat tccggagacc cagtggttca aggagtgaat 181 ggctttgagg ctgagttcag caagagtaac tcttccttcc acctgcggaa agcctccgtg 241 cactggagcg actcggctgt gtacttctgt gctgtgagca tggatggaaa tgagaaaata 301 acttttgggg ctggaaccaa actcaccatt a // LOCUS MUSTCAXM 334 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma LD3. ACCESSION M34196 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 334 T-cell receptor alpha-chain (AA at 1) BASE COUNT 97 a 84 c 75 g 78 t ORIGIN 1 cagcaggtga gacaaagtcc ccaatctctg acagtctggg aaggagagac agcaattctg 61 aactgcagtt atgaggacag cacttttgac tacttcccat ggtaccgtct gttccctggg 121 gaaagccctg cactcctgat agccatacgt ccagtgtcca ataaaaagga agatggacga 181 ttcacaatct tcttcaataa aagggagaaa aagctctcct tgcacatcac agactctcag 241 cctggagact cagctaccta cttctgtgca gcaagaagta caggctttgc aagtgcgctg 301 acatttggat ctggcacaaa agtcattgtt ctac // LOCUS MUSTCAXN 327 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-14. ACCESSION M34198 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 327) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 327 T-cell receptor alpha-chain (AA at 1) BASE COUNT 92 a 80 c 85 g 70 t ORIGIN 1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata 61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga 121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg 181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa 241 gagtcagact cggctgtgta ctactgtgct ctgagtgatc agcgggggaa gcttatcttt 301 ggacagggaa ccaagttatc tatcaag // LOCUS MUSTCAXO 324 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-27. ACCESSION M34200 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 324 T-cell receptor alpha-chain (AA at 1) BASE COUNT 92 a 80 c 83 g 69 t ORIGIN 1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata 61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga 121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg 181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa 241 gagtcagact cggctgtgta ctactgtgct ctgaggagca actatcagtt gatctggggc 301 tctgggacca agctaattat aaag // LOCUS MUSTCAXP 297 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-6. ACCESSION M34202 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 297) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 297 T-cell receptor alpha-chain (AA at 1) BASE COUNT 85 a 73 c 74 g 65 t ORIGIN 1 gtgacattat ctgaaggaac ttctctgact gtgaactgtt cctatgaaac caaacagtac 61 ccaaccctgt tctggtatgt gcagtatccc ggagaaggtc cacagctcct ctttaaagtc 121 ccaaaggcca acgagaaggg aagcagcaga gggtttgaag ccacatacaa taaagaagcc 181 acctccttcc acttgcagaa agcctcagtg caagagtcag actcggctgt gtactactgt 241 gctctgagtg atcgggggac caatacaggc aaattaacct ttggggatgg gaccgtg // LOCUS MUSTCAXQ 193 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, T-cell clone V2.1. ACCESSION M34204 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 193) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 193 T-cell receptor alpha-chain (AA at 2) BASE COUNT 56 a 45 c 49 g 43 t ORIGIN 1 cctctttaaa gtcccaaagg ccaacgagaa gggaagcagc agagggtttg aagccacata 61 caataaagaa gccacctcct tccacttgca gaaagcctca gtgcaagagt cagactcggc 121 tgtgtactac tgtgctctga gtggaggcaa taataagctg acttttggtc aaggaaccgt 181 tctgagtgtt ctg // LOCUS MUSTCAXR 333 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-33. ACCESSION M34206 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 333) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 333 T-cell receptor alpha-chain (AA at 1) BASE COUNT 84 a 96 c 78 g 74 t 1 others ORIGIN 1 gactccgtga cccagacaga aggcctggtc actgtcaccg aggggttgcc tgtgaagctg 61 aactgcacct atcagactac ttatttaact attgcctttt tctggtatgt gcaatatctc 121 aacgaagccc ctcaggtact cctgcggagc tccacagaca acaagaggac cgagcaccaa 181 gggttccacg ccactctcna taagagcagc agctccttcc atctgcagaa gtcctcagcg 241 cagctgtcag actctgccct gtactactgt gctctgagga atacaggagg tgcagataga 301 ctcacctttg ggaaaggaac tcagctgatc atc // LOCUS MUSTCAXS 339 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-7. ACCESSION M34208 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 339 T-cell receptor alpha-chain (AA at 1) BASE COUNT 86 a 103 c 75 g 75 t ORIGIN 1 gactcagtga cccagacaga aggcctggtc actctcaccg aggggttgcc tgtgatgctg 61 aactgcacct atcagactgc ttactcaact ttccttttct ggtatgtgca acatctcaat 121 gaagccccta aactactcct gaagagctcc acagacaaca agaggaccga gcaccaaggg 181 ttccacgcca ctctccataa gagcagcagc tccttccatc tgcagaagtc ctcagcgcag 241 ctgtcagact ctgccctgta ctactgtgct ctgagtgata agactggagc taacactgga 301 aagctcacgt ttggacacgg caccatcctt agggtccat // LOCUS MUSTCAXT 342 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma P1F12C4. ACCESSION M34210 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 342) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 342 T-cell receptor alpha-chain (AA at 1) BASE COUNT 80 a 102 c 79 g 81 t ORIGIN 1 gactccgtga cccagacaga aggcctggtc actctcaacg aggggttgcc tgtgatgctg 61 aactgcacct atcagactat ttactcaaat gctttccttt tctggtatgt gcactatctc 121 aatgaatccc cttggctact cctgcggagc tccacagaca acaagaggac cgagcaccaa 181 gggttccacg ccactctcca taagagcagc agctccttcc atctgcagaa gtcctcagcg 241 cagctgtcag actctgccct gtactactgt gctttgagtg agaggtctgg agctaacact 301 ggaaagctca cgtttggaca cggcaccatc cttagggtcc at // LOCUS MUSTCAXU 324 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma P1D3A6. ACCESSION M34212 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 324 T-cell receptor alpha-chain (AA at 1) BASE COUNT 82 a 80 c 81 g 81 t ORIGIN 1 cagcaagtgc agcagagccc cgcgtccttg gttctgcagg agggggagaa tgcagagctg 61 cagtgtaact tttccacatc tttgaacagt atgcagtggt tttaccaacg tcctgaggga 121 agtctcgtca gcctgttcta caatccttct gggacaaagc agagtgggag actgacatcc 181 acaacagtca tcaaagaacg tcgcagctct ttgcacattt cctcctccca gatcacagac 241 tcaggcactt atctctgtgc tatggaggct actggaggca ataataagct gacttttggt 301 caaggaaccg ttctgagtgt tata // LOCUS MUSTCAXV 210 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 1E1O. ACCESSION M34214 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 210 T-cell receptor alpha-chain (AA at 1) BASE COUNT 54 a 53 c 50 g 53 t ORIGIN 1 gggggaagtc tcgtcagcct gttctacaat ccttctggga caaagcagag tgggagactg 61 acatccacta cagtcatcaa agaacgtcgc agctctttgc acatttcctc ctcccagaca 121 acagactcag gcacttatct ctgtgctatg gcggctactg gaggcaataa taagctgact 181 tttggtcaag gaaccgttct gagtgttata // LOCUS MUSTCAXW 234 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 7/6AH1. ACCESSION M34216 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 234) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 234 T-cell receptor alpha-chain (AA at 1) BASE COUNT 55 a 59 c 58 g 62 t ORIGIN 1 atgcagtggt tttatcaacg tcctggggga agtctcgtca gcctgttcta caatccttct 61 gggacaaagc agagtgggag actgacatcc actacagtca tcaaagaacg tcgcagctct 121 ttgcacattt cctcctccca gacaacagac tcaggcactt atctctgtgc tatgggtgta 181 tctggtagct tcaataagtt gacctttgga gcagggacca gactggctgt gtgc // LOCUS MUSTCAXX 312 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 2B11. ACCESSION M34218 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 312 T-cell receptor alpha-chain (AA at 1) BASE COUNT 89 a 79 c 79 g 64 t 1 others ORIGIN 1 aatccgtggg ccctgagngt ccacgagggt gaaagtgtca cggtgaattg tagttacaag 61 acatccataa ctgccctaca gtggtacaga cagaagtcag gcgaaggccc tgcccagcta 121 atcttaatac gttcaaatga gagagagaag cgcaatggaa gactcagagc cacccttgac 181 acctccagcc agagcagctc cttgtccatc actgctactc ggtgtgaaga caccgctgtg 241 tacttctgtg ctactgagac aggcaatact agaaaacaca tctttgggct ggggacaact 301 ttgcaagtgc aa // LOCUS MUSTCBYAO 153 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma LD1. ACCESSION M34195 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 153) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 153 T-cell receptor beta-chain (AA at 1) recomb 122 123 V-region end/J-region start BASE COUNT 39 a 37 c 36 g 41 t ORIGIN 1 caaataggag atgtccctga tgggtacaag gccaccagaa caacgcaaga agacttcttc 61 ctcctgctgg aattggcttc tccctctcag acatctttgt acttctgtgc cagcagtgta 121 ggttctggaa atacgctcta ttttggagaa gga // LOCUS MUSTCBYAP 111 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma LD3. ACCESSION M34197 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 111) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 111 T-cell receptor beta-chain (AA at 1) recomb 73 74 V-region end/J-region start BASE COUNT 18 a 37 c 25 g 31 t ORIGIN 1 ttcctcctgc tggaattggc ttctccctct cagacatctt tgtacttctg tgccgcgtcc 61 ccgacaggga acaccgacta caccttcggc tcagggacca ggcttttggt a // LOCUS MUSTCBYAQ 321 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-14. ACCESSION M34199 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 321 T-cell receptor beta-chain (AA at 1) recomb 292 293 V-region end/J-region start BASE COUNT 106 a 81 c 61 g 73 t ORIGIN 1 gacccgaaaa ttatccagaa accaaaatat ctggtggcag tcacagggag cgaaaaaatc 61 ctgatatgcg aacagtatct aggccacaat gctatgtatt ggtatagaca aagtgctaag 121 aagcctctag agttcatgtt ttcctacagc tatcaaaaac ttatggacaa tcagactgcc 181 tcaagtcgct tccaacctca aagttcaaag aaaaaccatt tagaccttca gatcacagct 241 ctaaagcctg atgactcggc cacatacttc tgtgccagca gccccaagac acgtcaaaac 301 accttgtact ttggtgcggg c // LOCUS MUSTCBYAR 210 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-27. ACCESSION M34201 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 210 T-cell receptor beta-chain (AA at 1) recomb 167 168 V-region end/J-region start BASE COUNT 57 a 56 c 43 g 54 t ORIGIN 1 aagattatgt ttagctacaa taataagcaa ctcattgtaa acgaaacagt tccaaggcgc 61 ttctcacctc agtcttcaga taaagctcat ttgaatcttc gaatcaagtc tgtagagccg 121 gaggactctg ctgtgtatct ctgtgccagc agctatcgga caccccccta tgctgagcag 181 ttcttcggac cagggacacg actcaccgtc // LOCUS MUSTCBYAS 255 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-6. ACCESSION M34203 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 255) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 255 T-cell receptor beta-chain (AA at 1) recomb 219 220 V-region end/J-region start BASE COUNT 57 a 68 c 77 g 53 t ORIGIN 1 cagtatccct ggatgagctg gtatcagcag gatctccaaa agcaactaca gtggctgttc 61 actctgcgga gtcctgggga caaagaggtc aaatctcttc ccggtgctga ttacctggcc 121 acacgggtca ctgatacgga gctgaggctg caagtggcca acatgagcca gggcagaacc 181 ttgtactgca cctgcagtgc ggggactggg ggggctacta acaccttgta ctttggtgcg 241 ggcacccgac tatcg // LOCUS MUSTCBYAT 213 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, T-cell clone V2.1. ACCESSION M34205 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 213) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 213 T-cell receptor beta-chain (AA at 1) recomb 171 172 V-region end/J-region start BASE COUNT 48 a 59 c 58 g 48 t ORIGIN 1 ctgaggctga tccattattc atatggtgct ggcagcactg agaaaggaga tatccctgat 61 ggatacaagg cctccagacc aagccaagag aacttctccc tcattctgga gttggctacc 121 ccctctcaga catcagtgta cttctgtgcc agcggtggcg gccgggggag ttatgctgag 181 cagttcttcg gaccagggac acgactcacc gtc // LOCUS MUSTCBYAU 207 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-33. ACCESSION M34207 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 207) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 207 T-cell receptor beta-chain (AA at 1) recomb 163 164 V-region end/J-region start BASE COUNT 65 a 54 c 36 g 52 t ORIGIN 1 ctagagttca tgttttccta cagctatcaa aaacttatgg acaatcagac tgcctcaagt 61 cgcttccaac ctcaaagttc aaagaaaaac catttagacc ttcagatcac agctctaaag 121 cctgatgact cggccacata cttctgtgcc agcagcaaaa gggccaacga aagattattt 181 ttcggtcatg gaaccaagct gtctgtc // LOCUS MUSTCBYAV 156 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-7. ACCESSION M34209 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 156) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 156 T-cell receptor beta-chain (AA at 1) recomb 118 119 V-region end/J-region start BASE COUNT 30 a 48 c 40 g 38 t ORIGIN 1 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg 61 gcttctccct ctcagacatc tttgtacttc tgtgccagca gtgtccgggt ctgggggcct 121 gaacagtact tcggtcccgg caccaggctc acggtt // LOCUS MUSTCBYAW 132 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma P1F12C4. ACCESSION M34211 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 132) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 132 T-cell receptor beta-chain (AA at 1) recomb 94 95 V-region end/J-region start BASE COUNT 35 a 39 c 28 g 30 t ORIGIN 1 ccaagccaag agaacttctc cctcattctg gagttggcta ccccctctca gacatcagtg 61 tacttctgtg ccagcggtgc cagacaggca aacacagaag tcttctttgg taaaggaacc 121 agactcacag tt // LOCUS MUSTCBYAX 303 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma P1D3A6. ACCESSION M34213 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 303 T-cell receptor beta-chain (AA at 1) recomb 259 260 V-region end/J-region start BASE COUNT 80 a 72 c 79 g 72 t ORIGIN 1 aaggtgacag taacaggagg aaacgtgaca ttgagctgtc gccagactaa tagccacaac 61 tacatgtact ggtatcggca ggacactggg catgggctga ggctgatcca ttactcatat 121 ggtgctggca accttcaaat aggagatgtc cctgatgggt acaaggccac cagaacaacg 181 caagaagact tcttcctcct gctggaattg gcttctccct ctcagacatc tttgtacttc 241 tgtgccagca gtgcaggagc tggaaatacg ctctattttg gagaaggaag ccggctcatt 301 gtt // LOCUS MUSTCBYAY 159 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 1E1O. ACCESSION M34215 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 159) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 159 T-cell receptor beta-chain (AA at 1) recomb 113 114 V-region end/J-region start BASE COUNT 35 a 40 c 39 g 45 t ORIGIN 1 gatgtccctg atgggtacaa ggccaccaga acaacgcaag aagacttctt cctcctgctg 61 gaattggctt ctccctctca gacatctttg tacttctgtg ccagcagtgt gggttctgga 121 aatacgctct attttggaga aggaagccgg ctcattgtt // LOCUS MUSTCBYAZ 321 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 7/6AH1. ACCESSION M34217 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 321 T-cell receptor beta-chain (AA at 1) recomb 275 276 V-region end/J-region start BASE COUNT 86 a 78 c 81 g 76 t ORIGIN 1 acccaaagcc ctagaaacaa ggtgacagta acaggaggaa acgtgacatt gagctgtcgc 61 cagactaata gccacaacta catgtactgg tatcggcagg acactgggca tgggctgagg 121 ctgatccatt actcatatgg tgctggcaac cttcaaatag gagatgtccc tgatgggtac 181 aaggccacca gaacaacgca agaagacttc ttcctcctgc tggaattggc ttctccctct 241 cagacatctt tgtacttctg tgccagcagt gtgggttctg gaaatacgct ctattttgga 301 gaaggaagcc ggctcattgt t // LOCUS MUSTCBYBA 339 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 2B11. ACCESSION M34219 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 339 T-cell receptor beta-chain (AA at 1) recomb 294 295 V-region end/J-region start BASE COUNT 90 a 88 c 88 g 73 t ORIGIN 1 gaggctgcag tcacccaaag ccctagaaac aaggtgacag taacaggagg aaacgtgaca 61 ttgagctgtc gccagactaa tagccacaac tacatgtact ggtatcggca ggacactggg 121 catgggctga ggctgatcca ttactcatat ggtgctggca accttcaaat aggagatgtc 181 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg 241 gcttctccct ctcagacatc tttgtacttc tgtgccagca ggagacaggg gcctagtcaa 301 aacaccttgt actttggtgc gggcacccga ctatcggtg // LOCUS CHKATHA 188 bp ss-mRNA VRT 03-JUL-1990 DEFINITION Chicken avian thymic hormone mRNA, partial cds. ACCESSION M34330 KEYWORDS avian thymic hormone; parvalbumin. SOURCE Chicken thymus, cDNA to mRNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 188) AUTHORS Palmisano,W.A. and Henzl,M.T. TITLE Partial nucleotide sequence of the parvalbumin from chicken thymus designated "avian thymic hormone" JOURNAL Biochem. Biophys. Res. Commun. 167, 1286-1293 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 188 avian thymic hormone (AA at 1) BASE COUNT 53 a 43 c 50 g 42 t ORIGIN 1 ccggatcaga tcaagaaggt ttttggaatc cttgatcagg acaagagcgg cttcattgaa 61 gaagaagagc ttcagctgtt tctgaagaac ttctcttcga gtgccagagt cctcacctct 121 gcggagacca aagctttcct ggctgcaggt gacaccgacg gcgacgacaa aataggcgta 181 gaagaatt // LOCUS DDISGSPA 1957 bp ds-DNA INV 03-JUL-1990 DEFINITION D.discoideum spore germination-specific protein (270-11) gene, complete cds. ACCESSION M33862 KEYWORDS spore germination-specific protein. SOURCE D.discoideum (strain AX-3) germinating spore, cDNA to mRNA, and DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1957) AUTHORS Giorda,R., Ohmachi,T., Shaw,D.R. and Ennis,H.L. TITLE a shared internal theronine-glutamic acid-threonine-proline repeat defines a family of dictyostelium discoideum spore germination- specific proteins JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.L.Ennis 17-APR-1990. FEATURES from to/span description pept 171 228 spore germination-specific protein, exon 1 316 1856 spore germination-specific protein, exon 2 IVS 229 315 spore germination-specific protein intron A BASE COUNT 724 a 359 c 258 g 616 t ORIGIN 1 aaataatttt attattttct tttgtaaaag taattaaata aaaaaaaaaa taaaaaataa 61 ataaaattaa ataaagtcaa ttaaaaaaaa aaaaataata taaatatata taaaataaaa 121 aaaaaaaaac aaaacaataa tagtttatga tataaatttt taataataat atgaaaaata 181 tatatagttt attcttatta tttgcattaa taagtgcaac atttgcaagt aagttgaaaa 241 aaaaaaaaaa aaaattatat tgtaaatttt aaataaaaaa caatatacta attattaatt 301 ttaaaattaa attagataat gcatttattg tacattggaa ttcagattca atttcaaaaa 361 aattaacggg acaaattggt gatacaatct ctttttatac aagtgatgga aattctcatg 421 atgtaaaaag ttcagatggt tctgtttcgt caagtgtttt ctctggtagt cttacaaatc 481 ctggaatttt caaggtaaca cttactaaag aaggtaatat tgaatttacc agttcatatg 541 atgaaggtct ttctgcaaca atagtagttt cttctggtgg tcaaattccg attacaacaa 601 cttcatcaac tacaactgat ggtagttcaa ccccttccac tccaacttca acaacttcag 661 cctcaactac tacaagtggt ggtagtgcta caacaacaac aggagaacca attactgatg 721 gttctaatgg aggcgccagt tccacaactg gcaatagcgg gacgacaggt tctgctacca 781 ctactacttc ttcttcttcc gataattccg atggcagtgt aggtacttca actacaactt 841 caccagctat cacaacttca agtgggtcaa taatcgatcc aacttcacca cctacaactg 901 attcatcctc taatagtggt ggttatggtt catcatcttc aattgaaaat ggcgtagaat 961 gtttattaac aatcactcaa gatgcatttg attcttggac atatgataat attatttaca 1021 ccgtttatca agtaaattta acaaatattg gtacactttc agttgagtct gttattctca 1081 ctccaaatga taactcttta atttaccata cttgggaatt ggtttatgat ggaacttcac 1141 tcactcttcc aacctataga aaagctggtc caatcaatcc agaggaaacc attatctttg 1201 gttatatctc tagaaatagt actgatgtta catttgcttt aagtccaaca tgttcagatt 1261 catcaagtcc aactccaact cctactgaga ctccaactga gactccaact gagactccaa 1321 ctgagactcc aactgagact ccaactgaaa ctccaactga aactccaact gaaactgaaa 1381 ctccaacacc aacaccatca agctcatcta gtgatgtaga tagtggttca tcatctgaaa 1441 ttgaaacccc aacaccaact gaaactgata ccccaacccc aacaccatca agttcttcaa 1501 gtgaaggaag tggatcatca tcagaaactc aaccaccaat tactccacca ccaaccactg 1561 gtacttcttg tttagcccaa gtccaacaaa aagttatcaa ctcatggatt aatggtgaag 1621 ttgatcatta tatacaagtt gaggctacta ttgttaacca aggttcaact ccaatttcat 1681 cttttaattt ttattctgat gctgaacaaa tttggtcagt tgaaaaaaca ggaaccaata 1741 cctataaatt accaagttgg ttctcaacaa ttccagttgg tgggtcccat acctttggtt 1801 atattgttaa atctgctgaa ttatctgacc tcgaaggagt tcaatataca tgttgatttt 1861 aaaactctct ttttgtaata ataaaaaaaa aaaaaaaatt ttttggaaat aaatttaatt 1921 ttcaaaaact agttttgatt tcactttatt taataat // LOCUS DDISGSPB 3655 bp ds-DNA INV 03-JUL-1990 DEFINITION D.discoideum pore germination-specific protein gene, complete cds. ACCESSION M33861 KEYWORDS endo-(1,4)-beta-D-glucanase; spore germination-specific protein. SOURCE D.discoideum (strain AX-3) germinating spore, cDNA to mRNA, and DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 3655) AUTHORS Giorda,R., Ohmachi,T., Shaw,D.R. and Ennis,H.L. TITLE A shared internal theronine-glutamic acid-threonine-proline repeat defines a family of Dictyostelium discoideum spore germination- specific proteins JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.L.Ennis, 17-APR-1990. FEATURES from to/span description pept 1346 1412 spore germination-specific protein, exon 1 1505 3555 spore germination-specific protein, exon 2 IVS 1413 1504 spore germination-specific protein intron BASE COUNT 1346 a 482 c 459 g 1368 t ORIGIN 1 tttttttttt ttaatatttt ttattttatt ttttttttta attattatta attattaatc 61 tttattataa acaaaatgca tatgtgttaa aattattata accaaaaatt aattaattta 121 aaaaactaag aactatagtt ctgagatttt caatagtttt tttcaaataa tatgatttct 181 ttttcaaggg tcattaaaat tatattatta gaactattta aaaaaaattc aaaagttaaa 241 tatttaactt ttgcattttt aaaaccatca attataataa ttaattattt tattattttt 301 tttttttttt tttttttttt aattattttt gttttttttt tttttttttt tttttttttt 361 ttttattaaa aaaactatga atactttaaa ttatagtttt tcattttttt attaactgat 421 cataatttaa tttaatttaa tttaatttat ttttttgtat ttaatactcg aaaaccacat 481 acccatgatt aattaaaaaa aataaaaaaa aataaaaaaa aaagaaaaag tactttttca 541 aataaaaaat gtttataaaa aaaaattttt ttttgaggcc aagttaatat ttttgggtag 601 ttaaaatact aagatttgtt ccaatttgga tttttaatgg tttttatttt taaaaataat 661 aatttaacat ttttctaatc aattttcaaa tttttttttt tataactgat ttcttttttt 721 tttattttaa ttttttttta attttttttt atttaaaaaa tatttcaagt tgtacatttc 781 cgttagaatt tcatttggaa gatattagat tttaatttaa aaacaatttt cctaaaaaat 841 aaaataaaaa atgcgaaatt taattttttt tttttattaa taattatttt gaattaaatt 901 tttttttttt tttttttttt ttcccagatt tccaatctta taaaaaggaa ttgtttttta 961 tttttttttt tttcattttc aaaaaactaa tttattagat ctttaaaaaa aaaaaaaaaa 1021 ataataataa taataaaaat aataatatta tctattatcc aaatttgttt ttgcaattaa 1081 tttcgttatt ttttttttta aaaaactcac cacatactta cacaccaaaa aataacaaaa 1141 ataataattc tattattata atcaatttat tgtagtataa gtttaacttt taaagttcta 1201 ttaaaaaaaa aaaaaaaaaa aaaaaaagaa aaaaaaaaat atataaaata ataaaacttt 1261 tgtttattat ttttatgtac tataaatttc aaattcctat atctaaattt ttaatatttc 1321 taaattttta taaattaaaa ccaatatgaa aatattgaaa aattgtatat tattaataat 1381 atttgggtta ttatcaactc aattaattaa tggtaaagta taaaaaaaaa aaaaaaaaaa 1441 aaatattata tttcttaaac aaaaaaaaaa acaaaatatt aattcttaat ttttttttta 1501 ttagcggata ccgattattg ttcattactt gaaaatgcat taatgtttta taaaatgaat 1561 agagctggtc gtttaccaga taacgatata ccatggagag gtaattcagc attgaatgat 1621 gcaagtccaa attcagctaa agatgccaat ggtgatggta atttaagtgg tggttatttt 1681 gatgctggtg atggtgttaa atttggttta ccaatggctt attctatgac tatgttgggt 1741 tggtcattca ttgaatatga atccaatatt gctcaatgtg gtttgacaag tttatacctc 1801 gatacaatta aatatggtac cgactggctt attgcagcac atactgccga taatgaattt 1861 gcaggccaag ttggtgatgg taatgttgat cattcttggt ggggtcctcc agaagatatg 1921 acaatggctc gtccaactta tatgttaaca accgaagcac caggtactga aattgcaatg 1981 gaagcagcat cagcattagc tgcagcttca atagcattta aatcttcaaa cccaacatac 2041 gctgcaactt gcttagcaca tgctaaaact cttcataatt tcgggtacac ttatcgtggt 2101 gtttattcag attccattac gaatgctcaa gctttttata attcatggtc tggctataag 2161 gatgatttag tttggggtag catttggtta tataaagcaa ctcaagattc agattattta 2221 acaaaagccg ttgcagatta tgcatcaggt ggtgttggtg gaatggcaca aggtaattct 2281 cacgattggg ataataaagc accaggttgt tgtttattat tatctaaatt agttccaacc 2341 acaagtactt ataaaactga tttcgaaggt tggttaaatt attggttacc aggtggaggt 2401 gtcacttata ctccaggtgg tttagcatgg atcagacaat ggggtccagc tcgttatgct 2461 gccactgccg ctttccttgg ttctttagct ggtactgaaa aaggcacaga tttcactcaa 2521 aaacaagttg actatttaat tggtaataat ccaaatcaac aatcatttgt agttggtatg 2581 ggtccaaatt atccaattaa tccacatcat cgtgctgccc atcattctac aactaatgat 2641 ataaataatc cagttaataa tttatacctc ttaaaaggtg ctttagttgg tggaccaggt 2701 tcaaatgatg aatatactga tgatagaact gattatattt caaatgaagt tgcaactgat 2761 tataatgctg gtttcgttgg tgcattagct tctcttgtaa atccatcttc aacttctgtt 2821 ccaaccacaa ctccaacagt aactgaaacc ccaacagaga ctccaactga gactccaact 2881 gagactccaa ctgagactcc aacagagact ccaacagaaa ctccaacaga gactccaaca 2941 gaaactccaa cagagactcc aacagaaact ccaacagaaa ctccaacaga aactccaaca 3001 gaaactccaa cagaaactcc aacagaaact ccaaccgaga ctccaactga aactgttact 3061 ccaaccccaa cagtaacacc aactgaaact ccatcaagtg gagaatcttt atcaatctat 3121 aaaagtggat taaaaaatga tttccaagat tggtcatggg gtgagcattc attaactgat 3181 acaacaaatg ttgaatctgg agaaaccaat tcaatttcat ttacaccaaa agcatatggt 3241 gcagtatttt taggatgttt cgaatgtatt gatactgata catacaataa tattgaattt 3301 gatattaatg gtggtagcag tggtgctcaa ttattaagaa taactgttgt taaaaatagt 3361 aaatctgttg gttccaaatt aattaccgat cttaatggtg gaactccaat cgaagcaaat 3421 tcatggacta aaattaaagc atcctttatt gatgacttta aagtatctgg taaagtcgat 3481 ggtatttgga ttcaagatat caaaggtgat acccaatcaa ctgtatacat aagtaatatt 3541 attgcaactg cttaaaaaaa tattaatatt aaatattaaa aaaagtataa ataaaataat 3601 cttaaattaa aaaaaataag tgttttcgaa attttctata gatatatatc taaaa // LOCUS ECOCYSXE 1396 bp ds-DNA BCT 03-JUL-1990 DEFINITION E.coli cysteine regulon 33 Kd (cysE) and 16 Kd protein (cysX) genes, complete cds. ACCESSION M34333 KEYWORDS cysE gene; cysX gene. SOURCE E.coli (strain K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1396) AUTHORS Tei,H., Murata,K. and Kimura,A. TITLE Structure and expression of cysX, the second gene in the Escherichia coli K-12 cysE locus JOURNAL Biochem. Biophys. Res. Commun. 167, 948-955 (1990) STANDARD simple staff_review FEATURES from to/span description pept 221 1042 33 Kd protein (cysE) pept 919 527 (c) 16 Kd protein (cysX) BASE COUNT 325 a 362 c 393 g 316 t ORIGIN 1 cgcgaactgg cgcatcgctt cggcgttgaa atgccaataa ccgaggaaat ttatcaagta 61 ttatattgcg gaaaaaacgc gcgcgaggca gcattgactt tactaggtcg tgcacgcaag 121 gacgagcgca gcagccacta accccaggga acctttgtta ccgctatgac ccggcccgcg 181 cagaacgggc cggtcattat ctcatcgtgt ggagtaagca atgtcgtgtg aagaactgga 241 aattgtctgg aacaatatta aagccgaagc cagaacgctg gcggactgtg agccaatgct 301 ggccagtttt taccacgcga cgctactcaa gcacgaaaac cttggcagtg cactgagcta 361 catgctggcg aacaagctgt catcgccaat tatgcctgct attgctatcc gtgaagtggt 421 ggaagaagcc tacgccgctg acccggaaat gatcgcctct gcggcctgtg atattcaggc 481 ggtgcgtacc cgcgacccgg cagtcgataa atactcaacc ccgttgttat acctgaaggg 541 ttttcatgcc ttgcaggcct atcgcatcgg tcactggttg tggaatcagg ggcgtcgcgc 601 actggcaatc tttctgcaaa accaggtttc tgtgacgttc caggtcgata ttcacccggc 661 agcaaaaatt ggtcgcggta tcatgcttga ccacgcgaca ggcatcgtcg ttggtgaaac 721 ggcggtgatt gaaaacgacg tatcgattct gcaatctgtg acgcttggcg gtacgggtaa 781 atctggtggt gaccgtcacc cgaaaattcg tgaaggtgtg atgattggcg cgggcgcgaa 841 aatcctcggc aatattgaag ttgggcgcgg cgcgaagatt ggcgcaggtt ccgtggtgct 901 gcaaccggtg ccgccgcata ccaccgccgc tggcgttccg gctcgtattg tcggtaaacc 961 agacagcgat aagccatcaa tggatatgga ccagcatttc aacggtatta accatacatt 1021 tgagtatggg gatgggatct aatgtcctgt gatcgtgccg gatgcgatgt aatcatctat 1081 ccggcctaca gtaactaatc tctcaatacc gctcccggat accccaactg tcgccaggct 1141 tcatacacca ctaccgacac cgcattggac agattcatgc tgcggctgtc cggcaccatc 1201 ggaatgcgaa ttttttgttc agcgggcagg gcatcaagaa tgctcgctgg caggccgcgt 1261 gtttccgggc cgaacatcag ataatcgcca tcctgatagc ttacggcgct gtgagcaggt 1321 gtacctttcg tggtgagggc gaacaggcgc tgggattttc tgcttcgagg aacgcgcgat 1381 agtcatgatg acgcgt // LOCUS ECOTRPP 74 bp ds-DNA SYN 03-JUL-1990 DEFINITION Expression plasmid pDS20 derivative. ACCESSION M34334 KEYWORDS . SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 74) AUTHORS Latta,M., Philit,M., Maury,I., Soubrier,F., Denefle,P. and Mayaux,J.-F. TITLE Tryptophan promoter derivatives on multicopy plasmids: A comparative analysis of expression potentials in Escherichia coli JOURNAL DNA 9, 129-137 (1990) STANDARD simple staff_review BASE COUNT 18 a 18 c 15 g 23 t ORIGIN 1 ctcaaggcgc actcccgttc tggataatgt tttttgcgcc gacatcataa cggttctggc 61 aaatattctg aaat // LOCUS HUMCYTOK 1724 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human cytokeratin 8 mRNA, complete cds. ACCESSION M34225 KEYWORDS cytokeratin 8. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1724) AUTHORS Yamamoto,R., Kao,L.-C., McKnight,C.E. and Strauss,J.F.III. TITLE Cloning and sequence of cDNA for human placental cytokeratin 8. Regulation of the mRNA in trophoblastic cells by cAMP JOURNAL Mol. Endocrinol. 4, 370-374 (1990) STANDARD simple staff_review FEATURES from to/span description pept 35 1486 cytokeratin 8 BASE COUNT 401 a 498 c 524 g 301 t ORIGIN 1 ttcggcaatt cctacctcca ctcctgcctc caccatgtcc atcagggtga cccagaagtc 61 ctacaaggtg tccacctctg gcccccgggc cttcagcagc cgctcctaca cgagtgggcc 121 cggttcccgc atcagctcct cgagcttctc ccgagtgggc agcagcaact ttcgcggtgg 181 cctgggcggc ggctatggtg gggccagcgg catgggaggc atcaccgcag ttacggtcaa 241 ccagagcctg ctgagcccct tgtccctgga ggtggacccc aacatccagg ccgtgcgcac 301 ccaggagaag gagcagatca agaccctgaa caacaagttt gcctccttca tagacaaggt 361 acggttcctg gagcagcaga acaagatgct ggagaccaag tggagcctcc tgcagcagca 421 gaagacggct cgaagcaaca tggacaacat gttcgagagc tacatcaaca accttaggcg 481 gcagctggag actctgggcc aggagaagct gaagctggag gcggagcttg gcaacatgca 541 ggggctggtg gaggacttca agaacaagta tgaggatgag atcaataagc gtacagagat 601 ggagaacgaa tttgtcctca tcaagaagga tgtggatgaa gcatacatga acaaggtaga 661 gctggagtct cgcctggaag ggctgaccga cgagatcaac ttcctcaggc agctgtatga 721 agaggagatc cgggagctgc agtcccagat ctcggacaca tctgtggtgc tgtccatgga 781 caacagccgc tccctggaca tggagagcat cattgctgag gtcaaggcac agtacgagga 841 tattgccaac cgcagccggg ctgaggctga gagcatgtac cagatcaagt atgaggagct 901 gcagagcctg gctgggaagc acggggatga cctgcggcgc acaaagactg agatctcaga 961 gatgaaccgg aacatcagcc ggctccaggc tgagattgag ggcctcaaag gccagagggc 1021 ttccctggag gccgccattg cagatgccga gcagcgtgga gagctggcca ttaaggatgc 1081 caacgccaag ttgtccgagc tggaggccgc cctgcagcgg gccaagcagg acatggcccg 1141 gcagctgcgt gagtaccagg agctgatgaa cgtcaagctg gccctggaca tcgacatcgc 1201 cacctacagg aagctgctgg agggcgagga gagcccgctg gagtctggga tgcagaacat 1261 gagtattcat acgaagacca ccggcggcta tgcgggtggt ttgagctcgg cctatgggga 1321 cctcacagac cccggcctca gctacagcct gggctccagc tttggctctg gcgcgggctc 1381 cagctccttc agccgcacca gctcctccag ggccgtggtt gtgaagaaga tcgagacacg 1441 tgatgggaag ctggtgtctg agtcctctga cgtcctgccc aagtgaacag ctgcggcagc 1501 ccctcccagc ctacccctcc tgcgctgccc cagagcctgg gaaggaggcc gctatgcagg 1561 gtagcactgg gaacaggaga cccacctgag gctcagccct agccctcagc ccacctgggg 1621 agtttactac ctggggaccc cccttgccca tgcctccagc tacaaaacaa ttcaattgct 1681 tttttttttt ttggtcccaa aataaaacct cagctagctc tgcc //