Path: utzoo!utgpu!news-server.csri.toronto.edu!bonnie.concordia.ca!uunet!bionet!root From: GenBank-Updates@genbank.bio.net Newsgroups: bionet.molbio.genbank.updates Subject: Rabbit alpha-1-globin gene to theta-1-globin pseudogene region Message-ID: Date: 22 May 91 12:10:35 GMT Sender: root@genbank.bio.net Distribution: bionet Lines: 138 Approved: lear@genbank.bio.net Checksum: 25217 9 LOCUS RABATGL1 4028 bp ds-DNA MAM 22-MAY-1991 DEFINITION Rabbit alpha-1-globin gene to theta-1-globin pseudogene region ACCESSION X04751 KEYWORDS alpha-1-globin; alpha-globin; globin; pseudogene; repetitive sequence; tandem repeat; theta-1-globin; theta-globin. SOURCE Oryctolagus cuniculus DNA. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 4028) AUTHORS Cheng,J.-F., Raid,L. and Hardison,R.C. TITLE Isolation and nucleotide sequence of the rabbit globin gene cluster psi-zeta-alpha-1-psi-alpha: Absence of a pair of alpha-globin genes evolving in concert JOURNAL J. Biol. Chem. 261, 839-848 (1986) STANDARD full automatic REFERENCE 2 (bases 1 to 4028) AUTHORS Hardison,R.C. JOURNAL Unpublished (1987) STANDARD full automatic COMMENT SWISS-PROT; P01948; HBA$RABIT. Submitted data [2] include some corrections to published seq. [1]. Referring to the authors the sequence from pos. 50 to 70 may not be completely accurate due to reading problems of the sequencing gels. Theta-1 pseudogene was formerly called psi alpha. Data kindly reviewed (15-Jun-1987) by Hardison R.C. From EMBL 26 entry OCATGL1; dated 22-APR-1990. FEATURES Location/Qualifiers precursor_RNA 150..861 /note="primary transcript od alpha-1-globin" mRNA 150..280 /note="exon 1" CDS 186..280 /note="alpha-1-globin (AA 1-32) (280 is 2nd base in codon)" /codon_start=186 intron 281..357 /note="intron I" mRNA 358..562 /note="exon 2" CDS 358..562 /note="alpha-1-globin (AA 33-100) (358 is 3rd base in codon)" /codon_start=358 intron 563..645 /note="intron II" mRNA 646..861 /note="exon 3" CDS 646..771 /note="alpha-1-globin (AA 101-142)" /codon_start=646 misc_feature 841..846 /note="put. polyA signal" polyA_site 861..861 /note="polyA site" repeat_region 1542..1675 /note="region of 5 x 25bp tandem repeat 1" repeat_region 3067..3133 /note="region of 7 tandem repeat 2 (9-10bp)" misc_feature 3139..3744 /note="pseudogene theta-1-globin" misc_feature 3803..3808 /note="put. polyA signal" polyA_site 3818..3818 /note="put. polyA site (found by homology to alpha-1)" BASE COUNT 685 a 1359 c 1310 g 674 t ORIGIN 1 gcggggccgg gtcccaggca gacgccgcga gggcgccccc agcggtggcg gccgccgccg 61 cgccccgccg cgccggccaa tgagcggggc cccgctgggc gtgcccgcag cacctcgggc 121 ttaaaagcgc cgcgcagtct gggctccgca cacttctggt ccagtccgac tgagaaggaa 181 ccaccatggt gctgtctccc gctgacaaga ccaacatcaa gactgcctgg gaaaagatcg 241 gcagccacgg tggcgagtat ggcgccgagg ccgtggagag gtgaggaccc ccgccccgcc 301 ccgccccgcc cgagcccgcc ggcgccgcgc cccgctcacg gcctcctgtc cccgcaggat 361 gttcttgggc ttccccacca ccaagaccta cttcccccac ttcgacttca cccacggctc 421 tgagcagatc aaagcccacg gcaagaaggt gtccgaagcc ctgaccaagg ccgtgggcca 481 cctggacgac ctgcccggcg ccctgtctac tctcagcgac ctgcacgcgc acaagctgcg 541 ggtggacccg gtgaatttca aggtgagccc gcagcccggc tgggagcgtc gcgggggtcg 601 gcggtccccg accacaccca ccgacgtccg cccctctctc tgcagctcct gtcccactgc 661 ctgctggtga ccctggccaa ccaccacccc agtgaattca cccctgcggt gcacgcctcc 721 ctggacaagt tcctggccaa cgtgagcacc gtgctgacct ccaaatatcg ttaagctgga 781 gcctgggagc cggcctggcc ctccgccccc cccacccccg cagcccaccc ctggtctttg 841 aataaagtct gagtgagtgg ccgacagtgc ccgtggagtt ctcgtgacct gaggtgcagg 901 gccggcctag ggacacgtcc gtgcacgtgc cgaggccccc tgtgcagctg caagggacag 961 gagtgggcaa ccggctggtt ccttccttcc tgcttgcaag tccacgaggg gctgctgaaa 1021 gaacccccca cacacacatg cacacactcg tgccactcgg ctgcctccag cctgggtccc 1081 cggctccccc agatctcggg ggggcactgg ctctccctca gcctcccaaa cgtacccacc 1141 cacccaccca cccacggtgc agacaaaacc ggaggtcgag tgcaggctgc agatcccagc 1201 agcacccggg gacgctcact cctaagaccc ttaggtcgcg cttggggcca gtgaggccca 1261 gtgcccacgt ggccaccctg gggctggcac ccctgccttg aggcagcggg ggcccggggt 1321 ggacagtgcc cgcggcaggc ttccttcctg aagagggagg tttgccgtgc catccagccc 1381 ctggctaaca ccagtgtcct ctcacgccca gtctggggct cctccttgga ggacaccgtg 1441 gcagcccctt gggcacctcg ggggcagtgg gagccgtggg aaggggctgt cttcgctcct 1501 tgagaggaag ggagacaggt gagggtgggg cgggacaggt gcacctgagc aggtgaatgg 1561 gcagactgtg gtgccaccgt agccaggaat ggtggagcac cgccgtagcc gggaatggtg 1621 gggcaccgcc gtagccggga atggtggggc accgccgtag ccgggaatgg tggggcacgg 1681 ctgaacctgc aacactgcct gctgaggagc agccgggcgc aggagcccac ccactggggt 1741 ggagaccccg cttctccaac cagacgccca gctccgtgca gctcaggttg gggagcagtg 1801 gtcatcgatg accaggctgg agactcggct tcttagccgc tggcttgctt cctctgctcc 1861 cgcctgggtt ttgtggtcag tcagcagaag ggcggggggg gggggctcca gtgcccaggt 1921 ctgtgggagg ggtggaggca ctgtgagggg accacttggg ggtgcggctg gcagggcgtg 1981 accccatgtg ctctgtgggt ctcctggagt tccattcagg gacgtggccc ccacaagtgc 2041 cagggctcag cagtgggaga cacactgccc ggaggcggca cacccacatt aggtggacca 2101 cagacgccag tcctctgctg gccccggctg tgtccggctt cccctgaccc ccgcgtgccc 2161 tctcgggtct agggccacct ctgcagcaag cagaggcgct cacttgcctg agaatcacgg 2221 caggccagtc ctgcttggtt taacccagag tggacactga taagtgtcat aagtagaaag 2281 tatagctaat tggcgtcatg ggtatacagc tgctatttag taggttagga atttgtgtgt 2341 gtggctgtct ctgtaattac aattacaacc tcagtgcctt aagtcatcaa cactcagctt 2401 ataatgtctg tgtgcatctt gtttcataat tggataatga atctatattc aaattaatgt 2461 aacgttgatt tctgtccaag aaaaataaat gcaagcattt aaaaaatcta tgactttttt 2521 ttaaaagtcc acatgttgaa taatcccatt tattaaacac acacacacac acacaacaag 2581 caaatccgtg gaaacagaga ggaggttggt gggctggagg aggggctgga ggcactgccc 2641 cggcagtttg ggagtagagg tggggagggt cgcacgcgct ggcttgacag ctcagtgtgg 2701 gagctgcaag gctcggctag gcactcagca ggtgcaggtg ttggccgccc gcaacggaac 2761 tcctgctgcg agccaccccg accggccgcg cggcggccca gcccgggagt cgctgtcacc 2821 atctcgcgca gcgcccgcgc tctgccgggg ttccgcgtcc tgtccaggtc tccctctgcg 2881 cgtgtgcata acatgtgtct ccactgaatg tttcaaatgt gtgttttgct gaaaggcctg 2941 gggttcagag cgagcccgaa agtggcggac cgagactgcg tgcgtgcgcg ggcctccggg 3001 tgcgcgcggc ggcacacgtg tcgggaacgg gcctgcgcca cgcccccaga ggcccgcggg 3061 gacccggccc gccgcgcccg ccgcgcccgc cgcgcccgcc gctgcccgcc gctgcccgcc 3121 gctgcccgcc gctgcgggat ggcgctgtcg gcggcggagc gggcgctgct gcgcgccctg 3181 tggaagaagc tggggagcaa cgtgggcgtc tacgcgaccg aggccctgga gaggtgcgca 3241 ccgggagggc gcccccggcc cgccgcgccc cgcgccgcgg ggcccccaca cgcaccacat 3301 ccccctcctc ccgcagaacc ttggaggcct tcccgcgcac caagatctac ttctcccaca 3361 tggacctgag cccgggctcc gccaggtcag agcccacggc cgcaaggtgg ccgacgcgct 3421 gaccctcgcc gcagaccacc tggacgacct gcccggcgcc ctgtccgctc tgagcgacct 3481 gcacgtgcgc acgctgcgcg tggaccccca ccacttcggg gtgagcgccg ggaaccttcc 3541 accggggagg gggctcccct aggcggggtg ggggaggaga atcgatggac cgcgagcggg 3601 aacgacccct ccctgcagct gctgggccac tgtctgctgg tgaccctcgc ccggcactac 3661 cctggagact tcggccccgc catgcacgcc tcggtggaca aattcctgca ccacgtgatc 3721 tcggcgctga cctccaagta ccgctgaatg gagggtggga ggtcgtggga cgccccgccc 3781 cccgtcgacg ccgtcggctt ggagtaaagc cccggggcag cagcctgaac cgagtgctcc 3841 ctggggattg cgtgtgtggg gatggcctcg ggtccgcaaa ccaaggggct ggcgggtttg 3901 gggcgtccag gtcccaaatt ccaattcctt ggccttggcc aggagggtgg caggcgggag 3961 gtggtcgggg ggctgttgat gcccagtcca ggcccttcgc agtactgctc gcttagtcct 4021 cctgactc //