Path: utzoo!utgpu!watserv1!watmath!att!pacbell.com!ucsd!usc!apple!bionet!UB.CC.UMICH.EDU!Doug_Eernisse From: Doug_Eernisse@UB.CC.UMICH.EDU Newsgroups: bionet.molbio.evolution Subject: Re: Homology/similarity/identity: proper usage Message-ID: <7474208@ub.cc.umich.edu> Date: 1 Feb 91 21:40:26 GMT Sender: daemon@genbank.bio.net Lines: 57 This is another response to the recent query by David Steffen regarding the use of the term "homology." I thought I might as well throw my two cents worth in. Many molecular biologists commonly use informal arbitrary criteria as support for statements of the "homology" of two genes. For example, they might suggest that if two peptide sequences in the same organism were highly similar (e.g., 85 percent) then one could be confident that the proteins were "homologous", due to a gene duplication event, as opposed to similarity due to parallel evolution for similar function. It seems to me that hypotheses of homology are only relevant to phylogenetic inference at the level they are proposed to be synapomorphic (shared derived similarities) on a cladogram. Therefore, it is hopeless to try to provide evidence for homology by the comparison of two taxa or sequences. The only interesting evidence one can bring to bear on the issue of common ancestry is shared "special" similarity relative to one or more outgroup taxa or sequences. This issue is, I think, distinct from the issue of whether homology is used as many of us use the term synapomorphy, as a proposal of homology, or as the actual similarity due to common ancestry which is ultimately impossible to prove. With sequence data, there are also problems of specifying the level of homology. For example, Michael Ghiselin (Syst. Zool. 18: 148-149 (1969) uses the following hypothetical example: A Asp-Val-Glu-Met-Ala B Asp-Pro-Glu-Met-Ala C Asp-Pro-Thr-Met-Ala D Gly-Pro-Thr-Met-Ala E Gly-Pro-Thr-Tyr-Ala F Gly-Pro-Thr-Tyr-Ser Ghiselin argues that similarity is a relation between the peptides as wholes, which decreases from A to F, while homology is a relation between the parts. He argues, for example, that Asp is hypothesized to be homologous to Gly in A and F, respectively, given this alignment of the sequences. He also argues that the peptide sequence A could be homologous to F even though they are completely dissimilar. One can speak of the correspondence between nucleotides or amino acids in terms of their position in a sequence which is hypothesized to be homologous. Although Ghiselin doesn't consider this use of homology, one more normally may also speak of the shared similarity of D, E and F at site 1, relative to A, B and C, which could be a synapomorphy (hypothesis of homology), depending on the outgroup(s) one selects which in turn determines the cladogram topology. One can also hypothesize that peptide F is homologous to peptide A, or more precisely, hypothesize that the shared ancestor of A and F had single protein-coding gene which is traceable, by descent, to the genes in A and F which produced these peptides. Confusing, isn't it? Doug Eernisse usergdef@ub.cc.umich.edu usergdef@umichub.bitnet Museum of Zoology and Dept. of Biology University of Michigan