Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Posting-Version: version B 2.10.1 6/24/83; site alice.UUCP Path: utzoo!watmath!clyde!burl!ulysses!allegra!alice!rab From: rab@alice.UucP (Rick Becker) Newsgroups: net.math.stat Subject: Re: The function 'hist' in S. Message-ID: <4409@alice.UUCP> Date: Thu, 10-Oct-85 05:55:39 EDT Article-I.D.: alice.4409 Posted: Thu Oct 10 05:55:39 1985 Date-Received: Sat, 12-Oct-85 17:26:39 EDT References: <296@natmlab.OZ> Organization: Bell Labs, Murray Hill Lines: 29 Keywords: density > From: rolf@natmlab.OZ (Rolf Turner) > > I've noticed that the hist function in S does something which > seems wrong to me. When the argument 'scale' is TRUE, the > write-up says that hist produces counts on a "density scale". > It seems to me that if you're using the word "density" this > should imply that the histogram integrates to 1. Instead, > hist produces counts that SUM to 1. The "scale" argument to hist was not meant to produce a density scaling. It was defined so that the y-axis would remain comparable when looking at histograms that had varying numbers of observations from the same underlying distribution as well as remaining the same when the observations were changed by a scale factor. A density scale would have the first property but would change drastically if all of the observations were multiplied by 10, say. The idea was to decouple the y-axis scale from the x-axis scale with the "scale" argument. In the old (1981) documentation, the argument was called "density", even though it was computed in this way. We changed the name to "scale" to avoid giving people the impression that it meant true density scaling. In retrospect, it may have been a better idea to use the more familiar true density scaling. However, if you make the change that Rolf Turner suggested, it would probably be a good idea to name the argument "density" once again. -- Rick Becker alice!rab research!rab