Newsgroups: comp.compression Path: utzoo!utgpu!news-server.csri.toronto.edu!rpi!sarah!bingnews!kym From: kym@bingvaxu.cc.binghamton.edu (R. Kym Horsell) Subject: Re: theoretical compression factor Message-ID: <1991Apr2.224833.7907@bingvaxu.cc.binghamton.edu> Organization: State University of New York at Binghamton References: <20144@alice.att.com> <1991Apr2.034441.28170@bingvaxu.cc.binghamton.edu> <20159@alice.att.com> Date: Tue, 2 Apr 1991 22:48:33 GMT In article <20159@alice.att.com> jj@alice.UUCP (jj, like it or not) writes: [I wrote] >>Is p lg p a hard-and-fast bound or not? I still think it's an average. >It's ALWAYS the average rate. Given that you're using compression, >you can't measure any other thing, unless you have a fixed message >set, and that's all. (In which case you should enumerate your >messages, perhaps huffmanwise...) If you have any set of data >with the overall 'p' known, that is the average rate for THE WHOLE >SET. Parts of it may do better, because they may hit shorter >symbols (for instance), but if you look at the probabilities >of THAT PART, you should find that you don't do better than >the p log p chosen over your locality, without a model. Ta. -kym