Newsgroups: comp.archives Path: utzoo!news-server.csri.toronto.edu!ox.com!msen.com!emv From: loren@tristan.llnl.gov (Loren Petrich) Subject: [neural-nets] Anybody's Experience with Fahlman's Quickprop? (was Re: Are Conjugate Gradient algorithms any good?) Message-ID: <1991Mar16.051539.9517@ox.com> Followup-To: comp.ai.neural-nets Keywords: Conjugate Gradient algorithms, Back-propagation Sender: emv@msen.com (Edward Vielmetti, MSEN) Reply-To: loren@tristan.llnl.gov (Loren Petrich) Organization: Lawrence Livermore National Laboratory References: <47034@nigel.ee.udel.edu> <92992@lll-winken.LLNL.GOV> Date: Sat, 16 Mar 1991 05:15:39 GMT Approved: emv@msen.com (Edward Vielmetti, MSEN) X-Original-Newsgroups: comp.ai.neural-nets Archive-name: ai/neural-nets/fahlman-quickprop/1991-03-11 Archive: cheops.cis.ohio-state.edu:/pub/neuroprose/fahlman.quickprop-tr.ps.Z [128.146.8.62] Original-posting-by: loren@tristan.llnl.gov (Loren Petrich) Original-subject: Anybody's Experience with Fahlman's Quickprop? (was Re: Are Conjugate Gradient algorithms any good?) Reposted-by: emv@msen.com (Edward Vielmetti, MSEN) Having reviewed some Conjugate Gradient methods, I find them rather complicated. An alternative, due to Fahlman, is the Quickprop algorithm. It is described in some papers of his that can be found in the /pub/neuroprose directory of cheops.cis.ohio-state.edu, available by anonymous ftp. Basically, it works by remembering the previous gradient and the stepsize taken from there, and finding the new weight values by fitting a line from the current gradient to the previous gradient. This operation is done on each weight component separately. In effect, the Hessian is approximated as a diagonal matrix, but one where the nonzero elements are independent of each other. There are some fudge factors that have to be added here and there, such as adding a gradient-descent "starter" and keeping the stepsizes from growing too rapidly, but this algorithm is remarkably simple. I have found it to be a stable and fast algorithm for solving gradient-descent problems. Has anyone else had experience with Quickprop, and how does it compare with Conjugate Gradients and other such methods? $$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$ Loren Petrich, the Master Blaster: loren@sunlight.llnl.gov Since this nodename is not widely known, you may have to try: loren%sunlight.llnl.gov@star.stanford.edu