Path: utzoo!utgpu!news-server.csri.toronto.edu!mailrus!uunet!mcsun!unido!gmdzi!al From: al@gmdzi.UUCP (Alexander Linden) Newsgroups: comp.ai.neural-nets Subject: Re: Using deltas at the input layer Summary: another reference Message-ID: <3166@gmdzi.UUCP> Date: 6 Aug 90 16:03:36 GMT References: <12039@sdcc6.ucsd.edu> <14739@csli.Stanford.EDU> Organization: GMD, Sankt Augustin, F. R. Germany Lines: 144 > In <12039@sdcc6.ucsd.edu> demers@beowulf.ucsd.edu (David Demers) writes: > > >Anyone else try anything using a normal backpropogation > >delta at the input layer? > >Dave DeMers > >demers@cs.ucsd.edu We did several experiments on the idea of backpropagating error information to input units. The most illustrative experiments can be found in Linden and Kindermann (1989) and much more detailled but in german in Linden (1990), where we did experiments on recognition of handwritten numerals. We showed that inversion (ie gradient descent in input space) can find input pattern which the neural network classifies as a "7" for example but are itself as close as possible at a "3". In other words this procedure can explicitly detect misclassfications. An very interesting idea now is to use these misclassifications as counterexamples and extending your training set with them (eg Hwang90EE22 or Linden90). A general scheme for detecting misclassifications is implicated through an extended error function: E = (O - T)^2 + (I - IT)^2 O is the output, T the target eg. "7", I is the input (at the beginning this may be used as a starting point for gradient descent in input space) IT is an input target eg the input presentation of some "3" When you calculate error and propagating them back through an already trained NN (leaving weights unchanged) you get error signals for input units. Adding these error values onto the input units (not trivial!, see Linden & Kindermann (1989) or better Kindermann & Linden (1990)) tries to get the input I as much as close to the input target IT and its induced output O as much as close to the given target T. Thrun Linden (1990) applied inversion to recurrent neural networks. In another internal paper we show (Thrun, Moeller, Linden (1990, internal paper)) that it is possible to generate actions from forward models via inversion. Consider a forward model, with current state and some action as input and an next state as output. An action is generated by performing gradient descent in action space in order to approximate the goal with the predicted next state. Very interesting results can be obtained by concatenating such forward models. This leads to an n-step-lookahead planner. But as common practice we have already done only some very small experiments. If you are interested of any paper of our group send your request to Alexander Linden | TEL. (49 or 0) 2241/14-2537 Research Group for Adaptive Systems | FAX. (49 or 0) 2241/14-2618 or -2889 GMD | TELEX 889469 gmd d P. O. BOX 1240 | / al@gmdzi.uucp D-5205 St. Augustin 1 | e-mail< al@zi.gmd.dbp.de Federal Republic of Germany | \ unido!gmdzi!al@uunet.uu.net ------------------------------------------------------------------------------- References to Inversion of Neural Networks: @UNPUBLISHED{Hwang90EE22, AUTHOR = {Hwang, J. N. and Choi, J. J. and Oh, S. and Marks, R. J.}, TITLE = {Query Learning Based on Boundary Search and Gradient Computation of Trained Multilayer Perceptrons}, NOTE = {to appear in Proceedings of IJCNN 90, San Diego, June 17-21, 1990}, YEAR = {1990}, KEYWORDS = {inversion}, REF = {EE22} } @INPROCEEDINGS{Hwang90EE23, AUTHOR = {Hwang, J. N. and Chan, C. H.}, TITLE = {Iterative Constrained Inversion of Neural Networks and its Applications}, BOOKTITLE = {24th Conference on Information Systems and Sciences, Priceton, March 1990}, YEAR = {1990}, KEYWORDS = {inversion}, REF = {EE23} } @ARTICLE{Kindermann90PC, AUTHOR = {Kindermann, J. and Linden, A.}, TITLE = {Inversion of Neural Nets}, YEAR = 1990, JOURNAL = {Parallel Computing}, NOTE = {(to appear)}, REF = {Map} } @INPROCEEDINGS{Linden89Inversion, AUTHOR = {Linden, A. and Kindermann, J.}, TITLE = {Inversion of Multilayer Nets}, BOOKTITLE = {Proceedings of the First International Joint Conference on Neural Networks, Washington, DC}, PUBLISHER = {IEEE}, ADDRESS = {San Diego}, YEAR = 1989, REF = {Inversion} } @MASTERSTHESIS{Linden90diplom, AUTHOR = {Linden, A.}, TITLE = {{U}ntersuchung von {B}ackpropagation in konnektionistischen {S}ystemen}, SCHOOL = {Universit"at Bonn}, YEAR = {1990}, ADDRESS = {Bonn}, REF = {diplom} } @UNPUBLISHED{Suddarth89Z4, AUTHOR = {Suddarth, S. C. and Bourrely, J. C.}, TITLE = {A Back-Propagation Associative Memory for Both Positive and Negative Learning}, YEAR = 1989, NOTE = {Poster Presentation at IJCNN-89}, KEYWORDS = {backpropagation | associative memory | negative training}, REF = {Z4} } @INPROCEEDINGS{Thrun90Inversion, AUTHOR = {Thrun, S. and Linden, A.}, TITLE = {Inversion in Time}, BOOKTITLE = {Proceedings of the EURASIP Workshop on Neural Networks, Sesimbra, Portugal, February 15-17}, ORGANIZATION = {EURASIP} , YEAR = 1990, REF = {Inversion} } @INPROCEEDINGS{Williams86Z23, AUTHOR = {Williams, R. J.}, TITLE = {Inverting a Connectionist Network Mapping by Backpropagation of Error}, BOOKTITLE = {8th Annual Conference of the Cognitive Science Society}, YEAR = 1986, PUBLISHER = {Lawrence Erlbaum}, ADDRESS = {Hillsdale, NJ}, KEYWORDS = {backpropagation | inversion | natural language}, REF = {Z23} }