Path: utzoo!utgpu!watserv1!watmath!att!rutgers!usc!zaphod.mps.ohio-state.edu!uakari.primate.wisc.edu!aplcen!haven!noc.sura.net!oleary From: oleary@noc.sura.net (dave o'leary) Newsgroups: comp.sys.proteon Subject: Re: Overview Problem Summary: Overview problem "fixed" Message-ID: <1990Oct19.200834.5767@noc.sura.net> Date: 19 Oct 90 20:08:34 GMT References: <9010170358.AA02833@umd5.UMD.EDU> <9010171142.AA08697@sayshell.umd.edu> Organization: Suranet, College Park, MD Lines: 36 Our Overview problems seem to be resolved for the moment. There were a few different things that we did - 1/ we watched the ethernet with a sniffer and saw that there were no broken ethernet packets, and no broken IP packets coming from Overview, even the UDP checksums were okay, however the SNMP part of the packets (i.e. the UDP data) was somehow munged. The gateways weren't responding to the broken queries, so they turned red on the monitor, even though they were up. Worse, some of the gateways were crashing with a NM_6B8 bughalt. Needless to say, this is less than ideal behavior. We also got an error on the monitor process of the gateway reporting a bad SNMP packet. As a hack to get around this we started pinging a bunch of the gateways instead of SNMP querying them - at least it kept Overview and the gateways from crashing. We also raised the time between queries on the various gateways. (hint to network management package implementors: it would have made our lives much easier if we could do this by changing all nodes at once rather than changing the hundred+ nodes one at a time). 2/ We got new software from Proteon - I'm not sure of the details, but it was four executables that replaced older versions. This seemed to help significantly. 3/ We backed up the hard disk, reformatted it, and reinstalled everything. This was completed at about 10 last night and things seem to have worked since last night. Thanks very much for all of your suggestions in our time of need, and for Proteon for the new software. Kim Long spearheaded out efforts here to get the problem resolved, so she may be able to provide more details about the various fixes. Thanks Kim !! dave o'leary SURAnet NOC Mgr.