Path: utzoo!utgpu!news-server.csri.toronto.edu!rpi!zaphod.mps.ohio-state.edu!ncar!csn!ccncsu!purdue!haven.umd.edu!uflorida!caen!uwm.edu!spool.mu.edu!munnari.oz.au!metro!cluster!rex From: rex@cs.su.oz (Rex Di Bona) Newsgroups: comp.sys.mips Subject: 4.52 is it reliable? Message-ID: <2338@cluster.cs.su.oz.au> Date: 24 Apr 91 07:04:44 GMT Sender: news@cluster.cs.su.oz.au Reply-To: rex@cluster.cs.su.oz (Rex Di Bona) Organization: Basser Dept of Computer Science, University of Sydney, Australia Lines: 35 We have just installed the 4.52 kernel on our M120's and 3240 to try and reduce the frequency of crashes. The symptoms we are/were having are: 1) The machine runs out of mbufs. 2) The machine panics and then takes a monitor exception Or.. it panics with 'clget: null client' Or.. the machine hangs the silent death 3) The machine double panics or fails to talk to the fuji scsi controller resulting in no kernel core dump... The load on the machines is usually 2.5+ in the run queue, the machines are swapping, but not really heavily. The result... well, we used to have two crashes or so a day, now we only have one (per machine :-) Is this a known problem? The 4.52 release notes said a mbuf leak was fixed, but there may still be another :-) The configuration is: 4 machines (3 by M120, 1 by 3240) each with... 20+ users, all on ncd-19 X terminals, 48MB of memory about 200 procs, 1100 inodes, 600 files, 100+ process switches and 5000+ system calls a sec. 60 or so ethernet packets a second doing X traffic. (running 4.51, then 4.51 upgraded with 4.52 kernels) We have tried increasing the number of mbufs, but the machines still eventually crash. Any suggestions? Fixes? P.s. we have 3230's with similar loads which have been running fine for weeks. -------- Rex di Bona (rex@cs.su.oz.au) Penguin Lust is NOT immoral