Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Posting-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!henry From: henry@utzoo.UUCP (Henry Spencer) Newsgroups: hacknews Subject: kernel-testing fiasco Message-ID: <4487@utzoo.UUCP> Date: Wed, 17-Oct-84 15:52:10 EDT Article-I.D.: utzoo.4487 Posted: Wed Oct 17 15:52:10 1984 Date-Received: Wed, 17-Oct-84 15:52:10 EDT Organization: U of Toronto Zoology Lines: 20 Well, we tested out a new kernel, with a couple of significant performance improvements (hashing in the inode table, and a freelist for the file table), plus other minor trivia. Looked fine single-user. When we came up multi-user, bizarre things started happening. Shut down again, came back up with the old /unix. Same bizarre problems. I noticed we were getting disk errors. Shut down again, started out to figure out just what was going on -- were our precious Eagles failing all of a sudden? -- and suddenly realized that I'd write-protected drive B during the standalone testing, and never un-protected it!! A flip of a switch, and we came back up perfectly. The real puzzle is, why was the system reacting so poorly to this? The proper response to something like this is a spew of console messages; there weren't any until I tinkered with the srm parameters. Even worse, there were indications that user programs weren't seeing errors either. I've seen some signs of error-handling problems in the rm driver before; it's time for a thorough investigation. -- Henry Spencer @ U of Toronto Zoology {allegra,ihnp4,linus,decvax}!utzoo!henry