Path: utzoo!attcan!uunet!mcsun!unido!ira.uka.de!rusux1!asterix.luftfahrt.uni-stuttgart.de!schmid From: schmid@asterix.luftfahrt.uni-stuttgart.de (Georg Schmid) Newsgroups: comp.sys.apollo Subject: more 'failing update SR10.1.p->SR10.2.p' Message-ID: <185@rusux1.rus.uni-stuttgart.de> Date: 10 Jul 90 12:29:04 GMT Sender: zrf80385@rusux1.rus.uni-stuttgart.de Reply-To: schmid@asterix.luftfahrt.uni-stuttgart.de (Georg Schmid) Organization: Inst. f. Statik & Dynamik der Luft- und Raumfahrtkonstruktionen Lines: 84 Because I received mail from some people which have the same problems running SR10.2.p on their DN10000's, I want to post somewhat more specifically what I found out since my last posting: (Sorry, this Article is quite long) After the installation (invol, diskless on other DN10000, install++) of the os and the configuration of /etc/rc.local the DN10000 hangs while booting tcpd and routed after doing some (successfull) shutdowns before. That means it seems to stop at "Starting standard daemons: tcpd routed" for about 20 minutes before reaching the dm (in fact everyting is only going a bit slowly). When it reaches the dm, every usage of programs like 'ping', 'telnet', 'netstat' causes the node to hang for several minutes with a subsequent error message like 'router/udp: unknown service' or 'icmp: unknown protocol'. When issuing any dm-command while hanging, the cursor disappears and nothing else happens for some minutes. Other people told me, that their machines don't reach the dm at all. The node boots correctly without the internet processes (I just removed tcpd, routed, inetd from '/etc/daemons'). Our second DN10000 (240F3) has been working for several months with SR10.2.p and the internet services. (I just had to issue things like 'mkdev /dev pty' from time to time) The problem why it doesn't boot seems to be located in the directory /sys/node_data/systmp, probably concerning the file 'tcp_data': - When I removed the directory 'systmp' within the Phase II - Shell and copied the directory //other-dn10000/sys/node_data.1b19c/systmp to node 1b19c, it booted successfully, and the internet services worked fine, but only for a few shutdowns.(I used our other DN10000 as a partner node while installing 1b19c, because the other DN10000 holds our AA for SR10.2.p) - When I pressed Ctrl-Return in service mode while hanging at "Starting standard...." the subsequent salvol told me: 'the vtoc trouble flag has been set for the following objects: /sys/node_data/systmp/tcp_data' and sometimes: 'the vtoc trouble flag has been set for the following objects: /sys/node_data/systmp/global_readonly' - When looking at the file tcp_data (after the salvol) from the Phase II Shell, its size was 528384 Bytes, whereas its normal size seems to be 1216512 Bytes. Every time the file has the former size, the internet seems to fail. Because there m u s t be some relevant difference I'm posting the exact configurations of our machines to be compared with machines having the same problem: Node 1B19C 2 CPU's, VS-Graphics, 16MB, 2x700MB Disks (sector striped) with two controllers, 1 Ethernet, 1 Apollo Token Ring controller. installed patches: p103, p108, p118, p119, p120, p124, p128, p130 Node 240F3 4 CPU's, 128MB, 4x700MB Disk (sector striped) with two controllers, 2 Apollo Token Ring controllers. installed patches: same as above At the moment I use a modified rc.local which removes the old systmp and copies a 'working' systmp to /sys/node_data before starting anything in rc.local. This seems to work for now (including TCP/IP) but doesn't really fix the problem, it's just a 'dirty hack' (I know). Some attempts to remove tcp_data within rc.local or to cat /dev/null on it failed. The real problem might be somewhere in the filesystem, something that corrputs things in systmp at shutdown, thus preventing the tcpd to reopen tcp_data and to start correctly (That's just a guess, I'm not an expert) P.S: I APR'ed on this via email, but didn't get any acknowledgement yet, perhaps it failed ? ----- Georg Schmid, ISD Uni Stuttgart, W.-Germany email: schmid@asterix.luftfahrt.uni-stuttgart.de voice: 0(049-)711-685-2053 fax: 0(049-)711-685-3706