Path: utzoo!attcan!utgpu!jarvis.csri.toronto.edu!cs.utexas.edu!tut.cis.ohio-state.edu!ucbvax!hayes.fai.alaska.edu!wisner From: wisner@hayes.fai.alaska.edu (Bill Wisner) Newsgroups: comp.protocols.tcp-ip Subject: Why do TCP connections hang? Message-ID: <8911301020.AA14662@hayes.fai.alaska.edu> Date: 30 Nov 89 10:20:15 GMT Sender: daemon@ucbvax.BERKELEY.EDU Organization: The Internet Lines: 35 I sent a message to this august assemblage recently complaining that FTP on our entire campus has an unpleasant habit of hanging in the middle of transfers. It now turns out that the problem is much more general. Any TCP connection may hang under certain unknown circumstances. Background: we have two subnets. The main subnet is populated by a VMS machine (with TWG WINS TCP) and many PCs and Macs running NCSA Telnet. It is connected with a Proteon router to the world at large. The other subnet has a another VAX (with TWG) and a slew of Sun workstaions; it is connected by a second Proteon router to the main subnet. TCP links from any campus machine to any machine off campus are prone to failure. (The same is true of connections from off-campus to on-.) I've used SunOS's etherfind utility to try to find the problem. I have hex dumps of an rlogin connection and an FTP transfer, both of which hung. (A certain message in my mailbox on a remote machine will cause my connection to hang every time if I merely read it. It contains no obvious clues -- it's just another normal looking message.) I could find nothing in the hex dumps. No packets were truncated; one packet arrived and the next one didn't. The hung connections seem to be data-specific. If I try transferring the same file twice, both FTP connections are likely to hang at the same point in the file. As an exercise, I took a file and split it into several small chunks, then tried transferring it. The connection still hung at the same point in the fragmented file. I have come to believe that perhaps the problem resides in the Proteon box that connects us to the Internet-at-large. It is, after all, the one common factor shared by all campus machines. But I am at a loss to figure out just what the problem might be. Any ideas or suggestions would be greatly appreciated. Brought to you by Super Global Mega Corp .com