Path: utzoo!utgpu!jarvis.csri.toronto.edu!cs.utexas.edu!oakhill!marvin From: marvin@oakhill.UUCP (Marvin Denman) Newsgroups: comp.sys.m88k Subject: Re: Information wanted on m88000 Risc workstations Message-ID: <2831@yogi.oakhill.UUCP> Date: 16 Jan 90 20:31:22 GMT References: <641@s5.Morgan.COM> <25A64468.11498@paris.ics.uci.edu> <648@s5.Morgan.COM> <1879@xyzzy.UUCP> <2811@yogi.oakhill.UUCP> <34446@mips.mips.COM> Reply-To: cs.utexas.edu!oakhill!marvin (Marvin Denman) Organization: Motorola Inc., Austin, Texas Lines: 52 In article <34446@mips.mips.COM> , earl@wright.mips.com (Earl Killian) writes: >Consider the application being discussed, >matrix multiply, which is highly vectorizable. If the original poster >is correct in that the 88100, with its pipelined floating-point units, >tops out in 6.7 mflop/s in single precision matrix multiplies, it >really proves this point. The MIPS R3000, with non-pipelined >floating-point units, can do matrix multiplies at > 25MHz 33MHz > single 11.8 mflop/s 15.7 mflop/s > double 7.8 mflop/s 10.4 mflop/s >This an example of why MIPS perfers low-latency to pipelined fp. >-- >UUCP: {ames,decwrl,prls,pyramid}!mips!earl >USPS: MIPS Computer Systems, 930 Arques Ave, Sunnyvale CA, 94086 It should be noted that the 88k numbers you repeated are apparently at 20Mhz and for the specific code fragment posted: DO 10 J = I,N 10 A(I,J) = A(I,J) + B(I,K) * C(K,J) The numbers you posted for the R3000 are PROBABLY for a slightly different code fragment: ( I am more conversant in C so I will translate) for (i=0 ; i