Path: utzoo!utgpu!news-server.csri.toronto.edu!rutgers!usc!julius.cs.uiuc.edu!ux1.cso.uiuc.edu!uiatma.atmos.uiuc.edu!bowman From: bowman@uiatma.atmos.uiuc.edu Newsgroups: comp.unix.aix Subject: Re: RS/6000 Model 320 FP Performance Keywords: BLAS Message-ID: <1990Oct31.233855.1371@ux1.cso.uiuc.edu> Date: 31 Oct 90 23:38:55 GMT References: Sender: Kenneth P. Bowman Followup-To: bowman@uiatma.atmos.uiuc.edu Distribution: usa Organization: University of Illinois at Urbana Lines: 26 In article mccalpin@perelandra.cms.udel.edu (John D. McCalpin) writes: > >Ooops, there must have been some typo in my code. I extracted the >code from the tech report again and got the following absolutely >phenomenal results! > > IBM RS/6000 Model 320 Matrix Multiply Performance > Matrix Order Time per MM MFLOPS > 32 .002 29.789 > 64 .019 27.594 . . . The value of tailoring the algorithms to the architecture is apparent. Is anyone, including IBM, planning or willing to produce a library of basic linear algebra subroutines that are optimized for the 6000? Think of the clock cycles that would be saved! Prof. Kenneth P. Bowman Department of Atmospheric Sciences University of Illinois at Urbana-Champaign 105 S. Gregory Avenue Urbana, IL 61801 217-328-3102 bowman@uiatma.atmos.uiuc.edu