Path: utzoo!utgpu!watserv1!watmath!att!att!linac!pacific.mps.ohio-state.edu!zaphod.mps.ohio-state.edu!wuarchive!udel!nigel.ee.udel.edu!mccalpin From: mccalpin@perelandra.cms.udel.edu (John D. McCalpin) Newsgroups: comp.graphics.visualization Subject: Re: more hardware blah blah Message-ID: Date: 20 Nov 90 02:02:49 GMT References: <1990Nov19.204135.16561@batcomputer.tn.cornell.edu> Sender: usenet@ee.udel.edu Organization: College of Marine Studies, U. Del. Lines: 25 Nntp-Posting-Host: perelandra.cms.udel.edu In-reply-to: andyrose@batcomputer.tn.cornell.edu's message of 19 Nov 90 20:41:35 GMT >On 19 Nov 90 20:41:35 GMT,andyrose@batcomputer.tn.cornell.edu(Andy Rose)said: Andy> At SuperComputing '90 last week in NY City, IBM had a Andy> RS-6000 550 mowing down linpacks at ~65 mflops. $140,000 to Andy> $500,000. Please note that this performance level is for the 1000x1000 "anything goes" LINPACK test, not the more commonly quoted 100x100 "you can't even modify the comments" version. I am anticipating that the latter (which IBM uses for their published FP performance numbers) will be about 19 MFLOPS, based on the speedup of the clock from the 540 to the 550 (41.? MHz vs 30 MHz). It is possible to get 1000x1000 LINPACK performance of about 36 MFLOPS on a Model 530 by careful coding (i.e. following the advice in Ron Bell's technical report on the RS/6000). I expect that IBM will release the fully optimized LAPACK package within a few months which should provide similar performance levels for most dense linear algebra stuff. Right now, only a few of the level-3 BLAS routines are optimized --- DGEMM (in libblas.a) runs at about 45 MFLOPS on the 530 or about 30-32 MFLOPS on the Model 320. -- John D. McCalpin mccalpin@perelandra.cms.udel.edu Assistant Professor mccalpin@vax1.udel.edu College of Marine Studies, U. Del. J.MCCALPIN/OMNET