Path: utzoo!utgpu!jarvis.csri.toronto.edu!mailrus!tut.cis.ohio-state.edu!unmvax!unm-la!lanl!cmcl2!yale!mfci!rodman From: rodman@mfci.UUCP (Paul Rodman) Newsgroups: comp.arch Subject: Re: When is RISC not RISC? Message-ID: <649@m3.mfci.UUCP> Date: 16 Feb 89 19:00:42 GMT References: <747@atanasoff.cs.iastate.edu> <28200275@mcdurb> Sender: rodman@mfci.UUCP Reply-To: rodman@mfci.UUCP (Paul Rodman) Organization: Multiflow Computer Inc., Branford Ct. 06405 Lines: 34 In article <28200275@mcdurb> aglew@mcdurb.Urbana.Gould.COM writes: > >>Why not have a *real* pipelined memory system and a compiler than can >>handle more than one miserable outstanding load in flight? Or have both. >> >> >> Paul Rodman > >Denelcore's HEP. > Yuk. Please don't bring up that pile of junk. I was referring more to our machine, where the compiler sees an exposed pipeline several beats long to main memory. In some cases a data cache is useful for reducing latency if the hit rate will be high enough. Of course, your compiler must do a good job about deciding when to go for the cache. ...I'm disgusted at these piddly RISC chips that brag about register scoreboarding of one lousy outstanding load at a time. Great for use with data caches, not much use for high performance computation on large data sets (where I'd much rather not use the cache, but want to get decent bandwidth). If you want to do a daxpy, for example, you need two loads and a store per 2 flops. What good are floating point chips if you can't feed 'em? The architecture is broken if I can't at least do 1 load/store (from memory,please) per flop. :-) Paul Rodman rodman@mfci.uucp