Path: utzoo!utgpu!news-server.csri.toronto.edu!rpi!dali.cs.montana.edu!uakari.primate.wisc.edu!sdd.hp.com!caen!uwm.edu!linac!midway!quads.uchicago.edu!rtp1 From: rtp1@quads.uchicago.edu (raymond thomas pierrehumbert) Newsgroups: comp.arch Subject: Re: RISC vs. CISC -- SPECmarks Message-ID: <1991May7.150724.18806@midway.uchicago.edu> Date: 7 May 91 15:07:24 GMT References: <1991May2.162909.9165@news.arc.nasa.gov> <819@cadlab.sublink.ORG> <1991May7.061500.7485@marlin.jcu.edu.au> Sender: news@midway.uchicago.edu (NewsMistress) Organization: University of Chicago Lines: 29 Rowan Hughes writes: >I'm a little puzzled by the discussions involving vector vs. risc s-scalar. Vector machines are not really SIMD; they have parallelization through the pipeline, but the bottom line is that the do not process a whole vector "at once". The pipeline only means that each pipe spits out a float result each cycle (or two, in the case of a mul-add). Super scalar machines can also produce a result per cycle. What I am confused about is why superscalar machines aren't seen as clearly superceding vector architectures. Like vector architetures, they use instruction overlap to produce a result (or two) per cycle. The difference is that the compiler or the programmer must arrange things so that the proper overlap is possible, whereas with the vector machines you just issue a single vector instruction, i.e. the particular kind of instruction overlap is hard-wired into the silicon (or GaAs). That would seem to make vector architectures clearly less versatile than superscalar. Perhaps the restrictions make the hardware easier to build; I imagine they certainly make the compilers easier to write, but progress seems to have been pretty good on RISC compilers. A notable exception is none of them I have ever seen will automatically do things like strip mining and unroll-and-jam for you. As far as I can see, insofar as current vector computers have some advantages over superscalar, the performance differences have more to do with memory bandwidth than processor architecture. I'd be happy to hear other comments on this, though. .