Path: utzoo!utgpu!jarvis.csri.toronto.edu!mailrus!uflorida!gatech!hubcap!dfk From: dfk@romeo.cs.duke.edu (David F. Kotz) Newsgroups: comp.parallel Subject: Very Large Data Sets - what are they? Keywords: your experience -- parallel I/O Message-ID: <4783@hubcap.UUCP> Date: 15 Mar 89 13:26:27 GMT Sender: fpst@hubcap.UUCP Lines: 24 Approved: parallel@hubcap.clemson.edu A number of you have or envision applications for computers that involve large amounts of data. Perhaps these are databases, matrices, images, VLSI circuit designs, etc. My research involves filesystems for parallel computers, and I am unsure what kinds and arrangements of data might be used by parallel computers. I am interested in the arrangement of the data on disk, and how the application reads it into memory. Does it read a lot or write a lot, or both? Does it read and write the same file? Does it read/write it all at once, or throughout the run of the application? Does it read/write sequentially or randomly? How might you modify the application for a parallel computer, and specifically, for parallel I/O? Can I discuss this with you? Do you have some data that I can analyze to determine, for example, what is the size of each row of the matrix, or cell of the chip, etc? Thanks, David Kotz Department of Computer Science, Duke University, Durham, NC 27706 ARPA: dfk@cs.duke.edu CSNET: dfk@duke UUCP: decvax!duke!dfk