Path: utzoo!utgpu!news-server.csri.toronto.edu!cs.utexas.edu!samsung!emory!mephisto!mcnc!thorin!unc!rooks From: rooks@unc.cs.unc.edu (Mark Rooks) Newsgroups: comp.text Subject: Re: where to get machine readable text? Keywords: machine readable text Message-ID: <16118@thorin.cs.unc.edu> Date: 18 Sep 90 20:36:15 GMT References: <2945@network.ucsd.edu> Sender: news@thorin.cs.unc.edu Reply-To: rooks@unc.cs.unc.edu (Mark Rooks) Organization: University Of North Carolina, Chapel Hill Lines: 53 In article <2945@network.ucsd.edu> pbiron@weber.ucsd.edu (Paul Biron) writes: >I am looking for places (hopefully free, via ftp) to get machine >readable text >e.g. > the Bible > the Talmud (sp?) > works of literature/philosophy (Shakespere, Aristotle, etc) > political/social documents (Bill of Rights, etc) > . > . > . > >I am interested in such texts solely for research purposes and >will not be reselling them. > >Thanx for any pointers, > >Paul Biron pbiron@ucsd.edu (619) 534-5758 Someone has already posted that the Bible is available through simtel. You could also pick up a computer shopper and order a KJB for not much more than the cost of the disks. A few of the major text projects ongoing: Oxford Text Archive: already mentioned by Ed. Some of the text little more than a raw scan. Very spotty collection in the humanities. Not free, though cheap. ARTFL project: Massive collection of 16th through 20th cent. French M-R texts; based at the Univ. of Chicago. Renaissance Textbase project: U. of Toronto. What the name implies. Elec. Text. Corp.: Published a CD (for only $250) last year with 50 MB of American lit. (from the Lib. of America series), the Riverside Shakespeare, a couple of Bibles, and a few other things. Georgetown Univ.: Publishing translations of Hegel. Additional material in the works. Thesaurus Lingua Grecae (TLG): Virtually the whole of ancient Greek. Based at UC-Irvine. Thesaurus Lingua Latinae (TLL): Comparable project to TLG, but Latin. Based in Germany. Most of the major works in philosophy (through the 19th cent.) should be available by this time next year. Mark Rooks