Path: utzoo!attcan!utgpu!jarvis.csri.toronto.edu!mailrus!tut.cis.ohio-state.edu!gem.mps.ohio-state.edu!apple!fox!portal!cup.portal.com!spage From: spage@cup.portal.com (S spage Page) Newsgroups: comp.lang.postscript Subject: Re: PS to ASCII? Message-ID: <24260@cup.portal.com> Date: 20 Nov 89 11:02:29 GMT References: <2323@draken.nada.kth.se> Organization: The Portal System (TM) Lines: 16 PostScript cranks out text mainly through the use of the `show' operator and its variants. This takes a PostScript string, which is ASCII characters in parentheses. Most programs which generate PostScript have a prologue which defines custom procedures, e.g. "J takes an offset on the current line and outputs text, M takes a vertical offset and prints text at the left margin." The result is that you can "see" the text pretty easily in the PostScript output file, and you might be able to write a filter which pulls out all the stuff in parentheses. If you really care you could try to convert special characters in the encoding like em-dashes and fancy quotes to plain ASCII. However, you'll probably have more trouble with spaces. Lots of programs output tiny text strings (no wonder their PS output prints slowly!) and it's hard to tell where the spaces should go. Good luck, your mileage will vary. =S