Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!utgpu!utfyzx!sq!yuri From: yuri@sq.UUCP Newsgroups: comp.text Subject: Re: Need pointer to SGML info (LONG) Message-ID: <1987Apr8.081822.24753@sq.uucp> Date: Wed, 8-Apr-87 08:18:22 EST Article-I.D.: sq.1987Apr8.081822.24753 Posted: Wed Apr 8 08:18:22 1987 Date-Received: Sat, 11-Apr-87 05:28:12 EST References: <883@maynard.BSW.COM> Organization: SoftQuad Inc., Toronto Lines: 168 Keywords: SGML Checksum: 60246 In article <883@maynard.BSW.COM> campbell@maynard.UUCP (Larry Campbell) writes > I'm looking for pointers to SGML info. Is SGML an ANSI standard? > ISO? CCITT? Where can I find documentation on it? SGML is ISO standard 8879. The best source of information is the group that has spearheaded the campaign for the creation of a Standard Generalized Markup Language, The Graphic Communications Association, 1730 North Lynn Street, Suite 604, Arlington, VA, 22209-2095. The GCA organizes workshops and conferences around the world which generally include an SGML component (MarkUp '87, May 13 to 15, Torremolinos, Spain, with a two-day hands-on workshop beforehand, May 11 and 12; TechDoc Eleventh, Aug. 18 to 20, San Francisco; others), and sells documentation defining the standard. You can easily obtain information on becoming a member. Later this month, a group of serious SGML proponents will be launching a new publication entitled TAG, the SGML Newsletter. You may subscribe (one year for $135, bimonthly moving to monthly) or request a free copy of Issue #1 by writing to TAG c/o BDS, Inc. 105A Carpenter Drive Sterling, Virginia 22170 Also of interest might be the Electronic Manuscript Special Interest Group Association of American Publishers attn: Carol Risher 2005 Massachusetts Avenue NW Washington DC 20036 Tel: 202 232-3335 The AAP, working with a number of other groups, including the IEEE, the Council on Library Resources, American Society of Indexers, Library of Congress, American Chemical Society, American Institute of Physics, American Mathematical Society, Printing Industries of America, Association of American University Presses, and some 25 others, has created a publishing-specific implementation of SGML for book, manual and journal creation. That standard, known as the AAP standard, is currently undergoing the ANSI approval stages, and was recently the subject of a workshop at Microsoft's CD-ROM conference. (SGML, particularly the AAP standard, is the best way to store data on CD-ROM: format, device and formatting software independent.) _______________ SGML in spite of its growing presence as a hot topic in the area of document interchange and storage formats, is still a somewhat nebulous acronym. At the recent Seybold Seminars on Publishing at least six products - SoftQuad Publishing Software, Context, Intran, Interleaf, Omnipage and Texet - were described as "supporting SGML" or "SGML-like". Perhaps subscribers to this newsgroup would appreciate a brief introduction to a subject that need not be as complicated as people are making it. SGML is (to some degree) the joint creation of the Department of Defense and the GCA. First and foremost, it is a way of looking at and describing documents, and doing so by describing their structure in a way that is computer-independent, operating system independent and, most important, format independent. If I were to describe a chapter title in a format specific way, I might say, in sqtroff, for instance: .ps 42 \" size: 42 points .ffam BKL \" font family: Bookman light .ft I \" italic face .ce \" centered on the line This is a title In the style of SGML, I would say, simply: This is a title Elsewhere would be defined as having specific characteristics for each use to which the document is put. Note that SGML is NOT "Generic Coding": in Generic Coding, the "tags" can and generally do stand for specific formatting instructions, like macro-calls in most formatting languages. SGML tags, however, specify information about the structure of the document but absolutely nothing about its format: the binding of document elements to specific design specifications is done at a later stage of the publication process. For example, some readers may be reminded of the SCRIBE text formatter. SCRIBE is a "Generic Coding" system; its Basic Environment `tags' Center, Enumerate, FlushLeft, for example, each stand for a very specific (although reasonably device-independent) *format*. And its `Face Codes' like @i, @b, @r and so on are as format-specific as ditroff's analogous \fI, \fP and \fR. McGraw-Hill recently took a huge collection of SGML-encoded articles and produced a 16 volume Encyclopedia of Science and Technology and, simultaneously a CD-ROM containing the same materials and, simultaneously made the textual component available on-line via the Westlaw database. The same source files were used to create all three editions. This forward-looking project could not have taken place without an SGML approach. The Standard itself defines a set of rules for creating an SGML implementation. For instance, the Standard - it is an official ISO standard, by the way, and was officially adopted by the DOD in August, 1983 - says that the ELEMENTS are delimited by and that each element will be closed with an end-tag identical except that it begins with a slash: . The Standard also establishes the idea of ENTITY REFERENCES, strings of characters to stand for longer strings (or logotypes, for instance) or for special or pi characters; and the rules for creating a DOCUMENT TYPE DEFINITION, an ASCII file which sets out the elements, their structural relationships and the entity references. Most of the current SGML software development is in the area of parsers, spurred by everyone's enthusiasm for DOD contracts. The idea is that someone encodes a file using a set of angle bracketed , teaches the parser a specific set of rules by feeding it a DOCUMENT TYPE DEFINITION, and then validates the file by running it through the parser. There are many parsers but, to the best of my knowledge, none that should yet be described as commercially available. Interleaf has signed up a parser produced by a Belgian firm called Sobemap. That is perhaps the most public of parsers. The best one that we've seen - and the one that seems to be the very first to actually comply fully with the requirements of the standard, was produced by Software Exoterica, based in Ottawa. (Much of the discussion about compliance has been complicated by an early release of a DRAFT ISO standard: the later, non-draft version radically differs from the Draft. A number of parser-makers have had to do a lot of unexpected work to upgrade.) I can recommend two groups as consultants for anyone who is serious about SGML: Aspen Systems Corporation attn: Sperling Martin 1600 Research Boulevard Rockville, MD 20850 Tel: 301 251-5356 (These were the consultants to the AAP project, the ones who, quite literally, defined that standard.) Software Exoterica attn: Elizabeth MacKenzie 2447 Kaladar Avenue Ottawa Canada K1V 8B9 Tel: 613 526-4701 (Exoterica is in the final stages of implementing a very large publishing system based on a military specification of SGML for the Canadian Department of Defense.) If you're in Europe, a good source of information is SGML Users' Group attn: Joan Smith, President National Computing Centre Oxford Road, Manchester M1 7ED United Kingdom Tel: 44 61 228 6333 The group's annual meeting will be held in conjunction with the Torremolinos MarkUp '87 Conference: May 14, 1987 2 to 4pm. _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ Yuri Rubinsky, President SoftQuad Inc. 720 Spadina Avenue Toronto Canada M5S 2T9 Tel: (416) 963-8337 (utzoo!sq!yuri)