Path: utzoo!attcan!uunet!ncrlnk!ncrcae!hubcap!gatech!uflorida!novavax!proxftl!twwells!comparc From: comparc@twwells.uucp (comp.archives) Newsgroups: comp.archives Subject: Comp.archives database format Message-ID: <161@twwells.uucp> Date: 11 Nov 88 06:06:16 GMT Reply-To: comparc@twwells.UUCP (comp.archives) Organization: None, Ft. Lauderdale Lines: 354 Approved: bill@twwells.uucp (T. William Wells) This is the second attempt at the database structure. Changes are still possible, so send in any comments you might have. Here is a short summary of the changes from the previous version: Lines in the database beginning with # are ignored. The end of data in a DB: posting is signaled by a line containing @END. Everything in a DB: posting before the first line beginning with an @ is ignored. The time field in the CO line for ftp access has been changed. A TT line has been added to the site entry format; it contains a short title for the archive site. A TM line has been added to the site entry format; it specifies the best times to use the archives. A KW line has been added to the site entry format; it contains a list of keywords describing the archive. (The original description said that the keywords are separated by a semicolon, this is an error: they are separated by commas.) An IX line has been added to the site entry format; it contains information about the index files for the archive. The contents lines have a new field, containing the size of the file in K. Some field delimiters have been changed. The CO line now uses semicolons instead of colons. The SY line now uses semicolons instead of commas. --- Comments in the databases begin with a #. They are retained with the data but are otherwise ignored. In the line oriented databases, if there is a line that is to be left blank, that line should still be entered, but with everything but the keyword left blank. --- The site database contains a series of entries separated by blank lines. Each entry has the following lines: NM EN TM TT AD MA CO IX KW DE Lines from TT to DE may be repeated as a group as often as necessary to describe different archives at a single site. Each of the lines from AD to DE may be repeated as often as necessary to contain the data. Following is a detailed description of each line. NM This is a domain name. If you are a uucp site, you should write this as .uucp. EN @ () This says who the person is who entered the database entry. The is the output from the date command. TM