Path: utzoo!utgpu!news-server.csri.toronto.edu!rpi!zaphod.mps.ohio-state.edu!uwm.edu!linac!midway!gsbsun!valley From: valley@gsbsun.uchicago.edu (Doug Dougherty) Newsgroups: comp.binaries.ibm.pc.d Subject: Re: Need a program to delete duplicate lines of text Keywords: duplicate, text, deletes, lines, non-document, ascii file Message-ID: <1991Jun14.131006.22487@midway.uchicago.edu> Date: 14 Jun 91 13:10:06 GMT References: <1991Jun14.012035.6708@disk.uucp> Sender: news@midway.uchicago.edu (NewsMistress) Distribution: usa Organization: University of Chicago Lines: 17 tony@disk.uucp (tony) writes: >For the past weeks I have been creating dictionary lists for a friend's word >game (on his BBS). I've been taking data files for various programs and >converting them to word lists. Each word must be alone on its own line. OftenI have many duplicate words in every file. I would love to find a program that >will quickly delete all the duplicates. Deleting them manually or in blocks >takes forever. I had thought about delimiting each word (") and importing >the text into a dbase file, but that would be pretty time consuming too. Does >anyone know of a program that could do the job I need done? >Thanks! There is a program called DO.COM (or .EXE, not sure) that DOes a lot of things to files, including, I think, the eqv of Unix "uniq". You should get it. -- (Another fine mess brought to you by valley@gsbsun.uchicago.edu)