Path: utzoo!utgpu!news-server.csri.toronto.edu!cs.utexas.edu!wuarchive!zaphod.mps.ohio-state.edu!samsung!think.com!masscomp!calvin!mark From: mark@calvin..westford.ccur.com (Mark Thompson) Newsgroups: comp.graphics Subject: Re: Help : Sperate Bitmap character Message-ID: <46012@masscomp.ccur.com> Date: 27 Aug 90 18:51:49 GMT References: <1990Aug27.172757.18703@ucselx.sdsu.edu> Sender: news@masscomp.ccur.com Reply-To: mark@calvin.westford.ccur.com (Mark Thompson) Organization: Concurrent Computer Corp. Westford MA. Lines: 42 In article <1990Aug27.172757.18703@ucselx.sdsu.edu> brian@ucselx.sdsu.edu (Brian Ho) writes: > My problem is when two characters are sperated less than one blank column, > I can not distinguish/sperate the two character. (P.S. the character has > unknown size) e.g. >000000000000000000000000000000000000000000000000000000000000000000 >000111111110000000000001110000000000000000000000000000000000000000 >000111000000000000000001110000000110000001101110000000000000000000 >000110000001111111000011111110011111100001111110000000000000000000 >000110000001111011100011110000110000110001110000000000000000000111 >000111111001110001100001110001111111110001100000000000000000011110 >000110000001110001100001100001111111110001100000000000000000001111 >000110000001110001100001110001110000000001100000000000000000000011 >000110000001110001100001110000111001110001100000000000000000000000 >000111111111110001100000111110011111100001100000000000000000000000 >000111111110100000000000000000000000000000000000000000000000000000 >000000000000000000000000000000000000000000000000000000000000000000 > > >The characters "En" and "te" are eventually appears side by side with the >other character. Well provided that none of your characters exhibit "breaks" in the bitmap (ofcourse with the exception of the letter "i"), a 3x3 neighborhood operation could clear up the "te" case by rejecting 1's that are not horizontally, vertically, or diagonally connected. The "En" case however is not so easily dealt with and if the character size is allowed to vary greatly within a single word, it may be impossible. I would try looking at horizontal run lengths and sudden changes in line gradients to attempt to determine a reasonable separation point for overlapping characters. Obviously you wouldn't resort to this unless no match could be made. Another alternative is to intelligently look at the word and context and make a guess at what the character(s) are. Many non-font oriented OCRs do this. +--------------------------------------------------------------------------+ | Mark Thompson | | mark@westford.ccur.com | | ...!{decvax,uunet}!masscomp!mark Designing high performance graphics | | (508)392-2480 engines today for a better tomorrow. | +------------------------------------------------------------------------- +