Keyword selection from word recognition results using definitional overlap

1994 
A method is presented to locate a set of potential keywords for a given document in the output of a word recognition algorithm. A clustering step locates words of significant length that occur several times. A word recognition algorithm is applied to these words to generate groups of visually similar alternatives for each image. A simulated annealing algorithm is then used to choose a st of keywords that contains at most one representative from each neighborhood such that an inter-word compatibility measurement is maximized. The compatibility measure is based on the similarity of subject and the definitional overlap of two words as measured by a dictionary. Experimental results are presented that illustrate the ability of the technique to operate in the presence of noise.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    2
    Citations
    NaN
    KQI
    []