In Search of Unstructured Documents Categorization

2008 
Transferring information from one part to another of the world is the main aim of communication. Now a day, the information is available in forms of documents or files created on requirements basis. The more the requirements the large the documents are. That is why; the way of creation which is random in nature as well as storage bends the documents unstructured in nature. The result is that, dealing with these documents becomes a headache. For the ease of process, the frequently required data should maintain certain pattern. But being unfortunate enough, most of the time we have to face problems like erroneous data retrieving or modification anomalies or even a large amount of time may be given for retrieving a single document. To overcome the situation, a solution has raised named unstructured document categorization. This field is a vast one containing all kind of solutions for various type of document categorization. Basically, the documents which are unstructured in nature will be categorized based on some given constraints. And through this paper we would like to highlight the most as well as popular techniques like text and data mining, genetic algorithm, lexical chaining, binarization methods in the field of unstructured document categorization so that we can reach the fulfillment of desired unstructured document categorization.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    7
    References
    0
    Citations
    NaN
    KQI
    []