Intelligence Analyst Associate (IAA) - CYC Knowledge Extraction

2003 
Abstract : The objective of this project was to assess the feasibility of leveraging the capabilities and strengths of the Intelligence Analyst Associate (IAA) and the Cyc Knowledge Base (KB) in order to help alleviate the textual data overload that intelligence analysts experience. IAA has capabilities for processing large volumes of unstructured text, extracting information relevant to intelligence analysts, such as entities (people, organizations, locations, dates, and times) and simple events (subject, verbs, and objects), storing the extracted information in a structured database, and enabling the use of visualization and analysis tools. The Cyc KB is a formalized representation of a vast quantity of fundamental human knowledge (facts, rules, of thumb, and heuristics) and consists of terms and assertions which relate those terms. By leveraging the Cyc KB, significant capabilities were exploited that greatly benefited the IAA and its end users. These included the ability to represent domain dependent facts in the Cyc KB to identify, classify, and specify knowledge concerning relevant entities as well as the ability to represent rules in the KB and use the Cyc KB inference engine to allow information to be derived from identified entities and entity classifications. A 'plug-in, plug-out' system framework was developed that served as the processing framework and testbed for the information extraction components, similar to the framework used for the IAA system. One of the main goals was aimed at reducing the time it takes to run a document through the IAA-Cyc system. The prototype system developed under a previous effort processed documents at a rate of 1.5 minutes per sentence. The new system processes documents at a rate of four seconds per document in which a document is typically comprised of 30-50 sentences.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    1
    References
    1
    Citations
    NaN
    KQI
    []