Named Entity Recognizer for Filipino Text Using Conditional Random Field

2013 
The study for a Named Entity Recognizer for Filipino Text Using Conditional Random Field (NERF-CRF) focused creating a system which identifies and classifies named entities present in a given corpus. The named entities were classified into four, namely: person, place, date and org. Named entities that are identified but do not fall in the four classifications are tagged as etc. Different modules were created to achieve the study's purpose, including a tokenizer and a part-of-speech tagger. The conditional random field approach was used in the classification of identified named entities. Filipino biographies were the corpus used in testing the system. The results, based on solving for the F-measure, indicate that the system is 83% accurate, and best in identifying named entity Date with 0% error rate but is unsatisfactory in distinguishing named entity place and org, with 42% and 33% error rates respectively.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    6
    References
    1
    Citations
    NaN
    KQI
    []