EKSTRAKSI KATA KUNCI PADA DOKUMEN TUNGGAL DENGAN PHRASE CHUNKING

I Nyoman Prayana Trisna,Arif Nurwidyantoro, S.Kom, M.Cs.

EKSTRAKSI KATA KUNCI PADA DOKUMEN TUNGGAL DENGAN PHRASE CHUNKING

2017

Kata kunci pada suatu dokumen membantu proses komputasi di dalam dokumen tersebut. Namun, pembentukan kata kunci secara manual membutuhkan pemahaman lebih, waktu yang banyak, dan sering terdapat unsur subjektivitas dalam pemilihan kata kunci. Dengan ekstraksi kata kunci secara otomatis, maka kata kunci dapat dibentuk dengan cepat dan objektif. Salah satu metode yang dapat digunakan adalah Phrase Chunking. Metode Phrase Chunking membutuhkan pola frase yang menjadi acuan dalam ekstraksi kandidat kata kunci, sehingga sebelum proses ekstraksi dibutuhkan pembelajaran terhadap pola frase yang sering muncul dalam kata kunci. Setelah kandidat berhasil diekstrak, maka kandidat diseleksi sesuai skenario-skenario yang berdasarkan teks acuan (abstrak atau badan teks), pola frase, dan bentuk pengurutan. Selain itu, pada penelitian ini dicoba untuk mereduksi kandidat yang diekstrak. Metode Phrase Chunking pada penelitian ini juga dibandingkan dengan metode TextRank. Hasil penelitian ini memberi nilai f-measure yang lebih baik apabila dibandingkan dengan metode TextRank. Penelitian ini menghasilkan nilai f-measure tertinggi ketika teks yang digunakan adalah abstrak, dengan reduksi kandidat dan pola frase yang terdiri dari dua atau tiga kata, dan pengurutan mengunakan frekuensi kandidat. Hasil dari penelitian ini juga memperlihatkan bahwa ekstraksi di abstrak lebih baik dibandingkan badan teks. Selain itu, kandidat yang baik dihasilkan ketika kata kunci yang diekstrak berupa frase dua atau tiga kata. Bentuk pengurutan terbaik ketika kandidat diurutkan berdasarkan hasil kali frekuensi kandidat dengan bobot pola frase. Namun, percobaan menggunakan reduksi kandidat ternyata tidak menghasilkan nilai f-measure yang lebih baik. Keywords in a document help the computing process in that document. However, manual assigment of keywords requires more effort to understand, take more time, and there is often an element of subjectivity in the selection of keywords. With automatic keyword extraction, keywords can be formed quickly and objectively. One of the methods that can be used is Phrase Chunking. Phrase Chunking method requires phrase patterns that become the reference in extraction of keyword candidates, so before the extraction process, learning phrase patterns that often appear in the keyword is needed. After the candidate has been successfully extracted, candidates are selected according to scenarios based on reference text, phrase patterns, and sorting. In addition, this study attempted to reduce the extracted candidate. Phrase Chunking method in this study also compared with TextRank method. The results of this study gave a better f-measure value when compared with the TextRank method. This study yielded the highest f-measure value when the text used was abstraction, with candidate reduction and phrase pattern consisting of two or three words, with sequencing using candidate frequency. The results of this study also show that the extraction in abstraction is better than the main body of the text. In addition, a good candidate is generated when the extracted is just a phrase consisting of two or three words. The best sorting form when the candidate is sorted by the frequency of the candidate with the weight of the phrase pattern. However, experiments using candidate reduction did not result in a better f-measure value.

Correction
Cite
Save
Machine Reading By IdeaReader

References

Citations