Restoring Hebrew Diacritics Without a Dictionary

2022 
We demonstrate that it is feasible to accurately diacritize Hebrew script without any human-curated resources other than plain diacritized text.We present Nakdimon, a two-layer character-level LSTM, that performs on par with much more complicated curation-dependent systems, across a diverse array of modern Hebrew sources.The model is accompanied by a training set and a test set, collected from diverse sources.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []