Applying topic modeling techniques to degraded texts: Spanish historical press during the Transición (1977-1982)

2018 
Topic modeling techniques are applied in the field of Digital Humanities, specifically wit historical texts some often. However, digitizing documents often produces texts with poor readability. This is the case of the historical press, in which to the degrading of the support must be added the layout, the inclusion of advertisements, illustrations, etc. This paper describes the application of topic modeling to a specific Spanish newspaper with these difficulties; as well as the same application during the same period to another newspaper converted to text manually. The comparison of the results shows consistency between both newspapers
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    11
    References
    0
    Citations
    NaN
    KQI
    []