An Assistant to Populate Repositories: Gathering Educational Digital Objects and Metadata Extraction

2016 
This paper presents an assistant to populate institutional repositories. This tool can detect all educational digital objects in a text format that are already published on institutional Websites and can be uploaded to a repository. This recopilation is a tedious task and is usually performed manually. In this paper, we propose a system architecture for automating this task of collecting text documents within a restricted domain in order to detect plausible documents that can be loaded into a repository. In addition, its metadata, such as language, category, title, authors, and their contact data, is automatically extracted. A prototype of this system was developed, and case studies in two different domains are analyzed.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    11
    References
    2
    Citations
    NaN
    KQI
    []