Machine accessibility of Open Access scientific publications from publisher systems via ResourceSync

2017 
In this poster, we outline the technical difficulties and present how we succeeded in harvesting metadata records and full text content of millions of OA articles from publisher APIs. We also show how we have managed to provide an interoperable layer over these data using ResourceSync. To achieve this we have created a publisher connector, which harvests the open access scientific papers from publishers and exposes the content in a standardised API. Our contribution can be summarised as: a) creation of a seamless layer for accessing content from across publishers, b) offering of a generic integrated access point to these data via ResourceSync and c) provision of a high performance access interface, which will be constantly updated. This is first service to provide a harmonised access layer over non-standardised publisher APIs for retrieving gold and hybrid gold scholarly content as well as the first implementation of ResourceSync scaling to millions of documents with the potential for fast real-time updates.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    4
    References
    0
    Citations
    NaN
    KQI
    []