When the Shanghai Ranking, also known as the Academic Ranking of World Universities (ARWU), was launched in 2003, Ton van Raan, director of CWTS at the time, sounded the alarm about the problematic way in which the ranking uses bibliometric data, for instance in attributing publications to universities.
Despite growing interest in Open Access (OA) to scholarly literature, there is an unmet need for large-scale, up-to-date, and reproducible studies assessing the prevalence and characteristics of OA. We address this need using oaDOI, an open online service that determines OA status for 67 million articles. We use three samples, each of 100,000 articles, to investigate OA in three populations: 1) all journal articles assigned a Crossref DOI, 2) recent journal articles indexed in Web of Science, and 3) articles viewed by users of Unpaywall, an open-source browser extension that lets users find OA articles using oaDOI. We estimate that at least 28% of the scholarly literature is OA (19M in total) and that this proportion is growing, driven particularly by growth in Gold and Hybrid. The most recent year analyzed (2015) also has the highest percentage of OA (45%). Because of this growth, and the fact that readers disproportionately access newer articles, we find that Unpaywall users encounter OA quite frequently: 47% of articles they view are OA. Notably, the most common mechanism for OA is not Gold, Green, or Hybrid OA, but rather an under-discussed category we dub Bronze: articles made free-to-read on the publisher website, without an explicit Open license. We also examine the citation impact of OA articles, corroborating the so-called open-access citation advantage: accounting for age and discipline, OA articles receive 18% more citations than average, an effect driven primarily by Green and Hybrid OA. We encourage further research using the free oaDOI service, as a way to inform OA policy and practice.
As scholars migrate into online spaces like Mendeley, blogs, Twitter, and more, they leave new traces of once-invisible interactions like reading, saving, discussing, and recommending. Observing these traces can inform new metrics of scholarly influence and impact -- so-called altmetrics. Stakeholders in academia are beginning to discuss how and where altmetrics can be useful towards evaluating a researcher's academic contribution. As this interest grows, libraries are in a unique position to help support an informed dialog on campus. We suggest that librarians can provide this support in three main ways: informing emerging conversations with the latest research, supporting experimentation with emerging altmetrics tools, and engaging in early altmetrics education and outreach. We include examples and lists of resources to help librarians fill these roles.
Today marks the official launch of Unpaywall, a web browser extension that links users directly to free full-text versions of research articles. Heather Piwowar and Jason Priem of Impactstory, the team behind Unpaywall, report on the successful pre-release phase, and explain how two decades of investment, a slew of new tools, and a flurry of new government mandates have helped build a powerful momentum behind green open access.
This is an open letter concerning the recent launch of the new open access journal, Science Advances. In addition to the welcome diversification in journal choices for authors looking for open access venues, there are many positive aspects of Science Advances: its broad STEM scope, its interest in cross-disciplinary research, and the offering of fee waivers. While we welcome the commitment of the Association to open access, we are also deeply concerned with the specific approach. Herein, we outline a number of suggestions that are in line with both the current direction that scholarly publishing is taking and the needs expressed by the open access community, which this journal aims to serve. The first of these issues concerns the licensing terms of the journal articles. The default choice of a non-commercial licence (CC BY-NC) places unnecessary restrictions on reuse and does not meet the standards set out by the Budapest Open Access Initiative. Many large funders, including Research Councils UK and the Wellcome Trust, do not recognise this as an open license. The adoption of CC BY-NC as the default license means that many researchers will be unable to submit to Science Advances if they are to conform to their funder mandates unless they pay for the upgrade to CC BY. There is little evidence that non-commercial restrictions provide a benefit to the progress of scholarly research, yet they have significant negative impact, limiting the ability to reuse material for educational purposes and advocacy. For example, NC-encumbered materials cannot be used on Wikipedia. The non-commercial clause is known to generate ambiguities and uncertainties (see for example, NC Licenses Considered Harmful) to the detriment of scholarly communication. Additionally, there is little robust evidence to suggest that adopting a CC-BY license will lead to income loss for your Association, and the $1,000 surcharge is difficult to justify or defend. The value of the CC BY license is outlined in detail by the Open Access Scholarly Publishers Association. We raise an additional issue with the $1,500 surcharge for articles more than 10 pages in length. In an online-only format, page length is an arbitrary unit that results from the article being read in PDF format. Can the AAAS explain what the additional costs associated with the increased length are that would warrant a 50% increase in APC for an unspecified number of additional digital pages? Other leading open access journals, such as PeerJ, the BMC series, and PLOS ONE, offer publication of articles with unlimited page lengths. The extra costs create constraints that may adversely incentivize authors to exclude important details of their study, preventing replication and hindering transparency, all of which are contrary to the aims of scholarly publication. Therefore it seems counterproductive to impose this additional charge; it discriminates against researchers' best effort to communicate their findings with as much detail as necessary. We feel that the proposed APCs and licencing scheme are detrimental to the AAAS and the global academic community. As such, we recommend that Science Advances: 1. Offers CC BY as standard for no additional cost, in line with leading open access publishers, so authors are able to comply with respective funding mandates; 2. Provides a transparent calculation of its APCs based on the publishing practices of the AAAS and explains how additional value created by the journal will measure against the significantly high prices paid by the authors; 3. Removes the surcharges associated with increased page number; 4. Releases all data files under CC0 (with CC BY optional), which has emerged as the community standard for data and is used by leading databases such as Figshare and DataDryad. We hope that you will consider the points raised above, keeping in mind how best to serve the scientific community, and use Science Advances to add the AAAS to the group of progressive and innovative open access scholarly publishers. We hope AAAS will collaborate with the academic community to facilitate the dissemination of scientific knowledge through a journal committed to fully embracing the principles of Open Access. We kindly request that you allow your response(s) to be made public along with this letter, and look forward to hearing your response soon. (Please note that the views expressed here represent those of the individuals and not the institutions or organization with which they are affiliated)
This is the old, deprecated author data for OpenAlex. In July, 2023, the OpenAlex dataset switched to a new author name disambiguation (V3), and deprecated all old author IDs. This is a data dump of those old author IDs. - Author metadata are included in `authors/**/*.gz` files. - Affiliations---mapping of OpenAlex work IDs to (old) Author IDs---are in the `affiliations_export_20230719T1259/*.gz` files. The files are all gzipped JSON-lines files. See the OpenAlex documentation for more information about OpenAlex and how you can use these files.
This is the old, deprecated author data for OpenAlex. In July, 2023, the OpenAlex dataset switched to a new author name disambiguation (V3), and deprecated all old author IDs. This is a data dump of those old author IDs. - Author metadata are included in `authors/**/*.gz` files. - Affiliations---mapping of OpenAlex work IDs to (old) Author IDs---are in the `affiliations_export_20230719T1259/*.gz` files. The files are all gzipped JSON-lines files. See the OpenAlex documentation for more information about OpenAlex and how you can use these files.
Author name disambiguation V3 initial clusters for the OpenAlex dataset. See https://openalex.org There are 633803287 rows, split into 4 CSV (comma-delimited) files (with headers). The CSV files have two columns: "work_author_id" and "author_id" "work_author_id": An OpenAlex Work ID and an author sequence number, joined with an underscore ("_") "author_id": An OpenAlex Author ID, representing a unique author in OpenAlex