in Europe and established by Denis de Sallo.
The two main ways in which OA affects academia are (i) through association with a higher documented impact of scholarly articles, as a result of availability and re-use, and (ii) through the possibility of non-restrictively allowing researchers to use automated tools to mine the scholarly literature. For the former, major arguments in favor of OA include the evidence that work that is openly available generates more academic citations, but also has more societal impact. In addition, appropriately-licensed OA works play a major role in academic education, including re-use in classes and for dissertations. The latter major argument involves non-restrictive access to the scholarly literature through appropriate licensing, making it possible to use automated tools to collect and analyze the entire body of scholarly literature in a legally sound framework and irrespective of copyright laws. The following sections cover these two effects of OA.
Academic impact. Academic impact is frequently measured through citation counts, and these remain fundamental as the ‘currency units’ for researchers, research groups, institutes and universities. Lawrence (2001) was the first to propose that OA would have a citation advantage. The utility and consistency of the citation advantage across different research fields has been intensively debated because its magnitude substantially varies depending on the discipline ( Table 2 ). However, the general tendency identified by studies to date indicates that there is at least some association between OA publishing and increased citation counts across most disciplines ( Hajjem et al. , 2006 ; Antelman, 2004 ) ( Figure 2 and Table 2 ). A comprehensive and annotated bibliography of studies documenting potential citation impacts was created by Steve Hitchcock ( eprints.soton.ac.uk/354006/1/oacitation-biblio-snapshot0613.html ) and has been managed by SPARC Europe since 2013 ( sparceurope.org/oaca/ ).
The majority concluded that there is a significant citation advantage for Open Access articles. Source: Data from The Open Access Citation Advantage Service, SPARC Europe, accessed March 2016.
R eference | D iscipline | C itation advantage | O rigin |
---|---|---|---|
Mathematics, Electrical Engineering, Political Science, Philosophy | +91%, +51%, +86%, +45% per discipline respectively | NA | |
Political Science | Statistically significant citation advantage | NA | |
Medicine, Biology, Agricultural Sciences, Chemistry and University Journals | +200% | NA | |
Mathematics | +35% | Quality advantage, no evidence of early advantage | |
(2008) | Physiology | -5% | NA |
Sciences, Social Sciences, and Humanities | +1% but statistically indistinguishable | No evidence of an early advantage | |
All | +8% for newly published articles; +16% for citations from developing countries | NA | |
Natural Sciences | +210 up to +290% | NA | |
Biology, Mathematics, Pharmacy and Pharmacology | No clear tendency towards an increase in impact | NA | |
(2010) | Engineering, Biology, Biomedicine, Chemistry, Psychology, Mathematics, Clinical Medicine, Health, Physics, Social Science, Earth Sciences | +?% to ?% depending on the discipline | Quality advantage is confirmed no evidence for selection bias |
Biology | No evidence of citation advantage | NA | |
(2010) | High Energy Physics | +200% | Early advantage confirmed |
(2006) | Biology, Psychology, Sociology, Health, Political Science, Economics, Education, Law, Business, Management | +36% to 172% | NA |
Physics | +250% to 580% | NA | |
(2006) | Astronomy and Physics | +200% | NA |
Agricultural Science | +621% but not to every journal | NA | |
(2005) | Astronomy | None | Selection bias and early advantage |
Astronomy | +200% | Early advantage confirmed | |
Opthalmology | No | NA | |
Computer Science | +157% up to +284% for top publication | NA | |
Ecology, Botany, Multidisciplinary Science and Biology | +8% | NA | |
Natural Sciences | 0-+50% in 2003 depending on field, negative citation advantage in 2000 | NA | |
Astronomy | +200% | NA | |
Solar Physics | +170% and +260% depending on the online repository | No evidence for selection bias | |
Condensed Matter Physics | NA | Confirm early access advantage and selection bias but no OA effect | |
(2008) | Ecology, Applied Mathematics, Sociology and Economics | +157% | NA |
(2005) | Medicine | +300% up to +450% | NA |
| Astronomy | +200% | Early advantage |
Environmental Science | Not significant | NA | |
(2015) | All | +111% up to 152% | NA |
| Economics | +35% up to 64% depending on the database used | NA |
(2011) | Humanities, Life Sciences, Mathematics & Physical Science, Medicine, Social Sciences | -49.24%-+87.73% | NA |
Communication Studies | +200% | NA |
Estimates for the open citation advantage range from +36% (Biology) to +600% (Agricultural Sciences) ( Swan, 2010 ; Wagner, 2010 ). In a longitudinal study, Eysenbach (2006) compared the bibliometric impact of a cohort of articles from a multi-disciplinary journal ( Proceedings of the National Academy of Sciences ) that offers both OA and non-OA publishing options. After adjusting for potentially confounding variables, the results indicated that non-OA papers were twice as likely to remain uncited six months after publication when compared to OA articles. Additionally, the average number of citations for OA articles was more than double than that of the non-OA articles. The study also differentiated the type of OA article, namely the self-archived (i.e., Green OA) and the publisher version of record (VOR) that is freely available (i.e., Gold OA). Gold OA was found to have a higher overall academic impact than Green OA.
Despite strong evidence for a citation advantage, the magnitude of this advantage remains variable. The substantial heterogeneity in observed citation advantages can be due to different academic cultures or could simply be spurious. For example, self-archiving prior to publication is a community standard in fields such as high energy physics or mathematics, but has yet to be widely adopted among the life sciences. Such ‘pre-prints’ have also been associated with an overall increase in the average number of citations, the total number of citations, and the speed of citation accumulation ( Aman, 2014 ; Gentil-Beccot et al. , 2010 ). Other studies could only replicate immense citation advantages (+600%) if relevant predictors were omitted ( McCabe & Snyder, 2014 ), which indicates a potential spurious effect. When taking into account these relevant predictors, the citation advantage became much smaller (+8%). When the citation advantage is low or non-existent, this could suggest that in those research fields there is a sufficient level of access to the literature such that OA confers no localised access advantage, or that adoption of OA has not yet reached a level where any such advantage has become statistically evident.
One alternative explanation for the existence of citation advantages could be that researchers choose to publish OA when a finding is more impactful, but empirical evidence contradicts this selection effect. Gargouri et al. (2010) compared citation counts for articles which were self-selected as OA or mandated as OA (e.g., by funders). The study concluded that both were cited significantly more than non-OA articles and showed no differences in citation rates. As such, these findings rule out a selection bias from authors as the cause for the citation advantage ( Gargouri et al. , 2010 ). However, research that is selected to merit funding by funding agencies may, in itself, be perceived to be more impactful than research that is not funded. Additionally, as no single OA mandate is ever 100% effective, it might be the simple case that authors are more likely to comply with a mandate for the research they perceive to be of higher impact. In a study of articles in the field of psychology, Anderson (2013a) found that publications with funding sources reported in the text were found to be more highly cited and connected to other highly-cited publications (this type of publication is called "generative" in the study) than publications with no reported funding sources. Furthermore, research that was privately funded was found to be more generative than publicly funded research. In a similar study in the Library and Information Sciences field done by Zhao (2010) , the citation counts for grant-funded publications were "substantially higher" than publications without grant funding. Although these studies indicate that grant funding is correlated with increased citation rates, the openness of articles was not addressed in either study. Future research will be required to demarcate the potential causality and to determine the conditions under which we could see whether or not OA has an effect on citation counts. For example, this could be conducted through a randomised controlled trial in which research articles from a particular funder are randomly assigned to OA and non-OA routes, with the citation counts assessed after a certain time.
In sum, evidence indicates that OA is broadly related to increased academic impact in terms of citations ( Figure 2 ; see also McKiernan et al. (2016) ), but given the large variability in results, further research should aim to synthesize these findings in a meta-analysis and try to explain the cause of this variability.
Broader societal impact. Scholarly articles also have a societal impact, such as when they are covered in news media or are discussed in social media channels; alternative metrics, or altmetrics, can be used as a guide to measure this mode of impact ( Liang et al. , 2014 ). Information such as social media usage, Mendeley readership, and media attention ( Piwowar, 2013 ) can be tracked by various altmetrics providers (e.g. ImpactStory, Plum Analytics, and Altmetric.com). As such, when an article generates discussions outside of the academic literature, altmetrics are capable of tracking this. Despite limitations (such as academics discussing their own research on platforms like Twitter), altmetrics provide a general view of the wider societal impact of research articles. Considering the increased pressure on researchers and research institutes to communicate research findings to the public, altmetrics can provide additional insight into which research drives public interest. A working group established by NISO is investigating the future role of altmetrics in research communication and assessment ( www.niso.org/topics/tl/altmetrics_initiative/ ).
OA articles would be expected to have an altmetrics advantage compared to the non-OA literature; if an article has fewer restrictions for journalists, citizens, businesses, and policy-makers, it seems logical that this would enable the research to be publicly re-used. Furthermore, those parties may be more likely to promote articles which are publicly accessible into different communication channels. In other words, increased access removes barriers to widespread societal engagement, whereas a relative lack of article access discourages engagement.
There is research showing evidence for an altmetrics advantage for OA articles, but this does not reflect itself in the most impactful articles. Wang et al. (2015) found evidence that OA articles receive more attention through social media. The authors compared social media attention (Twitter and Facebook) between OA and non-OA articles at Nature Communications and found that OA articles get 1.2–1.48 times as much social media attention as compared to non-OA articles (see also Adie, 2014 ). Nonetheless, of the top 100 articles of 2015 as presented by Altmetric.com, only 42 articles were OA ( www.altmetric.com/top100/2015/ ). This 42% is larger than the overall proportion of OA articles in the literature, which indicates that OA contributes relatively more impact per paper. However, it also indicates that the open impact advantage can be overshadowed by the intrinsic nature of the research published or by the traditionally prestigious journals with a larger and dedicated media apparatus (e.g., Nature, Science; Brembs et al. , 2013 ).
Allen et al. (2013) found that a social media announcement of the release of a research article increases the number of users who view or download that article, but does not translate to increases in the citation count in the field of clinical pain research. Costas et al. (2015) found a relatively weak correlation between social media activity and citation counts for the articles in their sample (over 1.5 million article records), while Mohammadi et al. (2015) found that the number of Mendeley readers with a status of graduate student or faculty correlated with citation counts. When OA to the articles is factored into an analysis, there is a potential recursive relation between citation counts and altmetrics due to OA. Eysenbach (2011) indicated that there is a moderate correlation (0.42–0.72) between the tweets and citations of articles from an OA journal ( Journal of Medical Internet Research ). Highly tweeted articles were eleven times more likely to be highly cited than less-tweeted articles, or vice versa (75% of highly tweeted articles were highly cited; 7% of less-tweeted articles were highly cited). However, it is difficult to assess causality in these cases: do research papers that have more academic impact make their way more frequently into societal discussions, or does online discourse increase their potential citation rates? Overall, this evidence implies that there is a general media advantage with OA (see also McKiernan et al. (2016) ), which can be used as a proxy or pathway to indicate greater societal impact.
Altmetrics themselves should not be conflated with citations when it comes to assessing impact, even though some providers such as Altmetric.com supply a single score that can be used to rank an article in a similar way to a journal’s Impact Factor. Each measure of altmetrics tells a different story about the impact of research, and a careful understanding of the altmetrics landscape in conjunction with citation-based metrics can lead to a clearer picture of societal impact of scientific research.
Traditionally, in order to publish a paper, researchers hand over their copyright via a Copyright Transfer Agreement. Copyright transfer as the default has far-reaching consequences on the ability of both the authors and others to re-use that published research, and many authors are not aware of the impact of these transfers on their ownership of the work. Academics frequently give the copyright to the publishers in exchange for the perceived prestige of publishing in one of their venues (e.g., Müller-Langer & Watt, 2010 ). In some cases, institutes adopt rights-retention OA policies that grant authors non-exclusive rights to their institutes before signing copyright agreements with publishers, which enables them to make articles OA without requiring permission from publishers ( cyber.law.harvard.edu/hoap/Good_practices_for_university_open-access_policies ). Essentially, copyright is a pre-digital tool wielded by traditional publishers to maintain revenues rather than fostering creativity, innovation, or protecting authors ( Okerson, 1991 ; Willinsky, 2002 ). For example, the Author’s Guild sued Google Books for copyright infringement because they provided freely available digital copies; the court rejected this suit in 2016, stating that Google Books served the public interest and that copyright’s "primary intended beneficiary is the public" ( EFF, 2015 ). In the digital age, copying is essential to perform necessary research tasks. These activities range from viewing the article (i.e., downloading requires copying) to re-using figures from an article in a book. The interaction of OA and copyright is complex and deserves extended research in itself (e.g., Scheufen, 2015 ). We will highlight how OA views copyright and relate this to its effects on text- and data-mining (TDM).
The majority of ‘born OA’ journals and publishers do not request or receive copyright from authors. Instead, publishers are granted non-exclusive rights to publish, and copyright is retained by authors through a Creative Commons license (typically CC-BY). Importantly, this represents a power shift from publisher-owned to author-owned rights to research. This model of author-retained copyright appears to be favoured by the majority (71%) of the research community ( Hoorn & van der Graaf, 2006 ). Shifting copyright to stay with the author, combined with appropriate open licensing, allows for wider re-use, including TDM, and forms the basis for a robust scholarly ecosystem.
As such, copyright in OA publications is non-restrictive and also allows machines to freely access it. In traditional publishing, human reading and computer reading are seen as two separate things which require different agreements, whereas OA publishing views them both in the same, non-restrictive manner. In other words, in order to mine OA journals, one only needs the technical skills to do so. In order to mine closed access journals, one needs to sign or negotiate access conditions, even if legitimate access to the articles has already been bought ( Bloudoff-Indelicato, 2015 ).
Automated extraction of information from scholarly research via TDM is a methodology that can be applied to investigate the scholarly literature at an enormous scale, creating new knowledge by combining individual findings. This has already proven to be useful for a large variety of applications (e.g., Glenisson et al. , 2005 ; Martone et al. , 2016 ; Swanson, 1987 ). Moreover, OA publishers facilitate TDM on a massive scale by allowing multiple options for collecting the literature needed. For example, PLOS is non-restrictive and allows users to scrape articles directly from the website or using its API. As a result, scraping tools can be used, such as rplos , an R package developed to search and download full-text scholarly papers ( Chamberlain et al. , 2016 ).
TDM is not only a knowledge-generation tool; it also allows for automated screening for errors and automated literature searches that renew scientific discovery. With TDM it becomes possible to easily compare one’s results with those of the published literature, identify convergence of evidence and enable knowledge discovery ( Natarajan et al. , 2006 ) or discover frequent tentative hypotheses that can be used for new research ( Malhotra et al. , 2013 ). It has already been used to make major advances in fields such as biomedicine ( Gonzalez et al. , 2016 ). TDM also allows for computer applications that can download all scholarly literature given certain search terms (e.g., ContentMine’s getpapers tool ; github.com/ContentMine/getpapers ), simplifying and shortening the tedious literature search. TDM can also serve a screening purpose similar to plagiarism scanners, helping to detect statistical errors in the scholarly literature (e.g., Nuijten et al. (2015) ). TDM can be used in various innovative ways and is an emerging and rapidly advancing field; non-restrictive licensing through OA certainly promotes its wider application.
Given the exponential increase in the number of scholarly publications, (semi-)automated methods to synthesize results have become increasingly important. TDM decreases the time dedicated to the search for relevant information in scholarly literature by categorizing information ( Leitner & Valencia, 2008 ), highlighting and annotating relevant results to specific users ( Shatkay et al. , 2008 ), and profiling research ( Porter et al. , 2002 ). Furthermore, TDM also prevents researchers and readers from wasting time on reinventing the wheel, simply because one can no longer keep up with the huge amount of published literature available ( Harmston et al. , 2010 ).
Because of traditional copyright transfers, TDM has often been stymied by traditional, closed access publishers who frequently see it as a copyright infringement. Researchers using software that harvests data from online publications have been (threatened to be) cut off from accessing the articles. These researchers found themselves trapped in negotiations to resume their research (even though their universities had paid subscription fees for access (e.g., Bloudoff-Indelicato, 2015 ; Van Noorden, 2012 )). Standard subscriptions do not permit systematic downloads because publishers fear that their content might be stolen and their revenue therefore lost ( Van Noorden, 2012 ). In 2014, Elsevier opened its papers for TDM via a proprietary API ( Van Noorden, 2014 ), but placed restrictions on the researchers using it; however, researchers are not legally required to comply with these restrictions in some countries (e.g., U.K., U.S.A., Handke et al. , 2015 ).
To make the enormous corpus of closed access papers retrospectively available to the public might be possible through legal action at an institutional or governmental level. The Dutch Government, for example, has recognized OA as a right, with Dutch citizens capable to make their scientific publications free to access after a ‘reasonable period of time’ ( Open Access NL, 2015 ). Such steps are further supported by Shavell (2010) and Eger & Scheufen (2012) who ascertained that transition towards an OA model could not be smooth without first undertaking the necessary legislative steps. The position of institutes regarding copyright transfer remains generally unclear. While academics themselves may have little power in broader debates regarding copyright, institutes could claim ownership of the work by invoking their rights under the work made-for-hire doctrine ( Denicola, 2006 ). However, OA policies at universities generally use a system of non-exclusive rights, presupposing that faculty are the owners of their work and can grant non-exclusive rights to the university for use (for examples of approaches and language used when drafting OA policies, see ( Shieber & Suber, 2016 )). Importantly, this means that OA through the ‘Green’ route does not always depend on permission from publishers, and increasingly is becoming dependent on rights retention by authors, through carefully-drafted and widely-supported university policies. While these are positive steps towards making research available for TDM, in light of the potential copyright problems for closed access articles and the fact that not all research is available through institutional Open Access policies, TDM will be easier and legally safer for OA journals. As a consequence, TDM is likely to be more readily applied to OA literature when compared to closed access literature.
The effect on publishers.
Any publisher has to cover operating costs, which are primarily made of (i) article processing charges (APCs), (ii) management and investment costs, and (iii) other costs. Article processing includes editing, proofreading and typesetting, among other things. Management and investment are instead the marginal costs needed to establish and keep the journal running. Other costs include promoting the journal, hosting and infrastructural services, sponsoring conferences, and other services that are extrinsic to research articles themselves. The average production cost for a single research article is estimated to be around $3500–$4000 ( Van Noorden, 2013 ) but these costs are highly depending on the publisher. For example, Philip Campbell (Editor-in-Chief of Nature ) stated that his journal’s internal costs were at $20,000–$30,000 per paper ( Van Noorden, 2013 ), due in part to the high selectivity and rejection rate at Nature (i.e., this is an average cost per published paper, and not the production costs associated with publishing a single accepted paper). However, these are at the high end of the cost spectrum, with other journals, such as the Journal of Machine Learning Research (JMLR) costing between $6.50–$10 per article ( blogs.harvard.edu/pamphlet/2012/03/06/anefficient-journal/ ). Other publishers are completely transparent about their direct and indirect production costs, such as Ubiquity Press, which levies an APCs of $500 ( ubiquitypress.com/site/publish/ ). One possible reason for such variation between journals and publishers is that it is generally unclear whether proposed costs relate to those directly involved in article processing or those required in order for a publisher to ‘break even’ if they receive zero subscription income for an article made OA.
In order to cover those costs and make a profit, closed access publishers charge for access via subscriptions, whereas many OA publishers or journals charge to publish. Due to increased subscription costs, closed access publishing is becoming an increasingly unsustainable business model ( Odlyzko, 2013 ) with prices estimated to have outpaced inflation at 250% in the past thirty years ( www.eff.org/issues/open-access ). This will slowly but surely diminish the scope of access to the scholarly literature as fewer organisations are able to pay such high costs. Only recently has any transparency into the detailed costs of subscriptions been gained by using Freedom of Information Requests to bypass non-disclosure agreements between libraries and publishers ( Bergstrom et al. , 2014 ; Lawson & Meghreblian, 2015 ). These requests provide the basis for understanding the economics of scholarly communication. For example, Bergstrom et al. (2014) found that commercial publishers, including Emerald, Sage, and Taylor and Francis, have prices of ten times the amount of non-profit publishers per citation for PhD-granting institutions. Two potential ways to prevent future retention of an unsustainable model is through decreasing the subscription prices, thereby lowering publishers’ profit margins and the financial burden on subscribers, or through switching to new OA-oriented business models and creating new value. Either way, price transparency will be essential for future bargaining efforts between academic libraries and publishers, and will be of interest to those involved in public policy and scholarly publishing. The concept of transitioning from a subscription-based model to one driven by APCs will be financially appealing to journals that operate with minimal profits or at a loss, and can be a pathway to achieve financial security and long-term journal sustainability. As such, increasing revenues is a strong incentive for OA ( osc.hul.harvard.edu/programs/journal-flipping/public-consultation/4/6/ , accessed 26/04/2016).
OA publishing has become associated with a ‘pay-to-publish’ model, whereas around 70% of peer-reviewed OA journals do not charge APCs, according to data from the Directory of Open Access Journals (DOAJ) (see blogs.harvard.edu/pamphlet/2009/05/29/what-percentage-of-open-access-journals-charge-publication-fees/ and citesandinsights.info/civ16i4.pdf ). However, approximately 50% of all articles published in peer-reviewed OA journals are published in APC-based venues ( Crawford, 2015 ; Laakso & Björk, 2012 ; Walters & Linvill, 2011 ). Authors paying to publish can be viewed as a fundamental conflict of interest for researchers. Nonetheless, this payment model has proven itself to function properly when editorial decisions are separated from the business-side of the publisher (i.e., editorial independence), removing the problem of ‘publication-bribery’. Additionally, many journals have always levied charges for to cover the costs of publishing regardless of OA; for example, PNAS charges $1225 per regular research article (with an additional $1350 for OA; pnas.org/site/authors/fees.xhtml ), and Cell charges $1000 for the first colour figure and $275 for each subsequent one ( cell.com/cell/authors ; as of April 2016). Therefore, equating OA with ‘pay-to-publish’ is actually a bit of a misnomer, as several closed journals charge to publish and many open journals do not. Furthermore, many publishers (e.g., PLOS , PeerJ ), as well as many learned societies, operate fee waiver schemes for researchers unable to obtain funds to cover publication fees.
For those OA publishers implementing a pay-to-publish model, around 68.8% of publishers offer fee waivers to low- and middle-income countries ( Lawson, 2015 ), while other journals offer fee discounts often given in lieu of total fee waivers. Solomon & Björk (2012) investigated the sources of funding used by authors for APCs, indicating that these are highly variable across academic disciplines. For example, while 45.5% of authors in Health Sciences, Biology and Life Sciences use grant or contract funding as source for APCs, only 10.4% use this in Business and Economics, with 45.8% coming from personal funds. Other sources include national funding bodies, and discretionary funds administered by institutions, as well as institutional funds specifically in place to support OA policies (see also Dallmeier-Tiessen et al. , 2011 ). Sources for APCs are also highly variable depending on the per capital GNP of the authors’ country, as well as the size of the APC ( Solomon & Björk, 2012 ). According to MacKie-Mason (2016) , one potential outcome of authors seeing the price of APCs and securing funding for them is that authors may begin to take the price of APCs into account (in addition to other factors such as prestige and topic) when selecting a journal for their research output, which may drive market competition and could as a consequence lower the price of APCs. However, a potential negative consequence of an increasingly APC-driven model of OA is that some researchers may struggle to procure funds in order to publish and conform to mandates at different levels. This might impact early-career researchers and those working in fields were research grants and publishing fees are more difficult to obtain.
Subscription-based publishers still frequently produce print versions of their journals, which increases production costs, potentially to justify charging for readership or to satisfy a small demographic who prefers this mode of reading. After all, subscriptions to print journals make sense and, if large-scale printing is still in place, simply transferring this idea to the digital versions creates continuity. Print versions are accompanied by logistical costs to print and ship each issue, but these are partially offset with reprint orders, additional charges for colour figures, and print-based advertising. For some of the largest subscription-oriented publishers the annual net profit on investment reaches up to 40 percent, which makes academic journal publishing highly lucrative for investors ( Satyanarayana, 2013 ), further increases investment to sustain this type of publishing model, and allows maintenance of an oligopoly ( Larivière et al. , 2015 ).
OA publishers only publish digitally and have opened up avenues for innovation. For example, PeerJ has introduced a wholly different OA business model, where readers pay nothing to access articles, but authors pay a membership fee once to publish for a lifetime. The Open Library of Humanities (OLH) is another innovative business model in which libraries pay a small fee to support OLH and scholars are able to publish for free (subscription for publishing rather than subscription for access); this support also enables the OLH to help journals transition from a subscription model to OA (for example, the recent case of Lingua ; timeshighereducation.com/research-intelligence/open-library-humanities-aims-flip-journals-open-access ). Library publishing has also developed in response to the OA movement; in this model, academic libraries begin publishing operations in the interest of providing added value to their patrons and contributing to the growth of knowledge ( librarypublishing.org ). In terms of innovating in the publishing platform itself, eLife have introduced the Lens as a novel way of viewing research articles online ( lens.elifesciences.org/about/ ), and F1000Research has introduced so-called ‘living figures’ to enable researchers to interact with data underlying research findings (e.g., Colomb & Brembs, 2015 ).
With this innovation comes massive scope for reducing the costs associated with publishing through implementing more efficient procedures. In this case, costs are reduced by eliminating the need for type-setting and copy-editing, with web-hosting costing only $15/year, and a total operating cost of between $6.50–$10.50 per article. Other platforms such as ARPHA offer an end-to-end XML-based publishing service, utilised by publishers including Pensoft, with a more efficient and integrated publishing workflow, which should highlight and reduce the real costs of publishing. In addition, OA has the potential to increase the speed of publication, as seen in journals like eLife and PeerJ ( blog.dhimmel.com/plos-and-publishing-delays/ ), which combined with ‘pre-print’ servers like biorXiv and platforms that offer post-publication peer review like Research Ideas and Outcomes ( riojournal.com/ ), F1000 Research , and ScienceOpen ( www.scienceopen.com/ ), can exceptionally accelerate the speed of research communication. Such innovations add value to the research communication process (contrary to services such as paying to print colour figures) and represent just several cases of innovation across the publishing ecosystem. One can imagine that publishing costs in OA journals become dependent on the value added on a per-article basis, which can help reshape and improve scholarly communication. As such, making publication costs dependent on the value added aligns the interests of publishers with those of scholars, where improving the quality of the process of scholarly communication is the end goal. The motivation behind this could come from the currently available data, which suggest that hybrid publishing options offered by traditional publishers, while being of higher cost due to supposed prestige, provide a much lower overall quality publishing process ( blog.wellcome.ac.uk/2016/03/23/wellcome-trust-and-coaf-open-access-spend-2014-15/ ). It is noteworthy that in spite of the higher costs of hybrid publishing compared to ‘pure’ or ‘born’ OA publishing, some reports, such as the highly influential and somewhat controversial Finch Report in the UK ( www.researchinfonet.org/publish/finch/ ), favoured the former model and high-priced Gold OA over a Green model.
The implementation of OA models has implications beyond the publishing industry in terms of economics. Research funding comes from multiple sources, including national funding agencies and industries, as well as private funders. Much primary research actually takes place outside of academia, inside R&D departments; if R&D in the private sector can access more research findings, this will ultimately benefit the public interest as well. A report from 2004 by Arzberger and colleagues into the scientific, social and economic development of access to research results concluded that access should be promoted to the largest extent possible. According to this report, access to research results can only be responsibly restricted in the case of national security, privacy, or those involving IP rights of the authors ( Arzberger et al. , 2004 ). A major principle underlying this is the ownership of research results: publicly funded research and data are public goods and because they have been produced in the public interest they should be considered and maintained as such. Indeed, such a principle has become one of the focal rallying points of the global OA movement. Appropriate licensing and accessibility can influence re-use through commercialization, and can empower citizens and industry to recognize great economic benefits. This apparently resonates with many organisations, as indicated by the increased numbers of OA policies on a global basis (see Figure 3 ).
Figures are given at the beginning of each year. Source: ROARMAP, accessed March 2016.
With access to scholarly articles, entrepreneurs and small businesses can accelerate innovation and discovery, which is advantageous for advancing the ‘entrepreneurial state’ ( Mazzucato, 2011 ). Access to research results has clear advantages for a range of industries and can help stimulate regional and global economies. Increased access to research results has been associated with considerable increases of return on financial investment ( Beagrie & Houghton, 2014 ). Furthermore, OA facilitates collaborations between publishers and industrial partners to leverage the potential of structured information networks for advanced data mining projects, such as that recently announced between IBM Watson and PLOS ( Denker, 2016 ). One of the major driving forces behind the development of OA in the UK on a national level, the ‘Finch Report’, also concluded that OA was an essential source for information and innovation to the civil service, commercial sectors, small- and medium-sized enterprises (SMEs), and the general public ( www.researchinfonet.org/publish/finch/ ).
Taking UK cancer research as one high impact case study, there is substantial evidence for the economic benefit of OA. In 2011–12 prices, the total expenditure on research relating to cancer in the period of 1970–2009 was £15 billion ( Glover et al. , 2014 ). 5.9 million quality adjusted life years were gained from the prioritized interventions in 1991–2010, of which the net-monetary benefit was an estimated £124 billion (i.e., eight-fold return on investment). However, only 17% of the annual net-monetary benefit was estimated to be attributable to research performed in the UK ( Glover et al. , 2014 ), suggesting that 83% of the economic return on cancer research is drawn from research from non-UK sources. Another example is from the area of environmental impact assessments, where Vickery (2011) has shown that OA to R&D results could result in recurring gains of around €6 billion per year. As such, opening up research for global access rather than localized and restricted use has the potential to increase the economic return, as demonstrated with the cases on cancer research and environmental impact assessments.
The question of the current publication cost is difficult and confounded by estimates of the total global publishing costs and revenue. Data provided by Outsell, a consultant in Burlingame, California, suggest that the science publishing industry generated $9.4 billion in revenue in 2011 and published around 1.8 million English-language articles. This equates to an approximate average revenue per article of $5,000. A white paper produced by the Max Planck Society estimated costs at €3,800–€5,000 per paper through subscription spending, based on a total global spending of €7.6 billion across 1.5-2 million articles per year in total ( Schimmer et al. , 2015 ). Other estimates suggest that the total spending on publishing, distribution and access to research is around £25 billion per year, with an additional £34 billion spent on reading those outputs, a sum which equates to around one third of the total annual global spending on research (£175 billion; Research Information Network, 2008 ).
Such high costs are at odds with alternative estimates of the cost of OA publishing. For example, the Scientific Electronic Library Online ( SciELO ) is a pan-Latin American bibliographic database, digital library, and co-operative electronic publishing model of OA journals. It is estimated that their costs are between $70 and $600 per OA article depending on the services provided ( Brembs, 2015 ). OA now dominates the Latin American publishing landscape, with an estimated 72–85% of articles now with full text OA articles publicly available ( www.sparc.arl.org/news/open-access-latin-america-embraced-key-visibility-research-outputs ). Furthermore, in countries such as Brazil, higher quality journals are more likely to be published OA ( Neto et al. , 2016 ), implying that low-cost, high quality, and OA can all co-exist. Even more extreme estimates of the cost of OA come from Standard Analytics, who suggested the absolute minimum per-article costs of publishing could fall to between $1.36 and $1.61 with sufficient cloud-based infrastructure ( Bogich et al. , 2016 ). However, it is likely that this estimate under-emphasizes marginal costs that are beyond a per-article cost basis. However, what is clear from these analyses is that OA has the opportunity to become a cost-reducing mechanism for scholarly publishing. Open Journals System (OJS), an open source software available for anyone to use and download without charge, is another example of this. Additionally, researcher-led initiatives such as the recently launched Discrete Analysis have costs that average around $30 per article, with no cost to authors or readers, and utilise the infrastructure offered by the arXiv to keep costs low ( discreteanalysisjournal.com ).
In her article, Sutton (2011) argued that current scholarly journals are digital products and that as such they are driven by very different economic principles and social forces than their print ancestors. Based on Anderson (2013b) , the author made the case that changes in both the delivery of scientific content and in publishers’ business models was inevitable when journals moved online. Sutton (2011) considered that scientific literature is no different from other digital products with respect to distribution costs and as such it is no exception to the ‘zero is inevitable’ rule of pricing.
OA to the scholarly literature does not just benefit academics, but also has wider impacts on other domains in society. It makes research available to anyone with an Internet connection who has the ability to search and read the material. Therefore, it transcends academic affiliation and supports sustainable lifelong learning. Examples of groups who might benefit most from OA include citizen scientists, medical patients and their supporting networks, health advocates, NGOs, and those who benefit from translation and transformation (e.g., sight-impaired people). In theory, OA affects anyone who uses information, and opens up possibilities for knowledge to be used in unexpected, creative and innovative ways, far beyond the mainstream professional research.
Access to knowledge has been called a human rights issue, considering it is included in Article 27 of the United Nations Declaration of Human Rights. Willinsky (2006) has argued that " Access to knowledge is a human right that is closely associated with the ability to defend, as well as to advocate for, other rights. ". This is not only true for access to knowledge from research that could save human lives, but also, as argued by Jacques Derrida, to the right of access to philosophy and the humanities disciplines that stem from it. Derrida writes about the field of Philosophy, " No one can forbid access to it. The moment one has the desire or will for it, one has the right to it. The right is inscribed in philosophy itself " ( Derrida, 2002 ).
Society’s ability to make research publicly accessible supports the long-term interest and investment in research. Citizens support research through taxes and therefore one could argue that efforts to support public access should be a fundamental part of the research process. While OA is not a solution to all aspects of research accessibility (e.g., filtering and language barriers, connectivity barriers and disability access remain continuing issues to be addressed; cyber.law.harvard.edu/hoap/Open_Access_(the_book) ), it most certainly increases accessibility greatly and at the same time allows innovations to remove other barriers (e.g., OA articles can be freely translated to address language barriers and can be changed to different formats to accommodate screen readers). Anecdotal evidence suggests that public access to research is required from a range of public spheres ( whoneedsaccess.org/ ). Nonetheless, the fact that access to knowledge continues to be prohibited in fields like public health should be of major concern to all stakeholders engaged in academic publishing.
In addition to professional research by, for example, academics, there is the dimension of citizen science. In citizen science, the broader public participates in the research process itself and will have an increased interest in accessing previous research. Numerous projects such as Galaxy Zoo, Zooniverse, Old Weather, Fold It, Whale FM, Bat Detective, and Project Discovery, are all different initiatives in which citizens publicly and openly engage with research. These initiatives introduce new ways of knowledge creation and these groups also require thorough access to actually be able to do non-redundant research. Citizen science forms part of the societal case for OA, because it clearly indicates that anyone can be actively engaged with research, and not only professional scientists.
Some traditional publishers and some academics have argued that public access to research is not required because research papers cannot be understood by non-specialists ( cyber.law.harvard.edu/hoap/Open_Access_(the_book) - see Section 5.5.1). However, citizen science initiatives already indicate the general public is interested in and understands the research. Whereas this understanding and engagement is highly variable, and strongly dependent on a range of extrinsic and intrinsic factors, the fact that a high level of public interest in science already exists is of relevance. These publishers and academics argue that specialization is a sufficient reason for confining access to professional research bodies through subscriptions. Such statements conflate a lack of desire or need for access with the denial of opportunity to access research, and makes false presumptions about the demand in access to the literature (i.e., unmet and unknown demand). Importantly, OA provides access to everyone who potentially needs or wants it, without making explicit and patronising statements or guesswork about who needs or deserves it. As Peter Suber says in his 2012 book: "The idea [of OA] is to stop thinking of knowledge as a commodity to meter out to deserving customers, and to start thinking of it as a public good, especially when it is given away by its authors, funded with public money, or both" (page 116). Isolated incidents such as the crashing of servers of Physical Reviews Letters upon the ‘Gravitational Waves’ announcement and OA publication (Feb, 2016; Abbott et al. , 2016 ) indicate that there are cases of extreme public interest in science that closed access would only impede. Moreover, one out of four people seeking medical information have hit a paywall at least once ( pewinternet.org/2013/01/15/information-triage/ ). Claims that only experts can and should read research articles does little to break down the ‘ivory tower’ perception that still pervades academia, and undermines the enormous amounts of resources invested in science communication and public engagement activities. Such perceptions run counter to the idea of access to knowledge as a right, retaining it as a privilege based on financial or academic status.
The arguments outlined above form the basis for democratic and equal access to research, which come to light even stronger in the developing world. For low- and middle-income countries (LMIC), OA publishing breaks traditional financial barriers and allows unrestricted, equal access to scholarly information to people all over the globe. Due to the high prices of journal subscriptions, developing countries struggle with access just as in developed countries, but to a greater extent and consequently with greater negative repercussions. For example, a research paper from 1982 that indicated why Liberia should be included in the Ebola endemic zone was unknown to Liberian officials in the 2014 Ebola outbreak ( Knobloch et al. , 1982 ); the paper was published behind a paywall, drastically reducing its discoverability. Even though the result is available in the abstract of the paywalled article, assessing the truth of the result certainly requires access to the full research article. In general, lack of access can have major deleterious consequences for students and researchers, in that they do not have sufficient material to conduct their own primary research or education.
OA provides a mechanism to level the playing field between developed and developing countries. This increases fair competition and the scientific potential of the developing world ( Chan et al. , 2005 ). This aspect is linked to the wider issue of open licensing, which is essential for effective marketing of medicines and medical research in developing countries ( Flynn et al. , 2009 ), and justifies the necessity of OA in the wider context of social welfare. Developing countries clearly acknowledge the need for access and as such have launched many repositories to increase access with self-archiving of research articles. In 2014, over 100 institutions in Africa launched a network of over 25 fully-operational OA repositories in Kenya, Tanzania and Uganda ( www.ubuntunet.net/april2014#researchrelevant ). Such developments suggest that African nations are leaning more towards a Green model of OA adoption.
The shift from a ‘reader pays’ to a pre-publication fee model (often conflated with ‘author pays’; see subsection ‘The effect on publishers’) with OA potentially limits its adoption in developing countries. The pay-to-publish system is a potentially greater burden for authors in developing countries, considering that they are not used to paying publication costs, and funding systems for OA are not as well-established as those in the Western world. Publication fees present an even greater relative burden ( Matheka et al. , 2014 ) given that they can often exceed a monthly salary for researchers. This has been at least partially mitigated with waiver fees for authors from developing countries and additional provisions in research grants, and around 70% of peer reviewed OA journals are fee-free. In November 2015, Research4Life ( research4life.org ) and DOAJ announced a working partnership that will help to ensure that the Research4Life users will have access to the largest possible array of OA journals from publishers with a certain quality standard. While Research4Life does not directly cover OA publication costs, a lot of publishers propose full or partial waivers if they are based in countries eligible by Research4Life. However, determining which countries qualify for access to scientific journals through these programs, and which journals they are provided access to, is a fairly closed process. They are also not entirely stable, as publishers can opt out of the initiative, or be selective about which countries they choose to serve. In 2011, publishers withdrew free access to 2500 health and biomedical journals for Bangladesh ( Kmietowicz, 2011 ) through the HINARI programme. While access was subsequently reinstated, this demonstrates that such initiatives are not an adequate replacement for full OA ( Chatterjee et al. , 2013 ). Despite these programs purporting to provide essential articles to researchers in poor nations, they exclude some developing countries (e.g., India) and limit access to researchers who work in registered institutions.
Initiatives such as the Journals Online Project developed by INASP (International Network for the Availability of Scientific Publications; inasp.info/en/ ) has helped to develop a number of online OA platforms in the Global South. These were launched in 1998 with the African Journals Online (AJOL) platform, a project currently managed in South Africa. More recently, INASP have set up Latin American Journals Online (LAMJOL) which hosts journals in El Salvador, Honduras, and Nicaragua. In Asia, Bangladesh Journals Online (BanglaJOL), Nepal Journals Online (NepJOL), and Sri Lankan Journals Online (SLJOL), all facilitated through INASP, continue to develop and now around 95% of their articles are full-text Open Access. As mentioned previously, improved access should not be limited to professional researchers only, considering that there is also global interest from the broader public, including health professionals.
One negative effect of OA comes from entities that attempt to profit by exploiting the pay-to-publish system used by many OA publishers. These publishers operate a sub-category of OA journals known as vanity presses, predatory publishers ( Beall, 2012 ) or pseudo-journals ( McGlynn, 2013 ). These journals, referred to in this work as ‘deceptive publishers’, seem to be in the scholarly publishing business primarily to collect publication fees (i.e., APCs) in exchange for rapid publication without formal peer-review. Beall (2015) has defined a list of criteria for identifying deceptive publishers and an index of publishers and individual journals that meet these criteria is continuously updated ( scholarlyoa.com ).
While not all scholars and advocates agree with the criteria proposed by Jeffrey Beall (who controversially describes the OA movement as "an anti-corporatist movement that wants to deny the freedom of the press to companies it disagrees with" ( Beall, 2013 )), there are several factors that many agree on to identify a deceptive publisher, but these factors are not clear-cut indicators of deceptive publishing. One such indicator is that deceptive publishers tend to charge low publication fees ( Xia, 2015 ), most below $100 and few charge more than $200. However, while this is a trait of almost all deceptive publishers, the reverse is not necessarily the case. For example, a single-authored paper with PeerJ would cost $99, but this is not a deceptive publisher. On the contrary, the average publication fee of journals indexed in the Directory of Open Access Journals (DOAJ) is around $900–$1,000 ( Solomon & Björk, 2012 ) and leading universities in the UK and Germany pay on average $1,200–$1,300 per article ( Schimmer et al. , 2015 ). The editorial and peer-review aspects of deceptive publishers are either non-existent or suspect; they also falsely claim to have ratings such as a Journal Impact Factor and to be indexed in major databases such as Scopus ( Djuric, 2015 ). Editors from these journals solicit articles that have no relation to the topic of their journal and do not send the manuscripts out to be properly peer-reviewed ( Bowman, 2014 ).
The problem of deceptive publishers in OA seems to highly affect countries where the academic evaluation strongly favors international publication without further quality checks ( Shen & Björk, 2015 ). Xia et al. (2015) collected and analyzed the publication record, citation count, and geographic location of authors from the various groups of journals. Statistical analyses verified that deceptive and non-deceptive journals have distinct author populations: authors who publish in deceptive journals tend to be early-career researchers from developing countries with still little publishing experience. The spatial distribution of both the deceptive publishers and those authors who submit in pseudo-journals is highly skewed: Asia and Africa contributes three quarters of authors ( Xia et al. , 2015 ) and Indian journals form the overwhelming proportion of deceptive publishers ( Xia, 2015 ). An interesting finding is the very low involvement of South America, both among deceptive publishers (0.5%) and corresponding authors in deceptive journals (2.2%). The OA infrastructure in Latin America is different compared to other developing countries, which reveals a possible reason for this asymmetric situation. Latin American journals and universities are engaged in OA publication models at a higher degree than other regions ( Alperin et al. , 2011 ). As a result, scholars from this region are not only more aware of OA issues, but they have more options for publishing OA than those from other regions ( Alperin et al. , 2011 ). Moreover, SciELO ( Packer, 2009 ) and the creation of Latin American databases ( Alonso-Gamboa & Russell, 2012 ) have played a tremendous part in this process by bringing recognition and a good reputation to publishing outlets in Latin America.
Considerable attention is given to the subject of deceptive publishers, who have become conflated with the OA movement in general to the detriment of genuine OA publishers. For example, a ‘sting’ operation that outed bad peer-review instead got misinterpreted as bad peer-review in OA journals ( Bohannon, 2013 ), but was probably more indicative of issues to do with the traditional closed and over-burdened system of peer review ( scilogs.com/communication_breakdown/jon-tennant-oa/ ). Overall, the deceptive publisher phenomenon is one major negative aspect that spawns many misconceptions and misgivings about publishing OA. Recently launched industry-led initiatives such as "Think, Check, Submit" ( thinkchecksubmit.org ) provide a checklist to help researchers identify trustworthy journals, and will likely be a pivotal tool in combating deceptive publishers.
OA exists in a constantly evolving scholarly research ecosystem and the proliferation of "open" as a description of scientific activities has caused some confusion about what the term "open" means (for a more comprehensive discussion, see Pomerantz & Peek (2016) ). As such, it is important to note how it is interconnected to other facets of the scholarly communication system. Here, we discuss the implications that the transition to OA has on developments in the broader context of Open Science (or Open Research).
The overall movement of OA has become conjoined with the drive for Open Data. Data sharing is fundamental to scientific progress, because data lead to the knowledge generated in research articles. Furthermore, data sharing has recently become a common requirement, together with OA, for both research funding and publication. The data sharing policy from PLOS illustrates the high degree of overlap between OA and Open Data; authors of articles published in PLOS are required to share the data except if they have valid reasons not to (i.e., an opt-out system; journals.plos.org/plosone/s/data-availability ). Many publishers, NGOs, and research funders have recently come together to commit to free research sharing in times of public health emergency, catalysed by the current Zika health threat ( http://www.wellcome.ac.uk/About-us/Policy/Spotlight-issues/Data-sharing/Public-health-emergencies/index.htm ). It is noteworthy that some of the largest publishers, including Wiley, Taylor and Francis, and Elsevier (with the exception of the journal The Lancet ) did not commit to research sharing during ongoing or future public health crises (as of May, 2016).
The benefits of Open Data are diverse, including a citation advantage. Combined with the citation advantage for OA articles, providing data alongside publications can increase citations on average by 30% ( Piwowar & Vision, 2013 ) and up to 69% ( Piwowar et al. , 2007 ), but this evidence is entirely field-dependent (e.g., Dorch et al. , 2015) ). Below we cover seven additional benefits of Open Data.
First, data sharing enhances reproducibility, a crucial aspect in a time where some scientific domains appear to have problems with reproducibility (e.g., Open Science Collaboration, 2015 ). Several factors could form the basis for this ‘crisis’, such as an overemphasis on novelty instead of rigour, selective reporting of results, an overemphasis on statistical significance, and insufficient documentation of the research methods. Publicly sharing data, code, and materials can certainly alleviate issues with reproducibility. This is especially pertinent in the modern sciences, where a substantial proportion of published results draw on quantitative experiments and computer simulations. As such, it is largely impossible to reproduce these experiments as they become more complex and the associated datasets increase in complexity. When full access to the data, metadata, and the code used to produce the ultimate results are provided alongside publication, this greatly improves reproducibility.
Second, publicly available data can be used to stimulate innovations, such as new analytical methods. An excellent example of this is provided by the neuroimaging OpenfMRI project, where shared data have been used to examine the effects of different processing pipelines on analysis outcomes ( Carp, 2012 ) and test new methods to characterize different cognitive tasks ( Turner & Laird, 2012 ). Another good example is the Protein Data Bank (PDB) ( Berman et al. , 2000 ), a project which has enabled the re-use of the primary structural data and opened up new avenues of research, despite the latter not being expected.
Third, data sharing enables new research questions that can only be answered by combining datasets which now remain separated. Analyzing vast volumes of data can yield novel and perhaps surprising findings. This allows for integrated research hypotheses on the underlying processes behind the original data and observations. Exploratory approaches to large datasets can be seen as hypothesis generating tools, which later drives experimental testing to confirm or disprove these hypotheses ( Wagenmakers et al. , 2012 ).
Fourth, the realization that data will ultimately be shared and visible to the community provides a strong incentive for researchers to ensure they engage in better data documentation and, therefore, research methods. For example, the willingness to publicly share data has been associated with fewer statistical errors in the final research article ( Wicherts et al. , 2011 ).
Fifth, public data sharing provides a digital backup for datasets, protecting valuable scientific resources. Moreover, a considerable amount of data produced every day does not ultimately lead to publication and often remain hidden. Such data might remain in a hidden file-drawer despite being valid, creating a systematic bias in the information available. Public data sharing opens this file-drawer and, consequently, allows independent assessments of whether the data are valid or not.
Sixth, sharing data can certainly reduce the cost of performing research. A file-drawer has been indicated to greatly reduce the efficiency of research in detecting effects ( van Assen et al. , 2014 ). Open Data, as such, discourages redundant data collection (i.e., data that have been already collected but never made publicly accessible) and simultaneously allows researchers to better approximate what is happening in their fields. This will have a large effect on research costs, resulting in savings that can be then be used for more productive research goals.
Finally, and tightly connected with the sixth point, Open Data potentially has a great economic value. For example, Open Data creates jobs for analysis and re-use of these data Capgemini (2015) , and contributes to additional value of products and services in major sectors ( Manyika et al. , 2013 ), ad well as benefits users of these data rich services ( Stott, 2014 ).
Beyond OA and Open Data lies a more integrated approach to research, referred to more broadly as Open Science (i.e., Science 2.0, Open Scholarship). According to the European Commission’s Horizon 2020 programme, Open Science is defined as " The transformation, opening up and democratisation of science and research through ICT, with the objectives of making science more efficient, transparent and interdisciplinary, of changing the interaction between science and society, and of enabling broader societal impact and innovation ". Consequently, we see OA as only one of the multiple challenges currently facing the ‘open transformation’ of the scholarly publishing system ( Watson, 2015 ), and OA should therefore be considered in the wider contexts and complimentary domains of research transparency and open source.
As Kriegeskorte et al. (2012) pointed out, OA is now widely accepted as desirable and becoming a reality in many academic spheres. However, the second essential complementary element to research, evaluation, has received less attention despite the large amount of research that has been done to document its current limitations ( Benos et al. , 2007 ; Birukou et al. , 2011 ; Ioannidis, 2005 ; Ioannidis, 2012a ; Ioannidis, 2012b ; John et al. , 2012 ; Nosek & Bar-Anan, 2012 ; Simmons et al. , 2011 ).
Open evaluation, an ongoing post-publication process of transparent peer review and rating of papers, promises to address the problems of the current assessment systems Kriegeskorte et al. (2012) , as well as increasing the overall quality of the peer review process. As such, ongoing assessments of the development of OA must also consider the broader impact and concurrent changes to the peer review system ( van Rooyen et al. , 1999 ; Wicherts, 2016 ; Leek et al. , 2011 ). Some assessment methods, such as the Research Excellence Framework (REF) in England and administered by HEFCE, have already made OA a core feature of evaluation in that all research papers submitted to the REF must be archived in an institutional or subject repository ( www.hefce.ac.uk/pubs/year/2014/201407/ ). While it is too early to evaluate the impact of this policy, by tying OA compliance with research evaluation we might expect to see a national shift towards large-scale OA adoption. At the very least, such a combination is generating increasing interest and awareness about OA among researchers, increasing usage of institutional repositories, and increasing demand for funding for APCs ( Tate, 2015 ).
Future research regarding better ways to improve scholarly communication will be instrumental in providing evidence to support the transformation of the publishing system and design new alternatives ( Buttliere, 2014 ; Ghosh et al. , 2012 ; Kriegeskorte et al. , 2012 ; Pöschl, 2012 ), which will draw heavily upon on open publishing framework driven by developments and newly emerging models in OA. Finally, consideration of Open Science and OA will be important inclusions in evolving research standards such as the Transparency and Openness Promotion (TOP) guidelines ( https://cos.io/top/ ) .
Conclusions
This article provides an evidence-based review of the impact of OA on academy, economy and society. Overall, the evidence points to a favorable impact of OA on the scholarly literature through increased dissemination and re-use. OA has the potential to be a sustainable business venture for new and established publishers, and can provide substantial benefits to research- and development-intensive businesses, including health organisations, volunteer sectors, and technology. OA is a global issue, highlighted by inequalities between developing and developed nations, and largely fueled by financial disparity. Current levels of access in the developing world are insufficient and unstable, and OA has the potential to foster the development of stable research ecosystems. While deceptive publishing remains an ongoing issue, particularly in the developing world, increasing public engagement, development of OA policies, and discussion of sustainable and ethical publishing practices can remove this potential threat.
For libraries, universities, governments, and research institutions, one major benefit of lowering the cost of knowledge is the availability of extra budget that can be reallocated for other purposes. For researchers themselves, OA can increase their audience and impact by delivering wider and easier access for readers. For publishers, promoting OA is an answer to the desires and the needs of their research communities. Furthermore, subscription-based publishers have (partly) answered the call of the increasing global demand for OA, by giving their green light to author self-archiving ( Harnad et al. , 2008 ), as well as by establishing numerous ‘hybrid’ OA options. In an author survey, Swan & Brown (2004) reported that the vast majority of their sample indicated that they would self-archive willingly if their employer (or funding body) required them to do so. Similarly, in a study by Swan & Brown (2005) the vast majority of researchers (81%) indicated that they would comply with mandates that made OA a condition of funding or employment. There is evidence that many funders and research organisations are moving in this direction: since 2005, the number of policies supporting OA publishing increased steadily, and there is similar growth in the number of institutional rights-retention policies. Consequently, it is now the responsibility of researchers to ensure OA to their publications either by choosing the Green or the Gold road, and for public research funders to employ policies that are in the best interests of the wider public while considering the financial sustainability of the scholarly publishing ecosystem.
The fact that OA impacts upon such a diverse range of stakeholders, often with highly polarised and emotional viewpoints, highlights the ongoing need for evidence-informed discussion and engagement at all levels. This is especially the case for research communities, who have exceptionally diverse perspectives about OA and in particular how it interacts with ‘quality’ and ‘prestige’ in publishing ( Schroter & Tite, 2006 ; Schroter et al. , 2005 ). As Peter Suber, a leading voice in the OA movement, stated ( dash.harvard.edu/handle/1/4391169 ).
Therefore, OA supporters should focus their efforts on working for new models and systems rather than trying to undermine or punish the existing ones. OA remains only one of the multiple challenges that the scholarly publishing system is currently facing. As highlighted in this review, the empirical evidence for OA is overwhelmingly positive, but further research is certainly required to move from investigating the effects of OA to researching the broader effects of Open Science. In particular, OA must be considered in the future to more broadly regarding the adverse effects of a system of journal-based research assessment ( Brembs et al. , 2013 ), and the development of scholarly communication systems that are sustainable for, and in the best interests of, the commons.
We would like to collectively acknowledge the OpenCon community for inspiring this paper, and for providing continuous discussion about the various aspects of Open Access. In particular, we are grateful to Brett Buttliere, Audrey Risser, Sarah Barkla, and April Clyburne-Sherin for contributing resources to the development of this paper, and Tracey Depellegrin Connelly, Matt Menzenski, and Joseph McArthur for helpful comments on an earlier draft. We also thank Neil Saunders who provided the base code to extract data from PubMed Central. We would also like to thank Andy Nobes for drawing our attention to the work of INASP. PM would like to thank Lennart Martens for insightful discussions on Open Science. For comments on the first published version of this manuscript, we would like to thank Philip Young, Ross Mounce, Anna Sharman, and David Wojick for their helpful comments. Reviews from Gwilym Lockwood, Peter Suber, Paige Brown Jarreau, Anne Tierney and Chris Chambers were supportive, constructive, and greatly improved the content and balance of this article.
[version 3; referees: 3 approved
This research was partly funded by the Belgian National Fund for Scientific Research through a FRIA grant. PM acknowledges support from the European Commission Horizon 2020 Programme under Grant Agreement 634107 (PHC32-2014) ‘MULTIMOT’.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Peter suber.
1 Berkman Center for Internet & Society, Harvard University, Cambridge, MA, USA
I'm commenting on this sentence from the first paragraph of Version 3 of the article:
The Green route is also enabled through author rights retention, in which authors pre-emptively grant non-exclusive rights to their institutions before publishing any works. The institution then has the ability to make articles by these authors OA without seeking permission from the publishers (e.g., this is the case of the Dutch Taverne amendment that has declared self-archival of research after ‘a reasonable period of time’ a legal right ( Open Access NL, 2015 )).
The authors added the bulk of this at my request in an earlier comment. I'm glad they did, but it's still only partially true. It's true that universities can adopt rights-retention OA policies that make it unnecessary to seek permission from publishers. But it's not true that Dutch Taverne amendment is an example. It's not a university policy, but legislation. It's legislation that gives authors permission for green OA regardless of the contracts they signed, and regardless of the rights they might have retained through a university policy. It's a very good idea and I recommend it everywhere. (There is already a similar law in Germany, and one is now emerging in France.) But what the authors need here is an example of a university rights-retention OA policy, or a thorough explanation of this kind of OA policy.
I'm not seeking a citation to my own university's activity, though it adopted the first OA policies of this kind. Nor am I seeking a citation to the good-practices guide that I maintain with Stuart Shieber, though it's the standard reference on this kind of policy. In fact, I was reluctant to follow up the authors' response to my prior suggestion because I didn't want to appear to seek additional citations.
But since I've gone this far, I'll mention these two sources anyway:
The Harvard open-access policies
https://osc.hul.harvard.edu/policies/
Good practices for university open-access policies
http://bit.ly/goodoa
Of the two, I'd recommend the second in this situation. But even if the authors include no citation on this point, at least they should stop citing the Dutch law as an example, and treat it separately as another path to the same goal.
I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.
Gwilym lockwood.
1 Neurobiology of Language Department, Max Planck Institute for Psycholinguistics, Nijmegen, Netherlands
I have no further comments; my points or concerns have been addressed, and other issues are highlighted in further depth by other reviewers.
I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
In the first version I reviewed, you said, "This [green] route is dependent on journal or publisher policies on self-archiving (sherpa.ac.uk/romeo)."
That was untrue or incomplete and I offered this comment: It overlooks rights retention. Some individual authors retain enough rights to authorize green OA on their own. While this may be fairly rare, rights-retention OA policies at universities are increasingly common. More than 80 institutions in North America, Europe, Africa, and Asia have now adopted rights-retention OA policies. Under these policies, the faculty grant non-exclusive rights to their institution before they sign future publishing contracts. The institution then has permission to make those future articles OA without having to seek permission from publishers. (The institutions also grant the same non-exclusive rights back to authors.) For more detail on rights-retention OA policies, see Stuart Shieber and Peter Suber, "Good Practices for University OA Policies."
http://bit.ly/goodoa
You revised the text in a way that missed my point and misstated my position: "While academics themselves may have little power in debates regarding copyright, institutes could claim ownership of the work they likely already own by invoking their rights under the work made-for-hire doctrine (Denicola, 2006). However, it is difficult to imagine researchers favoring university-held rather than journal-held copyright, and a system of non-exclusive rights is preferred, as is reflected in OA policies and OA journals (Suber, 2012)."
Here's the main point: More than 80 universities around the world have adopted rights-retention OA policies. These policies are adopted by faculty votes, not administrative edicts. At these institutions the rights needed to authorize OA are not seized from faculty by the institution, or claimed by the institution through work-for-hire. These policies presuppose that the rights initially belong to faculty, not the institution. If the institution is to exercise them, faculty must voluntarily grant them. (There are details we needn't go into here, for example, that we're only talking about non-exclusive rights, and that these policies generally include waiver options when faculty don't want the institution to have rights to a given work.) At Harvard, which pioneered this type of policy, four of the school-level votes were unanimous. In short, it's not at all "difficult to imagine researchers favoring university-held...copyright." On the contrary, it's easy to imagine and widely attested. Green OA does not always depend on permission from publishers. Increasingly it depends on rights retention by authors, through carefully drafted and widely supported university policies. That's a fact. My opinion is that that's a good thing.
You needn't share my opinion, and needn't mention the fact. But please don't misrepresent the fact or reverse my opinion.
1 School of Psychology, Cardiff University, Cardiff, UK
All my suggestions have been addressed. I'm very happy approve this interesting and useful addition to the literature.
Tennant et al offer a timely and insightful review of the various effects of open access publishing on science and society. The paper is well structured and enjoyable to read. Although I am not an expert on open access publishing, I also found the discussion of the literature quite balanced and evidence-based.
I have just three recommendations for revisions:
p13 This sentence is difficult to parse: "Whereas this is hyper-variable, and strongly dependent on a range of factors, it is the fact that any public interest in science that is of importance."
1 Department of Learning and Teaching Enhancement, Edinburgh Napier University, Edinburgh, UK
This paper is a comprehensive review of the complexities of OA. I have come late to the discussion on this paper, and I find that the previous reviewers have been meticulous in their critique of the paper, to the point I have very little to add. However, there are a couple of points for consideration. What is the effect (if any) of the UK Research Excellence Framework on Open Access? To what extent is disciplinarity a factor in Open Access? I ask this question because of the high impact of the sciences and biomedical research, but wonder about Arts and Humanities (and other areas) as a comparision. As one of the other reviewers said, "this doesn't consider the fact that the prestige of some journals is advertising in and of itself." While this is true of scientific journals, the same can't be said, for example, of education journals, so there is a lack of parity between disciplines. There was also an assumption of the willingness of reviewers to continue to offer their services freely. This aspect of OA (and subscription-based) publishing is hardly ever critiqued, but it assumed to be part of the process.
All in all, this paper gives a lot of food for thought. I don't expect a rewrite of the paper, based on my comments, but I would welcome further discussion on where the authors (and readers) see OA going in the future.
1 Manship School of Mass Communication, Louisiana State University, Baton Rouge, LA, USA
This is an interesting and timely review of the issue of open access to scientific literature.
The two other reviewers have highlighted specific issues that should be addressed in the revision of this article, and I agree with these issues. I've added other notes below. I think this review article would benefit from a re-write to correct potentially biased language in support of open access and to round out the review with further evidence of open access impacts on citation rates, altmetrics, scientific literacy / public engagement and research quality.
The authors cite “fostering a culture of greater scientific literacy” as a benefit of open access. While this is theoretically a benefit, has more or less open access in particular scientific fields been tied to greater or lesser scientific literacy in those areas? Is this potential benefit supported by research literature? The authors should be clear on what the evidence-based benefits of open access are, and also what the potential drawbacks are. References to related research should be provided on this topic.
The structure the authors use for laying out their evidence and the language they use (e.g. “[the] case for Open Access” ) appear to lean more toward the positive impacts / benefits of open access from the outset. The authors should be very careful to review the evidence first before making value-based statements or arguments about open access, even if the evidence-based benefits outweigh any potential drawbacks, or lack of significant benefits, in the end.
Correct the typo in the following sentence: “[In] A longitudinal study Eysenbach (2006) compared…”
The authors write: “One alternative explanation for the citation advantage could be that researchers choose to publish OA when a finding is more impactful, but empirical evidence contradicts this selection effect. Gargouri et al. (2010) compared citation counts within a cohort of OA articles that had either been self-selected as OA or mandated as OA (e.g., by funders). The study concluded that both were cited significantly more than non-OA articles. As such, these findings rule out a selection bias from authors as the cause for the citation advantage (Gargouri et al., 2010).” However, couldn’t funded research also have a tendency to be considered “more impactful,” because it was chosen in the first place to be funded and mandated as OA? The authors should discuss this, and whether there is any research that experimentally investigates whether open access provides a citation advantage. This could perhaps be suggested as future research. The authors should also discuss how/why some studies have found no citation advantage for OA papers.
Related to social media mentions of research papers and citation counts, the authors might also consider citing Liang, X., Su, L. Y. F., Yeo, S. K., Scheufele, D. A., Brossard, D., Xenos, M., ... & Corley, E. A. (2014). Building Buzz (Scientists) Communicating Science in New Media Environments. Journalism & Mass Communication Quarterly , 1077699014550092. [ PDF ]
In discussing the economics of OA, the authors should also discuss any evidence of potential drawbacks for various stakeholders, such as where funds for pay-to-publish fees will come from and how these fees may affect individual researchers. Pay-to-publish models of OA may also burden early career researchers and researchers working in fields where research grants are more difficult to obtain.
The authors do not discuss the potential impact of OA on research quality or reproducibility (reproducibility is only mentioned in the context of open data). As this has been a controversial issue in the past (e.g. the mentioned 'sting' operations) the authors should discuss any research that has investigated the impact of open access on the rigor of peer reviews, research quality, presence of replication studies / reproducibility, etc. There has also been some discussion of whether open peer review (like that used by F1000Research) affects the quality of reviews, e.g. (Rooyen et al. 1999). The authors should mention this and/or subsequent literature when addressing open peer review.
There have also been studies on scientists' / journal article authors' perceptions and attitudes toward open access, e.g. Schroter and Tite (2005; 2006). The authors might considering summarizing some of this research, as it gives context to some of the existing barriers to open access and perceived drawbacks among researchers.
In summary, the topic of this review is important and timely. However, this paper falls short of what I would expect from a systematic review in terms of systematically summarizing previous research findings related to the impact of open access on scientific publishing, public engagement with science, science literacy and altmetrics. The authors should be careful to hold back value judgements / arguments related to the case for open access until having systemically reviewed the evidence-based benefits, drawbacks, and/or lack of significant benefits. The authors should also provide some discussion of how we might objectively weigh any evidence-based benefits with potential drawbacks for various stakeholders including researchers and especially early career researchers. The authors should avoid summarizing mostly the findings of previous studies that find positive impacts of open access on the various domains of potential impact they consider in their paper. The evidence already presented in the paper is rigorous and detailed. However, I would recommend a revision that rounds this review out with more systematic evidence.
The article is very well-done, unusually thorough and detailed. Here are a few ways to improve it.
When I refer to page numbers, I mean the page numbers in the PDF of v1, April 11, 2016.
http://f1000research.com/articles/5-632/v1
"You" refers to the authors.
Apologies in advance if I sometimes cite my own work in these comments.
p. 1. In the abstract you say, "The economic case for Open Access is less well-understood, although it is clear that access to the research literature is key for innovative enterprises, and a range of governmental and non-governmental services."
This understates the economic case. For example, some subscription journals convert to OA precisely for economic benefits.
See the preliminary version of David Solomon, Bo-Christer Björk, and Mikael Laakso, "Converting Scholarly Journals to Open Access: A Review of Approaches and Experiences" now open for public comment. (The final version will be published this summer.)
https://osc.hul.harvard.edu/programs/journal-flipping/public-consultation/
See especially section 4.6, "Increased revenue and financial viability."
https://osc.hul.harvard.edu/programs/journal-flipping/public-consultation/4/6/
p. 3. You say, "The Green route refers to author self-archiving, in which a version of the peer-reviewed article is posted online to a repository or website."
Green OA also applies to preprints, which are not peer-reviewed.
p. 3. You say, "This [green] route is dependent on journal or publisher policies on self-archiving ( sherpa.ac.uk/romeo) ."
This is importantly incomplete. It overlooks rights retention. Some individual authors retain enough rights to authorize green OA on their own. While this may be fairly rare, rights-retention OA policies at universities are increasingly common. More than 80 institutions in North America, Europe, Africa, and Asia have now adopted rights-retention OA policies. Under these policies, the faculty grant non-exclusive rights to their institution before they sign future publishing contracts. The institution then has permission to make those future articles OA without having to seek permission from publishers. (The institutions also grant the same non-exclusive rights back to authors.)
For more detail on rights-retention OA policies, see Stuart Shieber and Peter Suber, "Good Practices for University OA Policies."
p. 3. You say, "A subscription to all peer-reviewed journals is not affordable for any single individual, research institute or university (Odlyzko, 2006)."
This is true and important, but it's a pity you don't cite more recent evidence than 2006.
An important kind of evidence for this proposition is that not even Harvard University can afford all the journals needed by its faculty and students, and must cancel journals every year for budgetary reasons alone. I've collected seven public statements from Harvard to this effect (2008-2012) in the supplements to p. 30 of my 2012 book (Open Access, MIT Press, 2012).
http://bit.ly/oa-book#p30.2
p. 3. You say, "Much of the driving force behind this global change has been through a combination of direct, grassroots advocacy initiatives in conjunction with policy changes from funders and governments."
Please add *university policies* to this list. They're on a par with funder policies in importance, and they're far more numerous. ROARMAP shows that 7+ times more universities have OA policies than funders.
p. 3. You say, "The Open Access movement is intrinsically tied to the development of the Internet and how it redefined communication and publishing (Laakso et al., 2011)."
For more documentation on how the OA movement arose as soon as the internet arose, see my Timeline of the Open Access Movement.
http://legacy.earlham.edu/~peters/fos/timeline.htm
In 2009, I moved the timeline to the Open Access Directory wiki, and you should probably cite that version:
Timeline of the open access movement...
http://oad.simmons.edu/oadwiki/Timeline
...especially subsection on developments before 2000.
http://oad.simmons.edu/oadwiki/Timeline_before_2000
(You might cite this timeline again on p. 5, when you introduce your own timeline.)
p. 3. You say, "One result of the growing OA movement is the rise of OA-only publishers...."
Somewhere in this paragraph, I'd mention that some OA publishers are for-profit (e.g. BMC) and some are non-profit (e.g. PLoS).
p. 4. Your section on the impact advantage is very well-done. Most treatments are much briefer, less careful, and less detailed than yours.
I have just these suggestions. You cite authors of individual studies, and Alma Swan's 2010 literature review. But you don't cite the mother lode of literature on this topic: Steve Hitchcock's annotated bibliography, "The effect of open access and downloads ('hits') on citation impact: a bibliography of studies."
http://eprints.soton.ac.uk/354006/1/oacitation-biblio-snapshot0613.html
Or more precisely, you cite it once, 10 paragraphs before the section on the impact advantage begins. You should cite it again within the section on the impact advantage. You should mention that it's comprehensive and annotated.
Hitchcock stopped updating it in 2013. But you should mention that SPARC Europe has committed to update it through its Open Access Citation Advantage Service.
http://sparceurope.org/oaca/
Finally, in the same place where you cite Swan's literature review, you should cite Ben Wagner's literature review, "Open Access Citation Advantage: An Annotated Bibliography," Issues in Science and Technology Librarianship, Winter 2010.
http://www.istl.org/10-winter/article2.html
p. 5. In the timeline entry for 2002, the BOAI was released on February 14, not January 14.
p. 6. In the timeline entry for 2013, I'd say that the suicide of Aaron Swartz "increases" (not "gains") international attention for the OA movement, or "draws new attention" to the OA movement. The current language suggests that the OA movement didn't have international attention before that, which is very far from the truth.
p. 9. You say, "Shifting copyright to stay with the author allows for wider re-use, including TDM, and forms the basis for a robust and developing public domain."
You shouldn't use "public domain" here. In copyright law, the term has a specific meaning which you don't mean here.
p. 10. You say, "Only recently has any transparency into the detailed costs of subscriptions been gained by using Freedom of Information Requests to bypass non-disclosure agreements between libraries and publishers (Lawson & Meghreblian, 2015)."
Here you overlook the earlier Big Deal Contract Project in the US, from Ted Bergstrom, Paul Courant, and Preston McAfee. It too used public records laws and Freedom of Information requests. I'm not sure when it launched, but it was before 2009.
http://www.econ.ucsb.edu/~tedb/Journals/BundleContracts.html
p. 10. You say, "The average production cost for one paper is estimated to be around $3500–$4000 (Van Noorden, 2013)."
I've seen dozens of widely varying estimates of this cost, most of them much lower than Van Noorden's. Unfortunately I don't have time to hunt them down. I hope you can introduce at least a few more, if only to show that estimates differ widely here.
p. 10. You say, "Philip Campbell (Editor-in-Chief of Nature) stated that his journal’s internal costs were at $20,000–$30,000 per paper...."
To clarify, I think he meant that this was the cost per published paper. If Nature rejects x articles for every one it publishes, then this includes the cost of peer reviewing x rejected articles. Since Nature is very selective, x is high. But this "cost per published paper" should not be compared to costs for peer-reviewing a single paper or the production costs of publishing an accepted paper.
p. 10. You say, "OA publishing is most prevalent in the form of ‘pay-to-publish’...."
This is either false or misleading. About 70% of peer-reviewed OA journals charge no APCs at all. In that sense, the fee-based model is not the most prevalent. It's a minority model. On the other hand, about 50% of the articles published in peer-reviewed OA journals are published in the fee-based variety.
On my claim that most OA journals charge no APCs:
See my article, "Good facts, bad predictions," SPARC Open Access Newsletter, June 2006.
https://dash.harvard.edu/handle/1/4391309
And my article, "No-fee open-access journals," SPARC Open Access Newsletter, November 2, 2006. https://dash.harvard.edu/handle/1/4552050
The DOAJ used to make it easy to see what percentage of listed journals were fee-based and what percentage were no-fee. But it has temporarily made that difficult by combining the categories of "no-fee journals" and "journals for which we don't have enough information to say."
On my claim that about half the articles published in peer-reviewed OA journals are published in the fee-based variety, see the updates to p. 170 of my 2012 book. There I cite three studies and quote the relevant excerpts.
http://bit.ly/oa-book#p170
p. 10. When you describe ways in which fee-based OA journals mitigate some problems arising from the model, you mention the firewall between the editorial and business side of the journal (good), and you mention fee waivers (good).
You should also mention fee discounts, which many journals give in lieu of fee waivers.
You should also mention that most fees charged by fee-based journals are paid by funders (59%), or the author's employer (24%), and only 12% are paid by authors themselves. These numbers are from Suenje Dallmeier-Tiessen et al., "Highlights from the SOAP project survey. What Scientists Think about Open Access Publishing," arXiv, January 29, 2011, Table 4.
http://arxiv.org/abs/1101.5260
p. 12. You say, "In his article Sutton (2011)..."
please change "his" to "her". The article is by Caroline Sutton.
p. 12. You say, "While OA is not a solution to all aspects of research accessibility (e.g., language barriers and disability access remain continuing issues to be addressed)...."
See my 2012 book (Open Access, MIT Press, 2012, http://bit.ly/oa-book ), at pp. 26-27, where I make much the same point. "OA isn't universal access" and by itself doesn't overcome "filtering and censorship barriers", "language barriers", "handicap access barriers", or "connectivity barriers".
p. 13. You refer to "the fact that access to knowledge is actively prohibited in fields like public health...."
I don't know what you mean here by "actively prohibited".
p. 13. You say, "Some traditional publishers, and some academics, have argued that public access to research is not required because research papers cannot be understood by non-specialists...."
Here you might want to cite Section 5.5.1 ("OA for Lay Readers," pp. 115-119) of my 2012 book.
p. 13. You say, "The shift from a ‘reader pays’ to an ‘author pays’ mode...."
I recommend avoiding the term "author pays" for the reasons I gave in my fourth comment to p. 10 above. Most OA journals don't charge author-side fees, and among those who do, most fees are not paid by authors.
p. 13. You say, "This has been at least partially mitigated with waiver fees for authors from developing countries and additional provisions in research grants...."
Yes. But again, don't forget that the majority of peer-reviewed journals are no-fee journals. See my fourth comment to p. 10 above.
pp. 15-15. You say, "Fortunately, it seems that funders and research organisations are moving in that direction. Since 2005, the number policies supporting OA publishing increased steadily. Consequently, it is now the responsibility of researchers to ensure OA to their publications either by choosing the green or the gold road."
Since you're recapitulating some grounds for optimism here, I'd also reiterate the growth of rights-retention OA policies. See my second comment on p. 3 above.
p. 16. You say, "As Peter Suber, a leading voice in the OA movement, stated: 'As long as they do not have the power to stop Open Access, the toll-access publishers are not the enemy'."
Thanks for quoting me. I wanted to give you the source to cite. Unfortunately, I don't think you're using an exact quote. Here's the closest one I can find:
"TA [toll-access] publishers are not the enemy. They are only unpersuaded. Even when they are opposed, and not merely unpersuaded, they are only enemies if they have the power to stop OA. No publisher has this power, or at least not by virtue of publishing under a TA business model. If we have enemies, they are those who can obstruct progress to OA. The only people who fit this description are friends of OA who are distracted from providing OA by other work or other priorities."
It's from "Two distractions," SPARC Open Access Newsletter, May 3, 2004.
https://dash.harvard.edu/handle/1/4391169
Imperial College London, UK
Hi Alexander,
I've been looking into this, and the most up-to-date statistics for this based on the DOAJ come from this source: http://citesandinsights.info/civ16i4.pdf . Figures here seem to suggest that 71% of journals in the DOAJ do not levy an APC. This is likely to change slightly with the updated 'crackdown' from the DOAJ ( http://www.nature.com/news/open-access-index-delists-thousands-of-journals-1.19871 ), but I'll add a reference to this in.
Harvard University, USA
Here are the latest stats from the DOAJ (May 24, 2016).
https://goo.gl/LejTAw
Dear Alexander,
I'm just posting the link that Peter was kind enough to send us yesterday, with updated statistics on this matter: https://plus.google.com/+PeterSuber/posts/HjrRDcrZS8p
Important points:
Here are the numbers as of May 24, 2016:
Total number of journals listed in DOAJ = 8,858
Yes (fee-based) = 1,424 = 16%
No (no-fee) = 2,601 = 29%
No info = 4,833 = 55%
Further references and information are provided in that post, and will be integrated into the next version of this manuscript.
This is the first time I've written an open peer review, although I always sign private peer reviews. Normally I'd make comments directed to the authors and the authors alone, but since this is open, I've also included a section for other readers of this paper. This may sound a bit like an Amazon or Airbnb review or something.
Short summary for readers
This is an excellent paper about the academic, economic, and societal benefits and impacts of Open Access. It's a good introductory text for people who don't know much about OA and would like to know more. It's also a good persuasive text for stakeholders in policy, universities, publishing, funding, etc. positions who may be interested in including OA in their decision making.
In addition to its attention to detail, its main strengths are its focus, its brevity, and its relative impartiality.
One of the difficulties with writing about OA is that there are so many overlapping issues; this paper is very good at giving a brief overview or description of the other issues, pointing the reader in the direction of somewhere with more information, and then getting back onto the topic.
Another thing about OA is that its advocates are very passionate about it. As with any cause, that's a good thing for its supporters, but overwhelmingly pro-OA resources can seem potentially off-putting to neutrals. This paper does an excellent job of presenting an evidence-based pro-OA viewpoint in a measured tone and without coming across as ideological.
One possible caveat is that the paper presents extensive evidence of what OA does, but it doesn't tackle the meatier issue of how to implement it successfully. However, I feel that's a separate issue which is beyond the scope and purpose of this paper.
Suggestions and comments for authors
First of all, great article! Well done and thank you for pulling together what is a disparate collection of links and literature into a one-stop shop which is both useful and coherent. I like this article a lot... but my role here is to criticise and make it better, so the rest of this review will focus on that.
This article is well-written and well-structured. That's made it much easier as a reviewer to simply go through the article and highlight my issues with it paragraph by paragraph, rather than having to make it coherent first and then sort out the smaller things.
The vast majority of the issues I have with this paper are minor ones, so it didn't make sense to have separate major/minor sections; rather, I'll just go through them in order in the text.
(I printed this out to underline/comment on, so for me, tables 1 and 2 came during the academic case for OA section. Online, they're supplementary materials, and I think it's best that way, but this is why I'm commenting on the tables during that section)
"We recommend that OA supporters focus their efforts on working to establish viable new models and systems of scholarly communication, rather than trying to undermine the existing ones..."
In general, I agree with this sentiment. However, I feel that its inclusion in the abstract is a bit jarring as the text of the article doesn't really cover recommendations to OA supporters at all, other than in the very last paragraph. I think that's good, as I feel this paper is best suited as relatively neutral source of information rather than a preaching to the converted or ideology discussion kind of purpose. So, I think this part can be left out of the abstract; it doesn't refer to any particular "recommendations to OA supporters" bit in the text and potentially clouds the strength of the relative impartiality of the paper.
A brief history of OA
"BioMed Central and ... PLOS were founded in the early 2000s and remain successful businesses to date." (p3, col2)
Technically, PLOS is a non-profit. I suggest changing successful businesses to successful business models. This both highlights the financial sustainability of OA (increased APCs at PLOS notwithstanding) and also sets it apart from traditional publishers, which are definitely successful businesses.
The academic case for OA
figure 1 (p4, top)
I have difficulty interpreting the y-axis on figure 1. It's labelled as cumulative number of PubMed articles relative to 2000, but I'm not sure how to read it. Reading off 2014, non-OA is c.22 on the y-axis, and OA is c.33 on the y-axis. Based on the figure 1 caption about the ratio, I'm interpreting this as meaning that, in 2014, the ratio of cumulative PubMed articles was approx 33:22 OA to non-OA, or in other words, 60% of PubMed articles in 2000-2014 were OA. However, I'm not sure if this is how it's meant to be interpreted. I think that it's well visualised, and really makes it clear how OA has taken off, but exactly what the numbers represent on the y-axis is unclear to me: number of articles? number of times more articles? It could use some relabelling.
"Napster moment" (p4, col1)
I like the comparison, but it could use a citation (even just the Napster wikipedia article) and/or a little more explanation to clarify what that means.
"1991 ... by the American physician Paul Ginsparg"
He's a physicist, not a physician.
Xu et al (2011)
I don't think this reference was very well cited. Firstly, Oxford Open Journals are listed as a discipline, when they're the source of papers across disciplines. The actual disciplines were Medicine, Social Sciences, Mathematics & Physical Sciences, Life Sciences, and Humanities. Secondly, you list the citation advantage as 138.87%. However, one of the main findings of this paper was the disparity in citation advantages; it ranged from 163.16% for OA articles in Mathematics & Physical Sciences to an actual citation disadvantage of -49.24% for OA articles in Humanities. Given the pro-OA nature of the paper, I feel like you have an extra responsibility to report the few anti-OA pieces of evidence.
Gargouri et al (2010) (page 8, col1)
This paragraph is about a possible confound for the OA citation advantage, where it could be that researchers choose to publish OA for extra cool findings, and you use the Gargouri et al. study to counter this... which is totally correct. You write:
"Gargouri et al. (2010) compared citation counts [for articles which were] self-selected as OA or mandated as OA. The study concluded that both were cited significantly more than non-OA articles. As such, these findings rule out a selection bias"
This is true that both OA types were cited more than non-OA. However, it's also missing the crucial point that there was no difference in citation between self-selected OA articles and mandatory OA articles. Including this would strengthen your point to show that it's OA itself which leads to the citation advantage.
The whole section about altmetrics (subhead societal impact of the academic case for OA, p8, col2) could use some attention. It's not clear until much later what the difference is between alternative metrics (i.e. altmetrics), i.e. the various types of metrics which are alternative to journal impact factors, and Altmetric, i.e. the company which is often confusingly referred to as Altmetrics (not in this paper, to be fair, but elsewhere). A quick disambiguating sentence or two would be really useful here.
In the following paragraph (page 8, col2), you write about OA altmetrics advantage, and say that there's a logical assumption that OA articles should have one. However, this doesn't consider the fact that the prestige of some journals is advertising in and of itself. You can, and do, get a lot of closed-access papers which generate high altmetrics (social media attention, Mendeley readhership) from academics who do have access. And sure enough, in the next paragraph, (page 8 and 9), the Wang et al. 2015 article finds that the OA altmetric advantage doesn't extend to the most impactful articles. I think this section can be made more nuanced and informative by quickly discussing the role of journal prestige. Nothing in depth, just as something that exists and needs to change (for example, you could point people to Brembs et al. and the Deep Impact paper in Frontiers).
"Essentially, copyright is a tool wielded by traditional publishers for financial gain rather than fostering creativity..."
I don't disagree with this. However, I feel it comes on too strong. I think it's fair to say that most people's immediate opinion of copyright is "well, I'd like my stuff to be copyrighted, as that means people can't steal it and pass it off as their own". I think that you need a little more detail here, even just two or three sentences to explain how and why copyright is used for financial gain rather than author protection. Otherwise, it just sounds political/ideological, and counterintuitive for people who haven't read much about copyright.
Glenisson et al. (2005) citation (page 9, col2).
You write that TDM has "proven to be useful for a large variety of applications", and use the Glenisson citation to back this up. I have to say here that I know very little about TDM; however, following through to the Glenisson paper, I don't see how it supports that conclusion. I read it and it seems to show a proof-of-concept kind of study: that TDM can group a set of papers into themes in the same way that an expert can. This is really cool and everything, but I don't think that that substantiates your point that TDM is useful for a large variety of applications. Rather, I'd like to see a couple of specific examples, which you then describe more fully in the next paragraph. One good one is Swanson 1987 (I think - taken from here: http://people.ischool.berkeley.edu/~hearst/papers/acl99/acl99-tdm.html ), who used TDM to make the link between migraines and magnesium deficiency.
"...simply because one can no longer keep up with the published literature".
Small point, but I think it's worth stressing that this is due to the amount of literature that there is.
The economic case for OA
the pay-to-publish part (p10, col2)
I feel this glosses over problems with pay-to-publish. You come back to predatory OA later, but this isn't quite the same: I think it could use a couple of extra sentences describing what the conflict of interest for researchers is, and also stress that pay-to-publish makes it potentially in a journal's interest to accept more papers than they necessarily should. One of the most common anti-OA arguments I see in non-scientific media is that OA is pay-to-publish, which is often misrepresented as "pay-to-publish is publication bribery". I think this section needs a little more substance to it to acknowledge/address this.
"making publication costs dependent on the value added..." (page 11, col1)
When talking about the value added by journals, this paragraph ignores the elephant in the room: journal prestige. Again, I know that this isn't the purpose of this article, but I think it could really be strengthened by mentioning it before moving on.
"Much primary research actually takes place outside of academia inside research and development departments" (page 11, col2)
The part following this sentence is muddy. First, you talk about R&D outside academia (i.e. presumably private research), and then you talk about access to research results because they're publicly financed public goods. So, what does that mean, that R&D from private businesses who've invested their own capital in it should be made available to all? (maybe I agree with that, in some cases, but a lot of people sure won't)
I think this paragraph could be honed a bit; otherwise, it's straying into the ideological territory of saying that all private research should be made public for the public good. That transcends OA in scholarly publishing, and makes OA in scholarly publishing too easy to dismiss.
The cancer research paragraph (page 12, col1) is also unclear. It took me a while to figure out it's talking about UK expenditure - my first assumption of "total expenditure" meant worldwide. It's also not totally clear what the point is - the geographical origin of research is unrelated to its open status. I think that it's quite a leap to write (apologies for paraphrasing) "83% of UK economic benefit from cancer research comes from research outside UK, therefore open access is good", because I think it conflates two different things.
Also, small point, "17% of the annual net-monetary was estimated" is missing the word benefit after net-monetary.
The societal case for OA
Small point: as somebody who wears a linguistics hat quite often, it rankles to read on page 11 "Examples of [non-academic] groups who might benefit include... those who work in linguistics and translation". Translation, for sure, but linguistics is an academic field - you even mention the Lingua to Glossa movement organised by academic linguists later in the manuscript! To me, this is like writing "...those who work in biology and vets", lumping the academic field and a practical use of that field together. Just referring to translation is fine.
Citizen engagement (page 13, col 1)
I agree that these are great examples of citizen engagement with science, but at the risk of sounding like an Elsevier representative, interest in projects like Galaxy Zoo does not entail desire to download and read papers. In fact, you could even make the (spurious) argument that those projects come into existence precisely because citizens aren't interested in downloading and reading papers. I don't actually agree with that, I agree with your general point... but I think that citizen science project interest and citizen science paper interest. Obviously I think it is in the public interest to have science journals OA, but this isn't the right argument (and I think the sentence "Such statements conflate a lack of desire or need for access with the denial of opportunity to access research" is perfect). I think a stronger argument would be to look at existing OA journals, such as PLOS and Frontiers, and see how many views and downloads come from people who aren't academics. If you can point to, say, some of the most viewed/downloaded PLOS papers and say "look, 30% (or whatever, that's a random number) of these readers aren't academics, they're real people who are interested in it", that would make for a stronger argument.
Quibble about the "yes, we were warned about Ebola" example: the finding from that paper (that Liberians have Ebola antibodies in their blood, suggesting the endemic presence of Ebola) is actually written on the first page preview of the paper ( http://www.sciencedirect.com/science/article/pii/S0769261782800282/part/first-page-pdf , accessed from my laptop outside my institution). It could be argued that anybody could see this finding anywhere in the world, meaning that it's not a problem of OA, it's a problem about searching and indexing. A good counterargument to that is obviously that this paper would have been unsearchable with TDM at the start of the outbreak when people were combing through all West African Ebola literature.
" 'green' model of OA adoption" (page 13, col2)
You generally refer to Green and Gold routes, with the colours capitalised. Just a small terminology thing to keep consistent.
A much more important thing is also on page 13, col2:
"The pay-to-publish system is a potentially greater burden for authors in developed countries, considering that they are not used to paying publication costs, and funding systems for OA are not as well-established as those in the Western world."
--> developing countries, not developed countries!
Predatory publishers (page 14, col1)
I agree with Ross Mounce's comment on the paper: you give Beall too much importance. I think it can be a useful list and should be mentioned, but definitely include some caveats like the ones Ross writes, or the fact that he added Frontiers to the list because of a couple of editorial mistakes.
Peter Suber (page 16, col1)
You describe him as "a leading voice in the OA movement", but I think you should write what his positions are (see http://cyber.law.harvard.edu/~psuber/wiki/Peter_Suber ) in order to justify his importance.
Other general things
There are no proposed solutions in this paper, which is totally fine, because it's beyond the scope of the paper. I feel it could benefit by putting in a couple of sentences here and there about who is needed for driving this change: academics, funders, governments, etc.
I was disappointed not to see anything about the Dutch government and university library organisations' collective drive towards OA. They've changed the national law on copyright, they've reached agreements with most major publishing groups, they may well introduce mandatory OA publishing in the Netherlands in 2016, and they've made it one of the main priorities of their EU presidency this year. It's like the best example of how a whole country can take the lead and sort it out. I think including a quick reference to the Netherlands as an example of excellent OA policy (in the same way that you mention sciELO in Latin America) would go a long way towards convincing the people who are reading this thinking, "ah, yes, I guess OA makes sense in the developing world, but we're doing fine here in the West and it would be too difficult to change things". A good summary of that is here: http://openaccess.nl/en/in-the-netherlands/current-situation
Final remarks
That's the end of my 2800-odd word review. I really enjoyed reading this paper, going through it, and trying to find ways to improve it. Thanks to the authors for writing an excellent paper.
Help | Advanced Search
Title: can pre-trained language models generate titles for research papers.
Abstract: The title of a research paper communicates in a succinct style the main theme and, sometimes, the findings of the paper. Coming up with the right title is often an arduous task, and therefore, it would be beneficial to authors if title generation can be automated. In this paper, we fine-tune pre-trained and large language models to generate titles of papers from their abstracts. We also use ChatGPT in a zero-shot setting to generate paper titles. The performance of the models is measured with ROUGE, METEOR, MoverScore, BERTScore and SciBERTScore metrics.
Subjects: | Computation and Language (cs.CL); Artificial Intelligence (cs.AI) |
Cite as: | [cs.CL] |
(or [cs.CL] for this version) | |
Focus to learn more arXiv-issued DOI via DataCite |
Access paper:.
Code, data and media associated with this article, recommenders and search tools.
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs .
IMAGES
COMMENTS
Research Policy Adviser. Aggregation plays an increasingly essential role in maximising the long-term benefits of open access, helping to turn the promise of a 'research commons' into a reality. The aggregation services that CORE provides therefore make a very valuable contribution to the evolving open access environment in the UK.
DOAJ is a unique and extensive index of diverse open access journals from around the world, driven by a growing community, committed to ensuring quality content is freely available online for everyone.
Free access to millions of research papers for everyone. OA.mg is a search engine for academic papers. Whether you are looking for a specific paper, or for research from a field, or all of an author's works - OA.mg is the place to find it. Universities and researchers funded by the public publish their research in papers, but where do we ...
The SpringerOpen portfolio has grown tremendously since its launch in 2010, so that we now offer researchers from all areas of science, technology, medicine, the humanities and social sciences a place to publish open access in journals. Publishing with SpringerOpen makes your work freely available online for everyone, immediately upon publication, and our high-level peer-review and production ...
ScienceOpen offers content hosting, context building and marketing services for publishers. See our tailored offerings For academic publishers to promote journals and interdisciplinary collections For open access journals to host journal content in an interactive environment For university library publishing to develop new open access paradigms for their scholars For scholarly societies to ...
DOAJ's mission is to increase the visibility, accessibility, reputation, usage and impact of quality, peer-reviewed, open access scholarly research journals globally, regardless of discipline, geography or language.
What is open access and open research? Open access (OA) refers to the free, immediate, online availability of research outputs such as journal articles or books, combined with the rights to use these outputs fully in the digital environment. OA content is open to all, with no access fees. Open research goes beyond the boundaries of publications ...
We have published over 124,000 open access articles via Gold open access across disciplines -from the life sciences to the humanities, representing 33% of all Springer Nature articles in 2020. Authors can also publish their article under an open access licence in more than 2,200 of our hybrid journals. Our portfolio focuses on robust and insightful research, supporting the development of new ...
PLOS is a nonprofit, Open Access publisher empowering researchers to accelerate progress in science and medicine by leading a transformation in research communication.
MDPI is a publisher of peer-reviewed, open access journals in various fields of science and technology since 1996.
The MIT Open Access Articles collection consists of scholarly articles written by MIT-affiliated authors that are made available through DSpace@MIT under the MIT Faculty Open Access Policy, or under related publisher agreements. Articles in this collection generally reflect changes made during peer-review. Version details are supplied for each paper in the collection: Original manuscript ...
SpringerOpen Journals offers a wide range of open access publications in various fields of study.
Nature and the Nature research journals - now with immediate gold open access options for all primary research. We are fully committed to open research and the benefits this brings for ...
The Open Research Library (ORL) is planned to include all Open Access book content worldwide on one platform for user-friendly discovery, offering a seamless experience navigating more than 20,000 Open Access books.
Hindawi journals have joined Wiley's open access journal portfolio Journal content is available to openly view, download, and share on Wiley Online Library. With a 200 year tradition of publishing excellence, Wiley is committed to expanding routes to open access publishing and ensuring the maximum reach and impact of high-quality, trusted research for the benefit of humankind.
Open Access at AAAS. AAAS and the Science family of journals believe in empowering authors with choice. We support Open Access (OA) options that are informed by the scientific community, contribute to the accurate record of published scientific content, and protect the overall integrity of that content. In 2015, AAAS began offering Gold Open ...
In contrast, open access ensures that the outputs of the research process can be read and built upon by everyone. Open access to publications is a component of Open Science, which encompasses a variety of efforts focused on making scientific research more transparent and accessible.
CORE harvests research papers from such as institutional and subject repositories, and open access and hybrid journals. CORE currently contains 207,255,818 open access articles collected from 10,286 data providers around the world."
Open access is vital to a collaborative, inclusive and transparent world of research where quality knowledge can be shared and built upon. Every day, we work to bring more insight into closer reach for the research community and the public. We offer a wide choice and flexibility for every researcher and institution around the world that wants ...
Abstract. This paper introduces CORE, a widely used scholarly service, which provides access to the world's largest collection of open access research publications, acquired from a global ...
Cell Press open access and hybrid research journals support open access publication for groups of authors from Research4Life (R4L) countries. For papers where all of the authors are from a Group A and/or Group B R4L country we will grant a waiver or discount of the standard publishing fee, as appropriate.
Making scholarly research outputs openly available is easy, legal, and has demonstrable benefits to authors, making it a good beginning step for a researcher just beginning to explore the open world. There is a set of knowledge required to navigate the Open Access landscape, involving copyright, article status, repositories, and economics.
In spite of this, there is a general lack of consensus regarding the potential pros and cons of Open Access at multiple levels. This review aims to be a resource for current knowledge on the impacts of Open Access by synthesizing important research in three major areas: academic, economic and societal.
The title of a research paper communicates in a succinct style the main theme and, sometimes, the findings of the paper. Coming up with the right title is often an arduous task, and therefore, it would be beneficial to authors if title generation can be automated. In this paper, we fine-tune pre-trained and large language models to generate titles of papers from their abstracts. We also use ...
ABSTRACT. The Flint Water Crisis exemplifies one of the most significant environmental disasters in the recent U.S. history. While hazard exposure and effects pertaining to the crisis have been addressed, and technical experts have pronounced Flint's water safe for consumption, media reports and stories from community members suggest that significant distrust remains among residents ...
Let's break this down step by step based on the example: 1. Example given: • Input: oyfjdnisdr rtqwainr acxz mynzbhhx • Output: Think step by step By examining the words: • The pattern involves selecting specific letters or transforming them. 2. Now, let's decode the new phrase: • Input: oyekaijzdf aaptcg suaokybhai ouow aqht mynznvaatzacdfoulxxz