The year 2012 has been an auspicious one for open access (OA) publishing. First, the Working Group on Expanding Access to Published Research Findings reported to the Minister of Universities and Science (the ‘Finch Report’).1 This was quickly followed by the new policy of Research Councils UK (RCUK), adopting the Finch Group recommendations for the publication of publicly funded research work.2 Next, the Higher Education Funding Council for England (HEFCE) announced the intention to require that research submitted to future research evaluation exercises – after the 2014 Research Excellence Framework (REF) – be open access.3
There is now a broad consensus, that includes for-profit publishers, that open access is the way of the future. The debate – and disagreement – is about the policies that are best suited to steering this transition. In this paper, I map out my perspective on this issue, both as the head of an institution with particular interests in the future of scholarly publication and also as a member of the Finch Group, with a shared responsibility for our recommendations.
Green and its limits
Like most universities in Britain, the University of Salford has an open access repository where anyone can go, online, to find publications and reports by our researchers.4 Since we launched our repository in 2009, we have built up a digital collection of 7,375 items, of which 4,105 have a full-text document attached, and of which 2,616 are publicly accessible.
“There is good evidence … that when research results are made freely available in online repositories, rates of citation rise.”
Over the last three years, downloads of papers in our collection have increased tenfold, from 29,135 in 2009 to 297,635 in 2011–2012. In this, we are typical of universities in the UK in general, as well as elsewhere.5 There is good evidence, from both the UK and elsewhere, that when research results are made freely available in online repositories, rates of citation rise.6,7 And increased rates of citation are a good indication that new knowledge is spreading. This is the energy that drives the research and innovation system.
The University of Salford Institutional Repository is ‘green’. This means that, for the most part, it contains either summary data about publications (rather than full text) or final drafts of publications which their authors have posted before submitting the text to a journal for publication.8 Submitting the final draft avoids violating copyright, which an author usually signs away to a publisher at the last stage of the publication cycle. Green repositories also contain unpublished, or self-published, reports and information; the so-called ‘grey’ literature which is increasingly important, given new and easily available forms of digital distribution.
Green repositories are of significance in making research results widely available, and will continue to be so. But their main limitations are that they do not – and cannot – contain everything. In particular, they cannot contain the ‘version of record’ when this is protected by a copyright that has been ceded to a publisher, and the publisher requires payment to view the paper, either via a journal subscription paid by an individual or an institution, or by means of a ‘pay-to-view’ charge via a commercial website. While access to the green version of a research paper is very useful in scanning a field for new work, only the version of record has research results corrected after review, final forms of diagrams, tables and photographs, and the final pagination for the purposes of citation. A researcher finding a new and exciting paper in a green repository will often need to get access to the published version as well, either via an online subscription to the journal licensed after payment by a library, or through a one-off payment via the publisher's web page.
A further limitation of green open access – and of growing significance – is the barriers it poses for text and data mining. Text and data mining have made remarkable advances in recent years, and now digital robots are able to trawl vast amounts of widely distributed information, finding and sorting that which is relevant to a search and stripping numbers from tables, and shapes and patterns from images.9 Increasingly vital research areas depend on digital robotics to collect information vital for drug design, or disease control, economic trends or climate change. To be effective – and accurate – scientific text- and data-mining systems need to trawl versions of record of research publications, and this will not work when thousands of pay-walls get in the way, or when restrictive formats, such as PDF, are used as attempts to restrict re-use of published papers.
These limitations are increasingly making the combination of green repositories and subscription publishing an unsatisfactory compromise. And this compromise may also be keeping the costs of subscriptions (or licences) to academic journals artificially high. Because publishers price whole journals (and bundles of journals, known as the ‘big deal’) as packages, and negotiate with large consortia of libraries, there is little direct relationship between researchers, who choose where to submit their articles, and their publishers. If a quality retailer, perhaps selling sports equipment, priced their products too high, customers would go elsewhere and the reputation of the shop would fall. It would have to innovate, lower its prices or go out of business. But until recently, the ever-rising price of academic journals has been unclear to many academics because subscription costs are paid through libraries, and electronic journals appear to be free at the point of use as long as the researcher has a recognized and authorized electronic identity. While many publishers' prices have been reasonable, others have not. Elsevier's reported profit margins of close to 40% have, rather dramatically, alerted the broader academic community to what university librarians have been saying for years – that the current system of distribution and pricing is unsustainable.10
“… limitations are increasingly making the combination of green repositories and subscription publishing an unsatisfactory compromise.”
Gold and its ambiguities
An alternative – already well established – is ‘gold’ open access, in which publication costs are paid before publication or by other means, allowing the publisher to permit wider distribution without damaging loss of revenue. But the term can be misleading, because it has been used to embrace rather different approaches. In some cases, gold means up-front payment for limited distribution rights; a paper may be distributed but not reused in any way, including text or data mining, without further charges. This form of gold does not help much with current challenges and may just be a way of adding additional publication charges, in parallel with licensing and pay-wall costs. Gold can also mean the up-front payment of all costs as well as a reasonable margin. This should allow unrestricted distribution and re-use rights, signalled by the Creative Commons' CC-BY licence. Rather than calling this approach ‘gold’, it is probably better, if more clumsy, to refer to this as full article processing charges (APCs).11
A central point of contention is whether or not publication with a CC-BY licence (and therefore with no restriction on distribution or re-use) will result in libraries cancelling journal subscriptions. It may seem intuitive that this would be the case; if all the major research papers are freely available, why subscribe? Some publishers have argued vehemently that, unless APCs are high, their industry will die. Some have advocated national licences as an alternative. These would make all research publications available while protecting publishers' revenue streams through mandatory subscriptions. Some publishers want a regulated system, in which open access to publications for which there have not been full APCs is prohibited for considerable periods of time. This is often conceptualized as a prohibition of open access for the ‘half-life’ of the article – the time over which half of all eventual readership will happen. In some cases, this half-life could be five years or more.
“A central point of contention is whether or not publication with a CC-BY licence … will result in libraries cancelling journal subscriptions.”
In reality, there is little independent and systematic evidence that academic journal subscriptions are cancelled if parts of the journal content are made open access, and some suggest that the opposite could be the case. This could be because academic journals often contain valuable editorial content that is additional to research papers, because researchers often need to browse through whole journals to keep up with specific fields, and because libraries need to ensure that they have full and coherent runs of journal titles online, as they did traditionally with paper copies. Innovations by a range of publishers are showing how new business models can move with, and anticipate, the changing world that is enabled by new digital technologies.
Important here will be the maximum embargoes on research results that are permitted by funding bodies such as the Wellcome Trust and the Research Councils on research results based on grants given by them. In particular, public interest in publicly funded research must be that these restrictions are short-lived and perhaps no longer than six months. It is particularly significant that Research Councils UK has now followed the lead of the Wellcome Trust in requiring open access publishing for the research that the Councils fund, and in providing appropriate mechanisms for funding APCs. These moves provide the conditions for the ‘mixed economy’ envisaged in the Finch Report, in which the transition towards full APCs and CC-BY licences continues and in which academic journal prices are not kept artificially high because subscriptions double up as toll-gates, restricting access to essential research results that have often been funded with public money.
The Finch Report and its critics
These are the issues that we have addressed in our report, Accessibility, sustainability, excellence: how to expand access to research publications.12 In essence, our proposals seek to map out the transition, that could take a decade or more, from the current uneasy combination of green open access, various forms of gold and online subscriptions and pay-to-view, and towards a system in which all APCs are met up-front, with CC-BY licences, across all major science and knowledge systems. Given the inherent nature of digital production and distribution, this is as inevitable for academic publishing as online music distribution and digital video on demand is for the entertainment industry. What are needed are careful and sensible policies that steer this transition, to the best interests of researchers and those who depend on the new knowledge that they make and need to share.
“Innovations by a range of publishers are showing how new business models can move with, and anticipate, the changing world …”
Some responses to the Finch Group Report have criticized us because, somehow, we did not propose that publishers' prices should be regulated.13 But it is difficult to see how this could have been done, since there is no evident national mechanism for price control, let alone for the sort of international agreements that would be required, since many leading academic publishers are not based in Britain.
Even if price controls were possible, they could be counter-productive. It is a reasonable expectation that moving to full APCs and CC-BY licences – particularly as this is now required by funding councils for research that is the outcome of public funds – could promote innovation and new business models that will drive publishing charges down. There is some evidence that this is already happening. For example, the co-founder of the Public Library of Science PLOS ONE OA journal, now the biggest journal in the world, has set up Peer-J, where for a lifetime membership of US$99, academics can use a journal infrastructure wherein they both contribute and benefit from peer review activity.14
“Even if price controls were possible, they could be counter-productive.”
The report has also been criticized because of its estimates of the costs of the transition, in an environment in which research funding is extremely tight. But it is important to go to the detail here. Although the headline figure in the Finch Group Report is £50-£60m per annum, this is a bundle of associated costs. The estimate for the move to full APCs is £38m1
An important piece of research by Alma Swan and John Houghton, commissioned by the Open Access Implementation Group and now published, adds to the Finch Group analysis by modelling the implications of the transition from green to gold for four different kinds of universities. The key point is that, once this inevitable transition is completed, all universities will have lower overall costs.15
David Price, University College London (UCL), has argued that we have got it only partially right, and that the best course is to stay with the present combination of green and subscription publishing, augmented by licence extensions to allow, for example, biomedical researchers in the NHS to get the same ‘free at the point of use’ effect as their colleagues who have the good fortune to be affiliated with universities.16 In fact, our report recommends licence extensions such as these (these are part of the composite headline figure of estimated costs during the transition). But it is difficult to see how this compromise can benefit anyone, because versions of record will remain protected by licence fees and pay-walls, important advances such as text and data mining will be inhibited, and some publishers' charges will probably remain artificially high.
Peter Suber, writing in the BMJ, has criticized the Finch Report, and the RCUK policy, for failing to take appropriate advantage of the opportunities that green repositories offer. While he agrees that gold (or, more accurately, full, up-front APCs and CCY-BY licences) is the objective, he stresses that funding agencies and universities should in the interim push for deposits in institutional and subject-area repositories, accepting that these may not be copies of record and that embargoes may be unavoidable. Suber argues that this flexible approach is a “fast and inexpensive first step to assuring free online access to research. That is a major advantage over the high access prices now shackling research, and that is the point. If we want to shorten embargoes and increase reuse rights, and we do, then we can take further steps, either by strengthening our green policies or paying for gold. What matters first is to use the tools we have to drive open access for the benefit of researchers and taxpayers … To fund the transition to gold without first harnessing the power of green incurs premature expense, leaves the transition incomplete, and puts the interests of publishers ahead of the interests of research. That may be justified for publishers, but not for the non-profit funding agencies and universities devoted to advancing research”.17
But this is exactly the approach that the Finch Report argued for: “at least for the short to medium term, the Government, the Research Councils and the Higher Education Funding Councils should seek increases in access through all three mechanisms – open access journals, extensions to licensing, and repositories … We see repositories fulfilling a subsidiary, but important role, for the short to medium term alongside open access journals and extensions to licensing. But it is important that they do so in a sustainable way, in the interests of the research communications system as a whole. That will require further investment in developing the UK-wide infrastructure of repositories”.18
“… the larger picture is clear: we are headed for an open access world …”
These debates will continue for a while, and appropriately so, since this is a complex area with many uncertainties. But the larger picture is clear: we are headed for an open access world that will replace traditional subscription publishing with systems of distributing new knowledge that are far more appropriate to the immense opportunities of new digital technologies.