Reviewing the Scourge of Self-Plagiarism

How to Cite

Green, L. (2005). Reviewing the Scourge of Self-Plagiarism. M/C Journal, 8(5).
Vol. 8 No. 5 (2005): 'review'
Published 2005-10-01

The task of the unpaid reviewer in academic publishing has always been a taxing one. Although the notion is one of blind peer review, the selection of reviewers is far from random. Journals try to balance a prospective reviewer’s expertise with their availability, and with their track record of returning a useful review on time. Ideally, the reviewer should have a specific (reasonably expert) knowledge of the paper’s topic, but should also retain enough in common with the interested, but jargon-averse, academic reader to empathise with non-specialist journal subscribers. Reviewers should be able to judge the quality of the argument, of the writing, and of the contribution of the article to the field. It’s a tough ask, and now there is a further layer of concern: will the reviewer – having satisfied all the foregoing – be able to spot ‘self-plagiarism’?

Self-plagiarism is a relatively new evil – at least, few people in the author’s circle appear aware of it. Googling the term results in some 8,000 hits (whereas plagiarism scores 3,150,000). At first blush, the usual interpretation of plagiarism – the pinching of some one else’s intellectual property without acknowledgement – seems to rule out the possibility of self-plagiarism. Surely, if the ideas and words are your own, a little judicious re-purposing is all grist to the mill? Indeed, most of the anti-plagiarism sites, for example: What is Plagiarism? (Georgetown University), don’t include the term at all. Instead, the site offers examples of five types of plagiarism, most of them familiar to seasoned markers of student work, which are sufficiently rigorous to include “the ‘apt phrase’”, defined as the lifting of a part sentence. Their comment on the example they give for ‘apt phrase’ plagiarism involves four words in an original paragraph: “This passage is almost entirely original, but the phrase ‘dissolved into a conglomeration’ is taken directly from Arendt [the example used for illustration]. Even though this is a short phrase, it must be footnoted. Only phrases that have truly become part of general usage can be used without citation.” Georgetown University, then, sees plagiarism predominantly as a matter of correct attribution of authorship.

Most journals have a requirement that no work offered to them for review should previously have been published, and that concurrent submissions to multiple journals are not permitted. The issue here, it seems, is that a journal’s reputation is built upon the originality and usefulness of its contents. Journal editors like to feel that they are ‘advancing the field’ with each edition and they are building a readership that can count upon learning something new (or, at least, provocative) for each hour invested in consuming their journal. Where papers have appeared in other forms (based, for example, on a presentation recorded in conference proceedings) this may be acceptable to the journal, provided it is acknowledged, and experienced editors will often check that papers developed from conference presentations have not previously been posted on the web.

If two journals in ignorance each accept and publish the same paper at the same time then that reflects very poorly on the academic who failed to deal honestly with the editors. The issue here is one of resources – the printed page, in particular, is expensive – and of the reviewers’ time. Given the unpaid and voluntary labour of reviewers, and the amount of time and energy that goes into deciding which papers to accept or reject, journals think very poorly of someone who ‘withdraws’ a paper after acceptance on the grounds that s/he has got a better offer/earlier publication elsewhere. Most journals would not welcome future papers from that author. If self-plagiarism were a simple matter of passing off published paper A as published paper B (say, by changing the title and offering it elsewhere), then it would be seen to be deceitful and perpetrators would receive little respect from their peers. But these extreme cases are not (generally) the kind of self-plagiarism against which authors are warned.

So what is the authorship problem widely referred to as ‘self-plagiarism’? The SPlaT website (SPlaT) is happy to explain:

Self-plagiarism occurs when an author reuses portions of their previous writings in subsequent research papers. Occasionally, the derived paper is simply a re-titled and reformatted version of the original one, but more frequently it is assembled from bits and pieces of previous work. … It is our belief that self-plagiarism is detrimental to scientific progress and bad for our academic community. Flooding conferences and journals with near-identical papers makes searching for information relevant to a particular topic harder than it has to be. It also rewards those authors who are able to break down their results into overlapping least-publishable-units over those who publish each result only once. Finally, whenever a self-plagiarised paper is allowed to be published, another, more deserving paper, is not.

Among the more chilling examples of self-plagiarism identified by the developers of SPlaT is “cryptomnesia (reusing one’s own previously published text while unaware of its existence)” (SPlaT). The avoidance of cryptomnesia is one reason why authors are encouraged to use the SPlaT tool. Academic and journal reviewers are also regarded as potential users, and the software is designed to work in three modes – ‘reviewer’s workbench’, ‘author’ and ‘web spider’. It is indeed a crypromnesiac’s concern that the ‘apt phrase’ that came so creatively to the author in an earlier paper might appear again, unwittingly, in the guise of an original composition. However, the injunction to use SPlaT as a ‘reviewer’s workbench’ (where “SPlaT compares a paper under review to a record of the author’s previously published articles extracted from their web site and online article repositories” [SPlaT]) begs the question as to how a review may remain blind – in the sense of not identifying the author of the work to be reviewed – if the ‘workbench’ and/or ‘web spider’ modes of SPlaT are pressed into service.

Might it be the case, notwithstanding the foregoing, that the problem of self-plagiarism is as authentic as ‘social anxiety disorder’ (SAD), incidences of which multiplied dramatically once a drug, Paxil, had been shown effective in treating it? In a Washington Post article (Vedantam), the journalist-author comments: “according to a marketing newsletter, media accounts of social anxiety rose from just 50 stories in 1997 and 1998 to more than 1 billion references in 1999 alone” and goes on to say, “The education and advertising campaigns have raised concerns that pharmaceutical companies, traditionally in the business of finding new drugs for existing disorders, are increasingly in the business of seeking new disorders for existing drugs”. Prior to the publicity about SAD, Paxil was an anti-depressant with sales languishing way behind Prozac and Zoloft. The identification (and treatment) of social anxiety disorder did wonders for its marketing. Could it be that self-plagiarism has only come into existence as a major concern for academia now that there is a tool for its detection?

Social anxiety disorder may be an authentic scourge – as may self-plagiarism – and the fact that it has been publicised in concert with its cure (or detection) does not mean that the remedy serves no useful purpose. On the contrary, once a population of professionals is attuned to a new way of viewing symptoms and practices then valuable advances may result. However, such advances are only possible when the community concerned has had a chance to consider the matter and discuss the ramifications. At the present, we run the risk of allowing the designers of anti-self-plagiarism software to be the judges and the jury of this new way to commit academic crime.

One way to avoid charges of self-plagiarism is self-citation. Leaving aside crytomnesia, it is perfectly possible to cite the already-published reference when an author is aware of reusing a previously-published phrase or idea. Unfortunately, this remedy is also generally frowned upon in many academic circles. The practice undermines the principle of blind peer review – since the identity of the author soon becomes clear in such repeated instances – while readers may become irritated, suspecting that self-citation is a clumsy ruse to improve the citation index ratings of the originally-published article.

The issue is of concern to more than journal editors: it also relates to text- and reference-book editors and publishers. One ‘for instance’ was discussed a year ago by the World Association of Medical Editors (WAME) who conducted a hypothetical on “self-plagiarism of textbook chapters” and threw the discussion open to the members’ list. The initial self-plagiarism case-study situation was complicated by the supposition that Author A (of Book A) had self-plagiarised a previously-published chapter which had been jointly authored by Author A and Author B (Book B). Notwithstanding this complication, the WAME Ethics Committee addressed themselves to four questions:

Is [Does] reuse of a person’s writings in another textbook, but authored by the same person, meet the definition of plagiarism? If so, what degree of identical components needs to be present for this definition to be met?

Is it appropriate for authors to write for different textbooks in the same field? If so, can they write on the same topic? If not, what are the potential infringements on the author’s rights to pursue their career/income?

Should the editors of these textbooks agree to exclude authors that write for one another’s textbooks? Or is that unfair restraint of trade? For example, if all four textbooks were to agree to limit or completely avoid any overlap among authors, it could effectively deny entry of another textbook into that market.

For book A, the author had a co-author. Since this shared work was used for book B, what is the author’s responsibility to the original co-author? (WAME)

These are good questions and they are the kinds of questions we should be asking ourselves about self-plagiarism in our own ‘media and culture’ academic circles. In particular, in the case of textbooks, it is precisely because an author has a standing in the field, and has published on equivalent matters, that editors seek them out and ask them to contribute chapters. Whilst all reputable writers would expect to originate a new chapter according to the specific brief given, it is possible (some might even say likely) that there is an overlap in approach and phraseology. In the case of Books A and B, the overlap stretched the bounds of coincidence in that: “One table is essentially identical, although other tables in the two chapters are different. In addition, there are some passages that contain identical phrases. Most of these appear to have been reworded, but many identifiable words and phrases are identical between the two chapters. There are also areas where the text is completely different” (WAME). However, this hypothetical case is clearly not a situation where the same authorial product was disguised with a new title.

Although the whole debate is worth reading, the general consensus of the Ethics Committee was along the lines of (specifically citing one response):

I do not see a problem with the author reusing his own material to write a chapter in another textbook (readers of textbooks as opposed to research articles are not expecting originality). The problem is that he should have done this with the concurrence of the two editors and if he signed over his copyright the permission of publisher of textbook A. He should of course also have consulted with his co-author. I think the editors should inform the publishers and his employer of the facts and let them decide what course of action to take. (WAME)

The references to re-using the material transparently, and the editors of the textbooks informing the author’s employer, are a constant refrain from a number of contributors to the discussion.

Some WAME list discussants offer defences to the charge of self-plagiarism: “the main problem here is not whether the same, or very similar, information can or should be published in more than one place” commented Frank Davidoff, “that sort of thing is done all the time, and can serve important functions. After all, different people read different textbooks, and if it’s important for the information to get out there, why shouldn’t it be made as widely available as possible?” Andrew Herxheimer thought the readers’ perspective had not been given sufficient consideration: “If I were keenly interested in the contents of the chapter in textbook B, I might well wish to know how they had developed, and to look at earlier versions of the material, and to understand why the contents and emphases etc had changed in the way they had.” “The choice of an author for a review monograph or textbook chapter is based always on perusal of the existing reviews and chapters, hoping that the new publication can contain something just as good” argued Rick Nelson, going on to say, “that obligates the author to produce something as similar to his previous publication as possible, and yet different – an impossible task even if such writing were a priority endeavor, which it never is.” (WAME).

Irving Hexham, of the Department of Religious Studies, University of Calgary, appears to have been substantially ahead of the game in discussing self-plagiarism in the 1990s. His consideration of the issue is generally more sympathetic than SPlaT’s, or WAME’s. For example, “Self-plagiarism must be distinguished from the recycling of one’s work that to a greater or lesser extent everyone does legitimately”, and:

Academics are expected to republish revised versions of their Ph.D. thesis. They also often develop different aspects of an argument in several papers that require the repetition of certain key passages. This is not self-plagiarism if the complete work develops new insights. It is self-plagiarism if the argument, examples, evidence, and conclusion remain the same in two works that only differ in their appearance. (Hexham)

It appears that Hexham and SPlaT have very different ideas of what constitutes self-plagiarism. Their different perspectives may be influenced by disciplinary perspectives and wider contexts – journal article or textbook chapter, a cannibalised conference paper or thesis – and by whether or not they have authored software to catch the offending behaviour. At least one Australian academic (not in M/C – Media and Culture) has been asked by their University to justify their publications against a charge of self-plagiarism, however, which is how the topic has become visible and why the need for debate has become urgent.

Incidentally, the opening sentence of the opening paragraph to the Introduction of the paper on “Splat: A System for Self-Plagiarism Detection” is almost identical to the Abstract for a paper published two years later as “Self-Plagiarism in Computer Science”, viz:

“We are all too aware of the ravages of scientific misconduct in the academic community. Students submit assignments inherited from the [sic] their friends who took the course the year before, on-line paper-mills allow students to browse for term papers on popular topics, and occasionally researchers are found out when falsifying data or publishing the work of others as their own.” (Collburg et al.)

“We are all too aware of the ravages of misconduct in the academic community. Students submit assignments inherited from their friends, online papermills provide term papers on popular topics, and occasionally researchers are found falsifying data or publishing the work of others as their own.” (Collburg & Kubourov)

Further, in these two papers there is a difference in authorship line-up, as with the WAME example…

So what of the reviewers in all this? The Journal of Optical Networking, published by the Optical Society of America, comments that “self-plagiarism causes duplicate papers in the scientific literature, violates copyright agreements, and unduly burdens reviewers, editors, and the scientific publishing enterprise.” (JON). In an environment of blind peer review, where the reviewer does not know the author’s identity and is not in a position to check the body of their published work, the acid test becomes whether (in the reviewer’s opinion) the article advances the debate by offering something new. The submission should also repay the time and effort expended in reading and considering the contents. Other than that, issues of in/valid repurposing, repackaging, recycling and redeveloping arguments and findings require debate and determination at a discipline-wide level, rather than at the coalface of reviewers’ practice.

Author Biography

Lelia Green