Fun fact: he's using this to prove he didn't do anything wrong, as in "see? the people on my thesis committee didn't care I copied their own work, why should anyone else?"
The truth is, people on "thesis committee" don't read thesis. Some do. The director usually does, if he has the time. But many don't; they glance at the intro and conclusion and call it a day.
> He wrote the thesis at a time when it was impossible to identify lightly rephrased statements across a wide body of works. Now we can dump all of these documents into an LLM and have similar sentences surfaced for human review very quickly
He also uses this to say it's unfair to punish him now with tools that didn't exist when he did the crime, which I find quite rich. If you murdered someone before DNA testing was available, that doesn't exonerate you in any way.
I don't know about this case, but a lot of these kinds of cases truly are witch-hunts. It's not at all like the reproducibility crisis and faked data and images.
This is just how scholarship works. It's not needed in the kind of day to day most of us do, but when you're writing a thesis for a PhD, this stuff matters. You're making the argument that you're expanding the totality of human knowledge with your dissertation, and that requires strict source citing to separate your original scholarship from the sources that influenced it.
With another professor I caught a flagrant case in a student thesis and we faced attacks from the university administration because the student had a stellar transcript (also not the positive signal some might think). Punishment was almost inexistent.
It's difficult for me to imagine what it would take to get a doctoral thesis revoked.
Personal grudges. Academia is full of them.
If some in your experience erred on the side of leniency, then it stands to reason that others might err just as egregiously in the opposite direction.
In fact, your anecdote suggests erring is the norm. We should thus expect punishments to be inappropriate in one direction or another. An appropriate punishment seems rather unlikely.
They compiled a document with the source material side-by-side https://v42.arretsurimages.net/fichiers/documents/2024-08-02...
This goes well beyond accidentally triggering a plagiarism detector.
> Feed stuff you wrote into those tools and it will call you a plagiarist every day because you wrote something similar to the person you learned it from.
The examples in the article use very distinctive wording. One or two occurrences would be forgivable as coincidence or inspiration. An entire document full of examples points to something else.
I imagine any tool that’s good at plagiarism detection would also kill it at this kind of literature research.
An example of something where it worked like this is that I had some ideas around how tribes evolve and so on and wrote them as I could think of them and ChatGPT was able to find that Darwin’s Cathedral had a far better synthesis of various much more rigorous takes on the subject.
What exactly is the point of dedicating years of your life to create something exactly nobody is going to read?
Also, very dry, so it's easy to loose focus, and you can read a rephrasing of your own thesis as a "he has the same ideas" (also, if you do that, please reference the author?)
I find a few of the example damning (hje should totally have added a citation and build his argument around it). Most less so, and i understand that a reader could not catch them.
introduction1 -> main1 -> conclussion1
introduction2 -> main2 -> conclussion2
introduction3 -> main3 -> conclussion3
the thesis is something like
long introduction -> easy example -> main1 -> main2 -> main3 -> main of preprint -> long conclussion
It's a long time that the incentive and job structure make universities a very toxic environment. Professors are basically running a 40 years race (about from bachelor or master graduation to retirement). It is still amazing that some good comes out of it.
For me, it wasn't quite so apparent at the defense, and I don't know that all members read the final thesis carefully, but most of them had already seen me publish or present most of the research previously, often multiple times. I also know that some (and not just my advisor) did read the final thesis very closely. My thesis was only partially thesis by publication, however, which may have influenced this; it does now have a fair number of citations in its own right, which is somewhat unusual for the theses in the field, and potentially seen as awkward (it means there's significant work in the thesis that I never published elsewhere).
As a caveat, the American system (before current crises) does feel like it can have a two-tier system of PhD students who are expected to remain in academia (we both were) and ones who are not, even at strong universities. Expectations, and attention given, can vary considerably. The American system also tends to have larger and more closely involved committees than, for example, the UK/Irish system.
However, for the form of plagiarism discussed here: if someone had sentences from papers I published years ago interspersed in their work, and they weren't particularly notable sentences, I'm not confident I would notice. Depending on citations and what the sentences were, I'm not even sure I'd mind much, for example, if they were essentially copying a model definition.
But the process of creating that work, engaged throughought that process with those purported to be more practiced, is usually pretty good at seeding enough expertise and confidence that you might be able to proceed more independently and with real novelty, or might at least be prepared to share the trade with others new to it.
That's the point of those years, and so it's more than a little ironic that AI is being used to undermine a practicing expert while simultaneously eroding the traditional process for becoming one by making it so easy to just generate slop and engage with hallucinations than to actually practice writing deep work or engaging with primary sources.
It is not "early work" but the end of early work. The masterpiece: the piece of work that proves a subject has mastered their craft.
If you're still producing junk you haven't earned your PhD.
You probably have plenty of novel ideas in early career, but you almost certainly lack the experience and the basic understanding of your field to develop them properly. Most people have exhausted their own ideas by mid-career. But that that point, they should have the skills and the experience to work on the ideas they come across.
(Looking back at my PhD, it's quite amusing how little did I understand. On the other hand, many of the choices I intuitively made turned out to have some value. But in some cases, understanding that properly took a decade of work by other people.)
Your PhD work is an apprenticeship, after which you are expected to work as a journeyman. The masterpiece that qualifies you for independent work as a tenured professor is often called habilitation. Many academic cultures don't have those, because the expectations are so situational that they don't want to formalize them.
In some ways, people doing research now have it way more difficult than people of the past. They have hundreds of years worth of research to study before they are on top of things and making an original contribution that stands out among the huge amount of research that already exists is really hard. If we want to keep PhD as a proof of meaningful work, then we ought to lengthen the graduate studies considerably. How about a 10 year PhD program, at the end of which you can really say you have mastered the field?
Then academia is broken and the universities that operate like this should be dismantled (not to mention the accreditation organizations)
What's actually happening is people chasing items on a CV instead of actual knowledge is rotting the core of universities.
I find it rich how fast you are to jump to destroying the entirety of academia in one stroke. It's quite easy to say things we don't understand should not exist, of course I'm guilty of this myself from time to time. Have you done education beyond the bachelor's degree? It's a very different world.
If we apply your criteria, I'm not sure if any universities would be left.
There's a lot to be said about publishing in academia being broken and how nearly all the value comes from 10% of publications, while the rest are garbage spewed out for reasons orthogonal to the advancement knowledge. However, IMHO, none of that really applies to PhD theses.
It's really the "cheat yourself" problem, except we put some value on that paper and those letters.
What crime?
The article doesn’t really expand upon what having fragments copied from others means. Even if it fits the letter of the definition, on a phd thesis that may or may not be a big deal. If he’s passing off the ideas of others as his, or faking his research by using the results of others or making them up, then that’s really bad. If he’s just using phrases / wording from others to get his original points across, it looks bad but I don’t see it as a huge deal, especially 30 years out from the phd.
A PhD is supposed to be original research, if the originality or integrity is in question that’s one thing, the rest is much more pedantic, even if technically wrong.
They link to the document that shows the plagiarized sections side by side with their sources
https://v42.arretsurimages.net/fichiers/documents/2024-08-02...
I don't read enough French (especially at PhD thesis level!) to parse everything, but even I can see phrasings copied from the source documents in a lot of the examples. Some of them weren't even paraphrasing, they were lifting the exact distinctive word choices.
(Another one, unrelated, but also wild, argues that people who attack him are in fact against science itself, that they want to go back to the Middle Ages, etc.)
It's very obvious he pieced together interesting ideas from others to pass them as his own. And it worked very well, he has radio shows and TV shows and whatnot. And he still has a lot of supporters!
That is his defence:
"Having read many books throughout his career, he may have “assimilated” them and “not always consciously” used them in his own writing, he said."
I have never published anything in academic context, but I did write a lot and I often found that phrases of the same topic I read before creep into my own writing, so I could see this happening to me without intention.
"the plagiarism report shows that he was rephrasing all of the sentences rather than copying verbatim"
On the other hand, this is in fact the standard defence when someone is accused of plagiarism. But sometimes it might actually be correct? (No idea here, my french ain't good enough either)
In general the reaction seems out of proportion to me, I much rather would like to have focus on actual fraud in science.
I sometimes read this blog for example and there seem to be other things going on with less attention.
Also, I believe citation is usually limited to prior written work. I don't think citing personal communications is mandatory, but at least for me lots of ideas come up in personal communications, random discussions, etc.. I think actually we should give more credit in this case, but it shows that attributing fraud for failing to cite may be a little too harsh. Again, I don't know if that's the case here, or if his thesis is just some pastiche or prior work without any significant or original contribution.
Just to put it side-by-side in a form that will absolutely not work on mobile but might work if you're on desktop, here's a "two-column form" of an example plagiarism from the thesis:
[ THESIS ] [SOURCE]
les équations de la physique engendrent, les équations de la physique engendrent,
en plus des solutions correspondant aux en plus des solutions correspondant aux
phénomènes, des solutions sans aucune phénomènes, des solutions sans aucune
signification physique directe. signification physique directe (certaines
solutions sont rejetées, par exemple,
parce qu'elles violent la causalité
usuelle). La correspondance entre les
mathématiques et la physique n'est donc
pas biunivoque et naturelle. Ceci laisse
donc planer un doute sur une sorte
Un doute
très sérieux plane donc sur l'idee d'un
isomorphisme entre le language mathé- d'isomorphisme entre le langage (mathé-
matique et la nature. Quant matique) et la nature. [...]
à l'explication platonicienne de la L'explication platonicienne de la
réussite des mathématiques réussite des mathématiques peut être
divisée en deux thèses. La première
pourrait être qualifiée de « platonisme
faible » (celui qui peut être trouvé dans
des dialogues tels que la République).
elle consiste à admettre que
les mathématiques Selon cette conception, les mathématiques
constituent un langage intermédiaire qui constituent un langage intermédiaire qui
permet de passer du monde sensible au monde permet de passer du sensible aux monde
des Idées, qui forme la réalité profonde des Idées qui constitue la réalité profonde
des choses. Si les mathématiques sont des choses. Les mathématiques sont
efficaces, dira un platonicien, c'est efficaces ici
parce qu'elles permettent de viser les parce qu'elles offrent unmoyen de viser les
structures profondes du monde. véritables structures du monde.Just some Google Translate of some of the plagiarized paragraphs, the thesis "The Unity of Physics" has:
> The idea that the diversity of reality is underpinned by a deeper unity is as old as thought itself. Great mythologies recount it, early philosophers affirm it, and modern science has taken up the same agenda by first unifying concepts of motion, matter, and space. Indeed, the desire for intelligibility can arguably not do without the idea of the One. However, simply attributing such a tendency to human nature does not validate its realizations. The proclaimed unity may well prove false—stemming merely from incantation, decree, or fantasy—while exerting a purely dogmatic fascination. Yet, if thought were to discover—amidst the shifting mirrors of phenomena—eternal relationships capable of encapsulating them, one could certainly speak of a joy of the mind. While not necessarily an essential framework of thought, the desire for unity corresponds to a nostalgia, a craving for the absolute, an ontological impatience. Yet, the moment it is expressed, it clashes with the irreducible dispersion of things. From this arises a rift between the desiring mind and the disappointing world. At the close of the century, the increasingly assertive power of physical theories—with their all-encompassing nature and unifying aim—prompts us to examine the foundations of the physicists' quest for unity, to define its limits, and to consider its current prospects.
This is claimed to be a mashup of paragraphs from three different sources, first, the sentence starting "However..." is said to hail from Jean-Michel Besnier's "Theories of Knowledge",
> Could the "monist" tendency inherent in the act of knowing be suggested any more clearly? Yet, simply positing such a tendency within human nature is obviously not enough to validate its realizations. Indeed, that unity may well prove illusory, stemming from sheer fantasy while exerting a purely dogmatic fascination. That is precisely why critical philosophy sets out to distinguish between the scientific and...
Followed by a bit of Camus' "Myth of Sisyphus,"
> If man were to recognize that the universe, too, can love and suffer, he would be reconciled. If thought were to discover, within the shifting mirrors of phenomena, eternal relationships capable of summarizing them—and of summarizing themselves in a single principle—one could speak of a happiness of the spirit of which the myth of the blessed would be but a ludicrous counterfeit. This longing for unity, this craving for the absolute, illustrates the essential movement of the human drama. Yet the fact that this longing exists does not imply that it must be immediately appeased.
The last sentence of Sisyphus was changed except for the "Yet" to what appeared to be an original sentence or two in the thesis, "Yet ... irreducible dispersion of things. From this arises a rift between the desiring mind and the disappointing world" -- but only to immediately jump into a third line from Parrochia's "Grand Revolutions of the 20th Century,"
> The increasingly assertive power of modern physical theories—along with their all-encompassing nature and unifying aim—now enables the scientist to occupy, to some extent, the role held by the philosopher from antiquity through the classical age. This is by no means the least significant consequence of the revolution we have experienced...
My very very initial read of this style, I would almost guess that he paid someone else -- someone who did not have a science education -- to write his thesis for him. And probably if that were true, then he had to provide the sources, "I like this sentence from here, that one from there, you see I highlighted this paragraph of this paper -- I'll highlight and you just paste everything together into one big whole and I'll look through the word processor and tweak a couple of sentence beginnings and endings to make everything look nice for the committee and probably only one person on the committee really reads a bit of it but let's be honest that they're all busy with their own research." With that sort of origin, that's how you get the "blind copying without rephrasing" type of thing (The person who's copying doesn't trust their technical chops to rephrase anything! "What if I choose the wrong word and it has another meaning in science and I embarrass myself?" -- so they go verbatim, "this made sense to someone who was well educated in the sciences, it can't be too embarrassing") with a little bit of tweaks between the chunks.
The really incredible thing about the plagiarism report is the 16 copié-collé/copy-paste sections AFTER this one, where it's just like "Yep, he stole whole pages at a time from his sources in just this way."