The alpha test of the new version v6 of Midjourney shows that the popular image AI can generate images that are very close to copyrighted original images and unmistakably reproduce well-known brands or people.
Midjourney users share examples of this on X. The most striking is that of Joaquin Phoenix in the 2019 Joker movie. The prompt “Joaquin Phoenix Joker movie, 2019, screens from movie, movie scene” returns an image that almost exactly matches a movie scene.
The difference is mainly in the lighting and colors. The images are so similar that it seems possible that this is a slightly modified version of the training data (image 1: Midjourney, image 2: original movie scene).
The new model may have been trained repeatedly and intensively on the same data to maximize performance, resulting in outputs that are almost identical to the training data. This is called “overtraining” or “overfitting,” and studies have indicated that it can happen. ChatGPT can also show signs of text overfitting.
There are several examples of this on X. The scene is not always actually from a movie – but it could be, as the following generated images with characters from superhero movies show.
Again, some generated images are very close to the film originals, differing only in minor variations such as camera angle or posture.
The prompts for these images are easy to write, all you need is the name of the movie, maybe the year, and a tag like “movie scene” or “screenshot from a movie”. I was able to generate these images with Gandalf from “Lord of the Rings” using the same scheme. Images two and four look close to the original Gandalf.
Midjourney faces plagiarism accusations
Illustrator and film concept artist Reid Southen, who has worked for Marvel Productions, criticizes Midjourney on X. He reports that images have been deleted from his account because of his comments. According to Southen, his account was banned, and he was kicked out of the Midjourney Discord group. He made a video showing his plagiarism experiments with Midjourney v6.
Southen accuses Midjourney of “illegally using copyrighted IP without a license.” The AI software can create “exact copies of copyrighted IP, as well as infinite derivatives,” according to Southen.
Artists would be competing with their own work in the same market. “Or what about brand image issues and consumer confusion when 50% of all Marvel stuff online ends up AI knockoffs?” writes Southen.
Simple prompts that lead to plagiarism also exist for classic artworks like the Mona Lisa. In this case, the plagiarism would not be legally problematic because the image is in public domain due to its age.
For critics of AI systems who accuse model makers of data theft and misuse, the new v6 model is likely to be a bombshell. Numerous lawsuits are pending, and Midjourney is already involved in at least one.
Tech companies argue for “fair use” because training computer systems with data and letting them learn from it is supposedly a “transformative” use of data. But the examples above may convince courts otherwise. Midjourney has not yet responded to this criticism.