A new report from Data and Society raises doubts about automated solutions to deceptively altered videos, including machine learning-altered videos called deepfakes. Authors Britt Paris and Joan Donovan argue that deepfakes, while new, are part of a long history of media manipulation — one that requires both a social and a technical fix. Relying on AI could actually make things worse by concentrating more data and power in the hands of private corporations.

As Paris and Donovan see it, deepfakes are unlikely to be fixed by technology alone. “The relationship between media and truth has never been stable,” the report reads. In the 1850s when judges began allowing photographic evidence in court, people mistrusted the new technology and preferred witness testimony and written records. By the 1990s, topplay media companies were complicit in misrepresenting events by selectively editing out images from evening broadcasts. In the Gulf War, reporters constructed a conflict between evenly matched opponents by failing to show the starkly uneven death toll between US and Iraqi forces. “These images were real images,” the report says. “What was manipulative was how they were contextualized, interpreted, and broadcast around the clock on cable television.”
Today, deepfakes have taken manipulation even further by allowing people to manipulate videos and images using machine learning, with results that are almost impossible to detect with the human eye. Now, the report says, “anyone with a public social media profile is fair game to be faked.” Once the fakes exist, they can go viral on social media in a matter of seconds.
To combat this issue, Facebook recently announced that it was releasing a dataset to allow people to test out new models aimed at detecting deepfakes. Startups like TruePic, which use AI and blockchain technology to detect manipulated photos, have started to gain momentum as well. More recently, the US Defense Advanced Research Projects Agency (DARPA) invested in Medifor, which looks at differences in video pixels to determine when something has been doctored. But almost all of these solutions aim to fight manipulation at the point-of-capture (i.e., when a photo or video is taken) or at the detection level (to make it easier to differentiate between doctored and undoctored content).