Nashville News Post

collapse
Home / Daily News Analysis / OpenAI's new image watermarks make it easier to spot AI fakes - here's how

OpenAI's new image watermarks make it easier to spot AI fakes - here's how

May 24, 2026  Twila Rosenbaum  39 views
OpenAI's new image watermarks make it easier to spot AI fakes - here's how

OpenAI has taken a major step forward in the fight against AI-generated misinformation by introducing what it calls content provenance signals across its image ecosystem. In short, the company is now tagging every image created by its tools — from DALL-E 3 images in ChatGPT to outputs from Sora and the OpenAI API — with multiple layers of identifying information. This move is designed to make it significantly harder for bad actors to pass off AI-generated visuals as real photographs or human-made art.

Why provenance matters

The problem of fake images is not new, but the scale at which AI can now produce convincing visuals has raised alarms worldwide. Deepfakes have already been used in political propaganda, financial scams, and social media manipulation. Early efforts to embed metadata — text-based information hidden in image files — were a step in the right direction, but they proved easy to defeat. A simple screenshot stripped away all metadata, leaving the image with no trace of its artificial origins. OpenAI's latest approach addresses this weakness by using multiple overlapping techniques, each designed to survive common image-manipulation tactics.

C2PA compliance: a global standard

OpenAI has become a C2PA Conforming Generator Product. C2PA stands for the Coalition for Content Provenance and Authenticity, an industry initiative that creates technical standards for verifying the origin of digital content. By meeting C2PA's requirements, OpenAI ensures that the metadata attached to its images follows a secure, standardized format. That metadata includes information about the model used, the time of creation, and the platform that generated the image. Platforms like social media sites and news organizations can then read this metadata using Content Credentials tools to confirm whether an image was AI-generated. However, metadata alone is still vulnerable to stripping, which is why OpenAI has added a second, more robust layer.

SynthID: invisible watermarks in the pixels

Perhaps the most significant upgrade is OpenAI's adoption of Google DeepMind's SynthID technology. SynthID embeds a digital watermark directly into the pixels of an image at the moment of generation. This watermark is imperceptible to the human eye but can be detected by specialized software. What makes SynthID powerful is its durability: it remains intact even after the image is resized, cropped, compressed, color-adjusted, or screen-shot. The watermark is distributed across the entire image, not just in a small corner or area, making it nearly impossible to remove without destroying the image's quality. This technique is a modern form of steganography — the ancient practice of hiding messages in plain sight.

A brief history of steganography

Steganography dates back to ancient Greece. Around 440 BC, the historian Herodotus recorded how a Greek leader named Histiaeus shaved the head of a trusted slave, tattooed a message onto his scalp, and waited for the hair to grow back before sending the slave on his mission. The message was hidden in plain sight until the recipient shaved the slave's head again. During World War II, similar techniques were used with microdots and invisible inks. In the digital age, steganography involves embedding data into the least significant bits of image pixels, audio files, or video frames. SynthID uses a sophisticated variant of this concept, ensuring that the signal is both robust and invisible.

How SynthID differs from traditional metadata

Traditional metadata, like the kind embedded in JPEG files, is stored separately from the image data. When you take a screenshot or re-save an image in a different format, that metadata is often discarded. SynthID, by contrast, is part of the image's pixel data itself. Think of it like a digital watermark on paper money: you can't photocopy it away without distorting the underlying design. OpenAI's integration of SynthID means that every image produced by ChatGPT, Codex, and the OpenAI API carries this hidden fingerprint. Google already uses SynthID for its Gemini-generated images, including the Nano Banana model, which places a visible diamond logo alongside the invisible watermark. OpenAI's implementation is similar but does not include a visible mark by default — only the invisible signal.

The public verification tool

Concurrent with these changes, OpenAI is launching a public verification tool available at a dedicated URL. This tool allows anyone to upload an image and check whether it was generated by an OpenAI system. It works by analyzing the C2PA metadata and the SynthID watermark. Early tests suggest the tool can detect even heavily modified images, though its accuracy may vary with extreme alterations. OpenAI acknowledges that no single technique is foolproof, but the combination of standards, watermarks, and verification tools creates a robust ecosystem for provenance. The company hopes this will set a new baseline for the entire AI industry, encouraging other model providers to adopt similar measures.

Real-world implications

For journalists, fact-checkers, and ordinary users, the ability to verify the origin of an image is increasingly critical. A fake image of a politician at a false event or a doctored photo of a disaster scene can go viral within minutes, causing real-world harm. While OpenAI's system is limited to its own generated content, it sets a precedent. If every major AI image generator adopted C2PA and SynthID, it would become much harder for malicious actors to create convincing forgeries without leaving a trace. However, challenges remain: open-source models and offline generators may not follow the same standards. Moreover, adversarial techniques to defeat watermarks are constantly evolving. Researchers at several universities are already exploring ways to remove SynthID-like signals, though early attempts have had limited success.

Technical details of the watermark

SynthID works by modifying the pixel values in a way that creates a statistical pattern detectable by a trained classifier. The modification is tiny — often a single bit out of 16 or 24 bits per color channel — making it invisible to the human eye. The pattern is robust against common transformations. For example, if an image is resized to 75% of its original dimensions, the watermark can still be recovered with high confidence. If the image is cropped to show only a subsection, the watermark may be partially degraded but often remains identifiable. The technology has been tested extensively by Google DeepMind on millions of images, with false positive rates kept below 0.1%. This level of reliability is crucial for real-world deployment.

OpenAI has also confirmed that the watermark is applied to all images generated through its platforms, including those created via the API for third-party applications. This means that developers building apps that rely on OpenAI's image generation will automatically produce watermarked outputs. The company has updated its terms of service to reflect these requirements, ensuring that its partners comply with the provenance standards.

Comparison with earlier attempts

Before this announcement, OpenAI's main method of provenance was metadata embedded in the image file's EXIF region. That metadata included a link to Content Credentials, but it could be easily removed by converting the image to a different format or taking a screenshot. Independent tests showed that 90% of AI-generated images shared on social media had their metadata stripped within the first few days of posting, often unintentionally as platforms compressed or re-encoded images. With SynthID, the watermark persists through such transformations. Early internal tests by OpenAI indicate that over 95% of images that underwent typical social media processing still retained detectable watermarks.

The combination of robust watermarks and standardized metadata creates a layered defense. Even if one layer is somehow bypassed, the other can still provide evidence of AI origin. This redundancy is key to building trust in the verification process. OpenAI's move also pressures other companies, such as Midjourney and Stable Diffusion, to implement similar safeguards. The industry has been fragmented, with each provider using different — and often weaker — methods. Standardization around C2PA and SynthID could change that.

As the line between real and AI-generated content continues to blur, tools like these become essential for maintaining information integrity. The ability to quickly verify an image's provenance empowers journalists, researchers, and everyday users to make informed decisions about the content they encounter. While no system is perfect, the combination of open standards, invisible watermarks, and public verification represented by OpenAI's current offering marks a significant improvement over previous efforts. Ongoing collaboration between AI developers, metadata standards bodies, and verification tool makers will be needed to stay ahead of those who seek to exploit the technology for harm.


Source: ZDNET News


Share:

Your experience on this site will be improved by allowing cookies Cookie Policy