OpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Us

Chef’s kiss of a headline from 404media, “OpenAI Furious DeepSeek Might Have Stolen All the Data OpenAI Stole From Us”:

Here is how the Bloomberg article begins: “Microsoft Corp. and OpenAI are investigating whether data output from OpenAI’s technology was obtained in an unauthorized manner by a group linked to Chinese artificial intelligence startup DeepSeek, according to people familiar with the matter.” The story goes on to say that “Such activity could violate OpenAI’s terms of service or could indicate the group acted to remove OpenAI’s restrictions on how much data they could obtain, the people said.”

From a January 2024 OpenAI Blog Post:

Training AI models using publicly available internet materials is fair use, as supported by long-standing and widely accepted precedents.

OpenAI makes their Generative AI tools like Chat-GPT available, publicly, on their own website. Following their logic – DeepSeek is applying “fair use” to it. Via long-standing and widely accepted precedents. Can’t argue with OpenAI’s own logic, all seems above-board and perfectly fine.