The New York Times got its content removed from one of the biggest AI training datasets. Here's how it did it
Published
The New York Times' office and Sam Altman, OpenAI CEO. Lindsey Nicholson/UCG/Universal Images Group via Getty Images; Win McNamee/Getty Images The New York Times discovered a big AI training dataset contained links to its copyrighted content. The media company also found its content in other AI…
#samaltman #openai #webtext #commoncrawl #ccbot #gpt3 #google #infiniset #c4 #times