Artificial Intelligence

OpenAI Faces Allegations of Accidental Data Deletion in NY Times Copyright Case!

Published

2 years ago

November 25, 2024

OpenAI is currently embroiled in a copyright lawsuit with The New York Times and Daily News, facing scrutiny for allegedly erasing potentially critical evidence in the case. The lawsuit accuses OpenAI of using copyrighted content to train its AI models without proper authorization, raising significant concerns about intellectual property rights in the age of artificial intelligence.

The Incident

Earlier this year, OpenAI agreed to grant The Times and Daily News access to virtual machines (VMs) to search for their copyrighted content within its AI training datasets. These VMs are software-based environments commonly used for tasks like testing and data analysis.

Since November 1, legal teams and hired experts for the plaintiffs reportedly invested over 150 hours sifting through OpenAI’s training data. However, on November 14, OpenAI engineers inadvertently deleted the search data stored on one of the VMs, according to a letter filed in the U.S. District Court for the Southern District of New York.

Data Recovery Attempts

While OpenAI attempted to recover the lost data, they only partially succeeded. The restored files lacked their original folder structures and filenames, rendering them ineffective for determining where the plaintiffs’ copyrighted articles may have been used in training the AI models.

The plaintiffs’ attorneys criticized OpenAI for this mishap, highlighting that significant time and resources were wasted as their team was forced to start over. “The plaintiffs learned only yesterday that the recovered data is unusable,” the letter stated, adding that OpenAI is in a better position to search its own datasets using internal tools.

OpenAI’s Defense

OpenAI has denied the allegations, attributing the issue to a misconfiguration requested by the plaintiffs’ own team. In a response filed on November 22, OpenAI’s counsel stated:

“Plaintiffs requested a configuration change to one of several machines… implementing plaintiffs’ requested change resulted in removing the folder structure and some file names on one hard drive, which was intended as a temporary cache.”

OpenAI maintains that no files were permanently lost and emphasized that the deletion was not deliberate.

The Broader Legal Context

At the heart of the lawsuit is OpenAI’s use of publicly available data, including copyrighted content, to train its models. OpenAI contends that such practices fall under the doctrine of fair use, allowing the creation of AI systems like GPT-4, which rely on vast amounts of data, including books and articles.

Licensing Agreements

Despite its stance, OpenAI has been securing licensing agreements with numerous publishers, such as Associated Press, Axel Springer, and Dotdash Meredith. These deals remain confidential, though reports suggest that some partners, like Dotdash, receive payments exceeding $16 million annually.

What’s Next?

The legal battle raises broader questions about how AI companies should handle copyrighted materials and whether using such data for AI training constitutes fair use. OpenAI’s ability to demonstrate transparency and compliance will likely play a pivotal role in the case’s outcome.

Implications for AI Development

For now, the accidental deletion serves as a reminder of the technical and ethical complexities surrounding AI development and its intersection with intellectual property rights. As companies like OpenAI navigate these challenges, they must balance innovation with respect for creators’ rights.

Conclusion

The ongoing copyright lawsuit between OpenAI and major news organizations underscores critical issues in the rapidly evolving landscape of artificial intelligence. As this case unfolds, it will set important precedents regarding data usage and copyright law in AI development. The outcome could influence not only how AI companies operate but also how they engage with content creators moving forward.

Up Next

Amagi Acquires Argoid AI to Power AI-Driven Innovation in M&E!

Don't Miss

Microsoft Unveils Two New Chips to Boost AI Performance and Enhance Security in Data Centers!

Kontham Abhishek

Passionate about the entrepreneurial journey, I Konatham Abhishek shares the latest news on startups, founders, and funding, offering valuable insights and updates on the startup ecosystem.

9 Comments

Artificial Intelligence

UAE G42 Launches 8-Exaflop AI Supercomputer in India for Sovereign AI 2026

Published

3 months ago

February 24, 2026

Kontham Abhishek

UAE-based G42 has announced plans to deploy an 8 exaflop AI supercomputer in India, announced at the AI Impact Summit 2026 in Delhi. This national-scale project partners with Cerebras, MBZUAI, and India’s C-DAC, operating under full Indian data sovereignty as part of the India AI Mission.

The supercomputer boosts sovereign AI capabilities, enabling startups, researchers, academics, SMEs, and government access for tailored applications like public services and language tech. G42 India CEO Manu Jain highlighted its role in making India AI-native while prioritizing security.

This follows India-UAE tech pacts in late 2025, positioning India among global leaders in exaflop AI infrastructure amid rising demand for localized compute. Cerebras CSO Andy Hock noted it will accelerate large model training for India-specific needs.

Artificial Intelligence

Adopt AI Secures $6 Million to Power No-Code AI Agents for Business Automation

Published

1 year ago

May 14, 2025

Kontham Abhishek

Adopt AI, a San Jose and Bengaluru-based agentic AI startup, has raised $6 million in seed funding led by Elevation Capital, with participation from Foster Ventures, Powerhouse Ventures, Darkmode Ventures, and angel investors. The funding will be used to expand the company’s engineering and product teams and to scale enterprise deployments of its automation platform.

Founded by Deepak Anchala, Rahul Bhattacharya, and Anirudh Badam, Adopt AI offers a platform that lets businesses automate workflows and execute complex actions using natural language commands, without needing to rebuild existing systems. Its core products include a no-code Agent Builder, which allows companies to quickly create and deploy AI-driven conversational interfaces, and Agentic Experience, which replaces traditional user interfaces with text-based commands.

The startup’s technology is aimed at SaaS and B2C companies in sectors like banking and healthcare, helping them rapidly integrate intelligent agent capabilities into their applications. Adopt AI’s team includes engineers from Microsoft and Google, with Chief AI Officer Anirudh Badam bringing over a decade of AI experience from Microsoft.

The company has also launched an Early Access Program to let businesses pilot its automation solution and collaborate on new use cases.

Artificial Intelligence

Social Media Platforms Push for AI Labeling to Counter Deepfake Risks

Published

1 year ago

March 28, 2025

Kontham Abhishek

Social Media Platforms Push for AI Labeling to Counter Deepfake Risks,Startup Stories,Startup News,Startup Stories 2025,Startup Stories India,Tech News,Social Media Platforms Seek AI Labelling,Deepfakes,Social Media Platforms Push for AI Labeling,Social Media Platforms,Social Media,Social Media Deepfake Risks,Deepfake Risks,Deepfake Technology on Social Media,Deepfake on Social Media,AI,Deepfake Threat,Industry Stakeholders,Delhi,AI Content,Deepfake Technology,Stakeholders,Artificial intelligence,Online Platforms,AI Labeling,Deepfake,Digital Services,Digital News,Facebook,Instagram,Advanced Artificial Intelligence,Privacy,Made with AI,Elections,Politics,Personal Privacy

Social media platforms are intensifying efforts to combat the misuse of deepfake technology by advocating for mandatory AI labeling and clearer definitions of synthetic content. Deepfakes, created using advanced artificial intelligence, pose significant threats by enabling the spread of misinformation, particularly in areas like elections, politics, and personal privacy.

Meta’s New Approach

Meta has announced expanded policies to label AI-generated content across Facebook and Instagram. Starting May 2025, “Made with AI” labels will be applied to synthetic media, with additional warnings for high-risk content that could deceive the public. Meta also requires political advertisers to disclose the use of AI in ads related to elections or social issues, aiming to address concerns ahead of key elections in India, the U.S., and Europe.

Industry-Wide Efforts

Other platforms like TikTok and Google have introduced similar rules, requiring deepfake content to be labeled clearly. TikTok has banned deepfakes involving private figures and minors, while the EU has urged platforms to label AI-generated media under its Digital Services Act guidelines.

Challenges Ahead

Despite these measures, detecting all AI-generated content remains difficult due to technological limitations. Experts warn that labeling alone may not fully prevent misinformation campaigns, especially as generative AI tools become more accessible.

Election Implications

With major elections scheduled in 2025, experts fear deepfakes could exacerbate misinformation campaigns, influencing voter perceptions. Social media platforms are under pressure to refine their policies and technologies to ensure transparency while safeguarding free speech.

Startup Stories

OpenAI Faces Allegations of Accidental Data Deletion in NY Times Copyright Case!

Artificial Intelligence

OpenAI Faces Allegations of Accidental Data Deletion in NY Times Copyright Case!

The Incident

Data Recovery Attempts

OpenAI’s Defense

The Broader Legal Context

Licensing Agreements

What’s Next?

Implications for AI Development

Conclusion

Leave a Reply
Cancel reply

Leave a Reply

Artificial Intelligence

UAE G42 Launches 8-Exaflop AI Supercomputer in India for Sovereign AI 2026

Artificial Intelligence

Adopt AI Secures $6 Million to Power No-Code AI Agents for Business Automation

Artificial Intelligence

Social Media Platforms Push for AI Labeling to Counter Deepfake Risks

Meta’s New Approach

Industry-Wide Efforts

Challenges Ahead

Election Implications

Recent Posts

OpenAI’s Trusted Contact Feature Signals a New Direction in AI Safety

Apple MacBook Air M5 Launched: M5 Chip, 22-Hour Battery in India

₹290 Crore Boost: Rozana’s Series B Funding Scales Rural Retail Network Nationwide

OpenAI’s Trusted Contact Feature Signals a New Direction in AI Safety

T.N. Seshan: The Fearless Reformer Who Redefined Indian Democracy

Larry Page: The Visionary Co-Founder Behind Google’s Global Success

Playboy: The Success Story Of Hefner’s Empire

Startup Stories

OpenAI Faces Allegations of Accidental Data Deletion in NY Times Copyright Case!

The Incident

Data Recovery Attempts

OpenAI’s Defense

The Broader Legal Context

Licensing Agreements

What’s Next?

Implications for AI Development

Conclusion

You may like

Leave a Reply Cancel reply

Leave a Reply

Artificial Intelligence

UAE G42 Launches 8-Exaflop AI Supercomputer in India for Sovereign AI 2026

Artificial Intelligence

Adopt AI Secures $6 Million to Power No-Code AI Agents for Business Automation

Artificial Intelligence

Social Media Platforms Push for AI Labeling to Counter Deepfake Risks

Meta’s New Approach

Industry-Wide Efforts

Challenges Ahead

Election Implications

Recent Posts

OpenAI’s Trusted Contact Feature Signals a New Direction in AI Safety

Apple MacBook Air M5 Launched: M5 Chip, 22-Hour Battery in India

₹290 Crore Boost: Rozana’s Series B Funding Scales Rural Retail Network Nationwide

OpenAI’s Trusted Contact Feature Signals a New Direction in AI Safety

T.N. Seshan: The Fearless Reformer Who Redefined Indian Democracy

Larry Page: The Visionary Co-Founder Behind Google’s Global Success

Playboy: The Success Story Of Hefner’s Empire

Leave a Reply
Cancel reply