Aries - Studio Ghibli and Japanese Publishers Tell OpenAI to Stop Training on Their Work: What It Means for AI and Copyright

Studio Ghibli and Japanese Publishers Tell OpenAI to Stop Training on Their Work: What It Means for AI and Copyright

Studio Ghibli and major Japanese publishers demand OpenAI stop using films and manga to train AI without permission. The move highlights conflicts over generative AI copyright, dataset licensing, AI training data fair use, and potential legal and commercial impacts for businesses and creators.

Studio Ghibli and several major Japanese publishers have publicly demanded that OpenAI stop using their films, manga and other creative works to train generative AI models without permission. This public request has refocused attention on the legal and ethical questions around AI training data and the boundaries of AI training data fair use.

Background: Why training data is the flashpoint

Machine learning models are trained on large collections of digital content so they can predict text, generate images or summarize media. Training means showing a model many examples so it learns patterns. Rights holders say copying and using those works for model training should require permission or payment. Many AI developers have relied on large scale web scraping and defenses that emphasize transformation or fair use. The current OpenAI copyright dispute adds pressure as regulators and courts consider whether that approach is lawful.

Key details and developments

Studio Ghibli is leading public requests from multiple major Japanese publishers, a high profile move given the studio's cultural reach.
OpenAI remains closely linked with Microsoft, whose strategic investment means changes at OpenAI can ripple into widely used products. Microsoft support has been estimated at roughly 10 billion in 2023.
AI chat and content tools have wide reach. For example ChatGPT surpassed around 100 million monthly active users early in 2023, showing how trained models are embedded in business workflows.
Japan's manga and anime industries are economically significant, which is why publishers emphasize control over downstream commercial uses and calls for dataset licensing or collective licensing solutions.
The recent viral Ghibli style images trend and ChatGPT 4o features intensified concern about AI outputs substantially similar to source works and about unauthorized use of living artists styles.

Implications and analysis: what this means for industry and businesses

The dispute could produce concrete outcomes for AI providers and their customers as courts and policy bodies clarify rules on generative AI copyright and AI model training copyright.

Licensing and cost AI companies may need to negotiate licensing deals with publishers and studios. That could raise the cost of training data and affect pricing for AI services used by small companies and creators.
Data removal and model updates Publishers could demand takedown of specific works from training corpora, forcing retraining or model adjustments to reduce legal exposure.
Legal precedent and regulation Guidance from bodies such as the US Copyright Office AI report 2025 and emerging case law could define when building datasets from copyrighted works is permitted.
Shift in data strategy Expect more investment in synthetic data, licensed datasets or private datasets and in stronger provenance and attribution systems to manage AI copyright risk.

Practical steps for organizations

Audit AI usage to identify where third party trained models are used in products or workflows and whether outputs risk being substantially similar to copyrighted works.
Prepare alternatives by evaluating open source models, licensed datasets, or bespoke models trained on permissively licensed content.
Update procurement and intellectual property policies to include clauses on training data provenance, indemnity for AI vendors, and requirements for dataset licensing or proof of rights.

Expert context

The public demand from cultural publishers fits a broader trend where creators and rights holders assert control over the datasets that power generative AI. Recent guidance on the fair use doctrine for generative AI and calls for mechanisms to obtain rights suggest collective licensing schemes may gain traction. As the legal landscape evolves, businesses should monitor developments including lawsuits against AI firms and regulatory reports.

Conclusion

The Studio Ghibli led call for OpenAI to stop training on films and manga without permission is more than a single dispute. It signals a moment where nonconsensual large scale web scraping and use of copyrighted materials for model training is under renewed scrutiny. For businesses using AI tools, expect greater legal clarity and possibly higher costs or more conservative product choices. Companies should begin auditing AI dependencies and prepare procurement and compliance strategies that address dataset licensing and AI copyright risk.

selected projects

Get to know our take on the latest news

View Post

When ChatGPT Told Users They Were “Special”: Lawsuits Allege AI Isolation and Harm

View Post

How ChatGPT Forced Google to Reinvent Search and What That Means for AI Ads and the Web

Ready to live more and work less?

Get started