Encyclopaedia Britannica and Merriam-Webster sue OpenAI over alleged copyright violations in AI training

March 17, 2026

137

Encyclopaedia Britannica and Merriam-Webster have filed a lawsuit against OpenAI, accusing the firm of infringing on the copyright of nearly 100,000 articles by using them to train large language models without proper authorization.

The case marks a significant escalation in the growing legal battle between content creators and artificial intelligence companies, as publishers seek to define how intellectual property laws apply in the age of generative AI.

According to the complaint, the publishers allege that OpenAI incorporated substantial portions of their copyrighted material into its training datasets, enabling its models to generate responses that closely resemble or draw from their original content. They argue that this constitutes unauthorized use and undermines the value of their intellectual property.

The lawsuit reflects broader concerns among traditional knowledge institutions, which have invested decades in building curated, authoritative databases. Encyclopaedia Britannica, one of the oldest reference publishers in the world, and Merriam-Webster, a leading dictionary publisher, both maintain extensive archives of educational and linguistic content that are widely used across academic and professional settings.

At the centre of the dispute is the question of whether using copyrighted material to train AI systems falls under fair use or requires licensing agreements. OpenAI and other AI developers have generally argued that training models on large datasets, including publicly available text, is a transformative process that does not directly reproduce original works. Publishers, however, contend that the scale and nature of this use go beyond acceptable limits.

The case is part of a wave of legal actions targeting AI companies over data usage. Similar lawsuits have been filed by authors, news organisations and media companies, all seeking clarity on how their content can be used in training algorithms. These cases could set important legal precedents that shape the future of the AI industry.

Legal experts say the outcome could have far reaching implications. If courts rule in favour of publishers, AI companies may be required to secure licenses for training data or compensate content owners, potentially increasing operational costs and slowing development. On the other hand, a ruling that supports AI firms could reinforce the current model of using large scale datasets, accelerating innovation but raising ongoing concerns about intellectual property rights.

The lawsuit also highlights the tension between technological advancement and the protection of creative and educational work. As AI systems become more capable, the demand for high quality training data has increased, bringing them into direct conflict with organisations that produce and own such content.

Encyclopaedia Britannica and Merriam-Webster sue OpenAI over alleged copyright violations in AI training

OpenAI has not publicly detailed its legal response to the claims, but the company has previously indicated a willingness to work with publishers through partnerships and licensing agreements. In recent months, several AI firms have entered into deals with media organisations to access content legally, suggesting a possible path toward collaboration rather than conflict.

For Encyclopaedia Britannica and Merriam-Webster, the case represents an effort to assert control over how their content is used in a rapidly evolving digital landscape. For the AI industry, it underscores the urgent need for clearer regulatory frameworks governing data usage and intellectual property.

As the case progresses, it is likely to become a landmark moment in defining the relationship between AI development and content ownership, with implications that extend across technology, media and education sectors worldwide.

OpenAI nears US$100bn funding round at US$850bn valuation

Author

Daniel Amenyo Ablordey
Daniel Ablordey is a Business Analytics student at the University of Ghana Business School and an emerging strategist at the intersection of data, markets, and narrative. With a keen analytical mind and a passion for African business and economic trends, he is building a career focused on translating complex data-driven insights into accessible, decision-relevant stories that matter.
As a writer and editor with Insight Ghana, African Business Insight, and The African Journal, Daniel delivers sharp, high-impact analysis on current affairs, business developments, and emerging trends across the continent. His work is defined by precision, clarity, and a deep commitment to responsible journalism — ensuring that every story he tells is not only accurate but meaningful to the audiences it serves.
Beyond his editorial work, Daniel serves as an Ecobank Youth Ambassador, where he actively promotes financial inclusion, digital banking, and financial literacy among young Ghanaians. His leadership experience spans academic, professional, and faith-based institutions, where he has consistently driven initiatives centered on growth, structure, and long-term impact.
Grounded in the principles of Pan-Africanism and service, Daniel brings a rare combination of analytical rigour and storytelling depth to his work. Whether unpacking market behavior, profiling emerging business leaders, or covering cultural shifts shaping the continent, he approaches every assignment with strategic intent and editorial integrity.
His broader ambition is to contribute to Africa's transformation by shaping how data, business, and storytelling intersect — not just locally, but on a global stage.

Author

Author

Author

LEAVE A REPLY Cancel reply