{"id":38306,"date":"2025-09-22T17:13:08","date_gmt":"2025-09-22T17:13:08","guid":{"rendered":"https:\/\/publicknowledge.org\/?p=38306"},"modified":"2025-09-22T17:13:13","modified_gmt":"2025-09-22T17:13:13","slug":"is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem","status":"publish","type":"post","link":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/","title":{"rendered":"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem"},"content":{"rendered":"\n<p><em>The tug of war between online news publishers and AI developers on the role of copyrighted content in AI model training may lead to a more closed internet for everyone. In this two-part blog series, we describe the situation as it\u2019s unfolding and propose policy solutions worth exploring to preserve incentives for publishers to keep creating the timely content that AI developers need and democracy requires. View the second post, &#8220;<a href=\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-2-framing-solutions\/\">Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 2: Framing Solutions<\/a>&#8221; to continue the series.<\/em><\/p>\n\n\n\n<p>In July, the Senate Judiciary Subcommittee on Crime and Counterterrorism hosted a <a href=\"https:\/\/www.judiciary.senate.gov\/committee-activity\/hearings\/too-big-to-prosecute-examining-the-ai-industrys-mass-ingestion-of-copyrighted-works-for-ai-training\">hearing<\/a> with the provocative title, \u201cToo Big to Prosecute? Examining the AI Industry\u2019s Mass Ingestion of Copyrighted Works for AI Training.\u201d Happily, some of the conversation with the four witnesses was actually about the crime of online piracy, meaning how artificial intelligence, or AI, firms have downloaded content from shadow libraries (unofficial pirate repositories of digital books and articles) to train their large language models (LLMs). However, the majority of the hearing served only to <a href=\"https:\/\/publicknowledge.org\/piracy-vs-fair-use-how-ai-training-intersects-with-copyright-law\/\">conflate<\/a> \u201cpiracy of copyrighted works\u201d with \u201ctraining generative AI models on copyrighted works.\u201d The former may be illegal, but we believe the latter is generally a protected fair use under the legal doctrine that allows limited use of copyrighted works without permission.&nbsp;<\/p>\n\n\n\n<p>The hearing was a reflection of the tremendous public attention on the intersection of generative artificial intelligence, copyright, and online publishing. Although the hearing largely focused on book publishers, many of the issues discussed also pertain to another type of publisher Public Knowledge has written about quite a bit: online news publishers. They, too, often use the language of \u201cstealing,\u201d \u201ctaking,\u201d or \u201choovering\u201d to describe what happens to their copyrighted content by digital platforms and believe it to be a type of theft. We find the argument that digital platforms \u201csteal\u201d news content misconstrues copyright law and conflates two <em>very<\/em> different ideas: piracy, and AI model training.&nbsp;<\/p>\n\n\n\n<p>In our view, web crawling for the purpose of AI training implicates the freedom to learn, freely use information, and freely express creativity. We also acknowledge the genuine economic risks generative AI poses to creators, including news publishers. And we know that some aspects of AI model design may infringe on content owners\u2019 rights to control the reproduction, distribution, and public display of their work. For example, overfitting, including memorization, by models may result in infringing outputs. However, when <a href=\"https:\/\/publicknowledge.org\/generative-ai-is-disruptive-but-more-copyright-isnt-the-answer\/\">we apply existing copyright law to our best understanding of generative AI systems<\/a>, we find that their core elements are consistent with the law.<\/p>\n\n\n\n<p>In Part 1 of this two-part blog series, we discuss why news publishers fear the impact of AI on their business models and whether these concerns are warranted. In <a href=\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-2-framing-solutions\/\">Part 2<\/a>, we describe strategies publishers are already using to mitigate the impact of AI on their business models, list additional solutions that are emerging to empower them against the threat generative AI represents, and identify policy solutions worthy of additional exploration to preserve the benefits of fair use while preserving incentives for publishers to keep creating the content AI developers and the public need.<\/p>\n\n\n\n<h3 class=\"heading-3 wp-block-heading\" id=\"h-ai-developers-and-news-publishers-are-frenemies\">AI Developers and News Publishers Are\u2026 Frenemies?<\/h3>\n\n\n\n<p>Whether they acknowledge it or not, AI developers and news publishers are increasingly codependent. Online news publishers play a crucial role in AI training by providing large datasets of high-quality, human-generated content. Whether through model training or retrieval augmented generation (more on these later), journalism grounds AI models in reality and in the <em>now<\/em>. The currency and relevance of AI generated outputs depend on access to timely sources of content. Journalism provides factual reporting, context, and in-depth analysis of real-world events and issues happening in the moment. By training AI on diverse journalistic sources, models learn to recognize and mitigate biases present in their training data. And using journalism for model training can help improve fact-checking and combat propaganda and false information. That may be why <a href=\"https:\/\/www.newsmediaalliance.org\/release-news-media-alliance-study-finds-pervasive-unauthorized-use-of-publisher-content-to-power-generative-ai-technologies\/\">studies show<\/a> the training sets underlying LLMs \u201csignificantly overweight publisher content\u201d compared to the generic collection of content scraped by <a href=\"https:\/\/commoncrawl.org\/\">Common Crawl<\/a>. The result: If their outputs undermine all the viable business incentives and models that sustain online news publishing, AI developers will eventually be in a world of hurt.<\/p>\n\n\n\n<p>Conversely, journalism organizations need to understand and, where appropriate, leverage AI systems to adapt to the realities of a digital media landscape. With the rise of the internet, then search and social media, and now AI-mediated search and information distribution, news publishers have been forced to grapple with rapidly changing technology that disrupts their business models. Additionally, publishers are always looking for opportunities to reduce costs, streamline news gathering, facilitate translation, and <a href=\"https:\/\/www.niemanlab.org\/2025\/09\/europe-middle-east-and-africa-newsrooms-are-experimenting-with-conversational-ai\/\">engage customers<\/a>, and are therefore aggressively seeking ways to <a href=\"https:\/\/cnti.org\/article\/ai-in-journalism\/\">leverage the substantial benefits<\/a> of AI in their own operations. (This is <a href=\"https:\/\/www.niemanlab.org\/2025\/08\/politicos-recent-ai-experiments-shouldnt-be-subject-to-newsroom-editorial-standards-its-editors-testify\/\">not without controversy<\/a>, including as to whether AI tools must be subject to the same journalistic editorial standards that pertain to human journalists. There is also <a href=\"https:\/\/fortune.com\/2025\/08\/18\/mit-report-95-percent-generative-ai-pilots-at-companies-failing-cfo\/\">evidence<\/a> that the vast majority of businesses \u2013 95% \u2013 have yet to see real efficiencies from AI materialize.) Lastly, journalists will need to leverage AI to conduct forensics and ensure the legitimacy of images and videos that may have been <em>created<\/em> with AI tools.&nbsp;<\/p>\n\n\n\n<p>Finally, AI developers and publishers both have self-benefiting roles to play in maintaining the open and free nature of the internet. Certainly government-funded research, commercial innovation, public policy, and infrastructure have been critical to create the internet as we know it today. But a lot of content and services (like search) are accessible and free to internet users today because of appealing publisher content and the advertising that supports it. Ensuring the viability of online publishing \u2013 whether it takes the form of newspapers, personal blogs, Substacks, or other models \u2013 allows a wider range of content and services to <em>remain<\/em> accessible and free. At the same time, news publishers have relied on users (and platform algorithms) freely sharing and promoting their content to drive clicks, views, and advertising revenues. Allowing AI models to \u201cread\u201d and learn from online content is an <a href=\"https:\/\/www.techpolicy.press\/what-the-copyright-case-against-ed-sheeran-can-teach-us-about-ai\/\">essential aspect<\/a> of an Open Internet, and it may create substantial economic and societal benefits. All of this is built on the premise of a free and open web.&nbsp;<\/p>\n\n\n\n<p>Lastly, journalism shares an interest in permissive copyright rules and strong fair use protections. Journalists are themselves highly dependent on the legal doctrine of fair use \u2013 for criticism and commentary, news gathering and reporting, republishing source material, illustration, historical reference, and documenting claims. Hollowing out fair use or dramatically expanding intellectual property rights could whip around and harm journalism itself.<\/p>\n\n\n\n<p>Given these relationships, in our view <em>policy solutions must be developed to ensure that the benefits of generative AI are shared by the body politic writ large without undercutting the journalism necessary for democracy\u2019s survival.<\/em><\/p>\n\n\n\n<h3 class=\"heading-3 wp-block-heading\" id=\"h-why-publishers-fear-generative-ai-nbsp\">Why Publishers Fear Generative AI&nbsp;<\/h3>\n\n\n\n<p>News publishers have long believed that dominant digital platforms unfairly \u2013 and in some cases, <a href=\"https:\/\/publicknowledge.org\/the-google-ad-tech-search-cases\/\">illegally<\/a> \u2013 exploit their work. The platforms\u2019 aim, this theory goes, is to garner most or all of the joint value created through their <a href=\"https:\/\/www.economist.com\/business\/2025\/07\/14\/ai-is-killing-the-web-can-anything-save-it\">longstanding exchange<\/a> with publishers: user engagement from news content on search and social media platforms in exchange for referral traffic provided to publishers through links. The current challenges in the news industry <a href=\"https:\/\/baekdal.com\/monetization\/the-updated-and-scary-circulation-and-revenue-figures-for-newspapers\/\">predate the internet<\/a>, but there is no debate that digital disintermediation has dramatically impacted the structure and economics of news delivery. This has led publishers, in some cases, to pursue <a href=\"http:\/\/publicknowledge.org\/we-can-save-local-news-without-upending-copyright-law\/\">solutions (like link taxes) that are incompatible with copyright law<\/a> as well as the principle of an Open Internet.<\/p>\n\n\n\n<p>Now AI, especially generative AI and its embedment in search products, chatbots, and agents, has exacerbated news publishers\u2019 concerns about the devastation platforms have caused to their business models. For example, generative AI\u2019s ability to provide complete narrative answers to some of the most complex user search queries right on the search engine results page undermines the need to click through to online publishers for more information. (When Google rolled out AI Overviews, now <a href=\"https:\/\/search.google\/ways-to-search\/ai-mode\/\">AI Mode<\/a>, in May of 2024, the company actually <a href=\"https:\/\/blog.google\/products\/search\/generative-ai-google-search-may-2024\/\">explicitly promised<\/a> to users that AI overviews are the perfect solution when \u201cyou don\u2019t have time to piece together all the information you need.\u201d In other words, a zero-click search is the product benefit of AI Overviews.) Or, chatbots and agents trained on news publishers\u2019 copyrighted content answer user queries about current events <em>instead<\/em> of search engines. That means the flow of traffic, ad dollars, and profit could continue to shift toward the dominant AI companies and away from publishers, spiking the trend line in place for decades.&nbsp;<\/p>\n\n\n\n<p>This challenge isn\u2019t just about model training. AI models are now often complemented by <em>grounding processes<\/em>, by which AI models are connected to real-time information to improve the accuracy and relevance of their outputs. One example of a grounding process is retrieval augmented generation (RAG), a technique that accesses web pages as part of a query to improve the accuracy and currency of the AI model\u2019s outputs. These kinds of enhancements require access to current information \u2013 like today\u2019s news \u2013 to validate or update responses that would otherwise be based on prior generations of training data. While some of these responses include citations or links to the source, publishers believe these technologies will result in (even) less traffic, (even) fewer ad dollars, and (even) fewer subscription conversions. Publishers also highlight the risk of brand erosion due to <a href=\"https:\/\/www.404media.co\/wikipedia-editors-adopt-speedy-deletion-policy-for-ai-slop-articles\/\">AI slop<\/a>, hallucinations, and misattribution (or lack of attribution) to the right news sources.&nbsp;<\/p>\n\n\n\n<p>This line of thinking doesn\u2019t even account for the likely adoption of an advertising-based business model for AI products. That would mean even more ad dollars migrating from publishers to AI firms. Google is already selling ads <a href=\"https:\/\/www.theverge.com\/2024\/10\/3\/24260637\/googles-ai-overview-ads-launch\">within AI Overviews<\/a>. Other AI firms are likely to adopt ad-based business models, as well, for two reasons: it\u2019s the business model the dominant platforms already <a href=\"https:\/\/business.google.com\/us\/think\/ai-excellence\/5-ways-ai-makes-google-search-work-harder-for-your-brand\/\">rely on<\/a>, and newer AI companies have <a href=\"https:\/\/futurism.com\/openai-trouble-subprime\">had little success<\/a> in attracting paid users. (Publishers\u2019 concerns also do not factor in Google\u2019s brand-new \u201c<a href=\"https:\/\/blog.google\/products\/search\/preferred-sources\/\">Preferred Sources<\/a>\u201d feature, which lets users \u201cselect their favorite sources\u201d to be placed most prominently in search results \u201cwhen those sources have published fresh and relevant content for your search.\u201d Preferred Sources may serve to further marginalize small and diverse news sources, as users are generally more familiar with major, national media brands.)<\/p>\n\n\n\n<p>Lastly, news publishers, like many others, are concerned about <a href=\"https:\/\/www.newsguardtech.com\/ai-monitor\/august-2025-ai-false-claim-monitor\/\">false information from chatbots<\/a> and the impact that \u201c<a href=\"https:\/\/www.techpolicy.press\/how-ai-driven-search-may-reshape-democracy-economics-and-human-agency\/\">pre-digested verdicts<\/a>\u201d to important queries, shaped by opaque algorithms and advertising- and engagement-based financial incentives, will have on our overall information environment.&nbsp;<\/p>\n\n\n\n<h3 class=\"heading-3 wp-block-heading\" id=\"h-early-impact-of-generative-ai-on-news-publishers\">Early Impact of Generative AI on News Publishers<\/h3>\n\n\n\n<p>Publishers aren\u2019t crazy \u2013 they\u2019re already reporting the damage generative AI and its offspring are causing to their cost structure and revenue. (Yes, it\u2019s fair to say these trends are getting media coverage in part because publishers are trying to make the case for protective legislation. But without data from AI firms to refute it, this is the story legislators are hearing and acting on. More on that later.)<\/p>\n\n\n\n<h4 class=\"heading-4 wp-block-heading\" id=\"h-alarmed-by-declines-in-traffic\"><strong>Alarmed by Declines in Traffic<\/strong><\/h4>\n\n\n\n<p>The emergence of new AI tools (like OpenAI\u2019s ChatGPT, Google\u2019s Gemini, Anthropic\u2019s Claude, and Perplexity) has resulted in more search referrals to many publishers. However, the increase in referrals is <a href=\"https:\/\/digiday.com\/media\/in-graphic-detail-ai-platforms-are-driving-more-traffic-but-not-enough-to-offset-zero-click-search\/\">not compensating<\/a> for a higher rate of zero-click searches derived from Google\u2019s AI-powered search overviews. (Search engine optimization or SEO agencies working on behalf of advertisers are <a href=\"https:\/\/www.searchenginejournal.com\/google-ai-overviews-found-in-74-of-problem-solving-queries\/538504\/\">obsessing<\/a> over precisely which keywords, industries, and geographies are more likely to trigger an AI overview. The consensus seems to be that they currently show up in ~20% of searches.) As predicted, links from AI-powered search overviews <a href=\"https:\/\/www.wsj.com\/tech\/ai\/google-ai-news-publishers-7e687141\">have plummeted<\/a> relative to traditional search queries (which have also become <a href=\"https:\/\/digiday.com\/media\/googles-latest-core-update-leaves-publishers-rattled-but-its-consequences-are-still-to-be-determined\/\">unpredictable<\/a>). Consumer search behavior also seems to be changing: New <a href=\"https:\/\/www.pewresearch.org\/short-reads\/2025\/07\/22\/google-users-are-less-likely-to-click-on-links-when-an-ai-summary-appears-in-the-results\/?ref=therebooting.com\">consumer research<\/a> shows that Google users who encounter an AI summary are 50% less likely to click on links to other websites than users who see a traditional search result. Why click on blue links if everything you need to know appears upon your query? Google users who encountered an AI summary also rarely \u2013 1% of the time \u2013 click on a link in the summary itself. And Google users are more likely to end their browsing session entirely after visiting a search page with an AI summary than on pages without a summary. New <a href=\"https:\/\/digitalcontentnext.org\/blog\/2025\/08\/14\/facts-googles-push-to-ai-hurts-publisher-traffic\/\">data from Digital Content Next\u2019s membership of 19 digital publishers<\/a> shows median year-over-year referral traffic from Google Search down 10% for the most recent eight-week period. News brands, which may still be able to cover breaking news in ways AI cannot, fell 7%. (Early in August, Google <a href=\"https:\/\/blog.google\/products\/search\/ai-search-driving-more-queries-higher-quality-clicks\/\">maintained<\/a> that \u201ctotal organic click volume\u201d is stable, but goes on to emphasize other measures such as \u201cclick quality,\u201d the presence of more links on the search engine results page, and the shift in traffic to different kinds of content. The company\u2019s post generated <a href=\"https:\/\/pressgazette.co.uk\/platforms\/google-search-clicks-traffic-2025-ai-overviews\/\">spirited feedback<\/a> from publishers and SEO experts in the U.S. and U.K.)<\/p>\n\n\n\n<h4 class=\"heading-4 wp-block-heading\" id=\"h-overwhelmed-by-new-traffic-from-crawlers-and-bots\"><strong>Overwhelmed by New Traffic from Crawlers and Bots<\/strong><\/h4>\n\n\n\n<p>Well upstream from traffic and ad revenue, publishers are taking on new costs as AI training data crawlers and bots overwhelm their systems.&nbsp;<\/p>\n\n\n\n<p>For example: TollBit was one of the first platforms to enable websites to monetize their content by charging AI companies and bots for access, so it has some of the most extensive history regarding AI crawler behavior. In its most recent quarterly \u201c<a href=\"https:\/\/tollbit.com\/bots\/25q1\/\">State of the Bots<\/a>\u201d report, TollBit reported that total AI user agent traffic among the TollBit customer network grew 87% from the last quarter of 2024 to the first quarter of 2025. This is likely due to higher rates of adoption of these tools among users. Within this total, for the first time, traffic from retrieval augmented generation bots exceeded traffic from training bots, growing at nearly 2.5x the rate of training bot traffic. If this trend continues, it will mean that supporting AI user agent traffic will be an ongoing<em> and increasing<\/em> cost for publishers as adoption grows. And as noted above, TollBit found that referral traffic from AI bots was still minuscule \u2013 just 0.04% of all external referrals to network sites in Q1 2025 \u2013 and nowhere near enough to offset the broader decline in traffic from traditional search sites.&nbsp;<\/p>\n\n\n\n<h4 class=\"heading-4 wp-block-heading\" id=\"h-frustrated-by-lack-of-control-over-access\"><strong>Frustrated by Lack of Control Over Access<\/strong><\/h4>\n\n\n\n<p>Publishers, faced with heavy scraping loads from AI firms but seeing little return in monetizable traffic, are increasingly pushing to assert greater control over how their content is used for training and real-time AI queries. This can be technically complex. For example, some of the largest AI products, like <a href=\"https:\/\/developers.google.com\/search\/docs\/appearance\/ai-features\">Google AI Overviews<\/a>, <a href=\"https:\/\/www.microsoft.com\/en-us\/bing\/copilot-search\/\">Microsoft Copilot<\/a> (Blogbot), and <a href=\"https:\/\/support.apple.com\/en-us\/119829\">Apple\u2019s AI tools<\/a> (Applebot), do not separate their AI user agents from their search ranking crawlers. Publishers risk losing all their visibility to platform users if they try to manage or block these firms from accessing their content. Publishers see the need to control how their content is used for AI training as a way to counter these technology companies\u2019 monopolistic power. But blocking search ranking crawlers can be business suicide.&nbsp;<\/p>\n\n\n\n<p>Other AI firms simply <a href=\"https:\/\/www.bloomberg.com\/news\/articles\/2024-08-15\/google-s-search-dominance-leaves-sites-little-choice-on-ai-scraping\">ignore the robots\u2019 exclusion protocol<\/a>, robots.txt, that publishers use to notify technology platforms that they do not wish to have their content crawled. TollBit\u2019s network data, for example, suggests that disallowing real-time scraping by retrieval augmentation bots via robots.txt has zero impact on the referrals the AI apps deliver \u2013 they\u2019re still crawling. AI firms may also be using third-party scrapers, <a href=\"https:\/\/blog.cloudflare.com\/perplexity-is-using-stealth-undeclared-crawlers-to-evade-website-no-crawl-directives\/\">stealth scrapers<\/a>, or masked user agents that continue to scrape sites despite the exclusion protocol. They may also pull cached content from search engines or <a href=\"https:\/\/www.theverge.com\/news\/757538\/reddit-internet-archive-wayback-machine-block-limit\">scrape it from the Internet Archive<\/a>. This has resulted in online publishers blocking the Internet Archive to avoid their content being scrapable from the <a href=\"https:\/\/archive.org\/\">Wayback Machine<\/a>. This means both publishers and AI firms \u2013 as well as internet users in general \u2013 lose important pieces of digital history.&nbsp;<\/p>\n\n\n\n<p>In <a href=\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-2-framing-solutions\/\">Part 2<\/a>, we describe strategies publishers are using to respond; inventory solutions that are emerging to empower them against the threat generative AI represents for their business models; and identify some promising policy solutions that preserve the benefits of the fair use doctrine while still preserving incentives for publishers to keep creating the timely content that AI developers need for their business model \u2013 and citizens need to stay informed.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The tug of war between online news publishers and AI developers on the role of copyrighted content in AI model training may lead to a more closed internet for everyone.<\/p>\n","protected":false},"author":189,"featured_media":37422,"parent":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[5],"tags":[12,14,29],"class_list":["post-38306","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-insights","tag-copyright-reform","tag-platform-regulation","tag-trustworthy-ai"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v26.5 (Yoast SEO v26.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem - Public Knowledge<\/title>\n<meta name=\"description\" content=\"The battle between news publishers and AI developers on using copyrighted material in generative AI model training may have some unintended consequences.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem\" \/>\n<meta property=\"og:description\" content=\"The battle between news publishers and AI developers on using copyrighted material in generative AI model training may have some unintended consequences.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/\" \/>\n<meta property=\"og:site_name\" content=\"Public Knowledge\" \/>\n<meta property=\"article:published_time\" content=\"2025-09-22T17:13:08+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-09-22T17:13:13+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1-1440x720.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1440\" \/>\n\t<meta property=\"og:image:height\" content=\"720\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Lisa Macpherson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Lisa Macpherson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"13 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/\"},\"author\":{\"name\":\"Lisa Macpherson\",\"@id\":\"https:\/\/publicknowledge.org\/#\/schema\/person\/757e28331d7a5e31a3290be1d16d219b\"},\"headline\":\"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem\",\"datePublished\":\"2025-09-22T17:13:08+00:00\",\"dateModified\":\"2025-09-22T17:13:13+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/\"},\"wordCount\":2746,\"publisher\":{\"@id\":\"https:\/\/publicknowledge.org\/#organization\"},\"image\":{\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1.png\",\"keywords\":[\"Copyright Reform\",\"Platform Regulation\",\"Trustworthy AI\"],\"articleSection\":[\"Insights\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/\",\"url\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/\",\"name\":\"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem - Public Knowledge\",\"isPartOf\":{\"@id\":\"https:\/\/publicknowledge.org\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1.png\",\"datePublished\":\"2025-09-22T17:13:08+00:00\",\"dateModified\":\"2025-09-22T17:13:13+00:00\",\"description\":\"The battle between news publishers and AI developers on using copyrighted material in generative AI model training may have some unintended consequences.\",\"breadcrumb\":{\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#primaryimage\",\"url\":\"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1.png\",\"contentUrl\":\"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1.png\",\"width\":2000,\"height\":1000},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/publicknowledge.org\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/publicknowledge.org\/#website\",\"url\":\"https:\/\/publicknowledge.org\/\",\"name\":\"Public Knowledge\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/publicknowledge.org\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/publicknowledge.org\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/publicknowledge.org\/#organization\",\"name\":\"Public Knowledge\",\"url\":\"https:\/\/publicknowledge.org\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/publicknowledge.org\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/publicknowledge.org\/wp-content\/uploads\/2021\/12\/pk_social_logo-2.png\",\"contentUrl\":\"https:\/\/publicknowledge.org\/wp-content\/uploads\/2021\/12\/pk_social_logo-2.png\",\"width\":400,\"height\":200,\"caption\":\"Public Knowledge\"},\"image\":{\"@id\":\"https:\/\/publicknowledge.org\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/publicknowledge.org\/#\/schema\/person\/757e28331d7a5e31a3290be1d16d219b\",\"name\":\"Lisa Macpherson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/publicknowledge.org\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/a7ccfea9aef75381949570e9237ff7f0ef0efcd0f80308496086b5b8f6a2989e?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/a7ccfea9aef75381949570e9237ff7f0ef0efcd0f80308496086b5b8f6a2989e?s=96&d=mm&r=g\",\"caption\":\"Lisa Macpherson\"},\"url\":\"https:\/\/publicknowledge.org\/author\/lisa-macpherson\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem - Public Knowledge","description":"The battle between news publishers and AI developers on using copyrighted material in generative AI model training may have some unintended consequences.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/","og_locale":"en_US","og_type":"article","og_title":"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem","og_description":"The battle between news publishers and AI developers on using copyrighted material in generative AI model training may have some unintended consequences.","og_url":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/","og_site_name":"Public Knowledge","article_published_time":"2025-09-22T17:13:08+00:00","article_modified_time":"2025-09-22T17:13:13+00:00","og_image":[{"width":1440,"height":720,"url":"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1-1440x720.png","type":"image\/png"}],"author":"Lisa Macpherson","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Lisa Macpherson","Est. reading time":"13 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#article","isPartOf":{"@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/"},"author":{"name":"Lisa Macpherson","@id":"https:\/\/publicknowledge.org\/#\/schema\/person\/757e28331d7a5e31a3290be1d16d219b"},"headline":"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem","datePublished":"2025-09-22T17:13:08+00:00","dateModified":"2025-09-22T17:13:13+00:00","mainEntityOfPage":{"@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/"},"wordCount":2746,"publisher":{"@id":"https:\/\/publicknowledge.org\/#organization"},"image":{"@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#primaryimage"},"thumbnailUrl":"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1.png","keywords":["Copyright Reform","Platform Regulation","Trustworthy AI"],"articleSection":["Insights"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/","url":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/","name":"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem - Public Knowledge","isPartOf":{"@id":"https:\/\/publicknowledge.org\/#website"},"primaryImageOfPage":{"@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#primaryimage"},"image":{"@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#primaryimage"},"thumbnailUrl":"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1.png","datePublished":"2025-09-22T17:13:08+00:00","dateModified":"2025-09-22T17:13:13+00:00","description":"The battle between news publishers and AI developers on using copyrighted material in generative AI model training may have some unintended consequences.","breadcrumb":{"@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#primaryimage","url":"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1.png","contentUrl":"https:\/\/publicknowledge.org\/wp-content\/uploads\/2024\/10\/Website-Pictures-1.png","width":2000,"height":1000},{"@type":"BreadcrumbList","@id":"https:\/\/publicknowledge.org\/is-there-a-middle-ground-in-the-tug-of-war-between-news-publishers-and-ai-firms-part-1-framing-the-problem\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/publicknowledge.org\/"},{"@type":"ListItem","position":2,"name":"Is There a Middle Ground in the Tug of War Between News Publishers and AI Firms? Part 1: Framing the Problem"}]},{"@type":"WebSite","@id":"https:\/\/publicknowledge.org\/#website","url":"https:\/\/publicknowledge.org\/","name":"Public Knowledge","description":"","publisher":{"@id":"https:\/\/publicknowledge.org\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/publicknowledge.org\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/publicknowledge.org\/#organization","name":"Public Knowledge","url":"https:\/\/publicknowledge.org\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/publicknowledge.org\/#\/schema\/logo\/image\/","url":"https:\/\/publicknowledge.org\/wp-content\/uploads\/2021\/12\/pk_social_logo-2.png","contentUrl":"https:\/\/publicknowledge.org\/wp-content\/uploads\/2021\/12\/pk_social_logo-2.png","width":400,"height":200,"caption":"Public Knowledge"},"image":{"@id":"https:\/\/publicknowledge.org\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/publicknowledge.org\/#\/schema\/person\/757e28331d7a5e31a3290be1d16d219b","name":"Lisa Macpherson","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/publicknowledge.org\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/a7ccfea9aef75381949570e9237ff7f0ef0efcd0f80308496086b5b8f6a2989e?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a7ccfea9aef75381949570e9237ff7f0ef0efcd0f80308496086b5b8f6a2989e?s=96&d=mm&r=g","caption":"Lisa Macpherson"},"url":"https:\/\/publicknowledge.org\/author\/lisa-macpherson\/"}]}},"_links":{"self":[{"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/posts\/38306","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/users\/189"}],"replies":[{"embeddable":true,"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/comments?post=38306"}],"version-history":[{"count":0,"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/posts\/38306\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/media\/37422"}],"wp:attachment":[{"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/media?parent=38306"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/categories?post=38306"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/publicknowledge.org\/wp-json\/wp\/v2\/tags?post=38306"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}