Blog Strategy 29 June 2026 10 min read

Your competitor
isn’t a paper.
It’s the feed.

Editors benchmark their newsroom against the paper across town. The reader never does. The reader has nearly seven hours online a day, forty-seven seconds of patience, and an infinite feed engineered to keep both. The competitor isn’t another publisher — it’s the feed. Audio is the one surface that reaches the minutes it can’t.

Every newsroom keeps a competitive set. The rival paper. The rival site. The outlet that broke the story first. The morning meeting measures the day against them. It’s the wrong scoreboard.

The reader doesn’t choose your article over a rival article. The reader chooses your article over a video, a group chat, a game, and a feed designed by some of the best-funded engineering teams on earth to never end. Average attention on any screen is now 47 seconds[1] before the swipe. The average news article needs three and a half minutes. The competitor isn’t the other paper. The competitor is the scroll.

This is what the attention economy actually means for a publisher. Not a slogan — a market structure. Your content competes for a fixed, shrinking budget of human attention against opponents who don’t play by editorial rules. The only honest question is which minutes of the reader’s day you can still win. Written from inside BotTalk, the orchestration layer running audio for thirty European newsrooms today.

You’re not losing readers to other publishers

The comforting story is that audiences are fragmenting across more outlets. The real story is that audiences are leaving text entirely. 44% of 18–24s now name social media or video platforms — not a news brand — as their main source of news[2]. The competitive set isn’t a list of mastheads. It’s a list of apps: TikTok, YouTube, Instagram, a messaging thread. And the scale of the opponent is the part editors underrate — the average TikTok user now spends roughly 40 hours a month inside one app, more than thirteen hours beyond YouTube[5]. That is the gravity well your three-and-a-half-minute article is pulling against.

The reader brings almost no patience to the fight. People spend close to seven hours a day online, nearly four of them on a phone[4], and give any single screen about forty-seven seconds before the thumb moves. Worse for a publisher chasing that screen: up to 85% of mobile-feed video is watched with the sound off[3]. The attention there is shallow, captioned, muted, and gone in a swipe.

Out-scrolling TikTok with a faster-loading article page is bringing a newspaper to a feed fight. You don’t win the screen war. You change the battlefield.

The screen war versus the background Two attention surfaces. The top surface, the screen, is a single bar divided between TikTok, YouTube, Instagram, messaging, and games, with news reduced to a thin coral sliver at the end — the war every publisher fights and loses, where 85% of video is muted and patience runs out at 47 seconds. The bottom surface, the background, is an open track running through the reader's screen-down moments — commute, kitchen, gym, walk, chores — where the feed has no product and audio wins uncontested. THE SCREEN · WHERE EVERY PUBLISHER FIGHTS — AND LOSES TIKTOK YOUTUBE INSTAGRAM CHAT GAMES NEWS — THIS IS YOU EYES LOCKED · 85% MUTED · 47-SECOND PATIENCE THE BACKGROUND · WHERE AUDIO WINS UNCONTESTED AUDIO COMMUTE KITCHEN GYM WALK CHORES PHONE DOWN · EARS OPEN · THE FEED HAS NO PRODUCT HERE STOP CONTESTING THE SLIVER · OWN THE OPEN SURFACE
Figure 1 · Two attention surfaces. Top: the screen, where news is a coral sliver against TikTok, YouTube, and the feed. Bottom: the background, the screen-down hours where audio runs uncontested.

Audio wins the hours the feed can’t reach

Here is the asymmetry the feed can’t close: the screen demands eyes. Audio only needs ears. And the reader has hours every day when their eyes are busy and their ears are free.

The commute. The kitchen. The gym. The school run. The walk to the station. The phone is in a pocket, face-down, screen-dark — and the muted feed has nothing to say to a dark screen. These are the background hours. TikTok can’t fill them. YouTube can’t fill them. A text article certainly can’t. Audio can.

This is why audio isn’t a smaller version of the screen war. It’s a different surface entirely — one the publisher can still own outright. Not Google. Not Meta. Not OpenAI. The reader who bounced off your article at forty-seven seconds listens to the same article for four minutes when it’s read to them in the car. The competitor never followed them there.

Three moves to win the background

1. Stop benchmarking the screen. Benchmark the day.

The first move is a change of scoreboard. Stop asking “did we beat the rival site’s pageviews” and start asking “how many of the reader’s screen-down hours did we reach.” Those are different questions with different answers. Pageviews measure the war you’re losing. Listening hours measure the one you can win.

A publisher with audio articles and a podcast feed competes for the commute, the workout, and the chores — surfaces where the feed has no product at all. That’s not a marginal gain on the existing battle. It’s a second front the competitor isn’t fighting on.

2. Make the article listenable everywhere the reader already is

Background attention is portable. Your audio has to be too. An audio player locked to the article page only wins the on-site minute. To win the background, the same article has to be available where the reader’s ears actually are: a per-publisher podcast feed, auto-generated, shipping to Apple Podcasts, Spotify, and the open ecosystem.

The reader who listened to two pieces on your site subscribes to the feed. The next morning your editor’s voice is the first thing in their car. That’s the moment the publisher beats TikTok — not on the screen, but in the fifteen minutes TikTok was never in the room. And the surface is real and growing: 40% of people now listen to a podcast every week, up from 15% in 2017[6]. The background isn’t a niche. It’s the fastest-growing block of attention left. For the engagement mechanics underneath it, see our piece on fuelling long sessions with audio articles.

3. Make the voice worth staying for

Background listening is forgiving of distraction and unforgiving of friction. A mispronounced name, a robotic cadence, a tonal mismatch — each one is a cue to reach for the phone and open the feed. Long sessions are a function of how rarely the listener has to forgive the audio.

That means two things. A named, trusted voice — the editor the reader came for, not a generic provider default — and a pre-synthesis quality engine that catches the numbers, names, and dialect failures before a single voice model speaks. The reader stays because the audio is worth staying for. The session compounds. The feed waits, unopened.

The feed owns the eyes. Audio owns the ears. Stop fighting for the wrong sense.

What the background is worth

This isn’t a thought experiment. It’s the operating reality across the BotTalk network. Numbers from June 2026:

47s
Attention before the swipe (UC Irvine)
44%
Under-25s whose main news source is social or video (Reuters)
24K
Hours of attention BotTalk captures daily — in the background
20M
Monthly listeners who chose the ear over the scroll
  • 24,000 hours of attention captured per day across the network — almost all of it in the background hours the feed never reaches.
  • 6,000 articles narrated per day across 30 publishers, each one now listenable in the car, the kitchen, the gym.
  • 20M monthly listeners who chose the ear over the scroll.
  • 5 AI voice providers orchestrated behind one policy so the voice always fits the moment — and never gives the listener a reason to reach for the feed.

These are BotTalk production figures, June 2026 — verifiable on request under the standard audit clause in our customer contracts.

Audio doesn’t replace the screen. Breaking news, the live blog, the visual scoop — those still belong to the feed’s own turf, and chasing audio there is the wrong fight. The screen war is real, and publishers are losing it. But the background is open, almost no one is contesting it, and it is a surface the publisher can still own. The attention economy didn’t end the publisher’s business. It moved it.

Two newsrooms that changed the scoreboard

Lena Kaiser, Head of Product at taz

“Audio gave the digital app a human face. We cloned the voices of our own colleagues — and TTS became the killer argument to keep the app. Seventy percent of our readers now listen rather than read.”

Lena Kaiser Head of Product · taz
Read the taz case study
Alexander Ottitzky, CTO at heute.at

“Plug and play from day one — no extensive config. The Austrian accent was make-or-break for us, and BotTalk got it right. Pricing has stayed predictable, unlike the other providers we tested.”

Alexander Ottitzky CTO · heute.at
Read the heute.at case study

Two newsrooms. Both stopped trying to out-scroll the feed. Both went where the feed couldn’t follow.

A four-question audit for your competitive set

Before the next strategy meeting benchmarks you against the wrong opponent:

  1. Who is actually taking the reader’s time? If the answer is “the rival site,” look again. It’s the feed. Plan for the real competitor.
  2. Which screen-down hours do you reach today? The commute, the kitchen, the gym. If the answer is “none,” that’s the open front.
  3. Is the article listenable off your site? No podcast feed means you’re still fighting on the screen, where you lose. Ship the feed.
  4. Is the voice worth staying for? A generic default voice and raw copy straight to a model hands the listener a reason to open the feed instead.

Four questions. Ten minutes. Most “how do we compete with TikTok” strategies never ask the one that matters: which sense are you fighting for.

Frequently asked

Six questions before you benchmark the wrong competitor.

Who is a publisher’s real competitor in the attention economy?

Not another newsroom. The reader’s attention is a fixed, shrinking budget spent mostly on TikTok, YouTube, Instagram, messaging, and games — not on choosing between two news brands. Up to 44% of 18–24s name social or video platforms, not a publisher, as their main source of news. The competitive set is a list of apps, not a list of mastheads.

Why can’t publishers just compete with TikTok and YouTube on screen?

Because the screen attention is already lost and shallow — up to 85% of mobile-feed video is watched on mute, and the feeds are engineered by some of the best-funded teams in tech to be endless. Out-scrolling them with a faster article page doesn’t work. Audio changes the battlefield to a sense the feed doesn’t compete for.

What are background hours and why do they matter to publishers?

Background hours are the parts of the day when the reader’s eyes are busy but their ears are free: the commute, the kitchen, the gym, the walk, the chores. The feed can’t fill them — its product needs a screen. Audio can. It’s the one large block of daily attention a publisher can still win outright.

How does audio help a publisher win attention back from the feed?

A reader who bounces off a text article in 47 seconds will listen to the same article for around four minutes when it’s narrated. A podcast feed then reaches them in the car or at the gym, where the feed has no presence. Across the BotTalk network, audio captures roughly 24,000 hours of attention per day, almost all in those background moments.

Does a publisher need a podcast feed, or is an on-site player enough?

Both, but the feed is what wins the background. An on-site player captures the on-site minute; the podcast feed captures the commute and the workout — far longer sessions in surfaces the feed never reaches. With 40% of people now listening to a podcast weekly, shipping only the on-site player leaves the largest block of winnable attention on the table.

Why does voice quality matter for winning attention?

Background listening is unforgiving of friction. A mispronounced name or a robotic cadence is a cue to reach for the phone and open the feed. A named, trusted voice plus a pre-synthesis quality engine keeps the listener in the session instead of handing them back to the competitor.

Sources

The research behind the numbers.

  1. [1] · UC Irvine · 2023

    Gloria Mark, Can’t pay attention? You’re not alone (University of California), tied to her book Attention Span. Twenty years of screen-attention logging: average attention on any screen fell from ~2.5 minutes in 2004 to 75 seconds in 2012 to 47 seconds in recent years.

    universityofcalifornia.edu ↗
  2. [2] · Reuters Institute · 2025

    Reuters Institute for the Study of Journalism, Oxford, Digital News Report 2025. Across 48 markets, 44% of 18–24s name social media and video networks as their main source of news; in the US, social and video (54%) overtook TV (50%) and news sites (48%) for the first time.

    reutersinstitute.politics.ox.ac.uk ↗
  3. [3] · Digiday · 2016

    Digiday, 85 percent of Facebook video is watched without sound. Publisher-reported finding from 2016: up to 85% of mobile-feed video views happen with the sound off — the canonical citation for the silent-video pattern that still governs feed consumption.

    digiday.com ↗
  4. [4] · DataReportal · 2025

    DataReportal (Kepios, with GWI data), Digital 2025: Global Overview Report. Internet users average 6 hours 38 minutes online per day, roughly 3 hours 46 minutes of it on mobile — the best-sourced figure for daily time online and the mobile share of it.

    datareportal.com ↗
  5. [5] · data.ai / Sensor Tower · 2024

    data.ai (now Sensor Tower), State of Mobile 2024. The average TikTok user spends ~40 hours per month in-app — up from 11 hours/month in early 2020, and about 13 hours/month more than the average YouTube user.

    sensortower.com ↗
  6. [6] · Edison Research · 2025

    Edison Research, The Infinite Dial 2025. 70% of Americans 12+ have listened to a podcast; 55% are monthly listeners and 40% are weekly — the weekly figure up from 15% in 2017, a record high in the industry’s longest-running consumer-audio study.

    edisonresearch.com ↗
Dr. Andrey Esaulov, co-founder and CEO of BotTalk

About the author

Dr. Andrey Esaulov

Co-founder & CEO · BotTalk

Andrey holds a doctorate in linguistics, and before founding BotTalk he spent more than six years leading a department at Axel Springer — one of the largest publishing houses in Europe. BotTalk now runs audio production for 30+ European newsrooms, including taz, heute.at, Tamedia, and Mediengruppe Pressedruck. Andrey writes about the attention economy, voice infrastructure, and the orchestration layer above commercial AI.

Reach Andrey directly: [email protected] · LinkedIn.

Article last reviewed by the author: . The attention-economy, news-consumption, and podcast-adoption references in the Sources section are re-verified on each material update.

Benchmark the right competitor

Talk to Andrey.

No deck. No pitch. Your CMS, your audience, your editorial voice. The background-hours walkthrough on your own articles. Thirty minutes. One call.