Inside Mel Robbins’ Audio-First Strategy For The Video Podcast Era.
- Inside Audio Marketing

- 6 hours ago
- 4 min read

Mel Robbins sees video as podcasting’s next major growth opportunity, but she is also warning that the rush to put cameras in every studio should not obscure a basic reality: audio and video are different products, serving audiences with different habits and expectations.
Speaking at Cannes Lions festival last week, the host of “The Mel Robbins Podcast” said she is enthusiastic about the arrival of video across podcast platforms and the bigger marketing opportunities it may create. But she said the operational and technical realities are becoming increasingly complicated, particularly as platforms build video products around their own business models.
“For us, video has always been a thing that was on YouTube, and it was its own thing,” Robbins said. “We’re an audio-first production.”
That distinction matters. Robbins said her company does not simply put a camera on an audio recording and distribute the same product everywhere. Instead, it creates separate versions and workflows for audio, video and short-form social distribution, based on how consumers use each format.
Audio listeners are often in cars, walking, working or doing household tasks, she said. They are more receptive to a longer introduction, repetition and recaps that help them rejoin a conversation after an interruption. Video viewers, by contrast, have less patience for a slow opening.
“If I do a long welcome-in on video, people drop off immediately,” Robbins said. “Video people hate that stuff.”
The contrast is one reason why she believes podcast producers need to be more deliberate as video becomes a bigger part of the business. A visual version of a show cannot simply be treated as an add-on, she said. It needs its own pacing, structure and editorial logic.
For Robbins, the answer is not to chase every platform or trend indiscriminately. It is to remain disciplined about the audience and the purpose of the work.
She said her company uses a three-part framework — “You, One, Many” — to guide every business and editorial decision. The first step is defining who the creator is, what they value and what the work is intended to accomplish. The second is identifying the one person that the content is meant to help, entertain or inspire. The third is recognizing that broader audience growth follows when that person finds the content useful enough to share.
“You don’t get the many without truly serving one,” Robbins said. She said the audience for her show is not built around a narrow demographic profile. Instead, she thinks about the person in front of her in the grocery store line: someone who may be overwhelmed, trying to do better and looking for useful, practical encouragement.
That philosophy shapes a production process Robbins said can require 100 hours or more for a single episode. Her team doesn’t approach the show as a casual conversation that listeners are invited to overhear. Each episode is constructed around a specific listener need.
“We do not just roll up to a mic, tape a conversation, and allow you to eavesdrop,” Robbins said. “We do something very different.”
The team begins with audience research, including sentiment analysis, online polls, feedback from the show’s inbox and signals from Robbins’ 40 million social followers. It also looks at search behavior, prior interviews with prospective guests and viewing behavior on YouTube, including moments when viewers replay or stop watching.
The research helps determine not only what topics should be covered, but also which guests are capable of communicating ideas in a way that is useful to a mainstream audience. Before a guest arrives at the company’s Boston studio, the team may have already spent 30 to 50 hours researching and shaping the conversation. Guests also participate in a required pre-interview that typically lasts two hours.
“We’ve reverse-engineered the questions,” Robbins said. “We’re trying to engineer a conversation for impact.”
Robbins said her team has even designed the studio process around the listener’s presence. During guest preparation, she tells interviewees that there are three people in the conversation: the host, the guest and the listener. Two cameras and an empty chair represent that third person.
“We want the tone to be that we’re here for you,” she said.
The interviews themselves can last three or four hours. Then comes another major production effort creating video episodes and social media clips.
Robbins also programs episodes dynamically rather than locking into a fixed release calendar. Robbins said the team may have 15 or 20 completed episodes ready to go, but it does not guarantee guests a release date. Instead, they continuously evaluate audience sentiment and the response to recent episodes to decide what subject is most appropriate, pointing to the importance of understanding the emotional state of the audience.
“There is no way a productivity episode, no matter how good it is, would work right now,” said Robbins, explaining audience research recently found “exhausted” is the one word coming up more than any other. That reinforced her belief that creators and marketers need to make their messages simpler, not more complicated. “It has to be simple, because if it’s not sticky and not memorable, it’s useless to people,” she said.
She also cautioned against becoming overly driven by download data. Robbins said she doesn’t have direct access to the show’s back-end performance numbers because she does not want editorial choices to become a contest between immediate audience metrics and long-term impact.
Instead, the company distributes a daily email to the team featuring listener comments. The purpose is to remind everyone involved that a single person is on the other end of every episode, clip and campaign.
“That is the person that we’re showing up for,” she said.




Comments