Hi there! My name is Janko Roettgers, and this is Lowpass. This week: Why phones may be better for AR than headsets, and how Meta makes video work globally.
After working on the Vision Pro, this AR veteran is going back to phones
When Mark Drummond was working on the Vision Pro at Apple, he had a bit of an epiphany that didn’t really fit Cupertino’s preferred narrative.
Drummond was managing the Character Intelligence Team, which among other things built the Encounter Dinosaurs demo. Preinstalled on the Vision Pro, the demo puts viewers eye to eye with interactive prehistoric creatures.
“We built that with Jon Favreau,” Drummond says, referring to Apple’s long-standing partnership with the Mandalorian director. “Before the headset was available to us to take down to Burbank, we used iPhones and iPads,” he recalls. Relying on mobile devices for demos made sense. VisionOS is essentially an iPadOS fork. “It did actually work out. We had a really good experience with iPhone and iPad,” he says.
Through that experience, Drummond realized that these mobile devices weren’t just decent stand-ins for the headset. “What we learned looking for sources of surprise and delight with interactive characters in mixed reality is that the headset is actually not the best [device] for this kind of thing,” he says.
“I still think it's a pretty fabulous piece of hardware,” Drummond says. However, headsets can also be alienating, and separate viewers from the world and from the people around them. “It's kind of lonely,” Drummond says. Having an AR app on a phone, on the other hand, makes it much easier to show it to others. “People can lean in over your shoulder,” Drummond says.
That’s why, shortly after leaving Apple in 2023, Drummond embraced mobile devices for augmented reality storytelling. For the past two and a half years, he has been working on a new iPhone AR app called Pixi that’s all about mobile-first interactive storytelling.
In short, Pixi is working on something that could best be described as the AR version of the email greeting card. When it launches in the coming weeks, Pixi will let anyone pick an interactive character and a scenario, add a personalized message, and then send it to their contacts via iMessage or WhatsApp. Once the recipient opens such a Pixi message, the character appears overlaid in the camera view of their real-world environment, and interacts with them.
During a recent demo, Pixi’s AR experiences included an animated cat and a robot. They could tell jokes, play tic-tac-toe with you, or challenge you to a game of whack-a-mole, right on your desk.
At first glance, none of it seemed especially groundbreaking if you’ve ever played with an AR Snapchat filter or any other mobile AR app. “If you're not slightly embarrassed by the first product, you launched way too [late],” Drummond says, suggesting that the company will incorporate additional characters and more advanced storylines in the future.
However, Pixi is incorporating some interesting interactive elements: The cat comedian, for instance, delivers one dad joke after another until the app detects the viewer cracking a smile via the phone’s front-facing camera. After that, the character thanks the viewer for their attention like a comedian ending their show and delivers a personalized greeting message.
“In order to make a character feel like it's present, it has to pay attention,” Drummond says. That includes not just paying attention to the viewer, but also the environment they are in. “This kind of attention is only possible through on-device AI,” he says. Pixi uses AI and machine learning to recognize facial expressions, and the app also downloads custom ML models on the fly to recognize objects and then incorporate them into a story.
When mobile AR first became popular with Snapchat filters and ARKit-based apps, developers had very little to work with when it came to interacting with a viewer’s real-world environment. Basically, apps would detect surfaces and just put any virtual object on top of those surfaces. Over time, AR apps became smarter about occlusion, allowing characters to hide behind real-world objects. Eventually, some even offered rudimentary object detection.
With recent advancements in AI, there’s a potential to supercharge object recognition, to the point where apps will be able to recognize classes of objects, understand how they work, and incorporate them into interactive experiences. Here’s how Drummond envisions Pixi’s AR greeting cards will work in the future:
“Let's say that I have a friend [who is] a lawyer. She got promoted, and I want to say congratulations. I send her a [Pixi with a] golden retriever, which appears on the office floor. It looks at her, then starts sniffing. It walks over to a filing cabinet, and is very focused on a drawer in that filing cabinet. She thinks: Dog, that’s just old tax returns. The dog will not relent. It looks, sniffs, looks winsome. She opens the drawer. Inside, we've hidden a huge virtual dog biscuit [with the message]: ‘Congratulations, see you Friday.’”
Would all this be technically possible with a Vision Pro app as well? Perhaps. But there are plenty of other reasons holding the headset back — including that $3,500 price tag. “It’s expensive,” Drummond acknowledges, adding, “I still think it's a steal given how much amazing technology you've got, just those displays alone.”
However, the high price of the device has also held back adoption, with IDC estimating that the company shipped just 45,000 units during last year’s holiday quarter. “It doesn't have great market penetration,” Drummond says.
Even people who own one may be reluctant to put it on every time someone sends them a two-minute clip or experience. “You don't always have it to hand, and [you’ve got] the setup, teardown costs,” Drummond says. “We have phones with us all the time.”
It’s no secret that the Vision Pro was just a first step for Apple, which is reportedly working on its own smart and AR glasses. However, Drummond believes that such glasses products won’t be able to replace smart phones for years to come due to physical constraints. Phones are now capable of running complex machine learning models locally, while glasses will have much more limited power and compute capacities. Because of those constraints, they’ll likely function as another display for notifications and directions, similar to the Apple Watch. “I think the Watch and the spectacles from Apple will do much the same job,” he says.
This means that AR developers looking to build interactive entertainment experiences may find that the phone remains the best platform for the foreseeable future — a platform that is getting more capable by the day, thanks to AI.
Enjoyed reading this story? Then please consider upgrading to the $8 a month / $80 a year paid tier to support my reporting, and get access to the full Lowpass newsletter every week.
SPONSORED
She got 100 million views. She doesn't exist.

A student in India built a fictional influencer on Google Gemini, ran her less than an hour a day, and hit 10 million views per reel. This is just the beginning.
Discover 44 more ways humans can thrive in the age of AI from former PCMag editor and VidCon CEO Jim Louderback's 2026 SXSW keynote..
Number of the moment: Half a billion
That’s how many Instagram and Facebook users each watch AI-translated videos every week, according to remarks made by Meta CFO Susan Li during the company’s Q1 2026 earnings call.
Meta stopped reporting detailed monthly user numbers for its individual apps some time ago. The company did have 3.5 billion monthly users across its “family of apps” in Q1, but acknowledged that it may count people with different user names across its apps multiple times. Regardless, it seems clear that a huge chunk of Meta’s userbase now consumers videos that have been translated by AI.
What else
Netflix goes vertical with its new mobile app. I wrote about Netflix’s mobile app refresh for Fast Company.
YouTube’s Q1 ad revenue was close to $10 million. Advertising revenue for the Google-owned service grew by 10.7 percent year-over-year.
Vine is back, and so are your cringe-worthy old videos. Divine, a Vine clone financed by Jack Dorsey, launches with 500,000 old Vine videos.
Google TV is bringing YouTube Shorts to the home screen. Other updates announced this week include the ability to create AI videos with Google’s Veo.
Claude connects to Blender, donates to its development. Anthropic has unveiled a number of partnerships with creative software solutions, including Adobe Creative Cloud, Ableton and Blender. As part of this, Antropic is also becoming a major Blender donor, committing to donate at least $280,000 every year.
The CW Network brings new shows to The Roku Channel. Starting this fall, select CW shows will become available on The Roku Channel the day after they air.
Spotify has a new non-AI badge. The music streaming service is awarding the badge to artists it deems to show “signals of a real artist presence.”
That’s it
Ever thought about upgrading to a paid Lowpass subsciption, but haven’t pulled the trigger yet? Then this one may be for you: Today is the last day to get your hands on a lifetime subscription for just $150, down from $250. That’s less than you’d pay for two years under the annual plan. Not ready for such a long-term commitment? Maybe want to keep things casual for a while? Then how about a three-month trial for just $5?
Either way, I appreciate all the support I’m getting from readers, whether it’s through paid subscriptions, forwarding of this newsletter to colleagues who may be interested in it, or social media posts promoting Lowpass stories to your followers. This month was the third birthday of this newsletter, and I couldn’t be more thankful about the support I;ve gotten from readers over the years …
Thanks for reading, have a great weekend!
Was this email forwarded to you? To get Lowpass for free every week, sign up here.
Got a news tip? Simply respond to this email.
Interested in partnering with Lowpass? Check out our sponsorship page.

