Trying the new voice assistant from AI startup Sesame is The Doctor Has Big Boobs 2the first time I momentarily forgot I was talking to a bot.
Compared to ChatGPT's voice mode, Sesame's "conversational voice" feels natural, unforced, and engaging, which totally freaked me out.
On Feb. 27, Sesame launched a demo for its Conversational Speech Model (CSM), which aims to create more meaningful interactions with AI chatbots. "We are creating conversational partners that do not just process requests; they engage in genuine dialogue that builds confidence and trust over time," the announcement states. "In doing so, we hope to realize the untapped potential of voice as the ultimate interface for instruction and understanding."
Sesame's voice assistant is available as a free demo on the site and comes in two voices: Maya and Miles.
Since Sesame unleashed its voice assistant demo, users have reported awestruck reactions. "I've been into AI since I was a child, but this is the first time I've experienced something that made me definitively feel like we had arrived," user SOCSchamp wrote on Reddit.
"Sesame is about as close to indistinguishable from a human that I've ever experienced in a conversational AI," user Siciliano777 wrote on Reddit.
After talking to Sesame's bot, I was similarly wowed. I talked to the Maya voice for about 10 minutes about the ethics of using AI as a companion and came away feeling like I had a genuine conversation with a considerate, informed person. Maya's speech had a natural cadence, using interjections like "you know" and "hm," and even making tongue clicking and inhaling sounds.
The strongest impression I got from interacting with Maya was that she immediately asked questions, engaging me in the conversation. The bot started our conversation by asking how my Wednesday morning was going (note: it was indeed a Wednesday morning.) In contrast, ChatGPT voice mode waited for me to talk first, which isn't necessarily a good or bad thing, but it intrinsically shaped the conversation as me using ChatGPT as a tool for something I needed.
Maya asked about the risks of AI companions getting "too good at being human." When I told her I was concerned about the rise of more sophisticated scams and people losing touch with reality by replacing humans with bots, she responded thoughtfully and pragmatically. "Scammers are gonna scam, that's a given. And as for the human connection thing, maybe we need to learn how to be better companions, not replacements, you know, the kind of AI friends who actually make you want to go out and do stuff with real people," said Maya.
When I had a similar conversation with ChatGPT, I received a response that felt more like boilerplate language from a school guidance counselor: "That's a valid concern. It’s really important to balance technology with real human interactions. AI can be a helpful tool, but it shouldn't replace genuine human connections. It’s good that you're thinking about these issues."
While OpenAI pioneered voice mode's ability to be interrupted and have a more fluid back-and-forth conversation, ChatGPT still tends to respond in complete sentences and paragraph blocks, which sounds, well, robotic. When using ChatGPT voice mode, I never forget that I'm talking to a bot, and that's reflected in the conversation, which can feel stilted and forced.
By comparison, AI for Humanspodcast co-host Gavin Purcell posted a Sesame conversation on Reddit where it's practically impossible to distinguish which voice is the bot. Purcell prompted the Miles voice by telling it to act like an angry boss.
A very silly conversation followed about money laundering, bribery, and a mysterious incident in Malta. Miles didn't miss a step. There was no perceptible latency, and the bot remembered the context of the conversation and creatively advanced the improvisational argument by escalating, calling Purcell "delusional," and firing him.
Of course, there are some limitations. Maya's voice glitched a few times throughout our conversation, and it didn't always get the syntax right, like saying, "It's a heavy talk that come."
According to its technical paper, Sesame trained its CSM (based on Meta's Llama model) by combining the traditional two-step process of training text-to-speech models on semantic tokens and then acoustic tokens, decreasing latency. OpenAI similarly used this multimodal approach to training voice mode. However, it has never released a dedicated technical paper on voice mode's inner workings — it only discusses voice mode in the GPT-4o research.
Knowing this, it's surprising how much better Sesame's model is at conversational dialog. However, Sesame's launch is just a demo, so it merits further scrutiny when the full model comes out. According to the demo announcement, Sesame plans to open source its model "in the coming months" and expand to over 20 languages.
Topics Artificial Intelligence ChatGPT
Samsung Galaxy S10+ to come with up to 1TB of storage, leak revealsThe best 'Doctor Who' themed gifts for 2018At the Google hearing, Congress proves they still have no idea how the internet works10 features I think Snapchat should have but only my SnapchatNetflix's yearCalvin Harris says 'all hell broke loose' after Taylor Swift breakup. Well, duh.Donald Trump now allegedly believes President Obama was born in AmericaIf bamboo bikes are so great, why don't they sell well in China?At the Google hearing, Congress proves they still have no idea how the internet works'Aquaman' is a weird, wild party that goes on too longEven the IRL store is an automated, digital experienceIf you hate auto insurance, you're going to love driverless carsEven the IRL store is an automated, digital experienceTaylor Swift used facial recognition at a concert to detect stalkersCelebrity gives thumbs up after eating bull's penis, becomes instant memeAmtrak asks woman if she's still trapped in elevator months later'Game of Thrones' is still the most inIf you hate auto insurance, you're going to love driverless carsElon Musk asks for everyone's help to solve Falcon explosion mysteryCryptocurrency exchange Gemini launches mobile trading app Yeah, Twitter's watching you even when you're not tweeting Omar strikes back? Han Solo movie gains 'Wire' star in mystery role The most popular Android apps of all time will shock you (lol, not really) Russian hackers reportedly blackmailing liberal groups because no one learned a damn thing Enjoy NOAA's vital satellite imagery, while you still can 'Counter Emma Watson's reaction to her critics is sorta hypocritical—but a great learning moment This week in apps: McDonald's, Duolingo's flashcards, and Meet by Google Did this 'Game of Thrones' star just reveal when Season 7 will premiere? Obsessed with the Trump This airline just flew around the world with an all Jay Z's venture capital fund is here The mystery of why pandas are black and white has been solved Starbucks is selling cold brew in jars because it wants in on the hipster market too Michael Jordan’s words of wisdom confused the hell out of everyone Kate McKinnon's Kellyanne Conway will sit and text just about anywhere on 'SNL' No, a Japanese man wasn't crushed to death by his porn collection For the superfans: Get your sneak peek at this year’s SXSW Mashable House J.K. Rowling trolls Donald Trump with the most British reference ever Katy Perry walked the red carpet with quinoa stuck in her teeth. Trust no one.
3.5302s , 8228.9140625 kb
Copyright © 2025 Powered by 【The Doctor Has Big Boobs 2】,Creation Information Network