Sesame, the startup behind the viral digital assistant Maya, releases its base AI mannequin

March 13, 2025

AI firm Sesame has launched the bottom mannequin that powers Maya, the impressively lifelike voice assistant.

The mannequin, which is 1 billion parameters in dimension (“parameters” referring to particular person elements of the mannequin), is below an Apache 2.0 license, which means it may be used commercially with few restrictions. Known as CSM-1B, the mannequin generates “RVQ audio codes” from textual content and audio inputs, in accordance with Sesame’s description on the AI dev platform Hugging Face.

RVQ refers to “residual vector quantization,” a way for encoding audio into discrete tokens referred to as codes. RVQ is utilized in a lot of latest AI audio applied sciences, together with Google’s SoundStream and Meta’s Encodec.

CSM-1B makes use of a mannequin from Meta’s Llama household as its spine paired with an audio “decoder” part. A fine-tuned variant of CSM powers Maya, Sesame says.

“The mannequin open-sourced here’s a base technology mannequin,” Sesame writes in CSM-1B’s Hugging Face and GitHub repositories. “It’s able to producing quite a lot of voices, but it surely has not been fine-tuned on any particular voice […] The mannequin has some capability for non-English languages on account of information contamination within the coaching information, but it surely seemingly received’t do nicely.”

It’s unclear what information Sesame used to coach CSM-1B. The corporate didn’t say.

It’s value noting the mannequin has no actual safeguards to talk of. Sesame has an honor system and merely urges builders and customers to not use the mannequin to imitate an individual’s voice with out their consent, create deceptive content material like pretend information, or have interaction in “dangerous” or “malicious” actions.

I attempted the demo on Hugging Face, and cloning my voice took lower than a minute. From there, it was straightforward to generate speech to my coronary heart’s want, together with on controversial matters just like the election and Russian propaganda.

Client Stories just lately warned that many standard AI-powered voice cloning instruments in the marketplace don’t have “significant” safeguards to stop fraud or abuse.

Sesame, co-founded by Oculus co-creator Brendan Iribe, went viral in late February for its assistant tech, which comes near clearing uncanny valley territory. Maya and Sesame’s different assistant, Miles, take breaths and communicate with disfluencies, and may be interrupted whereas talking, very like OpenAI’s Voice Mode.

Sesame has raised an undisclosed quantity of capital from Andreessen Horowitz, Spark Capital, and Matrix Companions. Along with constructing voice assistant tech, the corporate says it’s prototyping AI glasses “designed to be worn all day” that’ll be geared up with its customized fashions.

Source link

Sesame, the startup behind the viral digital assistant Maya, releases its base AI mannequin

LEAVE A REPLY Cancel reply

Don't Miss

Nykaa, Exide Industries, Shilchar Tech, HDB Monetary — Ask Revenue

The Misleading Energy of Maps (with Paulina Rowinska)

Auto mortgage – 14.99% – Steadiness barely taking place : personalfinance

The Entrepreneur and the Summer time Blockbuster

Winner Highlight 2025: Allied Financial institution Restricted

EVEN MORE NEWS

Ally Financial institution Says This Is the Finest Time To ‘Graduate’...

Workday, Amazon AI employment bias claims add to rising considerations concerning...

Hyperlinks 7/5/2025 | bare capitalism

POPULAR CATEGORY