ElevenLabs Unveils Innovative Speech-to-Text Model: Revolutionizing Voice Recognition Technology

February 27, 2025February 27, 2025

ElevenLabs, an innovative AI startup renowned for its audio-generation capabilities, has recently secured a substantial $180 million funding round, enhancing its valuation to $3.3 billion. The company is now venturing into the realm of speech-to-text technology with the launch of its first standalone model, Scribe.

Introducing ElevenLabs’ Scribe: A New Era in Speech-to-Text Technology

With its extensive library of voices, ElevenLabs has already supported numerous companies in delivering effective speech-to-text services. The introduction of Scribe marks the company’s intent to compete in the growing market of speech detection technologies, challenging established players such as Gladia, Speechmatics, AssemblyAI, Deepgram, and OpenAI’s Whisper.

Key Features of Scribe

The Scribe model is designed to support over 99 languages at launch, with a focus on delivering accuracy. Here are some notable highlights:

Excellent Accuracy: More than 25 languages boast a word error rate of less than 5%, including:

English (97% accuracy)
French
German
Hindi
Indonesian
Japanese
Kannada
Malayalam
Polish
Portuguese
Spanish
Vietnamese

Performance Benchmark: Scribe outperformed Google Gemini 2.0 Flash and Whisper Large V3 across various languages in FLEURS & Common Voice benchmark tests.

Innovation in Speech Detection

Previously, ElevenLabs developed a speech-to-text component for its conversational agent platform. However, Scribe represents the company’s first foray into standalone speech detection. In a recent interview with TechCrunch, CEO Mati Staniszewski emphasized the need for improved speech detection models, stating:

“We want to understand what’s being said by you in a conversation better. Many people say that speech-to-text is a solved problem. But for many languages, it is pretty bad. We think we can build better speech detection models because we have in-house teams to annotate data and give us quick feedback.”

Advanced Features

Scribe comes equipped with several advanced functionalities:

Smart Speaker Diarization: Identifies speakers in conversations.
Word-Level Timestamps: Provides accurate subtitles.
Auto-Tagging: Recognizes sound events, such as audience laughter.

Moreover, the platform enables customers to transcribe video content seamlessly for subtitle and caption integration.

Future Developments and Pricing

Currently, Scribe supports only pre-recorded audio formats. However, ElevenLabs plans to release a low-latency, real-time version of the model soon to facilitate meeting transcriptions and voice note-taking.

The pricing for Scribe is set at $0.40 per hour of transcribed audio, which remains competitive within the market, though some competitors may offer lower rates with different feature sets.

For more information about ElevenLabs and its innovative technologies, visit their official website here.

Payments and Digital Banking

Instabase Secures $100M Investment to Scale Its Cutting-Edge AI-Driven Data Platform

Bysupport January 20, 2025January 20, 2025

Instabase, specializing in applied AI for unstructured data management, has raised $100 million in its Series D funding round, led by the Qatar Investment Authority alongside investors like Greylock Partners and Andreessen Horowitz. Founded in 2015, Instabase develops AI solutions that help businesses extract insights from data such as emails and PDFs. The new funding will enhance its AI Hub platform, focusing on automation, data analysis, and search capabilities. With a growing customer base that includes AXA and Uber, Instabase aims to transform data management across industries, reinforcing its position as a trusted partner for major financial institutions.

Industry News

Unlocking MVP Success: How AI Can Elevate Your Startup with Chris Gardner at TechCrunch All Stage

Bysupport April 4, 2025April 4, 2025

TechCrunch All Stage, set for July 15 at the SoWa Power Station in Boston, is a vital event for startup founders seeking insights and networking opportunities. Highlighted by Chris Gardner from Underscore VC, the session “MVP in the Age of AI” will explore optimizing minimum viable products using AI. Attendees can benefit from exclusive discounts on passes, saving over $200, and connect with over 1,200 participants and leading VCs. The event will feature various sessions on market assessment, fundraising strategies, and more. Opportunities to showcase startups and sponsorships are also available. Secure your tickets now!

Last Chance: Save Up to $325 at TechCrunch Sessions: AI - Only 4 Days Left!

Industry News

Discover the Audience Choice Winners Shaping Breakout Sessions at TechCrunch Sessions: AI

Bysupport April 8, 2025April 8, 2025

Exciting news for AI enthusiasts! TechCrunch Sessions: AI on June 5 at UC Berkeley will feature two Audience Choice winners, Yann Stoneman and Hua Wang, sharing insights on artificial intelligence. Stoneman’s session, “Secure Generative AI for Regulated Enterprises,” will discuss using AI in regulated industries like healthcare and finance, highlighting real-world use cases and practical strategies. Wang will present “The AI Policy Playbook,” focusing on the challenges startups face with regulations while leveraging AI for growth. Attendees can engage in discussions and hands-on demos. Don’t miss this opportunity—grab your tickets now for early bird pricing!

Future of Fintech

Cohere Unveils ‘North’: The Game-Changing AI Solution for Privacy-Focused Enterprises

Bysupport January 11, 2025January 11, 2025

Cohere has launched its enterprise AI platform, North, which outperforms competitors like Microsoft Copilot and Google Vertex AI. Designed for regulated industries, North emphasizes security and compliance while delivering high performance. The Royal Bank of Canada is an early adopter, utilizing North to enhance operations in compliance-heavy sectors. Key features include a user-friendly interface, improved decision-making through AI insights, operational efficiency, and seamless compliance assurance. As organizations increasingly adopt North, its potential to transform enterprise AI solutions and boost efficiency in regulated industries becomes clear. For more details, visit Cohere’s Enterprise AI Solutions page or refer to Forbes.

Industry News

Unlock the Power of Google’s Gemini: Ask Questions with Videos and On-Screen Content!

Bysupport March 3, 2025March 3, 2025

Google has enhanced its AI assistant, Gemini, with new features unveiled at the Mobile World Congress 2025 in Barcelona. A key addition is the “Screenshare” feature, allowing users to share their smartphone screens with Gemini and ask questions about displayed content, enhancing the shopping experience. Another notable feature is real-time video search, enabling users to record videos and receive immediate assistance based on what’s being filmed. These features will be available to Gemini Advanced users on the Google One AI Premium plan for Android devices later this month, promising a more intuitive interaction with technology.

Industry News

Millie’s Maternity Clinic Secures $12M Series A Funding from Elite All-Female VC Team

Bysupport February 27, 2025February 27, 2025

Millie, a California-based maternity clinic founded by Anu Sharma, has secured $12 million in Series A funding to enhance maternal care. Inspired by her own challenging experience during her daughter’s birth in 2019, Sharma aims to address the overlooked needs of pregnant women and new mothers. Launched in 2022, Millie offers a range of services, including maternity care, postpartum coaching, and mental health support, utilizing a hybrid model of in-person and virtual consultations. With nearly $19 million raised overall, Millie plans to expand its presence in California and enhance its technology services, positioning itself as a leader in maternal healthcare.

ElevenLabs Unveils Innovative Speech-to-Text Model: Revolutionizing Voice Recognition Technology

Introducing ElevenLabs’ Scribe: A New Era in Speech-to-Text Technology

Key Features of Scribe

Innovation in Speech Detection

Advanced Features

Future Developments and Pricing

Instabase Secures $100M Investment to Scale Its Cutting-Edge AI-Driven Data Platform

Unlocking MVP Success: How AI Can Elevate Your Startup with Chris Gardner at TechCrunch All Stage

Discover the Audience Choice Winners Shaping Breakout Sessions at TechCrunch Sessions: AI

Cohere Unveils ‘North’: The Game-Changing AI Solution for Privacy-Focused Enterprises

Unlock the Power of Google’s Gemini: Ask Questions with Videos and On-Screen Content!

Millie’s Maternity Clinic Secures $12M Series A Funding from Elite All-Female VC Team

Revolutionizing Collaboration: Microsoft Empowers AI Agents to Communicate, Transforming the Future of Work

Revolutionizing Code Development: GitHub Copilot Transforms into an Autonomous Agent with Asynchronous Code Testing

Supercharge Your PC: Nvidia and Microsoft Revolutionize AI Processing

Join Our Newsletter

Recent Post

Revolutionizing Collaboration: Microsoft Empowers AI Agents to Communicate,…

Revolutionizing Code Development: GitHub Copilot Transforms into an…

Supercharge Your PC: Nvidia and Microsoft Revolutionize AI…

Newsletter

Subscribe to our MailChimp newsletter
and stay up to date with all events coming straight in your mailbox:

Introducing ElevenLabs’ Scribe: A New Era in Speech-to-Text Technology

Key Features of Scribe

Innovation in Speech Detection

Advanced Features

Future Developments and Pricing

Similar Posts

Join Our Newsletter

Recent Post

Newsletter

Subscribe to our MailChimp newsletter and stay up to date with all events coming straight in your mailbox:

Subscribe to our MailChimp newsletter
and stay up to date with all events coming straight in your mailbox: