Gate News message, April 17 — Google unveiled Gemini 3.1 Flash TTS, an advanced text-to-speech model with enhanced emotional expression and control features, on April 15. The new model will be rolled out progressively through developer APIs, enterprise Vertex AI, and collaboration tools.
The model’s core capabilities include natural language-based audio tags for fine-tuning speed, intonation, and emotion, plus a “Director Mode” for specifying scenes and character roles to generate more nuanced voice outputs. A multi-speaker feature enables simultaneous dialogue generation, allowing more natural conversation flows suitable for podcasts, audio content, and AI assistants. The model supports over 70 languages and dialects, reflecting regional accents and expressions for localized voice experiences globally.
Google emphasized performance and cost efficiency, achieving high scores on blind human evaluation benchmarks while reducing computational costs through its Flash architecture—designed for large-scale enterprise adoption. Generated audio includes SynthID watermarking to identify AI-generated content and combat misinformation.
The move reflects intensifying competition in voice interfaces. OpenAI is combining real-time voice features with conversational AI for human-like interactions, while Meta is expanding investments in AI characters with voice-based social experiences. Industry observers note that while high-level acting and creative work may remain human-driven for now, repetitive and large-scale production markets could see gradual AI adoption in dubbing, advertising, and audiobook sectors.
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to
Disclaimer.
Articoli correlati
Huawei Launches First AI Glasses in China as Market Grows 322% Globally
Huawei has launched AI glasses in China, featuring an in-house chip and various smart functions. The global AI glasses market saw a significant increase in shipments, with China becoming the second-largest market.
GateNews33m fa
Google Expands Gemini Feature in Chrome Browser to Seven Asia-Pacific Countries
Google announced on April 20 that it will expand Gemini features in Chrome to several Asia-Pacific countries, including Australia and South Korea, enhancing access to its AI capabilities in the region.
GateNews1h fa
CuspAI Raises $200M at $1B+ Valuation to Accelerate AI-Powered Materials Discovery
CuspAI, a UK AI startup focused on materials discovery, seeks to raise $200 million, aiming for a $1 billion valuation. Its platform excels in creating materials for various applications and boasts high success rates compared to competitors.
GateNews1h fa
Dark Matter Labs Releases and Open-Sources Kimi K2.6 Model
Dark Matter Labs has released the Kimi K2.6 model, boasting enhanced long-context coding and improved autonomous execution abilities. It is now available on multiple platforms for all users.
GateNews1h fa
Claude Desktop Installation Reportedly Writes Backdoor File to Chromium-Based Browsers
The Claude Desktop application by Anthropic installs a backdoor file in Chromium-based browsers without user consent, posing serious security and privacy risks by potentially allowing attackers to control users' browsers.
GateNews1h fa
亞馬遜加碼 Anthropic 250 億美元:5GW 算力、千億 AWS 綁定
Amazon擴大與Anthropic的合作,投資最高250億美元,Anthropic承諾十年內在AWS支出超過1000億美元,並獲得5GW的新增算力。這是Anthropic算力擴展的第二波,估值達3800億美元。Anthropic年化營收突破300億美元,需求急增促使其基礎設施擴張,與AWS及Google同步推進算力發展。
ChainNewsAbmedia1h fa