Tech Product

AudioLM

Overview

最終更新: 2026年7月12日

Google DeepMindが開発した、高品質な音声生成を可能にする大規模言語音声モデル。テキストの介在なしに音声から音声への直接的な変換や生成を学習し、話者の声のトーン、抑揚、感情といった特徴を極めて自然に再現できる。Google Meetのリアルタイム翻訳機能の核となる技術であり、従来の機械的な合成音声とは一線を画す自然な会話体験を実現する。

Mentioned Articles

1 件

テクノロジー
Google Meet、AIリアルタイム翻訳のベータテストを開始： Geminiが実現する「声まで通じる」未来のコミュニケーション
Google I/O 2025で発表された数々の新技術の中でも、ひときわ大きな注目を集めているのが、ビデオ会議サービス「Google Meet」に搭載されるAIを活用したリアルタイム音声翻訳機能だろう。これは単なる文字起 […]
2025年5月21日約 8 分

External Mentions

10 件

arXivSpatio-Temporal Audio Language Modeling for Dynamic Sound Sources
▲ 0Oh Hyun-Bin2026年6月12日
Hacker NewsShow HN: Audiomass – a free, open-source multitrack audio editor for the web
▲ 549pantelisk2026年5月24日
Hacker NewsYouTube audio quality – How good does it get? (2022)
▲ 116fhinson2025年2月1日
Hacker NewsPython notebooks for fundamentals of music processing
▲ 277yeknoda2024年6月2日
arXivAudioPaLM: A Large Language Model That Can Speak and Listen
▲ 0Paul K. Rubenstein2023年6月22日
arXivLM-VC: Zero-shot Voice Conversion via Speech Generation based on Language Models
▲ 0Zhichao Wang2023年6月18日
arXivUniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
▲ 0Chenpeng Du2023年6月13日
arXivSoundStorm: Efficient Parallel Audio Generation
▲ 0Zalán Borsos2023年5月16日
arXivHiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec
▲ 0Dongchao Yang2023年5月4日
Hacker NewsScaling up the Prime Video audio/video monitoring service and reducing costs
▲ 989debdut2023年5月4日

AudioLM

Overview

Mentioned Articles

Google Meet、AIリアルタイム翻訳のベータテストを開始： Geminiが実現する「声まで通じる」未来のコミュニケーション

External Mentions