book

図解まるわかり VR・AR・MRのしくみ

Name: 図解まるわかり VR・AR・MRのしくみ
Author: monoAI technology株式会社
ISBN: 9784798185811

by monoAI technology株式会社

October 2024

Intermediate

240 pages

6h 57m

Japanese

Shōeisha

Read now

Unlock full access

Content preview from 図解まるわかり VR・AR・MRのしくみ

108

 ボイスジェネレーション、DNN、GAN

人工的な歌声を生成する技術

5-12

音声合成AIの進化

AIを使って人の声や歌声を生成する、ボイスジェネレーションという技

術があります。この技術の中核となるのは、ディープラーニングを応用し

た音声合成モデルです。

代表的なモデルには、

DNN（ディープニューラルネットワーク）と

GAN（生成敵対ネットワーク）が挙げられます。DNNは大量の音声デー

タから自然な発音の特徴を学習し、テキストから音声を生成します（図

5-23）。一方の GAN は、2 つの AIが競い合うことで、よりリアルな音声生

成を実現します（図5-24）。

高品質で自然な音声

ボイスジェネレーションでは、自然で高品質な音声生成が求められてお

り、リアルタイム性の確保も重要な課題です。その課題を解消するため

に、高度なデータ前処理と効率的なモデル構築が行われています。

例えば、歌声生成では、歌手の呼吸や歌詞のリズムなど、音声以外の情

報もモデルに取り込むことで、より自然な歌声を実現しています。

また、ボイスジェネレーションは、XR体験を豊かにする役割も担って

います。

例えば、VRゲームや教育アプリケーションにおいて、キャラクターの

声がより自然で感情豊かになれば、ユーザーの没入感が高まります。AR

ではリアルタイムでより人間の発話に近い音声ガイドを実現します。また

MRでは、デジタルアシスタントがユーザーの指示に対して自然な音声で

応答することで、対話型の操作が可能になります。

これにより、ユーザーはより直感的にシステムと対話でき、操作性の向 ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9784798185811

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

図解まるわかり VR・AR・MRのしくみ

by monoAI technology株式会社

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Amir M.

Mark W.

You might also like

図解まるわかりプログラミングのしくみ

図解まるわかりクラウドのしくみ

図解まるわかり仮想化のしくみ

図解まるわかり Web技術のしくみ

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

More than 5,000 organizations count on O’Reilly

Julian F.

Addison B.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.