Name: SP121-Multi-speaker Emotional Acoustic Modeling for CNN-based Speech Synthesis - Projects Lelo
Price: 13000.00 INR
Availability: InStock

In this paper, we investigate multi-speaker emotional acoustic modeling methods for convolutional neural network (CNN) based speech synthesis system. For emotion modeling, we extend to the speech synthesis system that learns a latent embedding space of emotion, derived from a desired emotional identity, and we use emotion code and mel-frequency spectrogram as an emotion identity. In order to model speaker variation in a text-to-speech (TTS) system, we use speaker representations such as trainable speaker embedding and speaker code. We have implemented speech synthesis systems combining speaker representation and emotion representation and compared them by experiments. Experimental results have demonstrated that the multi-speaker emotional speech synthesis approach using trainable speaker embedding and emotion representation from mel spectrogram achieves higher performance when compared with other approaches in terms of naturalness, speaker similarity, and emotion similarity.

Reviews

There are no reviews yet.

Be the first to review “SP121-Multi-speaker Emotional Acoustic Modeling for CNN-based Speech Synthesis”

You must be logged in to post a review.

Contact UsHere's your new discount product tab.

SP121-Multi-speaker Emotional Acoustic Modeling for CNN-based Speech Synthesis

Reviews

Company

Home

About Us

Shop

Projects

Software

Hardware

Mini Projects

Mechanical

Policy

Terms & Conditions

Privacy Policy

Refund & Cancellation policy

Shipping & Delivery Policy

Address

Saikrupa Mall, Dahisar Railway Station, F16, Lokmanya Tilak Rd, West, Dahisar East, Mumbai, Maharashtra 400068

Contact Us To Know More +918356839486

Copyright © ProjectsLelo 2025

Designed & Developed by Tech Cryptors IT Services

SP121-Multi-speaker Emotional Acoustic Modeling for CNN-based Speech Synthesis

Reviews

Related products

SP117-Self-supervised Audio-visual Co-segmentation

SP112-Packet-based Network Traffic Classification Using Deep Learning

SP114-Crowd-Robot Interaction: Crowd-Aware Robot Navigation With Attention-Based Deep Reinforcement Learning

Company

Projects

Policy

Address

Designed & Developed by Tech Cryptors IT Services