Application of Different Statistical Tests for Validation of Synthesized Speech Parameterized by Cepstral Coefficients and LSP

Authors

  • Carlos Angel Franco-Galvan Universidad Nacional Autónoma de Mexicana, Facultad de Artes BUAP, Laboratorio de Tecnologías del Lenguaje
  • José Abel Herrera-Camacho Universidad Nacional Autónoma de México, Facultad de Ingeniería UNAM
  • Boris Escalante-Ramirez Universidad Nacional Autónoma de México, Facultad de Ingeniería UNAM

DOI:

https://doi.org/10.13053/cys-23-2-2977

Keywords:

Speech synthesis, voice parameterization, line spectral pair

Abstract

The following document tries out different statistical norms to validate the quality of synthesized voices applied to an HTS-based spanish synthesizer, which uses LSP and Cepstral Coefficients parameterizations. The standard MOS tests were carried out. Other types of quality tests were performed as well to reinforce the MOS results. Such as: MUSHRA, ABX and CCR. The subjective test PESQ was also applied. To validate intelligibility a SUS test was used.

Author Biographies

Carlos Angel Franco-Galvan, Universidad Nacional Autónoma de Mexicana, Facultad de Artes BUAP, Laboratorio de Tecnologías del Lenguaje

Part Time Lecturer in Coegio de Música BUAPPhD student in Posgrado en CIC UNAM

José Abel Herrera-Camacho, Universidad Nacional Autónoma de México, Facultad de Ingeniería UNAM

Profesor Titular C, Tiempo Completo

Boris Escalante-Ramirez, Universidad Nacional Autónoma de México, Facultad de Ingeniería UNAM

B.Eng. (National University of Mexico), M.E.E. (Philips International Institute, The Netherlands), Ph.D. (Technical University of Eindhoven, The Neherlands)Research interests: Computational models of human vision and its applications to digital image processing.Member of the National Research System (SNI nivel 1)

Downloads

Published

2019-06-27