Interface for Dynamic Modification of the Transformation Parameters of the PSOLA Algorithm
DOI:
https://doi.org/10.18100/ijamec.90459Keywords:
speech synthesis, PSOLA algorithm, PSOLA parameters, prosodic parameters, dynamically changing, interfaceAbstract
The prosody of a speech signal is related to many factors: the social and geographical origin of the speaker, his or her emotional state, his physiological state (weariness, sickness, …) and the type of the sentence (interrogative, affirmative, etc.). A good synthesis or speech transformation system must account for all of these factors in order to produce a speech that sounds natural. In this paper, we propose a graphical interface for the modification of the prosodic features of the speech signal (the melodic curve - fundamental frequency and temporal organization of the syllables - and the formantic trajectories) using the PSOLA algorithm. The interface allows the user to manually introduce the desired trajectories of the transformation parameters of the PSOLA algorithm in order to produce a transformed signal which has the desired prosody. The results are acceptable, especially for the modification of the fundamental frequency and of the temporal organization of the source signal.Downloads
References
M. Schübiger: English intonation. Max Niemeyer Verlag, Tübingen (1958)
D. Crystal: Prosodic systems and intonation in English. Cambridge University Press, London (1969)
G. Peeters : Modèles et modification du signal sonore adaptés à ses caractéristiques locales, Thesis, Paris (2001)
H. Valbret, E. Mouline, J. Tubach : Voice transformation using PSOLA technique, Speech Communication 11 (1992), p. 175-187.
M. Kondoz: Digital Speech, Coding for low bit rate communication systems, Wiley, 2004
M. Jelinek, J. P. Adoul, Frequency-Domain Spectral Envelope Estimation for Low Rate Coding of Speech, ICASSP, 1999
P. Veprek, M. S. Scordilis, Analysis, enhancement, and evaluation of five pitch determination techniques, Speech Communication 37, 2002
F. Charpentier, Traitement de la parole par Analyse / Synthèse de Fourier application à la synthèse par diphones, Thèse, ENST, Paris, 1988
N. Henrich, B. Doval, C. d’Alessandro, and M.Castellengo. Open quotient measurements on EGG, speech and singing signals. In Proc. 4th International Workshop on Advances in Quantitative Laryngoscopy, Voice and Speech Research, Jena, Apr. 2000.
. G. Peeters. Analyse et synthèse des sons musicaux par la méthode PSOLA. In JIM98- Workshop, Agelonde, France, Mai 1998.
S. M. Metev and V. P. Veiko, Laser Assisted Microtechnology, 2nd ed., R. M. Osgood, Jr., Ed. Berlin, Germany: Springer-Verlag, 1998.
A. Mousa. Voice Conversion Using Pitch Shifting Algorithm by Time Stretching with PSOLA and Re-Sampling. Journal of Electrical Engineering, Vol 61, NO. 1, 2010, p. 57-61
P. Dutilleux, G. De Poli, U. Zölzer, DAFX - Digital Audio Effects, U. Zölzer, Ed. John Wiley & Sons, Sussex, England, 2002, p. 201-234.
D. Childers, Modern Spectrum Analysis, IEEE Press, Piscateway, New Jersey, U.S., 1978, p. 252-255.
W.H. Press, S.A. Teukolsky, W.T. Vetterling, B.P. Flannery, Numerical Recipes in C: the art of scientific computing, Second Edition, Cambridge University Press, 1992.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 International Journal of Applied Methods in Electronics and Computers
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.