Interface for Dynamic Modification of the Transformation Parameters of the PSOLA Algorithm

Authors

  • Demri LYES
  • Falek LEİLA
  • Teffahi HOCİNE

DOI:

https://doi.org/10.18100/ijamec.90459

Keywords:

speech synthesis, PSOLA algorithm, PSOLA parameters, prosodic parameters, dynamically changing, interface

Abstract

The prosody of a speech signal is related to many factors: the social and geographical origin of the speaker, his or her emotional state, his physiological state (weariness, sickness, …) and the type of the sentence (interrogative, affirmative, etc.). A good synthesis or speech transformation system must account for all of these factors in order to produce a speech that sounds natural. In this paper, we propose a graphical interface for the modification of the prosodic features of the speech signal (the melodic curve - fundamental frequency and temporal organization of the syllables - and the formantic trajectories) using the PSOLA algorithm. The interface allows the user to manually introduce the desired trajectories of the transformation parameters of the PSOLA algorithm in order to produce a transformed signal which has the desired prosody. The results are acceptable, especially for the modification of the fundamental frequency and of the temporal organization of the source signal.

Downloads

Download data is not yet available.

References

M. Schübiger: English intonation. Max Niemeyer Verlag, Tübingen (1958)

D. Crystal: Prosodic systems and intonation in English. Cambridge University Press, London (1969)

G. Peeters : Modèles et modification du signal sonore adaptés à ses caractéristiques locales, Thesis, Paris (2001)

H. Valbret, E. Mouline, J. Tubach : Voice transformation using PSOLA technique, Speech Communication 11 (1992), p. 175-187.

M. Kondoz: Digital Speech, Coding for low bit rate communication systems, Wiley, 2004

M. Jelinek, J. P. Adoul, Frequency-Domain Spectral Envelope Estimation for Low Rate Coding of Speech, ICASSP, 1999

P. Veprek, M. S. Scordilis, Analysis, enhancement, and evaluation of five pitch determination techniques, Speech Communication 37, 2002

F. Charpentier, Traitement de la parole par Analyse / Synthèse de Fourier application à la synthèse par diphones, Thèse, ENST, Paris, 1988

N. Henrich, B. Doval, C. d’Alessandro, and M.Castellengo. Open quotient measurements on EGG, speech and singing signals. In Proc. 4th International Workshop on Advances in Quantitative Laryngoscopy, Voice and Speech Research, Jena, Apr. 2000.

. G. Peeters. Analyse et synthèse des sons musicaux par la méthode PSOLA. In JIM98- Workshop, Agelonde, France, Mai 1998.

S. M. Metev and V. P. Veiko, Laser Assisted Microtechnology, 2nd ed., R. M. Osgood, Jr., Ed. Berlin, Germany: Springer-Verlag, 1998.

A. Mousa. Voice Conversion Using Pitch Shifting Algorithm by Time Stretching with PSOLA and Re-Sampling. Journal of Electrical Engineering, Vol 61, NO. 1, 2010, p. 57-61

P. Dutilleux, G. De Poli, U. Zölzer, DAFX - Digital Audio Effects, U. Zölzer, Ed. John Wiley & Sons, Sussex, England, 2002, p. 201-234.

D. Childers, Modern Spectrum Analysis, IEEE Press, Piscateway, New Jersey, U.S., 1978, p. 252-255.

W.H. Press, S.A. Teukolsky, W.T. Vetterling, B.P. Flannery, Numerical Recipes in C: the art of scientific computing, Second Edition, Cambridge University Press, 1992.

Downloads

Published

02-10-2014

Issue

Section

Research Articles

How to Cite

[1]
“Interface for Dynamic Modification of the Transformation Parameters of the PSOLA Algorithm”, J. Appl. Methods Electron. Comput., vol. 2, no. 4, pp. 26–30, Oct. 2014, doi: 10.18100/ijamec.90459.

Similar Articles

1-10 of 141

You may also start an advanced similarity search for this article.