A method for manipulating the pitch and duration of an acoustic speech signal. This method was realized by Moulines & Charpentier (1990), who called it Time-Domain Pitch-Synchronous Overlap-and-Add (TD-PSOLA). Nowadays, PSOLA™ and TD-PSOLA™ are trademarks owned by France Telecom®.

Overlap-add synthesis

When a Sound is created from a Manipulation object, the following steps are performed:

1. From the PitchTier, new points are generated along the entire time domain, with the method of PitchTier: To PointProcess.
2. The period information in the original pulses (available in the Manipulation object) is used to remove from the new pulses all points that lie within voiceless intervals (i.e., places where the distance between adjacent points in the original pulses is greater than 20 ms).
3. The voiceless parts are copied from the source Sound to the target Sound, re-using some parts if the local duration is greater than 1.
4. For each target point, we look up the nearest source point. A piece of the source Sound, centred around the source point, is copied to the target Sound at a location determined by the target point, using a bell-shaped window whose left-hand half-length is the minimum of the left-hand periods adjacent to the source and target points (and analogously for the right-hand half-length).

Links to this page

© ppgb, August 16, 2007