Sound: Change speaker...

A command to create a new Sound with manipulated characteristics.

Settings

The quality of the manipulation depends on the pitch measurement.

The arguments that control the pitch measurement are:

Pitch floor (Hz) (standard value: 75 Hz)
pitch candidates below this frequency will not be considered.
Pitch ceiling (Hz) (standard value: 600 Hz)
pitch candidates above this frequency will be ignored.

The arguments that control the manipulation are:

Multiply formants by
determines the formant frequencies of the newly created sound. The formant frequency of the new sound will equal the formant frequencies of the selected sound multiplied by this number. If this number equals 1, formant frequencies will not change. A number of 1.1 will change a male voice to a voice with approximate female formant characteristics. A ratio of 1/1.1 will change a female voice to a voice with approximate male formant characteristics.
Multiply pitch by
determines what the pitch of the new Sound will be. The pitch values of the new sound will equal the pitch values of the selected sound multiplied by this number. A value of 1.8 will approximately change a male's pitch to a female's pitch.
Multiply pitch range by (standard value: 1.0)
determines the pitch range of the newly created sound. A factor of 1.0 means that no additional pitch modification will occur (except the obvious one described above). A factor of 0.0 monotonizes the new sound to the new pitch median. A negative number inverses the pitch range with respect to the median.
Multiply duration by (standard value: 1.0)
determines how to modify the duration of the newly created sound. A value of 1.0 means that the new sound will have the same duration as the selected sound. A value less than 1.0 will result in a shortened new sound. A value larger than 2.5 will not work.

If you want more control over the synthesis you can supply your own Pitch object and use the Sound & Pitch: Change speaker... command.

Algorithm

The shifting of formant frequencies is done via manipulation of the sampling frequency. To multiply all formants by a factor of 1.10 (i.e. raising them by 10 percent), a sampling frequency of 44100 Hz is first raised to 48510 Hz (without changing the samples). After this, the sound is lengthened by a factor of 1.10 and the pitch is lowered by a factor of 1.10, so that the original duration and pitch are restored. After this, the sound is resampled to 44100 Hz (by sinc interpolation).Pitch and duration changes are generated with overlap-add synthesis.


© djmw 20080515