FAQ: Pitch analysis

FAQ: Pitch analysis

Please also consult how to choose a pitch analysis method.

Question: what algorithm is used for pitch analysis?

Answer: for how the raw pitch analysis method works, see Sound: To Pitch (ac).... The 1993 article is downloadable from https://www.fon.hum.uva.nl/paul/

Question: why does Praat consider my sound voiceless while I hear it as voiced?

There are at least five possibilities. Most of them can be checked by zooming in on the waveform.

The first possibility is that the pitch has fallen below the pitch floor. For instance, your pitch floor could be 50 Hz but the English speaker produces creak at the end of the utterance. Or your pitch floor could be 50 Hz but the Chinese speaker is in the middle of a third tone. If this happens, it may help to lower the pitch floor to e.g. 30 Hz (Pitch settings...), although that may also smooth the pitch curve too much in other places.

The second possibility is that the pitch has moved too fast. This could happen at the end of a Chinese fourth tone, which drops very fast. If this happens, it may help to use pitch analysis by raw cross-correlation, although Praat may then hallucinate pitches in other places that you would prefer to consider voiceless.

The third possibility is that the periods are very irregular, as in some pathological voices. If you want to see a pitch in those cases, it may help to use pitch analysis by raw cross-correlation. Or it may help to lower the voicing threshold setting to 0.25 (instead of the standard 0.50) or so (Pitch settings...).

The fourth possibility is that there is a lot of background noise, as in a recording on a busy street. In such a case, it may help to lower the voicing threshold setting to 0.25 (instead of the standard 0.50) or so. The disadvantage of lowering this setting is that for non-noisy recordings, Praat will become too eager to find voicing in some places that you would prefer to consider voiceless; so make sure to set it back to 0.50 once you have finished analysing the noisy recordings.

The fifth possibility is that the part analysed as voiceless is much less loud than the rest of the sound, or that the sound contains a loud noise elsewhere. This can be checked by zooming in on the part analysed as voiceless: if Praat suddenly considers it as voiced, this is a sign that this part is much quieter than the rest. To make Praat analyse this part as voiced, you can lower the silence threshold setting to 0.01 (instead of the standard 0.09) or so. The disadvantage of lowering this setting is that Praat may start to consider some distant background sounds (and quiet echos, for instance) as voiced.

Question: why do I get different results for the maximum pitch if...?

If you select a Sound and choose Sound: To Pitch (filtered ac)..., the time step will usually be 0.015 seconds. The resulting Pitch object will have values for times that are 0.015 seconds apart. If you then click Info or choose Get maximum pitch from the Query submenu, the result is based on those time points. By contrast, if you choose Get maximum pitch from the Pitch menu in the SoundEditor window, the result will be based on the visible points, of which there tend to be a hundred in the visible window. These different time spacings will lead to slightly different pitch contours.

If you choose Move cursor to maximum pitch, then choose Get pitch from the Pitch menu, the result will be different again. This is because Get maximum pitch can do a parabolic interpolation around the maximum, whereas Get pitch, not realizing that the cursor is at a maximum, does a stupid linear interpolation, which tends to lead to lower values.

Links to this page

FAQ (Frequently Asked Questions)