Detecting true perfect pitch

This article (also this) proposes that there are two types of perfect pitch, “ability to perceptually encode” and “heightened tonal memory”. And these groups perform differently on a tonal matching test. I take the first to mean the ability to match any tone whatsoever precisely, while the second one to mean the ability to have long-term memory of certain heard tones.

It is interesting to consider what kinds of test actually measure perfect pitch. Usually there are two abilities under consideration, one is the ability to recognize heard tones by their names, the other to generate tones upon calling their names. The proposed article seems to say these two in themselves are rather symptoms of either APE or HTM or even something else as manifested in an association task. Indeed, the recognition task (hear a tone, call a name) is not strict enough to identify either APE or HTM. A piano player may have tactile or visual idenfication of heard tone with position on keyboard, and mediated by this association, know the name of the note — although this is usually not the case. Same goes for all the tests involving reproducing a note on an instrument or using vocal chord position, etc. These are cases of a “hidden” external reference. The mediating step is not seen. The generation task is more interesting, as it must involve at least tonal memory in the form of an internal reference. If it can be done accurately then it could be either APE or HTM but it would not be able to distinguish between the two.

The test proposed by the article solves some of these problems by requiring generation, and by using distraction after the short target tone is produced. The point is to move on from the target tone faster than consultation with hidden external references can take place. If recognition is not immediate, then one must first hold the note in short-term memory, then after the distraction, compare it to internal reference pitches from tonal memory. This is not accurate since short-term tonal memory itself is not stable, being influenced by distraction. So for some small number of tones (could be all of the chromatic scale), HTM could do well, depending on the person, but maybe performance is not even…, and HTM should never be able to match lesser-heard (e.g. non-standard) pitches well… However, if recognition is by APE, then any tone can be immediately recognized into an abstract form and as something distinct, and easily matched later in the abstract forms.

Under this regime, it would seem that most people who recognize and generate tones upon request probably just have varying degrees of HTM and have developed a quick lookup table as internal reference, which would seem to be malleable by training as with other kinds of memory (for people with good associative memory anyway). APE, however, probably cannot be learned — it’s a kind of idiot savant skill like people who know large number multiplications in one second — it just cannot be done with a lookup table.


P.S., here is a highly enlightening thought experiment by somebody trying to learn perfect pitch, and I must say it expresses almost perfectly my thoughts on the subject.

No comments yet. Be the first.

Leave a reply