Auditory imagery is a form of mental imagery that is used to organize and analyze sounds when there is no external auditory stimulus present. This form of imagery is broken up into a couple of auditory modalities such as verbal imagery or musical imagery. This modality of mental imagery differs from other sensory images such as motor imagery or visual imagery. The vividness and detail of auditory imagery can vary from person-to-person depending on their background and condition of their brain. Through all of the research developed to understand auditory imagery behavioral neuroscientists have found that the auditory images developed in subjects' minds are generated in real time and consist of fairly precise information about quantifiable auditory properties as well as melodic and harmonic relationships. These studies have been able to recently gain confirmation and recognition due to the arrival of Positron emission tomography and fMRI scans that can confirm a physiological and psychological correlation.
The accuracy of tempo within an auditory image usually suffers when recalled however the consistency of a person’s perception of tempo is preserved. When surveying subject’s auditory imagery that their sense of tempo usually stays within 8% of the original tempo heard in a song that the subject heard at some point in the past. This was shown by having subjects compare the pitch of two words in a song. For instance, people can sing through “Jingle Bells” in their head and determine if there is a difference in pitch between the word ‘Snow’ and ‘Sleigh’. Experiments like this have shown it takes longer to compare the pitches of two words if the space between the two words is larger. Therefore the tempo structure of the melody is preserved in the auditory image. However, if someone had musical training then the person has more flexibility in his or her auditory imagery tempo representations.
Humans retain a relatively strong auditory image for details in pitch, which can be improved with musical training. The development of cultivating an auditory image with absolute pitch, which is being able to determine a note upon hearing a sound, however, is dependent on childhood musical training and genetic factors. People are able to improve their discrimination of pitch; however, they cannot improve their detection. Auditory image pitch detection studies have shown that response time decreases when judging two high pitches as opposed to judging two low pitches.
Of the many aspects of sound, loudness is a characteristic of auditory imagery that is usually lost or impaired. This is evident when people attempt to image a song and there is little noticeable volume dynamics in the auditory image. According to Pitt and Crowder, the encoding of loudness into our auditory imagery was shown to have little correlation with any physiological neural factors. Other scientists such as Intons-Petersons believe that there is encoding for loudness in our auditory images that if so it most likely occurs in a person’s motor cortex.
The auditory imagery developed from lyrics or words generally is also considered a part of inner speech. When people image their voice or the voices of others it is considered inner speech but some researchers argue that it is a lack of self-monitoring of speech. This generally refers to imagining speech which can occur when trying to remember what someone said or the sound of their voice which can be elicited voluntarily or involuntarily. Auditory verbal imagery is considered useful for practicing and organizing things people would like to say in person. For instance, practicing a speech or getting ready to sing a part in a song.
Cognitive scientists are very interested in finding out what brain structures are involved with mental imaging in order to provide consistent, localized, and more tangible evidence. It has been established that auditory imagery makes use of the right lobe since people with right lobe lesions tend to have difficulty generating auditory images. This is because auditory imaging requires the usage of the frontal and superior temporal right lobe as well as a lot of the right auditory association cortices. These portions of the brain are usually involved with interpreting the inflections of sounds (such as sad or angry sounds).
The supplementary motor area is also involved in image generation and encodes motor processes to do, while the right thalamus is hypothesized to be a part of auditory image retrieval. The activation of the supplementary motor area is also relevant since it is a portion of the brain that is involved when a motor task is imagined as opposed to overly executed. This shows that developing an auditory image is partially a motor task.
During auditory verbal imagery, the inferior frontal cortex and the insula were activated as well as the supplementary motor area, left superior temporal/inferior parietal region, the right posterior cerebellar cortex, the left precentral, and superior temporal gyri. Other areas of the brain have been activated during auditory imagery however there hasn’t been an encoding process attributed to it yet such as frontopolar areas, and the subcallosal gyrus.
As associations between pieces of sound such as music or repetitive dialogue become stronger and more complex even the silence involved in the sound can initiate auditory images in the brain. Studies have been done in which people listen to a CD over and over with silence in between tracks and the neural activity was analyzed using fMRI. It was consistently found the prefrontal cortex and premotor cortical areas were active during the anticipation of auditory imagery. The caudal PFC was used a lot during the early stages of learning of the song while in later stages the rostral PFC was used more indicating a shift in the cortex regions used during auditory imaging association.
Musical training and experienceEdit
Musical training has consistently shown to be a powerful way to refine auditory imagery enabling people to discern and manipulate various characteristics of sound such as pitch, timbre, tempo, etc. Musical training can cause localized networks of neurons to fire synchronously a lot more easily through spatial temporal firing patterns (Hebbian theory), which may explain why non-musical auditory imagery is enhanced in musically trained subjects. For musically naïve people, music is mainly an external experience. Naïve people are significantly worse than highly trained individuals on all auditory imagery tasks.
Difference in vividnessEdit
Even though subjects can’t confuse an auditory image as a perceived sound some people may experience very vivid auditory images. The difference in vividness from person to person can be an important neuronal correlation of sensory processes and higher-order cognition. The Bucknell Auditory Imagery Score assesses the vividness of a person’s auditory imagery and was shown to correlate directly with the neuronal activity of the superior temporal gyrus as well as the prefrontal cortex. Musical training does not produce an improvement in the vividness of auditory images however data showing if vividness can be improved or a circuit dedicated to vividness has shown to be inconclusive.
There have been a few studies conducted concerning auditory imagery generated in subjects during dreaming. There are different kinds of auditory imagery people experience in their dreams when waking up from rapid eye movement sleep. Auditory imagery is generally fairly common in rapid eye movement sleep with the majority of it being verbal auditory imagery. Studies found that the last auditory images in a dream are usually words spoken by the self-character in the dream. Some findings concerning the dream auditory imagery in patients with brain lesions and children’s dreams have been done as well but are more speculative.
There is a lot of anecdotal evidence that reading musical notation can cause musicians to sense an auditory image of the notes they are reading which is a phenomenon called notational audiation. Present studies show that only some musicians who can read musical notation can hear an inner voice emulating the melody while reading the notation which has served as an interesting mode of study to understand the way information is encoded in the brain. Musicians have their sense of notational audiation significantly impaired during phonatory distractions due to the conflicting signals induced onto a single sensory modality. Some musicians who are proficient at reading sheet music may experience an auditory image while reading over the excerpt for Symphony No. 40 from Mozart below.
Schizophrenic patients have a weakened sense of reality and how to respond to reality. Moreover, 60% of patients suffering from schizophrenia are hypothesized to have a much more vivid sense of auditory imagery. When normal subjects and schizophrenic subjects were both asked to generate an auditory image, schizophrenic patients were shown to have a much weaker activation of the posterior cerebral cortex, hippocampi, bilateral lenticular nuclei, right thalamus, middle and superior cortex, and left nucleus accumbens. These areas are important to inner speech and verbal self-monitoring which may explain why schizophrenia is more likely to induce auditory hallucinations. These auditory hallucinations differ from an internal monologue which is usually imagined in the first person. The hallucinations on the other hand are imagined in the second and third person which is speculated to be caused by increased activity in the left premotor, middle temporal and inferior parietal cortex, and supplementary motor area during second or third person imagery.
Implications and research directionsEdit
Studies on auditory imagery can give insight to involuntary intrusive images called earworms. A relatable phenomenon in which the lay person has experienced an earworm is when a jingle gets stuck in a person’s head. However, some people suffering from obsessive compulsive disorder may have stubborn earworms that stay for a much longer period of time on the order of years in which research in auditory imagery may be able to salvage them and get rid of their auditory image.
These studies are important for psychologists who want to understand how human memory and musical cognition works. For most modes of memory, people do not spontaneously remember facts or ideas throughout their day unless it is pressing to their current situation, however auditory imagery can spontaneously and constantly occur to people so evidence tells that this mode of memory differs from others. For instance, the auditory images that are remembered are usually 10–20 seconds long, however remembering facts or scenes do not necessarily hold time stamps like auditory images do. This insight would hold relevance on understanding the relationship of music and memory.
Moreover, musicians and music educators may be able to lessen the amount of practice they have to physically do by honing their auditory imagery due to the refinement of auditory discrimination and organization. By improving a person's ability to manipulate their 'inner ear' and concept of auditory images they can learn and play music better on a shorter time scale with less effort.
- Levitin, D. J., & Cook, P. R. (1996). Memory for musical tempo: Additional evidence that auditory memory is absolute. Attention, Perception, & Psychophysics, 58(6), 927-935.
- Hubbard, T. L. (2010). Auditory imagery: Empirical findings. Psychological bulletin, 136(2), 302.
- Jensen, M. (2005). Auditory imagery: a review and challenges ahead: Technical report, SSKKII-2005.01. SSKKII center for cognitive science, Göteborg University, Sweden.
- Shergill, S., Bullmore, E., Brammer, M., Williams, S., Murray, R., & McGuire, P. (2001). A functional study of auditory verbal imagery. Psychological medicine, 31(2), 241-253.
- Zatorre, R. J., & Halpern, A. R. (2005). Mental concerts: musical imagery and auditory cortex. Neuron, 47(1), 9-12.
- Halpern, A. R., & Zatorre, R. J. (1999). When that tune runs through your head: a PET investigation of auditory imagery for familiar melodies. Cerebral Cortex, 9(7), 697-704
- Zatorre, R. J., Halpern, A. R., Perry, D. W., Meyer, E., & Evans, A. C. (1996). Hearing in the mind's ear: A PET investigation of musical imagery and perception. Journal of Cognitive Neuroscience, 8(1), 29-46.
- Kraemer, D. J. M., Macrae, C. N., Green, A. E., & Kelley, W. M. (2005). Musical imagery: sound of silence activates auditory cortex. Nature, 434(7030), 158-158.
- Leaver, A. M., Van Lare, J., Zielinski, B., Halpern, A. R., & Rauschecker, J. P. (2009). Brain activation during anticipation of sound sequences. The Journal of Neuroscience, 29(8), 2477-2485
- Lotze, M., Scheler, G., Tan, H. R. M., Braun, C., & Birbaumer, N. (2003). The musician's brain: functional imaging of amateurs and professionals during performance and imagery. Neuroimage, 20(3), 1817-1829.
- Meister, I. G., Krings, T., Foltys, H., Boroojerdi, B., Müller, M., Töpper, R., & Thron, A. (2004). Playing piano in the mind—an fMRI study on music imagery and performance in pianists. Cognitive Brain Research, 19(3), 219-228.
- Aleman, A., Nieuwenstein, M. R., Böcker, K. B. E., & de Haan, E. H. F. (2000). Music training and mental imagery ability. Neuropsychologia, 38(12), 1664-1668.
- Herholz, S. C., Halpern, A. R., & Zatorre, R. J. (2012). Neuronal correlates of perception, imagery, and memory for familiar tunes. Journal of cognitive neuroscience, 24(6), 1382-1397.
- Brodsky, W., Henik, A., Rubinstein, B. S., & Zorman, M. (2003). Auditory imagery from musical notation in expert musicians. Attention, Perception, & Psychophysics, 65(4), 602-612.
- Oertel, V., Rotarska-Jagiela, A., van de Ven, V., Haenschel, C., Grube, M., Stangier, U., . . . Linden, D. E. J. (2009). Mental imagery vividness as a trait marker across the schizophrenia spectrum. Psychiatry Research, 167(1–2), 1-11. doi: 10.1016/j.psychres.2007.12.008
- Shergill, S. S., Bullmore, E., Simmons, A., Murray, R., & McGuire, P. (2000). Functional anatomy of auditory verbal imagery in schizophrenic patients with auditory hallucinations. American Journal of Psychiatry, 157(10), 1691-1693.
- Seal, M., Aleman, A., & McGuire, P. (2004). Compelling imagery, unanticipated speech and deceptive memory: Neurocognitive models of auditory verbal hallucinations in schizophrenia. Cognitive Neuropsychiatry, 9(1-2), 43-72.
- Communications, A. U. (2012). Involuntary Musical Imagery (earworms) - research by Lassi Liikkanen, Aalto University. aaltouniversity, from https://www.youtube.com/watch?v=gc5my6Lfipo
- Liikkanen, L. A. New Directions for Understanding Involuntary Musical Imagery. Archived April 28, 2016, at the Wayback Machine