The field of articulatory phonetics is a subfield of phonetics. In studying articulation, phoneticians explain how humans produce speech sounds via the interaction of different physiological structures.
Generally, articulatory phonetics is concerned with the transformation of aerodynamic energy into acoustic energy. Aerodynamic energy refers to the airflow through the vocal tract. Its potential form is air pressure; its kinetic form is the actual dynamic airflow. Acoustic energy is variation in the air pressure that can be represented as sound waves, which are then perceived by the human auditory system as sound.
The vocal tract can be viewed through an aerodynamic-biomechanic model that includes three main components:
- air cavities
- air valves
Air cavities are containers of air molecules of specific volumes and masses. The main air cavities present in the articulatory system are the supraglottal cavity and the subglottal cavity. They are so-named because the glottis, the openable space between the vocal folds internal to the larynx, separates the two cavities. The supraglottal cavity or the orinasal cavity is divided into an oral subcavity (the cavity from the glottis to the lips excluding the nasal cavity) and a nasal subcavity (the cavity from the velopharyngeal port, which can be closed by raising the velum). The subglottal cavity consists of the trachea and the lungs. The atmosphere external to the articulatory stem may also be considered an air cavity whose potential connecting points with respect to the body are the nostrils and the lips.
Pistons are initiators. The term initiator refers to the fact that they are used to initiate a change in the volumes of air cavities, and, by Boyle's Law, the corresponding air pressure of the cavity. The term initiation refers to the change. Since changes in air pressures between connected cavities lead to airflow between the cavities, initiation is also referred to as an airstream mechanism. The three pistons present in the articulatory system are the larynx, the tongue body, and the physiological structures used to manipulate lung volume (in particular, the floor and the walls of the chest). The lung pistons are used to initiate a pulmonic airstream (found in all human languages). The larynx is used to initiate the glottalic airstream mechanism by changing the volume of the supraglottal and subglottal cavities via vertical movement of the larynx (with a closed glottis). Ejectives and implosives are made with this airstream mechanism. The tongue body creates a velaric airsteam by changing the pressure within the oral cavity: the tongue body changes the mouth subcavity. Click consonants use the velaric airstream mechanism. Pistons are controlled by various muscles.
Valves regulate airflow between cavities. Airflow occurs when an air valve is open and there is a pressure difference between the connecting cavities. When an air valve is closed, there is no airflow. The air valves are the vocal folds (the glottis), which regulate between the supraglottal and subglottal cavities, the velopharyngeal port, which regulates between the oral and nasal cavities, the tongue, which regulates between the oral cavity and the atmosphere, and the lips, which also regulate between the oral cavity and the atmosphere. Like the pistons, the air valves are also controlled by various muscles.
To produce any kind of sound, there must be movement of air. To produce sounds that people can interpret as spoken words, the movement of air must pass through the vocal cords, up through the throat and, into the mouth or nose to then leave the body. Different sounds are formed by different positions of the mouth—or, as linguists call it, "the oral cavity" (to distinguish it from the nasal cavity).
The two classes of soundsEdit
Sounds of all languages fall under two categories: Consonants and Vowels.
Consonants are produced with some form of restriction or closing in the vocal tract that hinders the airflow from the lungs. Consonants are classified according to where in the vocal tract the airflow has been restricted. This is also known as the place of articulation.
Places of articulationEdit
Movement of the tongue and lips can create these constrictions and by forming the oral cavity in different ways, different sounds can be produced.
Bilabial sounds are produced with both lips, such as [b], [m], and [p].
[f] and [v] are articulated by placing the upper teeth against the lower lip.
[θ] and [ð] are both spelled as "th" (θ as in think) (ð as in the). They are pronounced by inserting the tip of the tongue between the teeth.
[t] [d] [n] [s] [z] [l] [r] are produced in many ways where the tongue is raised towards the alveolar ridge.
[t, d, n] the tip of the tongue is raised and touches the ridge.
[s, z] the sides of the front of the tongue are raised, but the tip is lowered so that air escapes over it.
[l] the tip of the tongue is raised while the rest of the tongue remains down, permitting air to escape over its sides. Hence, [l] is called a lateral sound (âm biên).
[r] [IPA ɹ] curl the tip of tongue back behind the alveolar ridge, or bunch up the top of the tongue behind the ridge, the air escapes through the central part of the mouth.
[ʃ] [ʒ] [tʃ] [dʒ] [j] are produced by raising the front part of the tongue to the palate.
[k] [ɡ] [ŋ] are produced by raising the back part of the tongue to the soft palate or the velum.
[ʀ] [q] [ԍ] these sounds are produced by raising the back of the tongue to the uvula. The 'r' in French and German may be an uvular trill (symbolized by [ʀ]). The uvular sounds [q] and [ԍ] occur in Arabic. These do not normally occur in English.
[h] [ʔ] the sound [h] is from the flow of air coming from an open glottis, past the tongue and lips as they prepare to pronounce a vowel sound, which always follows [h]. if the air is stopped completely at the glottis by tightly closed vocal cords the sound upon release of the cords is called a glottal stop [ʔ].
Vowels are produced by the passage of air through the larynx and the vocal tract. Most vowels are voiced (i.e. the vocal folds are vibrating). Except in some marginal cases, the vocal tract is open, so that the airstream is able to escape without generating fricative noise.
Variation in vowel quality is produced by means of the following articulatory structures:
The larynx is used to differentiate voiced and voiceless vowels. In addition, the pitch of the vowel is changed by altering the frequency of vibration of the vocal folds. In some languages there are contrasts among vowels with different phonation types.
Vowels may be made pharyngealized (also epiglottalized, sphincteric or strident) by means of a retraction of the tongue root.:306-310 Vowels may also be articulated with advanced tongue root.:298 There is discussion of whether this vowel feature (ATR) is different from the Tense/Lax distinction in vowels. :302-6
Soft palate (velum)Edit
Vowels are normally produced with the soft palate raised so that no air escapes through the nose. However, vowels may be nasalized as a result of lowering the soft palate. Many languages use nasalization contrastively. :298-300
The tongue is a highly flexible organ that is capable of being moved in many different ways. For vowel articulation the principal variations are vowel height and the dimension of backness and frontness.. A less common variation in vowel quality can be produced by a change in the shape of the front of the tongue, resulting in a rhotic or rhotacized vowel. 
What the above equations express is that given an initial pressure P1 and volume V1 at time 1 the product of these two values will be equal to the product of the pressure P2 and volume V2 at a later time 2. This means that if there is an increase in the volume of cavity, there will be a corresponding decrease in pressure of that same cavity, and vice versa. In other words, volume and pressure are inversely proportional (or negatively correlated) to each other. As applied to a description of the subglottal cavity, when the lung pistons contract the lungs, the volume of the subglottal cavity decreases while the subglottal air pressure increases. Conversely, if the lungs are expanded, the pressure decreases.
A situation can be considered where (1) the vocal fold valve is closed separating the supraglottal cavity from the subglottal cavity, (2) the mouth is open and, therefore, supraglottal air pressure is equal to atmospheric pressure, and (3) the lungs are contracted resulting in a subglottal pressure that has increased to a pressure that is greater than atmospheric pressure. If the vocal fold valve is subsequently opened, the previously two separate cavities become one unified cavity although the cavities will still be aerodynamically isolated because the glottic valve between them is relatively small and constrictive. Pascal's Law states that the pressure within a system must be equal throughout the system. When the subglottal pressure is greater than supraglottal pressure, there is a pressure inequality in the unified cavity. Since pressure is a force applied to a surface area by definition and a force is the product of mass and acceleration according to Newton's Second Law of Motion, the pressure inequality will be resolved by having part of the mass in air molecules found in the subglottal cavity move to the supraglottal cavity. This movement of mass is airflow. The airflow will continue until a pressure equilibrium is reached. Similarly, in an ejective consonant with a glottalic airstream mechanism, the lips or the tongue (i.e., the buccal or lingual valve) are initially closed and the closed glottis (the laryngeal piston) is raised decreasing the oral cavity volume behind the valve closure and increasing the pressure compared to the volume and pressure at a resting state. When the closed valve is opened, airflow will result from the cavity behind the initial closure outward until intraoral pressure is equal to atmospheric pressure. That is, air will flow from a cavity of higher pressure to a cavity of lower pressure until the equilibrium point; the pressure as potential energy is, thus, converted into airflow as kinetic energy.
Sound sources refer to the conversion of aerodynamic energy into acoustic energy. There are two main types of sound sources in the articulatory system: periodic (or more precisely semi-periodic) and aperiodic. A periodic sound source is vocal fold vibration produced at the glottis found in vowels and voiced consonants. A less common periodic sound source is the vibration of an oral articulator like the tongue found in alveolar trills. Aperiodic sound sources are the turbulent noise of fricative consonants and the short-noise burst of plosive releases produced in the oral cavity.
- Non-vocal fold vibration: 20–40 hertz (cycles per second)
- Vocal fold vibration
- Lower limit: 70–80 Hz modal (bass), 30–40 Hz creaky
- Upper limit: 1170 Hz (soprano)
Vocal fold vibrationEdit
- cricoid cartilage
- thyroid cartilage
- arytenoid cartilage
- interarytenoid muscles (fold adduction)
- posterior cricoarytenoid muscle (fold abduction)
- lateral cricoarytenoid muscle (fold shortening/stiffening)
- thyroarytenoid muscle (medial compression/fold stiffening, internal to folds)
- cricothyroid muscle (fold lengthening)
- hyoid bone
- sternothyroid muscle (lowers thyroid)
- sternohyoid muscle (lowers hyoid)
- stylohyoid muscle (raises hyoid)
- digastric muscle (raises hyoid)
- Magnetic resonance imaging (MRI) / Real-time MRI 
- Medical ultrasonography
- Electromagnetic articulography
In order to understand how sounds are made, experimental procedures are often adopted. Palatography is one of the oldest instrumental phonetic techniques used to record data regarding articulators. In traditional, static palatography, a speaker's palate is coated with a dark powder. The speaker then produces a word, usually with a single consonant. The tongue wipes away some of the powder at the place of articulation. The experimenter can then use a mirror to photograph the entire upper surface of the speaker's mouth. This photograph, in which the place of articulation can be seen as the area where the powder has been removed, is called a palatogram.
Technology has since made possible electropalatography (or EPG). In order to collect EPG data, the speaker is fitted with a special prosthetic palate, which contains a number of electrodes. The way in which the electrodes are "contacted" by the tongue during speech provides phoneticians with important information, such as how much of the palate is contacted in different speech sounds, or which regions of the palate are contacted, or what the duration of the contact is.
- Note that although sound is just air pressure variations, the variations must be at a high enough rate to be perceived as sound. If the variation is too slow, it will be inaudible.
- "Laver, John Principles of Phonetics, 1994, Cambridge University Press
- "Peter Ladefoged and Ian Maddieson The Sounds of the World's Languages, 1996, Blackwell; ISBN 0-631-19815-6
- Stated in a less abbreviatory fashion: pressure1 × volume1 = pressure2 × volume2
- volume1 divided by sum of volume1 and change in volume = sum of pressure1 and the change in pressure divided by pressure1
- Niebergall, A; Zhang, S; Kunay, E; Keydana, G; Job, M; et al. (2010). "Real-time MRI of Speaking at a Resolution of 33 ms: Undersampled Radial FLASH with Nonlinear Inverse Reconstruction". Magn. Reson. Med. doi:10.1002/mrm.24276..
- Ladefoged, Peter (1993). A Course In Phonetics (3rd ed.). Harcourt Brace College Publishers. p. 60.
- Interactive place and manner of articulation
- Observing your articulators
- QMU's CASL Research Centre site for ultrasound tongue imaging
- Seeing Speech – with reference examples of IPA sounds using MRI and ultrasound tongue imaging
- UCLA Electromagnetic articulography
- UCLA Aerometry
- UCLA Electrolaryngography
- Articulatory Phonetic Alphabet