Formulaic language

(Redirected from Formulaic sequence)

Formulaic language (previously known as automatic speech or embolalia) is a linguistic term for verbal expressions that are fixed in form, often non-literal in meaning with attitudinal nuances, and closely related to communicative-pragmatic context.[1] Along with idioms, expletives and proverbs, formulaic language includes pause fillers (e.g., "Like", "Er" or "Uhm") and conversational speech formulas (e.g., "You've got to be kidding," "Excuse me?" or "Hang on a minute").


The word embolalia comes from the Greek word embolos which means 'something thrown in', from the word emballo- meaning 'to throw in',[2] and -lalia meaning 'speech, chattering and babbling; abnormal or disordered form of speech.[3]

Modern linguists led by Leonard Bloomfield in 1933 call these "hesitation forms", the sounds of stammering (uh), stuttering (um, um), throat-clearing (ahem!), stalling (well, um, that is), interjected when the speaker is groping for words or at a loss for the next thought.[4]

French psychiatrist Jules Séglas, on the other hand, defined the term embolalia as "the regular addition of prefixes or suffixes to words" and mentioned that the behavior is sometimes used by normal individuals to demonstrate to their interlocutor that they are paying attention to the conversation.[5]

Harry Levin and Irene Silverman called formulaic language "vocal segregates" in their 1965 paper on hesitation phenomena and found out from their experiments on children that these segregates seem to be less voluntary hesitation phenomena and may be signs of uncontrolled emotionality under stress.[6]

The Irish poet William Butler Yeats argued for formulaic language experiments with his wife,[7] which provided him with symbols for his poetry as well as literary theories.[8]


Linguistic featuresEdit

Definition of formulaic sequencesEdit

According to The Canadian Modern Language Review, formulaic sequences are "fixed combinations of words that ... can facilitate fluency in speech by making pauses shorter and less frequent, and allowing longer runs of speech between pauses".[9]

A formulaic sequence is "a sequence, continuous or discontinuous, of words or other elements, which is, or appears to be, prefabricated: that is, stored and retrieved whole from memory at the time of use, rather than being subject to generation or analysis by the language grammar."[10]

They can be found everywhere in language use and "make up a large proportion of any discourse".[11] Formulaic sequences can be of any length and can be used to express messages, functions, social solidarity and process information very fast without communication misunderstanding.[12]

Morphology and phonologyEdit

Filled pauses

Filled pauses consist of repetitions of syllables and words, reformulation or false starts where speakers rephrase their speech to fit the representation they best perceive, grammatical repairs, and partial repeats that often involve searching for the right words in one's lexicon to carry across an intended meaning.[13] There are basically three distinct forms for filled pauses: (i) an elongated central vowel only; (ii) a nasal murmur only; and (iii) a central vowel followed by a nasal murmur.[14] Although a schwa-like quality [ə:], appears to be the most commonly used, some speakers consistently using the neutral vowel [ɨ:] instead, and others use both vowels in the same sentence, depending on the quality of the previous word last vowel.[14] Filled pauses vocalizations may be built around central vowels and speakers may differ in their preferences, but that they do not appear to behave as other words in the language.[14] The lengthening of words ending in a coronal fricative, for instance, could be obtained by prolonging the entire rhyme and/or the fricative only.[14] Most of the time, however, the neutral vowel [ɨ:] is appended to achieve the desired effect.[14]

Prolonged pauses

Similarly to filled pauses, single occurrences of prolonged pauses occurring between stretches of fluent speech, may be preceded and followed by silent pauses, as they most often occur on function words with a CV or V structure.[14] Even though they are not always central, the vowels of such syllables may be as long as the ones observed for filled pauses.[14]

Retraced and unretraced restarts

Riggenbach's 1991 study of fluency development in Chinese learners of English had an analysis of repair phenomena, which included retraced restarts and unretraced restarts.[15] Retraced restarts refer to the reformulations whereby a portion of the original utterance is duplicated.[15] They can either involve repetition, that is, the precise adjacent duplication of a sound, syllable, word or phrase, or insertion, which refers to a retraced restart with the addition of new unretraced lexical items.[15] Conversely, unretraced restarts refer to reformulations that reject the original utterance, similarly known as false starts.[15]

Semantics and pragmaticsEdit

The semantics of formulaic language have often been debated on, and to date, there lacks a consensus on whether or not filler words are intentional in speech and whether or not they should be considered as words or if they are simply side effects of difficulties in the planning process of speech by speakers. Bailey & Ferriera's (2007) paper[16] found that there is little evidence to suggest that the use of filler words are intentional in speech and that they should not be considered as words in the conventional sense.

Filler words consist of "Non-lexical fillers" and "Lexical fillers".[15] "Non-lexical fillers" are recognized as fillers that are not words and "Lexical fillers" are recognized as fillers that are words and both types of fillers are thought to contain little or no semantic information.[15] However, some filler words are used to express certain speech acts. "Yeah", a "Lexical filler", is used to give affirmation, introduce a new topic, shows speaker's perception and understanding, and occurs after a speech management problem when the speakers does not how to continue their speech.[17] Fillers like "Mmmm", a "Non-lexical filler", and "Well", a "Lexical filler", are also said to signal listener's understanding of the information provided.[17]

Research has shown that people were less likely to use formulaic language in general topics and domains they were more well-versed in, because they were more adept at selecting the appropriate terms.[18] To date, there is insufficient research done to say if fillers are a part of integral meaning, or if they are aspect of performance,[19] but we can say that they are useful in facilitating information for the listener.[20]


Formulaic language is more likely to occur at the beginning of utterance or phrase and the reason is because it is presumed that there is a greater demand on planning processes at these junctures.[21] Features of formulaic language, like filled pauses or repetitions, are most likely to occur immediately prior to the onset of a complex syntactic constituent.[22] Filled pauses are also likely after the initial word in a complex constituent, especially after function words.[22] Therefore, listeners might be able to use the presence of a recent filled pause to predict that an ambiguous structure, and this trait is in favor of a more complex analysis .[16]

There are several different types of formulaic language. One type is relatively universal, often transcending differences in language and to some degree culture. Simple fillers like "Uhm", "Uh", or "Er" are used by many different people in many different settings.[23] For the most part, these types of fillers are considered innocuous, and are often overlooked by listeners, as long as they are not utilized so often that they overshadow the remainder of the conversation.[24]

Other forms of formulaic language are ingrained within specific cultures, and in fact are sometimes considered an identifying characteristic of people who share a particular religion, or live in a specific geographical region.[24] Along with accents, formulaic language of this type is sometimes considered colorful and somewhat entertaining. Writers often make use of this type of speech to give the characters in their writings additional personality, helping to make them unique.[25]


The study conducted by Dechert (1980) that investigated the speech performance of a German student of English revealed that there is a tendency for speech pauses to be situated at breaks that are consistent with "episodic units".[26] Dechert (1980) found that the more fluent utterances exhibited more pauses at those junctures and lesser within the "episodic units", leading him to posit that the study subject was able to use the narrative structure to pace his own speech with natural breaks in order for him to scout for the words and phrases that are to follow subsequently.[26]

Through the comparison of the story retelling utterances collated of second language learners, Lennon (1984) discovered notable disparities in the distribution of pauses between recounting in the research subjects' first and second languages respectively.[27] The study found that all of the pauses were found to be located either at clause breaks or following nonintegral components of the clause, without pauses within the clauses.[27] On the other hand, the narrators who spoke using their second language exhibited different patterns, with a higher frequency of pauses occurring within the clauses, leading to the conclusion posited by Lennon to be that the speakers seem to be "planning within clauses as well as in suprasegmental units", and hence, the occurrence of pauses within clauses and not at the intersection of clauses could well be an indicator distinguishing fluent and confluent speech.[27]

Discourse featuresEdit

Cognitive loadEdit

Cognitive load is an important predictor of formulaic language.[20] More disfluency is found in longer utterances[28] and when the topic is unfamiliar.[20] In Wood's book, he suggested that when a high degree of cognitive load occurs, such as during expository speech or impromptu descriptions of complex interrelated topics, even native speakers can suffer from disfluency.[29]

Speech rateEdit

Formulaic phrases are uttered more quickly than comparable non-formulaic phrases.[30] Speech rate is closely related to cognitive load.[31] Depending on the cognitive load, the rates of a speaker's utterances are produced either faster or slower, in comparison to a fixed speaking rate which happens usually.[31] For example, speech rate becomes slower when having to make choices that are not anticipated, and tend to accelerate when words are being repeated.[31] In fast conditions, cognitive processes that result in a phonetic plan, fail to keep up with articulation, and thus, the articulation of the existing plan is restarted,[32] resulting in the repetition of words which is more likely to happen but no more likely than fillers.[20]

Frequency of wordsEdit

In Beattie and Butterworth's (1979) study, low frequency content words and those rated as contextually improbable were preceded by hesitations such as fillers.[33] Speakers, when choosing to use low frequency words in their speech, are aware, and are more likely to be disfluent.[33] This is further supported by Schnadt and Corley where they found that prolongations and fillers increased in words just before multiple-named or low frequency items.[21]

Domain (addressor vs. addressee)Edit

Humans are found to be more disfluent overall when addressing other humans than when addressing machines.[34] More instances of formulaic language is found in dialogues than in monologues.[34] The different roles the addresser played (such as a sister, a daughter or a mother) greatly influences the numbers of disfluencies, particularly, fillers produced, regardless of length or complexity.[35]


Comprehension cuesEdit

There is a common agreement that disfluencies are accompanied by important modifications both at the segmental and prosodic levels and that speakers and listeners use such cues systematically and meaningfully. Thus they appear as linguistic universal devices that are similar to other devices and are controlled by the speaker and regulated by language specific constraints.[14] In addition, speech disfluencies such as fillers can help listeners to identify upcoming words.[36]

While formulaic language can serve as a useful cue that more is to come, some people do develop an unconscious dependence on these filler words.[37] When this is the case, it is necessary to correct the problem by making the speaker be aware of their over-reliance on formulaic language production and by training the person to make more efficient use of other verbal strategies. As the individual gains confidence and is less apt to have a need for filler words, the predilection toward formulaic language is then able to gradually diminish.[25]

A study done by Foxtree (2001)[38] showed that both English and Dutch listeners were faster to identify words in a carrier sentence when it was preceded with an "Uh" instead of without an "Uh", which suggested that different fillers have different effects as they might be conveying different information.[20]

Fischer and Brandt-Pook also found out that discourse particles mark thematic breaks, signal the relatedness between the preceding and following utterance, indicate if the speaker has understood the content communicated, and support the formulation process by signalling possible problems in speech management.[17]

While fillers might give listeners cues about the information being conveyed, Bailey & Ferreira's study[39] made a distinction between "Good Cues" and "Bad Cues" in facilitating listener's comprehension. A "Good Cue" leads the listener to correctly predict the onset of a new constituent (Noun Phrase, Verb Phrase), whereas a "Bad Cue" leads the listener to incorrectly predict the onset of a new constituent.[39] "Good Cue" make it easier for listeners to process the information they have been presented while "Bad Cue" make it harder for listeners process the relevant information.[39]

There is strong empirical evidence that speakers use formulaic language in similar ways across languages and that formulaic language plays a fundamental role in the structuring of spontaneous speech, as they are used to achieve a better synchronization between interlocutors by announcing upcoming topic changes, delays related to planning load or preparedness problems, as well as speaker's intentions to take/give the floor or to revise/abandon an expression he/she had already presented.[14]

Communicative goalsEdit

A study conducted by Clark and Foxtree (2002)[40] mentioned that parts of formulaic language, such as fillers, serve a communicative function and are considered integral to the information the speaker tries to convey, although they do not add to the propositional content or the primary message.[40] Instead, they are considered part of a collateral message where the speaker is commenting on her performance.[40] Speakers produce filled pauses (e.g. "Uh" or "Um") for a variety of reasons, including the intention to discourage interruptions or to gain additional time to plan utterances.[16]

Another communicative goal includes the attention-impelling function,[4] which explores another purpose of hesitation forms as being to dissociate oneself slightly from the harsh reality of what is to follow.[4] With the use of a beat of time filled with a meaningless interjection, uncommitted people who are "into distancing" make use of such formulaic language to create a little distance between themselves and their words, as if it might lessen the impact of their words.[4]

However, not all forms of formulaic language are considered appropriate or harmless. There are examples of formulaic language production that lean towards being offensive, for instance, the use of anything considered to be profanity within a given culture.[25]

In this form, the speech is usually the insertion of swear words within the sentence structure used to convey various ideas. At times, this use of formulaic language comes about due to the individual being greatly distressed or angry.[25] However, there are situations where swear words are inserted unconsciously even if the individual is extremely happy.[25] When the use of swear words is called to the attention of the individual, he or she may not even have been aware of the usage of such formulaic language.[25]

Neurological basisEdit

Medical casesEdit


Many patients who suffer from aphasia retain the ability to produce formulaic language, including conversational speech formulas and swear words—in some cases, patients are unable to create words or sentences, but they are able to swear. Also, the ability to pronounce other words can change and evolve during the process of recovery, while pronunciation and use of swear words remain unchanged.[1]

Patients who are affected by transcortical sensory aphasia, a rare form of aphasia, have been found to exhibit formulaic language that is characterised by "lengthy chunks of memorized material".[41]

Apraxia of speechEdit

Apraxia of speech can also occur in conjunction with dysarthria (muscle weakness affecting speech production) or aphasia (language difficulties related to neurological damage).[42]

One of the articulatory characteristics of apraxia of speech found in adults includes speech behavior that "exhibits fewer errors with formulaic language than volitional speech".[43] Developmental verbal dyspraxia has also been found to have more effect on volitional speech than on formulaic language.[44]

The characteristics of apraxia of speech include difficulties in imitating speech sounds, imitating no-speech movements, such as sticking out the tongue, groping for sounds, and in severe cases, the inability to produce any sounds, inconsistent errors and a slow rate of speech. However, patients who suffer from apraxia of speech may retain the ability to produce formulaic language, such as "thank you" or "how are you?".[42] Apraxia of speech can also occur in conjunction with dysarthria, an illness which inflicts muscle weakness affecting speech production), or aphasia, which causes language difficulties related to neurological damage.[42]

Developmental coordination disorderEdit

Developmental coordination disorder is a chronic neurological disorder that affects the voluntary movements of speech.[45] Children with developmental coordination disorder are unable to formulate certain kinds of voluntary speech; however, they may speak set words or phrases spontaneously, constituting formulaic language—although they may not be able to repeat them on request.[45]

See alsoEdit


  1. ^ a b Stahl, Benjamin; Van Lancker Sidtis, Diana (2015), "Tapping into neural resources of communication: formulaic language in aphasia therapy", Frontiers in Psychology, 6 (1526): 1–5, doi:10.3389/fpsyg.2015.01526, PMC 4611089, PMID 26539131
  2. ^ mondofacto. "embolalia - Definition".
  3. ^ "lalo-, lallo-, lalio-, lal-, -lalia, -lalic + - Word Information".
  4. ^ a b c d Safire, William (16 June 1991). "On Language; Impregnating the Pause". The New York Times. p. 8.
  5. ^ Obler, Loraine K.; Albert, Martin L. (1985), "Historical Note: Jules Seglas on Language in Dementia", Brain and Language, 24 (2): 314–325, doi:10.1016/0093-934X(85)90138-5, PMID 3884087, S2CID 22724372
  6. ^ Levin, Harry; Silverman, Irene (1965), "Hesitation Phenomena in Children's Speech", Language and Speech, 8 (2): 67–85, doi:10.1177/002383096500800201, S2CID 143111880
  7. ^ Dekel, Gil (2008), "Wordless Silence of Poetic Mind: Outlining and Visualising Poetic Experiences through Artmaking", Forum: Qualitative Social Research, 9 (2)
  8. ^ An Overview of Yeats A Vision
  9. ^ Wood, David (September 2006). "Uses and Functions of Formulaic Sequences in Second-Language Speech: An Exploration of the Foundations of Fluency". The Canadian Modern Language Review. 63 (1): 13–33 – via Project MUSE.
  10. ^ Wray, Alison (2002). Formulaic Language and the Lexicon. Cambridge: Cambridge University Press. p. 9. ISBN 978-0521022125.
  11. ^ Schmitt (Ed.), Norbert (2004). Formulaic Sequences in Action: An Introduction. In: Schmitt, Norbert (Ed.) Formulaic Sequences: Acquisition, Processing and Use. Amsterdam: Benjamins. p. 1.
  12. ^ Schmitt (Ed.), Norbert (2004). Formulaic Sequences in Action: An Introduction. In: Schmitt, Norbert (Ed.) Formulaic Sequences: Acquisition, Processing and Use. Amsterdam: Benjamins. p. 3.
  13. ^ Freed, B. (1995). Second Language Acquisition in a Study Abroad Context. Amsterdam /Philadelphia: John Benjamins Publishing Company.
  14. ^ a b c d e f g h i Moniz, H.; Mata, A. I.; Viana, M. C. (2007). "On Filled Pauses and Prolongations in European Portuguese" (PDF). Interspeech: 2645–2648.
  15. ^ a b c d e f Riggenbach, H. (1991). Towards an understanding of fluency: A microanalysis of nonnative speaker conversations. Discourse Processes, 14: 423–41.
  16. ^ a b c Bailey, Karl G. D.; Ferreira, Fernanda (2007), "The processing of filled pause disfluencies in the visual world", Eye Movements a Window on Mind and Brain: 487–502, doi:10.1016/B978-008044980-7/50024-0, ISBN 9780080449807
  17. ^ a b c Fischer, K.; Brandt-Pook, H. (1998), Automatic Disambiguation of Discourse Particles (PDF), pp. 107–113
  18. ^ Schachter, S.; F. Rauscher; N. Christenfeld; K. Tyson Crone (1994). "The vocabularies of academia" (PDF). Psychological Science. 5: 37–41. doi:10.1111/j.1467-9280.1994.tb00611.x. S2CID 30543076.
  19. ^ Brennan, S. E.; Williams, M. (1995), "The feeling of another's knowing: Prosody and filled pauses as cues to listeners about the metacognitive states of speakers" (PDF), Journal of Memory and Language, 34 (3): 383–398, doi:10.1006/jmla.1995.1017
  20. ^ a b c d e Corley, M.; Stewart, O. W. (2008). "Hesitation Disfluencies in Spontaneous Speech: The Meaning of um" (PDF). Language and Linguistics Compass. 2 (4): 589–602. doi:10.1111/j.1749-818X.2008.00068.x. hdl:20.500.11820/0e5f2f2f-7383-42c5-a7ba-63f2587ad877.
  21. ^ a b Schnadt, M. J., and M. Corley. submitted. Buying time in spontaneous speech: How speakers accommodate lexical difficulty.
  22. ^ a b Clark, H. H.; Wasow, T. (1998). "Repeating words in spontaneous speech" (PDF). Cognitive Psychology. 37 (3): 201–242. CiteSeerX doi:10.1006/cogp.1998.0693. PMID 9892548. S2CID 6037669. Archived from the original (PDF) on 2013-05-10. Retrieved 2012-03-12.
  23. ^ "Blackwell Reference Online - Formulaic Sequences and Language Disorders".[permanent dead link]
  24. ^ a b Kuniper, K. (2000). "On the Linguistic Properties of Formulaic Speech" (PDF). Oral Tradition: 279–305.
  25. ^ a b c d e f "What is Automatic Speech?".
  26. ^ a b [Dechert, HW. (1980). Pauses and intonation as indicators of verbal planning in second-language speech productions: Two examples from a case study. In Dechert, HW & Raupach, M. (Eds.), Temporal variables in speech (pp. 271-285).]
  27. ^ a b c [Lennon, P. (1984). Retelling a story in English. In HW, Dechert, D. Mehle, 8c M. Raupauch (Eds.), Second Language Productions (pp. 50-68). Turbingen: Gunter Narr Verlag.]
  28. ^ Shriberg, E. (1996), "Disfluencies in Switchboard" (PDF), Proceedings, International Conference on Spoken Language Processing, Addendum: 11–14
  29. ^ David Wood (1 September 2010). Formulaic Language and Second Language Speech Fluency: Background, Evidence and Classroom Applications. Continuum International Publishing Group. ISBN 978-1-4411-5819-2. Retrieved 23 March 2012.
  30. ^ "2 Words are difficult to define". Lexical Lab. Retrieved 2018-09-25.
  31. ^ a b c O'Shaughnessy, D. (1995), "Timing patterns in fluent and disfluent spontaneous speech", Acoustics, Speech, and Signal Processing, 1: 600–603, doi:10.1109/ICASSP.1995.479669, ISBN 978-0-7803-2431-2, S2CID 27286131
  32. ^ Blacfkmer, Elizabeth R.; Mitton, Janet L. (1991), "Theories of monitoring and the timing of repairs in spontaneous speech", Cognition, 39 (3): 173–194, doi:10.1016/0010-0277(91)90052-6, PMID 1841032, S2CID 20263258
  33. ^ a b Beattie, G. W.; Butterworth, B. L. (1979), "Contextual probability and word frequency as determinants of pauses and errors in spontaneous speech" (PDF), Language and Speech, 22 (3): 201–211, doi:10.1177/002383097902200301, S2CID 5780695
  34. ^ a b Oviatt, S. (1995), "Predicting spoken disfluencies during human-computer interaction" (PDF), Computer Speech & Language, 9: 19–35, CiteSeerX, doi:10.1006/csla.1995.0002, archived from the original (PDF) on 2006-06-14, retrieved 2012-03-12
  35. ^ Bortfeld, H.; Leon; Bloom, J. E.; Schober, M. F.; Brennan, S. E. (2001), "Disfluency rates in conversation: Effects of age, relationship, topic, role, and gender" (PDF), Language and Speech, 44 (2): 123–147, CiteSeerX, doi:10.1177/00238309010440020101, PMID 11575901, S2CID 10985337
  36. ^ Brennan, S. E.; Schober, M. F. (2001), "How listeners compensate for disfluencies in spontaneous speech" (PDF), Journal of Memory and Language, 44 (2): 274–296, doi:10.1006/jmla.2000.2753
  37. ^ Yang, Li-Chiung (2001), "Visualizing Spoken Discourse: Prosodic Form and Discourse Functions of Interruptions" (PDF), In Proceedings of the 2001 Conference on Empirical Methods in Natural Language Processing, 16: 1–10, doi:10.3115/1118078.1118106
  38. ^ Fox Tree, J. E. (2001). "Listeners' uses of um and uh in speech comprehension". Memory & Cognition. 29 (2): 320–326. doi:10.3758/bf03194926. PMID 11352215.
  39. ^ a b c Bailey, K. G. B.; Ferreira, F. (2003), "Disfluencies influence syntactic parsing" (PDF), Journal of Memory and Language, 49 (2): 183–200, doi:10.1016/s0749-596x(03)00027-5
  40. ^ a b c Clark, H. H.; Fox Tree, J. E. (2002), "Using uh and um in spontaneous speaking" (PDF), Cognition, 84 (1): 73–111, CiteSeerX, doi:10.1016/S0010-0277(02)00017-3, PMID 12062148, S2CID 37642332, archived from the original (PDF) on 2012-10-14
  41. ^ McCaffrey, Patrick. Transcortical Sensory Aphasia. The Neuroscience on the Web Series: Neuropathologies of Language and Cognition
  42. ^ a b c Britchkow, Ela. (2005). Apraxia.
  43. ^ Ogar, J.; Slama, H.; Dronkers, N.; Amici, S.; Gorno-Tempini, M. L. (2005), "Apraxia of Speech: An overview" (PDF), Neurocase, 11 (6): 427–432, doi:10.1080/13554790500263529, PMID 16393756, S2CID 8650885
  44. ^ "Velleman, Shelley L. Childhood apraxia of speech (developmental verbal dyspraxia). Retrieved on 9 March 2012". Archived from the original on 7 March 2012. Retrieved 12 March 2012.
  45. ^ a b Portelli, J., "Developmental Verbal Dyspraxia" (PDF), Association of Speech and Language Pathologists of Malta, archived from the original (PDF) on 2016-03-03, retrieved 2012-03-12

External linksEdit