Semantic analysis (linguistics)

In linguistics, semantic analysis is the process of relating syntactic structures, from the levels of words, phrases, clauses, sentences and paragraphs to the level of the writing as a whole, to their language-independent meanings. It also involves removing features specific to particular linguistic and cultural contexts, to the extent that such a project is possible. The elements of idiom and figurative speech, being cultural, are often also converted into relatively invariant meanings in semantic analysis. Semantics, although related to pragmatics, is distinct in that the former deals with word or sentence choice in any given context, while pragmatics considers the unique or particular meaning derived from context or tone. To reiterate in different terms, semantics is about universally coded meaning, and pragmatics, the meaning encoded in words that is then interpreted by an audience.[1]

Semantic analysis can begin with the relationship between individual words. This requires an understanding of lexical hierarchy, including hyponymy and hypernymy, meronomy, polysemy, synonyms, antonyms, and homonyms.[2] It also relates to concepts like connotation (semiotics) and collocation, which is the particular combination of words that can be or frequently are surrounding a single word. This can include idioms, metaphor, and simile, like, "white as a ghost."

With the availability of enough material to analyze, semantic analysis can be used to catalog and trace the style of writing of specific authors.[3]

See also edit

References edit

  1. ^ Goddard, Cliff (2013). Semantic Analysis: An Introduction (2nd ed.). New York: Oxford University Press. p. 17.
  2. ^ Manning, Christopher; Scheutze, Hinrich (1999). Foundations of Statistical Natural Language Processing. Cambridge: MIT Press. p. 110. ISBN 9780262133609.
  3. ^ Miranda-Garcıa, Antonio; Calle-Martın, Javier (May 2012). "The Authorship of the Disputed Federalist Papers with an Annotated Corpus". English Studies. 93 (3): 371–390. doi:10.1080/0013838x.2012.668795. S2CID 162248379.