Open main menu

Henry Kučera (15 February 1925 – 20 February 2010), born Jindřich Kučera (Czech pronunciation: [ˈjɪndr̝ɪx ˈkutʃɛra]), was a Czech-American linguist who pioneered corpus linguistics, linguistic software, was a major contributor to the American Heritage Dictionary, and a pioneer in the development of spell checking computer software.

Early life and educationEdit

Kučera was born in the former Czechoslovakia in the village of Třebařov (between Pardubice and Olomouc). When the Communists came to power in February 1948, his studies in philosophy and linguistics at Charles University in the Czech capital of Prague were interrupted. He was forced to leave Czechoslovakia in April 1948 when it became clear that his political writings had placed him at risk of detention by the Communist authorities.

Kučera then moved to Allied-occupied Germany where he worked under the supervision of the U.S. CIC (Counterintelligence Corps) for refugee organizations, assisting Czech refugees with relocation programs and preparing passports. It was during this time that he met Pavel Tigrid, who went on to become Kučera's mentor and longtime friend. Following the Velvet Revolution in 1989, Tigrid went on to become the Minister of Culture for Czechoslovakia.

Teaching careerEdit

In 1949, he traveled to New York City aboard the USS General C. H. Muir. After receiving his PhD from Harvard University, he taught at the University of Florida at Gainesville for two years. Kučera then returned to Harvard as a research fellow. In 1955 he received an appointment at Brown University, where he was promoted to full professor in 1965. He spent the rest of his teaching career there. He retired in 1990 as the Fred M. Seed Professor Emeritus of Linguistics and Cognitive Sciences with, among other honors, a black tie retirement party and the publication of a Festschrift about his accomplishments.[1]


While at Brown, he was able to further pursue his interest in linguistics. There he became interested in the computational analysis of human language, though at the time there were scarcely any tools for this type of research.

In 1963 and 1964, Kučera collaborated with W. Nelson Francis to create the Brown Corpus of Standard American English, generally known as the Brown Corpus. This was a carefully compiled selection of current American English as published during the year 1961 in 1,000 sources on a wide variety of subjects. It has been very widely used in computational linguistics, and was for many years among the most-cited resources in the field. Kučera and Francis themselves subjected it to a variety of computational analyses from which derived their classic work Computational Analysis of Present-Day American English (1967), followed by Francis and Kučera's Frequency Analysis of English Usage: Lexicon and Grammar (1982).[2]

Shortly thereafter, Boston publisher Houghton-Mifflin approached Kučera to supply a million word, three-line citation base for its new American Heritage Dictionary. This ground-breaking new dictionary, which first appeared in 1969, was the first dictionary to be compiled using corpus linguistics for word frequency and other information.

Kučera wrote one of the first spell checkers over Christmas break, 1981, in PL/I for VAX machines, at the behest of Digital Equipment Corporation. It was a simple, rapid spelling verifier. Further development resulted in "International Correct Spell", a spell checking program which was used on word processing systems such as WordStar and Microsoft Word as well as numerous small computer applications. Kučera later oversaw the development of Houghton-Mifflin’s Correct Text grammar checker, which also drew heavily on statistical techniques for analysis. He founded Language Systems Incorporated (LSI), later Language Systems Software Incorporated (LSSI), to manage his software programs and updates until the patents expired in 2002.


In addition to his PhD in Linguistics from Harvard University, Kučera had a doctorate from Charles University in Prague which was restored after the Velvet Revolution ousted the communist rule in Czechoslovakia in 1989. He received honorary degrees from Pembroke College in Providence, RI, Bucknell University in Lewisburg, PA, and Masaryk University in Brno, Czech Republic (1990).[3][4] Kučera was a member of the academic honor society Phi Beta Kappa.


  1. ^ Harvey, Charlotte Bruce. "Teaching Computers to Spell". Brown Alumni Magazine. Providence, Rhode Island: Brown University (May/June 2010). ISSN 1520-863X. Retrieved 9 May 2017.
  2. ^ Henry Kucera at DBLP Bibliography Server  
  3. ^ "Henry Kucera". Tributes. February 2010. Retrieved 9 May 2017.
  4. ^ "Award information : Henry Kučera". Masaryk University. Retrieved 10 May 2017. A proposal to grant an honorary degree of Doctor honoris causa in the field of philological sciences was submitted by Masaryk University. The graduation ceremony was held on 6 June 1990.

Selected PublicationsEdit

  • H Kučera. 1961. The phonology of Czech.
  • H Kučera and WN Francis. 1967. Computational Analysis of Present-Day American English.
  • 1968. A Comparative Quantitative Phonology of Russian, Czech, and German
  • WN Francis and H Kučera. 1983. Frequency Analysis of English Usage: Lexicon and Grammar (with W. Nelson Francis)
  • WN Francis and H Kučera. 1989.Manual of information to accompany a standard corpus of present-day edited American English, for use with digital computers.
  • Mackie, Andrew, Tatyana K. McAuley, Cynthia Simmons, eds. 1962. For Henry Kučera. Ann Arbor, MI: Michigan Slavic Publications.

External linksEdit