Talk:Line breaking rules in East Asian languages

Linguistics Low‑importance

	Linguistics portal This article is within the scope of WikiProject Linguistics, a collaborative effort to improve the coverage of linguistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.LinguisticsWikipedia:WikiProject LinguisticsTemplate:WikiProject LinguisticsLinguistics articles
Low	This article has been rated as Low-importance on the project's importance scale.

Asia Low‑importance

	Asia portal This article is within the scope of WikiProject Asia, a collaborative effort to improve the coverage of Asia on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.AsiaWikipedia:WikiProject AsiaTemplate:WikiProject AsiaAsia articles
Low	This article has been rated as Low-importance on the project's importance scale.

Typography Low‑importance

	This article is within the scope of WikiProject Typography, a collaborative effort to improve the coverage of articles related to Typography on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.TypographyWikipedia:WikiProject TypographyTemplate:WikiProject TypographyTypography articles
Low	This article has been rated as Low-importance on the importance scale.

East Asia (defunct)

This article is within the scope of WikiProject East Asia, a project which is currently considered to be defunct.East AsiaWikipedia:WikiProject East AsiaTemplate:WikiProject East AsiaEast Asia articles

Wrong Chinese and Korean special characters

Latest comment: 13 years ago1 comment1 person in discussion

Hi,

The special characters listed for Chinese and Korean look wrong. For example, both "break before" and "break after" characters for Simplified Chinese contain "〈" (= U+3008).

I suspect that this is caused by the fact that the original OOXML PDF: "Office Open XML Part 4 - Markup Language Reference.pdf" has wrong annotations for them. Actually I can't cut&paste the same text from the PDF.

We could fix this by converting the PDF doc to HTML with pdftohtml, and then copy the text from the generated HTML. For example, "break before" characters for Simplified Chinese should be:

"!%),.:;>?]}¢¨°·ˇˉ―‖’”…‰′″›℃∶、。〃〉》」』】〕〗〞︶︺︾﹀﹄﹚﹜﹞！＂％＇），．：；？］｀｜｝～￠"

Regards, DaikiUeno (talk) 06:26, 13 December 2010 (UTC)Reply

Chinese word (digraph) breaking?

Latest comment: 3 years ago1 comment1 person in discussion

Is there a rule in Chinese about not breaking up words that consist of two different characters? Jimw338 (talk) 17:21, 22 October 2021 (UTC)Reply

Add topic