List of Wikipedias
Wikipedia is a multilingual online encyclopedia edited and maintained by a community of volunteer editors. As of October 2020 Wikipedia articles have been created in 314 languages, with 304 active and 10 closed.
Wikipedia edition codesEdit
Each Wikipedia has a code, which is used as a subdomain below wikipedia.org. Interlanguage links are sorted by that code. The codes represent the language codes defined by ISO 639-1 and ISO 639-3, and the decision of which language code to use is usually determined by the IETF language tag policy. Wikipedias also vary by how thinly they slice dialects and variants; for example, the English Wikipedia includes most modern varieties of English (American English, British English, Indian English, South African English, etc.), but does not include other related languages such as Scots or Old English/Anglo-Saxon, both of which have separate Wikipedias. The Spanish Wikipedia includes both Peninsular Castilian and Latin American Spanish; Malay Wikipedia includes a large number of Malay languages; and so on.
Differences between the ISO mappings and Wikipedia codes include:
|WP code||WP edition||ISO 639||Notes|
|sq||Albanian||multiple||'sq' is the ISO code for the Albanian macrolanguage, which includes four individual languages.|
|als||Alemannic||gsw||'als' is the ISO code for Tosk Albanian.|
Alemannic Wikipedia covers several dialects: 'gsw' for Swiss German, Alemannic German, and Alsatian; 'gct' for Colonia Tovar dialect; 'swg' for Swabian German, and 'wae' for Walser German
|roa-rup||Aromanian||rup||'roa' is the ISO code for Romance (Other).|
|map-bms||Banyumasan||none||'map' is the ISO code for Austronesian (Other).|
|nds-nl||Dutch Low Saxon||multiple||In ISO, nds is 'Low Saxon', restricted to Germany in Ethnologue.|
The various subdivisions of Dutch Low Saxon have separate ISO codes.
|bh||Bihari||bih||ISO code 'bih' is a macrolanguage which includes Bhojpuri (bho), Maithili (mai), Magahi (mag), and nine others.|
Bihari Wikipedia excludes Maithili (mai) and Fiji Hindi (hif) which exist as independent Wikipedias.
|zh-yue||Cantonese||yue||zh is the ISO 639-1 code for Chinese in general.|
|zh-classical||Classical Chinese||lzh||As above.|
|ms||Malay||ms||ISO collective code 'ms' is a macrolanguage that includes more than 30 individual languages and dialects. However, the wiki excludes Indonesian because Indonesian Wikipedia (id) exists independently.|
|zh-min-nan||Southern Min/Min Nan||nan||Same as the "zh" languages. Not written in Chinese characters, but uses Pe̍h-ōe-jī or a derived romanization. hak does the same.|
|no||Norwegian (Bokmål)||nb, nob||ISO uses no for Norwegian in general. (Norwegian Nynorsk is at 'nn' in both ISO and Wikipedia.)|
|ksh||Ripuarian||none||ISO ksh is for the Colognian dialect, the most prominent dialect of the Ripuarian language group. The other variants (e.g. the Aachen dialect) do not have ISO codes.|
|bat-smg||Samogitian||sgs||'bat' is the ISO code for Baltic (Other).|
|roa-tara||Tarantino||none||'roa' is the ISO code for Romance (Other). Neapolitan in general is nap.|
|fiu-vro||Võro||vro||fiu is the ISO code for Finno-Ugric languages.|
|cbk-zam||Zamboanga Chavacano||none||ISO cbk is for the Chavacano language. The individual variants do not have ISO codes.|
Additionally, Wikipedias vary in orthography at times. Chinese Wikipedia automatically translates from modern Mandarin Chinese into five standard forms: Mainland China and Singapore in simplified Chinese characters, and Taiwan, Hong Kong, and Macau in traditional Chinese characters. Belarusian, however, has a separate Wikipedia for the 'normative' orthography (be) and Taraškievica (be-tarask).
The table below lists the language editions of Wikipedia roughly sorted by the number of active users (registered users who have made at least one edit in the last thirty days); in particular, the "power of ten" of the count of active users (i.e., the common logarithm rounded down to a whole number) is used: so "5" means at least 105 (or 100,000), "4" means at least 104 (10,000), and so on. Script names are listed as their ISO codes.
- The "Total" column refers to the number of pages in all namespaces, including both articles (the official article count of each wiki) and non-articles (user pages, images, talk pages, "project" pages, categories, re-directs, and templates).
- "Active users" are registered users who have made at least one edit in the last thirty days.
- "Images" is the number of locally uploaded files. Note that some large Wikipedias don't use local images and rely on Commons completely, so the value 0 is not an error.
- The "Depth" column (Edits/Articles × Non-Articles/Articles × [1−Stub-ratio]) is a rough indicator of a Wikipedia's quality, showing how frequently its articles are updated. It does not refer to academic quality.
- Lsjbot, a bot run by Sverker Johansson, is responsible for much of the growth of the second and third largest Wikipedias, the Cebuano and the Swedish Wikipedias, respectively, as well as the rapid growth of the Waray Wikipedia.
- The statistics are derived from API:Siteinfo and updated at Commons:Data:Wikipedia statistics/data.tab every six hours and displayed with
- Indo-European languages — 128
- Altaic languages — 22
- Uralic languages — 14
- Dravidian languages — 5
- Ibero-Caucasian languages — 10
- Sino-Tibetan languages — 12
- Afroasiatic languages — 11
- Austronesian languages — 29
- Austroasiatic languages — 4
- Kra–Dai languages — 4
- Indigenous languages of the Americas — 12
- Niger–Congo languages — 28
- Nilo-Saharan languages — 1
- Language isolate — 1
- Pidgins and creole languages — 10
- Constructed languages — 8
- List of Wikipedias – Meta
- "Wikipedia Statistics - Bot article creations only". stats.wikimedia.org. Retrieved 3 July 2019.
- "ISO 639 code tables". www.sil.org. Retrieved 9 March 2018.
- "Browse by Language Family". Ethnologue. Retrieved 9 March 2018.
|Meta has related information at: List of Wikipedias|