Global Language Diversity on Wikipedia

A table showing the relationship between language speaker populations and Wikipedia article counts.
Author

Carwil Bjork-James

Published

February 23, 2026

Languages and Wikipedia Articles

The largest Wikipedia editions by number of articles, 2024 (CC-0 Arief Azazie Zain)

Top 250 Languages by Number of Speakers

Speaker estimates are L1 (first-language) counts from LinguaMeta, supplemented for a handful of languages where LinguaMeta lacks an estimate (sourced from Wikipedia). Background shading: blue = speaker population; green/gold = Wikipedia article count.

Language Language Description Speakers (L1 est.) Wikipedia Articles
1. Mandarin Chinese
普通话
major branch of Chinese spoken across most of northern and southwestern China 1288700000 0
2. Chinese
中文
NA 1288700000 1,528,688
3. English
English
West Germanic language 1267445366 7,155,715
4. Hindi
हिन्दी
Indo-Aryan language spoken chiefly in North India 544000000 168,468
5. Spanish
español
Romanic language originating in the Iberian Peninsula 468360000 2,102,562
6. Arabic
العربية
NA 346277000 1,304,590
7. Urdu
اردو
Indo-Aryan language spoken in South Asia 288000000 241,743
8. Bengali
বাংলা
Indo-Aryan language mostly spoken in Bangladesh and India 266000000 185,806
9. Portuguese
português
Romance language 225751001 1,167,497
10. French
français
Romance language 203194940 2,746,574
11. Punjabi
پنجابی
Indo-Aryan language spoken in the Punjab region of Pakistan and India 200000000 59,471
12. Indonesian
Bahasa Indonesia
official language of Indonesia 171000000 766,802
13. Swahili
Kiswahili
NA 171000000 108,115
14. Russian
русский
East Slavic language 170200000 2,091,844
15. Japanese
日本語
language spoken in East Asia 119000000 1,494,908
16. Western Panjabi
شاہ مُکھی
spurious identity for purported distinct language of Pakistani Punjab maintained by SIL International, as documented in ISO 639-3 standard and Ethnologue 113000000 75,288
17. Telugu
తెలుగు
Dravidian language native to South India 95000000 121,704
18. Lahnda
NA 93000000 0
19. Marathi
मराठी
Indo-Aryan language 93000000 101,651
20. German
Deutsch
West Germanic language spoken mainly in Central Europe 91739000 3,106,865
21. Javanese
Basa Jawa
Austronesian language 91000000 75,121
22. Vietnamese
Tiếng Việt
Austroasiatic language originating in Vietnam 85000000 1,299,316
23. Wu Chinese
吳語
Sinitic language 84000000 48,020
24. Persian
فارسی
NA 82000000 1,070,008
25. Caribbean Javanese
NA 82000000 0
26. Tamil
தமிழ்
Dravidian language native to South India and Sri Lanka 81530000 181,903
27. Yue Chinese
廣東話
primary branch of Chinese spoken in southern China 79000000 149,667
28. Egyptian Arabic
اللغه المصريه الحديثه
Arabic dialect spoken in Egypt 78000000 1,630,970
29. Turkish
Türkçe
Oghuz Turkic language of the Turkish people 76290000 673,350
30. Korean
한국어
language spoken in Korean Peninsula and some part of North-eastern China 75000000 741,201
31. Filipino
Filipino
national and official language of the Philippines 66000000 0
32. Italian
italiano
Romance language 65250820 1,961,929
33. Jinyu Chinese
晋语
branch of Chinese spoken in parts of northern China 63000000 0
34. Gujarati
ગુજરાતી
Indo-Aryan language that is spoken on the state of Gujarat 60000000 30,836
35. Thai
ภาษาไทย
language spoken by Thai people 55000000 180,846
36. Pashto
پښتو
NA 53000000 21,180
37. Kannada
ಕನ್ನಡ
Dravidian language 49000000 34,873
38. Nigerian Pidgin
Naija
English-based creole languages 45000000 1,543
39. Malayalam
മലയാളം
Dravidian language of India 42000000 87,618
40. Min Nan Chinese
Min Nan
branch of the Min Chinese language 42000000 433,979
41. Oromo
Afaan Oromoo
NA 42000000 1,968
42. Odia (Oriya)
ଓଡ଼ିଆ
NA 42000000 20,741
43. Xiang Chinese
湘語
Chinese language spoken mainly in Hunan province 40000000 0
44. Sindhi
سنڌي
Indo-Aryan language spoken in Pakistan and India 38400000 21,230
45. Sudanese Arabic
لهجة سودانية
NA 37000000 0
46. Fulah
Fulfulde
language of West Africa of the Senegambian branch of the Niger–Congo language family 37000000 15,915
47. Polish
polski
West Slavic language 37000000 1,689,025
48. Amharic
አማርኛ
Semitic language of Ethiopia 36000000 15,617
49. Algerian Arabic
الدارجة الجزائرية
Maghrebi dialect of the Arabic language spoken in Algeria 36000000 0
50. Myanmar (Burmese)
မြန်မာ
Sino-Tibetan language of Myanmar 36000000 110,368
51. Odia
NA 35000000 0
52. Malay
Bahasa Melayu
NA 34400000 437,427
53. Bhojpuri
भोजपुरी
Indo-Aryan language native to India and Nepal 33470000 0
54. Tagalog
Filipino
NA 33000000 48,838
55. Hakka Chinese
客家話
primary branch of Chinese originating in Southern China 32000000 10,402
56. Sundanese
Basa Sunda
NA 32000000 62,428
57. Moroccan Arabic
الداريجة المغريبية
Maghrebi dialect of the Arabic language spoken in Morocco 31000000 13,558
58. Azerbaijani
Azərbaycanca
NA 29230000 212,588
59. Ukrainian
українська
East Slavic language 29000000 1,413,312
60. Hausa
Hausa
Chadic language spoken by the Hausa people 28250000 94,304
61. Igbo
Igbo
Niger–Congo language of the Igbo people, mainly spoken in Nigeria 28000000 45,660
62. Saraiki
سرائیکی
Indo-Aryan language spoken in Pakistan 28000000 24,462
63. Northern Uzbek
NA 28000000 0
64. Yoruba
Èdè Yorùbá
Niger-Congo language spoken in West Africa 28000000 36,882
65. Cebuano
Cebuano
Austronesian language spoken in the Philippines by Cebuanos and other ethnic groups 26000000 6,115,165
66. Uzbek
o‘zbek
Turkic language 26000000 335,612
67. Saidi Arabic
صعيدى
variety of Arabic spoken by the Ṣa‘īdi people south of Cairo, Egypt to the border of Sudan 25000000 0
68. Awadhi
अवधी
Indo-Aryan language spoken in Northern India 25000000 2,625
69. Antankarana Malagasy
NA 25000000 0
70. Dutch
Nederlands
West Germanic language 24085200 2,215,081
71. South Azerbaijani
تۆرکجه
Turkic language 24000000 244,608
72. North Azerbaijani
NA 24000000 0
73. Gan Chinese
贛語
member of the Sinitic languages of the Sino-Tibetan language family spoken as the native language by the people in the Jiangxi province of China 24000000 6,817
74. Malagasy
Malagasy
NA 24000000 102,345
75. Marwari
मारवाड़ी
NA 22000000 0
76. Romanian
română
Eastern Romance language, official of Romania 21100000 541,706
77. Nepali
नेपाली
Northern Indo-Aryan language family 20400000 29,598
78. Maithili
मैथिली
Indo-Aryan language spoken in India and Nepal 19300000 14,309
79. Rajasthani
राजस्थानी
NA 19300000 0
80. Serbo-Croatian
srpskohrvatski jezik
South Slavic language 19000000 461,476
81. Mesopotamian Arabic
لهجة بلاد ما بين النهرين
continuum of mutually-intelligible varieties of Arabic 17000000 0
82. Assamese
অসমীয়া
Indo-Aryan language spoken in Assam, India 17000000 23,023
83. Madurese
Bhasa Madhura
language spoken in Indonesia 17000000 5,206
84. Northeastern Thai
ภาษาอีสาน
Thai dialects of the Lao language 17000000 0
85. Rangpuri
অংপুরি
Indo-Aryan language spoken in India, Bangladesh, and Nepal 16700000 0
86. Haryanvi
हरियाणवी
Western Hindi language closely related to Hindi widely spoken in the North Indian state of Haryana and in Delhi 16000000 0
87. Magahi
मगही
Indo-Aryan language spoken in India 16000000 0
88. Nepali
नेपाली
official language of Nepal 16000000 0
89. Sinhala
සිංහල
Indo-Aryan language native to Sri Lanka 16000000 25,069
90. Zhuang
Vahcuengh
any of various Tai languages used by the Zhuang people 16000000 3,008
91. Chhattisgarhi
छत्तीसगढ़ी
official language in the Indian state of Chhattisgarh 15000000 0
92. Khmer
ខ្មែរ
Austroasiatic language of Cambodia 15000000 11,963
93. Southern Pashto
Pashto dialect 14920000 0
94. Nigerian Fulfulde
language of Nigeria 14000000 0
95. Zulu
isiZulu
Nguni language of eastern South Africa and neighbouring countries 14000000 12,166
96. Kazakh
қазақ
Kipchak Turkic language of Central Asia 13200000 242,807
97. Sanaani Arabic
cluster of varieties of Arabic spoken in Yemen and southwestern Saudi Arabia 13000000 0
98. Deccan
Deccani
variety of Hindustani spoken in the Deccan region of India; native language of the Deccani people 13000000 0
99. Chichewa
Chichewa
language of the Bantu language family 13000000 1,102
100. Swedish
svenska
North Germanic language spoken in Sweden and Finland 12226000 2,622,133
101. Greek
Ελληνικά
dialects and varieties of the Greek language spoken in the modern era 12200000 266,986
102. Ta'Izzi-Adeni Arabic
لهجة تعزية-عدنية
dialect of Arabic spoken primarily in the Republic of Yemen and the Republic of Djibouti 12000000 0
103. Iranian Persian
dialect of the Persian language 12000000 0
104. Shona
Chishona
Bantu language of Zimbabwe and Mozambique 12000000 11,539
105. Hungarian
magyar
Uralic language 11800000 567,269
106. Kurmanji Kurdish
Kurdî
Western Iranian language 11500000 91,100
107. Low German
Plattdüütsch
West Germanic language spoken mainly in northern Germany and the eastern part of the Netherlands 11500000 85,815
108. Sorani Kurdish
سۆرانی
Kurdish language spoken in Iraq and Iran 11100000 80,390
109. Hijazi Arabic
حجازي
variety of Arabic spoken in the Hejaz region of Saudi Arabia 11000000 0
110. Twi
Twi
language of Akan lands in Ghana 11000000 0
111. Wolof
Wolof làkk
language of Senegal, the Gambia, and Mauritania 11000000 1,747
112. Norwegian Bokmål
bokmål
one of two official written standards for the Norwegian language 10501500 0
113. Tigrinya
ትግርኛ
Semitic language spoken in Ethiopia and Eritrea 10100000 358
114. North Mesopotamian Arabic
لهجة موصلية
Arabic dialect 10000000 0
115. Czech
čeština
West Slavic language 10000000 588,604
116. Ilocano
Ilokano
Austronesian language spoken by the Ilocano people of the Philippines 10000000 15,466
117. Nande
Yira
NA 10000000 0
118. Xhosa
isiXhosa
Nguni language of southern South Africa 10000000 2,412
119. Luba-Lulua
Ciluba
Bantu language spoken in DR Congo 9800000 0
120. Kinyarwanda
Ikinyarwanda
Bantu language spoken in Central Africa, official in Rwanda 9800000 9,570
121. Dhundari
ढूंढारी
NA 9600000 0
122. Kanuri
Kànùrí
NA 9600000 0
123. Dari
dialect of the Persian language spoken in Afghanistan 9600000 0
124. Belarusian
беларуская
East Slavic language 9500000 262,275
125. Min Dong Chinese
Mìng-dĕ̤ng-ngṳ̄
branch of the Min group of Sinitic languages of China 9500000 16,705
126. Umbundu
Umbundu
NA 9400000 0
127. Hiligaynon
Ilonggo
Hiligaynon language spoken in the Western Visayas region of the Philippines 9200000 0
128. Somali
Soomaali
Afroasiatic language belonging to the Cushitic branch 9200000 10,433
129. Kikuyu
Gĩkũyũ
Bantu language in Kenya 9100000 2,250
130. Congo Swahili
NA 9100000 0
131. Bambara
Bámánánkán
western African language spoken in Mali, with SVO structure and two lexical tones 9000000 864
132. Haitian Creole
Kreyòl Ayisyen
language spoken in Haiti 9000000 71,766
133. Twi
twi
NA 9000000 4,655
134. Tajik
тоҷикӣ
language spoken in Tajikistan 8900000 117,002
135. Hebrew
עִבְרִית
standard form of the revived Hebrew language spoken today mainly in Israel 8700000 392,894
136. Catalan
català
Western Romance language 8539000 790,651
137. Quechua
Runa Simi
NA 8500000 24,471
138. Bavarian
Boarisch
major group of Upper German varieties spoken in the southeast of the German language area Bavaria 8400000 27,222
139. Sichuan Yi
Nuosuhxop
the prestige language of the Yi people 8400000 3
140. Mossi
Mòoré
one of two official regional languages of Burkina Faso 8300000 1,315
141. Kimbundu
Kimbundu
Bantu language 8100000 0
142. Sylheti
সিলেঢী
Indo-Aryan language spoken in Sylhet 8100000 1,232
143. Kongo
Kikongo
NA 8000000 1,945
144. Minangkabau
Baso Minangkabau
Minangkabauic language spoken predominantly by the Minangkabau ethnic group 8000000 229,117
145. Serbian
српски
standardized variety of Serbo-Croatian language used by Serbs 7890000 717,549
146. Standard Moroccan Tamazight
ⵜⴰⵎⴰⵣⵉⵖⵜ
standardised form of Berber language of Morocco (including all regional accents), written in Tifinagh alphabet 7800000 12,189
147. Hmong
𖬌𖬣𖬵
NA 7700000 0
148. Uyghur
ئۇيغۇر تىلى
Turkic language spoken by the Uyghur people 7700000 9,705
149. Rundi
Ikirundi
Bantu language 7500000 703
150. Albanian
Shqip
Indo-European language, spoken in Albania, Kosovo, North Macedonia and Montenegro as well as Italy, Croatia, Romania and Sebia 7500000 105,186
151. Kanauji
कन्नौजी
NA 7400000 0
152. Afrikaans
Afrikaans
West Germanic language, spoken in South Africa and Namibia 7300000 128,415
153. Santali
সাঁওতালী
Kherwari language of the Austro-Asiatic family spoken in India, Bangladesh, Bhutan and Nepal 7300000 14,716
154. Eastern Maninkakan
NA 7100000 0
155. Bulgarian
български
South Slavic language 7000000 308,782
156. Varhadi-Nagpuri
वऱ्हाडी
dialect of Marathi spoken in Vidarbha region of Maharashtra and neighboring regions 7000000 0
157. Northern Thai
ᨣᩴᩤᨾᩮᩬᩥᨦ
NA 6600000 0
158. Mongolian
Монгол
official language of Mongolia 6500000 27,147
159. Central Pashto
NA 6500000 0
160. Sesotho
Sesotho
Southern Bantu language 6400000 1,674
161. Krio
Krio
English-based creole language spoken in Sierra Leone 6300000 0
162. Swiss German
Schwiizertüütsch
group of dialects of the Upper German branch of the Germanic language family 6120000 0
163. Balochi
Balòci
NA 6100000 0
164. Mewati
मेवाती
Indo-Aryan language of India 6100000 0
165. Tswana
Setswana
Bantu language of Botswana and South Africa 6000000 3,926
166. Luyia
Oluluhya
NA 5900000 0
167. Guarani
avañeʼẽ
NA 5852000 6,021
168. Libyan Arabic
ليبي
dialect of the language as spoken in the North African country 5600000 0
169. Betawi
Betawi
language spoken in Indonesia 5600000 3,197
170. Luganda
Oluganda
Bantu language of Uganda 5600000 4,590
171. Danish
dansk
North Germanic language 5510600 313,369
172. Norwegian
norsk
North Germanic language spoken in Norway 5500000 680,695
173. Southern Thai
ภาษาปักษ์ใต้
NA 5500000 0
174. Bemba
Chibemba
Bantu language spoken primarily in north-eastern Zambia by the Bemba people 5400000 0
175. Kashmiri
كٲشُر
language from the Dardic subgroup of the Indo-Aryan languages 5400000 9,671
176. Kituba
Kikongo-Kituba
language of the Democratic Republic of Congo 5400000 0
177. Malvi
मालवी
NA 5400000 0
178. Northeastern Dinka
language of South Sudan 5320000 0
179. Sepedi
Sepedi
NA 5300000 8,915
180. Finnish
suomi
Finno-Ugric language mostly spoken in Finland 5200000 614,978
181. Halh Mongolian
NA 5200000 0
182. Luo
Dholuo
NA 5200000 0
183. Tok Pisin
Tok Pisin
English creole spoken in Papua New Guinea 5200000 1,415
184. Hadrami Arabic
variety of the Arabic language 5100000 0
185. Lao
ລາວ
Kra–Dai language of Southeast Asia 5100000 5,451
186. Sukuma
Kisukuma
NA 5100000 0
187. Ghanaian Pidgin English
Ghanaian Pidgin
creole language 5000000 4,805
188. Koongo
NA 5000000 0
189. Sicilian
Sicilianu
Italo-Dalmatian language spoken in Southern Italy 5000000 26,290
190. Konkani
कोंकणी
Indo-Aryan language spoken in India 4900000 0
191. Slovak
slovenčina
West Slavic language spoken in Slovakia 4900000 258,929
192. Balinese
Basa Bali
Malayo-Polynesian language spoken on the island of Bali 4800000 36,303
193. Paraguayan Guaraní
NA 4800000 0
194. Mainfränkisch
Ostfränkisch
Upper German dialect family 4800000 0
195. Croatian
hrvatski
standardized variety of Serbo-Croatian language, used by Croats 4660000 229,760
196. Huizhou Chinese
徽州話
Sinitic language 4600000 0
197. Eastern Oromo
Oromoo
NA 4500000 0
198. Buginese
ᨅᨔ ᨕᨘᨁᨗ
South Sulawesi language predominantly spoken by the Bugis ethnic group 4300000 15,955
199. Dinka
Thuɔŋjäŋ
Nilotic dialect cluster spoken by the Dinka people, the major ethnic group of South Sudan 4200000 323
200. Konkani
कोंकणी
NA 4200000 3,642
201. Mazanderani
مازندرانی
Northwestern Iranian language spoken mainly in Iran's Mazandaran, Gilan and Golestan provinces 4200000 64,575
202. Tichurong
Sino-Tibetan language 4200000 0
203. Southern Uzbek
اوزبیک تورکچه
Variant of the Uzbek language spoken in modern day Afghanistan 4200000 0
204. Gheg Albanian
one of the two major varieties of Albanian 4100000 0
205. Bukit Malay
NA 4100000 0
206. Kamba
Kĩkamba
Bantu language spoken in Kenya 4100000 0
207. Kalenjin
Kalenjin
Southern Nilotic language family 4100000 0
208. Banjar
Banjar
NA 4000000 11,801
209. Northern Hindko
Hindko
Hindko dialect associated with the region around and north of Abottabad 4000000 0
210. Zarma
Zarma Sanni
Songhay language of southwestern Niger 3900000 0
211. Borana-Arsi-Guji Oromo
NA 3900000 0
212. Gilaki
گیلکی
Western Iranian language 3900000 48,304
213. Turkmen
türkmençe
Oghuz Turkic language of Central Asia 3900000 7,068
214. Makhuwa
Emakhuwa
NA 3900000 0
215. Merwari
मेरवारी
NA 3900000 0
216. Southern Balochi
بلوچی‎
group of Balochi dialects 3800000 0
217. Bosnian
bosanski
standardized variety of Serbo-Croatian 3800000 97,141
218. Sidamo
Sidaamu Afoo
NA 3800000 0
219. Achinese
Acèh
Northern Sumatran language 3700000 13,024
220. Chuanqiandian Cluster Miao
West Hmongic dialect continuum 3700000 0
221. Pulaar
language spoken by Fula people and Tukolor 3700000 0
222. Shekhawati
शेखावाटी
NA 3700000 0
223. Garhwali
गढ़वळि
Central Pahari language belonging to the Northern Zone of Indo-Aryan languages 3600000 0
224. Lambadi
लम्बाडी
language of India 3600000 0
225. Lombard
Lumbaart
Gallo-Italic language spoken in the Italian region of Lombardy 3600000 80,041
226. Shan
ၵႂၢမ်းတႆးယႂ်
native language of the Shan people and is mostly spoken in Shan State, Burma 3600000 14,660
227. Bangala
Ngala
Bantu language 3500000 0
228. Galician
Galego
Galician-Portuguese language 3500000 230,350
229. Central Atlas Tamazight
Tamaziɣt
Berber language of the Afro-Asiatic language family 3500000 0
230. Lingala
Lingála
Bantu language spoken in western Central Africa 3420000 5,187
231. Georgian
ქართული
official language of Georgia 3400000 192,392
232. Kabyle
القبايل
Berber language spoken by the Kabyle people 3400000 7,072
233. Pattani Malay
بهاس جاوي
dialect of Malay spoken in Pattani, Kelantan, and Terengganu 3400000 0
234. Peripheral Mongolian
NA 3400000 0
235. Hmong Daw
us Hmoob
NA 3400000 0
236. Tiv
Tiv
language of Nigeria 3400000 0
237. Bikol
Bikol
NA 3300000 0
238. Sankaran Maninka
NA 3300000 0
239. Omani Arabic
اللهجة العمانية
variety of Peninsular Arabic spoken in Oman 3200000 0
240. Bundeli
बुन्देली
Indo-Aryan language spoken in India 3200000 0
241. Ewe
Eʋegbe
Niger–Congo language spoken in southeastern Ghana and southern Togo 3200000 1,284
242. Fon
Fɔ̀ngbè
part of the Gbe language cluster and belongs to the Volta–Niger branch of the Niger–Congo languages 3200000 3,512
243. Gondi
गोंडी
NA 3200000 0
244. Central Kanuri
Saharan language 3200000 1,633
245. Waray
Waray
Visayan language primarily spoken in the islands of Samar and Eastern Leyte 3200000 1,266,873
246. Kenyi
NA 3100000 0
247. Musi
Baso Palembang
indigenous language spoken by Musi people native to Musi regions in South Sumatra 3100000 0
248. Southern Kurdish
کوردی خوارین
variety of Kurdish comprising several dialects of Northwestern Iranian languages spoken in the south of Kurdistan (in upper Mesopotamia, to the west of Iran and east of Iraq) 3100000 0
249. Tachelhit
Taclḥiyt
Berber language of southwestern Morocco 3100000 10,886
250. Yao
Ciyawo
language of Africa 3100000 0

Additional Languages with Wikipedia Editions

These languages fall outside the top 250 by speaker population but have active Wikipedia editions. Global rank (by estimated L1 speakers) shown where available.

Language Language Description Speakers (L1 est.) Wikipedia Articles
257. Armenian
հայերեն
Indo-European language 3000000 325,140
265. Kyrgyz
кыргызча
Kipchak Turkic language of Central Asia 2900000 76,187
266. Sango
Sängö
NA 2900000 352
268. Aymara
Aymar aru
native language in South America 2810000 5,255
270. Tibetan
བོད་ཡིག
Tibeto-Burman language 2800000 8,023
274. Jamaican Creole English
Patwa
an English-based creole spoken in and around Jamaica; it additionally takes influence from various African languages, particularly Akan 2700000 1,732
283. Batak Toba
ᯅᯖᯂ᯲ ᯖᯬᯅ
Batak language spoken by the Toba Batak ethnic group 2500000 1,400
284. Central Bikol
Bikol Sentral
Austronesian language spoken in the Philippines 2500000 21,663
290. Pampanga
Pampanga
Austronesian language spoken in the Philippines 2500000 10,208
292. Tsonga
Xitsonga
Bantu language of the Tsonga people of southern Africa 2500000 1,084
299. Lithuanian
lietuvių
Baltic language spoken in Lithuania 2300000 225,399
306. Swati
siSwati
language of the Swazi people 2140000 1,136
312. Esperanto
esperanto
NA 2000000 383,179
315. Occitan
occitan
Romance language of Western Europe 2000000 90,655
319. Tulu
ತುಳು
Indian Dravidian language of Tulu Nadu region 2000000 3,186
320. Tatar
татар теле
Turkic language spoken by Tatars 2000000 610,052
327. Fanti
NA 1900000 1,795
336. Tosk Albanian
Schwyzertüütsch
southern group of dialects of the Albanian language 1800000 31,625
337. Bashkir
Башҡортса
Turkic language in Russia 1800000 63,941
339. Chuvash
Чӑвашла
Turkic language spoken in central Russia, primarily in the Chuvash Republic and adjacent areas 1800000 58,769
341. Northern Luri
لری
NA 1800000 1
343. Slovenian
slovenščina
South Slavic language spoken primarily in Slovenia 1800000 196,968
344. Tumbuka
Chitumbuka
Niger-Congo language originating in parts of Malawi, Zambia, and Tanzania 1800000 18,808
358. Dimli
Dimilkî
NA 1600000 42,607
360. Igala
Igala
language of the Yoruboid branch of the Volta–Niger language family 1600000 1,068
364. Scots
Scots
Germanic language 1600000 34,157
379. Meiteilon (Manipuri)
Meitheiron
Sino-Tibetan language 1500000 10,460
380. Pangasinan
Pangasinense
Austronesian language spoken in the province of Pangasinan by Pangasinense people 1500000 2,619
394. Western Armenian
արեւմտահայերէն
Indo-European language 1400000 13,436
398. Macedonian
македонски
South Slavic language mostly spoken in North Macedonia and its neighbouring countries 1400000 159,859
401. Norwegian Nynorsk
nynorsk
one of two official written standards for the Norwegian language 1400000 177,568
408. Avaric
Магӏарул мацӏ
language belonging to the Avar–Andic group of the Northeast Caucasian language family 1310000 4,013
417. Venda
Tshivenḓa
language of the Venda people 1300000 892
425. Estonian
eesti
NA 1200000 258,539
445. Vlaams
West-Vlaams
Germanic language spoken in West Flanders, French Flanders and in the west of Zeelandic Flanders 1200000 8,322
447. Batak Mandailing
Saro Mandailing
language spoken by the Mandailing ethnic group 1100000 1,208
451. Irish
Gaeilge
language native to Ireland 1100000 63,499
453. Gorontalo
Bahasa Hulontalo
language in northern Sulawesi, Indonesia 1100000 15,094
456. Latvian
latviešu
Baltic language, official in Latvia and the European Union 1100000 141,714
458. Sardinian
Sardu
NA 1100000 7,799
459. Tigre
ትግረ
semetic language spoken in the Horn of Africa 1100000 44
462. Talysh
Tolışə
NA 1006000 10,090
468. Southern Dagaare
Dagaare
language of the Dagaara people in Ghana and Burkina Faso 1000000 3,116
469. Basque
euskara
language of the Basque people 1000000 482,145
470. Farefare
Frafra
language in West Africa 1000000 1,334
474. Kabiyè
kabɩyɛ
Eastern Gurunsi language primarily of northern Togo 1000000 1,714
479. Newari
नेपाल भाषा
Sino-Tibetan language of central-eastern Nepal 1000000 73,813
481. Nupe-Nupe-Tako
Nupe
NA 1000000 698
482. Rakhine
ရခိုင်ဘာသာ
NA 1000000 1,148
502. Limburgan
Limburgs
Low Franconian group of dialects 950000 15,178
506. Chechen
Нохчийн
Northeast Caucasian language spoken mostly in Chechnya and by Chechen people 940000 818,550
517. South Ndebele
isiNdebele seSewula
NA 900000 297
532. Welsh
Cymraeg
Brythonic language spoken natively in Wales 850000 284,040
533. Komering
Cawa Komering
Lampungic language spoken predominantly by the Komering ethmic group 850000 2,888
535. Mon
ဘာသာ မန်
Austroasiatic language spoken by the Mon in Burma and Thailand 850000 1,974
541. Iban
Iban
Coastal Dayak language spoken by Iban people 820000 2,316
544. Tetum
Tetum
Austronesian language spoken on the island of Timor 820000 1,384
547. Venetian
vèneto
Romance language spoken in the Italian region of Veneto 810000 69,574
550. Dagbani
Dagbani
Gur/Mabia language spoken in Ghana 800000 13,596
560. Nias
Li Niha
Austronesian language spoken in Indonesia 770000 1,770
562. Dotyali
डोटेली
Indo-Aryan language 760000 3,663
570. Angika
अंगिका
Bihari language of India and Nepal 740000 1,672
571. Frisian
Frysk
Germanic language native to the Dutch region of Friesland 740000 59,578
589. Picard
Picard
Gallo-Romance "langue d'oïl", spoken in northern France and southern Belgium 700000 6,089
595. Walloon
walon
Romance language indigenous to Belgium and France 680000 12,914
603. Asturian
asturianu
NA 650000 138,965
605. Gun
gungbe
language spoken in Nigeria and Benin 650000 1,617
615. N'Ko
ߒߞߏ
NA 630000 1,587
620. Neapolitan
napulitano
Italo-Dalmatian language spoken in southern Italy 610000 14,954
629. Kuanyama
Oshikwanyama
language of Angola and Namibia 600000 4
635. Ruthenian
NA 600000 1,196
642. Crimean Tatar
qırımtatar tili
Turkic language spoken in Crimea 580000 29,674
651. Breton
brezhoneg
Celtic language spoken in France 560000 90,834
661. Ndonga
ndonga
Bantu language spoken in Namibia and parts of Angola 550000 8
663. Papiamento
Papiamentu
creole language spoken in the Dutch West Indies 549000 5,328
665. Ligurian
lìgure
Gallo-Romance language (for the ancient extinct language use Q36104) 540000 11,495
667. Udmurt
Удмурт
Uralic language 540000 5,916
668. Ossetian
Ирон ӕвзаг
dialect of the Ossetian language 538000 21,540
673. Rusyn
русиньскый язык
East Slavic language 530000 10,207
681. Pa'O Karen
ပအိုဝ်ႏ
Pa'O language 500000 2,913
693. Silesian
ślůnski
West Slavic ethnolect 500000 59,974
699. Kara-Kalpak
Qaraqalpaq tili
Turkic language spoken in Uzbekistan 490000 11,734
707. Eastern Mari
олык марий
Mari language in the Uralic language family 470000 11,325
714. Maltese
Malti
Semitic language spoken mostly in Malta 460000 7,769
722. Jju
Diryem Jju
Niger–Congo language spoken in central Nigeria 450000 259
729. Yakut
Саха тыла
Turkic language 450000 18,081
734. Kabardian
Адыгэбзэ
Northwest Caucasian language 440000 1,640
735. Kusaal
Kʋsaal
NA 440000 1,310
740. Erzya
Эрзянь
Uralic language spoken in Russia 440000 7,869
745. Mingrelian
მარგალური
Kartvelian language spoken in Western Georgia 440000 22,218
760. Luxembourgish
Lëtzebuergesch
Germanic language or language variety spoken in Luxembourg 420000 66,881
768. Sranan Tongo
Sranantongo
creole language spoken in Suriname 410000 1,128
791. Pfaelzisch
Pälzisch
West Franconian dialect of German 400000 2,847
809. Pontic
Ποντιακά
Greek dialect 390000 538
814. Dhivehi
ދިވެހި
Indo-Aryan national language of the Maldives 380000 3,188
818. Fiji Hindi
फ़िजी हिंदी
language spoken by most Fijian citizens of Indian descent 380000 12,213
824. Dzongkha
རྫོང་ཁ་
Sino-Tibetan language spoken in Bhutan 370000 387
825. Fijian
Na Vosa Vakaviti
Austronesian language of the Malayo-Polynesian family spoken in Fiji 370000 1,715
840. Kalmyk
Хальмг Өөрдин
register of the Kalmyk language, natively spoken by the Kalmyk people of Kalmykia 360000 1,590
847. Icelandic
íslenska
North Germanic language mainly spoken in Iceland 350000 61,218
874. Obolo
Andoni
NA 320000 433
891. Vlax Romani
Romani čhib
dialect group of the Romani language 310000 755
905. Moksha
Мокшень
member of the Mordvinic branch of the Uralic languages and the majority language in the western part of Mordovia 300000 7,628
950. Bislama
Bislama
English-based creole language of Vanuatu 270000 1,487
969. Komi
Коми кыв
Uralic language that is spoken on the Republic of Komi, Russia 260000 5,751
971. Lezghian
Лезги
Northeast Caucasian language that belongs to the Lezgic languages 260000 4,459
981. Tai Nüa
ᥖᥭᥰ ᥘᥫᥴ
NA 260000 449
993. Extremaduran
Estremeñu
Romance language spoken in Spain 250000 4,178
1037. Karachay-Balkar
Къарачай-Малкъар
Turkic language 240000 2,784
1038. Kölsch
Kölsch
dialect of the Ripuarian Central German group of languages 240000 3,038
1049. Zeeuws
Zeêuws
language or collection of related dialects spoken on and around the Zeelandic islands 240000 7,215
1058. Ingush
ГӀалгӀай
language spoken by the Ingush people 230000 2,512
1089. Russia Buriat
Буряад хэлэн
Buryat language 218000 2,913
1107. West Coast Bajau
ling Sama
NA 200000 239
1127. Tyap
Katab
Niger–Congo language spoken in central Nigeria 200000 1,506
1150. Samoan
Gagana faʻa Sāmoa
language of the Samoan Islands 200000 1,208
1175. Kadazan Dusun
Kadazandusun
language spoken by the Dusun and Kadazan peoples of Sabah, Malaysia 180000 1,790
1193. Tuvinian
Тыва дыл
Turkic language in Russia 180000 4,128
1208. Latgalian
Latgalīšu
historical variety of Latvian, sometimes considered a separate Baltic language 170000 1,130
1212. Navajo
Diné bizaad
Athabaskan language of Na-Dené stock spoken in the southwestern United States 170000 22,664
1227. Corsican
Corse
Italo-Dalmatian language 160000 8,653
1253. Yiddish
יידיש
High German-derived language used by Ashkenazi Jews 160000 15,653
1274. Hiri Motu
Hiri Motu
Austronesian language of Papua New Guinea 150000 3
1317. Amis
Pangcah
language of the Amis 140000 1,146
1336. Maori
Māori
Polynesian language spoken in New Zealand 140000 8,033
1365. Wayuu
Wayuunaiki
NA 130000 697
1387. Pennsylvania German
Deitsch
variety of West Central German 130000 2,049
1397. Adyghe
Адыгабзэ
one of the official languages of the Republic of Adygea in Russia 120000 640
1457. Gagauz
Gagauz dili
Turkic language, spoken mainly by the Gagauz people and the official language of the autonomous Moldovan region of Gagauzia 110000 3,013
1470. Ladino
Ladino
language of Sephardic Jews and form of Spanish 110000 3,994
1471. Lak
Лакку
Northeast Caucasian language 110000 1,090
1546. Narom
Normaund
language spoken in Malaysia 100000 5,059
1564. Tonga
lea fakatonga
Polynesian language 100000 2,045
1620. Tahitian
Tahiti
language of French Polynesia without official language status 91000 1,250
1621. Bishnupriya
ইমার ঠার
Indo-Aryan language spoken in India and Bangladesh 90000 25,092
1637. Abkhazian
Аҧсшәа
Northwest Caucasian language native to northwestern Georgia 88000 6,494
1659. Atayal
Atayal
Austronesian language spoken in Taiwan 84000 2,582
1724. Scots Gaelic
Gàidhlig
Goidelic Celtic language of Scotland 72000 16,046
1772. Paiwan
pinayuanan
Austronesian language spoken in Taiwan 66000 376
1787. Arpitan
arpetan
NA 64000 5,825
1788. Komi-Permyak
Перем коми
Uralic language spoken in Perm Krai, Russia 64000 3,471
1873. Guianese Creole French
Kréyòl
French-based creole from French Guiana 52000 1,076
1895. Kashubian
Kaszëbsczi
West Slavic language spoken in Poland 50000 5,532
1930. Faroese
føroyskt
insular Nordic language spoken as a native language by the people of Faroe Islands 49000 14,200
1964. Inuktitut
ᐃᓄᒃᑎᑐᑦ
NA 45000 431
1994. Romansh
rumantsch
Romance language spoken predominantly in the southeastern Swiss canton of Grisons (Graubünden) 42000 3,840
2063. Chamorro
Chamoru
Malayo-Polynesian (Austronesian) language, spoken on the Mariana Islands 37000 559
2067. Friulian
Furlan
Romance language belonging to the Rhaeto-Romance family, spoken in the Friuli region of northeastern Italy 37000 4,934
2155. Ladin
Ladin
Romance language 31000 180,848
2156. Livvi
Livvinkarjala
Finno-Ugric language 31000 4,693
2176. Hawaiian
ʻŌlelo Hawaiʻi
Polynesian language 30000 2,974
2192. Western Mari
Мары йӹлмӹ
NA 30000 10,430
2277. Aragonese
Aragonés
Romance language 26000 63,115
2283. Cherokee
ᏣᎳᎩ
Iroquoian language spoken by the Cherokee people 26000 987
2435. Southern Altai
Алтай тил
Kipchak Turkic language of the Altai Republic, Russia 20000 1,105
2625. Northern Sami
davvisámegiella
most widely spoken of all Sámi languages 16000 7,907
2658. Mirandese
mirandés
Romance language belonging to the Astur-Leonese linguistic group, sparsely spoken in a small area of northeastern Portugal 15000 4,294
2744. Upper Sorbian
hornjoserbšćina
language spoken by Sorbs in Germany in the historical province of Upper Lusatia 13000 14,236
2850. Choctaw
Chahta
Muskogean language spoken in US 11000 6
2964. Sakizaya
Sakizaya a kamu
Eastern Taiwan language 10000 2,735
2993. Northern Frisian
Nordfriisk
minority language of Germany, spoken mostly by people in North Frisia 9600 21,025
3089. Inupiaq
Iñupiatun
group of dialects of the Inuit language 8000 594
3177. Lower Sorbian
dolnoserbšćina
Western Slavic language spoken in eastern Germany in the historical province of Lower Lusatia 7000 3,444
3240. Atikamekw
Atikamekw Nehiromowin
Algonquian language 6400 2,078
3256. Piemontese
Piemontèis
Romance language spoken mainly in Italy 6200 71,114
3468. Sediq
patas Taroko
NA 4700 1,201
3575. Creek
Mvskoke
Indigenous American language 4000 1
3667. Veps
Vepsän
Finnic language native to Northwest Russia 3500 7,110
4027. Cornish
Kernewek
Brythonic Celtic language in Southwestern Britain 2000 7,119
4092. Cheyenne
Tsêhesenêstsestôtse
indigenous language of the United States 1900 721
4141. Manx
Gaelg
Celtic language spoken on the Isle of Man 1700 7,093
4473. Saterfriesisch
Seeltersk
last living dialect of the East Frisian language 960 4,130
4668. Inari Sami
anarâškielâ
Sami language spoken by the Inari Sami of Finland 610 6,529
5414. Pipil
language of Central America 20 247

Classical, Constructed, and Special-Purpose Wikipedia Editions

These Wikipedia editions are not associated with a living natural language in the LinguaMeta dataset. They include classical and liturgical languages (Latin, Sanskrit, Old English), constructed languages (Esperanto is covered above; Ido, Interlingua, Volapük, etc. appear here), dialect/regional-code editions, and the Simple English Wikipedia.

Wikipedia Edition WP Code Articles
Simple English simple 280,183
Latin la 141,131
Belarusian(Taraškievica orthography) be-tarask 90,719
Ido io 61,480
Volapük vo 49,385
Interlingua ia 30,396
Kotava avk 29,900
Samogitian bat-smg 17,275
Classical Chinese zh-classical 14,069
Emilian–Romagnol eml 13,956
Banyumasan map-bms 13,943
Interlingue ie 13,509
Sanskrit sa 12,487
Tarantino roa-tara 9,505
Bihari (Bhojpuri) bh 8,935
Dutch Low Saxon nds-nl 8,078
Võro fiu-vro 6,890
Old English ang 5,174
Lingua Franca Nova lfn 4,547
Nahuatl nah 4,165
Toki Pona tok 3,765
Zamboanga Chavacano cbk-zam 3,229
Novial nov 2,069
Aramaic (Syriac) arc 1,918
Aromanian roa-rup 1,391
Lojban jbo 1,353
Old Church Slavonic cu 1,340
Gothic got 1,004
Pali pi 299