Localization Statistics

This page is the home of an effort to determine one possible metric of browser localization coverage - that is, the percentage of the Internet population who have the browser in their first language (a.k.a. "language spoken at home"). This is not the only way of measuring, but I think it is one valid one. The higher the percentage is, the more people you are giving the best service to.

Calculating this figure is difficult because available data for "Internet population" is given by country, and not by language. So, I have taken the net population figures for each country and split them up proportionally to the first languages spoken in that country. There is an assumption here that the internet population of a country is evenly distributed among its language communities, which is probably not true for a country like India. However, this method will give better results than just allocating languages to countries on a 1:1 basis - "USA - English; India - Hindi; Italy - Italian" and so on. (Did you know that only 82.1% of the population of the USA speaks English at home?)

Better data is always welcome - and if any country can provide data of internet population split up by language, that can be incorporated instead. But as yet, I have found none.

Download the spreadsheet (OpenDocument format).

Headline Coverage Figures of World Net Population

Total World Net Population: 1,231,026,000

CategoryCountPercentage
Firefox 2 official4488.3%
Firefox 3 official4588.6%
Firefox 2 and Firefox 3 official4889.7%
All Firefox localizations found7094.8%
Opera all3285.2%
Opera 9.53084.4%
IE 77798.3%

FAQ

Why does the "Firefox 3" line say there are 45 localizations and not 48, when there are 48 on the download page?
This work is an overview, which comes with some simplifications. I only have one column in my spreadsheet for each of English, Portuguese and Spanish (and some other languages like Tamil). That's not to rubbish the work of localizers like the en-GB team; but, as a Brit myself, I'm happy to admit that I could use en-US at a pinch, and it would still be much better than having to use a browser in Urdu or Simplified Chinese.
Why is "Firefox 2 and Firefox 3" more than either "Firefox 2" or "Firefox 3"?
At the time of writing, FF3 has Indonesian, Sinhala, Albanian and Serbian (welcome!), which FF2 doesn't have. FF2 has Bulgarian, Welsh and Persian, which FF3 doesn't have (yet). I include this combined line to show you where we will be soon, on the assumption that these localizations will appear sooner rather than later.
Which language communities do we serve which IE does not?
Hello to native speakers of Belarusian, Lower Sorbian, Friulian, Frisian, Upper Sorbian, Kurdish, Maori, Mongolian, Ndebele, Sepedi, Sinhala, Siswati, Sesotho, Tsonga, Venda, Wolof and Xhosa. We're meeting your needs :-) (And particularly large kudos to translate.org.za for being responsible for a load of these.)
What do you define as a localization?
Something I can download and install, either a complete bit of software or some sort of add-on, which lets me view the user interface of that software in that language. Some localizations may well offer more than this (such as regionalized resources e.g. search) but this is my minimum benchmark for inclusion.
Are IE's localizations or Opera's localizations better or worse than the Firefox ones?
That's a question of some debate. My view is that not all localizations are created equal, but any localization is much, much better than no localization. In this spreadsheet, I lump together 90%, 95% and 100% solutions and call them all "supported", which is by contrast to the 0% solution, no localization, which is "not supported". I think this is a reasonable simplification, and it also avoids having to have arguments about whether or not an IE LIP is superior or inferior to a Firefox addon language pack.
Why is the "Firefox 3 official" percentage lower than IE 7's?
There are a couple of relatively big languages (as well as a dozen smaller ones) that we don't have official localizations for yet which they do: Hindi (2.20%), Vietnamese (1.48%), Thai (0.70%), Malay (0.66%), Bengali (0.47%), Tagalog (0.39%), Tamil (0.41%), Telugu (0.38%), Urdu (0.32%), ... .
If Microsoft were reading, which language should they do next?
Belarusian (0.45%), a Slavonic language spoken in Belarus, Russia, Ukraine and Poland.

Original URL: http://www.gerv.net/hacking/l10n-stats/