Since yesterday, the following image from an article by liberal journalist Evgenya Albats has been making the rounds on the Internet. It shows that whereas Putin’s official tally was 65%, independent observers put it close to or below the 50% marker that would necessitate a second round, such as Golos’ 51% and Citizen Observer’s 45%. Predictably, these figures were seized upon by the liberals to condemn the legitimacy of the elections. As Putin ended up getting 63.6%, while the average of all observers was 50.2%, one could conclude that the level of fraud was 13% or more.
However, as pointed out by Kireev, this is a gross misuse of statistics for political ends, because of the severe sampling problems: Golos observers were concentrated in Moscow, St.-Petersburg, and a few other large cities where Putin is less popular, while Citizen Observer is almost entirely confined to the capital. The website http://sms.golos.org/ collates the results from all the big Russian observer projects, and from the regional data, we can see that about half the election protocols compiled to create these figures were from Moscow; almost another quarter were from Moscow oblast and St.-Petersburg.
Nonetheless, while looking through the regional data, I realized that if it were to be adjusted for its pro-Moscow (anti-Putin) sampling bias, we could get a fairly a good estimate for the level of fraud in this election; or at least, an upper limit for it. And so that’s what I proceeded to do.
After assembling the data, I came up with the following table. The first column are the different provinces. The second column is Putin’s vote according to the observer protocols for that region. The third column is Putin’s vote according to the Central Election Commission. The third column is the difference between the two. It may represent fraud, but it may also be (1) sampling bias – more on this later, (2) natural margins of error, which are especially high in regions where there were few observers. The fourth column is the total number of ballots (both real and spoiled) cast in this region.
ВВП (набл.) | ВВП (ЦИК) | Х | бул. | |
Республика Адыгея (Адыгея) | 59.72 | 64.07 | 4.35 | 220481 |
Республика Башкортостан | 63.21 | 76.38 | 13.17 | 2300258 |
Республика Карелия | 50.56 | 55.38 | 4.82 | 309439 |
Республика Коми | 70.28 | 65.02 | -5.26 | 525780 |
Республика Марий Эл | 57.11 | 59.98 | 2.87 | 381148 |
Республика Татарстан (Татарстан) | 72.21 | 82.70 | 10.49 | 2378904 |
Удмуртская Республика | 62.50 | 65.75 | 3.25 | 784405 |
Чувашская Республика – Чувашия | 59.01 | 62.32 | 3.31 | 702957 |
Алтайский край | 50.90 | 57.35 | 6.45 | 1175430 |
Краснодарский край | 59.96 | 63.72 | 3.76 | 2692090 |
Красноярский край | 55.33 | 59.53 | 4.20 | 1303846 |
Приморский край | 40.97 | 57.31 | 16.34 | 989669 |
Ставропольский край | 66.80 | 64.47 | -2.33 | 1195740 |
Хабаровский край | 52.21 | 56.15 | 3.94 | 653997 |
Амурская область | 59.35 | 62.84 | 3.49 | 399704 |
Архангельская область | 54.83 | 57.97 | 3.14 | 575014 |
Астраханская область | 60.46 | 68.76 | 8.30 | 432603 |
Белгородская область | 49.22 | 59.30 | 10.08 | 899973 |
Брянская область | 50.01 | 64.02 | 14.01 | 699848 |
Владимирская область | 50.03 | 53.49 | 3.46 | 638010 |
Волгоградская область | 58.91 | 63.41 | 4.50 | 1278416 |
Вологодская область | 58.88 | 59.44 | 0.56 | 608595 |
Воронежская область | 55.05 | 61.34 | 6.29 | 1304344 |
Ивановская область | 60.33 | 61.85 | 1.52 | 519239 |
Иркутская область | 50.49 | 55.45 | 4.96 | 1072723 |
Калининградская область | 47.82 | 52.55 | 4.73 | 457483 |
Калужская область | 54.18 | 59.02 | 4.84 | 506933 |
Кемеровская область | 66.51 | 77.19 | 10.68 | 1642580 |
Курганская область | 53.68 | 63.39 | 9.71 | 482391 |
Курская область | 56.91 | 60.45 | 3.54 | 606717 |
Ленинградская область | 55.86 | 61.90 | 6.04 | 810757 |
Липецкая область | 54.70 | 61.00 | 6.30 | 626535 |
Московская область | 52.71 | 56.85 | 4.14 | 3545368 |
Мурманская область | 58.58 | 60.05 | 1.47 | 407311 |
Нижегородская область | 53.58 | 63.90 | 10.32 | 1857953 |
Новгородская область | 51.49 | 57.91 | 6.42 | 309970 |
Новосибирская область | 52.15 | 56.34 | 4.19 | 1352726 |
Омская область | 42.80 | 55.55 | 12.75 | 974829 |
Оренбургская область | 50.91 | 56.89 | 5.98 | 1014937 |
Орловская область | 45.22 | 52.84 | 7.62 | 450151 |
Пензенская область | 53.43 | 64.27 | 10.84 | 765541 |
Пермский край | 59.62 | 62.94 | 3.32 | 1170209 |
Ростовская область | 58.15 | 62.66 | 4.51 | 2113180 |
Рязанская область | 54.16 | 59.74 | 5.58 | 620967 |
Самарская область | 52.98 | 58.56 | 5.58 | 1557667 |
Саратовская область | 60.52 | 70.64 | 10.12 | 1323161 |
Сахалинская область | 53.76 | 56.30 | 2.54 | 228350 |
Свердловская область | 60.06 | 64.50 | 4.44 | 2073983 |
Тверская область | 54.67 | 58.02 | 3.35 | 667496 |
Томская область | 50.93 | 57.07 | 6.14 | 458311 |
Тульская область | 58.16 | 67.77 | 9.61 | 867569 |
Тюменская область | 73.19 | 73.10 | -0.09 | 836179 |
Ульяновская область | 53.99 | 58.18 | 4.19 | 666159 |
Челябинская область | 60.76 | 65.02 | 4.26 | 1729399 |
Забайкальский край | 60.94 | 65.69 | 4.75 | 498407 |
Ярославская область | 48.58 | 54.53 | 5.95 | 670972 |
Город Москва | 45.11 | 46.95 | 1.84 | 4247438 |
Город Санкт-Петербург | 50.33 | 58.77 | 8.44 | 2388567 |
Ханты-Мансийский автономный округ | 65.12 | 66.41 | 1.29 | 707504 |
Чукотский автономный округ | 44.84 | 72.64 | 27.80 | 29337 |
Территория за пределами РФ | 65.17 | 73.19 | 8.02 | 441931 |
Всего в регионах с наблюдателями | 56.11 | 61.97 | 5.86 | 63151581 |
РОССИЯ | 63.60 | 71701665 |
Sources: CEC regions data, Kommersant elections map, Golos’ SMS-ЦИК observer data aggregation project.
As you can see, the figures are more or less as what we can expect from analysis already published on this blog. In Moscow, fraud is minimal, the difference between observer protocols and the official result being less than 2%. We can be fairly certain about this: The protocols analyzed have data on over a million, i.e. some 1,021,810 votes, out of a total of 4,247,438 cast; at almost 25%, this is excellent coverage. Furthermore, the real fraud figure may be smaller than the 1.84% given above because the observers made sure to cover all the stations with the most suspicious 2011 results.
Coverage in St.-Petersburg is far smaller at 5%, but the fraud figure of 8.44% can still be treated as very reliable. It is backed up by other statistical evidence.
To get a figure for the regions in SMS-ЦИК dataset, which accounted for 88% of Russia’s total votes, I took the regional observer protocols’ figures for Putin and weighing them by the total number of ballots in that region. My final fraud figure using this method came out to 5.86%.
Five Caveats
This is not a conclusive fraud figure, of course, there still being at least five factors that would further influence it. Two of them are negative, one is probably neutral, and two are positive.
(1) This is a negative factor, but one that is very hard to quantify. The pro-Putin votes are weighted according to turnout, however, it is also the case that regions with greater turnout tend to have more fraud – this is because one of the most common methods of fraud is inflating turnout that almost invariably benefits Putin. But it is important stress that this relationship does not necessarily imply fraud, for it is also the case that there are subgroups of the Russian population – primarily, rural dwellers – among whom turnout is naturally higher. So we can expect turnout to be higher in some of the more rural provinces without fraud being responsible. Separating out the two is extremely tricky and is closely tied to a related problem – to what extent is fraud, or subgroups with specific voting patterns, responsible for Putin’s and United Russia’s long tails?
(2) The neutral factor (more or less) are the margins of error that come from only having a very limited numbers of observers in the more remote regions. For instance, it seems pretty unlikely there was 5% fraud AGAINST Putin in the Komi republic. 😉 I am assuming that since there margins can either be positive or negative, they will largely cancel themselves out by the time we calculate the aggregate total.
(3) This is a negative factor. Some regions, accounting for 12% of the total votes, are missing from the SMS-ЦИК dataset: Altai Republic, Buryatia, Daghestan, Ingushetia, Kabardino-Balkaria, Kalmykia, Karachay-Cherkess, Mordovia, Sakha Republic, North Ossetia, Tyva, Khakassia, Chechnya, Kamchatka krai, Kirov oblast, Kostroma oblast, Magadan oblast, Smolensk oblast, Tambov oblast, Jewish autonomous oblast, Nenets autonomous oblast, Yamalo-Nenets autonomous oblast, and Baikonur (Kazakhstan).
The FOM exit poll data showed that even though the North Caucasus was the region most wracked by fraud, it also showed, at 68.4%, the highest genuine support for Putin. The election in Stavrapol krai appear to have been fair – the official figure there was actually higher than the observers’ – so let’s leave its result as is. Assuming that turnout in the ethnic minority republics of the North Caucasus was only 50% or so, as seems more likely based on anecdotal evidence rather than the 90%-like official turnout, then the real, average Putin vote across those areas would then be about 71% – still above the Russian national average, but only moderately so – as opposed to the official 89%. This would raise Putin’s real average score by a bit, but by less than he would lose from the large amount of fraud embodied in them.
Similar things can be said, albeit to a smaller extent, for the other ethnic republics (a few of which, like Buryatia, seem to be quiet fair; others, like Mordovia, which are as fraudulent as anything observed in the North Caucasus). The average Putin vote officially in all the non-North Caucasus, non-observed regions is 68%; of the ethnic Russian majority ones, only about 62%. These regions are already almost or entirely consistent with the national average, so they will have only the most insignificant impacts.
Including all the other regions will up the official score to 63.6% (by definition), but will also increase both the level of fraud and Putin’s real score. So perhaps Putin will go up to 57.0% (thanks to the genuine North Caucasus votes), but fraud will also increase to maybe 6.6%.
(4) Now this is already looking very bad, as bad as the 2011 elections, but fortunately there are two major mitigating factors. First, just as nationwide observers are biased towards Moscow, then logically at the regional level they would likewise be biased towards major urban areas. If a crew of observers volunteer in some Russian backwater province, after hearing Navalny’s call over the Internet, chances are they would hail from the big local urban center. And there are significant voting differences between town and city in Russia, with the rural voters consistently both turning out in greater numbers and giving the Kremlin candidate around 10% or even 15% more votes (e.g., in FOM’s last pre-elections poll, only 43% of Muscovites and 47% of people living in cities of more than one million said they’d vote for Putin, compared to 51% of small towners and 58% of rural folks).
Now some 25% of Russia’s population is rural, and another significant part lives in small towns; the observer presence there is all but minimal, in any one region. As such, the observer protocol figures would systemically understate Putin’s vote. To what extent? Crude back of the envelope calculation, but I think it’s valid: 25% of a subgroup that gives Putin 10% more would give him 2.5% more, and they are very much underrepresented in the poll; add another 0.5% for the small town people. Putin’s real score rises to 60.0%, while his fraud score is compressed to 3.6%; total remains, by definition, at 63.6%.
(5) It is also known that observers concentrated most on polling stations that had a legacy of suspicious results from the 2011 elections. Since it is likely that those stations are still more likely – relative to others – to be bad apples this time round, the focus on them means that the level of fraud may be further artificially skewed.
Interpretation
There are many ways one can interpret these results.
One can cite the 5.86% figure as the most precise one, but one that doesn’t take into account a number of complicating factors. Alternatively, one could argue for a significantly lower figure, like 3.6% – or even lower once you adjust for the last factor. Alternatively, one can argue that the positive factors cancel out the first factor, which is unknown in magnitude but surely significant, and so return to a fraud estimate of 4%-6%. This range would back the two most comprehensive exit polls, FOM which gives Putin 59.3% (possible fraud: 4.3%) and VCIOM, which gives him 58.3% (possible fraud: 5.3%).
Either way, one thing is absolutely clear: A proper analysis of the observer protocols statistics can in no way support the theory that Putin got less than 50%.
Thanks! What’s not included in your percentage is vote tampering is the carousels and other tricks with the additional lists. That’s an extra 3-4%. I’d also like to think that observers functioned as a deterrent. In other words fraude percentage may have been higher at unmonitored polling states. On the other hand one can also not completely believe ‘opposition’ observers on their blue eyes … In general i’d like to keep it simple: Fraude turned a slim Putin majority into a confident win.
Joera,
It’s an interesting addition. But I’m very skeptical about the influence of carousels. Moscow was deluged with them, at least if the media noise was to be believed, but there the difference between results at observed stations and the official tally was less than 2%. The tight circle around the turnout – Putin vote graph, and the almost perfect Gaussian shape, is further confirmation that the scale of the fraud there was minimal.
This is not surprising as you’d need nearly 50,000 stuffed ballots to increase Putin’s share by just 1% in Moscow alone. I would think their national influence at much less than 1%. It is after all so much easier to falsify in the privacy and comfort of the vote count.
I would also note that in this election, even the Shpilkin method of assuming that the turnout/vote distribution for Putin has to follow that of his opponents – a flawed one, because Russian rural voters have specific voting patterns involving higher turnouts and more support for the party of power – gives Putin a score of 57.5% (official: 63.6%). So I would treat 6% fraud as the absolute quasi-feasible maximum in this election.
Another outstanding analysis Anatoly. I hope it receives as much attention on the Runet as Yevgenia Albats’s original article. In my opinion this is another article that Inosmi should publish.
That haven’t published me in 4 months. 🙁 Perhaps the last one they did put them off.
I want to just quickly add that the one thing that disproves the Golos and Citizens’ Observer figures as a true reflection of the vote across the whole of Russia (as opposed to Moscow) is the impossibly high figures both give to Prokhorov. Golos puts him almost level with Zyuganov and Citizens’ Observer puts him ahead of Zyuganov. Whilst that was true of Moscow and whilst one can understand why the folks at Golos and Citizens’ Observer might want it to be so, I do not think that there is a single detached observer of the Russian political scene who believes it. Besides it is contradicted by the opinion and exit polls.
You are right. The reason Prokhorov scores so highly is simply because the observers are concentrated in the capital, Moscow oblast, and St.-Petersburg.
The differences between Golos and Citizen Observer are directly due to this factor. Citizen Observer (and Navalny’s RosVybory) are the most concentrated in the capital; Golos has a slightly broader geographic reach, so they still show Putin in the lead if barely so.
This doesn’t mean their results aren’t useful, but the problem is when partisans begin citing them as “proof” of Putin’s illegitimacy when they constitute no such thing due to sampling issues as discussed extensively in this post.
Well I had said from before that we shouldn’t be surprised if prokhorov is declared as being THE victim of a stolen vote. With the superjob poll giving him 21% support and those two organizations giving him second place and implying that a runoff should have occurred I expect that some in the western MSM will pick up on this meme as well despite all the other evidence which shows that prokhorov could not get 10% support nationally. But then for it seems that Moscow = Russia.
One very last comment. One does wonder which party is the principal victim of fraud? The analysis of the vote in the northern Caucasus suggests that it might be the KPRF. It appears to have done a great deal better there than the official figures indicate. Could that be true of the other Muslim republics as well?
I don’t think any one party suffered from fraud more than another; when ballots are stuffed or Putin’s figures are bumped up, the other parties are repressed equally.
Zyuganov saw the largest losses due to fraud in the North Caucasus, but only because the other three candidates have virtually no brand there. Same goes for their respective parties. Hence, natural result of both Putin and commie would be higher than in other regions due to the near absence of the other three, but extensive fraud would hurt the KPRF most in absolute terms.
There seem to be three distinct patterns of fraud in the North Caucasus. Chechnya just writes up 99%+ support for Putin, nice and simple; and the other Muslim republics seem to rely on the specifics characteristics of communal voting to achieve their 90%+ results, as described in my last post; North Ossetia is almost as top down as Chechnya, but to keep up pretenses splits the steak with the commies, by a 75% UR/Putin vs. 20% Communist/Zyuganov arrangement with the others fighting over the remaining 5% scraps. Their 2012 vote looks exactly the same as in 2011.
Anatoly,
why not simply do a random check of the numbers given in Golos data and TsIK for the same UIK? In Moscow, Golos has something like 1000 UIKs in their database, if you spend couple of hours you’ll be able to run a simple script selecting 100 of them, and compare the data. That would be a much, much better evidence.
Thanks for that idea! Perhaps I’ll do something along those lines this weekend, if someone else doesn’t get there by then.
That would be a good idea indeed.
The SMS-TSiK data: http://sms.golos.org/export.xml
Do you know where I can find the UIK-level official data?
Shipilev has the database with official results, I believe: http://shipilev.livejournal.com/99071.html
Of course, politrash did something like this already: http://politrash-ru.livejournal.com/73814.html
Great work, Anatoly! I needed something like this for an article I am writing. Greetings from Spain. Keep it up!
Ah Golos. “Russia’s only independent vote monitoring organisation” as we are ceaselessly told.
3-6% fraud isn’t bad by SWP’s hometown of Chicago standards.
These figures are making my head spin. In general, I think the reports from “Golos” and the so called “independent” observers are only good for wiping my behind with
Tony, AMEN!