I enjoyed Brian Wansink’s book Mindless Eating–it was well written and filled with creative experiments like the ever filling soup bowl. In the ten years since that time Wansink became not just a media start but an academic star with an h-index of 75 and over 24 thousand citations. In recent years, however, he has had to retract papers in the light of inconsistencies and questions about his data and statistics.

A Buzzfeed article, based in part on emails, now reveals that Wansink was running a brazen p-hacking factory:

The correspondence shows, for example, how Wansink coached Siğirci to knead the pizza data.

First, he wrote, she should break up the diners into all kinds of groups: “males, females, lunch goers, dinner goers, people sitting alone, people eating with groups of 2, people eating in groups of 2+, people who order alcohol, people who order soft drinks, people who sit close to buffet, people who sit far away, and so on…”

Then she should dig for statistical relationships between those groups and the rest of the data: “# pieces of pizza, # trips, fill level of plate, did they get dessert, did they order a drink, and so on…”

…“Work hard, squeeze some blood out of this rock, and we’ll see you soon.”…All four of the pizza papers were eventually retracted or corrected.

In essence, Wansink all but published a study finding green jelly beans cause acne. All hail XKCD.

1 Maz February 26, 2018 at 7:45 am

What’s amazing that the whole thing started to unravel when Wansink himself published a blog post where he candidly described how he made his underlings dredge datasets for “interesting findings.” He seemed to be genuinely statistically illiterate and didn’t seem to know that sampling error exists or that there was anything really wrong with what he was doing.


2 Ray Lopez February 26, 2018 at 8:04 am

Never heard of this guy but another possibility is that he, like Lance Armstrong, got sick of the whole charade and decided to confess.


3 Maz February 26, 2018 at 8:10 am

No, he’s been fighting back all the time.


5 TheRiver February 26, 2018 at 9:53 am

I am shocked that anyone would use science and statistics to prove something that was not true but satisfied their own bias. Shocked, I tell you!


6 dude February 26, 2018 at 8:53 am

+1. This really is how it reads.

I imagine this is something close to standard operating procedure for a lot of the soft sciences.


7 dearieme February 26, 2018 at 9:51 am

If only it were restricted to the soft sciences.


14 Mark Thorson February 26, 2018 at 12:28 pm

A Google search on “wansink” and site or domain “dailymail.co.uk” gets 98 hits. Perhaps that should be used as an inverse indicator of research credibility.


15 Asher February 26, 2018 at 8:26 am

Andrew Gelman’s Statistical Modeling blog (MR links to it) has been on to this guy for years.



16 Erik Larsen February 26, 2018 at 9:40 am

No, the first post on Wansink is from december 2016 in relation to the above-mentioned blog post: http://andrewgelman.com/2016/12/15/hark-hark-p-value-heavens-gate-sings/

Tim van der Zee and co-authors are doing the heavy lifting here: http://www.timvanderzee.com/the-wansink-dossier-an-overview/


17 Ironman February 26, 2018 at 8:35 am

Legal advice for economists considering p-hacking from one of the best contributions to the Examples of Junk Science series: Antitrustworthy Analysis: “Don’t p-hack. It wastes time, it will be vulnerable to cross-examination and it undermines the legitimacy of the valuable contributions that economics can make to a case.”

Here’s the link to the entire series, where material covered by the “Ignoring Inconsistencies” contribution would also be very relevant to the emerging story of Wansink’s questionable analytical methods. These aren’t things that happened 10 years ago in the world of food and nutrition research – it is happening in economics and other sciences today.


18 Mister C February 26, 2018 at 9:51 am

Thanks! Good link. That comic in the first link (“try to grab the 84…”) is exactly true. Way too common of a methodology, but I totally understand why it happens. !


19 Mark Thorson February 26, 2018 at 12:31 pm

The common term is “we tortured the data until it confessed”.


20 Per Kurowski February 26, 2018 at 8:43 am

And there are also some cases of mindless lack of research. For instance, bank regulators, when determining their risk weighted capital requirements for banks, never researched what assets were dangerous to the bank system, they just used how risky the assets were in general.


21 Your husband's cane February 26, 2018 at 8:43 am

How mathematically literate are grad students in the social and behavioral sciences? This is pure conjecture on my part, but my suspicion is that they’ve had one class on statistics, probably at the undergrad level, with no very high degree of comprehension required to pass; and after that, they’ve plugged their numbers into the computer and accepted whatever p-value it puts out, with no notion of which programs and which parameters should be used under which circumstances.

It might be interesting to assemble a roomful of sociology researchers, take away their devices, and give them a test on basic undergrad probability. What fraction of them could come up with the correct answer to, say, a basic Bayes’ Theorem problem?


22 john February 26, 2018 at 9:53 am

ScienceNew.org had an (maybe even a couple) article a few years back suggesting that it’s a lot more than just the social scientists that have problems with statistics. I think this is it but don’t have the subscription to check now: https://www.sciencenews.org/article/odds-are-its-wrong

In addition the problem journals pushing immediate results that look good and not caring much about failure to replicate or actually doing the types of studies needed to actually apply the statistical analysis (meaning more than just one data set) it pointed out that many have just gotten the stat packages and don’t know what the tool is actually doing. Hence misapplying concepts. Apparently too many researchers think a Confidence Interval or 95 % means their results are true with a 95% level of confidence.


23 byomtov February 26, 2018 at 10:42 am

I don’t think mathematical literacy has a lot to do with it.

This kind of thing shouldn’t pass an honest person’s smell test, at least if they have had even one statistics class. And, by the way, that first statistics class ought to cover the point. Maybe there is too much emphasis on calculations and not enough on the underlying logic of what is going on. Come to think of it, there’s no “maybe” to it.


24 mkt42 February 26, 2018 at 2:26 pm

“ScienceNew.org had an (maybe even a couple) article a few years back suggesting that it’s a lot more than just the social scientists that have problems with statistics.”

Well yes, but we don’t need a citation to notice that the same p-hacking problem is common in medical research, health and nutrition research, and to a certain extent epidemiology.

All of these fields including the social sciences have a variety of methods to try to deal with the problem, but usually the best methods are complex and themselves prone to mis-use, and the researcher has to be honest enough to use them rather than pretending that the results were obtained without resorting to p-hacking, data-fishing, etc.


25 Alan Crowe February 26, 2018 at 3:16 pm

Look at this article: https://www.theguardian.com/science/occams-corner/2013/sep/19/science-religion-not-be-questioned

The author is Henry Gee who is a senior editor of Nature, Britain’s leading science journal. You would expect him to get p-values right, it is a core part of his job and he holds an important job, gate keeping science. But, no, he makes the basic error

> If this all sounds rather rarefied, consider science at its most practical. As discussed in Dr McLain’s article and the comments subjacent, scientific experiments don’t end with a holy grail so much as an estimate of probability. For example, one might be able to accord a value to one’s conclusion not of “yes” or “no” but “P<0.05", which means that the result has a less than one in 20 chance of being a fluke. That doesn't mean it's "right"

That gets things the wrong way round. Next Henry Gee will be telling us that five twelfths is equal to 2.4 because division is commutative 🙂

Are grad students more mathematically literate than the senior figures who gate keep their careers? Since this is an economic blog we should think about incentives. Will mathematical literacy help or hinder them getting published in Nature?


26 Charbes A. February 26, 2018 at 8:45 am

It is sad how Mr. Wansink is being persecuted by minions of a dead orthodoxy. Meanwhile, America faces an unprecedent obesity crisis.


27 JFA February 26, 2018 at 9:16 am

Hahah. Is that you, Wansink?


28 Charbes A. February 26, 2018 at 10:37 am

No, I neither am Mr. Wansik nor have any relationship with his seminal research. But it is said how Big Press’ scandalmongering rags are striving to destroy a straightaway researcher.


29 JFA February 26, 2018 at 11:06 am

“Straightaway researcher”. Hahaha. Whatever Wansink. Only the accused or someone equally inept at understanding science could/would make such obtuse statements.


30 Charbes A. February 26, 2018 at 12:48 pm

He is being accused of what? Having found variables there are related a.k.a. doing his job. Meanwhile, Big Fast Food destroys shortens American lives.

31 JFA February 26, 2018 at 1:16 pm

Finding variables that are related, making data up. 6 one way, 1/2 dozen the other. But whatever it takes to blame “Big Fast Food” for personal choices, right?

32 Charbes A. February 26, 2018 at 1:37 pm

I mean, drug dealers are just entrepreneurs…

33 JFA February 26, 2018 at 2:44 pm

1) Yes, drug dealers are entrepreneurs. 2) You decided that you couldn’t defend the whole “making up data thing”, so you changed the subject. I’m sure it was the personal choice part of my comment that you disagree with. But having grown up in the South (not know for its healthy eating habits (independent of the actions of those evil corporations)) and on fast food, I am somehow able to both avoid consuming excessive amounts of “bad” food and to maintain a simple exercise regimen to keep myself relatively fit and healthy. Much like someone who is addicted to heroin could have… you know… not taken heorin to begin with.

34 Charbes A. February 26, 2018 at 5:42 pm

You did not read the word “just”. Or pretended you did not read. You are chilling for Big Fast Food. Again, all the emails show is that Mr. Wansik broke some formal protocols. He commited no wrongdoing whatsoever.

35 JFA February 26, 2018 at 7:38 pm

I guess presenting data from 3 to 5 year olds as if we’re generated from 8 to 11 year olds and presenting estimates with p values that could not have been generated by the data are now just frowned upon rather than being signs of fraud… buy whatever. Also, no one is “just” one thing. I’m sure some drug dealers are guitar players. What else were you suggesting when saying they were “just” entrepreneurs?

36 JFA February 27, 2018 at 11:32 am
37 Mulp February 26, 2018 at 8:53 am

Collecting data costs a lot of money.

p-hacking is simply increasing productivity.

That makes it better, according to economists.


38 rayward February 26, 2018 at 9:10 am

How many times has one heard or read a statement by a social scientist that “I go where the data take me”. Of course, the intent is to confirm the absence of bias because the researcher doesn’t go looking for data that confirms a result already predicted. Ironically, “hypothesis first” would suggest bias to most people, not the best practice for conducting research. I’m an advocate so I go looking for authority that confirms a legal result (the “hypothesis”) that is best for my client. The difference between unbiased research and advocacy social science may be clear to Tabarrok and Cowen, but not to me. The new book about Peter Thiel and Gawker and the ongoing dispute between Thiel and Gawker (there’s a possible claim against Thiel for tortuous interference) have revealed some interesting details, including the effort to select overweight women for the jury in the lawsuit by Hulk Hogan against Gawker that Thiel funded after several mock trials indicated that overweight women were more likely to punish a web site like Gawker that discloses personal and salacious material about people – overweight women feel that they are the subject of such personal and salacious disclosures. Think about that: Thiel won his lawsuit in large part because of overweight women, women who, fortunately for Thiel, didn’t follow the advice of Mr. Wansick to avoid binge pizza eating. https://www.thedailybeast.com/peter-thiel-got-his-revenge-on-gawker-he-may-yet-regret-it


39 Hadur February 26, 2018 at 9:25 am

Can we prove that Tyler really HAS visited all the gas station tacquerias of Fairfax County?


48 Pollster February 26, 2018 at 11:31 am

What’s described here is literally the best practice in polling and survey-based market research, in general.


49 stasi February 26, 2018 at 12:40 pm

I remember in my University department of Finance in the early 1990s the academics used to snidely call p-hacking “data mining”.


50 mkt42 February 26, 2018 at 2:40 pm

“Data mining” was a really good phrase, but around maybe 15 or 20 years ago computer science types (I don’t think the term “data scientist” had become widespread yet) started popularizing semi-legitimate techniques that they called data mining. So now I call it “data fishing” instead.

And in the last few years, the phrase “data mining” has been eclipsed by “machine learning”. There’s some good work being done, but there’s a limit to how far you can go with pure empiricism and no underlying theory. Having tens of millions of observations and tens of thousands of variables doesn’t help if the variables lack explanatory power, the data cover only a few years of observations, and the researcher doesn’t know about endogeneity. (The many observations do however enable the researcher to do cross-validation, a clear improvement over within-sample standard error estimation.)

The Google Flu Trends saga is perhaps the perfect example. Google thought it had invented a superior technique for early detection of flu epidemics. Turns out they hadn’t, they just got lucky one year. Which doesn’t mean that their work should be discarded; additional tools are always welcome. But machine learning isn’t going to solve the conundrums and complexities of social science research.


51 rayward February 26, 2018 at 11:46 am

Here’s an article in the NYT about a study conducted at the Mind and Body Lab that “exercise beliefs” affect both waistlines and life span: https://www.nytimes.com/2018/02/22/well/move/how-our-beliefs-can-shape-our-waistlines.html


52 Stephen February 26, 2018 at 12:58 pm

“How mathematically literate are grad students in the social and behavioral sciences? This is pure conjecture on my part, but my suspicion is that they’ve had one class on statistics, probably at the undergrad level, with no very high degree of comprehension required to pass; and after that, they’ve plugged their numbers into the computer and accepted whatever p-value it puts out, with no notion of which programs and which parameters should be used under which circumstances.”

“Pure conjecture on my part” X “my suspicion” X “probably” – from someone who wishes to denigrate the mathematical literacy of others.


53 Andy S. February 26, 2018 at 1:24 pm

Have you ever noticed that books that have Ph.D. after the author’s name on the cover are all dubious? Is it just me or does everyone know this?


54 mike davis February 26, 2018 at 1:39 pm

I wouldn’t have a problem if all this guy did was send his minions off to gather lots of data and then look through it to see if they could find some interesting relationship. The problem is that he claimed that what they found is true.

I haven’t read his stuff but if I understand correctly, he would say something like: “we found that if x happens (all you can eat pizza), then people do Y (eat too much pizza). So if you don’t want Y, don’t allow X.”

What he should have said is “Hmm…we found X and Y are related. That’s interesting. Let’s see if we can construct a logical theory to explain why that might be true. If we get that far, then we’ll go do lots of other tests to see if we can falsify that hypothesis.”


55 IVV February 26, 2018 at 4:57 pm

What is it with Buzzfeed getting the scoops lately? What are they doing well, suddenly?


56 Ted February 26, 2018 at 7:55 pm

@Alex – here’s an idea for a follow up post that I’d love to see you write: Exploring the idea of whether we lived in a world where green jelly beans *do* cause acne. How would we find out? How would we know?

(In my mind, the sin was not searching for non-preregistered relationships, the sin was using naive p-values to report confidence in those relationships. Certainly the relationships are worth exploring, and certainly we might scientific truth where we did not know to find some. This is a distinction worth clarifying, imo.)


