Daily Interesting News: blogger

Visualizzazione post con etichetta blogger. Mostra tutti i post

venerdì 10 marzo 2017

Women in Power

In 1915 Charlotte Perkins Gilman published a funny but unsettling story called Herland. As the title hints, it’s a fantasy about a nation of women – and women only – that has existed for two thousand years in some remote, still unexplored part of the globe. A magnificent utopia: clean and tidy, collaborative, peaceful (even the cats have stopped killing the birds), brilliantly organised in everything from its sustainable agriculture and delicious food to its social services and education. And it all depends on one miraculous innovation. At the very beginning of its history, the founding mothers had somehow perfected the technique of parthenogenesis. The practical details are a bit unclear, but the women somehow just gave birth to baby girls, with no intervention from men at all. There was no sex in Herland.

The story is all about the disruption of this world when three American males discover it: Vandyck Jennings, the nice-guy narrator; Jeff Margrave, a man whose gallantry is almost the undoing of him in the face of all these ladies; and the truly appalling Terry Nicholson. When they first arrive, Terry refuses to believe that there aren’t some men around somewhere, pulling the strings – because how, after all, could you imagine women running anything? When eventually he has to accept that this is exactly what they are doing, he decides that what Herland needs is a bit of sex and a bit of male mastery. The story ends with Terry unceremoniously deported after one of his bids for mastery, in the bedroom, goes horribly wrong.

There are all kinds of irony to this tale. One joke that Perkins Gilman plays throughout is that the women simply don’t recognise their own achievements. They have independently created an exemplary state, one to be proud of, but when confronted by their three uninvited male visitors, who lie somewhere on the spectrum between spineless and scumbag, they tend to defer to the men’s competence, knowledge and expertise; and they are slightly in awe of the male world outside. Although they have made a utopia, they think they have messed it all up.

As well as describing imaginary communities of women doing things their way, Herland raises bigger questions, from knowing how to recognise female power to the sometimes funny, sometimes frightening stories we tell ourselves about it, and indeed have told ourselves about it, in the West at least, for thousands of years.

I’ve talked before about the ways women get silenced in public discourse. And there’s plenty of that silencing still going on. We need only think of Elizabeth Warren being prevented a few weeks ago from reading out a letter by Coretta Scott King in the US Senate.1 What was extraordinary on that occasion wasn’t only that she was silenced and formally excluded from the debate (I don’t know enough about the rules of procedure in the Senate to have a sense of how justified, or not, that was); but that several men over the next couple of days did read the letter out and were neither excluded nor shut up. True, they were trying to support Warren. But the rules of speech that applied to her didn’t appear to apply to Bernie Sanders, or the three other male senators who did the same.

The right to be heard is crucially important. But I want to think more generally about how we have learned to look at women who exercise power, or try to; I want to explore the cultural underpinnings of misogyny in politics or the workplace, and its forms (what kind of misogyny, aimed at what or whom, using what words or images, and with what effects); and I want to think harder about how and why the conventional definitions of ‘power’ (or for that matter of ‘knowledge’, ‘expertise’ and ‘authority’) that we carry round in our heads have tended to exclude women.

Hillary Clinton portrayed as Medusa, with Trump as Perseus.

It is happily the case that in 2017 there are more women in what we would all probably agree are ‘powerful’ positions than there were ten, let alone fifty years ago. Whether that is as politicians, police commissioners, CEOs, judges or whatever, it’s still a clear minority – but there are more. (If you want some figures, around 4 per cent of UK MPs were women in the 1970s; around 30 per cent are now.) But my basic premise is that our mental, cultural template for a powerful person remains resolutely male. If we close our eyes and try to conjure up the image of a president or (to move into the knowledge economy) a professor, what most of us see isn’t a woman. And that’s just as true even if you are a woman professor: the cultural stereotype is so strong that, at the level of those close-your-eyes fantasies, it is still hard for me to imagine me, or someone like me, in my role. I put the phrase ‘cartoon professor’ into Google Images – ‘cartoon professor’ to make sure that I was targeting the imaginary ones, the cultural template, not the real ones. Out of the first hundred that came up, only one, Professor Holly from Pokémon Farm, was female.2

To put this the other way round, we have no template for what a powerful woman looks like, except that she looks rather like a man. The regulation trouser suits, or at least the trousers, worn by so many Western female political leaders, from Merkel to Clinton, may be convenient and practical; they may be a signal of the refusal to become a clothes horse, which is the fate of so many political wives; but they’re also a simple tactic – like lowering the timbre of the voice – to make the female appear more male, to fit the part of power.3 Elizabeth I knew exactly what the game was when she said she had ‘the heart and stomach of a king’. It’s that idea of the divorce between women and power that makes Melissa McCarthy’s parodies of the White House press secretary Sean Spicer on Saturday Night Live so effective. It’s said that these have annoyed President Trump more than most satires on his regime, because (according to one of the ‘sources close to him’), ‘he doesn’t like his people to appear weak.’ Decode that, and what it actually means is that he doesn’t like his men to be parodied by and as women. Weakness comes with a female gender.4

What follows from this is that women are still perceived as belonging outside power. Whether we sincerely want them to get to the inside of it or whether, by various often unconscious means, we cast women as interlopers when they make it (I still remember a Cambridge where, in most colleges, the women’s loos were tucked away across two courts, through the passage and down the stairs in the basement), the shared metaphors we use of female access to power – knocking on the door, storming the citadel, smashing the glass ceiling, or just giving them a leg up – underline female exteriority. Women in power are seen as breaking down barriers, or alternatively as taking something to which they are not quite entitled. A headline in the Times in early January captured this wonderfully. Above an article reporting on the possibility that women might soon gain the positions of Metropolitan Police commissioner, chair of the BBC Trust, and bishop of London, it read: ‘Women Prepare for a Power Grab in Church, Police and BBC.’ (Cressida Dick, the new commissioner of the Met, is the only one of these predictions yet to have come true.) Now, I realise that headline writers are paid to grab attention. But the idea that even under those circumstances you could present the prospect of a woman becoming bishop of London as a ‘power grab’ (and that probably thousands upon thousands of readers didn’t bat an eyelid when they read it) is a sure sign that we need to look a lot more carefully at our cultural assumptions about women’s relationship with power. Workplace nurseries, family-friendly hours, mentoring schemes and all those practical things are importantly enabling, but they are only part of what we need to be doing. If we want to give women as a gender – and not just in the shape of a few determined individuals – their place on the inside of the structures of power, we have to think harder about how and why we think as we do. If there is a cultural template, which works to disempower women, what exactly is it and where do we get it from?

At this point, it may be useful to start thinking about the classical world. More often than we may realise, and in sometimes quite shocking ways, we are still using Greek idioms to represent the idea of women in, and out of, power. There is at first sight a rather impressive array of powerful female characters in the repertoire of Greek myth and storytelling. In real life, ancient women had no formal political rights, and precious little economic or social independence; in some cities, such as Athens, respectable married women were almost never seen outside the home. But Athenian drama in particular, and the Greek imagination more generally, has offered our imaginations a series of unforgettable women: Medea, Clytemnestra, Antigone.

They are not, however, role models – far from it. For the most part, they are portrayed as abusers rather than users of power. They take it illegitimately, in a way that leads to chaos, to the fracture of the state, to death and destruction. They are monstrous hybrids, who aren’t – in the Greek sense – women at all. And the unflinching logic of their stories is that they must be disempowered, put back in their place. In fact, it is the unquestionable mess that women make of power in Greek myth that justifies their exclusion from it in real life, and justifies the rule of men. (I can’t help thinking that Perkins Gilman was lightly parodying this logic when she made the women of Herland believe that they had messed up.)

Go back to one of the very earliest Greek dramas to survive, the Agamemnon of Aeschylus, first performed in 458 BC, and you’ll find that its antiheroine, Clytemnestra, horribly encapsulates that ideology. In the play, she becomes the effective ruler of her city while her husband is away fighting the Trojan War; and in the process she ceases to be a woman. Aeschylus repeatedly uses male terms and the language of masculinity to refer to her. In the very first lines, for example, her character is described as androboulon – a hard word to translate neatly but something like ‘with manly purpose’, or ‘thinking like a man’. And, of course, the power that Clytemnestra illegitimately claims is put to destructive purpose when she murders Agamemnon in his bath on his return. The patriarchal order is only restored when Clytemnestra’s children conspire to kill her.5

There’s a similar logic in the stories of that mythical race of Amazon women, said by Greek writers to exist somewhere on the northern borders of their world.6 A more violent and more militaristic lot than the peaceful denizens of Herland, this monstrous regiment always threatened to overrun the civilised world of Greece and Greek men. An enormous amount of modern feminist energy has been wasted on trying to prove that these Amazons did once exist, with all the seductive possibilities of a historical society that really was ruled by and for women. Dream on. The hard truth is that the Amazons were a Greek male myth. The basic message was that the only good Amazon was a dead one, or – to go to back to awful Terry – one that had been mastered, in the bedroom. The underlying point was that it was the duty of men to save civilisation from the rule of women.7

There are, it is true, occasional examples where it might look as if we are getting a more positive version of ancient female power. One staple of the modern stage is Aristophanes’ comedy known by the name of its lead female character, Lysistrata. Written in the fifth century BC, it appears to be a perfect mixture of highbrow classics, feisty feminism, a stop-the-war agenda and a good sprinkling of smut (and it was once translated by Germaine Greer). It’s the story of a sex-strike, set not in the world of myth but in the contemporary world of ancient Athens. Under Lysistrata’s leadership, the women try to force their husbands to end the long-running war with Sparta by refusing to sleep with them until they do. The men go round for most of the play with enormously inconvenient erections (which now tends to cause some difficulty and hilarity in the costume department). Eventually, unable to bear their encumbrances any longer, they give in to the women’s demands and make peace. Girl power at its finest, you might think.8

Athena, the patron deity of the city, is often brought in on the positive side too. Doesn’t the simple fact that she was female suggest a more nuanced version of the imagined sphere of women’s influence? I’m afraid not.

If you scratch the surface and go back to the fifth-century context, Lysistrata looks very different. It’s not just that the original audience and actors consisted, according to Athenian convention, entirely of men – the female characters were probably played much like pantomime dames. It’s also the fact that, in the end, the fantasy of women’s power is firmly stamped down. In the final scene, the peace process consists of bringing a naked woman onto the stage (or a man somehow dressed up as a naked woman), who is used as if she were a map of Greece, and is metaphorically carved up in an uncomfortably pornographic way between the men of Athens and Sparta. Not much proto-feminism there.

As for Athena, it’s true that in those binary charts of gods and goddesses that we make for ourselves, she appears on the female side. But the crucial thing about her in the ancient context is that she is another of those difficult hybrids. In the Greek sense she’s not a woman at all. For a start she’s dressed as a warrior, when fighting was exclusively male work (that’s an underlying problem with the Amazons too). Then she’s a virgin, when the raison d’être of the female sex was breeding new citizens. And she wasn’t even born of a mother but directly from the head of her father, Zeus. It was almost as if Athena, woman or not, offered a glimpse of an ideal male world in which women could not just be kept in their place but dispensed with entirely.9

The point is simple but important: if we go back to the beginnings of Western history we find a radical separation – real, cultural and imaginary – between women and power. But one item of Athena’s costume brings this right up to our own day. On most images of the goddess, at the very centre of her body armour, fixed onto her breastplate, is the image of a female head, with writhing snakes for hair. This is the head of Medusa, one of three mythical sisters known as the Gorgons, and it was one of the most potent ancient symbols of male mastery of the dangers that the very possibility of female power represented. It’s no accident that we find her decapitated, her head proudly paraded as an accessory by this decidedly un-female female deity.

There are many ancient variations on Medusa’s story. One famous version has her as a beautiful woman raped by Poseidon in a temple of Athena, who promptly transformed her, as punishment for the sacrilege, into a monstrous creature with a deadly capacity to turn to stone anyone who looked at her face. It later became the task of the hero Perseus to kill this woman, and he cut her head off using his shiny shield as a mirror so as to avoid having to look directly at her. At first he used the head as a weapon since – even in death – it retained the capacity to petrify; but he then presented it to Athena, who displayed it on her own armour (one message being: take care not to look too directly at the goddess).10

It doesn’t need Freud to see those snaky locks as an implied claim to phallic power. This is the classic myth in which the dominance of the male is violently reasserted against the illegitimate power of the woman. And Western literature, culture and art have repeatedly returned to it in those terms. The bleeding head of Medusa is a familiar sight among our own modern masterpieces, often loaded with questions about the power of the artist to represent an object at which no one should look. In 1598 Caravaggio did an extraordinary version of the decapitated head with his own features, so it is said, screaming in horror, blood pouring out, the snakes still writhing. A few decades earlier Cellini made a large bronze statue of Perseus which still stands in the Piazza della Signoria in Florence: the hero is depicted trampling on the mangled corpse of Medusa, and holding her head up in the air, again with the blood and the gunge pouring out of it.11

What’s extraordinary is that this beheading remains even now a cultural symbol of opposition to women’s power. Angela Merkel’s features have again and again been superimposed on Caravaggio’s image.12 In one of the more silly outbursts in this vein, a column in the magazine of the Police Federation called Theresa May the ‘Medusa of Maidenhead’ during her time as home secretary. ‘The Medusa comparison might be a bit strong,’ the Daily Express responded: ‘We all know that Mrs May has beautifully coiffed hair.’ But May got off lightly compared with Dilma Rousseff, who had to open a major Caravaggio show in São Paolo. The Medusa was naturally in it, and Rousseff standing in front of the very painting proved an irresistible photo opportunity.

But it’s with Hillary Clinton that we see the Medusa theme at its starkest and nastiest. Predictably Trump’s supporters produced a great number of images showing her with snaky locks. But the most horribly memorable of them adapted Cellini’s bronze, a much better fit than the Caravaggio because it wasn’t just a head: it also included the heroic male adversary and killer. All you needed to do was superimpose Trump’s face on that of Perseus, and give Clinton’s features to the severed head (in the interests of taste, I guess, the mangled body on which Perseus tramples in the original was omitted). It’s true that if you crawl around some of the darker recesses of the web, you can find some very unpleasant images of Obama, but they’re very dark recesses. This scene of Perseus-Trump brandishing the dripping, oozing head of Medusa-Clinton was very much part of the everyday, domestic American decorative world: you could buy it on T-shirts and tank tops, on coffee mugs, on laptop sleeves and tote bags (sometimes with the logo TRIUMPH, sometimes TRUMP). It may take a moment or two to take in that normalisation of gendered violence, but if you were ever doubtful about the extent to which the exclusion of women from power is culturally embedded or unsure of the continued strength of classical ways of formulating and justifying it – well, I give you Trump and Clinton, Perseus and Medusa, and rest my case.

Hillary Clinton portrayed as Medusa, with Trump as Perseus.
Hillary Clinton portrayed as Medusa, with Trump as Perseus.
*

It isn’t enough to rest the case there without saying what we might actually do. What would it take to resituate women on the inside of power? Here I think we have to distinguish between an individual perspective and a more communal, general one. If we look at some of the women who have ‘made it’, we can see that the tactics and strategies behind their success don’t merely come down to aping male idioms. One thing that many of these women share is a capacity to turn the symbols that usually disempower women to their own advantage. Margaret Thatcher seems to have done that with her handbags, so that eventually the most stereotypically female accessory became a verb of political power: as in ‘to handbag’.13 And I suppose that at an incomparably more junior level I did something similar when I went for my first interview for an academic job, in Thatcher’s heyday as it happens. I bought a pair of blue tights specially for the occasion. It wasn’t my usual fashion choice, but the logic was satisfying: ‘If you interviewers are going to be thinking that I’m a right bluestocking, let me just show you that I know that’s what you’re thinking and that I got there first.’

As for Theresa May, it may be a bit too early to say, and there is a possibility that we will end up looking back to her as a woman who was put into power in order to fail. (I’m trying very hard here not to compare her to Clytemnestra.) But I do sense that her ‘shoe thing’ and those kitten heels are one of the ways she shows that she is refusing to be packaged into the male template. She is also rather good, as Thatcher was, at exploiting the weak spots in the armoury of traditional Tory male power. The fact that she isn’t part of the clubbable boys’ world, that she isn’t ‘one of the lads’, has helped her carve out independent territory for herself. She has gained power and freedom out of the exclusion. She is also allergic to ‘mansplaining’.

Many women could, I’m sure, share perspectives and tricks like this. But the big issues I’ve been trying to confront aren’t solved by tips on how to exploit the status quo. And I don’t think patience is likely to be the answer either, though gradual change very likely will take place. In fact, given that women in this country have only had the vote for a hundred years, we shouldn’t forget to congratulate ourselves for the revolution that we have all, women and men, brought about. That said, if the deep cultural structures legitimating women’s exclusion are as I have argued, gradualism is likely to take too long for me, thank you very much. We have to be more reflective about what power is, what it is for, and how it is measured. To put it another way, if women aren’t perceived to be fully within the structures of power, isn’t it power that we need to redefine rather than women?

So far the women’s power I’ve been referring to is the usual kind we imagine, possessed by national politicians, CEOs, prominent journalists, television executives and so on. This gives a very narrow version of what power is, largely correlating it with public prestige (or in some cases public notoriety). It’s very ‘high end’ in a very traditional sense, and bound up with the ‘glass ceiling’ image of power, which not only effectively positions women on the outside of power, but also imagines the female pioneer as the already successful superwoman with just a few last vestiges of male prejudice keeping her from the top. I don’t think this model speaks to most women, who even if they aren’t aiming to be president of the US or a company boss, still rightly feel that they want a stake in power. And it certainly didn’t appeal last year to sufficient numbers of American voters.

Even if we do restrict our sights to national politics the question of the way we judge women’s success within it is tricky. There are plenty of league tables charting the proportion of women within national legislatures. At the very top comes Rwanda, where more than 60 per cent of the members of the legislature are women, while the UK is almost fifty places down, at roughly 30 per cent. Strikingly, the Saudi Arabian National Council has a higher proportion of women than the US Congress. It’s hard not to lament some of these figures and applaud others, and a lot has rightly been made of the role of women in post-civil war Rwanda. But I do wonder if, in some places, the presence of large numbers of women in the national legislature means that that is where the power is not.

I’m also not sure that we’re being quite straight with ourselves about what we want women in parliaments for. A number of studies point to the role of women politicians in promoting legislation in women’s interests (in childcare, for example, equal pay and domestic violence). A report from the Fawcett Society recently suggested a link between the 50/50 balance between women and men in the Welsh Assembly and the number of times ‘women’s issues’ were raised there. I’m certainly not going to complain about childcare and the rest getting a fair airing, but I’m not sure it’s a good idea that such things continue to be perceived as ‘women’s issues’, and – for me at least – they aren’t the main reasons we want more women in parliaments. Those reasons are much more basic: it is flagrantly unjust to keep women out, by whatever unconscious means we do so; and we simply can’t afford to do without women’s expertise, whether it is in technology, the economy or social care. If that means fewer men get into the legislature, as it must do (social change always has its losers as well as its winners), I’m happy to look those men in the eye.

But this is still treating power as an elite thing, coupled to public prestige, to the individual charisma of so-called ‘leadership’, and often, though not always, to a degree of celebrity. It’s also treating power very narrowly as something only the few – mostly men – can possess or wield (that’s exactly what’s summed up by the image of Perseus or Trump, brandishing his sword). On those terms, women as a gender – not as some individuals – are by definition excluded from it. You can’t easily fit women into a structure that is already coded as male; you have to change the structure. That means thinking about power differently. It means decoupling it from public prestige. It means thinking collaboratively, about the power of followers not just of leaders. It means above all thinking about power as an attribute or even a verb (‘to power’), not as a possession: what I have in mind is the ability to be effective, to make a difference in the world, and the right to be taken seriously, together as much as individually. It’s power in that sense that many women feel they don’t have – and that they want. Why the popular resonance of ‘mansplaining’ (despite the intense dislike of the term felt by many men)? It hits home for us because it points straight to what it feels like not to be taken seriously: a bit like when I get lectured on Roman history on Twitter.

So should we be optimistic about change when we think about what power is and what it can do, and women’s engagement with it? Maybe, we should be a little. I’m struck, for example, that one of the most influential political movements of the last few years, Black Lives Matter, was founded by three women; few of us, I suspect, would recognise any of their names, but together they had the power to get things done in a different way.14

I’m not sure that culturally we’ve got anywhere near subverting those foundational stories of power that serve to keep women out of it, and turning them to our own advantage, as Thatcher did with her handbag. I have been playing the part of the pedant, objecting to Lysistrata being played as if it were about girl power (maybe that’s exactly what we should be doing). There have been all kinds of well-known feminist attempts over the last fifty years or more to reclaim Medusa for female power (‘Laugh with Medusa’, as the title of one recent collection of essays almost put it) – not to mention the use of her as the Versace logo – but it’s made not a blind bit of difference to the way she has been used in attacks on female politicians.

The power of those traditional narratives is very nicely, though fatalistically, captured by Perkins Gilman. There is a sequel to Herland, in which Vandyck decides to escort Terry back home to Ourland, taking with him his wife from Herland, Ellador: it’s called With Her in Ourland. In truth, Ourland doesn’t show itself off very well, not least because Ellador is introduced to it in the middle of World War One. And before long the couple, having ditched Terry, decide to go back to Herland. By now Van and Ellador are expecting a baby, and – you may have guessed it – the last words of this second novella are: ‘In due time a son was born to us.’ Perkins Gilman must have been well aware that there was no need for another sequel. Any reader in tune with the logic of the Western tradition would have been able to predict exactly who would be in charge of Herland in fifty years’ time. That boy.

The hi-tech war on science fraud

war

Unknown 13:55 1

One morning last summer, a German psychologist named Mathias Kauff woke up to find that he had been reprimanded by a robot. In an email, a computer program named Statcheck informed him that a 2013 paper he had published on multiculturalism and prejudice appeared to contain a number of incorrect calculations – which the program had catalogued and then posted on the internet for anyone to see. The problems turned out to be minor – just a few rounding errors – but the experience left Kauff feeling rattled. “At first I was a bit frightened,” he said. “I felt a bit exposed.”

Sign up to the long read email
Read more
Kauff wasn’t alone. Statcheck had read some 50,000 published psychology papers and checked the maths behind every statistical result it encountered. In the space of 24 hours, virtually every academic active in the field in the past two decades had received an email from the program, informing them that their work had been reviewed. Nothing like this had ever been seen before: a massive, open, retroactive evaluation of scientific literature, conducted entirely by computer.

Statcheck’s method was relatively simple, more like the mathematical equivalent of a spellchecker than a thoughtful review, but some scientists saw it as a new form of scrutiny and suspicion, portending a future in which the objective authority of peer review would be undermined by unaccountable and uncredentialed critics.

Susan Fiske, the former head of the Association for Psychological Science, wrote an op-ed accusing “self-appointed data police” of pioneering a new “form of harassment”. The German Psychological Society issued a statement condemning the unauthorised use of Statcheck. The intensity of the reaction suggested that many were afraid that the program was not just attributing mere statistical errors, but some impropriety, to the scientists.

The man behind all this controversy was a 25-year-old Dutch scientist named Chris Hartgerink, based at Tilburg University’s Meta-Research Center, which studies bias and error in science. Statcheck was the brainchild of Hartgerink’s colleague Michèle Nuijten, who had used the program to conduct a 2015 study that demonstrated that about half of all papers in psychology journals contained a statistical error. Nuijten’s study was written up in Nature as a valuable contribution to the growing literature acknowledging bias and error in science – but she had not published an inventory of the specific errors it had detected, or the authors who had committed them. The real flashpoint came months later, when Hartgerink modified Statcheck with some code of his own devising, which catalogued the individual errors and posted them online – sparking uproar across the scientific community.

Hartgerink is one of only a handful of researchers in the world who work full-time on the problem of scientific fraud – and he is perfectly happy to upset his peers. “The scientific system as we know it is pretty screwed up,” he told me last autumn. Sitting in the offices of the Meta-Research Center, which look out on to Tilburg’s grey, mid-century campus, he added: “I’ve known for years that I want to help improve it.” Hartgerink approaches his work with a professorial seriousness – his office is bare, except for a pile of statistics textbooks and an equation-filled whiteboard – and he is appealingly earnest about his aims. His conversations tend to rapidly ascend to great heights, as if they were balloons released from his hands – the simplest things soon become grand questions of ethics, or privacy, or the future of science.

“Statcheck is a good example of what is now possible,” he said. The top priority, for Hartgerink, is something much more grave than correcting simple statistical miscalculations. He is now proposing to deploy a similar program that will uncover fake or manipulated results – which he believes are far more prevalent than most scientists would like to admit.

The Guardian's Audio Long Reads The hi-tech war on science fraud – podcast
The problem of fake data may go far deeper than scientists admit. Now a team of researchers has a controversial plan to root out the perpetrators
Listen
When it comes to fraud – or in the more neutral terms he prefers, “scientific misconduct” – Hartgerink is aware that he is venturing into sensitive territory. “It is not something people enjoy talking about,” he told me, with a weary grin. Despite its professed commitment to self-correction, science is a discipline that relies mainly on a culture of mutual trust and good faith to stay clean. Talking about its faults can feel like a kind of heresy. In 1981, when a young Al Gore led a congressional inquiry into a spate of recent cases of scientific fraud in biomedicine, the historian Daniel Kevles observed that “for Gore and for many others, fraud in the biomedical sciences was akin to pederasty among priests”.

The comparison is apt. The exposure of fraud directly threatens the special claim science has on truth, which relies on the belief that its methods are purely rational and objective. As the congressmen warned scientists during the hearings, “each and every case of fraud serves to undermine the public’s trust in the research enterprise of our nation”.

But three decades later, scientists still have only the most crude estimates of how much fraud actually exists. The current accepted standard is a 2009 study by the Stanford researcher Daniele Fanelli that collated the results of 21 previous surveys given to scientists in various fields about research misconduct. The studies, which depended entirely on scientists honestly reporting their own misconduct, concluded that about 2% of scientists had falsified data at some point in their career.

If Fanelli’s estimate is correct, it seems likely that thousands of scientists are getting away with misconduct each year. Fraud – including outright fabrication, plagiarism and self-plagiarism – accounts for the majority of retracted scientific articles. But, according to RetractionWatch, which catalogues papers that have been withdrawn from the scientific literature, only 684 were retracted in 2015, while more than 800,000 new papers were published. If even just a few of the suggested 2% of scientific fraudsters – which, relying on self-reporting, is itself probably a conservative estimate – are active in any given year, the vast majority are going totally undetected. “Reviewers and editors, other gatekeepers – they’re not looking for potential problems,” Hartgerink said.

But if none of the traditional authorities in science are going to address the problem, Hartgerink believes that there is another way. If a program similar to Statcheck can be trained to detect the traces of manipulated data, and then make those results public, the scientific community can decide for itself whether a given study should still be regarded as trustworthy.

Hartgerink’s university, which sits at the western edge of Tilburg, a small, quiet city in the southern Netherlands, seems an unlikely place to try and correct this hole in the scientific process. The university is best known for its economics and business courses and does not have traditional lab facilities. But Tilburg was also the site of one of the biggest scientific scandals in living memory – and no one knows better than Hartgerink and his colleagues just how devastating individual cases of fraud can be.

In September 2010, the School of Social and Behavioral Science at Tilburg University appointed Diederik Stapel, a promising young social psychologist, as its new dean. Stapel was already popular with students for his warm manner, and with the faculty for his easy command of scientific literature and his enthusiasm for collaboration. He would often offer to help his colleagues, and sometimes even his students, by conducting surveys and gathering data for them.

As dean, Stapel appeared to reward his colleagues’ faith in him almost immediately. In April 2011 he published a paper in Science, the first study the small university had ever landed in that prestigious journal. Stapel’s research focused on what psychologists call “priming”: the idea that small stimuli can affect our behaviour in unnoticed but significant ways. “Could being discriminated against depend on such seemingly trivial matters as garbage on the streets?” Stapel’s paper in Science asked. He proceeded to show that white commuters at the Utrecht railway station tended to sit further away from visible minorities when the station was dirty. Similarly, Stapel found that white people were more likely to give negative answers on a quiz about minorities if they were interviewed on a dirty street, rather than a clean one.

Stapel had a knack for devising and executing such clever studies, cutting through messy problems to extract clean data. Since becoming a professor a decade earlier, he had published more than 100 papers, showing, among other things, that beauty product advertisements, regardless of context, prompted women to think about themselves more negatively, and that judges who had been primed to think about concepts of impartial justice were less likely to make racially motivated decisions.

His findings regularly reached the public through the media. The idea that huge, intractable social issues such as sexism and racism could be affected in such simple ways had a powerful intuitive appeal, and hinted at the possibility of equally simple, elegant solutions. If anything united Stapel’s diverse interests, it was this Gladwellian bent. His studies were often featured in the popular press, including the Los Angeles Times and New York Times, and he was a regular guest on Dutch television programmes.

But as Stapel’s reputation skyrocketed, a small group of colleagues and students began to view him with suspicion. “It was too good to be true,” a professor who was working at Tilburg at the time told me. (The professor, who I will call Joseph Robin, asked to remain anonymous so that he could frankly discuss his role in exposing Stapel.) “All of his experiments worked. That just doesn’t happen.”

A student of Stapel’s had mentioned to Robin in 2010 that some of Stapel’s data looked strange, so that autumn, shortly after Stapel was made Dean, Robin proposed a collaboration with him, hoping to see his methods first-hand. Stapel agreed, and the data he returned a few months later, according to Robin, “looked crazy. It was internally inconsistent in weird ways; completely unlike any real data I had ever seen.” Meanwhile, as the student helped get hold of more datasets from Stapel’s former students and collaborators, the evidence mounted: more “weird data”, and identical sets of numbers copied directly from one study to another.

In August 2011, the whistleblowers took their findings to the head of the department, Marcel Zeelenberg, who confronted Stapel with the evidence. At first, Stapel denied the charges, but just days later he admitted what his accusers suspected: he had never interviewed any commuters at the railway station, no women had been shown beauty advertisements and no judges had been surveyed about impartial justice and racism.

Stapel hadn’t just tinkered with numbers, he had made most of them up entirely, producing entire datasets at home in his kitchen after his wife and children had gone to bed. His method was an inversion of the proper scientific method: he started by deciding what result he wanted and then worked backwards, filling out the individual “data” points he was supposed to be collecting.

On 7 September 2011, the university revealed that Stapel had been suspended. The media initially speculated that there might have been an issue with his latest study – announced just days earlier, showing that meat-eaters were more selfish and less sociable – but the problem went much deeper. Stapel’s students and colleagues were about to learn that his enviable skill with data was, in fact, a sham, and his golden reputation, as well as nearly a decade of results that they had used in their own work, were built on lies.

Chris Hartgerink was studying late at the library when he heard the news. The extent of Stapel’s fraud wasn’t clear by then, but it was big. Hartgerink, who was then an undergraduate in the Tilburg psychology programme, felt a sudden disorientation, a sense that something solid and integral had been lost. Stapel had been a mentor to him, hiring him as a research assistant and giving him constant encouragement. “This is a guy who inspired me to actually become enthusiastic about research,” Hartgerink told me. “When that reason drops out, what remains, you know?”

Hartgerink wasn’t alone; the whole university was stunned. “It was a really difficult time,” said one student who had helped expose Stapel. “You saw these people on a daily basis who were so proud of their work, and you know it’s just based on a lie.” Even after Stapel resigned, the media coverage was relentless. Reporters roamed the campus – first from the Dutch press, and then, as the story got bigger, from all over the world.

On 9 September, just two days after Stapel was suspended, the university convened an ad-hoc investigative committee of current and former faculty. To help determine the true extent of Stapel’s fraud, the committee turned to Marcel van Assen, a statistician and psychologist in the department. At the time, Van Assen was growing bored with his current research, and the idea of investigating the former dean sounded like fun to him. Van Assen had never much liked Stapel, believing that he relied more on the force of his personality than reason when running the department. “Some people believe him charismatic,” Van Assen told me. “I am less sensitive to it.”

Van Assen – who is 44, tall and rangy, with a mop of greying, curly hair – approaches his work with relentless, unsentimental practicality. When speaking, he maintains an amused, half-smile, as if he is joking. He once told me that to fix the problems in psychology, it might be simpler to toss out 150 years of research and start again; I’m still not sure whether or not he was serious.

To prove misconduct, Van Assen said, you must be a pitbull: biting deeper and deeper, clamping down not just on the papers, but the datasets behind them, the research methods, the collaborators – using everything available to bring down the target. He spent a year breaking down the 45 studies Stapel produced at Tilburg and cataloguing their individual aberrations, noting where the effect size – a standard measure of the difference between the two groups in an experiment – seemed suspiciously large, where sequences of numbers were copied, where variables were too closely related, or where variables that should have moved in tandem instead appeared adrift.

The committee released its final report in October 2012 and, based largely on its conclusions, 55 of Stapel’s publications were officially retracted by the journals that had published them. Stapel also returned his PhD to the University of Amsterdam. He is, by any measure, one of the biggest scientific frauds of all time. (RetractionWatch has him third on their all-time retraction leaderboard.) The committee also had harsh words for Stapel’s colleagues, concluding that “from the bottom to the top, there was a general neglect of fundamental scientific standards”. “It was a real blow to the faculty,” Jacques Hagenaars, a former professor of methodology at Tilburg, who served on the committee, told me.

By extending some of the blame to the methods and attitudes of the scientists around Stapel, the committee situated the case within a larger problem that was attracting attention at the time, which has come to be known as the “replication crisis”. For the past decade, the scientific community has been grappling with the discovery that many published results cannot be reproduced independently by other scientists – in spite of the traditional safeguards of publishing and peer-review – because the original studies were marred by some combination of unchecked bias and human error.

After the committee disbanded, Van Assen found himself fascinated by the way science is susceptible to error, bias, and outright fraud. Investigating Stapel had been exciting, and he had no interest in returning to his old work. Van Assen had also found a like mind, a new professor at Tilburg named Jelte Wicherts, who had a long history working on bias in science and who shared his attitude of upbeat cynicism about the problems in their field. “We simply agree, there are findings out there that cannot be trusted,” Van Assen said. They began planning a new sort of research group: one that would investigate the very practice of science.

Illustration by Bratislav Milenkovic.
Illustration by Bratislav Milenkovic.
Van Assen does not like assigning Stapel too much credit for the creation of the Meta-Research Center, which hired its first students in late 2012, but there is an undeniable symmetry: he and Wicherts have created, in Stapel’s old department, a platform to investigate the sort of “sloppy science” and misconduct that very department had been condemned for.

Hartgerink joined the group in 2013. “For many people, certainly for me, Stapel launched an existential crisis in science,” he said. After Stapel’s fraud was exposed, Hartgerink struggled to find “what could be trusted” in his chosen field. He began to notice how easy it was for scientists to subjectively interpret data – or manipulate it. For a brief time he considered abandoning a future in research and joining the police.

There are probably several very famous papers that have fake data, and very famous people who have done it
Van Assen, who Hartgerink met through a statistics course, helped put him on another path. Hartgerink learned that a growing number of scientists in every field were coming to agree that the most urgent task for their profession was to establish what results and methods could still be trusted – and that many of these people had begun to investigate the unpredictable human factors that, knowingly or not, knocked science off its course. What was more, he could be a part of it. Van Assen offered Hartgerink a place in his yet-unnamed research group. All of the current projects were on errors or general bias, but Van Assen proposed they go out and work closer to the fringes, developing methods that could detect fake data in published scientific literature.

“I’m not normally an expressive person,” Hartgerink told me. “But I said: ‘Hell, yes. Let’s do that.’”

Hartgerink and Van Assen believe not only that most scientific fraud goes undetected, but that the true rate of misconduct is far higher than 2%. “We cannot trust self reports,” Van Assen told me. “If you ask people, ‘At the conference, did you cheat on your fiancee?’ – people will very likely not admit this.”

Uri Simonsohn, a psychology professor at University of Pennsylvania’s Wharton School who gained notoriety as a “data vigilante” for exposing two serious cases of fraud in his field in 2012, believes that as much as 5% of all published research contains fraudulent data. “It’s not only in the periphery, it’s not only in the journals people don’t read,” he told me. “There are probably several very famous papers that have fake data, and very famous people who have done it.”

But as long as it remains undiscovered, there is a tendency for scientists to dismiss fraud in favour of more widely documented – and less seedy – issues. Even Arturo Casadevall, an American microbiologist who has published extensively on the rate, distribution, and detection of fraud in science, told me that despite his personal interest in the topic, my time would be better served investigating the broader issues driving the replication crisis. Fraud, he said, was “probably a relatively minor problem in terms of the overall level of science”.

This way of thinking goes back at least as far as scientists have been grappling with high-profile cases of misconduct. In 1983, Peter Medawar, the British immunologist and Nobel laureate, wrote in the London Review of Books: “The number of dishonest scientists cannot, of course, be known, but even if they were common enough to justify scary talk of ‘tips of icebergs’, they have not been so numerous as to prevent science’s having become the most successful enterprise (in terms of the fulfilment of declared ambitions) that human beings have ever engaged upon.”

From this perspective, as long as science continues doing what it does well – as long as genes are sequenced and chemicals classified and diseases reliably identified and treated – then fraud will remain a minor concern. But while this may be true in the long run, it may also be dangerously complacent. Furthermore, scientific misconduct can cause serious harm, as, for instance, in the case of patients treated by Paolo Macchiarini, a doctor at Karolinska Institute in Sweden who allegedly misrepresented the effectiveness of an experimental surgical procedure he had developed. Macchiarini is currently being investigated by a Swedish prosecutor after several of the patients who received the procedure later died.

Even in the more mundane business of day-to-day research, scientists are constantly building on past work, relying on its solidity to underpin their own theories. If misconduct really is as widespread as Hartgerink and Van Assen think, then false results are strewn across scientific literature, like unexploded mines that threaten any new structure built over them. At the very least, if science is truly invested in its ideal of self-correction, it seems essential to know the extent of the problem.

But there is little motivation within the scientific community to ramp up efforts to detect fraud. Part of this has to do with the way the field is organised. Science isn’t a traditional hierarchy, but a loose confederation of research groups, institutions, and professional organisations. Universities are clearly central to the scientific enterprise, but they are not in the business of evaluating scientific results, and as long as fraud doesn’t become public they have little incentive to go after it. There is also the questionable perception, although widespread in the scientific community, that there are already measures in place that preclude fraud. When Gore and his fellow congressmen held their hearings 35 years ago, witnesses routinely insisted that science had a variety of self-correcting mechanisms, such as peer-review and replication. But, as the science journalists William Broad and Nicholas Wade pointed out at the time, the vast majority of cases of fraud are actually exposed by whistleblowers, and that holds true to this day.

And so the enormous task of keeping science honest is left to individual scientists in the hope that they will police themselves, and each other. “Not only is it not sustainable,” said Simonsohn, “it doesn’t even work. You only catch the most obvious fakers, and only a small share of them.” There is also the problem of relying on whistleblowers, who face the thankless and emotionally draining prospect of accusing their own colleagues of fraud. (“It’s like saying someone is a paedophile,” one of the students at Tilburg told me.) Neither Simonsohn nor any of the Tilburg whistleblowers I interviewed said they would come forward again. “There is no way we as a field can deal with fraud like this,” the student said. “There has to be a better way.”

In the winter of 2013, soon after Hartgerink began working with Van Assen, they began to investigate another social psychology researcher who they noticed was reporting suspiciously large effect sizes, one of the “tells” that doomed Stapel. When they requested that the researcher provide additional data to verify her results, she stalled – claiming that she was undergoing treatment for stomach cancer. Months later, she informed them that she had deleted all the data in question. But instead of contacting the researcher’s co-authors for copies of the data, or digging deeper into her previous work, they opted to let it go.

They had been thoroughly stonewalled, and they knew that trying to prosecute individual cases of fraud – the “pitbull” approach that Van Assen had taken when investigating Stapel – would never expose more than a handful of dishonest scientists. What they needed was a way to analyse vast quantities of data in search of signs of manipulation or error, which could then be flagged for public inspection without necessarily accusing the individual scientists of deliberate misconduct. After all, putting a fence around a minefield has many of the same benefits as clearing it, with none of the tricky business of digging up the mines.

As Van Assen had earlier argued in a letter to the journal Nature, the traditional approach to investigating other scientists was needlessly fraught – since it combined the messy task of proving that a researcher had intended to commit fraud with a much simpler technical problem: whether the data underlying their results was valid. The two issues, he argued, could be separated.

Scientists can commit fraud in a multitude of ways. In 1974, the American immunologist William Summerlin famously tried to pass a patch of skin on a mouse darkened with permanent marker pen as a successful interspecies skin-graft. But most instances are more mundane: the majority of fraud cases in recent years have emerged from scientists either falsifying images – deliberately mislabelling scans and micrographs – or fabricating or altering their recorded data. And scientists have used statistical tests to scrutinise each other’s data since at least the 1930s, when Ronald Fisher, the father of biostatistics, used a basic chi-squared test to suggest that Gregor Mendel, the father of genetics, had cherrypicked some of his data.

In 2014, Hartgerink and Van Assen started to sort through the variety of tests used in ad-hoc investigations of fraud in order to determine which were powerful and versatile enough to reliably detect statistical anomalies across a wide range of fields. After narrowing down a promising arsenal of tests, they hit a tougher problem. To prove that their methods work, Hartgerink and Van Assen have to show they can reliably distinguish false from real data. But research misconduct is relatively uncharted territory. Only a handful of cases come to light each year – a dismally small sample size – so it’s hard to get an idea of what constitutes “normal” fake data, what its features and particular quirks are. Hartgerink devised a workaround, challenging other academics to produce simple fake datasets, a sort of game to see if they could come up with data that looked real enough to fool the statistical tests, with an Amazon gift card as a prize.

By 2015, the Meta-Research group had expanded to seven researchers, and Hartgerink was helping his colleagues with a separate error-detection project that would become Statcheck. He was pleased with the study that Michèle Nuitjen published that autumn, which used Statcheck to show that something like half of all published psychology papers appeared to contain calculation errors, but as he tinkered with the program and the database of psychology papers they had assembled, he found himself increasingly uneasy about what he saw as the closed and secretive culture of science.

When scientists publish papers in journals, they release only the data they wish to share. Critical evaluation of the results by other scientists – peer review – takes place in secret and the discussion is not released publicly. Once a paper is published, all comments, concerns, and retractions must go through the editors of the journal before they reach the public. There are good, or at least defensible, arguments for all of this. But Hartgerink is part of an increasingly vocal group that believes that the closed nature of science, with authority resting in the hands of specific gatekeepers – journals, universities, and funders – is harmful, and that a more open approach would better serve the scientific method.

Hartgerink realised that with a few adjustments to Statcheck, he could make public all the statistical errors it had exposed. He hoped that this would shift the conversation away from talk of broad, representative results – such as the proportion of studies that contained errors – and towards a discussion of the individual papers and their mistakes. The critique would be complete, exhaustive, and in the public domain, where the authors could address it; everyone else could draw their own conclusions.

In August 2016, with his colleagues’ blessing, he posted the full set of Statcheck results publicly on the anonymous science message board PubPeer. At first there was praise on Twitter and science blogs, which skew young and progressive – and then, condemnations, largely from older scientists, who feared an intrusive new world of public blaming and shaming. In December, after everyone had weighed in, Nature, a bellwether of mainstream scientific thought for more than a century, cautiously supported a future of automated scientific scrutiny in an editorial that addressed the Statcheck controversy without explicitly naming it. Its conclusion seemed to endorse Hartgerink’s approach, that “criticism itself must be embraced”.

In the same month, the Office of Research Integrity (ORI), an obscure branch of the US National Institutes of Health, awarded Hartgerink a small grant – about $100,000 – to pursue new projects investigating misconduct, including the completion of his program to detect fabricated data. For Hartgerink and Van Assen, who had not received any outside funding for their research, it felt like vindication.

Yet change in science comes slowly, if at all, Van Assen reminded me. The current push for more open and accountable science, of which they are a part, has “only really existed since 2011”, he said. It has captured an outsize share of the science media’s attention, and set laudable goals, but it remains a small, fragile outpost of true believers within the vast scientific enterprise. “I have the impression that many scientists in this group think that things are going to change.” Van Assen said. “Chris, Michèle, they are quite optimistic. I think that’s bias. They talk to each other all the time.”

When I asked Hartgerink what it would take to totally eradicate fraud from the scientific process, he suggested that scientists make all of their data public; register the intentions of their work before conducting experiments, to prevent post-hoc reasoning, and that they have their results checked by algorithms during and after the publishing process.

To any working scientist – currently enjoying nearly unprecedented privacy and freedom for a profession that is in large part publicly funded – Hartgerink’s vision would be an unimaginably draconian scientific surveillance state. For his part, Hartgerink believes the preservation of public trust in science requires nothing less – but in the meantime, he intends to pursue this ideal without the explicit consent of the entire scientific community, by investigating published papers and making the results available to the public.

Even scientists who have done similar work uncovering fraud have reservations about Van Assen and Hartgerink’s approach. In January, I met with Dr John Carlisle and Dr Steve Yentis at an anaesthetics conference that took place in London, near Westminster Abbey. In 2012, Yentis, then the editor of the journal Anaesthesia, asked Carlisle to investigate data from a researcher named Yoshitaka Fujii, who the community suspected was falsifying clinical trials. In time, Carlisle demonstrated that 168 of Fujii’s trials contained dubious statistical results. Yentis and the other journal editors contacted Fujii’s employers, who launched a full investigation. Fujii currently sits at the top of the RetractionWatch leaderboard with 183 retracted studies. By sheer numbers he is the biggest scientific fraud in recorded history.

You’re saying to a person, ‘I think you’re a liar.’ How many fraudulent papers are worth one false accusation?
Carlisle, who, like Van Assen, found that he enjoyed the detective work (“it takes a certain personality, or personality disorder”, he said), showed me his latest project, a larger-scale analysis of the rate of suspicious clinical trial results across multiple fields of medicine. He and Yentis discussed their desire to automate these statistical tests – which, in theory, would look a lot like what Hartgerink and Van Assen are developing – but they have no plans to make the results public; instead they envision that journal editors might use the tests to screen incoming articles for signs of possible misconduct.

“It is an incredibly difficult balance,” said Yentis, “you’re saying to a person, ‘I think you’re a liar.’ We have to decide how many fraudulent papers are worth one false accusation. How many is too many?”

With the introduction of programs such as Statcheck, and the growing desire to conduct as much of the critical conversation as possible in public view, Yentis expects a stormy reckoning with those very questions. “That’s a big debate that hasn’t happened,” he said, “and it’s because we simply haven’t had the tools.”

For all their dispassionate distance, when Hartgerink and Van Assen say that they are simply identifying data that “cannot be trusted”, they mean flagging papers and authors that fail their tests. And, as they learned with Statcheck, for many scientists, that will be indistinguishable from an accusation of deceit. When Hartgerink eventually deploys his fraud-detection program, it will flag up some very real instances of fraud, as well as many unintentional errors and false positives – and present all of the results in a messy pile for the scientific community to sort out. Simonsohn called it “a bit like leaving a loaded gun on a playground”.

When I put this question to Van Assen, he told me it was certain that some scientists would be angered or offended by having their work and its possible errors exposed and discussed. He didn’t want to make anyone feel bad, he said – but he didn’t feel bad about it. Science should be about transparency, criticism, and truth.

“The problem, also with scientists, is that people think they are important, they think they have a special purpose in life,” he said. “Maybe you too. But that’s a human bias. I think when you look at it objectively, individuals don’t matter at all. We should only look at what is good for science and society.”

Daily Interesting News

ADS

Hot

Post Top Ad

venerdì 10 marzo 2017

Women in Power

Women in Power

The hi-tech war on science fraud

Post Top Ad

Author Details

Translate

Recent

Social

Recent

Popular

Comments

Sponsor

Daily News

Archive

Tags

About Us

Labels

Total Views

Modulo di contatto

Daily Interesting News

ADS

Hot

Post Top Ad

venerdì 10 marzo 2017

Women in Power

Women in Power

The hi-tech war on science fraud

Post Top Ad

Author Details

Translate

Recent

Social

Recent

Popular

Comments

Sponsor

Daily News

Archive

Tags

About Us

Labels

Total Views

Iscriviti a

Modulo di contatto