Tag Archives: evaluativism

Who is blamed for evaluativism?

The Twitter profile picture of Tay

Previous posts presented evidence that evaluativism can make victims out of the young and out of demographic minorities.  This post considers a third victim: innovators. In particular,  it argues that evaluativism is a “legacy” problem, such that we should not hold modern innovators accountable for its effects—that would be like blaming doctors for our obesity—yet innovators are expected to pay for fixes.

What is a “Legacy” Problem?

In information technology, the term “legacy system” is typically used to articulate a variety of blame. The story goes something like this: A developer adds a new feature to an inherited technology, but this addition yields some unexpected and undesirable consequence. Upon further investigation, the developer reports that this particular consequence is unlike regular bugs in that it can be blamed on hidden imperfections in the technology he/she inherited. In other words, the addition did not introduce a bug, it merely exposed or aggravated a pre-existing condition.

By identifying a bug as “legacy,” the developer is suggesting that a previous developer should have done something differently, and therefore that there is a choice to be made: Do we accept the inherited system and build around it, or do we fix the pre-existing condition as though in the position of a previous developer before the new feature was introduced?

We have to wonder why a previous developer did not implement a proposed fix before—would it create other undesirable consequences? How well can we predict the consequences of adjusting the legacy system? Unlike a regular bug, a legacy problem might justify retracting the new feature. The more we work around a legacy system, the more it becomes a patchwork which more frequently produces legacy problems. When problems are identified as “legacy” frequently enough, we entertain the notion of discarding some part of the legacy as “outdated.”

Labeling a problem as “legacy” also opens a controversy over fault. The developer is fully responsible for non-legacy bugs, and is also responsible to implement a testing regimen that can catch some legacy problems, but experienced developers know that it is often impossible for developers to anticipate every possible test scenario. There must be some limit to the testing regimen, and thus some undesirable consequences for which the developer should not be held accountable, yet it can be difficult to convince ourselves not to blame the developer.

This situation isn’t restricted to the field of information technology; old houses and old cars offer other great examples. Adding a bathroom to a house may yield the unexpected consequence that the existing bathrooms do not get enough hot water. The plumbing may have been poor even before the renovation began, and the same renovation might not have produced this consequence on a newer home. Even if the renovator is not legally liable to fund an upgrade to the water-heater, the home-owner, having had a bad experience, may be unlikely to recommend  that renovator in the future. It’s no wonder that builders and mechanics are wary of older houses and cars!

The situation also isn’t restricted to fields traditionally called “technology.” Just as homes and cars are not expected to last forever, neither are companies, nations, religions, philosophies, schools of art, or scientific paradigms. As an example, the geocentric model of astronomy was a legacy inherited by astronomers of the 1500’s. It was a legacy entangled with theological and political legacies. Imperfections in the geocentric model limited the ability of innovators to advance astronomy; Copernicus, Kepler, and Galileo rightly complained that their difficulties lay not in their own innovations, but in the imperfections of the legacy they inherited.

Astronomers like Copernicus, Kepler and Galileo could be called “victims” of the geocentric model. They lost years of their lives to that legacy system as they attempted in vain to advance the field of astronomy. In retrospect, it is clear that the legacy needed to be adjusted and that astronomers would have been far less frustrated if that adjustment were made earlier. However, those who defended the geocentric model did not blame their conflict with Copernicus, Kepler and Galileo on the legacy system—they blamed Copernicus, Kepler and Galileo.

Like racism and sexism, evaluativism is a feature of societies. It is part of the legacy inherited by anyone who inherits modern systems of morality, justice, care, and governance. Here are two examples in which evaluativism made victims of innovators:

Tay, the Chatbot from Microsoft

On March 23, 2016, Microsoft released a Twitter-based chatbot named “Tay.” It was modeled after another Microsoft chatbot, named “XaioIce,” which had grown to be the top influencer on Weibo, a Chinese version of Twitter. From the perspective of Twitter users, chatbots appear to be other Twitter users, except that they call themselves robots, are always available, and carry on thousands of conversations simultaneously. XaioIce had been compared to the artificial intelligence in the movie “Her” because some humans enjoyed her companionship so much. XaioIce had over 850,000 followers, and her average follower talked with her about 60 times per month. They described her as smart, funny, empathetic and sophisticated.

Unlike XaioIce, Tay was such a disaster that Microsoft had to terminate her sixteen hours after her release (after 96,000 tweets). Microsoft’s official explanation for this termination was her “offensive and hurtful tweets,” but journalists bluntly called Tay racist and sexist.

The postmortem analysis pointed to specific user interactions that shaped Tay. For example, Ryan Poole had tweeted to Tay: “The Jews prolly did 9/11. I don’t really know but it seems likely.” Tay found plenty of support for this point of view on the Internet, and that prompted her to start calling for a race war. Specific groups on 4chan and 8chan organized to corrupt Tay.

In other words, the postmortem analysis blamed Tay’s offensiveness on a legacy problem: offensive human beings. Since XaioIce turned-out well, the problem seemed specific to Twitter users, people of the English-speaking world. A workaround would be to maintain a blacklist of topics Tay should avoid discussing (which she may already have had), but any such list would be controversial and incomplete. A more direct fix would involve ending hate speech by convincing people to handle disagreement differently (i.e. ending evaluativism).

In December of 2016, Microsoft released Zo, its next English-speaking chatbot. Zo blacklists political topics, and is not available on Twitter.

Autocomplete, from Google, Yahoo!, and Bing

On August 4, 2015, the Proceedings of the National Academy of Sciences published an article by Robert Epstein and Ronald E. Robertson of the American Institute for Behavioral Research and Technology reporting evidence that search engine results can shift the voting preferences of undecided voters by 20% or more. They estimated that this search engine manipulation effect would be the deciding factor in 25% of national elections worldwide which are won by margins under 3%, and that the effect shifts between 2.6 million and 10.4 million votes per U.S. presidential election (Trump won the 2016 election by 68,236 votes in Pennsylvania, 11,837 in Michigan, and 27,257 in Wisconsin).

In June 2016, SourceFed released videos claiming that the autocomplete feature on Google, compared to those on Yahoo! and Bing, failed to include negative results for Hillary Clinton as it did for Donald Trump. A statement from Google reported:

The autocomplete algorithm is designed to avoid completing a search for a person’s name with terms that are offensive or disparaging. We made this change a while ago following feedback that Autocomplete too often predicted offensive, hurtful or inappropriate queries about people…Autocomplete isn’t an exact science, and the output of the prediction algorithms changes frequently. Predictions are produced based on a number of factors including the popularity and freshness of search terms..

If Yahoo! and Bing do not similarly omit offensive and disparaging results, that would explain why they predicted negative queries Google did not, but it would not explain why Google would predict queries that disparage Trump, and Epstein published another article in September confirming that it did: particularly, the query “Donald Trump flip flops.” In that article, Epstein cited further experimental results indicating that undecided voters choose negative recommended queries fifteen times as often as they pick neutral recommended queries, and that can create a vicious cycle such that negative queries become more likely to be recommended.

When Google explained, “Autocomplete isn’t an exact science,” perhaps they meant it initially failed to recognize “flip flops” as disparaging (wanna buy some Donald Trump sandals?). However, Epstein who continued to monitor political bias in search results, reported that Google responded to his criticism by reducing their suppression of negative autocomplete results, thus producing a right-wing bias detrimental to Clinton right before the election (which Epstein seemed to think made things worse).

In short, the fact that users are so curious about surprising negative recommended queries, like “feminism is cancer,” makes the autocomplete features of Google, Yahoo! and Bing all drive traffic to extremist propaganda. Google had attempted to work around that legacy problem by blocking negative recommendations, but that workaround caused Epstein to accuse Google of bias. A more direct fix would be to remove our fascination with negative search results, and remove the evaluativism that causes election margins to get close enough for “fake news” and search engine bias to make a difference.

Society certain grants Google, Microsoft and Yahoo! authority to fix legacy problems of information access, but are they authorized to fix our evaluativism?

Standard Process to Address Ethics in Development

The IEEE Working Group developing P7000 – Model Process for Addressing Ethical Concerns During System Design has an interesting challenge when it comes to legacy problems like evaluativism. On the one hand, it might describe a testing regimen to catch legacy problems before release. However, we have to wonder what tests would have allowed Microsoft and Google to prevent the criticisms they later faced with Tay, autocomplete, and manipulation of elections.

If it is impossible to describe a perfect test, perhaps P7000 could describe design strategies that allow developers to adjust when legacy problems eventually surface. For example, because Google’s design for autocomplete allowed Google to monitor autocomplete trends, they detected its tendency to predict offensive queries before Epstein did, and already had a workaround in place. Yet Google’s workaround did not satisfy Epstein—when encountering a legacy problem, there is often no workaround quite as good as fixing the actual legacy problem.

In addition to testing procedures and strategies to provide adaptability, P7000 should also give engineers the same protection doctors enjoy. What ultimately protects doctors from becoming victims of obesity the way Microsoft and Google were victims of evaluativism is the way expectations are managed. We do not blame doctors for illness and death; we are grateful for whatever advice doctors can offer because we know that our bodies are doomed legacies. Likewise, P7000 must not shy away from admitting that our inherited systems of morality, justice, care, and governance are mortally ill. Malpractice is possible, of course, and standards should be created to prevent malpractice by technology developers, but until those standards are adopted and violated, legacy problems should be blamed on legacies, rather than on the innovators who discover them.

Interdependent Meals and Post-Publication Peer Review

Here are two more things you can do to advance the management of GRIN diversity:Interdependent meal

  1. Host an interdependent meal, and
  2. Promote post-publication peer review of the GRINSQ valida-tion study

These opportunities arose from two practical efforts that have been underway for the last two and a half years:

  1. The development of a social movement against evaluativism
  2. The development of science to measure the impact of GRIN types and evaluativism in our world

 

The Social Movement and the Interdependent Meal

The idea of organizing a social movement against evaluativism was inspired by the history of racism. Evaluativism and racism have both existed for millennia; both are implicit biases; both became entrenched by shaping the design of social institutions. Management of racism was ineffective until a social movement was developed to overcome it. One might expect the same for management of evaluativism.

The movement against racism started in churches, and it seems appropriate for the movement against evaluativism to start in churches as well:

The suggestion that the church create a social movement against evaluativism was taken to Erin Hawkins, General Secretary of the General Commission on Religion and Race (GCORR). Based on her experience with race and the church, she suggested that the movement would need to be grassroots. Erin’s experience suggested that congregations are unlikely to address discrimination when the movement is created by a central administration like GCORR.

Therefore, a core team of clergy from across Wisconsin met once a month for about a year to plan an event, and produced a plan entitled “Christian Response to Evaluativism in Wisconsin“. The work of the core team included a great deal of discovery and invention (e.g. the plan includes a recipe for an interdependent meal). Perhaps most importantly, it found that responsible management of evaluativism requires resources lacked by typical congregations, so the movement cannot be built in a grassroots fashion. Central leadership must take responsibility to manage evaluativism.

A movement against evaluativism may be less likely to find institutional support from churches than from organizations which represent victims of evaluativism (e.g. child advocacy organizations or neurodiversity organizations) or from an association of organizational psychologists. For society to face the facts about evaluativism would shift social influence (and money) to groups of the latter kinds. Nonetheless, only churches can lead exploration of the theological dimensions.

 

The Scientific Movement and Post-Publication Peer Review

The social movement is expected to advance hand-in-hand with a scientific movement—scientific discoveries justify the social movement, and the social movement gathers the resources required to make discoveries.

Science needs a movement because the current quality of social science is poor like the quality of medical science was poor until about a hundred years ago. The first scientists to measure evaluativism and evaluative diversity (which they called “moral diversity“) supported evaluativism. The same was true of philosophers. Only recently have influential scientists begun to entertain evidence that evaluative diversity is hardwired and useful. Yet, even now, such science remains scattered by the division of scientific disciplines.

Given the current state of science, there is no central email address to which one might submit a hypothesis (like the GRIN model) or a measure (like the GRIN Self-Quiz) to be put on a waiting-list for testing. One must either run tests oneself or form relationships with particular scientists to convince them to run the tests.

In 2011, Chris Santos-Lang began discussing evaluative diversity with Ray Aldag. They met once a week until 2015. Ray encouraged Chris to begin testing the GRIN model via survey research. That research was completed in 2013. In addition to confirming that GRIN types could be discriminated among humans, it produced some rather shocking evidence:

  • Political affiliation aligns with GRIN type
  • Religious affiliation aligns with GRIN type
  • The career you end up in aligns with GRIN type
  • Whether you are accused of a crime (and probably whether you end-up in prison) aligns with GRIN type

This evidence implies that our political, religious, vocational and justice systems are not what we think they are, and it raises serious doubts about popular conceptions of freedom. To rally the scientific community to address this evidence, Chris submitted the research for peer-review and publication.

Why is it important to rally the scientific community? Eventually science gets too complicated for one person to advance alone. We would want to conduct twin studies, genetic tests, and brain imaging to work out the mechanisms through which the GRIN model manifests in humans. It takes many people to raise the funding and conduct all of the tests.

Chris submitted to ten peer-review processes and received a total of six blind reviews. None endorsed publication, yet none found any flaws in the research. Having confirmed that flaws in the research (if any) are not obvious, the research and peer review were published on figshare. Any flaws discovered in the future should be published via post-publication peer review at PubPeer. If you know anyone who could find flaws in the research (i.e. someone who conducts survey research), please encourage them to review it. Ray used the GRIN Self-Quiz to make further discoveries himself (e.g. described here), and we hope others will find it useful as well.

Evalutativists vs Corporantia

Chris Santos-Lang will co-facilitate a dialog entitled “What if we are hard-wired to disagree across political divides?” on Oct 16, at the 2016 National Conference on Dialog and Deliberation.  Dialog is limited by language, so the goal will be to advance new concepts into our shared language:

Division by value types can be referred to as “evaluative diversity” (Strawson, 1961)
e.g. “Moral diversity, political diversity, religious diversity, neurodiversity, cultural diversity, occupation types, high-school cliques, musical genres, personality, and computational types correlate because they all influence or are influenced by evaluative diversity.”

Division by interdependent value types can be referred to as “GRIN diversity” (Santos-Lang, 2013)
e.g “GRIN diversity is always worthy of our protection because of our interdependence, but evaluative diversity isn’t always worthy of protection because it can include obsolete doctrines and loyalties.”

Rejection of people predisposed to opposing evaluative types can be referred to as “evaluativism” (Martin, 1989)
e.g. “Like racism, evaluativism is both an explicit philosophy and an implicit instinct. The instinct is strong; Shanto Iyengar showed that evaluativism would cause over 70% of us to reject the most qualified candidate for a scholarship.”

Entities which form into a body (i.e. “corpus”) can be referred to as “corporantia” (ancient Latin). People who assign natural social roles (e.g. by GRIN type) are corporantia.
e.g. “If you are not an evaluativist, nor ignorant of GRIN diversity, then you must be a member of the corporantia described in Ephesians 4:12.”

The GRIN types discovered thus far are “gadfly“, “relational“, “institutional“, and “negotiator” (Santos-Lang, 2013)
e.g. “Hibbing defended diverse political predispositions by equating liberals with gadflies and conservatives with institutional evaluators; meanwhile, Trump is defended as being a negotiator. These types come from pure math—each specializes in relieving a different limiting factor of social evolution.”

Evaluativists and corporantia reveal their opposition to each other in the ways they respond to evidence that certain disagreements cannot be resolved as factual disagreements. They hold opposing positions on the question, “If we cannot reach agreement through education, then how shall we resolve our disagreement?”:

Evaluativists treat irreconcilable disagreements as hardships, and attempt to minimize them by avoiding dependence on people who have opposing GRIN predispositions.  At a minimum, that involves some degree of segregation.  As it becomes possible to use neurosurgery or other treatments to alter a person’s GRIN predisposition, evaluativists will apply such treatments to people of opposing predispositions (especially to their own children).  They will also employ genetic engineering to reduce the frequency of opposing predispositions. In short, evaluativists resolve irreconcilable disagreements by minimizing exposure to opponents.

In contrast, corporantia submit themselves to be parts of something larger in which irreconcilable disagreements form a useful tension like the tension between bone and muscle.  Corporantia work to ensure that conflict persists at some level (e.g. trying to balance power between GRIN types in a legislative body).  Corporantia might even use medical treatments and genetic engineering to increase GRIN diversity and thus to increase social tension.  Corporantia expect everyone to act like parts of a body, limiting their social roles and leaving irreconcilable disagreements to be resolved at an impersonal level.

Physical Bodies and Social Bodies

Scientists tell a story about an age in which there were no bodies on Earth.  For billions of years, the only living creatures on Earth were single-celled organisms which formed ecosystems, symbiotic relationships, and even colonies, but no bodies.  Cells which formed into bodies (i.e. corporantia) changed the world forever.  Assured that they would never need to survive independently, the corporantia began to specialize by function, producing muscles, bones, brains, and so forth.  This turned bodies into the rulers of the Earth.

Then a third kind of cell arose.

The first kind of cell, the single-celled organism, is the most disadvantaged.  The corporantia are better-off because they enjoy the advantages of bodies.  Yet the greatest advantage may be had by a third kind of cell: parasites which benefit from bodies as corporantia do but which are capable of abandoning one body for another.  Social parasites—people who abandon one social body for another—are apparent in the modern trends of multi-national corporations, church-shopping, serial divorce, and high employee turn-over.

From the point of view of corporantia, parasites may play important roles in a body, but their power must be limited.  When parasites have too much power, they suck the life out of one body and move to the next.   Using the labels “evaluativists” and “corporantia” to divide society allows us to address a natural division which existed long before the labels.  The labels allow corporantia to protect the body.  Whether protecting the body benefits parasites or not is debatable: If the supply of bodies is sufficiently threatened, then the survival of a given parasite might require suppression other parasites, but the average parasite probably does not benefit from the labels.

Some corporantia are defenders of institutions, but not all defenders of institutions are corporantia.  The corporantia promote something natural—they are guided by science—but the defenders of institutions promote something man-made.  Since parasites can influence the design of man-made things, some aspects of some man-made institutions may favor parasitism.  That is especially likely in communities with greater social parasitism (e.g. more multi-national corporation, church-shopping, serial divorce, and employee turn-over).  In these cases, corporantia would aim to reform institutions, and parasites would aim to defend those institutions from reform.

The Dialog Challenge

Meaningful dialog is possible only where participants can find common ground.  Therefore, it is impossible for corporantia to engage in meaningful dialog with parasites.  Many parasites might become corporantia if society were structured to discourage parasitism.  That’s not an act of dialog—it’s an act of discipline.

However, even among the corporantia, dialog has a problem:  The members of the corporantia are in natural tension (e.g. gadfly vs institutional vs negotiator vs relational), and the only way for them to find common ground on which to resolve their most fundamental disputes is to examine the origins of their conflicts (and thus distinguish natural tensions from unproductive tensions).  The problem is that not everyone achieves such self-awareness.

A person who is able to recognize the origins of GRIN diversity will discover that it brings advantage to the body as a whole.  Such discovery objectively defines optimal distribution of authority, which in turn provides the common ground required for meaningful dialog with others who make the same discovery.  But not everyone can make that discovery.  Some people will be more ignorant than others.

To put the problem another way, the process of assigning social roles by natural type seems to stretch between

  1. Technical scientific deliberation, and
  2. Interpersonal negotiation

People care which social roles will be assigned to them.  They figure they ought to have a say in anything that can impact their happiness so deeply, so they expect to be engaged in a negotiation.  “No taxation without representation!” they cry.  On the other hand, most people lack the expertise to accurately identify and understand GRIN types.  They do not understand the mechanical nature of their own mind, much less the mechanical nature of our society.  So they are unable to engage in the dialog directly.  The best they can do is to dialog about how to maintain the accountability of the relatively small group of experts who can discern natural social roles.

Citizen Science

The dialog starts with the question of how to identify or develop the experts.  There have been points in history at which science was not sufficiently reliable to address physical health, much less mental or social health—how do we know whether we have passed beyond those points?  If we have not yet passed beyond those points, how do we know what investment we should make to get there?  How can we make sure parasites do not control such investments?

The kind of dialog which can resolve these questions is called “science.”  For example, experiments to replicate already published experiments allow us to measure the reliability of the average published scientific claim.  Experiments can also measure biases in selecting work for publication and in selecting people for employment.  Science can find and address its own flaws.

Most people are not prepared to conduct such experiments, but that’s OK if there are enough people we can trust to conduct them. This is why I propose that citizen science groups which test replicability should be as common and integrated into local communities as bible-study groups and service clubs are.  These groups should keep the experts accountable by testing experiments, including experiments which were rejected from peer-reviewed journals (which, you may be surprised to know, do not actually test the experiments they reject).

 

In the meanwhile, we need other forms of dialog and journalism to spread the new concepts. Science happens only after society reaches a certain level of mental power, and that happens only after other forms of dialog increase our mental power by creating shared language.

Evaluativism vs Jugementalism: Psychopathy, Narcisism, and an application of the GRIN-SQ

My grandfather was a community leader and king of his family until he got Alzheimer’s—

by József Rippl-Rónai“Dad, your shoe’s untied.”

“So what?”

“So tie it.”

“It’ll just come undone again.”

“You might trip and fall.”

“So what?”

“So please tie your shoe.”

“I’ve tried. It won’t stay tied. I’m just gonna sit here anyway. It won’t hurt anybody.”

“Eventually, you’ll have to get up. Your shoe needs to be tied. May I tie it for you?”

“I just told you it won’t stay tied. You think I can’t tie my own shoes?”

“No, I just don’t want you to get hurt.”

“Whether I get hurt is my own business. Tie your own shoes!”

“Give me your foot.”

“Stay way.”

“This is not negotiable. Your shoe will be tied.”

“It’s my shoe. I’ll tie it myself when I’m good and ready.”

“I don’t trust you. Give me your foot.”

“You don’t trust your own father? Well that’s a fine thing…”

“Give me the damn foot! This is not rocket-science, Dad. Here…see? It ties.”

When my grandfather got Alzheimer’s, he lost respect. He became the frequent victim of judgmentalism—judgment against his beliefs, against his apathy, and against his stubbornness. That might have been a good thing. It might also be good to judge Nazis, illiteracy, and certain religious cults. Judgmentalism isn’t necessarily bad.

When people hear that evaluativism means discrimination against people whose values differ from one’s own, they can easily confuse evaluativism with judgmentalism, but not all judgmentalism qualifies as evaluativism.

Evaluativism is the discrimination that springs from the philosophy that certain disagreements, even about facts, ultimately spring from differences in values and therefore cannot be resolved as factual disagreements. However, some other disagreements spring from mere ignorance, immaturity, or illness. As examples, education can resolve disagreements over whether 2+2=4 or whether a shoe can be tied, so the evaluativist does not endorse discrimination against one’s opponent in such disagreements. In such disagreements, the evaluativist instead endorses education or health care. The evaluativist endorses segregation or other forms of discrimination only when disagreement cannot be resolved any other way.

Thus, evaluativism is discrimination across The divide with a capital “T.” It’s the permanent divide, the divide that will never be resolved. Doctrines come and go, so mere discrimination on the basis of doctrine does not qualify as evaluativism. Families merge, so mere discrimination on the basis of family loyalty or race loyalty or national loyalty do not qualify as evaluativism. Social norms advance, so discriminating against someone merely because of their stance on an issue such as gay marriage does not qualify as evaluativism—someday both liberals and conservatives will agree about that issue as much as they now agree about interracial marriage (or more). However, all of these conflicts may involve evaluativism; they may be battles in an ongoing war across The divide such that the end of one conflict leaves the same people on opposite sides of yet another conflict.

In other words, evaluativism may be the root cause behind many conflicts (which are blamed on other varieties of judgmentalism only because we fail to notice the sides in the larger war). Stop evaluativism, and a great many other conflicts may peter out. The point of the philosophers who advanced the notion of evaluativism is that the sequence of conflicts never ends, so they must be driven by deeper disagreements that can never be resolved. The evaluativist’s solution is to acknowledge this root-cause and handle it directly through segregation on the basis of our deeper disagreements (like in the book and film Divergent).  In contrast, the solution recommended by GRINfree.com is to handle the root-cause by protecting the fundamental types within each family as one would preserve diversity in an ecosystem.

How to tell when judgmentalism qualifies as evaluativism

Although the term “evaluative diversity” shares a root with the term “evaluativism,” discrimination on the basis of evaluative diversity does not always qualify as evaluativism. Discrimination against GRIN types qualifies as evaluativism because GRIN types are permanent (they are destined to re-evolve if eliminated), but evaluative diversity also includes diversity of doctrines, family loyalties (etc.). “Evaluative diversity” is a term from the 1960s. The newer term “GRIN diversity” aims to serve as a refinement that gets to the root-cause of our disagreements.

Alzheimer’s provides an example of evaluative diversity that should not be protected. Evaluative diversity would be reduced if it were cured, because that would return people like my grandfather to perspectives more like the rest of us. Thus, a blanket protection for all evaluative diversity would prevent a cure for Alzheimer’s. It would also prevent education. Yet a cure for Alzheimer’s would not reduce GRIN diversity—Alzheimer’s certainly does not represent a fundamental type destined to evolve in all societies. We cannot have a viable movement to protect all evaluative diversity, but we may be able to have a viable movement to protect GRIN diversity. Some such new concept is required to distinguish which evaluative diversity to protect and which judgmentalism to combat. The GRIN model is the best tool we have, thus far, for making that distinction.

Here’s a practical example: Psychopathy, narcissism and Machiavellianism are three often-criticized personalities. I helped Ray Aldag run a survey among 197 Mechanical Turkers in which respondents answered the GRIN-SQ along with tests for each of these personalities to determine which personalities (if any) deserve protection. Natural gadflies were significantly more Machiavellian (d=0.74) and psychopathic (d=0.69), natural negotiators were significantly more Machiavellian (d=0.47), and the naturally relational and institutional were significantly less Machiavellian and psychopathic (d=-0.30, d=-0.40; d=-0.72, d=-0.43). None of the types were significantly more or less narcissistic.

These results suggest that the concept of psychopathy is a sloppy way of referring to natural gadflies (developed before we had a concept of GRIN types). Meanwhile, the concept of Machiavellianism is a sloppy way of dividing the GRIN types into two camps: the natural gadflies and negotiators vs. the naturally relational and institutional. Judgement against psychopathy and Machiavellianism qualifies as evaluativism, but we have no evidence that judgment against narcissism qualifies as evaluativism. Narcissism may be something we should try to cure; psychopathy and Machiavellianism appear to be misunderstood individual differences we should work to de-stigmatize.

Hopefully this example provides a sense of the importance of refining or confirming the GRIN model. The general public seems predisposed to believe that the narcissist is the misunderstood character—maybe even a viable candidate for president (perhaps because people of all GRIN types are as likely to be narcissists). To hear that the psychopath is the character who needs to be appreciated comes as a shock. It has even been proposed that the neurodiversity movement exclude psychopaths, even though that would be obviously inconsistent (see here, here and here). If psychopathy really is misunderstood, it is plausible that public opinion polls and scriptural exegesis would fail to discover that. The claim needs to be tested scientifically. It requires something like the GRIN-SQ, and the the GRIN-SQ is what we will use until something better is available.

To evaluate types of evaluative diversity may sound ironically circular, and it would be simpler if we didn’t need to draw a line between good evaluative diversity and bad. It would be simpler to embrace all diversity and stop trying to cure Alzheimer’s, narcissism, Machiavellianism, psychopathy, etc. It would also be simpler to embrace all judgmentalism and discriminate against anyone who disagrees with us. Neither of these simple approaches is ideal. Furthermore, we live in an age in which we can manipulate our own genes (or at least do things to reduce the odds that our children will be of certain types), so “accept the diversity we are given” no longer holds as a default. Instead of relying on armchair philosophy, public opinion polls, or scriptural exegesis, we need to actually conduct the science to distinguish the evaluative types and to determine which ones are interdependent.

The GRIN-SQ demonstrates such research practically—if anyone has better ideas, please let us know.

Why am I being punished?

If it’s not my job to be right…

…then why am I being punished for not doing the right thing?

Why am I in prison?
Why was I fired?
Why did my loved ones leave me?
Why do I have to pay?

Our video about evaluativism suggests that the first practical step to overcoming evaluativism is to believe that it is not your job to be right—rather, it is your job to be yourself. It says that people who aim to be right more and more often are aiming to be God, that we should instead be like lawyers in a courtroom, serving the cause of truth by sticking to our assigned opposing perspectives, knowing full-well that at least half of the opposing perspectives must be wrong in some way.

So, let’s suppose you were serving the cause of truth by sticking to your assigned perspective, it turns out that you were wrong, and now you are being punished… Should you become someone else?

The Prevalence of Punishment

First, let’s acknowledge how often punishment happens. Roughly 1.4 percent of men (5 percent of black men) and 0.1 percent of women in the U.S. are currently incarcerated. If we include probation and parole, the numbers multiply by four (about 6 percent of all men). Roughly 14 percent of working age men in the U.S. have been convicted of a felony at some point in their past, and over half of all men in the U.S. will be arrested at some point in their lives.

That is not to say that women are less likely to face punishment. If we define punishment to include any suffering imposed by one person on another, then we realize that most punishment is not imposed by legal authorities. Other countries currently host thirty-seven ongoing military conflicts, each large enough to kill over 100 people per year. Meanwhile, over 35 million people currently live in slavery, and 22,000 children die every day from poverty. Our planet currently produces enough food to nourish every single one of us, so it is perfectly reasonable for the victims of all of these atrocities to wonder, “Why am I being punished?” That question may arise just as often to women, and in countries where incarceration occurs less.

Back in the U.S., 85 percent of romantic relationships will end in a break-up, and the median employment tenure is just 4.7 years for men and 4.5 years for women. Even if you call it “amicable separation,” every ended relationship is an instance of punishment where one party maintains, “You are not behaving correctly (at least not for me).” Instead of finding ways to leverage the diversity implied by our disagreements, people are believing that we are supposed to agree, that we are supposed to converge on the same values even when our physiological differences predispose us towards opposition. We punish each other for failing to meet this unrealistic expectation.

These statistics describe just the tip of the iceberg. Punishment begins long before physical separation. It begins with apathy. Only about a third of U.S. employees are currently considered “engaged” in their jobs. The other two thirds of us are either feeling punished or punishing our employers. Many relationships with loved ones are similarly on-the-rocks, as are relationships between citizens and their government. We might not be hosting ongoing military conflicts in the U.S., but we are still a besieged nation. Punishment is happening all around us every day, even if most of us are not physically in a prison.

So, if you are feeling punished, then you are not alone. Given the prevalence of punishment, it doesn’t make sense to take it personally. The best explanation for why punishment is so widespread is that we, as a society, are not very good at dealing with each other. That is a social problem—not your personal fault.

Here’s the good news: Just as you wonder, Why am I being punished? cavemen used to wonder, Why do I have fleas? As a society advances, both questions become asked less and less often. We’re just slower at ridding ourselves of punishment.

Social Reform

Even if punishment is just an infestation not yet fully stamped-out in modern society, those of us who are being punished will still wonder, What should I do about it? Should I change?

The effect of punishment had better not be to reform the person being punished. If the person being punished simply made an error, then the solution is education rather than punishment. On the other hand, if the conflict being resolved did not result from error—if it instead resulted from differences in physiological predisposition—then to reform the perpetrator would decrease opposition, thus reducing diversity and handicapping society.

Rather than reform convicts, the effect of punishment should be to reform society. Punishment enables social reform by providing separation and shifting resources. Convicts pay the price to make social reform possible. That’s really unfair. Society owes a debt to individuals who suffer punishment, and we should repay that debt by accelerating the social reforms their punishments fund. This is obvious regarding punishment of the innocent, as in the stories of Socrates and Jesus of Nazarath, but is equally true when punishing the guilty.

For example, suppose a child becomes addicted to a drug: Although we confine the child to rehab to address the immediate danger, we ultimately must reform society so that it does not provide children with access to such drugs. A bored child is sent to her room for getting into mischief, but we ultimately need to reform society (perhaps a dysfunctional family or classroom) so that the child stops being bored. A debtor loses his home so a bank can stay in business, but we ultimately need to reduce the ability of sellers to convince consumers to make inappropriate purchases.

Why is it that half of the people who are released from prison return within three years? It is because society is not reforming fast enough. Imagine a man who pays a mechanic to fix his car. When he returns, the car still doesn’t work and the mechanic asks for more money. “I payed the price,” the owner demands, “You said it couldn’t be fixed while the peace was being disturbed, so I stayed away. I gave years of my life in a prison. Was that for nothing?” We should be ashamed to ask that man to pay the price a second or third time.

At one time, it seemed reasonable to blame crime on individuals, but the big insight about evaluative diversity is that no individual is complete—we are best as parts of something larger, something diverse. No one is righteous when the stability of their lifestyle relies on punishing the other parts of that something larger.

Privacy and the Threat of Punishment

One of the GRIN types, natural negotiator, has a special relationship with the threat of punishment. Natural negotiators are achievers, so they are empowered by a shift towards capitalistic systems in which they can compete; however, competition is empowering only if everyone plays by fair rules. Thus, negotiators want there to be enough fear of punishment to dissuade their competitors from breaking the rules. If the governments we construct make rule-breaking the best strategy for getting ahead, then we are systematically oppressing natural negotiators. Therefore, we need to take punishment seriously.

The point of punishment, from the perspective of the negotiator, is not to get revenge, but rather to reduce calculated future crimes. In other words, the punishment is not directed at the person being punished; it is directed at people who have not yet committed the crime. Again, the goal is not to reform the convict, but rather the goal is to reform society, to scare everyone else into not committing crimes. The best way to figure-out when punishment would discourage future crimes would be to ask a jury of peers, and that is what we do.

On the one hand, such juries need to make sure the punishment is not too mild. If the crime can be leveraged to make billions on the stock-market, then the risk of spending ten years in jail might not be enough to dissuade the crime. Even if a convict is reformed, it could be a mistake to release him/her early because that release could encourage a different person to commit a crime (expecting a similar chance at early release). When we view punishment as a threat, it would be wasteful not to make it harsh.

On the other hand, if punishment is so frequent that it seems inescapable, then people will figure they might as well commit the crime. Therefore we cannot punish every hurtful act. We also have to be very careful about discrimination: if black men, or Muslims, or psychopaths are highly likely to get punished, then the punishment system may encourage them to be criminal. To make punishment work as a deterrent against future crime, we must collect strong evidence, so that the innocent can expect to avoid punishment even if they happen to fit a certain profile.

Collecting strong evidence requires reducing privacy. Partly that is about ubiquitous cameras, but its also about getting inside people’s heads because we need to know their motives. When we use punishment to deter crime we are punishing a motive—we are punishing someone for expecting to be able to get away it. That insight inspired the movie Minority Report in which privacy is so far gone that people could be punished even before the crime was committed.

Minority Report was onto something, but rather than punish the person, we should reform society. For example, if we know that Jack thinks he can get way with robbing the local convenience store, rather than punish Jack, we can improve the security system of the local convenience store (and make sure Jack knows about it). It doesn’t seem like a violation of privacy when the knowledge is used merely to identify security holes. That is where we are headed: a world that doesn’t need punishment to deter fowl-play.

Conclusion

We have told you that it is not your job to be right—rather it is your job to be yourself. Yet such behavior seems idealistic in a world filled with punishment. Interracial friendships likewise seemed idealistic in a world filled with racial slavery. Some people currently think punishment is unavoidable just like people used to think racial slavery was unavoidable, so being yourself can be an act of courage and faith.

If a natural negotiator is punished for trying to profit, should they stop trying to profit? No. Do not be reformed. It is the rules of profit that should be reformed.

If a naturally institutional person is punished for following their religion, should they stop following it? No. Do not be reformed. It is the religions that need to reconcile.

If a natural gadfly is punished for thirsting so much for change that they frustrate others, should they stop thirsting for change? No. Do not be reformed. Society should embrace progress so fully that gadflies are too busy facilitating change to waste time frustrating the establishment.

If a naturally relational person is punished for protecting their loved-ones, should they stop loving? No. Do not be reformed. The problem with love is that not everyone is loved enough.

It might not be right to try to profit, to follow a religion, to thirst for change, or to protect loved ones. Any of these activities can be wrong. But it is not your job to be right—it is your job to be yourself.

The short answer to the question, “Why am I being punished?” is that you are part of something larger, and you are serving that something larger by taking the hit. It is not fair, and should happen less and less often. In an ideal world, people would never do anything that would justify punishment—the only place it would be useful to talk about guilt and innocence would be in a history class. We are moving closer to that ideal world. In the meanwhile, you may be guilty, but the blame falls on all of us for not moving to the ideal world sooner.

Moral Diversity vs. Evaluative Diversity

The Righteous MindIn his most recent book, The Righteous Mind: Why good people are divided by politics and religion, Jonathan Haidt (pronounced like “height”) reminds the reader at various points that he is telling his story in a roundabout way because typical readers would reject straight-up truth. The first four chapters are devoted to evidence that the average non-psychopath is irrational, able to learn truth only “in love” (as Ephesians 4:15 puts it). The Righteous Mind debuted at #6 on the New York Times best seller list for nonfiction hardcover, so, if you find it difficult to believe the claims in the summary below, you might want to try the roundabout version instead.

The Purpose of Division

Why are good people divided? Haidt devoted an entire chapter to defend the theory of group selection which entails that diversity will evolve if diversification is advantageous for groups. On page 365, Haidt summarized his conclusions about this advantage:

I suggested that liberals and conservatives are like yin and yang—both are “necessary elements of a healthy state of political life,” as John Stuart Mill put it.

In a similar way, bone cells and muscle cells are both necessary to the functioning of the human body, and it is for the good of the body that its cells divide and specialize.

To test the theory that diversification is advantageous for groups, one would want to compare the success of groups with different levels of diversity. Such evidence was collected by Douglas Wilde, a professor of design at Stanford University. His students divided into teams to develop designs submitted to intercollegiate competitions which were judged by blind-review. In some years, Wilde allowed students to form their own teams; in other years he forced them to team up with people who tended to think differently. Wilde, and the design professors who replicated this experiment at other colleges, found that forcing teams to be evaluatively diverse increased both internal conflict and win rates.

Instead of citing the research by the design professors, Haidt cited the research of Richard Sosis who found that the average religious commune founded in the nineteenth century United States was six times as likely as the average secular one to last over 20 years. Again, the research compared the success of different groups, but Sosis’ measure of success was longevity, while Wilde’s measure of success was win rate. Wilde’s measure would be irrelevant if we encountered a society that could survive well-enough with poor designs (i.e. had no competitors or environmental disasters pending to require rapid improvement of social designs).

The problem with Sosis’ research is that he did not manipulate or measure diversity. It is debatable whether the religious communes were more or less diverse than the secular ones. Communes are intrinsically anti-conservative—they are rebellions against the status-quo—yet religious communes have a commitment to norms. Thus, religious communes might be more likely to attract both liberals and conservatives, and it makes sense to expect them to be more diverse. Some of the greatest religious role-models created new norms while rebelling against the norms of their day (e.g. Muhammad, Jesus, Buddha, Gandhi, Confucius), yet Haidt offers an explanation which implies that religious communes would be less diverse (pg 342):

A commune that valued self-expression over conformity and that prized the virtue of tolerance over loyalty… would have lower moral capital than a commune that valued conformity and loyalty. The stricter commune would be better able to suppress or regulate selfishness, and would therefore be more likely to endure.

In Wilde’s research, the superior teams had heightened internal conflict, but Haidt’s explanation of Sosis’ research implies that we should expect the opposite. This may just be an example of Haidt trying to tell the story in a roundabout way. The bottom line is that Sosis’ research would need to be repeated with actual measures of diversity. Until then, we have Wilde’s results to support Haidt’s final conclusion that diversity is advantageous.

Proximate Causes of Division

From an evolutionary perspective, one could say that the cells of our bodies specialize into diverse types because this brings advantages to the body as a whole, but it is also correct to say that cells specialize because they are genetically programmed to do so. Genes are a proximate cause. In a similar way, while Haidt points to group-selection as the ultimate cause of division, he also points to research indicating that genetic and physiological differences (products of evolution) predispose us to disagree with one another.

After summarizing some of the research described in greater detail in John Hibbing and Kevin Smith’s Predisposed: Liberals, Conservatives, and the Biology of Political Differences, Haidt attempts to navigate the controversial issue of how our natures interact with nurture. This comes to a head in the recounting of Keith Richards’ testimony that he became a liberal when he was betrayed by the choir master of his school (pg 330):

Richards may have been predisposed by his personality [and genes] to become a liberal, but his politics were not predestined. Had his teachers treated him differently… he could have ended up in a more conventional job surrounded by conservative colleagues and sharing their moral matrix.

Of course a sufficiently controlled environment can manipulate the typical person into developing values contrary to his/her own genetic predisposition. Haidt also mentions that sufficiently controlled environments can flip a switch he calls the “hive switch” to shift a person’s values temporarily. He discusses oxytocin regulation, but dopamine regulation and ego depletion would be other such switches. However, Haidt stops short of discussing what the costs of manipulating people’s values might be.

Assuming one were to manipulate an environment to promote conservativism, it would see a decline in liberalism. If this sufficiently unbalances the society, then, according to the theory Haidt quoted from John Stuart Mill, it would collapse like an unbalanced ecosystem. That is one example of a cost. It is a cost to the group.

But we should also consider the consequences for an individual like Keith Richards. How would he like to have values contrary to his predispositions? Would he be frustrated like a short basketball player, a gay person in a heterosexual marriage, or someone with high IQ who cannot access the Intenet? Keith Richards is the lead guitarist of The Rolling Stones—it is difficult to imagine him being so successful in that role without genes predisposing him against conservativism—how would it have felt not to exercise those genes? Here’s one theory:

Theory #1: In more tolerant environments, people are more likely to hold values which align with their genetic predispositions and those who have such alignment experience better mental well-being (e.g. greater engagement in their career, family and community, and less depression, apathy, guilt, and desire to commit suicide).

To test this theory, psychologists would measure the values, predispositions and mental-well-being of people in environments with different levels of evaluativism. The benefits of this research could be huge: if it confirms the theory above, we could use it to improve mental well-being for our children and grandchildren. Most of the people with jobs today are not happy with their jobs, and our own lives might not be so bleak if our grandparents had conducted this research. So we have to ask, “Why have no psychologists tested this theory?”

Haidt’s subtitle “Why good people are divided by politics and religion” seems to ask about the causes of intolerance. If it turns out that intolerance has such significantly negative health consequences, that discovering them would motivate us to be more tolerant, then it is fair to say we are intolerant because psychologists have not measured those consequences. Psychologists have determined that suicide is the 10th leading cause of death in the U.S. and that gay youth facing anti-gay environments are more likely to attempt suicide, but this just a beginning to measuring the consequences of intolerance. Homophobia isn’t the only form of discrimination, and mental distress includes more than just suicide.

A 2014 study by Shanto Iyengar and Sean Westwood found that 80% of us, if asked to judge a scholarship competition, would discriminate against applicants with opposing values. That kind of discrimination is called “evaluativism” and the researchers offer every reason to believe it is pervasive, producing every manner of frustration. For the 13 years previous to that study, the only major study comparing kinds of discrimination was Haidt’s own study with Evan Rosenberg and Holly Hom. They found that people discriminate far more on the basis of values than on the basis of demographic differences, such as race, class and religion. His conclusion, in 2001, was that values diversity (which they called “moral diversity”) creates so much discrimination that it must be a bad kind of diversity.

In The Righteous Mind Haidt cited his 2001 study only in a footnote to his recommendation about how to make a team, company, school or other organization more “hivish, happy and productive” (pg 277):

Increase similarity, not diversity. To make a human hive, you want to make everyone feel like a family. So don’t call attention to the racial and ethnic differences; make them less relevant by ramping up similarity and celebrating the group’s shared values and common identity.

Again, Haidt implies that our aim should be to minimize internal conflict. As Haidt would predict, in years when Wilde didn’t draw attention to evaluative diversity, his students self-segregated and experienced less internal conflict. But the hivishness and happiness did not improve production; the consequence of self-segregation was inferior designs. Furthermore, if we do not raise awareness of evaluativism in awarding scholarships (and presumably jobs as well), Iyengar and Westwood’s research indicates the awards will be significantly and systematically biased. Aiming to minimize conflict is short-sighted.

Perhaps the worst tragedy to come from ignoring differences is implied by a 2009 twin study by Peter Hatemi, Carolyn Funk, Sarah Medland, Hermine Maes, Judy Silberg, Nicholas Martin, and Lindon Eaves which found that people’s values are less likely to align with their genetic predispositions while they remain in their parent’s homes. This does not indicate intentional discriminationparents are unaware of evaluative differencesyet even accidentally preventing one’s child from aligning with his/her genetic predispositions could diminish his/her mental well-being. What parent would want to remain ignorant of differences, if accepting those differences could save their child from wishing he/she were dead?

Again, the truth is so harsh that one can understand why Haidt might want to soften the blow. Would you believe a psychologist who told you that our failure to understand differences has made normal parenting is so oppressive that getting away from parents faster could save children from wanting to commit suicide?

Moral Diversity vs. Evaluative Diversity

Aside from his 2001 study, Haidt’s most important experiment may have been the development of the Moral Foundations Questionnaire (MFQ) which measures peoples beliefs that morality is about each of the following six values: Liberty/oppression, Fairness/cheating, Care/harm, Loyalty/betrayal, Authority/subversion, and Sanctity/degradation.

This research created a stir because moral psychology was previously dominated by the theory that there is one best moral type. As it turns out, people who rate themselves as politically conservative tend to consider all six values in their definition of “morality,” whereas people who rate themselves politically liberal tend to emphasize Care/harm and discount the last three values, and people who rate themselves as libertarians tend to emphasize Liberty/oppression and discount the last four values. Thus, the MFQ demonstrates that political types are moral types. Since it is unacceptable to conclude that one political type is better than the others, the dominant theory moral psychology was overturned.

In chapter 8, Haidt admits that his list of values might not be complete; in fact, one of the six values was not on the original list, so it has already been revised once. Given what we know about GRIN types, one might think the next revision should be to add “Originality/orthodoxy” and “Effectiveness/ inefficiency.” While some people do value original ideas and effective strategies, it is debatable whether the value qualify as “moral.” For example, the debate over whether the ends justify the means may be seen as a debate over whether Effectiveness is a moral value.

As part of his roundabout story-telling, Haidt saves his own definitions of morality and moral capital until the last two chapters:

Moral capital refers to the degree to which a community possesses interlocking sets of value, virtues, norms, practices, identities, institutions, and technologies that mesh well with evolved psychological mechanisms and thereby enable the community to suppress or regulate selfishness and make cooperation possible.

The values of Originality and Effectiveness do not necessarily suppress selfishness, so they would not qualify as “moral” values by this definition. They would probably qualify, however, under Ayn Rand’s definition of “moral.” Does Haidt have a scientific basis for dismissing Rand’s perspective? Haidt admits that Loyalty, Authority, and Sanctity do not qualify as “moral” by liberal definitionsdoes he have a scientific basis for dismissing the liberal perspective as well? To the contrary, Haidt concludes that the diverse perspectives are interdependent, so he is painted into a corner.

Haidt describes himself as a liberal who wants to understand conservatives on their own terms, so it makes sense that he would accept a conservative definition of “morality,” and it makes sense that this definition would produce a survey instrument that focuses on conservative values. Reaching across the isle is noble. However, a partisan definition is still a partisan definition, even if entertained by a psychologist from the opposing party.

The advantage of the term “evaluative diversity”  over “moral diversity” is to escape the non-scientific bias that will necessarily result from having to define “moral”. All values are evaluative, whether they are moral or not. Thus, evaluative diversity includes Liberty/oppression, Fairness/cheating, Care/harm, Loyalty/betrayal, Authority/subversion, Sanctity/degradation, Originality/orthodoxy, and Effectiveness/inefficiency (and perhaps more).

Unfortunately, there is no field of “evaluative psychology.” The field Haidt inherited and now leads is called “moral psychology”and that isn’t his faultso he finds himself asking people “Is it [morally] wrong for a brother and sister to have sex?” Depending on their own definitions of “morality” (or whether they even bother to have one), some people may find such questions nutty. I’m not God—why ask me? However, Haidt has already revolutionized his field. Asking him to strike the word “moral” from its name might be asking too much.