Ethereum Blog

Superrationality and DAOs



Vitalik Buterin


Roundup Q2 08th July, 2017

Roundup Round III 24th May, 2017

crypto renaissance salon

Superrationality and DAOs

Posted on .

Warning: this post contains crazy ideas. Myself describing a crazy idea should NOT be construed as implying that (i) I am certain that the idea is correct/viable, (ii) I have an even >50% probability estimate that the idea is correct/viable, or that (iii) “Ethereum” endorses any of this in any way.

One of the common questions that many in the crypto 2.0 space have about the concept of decentralized autonomous organizations is a simple one: what are DAOs good for? What fundamental advantage would an organization have from its management and operations being tied down to hard code on a public blockchain, that could not be had by going the more traditional route? What advantages do blockchain contracts offer over plain old shareholder agreements? Particularly, even if public-good rationales in favor of transparent governance, and guarnateed-not-to-be-evil governance, can be raised, what is the incentive for an individual organization to voluntarily weaken itself by opening up its innermost source code, where its competitors can see every single action that it takes or even plans to take while themselves operating behind closed doors?

There are many paths that one could take to answering this question. For the specific case of non-profit organizations that are already explicitly dedicating themselves to charitable causes, one can rightfully say that the lack of individual incentive; they are already dedicating themselves to improving the world for little or no monetary gain to themselves. For private companies, one can make the information-theoretic argument that a governance algorithm will work better if, all else being equal, everyone can participate and introduce their own information and intelligence into the calculation – a rather reasonable hypothesis given the established result from machine learning that much larger performance gains can be made by increasing the data size than by tweaking the algorithm. In this article, however, we will take a different and more specific route.

What is Superrationality?

In game theory and economics, it is a very widely understood result that there exist many classes of situations in which a set of individuals have the opportunity to act in one of two ways, either “cooperating” with or “defecting” against each other, such that everyone would be better off if everyone cooperated, but regardless of what others do each indvidual would be better off by themselves defecting. As a result, the story goes, everyone ends up defecting, and so people’s individual rationality leads to the worst possible collective result. The most common example of this is the celebrated Prisoner’s Dilemma game.

Since many readers have likely already seen the Prisoner’s Dilemma, I will spice things up by giving Eliezer Yudkowsky’s rather deranged version of the game:

Let’s suppose that four billion human beings – not the whole human species, but a significant part of it – are currently progressing through a fatal disease that can only be cured by substance S.

However, substance S can only be produced by working with [a strange AI from another dimension whose only goal is to maximize the quantity of paperclips] – substance S can also be used to produce paperclips. The paperclip maximizer only cares about the number of paperclips in its own universe, not in ours, so we can’t offer to produce or threaten to destroy paperclips here. We have never interacted with the paperclip maximizer before, and will never interact with it again.

Both humanity and the paperclip maximizer will get a single chance to seize some additional part of substance S for themselves, just before the dimensional nexus collapses; but the seizure process destroys some of substance S.

The payoff matrix is as follows:

Humans cooperate Humans defect
AI cooperates 2 billion lives saved, 2 paperclips gained 3 billion lives, 0 paperclips
AI defects 0 lives, 3 paperclips 1 billion lives, 1 paperclip

From our point of view, it obviously makes sense from a practical, and in this case moral, standpoint that we should defect; there is no way that a paperclip in another universe can be worth a billion lives. From the AI’s point of view, defecting always leads to one extra paperclip, and its code assigns a value to human life of exactly zero; hence, it will defect. However, the outcome that this leads to is clearly worse for both parties than if the humans and AI both cooperated – but then, if the AI was going to cooperate, we could save even more lives by defecting ourselves, and likewise for the AI if we were to cooperate.

In the real world, many two-party prisoner’s dilemmas on the small scale are resolved through the mechanism of trade and the ability of a legal system to enforce contracts and laws; in this case, if there existed a god who has absolute power over both universes but cared only about compliance with one’s prior agreements, the humans and the AI could sign a contract to cooperate and ask the god to simultaneously prevent both from defecting. When there is no ability to pre-contract, laws penalize unilateral defection. However, there are still many situations, particularly when many parties are involved, where opportunities for defection exist:

  • Alice is selling lemons in a market, but she knows that her current batch is low quality and once customers try to use them they will immediately have to throw them out. Should she sell them anyway? (Note that this is the sort of marketplace where there are so many sellers you can’t really keep track of reputation). Expected gain to Alice: $5 revenue per lemon minus $1 shipping/store costs = $4. Expected cost to society: $5 revenue minus $1 costs minus $5 wasted money from customer = -$1. Alice sells the lemons.
  • Should Bob donate $1000 to Bitcoin development? Expected gain to society: $10 * 100000 people – $1000 = $999000, expected gain to Bob: $10 – $1000 = -$990, so Bob does not donate.
  • Charlie found someone else’s wallet, containing $500. Should he return it? Expected gain to society: $500 (to recipient) – $500 (Charlie’s loss) + $50 (intangible gain to society from everyone being able to worry a little less about the safety of their wallets). Expected gain to Charlie: -$500, so he keeps the wallet.
  • Should David cut costs in his factory by dumping toxic waste into a river? Expected gain to society: $1000 savings minus $10 average increased medical costs * 100000 people = -$999000, expected gain to David: $1000 – $10 = $990, so David pollutes.
  • Eve developed a cure for a type of cancer which costs $500 per unit to produce. She can sell it for $1000, allowing 50,000 cancer patients to afford it, or for $10000, allowing 25,000 cancer patients to afford it. Should she sell at the higher price? Expected gain to society: -25,000 lives (including Alice’s profit, which cancels’ out the wealthier buyers’ losses). Expected gain to Eve: $237.5 million profit instead of $25 million = $212.5 million, so Eve charges the higher price.

Of course, in many of these cases, people sometimes act morally and cooperate, even though it reduces their personal situation. But why do they do this? We were produced by evolution, which is generally a rather selfish optimizer. There are many explanations. One, and the one we will focus on, involves the concept of superrationality.


Consider the following explanation of virtue, courtesy of David Friedman:

I start with two observations about human beings. The first is that there is a substantial connection between what goes on inside and outside of their heads. Facial expressions, body positions, and a variety of other signs give us at least some idea of our friends’ thoughts and emotions. The second is that we have limited intellectual ability–we cannot, in the time available to make a decision, consider all options. We are, in the jargon of computers, machines of limited computing power operating in real time.
Suppose I wish people to believe that I have certain characteristics–that I am honest, kind, helpful to my friends. If I really do have those characteristics, projecting them is easy–I merely do and say what seems natural, without paying much attention to how I appear to outside observers. They will observe my words, my actions, my facial expressions, and draw reasonably accurate conclusions.
Suppose, however, that I do not have those characteristics. I am not (for example) honest. I usually act honestly because acting honestly is usually in my interest, but I am always willing to make an exception if I can gain by doing so. I must now, in many actual decisions, do a double calculation. First, I must decide how to act–whether, for example, this is a good opportunity to steal and not be caught. Second, I must decide how I would be thinking and acting, what expressions would be going across my face, whether I would be feeling happy or sad, if I really were the person I am pretending to be.
If you require a computer to do twice as many calculations, it slows down. So does a human. Most of us are not very good liars.
If this argument is correct, it implies that I may be better off in narrowly material terms–have, for instance, a higher income–if I am really honest (and kind and …) than if I am only pretending to be, simply because real virtues are more convincing than pretend ones. It follows that, if I were a narrowly selfish individual, I might, for purely selfish reasons, want to make myself a better person–more virtuous in those ways that others value.
The final stage in the argument is to observe that we can be made better–by ourselves, by our parents, perhaps even by our genes. People can and do try to train themselves into good habits–including the habits of automatically telling the truth, not stealing, and being kind to their friends. With enough training, such habits become tastes–doing “bad” things makes one uncomfortable, even if nobody is watching, so one does not do them. After a while, one does not even have to decide not to do them. You might describe the process as synthesizing a conscience.

Essentially, it is cognitively hard to convincingly fake being virtuous while being greedy whenever you can get away with it, and so it makes more sense for you to actually be virtuous. Much ancient philosophy follows similar reasoning, seeing virtue as a cultivated habit; David Friedman simply did us the customary service of an economist and converted the intuition into more easily analyzable formalisms. Now, let us compress this formalism even further. In short, the key point here is that humans are leaky agents – with every second of our action, we essentially indirectly expose parts of our source code. If we are actually planning to be nice, we act one way, and if we are only pretending to be nice while actually intending to strike as soon as our friends are vulnerable, we act differently, and others can often notice.

This might seem like a disadvantage; however, it allows a kind of cooperation that was not possible with the simple game-theoretic agents described above. Suppose that two agents, A and B, each have the ability to “read” whether or not the other is “virtuous” to some degree of accuracy, and are playing a symmetric Prisoner’s Dilemma. In this case, the agents can adopt the following strategy, which we assume to be a virtuous strategy:

  1. Try to determine if the other party is virtuous.
  2. If the other party is virtuous, cooperate.
  3. If the other party is not virtuous, defect.

If two virtuous agents come into contact with each other, both will cooperate, and get a larger reward. If a virtuous agent comes into contact with a non-virtuous agent, the virtuous agent will defect. Hence, in all cases, the virtuous agent does at least as well as the non-virtuous agent, and often better. This is the essence of superrationality.

As contrived as this strategy seems, human cultures have some deeply ingrained mechanisms for implementing it, particularly relating to mistrusting agents who try hard to make themselves less readable – see the common adage that you should never trust someone who doesn’t drink. Of course, there is a class of individuals who can convincingly pretend to be friendly while actually planning to defect at every moment – these are called sociopaths, and they are perhaps the primary defect of this system when implemented by humans.

Centralized Manual Organizations…

This kind of superrational cooperation has been arguably an important bedrock of human cooperation for the last ten thousand years, allowing people to be honest to each other even in those cases where simple market incentives might instead drive defection. However, perhaps one of the main unfortunate byproducts of the modern birth of large centralized organizations is that they allow people to effectively cheat others’ ability to read their minds, making this kind of cooperation more difficult.

Most people in modern civilization have benefited quite handsomely, and have also indirectly financed, at least some instance of someone in some third world country dumping toxic waste into a river to build products more cheaply for them; however, we do not even realize that we are indirectly participating in such defection; corporations do the dirty work for us. The market is so powerful that it can arbitrage even our own morality, placing the most dirty and unsavory tasks in the hands of those individuals who are willing to absorb their conscience at lowest cost and effectively hiding it from everyone else. The corporations themselves are perfectly able to have a smiley face produced as their public image by their marketing departments, leaving it to a completely different department to sweet-talk potential customers. This second department may not even know that the department producing the product is any less virtuous and sweet than they are.

The internet has often been hailed as a solution to many of these organizational and political problems, and indeed it does do a great job of reducing information asymmetries and offering transparency. However, as far as the decreasing viability of superrational cooperation goes, it can also sometimes make things even worse. Online, we are much less “leaky” even as individuals, and so once again it is easier to appear virtuous while actually intending to cheat. This is part of the reason why scams online and in the cryptocurrency space are more common than offline, and is perhaps one of the primary arguments against moving all economic interaction to the internet a la cryptoanarchism (the other argument being that cryptoanarchism removes the ability to inflict unboundedly large punishments, weakening the strength of a large class of economic mechanisms).

A much greater degree of transparency, arguably, offers a solution. Individuals are moderately leaky, current centralized organizations are less leaky, but organizations where randomly information is constantly being released to the world left, right and center are even more leaky than individuals are. Imagine a world where if you start even thinking about how you will cheat your friend, business partner or spouse, there is a 1% chance that the left part of your hippocampus will rebel and send a full recording of your thoughts to your intended victim in exchange for a $7500 reward. That is what it “feels” like to be the management board of a leaky organization.

This is essentially a restatement of the founding ideology behind Wikileaks, and more recently an incentivized Wikileaks alternative, came out to push the envelope further. However, Wikileaks exists, and yet shadowy centralized organizations also continue to still exist and are in many cases still quite shadowy. Perhaps incentivization, coupled with prediction-like-mechanisms for people to profit from outing their employers’ misdeeds, is what will open the floodgates for greater transparency, but at the same time we can also take a different route: offer a way for organizations to make themselves voluntarily, and radically, leaky and superrational to an extent never seen before.

… and DAOs

Decentralized autonomous organizations, as a concept, are unique in that their governance algorithms are not just leaky, but actually completely public. That is, while with even transparent centralized organizations outsiders can get a rough idea of what the organization’s temperament is, with a DAO outsiders can actually see the organization’s entire source code. Now, they do not see the “source code” of the humans that are behind the DAO, but there are ways to write a DAO’s source code so that it is heavily biased toward a particular objective regardless of who its participants are. A futarchy maximizing the average human lifespan will act very differently from a futarchy maximizing the production of paperclips, even if the exact same people are running it. Hence, not only is it the case that the organization will make it obvious to everyone if they start to cheat, but rather it’s not even possible for the organization’s “mind” to cheat.

Now, what would superrational cooperation using DAOs look like? First, we would need to see some DAOs actually appear. There are a few use-cases where it seems not too far-fetched to expect them to succeed: gambling, stablecoins, decentralized file storage, one-ID-per-person data provision, SchellingCoin, etc. However, we can call these DAOs type I DAOs: they have some internal state, but little autonomous governance. They cannot ever do anything but perhaps adjust a few of their own parameters to maximize some utility metric via PID controllers, simulated annealing or other simple optimization algorithms. Hence, they are in a weak sense superrational, but they are also rather limited and stupid, and so they will often rely on being upgraded by an external process which is not superrational at all.

In order to go further, we need type II DAOs: DAOs with a governance algorithm capable of making theoretically arbitrary decisions. Futarchy, various forms of democracy, and various forms of subjective extra-protocol governance (ie. in case of substantial disagreement, DAO clones itself into multiple parts with one part for each proposed policy, and everyone chooses which version to interact with) are the only ones we are currently aware of, though other fundamental approaches and clever combinations of these will likely continue to appear. Once DAOs can make arbitrary decisions, then they will be able to not only engage in superrational commerce with their human customers, but also potentially with each other.

What kinds of market failures can superrational cooperation solve that plain old regular cooperation cannot? Public goods problems may unfortunately be outside the scope; none of the mechanisms described here solve the massively-multiparty incentivization problem. In this model, the reason why organizations make themselves decentralized/leaky is so that others will trust them more, and so organizations that fail to do this will be excluded from the economic benefits of this “circle of trust”. With public goods, the whole problem is that there is no way to exclude anyone from benefiting, so the strategy fails. However, anything related to information asymmetries falls squarely within the scope, and this scope is large indeed; as society becomes more and more complex, cheating will in many ways become progressively easier and easier to do and harder to police or even understand; the modern financial system is just one example. Perhaps the true promise of DAOs, if there is any promise at all, is precisely to help with this.


Vitalik Buterin


Author Donald McIntyre

Posted at 12:22 am January 24, 2015.

I wonder if the heuristic described above is learned or genetic, I wonder if the fact that the non-virtuous and “sociopath” members of society have a normal distribution and are always present because they represent a positive evolutionary element of society.
Try to determine if the other party is virtuous.
If the other party is virtuous, cooperate.
If the other party is not virtuous, defect.


    Author Marcel Soanes

    Posted at 12:02 pm January 24, 2015.

    >And I wonder if the fact that the non-virtuous and “sociopath” members of society have a normal distribution across mankind’s existence and are always present because they represent a positive evolutionary element of society?

    You might be interested in looking up evolutionary game theory for this.


      Author Donald McIntyre

      Posted at 7:10 pm January 24, 2015.

      I read “The Origin Of Species” and “Evolution And Culture” and they were two of the most revealing books I ever read. I am taking a Game Theory course on Coursera, but I am definitely going to follow up with Evolutionary Game Theory. Thx!


Author Eugene Wang

Posted at 6:38 am January 24, 2015.

For a while, I was thinking of a lending DAO that lent money to DAOs, but first looked at their source code for a piece of code that ensures that they can’t renege their promises to pay back their loans (as long as they have enough money). This brought it back.


    Author Smithgift

    Posted at 3:09 am January 25, 2015.

    Interesting idea, but I think the key phrase is “as long as they have enough money.” The DAO could be set to transfer all of its assets to another the moment before the bill comes due. To prevent this would require the repayment code to have control over any possible expenditure, at which point we hit the Halting Problem (and simple practicality) almost instantly.

    That said, the entities that programmed the DAO to deceive the Lending-DAO might be easily discovered through the blockchain.


Author Meher Roy

Posted at 3:02 pm January 24, 2015.

Very interesting blog. How is this different from the older idea of radical transparency?


    Author Vitalik Buterin

    Posted at 5:48 pm January 24, 2015.

    Key difference is this:

    > Hence, not only is it the case that the organization will make it obvious to everyone if they start to cheat, but rather it’s not even possible for the organization’s “mind” to cheat.

    It’s not about having a transparent thought process, it’s about actually changing and provably committing to a new thought process on an organizational level.


      Author Meher Roy

      Posted at 6:28 pm January 24, 2015.

      Thanks, makes sense.

      Feel that’s how Government should be.. Seems a better line than attempting to be rid of Government all together.

      Society with better decentralised governance, rather than no governance.


      Author Joshua Davis

      Posted at 8:33 pm January 30, 2015.

      I find this conversation to be very pertinent in light of James D’Angelo’s carboard box reform video. We think of transparency as always being good but if that was the case why was there a need to institute the secret ballot? So transparency is good in the sense of accountability but it also makes the transparent subject vulnerable to bribes and to intimidation. Seems like the ideal solution is to be transparent in the execution of policy and to be secret in the process in which policy is created. The smoke-filled back room of secrecy is something people hate but people also hate the fact that lobbyists wield a lot of power relative to the money they spend to influence elections. If you want people to vote their conscience free from intimidation and from bribes you need to shield them from the retribution which might result if they choose to vote contrary to the interests of those who have money or power. The majority of voters don’t have money or power, they are not special interests lobbies but they want their elected officials to be transparent. In this way they actually desire something that is contrary to their best interest. So I think this article would be really helpful if it could differentiate between policy creation and policy execution. For the latter transparency is highly desired and for the former transparency can result those citizens with the most money and most power to lobby (aka bribe) and intimidate deciding how policy is created.


Author Tal

Posted at 2:48 pm January 25, 2015.

There is some interesting academic work on some of theses issues and I apologize for not giving here citations to some of them..

First, much of the behavior of humans comes from the evolutionary profit of being kind and virtuous toward those sharing genes with you (primarily your family, lets call them “us”) and being vicious towards others (“them”). Modern days created cultural issues that replaces “us” with people similar to us, and “them” as all the rest.

The big difference is that some people define “us” as themselves, while others defines “us” as the entire human species. Hence the big difference in behavior. Space faring people (Astronauts, Cosmonauts etc.) BTW report and “Overview effect”, where people being in space and seeing the entire glove first hand, return home with a desire to help humanity.

Second, many claim that human decision making is irrational by definition. The decision itself is being made first, and the rational applied later to explain and support it. The “feelings” on which those decisions are made were distilled by evolution and as such cause the decisions to “seem” rational, since, well, that’s what evolution do 🙂

Third. DAOs are definitely the future, for countless reasons. It’s just a matter of time and some tinkering with the correct algorithms and mechanisms that lay at their base to make them mathematically and economically robust. We already created airplanes, cell phones and genetically modified chickens, so how hard can that be?!

Thanks for the interesting post!


    Author Dor Konforty

    Posted at 2:56 pm January 30, 2015.

    Superrationality, enlightened self-interest, technology -> transparent computerized minds do not require trust, eliminating options of (game-theoretical) defect.

    This is not crazy, but inevitable.


Author A Hzg

Posted at 9:10 pm January 25, 2015.

very big fan of your blog post,

I wonder if you could continue the conversation on email ?
I think Superrationality as you said is matter of scale and this is must interesting parameter in human development when we are thinking about globalization. Now this is my next question, how reduce the nefast effect of globalization (for instance to protect our planet of ecological disaster)? Is human alone is able of super Superrationality at a large scale or he needs to have a DAO to keep track of this progress ?


Author Kyle MacDonald

Posted at 11:30 am January 26, 2015.

Full transparency is the only option ! Privacy Is The Enemy.


Author Dor Konforty

Posted at 9:42 pm January 31, 2015.

Superrationality, enlightened self-interest, technology -> transparent computerized minds do not require trust, eliminating options of (game-theoretical) defect.

This is not crazy, but inevitable.


Author James D’Angelo

Posted at 4:45 pm March 27, 2015.

Vitalik, this is a truly mind-bending article, and I’m only halfway in. I’ll have more comments later, but I almsot fell over when I read that a fairly straightforward use case for DAOS is a “one-ID-per-person data provision”. Is there anything more out there on this? I noticed you linked to all the other stuff. Thanks in advance – James


Author Georgi Karov

Posted at 9:38 pm May 24, 2015.

<<< This is not crazy, but inevitable.



Author Robbie

Posted at 1:48 pm May 25, 2016.

One question remains after reading, @vitalik_buterin:disqus or anyone else, could you help me with it.

Vitalik you say:
“With public goods, the whole problem is that there is no way to exclude anyone from benefiting, so the strategy fails. ”

Few things I missunderstand:
1. what you mean by saying “public goods”?
2. When you say “this strategy” – you mean SuperRationality? or something else?
2. If so, then follows a last question – Why this strategy I presume it’s “SuperRationality” – fails public good organisations?

Thank you in advance 🙂

P.S. Amazing and deep post. Thank you


Leave a Reply

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

View Comments (17) ...