April 17th, 2007
How Aggregate Displays Change User Behavior
A fascinating study demonstrates how simply displaying aggregate data like Top 10 lists heavily influences the way people make decisions on social web sites.
Aggregate displays are everywhere, from the book ratings at Amazon.com to the most-emailed articles at the New York Times to the number of diggs at Digg.com. They’re a primary element of social design. They not only let people know how their actions relate to others, but they also alter the behavior of those who view them.
Columbia sociology professor Duncan Watts has written the fascinating piece Is Justin Timberlake the product of Cumulative Advantage?, describing a sociology experiment that has huge implications for the display of aggregate data on social web sites. (thankfully, the article isn’t about Justin Timberlake at all. The author doesn’t even mention his name…probably titled by an editor at the Times)
Description of the Experiment
Watt’s study demonstrates how socially influenced we are as we use software and make decisions online. He describes an experiment in which they built two web sites. In one web site they showed a list of songs to users and had the users rate the songs as they listened to them, allowing the users to download the songs if they wanted to. In the other, they also allowed the users to rate and download the songs but they also showed the download numbers to the users. In the second group the users could see how often the songs had been downloaded as they used the site. The difference was in the display: users either saw the aggregate display of downloads (called the social influence group) or they did not (called the independent group).
In addition, they split up the social influence group into 8 parts. They replicated the same test 8 times to see if there were any differences over time.
Watts describes how the download numbers could be used to test:
“This setup let us test the possibility of prediction in two very direct ways. First, if people know what they like regardless of what they think other people like, the most successful songs should draw about the same amount of the total market share in both the independent and social-influence conditions — that is, hits shouldn’t be any bigger just because the people downloading them know what other people downloaded. And second, the very same songs — the “best” ones — should become hits in all social-influence worlds.”
In other words, the mere fact that users can see the downloads is being tested. A simple display difference. A design decision.
Does Seeing Aggregate Data Change Behavior?
Does displaying aggregate download data change the behavior of users? The answer is Yes. Their findings:
- The most popular songs in the social influence group were way more popular than those in the independent group. In other words, the rich got richer when people could see the aggregate data.
- The popular songs in the 8 social influence groups were not the same! That is, the download numbers affected the popularity of the songs. Early download leaders continued to lead not just because they were good songs, but because they were already leading.
- The independent group was considered the test for quality. The songs that were most downloaded were considered the highest quality because everybody voted independently…there was no social influence since download numbers were not displayed. These songs did correlate slightly with the songs in the social influence group that did well, but did not have much of an effect overall.
- The social influence group was influenced much more by the number of downloads than by the quality of the songs. The social influence had a stronger effect on the song downloads than independent, unbiased decision making.
This result could be seen as a confirmation of the bandwagon effect, a known bias resulting from our tendency to follow the crowd. This bias is probably the result of ignorance…if we don’t know something we tend to rely on the opinion of others. In this case users probably paid attention to the download numbers because they didn’t have any prior experience with the music.
Outcome is Unpredictable
One outcome is that predicting what will happen in social influenced situations is impossible. Watts explains:
“In our artificial market, therefore, social influence played as large a role in determining the market share of successful songs as differences in quality. It’s a simple result to state, but it has a surprisingly deep consequence. Because the long-run success of a song depends so sensitively on the decisions of a few early-arriving individuals, whose choices are subsequently amplified and eventually locked in by the cumulative-advantage process, and because the particular individuals who play this important role are chosen randomly and may make different decisions from one moment to the next, the resulting unpredictably is inherent to the nature of the market.”
In other words, early leaders tend to stay in the lead simply because people see they are leading and are influenced by it. If a song gets an early lead then people assume it is because of quality, but the case may simply be that it happened to be one of the first listened to.
Huge Implications for Social Site Design
This result has huge implications for all social web sites, especially those that show aggregate data. Digg, for example, shows aggregate data everywhere on the site. This experiment, in addition to several other issues that I wrote about in Digg’s Design Dilemma, suggest that the results there are socially influenced to such an extent that it would be hard indeed to know where the quality lies…
It also leads to interesting questions for those building social sites. What data do we aggregate? What should we display? Where? What influence will it have on the future behavior of those who see it? Does it influence the quality of content? How so?….etc…etc…
Another level of complexity can be added on top of this. What if the users knew more about the download numbers? For example, what if a user knew their best friend hated one of these songs? Of course that’s going to affect our decision as well, and maybe moreso, because the user trusts their friends more than a simple download number.
A Note of Caution
Finally, a note of caution. Over at Publishing 2.0 Scott Karp extrapolates this finding further, suggesting (somewhat tongue-in-cheek) that it explains other phenomena like Web 2.0 and the blogging A-List. He wonders if the A-List is just riding a wave of initial popularity.
We should be careful to push this that far, however, because some things that Karp mentions happen over a lot longer a period of time and have many other factors involved. The A-Listers, for example, drop like a rock if they don’t continue to post and get linked to. They quickly lose their advantage in the face of low post output and other changes over time. In the study Watts tested the ratings for song preferences. Songs, of course, don’t change over time. My guess is that the result will apply relatively well to things that don’t change…songs, movies, texts…and less to things that don’t.
If Watt’s experiment is solid, however, it should help inform anybody building a social web site. If people are unduly influenced by aggregate data, then we cannot make any assumptions about the quality of it. In other words, much of the success in social systems is based on that most horrible of reasons…luck.
Links to this Post
Comments
1. Damien Ravé 10:21am, Tue 17th, 2007
Interesting finding, although I’m not that surprised. Every elitist knows that quantity doesn’t equate with quality, or that democracy doesn’t produce high standards. Now with a scientific proof to support this, we still have to find a better way to define, filter and display quality contents on the social web (that’s what we’re all struggling for, aren’t we ?). I like the idea of being able to evaluate the interest of a judgement based on who made it (a friend, a guru). That’s what we do on a micro-scale when reading a blog entry, but can we bring it to an industrial scale ?
2. Bob 6:02am, Wed 18th, 2007
Interesting read, many Top 100 competition polls that we have held show similar results. Another influence is real-estate, if people are happy with the selection at the top of the page they may not bother scrolling further down.
3. Bell 7:13am, Thu 19th, 2007
An interesting read, but it raises a question for me:
Is some of the effect not based on ease of navigation or “first come first served”?
Bob mentioned this in relation to real estate sites, and I would be very surprised if it didn’t apply everywhere. The most interesting, although impossible to get, stats would be on the position in a page of clickthroughs on Google or AltaVista. My money’s on 1-3 getting the lions share with 8-10 in second and 4-7 a poor third.
It would be interesting to repeat the experiment with the top 10 displayed in reverse and a list of randoms as two other possible variations.
4. Ben Clemens 3:03pm, Thu 19th, 2007
I enjoyed the article and agree it has big implications for social web design: as a warning sign. Sites like Digg that replicate the distortions of traditional publicity (like cumulative advantage) are reflections of old models of attention, and they are made more rabid by being semi-anonymous and unedited; there is nothing wrong with that, but it’s not essentially new, just faster and more distorted than even the old standard of celebrity news. Social media sites that really are new models of attention like Flickr are those that focus on enabling individuals to contribute long-tail content & metadata, without aggregate displays or leaderboards, and connect people through friends, small groups or search. I gotta say I prefer the latter.
5. catherine 11:18pm, Thu 19th, 2007
Interesting research and results. It demonstrates too why reporting election results before polls close can affect the outcome. People like to back ‘winners’.
6. Noah Brier 12:22pm, Mon 23rd, 2007
I didn’t think of this until I read your entry, but Watts’ article works right along with the ideas put forth in Wisdom of Crowds. The aggregate displays are really no different than telling people how many jelly beans others were guessing: It takes the independence out of the decision. Interesting stuff, thanks Josh.
7. Michael 5:24am, Sat 28th, 2007
So if you are a musician and if you have uploaded a song to a music community, then just download the song as often as possible and your song will be famous.
8. tmulligan 1:19pm, Sat 28th, 2007
interesting indeed, but with reference to Bell’s comment i would point out that in a recent test of a community site for an open source software project, my colleagues and i were surprised to learn that the top and very bottom of almost every page were consistently hot spots…
-t
9. tmulligan 1:50pm, Sat 28th, 2007
also regarding catherine’s comment and the influence of bandwagon behavior on elections, my concern is more about how this can be (and i believe has been) effectively used to split, erode, and ultimately nullify support for fledgling parties with agendas that appear relatively palatable to a fairly large portion of the population — resulting in predictably short term strategies taken in hopes of suffering a lesser evil…
i’m speaking here about the US, of course, but it strikes me that a) we definitely need additional parties along with a more representative system of power sharing (coalitions, etc) as is the habit with many democracies the world over these days and b) that if bookmarking is effectively voting (as has been suggested elsewhere on Bokardo), it’s no small consolation that when it comes to the internet, at least there are no set limitations on our voting — we need only struggle with our own susceptibility to hype and hyperbole and not the cunning of the Karl Roves of the world…