<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Skippy Records &#187; Networks and Webs</title>
	<atom:link href="http://skippyrecords.wordpress.com/category/networks-and-webs/feed/" rel="self" type="application/rss+xml" />
	<link>http://skippyrecords.wordpress.com</link>
	<description></description>
	<lastBuildDate>Tue, 17 Jan 2012 22:41:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='skippyrecords.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Skippy Records &#187; Networks and Webs</title>
		<link>http://skippyrecords.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://skippyrecords.wordpress.com/osd.xml" title="Skippy Records" />
	<atom:link rel='hub' href='http://skippyrecords.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Beckstrom&#039;s &quot;New Model of Network Valuation&quot;</title>
		<link>http://skippyrecords.wordpress.com/2009/04/09/beckstroms-new-model-of-network-valuation/</link>
		<comments>http://skippyrecords.wordpress.com/2009/04/09/beckstroms-new-model-of-network-valuation/#comments</comments>
		<pubDate>Thu, 09 Apr 2009 16:28:48 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[network]]></category>
		<category><![CDATA[networks]]></category>
		<category><![CDATA[scale-free]]></category>

		<guid isPermaLink="false">http://blog.drskippy.com/?p=191</guid>
		<description><![CDATA[Earlier this year, Rod Beckstrom released a research paper &#8220;A New Model for Network Valuation.&#8221;  In his model Beckstrom proposes that the value of a network is the sum of the benefits to networked individuals minus the cost to networked individuals.  From a perspective, this is obviously true. However, I have some criticisms of this [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=191&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Earlier this year, Rod Beckstrom released a research paper &#8220;<a title="A new modle for network valuation" href="http://beckstrom.com/The_Economics_of_Networks" target="_blank">A New Model for Network Valuation</a>.&#8221;  In his model Beckstrom proposes that the value of a network is the sum of the benefits to networked individuals minus the cost to networked individuals.  From a perspective, this is obviously true.</p>
<p>However, I have some criticisms of this model:</p>
<p>(1) He proposes this be called &#8220;Beckstrom&#8217;s Law&#8221; analogously to <a title="Metcalfe's Law on wikipedia" href="http://en.wikipedia.org/wiki/Metcalfe%27s_law" target="_blank">Metcalfe&#8217;s Law</a> Law and <a title="Reed's Law on wikipedia" href="http://en.wikipedia.org/wiki/Reed%27s_law" target="_blank">Reed&#8217;s Law</a> Laws (discussed here previously <a title="What is the value of your network?" href="http://blog.drskippy.com/2007/02/21/what-is-local-part-3-what-is-the-value-of-your-network/" target="_self">here</a> and <a title="Affinity over proximity" href="http://blog.drskippy.com/2007/02/21/what-is-local-part-2-affinity-over-proximity/" target="_blank">here</a>).</p>
<p>What is a &#8220;law&#8221; good for?  In the case of other soft laws&#8211;the most famous being <a title="Moore's Law on wikipedia" href="http://en.wikipedia.org/wiki/Moore%27s_law" target="_blank">Moore&#8217;s Law</a>&#8211;the value of the law comes from a scaling relation that gives some rough predictive capabilities independent of scale.  Moore&#8217;s law is a soft law because there isn&#8217;t anything about the physics of the problem that allows deriving Moore&#8217;s scaling law.  But the social/business network has behaved this way for decades.  Moore&#8217;s law is widely believed to break down as we reach the power and frequency limits of conventional integrated circuits manufacturing methods.   The physics of current circuit manufacturing techniques says that either Moore&#8217;s law will stop being an accurate predictor of the scaling of computation power, or we will make a significant shift in the physics we are using.</p>
<p>Thus a law provides predictive and analytical power:  How does <em>this </em>change as <em>that </em>changes?  Even a law being violated (i.e. a model breaking down) gives us signals indicating when important shifts in the relevance of a model to a social or physical system are occurring.</p>
<p>&#8220;Beckstrom&#8217;s Law&#8221; isn&#8217;t a law in this sense.  It derives no scaling relations.  Further, you get an idea of how difficult it is to make estimates of value in this accounting system when Beckstrom starts his discussion of phone network value by discussing the &#8220;upper limit&#8221; as GDP.</p>
<p>(2) Beckstrom&#8217;s Rules of Accounting. Useful laws provide an &#8220;abstraction jump.&#8221;  With powerful models, can accumulate the particulars of a system and use this leverage to understand and manipulate at a different scale, i.e., we can derive laws. For example, we sum up the location and amount of all the little masses in the earth and calculate the acceleration of falling objects at the surface of the earth&#8211;the theory allows us to think about acceleration of falling objects at the surface of the earth without worrying about the location and mass of all the little grains of stuff that make up the earth.  Keeping track of all the pieces of the earth is &#8220;accounting&#8221; (vital to the theory) but the power of the law comes from the leverage of the abstractions derived.  Beckstrom&#8217;s law doesn&#8217;t seem to have this quality. It doesn&#8217;t derive high-leverage abstractions from underlying structure.  It is an accounting method.</p>
<p>(3) Beckstrom&#8217;s valuations are <em>essentially </em>relative.  In particular, benefits and costs are calculated compared to the prevailing contextual way of performing the transactions.  They depend on the path through history rather than depending on form. In his example of Amazon.com, for example, the benefits of for a user of Amazon.com are realized in savings over driving to the local bookstore. He points out that Amazon.com also gets a &#8220;network benefit&#8221; from the transaction because they have wholesale and operational costs less than the purchase price of the book.  These are &#8220;benefits&#8221; of Amazon.com over the local bookseller based on cost savings for a commodity assumed to be available through both channels.</p>
<p>Before you discount this problem with a &#8220;what else is there?&#8221; consider that the system of agents and interactions may reach a novel state through losses&#8211;a new state not used in the relative benefits calculation.  The network may enable new states of the system that were not contemplated before the existence of the network.  This has important implications for the network security arguments in Beckstrom&#8217;s paper.  Losses may exceed benefits.  More importantly, losses may come from reaching system states that were not visited on the way to realizing benefits.</p>
<p>Backing this thinking up a bit, we realize that relative gains pose a problem too: the system of agents and interactions may enable transactions that were not possible in the prevailing contextual way of performing transactions.  Then the benefits can no more be calculated than the losses using an accounting method.</p>
<p>(4) Transactions don&#8217;t seem to neatly belong to a single network. In the Amazon.com example, presumably, Beckstrom was calculating the value of the Internet, or maybe just the World-Wide Web. Or maybe the value of the network of the network of roads allowing me to drive to the store vs. the value of the Internet?  Or maybe the network of all booksellers (since they all buy from the same publishers)?  Which network does the transaction belong to?  Or do we allocate a fraction of the transaction to each of the overlapping networks that can be argued to enable the benefits of the transaction?</p>
<p>(5) Very carefully (!) add historical benefits to predicted losses. (I think <a title="Taleb Home Page" href="http://www.fooledbyrandomness.com/" target="_blank">Taleb would argue never</a>!) These are apples and oranges. See multiple explanations of the current state of the world&#8217;s banking system.  Beckstrom argues that &#8220;optimal security investment occurs on the loss function line where it is tangent to the 45 degree line, or where one dollar of security investment equals one dollar of decrease in expected losses.&#8221;  This is true in two places on Beckstrom&#8217;s curve&#8211;once for very low security investment and again for moderate security investment.  In fact, my losses are bounded by &#8220;everything I value&#8221; for $0 security spending (I can lose everything, and no more). So the existence of two inflexion points seems accurate and how we choose between them based on Beckstrom&#8217;s arguments isn&#8217;t clear.</p>
<p>How do we know the curve doesn&#8217;t look like this all the way down?</p>
<div class="wp-caption aligncenter" style="width: 190px"><img title="Knees all the way down..." src="http://drskippy.net/img/kneesallthewaydown_2009-04-09.jpg" alt="Pathelogical risk-cost curve" width="180" height="180" /><p class="wp-caption-text">&quot;Pathelogical&quot; risk-cost curve. Risk Investment on the x-axis; Cost of losses on the y-axis.</p></div>
<p>More troubling is that security investment is historical accounting while the loss curve represent one of the infinite possible world lines.  What if security spending creates value rather than merely stemming losses&#8211;then we have more to lose than when we started security spending.  Maybe the curve looks like this?</p>
<div class="wp-caption aligncenter" style="width: 262px"><img title="Risk-cost region" src="http://drskippy.net/img/region_2009-04-09.jpg" alt="Risk-cost region" width="252" height="252" /><p class="wp-caption-text">Risk-cost region. Risk Investment on the x-axis; Cost of losses on the y-axis.</p></div>
<p>What will a law describing network value look like?</p>
<ul>
<li>Scaling laws from underlying size, structure and dynamics</li>
<li>It is highly likely that the values of constants essential to practice will be missing from the law.  Actual risk and benefit calculations will depend on the particulars of the system we are analyzing.</li>
<li>Provide some level of causal explanation for the generative, emergent qualities we observe from real networks every day.  It seems clear that networks enable new transactions. So using relative transaction value seems to miss the most exciting source of value from networks. A useful network value law will say something about the way that the future is not always a simple extrapolation of the past.</li>
<li>(There may be no such law&#8211;then maybe Beckstrom&#8217;s &#8220;Rules of Accounting&#8221; are the best we can do.)</li>
</ul>
<br /> Tagged: network, networks, scale-free <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/191/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=191&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2009/04/09/beckstroms-new-model-of-network-valuation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/kneesallthewaydown_2009-04-09.jpg" medium="image">
			<media:title type="html">Knees all the way down...</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/region_2009-04-09.jpg" medium="image">
			<media:title type="html">Risk-cost region</media:title>
		</media:content>
	</item>
		<item>
		<title>Are you already &quot;socially networked&quot; with everyone you know?</title>
		<link>http://skippyrecords.wordpress.com/2008/11/16/are-you-already-socially-networked-with-everyone-you-know/</link>
		<comments>http://skippyrecords.wordpress.com/2008/11/16/are-you-already-socially-networked-with-everyone-you-know/#comments</comments>
		<pubDate>Mon, 17 Nov 2008 00:42:44 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[social networks percolation simulation]]></category>

		<guid isPermaLink="false">http://h180745wp.setupmyblog.com/?p=109</guid>
		<description><![CDATA[Facebook and LinkedIn Network Connections Both my Facebook and LinkedIn social networks have gone through periods of rapid grown in the past year. Although I have been a member of LinkedIn for many years, my network expanded rapidly last year, approximately doubling in a couple of months.&#160; I have had a similar experience in the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=109&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<h3>Facebook and LinkedIn Network Connections</h3>
<p>Both my Facebook and <a href="http://www.linkedin.com/pub/0/462/977">LinkedIn </a>social networks have gone through periods of rapid grown in the past year. Although I have been a member of LinkedIn for many years, my network expanded rapidly last year, approximately doubling in a couple of months.&nbsp; I have had a similar experience in the last few months with my Facebook network.</p>
<p>This is likely a convergence of factors.&nbsp; My contemporaries are reconnecting with high-school and college acquaintances on Facebook.&nbsp; More of the people I have worked with in the past have joined LinkedIn and started added colleagues from past employers.&nbsp; This has led to a critical mass of people both connecting and inviting others to connect.</p>
<p>In each case, it seems that when enough people in my circle of acquaintances are connected, new connections form rapidly because it isn&#8217;t far (in terms of network hops) to other people I know.</p>
<p>So am I already &quot;socially networked&quot; with everyone I know? I think the answer is yes.&nbsp; Because I am not very active at inviting new friends, it might be more accurate to say that I could be completely socially networked with an evening&#8217;s work.&nbsp; In terms of network dynamics, this is because the network of my local group of friends has undergone the phase transition network mathematicians refer to as &quot;percolation.&quot;<br />
<h3>A simple model of network percolation</h3>
<p>Where are without a model? Not very far on this blog. Instead of modeling a complete social network, let&#8217;s start with a simplified model and see if the behavior I want to describe pops out.</p>
<p>Here is a model that is easy to visualize. Imagine I am about to tile my kitchen floor. I draw a grid on the floor showing where each tile will be placed.&nbsp; Looking at the kitchen floor as I prepare to lay the tile, you see a simple grid of lines, like graph paper. Now, start adding tiles. But instead of working from one side of the kitchen to the other in rows (like a normal person), I add tiles randomly.</p>
<p>I lay one tile by the sink, then two by the fridge, then another by the stairwell, a couple in the center.&nbsp; The floor is getting cover red with tiles, but in a hodgepodge sprinkling.&nbsp; Not too many of them are sitting right next to each other at first.&nbsp; As I put down more and more tiles, however, there start to be little clusters of adjacent tiles. After a little while, the floor might look like this:</p>
<div style="text-align:center;"><img height="197" border="0" width="216" title="A few tiles in the kitchen tile network" alt="A few tiles in the kitchen tile network" src="http://drskippy.com/img/afewtiles_2008-11-16.png" /></div>
<p>But then, along comes my nephew Hayden, wanting to go to the sink for a drink of water. I don&#8217;t want him to walk where there are no tiles, because he will track glue everywhere. But he is small, and can&#8217;t take steps bigger than one tile. So I need to hurry to put down enough tiles so he can walk to the sink.&nbsp; But I don&#8217;t want to ruin my beautiful experiment with the random tiles.&nbsp; How many tiles do I have to put down (still placing them at random locations) before Hayden can walk on adjacent tiles from the living room to the sink to get a drink?</p>
<div style="text-align:center;"><img height="197" border="0" width="216" title="Kitchen tile network percolation" alt="Kitchen tile network percolation" src="http://drskippy.com/img/perctiles_2008-11-16.png" /></div>
<p>When he can do this, the edges of the network of tiles are connected.&nbsp; The network of tiles is said to have a percolating group of tiles.</p>
<p><span id="more-109"></span></p>
<h3>A slightly more complex simple network percolation model</h3>
<p> A mathematical description of the tile model goes like this.&nbsp; Each tile (graph nodes or vertices) can have at most 4 edges connecting it to other tiles (graph edges or links).&nbsp; The connections are created by placing adjacent tiles.</p>
<p> Let&#8217;s add a small generalization to the model to make it slightly more realistic. The tiles are connected by the fact that they are placed next to each other.&nbsp; But in a social network, you can be &quot;near&quot; someone, but choose not to connect.&nbsp; So let&#8217;s add the condition that each connection between adjacent people can be a friendship or not. And let&#8217;s allow this independently for each near neighbor. In the social network analogy, this means everyone person (node) can have at most 4 friendships (relationships, links to other nodes) but they don&#8217;t have to have be connected to someone they are &quot;near.&quot;&nbsp; Now people and friendships might look like this:</p>
<div style="text-align:center;"><img height="209" border="0" width="317" title="four-friend network" alt="four-friend network" src="http://drskippy.com/img/fourfriendnetwork_2008-11-16.png" /></div>
<p> The analogous percolation question for a social network: how many friendships have to be created before you can get anywhere in the network by traversing friendships?&nbsp; In other words, at what point is it possible to find everyone you know on a social network?</p>
<p> Further, regarding a period of rapid network growth: Is there some point in the evolution of the network when connections are created more rapidly?</p>
<p> To model this simple network, I created a simulation. You can <a title="simple percolation simulation" href="http://drskippy.com/python/simplePercolation.py">download the Python script</a> and run it yourself.&nbsp; I created a grid 6 x 6, giving 36 nodes.&nbsp; Central nodes can have at most 4 friends.&nbsp; Edge nodes can have 3 and corner nodes only 2. For the entire network, the number of possible edges is 2 x 6 x 5 = 60.</p>
<p> One reason to use a computer model, is that since I am placing the friendships randomly, each time we run the model on a 6 x 6 network, the details will come out slightly different. So we want to understand what happens on average.&nbsp; Using a computer model lets us run many (50 in this case) copies of the network experiment (without tiling a kitchen floor, and ripping it up over and over).</p>
<p> Instead of asking just about the path from the living room to the sink,&nbsp; or one side of our generalized network to another, let&#8217;s look at the overall connectivity of the nodes. How close are we to being able to traverse the graph from any point to any other point? For this, we measure the size of the largest group of connected nodes (people) as we add more edges (friendships).</p>
<p> The size of the largest group of connected people looks like this as we add friendships between the people:</p>
<div style="text-align:center;"><img border="0" title="Max group size vs edges in friendship graph" alt="Max group size vs edges in friendship graph" src="http://drskippy.com/img/MaxGroupSize_72_2008-11-16.png" /></div>
<p> From the plot above, you can see that the size of the largest group gets large very fast after about 30 friendships form.&nbsp; The network is fully connected after 40 friendships form&#8211;even though there are 60 friendships possible.</p>
<p> Okay, let&#8217;s close the loop on the analogy. If I had 36 high school friends on Facebook 3 months ago, they would have been difficult to find through each other.&nbsp; But over time, a few of them talked on the phone, met on the street etc. and invited each other to friend them on Facebook.&nbsp; This process was slow at first, but after about 30 friendships were formed, the process took off. One I was connected to a couple of sub-groups, I could find anyone I knew of my 36 high-school friends. Once the network percolates, I am socially networked with everyone I know.</p>
<br /> Tagged: social networks percolation simulation <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/109/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=109&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2008/11/16/are-you-already-socially-networked-with-everyone-you-know/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>

		<media:content url="http://drskippy.com/img/afewtiles_2008-11-16.png" medium="image">
			<media:title type="html">A few tiles in the kitchen tile network</media:title>
		</media:content>

		<media:content url="http://drskippy.com/img/perctiles_2008-11-16.png" medium="image">
			<media:title type="html">Kitchen tile network percolation</media:title>
		</media:content>

		<media:content url="http://drskippy.com/img/fourfriendnetwork_2008-11-16.png" medium="image">
			<media:title type="html">four-friend network</media:title>
		</media:content>

		<media:content url="http://drskippy.com/img/MaxGroupSize_72_2008-11-16.png" medium="image">
			<media:title type="html">Max group size vs edges in friendship graph</media:title>
		</media:content>
	</item>
		<item>
		<title>The Singlularity will always be near</title>
		<link>http://skippyrecords.wordpress.com/2008/10/01/the-singlularity-will-always-be-near/</link>
		<comments>http://skippyrecords.wordpress.com/2008/10/01/the-singlularity-will-always-be-near/#comments</comments>
		<pubDate>Wed, 01 Oct 2008 18:29:43 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[systems conspiracy singularity]]></category>

		<guid isPermaLink="false">http://h180745wp.setupmyblog.com/?p=101</guid>
		<description><![CDATA[Kevin Kelly&#8217;s essay Thinkism is an important essay responding to apocalyptic thinking about the &#34;Singularity.&#34; This is a very clear and considered essay, written with deep understanding and experience of how order emerges in complex systems. Related, I highly recommend his book Out of Control. It is a little dated, but if you hold this [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=101&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Kevin Kelly&#8217;s essay <a title="Thinkism" href="http://www.kk.org/thetechnium/archives/2008/09/thinkism.php">Thinkism</a> is an important essay responding to apocalyptic thinking about the &quot;<a title="Wikipedia - Singularity" target="_blank" href="http://en.wikipedia.org/wiki/Technological_singularity">Singularity</a>.&quot;  This is a very clear and considered essay, written with deep understanding and experience of how order emerges in complex systems.  Related, I highly recommend his book <a href="http://www.amazon.com/gp/product/0201483408?ie=UTF8&amp;tag=skipreco-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=0201483408">Out of Control</a><img src="http://www.assoc-amazon.com/e/ir?t=skipreco-20&amp;l=as2&amp;o=1&amp;a=0201483408" width="1" height="1" border="0" alt="" style="border:none!important;margin:0!important;" />.  It is a little dated, but if you hold this in mind as you read, it is all the more wonderful for Kelly&#8217;s depth and prescience.</p>
<br /> Tagged: systems conspiracy singularity <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/101/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/101/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/101/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/101/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/101/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/101/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/101/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/101/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/101/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/101/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/101/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/101/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/101/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/101/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=101&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2008/10/01/the-singlularity-will-always-be-near/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>

		<media:content url="http://www.assoc-amazon.com/e/ir?t=skipreco-20&#38;l=as2&#38;o=1&#38;a=0201483408" medium="image" />
	</item>
		<item>
		<title>Day-to-Day Tall Head of URL Exploration</title>
		<link>http://skippyrecords.wordpress.com/2008/09/03/day-to-day-tall-head-of-url-exploration/</link>
		<comments>http://skippyrecords.wordpress.com/2008/09/03/day-to-day-tall-head-of-url-exploration/#comments</comments>
		<pubDate>Thu, 04 Sep 2008 00:53:38 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[long tail]]></category>

		<guid isPermaLink="false">http://h180745wp.setupmyblog.com/?p=95</guid>
		<description><![CDATA[This is the 4th post on the statistics of URL exploration. In the previous three (The Long Tail of URL Exploration, What does the Nth Explorer of the Web Find? and The Tall Head of URL Exploration) I looked at how adding users grows the long tail and tall head of URLs for a single [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=95&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This is the 4th post on the statistics of URL exploration. In the previous three (<a href="http://h180745wp.setupmyblog.com/2008/08/29/the-tall-head-of-url-exploration/" title="Long Tail URLs">The Long Tail of URL Exploration</a>, <a href="http://h180745wp.setupmyblog.com/2008/08/27/what-does-the-nth-explorer-of-the-web-find/" title="The Nth Explorer">What does the Nth Explorer of the Web Find?</a> and <a href="http://h180745wp.setupmyblog.com/2008/08/27/the-long-tail-of-url-exploration/" title="Tall head of URLs">The Tall Head of URL Exploration</a>) I looked at how adding users grows the long tail and tall head of URLs for a single day.&nbsp; Today, the data covers 20 days with a relatively constant population.</p>
<p>To get some idea how the tall head evolves, compare the tall head on day 0 with 19 successive days. The plot below shows the Top 10, Top 50, Top 100, Top 500 and Top 1000 URLs for day zero and the fraction of the top URLs on day zero appearing in the tall head on the nth day.</p>
<p>&nbsp;</p>
<div style="text-align:center;"><img height="331" width="331" border="0" src="http://drskippy.net/img/tallheaddays_2008-09-03.png" alt="Tall Head URLs by day" title="Tall Head URLs by day" /></div>
<div align="center">Figure 1.&nbsp; Red-Top 10 URLs; Blue Top 50 URLs; <br />Green-Top 100 URLs; Cyan-Top 500 URLs; Yellow-Top<br /> 1000 URLs. (URLs ranked by visits).</div>
<div align="center">&nbsp;</div>
<p>While the Top 10 and Top 50 URLs show stability day after day, the Top 500 and Top 1000 roll over at a fairly constant rate after day one.&nbsp; The plot can be used to estimate the size of the persistent tall head of URLs for this population and the rate at which the tall head evolves.</p>
<p>First, look for a change in behavior from maintaining a constant fraction of the day-0 URLs to a steady decline from day to day. By this heuristic, estimate the persistent tall head to be between 50 and 100 URLs.</p>
<p>Secondly, to estimate the turnover of the tall head, choose the approximate desired tall head, e.g., the Top 500 URLs (cyan), and look at the slope of the line for days 1-19. (Alternately, choose a timescale for which the tall head should turn over to a given fraction remaining, say, 75%, giving a timescale of approximately 15 days.)</p>
<div style="text-align:center;"><img height="331" width="331" border="0" src="http://drskippy.net/img/tallheaddaysfit_2008-09-03.png" alt="Tall Head Top 500 Fit" title="Tall Head Top 500 Fit" /></div>
<div style="text-align:center;">Figure 2.&nbsp; Red-Top 500 URLs; Blue Top 1000 URLs, shown<br />for comparison; Green-Fit to Top 500 URLs. (Days 1-20, <br />URLs ranked by visits).</div>
<p>The plot above shows the Top 500 URLs rollover about 0.5% per day from days 1 to 20.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/skippyrecords.wordpress.com/95/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/skippyrecords.wordpress.com/95/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/95/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/95/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/95/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=95&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2008/09/03/day-to-day-tall-head-of-url-exploration/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/tallheaddays_2008-09-03.png" medium="image">
			<media:title type="html">Tall Head URLs by day</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/tallheaddaysfit_2008-09-03.png" medium="image">
			<media:title type="html">Tall Head Top 500 Fit</media:title>
		</media:content>
	</item>
		<item>
		<title>The Tall Head of URL Exploration</title>
		<link>http://skippyrecords.wordpress.com/2008/08/29/the-tall-head-of-url-exploration/</link>
		<comments>http://skippyrecords.wordpress.com/2008/08/29/the-tall-head-of-url-exploration/#comments</comments>
		<pubDate>Fri, 29 Aug 2008 16:02:24 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[long tail]]></category>

		<guid isPermaLink="false">http://h180745wp.setupmyblog.com/?p=93</guid>
		<description><![CDATA[In The Long Tail of URL Exploration, I looked at the distribution of URL visits by 102K people in a day.&#160; In What does the Nth Explorer of the Web Find?, I looked at how adding users grows the long tail and the number of unique URLs explored.&#160; In this 3rd (of 4) post, I [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=93&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In <a title="long tail of url exploration" href="http://h180745wp.setupmyblog.com/2008/08/27/the-long-tail-of-url-exploration/">The Long Tail of URL Exploration</a>, I looked at the distribution of URL visits by 102K people in a day.&nbsp; In <a title="what does the nth explorer find" href="http://h180745wp.setupmyblog.com/2008/08/27/what-does-the-nth-explorer-of-the-web-find/">What does the Nth Explorer of the Web Find?</a>, I looked at how adding users grows the long tail and the number of unique URLs explored.&nbsp; In this 3rd (of 4) post, I look at the how the tall head changes as we add explorers.</p>
<p>The tall head is the short list of sites that get the most visits.&nbsp; We could call it the Top 10, Top 100 or whatever we think is relevant.&nbsp; Later I propose a couple of heuristics for determining tall head membership in real time.</p>
<p>The tall head is made up of URLs that much of the population visits.&nbsp; These are the &quot;winner take all&quot; URLs of user attention&#8211;URLs like cnn.com, google.com, or facebook.com.&nbsp; For this reason, we might expect that while the long tail is growing with more unique URLs and the number of URLs with 2-3 hits is growing rapidly, the tall head is relatively stable.</p>
<p>This is the case.</p>
<p>One way to think about how stable the tall head might be is to ask how well a subset of the population predicts membership in the tall head for the entire sample.&nbsp; The data from the last two posts is well-suited to look at this question.</p>
<p>Below is a plot of the accuracy of various subsets of the population (the same subsets we used previously) in predicting the Top 10, Top 20, Top 50 and Top 100 URLs of the entire population.&nbsp; Just over 40% percent of the population predicts the Top 100 URLs with 90% accuracy.&nbsp; The Top 10 are predicted to 90% accuracy by 10% of the population.</p>
<div style="text-align:center;"><img border="0" title="predicting the tall head" alt="predicting the tall head" src="http://drskippy.net/img/predtallhead_20080829" /></div>
<div align="center">Figure.&nbsp; Red-Top 10 URLs by visit; Blue Top 20 </div>
<div align="center">URLs by visit; Green-Top50 URLs by visits; Cyan-Top </div>
<div align="center">100 URLs by visits.</div>
<p>The composition of the tall head depends relatively weakly on the subset of the population doing the predicting.</p>
<p>How can I predict the tall head for the day by 9 am in the morning? This is the real-time problem of long tail distributions.&nbsp; The dynamics of the system are that real time Web exploration data appears as a time-ordered list of URLs from whatever users happen to be surfing.&nbsp; This means that a real time heuristic for determining top URLs for the day has to rely on the properties of the time series including a small surfer sample size and recent counts of visits.</p>
<p>Fortunately, by the results illustrated above, a small sample size is a pretty good bet for determining tall head URLs. What we are still missing is metrics or intuition for how the long tail distribution evolves over time.</p>
<p>We do know that for a URL to end up in the tall head, it must be visited by many Web explorers.&nbsp; This means that we can rule out all URLs that are visited by only one or two users.&nbsp; This assumption also leads to a heuristic based on the time between visits&#8211;URLs visited by many people should have the same visit/time distribution as the users/time distribution of the entire sample.&nbsp; More specifically, we might guess that if the time between visits has an average near 1 day/number of visits and relatively low variance, it is likely to be in the tall head.&nbsp; A project for a little later&#8230;</p>
<p>Next post: How does the composition of the tall head change from day to day?</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/skippyrecords.wordpress.com/93/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/skippyrecords.wordpress.com/93/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/93/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/93/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/93/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=93&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2008/08/29/the-tall-head-of-url-exploration/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/predtallhead_20080829" medium="image">
			<media:title type="html">predicting the tall head</media:title>
		</media:content>
	</item>
		<item>
		<title>What does the Nth explorer of the Web find?</title>
		<link>http://skippyrecords.wordpress.com/2008/08/27/what-does-the-nth-explorer-of-the-web-find/</link>
		<comments>http://skippyrecords.wordpress.com/2008/08/27/what-does-the-nth-explorer-of-the-web-find/#comments</comments>
		<pubDate>Wed, 27 Aug 2008 20:17:59 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[long tail]]></category>

		<guid isPermaLink="false">http://h180745wp.setupmyblog.com/?p=91</guid>
		<description><![CDATA[In The Long Tail of URL Exploration, I looked at the distribution of URLs and visits. This was on the way to trying to answer questions like: How much overlap is there between the URLs 10 people visit and those in the 11th person&#8217;s click stream? How about the 100th or 100,000th person? Does the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=91&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In <a href="http://h180745wp.setupmyblog.com/2008/08/27/the-long-tail-of-url-exploration/" title="The Long Tail or URL Exploration">The Long Tail of URL Exploration</a>, I looked at the distribution of URLs and visits. This was on the way to trying to answer questions like:</p>
<ul>
<li>How much overlap is there between the URLs 10 people visit and those in the 11th person&#8217;s click stream? </li>
<li>How about the 100th or 100,000th person? Does the millionth user explore any unique URLs at all? </li>
<li>Can we build a model to answer How many people are required to crawl 10% of the Web?</li>
</ul>
<p>The second part of the answer is to look at how the model of URLs and visits evolves as we add users.&nbsp; To get samples with of different sizes using the same click stream data set, randomly select a subset of the users and run the analysis from the previous post.&nbsp; Through everyone back into the pot and randomly select a slightly larger set.&nbsp; Repeat.</p>
<p>I reran the model for 3%, 4%, 5%, 7%, 9%, 11%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80% and 90% of the overall users in the 1-day data set. The sample sized ranged from 3,000 to 91,200 users. For the entire data set, the average user made 184 URL visits during the day. In the randomly chosen subsets, users made an average of between 181 and 187 URL visits with most of the variation in the smaller sample size as expected.</p>
<p>Do I expect the number of unique URLs be linearly proportional to the number of users? Or if users are visiting many of the same URLs and URLs tend to have the &quot;winner take all&quot; properties we looked at before, we might expect the number of unique URLs added by the 90,000th user to be fewer than the number of unique URLs added by the 1,000th user.</p>
<p>I first plotted the number of unique URLs against the number of users in each sample.&nbsp; The curve looks straight but may be slightly concave downward.&nbsp; It is very subtly.&nbsp; I needed to look at the data in a way the amplified the change over the various subsamples.</p>
<p>Below is a plot of the number of unique URLs/user vs. the number of users. This line is flat if the number of URLs is growing linearly with the number of users.</p>
<div style="text-align:center;"><img height="331" width="331" border="0" src="http://drskippy.net/img/uniqperuser_20080827.png" alt="URLs per user" title="URLs per user" /></div>
<p>The blue curve is the best fit to another power function ( f(x)=ax^k ). The first few thousand users are contribute more original URLs (&gt;90 URLs per user) to the sample than the 100,000th (83 URLs).&nbsp; If you are the first explorer of the a new world, all of your discoveries are original; when you are a late comer, your contributions are around the margins. It may be surprising how much original content being explored by the 100,000th explorer.</p>
<p>Does the long tail get relatively longer or shorter?&nbsp; For simplicity, I use the URLs with only one visit to represent the long tail.&nbsp; Then ratio of 1-visit URLs to unique URLs decreases subtly. For the smallest samples size 70.0% of the unique URLs are hit only once; for the overall data set, the ratio is 69.2%. To amplify this change like above, the plot of 1-visit URLs per user is shown below.</p>
<div style="text-align:center;"><img height="331" width="331" border="0" src="http://drskippy.net/img/longtailperuser_20080827.png" alt="long tail per user" title="long tail per user" /></div>
<p>At 100,000 users, the long tail is growing at 57 URLs per additional user.&nbsp; The decrease with each additional user is slowing.&nbsp; The blue curve is the best fit to another power function.</p>
<p>If the power function is the best explanation of the underlying dynamics, the number of unique URLs and the long tail both continue to grow no matter how many people are exploring.&nbsp; Since an increasing number of people need to explore to keep the exploration rate constant, the cost of exploration per URL goes up as explorers are added.</p>
<p>Does anything interesting happen in the tall head where the big winners are? That will have to wait for another post.</p>
<p>&nbsp;</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/skippyrecords.wordpress.com/91/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/skippyrecords.wordpress.com/91/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/91/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/91/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/91/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=91&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2008/08/27/what-does-the-nth-explorer-of-the-web-find/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/uniqperuser_20080827.png" medium="image">
			<media:title type="html">URLs per user</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/longtailperuser_20080827.png" medium="image">
			<media:title type="html">long tail per user</media:title>
		</media:content>
	</item>
		<item>
		<title>The Long Tail of URL Exploration</title>
		<link>http://skippyrecords.wordpress.com/2008/08/27/the-long-tail-of-url-exploration/</link>
		<comments>http://skippyrecords.wordpress.com/2008/08/27/the-long-tail-of-url-exploration/#comments</comments>
		<pubDate>Wed, 27 Aug 2008 18:47:33 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[long tail URL click streams]]></category>

		<guid isPermaLink="false">http://h180745wp.setupmyblog.com/?p=90</guid>
		<description><![CDATA[Unlike robot crawlers, people visit only the links they think will be interesting.&#160; People follow news stories, follow links from friends, follow links to videos or pictures.&#160; People much more rarely follow links to boring stuff like privacy policies, lists of data or company mission statements. In order to understand the pattern or people discovering [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=90&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Unlike robot crawlers, people visit only the links they think will be interesting.&nbsp; People follow news stories, follow links from friends, follow links to videos or pictures.&nbsp; People much more rarely follow links to boring stuff like privacy policies, lists of data or company mission statements.</p>
<p>In order to understand the pattern or people discovering content on the Web, I looked at the click streams of real Web users.&nbsp; (For privacy hounds, I have real data, but the user names and URLs are hashed so I can&#8217;t personally identify anyone or look up the actual URLs they visited.)</p>
<p>How much overlap is there between the URLs 10 people visit and those in the 11th person&#8217;s click stream?&nbsp; How about the 100th or 100,000th person? Does the millionth user explore any unique URLs at all? Can we build a model to answer How many people are required to crawl 10% of the Web?</p>
<p>The first part of the answer is to look at the distribution of URLs created by a group of users.&nbsp; The sample has 102,000 user&#8217;s click streams for 1 day.&nbsp; I use only 1 day, because using multiple days complicates the estimates due to nearly all users having a set of URLs they visit daily. In the sample, users make 18.6M visits to just under 8.4M unique URLs.</p>
<p>When the URLs are ranked by number of visits, the distribution of visits over the 8.4M URLs shows that a few URLs get many hits while the tail of the distribution (way out to the right) is a long flat curve with many URLs getting only a handful of hits.</p>
<p>The distribution of visits can be fitted fairly accurately with a power law, p(x)=ax^k.&nbsp; I don&#8217;t plot the curve, because the head is so tall compared to the tail and the distribution falls off so quickly that the plot is a very sharp &quot;L&quot; shape and we don&#8217;t get much from looking at it.&nbsp; It is more useful to look at the cumulative distribution function (CDF) of the distribution.&nbsp; This is the sum of the probabilities over the rank from highest to lowest.&nbsp; Summing the probabilities to the lowest ranked URL, gives 100% of the visits recorded. Using the CDF perspective gives insight we can apply to practical situations.</p>
<p>Below is a plot of the CDF.&nbsp; The red dots are the data points calculated from the sample while the blue line is the best fit to the CDF of the proposed power law distribution.&nbsp; (The fit parameters are a=0.00219 and k=-0.690.)</p>
<p>
<div style="text-align:center;"><img border="0" src="http://drskippy.net/img/cdf_20080827.png" alt="CDF of URL Visits" title="CDF of URL Visits" /></div>
<p>One conclusion that comes out of this view of the data is that URL visits follow the so-called &quot;80/20 Rule.&quot; This predicts 80% of the visits for the day went to roughly 20% of the URLs. Actually, for this data, the proportion is about 80/50&#8211;80% of the traffic when to the top 4.5M URLs or the top 57% of URLs. </p>
<p>This view shows that the tall head of big visit winners takes a significant fraction of the overall attention of the group.&nbsp; These are likely URLs like www.google.com or www.cnn.com.</p>
<p>What does the long tail look like? The tail is just as surprising. For this data set, 5.8M URLs or 69% of the URLs visited during the day were visited only once. The number of URLs visited twice is 1.4M. </p>
<p>How do these numbers scale with the size of the group? That&#8217;s coming in the next post.&nbsp; </p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/skippyrecords.wordpress.com/90/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/skippyrecords.wordpress.com/90/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/90/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/90/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/90/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=90&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2008/08/27/the-long-tail-of-url-exploration/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/cdf_20080827.png" medium="image">
			<media:title type="html">CDF of URL Visits</media:title>
		</media:content>
	</item>
		<item>
		<title>Qualifiers with Infinite Denominators Link</title>
		<link>http://skippyrecords.wordpress.com/2008/08/12/qualifiers-with-infinite-denominators-link/</link>
		<comments>http://skippyrecords.wordpress.com/2008/08/12/qualifiers-with-infinite-denominators-link/#comments</comments>
		<pubDate>Tue, 12 Aug 2008 20:32:23 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[TED Long Tail probability free]]></category>

		<guid isPermaLink="false">http://h180745wp.setupmyblog.com/?p=88</guid>
		<description><![CDATA[I like digging through how we use and perceive words.&#160; Here is a fun one from Chris Andersen&#8217;s Blog:&#160; Many words we use imply a ratio.&#160; Here are the top 5 words whose meaning fizzles when the implied denominator becomes large: &#34;Most&#34; &#34;Average&#34; &#34;Typical&#34; &#34;All&#34; &#34;None/No&#34; It is not only large denominators that undermine these [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=88&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I like digging through how we use and perceive words.&nbsp; Here is a <a href="http://www.longtail.com/the_long_tail/2008/08/thirteen-words.html" title="Thriteen Words Blog Post">fun one from Chris Andersen&#8217;s Blog</a>:&nbsp; </p>
<p>Many words we use imply a ratio.&nbsp; Here are the top 5 words whose meaning fizzles when the implied denominator becomes large:</p>
<ol>
<li>&quot;Most&quot;</li>
<li>&quot;Average&quot;</li>
<li>&quot;Typical&quot;</li>
<li>&quot;All&quot;</li>
<li>&quot;None/No&quot;</li>
</ol>
<p>It is not only large denominators that undermine these words (more accurately, the success we have using our intuition to make predictions with these words). They also fall apart when probability distributions are anything but Normal (peaked, symmetric and short-tailed).&nbsp; We don&#8217;t seem to do a great job of helping people understand multi-modal or skewed distributions with our basic statistics education.&nbsp; It is increasingly important in planning and executing many business ideas that we become more sophisticated with long-tailed, asymmetric and multi-modal probability distributions.
<p>Chris is the author of <a href="http://www.amazon.com/Long-Tail-Revised-Updated-Business/dp/1401309666/ref=pd_bbs_sr_2?ie=UTF8&amp;s=books&amp;qid=1218579636&amp;sr=8-2">The Long Tail</a> and editor of <a href="http://www.wired.com/">WIRED magazine</a> and curator of <a href="http://www.ted.com/">TED</a>.&nbsp; He is working on a new book about &quot;free&quot; products and services.&nbsp; All recommended&#8211;check them out.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/skippyrecords.wordpress.com/88/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/skippyrecords.wordpress.com/88/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/88/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/88/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/88/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=88&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2008/08/12/qualifiers-with-infinite-denominators-link/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>
	</item>
		<item>
		<title>Competition III&#8211;Another Iterated Prisoner&#039;s Dilemma Tournament</title>
		<link>http://skippyrecords.wordpress.com/2008/02/13/competition-iii-another-iterated-prisoners-dilemma-tournament/</link>
		<comments>http://skippyrecords.wordpress.com/2008/02/13/competition-iii-another-iterated-prisoners-dilemma-tournament/#comments</comments>
		<pubDate>Wed, 13 Feb 2008 12:05:33 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[iterated prisoner’s dilemma]]></category>

		<guid isPermaLink="false">http://h180745wp.setupmyblog.com/?p=66</guid>
		<description><![CDATA[These results are a follow on to my last post on competition and iterated prisoner&#8217;s dilemma simulation. In the tournament below, I used the tournament rule that every agent plays every agent at each round.&#160; This takes a lot longer to run and the results are different. AvgT4T is still the winner, but T4TForgive beats [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=66&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>These results are a follow on to my <a href="http://drskippy.net/blog/2008/02/competition_ii.html">last post</a> on competition and iterated prisoner&rsquo;s dilemma simulation. In the tournament below, I used the tournament rule that every agent plays every agent at each round.&nbsp; This takes a lot longer to run and the results are different. AvgT4T is still the winner, but T4TForgive beats out T4T.</p>
<p>Tournament 1 was one in which everyone played once each round.&nbsp; This seems more like a competitive environment in which the other players don&#8217;t get complete information about the how play proceeded in that round&#8211;maybe more like competing for jobs or making friends in high school.&nbsp;</p>
<p>Tournament 2 seems to give more complete information at each round as a public auction or public disclosure pricing might provide.&nbsp; I am sure the merits of using one style of tournament over another have been debated plenty.</p>
<p>The difference in outcome illustrates an important characteristic of complex interactions among agents: The initial conditions and the rules of play make all the difference in the world.&nbsp; For those who are committed to free markets, there seems to be a parallel assumption that the ultimate achievement in free markets is one with now rules at all. Citizens of a connected and crowded planet may choose to design the rules of interacting based on our initial conditions. When I play with these simulations for awhile, I start to understand Case&#8217;s argument against depending on the simplistic dogma of free-markets to answer all questions of our common wellbeing.</p>
<p><img width="500" height="500" border="0" align="absmiddle" title="Prisoner's Dilema Population by Round" alt="Prisoner's Dilema Population by Round" src="http://drskippy.net/img/IPDPopulation_epe_20080213" />&nbsp;</p>
<p>Data tables &#8230;&nbsp;</p>
<p><span id="more-66"></span></p>
<p>Match Win-Loss-Tie</p>
<table border="0">
<tbody>
<tr>
<td>Random-T4TForgive:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Defect-Random:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Random-T4T:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>AvgT4T-Random:</td>
<td>0.537, 0.460, 0.003</td>
</tr>
<tr>
<td>Random-Pred1:</td>
<td>0.701, 0.299, 0.000</td>
</tr>
<tr>
<td>T4TDefect-Random:</td>
<td>0.946, 0.006, 0.048</td>
</tr>
<tr>
<td>Defect-T4TForgive:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4T-T4TForgive:</td>
<td>0.000, 0.000, 1.000</td>
</tr>
<tr>
<td>AvgT4T-T4TForgive:</td>
<td>0.000, 0.000, 1.000</td>
</tr>
<tr>
<td>Pred1-T4TForgive:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4TDefect-T4TForgive:</td>
<td>0.998, 0.000, 0.002</td>
</tr>
<tr>
<td>Defect-T4T:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Defect-AvgT4T:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Defect-Pred1:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Defect-T4TDefect:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>AvgT4T-T4T:</td>
<td>0.000, 0.000, 1.000</td>
</tr>
<tr>
<td>Pred1-T4T:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4TDefect-T4T:</td>
<td>0.981, 0.000, 0.019</td>
</tr>
<tr>
<td>Pred1-AvgT4T:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4TDefect-AvgT4T:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4TDefect-Pred1:</td>
<td>0.970, 0.015, 0.015</td>
</tr>
</tbody>
</table>
<p>Match Scores</p>
<table border="0">
<tbody>
<tr>
<td>T4T-T4TForgive:</td>
<td>0.500, 0.500</td>
</tr>
<tr>
<td>T4TDefect-T4TForgive:</td>
<td>0.510, 0.490</td>
</tr>
<tr>
<td>Random-T4TDefect:</td>
<td>0.490, 0.510</td>
</tr>
<tr>
<td>Defect-AvgT4T:</td>
<td>0.506, 0.494</td>
</tr>
<tr>
<td>T4TForgive-Random:</td>
<td>0.490, 0.510</td>
</tr>
<tr>
<td>Pred1-AvgT4T:</td>
<td>0.522, 0.478</td>
</tr>
<tr>
<td>Defect-T4T:</td>
<td>0.506, 0.494</td>
</tr>
<tr>
<td>T4TDefect-AvgT4T:</td>
<td>0.516, 0.484</td>
</tr>
<tr>
<td>Pred1-Random:</td>
<td>0.415, 0.585</td>
</tr>
<tr>
<td>Defect-Random:</td>
<td>0.857, 0.143</td>
</tr>
<tr>
<td>AvgT4T-T4TForgive:</td>
<td>0.500, 0.500</td>
</tr>
<tr>
<td>AvgT4T-T4T:</td>
<td>0.500, 0.500</td>
</tr>
<tr>
<td>T4TDefect-T4T:</td>
<td>0.504, 0.496</td>
</tr>
<tr>
<td>T4TDefect-Pred1:</td>
<td>0.511, 0.489</td>
</tr>
<tr>
<td>T4T-Random:</td>
<td>0.499, 0.501</td>
</tr>
<tr>
<td>Pred1-T4TForgive:</td>
<td>0.510, 0.490</td>
</tr>
<tr>
<td>Random-AvgT4T:</td>
<td>0.484, 0.516</td>
</tr>
<tr>
<td>Defect-T4TDefect:</td>
<td>0.506, 0.494</td>
</tr>
<tr>
<td>Defect-T4TForgive:</td>
<td>0.541, 0.459</td>
</tr>
<tr>
<td>Pred1-T4T:</td>
<td>0.503, 0.497</td>
</tr>
<tr>
<td>Defect-Pred1:</td>
<td>0.506, 0.494</td>
</tr>
</tbody>
</table>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/skippyrecords.wordpress.com/66/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/skippyrecords.wordpress.com/66/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/66/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=66&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2008/02/13/competition-iii-another-iterated-prisoners-dilemma-tournament/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/IPDPopulation_epe_20080213" medium="image">
			<media:title type="html">Prisoner&#039;s Dilema Population by Round</media:title>
		</media:content>
	</item>
		<item>
		<title>Competition II</title>
		<link>http://skippyrecords.wordpress.com/2008/02/11/competition-ii/</link>
		<comments>http://skippyrecords.wordpress.com/2008/02/11/competition-ii/#comments</comments>
		<pubDate>Tue, 12 Feb 2008 03:48:45 +0000</pubDate>
		<dc:creator>Dr. Skippy</dc:creator>
				<category><![CDATA[Networks and Webs]]></category>
		<category><![CDATA[prisoner's dilemma games game_theory IPD competition]]></category>

		<guid isPermaLink="false">http://h180745wp.setupmyblog.com/?p=65</guid>
		<description><![CDATA[I couldn&#8217;t let this one alone.&#160; While reading Case&#8217;s book on Competition, I decided to simulate the theoretical game Iterated Prisoner&#8217;s Dilemma. The explanation on Wikipedia is great.&#160; I use the canonical PD payoff matrix {(3,3), (0,5), (5,0) (1,1)} where the scores represent points awarded to (agent1, agent2) for combinations of play {(mum, mum),(mum, snitch),(snitch, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=65&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I couldn&#8217;t let this one alone.&nbsp; While reading <a href="http://drskippy.net/blog/2008/02/competitionmobius_thought_loop.html">Case&#8217;s book on Competition</a>, I decided to simulate the theoretical game<em> Iterated Prisoner&#8217;s Dilemma</em>. The explanation on <a target="_blank" href="http://en.wikipedia.org/wiki/Prisoner's_dilemma">Wikipedia</a> is great.&nbsp; I use the canonical PD payoff matrix {(3,3), (0,5), (5,0) (1,1)} where the scores represent points awarded to (agent1, agent2) for combinations of play {(mum, mum),(mum, snitch),(snitch, mum),(snitch, snitch)} respectively.</p>
<p>The simulation runs 200-game matches for 40 rounds of play.&nbsp; I have coded both everybody-plays-once-per-round and everybody-plays-everybody-per-round tournaments, although the results here are only from&nbsp; everybody-plays-once-per-round tournaments. Tournaments start with 25 agents of each strategy type.</p>
<p>I programmed 7 agent strategies:
<ul>
<li>Defect (always snitch)</li>
<li>Random (fair coin toss)</li>
<li>Tit-for-tat (Do what your opponent did last play&#8211;T4T is best overall according to the literature)</li>
<li>T4T with occasional snitches</li>
<li>T4T with occasional forgiveness</li>
<li>Prey on cooperators</li>
<li>Average T4T (T4T based on the average of all plays to date)</li>
</ul>
<p>Below are the results of 40 rounds of play. After each round, the total points earned by each strategy are tallied, then the population is redistributed according to the point breakdown.&nbsp; Predatory strategies tend to become extinct after 15 to 20 rounds.</p>
<p>&nbsp;</p>
<div style="text-align:center;"><img border="0" title="IPD Population Plot" alt="IPD Population Plot" src="http://drskippy.net/img/IPDPopulation_20080211.png" /></div>
<p>Average T4T seems to be the population winner in my simulations, settling in at about 40% of the population.&nbsp; I haven&#8217;t seen any documentation on this effect.&nbsp; It may be an artifact of my particular mix of agents when the tournament starts.&nbsp; Or I may have discovered a new, effective agent strategy.&nbsp; This doesn&#8217;t seem likely as it is such an obvious strategy. Has anyone seen this effect before?&nbsp; Please drop me a line.</p>
<p>The game win-loss-tie record between agent&#8217;s strategies was a little surprising to me.&nbsp; Defection wins most of the matches. But it is clear that winning games doesn&#8217;t correspond closely to achieving the highest scores per round.&nbsp; Coming in a close second against a wide range of strategies is what explains a particular population&#8217;s survival and growth.</p>
<p>Match Win-Loss-Tie
<p>&nbsp;</p>
<blockquote>
<table border="0">
<tr>
<td>AvgT4T-Random:</td>
<td> 0.492, 0.508, 0.000</td>
</tr>
<tr>
<td>T4TDefect-Random:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4TDefect-T4TForgive:</td>
<td> 1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>AvgT4T-T4TForgive:</td>
<td>0.000, 0.000, 1.000</td>
</tr>
<tr>
<td>Defect-T4TDefect:</td>
<td> 1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Defect-Pred1:</td>
<td> 1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4T-AvgT4T:</td>
<td> 0.000, 0.000, 1.000</td>
</tr>
<tr>
<td>Pred1-T4TForgive:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4TDefect-T4T:</td>
<td> 0.974, 0.000, 0.026</td>
</tr>
<tr>
<td>Pred1-AvgT4T:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4T-T4TForgive:</td>
<td> 0.000, 0.000, 1.000</td>
</tr>
<tr>
<td>Defect-AvgT4T:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>T4TDefect-AvgT4T:</td>
<td> 1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Random-T4TForgive:</td>
<td> 1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Defect-Random:</td>
<td> 1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Pred1-Random:</td>
<td> 0.250, 0.750, 0.000</td>
</tr>
<tr>
<td>Defect-T4T:</td>
<td> 1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Defect-T4TForgive:</td>
<td>1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Pred1-T4T:</td>
<td> 1.000, 0.000, 0.000</td>
</tr>
<tr>
<td>Random-T4T:</td>
<td> 0.750, 0.000, 0.250</td>
</tr>
<tr>
<td>T4TDefect-Pred1:</td>
<td> 1.000, 0.000, 0.000</td>
</tr>
</table>
</blockquote>
<p>If you compare the percentage of points earned in the (7!)/(2!)(5!) = 21 possible pairings, it is clear that the point leaders fairly evening split the points with all opponents, while the agent strategies the result in rapid extinction lose to some strategies by wider margins. </p>
<p>Match Scores (Fraction of Points)</p>
<blockquote>
<table border="0">
<tr>
<td>T4T-T4TForgive: </td>
<td>0.500, 0.500</td>
</tr>
<tr>
<td>T4TDefect-T4TForgive:</td>
<td> 0.510, 0.490</td>
</tr>
<tr>
<td>Random-T4TDefect:</td>
<td>0.492, 0.508</td>
</tr>
<tr>
<td>Defect-AvgT4T:</td>
<td> 0.506, 0.494</td>
</tr>
<tr>
<td>T4TForgive-Random:</td>
<td>0.491, 0.509</td>
</tr>
<tr>
<td>Pred1-AvgT4T:</td>
<td> 0.522, 0.478</td>
</tr>
<tr>
<td>Defect-T4T:</td>
<td> 0.506, 0.494</td>
</tr>
<tr>
<td>T4TForgive-AvgT4T:</td>
<td> 0.500, 0.500</td>
</tr>
<tr>
<td>T4TDefect-AvgT4T:</td>
<td>0.518, 0.482</td>
</tr>
<tr>
<td>Pred1-Random:</td>
<td> 0.397, 0.603</td>
</tr>
<tr>
<td>Defect-Random:</td>
<td> 0.858, 0.142</td>
</tr>
<tr>
<td>T4T-T4TDefect:</td>
<td> 0.496, 0.504</td>
</tr>
<tr>
<td>AvgT4T-T4T:</td>
<td> 0.500, 0.500</td>
</tr>
<tr>
<td>T4TDefect-Pred1:</td>
<td>0.512, 0.488</td>
</tr>
<tr>
<td>Pred1-T4T:</td>
<td> 0.503, 0.497</td>
</tr>
<tr>
<td>Pred1-T4TForgive:</td>
<td> 0.510, 0.490</td>
</tr>
<tr>
<td>Random-T4T:</td>
<td>0.502, 0.498</td>
</tr>
<tr>
<td>Random-AvgT4T:</td>
<td> 0.503, 0.497</td>
</tr>
<tr>
<td>Defect-T4TDefect:</td>
<td> 0.506, 0.494</td>
</tr>
<tr>
<td>Defect-T4TForgive:</td>
<td>0.537, 0.463</td>
</tr>
<tr>
<td>Defect-Pred1:</td>
<td>0.506, 0.494</td>
</tr>
</table>
</blockquote>
<p>You can download the code (<a href="http://drskippy.net/python/IteratedPrisonersDilema.zip">zip</a>, <a href="http://drskippy.net/python/IteratedPrisonersDilema.tar.gz">gzip</a>) here to play with ideas for strategies or see how the dynamics of the system change based on the mix of agents. There is a Readme file explaining how to create new agents with your favorite strategies. Adding new agent types is fairly straight forward and requires only a few lines of new code in most cases. <a href="http://www.python.org/">Python</a> 2.5&nbsp; is required to run the simulations.&nbsp; The analysis programs require <a target="_blank" href="http://matplotlib.sourceforge.net/">MatPlotLib</a> to create the plots.&nbsp; This software is available under a <a target="_blank" href="http://creativecommons.org/licenses/by-nc/3.0/us/">non-commercial Creative Commons License</a>.&nbsp; Have fun!</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/skippyrecords.wordpress.com/65/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/skippyrecords.wordpress.com/65/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/skippyrecords.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/skippyrecords.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/skippyrecords.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/skippyrecords.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/skippyrecords.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/skippyrecords.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/skippyrecords.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/skippyrecords.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/skippyrecords.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/skippyrecords.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/skippyrecords.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/skippyrecords.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/skippyrecords.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/skippyrecords.wordpress.com/65/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=skippyrecords.wordpress.com&amp;blog=13069636&amp;post=65&amp;subd=skippyrecords&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://skippyrecords.wordpress.com/2008/02/11/competition-ii/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/fd95bd67cd406fcb27a627a44570f2a2?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">drskippy27</media:title>
		</media:content>

		<media:content url="http://drskippy.net/img/IPDPopulation_20080211.png" medium="image">
			<media:title type="html">IPD Population Plot</media:title>
		</media:content>
	</item>
	</channel>
</rss>
