<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:series="http://unfoldingneurons.com/"
		>
<channel>
	<title>Comments on: Average length of a URL</title>
	<atom:link href="http://www.supermind.org/blog/559/average-length-of-a-url/feed" rel="self" type="application/rss+xml" />
	<link>http://www.supermind.org/blog/559/average-length-of-a-url</link>
	<description>A blog on Lucene, Solr, Nutch, crawling and vertical search</description>
	<lastBuildDate>Thu, 26 Aug 2010 08:00:19 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=abc</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Average length of a URL (Part 2) :: Kelvin Tan - Lucene Solr Nutch Consultant</title>
		<link>http://www.supermind.org/blog/559/average-length-of-a-url/comment-page-1#comment-17165</link>
		<dc:creator>Average length of a URL (Part 2) :: Kelvin Tan - Lucene Solr Nutch Consultant</dc:creator>
		<pubDate>Mon, 16 Aug 2010 19:49:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.supermind.org/?p=559#comment-17165</guid>
		<description>[...] 16 Aug 2010 at 02:49 pm &#124; Tagged as: programming Here&#8217;s a follow-up on my previous attempt at calculating the average length of a URL, which was naive and totally [...]</description>
		<content:encoded><![CDATA[<div style="background-color:#E9F7F6;">
<p>[...] 16 Aug 2010 at 02:49 pm | Tagged as: programming Here&#8217;s a follow-up on my previous attempt at calculating the average length of a URL, which was naive and totally [...]</p>
</div>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kelvin</title>
		<link>http://www.supermind.org/blog/559/average-length-of-a-url/comment-page-1#comment-17092</link>
		<dc:creator>Kelvin</dc:creator>
		<pubDate>Fri, 02 Jul 2010 14:31:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.supermind.org/?p=559#comment-17092</guid>
		<description>@Luying - you&#039;re absolutely right and when I have abit more time, I&#039;m planning to perform a more comprehensive survey based on blog URLs, news feeds and the lot.</description>
		<content:encoded><![CDATA[<p>@Luying - you&#8217;re absolutely right and when I have abit more time, I&#8217;m planning to perform a more comprehensive survey based on blog URLs, news feeds and the lot.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Luying</title>
		<link>http://www.supermind.org/blog/559/average-length-of-a-url/comment-page-1#comment-17090</link>
		<dc:creator>Luying</dc:creator>
		<pubDate>Fri, 02 Jul 2010 13:40:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.supermind.org/?p=559#comment-17090</guid>
		<description>But these are mostly top level pages which usually have shorter URLs.</description>
		<content:encoded><![CDATA[<p>But these are mostly top level pages which usually have shorter URLs.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brodie</title>
		<link>http://www.supermind.org/blog/559/average-length-of-a-url/comment-page-1#comment-16986</link>
		<dc:creator>Brodie</dc:creator>
		<pubDate>Tue, 23 Feb 2010 07:11:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.supermind.org/?p=559#comment-16986</guid>
		<description>Interesting, but I would be interested to see more statistical analysis of the URLs. i.e. smallest, largest, and 95%/98%/99.5% confidence for length of URL. I see that the raw mean helps with your requirement, but it doesn&#039;t answer questions like &quot;How long should I make a database field to store a URL?&quot;. 

If we knew that (as an example), 98% of URLs will fit into a VARCHAR(100) field, and 99.5% will fit into VARCHAR(200) then it would make it easier to make these choices. Perhaps you would you like to contribute such knowledge to the benefit of all? :-)

Regards,
Brodie</description>
		<content:encoded><![CDATA[<p>Interesting, but I would be interested to see more statistical analysis of the URLs. i.e. smallest, largest, and 95%/98%/99.5% confidence for length of URL. I see that the raw mean helps with your requirement, but it doesn&#8217;t answer questions like &#8220;How long should I make a database field to store a URL?&#8221;. </p>
<p>If we knew that (as an example), 98% of URLs will fit into a VARCHAR(100) field, and 99.5% will fit into VARCHAR(200) then it would make it easier to make these choices. Perhaps you would you like to contribute such knowledge to the benefit of all? <img src='http://www.supermind.org/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>Regards,<br />
Brodie</p>
]]></content:encoded>
	</item>
</channel>
</rss>
