<?xml version="1.0" encoding="UTF-8"?><rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
> <channel><title>Comments on: Technorati Suffers Major Data Accuracy Loss</title> <atom:link href="http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/feed/" rel="self" type="application/rss+xml" /><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/</link> <description>Digital Strategy Solutions, Change Management Leadership, Business Speaker, Payments Technology Convergence</description> <lastBuildDate>Sat, 11 Feb 2012 00:11:18 +0000</lastBuildDate> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <item><title>By: The Not-So-Solid Science of Technorati &#124; Skepticum</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-35451</link> <dc:creator>The Not-So-Solid Science of Technorati &#124; Skepticum</dc:creator> <pubDate>Sun, 25 Nov 2007 19:09:17 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-35451</guid> <description>[...] May David Dalka reports an unusual loss in [...]</description> <content:encoded><![CDATA[<p>[...] May David Dalka reports an unusual loss in [...]</p> ]]></content:encoded> </item> <item><title>By: Technorati: Porn Peddlers and Your Blog Image</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-33315</link> <dc:creator>Technorati: Porn Peddlers and Your Blog Image</dc:creator> <pubDate>Thu, 06 Sep 2007 14:08:27 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-33315</guid> <description>[...] Godin Joseph Jaffe Darren Rowse Mitch Joel Robert Scoble Elaine Vigneault B.L. Ochman David Dalka Andy Beal Valeria Maltoni Mack Collier Todd Andrlik Technorati Tags: bloggers, Scoble, Seth Godin, [...]</description> <content:encoded><![CDATA[<p>[...] Godin Joseph Jaffe Darren Rowse Mitch Joel Robert Scoble Elaine Vigneault B.L. Ochman David Dalka Andy Beal Valeria Maltoni Mack Collier Todd Andrlik Technorati Tags: bloggers, Scoble, Seth Godin, [...]</p> ]]></content:encoded> </item> <item><title>By: rod/techfold.com</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-30739</link> <dc:creator>rod/techfold.com</dc:creator> <pubDate>Fri, 01 Jun 2007 17:34:05 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-30739</guid> <description>I notice my authority number has been jumpy too:
http://techfold.com/2007/05/25/technorati-authority-tweaks/
Fortunately, mine spiked up realtively speaking (without a linking event), so I&#039;m less inclined to complain... good luck getting that sorted, and good job on running down the details...</description> <content:encoded><![CDATA[<p>I notice my authority number has been jumpy too:</p><p><a
href="http://techfold.com/2007/05/25/technorati-authority-tweaks/" rel="nofollow">http://techfold.com/2007/05/25/technorati-authority-tweaks/</a></p><p>Fortunately, mine spiked up realtively speaking (without a linking event), so I&#8217;m less inclined to complain&#8230; good luck getting that sorted, and good job on running down the details&#8230;</p> ]]></content:encoded> </item> <item><title>By: Elaine Vigneault</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-30730</link> <dc:creator>Elaine Vigneault</dc:creator> <pubDate>Fri, 01 Jun 2007 06:11:39 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-30730</guid> <description>I&#039;m pretty sure they don&#039;t count links from blogs that ask Technorati to stop scraping their content, too. So some link loss is due to blogs boycotting Technorati:
http://www.elainevigneault.com/2007/05/27/blog-tools-how-to-boycott-technorati.html</description> <content:encoded><![CDATA[<p>I&#8217;m pretty sure they don&#8217;t count links from blogs that ask Technorati to stop scraping their content, too. So some link loss is due to blogs boycotting Technorati:<br
/> <a
href="http://www.elainevigneault.com/2007/05/27/blog-tools-how-to-boycott-technorati.html" rel="nofollow">http://www.elainevigneault.com/2007/05/27/blog-tools-how-to-boycott-technorati.html</a></p> ]]></content:encoded> </item> <item><title>By: Bill</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-30726</link> <dc:creator>Bill</dc:creator> <pubDate>Fri, 01 Jun 2007 00:54:22 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-30726</guid> <description>I seem to recall having reported a similar issue to Technorati support on March 14, 2006, and a followup email to David Sifry on March 27, 2006, after only receiving an automated response.  I never got an answer.  Hopefully, you&#039;ve figured it out.
Next problem I&#039;ll just blog about instead of sending an email to support.</description> <content:encoded><![CDATA[<p>I seem to recall having reported a similar issue to Technorati support on March 14, 2006, and a followup email to David Sifry on March 27, 2006, after only receiving an automated response.  I never got an answer.  Hopefully, you&#8217;ve figured it out.</p><p>Next problem I&#8217;ll just blog about instead of sending an email to support.</p> ]]></content:encoded> </item> <item><title>By: David Dalka</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-30725</link> <dc:creator>David Dalka</dc:creator> <pubDate>Fri, 01 Jun 2007 00:29:39 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-30725</guid> <description>Your comments do not correlate well with our conversation. Please call me to continue it.
You should start with all the overcounted blogs in the Technorati 100 before you pick on the little guy next time. Start with - gigaom.com?reactions
You still owe me at least 10 links over the past three months that have never been counted. I expect action on those.</description> <content:encoded><![CDATA[<p>Your comments do not correlate well with our conversation. Please call me to continue it.</p><p>You should start with all the overcounted blogs in the Technorati 100 before you pick on the little guy next time. Start with &#8211; gigaom.com?reactions</p><p>You still owe me at least 10 links over the past three months that have never been counted. I expect action on those.</p> ]]></content:encoded> </item> <item><title>By: Adam Hertz</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-30724</link> <dc:creator>Adam Hertz</dc:creator> <pubDate>Fri, 01 Jun 2007 00:03:02 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-30724</guid> <description>Dave, thanks for pointing out the problem, and sorry it hit your authority so hard.  And thanks for the phone call this morning -- it was great talking with you about all sorts of issues.
For everyone else, here&#039;s what happened. This is not really a new feature that we&#039;re testing -- rather, we noticed a data quality problem and are addressing it.
Recently we noticed that several blogs were pinging us using permalinks, rather than the URL of the blog as the ping protocol intends. Our spider was fooled into interpreting these pages as blogs rather than posts.  This created duplicate blogs in our database, as well as duplicate links in our indexes.  Thus, any blog linked to from one of these posts had its authority improperly inflated.
We have two things to do to address this -- first, stop processing pings like this, and second, cleaning out the duplicate links.  We&#039;ve done the first, and have started to do the second.
Our very rough estimate of the impact of this issue is that 1 in 500 of the new blogs we&#039;ve added is actually a post.  That means that, on average, the impact of the cleanup on people&#039;s authorities will be modest.
Unfortunately for Dave, a disproportionate number of the recent links to his blog fell into this category.  So the cleanup wound up lowering his blog&#039;s authority more significantly.
Again, thanks to Dave for his patience and problem reports.</description> <content:encoded><![CDATA[<p>Dave, thanks for pointing out the problem, and sorry it hit your authority so hard.  And thanks for the phone call this morning &#8212; it was great talking with you about all sorts of issues.</p><p>For everyone else, here&#8217;s what happened. This is not really a new feature that we&#8217;re testing &#8212; rather, we noticed a data quality problem and are addressing it.</p><p>Recently we noticed that several blogs were pinging us using permalinks, rather than the URL of the blog as the ping protocol intends. Our spider was fooled into interpreting these pages as blogs rather than posts.  This created duplicate blogs in our database, as well as duplicate links in our indexes.  Thus, any blog linked to from one of these posts had its authority improperly inflated.</p><p>We have two things to do to address this &#8212; first, stop processing pings like this, and second, cleaning out the duplicate links.  We&#8217;ve done the first, and have started to do the second.</p><p>Our very rough estimate of the impact of this issue is that 1 in 500 of the new blogs we&#8217;ve added is actually a post.  That means that, on average, the impact of the cleanup on people&#8217;s authorities will be modest.</p><p>Unfortunately for Dave, a disproportionate number of the recent links to his blog fell into this category.  So the cleanup wound up lowering his blog&#8217;s authority more significantly.</p><p>Again, thanks to Dave for his patience and problem reports.</p> ]]></content:encoded> </item> <item><title>By: David Dalka</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-30709</link> <dc:creator>David Dalka</dc:creator> <pubDate>Thu, 31 May 2007 18:30:05 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-30709</guid> <description>Google does this all the time. They shouldn&#039;t do it either, but I find it hard for this to be anywhere near the top of the list of things Technorati needs to fix most urgently.</description> <content:encoded><![CDATA[<p>Google does this all the time. They shouldn&#8217;t do it either, but I find it hard for this to be anywhere near the top of the list of things Technorati needs to fix most urgently.</p> ]]></content:encoded> </item> <item><title>By: Mike Maddaloni</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-30708</link> <dc:creator>Mike Maddaloni</dc:creator> <pubDate>Thu, 31 May 2007 17:18:50 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-30708</guid> <description>So they&#039;re testing features on the live system?!
What, in your opinion, should be the volatility percentage for links?
mp/m</description> <content:encoded><![CDATA[<p>So they&#8217;re testing features on the live system?!</p><p>What, in your opinion, should be the volatility percentage for links?</p><p>mp/m</p> ]]></content:encoded> </item> <item><title>By: Pete Prestipino</title><link>http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/comment-page-1/#comment-30706</link> <dc:creator>Pete Prestipino</dc:creator> <pubDate>Thu, 31 May 2007 14:59:56 +0000</pubDate> <guid
isPermaLink="false">http://www.daviddalka.com/createvalue/2007/05/31/technorati-suffers-major-data-accuracy-loss/#comment-30706</guid> <description>I noticed this yesterday morning too Dave. I expected a few bugs with all of T&#039;s rollouts, however, this is over the top.</description> <content:encoded><![CDATA[<p>I noticed this yesterday morning too Dave. I expected a few bugs with all of T&#8217;s rollouts, however, this is over the top.</p> ]]></content:encoded> </item> </channel> </rss>
