<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The Fuzzy Lookup Transformation</title>
	<atom:link href="http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/</link>
	<description>James Beresford on Microsoft BI and Consulting in Sydney, Australia</description>
	<lastBuildDate>Mon, 06 Feb 2012 16:00:48 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
	<item>
		<title>By: BI Monkey</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-14377</link>
		<dc:creator>BI Monkey</dc:creator>
		<pubDate>Mon, 12 Dec 2011 02:21:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-14377</guid>
		<description>@Mai - use an OLE DB transformation to update the target row</description>
		<content:encoded><![CDATA[<p>@Mai &#8211; use an OLE DB transformation to update the target row</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mai Elsayed</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-14359</link>
		<dc:creator>Mai Elsayed</dc:creator>
		<pubDate>Sun, 11 Dec 2011 14:05:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-14359</guid>
		<description>Please can anyone tell me how i update records in the refrence table depend on their matches on the source table ?</description>
		<content:encoded><![CDATA[<p>Please can anyone tell me how i update records in the refrence table depend on their matches on the source table ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: BI Monkey</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-12654</link>
		<dc:creator>BI Monkey</dc:creator>
		<pubDate>Tue, 04 Oct 2011 21:36:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-12654</guid>
		<description>ashu.. if the company name data is reliable use a lookup. If it&#039;s inconsistent use a Fuzzy Lookup.</description>
		<content:encoded><![CDATA[<p>ashu.. if the company name data is reliable use a lookup. If it&#8217;s inconsistent use a Fuzzy Lookup.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ashu</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-12636</link>
		<dc:creator>ashu</dc:creator>
		<pubDate>Tue, 04 Oct 2011 06:05:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-12636</guid>
		<description>hi..
i need to do same thing 
plz help i have created one ssis package.. i want to insert price into table checking based on company name column .....</description>
		<content:encoded><![CDATA[<p>hi..<br />
i need to do same thing<br />
plz help i have created one ssis package.. i want to insert price into table checking based on company name column &#8230;..</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: CozyRoc</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-10817</link>
		<dc:creator>CozyRoc</dc:creator>
		<pubDate>Fri, 29 Jul 2011 21:58:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-10817</guid>
		<description>Just wanted to mention apparently there is a bug in the Fuzzy Lookup component, which appears on 64bit systems. I have lost a couple of hours banging my head with this. Please vote it is important for MS to fix this issue here:

https://connect.microsoft.com/SQLServer/feedback/details/488387/fuzzy-lookup-triggers-sqldumper-even-with-very-small-dataset-with-run64bitruntime-set-to-false

The problem is not new because I was able to reproduce it even under SQL 2005 SP3.</description>
		<content:encoded><![CDATA[<p>Just wanted to mention apparently there is a bug in the Fuzzy Lookup component, which appears on 64bit systems. I have lost a couple of hours banging my head with this. Please vote it is important for MS to fix this issue here:</p>
<p><a href="https://connect.microsoft.com/SQLServer/feedback/details/488387/fuzzy-lookup-triggers-sqldumper-even-with-very-small-dataset-with-run64bitruntime-set-to-false" rel="nofollow">https://connect.microsoft.com/SQLServer/feedback/details/488387/fuzzy-lookup-triggers-sqldumper-even-with-very-small-dataset-with-run64bitruntime-set-to-false</a></p>
<p>The problem is not new because I was able to reproduce it even under SQL 2005 SP3.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Darren Green</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-3645</link>
		<dc:creator>Darren Green</dc:creator>
		<pubDate>Mon, 13 Dec 2010 11:05:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-3645</guid>
		<description>James, Setting WarmCaches to false is certainly worth trying when you have a large index and only a few rows to try and match. I&#039;ve gained significant benefit on occasions by turning it off. Consider the ratio between the fuzzy index rows (the reference data) and the rows to match (the input rows), the larger this ratio the more likely it is that you’ll get a performance gain. 

It is a bit like the traditional Lookup and the default cache option, it can sometimes spend more time populating the cache than it takes to just run the query direct.</description>
		<content:encoded><![CDATA[<p>James, Setting WarmCaches to false is certainly worth trying when you have a large index and only a few rows to try and match. I&#8217;ve gained significant benefit on occasions by turning it off. Consider the ratio between the fuzzy index rows (the reference data) and the rows to match (the input rows), the larger this ratio the more likely it is that you’ll get a performance gain. </p>
<p>It is a bit like the traditional Lookup and the default cache option, it can sometimes spend more time populating the cache than it takes to just run the query direct.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Fuzzy Thinking &#124; BI Monkey</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-637</link>
		<dc:creator>Fuzzy Thinking &#124; BI Monkey</dc:creator>
		<pubDate>Wed, 11 Nov 2009 00:19:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-637</guid>
		<description>[...] Fuzzy Lookup and Fuzzy Grouping transformations &#8211; guides to the practical use of the transformations from me, the BI Monkey [...]</description>
		<content:encoded><![CDATA[<p>[...] Fuzzy Lookup and Fuzzy Grouping transformations &#8211; guides to the practical use of the transformations from me, the BI Monkey [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: BI Monkey</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-459</link>
		<dc:creator>BI Monkey</dc:creator>
		<pubDate>Thu, 08 Oct 2009 22:52:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-459</guid>
		<description>Hi Kevin. I haven&#039;t tested the &lt;strong&gt;Exhaustive&lt;/strong&gt; property thoroughly but I found one case where a data set of about 100,000 rows jumped from about a 5 minute to 8 hour processing time when I switched it to True. My gut feeling is the threshold is pretty low, but because of the number of variables in any such process around hardware and match complexity I would just test in your environment and see how much of a drag it adds.

However what I did find is that for some matching scenarios, the additional matches it brought in weren&#039;t all that valuable. How that would apply to your situation will depend on how well the matching algorithm will work with your data. Your best bet is to do a test on a sample of your data and see if the performance overhead is worth it in terms of additional matches obtained.</description>
		<content:encoded><![CDATA[<p>Hi Kevin. I haven&#8217;t tested the <strong>Exhaustive</strong> property thoroughly but I found one case where a data set of about 100,000 rows jumped from about a 5 minute to 8 hour processing time when I switched it to True. My gut feeling is the threshold is pretty low, but because of the number of variables in any such process around hardware and match complexity I would just test in your environment and see how much of a drag it adds.</p>
<p>However what I did find is that for some matching scenarios, the additional matches it brought in weren&#8217;t all that valuable. How that would apply to your situation will depend on how well the matching algorithm will work with your data. Your best bet is to do a test on a sample of your data and see if the performance overhead is worth it in terms of additional matches obtained.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin Moormann</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-458</link>
		<dc:creator>Kevin Moormann</dc:creator>
		<pubDate>Thu, 08 Oct 2009 21:12:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-458</guid>
		<description>Have you determined a rule of thumb for how big a data set can be before performance becomes to big of an issue when exhaustive is set to true?</description>
		<content:encoded><![CDATA[<p>Have you determined a rule of thumb for how big a data set can be before performance becomes to big of an issue when exhaustive is set to true?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pardeep Dhull</title>
		<link>http://www.bimonkey.com/2009/06/the-fuzzy-lookup-transformation/comment-page-1/#comment-226</link>
		<dc:creator>Pardeep Dhull</dc:creator>
		<pubDate>Sun, 30 Aug 2009 15:42:59 +0000</pubDate>
		<guid isPermaLink="false">http://www.bimonkey.com/?p=346#comment-226</guid>
		<description>Hey thanks man....i ws looking for something like this...simple yet thorough one.</description>
		<content:encoded><![CDATA[<p>Hey thanks man&#8230;.i ws looking for something like this&#8230;simple yet thorough one.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

