<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>BI Monkey &#187; DSV</title>
	<atom:link href="http://www.bimonkey.com/tag/dsv/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bimonkey.com</link>
	<description>James Beresford on Microsoft BI and Consulting in Sydney, Australia</description>
	<lastBuildDate>Mon, 06 Sep 2010 00:59:17 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.3</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>A caution on using Dimensional DSVs in Data Mining &#8211; part 2</title>
		<link>http://www.bimonkey.com/2009/06/a-caution-on-using-dimensional-dsvs-in-data-mining-part-2/</link>
		<comments>http://www.bimonkey.com/2009/06/a-caution-on-using-dimensional-dsvs-in-data-mining-part-2/#comments</comments>
		<pubDate>Thu, 18 Jun 2009 02:08:31 +0000</pubDate>
		<dc:creator>BI Monkey</dc:creator>
				<category><![CDATA[Data mining]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[DSV]]></category>

		<guid isPermaLink="false">http://www.bimonkey.com/?p=343</guid>
		<description><![CDATA[As a followup to this post I have found that not only does using a table external to the one being mined to provide a grouping fail to actually group within the model, it also confuses the Mining Legend in the Mining Model Viewer.
What I was seeing in the Mining Legend for a node in [...]]]></description>
			<content:encoded><![CDATA[<p>As a followup to <a title="A caution on using Dimensional DSVs in Data Mining" href="http://www.bimonkey.com/2009/06/a-caution-on-using-dimensional-dsvs-in-data-mining/">this post</a> I have found that not only does using a table external to the one being mined to provide a grouping fail to actually group within the model, it also confuses the Mining Legend in the Mining Model Viewer.</p>
<p>What I was seeing in the Mining Legend for a node in a Decision Tree was like this:</p>
<p>Total Cases: 100</p>
<p>Category A: 10 Cases</p>
<p>Category B: 25 Cases</p>
<p>Category C: 0 Cases</p>
<p>Category D: 9 Cases</p>
<p>&#8230; so the Total cases and the cases displayed didn&#8217;t tie up. By digging further using the Microsoft Mining Content Viewer and looking at the NODE_DISTRIBUTION I saw that there were multiple rows for the categories, and the Mining Legend was just picking one of those values.</p>
<p>So if you find youself looking at a node and wondering why the numbers don&#8217;t add up &#8211; it&#8217;s because your grouping hasn&#8217;t been used by the model.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bimonkey.com/2009/06/a-caution-on-using-dimensional-dsvs-in-data-mining-part-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A caution on using Dimensional DSVs in Data Mining</title>
		<link>http://www.bimonkey.com/2009/06/a-caution-on-using-dimensional-dsvs-in-data-mining/</link>
		<comments>http://www.bimonkey.com/2009/06/a-caution-on-using-dimensional-dsvs-in-data-mining/#comments</comments>
		<pubDate>Mon, 15 Jun 2009 23:23:39 +0000</pubDate>
		<dc:creator>BI Monkey</dc:creator>
				<category><![CDATA[Data mining]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[DSV]]></category>

		<guid isPermaLink="false">http://www.bimonkey.com/?p=338</guid>
		<description><![CDATA[If you are using a dimensional-style DSV in a Data Mining project, such as below:
Be aware that if you include a column from a Dimension table in your Mining Structure, the model will actually identify each key entry on the source table as a distinct value, rather than each distinct value in the Dimension table. [...]]]></description>
			<content:encoded><![CDATA[<div class="mceTemp">If you are using a dimensional-style DSV in a Data Mining project, such as below:</div>
<div class="wp-caption alignnone" style="width: 393px"><img title="A Dimensional DSV" src="http://www.bimonkey.com/uploads/ssas/dsv1.jpg" alt="b" width="383" height="223" /><p class="wp-caption-text">Fig 1: A Dimensional DSV</p></div>
<p>Be aware that if you include a column from a Dimension table in your Mining Structure, the model will actually identify each key entry on the <em>source</em> table as a distinct value, rather than each distinct value in the Dimension table. I found this out because I added a grouping category to one of my dimensional tables &#8211; a simple high &#8211; medium &#8211; low group &#8211; and there were multiple values in the attribute states for each grouping, as below:</p>
<div class="wp-caption alignnone" style="width: 218px"><img title="Mining Legend" src="http://www.bimonkey.com/uploads/ssas/mininglegend1.jpg" alt="b" width="208" height="157" /><p class="wp-caption-text">Fig 2: Mining Legend</p></div>
<p>To work around this you will need to add a Named Calculation to get the group on the main table, or convert the main table to a Named Query.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bimonkey.com/2009/06/a-caution-on-using-dimensional-dsvs-in-data-mining/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
