<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>BI Monkey &#187; Analysis Services</title>
	<atom:link href="http://www.bimonkey.com/tag/analysis-services/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bimonkey.com</link>
	<description>James Beresford on Microsoft BI and Consulting in Sydney, Australia</description>
	<lastBuildDate>Mon, 06 Sep 2010 00:59:17 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.3</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Managing your history data</title>
		<link>http://www.bimonkey.com/2010/05/managing-your-history-data/</link>
		<comments>http://www.bimonkey.com/2010/05/managing-your-history-data/#comments</comments>
		<pubDate>Tue, 11 May 2010 00:11:38 +0000</pubDate>
		<dc:creator>BI Monkey</dc:creator>
				<category><![CDATA[Analysis Services]]></category>
		<category><![CDATA[Integration Services]]></category>
		<category><![CDATA[SQL Server]]></category>

		<guid isPermaLink="false">http://www.bimonkey.com/?p=762</guid>
		<description><![CDATA[This post is to an extent a small rant about some design decisions I have been constrained by on my current project. These decisions were made predominantly for one fairly bad reason: it made the architect&#8217;s life easier (apologies to the architects if they are reading &#8211; but these were bad choices!)
The design choices in question [...]]]></description>
			<content:encoded><![CDATA[<p>This post is to an extent a small rant about some design decisions I have been constrained by on my current project. These decisions were made predominantly for one fairly bad reason: it made the architect&#8217;s life easier (apologies to the architects if they are reading &#8211; but these were bad choices!)</p>
<p>The design choices in question are around the managing of history data. In one component of the system it relates to Database storage design, the other relates to Cube storage design. In both cases the history data is stored in a separate location to the &#8220;current&#8221; data.</p>
<h2>Databases: Why separate history tables are a bad idea</h2>
<p>The first &#8211; and most compelling &#8211; reason for not storing your history data in separate tables to your current tables is that it increases complexity for users. Instead of having one location to look for data, your users now have to use two.</p>
<p>The second compelling reason is that there is no point to doing this from a storage point of view. SQL 2005 &amp; 2008 (<span style="text-decoration: underline;">Enterprise</span> editions only, admittedly) provide <a title="Partitioned Tables and Indexes in SQL Server 2005" href="http://msdn.microsoft.com/en-us/library/ms345146(SQL.90).aspx">partitioning</a>. This enables the contents of an individual table to be stored in separate locations on different <a title="Using Files and Filegroups" href="http://msdn.microsoft.com/en-us/library/ms187087.aspx">filegroups</a>. This means that you can store your current days data in one location and your history in a different one. The reason for doing this is the same as splitting it into separate tables &#8211; that querying the current section will be faster than the historic section.  In theory queries against partitioned tables should in fact be faster as the current data is now no longer in the same filegroup as the history data.</p>
<p>Now, there is an overhead associated with designing and maintaining partitions but I don&#8217;t see that it is significantly larger than that required to deal with the process required to archive data into separate tables on a daily basis. Additionally when maintaining separate history tables, you need to separate out every single table, whether it gets 10 rows a day or 10 million. With partitioning you can just target the large tables that need that focus.</p>
<p>There are other downsides to maintaining separate tables. If you make a change to a table design, you need to do it in 2 places.  You also need to remember to update your history processes. If your history process fails, you can end up with users getting unexpected query results or ETL process failures when the system loads the next day&#8217;s data into the current tables &#8211;  and untangling it becomes a real mess. If your partition processes fail to run, you just have too much data in one filegroup for a while &#8211; unlikely to be fatal.</p>
<p>So if you have large tables you need to split out for performance purposes &#8211; do it at the back end, using the power of the database &#8211; which <strong>is</strong> designed to store data efficiently. Keep it away from the users &#8211; they neither need to know or care about your need to keep the data separate. If you want to give them a single object to query with the current day&#8217;s data, just use views.</p>
<h2>Cubes: Why a separate history Cube is a bad idea</h2>
<p>Much of the above applies here &#8211; <a title="Managing Analysis Services Partitions" href="http://msdn.microsoft.com/en-us/library/ms175604.aspx">SSAS also has partitions</a> &#8211; so you can again store your historic and current data in separate physical locations with the users being totally unaware of this. Again there is overhead in maintenance, but this will also balance out with the maintenance and risks associated with maintaining two identical cubes that only differ in terms of data source.</p>
<h2>Use your storage options!</h2>
<p>So without banging on about the same things any further, please consider the following two points whenever considering managing your history data:</p>
<ol>
<li>How does what i&#8217;m planning affect my users?</li>
<li>How does what i&#8217;m planning leverage the platforms capabilities?</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://www.bimonkey.com/2010/05/managing-your-history-data/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Cannot View Data Mining Model in BIDS &#8211; function does not exist</title>
		<link>http://www.bimonkey.com/2009/06/cannot-view-data-mining-model-in-bids-function-does-not-exist/</link>
		<comments>http://www.bimonkey.com/2009/06/cannot-view-data-mining-model-in-bids-function-does-not-exist/#comments</comments>
		<pubDate>Thu, 04 Jun 2009 02:08:56 +0000</pubDate>
		<dc:creator>BI Monkey</dc:creator>
				<category><![CDATA[Analysis Services]]></category>
		<category><![CDATA[Data Mining]]></category>

		<guid isPermaLink="false">http://www.bimonkey.com/?p=315</guid>
		<description><![CDATA[I&#8217;d been running some Naive Bayes Data Mining models without problems as part of initiating a Data Mining exercise, so it was time to move on and cut the data some different ways. So I set up a Decision Tree model and it processed fine, but when I tried to view it a message box [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;d been running some Naive Bayes Data Mining models without problems as part of initiating a Data Mining exercise, so it was time to move on and cut the data some different ways. So I set up a Decision Tree model and it processed fine, but when I tried to view it a message box appeared telling me it wasn&#8217;t going to co-operate:</p>
<p><span style="color: #ff0000;">The tree graph cannot be created because of the following error:</span></p>
<p><span style="color: #ff0000;">&#8216;Query (1,6) The </span></p>
<p><span style="color: #ff0000;">&#8216;[System].[Microsoft].[AnalysisServices].[System].[DataMining].[DecisionTrees].[GetTreeScores] function does not exist.&#8217;.</span></p>
<p><a title="Error while trying to load the mining model in the mining model viewer" href="http://social.msdn.microsoft.com/forums/en-US/sqldatamining/thread/2b51695a-460f-4bfd-8502-fc2fe8b15a63/ ">Fortunately someone had hit this before</a>, as the solution is rather obscure. The install I am working against is non-standard, being split across two drives. What had happened is the path for the Data Mining dll&#8217;s set up in the install process didn&#8217;t actually match where they were placed.</p>
<p>So when I looked under the assembly location &#8211; SSMS &gt; AS Server &gt; Assemblies &gt; System &gt; Properties, the Source Path referenced a dll that didn&#8217;t actually exist &#8211; so it appears this incorrect path does not raise an error when trying to start the server. To fix it, I located <span>located where the dll really was, </span>then updated the config files where this path is stored &#8211; <span><span style="color: #800080;">System.0.asm.xml</span> and V<span style="color: #800080;">BAMDX.0.asm.xml</span> &#8211; to point to that path.</span></p>
<p><span>A restart of the server and the models reprocessed and I could happily view the output!<br />
</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bimonkey.com/2009/06/cannot-view-data-mining-model-in-bids-function-does-not-exist/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Kilimanjaro, Projects Gemini and Madison Webcast</title>
		<link>http://www.bimonkey.com/2009/05/kilimanjaro-projects-gemini-and-madison-webcast/</link>
		<comments>http://www.bimonkey.com/2009/05/kilimanjaro-projects-gemini-and-madison-webcast/#comments</comments>
		<pubDate>Sat, 09 May 2009 05:06:25 +0000</pubDate>
		<dc:creator>BI Monkey</dc:creator>
				<category><![CDATA[Microsoft BI]]></category>
		<category><![CDATA[SQL Server]]></category>
		<category><![CDATA[Analysis Services]]></category>
		<category><![CDATA[Excel]]></category>
		<category><![CDATA[Kilimanjaro]]></category>
		<category><![CDATA[Project Gemini]]></category>
		<category><![CDATA[Project Madison]]></category>

		<guid isPermaLink="false">http://www.bimonkey.com/?p=134</guid>
		<description><![CDATA[For those who haven&#8217;t seen much of Project Gemini but have heard the buzz, this TechNet Webcast: An Early look at SQL Server ‘Kilimanjaro’ and project ‘Madison’ &#8211; will give you a good insight. It also has some features on reusable Reporting Services components which look very impressive and info on Project Madison, which provides [...]]]></description>
			<content:encoded><![CDATA[<p>For those who haven&#8217;t seen much of Project Gemini but have heard the buzz, this <span><span>TechNet</span></span> <span><span>Webcast</span></span>: <a title="TechNet Webcast: An Early look at SQL Server ‘Kilimanjaro’ and project ‘Madison’ (Level 100)" href="http://msevents.microsoft.com/CUI/WebCastEventDetails.aspx?EventID=1032413071&amp;EventCategory=5&amp;culture=en-US&amp;CountryCode=US">An Early look at SQL Server ‘Kilimanjaro’ and project ‘Madison’</a> &#8211; will give you a good insight. It also has some features on reusable Reporting Services components which look very impressive and info on Project Madison, which provides scalability features. Registration as usual is a pain, and forget trying to use the site using any browser other than IE &#8211; I wish Microsoft would make their content easier to access.</p>
<p>Anyway, onto the <span><span>webcast</span></span> &#8211; about the 1st quarter of the <span><span>webcast</span></span> is the usual generic <span><span>roadmap</span></span> blurb, but then the presenter gets into the real meat of Gemini &#8211; an Excel based &#8216;in memory&#8217; analysis tool that allows joining between entities without having to know about such things, <span><span>superfast</span></span> analytics &#8211; pivoting, calculation, charts etc. and then being able to publish to <span><span>Sharepoint</span></span>. From an OLAP analysis point of view, the Pivot Tables also has slicers (effctively table wide filters) displayed in the spreadsheet as well, and it would be good if that made it into Pivot Tables generally in the next release of Office. It looks like an incredible tool and very easy to use &#8211; and may be a powerful step towards the realisation of the &#8216;BI for the masses&#8217; vision. The presenter did let slip one weakness though &#8211; much has been made of the 100 million rows of data demo &#8211; but that data still has to be loaded into memory first and will still take significant time. I also suspect that how successful Gemini will be is going to depend on how much it will rely on good data structures being in place in an organisation to support it. The Data Warehouse is going to remain the core part of  any succesful BI delivery.</p>
<p>The next component of interest was the reusable Reporting Services components &#8211; there is the concept of a library of components that can be built &#8211; e.g. standard charts, logos, gauges etc &#8211; and then dropped into any report, either by a developer or a user in Report Builder 2.0.  What really grabbed my attention is that these components are version aware &#8211; i.e. if the library version of the component is updated then if you reopen the report in design mode it will let you know and give you the option to update. Again this points to &#8216;BI for the masses&#8217; as you can have developers create some great components which any user can then drop in to their home grown reports. Plus as any developer knows, there&#8217;s a lot of repetition and any options for code re-use are always appreciated.</p>
<p>Finally, Project Madison was covered &#8211; and seems more about scalability up to multi-terabyte warehouses. It was a bit infrastructure focused for me so most of it passed me by, but clearly Microsoft are stepping up to try and address the market perception that they can&#8217;t scale.</p>
<p>Anyway, this all will be dropped in late 2010 as Kilimanjaro &#8211; an interim release of SQL Server.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bimonkey.com/2009/05/kilimanjaro-projects-gemini-and-madison-webcast/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft&#8217;s secret forecasting tool &#8211; the Office Suite</title>
		<link>http://www.bimonkey.com/2009/04/microsofts-secret-forecasting-tool-the-office-suite/</link>
		<comments>http://www.bimonkey.com/2009/04/microsofts-secret-forecasting-tool-the-office-suite/#comments</comments>
		<pubDate>Thu, 16 Apr 2009 23:51:32 +0000</pubDate>
		<dc:creator>BI Monkey</dc:creator>
				<category><![CDATA[Microsoft BI]]></category>
		<category><![CDATA[Access]]></category>
		<category><![CDATA[Analysis Services]]></category>
		<category><![CDATA[Data Mining]]></category>
		<category><![CDATA[Excel]]></category>
		<category><![CDATA[Office Suite]]></category>
		<category><![CDATA[Project Gemini]]></category>

		<guid isPermaLink="false">http://www.bimonkey.com/?p=98</guid>
		<description><![CDATA[Last night I attended an IAPA presentation on basic forecasting concepts and the tools used, presented by the ever interesting Eugene Dubossarsky (of Presciient, an analytics consultancy).  I will skip over the forecasting content as for the Microsoft BI community, the interesting part is which tool he used for most basic forecasting activities. It was [...]]]></description>
			<content:encoded><![CDATA[<p>Last night I attended an <a title="Institute of Analytics Professionals of Australia" href="http://www.iapa.org.au" target="_blank">IAPA</a> presentation on basic forecasting concepts and the tools used, presented by the ever interesting Eugene Dubossarsky (of <a title="Presciient is an independent consulting company that provides advisory and analytical services to businesses and government agencies" href="http://presciient.com/" target="_blank">Presciient</a>, an analytics consultancy).  I will skip over the forecasting content as for the <a title="Microsoft Business Intelligence" href="http://www.microsoft.com/BI/" target="_blank">Microsoft BI</a> community, the interesting part is which tool he used for most basic forecasting activities. It was <a title="Microsoft Excel" href="http://office.microsoft.com/excel">Excel</a>. Then, when he needed to do more advanced work, he used &#8211;  <a title="Microsoft Excel" href="http://office.microsoft.com/excel">Excel</a>. Only when he needed to do trickier stuff with larger amounts of data did he pull in a more heavyweight tool &#8211; <a title="Microsoft Access" href="http://office.microsoft.com/access">Access</a>.</p>
<p>That&#8217;s right &#8211; the office suite covers the majority of forecaster&#8217;s needs. <a title="SQL Server 2008 Overview, data platform, store data" href="http://www.microsoft.com/SQL/default.mspx">SQL Server</a> and <a title="Analysis Services" href="http://www.microsoft.com/sqlserver/2008/en/us/Analysis-Services.aspx">Analysis Services</a> didn&#8217;t get a look in until the really heavyweight analytics processes began. For his purposes however, Eugene much prefers <a title="The R Project for Statistical Computing" href="http://www.r-project.org/" target="_blank">R</a>, an open source stats program that is free, very powerful and now a serious competitor to <a title="SAS | Business Intelligence Software and Predictive Analytics" href="http://www.sas.com/" target="_blank">SAS </a>- much to their annoyance. Microsoft are rumoured to be talking to the people behind R, and an acquisition would make sense for both sides &#8211; R is not user friendly, which Microsoft could provide help with &#8211; and adding the capabilities of R would allow Microsoft to take a slug at SAS&#8217;s BI market.</p>
<p>So, this shows that most users <strong>still </strong>aren&#8217;t fully aware of, let alone using Excel&#8217;s capabilites &#8211; otherwise they wouldn&#8217;t be paying analytics consultants to to use it for them. Microsoft are always pushing Excel further, so now i&#8217;ll cover two features of Excel that the power users may not be aware of. It&#8217;s easy to forget sometimes that the 2007 Office suite wasn&#8217;t just a new, pretty interface &#8211; it also added huge BI capabilities.</p>
<p><strong>The Data Mining Add-In for Excel</strong> (download for SQL Server <a title="SQL Server 2008 Data Mining Add-In for Excel 2007" href="http://www.microsoft.com/downloads/details.aspx?displaylang=en&amp;FamilyID=896a493a-2502-4795-94ae-e00632ba6de7">2008</a> or <a title="SQL Server 2005 Data Mining Add-In for Excel 2007" href="http://www.microsoft.com/downloads/details.aspx?displaylang=en&amp;FamilyID=7c76e8df-8674-4c3b-a99b-55b17f3c4c51">2005</a>)</p>
<p>This Add-In allows you to leverage the <a title="Microsoft SQL Server 2008: Data Mining" href="http://www.microsoft.com/sqlserver/2008/en/us/data-mining.aspx" target="_blank">Data Mining</a> capabilities of Analysis Services through Excel. It allows you to use Excel as the front end for creating and working with Data Mining models that exist on your server. However what really makes it interesting for Excel users is that it allows you to perform Data Mining <em>on your spreadsheet data</em>.</p>
<p>There is a Virtual Lab <a title="MSDN Virtual Lab: Using the Microsoft SQL Server 2005 Data Mining Add-Ins for the 2007 Microsoft Office System" href="http://msevents.microsoft.com/CUI/WebCastEventDetails.aspx?culture=en-US&amp;EventID=1032346458&amp;EventCategory=3" target="_blank">here</a> explaining and demonstrating their use.</p>
<p><strong>Project Gemini</strong></p>
<p>This feature is slated for the next release of Excel, and is an in-memory tool for analysing large amounts of data in an OLAP style, but without all the fiddly data modelling normally required. It is a clear slug at other players in the in-Memory market, such as QlikTech. The models created will also be able to be ported back to SSAS with minimum effort as well. For more details read <a title="Project Gemini — Microsoft’s Brilliant OLAP Trojan Horse" href="http://www.olapreport.com/Comment_Gemini.htm">this commentary</a> from the OLAP Report.</p>
<p>Microsoft has one of the most powerful BI Tools in the world in Excel, users just need to be made aware!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bimonkey.com/2009/04/microsofts-secret-forecasting-tool-the-office-suite/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SSAS Training Resource</title>
		<link>http://www.bimonkey.com/2009/04/ssas-training-resource/</link>
		<comments>http://www.bimonkey.com/2009/04/ssas-training-resource/#comments</comments>
		<pubDate>Wed, 15 Apr 2009 07:37:54 +0000</pubDate>
		<dc:creator>BI Monkey</dc:creator>
				<category><![CDATA[Analysis Services]]></category>
		<category><![CDATA[Professional Certification and Training]]></category>
		<category><![CDATA[Training Sites]]></category>

		<guid isPermaLink="false">http://www.bimonkey.com/?p=95</guid>
		<description><![CDATA[I have added a link to Craig Utley&#8217;s excellent SSAS training video resource site LearnMicrosoftBI.com, which contains training videos on a variety of subjects in SSAS &#8211; dimensional modelling, Actions and the one I found most useful explaining the tricky but critical subject of Attribute Relationships (Video SSAS 109). Recommended for anyone starting out in [...]]]></description>
			<content:encoded><![CDATA[<p>I have added a link to Craig Utley&#8217;s excellent SSAS training video resource site <a title="Craing Utley's LearnMicrosoftBI.com" href="http://www.learnmicrosoftbi.com/">LearnMicrosoftBI.com</a>, which contains training videos on a variety of subjects in SSAS &#8211; dimensional modelling, Actions and the one I found most useful explaining the tricky but critical subject of Attribute Relationships (Video SSAS 109). Recommended for anyone starting out in SSAS or needing concepts clarifying.</p>
<p>Registration is required to download the videos (not sure why) &#8211; but it seems to generate no spam so not a big issue, and the content is very high quality for free content.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bimonkey.com/2009/04/ssas-training-resource/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
