SlideShare a Scribd company logo
Business Intelligence Lynn Langit/MSDN Developer Evangelist Southern California http://blogs.msdn.com/SoCalDevGal
Session Prerequisites – Session One Working SQL Server 2005 Developer –OR- Working .Net developers
Session One Objectives and Agenda Understand what, why, when & how of SQL Server 2008 Business Intelligence Examine the core functionality SSAS OLAP cubes and Data Mining Models
What and Why BI? Presentation Exploration Discovery Passive Interactive Proactive Role of Software Business Insight Predictive Analytics Canned reporting Ad-hoc reporting OLAP Data mining
SQL Server 2008 BI & Tools OLTP – SQL Server Engine SSMS / Profiler and other mgmt tools Reporting – SSRS No need for IIS, BIDS / Report Manager to design Integrates with SharePoint ETL – SSIS Part of SQL Server, BIDS to design OLAP – SSAS Multidimensional Cubes,  BIDS / SSMS Data Mining – SSAS Algorithm-based models – BIDS / Excel / SSMS
Why BI? Faster reports OLAP can be 1,000% faster Flexible click to query using pivot tables, add calculated members, create custom views Proactive ‘ discover’ patterns in data, ‘predict’ future Reduce load on OLTP source systems Scalable no manual index tuning, data de-normalization
SQL Server 2008 Languages OLTP – SQL Server Engine T-SQL,  .NET (CLR), XML Reporting – SSRS RDL + queries ETL – SSIS XMLA metadata + queries, .NET extendable OLAP – SSAS MDX, XMLA  Data Mining – SSAS DMX, XMLA, PMML
Cubes vs. Data Mining
Where do I start? Understand OLAP modeling Star schema + grain statements Review AdventureWorks DW sample From www.CodePlex.com Realistically access source data quality Plan for ETL, learn SSIS Leverage Excel Light-weight data mining designer and client OLAP cube pivot table client
BI baked into the MS platform Enterprise – Performance Point Individual – Excel  Project – Share Point
But, I want to build my own app… Don’t under-estimate the learning curve Embed extended controls Consider buying extendable components
Microsoft’s Predictive Analytics Data Mining SQL extensions (DMX) Application Developer Data Mining Specialist Microsoft Dynamics CRM Analytics Foundation SQL Server 2005  Business Intelligence Development Studio Microsoft SQL Server 2008 Analysis Services Information  Worker Data Mining Add-ins for  the 2007 Microsoft Office system Microsoft SQL Server 2008 Data Mining BI Analyst Custom Algorithms
Demo 1 – SSAS Cubes
Data Mining Add-ins for Office 2007 Table Analysis Tools for Excel 2007 Data Mining Template for Visio 2007 Data Mining Client for Excel 2007 Information  Worker BI Analyst Data Mining Specialist
Demo – Data Mining
Microsoft Data Mining Lifecycle  CRISP-DM SSAS (Data Mining) Excel SSAS (DSV) Query Excel SSIS SSAS SSRS Excel Your Apps SSIS SSAS Excel Data www.crisp-dm.org Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment
DM - From Scenarios to Tasks
From Tasks to Techniques
Understand & Prepare specifics
Modeling Specifics
New to SQL Server 2008 Microsoft Time Series algorithm improved  ARIMA plus ARTxp method, and a blending algorithm  = better results  New prediction mode allows adding new data to time series models Holdout Support added Easily partition data into training and test sets that are stored in mining structure & available to query after processing Ability to build mining models based on filtered subsets added Results in less structures, i.e. can just filter existing Drillthrough functionality extended  makes all mining structure columns available, not just columns included in the model allows you to build more compact models Cross-validation added allows users to quickly validate their modeling approach by automatically building temporary models and evaluating accuracy measures across K folds. The feature is available through a new cross-validation tab under Accuracy Charts in BIDS, in addition to being accessible programmatically via a stored procedure call.
Summary Data Mining in SQL Server 2008 is mature, powerful and accessible Can use Excel 2007 Familiar client for BI – OLAP cubes AND Data Mining models Model Creators /  Users Excel Data or Server Data SSAS  and Excel both support the full DM Cycle Data Understanding Data Preparation Modeling Validation Deployment
DM Webcasts Fri, 02 Nov 2007 MSDN Webcast: Build Smart Web Applications with SQL Server Data Mining (Level 200) Thu, 08 Nov 2007 MSDN Webcast: Building Adaptive Applications with SQL Server Data Mining (Level 300) Mon, 19 Nov 2007 MSDN Webcast: Extending and Customizing SQL Server Data Mining (Level 300) Fri, 30 Nov 2007 MSDN Webcast: Creating Visualizations for SQL Server Data Mining (Level 300) Thu, 01 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 1 of 3): Your First Project with SQL Server Data Mining (Level 200) Thu, 15 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 2 of 3): Understand SQL Server Data Mining Add-ins for the 2007 Office System (Level 200) Thu, 29 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 3 of 3): Use Predictive Intelligence to Create Smarter KPIs (Level 200)
BI Resources from Lynn Langit 1. “Foundations of SQL Server 2005 Business Intelligence” (published by APress in April 2007) 2.   http://blogs.msdn.com/SoCalDevGal 3.   “Building Business Intelligence Solutions with SQL Server 2008”  (MSPress Fall 2008)
DM Resources Technical Communities, Webcasts, Blogs, Chats & User Groups http://www.microsoft.com/communities/default.mspx   Microsoft Developer Network (MSDN) & TechNet  http://microsoft.com/msdn   http://microsoft.com/technet   Trial Software and Virtual Labs http://www.microsoft.com/technet/downloads/trials/default.mspx   Microsoft Learning and Certification http://www.microsoft.com/learning/default.mspx   SQL Server Data Mining http://www.sqlserverdatamining.com http://www.microsoft.com/bi/bicapabilities/data-mining.aspx
END – END - END End of first set
Business Intelligence Lynn Langit/MSDN Developer Evangelist Southern California http://blogs.msdn.com/SoCalDevGal
Session Prerequisites – Session Two Working SQL Server 2005 Developer Understanding of OLAP concepts Working SQL Server Analysis Server 2005 Developer Interest in or basic knowledge of Data Mining concepts
Session Two Objectives and Agenda Understand what’s new SQL Server 2008 Business Intelligence SSAS OLAP cubes SSAS Data Mining Structures
Demo – Simplified Cube / Dim Wizards
Demo – New Aggregation Designer
Data Mining Are you using it now?
Data Mining – Logical Model algorithm Mining Model Mining Model Training Data DB data Client data Application data Data Mining Engine To Predict Predicted Data Mining Model DB data Client data Application data “ Just one row ” Data Mining Engine
Evaluation Specifics
Data Mining - Physical Model Analysis Services Server Mining Model Data Mining Algorithm Data Source Your Application OLE DB/ ADOMD/ XMLA Deploy BI Dev Studio  (Visual Studio) App Data
Data Mining Interfaces – APIs XMLA Over TCP/IP XMLA Over HTTP Analysis Server (msmdsrv.exe) OLAP Data Mining Server ADOMD.NET .Net Stored Procedures Microsoft Algorithms Third Party Algorithms OLEDB for OLAP/DM ADO/DSO Any Platform, Any Device C++ App VB App .Net App AMO Any App ADOMD.NET WAN DM Interfaces
Configuration Model Creation/Management Database Administrators Session Mining Models Model Application Permissions on models Permissions on data sources
Deployment Browse Copy to Excel Drillthrough Query Default Advanced Excel Services Manage models and structures Export/Import Rename Connection Database Trace
Excel Functions* DMPREDICTTABLEROW  ( Connection, ModelName,   PredictionResult, TableRowRange [, string CommaSeparatedColumnNames] ) DMPREDICT  ( Connection, Model, PredictionResult, Value1, Name1,   [...,Value32, Name32] ) DMCONTENTQUERY  (Connection, Model, PredictionResult [, WhereClause])
Data Mining Extensions (DMX)  CREATE MINING MODEL  CreditRisk (CustID   LONG KEY, Gender  TEXT DISCRETE, Income    LONG CONTINUOUS, Profession  TEXT DISCRETE, Risk   TEXT DISCRETE PREDICT) USING  Microsoft_Decision_Trees INSERT INTO   CreditRisk  (CustId, Gender, Income, Profession, Risk) Select  CustomerID, Gender, Income, Profession,Risk From Customers Select  NewCustomers.CustomerID, CreditRisk.Risk,  PredictProbability(CreditRisk.Risk) FROM  CreditRisk  PREDICTION JOIN  NewCustomers ON   CreditRisk.Gender=NewCustomer.Gender   AND  CreditRisk.Income=NewCustomer.Income AND  CreditRisk.Profession=NewCustomer.Profession
DMX Column Expressions Predictable Columns Source Data Columns Functions -  Predict “ Workhorse” Discrete scalar values Continuous scalar values Associative nested tables Sequence nested tables Time Series Overloaded to PredictAssociation PredictSequence PredictTimeSeries PredictProbability PredictSupport PredictHistogram Cluster ClusterProbability GetNodeId IsInNode Arithmetic operators Stored Procedure Subselect Select from nested tables
Data Mining Interfaces – XMLA ++ XMLA Over TCP/IP XMLA Over HTTP Analysis Server (msmdsrv.exe) OLAP Data Mining Server ADOMD.NET .Net Stored Procedures Microsoft Algorithms Third Party Algorithms OLEDB for OLAP/DM ADO/DSO Any Platform, Any Device C++ App VB App .Net App AMO Any App ADOMD.NET WAN DM Interfaces
New to SQL Server 2008 Microsoft Time Series algorithm improved  ARIMA plus ARTxp method, and a blending algorithm  = better results  New prediction mode allows adding new data to time series models Holdout Support added Easily partition data into training and test sets that are stored in mining structure & available to query after processing Ability to build mining models based on filtered subsets added Results in less structures, i.e. can just filter existing Drillthrough functionality extended  makes all mining structure columns available, not just columns included in the model allows you to build more compact models Cross-validation added allows users to quickly validate their modeling approach by automatically building temporary models and evaluating accuracy measures across K folds. The feature is available through a new cross-validation tab under Accuracy Charts in BIDS, in addition to being accessible programmatically via a stored procedure call.
Summary Data Mining in SQL Server 2008 is mature, powerful and accessible Can use Excel 2007 Familiar client for BI – OLAP cubes AND Data Mining models Model Creators /  Users Excel Data or Server Data SSAS  and Excel both support the full DM Cycle Data Understanding Data Preparation Modeling Validation Deployment
DM Webcasts Fri, 02 Nov 2007 MSDN Webcast: Build Smart Web Applications with SQL Server Data Mining (Level 200) Thu, 08 Nov 2007 MSDN Webcast: Building Adaptive Applications with SQL Server Data Mining (Level 300) Mon, 19 Nov 2007 MSDN Webcast: Extending and Customizing SQL Server Data Mining (Level 300) Fri, 30 Nov 2007 MSDN Webcast: Creating Visualizations for SQL Server Data Mining (Level 300) Thu, 01 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 1 of 3): Your First Project with SQL Server Data Mining (Level 200) Thu, 15 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 2 of 3): Understand SQL Server Data Mining Add-ins for the 2007 Office System (Level 200) Thu, 29 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 3 of 3): Use Predictive Intelligence to Create Smarter KPIs (Level 200)
BI Resources from Lynn Langit 1. “Foundations of SQL Server 2005 Business Intelligence” (published by APress in April 2007) 2.   http://blogs.msdn.com/SoCalDevGal 3.   “Building Business Intelligence Solutions with SQL Server 2008”  (MSPress Fall 2008)
DM Resources Technical Communities, Webcasts, Blogs, Chats & User Groups http://www.microsoft.com/communities/default.mspx   Microsoft Developer Network (MSDN) & TechNet  http://microsoft.com/msdn   http://microsoft.com/technet   Trial Software and Virtual Labs http://www.microsoft.com/technet/downloads/trials/default.mspx   Microsoft Learning and Certification http://www.microsoft.com/learning/default.mspx   SQL Server Data Mining http://www.sqlserverdatamining.com http://www.microsoft.com/bi/bicapabilities/data-mining.aspx

More Related Content

PPT
SQL Server 2008 Data Mining
PPT
SQL Server 2008 Data Mining
PPT
Data Mining 2008
PPTX
3 App Compat Win7
PPT
It ready dw_day3_rev00
PDF
Taming the shrew Power BI
PPT
Data Mining for Developers
PDF
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction
SQL Server 2008 Data Mining
SQL Server 2008 Data Mining
Data Mining 2008
3 App Compat Win7
It ready dw_day3_rev00
Taming the shrew Power BI
Data Mining for Developers
Microsoft SQL Server Analysis Services (SSAS) - A Practical Introduction

What's hot (20)

PPTX
Azure Cosmos DB + Gremlin API in Action
PPTX
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
PPT
SQL Server 2008 for Developers
PPTX
Microsoft SQL Server 2012
PDF
Azure analysis services next step to bi in the cloud
PPTX
SSAS Tabular model importance and uses
PPTX
SSAS and MDX
PDF
Azure - Data Platform
PDF
Data Platform Overview
PPTX
SQL Server 2019 Master Data Service
PPTX
Microsoft Database Options
PDF
Creating a Tabular Model Using SQL Server 2012 Analysis Services
PPTX
SQL Azure
PDF
SQL Server 2008 Highlights
PPTX
Sql Saturday Costa Rica-SSAS Tabular Model
PDF
J1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
PPTX
Getting power bi
PPT
A Primer To Sybase Iq Development July 13
PPTX
Developing ssas cube
PDF
Azure Cloud Dev Camp - Introduction
Azure Cosmos DB + Gremlin API in Action
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
SQL Server 2008 for Developers
Microsoft SQL Server 2012
Azure analysis services next step to bi in the cloud
SSAS Tabular model importance and uses
SSAS and MDX
Azure - Data Platform
Data Platform Overview
SQL Server 2019 Master Data Service
Microsoft Database Options
Creating a Tabular Model Using SQL Server 2012 Analysis Services
SQL Azure
SQL Server 2008 Highlights
Sql Saturday Costa Rica-SSAS Tabular Model
J1 T1 4 - Azure Data Factory vs SSIS - Regis Baccaro
Getting power bi
A Primer To Sybase Iq Development July 13
Developing ssas cube
Azure Cloud Dev Camp - Introduction
Ad

Viewers also liked (8)

PDF
Chengdulife
PDF
Bangkok Serviced Apartment Market Report Q3 2011 w
PDF
Vientiane Laos Property Spotlight 2011
PDF
Yangon - Myanmar Property Market Report H2 2011
PDF
Colliers Bangkok Office Report Q2 2011 (Half Year)
PDF
Q3 2013 Colliers Asia Pacific Office Rents Map
PDF
Project valuation of sansiri Marketing
PDF
150404 Condo Rental Analysis
Chengdulife
Bangkok Serviced Apartment Market Report Q3 2011 w
Vientiane Laos Property Spotlight 2011
Yangon - Myanmar Property Market Report H2 2011
Colliers Bangkok Office Report Q2 2011 (Half Year)
Q3 2013 Colliers Asia Pacific Office Rents Map
Project valuation of sansiri Marketing
150404 Condo Rental Analysis
Ad

Similar to BI 2008 Simple (20)

PPT
SQL Server 2008 Data Mining
PPTX
SQL Server: Data Mining
PPTX
MS Sql Server: Datamining Introduction
PPTX
Introduction To Sql Server Data Mining
PPT
BI2008newFeatures
PDF
24 Hours of PASS -- Enterprise Data Mining with SQL Server
PPTX
Mine craft:
PPT
BI in SQL Server 2008 for Architects
PPTX
Data mining tools for excel and sql server
PPT
AnalysisServices
PDF
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
PDF
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
PDF
SQL Saturday 86 -- Enterprise Data Mining with SQL Server
PDF
SQL Saturday 119 Chicago -- Enterprise Data Mining with SQL Server
PPT
Bi2008 Plus Cloud Preview
PPTX
Data Mining With SQL Server
PPTX
Data Mining with SQL Server 2008
PPTX
Minería de Datos en Sql Server 2008
PPTX
MS SQL SERVER: Using the data mining tools
PPTX
MS SQL SERVER: Using the data mining tools
SQL Server 2008 Data Mining
SQL Server: Data Mining
MS Sql Server: Datamining Introduction
Introduction To Sql Server Data Mining
BI2008newFeatures
24 Hours of PASS -- Enterprise Data Mining with SQL Server
Mine craft:
BI in SQL Server 2008 for Architects
Data mining tools for excel and sql server
AnalysisServices
SQL Saturday 109 -- Enterprise Data Mining with SQL Server
SQL Saturday 108 -- Enterprise Data Mining with SQL Server
SQL Saturday 86 -- Enterprise Data Mining with SQL Server
SQL Saturday 119 Chicago -- Enterprise Data Mining with SQL Server
Bi2008 Plus Cloud Preview
Data Mining With SQL Server
Data Mining with SQL Server 2008
Minería de Datos en Sql Server 2008
MS SQL SERVER: Using the data mining tools
MS SQL SERVER: Using the data mining tools

More from llangit (20)

PPTX
2 Win7 For Devs Ux Touch Sensors
PPTX
1 Win7 For Devs Fund Search
PPTX
3 Kodu
PPTX
5 Digigirlz Xna
PPTX
4 Making Movies
PPTX
2 Digi Girlz Small Basic
PPTX
1 Digi Girlz So Cal Databases Kims Final
PPT
SQL Server 2008 for Developers
PPT
SQL Server 2008 for .NET Developers
PPT
Tech Days09 Sqldev
PPT
Windows Azure for .NET Developers
PPT
Introduction To Sql Services
PPT
Making of GirlGamer
PPT
Kodu
PPT
DigiGirlz_SoCal_Databases
PPT
DigiGirlzSmallBasic
PPT
The Role Of An Architect
PPT
SQL Server 2008 for Developers
PPT
Parallel Programming and F#
PPT
Net35 Overview
2 Win7 For Devs Ux Touch Sensors
1 Win7 For Devs Fund Search
3 Kodu
5 Digigirlz Xna
4 Making Movies
2 Digi Girlz Small Basic
1 Digi Girlz So Cal Databases Kims Final
SQL Server 2008 for Developers
SQL Server 2008 for .NET Developers
Tech Days09 Sqldev
Windows Azure for .NET Developers
Introduction To Sql Services
Making of GirlGamer
Kodu
DigiGirlz_SoCal_Databases
DigiGirlzSmallBasic
The Role Of An Architect
SQL Server 2008 for Developers
Parallel Programming and F#
Net35 Overview

Recently uploaded (20)

PDF
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf
PPT
Chapter four Project-Preparation material
PDF
How to Get Funding for Your Trucking Business
PDF
Unit 1 Cost Accounting - Cost sheet
PDF
MSPs in 10 Words - Created by US MSP Network
PDF
Ôn tập tiếng anh trong kinh doanh nâng cao
DOCX
Business Management - unit 1 and 2
PDF
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
PDF
Types of control:Qualitative vs Quantitative
PDF
DOC-20250806-WA0002._20250806_112011_0000.pdf
PDF
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
PDF
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
PPTX
Belch_12e_PPT_Ch18_Accessible_university.pptx
PDF
Business model innovation report 2022.pdf
PPTX
Business Ethics - An introduction and its overview.pptx
PDF
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
PPTX
New Microsoft PowerPoint Presentation - Copy.pptx
PPTX
5 Stages of group development guide.pptx
PDF
Training And Development of Employee .pdf
PDF
Power and position in leadershipDOC-20250808-WA0011..pdf
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf
Chapter four Project-Preparation material
How to Get Funding for Your Trucking Business
Unit 1 Cost Accounting - Cost sheet
MSPs in 10 Words - Created by US MSP Network
Ôn tập tiếng anh trong kinh doanh nâng cao
Business Management - unit 1 and 2
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
Types of control:Qualitative vs Quantitative
DOC-20250806-WA0002._20250806_112011_0000.pdf
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
Belch_12e_PPT_Ch18_Accessible_university.pptx
Business model innovation report 2022.pdf
Business Ethics - An introduction and its overview.pptx
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
New Microsoft PowerPoint Presentation - Copy.pptx
5 Stages of group development guide.pptx
Training And Development of Employee .pdf
Power and position in leadershipDOC-20250808-WA0011..pdf

BI 2008 Simple

  • 1. Business Intelligence Lynn Langit/MSDN Developer Evangelist Southern California http://blogs.msdn.com/SoCalDevGal
  • 2. Session Prerequisites – Session One Working SQL Server 2005 Developer –OR- Working .Net developers
  • 3. Session One Objectives and Agenda Understand what, why, when & how of SQL Server 2008 Business Intelligence Examine the core functionality SSAS OLAP cubes and Data Mining Models
  • 4. What and Why BI? Presentation Exploration Discovery Passive Interactive Proactive Role of Software Business Insight Predictive Analytics Canned reporting Ad-hoc reporting OLAP Data mining
  • 5. SQL Server 2008 BI & Tools OLTP – SQL Server Engine SSMS / Profiler and other mgmt tools Reporting – SSRS No need for IIS, BIDS / Report Manager to design Integrates with SharePoint ETL – SSIS Part of SQL Server, BIDS to design OLAP – SSAS Multidimensional Cubes, BIDS / SSMS Data Mining – SSAS Algorithm-based models – BIDS / Excel / SSMS
  • 6. Why BI? Faster reports OLAP can be 1,000% faster Flexible click to query using pivot tables, add calculated members, create custom views Proactive ‘ discover’ patterns in data, ‘predict’ future Reduce load on OLTP source systems Scalable no manual index tuning, data de-normalization
  • 7. SQL Server 2008 Languages OLTP – SQL Server Engine T-SQL, .NET (CLR), XML Reporting – SSRS RDL + queries ETL – SSIS XMLA metadata + queries, .NET extendable OLAP – SSAS MDX, XMLA Data Mining – SSAS DMX, XMLA, PMML
  • 8. Cubes vs. Data Mining
  • 9. Where do I start? Understand OLAP modeling Star schema + grain statements Review AdventureWorks DW sample From www.CodePlex.com Realistically access source data quality Plan for ETL, learn SSIS Leverage Excel Light-weight data mining designer and client OLAP cube pivot table client
  • 10. BI baked into the MS platform Enterprise – Performance Point Individual – Excel Project – Share Point
  • 11. But, I want to build my own app… Don’t under-estimate the learning curve Embed extended controls Consider buying extendable components
  • 12. Microsoft’s Predictive Analytics Data Mining SQL extensions (DMX) Application Developer Data Mining Specialist Microsoft Dynamics CRM Analytics Foundation SQL Server 2005 Business Intelligence Development Studio Microsoft SQL Server 2008 Analysis Services Information Worker Data Mining Add-ins for the 2007 Microsoft Office system Microsoft SQL Server 2008 Data Mining BI Analyst Custom Algorithms
  • 13. Demo 1 – SSAS Cubes
  • 14. Data Mining Add-ins for Office 2007 Table Analysis Tools for Excel 2007 Data Mining Template for Visio 2007 Data Mining Client for Excel 2007 Information Worker BI Analyst Data Mining Specialist
  • 15. Demo – Data Mining
  • 16. Microsoft Data Mining Lifecycle CRISP-DM SSAS (Data Mining) Excel SSAS (DSV) Query Excel SSIS SSAS SSRS Excel Your Apps SSIS SSAS Excel Data www.crisp-dm.org Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment
  • 17. DM - From Scenarios to Tasks
  • 18. From Tasks to Techniques
  • 19. Understand & Prepare specifics
  • 21. New to SQL Server 2008 Microsoft Time Series algorithm improved ARIMA plus ARTxp method, and a blending algorithm = better results New prediction mode allows adding new data to time series models Holdout Support added Easily partition data into training and test sets that are stored in mining structure & available to query after processing Ability to build mining models based on filtered subsets added Results in less structures, i.e. can just filter existing Drillthrough functionality extended makes all mining structure columns available, not just columns included in the model allows you to build more compact models Cross-validation added allows users to quickly validate their modeling approach by automatically building temporary models and evaluating accuracy measures across K folds. The feature is available through a new cross-validation tab under Accuracy Charts in BIDS, in addition to being accessible programmatically via a stored procedure call.
  • 22. Summary Data Mining in SQL Server 2008 is mature, powerful and accessible Can use Excel 2007 Familiar client for BI – OLAP cubes AND Data Mining models Model Creators / Users Excel Data or Server Data SSAS and Excel both support the full DM Cycle Data Understanding Data Preparation Modeling Validation Deployment
  • 23. DM Webcasts Fri, 02 Nov 2007 MSDN Webcast: Build Smart Web Applications with SQL Server Data Mining (Level 200) Thu, 08 Nov 2007 MSDN Webcast: Building Adaptive Applications with SQL Server Data Mining (Level 300) Mon, 19 Nov 2007 MSDN Webcast: Extending and Customizing SQL Server Data Mining (Level 300) Fri, 30 Nov 2007 MSDN Webcast: Creating Visualizations for SQL Server Data Mining (Level 300) Thu, 01 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 1 of 3): Your First Project with SQL Server Data Mining (Level 200) Thu, 15 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 2 of 3): Understand SQL Server Data Mining Add-ins for the 2007 Office System (Level 200) Thu, 29 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 3 of 3): Use Predictive Intelligence to Create Smarter KPIs (Level 200)
  • 24. BI Resources from Lynn Langit 1. “Foundations of SQL Server 2005 Business Intelligence” (published by APress in April 2007) 2. http://blogs.msdn.com/SoCalDevGal 3. “Building Business Intelligence Solutions with SQL Server 2008” (MSPress Fall 2008)
  • 25. DM Resources Technical Communities, Webcasts, Blogs, Chats & User Groups http://www.microsoft.com/communities/default.mspx Microsoft Developer Network (MSDN) & TechNet http://microsoft.com/msdn http://microsoft.com/technet Trial Software and Virtual Labs http://www.microsoft.com/technet/downloads/trials/default.mspx Microsoft Learning and Certification http://www.microsoft.com/learning/default.mspx SQL Server Data Mining http://www.sqlserverdatamining.com http://www.microsoft.com/bi/bicapabilities/data-mining.aspx
  • 26. END – END - END End of first set
  • 27. Business Intelligence Lynn Langit/MSDN Developer Evangelist Southern California http://blogs.msdn.com/SoCalDevGal
  • 28. Session Prerequisites – Session Two Working SQL Server 2005 Developer Understanding of OLAP concepts Working SQL Server Analysis Server 2005 Developer Interest in or basic knowledge of Data Mining concepts
  • 29. Session Two Objectives and Agenda Understand what’s new SQL Server 2008 Business Intelligence SSAS OLAP cubes SSAS Data Mining Structures
  • 30. Demo – Simplified Cube / Dim Wizards
  • 31. Demo – New Aggregation Designer
  • 32. Data Mining Are you using it now?
  • 33. Data Mining – Logical Model algorithm Mining Model Mining Model Training Data DB data Client data Application data Data Mining Engine To Predict Predicted Data Mining Model DB data Client data Application data “ Just one row ” Data Mining Engine
  • 35. Data Mining - Physical Model Analysis Services Server Mining Model Data Mining Algorithm Data Source Your Application OLE DB/ ADOMD/ XMLA Deploy BI Dev Studio (Visual Studio) App Data
  • 36. Data Mining Interfaces – APIs XMLA Over TCP/IP XMLA Over HTTP Analysis Server (msmdsrv.exe) OLAP Data Mining Server ADOMD.NET .Net Stored Procedures Microsoft Algorithms Third Party Algorithms OLEDB for OLAP/DM ADO/DSO Any Platform, Any Device C++ App VB App .Net App AMO Any App ADOMD.NET WAN DM Interfaces
  • 37. Configuration Model Creation/Management Database Administrators Session Mining Models Model Application Permissions on models Permissions on data sources
  • 38. Deployment Browse Copy to Excel Drillthrough Query Default Advanced Excel Services Manage models and structures Export/Import Rename Connection Database Trace
  • 39. Excel Functions* DMPREDICTTABLEROW ( Connection, ModelName, PredictionResult, TableRowRange [, string CommaSeparatedColumnNames] ) DMPREDICT ( Connection, Model, PredictionResult, Value1, Name1, [...,Value32, Name32] ) DMCONTENTQUERY (Connection, Model, PredictionResult [, WhereClause])
  • 40. Data Mining Extensions (DMX) CREATE MINING MODEL CreditRisk (CustID LONG KEY, Gender TEXT DISCRETE, Income LONG CONTINUOUS, Profession TEXT DISCRETE, Risk TEXT DISCRETE PREDICT) USING Microsoft_Decision_Trees INSERT INTO CreditRisk (CustId, Gender, Income, Profession, Risk) Select CustomerID, Gender, Income, Profession,Risk From Customers Select NewCustomers.CustomerID, CreditRisk.Risk, PredictProbability(CreditRisk.Risk) FROM CreditRisk PREDICTION JOIN NewCustomers ON CreditRisk.Gender=NewCustomer.Gender AND CreditRisk.Income=NewCustomer.Income AND CreditRisk.Profession=NewCustomer.Profession
  • 41. DMX Column Expressions Predictable Columns Source Data Columns Functions - Predict “ Workhorse” Discrete scalar values Continuous scalar values Associative nested tables Sequence nested tables Time Series Overloaded to PredictAssociation PredictSequence PredictTimeSeries PredictProbability PredictSupport PredictHistogram Cluster ClusterProbability GetNodeId IsInNode Arithmetic operators Stored Procedure Subselect Select from nested tables
  • 42. Data Mining Interfaces – XMLA ++ XMLA Over TCP/IP XMLA Over HTTP Analysis Server (msmdsrv.exe) OLAP Data Mining Server ADOMD.NET .Net Stored Procedures Microsoft Algorithms Third Party Algorithms OLEDB for OLAP/DM ADO/DSO Any Platform, Any Device C++ App VB App .Net App AMO Any App ADOMD.NET WAN DM Interfaces
  • 43. New to SQL Server 2008 Microsoft Time Series algorithm improved ARIMA plus ARTxp method, and a blending algorithm = better results New prediction mode allows adding new data to time series models Holdout Support added Easily partition data into training and test sets that are stored in mining structure & available to query after processing Ability to build mining models based on filtered subsets added Results in less structures, i.e. can just filter existing Drillthrough functionality extended makes all mining structure columns available, not just columns included in the model allows you to build more compact models Cross-validation added allows users to quickly validate their modeling approach by automatically building temporary models and evaluating accuracy measures across K folds. The feature is available through a new cross-validation tab under Accuracy Charts in BIDS, in addition to being accessible programmatically via a stored procedure call.
  • 44. Summary Data Mining in SQL Server 2008 is mature, powerful and accessible Can use Excel 2007 Familiar client for BI – OLAP cubes AND Data Mining models Model Creators / Users Excel Data or Server Data SSAS and Excel both support the full DM Cycle Data Understanding Data Preparation Modeling Validation Deployment
  • 45. DM Webcasts Fri, 02 Nov 2007 MSDN Webcast: Build Smart Web Applications with SQL Server Data Mining (Level 200) Thu, 08 Nov 2007 MSDN Webcast: Building Adaptive Applications with SQL Server Data Mining (Level 300) Mon, 19 Nov 2007 MSDN Webcast: Extending and Customizing SQL Server Data Mining (Level 300) Fri, 30 Nov 2007 MSDN Webcast: Creating Visualizations for SQL Server Data Mining (Level 300) Thu, 01 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 1 of 3): Your First Project with SQL Server Data Mining (Level 200) Thu, 15 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 2 of 3): Understand SQL Server Data Mining Add-ins for the 2007 Office System (Level 200) Thu, 29 Nov 2007 TechNet Webcast: Deliver Actionable Insight Throughout Your Organization with Data Mining (Part 3 of 3): Use Predictive Intelligence to Create Smarter KPIs (Level 200)
  • 46. BI Resources from Lynn Langit 1. “Foundations of SQL Server 2005 Business Intelligence” (published by APress in April 2007) 2. http://blogs.msdn.com/SoCalDevGal 3. “Building Business Intelligence Solutions with SQL Server 2008” (MSPress Fall 2008)
  • 47. DM Resources Technical Communities, Webcasts, Blogs, Chats & User Groups http://www.microsoft.com/communities/default.mspx Microsoft Developer Network (MSDN) & TechNet http://microsoft.com/msdn http://microsoft.com/technet Trial Software and Virtual Labs http://www.microsoft.com/technet/downloads/trials/default.mspx Microsoft Learning and Certification http://www.microsoft.com/learning/default.mspx SQL Server Data Mining http://www.sqlserverdatamining.com http://www.microsoft.com/bi/bicapabilities/data-mining.aspx