ABOUT THIS COURSE
This third-level offering takes the tactical and methodological presentation of Data Mining: Level II and puts the material into action through team-driven, live data mining exercises. Comparative review sessions then reveal real-world obstacles, breakthroughs and results from which to interpret, learn and apply.
Data Mining: Level III is a hands-on application workshop, applying data mining methods and techniques presented in Data Mining: Level II to real-world data. Although the workshop may be attended exclusively, registrants should have experience with the breadth of material covered in the Level II offering.
Throughout the workshop, the CRISP-DM model will be used to guide participants through the steps of the data mining process, and the attendees themselves will complete the entire data mining process during the workshop by solving simple data mining problems through a staged progression.
In the morning, participants will begin with a database containing multiple tables of information. Participants will each have a networked computer and may choose to work on exercises in pairs or individually. Attendees will determine which business questions will be considered, how they will be addressed using data mining, and how the data will be prepared for data mining. A divide-and-conquer approach will be used to carry out the data understanding and data preprocessing steps as participants work alone or in pairs to prepare the data for data mining.
In the afternoon, regression, decision tree, and neural network models will be created, and performance assessed. Participants may optimize these models by using advanced algorithm options, and report model performance on held-out data and summaries of key variables used in the models. Data preprocessing will be re-applied if models do not meet performance requirements. The team will determine which model best addresses the business question, and score the model on validation data.
Throughout the day, emphasis will be placed not only on the data mining process from a technical perspective, but also how to interpret, explain and apply results that have been discovered during the process.
WHO SHOULD ATTEND
LEVEL II COURSE PARTICIPANTS with an interest in applying the methods and techniques first-hand as presented and illustrated in the course
DATA MINING PRACTITIONERS who wish to expand their skills and analytical toolbox as well as hone proficiencies in maneuvering elusive data mining obstacles that stand in the way of superior model accuracy
BUSINESS ANALYSTS who must develop and interpret models, communicate the results and make actionable recommendations
FUNCTIONAL ANALYSTS: Customer Relationship Managers, Risk Analysts, Statistical Analysts, Business Forecasters, Inventory Flow Analysts, Direct Marketing Analysts, Medical Diagnostic Analysts, Market Timers, e-commerce System Architects and Web Data Analysts
BENEFITS OF ATTENDING
Driving the Level II presentation material through team exercises
Hands-on experience through the data mining process via a staged progression of exercises using application data
First-hand vendor-neutral exposure to various data mining tools
Real-world perspective of data preparation for data mining, model optimization and results interpretation
Cross-learning through team exercise comparisons to reveal what worked, what didn't, and why?
WHAT MAKES THIS COURSE UNIQUE
Unlike any other application-oriented offering on the market, Data Mining: Level III offers a structured approach to team-oriented data mining exercises in a lab environment. Since The Modeling Agency is not a tools vendor, participants enjoy a balanced, broad, and non-promotional perspective of data mining.
Training Avaliability and Delivery
This is primarily ilt training
This is a workshop seminar
This class may involve group study
This class may be available at a classroom in Pittsburgh, PA,
Team Meeting 1
Introduction
Purpose of Data Mining III: Practice
CRISP-DM
Description of Data Source for Modeling
Business Understanding
Prioritize Questions to be Addressed
Determine Method to Score Results
Assign Data Understanding Responsibilities
Result: List of Prioritized Business Questions and Corresponding Data
Mining Approaches
Breakout Session 1: Data Understanding
Summary Statistics
Visualization
Outlier Analysis
Missing Data Analysis
Create Mini-Report
Team Meeting 2
Data Assessment Summary
Assign Data Preprocessing Responsibilities
Result: Summaries of Available Modeling Data, and Recommendations for
Their Use
Breakout Session 2: Data Preprocessing
Correct Data Problems
Create Features
Create Mini-Report
Team Meeting 3
Data Preprocessing Summary
Join Data Modified During Breakout Sessions
Determine Sampling Strategy
Assign Modeling Responsibilities
Result: Single Modeling Dataset
AFTERNOON
Breakout Session 3: Modeling and Evaluation
Build Decision Trees, Regression, and Neural Networks
Assess Results
Rebuild Models, Changing Modeling Parameters
Breakout Session 4: Model Evaluation and Assessment
Score Models on Testing Data
Rank Variable Importance to Models
Create Mini-Report
Team Meeting 4
Summarize Modeling Results
Assess Which Modeling Techniques Worked and Didn t Work
Determine Needs for More Data Pre-Processing and Modeling
Assign Responsibilities
Breakout Session 5: Re-visit Modeling
Create Final Models
Create Mini-Report
Score Models on Validation Data
Team Meeting 5
Select Final Model to Use (if any)
Explain Reason for Selection
Assess Trade-offs Between Models
Assign List of Action Items Pending
About The Modeling Agency - Training Provider
The Modeling Agency - The Modeling Agency (TMA) is a structured, redundant network of senior-level data mining consultants, associate modelers, application programmers, project managers and technical documentation specialists. The Modeling Agency (TMA) provides data mining guidance and results for those who are data-rich, yet information-poor through a structured network of independent consultants, or agents.
The...