ABOUT THIS COURSE
Data mining is essentially a discovery process -- a process riddled with common yet elusive strategic pitfalls. Project failure is rarely due to poor model development. Rather, data mining projects often fall short of their potential due to flawed or overlooked assessment, business understanding, project definition and strategic planning specifically for information discovery.
If you are looking for an intensive vendor-neutral and non-promotional introduction to data mining best practices and an approach to predictive analytics which is critical to modeling success, then this course is designed for you.
Data Mining: Level I offers a concentrated presentation of data mining terminology, capabilities, limitations, risks, rewards, case studies, best practices, standard process and strategy. Those in attendance will be exposed to popular methods of predictive modeling, application examples, live illustrations and resources to get started.
Practitioners seeking to drill down into the tactical implementation of predictive analytics may also attend the Data Mining: Level II offering: an additional two days immediately following this course at the same site.
WHO SHOULD ATTEND
IT/IS EXECUTIVES AND MANAGERS: CIOs, CKOs, CTOs, Stakeholders, Functional Officers, Technical Directors and Project Managers
LINE-OF-BUSINESS EXECUTIVES AND FUNCTIONAL MANAGERS: Risk Managers, Customer Relationship Managers, Business Forecasters, Inventory Flow Analysts, Financial Forecasters, Direct Marketing Analysts, Medical Diagnostic Analysts, eCommerce Company Executives
TECHNOLOGY PLANNERS: Who survey emerging technologies in order to prioritize corporate investment
CONSULTANTS: Whose competitive environment is intensifying and whose success requires competency with data mining and related emerging information technologies
BENEFITS OF ATTENDING
- Make better business decisions based on information hidden in your data
- Develop a strong vocabulary and understanding of data mining terminology
- Communicate with confidence among your developers and consultants
- Plan and manage your data mining projects effectively from the start
- Leave with resources, contacts and actionable plans to substantially reduce your project preparation time, costs and risks
THE BUSINESS CHALLENGE
Traditionally, organizations use data tactically - to manage operations. For competitive edge, leading organizations use data strategically - to expand the business, to improve profitability, to reduce costs, anticipate behavior, and market more effectively. The mining of data for predictive indicators creates information assets that an organization can leverage to achieve these strategic objectives.
Predictive analytics is a new component in an enterprise's decision support system (DSS) architecture. It complements and interlocks with other DSS capabilities such as query and reporting, on-line analytical processing (OLAP), data visualization, and traditional statistical analysis. These other DSS technologies are generally retrospective.
The predictive aspect of data mining may be defined as "the data-driven discovery and modeling of hidden patterns in large volumes of data." Predictive analytics differs from the retrospective technologies above because it produces models -- models that capture and represent hidden patterns and interactions in the data. Via data mining, a user can discover patterns and build models automatically, without knowing exactly what s/he's looking for.
The resulting models are both descriptive and prospective. They address why things happened and what is likely to happen next. A user can pose "what-if" questions to a data-mining model that cannot be queried directly from the database or warehouse. Examples include: "What is the expected lifetime value of every customer account," "Which customers are likely to open a money market account," or "Will this customer cancel our service if we introduce fees?"
WHAT YOU WILL LEARN
- Basic principles and terminology for predictive analytics
- Who is utilizing predictive analytics, and why
- What are common project pitfalls and how to avoid them
- Project deployment, performance and maintenance issues
- How to define business objectives for a discovery process
- How to get started
WHAT MAKES THIS COURSE UNIQUE
This course offers a balanced and non-promotional presentation of data mining topics and its role in enterprise decision support. For over nineteen years, the instructor has been deeply involved with the development and deployment of real-world data mining solutions.
The presentation divided into four sessions of approximately three-hours each. The first session is intended to provide a general overview of predictive analytics. Subsequent sessions address three specific issues critical to success in the application of data mining in business environments. This course does not drill deeply into specific algorithms or technical implementation issues. It is also not a comprehensive presentation of a development methodology as presented in the Level II offering. Rather, the Level I course presents strategic and process issues that are critical in the success of deploying applied models in real world business environments.
Leading products will be used from a vendor-neutral perspective to illustrate and compare methods. Results are drawn from actual data mining applications and interpreted in the context of business impact. Attendees will depart with a binder full of slides, supporting notes and a valuable index of data mining resources.
Training Avaliability and Delivery
This is primarily ilt training
This class may involve group study
This class may be available at a classroom in Pittsburgh, PA,
This session introduces participants to the conceptual foundation of data mining projects. It is intended to give an overview of the types of problems that are appropriate for data mining, offer an approach that is realistic for applied model development in a business environment, and explain why traditional approaches are insufficient.
Why Build Models?
Data Mining: What it is, and What it isn t
Matching Technologies and Problem Types
Belief in Degree of Set Membership vs Right Answers
Why Traditional Statistics is Not Enough
Where Data Mining Works
Defining Goals for Better Performance
SESSION II It s About the Data
Improved performance generally comes from one of two sources: getting more information content from the available data, or getting better data. This session explores sources of data, the strengths and weaknesses of various types of data, and techniques for manipulating data to extract information content.
Types of Data
Sources of Data
Data Errors
Missing Data
Outliers
Normalizing Data
Derived Variables
Variable Reduction
Data Transformation Ideas
SESSION III Conceptual Introduction to Core Modeling Technologies
The third session introduces a number of the advanced technologies commonly used in data mining. Discussion of these technologies in Level I is focused on conceptual understanding, and the strengths and weaknesses of the various methods. A technical drill-down into techniques for the various algorithms is reserved for the Level II offering.
Linear Regression
Logistic Regression
Clustering
Classification Trees
Chaos
Neural Networks
Genetic Algorithms
SESSION IV Making Predictive Analytics Work
The final session addresses the Experimental Design and Project Definition aspects of Data Mining projects. These areas are where most data mining projects fall short of their potential.
Experimental Design
The Data Mining Process
Case Studies
About The Modeling Agency - Training Provider
The Modeling Agency - The Modeling Agency (TMA) is a structured, redundant network of senior-level data mining consultants, associate modelers, application programmers, project managers and technical documentation specialists. The Modeling Agency (TMA) provides data mining guidance and results for those who are data-rich, yet information-poor through a structured network of independent consultants, or agents.
The...