This course provides the fundamentals of using IBM SPSS Modeler and introduces the participant to data science. The principles and practice of data science are illustrated using the CRISP-DM methodology. The course structure follows the stages of a typical data mining project, from collecting data, to data exploration, data transformation, and modeling to effective interpretation of the results. The course provides training in the basics of how to read, prepare, and explore data with IBM SPSS Modeler, and introduces the student to modeling.

**Course Length:** 2 day(s)

**Course Price:** $1670 CAD

**Available Course Formats:**

- In-class
- Instructor Led Online
- Self-Paced Virtual Classroom

**1.** Introduction to data science

- List two applications of data science
- Explain the stages in the CRISP-DM methodology
- Describe the skills needed for data science

**2.** Introduction to IBM SPSS Modeler

- Describe IBM SPSS Modeler's user-interface
- Work with nodes and streams
- Generate nodes from output
- Use SuperNodes
- Execute streams
- Open and save streams
- Use Help

**3.** Introduction to data science using IBM SPSS Modeler

- Explain the basic framework of a data-science project
- Build a model
- Deploy a model

**4.** Collecting initial data

- Explain the concepts "data structure", "of analysis", "field storage" and "field measurement level"
- Import Microsoft Excel files
- Import IBM SPSS Statistics files
- Import text files
- Import from databases
- Export data to various formats

**5.** Understanding the data

- Audit the data
- Check for invalid values
- Take action for invalid values
- Define blanks

**6.** Setting the of analysis

- Remove duplicate records
- Aggregate records
- Expand a categorical field into a series of flag fields
- Transpose data

**7.** Integrating data

- Append records from multiple datasets
- Merge fields from multiple datasets
- Sample records

**8.** Deriving and reclassifying fields

- Use the Control Language for Expression Manipulation (CLEM)
- Derive new fields
- Reclassify field values

**9.** Identifying relationships

- Examine the relationship between two categorical fields
- Examine the relationship between a categorical field and a continuous field
- Examine the relationship between two continuous fields
- 10. Introduction to modeling
- List three types of models
- Use a supervised model
- Use a segmentation model

Audience

Anyone who wants to become familiar with IBM SPSS Modeler

Prerequisites

General computer literacy.

#### Instructor Led In Classroom

Newcomp can directly deliver IBM Business Analytics courses for Business Intelligence, Performance Management, and IBM Advanced Analytics through the use of in-class training facilities.

Currently, in-class courses are offered in Markham, Ottawa, Vancouver, Halifax, and Edmonton. Please note that classes can be added to new areas based on demand.

#### Instructor Led Online

Students receive the same quality as an in-class course, with a live instructor and the ability to participate in hands-on labs through real-life examples

ILOs help cut costs by reducing time and travel as they can be taken from home or the office and require only the use of a computer, high-speed wired internet and a headset.

#### Self Paced

Students can receive the same high-quality training, with the same courseware at their own speed and schedule with SPVC. Individuals with busy schedules can complete a course over a 30-day timeframe at a lower price than in-class or ILO courses.* Please note that there is no live interaction with an instructor in this format.*