The Data Incubator

Businesses are drowning in data
but starving for insights
Forrester

Practical Machine Learning

Summary

Build useful machine learning models that deliver data-driven insights to help your company make better decisions that can improve revenue, reduce costs, create new opportunities, identify new ideas, improve the customer experience and more.


Associated project work

In this miniproject students will work with a housing data set to develop a model to predict house price based on various features about a house. They will start with a linear regression model using just one feature, then build a model with 2 features and finally use a linear regression model trained on all of the data. The miniproject requires students to be able to use scikit-learn's transformers, predictors and pipelines. To further improve the model, students will also engineer polynomial features and train a ridge model.


Students will develop a model to predict customer churn from various customer features. They will start with a logistic regression model that uses only numerical features, then they will also incorporate categorical features. Students will improve the model by using a radnom forest classifier.


Students will gain experience with implementing unsupervised learning algorithms on real-world data and drawing insightful conclusions. They will use a data set of credit card company customers and first perform principal components analysis. Then students will train a K-Means clustering model and determine the proper number of clusters according to silhouette score. Lastly they will find cluster centroids which correspond to average customers and extract characteristic customer information.


This module is currently part of our Data Science Essentials Course.

Prerequisites
Basic Python
Basic linear algebra
Basic statistics