Data Scientist Masters - V9
Data Scientist Masters - V9
Data Scientist Masters - V9
Masters
Table of Contents
Key Features of
04 Data Scientist Masters Program
05 Learning Path
06 Step 1 : R Programming
22 Step 6 : Tableau
26 Electives
2 | www.simplilearn.com
About the Course
The Data Scientist Masters program has been
designed to introduce you to the world of
analytics and elevate your skills to ultimately
become a Data Scientist. As a Data Scientist,
you must be able to work with multiple data
formats, have knowledge of the algorithms that
can help you extract useful data, master data
mining, data management and data exploration.
If you are pursuing a career in Data Science, this
is the program for you.
3 | www.simplilearn.com
The Data Scientist Masters
trains you along an industry
recommended learning path to
succeed in the field of
Data Science
Key Features
Industry Recommended Learning path
4 | www.simplilearn.com
Learning Path
R Programming
Data Science
Certification Training Data Science
with Python
Data Science and
Analytics Language
SAS
Data Science and Analytics -
Industry Leader
Tableau
Building visualization,
organizing data, and designing Optional Electives
dashboards using Tableau
> Certified SAS Base Programmer
> Power Bi
DATA SCIENTIST
5 | www.simplilearn.com
STEP 1 2 3 4 5 6 7
Learn the concepts of Data Science R
R
6 | www.simplilearn.com
Course curriculum
Introduction to Business Analytics: Know the need of Business
Analytics, Business Decisions, Features and Types of Business
Analytics, Descriptive, Predictive, Supply Chain, Health Care,
Marketing, Human Resource, Web Analytics, Applications of Business
Analytics, Big Data, Analytical Tools
7 | www.simplilearn.com
Project 1:
Healthcare: Predictive analytics can be used in healthcare to mediate
hospital readmissions. In healthcare and other industries, predictors are
most useful when they can be transferred into action. But historical
and real-time data alone are worthless without intervention. More
importantly, to judge the ecacy and value of forecasting a trend and
ultimately changing behavior, both the predictor and the intervention
must be integrated back into the same system and workflow where the
trend originally occurred.
Project 2:
Insurance: Use of predictive analytics has increased greatly in
insurance businesses, especially for the biggest companies, according to
the 2013 Insurance Predictive Modeling Survey. While the survey showed
an increase in predictive modeling throughout the industry, all respon-
dents from companies that write over $1 billion in personal insurance
employ predictive modeling, compared to 69% of companies with less
than that amount of premium.
Project 3:
Retail: Analytics is used in optimizing product placements on shelves or
optimization of inventory to be kept in the warehouses using industry
examples. Through this project, participants learn the daily cycle of
product optimization from the shelves to the warehouse. This gives them
insights into regular occurrences in the retail sector
Project 4:
Internet: Internet analytics is the collection, modeling and analysis of
user data in large-scale online services such as social networking, e-com-
merce, search and advertisement. In this class, we explore a number of
key functions of such online services that have become ubiquitous over
the last couple of years. Specifically, we look at social and information
networks,recommender systems, clustering and community detection,
dimensionality reduction, stream computing and online ad auctions.
8 | www.simplilearn.com
Project 5:
Education: An education department in the US needs to analyze the
factors that influence the admission of a student into a college. Analyze
the historical data and determine the key drivers.
Project 6:
E-commerce: A UK-based online retail store has captured the sales
data for different products for the period of one year (Nov 2016 to Dec
2017). The organization sells gifts primarily on the online platform. The
customers who make a purchase consume directly for themselves. There
are small businesses that buy in bulk and sell to other customers through
the retail outlet channel. Find significant customers for the business who
make high purchases of their favourite products.
9 | www.simplilearn.com
STEP 1 2 3 4 5 6 7
Start your Analytics journey
SAS
Outline data science principles and how SAS can help implement them
Explain the dierent methods used to combine and modify datasets
Explain what PROC SQL is and how it’s used to retrieve data from
tables
Describe how to use the macro function to manipulate the character
strings and text.
List the various statistical procedures and explore the various
testing techniques.
Understand how SAS handles missing values in your datasets
using various procedures.
Explain the ways to create a cluster and to perform cluster
analysis on the dataset.
List the various time series models of SAS.
10 | www.simplilearn.com
Course curriculum
Analytics Overview: This covers Types of Analytics, Areas of Analytics,
Analytical Tools and Techniques
SAS Macros: Know the ned for SAS Macros, Macro Functions, SQL
Clauses for Macros, The % Macro and Conditional Statements
Working with Time series Data: Comprehend the need for Time Series
Analysis and it’s Options, Reading Date and Date time Values, White
Noise Process, Stationarity of a Time Series, Plot Transform Transpose
and Interpolating Time Series Data
11 | www.simplilearn.com
Project 1:
Demand Forecasting for Walmart
Retail: Predict accurate sales for 45 stores of Walmart, one of the US-
based leading retail stores, considering the impact of promotional mark-
down events. Check if macroeconomic factors like CPI, unemployment
rate, etc. have an impact on sales.
Project 2:
Attrition Analysis
Telecommunication: Analyze the employee attrition rate of a leading
BPO company. The dataset is maintained for the attrition analysis, and
it has records of employee id, retain indicator, sex indicator, relocation
indicator, and marital status.
Project 3:
Retail Analysis
Retail: E-commerce: Forecast sales based on independent variables such
as profit, quantity, marketing cost, and expenses using the regression
model.
Project 4:
Data-driven Macro Calls
Internet: Sales: Generate a list of all data sets in SAS which have
sales-related information and pass it on as the macro variable.
Project 5:
Customer Segmentation
Internet: Perform customer segmentation with RFM methodology on an
e-commerce website’s customer data set. Segment customers based on
frequency, recency, and monetary value.
12 | www.simplilearn.com
STEP 1 2 3 4 5 6 7
Data Science with Python
Data Science with Python
13 | www.simplilearn.com
Course curriculum
Data Science Overview: Get introduced to Data Science, dierent
sectors using Data Science, and purpose and components of Python
14 | www.simplilearn.com
Project 1:
NYC 311 Service Request Analysis
Telecommunication: Perform a service request data analysis of New York
City 311 calls. You will focus on data wrangling techniques to understand
patterns in the data and visualize the major complaint types.
Project 2:
MovieLens Dataset Analysis
Engineering: The GroupLens Research Project is a research group in the
Department of Computer Science and Engineering at the University of
Minnesota. The researchers of this group are
involved in several research projects in the fields of information filtering,
collaborative filtering and recommender systems. Here, we ask you to
perform an analysis using the Exploratory Data Analysis technique for
user datasets.
Project 3:
Stock Market Data Analysis
Stock Market: As a part of this project, you will import data using Yahoo
data reader from the following companies: Yahoo, Apple, Amazon,
Microsoft and Google. You will perform fundamental analytics, including
plotting, closing price, plotting stock trade by volume, performing daily
return analysis, and using pair plot to show the correlation between all of
the stocks.
Project 4:
Titanic Dataset Analysis
Hazard: On April 15, 1912, the Titanic sank after colliding with an iceberg,
killing 1502 out of 2224 passengers and crew. This tragedy shocked the
world and led to better safety regulations for ships. Here, we ask you
to perform an analysis using the exploratory data analysis technique, in
particular applying machine learning tools to predict which passengers
survived the tragedy.
15 | www.simplilearn.com
STEP 1 2 3 4 5 6 7
Machine Learning
Machine Learning
16 | www.simplilearn.com
Course curriculum
Introduction to Artificial Intelligence and Machine Learning: Get
introduced to Machine Learning concepts, logarithms, and its
applications
17 | www.simplilearn.com
Project 1:
Build a Predictive Model for Housing Prices
This project involves building a predictive model for determining housing
prices in California using US census data. You will analyze various metrics
such as population, median income, median housing price, and more for
each block group to predict the home prices in any given district.
Project 2:
Build a Phishing Website Detector Using LR Algorithms
The purpose of the project is to build a machine learning model that is
trained to use LR algorithms to detect phishing website datasets..
Project 3:
Build a Phishing Website Detector Using KNN Algorithms
The purpose of the project is to build a machine learning model that is
trained to use KNN algorithms to detect phishing website datasets..
Project 4:
Build an MNIST Classifier
The purpose of the project is to train a model on the MNIST image data-
base to detect images with 5 digits.
18 | www.simplilearn.com
STEP 1 2 3 4 5 6 7
Harness the power of Big Data & Hadoop
Big Data Hadoop and Spark Developer
19 | www.simplilearn.com
Course curriculum
Introduction to Bigdata and Hadoop Ecosystem
Apache Pig: Learn about Pig and how to get datasets for Pig
Development
20 | www.simplilearn.com
Project 1:
Domain-Banking
Description: A Portuguese banking institution ran a marketing campaign
to convince potential customers to invest in a bank term deposit.
Their marketing campaigns were conducted through phone calls,
and sometimes the same customer was contacted more than once.
Your job is to analyze the data collected from the marketing campaign.
Project 2:
Domain-Telecommunication
Description: A mobile phone service provider has launched a new Open
Network campaign. The company has invited users to raise complaints
about the towers in their locality if they face issues with their mobile
network. The company has collected the dataset of users who raised a
complaint. The fourth and the fifth field of the dataset has a latitude and
longitude of users, which is important information for the company.
You must find this latitude and longitude information on the basis of
the available dataset and create three clusters of users with a
k-means algorithm
For additional practice, we have three more projects to help you start
your Hadoop and Spark journey.
Project 3:
Domain-Social Media
Description: As part of a recruiting exercise, a major social media
company asked candidates to analyze a dataset from Stack Exchange.
You will be using the dataset to arrive at certain key insights.
Project 4:
Domain-Website providing movie-related information
Description: IMDB is an online database of movie-related information.
IMDB users rate movies on a scale of 1 to 5 -- 1 being the worst and 5
being the best -- and provide reviews. The dataset also has additional
information, such as the release year of the movie. You are tasked to
analyze the data collected
Project 5:
Domain-Insurance
Description: A US-based insurance provider has decided to launch a new
medical insurance program targeting various customers. To help a
customer understand the market better, you must perform a series of
data analyses using Hadoop
21 | www.simplilearn.com
STEP 1 2 3 4 5 6 7
Tableau
Tableau
22 | www.simplilearn.com
Course curriculum
Getting Started With Tableau: Overview of dierent versions of Tableau
and installation process
Deep diving with Data and Connections: Work with Excel Data
Interpreter and learn how to Split fields, pivot and filter data
Creating Charts: Know about Crosstabs and Heat Maps, Pie Charts,
Line and Area Charts, Packed Bubble, Treemaps, Scatter Plot
23 | www.simplilearn.com
Project 1:
Category Performance Analysis:
This project involves ranking subcategories by performance. Accord-
ing to the Performance Evaluation Program, the Subcategories yielding
consistent profit across last 4 years are awarded as the Best Performing
Subcategories. Help the manager identify the top subcategories based
on the profits and use advanced dashboard features to portray a
complete picture for Subcategory sales.
24 | www.simplilearn.com
STEP 1 2 3 4 5 6 7
Data Science Capstone
Data Science Capstone
Key Features:
Flexibility to choose the domain/industry of your choice
Build on any technology covered within the Master’s program
Dedicated mentoring sessions to ensure high-quality learning
Capstone completion certificate
25 | www.simplilearn.com
Elective Course
Certified SAS Base Programmer
The SAS Base Programmer course in a beginner level course for
a SAS professional. This training has been designed to enable you
to start your analytics career with SAS and prepare for the SAS
Base Programmer certification. This SAS course explores the SAS
tool and dierent techniques to help you access and manage data,
create data structures, generate reports, and handle errors. These
techniques are mandatory for a professional to start working on
the next SAS assignment and forms a strong base for advanced
techniques and certifications.
Python Basics
This course is ideal for you to understand the basics of Python
Programming Language.
Core Java
This Java Certification Course is a one-stop training program
designed to guide you from the beginning concepts of Java to
advanced programming techniques. This Java course requires no
previous coding experience and will provide you with foundational
knowledge of Core Java 8, including the scope of variables,
operators, arrays, loops, methods and constructors, and much more.
26 | www.simplilearn.com
Power Bi
Microsoft Power BI is a suite of tools to analyze your data
and extract business insights from it through building
interactive dashboards. This Power BI Training course will
help you get the most out of Power BI, enabling you to solve
business problems and improve operations.
27 | www.simplilearn.com
Advisory board member
Ronald Van Loon
Big Data Expert, Director Adversitement
Named by Onalytica as one of the 3 most
influential people in Big Data, Ronald is an
author for a number of leading Big Data & Data
Science websites, including Datafloq, Data
Science Central, and The Guardian. He is also a
renowned speaker at industry events.
Mike Tamir
Head of Data Science - Uber ATG
Named by Onalytica as the No.1 influencer in
AI & Machine Learning space, Mike serves as
Head of Data Science for Uber ATG self-driving
engineering team and as UC Berkeley data
science faculty.
Sina Jamshidi
Big Data Lead at Bell Labs
Sina has over 10 years of experience in
Technology as a Big Data Architect at Bell
Labs and as a Platinum level trainer. He
is very passionate about building a Big
Data education ecosystem and has been a
contributor to a number of magazine and
journal publications.
Simon Tavasoli
Analytics Lead at Cancer Care Ontario
Simon is a Data Scientist with 12 years of
experience in Healthcare analytics. He has
a master’s degree in Biostatistics from the
University of Western Ontario. He is passionate
about teaching Data Science, and has a number
of journal publications in preventive medicince
and data analytics.
28 | www.simplilearn.com
Paul Sharkov
Data Scientist at BMO Financial Group, Member of
SAS Canada Community
Paul is a lead SAS Data Scientist at the Bank
of Montreal. As an SAS Certified Predictive
Modeler, SAS Statistical Business Analyst, and
SAS Certified Advanced Programmer, Paul is
passionate about sharing his knowledge on
how Data Science can support data-driven
business decisions.
Alvaro Fuentes
Founder and Data Scientist at Quant Company
Alvaro is a Data Scientist who founded
Quant Company. He has also worked as a
lead Economic Analyst in the Central Bank
of Guatemala. He has a master’s degree
in Quantitative Economics and Applied
Mathematics and is actively involved in
consulting and training in the Data Science
space.
29 | www.simplilearn.com
USA
Simplilearn Americas, Inc.
201 Spear Street, Suite 1100, San Francisco, CA 94105
United States
Phone No: +1-844-532-7688
INDIA
Simplilearn Solutions Pvt Ltd.
# 53/1 C, Manoj Arcade, 24th Main, Harlkunte
2nd Sector, HSR Layout
Bangalore - 560102
Call us at: 1800-212-7688
www.simplilearn.com