DataScience Minordegree 2023 Syllabus
DataScience Minordegree 2023 Syllabus
DataScience Minordegree 2023 Syllabus
UNIVERSITY DEPARTMENTS
REGULATIONS – 2023
FACULTY OF INFORMATION & COMMUNICATION ENGINEERING
RAMANUJAN COMPUTING CENTRE
MINOR DEGREE ON ‘DATA SCIENCE’
Total Credits 18
CSM507 FOUNDATIONS OF DATASCIENCE L T P C
WITH PYTHON 3 0 0 3
Course Objectives:
To understand fundamentals and the process of data science.
To comprehend different types and representation of data and analyze them.
To apply inferential techniques to extrapolate information from the available
data.
To utilize the Python libraries for Data Wrangling.
To interpret data and present it using visualization libraries in Python.
Unit I INTRODUCTION 9
Data Science: Benefits and uses – facets of data – Data Science Process: Overview –
Defining research goals – Retrieving data – Data preparation – Exploratory Data analysis
– build the model – presenting findings and building applications
COURSE OUTCOMES:
References
2. David Evans, Vladimir Kolesnikov and Mike Rosulek, A Pragmatic Introduction to Secure
Multi-Party Computation, NOW Publishers, 2022 (Free access at
https://securecomputation.org/)
3. William Stallings, Cryptography and Network Security - Principles and Practice, Seventh
Edition, Pearson, 2017
Describe the big data and use cases from selected business domains.
Explain NoSQL big data management.
Install, configure and run Hadoop and HDFS.
Perform map-reduce analytics using Hadoop.
Use Hadoop-related tools such as HBase, Cassandra, Pig and Hive for big data
analytics.
References
1. Michael Minelli, Michael Chambers, and AmbigDhiraj, “Big Data, Big Analytics:
Emerging Business Intelligence and Analytic Trends for Today’s Businesses”,
Wiley, 2013.
2. Eric Sammer, “Hadoop Operations”, O’Reilley, 2012.
3. Sadalage, Pramod J. “NoSQL distilled”, 2013
4. E. Capriolo, D. Wrampler, and J. Rutherglen, “Programming Hive”, O’Reilley,
2012.
5. Lars George, “HBase: The Definitive Guide: O’Reilley, 2011.
6. Eben Hewitt, “Cassandra: The Definitive Guide: O’Reilley, 2010.
7. Alan Gates, “Programming Pig”, O’Reilley, 2011.
EXPLORATORY DATA ANALYSIS L T P C Credits
3 0 0 3 3
Course Objectives:
To outline an overview of exploratory data analysis.
To implement data visualization using Matplotlib.
To perform univariate data exploration and analysis.
To apply bivariate data exploration and analysis.
To use Data exploration and visualization techniques for multivariate and time
series data.
Unit I EXPLORATORY DATA ANALYSIS 9
EDA fundamentals – Understanding data science - Significance of EDA – Making sense
of data – Comparing EDA with classical and Bayesian analysis – Software tools for EDA
– Visual Aids for EDA – Data transformation techniques-merging database, reshaping
and pivoting, Transformation techniques.
CO5: Use Data exploration and visualization techniques for multivariate and time series
data.
Total: 60 Periods
REFERENCES:
1 Suresh Kumar Mukhiya, Usman Ahmed, “Hands-On Exploratory Data Analysis
with Python”, Packt Publishing, 2020. (Unit 1)
2 Jake Vander Plas, “Python Data Science Handbook: Essential Tools for Working
with Data”. First Edition, O Reilly, 2017. (Unit 2)
3 Catherine Mars, Jane Elliott, “Exploring Data: An Introduction to Data Analysis for
Social Scientists”, Wiley Publications, 2nd Edition, 2008. (Unit 3,4,5)
4 Eric Pimpler, Data Visualization and Exploration with R, GeoSpatial Training
service, 2017.
5 Claus O. Wilke, “Fundamentals of Data Visualization”, O’reilly Publications, 2019.
6 Matthew O. Ward, Georges Grinstein, Daniel Keim, “Interactive Data
Visualization: Foundations, Techniques, and Applications”, 2 nd Edition, CRC
press, 2015.