Data Science Proposal For A Ms Degree
Data Science Proposal For A Ms Degree
Data Science Proposal For A Ms Degree
The Master of Science in Data Science is an interdisciplinary graduate program that combines the teaching of
domain-specific and technical skills for analyzing large data sets. Built upon a core of foundational data
science courses in Computer Science and Mathematics, and a selection of courses from data science
application domains, the program is interdisciplinary in nature. Students interested in data science tools will
be able to focus on tool principles and tool development, and students interest in data science applications
will be able to focus on the application of data science tools with a selection of courses that develops skills in
one of three application areas. The program also provides its students the option of doing an industrial
internship, to acquire professional experience. The program allows the various academic units involved to add
courses in their specific application domains, thus keeping the program updated and relevant to current
practice and industrial needs. The program is both academic and professional in nature, providing courses that
are true to a Master’s level degree, and courses that reflect the needs of the profession.
The curriculum consists of three components: data science tool courses, data science application courses, and
an internship. Students must complete at least 30 credits of graduate level courses to complete the degree.
Credit can be given for prior study. At least 15 credits of courses must be completed at the University of
Miami. Each student’s selection of courses must be approved by a member of the administrative group (to
avoid course overlap, ensure coherence, etc.). Several tracks have been defined to provide focused study
within the general requirements of the degree. Students may choose to follow a track, or may build their own
individualized program in consultation with their advisor.
Students in this program are likely to come from a range of backgrounds, however, they must meet specified
minimum quantitative GRE requirements. We aim to attract students with interests in data mining, artificial
intelligence, visualization, smart cities, media, geographic information systems (GIS), and climate modeling.
The program will be administered by the Center for Computational Science, jointly with the academic units
that offer the courses (Computer Science - College of Arts and Sciences, Electrical and Computer Engineering -
College of Engineering, Mathematics – College of Arts and Science, School of Architecture, Meteorology and
Physical Oceanography - RSMAS, Journalism and Media Management - School of Communication). The
Department of Computer Science will serve as the home department for the degree. A body of industrial
board members will provide professional advice regarding the program content, and will assist in internship
placement.
The University of Miami Library system holdings and online resources are adequate to support this program.
As the courses that comprise the program are already in existence, the various departments have adequate
facilities, equipment, and space for the teaching. The Center for Computational Sciences will provide
additional computing resources to the departments. The program uses existing courses, and courses that the
various academic units are already committed to creating, so that there is no budget required for course
creation.
Revenue distribution: 30% of tuition revenue from all Master’s degree programs returns to the Academy net
of any waivers or scholarships. After that the CCS will take 15% to cover the costs of administering the
program including the salary of a Program Coordinator to be housed at the CCS. After that all income will be
distributed to the colleges and schools whose academic units teach the program’s courses, according to the
number of credit hours taught per student. Each college and school will distribute income to the academic
units according to their individual college and school policies.
Approval from the Curriculum Committee of the College of Arts & Sciences
Letter of Support from the Director of the Center for Computational Science
This letter confirms the support of the Center for Computational Science for the Master of Science in Data
Science proposed to be housed under the Department of Computer Science in conjunction with the Center for
Computational Science and other units of the university. The Center for Computational Science is happy to
administer the program and offer the use of computing resources as needed.
Regards,
1
Letter of Support from the Chairperson of the Department of Computer Science
November 1, 2017
This letter confirms the support of the Department of Computer Science for the Master of Science in
Data Science, proposed by the Department of Computer Science in conjunction with the Center for
Computational Sciences and other units of the university. The Department of Computer Science is
pleased to have courses in the program.
Regards,
Geoff Sutcliffe
Professor and Chair of Computer Science
Letter of Support from the Department of Mathematics
Letter of Support from the School of Architecture
This letter confirms the support of the School of Architecture for the Master of Science in Data Science
proposed to be housed in the Department of Computer Science in conjunction with the Center for
Computational Science, the School of Communication, the Rosenstiel School of Marine and Atmospheric
Science and other units of the university. The interdisciplinary design of the Master in Data Science
program is extremely valuable for students who seek to combine rigorous technical training with data
science applications in architecture. The School of Architecture has been an integral collaborator in the
process of creating the Master in Data Science program, and is pleased to have a track in Smart Cities as
part of the program.
Regards,
2 November 2017
If you need any further information, please do not hesitate to contact me either by
phone (305-421-4046), fax (305-421-4696) or by e-mail ([email protected]).
Sincerely,
Ben Kirtman
Professor, Department of Atmospheric Sciences
Director, Cooperative Institute for Marine and Atmospheric Studies
Rosenstiel School of Marine and Atmospheric Sciences
Program Director, Climate and Environmental Hazards
Center for Computational Science
University of Miami
Letter of Support from the Big Data Advisory Board
T: (305) 390-2550 | F: (305) 386-7474 | www.thinergistics.com | P.O. Box 160397, Miami, FL 33116
Additional Letter of Support from the Big Data Advisory Board
Additional Letter of Support from the Big Data Advisory Board
PROPOSAL
1. Rationale
Many large companies today have data science departments. Data scientists who can not only perform various
data analysis techniques but also are able to interpret the results by drawing on their domain knowledge into
actionable items are in high demand, as executives seek talented individuals capable of unlocking the hidden
value in big data to garner strategic insights and business results. The challenges of modern data science
require data scientists to possess strong training in both data analysis technologies and also domain specific
issues. Founded on computer science, mathematics, statistics, and optimization techniques, data scientists
add deep content knowledge in specialized applications such as communications, architecture, and marine
sciences. Training data scientists requires an interdisciplinary approach that ensures that the students are
well-trained and able to take up the role of data scientist in any organization. The Harvard Business Review
has dubbed data science as “the sexiest job of the 21st century.”
Data science consists of many steps. It starts with project conception then moves on to collection and
harvesting data from possible data sources, preprocessing the collected and harvested data, integrating
multiple data sets for analysis, conducting analysis for modeling and prediction, visualizing obtained results for
interpretation, developing actionable plans from obtained results, and preserving the curated data. Because
of the size and complexity of the data sets, traditional data processing tools (for example, those available in
spreadsheet programs and statistical software) are inadequate. In practice, many of the steps in analyzing
large data sets are done using purpose-specific computer programs that require more than a click of a button;
they require a solid understanding of the principles that the computer programs embody.
The position of Data Scientist is a growing job market. The need for data science and the shortage of data
scientists are well articulated in various reports [Herold; Orihuele and Bass; Sents], including an in-depth
analysis [McKinsey]. Those who suspect that data science is a “hype” warn that the demands for data
scientists may lesson soon, when many of the by-hand tasks of the data analytics processes are incorporated
into software tools [Darrow]. A counter argument is that while software tools become more intelligent, the
size and complexity of the datasets keep increasing. The 2016 McKinsey Report states, “Back in 2011, the
McKinsey Global Institute published a report highlighting the transformational potential of big data.1 Five
years later, we remain convinced that this potential has not been overhyped. In fact, we now believe that our
2011 analyses gave only a partial view. The range of applications and opportunities has grown even larger
today.” The report estimates that there will be a shortfall of 250K data scientists, but 50% of the work could
be automated. As of April, 2017, there are 35K job openings listed at Glassdoor.com and 11K job openings
listed at LinkedIn.com. The median salary of data scientists is estimated to be around $118K while that of
skilled programmers is estimated to be $65K, according to Wired.com. Thus, the market potential of data
science degree programs appears to be high.
At least two dozen universities now have a Master of Science in Data Science (or a similar degree). The current
offerors include: Arizona State (Business Analytics), Carnegie Mellon (Computational Data Science), Central
Florida, Columbia, Cornell (MPS in Applied Statistics), Georgia Tech., Illinois at Urbana Champaign
(Professional MS), Illinois Tech, Indiana, Johns Hopkins (online), Minnesota, NYU, North Carolina State
(Analytics), Northwestern (Analytics), Rochester, Rutgers (MS Business and Science), San Francisco, Southern
California, Stanford, Texas A&M (Analytics), UC Berkeley (online), UC San Diego (Analytics), Virginia,
Washington – Seattle, and Wisconsin. The duration of these programs ranges from 10 to 12 months with
specific guidelines from selected programs highlighted in Section 6 of this proposal.
Interdisciplinary master’s degree programs with an emphasis on professional training already exist at the
University of Miami. For example, the College of Arts & Sciences currently offers interdisciplinary professional
master’s degree programs in International Administration, Liberal Arts, Latin American Studies, and
Mathematical Finance, as well as departmental professional master’s degrees in Applied Behavior Analysis,
Anthropology, Criminology and Criminal Justice, International Studies, and Public Administration. Additionally,
the Professional Science Master’s (PSM) program at RSMAS prepares its students for science careers in
industry, government, and nonprofit organizations, where employment demands are growing. The curricula
are structured to allow students to complete their degree in as little as 12 months, with the training and
internship experiences necessary to prepare them for careers in today’s professional job market.
There are two existing UM master’s degree programs whose contents have some overlap with the proposed
program: Master of Science in Business Analytics in the School of Business and Master of Fine Arts in
Interactive Media in the School of Communication. The MS in Data Science is distinctly different from these
other programs at the University of Miami, with a focus on teaching key skills for conducting science with data
and a strong emphasis on interdisciplinary education. It will draw from a separate pool of potential students,
and produce graduates with a different set of data science skills. As such, the students will emerge into
different job markets, outside of communication, finance, and business.
2. Curriculum
a. List the major divisions of the discipline in which graduate degree work will be offered.
The program will be administered by the Center for Computational Science, jointly with the academic units
that offer the courses: School of Architecture, Department of Computer Science, Department of Mathematics,
School of Communication, and RSMAS). Since computation serves as the backbone of data science and since
the Center has served and collaborated with the university-wide research community, the CCS is an ideal
organization for taking the lead role in administering the program, and will appoint a program coordinator to
take overall responsibility for the program. The program coordinator will be responsible for recruiting, finance,
internships, and program administration. The Department of Computer Science (College of Arts & Sciences)
will serve as the home department for the degree. Additionally, the program will be advised by the following:
● Administrative Group: This group will represent all the academic units teaching courses in the program,
and will be in charge of overseeing the validity and health of the program. No changes can be made to
the academic structure without approval of this group. These members will be in charge of advising
their respective students. The current members are:
o Prof. Alberto Cairo (Journalism and Media Management, School of Communications)
o Prof. Ben Kirtman (Meteorology and Physical Oceanography, RSMAS)
o Prof. Mitsunori Ogihara (Department of Computer Science, College of Arts and Sciences)
o Prof. Rodolphe el-Khoury (School of Architecture)
o Prof. Mei-Ling Shyu (Department of Electrical and Computer Engineering, College of
Engineering)
● Industrial Advisory Board: The advisory board will consist of representatives from companies who are
willing to take in and supervise interns from the program for six months. Presently the board members
include:
o Pete Martinez, Chairman and CEO, GameChangerTec
o Hector Irizarry, Founding Partner, Thinergistics
o Gang Wang, Manager, Operations & Analytics, Royal Caribbean Cruise Lines
o Matthew Pape, Director, Market Data Analytics & Insight, Ryder - Global Marketing
o Louis Gidel, Chief Medical Informatics and Quality Officer, Baptist Health South Florida
o Daniel Cohen, Senior Vice President, Digital Payments and Labs for MasterCard Latin America
and the Caribbean
3. Requirements
a. Prerequisites.
Admission to the program will be handled by the Center for Computational Science. Requirements are:
i. Completion of an application
ii. A Baccalaureate degree from an accredited institution
iii. A cumulative undergraduate GPA of 3.0
iv. Introduction to Probability and Statistics and Computer Programming I (or equivalents). Students
may be admitted with deficiencies, which must be completed in addition to the degree
requirements.
v. GRE general test scores
a. Applicants must rank in the 65% percentile or higher in the Quantitative Reasoning test. There
is no minimum score requirement for the other parts of the GRE.
vi. Students from non-English speaking countries must send either TOEFL or IELTS scores.
a. TOEFL minimum score: Internet based - 92; Computer based - 237; Paper based - 580.
b. IELTS minimum score: 6.5.
vii. A personal statement of intent in which the applicant details reasons for pursuing the degree.
Credit can be given for prior study. At least 15 credits of courses must be completed at the University of
Miami.
b. Courses.
The curriculum consists of three components: data science tool courses, data science application courses, and
an internship. Students must complete at least 30 credits of graduate level courses to complete the degree.
Each student’s selection of courses must be approved by a member of the administrative group (to avoid
course overlap, ensure coherence, etc.).
Tracks
Several tracks have been defined to provide focused study within the general requirements of the degree.
Students may choose to follow a track, or may build their own individualized program in consultation with
their advisor.
● Technical Data Science (Department of Computer Science)
Core: (9 credits)
MTH642 Statistical Analysis
ECE648 Machine Learning
xor ECE677 Data Mining
CSC688 Data Science and Visualization
Tools: (12 credits)
(A) Programming:
CSC6XX Programming in Python
xor CSC632 Introduction to Parallel Computing
xor CSC640 Algorithm Design and Analysis
(B) Machine Learning:
CSC6XX Principles and Practice of Deep Learning
or CSC746 Neural Networks and Deep Learning
or ECE648 Machine Learning
(C) Data Analysis:
ECE697 Advanced Big Data Analysis
xor ECE677 Data Mining
(D) Statistics:
MTH624 Introduction to Probability Theory
xor MTH625 Introduction to Mathematical Statistics
Applications (6 credits)
Internship (3 credits)
c. Examinations
Each course contributes independently to completion of the program, and there is no cumulative examination.
Students must finish with a GPA of 3.0 in order to be awarded the degree.
4. Students
Students in this program are likely to come from a range of backgrounds, however, they must meet the
minimum quantitative GRE requirements outline above. We aim to attract students with interests in data
mining, artificial intelligence, visualization, smart cities, media, geographic information systems (GIS), and
climate modeling.
5. Resources
c. Budget
The program uses existing courses, and courses that the various academic units are already committed to
creating, so that there is no budget required for course creation.
Revenue distribution: 30% of tuition revenue from all Master’s degree programs returns to the Academy net
of any waivers or scholarships. After that the CCS will take 15% to cover the costs of administering the
program including the salary of a Program Coordinator to be housed at the CCS. After that all income will be
distributed to the colleges and schools whose academic units teach the program’s courses, according to the
number of credit hours taught per student. Each college and school will distribute income to the academic
units according to their individual college and school policies.
6. Comparisons
● MS in Data Science, Columbia University, Offered by the Data Science Institute at Columbia
o 30 credits, 6 core courses covering the essentials of computer science, probability, statistics and
machine learning and a capstone project in the last semester.
o Remaining 3 courses can be taken as electives from across the university, including computer
science, statistics, business, and civil engineering
o Research and internship opportunities available.
● MS in Data Science, New York University, Offered by the Center for Data Science at NYU
o 36 credits, 6 core courses covering the essentials of statistics and machine learning and a
capstone project in the last semester.
o Remaining 6 courses can be taken as electives in applied statistics, bioinformatics, computer
science, mathematical finance, political science and engineering
o Research and internship opportunities available.
● MS in Computational Data Science, Carnegie Mellon University, Offered by the School of Computer
Science
o 36 credits, 2 concentrations – Analytics or Systems.
o 5 core courses, 3, electives, 2 seminar courses and 1 capstone project is required
o Electives which can be taken from the Department of Computer Science.
o Research and internship opportunities available.
Local/State Programs
● MS in Data Science, Florida International University, Offered by the School of Computing and
Information Science
o 30 credits, 4 specializations – Computational Data Analytics, Business Analytics, Hospitality
Analytics, Biostatistics Data Analytics
o 4 core courses, electives, capstone project
o No research component
● MS in Data Analytics, University of Central Florida, Offered jointly by the College of Sciences and the
College of Engineering and Computer Science
o 30 credits, 8 core courses, 2 electives, capstone project
o Paid internships available
o No research component
Other Related Degree Programs
● MS in Statistics: Data Science, Stanford, Offered jointly by the Department of Statistics and the
Institute for Computational and Mathematical Engineering
o Track within the MS in Statistics degree
o 45 credits/units, no thesis but a capstone project
o Lab work in the Stanford Data Lab
o Internship and research opportunities are available
7. References
[Darrow] Barb Darrow (2015) Data science is still white hot, but nothing lasts forever. Fortune Technology,
May 21, 2015. http://fortune.com/2015/05/21/data-science-white-hot/
[Herold] Benjamin Herold (2016) In Analytics Economy, Demand for ata Scientists Outpaces Supply, McKinsey
Says. Education Weekly, December 8, 2016.
http://blogs.edweek.org/edweek/DigitalEducation/2016/12/analytics_economy_data_science_mckins
ey.html
[McKinsey] The Age of Analytics. McKinsey Global Institute. December, 2016.
[Orihuela and Bass] Rodrigo Orihuela and Dina Bass (2015) Help Wanted: Black Belts in Data. Bloomberg, June
4, 2015. https://www.bloomberg.com/news/articles/2015-06-04/help-wanted-black-belts-in-data
[Sents] Rob Sents (2016) Want to Become a Data Scientist? Where the Jobs Are And What Employers Are
Looking For. Forbes Magazine, November 16, 2016.
https://www.forbes.com/sites/emsi/2016/11/16/want-to-become-a-data-scientist-where-the-jobs-
are-and-what-employers-are-looking-for/#7c82a9915760