Big Data - SRM University PDF
Big Data - SRM University PDF
Big Data - SRM University PDF
0 IT2015 SRM(E&T)
COURSE COURSE NAME L T P C
CODE
CORE COURSES - SEMESTER I & II
DA2001 Foundations of Data Science 3 0 2 4
DA2002 Big Data Technology 2 0 3 4
DA2003 Computing for Data Analytics 3 0 2 4
DA2004 Programming for Data Analytics 2 0 3 4
DA2005 Marketing Analytics 3 2 0 4
DA2006 Algorithms for Advanced Analytics 3 0 2 4
CORE COURSES - SEMESTER III
DA2047 Seminar 0 0 1 1
DA2049 Project work Phase I 0 0 12 6
CORE COURSE - SEMESTER IV
1 IT2015 SRM(E&T)
NOTE:
Students have to register for the courses as per the following guidelines:
Credits
Sl.
Category II IV Category
No. I Semester III Semester
Semester Semester total
1 Core courses 12 12 --- --- 24
( 3 courses) ( 3 courses)
2 Program Elective 18 (in I to III semesters) --- 18
courses
Interdisciplinary 3 (in I to III semesters) 3
elective courses
(any one program
elective from other
programs)
3 Supportive courses 3 (in I to III semesters) --- 3
- mandatory
4 Seminar --- --- 1 --- 1
6 Project work --- --- 06 16 22
Total 71
2 IT2015 SRM(E&T)
Course code Course Title L T P C
DA2001 FOUNDATIONS OF DATA SCIENCE 3 0 2 4
Total contact hours - 75
Prerequisite
Nil
PURPOSE
Modern scientific, engineering, and business applications are increasingly dependent on data,
existing traditional data analysis technologies were not designed for the complexity of the
modern world. Data Science has emerged as a new, exciting, and fast-paced discipline that
explores novel statistical, algorithmic, and implementation challenges that emerge in
processing, storing, and extracting knowledge from Big Data.
INSTRUCTIONAL OBJECTIVES
1. Able to apply fundamental algorithmic ideas to process data.
2. Learn to apply hypotheses and data into actionable predictions.
3. Document and transfer the results and effectively communicate the findings using
visualization techniques.
3 IT2015 SRM(E&T)
REFERENCES
1. Nina Zumel, John Mount, “Practical Data Science with R”, Manning Publications, 2014.
2. Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman, “Mining of Massive Datasets”,
Cambridge University Press, 2014.
3. Mark Gardener, “Beginning R - The Statistical Pr ogramming Language”, John Wiley &
Sons, Inc., 2012.
4. W. N. Venables, D. M. Smith and the R Core Team, “An Introduction to R”, 2013.
5. Tony Ojeda, Sean Patrick Murphy, Benjamin Bengfort, Abhijit Dasgupta, “Practical Data
Science Cookbook”, Packt Publishing Ltd., 2014.
6. Nathan Yau, “Visualize This: The FlowingData Guide to Design, Visualization, and
Statistics”, Wiley, 2011.
7. Boris lublinsky, Kevin t. Smith, Alexey Yakubovich, “Professional Hadoop Solutions”,
Wiley, ISBN: 9788126551071, 2015.
8. http://www.johndcook.com/R_language_for_programmers.html
9. http://bigdatauniversity.com/
10. http://home.ubalt.edu/ntsbarsh/stat-data/topics.htm#rintroduction
4 IT2015 SRM(E&T)
Course code Course Title L T P C
DA2002 BIG DATA TECHNOLOGY 2 0 3 4
Total contact hours – 75
Prerequisite
Nil
PURPOSE
This course provides practical foundation level training that enables immediate and effective
participation in big data projects. The course provides grounding in basic and advanced
methods to big data technology and tools, including MapReduce and Hadoop and its
ecosystem.
INSTRUCTIONAL OBJECTIVES
1. Learn tips and tricks for Big Data use cases and solutions.
2. Learn to build and maintain reliable, scalable, distributed systems with Apache
Hadoop.
3. Able to apply Hadoop ecosystem components.
REFERENCES
1. Boris lublinsky, Kevin t. Smith, Alexey Yakubovich, “Professional Hadoop Solutions”,
Wiley, ISBN: 9788126551071, 2015.
2. Chris Eaton, Dirk deroos et al. , “Understanding Big data ”, McGraw Hill, 2012.
3. Tom White, “HADOOP: The definitive Guide” , O Reilly 2012.
5 IT2015 SRM(E&T)
4. Vignesh Prajapati, “Big Data Analytics with R and Haoop”, Packet Publishing 2013.
5. Tom Plunkett, Brian Macdonald et al, “Oracle Big Data Handbook”, Oracle Press, 2014.
6. http://www.bigdatauniversity.com/
7. Jy Liebowitz, “Big Data and Business analytics”,CRC press, 2013.
6 IT2015 SRM(E&T)
REFERENCES:
1. Chris Eaton, Dirk Deroos, Tom Deutsch et al., “Understanding Big Data”, McGrawHIll,
2012.
2. Alberto Cordoba, “Understanding the Predictive Analytics Lifecycle”, Wiley, 2014.
3. Eric Siegel, Thomas H. Davenport, “Predictive Analytics: The Power to Predict Who Will
Click, Buy, Lie, or Die”, Wiley, 2013.
4. James R Evans, “Business Analytics – Methods, Models and Decisions”, Pearson 2013.
5. R. N. Prasad, Seema Acharya, “Fundamentals of Business Analytics”, Wiley, 2015.
6. S M Ross, “Introduction to Probability and Statistics for Engineers and Scientists”,
Academic Foundation, 2011.
7. David Hand, Heiki Mannila, Padhria Smyth, “Principles of Data Mining”, PHI 2013.
8. Spyros Makridakis, Steven C Wheelwright, Rob J Hyndman, “Forecasting methods and
applications”, Wiley 2013( Reprint).
9. David Hand, Heikki Mannila, Padhraic Smyth, “Principles of Data mining”, PHI 2013.
10. http://cran.r-project.org/doc/manuals/R-intro.html
11. W.N. Venables, D.M Smith, “An introduction to R”,
12. R in Nutshell , O Reilly,
7 IT2015 SRM(E&T)
UNIT IV – STREAMS AND FILES (6 hours)
Streams – Text Input and Output – Reading and Writing Binary Data – Zip Archives – Object
Streams and Serialization – Memory Mapped Files.
REFERENCES:
1. White, “Hadoop: The Definitive Guide”, Third Edition - 2012 – O’Reilly – ISBN:
9789350237564.
2. Cay S. Horstmann, Gary Cornell, “Core Java™ 2: Volume II–Advanced Features”, Prentice
Hall, 9th edition, ISBN: 978-0137081608.
3. Jean Dollimore, Tim Kindberg, George Coulouris, “Distributed Systems Concepts and
Design”, 4th Edition, Jun 2005, Hardback, 944 pages, ISBN: 9780321263544.
4. Y. Daniel Liang, Introduction to Java Programming, Tenth Edition, Pearson, 2015.
8 IT2015 SRM(E&T)
UNIT IV – MARKET SEGMENTATION (9 hours)
Cluster Analysis - User-Based Collaborative Filtering - Collaborative Filtering - Using
Classification Trees for Segmentation.
REFERENCES
1. Wayne.L.Winston, “Marketing Analytics: Data driven techniques with MS-Excel”, Wiley,
1st ed. 2014.
2. Stephan Sorger, “Marketing Analytics: Strategic models and metrics”, CreateSpace
Independent Publishing Platform, 1st ed., 2013.
9 IT2015 SRM(E&T)
UNIT IV – SUPPORT VECTOR MACHINES (11 hours)
Learning and Soft Computing: Rationale, Motivations, Needs, Basics: Examples of Applications in
Diverse Fields, Basic Tools of Soft Computing: Neural Networks, Fuzzy Logic Systems, and
Support Vector Machines, Basic Mathematics of Soft Computing, Learning and Statistical
Approaches to Regression and Classification - Support Vector Machines - Risk Minimization
Principles and the Concept of Uniform Convergence, The VC Dimension, Structural Risk
Minimization, Support Vector Machine Algorithms.
REFERENCES
1. Jiawei Han and Micheline Kamber, “Data Mining: Concepts and Techniques”, Morgan
Kaufmann Publishers, 3rd ed, 2010.
2. Lior Rokach and Oded Maimon, “Data Mining and Knowledge Discovery Handbook”,
Springer, 2nd edition, 2010.
3. Ronen Feldman and James Sanger, “The Text Mining Handbook: Advanced Approaches in
Analyzing Unstructured Data”, Cambridge University Press, 2006.
4. Vojislav Kecman, “Learning and Soft Computing”, MIT Press, 2010.
5. Jared Dean, “Big Data, Data Mining, and Machine Learning: Value Creation for Business
Leaders and Practitioners”, Wiley India Private Limited, 2014.
10 IT2015 SRM(E&T)
UNIT II – FACTOR ANALYSIS (9 hours)
Factor Analysis: Meanings, Objectives and Assumptions, Designing a factor analysis, Deriving
factors and assessing overall factors, Interpreting the factors and validation of factor analysis.
REFERENCES:
1. Joseph F Hair, William C Black etal , “Multivariate Data Analysis” , Pearson Education,
7th edition, 2013.
2. T. W. Anderson , “An Introduction to Multivariate Statistical Analysis, 3rd Edition”, Wiley,
2003.
3. William r Dillon, John Wiley & sons, “Multivariate Analysis methods and applications”,
Wiley, 1984.
4. Naresh K Malhotra, Satyabhusan Dash, “Marketing Research Anapplied Orientation”,
Pearson, 2011.
5. Hamdy A Taha, “Operations Research”, Pearson, 2012.
6. S R Yaday, A K Malik, “Operations Research”, Oxford, 2014.
11 IT2015 SRM(E&T)
UNIT I – PYTHON CONCEPTS , DATA STRUCTURES, CLASSES (6 hours)
Interpreter – Program Execution – Statements – Expressions – Flow Controls – Functions -
Numeric Types – Sequences - Strings, Tuples, Lists and - Class Definition – Constructors –
Inheritance – Overloading – Text & Binary Files - Reading and Writing.
REFERENCES:
1. Mark Lutz, “Programming Python”, O'Reilly Media, 4th edition, 2010.
2. Mark Lutz, “Learning Python”, O'Reilly Media, 5th Edition, 2013.
3. Tim Hall and J-P Stacey, “Python 3 for Absolute Beginners”, Apress, 1st edition, 2009.
4. Magnus Lie Hetland, “Beginning Python: From Novice to Professional”, Apress, Second
Edition, 2005.
5. Shai Vaingast, “Beginning Python Visualization Crafting Visual Transformation Scripts”,
Apress, 2nd edition, 2014.
6. Wes Mc Kinney, “Python for Data Analysis”, O'Reilly Media, 2012.
7. White, “Hadoop: The Definitive Guide”, Third Edition - O’Reilly , 2012.
8. Brandon Rhodes and John Goerzen, “Foundations of Python Network Programming: The
Comprehensive Guide to Building Network Applications with Python”,Apress, Second
Edition, 2010.
9. http://blog.matthewrathbone.com/2013/11/17/python-map-reduce-on-hadoop---a-beginners-
tutorial.html
10. http://www.michael-noll.com/tutorials/writing-an-hadoop-mapreduce-program-in-python/
11. http://allthingshadoop.com/category/python/
12 IT2015 SRM(E&T)
Course code Course Title L T P C
DA2102 DECISION MANAGEMENT SYSTEMS 2 2 0 3
Total contact hours – 60
Prerequisite
Nil
PURPOSE
The course elaborates on the principles that guide the development of Decision Management
System and lays out the framework for building them. It helps how to find suitable decisions
and develop the understanding of those decisions that helps to automate them.
INSTRUCTIONAL OBJECTIVES
1. Understanding how Decision Management Systems can transform the business.
2. Planning the systems “with the decision in mind”.
3. Identifying, modeling, and prioritizing the decisions.
4. Designing and implementing robust decision services.
5. Monitoring ongoing decision-making and learning how to improve it.
13 IT2015 SRM(E&T)
REFERENCES
1. James Taylor, “Decision Management Systems-A Practical guide to using Business rules
and Predictive Analytics”, IBM Press, 2012.
2. Efraim Turban , Jay E. Aronson , Ting-Peng Liang, “Decision Support Systems &
Intelligent Systems”, 9th edition, Prentice Hall, 2010.
3. Alberto Cordoba, “Understanding the Predictive Analytics Lifecycle”, Wiley, 2014.
4. Eric Siegel, Thomas H. Davenport, “Predictive Analytics: The Power to Predict Who Will
Click, Buy, Lie, or Die”, Wiley, 2013.
5. George M Marakas, “Decision support Systems”, 2nd Edition, Pearson/Prentice Hall,2002
6. V.S. Janakiraman, K. Sarukesi, “Decision Support Systems”,PHI, ISBN8120314441,
9788120314443, 2004.
7. Efrem G Mallach, “Decision Support systems and Data warehouse Systems”, McGraw Hill,
thirteenth reprint, 2008.
14 IT2015 SRM(E&T)
UNIT V – DATA SECURITY & EVENT LOGGING (6 hours)
Integrating Hadoop with Enterprise Security Systems - Securing Sensitive Data in Hadoop – SIEM
system – Setting up audit logging in hadoop cluster
REFERENCES:
1. Mark Van Rijmenam, “Think Bigger: Developing a Successful Big Data Strategy for Your
Business”, Amazon, 1 edition, 2014.
2. Frank Ohlhorst John Wiley & Sons, “Big Data Analytics: Turning Big Data into Big
Money”, John Wiley & Sons, 2013.
3. Sherif Sakr, “Large Scale and Big Data: Processing and Management”, CRC Press, 2014.
4. Sudeesh Narayanan, “Securing Hadoop”, Packt Publishing, 2013.
5. Ben Spivey, Joey Echeverria, “Hadoop Security Protecting Your Big Data Problem”,
O’Reilly Media, 2015.
6. Top Tips for Securing Big Data Environments: e-book
(http://www.ibmbigdatahub.com/whitepaper/top-tips-securing-big-data-environments-e-
book)
7. http://www.dataguise.com/?q=securing-hadoop-discovering-and-securing-sensitive-data-
hadoop-data-stores
8. Gazzang for Hadoop http://www.cloudera.com/content/cloudera/en/solutions/enterprise-
solutions/security-for-hadoop.html
9. eCryptfs for Hadoop https://launchpad.net/ecryptfs.
10. Project Rhino - https://github.com/intel-hadoop/project-rhino/
References:
1. Clark Abrahams and Mingyuan Zhang, “Credit Risk Assessment: The New Lending System for
Borrowers, Lenders, and Investors”, ISBN 978-0-470-46168-6
2. Naeem Siddiqi, “Credit Risk Scorecards: Developing and Implementing Intelligent Credit
Scoring”, ISBN 978-0-471-75451-0
3. Laura B. Madsen, “Data-Driven Healthcare: How Analytics and BI are Transforming the
Industry”, M.S.ISBN 978-1-118-77221-8
4. Jason Burke, “Health Analytics: Gaining the Insights to Transform Health Care”, John Wiley
Sons Inc., 2013, ISBN: 978-1-118-38304-9
5. Jac Fitz-Enz , John R. Mattox II, “Predictive Analytics for Human Resources”, ISBN-13: 978-
8126552153.
6. James C. Sesil, “Applying Advanced Analytics to HR Management Decisions: Methods for
Selection, Developing Incentives, and Improving Collaboration”, ISBN-13: 978-0133064605
Weblink:
1. http://www.capgemini.com/resource-file
access/resource/pdf/Analytics__A_Powerful_Tool_for_the_Life_Insurance_Industry.pdf
16 IT2015 SRM(E&T)
Course code Course Title L T P C
IT2110 INFORMATION STORAGE MANAGEMENT 3 0 0 3
Total contact hours - 45
Prerequisite
Knowledge of Database Management Systems, Computer
Networks is preferred
PURPOSE
Information Storage and Management have highly developed into a sophisticated pillar of
information technology, provides a variety of solutions for storing, managing, accessing,
protecting, securing, sharing and optimizing information.
INSTRUCTIONAL OBJECTIVES
1 Identify the components of managing the data center and Understand logical and
physical components of a storage infrastructure.
2 Evaluate storage architectures, including storage subsystems SAN, NAS, IPSAN,CAS
3 Understand thebusiness continuity, backup and recovery methods.
REFERENCES
1. EMC Corporation, Information Storage and Management, WileyIndia, 2nd Edition, 2011.
2. Robert Spalding, “Storage Networks: The Complete Reference”, Tata McGraw Hill,
Osborne, 2003.
3. Marc Farley, Building Storage Networks, Tata McGraw Hill , Osborne,2nd Edition, 2001.
4. Meeta Gupta, Storage Area Network Fundamentals, Pearson Education Limited, 2002.
17 IT2015 SRM(E&T)
Course code Course Title L T P C
IT2111 CLOUD COMPUTING 2 0 2 3
Total contact hours – 60
Prerequisite
Knowledge of Computer Networks is preferred
PURPOSE
Cloud Computing has drawn the attention of industries and researchers worldwide. Many
applications that are being built nowadays were developed to suit the needs of cloud environment.
Hence it becomes necessary to have course in cloud computing which deals with the basics of
cloud, different services offered by cloud, and security issues in cloud. In a nutshell, this course
on cloud computing provides information on fundamental aspects of the cloud environment.
INSTRUCTIONAL OBJECTIVES
1 Learn about different deployment models of cloud and different services offered by cloud
2 Understand the technique of virtualization through theoretical concepts and practical training
3 Become knowledgeable in the rudimentary aspects of cloud application development
REFERENCES
1. Anthony T .Velte, Toby J.Velte, Robert Elsenpeter, “Cloud Computing: A Practical
Approach”, Tata McGraw Hill Edition, Fourth Reprint, 2010.
2. Kris Jamsa, “Cloud Computing: SaaS, PaaS, IaaS, Virtualization, Business Models, Mobile,
Security and more”, Jones & Bartlett Learning Company LLC, 2013.
3. Ronald L.Krutz, Russell vines, “Cloud Security: A Comprehensive Guide to Secure Cloud
Computing”, Wiley Publishing Inc., 2010.
18 IT2015 SRM(E&T)
Course code Course Title L T P C
DA2105 CLUSTER COMPUTING 2 2 0 3
Total contact hours – 60
Prerequisite
Nil
PURPOSE
A computer cluster consists of a set of loosely or tightly connected computers that work
together so that, in many respects, they can be viewed as a single system. Unlike grid
computers, cluster computers have each node set to perform the same task, controlled and
scheduled by software cluster environment. To improve the performance and availability over
that of a single computer, cluster computers are essential.
INSTRUCTIONAL OBJECTIVES
1. Able to Understand the Cluster installation and configuration
2. Understand the Parallel programming models & paradigms
3. Familiarize with Job management system and cluster scheduling process.
19 IT2015 SRM(E&T)
REFERENCES:
1. Thomas Sterling, “Beowulf Cluster Computing with Linux”, MIT Press, Second Edition,
2003.
2. Rajkumar Buyya , “High Performance Cluster Computing: Architectures and Systems”,
Vol. 1, Prentice Hall PTR, 2007.
3. Rajkumar Buyya, “High Performance Cluster Computing: Programming and Applications”,
Vol 2, Prentice Hall PTR, NJ, USA, 1999.
20 IT2015 SRM(E&T)
UNIT V – APPLICATIVE TRAVERSABLE FUNCTORS & IO (6 hours)
Generalizing monads – Applicative trait – Monads vs Applicative functors – Traversable functors –
Uses of Traverse - Factoring Effects – Simple IO type – Avoiding Stack Overflow – Non
blocking and asynchronous – General purpose IO type.
REFERENCES:
1. Paul Chiusano and Rúnar Bjarnason, “Functional Programming in Scala”, Manning
Publishers, 2014.
2. Dean Wampler, Alex Payne, “Programming Scala”, O'Reilly Media, 2009.
3. Scala by Example www.scala-lang.org/docu/files/ScalaByExample.pdf
4. Martin Odersky, “Scala Language Specification”, 2008, http://www.scala-
lang.org/docu/files/ScalaReference.pdf
5. Scala Library Documentation : http://www.scala-lang.org/docu/files/api/index.html
INSTRUCTIONAL OBJECTIVES
1. Leveraging intelligent web applications.
2. Analyzing web applications in the real world.
3. Building intelligence in your web.
21 IT2015 SRM(E&T)
UNIT IV – REASONING (4 hours)
Reasoning: Logic and its Limits, Dealing with Uncertainty - Mechanical Logic - The Semantic Web
- Limits of Logic - Description and Resolution - Collective Reasoning.
REFERENCES
1. Gautam Shroff, “Intelligent Web - Search, Smart Algorithms, and Big Data”, Oxford
University Press, 2013.
2. Haralambos Marmanis, Dmitry Babenko, “Algorithms of the Intelligent Web”, Manning
publications, 2009.
3. Christopher D. Manning, Prabhakar Raghavan, Hinrich Schütze, “An Introduction to
Information Retrieval”, Cambridge University Press, 2009.
4. Mark Gardener, “Beginning R - The Statistical Pr ogramming Language”, John Wiley &
Sons, Inc., 2012.
5. W. N. Venables, D. M. Smith and the R Core Team, “An Introduction to R”, 2013.
6. http://www.coursetalk.com/coursera/web-intelligence-and-big-data
22 IT2015 SRM(E&T)
UNIT III - INFORMATION NETWORKS AND THE WORLD WIDE WEB (6 hours)
The Structure of the Web- World Wide Web- Information Networks, Hypertext, and Associative
Memory- Web as a Directed Graph, Bow-Tie Structure of the Web- Link Analysis and Web Search-
Searching the Web: Ranking,Link Analysis using Hubs and Authorities- Page Rank- Link Analysis
in Modern Web Search,Applications, Spectral Analysis, Random Walks, and Web Search.
REFERENCES:
1. Easley and Kleinberg, “Networks, Crowds, and Markets: Reasoning about a highly
connected world”, Cambridge Univ. Press, 2010.
2. Robert A. Hanneman and Mark Riddle, “Introduction to social network methods”,
University of California, 2005.
3. Jure Leskovec,Stanford Univ.Anand Rajaraman,Milliway Labs, Jeffrey D. Ullman, “Mining
of Massive Datasets”, Cambridge University Press, 2 edition, 2014.
4. Wasserman, S., & Faust, K, “Social Network Analysis: Methods and Applications”,
Cambridge University Press; 1 edition, 1994.
5. Borgatti, S. P., Everett, M. G., & Johnson, J. C., “Analyzing social networks”, SAGE
Publications Ltd; 1 edition, 2013.
6. John Scott , “Social Network Analysis: A Handbook” , SAGE Publications Ltd; 2nd edition,
2000.
REFERENCES:
1. Byron Ellis, “Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data”,
Wiley, 1st edition, 2014.
2. Sherif Sakr, “Large Scale and Big Data: Processing and Management”, CRC Press, 2014.
2014.
3. Bill Franks, “Taming The Big Data Tidal Wave Finding Opportunities In Huge Data
Streams With Advanced Analytics”, Wiley, 2012.
4. Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman, “Mining of Massive Datasets”,
Cambridge University Press, 2014.
5. Paul C Zikopoulos, Chris Eaton, Paul Zikopoulos, “Understanding Big Data: Analytics for
Enterprise Class Hadoop and Streaming Data”, McGraw-Hil, 1st edition, 2011.
6. kafka.apache.org
7. flume.apache.org
8. zookeeper.apache.org
9. spark.apache.org
10. zeromq.org
24 IT2015 SRM(E&T)
Course code Course Title L T P C
DA2110 ADVANCED ALGORITHMS 2 0 2 3
Total contact hours - 60
Prerequisite
Nil
PURPOSE
A major thrust of computer science is the design, analysis, implementation, and scientific
evaluation of algorithms to solve critical problems. As parallel computing continues to merge
into the mainstream of computing, it becomes more and more important for students and
scientists to understand the application and analysis of algorithmic paradigms to both the
(traditional) sequential model of computing and to a variety of parallel models.
INSTRUCTIONAL OBJECTIVES
1. To integrate the parallel and sequential algorithms.
2. Design and analysis of paradigms for sequential and parallel models.
REFERENCES
25 IT2015 SRM(E&T)
Course code Course Title L T P C
DA2111 NATURAL LANGUAGE PROCESSING 2 2 0 3
Total contact hours - 60
Prerequisite
Nil
PURPOSE
An explosion of Web-based language techniques, merging of distinct fields, availability of
phone-based dialogue systems, and much more make this an exciting time in speech and
language processing. The first of its kind to thoroughly cover language technology – at all
levels and with all modern technologies, based on applying statistical and other machine-
learning algorithms to large corporations.
INSTRUCTIONAL OBJECTIVES
1. Able to illustrate the relative strengths and weaknesses of various approaches.
2. Understanding statistical sequence labeling, information extraction, question
answering and summarization, advanced topics in speech recognition, speech
synthesis.
3. Learn the language modeling, formal grammars, statistical parsing, machine
translation, and dialog processing.
26 IT2015 SRM(E&T)
REFERENCES
1. Daniel Jurafskey and James H. Martin “Speech and Language Processing”, Prentice Hall,
2009.
2. Christopher D.Manning and Hinrich Schutze, “Foundation of Statistical Natural Language
Processing”, MIT Press, 1999.
3. Ronald Hausser, “Foundations of Computational Linguistics”, Springer-Verleg, 1999.
4. James Allen, “Natural Language Understanding”, Benjamin/Cummings Publishing Co.
1995.
5. Steve Young and Gerrit Bloothooft, “Corpus – Based Methods in Language and Speech
Processing”, Kluwer Academic Publishers, 1997.
INSTRUCTIONAL OBJECTIVES
1. To learn the fundamentals of Pattern Recognition techniques
2. To learn the various Statistical Pattern recognition techniques
3. To learn the various Syntactical Pattern recognition techniques
4. To learn the Neural Pattern recognition techniques
27 IT2015 SRM(E&T)
REFERENCES
1. Robert Schalkoff, “Pattern Recognition: Statistical Structural and NeuralApproaches”, John
wiley & sons , Inc,1992.
2. Earl Gose, Richard johnsonbaugh, Steve Jost, “Pattern Recognition andImage Analysis”,
Prentice Hall of India,.Pvt Ltd, New Delhi, 1996.
3. Duda R.O., P.E.Hart & D.G Stork, “ Pattern Classification”, 2nd Edition, J.Wiley Inc 2001.
4. Duda R.O.& Hart P.E., “Pattern Classification and Scene Analysis”, J.wiley Inc, 1973.
5. Bishop C.M., “Neural Networks for Pattern Recognition”, Oxford University Press, 1995.
28 IT2015 SRM(E&T)