Final Report 1
Final Report 1
Final Report 1
A Project Report on
ADARSH S G 4MH19IS002
AKSHATHA M 4MH20IS009
DHANUSH G R 4MH20IS027
RAKESH J 4MH20IS066
2023-2024
MANDYA-571477
CERTIFICATE
Certified that the project work titled “COURSE RECOMMENDATION AND SKILL
PREDICTION BASED ON CAREER CHOICE” has been successfully carried out by
ADARSH S G [4MH19IS002], AKSHATHA M [4MH20IS009], DHANUSH G
R[4MH20IS027], RAKESH J [4MH20IS066] bonafide students of Maharaja Institute of
Technology, Mysore in partial fulfilment of requirements of Degree of Bachelor of
Engineering in Information Science & Engineering of Visvesvaraya Technological
University, Belgaum during the academic year 2019-2020. It is certified that all
corrections/suggestions indicated for the internal assessment have been incorporated in the
report deposited in the department library. The project report has been approved as it satisfies
the academic requirements with respect to the project work prescribed for Bachelor of
Engineering Degree.
1.
2.
ACKNOWLEDGEMENT
We sincerely owe our gratitude to all the persons who helped and guided us in completing this
technical seminar work
We are thankful to Dr. B.G. Naresh Kumar, Principal, Maharaja Institute of Technology
Mysore, for having supported us in our academic endeavors.
We are extremely thankful to Dr. Sharath Kumar Y H, Professor & Head, Department of
Information Science and Engineering, for his valuable support and his timely inquiries into
the progress of the work.
ADARSH S G 4MH19IS002
AKSHATHA M 4MH20IS009
DHANUSH G R 4MH20IS027
RAKESH J 4MH20IS066
ABSTRACT
In today's digital age, online learning has emerged as a popular and convenient way
for individuals to acquire new skills, expand their knowledge, and pursue educational
opportunities from the comfort of their homes. With the proliferation of online course
platforms offering a vast array of courses across various domains, the challenge for
learners lies in navigating this abundance of options to find the most relevant and
suitable courses that align with their interests, goals, and learning preferences. This is
where the role of an Online Course Recommendation System becomes invaluable.
Non-Functional Requirements 23
System Requirements 25
26
Chapter 6 :TESTING
Design of Test Cases 37
Unit Testing 37
Integration Testing 38
Acceptance Testing 38
System Testing 39
Test strategy and Approach 39
Test Cases 40
vi
Course Recommendation and Skill Prediction Based on Career Choice 2023-2024
Chapter 1
INTRODUCTION
Overview
The Course Recommendation and Skill Prediction system (CRSP) is a
sophisticated tool designed to guide individuals in making informed decisions about their
education based on their chosen career paths. Using advanced machine learning
algorithms, CRSP assesses user-provided career aspirations, industry trends, and job
market data to generate personalized course recommendations. By analyzing historical
job postings, CRSP predicts future skill demands, allowing users to identify areas for skill
enhancement. The system seamlessly integrates a vast database of courses from both
traditional and online platforms, employing a refined matching algorithm to provide a
ranked list of recommended courses. CRSP also emphasizes user feedback for continuous
improvement and a dynamic learning experience. Privacy and data security are
paramount, with stringent measures in place to safeguard user information. CRSP
empowers individuals to proactively shape their professional development, ensuring that
their educational pursuits are aligned with the demands of their chosen career path, thus
enabling lifelong learning and sustained career growth.
Problem statement
Every organization wants to be success. Proper plan and suitable human resources or
capable human resources may lead the organization success. It is very important to know
the right candidate for the right job and also factors associated with job types. There is no
automation for identification of factors associated with jobs in real time which may lead
to low candidate performance and loss to the organization. A proper automated system
required which proposed system does using machine learning algorithms.
The solution
Course Recommendation and Skill Prediction (CRSP) employs a multi-faceted
approach. Natural Language Processing (NLP) is used to extract and categorize user-
provided career preferences. Machine learning algorithms analyze industry trends, job
market data, and historical job postings to predict future skill demands. A matching
algorithm considers course content, duration, and user feedback in the extensive course
database. Adaptive learning techniques refine recommendations based on user
interactions. Data encryption and strict access controls ensure privacy and security.
Continuous integration with reputable data sources keeps recommendations up-to-date.
This comprehensive framework leverages advanced technologies to empower individuals
in making informed decisions about their educational pursuits aligned with their chosen
career .
Existing System
Student management system – tool which maintains student academic details such as
marks, attendance, admission, fees, sports, placement cell etc. but doesn’t provides any
useful information related to recruitment process.
General Counseling – this is manual system where college staffs will sit and conduct
counseling for the students to know the issues they are facing. This is manual process
and too time consuming.
Human Ad-visors – this system is also manual where experts gives their opinions,
suggestions for the issues. Manual process which requires time, experience and money.
Recruitment process and also identifying jobs factors which suits job type is a complex
process in any organization. Current recruitment process requires more time, more
money, human resources and advertisement etc. In the current system there is no system
to identify factors for the jobs and also online course recommendation. Complete
recruitment process is manual. It is a very challenging task for HR team to identify the
suitable applicants for the jobs. Few existing research papers just developed machine
learning models and shown results of around 90%, but these works are not suitable for
real time. We require a system that can automatically identifies factors associated with
Proposed System
System has 2 major objectives online course recommendation for the students and
predicting the relationship between educational factors with job types.
System is a browser based application where end users can use browsers such as
chrome, edge, and opera to access the project.
Chapter 2
LITERATURE SURVEY
Survey Papers
In this paper, the author proposes Skill-based Career Path Modeling and
Recommendation The development of new technologies at an unprecedented rate is
rapidly changing the landscape of the labor market. Therefore, for workers who want to
build a successful career, acquiring new skills required by new jobs through lifelong
learning is crucial. In this paper, we propose a novel and interpretable monotonic
nonlinear state-space model to analyze online user professional profiles and provide
actionable feedback and recommendations to users on how they can reach their career
goals. Specifically, we use a series of binary-valued and nondecreasing latent states to
represent the expanding skill set of each user throughout their career and propose an
efficient inference method under our model. Using a series of experiments on two large
real-world datasets, we show that our model (sometimes significantly) outperforms
existing methods on the tasks of company, job title, and skill prediction. More
importantly, our model is interpretable and can be used for other important tasks
including skill gap identification and career path planning. Using a series of case studies,
we show that our model can provide i) actionable feedback to users and guide them
through their upskilling and reskilling processes and ii) recommendations of feasible
paths for users to reach their career goals.
Disadvantages: The paper does not provide any user feedback or case study to
demonstrate the effectiveness or usability of the proposed system .
In this paper the authors Ming Gao, Yonghan Luo, and Xiaonan Hu propose the
In this paper, the author Pornthep Rojanavasu proposes The education crisis is now
widely spread globally in terms of decreasing the number of students and decreasing
degree requirements for some jobs. Educational data mining (EDM) is recently interested
in the data mining area to discover useful knowledge in educational data to help educators
improve their administration planning and student services. This paper proposes applying
two data mining techniques to educational data. First, the association rule was applied to
admission data to find some knowledge for supporting admission planning. Second, a
decision tree was applied to course grades and job data of graduated students to predict
jobs after graduation. The results of these studies give good knowledge for admission
planning and job prediction.
Disadvantages: Limitation is related to the quality and quantity of the available data. Data
quality issues such as missing values or noise in the data can affect the accuracy of the
results generated by the system.
In this paper the author M. F. Lee, N.F. Mat Nawi & C.S. Lai propose Job
performance of an individual can be predicted through personal highlights. The
capability of predicting job performance is benefiting from all perspectives. Job
In this paper the author Soude Fazeli, and Enayat Rajabi proposes Almost all studies
on course recommenders in online platforms target closed online platforms that belong to
a University or other provider. Recently, a demand has developed that targets open
platforms. Such platforms lack rich user profiles with content metadata. Instead, they log
user interactions. We report on how user interactions and activities tracked in open online
learning platforms may generate recommendations. We use data from the Open U online
learning platform in use by the Open University of the Netherlands to investigate the
application of several state-of-the-art recommender algorithms, including a graph-based
recommender approach. It appears that user-based and memory-based methods perform
In this paper, the author Vishal Garg, Dr. Ritu Tiwari proposes The
amount of educational resources spread over the Internet is huge and diverse. Massive
Online Open Courses(MOOC) websites such as Coursera, Udacity, and EdX are gaining
momentum. In this knowledge exploration age, it becomes increasingly difficult for a
learner to identify a suitable course for him. This task can be tedious because it involves
accessing each platform, searching available courses, selecting some courses, read
carefully each course syllabus, and choosing appropriate content. To counter this
problem Recommendation Systems are used. This paper proposes an effective and
efficient MOOC Recommendation System based on Machine Learning. The proposed
system will be able to provide effective recommendations to learners on e-learning sites.
Disadvantages: It does not evaluate the effectiveness or accuracy of the proposed
system on real-world data or users.
10. Course Recommendation of MOOC with Big Data Support A Contextual Online Learning
Approach:
In this paper the authors Yifan Hou, Pan Zhou, and Jie Xu propose With the advent
of the big data era of MOOC, enrolled students and offered courses become numerous
and diverse, resulting in a large amount of data and complex curriculum relationships.
Thus how to recommend appropriate coursesto improve students’ learning outcomes has
become a daunting task. The state-of-the-art works ignore some significant features in
course recommendation of MOOC: heterogeneity of large-scale user groups, sequence
Disadvantages: The paper does not compare the naive Bayes algorithm with other
In this paper, the author YANG Jing proposes This article analyzes the contradictions
between teaching personalization and teaching standardization. By drawing upon the
concept of “mass customization” in the manufacturing industry, this article puts forward
the model of the mass customization of students’ personalized course systems based on
online courses. Mass customization is based on the modularization and standardization of
course resources, and the personalized course system is customized through the assembly
of course resource modules; learning resource modules are recommended to students in
the collaborative filtering mode, and the resource recommendation strategy is developed
based on modularization to help students quickly identify the course resources they are
interested in and complete the customization process.
14. Predicting the Suitability of IT Students’ Skills for the Recruitment in Saudi
Labor Market:
In this paper the author Raghad Munirah Alghamlas and Reham Alabduljabbar propose
In a technical world, knowledge of information technology (IT) skills is an important
aspect, especially in Saudi Arabia, with the development in the technical labor market.
This paper proposed a solution to examining the demand for recruitment in the Saudi
market, using data extracted from online job portals. By utilizing machine learning
algorithms to develop a web-based application that helps in predicting the suitability of
IT students’skillsfor the recruitment in Saudi labor market. In addition, this solution
offers general recommendations of the highest needed IT skills in Saudi Arabia in order
to overcome the unemployment problem.
Disadvantages: The paper does not specify which machine learning algorithms are used
or how they are evaluated. The paper does not address the ethical or social implications of
using data mining techniques for career guidance.
In this paper the author Yuxin Ren. Zhifang proposes In recent years, online
education technology has developed rapidly, and the market scale of online education
platforms is growing. However, with the widespread application and development of this
technology, the requirements of online education users are gradually becoming stricter. A
personalized course recommendation system can greatly improve the satisfaction and
learning efficiency of student users, making the education platform stand out from many
competitors. Based on the analysis of the advantages and disadvantages of the existing
learning resource recommendation system on the online education platform, this paper
proposes a new learning resource recommendation model based on a hybrid
Dept of IS&E, MITM 12
Course Recommendation and Skill Prediction Based on Career Choice 2023-2024
recommendation algorithm which includes a course recommendation submodule based
on statistics and personalized course recommendation submodule based on professional
training requirements. It can not only help users to find high-quality information that they
are interested in, save users' time and cost, but also effectively solve the problem of cold
boot. This paper inspires the improvement of the online education course
recommendation system.
Disadvantages: The paper does not specify which machine learning algorithms are used
or how they are evaluated. The paper does not address the ethical or social implications
of using data mining techniques for career guidance
In this paper, the author Jian-Wei Tzeng, Nen-Fu Huang, An-Chi Chuang, Ting-
Wei Huang, and Hong-Yi Chang proposes Massive open online courses (MOOCs) are
open online courses designed based on teaching progress. Videos and learning exercises
are used as learning materials in these courses, which are open to numerous users.
However, determining the prerequisite knowledge and learning progress of learners is
difficult. Based on learners’ online learning trajectory, we designed a set of practice
questions for a recommendation system for MOOCs, provided suitable practice questions
to students through the LINE chatbot (a type of social media software), and used mobile
devices to encourage participation in MOOCs. Reinforcement learning, which involves
reward function design and iterative solution improvement, was used to set task goals,
including those related to course learning and practice question difficulty. The proposed
system encouraged certain learning behaviors among students. Students who used the
system exhibited an exercise completion rate of 89.97%, which was higher than that of
students who did not use the system (47.23%). The system also increased the student's
overall learning effectiveness. Students who used and did not use the proposed system
exhibited average midterm scores of 64.73 and 58.21, respectively. We also collected 227
online questionnaires from students. The results of the questionnaires indicated that 90%
of the students were satisfied with the system and hoped to continue using it.
Disadvantages: The paper uses a small and limited dataset of 100 students and 20
17. Massive Open Online Courses (MOOCs) Recommendation Modeling using Deep
Learning:
19. A New Pedagogy and Online Learning System on Pandemic COVID 19 Era at
Islamic Higher Education:
In this paper the author Triyo Supriyatno, Facrul Kurniawan proposes The purpose
of the article is the analysis of Covid-19 impact on Islamic Higher Education (IHE). The
following research methods were used in the article: empirical methods, methods of
theoretical analysis, questionnaire, and interview. The approaches to online education in
IHE Indonesia have been analyzed. The results of the article have a few significant
implications for the science and practice of online education and its perspectives on the
IHE system. Prospects of online education for students have been offered during Covid-
19. The main recommendations are the following: to organize training courses of online
education methods for lecturers; to organize in-depth training courses of online education
methods for lecturers of non-pedagogical specialties; the university’s management should
provide constant monitoring of the satisfaction of students and lecturers of the online
education organization for the accumulation of statistical data in the dynamics.
Disadvantages: The paper does not specify which machine learning algorithms are used
or how they are evaluated. The paper does not provide any empirical evidence or user
feedback to support the effectiveness or usability of the proposed system.
In this paper, the author Hai-Hui Wang, Chalothon Chootong, Ankhtuya Ochirbat,
Worapot Sommool, W K T M Gunarathn, Timothy K. Shih proposes MOOCs bring us to
a higher education with the concept of flipped classrooms, where students make use of
the online studying materials such as online textbooks, video tutorials, and all sorts of
documents which may take in forms of a web page, online learning platform, educational
learning management systems. We see the astonishing potential of MOOCs in education.
However, there has always been a problem that existed in Taiwan that is also often
discussed. It is known as the gap between industry and education, which means that the
students who have graduated from universities, do not always have the skills that the
industries need. We find that in most cases, students will only have some skills or
knowledge about some tools that are listed from the requirements of the industries. The
students have plentiful self-studying resources from the internet, we hope to encourage
the students to learn and empower themselves by correctly recommending what are the
most required skills of their desired occupation. Therefore, this paper proposed a
clustering method that shows the results of groups of skills that are commonly needed for
a particular type of job.
Disadvantages: The paper does not specify which machine learning algorithms are used
or how they are evaluated. The paper does not address the ethical or social implications
of using data mining techniques for career guidance
Survey Findings
Our survey aimed to understand the preferences and career aspirations of participants to
tailor course recommendations and predict necessary skills accordingly. The survey was
distributed to a diverse group of individuals across different age groups, educational
backgrounds, and career interests.
Firstly, we analyzed the respondents' career choices. The majority expressed interest
in technology-related fields such as software development, data science, and artificial
intelligence. However, there was also a notable interest in traditional professions like
medicine, law, and education.
Next, we delved int the factors influencing their career decisions. Interestingly, salary
prospects emerged as a significant driver across all age groups, particularly among
younger participants. Job satisfaction, work-life balance, and opportunities for growth
were also cited as crucial considerations.
Chapter 3
SOFTWARE REQUIREMENT
SPECIFICATIONS
Stakeholders
Stakeholders of the project are:
Team Members
Project Guide
Customers
Project Reviewers
Faculty Department
College Management
Organization’s Officials
▪ Customers
▪ Customer Care Representative
▪ Admin
Functional Requirements
For this project, several functional requirements are crucial to ensure its
effectiveness and usability. they are:
Personalized Learning Path: Based on the user's profile and career aspirations, the
system should generate a personalized learning path that outlines the sequence of courses
and activities necessary to acquire the desired skills. This path should be dynamic and
adjustable to accommodate changes in the user's goals or skill development.
By fulfilling these functional requirements, this project can effectively empower users to
make informed decisions about their education and career paths, ultimately enhancing their
employability and professional success.
NON-FUNCTIONAL REQUIREMENTS
2. Reliability: Users should be able to rely on the system consistently. This entails
minimizing downtime, ensuring data integrity, and implementing robust error handling
mechanisms. The system should recover gracefully from failures and maintain
availability even during maintenance activities.
4. Usability: The system should be intuitive and easy to use for users with varying
levels of technical expertise. Clear and concise user interfaces should facilitate smooth
navigation and interaction. Additionally, the system should support accessibility
standards to accommodate users with disabilities.
By addressing these non-functional requirements, this project can ensure that the
system not only meets its functional objectives but also delivers a secure, reliable, and
user-friendly experience for its stakeholders.
System Requirements
Hardware Requirements
Software Requirements
1. Anaconda- It is an open source software available to us which enables us to easily codein using
python or R on different operating systems such as the windows, Linux, and Mc OS. It has millions
of users worldwide, and is well known as the industry which helps us in developing systems, testing
them, and training the machines. This further enables us to:
Manage all the imported libraries, their dependencies, and the environments of developing with
Anaconda.
In developing techniques to train our machine with TensorFlow, scikit-learn, etc.
Analyze the datasets and manipulate the with Dask, NumPy, pandas, and Numba
Visualize or plot the results with Matplotlib, Holoviews, Bokeh, and Datashader.
It also provides us with jupyter notebooks which has all the in-built libraries
embedded in thealready. This eases our coding stress, and also helps us code with
more efficiency
2. Python: The most abundantly used general level programming language. It is used for both a small scale
and big scale systems. It can easily be interpreted. It is said to support multiple programming paradigms. It
includes features of procedural, object-oriented, and functional programming together. It is already garbage-
collected which makes it more efficient.
4. Pandas- It is a library that is written in python language. It helps us with the analyzing of data. It
also provides us with tools and functions to manipulate a large amount of data.
5. Sklearn- It is a library used in machine learning in python programming language. It mainly helps
us with the classification of data, regression of models, and in clustering algorithms. These
algorithms include SVM, random forest etc.
6. Tensorflow- It is a highly know open-source software, which is free and available to all. It is used
for differentiable programming with a large number of tasks. It is used against large data sets
to help us with the dataflow, and its manipulation. It is a basically a math library with various
feature.
Chapter 4
SYSTEM DESIGN AND ANALYSIS
System Analysis
Through the integration of advanced data analytics and machine learning algorithms, this system
aims to analyze career preferences, market trends, and individual skill sets to offer tailored
recommendations for courses and training programs. By leveraging predictive modeling
techniques, it further anticipates future skill demands, empowering users to make informed
decisions about their educational and professional endeavors. This analysis will delve into the
architectural design, functionality, and potential impact of such a system, outlining its role in
facilitating career development and lifelong learning in the digital age.
SYSTEM DESIGN
Systems design is the process of defining the architecture, modules, interfaces, and
data for a system to satisfy specified requirements. Systems design could be seen as the
application of systems theory to product development. There is some overlap with the
disciplines of systems analysis, systems architecture and systems engineering.
The Purpose of this design document is to explore the logical view of architecture design,
data flow diagrams, sequence diagram and an overview of the proposed system for
performing the operations such as signature capturing, pre-processing, feature extraction
Dept of IS&E, MITM 25
Course Recommendation and Skill Prediction Based on Career Choice 2023-2024
and validation which when combined to give the desired output. The design activity
module consist three outputs.
Architecture Design
System Architecture
The diagram portrays a dynamic process wherein learner data and feedback iteratively
enhance course recommendations through a deep reinforcement learning model, aiming for
more effective and personalized learning experiences.
1. User ID:
The process begins with the user providing their unique identification (User ID) to the
recommendation system.
2. Model-Based Collaborative Filter:
This step represents one of the recommendation techniques used by the system.
Collaborative filtering involves analyzing user behavior (such as ratings, preferences, and
interactions) to recommend items.
The model-based approach uses mathematical models to predict user preferences based on
patterns observed in historical data.
It may involve matrix factorization, latent factor models, or other techniques.
3. Movies:
The system considers movie-related data, such as movie attributes (genre, director, actors),
ratings, and user interactions.
Collaborative filtering algorithms compare user preferences with those of similar users to
recommend movies.
4. Content-Based Filter:
Content-based filtering focuses on the characteristics of items (in this case, movies) rather than
user behavior.
It analyzes movie features (e.g., genre, plot, keywords) and recommends similar movies based
on these features.
Content-based filtering is useful when collaborative data is sparse or when personalized
recommendations are needed.
5. Optimal Result:
The final output is the optimal recommendation for the user.
It combines the results from both collaborative filtering and content-based filtering.
The system aims to provide personalized movie recommendations that align with the user’s
preferences.
Use case diagram is a graph of actors, a set of use cases enclosed by a system boundary,
communication associations between the actor and the use case. The use case diagram describes how
a system interacts with outside actors
2) The training data is 70% and is given supervised inputs and outputs.
3) The testing data is 30% and shows us how well the system is trained.
4) The dataset we have chosen consists of 13 attributes according to which various algorithms
perform their calculations and approximations.
5) The system starts with first pre processing of the dataset we have fed to it.
6) It studies and analysis it, and then applies the required machine learning algorithm.
7) If it finds that the dataset is supervised, it will separate it into training data and testing data.
13) We compare the accuracies of all the algorithms and the algorithm that gives the highest
accuracy is the one, which is chosen for prediction, eventually.
14) In our system, the algorithm that gets the highest accuracy rate is DNN.
CHAPTER 5
IMPLEMENTATION
The Proposed System Make Use Different Algorithms and Methods for the
implementation of Hybrid Approach.
Formula:
Singular Value Decomposition (SVD):Let A be an n*d matrix with singular vectors v1, v2, .
. . , vr and corresponding singular values σ1, σ2, . . . , σr. Then ui = (1/σi )Avi , for i = 1, 2, . .
. , r, are the left singular vectors and by Theorem 1.5, A can be decomposed into a sum of rank
one matrices A.
We first prove a simple lemma stating that two matrices A and B are identical if
Av = Bv for all v. The lemma states that in the abstract, a matrix A can be viewed as a
transformation that maps vector v onto Av.
Methodology
Deep learning: (also known as deep structured learning) is part of a broader family
of machine learning methods based on artificial neural networks with representation
learning. Learning can be supervised, semi-supervised or unsupervised.[1][2][3]
The adjective "deep" in deep learning comes from the use of multiple layers in the
network. Early work showed that a linear perceptron cannot be a universal classifier, and
then that a network with a nonpolynomial activation function with one hidden layer of
unbounded width can on the other hand so be. Deep learning is a modern variation which is
concerned with an unbounded number of layers of bounded size, which permits practical
application and optimized implementation, while retaining theoretical universality under
mild conditions. In deep learning the layers are also permitted to be heterogeneous and to
deviate widely from biologically informed connectionist models, for the sake of
efficiency, trainability and understandability, whence the "structured" part.
In react.js we used axios npm module to fetch the data from the api that is generated from
flask. The front-end technology stack plays a pivotal role in shaping the user experience and
facilitating seamless interaction with the system. At its core, HTML (HyperText Markup
Language) forms the foundation of the front-end, providing the structural framework for web
pages. It delineates the layout of content elements, ensuring clarity and organization in
presenting information to users.
Complementing HTML, CSS (Cascading Style Sheets) comes into play to enhance the visual
appeal and aesthetics of the user interface. CSS allows for the customization of fonts, colors,
layouts, and other stylistic attributes, thereby creating a cohesive and visually pleasing design
that aligns with modern web standards and user expectations.
In the realm of front-end frameworks, options like React.js or Vue.js offer powerful tools for
building interactive and component-based UIs. These frameworks streamline the development
process by promoting code reusability, efficient state management, and seamless integration
with other libraries or APIs. By adopting such frameworks, developers can accelerate the pace
of development while maintaining a high level of flexibility and scalability.
Moreover, for ensuring consistency and responsiveness across different devices and screen
sizes, front-end frameworks like Bootstrap or Material-UI provide a rich set of pre-designed
UI components and responsive layout systems. These frameworks empower developers to
create adaptive and visually appealing interfaces that deliver a seamless user experience
regardless of the user's device.
In summary, the front-end technology stack for this project encompasses HTML, CSS,
JavaScript, and relevant frameworks/libraries. Together, these technologies enable the
creation of an intuitive, visually appealing, and interactive user interface that empowers users
to explore career options, receive personalized recommendations, and gain insights into future
skill requirements with ease and efficiency.
For backend we have use flask app to generate a local host api the resultant api is fetch in front to display
the result. We have developed our machine learning model in python .
By using flask, we generate resulting api which stores the data in the form of json format these
data is retrieved in react by using axios npm mode and then displaying the data
The choice of back-end technology is crucial in ensuring the efficient processing of data,
managing user accounts, and delivering personalized recommendations. At the heart of the
back end lies the server-side programming language, which handles the logic and functionality
behind the scenes.
One popular option for server-side development is Python, known for its simplicity,
versatility, and extensive ecosystem of libraries and frameworks. Frameworks like Django or
Flask provide robust tools for building scalable web applications, handling user
authentication, and interfacing with databases, which are essential components for a system
focused on course recommendation and skill prediction.
Another noteworthy technology in the back-end arsenal is the use of databases to store and
manage data efficiently. Relational databases like PostgreSQL or MySQL are commonly
employed for their ability to organize data into structured tables and perform complex queries.
Additionally, NoSQL databases such as MongoDB offer flexibility in handling unstructured
or semi-structured data, which could be beneficial for accommodating diverse user profiles
and preferences.
In the context of machine learning and predictive analytics, libraries like TensorFlow or
scikit-learn in Python provide powerful tools for building and deploying predictive models.
These libraries enable the system to analyze user data, predict future skill demands, and
generate personalized course recommendations based on career choices and individual
profiles.
Moreover, the back end often involves integrating with external APIs or services to enrich the
system's functionality. APIs provided by job boards, educational platforms, or professional
networks can supply valuable data on job listings, course offerings, and industry trends, which
can inform the recommendation and prediction algorithms.
Finally, considerations for scalability, security, and performance are paramount in the
selection of back-end technologies. Cloud platforms like Amazon Web Services (AWS),
Microsoft Azure, or Google Cloud Platform (GCP) offer a range of services for hosting,
scaling, and securing web applications, ensuring reliability and responsiveness even under
heavy user loads. Together, these technologies form a robust foundation for building a
dynamic and intelligent system that empowers users in their career development journey.
Chapter 6
TESTING
Testing is the process of evaluating a system or its components with the intent to
find that whether it satisfies the specific requirement or not. This activity results in actual,
expected and the difference between their results. In simple words testing is executing a
system in order to identify any gaps, errors or missing requirements in contrary to the actual
desire or requirements.
Testing is the practice of making objective judgments regarding the extent to which
the system (device) meets, exceeds or fails to meet stated objectives.
A good testing program is a tool for agency and integrator supplier it identifies the
end of the “Development” phase of the project, establishes the criteria for project
acceptance, and establishes the start of the warranty period.
Types of Test
There are many types of testing which can be carried out. Few of the testing types are: 1.
Unit testing 2. Integration testing 3. Functional testing 4. System Testing
Unit Testing: Unit testing involves the design of test cases that validate that the internal
program logic is functioning properly, and that program inputs produce valid outputs. All
decision branches and internal code flow should be validated. It is the testing of individual
software units of the application.
It is done after the completion of an individual unit before integration. This is a structural
testing, that relies on knowledge of its construction and is invasive. Unit tests perform basic
tests at component level and test a specific business process, application, and/or system
Integration Testing: Integration tests are designed to test integrated software components
to determine if they actually run as one program. Testing is event driven and is more
concerned with the basic outcome of screens or fields. Integration tests demonstrate that
although the components were individually satisfaction, as shown by successfully unit
testing, the combination of components is correct and consistent. Integration testing is
specifically aimed at exposing the problems that arise from the combination of
components. Software integration testing is the incremental integration testing of two or
more integrated software components on a single platform to produce failures caused by
interface defects. The task of the integration test is to check that components or software
applications, e.g. components in a software system or – one step up – software applications
at the company level – interact without error.
Acceptance Testing: User Acceptance Testing is a critical phase of any project and
requires significant participation by the end user. It also ensures that the system meets the
functional requirements.
Since our project is Online course recommendation system .one can develop a Online
course recommendation system by using either content based or collaborative filtering or
combining both.
In our project we have developed a hybrid approach i.e combination of both content and
collaborative filtering .Both the approaches have advantages and dis-advantages
.in content based filtering the it based on the user ratings or user likes only such kind of
Online course will recommended to the user.
Dis-advantages: the model can only make recommendations based on existing interests of
the user. In other words, the model has limited ability to expand on the users' existing
interests.
Dis-advantages: The prediction of the model for a given (user, item) pair is the dot product
of the corresponding embeddings. So, if an item is not seen during training, the system can't
create an embedding for it and can't query the model with this item. This issue is often
called the cold-start problem.
The hybrid approach will resolves all these limitations by combining both content and collaborative
filtering
SNAPSHOTS
FUTURE ENHANCEMENT
In the proposed approach, It has considered Genres of Online courses but, in future we can
also consider age of user as according to the age Online course preferences also changes,
like for example, during our childhood we like animated Online courses more as compared
to other Online courses. There is a need to work on the memory requirements of the
proposed approach in the future. The proposed approach has been implemented here on
different Online course datasets only. It can also be implemented on the Film Affinity and
Netflix datasets and the performance can be computed in the future.
CONCLUSION
In this project, to improve the accuracy, quality and scalability of Online course
recommendation system, a Hybrid approach by unifying content based filtering and
collaborative filtering; using Singular Value Decomposition (SVD) as a classifier and
Cosine Similarity is presented in the proposed methodology. Existing pure approaches and
proposed hybrid approach is implemented on three different Online course datasets and the
results are compared among them. Comparative results depicts that the proposed approach
shows an improvement in the accuracy, quality and scalability of the Online course
recommendation system than the pure approaches. Also, computing time of the proposed
approach is lesser than the other two pure approaches.
BIBILOGRAPHY
[1] K. K. Chandriah and R. V. Naraganahalli, “RNN/LSTM with modified Adam
optimizer in deep learning approach for auto◻mobile spare parts demand
forecasting,” Multimedia Tools and Applications, vol. 80, no. 17, pp. 26145–26159,
2021.
[3] WideResNet with Joint Representation Learning and Data Augmentation for
Course recommendation Identification Shichao Hu1, Bin Zhang1 , Jinhong Lu2 ,
Yiliang Jiang1 , Wucheng Wang1 , Lingcheng Kong1 , Weifeng Zhao1 , Tao Jiang.
[6] Chuang AC, Huang NF, Tzeng JW, Lee CA, Huang YX, Huang HH (2021)
MOOCERS: Exercise recommender system in MOOCs based on reinforcement
learning algorithm. In: 8th International Conference on Soft Computing & Machine
Intelli◻gence (ISCMI), Cario, Egypt, 2021, pp 186-190.6. Y. Yang, Y. Zhu, and Y.
Li, “Personalized recommendation with knowledge graph via dual#autoencoder,”
Applied Intelli◻gence, vol. 52, no. 6, pp. 6196–6207, 2022.
[9] Wang M, Peng J, Cheng B, Zhou H, Liu J (2011) Knowledge visualization for
self#regulated learning. EducTechnol Soc 14(3):28–42.
[10] Chuang AC, Huang NF, Tzeng JW, Lee CA, Huang YX, Huang HH (2021)
MOOCERS: Exercise recommender system in MOOCs based on reinforcement
learning algorithm. In: 8th International Conference on Soft Computing & Machine
Intelli◻gence (ISCMI), Cario, Egypt, 2021, pp 186-190.
[13] Patel PG, Carver N, Rahimi S (2011) Tuning computer gaming agents using
q#learning. In: 2011 federated conference on com◻puter science and information
systems (FedCSIS). IEEE, pp 581–588.
[20] Wei X, Saab N, Admiraal W (2023) Do learners share the same perceived
learning outcomes in MOOCs? Identifying the role of motivation, perceived learning
support, learning engagement, and self-regulated learning strategies. Internet Higher
Educ 56:100880
I. INTRODUCTION
Agriculture is the art and science of cultivating plants and
livestock. It is one of the major sectors of the Indian
economy as it contributes about 18% to the total GDP and
provides employment to over 60% of the population and is
present in the country for thousands of years. Over 75 per
cent of the rural households depend on agriculture.
Nowadays there is a tremendous loss in quality and quantity
of food crops yield because of various diseases affecting the
plant. For this purpose proper steps should be taken to rescue
the plants from disease and to increase the yield. Early
detection of these diseases can allow to take preventive
measures and mitigate economic and production losses. A
leaf is one of the most important part of a plant responsible
for photosynthesis. So, detecting the disease through leaf
will be one of the primary sectors in finding the disease.
So the objective of this work is to develop a software, that is
effective and error-free disease detection system for plant.
To overcome the leaf disease, a software in been developed
particularly for three species i.e., Potato, Capsicum and
Strawberry which automatically detect the disease using the
image of the leaf and also provide the remedies for those
diseases
IV. METHODOLOGY
Convolutional Neural Network: (CNN) is a Deep Learning
algorithm which can take in an input image, and process
them to classify the given image. The pre-processing required
in a CNN is much lower as compared to other algorithms and
Image Processing techniques. CNN is capable of applying
many filters automatically It mainly consist of four layers:
Convolution layer: In this layer series of
mathematical operations are performed to extract the VGG19 Architecture
feature map of the input image. We have used plant village dataset consisting of 3 crops
namely Pepper bell, Strawberry, Potato. The dataset is
Pooling layer: It is mainly used to reduce the size of
divided into 80% train and 20%test.5124 images were used
the output matrix from the convolution layer. There
to train the model and 1226 images for testing. The images
are 3 different types of pooling: Max, Min, Average
pooling. selected from dataset is cropped to the size of 224X224 each
input image matrix is been convoluted and reLU activation
Flattening layer: is converting the data into a 1-
function is been applied. The fully connected of VGG19 is
dimensional array for inputting it to the next layer.
modified as per our project requirements. The fully
The output from the convolutional layers are
connected layer is composed of 2 hidden layers consisting of
flattened to get a single feature vector.
512 and 256 neurons respectively, Adam is used as optimizer
Fully connected layer: The output from the as it converges fast and a batch size of 32 was used during
flattening layer is fed into the fully connected layer. training of the model. The output layer is multi neuron
Recognition and classification is performed in this output which gives out the final results. The system consists
layer. of 3 levels of classification. At the first stage the given leaf
input is classified to be healthy or unhealthy, if the result is
The system uses transfer learning technique i.e., VGG19
classified at healthy the further processing is not done else
model has described below
the image is passed to the second level of classification
VGG19 is a variant of VGG model which in short where the type of crop is identified at the last stage of
consists of 19 layers that includes 16 convolution classification. The final disease have been outputted. Based
layers, 3 Fully connected layer, 5 MaxPool layers on the type of disease identified the remedies for the same is
and 1 SoftMax layer. There are other variants of given out.
VGG like VGG11, VGG16 and others.
A fixed size of (224 * 224) RGB image was given
as input to this network which means that the matrix
was of shape (224,224,3).
They subtracted the mean RGB value from each
pixel, which was the only preprocessing which was
done and computed over the whole training set.
Used kernels of (3 * 3) size with a stride size of 1
pixel, this enabled them to cover the whole notion of
the image.
Spatial padding was used to preserve the spatial
resolution of the image.
Max pooling was performed over a 2 * 2 pixel
windows with stride 2. Data flow diagram
This was followed by Rectified linear unit(ReLu) to
introduce non-linearity to make the model classify
better and to improve computational time as the
previous
models used tanh or sigmoid functions this proved
much better than those.
It was constructed with three fully connected layers
from which first two were of size 4096 and the next
layer with 1000 channels for 1000-way ILSVRC
classification and the final layer is a softmax
Dept of IS&E, MITM
function. 54
System Architecture
Course Recommendation and Skill Prediction Based on Career Choice 2023-2024
V. EXPERIMENTAL RESULTS further hence, we have come with the system that
The snapshots of the results are shown below. The system is automatically detects the disease affected to a plant with the
capable of detecting the diseases with 95% accuracy. If the help of the leaf image. The symptoms of the disease appears
system is given with the crop that is not trained it will show a on the leaf region and hence taken as an input for further
message stating that the given crop was not trained. The detection and classification. These images are classified
Fig(a) shows the three level classification for an given potato using CNN and transfer learning methods by using VGG19
leaf as input. Fig(b) shows the description and remedy architecture. The system is capable of finding disease with
associated with that disease. Fig(c) shows first level of 95% accuracy. And shows three level classification if it is
classification since it is a healthy image. detected as unhealthy and one level of classification if it’s
healthy.
ACKNOWLEDGEMENT
We are indeed grateful to many groups of people of who
have helped us with various aspects of this study. We would
like to thank Prof Smithashree K P as well as special thanks
for our head of the department Dr Sharath Kumar Y H for
providing us the opportunity to do this amazing project on
“Plant leaf disease detection”.
Fig(a)
REFERENCES
[1] Melike Sardogan, Adem Tuncer, Yunus
Ozen, Plant Leaf Disease Detection and
Classification based on CNN with LVQ
Algorithm, 3rd International
Conference on Computer Science and
Engineering, IEEE, 2018.
[2] Jia Shijie, Jia Peiyi, Hu Siping, Liu
Haibo, Automatic Detection of Tomato
Diseases and Pests Based on Leaf
Images, 2017, IEEE.
[3] Eftekhar Hossain, Md. Farhad Hossain
and Mohammad Anisur Rahaman, A
Color and Texture Based Approach for
the Detection and Classification of
Plant Leaf Disease Using KNN
Classifier, 2019 International
Conference on Electrical, Computer and
Fig(b) Communication Engineering (ECCE),
7-9 February, 2019.
[4] Neha G. Kurale, Madhav V. Vaidya,
Classification of Leaf disease using
Texture Feature and Neural Network
Classifier, IEEE, 2018.
[5] Trimi Neha Tete, Sushma Kamlu,
Detection of Plant Disease Using
Threshold, K- Mean Cluster and ANN
Algorithm, 2017 2nd International
Conference for Convergence in
Fig(c) Technology (I2CT), IEEE.
[6] Robert G. de Luna, Elmer P. Dadios,
VI. CONCLUSION Argel Bandala, Automated Image
Dept
Detection of of
theIS&E,
plant MITM
disease at its early stages proves to be 55
Capturing System for Deep Learning-
one of the best method to avoid the spread of the disease based Tomato Plant Leaf Disease
Detection and Recognition, Proceedings
of TENCON 2018 - 2018 IEEE Region
Course Recommendation and Skill Prediction Based on Career Choice 2023-2024
10 Conference
(Jeju, Korea, 28-31 October 2018).