Ali Docs
Ali Docs
Ali Docs
2023-2024
1|P a ge
CERTIFICATE
Exam Seat No:
2|P a ge
DECLARATION
3|P a ge
ACKNOWLEDGEMENT
4|P a ge
Table of Contents/Index Page
1. Introduction 7-11
2. System Requirements 12
References/Bibliography
5|P a ge
ABSTRACT
6|P a ge
1. Introduction
8|P a ge
1.1 : - Scope and Purpose /Working of the proposed
project
Scope: -
A movie recommendation system’s scope includes
implementing diverse recommendation algorithms, data
collection, and user profiling. It encompasses user
interface design, privacy measures, scalability
considerations, and feedback mechanisms. The system
should provide accurate, diverse, and engaging movie
suggestions while aligning with business objectives,
ensuring continuous improvement, and adhering to
security and privacy standards.
Purpose: -
The purpose of a movie recommendation system
is to enhance the user experience by offering personalized
movie suggestions that match individual preferences,
fostering increased user engagement, content
consumption, and platform loyalty. It drives revenue
growth through content discovery, optimizes resource
allocation, and provides valuable data-driven insights.
9|P a ge
1.2 : - Advantages of project
11 | P a g e
2. System Requirements:
-Hardware requirements
-Software requirements
1. Python
2. Jupiter Notebook Chrome
3. PyCharm
12 | P a g e
3. Algorithms /Methodology Used:
Content-based filtering.
NLTK: -
13 | P a g e
Porter Stemmers: -
Count Vectorizer: -
Cosin Similarity: -
To recommend movies to a user, the systemcalculates
the cosine similarity between the user's profile vector and the
vectors representing the moviesin the database.
The cosine similarity between two vectors, A and B, is
calculated using the following formula:
14 | P a g e
Cosine Similarity (A, B) = (A • B) / (||A|| * ||B||)
--- A • B represents the dot product of vectors A and B.
--- ||A|| and ||B|| represent the magnitudes (lengths)
of vectors A and B.
Stream lit: -
15 | P a g e
enumerate: -
In content-based recommendation systems,
enumerate is used to create an index for items, making it
easier to retrieve item attributes or features for similarity
calculations. It allows efficient mapping between item
identifiers and their corresponding data. User profiles are
often represented as vectors. enumerate helps create user
profile vectors with unique indices for eachitem, reflecting
the user's interactions or preferences.
pickle: -
The pickle module implements binary protocols
for serializing and de-serializing a Python object structure.
“Pickling” is the process whereby a Python object hierarchy
is converted into a byte stream, and “unpickling” is the
inverse operation, whereby a byte stream (from a binary file
or bytes-like object) is converted back into an object
hierarchy. Pickling (and unpickling) is alternatively known as
“serialization”, “marshaling,” 1, or “flattening”; however, to
avoid confusion, the terms used here are “pickling” and
“unpickling”.
16 | P a g e
Requests: -
17 | P a g e
4. Output and Result analysis /Dataset analysis
Dataset Description: -
A movie recommendation dataset contains movie
details (ID, title, genre, etc.), user interactions (ratings,
timestamps), user profiles (demographics, preferences),
and optional metadata (keywords, reviews). It's used to
build recommendation systems, suggesting movies to users
based on their behavior and movie attributes like genre and
cast.
Features (Attributes):
The dataset includes details such as budget,
Genres, homepage, id, keywords, original language, original
title, Overview, popularity, runtime, spoken language,
status, crew, director, title, vote average, vote count,
tagline, and cast.
Target Variable:
The target variable is "Title" denoting the name of a
recommended set of movies.
Dataset Size:
It comprises a substantial number of movie
records, ranging from several hundred to several thousand
entries, sourced from online listings, automotive websites,
and government databases, OTT platforms.
18 | P a g e
Data Preparation:
Before building a model, data pre-processing is
required in which we extract the relevant features fromthe
dataset and, in cleaning replace the null values withnull
strings. After that converting the text data to feature vectors
or converting text data to numerical data.
Purpose:
A movie recommendation system aims to enhance
the user experience by offering personalized movie
suggestions that match individual preferences, fostering
increased user engagement, content consumption, and
platform loyalty. It drives revenue growth through content
discovery, optimizes resource allocation, and provides
valuabledata-driven insights.
19 | P a g e
-Result/Output Snapshots
1) Libraries Used: –
20 | P a g e
2) Dataset used: –
21 | P a g e
22 | P a g e
Data header: -
23 | P a g e
Checking rows and columns in the dataset: -
24 | P a g e
Removing the null values: -
25 | P a g e
Adding a tags column: -
26 | P a g e
3) Getting the similarity score: -
27 | P a g e
4) Recommended List of movies(output):-
28 | P a g e
5. Conclusion and Recommendations.
29 | P a g e
6. Future Enhancement /Further steps
30 | P a g e
-----References/Bibliography:
31 | P a g e
6. Ghazanfar M.A., Prugel-Bennett A. A scalable, accurate
hybrid recommender system; Proceedings of the 2010 Third
International Conference on Knowledge Discovery and Data
Mining; Washington, DC, USA. 9–10 January 2010. [Google
Scholar]
7. Deldjoo Y., Elahi M., Cremonesi P., Garzotto F., Piazzolla P.,
Quadrana M. Content-Based Video Recommendation
System Based on Stylistic Visual Features. J. Data Semant.
2016;5:99– 113. doi: 10.1007/s13740-016-0060-9.
[CrossRef] [Google Scholar]
32 | P a g e