Project Presentation
Project Presentation
Project Presentation
Prediction
More businesses are moving online these days, and consumers are ordering online
instead of traveling to the store to buy. Zomato and Swiggy are popular online
platforms for ordering food products. Other examples are Uber Eats, Food Panda, and
Deliveroo, which also have similar services. They provide food delivery options.
If the order is complete, a partner will pick up and deliver the meal to the given
address via a delivery service. In online food-ordering businesses, delivery time is
critical. As a result, estimated food delivery time prediction to reach the buyer’s
location is critical.
Objectives
1. What factors can affect the time in delivering food from the restaurant to the destination location?
2. How much food delivery time prediction accuracy performance?
Data Preparation
General Info
Data Preparation
Distance Calculation
To get the time needed to deliver food, the distance between
the restaurant and the delivery location is needed. To get the
required distance, we use geodesic function which can be
used to calculate the distance between 2 locations by
utilizing longitudes and latitudes.
Geodesic Function
Library- geopy.distance.geodesic
Exploratory Data Analysis
Univariate Analysis
There is consistent relationship There is a linear relationship between the time taken There is an inverse linear relationship
between the time taken and the to deliver the food and the age of the person who between the time taken to deliver the food
distance travelled to deliver the delivering the food. It looks like person with the young and the delivery person ratings. It looks
food. It looks like majority food age able to take less time than person with old age to like person with the higher ratings take
delivered within 25-27 minutes deliver the food to customers a less time to deliver the food than person
regardless of distance with low ratings
Exploratory Data Analysis
Bivariate Analysis
It looks like there is not much difference between the time taken depending on the vehicle they are driving and the type of food
they are delivering
Data Pipeline Architecture
1. AWS Instance Creation:
• Provision an Amazon EC2 instance to serve as the infrastructure for our data pipeline.
3. S3 Bucket Creation:
• Set up an Amazon Athena database to query data stored in the S3 bucket using SQL-like queries.
• Define tables and partitions to optimize query performance.
6.Power BI Connection through ODBC:
• Install and configure the Athena ODBC driver on the EC2 instance.
• Set up an ODBC data source to connect Power BI to Amazon Athena.
Feature Engineering
Parameters to be tuned
Best parameter
Prediction on N e w Data
Conclusion and Recommendation
Conclusion
1. Rating of person in previous delivers become is the most influential factor on the delivery time of food to the
destination location. Person with the higher ratings take a less time to deliver the food than person with low ratings
2. Model has R M S E score 7.28 and that means error between delivery time prediction and delivery time actual is 7.28 minutes
Recommendation
The rating obtained by the deliveryman is a representation of the deliveryman's performance in delivering food to the intended
location in terms of delivery time. Of course this is a potential loss of customers if this continues to happen. Delivery time
performance needs to be maintained so that the rating obtained is high and customer trust can still be maintained.
The R M S E value of the model can be used as a guarantee of delivery time performance which can be given to the customer so that
as much as possible the delivery time is not more than the existing R M S E score (delay in delivery time of not more than 7.28
minutes).
fooddeliveryx456