Final Project
Final Project
Final Project
on
Design and develop a system for Exploratory analysis of
Geolocational Data in Python
Submitted to the Department of Computer Science and Engineering
In partial fulfilment of the requirements
For the degree of
Bachelor of Technology
In
by
MADHAV SHARMA
(2200140100054)
SARTHAK YADAV
(2200140100094)
SHIVANSH GUPTA
(2200140100101)
Group No. 14
Guided By
I would like to express my special thanks to our mentor Ms. Anu Saxena for her time and
efforts she provided throughout the year. Your useful advice and suggestions were really
helpful to us during the project’s completion. In this aspect, I am eternally grateful to you.
I would like to acknowledge that this project was completed entirely by us and not by
someone else.
Signature………………………………… Signature………………………………
Name…………………………………….. Name…………………………………..
Date…………………………………….. Date…………………………………
Signature…………………………………
Name……………………………………..
Roll No…………………………………..
Date……………………………………..
Abstract
This project involves the use of K-Means Clustering to find the best accommodation for the
migrants by classifying accommodation for migrants on the basis of their preferences on
facility, budget and proximity to the location. To Fetch, Clean, Analyse and run K Means
Clustering on Geo-locational data to recommend accommodations to immigrants to a city.
Keywords: Data, Dataset, Recommendation, Map
TABLE OF CONTENT
1 Acknowledgement
2 Abstract
3 Introduction of Project 1
5 Way of approach 3
6 Project stages 4
Reference
3.Introduction of Project
This project endeavors to design and develop a comprehensive system for exploratory
analysis of geolocational data using the versatile programming language Python. By
leveraging Python's rich ecosystem of libraries and tools, we aim to create a platform that
empowers users to extract actionable insights from geospatial datasets with ease and
efficiency.
The system will be tailored to cater to the diverse needs of users across different domains,
including urban planners seeking to optimize city infrastructure, environmental scientists
monitoring ecological changes, and businesses looking to understand consumer behavior
based on spatial dynamics.
Through this endeavor, we aim to democratize access to geospatial analysis tools, enabling
researchers, analysts, and decision-makers to unlock the hidden potential of geolocational
data and make informed choices that shape our world. From data import and cleaning to
advanced spatial analysis techniques and visualization, this system will provide a seamless
and intuitive user experience, paving the way for data-driven insights that drive positive
change and innovation.
3.1Problem statement
Users feel uneasy when their location data is collected without their knowledge or
consent.
limitation imposed by the user's device and internet connection.
Location data is impacted by a variety of factors including businesses opening or
closing, people relocating, devices changing hands, and so on. Inaccurate data can
lead to poor business decisions, wasted resources, and reduced customer satisfaction
so it's accuracy can be improved.
Location data can be complex, and it may be difficult to integrate with other types of
data or into a product without requiring advanced analytics and data processing tools
to derive insights.
location-based app development can benefit from ready-to-use map APIs provided by
Google Maps, Mapbox, TomTom or other providers. It's important to carefully
consider the different pricing options offered by these providers, as the usage of
different geolocation features will generate costs that vary across these API providers
GPS
Global Positioning System, was originally developed for military navigation but
nowadays anyone with a GPS device can receive radio signals that these satellites
broadcast. This global satellite system provides geolocation and time information to a
GPS receiver almost anywhere on the Earth if there are no obstacles and at least three
GPS satellites available.
A big plus of GPS is its accuracy. It can locate something up to five meters precisely
or even better with dual-band GPS receivers. The accuracy depends on many factors
and it is also important to take into account the time it takes to determine a position,
the fix time.
Another advantage is that GPS works everywhere outdoors and there is no specific
infrastructure required. The downside is that…
Bluetooth Low Energy
Bluetooth is a wireless short-range communications technology standard. It’s mainly
designed for communicating over short distances. The signals do not carry very far,
even in optimal circumstances devices need to be within 100 meters. Although
Bluetooth has been around for two decades, its latest version, Bluetooth Low Energy
(BLE) is making big strides in geolocation and positioning.
.
Wi-Fi positioning taps into wireless local area networks (WLANs), which are
networks of devices that connect to a specific radio frequency, usually 2.4GHz or
5.0GHz. The Wi-Fi device transfers signals for a range of up to one hundred meters,
which means Wi-Fi can cover both indoor and outdoor sites. A tracking device will
sniff for nearby Wi-Fi access points (APs). By determining the unique identifier of the
APs, the MAC address for example, a position can be determined. Local or public
databases provide the link between observed MAC addresses and geolocation.
Tracking devices only sniff for Wi-Fi signals, they do not have to connect to the Wi-
Fi. Therefore Wi-Fi positioning also harnesses Wi-Fi networks that you don’t own or
can’t access. For instan…
Network-based geolocation
Location can also be determined by using a service provider’s network infrastructure.
The accuracy of network-based techniques can vary. This is both dependent on the
concentration of base stations and the implementation of the most up-to-date timing
methods. A technique used by different network providers is network triangulation.
This means that you can determine the location of a point by forming triangles to it
from known points. To use a service provider’s network infrastructure your tracking
device will be equipped with a module of the service provider.
Clean the Datasets to prepare them for analysis. (Data Cleaning via Pandas)
1. Data Collection Module Collect the data from the users and store
the data in the database for later use.
2. Searching Module After giving the input the user search for the
location comes under their budget with their required facilities.
[1] Cao, L., & Cong, G. (2018). Big data analytics in geolocational data: a survey.
ACM Computing Surveys (CSUR), 51(1), 1-36.
[2] Long, Y., & Shekhar, S. (2019). Spatial big data: a review on data acquisition,
storage, and management. Information Systems Frontiers, 21(6), 1247-1269.
[3] Gao, S., et al. (2017). Exploring human mobility patterns using geolocated
tweets. Applied Geography, 81, 44-54.
Signature………………………….. Signature………………………
Signature…………………………..
Name: Shivansh Gupta
February, 2024