Da Internship Report Bangalore University

Download as pdf or txt
Download as pdf or txt
You are on page 1of 30

BANGALORE UNIVERSITY

A INTERNSHIP REPORT
On

“DATA ANALYSIS OF INDIAN AGRICULTURE”


Submitted to Bangalore University in partial fulfilment for Award of degree

BACHELOR OF COMPUTER APPLICATION (BCA)


Submitted by

DHANUSH R (U03DQ21S0244)

GOVERNMENT FIRST GRADE COLLEGE, VIJAYANAGAR


DEPARTMENT OF COMPUTER SCIENCE
RPC Layout, Vijayanagar,Bangalore-104
2023-2024
GOVERNMENT FIRST GRADE COLLEGE, VIJAYANAGAR
RPC Layout, Vijayanagar, Bangalore-104

2023-2024

CERTIFICATE
This is to certify that the Internship report entitled “DATA ANALYSIS OF
INDIAN AGRICULTURE” is a Bonafide work carried out by
DHANUSH R(U03DQ21S0244) in partial fulfilment of the requirements for
the Bachelor’s Degree in Computer Application of the Bangalore University,
Bangalore during the academic year 2023-24

………………………… …………………………
Signature of Guide Signature of HOD
DECLARATION
I, DHANUSH R hereby declare that the Internship report entitled “DATA
ANALYSIS OF INDIAN AGRICULTURE” with reference to “GOVERNMENT
FIRST GRADE COLLEGE “prepared by me under the guidance of______
, CS Department, Government First Grade College and external assistance
by External Guide Narendra, Managing Director, The Edunet Foundation.

I also declare that this internship work is towards the partial fulfillment of
the university regulations for the award of degree of Bachelor of Computer
Applications by Bangalore University, Bangalore.

I have undergone a project for a period of 6 weeks. I further declare that this
project is based on the original study undertaken by me and has not been
submitted for the award of any degree from any other university/Institution.

DHANUSH R
ACKNOWLEDGEMENT

At every outset I wish to express my sincere gratitude to those who helped


me in completing this Internship work.

My first salute to Bangalore University for the opportunity given to do the


Internship according to the syllabus.

I would like to express my special thanks of gratitude to our Prof.______As


well our HOD Mrs. Nazia Hassan B who gave me the golden opportunity to
do this Internship report on the topic “DATA ANALYSIS OF INDIAN
AGRICULTURE” which also helped me doing a lot of Research and I came
to know about so many new things I thankful to them.

I also express my heartful gratitude to Lectures, Department of Computer


Science for their valuable, skilled guidance and encouragement at every
step which helped me to complete my work successfully.

My sincere thanks to all who helped me directly or indirectly in this


internship.
ABSTRACT

This abstract presents an exhaustive analysis of the Indian


agriculture sector from 2005 to 2023 using Microsoft Excel and
Power BI. By examining comprehensive datasets encompassing
crop production, weather patterns, market prices, and policy
frameworks, the study reveals insights into crop yield variations,
regional disparities, and the impact of climate change. Through
descriptive analytics and predictive modeling, it highlights the
role of technology, infrastructure, and policy reforms in enhancing
productivity and sustainability. The findings offer valuable insights
for policymakers and stakeholders, emphasizing the need for
data-driven strategies to address challenges and optimize
opportunities in India's agriculture sector for sustainable
development.
TABLE OF CONTENT

Page no.
Acknowledgement

Abstract

Organization Information 08
Internship Objectives 09
Weekly overview of internships 10
Introduction to Indian 13
Agriculture
Problem Statement 14

Project Overview 15

Proposed Solution 17
Technology Used 18
Model and Report 19
Conclusion 30
Organization Information
About EY GDS:

Ernst & Young Global Delivery Services, a global leader in professional services, has
expanded its commitment to corporate social responsibility (CSR) to include a
robust skilling component. Recognizing the critical role that education and skill
development play in fostering sustainable communities, EY has launched numerous
initiatives aimed at equipping individuals with the skills they need to thrive in a
rapidly changing world. These programs extend beyond traditional business
interests and reflect EY's dedication to making a positive impact beyond its core
services. The Next Gen Employability Program is an initiative by Edunet & AICTE in
collaboration with EY GDS to enhance the employability of students in the technical
education ecosystem.

About Edunet Foundation:

Edunet Foundation (EF) was founded in 2015. The organization primarily focuses on
youth skilling, innovation, and entrepreneurship. Since its inception, the organization
has helped young people from different geographies in
India to prepare for industry 4.0 jobs. EF has a national
footprint, and it works with regulators, state technical
universities, large network of engineering colleges and
high schools around India. The programs and initiatives
undertaken by Edunet Foundation are all focused on
digital skilling and conforms to the organization’s
mission 2025 goals aimed at skilling and impacting over
1,000,000 future workforces for IR 4.0 economy. Edunet Foundation enjoys “Special
Consultative Status” with the Economic and Social Council at the United Nations.
Internship Objectives

Internship Objectives:

1. Gain hands-on experience in data analytics tools such as Power BI and


Excel, along with understanding foundational concepts in data analytics.
2. Analyze 19 years of Indian agriculture data to extract meaningful insights
and identify key parameters related to crop production.
3. Learn to apply data analytics techniques to identify the top 5 crops by
production, top 3 states and districts by production, and production trends
across different seasons.
4. Develop skills in predictive analytics by predicting crop production for a
future year based on past data.
5. Explore the possibility of determining the Minimum Support Price (MSP)
for a given crop in a specific year, using data-driven approaches.
6. Enhance problem-solving abilities and critical thinking skills through the
application of data analytics to real-world scenarios in the agriculture
sector.
7. Collaborate with peers and mentors to discuss findings, share insights,
and propose recommendations for improving agricultural practices and
policies based on data analysis results.
Weekly overview of Internships

Weekly Completion Tasks Weekly Module Completion


Progress
Week 1: Importing, Pre-Processing Week 1:
and Data Modelling
• Data Analytics basic
• Understanding Data Analytics knowledge understanding
basics level
• Understanding Data and • Power BI Knowledge
application • Data Importing to Data
• Understanding Power BI tool Operations like Data Cleaning
• Project Planning (Module etc.
identification) • Create relationships between
• Adding the Data to Power BI tables
• Preparations- Categorization • Understanding the Business /
of Data, Data Cleaning Project Requirements
operations, Data Wrangling
operations, etc.
• Identify relations among the
data tables and Data
Modelling
• Business Requirements
generation

Week 2: DAX and Dash Week 2:


Board(Visualization)
• Prepare the DAX functions
• Understanding Data Analysis for the Project's betterment
Expressions for the Project • Advanced Visualization
Context • User Interaction using
• DAX Functions to Project filtering, slicers, etc. to make
Context the project interactive.
• Prepare New Measures and • Create new Columns and new
New Columns using DAX Measures, if needed
Functions according to the • Use DAX functions to
Project requirements enhance the Chart
• Understanding the Various
Charts and their usage
• Select the appropriate Chart
for each Project requirement.
• Visualize the text data into
Charts
• Apply filter(s) on the Chart, if
needed.

Week 3: Visualization and Week 3:


Dashboard Preparation
• Prepare Report(s)
• Power BI Analysis – • Use Advanced filtering
Advanced Visualization techniques, if needed
• Filters and Slicers • Prepare the Dashboard
• Adding various columns and
measures to charts to
achieve Project requirements

Week 4: Formatting and Testing Week 4:


the Project Functionality
• Formatting visuals and
• Testing and Iteration canvas background.
• Formatting • Approaches of testing
• Submit the Project strategies.
• Cross-checking with
functionality
• Validation of the Project.

Week 5: Week5:

Mock Presentations Presenting the project PPT before


subject matter experts
Week 6: Week 6:

Final Presentations presenting the project PPT before


the EY industry expert panel.
Introduction to Indian Agriculture

Indian agriculture, deeply rooted in tradition and culture, stands


as the backbone of the country's economy, employing over half of
the nation's workforce and contributing significantly to its GDP.

With a diverse agro-climatic environment, India boasts a rich


variety of crops, ranging from grains like rice and wheat to cash
crops like cotton and sugarcane.

Despite advancements in technology and modern farming


practices, a majority of Indian farmers still rely on traditional
methods, facing challenges such as fragmented landholdings,
water scarcity, and unpredictable weather patterns.

However, initiatives like the Green Revolution and recent


government policies aim to modernize the sector, enhance
productivity, and ensure food security for the nation's growing
population.

Understanding the complexities and dynamics of Indian


agriculture is crucial for addressing its challenges and unlocking
its vast potential for sustainable growth and development.
Problem Statement
The agricultural landscape in India faces multifaceted challenges
that demand immediate attention and innovative solutions. One
of the primary concerns is the persistence of small and
fragmented landholdings, leading to suboptimal use of resources
and limited economies of scale. Moreover, inadequate
infrastructure, including transportation and storage facilities,
results in post-harvest losses and inefficiencies in the supply
chain. Additionally, the sector contends with unpredictable
weather patterns and climate change, exacerbating the
vulnerability of farmers and their livelihoods. Coupled with these
challenges are socio-economic issues such as farmer
indebtedness, lack of access to modern technology and quality
inputs, and the marginalization of smallholder farmers in the
marketplace.

Addressing these challenges requires a holistic approach


encompassing policy reforms, technological interventions, and
capacity-building initiatives. Improving access to credit and
markets, promoting sustainable farming practices, and investing
in agricultural infrastructure are critical steps towards enhancing
productivity, resilience, and income security for farmers.
Furthermore, leveraging digital technologies and data-driven
insights can revolutionize agricultural practices, enabling
precision farming, real-time monitoring of crop health, and
informed decision-making. By fostering innovation and
collaboration across stakeholders, India can unlock the full
potential of its agriculture sector, ensuring food security, rural
prosperity, and environmental sustainability in the years to come.
Project Overview
The project entails a comprehensive examination of 19 years'
worth of Indian agricultural data utilizing data analytics tools like
Power BI and Excel. Its core objective is to extract profound
insights and patterns concerning crop production, seasonal
fluctuations, and geographical dispersion. The analysis
endeavors to pinpoint the primary crops cultivated, top-
performing regions (states and districts), and the influence of
distinct seasons on agricultural yield. Furthermore, predictive
analytics will be leveraged to anticipate crop production for an
upcoming year, while also assessing the feasibility of establishing
Minimum Support Price (MSP) for various crops. By delving into
this project, participants will acquire a profound understanding of
data analytics principles and their practical application within the
agricultural domain, thereby fostering informed decision-making
processes and policy development.

The project's scope extends to exploring various facets of the


Indian agricultural landscape, integrating data-driven insights to
uncover underlying trends and patterns. By delving into detailed
analyses of crop production, regional disparities, and seasonal
effects, participants will gain valuable insights into the intricacies
of agricultural dynamics. The predictive aspect of the project
aims to forecast future production levels, facilitating proactive
planning and resource allocation. Moreover, the examination of
Minimum Support Price feasibility adds a critical dimension to the
project, offering insights into policy implications and potential
interventions to support farmers. Through this project,
participants will not only enhance their data analytics skills but
also contribute to addressing real-world challenges in the Indian
agriculture sector, thereby fostering sustainable growth and
development.
Proposed Solution

The proposed solution involves leveraging the capabilities of data


analytics tools like Power BI and Excel to conduct a
comprehensive analysis of the Indian agriculture sector spanning
19 years of data. Through meticulous examination and
visualization of this extensive dataset, the project aims to uncover
significant trends, patterns, and correlations within the
agricultural landscape. By delving into aspects such as crop
production volumes, regional disparities, and seasonal variations,
the analysis seeks to provide valuable insights into the dynamics
of the agricultural sector.

Furthermore, the project will explore predictive modeling


techniques to forecast future crop production and assess the
feasibility of implementing Minimum Support Prices (MSP) for
various crops. By employing advanced analytics methodologies,
stakeholders can gain actionable insights to make informed
decisions regarding crop planning, resource allocation, and policy
formulation. Ultimately, the proposed solution aims to facilitate
evidence-based policymaking, enhance agricultural productivity,
and contribute to the sustainable development of the Indian
agriculture sector.
Technology Used

- Power BI: Leveraging Power BI's advanced analytics features for


data visualization, interactive dashboards, and statistical
analysis.
- Excel: Utilizing Excel for data manipulation, cleansing, and pre-
processing to ensure data accuracy and integrity.
- Comprehensive Analysis: Conducting in-depth analysis of 19
years of Indian agricultural data to identify trends, patterns, and
insights.
- Interactive Dashboards: Creating interactive dashboards in
Power BI to visualize key metrics, crop production trends, and
geographical distribution.
- Statistical Analysis: Performing statistical analysis to identify
correlations, trends, and predictive models for crop production
and minimum support price.
- Decision Support: Providing stakeholders with actionable
insights to optimize agricultural practices, policy-making, and
resource allocation.
Modelling & Result :

Excel Dataset:
The Excel dataset utilized in this project comprises
comprehensive records spanning 19 years, from 2005 to 2023,
capturing various aspects of Indian agriculture. It encompasses
diverse agricultural parameters such as crop production, yield,
acreage, and geographical data at the district and state levels.
The dataset is meticulously curated and standardized to ensure
consistency and reliability across all records. Additionally, it
includes historical information on climatic conditions, soil types,
and other pertinent factors that influence agricultural outcomes.
This rich dataset serves as the foundation for conducting
exhaustive analysis and deriving meaningful insights into the
Indian agriculture sector.
Power BI:
Power BI is a powerful business intelligence tool developed by
Microsoft, renowned for its user-friendly interface and robust
analytical capabilities. Leveraging Power BI, users can seamlessly
connect to various data sources, including Excel files, databases,
and cloud services, to create interactive and visually appealing
reports and dashboards. With its array of built-in visualization
options and advanced analytics features such as predictive
modeling and natural language queries, Power BI empowers users
to gain deep insights into their data. Moreover, its integration with
other Microsoft products like Excel, SharePoint, and Teams
enhances collaboration and facilitates data-driven decision-
making across organizations.
DAX (Data Analysis Expressions):
DAX, or Data Analysis Expressions, is a formula language used in
Power BI and other Microsoft tools like Excel Power Pivot and
Analysis Services. It's designed to perform calculations, define
custom measures, and create calculated columns within data
models. DAX functions allow users to manipulate data, perform
aggregations, filter data dynamically, and create sophisticated
calculations. Its syntax is similar to Excel formulas, making it
accessible to users familiar with Excel functions. DAX plays a
crucial role in unlocking the full potential of Power BI by enabling
users to derive meaningful insights and drive data-driven
decision-making through powerful calculations and analyses.
The clustered bar chart, in Power BI effectively displays the
top 5 crops based on production, offering a clear comparison of
their relative contributions. This visualization aids in identifying
crop priorities and understanding their production dynamics over
the years. It facilitates data-driven decision-making by
highlighting the crops with the highest output, allowing
stakeholders to focus on optimizing their cultivation and
distribution strategies. Overall, this chart enhances agricultural
analysis by providing concise insights into key production trends.
The clustered column chart in Power BI vividly showcases
the top 3 states by agricultural production, offering a visual
representation of their relative performance. This visualization
enables stakeholders to discern the leading states in terms of
crop output, facilitating targeted resource allocation and policy
planning. By quickly identifying the most productive states,
decision-makers can prioritize initiatives to support and enhance
agricultural activities in these regions, fostering sustainable
growth and development in the agricultural sector.
Utilizing the clustered column chart in Power BI, we can
easily pinpoint the top 3 districts based on agricultural
production, providing valuable insights into regional performance.
This visual representation allows stakeholders to identify key
districts contributing significantly to overall crop output, aiding in
targeted resource allocation and strategic decision-making. By
highlighting the most productive districts, policymakers and
agricultural stakeholders can tailor interventions and investments
to bolster agricultural development at the local level, fostering
economic growth and food security.
Employing a pie chart visualization in Power BI facilitates the
identification of production distribution across various seasons,
offering a concise overview of agricultural output trends
throughout the year. This graphical representation enables
stakeholders to discern the proportion of production attributed to
different seasons, aiding in seasonal planning and resource
allocation. By visualizing production variations across seasons,
decision-makers can adapt strategies to optimize resource
utilization and mitigate risks associated with seasonal
fluctuations, enhancing overall agricultural productivity and
resilience.
Utilizing a line chart in Power BI, the year-wise production trend
can be depicted, allowing for the visualization of production
fluctuations over the 19-year period. Additionally, incorporating
prediction models enables forecasting of future production
trends based on historical data, empowering stakeholders to
anticipate potential changes and make informed decisions. This
comprehensive visualization aids in identifying long-term
production patterns, facilitating strategic planning and resource
management to optimize agricultural output and enhance
sustainability.
Employing predictive analytics within Power BI, a one-year
projection of agricultural production can be integrated into
the year-wise production trend line chart. By leveraging historical
data and predictive algorithms, this feature enables stakeholders
to anticipate future production levels with greater accuracy,
thereby enhancing decision-making capabilities and enabling
proactive measures to address potential challenges or
opportunities in the agricultural sector. This predictive
functionality enhances the utility of the visualization tool,
providing valuable insights for stakeholders to optimize resource
allocation, mitigate risks, and maximize agricultural productivity
in the coming year.
By incorporating predictive modeling techniques, Power BI can
generate insights into the potential Minimum Support Price
(MSP) for specific crops in the upcoming year. Utilizing historical
MSP data, current market trends, and predictive algorithms, this
card visualization offers stakeholders valuable foresight into the
probable MSP levels, empowering informed decision-making and
strategic planning. This predictive feature enhances the
platform's utility by providing stakeholders with actionable
information to anticipate market conditions, optimize crop
selection, and ensure fair returns for farmers, thereby fostering a
more sustainable and resilient agricultural ecosystem.
Final Output of the Analysis
The dashboard provides a comprehensive overview of the Indian
agriculture sector, leveraging data analytics and visualization
tools to offer insights into crop production, regional performance,
and market trends. Through interactive visualizations such as
clustered bar charts, line charts, and pie charts, stakeholders can
easily identify top-performing crops, states, and districts, as well
as analyze production trends across different seasons and years.
Additionally, predictive modeling techniques are employed to
forecast future production levels and assess the possibility of
Minimum Support Prices (MSP) for specific crops. With its user-
friendly interface and predictive capabilities, the dashboard
serves as a valuable decision-making tool for policymakers,
agricultural experts, and stakeholders across the industry,
facilitating data-driven strategies and fostering sustainable
agricultural practices.
Conclusion
In conclusion, the Power BI-driven exhaustive analysis of the Indian
agriculture sector offers invaluable insights into the dynamics of
crop production, market trends, and policy implications. By
harnessing the power of data analytics and visualization
techniques, the project provides a deeper understanding of the
agricultural landscape, enabling stakeholders to make informed
decisions and drive meaningful interventions. From identifying top-
performing crops and regions to predicting future production levels
and assessing MSP possibilities, the dashboard serves as a robust
platform for evidence-based policymaking and strategic planning.
Moving forward, continued efforts to refine data models, expand
predictive capabilities, and enhance user accessibility will further
strengthen the dashboard's utility and impact in addressing the
evolving challenges and opportunities within the Indian agriculture
sector.
Furthermore, the comprehensive nature of the dashboard fosters
collaboration among various stakeholders, including policymakers,
researchers, farmers, and industry experts, facilitating data-driven
discussions and initiatives aimed at enhancing agricultural
productivity and sustainability. By leveraging advanced analytical
tools and methodologies, such as predictive modeling and scenario
analysis, the dashboard empowers users to explore potential
outcomes and develop strategies to mitigate risks and capitalize on
emerging opportunities. Ultimately, the project exemplifies the
transformative potential of data-driven approaches in addressing
complex socio-economic issues and driving positive change in the
agriculture sector, paving the way for a more resilient and inclusive
agricultural ecosystem in India.

You might also like