Screenshot 2024-06-21 at 8.08.56 PM

Download as pdf or txt
Download as pdf or txt
You are on page 1of 26

INTERNSHIP REPORT

A report submitted in partial fulfillment of the requirements for the Award of Degree of

BACHELOR OF COMPUTER APPLICATION


by
ASWANTH MT (U18AJ21S0245)

DEPARTMENT OF COMPUTER APPLICATION


ACHARYA INSTITUTE OF GRADUATE STUDIES
(NAAC Re-Accredited ‘A+’ and Affiliated to Bengaluru City University)
1#89/90, Soldevanahalli, Hesaraghatta road, BENGALURU – 560107

2023-2024

i
ACHARYA INSTITUTE OF GRADUATE STUDIES
(NAAC Re-Accredited ‘A+’ and Affiliated to Bengaluru City University)
1#89/90, Soldevanahalli, Hesaraghatta Road, Bengaluru – 560107

DEPARTMENT OF COMPUTER APPLICATION

CERTIFICATE

This is to certify that ASWANTH M T USN NO : U18AJ21S0245 (A student of the III


year BCA program) has completed an internship program in Artificial Intelligence And Machine
Learning at AIROBOSOFT From 26 MARCH 2024 to 26 APRIL2024.
During this internship, ASWANTH MT demonstrated a high level of:
DETERMINATION
COMMITTED PROFESSIONALISM

We commend ASWANTH M T for their dedication and valuable contributions. This internship
experience has undoubtedly played a significant role in furthering their academic and
professional development.

Head of Department Internship Coordinator

ii
iii
ACKNOWLEDGEMENT

I would like to take this opportunity to thank my college Acharya Institute of Graduate
Studies, Bengaluru and the Department of Computer Application for allowing us to work
on this internship.

Life enhances better opportunities with better blessings with adequate space and time. It was
a great blessing to do this Internship titled “Artificial Intelligence And Machine Learning”,
which I have put into all my efforts and dedication towards it resulting in getting undiscovered
knowledge, better experiences, and ideas behind it. To give brighter and broader measures there
have been a few concerns supportive of making this project real real-time application, without
which my project would have been meaningless.

First, my heartfelt gratitude and respect to Dr. Gurunath Rao Vaidya, Principal of AIGS, and
to Prof. Ramakrishna C. N, HOD, Dept. of Computer Application. With utmost thanks and
dedication, I would like to thank my guide Asst Prof. Aswathy Ravi, who she/ was aside in
every step of the work that I have done and with some important advice and corrective
measures.

I would also like to extend my thanks and gratitude to every faculty of the BCA Department and
to my family inmates and friends who were concerned for the internship.

iv
ABSTRACT

During my AI and ML internship, I developed and fine-tuned machine learning models using
Python and libraries such as TensorFlow and scikit-learn. I implemented AI algorithms to enhance
data analysis and improve predictive capabilities, contributing to more efficient and accurate
decision-making processes. I also worked on data preprocessing, feature engineering, and model
evaluation to ensure the robustness and reliability of the models. This hands-on experience allowed
me to apply theoretical knowledge to real-world problems, collaborate with a team of skilled
professionals, and gain a deeper understanding of the practical applications of AI and machine
learning in various domains.

Organization Information:

AIRBOSOFT is a distinguished Indian engineering and electronics company headquartered in


Bangalore, Karnataka. Specializing in both product and service domains, ITABS excels in wireless
communication technology and delivers high-quality services to its esteemed clients. The
company's core product offerings include embedded components such as ICs, control boards,
controllers, microprocessors, fuel systems, water level controllers, security systems, biometric
login systems, and wireless devices. Additionally, ITABS caters to industrial needs with products
like network tower management systems and develops various applications, including e-commerce
apps and website design and development. With a strong focus on innovation and client
satisfaction, ITABS stands out as a leader in the engineering and electronics industry, consistently
pushing the boundaries of technology and delivering exceptional solutions.

Programs and opportunities:

AIRBOSOFT offers a comprehensive array of programs and opportunities designed to foster


learning, innovation, and career growth. Their internship programs in software development,
embedded systems, and AI/ML provide hands-on experience under the guidance of industry
experts. Specialized training programs cater to fresh graduates and professionals looking to
enhance their skills in areas like AI, machine learning, and software development. ITABS also

v
engages in cutting-edge research and development projects focusing on wireless communication,
IoT, and AI technologies. For career opportunities, ITABS regularly hires for full-time positions
in software development, embedded systems engineering, and AI/ML, and offers flexible project-
based roles for those seeking varied work experiences. Collaborative projects with universities and
research institutions further enrich the professional development landscape at ITABS, making it
an ideal environment for ambitious IT professionals.

Methodologies: Our vision is to be a leader in the software industry by providing enhanced


services, relationships, and profitability. We aim to provide quality services that exceed the
expectations of our esteemed customers. Our mission is to build long-term relationships with our
customers and clients, offering exceptional customer service through innovation and advanced
technology.

Key parts of the report:

Under each division, we further provide specific industry solutions on focused domains with
cutting-edge technologies. Benefits of the Company/Institution through our report:

Under each division, we further provide specific industry solutions on focused domains with
cutting-edge technologies. We emphasize on building relationships with our clients by delivering
projects on time and within budget.

vi
INDEX

S.NO CONTENTS PAGE NO

1. Introduction 1

1.1 Modules 2

2 Analysis 3

3 Software requirements specifications 4

4 Technology 5

4.1 ASP.NET 5

4.2 ADP.NET 6

4.3 C#.NET 6

4.4 JAVA 7

4.5 SQL Data Base 8

5 Coding 10

6 Screenshots 11

7 Conclusion 16

8 Bibliography 17
Learning Objectives/Internship Objectives

➢ Internships are generally thought to be reserved for college students looking to gain
experience in a particular field. However, a wide array of people can benefit from Training
Internships to receive real world experience and develop their skills.

➢ An objective for this position should emphasize the skills you already possess in the area
and your interest in learning more

➢ Internships are utilized in several different career fields, including architecture,


engineering, healthcare, economics, advertising, and many more.

➢ Some internship is used to allow individuals to perform scientific research while othersare
specifically designed to allow people to gain first-hand experience working.

➢ Utilizing internships is a great way to build your resume and develop skills that can be
emphasized in your resume for future jobs. When you are applying for a Training
Internship, make sure to highlight any special skills or talents that can make you standapart
from the rest of the applicants so that you have an improvedchance of landing theposition.

8
viii
WEEEKLY OVERVIEW OF INTERSHIP ACTIVITIES

DATE DAY NAME OF THE TOPIC/MODULE COMPLETED


F
26/03/2024 TUESDAY INTRODUCTION AND ORIENTATION
I
R 27/03/2024 WEDNESDAY DATA COLLECTION AND PREPROCESSING TECHNIQUE
S
28/03/2024 THURSDAY EXPLORATORY DATA ANALYSIS AND VISUALIZATION
T
29/03/2024 FRIDAY INTRODUCTION TO MACHINE LEARNINGALGORITHM
W
E 30/03/2024 SATURDAY HOLIDAY
E
31/03/2024 SUNDAY HOLIDAY
K
01/04/2024 MONDAY IMPLEMENTING LINEAR REGRESSION MODELS

S DATE DAY NAME OF THE TOPIC/MODULE COMPLETED


E
02/04/2024 TUESDAY IMPLEMENTING LOGISTIC REGRESSION MODELS
C
O 03/04/2024 WEDNESDAY DECISION TREES
N
D 04/04.2024 THURSDAY INTRODUCTION TO NEURAL NETWORKS AND DEEP
W LEARNING
E 05/04/2024 FRIDAY IMPLEMENTING A BASIC NEURAL NETWORK
E
06/04/2024 SATURDAY HOLIDAY
K
07/04/2024 SUNDAY HOLIDAY

T DATE DAY NAME OF THE TOPIC/MODULE COMPLETED


H
I 08/04/2024 MONDAY CONVOLUTIONAL NEURAL NETWORKS
R
09/04/2024 TUESDAY IMPLEMENTATION CNNS FOR IMAGE CLASSIFICATION
D
10/04/2024 WEDNESDAY RECURRENT NEURAL NETWORKS(RNNS)
W
E 11/04/2024 THURSDAY NATURAL LANGUAGE PROCESSING (NLP)BASICS
E
K 12/04/2024 FRIDAY TEXT PREPROESSING AND WORD EMEDDINGS

13/04/2024 SATURDAY HOLIDAY


DATE DAY NAME OF THE TOPIC/MODULE COMPLETED
F
14/04/2024 SUNDAY HOLIDAY
O
U 15/04/2024 MONDAY IMPLEMENTING RNNS FOR TEXT CLASSIFICATION
R
T 16/04/2024 TUESDAY SEQUENCE-TO-SEQUENCE MODELS AND APPLICATION
H
17/04/2024 WEDNESDAY ADVANCED NLP TECHNIQUES AND TRANSFORMERS
W 18/04/2024 THURSDAY IMPLEMENTING TRANSFORMER MODELS
E
E 19/04/2024 FRIDAY MODEL DEVELOPMENT AND SERVING
K
20/04/2024 SATURDAY HOLIDAY

21/04/2024 SUNDAY HOLIDAY

F 22/04/2024 MONDAY INTRODUCTION TO REINFORCEMENT LEARNING


I
F 23/04/2024 TUESDAY IMPLEMENTING A BASIC REINFORCEMENT
T LEARNING AGENT
H 24/04/2024 WEDNESDAY HOLIDAY

W
E 25/04/2024 THURSDAY HOLIDAY
A
K 26/04/2024 FRIDAY HOLIDAY

10
Internship Report 2023-2024 Title of Internship

INTERNSHIP REPORT

1. INTRODUCTION

Artificial Intelligence (AI) and Machine Learning (ML) have rapidly emerged as transformative
technologies across various industries, revolutionizing how businesses operateand how people
interact with technology. This internship report explores the practical application of AI and ML
within the context of AIRBOSOFT, providing insights into the development, challenges, and
outcomes of projects undertaken during the internship.

This report begins with an overview of the organization and its role in the AI/ML landscape,
followed by a discussion of the objectives, scope, and methodology employed during the
internship. The subsequent sections detail the AI and ML techniques utilized, the data sources
leveraged, and the tools and technologies implemented. Furthermore, the report evaluates the
project's performance against predefined metrics and analyze the impact and potential applications
of the solutions developed.

The internship experience not only facilitated hands-on learning of AI and ML methodologiesbut
also provided valuable insights into the practical implications of these technologies withina real-
world business environment. This report aims to provide a comprehensive understanding of the
internship journey, including key learnings, challenges encountered, andrecommendations for
future work in the field of AI and ML.
Internship Report 2023-2024 Title of Internship

1
Internship Report 2023-2024 Title of Internship

2. SYSTEM ANALYSIS

2.1 Requirement Analysis

Existing System:

In an Existing, the sales prediction project aims to leverage machine learning techniques to forecast
future sales based on historical data. By analyzing factors such as past sales figures, seasonal trends,
economic conditions, and marketing activities, the project seeks to develop accurate predictive models.
Data preprocessing involves cleaning and transforming data to handle missing values, outliers, and
categorical variables, while feature engineering includes creating new features such as time-related
variables and lag features. Model selection includes evaluating various algorithms such as Linear
Regression, Random Forest, and Gradient Boosting, with a focus on optimizing performance metrics
like Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE).

Proposed System:
Following the previously proposed set Future updates for the sales prediction project will focus on
enhancing accuracy, scalability, and usability through advanced techniques and technologies. Firstly,
we plan to expand data integration by including additional external sources such as social media trends,
competitor pricing, and economic indicators to provide a more comprehensive view. Advanced data
preprocessing techniques, including automated cleaning and feature engineering, will be implemented
to handle complex data issues more effectively. We will explore advanced modeling techniques such
as ensemble learning and deep learning architectures like LSTM networks for improved forecasting
accuracy. Model interpretability will be enhanced using SHAP values and LIME to provide transparent
explanations for predictions. Real-time prediction capabilities and scalable deployment using cloud
platforms will enable immediate decision-making support. Continuous monitoring and automated
model retraining will ensure that models remain accurate and adaptive to changing market conditions.

Department of Computer Application

2 Acharya Institute of Graduate Studies


Internship Report 2023-2024 Title of Internship

3. SOFTWARE REQUIREMENTS SPECIFICATIONS

The software requirement specification can be produced at the culmination of the analysis task. The
function and performance allocated to software as part of system engineering are refined by
establishing a complete information description, a detailed functional description, a representation of
system behavior, an indication of performance and design constrain, appropriate validation criteria,
and other information pertinent to requirements.

Software Requirements:

• Operating system: Windows 11


• Coding Language: Python
• Front-End: Google Colab
• Data Base: SQL Server 2008.

Hardware Requirement:

• System : HP Pavilion Laptop

• SSD : 512 GB
• Ram : 16GB

3
Internship Report 2023-2024 Title of Internship

4. TECHNOLOGY

Effective management of resources is an essential task for companies that are managing
different projects .Its important for them to efficiently organize and allocate personal as well as
equipment for different projects, same time avoiding idle resources.
Resource utilization refers to the process of making the most of the resources available to you
in order to achieve the objective that you want to. Your individual project and the utilization of
your resources for the same could also be called resource utilization.
During this Internship period, learnt the lesson of resource utilization and its direct benefit on
the growth of the company. At the most basic level, utilization is the metric professional
services companies use to measure the profitability of the people.
3.2.1 Soft
ware Tools
Anaconda
Navigator tool
Anaconda Navigator is a desktop graphical user interface (GUI) included in Anaconda®
distribution that allows you to launch applications and easily manage conda packages,
environments, and channels without using command-line commands. Navigator can search for
packages on Anaconda Cloud or in a local Anaconda Repository. It is available for Windows,
macOS, and Linux.
The following applications are available by default in Navigator:
JupyterLab
Jupyter Notebook
Spyder
VSCode
Glueviz
Orange 3 App
RStudio

Jupyter Notebook

The Jupyter Notebook is an open source web application that you can use to create and share
documents that contain live code, equations, visualizations, and text. Jupyter Notebook is
Department of Computer Application
maintained by the people at Project Jupyter. Jupyter Notebooks are a spin-off project from the
4 Acharya Institute of Graduate Studies
Internship Report 2023-2024 Title of Internship

IPython project, which used to have an IPython Notebook project itself. The name, Jupyter,
comes from the core supported programming languages that it supports: Julia, Python, and R.
Jupyter ships with the IPython kernel, which allows you to write your programs in Python, but
there are currently over 100 other kernels that you can also use.

Spyder

Spyder is a powerful scientific environment written in Python, for Python, and designed by and
for scientists, engineers and data analysts. It features a unique combination of the advanced
editing, analysis, debugging, and profiling functionality of a comprehensive development tool
with the data exploration, interactive execution, deep inspection, and beautiful visualization
capabilities of a scientific package.
Spyder offers built-in integration with many popular scientific packages, including NumPy,
SciPy, Pandas, IPython, QtConsole, Matplotlib, SymPy, and more.Spyder’s abilities can be
extended even further via its plugin system and API. Spyder can also be used as a PyQt5
extension library, allowing you to build upon its functionality and embed its components, such
as the interactive console, in your own software.

3.2.2 La

nguages Used

Python

Python is a high-level, interpreted, interactive and object-oriented scripting language. Python is


designed to be highly readable and is Interpreted , Interactive, Object-Oriented Beginner's
Language.Python features a dynamic type system and automatic memory management. It
supported multiple programming paradigms, including object-oriented, imperative and has large
and comprehensive standard library.
Python become popular because of 5 libraries are TensorFlow, NumPy, SciPy, Pandas and
Matplotlib. Python is the most widely used programming language today. When it comes to
solving data science tasks and challenges, Python never ceases to surprise its users. Most data
scientists are already leveraging the power of Python programming every day. Python has been
built with extraordinary Python libraries that are used by programmers every day in solving
problems.
Department of Computer Application

5 Acharya Institute of Graduate Studies


Internship Report 2023-2024 Title of Internship

Python features include:

Easy-to-learn − Python has few keywords, simple structure, and a clearly defined
syntax. This allows the student to pick up the language quickly.
Easy-to-read − Python code is more clearly defined and visible to the eyes.
Easy-to-maintain − Python's source code is fairly easy-to-maintain.
A broad standard library − Python's bulk of the library is very portable and cross platform
compatible on UNIX, Windows, and Macintosh.
Interactive Mode − Python has support for an interactive mode which allows
interactive testing and debugging of snippets of code.
Portable − Python can run on a wide variety of hardware platforms and has the same
interface on all platforms.
Extendable − You can add low-level modules to the Python interpreter. These
modules enable programmers to add to or customize their tools to be more efficient.
Databases − Python provides interfaces to all major commercial databases.
GUI Programming − Python supports GUI applications that can be created and
ported to many system calls, libraries and windows systems, such as Windows MFC,
Macintosh, and the X Window system of Unix.
Scalable − Python provides a better structure and support for large programs than shell
scripting.

3.2.3 Libraries

Tensor Flow

TensorFlow is a library for high-performance numerical computations with around 35,000


comments and a vibrant community of about 1,500 contributors. It’s used across various
scientific fields. TensorFlow is a framework for defining and running computations that involve
tensors, which are partially defined computational objects that eventually produce a value.

Department of Computer Application

6 Acharya Institute of Graduate Studies


Internship Report 2023-2024 Title of Internship

Numpy

NumPy (Numerical Python) is the fundamental package for numerical computation in Python; it
contains a powerful N-dimensional array object. It has around 18,000 comments on GitHub and
an active community of 700 contributors. It’s a general-purpose array-processing package that
provides high-performance multidimensional objects called arrays and tools for working with
them.

SciPy

SciPy (Scientific Python) is another free and open-source Python library extensively used in
data science for high-level computations. SciPy has around 19,000 comments on GitHub and an
active community of about 600 contributors.

Pandas

Pandas (Python data analysis) is a must in the data science life cycle. It is the most popular and
widely used Python library for data science, along with NumPy in matplotlib. With around
17,00 comments on GitHub and an active community of 1,200 contributors, it is heavily used
for data analysis and cleaning. Pandas provide fast, flexible data structures, such as data frame
CDs, which are designed to work with structured data very quickly and intuitively.

Matplotlib

Matplotlib has powerful yet beautiful visualizations. It’s a plotting library for Python with
around 26,000 comments on GitHub and a very vibrant community of about 700 contributors.
Because of the graphs and plots that it produces, it’s extensively used for data visualization. It
also provides an object-oriented API, which can be used to embed those plots into applications.

Department of Computer Application

7 Acharya Institute of Graduate Studies


Internship Report 2023-2024 Title of Internship

4.5 DATABASE

GOOGLE COLLAB

Google Collab, short for Google Collaboratory, is a free cloud-based Jupyter notebook
environment provided by Google. It allows users to write and execute Python code through a
web browser, eliminating the need to set up and maintain a local development environment.

Google Collab integrates seamlessly with Google Drive. You can mount your Google Drive to
access and store data files directly from Collab notebooks. This makes it easy to read and write
data files (CSV, Excel, etc.) and even store Python objects

• CSV.FILES :

CSV (Comma-Separated Values) files are plain-text files that store tabular data in a structured
format, where each line represents a data record, and each record consists of one or more fields
separated by commas. CSV files are widely used for data storage and exchange due to their
simplicity and compatibility with various applications and programming languages.

EXAMPLE :

Department of Computer Application

8 Acharya Institute of Graduate Studies


Internship Report 2023-2024 Title of Internship

NOTE : In this project the csv file has been downloaded from google platform and all the
redundancies has been removed.

Department of Computer Application

9 Acharya Institute of Graduate Studies


Internship Report 2023-2024 Title of Internship

5. CODING
import pandas as pd
import matplotlib.pyplot as plt

data = pd.read_csv('advertising.csv')
data.head()
data.shape
fig, axs = plt.subplots(1, 3, sharey=True)
data.plot(kind='scatter', x='TV', y='Sales', ax=axs[0], figsize=(16, 8))
data.plot(kind='scatter', x='Radio', y='Sales', ax=axs[1])
data.plot(kind='scatter', x='Newspaper', y='Sales', ax=axs[2])
feature_cols = ['TV']
X = data[feature_cols]
y = data.Sales
from sklearn.linear_model import LinearRegression
lm = LinearRegression()
lm.fit(X, y)
print(lm.intercept_)
print(lm.coef_)
X_new = pd.DataFrame({'TV': [50]})
X_new.head()
lm.predict(X_new)
X_new = pd.DataFrame({'TV': [data.TV.min(), data.TV.max()]})
X_new.head()
preds = lm.predict(X_new)
preds
data.plot(kind='scatter', x='TV', y='Sales')
plt.plot(X_new, preds, c='red', linewidth=2)
import statsmodels.formula.api as smf
lm = smf.ols(formula='Sales ~ TV', data=data).fit()
lm.conf_int()
lm.pvalues
lm.rsquared
feature_cols = ['TV', 'Radio', 'Newspaper']
X = data[feature_cols]
y = data.Sales
lm = LinearRegression()
lm.fit(X, y)
print(lm.intercept_)
print(lm.coef_)
lm = smf.ols(formula='Sales ~ TV + Radio + Newspaper', data=data).fit()
lm.conf_int()
lm.summary()
lm = smf.ols(formula='Sales ~ TV + Radio', data=data).fit()
lm.rsquared
lm = smf.ols(formula='Sales ~ TV + Radio + Newspaper', data=data).fit()
lm.rsquared

Department of Computer Application

10 Acharya Institute of Graduate Studies


Internship Report 2023-2024 Title of Internship

import numpy as np
np.random.seed(12345)
nums = np.random.rand(len(data))
mask_large = nums > 0.5
data['Size'] = 'small'
data.loc[mask_large, 'Size'] = 'large'
data.head()
data['IsLarge'] = data.Size.map({'small':0, 'large':1})
data.head(
feature_cols = ['TV', 'Radio', 'Newspaper', 'IsLarge']
X = data[feature_cols]
y = data.Sales
lm = LinearRegression()
lm.fit(X, y)
print(feature_cols, lm.coef_)
np.random.seed(123456)
nums = np.random.rand(len(data))
mask_suburban = (nums > 0.33) & (nums < 0.66)
mask_urban = nums > 0.66
data['Area'] = 'rural'
data.loc[mask_suburban, 'Area'] = 'suburban'
data.loc[mask_urban, 'Area'] = 'urban'
data.head()
area_dummies = pd.get_dummies(data.Area, prefix='Area').iloc[:, 1:]
data = pd.concat([data, area_dummies], axis=1)
data.head()
feature_cols = ['TV', 'Radio', 'Newspaper', 'IsLarge', 'Area_suburban',
'Area_urban']
X = data[feature_cols]
y = data.Sales
lm = LinearRegression()
lm.fit(X, y)
print(feature_cols, lm.coef_)

Department of Computer Application

11 Acharya Institute of Graduate Studies


Internship Report 2023-2024 Title of Internship

6. SCREENSHOTS

Department of Computer Application

12 Acharya Institute of Graduate Studies


7. CONCLUSION

AI/ML internship has been a transformative experience, providing me with in-depth knowledge
and practical skills in machine learning and artificial intelligence. I gained hands-on experience
with various ML algorithms, from supervised and unsupervised learning to advanced neural
networks, using tools like Python, scikit-learn. Working on real-world projects allowed me to
apply theoretical concepts to practical scenarios, enhancing my data preprocessing, model
training, and evaluation skills. Collaborating with a professional team helped me understand
industry best practices, agile workflows, and the importance of teamwork and communication.
This internship has solidified my passion for AI/ML, preparing me to tackle future challenges
and contribute meaningfully to the field.
8. BIBLOGRAPHY

The following books are referred during the analysis and execution phase of the project

1. Multiple linear regression in data mining http://ocw.mit.edu/courses/sloan-school-


ofmanagement/15-062-data-mining-spring-2003/lecture-notes/lecture9.pdf

You might also like