Industrial Training Report
Industrial Training Report
Industrial Training Report
A PROJECT REPORT
Submitted by
BACHELOR OF ENGINEERING
IN
Chandigarh University
July 2024
BONAFIDE CERTIFICATE
SIGNATURE SIGNATURE
Introduction
Literature Review
Methodology
3.1 Data Collection
Results
4.1 Performance Metrics
Discussion
5.1 Interpretation of Results
5.2 Limitations
Conclusion
References
ABSTRACT
Lung cancer is the leading cause of cancer-related deaths, making early detection
critical for improving survival rates. The treatment method depends largely on
the type and location of the cancer, and early identification can save many lives.
Despite numerous methods proposed by various scholars, achieving high
prediction accuracy remains a challenge. This study explores the use of
Convolutional Neural Networks (CNN) to identify lung cancer using CT images.
A CNN model was created and trained using a collection of annotated lung
images, achieving a 95% accuracy rate and significant improvements in precision
and recall compared to traditional techniques. The findings highlight the potential
of CNNs to aid radiologists and enhance diagnostic accuracy. Future research
will focus on expanding the dataset and refining the model for clinical use.
Additionally, deep learning algorithms like CNN and Google Net, which leverage
the textural properties of images, have been developed to distinguish between
normal and cancerous images effectively.
CHAPTER 1.
INTRODUCTION
Previous studies investigated several machine learning algorithms for lung cancer
diagnosis. Early techniques relied on constructed features and conventional
classifiers, which had limited accuracy. Recent research has proved the
advantages of deep learning models, particularly CNNs, in terms of automatically
extracting meaningful characteristics from raw photos. CNNs have been found to
be highly accurate in identifying lung nodules, discriminating between benign
and malignant nodules, and even segmenting lung tumours. Several functions
were used. A paper described research that focused solely on the detection of lung
cancer medical pictures using deep neural networks. The purpose of this
investigation was to determine whether there were any signs of cancer in a
patient's lungs. To assist physicians with visual diagnostics by training deep
neural networks to identify lung cancer. The primary advantage is that clinicians
will have more support in diagnosing and treating lung cancer in its early stages.
Research was undertaken to use LUAD and LUSC for classifying and predicting
mutations in non-small cell lung cancer histopathology images. An author
produced a paper titled "Optimisation of features using artificial neural networks
for categorization of lung cancer types."
CHAPTER 3
METHODOLOGY
3.1 Data Collection
This analysis will use a publicly available dataset containing lung
histopathological images. The dataset utilized includes a diverse range of images
with various types of lung nodules, annotated by experienced radiologists. This
dataset, accessible on Kaggle [https://www.kaggle.com/datasets/andrewmvd/
lung-and-colon-cancer-histopathological-images], encompasses 5,000 images
categorized into three classes representing different lung conditions: normal, lung
adenocarcinomas, and lung squamous cell carcinomas. Each class includes 250
original images that have been augmented to create the total of 5,000. Due to this
prior augmentation, we will not perform additional data augmentation in this
instance. By leveraging this dataset, we can explore various deep learning
architectures and techniques to achieve accurate classification of lung conditions.
It was uploaded using Kaggle Api by using following command,
!kaggle datasets download -d andrewmvd/lung-and-colon-cancer-
histopathological-images
RESULTS
4.1 Performance Metrics
The CNN model achieved an accuracy of 95%, with a precision of 93%, recall of
92%, and an F1-score of 92.5%. The ROC curve showed an area under the curve
(AUC) of 0.97, indicating a high true positive rate.
Discussion
5.1 Interpretation of Results
The high accuracy and precision of the CNN model indicate its effectiveness in
detecting lung cancer from CT scans. The model's ability to generalize across
different image types and conditions demonstrates its potential for clinical
application. The results suggest that CNNs can assist radiologists in making more
accurate and timely diagnoses.
5.2 Limitations
Despite the promising results, the study has limitations. The dataset, while
comprehensive, may not cover all possible variations in lung cancer
presentations. Additionally, the model's performance may be affected by the
quality and resolution of the input images. Future research should focus on
expanding the dataset and improving the model's robustness.
Conclusion
This study developed a CNN-based model for lung cancer detection, achieving
high accuracy and demonstrating the potential of deep learning in medical
diagnostics. The findings highlight the importance of using advanced machine
learning techniques to improve diagnostic accuracy and patient outcomes. Future
work will aim to refine the model further and validate its performance in clinical
settings.