A2 Vishal Borra

Uploaded by

The document defines functions for preprocessing data, applying clustering algorithms, and evaluating clustering performance. It loads clustering algorithms like MiniBatchKMeans, SpectralClustering, and OPTICS. For different datasets, it applies the algorithms, calculates the Davies-Bouldin Index metric, and identifies the best performing algorithm and number of clusters. It plots the clustered data and prints the results.

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

A2 Vishal Borra

Uploaded by

vishal.borra

0% found this document useful (0 votes)

15 views2 pages

Copyright

Available Formats

TXT, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Download as txt, pdf, or txt

0% found this document useful (0 votes)

15 views2 pages

A2 Vishal Borra

Uploaded by

vishal.borra

Copyright:

Available Formats

Download as TXT, PDF, TXT or read online from Scribd

Download as txt, pdf, or txt

Jump to Page

You are on page 1of 2

Search inside document

import pandas as pd

from sklearn.cluster import MiniBatchKMeans, SpectralClustering, OPTICS

from sklearn.metrics import davies_bouldin_score
import matplotlib.pyplot as plt

# Data preprocessing function

def preprocess_data(file_path):
# Reading the data from CSV
data = pd.read_csv(file_path, header=None)
# Normalizing the data
data_normalized = (data - data.mean()) / data.std()
return data_normalized

# Function to apply clustering and calculate the Davies-Bouldin Index

def apply_clustering_and_evaluate(data, algorithm):
# Applying the clustering algorithm to the data
labels = algorithm.fit_predict(data)
# Calculating Davies-Bouldin Index
db_index = davies_bouldin_score(data, labels)
return labels, db_index

# Defin ing the clustering algorithms

algorithms = {
'MiniBatchKMeans': MiniBatchKMeans(n_clusters=3, batch_size=100),
'SpectralClustering': SpectralClustering(n_clusters=3,
affinity='nearest_neighbors'),
'OPTICS': OPTICS(min_samples=5, xi=0.05, min_cluster_size=0.1)
}

# Main execution loop

for path in [
(r"C:\Users\Vishal\Desktop\DM2\D01.csv"),
(r"C:\Users\Vishal\Desktop\DM2\D02.csv"),
(r"C:\Users\Vishal\Desktop\DM2\D03.csv")
]:
data = preprocess_data(path)

best_davies_bouldin_score = float('inf')
best_algorithm_name = None
best_labels = None

for name, alg in algorithms.items():

labels, db_index = apply_clustering_and_evaluate(data, alg)
if db_index < best_davies_bouldin_score:
best_davies_bouldin_score = db_index
best_algorithm_name = name
best_labels = labels

# Counting number of clusters

num_clusters = len(set(best_labels))

# Plotting the results

plt.scatter(data[0], data[1], c=best_labels, cmap='viridis', marker='o')
plt.title(f'{best_algorithm_name} - Davies-Bouldin Index:
{best_davies_bouldin_score:.2f}, Number of Clusters: {num_clusters}')
plt.xlabel('Feature 1')
plt.ylabel('Feature 2')
plt.show()
print(f"The best algorithm for {path} is {best_algorithm_name} with a Davies-
Bouldin Index of {best_davies_bouldin_score:.2f} and {num_clusters} clusters")

From Import Import As Import As From Import From Import From Import From Import
Document9 pages
From Import Import As Import As From Import From Import From Import From Import
Mr Sonu
No ratings yet
MACHINE LEARNING manual
Document36 pages
MACHINE LEARNING manual
manda.ashok
No ratings yet
Abhiml ML File
Document74 pages
Abhiml ML File
Bhawna Chandla
No ratings yet
ML Practical 205160694034
Document33 pages
ML Practical 205160694034
09Samrat Bikram Shah
No ratings yet
Correction
Document3 pages
Correction
bougmazisoufyane
No ratings yet
DOC-20241108-WA0003
Document16 pages
DOC-20241108-WA0003
shashankyadav5674
No ratings yet
Pattern Recognition
Document26 pages
Pattern Recognition
Aryan Attri
No ratings yet
Email Spam Classifier
Document22 pages
Email Spam Classifier
phenomenal beast
No ratings yet
HW5 Clustering (50 PTS) : Test Algorithms
Document5 pages
HW5 Clustering (50 PTS) : Test Algorithms
ljab858
No ratings yet
Documentation
Document7 pages
Documentation
Bhavya Jha
No ratings yet
Bda Assign
Document15 pages
Bda Assign
Aishwarya Biradar
No ratings yet
Pattern Recognition Lab
Document24 pages
Pattern Recognition Lab
Prashant Kumar
No ratings yet
DWDM Lab Report
Document26 pages
DWDM Lab Report
Simran Shrestha
No ratings yet
Data Science Lab Manual
Document32 pages
Data Science Lab Manual
Ravishankar Gautam
No ratings yet
BI 8
Document3 pages
BI 8
Aditya
No ratings yet
ML Assignment 01 Code
Document21 pages
ML Assignment 01 Code
Awais Khan
No ratings yet
Vid 4
Document6 pages
Vid 4
diyalap01
No ratings yet
Exercise 3
Document12 pages
Exercise 3
Ram Aypn
No ratings yet
Laboratoare SBC
Document17 pages
Laboratoare SBC
Denisa Alina
No ratings yet
DATA MINING EX1
Document10 pages
DATA MINING EX1
20bel513
No ratings yet
1 - Sentiment - Analysis - NLP - Ipynb - Codes Only
Document5 pages
1 - Sentiment - Analysis - NLP - Ipynb - Codes Only
racoon97970301
No ratings yet
Cod SBC
Document16 pages
Cod SBC
Denisa Alina
No ratings yet
KMeans Clustering
Document1 page
KMeans Clustering
rampage4630
No ratings yet
CO3
Document8 pages
CO3
sankeerthrockz2002
No ratings yet
ML Assignment 1 - Nageswar
Document7 pages
ML Assignment 1 - Nageswar
upendrakomurumallu
No ratings yet
Principal Component Analysis Notes : Info
Document22 pages
Principal Component Analysis Notes : Info
VALMICK GUHA
No ratings yet
Ensemble Learning
Document1 page
Ensemble Learning
xewetef241
No ratings yet
ML Remaining
Document17 pages
ML Remaining
Anish Kushwaha (Anshu)
No ratings yet
C1 W2 Lab05 Sklearn GD Soln
Document3 pages
C1 W2 Lab05 Sklearn GD Soln
Srinidhi P
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
Document20 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
Saloni Tuli
No ratings yet
ML
Document7 pages
ML
21eg105f37
No ratings yet
Unit1 ML Programs
Document5 pages
Unit1 ML Programs
diroja5648
No ratings yet
Grid Search For SVM
Document9 pages
Grid Search For SVM
kPrasad8
No ratings yet
Rice - Ipynb - Colab
Document11 pages
Rice - Ipynb - Colab
Mortal king JNJ
No ratings yet
Clustering Algorithms CheatSheet 1710438661
Document6 pages
Clustering Algorithms CheatSheet 1710438661
Stish
No ratings yet
0 Aimlfinal
Document24 pages
0 Aimlfinal
arvindhrk05
No ratings yet
Rajeek8 12
Document21 pages
Rajeek8 12
ezhilventhanmb30
No ratings yet
Project - Ipynb - Colaboratory
Document4 pages
Project - Ipynb - Colaboratory
NARENDRA N BU21CSEN0300484
No ratings yet
allcodesml2
Document10 pages
allcodesml2
21r11a05g8
No ratings yet
AIL303 M
Document22 pages
AIL303 M
Nguyen Hai Anh (K17 HL)
No ratings yet
Aiml Ex 4-7
Document8 pages
Aiml Ex 4-7
Lakshmi Dheeba K
No ratings yet
Lecture Material 3
Document7 pages
Lecture Material 3
2021me372
No ratings yet
DeepLearningForVisionSystems Ch5 AlexNet
Document32 pages
DeepLearningForVisionSystems Ch5 AlexNet
mkkadambi
No ratings yet
ML - Practical File
Document15 pages
ML - Practical File
Jatin Mathur
No ratings yet
New Chat: 1. Predicting Uber Ride Prices
Document16 pages
New Chat: 1. Predicting Uber Ride Prices
manasishivarkar
No ratings yet
Performance Testing
Document15 pages
Performance Testing
Dhawal Soni
No ratings yet
AIML 01 Merged
Document25 pages
AIML 01 Merged
IT : 47 Patel Jyot
No ratings yet
Data Mining & Data Science Practical Slips
Document45 pages
Data Mining & Data Science Practical Slips
ag8411877
No ratings yet
Assign 6 Solution
Document11 pages
Assign 6 Solution
Aditya kishore sinha
No ratings yet
Clustering
Document1 page
Clustering
xewetef241
No ratings yet
CV Assignment 2 Group02
Document12 pages
CV Assignment 2 Group02
Manash Barman
No ratings yet
Machine Learning Practice
Document17 pages
Machine Learning Practice
21f2001191
No ratings yet
PR
Document17 pages
PR
Vanshika Gupta
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
Document19 pages
MlLabManualdocx 2024 09 04 22 02 58
Âñkït Bháñgålíã 66
No ratings yet
20MIS1025 - DecisionTree - Ipynb - Colaboratory
Document4 pages
20MIS1025 - DecisionTree - Ipynb - Colaboratory
Sandip Das
No ratings yet
Ass6(DMDS)
Document7 pages
Ass6(DMDS)
Gayatri Joshi
No ratings yet
Untitled Document
Document19 pages
Untitled Document
s14utkarsh2111019
No ratings yet
saurabh
Document22 pages
saurabh
Aman Bansal
No ratings yet
Institute of Management Technology, Ghaziabad End Term Exam (Term - VII) Take Home Exam (Time Duration: 2.30 HRS) Batch 2019 - 21 Answer-Sheet
Document18 pages
Institute of Management Technology, Ghaziabad End Term Exam (Term - VII) Take Home Exam (Time Duration: 2.30 HRS) Batch 2019 - 21 Answer-Sheet
Vishakha
No ratings yet
Hands-On AI: Building ML Models with Python
From Everand
Hands-On AI: Building ML Models with Python
Anand Vemula
No ratings yet