Machine Learning CNN
Machine Learning CNN
Machine Learning CNN
Learning Techniques
By Clara M. Pennock (Keele University, UK)
Outline
• Aim: to give a basic understanding of machine learning and how to
apply these.
• Start with basic explanation of machine learning.
• Give examples of machine learning I’ve used.
• End with tutorial on using machine learning.
Basics
Non-linear
Label
function with
e.g. Quiescent (0),
some
Star-forming (1)
parameter w.
Depends on 𝑓𝑊 (𝑥)
Ԧ = 𝑦Ԧ
the algorithm
chosen.
Features
e.g. U-V, V-J
Basics – The types
From :
https://www.astroml.org/book_figures/ch
apter7/fig_PCA_LLE.html
Python: Scikit-Learn
Convolution Pooling
Convolutional Neural Networks
CNN training set
• Forest – created
from multiple
decision trees.
• Random –
dataset is
randomly split
into subsets,
each decision
tree is trained on
a different
subset.
Random Forest to find AGN behind the
Magellanic Clouds
Training Set: Reasons I use it:
• AGN – 292 spectroscopically identified • Require very little data preparation
sources • Perform well on imbalanced datasets.
• Galaxies (no identified AGN) – 512 • Can handle both numerical and
spectroscopically identified sources categorical data.
• Stars – ~40,000 from Simbad • Can handle multi-output problems.
• Other galactic sources (e.g. YSOs, PNe, • Outputs feature importance.
SN, SNRs) – ~3000 from Simbad
• Can handle numerous features and
objects.
Have survey catalogue data ranging • Produces classification probabilities.
from Radio to X-rays.
• Generalises well to unseen datasets.
Quick tutorials
• I’ve created some simple python scripts for you to try out. You can use the
provided example data or data of your own.
Dropbox link:
https://www.dropbox.com/sh/6tbzg2pc5kk50mf/AADBi3_CYOixCmpLA3tdn
WOua?dl=0