Find - S
Find - S
Find - S
The S algorithm, also known as the Find-S algorithm, is a machine learning algorithm
that seeks to find a maximally specific hypothesis based on labeled training data. It
starts with the most specific hypothesis and generalizes it by incorporating positive
examples. It ignores negative examples during the learning process.
Find-S Algorithm:
Following are the steps for the Find-S algorithm:
Initialization − The algorithm starts with the most specific hypothesis, denoted as h. This initial
hypothesis is the most restrictive concept and typically assumes no positive examples. It may be
represented as h = <∅, ∅, ..., ∅>, where ∅ denotes "don't care" or "unknown" values for each
attribute.
Iterative Process − The algorithm iterates through each training example and refines the hypothesis
based on whether the example is positive or negative.
o For each positive training example (an example labeled as the target class), the algorithm
updates the hypothesis by generalizing it to include the attributes of the example. The
hypothesis becomes more general as it covers more positive examples.
o For each negative training example (an example labeled as a non-target class), the algorithm
ignores it as the hypothesis should not cover negative examples. The hypothesis remains
unchanged for negative examples.
Generalization − After processing all the training examples, the algorithm produces a final
hypothesis that covers all positive examples while excluding negative examples. This final
hypothesis represents the generalized concept that the algorithm has learned from the training data.