What are recommended methods for incorporating prior knowledge in machine learning tasks? By prior knowledge I am referring to information like which features are a priori considered more important or which samples should be weighted higher (e.g. due to higher data quality). I am looking for references that give a good overview of the available methods and also pointers to software implementations.
1 Answer
I know this is 2 years old, but it might help someone at some point. Background knowledge comes in many forms, from prioritized features to whole Ontologies or other meta knowledge constructs. Dealing with this problem before, I have, for example, been able to represent my background knowledge as a graph (which it normally is anyways), and used its hierarchical properties to prioritize features when actually learning. As mentioned before, you can always resort to Bayesian methods, but the problem with them is the computational complexity, so be prepared to wait. A lot.