This repository contains code to reproduce the results described in my How To Use Active Learning To Iteratively Improve Your Machine Learning Models blog post. It borrows heavily from this repo.
- Microsoft Machine Learning Server to run the R code
- A Databricks cluster to run the notebook that creates the word embeddings (optional)
Have a look at the 1_wiki_detox_active_learning_workshop notebook to reproduce all the results reported in the blog post.
Feel free to raise a pull request if you have any questions, feedback, suggestions, etc.