ACTIVE LEARNING METHODS FOR LABELLING DATASETS
Main Article Content
Abstract
Data scientists are challenged with more data than they will ever be able to analyse as data collection and storage costs continue to drop. The most fascinating developments in machine learning require vast amounts of data. But it also creates a new challenge for the machine learning community, as all supervised learning-based machine learning applications remain practically useless without labelled data. Labelling large datasets has become a vital challenge. A specific instance of Supervised Machine Learning is Active Learning. By actively choosing the important data points, this method builds a highperformance classifier thus minimizing the size of the training dataset.