Unsupervised Learning#

Unsupervised learning?#

Most methods we cover are supervised learning. Supervised learning refers to using the dataset where the training data and their labels are both present. Unsupervised methods, on the other hand, do not use labels. They find out traits in different parts of the dataset all by themselves.

Most common unsupervised methods#

Clustering#

Clustering clusters the data entry that are close to each other together.

Decision tree#

Decision tree discriminate between different classes of points by making decisions.

Self supervised#

Self supervised methods usually refer to MLM (Masked Language Model) training.

Semi supervised#

Semi supervised methods are used when most data don’t have labels but a small portion of the dataset does have labels.