Skip to content

Supervised Learning

Supervised Learning is a paradigm in Machine Learning, where we train the computer to predict an output based on the input data. The dataset contains both features and targets, provided by a human.

Notation

NotationDescription
SDataset. Contains all the features and targets for our predictions
NTotal number of samples in the dataset
xFeature. Input value(s) from the dataset.
Also called: explicative variables, or class in classification
XMatrix of many features, i.e X=(x1,,xn)
yTarget. Output values from the dataset.
Also called: ground truth, response variables, or label
y^Prediction. Output calculated/suggested by the model

Example

Surface (m2)Number of roomsPrice / month
18021500
29021600
310031750
411031900

Dataset S, composed of 4 samples

  • Feature x1 → Surface
  • Feature x2 → Nbr. of rooms
  • Target y1 → Price / month