Lecture 25 - Supervised Learning I

Approximate Q-Learning

Task: Learn a mappin/model $f_{θ}$ from imputs $x \in X$ to outputs $y \in Y$
- Inputs $x$ are also called features
- Outputs $y$ are also called targets
- If y is categorical: classification model
- If y is real-valued: regression model
- $f_{θ}$ has learnable parameters $θ$
Experience: Given in the form of input-output pairs
- Training Set D = $x, y^{N}$
Examples
- Email spam detector
  - Input: Words
  - Output: Spam or not
  - Classification model
- Digit Classification
  - Input: image
  - Output: Digits
  - Classification Model
- Stock Forecasting
  - Input: Price History
  - Output: Future Price
  - Regression Model

Memorize training set
Given distance metric, find K closest sample to x, and pick average of y.
- So basically count K closest sample, and pick most frequent one.

Training Set: D = ${y^{(n)}}_{n = 1}^{N}$
Likelihood of D
- Assume samples are independent and identically distributed
- L(D; $θ$ ) = $Π_{n} P (Y = y^{(n)}) = Π_{n} θ^{y^{(} n)} * (1 - θ)^{(1 - y^{(n)})} *$
Maximize $L (D; θ) \leftrightarrow$ Maximize $lo g L (D; θ)$
To maximize find $θ$

Meet's Notes