Classification and Logistic Regression

迪丽瓦拉

2025-05-28 19:57:47

0次

文章目录

一、Classification: Probabilistic Generative Model
- Ideal Alternatives
- Two Boxes example
- Two Classes
- Gaussian Distribution
- Probability from Class
- Maximum Likelihood
- Now we can do classification
- Three Steps
二、Classification: Logistic Regression
- Step 1: Function Set
- Step 2: Goodness of a Function
- Step 3: Find the best function
- Logistic Regression + Square Error
- Generative v.s. Discriminative
- Multi-class Classification
- Limitation of Logistic Regression
总结

一、Classification: Probabilistic Generative Model

Ideal Alternatives

Function (Model):
在这里插入图片描述
Loss function:

The number of times f get incorrect results on training data.

Find the best function:
Example: Perceptron, SVM

Classification as Regression?
Binary classification as example ：
Training: Class 1 means the target is 1; Class 2 means the target is -1
Testing: closer to 1 → class 1; closer to -1 → class 2
在这里插入图片描述

Penalize to the examples that are “too correct”

Multiple class: Class 1 means the target is 1; Class 2 means the target is 2; Class 3 means the target is 3 …… problematic

Two Boxes example

在这里插入图片描述

From one of the boxes，where does it come from?
在这里插入图片描述

Two Classes

Estimating the Probabilities From training data
在这里插入图片描述
Given an x, which class does it belong to

Gaussian Distribution

在这里插入图片描述
Input: vector x, output: probability of sampling x
The shape of the function determines by mean μ and covariance matrix Σ

Probability from Class

在这里插入图片描述

Maximum Likelihood

在这里插入图片描述

The Gaussian with any mean μ and covariance matrix Σ can generate these points

在这里插入图片描述
Likelihood of a Gaussian with mean μ and covariance matrix Σ = the probability of the Gaussian samples x^1,x2,x^3, …… ,x^79

Now we can do classification

在这里插入图片描述

Testing data: 47% accuracy
All: hp, att, sp att,
de, sp de, speed (6 features)

Modifying Model：

All: hp, att, sp att, de, sp de, speed

Three Steps

Function Set (Model):
在这里插入图片描述
Goodness of a function:
The mean μ and covariance Σ that maximizing the likelihood (the probability of generating data)
Find the best function: easy

Probability Distribution
在这里插入图片描述
Posterior Probability：

二、Classification: Logistic Regression

Step 1: Function Set

在这里插入图片描述

Step 2: Goodness of a Function

在这里插入图片描述

Step 3: Find the best function

在这里插入图片描述

Logistic Regression + Square Error

在这里插入图片描述

Generative v.s. Discriminative

在这里插入图片描述

Usually people believe discriminative model is better
Benefit of generative model
With the assumption of probability distribution
less training data is needed
more robust to the noise
Priors and class-dependent probabilities can be estimated from different sources.