This repository about Logistic Regression and Support Vector Machine

Motivation of Experiment

Compare andand understand the difference between gradient descent and batch random stochastic gradient descent.
Compare and understand the differences and relationships between Logistic regression and linear classification.
Further understand the principles of SVM and practice on larger data.

Dataset

Experiment uses a9a of LIBSVM Data, including 32561/16281(testing) samples and each sample has 123/123 (testing) features. Please download the training set and validation set. The dimension may be wrong, which is due to the values of last column are all zero so it is ignored. This can be fixed by adding one column by yourself or specify the n_features to be 123 when using the function.

Environment for Experiment

Python3, at least including following python package: sklearn，numpy，jupyter，matplotlib It is recommended to install anaconda3 directly, which has built-in python package above.

Experiment Step

The experimental code and drawing are completed on jupyter.

Logistic Regression and Batch Stochastic Gradient Descent

Load the training set and validation set.
Initialize logistic regression model parameter (you can consider initializing zeros, random numbers or normal distribution).
Select the loss function and calculate its derivation, find more detail in PPT.
Determine the size of the batch_size and randomly take some samples,calculate gradient G toward loss function from partial samples.
Use the SGD optimization method to update the parametric model and encourage additional attempts to optimize the Adam method.
Select the appropriate threshold, mark the sample whose predict scores greater than the threshold as positive, on the contrary as negative. Predict under validation set and get the loss Lvalidation.
Repeat step 4 to 6 for several times, and drawing graph of Lvalidation with the number of iterations.

Linear Classification and Batch Stochastic Gradient Descent

Load the training set and validation set.
Initialize SVM model parameters (you can consider initializing zeros, random numbers or normal distribution).
Select the loss function and calculate its derivation, find more details in PPT.
Determine the size of the batch_size and randomly take some samples,calculate gradient G toward loss function from partial samples.
Use the SGD optimization method to update the parametric model and encourage additional attempts to optimize the Adam method.
Select the appropriate threshold, mark the sample whose predict scores greater than the threshold as positive, on the contrary as negative. Predict under validation set and get the loss Lvalidation.
Repeat step 4 to 6 for several times, and draw graph of Lvalidation with the number of iterations.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This repository about Logistic Regression and Support Vector Machine

Motivation of Experiment

Dataset

Environment for Experiment

Experiment Step

About

Uh oh!

Releases

Packages

abidmi/DL-Logistic-Regression-and-Support-Vector-Machine

Folders and files

Latest commit

History

Repository files navigation

This repository about Logistic Regression and Support Vector Machine

Motivation of Experiment

Dataset

Environment for Experiment

Experiment Step

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages