M. Pawan Kumar
 
 

HOME

RESEARCH

PUBLICATIONS

GROUP

TALKS

TEACHING

CV

 

 

 

 

 

 

NOTES

Project Notes: Optimization Methods for Linear Regression
PDF

Lecture: Linear Regression
PPT   PDF

DATA SET

Please use the simplified version of the California Housing Data Set. Use the first 8 features as the input, and the final feature (median house price) as the target output. The entire data set consists of approximately 20,000 samples. You may wish to use a random subset of the data (say 500 samples) when developing your code. Once you're confident that your code is bug-free, use the entire data set to run your final set of experiments.

Note that the following modifications have been made to the original data set.

  • Samples with missing features have been deleted.
  • One feature (location of the house w.r.t. ocean/sea) has been removed.

ERRATA

Any errors found in the notes/code will be discussed here.