Book a Demo!
CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutPoliciesSign UpSign In
UBC-DSCI
GitHub Repository: UBC-DSCI/dsci-100-assets
Path: blob/master/2019-spring/slides/10_regression3.ipynb
2051 views
Kernel: R

DSCI 100 - Introduction to Data Science

Lecture 10 - Introduction to linear regression

2019-03-21

Final exam details:

  • Friday, April 26th at 19:00 in BUCH A102

  • Format - same as quizzes (open book, on Canvas)

  • Length - it will be longer than the quizzes...

  • Covers entire course content, with more questions on topics that were not covered on the quizzes

Multivariate regression problem

For example, use house size and number of baths to predict house sale price

k-nn multivariate regression

linear multivariate regression

The two sides of regression

  1. Regression for prediction

  2. Regression for understanding/description (statistical inference)

Regression for understanding/description questions

  • How strong is the relationship between the outcome/response variable and the explanatory variable(s)?

  • Which of the explanatory variables contribute to the outcome/response variable?

  • How accurately can we estimate the effect of each explanatory variable on the outcome/response variable?

Simple example of statistical inference

Question: What proportion of undergraduate students have an iphone?

We can't possibly ask all undergraduates if they have an iphone... so what can we do?

What if we randomly selected a subset and then asked them if they have an iphone? We could then calculate a proportion that we could use as an estimate of the true population proportion!

We often want to report a plausible range, along with the proportion we estimate...

Population parameters can be proportions (iPhone example) or regression line slopes.

What did we learn today?