Regression models are very useful for creating predictive models. For this exercise, we will play with some different data yet again. I do this for two reasons. First, it gets a bit old to be using the same data set over and over again. Second, regression analysis relies upon the notion that you are predicting the functional relationship between variables and the various things measured in the classic Rice Center data set are not causal.

So instead, we will be using car data described as:

Cars were selected at random from among 1993 passenger car models that were listed in both the Consumer Reports issue and the PACE Buying Guide. Pickup trucks and Sport/Utility vehicles were eliminated due to incomplete information in the Consumer Reports source. Duplicate models (e.g., Dodge Shadow and Plymouth Sundance) were listed at most once.

And can be loaded into your session as:

library( MASS )
names( Cars93 )
 [1] "Manufacturer"       "Model"              "Type"              
 [4] "Min.Price"          "Price"              "Max.Price"         
 [7] "MPG.city"           "MPG.highway"        "AirBags"           
[10] "DriveTrain"         "Cylinders"          "EngineSize"        
[13] "Horsepower"         "RPM"                "Rev.per.mile"      
[16] "Man.trans.avail"    "Fuel.tank.capacity" "Passengers"        
[19] "Length"             "Wheelbase"          "Width"             
[22] "Turn.circle"        "Rear.seat.room"     "Luggage.room"      
[25] "Weight"             "Origin"             "Make"              
Manufacturer

Model

Type

Min.Price

Price

Max.Price

MPG.city

MPG.highway

AirBags

DriveTrain

Cylinders

EngineSize

Horsepower

RPM

Rev.per.mile

Man.trans.avail

Fuel.tank.capacity

Passengers

Length

Wheelbase

Width

Turn.circle

Rear.seat.room

Luggage.room

Weight

Origin

Make

The Question

The sole question for this topic is to have you figure out the best regression model that describes the variation in car weight in the Cars93 data set. In you evaluation of alternative models, make sure you are looking at the normality of predictors, the relative fit of the alternative models, etc.

LS0tCnRpdGxlOiAiUmVncmVzc2lvbiBIb21ld29yayIKb3V0cHV0OiBodG1sX25vdGVib29rCi0tLQoKUmVncmVzc2lvbiBtb2RlbHMgYXJlIHZlcnkgdXNlZnVsIGZvciBjcmVhdGluZyBwcmVkaWN0aXZlIG1vZGVscy4gIEZvciB0aGlzIGV4ZXJjaXNlLCB3ZSB3aWxsIHBsYXkgd2l0aCBzb21lIGRpZmZlcmVudCBkYXRhIHlldCBhZ2Fpbi4gSSBkbyB0aGlzIGZvciB0d28gcmVhc29ucy4gRmlyc3QsIGl0IGdldHMgYSBiaXQgb2xkIHRvIGJlIHVzaW5nIHRoZSBzYW1lIGRhdGEgc2V0IG92ZXIgYW5kIG92ZXIgYWdhaW4uIFNlY29uZCwgcmVncmVzc2lvbiBhbmFseXNpcyByZWxpZXMgdXBvbiB0aGUgbm90aW9uIHRoYXQgeW91IGFyZSBwcmVkaWN0aW5nIHRoZSBmdW5jdGlvbmFsIHJlbGF0aW9uc2hpcCBiZXR3ZWVuIHZhcmlhYmxlcyBhbmQgdGhlIHZhcmlvdXMgdGhpbmdzIG1lYXN1cmVkIGluIHRoZSBjbGFzc2ljIFJpY2UgQ2VudGVyIGRhdGEgc2V0IGFyZSBub3QgY2F1c2FsLiAKCjxjZW50ZXI+CiFbXShodHRwczovL2xpdmUuc3RhdGljZmxpY2tyLmNvbS82NTUzNS81MDU5MDA3NDYzNl85Mjc3MTg5MjYwX2QuanBnKQo8L2NlbnRlcj4KClNvIGluc3RlYWQsIHdlIHdpbGwgYmUgdXNpbmcgY2FyIGRhdGEgZGVzY3JpYmVkIGFzOgoKPiBDYXJzIHdlcmUgc2VsZWN0ZWQgYXQgcmFuZG9tIGZyb20gYW1vbmcgMTk5MyBwYXNzZW5nZXIgY2FyIG1vZGVscyB0aGF0IHdlcmUgbGlzdGVkIGluIGJvdGggdGhlIENvbnN1bWVyIFJlcG9ydHMgaXNzdWUgYW5kIHRoZSBQQUNFIEJ1eWluZyBHdWlkZS4gUGlja3VwIHRydWNrcyBhbmQgU3BvcnQvVXRpbGl0eSB2ZWhpY2xlcyB3ZXJlIGVsaW1pbmF0ZWQgZHVlIHRvIGluY29tcGxldGUgaW5mb3JtYXRpb24gaW4gdGhlIENvbnN1bWVyIFJlcG9ydHMgc291cmNlLiBEdXBsaWNhdGUgbW9kZWxzIChlLmcuLCBEb2RnZSBTaGFkb3cgYW5kIFBseW1vdXRoIFN1bmRhbmNlKSB3ZXJlIGxpc3RlZCBhdCBtb3N0IG9uY2UuCgpBbmQgY2FuIGJlIGxvYWRlZCBpbnRvIHlvdXIgc2Vzc2lvbiBhczoKCmBgYHtyfQpsaWJyYXJ5KCBNQVNTICkKbmFtZXMoIENhcnM5MyApCmBgYAoKIyMgVGhlIFF1ZXN0aW9uCgpUaGUgc29sZSBxdWVzdGlvbiBmb3IgdGhpcyB0b3BpYyBpcyB0byBoYXZlIHlvdSBmaWd1cmUgb3V0IHRoZSBiZXN0IHJlZ3Jlc3Npb24gbW9kZWwgdGhhdCBkZXNjcmliZXMgdGhlIHZhcmlhdGlvbiBpbiBjYXIgd2VpZ2h0IGluIHRoZSBgQ2FyczkzYCBkYXRhIHNldC4gIEluIHlvdSBldmFsdWF0aW9uIG9mIGFsdGVybmF0aXZlIG1vZGVscywgbWFrZSBzdXJlIHlvdSBhcmUgbG9va2luZyBhdCB0aGUgbm9ybWFsaXR5IG9mIHByZWRpY3RvcnMsIHRoZSByZWxhdGl2ZSBmaXQgb2YgdGhlIGFsdGVybmF0aXZlIG1vZGVscywgZXRjLgo=