Midterm 2.... Flashcards Preview

Question 1

Q

Convert odds of 1:8 to a probability.

Answer

A

1/8 = .125 –> .125/(.125 + 1) = 11.11% probability. Because 1:8 means 1 out of every 9.

Question 2

Q

Odds EQ for Probability(for/against)

Answer

A

odds(for/against)/odds(for/against) + 1

Question 3

Q

What does a logistic regression model predict?

Answer

A

LogOdds! This will have a range of (-infinity, infinity)

Question 4

Q

How do you convert logOdds to odds?

Answer

A

e^logOdds = odds

Question 5

Q

How do you convert logOdds to probability?

Answer

A

e^logOdds/(e^logOdds + 1)

Question 6

Q

What does logOdds equal in terms of ln(x)?

Answer

A

ln(odds) = logOdds or log(odds) = logOdds

Question 7

Q

What is the range of odds (what are they bound by?)

Answer

A

[0, infinity)

Question 8

Q

What is the range of logOdds (what are they bound by?)

Answer

A

(-infinity, infinity)

Question 9

Q

What type of estimation model is logistic regression, and why?

Answer

A

Class probability estimation model. It is using a numeric value to estimate the probability of a categorical variable! Ex. What is the chance Marc goes to class? 0.3

Question 10

Q

What loss function does support vector machine use?

Answer

A

Hinge loss

Question 11

Q

Hinge loss (loss function)

Answer

A

An instance on the wrong side of the line does not incur a penalty. ONLY when it’s on the wrong side and outside of the margin.

Question 12

Q

Zero-one loss

Answer

A

An instance incurs a loss of 0 for a correct decision and 1 for an incorrect decision.

Question 13

Q

Squared error

Answer

A

Specifies a loss equal to the square of the distance from the boundary. A further instance would have a greater error. Usually used for numeric value prediction rather than classification.

Question 14

Q

Loss function

Answer

A

Determines how much penalty should be assigned to an instance based on the model’s predictive value

Question 15

Q

Finish this sentence. Accuracy of training data is sometimes called…

Answer

A

In-sample accuracy (train) vs. out-of sample accuracy (test)

Question 16

Q

When is logistic regression more accurate vs. decision tree and vice versa?

Answer

A

LR is more accurate with a smaller data set, DT on bigger sets

Question 17

Q

What’s the point of regularization?

Answer

A

It gives a penalty to more complicated models because those are more prone to overfitting.

Question 18

Q

In a confusion matrix what are the column headers? Row headers?

Answer

A

Column: Actual y and n
Row: Predicted y and n

Question 19

Q

False positive

Answer

A

Predicted positive, actual negative

Question 20

Q

False negative

Answer

A

Predicted negative, actual positive

Question 21

Q

True negative

Answer

A

Predicted negative, actual negative

Question 22

Q

True positive

Answer

A

Predicted positive, actual positive

Question 23

Q

True positive rate

Answer

A

True positive / all actual positives (both true and false)

Question 24

Q

False positive rate

Answer

A

False positive / all actual negatives (both true and false)

Question 25

Q

Positive predictive value (PPV)

Answer

A

True positive / all predicted positive (both correct and incorrect)

Question 26

Q

What’s the expected value of a game of roulette? Probability of hitting black = 48%. Bet = $100

Answer

A

EV = (0.48)(100) + (1-0.48)(-100) = - 4

Question 27

Q

What are the two uses for expected value?

Answer

A

Inform how to use our classifier for individual predictions.
Compare classifiers.

Question 28

Q

Class priors

Answer

A

The proportion of positive and negative instances in your data set. Ex. 40 of 100 people would buy a new car next year if they could. p(p) = .4, p(n) = .6

Question 29

Q

Two critical conditions underlying profit calculations:

Answer

A

Class priors

2. Costs and benefits

Question 30

Q

Where is the perfect point on an ROC curve (hint: x axis is FPR, y axis is TPR)

Answer

A

Top left. FPR of 0, TPR of 1

Question 31

Q

How are ROC curves created?

Answer

A

TPR and FPR are found at every cutoff point. Ex. Titanic threshold of 0 to 1, values would be found at every .01

Question 32

Q

What is the Area Under the ROC Curve used for (AUC)?

Answer

A

AUC is used when a single number is needed to summarize performance or when nothing

Question 33

Q

What are two alternatives to the ROC curve?

Answer

A

Cumulative response curve

2. Lift curve

Question 34

Q

How is a lift curve calculated

Answer

A

Cumulative response curve values. y/x

Question 35

Q

How can you calculate cumulative response curve values from lift?

Answer

A

Lift: x axis * y axis. Contact 20% with a lift of 2.5 means it will be .5 on the cumulative response curve.

Question 36

Q

Euclidian distance

Answer

A

Distance formula but using it with attributes between two people.

Question 37

Q

Manhattan distance

Answer

A

Distance using the two axes rather than the hypotenuse

Question 38

Q

Euclidian vs. Manhattan

Answer

A

Euclidian uses hypotenuse of triangle (where two instances are the edges). Manhattan uses two bases.

Question 39

Q

Nearest neighbors

Answer

A

Judge similarity by calculating distance to nearest neighbors and using those results to make a prediction. Ex. 3 nearest neighbors –> 2 no’s, 1 yes. Instance should be a no!

Question 40

Q

How do we give weight to closest neighbors

Answer

A

With similarity weight: Inverse of distance squared –> contribution = sim. weight/(sum of all sim. weights)

Question 41

Q

How do we get to a probability from nearest neighbors

Answer

A

Sum of all “no” contributions = p(no)

Question 42

Q

How do we avoid overfitting the data with nearest neighbors?

Answer

A

By choosing a higher k = # of neighbors!

Question 43

Q

What are the three issues with nearest neighbors?

Answer

A

Dimensionality and domain knowledge. Unimportant features might have too much influence over important ones!
Fast to train, slow to predict. Prediction requires plotting the entire dataset.
Easy to interpret, but no “knowledge” extracted from data.

Question 44

Q

Hierarchical clustering

Answer

A

Consider individual points and distance between them. Ex. Points with a Euclidian distance smaller than x will be clustered.

Question 45

Q

Link function (clustering)

Answer

A

Minimum req. that must be met before an item is clustered.

Question 46

Q

Centroid-based clustering

Answer

A

Decide k (number of centroids) and groups will be made around those. Points are grouped on which centroid they’re closest to. When a point is added the centroid is repositioned.

Midterm 2.... Flashcards Preview

MGMT3200 Business Analytics > Midterm 2.... > Flashcards

Decks in MGMT3200 Business Analytics Class (1):

Brainscape's Knowledge GenomeTM

Midterm 2.... Flashcards Preview

MGMT3200 Business Analytics > Midterm 2.... > Flashcards

Brainscape's Knowledge Genome^TM