1) When we fit a model to data, which is typically larger?
a) Test Error b)...
1) When we fit a model to data, which is typically larger?
a) Test Error b) Training Error
2) What are reasons why test error could be LESS than training
error? (Pick all that applies)
a) By chance, the test set has easier cases than the training
set.
b) The model is highly complex, so training error systematically
overestimates test error
c) The model is not very complex, so training error
systematically overestimates test error
3) Suppose we want to...
4. Suppose we have a classifier that classifies if an image
contains a Human face or...
4. Suppose we have a classifier that classifies if an image
contains a Human face or not. Suppose we have 100 images, 50 of
which contain human faces. If our classifier accurately classifies
that 30 images contains human faces, but at the same time wrongly
classifies that 30 images contains human faces. What is the
precision and recall. (10 points)
5. Suppose we use k-fold cross validation, how many times should
we train the classifier? (10 points)
6. I have...
Activity I: Mark and Recapture Method
Objective:
Perform the mark-recapture method to estimate the number of...
Activity I: Mark and Recapture Method
Objective:
Perform the mark-recapture method to estimate the number of
individuals within an a certain area.
Materials:
Plastic container with lid (represents a pond)
Bag or box of small objects such as editable goldfish, beans,
macaroni noodles, etc.
(represents the fish)
Permanent Marker
Calculator
Procedure:
Add all of your fish to the pond (no more than 150 fish)
Randomly capture 40 fish from the pond.
Mark those 20 fish with the marker.
Return the...
Inductive generalizations have this form:1. X percent of the
observed members of group A have property...
Inductive generalizations have this form:1. X percent of the
observed members of group A have property B. (the sample)2.Thus,
about X percent of A have property B. (the generalization from the
sample)For instance: 1. 40% of the pickles you have pulled out of
the barrel are very good.2.Therefore, about 40% of the pickles in
the barrel are very good.To assess the strength or weakness of
inductive generalizations (arguments which generalize from sample
sets), logicians run through 3 checks:
CHECK #1:...
2) Airline accidents: According to the U.S. National
Transportation Safety Board, the number of airline accidents...
2) Airline accidents: According to the U.S. National
Transportation Safety Board, the number of airline accidents by
year from 1983 to 2006 were 23, 16, 21, 24, 34, 30, 28, 24, 26, 18,
23, 23, 36, 37, 49, 50, 51, 56, 46, 41, 54, 30, 40, and 31.
a. For the sample data, compute the mean and its standard error
(from the standard deviation), and the median.
b. Using R, compute bootstrap estimates of the mean, median and
25% trimmed...