1) When we fit a model to data, which is typically larger?
a) Test Error b)...
1) When we fit a model to data, which is typically larger?
a) Test Error b) Training Error
2) What are reasons why test error could be LESS than training
error? (Pick all that applies)
a) By chance, the test set has easier cases than the training
set.
b) The model is highly complex, so training error systematically
overestimates test error
c) The model is not very complex, so training error
systematically overestimates test error
3) Suppose we want to...
Activity I: Mark and Recapture Method
Objective:
Perform the mark-recapture method to estimate the number of...
Activity I: Mark and Recapture Method
Objective:
Perform the mark-recapture method to estimate the number of
individuals within an a certain area.
Materials:
Plastic container with lid (represents a pond)
Bag or box of small objects such as editable goldfish, beans,
macaroni noodles, etc.
(represents the fish)
Permanent Marker
Calculator
Procedure:
Add all of your fish to the pond (no more than 150 fish)
Randomly capture 40 fish from the pond.
Mark those 20 fish with the marker.
Return the...
Inductive generalizations have this form:1. X percent of the
observed members of group A have property...
Inductive generalizations have this form:1. X percent of the
observed members of group A have property B. (the sample)2.Thus,
about X percent of A have property B. (the generalization from the
sample)For instance: 1. 40% of the pickles you have pulled out of
the barrel are very good.2.Therefore, about 40% of the pickles in
the barrel are very good.To assess the strength or weakness of
inductive generalizations (arguments which generalize from sample
sets), logicians run through 3 checks:
CHECK #1:...
2) Airline accidents: According to the U.S. National
Transportation Safety Board, the number of airline accidents...
2) Airline accidents: According to the U.S. National
Transportation Safety Board, the number of airline accidents by
year from 1983 to 2006 were 23, 16, 21, 24, 34, 30, 28, 24, 26, 18,
23, 23, 36, 37, 49, 50, 51, 56, 46, 41, 54, 30, 40, and 31.
a. For the sample data, compute the mean and its standard error
(from the standard deviation), and the median.
b. Using R, compute bootstrap estimates of the mean, median and
25% trimmed...
Problem 1: Relations among Useful Discrete Probability
Distributions. A Bernoulli experiment consists of
only one trial...
Problem 1: Relations among Useful Discrete Probability
Distributions. A Bernoulli experiment consists of
only one trial with two outcomes (success/failure) with probability
of success p. The Bernoulli distribution
is
P (X = k) =
pkq1-k,
k=0,1
The sum of n independent Bernoulli trials forms a binomial
experiment with parameters n and p. The binomial probability
distribution provides a simple, easy-to-compute approximation with
reasonable accuracy to hypergeometric distribution with parameters
N, M and n when n/N is less than or equal...