Question

1) For a given text, the outcome of the sentiment analysis (unknown variable of sentiment) will...

1) For a given text, the outcome of the sentiment analysis (unknown variable of sentiment) will be the same irrespective of the pre-built dictionaries used True or False

------------------------------------------------------------

------------------------------------------------------------

2) When growing a decision tree, when do you stop branching at a node?
a) The node is pure or the number of observations in the node is less than/equal to a preset threshold value

b) The node is impure or the number of observations in the node is less than/equal to a preset threshold value

c) The node is pure or the number of observations in the node is more than/equal to a preset threshold value

d) The node has a single observation

------------------------------------------------------------

------------------------------------------------------------

3) A small (close to 0) value of GINI index indicates that ______

a) The node is pure

b) The entropy of the node is very small

c) The node mostly contains observations from a single class

d) All of the above

------------------------------------------------------------

------------------------------------------------------------

4) Which of the following is true about Random Forests and Bagging?


1) In Random Forest, the decision trees are trained on a subset of the samples and a subset of the predictors at each split

2) In Random Forest, the decision trees are trained on a subset of the samples and the complete set of predictors at each split

3) In Bagging, the decision trees are trained on a subset of the samples and a subset of the predictors at each split

4) In Bagging, the decision trees are trained on a subset of the samples and the complete set of predictors at each split

a) 1 and 3

b) 2 and 3

c) 1 and 4

d) 2 and 4

Homework Answers

Answer #1

1. False. The outcome will not be same and it will depend on the pre-built dictionaries.

2. The branching will stop when the node is pure or the number of observations in the node is less than/equal to the threshold value. A) is the correct option.

3. If gini index = 0, it means all the observations are from a single class or the node is pure hence small entropy. Hence, all the options are correct. D) is the correct option here.

4. In random forest, only a subset of features are used while in Bagging all the features are used. Hence, 1 and 4 are correct. Correct option is C)

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
1. Given a poisson random variable, X = # of events that occur, where the average...
1. Given a poisson random variable, X = # of events that occur, where the average number of events in the sample unit (μ) is given on the right, determine the smallest critical value (critical value = c) for the random variable such that you have at least a 99% probability of finding c or fewer events.    μ = 4    2. Given a binomial random variable X where X = # of operations in a local hospital that...
In the following probability​ distribution, the random variable x represents the number of activities a parent...
In the following probability​ distribution, the random variable x represents the number of activities a parent of a 6th to 8th​-grade student is involved in. Complete parts​ (a) through​ (f) below. x 0 1 2 3 4 ​P(x) 0.395 0.075 0.199 0.195 0.136 ​(a) Verify that this is a discrete probability distribution. This is a discrete probability distribution because the sum of the probabilities is ___and each probability is ___ (Less than or equal to 1; Greater than or equal...
Suppose a baseball player had 229229 hits in a season. In the given probability​ distribution, the...
Suppose a baseball player had 229229 hits in a season. In the given probability​ distribution, the random variable X represents the number of hits the player obtained in a game. x 0 1 2 3 4 5 ​P(x) 0.13590.1359 0.49370.4937 0.26020.2602 0.07830.0783 0.02070.0207 0.01120.0112 ​(a) Compute and interpret the mean of the random variable X. mu Subscript xμxequals=nothing ​(Round to one decimal place as​ needed.) Which of the following interpretation of the mean is​ correct? A. The observed value of...
Suppose a baseball player had 231 hits in a season. In the given probability​ distribution, the...
Suppose a baseball player had 231 hits in a season. In the given probability​ distribution, the random variable X represents the number of hits the player obtained in a game. x 0 1 2 3 4 5 ​P(x) 0.1916 0.4193 0.2462 0.1047 0.0118 0.0264 ​(a) Compute and interpret the mean of the random variable X. mu Subscript xequals nothing ​(Round to one decimal place as​ needed.) Which of the following interpretation of the mean is​ correct? A. The observed value...
Suppose a baseball player had 208 hits in a season. In the given probability​ distribution, the...
Suppose a baseball player had 208 hits in a season. In the given probability​ distribution, the random variable X represents the number of hits the player obtained in a game. x 0 1 2 3 4 5 ​P(x) 0.1166 0.4774 0.2601 0.1094 0.0168 0.0197 ​(a) Compute and interpret the mean of the random variable X. mu Subscript xequals nothing ​(Round to one decimal place as​ needed.) Which of the following interpretation of the mean is​ correct? A. The observed value...
Suppose the following data represent the ratings​ (on a scale from 1 to​ 5) for a...
Suppose the following data represent the ratings​ (on a scale from 1 to​ 5) for a certain smart phone​ game, with 1 representing a poor rating. Complete parts​ (a) through​ (d) below. Stars   Frequency 1 2955 2 2537 3 3809 4 4537 5 10,637 a)Construct a discrete probability distribution for the random variable x. (round to 3 decimal places if needed) b) Graph the discrete probability distribution. Choose the correct graph below. ​(c) Compute and interpret the mean of the...
Suppose the following data represent the rating (on a scale from 1 to 5) for a...
Suppose the following data represent the rating (on a scale from 1 to 5) for a certain smart phone game, with 1 representing a poor rating. Complete parts (a) below Stars --- Frequency 1 =. 2524 2 =. 2947 3 =. 3955 4 =. 3938 5 = 10,170 (a) Construct a discrete probability distribution for the random variable X [ hint: p(xi) =fi-n] Stars (x)- P(x) 1    - ? 2        -       ? 3        -       ? 4        -...
1. Find the value z of a standard Normal variable Z that satisfies each of the...
1. Find the value z of a standard Normal variable Z that satisfies each of the following conditions. (If you use Table A, report the value of z that comes closest to satisfying the condition.) In each case, sketch a standard Normal curve with your value of z marked on the axis. 38% of the observations fall below z 70% of the observations fall above z 2.) Jorge scores 2090 on the SAT. Assuming that both tests measure the same...
1. Is it true that the sample mean is always equal to the population mean that...
1. Is it true that the sample mean is always equal to the population mean that we picked the sample from? Explain. 2. Is it true that the confidence interval is narrower for 95% confidence than for 90% confidence? Explain 3. Is it true that the Sample means are less variable than individual observations as n→∞ ? Explain 4. A newspaper article reports that the average income of Canadian adults is 45000 with a 90% confidence interval of 28000 to...
1- Helena Corporation declared a 2-for-1 stock split on 8,000 shares of $6 par value common...
1- Helena Corporation declared a 2-for-1 stock split on 8,000 shares of $6 par value common stock. If the market price of the stock had been $25 a share before the split, the par value, number of shares, and approximate market value after the split would be: Par Value No. of Shares Market Value A. $ 6.00 16,000 $ 12.50 B. $ 6.00 8,000 $ 25.00 C. $ 3.00 16,000 $ 12.50 D. $ 3.00 16,000 $ 25.00 2- The...