Question

1) For a given text, the outcome of the sentiment analysis (unknown variable of sentiment) will...

1) For a given text, the outcome of the sentiment analysis (unknown variable of sentiment) will be the same irrespective of the pre-built dictionaries used True or False

------------------------------------------------------------

------------------------------------------------------------

2) When growing a decision tree, when do you stop branching at a node?
a) The node is pure or the number of observations in the node is less than/equal to a preset threshold value

b) The node is impure or the number of observations in the node is less than/equal to a preset threshold value

c) The node is pure or the number of observations in the node is more than/equal to a preset threshold value

d) The node has a single observation

------------------------------------------------------------

------------------------------------------------------------

3) A small (close to 0) value of GINI index indicates that ______

a) The node is pure

b) The entropy of the node is very small

c) The node mostly contains observations from a single class

d) All of the above

------------------------------------------------------------

------------------------------------------------------------

4) Which of the following is true about Random Forests and Bagging?


1) In Random Forest, the decision trees are trained on a subset of the samples and a subset of the predictors at each split

2) In Random Forest, the decision trees are trained on a subset of the samples and the complete set of predictors at each split

3) In Bagging, the decision trees are trained on a subset of the samples and a subset of the predictors at each split

4) In Bagging, the decision trees are trained on a subset of the samples and the complete set of predictors at each split

a) 1 and 3

b) 2 and 3

c) 1 and 4

d) 2 and 4

Homework Answers

Answer #1

1. False. The outcome will not be same and it will depend on the pre-built dictionaries.

2. The branching will stop when the node is pure or the number of observations in the node is less than/equal to the threshold value. A) is the correct option.

3. If gini index = 0, it means all the observations are from a single class or the node is pure hence small entropy. Hence, all the options are correct. D) is the correct option here.

4. In random forest, only a subset of features are used while in Bagging all the features are used. Hence, 1 and 4 are correct. Correct option is C)

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
1. Given a poisson random variable, X = # of events that occur, where the average...
1. Given a poisson random variable, X = # of events that occur, where the average number of events in the sample unit (μ) is given on the right, determine the smallest critical value (critical value = c) for the random variable such that you have at least a 99% probability of finding c or fewer events.    μ = 4    2. Given a binomial random variable X where X = # of operations in a local hospital that...
In the following probability​ distribution, the random variable x represents the number of activities a parent...
In the following probability​ distribution, the random variable x represents the number of activities a parent of a 6th to 8th​-grade student is involved in. Complete parts​ (a) through​ (f) below. x 0 1 2 3 4 ​P(x) 0.395 0.075 0.199 0.195 0.136 ​(a) Verify that this is a discrete probability distribution. This is a discrete probability distribution because the sum of the probabilities is ___and each probability is ___ (Less than or equal to 1; Greater than or equal...
1- Helena Corporation declared a 2-for-1 stock split on 8,000 shares of $6 par value common...
1- Helena Corporation declared a 2-for-1 stock split on 8,000 shares of $6 par value common stock. If the market price of the stock had been $25 a share before the split, the par value, number of shares, and approximate market value after the split would be: Par Value No. of Shares Market Value A. $ 6.00 16,000 $ 12.50 B. $ 6.00 8,000 $ 25.00 C. $ 3.00 16,000 $ 12.50 D. $ 3.00 16,000 $ 25.00 2- The...
1. Let x be a continuous random variable. What is the probability that x assumes a...
1. Let x be a continuous random variable. What is the probability that x assumes a single value, such as a (use numerical value)? 2. The following are the three main characteristics of a normal distribution. The total area under a normal curve equals _____. A normal curve is ___________ about the mean. Consequently, 50% of the total area under a normal distribution curve lies on the left side of the mean, and 50% lies on the right side of...
1. Use the given values of n and p to find the minimum usual value muμminus−2sigmaσ...
1. Use the given values of n and p to find the minimum usual value muμminus−2sigmaσ and the maximum usual value muμplus+2sigmaσ. Round to the nearest hundredth unless otherwise noted. nequals=10141014​; pequals=0.860.86 Five males with an​ X-linked genetic disorder have one child each. The random variable x is the number of children among the five who inherit the​ X-linked genetic disorder. Determine whether a probability distribution is given. If a probability distribution is​ given, find its mean and standard deviation....
1.A fair die is rolled once, and the number score is noted. Let the random variable...
1.A fair die is rolled once, and the number score is noted. Let the random variable X be twice this score. Define the variable Y to be zero if an odd number appears and X otherwise. By finding the probability mass function in each case, find the expectation of the following random variables: Please answer to 3 decimal places. Part a)X Part b)Y Part c)X+Y Part d)XY ——- 2.To examine the effectiveness of its four annual advertising promotions, a mail...
1. Single Factor Anova is used to decide whether risk level should be regarded as a...
1. Single Factor Anova is used to decide whether risk level should be regarded as a source of mutual fund return variation. Three levels of risk are employed: low, medium and high. The annual % return for random samples of 10 funds at each risk level is recorded. State the hypotheses of the test using correct notation and complete sentences. a) In a complete sentence (using the terms ‘risk level’ and ‘return’) explain the decision that would correspond to a...
1) Psychologists know that people find it easier to remember items at the beginning and at...
1) Psychologists know that people find it easier to remember items at the beginning and at the end of a list of items. Suppose you think that level of education might make a difference to the tendency to remember items in the middle of the list. You randomly select two groups of subjects, with N = 10 in each group.   Participants in Group 1 have 12 years of education, and members of group 2 have 20 years of education. Which...
Example 1: Among females in the US in a given age cohort, a diastolic blood pressure...
Example 1: Among females in the US in a given age cohort, a diastolic blood pressure is normally distributed with mean µ = 95 mm Hg and standard deviation σ = 15 mm Hg. a) What is the probability that a randomly selected woman has a diastolic blood pressure less than 90 mm Hg? b) What is the probability that she has a diastolic blood pressure greater than 95 mm Hg? c) What is the probability that she has a...
Data For Tasks 1-8, consider the following data: 7.2, 1.2, 1.8, 2.8, 18, -1.9, -0.1, -1.5,...
Data For Tasks 1-8, consider the following data: 7.2, 1.2, 1.8, 2.8, 18, -1.9, -0.1, -1.5, 13.0, 3.2, -1.1, 7.0, 0.5, 3.9, 2.1, 4.1, 6.5 In Tasks 1-8 you are asked to conduct some computations regarding this data. The computation should be carried out manually. All the steps that go into the computation should be presented and explained. (You may use R in order to verify your computation, but not as a substitute for conducting the manual computations.) A Random...
ADVERTISEMENT
Need Online Homework Help?

Get Answers For Free
Most questions answered within 1 hours.

Ask a Question
ADVERTISEMENT