Question

Explain in simple terms the algorithm used by Decision Trees.

  1. Explain in simple terms the algorithm used by Decision Trees.

Homework Answers

Answer #1

Decision Tree algorithm has a place with the group of regulated learning algorithms. In contrast to other regulated learning algorithms, decision tree algorithm can be utilized for tackling relapse and order issues as well. The overall intention of utilizing Decision Tree is to make a preparation model which can use to foresee class or estimation of target factors by taking in decision rules induced from earlier information preparing information. The understanding degree of Decision Trees algorithm is so natural contrasted and other arrangement algorithms. The decision tree algorithm attempts to take care of the issue, by utilizing tree portrayal. Each inward hub of the tree relates to a trait, and each leaf hub compares to a class mark.

Decision Tree Algorithm Pseudocode

1-Place the best property of the informational index at the base of the tree.

2-Split the preparation set into subsets. Subsets ought to be made so that every subset contains information with a similar incentive for a quality.

3-Repeat stage 1 and stage 2 on every subset until you discover leaf hubs in all the parts of the tree.

Presumptions while making Decision Tree-

- At the start, the entire preparing set is considered as the root.

- Feature esteems are liked to be absolute. On the off chance that the qualities are constant, at that point they are discretized preceding structure the model.

- Records are dispersed recursively based on characteristic qualities.

- Order to putting qualities as root or inner hub of the tree is finished by utilizing some measurable methodology.

Characteristics Selection

In the event that dataset comprises of "n" qualities at that point choosing which credit to put at the root or at various degrees of the tree as inward hubs is a confounded advance. By just haphazardly choosing any hub to be the root can't settle the issue. In the event that we follow an irregular methodology, it might give us awful outcomes with low precision.

Data Gain

By utilizing data gain as a standard, we attempt to gauge the data contained by each property. We are going to utilize a few focuses deducted from data hypothesis. To quantify the irregularity or vulnerability of an arbitrary variable X is characterized by Entropy.

Gini Index

Gini Index is a measurement to quantify how frequently an arbitrarily picked component would be erroneously recognized. It implies a property with lower gini list ought to be liked.

Overfitting

Overfitting is a down to earth issue while building a decision tree model. The model is having an issue of overfitting is viewed as when the algorithm keeps on going further and more profound in the to lessen the preparation set mistake however results with an expanded test set blunder i.e, Accuracy of expectation for our model goes down. It by and large happens when it fabricates numerous branches because of exceptions and abnormalities in information.

Two methodologies which we can use to abstain from overfitting are:

1-Pre-Pruning

2-Post-Pruning

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Explain in simple terms the algorithm used by neural networks.
Explain in simple terms the algorithm used by neural networks.
Describe a specific example of the use of decision trees in business context. For example, decision...
Describe a specific example of the use of decision trees in business context. For example, decision trees can be used to forecast home buyer’s willingness to close a deal. Directions: Explain what type of data (specific columns) can be used in the example you will find and what type of trees (classification or regression) will be used.
Please find and share algorithm about red/black trees. Explain it, give 2 real world usages and...
Please find and share algorithm about red/black trees. Explain it, give 2 real world usages and discuss its efficiency with your friends!
Please find and share algorithm about red/black trees. Explain it, give 2 real world usages and...
Please find and share algorithm about red/black trees. Explain it, give 2 real world usages and discuss its efficiency with your friends! Please don't copy my previous question:)
Write the SIMPLE algorithm to solve the incompressible flow, and explain each step briefly.
Write the SIMPLE algorithm to solve the incompressible flow, and explain each step briefly.
Explain the **Apriori property** that is used in the **Apriori algorithm** What is the **holdout method**?...
Explain the **Apriori property** that is used in the **Apriori algorithm** What is the **holdout method**? Explain. Draw a diagram.
Which type of sorting algorithm is best used to sort sequentially? Explain
Which type of sorting algorithm is best used to sort sequentially? Explain
Explain how the following terms are different: simple and stratified, in terms of the integumentary system...
Explain how the following terms are different: simple and stratified, in terms of the integumentary system and tissue
A systems analyst tests a new algorithm designed to work faster than the currently-used algorithm. Each...
A systems analyst tests a new algorithm designed to work faster than the currently-used algorithm. Each algorithm is applied to a group of 89 sample problems. The new algorithm completes the sample problems with a mean time of 17.64 hours. The current algorithm completes the sample problems with a mean time of 17.75 hours. The standard deviation is found to be 4.5614.561 hours for the new algorithm, and 4.210 hours for the current algorithm. Conduct a hypothesis test at the...
A systems analyst tests a new algorithm designed to work faster than the currently-used algorithm. Each...
A systems analyst tests a new algorithm designed to work faster than the currently-used algorithm. Each algorithm is applied to a group of 36 sample problems. The new algorithm completes the sample problems with a mean time of 22.83 hours. The current algorithm completes the sample problems with a mean time of 23.06 hours. The standard deviation is found to be 3.511 hours for the new algorithm, and 3.471 hours for the current algorithm. Conduct a hypothesis test at the...
ADVERTISEMENT
Need Online Homework Help?

Get Answers For Free
Most questions answered within 1 hours.

Ask a Question
ADVERTISEMENT