Question

How does Q-learning different from the state value-based learning in Reinforcement Learning

How does Q-learning different from the state value-based learning in Reinforcement Learning

Homework Answers

Answer #1

In Q-learning functions perform randomly that do not need any policy.It follows greedy approach.

It is defined for state and action.Q(S,A) to determine of it is good to take action A at state S.

It is a policy reinforcement learning ,so that the best action could be taken for a current state.

It is a model free learning, where agent does not know anything about transition. It discovers about good and bad action by trial and error.

But

In state value based learning the agent has prior knowledge about effect of its action.It is total reward starting from state S and it acts according to some policy.

It calculates cumulative score for each state and state with maximum value gets selected.

Conclusion

Q learning is based on learning policy to take the best action while value based performs according to predefined policy.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Question # 6: What is the difference between associative learning, reinforcement, conditioned stimuli, and discriminative stimuli?...
Question # 6: What is the difference between associative learning, reinforcement, conditioned stimuli, and discriminative stimuli? Question # 7: What is the difference between incentive salience and goal-directed behavior? Question # 8: Compare and contrast the drive theory of drug addiction and the opponent-process theory of drug addiction? Question # 9: How does animal models of drug self-administration and drug reinstatement related to human models of drug relapse? Question # 10: How does the nucleus accumbens relates to the theories...
1.How does the movie Higher Learning reflect its social context? ? How are the social the...
1.How does the movie Higher Learning reflect its social context? ? How are the social the social conditions depicted in the movie different or similar than those today?
what does Vcr mean ? how can a beam carry shear force without shear reinforcement
what does Vcr mean ? how can a beam carry shear force without shear reinforcement
Q-How many different 5-card hands can be dealt from a standard 52-card deck? Q How many...
Q-How many different 5-card hands can be dealt from a standard 52-card deck? Q How many different passwords of length 8 can be made if there must be one upper-caseletter, one lower-case letter, and 6 digits?2.
2. Why does organic growth often creates more value than growth from acquisitions? Describe how different...
2. Why does organic growth often creates more value than growth from acquisitions? Describe how different types of organic growth might create different amounts of value?
How does level of analysis impact learning in organizations? How does this information impact scenario planning...
How does level of analysis impact learning in organizations? How does this information impact scenario planning for an organization?
How does a nonparametic test different from a parametic test?
How does a nonparametic test different from a parametic test?
How can an instructor use performance-based assessment to verify psychomotor skill learning? Include an example of...
How can an instructor use performance-based assessment to verify psychomotor skill learning? Include an example of how you might complete a performance based assessment for a completely online course. Remember to use the vocabulary from the text in your discussion. 
For a harmonic oscillator in the ground-state, determine △p and △x. How does the value of...
For a harmonic oscillator in the ground-state, determine △p and △x. How does the value of △p△x compare to the Heisenberg Uncertainty Principle?
how will you plan learning experiences based on children's strengths, interests, abilities and knowledge?
how will you plan learning experiences based on children's strengths, interests, abilities and knowledge?
ADVERTISEMENT
Need Online Homework Help?

Get Answers For Free
Most questions answered within 1 hours.

Ask a Question
ADVERTISEMENT