Question

How does Q-learning different from the state value-based learning in Reinforcement Learning

How does Q-learning different from the state value-based learning in Reinforcement Learning

Homework Answers

Answer #1

In Q-learning functions perform randomly that do not need any policy.It follows greedy approach.

It is defined for state and action.Q(S,A) to determine of it is good to take action A at state S.

It is a policy reinforcement learning ,so that the best action could be taken for a current state.

It is a model free learning, where agent does not know anything about transition. It discovers about good and bad action by trial and error.

But

In state value based learning the agent has prior knowledge about effect of its action.It is total reward starting from state S and it acts according to some policy.

It calculates cumulative score for each state and state with maximum value gets selected.

Conclusion

Q learning is based on learning policy to take the best action while value based performs according to predefined policy.

Know the answer?
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for?
Ask your own homework help question
Similar Questions
Question # 6: What is the difference between associative learning, reinforcement, conditioned stimuli, and discriminative stimuli?...
Question # 6: What is the difference between associative learning, reinforcement, conditioned stimuli, and discriminative stimuli? Question # 7: What is the difference between incentive salience and goal-directed behavior? Question # 8: Compare and contrast the drive theory of drug addiction and the opponent-process theory of drug addiction? Question # 9: How does animal models of drug self-administration and drug reinstatement related to human models of drug relapse? Question # 10: How does the nucleus accumbens relates to the theories...
1.How does the movie Higher Learning reflect its social context? ? How are the social the...
1.How does the movie Higher Learning reflect its social context? ? How are the social the social conditions depicted in the movie different or similar than those today?
Q-How many different 5-card hands can be dealt from a standard 52-card deck? Q How many...
Q-How many different 5-card hands can be dealt from a standard 52-card deck? Q How many different passwords of length 8 can be made if there must be one upper-caseletter, one lower-case letter, and 6 digits?2.
2. Why does organic growth often creates more value than growth from acquisitions? Describe how different...
2. Why does organic growth often creates more value than growth from acquisitions? Describe how different types of organic growth might create different amounts of value?
How does level of analysis impact learning in organizations? How does this information impact scenario planning...
How does level of analysis impact learning in organizations? How does this information impact scenario planning for an organization?
How does a nonparametic test different from a parametic test?
How does a nonparametic test different from a parametic test?
How can an instructor use performance-based assessment to verify psychomotor skill learning? Include an example of...
How can an instructor use performance-based assessment to verify psychomotor skill learning? Include an example of how you might complete a performance based assessment for a completely online course. Remember to use the vocabulary from the text in your discussion. 
how will you plan learning experiences based on children's strengths, interests, abilities and knowledge?
how will you plan learning experiences based on children's strengths, interests, abilities and knowledge?
For a harmonic oscillator in the ground-state, determine △p and △x. How does the value of...
For a harmonic oscillator in the ground-state, determine △p and △x. How does the value of △p△x compare to the Heisenberg Uncertainty Principle?
Problem-based learning The focus for this problem-based learning exercise is twofold – strategy and tactics of...
Problem-based learning The focus for this problem-based learning exercise is twofold – strategy and tactics of integrative negotiation and the importance of using strategy and planning for negotiations. Please take THREE of the following learning goals, and discuss, in a paragraph EACH (e.g. three total, meaty paragraphs)********, the knowledge you have gained about each. You may demonstrate this knowledge through application. Because the emphasis on this assignment is problem-based learning, you are welcome and encouraged to structure this exercise around...