How does Q-learning different from the state value-based learning in Reinforcement Learning

Question

Question

How does Q-learning different from the state value-based learning in Reinforcement Learning

Engineering Computer-Science

0 0

Add a comment Transcribed image text

Answer 1

Answer #1

In Q-learning functions perform randomly that do not need any policy.It follows greedy approach.

It is defined for state and action.Q(S,A) to determine of it is good to take action A at state S.

It is a policy reinforcement learning ,so that the best action could be taken for a current state.

It is a model free learning, where agent does not know anything about transition. It discovers about good and bad action by trial and error.

But

In state value based learning the agent has prior knowledge about effect of its action.It is total reward starting from state S and it acts according to some policy.

It calculates cumulative score for each state and state with maximum value gets selected.

Conclusion

Q learning is based on learning policy to take the best action while value based performs according to predefined policy.

0 0

Add a comment

How does Q-learning different from the state value-based learning in Reinforcement Learning

Homework Answers

Post as a guest

Earn Coins

Not the answer you're looking for?

Similar Questions

Question # 6: What is the difference between associative learning, reinforcement, conditioned stimuli, and discriminative stimuli?...

1.How does the movie Higher Learning reflect its social context? ? How are the social the...

what does Vcr mean ? how can a beam carry shear force without shear reinforcement

Q-How many different 5-card hands can be dealt from a standard 52-card deck? Q How many...

2. Why does organic growth often creates more value than growth from acquisitions? Describe how different...

How does level of analysis impact learning in organizations? How does this information impact scenario planning...

How does a nonparametic test different from a parametic test?

How can an instructor use performance-based assessment to verify psychomotor skill learning? Include an example of...

For a harmonic oscillator in the ground-state, determine △p and △x. How does the value of...

how will you plan learning experiences based on children's strengths, interests, abilities and knowledge?

Need Online Homework Help?

Active Questions