Implement more options for actions in the neural nets (one hots, outputs) #73

sofian · 2014-07-12T01:29:42Z

Right now the neural network used for the Q(s,a) function always represents actions the same way: one graded input per action dim. We should add the following options:

treat each action as a separate output
each action as a "one hot" input
some action types as "one hot" and others as graded

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement more options for actions in the neural nets (one hots, outputs) #73

Implement more options for actions in the neural nets (one hots, outputs) #73

sofian commented Jul 12, 2014

Implement more options for actions in the neural nets (one hots, outputs) #73

Implement more options for actions in the neural nets (one hots, outputs) #73

Comments

sofian commented Jul 12, 2014