If you think the move is valid but not optimal, give a score of 0. Otherwise, give a score of 1.\n\nEval 2 (Short Horizon Reward): Evaluate how close is the state to the goal state by rating it with a ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results