This is using a simple supervised function learner to learn the mapping
from joint angles to positions and velocities. From there it applies a
trivial hard-coded algorithm to reach for a random target point that moves
every few seconds (not explicitly shown).
It starts out naive in this video, so the the function learner is learning on the fly
from scratch just from the motion that happens at random initially; but
the nature of the function learner is that it learns very fast locally
so the behavior is reasonable almost immediately.
I upped the number of segments to push it to where it's barely working (it
worked too well with the default number of segments).
It doesn't understand the force and acceleration dynamics--those are implicit
in the simple hard-coded control code--which is why it's wobbly. Learning
those comes next...
-Simon Funk
|