Looking at the photo I don't think the AI is going to realize eating the piece i...

Looking at the photo I don't think the AI is going to realize eating the piece it is about to pick up and it will choke. I would actually like to see more reinforcement learning agents like that, the action space on infant movement is quite small so it's even really about "action space discovery" to some point. Things it discovers are way more interesting, like if food were not on the floor/level and it has to stand to get it, it will eventually get there after N attempts (over time), and then if you introduce another agent if learning to block the other agent will award more food, etc, then it discovers 50/50 and equilibriums (better to eat now than wait). PLATO seems like a step in that direction.