Watch the weird videos used to train AI what different actions look like

How do you teach a computer to recognize 'poking' or 'shopping'?