Solving credit assignment problem in behavior coordination learning via robot action decomposition

Wai Keung Fung*, Yun Hui Liu

*Awdur cyfatebol y gwaith hwn

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygl Cynhadleddadolygiad gan gymheiriaid

Crynodeb

In behavior coordination, several primitive behaviors are 'combined' to generate a resultant action to drive the robot. The weights across the primitive behaviors should be properly determined according to the situations that the robot encounters in order to successfully avoid collisions with obstacles and accomplish the assigned task. Behavior coordination learning is proposed to learn the mapping between the situations encountered by the robot and the weight combinations on primitive behaviors from observed resultant behavior of the robot. This paper proposes an action decomposition algorithm to automatically derive the weights across primitive behaviors from an observed resultants behavior with minimum weight variations along time by local optimization scheme. Several examples on simulated and experimental data are presented to demonstrate the computation in action decomposition.

Iaith wreiddiolSaesneg
Tudalennau (o-i)II-716 - II-721
CyfnodolynProceedings of the IEEE International Conference on Systems, Man and Cybernetics
Cyfrol2
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 1999
Cyhoeddwyd yn allanolIe
Digwyddiad1999 IEEE International Conference on Systems, Man, and Cybernetics 'Human Communication and Cybernetics' - Tokyo, Jpn
Hyd: 12 Hyd 199915 Hyd 1999

Dyfynnu hyn