1、附录 Gait Adaptation in a Quadruped Robot 1. Introduction A short time after birth a foal can walk and then run. It is remarkable that the animal learns tocoordinate the many muscles of the legs and trunk in such a short period of time. It is not likely that any learning algorithm could program a nerv
2、ous system ab initio with so few training epochs. Nor is it likely that the foals locomotor controller is completely determined before birth. How can this a- bility be explained? How can this ability be incorporated into the control system of a walking machine? Researchers in biology have presented
3、clear evidence of a functional unit of the central nervous system, the Central Pattern Generator (CPG), which can cause rhythmic movement of the trunk and limb muscles(Grillner and Wallen, 1985). In adult animals, the output of these cells can generate muscle activity that is very similar to activit
4、y during normal walking, even when sensory feedback has been eliminated (Grillner and Zangger, 1975). The CPG begins its ac- tivity before birth, although its activity does not appear to imitate the details of a particular walking animal, it is apparently correlated with the animals class, i.e., amp
5、hibian, reptile, mammal, etc. (Bekoff, 1985; Cohen, 1988).Apparently, the basic structure of the CPG network is laid down by evolution. How is this basic structure adapted to produce the detailed coordination needed to control a walk- ing animal? The answer to this question is important to robotics
6、for the following reason. CPGs have been well studied as a basic coordinating mechanism (Cohen et al., 1982; Bay and Hemami, 1987; Matsuoka, 1987; Rand et al., 1988; Taga et al., 1991; Collins and Stewart, 1993; Murray, 1993; Zielinska, 1996; Jalics et al., 1997; Ito et al., 1998; Kimura et al., 199
7、9). However, the details of how this system can automatically adapt to control a real robot are not clear. A good goal would be to describe a general strategy for matching a generic CPG to a particular robot in real-time, with a minimal amount of interaction with the environment. Reinforcement learn
8、ing has been applied on long time scales to certain problems in walking (learning coordination and basic leg movement) (Ilg and Berns, 1995), but the time scales of such an approach is too long to explain the quick learning of animals just after birth. The author suggests that part of the answer may
9、 be in the use of a number of simple innate internal models to evaluate the performance of the rapidly developing nervous system. These innate internal models could be used to adaptively tune CPGs during phases of rapid development. Figure 3 illustrates the training concept. A CPG generates a signal
10、 destined for a group of actuators (muscle) as well as a second signal, which is a copy of the signal sent to the actuators destined for an innate forward model. In biology, a copy of the motor signal is called an efference copy (Sperry, 1950). A forward model as described by Kawato (Kawato and Wolp
11、ert, 1998) is a functional model of the forward dynamics of the system.We use very simplified, innate forward models. These forward models predict the sensory expectation, or the desired consequence of CPG activity. This information is compared, and an adaptive rule then modifies the CPG. The author
12、 suggests that a handful of adaptive mechanisms may be used for rapid tuning of a generic CPG. For example, the adaptive model can be used to ensure coordination of limbs with the environment. The use of simplified, innate models is the most conservative stance possible. The intent is to make the fe
13、west assumptions possible about the knowledge that the nervous system has about the body that it is trying to control. By demonstrating how this process may be used in a physical devicea robot we give compelling evidence that this approach is sufficient. This article reports on an investigation into
14、 how a group of forward models could be used to adaptively tune a CPG in a real robot. As these models are innate, we assume they are simple; it would not be satisfying if these models were as complex as the behaviors that they help generate. Secondly, these models should not be detailed, accurate m
15、odels of the forward dynamics of the robot. If innate models are used, their simplicity prohibits them from detailing accurate information about the structure of the pattern generator. The study here supposes that once the CPGs have been tuned to produce basic locomotion, other, more general learnin
16、g mechanisms would take over to create a more refined gait. These learning mechanisms might include reinforcement learning or supervised learning methods. 2. Experiments The results of three experiments using GEO-II are described in this section. These experiments require progressively more adaptati
17、on to the environment and culminate in adaptive walking behavior. The robot learns to adjust key parameters of the CPG network to allow the robot to walk within minutes. 2.1Experimental Setup The robot platform is a four-legged robot, “GEO-II” (Fig. 1). Sensors include a force sensor on each foot, a
18、nd a gyro scope which senses body roll. The unique features of this robot include a flexible, three-degrees of freedom spine. This allows spinal movement including twist. Model airplane servo actuators drive all axes.These servos are positional control devices. Geo II weighs 1.25 Kg. Figure 1.The GE
19、O-II robot. GEO-II features a flexible spine Computation is divided between an onboard processor, a 68HC11 based ServoX24 board by Digital Designs and Systems, Inc., and a dual Intel Pentium workstation. The ServoX24 board is responsible for generating command signals for the servos as well as A/D sampling of sensor signals. The workstation is responsible for computing the ARRs, the AMs, and reflexes modules. The workstation also hosts a graphical user interface. All code runs under Windows 2000(C+ Microsoft) in a multi-threaded, windowed environment.