1、 1 毕 业 设 计毕 业 设 计 (2012 届)届) 2012012 2 年年 6 6 月月 14 14 日日 设计题目设计题目 生产加工站优化控制仿真演示系统生产加工站优化控制仿真演示系统 学生姓名学生姓名 学学 号号 专业班级专业班级 计算机科学与技术计算机科学与技术 0 08 8- -5 5 班班 指导教师指导教师 2 生产加工站优化控制仿真演示系统生产加工站优化控制仿真演示系统 摘摘 要要 研究多站点传送带给料生产加工站(Conveyor-serviced production station, CSPS) 系统 的最优控制问 题, 其 优化目标是通过 合理选择每 个 CSPS 的
2、 Look-ahead 控制策略, 实现整个系统的工件处理率最大. 本文首先根据多 Agent 系统的反应扩散思想, 对每个 Agent 的原始性能函数进行改进, 引入了具 有扩散功能的局域信息交互项(原始项看作具有反应功能);并运用性能势理论, 构建一种适用于平均和折扣两种性能准则的Wolf-PHC 多 Agent学习算法, 以求 解决策时刻不同步的多站点的协作 Look-ahead 控制策略. 最后, 论文通过仿真 实验验证了该算法的有效性, 学习结果表明, 通过性能函数的改进, 各工作站 的负载平衡性得到改善, 整个系统的工件处理率也明显提高. 关键关键词词: 传送带给料生产加工站, L
3、ook-ahead 控制, 多 Agent 强化学习, 性 能函数 3 The simulation system of conveyor-serviced production station Abstract The optimal control problem of a multiple conveyor-serviced production station (CSPS) system is concerned.The objective is to maximize the part-processing rate of the entire system by choosing
4、a suitable look-ahead control strategy for each CSPS.According to the reaction-diffusion mechanism of multi-agent systems,the original performance function of each agent is first modified by introducing an item with a diffusion function that denotes the interaction of local information (The original
5、 item is assumed to have a reaction function).Then,combined with the concept of performance potentials,a multi-agent algorithm,Wolf-PHC algorithm,is proposed to derive the coordinated look-ahead control strategy for systems with either discounted or average performance criteria,where the decision ep
6、och of each agent is asynchronous.Finally,a simulation example is used to illustrate the effectiveness of the algorithm,and the simulation results show that due to the modification of the performance functions,the contributions of all the stations are well balanced,and the part-processing rate of the entire system is increased significantly. Key words : Conveyor-serviced production station (CSPS), look-ahead control, multi-agent reinfor