基于深度学习的变电站围栏布置机器人分层路径规划方法

Path Planning Method for Substation Protective Barriers Deploy Robot Based on Deep Learning

  • 摘要: 布放围栏构建隔离禁区是变电站电力设备检修作业的重要前提,机器代人自动布放隔离围栏已成为现场需求,但面临挑战。针对全局信息缺失时机器人运输围栏至待布放区和布放围栏形成禁区的路径寻优难题,提出一种基于深度强化学习的自适应分层路径规划方法。考虑站内全局信息缺失,采用马尔科夫决策链,建立机器人运输围栏至待布放区的局部信息交互导航模型;考虑运输围栏的空间几何尺寸和负载约束,基于多源传感信息融合,优化可行路径中机器人的动态位姿朝向;栅格化机器人布放围栏形成禁区的场景地图,实现栅格网络中围栏布放路径的动态交互避障,采用深度Q网络算法,提取机器人即时定位的超宽带位置信息生成复合状态特征,在奖励函数中引入避障设备距离、障碍物空间安全边界和碰撞奖惩机制三元素,实现机器人循序布放负载围栏形成禁区任务下的自适应避障路径规划。仿真和试验验证了所提方法的有效性。所提方法对布放隔离围栏任务中分时路径导航的自适应切换,提高了变电站围栏布置机器人导航成功率,为机器代人全自动布放围栏技术提供借鉴。

     

    Abstract: Deploying electrical protective barriers to build isolation exclusion zones is an important prerequisite for substation power equipment maintenance operations. Automatic deployment of protective barriers by machines instead of humans has become a field requirement but faces challenges. To solve the problem of optimizing the path for a robot to transport and deploy protective barriers to the area, and to form a restricted area when global information is missing, an adaptive hierarchical path planning method is proposed based on deep reinforcement learning. Considering the lack of global information in the station, a Markov decision chain is used to establish a local information interaction navigation model for the robot transporting and deploying the protective barriers to the area. Considering the spatial geometry and load constraints of the transporting protective barriers, the dynamic orientation of the robot in the feasible path is optimized based on the fusion of multi-source sensing information. The scenario map in which the robot deploys the fence to form an exclusion zone is rasterized, and a dynamic interactive obstacle avoidance is achieved for the fence deployment path in the raster network. Dynamic interaction obstacle avoidance, using deep Q-network algorithm, extracting the ultra-wideband position information of robot instant positioning to generate composite state features, introducing three elements of obstacle avoidance equipment distance, obstacle space safety boundary and collision reward and punishment mechanism in the reward function, to achieve adaptive obstacle avoidance path planning under the task of robots laying loaded fences to form forbidden zones in a sequential manner. Simulations and experiments verify the effectiveness of the proposed method. The proposed method for time-sharing path navigation in the task of laying the exclusion fence improves the success rate of the navigation of the substation fence placement robot, and lays the foundation for the fully automated laying of the protective barriers technology.

     

/

返回文章
返回