交通灯控制系统外文翻译_第1页
交通灯控制系统外文翻译_第2页
交通灯控制系统外文翻译_第3页
交通灯控制系统外文翻译_第4页
交通灯控制系统外文翻译_第5页
已阅读5页,还剩7页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

1、本科生毕业设计(论文)外文文献翻译毕业设计题目: 交通灯智能控制系统 学 院: 信息科学与工程学院 专业班级: 测控技术与仪器0703班 学生姓名: 王欣 指导教师: 桑海峰 2011年 3月 19日外文原文intelligent traffic light controlmarco wiering, jelle van veenen, jilles vreeken, and arne koopman intelligent systems groupinstitute of information and computing sciences utrecht universitypadual

2、aan 14, 3508tb utrecht, the netherlandsemail: marcocs.uu.nljuly 9, 2004abstractvehicular travel is increasing throughout the world, particularly in large urban areas.therefore the need arises for simulating and optimizing traffic control algorithms to better accommodate this increasing demand. in th

3、is paper we study the simulation and optimization of traffic light controllers in a city and present an adaptive optimization algorithm based on reinforcement learning. we have implemented a traffic light simulator, green light district, that allows us to experiment with different infrastructures an

4、d to compare different traffic light controllers. experimental results indicate that our adaptive traffic light controllers outperform other fixed controllers on all studied infrastructures.keywords: intelligent traffic light control, reinforcement learning, multi-agent systems (mas), smart infrastr

5、uctures, transportation research1 introductiontransportation research has the goal to optimize transportation flow of people and goods.as the number of road users constantly increases, and resources provided by current infrastructures are limited, intelligent control of traffic will become a very im

6、portant issue in the future. however, some limitations to the usage of intelligent traffic control exist. avoiding traffic jams for example is thought to be beneficial to both environment and economy, but improved traffic-flow may also lead to an increase in demand levinson, 2003.there are several m

7、odels for traffic simulation. in our research we focus on microscopic models that model the behavior of individual vehicles, and thereby can simulate dynamics of groups of vehicles. research has shown that such models yield realistic behavior nagel and schreckenberg, 1992, wahle and schreckenberg, 2

8、001.cars in urban traffic can experience long travel times due to inefficient traffic light control. optimal control of traffic lights using sophisticated sensors and intelligent optimization algorithms might therefore be very beneficial. optimization of traffic light switching increases road capaci

9、ty and traffic flow, and can prevent traffic congestions. traffic light control is a complex optimization problem and several intelligent algorithms, such as fuzzy logic, evolutionary algorithms, and reinforcement learning (rl) have already been used in attempts to solve it. in this paper we describ

10、e a model-based, multi-agent reinforcement learning algorithm for controlling traffic lights.in our approach, reinforcement learning sutton and barto, 1998, kaelbling et al., 1996 with road-user-based value functions wiering, 2000 is used to determine optimal decisions for each traffic light. the de

11、cision is based on a cumulative vote of all road users standing for a traffic junction, where each car votes using its estimated advantage (or gain) of setting its light to green. the gain-value is the difference between the total time it expects to wait during the rest of its trip if the light for

12、which it is currently standing is red, and if it is green. the waiting time until cars arrive at their destination is estimated by monitoring cars flowing through the infrastructure and using reinforcement learning (rl) algorithms.we compare the performance of our model-based rl method to that of ot

13、her controllers using the green light district simulator (gld). gld is a traffic simulator that allows us to design arbitrary infrastructures and traffic patterns, monitor traffic flow statistics such as average waiting times, and test different traffic light controllers. the experimental results sh

14、ow that in crowded traffic, the rl controllers outperform all other tested non-adaptive controllers. we also test the use of the learned average waiting times for choosing routes of cars through the city (co-learning), and show that by using co-learning road users can avoid bottlenecks.this paper is

15、 organized as follows. section 2 describes how traffic can be modelled, predicted, and controlled. in section 3 reinforcement learning is explained and some of its applications are shown. section 4 surveys several previous approaches to traffic light control, and introduces our new algorithm. sectio

16、n 5 describes the simulator we used for our experiments, and in section 6 our experiments and their results are given. we conclude in section 7.2 modelling and controlling trafficin this section, we focus on the use of information technology in transportation. a lot of ground can be gained in this a

17、rea, and intelligent transportation systems (its) gained interest of several governments and commercial companies ten-t expert group on its, 2002, white paper, 2001, epa98, 1998.its research includes in-car safety systems, simulating effects of infrastructural changes, route planning, optimization o

18、f transport, and smart infrastructures. its main goals are: improving safety, minimizing travel time, and increasing the capacity of infrastructures. such improvements are beneficial to health, economy, and the environment, and this shows in the allocated budget for its.in this paper we are mainly i

19、nterested in the optimization of traffic flow, thus effectively minimizing average traveling (or waiting) times for cars. a common tool for analyzing traffic is the traffic simulator. in this section we will first describe two techniques commonly used to model traffic. we will then describe how mode

20、ls can be used to obtain real-time traffic information or predict traffic conditions. afterwards we describe how information can be communicated as a means of controlling traffic, and what the effect of this communication on traffic conditions will be. finally, we describe research in which all cars

21、 are controlled using computers.2.1 modelling traffic.traffic dynamics bare resemblance with, for example, the dynamics of fluids and those of sand in a pipe. different approaches to modelling traffic flow can be used to explain phenomena specific to traffic, like the spontaneous formation of traffi

22、c jams. there are two common approaches for modelling traffic; macroscopic and microscopic models.2.1.1 macroscopic models.macroscopic traffic models are based on gas-kinetic models and use equations relating traffic density to velocity lighthill and whitham, 1955, helbing et al., 2002. these equati

23、ons can be extended with terms for build-up and relaxation of pressure to account for phenomena like stop-and-go traffic and spontaneous congestions helbing et al., 2002, jin and zhang, 2003, broucke and varaiya, 1996. although macroscopic models can be tuned to simulate certain driver behaviors, th

24、ey do not offer a direct, flexible, way of modelling and optimizing them, making them less suited for our research.2.1.2 microscopic models.in contrast to macroscopic models, microscopic traffic models offer a way of simulating various driver behaviors. a microscopic model consists of an infrastruct

25、ure that is occupied by a set of vehicles. each vehicle interacts with its environment according to its own rules. depending on these rules, different kinds of behavior emerge when groups of vehicles interact.cellular automata. one specific way of designing and simulating (simple) driving rules of c

26、ars on an infrastructure, is by using cellular automata (ca). ca use discrete partially connected cells that can be in a specific state. for example, a road-cell can contain a car or is empty. local transition rules determine the dynamics of the system and even simple rules can lead to chaotic dynam

27、ics. nagel and schreckenberg (1992) describe a ca model for traffic simulation. at each discrete time-step, vehicles increase their speed by a certain amount until they reach their maximum velocity. in case of a slower moving vehicle ahead, the speed will be decreased to avoid collision. some random

28、ness is introduced by adding for each vehicle a small chance of slowing down. experiments showed realistic behavior of this ca model on a single road with emerging behaviors like the formation of start-stop waves when traffic density increases.cognitive multi-agent systems. a more advanced approach

29、to traffic simulation and optimization is the cognitive multi-agent system approach (cmas), in which agents interact and communicate with each other and the infrastructure. a cognitive agent is an entity that autonomously tries to reach some goal state using minimal effort. it receives information f

30、rom the environment using its sensors, believes certain things about its environment, and uses these beliefs and inputs to select an action. because each agent is a single entity, it can optimize (e.g., by using learning capabilities) its way of selecting actions. furthermore, using heterogeneous mu

31、lti-agent systems, different agents can have different sensors, goals, behaviors, and learning capabilities, thus allowing us to experiment with a very wide range of (microscopic) traffic models.dia (2002) used a cmas based on a study of real drivers to model the drivers response to travel informati

32、on. in a survey taken at a congested corridor, factors influencing the choice of route and departure time were studied. the results were used to model a driver population, where drivers respond to presented travel information differently. using this population, the effect of different information sy

33、stems on the area where the survey was taken could be simulated. the research seems promising, though no results were presented.a traffic prediction model that has been applied to a real-life situation, is described in wahle and schreckenberg, 2001. the model is a multi-agent system (mas) where driv

34、ing agents occupy a simulated infrastructure similar to a real one. each agent has two layers of control; one for the (simple) driving decision, and one for tactical decisions like route choice. the real world situation was modelled by using detection devices already installed. from these devices, i

35、nformation about the number of cars entering and leaving a stretch of road are obtained. using this information, the number of vehicles that take a certain turn at each junction can be inferred. by instantiating this information in a faster than real-time simulator, predictions on actual traffic can

36、 be made. a system installed in duisburg uses information from the existing traffic control center and produces real-time information on the internet. another system was installed on the freeway system of north rhine-westphalia, using data from about 2.500 inductive loops to predict traffic on 6000

37、km of roads.中文译文智能交通灯控制马克 威宁,简丽 范 威,吉尔 威瑞肯,安瑞 库普曼智能系统小组乌得勒支大学信息与计算科学研究所荷兰乌得勒支padualaan14号邮箱:marcocs.uu.nl2004年7月9日摘要世界各地的车辆运行逐渐增多,尤其是在一个大的本地区域。因此就需要有关交通控制的模拟与优化算法,来更好的地适应日益增长的需求。在文中,我们学习了在城市中的模拟与优化的交通灯控制器,以及目前基于强化学习的自适应优化算法。我们已经实行了一个交通等模拟器,绿灯区,这允许我们用不同的基础设施和不同的交通控制器去实验。实验结果表明,在所有基础设施的研究领域内,我们的自适应交通灯

38、控制器优于其他固定的控制器。关键字:智能交通灯控制,强化学习,多代理系统(mas),智能基础设施,运输研究1 介绍运输研究的目的是优化人流和物流。随着道路使用者的数量不断上涨,当前基础设施所提供的资源受到限制,在未来,交通智能控制将会成为一个非常重要的问题。然而,一些交通智能控制使用受限问题的存在。避免交通堵塞,例如,被认为是对环境和经济有益的,但是增加的交通流也可能导致资源需求的增加。莱文森,2003。这有几个交通仿真模型。在我们的研究中,我们专注于那些具有个体车辆行为的微观模型,从而更好的模拟群体车辆的动力学。研究表明,这种模型的出现具有现实意义nagel and schreckenber

39、g,1992,wahle and schreckenberg,2001。汽车在城市交通中经历了漫长的运行时间,要归因于低效的交通灯控制。因此,使用成熟传感器和智能优化算法的交通灯优化控制可能是有益的。优化的交通灯切换增加了道路的容量和人流,能阻止交通堵塞。交通灯控制是一个复杂的优化问题和几个智能算法,例如模糊逻辑、遗传算法和强化学习(rl)已被应用去试图解决问题。在本文中,我们描述了一种对交通灯控制,基于模型的、多代理的强化学习算法。我们的方法,强化学习sutton and barto,1998,kaelbling,1996和基于道路使用者的价值功能威宁,2000被用来决定每个交通灯的优化选择

40、。这个决定是基于道路使用者站了一个交叉路口的累积投票,在那里每辆汽车使用其估计选票的优势(或增益)设置它的光的绿色。在其余路程,它的所有等待时间里,如果信号灯现在是红色的或者绿色的,那么增益的值是不同的。汽车直到到达目的地后的等待时间,是通过监测汽车流过基础设施和应用强化学习(rl)算法而估算出来的。本文写作安排如下。第二部分描述了交通是如何被建立、预测和控制的。在第三部分解释了什么是强化学习和一些它的应用。第四部分调查了几个以前交通控制的方法,介绍了我们的新算法。第五部分描述了我们实验中所使用的仿真器,以及第六部分给出我们的实验和实验结果。在第七部分我们得出结论。2 建立和控制交通在这一部分

41、,我们专注于在交通运输方面所使用的信息技术。在这个区域增加了大量的土地,并且一些政府和商业公司在交通智能系统(its)方面获得了利润。ten-t expert group on its,2002,白皮书,2001,epa98,1998。交通智能系统(its)研究包括车内安全系统,基础设施改变所引起的仿真效果,路途规划,优化运输和智能的基础设施。其主要目标是:提高安全性、减少运行时间、增加基础设施的能力。这种改进有益健康、经济、环境,这表现在交通智能系统的分配预算方面。在本文中,我们主要对车流的优化感兴趣,从而有效减少平均运行(或者等待)的车辆次数。一种常见的分析交通的工具就是交通仿真器。在这部

42、分中,我们将首先描述两种常用于交通模型的技术。然后我们将描述模型是如何用来获取实时交通信息或者预测交通情况的。后来,我们描述信息是如何作为一种控制交通的手段来进行沟通的,在这样的交通条件下,沟通产生了什么样的影响。最后,我们描述了所有的汽车都使用计算机进行控制的研究。2.1 建立交通与交通动力学仅有的相似之处是,例如,流体力学和管内的沙子。建立车流模型的不同方法是用来解释交通的特殊现象的,就像自发形成的交通堵塞状况。有两种普遍的方法去建立交通:宏观和微观模型。2.1.1 宏观模型宏观交通模型是基于gas-kinetic模型的,利用了关于交通密度和速度的方程式lighthill and whit

43、ham,1955,helbing et al.,2002。这些方程式可以延长积累和放松压力,归因于类似的停停走走的交通和自发的拥堵的现象。helbing et al.,2002,jin and zhang, 2003,broucke and varaiya,1996。尽管宏观模型可以来模拟一些特定的可调驱动行为,但是他们不能提供一个直接的、灵活的建立和优化交通的方法,这使他们不太适合我们的研究。2.1.2 微观模型与宏观模型相对比的,微观交通模型提供了一种仿真各种各样司机行为的方法。一个微观模型由一组车辆占据的基础设施组成。每辆车都根据自己的规则,和周围的环境产生作用。根据这些规则,当很多车辆互相作用时,不同种类的行为就会出现。元胞自动机。一个在基础设施上的具体设计和仿真(简单的

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论