版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、1System Busses / Networks-on-ChipEECE 579 - Advanced Topics in VLSI DesignSpring 2009Brad Quinton2OutlineSimple systems bussesOverviewAMBA APBAdvantages/Limitations Complex systems bussesOverviewAMBA AHBAdvantages/LimitationsNetworks-on-Chip (NoC)OverviewAMBA AXIResearch Topics: Topology, Protocol,
2、VLSI Implementation.Review: “A Generic Architecture for On-Chip Packet-Switched Interconnections”3Bluetooth “Platform” SoCMacintosh PICTimage form atis not supportedProcessorMemory ControllerApplication Specific LogicLow-speed I/O and Support LogicSystem Bus / Hardware I/F4Simple System Busses5Simpl
3、e System BussesThe primary goal of a simple system bus is to allow software (running on a processor) to communicate with other hardware in the SoCThere are many different implementation . but they are all very similar6Embedded Processor I/ORISC-based embedded processors communicate with external har
4、dware using two simple instructions:7Embedded Processor I/ORISC-based embedded processors communicate with external hardware using two simple instructions:Load Operation: Copies a word of data from a specific address to a local registerStore Operation: Copies a word of data from a local register to
5、a specific address8Embedded Processor I/ORISC-based embedded processors communicate with external hardware using two simple instructions:Load Operation: Copies a word of data from a specific address to a local registerStore Operation: Copies a word of data from a local register to a specific address
6、The simple system bus is just a direct extension of this model9Embedded Processor I/O10Embedded Processor I/OSoftware sets up the register with the address and data .11Embedded Processor I/OSoftware sets up the register with the address and data .Blocks decode addresses to see if they are the target
7、s.12Embedded Processor I/OSoftware sets up the register with the address and data .Blocks decode addresses to see if they are the targets.Data transferred between register and hardware13AMBA SpecificationAMBA: Advanced Microcontroller Bus ArchitectureCreated by ARM to enable standardized interfaces
8、to their embedded processorsActually three standards: APB, AHB, and AXIVery commonly used for commercial IP cores14AMBA SpecificationAMBA: Advanced Microcontroller Bus ArchitectureCreated by ARM to enable standardized interfaces to their embedded processorsActually three standards: APB, AHB, and AXI
9、Very commonly used for commercial IP coresSimple Bus15AMBA SpecificationAMBA: Advanced Microcontroller Bus ArchitectureCreated by ARM to enable standardized interfaces to their embedded processorsActually three standards: APB, AHB, and AXIVery commonly used for commercial IP coresSimple BusComplex B
10、us16AMBA SpecificationAMBA: Advanced Microcontroller Bus ArchitectureCreated by ARM to enable standardized interfaces to their embedded processorsActually three standards: APB, AHB, and AXIVery commonly used for commercial IP coresNoCSimple BusComplex Bus17AMBA APB: Read OperationMacintosh PICTimage
11、 formatis not supported18AMBA APB: Read OperationMacintosh PICTimage formatis not supportedTarget Address19AMBA APB: Read OperationMacintosh PICTimage formatis not supportedTarget AddressTransaction Type20AMBA APB: Read OperationMacintosh PICTimage formatis not supportedTarget AddressTransaction Typ
12、eAddress Decode21AMBA APB: Read OperationMacintosh PICTimage formatis not supportedTarget AddressTransaction TypeAddress DecodeOptional (for asynchronous implementations.)22AMBA APB: Read OperationMacintosh PICTimage formatis not supportedTarget AddressTransaction TypeAddress DecodeOptional (for asy
13、nchronous implementations.)Read Data23AMBA APB: Write OperationMacintosh PICTimage formatis not supported24AMBA APB: Write OperationMacintosh PICTimage formatis not supportedCommon Signals Between Read and Write25AMBA APB: Write OperationMacintosh PICTimage formatis not supportedWrite DataCommon Sig
14、nals Between Read and Write26Remember Our Case Study- data width:16 bits- address width: 16 bits- read cycle time: 50 ns- write cycle time: 50 ns Simple generic processor interface:27- data width:16 bits- address width: 16 bits- read cycle time: 50 ns- write cycle time: 50 ns Simple generic processo
15、r interface:System busRemember Our Case Study28Simple Bus AdvantagesSimple to implementEasy to understandSimple programming modelEasy to add new hardware blocksMinimal hardware requirements (most of the signals are shared)29Simple Bus LimitationsSingle Master - limits parallelismScalability - perfor
16、mance suffers as bus is loaded.Single outstanding request - poor throughput and multi-threading performance bottleneck30Case Study: Single MasterImagine a new partition: APS Bit Error Monitor communicates directly with SwitchSimple bus doesnt work.31Case Study: Single MasterNo PathImagine a new part
17、ition: APS Bit Error Monitor communicates directly with SwitchSimple bus doesnt work.32Case Study: Single MasterNo PathThis can make software the bottleneck in the system.Imagine a new partition: APS Bit Error Monitor communicates directly with SwitchSimple bus doesnt work.33Single Master SummaryA b
18、us that is limited to a single master:Makes inter-block communication inefficientLimits parallelism between hardware and softwareIncreases reliance on interruptsCreates software performance bottlenecksIs not compatible with multiple processors 34Scalability35ScalabilityBlocks are functionally easy t
19、o add, but. 36ScalabilityEach new block increases the delay on the address and data Blocks are functionally easy to add, but. 37Scalability SummarySimple busses are not scaleable because:The address and data “fan-out” to each targetAdding a new block increases the load on the bus Increased fanout +
20、greater load = reduce performance38Single Outstanding RequestMacintosh PICTimage formatis not supported39Single Outstanding RequestMacintosh PICTimage formatis not supportedProcessor is stalled waiting for response.40Single Outstanding RequestMacintosh PICTimage formatis not supportedProcessor is st
21、alled waiting for response.best-case backpressuredifficult to provide performance guarantees.Still potentially a bandwidth bottleneckStill doesnt scale well when blocks are addedMultiple outstanding transactions not handled well - no ordering information64Networks-on-Chip (NoCs)65Networks-on-ChipIt
22、is clear that even with significant design effort the bus-style interconnect is not going to sufficient for large SoCs:the physical implementation does not scale: bus fanout, loading, arbitration depth all reduce operating frequencythe available bandwidth does not scale: the single bus must be share
23、d by all masters and slaves66Networks-on-ChipIt is clear that even with significant design effort the bus-style interconnect is not going to sufficient for large SoCs:the physical implementation does not scale: bus fanout, loading, arbitration depth all reduce operating frequencythe available bandwi
24、dth does not scale: the single bus must be shared by all masters and slavesLets start again: Leverage research from data networking67What do we want?The SoCs of the future will:have 100s of hardware blocks,have billions of transistors, have multiple processors,have large wire-to-gate delay ratios,ha
25、ndle large amounts of high-speed data,need to support “plug-and-play” IP blocksOur NoC needs to be ready for these SoCs.68The Ideal NetworkWhat would the ideal network look like?:Low area overheadSimple implementationHigh-speed operationLow-latencyHigh-bandwidthOperate at a constant frequency even w
26、ith additional blocksIncrease available bandwidth as blocks are addedProvide performance guaranteesHave a “universal” interface69The Ideal NetworkWhat would the ideal network look like?:Low area overheadSimple implementationHigh-speed operationLow-latencyHigh-bandwidthOperate at a constant frequency
27、 even with additional blocksIncrease available bandwidth as blocks are addedProvide performance guaranteesHave a “universal” interfaceThese are competing requirements: Design a network that is the “best” fit.70What do we need to decide?Network InterfaceNetwork Protocol / Transaction FormatNetwork To
28、pologyVLSI Implementation71Network InterfaceWe want our network to be “plug-and-play” so industry standardization is keyHowever the standard be universal enough to address many different needsAMBA AXI is an example of an attempt at this72AMBA AXIARM added the AXI specification to Version 3.0 of the
29、AMBA standardNew approach: define the interface and leave the interconnect up to the designersGood plan since a specific bus implementation is no longer requiredIt is possible to use AXI to build many different NoCs73AMBA AXIInterface divided into 5 channels:Write AddressWrite DataWrite ResponseRead
30、 AddressRead Data/ResponseEach channel is independent and use two-way flow control74AMBA AXI Read ChannelsM a c i n t o s h P I C Ti m a g e f o r m a ti s n o t s u p p o r t e d75AMBA AXI Read ChannelsM a c i n t o s h P I C Ti m a g e f o r m a ti s n o t s u p p o r t e dIndependent76AMBA AXI Re
31、ad ChannelsM a c i n t o s h P I C Ti m a g e f o r m a ti s n o t s u p p o r t e dGive me some dataIndependent77AMBA AXI Read ChannelsM a c i n t o s h P I C Ti m a g e f o r m a ti s n o t s u p p o r t e dGive me some dataHere you goIndependent78AMBA AXI Read ChannelsM a c i n t o s h P I C Ti m
32、 a g e f o r m a ti s n o t s u p p o r t e dGive me some dataHere you goIndependentchannels synchronized with ID # or “tags”79AMBA AXI Write ChannelsM acintosh P IC Tim age form atis not supported80AMBA AXI Write ChannelsM acintosh P IC Tim age form atis not supportedIndependentIndependent81AMBA AX
33、I Write ChannelsM acintosh P IC Tim age form atis not supportedIm sending data. Please store it.IndependentIndependent82AMBA AXI Write ChannelsM acintosh P IC Tim age form atis not supportedIm sending data. Please store it.Here is the data.IndependentIndependent83AMBA AXI Write ChannelsM acintosh P
34、IC Tim age form atis not supportedIm sending data. Please store it.Here is the data.I received that data correctly.IndependentIndependent84AMBA AXI Write ChannelsM acintosh P IC Tim age form atis not supportedIm sending data. Please store it.Here is the data.I received that data correctly.Independen
35、tIndependentchannels synchronized with ID # or “tags”85AMBA AXI Flow-ControlInformation moves only when:Source is Valid, andDestination is ReadyOn each channel the master or slave can limit the flowVery flexibleMa c i n t o s h P I CTi ma g e f o r ma ti s n o t s u p p o r t e dM a c i n t o s h P
36、I C Ti m a g e f o r m a ti s n o t s u p p o r t e dM a c i n t o s h P I C Ti ma g e f o r ma ti s n o t s u p p o r t e d86M a c i n t o s h P I C Ti ma g e f o r ma ti s n o t s u p p o r t e dM a c i n t o s h P I C Ti m a g e f o r m a ti s n o t s u p p o r t e dMa c i n t o s h P I CTi ma g
37、e f o r ma ti s n o t s u p p o r t e dAMBA AXI Flow-ControlInformation moves only when:Source is Valid, andDestination is ReadyOn each channel the master or slave can limit the flowVery flexibleTransfer87AMBA AXI Flow-ControlThis definition of very independent, fully flow-controlled channels is ver
38、y usefulHowever, there is a potential problem: 88AMBA AXI Flow-ControlThis definition of very independent, fully flow-controlled channels is very usefulHowever, there is a potential problem: DEADLOCK89AMBA AXI Flow-ControlThis definition of very independent, fully flow-controlled channels is very us
39、efulHowever, there is a potential problem: DEADLOCKOn a write transaction the master must not wait for AWREADY before asserting WVALID90AMBA AXI Read Ma c i n t o s h P I C Ti ma g e f o r ma ti s n o t s u p p o r t e d91AMBA AXI Read Ma c i n t o s h P I C Ti ma g e f o r ma ti s n o t s u p p o r
40、 t e dRead Address ChannelRead Data Channel92AMBA AXI WriteM a c in to s h P IC Tim a g e fo r m a tis n o t s u p p o r te d93AMBA AXI WriteM a c in to s h P IC Tim a g e fo r m a tis n o t s u p p o r te dWrite Address ChannelWrite Response ChannelWrite Data Channel94A True Interface Specification
41、Because of the channel independence and the two-way flow-control the interface does not dictate the network protocol, transaction format, network topology, or VLSI implementationFor example:if you want to build a packet-based network, you can “backpressure” the data channel while you build the packe
42、t header from the address channel information,you can use store-and-forward, or cut-through, etc.95Network Protocol / Transaction FormatThere are many choice for network protocols and transactions formats:circuit-switched : plan and provision a connection before communication startspacket-switched :
43、 issues packets which compete for network resourceshybrids: schedule connectivity (dynamic or static)96Network Protocol / Transaction FormatThere are many choice for network protocols and transactions formats:circuit-switched : plan and provision a connection before communication startspacket-switched : issues packets which compete for network resourceshybrids: schedule connectivity (dynamic or static)There is still lots of research here.97Network TopologyHow should your network elements be interconnected:Ful
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2025年度临时工物业管理及安保服务合同4篇
- 二零二五版开发商与购房者共有产权住房分时度假合同范本3篇
- 2025年度全麦面包砖出口贸易合同范本4篇
- 2025年度个人旅游行程居间代理合同范本4篇
- 2025年度煤矿自卸车升级改造项目合同3篇
- 2025版高端酒店客房升级改造施工承包合同4篇
- 2024版商务车租赁协议3篇
- 二零二五年航空航天装备股东合作协议汇编3篇
- 2025年度自动化码头运输设备租赁合同4篇
- 2025版模具行业特种技能用工合同规范范本4篇
- 校园安全培训课件
- 化工厂施工安全质量冬季施工措施
- 移动商务内容运营(吴洪贵)项目五 运营效果监测
- 2023-2024学年广西壮族自治区玉林市小学语文一年级期末评估测试题详细参考答案解析
- 青少年自杀自伤行为预防与干预专家讲座
- 比较思想政治教育学
- 职业技能大赛:电工(五级)理论知识考核要素细目表(征求意见稿)
- 阿特拉斯拧紧工具维修培训
- 莱州市石材产业园控制性详细规划环境影响报告书
- GB/T 4882-2001数据的统计处理和解释正态性检验
- POCT血糖仪项目培训记录表、资质授权申请表
评论
0/150
提交评论