版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
MemoryHierarchyReview1Outline7.1IntroductionPrincipleofLocalityMemoryHierarchy7.2BasicofCacheDirectedMappedCacheBitsinaCacheCacheWriteCacheMissesMultiwordCacheBlock7.3CachePerformanceImproveCachePerformanceMemoryisaBottleneckFullyAssociativeCacheSetAssociativeCacheMulti-LevelCache7.1PrincipleofLocalityProgramaccessarelativelysmallportionoftheiraddressspaceatanyinstantoftimeTwokindoflocality:1.TemporallocalityIfanitemisreferenced,itwilltendtobereferencedagainsoon2.SpatiallocalityIfanitemisreferenced,itemswhoseaddressareclosebywilltendtobereferencedsoon7.1MemoryHierarchy–Speedvs.SizeMemoryTypeDiskDRAMSRAM7.1MemoryHierarchy-OperationIfdataisfound(hit)transfertoprocessor,otherwise(miss)transferdatatoupperlevel.AccesstimeHittimeMisspenaltyUserswantlargeandfastmemories!
SRAMaccesstimesare2-25nsatcostof$100to$250perMbyte.
DRAMaccesstimesare60-120nsatcostof$5to$10perMbyte.
Diskaccesstimesare10to20millionnsatcostof$.10to$.20perMbyte.
Tryandgiveittothemanywaybuildamemoryhierarchy7.1MemoryHierarchy7.2CacheCacheMemoryhierarchybetweenCPUandmainmemoryThestoragemanagedtotakeadvantageoflocalityofaccessCachetwoissuesHowdoweknowifadataitemisinthecache?TagandvalidbitIfitis,howdowefindit?MappedapproachesHowdoescacheworkexampleAddXntocacheDirectMappingmapmanymemorywordsontoonelocationincacheAddressismodulothenumberofblocksinthecache(blockaddress)modulo(no.ofcacheblocksinthecache)Example:Cachehas8wordMapping=blockaddressmodule87.2DirectMappedCache7.2DirectMappedCacheTwoissues1.Whichmemorywordinthecache?Usetagtoidentify2.Whetherthememoryblockisvalid?Ex.Initially,thecacheisemptyUsevalidbittoidentifyThusthecachedatastructurearevalidtagdataword…CacheIndex7.2DirectMappedCacheIfatagismatchedandvalidbitison
ThenarequesthitTagiscomparedwithupperportionofaddressReadhitsReadvaliddataoncacheReadmissesstalltheCPU,fetchblockfrommemory,delivertocache,restart
Writehitscanreplacedataincacheandmemory(write-through)writethedataonlyintothecache(write-backthecachelater)
Writemissesreadtheentireblockintothecache,thenwritetheword7.2CacheReadWriteTerminology7.2CacheWriteIssuesTwocachewritescheme:1.WritebackWhenwriteoccurs,onlywritetothecache2.WritethroughWhenwriteoccurs,writetothecacheandmemoryWrite-backproblemcacheandmemoryinconsistence,andcomplextoimplementEx.Whenacacheentryisreplaced,itmustupdatethecorrespondingmemoryaddressWrite-throughproblemWritingtomainmemoryslowsdowntheperformanceEx.CPIwithoutcachemiss=1.2clockcycleswritetomemorycausesextra10cycles13%storeinstructionsingccSolution:writebuffer,storethedataintowritebufferwhilethedataiswaitingtobewrittentomemoryTheprocesscancontinueexecutionafterwritingdataintocacheandwritebuffer7.2DirectMappedCacheCacheExampleonDECstation3100UseMIPSR2000CPU64KBdata98KBcachesize
7.2MultiwordCacheBlockTakeadvantageofspatiallocalityWithacachemiss,wewillfetchmultiplewordsthatareadjacentIncreasingtheblocksizetendstodecreasemissrate:
Usesplitcachesbecausethereismorespatiallocalityincode:
7.2Performance–Missratevs.BlocksizeImprovementoninstructionmissMakereadingmultiplewordseasierbyusingbanksofmemory
7.2Memorysystem-hardwareIssues7.3ImproveCachePerformanceThreewaysLargercacheSetassociativecacheReducecachemissrateNewplacementruleotherthandirectmappingMulti-levelcacheReducecachemisspenalty7.3FlexiblePlacementofBlocksTherearetwomoreflexibleschemes,thendirectedmappedSetassociativecacheFullyassociativecacheExample:block12addressisplacedin8blockcachePlace12%8=4Place12%4=0DirectmappedSetassociativeFullyassociativePlacedanyatblock7.3FullyAssociativecacheAnextremeschemeAmemorydatacanbeplacedinanyblockinthecacheDisadvantage:SearchallentriesinthecacheforamatchParallelcomparators7.3SetAssociativeCacheBetweendirectmappedandfull-associativeAmemorydatacanbeplacedinasetofblocksinthecache(address)modulo(numberofsetsincache)Ex:12modulo4=0Disadvantage:SearchallentriesinthesetforamatchParallelcomparators7.3EightBlockCacheConfigurationTotalsizeofcacheinblocksisequaltothenumberofsetsThus,forfixedcachesize,increaseassociativitydecreasesthenumberofset,butincreasenumberofelementinaset7.3MissRatewithAssociativityHigherdegreeofassociativityLowermissrateMorehardwarecosttosearch7.3Implementationof4-waySet-AssociativeCacheParallelcomparators7.3UseMulti-LevelCachetoReduceMissPenaltyAddasecondlevelcache:oftenprimarycacheisonthesamechipastheprocessoruseSRAMstoaddanothercacheaboveprimarymemory(DRAM)misspenaltygoesdownifdataisin2ndlevelcacheUsingmultilevelcaches:tryandoptimizethehittimeonthe1stlevelcachetryandoptimizethemissrateonthe2ndlevelcachePrimarycache(L1)Secondarycache(L2)L1cachemissL2cachemissCachehit7.3DecreasingMissPenaltywithMultilevelCachesAddasecondlevelcache:oftenprimarycacheisonthesamechipastheprocessoru
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 江苏省镇江市丹徒区高中政治 第九课 唯物辩证法的实质与核心教案 新人教版必修4
- 二年级品德与生活上册 诚实故事会教案2 北师大版
- 2024秋八年级物理上册 第4章 光的折射 透镜 第一节 光的折射教案2(新版)苏科版
- 2024年秋九年级历史上册 第2单元 古代欧洲文明 第4课 希腊城邦和亚历山大帝国教案 新人教版
- 2024-2025学年高中英语 Module 5 Newspapers and Magazines教案1 外研版必修2
- 2024年五年级语文上册 第四单元 13 少年中国说(节选)配套教案 新人教版
- 2023六年级数学下册 第4单元 比例 2正比例和反比例练习课(正比例和反比例)教案 新人教版
- 换热站管理制度
- 自建房屋外包合同(2篇)
- 设计师求职简历幻灯片模板
- 菜籽油销售方案
- 车站爱心驿站活动方案
- 少年中国说英文版
- 防洪堤与拦河坝钢筋工程施工方案及关键性技术措施
- 100个红色经典故事【十八篇】
- 5G网络安全架构设计
- 2024电力人工智能样本增广技术架构要求
- 特种设备安全法全文
- 2024年国家能源集团公司招聘笔试参考题库含答案解析
- 幼儿园的小小科学家实验室主题班会课件
- 变电运维管理规定(试行)第3分册组合电器运维细则
评论
0/150
提交评论