版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
新一代数据仓库:HAWQ偶数科技CEO,ApacheHAWQ创始人www.oushu.io@Copyright2016.Allrightsreserved公司简介HAWQ成功案例公司简介HAWQ成功案例@Copyright2016.Allrightsreserved 2014 2014201520162017201820192020亿美元全球大数据市场规模市场体量增长率%60%40%20%0%45.36%63.01%58.69%34746.87%41.53%90726.47%517000248520142015201920200市场体量增长率201620172018据市场规模67.96%@Copyright2016.AllrightsreservedQlikPowerBI数据安全QlikPowerBI数据安全PwaWarehouseInformaticaTalendKettleNoSQLNewSQL挖掘/机器学习/AISAS,SPSS,TensorflowCloud(公有云和私有云)全球数据仓库市场规模2016年达数百亿美金•Database:1962年出现–InvertedFileDatabaseSystemon•数据库的几个阶段–1960s:NavigationalDBMS(网状&层次模型)•IntegratedDataStore(IDS)•InformationManagementSystem(IMS)–1970s-1990s:SQL/RelationalDBMS•OLTP,Datawarehouse,MPP–2000s-Present:PostRelational•NoSQL(XML,KV,Graph,Tree),NewSQL,NewDW@Copyright2016.Allrightsreserved•查询优化和执行•索引与存储•事务处理@Copyright2016.Allrightsreserved1973TuringAward住在Harrison的所有客户@Copyright2016.Allrightsreserved1981TuringAwardSelectcustomer_nameFromcustomerWherecustomer_city=‘Harrison’;找出住在Harrison的所有客户elationalModelofDataforLargeSharedDataBanksJimGray1998TuringAwardr2014TuringAward@Copyright2016.AllrightsreservedalueCassandra:CQL–HBase:API•GraphModeleoj–Giraph/Pregel–XMLDatabaseB@Copyright2016.Allrightsreserved@Copyright2016.Allrightsreserved硬件配 适用场景架构share-nothing硬件/软件架构大多工业标准的x86服务器复杂的计算需求缺乏弹性面向传统BI分析不易调整几十个节点caGreenplumRedshift支持数据湖弹性伸缩,支持CaaS平台灵活配置硬件配 适用场景架构share-nothing硬件/软件架构大多工业标准的x86服务器复杂的计算需求缺乏弹性面向传统BI分析不易调整几十个节点caGreenplumRedshift支持数据湖弹性伸缩,支持CaaS平台灵活配置share-nothing硬件架构+软件实现distributedshared-storage盘磁盘缺乏弹性不易调整IOracle,DB2工业标准的x86服务器上千个节点owflake面向大数据和人工智能可扩展性表传统数仓share-storage硬件/软件架构@Copyright2016.AllrightsreservedL开L开源&开放&线性可AmazonAthenaSQL私有软件&闭源&非线性可扩展@Copyright2016.Allrightsreserved•SQLonHadoope–Snowflake(onS3),AmazonAthena(onS3)•Hybrid:有自己的存储,对外部存储可插拔–Impala@Copyright2016.Allrightsreserved@Copyright2016.AllrightsreservedadoopSQLonObjectStoreridStoragekSQLnowflakepalamiddlehtopmiddlehhhhhhhhoodmiddlemiddlemiddleododmiddle@Copyright2016.AllrightsreservedApacheHAWQ发展历程•2011年-常雷博士在EMC/Pivotal提出创意,HAWQ项目启动。•2013年-HAWQ1.0发布,性能是Hive的数百倍。•2014年-HAWQ为全球多家大型企业客户采用。•2015年-HAWQ开源成为Apache项目。•2016年-常雷博士及HAWQ核心团队创立偶数科技。•2017年-偶数得到国际顶级VC投资,致力于HAWQ的发展。•2017年-OushuDatabase3.0企业版本发布,全新执行器,世界上最快的数据仓库10倍性能提升@Copyright2016.AllrightsreservedInterconnectMirrorSegmentMirrorSegmentPrimarySegmentPrimarySegmentSegmenthostPrimarySegmentPrimarySegmentMirrorSegmentMirrorSegmentSegmenthosttInterconnectMirrorSegmentMirrorSegmentPrimarySegmentPrimarySegmentSegmenthostPrimarySegmentPrimarySegmentMirrorSegmentMirrorSegmentSegmenthosttaryegmentrorSegmentrrorSegmentSegmenthostreplicationreplicationMasterhost@Copyright2016.AllrightsreservedDegreeofParallelism=8#SegmentPerNode=4rimarySegmentMirrorSegmentrimarySegmentMirrorSegmentPrimarySegmentMirrorSegmentSegmenthostogreplicationPrimarySegmentMirrorSegmentPrimarySegmentMirrorSegmentDatanoderimarySegmentMirrorSegmentPrimarySegmentMirrorSegmenttrorSegmentaryegmentrrorSegmentreplicationogDatareplicationRack1Rack2•DegreeofParallelism=8•#SegmentPerNode=ogreplicationPrimarySegmentMirrorSegmentPrimarySegmentMirrorSegmentDatanoderimarySegmentMirrorSegmentPrimarySegmentMirrorSegmenttrorSegmentaryegmentrrorSegmentreplicationogDatareplicationRack1Rack2•DegreeofParallelism=8•#SegmentPerNode=4Issues:•Recoverycomplexity•Expansioncomplexity•Managementcomplexity(manysegmentspernode)•FixedDegreeofParallelism DatanodeDatanodeBNamenodeMasterhosttMetaOps@Copyright2016.AllrightsreservedSegmenthostSegmenthostSegmenthostSegmenthostMirrorMirrorSegmentMirrorMirrorSegmentrimaryrimarySegmentPrimaryPrimarySegmentDaDatanodeMasterhost SegmentSegmenttDatanodeStatelessmentSegmentegmentData DatanodereplicationSegmenthostSegmenthostSegmenthostRack2Rack1•DegreeofParallelism=8•#SegmentPerMasterhost SegmentSegmenttDatanodeStatelessmentSegmentegmentData DatanodereplicationSegmenthostSegmenthostSegmenthostRack2Rack1•DegreeofParallelism=8•#SegmentPerNode=2Issues:•RecoverycomplexityDatanodeHAWQ1.0GAArchitecture(2013)tCatalogSegmenthostMetaOps•Expansioncomplexity•Managementcomplexity(manysegmentspernode)•FixedDegreeofParallelism@Copyright2016.AllrightsreservedntDDatanodeNamenodeNamenode和PaaS/Docker云平台原生结合的并行SQL引擎MasterhostInterconnectvsegvsegvsegvsegvsegvsegtvsegvsegvsegvsegSegmentDatanodevsegvsegStatelessDatareplicationSegmenthostSegmenthostSegmenthost和PaaS/Docker云平台原生结合的并行SQL引擎MasterhostInterconnectvsegvsegvsegvsegvsegvsegtvsegvsegvsegvsegSegmentDatanodevsegvsegStatelessDatareplicationSegmenthostSegmenthostSegmenthostRack2Rack1•DegreeofParallelism=Any(#vseg)•#SegmentPerNode=1•Recoverycomplexity•Expansioncomplexity•FixedDegreeofParallelismDatanodeSegmentDatanodeHAWQ2.0:ArchitectureChange(2016Q2)界上第一个SegmenthostMetaOps@Copyright2016.AllrightsreservedCatalogResourceManagervsegvsegSegmentvsegvsegDatanodevsegvsegSegmentvsegvsegDatanodeNamenodeNamenodeMasterhosttvsegvsegHornetSegmentvsegHornetSegmentvsegDatanodeStatelessDatareplicationDatanodeDatanodeSegmentvsegvsegHornetDatanodeCatalogResourceManagerNamenodevsegvsegHornettHAWQ++3.0:HornetExecutionEngine(2017Q3)MasterhosttvsegvsegHornetSegmentvsegHornetSegmentvsegDatanodeStatelessDatareplicationDatanodeDatanodeSegmentvsegvsegHornetDatanodeCatalogResourceManagerNamenodevsegvsegHornettSegmenthostSegmentSegmenthostSegmenthost SegmenthostSegmenthostRack1MetaOpsHornetExecutionEngine:SIMD/Newhardwaretimesfaster@Copyright2016.Allrightsreserved@Copyright2016.Allrightsreserved单位(毫秒ms)rkselectcount(*)fromlineitem;selectcount(*)fromlineitem;AVERAGE@Copyright2016.Allrightsreserved单位(毫秒ms)rkRatioselectcount(l_orderkey)fromlineitem;selectcount(l_partkey)fromlineitem;selectcount(l_suppkey)fromlineitem;selectcount(l_linenumber)fromlineitem;selectcount(l_quantity)fromlineitem;selectcount(l_extendedprice)fromlineitem;selectcount(l_discount)fromlineitem;selectcount(l_tax)fromlineitem;selectcount(l_returnflag)fromlineitem;selectcount(l_linestatus)fromlineitem;selectcount(l_shipdate)fromlineitem;selectcount(l_commitdate)fromlineitem;selectcount(l_receiptdate)fromlineitem;selectcount(l_shipinstruct)fromlineitem;selectcount(l_shipmode)fromlineitem;selectcount(l_comment)fromlineitem;AVERAGE@Copyright2016.Allrightsreserved单位(毫秒ms)rkRatioselectsum(l_orderkey)fromlineitem;selectsum(l_partkey)fromlineitem;selectsum(l_suppkey)fromlineitem;selectsum(l_linenumber)fromlineitem;selectsum(l_quantity)fromlineitem;selectsum(l_extendedprice)fromlineitem;selectsum(l_discount)fromlineitem;selectsum(l_tax)fromlineitem;selectavg(l_orderkey)fromlineitem;selectavg(l_partkey)fromlineitem;selectavg(l_suppkey)fromlineitem;selectavg(l_linenumber)fromlineitem;selectavg(l_quantity)fromlineitem;selectavg(l_extendedprice)fromlineitem;selectavg(l_discount)fromlineitem;selectavg(l_tax)fromlineitem;AVERAGE@Copyright2016.Allrightsreserved单位(毫秒ms)ycountfromlineitemgroupby14countfromlineitemgroupbylpartkey;4127.987.10omlineitemgroupby6191363.5126.33roupby370.1530.71lineitemgroupby4929.786.03y392.4126.43groupbyl_tax;352.9929.38temgroupby545.8620.79329.3034.06638.5125.18tdatecountfromlineitemgroupby642.3125.16untfromlineitemgroupby647.1224.18823.0902630.6303omlineitemgroupby032.16AVERAGE(除去sparkOOM语句)300721.66@Copyright2016.Allrightsreserved单位(毫秒ms)rkRatioselectl_partkey,sum(l_partkey),avg(l_partkey)fromlineitemgroupbylpartkey;selectl_suppkey,sum(l_suppkey),avg(l_suppkey)fromlineitemgroupbyl_suppkey;selectl_linenumber,sum(l_linenumber),avg(l_linenumber)fromlineitemgroupbyllinenumber;selectl_quantity,sum(l_quantity),avg(l_quantity)fromlineitemgroupbyl_quantity;mlextendedpriceavg(l_extendedprice)fromlineitemgroupbyselectl_discount,sum(l_discount),avg(l_discount)fromlineitemgroupbyldiscount;selectl_tax,sum(l_tax),avg(l_tax)fromlineitemgroupbyltax;AVERAGE@Copyright2016.Allrightsreserved单位(毫秒ms)rkRatioselectl_partkey,l_suppkey,count(*)fromlineitemgroupbyl_partkey,l_suppkey;electlpartkeyllinenumbercountfromlineitemgroupbyl_partkey,l_linenumber;selectl_suppkey,l_extendedprice,count(*)fromlineitemgroupbylsuppkey,lextendedprice;selectl_partkey,l_shipmode,count(*)fromlineitemgroupbylpartkey,l_shipmode;selectl_partkey,l_shipdate,count(*)fromlineitemgroupbyl_partkey,l_shipdate;selectl_suppkey,l_tax,count(*)fromlineitemgroupbyl_suppkey,l_tax;selectl_shipdate,l_commitdate,count(*)fromlineitemgroupbylshipdatel_commitdate;selectcount(l_orderkey)fromlineitemgroupbyl_linenumber,l_quantity,l_tax;AVERAGE@Copyright2016.Allrightsreserved单位(毫秒ms)Oushuselectl_partkey+l_suppkey,count(*)fromlineitemgroupbyl_partkey+l_suppkey;4050.55316017.80selectl_partkey+1000fromlineitemgroupbyl_partkey+1000;2869.51270839.44selectl_tax*100fromlineitemgroupby426.141000523.48AVERAGEgroupby表达式2448.7322896.3313.57@@Copyright2016.Allrightsreserved单位(毫秒ms)selectl_partkey,count(*),groupbylpartkey;22selectl_suppkey,count(*),count(l_orderkey),sum(l_orderkey),avg(l_orderkey)fromlineitemgroupbylsuppkey;99.989.89selectl_linenumber,count(*),count(l_orderkey),sum(l_orderkey),avg(l_orderkey)fromlineitemgroupbyl_linenumber;698.1867selectl_quantity,count(*),count(l_orderkey),sum(l_orderkey),avg(l_orderkey)fromlineitemgroupbylquantity;702.6021selectl_discount,count(*),count(l_orderkey),sum(l_orderkey),avg(l_orderkey)fromlineitemgroupbyl_discount;741.1709selectl_tax,count(*),count(l_orderkey),sum(l_orderkey),avg(l_orderkey)fromlineitemgroupbyl_tax;670.6396selectl_returnflag,count(*),count(l_orderkey),sum(l_orderkey),avg(l_orderkey)fromlineitemgroupbylreturnflag;913.2303selectl_linestatus,count(*),count(l_orderkey),sum(l_orderkey),avg(l_orderkey)fromlineitemgroupbyl_linestatus;675.9441selectl_shipdate,count(*),count(l_ord
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 不同分公司劳动合同
- 变更劳动合同补充协议
- 北京技术合同登记实务
- 辽宁省兴城市2024-2025学年七年级上学期期中历史试题(含答案)
- 河南省周口市商水县2024-2025学年八年级上学期期中测试物理卷(含答案)
- 《压缩面膜》规范
- 移动护理信息系统的设计
- 存包柜相关行业投资方案范本
- 腹部视诊课件
- 防治害虫的物理防治法课件
- 家用暖通合同范本
- 河道水体生态修复治理施工方案
- 劳务派遣人员工作培训及管理方案
- 2024年长春二道区公益性岗位招聘133名工作人员历年高频难、易错点500题模拟试题附带答案详解
- 统编版六年级语文上册《字音辨析》专项测试题带答案
- 期中试卷(1~4单元)(试题)-2024-2025学年五年级上册数学人教版
- 2025届湖北省黄冈市黄冈市高三上学期9月调研考试一模英语试题(含答案解析)
- 医院健康体检科高危异常检查结果登记追访制度
- 小学数学人教版-六年级上-第一单元-分数乘法-教材分析
- 高中英语试卷分析
- 骨科中医护理方案理论考试试题题库及答案
评论
0/150
提交评论