




版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Good is good, but better carries it.精益求精,善益求善。SAS数据挖掘实例-广告投放问题eng-SASTutorial:EnterpriseMinerCasedescriptionSupposethatyouworkforamailorderenterprisethatsendsoutacatalogoffurnishingsandhousewareseachmonth.Aspartofanupcomingsalescampaign,youwanttodistributeaspecialcatalogthatisdevotedtofinediningandc
2、ontainskitchenware,dishes,andflatware.Itstooexpensivetosendthiscatalogtoallofyourcustomers,soyouneedtotargetthosemostlikelytobuy.Youdothisbydevelopingatargetingmodelandthenusingittoproduceanewmailinglist.Youhaveanextensiverecordofcustomerpurchases.Thedataincludesvariablesthatindicatewhethercustomers
3、boughtkitchenware,dishes,orflatwareinthepasttwoyears.ThispurchasehistoryhasbeenusedtocreatetheCUSTDET1dataset,whichcontains49variableswiththefollowinglabels:PurchaseDollarsSpentYearlyIncomeHomeValueOrderFrequencyRecencyMarriedNamePrefixAgeSexTelemarketInd.RentsApartmentOccupied0intheDINEBIN(N)=formu
4、lafieldatthebottomofthewindow.ClickOK.TheCreateVariablereappearswithdining0displayedintheFormulafield.ClickOK.ThenewvariableDINEBINappearsintheTransformVariableswindow.ClosetheTransformVariableswindow.Aconfirmationboxappears.SelectYestosaveyourchangesModifyingAttributesYounowneedtoidentifyitasthemod
5、elstarget.ThisisdonewiththeDataSetAttributesnodeDraganddroptheDataSetAttributesnodeontotheworkspacetotherightoftheinputdatasourcenode.ConnecttheTransformVariablenodetotheDataSetAttributesnode.Double-clicktheDataSetAttributesnode.TheDataSetAttributeswindowappears.ClicktheVariablestab.Scrolldownthelis
6、tofvariablesuntilDINEBINappears.Noticethegrayed-outcolumnModelRoleandthewhitecolumnNewModelRole.Grayed-outcolumnsreflecttheoriginaldatasetattributesandtheycannotbeedited.Rolereferstotheuseofeachvariable.Mostvariablesaretreatedasinputvariablesinanattempttopredictthetarget.Ifyouscrolldownthelistofvari
7、ables,youwillseethatEnterpriseMinerconsiderscertainvariablesunsuitableasinputs(e.g.,dates,orvariableswithasinglevalue).Suchvariablesaregiventherolerejected.Right-clickinthecolumnNewModelRoletotherightofthevariableDINEBIN.SelectSetNewModelRolefromthepop-upmenu.Selecttarget.Youaretryingtotargetthebuye
8、rsofdiningwares(forwhomthevariableDinebin=1).Howeverothervariablesinthedatasetcontainthesameinformation:KITCHEN,DISHES,andFLATWAREDINEBINhasvalueof1ifthecustomerhadboughtanydiningware.Itisthereforenecessarytoexcludethemfromtheanalysis(assigna“reject”status).Note:WithintheDataSetAttributesWindow,thec
9、olumnMeasurementreferstomeasurementlevel.Thisistherangeofvaluesthatisfoundineachvariable.Therearefivepossibleassignments:unary-onevalueforexample,avariablewithaparticularvaluethatwasusedtocreateadatasubsetbinary-twovaluesforexample,thevariableMARITALthatcontainsNoorYesnominal-morethantwonon-numericv
10、alues,butnoimpliedorderforexample,STATECODthatcontainsAK,AL,AR,AZ,etc.ordinal-morethantwobutnotmorethantennumericvalues,withimpliedorderforexample,NUMCARSthatcontainsvaluesfrom0to3interval-morethantennumericvaluesforexample,AMOUNTthatcontainsmanydifferentdollarvaluesAneworderforvaluesinTargetVariabl
11、eWhenyoubuildamodel,EnterpriseMinerconsidersthetargeteventtobethefirstsortedvalueofthetargetvariable.Thedefaultsortorderisascending.Butthenewtargetvariable,DINEBIN,containsvaluesof0and1,with1representingthepurchaseofanydiningwares.ThevaluesneedtobeindescendingorderforEnterpriseMinertoaimattheintende
12、dtarget.Tochangetheorderofatargetvariable:ClicktheClassVariablestabintheDataSetAttributeswindow.Scrolldownthelistofvariablesuntilthetargetvariableappears.Right-clickintheNewOrdercolumntotherightofthetaretvariable.SelectSetNewOrderfromthepop-upmenu.SelectDescending.AnewlevelInthecaseofthedatainDataMi
13、ning2003,thenewDINEBINvariablehasbeenassignedthewrongmeasurementlevel.ScrolldowntheDataSetAttributeswindowandnoticethattotherightofDINEBIN,tothecolumnMeasurement.Becausethevariablecontainsonlyvaluesof0and1,thecorrectmeasurementlevelisbinary.Right-clickintheNewMeasurementcolumntotherightofthevariable
14、DINEBIN.SelectSetNewMeasurementfromthepop-upmenu.Selectbinary.ThenewmeasurementlevelofDINEBINisreflectedinthewindow.DataPartitionYouhavecontroloverhowthepartitionsarecreated.TheDataPartitionnodeprovidesseveraloptionsDraganddroptheDataPartitionnodeontotheworkspace.ConnecttheDataSetAttributesnodetothe
15、DataPartitionnode.Double-clicktheDataPartitionnode.ItswindowappearswiththePartitiontabintheforegroundBuildaDecisionTreeModelDraganddroptheTreenodeontotheworkflow.ConnecttheDataPartitionnodetotheTreenode.Double-clicktheTreenodetoopenit.TheTreewindowappearswiththeVariablestabintheforeground.TheBasican
16、dAdvancedtabscontainasetofcriteriafortreebuildingandevaluation.Ifyouhavemadechangestothedefaultsettingsforthisnode,anewwindowpromptsyoutoenteramodelnameEnterDirect-TreeintheModelNamefield.EnterDirectMarketingDecisionTreeModelintheModelDescriptionfield.SelectOK.TheSaveModelAswindowclosesandreturnsyou
17、totheworkflow.ViewsoftreestructureRight-clicktheTreenode.SelectRunfromthepop-upmenu.Analertboxappears.SelectYestoviewyourresults.TheTreeResultswindowappearswiththeAlltabintheforeground.SelectthePlottabtoviewperformanceontrainingandvalidationsetsSelectViewTreetoviewtreestructureEvaluation/AssessmentT
18、heAssessmentnodetakesoutputfromanymodelingnodeandchecksthemodelsaccuracyagainstdatainthetestpartition.DraganddroptheAssessmentnodeontotheworkflowbelowtheTreenode.ConnecttheTreenodetotheAssessmentnode.Right-clicktheAssessmentnode.SelectRunfromthepop-upmenu.AstheDataMining2003dataisprocessedintheworkf
19、low,eachnodeinturndisplaysathickgreenborder.Whenitdoes,youarepromptedtoviewtheresults.SelectYes.TheAssessmentToolwindowappearsNote:TheAssessmentnodedoesnotanalyzeallavailabledata,itreliesonsamplingtoproducequickresults.Asaconsequence,differentliftchartsshowvariationsevenwhentheyusethesamedataLiftcha
20、rtsTheModelstaboftheAssessmentToolwindowlistsmodelsthatyouhavedefined.Scrollingtotherightdisplaysstatisticsforeachmodel.SelecttheDirect-Treemodel.SelectToolsLiftChartfromtheSASmenubar.TheLiftChartwindowappears.Thebluebaselinerepresentstheresponseratethatyouobtainbynotusingamodel,butbysendingthecatal
21、ogtoeveryoneinyourcustomerdatabase.DiagnosisChart(ConfusionMatrix)SelectToolsDiagnosischartReportresultsTheReporternodeistheEnterpriseMinertoolfordrawingadetailedmapoftheDataMiningprocess.DraganddroptheReporternodetotheworkspace.ConnecttheAssessmentnodetotheReporternode.Right-clicktheReporternode.Se
22、lectRunfromthepop-upmenu.SelectOpenwhenthereportgenerationmessageboxappears.AnextensivereportonyourminingworkflowappearsinyourdefaultWebbrowser.Applyingthemodelonadataset(Trainingorother)TheScore!nodeperformsthistask.DraganddroptheScore!nodetotheworkspace.Connectthetreenodetothescorenode.Draganddrop
23、theInputDataSourcenodeontotheworkspaceabovetheScore!node.ConnecttheInputDataSourcenodetotheScore!node.Double-clicktheInputDataSourcenode.TheInputDataSourcewindowappears.Asyoudidatthebeginningoftheworkflow,specifyadatasetastheSourceData.Fornow,selecttheEMDATAdirectory,andselectthefilethatbeginswithts
24、t(plussomerandomcharacters)tobethesourcedata.Thisisthetrainingpartition.ChangetheRoleofthedatasetfromRAWtoSCORE.ClosetheInputDataSourcewindow.Aconfirmationboxappears.SelectYestosaveyourchanges.GaininsightDouble-clicktheScore!nodethatisalreadyintheworkflow.TheScore!windowappears.Selecttheradiobuttont
25、hatisnexttoApplytrainingdatascorecodetoscoredataset.ClosetheScore!window.Aconfirmationboxappears.SelectYestosaveyourchanges.DraganddroptheInsightnodeontotheworkspacebelowtheScore!node.ConnecttheScore!nodetotheInsightnodeDouble-clicktheInsightnode.TheInsightSettingswindowappears.ClicktheSelectbuttont
26、hatisnexttotheDatasetfield.TheImportsMapwindowappears.Youneedtoclickthroughthehierarchicallistofdatasetstoidentifythedatasetthatisassociatedwiththescoredata.ThisdatasettypicallyhasanSDprefix,followedbyastringofrandomalphanumericcharacters.FindthefileClicktheplussign(+)totheleftofScore!Applyinthedata
27、setlist.ItexpandstoshowSAS_DATA_SETS.Clicktheplussign(+)totheleftofSAS_DATA_SETS.Itexpandstoshowfourdatasets.SelectthedatasetwhosefilerefbeginswiththelettersSD.TheRoleandDescriptionfieldsidentifyitasscoredata.ClicktheOKbutton.YoureturntotheDatatab.IntheInsightbasedonfield,clicktheradiobuttonthatisnexttoEntiredataset.ClosetheI
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 公司多人股合同样本
- 共同财产合同标准文本
- 人防车库施工合同样本
- 公关服务协议合同标准文本
- 旅游景区租赁合同范本
- 调酒师考试中常见错误及试题及答案
- 2025建筑工程合同转让
- 学校升降桩施工合同(2篇)
- 媒体策划与执行合同(2篇)
- 2025至2030年中国医疗保健话机行业投资前景及策略咨询报告
- 人力资源社会保障部所属单位招聘真题2024
- 保定雄安新区容城县招聘事业单位真题2024
- 钢筋机械连接技术规程 JGJ 107-2016
- 德育主题班会 《遵义会议》教学课件
- 生产计划与物料管理PMC
- 2022版义务教育(语文)课程标准(含2022年修订和新增部分)
- ADAScog(老年痴呆量表—认知)
- 药品经营中存在的问题及监管对策
- ZN73A(X)永磁式户内中压真空断路器
- 水利工程供水定价成本监审办法
- 石头峡水电站水利枢纽工程坝体设计说明书毕业设计
评论
0/150
提交评论