倾向评分配对简介培训课件_第1页
倾向评分配对简介培训课件_第2页
倾向评分配对简介培训课件_第3页
倾向评分配对简介培训课件_第4页
倾向评分配对简介培训课件_第5页
已阅读5页,还剩34页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

倾向评分配对简介OutlineDay1Overview:WhyPSM?HistoryanddevelopmentofPSMCounterfactualframeworkThefundamentalassumptionGeneralprocedureSoftwarepackagesReview&illustrationofthebasicmethodsdevelopedbyRosenbaumandRubin2倾向评分配对简介Outline(continued)ReviewandillustrationofHeckman’sdifference-in-differencesmethodProblemswiththeRosenbaum&Rubin’smethodDifference-in-differencesmethodNonparametricregressionBootstrappingDay2Practicalissues,concerns,andstrategiesQuestionsanddiscussions3倾向评分配对简介PSMReferencesCheckwebsite:/VRC/Lectures/index.htm(Linktofile“Day1b.doc”)4倾向评分配对简介WhyPSM?(1)Need1:AnalyzecausaleffectsoftreatmentfromobservationaldataObservationaldata-thosethatarenotgeneratedbymechanismsofrandomizedexperiments,suchassurveys,administrativerecords,andcensusdata.Toanalyzesuchdata,anordinaryleastsquare(OLS)regressionmodelusingadichotomousindicatoroftreatmentdoesnotwork,becauseinsuchmodeltheerrortermiscorrelatedwithexplanatoryvariable.

5倾向评分配对简介WhyPSM?(2)Theindependentvariablewisusuallycorrelatedwiththeerrorterm.Theconsequenceisinconsistentandbiasedestimateaboutthetreatmenteffect.6倾向评分配对简介WhyPSM?(3)Need2:RemovingSelectionBiasinProgramEvaluationFisher’srandomizationidea.Whethersocialbehavioralresearchcanreallyaccomplishrandomizedassignmentoftreatment?ConsiderE(Y1|W=1)–E(Y0|W=0).AddandsubtractE(Y0|W=1),wehave{E(Y1|W=1)–E(Y0|W=1)}+{E(Y0|W=1)-E(Y0|W=0)}

Crucial:E(Y0|W=1)E(Y0|W=0)

Thedebateamongeducationresearchers:theimpactofCatholicschoolsvis-à-vispublicschoolsonlearning.TheCatholicschooleffectisthestrongestamongthoseCatholicstudentswhoarelesslikelytoattendCatholicschools(Morgan,2001).7倾向评分配对简介WhyPSM?(4)Heckman&Smith(1995)FourImportantQuestions:

Whataretheeffectsoffactorssuchassubsidies,advertising,locallabormarkets,familyincome,race,andsexonprogramapplicationdecision?Whataretheeffectsofbureaucraticperformancestandards,locallabormarketsandindividualcharacteristicsonadministrativedecisionstoacceptapplicantsandplacetheminspecificprograms?Whataretheeffectsoffamilybackground,subsidiesandlocalmarketconditionsondecisionstodropoutfromaprogramandonthelengthoftimetakentocompleteaprogram?Whatarethecostsofvariousalternativetreatments?8倾向评分配对简介HistoryandDevelopmentofPSMThelandmarkpaper:Rosenbaum&Rubin(1983).Heckman’searlyworkinthelate1970sonselectionbiasandhiscloselyrelatedworkondummyendogenousvariables(Heckman,1978)addressthesameissueofestimatingtreatmenteffectswhenassignmentisnonrandom.Heckman’sworkonthedummyendogenousvariableproblemandtheselectionmodelcanbeunderstoodasageneralizationofthepropensity-scoreapproach(Winship&Morgan,1999).Inthe1990s,Heckmanandhiscolleaguesdevelopeddifference-in-differencesapproach,whichisasignificantcontributiontoPSM.Ineconomics,theDIDapproachanditsrelatedtechniquesaremoregenerallycallednonexperimentalevaluation,oreconometricsofmatching.9倾向评分配对简介TheCounterfactualFrameworkCounterfactual:whatwouldhavehappenedtothetreatedsubjects,hadtheynotreceivedtreatment?Thekeyassumptionofthecounterfactualframeworkisthatindividualsselectedintotreatmentandnontreatmentgroupshavepotentialoutcomesinbothstates:theoneinwhichtheyareobservedandtheoneinwhichtheyarenotobserved(Winship&Morgan,1999).Forthetreatedgroup,wehaveobservedmeanoutcomeundertheconditionoftreatmentE(Y1|W=1)andunobservedmeanoutcomeundertheconditionofnontreatmentE(Y0|W=1).Similarly,forthenontreatedgroupwehavebothobservedmeanE(Y0|W=0)andunobservedmeanE(Y1|W=0).10倾向评分配对简介TheCounterfactualFramework(Continued)Underthisframework,anevaluationofE(Y1|W=1)-E(Y0|W=0)canbethoughtasaneffortthatusesE(Y0|W=0)toestimatethecounterfactualE(Y0|W=1).ThecentralinterestoftheevaluationisnotinE(Y0|W=0),butinE(Y0|W=1).Therealdebateabouttheclassicalexperimentalapproachcentersonthequestion:whetherE(Y0|W=0)reallyrepresentsE(Y0|W=1)?

11倾向评分配对简介FundamentalAssumption

Rosenbaum&Rubin(1983)Differentversions:“unconfoundedness”&“ignorabletreatmentassignment”(Rosenbaum&Robin,1983),“selectiononobservables”(Barnow,Cain,&Goldberger,1980),“conditionalindependence”(Lechner1999,2002),and“exogeneity”(Imbens,2004)12倾向评分配对简介1-to-1or1-to-nMatch

NearestneighbormatchingCalipermatching

Mahalanobis

MahalanobiswithpropensityscoreaddedRunLogisticRegression:

Dependentvariable:Y=1,ifparticipate;Y=0,otherwise.Chooseappropriateconditioning(instrumental)variables.Obtainpropensityscore:predictedprobability(p)orlog[(1-p)/p].GeneralProcedureMultivariateanalysisbasedonnewsample

1-to-1or1-to-nmatchandthenstratification(subclassification)KernelorlocallinearweightmatchandthenestimateDifference-in-differences(Heckman)EitherOr13倾向评分配对简介NearestNeighborandCaliperMatchingNearestneighbor:ThenonparticipantwiththevalueofPjthatisclosesttoPiisselectedasthematch.Caliper:Avariationofnearestneighbor:Amatchforpersoniisselectedonlyifwhereisapre-specifiedtolerance.Recommendedcalipersize:.25p1-to-1Nearestneighborwithincaliper(Theisacommonpractice)1-to-nNearestneighborwithincaliper14倾向评分配对简介MahalanobisMetricMatching:(withorwithoutreplacement)

Mahalanobiswithoutp-score:Randomlyorderingsubjects,calculatethedistancebetweenthefirstparticipantandallnonparticipants.Thedistance,d(i,j)canbedefinedbytheMahalanobisdistance:whereuandvarevaluesofthematchingvariablesforparticipantiandnonparticipantj,andCisthesamplecovariancematrixofthematchingvariablesfromthefullsetofnonparticipants.Mahalanobismetricmatchingwithp-scoreadded(touandv).NearestavailableMahalandobismetricmatchingwithincalipersdefinedbythepropensityscore(needyourownprogramming).15倾向评分配对简介Stratification(Subclassification)Matchingandbivariateanalysisarecombinedintooneprocedure(nostep-3multivariateanalysis):Groupsampleintofivecategoriesbasedonpropensityscore(quintiles).Withineachquintile,calculatemeanoutcomefortreatedandnontreatedgroups.Estimatethemeandifference(averagetreatmenteffects)forthewholesample(i.e.,allfivegroups)andvarianceusingthefollowingequations:16倾向评分配对简介MultivariateAnalysisatStep-3Wecouldperformanykindofmultivariateanalysisweoriginallywishedtoperformontheunmatcheddata.Theseanalysesmayinclude:multipleregressiongeneralizedlinearmodelsurvivalanalysisstructuralequationmodelingwithmultiple-groupcomparison,andhierarchicallinearmodeling(HLM)Asusual,weuseadichotomousvariableindicatingtreatmentversuscontrolinthesemodels.17倾向评分配对简介VeryUsefulTutorialforRosenbaum&Rubin’sMatchingMethodsD’Agostino,R.B.(1998).Propensityscoremethodsforbiasreductioninthecomparisonofatreatmenttoanon-randomizedcontrolgroup.StatisticsinMedicine17,2265-2281.18倾向评分配对简介SoftwarePackagesThereiscurrentlynocommercialsoftwarepackagethatoffersformalprocedureforPSM.InSAS,LoriParsonsdevelopedseveralMacros(e.g.,theGREEDYmacrodoesnearestneighborwithincalipermatching).InSPSS,Dr.JohnPainterofJordanInstitutedevelopedaSPSSmacrotodosimilarworksasGREEDY(/VRC/Lectures/index.htm).WehaveinvestigatedseveralcomputingpackagesandfoundthatPSMATCH2(developedbyEdwinLeuvenandBarbaraSianesi[2003],asauser-suppliedroutineinSTATA)isthemostcomprehensivepackagethatallowsuserstofulfillmosttasksforpropensityscorematching,andtheroutineisbeingcontinuouslyimprovedandupdated.19倾向评分配对简介DemonstrationofRunningSTATA/PSMATCH2:

Part1.Rosenbaum&Rubin’sMethods

(Linktofile“Day1c.doc”)20倾向评分配对简介ProblemswiththeConventional(PriortoHeckman’sDID)ApproachesEqualweightisgiventoeachnonparticipant,thoughwithincaliper,inconstructingthecounterfactualmean.Lossofsamplecasesdueto1-to-1match.Whatdoestheresamplerepresent?Externalvalidity.It’sadilemmabetweeninexactmatchandincompletematch:whiletryingtomaximizeexactmatches,casesmaybeexcludedduetoincompletematching;whiletryingtomaximizecases,inexactmatchingmayresult.21倾向评分配对简介WeightsW(i.,j)(distancebetweeniandj)canbedeterminedbyusingoneoftwomethods:Kernelmatching:whereG(.)isakernelfunctionandnisabandwidthparameter.

Heckman’sDifference-in-DifferencesMatchingEstimator(2)23倾向评分配对简介Locallinearweightingfunction(lowess):

Heckman’sDifference-in-DifferencesMatchingEstimator(3)24倾向评分配对简介AReviewofNonparametricRegression

(CurveSmoothingEstimators)IamgratefultoJohnFox,theauthorofthetwoSagegreenbooksonnonparametricregression(2000),forhisprovisionoftheRcodetoproducetheillustratingexample.25倾向评分配对简介WhyNonparametric?WhyParametricRegressionDoesn’tWork?26倾向评分配对简介Focalx(120)

The120thorderedxSaintLucia:x=3183y=74.8Thewindow,calledspan,contains.5N=95observationsTheTask:DeterminingtheY-valueforaFocalPointX(120)27倾向评分配对简介TricubekernelweightsWeightswithintheSpanCanBeDeterminedbytheTricubeKernelFunction28倾向评分配对简介TheY-valueatFocalX(120)IsaWeightedMeanWeightedmean=71.1130129倾向评分配对简介TheNonparametricRegressionLineConnectsAll190AveragedYValues30倾向评分配对简介ReviewofKernelFunctionsTricubeisthedefaultkernelinpopularpackages.Gaussiannormalkernel:Epanechnikovkernel–parabolicshapewithsupport[-1,1].Butthekernelisnotdifferentiableatz=+1.Rectangularkernel(acrudemethod).31倾向评分配对简介LocalLinearRegression

(Alsoknownaslowessorloess)AmoresophisticatedwaytocalculatetheYvalues.Insteadofconstructingweightedaverage,itaimstoconstructasmoothlocallinearregressionwithestimated0and1thatminimizes:whereK(.)isakernelfunction,typicallytricube.32倾向评分配对简介TheLocalAverageNowIsPredictedbyaRegressionLine,InsteadofaLineParalleltotheX-axis.33倾向评分配对简介AsymptoticPropertiesoflowessFan(1992,1993)demonstratedadvantagesoflowessovermorestandardkernelestimators.Heprovedthatlowesshasnicesamplingpropertiesandhighminimaxefficiency.InHeckman’sworkspriorto1997,heandhisco-authorsusedthekernelweights.Butsince1997theyhaveusedlowess.Inpracticeit’sfairlycomplicatedtoprogramtheasymptoticproperties.NosoftwarepackagesprovideestimationoftheS.E.forlowess.Inpractice,oneusesS.E.estimatedbybootstrapping.34倾向评分配对简介BootstrapStatisticsInference(1)Itallowstheusertomakeinferenceswithoutmakingstrongdistributionalassumptionsandwithouttheneedforanalyticformulasforthesamplingdistribution’sparameters.Basicidea:treatthesampleasifitisthepopulation,andapplyMonteCarlosamplingtogenerateanempiricalestimateofthestatistic’ssamplingdistribution.Thisisdonebydrawingalargenumberof“resamples”ofsizenfromthisoriginalsamplerandomlywithreplacement.AcloselyrelatedideaistheJackknife:“droponeout”.Thatis,itsystematicallydropsoutsubsetsofthedataoneatatimeandassessesthevariationinthesamplingdistributionofthestatisticsofinterest.35倾向评分配对简介BootstrapStatisticsInference(2)Afterobtainingestimatedstandarderror(i.e.,thestandarddeviationofthesampling

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论