版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、We are crossing the bridge now .第1页/共96页Statistics OverviewDot PlotBox PlotHistogramBar ChartTrend ChartChartsFrequency DistributionTablesCharts & TablesMeanMedianModeLocationRangeStandard DeviationVarianceDispersionSkewnessKurtosisShapeNumerical MeasuresDescriptive StatisticsPoint EstimateInter
2、val EstimateParameter EstimationParametric MethodsNonparametric MethodsHypothesis TestingInferential StatisticsStatistics第2页/共96页What will be taught.Tools to be taught Parameter Estimation Hypothesis Testing ANOVAInferential Statisticscomprises those methods concerned with the analysis of a subset o
3、f data leading to predictions or inferences about the entire set of data第3页/共96页Learning ObjectiveslWhat is a Probability Distribution?Experiment, Sample Space, EventRandom Variable, Probability Functions (pmf, pdf, cdf)lDiscrete DistributionsBinomial DistributionPoisson DistributionlContinuous Dist
4、ributionsNormal DistributionExponential DistributionlSampling DistributionsZ Distributiont Distributionc2 DistributionF Distribution第4页/共96页lAs we progress from description of data towards inference of data, an important concept is the idea of a probability distribution.lTo appreciate the notion of
5、a probability distribution, we need to review various fundamental concepts related to it:Experiment, Sample Space, EventRandom VariableWhat do we mean by inference of data?第5页/共96页ExperimentlAn experiment is any activity that generates a set of data, which may be numerical or not numerical.1, 2, .,
6、6(a)Throwing a diceExperiment generates numerical / discrete dataPinsStainsRejectAccept(b)Inspecting for stain marksExperiment generates attribute dataPins(c)Measuring shaft 10.53 mm10.49 mm10.22 mm10.29 mm11.20 mmExperiment generates continuous data第6页/共96页Random ExperimentlIf we throw the dice aga
7、in and again, or produce many shafts from the same process, the outcomes will generally be different, and cannot be predicted in advance with total certainty.lAn experiment which can result in different outcomes, even though it is repeated in the same manner every time, is called a random experiment
8、.第7页/共96页Sample SpacelThe collection of all possible outcomes of an experiment is called its sample space.1, 2, ., 6- Tossing of a dicePass, Fail- Inspecting for stain marksAll possible values- Measuring shaft between 0 and 10mm)EventlAn outcome, or a set of outcomes, from a random experiment is cal
9、led an event, i.e. it is a subset of the sample space.第8页/共96页EventlExample 1: Some events from tossing of a dice.Event 1: the outcome is an odd numberEvent 2: the outcome is a number 4 lExample 2: Some events from measuring shaft :Event 1: the outcome is a diameter meanEvent 2: the outcome is a par
10、t failing specs. E2 = x USL E2 = 5, 6 E1 = 1, 3, 5 E1= x m第9页/共96页Random VariablelFrom a same experiment, different events can be derived depending on which aspects of the experiment we consider important.lIn many cases, it is useful and convenient to define the aspect of the experiment we are inter
11、ested in by denoting the event of interest with a symbol (usually an uppercase letter), e.g.: Let X be the event “the number of a dice is odd”.Let W be the event “the shaft is within specs.”.第10页/共96页Random VariablelWe have defined a function that assigns a real number to an experimental outcome wit
12、hin the sample space of the random experiment.lThis function (X or W in our examples) is called a random variable because: The outcomes of the same event are clearly uncertain and are variable from one outcome to another.Each outcome has an equal chance of being selected.PinsMeasuring shaft X = Part
13、s out of specs.(LSL = 8 mm,USL = 10 mm)0.,7.99998, 7.99999, 8, 8,00001,9.99999, 10, 10.00001, 10.00002, LSLUSL第11页/共96页ProbabilitylTo quantify how likely a particular outcome of a random variable can occur, we typically assign a numerical value between 0 and 1 (or 0 to 100%).This numerical value is
14、called the probability of the outcome.lThere are a few ways of interpreting probability. A common way is to interpret probability as a fraction (or proportion) of times the outcome occurs in many repetitions of the same random experiment.This method is the relative frequency approach or frequentist
15、approach to interpreting probability.第12页/共96页Probability DistributionlWhen we are able to assign a probability to each possible outcome of a random variable X, the full description of all the probabilities associated with the possible outcomes is called a probability distribution of X.lA probabilit
16、y distribution is typically presented as a curve or plot that has:All the possible outcomes of X on the horizontal axisThe probability of each outcome on the vertical axis第13页/共96页Normal DistributionExponential DistributionUniform DistributionBinomial DistributionDiscrete Probability Distributions (
17、Theoretical)Continuous Probability Distributions (Theoretical)第14页/共96页Empirical Distributions Created from actual observations. Usually represented as histograms. Empirical distributions, like theoretical distributions, apply to both discrete and continuous distributions.第15页/共96页 Three common impo
18、rtant characteristics:Shape- defines nature of distributionCenter- defines central tendency of dataSpread- defines dispersion of data(or Dispersion, or Scale)Properties of DistributionsExponential DistributionUniform Distribution第16页/共96页ShapelDescribes how the probabilities of all the possible outc
19、omes are distributed.lCan be described mathematically with an equation called a probability function, e.g:Probability functionLowercase letter represents a specific value of random variable X xexfx22121m f(x) means P(X = x)第17页/共96页Probability Functions For a discrete distribution,f(x) called is the
20、 probabilitymass function (pmf), e.g.: For a continuous distribution,f(x) is called the probabilitydensity function (pdf), e.g.: n,0,1,2,xp1pxnxPxnx 0,1tettft00f(t)123 = 4210.5第18页/共96页Properties of Distributions The total probability for any distribution sums to 1.In a discrete distribution,probabi
21、lity is representedas height of the bar.In a continuous distribution,probability is representedas area under the curve(pdf), between two points.Binomial DistributionNormal Distribution第19页/共96页Properties of DistributionsProbability of An Exact Value Under PDF is Zero! For a continuous random variabl
22、e, the probability of an exact value occurring is theoretically 0 because a line on a pdf has 0 width, implying: In practice, if we obtain a particular value, e.g. 12.57, of a random variable X, how do we interpret the probability of 12.57 happening?It is interpreted as the probability of X assuming
23、 a value within a small interval around 12.57, i.e. 12.565, 12.575.This is obtained by integrating the area under the pdf between 12.565 and 12.575.P(X = x) = 0for a continuousrandom variable第20页/共96页Properties of DistributionsExponential DistributionArea of a line is zero!f(9.5) = P(X = 9.5) = 0To
24、get probability of 20.0, integrate area between 19.995 and 20.005, i.e.P(19.995 X 10n) for inspection.lEach part is classified asaccept or reject.Binomial DistributionReject rate = pSample size (n)第28页/共96页Binomial ExperimentlAssuming we have a process that is historically known to produce p reject
25、rate.p can be used as the probability of finding a failed unit each time we draw a part from the process for inspection.lLets pull a sample of n partsrandomly from a large population ( 10n) for inspection.lEach part is classified asaccept or reject.Binomial Distribution1. For each trial (drawing a u
26、nit), the probability of success is constant.2. Trials are independent; result of a unit does not influence outcome of next unit3. Each trial results in only two possible outcomes.A binomial experiment!第29页/共96页Probability Mass FunctionlIf each binomial experiment (pulling n parts randomly for pass/
27、fail inspection) is repeated several times, do we see the same x defective units all the time?lThe pmf that describes how the x defective units (called successes) are distributed is given as:Binomial Distribution n ,0,1,2,xp1pxnxPxnxProbability of getting x defective units(x successes)Using a sample
28、 size of n units(n trials)Given that the overall defective rate is p(probability of success is p) 第30页/共96页ApplicationslThe binomial distribution is extensively used to model results of experiments that generate binary outcomes, e.g. pass/fail, go/nogo, accept/reject, etc.In industrial practice, it
29、is used for data generated from counting of defectives, e.g.:1. Acceptance Sampling2. p-chartBinomial DistributionBinomial Distribution0.000.050.100.150.200.250.30012345678Number of Rejects (X)Probability of Finding X Rejects xnxp1pxnxP第31页/共96页Example 1If a process historically gives 10% reject rat
30、e (p = 0.10), what is the chance of finding 0, 1, 2 or 3 defectives within a sample of 20 units (n = 20)?1.Binomial Distribution n ,0,1,2,xp1pxnxPxnx .,0020101100200P1xfor .,0etc101101201P2xfor121第32页/共96页Example 1 (contd)2. These probabilities can be obtained from Minitab:Calc Probability Distribut
31、ions BinomialBinomial DistributionP(x)n = 20p = 0.1Specify column containing x defectivesSpecify column to store results第33页/共96页Example 1 (contd)3. Create its pmf from Minitab and read off the answers:Binomial DistributionlSpecify column of possible outcomes, x: 0 to 20 defectives.lCompute and stor
32、e results, P(x), as shown previously:Calc Probability Distributions Binomial.lCreate a chart of the pmf:lGraph ChartDisplay data labelsSelect range of x to plotSpecify axis titles第34页/共96页Example 1 (contd)Binomial DistributionBinomial Distribution0.1220.2700.2850.1900.0900.0320.0090.0020.0000.000.05
33、0.100.150.200.250.30012345678No. of Defectives (x)Probability of Finding x Defectives n ,0,1,2,xp1pxnxPxnxFrom Excel:From Minitab:What is the probability of getting 2 defectives or less?第35页/共96页Example 1 (contd)lFor the 2 previous charts, the x-axis denotes the number of defective units, x.lIf we d
34、ivide each x valueby constant sample size, n,and re-express the x-axisas a proportion defectivep-axis, the probabilitiesdo not change.Binomial Distribution第36页/共96页The location, dispersion and shape of a binomial distribution are affected by the sample size, n, and defective rate, p.Parameters of Bi
35、nomial DistributionParameters of the distribution第37页/共96页Binomial DistributionNormal Approximation to the BinomialDepending on the values of n and p, the binomial distributions are a family of distributions that can be skewed to the left or right.Under certain conditions (combinations of n and p),
36、the binomial distribution approximately approaches the shape of a normal distribution:For p 0.5,np 5For p far from 0.5 (smaller or larger),np 10第38页/共96页Binomial DistributionMean and Variance Although n and p pin down a specific binomial distribution, often the mean and variance of the distribution
37、are used in practical applications such as the p-chart.The mean and variance of a binomial distribution:ornpnpp12ppmpnnpppnnp12第39页/共96页40第40页/共96页ApplicationslThe Poisson distribution is a useful model for any random phenomenon that occurs on a per unit basis:Per unit areaPer unit volumePer unit ti
38、me, etc.lA typical application is as amodel of number of defectsin a unit of product, e.g.:Number of cracks per 10m rollinspection in production of continuous rolls of sheetmetal.Number of particles per cm2 of partPoisson DistributionDefect rate = Inspection units (n)# defectsper unit第41页/共96页Poisso
39、n ProcesslThe Poisson distribution is derived based on a random experiment called a Poisson process.lLets look at the inspectionof 10m roll of sheetmetal again:Each 10m roll represents asubinterval of a continuousroll (interval) of real numbers.In the entire production of thecontinuous roll, defects
40、 occurrandomly and results of aninspected unit does not influenceoutcome of the next unit.Conceptually, the sheetmetal can be partitioned into sub-intervals until each is small enough that:1. There is either 1 defect or none within the subinterval, i.e. probability of more than 1 defect is zero.2. P
41、robability of 1 count in any subinterval is the same.3. Probability of 1 count increases proportionately as subinterval size increases. 4. Outcome in each.subinterval is independent of other subintervals第42页/共96页Probability Mass FunctionlIf each Poisson process (pulling an inspection unit randomly t
42、o count defects) is repeated over time, do we see the same x defects per unit all the time?lThe pmf that describes how the x defects (called counts) per unit are distributed is given as: 0,1,2,xx!exPxProbability of getting x defects per inspection unit(x counts)Given that the overall defects per uni
43、t is (defect rate is )第43页/共96页Example 2If a process is historically known to give 4.0 defects per unit ( = 4), what is the chance of finding 0, 1, 2 or 3 defects per unit?1. 0,1,2,xx!exPx .0!e0P0,xfor04 .etc1!e1P1,xfor14第44页/共96页Example 2 (contd)2. These probabilities can be obtained from Minitab:C
44、alc Probability Distributions Poisson = 4.0P(x)Specify column containing x defectsSpecify column to store results第45页/共96页Example 2 (contd)3. Create its pmf from Minitab and read off the answers:Poisson DistributionlSpecify column of possible outcomes, x: 0 to 20 defects.lCompute and store results,
45、P(x), as shown previously:Calc Probability Distributions Binomial.lCreate a chart of the pmf:lGraph ChartDisplay data labelsSelect range of x to plotSpecify axis titles第46页/共96页Example 2 (contd)Poisson DistributionFrom Excel:From Minitab:What is the probability of getting 2 defects or less? 0,1,2,xx
46、!exPx第47页/共96页The location, dispersion and shape of a Poisson distribution are affected by the mean, .Parameter of the Poisson DistributionParameter of the distribution第48页/共96页Poisson DistributionNormal Approximation to the PoissonThe Poisson distributions are generally skewed to the right. For 15,
47、 the Poisson distribution approximately approaches the normal distribution.Poisson Approximation to the BinomialThe binomial distribution can be shown to approach the Poisson distribution in its limiting conditions, i.e.:when p is very small (approaching zero) n is large (approaching infinity)This a
48、llows the Poisson pmf to be used (easier) when a binomial experiment assumes above conditionsnp = = constant第49页/共96页Summary of ApproximationsBinomial p 10 p Poisson Normal 第50页/共96页Poisson DistributionMean and Variance Although pins down a specific Poisson distribution, often the mean and variance
49、of the distribution are used in practical applications such as the c-chart.The mean and variance of a Poisson distribution:2What happens to the variability as the mean of the Poisson distribution increases?第51页/共96页ExercisesA process yields a defective rate of 10%. For a sampling plan of 10 units, d
50、etermine the probability distribution (pmf and cdf).A certain process yields a defect rate of 2.8 dpmo. For a million opportunities inspected, determine the probability distribution (pmf and cdf).第52页/共96页53lNormal DistributionlExponential DistributionlWeibull Distribution第53页/共96页54Normal Distribut
51、ion第54页/共96页The most widely used model for the distribution of continuous random variables.Arises in the study of numerous natural physical phenomena, such as the velocity of molecules, as well as in one of the most important findings, the Central Limit Theorem.第55页/共96页Many natural phenomena and ma
52、n-made processes are observed to have normal distributions, or can be closely represented as normally distributed.For example, the length of a machined part is observed to vary about its mean due to:temperature drift, humidity change, vibrations, cutting angle variations, cutting tool wear, bearing
53、wear, rotational speed variations, fixturing variations, raw material changes and contamination level changesIf these sources of variation are small, independent and equally likely to be positive or negative about the mean value, the length will closely approximate a normal distribution.第56页/共96页 dx
54、xfxXPxFx)(Cumulative Distribution Function xforexfx22121mNormal DistributionProbability Density FunctionNormal Distributionaa0.5dxexx22121m第57页/共96页 First introduced by French mathematician Abraham DeMoivre in 1733. Made famous in 1809 by German mathematician K.F. Gauss when he also developed a norm
55、al distribution independently and used it in his study of astronomy. As a result, it is also known as the Gaussian distribution.Karl Friedrich GausslDuring mid to late nineteenth century, many statisticians believed that it was “normal” for most well-behaved data to follow this curve, hence the “nor
56、mal distribution”.第58页/共96页A normal distribution can be completely described by knowing only the:Mean (m)Variance (2)Some Properties of the Normal DistributionDistribution OneDistribution TwoDistribution ThreeWhat is the difference between the 3 normal distributions?X N(m, 2)1Parameters of the distr
57、ibution第59页/共96页ANormal(mA,A)BNormal(mB,B)ANormal(mA,A)BNormal(mB,B)ANormal(mA,A)BNormal(mB,B)What is the difference between process A & B for each case?第60页/共96页The mean, median and mode all coincide at the same value m. There is perfect symmetry.Some Properties of the Normal Distribution+ - Do
58、es it mean that any data setwhich has mean, median and modeat the same value will automaticallybe a normal distribution?MeanMedianMode2第61页/共96页The area under sections of the curve can be used to estimate the probability of a certain “event” occurring:Some Properties of the Normal DistributionPoint
59、of Inflection1+ - 68.27%95.45%99.73%m +/- 3 is often referred to as the width of a normal distribution3第62页/共96页Lets compute the cumulative probabilities of the following distributions:Some Properties of the Normal Distribution+ - m = 3.5 = 0.61.8+ - 20.0m = 16.6 = 2.8+ - m = -1.5 = 0.9-2.80.5F(1.8)
60、 = P(X 20.0) = 1 F(20.0)?P(-2.8 X 1) = 1 P(T1) = 1 0.3297(b) P(1T2) = P(T2) P(T1) = 0.5507 0.3297Example 3 (contd)第72页/共96页Exponential DistributionExponential and PoissonlThe exponential and Poisson distributions are related:In a Poisson process (recall the 10m roll sheetmetal inspection), we observe the nu
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 北京印刷学院《竹木雕技法》2023-2024学年第一学期期末试卷
- 业务部年度工作计划9篇
- 二零二五年度企业固定资产抵押贷款合同样本3篇
- 2024年樘木项目可行性研究报告
- 装载机租赁合同
- 二手车买卖合同
- 2024年眼镜角套项目可行性研究报告
- 2025年度智慧CRM系统升级版:助力企业销售合同高效管理的智能解决方案2篇
- 2025版空场地租赁合同范本-租赁合同解除后的处理83篇
- 北京信息职业技术学院《电视栏目专题与制作》2023-2024学年第一学期期末试卷
- 医药销售培训课程
- 2022-2023学年北京市海淀区七年级(上)期末语文试卷
- 膝关节炎阶梯治疗
- 设备日常维护及保养培训
- 行业背景、经济运行情况及产业未来发展趋势分析
- 配电室维护协议书
- 2024年度工作总结模板简约干练风格
- 2024年广东省第一次普通高中学业水平合格性考试历史试卷(解析版)
- 部编版一年级上册语文期末试题含答案
- 2025届东莞东华高级中学高一生物第一学期期末考试试题含解析
- 新疆巴音郭楞蒙古自治州库尔勒市2024-2025学年高一生物上学期期末考试试题
评论
0/150
提交评论