用机器学习的方法理解社会媒体课件

上传人：n*** IP属地：贵州上传时间：2022-12-21 格式：PPT 页数：192 大小：2.22MB 积分：25 举报 版权申诉

已阅读5页，还剩187页未读，继续免费阅读

版权说明：本文档由用户提供并上传，收益归属内容提供方，若内容存在侵权，请进行举报或认领

文档简介

Understanding

Social

Media

with

Machine

Learning

Xiaojin

Zhu

jerryzhu@

Department

Computer

Sciences

University

Wisconsin–Madison,

USA

CCF/ADL

Beijing

2013Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

20131

95UnderstandingSocialMediaZhuOutline1234Spatio-Temporal

Signal

Recovery

from

Social

MediaMachine

Learning

Basics

Probability

Statistical

Estimation

Decision

Theory

Graphical

Models

Regularization

Stochastic

ProcessesSocioscope:

Probabilistic

Model

for

Social

MediaCase

Study:

RoadkillZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

20132

95Outline1Spatio-TemporalSignal

Spatio-Temporal

Signal

Recovery

from

Social

MediaOutline1234Spatio-Temporal

Signal

Recovery

from

Social

MediaMachine

Learning

Basics

Probability

Statistical

Estimation

Decision

Theory

Graphical

Models

Regularization

Stochastic

ProcessesSocioscope:

Probabilistic

Model

for

Social

MediaCase

Study:

RoadkillZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

20133

95 Spatio-TemporalSignalRecove

Spatio-Temporal

Signal

Recovery

from

Social

MediaSpatio-temporal

Signal:

When,

Where,

How

Much

Direct

instrumental

sensing

cult

and

expensiveZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

20134

95 Spatio-TemporalSignalRecov

Spatio-Temporal

Signal

Recovery

from

Social

MediaHumans

SensorsZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

20135

95 Spatio-TemporalSignalRecove

Spatio-Temporal

Signal

Recovery

from

Social

MediaHumans

Sensors

Not

“hot

trend”

discovery:

know

what

event

want

monitor

Not

natural

language

processing

for

social

media:

are

given

reliable

text

classiﬁer

for

“hit”

Our

task:

precisely

estimating

spatiotemporal

intensity

function

fst

pre-deﬁned

target

phenomenon.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

20136

95 Spatio-TemporalSignalRecov

Spatio-Temporal

Signal

Recovery

from

Social

MediaChallenges

Using

Humans

Sensors

Keyword

doesn’t

always

mean

eventIII

was

just

told

look

dead

crow.Don’t

blame

one

day

treat

you

dead

crow.Human

sensors

aren’t

under

our

controlLocation

stamps

may

erroneous

missingIIII3%

have

GPS

coordinates:

(-98.24,

23.22)47%

have

valid

user

proﬁle

location:

Bristol,

UK,

New

York50%

don’t

have

valid

location

informationHogwarts,

the

tra

c..blah,

Sitting

TacoZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

20137

95 Spatio-TemporalSignalRecov

Spatio-Temporal

Signal

Recovery

from

Social

MediaProblem

Deﬁnition

Input:

list

time

and

location

stamps

the

target

posts.

Output:

fst

Intensity

target

phenomenon

location

(e.g.,

New

York)

and

time

(e.g.,

0-1am)Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

20138

95 Spatio-TemporalSignalRecov

Spatio-Temporal

Signal

Recovery

from

Social

MediaWhy

Simple

Estimation

Bad

fst

xst,

the

count

target

posts

bin

(s,t)

Justiﬁcation:

MLE

the

model

⇠

Poisson(f)

However,IIIPopulation

Bias:

Assume

fst

fs0t0,

users

(s,t),

thenxst

xs0t0Imprecise

location:

Posts

without

location

stamp,

noisy

user

proﬁlelocationZero/Low

counts:

don’t

see

tweets

from

Antarctica,

penguinsthere?Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

20139

95 Spatio-TemporalSignalRecov

Machine

Learning

BasicsOutline1234Spatio-Temporal

Signal

Recovery

from

Social

MediaMachine

Learning

Basics

Probability

Statistical

Estimation

Decision

Theory

Graphical

Models

Regularization

Stochastic

ProcessesSocioscope:

Probabilistic

Model

for

Social

MediaCase

Study:

RoadkillZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201310

95 MachineLearningBasics1SpatiMachine

Learning

BasicsProbabilityOutline1234Spatio-Temporal

Signal

Recovery

from

Social

MediaMachine

Learning

Basics

Probability

Statistical

Estimation

Decision

Theory

Graphical

Models

Regularization

Stochastic

ProcessesSocioscope:

Probabilistic

Model

for

Social

MediaCase

Study:

RoadkillZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201311

95MachineLearningBasicsProbabiMachine

Learning

BasicsProbabilityProbability

The

probability

discrete

random

variable

taking

the

value

P(A

[0,1].

Sometimes

written

P(a)

when

danger

confusion.

Normalization

Joint

probability

P(A

a,B

P(a,b),

the

two

events

both

happen

the

same

time.

Marginalization

P(A

B”.

P(a,b)

The

product

rule

P(a,b)

P(a)P(b|a)

P(b)P(a|b).Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201312

95MachineLearningBasicsProbabiBayes

rule

P(a|b)

=P(b|a)P(a).In

general,

P(a|b,C)

=P(b|C)Rp(D|✓)p(✓)d✓

the

evidence,Machine

Learning

BasicsProbabilityBayes

RuleP(b)

P(b|a,C)P(a|C)where

can

one

morerandom

variables.Bayesian

approach:

when

✓

model

parameter,

observed

data,we

havep(✓|D)

=p(D|✓)p(✓)

p(D),Rp(D|✓)d✓

1),IIIIp(✓)

the

prior,p(D|✓)

the

likelihood

function

(of

✓,

not

normalized:p(D)

=p(✓|D)

the

posterior.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201313

95BayesruleP(a|b)=P(b|a)P(a).Machine

Learning

BasicsProbabilityIndependence

The

product

rule

can

simpliﬁed

P(a,b)

P(a)P(b)

i↵

and

are

independent

Equivalently,

P(a|b)

P(a),

P(b|a)

P(b).Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201314

95MachineLearningBasicsProbabiR

x2P(x1

x2)

1Machine

Learning

BasicsProbabilityProbability

density

continuous

random

variable

has

probability

density

function

(pdf)

p(x)

[0,1].

p(x)

possible!

Integrates

x1Marginalization

p(x)

p(x)dx

1p(x)dx

p(x,y)dyZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201315

95Rx2P(x1<X<x2)=Z1R1MapMachine

Learning

BasicsProbabilityExpectation

and

Variance

The

expectation

(“mean”

“average”)

function

under

the

probability

distribution

EP[f]

P(a)f(a)

Ep[f]

p(x)f(x)dx

particular

f(x)

this

the

mean

the

random

variable

The

variance

isVar(f)

E[(f(x)E[f(x)])2]

E[f(x)2]E[f(x)]2The

standard

deviation

std(f)

=Var(f).Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201316

95pMachineLearningBasicsProbabMachine

Learning

BasicsProbabilityMultivariate

Statistics

When

x,y

are

vectors,

E[x]

the

mean

vector

Cov(x,y)

the

covariance

matrix

with

i,j-th

entry

being

Cov(xi,yj).Cov(x,y)

Ex,y[(xE[x])(yE[y])]

Ex,y[xy]E[x]E[y]Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201317

95MachineLearningBasicsProbabi8

✓

◆<px(1:8

✓:Qdk=1

pkPdMachine

Learning

BasicsProbabilitySome

Discrete

Distributions

P(X

Binomial.

(number

trials)

and

(head

probability)p)n

for

0,1,...,notherwise

f(x)

0Bernoulli.

Binomial

with

1.Multinomial

(p1,...,pd)>

(d-sided

die)f(x)

nx1,...,xd◆xk<

k=1

notherwiseZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201318

958✓◆<px(1:8✓:Qdk=1pMachine

Learning

BasicsProbabilityMore

Discrete

Distributions

Poisson.

⇠

Poisson(

)

xx!

f(x)

efor

0,1,2,....

the

rate

intensity

parametermean:,

variance:

thenX1

⇠

Poisson(

2).This

distribution

unbounded

counts

with

probability

massfunction“hump”

(mode

e1).Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201319

95MachineLearningBasicsProbabiGaussian

(Normal):

⇠

N(µ,Machine

Learning

BasicsProbabilitySome

Continuous

Distributions2)with

parameters

(themean)

and

(the

variance)

1f(x)

p2⇡exp✓(x2µ)22◆.is

the

standard

deviation.If

0,=

has

standard

normal

distribution.

2),

then

iZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201320

95Gaussian(Normal):X⇠N(µ,MaMachine

Learning

BasicsProbabilitySome

Continuous

Distributions

Multivariate

Gaussian.

Let

x,µ

Rd,

⌃

S+

symmetric,

positive

deﬁnite

matrix

size

⇥

Then

⇠

N(µ,⌃)

with

PDF

f(x)

exp

µ)

⌃

µ)

and

⌃

its

inverseZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201321

95MachineLearningBasicsProbabiMachine

Learning

BasicsProbabilityMarginal

and

Conditional

Gaussian

two

(groups

of)

variables

x,y

are

jointly

Gaussian:xy⇠

N✓µxµy,

CC>

B◆(1)(Marginal)

⇠

N(µx,A)(Conditional)

y|x

⇠

N(µy

C>A1(xµx),BC>A1C)Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201322

95MachineLearningBasicsProbabiMachine

Learning

BasicsProbabilityMore

Continuous

Distributions

with

↵

Generalizes

factorial:

(n)

1)!

when

positive

integer.

(↵

↵

(↵)

for

↵

parameter

↵

and

scale

parameter

0f(x)

=↵1

(↵)x↵1ex/,

0.Conjugate

prior

for

Poisson

rate.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201323

95MachineLearningBasicsProbabiMachine

Learning

BasicsStatistical

EstimationOutline1234Spatio-Temporal

Signal

Recovery

from

Social

MediaMachine

Learning

Basics

Probability

Statistical

Estimation

Decision

Theory

Graphical

Models

Regularization

Stochastic

ProcessesSocioscope:

Probabilistic

Model

for

Social

MediaCase

Study:

RoadkillZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201324

95MachineLearningBasicsStatistMachine

Learning

BasicsStatistical

EstimationParametric

Models

statistical

model

set

distributions.

machine

learning,

call

the

hypothesis

space.

parametric

model

can

parametrized

ﬁnite

number

parameters:

f(x)

⌘

f(x;✓)

with

parameter

✓

Rd:

f(x;✓)

✓

⇥

⇢

where

⇥

the

parameter

space.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201325

95MachineLearningBasicsStatistStatistical

Estimation

Machine

Learning

BasicsParametric

Models

denote

the

expectationE✓(g)

=Zxg(x)f(x;✓)dxE✓

means

Ex⇠f(x;✓),

not

over

di↵erent

✓’s.

data

1All

(parametric)

models

are

wrong.

Some

are

useful

than

others.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201326

95StatisticalEstimation MachinMachine

Learning

BasicsStatistical

EstimationNonparametric

model

nonparametric

model

cannot

parametrized

ﬁxed

number

parameters.

Model

complexity

grows

indeﬁnitely

with

sample

size

Example:

arP(X)

1}.

Given

iid

data

x1,...,xn,

the

optimal

estimator

the

mean

again

xi.

Nonparametric

makes

weaker

model

assumptions

and

thus

preferred.

But

parametric

models

converge

faster

and

are

practical.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201327

95MachineLearningBasicsStatistMachine

Learning

BasicsStatistical

Estimation(

✓Estimation

...Xn

that

attempts

estimate

parameter

✓.

This

the

“learning”

machine

learning!

Example:

classiﬁcation

Pxi,yi)

and

the

learned

model.

Consistent

estimators

learn

the

correct

model

with

training

data

eventually.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201328

95MachineLearningBasicsStatistbias(✓

bn)

E✓(✓

bn)qThe

standard

error

estimator

se(✓

bn)

=Var✓(✓

bn)Pi

xi,

where

⇠

N(0,1).

Then

the

standardMachine

Learning

BasicsStatistical

EstimationBias

E✓

w.r.t.

the

joint

distribution

f(x1,...,xn;✓)

i=1

f(xi;✓).

The

bias

the

estimator

is✓ˆdeviation

regardless

contrast,

se(µ)

1/pn

n12An

estimator

unbiased

bias(✓

bn)

0.Example:

Let

1which

decreases

with

n.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201329

95bias(✓bn)=E✓(✓bn)qThestanmse(✓

bn)

E✓

(✓

bnMachine

Learning

BasicsStatistical

EstimationMSE

The

mean

squared

error

estimator

is⇣✓)2⌘Bias-variance

decompositionmse(✓

bn)

bias2(✓

bn)

se2(✓

bn)

bias2(✓

bn)

Var✓(✓

bn)

PZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201330

95mse(✓bn)=E✓(✓bnMachineYMachine

Learning

BasicsStatistical

EstimationMaximum

Likelihood

Let

x1,...,xn

⇠

f(x;✓)

where

✓

⇥.

The

likelihood

function

isLn(✓)

f(x1,...,xn;✓)

ni=1f(xi;✓)The

log

likelihood

function

`n(✓)

logLn(✓).The

maximum

likelihood

estimator

(MLE)

✓

argmax✓2⇥Ln(✓)

argmax✓2⇥`n(✓)Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201331

95YMachineLearningBasicsStatisMachine

Learning

BasicsStatistical

EstimationMLE

examples

The

MLE

for

p(head)

from

coin

ﬂips

count(head)/n

for

and

1/n

(Xi

The

MLE

does

not

always

agree

with

intuition.

The

MLE

for

X1,...,Xn

⇠

uniform(0,✓)

✓

max(X1,...,Xn).Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201332

95MachineLearningBasicsStatistMachine

Learning

BasicsStatistical

EstimationProperties

MLE

When

identiﬁable,

underPcertain

conditions

(see

Wasserman

parameter

✓.

That

is,

the

MLE

consistent.

Asymptotic

Normality:

Let

1/In(✓)

where

In(✓)

the

Fisher

information,

and

✓

N(0,1)

The

MLE

asymptotically

cient

(achieves

the

Cram´er-Rao

lower

bound),

“best”

among

unbiased

estimators.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201333

95MachineLearningBasicsStatistMachine

Learning

BasicsStatistical

EstimationFrequentist

statistics

Probability

refers

limiting

relative

frequency.

Data

are

random.

Estimators

are

random

because

they

are

functions

data.

Parameters

are

ﬁxed,

unknown

constants

not

subject

probabilistic

statements.

Procedures

are

subject

probabilistic

statements,

for

example

95%

conﬁdence

intervals

trap

the

true

parameter

value

Classiﬁers,

even

learned

with

deterministic

procedures,

are

random

because

the

training

set

random.

PAC

bound

frequentist.

Most

procedures

machine

learning

are

frequentist

methods.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201334

95MachineLearningBasicsStatistMachine

Learning

BasicsStatistical

EstimationBayesian

statistics

Probability

refers

degree

belief.

Inference

about

parameter

✓

producing

probability

distributions

it.

Starts

with

prior

distribution

p(✓).

Likelihood

function

p(x

✓),

function

✓

not

After

observing

data

one

applies

the

Bayes

rule

obtain

the

posterior

p(✓

evidence.

Prediction

integrating

parameters

out:

p(x

Data)

p(x

✓)p(✓

Data)d✓Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201335

95MachineLearningBasicsStatistMachine

Learning

BasicsStatistical

EstimationFrequentist

Bayesian

machine

learning

Frequentists

produce

point

estimate

✓

from

Data,

and

predict

with

p(x

✓

ˆ).

integrating

over

✓s.

Bayesian

integration

often

intractable,

need

either

“nice”

distributions

approximations.

The

maximum

posteriori

(MAP)

estimate

✓MAP

argmax✓p(✓

point

estimate

and

not

Bayesian.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201336

95MachineLearningBasicsStatistMachine

Learning

BasicsDecision

TheoryOutline1234Spatio-Temporal

Signal

Recovery

from

Social

MediaMachine

Learning

Basics

Probability

Statistical

Estimation

Decision

Theory

Graphical

Models

Regularization

Stochastic

ProcessesSocioscope:

Probabilistic

Model

for

Social

MediaCase

Study:

RoadkillZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201337

95MachineLearningBasicsDecisioMachine

Learning

BasicsDecision

Theory✓

✓✓

✓

✓Comparing

Estimators

Training

set

(x1,...,xn)

⇠

p(x;✓)

Learned

model:

✓

ˆ⌘

✓

ˆ(D)

estimator

✓

based

data

Loss

function

L(✓,✓

ˆ)

⇥

R+

squared

loss

L(✓,

ˆ)

(✓

ˆ)2

✓

loss

L(✓,

ˆ)

p(x;✓)log

p(x;ˆ)

Since

random,

both

✓

ˆ(D)

and

L(✓,✓

ˆ)

are

random

variablesZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201338

95MachineLearningBasicsDecisioMachine

Learning

BasicsDecision

Theory✓Risk

The

risk

R(✓,✓

ˆ)

the

expected

loss

R(✓,✓

ˆ)

ED[L(✓,✓

ˆ(D))]

averaged

over

training

sets

sampled

from

the

true

✓

The

risk

the

“average

training

set”

behavior

learning

algorithm

when

the

world

✓

Not

computable:

don’t

know

which

✓

the

world

in.

Assume

squared

loss.

Then

R(✓,✓

ˆ1)

(hint:

variance),

R(✓,

ˆ2)

ED(✓

3.14)2

(✓

3.14)2.

Smart

learning

algorithm

✓

ˆ1

and

dumb

one

✓

ˆ2.

However,

for

tasks✓

(3.141,3.14

the

dumb

algorithm

better.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201339

95MachineLearningBasicsDecisioDecision

Theory

Machine

Learning

BasicsMinimax

Estimatormaximum

riskRmax(✓

ˆ)

supR(✓,✓

ˆ)✓

✓The

minimax

estimator

✓

ˆminimax

minimizes

the

maximum

risk

✓

ˆminimax

arginf

supR(✓,✓

ˆ)

✓The

inﬁmum

over

all

estimators

✓

ˆ.The

minimax

estimator

the

“best”

guarding

against

the

worstpossible

world.Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201340

95DecisionTheory MachineLearniMachine

Learning

BasicsGraphical

ModelsOutline1234Spatio-Temporal

Signal

Recovery

from

Social

MediaMachine

Learning

Basics

Probability

Statistical

Estimation

Decision

Theory

Graphical

Models

Regularization

Stochastic

ProcessesSocioscope:

Probabilistic

Model

for

Social

MediaCase

Study:

RoadkillZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201341

95MachineLearningBasicsGraphicGraphical

Models

Machine

Learning

BasicsThe

envelope

quizZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201342

95GraphicalModels MachineLearnMachine

Learning

BasicsGraphical

ModelsThe

envelope

quiz

P(E

1/2

P(B

1/2,P(B

1/2?

P(B=b)

Switch.

The

graphical

model:

BZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201342

95MachineLearningBasicsGraphicMachine

Learning

BasicsGraphical

ModelsProbabilistic

Reasoning

The

world

reduced

set

random

variables

x1,...,xn

e.g.

(x1,...,xn

Inference:

given

joint

distribution

p(x1,...,xn),

compute

p(x1,...,xn

1,xn)

p(x1,...,xn

1,xn

Learning:

estimate

p(x1,...,xn)

from

training

data

X(1),...,X(N),

(i)

(i)Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201343

95MachineLearningBasicsGraphicMachine

Learning

BasicsGraphical

ModelsIt

cult

reason

with

uncertainty

joint

distribution

p(x1,...,xn)IIexponential

na¨ıve

storage

(2n

for

binary

r.v.)hard

interpret

(conditional

independence)

Often

can’t

a↵ord

brute

forceIf

p(x1,...,xn)

not

given,

estimate

from

dataIOften

can’t

a↵ord

brute

forceZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201344

95MachineLearningBasicsGraphicMachine

Learning

BasicsGraphical

ModelsGraphical

models

Graphical

models:

cient

representation,

inference,

and

learning

p(x1,...,xn),

exactly

approximately

Two

main

“ﬂavors”:IIdirected

graphical

models

Bayesian

Networks

(often

frequentistinstead

Bayesian)undirected

graphical

models

Markov

Random

FieldsKey

idea:

make

conditional

independence

explicitZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201345

95MachineLearningBasicsGraphicMachine

Learning

BasicsGraphical

ModelsBayesian

Network

Directed

graphical

models

are

also

called

Bayesian

networks

directed

graph

has

nodes

(x1,...,xn),

some

them

connected

directed

edges

cycle

directed

path

...

where

directed

acyclic

graph

(DAG)

contains

cycles

Bayesian

network

the

DAG

family

distributions

satisfying{p

p(X)

=Yip(xi

Pa(xi))}where

Pa(xi)

the

set

parents

xi.p(xi

Pa(xi))

the

conditional

probability

distribution

(CPD)

xiBy

specifying

the

CPDs

for

all

specify

particular

distributionp(X)Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201346

95MachineLearningBasicsGraphicExample:

Alarm

Binary

variablesGraphical

Models

P(E)=0.002

P(M

0.7

P(M

~A)

0.01Machine

Learning

Basics

P(B)=0.001

BP(A

0.95P(A

~E)

0.94P(A

~B,

0.29P(A

~B,

~E)

0.001

P(J

0.9

P(J

~A)

0.05

P(B,⇠

E,A,J,⇠

M)=

P(B)P(⇠

E)P(A

B,⇠

E)P(J

A)P(⇠

0.7)⇡

.000253Zhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201347

95Example:AlarmGraphicalModelGraphical

Models

Machine

Learning

BasicsExample:

Naive

Bayes

yy...x1xdx

dp(y,x1,...xd)

p(y)

i=1

p(xiUsed

extensively

natural

language

processingPlate

representation

the

rightZhu

Wisconsin)Understanding

Social

MediaCCF/ADL

Beijing

201348

95GraphicalModels MachineLearnGraphical

Models

Machine

Lea

人人文库> 全部分类> 教育资料 > 课件下载

温馨提示

1. 本站所有资源如无特殊说明，都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
2. 本站的文档不包含任何第三方提供的附件图纸等，如果需要附件，请联系上传者。文件的所有权益归上传用户所有。
3. 本站RAR压缩包中若带图纸，网页内容里面会有图纸预览，若没有图纸预览就没有图纸。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 人人文库网仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对用户上传分享的文档内容本身不做任何修改或编辑，并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容，请与我们联系，我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

用机器学习的方法理解社会媒体课件

文档简介

温馨提示

最新文档

评论

用机器学习的方法理解社会媒体课件

文档简介

温馨提示

最新文档

评论

相关文档