147
BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC a Apprentissage de la structure des r´ eseaux bay´ esiens : ´ etat de l’art et int´ egration de connaissances s´ emantiques Montassar Ben Messaoud 12 , Mouna Ben Ishak 12 , Philippe Leray 2 , Nahla Ben Amor 1 1 LARODEC, ISG, University of Tunis, Tunisia 2 Knowledge and Decision Team, LINA UMR 6241, Nantes, France Journ´ ees MSTGA 7 nov. 2011, Toulouse, France Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben Amor Apprentissage RB et Ontologies 1/73

Apprentissage de la structure des réseaux bayésiens : état

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Apprentissage de la structure des reseauxbayesiens : etat de l’art et integration de

connaissances semantiques

Montassar Ben Messaoud12, Mouna Ben Ishak12,Philippe Leray2, Nahla Ben Amor1

1LARODEC, ISG, University of Tunis, Tunisia2Knowledge and Decision Team, LINA UMR 6241, Nantes, France

Journees MSTGA7 nov. 2011, Toulouse, France

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 1/73

Page 2: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Outline

1 BN definition

2 BN Structure learning

3 BNs and Causality

4 BNs and Ontologies

5 SemCaDo algorithm

6 2OC approach

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 2/73

Page 3: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Bayesian network definition

Definition (Pearl 1985)

A Bayesian network (BN) is defined by

one qualitative description of (conditional) dependences /independences between variables

directed acyclic graph (DAG)one quantitative description of these dependences

conditional probability distributions (CPDs)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 3/73

Page 4: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Example

one topological order : B, E , A, R, T (not unique)

R Radio

E Earthquake

A Alarm

B Burglary

T TV

P(Alarm|Burglary,Earthquake)

Burglary,Earthquake = Y,Y Y,N N,Y N,N

Alarm=Y 0.75 0.10 0.99 0.10Alarm=N 0.25 0.90 0.01 0.90

P(TV|Radio)

Radio = Y N

TV=Y 0.99 0.50TV=N 0.01 0.50

P(Radio|Earthquake)

Earthquake = Y N

Radio=Y 0.99 0.01Radio=N 0.01 0.99

P(Burglary)=[0.001 0.999] P(Earthquake)=[0.0001 0.9999]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 4/73

Page 5: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Consequence

Chain rule

P(S) = P(S1)× P(S2|S1)× P(S3|S1, S2)× · · · × P(Sn|S1 . . . Sn−1)

Consequence with a BN

P(Si |S1 . . . Si−1) = P(Si |parents(Si )) so

P(S) = Πni=1P(Si |parents(Si ))

The (global) joint probability distribution is decomposed in aproduct of (local) conditional distributions

BN = compact representation of the joint distribution P(S)given some information about dependence relationshipsbetween variables

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/73

Page 6: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Consequence

Chain rule

P(S) = P(S1)× P(S2|S1)× P(S3|S1, S2)× · · · × P(Sn|S1 . . . Sn−1)

Consequence with a BN

P(Si |S1 . . . Si−1) = P(Si |parents(Si )) so

P(S) = Πni=1P(Si |parents(Si ))

The (global) joint probability distribution is decomposed in aproduct of (local) conditional distributions

BN = compact representation of the joint distribution P(S)given some information about dependence relationshipsbetween variables

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/73

Page 7: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Markov equivalence

Definition

B1 and B2 are Markov equivalent iff both describe exactly thesame conditional (in)dependence statements.

Graphical properties

B1 and B2 have the same skeleton, V-structures and inferrededges.

All the equivalent graphs (= equivalence class) can besummarized by one partially directed DAG named CPDAG orEssential Graph

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 6/73

Page 8: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Markov equivalence

Definition

B1 and B2 are Markov equivalent iff both describe exactly thesame conditional (in)dependence statements.

Graphical properties

B1 and B2 have the same skeleton, V-structures and inferrededges.

All the equivalent graphs (= equivalence class) can besummarized by one partially directed DAG named CPDAG orEssential Graph

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 6/73

Page 9: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Markov equivalence

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 7/73

Page 10: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Outline

1 BN definition

2 BN Structure learning

3 BNs and Causality

4 BNs and Ontologies

5 SemCaDo algorithm

6 2OC approach

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/73

Page 11: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

BN structure learning

How to build / learn a Bayesian Network ?

1 DAG is known, how to determine the CPDs ?

from experts : knowledge elicitationfrom complete data / incomplete data

2 DAG is unknown, how to determine it (or the CPDAG) ?

from complete data / incomplete datalatent variable discovery ?

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 9/73

Page 12: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Structure learning is a complex task

Size of the ”solution” space

the number of possible DAGs with n variables issuper-exponential w.r.t n (Robinson 77)

NS(n) =

{1 , n = 0 or 1∑n

i=1(−1)i+1(n

i

)2i(n−1)NS(n − i), n > 1

NS(5) = 29281 NS(10) = 4.2× 1018

an exhaustive search is impossible !

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 10/73

Page 13: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Structure learning algorithms

How to search a good BN ?

Constraint-based methodsBN = independence model⇒ find CI in data in order to build the DAG

Score-based methodsBN = probabilistic model that must fit data as well as possible⇒ search the DAG space in order to maximize a scoringfunction

Hybrid methods

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 11/73

Page 14: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Constraint-based methods

How to search a good BN ?

Constraint-based methodsBN = independence model⇒ find CI in data in order to build the DAG

Score-based methodsBN = probabilistic model that must fit data as well as possible⇒ search the DAG space in order to maximize a scoringfunction

Hybrid methods

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 12/73

Page 15: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Constraint-based methods

Two reference algorithms

Pearl et Verma : IC, IC*

Spirtes, Glymour et Scheines : SGS, PC, CI, FCI

Common principle

build an undirected graph describing direct dependencesbetween variables (χ2 tests)

by adding edges (Pearl et Verma)by deleting edges (SGS)

detect V-structures (from previous statistical tests)

propagate some edge orientation (inferred edges) in order toobtain a CPDAG

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 13/73

Page 16: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Constraint-based methods

Some inconvenients

reliability of CI test conditionally to several variables with alimited amount of data)

SGS heuristic : if df < N10 , then declare dependence

Combinatorial explosion of the number of tests

PC heuristic : begin with order 0 (XA⊥XB) then order 1(XA⊥XB | XC ), etc ...

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 14/73

Page 17: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

PC algorithm

Step 0 : undirected complete graph

Left : target BN used to generate 5000 samples.

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/73

Page 18: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

PC algorithm

Step 1a : delete all order 0 independences discovered

χ2: S⊥A L⊥A B⊥A O⊥A X⊥A D⊥A T⊥S L⊥T O⊥B X⊥B

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/73

Page 19: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

PC algorithm

Step 1a : delete all order 1 independences discovered

χ2: T⊥A|O O⊥S |L X⊥S |L B⊥T |S X⊥T |O D⊥T |O ...

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/73

Page 20: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

PC algorithm

Step 1a : delete all order 2 independences discovered

χ2: D⊥S |{L,B} X⊥O|{T , L} D⊥O|{T , L}

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/73

Page 21: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

PC algorithm

Step 2 : research V-structures

χ2 : one V-structure T → O ← L is discovered

A S

T L B

O

X D

A S

T L B

O

X D

Step 3 : inferred edges

no one in this example

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/73

Page 22: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

PC algorithm

From CPDAG to DAG

Orientation of the remaining undirected edges(only constraint : do not create any new V-structure)

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/73

Page 23: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

PC algorithm

Obtained DAG versus target one

χ2 test with 5000 samples fails to discoverA→ T , O → X and O → D

A S

T L B

O

X D

A S

T L B

O

X D

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 15/73

Page 24: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Score-based methods

How to search a good BN ?

Constraint-based methodsBN = independence model⇒ find CI in data in order to build the DAG

Score-based methodsBN = probabilistic model that must fit data as well as possible⇒ search the DAG space in order to maximize a scoringfunction

Hybrid methods

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 16/73

Page 25: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Notion of score

General principle : Occam razor

Pluralitas non est ponenda sine neccesitate(La pluralite (des notions) ne devrait pas etre posee sansnecessite) plurality should not be posited without necessity

Frustra fit per plura quod potest fieri per pauciora(C’est en vain que l’on fait avec plusieurs ce que l’on peutfaire avec un petit nombre) It is pointless to do with morewhat can be done with fewer

= Parcimony principle : find a model

fitting the data D : likelihood : L(D|θ,B)

the simplest possible : dimension of B : Dim(B)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 17/73

Page 26: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Score examples

AIC and BIC

Compromise between likelihood and complexity

Application of AIC (Akaıke 70) and BIC (Schwartz 78) criteria

SAIC (B,D) = log L(D|θMV ,B)− Dim(B)

SBIC (B,D) = log L(D|θMV ,B)− 1

2Dim(B) log N

Bayesian scores : BD, BDe, BDeu

SBD(B,D) = P(B,D) (Cooper et Herskovits 92)

BDe = BD + score equivalence (Heckerman 94)

SBD(B,D) = P(B)n∏

i=1

qi∏j=1

Γ(αij)

Γ(Nij + αij)

ri∏k=1

Γ(Nijk + αijk)

Γ(αijk)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 18/73

Page 27: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Score properties

Two important properties

Decomposability

(Global)Score(B,D) =n∑

i=1

(local)score(Xi , pai )

Score equivalence

If two BN B1 and B2 are Markov equivalent thenS(B1,D) = S(B2,D)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 19/73

Page 28: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Score properties

Two important properties

Decomposability

(Global)Score(B,D) =n∑

i=1

(local)score(Xi , pai )

Score equivalence

If two BN B1 and B2 are Markov equivalent thenS(B1,D) = S(B2,D)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 19/73

Page 29: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Heuristic exploration of search space

Search space and heuristics

espace Brestriction to tree space : Chow&Liu, MWST

DAG with node ordering : K2 algorithmgreedy searchgenetic algorithms, ...

espace Egreedy equivalence search

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 20/73

Page 30: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Restriction to tree space

Principle

what is the best tree connecting all the nodes,i.e. maximizing a weight defined for each possible edge ?

Answer : maximal weighted spanning tree (MWST)

(Chow et Liu 68) : weight = mutual information :

W (XA,XB) =∑a,b

Nab

Nlog

NabN

Na.N.b

(Heckerman 94) : any local score :

W (XA,XB) = score(XA,Pa(XA) = XB)− score(XA, ∅)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 21/73

Page 31: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Restriction to tree space

Principle

what is the best tree connecting all the nodes,i.e. maximizing a weight defined for each possible edge ?

Answer : maximal weighted spanning tree (MWST)

(Chow et Liu 68) : weight = mutual information :

W (XA,XB) =∑a,b

Nab

Nlog

NabN

Na.N.b

(Heckerman 94) : any local score :

W (XA,XB) = score(XA,Pa(XA) = XB)− score(XA, ∅)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 21/73

Page 32: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Restriction to tree space

Remarks

MWST returns an undirected tree

this undirected tree = CPDAG of all the directed tree withthis skeleton

Obtain a directed tree by (randomly) choosing one root andorienting the edges with a depth first search over this tree

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 22/73

Page 33: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Example : obtained DAG vs. target one

A S

T L B

O

X D

A S

T L B

O

X D

MWST can not discover cycles neither V-structures (tree space !)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 23/73

Page 34: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Heuristic exploration of search space

Search space and heuristics

espace Brestriction to tree space : Chow&Liu, MWSTDAG with node ordering : K2 algorithm

greedy search

genetic algorithms, ...

espace Egreedy equivalence search

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 24/73

Page 35: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Greedy search

Principle

Exploration of the search space with traversal operators

add edgeinvert edgedelete edge

and respect the DAG definition (no cycle)

exploration can begin from any given DAG

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 25/73

Page 36: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Example : obtained DAG vs. target one

A S

T L B

O

X D

A S

T L B

O

X D

start = empty graph. GS result = local optimum :-(

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 26/73

Page 37: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Example : obtained DAG vs. target one

A S

T L B

O

X D

A S

T L B

O

X D

start = MWST result. GS result is better

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 26/73

Page 38: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Heuristic exploration of search space

Search space and heuristics

espace Brestriction to tree space : Chow&Liu, MWST

DAG with node ordering : K2 algorithmgreedy searchgenetic algorithms, ...

espace Egreedy equivalence search

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 27/73

Page 39: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

What about changing our search space

Preliminaries

IC/PC result = CPDAG

MWST result = CPDAG

Score-based methods do not distinguish equivalent DAGs

Search in EE = CPDAG space

Better properties : YES

2 equivalents structures = 1 unique structure in EBetter size : NO

E size is quasi similar to DAG space(asymptotic ratio is 3,7 : Gillispie et Perlman 2001)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 28/73

Page 40: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Greedy Equivalent Search

Principe (Chickering 2002)

Greedy search in EPhase 1 : add edges until convergence

Phase 2 : delete edges until convergence

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 29/73

Page 41: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Add edge examples in E

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

E0

neighborhood of E0

E1

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

neighborhood of E1

X1

X2

X3

X4

X1

X2

X3

X4

X1

X2

X3

X4

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 30/73

Page 42: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Score-based methods

How to search a good BN ?

Constraint-based methodsBN = independence model⇒ find CI in data in order to build the DAG

Score-based methodsBN = probabilistic model that must fit data as well as possible⇒ search the DAG space in order to maximize ascoring/fitness function

Hybrid methods

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 31/73

Page 43: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Hybrid methods = local search methods

Local search and global learning

search one local neighborhood for a given node T

reiterate for each T

learn the global structure with these local informations

which neighborhood ?

PC (T ) : Parents and Children T (without distinction)

MB(T ) : Markov Blanket of T - Parents, children andspouses

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 32/73

Page 44: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Hybrid methods = local search methods

Local search and global learning

search one local neighborhood for a given node T

reiterate for each T

learn the global structure with these local informations

which neighborhood ?

PC (T ) : Parents and Children T (without distinction)

MB(T ) : Markov Blanket of T - Parents, children andspouses

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 32/73

Page 45: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Local search identification

Identification of MB(T) or PC(T)

IAMB (Aliferis 2002)

MMPC (Tsamardinos et al. 2003), ...

Hybrid structure learning algorithms

MMHC (Tsamardinos et al. 2006) = MMPC + Greedy search

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 33/73

Page 46: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Outline

1 BN definition

2 BN Structure learning

3 BNs and Causality

4 BNs and Ontologies

5 SemCaDo algorithm

6 2OC approach

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 34/73

Page 47: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

A BN is not a causal model

Usual BN

A→ B does not imply direct causal relationship between Aand B,only edges from the CPDAG represent causal relationships ∗

Confusion

when the DAG is given by an expert, this graph is very oftencausal

when the DAG is learnt from data, no reason to be causal !

But, is it important ?

equivalent DAGs ⇒ same joint distribution so same result for(probabilistic) inference

⇒ causality is not required for (probabilistic) inference

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 35/73

Page 48: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

A BN is not a causal model

Usual BN

A→ B does not imply direct causal relationship between Aand B,only edges from the CPDAG represent causal relationships ∗

Confusion

when the DAG is given by an expert, this graph is very oftencausal

when the DAG is learnt from data, no reason to be causal !

But, is it important ?

equivalent DAGs ⇒ same joint distribution so same result for(probabilistic) inference

⇒ causality is not required for (probabilistic) inference

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 35/73

Page 49: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

A BN is not a causal model

Usual BN

A→ B does not imply direct causal relationship between Aand B,only edges from the CPDAG represent causal relationships ∗

Confusion

when the DAG is given by an expert, this graph is very oftencausal

when the DAG is learnt from data, no reason to be causal !

But, is it important ?

equivalent DAGs ⇒ same joint distribution so same result for(probabilistic) inference

⇒ causality is not required for (probabilistic) inference

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 35/73

Page 50: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Causal BN

Definition

each A→ B represents ont direct causal relationship, i.e. A isthe direct cause which generates B

if causality is not required for (probabilistic) inference, what isthe interest of CBNs ?

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 36/73

Page 51: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Causal BN

Definition

each A→ B represents ont direct causal relationship, i.e. A isthe direct cause which generates B

if causality is not required for (probabilistic) inference, what isthe interest of CBNs ?

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 36/73

Page 52: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Intervention vs. Observation

Probabilistic inference :we observe B = b,we compute P(A|B = b)

Causal inference [Pearl 00]:we manipulate/intervene upon B : do(B = b)

example with A→ B

P(A|do(B = b)) = P(A),

P(B|do(A = a)) = P(B|A = a)

example with A← B

P(A|do(B = b)) = P(A|B = b),

P(B|do(A = a)) = P(B)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 37/73

Page 53: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Manipulation Theorem

How is modified the joint distribution after one manipulationdo(M = m) ?

intuitive version

we forget the ”official” causes of M (its parents in the DAG)

we keep the fact that M = m for its raising effects (to Mchildren)

official version [Spirtes et al. 00]

P(v |do(m)) =

∏Vi∈V \M

P(vi |Pa(Vi ))

M=m

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 38/73

Page 54: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Causal structure learning

usual situation : observational datawhatever the methods, the right results is the CPDAGpartial determination of the causal structure

How to find a full causal graph ?

use only experimental data, and decide at every step the moreinteresting experiment to realize (active learning [Murphy 01],...)

use only observational data, for a very specific distribution(LiNGAM models [Hoyer et al. 08])

Another idea MyCaDo algorithm [Meganck, Leray & Manderick 06]

use (already existing) observational data to find the CPDAG

complete the orientation with experimental data

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 39/73

Page 55: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Causal structure learning

usual situation : observational datawhatever the methods, the right results is the CPDAGpartial determination of the causal structure

How to find a full causal graph ?

use only experimental data, and decide at every step the moreinteresting experiment to realize (active learning [Murphy 01],...)

use only observational data, for a very specific distribution(LiNGAM models [Hoyer et al. 08])

Another idea MyCaDo algorithm [Meganck, Leray & Manderick 06]

use (already existing) observational data to find the CPDAG

complete the orientation with experimental data

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 39/73

Page 56: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Causal structure learning

usual situation : observational datawhatever the methods, the right results is the CPDAGpartial determination of the causal structure

How to find a full causal graph ?

use only experimental data, and decide at every step the moreinteresting experiment to realize (active learning [Murphy 01],...)

use only observational data, for a very specific distribution(LiNGAM models [Hoyer et al. 08])

Another idea MyCaDo algorithm [Meganck, Leray & Manderick 06]

use (already existing) observational data to find the CPDAG

complete the orientation with experimental data

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 39/73

Page 57: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

MyCaDo algorithm

Donnéesd'observation

Donnéesexpérimentales

SystèmeAlgorithmed'apprentissage

de structured'un RB

CPDAG

Algorithme MyCaDo

Choixde l'exp.

Réalisationde l'exp.

Analysedes résultats

Réseau Bayésiencausal

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 40/73

Page 58: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

MyCaDo algorithm

1 Choice of the experiment = what variable M manipulate ?

the one potentially orienting more edgesby taking into account experiment/observation cost

2 Experimentation

do(M = m) for all possible values mObserve all candidate variables C (C –M)

3 Result analysis

P(C |M) (observation) ' P(C |do(M)) (experiment) ?equal : C ← M else M ← Corient some other edges by applying specific rules (cf. PC algoand Meek rules)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 41/73

Page 59: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

MyCaDo algorithm

1 Choice of the experiment = what variable M manipulate ?

the one potentially orienting more edgesby taking into account experiment/observation cost

2 Experimentation

do(M = m) for all possible values mObserve all candidate variables C (C –M)

3 Result analysis

P(C |M) (observation) ' P(C |do(M)) (experiment) ?equal : C ← M else M ← Corient some other edges by applying specific rules (cf. PC algoand Meek rules)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 41/73

Page 60: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

MyCaDo algorithm

1 Choice of the experiment = what variable M manipulate ?

the one potentially orienting more edgesby taking into account experiment/observation cost

2 Experimentation

do(M = m) for all possible values mObserve all candidate variables C (C –M)

3 Result analysis

P(C |M) (observation) ' P(C |do(M)) (experiment) ?equal : C ← M else M ← Corient some other edges by applying specific rules (cf. PC algoand Meek rules)

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 41/73

Page 61: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Outline

1 BN definition

2 BN Structure learning

3 BNs and Causality

4 BNs and Ontologies

5 SemCaDo algorithm

6 2OC approach

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 42/73

Page 62: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Motivations

Knowledge-based systems aim to make expertise available fordecision making and information sharing.

To resolve some complex problems, the combination ofdifferent knowledge-based systems can be very powerful.

Our idea

Combine Two KBSs : Bayesian Networks and Ontologies, to helpboth.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 43/73

Page 63: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Motivations

Knowledge-based systems aim to make expertise available fordecision making and information sharing.

To resolve some complex problems, the combination ofdifferent knowledge-based systems can be very powerful.

Our idea

Combine Two KBSs : Bayesian Networks and Ontologies, to helpboth.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 43/73

Page 64: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Motivations

Knowledge-based systems aim to make expertise available fordecision making and information sharing.

To resolve some complex problems, the combination ofdifferent knowledge-based systems can be very powerful.

Our idea

Combine Two KBSs : Bayesian Networks and Ontologies, to helpboth.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 43/73

Page 65: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Motivations

Existing works – Bayesian Network =⇒ Ontology

BayesOWL [Ding & Peng, 2004]

OntoBayes [Yang & Calmet, 2005]

PR-OWL [Costa & Laskey, 2006]

Existing works – Ontology =⇒ Bayesian Network

BN ”basic” construction using ontologies [Devitt & al. 2006]

Semantical causal discovery SemCaDo 1.0 [Ben Messaoud &al. 2009]

Our contribution : SemCaDo 2.0

Ontology =⇒ Causal Bayesian Network =⇒ Ontology

Causal Discovery Ontology Evolution

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 44/73

Page 66: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Motivations

Existing works – Bayesian Network =⇒ Ontology

BayesOWL [Ding & Peng, 2004]

OntoBayes [Yang & Calmet, 2005]

PR-OWL [Costa & Laskey, 2006]

Existing works – Ontology =⇒ Bayesian Network

BN ”basic” construction using ontologies [Devitt & al. 2006]

Semantical causal discovery SemCaDo 1.0 [Ben Messaoud &al. 2009]

Our contribution : SemCaDo 2.0

Ontology =⇒ Causal Bayesian Network =⇒ Ontology

Causal Discovery Ontology Evolution

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 44/73

Page 67: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Motivations

Existing works – Bayesian Network =⇒ Ontology

BayesOWL [Ding & Peng, 2004]

OntoBayes [Yang & Calmet, 2005]

PR-OWL [Costa & Laskey, 2006]

Existing works – Ontology =⇒ Bayesian Network

BN ”basic” construction using ontologies [Devitt & al. 2006]

Semantical causal discovery SemCaDo 1.0 [Ben Messaoud &al. 2009]

Our contribution : SemCaDo 2.0

Ontology =⇒ Causal Bayesian Network =⇒ Ontology

Causal Discovery Ontology Evolution

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 44/73

Page 68: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Ontology

Basics

Definition:

Shared understanding within a community of people.Declarative specification of entities and their relationships witheach other.Separate the domain knowledge from the operationalknowledge.

Construction: expertise, ontology evolution

Reasoning: Description logic reasoners

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 45/73

Page 69: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide

TsunamiFire

Catastrophes

Volcano

Earthquake

Flood

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 46/73

Page 70: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide

TsunamiFire

Catastrophes

Volcano

Earthquake

FloodNameLocalizationDateNb_killed

NameMgnitudeLocalizationDateNb_killed

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 46/73

Page 71: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements) Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 46/73

Page 72: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Causes

Causes Causes

CausesCausesCauses

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 46/73

Page 73: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Aleppo Shaanxi Haiti

is-instanceis-instance is-instance

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 46/73

Page 74: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Elements of an ontology

C: Classes (concepts)

P: Attributes (properties)

H: Hierarchical structure (is-a,part-of relations)

R: Other semantic relationships

I: Instances (individuals)

A: Axioms (logic statements)

Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Aleppo Shaanxi Haiti

is-instanceis-instance is-instance

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 46/73

Page 75: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Ontology evolution

Ontology population

Get new instances of concept(s) already present in the ontology.

Ontology enrichment

Update (add or modify) concepts, properties and relations in agiven ontology.

Evolution vs. revolution

Evolution (ontology continuity): Add new knowledge.

Revolution (ontology discontinuity): Modify existingknowledge.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 47/73

Page 76: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Ontology evolution

Ontology population

Get new instances of concept(s) already present in the ontology.

Ontology enrichment

Update (add or modify) concepts, properties and relations in agiven ontology.

Evolution vs. revolution

Evolution (ontology continuity): Add new knowledge.

Revolution (ontology discontinuity): Modify existingknowledge.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 47/73

Page 77: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Ontology evolution

Ontology population

Get new instances of concept(s) already present in the ontology.

Ontology enrichment

Update (add or modify) concepts, properties and relations in agiven ontology.

Evolution vs. revolution

Evolution (ontology continuity): Add new knowledge.

Revolution (ontology discontinuity): Modify existingknowledge.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 47/73

Page 78: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Ontologies vs. BNs

Construction

Probabilistic inference

Parameters

Structure

Construction

Logical inference

Creation

Maintenance

Probabilistic Graphical Models (PGM)Ontologies

Merging…

Evolving

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 48/73

Page 79: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Outline

1 BN definition

2 BN Structure learning

3 BNs and Causality

4 BNs and Ontologies

5 SemCaDo algorithm

6 2OC approach

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 49/73

Page 80: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

SemCaDo assumptions

BN ⇔ Ontology general assumptions

Nodes ⇐⇒ Concepts

Random variables ⇐⇒ Concept attributes

Causal dependencies ⇐⇒ Semantic causal relations

Data ⇐⇒ Concept-attribute instances

SemCaDo specific assumptions

Causal relations concern concepts sharing the same semantictype.

Ontology continuity (evolution).

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 50/73

Page 81: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

SemCaDo assumptions

BN ⇔ Ontology general assumptions

Nodes ⇐⇒ Concepts

Random variables ⇐⇒ Concept attributes

Causal dependencies ⇐⇒ Semantic causal relations

Data ⇐⇒ Concept-attribute instances

SemCaDo specific assumptions

Causal relations concern concepts sharing the same semantictype.

Ontology continuity (evolution).

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 50/73

Page 82: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Example

Landslide TsunamiFire

Natural

VolcanoEarthquakeFlood

Catastrophes

Man-made

is-ais-ais-ais-ais-ais-a

is-ais-a

Causes

Causes Causes

CausesCausesCauses

Our causal BN will represent the grey part of the ontology

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 51/73

Page 83: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

SemCaDo three main steps

Obs.Data

1st stepStructure learning

3rd stepOntology evolutionChoice of experienceChoice of experience

Analyse the results

2nd stepCausal discovery

PDAG CBN

Perform the experience

Enriched ontology

Domainontology

Inter.Data

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 52/73

Page 84: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

1st step: initial BN structure learning

Extraction of the causal relationships (Rc) from the ontology.

Integration of these edge as constraints in the structurelearning algorithm [De Campos & al., 2007].

Continuity : these edges will not be ”questioned” duringlearning

Interest

Ontology helps in reducing the search task complexity.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 53/73

Page 85: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

1st step: initial BN structure learning

Extraction of the causal relationships (Rc) from the ontology.

Integration of these edge as constraints in the structurelearning algorithm [De Campos & al., 2007].

Continuity : these edges will not be ”questioned” duringlearning

Interest

Ontology helps in reducing the search task complexity.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 53/73

Page 86: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

2nd step: serendipitous causal discovery

Experimentations are needed in order to find the causalorientation of some edges

Our solution : MyCaDo [Meganck & al., 2006], iterativecausal discovery process

Adaptation to take into account ontological knowledge : Radadistance on H between one set of concepts and their mostspecific common subsumer.

Interest

Ontology helps in potentially orienting the more unexpected(serendipitous) links

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 54/73

Page 87: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

2nd step: serendipitous causal discovery

Experimentations are needed in order to find the causalorientation of some edges

Our solution : MyCaDo [Meganck & al., 2006], iterativecausal discovery process

Adaptation to take into account ontological knowledge : Radadistance on H between one set of concepts and their mostspecific common subsumer.

Interest

Ontology helps in potentially orienting the more unexpected(serendipitous) links

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 54/73

Page 88: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

3rd Step: Ontology evolution process

[Stojanovic et al., 2002]

Interest

BN structure learning from data helps in discovering new relationsin the ontology

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 55/73

Page 89: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

3rd Step: Ontology evolution process

[Stojanovic et al., 2002]

Interest

BN structure learning from data helps in discovering new relationsin the ontology

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 55/73

Page 90: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Experimental study (1)

Benchmark : no existing real benchmark & system :-(

BN graph : random generation (50 to 200 nodes).

Ontology :

Causal relationships : BN edgesHierarchy of concept : generation by clustering BN nodes

Data is generated by using BN as a generative model.

Experimental protocol

Hierarchy of concepts and 10% to 40% of existing causalrelationships are given as inputs

Semantic gain : cumulative Rada distance of the discoveredrelationships

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 56/73

Page 91: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Experimental study (1)

Benchmark : no existing real benchmark & system :-(

BN graph : random generation (50 to 200 nodes).

Ontology :

Causal relationships : BN edgesHierarchy of concept : generation by clustering BN nodes

Data is generated by using BN as a generative model.

Experimental protocol

Hierarchy of concepts and 10% to 40% of existing causalrelationships are given as inputs

Semantic gain : cumulative Rada distance of the discoveredrelationships

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 56/73

Page 92: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Experimental study (2)

Ontology helps structure learning

SemCaDo performs better in less steps than MyCaDo

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 57/73

Page 93: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Experimental study (2)

Structure learning helps ontology evolution

Original causal discoveries are discovered first and can beadded to the ontology

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 57/73

Page 94: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Future works

SemCaDo 2.0

Test SemCaDo on a real system.

SemCaDo 3.0

Interact better with the ontology (not only H) during thecausal discovery process.

Soften the continuity hypothesis : ontology enrichment

Generalize to any type of semantic relations in R.

Extend to probabilistic relational models and probabilisticontologies.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 58/73

Page 95: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Future works

SemCaDo 2.0

Test SemCaDo on a real system.

SemCaDo 3.0

Interact better with the ontology (not only H) during thecausal discovery process.

Soften the continuity hypothesis : ontology enrichment

Generalize to any type of semantic relations in R.

Extend to probabilistic relational models and probabilisticontologies.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 58/73

Page 96: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Outline

1 BN definition

2 BN Structure learning

3 BNs and Causality

4 BNs and Ontologies

5 SemCaDo algorithm

6 2OC approach

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 59/73

Page 97: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Motivations

Previous works between BNs and Ontologies

Do not explore all the expressive capabilities of ontologies.

Focus for the most part on reasoning

Our idea

Look for one BN extension : Object Oriented BNs.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 60/73

Page 98: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Motivations

Previous works between BNs and Ontologies

Do not explore all the expressive capabilities of ontologies.

Focus for the most part on reasoning

Our idea

Look for one BN extension : Object Oriented BNs.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 60/73

Page 99: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Object Oriented Bayesian Networks

An extension of BNs using the object paradigm

[Bangsø and Wuillemin, 2000a; Bangsø and Wuillemin,2000b; Koller and Pfeffer, 1997].

Support several aspects of the object oriented modeling. (e.g.,inheritance, instantiation).

Designed to model large and complex domains.

OOBN structure learning : OO-SEM

This algorithm [Langseth and Nielsen, 2003] is based on 2 steps

Generation of a prior OOBN based on a prior expert knowledge

Grouping nodes into instantiations and instantiations intoclasses.Giving a prior information about the candidate interfaces.

Adaptation of Structural EM algorithm to learn the final structure.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 61/73

Page 100: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Object Oriented Bayesian Networks

An extension of BNs using the object paradigm

[Bangsø and Wuillemin, 2000a; Bangsø and Wuillemin,2000b; Koller and Pfeffer, 1997].

Support several aspects of the object oriented modeling. (e.g.,inheritance, instantiation).

Designed to model large and complex domains.

OOBN structure learning : OO-SEM

This algorithm [Langseth and Nielsen, 2003] is based on 2 steps

Generation of a prior OOBN based on a prior expert knowledge

Grouping nodes into instantiations and instantiations intoclasses.Giving a prior information about the candidate interfaces.

Adaptation of Structural EM algorithm to learn the final structure.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 61/73

Page 101: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

OOBN structure learning

Drawbacks

This prior knowledge is not always obvious to obtain.

The expert should be familiar with the object orientedmodeling.

Our idea

Harness ontologies representation capabilities in order to generatethe prior OOBN structure.

Ontologies OOBNsConcepts Cp Classes

Properties Pcpi Real nodes

Inheritance relations HR Class hierarchies

Semantic relations SR Links/ Interfaces

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 62/73

Page 102: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

OOBN structure learning

Drawbacks

This prior knowledge is not always obvious to obtain.

The expert should be familiar with the object orientedmodeling.

Our idea

Harness ontologies representation capabilities in order to generatethe prior OOBN structure.

Ontologies OOBNsConcepts Cp Classes

Properties Pcpi Real nodes

Inheritance relations HR Class hierarchies

Semantic relations SR Links/ Interfaces

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 62/73

Page 103: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

The 2OC approach

Ontology

Prior OOBN

Final OOBN

it seems interesting to refine the ontology

used initially !

Morphing process

Learning process

Change detection process

Proposals of evolution

Data (1) ontology to prior OOBN

[Ben Ishak et al. 2011a]

(2) final OOBN to ontology

[Ben Ishak et al. 2011b]

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 63/73

Page 104: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Onto2PriorOOBN algorithm

Ontology based generation of a prior OOBN

Ontology graph traversal and morphing into a prior OOBNstructure.

3 steps

Initialization step: to generate the OOBN class and a classto each concept.Discovery step: to define input, internal and output sets foreach class of the OOBN.Closing step: to define instances to add to the global OOBNclass.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 64/73

Page 105: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Onto2PriorOOBN algorithm

Ontology based generation of a prior OOBN

Ontology graph traversal and morphing into a prior OOBNstructure.

3 steps

Initialization step: to generate the OOBN class and a classto each concept.Discovery step: to define input, internal and output sets foreach class of the OOBN.Closing step: to define instances to add to the global OOBNclass.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 64/73

Page 106: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Onto2PriorOOBN algorithm

Ontology based generation of a prior OOBN

Ontology graph traversal and morphing into a prior OOBNstructure.

3 steps

Initialization step: to generate the OOBN class and a classto each concept.Discovery step: to define input, internal and output sets foreach class of the OOBN.Closing step: to define instances to add to the global OOBNclass.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 64/73

Page 107: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Onto2PriorOOBN algorithm

Ontology based generation of a prior OOBN

Ontology graph traversal and morphing into a prior OOBNstructure.

3 steps

Initialization step: to generate the OOBN class and a classto each concept.Discovery step: to define input, internal and output sets foreach class of the OOBN.Closing step: to define instances to add to the global OOBNclass.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 64/73

Page 108: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Illustrative example : initial ontology

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaiesconcerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 65/73

Page 109: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Illustrative example : prior OOBN

MedCost

ThisCarCostThisCarDam

PropCost

OtherCarCost

ILiCost

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

CO:CarOwner

Global_Insurance

accident

A:Accident

accident

C:Car

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

Theft

T: Theft

Theft

I:Insurance

SeniorTrain

D:Driver

DrivingSkill

RiskAversion DrivQuality

DrivHist

SeniorTrain DrivingSkill

RiskAversion DrivQuality

DrivHist

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 66/73

Page 110: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

FinalOOBN2Onto

Ontology enrichment based on OOBN learning

Part of the ontology evolution process.

Consists in adding, removing or modifying concepts,properties and/or relations.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 67/73

Page 111: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Example : remove relations

No common interface identified between two classes⇒ their corresponding concepts should be independent.

After morphing

After learning

Possible change

p1.1p1.2

Cp1p2.1P2.2

Cp2

I1: cCp1

I2: cCp2

p1.1

p1.2p2.1p2.2

cp2.2 cp2.1

p1.1p1.2

Cp1p2.1P2.2

Cp2

I1: cCp1

I2: cCp2

p1.1

p1.2p2.1p2.2

SR

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 68/73

Page 112: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Example : add concepts / relations

If ccp communicates with only one class → Add relation.

Otherwise, check classes similarities → Add concepts /relations.

p1.1p1.2

Cp1

p2.1p2.2

Cp2

p3.1

Cp3p4.1p4.2

Cp4

I1: cCp1I4: cCp4

p1.1 p1.2

p4.1p4.2

p4.2 p4.1

I3: cCp3

p3.2

p3.2

I2: cCp2

p2.1p2.2

p2.2 p2.1

I1: cCp1

p1.2

I2: cCp2

p2.1p2.2

p2.2

p1.1p1.2

Cp1

p3.1

Cp3

p super.1

CpSuper

p2.2

Cp2p4.1

Cp4

After morphing Possible changeAfter learning

is-a is-a

I4: cCp4

p4.1p4.2

p4.1

I3: cCp3

p3.2p3.2

p1.1

SR1 SR2

SR3

SR1 SR2

SR3

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 69/73

Page 113: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Example : concepts redefinition

If the class contains more than one component, then Thecorresponding concept may be deconstructed into morerefined ones.

p1p2p3p4

Cp

Ccp

p1p2

p3p4

I1 I2

A disconnected graph

After learningp1p2p3

Cp1

p4

Cp2

Possible change

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 70/73

Page 114: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Concepts and relations identification

Semi-automated process

The possible changes are communicated to an expert

The expert semantically identify the discovered relations and /or concepts

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 71/73

Page 115: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Conclusion

We presented here two approaches for integrating semanticalknowledge in order to help BN structure learning :

SEMCADO : Causal BN, choice of ”original” experiments

O2C : Object-oriented BN, observational data

Originality

Use an ontology instead of expert knowledge : separationbetween expert acquisition and structure learning

The bidirectional benefit: a real cooperation, in both ways,between ontologies and OOBNs.

Difficulties

No similar work or benchmark for the comparative study

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 72/73

Page 116: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Conclusion

We presented here two approaches for integrating semanticalknowledge in order to help BN structure learning :

SEMCADO : Causal BN, choice of ”original” experiments

O2C : Object-oriented BN, observational data

Originality

Use an ontology instead of expert knowledge : separationbetween expert acquisition and structure learning

The bidirectional benefit: a real cooperation, in both ways,between ontologies and OOBNs.

Difficulties

No similar work or benchmark for the comparative study

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 72/73

Page 117: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Conclusion

We presented here two approaches for integrating semanticalknowledge in order to help BN structure learning :

SEMCADO : Causal BN, choice of ”original” experiments

O2C : Object-oriented BN, observational data

Originality

Use an ontology instead of expert knowledge : separationbetween expert acquisition and structure learning

The bidirectional benefit: a real cooperation, in both ways,between ontologies and OOBNs.

Difficulties

No similar work or benchmark for the comparative study

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 72/73

Page 118: Apprentissage de la structure des réseaux bayésiens : état

BN definition BN Structure learning BNs and Causality BNs and Ontologies SemCaDo algorithm 2OC approach

Thank you for your attentionMeganck, S., Leray, P., and Manderick, B. (2006). Learning causal bayesian networks from observationsand experiments: A decision theoritic approach. In Proceedings of the Third International Conference,MDAI 2006, LNAI 3885, pages 58-69, Tarragona, Spain. Springer.

Ben Messaoud, M., Leray, P., and Ben Amor, N. (2009). Integrating ontological knowledge for iterativecausal discovery and vizualisation. In Proceedings of the 10th European Conference on Symbolic andQuantitative Approaches to Reasoning with Uncertainty (ECSQARU 2009), pages 168-179, Verona, Italy.

Ben Messaoud, M., Leray, P., and Ben Amor, N. (2011a). Semcado: a serendipitous strategy for learningcausal bayesian networks using ontologies. In Proceedings of the 11th European Conference on Symbolicand Quantitative Approaches to Reasoning with Uncertainty (ECSQARU 2011), pages 182-193, Belfast,Northern Ireland.

Ben Messaoud, M., Leray, P., and Ben Amor, N. (2011b). Semcado: a serendipitous causal discoveryalgorithm for ontology evolution. In The IJCAI-11 Workshop on Automated Reasoning about Context andOntology Evolution (ARCOE-11), pages 43-47, Barcelona, Spain.

Ben Ishak, M., Leray, P., and Ben Amor, N. (2011a). Ontology-based generation of object orientedbayesian networks. In Proceedings of the 8th Bayesian Modeling Applications Workshop (BMAW-11),workshop of the 27th Conference on Uncertainty in Artificial Intelligence (UAI 2011), pages 9-17,Barcelona, Spain.

Ben Ishak, M., Leray, P., and Ben Amor, N. (2011b). A two-way approach for probabilistic graphical modelsstructure learning and ontology enrichment. In Proceedings of the International Conference on KnowledgeEngineering and Ontology Development (KEOD 2011) part of the International Joint Conference onKnowledge Discovery, Knowledge Engineering and Knowledge Management IC3K, pages ?-?, Paris, France.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 73/73

Page 119: Apprentissage de la structure des réseaux bayésiens : état

Annexes

OOBN definition

Models the domain using fragments of a Bayesian networkknown as classes.

Each class is a DAG over three sets of nodes (I,H,O):

RuggedAuto

Antilock

CarValue

Airbag

Mileage

VehicleYear MakeModel

Car

Cushioning

I

H

O

I +O = the class interface.

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 1/8

Page 120: Apprentissage de la structure des réseaux bayésiens : état

Annexes

OOBN nodes

Instantiations: representing the instantiation of a class insideanother class.

SocioEconAge

RiskAversionDrivQuality

D:DriverSocioEcon

Age

HomeBase

VehicleYear MakeModelAntiTheftDrivQuality

CO:CarOwner

Age

MedCost

Insurance

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 2/8

Page 121: Apprentissage de la structure des réseaux bayésiens : état

Annexes

OOBN nodes

Simple nodes:

Reference nodes: for specification of input and output nodesonly.

Real nodes: represent variables.

SocioEconAge

RiskAversionDrivQuality

D:DriverSocioEcon

Age

HomeBase

VehicleYear MakeModelAntiTheftDrivQuality

CO:CarOwner

Age

MedCost

Insurance

DrivQuality MileageAntilock

AccidentA:Accident

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 3/8

Page 122: Apprentissage de la structure des réseaux bayésiens : état

Annexes

OOBN links

Reference links: to link reference or real nodes to reference nodes.

SocioEconAge

RiskAversionDrivQuality

D:DriverSocioEcon

Age

HomeBase

VehicleYear MakeModelAntiTheftDrivQuality

CO:CarOwner

Age

MedCost

Insurance

DrivQuality MileageAntilock

AccidentA:Accident

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 4/8

Page 123: Apprentissage de la structure des réseaux bayésiens : état

Annexes

OOBN links

Directed links: to link reference or real nodes to real nodes.

SocioEconAge

RiskAversionDrivQuality

D:DriverSocioEcon

Age

HomeBase

VehicleYear MakeModelAntiTheftDrivQuality

CO:CarOwner

Age

MedCost

Insurance

DrivQuality MileageAntilock

AccidentA:Accident

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 5/8

Page 124: Apprentissage de la structure des réseaux bayésiens : état

Annexes

OOBN links

Construction links: to express that two nodes (or instantiations)are linked in some manner.

SocioEconAge

RiskAversionDrivQuality

D:DriverSocioEcon

Age

HomeBase

VehicleYear MakeModelAntiTheftDrivQuality

CO:CarOwner

Age

MedCost

Insurance

DrivQuality MileageAntilock

AccidentA:Accident

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 6/8

Page 125: Apprentissage de la structure des réseaux bayésiens : état

Annexes

OOBN classes hierarchy

Classes may have subclasses.

Subclass inherits all the superclass nodes.Subclass may have additional nodes not yet represented in thesuperclass.

HomeBase AntiTheft CarValue

Theft

Theft

DrivQuality Mileage Antilock

Accident

Accident

Insurance_Services

ServiceDes

ServiceDes ServiceDes

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 7/8

Page 126: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaiesconcerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 127: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

Initialization step

Global_ Insurance

Insurance

Theft

Accident Driver

Car

CarOwner

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 128: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 129: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCost

ThisCarCostThisCarDam

PropCost

OtherCarCost

ILiCost

Insurance

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 130: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 131: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

GoodStudenOtherCar

SocioEcon Age

AntiTheft

MakeModel

HomeBase

VehicleYear

CarOwner

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 132: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 133: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

CarOwner

SeniorTrain

D:Driver

DrivingSkill

RiskAversion DrivQuality

DrivHist

SeniorTrain DrivingSkill

RiskAversion DrivQuality

DrivHist

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 134: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 135: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

CO:CarOwner

Insurance

SeniorTrain

D:Driver

DrivingSkill

RiskAversion DrivQuality

DrivHist

SeniorTrain DrivingSkill

RiskAversion DrivQuality

DrivHist

MedCost

ThisCarCostThisCarDam

PropCost

OtherCarCost

ILiCost

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 136: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 137: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 138: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

CO:CarOwner

Insurance

accident

A:Accident

accident

MedCost

ThisCarCostThisCarDam

PropCost

OtherCarCost

ILiCost

SeniorTrain

D:Driver

DrivingSkill

RiskAversion DrivQuality

DrivHist

SeniorTrain DrivingSkill

RiskAversion DrivQuality

DrivHist

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 139: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 140: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 141: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

CO:CarOwner

Insurance

accident

A:Accident

accident

C:Car

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

MedCost

ThisCarCostThisCarDam

PropCost

OtherCarCost

ILiCost

SeniorTrain

D:Driver

DrivingSkill

RiskAversion DrivQuality

DrivHist

SeniorTrain DrivingSkill

RiskAversion DrivQuality

DrivHist

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 142: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 143: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 144: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

CO:CarOwner

Insurance

SeniorTrain

D:Driver

DrivingSkill

RiskAversion DrivQuality

DrivHist

SeniorTrain DrivingSkill

RiskAversion DrivQuality

DrivHist

accident

A:Accident

accident

C:Car

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

Theft

T: Theft

Theft

MedCost

ThisCarCostThisCarDam

PropCost

OtherCarCost

ILiCost

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 145: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 146: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCostThisCarDamThisCarCostPropCostILiCostOtherCarCost

Insurance AgeGoodStudenSocioEconHomeBaseAntiTheftVehicleYearMakeModelOtherCar

CarOwner

AccidentAccident

AirbagCushioningMileageCarValueRuggedAutoAntilock

CarTheft

Theft

concerns

DrivQualitySeniorTrainDrivingSkillDrivHistRiskAversion

Driver

has drivercharacteristics

repaiesrepaies

concerns

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8

Page 147: Apprentissage de la structure des réseaux bayésiens : état

Annexes

Illustrative example : more details

MedCost

ThisCarCostThisCarDam

PropCost

OtherCarCost

ILiCost

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

GoodStuden

OtherCar

SocioEcon

Age

AntiTheft

MakeModel

HomeBase

VehicleYear

CO:CarOwner

Global_Insurance

accident

A:Accident

accident

C:Car

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

Airbag Cushioning Mileage

CarValue RuggedAuto Antilock

Theft

T: Theft

Theft

I:Insurance

SeniorTrain

D:Driver

DrivingSkill

RiskAversion DrivQuality

DrivHist

SeniorTrain DrivingSkill

RiskAversion DrivQuality

DrivHist

Montassar Ben Messaoud, Mouna Ben Ishak, Philippe Leray, Nahla Ben AmorApprentissage RB et Ontologies 8/8