Causality in News Event Prediction

undefined
 
Kira Radinsky, Sagie Davidovich, Shaul Markovitch
Technion - Israel Institute of Technology
 
Learning 
Causality
for News Events 
Prediction
 
“…a rigorous, often quantitative,
statement, forecasting what will happen
under
 specific conditions
.
[Wikipedia]
 
What is Prediction?
 
“A description of what one thinks will
take place in the future, based on
previous knowledge
.” 
[Online Dictionary]
 
Why is News Event Prediction
Important?
Strategic Intelligence
Strategic planning
Strategic planning
Financial investments
 
Motivation
Problem definition
Solution
Representation
Algorithm
Evaluation
 
Outline
 
Problem Definition: Events Prediction
 
Ev is a set of events
T is discrete representation of time
 
Motivation
Problem definition
Solution
Representation
Algorithm
Evaluation
 
Outline
 
Causality Mining Process: Overview
 
Motivation
Problem definition
Solution
Representation
Algorithm
Evaluation
 
Outline
 
Modeling an Event
Weapon
warehouse
bombs
US Army
Kabul
Missiles
 
Caused
Troops
kill
US
Army
 
Time
Event & Causality Representation
Event Representation
 
Causality Representation
5
Afghan
“US Army bombs a weapon
warehouse in Kabul with missiles”
 
“5 Afghan troops were killed”
 
Motivation
Problem definition
Solution
Representation
Algorithm
Causality Mining Process
Evaluation
 
Outline
 
Machine Learning Problem Definition
 
 
Algorithm Outline
 
Learning Phase
1.
Generalize events
2.
Causality prediction rule generation
 
Prediction Phase
1.
Finding similar generalized event
2.
Application of causality prediction rule
 
 
Algorithm Outline
 
Learning Phase
1.
Generalize events
1.
How do we generalize objects?
2.
How do we generalize actions?
3.
How do we generalize an event?
2.
Causality prediction rule generation
 
Generalizing Objects
Russian
Federation
Eastern Europe
China
USSR
the Russian Federation
643
RUS
185
Russia
Rouble (Rub)
 
Name official English
 
ISO3 Code
 
FAOSTAT code
 
DBPedia ID
 
Currency Name
 
UN Code
 
Is in group
 
Land border
 
Is successor of
 
Is predecessor of
Ontology – Linked data
 
Generalizing Actions
 
 
 
 
Levin classes (Levin 1993)
 
– 270 classes
 
Generalizing Events:
Putting it all together
Present
Event
Army base
strikes
NATO
Baghdad
Missiles
US
Country
Army
Weapon
warehouse
bombs
US Army
Kabul
City
Military
facility
 
rdf:type
 
“NATO strikes an army base in
Baghdad”
 
“US Army bombs a weapon
warehouse in Kabul with missiles”
Generalizing Events: HAC algorithm
 
Generalizing Events:
Event distance metric
Present
Event
Army base
strikes
NATO
Baghdad
Missiles
US
Country
Army
Weapon
warehouse
bombs
US Army
Kabul
City
Military
facility
 
rdf:type
 
“NATO strikes an army base in
Baghdad”
 
“US Army bombs a weapon
warehouse in Kabul with missiles”
 
Algorithm Outline
Weapon
warehouse
bombs
US Army
Kabul
Missiles
Caused
Troops
kill
US
Country
Army
Type
Type
Time
Prediction Rule Generation
5
Afghan
“US Army bombs a weapon
warehouse in Kabul with missiles”
“5 Afghan troops were killed”
Afghanistan
Effect
Theme
Attribute 
=
Cause
Location
Country
Nationality
Effect
Action
=kill
Effect
Theme
=Troops
 
Algorithm Outline
 
Prediction Phase
1.
Finding similar generalized event
2.
Application of causality prediction rule
Finding Similar Generalized Event
“Baghdad bombing”
 
0.2
 
0.3
 
0.7
 
0.8
 
0.75
 
0.2
 
0.65
 
0.1
undefined
Theme1
bomb
Actor1
Location1
Instrument1
C
a
u
s
e
d
Troops
kill
Time
Prediction Rule Application
E
f
f
e
c
t
T
h
e
m
e
A
t
t
r
i
b
u
t
e
 
=
C
a
u
s
e
L
o
c
a
t
i
o
n
Effect
Action
=kill
Effect

=Troops
 
Motivation
Problem definition
Solution
Representation
Algorithm
Evaluation
 
Outline
 
Prediction Evaluation
 
Human Group 1
:
Mark events E that can cause other events.
Human Group 2:
Given:  Random sample of events from E , predictions and
time of events
Search the web and give estimation on the prediction
accuracy
 
Prediction Accuracy Results
 
Causality Evaluation
 
Human Group 1
:
Mark events E for test for the second two control groups and the
algorithm.
Human Group 2:
Given:  Random sample of events  from E.
State what you think would happen following this event.
Human Group 3:
Given: algorithm predictions + human (2
nd
 group) predictions
Evaluate the quality of the predictions
 
Causality Results
 
The results are statistically significant
 
Accuracy of Extraction
 
Extraction Evaluation
 
Entity Ontology Matching
 
Related work
 
Causality Information Extraction
Goal:
 Extract causality relations from a text
Techniques:
1.
Usage of handcrafted  domain-specific patterns
[Kaplan and Berry-Rogghe, 1991]
2.
Usage of handcrafted linguistic patterns
[Garcia 1997],[Khoo, Chan, &Niu 2000], [Girju &Moldovan 2002]
3.
Semi-Supervised pattern learning approaches, based on text
features
[Blanco, Castell, &Moldovan 2008], [Sil & Huang & Yates 2010]
4.
Supervised pattern learning approaches based on text
features
[Riloff 1996],[Riloff & Jones 1999], [Agichtein & Gravano, 2000; Lin & Pantel, 2001]
 
Related work
 
Temporal Information Extraction
Goal
: Predicting the temporal order of events or time
expressions described in text
Technique:
 learn classifiers that predict a temporal order of a
pair of events based on a predefined features of the pair.
 
[Ling & Weld, 2010; Mani, Schiffman, & Zhang, 2003; Lapata & Lascarides,2006; Chambers, Wang, &
Jurafsky, 2007; Tatu & Srikanth, 2008; Yoshikawa, Riedel, Asahara, & Matsumoto, 2009]
Future work
 
Going beyond human tagged examples
Incorporating time into the equation
When will correlation mean causality?
Using other sources than news
Incorporating real time data (Twitter, Facebook)
Incorporating numerical data (Stocks, Weather, Forex)
Can we predict general facts?
Can a machine predict better than an expert?
 
Summary
 
Canonical event representation
Machine learning algorithm for events prediction
Leveraging world knowledge for generalization
Using text as human tagged examples
Causality mining from text
Contribution to machine common-sense
understanding
 
 
 
“The best way to predict the future is to invent it” [Alan Kay]
Slide Note
Embed
Share

Learning about the significance of predictions in news events and the process of causality mining for accurate forecasting. The research delves into problem definition, solution representation, algorithms, and evaluation in event prediction. Emphasis is placed on events, time representation, prediction functions, and the extraction of causality relations from news articles for state inference.

  • Causality
  • News Event Prediction
  • Event Mining
  • Forecasting
  • Information Extraction

Uploaded on Sep 20, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Learning Causality for News Events Prediction Kira Radinsky, Sagie Davidovich, Shaul Markovitch Technion - Israel Institute of Technology

  2. What is Prediction? a rigorous, often quantitative, statement, forecasting what will happen under specific conditions. [Wikipedia] A description of what one thinks will take place in the future, based on previous knowledge. [Online Dictionary]

  3. Why is News Event Prediction Important? Event Predicted event (Pundit) Al-Qaida demands hostage exchange A country will refuse the demand Strategic Intelligence Volcano erupts in Democratic Republic of Congo Thousands of people flee from Congo Strategic planning 7.0 magnitude earthquake strikes Haitian coast Tsunami-warning is issued Strategic planning China overtakes Germany as world's biggest exporter Wheat price will fall Financial investments

  4. Outline Motivation Problem definition Solution Representation Algorithm Evaluation

  5. Problem Definition: Events Prediction Ev is a set of events T is discrete representation of time Prediction Function , s.t.: occurred at time occurred at time

  6. Outline Motivation Problem definition Solution Representation Algorithm Evaluation

  7. Causality Mining Process: Overview News Articles acquisition Crawling [NYT 1851-2009] Modeling & Normalization Causality Pattern Classification <Pattern, Constraint, Confidence> Event Extraction Tagging Dependency parsing (Stanford parser) Causality Relations extraction Context inference Thematic roles normalized Base forms URIs assignment (Contextual Disambiguation) Thematic roles assignment Based on VerbNet Index Causality Graph Building Built on 20 machines 300 million nodes 1 billion edges 13 million news articles in total State Inference

  8. Outline Motivation Problem definition Solution Representation Algorithm Evaluation

  9. Modeling an Event Comparison between events (Canonical) 1. (Lexicon & Syntax) Language & wording independent 2. (Semantic) Non ambiguous Generalization / abstraction Reasoning Many philosophies Property Exemplification of Events theory (Kim 1993) Conceptual Dependency theory (Schank 1972)

  10. Time Event & Causality Representation Event Representation Causality Representation 5 Quantifier kill Troops Action Attribute Afghan 1/2/1987 11:15AM +(3h) Time- frame Event2 5 Afghan troops were killed Army bombs Weapon warehouse US US Army Action Time- frame Location 1/2/1987 11:00AM +(2h) Event1 Kabul US Army bombs a weapon warehouse in Kabul with missiles Missiles

  11. Outline Motivation Problem definition Solution Representation Algorithm Causality Mining Process Evaluation

  12. Machine Learning Problem Definition Goal function: Learning algorithm receives a set of examples and produces a hypothesis which is good approximation of

  13. Algorithm Outline Learning Phase 1. Generalize events 2. Causality prediction rule generation Prediction Phase 1. Finding similar generalized event 2. Application of causality prediction rule

  14. Algorithm Outline Learning Phase 1. Generalize events 1. How do we generalize objects? 2. How do we generalize actions? 3. How do we generalize an event? 2. Causality prediction rule generation

  15. Generalizing Objects the Russian Federation Eastern Europe RUS Russian Federation Land border China 185 Russia USSR Rouble (Rub) 643

  16. Ontology Linked data

  17. Generalizing Actions Levin classes (Levin 1993) 270 classes Class Hit-18.1 Roles and Restrictions: Agent[+int_control] Patient[+concrete] Instrument[+concrete] Members: bang, bash, hit, kick, ... Frames: Name Example Syntax Semantics cause(Agent, E)manner(during(E), directedmotion, Agent) !contact(during(E), Agent, Patient) manner(end(E),forceful, Agent) contact(end(E), Agent, Patient) Basic Transitive Paula hit the ball Agent V Patient

  18. Generalizing Events: Putting it all together NATO strikes an army base in Baghdad US NATO Location Present Event 1/2/1987 11:00AM +(2h) Time-frame Baghdad Generalization rule Action Army base Actor: [state of Nato] Property: [Hit1.1] Theme: [Military facility] Location: [Arab City] Country Army strikes Military facility Similar verb City Weapon warehouse bombs US Army Action Location Past Event Time-frame 1/2/1987 11:00AM +(2h) Kabul US Army bombs a weapon warehouse in Kabul with missiles Instrument rdf:type Missiles

  19. Generalizing Events: HAC algorithm

  20. Generalizing Events: Event distance metric NATO strikes an army base in Baghdad US NATO Location Present Event 1/2/1987 11:00AM +(2h) Time-frame Baghdad Action Army base Country Army strikes Military facility Similar verb City Weapon warehouse bombs US Army Action Location Past Event Time-frame 1/2/1987 11:00AM +(2h) Kabul US Army bombs a weapon warehouse in Kabul with missiles Instrument rdf:type Missiles

  21. Algorithm Outline Learning Phase 1. Generalize events 2. Causality prediction rule generation

  22. Prediction Rule Generation Time 5 Quantifier kill Troops Action Attribute 5 Afghan troops were killed Afghan Effect Event 1/2/1987 11:15AM +(3h) Time- frame Nationality Afghanistan Army Country Type Type bombs Weapon warehouse Country US US Army Action Cause Event Time- frame 1/2/1987 11:00AM +(2h) Kabul Location Effect Theme Attribute = Cause Location Country Nationality Effect Action=kill Effect Theme=Troops US Army bombs a weapon warehouse in Kabul with missiles Missiles

  23. Algorithm Outline Prediction Phase 1. Finding similar generalized event 2. Application of causality prediction rule

  24. Finding Similar Generalized Event 0.2 Baghdad bombing 0.7 0.3 0.8 0.65 0.2 0.1 0.75

  25. Prediction Rule Application Time kill Troops Action Attribute Predicted Effect Event Time- frame T1+ Nationality bomb Theme1 Country Actor1 Action Input Event Time- frame Location1 T1 Location Effect Theme Attribute = Cause Location Country Nationality Effect Action=kill Effect Theme=Troops Instrument1

  26. Outline Motivation Problem definition Solution Representation Algorithm Evaluation

  27. Prediction Evaluation Human Group 1: Mark events E that can cause other events. Human Group 2: Given: Random sample of events from E , predictions and time of events Search the web and give estimation on the prediction accuracy

  28. Prediction Accuracy Results Highly certain Certain Algorithm 0.58 0.49 Humans 0.4 0.38

  29. Causality Evaluation Human Group 1: Mark events E for test for the second two control groups and the algorithm. Human Group 2: Given: Random sample of events from E. State what you think would happen following this event. Human Group 3: Given: algorithm predictions + human (2nd group) predictions Evaluate the quality of the predictions

  30. Causality Results [0,1) [1-2) [2-3) [3,4] Avg. Rank Avg. Accuracy Algorithm 0 2 19 29 3.08 77% Humans 0 3 24 23 2.86 72% The results are statistically significant

  31. Event Predicted Event (Human) Predicted event (Pundit) Al-Qaida demands hostage exchange Al-Qaida exchanges hostage A country will refuse the demand Volcano erupts in Democratic Republic of Congo Scientists in Republic of Congo investigate lava beds Thousands of people flee from Congo 7.0 magnitude earthquake strikes Haitian coast Tsunami in Haiti effects coast Tsunami-warning is issued 2 Palestinians reportedly shot dead by Israeli troops Israeli citizens protest against Palestinian leaders War will be waged Professor of Tehran University killed in bombing Tehran students remember slain professor in memorial service Professor funeral will be held Alleged drug kingpin arrested in Mexico Mafia kills people with guns in town Kingpin will be sent to prison UK bans Islamist group Islamist group would adopt another name in the UK Group will grow China overtakes Germany as world's biggest exporter German officials suspend tariffs Wheat price will fall

  32. Accuracy of Extraction Extraction Evaluation Action Actor Object Instrument Location Time 93% 74% 76% 79% 79% 100% Entity Ontology Matching Actor Object Instrument Location 84% 83% 79% 89%

  33. Related work Causality Information Extraction Goal: Extract causality relations from a text Techniques: 1. Usage of handcrafted domain-specific patterns [Kaplan and Berry-Rogghe, 1991] 2. Usage of handcrafted linguistic patterns [Garcia 1997],[Khoo, Chan, &Niu 2000], [Girju &Moldovan 2002] 3. Semi-Supervised pattern learning approaches, based on text features [Blanco, Castell, &Moldovan 2008], [Sil & Huang & Yates 2010] 4. Supervised pattern learning approaches based on text features [Riloff 1996],[Riloff & Jones 1999], [Agichtein & Gravano, 2000; Lin & Pantel, 2001]

  34. Related work Temporal Information Extraction Goal: Predicting the temporal order of events or time expressions described in text Technique: learn classifiers that predict a temporal order of a pair of events based on a predefined features of the pair. [Ling & Weld, 2010; Mani, Schiffman, & Zhang, 2003; Lapata & Lascarides,2006; Chambers, Wang, & Jurafsky, 2007; Tatu & Srikanth, 2008; Yoshikawa, Riedel, Asahara, & Matsumoto, 2009]

  35. Future work Going beyond human tagged examples Incorporating time into the equation When will correlation mean causality? Using other sources than news Incorporating real time data (Twitter, Facebook) Incorporating numerical data (Stocks, Weather, Forex) Can we predict general facts? Can a machine predict better than an expert?

  36. Summary Canonical event representation Machine learning algorithm for events prediction Leveraging world knowledge for generalization Using text as human tagged examples Causality mining from text Contribution to machine common-sense understanding The best way to predict the future is to invent it [Alan Kay]

More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#