SLIDE 1
Argument Mining: the Bottleneck of Knowledge and Reasoning Patrick - - PowerPoint PPT Presentation
Argument Mining: the Bottleneck of Knowledge and Reasoning Patrick - - PowerPoint PPT Presentation
Argument Mining: the Bottleneck of Knowledge and Reasoning Patrick Saint-Dizier IRIT - CNRS, Toulouse, France. stdizier@irit.fr April 14, 2016 The Relatedness Problem Given a controversial issue: argument mining from texts besides
SLIDE 2
SLIDE 3
Problem Analysis and Research questions
◮ Corpus construction: issues + arguments found in various texts ◮ How to tag arguments to characterize the need of knowledge
and reasoning ?
◮ How to categorize the knowledge involved ? ◮ How to pair NLP with KR for Argument mining ? ◮ Knowledge-driven argument mining: how to account for the
diversity of arguments w.r.t. an issue ?
◮ The Qualia of the Generative Lexicon: an useful lexical and
knowledge representation for argument mining?
◮ Case studies: knowledge vs. reasoning ?
SLIDE 4
Corpus Construction
Issue Corpus size
- nb. of arguments
(1) Ebola vaccination 16 texts, 50 is necessary 8300 words (2) Women’s condition has 9 texts, 24 improved in India 4600 words (3) The development of nu- 7 texts, 31 clear plants is necessary 5800 words (4) Organic agriculture 19 texts, 17 is the future 5800 words Total 51 texts, 122 24500 words
SLIDE 5
Arguments and argument compounds
⇒ Arguments seldom come in isolation. ⇒ They are often articulated within a context that indicates e.g.: circumstances, restrictions, illustrations, concessions, comparisons, purposes, and various forms of elaborations. ⇒ We call such a form an argument compound, where the argument is the kernel: —-> allows for a larger diversity of arguments.
SLIDE 6
Corpus Tagging
The following tags have been identified, but need to be further elaborated:
- 1. the text span involved that delimits the argument compound
and its kernel,
- 2. the polarity of the argument w.r.t. the issue: support, attack,
argumentative concession or contrast.
- 3. the conceptual relation(s) with the issue,
- 4. the knowledge involved, to identify the argument: list of the
main concepts used,
- 5. the a priori strength of the argument,
- 6. the discourse structures associated with the argument kernel.
SLIDE 7
Illustration for issue (1)
<argument nb= 11, polarity= attack with concession , relation To Issue= limited proofs of efficiency and safety of vaccination, concepts Involved= efficiency measures, safety measures, test and evaluation methods, strength= moderate > <concession> Even if the vaccine seems 100% efficient and without any side effects on the tested population, < /concession> <main arg> it is necessary to wait for more conclusive data before making large vaccination campaigns. < /main arg> <elaboration> The national authority of Guinea has approved the continuation of the tests on targeted populations. </elaboration> < /argument>.
SLIDE 8
Evidence for knowledge for argument mining
Need of knowledge: total nb of arguments / nb of those that require knowledge.
Issue need of knowledge total number of concepts nb of cases (rate) involved (estimate) (1) 44 (88%) 54 (2) 18 (75%) 23 (3) 18 (58%) 19 (4) 15 (88%) 25 Total 95 (78%) 121
SLIDE 9
Main concepts used in argument kernels and their expression in language (issue 1)
Supports: efficiency is very good, 100% protection; avoids or reduces dissemination of disease; limited side-effects, etc. Attacks: limited number of cases and deaths compared to other diseases; limited risks of contamination, ignorance of contamination forms; toxicity and high side-effects, etc. Concessions or Contrasts: some side-effects; high production and development costs; vaccine not yet available; ethical problems, etc. The above arguments are expressed in various ways:
- evaluative expressions: Vaccine development is very expensive,
- comparatives: number of sick people much smaller than for
Malaria.
- facts related to properties of the main concept(s) of the issue:
Vaccine is not yet available. There is no risk of dissemination.
- facts related to the consequences, purposes, uses or goals of
the issue: vaccine prevents bio-terrorism.
SLIDE 10
From Concepts to Knowledge Representation
The terms used in argument kernels concern: purposes, properties, parts, creation and development, etc. of the head terms of the issue or of derived concepts. These are relatively well defined in the Generative Lexicon. From issue (1): Vaccine(X):
CONSTITUTIVE:
- ACTIVE PRINCIPLE, ADJUVANT
- ,
TELIC:
- MAIN: PROTECT FROM(X,Y,D), AVOID(X,DISSEMINATION(D)),
MEANS: INJECT(Z,X,Y)
- ,
FORMAL:
- MEDICINE, ARTEFACT
- ,
AGENTIVE :
- DEVELOP(T,X), TEST(T,X), SELL(T,X)
-
SLIDE 11
Modeling the Diversity / the Generative expansion of arguments
◮ Arguments attack or support specific facets of the concepts of
the controversial issue (called root concepts). (protect from(X,Y, (infect(E1,ebola, Y) ⇒ get sick(E2,Y) ⇒ ♦ die(E3,Y))) ∧ avoid(X,dissemination(ebola)).
◮ Arguments may also attack or support concepts derived from
these initial concepts (related to functions, parts, etc.).
◮ For example, they may attack properties or purposes of the
adjuvant or of the protocols used to test the vaccine. Arguments must however remain functionally close to the root. ⇒ Develop a network of Concepts and their Qualias derived from those involved in the controversial issue, with a limited depth. ⇒ Qualias structure the concepts in terms of parts, functions, etc.
SLIDE 12
Generative expansion of arguments: a simple example
- Consider: constitutive(vaccine(X)) = {active principle, adjuvant}.
Network of Concepts and Qualias:
- ‘active principle’: terminal concept in the network, associated with
its lexicalizations (e.g. active principle, vaccine),
- ‘Adjuvant’: non-terminal concept, included with its Qualia into the
network: Adjuvant(Y,X1):
FORMAL :
- MEDICINE, CHEMICALS
- ,
TELIC:
- DILUTE(Y,X1), ALLOW(INJECT(X1,P))
-
- Then the non-terminal concepts (e.g. medicine, dilute(Y,X1),
inject(X1,P) introduce new Qualias in the network. Natural language terms are associated to these concepts, e.g.: medicine, chemicals, inject, injection, dilute, dilution.
SLIDE 13
Case study 1
Utterance A1: The adjuvant of the Ebola vaccine is toxic Utterance A1 matches with the language pattern: [np, ‘is’, evaluative expression] A1 negatively evaluates the adjuvant (lexical feature of the adjective ‘toxic’), but it does not explicitly say anything about the vaccine. Then, given: Ebola:
FORMAL:
- DISEASE
- ,
TELIC:
- INFECT(E1,EBOLA, P) ⇒ GET SICK(E2,P) ⇒ ♦ DIE(E3,P)
∧ E1 E2 E3.
-
SLIDE 14
The constitutive role of vaccine(X) says that the adjuvant is part of the
- vaccine. The Qualia of ‘adjuvant’ indicates that the active principle X1
is mixed by dilution with the adjuvant Y :
◮ Adjuvant Qualia: dilute(Y,X1) Y and X1 are mixed together to
form a single entity, the vaccine X.
◮ upwards inheritance of a property in a part-of relation: if a
(major) constitutive part K1 of an object K has a property P , then (probably) the entire object K has P: has property(K1,P) ∧ part of(K1,K) ⇒ has property(K,P).
◮ since Y and X1 are parts of X, then since Y is toxic for humans, it
follows that X is also toxic for humans. Therefore, A1 attacks the controversial issue. This statement may also be interpreted as a contrast to the controversial issue: ’the vaccine is necessary BUT it is toxic’. (Winterstein 2012)
SLIDE 15
A2: Seven persons died during the Ebola vaccine tests
◮ In the GL structure of vaccine(X), the ’test’ activity is
related to the agentive role.
◮ Axiomatization of the GL structure:
by definition, the agentive role is pre-telic: it occurs before the functions or the roles given in the telic role and their related properties are active: ∀ P(E) ∈ agentive-role, ∀ Q(E1) ∈ telic-role, E E1 ∧ ¬(P ⇒ Q).
◮ From that point of view, A2 is about tests, it does not say
anything about the vaccine roles, functions and consequences once it has been fully tested and approved.
◮ Argument2 is irrelevant or neutral w.r.t. the controversial
issue.
SLIDE 16
A3: No one is infected by Ebola in Europe
∀ Z, human(Z), in(Z,europe) ⇒ ¬ infect(E,ebola,Z) . (1) contradiction with telic role(epidemic(ebola)): infect(E1,ebola,Z). (2) Therefore, from the telic role of epidemic, there is no dissemination at the moment and therefore, from the telic of vaccine, no need of vaccine. (3) Therefore, Argument A3 is a partial attack of the controversial issue, with several restrictions: it is valid only in Europe and it relates a fact that occurs at the present time (compositionality rule again), (4) A3 may be analyzed as a contrast: vaccine is indeed necessary as a general principle, BUT since there are no cases in Europe at the moment it may not be necessary in Europe.
SLIDE 17
A4: The vaccine is not always efficient: 3 vaccined peopled died in Monrovia
The higher-order adjective ’efficient’ modifies the predicates in the telic role of vaccine: protect from(X,Y,Ebola), avoid(X, dissemination(X,Ebola)). (1) A4 says: ∃ Y, patient(Y), (inject(E1,X,Ebola vaccine,Y) ∧ get sick(E2,Y) ∧ ♦ die(E3,Y)) ⇒ ¬(protect from(X,Y,Ebola). (2) A4 contradicts the telic of vaccine(X). (3) Since the protection of the vaccine is not always guaranteed, statement 4 is an argument that weakly attacks the controversial issue, it is interpreted as a concession : Vaccine protects the population HOWEVER there are cases where it does not work. A concession basically supports the controversial issue, but adds some restrictions.
SLIDE 18
More complex cases
- Personal decision: everyone is free to have the vaccine
injected or not (therapeutic freedom principle),
- Temporal dependency of the statement, implicit conclusions:
there are now less cases in Africa,
- Debatable character of information sources on which the
statement is based: statistics are false, efficiency is evaluated by producer.
SLIDE 19
Required knowledge and inference: a Synthesis
◮ lexical knowledge: semantic features for lexical items, in
particular polarity, e.g. for verbs (avoid), intensifiers (for adverbs), scales, etc.,
◮ domain knowledge: encoded via the formalism of the
Generative Lexicon, including event structures and causal chains, via a network of Qualia structures,
◮ reasoning shemes:
- 1. inferences related to the semantics of the Qualia roles
- 2. inferences related to lexical semantics structures,
- 3. inferences related to general purpose or domain knowledge
- 4. inferences dedicated to argumentation, that allow to
compute relations and their strength between the controversial issue and the argument at stake. These are specific compositionality rules.
SLIDE 20